v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019

26 September 2019

ArXiv (abs)PDF HTML Github (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,048 papers shown

A Contrastive Framework with User, Item and Review Alignment for RecommendationWeb Search and Data Mining (WSDM), 2025

Hoang V. Dong

Yuan Fang

Hady W. Lauw

661

21 Jan 2025

Assessing and Enhancing the Robustness of Large Language Models with Task Structure Variations for Logical ReasoningInternational Conference on Neural Information Processing (ICONIP), 2023

478

20 Jan 2025

Harnessing the Potential of Large Language Models in Modern Marketing Management: Applications, Future Directions, and Strategic Recommendations

255

18 Jan 2025

A Comprehensive Survey of Foundation Models in MedicineIEEE Reviews in Biomedical Engineering (RBME), 2024

766

17 Jan 2025

Bridging the Fairness Gap: Enhancing Pre-trained Models with LLM-Generated SentencesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

253

12 Jan 2025

Multi-task Visual Grounding with Coarse-to-Fine Consistency ConstraintsAAAI Conference on Artificial Intelligence (AAAI), 2025

369

12 Jan 2025

Finnish SQuAD: A Simple Approach to Machine Translation of Span Annotations

Emil Nuutinen

Iiro Rastas

Filip Ginter

180

10 Jan 2025

Merging Feed-Forward Sublayers for Compressed Transformers

373

10 Jan 2025

Clinical Insights: A Comprehensive Review of Language Models in MedicinePLOS Digital Health (PDH), 2024

529

08 Jan 2025

Trust Modeling in Counseling Conversations: A Benchmark Study

180

06 Jan 2025

Decoding News Bias: Multi Bias Detection in News ArticlesInternational Conference on Natural Language Processing and Information Retrieval (ICNLPIR), 2024

Bhushan Santosh Shah

Deven Santosh Shah

Vahida Attar

218

05 Jan 2025

Swift Cross-Dataset Pruning: Enhancing Fine-Tuning Efficiency in Natural Language UnderstandingInternational Conference on Computational Linguistics (COLING), 2025

Binh-Nguyen Nguyen

Yang He

255

05 Jan 2025

Toward Corpus Size Requirements for Training and Evaluating Depression Risk Models Using Spoken LanguageInterspeech (Interspeech), 2022

246

03 Jan 2025

Efficient support ticket resolution using Knowledge Graphs

Sherwin Varghese

James Tian

03 Jan 2025

SAFE-MEME: Structured Reasoning Framework for Robust Hate Speech Detection in Memes

Palash Nandi

Shivam Sharma

Tanmoy Chakraborty

228

31 Dec 2024

Text Classification: Neural Networks VS Machine Learning Models VS Pre-trained Models

Christos Petridis

VLM

318

31 Dec 2024

A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in MedicineInformation Fusion (Inf. Fusion), 2024

449

31 Dec 2024

AIGT: AI Generative Table Based on PromptInternational Conference on Computational Linguistics (COLING), 2024

227

24 Dec 2024

Adversarial Robustness through Dynamic Ensemble Learning

251

20 Dec 2024

COSEE: Consistency-Oriented Signal-Based Early Exiting via Calibrated Sample Weighting MechanismAAAI Conference on Artificial Intelligence (AAAI), 2024

326

17 Dec 2024

Unlocking LLMs: Addressing Scarce Data and Bias Challenges in Mental Health

Vivek Kumar

Eirini Ntoutsi

Pushpraj Singh Rajawat

Giacomo Medda

Diego Reforgiato Recupero

AI4MH

263

17 Dec 2024

Bias Vector: Mitigating Biases in Language Models with Task Arithmetic ApproachInternational Conference on Computational Linguistics (COLING), 2024

263

16 Dec 2024

One Pixel is All I Need

Deng Siqin

Zhou Xiaoyi

ViT

1.0K

14 Dec 2024

BinarySelect to Improve Accessibility of Black-Box Attack ResearchInternational Conference on Computational Linguistics (COLING), 2024

Shatarupa Ghosh

Jonathan Rusert

AAML

391

13 Dec 2024

SMMF: Square-Matricized Momentum Factorization for Memory-Efficient OptimizationAAAI Conference on Artificial Intelligence (AAAI), 2024

Kwangryeol Park

Seulki Lee

182

12 Dec 2024

Coverage-based Fairness in Multi-document SummarizationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

410

11 Dec 2024

KITE-DDI: A Knowledge graph Integrated Transformer Model for accurately predicting Drug-Drug Interaction Events from Drug SMILES and Biomedical Knowledge GraphIEEE Access (IEEE Access), 2024

Azwad Tamir

Jiann-Shiun Yuan

183

08 Dec 2024

AntLM: Bridging Causal and Masked Language Models

327

04 Dec 2024

Generative Language Models Potential for Requirement Engineering Applications: Insights into Current Strengths and Limitations

278

01 Dec 2024

DynRank: Improving Passage Retrieval with Dynamic Zero-Shot Prompting Based on Question Classification

Abdelrahman Abdallah

Jamshid Mozafari

Bhawna Piryani

Mohammed M. Abdelgwad

Adam Jatowt

372

30 Nov 2024

A Flexible Method for Behaviorally Measuring Alignment Between Human and Artificial Intelligence Using Representational Similarity Analysis

458

30 Nov 2024

Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?

281

27 Nov 2024

What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational Linguistics

Jordan J. Bird

410

26 Nov 2024

Dynamic Self-Distillation via Previous Mini-batches for Fine-tuning Small Language Models

390

25 Nov 2024

Profiling Bias in LLMs: Stereotype Dimensions in Contextual Word Embeddings

Carolin M. Schuster

Maria-Alexandra Dinisor

Shashwat Ghatiwala

Georg Groh

364

25 Nov 2024

Is Training Data Quality or Quantity More Impactful to Small Language Model Performance?

Aryan Sajith

Krishna Chaitanya Rao Kathala

254

24 Nov 2024

A Comparative Analysis of Transformer and LSTM Models for Detecting Suicidal Ideation on RedditInternational Conference on Machine Learning and Applications (ICMLA), 2024

Khalid Hasan

Jamil Saquer

AI4MH

266

23 Nov 2024

IRLab@iKAT24: Learned Sparse Retrieval with Multi-aspect LLM Query Generation for Conversational Search

185

22 Nov 2024

FLARE: FP-Less PTQ and Low-ENOB ADC Based AMS-PiM for Error-Resilient, Fast, and Efficient Transformer Acceleration

214

22 Nov 2024

BERT-Based Approach for Automating Course Articulation Matrix Construction with Explainable AI

Natenaile Asmamaw Shiferaw

Simpenzwe Honore Leandre

Aman Sinha

Dillip Rout

136

21 Nov 2024

Forecasting Future International Events: A Reliable Dataset for Text-Based Event ModelingConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

260

21 Nov 2024

Mitigating Gender Bias in Contextual Word Embeddings

Navya Yarrabelly

Vinay Damodaran

Feng-Guang Su

249

18 Nov 2024

New Emerged Security and Privacy of Pre-trained Model: a Survey and Outlook

309

12 Nov 2024

Clustering in Causal Attention MaskingNeural Information Processing Systems (NeurIPS), 2024

Nikita Karagodin

Yury Polyanskiy

Philippe Rigollet

317

07 Nov 2024

PSformer: Parameter-efficient Transformer with Segment Attention for Time Series Forecasting

315

03 Nov 2024

Multi-Channel Hypergraph Contrastive Learning for Matrix Completion

291

02 Nov 2024

Human-inspired Perspectives: A Survey on AI Long-term Memory

585

01 Nov 2024

ProTransformer: Robustify Transformers via Plug-and-Play ParadigmNeural Information Processing Systems (NeurIPS), 2024

262

30 Oct 2024

Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRAInternational Conference on Learning Representations (ICLR), 2024

389

28 Oct 2024

Ensembling Finetuned Language Models for Text Classification

Sebastian Pineda Arango

Maciej Janowski

Lennart Purucker

Arber Zela

Katharina Eggensperger

Josif Grabocka

217

25 Oct 2024