v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019

26 September 2019

ArXiv (abs)PDF HTML Github (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,050 papers shown

Successor Features for Efficient Multisubject Controlled Text Generation

Mengyao Cao

Mehdi Fatemi

Jackie Chi Kit Cheung

Samira Shabanian

BDL

173

03 Nov 2023

Adapting Fake News Detection to the Era of Large Language Models

320

02 Nov 2023

Investigating Self-Supervised Deep Representations for EEG-based Auditory Attention DecodingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Karan Thakkar

Jiarui Hai

Mounya Elhilali

223

01 Nov 2023

Latent Space Translation via Semantic AlignmentNeural Information Processing Systems (NeurIPS), 2023

Valentino Maiorca

Francesco Locatello

416

01 Nov 2023

LLMs may Dominate Information Access: Neural Retrievers are Biased Towards LLM-Generated TextsKnowledge Discovery and Data Mining (KDD), 2023

Liang Pang

Jun Xu

279

31 Oct 2023

Do large language models solve verbal analogies like children do?

Claire E. Stevenson

Mathilde ter Veen

Rochelle Choenni

Han L. J. van der Maas

Ekaterina Shutova

LRM

170

31 Oct 2023

Learning to Play Chess from Textbooks (LEAP): a Corpus for Evaluating Chess Moves based on Sentiment Analysis

Haifa Alrdahi

Riza Batista-Navarro

195

31 Oct 2023

EELBERT: Tiny Models through Dynamic EmbeddingsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

139

31 Oct 2023

Efficient Classification of Student Help Requests in Programming Courses Using Large Language Models

194

31 Oct 2023

MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment TasksNeural Information Processing Systems (NeurIPS), 2023

Tatsunori Hashimoto

285

30 Oct 2023

MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient image-text retrieval

369

30 Oct 2023

A Lightweight Method to Generate Unanswerable Questions in EnglishConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Vagrant Gautam

Miaoran Zhang

Dietrich Klakow

212

30 Oct 2023

BERT Lost Patience Won't Be Robust to Adversarial SlowdownNeural Information Processing Systems (NeurIPS), 2023

331

29 Oct 2023

Stacking the Odds: Transformer-Based Ensemble for AI-Generated Text DetectionAustralasian Language Technology Association Workshop (ALTA), 2023

Duke Nguyen

Khaing Myat Noe Naing

Aditya Joshi

219

29 Oct 2023

Multi-grained Evidence Inference for Multi-choice Reading ComprehensionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

Yilin Zhao

Hai Zhao

Sufeng Duan

209

27 Oct 2023

Outlier Dimensions Encode Task-Specific KnowledgeConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

William Rudman

Catherine Chen

Carsten Eickhoff

293

26 Oct 2023

PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream Applications

184

26 Oct 2023

Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023

144

26 Oct 2023

Joint Entity and Relation Extraction with Span Pruning and Hypergraph Neural NetworksConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

378

26 Oct 2023

Apollo: Zero-shot MultiModal Reasoning with Multiple Experts

181

25 Oct 2023

Kiki or Bouba? Sound Symbolism in Vision-and-Language ModelsNeural Information Processing Systems (NeurIPS), 2023

Morris Alper

Hadar Averbuch-Elor

290

25 Oct 2023

FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

175

25 Oct 2023

Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model TrainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

277

25 Oct 2023

URL-BERT: Training Webpage Representations via Social Media Engagements

Taylor Berg-Kirkpatrick

239

25 Oct 2023

CR-COPEC: Causal Rationale of Corporate Performance Changes to Learn from Financial ReportsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

260

24 Oct 2023

Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model CompressionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Dongyan Zhao

Rui Yan

224

24 Oct 2023

TRAMS: Training-free Memory Selection for Long-range Language ModelingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Haofei Yu

Cunxiang Wang

Yue Zhang

Wei Bi

RALM

301

24 Oct 2023

CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language ModelConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

266

24 Oct 2023

PartialFormer: Modeling Part Instead of Whole for Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Jingbo Zhu

246

23 Oct 2023

Unveiling the Multi-Annotation Process: Examining the Influence of Annotation Quantity and Instance Difficulty on Model PerformanceConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Pritam Kadasi

Mayank Singh

227

23 Oct 2023

PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical Domain

169

22 Oct 2023

Transductive Learning for Textual Few-Shot Classification in API-based Embedding ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

220

21 Oct 2023

A Novel Information-Theoretic Objective to Disentangle Representations for Fair ClassificationInternational Joint Conference on Natural Language Processing (IJCNLP), 2023

284

21 Oct 2023

Plausibility Processing in Transformer Language Models: Focusing on the Role of Attention Heads in GPT

Soo Hyun Ryu

171

20 Oct 2023

The Less the Merrier? Investigating Language Representation in Multilingual Models

H. Nigatu

A. Tonja

Jugal Kalita

261

20 Oct 2023

Unsupervised Candidate Answer Extraction through Differentiable Masker-Reconstructor Model

230

19 Oct 2023

A Predictive Factor Analysis of Social Biases and Task-Performance in Pretrained Masked Language Models

Yi Zhou

Jose Camacho-Collados

Danushka Bollegala

438

19 Oct 2023

Boosting Inference Efficiency: Unleashing the Power of Parameter-Shared Pre-trained Language Models

Weize Chen

Xiaoyue Xu

Xu Han

Yankai Lin

Ruobing Xie

Zhiyuan Liu

Maosong Sun

Jie Zhou

123

19 Oct 2023

Character-level Chinese Backpack Language Models

Hao Sun

John Hewitt

150

19 Oct 2023

Time-Aware Representation Learning for Time-Sensitive Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Jungbin Son

Alice Oh

154

19 Oct 2023

Pretraining Language Models with Text-Attributed Heterogeneous GraphsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

282

19 Oct 2023

DepWiGNN: A Depth-wise Graph Neural Network for Multi-hop Spatial Reasoning in TextConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Shuaiyi Li

Yang Deng

Wai Lam

368

19 Oct 2023

SPEED: Speculative Pipelined Execution for Efficient Decoding

Coleman Hooper

Sehoon Kim

204

18 Oct 2023

DesignQuizzer: A Community-Powered Conversational Agent for Learning Visual Design

176

18 Oct 2023

Improving Long Document Topic Segmentation Models With Enhanced Coherence ModelingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Qian Chen

Wen Wang

AI4TS

246

18 Oct 2023

Chain-of-Thought Tuning: Masked Language Models can also Think Step By Step in Natural Language UnderstandingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

217

18 Oct 2023

Disentangling the Linguistic Competence of Privacy-Preserving BERTBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2023

Stefan Arnold

Nils Kemmerzell

Annika Schreiner

253

17 Oct 2023

QADYNAMICS: Training Dynamics-Driven Synthetic QA Diagnostic for Zero-Shot Commonsense Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

266

17 Oct 2023

Survey of Vulnerabilities in Large Language Models Revealed by Adversarial Attacks

476

230

16 Oct 2023

PELA: Learning Parameter-Efficient Models with Low-Rank ApproximationComputer Vision and Pattern Recognition (CVPR), 2023

Yangyang Guo

Guangzhi Wang

Mohan S. Kankanhalli

214

16 Oct 2023