ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,050 papers shown
Successor Features for Efficient Multisubject Controlled Text Generation
Successor Features for Efficient Multisubject Controlled Text Generation
Mengyao Cao
Mehdi Fatemi
Jackie Chi Kit Cheung
Samira Shabanian
BDL
173
0
0
03 Nov 2023
Adapting Fake News Detection to the Era of Large Language Models
Adapting Fake News Detection to the Era of Large Language Models
Jinyan Su
Claire Cardie
Preslav Nakov
DeLMO
320
35
0
02 Nov 2023
Investigating Self-Supervised Deep Representations for EEG-based
  Auditory Attention Decoding
Investigating Self-Supervised Deep Representations for EEG-based Auditory Attention DecodingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Karan Thakkar
Jiarui Hai
Mounya Elhilali
223
2
0
01 Nov 2023
Latent Space Translation via Semantic Alignment
Latent Space Translation via Semantic AlignmentNeural Information Processing Systems (NeurIPS), 2023
Valentino Maiorca
Luca Moschella
Antonio Norelli
Marco Fumero
Francesco Locatello
Emanuele Rodolà
416
36
0
01 Nov 2023
LLMs may Dominate Information Access: Neural Retrievers are Biased
  Towards LLM-Generated Texts
LLMs may Dominate Information Access: Neural Retrievers are Biased Towards LLM-Generated TextsKnowledge Discovery and Data Mining (KDD), 2023
Sunhao Dai
Yuqi Zhou
Liang Pang
Weihao Liu
Xiaolin Hu
Yong Liu
Xiao Zhang
Gang Wang
Jun Xu
279
6
0
31 Oct 2023
Do large language models solve verbal analogies like children do?
Do large language models solve verbal analogies like children do?
Claire E. Stevenson
Mathilde ter Veen
Rochelle Choenni
Han L. J. van der Maas
Ekaterina Shutova
LRM
170
12
0
31 Oct 2023
Learning to Play Chess from Textbooks (LEAP): a Corpus for Evaluating
  Chess Moves based on Sentiment Analysis
Learning to Play Chess from Textbooks (LEAP): a Corpus for Evaluating Chess Moves based on Sentiment Analysis
Haifa Alrdahi
Riza Batista-Navarro
195
2
0
31 Oct 2023
EELBERT: Tiny Models through Dynamic Embeddings
EELBERT: Tiny Models through Dynamic EmbeddingsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Gabrielle Cohn
Rishika Agarwal
Deepanshu Gupta
Siddharth Patwardhan
139
2
0
31 Oct 2023
Efficient Classification of Student Help Requests in Programming Courses
  Using Large Language Models
Efficient Classification of Student Help Requests in Programming Courses Using Large Language Models
Jaromír Šavelka
Paul Denny
Mark H. Liffiton
Brad Sheese
AI4Ed
194
8
0
31 Oct 2023
MoCa: Measuring Human-Language Model Alignment on Causal and Moral
  Judgment Tasks
MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment TasksNeural Information Processing Systems (NeurIPS), 2023
Allen Nie
Yuhui Zhang
Atharva Amdekar
Chris Piech
Tatsunori Hashimoto
Tobias Gerstenberg
285
55
0
30 Oct 2023
MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient
  image-text retrieval
MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient image-text retrieval
Youbo Lei
Feifei He
Chen Chen
Yingbin Mo
Sijia Li
Defeng Xie
H. Lu
VLM
369
2
0
30 Oct 2023
A Lightweight Method to Generate Unanswerable Questions in English
A Lightweight Method to Generate Unanswerable Questions in EnglishConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Vagrant Gautam
Miaoran Zhang
Dietrich Klakow
212
2
0
30 Oct 2023
BERT Lost Patience Won't Be Robust to Adversarial Slowdown
BERT Lost Patience Won't Be Robust to Adversarial SlowdownNeural Information Processing Systems (NeurIPS), 2023
Zachary Coalson
Gabriel Ritter
Rakesh Bobba
Sanghyun Hong
AAML
331
2
0
29 Oct 2023
Stacking the Odds: Transformer-Based Ensemble for AI-Generated Text
  Detection
Stacking the Odds: Transformer-Based Ensemble for AI-Generated Text DetectionAustralasian Language Technology Association Workshop (ALTA), 2023
Duke Nguyen
Khaing Myat Noe Naing
Aditya Joshi
219
6
0
29 Oct 2023
Multi-grained Evidence Inference for Multi-choice Reading Comprehension
Multi-grained Evidence Inference for Multi-choice Reading ComprehensionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Yilin Zhao
Hai Zhao
Sufeng Duan
209
2
0
27 Oct 2023
Outlier Dimensions Encode Task-Specific Knowledge
Outlier Dimensions Encode Task-Specific KnowledgeConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
William Rudman
Catherine Chen
Carsten Eickhoff
293
9
0
26 Oct 2023
PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word
  Tokenization on Downstream Applications
PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream Applications
Yang Tan
Mingchen Li
P. Tan
Ziyi Zhou
Huiqun Yu
Guisheng Fan
Liang Hong
184
0
0
26 Oct 2023
Understanding the Role of Input Token Characters in Language Models: How
  Does Information Loss Affect Performance?
Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Ahmed Alajrami
Katerina Margatina
Nikolaos Aletras
AAML
144
3
0
26 Oct 2023
Joint Entity and Relation Extraction with Span Pruning and Hypergraph
  Neural Networks
Joint Entity and Relation Extraction with Span Pruning and Hypergraph Neural NetworksConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zhaohui Yan
Aaron Courville
Wei Liu
Kewei Tu
378
29
0
26 Oct 2023
Apollo: Zero-shot MultiModal Reasoning with Multiple Experts
Apollo: Zero-shot MultiModal Reasoning with Multiple Experts
Daniela Ben-David
Tzuf Paz-Argaman
Reut Tsarfaty
MoE
181
0
0
25 Oct 2023
Kiki or Bouba? Sound Symbolism in Vision-and-Language Models
Kiki or Bouba? Sound Symbolism in Vision-and-Language ModelsNeural Information Processing Systems (NeurIPS), 2023
Morris Alper
Hadar Averbuch-Elor
290
15
0
25 Oct 2023
FedTherapist: Mental Health Monitoring with User-Generated Linguistic
  Expressions on Smartphones via Federated Learning
FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jaemin Shin
Hyungjun Yoon
Seungjoo Lee
Sungjoon Park
Yunxin Liu
Jinho D. Choi
Sung-Ju Lee
175
12
0
25 Oct 2023
Subspace Chronicles: How Linguistic Information Emerges, Shifts and
  Interacts during Language Model Training
Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model TrainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Max Müller-Eberstein
Rob van der Goot
Barbara Plank
Ivan Titov
277
15
0
25 Oct 2023
URL-BERT: Training Webpage Representations via Social Media Engagements
URL-BERT: Training Webpage Representations via Social Media Engagements
A. Qamar
Chetan Verma
Ahmed El-Kishky
Sumit Binnani
Sneha Mehta
Taylor Berg-Kirkpatrick
239
0
0
25 Oct 2023
CR-COPEC: Causal Rationale of Corporate Performance Changes to Learn
  from Financial Reports
CR-COPEC: Causal Rationale of Corporate Performance Changes to Learn from Financial ReportsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Ye Eun Chun
Sunjae Kwon
Kyung-Woo Sohn
Nakwon Sung
Junyoup Lee
Byungki Seo
Kevin Compher
Seung-won Hwang
Jaesik Choi
260
1
0
24 Oct 2023
Retrieval-based Knowledge Transfer: An Effective Approach for Extreme
  Large Language Model Compression
Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model CompressionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jiduan Liu
Jiahao Liu
Qifan Wang
Jingang Wang
Xunliang Cai
Dongyan Zhao
Ran Wang
Rui Yan
224
6
0
24 Oct 2023
TRAMS: Training-free Memory Selection for Long-range Language Modeling
TRAMS: Training-free Memory Selection for Long-range Language ModelingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Haofei Yu
Cunxiang Wang
Yue Zhang
Wei Bi
RALM
301
5
0
24 Oct 2023
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without
  Full Large Language Model
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language ModelConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Kaiyan Zhang
Ning Ding
Biqing Qi
Xuekai Zhu
Xinwei Long
Bowen Zhou
266
5
0
24 Oct 2023
PartialFormer: Modeling Part Instead of Whole for Machine Translation
PartialFormer: Modeling Part Instead of Whole for Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Tong Zheng
Bei Li
Huiwen Bao
Jiale Wang
Weiqiao Shan
Tong Xiao
Jingbo Zhu
MoEAI4CE
246
2
0
23 Oct 2023
Unveiling the Multi-Annotation Process: Examining the Influence of
  Annotation Quantity and Instance Difficulty on Model Performance
Unveiling the Multi-Annotation Process: Examining the Influence of Annotation Quantity and Instance Difficulty on Model PerformanceConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Pritam Kadasi
Mayank Singh
227
4
0
23 Oct 2023
PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical Domain
PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical Domain
Wei-wei Zhu
Xiaoling Wang
Huanran Zheng
Mosha Chen
Buzhou Tang
ELMLM&MA
169
48
0
22 Oct 2023
Transductive Learning for Textual Few-Shot Classification in API-based
  Embedding Models
Transductive Learning for Textual Few-Shot Classification in API-based Embedding ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Pierre Colombo
Victor Pellegrain
Malik Boudiaf
Victor Storchan
Myriam Tami
Ismail Ben Ayed
C´eline Hudelot
Pablo Piantanida
220
8
0
21 Oct 2023
A Novel Information-Theoretic Objective to Disentangle Representations
  for Fair Classification
A Novel Information-Theoretic Objective to Disentangle Representations for Fair ClassificationInternational Joint Conference on Natural Language Processing (IJCNLP), 2023
Pierre Colombo
Nathan Noiry
Guillaume Staerman
Pablo Piantanida
FaMLDRL
284
2
0
21 Oct 2023
Plausibility Processing in Transformer Language Models: Focusing on the
  Role of Attention Heads in GPT
Plausibility Processing in Transformer Language Models: Focusing on the Role of Attention Heads in GPT
Soo Hyun Ryu
171
1
0
20 Oct 2023
The Less the Merrier? Investigating Language Representation in
  Multilingual Models
The Less the Merrier? Investigating Language Representation in Multilingual Models
H. Nigatu
A. Tonja
Jugal Kalita
261
6
0
20 Oct 2023
Unsupervised Candidate Answer Extraction through Differentiable
  Masker-Reconstructor Model
Unsupervised Candidate Answer Extraction through Differentiable Masker-Reconstructor Model
Zhuoer Wang
Yicheng Wang
Ziwei Zhu
James Caverlee
230
0
0
19 Oct 2023
A Predictive Factor Analysis of Social Biases and Task-Performance in
  Pretrained Masked Language Models
A Predictive Factor Analysis of Social Biases and Task-Performance in Pretrained Masked Language Models
Yi Zhou
Jose Camacho-Collados
Danushka Bollegala
438
7
0
19 Oct 2023
Boosting Inference Efficiency: Unleashing the Power of Parameter-Shared
  Pre-trained Language Models
Boosting Inference Efficiency: Unleashing the Power of Parameter-Shared Pre-trained Language Models
Weize Chen
Xiaoyue Xu
Xu Han
Yankai Lin
Ruobing Xie
Zhiyuan Liu
Maosong Sun
Jie Zhou
123
0
0
19 Oct 2023
Character-level Chinese Backpack Language Models
Character-level Chinese Backpack Language Models
Hao Sun
John Hewitt
150
1
0
19 Oct 2023
Time-Aware Representation Learning for Time-Sensitive Question Answering
Time-Aware Representation Learning for Time-Sensitive Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jungbin Son
Alice Oh
154
12
0
19 Oct 2023
Pretraining Language Models with Text-Attributed Heterogeneous Graphs
Pretraining Language Models with Text-Attributed Heterogeneous GraphsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Tao Zou
Le Yu
Yifei Huang
Leilei Sun
Bo Du
AI4CE
282
21
0
19 Oct 2023
DepWiGNN: A Depth-wise Graph Neural Network for Multi-hop Spatial
  Reasoning in Text
DepWiGNN: A Depth-wise Graph Neural Network for Multi-hop Spatial Reasoning in TextConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Shuaiyi Li
Yang Deng
Wai Lam
368
4
0
19 Oct 2023
SPEED: Speculative Pipelined Execution for Efficient Decoding
SPEED: Speculative Pipelined Execution for Efficient Decoding
Coleman Hooper
Sehoon Kim
Hiva Mohammadzadeh
Hasan Genç
Kurt Keutzer
A. Gholami
Y. Shao
204
48
0
18 Oct 2023
DesignQuizzer: A Community-Powered Conversational Agent for Learning
  Visual Design
DesignQuizzer: A Community-Powered Conversational Agent for Learning Visual Design
Zhenhui Peng
Qiaoyi Chen
Zhiyu Shen
Xiaojuan Ma
Antti Oulasvirta
176
12
0
18 Oct 2023
Improving Long Document Topic Segmentation Models With Enhanced
  Coherence Modeling
Improving Long Document Topic Segmentation Models With Enhanced Coherence ModelingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Hai Yu
Chong Deng
Qinglin Zhang
Jiaqing Liu
Qian Chen
Wen Wang
AI4TS
246
20
0
18 Oct 2023
Chain-of-Thought Tuning: Masked Language Models can also Think Step By
  Step in Natural Language Understanding
Chain-of-Thought Tuning: Masked Language Models can also Think Step By Step in Natural Language UnderstandingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Caoyun Fan
Jidong Tian
Yitian Li
Wenqing Chen
Hao He
Yaohui Jin
LRM
217
5
0
18 Oct 2023
Disentangling the Linguistic Competence of Privacy-Preserving BERT
Disentangling the Linguistic Competence of Privacy-Preserving BERTBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2023
Stefan Arnold
Nils Kemmerzell
Annika Schreiner
253
0
0
17 Oct 2023
QADYNAMICS: Training Dynamics-Driven Synthetic QA Diagnostic for
  Zero-Shot Commonsense Question Answering
QADYNAMICS: Training Dynamics-Driven Synthetic QA Diagnostic for Zero-Shot Commonsense Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Haochen Shi
Weiqi Wang
Tianqing Fang
Baixuan Xu
Wenxuan Ding
Xin Liu
Yangqiu Song
266
7
0
17 Oct 2023
Survey of Vulnerabilities in Large Language Models Revealed by
  Adversarial Attacks
Survey of Vulnerabilities in Large Language Models Revealed by Adversarial Attacks
Erfan Shayegani
Md Abdullah Al Mamun
Yu Fu
Pedram Zaree
Yue Dong
Nael B. Abu-Ghazaleh
AAML
476
230
0
16 Oct 2023
PELA: Learning Parameter-Efficient Models with Low-Rank Approximation
PELA: Learning Parameter-Efficient Models with Low-Rank ApproximationComputer Vision and Pattern Recognition (CVPR), 2023
Yangyang Guo
Guangzhi Wang
Mohan S. Kankanhalli
214
8
0
16 Oct 2023
Previous
123...141516...596061
Next
Page 15 of 61
Pageof 61