ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,049 papers shown
The Future of Combating Rumors? Retrieval, Discrimination, and
  Generation
The Future of Combating Rumors? Retrieval, Discrimination, and Generation
Junhao Xu
Longdi Xian
Zening Liu
Mingliang Chen
Qiuyang Yin
Fenghua Song
161
3
0
29 Mar 2024
New Semantic Task for the French Spoken Language Understanding MEDIA
  Benchmark
New Semantic Task for the French Spoken Language Understanding MEDIA Benchmark
Nadege Alavoine
G. Laperriere
Christophe Servan
Sahar Ghannay
Sophie Rosset
VLM
320
2
0
28 Mar 2024
A Benchmark Evaluation of Clinical Named Entity Recognition in French
A Benchmark Evaluation of Clinical Named Entity Recognition in French
N. Bannour
Christophe Servan
Aurélie Névéol
Xavier Tannier
174
1
0
28 Mar 2024
mALBERT: Is a Compact Multilingual BERT Model Still Worth It?
mALBERT: Is a Compact Multilingual BERT Model Still Worth It?
Christophe Servan
Sahar Ghannay
Sophie Rosset
164
1
0
27 Mar 2024
GPTs and Language Barrier: A Cross-Lingual Legal QA Examination
GPTs and Language Barrier: A Cross-Lingual Legal QA Examination
Ha-Thanh Nguyen
Hiroaki Yamada
Ken Satoh
ELMAILaw
107
0
0
26 Mar 2024
REFeREE: A REference-FREE Model-Based Metric for Text Simplification
REFeREE: A REference-FREE Model-Based Metric for Text Simplification
Yichen Huang
Ekaterina Kochmar
204
4
0
26 Mar 2024
A Survey on Deep Learning and State-of-the-art Applications
A Survey on Deep Learning and State-of-the-art Applications
Mohd Halim Mohd Noor
A. O. Ige
AILawMLAU
211
0
0
26 Mar 2024
Opportunities and challenges in the application of large artificial
  intelligence models in radiology
Opportunities and challenges in the application of large artificial intelligence models in radiology
Liangrui Pan
Zhenyu Zhao
Ying Lu
Kewei Tang
Liyong Fu
Qingchun Liang
Shaoliang Peng
LM&MAMedImAI4CE
272
12
0
24 Mar 2024
VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for
  Vietnamese Natural Language Understanding
VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding
Phong Nguyen-Thuan Do
Son Quoc Tran
Phu Gia Hoang
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
ELM
232
7
0
23 Mar 2024
Enhancing Traffic Incident Management with Large Language Models: A
  Hybrid Machine Learning Approach for Severity Classification
Enhancing Traffic Incident Management with Large Language Models: A Hybrid Machine Learning Approach for Severity Classification
Artur Grigorev
Khaled Saleh
Yuming Ou
Adriana-Simona Mihaita
252
9
0
20 Mar 2024
How Gender Interacts with Political Values: A Case Study on Czech BERT
  Models
How Gender Interacts with Political Values: A Case Study on Czech BERT Models
Adnan Al Ali
Jindvrich Libovický
161
1
0
20 Mar 2024
Adaptive Ensembles of Fine-Tuned Transformers for LLM-Generated Text
  Detection
Adaptive Ensembles of Fine-Tuned Transformers for LLM-Generated Text Detection
Zhixin Lai
Xuesheng Zhang
Suiyao Chen
DeLMO
193
46
0
20 Mar 2024
WaterVG: Waterway Visual Grounding based on Text-Guided Vision and
  mmWave Radar
WaterVG: Waterway Visual Grounding based on Text-Guided Vision and mmWave Radar
Runwei Guan
Liye Jia
Fengyufan Yang
Shanliang Yao
Erick Purwanto
...
Eng Gee Lim
Jeremy S. Smith
Ka Lok Man
Xuming Hu
Yutao Yue
378
19
0
19 Mar 2024
Simple Hack for Transformers against Heavy Long-Text Classification on a
  Time- and Memory-Limited GPU Service
Simple Hack for Transformers against Heavy Long-Text Classification on a Time- and Memory-Limited GPU Service
Mirza Alim Mutasodirin
Radityo Eko Prasojo
Achmad F. Abka
Hanif Rasyidi
VLM
157
0
0
19 Mar 2024
Improving Generalizability of Extracting Social Determinants of Health
  Using Large Language Models through Prompt-tuning
Improving Generalizability of Extracting Social Determinants of Health Using Large Language Models through Prompt-tuning
C.A.I. Peng
Zehao Yu
Kaleb E. Smith
W. Lo‐Ciganic
Jiang Bian
Yonghui Wu
LM&MA
146
2
0
19 Mar 2024
Large language models in 6G security: challenges and opportunities
Large language models in 6G security: challenges and opportunities
Tri Nguyen
Huong Nguyen
Ahmad Ijaz
Saeid Sheikhi
Athanasios V. Vasilakos
Panos Kostakos
ELM
274
27
0
18 Mar 2024
SSCAE -- Semantic, Syntactic, and Context-aware natural language
  Adversarial Examples generator
SSCAE -- Semantic, Syntactic, and Context-aware natural language Adversarial Examples generatorIEEE Transactions on Dependable and Secure Computing (IEEE TDSC), 2024
J. Asl
Mohammad H. Rafiei
Manar Alohaly
Daniel Takabi
AAMLSILM
190
5
0
18 Mar 2024
Metaphor Understanding Challenge Dataset for LLMs
Metaphor Understanding Challenge Dataset for LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Xiaoyu Tong
Rochelle Choenni
Martha Lewis
Ekaterina Shutova
174
29
0
18 Mar 2024
Semantic-Enhanced Representation Learning for Road Networks with
  Temporal Dynamics
Semantic-Enhanced Representation Learning for Road Networks with Temporal DynamicsIEEE Transactions on Mobile Computing (IEEE TMC), 2024
Yile Chen
Xiucheng Li
Gao Cong
Zhifeng Bao
Cheng Long
203
7
0
18 Mar 2024
A Modified Word Saliency-Based Adversarial Attack on Text Classification
  Models
A Modified Word Saliency-Based Adversarial Attack on Text Classification Models
Hetvi Waghela
Sneha Rakshit
Jaydip Sen
AAML
202
11
0
17 Mar 2024
Rethinking Multi-view Representation Learning via Distilled
  Disentangling
Rethinking Multi-view Representation Learning via Distilled Disentangling
Guanzhou Ke
Bo Wang
Xiaoli Wang
Shengfeng He
378
22
0
16 Mar 2024
ATOM: Asynchronous Training of Massive Models for Deep Learning in a
  Decentralized Environment
ATOM: Asynchronous Training of Massive Models for Deep Learning in a Decentralized Environment
Xiaofeng Wu
Jia Rao
Wei Chen
213
5
0
15 Mar 2024
ST-LDM: A Universal Framework for Text-Grounded Object Generation in
  Real Images
ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real ImagesEuropean Conference on Computer Vision (ECCV), 2024
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
149
1
0
15 Mar 2024
FBPT: A Fully Binary Point Transformer
FBPT: A Fully Binary Point TransformerIEEE International Conference on Robotics and Automation (ICRA), 2024
Zhixing Hou
Yuzhang Shang
Yan Yan
MQ
233
1
0
15 Mar 2024
depyf: Open the Opaque Box of PyTorch Compiler for Machine Learning
  Researchers
depyf: Open the Opaque Box of PyTorch Compiler for Machine Learning Researchers
Kaichao You
Runsheng Bai
Meng Cao
Jianmin Wang
Ion Stoica
Mingsheng Long
VLM
234
0
0
14 Mar 2024
Rethinking Referring Object Removal
Rethinking Referring Object Removal
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
203
0
0
14 Mar 2024
Language models scale reliably with over-training and on downstream
  tasks
Language models scale reliably with over-training and on downstream tasksInternational Conference on Learning Representations (ICLR), 2024
S. Gadre
Georgios Smyrnis
Vaishaal Shankar
Suchin Gururangan
Mitchell Wortsman
...
Y. Carmon
Achal Dave
Reinhard Heckel
Niklas Muennighoff
Ludwig Schmidt
ALMELMLRM
351
77
0
13 Mar 2024
Masked AutoDecoder is Effective Multi-Task Vision Generalist
Masked AutoDecoder is Effective Multi-Task Vision GeneralistComputer Vision and Pattern Recognition (CVPR), 2024
Han Qiu
Jiaxing Huang
Shiyang Feng
Lewei Lu
Xiaoqin Zhang
Shijian Lu
217
5
0
12 Mar 2024
A Logical Pattern Memory Pre-trained Model for Entailment Tree
  Generation
A Logical Pattern Memory Pre-trained Model for Entailment Tree GenerationInternational Conference on Language Resources and Evaluation (LREC), 2024
Li Yuan
Yi Cai
Haopeng Ren
Jiexin Wang
LRM
206
8
0
11 Mar 2024
LORS: Low-rank Residual Structure for Parameter-Efficient Network
  Stacking
LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking
Jialin Li
Qiang Nie
Weifu Fu
Yuhuan Lin
Guangpin Tao
Yong-Jin Liu
Chengjie Wang
239
7
0
07 Mar 2024
On the Effectiveness of Distillation in Mitigating Backdoors in
  Pre-trained Encoder
On the Effectiveness of Distillation in Mitigating Backdoors in Pre-trained Encoder
Tingxu Han
Shenghan Huang
Ziqi Ding
Weisong Sun
Yebo Feng
...
Hanwei Qian
Cong Wu
Quanjun Zhang
Yang Liu
Zhenyu Chen
185
10
0
06 Mar 2024
A General and Flexible Multi-concept Parsing Framework for Multilingual Semantic Matching
Dongyu Yao
Asaad Alghamdi
Qingrong Xia
Xiaoye Qu
Xinyu Duan
Zhefeng Wang
Yi Zheng
Baoxing Huai
Peilun Cheng
Zhou Zhao
269
0
0
05 Mar 2024
Found in the Middle: How Language Models Use Long Contexts Better via
  Plug-and-Play Positional Encoding
Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding
Zhenyu Zhang
Runjin Chen
Shiwei Liu
Zhewei Yao
Olatunji Ruwase
Beidi Chen
Xiaoxia Wu
Zinan Lin
298
65
0
05 Mar 2024
A Tutorial on the Pretrain-Finetune Paradigm for Natural Language
  Processing
A Tutorial on the Pretrain-Finetune Paradigm for Natural Language Processing
Yu Wang
Wen Qu
229
0
0
04 Mar 2024
Vision-Language Models for Medical Report Generation and Visual Question
  Answering: A Review
Vision-Language Models for Medical Report Generation and Visual Question Answering: A Review
Iryna Hartsock
Ghulam Rasool
381
170
0
04 Mar 2024
How does Architecture Influence the Base Capabilities of Pre-trained
  Language Models? A Case Study Based on FFN-Wider Transformer Models
How does Architecture Influence the Base Capabilities of Pre-trained Language Models? A Case Study Based on FFN-Wider Transformer Models
Xin Lu
Yanyan Zhao
Bing Qin
171
0
0
04 Mar 2024
Pseudo-Label Calibration Semi-supervised Multi-Modal Entity Alignment
Pseudo-Label Calibration Semi-supervised Multi-Modal Entity Alignment
Luyao Wang
Pengnian Qi
Xigang Bao
Chunlai Zhou
Biao Qin
209
17
0
02 Mar 2024
ATP: Enabling Fast LLM Serving via Attention on Top Principal Keys
ATP: Enabling Fast LLM Serving via Attention on Top Principal Keys
Yue Niu
Saurav Prakash
Salman Avestimehr
158
1
0
01 Mar 2024
Hierarchical Indexing for Retrieval-Augmented Opinion Summarization
Hierarchical Indexing for Retrieval-Augmented Opinion Summarization
Tom Hosking
Hao Tang
Mirella Lapata
315
8
0
01 Mar 2024
Rethinking Tokenization: Crafting Better Tokenizers for Large Language
  Models
Rethinking Tokenization: Crafting Better Tokenizers for Large Language Models
Jinbiao Yang
LLMAG
263
13
0
01 Mar 2024
Cause and Effect: Can Large Language Models Truly Understand Causality?
Cause and Effect: Can Large Language Models Truly Understand Causality?
Swagata Ashwani
Kshiteesh Hegde
Nishith Reddy Mannuru
Mayank Jindal
Dushyant Singh Sengar
Krishna Chaitanya Rao Kathala
Dishant Banga
Vinija Jain
Vasu Sharma
LRM
284
41
0
28 Feb 2024
Securing Reliability: A Brief Overview on Enhancing In-Context Learning
  for Foundation Models
Securing Reliability: A Brief Overview on Enhancing In-Context Learning for Foundation Models
Yunpeng Huang
Yaonan Gu
Jingwei Xu
Zhihong Zhu
Zhaorun Chen
Xiaoxing Ma
232
4
0
27 Feb 2024
Fine-Grained Natural Language Inference Based Faithfulness Evaluation
  for Diverse Summarisation Tasks
Fine-Grained Natural Language Inference Based Faithfulness Evaluation for Diverse Summarisation Tasks
Huajian Zhang
Yumo Xu
Laura Perez-Beltrachini
HILM
205
24
0
27 Feb 2024
Feature Re-Embedding: Towards Foundation Model-Level Performance in
  Computational Pathology
Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology
Wenhao Tang
Fengtao Zhou
Shengyue Huang
Xiang Zhu
Yi Zhang
Bo Liu
397
67
0
27 Feb 2024
Generating Effective Ensembles for Sentiment Analysis
Generating Effective Ensembles for Sentiment Analysis
Itay Etelis
Avi Rosenfeld
Abraham Itzhak Weinberg
David Sarne
139
4
0
26 Feb 2024
Unveiling Vulnerability of Self-Attention
Unveiling Vulnerability of Self-Attention
Khai Jiet Liong
Hongqiu Wu
Haizhen Zhao
192
0
0
26 Feb 2024
Layer-wise Regularized Dropout for Neural Language Models
Layer-wise Regularized Dropout for Neural Language Models
Shiwen Ni
Min Yang
Ruifeng Xu
Chengming Li
Xiping Hu
126
0
0
26 Feb 2024
QASE Enhanced PLMs: Improved Control in Text Generation for MRC
QASE Enhanced PLMs: Improved Control in Text Generation for MRC
Lin Ai
Zheng Hui
Zizhou Liu
Julia Hirschberg
148
0
0
26 Feb 2024
OAG-Bench: A Human-Curated Benchmark for Academic Graph Mining
OAG-Bench: A Human-Curated Benchmark for Academic Graph Mining
Fanjin Zhang
Shijie Shi
Yifan Zhu
Bo Chen
Yukuo Cen
...
Huihui Yuan
Jian Song
Xiaoyan Li
Yuxiao Dong
Jie Tang
337
26
0
24 Feb 2024
Second-Order Fine-Tuning without Pain for LLMs:A Hessian Informed Zeroth-Order Optimizer
Second-Order Fine-Tuning without Pain for LLMs:A Hessian Informed Zeroth-Order Optimizer
Yanjun Zhao
Sizhe Dang
Haishan Ye
Guang Dai
Yi Qian
Ivor W.Tsang
677
29
0
23 Feb 2024
Previous
123...101112...596061
Next
Page 11 of 61
Pageof 61