ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,048 papers shown
Advancing Semantic Caching for LLMs with Domain-Specific Embeddings and Synthetic Data
Advancing Semantic Caching for LLMs with Domain-Specific Embeddings and Synthetic Data
Waris Gill
Justin Cechmanek
Tyler Hutcherson
Srijith Rajamohan
Jen Agarwal
Muhammad Ali Gulzar
Manvinder Singh
Benoit Dion
198
4
0
03 Apr 2025
From Text to Graph: Leveraging Graph Neural Networks for Enhanced Explainability in NLP
From Text to Graph: Leveraging Graph Neural Networks for Enhanced Explainability in NLP
Fabio Yáñez-Romero
Andrés Montoyo
Armando Suárez
Yoan Gutiérrez
Ruslan Mitkov
353
1
0
02 Apr 2025
KernelDNA: Dynamic Kernel Sharing via Decoupled Naive Adapters
KernelDNA: Dynamic Kernel Sharing via Decoupled Naive Adapters
Haiduo Huang
Yadong Zhang
Pengju Ren
Pengju Ren
368
0
0
30 Mar 2025
Evaluating Text-to-Image and Text-to-Video Synthesis with a Conditional Fréchet Distance
Evaluating Text-to-Image and Text-to-Video Synthesis with a Conditional Fréchet Distance
Jaywon Koo
J. Hernandez
Moayed Haji-Ali
Ziyan Yang
Vicente Ordonez
EGVM
323
0
0
27 Mar 2025
Cyborg Data: Merging Human with AI Generated Training Data
Cyborg Data: Merging Human with AI Generated Training Data
Kai North
Christopher Ormerod
193
1
0
26 Mar 2025
Unsupervised Acquisition of Discrete Grammatical Categories
Unsupervised Acquisition of Discrete Grammatical Categories
David Ph. Shakouri
Crit Cremers
Niels O. Schiller
196
1
0
24 Mar 2025
CoMP: Continual Multimodal Pre-training for Vision Foundation Models
CoMP: Continual Multimodal Pre-training for Vision Foundation Models
Yuxiao Chen
L. Meng
Wujian Peng
Zuxuan Wu
Yu-Gang Jiang
VLM
486
5
0
24 Mar 2025
Detection of Somali-written Fake News and Toxic Messages on the Social Media Using Transformer-based Language Models
Detection of Somali-written Fake News and Toxic Messages on the Social Media Using Transformer-based Language Models
Muhidin A. Mohamed
Shuab D. Ahmed
Yahye A. Isse
Hanad M. Mohamed
Fuad Mire Hassan
Houssein A. Assowe
243
1
0
23 Mar 2025
Deceptive Humor: A Synthetic Multilingual Benchmark Dataset for Bridging Fabricated Claims with Humorous Content
Deceptive Humor: A Synthetic Multilingual Benchmark Dataset for Bridging Fabricated Claims with Humorous Content
Sai Kartheek Reddy Kasu
Shankar Biradar
Sunil Saumya
327
1
0
20 Mar 2025
Unified Enhancement of the Generalization and Robustness of Language Models via Bi-Stage Optimization
Unified Enhancement of the Generalization and Robustness of Language Models via Bi-Stage Optimization
Yizhou Sun
Juan Yin
Juan Zhao
Fan Zhang
Yongheng Liu
Hongji Chen
248
0
0
19 Mar 2025
Model Hubs and Beyond: Analyzing Model Popularity, Performance, and Documentation
Model Hubs and Beyond: Analyzing Model Popularity, Performance, and DocumentationInternational Conference on Web and Social Media (ICWSM), 2025
Pritam Kadasi
Sriman Reddy
Srivathsa Vamsi Chaturvedula
Rudranshu Sen
Agnish Saha
Soumavo Sikdar
Sayani Sarkar
Suhani Mittal
Rohit Jindal
Mayank Singh
365
3
0
19 Mar 2025
Sentiment Analysis in SemEval: A Review of Sentiment Identification ApproachesInternational Journal of Electrical and Computer Engineering (IJECE) (IJECE), 2023
Bousselham EL HADDAOUI
R. Chiheb
R. Faizi
A. E. Afia
260
1
0
13 Mar 2025
ARLED: Leveraging LED-based ARMAN Model for Abstractive Summarization of Persian Long Documents
Samira Zangooei
Amirhossein Darmani
Hossein Farahmand Nezhad
Laya Mahmoudi
231
1
0
13 Mar 2025
DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation
DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation
Chen Chen
Rui Qian
Wenze Hu
Tsu-Jui Fu
Jialing Tong
...
Lezhi Li
Bowen Zhang
Alex Schwing
Wei Liu
Yue Yang
380
6
0
13 Mar 2025
ReSi: A Comprehensive Benchmark for Representational Similarity Measures
ReSi: A Comprehensive Benchmark for Representational Similarity MeasuresInternational Conference on Learning Representations (ICLR), 2024
Max Klabunde
Tassilo Wald
Tobias Schumacher
Klaus H. Maier-Hein
Markus Strohmaier
Adriana Iamnitchi
AI4TSVLM
493
10
0
13 Mar 2025
Talk2PC: Enhancing 3D Visual Grounding through LiDAR and Radar Point Clouds Fusion for Autonomous Driving
Talk2PC: Enhancing 3D Visual Grounding through LiDAR and Radar Point Clouds Fusion for Autonomous Driving
Runwei Guan
Tao Huang
Ningwei Ouyang
Shaofeng Liang
Daizong Liu
...
Lianqing Zheng
Ming Xu
Yutao Yue
Guoqiang Mao
Hui Xiong
326
1
0
11 Mar 2025
Large Language Model as Meta-Surrogate for Data-Driven Many-Task Optimization: A Proof-of-Principle Study
Wei Wei
Yue-Jiao Gong
Jun Zhang
300
1
0
11 Mar 2025
A Survey on Knowledge-Oriented Retrieval-Augmented Generation
A Survey on Knowledge-Oriented Retrieval-Augmented Generation
Mingyue Cheng
Yucong Luo
Jie Ouyang
Qiang Liu
Huijie Liu
...
Bohou Zhang
Jiawei Cao
Jie Ma
Daoyu Wang
Tong Xu
3DV
367
37
0
11 Mar 2025
CtrlRAG: Black-box Adversarial Attacks Based on Masked Language Models in Retrieval-Augmented Language Generation
Runqi Sui
AAML
210
3
0
10 Mar 2025
Gender Encoding Patterns in Pretrained Language Model Representations
Mahdi Zakizadeh
Mohammad Taher Pilehvar
418
0
0
09 Mar 2025
Fine-Grained Evaluation for Implicit Discourse Relation Recognition
Xinyi Cai
202
1
0
07 Mar 2025
Layer-Specific Scaling of Positional Encodings for Superior Long-Context Modeling
Zhenghua Wang
Yiran Ding
Changze Lv
Zhibo Xu
Changze Lv
Tianyuan Shi
Xiaoqing Zheng
Qi Zhang
285
1
0
06 Mar 2025
PriFFT: Privacy-preserving Federated Fine-tuning of Large Language Models via Hybrid Secret Sharing
PriFFT: Privacy-preserving Federated Fine-tuning of Large Language Models via Hybrid Secret Sharing
Zhichao You
Xuewen Dong
Ke Cheng
Xutong Mu
Jiaxuan Fu
Shiyang Ma
Qiang Qu
Yulong Shen
FedML
251
0
0
05 Mar 2025
Zero-Shot Complex Question-Answering on Long Scientific Documents
Wanting Wang
RALM
134
1
0
04 Mar 2025
Efficient or Powerful? Trade-offs Between Machine Learning and Deep Learning for Mental Illness Detection on Social MediaScientific Reports (Sci Rep), 2025
Zhanyi Ding
Zhongyan Wang
Yeyubei Zhang
Yuchen Cao
Yunchong Liu
Xiaorui Shen
Yexin Tian
Jianglai Dai
AI4MH
255
19
0
03 Mar 2025
EPEE: Towards Efficient and Effective Foundation Models in Biomedicine
Zaifu Zhan
Shuang Zhou
Huixue Zhou
Ziqiang Liu
Rui Zhang
245
1
0
03 Mar 2025
Neural ODE Transformers: Analyzing Internal Dynamics and Adaptive Fine-tuning
Neural ODE Transformers: Analyzing Internal Dynamics and Adaptive Fine-tuningInternational Conference on Learning Representations (ICLR), 2025
Anh Tong
Thanh Nguyen-Tang
Dongeun Lee
Duc Nguyen
Toan M. Tran
David Hall
Cheongwoong Kang
Jaesik Choi
451
7
0
03 Mar 2025
TimesBERT: A BERT-Style Foundation Model for Time Series Understanding
TimesBERT: A BERT-Style Foundation Model for Time Series Understanding
Haoran Zhang
Yong Liu
Yunzhong Qiu
Haixuan Liu
Zhongyi Pei
Jianmin Wang
Mingsheng Long
AI4TS
230
7
0
28 Feb 2025
Uncertainty Quantification in Retrieval Augmented Question Answering
Uncertainty Quantification in Retrieval Augmented Question Answering
Laura Perez-Beltrachini
Mirella Lapata
RALM
542
4
0
25 Feb 2025
Encryption-Friendly LLM Architecture
Encryption-Friendly LLM ArchitectureInternational Conference on Learning Representations (ICLR), 2024
Donghwan Rho
Taeseong Kim
Minje Park
Jung Woo Kim
Hyunsik Chae
Jung Hee Cheon
Ernest K. Ryu
498
18
0
24 Feb 2025
Towards Typologically Aware Rescoring to Mitigate Unfaithfulness in Lower-Resource Languages
Towards Typologically Aware Rescoring to Mitigate Unfaithfulness in Lower-Resource Languages
Tsan Tsai Chan
Xin Tong
Thi Thu Uyen Hoang
Barbare Tepnadze
Wojciech Stempniak
398
0
0
24 Feb 2025
Reasoning with Latent Thoughts: On the Power of Looped Transformers
Reasoning with Latent Thoughts: On the Power of Looped TransformersInternational Conference on Learning Representations (ICLR), 2025
Nikunj Saunshi
Nishanth Dikkala
Zhiyuan Li
Sanjiv Kumar
Sashank J. Reddi
OffRLLRMAI4CE
431
64
0
24 Feb 2025
Pay Attention to Real World Perturbations! Natural Robustness Evaluation in Machine Reading Comprehension
Pay Attention to Real World Perturbations! Natural Robustness Evaluation in Machine Reading Comprehension
Yulong Wu
Viktor Schlegel
Riza Batista-Navarro
AAML
417
1
0
23 Feb 2025
Iterative Auto-Annotation for Scientific Named Entity Recognition Using BERT-Based Models
Iterative Auto-Annotation for Scientific Named Entity Recognition Using BERT-Based Models
Kartik Gupta
133
0
0
22 Feb 2025
Robust Bias Detection in MLMs and its Application to Human Trait Ratings
Robust Bias Detection in MLMs and its Application to Human Trait RatingsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Ingroj Shrestha
Louis Tay
Padmini Srinivasan
350
1
0
21 Feb 2025
Comprehensive Analysis of Transparency and Accessibility of ChatGPT, DeepSeek, And other SoTA Large Language Models
Comprehensive Analysis of Transparency and Accessibility of ChatGPT, DeepSeek, And other SoTA Large Language Models
Ranjan Sapkota
Shaina Raza
Manoj Karkee
266
15
0
21 Feb 2025
A Survey of Model Architectures in Information Retrieval
A Survey of Model Architectures in Information Retrieval
Zhichao Xu
Fengran Mo
Zhiqi Huang
Crystina Zhang
Puxuan Yu
Bei Wang
Jimmy J. Lin
Vivek Srikumar
3DVKELM
582
18
0
20 Feb 2025
Hyper-SET: Designing Transformers via Hyperspherical Energy Minimization
Hyper-SET: Designing Transformers via Hyperspherical Energy Minimization
Yunzhe Hu
Difan Zou
Dong Xu
503
1
0
17 Feb 2025
The underlying structures of self-attention: symmetry, directionality, and emergent dynamics in Transformer training
The underlying structures of self-attention: symmetry, directionality, and emergent dynamics in Transformer training
Matteo Saponati
Pascal Sager
Pau Vilimelis Aceituno
Thilo Stadelmann
Benjamin Grewe
211
4
0
15 Feb 2025
LLM4GNAS: A Large Language Model Based Toolkit for Graph Neural Architecture Search
LLM4GNAS: A Large Language Model Based Toolkit for Graph Neural Architecture Search
Yang Gao
Hong Yang
Y. Chen
Junxian Wu
Peng Zhang
Haishuai Wang
244
3
0
12 Feb 2025
Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning
Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning
Qifan Yu
Zhenyu He
Sijie Li
Xun Zhou
Jun Zhang
Jingjing Xu
Di He
OffRLLRM
340
13
0
12 Feb 2025
Al-Khwarizmi: Discovering Physical Laws with Foundation Models
Al-Khwarizmi: Discovering Physical Laws with Foundation Models
Christopher E. Mower
Haitham Bou-Ammar
AI4CE
790
4
0
03 Feb 2025
SecPE: Secure Prompt Ensembling for Private and Robust Large Language Models
SecPE: Secure Prompt Ensembling for Private and Robust Large Language ModelsEuropean Conference on Artificial Intelligence (ECAI), 2025
Jiawen Zhang
Kejia Chen
Zunlei Feng
Jian Lou
Weilong Dai
Qingbin Liu
Xiaoyu Yang
AAMLSILMFedML
514
1
0
02 Feb 2025
AdditiveLLM: Large Language Models Predict Defects in Additive Manufacturing
AdditiveLLM: Large Language Models Predict Defects in Additive ManufacturingAdditive Manufacturing Letters (AML), 2025
P. Pak
A. Farimani
AI4CE
246
2
0
29 Jan 2025
Merino: Entropy-driven Design for Generative Language Models on IoT Devices
Merino: Entropy-driven Design for Generative Language Models on IoT DevicesAAAI Conference on Artificial Intelligence (AAAI), 2024
Youpeng Zhao
Ming Lin
Huadong Tang
Qiang Wu
Jun Wang
375
1
0
28 Jan 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and EthicsInformation Fusion (Inf. Fusion), 2023
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Xiaoshi Zhong
LM&MAAILaw
726
269
0
28 Jan 2025
A Review on Self-Supervised Learning for Time Series Anomaly Detection: Recent Advances and Open Challenges
A Review on Self-Supervised Learning for Time Series Anomaly Detection: Recent Advances and Open Challenges
Aitor Sánchez-Ferrera
Borja Calvo
Jose A. Lozano
AI4TS
445
4
0
25 Jan 2025
EDoRA: Efficient Weight-Decomposed Low-Rank Adaptation via Singular Value Decomposition
EDoRA: Efficient Weight-Decomposed Low-Rank Adaptation via Singular Value Decomposition
Hamid Nasiri
Peter Garraghan
219
3
0
21 Jan 2025
Reference-free Evaluation Metrics for Text Generation: A Survey
Reference-free Evaluation Metrics for Text Generation: A Survey
Takumi Ito
Kees van Deemter
Jun Suzuki
ELM
345
9
0
21 Jan 2025
A Contrastive Framework with User, Item and Review Alignment for Recommendation
A Contrastive Framework with User, Item and Review Alignment for RecommendationWeb Search and Data Mining (WSDM), 2025
Hoang V. Dong
Yuan Fang
Hady W. Lauw
679
8
0
21 Jan 2025
Previous
12345...596061
Next