Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
International Conference on Learning Representations (ICLR), 2019
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 3,048 papers shown
Title
Advancing Semantic Caching for LLMs with Domain-Specific Embeddings and Synthetic Data
Waris Gill
Justin Cechmanek
Tyler Hutcherson
Srijith Rajamohan
Jen Agarwal
Muhammad Ali Gulzar
Manvinder Singh
Benoit Dion
174
3
0
03 Apr 2025
From Text to Graph: Leveraging Graph Neural Networks for Enhanced Explainability in NLP
Fabio Yáñez-Romero
Andrés Montoyo
Armando Suárez
Yoan Gutiérrez
Ruslan Mitkov
325
0
0
02 Apr 2025
KernelDNA: Dynamic Kernel Sharing via Decoupled Naive Adapters
Haiduo Huang
Yadong Zhang
Pengju Ren
Pengju Ren
331
0
0
30 Mar 2025
Evaluating Text-to-Image and Text-to-Video Synthesis with a Conditional Fréchet Distance
Jaywon Koo
J. Hernandez
Moayed Haji-Ali
Ziyan Yang
Vicente Ordonez
EGVM
305
0
0
27 Mar 2025
Cyborg Data: Merging Human with AI Generated Training Data
Kai North
Christopher Ormerod
184
1
0
26 Mar 2025
Unsupervised Acquisition of Discrete Grammatical Categories
David Ph. Shakouri
Crit Cremers
Niels O. Schiller
157
1
0
24 Mar 2025
CoMP: Continual Multimodal Pre-training for Vision Foundation Models
Yuxiao Chen
L. Meng
Wujian Peng
Zuxuan Wu
Yu-Gang Jiang
VLM
442
4
0
24 Mar 2025
Detection of Somali-written Fake News and Toxic Messages on the Social Media Using Transformer-based Language Models
Muhidin A. Mohamed
Shuab D. Ahmed
Yahye A. Isse
Hanad M. Mohamed
Fuad Mire Hassan
Houssein A. Assowe
202
1
0
23 Mar 2025
Deceptive Humor: A Synthetic Multilingual Benchmark Dataset for Bridging Fabricated Claims with Humorous Content
Sai Kartheek Reddy Kasu
Shankar Biradar
Sunil Saumya
303
1
0
20 Mar 2025
Unified Enhancement of the Generalization and Robustness of Language Models via Bi-Stage Optimization
Yizhou Sun
Juan Yin
Juan Zhao
Fan Zhang
Yongheng Liu
Hongji Chen
225
0
0
19 Mar 2025
Model Hubs and Beyond: Analyzing Model Popularity, Performance, and Documentation
International Conference on Web and Social Media (ICWSM), 2025
Pritam Kadasi
Sriman Reddy
Srivathsa Vamsi Chaturvedula
Rudranshu Sen
Agnish Saha
Soumavo Sikdar
Sayani Sarkar
Suhani Mittal
Rohit Jindal
Mayank Singh
345
3
0
19 Mar 2025
Sentiment Analysis in SemEval: A Review of Sentiment Identification Approaches
International Journal of Electrical and Computer Engineering (IJECE) (IJECE), 2023
Bousselham EL HADDAOUI
R. Chiheb
R. Faizi
A. E. Afia
233
1
0
13 Mar 2025
ARLED: Leveraging LED-based ARMAN Model for Abstractive Summarization of Persian Long Documents
Samira Zangooei
Amirhossein Darmani
Hossein Farahmand Nezhad
Laya Mahmoudi
189
1
0
13 Mar 2025
DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation
Chen Chen
Rui Qian
Wenze Hu
Tsu-Jui Fu
Jialing Tong
...
Lezhi Li
Bowen Zhang
Alex Schwing
Wei Liu
Yue Yang
355
6
0
13 Mar 2025
ReSi: A Comprehensive Benchmark for Representational Similarity Measures
International Conference on Learning Representations (ICLR), 2024
Max Klabunde
Tassilo Wald
Tobias Schumacher
Klaus H. Maier-Hein
Markus Strohmaier
Adriana Iamnitchi
AI4TS
VLM
470
10
0
13 Mar 2025
Talk2PC: Enhancing 3D Visual Grounding through LiDAR and Radar Point Clouds Fusion for Autonomous Driving
Runwei Guan
Tao Huang
Ningwei Ouyang
Shaofeng Liang
Daizong Liu
...
Lianqing Zheng
Ming Xu
Yutao Yue
Guoqiang Mao
Hui Xiong
304
1
0
11 Mar 2025
Large Language Model as Meta-Surrogate for Data-Driven Many-Task Optimization: A Proof-of-Principle Study
Wei Wei
Yue-Jiao Gong
Jun Zhang
267
1
0
11 Mar 2025
A Survey on Knowledge-Oriented Retrieval-Augmented Generation
Mingyue Cheng
Yucong Luo
Jie Ouyang
Qiang Liu
Huijie Liu
...
Bohou Zhang
Jiawei Cao
Jie Ma
Daoyu Wang
Tong Xu
3DV
354
33
0
11 Mar 2025
CtrlRAG: Black-box Adversarial Attacks Based on Masked Language Models in Retrieval-Augmented Language Generation
Runqi Sui
AAML
176
3
0
10 Mar 2025
Gender Encoding Patterns in Pretrained Language Model Representations
Mahdi Zakizadeh
Mohammad Taher Pilehvar
399
0
0
09 Mar 2025
Fine-Grained Evaluation for Implicit Discourse Relation Recognition
Xinyi Cai
185
1
0
07 Mar 2025
Layer-Specific Scaling of Positional Encodings for Superior Long-Context Modeling
Zhenghua Wang
Yiran Ding
Changze Lv
Zhibo Xu
Changze Lv
Tianyuan Shi
Xiaoqing Zheng
Qi Zhang
254
1
0
06 Mar 2025
PriFFT: Privacy-preserving Federated Fine-tuning of Large Language Models via Hybrid Secret Sharing
Zhichao You
Xuewen Dong
Ke Cheng
Xutong Mu
Jiaxuan Fu
Shiyang Ma
Qiang Qu
Yulong Shen
FedML
238
0
0
05 Mar 2025
Zero-Shot Complex Question-Answering on Long Scientific Documents
Wanting Wang
RALM
133
1
0
04 Mar 2025
Efficient or Powerful? Trade-offs Between Machine Learning and Deep Learning for Mental Illness Detection on Social Media
Scientific Reports (Sci Rep), 2025
Zhanyi Ding
Zhongyan Wang
Yeyubei Zhang
Yuchen Cao
Yunchong Liu
Xiaorui Shen
Yexin Tian
Jianglai Dai
AI4MH
235
13
0
03 Mar 2025
EPEE: Towards Efficient and Effective Foundation Models in Biomedicine
Zaifu Zhan
Shuang Zhou
Huixue Zhou
Ziqiang Liu
Rui Zhang
233
1
0
03 Mar 2025
Neural ODE Transformers: Analyzing Internal Dynamics and Adaptive Fine-tuning
International Conference on Learning Representations (ICLR), 2025
Anh Tong
Thanh Nguyen-Tang
Dongeun Lee
Duc Nguyen
Toan M. Tran
David Hall
Cheongwoong Kang
Jaesik Choi
424
7
0
03 Mar 2025
TimesBERT: A BERT-Style Foundation Model for Time Series Understanding
Haoran Zhang
Yong Liu
Yunzhong Qiu
Haixuan Liu
Zhongyi Pei
Jianmin Wang
Mingsheng Long
AI4TS
210
7
0
28 Feb 2025
Uncertainty Quantification in Retrieval Augmented Question Answering
Laura Perez-Beltrachini
Mirella Lapata
RALM
507
4
0
25 Feb 2025
Encryption-Friendly LLM Architecture
International Conference on Learning Representations (ICLR), 2024
Donghwan Rho
Taeseong Kim
Minje Park
Jung Woo Kim
Hyunsik Chae
Jung Hee Cheon
Ernest K. Ryu
474
17
0
24 Feb 2025
Towards Typologically Aware Rescoring to Mitigate Unfaithfulness in Lower-Resource Languages
Tsan Tsai Chan
Xin Tong
Thi Thu Uyen Hoang
Barbare Tepnadze
Wojciech Stempniak
368
0
0
24 Feb 2025
Reasoning with Latent Thoughts: On the Power of Looped Transformers
International Conference on Learning Representations (ICLR), 2025
Nikunj Saunshi
Nishanth Dikkala
Zhiyuan Li
Sanjiv Kumar
Sashank J. Reddi
OffRL
LRM
AI4CE
411
62
0
24 Feb 2025
Pay Attention to Real World Perturbations! Natural Robustness Evaluation in Machine Reading Comprehension
Yulong Wu
Viktor Schlegel
Riza Batista-Navarro
AAML
396
1
0
23 Feb 2025
Iterative Auto-Annotation for Scientific Named Entity Recognition Using BERT-Based Models
Kartik Gupta
123
0
0
22 Feb 2025
Robust Bias Detection in MLMs and its Application to Human Trait Ratings
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Ingroj Shrestha
Louis Tay
Padmini Srinivasan
319
1
0
21 Feb 2025
Comprehensive Analysis of Transparency and Accessibility of ChatGPT, DeepSeek, And other SoTA Large Language Models
Ranjan Sapkota
Shaina Raza
Manoj Karkee
245
15
0
21 Feb 2025
A Survey of Model Architectures in Information Retrieval
Zhichao Xu
Fengran Mo
Zhiqi Huang
Crystina Zhang
Puxuan Yu
Bei Wang
Jimmy J. Lin
Vivek Srikumar
3DV
KELM
570
17
0
20 Feb 2025
Hyper-SET: Designing Transformers via Hyperspherical Energy Minimization
Yunzhe Hu
Difan Zou
Dong Xu
467
1
0
17 Feb 2025
The underlying structures of self-attention: symmetry, directionality, and emergent dynamics in Transformer training
Matteo Saponati
Pascal Sager
Pau Vilimelis Aceituno
Thilo Stadelmann
Benjamin Grewe
167
4
0
15 Feb 2025
LLM4GNAS: A Large Language Model Based Toolkit for Graph Neural Architecture Search
Yang Gao
Hong Yang
Y. Chen
Junxian Wu
Peng Zhang
Haishuai Wang
232
3
0
12 Feb 2025
Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning
Qifan Yu
Zhenyu He
Sijie Li
Xun Zhou
Jun Zhang
Jingjing Xu
Di He
OffRL
LRM
326
13
0
12 Feb 2025
Al-Khwarizmi: Discovering Physical Laws with Foundation Models
Christopher E. Mower
Haitham Bou-Ammar
AI4CE
762
4
0
03 Feb 2025
SecPE: Secure Prompt Ensembling for Private and Robust Large Language Models
European Conference on Artificial Intelligence (ECAI), 2025
Jiawen Zhang
Kejia Chen
Zunlei Feng
Jian Lou
Weilong Dai
Qingbin Liu
Xiaoyu Yang
AAML
SILM
FedML
474
1
0
02 Feb 2025
AdditiveLLM: Large Language Models Predict Defects in Additive Manufacturing
Additive Manufacturing Letters (AML), 2025
P. Pak
A. Farimani
AI4CE
226
2
0
29 Jan 2025
Merino: Entropy-driven Design for Generative Language Models on IoT Devices
AAAI Conference on Artificial Intelligence (AAAI), 2024
Youpeng Zhao
Ming Lin
Huadong Tang
Qiang Wu
Jun Wang
365
1
0
28 Jan 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
Information Fusion (Inf. Fusion), 2023
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Xiaoshi Zhong
LM&MA
AILaw
662
262
0
28 Jan 2025
A Review on Self-Supervised Learning for Time Series Anomaly Detection: Recent Advances and Open Challenges
Aitor Sánchez-Ferrera
Borja Calvo
Jose A. Lozano
AI4TS
421
4
0
25 Jan 2025
EDoRA: Efficient Weight-Decomposed Low-Rank Adaptation via Singular Value Decomposition
Hamid Nasiri
Peter Garraghan
202
3
0
21 Jan 2025
Reference-free Evaluation Metrics for Text Generation: A Survey
Takumi Ito
Kees van Deemter
Jun Suzuki
ELM
322
8
0
21 Jan 2025
A Contrastive Framework with User, Item and Review Alignment for Recommendation
Web Search and Data Mining (WSDM), 2025
Hoang V. Dong
Yuan Fang
Hady W. Lauw
645
8
0
21 Jan 2025
Previous
1
2
3
4
5
...
59
60
61
Next