Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
International Conference on Learning Representations (ICLR), 2019
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 3,048 papers shown
Advancing Mental Disorder Detection: A Comparative Evaluation of Transformer and LSTM Architectures on Social Media
Annual International Computer Software and Applications Conference (COMPSAC), 2025
Khalid Hasan
Jamil Saquer
Mukulika Ghosh
AI4MH
125
6
0
17 Jul 2025
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation
Sangmin Bae
Yujin Kim
Reza Bayat
S. Kim
Jiyoun Ha
...
Adam Fisch
Hrayr Harutyunyan
Ziwei Ji
Aaron Courville
Se-Young Yun
MoE
283
24
0
14 Jul 2025
Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations
A. Bochkov
242
2
0
07 Jul 2025
DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy
Ming Dai
Wenxuan Cheng
Jiang-Jiang Liu
Sen Yang
Wenxiao Cai
Yanpeng Sun
Wankou Yang
200
6
0
02 Jul 2025
Health Sentinel: An AI Pipeline For Real-time Disease Outbreak Detection
Devesh Pant
Rishi Raj Grandhe
Vipin Samaria
Mukul Paul
Sudhir Kumar
...
Himanshu Chauhan
Pranay Verma
Neha Khandelwal
Soma S Dhavala
Minesh Mathew
94
0
0
24 Jun 2025
Beyond Parameters: Exploring Virtual Logic Depth for Scaling Laws
Ruike Zhu
Hanwen Zhang
Kevin Li
Tianyu Shi
Y. Duan
Chi Wang
Tianyi Zhou
Arindam Banerjee
Zengyi Qin
VLM
LRM
193
1
0
23 Jun 2025
All is Not Lost: LLM Recovery without Checkpoints
Nikolay Blagoev
Oğuzhan Ersoy
Lydia Yiyu Chen
219
1
0
18 Jun 2025
Enhancing Hyperbole and Metaphor Detection with Their Bidirectional Dynamic Interaction and Emotion Knowledge
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Li Zheng
Sihang Wang
Hao Fei
Zuquan Peng
Fei Li
Jianming Fu
Chong Teng
Donghong Ji
190
2
0
18 Jun 2025
FASCIST-O-METER: Classifier for Neo-fascist Discourse Online
Rudy Alexandro Garrido Veliz
Martin Semmann
Chris Biemann
Seid Muhie Yimam
250
0
0
12 Jun 2025
Latent Multi-Head Attention for Small Language Models
Sushant Mehta
Raj Abhijit Dandekar
Rajat Dandekar
Sreedath Panat
RALM
178
2
0
11 Jun 2025
semantic-features: A User-Friendly Tool for Studying Contextual Word Embeddings in Interpretable Semantic Spaces
Jwalanthi Ranganathan
Rohan Jha
Kanishka Misra
Kyle Mahowald
202
1
0
06 Jun 2025
Towards Efficient Multi-LLM Inference: Characterization and Analysis of LLM Routing and Hierarchical Techniques
Adarsh Prasad Behera
J. Champati
Roberto Morabito
Sasu Tarkoma
J. Gross
200
5
0
06 Jun 2025
Training-free AI for Earth Observation Change Detection using Physics Aware Neuromorphic Networks
Scientific Reports (Sci Rep), 2025
Stephen Smith
Cormac Purcell
Zdenka Kuncic
265
1
0
04 Jun 2025
MCFNet: A Multimodal Collaborative Fusion Network for Fine-Grained Semantic Classification
Yang Qiao
Xiaoyu Zhong
Xiaofeng Gu
Zhiguo Yu
236
0
0
29 May 2025
Improving QA Efficiency with DistilBERT: Fine-Tuning and Inference on mobile Intel CPUs
Ngeyen Yinkfu
118
0
0
28 May 2025
VeriTrail: Closed-Domain Hallucination Detection with Traceability
Dasha Metropolitansky
Jonathan Larson
HILM
252
1
0
27 May 2025
Unfolding A Few Structures for The Many: Memory-Efficient Compression of Conformer and Speech Foundation Models
Zhaoqing Li
Haoning Xu
Xurong Xie
Zengrui Jin
Tianzi Wang
Xunying Liu
192
0
0
27 May 2025
Discrete Markov Bridge
Hengli Li
Yuxuan Wang
Song-Chun Zhu
Ying Nian Wu
Zilong Zheng
DiffM
214
0
0
26 May 2025
Recurrent Self-Attention Dynamics: An Energy-Agnostic Perspective from Jacobians
Akiyoshi Tomihari
Ryo Karakida
355
2
0
26 May 2025
ResSVD: Residual Compensated SVD for Large Language Model Compression
Haolei Bai
Siyong Jian
Tuo Liang
Yu Yin
Huan Wang
321
4
0
26 May 2025
Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning
Xinghao Chen
Anhao Zhao
Heming Xia
Xuan Lu
Hanlin Wang
Yanjun Chen
Wei Zhang
Jian Wang
W. Li
Xiaoyu Shen
ReLM
LRM
378
18
0
22 May 2025
FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document Understanding
International Conference on Computational Linguistics (COLING), 2025
Amit Agarwal
Srikant Panda
Kulbhushan Pachauri
206
12
0
22 May 2025
Leveraging Large Language Models for Command Injection Vulnerability Analysis in Python: An Empirical Study on Popular Open-Source Projects
Yuxuan Wang
Jingshu Chen
Qingyang Wang
ELM
189
0
0
21 May 2025
SDLog: A Deep Learning Framework for Detecting Sensitive Information in Software Logs
Roozbeh Aghili
Xingfang Wu
Foutse Khomh
Heng Li
224
0
0
20 May 2025
Large Language Models and Their Applications in Roadway Safety and Mobility Enhancement: A Comprehensive Review
Muhammad Monjurul Karim
Yan Shi
Shucheng Zhang
Bingzhang Wang
Mehrdad Nasri
Yinhai Wang
188
11
0
19 May 2025
Self-Supervised Learning for Image Segmentation: A Comprehensive Survey
Thangarajah Akilan
Nusrat Jahan
Tianlei Wang
SSL
358
1
0
19 May 2025
On Membership Inference Attacks in Knowledge Distillation
Ziyao Cui
Minxing Zhang
Jian Pei
249
2
0
17 May 2025
Class Distillation with Mahalanobis Contrast: An Efficient Training Paradigm for Pragmatic Language Understanding Tasks
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Chenlu Wang
Weimin Lyu
Ritwik Banerjee
204
0
0
17 May 2025
Parallel Scaling Law for Language Models
Mouxiang Chen
Binyuan Hui
Zeyu Cui
Jiaxi Yang
Dayiheng Liu
Jianling Sun
Junyang Lin
Zhongxin Liu
MoE
LRM
330
20
0
15 May 2025
AI Greenferencing: Routing AI Inferencing to Green Modular Data Centers with Heron
Tella Rajashekhar Reddy
Palak
Rohan Gandhi
Anjaly Parayil
Chaojie Zhang
...
Liangcheng Yu
Jayashree Mohan
Srinivasan Iyengar
Shivkumar Kalyanaraman
Debopam Bhattacherjee
237
0
0
15 May 2025
Structural-Temporal Coupling Anomaly Detection with Dynamic Graph Transformer
Chang Zong
Yueting Zhuang
Jian Shao
Weiming Lu
325
1
0
13 May 2025
A Survey on Collaborative Mechanisms Between Large and Small Language Models
Yi Chen
JiaHao Zhao
HaoHao Han
380
10
0
12 May 2025
KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification
Hajar Sakai
Sarah Lam
VLM
350
0
0
12 May 2025
Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Representation Learning
Hang Gao
Chenhao Zhang
Tie Wang
Junsuo Zhao
Fengge Wu
Changwen Zheng
Huaping Liu
LRM
472
0
0
09 May 2025
Prediction-powered estimators for finite population statistics in highly imbalanced textual data: Public hate crime estimation
Hannes Waldetoft
Jakob Torgander
Måns Magnusson
230
2
0
05 May 2025
Parameter-Efficient Transformer Embeddings
Henry Ndubuaku
Mouad Talhi
264
0
0
04 May 2025
FineScope : Precision Pruning for Domain-Specialized Large Language Models Using SAE-Guided Self-Data Cultivation
Chaitali Bhattacharyya
Hyunsei Lee
Junyoung Lee
Shinhyoung Jang
Il hong Suh
Yeseong Kim
305
1
0
01 May 2025
MatMMFuse: Multi-Modal Fusion model for Material Property Prediction
Abhiroop Bhattacharya
Sylvain G. Cloutier
AI4CE
166
1
0
30 Apr 2025
HMI: Hierarchical Knowledge Management for Efficient Multi-Tenant Inference in Pretrained Language Models
The VLDB journal (VLDB J.), 2025
Junxuan Zhang
Jiadong Wang
Haoyang Li
Lidan Shou
Ke Chen
Gang Chen
Qin Xie
Guiming Xie
Xuejian Gong
188
1
0
24 Apr 2025
MOOSComp: Improving Lightweight Long-Context Compressor via Mitigating Over-Smoothing and Incorporating Outlier Scores
Fengwei Zhou
Jiafei Song
Wenjin Jason Li
Gengjian Xue
Zhikang Zhao
Yichao Lu
Bailin Na
321
1
0
23 Apr 2025
HYPEROFA: Expanding LLM Vocabulary to New Languages via Hypernetwork-Based Embedding Initialization
Enes Özeren
Yihong Liu
Hinrich Schütze
254
1
0
21 Apr 2025
Quantitative Clustering in Mean-Field Transformer Models
Shi Chen
Zhengjiang Lin
Yury Polyanskiy
Philippe Rigollet
392
13
0
20 Apr 2025
Q-FAKER: Query-free Hard Black-box Attack via Controlled Generation
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
CheolWon Na
YunSeok Choi
Jee-Hyong Lee
AAML
183
0
0
18 Apr 2025
WildFireCan-MMD: A Multimodal Dataset for Classification of User-Generated Content During Wildfires in Canada
Braeden Sherritt
Isar Nejadgholi
Efstratios Aivaliotis
Khaled Mslmani
Marzieh Amini
VLM
473
0
0
17 Apr 2025
Out of Sight Out of Mind, Out of Sight Out of Mind: Measuring Bias in Language Models Against Overlooked Marginalized Groups in Regional Contexts
Fatma Elsafoury
David Hartmann
273
0
0
17 Apr 2025
A new training approach for text classification in Mental Health: LatentGLoss
Korhan Sevinç
AI4MH
82
1
0
09 Apr 2025
Exploring Gradient-Guided Masked Language Model to Detect Textual Adversarial Attacks
Xiaomei Zhang
Zhaoxi Zhang
Yanjun Zhang
Xufei Zheng
L. Zhang
Shengshan Hu
Shirui Pan
AAML
238
2
0
08 Apr 2025
Pyramid-based Mamba Multi-class Unsupervised Anomaly Detection
Nasar Iqbal
Niki Martinel
Mamba
229
1
0
04 Apr 2025
StereoDetect: Detecting Stereotypes and Anti-stereotypes the Correct Way Using Social Psychological Underpinnings
Kaustubh Shivshankar Shejole
Pushpak Bhattacharyya
171
1
0
04 Apr 2025
Advancing Semantic Caching for LLMs with Domain-Specific Embeddings and Synthetic Data
Waris Gill
Justin Cechmanek
Tyler Hutcherson
Srijith Rajamohan
Jen Agarwal
Muhammad Ali Gulzar
Manvinder Singh
Benoit Dion
194
3
0
03 Apr 2025
Previous
1
2
3
4
5
6
...
59
60
61
Next