v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019

26 September 2019

ArXiv (abs)PDF HTML Github (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,048 papers shown

Advancing Mental Disorder Detection: A Comparative Evaluation of Transformer and LSTM Architectures on Social MediaAnnual International Computer Software and Applications Conference (COMPSAC), 2025

125

17 Jul 2025

Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation

...

283

14 Jul 2025

Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations

A. Bochkov

242

07 Jul 2025

DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy

200

02 Jul 2025

Health Sentinel: An AI Pipeline For Real-time Disease Outbreak Detection

...

24 Jun 2025

Beyond Parameters: Exploring Virtual Logic Depth for Scaling Laws

193

23 Jun 2025

All is Not Lost: LLM Recovery without Checkpoints

Nikolay Blagoev

Oğuzhan Ersoy

Lydia Yiyu Chen

219

18 Jun 2025

Enhancing Hyperbole and Metaphor Detection with Their Bidirectional Dynamic Interaction and Emotion KnowledgeAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

190

18 Jun 2025

FASCIST-O-METER: Classifier for Neo-fascist Discourse Online

Rudy Alexandro Garrido Veliz

Martin Semmann

Chris Biemann

Seid Muhie Yimam

250

12 Jun 2025

Latent Multi-Head Attention for Small Language Models

178

11 Jun 2025

semantic-features: A User-Friendly Tool for Studying Contextual Word Embeddings in Interpretable Semantic Spaces

Jwalanthi Ranganathan

Rohan Jha

Kanishka Misra

Kyle Mahowald

202

06 Jun 2025

Towards Efficient Multi-LLM Inference: Characterization and Analysis of LLM Routing and Hierarchical Techniques

200

06 Jun 2025

Training-free AI for Earth Observation Change Detection using Physics Aware Neuromorphic NetworksScientific Reports (Sci Rep), 2025

Stephen Smith

Cormac Purcell

Zdenka Kuncic

265

04 Jun 2025

MCFNet: A Multimodal Collaborative Fusion Network for Fine-Grained Semantic Classification

236

29 May 2025

Improving QA Efficiency with DistilBERT: Fine-Tuning and Inference on mobile Intel CPUs

Ngeyen Yinkfu

118

28 May 2025

VeriTrail: Closed-Domain Hallucination Detection with Traceability

Dasha Metropolitansky

Jonathan Larson

HILM

252

27 May 2025

Unfolding A Few Structures for The Many: Memory-Efficient Compression of Conformer and Speech Foundation Models

192

27 May 2025

Discrete Markov Bridge

214

26 May 2025

Recurrent Self-Attention Dynamics: An Energy-Agnostic Perspective from Jacobians

Akiyoshi Tomihari

Ryo Karakida

355

26 May 2025

ResSVD: Residual Compensated SVD for Large Language Model Compression

321

26 May 2025

Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning

378

22 May 2025

FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document UnderstandingInternational Conference on Computational Linguistics (COLING), 2025

Amit Agarwal

Srikant Panda

Kulbhushan Pachauri

206

22 May 2025

Leveraging Large Language Models for Command Injection Vulnerability Analysis in Python: An Empirical Study on Popular Open-Source Projects

189

21 May 2025

SDLog: A Deep Learning Framework for Detecting Sensitive Information in Software Logs

224

20 May 2025

Large Language Models and Their Applications in Roadway Safety and Mobility Enhancement: A Comprehensive Review

Muhammad Monjurul Karim

188

19 May 2025

Self-Supervised Learning for Image Segmentation: A Comprehensive Survey

358

19 May 2025

On Membership Inference Attacks in Knowledge Distillation

Ziyao Cui

Minxing Zhang

Jian Pei

249

17 May 2025

Class Distillation with Mahalanobis Contrast: An Efficient Training Paradigm for Pragmatic Language Understanding TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Chenlu Wang

Weimin Lyu

Ritwik Banerjee

204

17 May 2025

Parallel Scaling Law for Language Models

330

15 May 2025

AI Greenferencing: Routing AI Inferencing to Green Modular Data Centers with Heron

Tella Rajashekhar Reddy

...

Shivkumar Kalyanaraman

Debopam Bhattacherjee

237

15 May 2025

Structural-Temporal Coupling Anomaly Detection with Dynamic Graph Transformer

325

13 May 2025

A Survey on Collaborative Mechanisms Between Large and Small Language Models

Yi Chen

JiaHao Zhao

HaoHao Han

380

12 May 2025

KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification

Hajar Sakai

Sarah Lam

VLM

350

12 May 2025

Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Representation Learning

472

09 May 2025

Prediction-powered estimators for finite population statistics in highly imbalanced textual data: Public hate crime estimation

Hannes Waldetoft

Jakob Torgander

Måns Magnusson

230

05 May 2025

Parameter-Efficient Transformer Embeddings

Henry Ndubuaku

Mouad Talhi

264

04 May 2025

FineScope : Precision Pruning for Domain-Specialized Large Language Models Using SAE-Guided Self-Data Cultivation

Chaitali Bhattacharyya

305

01 May 2025

MatMMFuse: Multi-Modal Fusion model for Material Property Prediction

Abhiroop Bhattacharya

Sylvain G. Cloutier

AI4CE

166

30 Apr 2025

HMI: Hierarchical Knowledge Management for Efficient Multi-Tenant Inference in Pretrained Language ModelsThe VLDB journal (VLDB J.), 2025

188

24 Apr 2025

MOOSComp: Improving Lightweight Long-Context Compressor via Mitigating Over-Smoothing and Incorporating Outlier Scores

321

23 Apr 2025

HYPEROFA: Expanding LLM Vocabulary to New Languages via Hypernetwork-Based Embedding Initialization

Enes Özeren

Yihong Liu

Hinrich Schütze

254

21 Apr 2025

Quantitative Clustering in Mean-Field Transformer Models

392

20 Apr 2025

Q-FAKER: Query-free Hard Black-box Attack via Controlled GenerationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

183

18 Apr 2025

WildFireCan-MMD: A Multimodal Dataset for Classification of User-Generated Content During Wildfires in Canada

Braeden Sherritt

Isar Nejadgholi

Efstratios Aivaliotis

Khaled Mslmani

Marzieh Amini

VLM

473

17 Apr 2025

Out of Sight Out of Mind, Out of Sight Out of Mind: Measuring Bias in Language Models Against Overlooked Marginalized Groups in Regional Contexts

Fatma Elsafoury

David Hartmann

273

17 Apr 2025

A new training approach for text classification in Mental Health: LatentGLoss

Korhan Sevinç

AI4MH

09 Apr 2025

Exploring Gradient-Guided Masked Language Model to Detect Textual Adversarial Attacks

238

08 Apr 2025

Pyramid-based Mamba Multi-class Unsupervised Anomaly Detection

Nasar Iqbal

Niki Martinel

Mamba

229

04 Apr 2025

StereoDetect: Detecting Stereotypes and Anti-stereotypes the Correct Way Using Social Psychological Underpinnings

Kaustubh Shivshankar Shejole

Pushpak Bhattacharyya

171

04 Apr 2025

Advancing Semantic Caching for LLMs with Domain-Specific Embeddings and Synthetic Data

194

03 Apr 2025