v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019

26 September 2019

ArXiv (abs)PDF HTML Github (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,048 papers shown

Advancing Semantic Caching for LLMs with Domain-Specific Embeddings and Synthetic Data

198

03 Apr 2025

From Text to Graph: Leveraging Graph Neural Networks for Enhanced Explainability in NLP

353

02 Apr 2025

KernelDNA: Dynamic Kernel Sharing via Decoupled Naive Adapters

Haiduo Huang

Yadong Zhang

Pengju Ren

368

30 Mar 2025

Evaluating Text-to-Image and Text-to-Video Synthesis with a Conditional Fréchet Distance

323

27 Mar 2025

Cyborg Data: Merging Human with AI Generated Training Data

Kai North

Christopher Ormerod

193

26 Mar 2025

Unsupervised Acquisition of Discrete Grammatical Categories

David Ph. Shakouri

Crit Cremers

Niels O. Schiller

196

24 Mar 2025

CoMP: Continual Multimodal Pre-training for Vision Foundation Models

486

24 Mar 2025

Detection of Somali-written Fake News and Toxic Messages on the Social Media Using Transformer-based Language Models

243

23 Mar 2025

Deceptive Humor: A Synthetic Multilingual Benchmark Dataset for Bridging Fabricated Claims with Humorous Content

Sai Kartheek Reddy Kasu

Shankar Biradar

Sunil Saumya

327

20 Mar 2025

Unified Enhancement of the Generalization and Robustness of Language Models via Bi-Stage Optimization

248

19 Mar 2025

Model Hubs and Beyond: Analyzing Model Popularity, Performance, and DocumentationInternational Conference on Web and Social Media (ICWSM), 2025

Pritam Kadasi

Sriman Reddy

Srivathsa Vamsi Chaturvedula

365

19 Mar 2025

Sentiment Analysis in SemEval: A Review of Sentiment Identification ApproachesInternational Journal of Electrical and Computer Engineering (IJECE) (IJECE), 2023

Bousselham EL HADDAOUI

R. Chiheb

R. Faizi

A. E. Afia

260

13 Mar 2025

ARLED: Leveraging LED-based ARMAN Model for Abstractive Summarization of Persian Long Documents

Samira Zangooei

Amirhossein Darmani

Hossein Farahmand Nezhad

Laya Mahmoudi

231

13 Mar 2025

DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation

...

380

13 Mar 2025

ReSi: A Comprehensive Benchmark for Representational Similarity MeasuresInternational Conference on Learning Representations (ICLR), 2024

493

13 Mar 2025

Talk2PC: Enhancing 3D Visual Grounding through LiDAR and Radar Point Clouds Fusion for Autonomous Driving

...

326

11 Mar 2025

Large Language Model as Meta-Surrogate for Data-Driven Many-Task Optimization: A Proof-of-Principle Study

Wei Wei

Yue-Jiao Gong

Jun Zhang

300

11 Mar 2025

A Survey on Knowledge-Oriented Retrieval-Augmented Generation

...

367

11 Mar 2025

CtrlRAG: Black-box Adversarial Attacks Based on Masked Language Models in Retrieval-Augmented Language Generation

Runqi Sui

AAML

210

10 Mar 2025

Gender Encoding Patterns in Pretrained Language Model Representations

Mahdi Zakizadeh

Mohammad Taher Pilehvar

418

09 Mar 2025

Fine-Grained Evaluation for Implicit Discourse Relation Recognition

Xinyi Cai

202

07 Mar 2025

Layer-Specific Scaling of Positional Encodings for Superior Long-Context Modeling

285

06 Mar 2025

PriFFT: Privacy-preserving Federated Fine-tuning of Large Language Models via Hybrid Secret Sharing

251

05 Mar 2025

Zero-Shot Complex Question-Answering on Long Scientific Documents

Wanting Wang

RALM

134

04 Mar 2025

Efficient or Powerful? Trade-offs Between Machine Learning and Deep Learning for Mental Illness Detection on Social MediaScientific Reports (Sci Rep), 2025

255

03 Mar 2025

EPEE: Towards Efficient and Effective Foundation Models in Biomedicine

245

03 Mar 2025

Neural ODE Transformers: Analyzing Internal Dynamics and Adaptive Fine-tuningInternational Conference on Learning Representations (ICLR), 2025

451

03 Mar 2025

TimesBERT: A BERT-Style Foundation Model for Time Series Understanding

230

28 Feb 2025

Uncertainty Quantification in Retrieval Augmented Question Answering

Laura Perez-Beltrachini

Mirella Lapata

RALM

542

25 Feb 2025

Encryption-Friendly LLM ArchitectureInternational Conference on Learning Representations (ICLR), 2024

498

24 Feb 2025

Towards Typologically Aware Rescoring to Mitigate Unfaithfulness in Lower-Resource Languages

398

24 Feb 2025

Reasoning with Latent Thoughts: On the Power of Looped TransformersInternational Conference on Learning Representations (ICLR), 2025

431

24 Feb 2025

Pay Attention to Real World Perturbations! Natural Robustness Evaluation in Machine Reading Comprehension

417

23 Feb 2025

Iterative Auto-Annotation for Scientific Named Entity Recognition Using BERT-Based Models

Kartik Gupta

133

22 Feb 2025

Robust Bias Detection in MLMs and its Application to Human Trait RatingsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

Ingroj Shrestha

Louis Tay

Padmini Srinivasan

350

21 Feb 2025

Comprehensive Analysis of Transparency and Accessibility of ChatGPT, DeepSeek, And other SoTA Large Language Models

Ranjan Sapkota

Shaina Raza

Manoj Karkee

266

21 Feb 2025

A Survey of Model Architectures in Information Retrieval

582

20 Feb 2025

Hyper-SET: Designing Transformers via Hyperspherical Energy Minimization

Yunzhe Hu

Difan Zou

Dong Xu

503

17 Feb 2025

The underlying structures of self-attention: symmetry, directionality, and emergent dynamics in Transformer training

Matteo Saponati

Pascal Sager

Pau Vilimelis Aceituno

Thilo Stadelmann

Benjamin Grewe

211

15 Feb 2025

LLM4GNAS: A Large Language Model Based Toolkit for Graph Neural Architecture Search

244

12 Feb 2025

Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning

340

12 Feb 2025

Al-Khwarizmi: Discovering Physical Laws with Foundation Models

Christopher E. Mower

Haitham Bou-Ammar

AI4CE

790

03 Feb 2025

SecPE: Secure Prompt Ensembling for Private and Robust Large Language ModelsEuropean Conference on Artificial Intelligence (ECAI), 2025

514

02 Feb 2025

AdditiveLLM: Large Language Models Predict Defects in Additive ManufacturingAdditive Manufacturing Letters (AML), 2025

P. Pak

A. Farimani

AI4CE

246

29 Jan 2025

Merino: Entropy-driven Design for Generative Language Models on IoT DevicesAAAI Conference on Artificial Intelligence (AAAI), 2024

375

28 Jan 2025

A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and EthicsInformation Fusion (Inf. Fusion), 2023

726

269

28 Jan 2025

A Review on Self-Supervised Learning for Time Series Anomaly Detection: Recent Advances and Open Challenges

Aitor Sánchez-Ferrera

Borja Calvo

Jose A. Lozano

AI4TS

445

25 Jan 2025

EDoRA: Efficient Weight-Decomposed Low-Rank Adaptation via Singular Value Decomposition

Hamid Nasiri

Peter Garraghan

219

21 Jan 2025

Reference-free Evaluation Metrics for Text Generation: A Survey

345

21 Jan 2025

A Contrastive Framework with User, Item and Review Alignment for RecommendationWeb Search and Data Mining (WSDM), 2025

Hoang V. Dong

Yuan Fang

Hady W. Lauw

679

21 Jan 2025