v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019

26 September 2019

ArXiv (abs)PDF HTML Github (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,048 papers shown

RedHerring Attack: Testing the Reliability of Attack Detection

Jonathan Rusert

AAML

25 Sep 2025

Every Character Counts: From Vulnerability to Defense in Phishing Detection

Maria Chiper

Radu Tudor Ionescu

213

24 Sep 2025

An overview of neural architectures for self-supervised audio representation learning from masked spectrograms

187

23 Sep 2025

Uncertainty in Semantic Language Modeling with PIXELS

Stefania Radu

Marco Zullich

Matias Valdenegro-Toro

143

23 Sep 2025

Modeling the Attack: Detecting AI-Generated Text by Quantifying Adversarial Perturbations

Lekkala Sai Teja

Annepaka Yadagiri

Sangam Sai Anish

Siva Gopala Krishna Nuthakki

Partha Pakray

AAML DeLMO

218

22 Sep 2025

FedEL: Federated Elastic Learning for Heterogeneous Devices

136

21 Sep 2025

DRES: Fake news detection by dynamic representation and ensemble selection

Faramarz Farhangian

Leandro A. Ensina

George D. C. Cavalcanti

Rafael M. O. Cruz

160

21 Sep 2025

Mental Multi-class Classification on Social Media: Benchmarking Transformer Architectures against LSTM Models

156

20 Sep 2025

Causality-Induced Positional Encoding for Transformer-Based Representation Learning of Non-Sequential Features

154

20 Sep 2025

Localmax dynamics for attention in transformers and its asymptotic behavior

Henri Cimetière

Maria Teresa Chiri

Bahman Gharesifard

19 Sep 2025

Combating Biomedical Misinformation through Multi-modal Claim Detection and Evidence-based VerificationAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025

125

17 Sep 2025

Efficient Hate Speech Detection: Evaluating 38 Models from Traditional Methods to TransformersACM Southeast Regional Conference (ACMSE), 2025

124

14 Sep 2025

Adversarial Attacks Against Automated Fact-Checking: A Survey

138

10 Sep 2025

Few-Shot Query Intent Detection via Relation-Aware Prompt Learning

106

06 Sep 2025

Dynamic Adaptive Shared Experts with Grouped Multi-Head Attention Mixture of Experts

102

05 Sep 2025

LMAE4Eth: Generalizable and Robust Ethereum Fraud Detection by Exploring Transaction Semantics and Masked Graph EmbeddingIEEE Transactions on Information Forensics and Security (TIFS), 2025

152

04 Sep 2025

PracMHBench: Re-evaluating Model-Heterogeneous Federated Learning Based on Practical Edge Device ConstraintsDesign Automation Conference (DAC), 2025

190

04 Sep 2025

RTQA : Recursive Thinking for Complex Temporal Knowledge Graph Question Answering with Large Language Models

104

04 Sep 2025

StructCoh: Structured Contrastive Learning for Context-Aware Text Semantic Matching

Chao Xue

Ziyuan Gao

AILaw

148

02 Sep 2025

DrDiff: Dynamic Routing Diffusion with Hierarchical Attention for Breaking the Efficiency-Quality Trade-off

152

02 Sep 2025

Bridging Thoughts and Words: Graph-Based Intent-Semantic Joint Learning for Fake News Detection

01 Sep 2025

Testing the assumptions about the geometry of sentence embedding spaces: the cosine measure need not apply

Vivi Nastase

Paola Merlo

01 Sep 2025

CaresAI at BioCreative IX Track 1 -- LLM for Biomedical QA

31 Aug 2025

Dual-Model Weight Selection and Self-Knowledge Distillation for Medical Image Classification

108

28 Aug 2025

FlowletFormer: Network Behavioral Semantic Aware Pre-training Model for Traffic Classification

162

27 Aug 2025

MahaParaphrase: A Marathi Paraphrase Detection Corpus and BERT-based Models

24 Aug 2025

SALMAN: Stability Analysis of Language Models Through the Maps Between Graph-based Manifolds

110

23 Aug 2025

CoPE: A Lightweight Complex Positional Encoding

Avinash Amballa

23 Aug 2025

Refining Contrastive Learning and Homography Relations for Multi-Modal Recommendation

116

19 Aug 2025

Incorporating Legal Logic into Deep Learning: An Intelligent Approach to Probation PredictionInternational Joint Conference on Artificial Intelligence (IJCAI), 2025

129

17 Aug 2025

Labels or Input? Rethinking Augmentation in Multimodal Hate Detection

107

15 Aug 2025

A Survey on Diffusion Language Models

281

14 Aug 2025

Enhancing Rumor Detection Methods with Propagation Structure Infused Language ModelInternational Conference on Computational Linguistics (COLING), 2025

124

10 Aug 2025

A Study of the Framework and Real-World Applications of Language Embedding for 3D Scene Understanding

Mahmoud Chick Zaouali

283

07 Aug 2025

Decision-Making with Deliberation: Meta-reviewing as a Document-grounded Dialogue

07 Aug 2025

Fine-Tuning Small Language Models (SLMs) for Autonomous Web-based Geographical Information Systems (AWebGIS)

116

06 Aug 2025

LaMPE: Length-aware Multi-grained Positional Encoding for Adaptive Long-context Scaling Without Training

141

04 Aug 2025

Zero-shot Compositional Action Recognition with Neural Logic Constraints

200

04 Aug 2025

HeQ: a Large and Diverse Hebrew Reading Comprehension BenchmarkConference on Empirical Methods in Natural Language Processing (EMNLP), 2025

104

03 Aug 2025

HT-Transformer: Event Sequences Classification by Accumulating Prefix Information with History Tokens

Ivan Karpukhin

Ivan A Kireev

AI4TS

113

02 Aug 2025

Unifying Mixture of Experts and Multi-Head Latent Attention for Efficient Language Models

154

02 Aug 2025

Object-Centric Cropping for Visual Few-Shot Classification

239

31 Jul 2025

XAutoLM: Efficient Fine-Tuning of Language Models via Meta-Learning and AutoML

Ernesto L. Estevanell-Valladares

Suilan Estevez-Velarde

Yoan Gutiérrez

Andrés Montoyo

Ruslan Mitkov

134

30 Jul 2025

Traits Run Deep: Enhancing Personality Assessment via Psychology-Guided LLM Representations and Multimodal Apparent Behaviors

30 Jul 2025

GovRelBench:A Benchmark for Government Domain Relevance

173

29 Jul 2025

Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study

181

28 Jul 2025

Semantic IDs for Music RecommendationACM Conference on Recommender Systems (RecSys), 2025

24 Jul 2025

CompLeak: Deep Learning Model Compression Exacerbates Privacy Leakage

216

22 Jul 2025

Custom Algorithm-based Fault Tolerance for Attention Layers in Transformers

Vasileios Titopoulos

K. Alexandridis

G. Dimitrakopoulos

22 Jul 2025

A Language Model-Driven Semi-Supervised Ensemble Framework for Illicit Market Detection Across Deep/Dark Web and Social Platforms

Navid Yazdanjue

Morteza Rakhshaninejad

19 Jul 2025