ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,048 papers shown
RedHerring Attack: Testing the Reliability of Attack Detection
RedHerring Attack: Testing the Reliability of Attack Detection
Jonathan Rusert
AAML
85
0
0
25 Sep 2025
Every Character Counts: From Vulnerability to Defense in Phishing Detection
Every Character Counts: From Vulnerability to Defense in Phishing Detection
Maria Chiper
Radu Tudor Ionescu
213
0
0
24 Sep 2025
An overview of neural architectures for self-supervised audio representation learning from masked spectrograms
An overview of neural architectures for self-supervised audio representation learning from masked spectrograms
Sarthak Yadav
Sergios Theodoridis
Zheng-Hua Tan
Mamba
187
0
0
23 Sep 2025
Uncertainty in Semantic Language Modeling with PIXELS
Uncertainty in Semantic Language Modeling with PIXELS
Stefania Radu
Marco Zullich
Matias Valdenegro-Toro
143
0
0
23 Sep 2025
Modeling the Attack: Detecting AI-Generated Text by Quantifying Adversarial Perturbations
Modeling the Attack: Detecting AI-Generated Text by Quantifying Adversarial Perturbations
Lekkala Sai Teja
Annepaka Yadagiri
Sangam Sai Anish
Siva Gopala Krishna Nuthakki
Partha Pakray
AAMLDeLMO
218
1
0
22 Sep 2025
FedEL: Federated Elastic Learning for Heterogeneous Devices
FedEL: Federated Elastic Learning for Heterogeneous Devices
Letian Zhang
Bo Chen
Jieming Bian
Lei Wang
Jie Xu
FedML
136
0
0
21 Sep 2025
DRES: Fake news detection by dynamic representation and ensemble selection
DRES: Fake news detection by dynamic representation and ensemble selection
Faramarz Farhangian
Leandro A. Ensina
George D. C. Cavalcanti
Rafael M. O. Cruz
160
3
0
21 Sep 2025
Mental Multi-class Classification on Social Media: Benchmarking Transformer Architectures against LSTM Models
Mental Multi-class Classification on Social Media: Benchmarking Transformer Architectures against LSTM Models
Khalid Hasan
Jamil Saquer
Yifan Zhang
AI4MH
156
0
0
20 Sep 2025
Causality-Induced Positional Encoding for Transformer-Based Representation Learning of Non-Sequential Features
Causality-Induced Positional Encoding for Transformer-Based Representation Learning of Non-Sequential Features
Kaichen Xu
Yihang Du
Mianpeng Liu
Zimu Yu
Xiaobo Sun
154
0
0
20 Sep 2025
Localmax dynamics for attention in transformers and its asymptotic behavior
Localmax dynamics for attention in transformers and its asymptotic behavior
Henri Cimetière
Maria Teresa Chiri
Bahman Gharesifard
88
1
0
19 Sep 2025
Combating Biomedical Misinformation through Multi-modal Claim Detection and Evidence-based Verification
Combating Biomedical Misinformation through Multi-modal Claim Detection and Evidence-based VerificationAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Mariano Barone
Antonio Romano
Giuseppe Riccio
Marco Postiglione
V. Moscato
125
1
0
17 Sep 2025
Efficient Hate Speech Detection: Evaluating 38 Models from Traditional Methods to Transformers
Efficient Hate Speech Detection: Evaluating 38 Models from Traditional Methods to TransformersACM Southeast Regional Conference (ACMSE), 2025
Mahmoud Abusaqer
Jamil Saquer
Hazim Shatnawi
VLM
124
1
0
14 Sep 2025
Adversarial Attacks Against Automated Fact-Checking: A Survey
Adversarial Attacks Against Automated Fact-Checking: A Survey
Fanzhen Liu
A. Abuadbba
Kristen Moore
Surya Nepal
Cécile Paris
Jia Wu
Jian Yang
Quan Z. Sheng
AAML
138
1
0
10 Sep 2025
Few-Shot Query Intent Detection via Relation-Aware Prompt Learning
Few-Shot Query Intent Detection via Relation-Aware Prompt Learning
Liang Zhang
Yuan Li
Shijie Zhang
Zheng Zhang
Xitong Li
106
1
0
06 Sep 2025
Dynamic Adaptive Shared Experts with Grouped Multi-Head Attention Mixture of Experts
Dynamic Adaptive Shared Experts with Grouped Multi-Head Attention Mixture of Experts
Cheng Li
Jiexiong Liu
Yixuan Chen
Jie ji
MoE
102
0
0
05 Sep 2025
LMAE4Eth: Generalizable and Robust Ethereum Fraud Detection by Exploring Transaction Semantics and Masked Graph Embedding
LMAE4Eth: Generalizable and Robust Ethereum Fraud Detection by Exploring Transaction Semantics and Masked Graph EmbeddingIEEE Transactions on Information Forensics and Security (TIFS), 2025
Yifan Jia
Yanbin Wang
Jianguo Sun
Ye Tian
Peng Qian
152
3
0
04 Sep 2025
PracMHBench: Re-evaluating Model-Heterogeneous Federated Learning Based on Practical Edge Device Constraints
PracMHBench: Re-evaluating Model-Heterogeneous Federated Learning Based on Practical Edge Device ConstraintsDesign Automation Conference (DAC), 2025
Yuanchun Guo
Bingyan Liu
Yulong Sha
Zhensheng Xian
190
0
0
04 Sep 2025
RTQA : Recursive Thinking for Complex Temporal Knowledge Graph Question Answering with Large Language Models
RTQA : Recursive Thinking for Complex Temporal Knowledge Graph Question Answering with Large Language Models
Zhaoyan Gong
Juan Li
Zhiqiang Liu
Lei Liang
H. Chen
Wen Zhang
ReLMLRM
104
2
0
04 Sep 2025
StructCoh: Structured Contrastive Learning for Context-Aware Text Semantic Matching
StructCoh: Structured Contrastive Learning for Context-Aware Text Semantic Matching
Chao Xue
Ziyuan Gao
AILaw
148
1
0
02 Sep 2025
DrDiff: Dynamic Routing Diffusion with Hierarchical Attention for Breaking the Efficiency-Quality Trade-off
DrDiff: Dynamic Routing Diffusion with Hierarchical Attention for Breaking the Efficiency-Quality Trade-off
Jusheng Zhang
Yijia Fan
Kaitong Cai
Zimeng Huang
Xiaofei Sun
Jian Wang
Chengpei Tang
Keze Wang
DiffM
152
25
0
02 Sep 2025
Bridging Thoughts and Words: Graph-Based Intent-Semantic Joint Learning for Fake News Detection
Bridging Thoughts and Words: Graph-Based Intent-Semantic Joint Learning for Fake News Detection
Zhengjia Wang
Qiang Sheng
Danding Wang
Beizhe Hu
Juan Cao
GNN
92
2
0
01 Sep 2025
Testing the assumptions about the geometry of sentence embedding spaces: the cosine measure need not apply
Testing the assumptions about the geometry of sentence embedding spaces: the cosine measure need not apply
Vivi Nastase
Paola Merlo
84
0
0
01 Sep 2025
CaresAI at BioCreative IX Track 1 -- LLM for Biomedical QA
CaresAI at BioCreative IX Track 1 -- LLM for Biomedical QA
Reem Abdel-Salam
M. Adewunmi
M. Abayomi
LM&MA
69
0
0
31 Aug 2025
Dual-Model Weight Selection and Self-Knowledge Distillation for Medical Image Classification
Dual-Model Weight Selection and Self-Knowledge Distillation for Medical Image Classification
Ayaka Tsutsumi
Guang Li
Ren Togo
Takahiro Ogawa
Satoshi Kondo
Miki Haseyama
108
0
0
28 Aug 2025
FlowletFormer: Network Behavioral Semantic Aware Pre-training Model for Traffic Classification
FlowletFormer: Network Behavioral Semantic Aware Pre-training Model for Traffic Classification
Liming Liu
Ruoyu Li
Qing Li
Meijia Hou
Yong Jiang
Mingwei Xu
162
0
0
27 Aug 2025
MahaParaphrase: A Marathi Paraphrase Detection Corpus and BERT-based Models
MahaParaphrase: A Marathi Paraphrase Detection Corpus and BERT-based Models
Suramya Jadhav
Abhay Shanbhag
Amogh Thakurdesai
Ridhima Sinare
Ananya Joshi
Raviraj Joshi
49
0
0
24 Aug 2025
SALMAN: Stability Analysis of Language Models Through the Maps Between Graph-based Manifolds
SALMAN: Stability Analysis of Language Models Through the Maps Between Graph-based Manifolds
Wuxinlin Cheng
Yun Feng
Jinwen Wu
K. P. Subbalakshmi
Tian Han
Zhuo Feng
AAML
110
0
0
23 Aug 2025
CoPE: A Lightweight Complex Positional Encoding
CoPE: A Lightweight Complex Positional Encoding
Avinash Amballa
59
0
0
23 Aug 2025
Refining Contrastive Learning and Homography Relations for Multi-Modal Recommendation
Refining Contrastive Learning and Homography Relations for Multi-Modal Recommendation
Shouxing Ma
Yawen Zeng
Shiqing Wu
Guandong Xu
116
0
0
19 Aug 2025
Incorporating Legal Logic into Deep Learning: An Intelligent Approach to Probation Prediction
Incorporating Legal Logic into Deep Learning: An Intelligent Approach to Probation PredictionInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Qinghua Wang
Xu Zhang
Lingyan Yang
Rui Shao
Bonan Wang
Fang Wang
Cunquan Qu
AILaw
129
0
0
17 Aug 2025
Labels or Input? Rethinking Augmentation in Multimodal Hate Detection
Labels or Input? Rethinking Augmentation in Multimodal Hate Detection
Sahajpreet Singh
Rongxin Ouyang
Subhayan Mukerjee
Kokil Jaidka
VLM
107
0
0
15 Aug 2025
A Survey on Diffusion Language Models
A Survey on Diffusion Language Models
Tianyi Li
Mingda Chen
Bowei Guo
Zhiqiang Shen
281
28
0
14 Aug 2025
Enhancing Rumor Detection Methods with Propagation Structure Infused Language Model
Enhancing Rumor Detection Methods with Propagation Structure Infused Language ModelInternational Conference on Computational Linguistics (COLING), 2025
Chaoqun Cui
Siyuan Li
Kunkun Ma
Caiyan Jia
124
4
0
10 Aug 2025
A Study of the Framework and Real-World Applications of Language Embedding for 3D Scene Understanding
A Study of the Framework and Real-World Applications of Language Embedding for 3D Scene Understanding
Mahmoud Chick Zaouali
Todd Charter
Yehor Karpichev
Brandon Haworth
Homayoun Najjjaran
3DGS
283
0
0
07 Aug 2025
Decision-Making with Deliberation: Meta-reviewing as a Document-grounded Dialogue
Decision-Making with Deliberation: Meta-reviewing as a Document-grounded Dialogue
Sukannya Purkayastha
Nils Dycke
Anne Lauscher
Iryna Gurevych
92
1
0
07 Aug 2025
Fine-Tuning Small Language Models (SLMs) for Autonomous Web-based Geographical Information Systems (AWebGIS)
Fine-Tuning Small Language Models (SLMs) for Autonomous Web-based Geographical Information Systems (AWebGIS)
Mahdi Nazari Ashani
Ali Asghar Alesheikh
Saba Kazemi
Kimya Kheirkhah
Yasin Mohammadi
Fatemeh Rezaie
Amir Mahdi Manafi
Hedieh Zarkesh
116
0
0
06 Aug 2025
LaMPE: Length-aware Multi-grained Positional Encoding for Adaptive Long-context Scaling Without Training
LaMPE: Length-aware Multi-grained Positional Encoding for Adaptive Long-context Scaling Without Training
Sikui Zhang
Guangze Gao
Ziyun Gan
Chunfeng Yuan
Zefeng Lin
Houwen Peng
Bing Li
Weiming Hu
141
0
0
04 Aug 2025
Zero-shot Compositional Action Recognition with Neural Logic Constraints
Zero-shot Compositional Action Recognition with Neural Logic Constraints
Gefan Ye
Lin Li
Kexin Li
Jun Xiao
Long Chen
200
3
0
04 Aug 2025
HeQ: a Large and Diverse Hebrew Reading Comprehension Benchmark
HeQ: a Large and Diverse Hebrew Reading Comprehension BenchmarkConference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Amir D. N. Cohen
Hilla Merhav
Yoav Goldberg
Reut Tsarfaty
104
11
0
03 Aug 2025
HT-Transformer: Event Sequences Classification by Accumulating Prefix Information with History Tokens
HT-Transformer: Event Sequences Classification by Accumulating Prefix Information with History Tokens
Ivan Karpukhin
Ivan A Kireev
AI4TS
113
1
0
02 Aug 2025
Unifying Mixture of Experts and Multi-Head Latent Attention for Efficient Language Models
Unifying Mixture of Experts and Multi-Head Latent Attention for Efficient Language Models
Sushant Mehta
Raj Abhijit Dandekar
Rajat Dandekar
Sreedath Panat
MoE
154
2
0
02 Aug 2025
Object-Centric Cropping for Visual Few-Shot Classification
Object-Centric Cropping for Visual Few-Shot Classification
Aymane Abdali
Bartosz Boguslawski
Lucas Drumetz
Vincent Gripon
OCL
239
0
0
31 Jul 2025
XAutoLM: Efficient Fine-Tuning of Language Models via Meta-Learning and AutoML
XAutoLM: Efficient Fine-Tuning of Language Models via Meta-Learning and AutoML
Ernesto L. Estevanell-Valladares
Suilan Estevez-Velarde
Yoan Gutiérrez
Andrés Montoyo
Ruslan Mitkov
134
0
0
30 Jul 2025
Traits Run Deep: Enhancing Personality Assessment via Psychology-Guided LLM Representations and Multimodal Apparent Behaviors
Traits Run Deep: Enhancing Personality Assessment via Psychology-Guided LLM Representations and Multimodal Apparent Behaviors
Jia Li
Yichao He
Jiacheng Xu
Tianhao Luo
Zhenzhen Hu
Richang Hong
Meng Wang
80
0
0
30 Jul 2025
GovRelBench:A Benchmark for Government Domain Relevance
GovRelBench:A Benchmark for Government Domain Relevance
Haiquan Wang
Yi Chen
Shang Zeng
Yun Bian
Zhe Cui
173
0
0
29 Jul 2025
Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study
Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study
Yiran Huang
Lukas Thede
Goran Frehse
Wenjia Xu
Zeynep Akata
181
0
0
28 Jul 2025
Semantic IDs for Music Recommendation
Semantic IDs for Music RecommendationACM Conference on Recommender Systems (RecSys), 2025
M. J. Mei
Florian Henkel
Samuel E. Sandberg
Oliver Bembom
Andreas Ehmann
VLM
79
2
0
24 Jul 2025
CompLeak: Deep Learning Model Compression Exacerbates Privacy Leakage
CompLeak: Deep Learning Model Compression Exacerbates Privacy Leakage
Na Li
Yansong Gao
Hongsheng Hu
Boyu Kuang
Anmin Fu
216
0
0
22 Jul 2025
Custom Algorithm-based Fault Tolerance for Attention Layers in Transformers
Custom Algorithm-based Fault Tolerance for Attention Layers in Transformers
Vasileios Titopoulos
K. Alexandridis
G. Dimitrakopoulos
91
0
0
22 Jul 2025
A Language Model-Driven Semi-Supervised Ensemble Framework for Illicit Market Detection Across Deep/Dark Web and Social Platforms
A Language Model-Driven Semi-Supervised Ensemble Framework for Illicit Market Detection Across Deep/Dark Web and Social Platforms
Navid Yazdanjue
Morteza Rakhshaninejad
Hossein Yazdanjouei
M. S. Khorshidi
Mikko S. Niemela
Fang Chen
Amir H. Gandomi
64
0
0
19 Jul 2025
Previous
12345...596061
Next