ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,048 papers shown
Noise-Free Explanation for Driving Action Prediction
Noise-Free Explanation for Driving Action Prediction
Hongbo Zhu
Theodor Wulff
R. S. Maharjan
Jinpei Han
Angelo Cangelosi
AAMLFAtt
286
0
0
08 Jul 2024
AI Safety in Generative AI Large Language Models: A Survey
AI Safety in Generative AI Large Language Models: A Survey
Jaymari Chua
Yun Yvonna Li
Shiyi Yang
Chen Wang
Lina Yao
LM&MA
364
37
0
06 Jul 2024
Beyond Perplexity: Multi-dimensional Safety Evaluation of LLM
  Compression
Beyond Perplexity: Multi-dimensional Safety Evaluation of LLM Compression
Zhichao Xu
Ashim Gupta
Tao Li
Oliver Bentham
Vivek Srikumar
416
20
0
06 Jul 2024
Not (yet) the whole story: Evaluating Visual Storytelling Requires More
  than Measuring Coherence, Grounding, and Repetition
Not (yet) the whole story: Evaluating Visual Storytelling Requires More than Measuring Coherence, Grounding, and Repetition
Aditya K Surikuchi
Raquel Fernández
Sandro Pezzelle
242
9
0
05 Jul 2024
Multi-modal Masked Siamese Network Improves Chest X-Ray Representation
  Learning
Multi-modal Masked Siamese Network Improves Chest X-Ray Representation Learning
Saeed Shurrab
Alejandro Guerra-Manzanares
Farah E. Shamout
242
4
0
05 Jul 2024
ESQA: Event Sequences Question Answering
ESQA: Event Sequences Question Answering
Irina Abdullaeva
Andrei Filatov
Mikhail Orlov
Ivan Karpukhin
Viacheslav Vasilev
Denis Dimitrov
Andrey Kuznetsov
Ivan A Kireev
Ivan A Kireev
226
1
0
03 Jul 2024
Aspect-Based Sentiment Analysis Techniques: A Comparative Study
Aspect-Based Sentiment Analysis Techniques: A Comparative Study
Dineth Jayakody
Koshila Isuranda
A. V. A. Malkith
Nisansa de Silva
Sachintha Rajith Ponnamperuma
G. Sandamali
K. L. Sudheera
198
6
0
03 Jul 2024
Increasing Model Capacity for Free: A Simple Strategy for Parameter
  Efficient Fine-tuning
Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning
Haobo Song
Hao Zhao
Soumajit Majumder
Tao Lin
183
7
0
01 Jul 2024
Look Ahead or Look Around? A Theoretical Comparison Between
  Autoregressive and Masked Pretraining
Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining
Qi Zhang
Tianqi Du
Haotian Huang
Yifei Wang
Yisen Wang
236
6
0
01 Jul 2024
Large Language Model Enhanced Knowledge Representation Learning: A Survey
Large Language Model Enhanced Knowledge Representation Learning: A Survey
Xin Wang
Zirui Chen
Haofen Wang
Leong Hou U
Zhao Li
Wenbin Guo
KELM
520
21
0
01 Jul 2024
FLY-TTS: Fast, Lightweight and High-Quality End-to-End Text-to-Speech
  Synthesis
FLY-TTS: Fast, Lightweight and High-Quality End-to-End Text-to-Speech Synthesis
Yinlin Guo
Yening Lv
Jinqiao Dou
Yan Zhang
Yuehai Wang
200
2
0
30 Jun 2024
LegalTurk Optimized BERT for Multi-Label Text Classification and NER
LegalTurk Optimized BERT for Multi-Label Text Classification and NER
Farnaz Zeidi
Mehmet Fatih Amasyali
Çiğdem Erol
VLM
142
3
0
30 Jun 2024
"I understand why I got this grade": Automatic Short Answer Grading with Feedback
"I understand why I got this grade": Automatic Short Answer Grading with Feedback
Dishank Aggarwal
Pushpak Bhattacharyya
Bhaskaran Raman
Pushpak Bhattacharyya
227
9
0
30 Jun 2024
BioMNER: A Dataset for Biomedical Method Entity Recognition
BioMNER: A Dataset for Biomedical Method Entity Recognition
Chen Tang
Bohao Yang
Kun Zhao
Bo Lv
Chenghao Xiao
Frank Guerin
Chenghua Lin
182
0
0
28 Jun 2024
Protein Representation Learning with Sequence Information Embedding:
  Does it Always Lead to a Better Performance?
Protein Representation Learning with Sequence Information Embedding: Does it Always Lead to a Better Performance?
Y. Tan
Lirong Zheng
Bozitao Zhong
Liang Hong
Bingxin Zhou
206
8
0
28 Jun 2024
When Search Engine Services meet Large Language Models: Visions and
  Challenges
When Search Engine Services meet Large Language Models: Visions and Challenges
Haoyi Xiong
Jiang Bian
Yuchen Li
Xuhong Li
Jundong Li
Shuaiqiang Wang
D. Yin
Sumi Helal
353
81
0
28 Jun 2024
Fibottention: Inceptive Visual Representation Learning with Diverse
  Attention Across Heads
Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Ali Khaleghi Rahimian
Manish Kumar Govind
Subhajit Maity
Dominick Reilly
Christian Kummerle
Srijan Das
A. Dutta
237
1
0
27 Jun 2024
The Odyssey of Commonsense Causality: From Foundational Benchmarks to
  Cutting-Edge Reasoning
The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning
Shaobo Cui
Zhijing Jin
Bernhard Schölkopf
Boi Faltings
CMLLRM
255
7
0
27 Jun 2024
Clustering in pure-attention hardmax transformers and its role in
  sentiment analysis
Clustering in pure-attention hardmax transformers and its role in sentiment analysis
Albert Alcalde
Giovanni Fantuzzi
Enrique Zuazua
292
10
0
26 Jun 2024
Unveiling and Controlling Anomalous Attention Distribution in
  Transformers
Unveiling and Controlling Anomalous Attention Distribution in Transformers
Ruiqing Yan
Xingbo Du
Haoyu Deng
Linghan Zheng
Qiuzhuang Sun
Jifang Hu
Yuhang Shao
Penghao Jiang
Jinrong Jiang
Lian Zhao
200
1
0
26 Jun 2024
A New Benchmark Dataset and Mixture-of-Experts Language Models for Adversarial Natural Language Inference in Vietnamese
A New Benchmark Dataset and Mixture-of-Experts Language Models for Adversarial Natural Language Inference in Vietnamese
Tin Van Huynh
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
347
2
0
25 Jun 2024
Are there identifiable structural parts in the sentence embedding whole?
Are there identifiable structural parts in the sentence embedding whole?
Vivi Nastase
Paola Merlo
200
6
0
24 Jun 2024
Large Vocabulary Size Improves Large Language Models
Large Vocabulary Size Improves Large Language Models
Sho Takase
Ryokan Ri
Shun Kiyono
Takuya Kato
316
8
0
24 Jun 2024
Evaluating the Effectiveness of the Foundational Models for Q&A
  Classification in Mental Health care
Evaluating the Effectiveness of the Foundational Models for Q&A Classification in Mental Health care
Hassan Alhuzali
Ashwag Alasmari
AI4MH
262
4
0
23 Jun 2024
Intrinsic Dimension Correlation: uncovering nonlinear connections in multimodal representations
Intrinsic Dimension Correlation: uncovering nonlinear connections in multimodal representations
Lorenzo Basile
Santiago Acevedo
Luca Bortolussi
Fabio Anselmi
Alex Rodriguez
296
6
0
22 Jun 2024
Brain-Like Language Processing via a Shallow Untrained Multihead
  Attention Network
Brain-Like Language Processing via a Shallow Untrained Multihead Attention Network
Badr AlKhamissi
Greta Tuckute
Antoine Bosselut
Martin Schrimpf
223
8
0
21 Jun 2024
Text Serialization and Their Relationship with the Conventional
  Paradigms of Tabular Machine Learning
Text Serialization and Their Relationship with the Conventional Paradigms of Tabular Machine Learning
Kyoka Ono
Simon A. Lee
LMTD
215
12
0
19 Jun 2024
Fighting Randomness with Randomness: Mitigating Optimisation Instability
  of Fine-Tuning using Delayed Ensemble and Noisy Interpolation
Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation
Branislav Pecher
Ján Cegin
Róbert Belanec
Jakub Simko
Ivan Srba
Maria Bielikova
214
1
0
18 Jun 2024
QueerBench: Quantifying Discrimination in Language Models Toward Queer
  Identities
QueerBench: Quantifying Discrimination in Language Models Toward Queer Identities
Mae Sosto
Alberto Barrón-Cedeño
210
5
0
18 Jun 2024
TroL: Traversal of Layers for Large Language and Vision Models
TroL: Traversal of Layers for Large Language and Vision Models
Byung-Kwan Lee
Sangyun Chung
Chae Won Kim
Beomchan Park
Yong Man Ro
349
12
0
18 Jun 2024
CollabStory: Multi-LLM Collaborative Story Generation and Authorship Analysis
CollabStory: Multi-LLM Collaborative Story Generation and Authorship Analysis
Saranya Venkatraman
Nafis Irtiza Tripto
Dongwon Lee
496
30
0
18 Jun 2024
A Systematic Survey of Text Summarization: From Statistical Methods to
  Large Language Models
A Systematic Survey of Text Summarization: From Statistical Methods to Large Language Models
Haopeng Zhang
Philip S. Yu
Jiawei Zhang
287
99
0
17 Jun 2024
Scaling Synthetic Logical Reasoning Datasets with Context-Sensitive
  Declarative Grammars
Scaling Synthetic Logical Reasoning Datasets with Context-Sensitive Declarative Grammars
Damien Sileo
LRMReLM
289
5
0
16 Jun 2024
Improving Large Models with Small models: Lower Costs and Better
  Performance
Improving Large Models with Small models: Lower Costs and Better Performance
Dong Chen
Shuo Zhang
Yueting Zhuang
Siliang Tang
Qidong Liu
Hua Wang
Mingliang Xu
211
13
0
15 Jun 2024
Adversarial Evasion Attack Efficiency against Large Language Models
Adversarial Evasion Attack Efficiency against Large Language Models
João Vitorino
Eva Maia
Isabel Praça
AAML
187
5
0
12 Jun 2024
Defining and Detecting Vulnerability in Human Evaluation Guidelines: A
  Preliminary Study Towards Reliable NLG Evaluation
Defining and Detecting Vulnerability in Human Evaluation Guidelines: A Preliminary Study Towards Reliable NLG Evaluation
Jie Ruan
Wenqing Wang
Xiaojun Wan
AAMLELM
227
10
0
12 Jun 2024
HOLMES: Hyper-Relational Knowledge Graphs for Multi-hop Question
  Answering using LLMs
HOLMES: Hyper-Relational Knowledge Graphs for Multi-hop Question Answering using LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Pranoy Panda
Ankush Agarwal
Chaitanya Devaguptapu
Manohar Kaul
Prathosh A P
RALM
235
33
0
10 Jun 2024
Emotion-Aware Speech Self-Supervised Representation Learning with
  Intensity Knowledge
Emotion-Aware Speech Self-Supervised Representation Learning with Intensity KnowledgeInterspeech (Interspeech), 2024
Rui Liu
Zening Ma
SSL
297
2
0
10 Jun 2024
Gentle-CLIP: Exploring Aligned Semantic In Low-Quality Multimodal Data
  With Soft Alignment
Gentle-CLIP: Exploring Aligned Semantic In Low-Quality Multimodal Data With Soft Alignment
Zijia Song
Z. Zang
Yelin Wang
Guozheng Yang
Jiangbin Zheng
Kaicheng Yu
Wanyu Chen
Stan Z. Li
254
0
0
09 Jun 2024
Peer Review as A Multi-Turn and Long-Context Dialogue with Role-Based
  Interactions
Peer Review as A Multi-Turn and Long-Context Dialogue with Role-Based Interactions
Cheng Tan
Dongxin Lyu
Siyuan Li
Zhangyang Gao
Jingxuan Wei
Siqi Ma
Zicheng Liu
Stan Z. Li
LLMAG
211
29
0
09 Jun 2024
Automata Extraction from Transformers
Automata Extraction from Transformers
Yihao Zhang
Zeming Wei
Meng Sun
AI4CE
406
1
0
08 Jun 2024
Integrating Text and Image Pre-training for Multi-modal Algorithmic
  Reasoning
Integrating Text and Image Pre-training for Multi-modal Algorithmic Reasoning
Zijian Zhang
Wei Liu
261
0
0
08 Jun 2024
BERTs are Generative In-Context Learners
BERTs are Generative In-Context LearnersNeural Information Processing Systems (NeurIPS), 2024
David Samuel
231
13
0
07 Jun 2024
DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and
  Effective for LMMs
DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMsNeural Information Processing Systems (NeurIPS), 2024
Lingchen Meng
Jianwei Yang
Rui Tian
Xiyang Dai
Zuxuan Wu
Jianfeng Gao
Yu-Gang Jiang
VLM
268
30
0
06 Jun 2024
Pre-trained Transformer Uncovers Meaningful Patterns in Human Mobility
  Data
Pre-trained Transformer Uncovers Meaningful Patterns in Human Mobility Data
Alameen Najjar
225
1
0
06 Jun 2024
A Survey on Medical Large Language Models: Technology, Application,
  Trustworthiness, and Future Directions
A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions
Lei Liu
Xiaoyan Yang
Junchi Lei
Xiaoyang Liu
Yue Shen
...
Peng Wei
Jinjie Gu
Zhixuan Chu
Zhan Qin
Kui Ren
LM&MAAILaw
264
42
0
06 Jun 2024
RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence
  Models for Abstractive Radiology Report Summarization
RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence Models for Abstractive Radiology Report Summarization
Jinge Wu
Abul Hasan
Honghan Wu
125
2
0
05 Jun 2024
Language Model Can Do Knowledge Tracing: Simple but Effective Method to
  Integrate Language Model and Knowledge Tracing Task
Language Model Can Do Knowledge Tracing: Simple but Effective Method to Integrate Language Model and Knowledge Tracing Task
Unggi Lee
Jiyeong Bae
Dohee Kim
Sookbun Lee
Jaekwon Park
Taekyung Ahn
Gunho Lee
Damji Stratton
Hyeoncheol Kim
AI4EdKELM
281
23
0
05 Jun 2024
Using Self-supervised Learning Can Improve Model Fairness
Using Self-supervised Learning Can Improve Model Fairness
Sofia Yfantidou
Dimitris Spathis
Marios Constantinides
Athena Vakali
Daniele Quercia
F. Kawsar
336
9
0
04 Jun 2024
Robust Interaction-based Relevance Modeling for Online E-Commerce and
  LLM-based Retrieval
Robust Interaction-based Relevance Modeling for Online E-Commerce and LLM-based Retrieval
Ben Chen
Huangyu Dai
Xiang Ma
Wen Jiang
Wei Ning
142
0
0
04 Jun 2024
Previous
123...789...596061
Next
Page 8 of 61
Pageof 61