ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,049 papers shown
Enhancing Language Models for Financial Relation Extraction with Named
  Entities and Part-of-Speech
Enhancing Language Models for Financial Relation Extraction with Named Entities and Part-of-Speech
Menglin Li
Kwan Hui Lim
187
1
0
02 May 2024
A Named Entity Recognition and Topic Modeling-based Solution for
  Locating and Better Assessment of Natural Disasters in Social Media
A Named Entity Recognition and Topic Modeling-based Solution for Locating and Better Assessment of Natural Disasters in Social Media
Ayaz Mehmood
Muhammad Tayyab Zamir
Muhammad Asif Ayub
Nasir Ahmad
Kashif Ahmad
109
3
0
01 May 2024
EfficientASR: Speech Recognition Network Compression via Attention
  Redundancy and Chunk-Level FFN Optimization
EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization
Jianzong Wang
Ziqi Liang
Xulong Zhang
Ning Cheng
Jing Xiao
178
1
0
30 Apr 2024
Enhancing Pre-Trained Generative Language Models with Question Attended
  Span Extraction on Machine Reading Comprehension
Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading Comprehension
Lin Ai
Zheng Hui
Zizhou Liu
Julia Hirschberg
211
2
0
27 Apr 2024
Transfer Learning Enhanced Single-choice Decision for Multi-choice
  Question Answering
Transfer Learning Enhanced Single-choice Decision for Multi-choice Question Answering
Chenhao Cui
Yufan Jiang
Shuangzhi Wu
Zhoujun Li
FaML
162
0
0
27 Apr 2024
CoSD: Collaborative Stance Detection with Contrastive Heterogeneous
  Topic Graph Learning
CoSD: Collaborative Stance Detection with Contrastive Heterogeneous Topic Graph Learning
Yinghan Cheng
Tao Gui
Chongyang Shi
Liang Xiao
Shufeng Hao
Liang Hu
230
2
0
26 Apr 2024
Exploring Internal Numeracy in Language Models: A Case Study on ALBERT
Exploring Internal Numeracy in Language Models: A Case Study on ALBERT
Ulme Wennberg
G. Henter
MILM
232
2
0
25 Apr 2024
Exploring Learngene via Stage-wise Weight Sharing for Initializing
  Variable-sized Models
Exploring Learngene via Stage-wise Weight Sharing for Initializing Variable-sized Models
Shiyu Xia
Wenxuan Zhu
Xu Yang
Xin Geng
214
5
0
25 Apr 2024
Learning Long-form Video Prior via Generative Pre-Training
Learning Long-form Video Prior via Generative Pre-Training
Jinheng Xie
Jiajun Feng
Zhaoxu Tian
Kevin Qinghong Lin
Yawen Huang
...
Nanxu Gong
Xu Zuo
Jiaqi Yang
Yefeng Zheng
Mike Zheng Shou
233
8
0
24 Apr 2024
A Comprehensive Survey on Evaluating Large Language Model Applications
  in the Medical Industry
A Comprehensive Survey on Evaluating Large Language Model Applications in the Medical Industry
Yining Huang
Keke Tang
Meilian Chen
Boyuan Wang
ELMLM&MA
413
30
0
24 Apr 2024
Mapping Literature Landscapes with Data-Driven Discovery: A Case Study on MOEA/D
Mapping Literature Landscapes with Data-Driven Discovery: A Case Study on MOEA/D
Mingyu Huang
Ke Li
Ke Li
306
1
0
22 Apr 2024
Embarrassingly Simple Unsupervised Aspect Based Sentiment Tuple
  Extraction
Embarrassingly Simple Unsupervised Aspect Based Sentiment Tuple Extraction
Kevin Scaria
Abyn Scaria
Ben Scaria
CoGe
152
0
0
21 Apr 2024
PEACH: Pretrained-embedding Explanation Across Contextual and
  Hierarchical Structure
PEACH: Pretrained-embedding Explanation Across Contextual and Hierarchical Structure
Feiqi Cao
S. Han
Hyunsuk Chung
208
0
0
21 Apr 2024
Explanation based Bias Decoupling Regularization for Natural Language
  Inference
Explanation based Bias Decoupling Regularization for Natural Language Inference
Jianxiang Zang
Hui Liu
188
1
0
20 Apr 2024
Evaluating Subword Tokenization: Alien Subword Composition and OOV
  Generalization Challenge
Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization Challenge
Khuyagbaatar Batsuren
Ekaterina Vylomova
Verna Dankers
Tsetsuukhei Delgerbaatar
Omri Uzan
Yuval Pinter
Gábor Bella
182
16
0
20 Apr 2024
Transformer-Based Classification Outcome Prediction for Multimodal
  Stroke Treatment
Transformer-Based Classification Outcome Prediction for Multimodal Stroke Treatment
Danqing Ma
Meng Wang
Ao Xiang
Zongqing Qi
Qin Yang
208
23
0
19 Apr 2024
EnriCo: Enriched Representation and Globally Constrained Inference for
  Entity and Relation Extraction
EnriCo: Enriched Representation and Globally Constrained Inference for Entity and Relation Extraction
Urchade Zaratiana
Nadi Tomeh
Yann Dauxais
Pierre Holat
Thierry Charnois
219
0
0
18 Apr 2024
GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation
  Extraction
GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction
Urchade Zaratiana
Nadi Tomeh
Niama El Khbir
Pierre Holat
Thierry Charnois
324
2
0
18 Apr 2024
Enhance Robustness of Language Models Against Variation Attack through
  Graph Integration
Enhance Robustness of Language Models Against Variation Attack through Graph Integration
Ziteng Xiong
Lizhi Qing
Yangyang Kang
Jiawei Liu
Hongsong Li
Changlong Sun
Xiaozhong Liu
Wei Lu
210
2
0
18 Apr 2024
Dynamic Self-adaptive Multiscale Distillation from Pre-trained
  Multimodal Large Model for Efficient Cross-modal Representation Learning
Dynamic Self-adaptive Multiscale Distillation from Pre-trained Multimodal Large Model for Efficient Cross-modal Representation Learning
Zhengyang Liang
Meiyu Liang
Wei Huang
Yawen Li
Zhe Xue
277
1
0
16 Apr 2024
Referring Flexible Image Restoration
Referring Flexible Image Restoration
Runwei Guan
Rongsheng Hu
Zhuhao Zhou
Tianlang Xue
Ka Lok Man
Jeremy S. Smith
Eng Gee Lim
Weiping Ding
Yutao Yue
197
0
0
16 Apr 2024
On the Effects of Fine-tuning Language Models for Text-Based
  Reinforcement Learning
On the Effects of Fine-tuning Language Models for Text-Based Reinforcement Learning
Mauricio G. Gruppi
Soham Dan
K. Murugesan
Subhajit Chaudhury
LLMAG
125
0
0
15 Apr 2024
Navigating the Landscape of Large Language Models: A Comprehensive
  Review and Analysis of Paradigms and Fine-Tuning Strategies
Navigating the Landscape of Large Language Models: A Comprehensive Review and Analysis of Paradigms and Fine-Tuning Strategies
Benjue Weng
LM&MA
287
15
0
13 Apr 2024
VertAttack: Taking advantage of Text Classifiers' horizontal vision
VertAttack: Taking advantage of Text Classifiers' horizontal vision
Jonathan Rusert
AAML
250
3
0
12 Apr 2024
Emerging Property of Masked Token for Effective Pre-training
Emerging Property of Masked Token for Effective Pre-training
Hyesong Choi
Hunsang Lee
Seyoung Joung
Hyejin Park
Jiyeong Kim
Dongbo Min
170
10
0
12 Apr 2024
Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced
  Pre-training
Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training
Hyesong Choi
Hyejin Park
Kwang Moo Yi
Sungmin Cha
Dongbo Min
274
10
0
12 Apr 2024
On Unified Prompt Tuning for Request Quality Assurance in Public Code
  Review
On Unified Prompt Tuning for Request Quality Assurance in Public Code Review
Xinyu Chen
Lin Li
Rui Zhang
Peng Liang
276
1
0
11 Apr 2024
CQIL: Inference Latency Optimization with Concurrent Computation of
  Quasi-Independent Layers
CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent LayersAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Longwei Zou
Qingyang Wang
Han Zhao
Tingfeng Liu
Yi Yang
Yangdong Deng
248
1
0
10 Apr 2024
Dimensionality Reduction in Sentence Transformer Vector Databases with
  Fast Fourier Transform
Dimensionality Reduction in Sentence Transformer Vector Databases with Fast Fourier Transform
Vitaly Bulgakov
Alec Segal
135
5
0
09 Apr 2024
AnchorAL: Computationally Efficient Active Learning for Large and
  Imbalanced Datasets
AnchorAL: Computationally Efficient Active Learning for Large and Imbalanced Datasets
Pietro Lesci
Andreas Vlachos
346
7
0
08 Apr 2024
Chinese Sequence Labeling with Semi-Supervised Boundary-Aware Language
  Model Pre-training
Chinese Sequence Labeling with Semi-Supervised Boundary-Aware Language Model Pre-training
Longhui Zhang
Dingkun Long
Meishan Zhang
Yanzhao Zhang
Pengjun Xie
Min Zhang
299
3
0
08 Apr 2024
OPSD: an Offensive Persian Social media Dataset and its baseline
  evaluations
OPSD: an Offensive Persian Social media Dataset and its baseline evaluations
M. Safayani
Amir Sartipi
Amir Hossein Ahmadi
Parniyan Jalali
Amir Hossein Mansouri
Mohammad Bisheh-Niasar
Zahra Pourbahman
101
0
0
08 Apr 2024
Shortcut-connected Expert Parallelism for Accelerating Mixture-of-Experts
Shortcut-connected Expert Parallelism for Accelerating Mixture-of-Experts
Weilin Cai
Juyong Jiang
Le Qin
Junwei Cui
Sunghun Kim
Jiayi Huang
515
22
0
07 Apr 2024
What Happens When Small Is Made Smaller? Exploring the Impact of
  Compression on Small Data Pretrained Language Models
What Happens When Small Is Made Smaller? Exploring the Impact of Compression on Small Data Pretrained Language Models
Busayo Awobade
Mardiyyah Oduwole
Steven Kolawole
201
1
0
06 Apr 2024
Order-Based Pre-training Strategies for Procedural Text Understanding
Order-Based Pre-training Strategies for Procedural Text Understanding
Abhilash Nandy
Yash Kulkarni
Pawan Goyal
Niloy Ganguly
196
6
0
06 Apr 2024
A Morphology-Based Investigation of Positional Encodings
A Morphology-Based Investigation of Positional Encodings
Poulami Ghosh
Shikhar Vashishth
Mary Dabre
Pushpak Bhattacharyya
223
6
0
06 Apr 2024
Multi-modal Learning for WebAssembly Reverse Engineering
Multi-modal Learning for WebAssembly Reverse EngineeringInternational Symposium on Software Testing and Analysis (ISSTA), 2024
Hanxian Huang
Jishen Zhao
231
5
0
04 Apr 2024
Robust Pronoun Fidelity with English LLMs: Are they Reasoning,
  Repeating, or Just Biased?
Robust Pronoun Fidelity with English LLMs: Are they Reasoning, Repeating, or Just Biased?Transactions of the Association for Computational Linguistics (TACL), 2024
Vagrant Gautam
Eileen Bingert
D. Zhu
Anne Lauscher
Dietrich Klakow
325
14
0
04 Apr 2024
Revisiting subword tokenization: A case study on affixal negation in
  large language models
Revisiting subword tokenization: A case study on affixal negation in large language modelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Thinh Hung Truong
Yulia Otmakhova
Karin Verspoor
Trevor Cohn
Timothy Baldwin
210
4
0
03 Apr 2024
Linear Attention Sequence Parallelism
Linear Attention Sequence Parallelism
Weigao Sun
Zhen Qin
Dong Li
Xuyang Shen
Yu Qiao
Yiran Zhong
385
5
0
03 Apr 2024
Digital Forgetting in Large Language Models: A Survey of Unlearning
  Methods
Digital Forgetting in Large Language Models: A Survey of Unlearning MethodsArtificial Intelligence Review (Artif Intell Rev), 2024
Alberto Blanco-Justicia
N. Jebreel
Benet Manzanares-Salor
David Sánchez
Josep Domingo-Ferrer
Guillem Collell
Kuan Eeik Tan
KELMMU
336
41
0
02 Apr 2024
Deconstructing In-Context Learning: Understanding Prompts via Corruption
Deconstructing In-Context Learning: Understanding Prompts via CorruptionInternational Conference on Language Resources and Evaluation (LREC), 2024
Namrata Shivagunde
Vladislav Lialin
Sherin Muckatira
Anna Rumshisky
353
8
0
02 Apr 2024
Semantic Augmentation in Images using Language
Semantic Augmentation in Images using Language
Sahiti Yerramilli
Jayant Sravan Tamarapalli
Tanmay Girish Kulkarni
Jonathan M Francis
Eric Nyberg
DiffMVLM
231
8
0
02 Apr 2024
Green AI: Exploring Carbon Footprints, Mitigation Strategies, and Trade
  Offs in Large Language Model Training
Green AI: Exploring Carbon Footprints, Mitigation Strategies, and Trade Offs in Large Language Model Training
Vivian Liu
Yiqiao Yin
308
46
0
01 Apr 2024
Efficient Prompting Methods for Large Language Models: A Survey
Efficient Prompting Methods for Large Language Models: A Survey
Kaiyan Chang
Songcheng Xu
Chenglong Wang
Yingfeng Luo
Tong Xiao
Jingbo Zhu
LRM
408
47
0
01 Apr 2024
Efficiently Distilling LLMs for Edge Applications
Efficiently Distilling LLMs for Edge Applications
Achintya Kundu
Fabian Lim
Aaron Chew
L. Wynter
Penny Chong
Rhui Dih Lee
220
10
0
01 Apr 2024
CoUDA: Coherence Evaluation via Unified Data Augmentation
CoUDA: Coherence Evaluation via Unified Data Augmentation
Dawei Zhu
Wenhao Wu
Yifan Song
Fangwei Zhu
Ziqiang Cao
Sujian Li
147
1
0
31 Mar 2024
Addressing Both Statistical and Causal Gender Fairness in NLP Models
Addressing Both Statistical and Causal Gender Fairness in NLP Models
Hannah Chen
Yangfeng Ji
David Evans
313
5
0
30 Mar 2024
A Comprehensive Study on NLP Data Augmentation for Hate Speech
  Detection: Legacy Methods, BERT, and LLMs
A Comprehensive Study on NLP Data Augmentation for Hate Speech Detection: Legacy Methods, BERT, and LLMs
Md Saroar Jahan
Mourad Oussalah
D. Beddiar
Jhuma Kabir Mim
Nabil Arhab
182
16
0
30 Mar 2024
Classifying Conspiratorial Narratives At Scale: False Alarms and
  Erroneous Connections
Classifying Conspiratorial Narratives At Scale: False Alarms and Erroneous Connections
Ahmad Diab
Rr. Nefriana
Yu-Ru Lin
182
9
0
29 Mar 2024
Previous
123...91011...596061
Next
Page 10 of 61
Pageof 61