Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
International Conference on Learning Representations (ICLR), 2019
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 3,049 papers shown
Second-Order Fine-Tuning without Pain for LLMs:A Hessian Informed Zeroth-Order Optimizer
Yanjun Zhao
Sizhe Dang
Haishan Ye
Guang Dai
Yi Qian
Ivor W.Tsang
677
29
0
23 Feb 2024
An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach
Mohammad Amaz Uddin
Md Mahiuddin
Iqbal H. Sarker
199
41
0
21 Feb 2024
EvoGrad: A Dynamic Take on the Winograd Schema Challenge with Human Adversaries
Jing Han Sun
Ali Emami
275
6
0
20 Feb 2024
Detecting misinformation through Framing Theory: the Frame Element-based Model
Guan-Hua Wang
Rebecca Frederick
Jinglong Duan
William Wong
V. Rupar
Weihua Li
Quan-wei Bai
201
8
0
19 Feb 2024
Head-wise Shareable Attention for Large Language Models
Zouying Cao
Yifei Yang
Hai Zhao
174
5
0
19 Feb 2024
Utilizing BERT for Information Retrieval: Survey, Applications, Resources, and Challenges
Jiajia Wang
Jimmy Xiangji Huang
Xinhui Tu
Junmei Wang
Angela J. Huang
Md Tahmid Rahman Laskar
Amran Bhuiyan
353
94
0
18 Feb 2024
Puzzle Solving using Reasoning of Large Language Models: A Survey
Panagiotis Giadikiaroglou
Maria Lymperaiou
Giorgos Filandrianos
Giorgos Stamou
ELM
ReLM
LRM
378
52
0
17 Feb 2024
EEG2Rep: Enhancing Self-supervised EEG Representation Through Informative Masked Inputs
Navid Mohammadi Foumani
G. Mackellar
Soheila Ghane
Saad Irtza
Nam Nguyen
Mahsa Salehi
331
39
0
17 Feb 2024
A Question Answering Based Pipeline for Comprehensive Chinese EHR Information Extraction
Huaiyuan Ying
Sheng Yu
MedIm
130
0
0
17 Feb 2024
Enhancing ESG Impact Type Identification through Early Fusion and Multilingual Models
Hariram Veeramani
Surendrabikram Thapa
Usman Naseem
162
6
0
16 Feb 2024
Understanding Survey Paper Taxonomy about Large Language Models via Graph Representation Learning
Jun Zhuang
C. Kennington
95
14
0
16 Feb 2024
Reusing Softmax Hardware Unit for GELU Computation in Transformers
C. Peltekis
K. Alexandridis
G. Dimitrakopoulos
126
9
0
15 Feb 2024
OrderBkd: Textual backdoor attack through repositioning
Irina Alekseevskaia
Konstantin Arkhipenko
228
5
0
12 Feb 2024
Large Language Models: A Survey
Shervin Minaee
Tomas Mikolov
Narjes Nikzad
M. Asgari-Chenaghlu
R. Socher
Xavier Amatriain
Jianfeng Gao
ALM
LM&MA
ELM
847
779
0
09 Feb 2024
Traditional Machine Learning Models and Bidirectional Encoder Representations From Transformer (BERT)-Based Automatic Classification of Tweets About Eating Disorders: Algorithm Development and Validation Study
J. Benítez-Andrades
José-Manuel Alija-Pérez
Maria-Esther Vidal
R. Pastor-Vargas
María Teresa García-Ordás
154
44
0
08 Feb 2024
Empowering machine learning models with contextual knowledge for enhancing the detection of eating disorders in social media posts
J. Benítez-Andrades
María Teresa García-Ordás
Mayra Russo
Ahmad Sakor
Luis Daniel Fernandes Rotger
Maria-Esther Vidal
AI4MH
235
8
0
08 Feb 2024
Improving Agent Interactions in Virtual Environments with Language Models
Jack Zhang
LLMAG
169
0
0
08 Feb 2024
Triplet Interaction Improves Graph Transformers: Accurate Molecular Graph Learning with Triplet Graph Transformers
Md Shamim Hussain
Mohammed J Zaki
D. Subramanian
ViT
383
15
0
07 Feb 2024
Lens: A Knowledge-Guided Foundation Model for Network Traffic
Qineng Wang
Chen Qian
Xiaochang Li
Ziyu Yao
Huajie Shao
Ziyu Yao
Bo Ji
Long Cheng
Gang Zhou
Huajie Shao
170
7
0
06 Feb 2024
DE
3
^3
3
-BERT: Distance-Enhanced Early Exiting for BERT based on Prototypical Networks
Jianing He
Tao Gui
Weiping Ding
Duoqian Miao
Jun Zhao
Liang Hu
LongBing Cao
200
6
0
03 Feb 2024
Fractal Patterns May Illuminate the Success of Next-Token Prediction
Ibrahim Alabdulmohsin
Vinh Q. Tran
Mostafa Dehghani
178
4
0
02 Feb 2024
Distractor Generation for Multiple-Choice Questions: A Survey of Methods, Datasets, and Evaluation
Elaf Alhazmi
Quan Z. Sheng
W. Zhang
Munazza Zaib
A. Alhazmi
AI4Ed
228
1
0
02 Feb 2024
Dive into the Chasm: Probing the Gap between In- and Cross-Topic Generalization
Andreas Waldis
Yufang Hou
Iryna Gurevych
ELM
229
9
0
02 Feb 2024
Investigating Recurrent Transformers with Dynamic Halt
Jishnu Ray Chowdhury
Cornelia Caragea
554
3
0
01 Feb 2024
Comparing Template-based and Template-free Language Model Probing
Sagi Shaier
Kevin Bennett
Lawrence E Hunter
Katharina von der Wense
ELM
291
6
0
31 Jan 2024
Desiderata for the Context Use of Question Answering Systems
Sagi Shaier
Lawrence E Hunter
Katharina von der Wense
352
6
0
31 Jan 2024
PipeNet: Question Answering with Semantic Pruning over Knowledge Graphs
Ying Su
Jipeng Zhang
Yangqiu Song
Tong Zhang
311
2
0
31 Jan 2024
Fine-tuning Transformer-based Encoder for Turkish Language Understanding Tasks
Savas Yildirim
123
15
0
30 Jan 2024
When Large Language Models Meet Vector Databases: A Survey
Zhi Jing
Yongye Su
Yikun Han
Bo Yuan
Haiyun Xu
Chunjiang Liu
Kehai Chen
Min Zhang
450
73
0
30 Jan 2024
GuReT: Distinguishing Guilt and Regret related Text
S. Butt
F. Balouchzahi
Abdul Gafar Manuel Meque
Maaz Amjad
Hector G. Ceballos Cancino
Grigori Sidorov
Alexander Gelbukh
127
2
0
29 Jan 2024
X-PEFT: eXtremely Parameter-Efficient Fine-Tuning for Extreme Multi-Profile Scenarios
Namju Kwak
Taesup Kim
MoE
109
0
0
29 Jan 2024
BPDec: Unveiling the Potential of Masked Language Modeling Decoder in BERT pretraining
International Conference on Neural Information Processing (ICONIP), 2024
Wen-Chieh Liang
Youzhi Liang
OffRL
131
2
0
29 Jan 2024
Credit Risk Meets Large Language Models: Building a Risk Indicator from Loan Descriptions in P2P Lending
Mario Sanz-Guerrero
Javier Arroyo
356
11
0
29 Jan 2024
Quantifying Stereotypes in Language
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2024
Yang Liu
213
5
0
28 Jan 2024
Semantics of Multiword Expressions in Transformer-Based Models: A Survey
Transactions of the Association for Computational Linguistics (TACL), 2024
Filip Miletić
Sabine Schulte im Walde
279
13
0
27 Jan 2024
A Comprehensive Survey of Compression Algorithms for Language Models
Seungcheol Park
Jaehyeon Choi
Sojin Lee
U. Kang
MQ
334
20
0
27 Jan 2024
Socially Aware Synthetic Data Generation for Suicidal Ideation Detection Using Large Language Models
IEEE Access (IEEE Access), 2024
Hamideh Ghanadian
I. Nejadgholi
Hussein Al Osman
SyDa
198
35
0
25 Jan 2024
A Comparative Analysis of Noise Reduction Methods in Sentiment Analysis on Noisy Bangla Texts
Kazi Toufique Elahi
Tasnuva Binte Rahman
Shakil Shahriar
Samir Sarker
Md. Tanvir Rouf Shawon
G. M. Shahariar
176
5
0
25 Jan 2024
Rethinking Patch Dependence for Masked Autoencoders
Letian Fu
Long Lian
Renhao Wang
Baifeng Shi
Xudong Wang
Adam Yala
Trevor Darrell
Alexei A. Efros
Ken Goldberg
344
32
0
25 Jan 2024
SEDAC: A CVAE-Based Data Augmentation Method for Security Bug Report Identification
Y. Liao
T. Zhang
22
0
0
22 Jan 2024
Freely Long-Thinking Transformer (FraiLT)
Akbay Tabak
114
0
0
21 Jan 2024
Robust Evaluation Measures for Evaluating Social Biases in Masked Language Models
AAAI Conference on Artificial Intelligence (AAAI), 2024
Yang Liu
137
5
0
21 Jan 2024
Instructional Fingerprinting of Large Language Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Lyne Tchapmi
Fei Wang
Mingyu Derek Ma
Pang Wei Koh
Chaowei Xiao
Muhao Chen
WaLM
278
58
0
21 Jan 2024
Finding a Needle in the Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases with Minimal Distribution Distortion
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2024
Aly M. Kassem
Sherif Saad
AAML
301
3
0
21 Jan 2024
Attentive Fusion: A Transformer-based Approach to Multimodal Hate Speech Detection
Atanu Mandal
Gargi Roy
Amit Barman
Indranil Dutta
S. Naskar
115
5
0
19 Jan 2024
Learning High-Quality and General-Purpose Phrase Representations
Lihu Chen
Gaël Varoquaux
Fabian M. Suchanek
295
8
0
18 Jan 2024
Preparing Lessons for Progressive Training on Language Models
AAAI Conference on Artificial Intelligence (AAAI), 2024
Yu Pan
Ye Yuan
Yichun Yin
Jiaxin Shi
Zenglin Xu
Ming Zhang
Lifeng Shang
Xin Jiang
Qun Liu
268
13
0
17 Jan 2024
CEL: A Continual Learning Model for Disease Outbreak Prediction by Leveraging Domain Adaptation via Elastic Weight Consolidation
bioRxiv (bioRxiv), 2024
Saba Aslam
Abdur Rasool
Hongyan Wu
Xiaoli Li
197
14
0
17 Jan 2024
Fixed Point Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2024
Xingjian Bai
Luke Melas-Kyriazi
233
4
0
16 Jan 2024
The What, Why, and How of Context Length Extension Techniques in Large Language Models -- A Detailed Survey
Saurav Pawar
S.M. Towhidul Islam Tonmoy
S. M. M. Zaman
Vinija Jain
Vasu Sharma
Amitava Das
215
41
0
15 Jan 2024
Previous
1
2
3
...
11
12
13
...
59
60
61
Next
Page 12 of 61
Page
of 61
Go