Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.11942
Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 2,911 papers shown
Title
Pseudo-Label Calibration Semi-supervised Multi-Modal Entity Alignment
Luyao Wang
Pengnian Qi
Xigang Bao
Chunlai Zhou
Biao Qin
25
9
0
02 Mar 2024
ATP: Enabling Fast LLM Serving via Attention on Top Principal Keys
Yue Niu
Saurav Prakash
Salman Avestimehr
26
1
0
01 Mar 2024
Hierarchical Indexing for Retrieval-Augmented Opinion Summarization
Tom Hosking
Hao Tang
Mirella Lapata
29
2
0
01 Mar 2024
Rethinking Tokenization: Crafting Better Tokenizers for Large Language Models
Jinbiao Yang
LLMAG
73
11
0
01 Mar 2024
Cause and Effect: Can Large Language Models Truly Understand Causality?
Swagata Ashwani
Kshiteesh Hegde
Nishith Reddy Mannuru
Mayank Jindal
Dushyant Singh Sengar
Krishna Chaitanya Rao Kathala
Dishant Banga
Vinija Jain
Aman Chadha
LRM
40
18
0
28 Feb 2024
Securing Reliability: A Brief Overview on Enhancing In-Context Learning for Foundation Models
Yunpeng Huang
Yaonan Gu
Jingwei Xu
Zhihong Zhu
Zhaorun Chen
Xiaoxing Ma
35
3
0
27 Feb 2024
Fine-Grained Natural Language Inference Based Faithfulness Evaluation for Diverse Summarisation Tasks
Huajian Zhang
Yumo Xu
Laura Perez-Beltrachini
HILM
24
9
0
27 Feb 2024
Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology
Wenhao Tang
Fengtao Zhou
Shengyue Huang
Xiang Zhu
Yi Zhang
Bo Liu
42
20
0
27 Feb 2024
Generating Effective Ensembles for Sentiment Analysis
Itay Etelis
Avi Rosenfeld
Abraham Itzhak Weinberg
David Sarne
35
2
0
26 Feb 2024
Unveiling Vulnerability of Self-Attention
Khai Jiet Liong
Hongqiu Wu
Haizhen Zhao
28
0
0
26 Feb 2024
Layer-wise Regularized Dropout for Neural Language Models
Shiwen Ni
Min Yang
Ruifeng Xu
Chengming Li
Xiping Hu
30
0
0
26 Feb 2024
QASE Enhanced PLMs: Improved Control in Text Generation for MRC
Lin Ai
Zheng Hui
Zizhou Liu
Julia Hirschberg
29
0
0
26 Feb 2024
OAG-Bench: A Human-Curated Benchmark for Academic Graph Mining
Fanjin Zhang
Shijie Shi
Yifan Zhu
Bo Chen
Yukuo Cen
...
Huihui Yuan
Jian Song
Xiaoyan Li
Yuxiao Dong
Jie Tang
42
15
0
24 Feb 2024
Prejudice and Volatility: A Statistical Framework for Measuring Social Discrimination in Large Language Models
Yiran Liu
Ke Yang
Zehan Qi
Xiao-Yang Liu
Yang Yu
U. I. Urbana-Champaign
39
1
0
23 Feb 2024
Second-Order Fine-Tuning without Pain for LLMs:A Hessian Informed Zeroth-Order Optimizer
Yanjun Zhao
Sizhe Dang
Haishan Ye
Guang Dai
Yi Qian
Ivor W.Tsang
66
8
0
23 Feb 2024
COBIAS: Assessing the Contextual Reliability of Bias Benchmarks for Language Models
Priyanshul Govil
Hemang Jain
Vamshi Bonagiri
Aman Chadha
Ponnurangam Kumaraguru
Manas Gaur
S. Dey
47
2
0
22 Feb 2024
An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach
Mohammad Amaz Uddin
Iqbal H. Sarker
36
14
0
21 Feb 2024
EvoGrad: A Dynamic Take on the Winograd Schema Challenge with Human Adversaries
Jing Han Sun
Ali Emami
32
3
0
20 Feb 2024
Detecting misinformation through Framing Theory: the Frame Element-based Model
Guan-Hua Wang
Rebecca Frederick
Jinglong Duan
William Wong
V. Rupar
Weihua Li
Quan-wei Bai
27
2
0
19 Feb 2024
Head-wise Shareable Attention for Large Language Models
Zouying Cao
Yifei Yang
Hai Zhao
36
4
0
19 Feb 2024
Utilizing BERT for Information Retrieval: Survey, Applications, Resources, and Challenges
Jiajia Wang
Jimmy X. Huang
Xinhui Tu
Junmei Wang
Angela J. Huang
Md Tahmid Rahman Laskar
Amran Bhuiyan
34
28
0
18 Feb 2024
Puzzle Solving using Reasoning of Large Language Models: A Survey
Panagiotis Giadikiaroglou
Maria Lymperaiou
Giorgos Filandrianos
Giorgos Stamou
ELM
ReLM
LRM
11
24
0
17 Feb 2024
EEG2Rep: Enhancing Self-supervised EEG Representation Through Informative Masked Inputs
Navid Mohammadi Foumani
G. Mackellar
Soheila Ghane
Saad Irtza
Nam Nguyen
Mahsa Salehi
12
14
0
17 Feb 2024
A Question Answering Based Pipeline for Comprehensive Chinese EHR Information Extraction
Huaiyuan Ying
Sheng Yu
MedIm
22
0
0
17 Feb 2024
Enhancing ESG Impact Type Identification through Early Fusion and Multilingual Models
Hariram Veeramani
Surendrabikram Thapa
Usman Naseem
11
5
0
16 Feb 2024
Understanding Survey Paper Taxonomy about Large Language Models via Graph Representation Learning
Jun Zhuang
C. Kennington
16
9
0
16 Feb 2024
Reusing Softmax Hardware Unit for GELU Computation in Transformers
C. Peltekis
K. Alexandridis
G. Dimitrakopoulos
19
0
0
15 Feb 2024
OrderBkd: Textual backdoor attack through repositioning
Irina Alekseevskaia
Konstantin Arkhipenko
22
2
0
12 Feb 2024
Large Language Models: A Survey
Shervin Minaee
Tomáš Mikolov
Narjes Nikzad
M. Asgari-Chenaghlu
R. Socher
Xavier Amatriain
Jianfeng Gao
ALM
LM&MA
ELM
120
364
0
09 Feb 2024
Traditional Machine Learning Models and Bidirectional Encoder Representations From Transformer (BERT)-Based Automatic Classification of Tweets About Eating Disorders: Algorithm Development and Validation Study
J. Benítez-Andrades
José-Manuel Alija-Pérez
Maria-Esther Vidal
R. Pastor-Vargas
María Teresa García-Ordás
13
36
0
08 Feb 2024
Empowering machine learning models with contextual knowledge for enhancing the detection of eating disorders in social media posts
J. Benítez-Andrades
María Teresa García-Ordás
Mayra Russo
Ahmad Sakor
Luis Daniel Fernandes Rotger
Maria-Esther Vidal
AI4MH
82
3
0
08 Feb 2024
Improving Agent Interactions in Virtual Environments with Language Models
Jack Zhang
LLMAG
24
0
0
08 Feb 2024
Triplet Interaction Improves Graph Transformers: Accurate Molecular Graph Learning with Triplet Graph Transformers
Md Shamim Hussain
Mohammed J. Zaki
D. Subramanian
ViT
26
5
0
07 Feb 2024
DE
3
^3
3
-BERT: Distance-Enhanced Early Exiting for BERT based on Prototypical Networks
Jianing He
Qi Zhang
Weiping Ding
Duoqian Miao
Jun Zhao
Liang Hu
LongBing Cao
34
3
0
03 Feb 2024
Fractal Patterns May Illuminate the Success of Next-Token Prediction
Ibrahim M. Alabdulmohsin
Vinh Q. Tran
Mostafa Dehghani
29
2
0
02 Feb 2024
Distractor Generation for Multiple-Choice Questions: A Survey of Methods, Datasets, and Evaluation
Elaf Alhazmi
Quan Z. Sheng
W. Zhang
Munazza Zaib
A. Alhazmi
AI4Ed
38
1
0
02 Feb 2024
Dive into the Chasm: Probing the Gap between In- and Cross-Topic Generalization
Andreas Waldis
Yufang Hou
Iryna Gurevych
ELM
24
7
0
02 Feb 2024
Investigating Recurrent Transformers with Dynamic Halt
Jishnu Ray Chowdhury
Cornelia Caragea
39
1
0
01 Feb 2024
Comparing Template-based and Template-free Language Model Probing
Sagi Shaier
Kevin Bennett
Lawrence E Hunter
K. Wense
ELM
28
3
0
31 Jan 2024
Desiderata for the Context Use of Question Answering Systems
Sagi Shaier
Lawrence E Hunter
K. Wense
28
4
0
31 Jan 2024
PipeNet: Question Answering with Semantic Pruning over Knowledge Graphs
Ying Su
Jipeng Zhang
Yangqiu Song
Tong Zhang
30
0
0
31 Jan 2024
When Large Language Models Meet Vector Databases: A Survey
Zhi Jing
Yongye Su
Yikun Han
Bo Yuan
Haiyun Xu
Chunjiang Liu
Kehai Chen
Min Zhang
53
35
0
30 Jan 2024
Fine-tuning Transformer-based Encoder for Turkish Language Understanding Tasks
Savas Yildirim
6
5
0
30 Jan 2024
GuReT: Distinguishing Guilt and Regret related Text
S. Butt
F. Balouchzahi
Abdul Gafar Manuel Meque
Maaz Amjad
Hector G. Ceballos Cancino
Grigori Sidorov
Alexander Gelbukh
17
0
0
29 Jan 2024
X-PEFT: eXtremely Parameter-Efficient Fine-Tuning for Extreme Multi-Profile Scenarios
Namju Kwak
Taesup Kim
MoE
13
0
0
29 Jan 2024
BPDec: Unveiling the Potential of Masked Language Modeling Decoder in BERT pretraining
Wen-Chieh Liang
Youzhi Liang
OffRL
23
2
0
29 Jan 2024
Credit Risk Meets Large Language Models: Building a Risk Indicator from Loan Descriptions in P2P Lending
Mario Sanz-Guerrero
Javier Arroyo
28
4
0
29 Jan 2024
Quantifying Stereotypes in Language
Yang Liu
30
1
0
28 Jan 2024
Semantics of Multiword Expressions in Transformer-Based Models: A Survey
Filip Miletic
Sabine Schulte im Walde
40
6
0
27 Jan 2024
A Comprehensive Survey of Compression Algorithms for Language Models
Seungcheol Park
Jaehyeon Choi
Sojin Lee
U. Kang
MQ
24
12
0
27 Jan 2024
Previous
1
2
3
...
8
9
10
...
57
58
59
Next