v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019

26 September 2019

ArXiv (abs)PDF HTML Github (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,049 papers shown

Second-Order Fine-Tuning without Pain for LLMs:A Hessian Informed Zeroth-Order Optimizer

677

23 Feb 2024

An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach

Mohammad Amaz Uddin

Md Mahiuddin

Iqbal H. Sarker

199

21 Feb 2024

EvoGrad: A Dynamic Take on the Winograd Schema Challenge with Human Adversaries

Jing Han Sun

Ali Emami

275

20 Feb 2024

Detecting misinformation through Framing Theory: the Frame Element-based Model

201

19 Feb 2024

Head-wise Shareable Attention for Large Language Models

Zouying Cao

Yifei Yang

Hai Zhao

174

19 Feb 2024

Utilizing BERT for Information Retrieval: Survey, Applications, Resources, and Challenges

Md Tahmid Rahman Laskar

Amran Bhuiyan

353

18 Feb 2024

Puzzle Solving using Reasoning of Large Language Models: A Survey

Panagiotis Giadikiaroglou

Maria Lymperaiou

Giorgos Filandrianos

Giorgos Stamou

ELM ReLM LRM

378

17 Feb 2024

EEG2Rep: Enhancing Self-supervised EEG Representation Through Informative Masked Inputs

Navid Mohammadi Foumani

331

17 Feb 2024

A Question Answering Based Pipeline for Comprehensive Chinese EHR Information Extraction

Huaiyuan Ying

Sheng Yu

MedIm

130

17 Feb 2024

Enhancing ESG Impact Type Identification through Early Fusion and Multilingual Models

Hariram Veeramani

Surendrabikram Thapa

Usman Naseem

162

16 Feb 2024

Understanding Survey Paper Taxonomy about Large Language Models via Graph Representation Learning

Jun Zhuang

C. Kennington

16 Feb 2024

Reusing Softmax Hardware Unit for GELU Computation in Transformers

C. Peltekis

K. Alexandridis

G. Dimitrakopoulos

126

15 Feb 2024

OrderBkd: Textual backdoor attack through repositioning

Irina Alekseevskaia

Konstantin Arkhipenko

228

12 Feb 2024

Large Language Models: A Survey

847

779

09 Feb 2024

Traditional Machine Learning Models and Bidirectional Encoder Representations From Transformer (BERT)-Based Automatic Classification of Tweets About Eating Disorders: Algorithm Development and Validation Study

J. Benítez-Andrades

José-Manuel Alija-Pérez

Maria-Esther Vidal

R. Pastor-Vargas

María Teresa García-Ordás

154

08 Feb 2024

Empowering machine learning models with contextual knowledge for enhancing the detection of eating disorders in social media posts

J. Benítez-Andrades

María Teresa García-Ordás

Mayra Russo

Ahmad Sakor

Luis Daniel Fernandes Rotger

Maria-Esther Vidal

AI4MH

235

08 Feb 2024

Improving Agent Interactions in Virtual Environments with Language Models

Jack Zhang

LLMAG

169

08 Feb 2024

Triplet Interaction Improves Graph Transformers: Accurate Molecular Graph Learning with Triplet Graph Transformers

383

07 Feb 2024

Lens: A Knowledge-Guided Foundation Model for Network Traffic

Ziyu Yao

Bo Ji

Long Cheng

Gang Zhou

Huajie Shao

170

06 Feb 2024

^3

-BERT: Distance-Enhanced Early Exiting for BERT based on Prototypical Networks

Duoqian Miao

200

03 Feb 2024

Fractal Patterns May Illuminate the Success of Next-Token Prediction

Ibrahim Alabdulmohsin

Vinh Q. Tran

Mostafa Dehghani

178

02 Feb 2024

Distractor Generation for Multiple-Choice Questions: A Survey of Methods, Datasets, and Evaluation

228

02 Feb 2024

Dive into the Chasm: Probing the Gap between In- and Cross-Topic Generalization

229

02 Feb 2024

Investigating Recurrent Transformers with Dynamic Halt

Jishnu Ray Chowdhury

Cornelia Caragea

554

01 Feb 2024

Comparing Template-based and Template-free Language Model Probing

Sagi Shaier

Kevin Bennett

Lawrence E Hunter

Katharina von der Wense

ELM

291

31 Jan 2024

Desiderata for the Context Use of Question Answering Systems

Sagi Shaier

Lawrence E Hunter

Katharina von der Wense

352

31 Jan 2024

PipeNet: Question Answering with Semantic Pruning over Knowledge Graphs

Ying Su

Jipeng Zhang

Yangqiu Song

Tong Zhang

311

31 Jan 2024

Fine-tuning Transformer-based Encoder for Turkish Language Understanding Tasks

Savas Yildirim

123

30 Jan 2024

When Large Language Models Meet Vector Databases: A Survey

450

30 Jan 2024

GuReT: Distinguishing Guilt and Regret related Text

S. Butt

F. Balouchzahi

Abdul Gafar Manuel Meque

Maaz Amjad

Hector G. Ceballos Cancino

Grigori Sidorov

Alexander Gelbukh

127

29 Jan 2024

X-PEFT: eXtremely Parameter-Efficient Fine-Tuning for Extreme Multi-Profile Scenarios

Namju Kwak

Taesup Kim

MoE

109

29 Jan 2024

BPDec: Unveiling the Potential of Masked Language Modeling Decoder in BERT pretrainingInternational Conference on Neural Information Processing (ICONIP), 2024

Wen-Chieh Liang

Youzhi Liang

OffRL

131

29 Jan 2024

Credit Risk Meets Large Language Models: Building a Risk Indicator from Loan Descriptions in P2P Lending

Mario Sanz-Guerrero

Javier Arroyo

356

29 Jan 2024

Quantifying Stereotypes in LanguageConference of the European Chapter of the Association for Computational Linguistics (EACL), 2024

Yang Liu

213

28 Jan 2024

Semantics of Multiword Expressions in Transformer-Based Models: A SurveyTransactions of the Association for Computational Linguistics (TACL), 2024

Filip Miletić

Sabine Schulte im Walde

279

27 Jan 2024

A Comprehensive Survey of Compression Algorithms for Language Models

334

27 Jan 2024

Socially Aware Synthetic Data Generation for Suicidal Ideation Detection Using Large Language ModelsIEEE Access (IEEE Access), 2024

198

25 Jan 2024

A Comparative Analysis of Noise Reduction Methods in Sentiment Analysis on Noisy Bangla Texts

Md. Tanvir Rouf Shawon

G. M. Shahariar

176

25 Jan 2024

Rethinking Patch Dependence for Masked Autoencoders

Letian Fu

344

25 Jan 2024

SEDAC: A CVAE-Based Data Augmentation Method for Security Bug Report Identification

Y. Liao

T. Zhang

22 Jan 2024

Freely Long-Thinking Transformer (FraiLT)

Akbay Tabak

114

21 Jan 2024

Robust Evaluation Measures for Evaluating Social Biases in Masked Language ModelsAAAI Conference on Artificial Intelligence (AAAI), 2024

Yang Liu

137

21 Jan 2024

Instructional Fingerprinting of Large Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Pang Wei Koh

278

21 Jan 2024

Finding a Needle in the Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases with Minimal Distribution DistortionConference of the European Chapter of the Association for Computational Linguistics (EACL), 2024

Aly M. Kassem

Sherif Saad

AAML

301

21 Jan 2024

Attentive Fusion: A Transformer-based Approach to Multimodal Hate Speech Detection

115

19 Jan 2024

Learning High-Quality and General-Purpose Phrase Representations

Lihu Chen

Gaël Varoquaux

Fabian M. Suchanek

295

18 Jan 2024

Preparing Lessons for Progressive Training on Language ModelsAAAI Conference on Artificial Intelligence (AAAI), 2024

Lifeng Shang

Xin Jiang

Qun Liu

268

17 Jan 2024

CEL: A Continual Learning Model for Disease Outbreak Prediction by Leveraging Domain Adaptation via Elastic Weight ConsolidationbioRxiv (bioRxiv), 2024

197

17 Jan 2024

Fixed Point Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2024

Xingjian Bai

Luke Melas-Kyriazi

233

16 Jan 2024

The What, Why, and How of Context Length Extension Techniques in Large Language Models -- A Detailed Survey

Saurav Pawar

S.M. Towhidul Islam Tonmoy

S. M. M. Zaman

Vinija Jain

Vasu Sharma

Amitava Das

215

15 Jan 2024