v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019

26 September 2019

ArXiv (abs)PDF HTML Github (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,050 papers shown

Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer

Boan Liu

Liang Ding

Li Shen

212

15 Oct 2023

CarExpert: Leveraging Large Language Models for In-Car Conversational Question Answering

Md. Rony

Christian Suess

Sinchana Ramakanth Bhat

230

14 Oct 2023

Low-Resource Clickbait Spoiling for Indonesian via Question Answering

Ni Putu Intan Maharani

Ayu Purwarianti

Alham Fikri Aji

167

12 Oct 2023

To token or not to token: A Comparative Study of Text Representations for Cross-Lingual Transfer

Md. Mushfiqur Rahman

Fardin Ahsan Sakib

Fahim Faisal

Antonios Anastasopoulos

219

12 Oct 2023

Pit One Against Many: Leveraging Attention-head Embeddings for Parameter-efficient Multi-head AttentionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Huiyin Xue

Nikolaos Aletras

330

11 Oct 2023

On the Relationship between Sentence Analogy Identification and Sentence Structure Encoding in Large Language ModelsFindings (Findings), 2023

Thilini Wijesiriwardene

Ruwan Wickramarachchi

Vinija Jain

287

11 Oct 2023

The Temporal Structure of Language Processing in the Human Brain Corresponds to The Layered Hierarchy of Deep Language ModelsbioRxiv (bioRxiv), 2023

...

263

11 Oct 2023

Sparse Universal TransformerConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Shawn Tan

Songlin Yang

Zhenfang Chen

Aaron Courville

Chuang Gan

MoE

266

11 Oct 2023

A Comparative Study of Transformer-based Neural Text Representation Techniques on Bug TriagingInternational Conference on Automated Software Engineering (ASE), 2023

Atish Kumar Dipongkor

Kevin Moran

10 Oct 2023

P5: Plug-and-Play Persona Prompting for Personalized Response SelectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Joosung Lee

Min Sik Oh

Donghun Lee

205

10 Oct 2023

Model Tuning or Prompt Tuning? A Study of Large Language Models for Clinical Concept and Relation ExtractionJournal of Biomedical Informatics (JBI), 2023

Zehao Yu

Jiang Bian

206

10 Oct 2023

Evolution of Natural Language Processing Technology: Not Just Language Processing Towards General Purpose AI

Masahiro Yamamoto

187

10 Oct 2023

LLM for SoC Security: A Paradigm ShiftIEEE Access (IEEE Access), 2023

360

09 Oct 2023

IDTraffickers: An Authorship Attribution Dataset to link and connect Potential Human-Trafficking Operations on Text Escort AdvertisementsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

V. Saxena

Benjamin Bashpole

Gijs Van Dijck

Gerasimos Spanakis

287

09 Oct 2023

Empower Nested Boolean Logic via Self-Supervised Curriculum LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Min Zhang

236

09 Oct 2023

On the Zero-Shot Generalization of Machine-Generated Text DetectorsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Jingyu Zhang

Tianxing He

169

08 Oct 2023

ZooPFL: Exploring Black-box Foundation Models for Personalized Federated Learning

Xing Xie

238

08 Oct 2023

Counter Turing Test CT^2: AI-Generated Text Detection is Not as Easy as You May Think -- Introducing AI Detectability IndexConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Megha Chakraborty

S.M. Towhidul Islam Tonmoy

...

Vinija Jain

203

08 Oct 2023

Compresso: Structured Pruning with Collaborative Prompting Learns Compact Large Language Models

272

08 Oct 2023

MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Xiusi Chen

321

08 Oct 2023

The Troubling Emergence of Hallucination in Large Language Models -- An Extensive Definition, Quantification, and Prescriptive RemediationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

S.M. Towhidul Islam Tonmoy

325

186

08 Oct 2023

A New Dataset for End-to-End Sign Language Translation: The Greek Elementary School Dataset

Andreas Voskou

Konstantinos P. Panousis

254

07 Oct 2023

A Comprehensive Evaluation of Large Language Models on Benchmark Biomedical Text Processing Tasks

Fangshuo Liao

Md Tahmid Rahman Laskar

Cruz Barnum

Jimmy Xiangji Huang

AI4MH LM&MA

367

119

06 Oct 2023

Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision MakingInternational Conference on Learning Representations (ICLR), 2023

311

04 Oct 2023

ResidualTransformer: Residual Low-Rank Learning with Weight-Sharing for Transformer LayersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Yiming Wang

Jinyu Li

224

03 Oct 2023

ScaleNet: An Unsupervised Representation Learning Method for Limited InformationGerman Conference on Pattern Recognition (GCPR), 2023

Huili Huang

M. M. Roozbahani

SSL

328

786

03 Oct 2023

Selective Feature Adapter for Dense Vision Transformers

XueQing Deng

Qi Fan

Xiaojie Jin

Linjie Yang

Peng Wang

226

03 Oct 2023

Zero-Shot Continuous Prompt Transfer: Generalizing Task Semantics Across Language ModelsInternational Conference on Learning Representations (ICLR), 2023

216

02 Oct 2023

From Bricks to Bridges: Product of Invariances to Enhance Latent Space CommunicationInternational Conference on Learning Representations (ICLR), 2023

Valentino Maiorca

253

02 Oct 2023

Improving Length-Generalization in Transformers via Task Hinting

Pranjal Awasthi

Anupam Gupta

192

01 Oct 2023

RelBERT: Embedding Relations with Language ModelsArtificial Intelligence (AIJ), 2023

Asahi Ushio

Jose Camacho-Collados

Steven Schockaert

KELM

324

30 Sep 2023

KLoB: a Benchmark for Assessing Knowledge Locating Methods in Language Models

Yiming Ju

Zheng Zhang

KELM

168

28 Sep 2023

ELIP: Efficient Discriminative Language-Image Pre-training with Fewer Vision Tokens

Haoyu Zhang

276

28 Sep 2023

Question answering using deep learning in low resource Indian language Marathi

Dhiraj Amin

S. Govilkar

Sagar Kulkarni

105

27 Sep 2023

Identifying and Mitigating Privacy Risks Stemming from Language Models: A Survey

Victoria Smith

Ali Shahin Shamsabadi

Carolyn Ashurst

Adrian Weller

PILM

503

27 Sep 2023

Generative Speech Recognition Error Correction with Large Language Models and Task-Activating PromptingAutomatic Speech Recognition & Understanding (ASRU), 2023

432

27 Sep 2023

Knowledgeable In-Context Tuning: Exploring and Exploiting Factual Knowledge for In-Context Learning

Jiadong Wang

Chengyu Wang

Chuanqi Tan

Jun Huang

Ming Gao

KELM

317

26 Sep 2023

LORD: Low Rank Decomposition Of Monolingual Code LLMs For One-Shot Compression

Ayush Kaushal

Tejas Vaidhya

Irina Rish

363

25 Sep 2023

Investigating Large Language Models and Control Mechanisms to Improve Text Readability of Biomedical AbstractsIEEE International Conference on Healthcare Informatics (ICHI), 2023

Matthew Shardlow

254

22 Sep 2023

TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight InheritanceIEEE International Conference on Computer Vision (ICCV), 2023

...

260

21 Sep 2023

Towards Answering Health-related Questions from Medical Videos: Datasets and ApproachesInternational Conference on Language Resources and Evaluation (LREC), 2023

161

21 Sep 2023

BELT:Bootstrapping Electroencephalography-to-Language Decoding and Zero-Shot Sentiment Classification by Natural Language Supervision

Jinzhao Zhou

Yiqun Duan

Yu-Cheng Chang

Yu-Kai Wang

Chin-Teng Lin

222

21 Sep 2023

DimCL: Dimensional Contrastive Learning For Improving Self-Supervised LearningIEEE Access (IEEE Access), 2023

315

21 Sep 2023

Long-tail Augmented Graph Contrastive Learning for Recommendation

136

20 Sep 2023

Heterogeneous Entity Matching with Complex Attribute Associations using BERT and Neural Networks

Shitao Wang

Jiamin Lu

129

20 Sep 2023

A Family of Pretrained Transformer Language Models for RussianInternational Conference on Language Resources and Evaluation (LREC), 2023

...

Alena Fenogenova

320

19 Sep 2023

Artificial Intelligence-Enabled Intelligent Assistant for Personalized and Adaptive Learning in Higher Education

266

328

19 Sep 2023

A Neighbourhood-Aware Differential Privacy Mechanism for Static Word EmbeddingsInternational Joint Conference on Natural Language Processing (IJCNLP), 2023

Danushka Bollegala

Shuichi Otake

T. Machide

Ken-ichi Kawarabayashi

364

19 Sep 2023

Model Leeching: An Extraction Attack Targeting LLMs

Lewis Birch

William Hackett

Stefan Trawicki

N. Suri

Peter Garraghan

200

19 Sep 2023

Generative modeling, design and analysis of spider silk protein sequences for enhanced mechanical propertiesAdvanced Functional Materials (Adv. Funct. Mater.), 2023

Wei Lu

David L. Kaplan

Markus J. Buehler

173

18 Sep 2023