v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019

26 September 2019

ArXiv (abs)PDF HTML Github (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,049 papers shown

Enhancing Language Models for Financial Relation Extraction with Named Entities and Part-of-Speech

Menglin Li

Kwan Hui Lim

187

02 May 2024

A Named Entity Recognition and Topic Modeling-based Solution for Locating and Better Assessment of Natural Disasters in Social Media

Ayaz Mehmood

Muhammad Tayyab Zamir

Muhammad Asif Ayub

Nasir Ahmad

Kashif Ahmad

109

01 May 2024

EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization

178

30 Apr 2024

Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading Comprehension

211

27 Apr 2024

Transfer Learning Enhanced Single-choice Decision for Multi-choice Question Answering

Zhoujun Li

162

27 Apr 2024

CoSD: Collaborative Stance Detection with Contrastive Heterogeneous Topic Graph Learning

230

26 Apr 2024

Exploring Internal Numeracy in Language Models: A Case Study on ALBERT

Ulme Wennberg

G. Henter

MILM

232

25 Apr 2024

Exploring Learngene via Stage-wise Weight Sharing for Initializing Variable-sized Models

214

25 Apr 2024

Learning Long-form Video Prior via Generative Pre-Training

...

233

24 Apr 2024

A Comprehensive Survey on Evaluating Large Language Model Applications in the Medical Industry

413

24 Apr 2024

Mapping Literature Landscapes with Data-Driven Discovery: A Case Study on MOEA/D

Mingyu Huang

Ke Li

306

22 Apr 2024

Embarrassingly Simple Unsupervised Aspect Based Sentiment Tuple Extraction

152

21 Apr 2024

PEACH: Pretrained-embedding Explanation Across Contextual and Hierarchical Structure

Feiqi Cao

S. Han

Hyunsuk Chung

208

21 Apr 2024

Explanation based Bias Decoupling Regularization for Natural Language Inference

Jianxiang Zang

Hui Liu

188

20 Apr 2024

Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization Challenge

Khuyagbaatar Batsuren

Ekaterina Vylomova

Verna Dankers

Tsetsuukhei Delgerbaatar

Omri Uzan

Yuval Pinter

Gábor Bella

182

20 Apr 2024

Transformer-Based Classification Outcome Prediction for Multimodal Stroke Treatment

208

19 Apr 2024

EnriCo: Enriched Representation and Globally Constrained Inference for Entity and Relation Extraction

219

18 Apr 2024

GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction

324

18 Apr 2024

Enhance Robustness of Language Models Against Variation Attack through Graph Integration

Changlong Sun

Xiaozhong Liu

Wei Lu

210

18 Apr 2024

Dynamic Self-adaptive Multiscale Distillation from Pre-trained Multimodal Large Model for Efficient Cross-modal Representation Learning

277

16 Apr 2024

Referring Flexible Image Restoration

Tianlang Xue

197

16 Apr 2024

On the Effects of Fine-tuning Language Models for Text-Based Reinforcement Learning

125

15 Apr 2024

Navigating the Landscape of Large Language Models: A Comprehensive Review and Analysis of Paradigms and Fine-Tuning Strategies

Benjue Weng

LM&MA

287

13 Apr 2024

VertAttack: Taking advantage of Text Classifiers' horizontal vision

Jonathan Rusert

AAML

250

12 Apr 2024

Emerging Property of Masked Token for Effective Pre-training

Hyesong Choi

Hunsang Lee

Seyoung Joung

Hyejin Park

Jiyeong Kim

Dongbo Min

170

12 Apr 2024

Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training

Hyesong Choi

Hyejin Park

Kwang Moo Yi

Sungmin Cha

Dongbo Min

274

12 Apr 2024

On Unified Prompt Tuning for Request Quality Assurance in Public Code Review

276

11 Apr 2024

CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent LayersAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

248

10 Apr 2024

Dimensionality Reduction in Sentence Transformer Vector Databases with Fast Fourier Transform

Vitaly Bulgakov

Alec Segal

135

09 Apr 2024

AnchorAL: Computationally Efficient Active Learning for Large and Imbalanced Datasets

Pietro Lesci

Andreas Vlachos

346

08 Apr 2024

Chinese Sequence Labeling with Semi-Supervised Boundary-Aware Language Model Pre-training

Min Zhang

299

08 Apr 2024

OPSD: an Offensive Persian Social media Dataset and its baseline evaluations

Amir Hossein Mansouri

Mohammad Bisheh-Niasar

Zahra Pourbahman

101

08 Apr 2024

Shortcut-connected Expert Parallelism for Accelerating Mixture-of-Experts

515

07 Apr 2024

What Happens When Small Is Made Smaller? Exploring the Impact of Compression on Small Data Pretrained Language Models

Busayo Awobade

Mardiyyah Oduwole

Steven Kolawole

201

06 Apr 2024

Order-Based Pre-training Strategies for Procedural Text Understanding

Abhilash Nandy

Yash Kulkarni

Pawan Goyal

Niloy Ganguly

196

06 Apr 2024

A Morphology-Based Investigation of Positional Encodings

Poulami Ghosh

Shikhar Vashishth

Mary Dabre

Pushpak Bhattacharyya

223

06 Apr 2024

Multi-modal Learning for WebAssembly Reverse EngineeringInternational Symposium on Software Testing and Analysis (ISSTA), 2024

Hanxian Huang

Jishen Zhao

231

04 Apr 2024

Robust Pronoun Fidelity with English LLMs: Are they Reasoning, Repeating, or Just Biased?Transactions of the Association for Computational Linguistics (TACL), 2024

325

04 Apr 2024

Revisiting subword tokenization: A case study on affixal negation in large language modelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Karin Verspoor

210

03 Apr 2024

Linear Attention Sequence Parallelism

385

03 Apr 2024

Digital Forgetting in Large Language Models: A Survey of Unlearning MethodsArtificial Intelligence Review (Artif Intell Rev), 2024

Alberto Blanco-Justicia

N. Jebreel

Benet Manzanares-Salor

336

02 Apr 2024

Deconstructing In-Context Learning: Understanding Prompts via CorruptionInternational Conference on Language Resources and Evaluation (LREC), 2024

353

02 Apr 2024

Semantic Augmentation in Images using Language

Sahiti Yerramilli

Jayant Sravan Tamarapalli

Tanmay Girish Kulkarni

Jonathan M Francis

Eric Nyberg

DiffM VLM

231

02 Apr 2024

Green AI: Exploring Carbon Footprints, Mitigation Strategies, and Trade Offs in Large Language Model Training

Vivian Liu

Yiqiao Yin

308

01 Apr 2024

Efficient Prompting Methods for Large Language Models: A Survey

Jingbo Zhu

408

01 Apr 2024

Efficiently Distilling LLMs for Edge Applications

220

01 Apr 2024

CoUDA: Coherence Evaluation via Unified Data Augmentation

Sujian Li

147

31 Mar 2024

Addressing Both Statistical and Causal Gender Fairness in NLP Models

Hannah Chen

Yangfeng Ji

David Evans

313

30 Mar 2024

A Comprehensive Study on NLP Data Augmentation for Hate Speech Detection: Legacy Methods, BERT, and LLMs

182

30 Mar 2024

Classifying Conspiratorial Narratives At Scale: False Alarms and Erroneous Connections

Ahmad Diab

Rr. Nefriana

Yu-Ru Lin

182

29 Mar 2024