v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019

26 September 2019

ArXiv (abs)PDF HTML Github (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,050 papers shown

Unsupervised Approach to Evaluate Sentence-Level Fluency: Do We Really Need Reference?

172

03 Dec 2023

Learning to Compose SuperWeights for Neural Parameter Allocation SearchIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

287

03 Dec 2023

Adaptive Resource Allocation for Semantic Communication NetworksIEEE Transactions on Communications (IEEE Trans. Commun.), 2023

353

02 Dec 2023

The Cost of Compression: Investigating the Impact of Compression on Parametric Knowledge in Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Srinath Namburi

Makesh Narsimhan Sreedhar

Srinath Srinivasan

Frederic Sala

224

01 Dec 2023

The Efficiency Spectrum of Large Language Models: An Algorithmic Survey

Tianyi Chen

410

01 Dec 2023

Spatial-Temporal-Decoupled Masked Pre-training for Spatiotemporal ForecastingInternational Joint Conference on Artificial Intelligence (IJCAI), 2023

429

01 Dec 2023

SEPSIS: I Can Catch Your Lies -- A New Paradigm for Deception Detection

224

01 Dec 2023

Mavericks at BLP-2023 Task 1: Ensemble-based Approach Using Language Models for Violence Inciting Text Detection

Saurabh Page

Sudeep Mangalvedhekar

Kshitij Deshpande

Tanmay Chavan

S. Sonawane

123

30 Nov 2023

DisCGen: A Framework for Discourse-Informed Counterspeech GenerationInternational Joint Conference on Natural Language Processing (IJCNLP), 2023

Sabit Hassan

Malihe Alikhani

251

29 Nov 2023

TARGET: Template-Transferable Backdoor Attack Against Prompt-based NLP Models via GPT4Natural Language Processing and Chinese Computing (NLPCC), 2023

253

29 Nov 2023

LayerCollapse: Adaptive compression of neural networks

Soheil Zibakhsh Shabgahi

Mohammad Soheil Shariff

F. Koushanfar

AI4CE

225

29 Nov 2023

RACE-IT: A Reconfigurable Analog Computing Engine for In-Memory Transformer Acceleration

312

29 Nov 2023

A Survey on Prompting Techniques in LLMs

Prabin Bhandari

192

28 Nov 2023

Entity-Aspect-Opinion-Sentiment Quadruple Extraction for Fine-grained Sentiment Analysis

Jun Xu

162

28 Nov 2023

Recognizing Conditional Causal Relationships about Emotions and Their Corresponding Conditions

Xinhong Chen

Zongxi Li

Yaowei Wang

Haoran Xie

Jianping Wang

Qing Li

121

28 Nov 2023

Leveraging deep active learning to identify low-resource mobility functioning information in public clinical notes

159

27 Nov 2023

C-SAW: Self-Supervised Prompt Learning for Image Generalization in Remote SensingIndian Conference on Computer Vision, Graphics & Image Processing (ICVGIP), 2023

Avigyan Bhattacharya

Mainak Singha

Ankit Jha

Biplab Banerjee

SSL VLM

196

27 Nov 2023

A Comparative and Experimental Study on Automatic Question Answering Systems and its Robustness against Word Jumbling

Shashidhar Reddy Javaji

Haoran Hu

Sai Sameer Vennam

Vijaya Gajanan Buddhavarapu

110

27 Nov 2023

Probabilistic Transformer: A Probabilistic Dependency Model for Contextual Word RepresentationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Haoyi Wu

Kewei Tu

839

26 Nov 2023

General Phrase Debiaser: Debiasing Masked Language Models at a Multi-Token LevelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

317

23 Nov 2023

A Multi-solution Study on GDPR AI-enabled Completeness Checking of DPAsEmpirical Software Engineering (EMSE), 2023

Muhammad Ilyas Azeem

Sallam Abualhaija

192

23 Nov 2023

Transformer-based Named Entity Recognition in Construction Supply Chain Risk Management in AustraliaIEEE Access (IEEE Access), 2023

Milad Baghalzadeh Shishehgarkhaneh

267

23 Nov 2023

Efficient Transformer Knowledge Distillation: A Performance ReviewConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

146

22 Nov 2023

Looped Transformers are Better at Learning Learning AlgorithmsInternational Conference on Learning Representations (ICLR), 2023

Liu Yang

Kangwook Lee

Robert D. Nowak

Dimitris Papailiopoulos

460

21 Nov 2023

Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey

...

383

102

21 Nov 2023

Long-MIL: Scaling Long Contextual Multiple Instance Learning for Histopathology Whole Slide Image Analysis

286

21 Nov 2023

Tensor-Aware Energy Accounting

Timur Babakol

Yu David Liu

154

19 Nov 2023

Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections

Lihan Zha

Yuchen Cui

Li-Heng Lin

Minae Kwon

Montse Gonzalez Arenas

Andy Zeng

Fei Xia

Dorsa Sadigh

337

17 Nov 2023

Generative AI for Hate Speech Detection: Evaluation and Findings

173

16 Nov 2023

Long-form Question Answering: An Iterative Planning-Retrieval-Generation Approach

Cheng Wang

Kashob Kumar Roy

Yatin Nandwani

Kevin Chen-Chuan Chang

207

15 Nov 2023

Temporal Knowledge Question Answering via Abstract Reasoning InductionAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Ziyang Chen

Dongfang Li

Xiang Zhao

Baotian Hu

Min Zhang

LRM

302

15 Nov 2023

OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining

233

15 Nov 2023

It Takes Two to Negotiate: Modeling Social Exchange in Online Multiplayer Games

Kokil Jaidka

Hansin Ahuja

Lynnette Ng

297

15 Nov 2023

GLiNER: Generalist Model for Named Entity Recognition using Bidirectional TransformerNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

160

14 Nov 2023

AI-generated text boundary detection with RoFT

296

14 Nov 2023

ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language ModelsInternational Conference on Learning Representations (ICLR), 2023

...

276

13 Nov 2023

Training A Multi-stage Deep Classifier with Feedback Signals

141

12 Nov 2023

Tunable Soft Prompts are Messengers in Federated LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

196

12 Nov 2023

Early-Exit Neural Networks with Nested Prediction SetsConference on Uncertainty in Artificial Intelligence (UAI), 2023

195

10 Nov 2023

The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based ModelsFindings (Findings), 2023

323

10 Nov 2023

Hallucination-minimized Data-to-answer Framework for Financial Decision-makersBigData Congress [Services Society] (BSS), 2023

...

Pablo Martinez Serrano

Punit Agrawal

Arijit Mukherjee

173

09 Nov 2023

A Survey of Large Language Models in Medicine: Progress, Application, and Challenge

...

736

191

09 Nov 2023

Legal-HNet: Mixing Legal Long-Context Tokens with Hartley Transform

Daniele Giofré

Sneha Ghantasala

AILaw

150

09 Nov 2023

DACBERT: Leveraging Dependency Agreement for Cost-Efficient Bert Pretraining

Martin Kuo

Jianyi Zhang

Yiran Chen

08 Nov 2023

Pragmatic Reasoning Unlocks Quantifier Semantics for Foundation Models

190

08 Nov 2023

DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding

160

07 Nov 2023

mahaNLP: A Marathi Natural Language Processing LibraryInternational Joint Conference on Natural Language Processing (IJCNLP), 2023

248

05 Nov 2023

Sentiment Analysis through LLM Negotiations

Jiwei Li

200

03 Nov 2023

TCM-GPT: Efficient Pre-training of Large Language Models for Domain Adaptation in Traditional Chinese MedicineComputer Methods and Programs in Biomedicine Update (CMPB), 2023

Xiaohong Liu

03 Nov 2023

A New Korean Text Classification Benchmark for Recognizing the Political Intents in Online Newspapers

Beomjune Kim

Eunsun Lee

Dongbin Na

157

03 Nov 2023