v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019

26 September 2019

ArXiv (abs)PDF HTML Github (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,048 papers shown

Noise-Free Explanation for Driving Action Prediction

286

08 Jul 2024

AI Safety in Generative AI Large Language Models: A Survey

Lina Yao

364

06 Jul 2024

Beyond Perplexity: Multi-dimensional Safety Evaluation of LLM Compression

416

06 Jul 2024

Not (yet) the whole story: Evaluating Visual Storytelling Requires More than Measuring Coherence, Grounding, and Repetition

Aditya K Surikuchi

Raquel Fernández

Sandro Pezzelle

242

05 Jul 2024

Multi-modal Masked Siamese Network Improves Chest X-Ray Representation Learning

Saeed Shurrab

Alejandro Guerra-Manzanares

Farah E. Shamout

242

05 Jul 2024

ESQA: Event Sequences Question Answering

226

03 Jul 2024

Aspect-Based Sentiment Analysis Techniques: A Comparative Study

Sachintha Rajith Ponnamperuma

G. Sandamali

K. L. Sudheera

198

03 Jul 2024

Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning

Haobo Song

Hao Zhao

Soumajit Majumder

Tao Lin

183

01 Jul 2024

Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining

Qi Zhang

Tianqi Du

Haotian Huang

Yifei Wang

Yisen Wang

236

01 Jul 2024

Large Language Model Enhanced Knowledge Representation Learning: A Survey

Xin Wang

Zirui Chen

Haofen Wang

Leong Hou U

Zhao Li

Wenbin Guo

KELM

520

01 Jul 2024

FLY-TTS: Fast, Lightweight and High-Quality End-to-End Text-to-Speech Synthesis

200

30 Jun 2024

LegalTurk Optimized BERT for Multi-Label Text Classification and NER

Farnaz Zeidi

Mehmet Fatih Amasyali

Çiğdem Erol

VLM

142

30 Jun 2024

"I understand why I got this grade": Automatic Short Answer Grading with Feedback

Dishank Aggarwal

Pushpak Bhattacharyya

Bhaskaran Raman

Pushpak Bhattacharyya

227

30 Jun 2024

BioMNER: A Dataset for Biomedical Method Entity Recognition

Chenghua Lin

182

28 Jun 2024

Protein Representation Learning with Sequence Information Embedding: Does it Always Lead to a Better Performance?

206

28 Jun 2024

When Search Engine Services meet Large Language Models: Visions and Challenges

353

28 Jun 2024

Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads

Ali Khaleghi Rahimian

237

27 Jun 2024

The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning

Zhijing Jin

255

27 Jun 2024

Clustering in pure-attention hardmax transformers and its role in sentiment analysis

Albert Alcalde

Giovanni Fantuzzi

Enrique Zuazua

292

26 Jun 2024

Unveiling and Controlling Anomalous Attention Distribution in Transformers

200

26 Jun 2024

A New Benchmark Dataset and Mixture-of-Experts Language Models for Adversarial Natural Language Inference in Vietnamese

Tin Van Huynh

Kiet Van Nguyen

Ngan Luu-Thuy Nguyen

347

25 Jun 2024

Are there identifiable structural parts in the sentence embedding whole?

Vivi Nastase

Paola Merlo

200

24 Jun 2024

Large Vocabulary Size Improves Large Language Models

316

24 Jun 2024

Evaluating the Effectiveness of the Foundational Models for Q&A Classification in Mental Health care

Hassan Alhuzali

Ashwag Alasmari

AI4MH

262

23 Jun 2024

Intrinsic Dimension Correlation: uncovering nonlinear connections in multimodal representations

296

22 Jun 2024

Brain-Like Language Processing via a Shallow Untrained Multihead Attention Network

223

21 Jun 2024

Text Serialization and Their Relationship with the Conventional Paradigms of Tabular Machine Learning

Kyoka Ono

Simon A. Lee

LMTD

215

19 Jun 2024

Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation

Jakub Simko

214

18 Jun 2024

QueerBench: Quantifying Discrimination in Language Models Toward Queer Identities

Mae Sosto

Alberto Barrón-Cedeño

210

18 Jun 2024

TroL: Traversal of Layers for Large Language and Vision Models

Yong Man Ro

349

18 Jun 2024

CollabStory: Multi-LLM Collaborative Story Generation and Authorship Analysis

Saranya Venkatraman

Nafis Irtiza Tripto

Dongwon Lee

496

18 Jun 2024

A Systematic Survey of Text Summarization: From Statistical Methods to Large Language Models

Haopeng Zhang

Philip S. Yu

Jiawei Zhang

287

17 Jun 2024

Scaling Synthetic Logical Reasoning Datasets with Context-Sensitive Declarative Grammars

Damien Sileo

LRM ReLM

289

16 Jun 2024

Improving Large Models with Small models: Lower Costs and Better Performance

Yueting Zhuang

211

15 Jun 2024

Adversarial Evasion Attack Efficiency against Large Language Models

187

12 Jun 2024

Defining and Detecting Vulnerability in Human Evaluation Guidelines: A Preliminary Study Towards Reliable NLG Evaluation

Jie Ruan

Wenqing Wang

Xiaojun Wan

AAML ELM

227

12 Jun 2024

HOLMES: Hyper-Relational Knowledge Graphs for Multi-hop Question Answering using LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Pranoy Panda

Ankush Agarwal

Chaitanya Devaguptapu

Manohar Kaul

Prathosh A P

RALM

235

10 Jun 2024

Emotion-Aware Speech Self-Supervised Representation Learning with Intensity KnowledgeInterspeech (Interspeech), 2024

Rui Liu

Zening Ma

SSL

297

10 Jun 2024

Gentle-CLIP: Exploring Aligned Semantic In Low-Quality Multimodal Data With Soft Alignment

Stan Z. Li

254

09 Jun 2024

Peer Review as A Multi-Turn and Long-Context Dialogue with Role-Based Interactions

Cheng Tan

Stan Z. Li

211

09 Jun 2024

Automata Extraction from Transformers

406

08 Jun 2024

Integrating Text and Image Pre-training for Multi-modal Algorithmic Reasoning

Zijian Zhang

Wei Liu

261

08 Jun 2024

BERTs are Generative In-Context LearnersNeural Information Processing Systems (NeurIPS), 2024

David Samuel

231

07 Jun 2024

DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMsNeural Information Processing Systems (NeurIPS), 2024

Yu-Gang Jiang

268

06 Jun 2024

Pre-trained Transformer Uncovers Meaningful Patterns in Human Mobility Data

Alameen Najjar

225

06 Jun 2024

A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions

...

Zhan Qin

264

06 Jun 2024

RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence Models for Abstractive Radiology Report Summarization

Jinge Wu

Abul Hasan

Honghan Wu

125

05 Jun 2024

Language Model Can Do Knowledge Tracing: Simple but Effective Method to Integrate Language Model and Knowledge Tracing Task

281

05 Jun 2024

Using Self-supervised Learning Can Improve Model Fairness

Sofia Yfantidou

Dimitris Spathis

Marios Constantinides

Athena Vakali

Daniele Quercia

F. Kawsar

336

04 Jun 2024

Robust Interaction-based Relevance Modeling for Online E-Commerce and LLM-based Retrieval

142

04 Jun 2024