v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019

26 September 2019

ArXiv (abs)PDF HTML Github (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,049 papers shown

Towards Effective Time-Aware Language Representation: Exploring Enhanced Temporal Understanding in Language Models

Jiexin Wang

Adam Jatowt

Yi Cai

AI4CE

274

04 Jun 2024

It's a Feature, Not a Bug: Measuring Creative Fluidity in Image Generators

Aditi Ramaswamy

Melane Navaratnarajah

Hana Chockler

EGVM

134

03 Jun 2024

Reward-based Input Construction for Cross-document Relation Extraction

153

31 May 2024

GAMedX: Generative AI-based Medical Entity Data Extractor Using Large Language Models

Mohammed-Khalil Ghali

225

31 May 2024

Entangled Relations: Leveraging NLI and Meta-analysis to Enhance Biomedical Relation Extraction

William Hogan

Jingbo Shang

245

31 May 2024

Unlocking the Potential of Large Language Models for Clinical Text Anonymization: A Comparative Study

264

29 May 2024

On the Role of Attention Masks and LayerNorm in Transformers

258

29 May 2024

Transformers Can Do Arithmetic with the Right Embeddings

...

199

27 May 2024

BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation

Ziniu Li

240

27 May 2024

Recent advances in text embedding: A Comprehensive Review of Top-Performing Methods on the MTEB Benchmark

Hongliu Cao

AI4TS

331

27 May 2024

SoK: Leveraging Transformers for Malware Analysis

443

27 May 2024

Tokenization Matters! Degrading Large Language Models through Challenging Their Tokenization

492

27 May 2024

Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration

Liang Pang

Jun Xu

345

26 May 2024

Accelerating Transformers with Spectrum-Preserving Token Merging

Duy M. Nguyen

Ngan Le

276

25 May 2024

MoEUT: Mixture-of-Experts Universal Transformers

Christopher D. Manning

MoE

258

25 May 2024

GPT is Not an Annotator: The Necessity of Human Annotation in Fairness Benchmark Construction

Virginia K. Felkner

Jennifer A. Thompson

Jonathan May

220

24 May 2024

ETTrack: Enhanced Temporal Motion Predictor for Multi-Object Tracking

254

24 May 2024

Optimizing Large Language Models for OpenAPI Code Completion

Bohdan Petryshyn

M. Lukoševičius

LLMAG ALM

198

24 May 2024

Thinking Forward: Memory-Efficient Federated Finetuning of Language Models

225

24 May 2024

CEEBERT: Cross-Domain Inference in Early Exit BERT

Divya J. Bajpai

M. Hanawal

LRM

193

23 May 2024

Super Tiny Language Models

Cheston Tan

Bobby Cheng

294

23 May 2024

A Survey on Vision-Language-Action Models for Embodied AI

910

169

23 May 2024

WeatherFormer: A Pretrained Encoder Model for Learning Robust Weather Representations from Small Datasets

173

22 May 2024

Investigating Persuasion Techniques in Arabic: An Empirical Study Leveraging Large Language Models

Abdurahmman Alzahrani

175

21 May 2024

Talk2Radar: Bridging Natural Language with 4D mmWave Radar for 3D Referring Expression Comprehension

...

654

21 May 2024

CReMa: Crisis Response through Computational Identification and Matching of Cross-Lingual Requests and Offers Shared on Social Media

133

20 May 2024

Case-Based Reasoning Approach for Solving Financial Question Answering

Yikyung Kim

Jay-Yoon Lee

AIMat

151

18 May 2024

The Future of Large Language Model Pre-training is Federated

...

450

17 May 2024

Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and OpportunitiesIEEE Communications Surveys and Tutorials (COMST), 2024

Yufei Cui

...

Xue Liu

319

188

17 May 2024

A survey on fairness of large language models in e-commerce: progress, application, and challenge

306

15 May 2024

A Survey of Generative Techniques for Spatial-Temporal Data Mining

...

220

15 May 2024

From Transformers to LLMs: A Systematic Survey of Efficiency Considerations in NLP

439

15 May 2024

A Decoupling and Aggregating Framework for Joint Extraction of Entities and RelationsIEEE Access (IEEE Access), 2024

260

14 May 2024

Impact of Stickers on Multimodal Sentiment and Intent in Social Media: A New Task, Dataset and Baseline

Yuanchen Shi

Biao Ma

Fang Kong

241

14 May 2024

ViWikiFC: Fact-Checking for Vietnamese Wikipedia-Based Textual Knowledge Source

312

13 May 2024

DEPTH: Discourse Education through Pre-Training Hierarchically

322

13 May 2024

Branching Narratives: Character Decision Points Detection

Alexey Tikhonov

160

12 May 2024

ExplainableDetector: Exploring Transformer-based Language Modeling Approach for SMS Spam Detection with Explainability Analysis

Mohammad Amaz Uddin

Muhammad Nazrul Islam

Leandros A. Maglaras

Helge Janicke

Iqbal H. Sarker

173

12 May 2024

SaudiBERT: A Large Language Model Pretrained on Saudi Dialect Corpora

Faisal Qarah

210

10 May 2024

Similarity Guided Multimodal Fusion Transformer for Semantic Location
Prediction in Social Media

199

09 May 2024

Multi-level Shared Knowledge Guided Learning for Knowledge Graph Completion

277

08 May 2024

A Review on Discriminative Self-supervised Learning Methods in Computer Vision

Nikolaos Giakoumoglou

Tania Stathaki

Athanasios Gkelias

SSL

438

08 May 2024

Switchable Decision: Dynamic Neural Generation Networks

213

07 May 2024

Revisiting character-level adversarial attacks

244

07 May 2024

LingML: Linguistic-Informed Machine Learning for Enhanced Fake News Detection

Jasraj Singh

Fang Liu

130

07 May 2024

Exploring prompts to elicit memorization in masked language model-based named entity recognitionPLoS ONE (PLoS ONE), 2024

Yuxi Xia

Anastasiia Sedova

Pedro Henrique Luz de Araujo

Vasiliki Kougia

Lisa Nussbaumer

Benjamin Roth

287

05 May 2024

Enabling Patient-side Disease Prediction via the Integration of Patient NarrativesThe Web Conference (WWW), 2024

112

05 May 2024

Hoaxpedia: A Unified Wikipedia Hoax Articles Dataset

Hsuvas Borkakoty

Luis Espinosa-Anke

275

03 May 2024

Exploiting ChatGPT for Diagnosing Autism-Associated Language Disorders and Identifying Distinct FeaturesResearch Square (RS), 2024

Chuanbo Hu

Wenqi Li

Mindi Ruan

Xiangxu Yu

Lynn K. Paul

Shuo Wang

Xin Li

129

03 May 2024

Large Language Models for UAVs: Current State and Pathways to the FutureIEEE Open Journal of Vehicular Technology (OJVT), 2024

Shumaila Javaid

Nasir Saeed

Bin He

281

02 May 2024