v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019

26 September 2019

ArXiv (abs)PDF HTML Github (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,050 papers shown

Multi-turn Dialogue Comprehension from a Topic-aware Perspective

265

18 Sep 2023

Are You Worthy of My Trust?: A Socioethical Perspective on the Impacts of Trustworthy AI Systems on the Environment and Human Society

Jamell Dacon

SILM

217

18 Sep 2023

Pedestrian Trajectory Prediction Using Dynamics-based Deep LearningIEEE International Conference on Robotics and Automation (ICRA), 2023

Honghui Wang

Weiming Zhi

Gustavo Batista

Rohitash Chandra

194

16 Sep 2023

Fake News Detectors are Biased against Texts Generated by Large Language Models

181

15 Sep 2023

How to Handle Different Types of Out-of-Distribution Scenarios in Computational Argumentation? A Comprehensive and Fine-Grained Field StudyAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Andreas Waldis

Yufang Hou

Iryna Gurevych

329

15 Sep 2023

Do Generative Large Language Models need billions of parameters?

Sia Gholami

Marwan Omar

203

12 Sep 2023

Leveraging Large Language Models and Weak Supervision for Social Media data annotation: an evaluation using COVID-19 self-reported vaccination tweetsInteracción (HCI), 2023

Ramya Tekumalla

Juan M. Banda

184

12 Sep 2023

Balanced and Explainable Social Media Analysis for Public Health with Large Language ModelsAustralasian Database Conference (ADC), 2023

174

12 Sep 2023

Incorporating Pre-trained Model Prompting in Multimodal Stock Volume Movement Prediction

178

11 Sep 2023

Black-Box Analysis: GPTs Across Time in Legal Textual Entailment Task

121

11 Sep 2023

CrisisTransformers: Pre-trained language models and sentence encoders for crisis-related social media textsKnowledge-Based Systems (KBS), 2023

Rabindra Lamsal

M. Read

S. Karunasekera

281

11 Sep 2023

Retrieval-Augmented Meta Learning for Low-Resource Text ClassificationIEEE International Joint Conference on Neural Network (IJCNN), 2023

243

10 Sep 2023

Introducing "Forecast Utterance" for Conversational Data Science

214

07 Sep 2023

Knowledge Solver: Teaching LLMs to Search for Domain Knowledge from Knowledge Graphs

226

06 Sep 2023

One Wide Feedforward is All You NeedConference on Machine Translation (WMT), 2023

251

04 Sep 2023

FusionAI: Decentralized Training and Deploying LLMs with Massive Consumer-Level GPUs

...

247

03 Sep 2023

Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language ModelsComputational Linguistics (CL), 2023

...

733

828

03 Sep 2023

Studying the impacts of pre-training using ChatGPT-generated text on downstream tasks

Sarthak Anand

174

02 Sep 2023

RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large ModelIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

...

Pareesa Ameneh Golnari

Yuxiong He

257

02 Sep 2023

Learning to Taste: A Multimodal Wine DatasetNeural Information Processing Systems (NeurIPS), 2023

515

31 Aug 2023

ViLTA: Enhancing Vision-Language Pre-training through Textual AugmentationIEEE International Conference on Computer Vision (ICCV), 2023

Weihan Wang

Zhiyong Yang

Bin Xu

Juanzi Li

Yankui Sun

VLM

289

31 Aug 2023

Thesis Distillation: Investigating The Impact of Bias in NLP Models on Hate Speech Detection

Fatma Elsafoury

283

31 Aug 2023

ToddlerBERTa: Exploiting BabyBERTa for Grammar Learning and Language Understanding

Omer Veysel Cagatan

163

30 Aug 2023

Introducing Language Guidance in Prompt-based Continual LearningIEEE International Conference on Computer Vision (ICCV), 2023

Muhammad Gul Zain Ali Khan

Muhammad Ferjad Naeem

Luc Van Gool

D. Stricker

F. Tombari

Muhammad Zeshan Afzal

VLM CLL

217

30 Aug 2023

Cyberbullying Detection for Low-resource Languages and Dialects: Review of the State of the ArtInformation Processing & Management (IPM), 2023

169

30 Aug 2023

TransPrompt v2: A Transferable Prompting Framework for Cross-task Text Classification

Jiadong Wang

Chengyu Wang

Cen Chen

178

29 Aug 2023

Video Multimodal Emotion Recognition System for Real World ApplicationsInterspeech (Interspeech), 2023

Sun-Kyung Lee

Jong-Hwan Kim

CVBM

122

28 Aug 2023

LMSanitator: Defending Prompt-Tuning Against Task-Agnostic BackdoorsNetwork and Distributed System Security Symposium (NDSS), 2023

181

26 Aug 2023

FwdLLM: Efficient FedLLM using Forward Gradient

Mengwei Xu

256

26 Aug 2023

WellXplain: Wellness Concept Extraction and Classification in Reddit Posts for Mental Health AnalysisKnowledge-Based Systems (KBS), 2023

Muskan Garg

AI4MH

196

25 Aug 2023

TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational GraphsNeural Information Processing Systems (NeurIPS), 2023

428

25 Aug 2023

Construction Grammar and Language Models

Harish Tayyar Madabushi

Laurence Romain

P. Milin

Dagmar Divjak

411

25 Aug 2023

Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and Vulnerabilities

221

107

24 Aug 2023

A Small and Fast BERT for Chinese Medical Punctuation RestorationInterspeech (Interspeech), 2023

212

24 Aug 2023

Evolution of ESG-focused DLT Research: An NLP Analysis of the LiteratureQuantitative Science Studies (QSS), 2023

Walter Hernandez Cruz

402

23 Aug 2023

GOPro: Generate and Optimize Prompts in CLIP using Self-Supervised LearningBritish Machine Vision Conference (BMVC), 2023

Mainak Singha

Ankit Jha

Biplab Banerjee

VLM

162

22 Aug 2023

GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-trainingIEEE International Conference on Computer Vision (ICCV), 2023

Hang Xu

Jianhua Han

James T. Kwok

Shen Zhao

Wei Zhang

Xiaodan Liang

CLIP VLM

211

22 Aug 2023

Systematic Offensive Stereotyping (SOS) Bias in Language Models

Fatma Elsafoury

101

21 Aug 2023

Large Language Models for Software Engineering: A Systematic Literature ReviewACM Transactions on Software Engineering and Methodology (TOSEM), 2023

Kailong Wang

Haoyu Wang

360

802

21 Aug 2023

Learning Representations on Logs for AIOpsIEEE International Conference on Cloud Computing (CLOUD), 2023

139

18 Aug 2023

BERT4CTR: An Efficient Framework to Combine Pre-trained Language Model with Non-textual Features for CTR PredictionKnowledge Discovery and Data Mining (KDD), 2023

164

17 Aug 2023

Lightweight Adaptation of Neural Language Models via Subspace EmbeddingInternational Conference on Information and Knowledge Management (CIKM), 2023

Amit Kumar Jaiswal

Haiming Liu

154

16 Aug 2023

BIOptimus: Pre-training an Optimal Biomedical Language Model with Curriculum Learning for Named Entity RecognitionWorkshop on Biomedical Natural Language Processing (BioNLP), 2023

Vera Pavlova

M. Makhlouf

235

16 Aug 2023

Finding Stakeholder-Material Information from 10-K Reports using Fine-Tuned BERT and LSTM Models

V. Z. Chen

196

15 Aug 2023

gSASRec: Reducing Overconfidence in Sequential Recommendation Trained with Negative SamplingACM Conference on Recommender Systems (RecSys), 2023

Aleksandr V. Petrov

Craig Macdonald

176

14 Aug 2023

A Novel Ehanced Move Recognition Algorithm Based on Pre-trained Models with Positional Embeddings

H. Wen

Jie Wang

Xiaodong Qiao

171

14 Aug 2023

SLoRA: Federated Parameter Efficient Fine-Tuning of Language Models

Salman Avestimehr

184

112

12 Aug 2023

GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via CipherInternational Conference on Learning Representations (ICLR), 2023

Youliang Yuan

296

400

12 Aug 2023

Identification of the Relevance of Comments in Codes Using Bag of Words and Transformer Based ModelsFire (FIRE), 2023

S. Sruthi

Tanmay Basu

102

11 Aug 2023

LittleMu: Deploying an Online Virtual Teaching Assistant via Heterogeneous Sources Integration and Chain of Teach PromptsInternational Conference on Information and Knowledge Management (CIKM), 2023

Jifan Yu

Lei Hou

Juanzi Li

159

11 Aug 2023