ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,050 papers shown
Multi-turn Dialogue Comprehension from a Topic-aware Perspective
Multi-turn Dialogue Comprehension from a Topic-aware Perspective
Xinbei Ma
Yi Xu
Hai Zhao
Zhuosheng Zhang
265
9
0
18 Sep 2023
Are You Worthy of My Trust?: A Socioethical Perspective on the Impacts
  of Trustworthy AI Systems on the Environment and Human Society
Are You Worthy of My Trust?: A Socioethical Perspective on the Impacts of Trustworthy AI Systems on the Environment and Human Society
Jamell Dacon
SILM
217
2
0
18 Sep 2023
Pedestrian Trajectory Prediction Using Dynamics-based Deep Learning
Pedestrian Trajectory Prediction Using Dynamics-based Deep LearningIEEE International Conference on Robotics and Automation (ICRA), 2023
Honghui Wang
Weiming Zhi
Gustavo Batista
Rohitash Chandra
194
5
0
16 Sep 2023
Fake News Detectors are Biased against Texts Generated by Large Language
  Models
Fake News Detectors are Biased against Texts Generated by Large Language Models
Jinyan Su
Terry Yue Zhuo
Jonibek Mansurov
Di Wang
Preslav Nakov
DeLMO
181
29
0
15 Sep 2023
How to Handle Different Types of Out-of-Distribution Scenarios in
  Computational Argumentation? A Comprehensive and Fine-Grained Field Study
How to Handle Different Types of Out-of-Distribution Scenarios in Computational Argumentation? A Comprehensive and Fine-Grained Field StudyAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Andreas Waldis
Yufang Hou
Iryna Gurevych
329
4
0
15 Sep 2023
Do Generative Large Language Models need billions of parameters?
Do Generative Large Language Models need billions of parameters?
Sia Gholami
Marwan Omar
203
27
0
12 Sep 2023
Leveraging Large Language Models and Weak Supervision for Social Media
  data annotation: an evaluation using COVID-19 self-reported vaccination
  tweets
Leveraging Large Language Models and Weak Supervision for Social Media data annotation: an evaluation using COVID-19 self-reported vaccination tweetsInteracción (HCI), 2023
Ramya Tekumalla
Juan M. Banda
184
16
0
12 Sep 2023
Balanced and Explainable Social Media Analysis for Public Health with
  Large Language Models
Balanced and Explainable Social Media Analysis for Public Health with Large Language ModelsAustralasian Database Conference (ADC), 2023
Yan Jiang
Ruihong Qiu
Yi Zhang
Peng Zhang
174
9
0
12 Sep 2023
Incorporating Pre-trained Model Prompting in Multimodal Stock Volume
  Movement Prediction
Incorporating Pre-trained Model Prompting in Multimodal Stock Volume Movement Prediction
Ruibo Chen
Zhiyuan Zhang
Yi Liu
Ruihan Bao
Keiko Harimoto
Xu Sun
AIFinAI4TS
178
0
0
11 Sep 2023
Black-Box Analysis: GPTs Across Time in Legal Textual Entailment Task
Black-Box Analysis: GPTs Across Time in Legal Textual Entailment Task
Nguyen Ha Thanh
Randy Goebel
Francesca Toni
Kostas Stathis
Ken Satoh
AILawELM
121
6
0
11 Sep 2023
CrisisTransformers: Pre-trained language models and sentence encoders
  for crisis-related social media texts
CrisisTransformers: Pre-trained language models and sentence encoders for crisis-related social media textsKnowledge-Based Systems (KBS), 2023
Rabindra Lamsal
M. Read
S. Karunasekera
281
22
0
11 Sep 2023
Retrieval-Augmented Meta Learning for Low-Resource Text Classification
Retrieval-Augmented Meta Learning for Low-Resource Text ClassificationIEEE International Joint Conference on Neural Network (IJCNN), 2023
Rongsheng Li
Yongqian Li
Hai-Tao Zheng
Chaiyut Luoyiching
Hai-Tao Zheng
Nannan Zhou
Hanjing Su
RALM
243
2
0
10 Sep 2023
Introducing "Forecast Utterance" for Conversational Data Science
Introducing "Forecast Utterance" for Conversational Data Science
Md. Mahadi Hassan
Alex Knipper
S. Karmaker
AI4TS
214
0
0
07 Sep 2023
Knowledge Solver: Teaching LLMs to Search for Domain Knowledge from
  Knowledge Graphs
Knowledge Solver: Teaching LLMs to Search for Domain Knowledge from Knowledge Graphs
Chao Feng
Xinyu Zhang
Zichu Fei
KELM
226
67
0
06 Sep 2023
One Wide Feedforward is All You Need
One Wide Feedforward is All You NeedConference on Machine Translation (WMT), 2023
Telmo Pires
António V. Lopes
Yannick Assogba
Hendra Setiawan
251
18
0
04 Sep 2023
FusionAI: Decentralized Training and Deploying LLMs with Massive
  Consumer-Level GPUs
FusionAI: Decentralized Training and Deploying LLMs with Massive Consumer-Level GPUs
Zhenheng Tang
Yuxin Wang
Xin He
Longteng Zhang
Xinglin Pan
...
Rongfei Zeng
Kaiyong Zhao
Shaoshuai Shi
Bingsheng He
Xiaowen Chu
247
35
0
03 Sep 2023
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language ModelsComputational Linguistics (CL), 2023
Yue Zhang
Yafu Li
Leyang Cui
Deng Cai
Lemao Liu
...
Longyue Wang
Anh Tuan Luu
Freda Shi
Shuming Shi
Shuming Shi
LRMRALMHILM
733
828
0
03 Sep 2023
Studying the impacts of pre-training using ChatGPT-generated text on
  downstream tasks
Studying the impacts of pre-training using ChatGPT-generated text on downstream tasks
Sarthak Anand
174
0
0
02 Sep 2023
RenAIssance: A Survey into AI Text-to-Image Generation in the Era of
  Large Model
RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large ModelIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Fengxiang Bie
Jianlong Wu
Zhongzhu Zhou
Adam Ghanem
Minjia Zhang
...
Pareesa Ameneh Golnari
David A. Clifton
Yuxiong He
Dacheng Tao
Shuaiwen Leon Song
EGVM
257
58
0
02 Sep 2023
Learning to Taste: A Multimodal Wine Dataset
Learning to Taste: A Multimodal Wine DatasetNeural Information Processing Systems (NeurIPS), 2023
Thoranna Bender
Simon Moe Sorensen
A. Kashani
K. E. Hjorleifsson
Grethe Hyldig
Søren Hauberg
Serge Belongie
Frederik Warburg
CoGe
515
7
0
31 Aug 2023
ViLTA: Enhancing Vision-Language Pre-training through Textual
  Augmentation
ViLTA: Enhancing Vision-Language Pre-training through Textual AugmentationIEEE International Conference on Computer Vision (ICCV), 2023
Weihan Wang
Zhiyong Yang
Bin Xu
Juanzi Li
Yankui Sun
VLM
289
9
0
31 Aug 2023
Thesis Distillation: Investigating The Impact of Bias in NLP Models on
  Hate Speech Detection
Thesis Distillation: Investigating The Impact of Bias in NLP Models on Hate Speech Detection
Fatma Elsafoury
283
5
0
31 Aug 2023
ToddlerBERTa: Exploiting BabyBERTa for Grammar Learning and Language
  Understanding
ToddlerBERTa: Exploiting BabyBERTa for Grammar Learning and Language Understanding
Omer Veysel Cagatan
163
2
0
30 Aug 2023
Introducing Language Guidance in Prompt-based Continual Learning
Introducing Language Guidance in Prompt-based Continual LearningIEEE International Conference on Computer Vision (ICCV), 2023
Muhammad Gul Zain Ali Khan
Muhammad Ferjad Naeem
Luc Van Gool
D. Stricker
F. Tombari
Muhammad Zeshan Afzal
VLMCLL
217
62
0
30 Aug 2023
Cyberbullying Detection for Low-resource Languages and Dialects: Review
  of the State of the Art
Cyberbullying Detection for Low-resource Languages and Dialects: Review of the State of the ArtInformation Processing & Management (IPM), 2023
Tanjim Mahmud
M. Ptaszynski
J. Eronen
Fumito Masui
169
83
0
30 Aug 2023
TransPrompt v2: A Transferable Prompting Framework for Cross-task Text
  Classification
TransPrompt v2: A Transferable Prompting Framework for Cross-task Text Classification
Jiadong Wang
Chengyu Wang
Cen Chen
Ming Gao
Yanjie Liang
Aoying Zhou
VLM
178
0
0
29 Aug 2023
Video Multimodal Emotion Recognition System for Real World Applications
Video Multimodal Emotion Recognition System for Real World ApplicationsInterspeech (Interspeech), 2023
Sun-Kyung Lee
Jong-Hwan Kim
CVBM
122
3
0
28 Aug 2023
LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors
LMSanitator: Defending Prompt-Tuning Against Task-Agnostic BackdoorsNetwork and Distributed System Security Symposium (NDSS), 2023
Chengkun Wei
Wenlong Meng
Zhikun Zhang
M. Chen
Ming-Hui Zhao
Wenjing Fang
Lei Wang
Zihui Zhang
Wenzhi Chen
AAML
181
14
0
26 Aug 2023
FwdLLM: Efficient FedLLM using Forward Gradient
FwdLLM: Efficient FedLLM using Forward Gradient
Mengwei Xu
Dongqi Cai
Yaozong Wu
Xiang Li
Shangguang Wang
FedML
256
34
0
26 Aug 2023
WellXplain: Wellness Concept Extraction and Classification in Reddit
  Posts for Mental Health Analysis
WellXplain: Wellness Concept Extraction and Classification in Reddit Posts for Mental Health AnalysisKnowledge-Based Systems (KBS), 2023
Muskan Garg
AI4MH
196
17
0
25 Aug 2023
TpuGraphs: A Performance Prediction Dataset on Large Tensor
  Computational Graphs
TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational GraphsNeural Information Processing Systems (NeurIPS), 2023
P. Phothilimthana
Sami Abu-El-Haija
Kaidi Cao
Bahare Fatemi
Mike Burrows
Charith Mendis
Bryan Perozzi
GNNAI4TS
428
29
0
25 Aug 2023
Construction Grammar and Language Models
Construction Grammar and Language Models
Harish Tayyar Madabushi
Laurence Romain
P. Milin
Dagmar Divjak
411
7
0
25 Aug 2023
Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and
  Vulnerabilities
Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and Vulnerabilities
Maximilian Mozes
Xuanli He
Bennett Kleinberg
Lewis D. Griffin
221
107
0
24 Aug 2023
A Small and Fast BERT for Chinese Medical Punctuation Restoration
A Small and Fast BERT for Chinese Medical Punctuation RestorationInterspeech (Interspeech), 2023
Tongtao Ling
Chen Liao
Lei Chen
Shilei Huang
Yi Liu
MedIm
212
2
0
24 Aug 2023
Evolution of ESG-focused DLT Research: An NLP Analysis of the Literature
Evolution of ESG-focused DLT Research: An NLP Analysis of the LiteratureQuantitative Science Studies (QSS), 2023
Walter Hernandez Cruz
K. Tylinski
Alastair Moore
Niall Roche
Nikhil Vadgama
Horst Treiblmaier
J. Shangguan
Paolo Tasca
Jiahua Xu
402
5
0
23 Aug 2023
GOPro: Generate and Optimize Prompts in CLIP using Self-Supervised
  Learning
GOPro: Generate and Optimize Prompts in CLIP using Self-Supervised LearningBritish Machine Vision Conference (BMVC), 2023
Mainak Singha
Ankit Jha
Biplab Banerjee
VLM
162
5
0
22 Aug 2023
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive
  Language-Image Pre-training
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-trainingIEEE International Conference on Computer Vision (ICCV), 2023
Xi Deng
Han Shi
Runhu Huang
Changlin Li
Hang Xu
Jianhua Han
James T. Kwok
Shen Zhao
Wei Zhang
Xiaodan Liang
CLIPVLM
211
3
0
22 Aug 2023
Systematic Offensive Stereotyping (SOS) Bias in Language Models
Systematic Offensive Stereotyping (SOS) Bias in Language Models
Fatma Elsafoury
101
2
0
21 Aug 2023
Large Language Models for Software Engineering: A Systematic Literature
  Review
Large Language Models for Software Engineering: A Systematic Literature ReviewACM Transactions on Software Engineering and Methodology (TOSEM), 2023
Xinying Hou
Yanjie Zhao
Yue Liu
Zhou Yang
Kailong Wang
Li Li
Xiapu Luo
David Lo
John C. Grundy
Haoyu Wang
360
802
0
21 Aug 2023
Learning Representations on Logs for AIOps
Learning Representations on Logs for AIOpsIEEE International Conference on Cloud Computing (CLOUD), 2023
Pranjal Gupta
Harshit Kumar
Debanjana Kar
Karan Bhukar
Pooja Aggarwal
P. Mohapatra
139
20
0
18 Aug 2023
BERT4CTR: An Efficient Framework to Combine Pre-trained Language Model
  with Non-textual Features for CTR Prediction
BERT4CTR: An Efficient Framework to Combine Pre-trained Language Model with Non-textual Features for CTR PredictionKnowledge Discovery and Data Mining (KDD), 2023
Dong Wang
Kave Salamatian
Yunqing Xia
Weiwei Deng
Qi Zhang
164
22
0
17 Aug 2023
Lightweight Adaptation of Neural Language Models via Subspace Embedding
Lightweight Adaptation of Neural Language Models via Subspace EmbeddingInternational Conference on Information and Knowledge Management (CIKM), 2023
Amit Kumar Jaiswal
Haiming Liu
154
3
0
16 Aug 2023
BIOptimus: Pre-training an Optimal Biomedical Language Model with
  Curriculum Learning for Named Entity Recognition
BIOptimus: Pre-training an Optimal Biomedical Language Model with Curriculum Learning for Named Entity RecognitionWorkshop on Biomedical Natural Language Processing (BioNLP), 2023
Vera Pavlova
M. Makhlouf
235
3
0
16 Aug 2023
Finding Stakeholder-Material Information from 10-K Reports using
  Fine-Tuned BERT and LSTM Models
Finding Stakeholder-Material Information from 10-K Reports using Fine-Tuned BERT and LSTM Models
V. Z. Chen
196
0
0
15 Aug 2023
gSASRec: Reducing Overconfidence in Sequential Recommendation Trained
  with Negative Sampling
gSASRec: Reducing Overconfidence in Sequential Recommendation Trained with Negative SamplingACM Conference on Recommender Systems (RecSys), 2023
Aleksandr V. Petrov
Craig Macdonald
176
58
0
14 Aug 2023
A Novel Ehanced Move Recognition Algorithm Based on Pre-trained Models
  with Positional Embeddings
A Novel Ehanced Move Recognition Algorithm Based on Pre-trained Models with Positional Embeddings
H. Wen
Jie Wang
Xiaodong Qiao
171
0
0
14 Aug 2023
SLoRA: Federated Parameter Efficient Fine-Tuning of Language Models
SLoRA: Federated Parameter Efficient Fine-Tuning of Language Models
Sara Babakniya
A. Elkordy
Yahya H. Ezzeldin
Qingfeng Liu
Kee-Bong Song
Mostafa El-Khamy
Salman Avestimehr
184
112
0
12 Aug 2023
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via CipherInternational Conference on Learning Representations (ICLR), 2023
Youliang Yuan
Wenxiang Jiao
Wenxuan Wang
Shu Yang
Pinjia He
Shuming Shi
Zhaopeng Tu
SILM
296
400
0
12 Aug 2023
Identification of the Relevance of Comments in Codes Using Bag of Words
  and Transformer Based Models
Identification of the Relevance of Comments in Codes Using Bag of Words and Transformer Based ModelsFire (FIRE), 2023
S. Sruthi
Tanmay Basu
102
1
0
11 Aug 2023
LittleMu: Deploying an Online Virtual Teaching Assistant via
  Heterogeneous Sources Integration and Chain of Teach Prompts
LittleMu: Deploying an Online Virtual Teaching Assistant via Heterogeneous Sources Integration and Chain of Teach PromptsInternational Conference on Information and Knowledge Management (CIKM), 2023
Shangqing Tu
Zheyuan Zhang
Jifan Yu
Chunyang Li
Siyu Zhang
Zijun Yao
Lei Hou
Juanzi Li
159
16
0
11 Aug 2023
Previous
123...161718...596061
Next
Page 17 of 61
Pageof 61