ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,049 papers shown
Towards Effective Time-Aware Language Representation: Exploring Enhanced Temporal Understanding in Language Models
Towards Effective Time-Aware Language Representation: Exploring Enhanced Temporal Understanding in Language Models
Jiexin Wang
Adam Jatowt
Yi Cai
AI4CE
274
5
0
04 Jun 2024
It's a Feature, Not a Bug: Measuring Creative Fluidity in Image
  Generators
It's a Feature, Not a Bug: Measuring Creative Fluidity in Image Generators
Aditi Ramaswamy
Melane Navaratnarajah
Hana Chockler
EGVM
134
1
0
03 Jun 2024
Reward-based Input Construction for Cross-document Relation Extraction
Reward-based Input Construction for Cross-document Relation Extraction
Byeonghu Na
Suhyeon Jo
Yeongmin Kim
Il-Chul Moon
153
2
0
31 May 2024
GAMedX: Generative AI-based Medical Entity Data Extractor Using Large
  Language Models
GAMedX: Generative AI-based Medical Entity Data Extractor Using Large Language Models
Mohammed-Khalil Ghali
Abdelrahman Farrag
Hajar Sakai
Hicham El Baz
Yu Jin
Sarah Lam
LM&MAMedIm
225
14
0
31 May 2024
Entangled Relations: Leveraging NLI and Meta-analysis to Enhance Biomedical Relation Extraction
Entangled Relations: Leveraging NLI and Meta-analysis to Enhance Biomedical Relation Extraction
William Hogan
Jingbo Shang
245
0
0
31 May 2024
Unlocking the Potential of Large Language Models for Clinical Text
  Anonymization: A Comparative Study
Unlocking the Potential of Large Language Models for Clinical Text Anonymization: A Comparative Study
David Pissarra
Isabel Curioso
João Alveira
Duarte Pereira
Bruno Ribeiro
Tomas Souper
Vasco Gomes
A. Carreiro
Vitor Rolla
264
11
0
29 May 2024
On the Role of Attention Masks and LayerNorm in Transformers
On the Role of Attention Masks and LayerNorm in Transformers
Xinyi Wu
A. Ajorlou
Yifei Wang
Stefanie Jegelka
Ali Jadbabaie
258
29
0
29 May 2024
Transformers Can Do Arithmetic with the Right Embeddings
Transformers Can Do Arithmetic with the Right Embeddings
Sean McLeish
Arpit Bansal
Alex Stein
Neel Jain
John Kirchenbauer
...
B. Kailkhura
A. Bhatele
Jonas Geiping
Avi Schwarzschild
Tom Goldstein
199
67
0
27 May 2024
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for
  Controllable Language Generation
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation
Chengxing Jia
Pengyuan Wang
Ziniu Li
Yi-Chen Li
Zhilong Zhang
Nan Tang
Yang Yu
OffRL
240
2
0
27 May 2024
Recent advances in text embedding: A Comprehensive Review of
  Top-Performing Methods on the MTEB Benchmark
Recent advances in text embedding: A Comprehensive Review of Top-Performing Methods on the MTEB Benchmark
Hongliu Cao
AI4TS
331
31
0
27 May 2024
SoK: Leveraging Transformers for Malware Analysis
SoK: Leveraging Transformers for Malware Analysis
Pradip Kunwar
Kshitiz Aryal
Maanak Gupta
Mahmoud Abdelsalam
Elisa Bertino
443
4
0
27 May 2024
Tokenization Matters! Degrading Large Language Models through Challenging Their Tokenization
Tokenization Matters! Degrading Large Language Models through Challenging Their Tokenization
Dixuan Wang
Yanda Li
Junyuan Jiang
Zepeng Ding
Ziqin Luo
Guochao Jiang
Jiaqing Liang
Deqing Yang
492
34
0
27 May 2024
Cocktail: A Comprehensive Information Retrieval Benchmark with
  LLM-Generated Documents Integration
Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration
Sunhao Dai
Weihao Liu
Yuqi Zhou
Liang Pang
Rongju Ruan
Gang Wang
Zhenhua Dong
Jun Xu
Jirong Wen
345
22
0
26 May 2024
Accelerating Transformers with Spectrum-Preserving Token Merging
Accelerating Transformers with Spectrum-Preserving Token Merging
Hoai-Chau Tran
D. M. Nguyen
Duy M. Nguyen
Trung Thanh Nguyen
Ngan Le
Pengtao Xie
Daniel Sonntag
James Y. Zou
Binh T. Nguyen
Mathias Niepert
276
24
0
25 May 2024
MoEUT: Mixture-of-Experts Universal Transformers
MoEUT: Mixture-of-Experts Universal Transformers
Róbert Csordás
Kazuki Irie
Jürgen Schmidhuber
Christopher Potts
Christopher D. Manning
MoE
258
28
0
25 May 2024
GPT is Not an Annotator: The Necessity of Human Annotation in Fairness
  Benchmark Construction
GPT is Not an Annotator: The Necessity of Human Annotation in Fairness Benchmark Construction
Virginia K. Felkner
Jennifer A. Thompson
Jonathan May
220
13
0
24 May 2024
ETTrack: Enhanced Temporal Motion Predictor for Multi-Object Tracking
ETTrack: Enhanced Temporal Motion Predictor for Multi-Object Tracking
Xudong Han
Nobuyuki Oishi
Yueying Tian
Elif Ucurum
R. Young
C. Chatwin
Philip Birch
254
15
0
24 May 2024
Optimizing Large Language Models for OpenAPI Code Completion
Optimizing Large Language Models for OpenAPI Code Completion
Bohdan Petryshyn
M. Lukoševičius
LLMAGALM
198
0
0
24 May 2024
Thinking Forward: Memory-Efficient Federated Finetuning of Language
  Models
Thinking Forward: Memory-Efficient Federated Finetuning of Language Models
Kunjal Panchal
Nisarg Parikh
Sunav Choudhary
Lijun Zhang
Yuriy Brun
Hui Guan
225
7
0
24 May 2024
CEEBERT: Cross-Domain Inference in Early Exit BERT
CEEBERT: Cross-Domain Inference in Early Exit BERT
Divya J. Bajpai
M. Hanawal
LRM
193
12
0
23 May 2024
Super Tiny Language Models
Super Tiny Language Models
Dylan Hillier
Leon Guertler
Cheston Tan
Palaash Agrawal
Ruirui Chen
Bobby Cheng
294
10
0
23 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
910
169
0
23 May 2024
WeatherFormer: A Pretrained Encoder Model for Learning Robust Weather
  Representations from Small Datasets
WeatherFormer: A Pretrained Encoder Model for Learning Robust Weather Representations from Small Datasets
Adib Hasan
Mardavij Roozbehani
M. Dahleh
AI4TS
173
1
0
22 May 2024
Investigating Persuasion Techniques in Arabic: An Empirical Study
  Leveraging Large Language Models
Investigating Persuasion Techniques in Arabic: An Empirical Study Leveraging Large Language Models
Abdurahmman Alzahrani
Eyad Babkier
Faisal Yanbaawi
Firas Yanbaawi
Hassan Alhuzali
175
0
0
21 May 2024
Talk2Radar: Bridging Natural Language with 4D mmWave Radar for 3D Referring Expression Comprehension
Talk2Radar: Bridging Natural Language with 4D mmWave Radar for 3D Referring Expression Comprehension
Runwei Guan
Ruixiao Zhang
Ningwei Ouyang
Tao Huang
Ka Lok Man
...
Ming Xu
Jeremy S. Smith
Eng Gee Lim
Yutao Yue
Hui Xiong
654
19
0
21 May 2024
CReMa: Crisis Response through Computational Identification and Matching
  of Cross-Lingual Requests and Offers Shared on Social Media
CReMa: Crisis Response through Computational Identification and Matching of Cross-Lingual Requests and Offers Shared on Social Media
Rabindra Lamsal
M. Read
S. Karunasekera
Muhammad Imran
133
4
0
20 May 2024
Case-Based Reasoning Approach for Solving Financial Question Answering
Case-Based Reasoning Approach for Solving Financial Question Answering
Yikyung Kim
Jay-Yoon Lee
AIMat
151
2
0
18 May 2024
The Future of Large Language Model Pre-training is Federated
The Future of Large Language Model Pre-training is Federated
Lorenzo Sani
Alexandru Iacob
Zeyu Cao
Bill Marino
Yan Gao
...
Wanru Zhao
William F. Shen
Preslav Aleksandrov
Xinchi Qiu
Nicholas D. Lane
AI4CE
450
40
0
17 May 2024
Large Language Model (LLM) for Telecommunications: A Comprehensive
  Survey on Principles, Key Techniques, and Opportunities
Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and OpportunitiesIEEE Communications Surveys and Tutorials (COMST), 2024
Hao Zhou
Chengming Hu
Ye Yuan
Yufei Cui
Yili Jin
...
Di Wu
Xue Liu
Charlie Zhang
Xianbin Wang
Jiangchuan Liu
319
188
0
17 May 2024
A survey on fairness of large language models in e-commerce: progress,
  application, and challenge
A survey on fairness of large language models in e-commerce: progress, application, and challenge
Qingyang Ren
Zilin Jiang
Jinghan Cao
Sijia Li
Chiqu Li
Yiyang Liu
Shuning Huo
Tiange He
Yuan Chen
AILawFaML
306
17
0
15 May 2024
A Survey of Generative Techniques for Spatial-Temporal Data Mining
A Survey of Generative Techniques for Spatial-Temporal Data Mining
Qianru Zhang
Haixin Wang
Cheng Long
Liangcai Su
Xingwei He
...
Tailin Wu
Hongzhi Yin
Siu-Ming Yiu
Qi Tian
Christian S. Jensen
AI4TS
220
15
0
15 May 2024
From Transformers to LLMs: A Systematic Survey of Efficiency Considerations in NLP
From Transformers to LLMs: A Systematic Survey of Efficiency Considerations in NLP
Wazib Ansar
Saptarsi Goswami
Amlan Chakrabarti
MedIm
439
13
0
15 May 2024
A Decoupling and Aggregating Framework for Joint Extraction of Entities
  and Relations
A Decoupling and Aggregating Framework for Joint Extraction of Entities and RelationsIEEE Access (IEEE Access), 2024
Yao Wang
Xin Liu
Weikun Kong
Hai-tao Yu
Teeradaj Racharak
Kyoung-Sook Kim
Le-Minh Nguyen
260
1
0
14 May 2024
Impact of Stickers on Multimodal Sentiment and Intent in Social Media: A New Task, Dataset and Baseline
Impact of Stickers on Multimodal Sentiment and Intent in Social Media: A New Task, Dataset and Baseline
Yuanchen Shi
Biao Ma
Fang Kong
Fang Kong
241
0
0
14 May 2024
ViWikiFC: Fact-Checking for Vietnamese Wikipedia-Based Textual Knowledge
  Source
ViWikiFC: Fact-Checking for Vietnamese Wikipedia-Based Textual Knowledge Source
Hung Tuan Le
Long Truong To
Manh Trong Nguyen
Kiet Van Nguyen
312
5
0
13 May 2024
DEPTH: Discourse Education through Pre-Training Hierarchically
DEPTH: Discourse Education through Pre-Training Hierarchically
Zachary Bamberger
Ofek Glick
Chaim Baskin
Yonatan Belinkov
322
0
0
13 May 2024
Branching Narratives: Character Decision Points Detection
Branching Narratives: Character Decision Points Detection
Alexey Tikhonov
160
2
0
12 May 2024
ExplainableDetector: Exploring Transformer-based Language Modeling
  Approach for SMS Spam Detection with Explainability Analysis
ExplainableDetector: Exploring Transformer-based Language Modeling Approach for SMS Spam Detection with Explainability Analysis
Mohammad Amaz Uddin
Muhammad Nazrul Islam
Leandros A. Maglaras
Helge Janicke
Iqbal H. Sarker
173
13
0
12 May 2024
SaudiBERT: A Large Language Model Pretrained on Saudi Dialect Corpora
SaudiBERT: A Large Language Model Pretrained on Saudi Dialect Corpora
Faisal Qarah
210
12
0
10 May 2024
Similarity Guided Multimodal Fusion Transformer for Semantic Location
  Prediction in Social Media
Similarity Guided Multimodal Fusion Transformer for Semantic Location Prediction in Social Media
Zhizhen Zhang
Ning Wang
Haojie Li
Zhihui Wang
199
1
0
09 May 2024
Multi-level Shared Knowledge Guided Learning for Knowledge Graph
  Completion
Multi-level Shared Knowledge Guided Learning for Knowledge Graph Completion
Yongxue Shan
Jie Zhou
Jie Peng
Xin Zhou
Jiaqian Yin
Xiaodong Wang
277
4
0
08 May 2024
A Review on Discriminative Self-supervised Learning Methods in Computer Vision
A Review on Discriminative Self-supervised Learning Methods in Computer Vision
Nikolaos Giakoumoglou
Tania Stathaki
Athanasios Gkelias
SSL
438
1
0
08 May 2024
Switchable Decision: Dynamic Neural Generation Networks
Switchable Decision: Dynamic Neural Generation Networks
Shujian Zhang
Korawat Tanwisuth
Chengyue Gong
Pengcheng He
Mi Zhou
BDL
213
0
0
07 May 2024
Revisiting character-level adversarial attacks
Revisiting character-level adversarial attacks
Elias Abad Rocamora
Yongtao Wu
Fanghui Liu
Grigorios G. Chrysos
Volkan Cevher
AAML
244
6
0
07 May 2024
LingML: Linguistic-Informed Machine Learning for Enhanced Fake News
  Detection
LingML: Linguistic-Informed Machine Learning for Enhanced Fake News Detection
Jasraj Singh
Fang Liu
Hong Xu
Bee Chin Ng
Wei Zhang
AI4CE
130
2
0
07 May 2024
Exploring prompts to elicit memorization in masked language model-based
  named entity recognition
Exploring prompts to elicit memorization in masked language model-based named entity recognitionPLoS ONE (PLoS ONE), 2024
Yuxi Xia
Anastasiia Sedova
Pedro Henrique Luz de Araujo
Vasiliki Kougia
Lisa Nussbaumer
Benjamin Roth
287
1
0
05 May 2024
Enabling Patient-side Disease Prediction via the Integration of Patient
  Narratives
Enabling Patient-side Disease Prediction via the Integration of Patient NarrativesThe Web Conference (WWW), 2024
Zhixiang Su
Yinan Zhang
Jiazheng Jing
Jie Xiao
Zhiqi Shen
112
2
0
05 May 2024
Hoaxpedia: A Unified Wikipedia Hoax Articles Dataset
Hoaxpedia: A Unified Wikipedia Hoax Articles Dataset
Hsuvas Borkakoty
Luis Espinosa-Anke
275
2
0
03 May 2024
Exploiting ChatGPT for Diagnosing Autism-Associated Language Disorders
  and Identifying Distinct Features
Exploiting ChatGPT for Diagnosing Autism-Associated Language Disorders and Identifying Distinct FeaturesResearch Square (RS), 2024
Chuanbo Hu
Wenqi Li
Mindi Ruan
Xiangxu Yu
Lynn K. Paul
Shuo Wang
Xin Li
129
8
0
03 May 2024
Large Language Models for UAVs: Current State and Pathways to the Future
Large Language Models for UAVs: Current State and Pathways to the FutureIEEE Open Journal of Vehicular Technology (OJVT), 2024
Shumaila Javaid
Nasir Saeed
Bin He
281
79
0
02 May 2024
Previous
123...8910...596061
Next
Page 9 of 61
Pageof 61