ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.11942
  4. Cited By
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
    SSLAIMat
ArXiv (abs)PDFHTMLGithub (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,044 papers shown
Title
GMAT: Global Memory Augmentation for Transformers
GMAT: Global Memory Augmentation for Transformers
Ankit Gupta
Jonathan Berant
RALM
145
52
0
05 Jun 2020
Understanding Self-Attention of Self-Supervised Audio Transformers
Understanding Self-Attention of Self-Supervised Audio Transformers
Shu-Wen Yang
Andy T. Liu
Hung-yi Lee
124
31
0
05 Jun 2020
Funnel-Transformer: Filtering out Sequential Redundancy for Efficient
  Language Processing
Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing
Zihang Dai
Guokun Lai
Yiming Yang
Quoc V. Le
228
251
0
05 Jun 2020
Position Masking for Language Models
Position Masking for Language Models
Andy Wagner
T. Mitra
Mrinal Iyer
Godfrey Da Costa
Marc Tremblay
36
5
0
02 Jun 2020
Subjective Question Answering: Deciphering the inner workings of
  Transformers in the realm of subjectivity
Subjective Question Answering: Deciphering the inner workings of Transformers in the realm of subjectivity
Lukas Muttenthaler
147
3
0
02 Jun 2020
WikiBERT models: deep transfer learning for many languages
WikiBERT models: deep transfer learning for many languagesNordic Conference of Computational Linguistics (NODALIDA), 2020
S. Pyysalo
Jenna Kanerva
Antti Virtanen
Filip Ginter
KELM
151
39
0
02 Jun 2020
Question Answering on Scholarly Knowledge Graphs
Question Answering on Scholarly Knowledge GraphsInternational Conference on Theory and Practice of Digital Libraries (TPDL), 2020
M. Y. Jaradeh
M. Stocker
Sören Auer
LMTDRALM
94
15
0
02 Jun 2020
Careful analysis of XRD patterns with Attention
Careful analysis of XRD patterns with Attention
Koichi Kano
T. Segi
H. Ozono
66
0
0
02 Jun 2020
A Pairwise Probe for Understanding BERT Fine-Tuning on Machine Reading
  Comprehension
A Pairwise Probe for Understanding BERT Fine-Tuning on Machine Reading ComprehensionAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2020
Jie Cai
Zhengzhou Zhu
Ping Nie
Qian Liu
AAML
87
7
0
02 Jun 2020
BERT-based Ensembles for Modeling Disclosure and Support in
  Conversational Social Media Text
BERT-based Ensembles for Modeling Disclosure and Support in Conversational Social Media Text
Tanvi Dadu
Kartikey Pant
R. Mamidi
79
9
0
01 Jun 2020
Emergence of Separable Manifolds in Deep Language Representations
Emergence of Separable Manifolds in Deep Language RepresentationsInternational Conference on Machine Learning (ICML), 2020
Jonathan Mamou
Hang Le
Miguel Angel del Rio
Cory Stephenson
Hanlin Tang
Yoon Kim
SueYeon Chung
AAMLAI4CE
260
44
0
01 Jun 2020
Conversational Machine Comprehension: a Literature Review
Conversational Machine Comprehension: a Literature ReviewInternational Conference on Computational Linguistics (COLING), 2020
Somil Gupta
Bhanu Pratap Singh Rawat
Hong Yu
172
22
0
01 Jun 2020
A Survey on Transfer Learning in Natural Language Processing
A Survey on Transfer Learning in Natural Language Processing
Zaid Alyafeai
Maged S. Alshaibani
Irfan Ahmad
201
84
0
31 May 2020
LRG at SemEval-2020 Task 7: Assessing the Ability of BERT and Derivative
  Models to Perform Short-Edits based Humor Grading
LRG at SemEval-2020 Task 7: Assessing the Ability of BERT and Derivative Models to Perform Short-Edits based Humor GradingInternational Workshop on Semantic Evaluation (SemEval), 2020
Siddhant Mahurkar
Rajaswa Patil
114
8
0
31 May 2020
Beyond Leaderboards: A survey of methods for revealing weaknesses in
  Natural Language Inference data and models
Beyond Leaderboards: A survey of methods for revealing weaknesses in Natural Language Inference data and models
Viktor Schlegel
Goran Nenadic
Riza Batista-Navarro
ELM
171
18
0
29 May 2020
ValueNet: A Natural Language-to-SQL System that Learns from Database
  Information
ValueNet: A Natural Language-to-SQL System that Learns from Database Information
Ursin Brunner
Kurt Stockinger
110
10
0
29 May 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot LearnersNeural Information Processing Systems (NeurIPS), 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
1.9K
51,003
0
28 May 2020
Language Representation Models for Fine-Grained Sentiment Classification
Language Representation Models for Fine-Grained Sentiment Classification
Brian Cheang
Bailey Wei
David Kogan
H. Qiu
Masud Ahmed
AI4MH
121
11
0
27 May 2020
Syntactic Structure Distillation Pretraining For Bidirectional Encoders
Syntactic Structure Distillation Pretraining For Bidirectional EncodersTransactions of the Association for Computational Linguistics (TACL), 2020
A. Kuncoro
Lingpeng Kong
Daniel Fried
Dani Yogatama
Laura Rimell
Chris Dyer
Phil Blunsom
145
34
0
27 May 2020
GECToR -- Grammatical Error Correction: Tag, Not Rewrite
GECToR -- Grammatical Error Correction: Tag, Not RewriteWorkshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2020
Kostiantyn Omelianchuk
Vitaliy Atrasevych
Artem Chernodub
Oleksandr Skurzhanskyi
215
353
0
26 May 2020
ParsBERT: Transformer-based Model for Persian Language Understanding
ParsBERT: Transformer-based Model for Persian Language UnderstandingNeural Processing Letters (NPL), 2020
Mehrdad Farahani
Mohammad Gharachorloo
Marzieh Farahani
Mohammad Manthouri
210
235
0
26 May 2020
An Audio-enriched BERT-based Framework for Spoken Multiple-choice
  Question Answering
An Audio-enriched BERT-based Framework for Spoken Multiple-choice Question AnsweringInterspeech (Interspeech), 2020
Chia-Chih Kuo
Shang-Bao Luo
Kuan-Yu Chen
122
18
0
25 May 2020
NILE : Natural Language Inference with Faithful Natural Language
  Explanations
NILE : Natural Language Inference with Faithful Natural Language ExplanationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Sawan Kumar
Partha P. Talukdar
XAILRM
251
169
0
25 May 2020
KaLM at SemEval-2020 Task 4: Knowledge-aware Language Models for
  Comprehension And Generation
KaLM at SemEval-2020 Task 4: Knowledge-aware Language Models for Comprehension And Generation
Jiajing Wan
Xinting Huang
LRM
118
5
0
24 May 2020
Transformer-based Context-aware Sarcasm Detection in Conversation
  Threads from Social Media
Transformer-based Context-aware Sarcasm Detection in Conversation Threads from Social Media
Xiangjue Dong
Changmao Li
Jinho Choi
109
28
0
22 May 2020
Open-Retrieval Conversational Question Answering
Open-Retrieval Conversational Question Answering
Chen Qu
Liu Yang
Cen Chen
Minghui Qiu
W. Bruce Croft
Mohit Iyyer
RALM
193
192
0
22 May 2020
Comparative Study of Machine Learning Models and BERT on SQuAD
Comparative Study of Machine Learning Models and BERT on SQuAD
Devshree Patel
Param Raval
Ratnam Parikh
Yesha Shastri
78
8
0
22 May 2020
PruneNet: Channel Pruning via Global Importance
PruneNet: Channel Pruning via Global Importance
A. Khetan
Zohar Karnin
100
12
0
22 May 2020
Med-BERT: pre-trained contextualized embeddings on large-scale
  structured electronic health records for disease prediction
Med-BERT: pre-trained contextualized embeddings on large-scale structured electronic health records for disease prediction
L. Rasmy
Yang Xiang
Z. Xie
Cui Tao
Degui Zhi
AI4MHLM&MA
222
826
0
22 May 2020
Pretraining with Contrastive Sentence Objectives Improves Discourse
  Performance of Language Models
Pretraining with Contrastive Sentence Objectives Improves Discourse Performance of Language Models
Dan Iter
Kelvin Guu
L. Lansing
Dan Jurafsky
154
83
0
20 May 2020
BiQGEMM: Matrix Multiplication with Lookup Table For Binary-Coding-based
  Quantized DNNs
BiQGEMM: Matrix Multiplication with Lookup Table For Binary-Coding-based Quantized DNNs
Yongkweon Jeon
Baeseong Park
S. Kwon
Byeongwook Kim
Jeongin Yun
Dongsoo Lee
MQ
277
39
0
20 May 2020
FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal
  Retrieval
FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval
D. Gao
Linbo Jin
Ben Chen
Minghui Qiu
Peng Li
Yi Wei
Yitao Hu
Haozhe Jasper Wang
OOD
189
146
0
20 May 2020
Normalized Attention Without Probability Cage
Normalized Attention Without Probability Cage
Oliver Richter
Roger Wattenhofer
218
22
0
19 May 2020
Sketch-BERT: Learning Sketch Bidirectional Encoder Representation from
  Transformers by Self-supervised Learning of Sketch Gestalt
Sketch-BERT: Learning Sketch Bidirectional Encoder Representation from Transformers by Self-supervised Learning of Sketch Gestalt
Hangyu Lin
Yanwei Fu
Yu-Gang Jiang
Xiangyang Xue
SSL
199
75
0
19 May 2020
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio
  Representation
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation
Po-Han Chi
Pei-Hung Chung
Tsung-Han Wu
Chun-Cheng Hsieh
Yen-Hao Chen
Shang-Wen Li
Hung-yi Lee
SSL
288
156
0
18 May 2020
Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory
  Prediction
Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction
Cunjun Yu
Xiao Ma
Jiawei Ren
Haiyu Zhao
Shuai Yi
320
561
0
18 May 2020
T-VSE: Transformer-Based Visual Semantic Embedding
T-VSE: Transformer-Based Visual Semantic Embedding
M. Bastan
Arnau Ramisa
Mehmet Tek
ViT
107
7
0
17 May 2020
CS-NLP team at SemEval-2020 Task 4: Evaluation of State-of-the-art NLP
  Deep Learning Architectures on Commonsense Reasoning Task
CS-NLP team at SemEval-2020 Task 4: Evaluation of State-of-the-art NLP Deep Learning Architectures on Commonsense Reasoning Task
Sirwe Saeedi
Ali (Aliakbar) Panahi
Seyran Saeedi
A. Fong
ReLMELMLRM
213
12
0
17 May 2020
Speech Recognition and Multi-Speaker Diarization of Long Conversations
Speech Recognition and Multi-Speaker Diarization of Long Conversations
H. H. Mao
Shuyang Li
Julian McAuley
G. Cottrell
VLM
209
49
0
16 May 2020
CERT: Contrastive Self-supervised Learning for Language Understanding
CERT: Contrastive Self-supervised Learning for Language Understanding
Hongchao Fang
Sicheng Wang
Meng Zhou
Jiayuan Ding
P. Xie
ELMSSL
181
368
0
16 May 2020
COVID-Twitter-BERT: A Natural Language Processing Model to Analyse
  COVID-19 Content on Twitter
COVID-Twitter-BERT: A Natural Language Processing Model to Analyse COVID-19 Content on Twitter
Martin Müller
M. Salathé
P. Kummervold
VLMMedImAI4MH
182
390
0
15 May 2020
Spelling Error Correction with Soft-Masked BERT
Spelling Error Correction with Soft-Masked BERT
Shaohua Zhang
Haoran Huang
Jicong Liu
Hang Li
96
242
0
15 May 2020
WG-WaveNet: Real-Time High-Fidelity Speech Synthesis without GPU
WG-WaveNet: Real-Time High-Fidelity Speech Synthesis without GPU
Po-Chun Hsu
Hung-yi Lee
115
16
0
15 May 2020
Machine Reading Comprehension: The Role of Contextualized Language
  Models and Beyond
Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond
Zhuosheng Zhang
Hai Zhao
Rui Wang
192
66
0
13 May 2020
Automated Extraction of Socio-political Events from News (AESPEN):
  Workshop and Shared Task Report
Automated Extraction of Socio-political Events from News (AESPEN): Workshop and Shared Task Report
Ali Hürriyetoǧlu
Vanni Zavarella
Hristo Tanev
E. Yoruk
Ali Safaya
Osman Mutlu
106
31
0
12 May 2020
A Report on the 2020 Sarcasm Detection Shared Task
A Report on the 2020 Sarcasm Detection Shared Task
Debanjan Ghosh
Avijit Vajpayee
Smaranda Muresan
113
63
0
12 May 2020
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
Hao Tian
Can Gao
Xinyan Xiao
Hao Liu
Bolei He
Hua Wu
Haifeng Wang
Feng Wu
194
265
0
12 May 2020
How Context Affects Language Models' Factual Predictions
How Context Affects Language Models' Factual Predictions
Fabio Petroni
Patrick Lewis
Aleksandra Piktus
Tim Rocktaschel
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
190
251
0
10 May 2020
schuBERT: Optimizing Elements of BERT
schuBERT: Optimizing Elements of BERT
A. Khetan
Zohar Karnin
184
31
0
09 May 2020
Modeling Document Interactions for Learning to Rank with Regularized
  Self-Attention
Modeling Document Interactions for Learning to Rank with Regularized Self-Attention
Shuo Sun
Kevin Duh
93
5
0
08 May 2020
Previous
123...565758596061
Next