Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1909.11942
Cited By
v1
v2
v3
v4
v5
v6 (latest)
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
International Conference on Learning Representations (ICLR), 2019
26 September 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Github (3271★)
Papers citing
"ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"
50 / 3,044 papers shown
Title
GMAT: Global Memory Augmentation for Transformers
Ankit Gupta
Jonathan Berant
RALM
145
52
0
05 Jun 2020
Understanding Self-Attention of Self-Supervised Audio Transformers
Shu-Wen Yang
Andy T. Liu
Hung-yi Lee
124
31
0
05 Jun 2020
Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing
Zihang Dai
Guokun Lai
Yiming Yang
Quoc V. Le
228
251
0
05 Jun 2020
Position Masking for Language Models
Andy Wagner
T. Mitra
Mrinal Iyer
Godfrey Da Costa
Marc Tremblay
36
5
0
02 Jun 2020
Subjective Question Answering: Deciphering the inner workings of Transformers in the realm of subjectivity
Lukas Muttenthaler
147
3
0
02 Jun 2020
WikiBERT models: deep transfer learning for many languages
Nordic Conference of Computational Linguistics (NODALIDA), 2020
S. Pyysalo
Jenna Kanerva
Antti Virtanen
Filip Ginter
KELM
151
39
0
02 Jun 2020
Question Answering on Scholarly Knowledge Graphs
International Conference on Theory and Practice of Digital Libraries (TPDL), 2020
M. Y. Jaradeh
M. Stocker
Sören Auer
LMTD
RALM
94
15
0
02 Jun 2020
Careful analysis of XRD patterns with Attention
Koichi Kano
T. Segi
H. Ozono
66
0
0
02 Jun 2020
A Pairwise Probe for Understanding BERT Fine-Tuning on Machine Reading Comprehension
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2020
Jie Cai
Zhengzhou Zhu
Ping Nie
Qian Liu
AAML
87
7
0
02 Jun 2020
BERT-based Ensembles for Modeling Disclosure and Support in Conversational Social Media Text
Tanvi Dadu
Kartikey Pant
R. Mamidi
79
9
0
01 Jun 2020
Emergence of Separable Manifolds in Deep Language Representations
International Conference on Machine Learning (ICML), 2020
Jonathan Mamou
Hang Le
Miguel Angel del Rio
Cory Stephenson
Hanlin Tang
Yoon Kim
SueYeon Chung
AAML
AI4CE
260
44
0
01 Jun 2020
Conversational Machine Comprehension: a Literature Review
International Conference on Computational Linguistics (COLING), 2020
Somil Gupta
Bhanu Pratap Singh Rawat
Hong Yu
172
22
0
01 Jun 2020
A Survey on Transfer Learning in Natural Language Processing
Zaid Alyafeai
Maged S. Alshaibani
Irfan Ahmad
201
84
0
31 May 2020
LRG at SemEval-2020 Task 7: Assessing the Ability of BERT and Derivative Models to Perform Short-Edits based Humor Grading
International Workshop on Semantic Evaluation (SemEval), 2020
Siddhant Mahurkar
Rajaswa Patil
114
8
0
31 May 2020
Beyond Leaderboards: A survey of methods for revealing weaknesses in Natural Language Inference data and models
Viktor Schlegel
Goran Nenadic
Riza Batista-Navarro
ELM
171
18
0
29 May 2020
ValueNet: A Natural Language-to-SQL System that Learns from Database Information
Ursin Brunner
Kurt Stockinger
110
10
0
29 May 2020
Language Models are Few-Shot Learners
Neural Information Processing Systems (NeurIPS), 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
1.9K
51,003
0
28 May 2020
Language Representation Models for Fine-Grained Sentiment Classification
Brian Cheang
Bailey Wei
David Kogan
H. Qiu
Masud Ahmed
AI4MH
121
11
0
27 May 2020
Syntactic Structure Distillation Pretraining For Bidirectional Encoders
Transactions of the Association for Computational Linguistics (TACL), 2020
A. Kuncoro
Lingpeng Kong
Daniel Fried
Dani Yogatama
Laura Rimell
Chris Dyer
Phil Blunsom
145
34
0
27 May 2020
GECToR -- Grammatical Error Correction: Tag, Not Rewrite
Workshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2020
Kostiantyn Omelianchuk
Vitaliy Atrasevych
Artem Chernodub
Oleksandr Skurzhanskyi
215
353
0
26 May 2020
ParsBERT: Transformer-based Model for Persian Language Understanding
Neural Processing Letters (NPL), 2020
Mehrdad Farahani
Mohammad Gharachorloo
Marzieh Farahani
Mohammad Manthouri
210
235
0
26 May 2020
An Audio-enriched BERT-based Framework for Spoken Multiple-choice Question Answering
Interspeech (Interspeech), 2020
Chia-Chih Kuo
Shang-Bao Luo
Kuan-Yu Chen
122
18
0
25 May 2020
NILE : Natural Language Inference with Faithful Natural Language Explanations
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
Sawan Kumar
Partha P. Talukdar
XAI
LRM
251
169
0
25 May 2020
KaLM at SemEval-2020 Task 4: Knowledge-aware Language Models for Comprehension And Generation
Jiajing Wan
Xinting Huang
LRM
118
5
0
24 May 2020
Transformer-based Context-aware Sarcasm Detection in Conversation Threads from Social Media
Xiangjue Dong
Changmao Li
Jinho Choi
109
28
0
22 May 2020
Open-Retrieval Conversational Question Answering
Chen Qu
Liu Yang
Cen Chen
Minghui Qiu
W. Bruce Croft
Mohit Iyyer
RALM
193
192
0
22 May 2020
Comparative Study of Machine Learning Models and BERT on SQuAD
Devshree Patel
Param Raval
Ratnam Parikh
Yesha Shastri
78
8
0
22 May 2020
PruneNet: Channel Pruning via Global Importance
A. Khetan
Zohar Karnin
100
12
0
22 May 2020
Med-BERT: pre-trained contextualized embeddings on large-scale structured electronic health records for disease prediction
L. Rasmy
Yang Xiang
Z. Xie
Cui Tao
Degui Zhi
AI4MH
LM&MA
222
826
0
22 May 2020
Pretraining with Contrastive Sentence Objectives Improves Discourse Performance of Language Models
Dan Iter
Kelvin Guu
L. Lansing
Dan Jurafsky
154
83
0
20 May 2020
BiQGEMM: Matrix Multiplication with Lookup Table For Binary-Coding-based Quantized DNNs
Yongkweon Jeon
Baeseong Park
S. Kwon
Byeongwook Kim
Jeongin Yun
Dongsoo Lee
MQ
277
39
0
20 May 2020
FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval
D. Gao
Linbo Jin
Ben Chen
Minghui Qiu
Peng Li
Yi Wei
Yitao Hu
Haozhe Jasper Wang
OOD
189
146
0
20 May 2020
Normalized Attention Without Probability Cage
Oliver Richter
Roger Wattenhofer
218
22
0
19 May 2020
Sketch-BERT: Learning Sketch Bidirectional Encoder Representation from Transformers by Self-supervised Learning of Sketch Gestalt
Hangyu Lin
Yanwei Fu
Yu-Gang Jiang
Xiangyang Xue
SSL
199
75
0
19 May 2020
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation
Po-Han Chi
Pei-Hung Chung
Tsung-Han Wu
Chun-Cheng Hsieh
Yen-Hao Chen
Shang-Wen Li
Hung-yi Lee
SSL
288
156
0
18 May 2020
Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction
Cunjun Yu
Xiao Ma
Jiawei Ren
Haiyu Zhao
Shuai Yi
320
561
0
18 May 2020
T-VSE: Transformer-Based Visual Semantic Embedding
M. Bastan
Arnau Ramisa
Mehmet Tek
ViT
107
7
0
17 May 2020
CS-NLP team at SemEval-2020 Task 4: Evaluation of State-of-the-art NLP Deep Learning Architectures on Commonsense Reasoning Task
Sirwe Saeedi
Ali (Aliakbar) Panahi
Seyran Saeedi
A. Fong
ReLM
ELM
LRM
213
12
0
17 May 2020
Speech Recognition and Multi-Speaker Diarization of Long Conversations
H. H. Mao
Shuyang Li
Julian McAuley
G. Cottrell
VLM
209
49
0
16 May 2020
CERT: Contrastive Self-supervised Learning for Language Understanding
Hongchao Fang
Sicheng Wang
Meng Zhou
Jiayuan Ding
P. Xie
ELM
SSL
181
368
0
16 May 2020
COVID-Twitter-BERT: A Natural Language Processing Model to Analyse COVID-19 Content on Twitter
Martin Müller
M. Salathé
P. Kummervold
VLM
MedIm
AI4MH
182
390
0
15 May 2020
Spelling Error Correction with Soft-Masked BERT
Shaohua Zhang
Haoran Huang
Jicong Liu
Hang Li
96
242
0
15 May 2020
WG-WaveNet: Real-Time High-Fidelity Speech Synthesis without GPU
Po-Chun Hsu
Hung-yi Lee
115
16
0
15 May 2020
Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond
Zhuosheng Zhang
Hai Zhao
Rui Wang
192
66
0
13 May 2020
Automated Extraction of Socio-political Events from News (AESPEN): Workshop and Shared Task Report
Ali Hürriyetoǧlu
Vanni Zavarella
Hristo Tanev
E. Yoruk
Ali Safaya
Osman Mutlu
106
31
0
12 May 2020
A Report on the 2020 Sarcasm Detection Shared Task
Debanjan Ghosh
Avijit Vajpayee
Smaranda Muresan
113
63
0
12 May 2020
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
Hao Tian
Can Gao
Xinyan Xiao
Hao Liu
Bolei He
Hua Wu
Haifeng Wang
Feng Wu
194
265
0
12 May 2020
How Context Affects Language Models' Factual Predictions
Fabio Petroni
Patrick Lewis
Aleksandra Piktus
Tim Rocktaschel
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
190
251
0
10 May 2020
schuBERT: Optimizing Elements of BERT
A. Khetan
Zohar Karnin
184
31
0
09 May 2020
Modeling Document Interactions for Learning to Rank with Regularized Self-Attention
Shuo Sun
Kevin Duh
93
5
0
08 May 2020
Previous
1
2
3
...
56
57
58
59
60
61
Next