Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 15,000 papers shown
Title
A Neighbourhood Framework for Resource-Lean Content Flagging
Sheikh Muhammad Sarwar
Dimitrina Zlatkova
Momchil Hardalov
Yoan Dinkov
Isabelle Augenstein
Preslav Nakov
16
5
0
31 Mar 2021
Deep Neural Approaches to Relation Triplets Extraction: A Comprehensive Survey
Tapas Nayak
Navonil Majumder
Pawan Goyal
Soujanya Poria
ViT
14
49
0
31 Mar 2021
Learning Generalizable Robotic Reward Functions from "In-The-Wild" Human Videos
Annie S. Chen
Suraj Nair
Chelsea Finn
30
137
0
31 Mar 2021
Dual Contrastive Loss and Attention for GANs
Ning Yu
Guilin Liu
Aysegül Dündar
Andrew Tao
Bryan Catanzaro
Larry S. Davis
Mario Fritz
GAN
24
60
0
31 Mar 2021
EnergyVis: Interactively Tracking and Exploring Energy Consumption for ML Models
Omar Shaikh
Jon Saad-Falcon
Austin P. Wright
Nilaksh Das
Scott Freitas
O. Asensio
Duen Horng Chau
24
18
0
30 Mar 2021
Grounding Dialogue Systems via Knowledge Graph Aware Decoding with Pre-trained Transformers
Debanjan Chaudhuri
Md. Rony
Jens Lehmann
13
12
0
30 Mar 2021
Autocorrect in the Process of Translation -- Multi-task Learning Improves Dialogue Machine Translation
Tao Wang
Chengqi Zhao
Mingxuan Wang
Lei Li
Deyi Xiong
20
13
0
30 Mar 2021
CaSiNo: A Corpus of Campsite Negotiation Dialogues for Automatic Negotiation Systems
Kushal Chawla
Jaysa Ramirez
Rene Clever
Gale M. Lucas
Jonathan May
Jonathan Gratch
15
50
0
29 Mar 2021
ViViT: A Video Vision Transformer
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
30
2,086
0
29 Mar 2021
On the Adversarial Robustness of Vision Transformers
Rulin Shao
Zhouxing Shi
Jinfeng Yi
Pin-Yu Chen
Cho-Jui Hsieh
ViT
30
137
0
29 Mar 2021
Efficient Explanations from Empirical Explainers
Robert Schwarzenberg
Nils Feldhus
Sebastian Möller
FAtt
29
9
0
29 Mar 2021
Changing the Mind of Transformers for Topically-Controllable Language Generation
Haw-Shiuan Chang
Jiaming Yuan
Mohit Iyyer
Andrew McCallum
20
9
0
29 Mar 2021
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS
Ye Jia
Heiga Zen
Jonathan Shen
Yu Zhang
Yonghui Wu
SSL
19
81
0
28 Mar 2021
TransICD: Transformer Based Code-wise Attention Model for Explainable ICD Coding
Biplob Biswas
Thai-Hoang Pham
Ping Zhang
13
29
0
28 Mar 2021
Accurate and Reliable Forecasting using Stochastic Differential Equations
Peng Cui
Zhijie Deng
Wenbo Hu
Jun Zhu
UQCV
30
1
0
28 Mar 2021
Automated Backend-Aware Post-Training Quantization
Ziheng Jiang
Animesh Jain
An Liu
Josh Fromm
Chengqian Ma
Tianqi Chen
Luis Ceze
MQ
35
2
0
27 Mar 2021
Machine Learning Meets Natural Language Processing -- The story so far
N. Galanis
P. Vafiadis
K.-G. Mirzaev
G. Papakostas
30
6
0
27 Mar 2021
Synthesis of Compositional Animations from Textual Descriptions
Anindita Ghosh
N. Cheema
Cennet Oguz
Christian Theobalt
P. Slusallek
31
170
0
26 Mar 2021
Gated Transformer Networks for Multivariate Time Series Classification
Minghao Liu
Shengqi Ren
Siyuan Ma
Jiahui Jiao
Yizhou Chen
Zhiguang Wang
Wei Song
AI4TS
36
130
0
26 Mar 2021
Describing and Localizing Multiple Changes with Transformers
Yue Qiu
Shintaro Yamamoto
Kodai Nakashima
Ryota Suzuki
K. Iwata
Hirokatsu Kataoka
Y. Satoh
27
55
0
25 Mar 2021
High-Fidelity Pluralistic Image Completion with Transformers
Ziyu Wan
Jingbo Zhang
Dongdong Chen
Jing Liao
ViT
23
231
0
25 Mar 2021
Bertinho: Galician BERT Representations
David Vilares
Marcos Garcia
Carlos Gómez-Rodríguez
57
22
0
25 Mar 2021
Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting
A. Bhunia
Pinaki Nath Chowdhury
Yongxin Yang
Timothy M. Hospedales
Tao Xiang
Yi-Zhe Song
SSL
17
59
0
25 Mar 2021
An Approach to Improve Robustness of NLP Systems against ASR Errors
Tong Cui
Jinghui Xiao
Liangyou Li
Xin Jiang
Qun Liu
19
11
0
25 Mar 2021
Improving Online Forums Summarization via Hierarchical Unified Deep Neural Network
Sansiri Tarnpradab
Fereshteh Jafariakinabad
K. Hua
13
5
0
25 Mar 2021
Efficient Feature Transformations for Discriminative and Generative Continual Learning
Vinay K. Verma
Kevin J Liang
Nikhil Mehta
Piyush Rai
Lawrence Carin
CLL
35
76
0
25 Mar 2021
Vision Transformers for Dense Prediction
René Ranftl
Alexey Bochkovskiy
V. Koltun
ViT
MDE
42
1,659
0
24 Mar 2021
FastMoE: A Fast Mixture-of-Expert Training System
Jiaao He
J. Qiu
Aohan Zeng
Zhilin Yang
Jidong Zhai
Jie Tang
ALM
MoE
22
94
0
24 Mar 2021
Representing Numbers in NLP: a Survey and a Vision
Avijit Thawani
Jay Pujara
Pedro A. Szekely
Filip Ilievski
24
114
0
24 Mar 2021
Thinking Aloud: Dynamic Context Generation Improves Zero-Shot Reasoning Performance of GPT-2
Gregor Betz
Kyle Richardson
Christian Voigt
ReLM
LRM
16
29
0
24 Mar 2021
Czert -- Czech BERT-like Model for Language Representation
Jakub Sido
O. Pražák
P. Pribán
Jan Pasek
Michal Seják
Miloslav Konopík
16
43
0
24 Mar 2021
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask Benchmark
Nicholas Lourie
Ronan Le Bras
Chandra Bhagavatula
Yejin Choi
LRM
22
137
0
24 Mar 2021
Multi-view 3D Reconstruction with Transformer
Dan Wang
Xinrui Cui
Xun Chen
Zhengxia Zou
Tianyang Shi
Septimiu Salcudean
Z. J. Wang
Rabab Ward
ViT
20
87
0
24 Mar 2021
Scene-Intuitive Agent for Remote Embodied Visual Grounding
Xiangru Lin
Guanbin Li
Yizhou Yu
LM&Ro
22
52
0
24 Mar 2021
Region Similarity Representation Learning
Tete Xiao
Colorado Reed
Xiaolong Wang
Kurt Keutzer
Trevor Darrell
VLM
SSL
29
116
0
24 Mar 2021
The NLP Cookbook: Modern Recipes for Transformer based Deep Learning Architectures
Sushant Singh
A. Mahmood
AI4TS
60
92
0
23 Mar 2021
Self-Supervised Pretraining Improves Self-Supervised Pretraining
Colorado Reed
Xiangyu Yue
Aniruddha Nrusimha
Sayna Ebrahimi
Vivek Vijaykumar
...
Shanghang Zhang
Devin Guillory
Sean L. Metzger
Kurt Keutzer
Trevor Darrell
25
105
0
23 Mar 2021
QuestEval: Summarization Asks for Fact-based Evaluation
Thomas Scialom
Paul-Alexis Dray
Patrick Gallinari
Sylvain Lamprier
Benjamin Piwowarski
Jacopo Staiano
Alex Jinpeng Wang
HILM
11
267
0
23 Mar 2021
How to decay your learning rate
Aitor Lewkowycz
36
24
0
23 Mar 2021
Self-supervised representation learning from 12-lead ECG data
Temesgen Mehari
Nils Strodthoff
SSL
18
141
0
23 Mar 2021
Are Neural Language Models Good Plagiarists? A Benchmark for Neural Paraphrase Detection
Jan Philip Wahle
Terry Ruas
Norman Meuschke
Bela Gipp
25
34
0
23 Mar 2021
Detecting Hate Speech with GPT-3
Ke-Li Chiu
Annie Collins
Rohan Alexander
AILaw
15
108
0
23 Mar 2021
Instance-level Image Retrieval using Reranking Transformers
Fuwen Tan
Jiangbo Yuan
Vicente Ordonez
ViT
26
89
0
22 Mar 2021
Tiny Transformers for Environmental Sound Classification at the Edge
David Elliott
Carlos E. Otero
Steven Wyatt
Evan Martino
21
15
0
22 Mar 2021
Open Domain Question Answering over Tables via Dense Retrieval
Jonathan Herzig
Thomas Müller
Syrine Krichene
Julian Martin Eisenschlos
LMTD
VLM
RALM
36
99
0
22 Mar 2021
Improving and Simplifying Pattern Exploiting Training
Derek Tam
Rakesh R Menon
Mohit Bansal
Shashank Srivastava
Colin Raffel
13
149
0
22 Mar 2021
BERT: A Review of Applications in Natural Language Processing and Understanding
M. V. Koroteev
VLM
22
194
0
22 Mar 2021
Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for Improved Cross-Modal Retrieval
Gregor Geigle
Jonas Pfeiffer
Nils Reimers
Ivan Vulić
Iryna Gurevych
27
59
0
22 Mar 2021
Identifying Machine-Paraphrased Plagiarism
Jan Philip Wahle
Terry Ruas
Tomávs Foltýnek
Norman Meuschke
Bela Gipp
11
30
0
22 Mar 2021
DeepViT: Towards Deeper Vision Transformer
Daquan Zhou
Bingyi Kang
Xiaojie Jin
Linjie Yang
Xiaochen Lian
Zihang Jiang
Qibin Hou
Jiashi Feng
ViT
42
510
0
22 Mar 2021
Previous
1
2
3
...
257
258
259
...
298
299
300
Next