Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1810.04805
Cited By
v1
v2 (latest)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 33,017 papers shown
Title
Normalized Flat Minima: Exploring Scale Invariant Definition of Flat Minima for Neural Networks using PAC-Bayesian Analysis
Yusuke Tsuzuku
Issei Sato
Masashi Sugiyama
179
86
0
15 Jan 2019
Passage Re-ranking with BERT
Rodrigo Nogueira
Dong Wang
OOD
449
1,228
0
13 Jan 2019
EQUATE: A Benchmark Evaluation Framework for Quantitative Reasoning in Natural Language Inference
Abhilasha Ravichander
Aakanksha Naik
Carolyn Rose
Eduard H. Hovy
AIMat
ELM
183
86
0
11 Jan 2019
Linguistic Analysis of Pretrained Sentence Encoders with Acceptability Judgments
Alex Warstadt
Samuel R. Bowman
237
24
0
11 Jan 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
672
4,089
0
09 Jan 2019
On the Possibilities and Limitations of Multi-hop Reasoning Under Linguistic Imperfections
Daniel Khashabi
Erfan Sadeqi Azer
Tushar Khot
Ashish Sabharwal
Dan Roth
LRM
150
9
0
08 Jan 2019
Multi-style Generative Reading Comprehension
Kyosuke Nishida
Itsumi Saito
Kosuke Nishida
Kazutoshi Shinoda
Atsushi Otsuka
Hisako Asano
J. Tomita
222
71
0
08 Jan 2019
Feature reinforcement with word embedding and parsing information in neural TTS
Huaiping Ming
Lei He
Haohan Guo
Frank Soong
322
15
0
03 Jan 2019
Judge the Judges: A Large-Scale Evaluation Study of Neural Language Models for Online Review Generation
Cristina Garbacea
Samuel Carton
Shiyan Yan
Qiaozhu Mei
ELM
209
32
0
02 Jan 2019
Text Infilling
Wanrong Zhu
Zhiting Hu
Eric Xing
309
64
0
01 Jan 2019
Multilingual Constituency Parsing with Self-Attention and Pre-Training
Nikita Kitaev
Steven Cao
Dan Klein
LRM
170
270
0
31 Dec 2018
A neural joint model for Vietnamese word segmentation, POS tagging and dependency parsing
Dat Quoc Nguyen
194
12
0
30 Dec 2018
Double Neural Counterfactual Regret Minimization
Hui Li
Kailiang Hu
Zhibang Ge
Tao Jiang
Yuan Qi
Le Song
125
53
0
27 Dec 2018
Adversarial Attack and Defense on Graph Data: A Survey
Lichao Sun
Yingtong Dou
Carl Yang
Ji Wang
Yixin Liu
Philip S. Yu
Lifang He
Yangqiu Song
GNN
AAML
336
343
0
26 Dec 2018
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond
Mikel Artetxe
Holger Schwenk
3DV
317
1,088
0
26 Dec 2018
Deep Representation Learning for Clustering of Health Tweets
O. Gencoglu
SSL
118
10
0
25 Dec 2018
Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification
Mozhi Zhang
Yoshinari Fujinuma
Jordan L. Boyd-Graber
379
21
0
22 Dec 2018
Joint Slot Filling and Intent Detection via Capsule Neural Networks
Chenwei Zhang
Yaliang Li
Nan Du
Wei Fan
Philip S. Yu
163
245
0
22 Dec 2018
A Survey on Deep Learning for Named Entity Recognition
Junlin Li
Aixin Sun
Jianglei Han
Chenliang Li
3DV
357
1,344
0
22 Dec 2018
Graph Neural Networks: A Review of Methods and Applications
Jie Zhou
Ganqu Cui
Shengding Hu
Zhengyan Zhang
Cheng Yang
Zhiyuan Liu
Lifeng Wang
Changcheng Li
Maosong Sun
AI4CE
GNN
1.9K
6,331
0
20 Dec 2018
Found in Translation: Learning Robust Joint Representations by Cyclic Translations Between Modalities
Hai Pham
Paul Pu Liang
Thomas Manzini
Louis-Philippe Morency
Barnabás Póczós
181
482
0
19 Dec 2018
A Tutorial on Deep Latent Variable Models of Natural Language
Yoon Kim
Sam Wiseman
Alexander M. Rush
BDL
VLM
230
45
0
17 Dec 2018
Conditional BERT Contextual Augmentation
Xing Wu
Shangwen Lv
Liangjun Zang
Jizhong Han
Songlin Hu
187
340
0
17 Dec 2018
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering
Shiyang Feng
Zhengkai Jiang
Haoxuan You
Pan Lu
Steven C. H. Hoi
Xiaogang Wang
Jiaming Song
AIMat
432
393
0
13 Dec 2018
Detecting weak and strong Islamophobic hate speech on social media
Bertie Vidgen
T. Yasseri
194
150
0
12 Dec 2018
SMIT: Stochastic Multi-Label Image-to-Image Translation
Andrés Romero
Pablo Arbelaez
Luc Van Gool
Radu Timofte
121
67
0
10 Dec 2018
SDNet: Contextualized Attention-based Deep Network for Conversational Question Answering
Chenguang Zhu
Michael Zeng
Xuedong Huang
189
131
0
10 Dec 2018
What is the Effect of Importance Weighting in Deep Learning?
Jonathon Byrd
Zachary Chase Lipton
421
508
0
08 Dec 2018
Auto-Encoding Scene Graphs for Image Captioning
Xu Yang
Kaihua Tang
Hanwang Zhang
Jianfei Cai
596
763
0
06 Dec 2018
Neural Abstractive Text Summarization with Sequence-to-Sequence Models
Tian Shi
Yaser Keneshloo
Naren Ramakrishnan
Chandan K. Reddy
323
252
0
05 Dec 2018
Efficient Attention: Attention with Linear Complexities
Zhuoran Shen
Mingyuan Zhang
Haiyu Zhao
Shuai Yi
Jiaming Song
602
652
0
04 Dec 2018
Practical Text Classification With Large Pre-Trained Language Models
Neel Kant
Raul Puri
Nikolai Yakovenko
Bryan Catanzaro
VLM
108
75
0
04 Dec 2018
Flexible and Scalable State Tracking Framework for Goal-Oriented Dialogue Systems
Rahul Goel
Shachi Paul
Tagyoung Chung
Jérémie Lecomte
Arindam Mandal
Dilek Z. Hakkani-Tür
89
15
0
30 Nov 2018
Visual Question Answering as Reading Comprehension
Hui Li
Peng Wang
Chunhua Shen
Anton Van Den Hengel
124
46
0
29 Nov 2018
Unsupervised Multi-modal Neural Machine Translation
Yuanhang Su
Kai Fan
Nguyen Bach
C.-C. Jay Kuo
Fei Huang
239
66
0
28 Nov 2018
From Recognition to Cognition: Visual Commonsense Reasoning
Rowan Zellers
Yonatan Bisk
Ali Farhadi
Yejin Choi
LRM
BDL
OCL
ReLM
596
984
0
27 Nov 2018
NSEEN: Neural Semantic Embedding for Entity Normalization
Shobeir Fakhraei
Joel Mathew
J. Ambite
124
19
0
19 Nov 2018
Synergistic Drug Combination Prediction by Integrating Multi-omics Data in Deep Learning Models
Tianyu Zhang
Liwei Zhang
Philip R. O. Payne
Fuhai Li
92
115
0
16 Nov 2018
Survey of Computational Approaches to Lexical Semantic Change
Nina Tahmasebi
L. Borin
Adam Jatowt
218
175
0
15 Nov 2018
Extractive Summary as Discrete Latent Variables
Aran Komatsuzaki
118
3
0
14 Nov 2018
An Introductory Survey on Attention Mechanisms in NLP Problems
Dichao Hu
AIMat
136
269
0
12 Nov 2018
Speech Intention Understanding in a Head-final Language: A Disambiguation Utilizing Intonation-dependency
Won Ik Cho
Hyeon Seung Lee
J. Yoon
Seokhwan Kim
N. Kim
312
5
0
10 Nov 2018
Densely Connected Attention Propagation for Reading Comprehension
Yi Tay
Anh Tuan Luu
S. Hui
Jian Su
178
48
0
10 Nov 2018
Effective Representation for Easy-First Dependency Parsing
Zuchao Li
Amir Vakili
Ali Montazer
166
0
0
08 Nov 2018
Language GANs Falling Short
International Conference on Learning Representations (ICLR), 2018
Massimo Caccia
Lucas Caccia
W. Fedus
Hugo Larochelle
Joelle Pineau
Laurent Charlin
522
230
0
06 Nov 2018
How Reasonable are Common-Sense Reasoning Tasks: A Case-Study on the Winograd Schema Challenge and SWAG
P. Trichelair
Ali Emami
Adam Trischler
Kaheer Suleman
Jackie C.K. Cheung
LRM
184
44
0
05 Nov 2018
Semantic Role Labeling for Knowledge Graph Extraction from Text
Mehwish Alam
Aldo Gangemi
Valentina Presutti
Diego Reforgiato Recupero
68
10
0
04 Nov 2018
Elastic CRFs for Open-ontology Slot Filling
Yinpei Dai
Yichi Zhang
Hong Liu
Zhijian Ou
Yanmeng Wang
Junlan Feng
199
2
0
04 Nov 2018
Learning to Rank Query Graphs for Complex Question Answering over Knowledge Graphs
Gaurav Maheshwari
Priyansh Trivedi
Denis Lukovnikov
Nilesh Chakraborty
Asja Fischer
Jens Lehmann
GNN
167
78
0
02 Nov 2018
Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks
Jason Phang
Thibault Févry
Samuel R. Bowman
324
480
0
02 Nov 2018
Previous
1
2
3
...
658
659
660
661
Next