Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.04805
Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
50 / 13,105 papers shown
Title
An Effective Label Noise Model for DNN Text Classification
Ishan Jindal
Daniel Pressel
Brian Lester
M. Nokleby
NoLa
12
48
0
18 Mar 2019
ETNLP: a visual-aided systematic approach to select pre-trained embeddings for a downstream task
Xuan-Son Vu
Thanh Vu
Son N. Tran
Lili Jiang
16
6
0
11 Mar 2019
Fast Prototyping a Dialogue Comprehension System for Nurse-Patient Conversations on Symptom Monitoring
Zhengyuan Liu
Jia Hui Hazel Lim
Nur Farah Ain Binte Sahimi
Shao Chuen Tong
Sharon Ong
...
M. Macdonald
Savitha Ramasamy
Pavitra Krishnaswamy
W. Chow
Nancy F. Chen
6
24
0
08 Mar 2019
Neural Language Models as Psycholinguistic Subjects: Representations of Syntactic State
Richard Futrell
Ethan Gotlieb Wilcox
Takashi Morita
Peng Qian
Miguel Ballesteros
R. Levy
MILM
8
190
0
08 Mar 2019
Predicting Research Trends From Arxiv
Steffen Eger
Chao Li
Florian Netzer
Iryna Gurevych
11
7
0
07 Mar 2019
SemEval-2019 Task 1: Cross-lingual Semantic Parsing with UCCA
Daniel Hershcovich
Zohar Aizenbud
Leshem Choshen
Elior Sulem
A. Rappoport
Omri Abend
14
36
0
06 Mar 2019
SECNLP: A Survey of Embeddings in Clinical Natural Language Processing
Katikapalli Subramanyam Kalyan
S. Sangeetha
12
82
0
04 Mar 2019
Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data
Wei-Ye Zhao
Liang Wang
Kewei Shen
Ruoyu Jia
Jingming Liu
14
210
0
01 Mar 2019
Infer Your Enemies and Know Yourself, Learning in Real-Time Bidding with Partially Observable Opponents
Manxing Du
Alexander I. Cowen-Rivers
Ying Wen
Phu Sakulwongtana
Jun Wang
M. Brorsson
R. State
11
1
0
28 Feb 2019
Link Prediction with Mutual Attention for Text-Attributed Networks
Robin Brochier
Adrien Guille
Julien Velcin
6
12
0
28 Feb 2019
Better, Faster, Stronger Sequence Tagging Constituent Parsers
David Vilares
Mostafa Abdou
Anders Søgaard
33
22
0
28 Feb 2019
An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models
Alexandra Chronopoulou
Christos Baziotis
Alexandros Potamianos
CLL
20
128
0
27 Feb 2019
Multi-Task Learning with Contextualized Word Representations for Extented Named Entity Recognition
Thai-Hoang Pham
Khai Mai
M. T. Nguyen
Nguyen Tuan Duc
Danushka Bollegala
Ryohei Sasano
Satoshi Sekine
9
4
0
26 Feb 2019
Enhancing Clinical Concept Extraction with Contextual Embeddings
Yuqi Si
Jingqi Wang
Hua Xu
Kirk Roberts
AI4MH
13
286
0
22 Feb 2019
Improving Multilingual Sentence Embedding using Bi-directional Dual Encoder with Additive Margin Softmax
Yinfei Yang
Gustavo Hernández Ábrego
Steve Yuan
Mandy Guo
Qinlan Shen
Daniel Matthew Cer
Yun-hsuan Sung
B. Strope
R. Kurzweil
41
115
0
22 Feb 2019
Breaking the Softmax Bottleneck via Learnable Monotonic Pointwise Non-linearities
O. Ganea
Sylvain Gelly
Gary Bécigneul
Aliaksei Severyn
19
18
0
21 Feb 2019
Using Embeddings to Correct for Unobserved Confounding in Networks
Victor Veitch
Yixin Wang
David M. Blei
CML
13
56
0
11 Feb 2019
End-to-End Open-Domain Question Answering with BERTserini
Wei Yang
Yuqing Xie
Aileen Lin
Xingyu Li
Luchen Tan
Kun Xiong
Ming Li
Jimmy J. Lin
RALM
15
491
0
05 Feb 2019
A large-scale crowdsourced analysis of abuse against women journalists and politicians on Twitter
Laure Delisle
Freddie Kalaitzis
Krzysztof Majewski
A. D. Berker
M. Marin
Julien Cornebise
11
29
0
31 Jan 2019
Glyce: Glyph-vectors for Chinese Character Representations
Yuxian Meng
Wei Yu Wu
Fei Wang
Xiaoya Li
Ping Nie
J. Mei
Muyu Li
Qinghong Han
Xiaofei Sun
Jiwei Li
VLM
9
188
0
29 Jan 2019
Stiffness: A New Perspective on Generalization in Neural Networks
Stanislav Fort
Pawel Krzysztof Nowak
Stanislaw Jastrzebski
S. Narayanan
11
94
0
28 Jan 2019
Dual Co-Matching Network for Multi-choice Reading Comprehension
Shuailiang Zhang
Zhao Hai
Yuwei Wu
Zhuosheng Zhang
Xi Zhou
Xiaoping Zhou
28
131
0
27 Jan 2019
A BERT Baseline for the Natural Questions
Chris Alberti
Kenton Lee
Michael Collins
ELM
AI4MH
14
126
0
24 Jan 2019
TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents
Thomas Wolf
Victor Sanh
Julien Chaumond
Clement Delangue
17
493
0
23 Jan 2019
Cross-lingual Language Model Pretraining
Guillaume Lample
Alexis Conneau
18
2,707
0
22 Jan 2019
Physics-Constrained Deep Learning for High-dimensional Surrogate Modeling and Uncertainty Quantification without Labeled Data
Yinhao Zhu
N. Zabaras
P. Koutsourelakis
P. Perdikaris
PINN
AI4CE
26
853
0
18 Jan 2019
Sentence transition matrix: An efficient approach that preserves sentence semantics
Myeongjun Jang
Pilsung Kang
11
2
0
16 Jan 2019
Exploiting Synchronized Lyrics And Vocal Features For Music Emotion Detection
Loreto Parisi
Simone Francia
Silvio Olivastri
Maria Stella Tavella
16
11
0
15 Jan 2019
Linguistic Analysis of Pretrained Sentence Encoders with Acceptability Judgments
Alex Warstadt
Samuel R. Bowman
16
23
0
11 Jan 2019
Judge the Judges: A Large-Scale Evaluation Study of Neural Language Models for Online Review Generation
Cristina Garbacea
Samuel Carton
Shiyan Yan
Qiaozhu Mei
ELM
17
29
0
02 Jan 2019
Graph Neural Networks: A Review of Methods and Applications
Jie Zhou
Ganqu Cui
Shengding Hu
Zhengyan Zhang
Cheng Yang
Zhiyuan Liu
Lifeng Wang
Changcheng Li
Maosong Sun
AI4CE
GNN
26
5,390
0
20 Dec 2018
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering
Peng Gao
Zhengkai Jiang
Haoxuan You
Pan Lu
Steven C. H. Hoi
Xiaogang Wang
Hongsheng Li
AIMat
19
362
0
13 Dec 2018
Auto-Encoding Scene Graphs for Image Captioning
Xu Yang
Kaihua Tang
Hanwang Zhang
Jianfei Cai
13
692
0
06 Dec 2018
Efficient Attention: Attention with Linear Complexities
Zhuoran Shen
Mingyuan Zhang
Haiyu Zhao
Shuai Yi
Hongsheng Li
15
506
0
04 Dec 2018
An Introductory Survey on Attention Mechanisms in NLP Problems
Dichao Hu
AIMat
6
246
0
12 Nov 2018
Speech Intention Understanding in a Head-final Language: A Disambiguation Utilizing Intonation-dependency
Won Ik Cho
Hyeon Seung Lee
J. Yoon
Seokhwan Kim
N. Kim
23
5
0
10 Nov 2018
Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks
Jason Phang
Thibault Févry
Samuel R. Bowman
19
466
0
02 Nov 2018
Improving Machine Reading Comprehension with General Reading Strategies
Kai Sun
Dian Yu
Dong Yu
Claire Cardie
AI4CE
8
116
0
31 Oct 2018
Large-scale Hierarchical Alignment for Data-driven Text Rewriting
Nikola I. Nikolov
Richard H. R. Hahnloser
32
7
0
18 Oct 2018
A Span-Extraction Dataset for Chinese Machine Reading Comprehension
Yiming Cui
Ting Liu
Wanxiang Che
Li Xiao
Zhipeng Chen
Wentao Ma
Shijin Wang
Guoping Hu
26
181
0
17 Oct 2018
Multi-Source Cross-Lingual Model Transfer: Learning What to Share
Xilun Chen
Ahmed Hassan Awadallah
Hany Hassan
Wei Wang
Claire Cardie
34
20
0
08 Oct 2018
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
32
668
0
21 Sep 2018
Multi-task Learning with Sample Re-weighting for Machine Reading Comprehension
Yichong Xu
Xiaodong Liu
Yelong Shen
Jingjing Liu
Jianfeng Gao
19
51
0
18 Sep 2018
RumourEval 2019: Determining Rumour Veracity and Support for Rumours
G. Gorrell
Kalina Bontcheva
Leon Derczynski
E. Kochkina
Maria Liakata
A. Zubiaga
9
213
0
18 Sep 2018
Explainable Recommendation: A Survey and New Perspectives
Yongfeng Zhang
Xu Chen
XAI
LRM
12
862
0
30 Apr 2018
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
294
6,950
0
20 Apr 2018
Interact and Decide: Medley of Sub-Attention Networks for Effective Group Recommendation
Lucas Vinh Tran
T. Pham
Yi Tay
Yiding Liu
Gao Cong
Xiaoli Li
11
93
0
12 Apr 2018
Clinical Concept Embeddings Learned from Massive Sources of Multimodal Medical Data
Andrew L. Beam
Benjamin Kompa
A. Schmaltz
Inbar Fried
G. Weber
N. Palmer
Xu Shi
Tianxi Cai
I. Kohane
8
176
0
04 Apr 2018
The Geometry of Culture: Analyzing Meaning through Word Embeddings
Austin C. Kozlowski
Matt Taddy
James A. Evans
19
375
0
25 Mar 2018
SparCML: High-Performance Sparse Communication for Machine Learning
Cédric Renggli
Saleh Ashkboos
Mehdi Aghagolzadeh
Dan Alistarh
Torsten Hoefler
10
126
0
22 Feb 2018
Previous
1
2
3
...
261
262
263
Next