ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
v1v2 (latest)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLMSSLSSeg
ArXiv (abs)PDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 33,017 papers shown
Title
Normalized Flat Minima: Exploring Scale Invariant Definition of Flat
  Minima for Neural Networks using PAC-Bayesian Analysis
Normalized Flat Minima: Exploring Scale Invariant Definition of Flat Minima for Neural Networks using PAC-Bayesian Analysis
Yusuke Tsuzuku
Issei Sato
Masashi Sugiyama
179
86
0
15 Jan 2019
Passage Re-ranking with BERT
Passage Re-ranking with BERT
Rodrigo Nogueira
Dong Wang
OOD
449
1,228
0
13 Jan 2019
EQUATE: A Benchmark Evaluation Framework for Quantitative Reasoning in
  Natural Language Inference
EQUATE: A Benchmark Evaluation Framework for Quantitative Reasoning in Natural Language Inference
Abhilasha Ravichander
Aakanksha Naik
Carolyn Rose
Eduard H. Hovy
AIMatELM
183
86
0
11 Jan 2019
Linguistic Analysis of Pretrained Sentence Encoders with Acceptability
  Judgments
Linguistic Analysis of Pretrained Sentence Encoders with Acceptability Judgments
Alex Warstadt
Samuel R. Bowman
237
24
0
11 Jan 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
672
4,089
0
09 Jan 2019
On the Possibilities and Limitations of Multi-hop Reasoning Under
  Linguistic Imperfections
On the Possibilities and Limitations of Multi-hop Reasoning Under Linguistic Imperfections
Daniel Khashabi
Erfan Sadeqi Azer
Tushar Khot
Ashish Sabharwal
Dan Roth
LRM
150
9
0
08 Jan 2019
Multi-style Generative Reading Comprehension
Multi-style Generative Reading Comprehension
Kyosuke Nishida
Itsumi Saito
Kosuke Nishida
Kazutoshi Shinoda
Atsushi Otsuka
Hisako Asano
J. Tomita
222
71
0
08 Jan 2019
Feature reinforcement with word embedding and parsing information in
  neural TTS
Feature reinforcement with word embedding and parsing information in neural TTS
Huaiping Ming
Lei He
Haohan Guo
Frank Soong
322
15
0
03 Jan 2019
Judge the Judges: A Large-Scale Evaluation Study of Neural Language
  Models for Online Review Generation
Judge the Judges: A Large-Scale Evaluation Study of Neural Language Models for Online Review Generation
Cristina Garbacea
Samuel Carton
Shiyan Yan
Qiaozhu Mei
ELM
209
32
0
02 Jan 2019
Text Infilling
Text Infilling
Wanrong Zhu
Zhiting Hu
Eric Xing
309
64
0
01 Jan 2019
Multilingual Constituency Parsing with Self-Attention and Pre-Training
Multilingual Constituency Parsing with Self-Attention and Pre-Training
Nikita Kitaev
Steven Cao
Dan Klein
LRM
170
270
0
31 Dec 2018
A neural joint model for Vietnamese word segmentation, POS tagging and
  dependency parsing
A neural joint model for Vietnamese word segmentation, POS tagging and dependency parsing
Dat Quoc Nguyen
194
12
0
30 Dec 2018
Double Neural Counterfactual Regret Minimization
Double Neural Counterfactual Regret Minimization
Hui Li
Kailiang Hu
Zhibang Ge
Tao Jiang
Yuan Qi
Le Song
125
53
0
27 Dec 2018
Adversarial Attack and Defense on Graph Data: A Survey
Adversarial Attack and Defense on Graph Data: A Survey
Lichao Sun
Yingtong Dou
Carl Yang
Ji Wang
Yixin Liu
Philip S. Yu
Lifang He
Yangqiu Song
GNNAAML
336
343
0
26 Dec 2018
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual
  Transfer and Beyond
Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond
Mikel Artetxe
Holger Schwenk
3DV
317
1,088
0
26 Dec 2018
Deep Representation Learning for Clustering of Health Tweets
Deep Representation Learning for Clustering of Health Tweets
O. Gencoglu
SSL
118
10
0
25 Dec 2018
Exploiting Cross-Lingual Subword Similarities in Low-Resource Document
  Classification
Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification
Mozhi Zhang
Yoshinari Fujinuma
Jordan L. Boyd-Graber
379
21
0
22 Dec 2018
Joint Slot Filling and Intent Detection via Capsule Neural Networks
Joint Slot Filling and Intent Detection via Capsule Neural Networks
Chenwei Zhang
Yaliang Li
Nan Du
Wei Fan
Philip S. Yu
163
245
0
22 Dec 2018
A Survey on Deep Learning for Named Entity Recognition
A Survey on Deep Learning for Named Entity Recognition
Junlin Li
Aixin Sun
Jianglei Han
Chenliang Li
3DV
357
1,344
0
22 Dec 2018
Graph Neural Networks: A Review of Methods and Applications
Graph Neural Networks: A Review of Methods and Applications
Jie Zhou
Ganqu Cui
Shengding Hu
Zhengyan Zhang
Cheng Yang
Zhiyuan Liu
Lifeng Wang
Changcheng Li
Maosong Sun
AI4CEGNN
1.9K
6,331
0
20 Dec 2018
Found in Translation: Learning Robust Joint Representations by Cyclic
  Translations Between Modalities
Found in Translation: Learning Robust Joint Representations by Cyclic Translations Between Modalities
Hai Pham
Paul Pu Liang
Thomas Manzini
Louis-Philippe Morency
Barnabás Póczós
181
482
0
19 Dec 2018
A Tutorial on Deep Latent Variable Models of Natural Language
A Tutorial on Deep Latent Variable Models of Natural Language
Yoon Kim
Sam Wiseman
Alexander M. Rush
BDLVLM
230
45
0
17 Dec 2018
Conditional BERT Contextual Augmentation
Conditional BERT Contextual Augmentation
Xing Wu
Shangwen Lv
Liangjun Zang
Jizhong Han
Songlin Hu
187
340
0
17 Dec 2018
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual
  Question Answering
Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering
Shiyang Feng
Zhengkai Jiang
Haoxuan You
Pan Lu
Steven C. H. Hoi
Xiaogang Wang
Jiaming Song
AIMat
432
393
0
13 Dec 2018
Detecting weak and strong Islamophobic hate speech on social media
Detecting weak and strong Islamophobic hate speech on social media
Bertie Vidgen
T. Yasseri
194
150
0
12 Dec 2018
SMIT: Stochastic Multi-Label Image-to-Image Translation
SMIT: Stochastic Multi-Label Image-to-Image Translation
Andrés Romero
Pablo Arbelaez
Luc Van Gool
Radu Timofte
121
67
0
10 Dec 2018
SDNet: Contextualized Attention-based Deep Network for Conversational
  Question Answering
SDNet: Contextualized Attention-based Deep Network for Conversational Question Answering
Chenguang Zhu
Michael Zeng
Xuedong Huang
189
131
0
10 Dec 2018
What is the Effect of Importance Weighting in Deep Learning?
What is the Effect of Importance Weighting in Deep Learning?
Jonathon Byrd
Zachary Chase Lipton
421
508
0
08 Dec 2018
Auto-Encoding Scene Graphs for Image Captioning
Auto-Encoding Scene Graphs for Image Captioning
Xu Yang
Kaihua Tang
Hanwang Zhang
Jianfei Cai
596
763
0
06 Dec 2018
Neural Abstractive Text Summarization with Sequence-to-Sequence Models
Neural Abstractive Text Summarization with Sequence-to-Sequence Models
Tian Shi
Yaser Keneshloo
Naren Ramakrishnan
Chandan K. Reddy
323
252
0
05 Dec 2018
Efficient Attention: Attention with Linear Complexities
Efficient Attention: Attention with Linear Complexities
Zhuoran Shen
Mingyuan Zhang
Haiyu Zhao
Shuai Yi
Jiaming Song
602
652
0
04 Dec 2018
Practical Text Classification With Large Pre-Trained Language Models
Practical Text Classification With Large Pre-Trained Language Models
Neel Kant
Raul Puri
Nikolai Yakovenko
Bryan Catanzaro
VLM
108
75
0
04 Dec 2018
Flexible and Scalable State Tracking Framework for Goal-Oriented
  Dialogue Systems
Flexible and Scalable State Tracking Framework for Goal-Oriented Dialogue Systems
Rahul Goel
Shachi Paul
Tagyoung Chung
Jérémie Lecomte
Arindam Mandal
Dilek Z. Hakkani-Tür
89
15
0
30 Nov 2018
Visual Question Answering as Reading Comprehension
Visual Question Answering as Reading Comprehension
Hui Li
Peng Wang
Chunhua Shen
Anton Van Den Hengel
124
46
0
29 Nov 2018
Unsupervised Multi-modal Neural Machine Translation
Unsupervised Multi-modal Neural Machine Translation
Yuanhang Su
Kai Fan
Nguyen Bach
C.-C. Jay Kuo
Fei Huang
239
66
0
28 Nov 2018
From Recognition to Cognition: Visual Commonsense Reasoning
From Recognition to Cognition: Visual Commonsense Reasoning
Rowan Zellers
Yonatan Bisk
Ali Farhadi
Yejin Choi
LRMBDLOCLReLM
596
984
0
27 Nov 2018
NSEEN: Neural Semantic Embedding for Entity Normalization
NSEEN: Neural Semantic Embedding for Entity Normalization
Shobeir Fakhraei
Joel Mathew
J. Ambite
124
19
0
19 Nov 2018
Synergistic Drug Combination Prediction by Integrating Multi-omics Data
  in Deep Learning Models
Synergistic Drug Combination Prediction by Integrating Multi-omics Data in Deep Learning Models
Tianyu Zhang
Liwei Zhang
Philip R. O. Payne
Fuhai Li
92
115
0
16 Nov 2018
Survey of Computational Approaches to Lexical Semantic Change
Survey of Computational Approaches to Lexical Semantic Change
Nina Tahmasebi
L. Borin
Adam Jatowt
218
175
0
15 Nov 2018
Extractive Summary as Discrete Latent Variables
Extractive Summary as Discrete Latent Variables
Aran Komatsuzaki
118
3
0
14 Nov 2018
An Introductory Survey on Attention Mechanisms in NLP Problems
An Introductory Survey on Attention Mechanisms in NLP Problems
Dichao Hu
AIMat
136
269
0
12 Nov 2018
Speech Intention Understanding in a Head-final Language: A
  Disambiguation Utilizing Intonation-dependency
Speech Intention Understanding in a Head-final Language: A Disambiguation Utilizing Intonation-dependency
Won Ik Cho
Hyeon Seung Lee
J. Yoon
Seokhwan Kim
N. Kim
312
5
0
10 Nov 2018
Densely Connected Attention Propagation for Reading Comprehension
Densely Connected Attention Propagation for Reading Comprehension
Yi Tay
Anh Tuan Luu
S. Hui
Jian Su
178
48
0
10 Nov 2018
Effective Representation for Easy-First Dependency Parsing
Effective Representation for Easy-First Dependency Parsing
Zuchao Li
Amir Vakili
Ali Montazer
166
0
0
08 Nov 2018
Language GANs Falling Short
Language GANs Falling ShortInternational Conference on Learning Representations (ICLR), 2018
Massimo Caccia
Lucas Caccia
W. Fedus
Hugo Larochelle
Joelle Pineau
Laurent Charlin
522
230
0
06 Nov 2018
How Reasonable are Common-Sense Reasoning Tasks: A Case-Study on the
  Winograd Schema Challenge and SWAG
How Reasonable are Common-Sense Reasoning Tasks: A Case-Study on the Winograd Schema Challenge and SWAG
P. Trichelair
Ali Emami
Adam Trischler
Kaheer Suleman
Jackie C.K. Cheung
LRM
184
44
0
05 Nov 2018
Semantic Role Labeling for Knowledge Graph Extraction from Text
Semantic Role Labeling for Knowledge Graph Extraction from Text
Mehwish Alam
Aldo Gangemi
Valentina Presutti
Diego Reforgiato Recupero
68
10
0
04 Nov 2018
Elastic CRFs for Open-ontology Slot Filling
Elastic CRFs for Open-ontology Slot Filling
Yinpei Dai
Yichi Zhang
Hong Liu
Zhijian Ou
Yanmeng Wang
Junlan Feng
199
2
0
04 Nov 2018
Learning to Rank Query Graphs for Complex Question Answering over
  Knowledge Graphs
Learning to Rank Query Graphs for Complex Question Answering over Knowledge Graphs
Gaurav Maheshwari
Priyansh Trivedi
Denis Lukovnikov
Nilesh Chakraborty
Asja Fischer
Jens Lehmann
GNN
167
78
0
02 Nov 2018
Sentence Encoders on STILTs: Supplementary Training on Intermediate
  Labeled-data Tasks
Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks
Jason Phang
Thibault Févry
Samuel R. Bowman
324
480
0
02 Nov 2018
Previous
123...658659660661
Next