ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 15,000 papers shown
Title
Adversarial Analysis of Natural Language Inference Systems
Adversarial Analysis of Natural Language Inference Systems
Tiffany Chien
Jugal Kalita
AAML
34
12
0
07 Dec 2019
Weak Supervision helps Emergence of Word-Object Alignment and improves
  Vision-Language Tasks
Weak Supervision helps Emergence of Word-Object Alignment and improves Vision-Language Tasks
Corentin Kervadec
G. Antipov
M. Baccouche
Christian Wolf
19
15
0
06 Dec 2019
Reading the Manual: Event Extraction as Definition Comprehension
Reading the Manual: Event Extraction as Definition Comprehension
Yunmo Chen
Tongfei Chen
Seth Ebner
Aaron Steven White
Benjamin Van Durme
21
63
0
03 Dec 2019
An Annotated Dataset of Coreference in English Literature
An Annotated Dataset of Coreference in English Literature
David Bamman
Olivia Lewke
A. Mansoor
6
105
0
03 Dec 2019
Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven
  Acoustic Embedding Selection
Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection
Shubhi Tyagi
M. Nicolis
Jonas Rohnke
Thomas Drugman
Jaime Lorenzo-Trueba
26
32
0
02 Dec 2019
Knowledge Infused Learning (K-IL): Towards Deep Incorporation of
  Knowledge in Deep Learning
Knowledge Infused Learning (K-IL): Towards Deep Incorporation of Knowledge in Deep Learning
Ugur Kursuncu
Manas Gaur
A. Sheth
NAI
18
57
0
01 Dec 2019
Deconstructing and reconstructing word embedding algorithms
Deconstructing and reconstructing word embedding algorithms
Edward Newell
Kian Kenyon-Dean
Jackie C.K. Cheung
31
4
0
29 Nov 2019
Blockwisely Supervised Neural Architecture Search with Knowledge
  Distillation
Blockwisely Supervised Neural Architecture Search with Knowledge Distillation
Changlin Li
Jiefeng Peng
Liuchun Yuan
Guangrun Wang
Xiaodan Liang
Liang Lin
Xiaojun Chang
23
179
0
29 Nov 2019
Inducing Relational Knowledge from BERT
Inducing Relational Knowledge from BERT
Zied Bouraoui
Jose Camacho-Collados
Steven Schockaert
21
166
0
28 Nov 2019
End-to-End Trainable Non-Collaborative Dialog System
End-to-End Trainable Non-Collaborative Dialog System
Yu Li
Kun Qian
Weiyan Shi
Zhou Yu
21
45
0
25 Nov 2019
Who did They Respond to? Conversation Structure Modeling using Masked
  Hierarchical Transformer
Who did They Respond to? Conversation Structure Modeling using Masked Hierarchical Transformer
Henghui Zhu
Feng Nan
Zhiguo Wang
Ramesh Nallapati
Bing Xiang
22
39
0
25 Nov 2019
Task-Oriented Dialog Systems that Consider Multiple Appropriate
  Responses under the Same Context
Task-Oriented Dialog Systems that Consider Multiple Appropriate Responses under the Same Context
Yichi Zhang
Zhijian Ou
Zhou Yu
19
182
0
24 Nov 2019
Separate and Attend in Personal Email Search
Separate and Attend in Personal Email Search
Yu Meng
Maryam Karimzadehgan
Honglei Zhuang
Donald Metzler
FedML
109
2
0
21 Nov 2019
Attention-Informed Mixed-Language Training for Zero-shot Cross-lingual
  Task-oriented Dialogue Systems
Attention-Informed Mixed-Language Training for Zero-shot Cross-lingual Task-oriented Dialogue Systems
Zihan Liu
Genta Indra Winata
Zhaojiang Lin
Peng-Tao Xu
Pascale Fung
15
98
0
21 Nov 2019
Generating Interactive Worlds with Text
Generating Interactive Worlds with Text
Angela Fan
Jack Urbanek
Pratik Ringshia
Emily Dinan
Emma Qian
...
Shrimai Prabhumoye
Douwe Kiela
Tim Rocktaschel
Arthur Szlam
Jason Weston
16
27
0
20 Nov 2019
Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine
  Translation
Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation
Junliang Guo
Xu Tan
Linli Xu
Tao Qin
Enhong Chen
Tie-Yan Liu
6
85
0
20 Nov 2019
DermGAN: Synthetic Generation of Clinical Skin Images with Pathology
DermGAN: Synthetic Generation of Clinical Skin Images with Pathology
Amirata Ghorbani
Vivek Natarajan
David Coz
Yuan Liu
GAN
MedIm
19
98
0
20 Nov 2019
Global Greedy Dependency Parsing
Global Greedy Dependency Parsing
Z. Li
Zhao Hai
Kevin Parnow
31
31
0
20 Nov 2019
Towards Lingua Franca Named Entity Recognition with BERT
Towards Lingua Franca Named Entity Recognition with BERT
Taesun Moon
Parul Awasthy
Jian Ni
Radu Florian
14
29
0
19 Nov 2019
Weakly-Supervised Video Moment Retrieval via Semantic Completion Network
Weakly-Supervised Video Moment Retrieval via Semantic Completion Network
Zhijie Lin
Zhou Zhao
Zhu Zhang
Qi. Wang
Huasheng Liu
22
149
0
19 Nov 2019
REFIT: A Unified Watermark Removal Framework For Deep Learning Systems
  With Limited Data
REFIT: A Unified Watermark Removal Framework For Deep Learning Systems With Limited Data
Xinyun Chen
Wenxiao Wang
Chris Bender
Yiming Ding
R. Jia
Bo-wen Li
D. Song
AAML
19
106
0
17 Nov 2019
Understanding and Improving Layer Normalization
Understanding and Improving Layer Normalization
Jingjing Xu
Xu Sun
Zhiyuan Zhang
Guangxiang Zhao
Junyang Lin
FAtt
18
338
0
16 Nov 2019
Evaluating robustness of language models for chief complaint extraction
  from patient-generated text
Evaluating robustness of language models for chief complaint extraction from patient-generated text
Ilya Valmianski
Caleb Goodwin
Ian M. Finn
Naqi Khan
D. Zisook
22
6
0
15 Nov 2019
Sequential Recommendation with Relation-Aware Kernelized Self-Attention
Sequential Recommendation with Relation-Aware Kernelized Self-Attention
Mingi Ji
Weonyoung Joo
Kyungwoo Song
Yoon-Yeong Kim
Il-Chul Moon
AI4TS
16
30
0
15 Nov 2019
Sato: Contextual Semantic Type Detection in Tables
Sato: Contextual Semantic Type Detection in Tables
Dan Zhang
Yoshihiko Suhara
Jinfeng Li
Madelon Hulsebos
cCaugatay Demiralp
W. Tan
LMTD
16
15
0
14 Nov 2019
Enhanced Meta-Learning for Cross-lingual Named Entity Recognition with
  Minimal Resources
Enhanced Meta-Learning for Cross-lingual Named Entity Recognition with Minimal Resources
Qianhui Wu
Zijia Lin
Guoxin Wang
Hui Chen
Börje F. Karlsson
Biqing Huang
Chin-Yew Lin
11
68
0
14 Nov 2019
There is Limited Correlation between Coverage and Robustness for Deep
  Neural Networks
There is Limited Correlation between Coverage and Robustness for Deep Neural Networks
Yizhen Dong
Peixin Zhang
Jingyi Wang
Shuang Liu
Jun Sun
Jianye Hao
Xinyu Wang
Li Wang
J. Dong
Ting Dai
OOD
AAML
16
32
0
14 Nov 2019
Generating Persona Consistent Dialogues by Exploiting Natural Language
  Inference
Generating Persona Consistent Dialogues by Exploiting Natural Language Inference
Haoyu Song
Weinan Zhang
Jingwen Hu
Ting Liu
17
73
0
14 Nov 2019
What do you mean, BERT? Assessing BERT as a Distributional Semantics
  Model
What do you mean, BERT? Assessing BERT as a Distributional Semantics Model
Timothee Mickus
Denis Paperno
Mathieu Constant
Kees van Deemter
18
45
0
13 Nov 2019
Momentum Contrast for Unsupervised Visual Representation Learning
Momentum Contrast for Unsupervised Visual Representation Learning
Kaiming He
Haoqi Fan
Yuxin Wu
Saining Xie
Ross B. Girshick
SSL
48
11,842
0
13 Nov 2019
Adapting and evaluating a deep learning language model for clinical
  why-question answering
Adapting and evaluating a deep learning language model for clinical why-question answering
Andrew Wen
Mohamed Y. Elwazir
Sungrim Moon
Jungwei Fan
LM&MA
16
31
0
13 Nov 2019
Neural Duplicate Question Detection without Labeled Training Data
Neural Duplicate Question Detection without Labeled Training Data
Andreas Rucklé
N. Moosavi
Iryna Gurevych
OOD
AAML
11
11
0
13 Nov 2019
Compressive Transformers for Long-Range Sequence Modelling
Compressive Transformers for Long-Range Sequence Modelling
Jack W. Rae
Anna Potapenko
Siddhant M. Jayakumar
Timothy Lillicrap
RALM
VLM
KELM
11
620
0
13 Nov 2019
Learning Multi-Sense Word Distributions using Approximate
  Kullback-Leibler Divergence
Learning Multi-Sense Word Distributions using Approximate Kullback-Leibler Divergence
P. Jayashree
Ballijepalli Shreya
P. K. Srijith
21
2
0
12 Nov 2019
A Syntax-aware Multi-task Learning Framework for Chinese Semantic Role
  Labeling
A Syntax-aware Multi-task Learning Framework for Chinese Semantic Role Labeling
Qingrong Xia
Zhenghua Li
Min Zhang
25
17
0
12 Nov 2019
Understanding BERT performance in propaganda analysis
Understanding BERT performance in propaganda analysis
Yiqing Hua
14
16
0
11 Nov 2019
Attending to Entities for Better Text Understanding
Attending to Entities for Better Text Understanding
Pengxiang Cheng
K. Erk
LRM
19
37
0
11 Nov 2019
Deep Contextualized Self-training for Low Resource Dependency Parsing
Deep Contextualized Self-training for Low Resource Dependency Parsing
Guy Rotman
Roi Reichart
13
50
0
11 Nov 2019
A hybrid text normalization system using multi-head self-attention for
  mandarin
A hybrid text normalization system using multi-head self-attention for mandarin
Junhui Zhang
Junjie Pan
Xiang Yin
Chen Li
Shichao Liu
Yang Zhang
Yuxuan Wang
Zejun Ma
AI4CE
11
15
0
11 Nov 2019
Improving BERT Fine-tuning with Embedding Normalization
Wenxuan Zhou
Junyi Du
Xiang Ren
11
6
0
10 Nov 2019
Can Monolingual Pretrained Models Help Cross-Lingual Classification?
Can Monolingual Pretrained Models Help Cross-Lingual Classification?
Zewen Chi
Li Dong
Furu Wei
Xian-Ling Mao
Heyan Huang
LRM
VLM
30
13
0
10 Nov 2019
Effectiveness of self-supervised pre-training for speech recognition
Effectiveness of self-supervised pre-training for speech recognition
Alexei Baevski
Michael Auli
Abdel-rahman Mohamed
SSL
19
147
0
10 Nov 2019
Efficient Dialogue State Tracking by Selectively Overwriting Memory
Efficient Dialogue State Tracking by Selectively Overwriting Memory
Sungdong Kim
Sohee Yang
Gyuwan Kim
Sang-Woo Lee
18
195
0
10 Nov 2019
CamemBERT: a Tasty French Language Model
CamemBERT: a Tasty French Language Model
Louis Martin
Benjamin Muller
Pedro Ortiz Suarez
Yoann Dupont
Laurent Romary
Eric Villemonte de la Clergerie
Djamé Seddah
Benoît Sagot
16
956
0
10 Nov 2019
Pre-train and Plug-in: Flexible Conditional Text Generation with
  Variational Auto-Encoders
Pre-train and Plug-in: Flexible Conditional Text Generation with Variational Auto-Encoders
Yu Duan
Canwen Xu
Jiaxin Pei
Jialong Han
Chenliang Li
11
42
0
10 Nov 2019
Dynamic Neuro-Symbolic Knowledge Graph Construction for Zero-shot
  Commonsense Question Answering
Dynamic Neuro-Symbolic Knowledge Graph Construction for Zero-shot Commonsense Question Answering
Antoine Bosselut
Ronan Le Bras
Yejin Choi
NAI
14
41
0
10 Nov 2019
Rethinking Self-Attention: Towards Interpretability in Neural Parsing
Rethinking Self-Attention: Towards Interpretability in Neural Parsing
Khalil Mrini
Franck Dernoncourt
Quan Tran
Trung Bui
W. Chang
Ndapandula Nakashole
MILM
LRM
8
29
0
10 Nov 2019
Knowledge Guided Named Entity Recognition for BioMedical Text
Knowledge Guided Named Entity Recognition for BioMedical Text
Pratyay Banerjee
Kuntal Kumar Pal
M. Devarakonda
Chitta Baral
19
0
0
10 Nov 2019
Improving Transformer Models by Reordering their Sublayers
Improving Transformer Models by Reordering their Sublayers
Ofir Press
Noah A. Smith
Omer Levy
11
87
0
10 Nov 2019
Syntax-Infused Transformer and BERT models for Machine Translation and
  Natural Language Understanding
Syntax-Infused Transformer and BERT models for Machine Translation and Natural Language Understanding
Dhanasekar Sundararaman
Vivek Subramanian
Guoyin Wang
Shijing Si
Dinghan Shen
Dong Wang
Lawrence Carin
19
40
0
10 Nov 2019
Previous
123...289290291...298299300
Next