ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 15,155 papers shown
Title
DocRED: A Large-Scale Document-Level Relation Extraction Dataset
DocRED: A Large-Scale Document-Level Relation Extraction Dataset
Yuan Yao
Deming Ye
Peng Li
Xu Han
Yankai Lin
Zhenghao Liu
Zhiyuan Liu
Lixin Huang
Jie Zhou
Maosong Sun
11
441
0
14 Jun 2019
Learning to Ask Unanswerable Questions for Machine Reading Comprehension
Learning to Ask Unanswerable Questions for Machine Reading Comprehension
Haichao Zhu
Li Dong
Furu Wei
Wenhui Wang
Bing Qin
Ting Liu
RALM
16
31
0
14 Jun 2019
Image Captioning: Transforming Objects into Words
Image Captioning: Transforming Objects into Words
Simão Herdade
Armin Kappeler
K. Boakye
Joao Soares
ViT
28
462
0
14 Jun 2019
Sentiment analysis is not solved! Assessing and probing sentiment
  classification
Sentiment analysis is not solved! Assessing and probing sentiment classification
Jeremy Barnes
Lilja Øvrelid
Erik Velldal
16
32
0
13 Jun 2019
Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index
Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index
Minjoon Seo
Jinhyuk Lee
Tom Kwiatkowski
Ankur P. Parikh
Ali Farhadi
Hannaneh Hajishirzi
RALM
10
153
0
13 Jun 2019
Learning Video Representations using Contrastive Bidirectional
  Transformer
Learning Video Representations using Contrastive Bidirectional Transformer
Chen Sun
Fabien Baradel
Kevin Patrick Murphy
Cordelia Schmid
SSL
ViT
19
133
0
13 Jun 2019
2D Attentional Irregular Scene Text Recognizer
2D Attentional Irregular Scene Text Recognizer
Pengyuan Lyu
Zhicheng Yang
Xinhang Leng
Xiaojun Wu
Ruiyu Li
Xiaoyong Shen
3DV
28
50
0
13 Jun 2019
Lattice Transformer for Speech Translation
Lattice Transformer for Speech Translation
Pei Zhang
Boxing Chen
Niyu Ge
Kai Fan
34
48
0
13 Jun 2019
Transfer Learning in Biomedical Natural Language Processing: An
  Evaluation of BERT and ELMo on Ten Benchmarking Datasets
Transfer Learning in Biomedical Natural Language Processing: An Evaluation of BERT and ELMo on Ten Benchmarking Datasets
Yifan Peng
Shankai Yan
Zhiyong Lu
LM&MA
AI4MH
13
830
0
13 Jun 2019
Explore, Propose, and Assemble: An Interpretable Model for Multi-Hop
  Reading Comprehension
Explore, Propose, and Assemble: An Interpretable Model for Multi-Hop Reading Comprehension
Yichen Jiang
Nitish Joshi
Yen-Chun Chen
Mohit Bansal
RALM
13
39
0
12 Jun 2019
Learning the Graphical Structure of Electronic Health Records with Graph
  Convolutional Transformer
Learning the Graphical Structure of Electronic Health Records with Graph Convolutional Transformer
E. Choi
Zhen Xu
Yujia Li
Michael W. Dusenberry
Gerardo Flores
Yuan Xue
Andrew M. Dai
MedIm
19
238
0
11 Jun 2019
Retrieve, Read, Rerank: Towards End-to-End Multi-Document Reading
  Comprehension
Retrieve, Read, Rerank: Towards End-to-End Multi-Document Reading Comprehension
Minghao Hu
Yuxing Peng
Zhen Huang
Dongsheng Li
RALM
6
58
0
11 Jun 2019
Modeling Sentiment Dependencies with Graph Convolutional Networks for
  Aspect-level Sentiment Classification
Modeling Sentiment Dependencies with Graph Convolutional Networks for Aspect-level Sentiment Classification
Pinlong Zhao
Linlin Hou
Ou Wu
GNN
29
172
0
11 Jun 2019
Future Data Helps Training: Modeling Future Contexts for Session-based
  Recommendation
Future Data Helps Training: Modeling Future Contexts for Session-based Recommendation
Fajie Yuan
Xiangnan He
Haochuan Jiang
G. Guo
Jian Xiong
Zhezhao Xu
Yilin Xiong
AI4TS
13
102
0
11 Jun 2019
Lightweight and Efficient Neural Natural Language Processing with
  Quaternion Networks
Lightweight and Efficient Neural Natural Language Processing with Quaternion Networks
Yi Tay
Aston Zhang
Anh Tuan Luu
J. Rao
Shuai Zhang
Shuohang Wang
Jie Fu
S. Hui
23
55
0
11 Jun 2019
What Does BERT Look At? An Analysis of BERT's Attention
What Does BERT Look At? An Analysis of BERT's Attention
Kevin Clark
Urvashi Khandelwal
Omer Levy
Christopher D. Manning
MILM
48
1,580
0
11 Jun 2019
GLTR: Statistical Detection and Visualization of Generated Text
GLTR: Statistical Detection and Visualization of Generated Text
Sebastian Gehrmann
Hendrik Strobelt
Alexander M. Rush
DeLMO
15
510
0
10 Jun 2019
Open-Domain Targeted Sentiment Analysis via Span-Based Extraction and
  Classification
Open-Domain Targeted Sentiment Analysis via Span-Based Extraction and Classification
Minghao Hu
Yuxing Peng
Zhen Huang
Dongsheng Li
Yiwei Lv
19
188
0
10 Jun 2019
Gendered Pronoun Resolution using BERT and an extractive question
  answering formulation
Gendered Pronoun Resolution using BERT and an extractive question answering formulation
Rakesh Chada
FaML
14
10
0
09 Jun 2019
Leveraging BERT for Extractive Text Summarization on Lectures
Leveraging BERT for Extractive Text Summarization on Lectures
Derek Miller
16
241
0
07 Jun 2019
Analyzing the Structure of Attention in a Transformer Language Model
Analyzing the Structure of Attention in a Transformer Language Model
Jesse Vig
Yonatan Belinkov
19
357
0
07 Jun 2019
From Caesar Cipher to Unsupervised Learning: A New Method for Classifier
  Parameter Estimation
From Caesar Cipher to Unsupervised Learning: A New Method for Classifier Parameter Estimation
Yu Liu
Li Deng
Jianshu Chen
C. Chen
SSL
18
0
0
06 Jun 2019
Cross-Lingual Syntactic Transfer through Unsupervised Adaptation of
  Invertible Projections
Cross-Lingual Syntactic Transfer through Unsupervised Adaptation of Invertible Projections
Junxian He
Zhisong Zhang
Taylor Berg-Kirkpatrick
Graham Neubig
25
21
0
06 Jun 2019
Unsupervised Pivot Translation for Distant Languages
Unsupervised Pivot Translation for Distant Languages
Yichong Leng
Xu Tan
Tao Qin
Xiang-Yang Li
Tie-Yan Liu
28
29
0
06 Jun 2019
Extracting Symptoms and their Status from Clinical Conversations
Extracting Symptoms and their Status from Clinical Conversations
Nan Du
Kai Chen
Anjuli Kannan
Linh Tran
Yuhui Chen
Izhak Shafran
12
68
0
05 Jun 2019
Large-Scale Multi-Label Text Classification on EU Legislation
Large-Scale Multi-Label Text Classification on EU Legislation
Ilias Chalkidis
Manos Fergadiotis
Prodromos Malakasiotis
Ion Androutsopoulos
AILaw
11
212
0
05 Jun 2019
The Secrets of Machine Learning: Ten Things You Wish You Had Known
  Earlier to be More Effective at Data Analysis
The Secrets of Machine Learning: Ten Things You Wish You Had Known Earlier to be More Effective at Data Analysis
Cynthia Rudin
David Carlson
HAI
17
34
0
04 Jun 2019
KERMIT: Generative Insertion-Based Modeling for Sequences
KERMIT: Generative Insertion-Based Modeling for Sequences
William Chan
Nikita Kitaev
Kelvin Guu
Mitchell Stern
Jakob Uszkoreit
VLM
23
65
0
04 Jun 2019
Sequence Tagging with Contextual and Non-Contextual Subword
  Representations: A Multilingual Evaluation
Sequence Tagging with Contextual and Non-Contextual Subword Representations: A Multilingual Evaluation
Benjamin Heinzerling
Michael Strube
6
36
0
04 Jun 2019
How multilingual is Multilingual BERT?
How multilingual is Multilingual BERT?
Telmo Pires
Eva Schlinger
Dan Garrette
LRM
VLM
57
1,371
0
04 Jun 2019
Converse Attention Knowledge Transfer for Low-Resource Named Entity
  Recognition
Converse Attention Knowledge Transfer for Low-Resource Named Entity Recognition
Shengfei Lyu
Linghao Sun
Huixiong Yi
Yong-jin Liu
Huanhuan Chen
Chun Miao
16
0
0
04 Jun 2019
Detecting Local Insights from Global Labels: Supervised & Zero-Shot
  Sequence Labeling via a Convolutional Decomposition
Detecting Local Insights from Global Labels: Supervised & Zero-Shot Sequence Labeling via a Convolutional Decomposition
A. Schmaltz
19
8
0
04 Jun 2019
Episodic Memory in Lifelong Language Learning
Episodic Memory in Lifelong Language Learning
Cyprien de Masson dÁutume
Sebastian Ruder
Lingpeng Kong
Dani Yogatama
CLL
KELM
25
280
0
03 Jun 2019
Learning Representations by Maximizing Mutual Information Across Views
Learning Representations by Maximizing Mutual Information Across Views
Philip Bachman
R. Devon Hjelm
William Buchwalter
SSL
40
1,452
0
03 Jun 2019
BAYHENN: Combining Bayesian Deep Learning and Homomorphic Encryption for
  Secure DNN Inference
BAYHENN: Combining Bayesian Deep Learning and Homomorphic Encryption for Secure DNN Inference
Peichen Xie
Bingzhe Wu
Guangyu Sun
BDL
FedML
11
33
0
03 Jun 2019
Efficient 8-Bit Quantization of Transformer Neural Machine Language
  Translation Model
Efficient 8-Bit Quantization of Transformer Neural Machine Language Translation Model
Aishwarya Bhandare
Vamsi Sripathi
Deepthi Karkada
Vivek V. Menon
Sun Choi
Kushal Datta
V. Saletore
MQ
22
129
0
03 Jun 2019
Pretraining Methods for Dialog Context Representation Learning
Pretraining Methods for Dialog Context Representation Learning
Shikib Mehri
E. Razumovskaia
Tiancheng Zhao
M. Eskénazi
14
84
0
02 Jun 2019
Adversarial Generation and Encoding of Nested Texts
Adversarial Generation and Encoding of Nested Texts
A. Rozental
GAN
11
0
0
01 Jun 2019
Investigating an Effective Character-level Embedding in Korean Sentence
  Classification
Investigating an Effective Character-level Embedding in Korean Sentence Classification
Won Ik Cho
Seokhwan Kim
N. Kim
23
8
0
31 May 2019
Fine-Grained Spoiler Detection from Large-Scale Review Corpora
Fine-Grained Spoiler Detection from Large-Scale Review Corpora
Mengting Wan
Rishabh Misra
Ndapandula Nakashole
Julian McAuley
9
129
0
31 May 2019
A Lightweight Recurrent Network for Sequence Modeling
A Lightweight Recurrent Network for Sequence Modeling
Biao Zhang
Rico Sennrich
27
7
0
30 May 2019
Unbabel's Submission to the WMT2019 APE Shared Task: BERT-based
  Encoder-Decoder for Automatic Post-Editing
Unbabel's Submission to the WMT2019 APE Shared Task: BERT-based Encoder-Decoder for Automatic Post-Editing
António Vilarinho Lopes
M. Amin Farajian
Gonçalo M. Correia
Jonay Trénous
André F. T. Martins
31
35
0
30 May 2019
A Simple but Effective Method to Incorporate Multi-turn Context with
  BERT for Conversational Machine Comprehension
A Simple but Effective Method to Incorporate Multi-turn Context with BERT for Conversational Machine Comprehension
Yasuhito Ohsugi
Itsumi Saito
Kyosuke Nishida
Hisako Asano
J. Tomita
25
43
0
30 May 2019
A Generalized Framework of Sequence Generation with Application to
  Undirected Sequence Models
A Generalized Framework of Sequence Generation with Application to Undirected Sequence Models
Elman Mansimov
Alex Jinpeng Wang
Sean Welleck
Kyunghyun Cho
AIMat
20
46
0
29 May 2019
Unsupervised Paraphrasing without Translation
Unsupervised Paraphrasing without Translation
Aurko Roy
David Grangier
BDL
LRM
11
61
0
29 May 2019
Adapting Text Embeddings for Causal Inference
Adapting Text Embeddings for Causal Inference
Victor Veitch
Dhanya Sridhar
David M. Blei
CML
9
21
0
29 May 2019
Defending Against Neural Fake News
Defending Against Neural Fake News
Rowan Zellers
Ari Holtzman
Hannah Rashkin
Yonatan Bisk
Ali Farhadi
Franziska Roesner
Yejin Choi
AAML
17
996
0
29 May 2019
Interpreting and improving natural-language processing (in machines)
  with natural language-processing (in the brain)
Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain)
Mariya Toneva
Leila Wehbe
MILM
AI4CE
31
219
0
28 May 2019
Combating Adversarial Misspellings with Robust Word Recognition
Combating Adversarial Misspellings with Robust Word Recognition
Danish Pruthi
Bhuwan Dhingra
Zachary Chase Lipton
8
300
0
27 May 2019
STAR-GCN: Stacked and Reconstructed Graph Convolutional Networks for
  Recommender Systems
STAR-GCN: Stacked and Reconstructed Graph Convolutional Networks for Recommender Systems
Jiani Zhang
Xingjian Shi
Shenglin Zhao
Irwin King
29
225
0
27 May 2019
Previous
123...299300301302303304
Next