ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 14,229 papers shown
Title
Neural Databases
Neural Databases
James Thorne
Majid Yazdani
Marzieh Saeidi
Fabrizio Silvestri
Sebastian Riedel
A. Halevy
NAI
26
9
0
14 Oct 2020
CoRel: Seed-Guided Topical Taxonomy Construction by Concept Learning and
  Relation Transferring
CoRel: Seed-Guided Topical Taxonomy Construction by Concept Learning and Relation Transferring
Jiaxin Huang
Yiqing Xie
Yu Meng
Yunyi Zhang
Jiawei Han
39
36
0
13 Oct 2020
Controlling the Interaction Between Generation and Inference in
  Semi-Supervised Variational Autoencoders Using Importance Weighting
Controlling the Interaction Between Generation and Inference in Semi-Supervised Variational Autoencoders Using Importance Weighting
G. Felhi
Joseph Leroux
Djamé Seddah
BDL
16
1
0
13 Oct 2020
Pretrained Transformers for Text Ranking: BERT and Beyond
Pretrained Transformers for Text Ranking: BERT and Beyond
Jimmy J. Lin
Rodrigo Nogueira
Andrew Yates
VLM
219
608
0
13 Oct 2020
CAPT: Contrastive Pre-Training for Learning Denoised Sequence
  Representations
CAPT: Contrastive Pre-Training for Learning Denoised Sequence Representations
Fuli Luo
Pengcheng Yang
Shicheng Li
Xuancheng Ren
Xu Sun
VLM
SSL
13
16
0
13 Oct 2020
MixCo: Mix-up Contrastive Learning for Visual Representation
MixCo: Mix-up Contrastive Learning for Visual Representation
Sungnyun Kim
Gihun Lee
Sangmin Bae
Seyoung Yun
SSL
106
80
0
13 Oct 2020
X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained
  Language Models
X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models
Zhengbao Jiang
Antonios Anastasopoulos
Jun Araki
Haibo Ding
Graham Neubig
HILM
KELM
13
136
0
13 Oct 2020
Are Some Words Worth More than Others?
Are Some Words Worth More than Others?
Shiran Dudy
Steven Bedrick
13
14
0
12 Oct 2020
Improving Text Generation with Student-Forcing Optimal Transport
Improving Text Generation with Student-Forcing Optimal Transport
Guoyin Wang
Chunyuan Li
Jianqiao Li
Hao Fu
Yuh-Chen Lin
...
Ruiyi Zhang
Wenlin Wang
Dinghan Shen
Qian Yang
Lawrence Carin
OT
22
17
0
12 Oct 2020
HUJI-KU at MRP~2020: Two Transition-based Neural Parsers
HUJI-KU at MRP~2020: Two Transition-based Neural Parsers
Ofir Arviv
Ruixiang Cui
Daniel Hershcovich
34
10
0
12 Oct 2020
Reformulating Unsupervised Style Transfer as Paraphrase Generation
Reformulating Unsupervised Style Transfer as Paraphrase Generation
Kalpesh Krishna
John Wieting
Mohit Iyyer
19
237
0
12 Oct 2020
Improving Compositional Generalization in Semantic Parsing
Improving Compositional Generalization in Semantic Parsing
I. Oren
Jonathan Herzig
Nitish Gupta
Matt Gardner
Jonathan Berant
21
63
0
12 Oct 2020
Predicting Clinical Trial Results by Implicit Evidence Integration
Predicting Clinical Trial Results by Implicit Evidence Integration
Qiao Jin
Chuanqi Tan
Mosha Chen
Xiaozhong Liu
Songfang Huang
10
8
0
12 Oct 2020
Counterfactual Variable Control for Robust and Interpretable Question
  Answering
Counterfactual Variable Control for Robust and Interpretable Question Answering
S. Yu
Yulei Niu
Shuohang Wang
Jing Jiang
Qianru Sun
AAML
OOD
40
9
0
12 Oct 2020
Neural, Symbolic and Neural-Symbolic Reasoning on Knowledge Graphs
Neural, Symbolic and Neural-Symbolic Reasoning on Knowledge Graphs
Jing Zhang
Bo Chen
Lingxi Zhang
Xirui Ke
Haipeng Ding
NAI
23
3
0
12 Oct 2020
Quantitative Argument Summarization and Beyond: Cross-Domain Key Point
  Analysis
Quantitative Argument Summarization and Beyond: Cross-Domain Key Point Analysis
Roy Bar-Haim
Yoav Kantor
Lilach Eden
Roni Friedman
Dan Lahav
Noam Slonim
29
43
0
11 Oct 2020
We Can Detect Your Bias: Predicting the Political Ideology of News
  Articles
We Can Detect Your Bias: Predicting the Political Ideology of News Articles
R. Baly
Giovanni Da San Martino
James R. Glass
Preslav Nakov
9
143
0
11 Oct 2020
SMYRF: Efficient Attention using Asymmetric Clustering
SMYRF: Efficient Attention using Asymmetric Clustering
Giannis Daras
Nikita Kitaev
Augustus Odena
A. Dimakis
23
44
0
11 Oct 2020
Few-shot Learning for Multi-label Intent Detection
Few-shot Learning for Multi-label Intent Detection
Yutai Hou
Y. Lai
Yushan Wu
Wanxiang Che
Ting Liu
VLM
13
48
0
11 Oct 2020
End to End Binarized Neural Networks for Text Classification
End to End Binarized Neural Networks for Text Classification
Harshil Jain
Akshat Agarwal
Kumar Shridhar
Denis Kleyko
MQ
15
26
0
11 Oct 2020
PHICON: Improving Generalization of Clinical Text De-identification
  Models via Data Augmentation
PHICON: Improving Generalization of Clinical Text De-identification Models via Data Augmentation
Xiang Yue
Shuang Zhou
13
10
0
11 Oct 2020
Plan ahead: Self-Supervised Text Planning for Paragraph Completion Task
Plan ahead: Self-Supervised Text Planning for Paragraph Completion Task
Dongyeop Kang
Eduard H. Hovy
LRM
34
24
0
11 Oct 2020
CDEvalSumm: An Empirical Study of Cross-Dataset Evaluation for Neural
  Summarization Systems
CDEvalSumm: An Empirical Study of Cross-Dataset Evaluation for Neural Summarization Systems
Yiran Chen
Pengfei Liu
Ming Zhong
Zi-Yi Dou
Danqing Wang
Xipeng Qiu
Xuanjing Huang
ELM
25
24
0
11 Oct 2020
SJTU-NICT's Supervised and Unsupervised Neural Machine Translation
  Systems for the WMT20 News Translation Task
SJTU-NICT's Supervised and Unsupervised Neural Machine Translation Systems for the WMT20 News Translation Task
Z. Li
Hai Zhao
Rui Wang
Kehai Chen
Masao Utiyama
Eiichiro Sumita
29
15
0
11 Oct 2020
Leveraging Spatial Information in Radiology Reports for Ischemic Stroke
  Phenotyping
Leveraging Spatial Information in Radiology Reports for Ischemic Stroke Phenotyping
Surabhi Datta
S. Khanpara
R. Riascos
Kirk Roberts
23
0
0
10 Oct 2020
Structural Knowledge Distillation: Tractably Distilling Information for
  Structured Predictor
Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor
Xinyu Wang
Yong-jia Jiang
Zhaohui Yan
Zixia Jia
Nguyen Bach
Tao Wang
Zhongqiang Huang
Fei Huang
Kewei Tu
26
10
0
10 Oct 2020
Automated Concatenation of Embeddings for Structured Prediction
Automated Concatenation of Embeddings for Structured Prediction
Xinyu Wang
Yong-jia Jiang
Nguyen Bach
Tao Wang
Zhongqiang Huang
Fei Huang
Kewei Tu
35
172
0
10 Oct 2020
A Tensor Compiler for Unified Machine Learning Prediction Serving
A Tensor Compiler for Unified Machine Learning Prediction Serving
Supun Nakandala Karla Saur
Karla Saur
Gyeong-In Yu
Konstantinos Karanasos
Carlo Curino
Markus Weimer
Matteo Interlandi
14
53
0
09 Oct 2020
ChrEn: Cherokee-English Machine Translation for Endangered Language
  Revitalization
ChrEn: Cherokee-English Machine Translation for Endangered Language Revitalization
Shiyue Zhang
B. Frey
Mohit Bansal
28
28
0
09 Oct 2020
High-order Semantic Role Labeling
High-order Semantic Role Labeling
Z. Li
Hai Zhao
Rui-cang Wang
Kevin Parnow
16
29
0
09 Oct 2020
Causal Feature Selection with Dimension Reduction for Interpretable Text
  Classification
Causal Feature Selection with Dimension Reduction for Interpretable Text Classification
Guohou Shan
James R. Foulds
Shimei Pan
CML
OOD
15
0
0
09 Oct 2020
Style Attuned Pre-training and Parameter Efficient Fine-tuning for
  Spoken Language Understanding
Style Attuned Pre-training and Parameter Efficient Fine-tuning for Spoken Language Understanding
Jin Cao
Jun Wang
Wael Hamza
Kelly Vanee
Shang-Wen Li
17
10
0
09 Oct 2020
MIA-Prognosis: A Deep Learning Framework to Predict Therapy Response
MIA-Prognosis: A Deep Learning Framework to Predict Therapy Response
Jiancheng Yang
Jiajun Chen
Kaiming Kuang
Tiancheng Lin
Junjun He
Bingbing Ni
19
8
0
08 Oct 2020
Precise Task Formalization Matters in Winograd Schema Evaluations
Precise Task Formalization Matters in Winograd Schema Evaluations
Haokun Liu
William Huang
Dhara Mungra
Samuel R. Bowman
ReLM
17
12
0
08 Oct 2020
Two are Better than One: Joint Entity and Relation Extraction with
  Table-Sequence Encoders
Two are Better than One: Joint Entity and Relation Extraction with Table-Sequence Encoders
Jue Wang
Wei Lu
18
224
0
08 Oct 2020
Text-based RL Agents with Commonsense Knowledge: New Challenges,
  Environments and Baselines
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines
K. Murugesan
Mattia Atzeni
Pavan Kapanipathi
Pushkar Shukla
Sadhana Kumaravel
Gerald Tesauro
Kartik Talamadupula
Mrinmaya Sachan
Murray Campbell
LM&Ro
LLMAG
OffRL
24
54
0
08 Oct 2020
Infusing Disease Knowledge into BERT for Health Question Answering,
  Medical Inference and Disease Name Recognition
Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name Recognition
Yun He
Ziwei Zhu
Yin Zhang
Qin Chen
James Caverlee
AI4MH
28
108
0
08 Oct 2020
Zero-Shot Stance Detection: A Dataset and Model using Generalized Topic
  Representations
Zero-Shot Stance Detection: A Dataset and Model using Generalized Topic Representations
Emily Allaway
Kathleen McKeown
11
177
0
07 Oct 2020
Representing Point Clouds with Generative Conditional Invertible Flow
  Networks
Representing Point Clouds with Generative Conditional Invertible Flow Networks
Michal Stypulkowski
Kacper Kania
M. Zamorski
Maciej Ziȩba
Tomasz Trzciñski
J. Chorowski
3DPC
16
4
0
07 Oct 2020
Low-Resource Domain Adaptation for Compositional Task-Oriented Semantic
  Parsing
Low-Resource Domain Adaptation for Compositional Task-Oriented Semantic Parsing
Xilun Chen
Asish Ghoshal
Yashar Mehdad
Luke Zettlemoyer
S. Gupta
22
89
0
07 Oct 2020
Why do you think that? Exploring Faithful Sentence-Level Rationales
  Without Supervision
Why do you think that? Exploring Faithful Sentence-Level Rationales Without Supervision
Max Glockner
Ivan Habernal
Iryna Gurevych
LRM
14
25
0
07 Oct 2020
Improving the Efficiency of Grammatical Error Correction with Erroneous
  Span Detection and Correction
Improving the Efficiency of Grammatical Error Correction with Erroneous Span Detection and Correction
M. Chen
Tao Ge
Xingxing Zhang
Furu Wei
M. Zhou
6
46
0
07 Oct 2020
Transfer Learning and Distant Supervision for Multilingual Transformer
  Models: A Study on African Languages
Transfer Learning and Distant Supervision for Multilingual Transformer Models: A Study on African Languages
Michael A. Hedderich
David Ifeoluwa Adelani
D. Zhu
Jesujoba Oluwadara Alabi
Udia Markus
Dietrich Klakow
17
71
0
07 Oct 2020
Representation Learning for Sequence Data with Deep Autoencoding
  Predictive Components
Representation Learning for Sequence Data with Deep Autoencoding Predictive Components
Junwen Bai
Weiran Wang
Yingbo Zhou
Caiming Xiong
SSL
AI4TS
18
12
0
07 Oct 2020
Beyond [CLS] through Ranking by Generation
Beyond [CLS] through Ranking by Generation
Cicero Nogueira dos Santos
Xiaofei Ma
Ramesh Nallapati
Zhiheng Huang
Bing Xiang
RALM
16
30
0
06 Oct 2020
Plug and Play Autoencoders for Conditional Text Generation
Plug and Play Autoencoders for Conditional Text Generation
Florian Mai
Nikolaos Pappas
Ivan Montero
Noah A. Smith
U. Washington
14
36
0
06 Oct 2020
Keep CALM and Explore: Language Models for Action Generation in
  Text-based Games
Keep CALM and Explore: Language Models for Action Generation in Text-based Games
Shunyu Yao
Rohan Rao
Matthew J. Hausknecht
Karthik Narasimhan
LLMAG
LM&Ro
11
126
0
06 Oct 2020
SlotRefine: A Fast Non-Autoregressive Model for Joint Intent Detection
  and Slot Filling
SlotRefine: A Fast Non-Autoregressive Model for Joint Intent Detection and Slot Filling
Di Wu
Liang Ding
Fan Lu
Jian Xie
VLM
BDL
21
80
0
06 Oct 2020
On the Sub-Layer Functionalities of Transformer Decoder
On the Sub-Layer Functionalities of Transformer Decoder
Yilin Yang
Longyue Wang
Shuming Shi
Prasad Tadepalli
Stefan Lee
Zhaopeng Tu
24
27
0
06 Oct 2020
The Multilingual Amazon Reviews Corpus
The Multilingual Amazon Reviews Corpus
Phillip Keung
Y. Lu
György Szarvas
Noah A. Smith
17
190
0
06 Oct 2020
Previous
123...259260261...283284285
Next