ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 14,511 papers shown
Title
Multi-Head Self-Attention with Role-Guided Masks
Multi-Head Self-Attention with Role-Guided Masks
Dongsheng Wang
Casper Hansen
Lucas Chaves Lima
Christian B. Hansen
Maria Maistro
J. Simonsen
Christina Lioma
21
1
0
22 Dec 2020
Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue
  Generation
Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue Generation
Shuai Lin
Pan Zhou
Xiaodan Liang
Jianheng Tang
Ruihui Zhao
Ziliang Chen
Liang Lin
MedIm
20
53
0
22 Dec 2020
A Hierarchical Reasoning Graph Neural Network for The Automatic Scoring
  of Answer Transcriptions in Video Job Interviews
A Hierarchical Reasoning Graph Neural Network for The Automatic Scoring of Answer Transcriptions in Video Job Interviews
Kai Chen
M. Niu
Qingcai Chen
18
5
0
22 Dec 2020
Recognizing Emotion Cause in Conversations
Recognizing Emotion Cause in Conversations
Soujanya Poria
Navonil Majumder
Devamanyu Hazarika
Deepanway Ghosal
Rishabh Bhardwaj
...
Romila Ghosh
Abhinaba Roy
Niyati Chhaya
Alexander Gelbukh
Rada Mihalcea
43
123
0
22 Dec 2020
SChuBERT: Scholarly Document Chunks with BERT-encoding boost Citation
  Count Prediction
SChuBERT: Scholarly Document Chunks with BERT-encoding boost Citation Count Prediction
Thomas van Dongen
Gideon Maillette de Buy Wenniger
Lambert Schomaker
19
24
0
21 Dec 2020
Explaining Black-box Models for Biomedical Text Classification
Explaining Black-box Models for Biomedical Text Classification
M. Moradi
Matthias Samwald
28
21
0
20 Dec 2020
Deep Open Intent Classification with Adaptive Decision Boundary
Deep Open Intent Classification with Adaptive Decision Boundary
Hanlei Zhang
Hua Xu
Ting-En Lin
VLM
19
103
0
18 Dec 2020
Mention Extraction and Linking for SQL Query Generation
Mention Extraction and Linking for SQL Query Generation
Jianqiang Ma
Zeyu Yan
Shuai Pang
Yang Zhang
Jianping Shen
24
29
0
18 Dec 2020
NeurST: Neural Speech Translation Toolkit
NeurST: Neural Speech Translation Toolkit
Chengqi Zhao
Mingxuan Wang
Qianqian Dong
Rong Ye
Lei Li
22
32
0
18 Dec 2020
SpAtten: Efficient Sparse Attention Architecture with Cascade Token and
  Head Pruning
SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
Hanrui Wang
Zhekai Zhang
Song Han
20
373
0
17 Dec 2020
SceneFormer: Indoor Scene Generation with Transformers
SceneFormer: Indoor Scene Generation with Transformers
Xinpeng Wang
Chandan Yeshwanth
Matthias Nießner
ViT
3DPC
18
147
0
17 Dec 2020
End-to-End Human Pose and Mesh Reconstruction with Transformers
End-to-End Human Pose and Mesh Reconstruction with Transformers
Kevin Qinghong Lin
Lijuan Wang
Zicheng Liu
ViT
34
613
0
17 Dec 2020
MELINDA: A Multimodal Dataset for Biomedical Experiment Method
  Classification
MELINDA: A Multimodal Dataset for Biomedical Experiment Method Classification
Te-Lin Wu
Shikhar Singh
S. Paul
Gully A. Burns
Nanyun Peng
22
18
0
16 Dec 2020
You Are What You Tweet: Profiling Users by Past Tweets to Improve Hate
  Speech Detection
You Are What You Tweet: Profiling Users by Past Tweets to Improve Hate Speech Detection
Prateek Chaudhry
Matthew Lease
25
7
0
16 Dec 2020
Discovering New Intents with Deep Aligned Clustering
Discovering New Intents with Deep Aligned Clustering
Hanlei Zhang
Hua Xu
Ting-En Lin
Rui Lv
20
116
0
16 Dec 2020
Costs to Consider in Adopting NLP for Your Business
Costs to Consider in Adopting NLP for Your Business
Made Nindyatama Nityasya
Haryo Akbarianto Wibowo
Radityo Eko Prasojo
Alham Fikri Aji
VLM
16
3
0
16 Dec 2020
R$^2$-Net: Relation of Relation Learning Network for Sentence Semantic
  Matching
R2^22-Net: Relation of Relation Learning Network for Sentence Semantic Matching
Kun Zhang
Le Wu
Guangyi Lv
Meng Wang
Enhong Chen
Shulan Ruan
25
20
0
16 Dec 2020
Multilingual Evidence Retrieval and Fact Verification to Combat Global
  Disinformation: The Power of Polyglotism
Multilingual Evidence Retrieval and Fact Verification to Combat Global Disinformation: The Power of Polyglotism
Denisa A.O. Roberts
40
3
0
16 Dec 2020
Graph Neural Networks: Taxonomy, Advances and Trends
Graph Neural Networks: Taxonomy, Advances and Trends
Yu Zhou
Haixia Zheng
Xin Huang
Shufeng Hao
Dengao Li
Jumin Zhao
AI4TS
25
115
0
16 Dec 2020
DialogXL: All-in-One XLNet for Multi-Party Conversation Emotion
  Recognition
DialogXL: All-in-One XLNet for Multi-Party Conversation Emotion Recognition
Weizhou Shen
Junqing Chen
Xiaojun Quan
Zhixiang Xie
6
199
0
16 Dec 2020
Trex: Learning Execution Semantics from Micro-Traces for Binary
  Similarity
Trex: Learning Execution Semantics from Micro-Traces for Binary Similarity
Kexin Pei
Zhou Xuan
Junfeng Yang
Suman Jana
Baishakhi Ray
19
88
0
16 Dec 2020
Learning to Rationalize for Nonmonotonic Reasoning with Distant
  Supervision
Learning to Rationalize for Nonmonotonic Reasoning with Distant Supervision
Faeze Brahman
Vered Shwartz
Rachel Rudinger
Yejin Choi
LRM
6
42
0
14 Dec 2020
Time to Transfer: Predicting and Evaluating Machine-Human Chatting
  Handoff
Time to Transfer: Predicting and Evaluating Machine-Human Chatting Handoff
Jiawei Liu
Zhe Gao
Yangyang Kang
Zhuoren Jiang
Guoxiu He
Changlong Sun
Xiaozhong Liu
Wei Lu
16
12
0
14 Dec 2020
Parameter-Efficient Transfer Learning with Diff Pruning
Parameter-Efficient Transfer Learning with Diff Pruning
Demi Guo
Alexander M. Rush
Yoon Kim
11
383
0
14 Dec 2020
LRC-BERT: Latent-representation Contrastive Knowledge Distillation for
  Natural Language Understanding
LRC-BERT: Latent-representation Contrastive Knowledge Distillation for Natural Language Understanding
Hao Fu
Shaojun Zhou
Qihong Yang
Junjie Tang
Guiquan Liu
Kaikui Liu
Xiaolong Li
34
57
0
14 Dec 2020
Audio Captioning using Pre-Trained Large-Scale Language Model Guided by
  Audio-based Similar Caption Retrieval
Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval
Yuma Koizumi
Yasunori Ohishi
Daisuke Niizumi
Daiki Takeuchi
Masahiro Yasuda
22
40
0
14 Dec 2020
Contrastive Learning with Adversarial Perturbations for Conditional Text
  Generation
Contrastive Learning with Adversarial Perturbations for Conditional Text Generation
Seanie Lee
Dong Bok Lee
Sung Ju Hwang
13
106
0
14 Dec 2020
Deep Portfolio Optimization via Distributional Prediction of Residual
  Factors
Deep Portfolio Optimization via Distributional Prediction of Residual Factors
Kentaro Imajo
Kentaro Minami
Katsuya Ito
Kei Nakagawa
OOD
AIFin
6
27
0
14 Dec 2020
Improving Image Captioning by Leveraging Intra- and Inter-layer Global
  Representation in Transformer Network
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network
Jiayi Ji
Yunpeng Luo
Xiaoshuai Sun
Fuhai Chen
Gen Luo
Yongjian Wu
Yue Gao
Rongrong Ji
ViT
41
170
0
13 Dec 2020
InferCode: Self-Supervised Learning of Code Representations by
  Predicting Subtrees
InferCode: Self-Supervised Learning of Code Representations by Predicting Subtrees
Nghi D. Q. Bui
Yijun Yu
Lingxiao Jiang
SSL
28
104
0
13 Dec 2020
Open-World Class Discovery with Kernel Networks
Open-World Class Discovery with Kernel Networks
Zifeng Wang
Batool Salehi
Andrey Gritsenko
Kaushik R. Chowdhury
Stratis Ioannidis
Jennifer Dy
17
17
0
13 Dec 2020
Yelp Review Rating Prediction: Machine Learning and Deep Learning Models
Yelp Review Rating Prediction: Machine Learning and Deep Learning Models
Zefang Liu
VLM
12
15
0
12 Dec 2020
TabTransformer: Tabular Data Modeling Using Contextual Embeddings
TabTransformer: Tabular Data Modeling Using Contextual Embeddings
Xin Huang
A. Khetan
Milan Cvitkovic
Zohar S. Karnin
ViT
LMTD
151
416
0
11 Dec 2020
Orthogonal Language and Task Adapters in Zero-Shot Cross-Lingual
  Transfer
Orthogonal Language and Task Adapters in Zero-Shot Cross-Lingual Transfer
M. Vidoni
Ivan Vulić
Goran Glavas
31
27
0
11 Dec 2020
Morphology Matters: A Multilingual Language Modeling Analysis
Morphology Matters: A Multilingual Language Modeling Analysis
Hyunji Hayley Park
Katherine J. Zhang
Coleman Haley
K. Steimel
Han Liu
Lane Schwartz
39
47
0
11 Dec 2020
Reinforced Multi-Teacher Selection for Knowledge Distillation
Reinforced Multi-Teacher Selection for Knowledge Distillation
Fei Yuan
Linjun Shou
J. Pei
Wutao Lin
Ming Gong
Yan Fu
Daxin Jiang
10
121
0
11 Dec 2020
Exploring wav2vec 2.0 on speaker verification and language
  identification
Exploring wav2vec 2.0 on speaker verification and language identification
Zhiyun Fan
Meng Li
Shiyu Zhou
Bo Xu
103
202
0
11 Dec 2020
An End-to-End Solution for Named Entity Recognition in eCommerce Search
An End-to-End Solution for Named Entity Recognition in eCommerce Search
Xiang Cheng
Mitchell Bowden
Bhushan Ramesh Bhange
Priyanka Goyal
T. Packer
F. Javed
14
19
0
11 Dec 2020
EQG-RACE: Examination-Type Question Generation
EQG-RACE: Examination-Type Question Generation
Xin Jia
Wenjie Zhou
Xu Sun
Yunfang Wu
AI4Ed
16
39
0
11 Dec 2020
Multi-Sense Language Modelling
Multi-Sense Language Modelling
Andrea Lekkas
Peter Schneider-Kamp
Isabelle Augenstein
KELM
11
2
0
10 Dec 2020
Look Before you Speak: Visually Contextualized Utterances
Look Before you Speak: Visually Contextualized Utterances
Paul Hongsuck Seo
Arsha Nagrani
Cordelia Schmid
19
66
0
10 Dec 2020
A Practical Approach towards Causality Mining in Clinical Text using
  Active Transfer Learning
A Practical Approach towards Causality Mining in Clinical Text using Active Transfer Learning
Musarrat Hussain
Fahad Ahmed Satti
Jamil Hussain
Taqdir Ali
Syed Imran Ali
H. M. Bilal
Gwang Hoon Park
Sungyoung Lee
8
7
0
10 Dec 2020
Causal BERT : Language models for causality detection between events
  expressed in text
Causal BERT : Language models for causality detection between events expressed in text
Vivek Khetan
Roshni Ramnani
M. Anand
Shubhashis Sengupta
Andrew E.Fano
22
43
0
10 Dec 2020
Infusing Finetuning with Semantic Dependencies
Infusing Finetuning with Semantic Dependencies
Zhaofeng Wu
Hao Peng
Noah A. Smith
17
36
0
10 Dec 2020
Contrastive Predictive Coding for Human Activity Recognition
Contrastive Predictive Coding for Human Activity Recognition
H. Haresamudram
Irfan Essa
Thomas Ploetz
30
118
0
09 Dec 2020
Know Your Limits: Uncertainty Estimation with ReLU Classifiers Fails at
  Reliable OOD Detection
Know Your Limits: Uncertainty Estimation with ReLU Classifiers Fails at Reliable OOD Detection
Dennis Ulmer
Giovanni Cina
OODD
29
31
0
09 Dec 2020
Topological Planning with Transformers for Vision-and-Language
  Navigation
Topological Planning with Transformers for Vision-and-Language Navigation
Kevin Chen
Junshen K. Chen
Jo Chuang
Marynel Vázquez
Silvio Savarese
LM&Ro
25
99
0
09 Dec 2020
Positional Encoding as Spatial Inductive Bias in GANs
Positional Encoding as Spatial Inductive Bias in GANs
Rui Xu
Xintao Wang
Kai-xiang Chen
Bolei Zhou
Chen Change Loy
GAN
27
89
0
09 Dec 2020
Session-Aware Query Auto-completion using Extreme Multi-label Ranking
Session-Aware Query Auto-completion using Extreme Multi-label Ranking
Nishant Yadav
Rajat Sen
Daniel N. Hill
A. Mazumdar
Inderjit S. Dhillon
8
10
0
09 Dec 2020
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps
Qi Zhu
Chenyu Gao
Peng Wang
Qi Wu
20
54
0
09 Dec 2020
Previous
123...258259260...289290291
Next