ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 15,000 papers shown
Title
A Neighbourhood Framework for Resource-Lean Content Flagging
A Neighbourhood Framework for Resource-Lean Content Flagging
Sheikh Muhammad Sarwar
Dimitrina Zlatkova
Momchil Hardalov
Yoan Dinkov
Isabelle Augenstein
Preslav Nakov
16
5
0
31 Mar 2021
Deep Neural Approaches to Relation Triplets Extraction: A Comprehensive
  Survey
Deep Neural Approaches to Relation Triplets Extraction: A Comprehensive Survey
Tapas Nayak
Navonil Majumder
Pawan Goyal
Soujanya Poria
ViT
14
49
0
31 Mar 2021
Learning Generalizable Robotic Reward Functions from "In-The-Wild" Human
  Videos
Learning Generalizable Robotic Reward Functions from "In-The-Wild" Human Videos
Annie S. Chen
Suraj Nair
Chelsea Finn
30
137
0
31 Mar 2021
Dual Contrastive Loss and Attention for GANs
Dual Contrastive Loss and Attention for GANs
Ning Yu
Guilin Liu
Aysegül Dündar
Andrew Tao
Bryan Catanzaro
Larry S. Davis
Mario Fritz
GAN
24
60
0
31 Mar 2021
EnergyVis: Interactively Tracking and Exploring Energy Consumption for
  ML Models
EnergyVis: Interactively Tracking and Exploring Energy Consumption for ML Models
Omar Shaikh
Jon Saad-Falcon
Austin P. Wright
Nilaksh Das
Scott Freitas
O. Asensio
Duen Horng Chau
24
18
0
30 Mar 2021
Grounding Dialogue Systems via Knowledge Graph Aware Decoding with
  Pre-trained Transformers
Grounding Dialogue Systems via Knowledge Graph Aware Decoding with Pre-trained Transformers
Debanjan Chaudhuri
Md. Rony
Jens Lehmann
13
12
0
30 Mar 2021
Autocorrect in the Process of Translation -- Multi-task Learning
  Improves Dialogue Machine Translation
Autocorrect in the Process of Translation -- Multi-task Learning Improves Dialogue Machine Translation
Tao Wang
Chengqi Zhao
Mingxuan Wang
Lei Li
Deyi Xiong
20
13
0
30 Mar 2021
CaSiNo: A Corpus of Campsite Negotiation Dialogues for Automatic
  Negotiation Systems
CaSiNo: A Corpus of Campsite Negotiation Dialogues for Automatic Negotiation Systems
Kushal Chawla
Jaysa Ramirez
Rene Clever
Gale M. Lucas
Jonathan May
Jonathan Gratch
15
50
0
29 Mar 2021
ViViT: A Video Vision Transformer
ViViT: A Video Vision Transformer
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
30
2,086
0
29 Mar 2021
On the Adversarial Robustness of Vision Transformers
On the Adversarial Robustness of Vision Transformers
Rulin Shao
Zhouxing Shi
Jinfeng Yi
Pin-Yu Chen
Cho-Jui Hsieh
ViT
30
137
0
29 Mar 2021
Efficient Explanations from Empirical Explainers
Efficient Explanations from Empirical Explainers
Robert Schwarzenberg
Nils Feldhus
Sebastian Möller
FAtt
29
9
0
29 Mar 2021
Changing the Mind of Transformers for Topically-Controllable Language
  Generation
Changing the Mind of Transformers for Topically-Controllable Language Generation
Haw-Shiuan Chang
Jiaming Yuan
Mohit Iyyer
Andrew McCallum
20
9
0
29 Mar 2021
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS
Ye Jia
Heiga Zen
Jonathan Shen
Yu Zhang
Yonghui Wu
SSL
19
81
0
28 Mar 2021
TransICD: Transformer Based Code-wise Attention Model for Explainable
  ICD Coding
TransICD: Transformer Based Code-wise Attention Model for Explainable ICD Coding
Biplob Biswas
Thai-Hoang Pham
Ping Zhang
13
29
0
28 Mar 2021
Accurate and Reliable Forecasting using Stochastic Differential
  Equations
Accurate and Reliable Forecasting using Stochastic Differential Equations
Peng Cui
Zhijie Deng
Wenbo Hu
Jun Zhu
UQCV
30
1
0
28 Mar 2021
Automated Backend-Aware Post-Training Quantization
Automated Backend-Aware Post-Training Quantization
Ziheng Jiang
Animesh Jain
An Liu
Josh Fromm
Chengqian Ma
Tianqi Chen
Luis Ceze
MQ
35
2
0
27 Mar 2021
Machine Learning Meets Natural Language Processing -- The story so far
Machine Learning Meets Natural Language Processing -- The story so far
N. Galanis
P. Vafiadis
K.-G. Mirzaev
G. Papakostas
30
6
0
27 Mar 2021
Synthesis of Compositional Animations from Textual Descriptions
Synthesis of Compositional Animations from Textual Descriptions
Anindita Ghosh
N. Cheema
Cennet Oguz
Christian Theobalt
P. Slusallek
31
170
0
26 Mar 2021
Gated Transformer Networks for Multivariate Time Series Classification
Gated Transformer Networks for Multivariate Time Series Classification
Minghao Liu
Shengqi Ren
Siyuan Ma
Jiahui Jiao
Yizhou Chen
Zhiguang Wang
Wei Song
AI4TS
36
130
0
26 Mar 2021
Describing and Localizing Multiple Changes with Transformers
Describing and Localizing Multiple Changes with Transformers
Yue Qiu
Shintaro Yamamoto
Kodai Nakashima
Ryota Suzuki
K. Iwata
Hirokatsu Kataoka
Y. Satoh
27
55
0
25 Mar 2021
High-Fidelity Pluralistic Image Completion with Transformers
High-Fidelity Pluralistic Image Completion with Transformers
Ziyu Wan
Jingbo Zhang
Dongdong Chen
Jing Liao
ViT
23
231
0
25 Mar 2021
Bertinho: Galician BERT Representations
Bertinho: Galician BERT Representations
David Vilares
Marcos Garcia
Carlos Gómez-Rodríguez
57
22
0
25 Mar 2021
Vectorization and Rasterization: Self-Supervised Learning for Sketch and
  Handwriting
Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting
A. Bhunia
Pinaki Nath Chowdhury
Yongxin Yang
Timothy M. Hospedales
Tao Xiang
Yi-Zhe Song
SSL
17
59
0
25 Mar 2021
An Approach to Improve Robustness of NLP Systems against ASR Errors
An Approach to Improve Robustness of NLP Systems against ASR Errors
Tong Cui
Jinghui Xiao
Liangyou Li
Xin Jiang
Qun Liu
19
11
0
25 Mar 2021
Improving Online Forums Summarization via Hierarchical Unified Deep
  Neural Network
Improving Online Forums Summarization via Hierarchical Unified Deep Neural Network
Sansiri Tarnpradab
Fereshteh Jafariakinabad
K. Hua
13
5
0
25 Mar 2021
Efficient Feature Transformations for Discriminative and Generative
  Continual Learning
Efficient Feature Transformations for Discriminative and Generative Continual Learning
Vinay K. Verma
Kevin J Liang
Nikhil Mehta
Piyush Rai
Lawrence Carin
CLL
35
76
0
25 Mar 2021
Vision Transformers for Dense Prediction
Vision Transformers for Dense Prediction
René Ranftl
Alexey Bochkovskiy
V. Koltun
ViT
MDE
42
1,659
0
24 Mar 2021
FastMoE: A Fast Mixture-of-Expert Training System
FastMoE: A Fast Mixture-of-Expert Training System
Jiaao He
J. Qiu
Aohan Zeng
Zhilin Yang
Jidong Zhai
Jie Tang
ALM
MoE
22
94
0
24 Mar 2021
Representing Numbers in NLP: a Survey and a Vision
Representing Numbers in NLP: a Survey and a Vision
Avijit Thawani
Jay Pujara
Pedro A. Szekely
Filip Ilievski
24
114
0
24 Mar 2021
Thinking Aloud: Dynamic Context Generation Improves Zero-Shot Reasoning
  Performance of GPT-2
Thinking Aloud: Dynamic Context Generation Improves Zero-Shot Reasoning Performance of GPT-2
Gregor Betz
Kyle Richardson
Christian Voigt
ReLM
LRM
16
29
0
24 Mar 2021
Czert -- Czech BERT-like Model for Language Representation
Czert -- Czech BERT-like Model for Language Representation
Jakub Sido
O. Pražák
P. Pribán
Jan Pasek
Michal Seják
Miloslav Konopík
16
43
0
24 Mar 2021
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New
  Multitask Benchmark
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask Benchmark
Nicholas Lourie
Ronan Le Bras
Chandra Bhagavatula
Yejin Choi
LRM
22
137
0
24 Mar 2021
Multi-view 3D Reconstruction with Transformer
Multi-view 3D Reconstruction with Transformer
Dan Wang
Xinrui Cui
Xun Chen
Zhengxia Zou
Tianyang Shi
Septimiu Salcudean
Z. J. Wang
Rabab Ward
ViT
20
87
0
24 Mar 2021
Scene-Intuitive Agent for Remote Embodied Visual Grounding
Scene-Intuitive Agent for Remote Embodied Visual Grounding
Xiangru Lin
Guanbin Li
Yizhou Yu
LM&Ro
22
52
0
24 Mar 2021
Region Similarity Representation Learning
Region Similarity Representation Learning
Tete Xiao
Colorado Reed
Xiaolong Wang
Kurt Keutzer
Trevor Darrell
VLM
SSL
29
116
0
24 Mar 2021
The NLP Cookbook: Modern Recipes for Transformer based Deep Learning
  Architectures
The NLP Cookbook: Modern Recipes for Transformer based Deep Learning Architectures
Sushant Singh
A. Mahmood
AI4TS
60
92
0
23 Mar 2021
Self-Supervised Pretraining Improves Self-Supervised Pretraining
Self-Supervised Pretraining Improves Self-Supervised Pretraining
Colorado Reed
Xiangyu Yue
Aniruddha Nrusimha
Sayna Ebrahimi
Vivek Vijaykumar
...
Shanghang Zhang
Devin Guillory
Sean L. Metzger
Kurt Keutzer
Trevor Darrell
25
105
0
23 Mar 2021
QuestEval: Summarization Asks for Fact-based Evaluation
QuestEval: Summarization Asks for Fact-based Evaluation
Thomas Scialom
Paul-Alexis Dray
Patrick Gallinari
Sylvain Lamprier
Benjamin Piwowarski
Jacopo Staiano
Alex Jinpeng Wang
HILM
11
267
0
23 Mar 2021
How to decay your learning rate
How to decay your learning rate
Aitor Lewkowycz
36
24
0
23 Mar 2021
Self-supervised representation learning from 12-lead ECG data
Self-supervised representation learning from 12-lead ECG data
Temesgen Mehari
Nils Strodthoff
SSL
18
141
0
23 Mar 2021
Are Neural Language Models Good Plagiarists? A Benchmark for Neural
  Paraphrase Detection
Are Neural Language Models Good Plagiarists? A Benchmark for Neural Paraphrase Detection
Jan Philip Wahle
Terry Ruas
Norman Meuschke
Bela Gipp
25
34
0
23 Mar 2021
Detecting Hate Speech with GPT-3
Detecting Hate Speech with GPT-3
Ke-Li Chiu
Annie Collins
Rohan Alexander
AILaw
15
108
0
23 Mar 2021
Instance-level Image Retrieval using Reranking Transformers
Instance-level Image Retrieval using Reranking Transformers
Fuwen Tan
Jiangbo Yuan
Vicente Ordonez
ViT
26
89
0
22 Mar 2021
Tiny Transformers for Environmental Sound Classification at the Edge
Tiny Transformers for Environmental Sound Classification at the Edge
David Elliott
Carlos E. Otero
Steven Wyatt
Evan Martino
21
15
0
22 Mar 2021
Open Domain Question Answering over Tables via Dense Retrieval
Open Domain Question Answering over Tables via Dense Retrieval
Jonathan Herzig
Thomas Müller
Syrine Krichene
Julian Martin Eisenschlos
LMTD
VLM
RALM
36
99
0
22 Mar 2021
Improving and Simplifying Pattern Exploiting Training
Improving and Simplifying Pattern Exploiting Training
Derek Tam
Rakesh R Menon
Mohit Bansal
Shashank Srivastava
Colin Raffel
13
149
0
22 Mar 2021
BERT: A Review of Applications in Natural Language Processing and
  Understanding
BERT: A Review of Applications in Natural Language Processing and Understanding
M. V. Koroteev
VLM
22
194
0
22 Mar 2021
Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for
  Improved Cross-Modal Retrieval
Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for Improved Cross-Modal Retrieval
Gregor Geigle
Jonas Pfeiffer
Nils Reimers
Ivan Vulić
Iryna Gurevych
27
59
0
22 Mar 2021
Identifying Machine-Paraphrased Plagiarism
Identifying Machine-Paraphrased Plagiarism
Jan Philip Wahle
Terry Ruas
Tomávs Foltýnek
Norman Meuschke
Bela Gipp
11
30
0
22 Mar 2021
DeepViT: Towards Deeper Vision Transformer
DeepViT: Towards Deeper Vision Transformer
Daquan Zhou
Bingyi Kang
Xiaojie Jin
Linjie Yang
Xiaochen Lian
Zihang Jiang
Qibin Hou
Jiashi Feng
ViT
42
510
0
22 Mar 2021
Previous
123...257258259...298299300
Next