ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 14,226 papers shown
Title
Learning Visual-Semantic Embeddings for Reporting Abnormal Findings on
  Chest X-rays
Learning Visual-Semantic Embeddings for Reporting Abnormal Findings on Chest X-rays
Jianmo Ni
Chun-Nan Hsu
Amilcare Gentili
Julian McAuley
MedIm
16
30
0
06 Oct 2020
Efficient One-Pass End-to-End Entity Linking for Questions
Efficient One-Pass End-to-End Entity Linking for Questions
Belinda Z. Li
Sewon Min
Srini Iyer
Yashar Mehdad
Wen-tau Yih
17
141
0
06 Oct 2020
Simple and Effective Few-Shot Named Entity Recognition with Structured
  Nearest Neighbor Learning
Simple and Effective Few-Shot Named Entity Recognition with Structured Nearest Neighbor Learning
Yi Yang
Arzoo Katiyar
NAI
20
61
0
06 Oct 2020
VisualWordGrid: Information Extraction From Scanned Documents Using A
  Multimodal Approach
VisualWordGrid: Information Extraction From Scanned Documents Using A Multimodal Approach
Mohamed Kerroumi
Othmane Sayem
A. Shabou
11
21
0
05 Oct 2020
InfoBERT: Improving Robustness of Language Models from An Information
  Theoretic Perspective
InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective
Boxin Wang
Shuohang Wang
Yu Cheng
Zhe Gan
R. Jia
Bo-wen Li
Jingjing Liu
AAML
38
113
0
05 Oct 2020
Deep Anomaly Detection by Residual Adaptation
Deep Anomaly Detection by Residual Adaptation
Lucas Deecke
Lukas Ruff
Robert A. Vandermeulen
Hakan Bilen
UQCV
23
4
0
05 Oct 2020
A Fully Hyperbolic Neural Model for Hierarchical Multi-Class
  Classification
A Fully Hyperbolic Neural Model for Hierarchical Multi-Class Classification
F. López
Michael Strube
13
28
0
05 Oct 2020
X-SRL: A Parallel Cross-Lingual Semantic Role Labeling Dataset
X-SRL: A Parallel Cross-Lingual Semantic Role Labeling Dataset
Angel Daza
Anette Frank
6
30
0
05 Oct 2020
Learning from Context or Names? An Empirical Study on Neural Relation
  Extraction
Learning from Context or Names? An Empirical Study on Neural Relation Extraction
Hao Peng
Tianyu Gao
Xu Han
Yankai Lin
Peng Li
Zhiyuan Liu
Maosong Sun
Jie Zhou
11
200
0
05 Oct 2020
Linguistic Profiling of a Neural Language Model
Linguistic Profiling of a Neural Language Model
Alessio Miaschi
D. Brunato
F. Dell’Orletta
Giulia Venturi
23
46
0
05 Oct 2020
PMI-Masking: Principled masking of correlated spans
PMI-Masking: Principled masking of correlated spans
Yoav Levine
Barak Lenz
Opher Lieber
Omri Abend
Kevin Leyton-Brown
Moshe Tennenholtz
Y. Shoham
11
72
0
05 Oct 2020
How Effective is Task-Agnostic Data Augmentation for Pretrained
  Transformers?
How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers?
Shayne Longpre
Yu Wang
Christopher DuBois
ViT
17
83
0
05 Oct 2020
Effective Unsupervised Domain Adaptation with Adversarially Trained
  Language Models
Effective Unsupervised Domain Adaptation with Adversarially Trained Language Models
Thuy-Trang Vu
Dinh Q. Phung
Gholamreza Haffari
6
24
0
05 Oct 2020
On Losses for Modern Language Models
On Losses for Modern Language Models
Stephane Aroca-Ouellette
Frank Rudzicz
6
33
0
04 Oct 2020
An Empirical Study on Large-Scale Multi-Label Text Classification
  Including Few and Zero-Shot Labels
An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels
Ilias Chalkidis
Manos Fergadiotis
Sotiris Kotitsas
Prodromos Malakasiotis
Nikolaos Aletras
Ion Androutsopoulos
VLM
AI4TS
10
84
0
04 Oct 2020
A Survey of Unsupervised Dependency Parsing
A Survey of Unsupervised Dependency Parsing
Wenjuan Han
Yong-jia Jiang
Hwee Tou Ng
Kewei Tu
SSL
21
10
0
04 Oct 2020
Multi-turn Response Selection using Dialogue Dependency Relations
Multi-turn Response Selection using Dialogue Dependency Relations
Qi Jia
Yizhu Liu
Siyu Ren
Kenny Q. Zhu
Haifeng Tang
19
45
0
04 Oct 2020
Tell Me How to Ask Again: Question Data Augmentation with Controllable
  Rewriting in Continuous Space
Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space
Dayiheng Liu
Yeyun Gong
Jie Fu
Yu Yan
Jiusheng Chen
Jiancheng Lv
Nan Duan
M. Zhou
10
37
0
04 Oct 2020
GraphDialog: Integrating Graph Knowledge into End-to-End Task-Oriented
  Dialogue Systems
GraphDialog: Integrating Graph Knowledge into End-to-End Task-Oriented Dialogue Systems
Shiquan Yang
Rui Zhang
S. Erfani
11
60
0
04 Oct 2020
A Geometry-Inspired Attack for Generating Natural Language Adversarial
  Examples
A Geometry-Inspired Attack for Generating Natural Language Adversarial Examples
Zhao Meng
Roger Wattenhofer
GAN
AAML
19
32
0
03 Oct 2020
Personality Trait Detection Using Bagged SVM over BERT Word Embedding
  Ensembles
Personality Trait Detection Using Bagged SVM over BERT Word Embedding Ensembles
Amirmohammad Kazameini
S. Fatehi
Yash Mehta
Sauleh Eetemadi
Erik Cambria
8
55
0
03 Oct 2020
Differentially Private Representation for NLP: Formal Guarantee and An
  Empirical Study on Privacy and Fairness
Differentially Private Representation for NLP: Formal Guarantee and An Empirical Study on Privacy and Fairness
Lingjuan Lyu
Xuanli He
Yitong Li
12
89
0
03 Oct 2020
Multi-domain Clinical Natural Language Processing with MedCAT: the
  Medical Concept Annotation Toolkit
Multi-domain Clinical Natural Language Processing with MedCAT: the Medical Concept Annotation Toolkit
Z. Kraljevic
Thomas Searle
Anthony Shek
Lukasz Roguski
Kawsar Noor
...
A. Shah
W. K. Wong
Zina M. Ibrahim
J. Teo
Richard J. B. Dobson
AI4MH
12
159
0
02 Oct 2020
Cost-effective Selection of Pretraining Data: A Case Study of
  Pretraining BERT on Social Media
Cost-effective Selection of Pretraining Data: A Case Study of Pretraining BERT on Social Media
Xiang Dai
Sarvnaz Karimi
Ben Hachey
Cécile Paris
11
35
0
02 Oct 2020
Accelerating Convergence of Replica Exchange Stochastic Gradient MCMC
  via Variance Reduction
Accelerating Convergence of Replica Exchange Stochastic Gradient MCMC via Variance Reduction
Wei Deng
Qi Feng
G. Karagiannis
Guang Lin
F. Liang
14
8
0
02 Oct 2020
LUKE: Deep Contextualized Entity Representations with Entity-aware
  Self-attention
LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention
Ikuya Yamada
Akari Asai
Hiroyuki Shindo
Hideaki Takeda
Yuji Matsumoto
22
662
0
02 Oct 2020
MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on
  a Massive Scale
MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale
Andreas Rucklé
Jonas Pfeiffer
Iryna Gurevych
14
37
0
02 Oct 2020
Continual Learning for Natural Language Generation in Task-oriented
  Dialog Systems
Continual Learning for Natural Language Generation in Task-oriented Dialog Systems
Fei Mi
Liangwei Chen
Mengjie Zhao
Minlie Huang
Boi Faltings
CLL
KELM
17
68
0
02 Oct 2020
Remote Sensing Image Scene Classification with Self-Supervised Paradigm
  under Limited Labeled Samples
Remote Sensing Image Scene Classification with Self-Supervised Paradigm under Limited Labeled Samples
Chao Tao
Ji Qi
Weipeng Lu
Hao Wang
Haifeng Li
SSL
13
100
0
02 Oct 2020
Data Transfer Approaches to Improve Seq-to-Seq Retrosynthesis
Data Transfer Approaches to Improve Seq-to-Seq Retrosynthesis
Katsuhiko Ishiguro
K. Ujihara
R. Sawada
Hirotaka Akita
Masaaki Kotera
22
6
0
02 Oct 2020
Enriching Word Embeddings with Temporal and Spatial Information
Enriching Word Embeddings with Temporal and Spatial Information
Hongyu Gong
S. Bhat
Pramod Viswanath
14
12
0
02 Oct 2020
How to Motivate Your Dragon: Teaching Goal-Driven Agents to Speak and
  Act in Fantasy Worlds
How to Motivate Your Dragon: Teaching Goal-Driven Agents to Speak and Act in Fantasy Worlds
Prithviraj Ammanabrolu
Jack Urbanek
Margaret Li
Arthur Szlam
Tim Rocktaschel
Jason Weston
LM&Ro
13
44
0
01 Oct 2020
Beyond The Text: Analysis of Privacy Statements through Syntactic and
  Semantic Role Labeling
Beyond The Text: Analysis of Privacy Statements through Syntactic and Semantic Role Labeling
Yan Shvartzshnaider
Ananth Balashankar
Vikas Patidar
Thomas Wies
L. Subramanian
19
4
0
01 Oct 2020
Learning Variational Word Masks to Improve the Interpretability of
  Neural Text Classifiers
Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers
Hanjie Chen
Yangfeng Ji
AAML
VLM
13
62
0
01 Oct 2020
Predicting User Engagement Status for Online Evaluation of Intelligent
  Assistants
Predicting User Engagement Status for Online Evaluation of Intelligent Assistants
Rui Meng
Zhen Yue
A. Glass
13
2
0
01 Oct 2020
Understanding Self-supervised Learning with Dual Deep Networks
Understanding Self-supervised Learning with Dual Deep Networks
Yuandong Tian
Lantao Yu
Xinlei Chen
Surya Ganguli
SSL
13
78
0
01 Oct 2020
A Survey on Explainability in Machine Reading Comprehension
A Survey on Explainability in Machine Reading Comprehension
Mokanarangan Thayaparan
Marco Valentino
André Freitas
FaML
12
50
0
01 Oct 2020
Detecting White Supremacist Hate Speech using Domain Specific Word
  Embedding with Deep Learning and BERT
Detecting White Supremacist Hate Speech using Domain Specific Word Embedding with Deep Learning and BERT
Hind S. Alatawi
Areej M. Alhothali
K. Moria
17
85
0
01 Oct 2020
CoLAKE: Contextualized Language and Knowledge Embedding
CoLAKE: Contextualized Language and Knowledge Embedding
Tianxiang Sun
Yunfan Shao
Xipeng Qiu
Qipeng Guo
Yaru Hu
Xuanjing Huang
Zheng-Wei Zhang
KELM
18
181
0
01 Oct 2020
Phonemer at WNUT-2020 Task 2: Sequence Classification Using COVID
  Twitter BERT and Bagging Ensemble Technique based on Plurality Voting
Phonemer at WNUT-2020 Task 2: Sequence Classification Using COVID Twitter BERT and Bagging Ensemble Technique based on Plurality Voting
Anshul Wadhawan
14
7
0
01 Oct 2020
RefVOS: A Closer Look at Referring Expressions for Video Object
  Segmentation
RefVOS: A Closer Look at Referring Expressions for Video Object Segmentation
Míriam Bellver
Carles Ventura
Carina Silberer
Ioannis V. Kazakos
Jordi Torres
Xavier Giró-i-Nieto
VOS
21
32
0
01 Oct 2020
RRF102: Meeting the TREC-COVID Challenge with a 100+ Runs Ensemble
RRF102: Meeting the TREC-COVID Challenge with a 100+ Runs Ensemble
Michael Bendersky
Honglei Zhuang
Ji Ma
Shuguang Han
Keith B. Hall
Ryan T. McDonald
19
16
0
01 Oct 2020
Examining the rhetorical capacities of neural language models
Examining the rhetorical capacities of neural language models
Zining Zhu
Chuer Pan
Mohamed Abdalla
Frank Rudzicz
28
10
0
01 Oct 2020
CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked
  Language Models
CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models
Nikita Nangia
Clara Vania
Rasika Bhalerao
Samuel R. Bowman
6
641
0
30 Sep 2020
Multi-document Summarization with Maximal Marginal Relevance-guided
  Reinforcement Learning
Multi-document Summarization with Maximal Marginal Relevance-guided Reinforcement Learning
Yuning Mao
Yanru Qu
Yiqing Xie
Xiang Ren
Jiawei Han
AI4TS
15
45
0
30 Sep 2020
Rethinking Attention with Performers
Rethinking Attention with Performers
K. Choromanski
Valerii Likhosherstov
David Dohan
Xingyou Song
Andreea Gane
...
Afroz Mohiuddin
Lukasz Kaiser
David Belanger
Lucy J. Colwell
Adrian Weller
8
1,517
0
30 Sep 2020
Learning Object Detection from Captions via Textual Scene Attributes
Learning Object Detection from Captions via Textual Scene Attributes
Achiya Jerbi
Roei Herzig
Jonathan Berant
Gal Chechik
Amir Globerson
22
21
0
30 Sep 2020
Stochastic Precision Ensemble: Self-Knowledge Distillation for Quantized
  Deep Neural Networks
Stochastic Precision Ensemble: Self-Knowledge Distillation for Quantized Deep Neural Networks
Yoonho Boo
Sungho Shin
Jungwook Choi
Wonyong Sung
MQ
14
29
0
30 Sep 2020
Towards a Multi-modal, Multi-task Learning based Pre-training Framework
  for Document Representation Learning
Towards a Multi-modal, Multi-task Learning based Pre-training Framework for Document Representation Learning
Subhojeet Pramanik
Shashank Mujumdar
Hima Patel
11
31
0
30 Sep 2020
Stock2Vec: A Hybrid Deep Learning Framework for Stock Market Prediction
  with Representation Learning and Temporal Convolutional Network
Stock2Vec: A Hybrid Deep Learning Framework for Stock Market Prediction with Representation Learning and Temporal Convolutional Network
Xing Wang
Yijun Wang
Bin Weng
Aleksandr Vinel
AIFin
AI4TS
19
11
0
29 Sep 2020
Previous
123...260261262...283284285
Next