ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.04805
  4. Cited By
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
    VLM
    SSL
    SSeg
ArXivPDFHTML

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 15,155 papers shown
Title
SelfORE: Self-supervised Relational Feature Learning for Open Relation
  Extraction
SelfORE: Self-supervised Relational Feature Learning for Open Relation Extraction
Xuming Hu
Chenwei Zhang
Yusong Xu
Lijie Wen
Philip S. Yu
SSL
21
84
0
06 Apr 2020
TAPAS: Weakly Supervised Table Parsing via Pre-training
TAPAS: Weakly Supervised Table Parsing via Pre-training
Jonathan Herzig
Pawel Krzysztof Nowak
Thomas Müller
Francesco Piccinno
Julian Martin Eisenschlos
LMTD
RALM
21
633
0
05 Apr 2020
Continual Domain-Tuning for Pretrained Language Models
Continual Domain-Tuning for Pretrained Language Models
Subendhu Rongali
Abhyuday N. Jagannatha
Bhanu Pratap Singh Rawat
Hong-ye Yu
CLL
KELM
6
7
0
05 Apr 2020
Syntax-driven Iterative Expansion Language Models for Controllable Text
  Generation
Syntax-driven Iterative Expansion Language Models for Controllable Text Generation
Noe Casas
José A. R. Fonollosa
Marta R. Costa-jussá
19
11
0
05 Apr 2020
Clustering based Contrastive Learning for Improving Face Representations
Clustering based Contrastive Learning for Improving Face Representations
Vivek Sharma
Makarand Tapaswi
M. Sarfraz
Rainer Stiefelhagen
CVBM
SSL
19
46
0
05 Apr 2020
FastBERT: a Self-distilling BERT with Adaptive Inference Time
FastBERT: a Self-distilling BERT with Adaptive Inference Time
Weijie Liu
Peng Zhou
Zhe Zhao
Zhiruo Wang
Haotang Deng
Qi Ju
31
354
0
05 Apr 2020
Unsupervised Domain Clusters in Pretrained Language Models
Unsupervised Domain Clusters in Pretrained Language Models
Roee Aharoni
Yoav Goldberg
24
243
0
05 Apr 2020
Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space
Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space
Chunyuan Li
Xiang Gao
Yuan Li
Baolin Peng
Xiujun Li
Yizhe Zhang
Jianfeng Gao
SSL
DRL
32
181
0
05 Apr 2020
A Hierarchical Network for Abstractive Meeting Summarization with
  Cross-Domain Pretraining
A Hierarchical Network for Abstractive Meeting Summarization with Cross-Domain Pretraining
Chenguang Zhu
Ruochen Xu
Michael Zeng
Xuedong Huang
BDL
AI4TS
18
18
0
04 Apr 2020
Generating Hierarchical Explanations on Text Classification via Feature
  Interaction Detection
Generating Hierarchical Explanations on Text Classification via Feature Interaction Detection
Hanjie Chen
Guangtao Zheng
Yangfeng Ji
FAtt
30
91
0
04 Apr 2020
An Iterative Multi-Knowledge Transfer Network for Aspect-Based Sentiment
  Analysis
An Iterative Multi-Knowledge Transfer Network for Aspect-Based Sentiment Analysis
Yunlong Liang
Fandong Meng
Jinchao Zhang
Yufeng Chen
Jinan Xu
Jie Zhou
30
38
0
04 Apr 2020
Aligned Cross Entropy for Non-Autoregressive Machine Translation
Aligned Cross Entropy for Non-Autoregressive Machine Translation
Marjan Ghazvininejad
Vladimir Karpukhin
Luke Zettlemoyer
Omer Levy
30
115
0
03 Apr 2020
Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal
  Transformers
Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Transformers
Zhicheng Huang
Zhaoyang Zeng
Bei Liu
Dongmei Fu
Jianlong Fu
ViT
30
436
0
02 Apr 2020
Information Leakage in Embedding Models
Information Leakage in Embedding Models
Congzheng Song
A. Raghunathan
MIACV
16
260
0
31 Mar 2020
Code Prediction by Feeding Trees to Transformers
Code Prediction by Feeding Trees to Transformers
Seohyun Kim
Jinman Zhao
Yuchi Tian
S. Chandra
33
216
0
30 Mar 2020
Sign Language Transformers: Joint End-to-end Sign Language Recognition
  and Translation
Sign Language Transformers: Joint End-to-end Sign Language Recognition and Translation
Necati Cihan Camgöz
Oscar Koller
Simon Hadfield
Richard Bowden
SLR
17
489
0
30 Mar 2020
Span-based discontinuous constituency parsing: a family of exact
  chart-based algorithms with time complexities from O(n^6) down to O(n^3)
Span-based discontinuous constituency parsing: a family of exact chart-based algorithms with time complexities from O(n^6) down to O(n^3)
Caio Corro
8
23
0
30 Mar 2020
Gossip and Attend: Context-Sensitive Graph Representation Learning
Gossip and Attend: Context-Sensitive Graph Representation Learning
Zekarias T. Kefato
Sarunas Girdzijauskas
16
7
0
30 Mar 2020
Speech2Action: Cross-modal Supervision for Action Recognition
Speech2Action: Cross-modal Supervision for Action Recognition
Arsha Nagrani
Chen Sun
David A. Ross
Rahul Sukthankar
Cordelia Schmid
Andrew Zisserman
25
54
0
30 Mar 2020
AliCoCo: Alibaba E-commerce Cognitive Concept Net
AliCoCo: Alibaba E-commerce Cognitive Concept Net
Xusheng Luo
Luxin Liu
Y. Yang
Le Bo
Yuanpeng Cao
Jinhang Wu
Qiang Li
Keping Yang
Kenny Q. Zhu
22
66
0
30 Mar 2020
Learning Interactions and Relationships between Movie Characters
Learning Interactions and Relationships between Movie Characters
Anna Kukleva
Makarand Tapaswi
Ivan Laptev
38
51
0
29 Mar 2020
Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency
  Parsing with Iterative Refinement
Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement
Alireza Mohammadshahi
James Henderson
29
33
0
29 Mar 2020
Abstractive Text Summarization based on Language Model Conditioning and
  Locality Modeling
Abstractive Text Summarization based on Language Model Conditioning and Locality Modeling
Dmitrii Aksenov
J. Moreno-Schneider
Peter Bourgonje
Robert Schwarzenberg
Leonhard Hennig
Georg Rehm
19
25
0
29 Mar 2020
Actor-Transformers for Group Activity Recognition
Actor-Transformers for Group Activity Recognition
Kirill Gavrilyuk
Ryan Sanford
Mehrsan Javan
Cees G. M. Snoek
ViT
19
178
0
28 Mar 2020
Information-Theoretic Probing with Minimum Description Length
Information-Theoretic Probing with Minimum Description Length
Elena Voita
Ivan Titov
21
270
0
27 Mar 2020
Improving Reproducibility in Machine Learning Research (A Report from
  the NeurIPS 2019 Reproducibility Program)
Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)
Joelle Pineau
Philippe Vincent-Lamarre
Koustuv Sinha
V. Larivière
A. Beygelzimer
Florence dÁlché-Buc
E. Fox
Hugo Larochelle
19
357
0
27 Mar 2020
Integrating Crowdsourcing and Active Learning for Classification of
  Work-Life Events from Tweets
Integrating Crowdsourcing and Active Learning for Classification of Work-Life Events from Tweets
Yunpeng Zhao
M. Prosperi
Tianchen Lyu
Yi Guo
Jiang Bian
13
5
0
26 Mar 2020
StrokeCoder: Path-Based Image Generation from Single Examples using
  Transformers
StrokeCoder: Path-Based Image Generation from Single Examples using Transformers
Sabine Wieluch
Friedhelm Schwenker
ViT
GAN
15
7
0
26 Mar 2020
A Survey of Deep Learning for Scientific Discovery
A Survey of Deep Learning for Scientific Discovery
M. Raghu
Erica Schmidt
OOD
AI4CE
38
120
0
26 Mar 2020
Mapping the Landscape of Artificial Intelligence Applications against
  COVID-19
Mapping the Landscape of Artificial Intelligence Applications against COVID-19
Joseph Aylett-Bullock
A. Luccioni
K. H. Pham
C. Lam
M. Luengo-Oroz
AI4CE
39
406
0
25 Mar 2020
XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating
  Cross-lingual Generalization
XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization
Junjie Hu
Sebastian Ruder
Aditya Siddhant
Graham Neubig
Orhan Firat
Melvin Johnson
ELM
57
955
0
24 Mar 2020
TextCaps: a Dataset for Image Captioning with Reading Comprehension
TextCaps: a Dataset for Image Captioning with Reading Comprehension
Oleksii Sidorov
Ronghang Hu
Marcus Rohrbach
Amanpreet Singh
25
386
0
24 Mar 2020
Machine learning as a model for cultural learning: Teaching an algorithm
  what it means to be fat
Machine learning as a model for cultural learning: Teaching an algorithm what it means to be fat
Alina Arseniev-Koehler
J. Foster
43
46
0
24 Mar 2020
Multi-Label Text Classification using Attention-based Graph Neural
  Network
Multi-Label Text Classification using Attention-based Graph Neural Network
Ankit Pal
M. Selvakumar
Malaikannan Sankarasubbu
29
80
0
22 Mar 2020
TanhExp: A Smooth Activation Function with High Convergence Speed for
  Lightweight Neural Networks
TanhExp: A Smooth Activation Function with High Convergence Speed for Lightweight Neural Networks
Xinyu Liu
Xiaoguang Di
19
59
0
22 Mar 2020
Visual Question Answering for Cultural Heritage
Visual Question Answering for Cultural Heritage
P. Bongini
Federico Becattini
Andrew D. Bagdanov
A. Bimbo
179
22
0
22 Mar 2020
NSURL-2019 Task 7: Named Entity Recognition (NER) in Farsi
NSURL-2019 Task 7: Named Entity Recognition (NER) in Farsi
Nasrin Taghizadeh
Zeinab Borhanifard
Melika GolestaniPour
Heshaam Faili
14
8
0
19 Mar 2020
Normalized and Geometry-Aware Self-Attention Network for Image
  Captioning
Normalized and Geometry-Aware Self-Attention Network for Image Captioning
Longteng Guo
Jing Liu
Xinxin Zhu
Peng Yao
Shichen Lu
Hanqing Lu
ViT
120
189
0
19 Mar 2020
Beheshti-NER: Persian Named Entity Recognition Using BERT
Beheshti-NER: Persian Named Entity Recognition Using BERT
Ehsan Taher
S. A. Hoseini
M. Shamsfard
9
34
0
19 Mar 2020
Enhancing Factual Consistency of Abstractive Summarization
Enhancing Factual Consistency of Abstractive Summarization
Chenguang Zhu
William Fu-Hinthorn
Ruochen Xu
Qingkai Zeng
Michael Zeng
Xuedong Huang
Meng-Long Jiang
HILM
KELM
190
40
0
19 Mar 2020
Diversity, Density, and Homogeneity: Quantitative Characteristic Metrics
  for Text Collections
Diversity, Density, and Homogeneity: Quantitative Characteristic Metrics for Text Collections
Yi-An Lai
Xuan Zhu
Yi Zhang
Mona T. Diab
12
21
0
19 Mar 2020
X-Stance: A Multilingual Multi-Target Dataset for Stance Detection
X-Stance: A Multilingual Multi-Target Dataset for Stance Detection
Jannis Vamvas
Rico Sennrich
19
83
0
18 Mar 2020
Distant Supervision and Noisy Label Learning for Low Resource Named
  Entity Recognition: A Study on Hausa and Yorùbá
Distant Supervision and Noisy Label Learning for Low Resource Named Entity Recognition: A Study on Hausa and Yorùbá
David Ifeoluwa Adelani
Michael A. Hedderich
D. Zhu
Esther van den Berg
Dietrich Klakow
6
11
0
18 Mar 2020
Pre-trained Models for Natural Language Processing: A Survey
Pre-trained Models for Natural Language Processing: A Survey
Xipeng Qiu
Tianxiang Sun
Yige Xu
Yunfan Shao
Ning Dai
Xuanjing Huang
LM&MA
VLM
243
1,450
0
18 Mar 2020
Transformer Networks for Trajectory Forecasting
Transformer Networks for Trajectory Forecasting
Francesco Giuliari
Irtiza Hasan
Marco Cristani
Fabio Galasso
113
371
0
18 Mar 2020
Watching the World Go By: Representation Learning from Unlabeled Videos
Watching the World Go By: Representation Learning from Unlabeled Videos
Daniel Gordon
Kiana Ehsani
D. Fox
Ali Farhadi
SSL
AI4TS
24
87
0
18 Mar 2020
Self-Supervised Log Parsing
Self-Supervised Log Parsing
S. Nedelkoski
Jasmin Bogatinovski
Alexander Acker
Jorge Cardoso
O. Kao
6
71
0
17 Mar 2020
A comprehensive study on the prediction reliability of graph neural
  networks for virtual screening
A comprehensive study on the prediction reliability of graph neural networks for virtual screening
Soojung Yang
K. Lee
Seongok Ryu
19
7
0
17 Mar 2020
XPersona: Evaluating Multilingual Personalized Chatbot
XPersona: Evaluating Multilingual Personalized Chatbot
Zhaojiang Lin
Zihan Liu
Genta Indra Winata
Samuel Cahyawijaya
Andrea Madotto
Yejin Bang
Etsuko Ishii
Pascale Fung
45
57
0
17 Mar 2020
Offensive Language Identification in Greek
Offensive Language Identification in Greek
Zeses Pitenis
Marcos Zampieri
Tharindu Ranasinghe
6
153
0
16 Mar 2020
Previous
123...287288289...302303304
Next