ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.11692
  4. Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach

RoBERTa: A Robustly Optimized BERT Pretraining Approach

26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
    AIMat
ArXivPDFHTML

Papers citing "RoBERTa: A Robustly Optimized BERT Pretraining Approach"

50 / 3,476 papers shown
Title
GraphCodeBERT: Pre-training Code Representations with Data Flow
GraphCodeBERT: Pre-training Code Representations with Data Flow
Daya Guo
Shuo Ren
Shuai Lu
Zhangyin Feng
Duyu Tang
...
Dawn Drain
Neel Sundaresan
Jian Yin
Daxin Jiang
M. Zhou
56
1,094
0
17 Sep 2020
Efficient Transformer-based Large Scale Language Representations using
  Hardware-friendly Block Structured Pruning
Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning
Bingbing Li
Zhenglun Kong
Tianyun Zhang
Ji Li
Z. Li
Hang Liu
Caiwen Ding
VLM
24
64
0
17 Sep 2020
Automated Source Code Generation and Auto-completion Using Deep
  Learning: Comparing and Discussing Current Language-Model-Related Approaches
Automated Source Code Generation and Auto-completion Using Deep Learning: Comparing and Discussing Current Language-Model-Related Approaches
Juan Cruz-Benito
Sanjay Vishwakarma
Francisco Martín-Fernández
Ismael Faro Ibm Quantum
22
30
0
16 Sep 2020
Reasoning about Goals, Steps, and Temporal Ordering with WikiHow
Reasoning about Goals, Steps, and Temporal Ordering with WikiHow
Li Zhang
Qing Lyu
Chris Callison-Burch
ReLM
LRM
11
85
0
16 Sep 2020
Critical Thinking for Language Models
Critical Thinking for Language Models
Gregor Betz
Christian Voigt
Kyle Richardson
SyDa
ReLM
LRM
AI4CE
18
35
0
15 Sep 2020
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot
  Learners
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
Timo Schick
Hinrich Schütze
22
953
0
15 Sep 2020
MLMLM: Link Prediction with Mean Likelihood Masked Language Model
MLMLM: Link Prediction with Mean Likelihood Masked Language Model
Louis Clouâtre
P. Trempe
Amal Zouaq
Sarath Chandar
17
43
0
15 Sep 2020
GeDi: Generative Discriminator Guided Sequence Generation
GeDi: Generative Discriminator Guided Sequence Generation
Ben Krause
Akhilesh Deepak Gotmare
Bryan McCann
N. Keskar
Shafiq R. Joty
R. Socher
Nazneen Rajani
51
389
0
14 Sep 2020
Learning an Effective Context-Response Matching Model with
  Self-Supervised Tasks for Retrieval-based Dialogues
Learning an Effective Context-Response Matching Model with Self-Supervised Tasks for Retrieval-based Dialogues
Ruijian Xu
Chongyang Tao
Daxin Jiang
Xueliang Zhao
Dongyan Zhao
Rui Yan
24
70
0
14 Sep 2020
On Robustness and Bias Analysis of BERT-based Relation Extraction
On Robustness and Bias Analysis of BERT-based Relation Extraction
Luoqiu Li
Xiang Chen
Hongbin Ye
Zhen Bi
Shumin Deng
Ningyu Zhang
Huajun Chen
24
18
0
14 Sep 2020
Cosine meets Softmax: A tough-to-beat baseline for visual grounding
Cosine meets Softmax: A tough-to-beat baseline for visual grounding
N. Rufus
U. R. Nair
K. M. Krishna
Vineet Gandhi
22
13
0
13 Sep 2020
Differentially Private Language Models Benefit from Public Pre-training
Differentially Private Language Models Benefit from Public Pre-training
Gavin Kerrigan
Dylan Slack
Jens Tuyls
13
56
0
13 Sep 2020
Improving Machine Reading Comprehension with Contextualized Commonsense
  Knowledge
Improving Machine Reading Comprehension with Contextualized Commonsense Knowledge
Kai Sun
Dian Yu
Jianshu Chen
Dong Yu
Claire Cardie
25
12
0
12 Sep 2020
CIA_NITT at WNUT-2020 Task 2: Classification of COVID-19 Tweets Using
  Pre-trained Language Models
CIA_NITT at WNUT-2020 Task 2: Classification of COVID-19 Tweets Using Pre-trained Language Models
Yandrapati Prakash Babu
Eswari Rajagopal
19
9
0
12 Sep 2020
Sparsifying Transformer Models with Trainable Representation Pooling
Sparsifying Transformer Models with Trainable Representation Pooling
Michal Pietruszka
Łukasz Borchmann
Lukasz Garncarek
13
10
0
10 Sep 2020
Brain2Word: Decoding Brain Activity for Language Generation
Brain2Word: Decoding Brain Activity for Language Generation
Nicolas Affolter
Béni Egressy
Damian Pascual
Roger Wattenhofer
9
21
0
10 Sep 2020
Exploiting Multi-Modal Features From Pre-trained Networks for
  Alzheimer's Dementia Recognition
Exploiting Multi-Modal Features From Pre-trained Networks for Alzheimer's Dementia Recognition
Junghyun Koo
Jie Hwan Lee
Jaewoo Pyo
Yujin Jo
Kyogu Lee
11
58
0
09 Sep 2020
kk2018 at SemEval-2020 Task 9: Adversarial Training for Code-Mixing
  Sentiment Classification
kk2018 at SemEval-2020 Task 9: Adversarial Training for Code-Mixing Sentiment Classification
Jiaxiang Liu
Xuyi Chen
Shikun Feng
Shuohuan Wang
Ouyang Xuan
Yu Sun
Zhengjie Huang
Weiyue Su
27
19
0
08 Sep 2020
Accenture at CheckThat! 2020: If you say so: Post-hoc fact-checking of
  claims using transformer-based models
Accenture at CheckThat! 2020: If you say so: Post-hoc fact-checking of claims using transformer-based models
Evan Williams
Paul Rodrigues
Valerie Novak
31
42
0
05 Sep 2020
Attention Flows: Analyzing and Comparing Attention Mechanisms in
  Language Models
Attention Flows: Analyzing and Comparing Attention Mechanisms in Language Models
Joseph F DeRose
Jiayao Wang
M. Berger
15
83
0
03 Sep 2020
A Primer on Motion Capture with Deep Learning: Principles, Pitfalls and
  Perspectives
A Primer on Motion Capture with Deep Learning: Principles, Pitfalls and Perspectives
Alexander Mathis
Steffen Schneider
Jessy Lauer
Mackenzie W. Mathis
20
165
0
01 Sep 2020
A Framework For Contrastive Self-Supervised Learning And Designing A New
  Approach
A Framework For Contrastive Self-Supervised Learning And Designing A New Approach
William Falcon
Kyunghyun Cho
SSL
11
103
0
31 Aug 2020
A Survey of Evaluation Metrics Used for NLG Systems
A Survey of Evaluation Metrics Used for NLG Systems
Ananya B. Sai
Akash Kumar Mohankumar
Mitesh M. Khapra
ELM
25
228
0
27 Aug 2020
Analysis and Evaluation of Language Models for Word Sense Disambiguation
Analysis and Evaluation of Language Models for Word Sense Disambiguation
Daniel Loureiro
Kiamehr Rezaee
Mohammad Taher Pilehvar
Jose Camacho-Collados
10
13
0
26 Aug 2020
Multi-Label Sentiment Analysis on 100 Languages with Dynamic Weighting
  for Label Imbalance
Multi-Label Sentiment Analysis on 100 Languages with Dynamic Weighting for Label Imbalance
Selim F. Yilmaz
E. Kaynak
Aykut Koç
H. Dibeklioğlu
Suleyman Serdar Kozat
29
26
0
26 Aug 2020
Conceptualized Representation Learning for Chinese Biomedical Text
  Mining
Conceptualized Representation Learning for Chinese Biomedical Text Mining
Ningyu Zhang
Qianghuai Jia
Kangping Yin
Liang Dong
Feng Gao
Nengwei Hua
OOD
29
65
0
25 Aug 2020
How Have We Reacted To The COVID-19 Pandemic? Analyzing Changing Indian
  Emotions Through The Lens of Twitter
How Have We Reacted To The COVID-19 Pandemic? Analyzing Changing Indian Emotions Through The Lens of Twitter
Rajdeep Mukherjee
S. Poddar
Atharva Naik
Soham Dasgupta
17
5
0
20 Aug 2020
Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems
Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems
Andrea Madotto
Zihan Liu
Zhaojiang Lin
Pascale Fung
38
58
0
14 Aug 2020
Hybrid Ranking Network for Text-to-SQL
Hybrid Ranking Network for Text-to-SQL
Qin Lyu
K. Chakrabarti
Shobhit Hathi
Souvik Kundu
Jianwen Zhang
Zheng Chen
AIMat
9
83
0
11 Aug 2020
KR-BERT: A Small-Scale Korean-Specific Language Model
KR-BERT: A Small-Scale Korean-Specific Language Model
Sangah Lee
Hansol Jang
Yunmee Baik
Suzi Park
Hyopil Shin
14
51
0
10 Aug 2020
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Hayato Futami
H. Inaguma
Sei Ueno
Masato Mimura
S. Sakai
Tatsuya Kawahara
19
50
0
09 Aug 2020
ConvBERT: Improving BERT with Span-based Dynamic Convolution
ConvBERT: Improving BERT with Span-based Dynamic Convolution
Zihang Jiang
Weihao Yu
Daquan Zhou
Yunpeng Chen
Jiashi Feng
Shuicheng Yan
32
156
0
06 Aug 2020
Aligning AI With Shared Human Values
Aligning AI With Shared Human Values
Dan Hendrycks
Collin Burns
Steven Basart
Andrew Critch
J. Li
D. Song
Jacob Steinhardt
32
515
0
05 Aug 2020
Multilingual Translation with Extensible Multilingual Pretraining and
  Finetuning
Multilingual Translation with Extensible Multilingual Pretraining and Finetuning
Y. Tang
C. Tran
Xian Li
Peng-Jen Chen
Naman Goyal
Vishrav Chaudhary
Jiatao Gu
Angela Fan
CLL
47
445
0
02 Aug 2020
MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain
  Question Answering
MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering
Shayne Longpre
Yi Lu
Joachim Daiber
ELM
HILM
30
151
0
30 Jul 2020
Representation Learning with Video Deep InfoMax
Representation Learning with Video Deep InfoMax
R. Devon Hjelm
Philip Bachman
SSL
MDE
14
28
0
27 Jul 2020
Reed at SemEval-2020 Task 9: Fine-Tuning and Bag-of-Words Approaches to
  Code-Mixed Sentiment Analysis
Reed at SemEval-2020 Task 9: Fine-Tuning and Bag-of-Words Approaches to Code-Mixed Sentiment Analysis
Vinay Gopalan
Mark Hopkins
20
6
0
26 Jul 2020
Named entity recognition in chemical patents using ensemble of
  contextual language models
Named entity recognition in chemical patents using ensemble of contextual language models
J. Copara
Nona Naderi
J. Knafou
Patrick Ruch
Douglas Teodoro
21
23
0
24 Jul 2020
FiSSA at SemEval-2020 Task 9: Fine-tuned For Feelings
FiSSA at SemEval-2020 Task 9: Fine-tuned For Feelings
Bertelt Braaksma
R. Scholtens
Stan van Suijlekom
Remy Wang
A. Ustun
15
3
0
24 Jul 2020
Clustering of Social Media Messages for Humanitarian Aid Response during
  Crisis
Clustering of Social Media Messages for Humanitarian Aid Response during Crisis
Swati Padhee
T. K. Saha
Joel R. Tetreault
A. Jaimes
17
6
0
23 Jul 2020
Investigating Pretrained Language Models for Graph-to-Text Generation
Investigating Pretrained Language Models for Graph-to-Text Generation
Leonardo F. R. Ribeiro
Martin Schmitt
Hinrich Schütze
Iryna Gurevych
17
215
0
16 Jul 2020
LogiQA: A Challenge Dataset for Machine Reading Comprehension with
  Logical Reasoning
LogiQA: A Challenge Dataset for Machine Reading Comprehension with Logical Reasoning
Jian Liu
Leyang Cui
Hanmeng Liu
Dandan Huang
Yile Wang
Yue Zhang
RALM
6
331
0
16 Jul 2020
Fighting the COVID-19 Infodemic in Social Media: A Holistic Perspective
  and a Call to Arms
Fighting the COVID-19 Infodemic in Social Media: A Holistic Perspective and a Call to Arms
Firoj Alam
Fahim Dalvi
Shaden Shaar
Nadir Durrani
Hamdy Mubarak
...
Giovanni Da San Martino
Ahmed Abdelali
Hassan Sajjad
Kareem Darwish
Preslav Nakov
14
102
0
15 Jul 2020
Deep learning models for representing out-of-vocabulary words
Deep learning models for representing out-of-vocabulary words
Johannes V. Lochter
Renato M. Silva
Tiago A. Almeida
11
15
0
14 Jul 2020
An Empirical Study on Robustness to Spurious Correlations using
  Pre-trained Language Models
An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models
Lifu Tu
Garima Lalwani
Spandana Gella
He He
LRM
19
184
0
14 Jul 2020
Learning Reasoning Strategies in End-to-End Differentiable Proving
Learning Reasoning Strategies in End-to-End Differentiable Proving
Pasquale Minervini
Sebastian Riedel
Pontus Stenetorp
Edward Grefenstette
Tim Rocktaschel
LRM
37
96
0
13 Jul 2020
TERA: Self-Supervised Learning of Transformer Encoder Representation for
  Speech
TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech
Andy T. Liu
Shang-Wen Li
Hung-yi Lee
SSL
48
356
0
12 Jul 2020
Learning Sparse Prototypes for Text Generation
Learning Sparse Prototypes for Text Generation
Junxian He
Taylor Berg-Kirkpatrick
Graham Neubig
16
23
0
29 Jun 2020
Improving Sequence Tagging for Vietnamese Text Using Transformer-based
  Neural Models
Improving Sequence Tagging for Vietnamese Text Using Transformer-based Neural Models
Viet The Bui
Oanh T. K. Tran
Hong Phuong Le
17
38
0
29 Jun 2020
Evaluation of Text Generation: A Survey
Evaluation of Text Generation: A Survey
Asli Celikyilmaz
Elizabeth Clark
Jianfeng Gao
ELM
LM&MA
19
376
0
26 Jun 2020
Previous
123...656667686970
Next