Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 3,476 papers shown
Title
Improving Controllability of Educational Question Generation by Keyword Provision
Ying-Hong Chan
Ho-Lam Chung
Yao-Chung Fan
19
3
0
02 Dec 2021
NER-BERT: A Pre-trained Model for Low-Resource Entity Tagging
Zihan Liu
Feijun Jiang
Yuxiang Hu
Chen Shi
Pascale Fung
22
37
0
01 Dec 2021
Interactive Model with Structural Loss for Language-based Abductive Reasoning
Linhao Li
Ming Xu
Yongfeng Dong
Xin Li
Ao Wang
12
2
0
01 Dec 2021
Dyna-bAbI: unlocking bAbI's potential with dynamic synthetic benchmarking
Ronen Tamari
Kyle Richardson
Aviad Sar-Shalom
Noam Kahlon
Nelson F. Liu
Reut Tsarfaty
Dafna Shahaf
35
5
0
30 Nov 2021
Refined Commonsense Knowledge from Large-Scale Web Contents
Tuan-Phong Nguyen
Simon Razniewski
Julien Romero
G. Weikum
28
32
0
30 Nov 2021
NLP Techniques for Water Quality Analysis in Social Media Content
Muhammad Asif Ayub
Khubaib Ahmad
Kashif Ahmad
Nasir Ahmad
Ala I. Al-Fuqaha
14
6
0
30 Nov 2021
EdiBERT, a generative model for image editing
Thibaut Issenhuth
Ugo Tanielian
Jérémie Mary
David Picard
DiffM
29
12
0
30 Nov 2021
End-to-End Referring Video Object Segmentation with Multimodal Transformers
Adam Botach
Evgenii Zheltonozhskii
Chaim Baskin
VOS
17
140
0
29 Nov 2021
Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling
Xumin Yu
Lulu Tang
Yongming Rao
Tiejun Huang
Jie Zhou
Jiwen Lu
3DPC
20
652
0
29 Nov 2021
Understanding Out-of-distribution: A Perspective of Data Dynamics
Dyah Adila
Dongyeop Kang
32
12
0
29 Nov 2021
Context Matters in Semantically Controlled Language Generation for Task-oriented Dialogue Systems
Ye Liu
Wolfgang Maier
Wolfgang Minker
Stefan Ultes
16
4
0
28 Nov 2021
LAFITE: Towards Language-Free Training for Text-to-Image Generation
Yufan Zhou
Ruiyi Zhang
Changyou Chen
Chunyuan Li
Chris Tensmeyer
Tong Yu
Jiuxiang Gu
Jinhui Xu
Tong Sun
VLM
21
162
0
27 Nov 2021
Semantic-Aware Generation for Self-Supervised Visual Representation Learning
Yunjie Tian
Lingxi Xie
Xiaopeng Zhang
Jiemin Fang
Haohang Xu
Wei Huang
Jianbin Jiao
Qi Tian
QiXiang Ye
SSL
GAN
28
16
0
25 Nov 2021
Transformer-based Korean Pretrained Language Models: A Survey on Three Years of Progress
Kichang Yang
KELM
VLM
23
11
0
25 Nov 2021
Temporal Effects on Pre-trained Models for Language Processing Tasks
Oshin Agarwal
A. Nenkova
VLM
20
52
0
24 Nov 2021
PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers
Xiaoyi Dong
Jianmin Bao
Ting Zhang
Dongdong Chen
Weiming Zhang
Lu Yuan
Dong Chen
Fang Wen
Nenghai Yu
Baining Guo
ViT
37
238
0
24 Nov 2021
VIOLET : End-to-End Video-Language Transformers with Masked Visual-token Modeling
Tsu-jui Fu
Linjie Li
Zhe Gan
Kevin Qinghong Lin
W. Wang
Lijuan Wang
Zicheng Liu
VLM
34
216
0
24 Nov 2021
Knowledge Enhanced Sports Game Summarization
Jiaan Wang
Zhixu Li
Tingyi Zhang
Duo Zheng
Jianfeng Qu
An Liu
Lei Zhao
Zhigang Chen
AI4TS
21
12
0
24 Nov 2021
UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling
Zhengyuan Yang
Zhe Gan
Jianfeng Wang
Xiaowei Hu
Faisal Ahmed
Zicheng Liu
Yumao Lu
Lijuan Wang
19
111
0
23 Nov 2021
Zero-Shot Open-Book Question Answering
Sia Gholami
M. Noori
RALM
8
10
0
22 Nov 2021
Florence: A New Foundation Model for Computer Vision
Lu Yuan
Dongdong Chen
Yi-Ling Chen
Noel Codella
Xiyang Dai
...
Zhen Xiao
Jianwei Yang
Michael Zeng
Luowei Zhou
Pengchuan Zhang
VLM
24
878
0
22 Nov 2021
Enhancing Multilingual Language Model with Massive Multilingual Knowledge Triples
Linlin Liu
Xin Li
Ruidan He
Lidong Bing
Shafiq R. Joty
Luo Si
KELM
35
18
0
22 Nov 2021
Efficient Softmax Approximation for Deep Neural Networks with Attention Mechanism
Ihor Vasyltsov
Wooseok Chang
25
12
0
21 Nov 2021
Capitalization and Punctuation Restoration: a Survey
V. Pais
D. Tufis
17
19
0
21 Nov 2021
DeepQR: Neural-based Quality Ratings for Learnersourced Multiple-Choice Questions
Lin Ni
Qiming Bao
Xiaoxuan Li
Qianqian Qi
Paul Denny
Jim Warren
Michael Witbrock
Jiamo Liu
AI4Ed
6
15
0
19 Nov 2021
SimMIM: A Simple Framework for Masked Image Modeling
Zhenda Xie
Zheng-Wei Zhang
Yue Cao
Yutong Lin
Jianmin Bao
Zhuliang Yao
Qi Dai
Han Hu
37
1,309
0
18 Nov 2021
Restormer: Efficient Transformer for High-Resolution Image Restoration
Syed Waqas Zamir
Aditya Arora
Salman Khan
Munawar Hayat
F. Khan
Ming-Hsuan Yang
ViT
32
2,120
0
18 Nov 2021
Pegasus@Dravidian-CodeMix-HASOC2021: Analyzing Social Media Content for Detection of Offensive Text
Pawan Kalyan Jada
Konthala Yasaswini
Karthik Puranik
Anbukkarasi Sampath
S. Thangasamy
K. Thamburaj
20
0
0
18 Nov 2021
Merging Models with Fisher-Weighted Averaging
Michael Matena
Colin Raffel
FedML
MoMe
27
348
0
18 Nov 2021
Seeking Common but Distinguishing Difference, A Joint Aspect-based Sentiment Analysis Model
Hongjiang Jing
Zuchao Li
Hai Zhao
Shu Jiang
17
24
0
18 Nov 2021
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing
Pengcheng He
Jianfeng Gao
Weizhu Chen
30
1,115
0
18 Nov 2021
Linking-Enhanced Pre-Training for Table Semantic Parsing
Bowen Qin
Lihan Wang
Binyuan Hui
Ruiying Geng
Zhen Cao
Min Yang
Jian Sun
Yongbin Li
27
1
0
18 Nov 2021
EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching
Yaya Shi
Xu Yang
Haiyang Xu
Chunfen Yuan
Bing Li
Weiming Hu
Zhengjun Zha
33
33
0
17 Nov 2021
Achieving Human Parity on Visual Question Answering
Ming Yan
Haiyang Xu
Chenliang Li
Junfeng Tian
Bin Bi
...
Ji Zhang
Songfang Huang
Fei Huang
Luo Si
Rong Jin
24
12
0
17 Nov 2021
Document AI: Benchmarks, Models and Applications
Lei Cui
Yiheng Xu
Tengchao Lv
Furu Wei
VLM
13
69
0
16 Nov 2021
Interpreting Language Models Through Knowledge Graph Extraction
Vinitra Swamy
Angelika Romanou
Martin Jaggi
20
20
0
16 Nov 2021
WikiContradiction: Detecting Self-Contradiction Articles on Wikipedia
Cheng-Mao Hsu
Cheng-Te Li
Diego Sáez-Trumper
Yi-Zhan Hsu
SSL
14
13
0
16 Nov 2021
Adversarially Constructed Evaluation Sets Are More Challenging, but May Not Be Fair
Jason Phang
Angelica Chen
William Huang
Samuel R. Bowman
AAML
28
13
0
16 Nov 2021
FACOS: Finding API Relevant Contents on Stack Overflow with Semantic and Syntactic Analysis
K. Luong
M. Hadi
Ferdian Thung
Fatemeh H. Fard
David Lo
16
4
0
14 Nov 2021
On Transferability of Prompt Tuning for Natural Language Processing
Yusheng Su
Xiaozhi Wang
Yujia Qin
Chi-Min Chan
Yankai Lin
...
Peng Li
Juanzi Li
Lei Hou
Maosong Sun
Jie Zhou
AAML
VLM
18
98
0
12 Nov 2021
Extraction of Medication Names from Twitter Using Augmentation and an Ensemble of Language Models
I. Kulev
Berkay Köprü
Raul Rodriguez-Esteban
Diego Saldana Miranda
Yi Huang
Alessandro La Torraca
Elif Özkirimli
MedIm
13
4
0
12 Nov 2021
AnswerSumm: A Manually-Curated Dataset and Pipeline for Answer Summarization
Alexander R. Fabbri
Xiaojian Wu
Srini Iyer
Haoran Li
Mona T. Diab
16
14
0
11 Nov 2021
A Survey of Visual Transformers
Yang Liu
Yao Zhang
Yixin Wang
Feng Hou
Jin Yuan
Jiang Tian
Yang Zhang
Zhongchao Shi
Jianping Fan
Zhiqiang He
3DGS
ViT
69
330
0
11 Nov 2021
Improving Large-scale Language Models and Resources for Filipino
Jan Christian Blaise Cruz
C. Cheng
AI4CE
24
27
0
11 Nov 2021
Kronecker Factorization for Preventing Catastrophic Forgetting in Large-scale Medical Entity Linking
Denis Jered McInerney
Luyang Kong
Kristjan Arumae
Byron C. Wallace
Parminder Bhatia
CLL
19
1
0
11 Nov 2021
Recent Advances in Automated Question Answering In Biomedical Domain
K. D. Baksi
14
0
0
10 Nov 2021
Prune Once for All: Sparse Pre-Trained Language Models
Ofir Zafrir
Ariel Larey
Guy Boudoukh
Haihao Shen
Moshe Wasserblat
VLM
23
82
0
10 Nov 2021
Critical Sentence Identification in Legal Cases Using Multi-Class Classification
Sahan Jayasinghe
Lakith Rambukkanage
Ashan Silva
Nisansa de Silva
A. Perera
AILaw
ELM
35
4
0
10 Nov 2021
Human-in-the-Loop Disinformation Detection: Stance, Sentiment, or Something Else?
Alexander Michael Daniel
23
1
0
09 Nov 2021
How does a Pre-Trained Transformer Integrate Contextual Keywords? Application to Humanitarian Computing
Valentin Barrière
Guillaume Jacquet
8
1
0
07 Nov 2021
Previous
1
2
3
...
49
50
51
...
68
69
70
Next