Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Journal of machine learning research (JMLR), 2019
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 12,032 papers shown
Constructing interval variables via faceted Rasch measurement and multitask deep learning: a hate speech application
Chris J. Kennedy
Geoff Bacon
A. Sahn
Claudia von Vacano
192
106
0
22 Sep 2020
An Empirical Study on Neural Keyphrase Generation
North American Chapter of the Association for Computational Linguistics (NAACL), 2020
Rui Meng
Xingdi Yuan
Tong Wang
Sanqiang Zhao
Adam Trischler
Daqing He
218
44
0
22 Sep 2020
UCD-CS at W-NUT 2020 Shared Task-3: A Text to Text Approach for COVID-19 Event Extraction on Social Media
Congcong Wang
David Lillis
123
4
0
21 Sep 2020
VirtualFlow: Decoupling Deep Learning Models from the Underlying Hardware
Conference on Machine Learning and Systems (MLSys), 2020
Andrew Or
Haoyu Zhang
M. Freedman
308
13
0
20 Sep 2020
Can questions summarize a corpus? Using question generation for characterizing COVID-19 research
Gabriela Surita
Rodrigo Nogueira
R. Lotufo
112
7
0
19 Sep 2020
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data
International Conference on Learning Representations (ICLR), 2020
Jonathan Pilault
Amine Elhattami
C. Pal
CLL
MoE
288
102
0
19 Sep 2020
Self-Supervised Meta-Learning for Few-Shot Natural Language Classification Tasks
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Trapit Bansal
Rishikesh Jha
Tsendsuren Munkhdalai
Andrew McCallum
SSL
VLM
300
97
0
17 Sep 2020
GraphCodeBERT: Pre-training Code Representations with Data Flow
International Conference on Learning Representations (ICLR), 2020
Daya Guo
Shuo Ren
Shuai Lu
Zhangyin Feng
Duyu Tang
...
Dawn Drain
Neel Sundaresan
Jian Yin
Daxin Jiang
M. Zhou
765
1,472
0
17 Sep 2020
Self-supervised pre-training and contrastive representation learning for multiple-choice video QA
AAAI Conference on Artificial Intelligence (AAAI), 2020
Seonhoon Kim
Seohyeong Jeong
Eunbyul Kim
Inho Kang
Nojun Kwak
SSL
286
43
0
17 Sep 2020
GLUCOSE: GeneraLized and COntextualized Story Explanations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
N. Mostafazadeh
Aditya Kalyanpur
Lori Moon
David W. Buchanan
Lauren Berkowitz
Or Biran
Jennifer Chu-Carroll
326
127
0
16 Sep 2020
Evaluating representations by the complexity of learning low-loss predictors
William F. Whitney
M. Song
David Brandfonbrener
Jaan Altosaar
Dong Wang
222
25
0
15 Sep 2020
Augmented Natural Language for Generative Sequence Labeling
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Ben Athiwaratkun
Cicero Nogueira dos Santos
Jason Krone
Bing Xiang
VLM
217
67
0
15 Sep 2020
BERT-QE: Contextualized Query Expansion for Document Re-ranking
Findings (Findings), 2020
Zhi Zheng
Kai Hui
Xianpei Han
Xianpei Han
Le Sun
Andrew Yates
215
107
0
15 Sep 2020
Autoregressive Knowledge Distillation through Imitation Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Alexander Lin
Jeremy Wohlwend
Howard Chen
Tao Lei
236
52
0
15 Sep 2020
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
North American Chapter of the Association for Computational Linguistics (NAACL), 2020
Timo Schick
Hinrich Schütze
438
1,063
0
15 Sep 2020
Noisy Self-Knowledge Distillation for Text Summarization
North American Chapter of the Association for Computational Linguistics (NAACL), 2020
Yang Liu
S. Shen
Mirella Lapata
155
47
0
15 Sep 2020
Current Limitations of Language Models: What You Need is Retrieval
Aran Komatsuzaki
LRM
118
3
0
15 Sep 2020
Efficient Transformers: A Survey
ACM Computing Surveys (ACM CSUR), 2020
Yi Tay
Mostafa Dehghani
Dara Bahri
Donald Metzler
VLM
854
1,350
0
14 Sep 2020
Contrastive Triple Extraction with Generative Transformer
AAAI Conference on Artificial Intelligence (AAAI), 2020
Hongbin Ye
Ningyu Zhang
Shumin Deng
Mosha Chen
Chuanqi Tan
Fei Huang
Huajun Chen
306
140
0
14 Sep 2020
BoostingBERT:Integrating Multi-Class Boosting into BERT for NLP Tasks
Tongwen Huang
Qingyun She
Junlin Zhang
133
18
0
13 Sep 2020
Compressed Deep Networks: Goodbye SVD, Hello Robust Low-Rank Approximation
M. Tukan
Alaa Maalouf
Matan Weksler
Dan Feldman
215
9
0
11 Sep 2020
Generating Accurate Assert Statements for Unit Test Cases using Pretrained Transformers
International Conference/Workshop on Automation of Software Test (AST), 2020
Michele Tufano
Dawn Drain
Alexey Svyatkovskiy
Neel Sundaresan
ViT
153
110
0
11 Sep 2020
Unit Test Case Generation with Transformers and Focal Context
Michele Tufano
Dawn Drain
Alexey Svyatkovskiy
Shao Kun Deng
Neel Sundaresan
ViT
266
242
0
11 Sep 2020
Semantic Relations and Deep Learning
Vivi Nastase
Stan Szpakowicz
GNN
280
0
0
11 Sep 2020
Brain2Word: Decoding Brain Activity for Language Generation
Nicolas Affolter
Béni Egressy
Damian Pascual
Roger Wattenhofer
208
26
0
10 Sep 2020
QED: A Framework and Dataset for Explanations in Question Answering
Transactions of the Association for Computational Linguistics (TACL), 2020
Matthew Lamm
J. Palomaki
Chris Alberti
D. Andor
Eunsol Choi
Livio Baldini Soares
Michael Collins
229
80
0
08 Sep 2020
Measuring Massive Multitask Language Understanding
International Conference on Learning Representations (ICLR), 2020
Dan Hendrycks
Collin Burns
Steven Basart
Andy Zou
Mantas Mazeika
Basel Alomair
Jacob Steinhardt
ELM
RALM
2.3K
6,566
0
07 Sep 2020
Team Alex at CLEF CheckThat! 2020: Identifying Check-Worthy Tweets With Transformer Models
Conference and Labs of the Evaluation Forum (CLEF), 2020
Alex Nikolov
Giovanni Da San Martino
Ivan Koychev
Preslav Nakov
160
22
0
07 Sep 2020
Learning to summarize from human feedback
Neural Information Processing Systems (NeurIPS), 2020
Nisan Stiennon
Long Ouyang
Jeff Wu
Daniel M. Ziegler
Ryan J. Lowe
Chelsea Voss
Alec Radford
Dario Amodei
Paul Christiano
ALM
865
2,739
0
02 Sep 2020
Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Interspeech (Interspeech), 2020
Wei Li
James Qin
Chung-Cheng Chiu
Ruoming Pang
Yanzhang He
184
15
0
30 Aug 2020
Neural Code Search Revisited: Enhancing Code Snippet Retrieval through Natural Language Intent
Geert Heyman
Tom Van Cutsem
177
35
0
27 Aug 2020
GREEK-BERT: The Greeks visiting Sesame Street
Hellenic Conference on Artificial Intelligence (HAI), 2020
John Koutsikakis
Ilias Chalkidis
Prodromos Malakasiotis
Ion Androutsopoulos
185
105
0
27 Aug 2020
What is being transferred in transfer learning?
Neural Information Processing Systems (NeurIPS), 2020
Behnam Neyshabur
Hanie Sedghi
Chiyuan Zhang
381
587
0
26 Aug 2020
Analysis and Evaluation of Language Models for Word Sense Disambiguation
Daniel Loureiro
Kiamehr Rezaee
Mohammad Taher Pilehvar
Jose Camacho-Collados
302
14
0
26 Aug 2020
A Baseline Analysis for Podcast Abstractive Summarization
Chujie Zheng
Harry J. Wang
Kunpeng Zhang
Ling Fan
181
13
0
24 Aug 2020
Example-Based Named Entity Recognition
Morteza Ziyadi
Yuting Sun
Abhishek Goswami
Jade Huang
Weizhu Chen
121
35
0
24 Aug 2020
End to End Dialogue Transformer
Ondrej Mekota
Memduh Gökirmak
Petr Laitoch
84
1
0
24 Aug 2020
PTT5: Pretraining and validating the T5 model on Brazilian Portuguese data
Diedre Carmo
Marcos Piau
Israel Campiotti
Rodrigo Nogueira
R. Lotufo
LM&MA
130
64
0
20 Aug 2020
Scruples: A Corpus of Community Ethical Judgments on 32,000 Real-Life Anecdotes
Nicholas Lourie
Ronan Le Bras
Yejin Choi
261
134
0
20 Aug 2020
Discovering Useful Sentence Representations from Large Pretrained Language Models
Nishant Subramani
Nivedita Suresh
123
7
0
20 Aug 2020
Language Models as Knowledge Bases: On Entity Representations, Storage Capacity, and Paraphrased Queries
Benjamin Heinzerling
Kentaro Inui
KELM
242
149
0
20 Aug 2020
Lite Training Strategies for Portuguese-English and English-Portuguese Translation
Alexandre Lopes
Rodrigo Nogueira
R. Lotufo
Hélio Pedrini
100
9
0
20 Aug 2020
Adaptation Algorithms for Neural Network-Based Speech Recognition: An Overview
P. Bell
Joachim Fainberg
Ondˇrej Klejch
Jinyu Li
Steve Renals
P. Swietojanski
316
82
0
14 Aug 2020
Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems
Andrea Madotto
Zihan Liu
Mohammad Kachuee
Pascale Fung
319
62
0
14 Aug 2020
Compression of Deep Learning Models for Text: A Survey
ACM Transactions on Knowledge Discovery from Data (TKDD), 2020
Manish Gupta
Puneet Agrawal
VLM
MedIm
AI4CE
510
134
0
12 Aug 2020
The Language Interpretability Tool: Extensible, Interactive Visualizations and Analysis for NLP Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Ian Tenney
James Wexler
Jasmijn Bastings
Tolga Bolukbasi
Andy Coenen
...
Ellen Jiang
Mahima Pushkarna
Carey Radebaugh
Emily Reif
Ann Yuan
VLM
356
210
0
12 Aug 2020
SemEval-2020 Task 10: Emphasis Selection for Written Text in Visual Media
International Workshop on Semantic Evaluation (SemEval), 2020
Amirreza Shirani
Franck Dernoncourt
Nedim Lipka
P. Asente
J. Echevarria
Thamar Solorio
134
21
0
07 Aug 2020
aschern at SemEval-2020 Task 11: It Takes Three to Tango: RoBERTa, CRF, and Transfer Learning
International Workshop on Semantic Evaluation (SemEval), 2020
Anton Chernyavskiy
Dmitry Ilvovsky
Preslav Nakov
132
27
0
06 Aug 2020
Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2020
Patrick Lewis
Pontus Stenetorp
Sebastian Riedel
OOD
ELM
288
196
0
06 Aug 2020
Forecasting AI Progress: A Research Agenda
Ross Gruetzemacher
Florian E. Dorner
Niko Bernaola-Alvarez
Charlie Giattino
D. Manheim
AI4TS
138
39
0
04 Aug 2020
Previous
1
2
3
...
234
235
236
...
239
240
241
Next