Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10683
Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
50 / 7,921 papers shown
Title
Measuring Mathematical Problem Solving With the MATH Dataset
Dan Hendrycks
Collin Burns
Saurav Kadavath
Akul Arora
Steven Basart
Eric Tang
D. Song
Jacob Steinhardt
ReLM
FaML
55
1,787
0
05 Mar 2021
Attention is Not All You Need: Pure Attention Loses Rank Doubly Exponentially with Depth
Yihe Dong
Jean-Baptiste Cordonnier
Andreas Loukas
7
372
0
05 Mar 2021
WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning
Krishna Srinivasan
K. Raman
Jiecao Chen
Michael Bendersky
Marc Najork
VLM
197
308
0
02 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
86
27,569
0
26 Feb 2021
A Primer on Contrastive Pretraining in Language Processing: Methods, Lessons Learned and Perspectives
Nils Rethmeier
Isabelle Augenstein
SSL
VLM
70
90
0
25 Feb 2021
Pruning the Index Contents for Memory Efficient Open-Domain QA
Martin Fajcik
Martin Docekal
Karel Ondrej
Pavel Smrz
21
8
0
21 Feb 2021
Formal Language Theory Meets Modern NLP
William Merrill
AI4CE
NAI
14
12
0
19 Feb 2021
CATE: Computation-aware Neural Architecture Encoding with Transformers
Shen Yan
Kaiqiang Song
Z. Feng
Mi Zhang
20
24
0
14 Feb 2021
Reasoning Over Virtual Knowledge Bases With Open Predicate Relations
Haitian Sun
Pat Verga
Bhuwan Dhingra
Ruslan Salakhutdinov
William W. Cohen
LRM
31
26
0
14 Feb 2021
PAQ: 65 Million Probably-Asked Questions and What You Can Do With Them
Patrick Lewis
Yuxiang Wu
Linqing Liu
Pasquale Minervini
Heinrich Küttler
Aleksandra Piktus
Pontus Stenetorp
Sebastian Riedel
RALM
23
228
0
13 Feb 2021
Proof Artifact Co-training for Theorem Proving with Language Models
Jesse Michael Han
Jason M. Rute
Yuhuai Wu
Edward W. Ayers
Stanislas Polu
AIMat
16
120
0
11 Feb 2021
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
278
1,978
0
09 Feb 2021
Damage detection using in-domain and cross-domain transfer learning
Z. Bukhsh
N. Jansen
Aaqib Saeed
21
42
0
07 Feb 2021
Unifying Vision-and-Language Tasks via Text Generation
Jaemin Cho
Jie Lei
Hao Tan
Mohit Bansal
MLLM
249
525
0
04 Feb 2021
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
Sebastian Gehrmann
Tosin P. Adewumi
Karmanya Aggarwal
Pawan Sasanka Ammanamanchi
Aremu Anuoluwapo
...
Nishant Subramani
Wei-ping Xu
Diyi Yang
Akhila Yerukola
Jiawei Zhou
VLM
246
283
0
02 Feb 2021
Measuring and Improving Consistency in Pretrained Language Models
Yanai Elazar
Nora Kassner
Shauli Ravfogel
Abhilasha Ravichander
Eduard H. Hovy
Hinrich Schütze
Yoav Goldberg
HILM
258
343
0
01 Feb 2021
TruthBot: An Automated Conversational Tool for Intent Learning, Curated Information Presenting, and Fake News Alerting
Ankur Gupta
Yash Varun
Prarthana Das
Nithya Muttineni
Parth Srivastava
Hamim Zafar
Tanmoy Chakraborty
Swaprava Nath
11
6
0
31 Jan 2021
Slot Self-Attentive Dialogue State Tracking
Fanghua Ye
Jarana Manotumruksa
Qiang Zhang
Shenghui Li
Emine Yilmaz
30
63
0
22 Jan 2021
Open-Domain Conversational Search Assistant with Transformers
Rafael Ferreira
Mariana Leite
David Semedo
João Magalhães
11
11
0
20 Jan 2021
ZeRO-Offload: Democratizing Billion-Scale Model Training
Jie Ren
Samyam Rajbhandari
Reza Yazdani Aminabadi
Olatunji Ruwase
Shuangyang Yang
Minjia Zhang
Dong Li
Yuxiong He
MoE
160
413
0
18 Jan 2021
Learning to Generate Task-Specific Adapters from Task Description
Qinyuan Ye
Xiang Ren
107
29
0
02 Jan 2021
Making Pre-trained Language Models Better Few-shot Learners
Tianyu Gao
Adam Fisch
Danqi Chen
241
1,913
0
31 Dec 2020
Improving BERT with Syntax-aware Local Attention
Zhongli Li
Qingyu Zhou
Chao Li
Ke Xu
Yunbo Cao
56
44
0
30 Dec 2020
ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning
Yujia Qin
Yankai Lin
Ryuichi Takanobu
Zhiyuan Liu
Peng Li
Heng Ji
Minlie Huang
Maosong Sun
Jie Zhou
38
125
0
30 Dec 2020
Contrastive Learning with Adversarial Perturbations for Conditional Text Generation
Seanie Lee
Dong Bok Lee
Sung Ju Hwang
13
105
0
14 Dec 2020
Data Boost: Text Data Augmentation Through Reinforcement Learning Guided Conditional Generation
Ruibo Liu
Guangxuan Xu
Chenyan Jia
Weicheng Ma
Lili Wang
Soroush Vosoughi
23
107
0
05 Dec 2020
GLGE: A New General Language Generation Evaluation Benchmark
Dayiheng Liu
Yu Yan
Yeyun Gong
Weizhen Qi
Hang Zhang
...
Jiancheng Lv
Ruofei Zhang
Winnie Wu
Ming Zhou
Nan Duan
ELM
28
66
0
24 Nov 2020
Language Models not just for Pre-training: Fast Online Neural Noisy Channel Modeling
Shruti Bhosale
Kyra Yee
Sergey Edunov
Michael Auli
42
7
0
13 Nov 2020
SeqGenSQL -- A Robust Sequence Generation Model for Structured Query Language
Ning Li
Bethany Keller
M. Butler
Daniel Matthew Cer
9
8
0
07 Nov 2020
EXAMS: A Multi-Subject High School Examinations Dataset for Cross-Lingual and Multilingual Question Answering
Momchil Hardalov
Todor Mihaylov
Dimitrina Zlatkova
Yoan Dinkov
Ivan Koychev
Preslav Nakov
AI4Ed
ELM
23
50
0
05 Nov 2020
Emergent Communication Pretraining for Few-Shot Machine Translation
Yaoyiran Li
E. Ponti
Ivan Vulić
Anna Korhonen
18
19
0
02 Nov 2020
Pre-trained Summarization Distillation
Sam Shleifer
Alexander M. Rush
11
98
0
24 Oct 2020
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models
Xian Li
Changhan Wang
Yun Tang
C. Tran
Yuqing Tang
J. Pino
Alexei Baevski
Alexis Conneau
Michael Auli
19
6
0
24 Oct 2020
Text Editing by Command
Felix Faltings
Michel Galley
Gerold Hintz
Chris Brockett
Chris Quirk
Jianfeng Gao
Bill Dolan
KELM
134
36
0
24 Oct 2020
Rethinking embedding coupling in pre-trained language models
Hyung Won Chung
Thibault Févry
Henry Tsai
Melvin Johnson
Sebastian Ruder
93
142
0
24 Oct 2020
COUGH: A Challenge Dataset and Models for COVID-19 FAQ Retrieval
Xinliang Frederick Zhang
Heming Sun
Xiang Yue
Simon M. Lin
Huan Sun
RALM
68
17
0
24 Oct 2020
Compositional Generalization and Natural Language Variation: Can a Semantic Parsing Approach Handle Both?
Peter Shaw
Ming-Wei Chang
Panupong Pasupat
Kristina Toutanova
CoGe
25
182
0
24 Oct 2020
Dynamic Contextualized Word Embeddings
Valentin Hofmann
J. Pierrehumbert
Hinrich Schütze
27
50
0
23 Oct 2020
Unsupervised Multi-hop Question Answering by Question Generation
Liangming Pan
Wenhu Chen
Wenhan Xiong
Min-Yen Kan
William Yang Wang
29
57
0
23 Oct 2020
Neural Passage Retrieval with Improved Negative Contrast
Jing Lu
Gustavo Hernández Ábrego
Ji Ma
Jianmo Ni
Yinfei Yang
11
25
0
23 Oct 2020
Pretraining and Fine-Tuning Strategies for Sentiment Analysis of Latvian Tweets
Gaurish Thakkar
Marcis Pinnis
50
9
0
23 Oct 2020
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding
Dongling Xiao
Yukun Li
Han Zhang
Yu Sun
Hao Tian
Hua-Hong Wu
Haifeng Wang
19
38
0
23 Oct 2020
Language Models are Open Knowledge Graphs
Chenguang Wang
Xiao Liu
D. Song
SSL
KELM
16
135
0
22 Oct 2020
Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data
Lingkai Kong
Haoming Jiang
Yuchen Zhuang
Jie Lyu
T. Zhao
Chao Zhang
OODD
6
26
0
22 Oct 2020
Open-Domain Frame Semantic Parsing Using Transformers
Aditya Kalyanpur
Or Biran
Tom Breloff
Jennifer Chu-Carroll
Ariel Diertani
Owen Rambow
Mark Sammons
21
18
0
21 Oct 2020
Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition
Yangyang Shi
Yongqiang Wang
Chunyang Wu
Ching-Feng Yeh
Julian Chan
Frank Zhang
Duc Le
M. Seltzer
49
168
0
21 Oct 2020
Local Knowledge Powered Conversational Agents
Sashank Santhanam
Wei Ping
Raul Puri
M. Shoeybi
M. Patwary
Bryan Catanzaro
6
4
0
20 Oct 2020
Neural Language Modeling for Contextualized Temporal Graph Generation
Aman Madaan
Yiming Yang
33
20
0
20 Oct 2020
Anti-Distillation: Improving reproducibility of deep networks
G. Shamir
Lorenzo Coviello
34
20
0
19 Oct 2020
Neural Databases
James Thorne
Majid Yazdani
Marzieh Saeidi
Fabrizio Silvestri
Sebastian Riedel
A. Halevy
NAI
18
9
0
14 Oct 2020
Previous
1
2
3
...
156
157
158
159
Next