ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Journal of machine learning research (JMLR), 2019
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 11,959 papers shown
Title
Neural Entity Linking: A Survey of Models Based on Deep Learning
Neural Entity Linking: A Survey of Models Based on Deep Learning
Ozge Sevgili
Artem Shelmanov
Mikhail V. Arkhipov
Sergey Petrakov
Christian Biemann
VLM3DVAI4TS
481
141
0
31 May 2020
Beyond Leaderboards: A survey of methods for revealing weaknesses in
  Natural Language Inference data and models
Beyond Leaderboards: A survey of methods for revealing weaknesses in Natural Language Inference data and models
Viktor Schlegel
Goran Nenadic
Riza Batista-Navarro
ELM
203
18
0
29 May 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot LearnersNeural Information Processing Systems (NeurIPS), 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
2.0K
51,623
0
28 May 2020
Syntactic Structure Distillation Pretraining For Bidirectional Encoders
Syntactic Structure Distillation Pretraining For Bidirectional EncodersTransactions of the Association for Computational Linguistics (TACL), 2020
A. Kuncoro
Lingpeng Kong
Daniel Fried
Dani Yogatama
Laura Rimell
Chris Dyer
Phil Blunsom
174
35
0
27 May 2020
Counterfactual Detection meets Transfer Learning
Counterfactual Detection meets Transfer Learning
Kelechi Nwaike
L. Jiao
129
2
0
27 May 2020
Predict-then-Decide: A Predictive Approach for Wait or Answer Task in
  Dialogue Systems
Predict-then-Decide: A Predictive Approach for Wait or Answer Task in Dialogue SystemsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Zehao Lin
Shaobo Cui
Guodun Li
Xiaoming Kang
Feng Ji
Feng-Lin Li
Zhongzhou Zhao
Haiqing Chen
Yin Zhang
194
3
0
27 May 2020
English Intermediate-Task Training Improves Zero-Shot Cross-Lingual
  Transfer Too
English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too
Jason Phang
Iacer Calixto
Phu Mon Htut
Yada Pruksachatkun
Haokun Liu
Clara Vania
Katharina Kann
Samuel R. Bowman
LRM
206
67
0
26 May 2020
ParsBERT: Transformer-based Model for Persian Language Understanding
ParsBERT: Transformer-based Model for Persian Language UnderstandingNeural Processing Letters (NPL), 2020
Mehrdad Farahani
Mohammad Gharachorloo
Marzieh Farahani
Mohammad Manthouri
234
235
0
26 May 2020
NILE : Natural Language Inference with Faithful Natural Language
  Explanations
NILE : Natural Language Inference with Faithful Natural Language ExplanationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Sawan Kumar
Partha P. Talukdar
XAILRM
259
169
0
25 May 2020
Summarizing and Exploring Tabular Data in Conversational Search
Summarizing and Exploring Tabular Data in Conversational Search
Shuo Zhang
Zhuyun Dai
K. Balog
Jamie Callan
RALMLMTD
203
44
0
23 May 2020
Text-to-Text Pre-Training for Data-to-Text Tasks
Text-to-Text Pre-Training for Data-to-Text Tasks
Mihir Kale
Abhinav Rastogi
AI4CE
293
217
0
21 May 2020
BiQGEMM: Matrix Multiplication with Lookup Table For Binary-Coding-based
  Quantized DNNs
BiQGEMM: Matrix Multiplication with Lookup Table For Binary-Coding-based Quantized DNNs
Yongkweon Jeon
Baeseong Park
S. Kwon
Byeongwook Kim
Jeongin Yun
Dongsoo Lee
MQ
289
40
0
20 May 2020
Normalized Attention Without Probability Cage
Normalized Attention Without Probability Cage
Oliver Richter
Roger Wattenhofer
230
22
0
19 May 2020
Weak-Attention Suppression For Transformer Based Speech Recognition
Weak-Attention Suppression For Transformer Based Speech Recognition
Yangyang Shi
Yongqiang Wang
Chunyang Wu
Christian Fuegen
Frank Zhang
Duc Le
Ching-Feng Yeh
M. Seltzer
219
18
0
18 May 2020
Towards Question Format Independent Numerical Reasoning: A Set of
  Prerequisite Tasks
Towards Question Format Independent Numerical Reasoning: A Set of Prerequisite Tasks
Swaroop Mishra
Arindam Mitra
Neeraj Varshney
Bhavdeep Singh Sachdeva
Chitta Baral
AIMat
111
14
0
18 May 2020
CS-NLP team at SemEval-2020 Task 4: Evaluation of State-of-the-art NLP
  Deep Learning Architectures on Commonsense Reasoning Task
CS-NLP team at SemEval-2020 Task 4: Evaluation of State-of-the-art NLP Deep Learning Architectures on Commonsense Reasoning Task
Sirwe Saeedi
Ali (Aliakbar) Panahi
Seyran Saeedi
A. Fong
ReLMELMLRM
221
12
0
17 May 2020
CERT: Contrastive Self-supervised Learning for Language Understanding
CERT: Contrastive Self-supervised Learning for Language Understanding
Hongchao Fang
Sicheng Wang
Meng Zhou
Jiayuan Ding
P. Xie
ELMSSL
185
368
0
16 May 2020
Movement Pruning: Adaptive Sparsity by Fine-Tuning
Movement Pruning: Adaptive Sparsity by Fine-Tuning
Victor Sanh
Thomas Wolf
Alexander M. Rush
307
550
0
15 May 2020
Machine Reading Comprehension: The Role of Contextualized Language
  Models and Beyond
Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond
Zhuosheng Zhang
Hai Zhao
Rui Wang
196
66
0
13 May 2020
Large Scale Multi-Actor Generative Dialog Modeling
Large Scale Multi-Actor Generative Dialog Modeling
Alex Boyd
Raul Puri
Mohammad Shoeybi
M. Patwary
Bryan Catanzaro
152
24
0
13 May 2020
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
Hao Tian
Can Gao
Xinyan Xiao
Hao Liu
Bolei He
Hua Wu
Haifeng Wang
Feng Wu
226
267
0
12 May 2020
Simultaneous paraphrasing and translation by fine-tuning Transformer
  models
Simultaneous paraphrasing and translation by fine-tuning Transformer models
Rakesh Chada
88
5
0
12 May 2020
Enabling Language Models to Fill in the Blanks
Enabling Language Models to Fill in the Blanks
Chris Donahue
Mina Lee
Abigail Z. Jacobs
233
202
0
11 May 2020
A Dataset for Statutory Reasoning in Tax Law Entailment and Question
  Answering
A Dataset for Statutory Reasoning in Tax Law Entailment and Question Answering
Nils Holzenberger
Andrew Blair-Stanek
Benjamin Van Durme
ELMAILaw
176
74
0
11 May 2020
Leveraging Monolingual Data with Self-Supervision for Multilingual
  Neural Machine Translation
Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation
Aditya Siddhant
Ankur Bapna
Yuan Cao
Orhan Firat
Mengzhao Chen
Sneha Kudugunta
N. Arivazhagan
Yonghui Wu
218
88
0
11 May 2020
How Context Affects Language Models' Factual Predictions
How Context Affects Language Models' Factual Predictions
Fabio Petroni
Patrick Lewis
Aleksandra Piktus
Tim Rocktaschel
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
190
252
0
10 May 2020
Transformer Based Language Models for Similar Text Retrieval and Ranking
Transformer Based Language Models for Similar Text Retrieval and Ranking
Javed Qadrud-Din
Ashraf Bah Rabiou
Ryan S Walker
Ravindra Soni
M. Gajek
Gabriel Pack
A. Rangaraj
105
6
0
10 May 2020
Measuring the Algorithmic Efficiency of Neural Networks
Measuring the Algorithmic Efficiency of Neural Networks
Danny Hernandez
Tom B. Brown
526
107
0
08 May 2020
JASS: Japanese-specific Sequence to Sequence Pre-training for Neural
  Machine Translation
JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation
Zhuoyuan Mao
Fabien Cromierès
Mary Dabre
Israfel Salazar
Sadao Kurohashi
164
4
0
07 May 2020
Multi-Stage Conversational Passage Retrieval: An Approach to Fusing Term
  Importance Estimation and Neural Query Rewriting
Multi-Stage Conversational Passage Retrieval: An Approach to Fusing Term Importance Estimation and Neural Query Rewriting
Sheng-Chieh Lin
Jheng-Hong Yang
Rodrigo Nogueira
Ming-Feng Tsai
Chuan-Ju Wang
Jimmy J. Lin
164
25
0
05 May 2020
Towards QoS-Aware and Resource-Efficient GPU Microservices Based on
  Spatial Multitasking GPUs In Datacenters
Towards QoS-Aware and Resource-Efficient GPU Microservices Based on Spatial Multitasking GPUs In Datacenters
Wei Zhang
Quan Chen
Kaihua Fu
Ningxin Zheng
Zhiyi Huang
Jingwen Leng
Chao Li
Wenli Zheng
Minyi Guo
79
3
0
05 May 2020
Establishing Baselines for Text Classification in Low-Resource Languages
Establishing Baselines for Text Classification in Low-Resource Languages
Jan Christian Blaise Cruz
C. Cheng
167
45
0
05 May 2020
Exploring Controllable Text Generation Techniques
Exploring Controllable Text Generation TechniquesInternational Conference on Computational Linguistics (COLING), 2020
Shrimai Prabhumoye
A. Black
Ruslan Salakhutdinov
AI4CE
355
96
0
04 May 2020
Generating SOAP Notes from Doctor-Patient Conversations Using Modular
  Summarization Techniques
Generating SOAP Notes from Doctor-Patient Conversations Using Modular Summarization TechniquesAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Kundan Krishna
Sopan Khosla
Jeffrey P. Bigham
Zachary Chase Lipton
334
134
0
04 May 2020
How Can We Accelerate Progress Towards Human-like Linguistic
  Generalization?
How Can We Accelerate Progress Towards Human-like Linguistic Generalization?Annual Meeting of the Association for Computational Linguistics (ACL), 2020
Tal Linzen
448
204
0
03 May 2020
Teaching Machine Comprehension with Compositional Explanations
Teaching Machine Comprehension with Compositional ExplanationsFindings (Findings), 2020
Qinyuan Ye
Xiao Huang
Elizabeth Boschee
Xiang Ren
LRMReLM
309
36
0
02 May 2020
ForecastQA: A Question Answering Challenge for Event Forecasting with
  Temporal Text Data
ForecastQA: A Question Answering Challenge for Event Forecasting with Temporal Text DataAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Woojeong Jin
Rahul Khanna
Suji Kim
Dong-Ho Lee
Fred Morstatter
Aram Galstyan
Xiang Ren
AI4TS
254
47
0
02 May 2020
Synthesizer: Rethinking Self-Attention in Transformer Models
Synthesizer: Rethinking Self-Attention in Transformer ModelsInternational Conference on Machine Learning (ICML), 2020
Yi Tay
Dara Bahri
Donald Metzler
Da-Cheng Juan
Zhe Zhao
Che Zheng
261
379
0
02 May 2020
UnifiedQA: Crossing Format Boundaries With a Single QA System
UnifiedQA: Crossing Format Boundaries With a Single QA SystemFindings (Findings), 2020
Daniel Khashabi
Sewon Min
Tushar Khot
Ashish Sabharwal
Oyvind Tafjord
Peter Clark
Hannaneh Hajishirzi
569
791
0
02 May 2020
Connecting the Dots: A Knowledgeable Path Generator for Commonsense
  Question Answering
Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question AnsweringFindings (Findings), 2020
Peifeng Wang
Nanyun Peng
Filip Ilievski
Pedro A. Szekely
Xiang Ren
188
93
0
02 May 2020
Scalable Multi-Hop Relational Reasoning for Knowledge-Aware Question
  Answering
Scalable Multi-Hop Relational Reasoning for Knowledge-Aware Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Yanlin Feng
Xinyue Chen
Bill Yuchen Lin
Peifeng Wang
Jun Yan
Xiang Ren
LRMKELM
197
263
0
01 May 2020
Intermediate-Task Transfer Learning with Pretrained Models for Natural
  Language Understanding: When and Why Does It Work?
Intermediate-Task Transfer Learning with Pretrained Models for Natural Language Understanding: When and Why Does It Work?Annual Meeting of the Association for Computational Linguistics (ACL), 2020
Yada Pruksachatkun
Jason Phang
Haokun Liu
Phu Mon Htut
Xiaoyi Zhang
Richard Yuanzhe Pang
Clara Vania
Katharina Kann
Samuel R. Bowman
CLLLRM
205
204
0
01 May 2020
Probing Contextual Language Models for Common Ground with Visual
  Representations
Probing Contextual Language Models for Common Ground with Visual Representations
Gabriel Ilharco
Rowan Zellers
Ali Farhadi
Hannaneh Hajishirzi
312
14
0
01 May 2020
POINTER: Constrained Progressive Text Generation via Insertion-based
  Generative Pre-training
POINTER: Constrained Progressive Text Generation via Insertion-based Generative Pre-trainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Yizhe Zhang
Guoyin Wang
Chunyuan Li
Zhe Gan
Chris Brockett
Bill Dolan
219
30
0
01 May 2020
Beneath the Tip of the Iceberg: Current Challenges and New Directions in
  Sentiment Analysis Research
Beneath the Tip of the Iceberg: Current Challenges and New Directions in Sentiment Analysis ResearchIEEE Transactions on Affective Computing (IEEE TAC), 2020
Soujanya Poria
Devamanyu Hazarika
Navonil Majumder
Amélie Reymond
554
221
0
01 May 2020
HERO: Hierarchical Encoder for Video+Language Omni-representation
  Pre-training
HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-trainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Linjie Li
Yen-Chun Chen
Yu Cheng
Zhe Gan
Licheng Yu
Jingjing Liu
MLLMVLMOffRLAI4TS
645
536
0
01 May 2020
Cross-Linguistic Syntactic Evaluation of Word Prediction Models
Cross-Linguistic Syntactic Evaluation of Word Prediction ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Aaron Mueller
Garrett Nicolai
Panayiota Petrou-Zeniou
N. Talmina
Tal Linzen
230
63
0
01 May 2020
Progressively Pretrained Dense Corpus Index for Open-Domain Question
  Answering
Progressively Pretrained Dense Corpus Index for Open-Domain Question AnsweringConference of the European Chapter of the Association for Computational Linguistics (EACL), 2020
Wenhan Xiong
Hong Wang
Wenjie Wang
RALM
308
17
0
30 Apr 2020
TLDR: Extreme Summarization of Scientific Documents
TLDR: Extreme Summarization of Scientific DocumentsFindings (Findings), 2020
Isabel Cachola
Kyle Lo
Arman Cohan
Daniel S. Weld
268
250
0
30 Apr 2020
Template Guided Text Generation for Task-Oriented Dialogue
Template Guided Text Generation for Task-Oriented Dialogue
Mihir Kale
Abhinav Rastogi
217
12
0
30 Apr 2020
Previous
123...235236237238239240
Next