ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.10683
  4. Cited By
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
    AIMat
ArXivPDFHTML

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 8,295 papers shown
Title
Data Boost: Text Data Augmentation Through Reinforcement Learning Guided
  Conditional Generation
Data Boost: Text Data Augmentation Through Reinforcement Learning Guided Conditional Generation
Ruibo Liu
Guangxuan Xu
Chenyan Jia
Weicheng Ma
Lili Wang
Soroush Vosoughi
23
107
0
05 Dec 2020
WeaQA: Weak Supervision via Captions for Visual Question Answering
WeaQA: Weak Supervision via Captions for Visual Question Answering
Pratyay Banerjee
Tejas Gokhale
Yezhou Yang
Chitta Baral
17
34
0
04 Dec 2020
Modifying Memories in Transformer Models
Modifying Memories in Transformer Models
Chen Zhu
A. S. Rawat
Manzil Zaheer
Srinadh Bhojanapalli
Daliang Li
Felix X. Yu
Sanjiv Kumar
KELM
13
190
0
01 Dec 2020
GLGE: A New General Language Generation Evaluation Benchmark
GLGE: A New General Language Generation Evaluation Benchmark
Dayiheng Liu
Yu Yan
Yeyun Gong
Weizhen Qi
Hang Zhang
...
Jiancheng Lv
Ruofei Zhang
Winnie Wu
Ming Zhou
Nan Duan
ELM
35
66
0
24 Nov 2020
Language Models not just for Pre-training: Fast Online Neural Noisy
  Channel Modeling
Language Models not just for Pre-training: Fast Online Neural Noisy Channel Modeling
Shruti Bhosale
Kyra Yee
Sergey Edunov
Michael Auli
50
7
0
13 Nov 2020
Generating Fact Checking Briefs
Generating Fact Checking Briefs
Angela Fan
Aleksandra Piktus
Fabio Petroni
Guillaume Wenzek
Marzieh Saeidi
Andreas Vlachos
Antoine Bordes
Sebastian Riedel
HILM
11
57
0
10 Nov 2020
When Do You Need Billions of Words of Pretraining Data?
When Do You Need Billions of Words of Pretraining Data?
Yian Zhang
Alex Warstadt
Haau-Sing Li
Samuel R. Bowman
21
136
0
10 Nov 2020
Multi-document Summarization via Deep Learning Techniques: A Survey
Multi-document Summarization via Deep Learning Techniques: A Survey
Congbo Ma
W. Zhang
Mingyu Guo
Hu Wang
Quan Z. Sheng
13
125
0
10 Nov 2020
SeqGenSQL -- A Robust Sequence Generation Model for Structured Query
  Language
SeqGenSQL -- A Robust Sequence Generation Model for Structured Query Language
Ning Li
Bethany Keller
M. Butler
Daniel Matthew Cer
11
8
0
07 Nov 2020
EXAMS: A Multi-Subject High School Examinations Dataset for
  Cross-Lingual and Multilingual Question Answering
EXAMS: A Multi-Subject High School Examinations Dataset for Cross-Lingual and Multilingual Question Answering
Momchil Hardalov
Todor Mihaylov
Dimitrina Zlatkova
Yoan Dinkov
Ivan Koychev
Preslav Nakov
AI4Ed
ELM
31
50
0
05 Nov 2020
Language Model is All You Need: Natural Language Understanding as
  Question Answering
Language Model is All You Need: Natural Language Understanding as Question Answering
Mahdi Namazifar
Alexandros Papangelis
Gökhan Tür
Dilek Z. Hakkani-Tür
19
47
0
05 Nov 2020
Emergent Communication Pretraining for Few-Shot Machine Translation
Emergent Communication Pretraining for Few-Shot Machine Translation
Yaoyiran Li
E. Ponti
Ivan Vulić
Anna Korhonen
23
19
0
02 Nov 2020
ABNIRML: Analyzing the Behavior of Neural IR Models
ABNIRML: Analyzing the Behavior of Neural IR Models
Sean MacAvaney
Sergey Feldman
Nazli Goharian
Doug Downey
Arman Cohan
15
49
0
02 Nov 2020
Pre-trained Summarization Distillation
Pre-trained Summarization Distillation
Sam Shleifer
Alexander M. Rush
15
98
0
24 Oct 2020
CoCo: Controllable Counterfactuals for Evaluating Dialogue State
  Trackers
CoCo: Controllable Counterfactuals for Evaluating Dialogue State Trackers
Shiyang Li
Semih Yavuz
Kazuma Hashimoto
Jia Li
Tong Niu
Nazneen Rajani
Xifeng Yan
Yingbo Zhou
Caiming Xiong
36
62
0
24 Oct 2020
Multilingual Speech Translation with Efficient Finetuning of Pretrained
  Models
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models
Xian Li
Changhan Wang
Yun Tang
C. Tran
Yuqing Tang
J. Pino
Alexei Baevski
Alexis Conneau
Michael Auli
19
6
0
24 Oct 2020
Text Editing by Command
Text Editing by Command
Felix Faltings
Michel Galley
Gerold Hintz
Chris Brockett
Chris Quirk
Jianfeng Gao
Bill Dolan
KELM
139
36
0
24 Oct 2020
Rethinking embedding coupling in pre-trained language models
Rethinking embedding coupling in pre-trained language models
Hyung Won Chung
Thibault Févry
Henry Tsai
Melvin Johnson
Sebastian Ruder
93
142
0
24 Oct 2020
COUGH: A Challenge Dataset and Models for COVID-19 FAQ Retrieval
COUGH: A Challenge Dataset and Models for COVID-19 FAQ Retrieval
Xinliang Frederick Zhang
Heming Sun
Xiang Yue
Simon M. Lin
Huan Sun
RALM
68
17
0
24 Oct 2020
Compositional Generalization and Natural Language Variation: Can a
  Semantic Parsing Approach Handle Both?
Compositional Generalization and Natural Language Variation: Can a Semantic Parsing Approach Handle Both?
Peter Shaw
Ming-Wei Chang
Panupong Pasupat
Kristina Toutanova
CoGe
25
182
0
24 Oct 2020
AQuaMuSe: Automatically Generating Datasets for Query-Based
  Multi-Document Summarization
AQuaMuSe: Automatically Generating Datasets for Query-Based Multi-Document Summarization
Sayali Kulkarni
Sheide Chammas
Wan Zhu
Fei Sha
Eugene Ie
RALM
58
52
0
23 Oct 2020
Dynamic Contextualized Word Embeddings
Dynamic Contextualized Word Embeddings
Valentin Hofmann
J. Pierrehumbert
Hinrich Schütze
29
51
0
23 Oct 2020
Unsupervised Multi-hop Question Answering by Question Generation
Unsupervised Multi-hop Question Answering by Question Generation
Liangming Pan
Wenhu Chen
Wenhan Xiong
Min-Yen Kan
William Yang Wang
29
57
0
23 Oct 2020
Neural Passage Retrieval with Improved Negative Contrast
Neural Passage Retrieval with Improved Negative Contrast
Jing Lu
Gustavo Hernández Ábrego
Ji Ma
Jianmo Ni
Yinfei Yang
21
25
0
23 Oct 2020
Pretraining and Fine-Tuning Strategies for Sentiment Analysis of Latvian
  Tweets
Pretraining and Fine-Tuning Strategies for Sentiment Analysis of Latvian Tweets
Gaurish Thakkar
Marcis Pinnis
55
9
0
23 Oct 2020
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling
  for Natural Language Understanding
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding
Dongling Xiao
Yukun Li
Han Zhang
Yu Sun
Hao Tian
Hua-Hong Wu
Haifeng Wang
19
38
0
23 Oct 2020
Language Models are Open Knowledge Graphs
Language Models are Open Knowledge Graphs
Chenguang Wang
Xiao Liu
D. Song
SSL
KELM
24
135
0
22 Oct 2020
Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution
  Data
Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data
Lingkai Kong
Haoming Jiang
Yuchen Zhuang
Jie Lyu
T. Zhao
Chao Zhang
OODD
19
26
0
22 Oct 2020
Open-Domain Frame Semantic Parsing Using Transformers
Open-Domain Frame Semantic Parsing Using Transformers
Aditya Kalyanpur
Or Biran
Tom Breloff
Jennifer Chu-Carroll
Ariel Diertani
Owen Rambow
Mark Sammons
26
18
0
21 Oct 2020
Emformer: Efficient Memory Transformer Based Acoustic Model For Low
  Latency Streaming Speech Recognition
Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition
Yangyang Shi
Yongqiang Wang
Chunyang Wu
Ching-Feng Yeh
Julian Chan
Frank Zhang
Duc Le
M. Seltzer
49
168
0
21 Oct 2020
An Empirical Investigation of Contextualized Number Prediction
An Empirical Investigation of Contextualized Number Prediction
Daniel M. Spokoyny
Taylor Berg-Kirkpatrick
AI4TS
14
34
0
20 Oct 2020
Local Knowledge Powered Conversational Agents
Local Knowledge Powered Conversational Agents
Sashank Santhanam
Wei Ping
Raul Puri
M. Shoeybi
M. Patwary
Bryan Catanzaro
19
4
0
20 Oct 2020
Neural Language Modeling for Contextualized Temporal Graph Generation
Neural Language Modeling for Contextualized Temporal Graph Generation
Aman Madaan
Yiming Yang
33
20
0
20 Oct 2020
Anti-Distillation: Improving reproducibility of deep networks
Anti-Distillation: Improving reproducibility of deep networks
G. Shamir
Lorenzo Coviello
34
20
0
19 Oct 2020
Neural Databases
Neural Databases
James Thorne
Majid Yazdani
Marzieh Saeidi
Fabrizio Silvestri
Sebastian Riedel
A. Halevy
NAI
26
9
0
14 Oct 2020
Pretrained Transformers for Text Ranking: BERT and Beyond
Pretrained Transformers for Text Ranking: BERT and Beyond
Jimmy J. Lin
Rodrigo Nogueira
Andrew Yates
VLM
219
608
0
13 Oct 2020
Reformulating Unsupervised Style Transfer as Paraphrase Generation
Reformulating Unsupervised Style Transfer as Paraphrase Generation
Kalpesh Krishna
John Wieting
Mohit Iyyer
19
237
0
12 Oct 2020
SMYRF: Efficient Attention using Asymmetric Clustering
SMYRF: Efficient Attention using Asymmetric Clustering
Giannis Daras
Nikita Kitaev
Augustus Odena
A. Dimakis
23
44
0
11 Oct 2020
Style Attuned Pre-training and Parameter Efficient Fine-tuning for
  Spoken Language Understanding
Style Attuned Pre-training and Parameter Efficient Fine-tuning for Spoken Language Understanding
Jin Cao
Jun Wang
Wael Hamza
Kelly Vanee
Shang-Wen Li
17
10
0
09 Oct 2020
Precise Task Formalization Matters in Winograd Schema Evaluations
Precise Task Formalization Matters in Winograd Schema Evaluations
Haokun Liu
William Huang
Dhara Mungra
Samuel R. Bowman
ReLM
17
12
0
08 Oct 2020
Infusing Disease Knowledge into BERT for Health Question Answering,
  Medical Inference and Disease Name Recognition
Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name Recognition
Yun He
Ziwei Zhu
Yin Zhang
Qin Chen
James Caverlee
AI4MH
28
108
0
08 Oct 2020
Uncovering the Limits of Adversarial Training against Norm-Bounded
  Adversarial Examples
Uncovering the Limits of Adversarial Training against Norm-Bounded Adversarial Examples
Sven Gowal
Chongli Qin
J. Uesato
Timothy A. Mann
Pushmeet Kohli
AAML
17
323
0
07 Oct 2020
Improving the Efficiency of Grammatical Error Correction with Erroneous
  Span Detection and Correction
Improving the Efficiency of Grammatical Error Correction with Erroneous Span Detection and Correction
M. Chen
Tao Ge
Xingxing Zhang
Furu Wei
M. Zhou
6
46
0
07 Oct 2020
Local Label Point Correction for Edge Detection of Overlapping Cervical
  Cells
Local Label Point Correction for Edge Detection of Overlapping Cervical Cells
Jiawei Liu
Huijie Fan
Qiang Wang
Wentao Li
Yandong Tang
Danbo Wang
Mingyi Zhou
Li Chen
13
9
0
05 Oct 2020
PMI-Masking: Principled masking of correlated spans
PMI-Masking: Principled masking of correlated spans
Yoav Levine
Barak Lenz
Opher Lieber
Omri Abend
Kevin Leyton-Brown
Moshe Tennenholtz
Y. Shoham
14
72
0
05 Oct 2020
Effective Unsupervised Domain Adaptation with Adversarially Trained
  Language Models
Effective Unsupervised Domain Adaptation with Adversarially Trained Language Models
Thuy-Trang Vu
Dinh Q. Phung
Gholamreza Haffari
6
24
0
05 Oct 2020
On Losses for Modern Language Models
On Losses for Modern Language Models
Stephane Aroca-Ouellette
Frank Rudzicz
11
33
0
04 Oct 2020
Multi-View Sequence-to-Sequence Models with Conversational Structure for
  Abstractive Dialogue Summarization
Multi-View Sequence-to-Sequence Models with Conversational Structure for Abstractive Dialogue Summarization
Jiaao Chen
Diyi Yang
16
143
0
04 Oct 2020
An Empirical Study on Large-Scale Multi-Label Text Classification
  Including Few and Zero-Shot Labels
An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels
Ilias Chalkidis
Manos Fergadiotis
Sotiris Kotitsas
Prodromos Malakasiotis
Nikolaos Aletras
Ion Androutsopoulos
VLM
AI4TS
10
84
0
04 Oct 2020
LUKE: Deep Contextualized Entity Representations with Entity-aware
  Self-attention
LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention
Ikuya Yamada
Akari Asai
Hiroyuki Shindo
Hideaki Takeda
Yuji Matsumoto
22
662
0
02 Oct 2020
Previous
123...163164165166
Next