Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1910.10683
Cited By
v1
v2
v3
v4 (latest)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Journal of machine learning research (JMLR), 2019
23 October 2019
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
47 / 11,947 papers shown
Title
Length-controllable Abstractive Summarization by Guiding with Summary Prototype
Itsumi Saito
Kyosuke Nishida
Kosuke Nishida
Atsushi Otsuka
Hisako Asano
J. Tomita
Hiroyuki Shindo
Yuji Matsumoto
226
37
0
21 Jan 2020
A multimodal deep learning approach for named entity recognition from social media
M. Asgari-Chenaghlu
M. Feizi-Derakhshi
Leili Farzinvash
M. Balafar
C. Motamed
219
36
0
19 Jan 2020
RobBERT: a Dutch RoBERTa-based Language Model
Findings (Findings), 2020
Pieter Delobelle
Thomas Winters
Bettina Berendt
192
261
0
17 Jan 2020
ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training
Findings (Findings), 2020
Weizhen Qi
Yu Yan
Yeyun Gong
Dayiheng Liu
Nan Duan
Jiusheng Chen
Ruofei Zhang
Ming Zhou
AI4TS
322
469
0
13 Jan 2020
Learning Accurate Integer Transformer Machine-Translation Models
SN Computer Science (SN Comput. Sci.), 2020
Ephrem Wu
92
4
0
03 Jan 2020
What Does My QA Model Know? Devising Controlled Probes using Expert Knowledge
Transactions of the Association for Computational Linguistics (TACL), 2019
Kyle Richardson
Ashish Sabharwal
179
47
0
31 Dec 2019
All-in-One Image-Grounded Conversational Agents
Da Ju
Kurt Shuster
Y-Lan Boureau
Jason Weston
LLMAG
133
9
0
28 Dec 2019
Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation
International Conference on Language Resources and Evaluation (LREC), 2019
Israfel Salazar
Mary Dabre
Atsushi Fujita
Sadao Kurohashi
178
6
0
26 Dec 2019
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization
International Conference on Machine Learning (ICML), 2019
Jingqing Zhang
Yao-Min Zhao
Mohammad Saleh
Peter J. Liu
RALM
3DGS
725
2,266
0
18 Dec 2019
Multilingual is not enough: BERT for Finnish
Antti Virtanen
Jenna Kanerva
Rami Ilo
Jouni Luoma
Juhani Luotolahti
T. Salakoski
Filip Ginter
S. Pyysalo
206
298
0
15 Dec 2019
WaLDORf: Wasteless Language-model Distillation On Reading-comprehension
J. Tian
A. Kreuzer
Pai-Hung Chen
Hans-Martin Will
VLM
162
3
0
13 Dec 2019
Extending Machine Language Models toward Human-Level Language Understanding
James L. McClelland
Felix Hill
Maja R. Rudolph
Jason Baldridge
Hinrich Schütze
LRM
143
36
0
12 Dec 2019
FlauBERT: Unsupervised Language Model Pre-training for French
International Conference on Language Resources and Evaluation (LREC), 2019
Hang Le
Loïc Vial
Jibril Frej
Vincent Segonne
Maximin Coavoux
Benjamin Lecouteux
A. Allauzen
Benoît Crabbé
Laurent Besacier
D. Schwab
AI4CE
304
429
0
11 Dec 2019
Zero-shot Text Classification With Generative Language Models
Raul Puri
Bryan Catanzaro
VLM
146
113
0
10 Dec 2019
Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline
European Conference on Computer Vision (ECCV), 2019
Vishvak Murahari
Dhruv Batra
Devi Parikh
Abhishek Das
VLM
292
119
0
05 Dec 2019
12-in-1: Multi-Task Vision and Language Representation Learning
Computer Vision and Pattern Recognition (CVPR), 2019
Jiasen Lu
Vedanuj Goswami
Marcus Rohrbach
Devi Parikh
Stefan Lee
VLM
ObjD
279
499
0
05 Dec 2019
BLiMP: The Benchmark of Linguistic Minimal Pairs for English
Transactions of the Association for Computational Linguistics (TACL), 2019
Alex Warstadt
Alicia Parrish
Haokun Liu
Anhad Mohananey
Wei Peng
Sheng-Fu Wang
Samuel R. Bowman
377
608
0
02 Dec 2019
What's Hidden in a Randomly Weighted Neural Network?
Computer Vision and Pattern Recognition (CVPR), 2019
Vivek Ramanujan
Mitchell Wortsman
Aniruddha Kembhavi
Ali Farhadi
Mohammad Rastegari
225
389
0
29 Nov 2019
Iterative Answer Prediction with Pointer-Augmented Multimodal Transformers for TextVQA
Computer Vision and Pattern Recognition (CVPR), 2019
Ronghang Hu
Amanpreet Singh
Trevor Darrell
Marcus Rohrbach
289
222
0
14 Nov 2019
KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation
Transactions of the Association for Computational Linguistics (TACL), 2019
Xiaozhi Wang
Tianyu Gao
Zhaocheng Zhu
Zhengyan Zhang
Zhiyuan Liu
Juan-Zi Li
Jian Tang
307
758
0
13 Nov 2019
CamemBERT: a Tasty French Language Model
Annual Meeting of the Association for Computational Linguistics (ACL), 2019
Louis Martin
Benjamin Muller
Pedro Ortiz Suarez
Yoann Dupont
Laurent Romary
Eric Villemonte de la Clergerie
Djamé Seddah
Benoît Sagot
455
1,043
0
10 Nov 2019
INSET: Sentence Infilling with INter-SEntential Transformer
Yichen Huang
Yizhe Zhang
Oussama Elachqar
Yu Cheng
219
1
0
10 Nov 2019
Learning to Few-Shot Learn Across Diverse Natural Language Classification Tasks
International Conference on Computational Linguistics (COLING), 2019
Trapit Bansal
Rishikesh Jha
Andrew McCallum
SSL
232
126
0
10 Nov 2019
The Dialogue Dodecathlon: Open-Domain Knowledge and Image Grounded Conversational Agents
Annual Meeting of the Association for Computational Linguistics (ACL), 2019
Kurt Shuster
Da Ju
Stephen Roller
Emily Dinan
Y-Lan Boureau
Jason Weston
226
84
0
09 Nov 2019
Sentence Meta-Embeddings for Unsupervised Semantic Textual Similarity
Annual Meeting of the Association for Computational Linguistics (ACL), 2019
Nina Poerner
Ulli Waltinger
Hinrich Schütze
AI4TS
445
21
0
09 Nov 2019
SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization
Annual Meeting of the Association for Computational Linguistics (ACL), 2019
Haoming Jiang
Pengcheng He
Weizhu Chen
Xiaodong Liu
Jianfeng Gao
T. Zhao
564
586
0
08 Nov 2019
Contrastive Multi-document Question Generation
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2019
W. Cho
Yizhe Zhang
Sudha Rao
Asli Celikyilmaz
Chenyan Xiong
Jianfeng Gao
Mengdi Wang
Bill Dolan
SyDa
314
31
0
08 Nov 2019
BERTs of a feather do not generalize together: Large variability in generalization across models with similar test set performance
BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2019
R. Thomas McCoy
Junghyun Min
Tal Linzen
344
156
0
07 Nov 2019
Unsupervised Cross-lingual Representation Learning at Scale
Annual Meeting of the Association for Computational Linguistics (ACL), 2019
Alexis Conneau
Kartikay Khandelwal
Naman Goyal
Vishrav Chaudhary
Guillaume Wenzek
Francisco Guzmán
Edouard Grave
Myle Ott
Luke Zettlemoyer
Veselin Stoyanov
445
7,503
0
05 Nov 2019
DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2019
Yizhe Zhang
Siqi Sun
Michel Galley
Yen-Chun Chen
Chris Brockett
Xiang Gao
Jianfeng Gao
Jingjing Liu
W. Dolan
VLM
596
1,645
0
01 Nov 2019
CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data
International Conference on Language Resources and Evaluation (LREC), 2019
Guillaume Wenzek
Marie-Anne Lachaux
Alexis Conneau
Vishrav Chaudhary
Francisco Guzmán
Armand Joulin
Edouard Grave
423
741
0
01 Nov 2019
Multi-Stage Document Ranking with BERT
Rodrigo Nogueira
Wei Yang
Dong Wang
Jimmy J. Lin
265
449
0
31 Oct 2019
Discourse-Aware Neural Extractive Text Summarization
Annual Meeting of the Association for Computational Linguistics (ACL), 2019
Jiacheng Xu
Zhe Gan
Yu Cheng
Jingjing Liu
BDL
287
289
0
30 Oct 2019
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
Annual Meeting of the Association for Computational Linguistics (ACL), 2019
M. Lewis
Yinhan Liu
Naman Goyal
Marjan Ghazvininejad
Abdel-rahman Mohamed
Omer Levy
Veselin Stoyanov
Luke Zettlemoyer
AIMat
VLM
751
11,966
0
29 Oct 2019
ZeRO: Memory Optimizations Toward Training Trillion Parameter Models
International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2019
Samyam Rajbhandari
Jeff Rasley
Olatunji Ruwase
Yuxiong He
ALM
AI4CE
381
1,356
0
04 Oct 2019
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
International Conference on Learning Representations (ICLR), 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
1.1K
7,057
0
26 Sep 2019
FreeLB: Enhanced Adversarial Training for Natural Language Understanding
International Conference on Learning Representations (ICLR), 2019
Chen Zhu
Yu Cheng
Zhe Gan
S. Sun
Tom Goldstein
Jingjing Liu
AAML
608
487
0
25 Sep 2019
Portuguese Named Entity Recognition using BERT-CRF
Fábio Souza
Rodrigo Nogueira
R. Lotufo
221
277
0
23 Sep 2019
TinyBERT: Distilling BERT for Natural Language Understanding
Findings (Findings), 2019
Xiaoqi Jiao
Yichun Yin
Lifeng Shang
Xin Jiang
Xiao Chen
Linlin Li
F. Wang
Qun Liu
VLM
515
2,117
0
23 Sep 2019
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Mohammad Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
869
2,348
0
17 Sep 2019
I-MAD: Interpretable Malware Detector Using Galaxy Transformer
Computers & security (Comput. Secur.), 2019
Miles Q. Li
Benjamin C. M. Fung
P. Charland
Steven H. H. Ding
228
38
0
15 Sep 2019
Conditional Text Generation for Harmonious Human-Machine Interaction
Bin Guo
Hao Wang
Yasan Ding
Wei Wu
Shaoyang Hao
Yueqi Sun
Zhiwen Yu
159
4
0
08 Sep 2019
Taming Momentum in a Distributed Asynchronous Environment
Ido Hakimi
Saar Barkai
Moshe Gabel
Assaf Schuster
247
24
0
26 Jul 2019
Contextual Word Representations: A Contextual Introduction
Noah A. Smith
215
35
0
15 Feb 2019
Are All Layers Created Equal?
Chiyuan Zhang
Samy Bengio
Y. Singer
274
156
0
06 Feb 2019
Neural Abstractive Text Summarization with Sequence-to-Sequence Models
Tian Shi
Yaser Keneshloo
Naren Ramakrishnan
Chandan K. Reddy
303
252
0
05 Dec 2018
Deep Learning for Genomics: A Concise Overview
Tianwei Yue
Yuanxin Wang
Longxiang Zhang
Chunming Gu
Haohan Wang
Wenping Wang
Qi Lyu
Yujie Dun
AILaw
VLM
BDL
245
96
0
02 Feb 2018
Previous
1
2
3
...
237
238
239