Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

23 October 2019

Sharan Narang

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

50 / 8,450 papers shown

Title
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models Xian Li Changhan Wang Yun Tang C. Tran Yuqing Tang J. Pino Alexei Baevski Alexis Conneau Michael Auli 21 6 0 24 Oct 2020
Text Editing by Command Felix Faltings Michel Galley Gerold Hintz Chris Brockett Chris Quirk Jianfeng Gao Bill Dolan KELM 147 37 0 24 Oct 2020
Rethinking embedding coupling in pre-trained language models Hyung Won Chung Thibault Févry Henry Tsai Melvin Johnson Sebastian Ruder 95 142 0 24 Oct 2020
COUGH: A Challenge Dataset and Models for COVID-19 FAQ Retrieval Xinliang Frederick Zhang Heming Sun Xiang Yue Simon M. Lin Huan Sun RALM 70 17 0 24 Oct 2020
Compositional Generalization and Natural Language Variation: Can a Semantic Parsing Approach Handle Both? Peter Shaw Ming-Wei Chang Panupong Pasupat Kristina Toutanova CoGe 25 182 0 24 Oct 2020
AQuaMuSe: Automatically Generating Datasets for Query-Based Multi-Document Summarization Sayali Kulkarni Sheide Chammas Wan Zhu Fei Sha Eugene Ie RALM 64 52 0 23 Oct 2020
Dynamic Contextualized Word Embeddings Valentin Hofmann J. Pierrehumbert Hinrich Schütze 36 51 0 23 Oct 2020
Synthetic Data Augmentation for Zero-Shot Cross-Lingual Question Answering Arij Riabi Thomas Scialom Rachel Keraron Benoît Sagot Djamé Seddah Jacopo Staiano 142 52 0 23 Oct 2020
Unsupervised Multi-hop Question Answering by Question Generation Liangming Pan Wenhu Chen Wenhan Xiong Min-Yen Kan William Yang Wang 29 58 0 23 Oct 2020
Answering Open-Domain Questions of Varying Reasoning Steps from Text Peng Qi Haejun Lee OghenetegiriTGSido Christopher D. Manning KELM RALM LRM 191 55 0 23 Oct 2020
Neural Passage Retrieval with Improved Negative Contrast Jing Lu Gustavo Hernández Ábrego Ji Ma Jianmo Ni Yinfei Yang 21 25 0 23 Oct 2020
Pretraining and Fine-Tuning Strategies for Sentiment Analysis of Latvian Tweets Gaurish Thakkar Marcis Pinnis 60 9 0 23 Oct 2020
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding Dongling Xiao Yukun Li Han Zhang Yu Sun Hao Tian Hua-Hong Wu Haifeng Wang 19 38 0 23 Oct 2020
Language Models are Open Knowledge Graphs Chenguang Wang Xiao Liu D. Song SSL KELM 24 135 0 22 Oct 2020
Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data Lingkai Kong Haoming Jiang Yuchen Zhuang Jie Lyu T. Zhao Chao Zhang OODD 19 26 0 22 Oct 2020
DuoRAT: Towards Simpler Text-to-SQL Models Torsten Scholak Raymond Li Dzmitry Bahdanau H. D. Vries C. Pal AI4TS 22 26 0 21 Oct 2020
Open-Domain Frame Semantic Parsing Using Transformers Aditya Kalyanpur Or Biran Tom Breloff Jennifer Chu-Carroll Ariel Diertani Owen Rambow Mark Sammons 26 18 0 21 Oct 2020
Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition Yangyang Shi Yongqiang Wang Chunyang Wu Ching-Feng Yeh Julian Chan Frank Zhang Duc Le M. Seltzer 56 168 0 21 Oct 2020
An Empirical Investigation of Contextualized Number Prediction Daniel M. Spokoyny Taylor Berg-Kirkpatrick AI4TS 19 34 0 20 Oct 2020
Local Knowledge Powered Conversational Agents Sashank Santhanam Wei Ping Raul Puri M. Shoeybi M. Patwary Bryan Catanzaro 19 4 0 20 Oct 2020
Neural Language Modeling for Contextualized Temporal Graph Generation Aman Madaan Yiming Yang 36 20 0 20 Oct 2020
Anti-Distillation: Improving reproducibility of deep networks G. Shamir Lorenzo Coviello 42 20 0 19 Oct 2020
ChemBERTa: Large-Scale Self-Supervised Pretraining for Molecular Property Prediction Seyone Chithrananda Gabriel Grand Bharath Ramsundar AI4CE 20 388 0 19 Oct 2020
Neural Databases James Thorne Majid Yazdani Marzieh Saeidi Fabrizio Silvestri Sebastian Riedel A. Halevy NAI 26 9 0 14 Oct 2020
Pretrained Transformers for Text Ranking: BERT and Beyond Jimmy J. Lin Rodrigo Nogueira Andrew Yates VLM 219 610 0 13 Oct 2020
Reformulating Unsupervised Style Transfer as Paraphrase Generation Kalpesh Krishna John Wieting Mohit Iyyer 19 237 0 12 Oct 2020
SMYRF: Efficient Attention using Asymmetric Clustering Giannis Daras Nikita Kitaev Augustus Odena A. Dimakis 25 44 0 11 Oct 2020
Style Attuned Pre-training and Parameter Efficient Fine-tuning for Spoken Language Understanding Jin Cao Jun Wang Wael Hamza Kelly Vanee Shang-Wen Li 17 10 0 09 Oct 2020
Precise Task Formalization Matters in Winograd Schema Evaluations Haokun Liu William Huang Dhara Mungra Samuel R. Bowman ReLM 17 12 0 08 Oct 2020
Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name Recognition Yun He Ziwei Zhu Yin Zhang Qin Chen James Caverlee AI4MH 28 108 0 08 Oct 2020
Uncovering the Limits of Adversarial Training against Norm-Bounded Adversarial Examples Sven Gowal Chongli Qin J. Uesato Timothy A. Mann Pushmeet Kohli AAML 17 323 0 07 Oct 2020
Toward Stance-based Personas for Opinionated Dialogues Thomas Scialom Serra Sinem Tekiroğlu Jacopo Staiano Marco Guerini 20 9 0 07 Oct 2020
Improving the Efficiency of Grammatical Error Correction with Erroneous Span Detection and Correction M. Chen Tao Ge Xingxing Zhang Furu Wei M. Zhou 19 46 0 07 Oct 2020
Local Label Point Correction for Edge Detection of Overlapping Cervical Cells Jiawei Liu Huijie Fan Qiang Wang Wentao Li Yandong Tang Danbo Wang Mingyi Zhou Li Chen 13 9 0 05 Oct 2020
PMI-Masking: Principled masking of correlated spans Yoav Levine Barak Lenz Opher Lieber Omri Abend Kevin Leyton-Brown Moshe Tennenholtz Y. Shoham 14 72 0 05 Oct 2020
Effective Unsupervised Domain Adaptation with Adversarially Trained Language Models Thuy-Trang Vu Dinh Q. Phung Gholamreza Haffari 14 24 0 05 Oct 2020
On Losses for Modern Language Models Stephane Aroca-Ouellette Frank Rudzicz 14 33 0 04 Oct 2020
Multi-View Sequence-to-Sequence Models with Conversational Structure for Abstractive Dialogue Summarization Jiaao Chen Diyi Yang 27 143 0 04 Oct 2020
An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels Ilias Chalkidis Manos Fergadiotis Sotiris Kotitsas Prodromos Malakasiotis Nikolaos Aletras Ion Androutsopoulos VLM AI4TS 20 84 0 04 Oct 2020
LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention Ikuya Yamada Akari Asai Hiroyuki Shindo Hideaki Takeda Yuji Matsumoto 22 662 0 02 Oct 2020
Beyond The Text: Analysis of Privacy Statements through Syntactic and Semantic Role Labeling Yan Shvartzshnaider Ananth Balashankar Vikas Patidar Thomas Wies L. Subramanian 19 4 0 01 Oct 2020
Learning Knowledge Bases with Parameters for Task-Oriented Dialogue Systems Andrea Madotto Samuel Cahyawijaya Genta Indra Winata Yan Xu Zihan Liu Zhaojiang Lin Pascale Fung 36 59 0 28 Sep 2020
KG-BART: Knowledge Graph-Augmented BART for Generative Commonsense Reasoning Ye Liu Yao Wan Lifang He Hao Peng Philip S. Yu 21 188 0 26 Sep 2020
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data Jonathan Pilault Amine Elhattami C. Pal CLL MoE 21 89 0 19 Sep 2020
Self-Supervised Meta-Learning for Few-Shot Natural Language Classification Tasks Trapit Bansal Rishikesh Jha Tsendsuren Munkhdalai Andrew McCallum SSL VLM 20 87 0 17 Sep 2020
GraphCodeBERT: Pre-training Code Representations with Data Flow Daya Guo Shuo Ren Shuai Lu Zhangyin Feng Duyu Tang ... Dawn Drain Neel Sundaresan Jian Yin Daxin Jiang M. Zhou 56 1,094 0 17 Sep 2020
GLUCOSE: GeneraLized and COntextualized Story Explanations N. Mostafazadeh Aditya Kalyanpur Lori Moon David W. Buchanan Lauren Berkowitz Or Biran Jennifer Chu-Carroll 19 121 0 16 Sep 2020
Evaluating representations by the complexity of learning low-loss predictors William F. Whitney M. Song David Brandfonbrener Jaan Altosaar Kyunghyun Cho 23 23 0 15 Sep 2020
BERT-QE: Contextualized Query Expansion for Document Re-ranking Zhi Zheng Kai Hui Ben He Xianpei Han Le Sun Andrew Yates 19 93 0 15 Sep 2020
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners Timo Schick Hinrich Schütze 22 953 0 15 Sep 2020