Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.13453
Cited By
MultiQA: An Empirical Investigation of Generalization and Transfer in Reading Comprehension
31 May 2019
Alon Talmor
Jonathan Berant
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MultiQA: An Empirical Investigation of Generalization and Transfer in Reading Comprehension"
38 / 38 papers shown
Title
Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations
Lifan Yuan
Yangyi Chen
Ganqu Cui
Hongcheng Gao
Fangyuan Zou
Xingyi Cheng
Heng Ji
Zhiyuan Liu
Maosong Sun
39
73
0
07 Jun 2023
Evaluating the Robustness of Machine Reading Comprehension Models to Low Resource Entity Renaming
Clemencia Siro
T. Ajayi
23
2
0
06 Apr 2023
UKP-SQuARE v3: A Platform for Multi-Agent QA Research
Haritz Puerto
Tim Baumgärtner
Rachneet Sachdeva
Haishuo Fang
Haotian Zhang
Sewin Tariverdian
Kexin Wang
Iryna Gurevych
28
2
0
31 Mar 2023
Understanding Finetuning for Factual Knowledge Extraction from Language Models
Mehran Kazemi
Sid Mittal
Deepak Ramachandran
KELM
34
10
0
26 Jan 2023
State-of-the-art generalisation research in NLP: A taxonomy and review
Dieuwke Hupkes
Mario Giulianelli
Verna Dankers
Mikel Artetxe
Yanai Elazar
...
Leila Khalatbari
Maria Ryskina
Rita Frieske
Ryan Cotterell
Zhijing Jin
116
93
0
06 Oct 2022
ATTEMPT: Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts
Akari Asai
Mohammadreza Salehi
Matthew E. Peters
Hannaneh Hajishirzi
130
100
0
24 May 2022
When to Use Multi-Task Learning vs Intermediate Fine-Tuning for Pre-Trained Encoder Transfer Learning
Orion Weller
Kevin Seppi
Matt Gardner
16
21
0
17 May 2022
Match-Prompt: Improving Multi-task Generalization Ability for Neural Text Matching via Prompt Learning
Shicheng Xu
Liang Pang
Huawei Shen
Xueqi Cheng
VLM
33
17
0
06 Apr 2022
Investigating Selective Prediction Approaches Across Several Tasks in IID, OOD, and Adversarial Settings
Neeraj Varshney
Swaroop Mishra
Chitta Baral
10
55
0
01 Mar 2022
Active Learning Over Multiple Domains in Natural Language Tasks
Shayne Longpre
Julia Reisler
E. G. Huang
Yi Lu
Andrew J. Frank
Nikhil Ramesh
Chris DuBois
OOD
24
13
0
01 Feb 2022
CommonsenseQA 2.0: Exposing the Limits of AI through Gamification
Alon Talmor
Ori Yoran
Ronan Le Bras
Chandrasekhar Bhagavatula
Yoav Goldberg
Yejin Choi
Jonathan Berant
ELM
19
141
0
14 Jan 2022
MetaQA: Combining Expert Agents for Multi-Skill Question Answering
Haritz Puerto
Gözde Gül Sahin
Iryna Gurevych
LLMAG
33
20
0
03 Dec 2021
Can Explanations Be Useful for Calibrating Black Box Models?
Xi Ye
Greg Durrett
FAtt
24
25
0
14 Oct 2021
Encoder Adaptation of Dense Passage Retrieval for Open-Domain Question Answering
Minghan Li
Jimmy J. Lin
AI4CE
25
9
0
04 Oct 2021
Single-dataset Experts for Multi-dataset Question Answering
Dan Friedman
Ben Dodge
Danqi Chen
MoMe
132
26
0
28 Sep 2021
PPT: Pre-trained Prompt Tuning for Few-shot Learning
Yuxian Gu
Xu Han
Zhiyuan Liu
Minlie Huang
VLM
51
402
0
09 Sep 2021
SyGNS: A Systematic Generalization Testbed Based on Natural Language Semantics
Hitomi Yanaka
K. Mineshima
Kentaro Inui
NAI
AI4CE
38
11
0
02 Jun 2021
ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning
Yujia Qin
Yankai Lin
Ryuichi Takanobu
Zhiyuan Liu
Peng Li
Heng Ji
Minlie Huang
Maosong Sun
Jie Zhou
55
125
0
30 Dec 2020
BERT Goes Shopping: Comparing Distributional Models for Product Representations
Federico Bianchi
Bingqing Yu
Jacopo Tagliabue
12
15
0
17 Dec 2020
Generalizing Cross-Document Event Coreference Resolution Across Multiple Corpora
M. Bugert
Nils Reimers
Iryna Gurevych
21
17
0
24 Nov 2020
XOR QA: Cross-lingual Open-Retrieval Question Answering
Akari Asai
Jungo Kasai
J. Clark
Kenton Lee
Eunsol Choi
Hannaneh Hajishirzi
14
145
0
22 Oct 2020
CDEvalSumm: An Empirical Study of Cross-Dataset Evaluation for Neural Summarization Systems
Yiran Chen
Pengfei Liu
Ming Zhong
Zi-Yi Dou
Danqing Wang
Xipeng Qiu
Xuanjing Huang
ELM
27
24
0
11 Oct 2020
On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise Tasks
Stephen Mussmann
Robin Jia
Percy Liang
8
15
0
10 Oct 2020
MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale
Andreas Rucklé
Jonas Pfeiffer
Iryna Gurevych
27
37
0
02 Oct 2020
Knowledge-Aware Procedural Text Understanding with Multi-Stage Training
Zhihan Zhang
Xiubo Geng
Tao Qin
Yunfang Wu
Daxin Jiang
29
22
0
28 Sep 2020
Transferability of Natural Language Inference to Biomedical Question Answering
Minbyul Jeong
Mujeen Sung
Gangwoo Kim
Donghyeon Kim
Wonjin Yoon
J. Yoo
Jaewoo Kang
19
37
0
01 Jul 2020
Leap-Of-Thought: Teaching Pre-Trained Models to Systematically Reason Over Implicit Knowledge
Alon Talmor
Oyvind Tafjord
Peter Clark
Yoav Goldberg
Jonathan Berant
ReLM
LRM
30
39
0
11 Jun 2020
UnifiedQA: Crossing Format Boundaries With a Single QA System
Daniel Khashabi
Sewon Min
Tushar Khot
Ashish Sabharwal
Oyvind Tafjord
Peter Clark
Hannaneh Hajishirzi
35
719
0
02 May 2020
Syntactic Data Augmentation Increases Robustness to Inference Heuristics
Junghyun Min
R. Thomas McCoy
Dipanjan Das
Emily Pitler
Tal Linzen
30
175
0
24 Apr 2020
Training Question Answering Models From Synthetic Data
Raul Puri
Ryan Spring
M. Patwary
M. Shoeybi
Bryan Catanzaro
ELM
24
159
0
22 Feb 2020
A Survey on Machine Reading Comprehension Systems
Razieh Baradaran
Razieh Ghiasi
Hossein Amirkhani
FaML
13
85
0
06 Jan 2020
SberQuAD -- Russian Reading Comprehension Dataset: Description and Analysis
Pavel Efimov
Andrey Chertok
Leonid Boytsov
Pavel Braslavski
60
59
0
20 Dec 2019
An Exploration of Data Augmentation and Sampling Techniques for Domain-Agnostic Question Answering
Shayne Longpre
Yi Lu
Zhucheng Tu
Christopher DuBois
19
70
0
04 Dec 2019
The Dialogue Dodecathlon: Open-Domain Knowledge and Image Grounded Conversational Agents
Kurt Shuster
Da Ju
Stephen Roller
Emily Dinan
Y-Lan Boureau
Jason Weston
20
81
0
09 Nov 2019
Coreference Resolution as Query-based Span Prediction
Wei Yu Wu
Fei Wang
Arianna Yuan
Fei Wu
Jiwei Li
LRM
33
180
0
05 Nov 2019
MRQA 2019 Shared Task: Evaluating Generalization in Reading Comprehension
Adam Fisch
Alon Talmor
Robin Jia
Minjoon Seo
Eunsol Choi
Danqi Chen
22
301
0
22 Oct 2019
A Constructive Prediction of the Generalization Error Across Scales
Jonathan S. Rosenfeld
Amir Rosenfeld
Yonatan Belinkov
Nir Shavit
24
205
0
27 Sep 2019
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,959
0
20 Apr 2018
1