ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
  • Feedback
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.12106
  4. Cited By
Understanding Dataset Design Choices for Multi-hop Reasoning

Understanding Dataset Design Choices for Multi-hop Reasoning

27 April 2019
Jifan Chen
Greg Durrett
    LRM
ArXiv (abs)PDFHTML

Papers citing "Understanding Dataset Design Choices for Multi-hop Reasoning"

50 / 67 papers shown
Title
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data
Deren Lei
Yaxi Li
Siyao Li
Mengya Hu
Rui Xu
Ken Archer
Mingyu Wang
Emily Ching
Alex Deng
SyDaHILMLRM
178
2
0
28 Jan 2025
Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?
Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?
Sohee Yang
Nora Kassner
E. Gribovskaya
Sebastian Riedel
Mor Geva
LRMKELMReLM
219
12
0
25 Nov 2024
Evaluating Long Range Dependency Handling in Code Generation LLMs
Evaluating Long Range Dependency Handling in Code Generation LLMs
Yannick Assogba
Donghao Ren
136
1
0
23 Jul 2024
MoreHopQA: More Than Multi-hop Reasoning
MoreHopQA: More Than Multi-hop Reasoning
Julian Schnitzler
Xanh Ho
Jiahao Huang
Florian Boudin
Saku Sugawara
Akiko Aizawa
LRM
137
13
0
19 Jun 2024
ACCORD: Closing the Commonsense Measurability Gap
ACCORD: Closing the Commonsense Measurability Gap
François Roewer-Després
Jinyue Feng
Zining Zhu
Frank Rudzicz
LRM
176
0
0
04 Jun 2024
Same Task, More Tokens: the Impact of Input Length on the Reasoning
  Performance of Large Language Models
Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models
Mosh Levy
Alon Jacoby
Yoav Goldberg
194
108
0
19 Feb 2024
What Else Do I Need to Know? The Effect of Background Information on
  Users' Reliance on QA Systems
What Else Do I Need to Know? The Effect of Background Information on Users' Reliance on QA Systems
Navita Goyal
Eleftheria Briakou
Amanda Liu
Connor Baumler
C. Bonial
J. Micher
Clare R. Voss
Marine Carpuat
Hal Daumé
108
10
0
23 May 2023
HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale
  Supervision
HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale Supervision
Wenting Zhao
Justin T. Chiu
Claire Cardie
Alexander M. Rush
LRM
105
5
0
23 May 2023
It Takes Two to Tango: Navigating Conceptualizations of NLP Tasks and
  Measurements of Performance
It Takes Two to Tango: Navigating Conceptualizations of NLP Tasks and Measurements of Performance
Arjun Subramonian
Xingdi Yuan
Hal Daumé
Su Lin Blodgett
116
19
0
15 May 2023
Explicit Planning Helps Language Models in Logical Reasoning
Explicit Planning Helps Language Models in Logical Reasoning
Hongyu Zhao
Kangrui Wang
Mo Yu
Hongyuan Mei
LRMReLM
222
17
0
28 Mar 2023
Natural Language Reasoning, A Survey
Natural Language Reasoning, A Survey
Fei Yu
Hongbo Zhang
Prayag Tiwari
Benyou Wang
ReLMLRM
210
82
0
26 Mar 2023
Analyzing the Effectiveness of the Underlying Reasoning Tasks in
  Multi-hop Question Answering
Analyzing the Effectiveness of the Underlying Reasoning Tasks in Multi-hop Question Answering
Xanh Ho
A. Nguyen
Saku Sugawara
Akiko Aizawa
LRM
119
8
0
12 Feb 2023
TASA: Deceiving Question Answering Models by Twin Answer Sentences
  Attack
TASA: Deceiving Question Answering Models by Twin Answer Sentences Attack
Yu Cao
Dianqi Li
Meng Fang
Wanrong Zhu
Jun Gao
Yibing Zhan
Dacheng Tao
AAML
96
19
0
27 Oct 2022
ReasonChainQA: Text-based Complex Question Answering with Explainable
  Evidence Chains
ReasonChainQA: Text-based Complex Question Answering with Explainable Evidence Chains
Minjun Zhu
Yixuan Weng
Shizhu He
Kang Liu
Jun Zhao
LRM
93
6
0
17 Oct 2022
Assessing Out-of-Domain Language Model Performance from Few Examples
Assessing Out-of-Domain Language Model Performance from Few Examples
Prasann Singhal
Jarad Forristal
Xi Ye
Greg Durrett
LRM
97
5
0
13 Oct 2022
How Well Do Multi-hop Reading Comprehension Models Understand Date
  Information?
How Well Do Multi-hop Reading Comprehension Models Understand Date Information?
Xanh Ho
Saku Sugawara
Akiko Aizawa
103
2
0
11 Oct 2022
Understanding and Improving Zero-shot Multi-hop Reasoning in Generative
  Question Answering
Understanding and Improving Zero-shot Multi-hop Reasoning in Generative Question Answering
Zhengbao Jiang
Jun Araki
Haibo Ding
Graham Neubig
LRM
95
12
0
09 Oct 2022
Machine Reading, Fast and Slow: When Do Models "Understand" Language?
Machine Reading, Fast and Slow: When Do Models "Understand" Language?
Sagnik Ray Choudhury
Anna Rogers
Isabelle Augenstein
LRM
100
18
0
15 Sep 2022
A Survey on Measuring and Mitigating Reasoning Shortcuts in Machine
  Reading Comprehension
A Survey on Measuring and Mitigating Reasoning Shortcuts in Machine Reading Comprehension
Xanh Ho
Johannes Mario Meissner
Saku Sugawara
Akiko Aizawa
OffRL
110
5
0
05 Sep 2022
MRCLens: an MRC Dataset Bias Detection Toolkit
MRCLens: an MRC Dataset Bias Detection Toolkit
Yifan Zhong
Haohan Wang
Eric Xing
80
0
0
18 Jul 2022
QAMPARI: An Open-domain Question Answering Benchmark for Questions with
  Many Answers from Multiple Paragraphs
QAMPARI: An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs
S. Amouyal
Tomer Wolfson
Ohad Rubin
Ori Yoran
Jonathan Herzig
Jonathan Berant
RALMVLM
183
35
0
25 May 2022
RobustLR: Evaluating Robustness to Logical Perturbation in Deductive
  Reasoning
RobustLR: Evaluating Robustness to Logical Perturbation in Deductive Reasoning
Soumya Sanyal
Zeyi Liao
Xiang Ren
ELMReLMLRM
182
23
0
25 May 2022
Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard
  Contexts
Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts
H. Trivedi
Niranjan Balasubramanian
Tushar Khot
Ashish Sabharwal
ReLMLRM
127
11
0
25 May 2022
The Unreliability of Explanations in Few-shot Prompting for Textual
  Reasoning
The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning
Xi Ye
Greg Durrett
ReLMLRM
178
207
0
06 May 2022
Reasoning over Public and Private Data in Retrieval-Based Systems
Reasoning over Public and Private Data in Retrieval-Based Systems
Simran Arora
Patrick Lewis
Angela Fan
Jacob Kahn
Christopher Ré
76
28
0
14 Mar 2022
What Makes Reading Comprehension Questions Difficult?
What Makes Reading Comprehension Questions Difficult?
Saku Sugawara
Nikita Nangia
Alex Warstadt
Sam Bowman
ELMRALM
70
14
0
12 Mar 2022
Uncertainty Calibration for Ensemble-Based Debiasing Methods
Uncertainty Calibration for Ensemble-Based Debiasing Methods
Ruibin Xiong
Yimeng Chen
Liang Pang
Xueqi Chen
Yanyan Lan
80
22
0
07 Nov 2021
Can Explanations Be Useful for Calibrating Black Box Models?
Can Explanations Be Useful for Calibrating Black Box Models?
Xi Ye
Greg Durrett
FAtt
105
27
0
14 Oct 2021
MuSiQue: Multihop Questions via Single-hop Question Composition
MuSiQue: Multihop Questions via Single-hop Question Composition
H. Trivedi
Niranjan Balasubramanian
Tushar Khot
Ashish Sabharwal
LRM
240
381
0
02 Aug 2021
Robustifying Multi-hop QA through Pseudo-Evidentiality Training
Robustifying Multi-hop QA through Pseudo-Evidentiality Training
Kyungjae Lee
Seung-won Hwang
Sanghyun Han
Dohyeon Lee
OffRL
106
13
0
07 Jul 2021
Flexible Generation of Natural Language Deductions
Flexible Generation of Natural Language Deductions
Kaj Bostrom
Xinyu Zhao
Swarat Chaudhuri
Greg Durrett
ReLMLRM
396
34
0
18 Apr 2021
Can NLI Models Verify QA Systems' Predictions?
Can NLI Models Verify QA Systems' Predictions?
Jifan Chen
Eunsol Choi
Greg Durrett
182
55
0
18 Apr 2021
Joint Passage Ranking for Diverse Multi-Answer Retrieval
Joint Passage Ranking for Diverse Multi-Answer Retrieval
Sewon Min
Kenton Lee
Ming-Wei Chang
Kristina Toutanova
Hannaneh Hajishirzi
152
43
0
17 Apr 2021
Connecting Attributions and QA Model Behavior on Realistic
  Counterfactuals
Connecting Attributions and QA Model Behavior on Realistic Counterfactuals
Xi Ye
Rohan Nair
Greg Durrett
83
25
0
09 Apr 2021
Rissanen Data Analysis: Examining Dataset Characteristics via
  Description Length
Rissanen Data Analysis: Examining Dataset Characteristics via Description Length
Ethan Perez
Douwe Kiela
Kyunghyun Cho
109
24
0
05 Mar 2021
Model Agnostic Answer Reranking System for Adversarial Question
  Answering
Model Agnostic Answer Reranking System for Adversarial Question Answering
Sagnik Majumder
Chinmoy Samant
Greg Durrett
OODAAML
76
7
0
05 Feb 2021
Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval
Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval
Omar Khattab
Christopher Potts
Matei A. Zaharia
RALMLRM
253
60
0
02 Jan 2021
IIRC: A Dataset of Incomplete Information Reading Comprehension
  Questions
IIRC: A Dataset of Incomplete Information Reading Comprehension Questions
James Ferguson
Matt Gardner
Hannaneh Hajishirzi
Tushar Khot
Pradeep Dasigi
RALM
96
60
0
13 Nov 2020
HoVer: A Dataset for Many-Hop Fact Extraction And Claim Verification
HoVer: A Dataset for Many-Hop Fact Extraction And Claim Verification
Yichen Jiang
Shikha Bordia
Zheng Zhong
Charles Dognin
M. Singh
Joey Tianyi Zhou
209
176
0
05 Nov 2020
Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of
  Reasoning Steps
Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of Reasoning Steps
Xanh Ho
A. Nguyen
Saku Sugawara
Akiko Aizawa
RALMLRM
149
613
0
02 Nov 2020
Challenges in Information-Seeking QA: Unanswerable Questions and
  Paragraph Retrieval
Challenges in Information-Seeking QA: Unanswerable Questions and Paragraph Retrieval
Akari Asai
Eunsol Choi
RALM
175
56
0
22 Oct 2020
Why do you think that? Exploring Faithful Sentence-Level Rationales
  Without Supervision
Why do you think that? Exploring Faithful Sentence-Level Rationales Without Supervision
Max Glockner
Ivan Habernal
Iryna Gurevych
LRM
123
26
0
07 Oct 2020
Context Modeling with Evidence Filter for Multiple Choice Question
  Answering
Context Modeling with Evidence Filter for Multiple Choice Question Answering
S. Yu
Hao Zhang
Wei Jing
Jing Jiang
39
2
0
06 Oct 2020
A Survey on Explainability in Machine Reading Comprehension
A Survey on Explainability in Machine Reading Comprehension
Mokanarangan Thayaparan
Marco Valentino
André Freitas
FaML
151
51
0
01 Oct 2020
Multi-Hop Fact Checking of Political Claims
Multi-Hop Fact Checking of Political Claims
W. Ostrowski
Arnav Arora
Pepa Atanasova
Isabelle Augenstein
LRM
170
47
0
10 Sep 2020
Selective Question Answering under Domain Shift
Selective Question Answering under Domain Shift
Amita Kamath
Robin Jia
Percy Liang
OOD
126
227
0
16 Jun 2020
Beyond Leaderboards: A survey of methods for revealing weaknesses in
  Natural Language Inference data and models
Beyond Leaderboards: A survey of methods for revealing weaknesses in Natural Language Inference data and models
Viktor Schlegel
Goran Nenadic
Riza Batista-Navarro
ELM
109
18
0
29 May 2020
Unsupervised Alignment-based Iterative Evidence Retrieval for Multi-hop
  Question Answering
Unsupervised Alignment-based Iterative Evidence Retrieval for Multi-hop Question Answering
Vikas Yadav
Steven Bethard
Mihai Surdeanu
RALM
142
53
0
04 May 2020
Is Multihop QA in DiRe Condition? Measuring and Reducing Disconnected
  Reasoning
Is Multihop QA in DiRe Condition? Measuring and Reducing Disconnected Reasoning
H. Trivedi
Niranjan Balasubramanian
Tushar Khot
Ashish Sabharwal
AAML
62
1
0
02 May 2020
Robust Question Answering Through Sub-part Alignment
Robust Question Answering Through Sub-part Alignment
Jifan Chen
Greg Durrett
OOD
117
13
0
30 Apr 2020
12
Next