Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
1906.02900
Cited By
Compositional Questions Do Not Necessitate Multi-hop Reasoning
7 June 2019
Sewon Min
Eric Wallace
Sameer Singh
Matt Gardner
Hannaneh Hajishirzi
Luke Zettlemoyer
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Compositional Questions Do Not Necessitate Multi-hop Reasoning"
50 / 116 papers shown
Title
Reward Hacking Mitigation using Verifiable Composite Rewards
Mirza Farhan Bin Tarek
Rahmatollah Beheshti
OffRL
LRM
15
0
0
19 Sep 2025
GRADE: Generating multi-hop QA and fine-gRAined Difficulty matrix for RAG Evaluation
Jeongsoo Lee
Daeyong Kwon
Kyohoon Jin
36
0
0
23 Aug 2025
Osiris: A Lightweight Open-Source Hallucination Detection System
Alex Shan
John Bauer
Christopher D. Manning
HILM
VLM
218
0
0
07 May 2025
SUNAR: Semantic Uncertainty based Neighborhood Aware Retrieval for Complex QA
Venktesh V
Mandeep Rathee
Avishek Anand
RALM
LRM
185
3
0
23 Mar 2025
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data
Deren Lei
Yaxi Li
Siyao Li
Mengya Hu
Rui Xu
Ken Archer
Mingyu Wang
Emily Ching
Alex Deng
SyDa
HILM
LRM
178
2
0
28 Jan 2025
MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge
Jie He
Nan Hu
Wanqiu Long
Jiaoyan Chen
Jeff Z. Pan
ELM
LRM
278
13
0
22 Dec 2024
Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?
Sohee Yang
Nora Kassner
E. Gribovskaya
Sebastian Riedel
Mor Geva
LRM
KELM
ReLM
219
12
0
25 Nov 2024
SG-FSM: A Self-Guiding Zero-Shot Prompting Paradigm for Multi-Hop Question Answering Based on Finite State Machine
Xiaochen Wang
Junqing He
Liang Chen
Reza Haf Zhe Yang
Yiru Wang
Xiangdi Meng
Kunhao Pan
Zhifang Sui
ReLM
LRM
64
1
0
22 Oct 2024
Seemingly Plausible Distractors in Multi-Hop Reasoning: Are Large Language Models Attentive Readers?
Neeladri Bhuiya
Viktor Schlegel
Stefan Winkler
LRM
124
8
0
08 Sep 2024
Hierarchical Retrieval-Augmented Generation Model with Rethink for Multi-hop Question Answering
Xiaoming Zhang
Ming Wang
Xiaocui Yang
Daling Wang
Shi Feng
Yifei Zhang
RALM
117
10
0
20 Aug 2024
Evaluating Long Range Dependency Handling in Code Generation LLMs
Yannick Assogba
Donghao Ren
136
1
0
23 Jul 2024
Meta-prompting Optimized Retrieval-augmented Generation
João Rodrigues
António Branco
RALM
113
0
0
04 Jul 2024
FSM: A Finite State Machine Based Zero-Shot Prompting Paradigm for Multi-Hop Question Answering
Xiaochen Wang
Junqing He
Zhiyong Yang
Yiru Wang
Xiangdi Meng
Kunhao Pan
Zhifang Sui
LRM
ReLM
89
6
0
03 Jul 2024
MoreHopQA: More Than Multi-hop Reasoning
Julian Schnitzler
Xanh Ho
Jiahao Huang
Florian Boudin
Saku Sugawara
Akiko Aizawa
LRM
137
13
0
19 Jun 2024
Measuring Retrieval Complexity in Question Answering Systems
Matteo Gabburo
Nicolaas Paul Jedema
Siddhant Garg
Leonardo F. R. Ribeiro
Alessandro Moschitti
92
2
0
05 Jun 2024
ACCORD: Closing the Commonsense Measurability Gap
François Roewer-Després
Jinyue Feng
Zining Zhu
Frank Rudzicz
LRM
176
0
0
04 Jun 2024
FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for Large Language Models
Andrew Zhu
Alyssa Hwang
Liam Dugan
Chris Callison-Burch
ELM
129
7
0
21 Feb 2024
Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models
Mosh Levy
Alon Jacoby
Yoav Goldberg
194
108
0
19 Feb 2024
Large Language Models are not Fair Evaluators
Peiyi Wang
Lei Li
Liang Chen
Zefan Cai
Dawei Zhu
Binghuai Lin
Yunbo Cao
Qi Liu
Tianyu Liu
Zhifang Sui
ALM
238
663
0
29 May 2023
What Else Do I Need to Know? The Effect of Background Information on Users' Reliance on QA Systems
Navita Goyal
Eleftheria Briakou
Amanda Liu
Connor Baumler
C. Bonial
J. Micher
Clare R. Voss
Marine Carpuat
Hal Daumé
108
10
0
23 May 2023
HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale Supervision
Wenting Zhao
Justin T. Chiu
Claire Cardie
Alexander M. Rush
LRM
105
5
0
23 May 2023
Out-of-Distribution Generalization in Text Classification: Past, Present, and Future
Linyi Yang
Yangqiu Song
Xuan Ren
Chenyang Lyu
Yidong Wang
Lingqiao Liu
Yongfeng Zhang
Jennifer Foster
Yue Zhang
OOD
178
3
0
23 May 2023
Explicit Planning Helps Language Models in Logical Reasoning
Hongyu Zhao
Kangrui Wang
Mo Yu
Hongyuan Mei
LRM
ReLM
222
17
0
28 Mar 2023
Natural Language Reasoning, A Survey
Fei Yu
Hongbo Zhang
Prayag Tiwari
Benyou Wang
ReLM
LRM
210
82
0
26 Mar 2023
Analyzing the Effectiveness of the Underlying Reasoning Tasks in Multi-hop Question Answering
Xanh Ho
A. Nguyen
Saku Sugawara
Akiko Aizawa
LRM
119
8
0
12 Feb 2023
Using Focal Loss to Fight Shallow Heuristics: An Empirical Analysis of Modulated Cross-Entropy in Natural Language Inference
Frano Rajic
Ivan Stresec
Axel Marmet
Tim Postuvan
82
3
0
23 Nov 2022
GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective
Linyi Yang
Shuibai Zhang
Libo Qin
Yafu Li
Yidong Wang
Hanmeng Liu
Yongfeng Zhang
Xingxu Xie
Yue Zhang
ELM
334
87
0
15 Nov 2022
ReasonChainQA: Text-based Complex Question Answering with Explainable Evidence Chains
Minjun Zhu
Yixuan Weng
Shizhu He
Kang Liu
Jun Zhao
LRM
93
6
0
17 Oct 2022
Counterfactual Multihop QA: A Cause-Effect Approach for Reducing Disconnected Reasoning
Wangzhen Guo
Qinkang Gong
Hanjiang Lai
LRM
93
4
0
13 Oct 2022
Relational Graph Convolutional Neural Networks for Multihop Reasoning: A Comparative Study
Ieva Staliunaite
P. Gorinski
Ignacio Iacobacci
GNN
117
0
0
12 Oct 2022
How Well Do Multi-hop Reading Comprehension Models Understand Date Information?
Xanh Ho
Saku Sugawara
Akiko Aizawa
103
2
0
11 Oct 2022
Understanding and Improving Zero-shot Multi-hop Reasoning in Generative Question Answering
Zhengbao Jiang
Jun Araki
Haibo Ding
Graham Neubig
LRM
95
12
0
09 Oct 2022
GAPX: Generalized Autoregressive Paraphrase-Identification X
Yi Zhou
Renyu Li
Hayden Housen
Ser-Nam Lim
BDL
100
0
0
05 Oct 2022
Machine Reading, Fast and Slow: When Do Models "Understand" Language?
Sagnik Ray Choudhury
Anna Rogers
Isabelle Augenstein
LRM
100
18
0
15 Sep 2022
A Survey on Measuring and Mitigating Reasoning Shortcuts in Machine Reading Comprehension
Xanh Ho
Johannes Mario Meissner
Saku Sugawara
Akiko Aizawa
OffRL
110
5
0
05 Sep 2022
Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts
H. Trivedi
Niranjan Balasubramanian
Tushar Khot
Ashish Sabharwal
ReLM
LRM
127
11
0
25 May 2022
From Easy to Hard: Two-stage Selector and Reader for Multi-hop Question Answering
Xin-Yi Li
Weixian Lei
Yubin Yang
RALM
163
24
0
24 May 2022
Automated Crossword Solving
Eric Wallace
Nicholas Tomlin
Albert Xu
Kevin Kaichuang Yang
Eshaan Pathak
Matthew Ginsberg
Dan Klein
162
14
0
19 May 2022
Better Retrieval May Not Lead to Better Question Answering
Zhengzhong Liang
Tushar Khot
Steven Bethard
Mihai Surdeanu
Ashish Sabharwal
RALM
LRM
141
3
0
07 May 2022
Task-guided Disentangled Tuning for Pretrained Language Models
Jiali Zeng
Yu Jiang
Shuangzhi Wu
Yongjing Yin
Mu Li
DRL
179
3
0
22 Mar 2022
Reasoning over Public and Private Data in Retrieval-Based Systems
Simran Arora
Patrick Lewis
Angela Fan
Jacob Kahn
Christopher Ré
76
28
0
14 Mar 2022
What Makes Reading Comprehension Questions Difficult?
Saku Sugawara
Nikita Nangia
Alex Warstadt
Sam Bowman
ELM
RALM
70
14
0
12 Mar 2022
Saving Dense Retriever from Shortcut Dependency in Conversational Search
Sungdong Kim
Gangwoo Kim
100
31
0
15 Feb 2022
CommonsenseQA 2.0: Exposing the Limits of AI through Gamification
Alon Talmor
Ori Yoran
Ronan Le Bras
Chandrasekhar Bhagavatula
Yoav Goldberg
Yejin Choi
Jonathan Berant
ELM
163
153
0
14 Jan 2022
General Greedy De-bias Learning
Xinzhe Han
Shuhui Wang
Chi Su
Qingming Huang
Qi Tian
211
12
0
20 Dec 2021
QuALITY: Question Answering with Long Input Texts, Yes!
Richard Yuanzhe Pang
Alicia Parrish
Nitish Joshi
Nikita Nangia
Jason Phang
...
Vishakh Padmakumar
Johnny Ma
Jana Thompson
He He
Sam Bowman
RALM
145
171
0
16 Dec 2021
Towards Interpretable and Reliable Reading Comprehension: A Pipeline Model with Unanswerability Prediction
Kosuke Nishida
Kyosuke Nishida
Itsumi Saito
Sen Yoshida
128
7
0
17 Nov 2021
Uncertainty Calibration for Ensemble-Based Debiasing Methods
Ruibin Xiong
Yimeng Chen
Liang Pang
Xueqi Chen
Yanyan Lan
80
22
0
07 Nov 2021
Grounded Graph Decoding Improves Compositional Generalization in Question Answering
Yu Gai
Paras Jain
Wendi Zhang
Joseph E. Gonzalez
Basel Alomair
Ion Stoica
BDL
OOD
111
8
0
05 Nov 2021
Hey AI, Can You Solve Complex Tasks by Talking to Agents?
Tushar Khot
Kyle Richardson
Daniel Khashabi
Ashish Sabharwal
RALM
LRM
116
14
0
16 Oct 2021
1
2
3
Next