ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
  • Feedback
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.02900
  4. Cited By
Compositional Questions Do Not Necessitate Multi-hop Reasoning

Compositional Questions Do Not Necessitate Multi-hop Reasoning

7 June 2019
Sewon Min
Eric Wallace
Sameer Singh
Matt Gardner
Hannaneh Hajishirzi
Luke Zettlemoyer
ArXiv (abs)PDFHTML

Papers citing "Compositional Questions Do Not Necessitate Multi-hop Reasoning"

50 / 116 papers shown
Title
Reward Hacking Mitigation using Verifiable Composite Rewards
Reward Hacking Mitigation using Verifiable Composite Rewards
Mirza Farhan Bin Tarek
Rahmatollah Beheshti
OffRLLRM
15
0
0
19 Sep 2025
GRADE: Generating multi-hop QA and fine-gRAined Difficulty matrix for RAG Evaluation
GRADE: Generating multi-hop QA and fine-gRAined Difficulty matrix for RAG Evaluation
Jeongsoo Lee
Daeyong Kwon
Kyohoon Jin
36
0
0
23 Aug 2025
Osiris: A Lightweight Open-Source Hallucination Detection System
Osiris: A Lightweight Open-Source Hallucination Detection System
Alex Shan
John Bauer
Christopher D. Manning
HILMVLM
218
0
0
07 May 2025
SUNAR: Semantic Uncertainty based Neighborhood Aware Retrieval for Complex QA
SUNAR: Semantic Uncertainty based Neighborhood Aware Retrieval for Complex QA
Venktesh V
Mandeep Rathee
Avishek Anand
RALMLRM
185
3
0
23 Mar 2025
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data
Deren Lei
Yaxi Li
Siyao Li
Mengya Hu
Rui Xu
Ken Archer
Mingyu Wang
Emily Ching
Alex Deng
SyDaHILMLRM
178
2
0
28 Jan 2025
MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge
MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge
Jie He
Nan Hu
Wanqiu Long
Jiaoyan Chen
Jeff Z. Pan
ELMLRM
278
13
0
22 Dec 2024
Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?
Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?
Sohee Yang
Nora Kassner
E. Gribovskaya
Sebastian Riedel
Mor Geva
LRMKELMReLM
219
12
0
25 Nov 2024
SG-FSM: A Self-Guiding Zero-Shot Prompting Paradigm for Multi-Hop
  Question Answering Based on Finite State Machine
SG-FSM: A Self-Guiding Zero-Shot Prompting Paradigm for Multi-Hop Question Answering Based on Finite State Machine
Xiaochen Wang
Junqing He
Liang Chen
Reza Haf Zhe Yang
Yiru Wang
Xiangdi Meng
Kunhao Pan
Zhifang Sui
ReLMLRM
64
1
0
22 Oct 2024
Seemingly Plausible Distractors in Multi-Hop Reasoning: Are Large
  Language Models Attentive Readers?
Seemingly Plausible Distractors in Multi-Hop Reasoning: Are Large Language Models Attentive Readers?
Neeladri Bhuiya
Viktor Schlegel
Stefan Winkler
LRM
124
8
0
08 Sep 2024
Hierarchical Retrieval-Augmented Generation Model with Rethink for
  Multi-hop Question Answering
Hierarchical Retrieval-Augmented Generation Model with Rethink for Multi-hop Question Answering
Xiaoming Zhang
Ming Wang
Xiaocui Yang
Daling Wang
Shi Feng
Yifei Zhang
RALM
117
10
0
20 Aug 2024
Evaluating Long Range Dependency Handling in Code Generation LLMs
Evaluating Long Range Dependency Handling in Code Generation LLMs
Yannick Assogba
Donghao Ren
136
1
0
23 Jul 2024
Meta-prompting Optimized Retrieval-augmented Generation
Meta-prompting Optimized Retrieval-augmented Generation
João Rodrigues
António Branco
RALM
113
0
0
04 Jul 2024
FSM: A Finite State Machine Based Zero-Shot Prompting Paradigm for
  Multi-Hop Question Answering
FSM: A Finite State Machine Based Zero-Shot Prompting Paradigm for Multi-Hop Question Answering
Xiaochen Wang
Junqing He
Zhiyong Yang
Yiru Wang
Xiangdi Meng
Kunhao Pan
Zhifang Sui
LRMReLM
89
6
0
03 Jul 2024
MoreHopQA: More Than Multi-hop Reasoning
MoreHopQA: More Than Multi-hop Reasoning
Julian Schnitzler
Xanh Ho
Jiahao Huang
Florian Boudin
Saku Sugawara
Akiko Aizawa
LRM
137
13
0
19 Jun 2024
Measuring Retrieval Complexity in Question Answering Systems
Measuring Retrieval Complexity in Question Answering Systems
Matteo Gabburo
Nicolaas Paul Jedema
Siddhant Garg
Leonardo F. R. Ribeiro
Alessandro Moschitti
92
2
0
05 Jun 2024
ACCORD: Closing the Commonsense Measurability Gap
ACCORD: Closing the Commonsense Measurability Gap
François Roewer-Després
Jinyue Feng
Zining Zhu
Frank Rudzicz
LRM
176
0
0
04 Jun 2024
FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for
  Large Language Models
FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for Large Language Models
Andrew Zhu
Alyssa Hwang
Liam Dugan
Chris Callison-Burch
ELM
129
7
0
21 Feb 2024
Same Task, More Tokens: the Impact of Input Length on the Reasoning
  Performance of Large Language Models
Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models
Mosh Levy
Alon Jacoby
Yoav Goldberg
194
108
0
19 Feb 2024
Large Language Models are not Fair Evaluators
Large Language Models are not Fair Evaluators
Peiyi Wang
Lei Li
Liang Chen
Zefan Cai
Dawei Zhu
Binghuai Lin
Yunbo Cao
Qi Liu
Tianyu Liu
Zhifang Sui
ALM
238
663
0
29 May 2023
What Else Do I Need to Know? The Effect of Background Information on
  Users' Reliance on QA Systems
What Else Do I Need to Know? The Effect of Background Information on Users' Reliance on QA Systems
Navita Goyal
Eleftheria Briakou
Amanda Liu
Connor Baumler
C. Bonial
J. Micher
Clare R. Voss
Marine Carpuat
Hal Daumé
108
10
0
23 May 2023
HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale
  Supervision
HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale Supervision
Wenting Zhao
Justin T. Chiu
Claire Cardie
Alexander M. Rush
LRM
105
5
0
23 May 2023
Out-of-Distribution Generalization in Text Classification: Past,
  Present, and Future
Out-of-Distribution Generalization in Text Classification: Past, Present, and Future
Linyi Yang
Yangqiu Song
Xuan Ren
Chenyang Lyu
Yidong Wang
Lingqiao Liu
Yongfeng Zhang
Jennifer Foster
Yue Zhang
OOD
178
3
0
23 May 2023
Explicit Planning Helps Language Models in Logical Reasoning
Explicit Planning Helps Language Models in Logical Reasoning
Hongyu Zhao
Kangrui Wang
Mo Yu
Hongyuan Mei
LRMReLM
222
17
0
28 Mar 2023
Natural Language Reasoning, A Survey
Natural Language Reasoning, A Survey
Fei Yu
Hongbo Zhang
Prayag Tiwari
Benyou Wang
ReLMLRM
210
82
0
26 Mar 2023
Analyzing the Effectiveness of the Underlying Reasoning Tasks in
  Multi-hop Question Answering
Analyzing the Effectiveness of the Underlying Reasoning Tasks in Multi-hop Question Answering
Xanh Ho
A. Nguyen
Saku Sugawara
Akiko Aizawa
LRM
119
8
0
12 Feb 2023
Using Focal Loss to Fight Shallow Heuristics: An Empirical Analysis of
  Modulated Cross-Entropy in Natural Language Inference
Using Focal Loss to Fight Shallow Heuristics: An Empirical Analysis of Modulated Cross-Entropy in Natural Language Inference
Frano Rajic
Ivan Stresec
Axel Marmet
Tim Postuvan
82
3
0
23 Nov 2022
GLUE-X: Evaluating Natural Language Understanding Models from an
  Out-of-distribution Generalization Perspective
GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective
Linyi Yang
Shuibai Zhang
Libo Qin
Yafu Li
Yidong Wang
Hanmeng Liu
Yongfeng Zhang
Xingxu Xie
Yue Zhang
ELM
334
87
0
15 Nov 2022
ReasonChainQA: Text-based Complex Question Answering with Explainable
  Evidence Chains
ReasonChainQA: Text-based Complex Question Answering with Explainable Evidence Chains
Minjun Zhu
Yixuan Weng
Shizhu He
Kang Liu
Jun Zhao
LRM
93
6
0
17 Oct 2022
Counterfactual Multihop QA: A Cause-Effect Approach for Reducing
  Disconnected Reasoning
Counterfactual Multihop QA: A Cause-Effect Approach for Reducing Disconnected Reasoning
Wangzhen Guo
Qinkang Gong
Hanjiang Lai
LRM
93
4
0
13 Oct 2022
Relational Graph Convolutional Neural Networks for Multihop Reasoning: A
  Comparative Study
Relational Graph Convolutional Neural Networks for Multihop Reasoning: A Comparative Study
Ieva Staliunaite
P. Gorinski
Ignacio Iacobacci
GNN
117
0
0
12 Oct 2022
How Well Do Multi-hop Reading Comprehension Models Understand Date
  Information?
How Well Do Multi-hop Reading Comprehension Models Understand Date Information?
Xanh Ho
Saku Sugawara
Akiko Aizawa
103
2
0
11 Oct 2022
Understanding and Improving Zero-shot Multi-hop Reasoning in Generative
  Question Answering
Understanding and Improving Zero-shot Multi-hop Reasoning in Generative Question Answering
Zhengbao Jiang
Jun Araki
Haibo Ding
Graham Neubig
LRM
95
12
0
09 Oct 2022
GAPX: Generalized Autoregressive Paraphrase-Identification X
GAPX: Generalized Autoregressive Paraphrase-Identification X
Yi Zhou
Renyu Li
Hayden Housen
Ser-Nam Lim
BDL
100
0
0
05 Oct 2022
Machine Reading, Fast and Slow: When Do Models "Understand" Language?
Machine Reading, Fast and Slow: When Do Models "Understand" Language?
Sagnik Ray Choudhury
Anna Rogers
Isabelle Augenstein
LRM
100
18
0
15 Sep 2022
A Survey on Measuring and Mitigating Reasoning Shortcuts in Machine
  Reading Comprehension
A Survey on Measuring and Mitigating Reasoning Shortcuts in Machine Reading Comprehension
Xanh Ho
Johannes Mario Meissner
Saku Sugawara
Akiko Aizawa
OffRL
110
5
0
05 Sep 2022
Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard
  Contexts
Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts
H. Trivedi
Niranjan Balasubramanian
Tushar Khot
Ashish Sabharwal
ReLMLRM
127
11
0
25 May 2022
From Easy to Hard: Two-stage Selector and Reader for Multi-hop Question
  Answering
From Easy to Hard: Two-stage Selector and Reader for Multi-hop Question Answering
Xin-Yi Li
Weixian Lei
Yubin Yang
RALM
163
24
0
24 May 2022
Automated Crossword Solving
Automated Crossword Solving
Eric Wallace
Nicholas Tomlin
Albert Xu
Kevin Kaichuang Yang
Eshaan Pathak
Matthew Ginsberg
Dan Klein
162
14
0
19 May 2022
Better Retrieval May Not Lead to Better Question Answering
Better Retrieval May Not Lead to Better Question Answering
Zhengzhong Liang
Tushar Khot
Steven Bethard
Mihai Surdeanu
Ashish Sabharwal
RALMLRM
141
3
0
07 May 2022
Task-guided Disentangled Tuning for Pretrained Language Models
Task-guided Disentangled Tuning for Pretrained Language Models
Jiali Zeng
Yu Jiang
Shuangzhi Wu
Yongjing Yin
Mu Li
DRL
179
3
0
22 Mar 2022
Reasoning over Public and Private Data in Retrieval-Based Systems
Reasoning over Public and Private Data in Retrieval-Based Systems
Simran Arora
Patrick Lewis
Angela Fan
Jacob Kahn
Christopher Ré
76
28
0
14 Mar 2022
What Makes Reading Comprehension Questions Difficult?
What Makes Reading Comprehension Questions Difficult?
Saku Sugawara
Nikita Nangia
Alex Warstadt
Sam Bowman
ELMRALM
70
14
0
12 Mar 2022
Saving Dense Retriever from Shortcut Dependency in Conversational Search
Saving Dense Retriever from Shortcut Dependency in Conversational Search
Sungdong Kim
Gangwoo Kim
100
31
0
15 Feb 2022
CommonsenseQA 2.0: Exposing the Limits of AI through Gamification
CommonsenseQA 2.0: Exposing the Limits of AI through Gamification
Alon Talmor
Ori Yoran
Ronan Le Bras
Chandrasekhar Bhagavatula
Yoav Goldberg
Yejin Choi
Jonathan Berant
ELM
163
153
0
14 Jan 2022
General Greedy De-bias Learning
General Greedy De-bias Learning
Xinzhe Han
Shuhui Wang
Chi Su
Qingming Huang
Qi Tian
211
12
0
20 Dec 2021
QuALITY: Question Answering with Long Input Texts, Yes!
QuALITY: Question Answering with Long Input Texts, Yes!
Richard Yuanzhe Pang
Alicia Parrish
Nitish Joshi
Nikita Nangia
Jason Phang
...
Vishakh Padmakumar
Johnny Ma
Jana Thompson
He He
Sam Bowman
RALM
145
171
0
16 Dec 2021
Towards Interpretable and Reliable Reading Comprehension: A Pipeline
  Model with Unanswerability Prediction
Towards Interpretable and Reliable Reading Comprehension: A Pipeline Model with Unanswerability Prediction
Kosuke Nishida
Kyosuke Nishida
Itsumi Saito
Sen Yoshida
128
7
0
17 Nov 2021
Uncertainty Calibration for Ensemble-Based Debiasing Methods
Uncertainty Calibration for Ensemble-Based Debiasing Methods
Ruibin Xiong
Yimeng Chen
Liang Pang
Xueqi Chen
Yanyan Lan
80
22
0
07 Nov 2021
Grounded Graph Decoding Improves Compositional Generalization in
  Question Answering
Grounded Graph Decoding Improves Compositional Generalization in Question Answering
Yu Gai
Paras Jain
Wendi Zhang
Joseph E. Gonzalez
Basel Alomair
Ion Stoica
BDLOOD
111
8
0
05 Nov 2021
Hey AI, Can You Solve Complex Tasks by Talking to Agents?
Hey AI, Can You Solve Complex Tasks by Talking to Agents?
Tushar Khot
Kyle Richardson
Daniel Khashabi
Ashish Sabharwal
RALMLRM
116
14
0
16 Oct 2021
123
Next