ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.01060
  4. Cited By
Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of
  Reasoning Steps
v1v2 (latest)

Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of Reasoning Steps

International Conference on Computational Linguistics (COLING), 2020
2 November 2020
Xanh Ho
A. Nguyen
Saku Sugawara
Akiko Aizawa
    RALMLRM
ArXiv (abs)PDFHTML

Papers citing "Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of Reasoning Steps"

50 / 590 papers shown
Chunks as Arms: Multi-Armed Bandit-Guided Sampling for Long-Context LLM Preference Optimization
Chunks as Arms: Multi-Armed Bandit-Guided Sampling for Long-Context LLM Preference Optimization
Shaohua Duan
Xinze Li
Zhenghao Liu
Xiaoyuan Yi
Shi Yu
Kaiyan Zhang
Yu Gu
Ge Yu
Maosong Sun
Maosong Sun
RALM
174
6
0
10 Apr 2026
PathFinder: MCTS and LLM Feedback-based Path Selection for Multi-Hop Question Answering
PathFinder: MCTS and LLM Feedback-based Path Selection for Multi-Hop Question Answering
Durga Prasad Maram
Kalpa Gunaratna
Vijay Srinivasan
Haris Jeelani
Srinivas Chappidi
LRM
65
0
0
05 Dec 2025
On Group Relative Policy Optimization Collapse in Agent Search: The Lazy Likelihood-Displacement
On Group Relative Policy Optimization Collapse in Agent Search: The Lazy Likelihood-Displacement
Wenlong Deng
Yushu Li
Boying Gong
Yi Ren
Christos Thrampoulidis
Xiaoxiao Li
166
7
0
03 Dec 2025
Towards Unification of Hallucination Detection and Fact Verification for Large Language Models
Towards Unification of Hallucination Detection and Fact Verification for Large Language Models
Weihang Su
Jianming Long
Changyue Wang
Shiyu Lin
Jingyan Xu
Ziyi Ye
Qingyao Ai
Yiqun Liu
HILM
148
1
0
02 Dec 2025
Agentic Policy Optimization via Instruction-Policy Co-Evolution
Agentic Policy Optimization via Instruction-Policy Co-Evolution
Han Zhou
Xingchen Wan
Ivan Vulić
Anna Korhonen
157
0
0
01 Dec 2025
From Atomic to Composite: Reinforcement Learning Enables Generalization in Complementary Reasoning
From Atomic to Composite: Reinforcement Learning Enables Generalization in Complementary Reasoning
Sitao Cheng
Xunjian Yin
Ruiwen Zhou
Yuxuan Li
Xinyi Wang
Liangming Pan
William Yang Wang
Victor Zhong
OffRLLRM
319
4
0
01 Dec 2025
Multi-chain Graph Refinement and Selection for Reliable Reasoning in Large Language Models
Multi-chain Graph Refinement and Selection for Reliable Reasoning in Large Language Models
Yujiao Yang
Jing Lian
Linhui Li
LRM
255
0
0
28 Nov 2025
Reducing Latency of LLM Search Agent via Speculation-based Algorithm-System Co-Design
Reducing Latency of LLM Search Agent via Speculation-based Algorithm-System Co-Design
Zixiao Huang
Wen Zeng
Tianyu Fu
Tengxuan Liu
Yizhou Sun
...
Y. Li
Quanlu Zhang
Guohao Dai
Zhenhua Zhu
Yu Wang
LRM
245
1
0
25 Nov 2025
Stabilizing Off-Policy Training for Long-Horizon LLM Agent via Turn-Level Importance Sampling and Clipping-Triggered Normalization
Stabilizing Off-Policy Training for Long-Horizon LLM Agent via Turn-Level Importance Sampling and Clipping-Triggered Normalization
Chenliang Li
Adel Elmahdy
Alex Boyd
Zhongruo Wang
Alfredo García
Parminder Bhatia
Taha A. Kass-Hout
Cao Xiao
Mingyi Hong
Mingyi Hong
OffRL
264
1
0
25 Nov 2025
HyperbolicRAG: Enhancing Retrieval-Augmented Generation with Hyperbolic Representations
HyperbolicRAG: Enhancing Retrieval-Augmented Generation with Hyperbolic Representations
Linxiao Cao
Ruitao Wang
Jindong Li
Zhipeng Zhou
Menglin Yang
256
1
0
24 Nov 2025
CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning
CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning
Jie He
Richard He Bai
Sinead Williamson
Jeff Z. Pan
Navdeep Jaitly
Yizhe Zhang
RALMVLMLRM
784
2
0
24 Nov 2025
Parametric Retrieval-Augmented Generation using Latent Routing of LoRA Adapters
Parametric Retrieval-Augmented Generation using Latent Routing of LoRA Adapters
Zhan Su
Fengran Mo
Jian-yun Nie
Yuchen Hui
Jiaao Sun
Jian-yun Nie
176
2
0
21 Nov 2025
ARK: Answer-Centric Retriever Tuning via KG-augmented Curriculum Learning
ARK: Answer-Centric Retriever Tuning via KG-augmented Curriculum Learning
Jiawei Zhou
Hang Ding
Haiyun Jiang
RALM
185
0
0
20 Nov 2025
MuISQA: Multi-Intent Retrieval-Augmented Generation for Scientific Question Answering
MuISQA: Multi-Intent Retrieval-Augmented Generation for Scientific Question Answering
Zhiyuan Li
Haisheng Yu
Guangchuan Guo
Nan Zhou
Jiajun Zhang
RALM
365
1
0
20 Nov 2025
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
Mingyue Cheng
Jie Ouyang
Shuo Yu
Ruiran Yan
Yucong Luo
Zirui Liu
Daoyu Wang
Qi Liu
Enhong Chen
199
25
0
18 Nov 2025
Hierarchical Token Prepending: Enhancing Information Flow in Decoder-based LLM Embeddings
Hierarchical Token Prepending: Enhancing Information Flow in Decoder-based LLM Embeddings
Xueying Ding
Xingyue Huang
Mingxuan Ju
Liam Collins
Yozen Liu
Leman Akoglu
Neil Shah
Tong Zhao
175
2
0
18 Nov 2025
CriticSearch: Fine-Grained Credit Assignment for Search Agents via a Retrospective Critic
CriticSearch: Fine-Grained Credit Assignment for Search Agents via a Retrospective Critic
Yaocheng Zhang
Haohuan Huang
Zijun Song
Yuanheng Zhu
Qichao Zhang
Zijie Zhao
Dongbin Zhao
OffRLLRM
212
9
0
15 Nov 2025
A Multifaceted Analysis of Negative Bias in Large Language Models through the Lens of Parametric Knowledge
A Multifaceted Analysis of Negative Bias in Large Language Models through the Lens of Parametric KnowledgeIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025
Jongyoon Song
Sangwon Yu
Sungroh Yoon
75
0
0
14 Nov 2025
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling
MiroMind Team
Song Bai
Lidong Bing
Carson Chen
Guanzheng Chen
...
T. Zhao
Xizhou Zhu
Yanpeng Zhou
Y. Zhang
Zhi Zhu
LLMAGLRMVLM
370
29
0
14 Nov 2025
Modeling Uncertainty Trends for Timely Retrieval in Dynamic RAG
Modeling Uncertainty Trends for Timely Retrieval in Dynamic RAG
Bo Li
Tian Tian
Zhenghua Xu
Hao Cheng
Shikun Zhang
Wei Ye
295
0
0
13 Nov 2025
CAPO: Confidence Aware Preference Optimization Learning for Multilingual Preferences
CAPO: Confidence Aware Preference Optimization Learning for Multilingual Preferences
Rhitabrat Pokharel
Yufei Tao
Ameeta Agrawal
172
3
0
10 Nov 2025
TeaRAG: A Token-Efficient Agentic Retrieval-Augmented Generation Framework
TeaRAG: A Token-Efficient Agentic Retrieval-Augmented Generation Framework
Chao Zhang
Y Samuel Wang
Derong Xu
Haoxin Zhang
Yuanjie Lyu
...
Tong Xu
Xiangyu Zhao
Yan Gao
Yao Hu
Enhong Chen
3DV
496
2
0
07 Nov 2025
Query Generation Pipeline with Enhanced Answerability Assessment for Financial Information Retrieval
Query Generation Pipeline with Enhanced Answerability Assessment for Financial Information Retrieval
H. Kim
Yeeun Yoo
Youngjun Kwak
RALM
260
1
0
07 Nov 2025
MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning
MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning
Qianhao Yuan
Jie Lou
Zichao Li
Jiawei Chen
Yaojie Lu
Hongyu Lin
Le Sun
Debing Zhang
Xianpei Han
OffRLRALM
213
6
0
04 Nov 2025
Using Span Queries to Optimize for Cache and Attention Locality
Using Span Queries to Optimize for Cache and Attention Locality
Paul C. Castro
N. Mitchell
Nathan Ordonez
Thomas Parnell
Mudhakar Srivatsa
Antoni Viros i Martin
LRM
175
0
0
04 Nov 2025
LiveSearchBench: An Automatically Constructed Benchmark for Retrieval and Reasoning over Dynamic Knowledge
LiveSearchBench: An Automatically Constructed Benchmark for Retrieval and Reasoning over Dynamic Knowledge
Heng Zhou
Ao Yu
Yuchen Fan
Jianing Shi
Li Kang
...
Y. Wu
Tiancheng He
Yiran Qin
Wenlong Zhang
Zhenfei Yin
KELMRALM
487
2
0
03 Nov 2025
PROPEX-RAG: Enhanced GraphRAG using Prompt-Driven Prompt Execution
PROPEX-RAG: Enhanced GraphRAG using Prompt-Driven Prompt Execution
Tejas Sarnaik
Manan Shah
Ravi Hegde
LRM
224
1
0
03 Nov 2025
DEEPAMBIGQA: Ambiguous Multi-hop Questions for Benchmarking LLM Answer Completeness
DEEPAMBIGQA: Ambiguous Multi-hop Questions for Benchmarking LLM Answer Completeness
Jiabao Ji
Min Li
Priyanshu Kumar
Shiyu Chang
Saloni Potdar
148
3
0
03 Nov 2025
Optimizing Native Sparse Attention with Latent Attention and Local Global Alternating Strategies
Optimizing Native Sparse Attention with Latent Attention and Local Global Alternating Strategies
Yuxuan Hu
Jianchao Tan
Jiaqi Zhang
Wen Zan
Pingwei Sun
Yifan Lu
Yerui Sun
Yuchen Xie
Xunliang Cai
Jing Zhang
305
0
0
02 Nov 2025
Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning
Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning
Wenjin Liu
Haoran Luo
X. Lin
Haoming Liu
Tiesunlong Shen
Jiapu Wang
Rui Mao
Erik Cambria
LLMAGOffRLLRM
594
4
0
02 Nov 2025
Separate the Wheat from the Chaff: Winnowing Down Divergent Views in Retrieval Augmented Generation
Separate the Wheat from the Chaff: Winnowing Down Divergent Views in Retrieval Augmented Generation
Song Wang
Zihan Chen
Peng Wang
Zhepei Wei
Zhen Tan
Yu Meng
Cong Shen
Jundong Li
236
1
0
01 Nov 2025
Interact-RAG: Reason and Interact with the Corpus, Beyond Black-Box Retrieval
Interact-RAG: Reason and Interact with the Corpus, Beyond Black-Box Retrieval
Yulong Hui
Chao Chen
Zhihang Fu
Yihao Liu
J. C. Ye
Huanchen Zhang
LRM
361
3
0
31 Oct 2025
MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval
MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval
Qi Luo
X. Li
Yuxin Wang
Tingshuo Fan
Yuan Li
Xinchi Chen
Xipeng Qiu
RALMKELMLRM
244
2
0
31 Oct 2025
InfoFlow: Reinforcing Search Agent Via Reward Density Optimization
InfoFlow: Reinforcing Search Agent Via Reward Density Optimization
Kun Luo
Hongjin Qian
Zheng Liu
Ziyi Xia
Shitao Xiao
Siqi Bao
Jun Zhao
Kang Liu
187
1
0
30 Oct 2025
Towards Global Retrieval Augmented Generation: A Benchmark for Corpus-Level Reasoning
Towards Global Retrieval Augmented Generation: A Benchmark for Corpus-Level Reasoning
Qi Luo
Xiaonan Li
Tingshuo Fan
Xinchi Chen
Xipeng Qiu
RALM3DVLRM
712
1
0
30 Oct 2025
GAP: Graph-Based Agent Planning with Parallel Tool Use and Reinforcement Learning
GAP: Graph-Based Agent Planning with Parallel Tool Use and Reinforcement Learning
Jiaqi Wu
Qinlao Zhao
Zefeng Chen
Kai Qin
Yifei Zhao
Xueqian Wang
Yuhang Yao
125
5
0
29 Oct 2025
Sharpness-Guided Group Relative Policy Optimization via Probability Shaping
Sharpness-Guided Group Relative Policy Optimization via Probability Shaping
Tue Le
Nghi D.Q.Bui
Linh Ngo Van
267
0
0
29 Oct 2025
Repurposing Synthetic Data for Fine-grained Search Agent Supervision
Repurposing Synthetic Data for Fine-grained Search Agent Supervision
Yida Zhao
Kuan Li
Xixi Wu
Liwen Zhang
Dingchu Zhang
...
Xinyu Wang
Kewei Tu
Pengjun Xie
Jingren Zhou
Yong Jiang
175
5
0
28 Oct 2025
SynthWorlds: Controlled Parallel Worlds for Disentangling Reasoning and Knowledge in Language Models
SynthWorlds: Controlled Parallel Worlds for Disentangling Reasoning and Knowledge in Language Models
Ken Gu
Advait Bhat
Mike A. Merrill
Robert West
Xin Liu
Daniel J. McDuff
Tim Althoff
KELMLRM
305
2
0
28 Oct 2025
BMGQ: A Bottom-up Method for Generating Complex Multi-hop Reasoning Questions from Semi-structured Data
BMGQ: A Bottom-up Method for Generating Complex Multi-hop Reasoning Questions from Semi-structured Data
Bingsen Qiu
Zijian Liu
Xiao Liu
H. Yang
Feier Zhang
Yixuan Qin
Feier Zhang
H. Yang
Zeren Gao
RALMLRM
350
1
0
28 Oct 2025
RaCoT: Plug-and-Play Contrastive Example Generation Mechanism for Enhanced LLM Reasoning Reliability
RaCoT: Plug-and-Play Contrastive Example Generation Mechanism for Enhanced LLM Reasoning Reliability
Kaitong Cai
Jusheng Zhang
Yijia Fan
Jing Yang
Keze Wang
LRM
202
14
0
26 Oct 2025
GlobalRAG: Enhancing Global Reasoning in Multi-hop Question Answering via Reinforcement Learning
GlobalRAG: Enhancing Global Reasoning in Multi-hop Question Answering via Reinforcement Learning
Jinchang Luo
Mingquan Cheng
Fan Wan
Ni Li
Xiaoling Xia
Shuangshuang Tian
Tingcheng Bian
Haiwei Wang
Haohuan Fu
Yan Tao
ReLMRALMLRM
550
2
0
23 Oct 2025
Think Straight, Stop Smart: Structured Reasoning for Efficient Multi-Hop RAG
Think Straight, Stop Smart: Structured Reasoning for Efficient Multi-Hop RAG
Jihwan Bang
Juntae Lee
Seunghan Yang
Sungha Choi
ReLMLRM
194
0
0
22 Oct 2025
LoongRL: Reinforcement Learning for Advanced Reasoning over Long Contexts
LoongRL: Reinforcement Learning for Advanced Reasoning over Long Contexts
S. S. Wang
Gaokai Zhang
Li Zhang
Ning Shang
Fan Yang
Dongyao Chen
M. Yang
OffRLRALMReLMLRM
317
6
0
22 Oct 2025
Search Self-play: Pushing the Frontier of Agent Capability without Supervision
Search Self-play: Pushing the Frontier of Agent Capability without Supervision
Hongliang Lu
Yuhang Wen
Pengyu Cheng
Ruijin Ding
Haotian Xu
Jiaqi Guo
Chutian Wang
Haonan Chen
Xiaoxi Jiang
Guanjun Jiang
LRM
168
8
0
21 Oct 2025
MENTOR: A Reinforcement Learning Framework for Enabling Tool Use in Small Models via Teacher-Optimized Rewards
MENTOR: A Reinforcement Learning Framework for Enabling Tool Use in Small Models via Teacher-Optimized Rewards
Changsu Choi
Hoyun Song
Dongyeon Kim
WooHyeon Jung
Minkyung Cho
Sunjin Park
NohHyeob Bae
Seona Yu
Kyungtae Lim
215
0
0
21 Oct 2025
WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection
WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection
Guanzhong He
Zhen Yang
Jinxin Liu
Bin Xu
Lei Hou
Juanzi Li
139
3
0
21 Oct 2025
Which LLM Multi-Agent Protocol to Choose?
Which LLM Multi-Agent Protocol to Choose?
Hongyi Du
Jiaqi Su
Jisen Li
Lijie Ding
Yingxuan Yang
Peixuan Han
Xiangru Tang
Kunlun Zhu
Jiaxuan You
253
1
0
20 Oct 2025
Towards Mixed-Modal Retrieval for Universal Retrieval-Augmented Generation
Towards Mixed-Modal Retrieval for Universal Retrieval-Augmented Generation
Chenghao Zhang
Guanting Dong
Xinyu Yang
Zhicheng Dou
VLM
116
0
0
20 Oct 2025
Annotation-Efficient Universal Honesty Alignment
Annotation-Efficient Universal Honesty Alignment
Shiyu Ni
Keping Bi
Jiafeng Guo
Minghao Tang
Jingtong Wu
Zengxin Han
Xueqi Cheng
HILM
263
1
0
20 Oct 2025
1234...101112
Next
Page 1 of 12
Pageof 12