Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.06195
Cited By
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers
12 August 2024
Zhenting Qi
Mingyuan Ma
Jiahang Xu
Li Zhang
Fan Yang
Mao Yang
ReLM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers"
14 / 14 papers shown
Title
Self-Critique Guided Iterative Reasoning for Multi-hop Question Answering
Zheng Chu
H. Fan
Jingchang Chen
Qianyu Wang
M. Yang
...
Zhongjie Wang
Hao Li
Guo Tang
Ming Liu
Bing Qin
ReLM
LRM
57
0
0
25 May 2025
When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs
Xiaomin Li
Zhou Yu
Zhiwei Zhang
Xupeng Chen
Ziji Zhang
Yingying Zhuang
Narayanan Sadagopan
Anurag Beniwal
LRM
57
1
0
16 May 2025
Search and Refine During Think: Autonomous Retrieval-Augmented Reasoning of LLMs
Yaorui Shi
Shihan Li
Chang Wu
Zhiyuan Liu
Sihang Li
Hengxing Cai
An Zhang
Xiang Wang
ReLM
LRM
85
0
0
16 May 2025
HyperTree Planning: Enhancing LLM Reasoning via Hierarchical Thinking
Runquan Gui
Ziyi Wang
Jun Wang
Chi Ma
Huiling Zhen
Mingxuan Yuan
Jianye Hao
Defu Lian
Enhong Chen
Feng Wu
LRM
244
0
0
05 May 2025
SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning
Jiaqi Chen
Bang Zhang
Ruotian Ma
Peisong Wang
Xiaodan Liang
Zhaopeng Tu
Xuzhao Li
Kwan-Yee K. Wong
LLMAG
ReLM
LRM
119
2
0
27 Apr 2025
IRIS: Interactive Research Ideation System for Accelerating Scientific Discovery
Aniketh Garikaparthi
Manasi Patwardhan
Lovekesh Vig
Arman Cohan
VLM
LRM
85
0
0
23 Apr 2025
From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs
Jiliang Ni
Jiachen Pu
Zhongyi Yang
Kun Zhou
Hui Wang
Xiaoliang Xiao
Dakui Wang
Xin Li
Jingfeng Luo
Conggang Hu
59
0
0
18 Apr 2025
Scaling Test-Time Inference with Policy-Optimized, Dynamic Retrieval-Augmented Generation via KV Caching and Decoding
Sakhinana Sagar Srinivas
Akash Das
Shivam Gupta
Venkataramana Runkana
OffRL
80
1
0
02 Apr 2025
Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning
Xinghao Chen
Zhijing Sun
Wenjin Guo
Miaoran Zhang
Yanjun Chen
...
Hui Su
Yijie Pan
Dietrich Klakow
Wenjie Li
Xiaoyu Shen
LRM
82
6
0
25 Feb 2025
Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling
Yiwen Ding
Zhiheng Xi
Wei He
Zhuoyuan Li
Yitao Zhai
Xiaowei Shi
Xunliang Cai
Tao Gui
Qi Zhang
Xuanjing Huang
LRM
108
4
0
24 Feb 2025
Policy Guided Tree Search for Enhanced LLM Reasoning
Yang Li
LRM
130
0
0
04 Feb 2025
Boosting Multimodal Reasoning with Automated Structured Thinking
Jinyang Wu
Mingkuan Feng
Shuai Zhang
Ruihan Jin
Feihu Che
Zengqi Wen
J. Tao
Jianhua Tao
LRM
130
11
0
04 Feb 2025
Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains
Xu Chu
Zhijie Tan
Hanlin Xue
Guanyu Wang
Tong Mo
Weiping Li
LRM
ELM
78
2
0
24 Jan 2025
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Zayne Sprague
Fangcong Yin
Juan Diego Rodriguez
Dongwei Jiang
Manya Wadhwa
Prasann Singhal
Xinyu Zhao
Xi Ye
Kyle Mahowald
Greg Durrett
ReLM
LRM
155
101
0
18 Sep 2024
1