Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2506.08379
Cited By
Reinforce LLM Reasoning through Multi-Agent Reflection
10 June 2025
Yurun Yuan
Tengyang Xie
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Reinforce LLM Reasoning through Multi-Agent Reflection"
12 / 12 papers shown
Title
Unlocking the Power of Multi-Agent LLM for Reasoning: From Lazy Agents to Deliberation
Zhiwei Zhang
Xiaomin Li
Yudi Lin
Hui Liu
Ramraj Chandradevan
...
Minhua Lin
Fali Wang
Xianfeng Tang
Qi He
Suhang Wang
LLMAG
LRM
235
0
0
04 Nov 2025
LANPO: Bootstrapping Language and Numerical Feedback for Reinforcement Learning in LLMs
Ang Li
Yifei Wang
Zhihang Yuan
Stefanie Jegelka
Y. X. R. Wang
ALM
KELM
170
0
0
18 Oct 2025
GraphTracer: Graph-Guided Failure Tracing in LLM Agents for Robust Multi-Turn Deep Search
Heng Zhang
Yuling Shi
Xiaodong Gu
Haochen You
Zijian Zhang
Lubin Gan
Yilei Yuan
Jin Huang
120
2
0
12 Oct 2025
MetaSynth: Multi-Agent Metadata Generation from Implicit Feedback in Black-Box Systems
Shreeranjani Srirangamsridharan
Ali Abavisani
Reza Yousefi Maragheh
Ramin Giahi
Kai Zhao
Jason H. D. Cho
Sushant Kumar
112
0
0
01 Oct 2025
Interactive Learning for LLM Reasoning
Hehai Lin
Shilei Cao
Minzhi Li
Sudong Wang
Haotian Wu
Linyi Yang
Lixian Zhang
Chengwei Qin
LLMAG
LRM
277
1
0
30 Sep 2025
Another Turn, Better Output? A Turn-Wise Analysis of Iterative LLM Prompting
Shashidhar Reddy Javaji
Bhavul Gauri
Zining Zhu
LRM
187
1
0
08 Sep 2025
Bootstrapping Task Spaces for Self-Improvement
Minqi Jiang
Andrei Lupu
Yoram Bachrach
LRM
165
2
0
04 Sep 2025
Unveiling the Latent Directions of Reflection in Large Language Models
Fu-Chieh Chang
Yu-Ting Lee
Pei-Yuan Wu
LLMSV
LRM
234
0
0
23 Aug 2025
TASER: Table Agents for Schema-guided Extraction and Recommendation
Nicole Cho
Kirsty Fielding
William Watson
Sumitra Ganesh
Manuela Veloso
LMTD
192
0
0
18 Aug 2025
From MAS to MARS: Coordination Failures and Reasoning Trade-offs in Hierarchical Multi-Agent Robotic Systems within a Healthcare Scenario
Yuanchen Bai
Zijian Ding
Shaoyue Wen
Xiang Chang
Angelique Taylor
96
1
0
06 Aug 2025
A Survey on AgentOps: Categorization, Challenges, and Future Directions
Zexin Wang
Jingjing Li
Quan Zhou
Haotian Si
Yuanhao Liu
Jianhui Li
Gaogang Xie
Fei Sun
Dan Pei
Changhua Pei
LLMAG
AI4TS
166
0
0
04 Aug 2025
Maximizing Prefix-Confidence at Test-Time Efficiently Improves Mathematical Reasoning
Matthias Otth
Jonas Hübotter
Ido Hakimi
Andreas Krause
ReLM
LRM
212
2
0
24 Jul 2025
1