ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.08379
  4. Cited By

Reinforce LLM Reasoning through Multi-Agent Reflection

10 June 2025
Yurun Yuan
Tengyang Xie
    LRM
ArXiv (abs)PDFHTML

Papers citing "Reinforce LLM Reasoning through Multi-Agent Reflection"

12 / 12 papers shown
Title
Unlocking the Power of Multi-Agent LLM for Reasoning: From Lazy Agents to Deliberation
Unlocking the Power of Multi-Agent LLM for Reasoning: From Lazy Agents to Deliberation
Zhiwei Zhang
Xiaomin Li
Yudi Lin
Hui Liu
Ramraj Chandradevan
...
Minhua Lin
Fali Wang
Xianfeng Tang
Qi He
Suhang Wang
LLMAGLRM
235
0
0
04 Nov 2025
LANPO: Bootstrapping Language and Numerical Feedback for Reinforcement Learning in LLMs
LANPO: Bootstrapping Language and Numerical Feedback for Reinforcement Learning in LLMs
Ang Li
Yifei Wang
Zhihang Yuan
Stefanie Jegelka
Y. X. R. Wang
ALMKELM
170
0
0
18 Oct 2025
GraphTracer: Graph-Guided Failure Tracing in LLM Agents for Robust Multi-Turn Deep Search
GraphTracer: Graph-Guided Failure Tracing in LLM Agents for Robust Multi-Turn Deep Search
Heng Zhang
Yuling Shi
Xiaodong Gu
Haochen You
Zijian Zhang
Lubin Gan
Yilei Yuan
Jin Huang
120
2
0
12 Oct 2025
MetaSynth: Multi-Agent Metadata Generation from Implicit Feedback in Black-Box Systems
MetaSynth: Multi-Agent Metadata Generation from Implicit Feedback in Black-Box Systems
Shreeranjani Srirangamsridharan
Ali Abavisani
Reza Yousefi Maragheh
Ramin Giahi
Kai Zhao
Jason H. D. Cho
Sushant Kumar
112
0
0
01 Oct 2025
Interactive Learning for LLM Reasoning
Interactive Learning for LLM Reasoning
Hehai Lin
Shilei Cao
Minzhi Li
Sudong Wang
Haotian Wu
Linyi Yang
Lixian Zhang
Chengwei Qin
LLMAGLRM
277
1
0
30 Sep 2025
Another Turn, Better Output? A Turn-Wise Analysis of Iterative LLM Prompting
Another Turn, Better Output? A Turn-Wise Analysis of Iterative LLM Prompting
Shashidhar Reddy Javaji
Bhavul Gauri
Zining Zhu
LRM
187
1
0
08 Sep 2025
Bootstrapping Task Spaces for Self-Improvement
Bootstrapping Task Spaces for Self-Improvement
Minqi Jiang
Andrei Lupu
Yoram Bachrach
LRM
165
2
0
04 Sep 2025
Unveiling the Latent Directions of Reflection in Large Language Models
Unveiling the Latent Directions of Reflection in Large Language Models
Fu-Chieh Chang
Yu-Ting Lee
Pei-Yuan Wu
LLMSVLRM
234
0
0
23 Aug 2025
TASER: Table Agents for Schema-guided Extraction and Recommendation
TASER: Table Agents for Schema-guided Extraction and Recommendation
Nicole Cho
Kirsty Fielding
William Watson
Sumitra Ganesh
Manuela Veloso
LMTD
192
0
0
18 Aug 2025
From MAS to MARS: Coordination Failures and Reasoning Trade-offs in Hierarchical Multi-Agent Robotic Systems within a Healthcare Scenario
From MAS to MARS: Coordination Failures and Reasoning Trade-offs in Hierarchical Multi-Agent Robotic Systems within a Healthcare Scenario
Yuanchen Bai
Zijian Ding
Shaoyue Wen
Xiang Chang
Angelique Taylor
96
1
0
06 Aug 2025
A Survey on AgentOps: Categorization, Challenges, and Future Directions
A Survey on AgentOps: Categorization, Challenges, and Future Directions
Zexin Wang
Jingjing Li
Quan Zhou
Haotian Si
Yuanhao Liu
Jianhui Li
Gaogang Xie
Fei Sun
Dan Pei
Changhua Pei
LLMAGAI4TS
166
0
0
04 Aug 2025
Maximizing Prefix-Confidence at Test-Time Efficiently Improves Mathematical Reasoning
Maximizing Prefix-Confidence at Test-Time Efficiently Improves Mathematical Reasoning
Matthias Otth
Jonas Hübotter
Ido Hakimi
Andreas Krause
ReLMLRM
212
2
0
24 Jul 2025
1