ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.24480
  4. Cited By
Towards Effective Code-Integrated Reasoning

Towards Effective Code-Integrated Reasoning

30 May 2025
Fei Bai
Yingqian Min
Beichen Zhang
Zhipeng Chen
Wayne Xin Zhao
Lei Fang
Zheng Liu
Zhongyuan Wang
Ji-Rong Wen
    OffRLLRM
ArXiv (abs)PDFHTML

Papers citing "Towards Effective Code-Integrated Reasoning"

8 / 8 papers shown
Title
Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning
Ran Xu
Jingjing Chen
Jiayu Ye
Yu Wu
Jun Yan
Carl Yang
Hongkun Yu
ELMLRM
226
2
0
27 Oct 2025
A$^2$FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning
A2^22FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning
Qianben Chen
Jingyi Cao
Jiayu Zhang
Tianrui Qin
Xiaowan Li
...
Xin Gui
Ge Zhang
Jian Yang
Yuchen Eleanor Jiang
Wangchunshu Zhou
LLMAGLRM
296
1
0
13 Oct 2025
How Many Code and Test Cases Are Enough? Evaluating Test Cases Generation from a Binary-Matrix Perspective
How Many Code and Test Cases Are Enough? Evaluating Test Cases Generation from a Binary-Matrix Perspective
Xianzhen Luo
Jinyang Huang
Wenzhen Zheng
Qingfu Zhu
Mingzheng Xu
Yiheng Xu
YuanTao Fan
L. Qin
Wanxiang Che
80
2
0
09 Oct 2025
Toward Effective Tool-Integrated Reasoning via Self-Evolved Preference Learning
Toward Effective Tool-Integrated Reasoning via Self-Evolved Preference Learning
Yifei Chen
Guanting Dong
Zhicheng Dou
LRM
191
2
0
27 Sep 2025
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Yulei Qin
Xiaoyu Tan
Zhengbao He
Gang Li
Haojia Lin
...
Yuzheng Cai
Xuan Zhang
Sheng Ye
Ke Li
Xing Sun
343
0
0
26 Sep 2025
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
Zhenghai Xue
Longtao Zheng
Qian Liu
Yingru Li
Xiaosen Zheng
Tianhao Shen
Bo An
OffRLLRM
148
48
0
02 Sep 2025
Understanding Tool-Integrated Reasoning
Understanding Tool-Integrated Reasoning
Heng Lin
Zhongwen Xu
LRM
240
14
0
26 Aug 2025
From Trial-and-Error to Improvement: A Systematic Analysis of LLM Exploration Mechanisms in RLVR
From Trial-and-Error to Improvement: A Systematic Analysis of LLM Exploration Mechanisms in RLVR
Jia Deng
Jie Chen
Zhipeng Chen
Daixuan Cheng
Fei Bai
Beichen Zhang
Yinqian Min
Y. Gao
Wayne Xin Zhao
Ji-Rong Wen
LRM
134
0
0
11 Aug 2025
1