ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2508.05592
  4. Cited By
MathSmith: Towards Extremely Hard Mathematical Reasoning by Forging Synthetic Problems with a Reinforced Policy
v1v2 (latest)

MathSmith: Towards Extremely Hard Mathematical Reasoning by Forging Synthetic Problems with a Reinforced Policy

7 August 2025
Shaoxiong Zhan
Yanlin Lai
Ziyu Lu
Dahua Lin
Ziqing Yang
Fei Tang
    LRM
ArXiv (abs)PDFHTML

Papers citing "MathSmith: Towards Extremely Hard Mathematical Reasoning by Forging Synthetic Problems with a Reinforced Policy"

8 / 8 papers shown
Auxiliary-Hyperparameter-Free Sampling: Entropy Equilibrium for Text Generation
Xiaodong Cai
Hai Lin
Shaoxiong Zhan
Weiqi Luo
Hong-Gee Kim
Hongyan Hao
Yu Yang
Hai-Tao Zheng
79
0
0
30 Nov 2025
BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs
BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs
Ivo Petrov
Jasper Dekoninck
Martin Vechev
153
4
0
06 Oct 2025
Socratic-Zero : Bootstrapping Reasoning via Data-Free Agent Co-evolution
Socratic-Zero : Bootstrapping Reasoning via Data-Free Agent Co-evolution
Shaobo Wang
Zhengbo Jiao
Zifan Zhang
Yilang Peng
Xu Ze
B. Yang
Wei Wang
Hu Wei
Linfeng Zhang
SyDaOffRLReLMLRMELM
291
6
0
29 Sep 2025
From Static to Dynamic: Adaptive Monte Carlo Search for Mathematical Process Supervision
From Static to Dynamic: Adaptive Monte Carlo Search for Mathematical Process Supervision
Jie Ma
Shihao Qi
Rui Xing
Ziang Yin
Bifan Wei
Jun Liu
Tongliang Liu
AI4TSLRM
166
0
0
29 Sep 2025
ScaleDiff: Scaling Difficult Problems for Advanced Mathematical Reasoning
ScaleDiff: Scaling Difficult Problems for Advanced Mathematical Reasoning
Qizhi Pei
Zhuoshi Pan
Honglin Lin
Xin Gao
Yu Li
Zinan Tang
Conghui He
Rui Yan
Lijun Wu
AIMatOffRLLRM
225
2
0
25 Sep 2025
Discovering New Theorems via LLMs with In-Context Proof Learning in Lean
Discovering New Theorems via LLMs with In-Context Proof Learning in Lean
Kazumi Kasaura
Naoto Onda
Yuta Oriike
Masaya Taniguchi
Akiyoshi Sannai
Sho Sonoda
LRM
122
0
0
16 Sep 2025
Merge-of-Thought Distillation
Merge-of-Thought Distillation
Zhanming Shen
Zeyu Qin
Zenan Huang
Hao Chen
J. Hu
Yihong Zhuang
Guoshan Lu
Gang Chen
Junbo Zhao
MoMeLRM
339
3
0
10 Sep 2025
ReSURE: Regularizing Supervision Unreliability for Multi-turn Dialogue Fine-tuning
ReSURE: Regularizing Supervision Unreliability for Multi-turn Dialogue Fine-tuning
Yiming Du
Yifan Xiang
Bin Liang
Dahua Lin
Kam-Fai Wong
Fei Tan
OffRL
179
1
0
27 Aug 2025
1
Page 1 of 1