Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2508.05592
Cited By
v1
v2 (latest)
MathSmith: Towards Extremely Hard Mathematical Reasoning by Forging Synthetic Problems with a Reinforced Policy
7 August 2025
Shaoxiong Zhan
Yanlin Lai
Ziyu Lu
Dahua Lin
Ziqing Yang
Fei Tang
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MathSmith: Towards Extremely Hard Mathematical Reasoning by Forging Synthetic Problems with a Reinforced Policy"
8 / 8 papers shown
Auxiliary-Hyperparameter-Free Sampling: Entropy Equilibrium for Text Generation
Xiaodong Cai
Hai Lin
Shaoxiong Zhan
Weiqi Luo
Hong-Gee Kim
Hongyan Hao
Yu Yang
Hai-Tao Zheng
79
0
0
30 Nov 2025
BrokenMath: A Benchmark for Sycophancy in Theorem Proving with LLMs
Ivo Petrov
Jasper Dekoninck
Martin Vechev
153
4
0
06 Oct 2025
Socratic-Zero : Bootstrapping Reasoning via Data-Free Agent Co-evolution
Shaobo Wang
Zhengbo Jiao
Zifan Zhang
Yilang Peng
Xu Ze
B. Yang
Wei Wang
Hu Wei
Linfeng Zhang
SyDa
OffRL
ReLM
LRM
ELM
291
6
0
29 Sep 2025
From Static to Dynamic: Adaptive Monte Carlo Search for Mathematical Process Supervision
Jie Ma
Shihao Qi
Rui Xing
Ziang Yin
Bifan Wei
Jun Liu
Tongliang Liu
AI4TS
LRM
166
0
0
29 Sep 2025
ScaleDiff: Scaling Difficult Problems for Advanced Mathematical Reasoning
Qizhi Pei
Zhuoshi Pan
Honglin Lin
Xin Gao
Yu Li
Zinan Tang
Conghui He
Rui Yan
Lijun Wu
AIMat
OffRL
LRM
225
2
0
25 Sep 2025
Discovering New Theorems via LLMs with In-Context Proof Learning in Lean
Kazumi Kasaura
Naoto Onda
Yuta Oriike
Masaya Taniguchi
Akiyoshi Sannai
Sho Sonoda
LRM
122
0
0
16 Sep 2025
Merge-of-Thought Distillation
Zhanming Shen
Zeyu Qin
Zenan Huang
Hao Chen
J. Hu
Yihong Zhuang
Guoshan Lu
Gang Chen
Junbo Zhao
MoMe
LRM
339
3
0
10 Sep 2025
ReSURE: Regularizing Supervision Unreliability for Multi-turn Dialogue Fine-tuning
Yiming Du
Yifan Xiang
Bin Liang
Dahua Lin
Kam-Fai Wong
Fei Tan
OffRL
179
1
0
27 Aug 2025
1
Page 1 of 1