Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.04718
Cited By
T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models
7 April 2025
Minki Kang
Jongwon Jeong
Jaewoong Cho
ALM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"T1: Tool-integrated Self-verification for Test-time Compute Scaling in Small Language Models"
2 / 2 papers shown
Title
ToolRL: Reward is All Tool Learning Needs
Cheng Qian
Emre Can Acikgoz
Qi He
Hongru Wang
X. Chen
Dilek Hakkani-Tür
Gökhan Tür
Heng Ji
OffRL
LRM
22
3
0
16 Apr 2025
Heimdall: test-time scaling on the generative verification
Wenlei Shi
Xing Jin
LRM
18
0
0
14 Apr 2025
1