Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.04404
Cited By
Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models
6 February 2025
Xiao-Wen Yang
Xuan-Yi Zhu
Wen-Da Wei
Ding-Chu Zhang
Jie-Jing Shao
Zhi Zhou
Lan-Zhe Guo
Yu-Feng Li
KELM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models"
5 / 5 papers shown
Title
How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning
Hongyi Cai
Junlin Wang
Xiaoyin Chen
Bhuwan Dhingra
LRM
22
0
0
30 May 2025
A Survey of Generative Categories and Techniques in Multimodal Large Language Models
Longzhen Han
Awes Mubarak
Almas Baimagambetov
Nikolaos Polatidis
Thar Baker
LRM
49
0
0
29 May 2025
Large Language Models for Planning: A Comprehensive and Systematic Survey
Pengfei Cao
Tianyi Men
Wencan Liu
Jingwen Zhang
Xuzhao Li
Xixun Lin
Dianbo Sui
Yanan Cao
Kang Liu
Jun Zhao
LLMAG
LM&Ro
OffRL
ELM
LRM
125
0
0
26 May 2025
First Finish Search: Efficient Test-Time Scaling in Large Language Models
Aradhye Agarwal
Ayan Sengupta
Tanmoy Chakraborty
ReLM
RALM
ALM
LRM
111
0
0
23 May 2025
Enhancing Web Agents with Explicit Rollback Mechanisms
Zizhuo Zhang
Tianqing Fang
Kaixin Ma
Wenhao Yu
Han Zhang
Haitao Mi
Dong Yu
KELM
130
3
0
16 Apr 2025
1