ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.04404
  4. Cited By
Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models

Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models

6 February 2025
Xiao-Wen Yang
Xuan-Yi Zhu
Wen-Da Wei
Ding-Chu Zhang
Jie-Jing Shao
Zhi Zhou
Lan-Zhe Guo
Yu-Feng Li
    KELMLRM
ArXiv (abs)PDFHTML

Papers citing "Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models"

5 / 5 papers shown
Title
How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning
How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning
Hongyi Cai
Junlin Wang
Xiaoyin Chen
Bhuwan Dhingra
LRM
26
0
0
30 May 2025
A Survey of Generative Categories and Techniques in Multimodal Large Language Models
A Survey of Generative Categories and Techniques in Multimodal Large Language Models
Longzhen Han
Awes Mubarak
Almas Baimagambetov
Nikolaos Polatidis
Thar Baker
LRM
51
0
0
29 May 2025
Large Language Models for Planning: A Comprehensive and Systematic Survey
Large Language Models for Planning: A Comprehensive and Systematic Survey
Pengfei Cao
Tianyi Men
Wencan Liu
Jingwen Zhang
Xuzhao Li
Xixun Lin
Dianbo Sui
Yanan Cao
Kang Liu
Jun Zhao
LLMAGLM&RoOffRLELMLRM
131
0
0
26 May 2025
First Finish Search: Efficient Test-Time Scaling in Large Language Models
Aradhye Agarwal
Ayan Sengupta
Tanmoy Chakraborty
ReLMRALMALMLRM
111
0
0
23 May 2025
Enhancing Web Agents with Explicit Rollback Mechanisms
Enhancing Web Agents with Explicit Rollback Mechanisms
Zizhuo Zhang
Tianqing Fang
Kaixin Ma
Wenhao Yu
Han Zhang
Haitao Mi
Dong Yu
KELM
130
3
0
16 Apr 2025
1