How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning

How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning

    LRM

Papers citing "How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning"

Title
No papers