ExPO: Unlocking Hard Reasoning with Self-Explanation-Guided Reinforcement Learning

ExPO: Unlocking Hard Reasoning with Self-Explanation-Guided Reinforcement Learning

Ruiyang Zhou
Shuozhe Li
Amy Zhang
Liu Leqi

Papers citing "ExPO: Unlocking Hard Reasoning with Self-Explanation-Guided Reinforcement Learning"

Title
No papers