B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught ReasonersInternational Conference on Learning Representations (ICLR), 2024 |
Automatic Curriculum Expert Iteration for Reliable LLM ReasoningInternational Conference on Learning Representations (ICLR), 2024 |
ReGenesis: LLMs can Grow into Reasoning Generalists via Self-ImprovementInternational Conference on Learning Representations (ICLR), 2024 |
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling LawsInternational Conference on Machine Learning (ICML), 2023 |