All Papers
0 / 0 papers shown
Title |
|---|
Title |
|---|

Title |
|---|
![]() Can Large Models Teach Student Models to Solve Mathematical Problems Like Human Beings? A Reasoning Distillation Method via Multi-LoRA InteractionInternational Joint Conference on Artificial Intelligence (IJCAI), 2025 |
BPO: Towards Balanced Preference Optimization between Knowledge Breadth and Depth in AlignmentNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024 |
![]() Preference Optimization for Reasoning with Pseudo FeedbackInternational Conference on Learning Representations (ICLR), 2024 |
![]() Self-Training Large Language Models for Tool-Use Without DemonstrationsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025 |
![]() WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-InstructInternational Conference on Learning Representations (ICLR), 2023 |