SEER: Facilitating Structured Reasoning and Explanation via
Reinforcement LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 |
On the Effectiveness of Offline RL for Dialogue Response GenerationInternational Conference on Machine Learning (ICML), 2023 |