Data-Driven Policy Mapping for Safe RL-based Energy Management SystemsEnergy Reports (Energy Rep.), 2025 |
TreeRL: LLM Reinforcement Learning with On-Policy Tree SearchAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
Subgoal-Guided Policy Heuristic Search with Learned SubgoalsInternational Conference on Machine Learning (ICML), 2025 |