
Title |
|---|
![]() Pitfall of Optimism: Distributional Reinforcement Learning by
Randomizing Risk CriterionNeural Information Processing Systems (NeurIPS), 2023 |
![]() Variance Control for Distributional Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023 |
![]() An Alternative to Variance: Gini Deviation for Risk-averse Policy
GradientNeural Information Processing Systems (NeurIPS), 2023 |
![]() Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent
Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2023 |
![]() Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous
ControlIEEE International Joint Conference on Neural Network (IJCNN), 2022 |
![]() The Sufficiency of Off-Policyness and Soft Clipping: PPO is still
Insufficient according to an Off-Policy MeasureAAAI Conference on Artificial Intelligence (AAAI), 2022 |