Efficient Planning in a Compact Latent Action SpaceInternational Conference on Learning Representations (ICLR), 2022 |
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent
Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022 |
Offline Reinforcement Learning as One Big Sequence Modeling ProblemNeural Information Processing Systems (NeurIPS), 2021 |
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator
Policy OptimizationNeural Information Processing Systems (NeurIPS), 2020 |
The Gambler's Problem and BeyondInternational Conference on Learning Representations (ICLR), 2019 |
Deterministic Value-Policy GradientsAAAI Conference on Artificial Intelligence (AAAI), 2019 |
The Divergence of Reinforcement Learning Algorithms with Value-Iteration
and Function ApproximationIEEE International Joint Conference on Neural Network (IJCNN), 2011 |