Title |
---|
![]() On Multi-objective Policy Optimization as a Tool for Reinforcement
Learning: Case Studies in Offline RL and Finetuning A. Abdolmaleki Sandy H. Huang Giulia Vezzani Bobak Shahriari Jost Tobias Springenberg ...András Gyorgy Csaba Szepesvári R. Hadsell N. Heess Martin Riedmiller |
![]() What Matters for Adversarial Imitation Learning? Manu Orsini Anton Raichuk Léonard Hussenot Damien Vincent Robert Dadashi Sertan Girgin M. Geist Olivier Bachem Olivier Pietquin Marcin Andrychowicz |