Title |
---|
![]() Nash Learning from Human Feedback Rémi Munos Michal Valko Daniele Calandriello M. G. Azar Mark Rowland ...Nikola Momchev Olivier Bachem D. Mankowitz Doina Precup Bilal Piot |
![]() Scalable Deep Reinforcement Learning Algorithms for Mean Field Games Mathieu Laurière Sarah Perrin Sertan Girgin Paul Muller Ayush Jain ...Georgios Piliouras Julien Pérolat Romuald Élie Olivier Pietquin Matthieu Geist |