Title
Dynamic Action Interpolation: A Universal Approach for Accelerating Reinforcement Learning with Expert Guidance Wenjun Cao 52 0 0 26 Apr 2025
Object-Aware DINO (Oh-A-Dino): Enhancing Self-Supervised Representations for Multi-Object Instance Retrieval Stefan Sylvius Wagner Stefan Harmeling OCL 76 0 0 12 Mar 2025
Adaptive trajectory-constrained exploration strategy for deep reinforcement learning Guojian Wang Faguo Wu Xiao Zhang Ning Guo Zhiming Zheng 30 3 0 27 Dec 2023
Extended Intelligence D. Barack Andrew Jaegle 33 5 0 15 Sep 2022
BYOL-Explore: Exploration by Bootstrapped Prediction Z. Guo S. Thakoor Miruna Pislar Bernardo Avila-Pires Florent Altché ... Yunhao Tang Michal Valko Rémi Munos M. G. Azar Bilal Piot 22 68 0 16 Jun 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress Rishabh Agarwal Max Schwarzer Pablo Samuel Castro Aaron C. Courville Marc G. Bellemare OffRL OnRL 28 63 0 03 Jun 2022
Exploration in Deep Reinforcement Learning: A Survey Pawel Ladosz Lilian Weng Minwoo Kim H. Oh OffRL 23 324 0 02 May 2022
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning Sabela Ramos Sertan Girgin Léonard Hussenot Damien Vincent Hanna Yakubovich ... Piotr Stańczyk Raphaël Marinier Jeremiah Harmsen Olivier Pietquin Nikola Momchev OffRL 35 23 0 04 Nov 2021
Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Generation Soojung Yang Doyeong Hwang Seul Lee Seongok Ryu Sung Ju Hwang 34 67 0 04 Oct 2021
A Minimalist Approach to Offline Reinforcement Learning Scott Fujimoto S. Gu OffRL 55 780 0 12 Jun 2021
What Matters for Adversarial Imitation Learning? Manu Orsini Anton Raichuk Léonard Hussenot Damien Vincent Robert Dadashi Sertan Girgin M. Geist Olivier Bachem Olivier Pietquin Marcin Andrychowicz 42 77 0 01 Jun 2021
Did I do that? Blame as a means to identify controlled effects in reinforcement learning Oriol Corcoll Youssef Mohamed Raul Vicente 18 3 0 01 Jun 2021
Offline Learning from Demonstrations and Unlabeled Experience Konrad Zolna Alexander Novikov Ksenia Konyushkova Çağlar Gülçehre Ziyun Wang Y. Aytar Misha Denil Nando de Freitas Scott E. Reed SSL OffRL 32 66 0 27 Nov 2020
Avoiding Tampering Incentives in Deep RL via Decoupled Approval J. Uesato Ramana Kumar Victoria Krakovna Tom Everitt Richard Ngo Shane Legg 26 14 0 17 Nov 2020
Forgetful Experience Replay in Hierarchical Reinforcement Learning from Demonstrations Alexey Skrynnik A. Staroverov Ermek Aitygulov Kirill Aksenov Vasilii Davydov Aleksandr I. Panov OffRL 16 4 0 17 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning Matthew W. Hoffman Bobak Shahriari John Aslanides Gabriel Barth-Maron Nikola Momchev ... Srivatsan Srinivasan A. Cowie Ziyun Wang Bilal Piot Nando de Freitas 60 225 0 01 Jun 2020
Unity: A General Platform for Intelligent Agents Arthur Juliani Vincent-Pierre Berges Esh Vckay Andrew Cohen Jonathan Harper ... Chris Goy Yuan Gao Hunter Henry Marwan Mattar Danny Lange 16 808 0 07 Sep 2018