
![]() Utilizing Reinforcement Learning for de novo Drug DesignMachine-mediated learning (ML), 2023 |
![]() Never Give Up: Learning Directed Exploration StrategiesInternational Conference on Learning Representations (ICLR), 2020 Adria Puigdomenech Badia Pablo Sprechmann Alex Vitvitskyi Daniel Guo Bilal Piot ...O. Tieleman Martín Arjovsky Alexander Pritzel Andew Bolt Charles Blundell |
![]() Unifying Count-Based Exploration and Intrinsic MotivationNeural Information Processing Systems (NeurIPS), 2016 |
![]() Adam: A Method for Stochastic OptimizationInternational Conference on Learning Representations (ICLR), 2014 Diederik P. Kingma Jimmy Ba |
![]() The KL-UCB Algorithm for Bounded Stochastic Bandits and BeyondAnnual Conference Computational Learning Theory (COLT), 2011 |