Better Exploration with Optimistic Actor-Critic

28 October 2019

Papers citing "Better Exploration with Optimistic Actor-Critic"

26 / 26 papers shown

Title
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation Eliot Xing Vernon Luk Jean Oh 84 0 0 16 Dec 2024
Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control Michal Nauman M. Ostaszewski Krzysztof Jankowski Piotr Milo's Marek Cygan OffRL 45 16 0 25 May 2024
Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning Michal Nauman Michal Bortkiewicz Piotr Milo's Tomasz Trzciñski M. Ostaszewski Marek Cygan OffRL 30 17 0 01 Mar 2024
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics Michal Nauman Marek Cygan 35 1 0 30 Oct 2023
Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control Amarildo Likmeta Matteo Sacco Alberto Maria Metelli Marcello Restelli OffRL 18 3 0 04 Mar 2023
Model-Based Uncertainty in Value Functions Carlos E. Luis A. Bottero Julia Vinogradska Felix Berkenkamp Jan Peters 36 13 0 24 Feb 2023
PAC-Bayesian Soft Actor-Critic Learning Bahareh Tasdighi Abdullah Akgul Manuel Haussmann Kenny Kazimirzak Brink M. Kandemir 34 3 0 30 Jan 2023
PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration Qisheng Zhang Zhen Guo A. Jøsang Lance M. Kaplan F. Chen Dong-Ho Jeong Jin-Hee Cho 25 0 0 13 Dec 2022
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size Alexander Nikulin Vladislav Kurenkov Denis Tarasov Dmitry Akimov Sergey Kolesnikov OffRL 33 14 0 20 Nov 2022
Revisiting Discrete Soft Actor-Critic Haibin Zhou Zichuan Lin Junyou Li Qiang Fu Wei Yang Deheng Ye 46 12 0 21 Sep 2022
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey A. Aubret L. Matignon S. Hassas 37 35 0 19 Sep 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure Xing Chen Dongcui Diao Hechang Chen Hengshuai Yao Haiyin Piao Zhixiao Sun Zhiwei Yang Randy Goebel Bei Jiang Yi-Ju Chang OffRL 32 8 0 20 May 2022
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning Edoardo Cetin Oya Celiktutan OffRL 42 16 0 07 Oct 2021
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble Gaon An Seungyong Moon Jang-Hyun Kim Hyun Oh Song OffRL 105 262 0 04 Oct 2021
Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates Romain Laroche Rémi Tachet des Combes 46 8 0 29 Sep 2021
On the Estimation Bias in Double Q-Learning Zhizhou Ren Guangxiang Zhu Haotian Hu Beining Han Jian-Hai Chen Chongjie Zhang 24 17 0 29 Sep 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain Jianye Hao Tianpei Yang Hongyao Tang Chenjia Bai Jinyi Liu Zhaopeng Meng Peng Liu Zhen Wang OffRL 36 92 0 14 Sep 2021
ADER:Adapting between Exploration and Robustness for Actor-Critic Methods Bo Zhou Kejiao Li Hongsheng Zeng Fan Wang Hao Tian OffRL 24 1 0 08 Sep 2021
A Survey of Exploration Methods in Reinforcement Learning Susan Amin Maziar Gomrokchi Harsh Satija H. V. Hoof Doina Precup OffRL 21 80 0 01 Sep 2021
Bayesian Bellman Operators M. Fellows Kristian Hartikainen Shimon Whiteson OffRL 42 15 0 09 Jun 2021
Enhanced Pub/Sub Communications for Massive IoT Traffic with SARSA Reinforcement Learning Carlos R. E. Arruda Pedro F. Moraes N. Agoulmine Joberto S. B. Martins 15 7 0 03 Jan 2021
Proximal Policy Optimization via Enhanced Exploration Efficiency Junwei Zhang Zhenghao Zhang Shuai Han Shuai Lu 29 41 0 11 Nov 2020
Softmax Deep Double Deterministic Policy Gradients Ling Pan Qingpeng Cai Longbo Huang 72 86 0 19 Oct 2020
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration Zhenghao Peng Hao Sun Bolei Zhou 18 18 0 14 Jun 2020
Novel Policy Seeking with Constrained Optimization Hao Sun Zhenghao Peng Bo Dai Jian Guo Dahua Lin Bolei Zhou 24 13 0 21 May 2020
Effective Diversity in Population Based Reinforcement Learning Jack Parker-Holder Aldo Pacchiano K. Choromanski Stephen J. Roberts 14 158 0 03 Feb 2020