OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in
Noisy Environments

OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments

19 December 2023

Yan Zheng

Jianye Hao

Zhen Wang

Papers citing "OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments"

7 / 7 papers shown

Title
LNUCB-TA: Linear-nonlinear Hybrid Bandit Learning with Temporal Attention H. Khosravi Mohammad Reza Shafie Ahmed Shoyeb Raihan Srinjoy Das I. Imtiaz Ahmed 29 0 0 01 Mar 2025
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement Yiwen Zhu Jinyi Liu Wenya Wei Qianyi Fu Yujing Hu Zhou Fang Bo An Jianye Hao Tangjie Lv Changjie Fan 21 3 0 14 May 2024
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles Kai-Wen Zhao Yi-An Ma Jianye Hao Jinyi Liu Yan Zheng Zhaopeng Meng OffRL OnRL 13 12 0 12 Jun 2023
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model Yifu Yuan Jianye Hao Fei Ni Yao Mu Yan Zheng Yujing Hu Jinyi Liu Yingfeng Chen Changjie Fan 55 12 0 02 Oct 2022
Curious Explorer: a provable exploration strategy in Policy Learning M. Miani Maurizio Parton M. Romito 29 0 0 29 Jun 2021
Max-value Entropy Search for Efficient Bayesian Optimization Zi Wang Stefanie Jegelka 110 357 0 06 Mar 2017
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning Y. Gal Zoubin Ghahramani UQCV BDL 247 9,042 0 06 Jun 2015