ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.12145
  4. Cited By
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in
  Noisy Environments

OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments

19 December 2023
Jinyi Liu
Zhi Wang
Yan Zheng
Jianye Hao
Chenjia Bai
Junjie Ye
Zhen Wang
Haiyin Piao
Yang Sun
ArXivPDFHTML

Papers citing "OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments"

7 / 7 papers shown
Title
LNUCB-TA: Linear-nonlinear Hybrid Bandit Learning with Temporal Attention
H. Khosravi
Mohammad Reza Shafie
Ahmed Shoyeb Raihan
Srinjoy Das
I. Imtiaz Ahmed
29
0
0
01 Mar 2025
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of
  Gradient Directions for Policy Improvement
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement
Yiwen Zhu
Jinyi Liu
Wenya Wei
Qianyi Fu
Yujing Hu
Zhou Fang
Bo An
Jianye Hao
Tangjie Lv
Changjie Fan
21
3
0
14 May 2024
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
Kai-Wen Zhao
Yi-An Ma
Jianye Hao
Jinyi Liu
Yan Zheng
Zhaopeng Meng
OffRL
OnRL
13
12
0
12 Jun 2023
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with
  Multi-choice Dynamics Model
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model
Yifu Yuan
Jianye Hao
Fei Ni
Yao Mu
Yan Zheng
Yujing Hu
Jinyi Liu
Yingfeng Chen
Changjie Fan
55
12
0
02 Oct 2022
Curious Explorer: a provable exploration strategy in Policy Learning
Curious Explorer: a provable exploration strategy in Policy Learning
M. Miani
Maurizio Parton
M. Romito
29
0
0
29 Jun 2021
Max-value Entropy Search for Efficient Bayesian Optimization
Max-value Entropy Search for Efficient Bayesian Optimization
Zi Wang
Stefanie Jegelka
110
357
0
06 Mar 2017
Dropout as a Bayesian Approximation: Representing Model Uncertainty in
  Deep Learning
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
Y. Gal
Zoubin Ghahramani
UQCV
BDL
247
9,042
0
06 Jun 2015
1