Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1806.02315
Cited By
v1
v2
v3 (latest)
Randomized Value Functions via Multiplicative Normalizing Flows
6 June 2018
Ahmed Touati
Harsh Satija
Joshua Romoff
Joelle Pineau
Pascal Vincent
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Randomized Value Functions via Multiplicative Normalizing Flows"
22 / 22 papers shown
Trust-MARL: Trust-Based Multi-Agent Reinforcement Learning Framework for Cooperative On-Ramp Merging Control in Heterogeneous Traffic Flow
Jie Pan
Tianyi Wang
Christian Claudel
Jing Shi
116
2
0
14 Jun 2025
Learning Uncertainty-Aware Temporally-Extended Actions
Joongkyu Lee
Seung Joon Park
Yunhao Tang
Min-hwan Oh
132
3
0
08 Feb 2024
Bayesian Exploration Networks
International Conference on Machine Learning (ICML), 2023
Matt Fellows
Brandon Kaplowitz
Christian Schroeder de Witt
Shimon Whiteson
BDL
356
4
0
24 Aug 2023
Constraining cosmological parameters from N-body simulations with Variational Bayesian Neural Networks
Frontiers in Astronomy and Space Sciences (Front. Astron. Space Sci.), 2023
Héctor J. Hortúa
L. '. García
Leonardo Castañeda C.
BDL
148
6
0
09 Jan 2023
CEIP: Combining Explicit and Implicit Priors for Reinforcement Learning with Demonstrations
Neural Information Processing Systems (NeurIPS), 2022
Kai Yan
Alex Schwing
Yu-Xiong Wang
OffRL
225
3
0
18 Oct 2022
Flow-based Recurrent Belief State Learning for POMDPs
International Conference on Machine Learning (ICML), 2022
Xiaoyu Chen
Yao Mu
Ping Luo
Sheng Li
Jianyu Chen
169
25
0
23 May 2022
TO-FLOW: Efficient Continuous Normalizing Flows with Temporal Optimization adjoint with Moving Speed
Computer Vision and Pattern Recognition (CVPR), 2022
Shian Du
Yihong Luo
Wei Chen
Jian Xu
Delu Zeng
250
9
0
19 Mar 2022
Improving the Diversity of Bootstrapped DQN by Replacing Priors With Noise
IEEE Transactions on Games (IEEE Trans. Games), 2022
Li Meng
Morten Goodwin
Anis Yazidi
P. Engelstad
278
6
0
02 Mar 2022
Exploring More When It Needs in Deep Reinforcement Learning
Youtian Guo
Qitong Gao
84
0
0
28 Sep 2021
A Survey of Exploration Methods in Reinforcement Learning
Susan Amin
Maziar Gomrokchi
Harsh Satija
H. V. Hoof
Doina Precup
OffRL
295
99
0
01 Sep 2021
Bayesian Bellman Operators
Neural Information Processing Systems (NeurIPS), 2021
M. Fellows
Kristian Hartikainen
Shimon Whiteson
OffRL
167
18
0
09 Jun 2021
Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
International Conference on Machine Learning (ICML), 2021
Yue Wu
Shuangfei Zhai
Nitish Srivastava
J. Susskind
Jian Zhang
Ruslan Salakhutdinov
Hanlin Goh
EDL
OffRL
OnRL
246
208
0
17 May 2021
Out-of-Distribution Detection of Melanoma using Normalizing Flows
M. Valiuddin
C.G.A. Viviers
OODD
136
0
0
23 Mar 2021
Parameterized Indexed Value Function for Efficient Exploration in Reinforcement Learning
AAAI Conference on Artificial Intelligence (AAAI), 2019
Tian Tan
Zhihan Xiong
Vikranth Dwaracherla
88
5
0
23 Dec 2019
Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning
T. Doan
Bogdan Mazoure
Moloud Abdar
A. Durand
Joelle Pineau
R. Devon Hjelm
191
16
0
17 Sep 2019
Benchmarking Bonus-Based Exploration Methods on the Arcade Learning Environment
Adrien Ali Taïga
W. Fedus
Marlos C. Machado
Aaron Courville
Marc G. Bellemare
224
43
0
06 Aug 2019
Stochastic Neural Network with Kronecker Flow
International Conference on Artificial Intelligence and Statistics (AISTATS), 2019
Chin-Wei Huang
Ahmed Touati
Pascal Vincent
Gintare Karolina Dziugaite
Alexandre Lacoste
Aaron Courville
BDL
141
8
0
10 Jun 2019
Worst-Case Regret Bounds for Exploration via Randomized Value Functions
Neural Information Processing Systems (NeurIPS), 2019
Daniel Russo
OffRL
241
93
0
07 Jun 2019
Randomised Bayesian Least-Squares Policy Iteration
Nikolaos Tziortziotis
Christos Dimitrakakis
Michalis Vazirgiannis
OffRL
136
2
0
06 Apr 2019
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto
David Meger
Doina Precup
OffRL
BDL
752
1,843
0
07 Dec 2018
Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning
David Janz
Jiri Hron
Przemysław Mazur
Katja Hofmann
José Miguel Hernández-Lobato
Sebastian Tschiatschek
327
54
0
15 Oct 2018
Randomized Prior Functions for Deep Reinforcement Learning
Ian Osband
John Aslanides
Albin Cassirer
UQCV
BDL
197
405
0
08 Jun 2018
1