Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.10316
Cited By
Proper Value Equivalence
18 June 2021
Christopher Grimm
André Barreto
Gregory Farquhar
David Silver
Satinder Singh
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Proper Value Equivalence"
12 / 12 papers shown
Title
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
46
2
0
11 Oct 2024
Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning
Bradley Burega
John D. Martin
Luke Kapeluck
Michael Bowling
40
0
0
27 Jun 2024
Pixel State Value Network for Combined Prediction and Planning in Interactive Environments
Sascha Rosbach
Stefan M. Leupold
S. Großjohann
Stefan Roth
27
0
0
11 Oct 2023
Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Prakash Panangaden
S. Rezaei-Shoshtari
Rosie Zhao
David Meger
Doina Precup
38
2
0
09 May 2023
Goal-oriented inference of environment from redundant observations
Kazuki Takahashi
T. Fukai
Y. Sakai
T. Takekawa
19
0
0
08 May 2023
Bayesian Reinforcement Learning with Limited Cognitive Load
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
OffRL
36
8
0
05 May 2023
Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function
Ruijie Zheng
Xiyao Wang
Huazhe Xu
Furong Huang
48
14
0
02 Feb 2023
On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement Learning
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
31
4
0
30 Oct 2022
Continuous MDP Homomorphisms and Homomorphic Policy Gradient
S. Rezaei-Shoshtari
Rosie Zhao
Prakash Panangaden
David Meger
Doina Precup
35
18
0
15 Sep 2022
Integrating Symmetry into Differentiable Planning with Steerable Convolutions
Linfeng Zhao
Xu Zhu
Lingzhi Kong
Robin Walters
Lawson L. S. Wong
26
7
0
08 Jun 2022
Between Rate-Distortion Theory & Value Equivalence in Model-Based Reinforcement Learning
Dilip Arumugam
Benjamin Van Roy
OffRL
38
1
0
04 Jun 2022
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
44
7
0
08 Dec 2021
1