ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.03497
  4. Cited By
Value Prediction Network
v1v2 (latest)

Value Prediction Network

11 July 2017
Junhyuk Oh
Satinder Singh
Honglak Lee
ArXiv (abs)PDFHTML

Papers citing "Value Prediction Network"

50 / 215 papers shown
Reinforcement Learning with Action Chunking
Reinforcement Learning with Action Chunking
Qiyang Li
Zhiyuan Zhou
Sergey Levine
OffRLOnRL
496
39
0
10 Jul 2025
Simple, Good, Fast: Self-Supervised World Models Free of Baggage
Simple, Good, Fast: Self-Supervised World Models Free of BaggageInternational Conference on Learning Representations (ICLR), 2025
Jan Robine
Marc Höftmann
Stefan Harmeling
DRLOCL
356
5
0
03 Jun 2025
Calibrated Value-Aware Model Learning with Probabilistic Environment Models
Calibrated Value-Aware Model Learning with Probabilistic Environment Models
C. Voelcker
Anastasiia Pedan
Arash Ahmadian
Romina Abachi
Igor Gilitschenski
Amir-massoud Farahmand
288
0
0
28 May 2025
Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down
  Maps
Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps
Linfeng Zhao
Lawson L. S. Wong
423
2
0
16 Dec 2024
Policy-shaped prediction: avoiding distractions in model-based
  reinforcement learning
Policy-shaped prediction: avoiding distractions in model-based reinforcement learningNeural Information Processing Systems (NeurIPS), 2024
Miles Hutson
Isaac Kauvar
Nick Haber
377
1
0
08 Dec 2024
Understanding World or Predicting Future? A Comprehensive Survey of World Models
Understanding World or Predicting Future? A Comprehensive Survey of World ModelsACM Computing Surveys (ACM CSUR), 2024
Jingtao Ding
Yunke Zhang
Yu Shang
Yuheng Zhang
Zefang Zong
...
Fengli Xu
Yong Li
Chen Gao
Fengli Xu
Yong Li
VGenSyDa
638
17
0
21 Nov 2024
Learning World Models for Unconstrained Goal Navigation
Learning World Models for Unconstrained Goal NavigationNeural Information Processing Systems (NeurIPS), 2024
Yuanlin Duan
Wensen Mao
He Zhu
287
9
0
03 Nov 2024
Prioritized Generative Replay
Prioritized Generative ReplayInternational Conference on Learning Representations (ICLR), 2024
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRLDiffM
607
9
0
23 Oct 2024
AlphaZeroES: Direct score maximization outperforms planning loss
  minimization
AlphaZeroES: Direct score maximization outperforms planning loss minimization
Carlos Martin
Tuomas Sandholm
203
0
0
12 Jun 2024
A New View on Planning in Online Reinforcement Learning
A New View on Planning in Online Reinforcement Learning
Kevin Roice
Parham Mohammad Panahi
Scott M. Jordan
Adam White
Martha White
OffRL
340
0
0
03 Jun 2024
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for
  Controllable Language Generation
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation
Chengxing Jia
Pengyuan Wang
Ziniu Li
Yi-Chen Li
Zhilong Zhang
Nan Tang
Yang Yu
OffRL
335
3
0
27 May 2024
Feasibility Consistent Representation Learning for Safe Reinforcement
  Learning
Feasibility Consistent Representation Learning for Safe Reinforcement Learning
Zhepeng Cen
Yi-Fan Yao
Zuxin Liu
Ding Zhao
OffRL
376
3
0
20 May 2024
Offline Model-Based Optimization via Policy-Guided Gradient Search
Offline Model-Based Optimization via Policy-Guided Gradient SearchAAAI Conference on Artificial Intelligence (AAAI), 2024
Yassine Chemingui
Aryan Deshwal
Trong Nghia Hoang
J. Doppa
OffRL
298
21
0
08 May 2024
Point Cloud Models Improve Visual Robustness in Robotic Learners
Point Cloud Models Improve Visual Robustness in Robotic Learners
Skand Peri
Iain Lee
Chanho Kim
Fuxin Li
Tucker Hermans
Stefan Lee
3DPC
333
16
0
29 Apr 2024
Episodic Reinforcement Learning with Expanded State-reward Space
Episodic Reinforcement Learning with Expanded State-reward Space
Dayang Liang
Yaru Zhang
Yunlong Liu
OffRL
193
6
0
19 Jan 2024
Bridging State and History Representations: Understanding
  Self-Predictive RL
Bridging State and History Representations: Understanding Self-Predictive RLInternational Conference on Learning Representations (ICLR), 2024
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TSAI4CE
482
48
0
17 Jan 2024
Adaptive Online Replanning with Diffusion Models
Adaptive Online Replanning with Diffusion Models
Siyuan Zhou
Yilun Du
Shun Zhang
Mengdi Xu
Yikang Shen
Wei Xiao
Dit-Yan Yeung
Chuang Gan
353
35
0
14 Oct 2023
Pixel State Value Network for Combined Prediction and Planning in
  Interactive Environments
Pixel State Value Network for Combined Prediction and Planning in Interactive Environments
Sascha Rosbach
Stefan M. Leupold
S. Großjohann
Stefan Roth
197
0
0
11 Oct 2023
A Unified View on Solving Objective Mismatch in Model-Based
  Reinforcement Learning
A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning
Ran Wei
Nathan Lambert
Anthony D. McDonald
Alfredo Garcia
Roberto Calandra
415
9
0
10 Oct 2023
Improving Reinforcement Learning Efficiency with Auxiliary Tasks in
  Non-Visual Environments: A Comparison
Improving Reinforcement Learning Efficiency with Auxiliary Tasks in Non-Visual Environments: A ComparisonInternational Conference on Machine Learning, Optimization, and Data Science (MOD), 2023
Moritz Lange
Noah Krystiniak
Raphael C. Engelhardt
Wolfgang Konen
Laurenz Wiskott
OffRL
264
2
0
06 Oct 2023
RTDK-BO: High Dimensional Bayesian Optimization with Reinforced
  Transformer Deep kernels
RTDK-BO: High Dimensional Bayesian Optimization with Reinforced Transformer Deep kernels
Alexander Shmakov
Avisek Naug
Vineet Gundecha
Sahand Ghorbanpour
Ricardo Luna Gutierrez
Ashwin Ramesh Babu
Antonio Guillen-Perez
Soumyendu Sarkar
551
12
0
05 Oct 2023
On Representation Complexity of Model-based and Model-free Reinforcement
  Learning
On Representation Complexity of Model-based and Model-free Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023
Hanlin Zhu
Baihe Huang
Stuart Russell
OffRL
457
5
0
03 Oct 2023
HarmonyDream: Task Harmonization Inside World Models
HarmonyDream: Task Harmonization Inside World ModelsInternational Conference on Machine Learning (ICML), 2023
Haoyu Ma
Jialong Wu
Ningya Feng
Chenjun Xiao
Dong Li
Jianye Hao
Jianmin Wang
Mingsheng Long
237
21
0
30 Sep 2023
AI planning in the imagination: High-level planning on learned abstract
  search spaces
AI planning in the imagination: High-level planning on learned abstract search spaces
Carlos Martin
Tuomas Sandholm
239
0
0
16 Aug 2023
Thinker: Learning to Plan and Act
Thinker: Learning to Plan and ActNeural Information Processing Systems (NeurIPS), 2023
Stephen Chung
Ivan Anokhin
David M. Krueger
LLMAGOffRLLRM
366
12
0
27 Jul 2023
$λ$-models: Effective Decision-Aware Reinforcement Learning with
  Latent Models
λλλ-models: Effective Decision-Aware Reinforcement Learning with Latent Models
C. Voelcker
Arash Ahmadian
Romina Abachi
Igor Gilitschenski
Amir-massoud Farahmand
435
0
0
30 Jun 2023
Rethinking Closed-loop Training for Autonomous Driving
Rethinking Closed-loop Training for Autonomous DrivingEuropean Conference on Computer Vision (ECCV), 2023
Chris Zhang
R. Guo
Wenyuan Zeng
Yuwen Xiong
Binbin Dai
Rui Hu
Mengye Ren
R. Urtasun
OffRL
299
38
0
27 Jun 2023
Simplified Temporal Consistency Reinforcement Learning
Simplified Temporal Consistency Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023
Yi Zhao
Wenshuai Zhao
Rinu Boney
Arno Solin
Joni Pajarinen
OffRL
308
17
0
15 Jun 2023
Deep Generative Models for Decision-Making and Control
Deep Generative Models for Decision-Making and Control
Michael Janner
336
3
0
15 Jun 2023
What model does MuZero learn?
What model does MuZero learn?European Conference on Artificial Intelligence (ECAI), 2023
Jinke He
Thomas M. Moerland
F. Oliehoek
389
5
0
01 Jun 2023
Query-Policy Misalignment in Preference-Based Reinforcement Learning
Query-Policy Misalignment in Preference-Based Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023
Xiao Hu
Jianxiong Li
Xianyuan Zhan
Qing-Shan Jia
Ya Zhang
401
15
0
27 May 2023
Decision-Aware Actor-Critic with Function Approximation and Theoretical
  Guarantees
Decision-Aware Actor-Critic with Function Approximation and Theoretical GuaranteesNeural Information Processing Systems (NeurIPS), 2023
Sharan Vaswani
A. Kazemi
Reza Babanezhad
Nicolas Le Roux
OffRL
477
6
0
24 May 2023
Co-Learning Empirical Games and World Models
Co-Learning Empirical Games and World Models
Max O. Smith
Michael P. Wellman
338
4
0
23 May 2023
Bayesian Reinforcement Learning with Limited Cognitive Load
Bayesian Reinforcement Learning with Limited Cognitive LoadOpen Mind (OM), 2023
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
OffRL
258
16
0
05 May 2023
A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential
  Decision Making
A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision MakingACM Computing Surveys (ACM Comput. Surv.), 2023
Carlos Núnez-Molina
Pablo Mesejo
Juan Fernández-Olivares
545
6
0
20 Apr 2023
Planning with Sequence Models through Iterative Energy Minimization
Planning with Sequence Models through Iterative Energy MinimizationInternational Conference on Learning Representations (ICLR), 2023
Hongyi Chen
Yilun Du
Yiye Chen
J. Tenenbaum
Patricio A. Vela
198
8
0
28 Mar 2023
Model-Based Reinforcement Learning with Isolated Imaginations
Model-Based Reinforcement Learning with Isolated ImaginationsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Minting Pan
Geng Chen
Yitao Zheng
Yunbo Wang
Xiaokang Yang
462
3
0
27 Mar 2023
Foundation Models for Decision Making: Problems, Methods, and
  Opportunities
Foundation Models for Decision Making: Problems, Methods, and Opportunities
Sherry Yang
Ofir Nachum
Yilun Du
Jason W. Wei
Pieter Abbeel
Dale Schuurmans
LM&RoOffRLLRMAI4CE
446
228
0
07 Mar 2023
Learning How to Infer Partial MDPs for In-Context Adaptation and
  Exploration
Learning How to Infer Partial MDPs for In-Context Adaptation and Exploration
Chentian Jiang
Nan Rosemary Ke
Hado van Hasselt
415
4
0
08 Feb 2023
Minimal Value-Equivalent Partial Models for Scalable and Robust Planning
  in Lifelong Reinforcement Learning
Minimal Value-Equivalent Partial Models for Scalable and Robust Planning in Lifelong Reinforcement Learning
Safa Alver
Doina Precup
OffRL
247
5
0
24 Jan 2023
Continuous Neural Algorithmic Planners
Continuous Neural Algorithmic PlannersLOG IN (LOG IN), 2022
Yu He
Petar Velivcković
Pietro Lio
Andreea Deac
234
7
0
29 Nov 2022
Operator Splitting Value Iteration
Operator Splitting Value IterationNeural Information Processing Systems (NeurIPS), 2022
Amin Rakhsha
Andrew Wang
Mohammad Ghavamzadeh
Amir-massoud Farahmand
OffRL
220
9
0
25 Nov 2022
Reward-Predictive Clustering
Reward-Predictive Clustering
Lucas Lehnert
M. Frank
Michael L. Littman
OffRL
309
0
0
07 Nov 2022
Disentangled (Un)Controllable Features
Disentangled (Un)Controllable FeaturesIEEE Symposium Series on Computational Intelligence (IEEE SSCI), 2022
Jacob E. Kooi
Mark Hoogendoorn
Vincent François-Lavet
DRL
297
1
0
31 Oct 2022
On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement
  Learning
On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement Learning
Dilip Arumugam
Mark K. Ho
Noah D. Goodman
Benjamin Van Roy
301
7
0
30 Oct 2022
Scaling up and Stabilizing Differentiable Planning with Implicit
  Differentiation
Scaling up and Stabilizing Differentiable Planning with Implicit DifferentiationInternational Conference on Learning Representations (ICLR), 2022
Linfeng Zhao
Huazhe Xu
Lawson L. S. Wong
292
10
0
24 Oct 2022
Simplifying Model-based RL: Learning Representations, Latent-space
  Models, and Policies with One Objective
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One ObjectiveInternational Conference on Learning Representations (ICLR), 2022
Raj Ghugare
Homanga Bharadhwaj
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
444
31
0
18 Sep 2022
Value Summation: A Novel Scoring Function for MPC-based Model-based
  Reinforcement Learning
Value Summation: A Novel Scoring Function for MPC-based Model-based Reinforcement Learning
Mehran Raisi
Amirhossein Noohian
Lucy McCutcheon
Saber Fallah
202
3
0
16 Sep 2022
A model-based approach to meta-Reinforcement Learning: Transformers and
  tree search
A model-based approach to meta-Reinforcement Learning: Transformers and tree searchThe European Symposium on Artificial Neural Networks (ESANN), 2022
Brieuc Pinon
Jean-Charles Delvenne
Raphaël Jungers
OffRL
259
4
0
24 Aug 2022
Spectral Decomposition Representation for Reinforcement Learning
Spectral Decomposition Representation for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022
Zhaolin Ren
Tianjun Zhang
Lisa Lee
Joseph E. Gonzalez
Dale Schuurmans
Bo Dai
OffRL
269
38
0
19 Aug 2022
12345
Next
Page 1 of 5