v1v2 (latest)

Value Prediction Network

11 July 2017

Papers citing "Value Prediction Network"

50 / 215 papers shown

Reinforcement Learning with Action Chunking

496

10 Jul 2025

Simple, Good, Fast: Self-Supervised World Models Free of BaggageInternational Conference on Learning Representations (ICLR), 2025

356

03 Jun 2025

Calibrated Value-Aware Model Learning with Probabilistic Environment Models

Amir-massoud Farahmand

288

28 May 2025

Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps

Linfeng Zhao

Lawson L. S. Wong

423

16 Dec 2024

Policy-shaped prediction: avoiding distractions in model-based reinforcement learningNeural Information Processing Systems (NeurIPS), 2024

Miles Hutson

Isaac Kauvar

Nick Haber

377

08 Dec 2024

Understanding World or Predicting Future? A Comprehensive Survey of World ModelsACM Computing Surveys (ACM CSUR), 2024

...

Chen Gao

Fengli Xu

Yong Li

VGen SyDa

638

21 Nov 2024

Learning World Models for Unconstrained Goal NavigationNeural Information Processing Systems (NeurIPS), 2024

Yuanlin Duan

Wensen Mao

He Zhu

287

03 Nov 2024

Prioritized Generative ReplayInternational Conference on Learning Representations (ICLR), 2024

607

23 Oct 2024

AlphaZeroES: Direct score maximization outperforms planning loss minimization

Carlos Martin

Tuomas Sandholm

203

12 Jun 2024

A New View on Planning in Online Reinforcement Learning

Kevin Roice

Parham Mohammad Panahi

Scott M. Jordan

Adam White

Martha White

OffRL

340

03 Jun 2024

BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation

Ziniu Li

335

27 May 2024

Feasibility Consistent Representation Learning for Safe Reinforcement Learning

Ding Zhao

376

20 May 2024

Offline Model-Based Optimization via Policy-Guided Gradient SearchAAAI Conference on Artificial Intelligence (AAAI), 2024

298

08 May 2024

Point Cloud Models Improve Visual Robustness in Robotic Learners

333

29 Apr 2024

Episodic Reinforcement Learning with Expanded State-reward Space

193

19 Jan 2024

Bridging State and History Representations: Understanding Self-Predictive RLInternational Conference on Learning Representations (ICLR), 2024

Pierre-Luc Bacon

482

17 Jan 2024

Adaptive Online Replanning with Diffusion Models

Chuang Gan

353

14 Oct 2023

Pixel State Value Network for Combined Prediction and Planning in Interactive Environments

197

11 Oct 2023

A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning

415

10 Oct 2023

Improving Reinforcement Learning Efficiency with Auxiliary Tasks in Non-Visual Environments: A ComparisonInternational Conference on Machine Learning, Optimization, and Data Science (MOD), 2023

Moritz Lange

Noah Krystiniak

Raphael C. Engelhardt

Wolfgang Konen

Laurenz Wiskott

OffRL

264

06 Oct 2023

RTDK-BO: High Dimensional Bayesian Optimization with Reinforced Transformer Deep kernels

Ricardo Luna Gutierrez

Ashwin Ramesh Babu

Antonio Guillen-Perez

Soumyendu Sarkar

551

05 Oct 2023

On Representation Complexity of Model-based and Model-free Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023

457

03 Oct 2023

HarmonyDream: Task Harmonization Inside World ModelsInternational Conference on Machine Learning (ICML), 2023

Dong Li

Jianye Hao

Jianmin Wang

Mingsheng Long

237

30 Sep 2023

AI planning in the imagination: High-level planning on learned abstract search spaces

Carlos Martin

Tuomas Sandholm

239

16 Aug 2023

Thinker: Learning to Plan and ActNeural Information Processing Systems (NeurIPS), 2023

366

27 Jul 2023

λ

-models: Effective Decision-Aware Reinforcement Learning with Latent Models

Amir-massoud Farahmand

435

30 Jun 2023

Rethinking Closed-loop Training for Autonomous DrivingEuropean Conference on Computer Vision (ECCV), 2023

299

27 Jun 2023

Simplified Temporal Consistency Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023

308

15 Jun 2023

Deep Generative Models for Decision-Making and Control

Michael Janner

336

15 Jun 2023

What model does MuZero learn?European Conference on Artificial Intelligence (ECAI), 2023

Jinke He

Thomas M. Moerland

F. Oliehoek

389

01 Jun 2023

Query-Policy Misalignment in Preference-Based Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023

401

27 May 2023

Decision-Aware Actor-Critic with Function Approximation and Theoretical GuaranteesNeural Information Processing Systems (NeurIPS), 2023

Nicolas Le Roux

477

24 May 2023

Co-Learning Empirical Games and World Models

Max O. Smith

Michael P. Wellman

338

23 May 2023

Bayesian Reinforcement Learning with Limited Cognitive LoadOpen Mind (OM), 2023

258

05 May 2023

A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision MakingACM Computing Surveys (ACM Comput. Surv.), 2023

Carlos Núnez-Molina

Pablo Mesejo

Juan Fernández-Olivares

545

20 Apr 2023

Planning with Sequence Models through Iterative Energy MinimizationInternational Conference on Learning Representations (ICLR), 2023

Patricio A. Vela

198

28 Mar 2023

Model-Based Reinforcement Learning with Isolated ImaginationsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

462

27 Mar 2023

Foundation Models for Decision Making: Problems, Methods, and Opportunities

Pieter Abbeel

LM&Ro OffRL LRM AI4CE

446

228

07 Mar 2023

Learning How to Infer Partial MDPs for In-Context Adaptation and Exploration

Chentian Jiang

Nan Rosemary Ke

Hado van Hasselt

415

08 Feb 2023

Minimal Value-Equivalent Partial Models for Scalable and Robust Planning in Lifelong Reinforcement Learning

Safa Alver

Doina Precup

OffRL

247

24 Jan 2023

Continuous Neural Algorithmic PlannersLOG IN (LOG IN), 2022

234

29 Nov 2022

Operator Splitting Value IterationNeural Information Processing Systems (NeurIPS), 2022

Amin Rakhsha

Andrew Wang

Mohammad Ghavamzadeh

Amir-massoud Farahmand

OffRL

220

25 Nov 2022

Reward-Predictive Clustering

Lucas Lehnert

M. Frank

Michael L. Littman

OffRL

309

07 Nov 2022

Disentangled (Un)Controllable FeaturesIEEE Symposium Series on Computational Intelligence (IEEE SSCI), 2022

Jacob E. Kooi

Mark Hoogendoorn

Vincent François-Lavet

DRL

297

31 Oct 2022

On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement Learning

301

30 Oct 2022

Scaling up and Stabilizing Differentiable Planning with Implicit DifferentiationInternational Conference on Learning Representations (ICLR), 2022

Linfeng Zhao

Huazhe Xu

Lawson L. S. Wong

292

24 Oct 2022

Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One ObjectiveInternational Conference on Learning Representations (ICLR), 2022

Homanga Bharadhwaj

444

18 Sep 2022

Value Summation: A Novel Scoring Function for MPC-based Model-based Reinforcement Learning

Mehran Raisi

Amirhossein Noohian

Lucy McCutcheon

Saber Fallah

202

16 Sep 2022

A model-based approach to meta-Reinforcement Learning: Transformers and tree searchThe European Symposium on Artificial Neural Networks (ESANN), 2022

Brieuc Pinon

Jean-Charles Delvenne

Raphaël Jungers

OffRL

259

24 Aug 2022

Spectral Decomposition Representation for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022

Tianjun Zhang

269

19 Aug 2022