ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.12491
  4. Cited By
World Model as a Graph: Learning Latent Landmarks for Planning
v1v2v3 (latest)

World Model as a Graph: Learning Latent Landmarks for Planning

International Conference on Machine Learning (ICML), 2020
25 November 2020
Lunjun Zhang
Ge Yang
Bradly C. Stadie
    DRL
ArXiv (abs)PDFHTML

Papers citing "World Model as a Graph: Learning Latent Landmarks for Planning"

50 / 51 papers shown
Incorporating Spatial Information into Goal-Conditioned Hierarchical Reinforcement Learning via Graph Representations
Incorporating Spatial Information into Goal-Conditioned Hierarchical Reinforcement Learning via Graph Representations
Shuyuan Zhang
Zihan Wang
Xiao-Wen Chang
Doina Precup
133
2
0
14 Nov 2025
Test-Time Graph Search for Goal-Conditioned Reinforcement Learning
Test-Time Graph Search for Goal-Conditioned Reinforcement Learning
Evgenii Opryshko
Junwei Quan
C. Voelcker
Yilun Du
Igor Gilitschenski
OffRL
166
3
0
08 Oct 2025
Embodied AI: From LLMs to World Models
Embodied AI: From LLMs to World Models
Tongtong Feng
Xin Wang
Yu Jiang
Wenwu Zhu
LM&Ro
451
22
0
24 Sep 2025
Subgoal-Guided Policy Heuristic Search with Learned Subgoals
Subgoal-Guided Policy Heuristic Search with Learned SubgoalsInternational Conference on Machine Learning (ICML), 2025
Jake E. Tuero
M. Buro
Levi H. S. Lelis
237
0
0
08 Jun 2025
Flattening Hierarchies with Policy Bootstrapping
Flattening Hierarchies with Policy Bootstrapping
John L. Zhou
Jonathan C. Kao
OffRL
457
1
0
20 May 2025
Option-aware Temporally Abstracted Value for Offline Goal-Conditioned Reinforcement Learning
Option-aware Temporally Abstracted Value for Offline Goal-Conditioned Reinforcement Learning
Hongjoon Ahn
Heewoong Choi
Jisu Han
Taesup Moon
OffRL
362
4
0
19 May 2025
Learning World Models for Unconstrained Goal Navigation
Learning World Models for Unconstrained Goal NavigationNeural Information Processing Systems (NeurIPS), 2024
Yuanlin Duan
Wensen Mao
He Zhu
282
9
0
03 Nov 2024
Exploring the Edges of Latent State Clusters for Goal-Conditioned
  Reinforcement Learning
Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Yuanlin Duan
Guofeng Cui
He Zhu
OffRL
433
1
0
03 Nov 2024
GHIL-Glue: Hierarchical Control with Filtered Subgoal Images
GHIL-Glue: Hierarchical Control with Filtered Subgoal ImagesIEEE International Conference on Robotics and Automation (ICRA), 2024
Kyle Hatch
Ashwin Balakrishna
Oier Mees
Suraj Nair
Seohong Park
...
Masha Itkina
Benjamin Eysenbach
Sergey Levine
Thomas Kollar
Benjamin Burchfiel
402
9
0
26 Oct 2024
HG2P: Hippocampus-inspired High-reward Graph and Model-Free Q-Gradient Penalty for Path Planning and Motion Control
HG2P: Hippocampus-inspired High-reward Graph and Model-Free Q-Gradient Penalty for Path Planning and Motion ControlNeural Networks (NN), 2024
Haoran Wang
Yaoru Sun
Zeshen Tang
Haibo Shi
Chenyuan Jiao
354
0
0
12 Oct 2024
Making Large Language Models into World Models with Precondition and
  Effect Knowledge
Making Large Language Models into World Models with Precondition and Effect KnowledgeInternational Conference on Computational Linguistics (COLING), 2024
Kaige Xie
Ian Yang
John Gunerli
Mark Riedl
312
16
0
18 Sep 2024
Offline Imitation Learning Through Graph Search and Retrieval
Offline Imitation Learning Through Graph Search and Retrieval
Zhao-Heng Yin
Pieter Abbeel
OffRL
291
10
0
22 Jul 2024
A New View on Planning in Online Reinforcement Learning
A New View on Planning in Online Reinforcement Learning
Kevin Roice
Parham Mohammad Panahi
Scott M. Jordan
Adam White
Martha White
OffRL
339
0
0
03 Jun 2024
World Models for Autonomous Driving: An Initial Survey
World Models for Autonomous Driving: An Initial Survey
Yanchen Guan
Haicheng Liao
Zhenning Li
Jia Hu
Runze Yuan
Yunjian Li
Guohui Zhang
Chengzhong Xu
527
99
0
05 Mar 2024
Learning Top-k Subtask Planning Tree based on Discriminative
  Representation Pre-training for Decision Making
Learning Top-k Subtask Planning Tree based on Discriminative Representation Pre-training for Decision Making
Jingqing Ruan
Kaishen Wang
Qingyang Zhang
Dengpeng Xing
Bo Xu
286
1
0
18 Dec 2023
CQM: Curriculum Reinforcement Learning with a Quantized World Model
CQM: Curriculum Reinforcement Learning with a Quantized World ModelNeural Information Processing Systems (NeurIPS), 2023
Seungjae Lee
Daesol Cho
Jonghae Park
H. J. Kim
256
15
0
26 Oct 2023
Hybrid Search for Efficient Planning with Completeness Guarantees
Hybrid Search for Efficient Planning with Completeness Guarantees
Kalle Kujanpää
Joni Pajarinen
Alexander Ilin
311
6
0
19 Oct 2023
Universal Visual Decomposer: Long-Horizon Manipulation Made Easy
Universal Visual Decomposer: Long-Horizon Manipulation Made EasyIEEE International Conference on Robotics and Automation (ICRA), 2023
Zichen Zhang
Yunshuang Li
Osbert Bastani
Abhishek Gupta
Dinesh Jayaraman
Yecheng Jason Ma
Luca Weihs
331
27
0
12 Oct 2023
Consciousness-Inspired Spatio-Temporal Abstractions for Better
  Generalization in Reinforcement Learning
Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023
Mingde Zhao
Safa Alver
H. V. Seijen
Romain Laroche
Doina Precup
Yoshua Bengio
547
5
0
30 Sep 2023
Guided Cooperation in Hierarchical Reinforcement Learning via
  Model-based Rollout
Guided Cooperation in Hierarchical Reinforcement Learning via Model-based RolloutIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Haoran Wang
Zeshen Tang
Leya Yang
Yaoru Sun
Fang Wang
Siyu Zhang
Ye-Ting Chen
342
3
0
24 Sep 2023
Balancing Exploration and Exploitation in Hierarchical Reinforcement
  Learning via Latent Landmark Graphs
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark GraphsIEEE International Joint Conference on Neural Network (IJCNN), 2023
Qingyang Zhang
Yiming Yang
Jingqing Ruan
Xuantang Xiong
Dengpeng Xing
Bo Xu
295
5
0
22 Jul 2023
HIQL: Offline Goal-Conditioned RL with Latent States as Actions
HIQL: Offline Goal-Conditioned RL with Latent States as ActionsNeural Information Processing Systems (NeurIPS), 2023
Seohong Park
Dibya Ghosh
Benjamin Eysenbach
Sergey Levine
OffRL
686
112
0
22 Jul 2023
Goal-Conditioned Reinforcement Learning with Disentanglement-based
  Reachability Planning
Goal-Conditioned Reinforcement Learning with Disentanglement-based Reachability PlanningIEEE Robotics and Automation Letters (RA-L), 2023
Zhifeng Qian
Mingyu You
Hongjun Zhou
Xuanhui Xu
Bin He
322
6
0
20 Jul 2023
Landmark Guided Active Exploration with State-specific Balance
  Coefficient
Landmark Guided Active Exploration with State-specific Balance Coefficient
Fei Cui
Jiaojiao Fang
Mengke Yang
Guizhong Liu
201
0
0
30 Jun 2023
Towards Generalist Robots: A Promising Paradigm via Generative
  Simulation
Towards Generalist Robots: A Promising Paradigm via Generative Simulation
Zhou Xian
Théophile Gervet
Zhenjia Xu
Yi-Ling Qiao
Tsun-Hsuan Wang
Yian Wang
LM&Ro
380
15
0
17 May 2023
Learning Achievement Structure for Structured Exploration in Domains
  with Sparse Reward
Learning Achievement Structure for Structured Exploration in Domains with Sparse RewardInternational Conference on Learning Representations (ICLR), 2023
Zihan Zhou
Animesh Garg
OffRL
306
4
0
30 Apr 2023
Neural Constraint Satisfaction: Hierarchical Abstraction for
  Combinatorial Generalization in Object Rearrangement
Neural Constraint Satisfaction: Hierarchical Abstraction for Combinatorial Generalization in Object Rearrangement
Michael Chang
Alyssa Dayan
Franziska Meier
Thomas Griffiths
Sergey Levine
Amy Zhang
OCLOffRL
314
9
0
20 Mar 2023
Imitating Graph-Based Planning with Goal-Conditioned Policies
Imitating Graph-Based Planning with Goal-Conditioned PoliciesInternational Conference on Learning Representations (ICLR), 2023
Junsup Kim
Younggyo Seo
SungSoo Ahn
Kyunghwan Son
Jinwoo Shin
273
16
0
20 Mar 2023
Goal-conditioned Offline Reinforcement Learning through State Space
  Partitioning
Goal-conditioned Offline Reinforcement Learning through State Space PartitioningMachine-mediated learning (ML), 2023
Mianchu Wang
Yue Jin
Giovanni Montana
OffRL
188
4
0
16 Mar 2023
Graph schemas as abstractions for transfer learning, inference, and
  planning
Graph schemas as abstractions for transfer learning, inference, and planning
J. S. Guntupalli
Rajkumar Vasudeva Raju
Shrinu Kushagra
Carter Wendelken
Daniel P. Sawyer
Ishani Deshpande
Guangyao Zhou
Miguel Lazaro-Gredilla
Dileep George
319
14
0
14 Feb 2023
Estimation of User's World Model Using Graph2vec
Estimation of User's World Model Using Graph2vec
Tatsuya Sakai
Takayuki Nagai
229
2
0
10 Jan 2023
Discrete Factorial Representations as an Abstraction for Goal
  Conditioned Reinforcement Learning
Discrete Factorial Representations as an Abstraction for Goal Conditioned Reinforcement Learning
Riashat Islam
Hongyu Zang
Anirudh Goyal
Alex Lamb
Kenji Kawaguchi
Xin-hui Li
Romain Laroche
Yoshua Bengio
Rémi Tachet des Combes
OffRLAI4CE
275
13
0
01 Nov 2022
DHRL: A Graph-Based Approach for Long-Horizon and Sparse Hierarchical
  Reinforcement Learning
DHRL: A Graph-Based Approach for Long-Horizon and Sparse Hierarchical Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Seungjae Lee
Jigang Kim
Inkyu Jang
H. J. Kim
OffRL
464
21
0
11 Oct 2022
Understanding Hindsight Goal Relabeling from a Divergence Minimization
  Perspective
Understanding Hindsight Goal Relabeling from a Divergence Minimization Perspective
Lunjun Zhang
Bradly C. Stadie
265
1
0
26 Sep 2022
Interaction Modeling with Multiplex Attention
Interaction Modeling with Multiplex AttentionNeural Information Processing Systems (NeurIPS), 2022
Fan-Yun Sun
Isaac Kauvar
Ruohan Zhang
Jiachen Li
Mykel Kochenderfer
Jiajun Wu
Nick Haber
194
25
0
23 Aug 2022
Value Memory Graph: A Graph-Structured World Model for Offline
  Reinforcement Learning
Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022
Deyao Zhu
Erran L. Li
Mohamed Elhoseiny
OffRL
294
13
0
09 Jun 2022
Goal-Space Planning with Subgoal Models
Goal-Space Planning with Subgoal Models
Chun-Ping Lo
Kevin Roice
Parham Mohammad Panahi
Scott M. Jordan
Adam White
Gábor Mihucz
Farzane Aminmansour
Martha White
440
9
0
06 Jun 2022
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal
  Search
Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal SearchInternational Conference on Learning Representations (ICLR), 2022
Michał Zawalski
Michał Tyrolski
K. Czechowski
Tomasz Odrzygó'zd'z
Damian Stachura
Piotr Pikekos
Yuhuai Wu
Lukasz Kuciñski
Piotr Milo's
LRM
637
13
0
01 Jun 2022
A Fully Controllable Agent in the Path Planning using Goal-Conditioned
  Reinforcement Learning
A Fully Controllable Agent in the Path Planning using Goal-Conditioned Reinforcement Learning
G. Lee
188
0
0
20 May 2022
Topological Experience Replay
Topological Experience ReplayInternational Conference on Learning Representations (ICLR), 2022
Zhang-Wei Hong
Tao Chen
Yen-Chen Lin
Joni Pajarinen
Pulkit Agrawal
301
21
0
29 Mar 2022
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Goal-Conditioned Reinforcement Learning: Problems and SolutionsInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Minghuan Liu
Menghui Zhu
Weinan Zhang
416
199
0
20 Jan 2022
Learning Domain Invariant Representations in Goal-conditioned Block MDPs
Learning Domain Invariant Representations in Goal-conditioned Block MDPs
Beining Han
Chongyi Zheng
Harris Chan
Keiran Paster
Michael Ruogu Zhang
Jimmy Ba
OODAI4CE
361
17
0
27 Oct 2021
Landmark-Guided Subgoal Generation in Hierarchical Reinforcement
  Learning
Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning
Junsup Kim
Younggyo Seo
Jinwoo Shin
390
77
0
26 Oct 2021
Planning from Pixels in Environments with Combinatorially Hard Search
  Spaces
Planning from Pixels in Environments with Combinatorially Hard Search SpacesNeural Information Processing Systems (NeurIPS), 2021
Marco Bagatella
Miroslav Olsák
Michal Rolínek
Georg Martius
OffRL
384
10
0
12 Oct 2021
Subgoal Search For Complex Reasoning Tasks
Subgoal Search For Complex Reasoning TasksNeural Information Processing Systems (NeurIPS), 2021
K. Czechowski
Tomasz Odrzygó'zd'z
Marek Zbysiñski
Michał Zawalski
Krzysztof Olejnik
Yuhuai Wu
Lukasz Kuciñski
Piotr Milo's
ReLMLRM
336
40
0
25 Aug 2021
Goal-Conditioned Reinforcement Learning with Imagined Subgoals
Goal-Conditioned Reinforcement Learning with Imagined Subgoals
Elliot Chane-Sane
Cordelia Schmid
Ivan Laptev
359
178
0
01 Jul 2021
DisTop: Discovering a Topological representation to learn diverse and
  rewarding skills
DisTop: Discovering a Topological representation to learn diverse and rewarding skillsIEEE Transactions on Cognitive and Developmental Systems (IEEE TCDS), 2021
A. Aubret
L. Matignon
S. Hassas
211
12
0
06 Jun 2021
A Framework of Explanation Generation toward Reliable Autonomous Robots
A Framework of Explanation Generation toward Reliable Autonomous Robots
Tatsuya Sakai
Kazuki Miyazawa
Takato Horii
Takayuki Nagai
248
8
0
06 May 2021
Explainable Autonomous Robots: A Survey and Perspective
Explainable Autonomous Robots: A Survey and Perspective
Tatsuya Sakai
Takayuki Nagai
288
86
0
06 May 2021
Rapid Exploration for Open-World Navigation with Latent Goal Models
Rapid Exploration for Open-World Navigation with Latent Goal ModelsConference on Robot Learning (CoRL), 2021
Dhruv Shah
Benjamin Eysenbach
G. Kahn
Nicholas Rhinehart
Sergey Levine
652
132
0
12 Apr 2021
12
Next
Page 1 of 2