ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.01599
  4. Cited By
Learning Dynamics Model in Reinforcement Learning by Incorporating the
  Long Term Future
v1v2 (latest)

Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term Future

5 March 2019
Nan Rosemary Ke
Amanpreet Singh
Ahmed Touati
Anirudh Goyal
Yoshua Bengio
Devi Parikh
Dhruv Batra
ArXiv (abs)PDFHTML

Papers citing "Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term Future"

26 / 26 papers shown
Title
HAEPO: History-Aggregated Exploratory Policy Optimization
HAEPO: History-Aggregated Exploratory Policy Optimization
Gaurish Trivedi
Alakh Sharma
Kartikey Singh Bhandari
Dhruv Kumar
Pratik Narang
Jagat Sesh Challa
48
0
0
26 Aug 2025
Disentangled Representations for Causal Cognition
Disentangled Representations for Causal Cognition
Filippo Torresan
Manuel Baltieri
CML
216
4
0
30 Jun 2024
Towards Principled Representation Learning from Videos for Reinforcement
  Learning
Towards Principled Representation Learning from Videos for Reinforcement Learning
Dipendra Kumar Misra
Akanksha Saran
Tengyang Xie
Alex Lamb
John Langford
SSLOffRL
272
7
0
20 Mar 2024
Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning
  with Goal Imagination
Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
Liangzhou Wang
Kaiwen Zhu
Fengming Zhu
Xinghu Yao
Shujie Zhang
Deheng Ye
Haobo Fu
Qiang Fu
Wei Yang
152
4
0
05 Mar 2024
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
DREAMWALKER: Mental Planning for Continuous Vision-Language NavigationIEEE International Conference on Computer Vision (ICCV), 2023
Hanqing Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
168
70
0
14 Aug 2023
MABL: Bi-Level Latent-Variable World Model for Sample-Efficient
  Multi-Agent Reinforcement Learning
MABL: Bi-Level Latent-Variable World Model for Sample-Efficient Multi-Agent Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Aravind Venugopal
Stephanie Milani
Fei Fang
Balaraman Ravindran
OffRL
193
6
0
12 Apr 2023
Habits and goals in synergy: a variational Bayesian framework for
  behavior
Habits and goals in synergy: a variational Bayesian framework for behaviorNature Communications (Nat. Commun.), 2023
Dongqi Han
Kenji Doya
Dongsheng Li
Jun Tani
BDL
194
205
0
11 Apr 2023
Learning to Forecast Aleatoric and Epistemic Uncertainties over Long
  Horizon Trajectories
Learning to Forecast Aleatoric and Epistemic Uncertainties over Long Horizon TrajectoriesIEEE International Conference on Robotics and Automation (ICRA), 2023
Aastha Acharya
Rebecca L. Russell
Nisar R. Ahmed
130
7
0
17 Feb 2023
World Models and Predictive Coding for Cognitive and Developmental
  Robotics: Frontiers and Challenges
World Models and Predictive Coding for Cognitive and Developmental Robotics: Frontiers and Challenges
T. Taniguchi
Shingo Murata
Masahiro Suzuki
D. Ognibene
Pablo Lanillos
...
L. Jamone
Tomoaki Nakamura
Alejandra Ciria
B. Lara
G. Pezzulo
231
73
0
14 Jan 2023
No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL
No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL
Han Wang
Archit Sakhadeo
Adam White
James Bell
Vincent Liu
Xutong Zhao
Puer Liu
Tadashi Kozuno
Alona Fyshe
Martha White
OffRLOnRL
192
8
0
18 May 2022
Competency Assessment for Autonomous Agents using Deep Generative Models
Competency Assessment for Autonomous Agents using Deep Generative ModelsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Aastha Acharya
Rebecca L. Russell
Nisar R. Ahmed
119
13
0
23 Mar 2022
Retrieval-Augmented Reinforcement Learning
Retrieval-Augmented Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Anirudh Goyal
A. Friesen
Andrea Banino
T. Weber
Nan Rosemary Ke
...
Michal Valko
Simon Osindero
Timothy Lillicrap
N. Heess
Charles Blundell
OffRL
370
66
0
17 Feb 2022
Bayesian sense of time in biological and artificial brains
Bayesian sense of time in biological and artificial brains
Zafeirios Fountas
Alexey Zakharov
121
1
0
14 Jan 2022
Robust Predictable Control
Robust Predictable ControlNeural Information Processing Systems (NeurIPS), 2021
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
OffRL
169
49
0
07 Sep 2021
Foresee then Evaluate: Decomposing Value Estimation with Latent Future
  Prediction
Foresee then Evaluate: Decomposing Value Estimation with Latent Future PredictionAAAI Conference on Artificial Intelligence (AAAI), 2021
Hongyao Tang
Jianye Hao
Guangyong Chen
Pengfei Chen
Chong Chen
Yaodong Yang
Jun Liu
Wulong Liu
Zhaopeng Meng
OffRL
184
5
0
03 Mar 2021
Learning Accurate Long-term Dynamics for Model-based Reinforcement
  Learning
Learning Accurate Long-term Dynamics for Model-based Reinforcement LearningIEEE Conference on Decision and Control (CDC), 2020
Nathan Lambert
Albert Wilcox
Howard Zhang
K. Pister
Roberto Calandra
179
39
0
16 Dec 2020
Neural-iLQR: A Learning-Aided Shooting Method for Trajectory
  Optimization
Neural-iLQR: A Learning-Aided Shooting Method for Trajectory OptimizationIEEE International Conference on Robotics and Biomimetics (ROBIO), 2020
Zilong Cheng
Yuling Li
Kai Chen
Jun Ma
Tong-heng Lee
213
1
0
21 Nov 2020
Episodic Memory for Learning Subjective-Timescale Models
Episodic Memory for Learning Subjective-Timescale Models
Alexey Zakharov
Matthew Crosby
Zafeirios Fountas
90
4
0
03 Oct 2020
Emergent Social Learning via Multi-agent Reinforcement Learning
Emergent Social Learning via Multi-agent Reinforcement LearningInternational Conference on Machine Learning (ICML), 2020
Kamal Ndousse
Douglas Eck
Sergey Levine
Natasha Jaques
233
52
0
01 Oct 2020
Model-based Reinforcement Learning: A Survey
Model-based Reinforcement Learning: A Survey
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
OffRL
407
63
0
30 Jun 2020
Objective Mismatch in Model-based Reinforcement Learning
Objective Mismatch in Model-based Reinforcement LearningConference on Learning for Dynamics & Control (L4DC), 2020
Nathan Lambert
Brandon Amos
Omry Yadan
Roberto Calandra
OffRL
308
108
0
11 Feb 2020
Variational Recurrent Models for Solving Partially Observable Control
  Tasks
Variational Recurrent Models for Solving Partially Observable Control TasksInternational Conference on Learning Representations (ICLR), 2019
Dongqi Han
Kenji Doya
Jun Tani
DRLOffRL
131
71
0
23 Dec 2019
Shaping Belief States with Generative Environment Models for RL
Shaping Belief States with Generative Environment Models for RLNeural Information Processing Systems (NeurIPS), 2019
Karol Gregor
Danilo Jimenez Rezende
F. Besse
Yan Wu
Hamza Merzic
Aaron van den Oord
OffRLAI4CE
355
121
0
21 Jun 2019
Disentangling Dynamics and Returns: Value Function Decomposition with
  Future Prediction
Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction
Hongyao Tang
Jianye Hao
Guangyong Chen
Pengfei Chen
Zhaopeng Meng
Yaodong Yang
Li Wang
68
3
0
27 May 2019
Meta reinforcement learning as task inference
Meta reinforcement learning as task inference
Jan Humplik
Alexandre Galashov
Leonard Hasenclever
Pedro A. Ortega
Yee Whye Teh
N. Heess
OffRL
330
134
0
15 May 2019
Making the V in VQA Matter: Elevating the Role of Image Understanding in
  Visual Question Answering
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal
Tejas Khot
D. Summers-Stay
Dhruv Batra
Devi Parikh
CoGe
924
3,734
0
02 Dec 2016
1