ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.04955
  4. Cited By
Temporal Difference Learning for Model Predictive Control

Temporal Difference Learning for Model Predictive Control

9 March 2022
Nicklas Hansen
Xiaolong Wang
H. Su
    PINN
    MU
ArXivPDFHTML

Papers citing "Temporal Difference Learning for Model Predictive Control"

50 / 161 papers shown
Title
Switching Sampling Space of Model Predictive Path-Integral Controller to
  Balance Efficiency and Safety in 4WIDS Vehicle Navigation
Switching Sampling Space of Model Predictive Path-Integral Controller to Balance Efficiency and Safety in 4WIDS Vehicle Navigation
Mizuho Aoki
Kohei Honda
H. Okuda
Tatsuya Suzuki
LLMSV
24
1
0
13 Sep 2024
MPPI-Generic: A CUDA Library for Stochastic Trajectory Optimization
MPPI-Generic: A CUDA Library for Stochastic Trajectory Optimization
Bogdan I. Vlahov
Jason Gibson
Manan S. Gandhi
Evangelos A. Theodorou
18
5
0
11 Sep 2024
Offline Policy Learning via Skill-step Abstraction for Long-horizon
  Goal-Conditioned Tasks
Offline Policy Learning via Skill-step Abstraction for Long-horizon Goal-Conditioned Tasks
Donghoon Kim
Minjong Yoo
Honguk Woo
OffRL
17
0
0
21 Aug 2024
ProSpec RL: Plan Ahead, then Execute
ProSpec RL: Plan Ahead, then Execute
Liangliang Liu
Huiyu Duan
Liu Yang
Rujia Shen
Yi Lin
Chaoran Kong
Lian Yan
P. Callet
OffRL
24
0
0
31 Jul 2024
QT-TDM: Planning with Transformer Dynamics Model and Autoregressive
  Q-Learning
QT-TDM: Planning with Transformer Dynamics Model and Autoregressive Q-Learning
Mostafa Kotb
C. Weber
Muhammad Burhan Hafez
Stefan Wermter
22
0
0
26 Jul 2024
A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data
A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data
Adrian Remonda
Nicklas Hansen
Ayoub Raji
Nicola Musiu
Marko Bertogna
Eduardo E. Veas
Xiaolong Wang
18
5
0
23 Jul 2024
FOSP: Fine-tuning Offline Safe Policy through World Models
FOSP: Fine-tuning Offline Safe Policy through World Models
Chenyang Cao
Yucheng Xin
Silang Wu
Longxiang He
Zichen Yan
Junbo Tan
Xueqian Wang
OffRL
42
0
0
06 Jul 2024
Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style
  Reinforcement Learning
Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning
Zakariae El Asri
Olivier Sigaud
Nicolas Thome
18
0
0
02 Jul 2024
Text-Aware Diffusion for Policy Learning
Text-Aware Diffusion for Policy Learning
Calvin Luo
Mandy He
Zilai Zeng
Chen Sun
23
4
0
02 Jul 2024
Learning Abstract World Model for Value-preserving Planning with Options
Learning Abstract World Model for Value-preserving Planning with Options
Rafael Rodríguez-Sánchez
G. Konidaris
22
1
0
22 Jun 2024
Decentralized Transformers with Centralized Aggregation are
  Sample-Efficient Multi-Agent World Models
Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models
Yang Zhang
Chenjia Bai
Bin Zhao
Junchi Yan
Xiu Li
Xuelong Li
OffRL
19
0
0
22 Jun 2024
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
Yuan Pu
Yazhe Niu
Jiyuan Ren
Zhenjie Yang
Hongsheng Li
Yu Liu
OffRL
38
1
0
15 Jun 2024
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models
  in Decision Making
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yi Ma
Pengyi Li
Yan Zheng
DiffM
50
7
0
13 Jun 2024
World Models with Hints of Large Language Models for Goal Achieving
World Models with Hints of Large Language Models for Goal Achieving
Zeyuan Liu
Ziyu Huan
Xiyao Wang
Jiafei Lyu
Jian Tao
Xiu Li
Furong Huang
Huazhe Xu
LM&Ro
LRM
AI4CE
29
1
0
11 Jun 2024
Optimization of geological carbon storage operations with multimodal
  latent dynamic model and deep reinforcement learning
Optimization of geological carbon storage operations with multimodal latent dynamic model and deep reinforcement learning
Zhongzheng Wang
Yuntian Chen
Guodong Chen
Dongxiao Zhang
AI4CE
16
0
0
07 Jun 2024
iQRL -- Implicitly Quantized Representations for Sample-efficient
  Reinforcement Learning
iQRL -- Implicitly Quantized Representations for Sample-efficient Reinforcement Learning
Aidan Scannell
Kalle Kujanpää
Yi Zhao
Mohammadreza Nakhaei
Arno Solin
J. Pajarinen
SSL
32
5
0
04 Jun 2024
SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for
  Embodied Manipulation
SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation
Junjie Zhang
Chenjia Bai
Haoran He
Wenke Xia
Zhigang Wang
Bin Zhao
Xiu Li
Xuelong Li
35
12
0
30 May 2024
Hierarchical World Models as Visual Whole-Body Humanoid Controllers
Hierarchical World Models as Visual Whole-Body Humanoid Controllers
Nicklas Hansen
V. JyothirS
Vlad Sobal
Yann LeCun
Xiaolong Wang
Hao Su
VGen
38
10
0
28 May 2024
A Recipe for Unbounded Data Augmentation in Visual Reinforcement
  Learning
A Recipe for Unbounded Data Augmentation in Visual Reinforcement Learning
Abdulaziz Almuzairee
Nicklas Hansen
Henrik I. Christensen
32
6
0
27 May 2024
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for
  Controllable Language Generation
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation
Chengxing Jia
Pengyuan Wang
Ziniu Li
Yi-Chen Li
Zhilong Zhang
Nan Tang
Yang Yu
OffRL
25
1
0
27 May 2024
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Chenjia Bai
Rushuai Yang
Qiaosheng Zhang
Kang Xu
Yi Chen
Ting Xiao
Xuelong Li
OffRL
37
3
0
25 May 2024
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Jiafei Lyu
Chenjia Bai
Jingwen Yang
Zongqing Lu
Xiu Li
23
8
0
24 May 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models
iVideoGPT: Interactive VideoGPTs are Scalable World Models
Jialong Wu
Shaofeng Yin
Ningya Feng
Xu He
Dong Li
Jianye Hao
Mingsheng Long
VGen
32
22
0
24 May 2024
Efficient Imitation Learning with Conservative World Models
Efficient Imitation Learning with Conservative World Models
Victor Kolev
Rafael Rafailov
Kyle Hatch
Jiajun Wu
Chelsea Finn
OffRL
27
5
0
21 May 2024
Efficient Multi-agent Reinforcement Learning by Planning
Efficient Multi-agent Reinforcement Learning by Planning
Qihan Liu
Jianing Ye
Xiaoteng Ma
Jun Yang
Bin Liang
Chongjie Zhang
27
3
0
20 May 2024
Learning Latent Dynamic Robust Representations for World Models
Learning Latent Dynamic Robust Representations for World Models
Ruixiang Sun
Hongyu Zang
Xin-hui Li
Riashat Islam
21
4
0
10 May 2024
Point Cloud Models Improve Visual Robustness in Robotic Learners
Point Cloud Models Improve Visual Robustness in Robotic Learners
Skand Peri
Iain Lee
Chanho Kim
Fuxin Li
Tucker Hermans
Stefan Lee
3DPC
34
3
0
29 Apr 2024
Model-based Reinforcement Learning for Parameterized Action Spaces
Model-based Reinforcement Learning for Parameterized Action Spaces
Renhao Zhang
Haotian Fu
Yilin Miao
G. Konidaris
13
3
0
03 Apr 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active
  Online Exploration
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRL
OnRL
18
0
0
31 Mar 2024
EfficientZero V2: Mastering Discrete and Continuous Control with Limited
  Data
EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data
Shengjie Wang
Shaohuai Liu
Weirui Ye
Jiacheng You
Yang Gao
OffRL
15
10
0
01 Mar 2024
Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter
  Lesson of Reinforcement Learning
Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
Michal Nauman
Michal Bortkiewicz
Piotr Milo's
Tomasz Trzciñski
M. Ostaszewski
Marek Cygan
OffRL
22
16
0
01 Mar 2024
PRISE: LLM-Style Sequence Compression for Learning Temporal Action
  Abstractions in Control
PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control
Ruijie Zheng
Ching-An Cheng
Hal Daumé
Furong Huang
Andrey Kolobov
19
9
0
16 Feb 2024
BBSEA: An Exploration of Brain-Body Synchronization for Embodied Agents
BBSEA: An Exploration of Brain-Body Synchronization for Embodied Agents
Sizhe Yang
Qian Luo
Anumpam Pani
Yanchao Yang
22
2
0
13 Feb 2024
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask
  Representation via Temporal Action-Driven Contrastive Loss
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
Ruijie Zheng
Yongyuan Liang
Xiyao Wang
Shuang Ma
Hal Daumé
Huazhe Xu
John Langford
Praveen Palanisamy
Kalyan Shankar Basu
Furong Huang
32
5
0
09 Feb 2024
DiffTORI: Differentiable Trajectory Optimization for Deep Reinforcement and Imitation Learning
DiffTORI: Differentiable Trajectory Optimization for Deep Reinforcement and Imitation Learning
Weikang Wan
Ziyu Wang
Zackory M. Erickson
David Held
David Held
26
4
0
08 Feb 2024
The Essential Role of Causality in Foundation World Models for Embodied
  AI
The Essential Role of Causality in Foundation World Models for Embodied AI
Tarun Gupta
Wenbo Gong
Chao Ma
Nick Pawlowski
Agrin Hilmkil
...
Jianfeng Gao
Stefan Bauer
Danica Kragic
Bernhard Schölkopf
Cheng Zhang
28
15
0
06 Feb 2024
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for
  Offline Reinforcement Learning
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning
Zihan Ding
Amy Zhang
Yuandong Tian
Qinqing Zheng
OffRL
35
17
0
05 Feb 2024
Understanding What Affects Generalization Gap in Visual Reinforcement
  Learning: Theory and Empirical Evidence
Understanding What Affects Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence
Jiafei Lyu
Le Wan
Xiu Li
Zongqing Lu
CML
OffRL
28
2
0
05 Feb 2024
DiffuserLite: Towards Real-time Diffusion Planning
DiffuserLite: Towards Real-time Diffusion Planning
Zibin Dong
Jianye Hao
Yifu Yuan
Fei Ni
Yitian Wang
Pengyi Li
Yan Zheng
69
14
0
27 Jan 2024
Locality Sensitive Sparse Encoding for Learning World Models Online
Locality Sensitive Sparse Encoding for Learning World Models Online
Zi-Yan Liu
Chao Du
Wee Sun Lee
Min-Bin Lin
KELM
CLL
OffRL
18
8
0
23 Jan 2024
Bridging Evolutionary Algorithms and Reinforcement Learning: A
  Comprehensive Survey on Hybrid Algorithms
Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey on Hybrid Algorithms
Pengyi Li
Jianye Hao
Hongyao Tang
Xian Fu
Yan Zheng
Ke Tang
27
9
0
22 Jan 2024
CivRealm: A Learning and Reasoning Odyssey in Civilization for
  Decision-Making Agents
CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents
Siyuan Qi
Shuo Chen
Yexin Li
Xiangyu Kong
Junqi Wang
...
Zhaowei Zhang
Nian Liu
Wei Wang
Yaodong Yang
Song-Chun Zhu
AI4CE
LRM
19
17
0
19 Jan 2024
Learning Hybrid Policies for MPC with Application to Drone Flight in
  Unknown Dynamic Environments
Learning Hybrid Policies for MPC with Application to Drone Flight in Unknown Dynamic Environments
Zhaohan Feng
Jie Chen
Wei Xiao
Jian-jun Sun
Bin Xin
Gang Wang
27
2
0
18 Jan 2024
Bridging State and History Representations: Understanding
  Self-Predictive RL
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TS
AI4CE
17
20
0
17 Jan 2024
CoVO-MPC: Theoretical Analysis of Sampling-based MPC and Optimal
  Covariance Design
CoVO-MPC: Theoretical Analysis of Sampling-based MPC and Optimal Covariance Design
Zeji Yi
Chaoyi Pan
Guanqi He
Guannan Qu
Guanya Shi
12
10
0
14 Jan 2024
TaskMet: Task-Driven Metric Learning for Model Learning
TaskMet: Task-Driven Metric Learning for Model Learning
Dishank Bansal
Ricky T. Q. Chen
Mustafa Mukadam
Brandon Amos
FedML
17
9
0
08 Dec 2023
H-GAP: Humanoid Control with a Generalist Planner
H-GAP: Humanoid Control with a Generalist Planner
Zhengyao Jiang
Yingchen Xu
Nolan Wagener
Yicheng Luo
Michael Janner
Edward Grefenstette
Tim Rocktaschel
Yuandong Tian
AI4CE
11
5
0
05 Dec 2023
Action Inference by Maximising Evidence: Zero-Shot Imitation from
  Observation with World Models
Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models
Xingyuan Zhang
Philip Becker-Ehmck
Patrick van der Smagt
Maximilian Karl
23
5
0
04 Dec 2023
Learning-Augmented Scheduling for Solar-Powered Electric Vehicle
  Charging
Learning-Augmented Scheduling for Solar-Powered Electric Vehicle Charging
Tongxin Li
11
0
0
10 Nov 2023
State-Wise Safe Reinforcement Learning With Pixel Observations
State-Wise Safe Reinforcement Learning With Pixel Observations
S. Zhan
Yixuan Wang
Qingyuan Wu
Ruochen Jiao
Chao Huang
Qi Zhu
14
10
0
03 Nov 2023
Previous
1234
Next