ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.00898
  4. Cited By
Residual-MPPI: Online Policy Customization for Continuous Control

Residual-MPPI: Online Policy Customization for Continuous Control

1 July 2024
Pengcheng Wang
Chenran Li
Catherine Weaver
Kenta Kawamoto
M. Tomizuka
Chen Tang
Wei Zhan
    OffRL
ArXivPDFHTML

Papers citing "Residual-MPPI: Online Policy Customization for Continuous Control"

6 / 6 papers shown
Title
Residual Policy Gradient: A Reward View of KL-regularized Objective
Pengcheng Wang
Xinghao Zhu
Yuxin Chen
Chenfeng Xu
M. Tomizuka
Chenran Li
36
0
0
14 Mar 2025
MPPI-Generic: A CUDA Library for Stochastic Trajectory Optimization
MPPI-Generic: A CUDA Library for Stochastic Trajectory Optimization
Bogdan I. Vlahov
Jason Gibson
Manan S. Gandhi
Evangelos A. Theodorou
21
5
0
11 Sep 2024
Planning with Diffusion for Flexible Behavior Synthesis
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
202
622
0
20 May 2022
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
212
832
0
12 Oct 2021
What Matters in Learning from Offline Human Demonstrations for Robot
  Manipulation
What Matters in Learning from Offline Human Demonstrations for Robot Manipulation
Ajay Mandlekar
Danfei Xu
J. Wong
Soroush Nasiriany
Chen Wang
Rohun Kulkarni
Li Fei-Fei
Silvio Savarese
Yuke Zhu
Roberto Martín-Martín
OffRL
147
461
0
06 Aug 2021
Fine-Tuning Language Models from Human Preferences
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
275
1,561
0
18 Sep 2019
1