ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.14079
  4. Cited By
Fighting Uncertainty with Gradients: Offline Reinforcement Learning via
  Diffusion Score Matching

Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching

24 June 2023
H. Suh
Glen Chou
Hongkai Dai
Lujie Yang
Abhishek Gupta
Russ Tedrake
    DiffM
    OffRL
ArXivPDFHTML

Papers citing "Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching"

8 / 8 papers shown
Title
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Michael Psenka
Alejandro Escontrela
Pieter Abbeel
Yi-An Ma
DiffM
89
22
0
17 Feb 2025
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Jing Zhang
Linjiajie Fang
Kexin Shi
Wenjia Wang
Bing-Yi Jing
OffRL
31
0
0
27 Oct 2024
Planning with Diffusion for Flexible Behavior Synthesis
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
202
622
0
20 May 2022
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
212
832
0
12 Oct 2021
Bundled Gradients through Contact via Randomized Smoothing
Bundled Gradients through Contact via Randomized Smoothing
H. Suh
Tao Pang
Russ Tedrake
76
52
0
11 Sep 2021
Model Error Propagation via Learned Contraction Metrics for Safe
  Feedback Motion Planning of Unknown Systems
Model Error Propagation via Learned Contraction Metrics for Safe Feedback Motion Planning of Unknown Systems
Glen Chou
N. Ozay
Dmitry Berenson
16
25
0
18 Apr 2021
COMBO: Conservative Offline Model-Based Policy Optimization
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
214
413
0
16 Feb 2021
Deep Dynamics Models for Learning Dexterous Manipulation
Deep Dynamics Models for Learning Dexterous Manipulation
Anusha Nagabandi
K. Konolige
Sergey Levine
Vikash Kumar
143
407
0
25 Sep 2019
1