ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.11752
  4. Cited By
Learning a Diffusion Model Policy from Rewards via Q-Score Matching

Learning a Diffusion Model Policy from Rewards via Q-Score Matching

17 February 2025
Michael Psenka
Alejandro Escontrela
Pieter Abbeel
Yi-An Ma
    DiffM
ArXivPDFHTML

Papers citing "Learning a Diffusion Model Policy from Rewards via Q-Score Matching"

22 / 22 papers shown
Title
Adaptive Diffusion Policy Optimization for Robotic Manipulation
Adaptive Diffusion Policy Optimization for Robotic Manipulation
Huiyun Jiang
Zhuang Yang
9
0
0
13 May 2025
CHD: Coupled Hierarchical Diffusion for Long-Horizon Tasks
CHD: Coupled Hierarchical Diffusion for Long-Horizon Tasks
Ce Hao
Anxing Xiao
Zhiwei Xue
Harold Soh
33
0
0
12 May 2025
Quantization-Free Autoregressive Action Transformer
Quantization-Free Autoregressive Action Transformer
Ziyad Sheebaelhamd
Michael Tschannen
Michael Muehlebach
Claire Vernade
36
0
0
18 Mar 2025
COLSON: Controllable Learning-Based Social Navigation via Diffusion-Based Reinforcement Learning
COLSON: Controllable Learning-Based Social Navigation via Diffusion-Based Reinforcement Learning
Yuki Tomita
Kohei Matsumoto
Yuki Hyodo
Ryo Kurazume
56
0
0
18 Mar 2025
Uncertainty Comes for Free: Human-in-the-Loop Policies with Diffusion Models
Uncertainty Comes for Free: Human-in-the-Loop Policies with Diffusion Models
Zhanpeng He
Yifeng Cao
M. Ciocarlie
55
0
0
26 Feb 2025
Maximum Entropy Reinforcement Learning with Diffusion Policy
Maximum Entropy Reinforcement Learning with Diffusion Policy
Xiaoyi Dong
Jian Cheng
X. Zhang
36
0
0
17 Feb 2025
Habitizing Diffusion Planning for Efficient and Effective Decision Making
Haofei Lu
Yifei Shen
Dongsheng Li
Junliang Xing
Dongqi Han
59
0
0
10 Feb 2025
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Haque Ishfaq
Guangyuan Wang
Sami Nur Islam
Doina Precup
41
2
0
29 Jan 2025
Policy Decorator: Model-Agnostic Online Refinement for Large Policy
  Model
Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model
Xiu Yuan
Tongzhou Mu
Stone Tao
Yunhao Fang
Mengke Zhang
H. Su
OffRL
59
0
0
18 Dec 2024
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class
  and Backbone
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone
Max Sobol Mark
Tian Gao
Georgia Gabriela Sampaio
M. K. Srirama
Archit Sharma
Chelsea Finn
Aviral Kumar
OffRL
OnRL
83
4
0
09 Dec 2024
Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation
Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation
Huy Le
Miroslav Gabriel
Tai Hoang
Gerhard Neumann
Ngo Anh Vien
99
1
0
22 Nov 2024
One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion
  Distillation
One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation
Zhendong Wang
Z. Li
Ajay Mandlekar
Zhenjia Xu
Jiaojiao Fan
...
Yuke Zhu
Yogesh Balaji
Mingyuan Zhou
Ming-Yu Liu
Yu Zeng
19
16
0
28 Oct 2024
Sampling from Energy-based Policies using Diffusion
Sampling from Energy-based Policies using Diffusion
V. Jain
Tara Akhound-Sadegh
Siamak Ravanbakhsh
DiffM
31
1
0
02 Oct 2024
Generalizing Consistency Policy to Visual RL with Prioritized Proximal
  Experience Regularization
Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization
Haoran Li
Zhennan Jiang
Yuhui Chen
Dongbin Zhao
OffRL
21
2
0
28 Sep 2024
Discrete Policy: Learning Disentangled Action Space for Multi-Task Robotic Manipulation
Discrete Policy: Learning Disentangled Action Space for Multi-Task Robotic Manipulation
Kun Wu
Yichen Zhu
Jinming Li
Junjie Wen
Ning Liu
Zhiyuan Xu
Qinru Qiu
33
4
0
27 Sep 2024
Scaling Diffusion Policy in Transformer to 1 Billion Parameters for
  Robotic Manipulation
Scaling Diffusion Policy in Transformer to 1 Billion Parameters for Robotic Manipulation
Minjie Zhu
Yichen Zhu
Jinming Li
Junjie Wen
Zhiyuan Xu
...
Ran Cheng
Chaomin Shen
Yaxin Peng
Feifei Feng
Jian Tang
28
13
0
22 Sep 2024
Diffusion Policy Policy Optimization
Diffusion Policy Policy Optimization
Allen Z. Ren
Justin Lidard
Lars L. Ankile
Anthony Simeonov
Pulkit Agrawal
Anirudha Majumdar
Benjamin Burchfiel
Hongkai Dai
Max Simchowitz
39
31
0
01 Sep 2024
Learning Multimodal Behaviors from Scratch with Diffusion Policy
  Gradient
Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient
Zechu Li
Rickmer Krohn
Tao Chen
Anurag Ajay
Pulkit Agrawal
Georgia Chalvatzaki
DiffM
42
7
0
02 Jun 2024
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Linjiajie Fang
Ruoxue Liu
Jing Zhang
Wenjia Wang
Bing-Yi Jing
OffRL
46
1
0
31 May 2024
Diffusion-based Reinforcement Learning via Q-weighted Variational Policy
  Optimization
Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Shutong Ding
Ke Hu
Zhenhao Zhang
Kan Ren
Weinan Zhang
Jingyi Yu
Jingya Wang
Ye-ling Shi
29
6
0
25 May 2024
Diffusion Actor-Critic with Entropy Regulator
Diffusion Actor-Critic with Entropy Regulator
Yinuo Wang
Likun Wang
Yuxuan Jiang
Wenjun Zou
Tong Liu
...
Wenxuan Wang
Liming Xiao
Jiang Wu
Jingliang Duan
Shengbo Eben Li
DiffM
40
7
0
24 May 2024
Planning with Diffusion for Flexible Behavior Synthesis
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
202
622
0
20 May 2022
1