ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.15177
  4. Cited By
Diffusion Actor-Critic with Entropy Regulator
v1v2v3 (latest)

Diffusion Actor-Critic with Entropy Regulator

24 May 2024
Yinuo Wang
Guojian Zhan
Yuxuan Jiang
Wenjun Zou
Tong Liu
Xujie Song
Wenxuan Wang
Liming Xiao
Jiang Wu
Jingliang Duan
Shengbo Eben Li
    DiffM
ArXiv (abs)PDFHTML

Papers citing "Diffusion Actor-Critic with Entropy Regulator"

24 / 24 papers shown
Title
One-Step Generative Policies with Q-Learning: A Reformulation of MeanFlow
One-Step Generative Policies with Q-Learning: A Reformulation of MeanFlow
Zeyuan Wang
Da Li
Yulin Chen
Ye-ling Shi
Liang Bai
Tianyuan Yu
Yanwei Fu
OffRL
164
0
0
17 Nov 2025
Controllable Flow Matching for Online Reinforcement Learning
Controllable Flow Matching for Online Reinforcement Learning
Bin Wang
Boxiang Tao
Haifeng Jing
Hongbo Dou
Zijian Wang
104
0
0
10 Nov 2025
Learning Intractable Multimodal Policies with Reparameterization and Diversity Regularization
Learning Intractable Multimodal Policies with Reparameterization and Diversity Regularization
Ziqi Wang
Jiashun Liu
L. Pan
215
0
0
03 Nov 2025
Mind Your Entropy: From Maximum Entropy to Trajectory Entropy-Constrained RL
Mind Your Entropy: From Maximum Entropy to Trajectory Entropy-Constrained RL
Guojian Zhan
Likun Wang
Pengcheng Wang
Feihong Zhang
Jingliang Duan
Masayoshi Tomizuka
Shengbo Eben Li
69
0
0
25 Oct 2025
Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control
Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control
Chengxiu Hua
Jiawen Gu
Yushun Tang
229
0
0
20 Oct 2025
A Diffusion-Refined Planner with Reinforcement Learning Priors for Confined-Space Parking
A Diffusion-Refined Planner with Reinforcement Learning Priors for Confined-Space Parking
Mingyang Jiang
Yueyuan Li
Jiaru Zhang
Songan Zhang
Ming Yang
DiffMOffRL
97
0
0
15 Oct 2025
D3P: Dynamic Denoising Diffusion Policy via Reinforcement Learning
D3P: Dynamic Denoising Diffusion Policy via Reinforcement Learning
Shu-Ang Yu
Feng Gao
Yi Wu
Chao Yu
Yu Wang
88
2
0
09 Aug 2025
One-Step Flow Policy Mirror Descent
One-Step Flow Policy Mirror Descent
Tianyi Chen
Haitong Ma
Na Li
Kai Wang
Bo Dai
219
0
0
31 Jul 2025
From Seeing to Experiencing: Scaling Navigation Foundation Models with Reinforcement Learning
From Seeing to Experiencing: Scaling Navigation Foundation Models with Reinforcement Learning
Honglin He
Yukai Ma
Wayne Wu
Bolei Zhou
OffRLLRM
125
5
0
29 Jul 2025
Flow-Based Policy for Online Reinforcement Learning
Flow-Based Policy for Online Reinforcement Learning
Lei Lv
Y. Li
Yu-Juan Luo
F. Sun
Tao Kong
Jiafeng Xu
Xiao Ma
296
7
0
15 Jun 2025
Enhanced DACER Algorithm with High Diffusion Efficiency
Enhanced DACER Algorithm with High Diffusion Efficiency
Yinuo Wang
Mining Tan
Wenjun Zou
Haotian Lin
Xujie Song
...
Tianze Zhu
Shiqi Liu
Jingliang Duan
Jingliang Duan
Shengbo Eben Li
DiffM
285
2
0
29 May 2025
Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning
Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning
Jiashun Liu
Zihao Wu
J. Obando-Ceron
Pablo Samuel Castro
Aaron Courville
L. Pan
184
4
0
29 May 2025
Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt
Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt
Xiang Zhu
Yichen Liu
Hezhong Li
Jianyu Chen
221
1
0
27 May 2025
Confidence-Regulated Generative Diffusion Models for Reliable AI Agent Migration in Vehicular Metaverses
Confidence-Regulated Generative Diffusion Models for Reliable AI Agent Migration in Vehicular Metaverses
Yingkai Kang
Jiawen Kang
Jinbo Wen
Tianze Zhang
Zhaohui Yang
Dusit Niyato
Yan Zhang
339
2
0
19 May 2025
Adaptive Diffusion Policy Optimization for Robotic Manipulation
Adaptive Diffusion Policy Optimization for Robotic Manipulation
Huiyun Jiang
Zhuang Yang
285
0
0
13 May 2025
CHD: Coupled Hierarchical Diffusion for Long-Horizon Tasks
CHD: Coupled Hierarchical Diffusion for Long-Horizon Tasks
Ce Hao
Anxing Xiao
Zhiwei Xue
Harold Soh
459
4
0
12 May 2025
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Learning a Diffusion Model Policy from Rewards via Q-Score MatchingInternational Conference on Machine Learning (ICML), 2023
Michael Psenka
Alejandro Escontrela
Pieter Abbeel
Yi-An Ma
DiffM
433
55
0
17 Feb 2025
Maximum Entropy Reinforcement Learning with Diffusion Policy
Maximum Entropy Reinforcement Learning with Diffusion Policy
Xiaoyi Dong
Jian Cheng
Xinsong Zhang
401
7
0
17 Feb 2025
Habitizing Diffusion Planning for Efficient and Effective Decision Making
Haofei Lu
Yifei Shen
Dongsheng Li
Junliang Xing
Dongqi Han
390
3
0
10 Feb 2025
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic LearningInternational Conference on Learning Representations (ICLR), 2025
Haque Ishfaq
Guangyuan Wang
Sami Nur Islam
Doina Precup
291
9
0
29 Jan 2025
Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation
Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Zilyu Ye
Zhiyang Chen
Tiancheng Li
Zemin Huang
Weijian Luo
Guo-Jun Qi
DiffM
546
17
0
02 Dec 2024
AERO: Entropy-Guided Framework for Private LLM Inference
AERO: Entropy-Guided Framework for Private LLM Inference
N. Jha
Brandon Reagen
462
5
0
16 Oct 2024
Sampling from Energy-based Policies using Diffusion
Sampling from Energy-based Policies using Diffusion
V. Jain
Tara Akhound-Sadegh
Siamak Ravanbakhsh
DiffM
463
5
0
02 Oct 2024
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Linjiajie Fang
Ruoxue Liu
Jing Zhang
Wenjia Wang
Bing-Yi Jing
OffRL
427
13
0
31 May 2024
1