ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.18901
  4. Cited By
Policy Optimization for Continuous Reinforcement Learning

Policy Optimization for Continuous Reinforcement Learning

30 May 2023
Hanyang Zhao
Wenpin Tang
D. Yao
    OffRL
ArXivPDFHTML

Papers citing "Policy Optimization for Continuous Reinforcement Learning"

4 / 4 papers shown
Title
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Michael Psenka
Alejandro Escontrela
Pieter Abbeel
Yi-An Ma
DiffM
89
23
0
17 Feb 2025
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Hanyang Zhao
Haoxian Chen
Ji Zhang
D. Yao
Wenpin Tang
55
0
0
03 Feb 2025
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sébastien Bubeck
Varun Chandrasekaran
Ronen Eldan
J. Gehrke
Eric Horvitz
...
Scott M. Lundberg
Harsha Nori
Hamid Palangi
Marco Tulio Ribeiro
Yi Zhang
ELM
AI4MH
AI4CE
ALM
268
3,000
0
22 Mar 2023
Deep Inventory Management
Deep Inventory Management
Dhruv Madeka
Kari Torkkola
Carson Eisenach
Anna Luo
Dean Phillips Foster
Sham M. Kakade
BDL
35
15
0
06 Oct 2022
1