Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.18901
Cited By
Policy Optimization for Continuous Reinforcement Learning
30 May 2023
Hanyang Zhao
Wenpin Tang
D. Yao
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Policy Optimization for Continuous Reinforcement Learning"
4 / 4 papers shown
Title
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Michael Psenka
Alejandro Escontrela
Pieter Abbeel
Yi-An Ma
DiffM
89
23
0
17 Feb 2025
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Hanyang Zhao
Haoxian Chen
Ji Zhang
D. Yao
Wenpin Tang
55
0
0
03 Feb 2025
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sébastien Bubeck
Varun Chandrasekaran
Ronen Eldan
J. Gehrke
Eric Horvitz
...
Scott M. Lundberg
Harsha Nori
Hamid Palangi
Marco Tulio Ribeiro
Yi Zhang
ELM
AI4MH
AI4CE
ALM
268
3,000
0
22 Mar 2023
Deep Inventory Management
Dhruv Madeka
Kari Torkkola
Carson Eisenach
Anna Luo
Dean Phillips Foster
Sham M. Kakade
BDL
35
15
0
06 Oct 2022
1