Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1905.09855
Cited By
v1
v2 (latest)
Distributional Policy Optimization: An Alternative Approach for Continuous Control
Neural Information Processing Systems (NeurIPS), 2019
23 May 2019
Chen Tessler
Guy Tennenholtz
Shie Mannor
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Distributional Policy Optimization: An Alternative Approach for Continuous Control"
27 / 27 papers shown
Adaptive Nesterov Accelerated Distributional Deep Hedging for Efficient Volatility Risk Management
Lei Zhao
Lin Cai
Wu-Sheng Lu
246
0
0
25 Feb 2025
Learning in complex action spaces without policy gradients
Arash Tavakoli
Sina Ghiassian
Nemanja Rakićević
OffRL
283
0
0
08 Oct 2024
Predicting Long-Term Human Behaviors in Discrete Representations via Physics-Guided Diffusion
Zhitian Zhang
Anjian Li
Angelica Lim
Mo Chen
415
5
0
29 May 2024
Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning
Conference on Robot Learning (CoRL), 2023
Jianlan Luo
Perry Dong
Jeffrey Wu
Aviral Kumar
Xinyang Geng
Sergey Levine
OffRL
337
40
0
18 Oct 2023
Distributional Soft Actor-Critic with Three Refinements
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jingliang Duan
Wenxuan Wang
Liming Xiao
Jiaxin Gao
Shengbo Eben Li
Chang Liu
Ya-Qin Zhang
Bo Cheng
Keqiang Li
OODD
OffRL
363
4
0
09 Oct 2023
Provably Convergent Policy Optimization via Metric-aware Trust Region Methods
Jun Song
Niao He
Lijun Ding
Chaoyue Zhao
268
6
0
25 Jun 2023
Towards Optimal Pricing of Demand Response -- A Nonparametric Constrained Policy Optimization Approach
IEEE Power & Energy Society General Meeting (PESGM), 2023
Jun Song
Chaoyue Zhao
OffRL
90
1
0
24 Jun 2023
Representation-Driven Reinforcement Learning
International Conference on Machine Learning (ICML), 2023
Ofir Nabati
Guy Tennenholtz
Shie Mannor
356
3
0
31 May 2023
Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning
International Conference on Artificial Neural Networks (ICANN), 2023
T. Kanazawa
Chetan Gupta
339
0
0
15 Mar 2023
Reinforcement Learning with History-Dependent Dynamic Contexts
International Conference on Machine Learning (ICML), 2023
Guy Tennenholtz
Nadav Merlis
Lior Shani
Martin Mladenov
Craig Boutilier
AI4CE
297
13
0
04 Feb 2023
Distillation Policy Optimization
Jianfei Ma
OffRL
621
1
0
01 Feb 2023
Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Hsin-En Su
Yen-Ju Chen
Ping-Chun Hsieh
Xi Liu
OffRL
282
1
0
10 Dec 2022
Decision-making with Speculative Opponent Models
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Jing-rong Sun
Shuo Chen
Cong Zhang
Yining Ma
Jie Zhang
353
2
0
22 Nov 2022
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping
Hao Sun
Lei Han
Rui Yang
Xiaoteng Ma
Jian Guo
Bolei Zhou
OffRL
OnRL
232
12
0
15 Sep 2022
Automated Reinforcement Learning: An Overview
Reza Refaei Afshar
Yingqian Zhang
Joaquin Vanschoren
U. Kaymak
Yaoxin Wu
Wen Song
Yingqian Zhang
OffRL
490
18
0
13 Jan 2022
Continuous Control with Action Quantization from Demonstrations
Robert Dadashi
Léonard Hussenot
Damien Vincent
Sertan Girgin
Anton Raichuk
Matthieu Geist
Olivier Pietquin
OffRL
219
32
0
19 Oct 2021
Maximum Entropy Reinforcement Learning with Mixture Policies
Nir Baram
Guy Tennenholtz
Shie Mannor
157
5
0
18 Mar 2021
A Study of Policy Gradient on a Class of Exactly Solvable Models
Gavin McCracken
Colin Daniels
Rosie Zhao
Anna M. Brandenberger
Prakash Panangaden
Doina Precup
182
0
0
03 Nov 2020
Learning to Represent Action Values as a Hypergraph on the Action Vertices
International Conference on Learning Representations (ICLR), 2020
Arash Tavakoli
Mehdi Fatemi
Petar Kormushev
230
25
0
28 Oct 2020
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls
Journal of machine learning research (JMLR), 2020
Jeongho Kim
Jaeuk Shin
Insoon Yang
259
41
0
27 Oct 2020
Implicit Distributional Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2020
Yuguang Yue
Zhendong Wang
Mingyuan Zhou
OffRL
216
17
0
13 Jul 2020
Optimistic Distributionally Robust Policy Optimization
Jun Song
Chaoyue Zhao
205
13
0
14 Jun 2020
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration
Zhenghao Peng
Hao Sun
Bolei Zhou
275
20
0
14 Jun 2020
Zeroth-Order Supervised Policy Improvement
Hao Sun
Ziping Xu
Yuhang Song
Meng Fang
Jiechao Xiong
Bo Dai
Bolei Zhou
OffRL
318
10
0
11 Jun 2020
Novel Policy Seeking with Constrained Optimization
Hao Sun
Zhenghao Peng
Bo Dai
Jian Guo
Dahua Lin
Bolei Zhou
408
15
0
21 May 2020
Convergence of Q-value in case of Gaussian rewards
Konatsu Miyamoto
Masaya Suzuki
Yuma Kigami
Kodai Satake
141
1
0
07 Mar 2020
Sample-based Distributional Policy Gradient
Conference on Learning for Dynamics & Control (L4DC), 2020
Rahul Singh
Keuntaek Lee
Yongxin Chen
176
21
0
08 Jan 2020
1
Page 1 of 1