ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.09855
  4. Cited By
Distributional Policy Optimization: An Alternative Approach for
  Continuous Control
v1v2 (latest)

Distributional Policy Optimization: An Alternative Approach for Continuous Control

Neural Information Processing Systems (NeurIPS), 2019
23 May 2019
Chen Tessler
Guy Tennenholtz
Shie Mannor
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Distributional Policy Optimization: An Alternative Approach for Continuous Control"

27 / 27 papers shown
Adaptive Nesterov Accelerated Distributional Deep Hedging for Efficient Volatility Risk Management
Adaptive Nesterov Accelerated Distributional Deep Hedging for Efficient Volatility Risk Management
Lei Zhao
Lin Cai
Wu-Sheng Lu
246
0
0
25 Feb 2025
Learning in complex action spaces without policy gradients
Learning in complex action spaces without policy gradients
Arash Tavakoli
Sina Ghiassian
Nemanja Rakićević
OffRL
283
0
0
08 Oct 2024
Predicting Long-Term Human Behaviors in Discrete Representations via
  Physics-Guided Diffusion
Predicting Long-Term Human Behaviors in Discrete Representations via Physics-Guided Diffusion
Zhitian Zhang
Anjian Li
Angelica Lim
Mo Chen
415
5
0
29 May 2024
Action-Quantized Offline Reinforcement Learning for Robotic Skill
  Learning
Action-Quantized Offline Reinforcement Learning for Robotic Skill LearningConference on Robot Learning (CoRL), 2023
Jianlan Luo
Perry Dong
Jeffrey Wu
Aviral Kumar
Xinyang Geng
Sergey Levine
OffRL
337
40
0
18 Oct 2023
Distributional Soft Actor-Critic with Three Refinements
Distributional Soft Actor-Critic with Three RefinementsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jingliang Duan
Wenxuan Wang
Liming Xiao
Jiaxin Gao
Shengbo Eben Li
Chang Liu
Ya-Qin Zhang
Bo Cheng
Keqiang Li
OODDOffRL
363
4
0
09 Oct 2023
Provably Convergent Policy Optimization via Metric-aware Trust Region
  Methods
Provably Convergent Policy Optimization via Metric-aware Trust Region Methods
Jun Song
Niao He
Lijun Ding
Chaoyue Zhao
268
6
0
25 Jun 2023
Towards Optimal Pricing of Demand Response -- A Nonparametric
  Constrained Policy Optimization Approach
Towards Optimal Pricing of Demand Response -- A Nonparametric Constrained Policy Optimization ApproachIEEE Power & Energy Society General Meeting (PESGM), 2023
Jun Song
Chaoyue Zhao
OffRL
90
1
0
24 Jun 2023
Representation-Driven Reinforcement Learning
Representation-Driven Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023
Ofir Nabati
Guy Tennenholtz
Shie Mannor
356
3
0
31 May 2023
Latent-Conditioned Policy Gradient for Multi-Objective Deep
  Reinforcement Learning
Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement LearningInternational Conference on Artificial Neural Networks (ICANN), 2023
T. Kanazawa
Chetan Gupta
339
0
0
15 Mar 2023
Reinforcement Learning with History-Dependent Dynamic Contexts
Reinforcement Learning with History-Dependent Dynamic ContextsInternational Conference on Machine Learning (ICML), 2023
Guy Tennenholtz
Nadav Merlis
Lior Shani
Martin Mladenov
Craig Boutilier
AI4CE
297
13
0
04 Feb 2023
Distillation Policy Optimization
Distillation Policy Optimization
Jianfei Ma
OffRL
621
1
0
01 Feb 2023
Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
Coordinate Ascent for Off-Policy RL with Global Convergence GuaranteesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Hsin-En Su
Yen-Ju Chen
Ping-Chun Hsieh
Xi Liu
OffRL
282
1
0
10 Dec 2022
Decision-making with Speculative Opponent Models
Decision-making with Speculative Opponent ModelsIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Jing-rong Sun
Shuo Chen
Cong Zhang
Yining Ma
Jie Zhang
353
2
0
22 Nov 2022
Optimistic Curiosity Exploration and Conservative Exploitation with
  Linear Reward Shaping
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping
Hao Sun
Lei Han
Rui Yang
Xiaoteng Ma
Jian Guo
Bolei Zhou
OffRLOnRL
232
12
0
15 Sep 2022
Automated Reinforcement Learning: An Overview
Automated Reinforcement Learning: An Overview
Reza Refaei Afshar
Yingqian Zhang
Joaquin Vanschoren
U. Kaymak
Yaoxin Wu
Wen Song
Yingqian Zhang
OffRL
490
18
0
13 Jan 2022
Continuous Control with Action Quantization from Demonstrations
Continuous Control with Action Quantization from Demonstrations
Robert Dadashi
Léonard Hussenot
Damien Vincent
Sertan Girgin
Anton Raichuk
Matthieu Geist
Olivier Pietquin
OffRL
219
32
0
19 Oct 2021
Maximum Entropy Reinforcement Learning with Mixture Policies
Maximum Entropy Reinforcement Learning with Mixture Policies
Nir Baram
Guy Tennenholtz
Shie Mannor
157
5
0
18 Mar 2021
A Study of Policy Gradient on a Class of Exactly Solvable Models
A Study of Policy Gradient on a Class of Exactly Solvable Models
Gavin McCracken
Colin Daniels
Rosie Zhao
Anna M. Brandenberger
Prakash Panangaden
Doina Precup
182
0
0
03 Nov 2020
Learning to Represent Action Values as a Hypergraph on the Action
  Vertices
Learning to Represent Action Values as a Hypergraph on the Action VerticesInternational Conference on Learning Representations (ICLR), 2020
Arash Tavakoli
Mehdi Fatemi
Petar Kormushev
230
25
0
28 Oct 2020
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time
  Systems with Lipschitz Continuous Controls
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous ControlsJournal of machine learning research (JMLR), 2020
Jeongho Kim
Jaeuk Shin
Insoon Yang
259
41
0
27 Oct 2020
Implicit Distributional Reinforcement Learning
Implicit Distributional Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2020
Yuguang Yue
Zhendong Wang
Mingyuan Zhou
OffRL
216
17
0
13 Jul 2020
Optimistic Distributionally Robust Policy Optimization
Optimistic Distributionally Robust Policy Optimization
Jun Song
Chaoyue Zhao
205
13
0
14 Jun 2020
Non-local Policy Optimization via Diversity-regularized Collaborative
  Exploration
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration
Zhenghao Peng
Hao Sun
Bolei Zhou
275
20
0
14 Jun 2020
Zeroth-Order Supervised Policy Improvement
Zeroth-Order Supervised Policy Improvement
Hao Sun
Ziping Xu
Yuhang Song
Meng Fang
Jiechao Xiong
Bo Dai
Bolei Zhou
OffRL
318
10
0
11 Jun 2020
Novel Policy Seeking with Constrained Optimization
Novel Policy Seeking with Constrained Optimization
Hao Sun
Zhenghao Peng
Bo Dai
Jian Guo
Dahua Lin
Bolei Zhou
408
15
0
21 May 2020
Convergence of Q-value in case of Gaussian rewards
Convergence of Q-value in case of Gaussian rewards
Konatsu Miyamoto
Masaya Suzuki
Yuma Kigami
Kodai Satake
141
1
0
07 Mar 2020
Sample-based Distributional Policy Gradient
Sample-based Distributional Policy GradientConference on Learning for Dynamics & Control (L4DC), 2020
Rahul Singh
Keuntaek Lee
Yongxin Chen
176
21
0
08 Jan 2020
1
Page 1 of 1