ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.11366
  4. Cited By
GMAC: A Distributional Perspective on Actor-Critic Framework
v1v2 (latest)

GMAC: A Distributional Perspective on Actor-Critic Framework

International Conference on Machine Learning (ICML), 2021
24 May 2021
D. W. Nam
Younghoon Kim
Chan Y. Park
ArXiv (abs)PDFHTMLGithub

Papers citing "GMAC: A Distributional Perspective on Actor-Critic Framework"

15 / 15 papers shown
Risk-Aware Reinforcement Learning with Bandit-Based Adaptation for Quadrupedal Locomotion
Risk-Aware Reinforcement Learning with Bandit-Based Adaptation for Quadrupedal Locomotion
Yuanhong Zeng
Anushri Dixit
OffRL
133
0
0
16 Oct 2025
Combining AI Control Systems and Human Decision Support via Robustness
  and Criticality
Combining AI Control Systems and Human Decision Support via Robustness and Criticality
Walt Woods
Alexander Grushin
Simon Khan
Alvaro Velasquez
274
2
0
03 Jul 2024
Off-policy Distributional Q($λ$): Distributional RL without
  Importance Sampling
Off-policy Distributional Q(λλλ): Distributional RL without Importance Sampling
Yunhao Tang
Mark Rowland
Rémi Munos
Bernardo Avila-Pires
Will Dabney
OffRL
209
1
0
08 Feb 2024
A Robust Quantile Huber Loss With Interpretable Parameter Adjustment In
  Distributional Reinforcement Learning
A Robust Quantile Huber Loss With Interpretable Parameter Adjustment In Distributional Reinforcement LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Parvin Malekzadeh
Konstantinos N. Plataniotis
Zissis Poulos
Zeyu Wang
265
6
0
04 Jan 2024
Learning Risk-Aware Quadrupedal Locomotion using Distributional
  Reinforcement Learning
Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement LearningIEEE International Conference on Robotics and Automation (ICRA), 2023
Lukas Schneider
Jonas Frey
Takahiro Miki
Marco Hutter
365
26
0
25 Sep 2023
Robust Quadrupedal Locomotion via Risk-Averse Policy Learning
Robust Quadrupedal Locomotion via Risk-Averse Policy LearningIEEE International Conference on Robotics and Automation (ICRA), 2023
Jiyuan Shi
Chenjia Bai
Haoran He
Lei Han
Dong Wang
Bin Zhao
Mingguo Zhao
Xiuyang Li
Xuelong Li
OffRLOOD
308
20
0
18 Aug 2023
PACER: A Fully Push-forward-based Distributional Reinforcement Learning
  Algorithm
PACER: A Fully Push-forward-based Distributional Reinforcement Learning AlgorithmNeurocomputing (Neurocomputing), 2023
Wensong Bai
Chao Zhang
Yichao Fu
Lingwei Peng
Hui Qian
Bin Dai
225
3
0
11 Jun 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple ReuseInformation Sciences (Inf. Sci.), 2023
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
232
19
0
29 May 2023
Trust Region-Based Safe Distributional Reinforcement Learning for
  Multiple Constraints
Trust Region-Based Safe Distributional Reinforcement Learning for Multiple ConstraintsNeural Information Processing Systems (NeurIPS), 2023
Dohyeong Kim
Kyungjae Lee
Songhwai Oh
291
21
0
26 Jan 2023
Invariance to Quantile Selection in Distributional Continuous Control
Invariance to Quantile Selection in Distributional Continuous Control
Felix Grün
Muhammad Saif-ur-Rehman
Tobias Glasmachers
Ioannis Iossifidis
112
0
0
29 Dec 2022
Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous
  Control
Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous ControlIEEE International Joint Conference on Neural Network (IJCNN), 2022
T. Kanazawa
Haiyan Wang
Chetan Gupta
UQCV
310
7
0
27 Jul 2022
The Nature of Temporal Difference Errors in Multi-step Distributional
  Reinforcement Learning
The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Yunhao Tang
Mark Rowland
Rémi Munos
Bernardo Avila-Pires
Will Dabney
Marc G. Bellemare
OffRL
181
12
0
15 Jul 2022
Robust Reinforcement Learning with Distributional Risk-averse
  formulation
Robust Reinforcement Learning with Distributional Risk-averse formulation
Pierre Clavier
S. Allassonnière
E. L. Pennec
OOD
224
9
0
14 Jun 2022
Challenges to Solving Combinatorially Hard Long-Horizon Deep RL Tasks
Challenges to Solving Combinatorially Hard Long-Horizon Deep RL Tasks
Andrew C. Li
Pashootan Vaezipoor
Rodrigo Toro Icarte
Sheila A. McIlraith
OffRLLRM
221
5
0
03 Jun 2022
Revisiting Gaussian mixture critics in off-policy reinforcement
  learning: a sample-based approach
Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach
Bobak Shahriari
A. Abdolmaleki
Arunkumar Byravan
A. Friesen
Siqi Liu
Jost Tobias Springenberg
N. Heess
Matthew W. Hoffman
Martin Riedmiller
OffRL
251
9
0
21 Apr 2022
1
Page 1 of 1