ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.07564
  4. Cited By
Clipped Action Policy Gradient
v1v2 (latest)

Clipped Action Policy Gradient

21 February 2018
Yasuhiro Fujita
S. Maeda
    OffRL
ArXiv (abs)PDFHTMLGithub (31★)

Papers citing "Clipped Action Policy Gradient"

24 / 24 papers shown
Guided Reinforcement Learning for Omnidirectional 3D Jumping in Quadruped Robots
Guided Reinforcement Learning for Omnidirectional 3D Jumping in Quadruped Robots
Riccardo Bussola
Michele Focchi
Giulio Turrisi
Claudio Semini
Luigi Palopoli
437
2
0
22 Jul 2025
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent
  Baseline
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent BaselineIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Wenjia Meng
Qian Zheng
Long Yang
Yilong Yin
Gang Pan
OffRL
250
0
0
04 May 2024
KnowGPT: Knowledge Graph based Prompting for Large Language Models
KnowGPT: Knowledge Graph based Prompting for Large Language Models
Qinggang Zhang
Hao-Heng Chen
Hao Chen
Daochen Zha
Zailiang Yu
Xiao Huang
KELMRALM
489
44
0
11 Dec 2023
Handling Cost and Constraints with Off-Policy Deep Reinforcement
  Learning
Handling Cost and Constraints with Off-Policy Deep Reinforcement Learning
Jared Markowitz
Jesse Silverberg
Gary Collins
OffRL
237
0
0
30 Nov 2023
Clipped-Objective Policy Gradients for Pessimistic Policy Optimization
Clipped-Objective Policy Gradients for Pessimistic Policy Optimization
Jared Markowitz
Edward W. Staley
OffRL
281
4
0
10 Nov 2023
Policy Gradient Algorithms Implicitly Optimize by Continuation
Policy Gradient Algorithms Implicitly Optimize by Continuation
Adrien Bolland
Gilles Louppe
D. Ernst
325
5
0
11 May 2023
Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for
  Robotics Control with Action Constraints
Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action ConstraintsIEEE Robotics and Automation Letters (RA-L), 2023
Kazumi Kasaura
Shuwa Miura
Tadashi Kozuno
Ryo Yonetani
Kenta Hoshino
Y. Hosoe
198
21
0
18 Apr 2023
Distillation Policy Optimization
Distillation Policy Optimization
Jianfei Ma
OffRL
618
1
0
01 Feb 2023
A Risk-Sensitive Approach to Policy Optimization
A Risk-Sensitive Approach to Policy OptimizationAAAI Conference on Artificial Intelligence (AAAI), 2022
Jared Markowitz
Ryan W. Gardner
Ashley J. Llorens
R. Arora
I-J. Wang
OffRL
311
10
0
19 Aug 2022
Remember and Forget Experience Replay for Multi-Agent Reinforcement
  Learning
Remember and Forget Experience Replay for Multi-Agent Reinforcement Learning
Pascal Weber
Daniel Wälchli
Mustafa Zeqiri
Petros Koumoutsakos
CLLOffRL
253
9
0
24 Mar 2022
Provably Efficient Convergence of Primal-Dual Actor-Critic with
  Nonlinear Function Approximation
Provably Efficient Convergence of Primal-Dual Actor-Critic with Nonlinear Function ApproximationAdaptive Agents and Multi-Agent Systems (AAMAS), 2022
Jing Dong
Li Shen
Ying Xu
Baoxiang Wang
253
1
0
28 Feb 2022
Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint
  Localization
Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint LocalizationInternational Conference on Learning Representations (ICLR), 2022
Can Wang
Sheng Jin
Yingda Guan
Wentao Liu
Chao Qian
Ping Luo
Wanli Ouyang
241
17
0
21 Jan 2022
Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective
Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective
Ting-Han Fan
Peter J. Ramadge
CMLFAttOffRL
254
2
0
06 Oct 2021
Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement
  Learning via Frank-Wolfe Policy Optimization
Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Policy OptimizationConference on Uncertainty in Artificial Intelligence (UAI), 2021
Jyun-Li Lin
Wei-Ting Hung
Shangtong Yang
Ping-Chun Hsieh
Xi Liu
306
21
0
22 Feb 2021
Factored Policy Gradients: Leveraging Structure for Efficient Learning
  in MOMDPs
Factored Policy Gradients: Leveraging Structure for Efficient Learning in MOMDPsNeural Information Processing Systems (NeurIPS), 2021
Thomas Spooner
N. Vadori
Sumitra Ganesh
235
8
0
20 Feb 2021
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Florian E. Dorner
185
13
0
09 Feb 2021
A Contraction Approach to Model-based Reinforcement Learning
A Contraction Approach to Model-based Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Ting-Han Fan
Peter J. Ramadge
OffRL
174
2
0
18 Sep 2020
SuperSuit: Simple Microwrappers for Reinforcement Learning Environments
SuperSuit: Simple Microwrappers for Reinforcement Learning Environments
J. K. Terry
Benjamin Black
Ananth Hari
153
26
0
17 Aug 2020
Action sequencing using visual permutations
Action sequencing using visual permutations
Michael G. Burke
Kartic Subr
S. Ramamoorthy
LRM
261
4
0
03 Aug 2020
A unified view of likelihood ratio and reparameterization gradients and
  an optimal importance sampling scheme
A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme
Paavo Parmas
Masashi Sugiyama
166
3
0
14 Oct 2019
Striving for Simplicity and Performance in Off-Policy DRL: Output
  Normalization and Non-Uniform Sampling
Striving for Simplicity and Performance in Off-Policy DRL: Output Normalization and Non-Uniform Sampling
Che Wang
Yanqiu Wu
Q. Vuong
George Andriopoulos
278
6
0
05 Oct 2019
Generalization in Transfer Learning
Generalization in Transfer LearningRobotica (Cambridge. Print) (RCP), 2019
S. E. Ada
Emre Ugur
H. L. Akin
205
24
0
03 Sep 2019
Augment-Reinforce-Merge Policy Gradient for Binary Stochastic Policy
Augment-Reinforce-Merge Policy Gradient for Binary Stochastic Policy
Yunhao Tang
Mingzhang Yin
Mingyuan Zhou
123
0
0
13 Mar 2019
Understanding the impact of entropy on policy optimization
Understanding the impact of entropy on policy optimization
Zafarali Ahmed
Nicolas Le Roux
Mohammad Norouzi
Dale Schuurmans
410
302
0
27 Nov 2018
1
Page 1 of 1