Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1802.07564
Cited By
v1
v2 (latest)
Clipped Action Policy Gradient
21 February 2018
Yasuhiro Fujita
S. Maeda
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Github (31★)
Papers citing
"Clipped Action Policy Gradient"
24 / 24 papers shown
Guided Reinforcement Learning for Omnidirectional 3D Jumping in Quadruped Robots
Riccardo Bussola
Michele Focchi
Giulio Turrisi
Claudio Semini
Luigi Palopoli
437
2
0
22 Jul 2025
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Wenjia Meng
Qian Zheng
Long Yang
Yilong Yin
Gang Pan
OffRL
250
0
0
04 May 2024
KnowGPT: Knowledge Graph based Prompting for Large Language Models
Qinggang Zhang
Hao-Heng Chen
Hao Chen
Daochen Zha
Zailiang Yu
Xiao Huang
KELM
RALM
489
44
0
11 Dec 2023
Handling Cost and Constraints with Off-Policy Deep Reinforcement Learning
Jared Markowitz
Jesse Silverberg
Gary Collins
OffRL
237
0
0
30 Nov 2023
Clipped-Objective Policy Gradients for Pessimistic Policy Optimization
Jared Markowitz
Edward W. Staley
OffRL
281
4
0
10 Nov 2023
Policy Gradient Algorithms Implicitly Optimize by Continuation
Adrien Bolland
Gilles Louppe
D. Ernst
325
5
0
11 May 2023
Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints
IEEE Robotics and Automation Letters (RA-L), 2023
Kazumi Kasaura
Shuwa Miura
Tadashi Kozuno
Ryo Yonetani
Kenta Hoshino
Y. Hosoe
198
21
0
18 Apr 2023
Distillation Policy Optimization
Jianfei Ma
OffRL
618
1
0
01 Feb 2023
A Risk-Sensitive Approach to Policy Optimization
AAAI Conference on Artificial Intelligence (AAAI), 2022
Jared Markowitz
Ryan W. Gardner
Ashley J. Llorens
R. Arora
I-J. Wang
OffRL
311
10
0
19 Aug 2022
Remember and Forget Experience Replay for Multi-Agent Reinforcement Learning
Pascal Weber
Daniel Wälchli
Mustafa Zeqiri
Petros Koumoutsakos
CLL
OffRL
253
9
0
24 Mar 2022
Provably Efficient Convergence of Primal-Dual Actor-Critic with Nonlinear Function Approximation
Adaptive Agents and Multi-Agent Systems (AAMAS), 2022
Jing Dong
Li Shen
Ying Xu
Baoxiang Wang
253
1
0
28 Feb 2022
Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint Localization
International Conference on Learning Representations (ICLR), 2022
Can Wang
Sheng Jin
Yingda Guan
Wentao Liu
Chao Qian
Ping Luo
Wanli Ouyang
241
17
0
21 Jan 2022
Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective
Ting-Han Fan
Peter J. Ramadge
CML
FAtt
OffRL
254
2
0
06 Oct 2021
Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Policy Optimization
Conference on Uncertainty in Artificial Intelligence (UAI), 2021
Jyun-Li Lin
Wei-Ting Hung
Shangtong Yang
Ping-Chun Hsieh
Xi Liu
306
21
0
22 Feb 2021
Factored Policy Gradients: Leveraging Structure for Efficient Learning in MOMDPs
Neural Information Processing Systems (NeurIPS), 2021
Thomas Spooner
N. Vadori
Sumitra Ganesh
235
8
0
20 Feb 2021
Measuring Progress in Deep Reinforcement Learning Sample Efficiency
Florian E. Dorner
185
13
0
09 Feb 2021
A Contraction Approach to Model-based Reinforcement Learning
International Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Ting-Han Fan
Peter J. Ramadge
OffRL
174
2
0
18 Sep 2020
SuperSuit: Simple Microwrappers for Reinforcement Learning Environments
J. K. Terry
Benjamin Black
Ananth Hari
153
26
0
17 Aug 2020
Action sequencing using visual permutations
Michael G. Burke
Kartic Subr
S. Ramamoorthy
LRM
261
4
0
03 Aug 2020
A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme
Paavo Parmas
Masashi Sugiyama
166
3
0
14 Oct 2019
Striving for Simplicity and Performance in Off-Policy DRL: Output Normalization and Non-Uniform Sampling
Che Wang
Yanqiu Wu
Q. Vuong
George Andriopoulos
278
6
0
05 Oct 2019
Generalization in Transfer Learning
Robotica (Cambridge. Print) (RCP), 2019
S. E. Ada
Emre Ugur
H. L. Akin
205
24
0
03 Sep 2019
Augment-Reinforce-Merge Policy Gradient for Binary Stochastic Policy
Yunhao Tang
Mingzhang Yin
Mingyuan Zhou
123
0
0
13 Mar 2019
Understanding the impact of entropy on policy optimization
Zafarali Ahmed
Nicolas Le Roux
Mohammad Norouzi
Dale Schuurmans
410
302
0
27 Nov 2018
1
Page 1 of 1