Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1807.00442
Cited By
Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization
2 July 2018
Xiangxiang Chu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Policy Optimization With Penalized Point Probability Distance: An Alternative To Proximal Policy Optimization"
2 / 2 papers shown
Title
Social Interpretable Reinforcement Learning
Leonardo Lucio Custode
Giovanni Iacca
OffRL
40
2
0
27 Jan 2024
Multi-Objective Reinforced Evolution in Mobile Neural Architecture Search
Xiangxiang Chu
Bo Zhang
Ruijun Xu
Hailong Ma
31
98
0
04 Jan 2019
1