Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.02829
Cited By
Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP)
7 February 2020
Zhimin Hou
Kuangen Zhang
Yi Wan
Dongyu Li
Chenglong Fu
Haoyong Yu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP)"
6 / 6 papers shown
A short methodological review on social robot navigation benchmarking
Pranup Chhetri
Alejandro Torrejon
Sergio Eslava
Luis J. Manso
170
0
0
25 Oct 2025
Evolution Guided Generative Flow Networks
Zarif Ikram
Ling Pan
Dianbo Liu
490
4
0
03 Feb 2024
Order-Preserving GFlowNets
International Conference on Learning Representations (ICLR), 2023
Yihang Chen
Lukas Mauch
543
16
0
30 Sep 2023
Multi-Agent Cooperation via Unsupervised Learning of Joint Intentions
Shanqi Liu
Weiwei Liu
Wenzhou Chen
Guanzhong Tian
Y. Liu
215
0
0
05 Jul 2023
Revisiting Discrete Soft Actor-Critic
Haibin Zhou
Zichuan Lin
Junyou Li
Qiang Fu
Wei Yang
Deheng Ye
461
19
0
21 Sep 2022
How does the structure embedded in learning policy affect learning quadruped locomotion?
Kuangen Zhang
Jongwoo Lee
Zhimin Hou
Clarence W. de Silva
Chenglong Fu
N. Hogan
284
1
0
29 Aug 2020
1
Page 1 of 1