Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2002.02829
Cited By

Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic
with Advantage Weighted Mixture Policy(SAC-AWMP)

Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP)

7 February 2020

Yi Wan

ArXiv (abs)PDF HTML

Papers citing "Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP)"

6 / 6 papers shown

A short methodological review on social robot navigation benchmarking

A short methodological review on social robot navigation benchmarking

Alejandro Torrejon

170

0

0

25 Oct 2025

Evolution Guided Generative Flow Networks

Evolution Guided Generative Flow Networks

490

4

0

03 Feb 2024

Order-Preserving GFlowNets

Order-Preserving GFlowNetsInternational Conference on Learning Representations (ICLR), 2023

543

16

0

30 Sep 2023

Multi-Agent Cooperation via Unsupervised Learning of Joint Intentions

Multi-Agent Cooperation via Unsupervised Learning of Joint Intentions

215

0

0

05 Jul 2023

Revisiting Discrete Soft Actor-Critic

Revisiting Discrete Soft Actor-Critic

461

19

0

21 Sep 2022

How does the structure embedded in learning policy affect learning
quadruped locomotion?

How does the structure embedded in learning policy affect learning quadruped locomotion?

Clarence W. de Silva

284

1

0

29 Aug 2020

Page 1 of 1