ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.05128
  4. Cited By
Marginalized State Distribution Entropy Regularization in Policy
  Optimization

Marginalized State Distribution Entropy Regularization in Policy Optimization

11 December 2019
Riashat Islam
Zafarali Ahmed
Doina Precup
ArXiv (abs)PDFHTML

Papers citing "Marginalized State Distribution Entropy Regularization in Policy Optimization"

10 / 10 papers shown
Polychromic Objectives for Reinforcement Learning
Polychromic Objectives for Reinforcement Learning
Jubayer Ibn Hamid
Ifdita Hasan Orney
Ellen Xu
Chelsea Finn
Dorsa Sadigh
OffRL
110
1
0
29 Sep 2025
Behind the Myth of Exploration in Policy Gradients
Behind the Myth of Exploration in Policy Gradients
Adrien Bolland
Gaspard Lambrechts
Damien Ernst
359
1
0
31 Jan 2024
Variational Curriculum Reinforcement Learning for Unsupervised Discovery
  of Skills
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of SkillsInternational Conference on Machine Learning (ICML), 2023
Seongun Kim
Kyowoon Lee
Jaesik Choi
SSLDRL
284
16
0
30 Oct 2023
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy
  Actor-Critic
Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-CriticInternational Conference on Machine Learning (ICML), 2023
Tianying Ji
Yuping Luo
Gang Hua
Xianyuan Zhan
Jianwei Zhang
Huazhe Xu
OffRLOnRL
408
21
0
05 Jun 2023
Hierarchical Reinforcement Learning in Complex 3D Environments
Hierarchical Reinforcement Learning in Complex 3D Environments
Bernardo Avila-Pires
Feryal M. P. Behbahani
Hubert Soyer
Kyriacos Nikiforou
Thomas Keck
Satinder Singh
OffRL
174
0
0
28 Feb 2023
Learning GFlowNets from partial episodes for improved convergence and
  stability
Learning GFlowNets from partial episodes for improved convergence and stabilityInternational Conference on Machine Learning (ICML), 2022
Kanika Madan
Jarrid Rector-Brooks
Maksym Korablyov
Emmanuel Bengio
Moksh Jain
A. Nica
Tom Bosc
Yoshua Bengio
Nikolay Malkin
240
120
0
26 Sep 2022
MADE: Exploration via Maximizing Deviation from Explored Regions
MADE: Exploration via Maximizing Deviation from Explored RegionsNeural Information Processing Systems (NeurIPS), 2021
Tianjun Zhang
Paria Rashidinejad
Jiantao Jiao
Yuandong Tian
Joseph E. Gonzalez
Stuart J. Russell
OffRL
226
47
0
18 Jun 2021
Geometric Entropic Exploration
Geometric Entropic Exploration
Z. Guo
M. G. Azar
Alaa Saade
S. Thakoor
Bilal Piot
Bernardo Avila-Pires
Michal Valko
Thomas Mesnard
Tor Lattimore
Rémi Munos
229
35
0
06 Jan 2021
Learning Compositional Neural Programs for Continuous Control
Learning Compositional Neural Programs for Continuous Control
Thomas Pierrot
Nicolas Perrin
Feryal M. P. Behbahani
Alexandre Laterre
Olivier Sigaud
Karim Beguir
Nando de Freitas
CLL
256
4
0
27 Jul 2020
Diversity Policy Gradient for Sample Efficient Quality-Diversity
  Optimization
Diversity Policy Gradient for Sample Efficient Quality-Diversity Optimization
Thomas Pierrot
Valentin Macé
Félix Chalumeau
Arthur Flajolet
Geoffrey Cideron
Karim Beguir
Antoine Cully
Olivier Sigaud
Nicolas Perrin-Gilbert
319
74
0
15 Jun 2020
1