ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.11199
  4. Cited By
Active Exploration in Markov Decision Processes

Active Exploration in Markov Decision Processes

28 February 2019
Jean Tarbouriech
A. Lazaric
ArXiv (abs)PDFHTML

Papers citing "Active Exploration in Markov Decision Processes"

32 / 32 papers shown
Title
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
Vincenzo De Paola
Riccardo Zamboni
Mirco Mutti
Marcello Restelli
116
0
0
02 May 2025
Geometric Active Exploration in Markov Decision Processes: the Benefit
  of Abstraction
Geometric Active Exploration in Markov Decision Processes: the Benefit of Abstraction
Ric De Santi
Federico Arangath Joseph
Noah Liniger
Mirco Mutti
Andreas Krause
AI4CE
93
2
0
18 Jul 2024
Global Reinforcement Learning: Beyond Linear and Convex Rewards via
  Submodular Semi-gradient Methods
Global Reinforcement Learning: Beyond Linear and Convex Rewards via Submodular Semi-gradient Methods
Ric De Santi
Manish Prajapat
Andreas Krause
91
5
0
13 Jul 2024
How to Explore with Belief: State Entropy Maximization in POMDPs
How to Explore with Belief: State Entropy Maximization in POMDPs
Riccardo Zamboni
Duilio Cirino
Marcello Restelli
Mirco Mutti
92
4
0
04 Jun 2024
Probabilistic Inference in Reinforcement Learning Done Right
Probabilistic Inference in Reinforcement Learning Done Right
Jean Tarbouriech
Tor Lattimore
Brendan O'Donoghue
BDLOffRL
79
4
0
22 Nov 2023
Submodular Reinforcement Learning
Submodular Reinforcement Learning
Manish Prajapat
Mojmír Mutný
Melanie Zeilinger
Andreas Krause
OffRL
88
14
0
25 Jul 2023
Maximum State Entropy Exploration using Predecessor and Successor
  Representations
Maximum State Entropy Exploration using Predecessor and Successor Representations
A. Jain
Lucas Lehnert
Irina Rish
Glen Berseth
85
16
0
26 Jun 2023
Cancellation-Free Regret Bounds for Lagrangian Approaches in Constrained
  Markov Decision Processes
Cancellation-Free Regret Bounds for Lagrangian Approaches in Constrained Markov Decision Processes
A. Müller
Pragnya Alatur
Giorgia Ramponi
Niao He
81
6
0
12 Jun 2023
Pretraining in Deep Reinforcement Learning: A Survey
Pretraining in Deep Reinforcement Learning: A Survey
Zhihui Xie
Zichuan Lin
Junyou Li
Shuai Li
Deheng Ye
OffRLOnRLAI4CE
81
23
0
08 Nov 2022
Active Exploration via Experiment Design in Markov Chains
Active Exploration via Experiment Design in Markov Chains
Mojmír Mutný
Tadeusz Janik
Andreas Krause
93
16
0
29 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
BYOL-Explore: Exploration by Bootstrapped Prediction
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
106
73
0
16 Jun 2022
The Importance of Non-Markovianity in Maximum State Entropy Exploration
The Importance of Non-Markovianity in Maximum State Entropy Exploration
Mirco Mutti
Ric De Santi
Marcello Restelli
87
33
0
07 Feb 2022
Challenging Common Assumptions in Convex Reinforcement Learning
Challenging Common Assumptions in Convex Reinforcement Learning
Mirco Mutti
Ric De Santi
Piersilvio De Bartolomeis
Marcello Restelli
OffRL
78
23
0
03 Feb 2022
Unsupervised Reinforcement Learning in Multiple Environments
Unsupervised Reinforcement Learning in Multiple Environments
Mirco Mutti
Mattia Mancassola
Marcello Restelli
OffRL
71
26
0
16 Dec 2021
Adaptive Multi-Goal Exploration
Adaptive Multi-Goal Exploration
Jean Tarbouriech
O. D. Domingues
Pierre Ménard
Matteo Pirotta
Michal Valko
A. Lazaric
123
3
0
23 Nov 2021
Learning Altruistic Behaviours in Reinforcement Learning without
  External Rewards
Learning Altruistic Behaviours in Reinforcement Learning without External Rewards
Tim Franzmeyer
Mateusz Malinowski
João F. Henriques
42
8
0
20 Jul 2021
Markov Decision Processes with Long-Term Average Constraints
Markov Decision Processes with Long-Term Average Constraints
Mridul Agarwal
Qinbo Bai
Vaneet Aggarwal
52
6
0
12 Jun 2021
Navigating to the Best Policy in Markov Decision Processes
Navigating to the Best Policy in Markov Decision Processes
Aymen Al Marjani
Aurélien Garivier
Alexandre Proutiere
94
25
0
05 Jun 2021
MARL with General Utilities via Decentralized Shadow Reward Actor-Critic
MARL with General Utilities via Decentralized Shadow Reward Actor-Critic
Junyu Zhang
Amrit Singh Bedi
Mengdi Wang
Alec Koppel
50
8
0
29 May 2021
Improved Sample Complexity for Incremental Autonomous Exploration in
  MDPs
Improved Sample Complexity for Incremental Autonomous Exploration in MDPs
Jean Tarbouriech
Matteo Pirotta
Michal Valko
A. Lazaric
48
13
0
29 Dec 2020
Adaptive Sampling for Estimating Distributions: A Bayesian Upper
  Confidence Bound Approach
Adaptive Sampling for Estimating Distributions: A Bayesian Upper Confidence Bound Approach
D. Kartik
N. Sood
U. Mitra
T. Javidi
21
0
0
08 Dec 2020
A Provably Efficient Sample Collection Strategy for Reinforcement
  Learning
A Provably Efficient Sample Collection Strategy for Reinforcement Learning
Jean Tarbouriech
Matteo Pirotta
Michal Valko
A. Lazaric
OffRL
89
16
0
13 Jul 2020
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State
  Entropy Estimate
Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate
Mirco Mutti
Lorenzo Pratissoli
Marcello Restelli
73
19
0
09 Jul 2020
Constrained episodic reinforcement learning in concave-convex and
  knapsack settings
Constrained episodic reinforcement learning in concave-convex and knapsack settings
Kianté Brantley
Miroslav Dudík
Thodoris Lykouris
Sobhan Miryoosefi
Max Simchowitz
Aleksandrs Slivkins
Wen Sun
OffRL
103
52
0
09 Jun 2020
Scalable First-Order Methods for Robust MDPs
Scalable First-Order Methods for Robust MDPs
Julien Grand-Clément
Christian Kroer
106
28
0
11 May 2020
Active Model Estimation in Markov Decision Processes
Active Model Estimation in Markov Decision Processes
Jean Tarbouriech
S. Shekhar
Matteo Pirotta
Mohammad Ghavamzadeh
A. Lazaric
83
25
0
06 Mar 2020
Reward-Free Exploration for Reinforcement Learning
Reward-Free Exploration for Reinforcement Learning
Chi Jin
A. Krishnamurthy
Max Simchowitz
Tiancheng Yu
OffRL
176
197
0
07 Feb 2020
Adaptive Sampling for Estimating Multiple Probability Distributions
Adaptive Sampling for Estimating Multiple Probability Distributions
S. Shekhar
T. Javidi
Mohammad Ghavamzadeh
112
1
0
28 Oct 2019
An Intrinsically-Motivated Approach for Learning Highly Exploring and
  Fast Mixing Policies
An Intrinsically-Motivated Approach for Learning Highly Exploring and Fast Mixing Policies
Mirco Mutti
Marcello Restelli
31
25
0
10 Jul 2019
Learning Multiple Markov Chains via Adaptive Allocation
Learning Multiple Markov Chains via Adaptive Allocation
M. S. Talebi
Odalric-Ambrym Maillard
50
1
0
27 May 2019
Exploration-Exploitation Trade-off in Reinforcement Learning on Online
  Markov Decision Processes with Global Concave Rewards
Exploration-Exploitation Trade-off in Reinforcement Learning on Online Markov Decision Processes with Global Concave Rewards
Wang Chi Cheung
49
18
0
15 May 2019
A Successive-Elimination Approach to Adaptive Robotic Sensing
A Successive-Elimination Approach to Adaptive Robotic Sensing
Esther Rolf
David Fridovich-Keil
Max Simchowitz
Benjamin Recht
Claire Tomlin
83
8
0
27 Sep 2018
1