v1v2 (latest)

Fast Rates for Maximum Entropy Exploration

International Conference on Machine Learning (ICML), 2023

14 March 2023

Daniele Calandriello

Pierre Menard

ArXiv (abs)PDF HTML Github (3★)

Papers citing "Fast Rates for Maximum Entropy Exploration"

16 / 16 papers shown

UAV-ON: A Benchmark for Open-World Object Goal Navigation with Aerial Agents

372

01 Aug 2025

The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning

565

125

21 May 2025

Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story

523

02 May 2025

Can RLHF be More Efficient with Imperfect Reward Models? A Policy Coverage Perspective

687

26 Feb 2025

Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous AgentsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024

322

30 Oct 2024

Robot Policy Learning with Temporal Optimal Transport RewardNeural Information Processing Systems (NeurIPS), 2024

Haichao Zhang

281

29 Oct 2024

Optimizing Backward Policies in GFlowNets via Trajectory Likelihood MaximizationInternational Conference on Learning Representations (ICLR), 2024

331

20 Oct 2024

Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation

Jean Seong Bjorn Choe

Jong-Kook Kim

256

25 Jul 2024

The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough

358

18 Jun 2024

How to Explore with Belief: State Entropy Maximization in POMDPs

287

04 Jun 2024

Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF

Ahmed Hassan Awadallah

Alexander Rakhlin

310

31 May 2024

Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning

Adriana Hugessen

Roger Creus Castanyer

Faisal Mohamed

Glen Berseth

212

27 May 2024

Generalizing Machine Learning Evaluation through the Integration of Shannon Entropy and Rough Set Theory

191

18 Apr 2024

Probabilistic Inference in Reinforcement Learning Done RightNeural Information Processing Systems (NeurIPS), 2023

370

22 Nov 2023

Generative Flow Networks as Entropy-Regularized RL

408

19 Oct 2023

Minimax Optimal Q Learning with Nearest NeighborsIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2023

Puning Zhao

Lifeng Lai

OffRL

324

03 Aug 2023