ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.08059
  4. Cited By
Fast Rates for Maximum Entropy Exploration
v1v2 (latest)

Fast Rates for Maximum Entropy Exploration

International Conference on Machine Learning (ICML), 2023
14 March 2023
D. Tiapkin
Denis Belomestny
Daniele Calandriello
Eric Moulines
Rémi Munos
A. Naumov
Pierre Perrault
Yunhao Tang
Michal Valko
Pierre Menard
ArXiv (abs)PDFHTMLGithub (3★)

Papers citing "Fast Rates for Maximum Entropy Exploration"

16 / 16 papers shown
UAV-ON: A Benchmark for Open-World Object Goal Navigation with Aerial Agents
UAV-ON: A Benchmark for Open-World Object Goal Navigation with Aerial Agents
Jianqiang Xiao
Yuexuan Sun
Yixin Shao
Boxi Gan
Rongqiang Liu
Yanjing Wu
Weili Gua
Xiang Deng
372
0
0
01 Aug 2025
The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning
The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning
Shivam Agarwal
Zimin Zhang
Lifan Yuan
Jiawei Han
Yuan Yao
565
125
0
21 May 2025
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
Vincenzo De Paola
Riccardo Zamboni
Mirco Mutti
Marcello Restelli
523
3
0
02 May 2025
Can RLHF be More Efficient with Imperfect Reward Models? A Policy Coverage Perspective
Can RLHF be More Efficient with Imperfect Reward Models? A Policy Coverage Perspective
Jiawei Huang
Bingcong Li
Christoph Dann
Niao He
OffRL
687
4
0
26 Feb 2025
Federated UCBVI: Communication-Efficient Federated Regret Minimization
  with Heterogeneous Agents
Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous AgentsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Safwan Labbi
D. Tiapkin
Lorenzo Mancini
Paul Mangold
Eric Moulines
FedML
322
5
0
30 Oct 2024
Robot Policy Learning with Temporal Optimal Transport Reward
Robot Policy Learning with Temporal Optimal Transport RewardNeural Information Processing Systems (NeurIPS), 2024
Yuwei Fu
Haichao Zhang
Di Wu
Wei Xu
Benoit Boulet
OffRL
281
6
0
29 Oct 2024
Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization
Optimizing Backward Policies in GFlowNets via Trajectory Likelihood MaximizationInternational Conference on Learning Representations (ICLR), 2024
Timofei Gritsaev
Nikita Morozov
S. Samsonov
D. Tiapkin
331
7
0
20 Oct 2024
Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation
Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation
Jean Seong Bjorn Choe
Jong-Kook Kim
256
5
0
25 Jul 2024
The Limits of Pure Exploration in POMDPs: When the Observation Entropy
  is Enough
The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough
Riccardo Zamboni
Duilio Cirino
Marcello Restelli
Mirco Mutti
358
7
0
18 Jun 2024
How to Explore with Belief: State Entropy Maximization in POMDPs
How to Explore with Belief: State Entropy Maximization in POMDPs
Riccardo Zamboni
Duilio Cirino
Marcello Restelli
Mirco Mutti
287
6
0
04 Jun 2024
Exploratory Preference Optimization: Harnessing Implicit
  Q*-Approximation for Sample-Efficient RLHF
Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF
Tengyang Xie
Dylan J. Foster
Akshay Krishnamurthy
Corby Rosset
Ahmed Hassan Awadallah
Alexander Rakhlin
310
87
0
31 May 2024
Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement
  Learning
Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning
Adriana Hugessen
Roger Creus Castanyer
Faisal Mohamed
Glen Berseth
212
2
0
27 May 2024
Generalizing Machine Learning Evaluation through the Integration of
  Shannon Entropy and Rough Set Theory
Generalizing Machine Learning Evaluation through the Integration of Shannon Entropy and Rough Set Theory
Olga Cherednichenko
Dmytro Chernyshov
Dmytro Sytnikov
Polina Sytnikova
191
1
0
18 Apr 2024
Probabilistic Inference in Reinforcement Learning Done Right
Probabilistic Inference in Reinforcement Learning Done RightNeural Information Processing Systems (NeurIPS), 2023
Jean Tarbouriech
Tor Lattimore
Brendan O'Donoghue
BDLOffRL
370
11
0
22 Nov 2023
Generative Flow Networks as Entropy-Regularized RL
Generative Flow Networks as Entropy-Regularized RL
D. Tiapkin
Nikita Morozov
Alexey Naumov
Dmitry Vetrov
408
58
0
19 Oct 2023
Minimax Optimal Q Learning with Nearest Neighbors
Minimax Optimal Q Learning with Nearest NeighborsIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2023
Puning Zhao
Lifeng Lai
OffRL
324
16
0
03 Aug 2023
1
Page 1 of 1