ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.07397
70
0

Bandit Optimal Transport

11 February 2025
Lorenzo Croissant
ArXivPDFHTML
Abstract

Despite the impressive progress in statistical Optimal Transport (OT) in recent years, there has been little interest in the study of the \emph{sequential learning} of OT. Surprisingly so, as this problem is both practically motivated and a challenging extension of existing settings such as linear bandits. This article considers (for the first time) the stochastic bandit problem of learning to solve generic Kantorovich and entropic OT problems from repeated interactions when the marginals are known but the cost is unknown. We provide O~(T)\tilde{\mathcal O}(\sqrt{T})O~(T​) regret algorithms for both problems by extending linear bandits on Hilbert spaces. These results provide a reduction to infinite-dimensional linear bandits. To deal with the dimension, we provide a method to exploit the intrinsic regularity of the cost to learn, yielding corresponding regret bounds which interpolate between O~(T)\tilde{\mathcal O}(\sqrt{T})O~(T​) and O~(T)\tilde{\mathcal O}(T)O~(T).

View on arXiv
@article{croissant2025_2502.07397,
  title={ Bandit Optimal Transport },
  author={ Lorenzo Croissant },
  journal={arXiv preprint arXiv:2502.07397},
  year={ 2025 }
}
Comments on this paper