ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2409.02416
22
0

Relative-Translation Invariant Wasserstein Distance

4 September 2024
Binshuai Wang
Qiwei Di
Ming Yin
Mengdi Wang
Quanquan Gu
Peng Wei
ArXivPDFHTML
Abstract

We introduce a new family of distances, relative-translation invariant Wasserstein distances (RWpRW_pRWp​), for measuring the similarity of two probability distributions under distribution shift. Generalizing it from the classical optimal transport model, we show that RWpRW_pRWp​ distances are also real distance metrics defined on the quotient set Pp(Rn)/∼\mathcal{P}_p(\mathbb{R}^n)/\simPp​(Rn)/∼ and invariant to distribution translations. When p=2p=2p=2, the RW2RW_2RW2​ distance enjoys more exciting properties, including decomposability of the optimal transport model, translation-invariance of the RW2RW_2RW2​ distance, and a Pythagorean relationship between RW2RW_2RW2​ and the classical quadratic Wasserstein distance (W2W_2W2​). Based on these properties, we show that a distribution shift, measured by W2W_2W2​ distance, can be explained in the bias-variance perspective. In addition, we propose a variant of the Sinkhorn algorithm, named RW2RW_2RW2​ Sinkhorn algorithm, for efficiently calculating RW2RW_2RW2​ distance, coupling solutions, as well as W2W_2W2​ distance. We also provide the analysis of numerical stability and time complexity for the proposed algorithm. Finally, we validate the RW2RW_2RW2​ distance metric and the algorithm performance with three experiments. We conduct one numerical validation for the RW2RW_2RW2​ Sinkhorn algorithm and show two real-world applications demonstrating the effectiveness of using RW2RW_2RW2​ under distribution shift: digits recognition and similar thunderstorm detection. The experimental results report that our proposed algorithm significantly improves the computational efficiency of Sinkhorn in certain practical applications, and the RW2RW_2RW2​ distance is robust to distribution translations compared with baselines.

View on arXiv
Comments on this paper