17
5

A Near-Linear Time Algorithm for the Chamfer Distance

Abstract

For any two point sets A,BRdA,B \subset \mathbb{R}^d of size up to nn, the Chamfer distance from AA to BB is defined as CH(A,B)=aAminbBdX(a,b)\text{CH}(A,B)=\sum_{a \in A} \min_{b \in B} d_X(a,b), where dXd_X is the underlying distance measure (e.g., the Euclidean or Manhattan distance). The Chamfer distance is a popular measure of dissimilarity between point clouds, used in many machine learning, computer vision, and graphics applications, and admits a straightforward O(dn2)O(d n^2)-time brute force algorithm. Further, the Chamfer distance is often used as a proxy for the more computationally demanding Earth-Mover (Optimal Transport) Distance. However, the \emph{quadratic} dependence on nn in the running time makes the naive approach intractable for large datasets. We overcome this bottleneck and present the first (1+ϵ)(1+\epsilon)-approximate algorithm for estimating the Chamfer distance with a near-linear running time. Specifically, our algorithm runs in time O(ndlog(n)/ε2)O(nd \log (n)/\varepsilon^2) and is implementable. Our experiments demonstrate that it is both accurate and fast on large high-dimensional datasets. We believe that our algorithm will open new avenues for analyzing large high-dimensional point clouds. We also give evidence that if the goal is to \emph{report} a (1+ε)(1+\varepsilon)-approximate mapping from AA to BB (as opposed to just its value), then any sub-quadratic time algorithm is unlikely to exist.

View on arXiv
Comments on this paper