Distributed Robust Subspace Recovery

We study Robust Subspace Recovery (RSR) in distributed settings. We consider a huge dataset in an ad hoc network without a central processor, where each node has access only to one chunk of the dataset. We assume that part of the whole dataset lies around a low-dimensional subspace and the other part is composed of outliers that lie away from that subspace. The goal is to recover the underlying subspace for the whole dataset, without transferring the data itself between the nodes. We apply the Consensus-Based Gradient method for the Geometric Median Subspace algorithm for RSR. We propose an iterative solution for the local dual minimization problem and establish its -linear convergence. We also explain how to distributedly implement the Reaper and Fast Median Subspace algorithms for RSR. We demonstrate the competitive performance of our algorithms for both synthetic and real data.
View on arXiv