252

InstaHide's Sample Complexity When Mixing Two Private Images

Abstract

Inspired by InstaHide challenge [Huang, Song, Li and Arora'20], [Chen, Song and Zhuo'20] recently provides one mathematical formulation of InstaHide attack problem under Gaussian images distribution. They show that it suffices to use O(nprivkpriv2/(kpriv+1))O(n_{\mathsf{priv}}^{k_{\mathsf{priv}} - 2/(k_{\mathsf{priv}} + 1)}) samples to recover one private image in nprivO(kpriv)+poly(npub)n_{\mathsf{priv}}^{O(k_{\mathsf{priv}})} + \mathrm{poly}(n_{\mathsf{pub}}) time for any integer kprivk_{\mathsf{priv}}, where nprivn_{\mathsf{priv}} and npubn_{\mathsf{pub}} denote the number of images used in the private and the public dataset to generate a mixed image sample. Under the current setup for the InstaHide challenge of mixing two private images (kpriv=2k_{\mathsf{priv}} = 2), this means npriv4/3n_{\mathsf{priv}}^{4/3} samples are sufficient to recover a private image. In this work, we show that nprivlog(npriv)n_{\mathsf{priv}} \log ( n_{\mathsf{priv}} ) samples are sufficient (information-theoretically) for recovering all the private images.

View on arXiv
Comments on this paper