In this paper, we investigate effective sketching schemes for high dimensional multilinear arrays or tensors. More specifically, we propose a novel tensor sparsification algorithm that retains a subset of the entries of a tensor in a judicious way, and prove that it can attain a given level of approximation accuracy in terms of tensor spectral norm with a much smaller sample complexity when compared with existing approaches. In particular, we show that for a th order cubic tensor of stable rank , the sample size requirement for achieving a relative error is, up to a logarithmic factor, of the order when is relatively large, and and essentially optimal when is sufficiently small. It is especially noteworthy that the sample size requirement for achieving a high accuracy is of an order independent of . In addition, we show that for larger , the space complexity can be improved for higher order tensors () by another sketching method via rank-one projection. To further demonstrate the utility of our techniques, we also study how higher order singular value decomposition (HOSVD) of large tensors can be efficiently approximated based on both sketching schemes.
View on arXiv