188

Low Rank Approximation for General Tensor Networks

Information Technology Convergence and Services (ITCS), 2022
Abstract

We study the problem of approximating a given tensor with qq modes ARn××nA \in \mathbb{R}^{n \times \ldots \times n} with an arbitrary tensor network of rank kk -- that is, a graph G=(V,E)G = (V, E), where V=q|V| = q, together with a collection of tensors {UvvV}\{U_v \mid v \in V\} which are contracted in the manner specified by GG to obtain a tensor TT. For each mode of UvU_v corresponding to an edge incident to vv, the dimension is kk, and we wish to find UvU_v such that the Frobenius norm distance between TT and AA is minimized. This generalizes a number of well-known tensor network decompositions, such as the Tensor Train, Tensor Ring, Tucker, and PEPS decompositions. We approximate AA by a binary tree network TT' with O(q)O(q) cores, such that the dimension on each edge of this network is at most O~(kO(dt)q/ε)\widetilde{O}(k^{O(dt)} \cdot q/\varepsilon), where dd is the maximum degree of GG and tt is its treewidth, such that ATF2(1+ε)ATF2\|A - T'\|_F^2 \leq (1 + \varepsilon) \|A - T\|_F^2. The running time of our algorithm is O(qnnz(A))+npoly(kdtq/ε)O(q \cdot \text{nnz}(A)) + n \cdot \text{poly}(k^{dt}q/\varepsilon), where nnz(A)\text{nnz}(A) is the number of nonzero entries of AA. Our algorithm is based on a new dimensionality reduction technique for tensor decomposition which may be of independent interest. We also develop fixed-parameter tractable (1+ε)(1 + \varepsilon)-approximation algorithms for Tensor Train and Tucker decompositions, improving the running time of Song, Woodruff and Zhong (SODA, 2019) and avoiding the use of generic polynomial system solvers. We show that our algorithms have a nearly optimal dependence on 1/ε1/\varepsilon assuming that there is no O(1)O(1)-approximation algorithm for the 242 \to 4 norm with better running time than brute force. Finally, we give additional results for Tucker decomposition with robust loss functions, and fixed-parameter tractable CP decomposition.

View on arXiv
Comments on this paper