On matrix estimation under monotonicity constraints

We consider the problem of estimating an unknown matrix from noisy observations under the constraint that is nondecreasing in both rows and columns. We consider the least squares estimator (LSE) in this setting and study its risk properties. We show that the worst case risk of the LSE is , up to multiplicative logarithmic factors, where and that the LSE is minimax rate optimal (up to logarithmic factors). We further prove that for some special , the risk of the LSE could be much smaller than ; in fact, it could even be parametric i.e., up to logarithmic factors. Such parametric rates occur when the number of "rectangular" blocks of is bounded from above by a constant. We derive, as a consequence, an interesting adaptation property of the LSE which we term variable adaptation -- the LSE performs as well as the oracle estimator when estimating a matrix that is constant along each row/column. Our proofs borrow ideas from empirical process theory and convex geometry and are of independent interest.
View on arXiv