97
0

Optimal Transfer Learning for Missing Not-at-Random Matrix Completion

Abstract

We study transfer learning for matrix completion in a Missing Not-at-Random (MNAR) setting that is motivated by biological problems. The target matrix QQ has entire rows and columns missing, making estimation impossible without side information. To address this, we use a noisy and incomplete source matrix PP, which relates to QQ via a feature shift in latent space. We consider both the active and passive sampling of rows and columns. We establish minimax lower bounds for entrywise estimation error in each setting. Our computationally efficient estimation framework achieves this lower bound for the active setting, which leverages the source data to query the most informative rows and columns of QQ. This avoids the need for incoherence assumptions required for rate optimality in the passive sampling setting. We demonstrate the effectiveness of our approach through comparisons with existing algorithms on real-world biological datasets.

View on arXiv
@article{jalan2025_2503.00174,
  title={ Optimal Transfer Learning for Missing Not-at-Random Matrix Completion },
  author={ Akhil Jalan and Yassir Jedra and Arya Mazumdar and Soumendu Sundar Mukherjee and Purnamrita Sarkar },
  journal={arXiv preprint arXiv:2503.00174},
  year={ 2025 }
}
Comments on this paper