Enhancing Image Matting in Real-World Scenes with Mask-Guided Iterative Refinement

24 February 2025

Abstract

Real-world image matting is essential for applications in content creation and augmented reality. However, it remains challenging due to the complex nature of scenes and the scarcity of high-quality datasets. To address these limitations, we introduce Mask2Alpha, an iterative refinement framework designed to enhance semantic comprehension, instance awareness, and fine-detail recovery in image matting. Our framework leverages self-supervised Vision Transformer features as semantic priors, strengthening contextual understanding in complex scenarios. To further improve instance differentiation, we implement a mask-guided feature selection module, enabling precise targeting of objects in multi-instance settings. Additionally, a sparse convolution-based optimization scheme allows Mask2Alpha to recover high-resolution details through progressive refinement,from low-resolution semantic passes to high-resolution sparse reconstructions. Benchmarking across various real-world datasets, Mask2Alpha consistently achieves state-of-the-art results, showcasing its effectiveness in accurate and efficient image matting.

View on arXiv

@article{liu2025_2502.17093,
  title={ Enhancing Image Matting in Real-World Scenes with Mask-Guided Iterative Refinement },
  author={ Rui Liu },
  journal={arXiv preprint arXiv:2502.17093},
  year={ 2025 }
}

Comments on this paper