Automatically Identify and Rectify: Robust Deep Contrastive Multi-view Clustering in Noisy Scenarios

27 May 2025

Main:8 Pages

13 Figures

Bibliography:3 Pages

9 Tables

Appendix:11 Pages

Abstract

Leveraging the powerful representation learning capabilities, deep multi-view clustering methods have demonstrated reliable performance by effectively integrating multi-source information from diverse views in recent years. Most existing methods rely on the assumption of clean views. However, noise is pervasive in real-world scenarios, leading to a significant degradation in performance. To tackle this problem, we propose a novel multi-view clustering framework for the automatic identification and rectification of noisy data, termed AIRMVC. Specifically, we reformulate noisy identification as an anomaly identification problem using GMM. We then design a hybrid rectification strategy to mitigate the adverse effects of noisy data based on the identification results. Furthermore, we introduce a noise-robust contrastive mechanism to generate reliable representations. Additionally, we provide a theoretical proof demonstrating that these representations can discard noisy information, thereby improving the performance of downstream tasks. Extensive experiments on six benchmark datasets demonstrate that AIRMVC outperforms state-of-the-art algorithms in terms of robustness in noisy scenarios. The code of AIRMVC are available atthis https URLon Github.

View on arXiv

@article{yang2025_2505.21387,
  title={ Automatically Identify and Rectify: Robust Deep Contrastive Multi-view Clustering in Noisy Scenarios },
  author={ Xihong Yang and Siwei Wang and Fangdi Wang and Jiaqi Jin and Suyuan Liu and Yue Liu and En Zhu and Xinwang Liu and Yueming Jin },
  journal={arXiv preprint arXiv:2505.21387},
  year={ 2025 }
}

Comments on this paper