14
0

OmicsCL: Unsupervised Contrastive Learning for Cancer Subtype Discovery and Survival Stratification

Abstract

Unsupervised learning of disease subtypes from multi-omics data presents a significant opportunity for advancing personalized medicine. We introduce OmicsCL, a modular contrastive learning framework that jointly embeds heterogeneous omics modalities-such as gene expression, DNA methylation, and miRNA expression-into a unified latent space. Our method incorporates a survival-aware contrastive loss that encourages the model to learn representations aligned with survival-related patterns, without relying on labeled outcomes. Evaluated on the TCGA BRCA dataset, OmicsCL uncovers clinically meaningful clusters and achieves strong unsupervised concordance with patient survival. The framework demonstrates robustness across hyperparameter configurations and can be tuned to prioritize either subtype coherence or survival stratification. Ablation studies confirm that integrating survival-aware loss significantly enhances the predictive power of learned embeddings. These results highlight the promise of contrastive objectives for biological insight discovery in high-dimensional, heterogeneous omics data.

View on arXiv
@article{karagoz2025_2505.00650,
  title={ OmicsCL: Unsupervised Contrastive Learning for Cancer Subtype Discovery and Survival Stratification },
  author={ Atahan Karagoz },
  journal={arXiv preprint arXiv:2505.00650},
  year={ 2025 }
}
Comments on this paper