A systematic review of challenges and proposed solutions in modeling multimodal data

Multimodal data modeling has emerged as a powerful approach in clinical research, enabling the integration of diverse data types such as imaging, genomics, wearable sensors, and electronic health records. Despite its potential to improve diagnostic accuracy and support personalized care, modeling such heterogeneous data presents significant technical challenges. This systematic review synthesizes findings from 69 studies to identify common obstacles, including missing modalities, limited sample sizes, dimensionality imbalance, interpretability issues, and finding the optimal fusion techniques. We highlight recent methodological advances, such as transfer learning, generative models, attention mechanisms, and neural architecture search that offer promising solutions. By mapping current trends and innovations, this review provides a comprehensive overview of the field and offers practical insights to guide future research and development in multimodal modeling for medical applications.
View on arXiv@article{farhadizadeh2025_2505.06945, title={ A systematic review of challenges and proposed solutions in modeling multimodal data }, author={ Maryam Farhadizadeh and Maria Weymann and Michael Blaß and Johann Kraus and Christopher Gundler and Sebastian Walter and Noah Hempen and Harald Binde and Nadine Binder }, journal={arXiv preprint arXiv:2505.06945}, year={ 2025 } }