P-Transformer: A Prompt-based Multimodal Transformer Architecture For Medical Tabular Data

30 March 2023

Abstract

Medical tabular data, abundant in Electronic Health Records (EHRs), is a valuable resource for diverse medical tasks such as risk prediction. While deep learning approaches, particularly transformer-based models, have shown remarkable performance in tabular data prediction, there are still problems remaining for existing work to be effectively adapted into medical domain, such as ignoring unstructured free-texts and underutilizing the textual information in structured data. To address these issues, we propose PTransformer, a \underline{P}rompt-based multimodal \underline{Transformer} architecture designed specifically for medical tabular data. This framework consists of two critical components: a tabular cell embedding generator and a tabular transformer. The former efficiently encodes diverse modalities from both structured and unstructured tabular data into a harmonized language semantic space with the help of pre-trained sentence encoder and medical prompts. The latter integrates cell representations to generate patient embeddings for various medical tasks. In comprehensive experiments on two real-world datasets for three medical tasks, PTransformer demonstrated the improvements with 10.9%/11.0% on RMSE/MAE, 0.5%/2.2% on RMSE/MAE, and 1.6%/0.8% on BACC/AUROC compared to state-of-the-art (SOTA) baselines in predictability.

View on arXiv

@article{ruan2025_2303.17408,
  title={ P-Transformer: A Prompt-based Multimodal Transformer Architecture For Medical Tabular Data },
  author={ Yucheng Ruan and Xiang Lan and Daniel J. Tan and Hairil Rizal Abdullah and Mengling Feng },
  journal={arXiv preprint arXiv:2303.17408},
  year={ 2025 }
}

Comments on this paper