MuG: A Multimodal Classification Benchmark on Game Data with Tabular,
Textual, and Visual FieldsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 |
Align, Reason and Learn: Enhancing Medical Vision-and-Language
Pre-training with KnowledgeACM Multimedia (ACM MM), 2022 |
Multi-Modal Masked Autoencoders for Medical Vision-and-Language
Pre-TrainingInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022 |