ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.11333
  4. Cited By
Multi-modal Understanding and Generation for Medical Images and Text via
  Vision-Language Pre-Training

Multi-modal Understanding and Generation for Medical Images and Text via Vision-Language Pre-Training

24 May 2021
Jong Hak Moon
HyunGyung Lee
W. Shin
Young-Hak Kim
E. Choi
    MedIm
ArXivPDFHTML

Papers citing "Multi-modal Understanding and Generation for Medical Images and Text via Vision-Language Pre-Training"

19 / 19 papers shown
Title
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Erik Cambria
LM&MA
AILaw
85
148
0
28 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
A Comprehensive Survey of Foundation Models in Medicine
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CE
LM&MA
VLM
95
16
0
17 Jan 2025
GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis
GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis
Bo Liu
K. Zou
Liming Zhan
Zexin Lu
Xiaoyu Dong
Yidi Chen
Chengqiang Xie
Jiannong Cao
Xiao-Ming Wu
Huazhu Fu
118
0
0
25 Nov 2024
RespLLM: Unifying Audio and Text with Multimodal LLMs for Generalized
  Respiratory Health Prediction
RespLLM: Unifying Audio and Text with Multimodal LLMs for Generalized Respiratory Health Prediction
Yuwei Zhang
Tong Xia
Aaqib Saeed
Cecilia Mascolo
LM&MA
24
3
0
07 Oct 2024
Advancing Medical Radiograph Representation Learning: A Hybrid Pre-training Paradigm with Multilevel Semantic Granularity
Advancing Medical Radiograph Representation Learning: A Hybrid Pre-training Paradigm with Multilevel Semantic Granularity
Hanqi Jiang
Xixuan Hao
Yuzhou Huang
Chong Ma
Jiaxun Zhang
Yi Pan
Ruimao Zhang
MedIm
28
0
0
01 Oct 2024
Unlocking the Power of Spatial and Temporal Information in Medical
  Multimodal Pre-training
Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training
Jinxia Yang
Bing-Huang Su
Wayne Xin Zhao
Ji-Rong Wen
27
2
0
30 May 2024
A Survey of Deep Learning-based Radiology Report Generation Using Multimodal Data
A Survey of Deep Learning-based Radiology Report Generation Using Multimodal Data
Xinyi Wang
Grazziela Figueredo
Ruizhe Li
W. Zhang
Weitong Chen
Xin Chen
MedIm
ViT
37
2
0
21 May 2024
SkinGEN: an Explainable Dermatology Diagnosis-to-Generation Framework with Interactive Vision-Language Models
SkinGEN: an Explainable Dermatology Diagnosis-to-Generation Framework with Interactive Vision-Language Models
Bo Lin
Yingjing Xu
Xuanwen Bao
Zhou Zhao
Zuyong Zhang
Zhouyang Wang
54
2
0
23 Apr 2024
RJUA-MedDQA: A Multimodal Benchmark for Medical Document Question
  Answering and Clinical Reasoning
RJUA-MedDQA: A Multimodal Benchmark for Medical Document Question Answering and Clinical Reasoning
Congyun Jin
Ming Zhang
Xiaowei Ma
Yujiao Li
Yingbo Wang
...
Chenfei Chi
Xiangguo Lv
Fangzhou Li
Wei Xue
Yiran Huang
LM&MA
23
2
0
19 Feb 2024
AliFuse: Aligning and Fusing Multi-modal Medical Data for Computer-Aided Diagnosis
AliFuse: Aligning and Fusing Multi-modal Medical Data for Computer-Aided Diagnosis
Qiuhui Chen
Yi Hong
MedIm
15
1
0
02 Jan 2024
UniChest: Conquer-and-Divide Pre-training for Multi-Source Chest X-Ray
  Classification
UniChest: Conquer-and-Divide Pre-training for Multi-Source Chest X-Ray Classification
Tianjie Dai
Ruipeng Zhang
Feng Hong
Jiangchao Yao
Ya-Qin Zhang
Yanfeng Wang
12
8
0
18 Dec 2023
Missing-modality Enabled Multi-modal Fusion Architecture for Medical
  Data
Missing-modality Enabled Multi-modal Fusion Architecture for Medical Data
Muyu Wang
Shiyu Fan
Yichen Li
Hui Chen
MedIm
17
1
0
27 Sep 2023
Utilizing Longitudinal Chest X-Rays and Reports to Pre-Fill Radiology
  Reports
Utilizing Longitudinal Chest X-Rays and Reports to Pre-Fill Radiology Reports
Qingqing Zhu
T. Mathai
P. Mukherjee
Yifan Peng
Ronald M. Summers
Zhiyong Lu
19
17
0
14 Jun 2023
Bi-VLGM : Bi-Level Class-Severity-Aware Vision-Language Graph Matching
  for Text Guided Medical Image Segmentation
Bi-VLGM : Bi-Level Class-Severity-Aware Vision-Language Graph Matching for Text Guided Medical Image Segmentation
Wenting Chen
Jie Liu
Yixuan Yuan
VLM
16
3
0
20 May 2023
Local Contrastive Learning for Medical Image Recognition
Local Contrastive Learning for Medical Image Recognition
S. A. Rizvi
Ruixiang Tang
X. Jiang
X. Ma
X. Hu
19
5
0
24 Mar 2023
LIMITR: Leveraging Local Information for Medical Image-Text
  Representation
LIMITR: Leveraging Local Information for Medical Image-Text Representation
Gefen Dawidowicz
Elad Hirsch
A. Tal
21
15
0
21 Mar 2023
Unified Vision-Language Pre-Training for Image Captioning and VQA
Unified Vision-Language Pre-Training for Image Captioning and VQA
Luowei Zhou
Hamid Palangi
Lei Zhang
Houdong Hu
Jason J. Corso
Jianfeng Gao
MLLM
VLM
250
922
0
24 Sep 2019
A Survey on Deep Learning in Medical Image Analysis
A Survey on Deep Learning in Medical Image Analysis
G. Litjens
Thijs Kooi
B. Bejnordi
A. Setio
F. Ciompi
Mohsen Ghafoorian
Jeroen van der Laak
Bram van Ginneken
C. I. Sánchez
OOD
278
10,544
0
19 Feb 2017
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,724
0
26 Sep 2016
1