Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2112.02889
Cited By
v1
v2 (latest)
Joint Learning of Localized Representations from Medical Images and Reports
European Conference on Computer Vision (ECCV), 2021
6 December 2021
Philipp Muller
Georgios Kaissis
Cong Zou
Daniel Munich
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Joint Learning of Localized Representations from Medical Images and Reports"
50 / 70 papers shown
Title
Medusa: Cross-Modal Transferable Adversarial Attacks on Multimodal Medical Retrieval-Augmented Generation
Yingjia Shang
Yi Liu
Huimin Wang
Furong Li
Wenfang Sun
Wu Chengyu
Yefeng Zheng
AAML
MedIm
209
0
0
24 Nov 2025
MV-MLM: Bridging Multi-View Mammography and Language for Breast Cancer Diagnosis and Risk Prediction
Shunjie-Fabian Zheng
Hyeonjun Lee
Thijs Kooi
Ali Diba
88
0
0
30 Oct 2025
Alignment, Mining and Fusion: Representation Alignment with Hard Negative Mining and Selective Knowledge Fusion for Medical Visual Question Answering
Computer Vision and Pattern Recognition (CVPR), 2025
Yuanhao Zou
Zhaozheng Yin
MedIm
160
3
0
09 Oct 2025
MedCutMix: A Data-Centric Approach to Improve Radiology Vision-Language Pre-training with Disease Awareness
Sinuo Wang
Yutong Xie
Yuyuan Liu
Qi Wu
MedIm
LM&MA
102
0
0
20 Sep 2025
The Missing Piece: A Case for Pre-Training in 3D Medical Object Detection
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Katharina Eckstein
Constantin Ulrich
Michael Baumgartner
Jessica Kächele
Dimitrios Bounias
Tassilo Wald
R. Floca
Klaus H. Maier-Hein
ViT
MedIm
114
1
0
19 Sep 2025
Data-Efficient Fine-Tuning of Vision-Language Models for Diagnosis of Alzheimer's Disease
Fangqi Cheng
Surajit Ray
Xiaochen Yang
MedIm
LM&MA
VLM
218
0
0
09 Sep 2025
A Language-Signal-Vision Multimodal Framework for Multitask Cardiac Analysis
Yuting Zhang
Tiantian Geng
Luoying Hao
Xinxing Cheng
A. Thorley
...
Sandeep S Hothi
Lei Wei
Zhaowen Qiu
D. Kotecha
Yanfu Zhang
92
0
0
18 Aug 2025
Prototype-Enhanced Confidence Modeling for Cross-Modal Medical Image-Report Retrieval
Shreyank N Gowda
Xiaobo Jin
Christian Wagner
MedIm
88
0
0
05 Aug 2025
Boosting Vision Semantic Density with Anatomy Normality Modeling for Medical Vision-language Pre-training
Weiwei Cao
Jianpeng Zhang
Zhongyi Shui
Sinuo Wang
Z. Chen
...
Le Lu
X. Ye
Tingbo Liang
Qi Zhang
L. Zhang
90
2
0
01 Aug 2025
Distribution-Based Masked Medical Vision-Language Model Using Structured Reports
Shreyank N. Gowda
Ruichi Zhang
Xiao Gu
Ying Weng
Lu Yang
VLM
212
1
0
29 Jul 2025
Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration
Jun Wang
Lixing Zhu
Xiaohan Yu
A. Bhalerao
Yulan He
270
0
0
12 Jun 2025
Enhancing Biomedical Multi-modal Representation Learning with Multi-scale Pre-training and Perturbed Report Discrimination
Conference on Algebraic Informatics (AI), 2024
Xinliu Zhong
Kayhan Batmanghelich
Li Sun
152
1
0
02 Jun 2025
Towards Scalable Language-Image Pre-training for 3D Medical Imaging
Chenhui Zhao
Yiwei Lyu
Asadur Chowdury
Edward Harake
A. Kondepudi
Akshay Rao
X. Hou
Honglak Lee
Rui Feng
MedIm
LM&MA
166
1
0
28 May 2025
Anatomy-Aware Conditional Image-Text Retrieval
Meng Zheng
Jiajin Zhang
Benjamin Planche
Zhongpai Gao
Terrence Chen
Ziyan Wu
MedIm
203
0
0
10 Mar 2025
A Shared Encoder Approach to Multimodal Representation Learning
Shuvendu Roy
Franklin Ogidi
Ali Etemad
Elham Dolatabadi
Arash Afkanpour
149
1
0
03 Mar 2025
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
Computer Vision and Pattern Recognition (CVPR), 2025
Ziyang Zhang
Yang Yu
Yucheng Chen
Xulei Yang
S. Yeo
MedIm
476
9
0
02 Mar 2025
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
Sheng Zhang
Yanbo Xu
Naoto Usuyama
Hanwen Xu
J. Bagga
...
Carlo Bifulco
M. Lungren
Tristan Naumann
Sheng Wang
Hoifung Poon
LM&MA
MedIm
725
413
0
10 Jan 2025
DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image Segmentation
AAAI Conference on Artificial Intelligence (AAAI), 2024
Qingtao Pan
Wenhao Qiao
Jingjiao Lou
Bing Ji
Shuo Li
VLM
257
5
0
17 Dec 2024
NoteContrast: Contrastive Language-Diagnostic Pretraining for Medical Text
Prajwal Kailas
Max Homilius
Rahul C. Deo
Calum A. MacRae
273
4
0
16 Dec 2024
Medical Multimodal Foundation Models in Clinical Diagnosis and Treatment: Applications, Challenges, and Future Directions
Artificial Intelligence in Medicine (AIM), 2024
Kai Sun
Siyan Xue
F. Sun
Haoran Sun
Yu-Juan Luo
...
Xinzhou Wang
Lei Yang
Shuo Jin
Jun Yan
Jiahong Dong
AI4CE
293
19
0
03 Dec 2024
Uni-Mlip: Unified Self-supervision for Medical Vision Language Pre-training
British Machine Vision Conference (BMVC), 2024
Ameera Bawazir
Kebin Wu
Wenbin Li
CLIP
283
1
0
20 Nov 2024
BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays
Neural Information Processing Systems (NeurIPS), 2024
Yang Zhou
Tan Li Hui Faith
Yanyu Xu
Sicong Leng
Xinxing Xu
Yong Liu
Rick Siow Mong Goh
SSL
VLM
LM&MA
MedIm
150
3
0
29 Oct 2024
Image-aware Evaluation of Generated Medical Reports
Neural Information Processing Systems (NeurIPS), 2024
Gefen Dawidowicz
Elad Hirsch
A. Tal
193
1
0
22 Oct 2024
VoxelPrompt: A Vision Agent for End-to-End Medical Image Analysis
Andrew Hoopes
Neel Dey
V. Butoi
John Guttag
Adrian V. Dalca
MedIm
LM&MA
353
0
0
10 Oct 2024
Advancing Medical Radiograph Representation Learning: A Hybrid Pre-training Paradigm with Multilevel Semantic Granularity
Hanqi Jiang
Xixuan Hao
Yuzhou Huang
Chong Ma
Jiaxun Zhang
Yi Pan
Ruimao Zhang
MedIm
333
1
0
01 Oct 2024
Language-guided Scale-aware MedSegmentor for Lesion Segmentation in Medical Imaging
Shuyi Ouyang
Jinyang Zhang
Xiangye Lin
Xilai Wang
Qingqing Chen
Yen-Wei Chen
Lanfen Lin
VLM
287
0
0
30 Aug 2024
HYDEN: Hyperbolic Density Representations for Medical Images and Reports
International Conference on Computational Linguistics (COLING), 2024
Zhi Qiao
Linbin Han
Xiantong Zhen
Jia-Hong Gao
Zhen Qian
176
1
0
19 Aug 2024
Masks and Manuscripts: Advancing Medical Pre-training with End-to-End Masking and Narrative Structuring
Shreyank N. Gowda
David A. Clifton
MedIm
197
3
0
23 Jul 2024
LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task
Khai-Nguyen Nguyen
Ryan Zhang
Ngoc Son Nguyen
Tan-Hanh Pham
Anh Dao
Ba Hung Ngo
Anh Totti Nguyen
Truong-Son Hy
MedIm
LM&MA
179
5
0
16 Jul 2024
A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery
Yu Zhang
Xiusi Chen
Sara Szymkuć
Sheng Wang
Shuiwang Ji
Wei Wang
Jiawei Han
344
82
0
16 Jun 2024
Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning
Shuvendu Roy
Yasaman Parhizkar
Franklin Ogidi
Vahid Reza Khazaie
Michael Colacci
Ali Etemad
Elham Dolatabadi
Arash Afkanpour
VLM
230
1
0
11 Jun 2024
Self-supervised vision-langage alignment of deep learning representations for bone X-rays analysis
A. Englebert
Anne-Sophie Collin
O. Cornu
Christophe De Vleeschouwer
192
1
0
14 May 2024
Open Challenges and Opportunities in Federated Foundation Models Towards Biomedical Healthcare
Xingyu Li
Lu Peng
Yuping Wang
Weihua Zhang
AI4CE
MedIm
LM&MA
297
28
0
10 May 2024
Optimizing Universal Lesion Segmentation: State Space Model-Guided Hierarchical Networks with Feature Importance Adjustment
Kazi Shahriar Sanjid
Md. Tanzim Hossain
Md. Shakib Shahariar Junayed
M. M. Uddin
Mamba
167
2
0
26 Apr 2024
ChEX: Interactive Localization and Region Description in Chest X-rays
Philip Muller
Georgios Kaissis
Daniel Rueckert
211
11
0
24 Apr 2024
CT-GLIP: 3D Grounded Language-Image Pretraining with CT Scans and Radiology Reports for Full-Body Scenarios
Jingyang Lin
Yingda Xia
Jianpeng Zhang
Ke Yan
Le Lu
Jiebo Luo
Jiebo Luo
Ling Zhang
MedIm
VLM
LM&MA
252
11
0
23 Apr 2024
Knowledge-enhanced Visual-Language Pretraining for Computational Pathology
Xiao Zhou
Xiaoman Zhang
Chaoyi Wu
Ya Zhang
Weidi Xie
Yanfeng Wang
VLM
254
12
0
15 Apr 2024
Foundation Model for Advancing Healthcare: Challenges, Opportunities, and Future Directions
IEEE Reviews in Biomedical Engineering (RBME), 2024
Yuting He
Fuxiang Huang
Xinrui Jiang
Yuxiang Nie
Minghao Wang
Jiguang Wang
Hao Chen
LM&MA
AI4CE
295
88
0
04 Apr 2024
Decomposing Disease Descriptions for Enhanced Pathology Detection: A Multi-Aspect Vision-Language Pre-training Framework
Computer Vision and Pattern Recognition (CVPR), 2024
Vu Minh Hieu Phan
Yutong Xie
Yuankai Qi
Lingqiao Liu
Liyang Liu
Bowen Zhang
Zhibin Liao
Qi Wu
Minh-Son To
Johan Verjans
275
26
0
12 Mar 2024
Enhancing medical vision-language contrastive learning via inter-matching relation modelling
Mingjian Li
Mingyuan Meng
M. Fulham
David Dagan Feng
Lei Bi
Jinman Kim
VLM
371
6
0
19 Jan 2024
Freeze the backbones: A Parameter-Efficient Contrastive Approach to Robust Medical Vision-Language Pre-training
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Jiuming Qin
Che Liu
Sibo Cheng
Wenhan Luo
Rossella Arcucci
VLM
MedIm
118
7
0
02 Jan 2024
From Text to Pixels: A Context-Aware Semantic Synergy Solution for Infrared and Visible Image Fusion
Xingyuan Li
Yang Zou
Jinyuan Liu
Zhiying Jiang
Long Ma
Xin-Yue Fan
Risheng Liu
242
13
0
31 Dec 2023
Masked Contrastive Reconstruction for Cross-modal Medical Image-Report Retrieval
Zeqiang Wei
Kai Jin
Xiuzhuang Zhou
MedIm
284
8
0
26 Dec 2023
CLIP in Medical Imaging: A Comprehensive Survey
Medical Image Analysis (MIA), 2023
Zihao Zhao
Yuxiao Liu
Han Wu
Yonghao Li
Sheng Wang
L. Teng
Disheng Liu
Zhiming Cui
Qian Wang
Hongtu Zhu
CLIP
MedIm
LM&MA
VLM
505
35
0
12 Dec 2023
Medical Vision Language Pretraining: A survey
Prashant Shrestha
Sanskar Amgain
Bidur Khanal
Cristian A. Linte
Binod Bhattarai
VLM
246
26
0
11 Dec 2023
Unified Medical Image Pre-training in Language-Guided Common Semantic Space
European Conference on Computer Vision (ECCV), 2023
Xiaoxuan He
Yifan Yang
Xinyang Jiang
Xufang Luo
Haoji Hu
Siyun Zhao
Dongsheng Li
Yuqing Yang
Lili Qiu
309
5
0
24 Nov 2023
LT-ViT: A Vision Transformer for multi-label Chest X-ray classification
International Conference on Information Photonics (ICIP), 2023
Umar Marikkar
Sara Atito
Muhammad Awais
Adam Mahdi
MedIm
ViT
164
9
0
13 Nov 2023
Enhancing Representation in Radiography-Reports Foundation Model: A Granular Alignment Algorithm Using Masked Contrastive Learning
Nature Communications (Nat. Commun.), 2023
Weijian Huang
Cheng Li
Hao Yang
Jiarun Liu
Shanshan Wang
MedIm
187
50
0
12 Sep 2023
A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text Supervision
Julio Silva-Rodríguez
H. Chakor
Riadh Kobbi
Jose Dolz
Ismail Ben Ayed
VLM
MedIm
340
77
0
15 Aug 2023
PRIOR: Prototype Representation Joint Learning from Medical Images and Reports
IEEE International Conference on Computer Vision (ICCV), 2023
Pujin Cheng
Li Lin
Junyan Lyu
Yijin Huang
Tong Lu
Xiaoying Tang
MedIm
352
78
0
24 Jul 2023
1
2
Next