ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.02889
  4. Cited By
Joint Learning of Localized Representations from Medical Images and
  Reports
v1v2 (latest)

Joint Learning of Localized Representations from Medical Images and Reports

European Conference on Computer Vision (ECCV), 2021
6 December 2021
Philipp Muller
Georgios Kaissis
Cong Zou
Daniel Munich
ArXiv (abs)PDFHTML

Papers citing "Joint Learning of Localized Representations from Medical Images and Reports"

50 / 70 papers shown
Title
Medusa: Cross-Modal Transferable Adversarial Attacks on Multimodal Medical Retrieval-Augmented Generation
Medusa: Cross-Modal Transferable Adversarial Attacks on Multimodal Medical Retrieval-Augmented Generation
Yingjia Shang
Yi Liu
Huimin Wang
Furong Li
Wenfang Sun
Wu Chengyu
Yefeng Zheng
AAMLMedIm
209
0
0
24 Nov 2025
MV-MLM: Bridging Multi-View Mammography and Language for Breast Cancer Diagnosis and Risk Prediction
Shunjie-Fabian Zheng
Hyeonjun Lee
Thijs Kooi
Ali Diba
88
0
0
30 Oct 2025
Alignment, Mining and Fusion: Representation Alignment with Hard Negative Mining and Selective Knowledge Fusion for Medical Visual Question Answering
Alignment, Mining and Fusion: Representation Alignment with Hard Negative Mining and Selective Knowledge Fusion for Medical Visual Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2025
Yuanhao Zou
Zhaozheng Yin
MedIm
160
3
0
09 Oct 2025
MedCutMix: A Data-Centric Approach to Improve Radiology Vision-Language Pre-training with Disease Awareness
MedCutMix: A Data-Centric Approach to Improve Radiology Vision-Language Pre-training with Disease Awareness
Sinuo Wang
Yutong Xie
Yuyuan Liu
Qi Wu
MedImLM&MA
102
0
0
20 Sep 2025
The Missing Piece: A Case for Pre-Training in 3D Medical Object Detection
The Missing Piece: A Case for Pre-Training in 3D Medical Object DetectionInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Katharina Eckstein
Constantin Ulrich
Michael Baumgartner
Jessica Kächele
Dimitrios Bounias
Tassilo Wald
R. Floca
Klaus H. Maier-Hein
ViTMedIm
114
1
0
19 Sep 2025
Data-Efficient Fine-Tuning of Vision-Language Models for Diagnosis of Alzheimer's Disease
Data-Efficient Fine-Tuning of Vision-Language Models for Diagnosis of Alzheimer's Disease
Fangqi Cheng
Surajit Ray
Xiaochen Yang
MedImLM&MAVLM
218
0
0
09 Sep 2025
A Language-Signal-Vision Multimodal Framework for Multitask Cardiac Analysis
A Language-Signal-Vision Multimodal Framework for Multitask Cardiac Analysis
Yuting Zhang
Tiantian Geng
Luoying Hao
Xinxing Cheng
A. Thorley
...
Sandeep S Hothi
Lei Wei
Zhaowen Qiu
D. Kotecha
Yanfu Zhang
92
0
0
18 Aug 2025
Prototype-Enhanced Confidence Modeling for Cross-Modal Medical Image-Report Retrieval
Prototype-Enhanced Confidence Modeling for Cross-Modal Medical Image-Report Retrieval
Shreyank N Gowda
Xiaobo Jin
Christian Wagner
MedIm
88
0
0
05 Aug 2025
Boosting Vision Semantic Density with Anatomy Normality Modeling for Medical Vision-language Pre-training
Boosting Vision Semantic Density with Anatomy Normality Modeling for Medical Vision-language Pre-training
Weiwei Cao
Jianpeng Zhang
Zhongyi Shui
Sinuo Wang
Z. Chen
...
Le Lu
X. Ye
Tingbo Liang
Qi Zhang
L. Zhang
90
2
0
01 Aug 2025
Distribution-Based Masked Medical Vision-Language Model Using Structured Reports
Distribution-Based Masked Medical Vision-Language Model Using Structured Reports
Shreyank N. Gowda
Ruichi Zhang
Xiao Gu
Ying Weng
Lu Yang
VLM
212
1
0
29 Jul 2025
Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration
Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration
Jun Wang
Lixing Zhu
Xiaohan Yu
A. Bhalerao
Yulan He
270
0
0
12 Jun 2025
Enhancing Biomedical Multi-modal Representation Learning with Multi-scale Pre-training and Perturbed Report Discrimination
Enhancing Biomedical Multi-modal Representation Learning with Multi-scale Pre-training and Perturbed Report DiscriminationConference on Algebraic Informatics (AI), 2024
Xinliu Zhong
Kayhan Batmanghelich
Li Sun
152
1
0
02 Jun 2025
Towards Scalable Language-Image Pre-training for 3D Medical Imaging
Towards Scalable Language-Image Pre-training for 3D Medical Imaging
Chenhui Zhao
Yiwei Lyu
Asadur Chowdury
Edward Harake
A. Kondepudi
Akshay Rao
X. Hou
Honglak Lee
Rui Feng
MedImLM&MA
166
1
0
28 May 2025
Anatomy-Aware Conditional Image-Text Retrieval
Meng Zheng
Jiajin Zhang
Benjamin Planche
Zhongpai Gao
Terrence Chen
Ziyan Wu
MedIm
203
0
0
10 Mar 2025
A Shared Encoder Approach to Multimodal Representation Learning
Shuvendu Roy
Franklin Ogidi
Ali Etemad
Elham Dolatabadi
Arash Afkanpour
149
1
0
03 Mar 2025
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual RepresentationsComputer Vision and Pattern Recognition (CVPR), 2025
Ziyang Zhang
Yang Yu
Yucheng Chen
Xulei Yang
S. Yeo
MedIm
476
9
0
02 Mar 2025
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
Sheng Zhang
Yanbo Xu
Naoto Usuyama
Hanwen Xu
J. Bagga
...
Carlo Bifulco
M. Lungren
Tristan Naumann
Sheng Wang
Hoifung Poon
LM&MAMedIm
725
413
0
10 Jan 2025
DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for
  Semi-Supervised Medical Image Segmentation
DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image SegmentationAAAI Conference on Artificial Intelligence (AAAI), 2024
Qingtao Pan
Wenhao Qiao
Jingjiao Lou
Bing Ji
Shuo Li
VLM
257
5
0
17 Dec 2024
NoteContrast: Contrastive Language-Diagnostic Pretraining for Medical
  Text
NoteContrast: Contrastive Language-Diagnostic Pretraining for Medical Text
Prajwal Kailas
Max Homilius
Rahul C. Deo
Calum A. MacRae
273
4
0
16 Dec 2024
Medical Multimodal Foundation Models in Clinical Diagnosis and
  Treatment: Applications, Challenges, and Future Directions
Medical Multimodal Foundation Models in Clinical Diagnosis and Treatment: Applications, Challenges, and Future DirectionsArtificial Intelligence in Medicine (AIM), 2024
Kai Sun
Siyan Xue
F. Sun
Haoran Sun
Yu-Juan Luo
...
Xinzhou Wang
Lei Yang
Shuo Jin
Jun Yan
Jiahong Dong
AI4CE
293
19
0
03 Dec 2024
Uni-Mlip: Unified Self-supervision for Medical Vision Language
  Pre-training
Uni-Mlip: Unified Self-supervision for Medical Vision Language Pre-trainingBritish Machine Vision Conference (BMVC), 2024
Ameera Bawazir
Kebin Wu
Wenbin Li
CLIP
283
1
0
20 Nov 2024
BenchX: A Unified Benchmark Framework for Medical Vision-Language
  Pretraining on Chest X-Rays
BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-RaysNeural Information Processing Systems (NeurIPS), 2024
Yang Zhou
Tan Li Hui Faith
Yanyu Xu
Sicong Leng
Xinxing Xu
Yong Liu
Rick Siow Mong Goh
SSLVLMLM&MAMedIm
150
3
0
29 Oct 2024
Image-aware Evaluation of Generated Medical Reports
Image-aware Evaluation of Generated Medical ReportsNeural Information Processing Systems (NeurIPS), 2024
Gefen Dawidowicz
Elad Hirsch
A. Tal
193
1
0
22 Oct 2024
VoxelPrompt: A Vision Agent for End-to-End Medical Image Analysis
VoxelPrompt: A Vision Agent for End-to-End Medical Image Analysis
Andrew Hoopes
Neel Dey
V. Butoi
John Guttag
Adrian V. Dalca
MedImLM&MA
353
0
0
10 Oct 2024
Advancing Medical Radiograph Representation Learning: A Hybrid Pre-training Paradigm with Multilevel Semantic Granularity
Advancing Medical Radiograph Representation Learning: A Hybrid Pre-training Paradigm with Multilevel Semantic Granularity
Hanqi Jiang
Xixuan Hao
Yuzhou Huang
Chong Ma
Jiaxun Zhang
Yi Pan
Ruimao Zhang
MedIm
333
1
0
01 Oct 2024
Language-guided Scale-aware MedSegmentor for Lesion Segmentation in Medical Imaging
Language-guided Scale-aware MedSegmentor for Lesion Segmentation in Medical Imaging
Shuyi Ouyang
Jinyang Zhang
Xiangye Lin
Xilai Wang
Qingqing Chen
Yen-Wei Chen
Lanfen Lin
VLM
287
0
0
30 Aug 2024
HYDEN: Hyperbolic Density Representations for Medical Images and Reports
HYDEN: Hyperbolic Density Representations for Medical Images and ReportsInternational Conference on Computational Linguistics (COLING), 2024
Zhi Qiao
Linbin Han
Xiantong Zhen
Jia-Hong Gao
Zhen Qian
176
1
0
19 Aug 2024
Masks and Manuscripts: Advancing Medical Pre-training with End-to-End
  Masking and Narrative Structuring
Masks and Manuscripts: Advancing Medical Pre-training with End-to-End Masking and Narrative Structuring
Shreyank N. Gowda
David A. Clifton
MedIm
197
3
0
23 Jul 2024
LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization
  and Classification Task
LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task
Khai-Nguyen Nguyen
Ryan Zhang
Ngoc Son Nguyen
Tan-Hanh Pham
Anh Dao
Ba Hung Ngo
Anh Totti Nguyen
Truong-Son Hy
MedImLM&MA
179
5
0
16 Jul 2024
A Comprehensive Survey of Scientific Large Language Models and Their
  Applications in Scientific Discovery
A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery
Yu Zhang
Xiusi Chen
Sara Szymkuć
Sheng Wang
Shuiwang Ji
Wei Wang
Jiawei Han
344
82
0
16 Jun 2024
Benchmarking Vision-Language Contrastive Methods for Medical
  Representation Learning
Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning
Shuvendu Roy
Yasaman Parhizkar
Franklin Ogidi
Vahid Reza Khazaie
Michael Colacci
Ali Etemad
Elham Dolatabadi
Arash Afkanpour
VLM
230
1
0
11 Jun 2024
Self-supervised vision-langage alignment of deep learning
  representations for bone X-rays analysis
Self-supervised vision-langage alignment of deep learning representations for bone X-rays analysis
A. Englebert
Anne-Sophie Collin
O. Cornu
Christophe De Vleeschouwer
192
1
0
14 May 2024
Open Challenges and Opportunities in Federated Foundation Models Towards
  Biomedical Healthcare
Open Challenges and Opportunities in Federated Foundation Models Towards Biomedical Healthcare
Xingyu Li
Lu Peng
Yuping Wang
Weihua Zhang
AI4CEMedImLM&MA
297
28
0
10 May 2024
Optimizing Universal Lesion Segmentation: State Space Model-Guided
  Hierarchical Networks with Feature Importance Adjustment
Optimizing Universal Lesion Segmentation: State Space Model-Guided Hierarchical Networks with Feature Importance Adjustment
Kazi Shahriar Sanjid
Md. Tanzim Hossain
Md. Shakib Shahariar Junayed
M. M. Uddin
Mamba
167
2
0
26 Apr 2024
ChEX: Interactive Localization and Region Description in Chest X-rays
ChEX: Interactive Localization and Region Description in Chest X-rays
Philip Muller
Georgios Kaissis
Daniel Rueckert
211
11
0
24 Apr 2024
CT-GLIP: 3D Grounded Language-Image Pretraining with CT Scans and Radiology Reports for Full-Body Scenarios
CT-GLIP: 3D Grounded Language-Image Pretraining with CT Scans and Radiology Reports for Full-Body Scenarios
Jingyang Lin
Yingda Xia
Jianpeng Zhang
Ke Yan
Le Lu
Jiebo Luo
Jiebo Luo
Ling Zhang
MedImVLMLM&MA
252
11
0
23 Apr 2024
Knowledge-enhanced Visual-Language Pretraining for Computational
  Pathology
Knowledge-enhanced Visual-Language Pretraining for Computational Pathology
Xiao Zhou
Xiaoman Zhang
Chaoyi Wu
Ya Zhang
Weidi Xie
Yanfeng Wang
VLM
254
12
0
15 Apr 2024
Foundation Model for Advancing Healthcare: Challenges, Opportunities,
  and Future Directions
Foundation Model for Advancing Healthcare: Challenges, Opportunities, and Future DirectionsIEEE Reviews in Biomedical Engineering (RBME), 2024
Yuting He
Fuxiang Huang
Xinrui Jiang
Yuxiang Nie
Minghao Wang
Jiguang Wang
Hao Chen
LM&MAAI4CE
295
88
0
04 Apr 2024
Decomposing Disease Descriptions for Enhanced Pathology Detection: A
  Multi-Aspect Vision-Language Pre-training Framework
Decomposing Disease Descriptions for Enhanced Pathology Detection: A Multi-Aspect Vision-Language Pre-training FrameworkComputer Vision and Pattern Recognition (CVPR), 2024
Vu Minh Hieu Phan
Yutong Xie
Yuankai Qi
Lingqiao Liu
Liyang Liu
Bowen Zhang
Zhibin Liao
Qi Wu
Minh-Son To
Johan Verjans
275
26
0
12 Mar 2024
Enhancing medical vision-language contrastive learning via inter-matching relation modelling
Enhancing medical vision-language contrastive learning via inter-matching relation modelling
Mingjian Li
Mingyuan Meng
M. Fulham
David Dagan Feng
Lei Bi
Jinman Kim
VLM
371
6
0
19 Jan 2024
Freeze the backbones: A Parameter-Efficient Contrastive Approach to
  Robust Medical Vision-Language Pre-training
Freeze the backbones: A Parameter-Efficient Contrastive Approach to Robust Medical Vision-Language Pre-trainingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Jiuming Qin
Che Liu
Sibo Cheng
Wenhan Luo
Rossella Arcucci
VLMMedIm
118
7
0
02 Jan 2024
From Text to Pixels: A Context-Aware Semantic Synergy Solution for
  Infrared and Visible Image Fusion
From Text to Pixels: A Context-Aware Semantic Synergy Solution for Infrared and Visible Image Fusion
Xingyuan Li
Yang Zou
Jinyuan Liu
Zhiying Jiang
Long Ma
Xin-Yue Fan
Risheng Liu
242
13
0
31 Dec 2023
Masked Contrastive Reconstruction for Cross-modal Medical Image-Report
  Retrieval
Masked Contrastive Reconstruction for Cross-modal Medical Image-Report Retrieval
Zeqiang Wei
Kai Jin
Xiuzhuang Zhou
MedIm
284
8
0
26 Dec 2023
CLIP in Medical Imaging: A Comprehensive Survey
CLIP in Medical Imaging: A Comprehensive SurveyMedical Image Analysis (MIA), 2023
Zihao Zhao
Yuxiao Liu
Han Wu
Yonghao Li
Sheng Wang
L. Teng
Disheng Liu
Zhiming Cui
Qian Wang
Hongtu Zhu
CLIPMedImLM&MAVLM
505
35
0
12 Dec 2023
Medical Vision Language Pretraining: A survey
Medical Vision Language Pretraining: A survey
Prashant Shrestha
Sanskar Amgain
Bidur Khanal
Cristian A. Linte
Binod Bhattarai
VLM
246
26
0
11 Dec 2023
Unified Medical Image Pre-training in Language-Guided Common Semantic
  Space
Unified Medical Image Pre-training in Language-Guided Common Semantic SpaceEuropean Conference on Computer Vision (ECCV), 2023
Xiaoxuan He
Yifan Yang
Xinyang Jiang
Xufang Luo
Haoji Hu
Siyun Zhao
Dongsheng Li
Yuqing Yang
Lili Qiu
309
5
0
24 Nov 2023
LT-ViT: A Vision Transformer for multi-label Chest X-ray classification
LT-ViT: A Vision Transformer for multi-label Chest X-ray classificationInternational Conference on Information Photonics (ICIP), 2023
Umar Marikkar
Sara Atito
Muhammad Awais
Adam Mahdi
MedImViT
164
9
0
13 Nov 2023
Enhancing Representation in Radiography-Reports Foundation Model: A
  Granular Alignment Algorithm Using Masked Contrastive Learning
Enhancing Representation in Radiography-Reports Foundation Model: A Granular Alignment Algorithm Using Masked Contrastive LearningNature Communications (Nat. Commun.), 2023
Weijian Huang
Cheng Li
Hao Yang
Jiarun Liu
Shanshan Wang
MedIm
187
50
0
12 Sep 2023
A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text Supervision
A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text Supervision
Julio Silva-Rodríguez
H. Chakor
Riadh Kobbi
Jose Dolz
Ismail Ben Ayed
VLMMedIm
340
77
0
15 Aug 2023
PRIOR: Prototype Representation Joint Learning from Medical Images and
  Reports
PRIOR: Prototype Representation Joint Learning from Medical Images and ReportsIEEE International Conference on Computer Vision (ICCV), 2023
Pujin Cheng
Li Lin
Junyan Lyu
Yijin Huang
Tong Lu
Xiaoying Tang
MedIm
352
78
0
24 Jul 2023
12
Next