Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2308.02463
Cited By
v1
v2
v3
v4
v5 (latest)
Towards Generalist Foundation Model for Radiology by Leveraging Web-scale 2D&3D Medical Data
4 August 2023
Chaoyi Wu
Xiaoman Zhang
Ya Zhang
Yanfeng Wang
Weidi Xie
MedIm
LM&MA
Re-assign community
ArXiv (abs)
PDF
HTML
Github (537★)
Papers citing
"Towards Generalist Foundation Model for Radiology by Leveraging Web-scale 2D&3D Medical Data"
50 / 131 papers shown
Med-VCD: Mitigating Hallucination for Medical Large Vision Language Models through Visual Contrastive Decoding
Computers in Biology and Medicine (Comput. Biol. Med.), 2025
Zahra Mahdavi
Zahra Khodakaramimaghsoud
Hooman Khaloo
Sina Bakhshandeh Taleshani
Erfan Hashemi
Javad Mirzapour Kaleybar
Omid Nejati Manzari
MLLM
VLM
304
1
0
01 Dec 2025
Robust Backdoor Removal by Reconstructing Trigger-Activated Changes in Latent Representation
Kazuki Iwahana
Yusuke Yamasaki
Akira Ito
Takayuki Miura
Toshiki Shibahara
AAML
296
5
0
12 Nov 2025
RadDiagSeg-M: A Vision Language Model for Joint Diagnosis and Multi-Target Segmentation in Radiology
Chengrun Li
Corentin Royer
Haozhe Luo
Bastian Wittmann
Xia Li
Ibrahim Ethem Hamamci
Sezgin Er
Anjany Sekuboyina
Bjoern Menze
MedIm
VLM
190
0
0
21 Oct 2025
MIMO: A medical vision language model with visual referring multimodal input and pixel grounding multimodal output
Computer Vision and Pattern Recognition (CVPR), 2025
Yanyuan Chen
Dexuan Xu
Yu Huang
Songkun Zhan
Hanpin Wang
Dongxue Chen
X. Wang
Meikang Qiu
Hang Li
324
17
0
11 Oct 2025
Enhancing 3D Medical Image Understanding with Pretraining Aided by 2D Multimodal Large Language Models
IEEE journal of biomedical and health informatics (JBHI), 2025
Qiuhui Chen
Xuancheng Yao
Huping Ye
Yi Hong
MedIm
165
1
0
11 Sep 2025
ShizhenGPT: Towards Multimodal LLMs for Traditional Chinese Medicine
Junying Chen
Zhenyang Cai
Zhiheng Liu
Yunjin Yang
Rongsheng Wang
...
Jing Guo
Xiang Wan
Guangjun Yu
Haizhou Li
Benyou Wang
LM&MA
231
4
0
20 Aug 2025
UNICON: UNIfied CONtinual Learning for Medical Foundational Models
Mohammad Areeb Qazi
Munachiso S Nwadike
Ibrahim Almakky
Mohammad Yaqub
Numan Saeed
CLL
199
0
0
19 Aug 2025
A Chain of Diagnosis Framework for Accurate and Explainable Radiology Report Generation
IEEE Transactions on Medical Imaging (IEEE TMI), 2025
Haibo Jin
Haoxuan Che
Sunan He
Hao-tao Chen
MedIm
AI4CE
298
6
0
13 Aug 2025
AMRG: Extend Vision Language Models for Automatic Mammography Report Generation
Nak-Jun Sung
Donghyun Lee
Bo Hwa Choi
Chae Jung Park
VLM
212
0
0
12 Aug 2025
CT-GRAPH: Hierarchical Graph Attention Network for Anatomy-Guided CT Report Generation
Hamza Kalisch
Fabian Horst
Jens Kleesiek
Ken Herrmann
C. Seibold
MedIm
137
4
0
07 Aug 2025
MedBLINK: Probing Basic Perception in Multimodal Language Models for Medicine
Mahtab Bigverdi
Wisdom O. Ikezogwo
Kevin Zhang
Hyewon Jeong
Mingyu Lu
Sungjae Cho
Linda G. Shapiro
Ranjay Krishna
LM&MA
MedIm
VLM
187
0
0
04 Aug 2025
Doctor Sun: A Bilingual Multimodal Large Language Model for Biomedical AI
Dong Xue
Ziyao Shao
Zhaoyang Duan
Fangzhou Liu
Bing Li
Zhongheng Zhang
LM&MA
461
0
0
30 Jul 2025
Exploring the Design Space of 3D MLLMs for CT Report Generation
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Mohammed Baharoon
Jun Ma
Congyu Fang
Augustin Toma
Bo Wang
234
1
0
26 Jun 2025
Chiron-o1: Igniting Multimodal Large Language Models towards Generalizable Medical Reasoning via Mentor-Intern Collaborative Search
Haoran Sun
Yankai Jiang
Wenjie Lou
Yujie Zhang
Wenjie Li
Lilong Wang
Mianxin Liu
Lei Liu
Xiaosong Wang
LRM
393
4
0
20 Jun 2025
CAPO: Reinforcing Consistent Reasoning in Medical Decision-Making
Songtao Jiang
Yuan Wang
Ruizhe Chen
Yan Zhang
Ruilin Luo
...
Sibo Song
Yang Feng
Jimeng Sun
Jian Wu
Zuozhu Liu
OffRL
LRM
252
5
0
15 Jun 2025
3D-RAD: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks
Xiaotang Gai
Jiaxiang Liu
Yichen Li
Zijie Meng
Jian Wu
Zuozhu Liu
VGen
516
14
0
11 Jun 2025
HSENet: Hybrid Spatial Encoding Network for 3D Medical Vision-Language Understanding
Yanzhao Shi
Xiaodan Zhang
Junzhong Ji
Haoning Jiang
Chengxin Zheng
Y. Wang
Liangqiong Qu
289
0
0
11 Jun 2025
Foundation Models in Medical Imaging: A Review and Outlook
Vivien van Veldhuizen
Vanessa Botha
C. Lu
Melis Erdal Cesur
Kevin Groot Lipman
...
Cees Snoek
Lodewyk Wessels
Ritse Mann
Eric Marcus
Jonas Teuwen
MedIm
VLM
AI4CE
547
2
0
10 Jun 2025
SurgVLM: A Large Vision-Language Model and Systematic Evaluation Benchmark for Surgical Intelligence
Zhitao Zeng
Zhu Zhuo
Xiaojun Jia
Erli Zhang
Junde Wu
...
Xiaochun Cao
Yutong Ban
Qi Dou
Yang Liu
Yueming Jin
VLM
536
22
0
03 Jun 2025
DrVD-Bench: Do Vision-Language Models Reason Like Human Doctors in Medical Image Diagnosis?
Tianhong Zhou
Yin Xu
Yingtao Zhu
Chuxi Xiao
Haiyang Bian
Lei Wei
Xuegong Zhang
LM&MA
VLM
289
6
0
30 May 2025
Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning
Jinquan Guan
Qi Chen
Lizhou Liang
Yuhang Liu
Vu Minh Hieu Phan
Minh-Son To
Jian Chen
Yutong Xie
LM&MA
LRM
224
0
0
29 May 2025
Look & Mark: Leveraging Radiologist Eye Fixations and Bounding boxes in Multimodal Large Language Models for Chest X-ray Report Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yunsoo Kim
Jinge Wu
Su-Hwan Kim
Pardeep Vasudev
Jiashu Shen
Honghan Wu
225
3
0
28 May 2025
Medical Large Vision Language Models with Multi-Image Visual Ability
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Xikai Yang
Juzheng Miao
Yuchen Yuan
Jiaze Wang
Qi Dou
Jinpeng Li
Ge Liu
277
8
0
25 May 2025
Are Vision Language Models Ready for Clinical Diagnosis? A 3D Medical Benchmark for Tumor-centric Visual Question Answering
Y. Chen
Wenjie Xiao
P. R. Bassi
Xinze Zhou
Sezgin Er
Ibrahim Ethem Hamamci
Zongwei Zhou
Yaoyao Liu
ELM
322
9
0
25 May 2025
Improving Medical Reasoning with Curriculum-Aware Reinforcement Learning
Shaohao Rui
Kaitao Chen
Weijie Ma
Xiaosong Wang
OffRL
LRM
236
0
0
25 May 2025
U2-BENCH: Benchmarking Large Vision-Language Models on Ultrasound Understanding
Anjie Le
Henan Liu
Yue Wang
Zhenyu Liu
Rongkun Zhu
...
Alison Noble
Jacques Souquet
Haoyun Zheng
Manxi Lin
Hongcheng Guo
LM&MA
ELM
VLM
428
5
0
23 May 2025
Specialized Foundation Models for Intelligent Operating Rooms
Ege Özsoy
Chantal Pellegrini
David Bani-Harouni
Kun Yuan
Matthias Keicher
Nassir Navab
295
0
0
19 May 2025
Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner
Wenchuan Zhang
Penghao Zhang
Jingru Guo
Tao Cheng
Jie Chen
Shuwan Zhang
Zhang Zhang
Yuhao Yi
Hong Bu
AI4TS
LRM
479
17
0
16 May 2025
Multi-Modal Explainable Medical AI Assistant for Trustworthy Human-AI Collaboration
Honglong Yang
Shanshan Song
Yi Qin
Lehan Wang
Haonan Wang
Xinpeng Ding
Qixiang Zhang
Bodong Du
Xuelong Li
LM&MA
290
3
0
11 May 2025
Structure Causal Models and LLMs Integration in Medical Visual Question Answering
IEEE Transactions on Medical Imaging (IEEE TMI), 2025
Zibo Xu
Qiang Li
Weizhi Nie
Weijie Wang
Anan Liu
CML
MedIm
382
6
0
05 May 2025
UniBiomed: A Universal Foundation Model for Grounded Biomedical Image Interpretation
Linshan Wu
Yuxiang Nie
Sunan He
Jiaxin Zhuang
Hao Chen
...
Hao Chen
Ronald Cheong Kin Chan
Yifan Peng
Pranav Rajpurkar
Hao Chen
LM&MA
MedIm
756
9
0
30 Apr 2025
Localizing Before Answering: A Hallucination Evaluation Benchmark for Grounded Medical Multimodal LLMs
Dung Nguyen
Minh Khoi Ho
Huy Ta
T. Nguyen
Qi Chen
...
Zhibin Liao
Minh-Son To
Johan Verjans
Phi Le Nguyen
Vu Minh Hieu Phan
659
0
0
30 Apr 2025
Multimodal Large Language Models for Medicine: A Comprehensive Survey
Jiarui Ye
Hao Tang
LM&MA
645
17
0
29 Apr 2025
SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical Imaging
Tan-Hanh Pham
Chris Ngo
Trong-Duong Bui
Minh Luu Quang
Tan-Huong Pham
Truong-Son Hy
413
7
0
14 Apr 2025
MedM-VL: What Makes a Good Medical LVLM?
Yiming Shi
Shaoshuai Yang
Xun Zhu
Haoyu Wang
Xiangling Fu
Chenyi Guo
Ji Wu
VLM
545
3
0
06 Apr 2025
UMIT: Unifying Medical Imaging Tasks via Vision-Language Models
Haiyang Yu
Siyang Yi
Ke Niu
Minghan Zhuo
Bin Li
LM&MA
288
13
0
20 Mar 2025
Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models
Yuxiang Lai
Shitian Zhao
Ming Li
Jike Zhong
Yuheng Li
Konstantinos Psounis
Xiaofeng Yang
OffRL
LRM
LM&MA
VLM
789
112
0
18 Mar 2025
Towards All-in-One Medical Image Re-Identification
Computer Vision and Pattern Recognition (CVPR), 2025
Yuan Tian
Kaiyuan Ji
Rongzhao Zhang
Yankai Jiang
Chunyi Li
Xiaosong Wang
Guoquan Zheng
VLM
422
7
0
11 Mar 2025
GEMA-Score: Granular Explainable Multi-Agent Scoring Framework for Radiology Report Evaluation
Zhenxuan Zhang
Kinhei Lee
Weihang Deng
Weihang Deng
Zihao Jin
...
Zhifan Gao
D. C. Marshall
D. C. Marshall
G. Yang
Guang Yang
MedIm
309
1
0
07 Mar 2025
Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation
Computer Vision and Pattern Recognition (CVPR), 2025
Aishik Konwer
Zhijian Yang
Erhan Bas
Cao Xiao
Prateek Prasanna
Parminder Bhatia
Taha A. Kass-Hout
MedIm
VLM
424
11
0
06 Mar 2025
BioD2C: A Dual-level Semantic Consistency Constraint Framework for Biomedical VQA
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Zhengyang Ji
Shang Gao
Li Liu
Yifan Jia
Yutao Yue
287
1
0
04 Mar 2025
MedHallTune: An Instruction-Tuning Benchmark for Mitigating Medical Hallucination in Vision-Language Models
Qiao Yan
Yuchen Yuan
Xiaowei Hu
Yihan Wang
Jiaqi Xu
Jinpeng Li
Chi-Wing Fu
Ge Liu
MLLM
VLM
LM&MA
321
3
0
28 Feb 2025
MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Jiazhen Pan
Che Liu
Junde Wu
Fenglin Liu
Jiayuan Zhu
Hongwei Bran Li
Chen Chen
Cheng Ouyang
Daniel Rueckert
LRM
LM&MA
VLM
628
155
0
26 Feb 2025
Vision Language Models in Medicine
Beria Chingnabe Kalpelbe
Angel Gabriel Adaambiik
Wei Peng
VLM
LM&MA
446
9
0
24 Feb 2025
Reducing Hallucinations of Medical Multimodal Large Language Models with Visual Retrieval-Augmented Generation
Yun-Wei Chu
Kai Zhang
Christopher Malon
Martin Renqiang Min
VLM
208
12
0
20 Feb 2025
From large language models to multimodal AI: A scoping review on the potential of generative AI in medicine
Biomedical Engineering Letters (Biomed Eng Lett), 2025
Lukas Buess
Matthias Keicher
Nassir Navab
Andreas Maier
Soroosh Tayebi Arasteh
LM&MA
1.1K
4
0
13 Feb 2025
RadGPT: Constructing 3D Image-Text Tumor Datasets
P. R. Bassi
Mehmet Can Yavuz
Kang Wang
Xiaoxi Chen
Wenxuan Li
S. Decherchi
Andrea Cavalli
Yang Yang
Yaoyao Liu
Zongwei Zhou
LM&MA
MedIm
530
32
0
08 Jan 2025
A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine
Information Fusion (Inf. Fusion), 2024
Hanguang Xiao
Feizhong Zhou
Xianglong Liu
Tianqi Liu
Zhipeng Li
Xin Liu
Xiaoxuan Huang
AILaw
LM&MA
LRM
545
107
0
31 Dec 2024
Read Like a Radiologist: Efficient Vision-Language Model for 3D Medical Imaging Interpretation
Changsun Lee
Sangjoon Park
Cheong-Il Shin
Woo Hee Choi
Hyun Jeong Park
J. Lee
Jong Chul Ye
477
15
0
18 Dec 2024
MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization
Kangyu Zhu
Peng Xia
Yun Li
Hongtu Zhu
Sheng Wang
Huaxiu Yao
714
22
0
09 Dec 2024
1
2
3
Next
Page 1 of 3