Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.15189
Cited By
Med-Flamingo: a Multimodal Medical Few-shot Learner
27 July 2023
Michael Moor
Qian Huang
Shirley Wu
Michihiro Yasunaga
C. Zakka
Yashodhara Dalmia
E. Reis
Pranav Rajpurkar
J. Leskovec
LM&MA
MedIm
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Med-Flamingo: a Multimodal Medical Few-shot Learner"
50 / 147 papers shown
Title
Multi-Modal Explainable Medical AI Assistant for Trustworthy Human-AI Collaboration
Honglong Yang
Shanshan Song
Yi Qin
Lehan Wang
Haonan Wang
Xinpeng Ding
Qixiang Zhang
Bodong Du
X. Li
LM&MA
12
0
0
11 May 2025
MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks
Wenqi Zeng
Yuqi Sun
Chenxi Ma
Weimin Tan
Bo Yan
LM&MA
VLM
30
0
0
09 May 2025
Structure Causal Models and LLMs Integration in Medical Visual Question Answering
Zibo Xu
Qiang Li
Weizhi Nie
Weijie Wang
Anan Liu
CML
MedIm
32
0
0
05 May 2025
Localizing Before Answering: A Hallucination Evaluation Benchmark for Grounded Medical Multimodal LLMs
Dung Tien Nguyen
Minh Khoi Ho
Huy Ta
T. Nguyen
Qi Chen
...
Zhibin Liao
Minh Nguyen Nhat To
Johan W. Verjans
Phi Le Nguyen
Vu Minh Hieu Phan
21
0
0
30 Apr 2025
Multimodal Large Language Models for Medicine: A Comprehensive Survey
Jiarui Ye
Hao Tang
LM&MA
76
0
0
29 Apr 2025
OmniV-Med: Scaling Medical Vision-Language Model for Universal Visual Understanding
Songtao Jiang
Yuan Wang
Sibo Song
Y. Zhang
Zijie Meng
Bohan Lei
Jian Wu
Jimeng Sun
Zuozhu Liu
MedIm
VLM
32
0
0
20 Apr 2025
Enhancing Multi-task Learning Capability of Medical Generalist Foundation Model via Image-centric Multi-annotation Data
Xun Zhu
Fanbin Mo
Zheng Zhang
J. Wang
Yiming Shi
Ming Wu
Chuang Zhang
Miao Li
Ji Wu
22
0
0
14 Apr 2025
SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical Imaging
Tan-Hanh Pham
Chris Ngo
Trong-Duong Bui
Minh Luu Quang
Tan-Huong Pham
Truong Son-Hy
27
0
0
14 Apr 2025
MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models
Pengfei Zhou
Fanrui Zhang
Xiaopeng Peng
Zhaopan Xu
Jiaxin Ai
...
Kai Wang
Xiaojun Chang
Wenqi Shao
Yang You
K. Zhang
ELM
LRM
30
0
0
08 Apr 2025
MedM-VL: What Makes a Good Medical LVLM?
Yiming Shi
Shaoshuai Yang
Xun Zhu
Haoyu Wang
Miao Li
Ji Wu
VLM
40
1
0
06 Apr 2025
LVMed-R2: Perception and Reflection-driven Complex Reasoning for Medical Report Generation
Hao Wang
Shuchang Ye
Jinghao Lin
Usman Naseem
Jinman Kim
LRM
27
0
0
02 Apr 2025
FaceBench: A Multi-View Multi-Level Facial Attribute VQA Dataset for Benchmarking Face Perception MLLMs
Xiaoqin Wang
Xusen Ma
Xianxu Hou
Meidan Ding
Yudong Li
Junliang Chen
Wenting Chen
Xiaoyang Peng
LinLin Shen
CVBM
68
0
0
27 Mar 2025
A Large-Scale Vision-Language Dataset Derived from Open Scientific Literature to Advance Biomedical Generalist AI
Alejandro Lozano
M. W. Sun
James Burgess
Jeffrey Nirschl
Christopher Polzak
...
Xiaohan Wang
Alfred Seunghoon Song
Chiang Chia-Chun
Robert Tibshirani
Serena Yeung-Levy
LM&MA
61
1
0
26 Mar 2025
Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector
Xiao Guo
Xiufeng Song
Yue Zhang
Xiaohong Liu
X. Liu
51
1
0
26 Mar 2025
MedAgent-Pro: Towards Multi-modal Evidence-based Medical Diagnosis via Reasoning Agentic Workflow
Ziyue Wang
Junde Wu
Chang Han Low
Yueming Jin
LRM
55
1
0
21 Mar 2025
Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models
Yuxiang Lai
Jike Zhong
Ming Li
Shitian Zhao
Xiaofeng Yang
OffRL
LRM
LM&MA
VLM
68
5
0
18 Mar 2025
Can Large Vision Language Models Read Maps Like a Human?
Shuo Xing
Zezhou Sun
Shuangyu Xie
Kaiyuan Chen
Yanjia Huang
Yuping Wang
Jiachen Li
Dezhen Song
Zhengzhong Tu
56
2
0
18 Mar 2025
LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition?
Bangyan Li
Wenxuan Huang
Yunhang Shen
Y. Wang
Shaohui Lin
...
Ling You
Yinqi Zhang
Ke Li
Xing Sun
Y. Sun
45
1
0
10 Mar 2025
CLIMB: Data Foundations for Large Scale Multimodal Clinical Foundation Models
Wei Dai
Peilin Chen
Malinda Lu
Daniel Li
Haowen Wei
Hejie Cui
Paul Pu Liang
LM&MA
44
1
0
09 Mar 2025
Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation
Aishik Konwer
Zhijian Yang
Erhan Bas
Cao Xiao
Prateek Prasanna
Parminder Bhatia
Taha A. Kass-Hout
MedIm
VLM
54
0
0
06 Mar 2025
MedHEval: Benchmarking Hallucinations and Mitigation Strategies in Medical Large Vision-Language Models
Aofei Chang
Le Huang
Parminder Bhatia
Taha A. Kass-Hout
Fenglong Ma
Cao Xiao
VLM
66
0
0
04 Mar 2025
BioD2C: A Dual-level Semantic Consistency Constraint Framework for Biomedical VQA
Zhengyang Ji
Shang Gao
Li Liu
Yifan Jia
Yutao Yue
34
0
0
04 Mar 2025
MedHallTune: An Instruction-Tuning Benchmark for Mitigating Medical Hallucination in Vision-Language Models
Qiao Yan
Yuchen Yuan
Xiaowei Hu
Yihan Wang
Jiaqi Xu
Jinpeng Li
Chi-Wing Fu
Pheng Ann Heng
MLLM
VLM
LM&MA
43
0
0
28 Feb 2025
PaliGemma-CXR: A Multi-task Multimodal Model for TB Chest X-ray Interpretation
Denis Musinguzi
Andrew Katumba
Sudi Murindanyi
28
0
0
28 Feb 2025
Repurposing the scientific literature with vision-language models
Anton Alyakin
Jaden Stryker
Daniel Alber
Karl L. Sangwon
Brandon Duderstadt
...
Laura Snyder
Eric Leuthardt
Douglas Kondziolka
E. Oermann
Eric Karl Oermann
86
0
0
26 Feb 2025
Vision Language Models in Medicine
Beria Chingnabe Kalpelbe
Angel Gabriel Adaambiik
Wei Peng
VLM
LM&MA
83
2
0
24 Feb 2025
MedForge: Building Medical Foundation Models Like Open Source Software Development
Zheling Tan
Kexin Ding
Jin Gao
Mu Zhou
Dimitris N. Metaxas
Shaoting Zhang
Dequan Wang
AI4CE
42
1
0
22 Feb 2025
Reducing Hallucinations of Medical Multimodal Large Language Models with Visual Retrieval-Augmented Generation
Yun-Wei Chu
Kai Zhang
Christopher Malon
Martin Renqiang Min
VLM
36
0
0
20 Feb 2025
Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization
Shuo Xing
Yuping Wang
Peiran Li
Ruizheng Bai
Y. Wang
Chengxuan Qian
Huaxiu Yao
Zhengzhong Tu
87
6
0
18 Feb 2025
From large language models to multimodal AI: A scoping review on the potential of generative AI in medicine
Lukas Buess
Matthias Keicher
Nassir Navab
Andreas K. Maier
Soroosh Tayebi Arasteh
LM&MA
110
0
0
13 Feb 2025
Baichuan-Omni-1.5 Technical Report
Yadong Li
J. Liu
Tao Zhang
Tao Zhang
S. Chen
...
Jianhua Xu
Haoze Sun
Mingan Lin
Zenan Zhou
Weipeng Chen
AuLLM
64
10
0
28 Jan 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Erik Cambria
LM&MA
AILaw
71
148
0
28 Jan 2025
MedPromptX: Grounded Multimodal Prompting for Chest X-ray Diagnosis
Mai A. Shaaban
Adnan Khan
Mohammad Yaqub
LM&MA
70
2
0
28 Jan 2025
CBVLM: Training-free Explainable Concept-based Large Vision Language Models for Medical Image Classification
Cristiano Patrício
Isabel Rio-Torto
J. S. Cardoso
Luís F. Teixeira
João C. Neves
VLM
116
0
0
21 Jan 2025
PeFoMed: Parameter Efficient Fine-tuning of Multimodal Large Language Models for Medical Imaging
Gang Liu
Jinlong He
Pengfei Li
Genrong He
Zixu Zhao
Shenjun Zhong
LM&MA
65
2
0
17 Jan 2025
Visual Large Language Models for Generalized and Specialized Applications
Yifan Li
Zhixin Lai
Wentao Bao
Zhen Tan
Anh Dao
Kewei Sui
Jiayi Shen
Dong Liu
Huan Liu
Yu Kong
VLM
83
10
0
06 Jan 2025
A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine
Hanguang Xiao
Feizhong Zhou
X. Liu
Tianqi Liu
Zhipeng Li
Xin Liu
Xiaoxuan Huang
AILaw
LM&MA
LRM
59
17
0
31 Dec 2024
ACE-
M
3
M^3
M
3
: Automatic Capability Evaluator for Multimodal Medical Models
Xiechi Zhang
Shunfan Zheng
Linlin Wang
Gerard de Melo
Zhu Cao
Xiaoling Wang
Liang He
ELM
99
0
0
16 Dec 2024
MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization
Kangyu Zhu
Peng Xia
Yun-Qing Li
Hongtu Zhu
Sheng Wang
Huaxiu Yao
90
1
0
09 Dec 2024
Gla-AI4BioMed at RRG24: Visual Instruction-tuned Adaptation for Radiology Report Generation
Xi Zhang
Zaiqiao Meng
Jake Lever
Edmond S. L. Ho
LM&MA
93
2
0
06 Dec 2024
Medical Multimodal Foundation Models in Clinical Diagnosis and Treatment: Applications, Challenges, and Future Directions
Kai Sun
Siyan Xue
F. Sun
Haoran Sun
Yu-Juan Luo
...
Xinzhou Wang
Lei Yang
Shuo Jin
Jun Yan
Jiahong Dong
AI4CE
64
2
0
03 Dec 2024
ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Zhihao Sun
Haoran Jiang
Haoran Chen
Yixin Cao
Xipeng Qiu
Zuxuan Wu
Yu Jiang
62
1
0
29 Nov 2024
On Domain-Specific Post-Training for Multimodal Large Language Models
Daixuan Cheng
Shaohan Huang
Ziyu Zhu
Xintong Zhang
Wayne Xin Zhao
Zhongzhi Luan
Bo Dai
Zhenliang Zhang
VLM
87
2
0
29 Nov 2024
Med-2E3: A 2D-Enhanced 3D Medical Multimodal Large Language Model
Yiming Shi
Xun Zhu
Ying Hu
Chenyi Guo
Miao Li
Ji Wu
71
1
0
19 Nov 2024
VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Vishwesh Nath
Wenqi Li
Dong Yang
Andriy Myronenko
Mingxin Zheng
...
Holger Roth
Daguang Xu
Baris Turkbey
Holger Roth
Daguang Xu
VLM
90
4
0
19 Nov 2024
Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress?
Daniel P. Jeong
Saurabh Garg
Zachary Chase Lipton
Michael Oberst
LM&MA
VLM
ELM
29
8
0
06 Nov 2024
E3D-GPT: Enhanced 3D Visual Foundation for Medical Vision-Language Model
Haoran Lai
Zihang Jiang
Qingsong Yao
Rongsheng Wang
Zhiyang He
Xiaodong Tao
Wei Wei
Weifu Lv
Shuoling Zhou
VLM
MedIm
17
3
0
18 Oct 2024
VoxelPrompt: A Vision-Language Agent for Grounded Medical Image Analysis
Andrew Hoopes
V. Butoi
John Guttag
Adrian V. Dalca
MedIm
LM&MA
26
1
0
10 Oct 2024
MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging
Noel Codella
Ying Jin
Shrey Jain
Yu Gu
Ho Hin Lee
...
Jenq-Neng Hwang
Thomas Lin
I. Tarapov
M. Lungren
Mu-Hsin Wei
LM&MA
VLM
MedIm
32
7
0
09 Oct 2024
From Generalist to Specialist: Adapting Vision Language Models via Task-Specific Visual Instruction Tuning
Yang Bai
Yang Zhou
Jun Zhou
Rick Siow Mong Goh
Daniel Ting
Yong Liu
VLM
41
0
0
09 Oct 2024
1
2
3
Next