Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.11793
Cited By
MM-Retinal: Knowledge-Enhanced Foundational Pretraining with Fundus Image-Text Expertise
20 May 2024
Ruiqi Wu
Chenran Zhang
Jianle Zhang
Yi Zhou
Tao Zhou
Huazhu Fu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MM-Retinal: Knowledge-Enhanced Foundational Pretraining with Fundus Image-Text Expertise"
6 / 6 papers shown
Title
MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks
Wenqi Zeng
Yuqi Sun
Chenxi Ma
Weimin Tan
Bo Yan
LM&MA
VLM
45
0
0
09 May 2025
MicarVLMoE: A Modern Gated Cross-Aligned Vision-Language Mixture of Experts Model for Medical Image Captioning and Report Generation
Amaan Izhar
Nurul Japar
Norisma Idris
Ting Dang
MoE
64
0
0
29 Apr 2025
Delving into Out-of-Distribution Detection with Medical Vision-Language Models
Lie Ju
Sijin Zhou
Yukun Zhou
Huimin Lu
Zhuoting Zhu
P. Keane
Zongyuan Ge
VLM
40
0
0
02 Mar 2025
Visual Question Answering in Ophthalmology: A Progressive and Practical Perspective
Xiaolan Chen
Ruoyu Chen
Pusheng Xu
Weiyi Zhang
Xianwen Shang
M. He
Danli Shi
19
1
0
22 Oct 2024
MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic Model
Junde Wu
Rao Fu
Huihui Fang
Yu Zhang
Yehui Yang
Haoyi Xiong
Huiying Liu
Yanwu Xu
MedIm
VLM
DiffM
98
238
0
01 Nov 2022
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
282
39,170
0
01 Sep 2014
1