Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.21368
Cited By
Prompting Medical Large Vision-Language Models to Diagnose Pathologies by Visual Question Answering
31 July 2024
Danfeng Guo
Sumitaka Honji
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Prompting Medical Large Vision-Language Models to Diagnose Pathologies by Visual Question Answering"
4 / 4 papers shown
Title
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs
Sheng Zhang
Yanbo Xu
Naoto Usuyama
Hanwen Xu
J. Bagga
...
Carlo Bifulco
M. Lungren
Tristan Naumann
Sheng Wang
Hoifung Poon
LM&MA
MedIm
145
191
0
10 Jan 2025
Multi-Modal Hallucination Control by Visual Information Grounding
Alessandro Favero
L. Zancato
Matthew Trager
Siddharth Choudhary
Pramuditha Perera
Alessandro Achille
Ashwin Swaminathan
Stefano Soatto
MLLM
52
14
0
20 Mar 2024
Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding
Sicong Leng
Hang Zhang
Guanzheng Chen
Xin Li
Shijian Lu
Chunyan Miao
Li Bing
VLM
MLLM
82
196
0
28 Nov 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
244
4,186
0
30 Jan 2023
1