Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.08967
Cited By
PathM3: A Multimodal Multi-Task Multiple Instance Learning Framework for Whole Slide Image Classification and Captioning
13 March 2024
Qifeng Zhou
Wenliang Zhong
Yuzhi Guo
Michael Xiao
Hehuan Ma
Junzhou Huang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PathM3: A Multimodal Multi-Task Multiple Instance Learning Framework for Whole Slide Image Classification and Captioning"
3 / 3 papers shown
Title
CLIP-IT: CLIP-based Pairing for Histology Images Classification
Banafsheh Karimian
Giulia Avanzato
Soufian Belharbi
Luke McCaffrey
Mohammadhadi Shateri
Eric Granger
VLM
46
0
0
22 Apr 2025
PathAlign: A vision-language model for whole slide images in histopathology
Faruk Ahmed
Andrew Sellergren
Lin Yang
Shawn Xu
Boris Babenko
...
S. Shetty
Daniel Golden
Yun-hui Liu
David F. Steiner
Ellery Wulczyn
LM&MA
VLM
29
13
0
27 Jun 2024
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
244
4,186
0
30 Jan 2023
1