Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.19339
Cited By
Stitching Gaps: Fusing Situated Perceptual Knowledge with Vision Transformers for High-Level Image Classification
29 February 2024
Delfina Sol Martinez Pandiani
Nicolas Lazzari
Valentina Presutti
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Stitching Gaps: Fusing Situated Perceptual Knowledge with Vision Transformers for High-Level Image Classification"
4 / 4 papers shown
Title
ASIF: Coupled Data Turns Unimodal Models to Multimodal Without Training
Antonio Norelli
Marco Fumero
Valentino Maiorca
Luca Moschella
Emanuele Rodolà
Francesco Locatello
VLM
79
32
0
04 Oct 2022
Multimodal learning with graphs
Yasha Ektefaie
George Dasoulas
Ayush Noori
Maha Farhat
Marinka Zitnik
38
82
0
07 Sep 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
388
4,010
0
28 Jan 2022
Iterative Visual Reasoning Beyond Convolutions
Xinlei Chen
Li-Jia Li
Li Fei-Fei
Abhinav Gupta
LRM
GNN
24
210
0
29 Mar 2018
1