Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.06904
Cited By
FocusCLIP: Multimodal Subject-Level Guidance for Zero-Shot Transfer in Human-Centric Tasks
11 March 2024
Muhammad Gul Zain Ali Khan
Muhammad Ferjad Naeem
F. Tombari
Luc Van Gool
Didier Stricker
Muhammad Zeshan Afzal
VLM
CLIP
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FocusCLIP: Multimodal Subject-Level Guidance for Zero-Shot Transfer in Human-Centric Tasks"
5 / 5 papers shown
Title
ChatGPT: Beginning of an End of Manual Linguistic Data Annotation? Use Case of Automatic Genre Identification
Taja Kuzman
I. Mozetič
Nikola Ljubesic
47
87
0
07 Mar 2023
Prior Knowledge-Guided Attention in Self-Supervised Vision Transformers
Kevin Miao
Akash Gokul
Raghav Singh
Suzanne Petryk
Joseph E. Gonzalez
Kurt Keutzer
Trevor Darrell
Colorado Reed
ViT
MedIm
23
6
0
07 Sep 2022
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
283
5,723
0
29 Apr 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
293
2,875
0
11 Feb 2021
Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning
Jiasen Lu
Caiming Xiong
Devi Parikh
R. Socher
83
1,440
0
06 Dec 2016
1