Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.14830
Cited By
CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP Aligned Representation
21 June 2024
Muhammad Ali
Salman Khan
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP Aligned Representation"
3 / 3 papers shown
Title
Captured by Captions: On Memorization and its Mitigation in CLIP Models
Wenhao Wang
Adam Dziedzic
Grace C. Kim
Michael Backes
Franziska Boenisch
79
0
0
11 Feb 2025
Speech-text based multi-modal training with bidirectional attention for improved speech recognition
Yuhang Yang
Haihua Xu
Hao-Ming Huang
E. Chng
Sheng Li
34
7
0
01 Nov 2022
ImageNet-21K Pretraining for the Masses
T. Ridnik
Emanuel Ben-Baruch
Asaf Noy
Lihi Zelnik-Manor
SSeg
VLM
CLIP
166
684
0
22 Apr 2021
1