CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP Aligned Representation

21 June 2024

Muhammad Ali

Papers citing "CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP Aligned Representation"

3 / 3 papers shown

Title
Captured by Captions: On Memorization and its Mitigation in CLIP Models Wenhao Wang Adam Dziedzic Grace C. Kim Michael Backes Franziska Boenisch 79 0 0 11 Feb 2025
Speech-text based multi-modal training with bidirectional attention for improved speech recognition Yuhang Yang Haihua Xu Hao-Ming Huang E. Chng Sheng Li 34 7 0 01 Nov 2022
ImageNet-21K Pretraining for the Masses T. Ridnik Emanuel Ben-Baruch Asaf Noy Lihi Zelnik-Manor SSeg VLM CLIP 166 684 0 22 Apr 2021