Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.10420
Cited By
CLIP-GCD: Simple Language Guided Generalized Category Discovery
17 May 2023
Rabah Ouldnoughi
Chia-Wen Kuo
Z. Kira
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CLIP-GCD: Simple Language Guided Generalized Category Discovery"
15 / 15 papers shown
Title
Agent-Centric Personalized Multiple Clustering with Multi-Modal LLMs
Ziye Chen
Yiqun Duan
Riheng Zhu
Zhenbang Sun
Mingming Gong
40
0
0
28 Mar 2025
MOS: Modeling Object-Scene Associations in Generalized Category Discovery
Zhengyuan Peng
Jinpeng Ma
Zhimin Sun
Ran Yi
Haichuan Song
Xin Tan
Lizhuang Ma
56
0
0
15 Mar 2025
GraphVL: Graph-Enhanced Semantic Modeling via Vision-Language Models for Generalized Class Discovery
Bhupendra S. Solanki
Ashwin Nair
Mainak Singha
Souradeep Mukhopadhyay
Ankit Jha
Biplab Banerjee
VLM
26
1
0
04 Nov 2024
Multimodal Generalized Category Discovery
Yuchang Su
Renping Zhou
Siyu Huang
Xingjian Li
Tianyang Wang
Ziyue Wang
Min Xu
40
0
0
18 Sep 2024
SelEx: Self-Expertise in Fine-Grained Generalized Category Discovery
Sarah Rastegar
Mohammadreza Salehi
Yuki M. Asano
Hazel Doughty
Cees G. M. Snoek
28
4
0
26 Aug 2024
GET: Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery
Enguang Wang
Zhimao Peng
Zhengyuan Xie
Fei Yang
Xialei Liu
Ming-Ming Cheng
56
3
0
15 Mar 2024
Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized Visual Class Discovery
Haiyang Zheng
Nan Pu
Wenjing Li
N. Sebe
Zhun Zhong
43
7
0
12 Mar 2024
Generalized Category Discovery in Semantic Segmentation
Zhengyuan Peng
Qijian Tian
Jianqing Xu
Yizhang Jin
Xuequan Lu
Xin Tan
Yuan Xie
Lizhuang Ma
ISeg
14
2
0
20 Nov 2023
Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment
Shengxiang Zhang
Muzammal Naseer
Guangyi Chen
Zhiqiang Shen
Salman Khan
Kun Zhang
F. Khan
VLM
58
4
0
24 Aug 2023
What's in a Name? Beyond Class Indices for Image Recognition
Kai Han
Yandong Li
S. Vaze
Jie Li
Xuhui Jia
VLM
19
7
0
05 Apr 2023
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
305
7,434
0
11 Nov 2021
CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP
Andreas Fürst
Elisabeth Rumetshofer
Johannes Lehner
Viet-Hung Tran
Fei Tang
...
David P. Kreil
Michael K Kopp
G. Klambauer
Angela Bitto-Nemling
Sepp Hochreiter
VLM
CLIP
201
102
0
21 Oct 2021
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
308
5,773
0
29 Apr 2021
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
Soravit Changpinyo
P. Sharma
Nan Ding
Radu Soricut
VLM
273
1,081
0
17 Feb 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
298
3,693
0
11 Feb 2021
1