Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2408.09441
Cited By
CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination
AAAI Conference on Artificial Intelligence (AAAI), 2024
18 August 2024
Kaicheng Yang
Tiancheng Gu
Xiang An
Haiqiang Jiang
Xiangzi Dai
Ziyong Feng
Weidong Cai
Jiankang Deng
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination"
16 / 16 papers shown
Title
Jarvis: Towards Personalized AI Assistant via Personal KV-Cache Retrieval
Binxiao Xu
Junyu Feng
Ruichuan An
Yulin Luo
Shilin Yan
Hao Liang
Ming Lu
Wentao Zhang
97
0
0
26 Oct 2025
A Parameter-Efficient Mixture-of-Experts Framework for Cross-Modal Geo-Localization
Linfeng Li
Jian-jun Zhao
Zepeng Yang
Yuhang Song
Bojun Lin
Tianle Zhang
Yuchen Yuan
C. Zhang
Xuelong Li
MoE
132
0
0
23 Oct 2025
ProCLIP: Progressive Vision-Language Alignment via LLM-based Embedder
Xiaoxing Hu
Kaicheng Yang
Ziyang Gong
Qi Ming
Zonghao Guo
Xiang An
Ziyong Feng
Junchi Yan
Xue Yang
CLIP
VLM
155
0
0
21 Oct 2025
UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning
Tiancheng Gu
Kaicheng Yang
Kaichen Zhang
Xiang An
Ziyong Feng
Y. Zhang
Weidong Cai
Jiankang Deng
Lidong Bing
137
4
0
15 Oct 2025
Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval
Tianlu Zheng
Yifan Zhang
Xiang An
Ziyong Feng
Kaicheng Yang
Qichuan Ding
VLM
88
2
0
11 Sep 2025
Spotlighter: Revisiting Prompt Tuning from a Representative Mining View
Yutong Gao
Maoyuan Shao
Xinyang Huang
Chuang Zhu
Lijuan Sun
Yu Weng
Xuan Liu
Guoshun Nan
VLM
144
0
0
31 Aug 2025
ForCenNet: Foreground-Centric Network for Document Image Rectification
Peng Cai
Qiang Li
Kaicheng Yang
Dong Guo
Jia Li
Nan Zhou
Xiang An
Ninghua Yang
Jiankang Deng
89
0
0
26 Jul 2025
Multimodal Medical Image Binding via Shared Text Embeddings
Yunhao Liu
SuYang Xi
Shiqi Liu
Hong Ding
Chicheng Jin
Chong Zhong
Junjun He
Catherine C. Liu
Yiqing Shen
128
1
0
22 Jun 2025
Simple yet Effective Semi-supervised Knowledge Distillation from Vision-Language Models via Dual-Head Optimization
Seongjae Kang
Dong Bok Lee
Hyungjoon Jang
Sung Ju Hwang
VLM
328
0
0
12 May 2025
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs
Tiancheng Gu
Kaicheng Yang
Ziyong Feng
Xingjun Wang
Yanzhao Zhang
Dingkun Long
Yingda Chen
Weidong Cai
Jiankang Deng
VLM
797
34
0
24 Apr 2025
Decoupled Global-Local Alignment for Improving Compositional Understanding
Xiaoxing Hu
Kaicheng Yang
Chao Guo
Haoran Xu
Ziyong Feng
Longji Xu
VLM
622
7
0
23 Apr 2025
RealSyn: An Effective and Scalable Multimodal Interleaved Document Transformation Paradigm
Tiancheng Gu
Kaicheng Yang
Chaoyi Zhang
Yin Xie
Xiang An
Ziyong Feng
Dongnan Liu
Weidong Cai
Jiankang Deng
CLIP
VLM
395
5
0
18 Feb 2025
CLIP-PING: Boosting Lightweight Vision-Language Models with Proximus Intrinsic Neighbors Guidance
Chu Myaet Thwal
Ye Lin Tun
Minh N. H. Nguyen
Eui-nam Huh
Choong Seon Hong
VLM
323
0
0
05 Dec 2024
ViCToR: Improving Visual Comprehension via Token Reconstruction for Pretraining LMMs
Yin Xie
Kaicheng Yang
Ninghua Yang
Weimo Deng
Xiangzi Dai
Tiancheng Gu
Yumeng Wang
Xiang An
Yongle Zhao
Ziyong Feng
MLLM
VLM
291
1
0
18 Oct 2024
Unsupervised Learning of Visual Features by Contrasting Cluster Assignments
Mathilde Caron
Ishan Misra
Julien Mairal
Priya Goyal
Piotr Bojanowski
Armand Joulin
OCL
SSL
1.1K
4,602
0
17 Jun 2020
Billion-scale similarity search with GPUs
IEEE Transactions on Big Data (TBD), 2017
Jeff Johnson
Matthijs Douze
Edouard Grave
781
4,389
0
28 Feb 2017
1