ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.14141
  4. Cited By
Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via
  Feature Distillation
v1v2v3 (latest)

Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation

27 May 2022
Yixuan Wei
Han Hu
Zhenda Xie
Zheng Zhang
Yue Cao
Jianmin Bao
Dong Chen
B. Guo
    CLIP
ArXiv (abs)PDFHTMLGithub (258★)

Papers citing "Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation"

50 / 94 papers shown
EoS-FM: Can an Ensemble of Specialist Models act as a Generalist Feature Extractor?
EoS-FM: Can an Ensemble of Specialist Models act as a Generalist Feature Extractor?
Pierre Adorni
M. Pham
Stéphane May
Sébastien Lefèvre
119
0
0
26 Nov 2025
Make me an Expert: Distilling from Generalist Black-Box Models into Specialized Models for Semantic Segmentation
Make me an Expert: Distilling from Generalist Black-Box Models into Specialized Models for Semantic Segmentation
Yasser Benigmim
Subhankar Roy
Khalid Oublal
Imad Eddine Marouf
S. Essid
Vicky Kalogeiton
Stéphane Lathuilière
168
0
0
30 Aug 2025
Seeing Further on the Shoulders of Giants: Knowledge Inheritance for Vision Foundation Models
Seeing Further on the Shoulders of Giants: Knowledge Inheritance for Vision Foundation Models
Jiabo Huang
Chen Chen
Lingjuan Lyu
VLM
206
1
0
20 Aug 2025
PiPViT: Patch-based Visual Interpretable Prototypes for Retinal Image Analysis
PiPViT: Patch-based Visual Interpretable Prototypes for Retinal Image Analysis
Marzieh Oghbaie
Teresa Araújoa
Hrvoje Bogunović
ViTMedIm
341
0
0
12 Jun 2025
A Unified and Scalable Membership Inference Method for Visual Self-supervised Encoder via Part-aware Capability
A Unified and Scalable Membership Inference Method for Visual Self-supervised Encoder via Part-aware Capability
Jie Zhu
Jirong Zha
Ding Li
Leye Wang
435
1
0
15 May 2025
Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline Results
Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline Results
Andrei Dumitriu
Florin Tatui
Florin Miron
Radu Tudor Ionescu
Radu Timofte
386
34
0
03 Apr 2025
RipVIS: Rip Currents Video Instance Segmentation Benchmark for Beach Monitoring and Safety
RipVIS: Rip Currents Video Instance Segmentation Benchmark for Beach Monitoring and SafetyComputer Vision and Pattern Recognition (CVPR), 2025
Andrei Dumitriu
Florin Tatui
Florin Miron
Aakash Ralhan
Radu Tudor Ionescu
Radu Timofte
367
1
0
01 Apr 2025
Wearable Accelerometer Foundation Models for Health via Knowledge Distillation
Wearable Accelerometer Foundation Models for Health via Knowledge Distillation
Salar Abbaspourazad
Anshuman Mishra
Joseph D. Futoma
Andrew C. Miller
Ian Shapiro
478
6
0
15 Dec 2024
Towards RAW Object Detection in Diverse Conditions
Towards RAW Object Detection in Diverse ConditionsComputer Vision and Pattern Recognition (CVPR), 2024
Zhong-Yu Li
Xin Jin
Boyuan Sun
Chun-Le Guo
Ming-Ming Cheng
189
5
0
24 Nov 2024
Explanation for Trajectory Planning using Multi-modal Large Language
  Model for Autonomous Driving
Explanation for Trajectory Planning using Multi-modal Large Language Model for Autonomous Driving
Shota Yamazaki
Chenyu Zhang
Takuya Nanri
Akio Shigekane
Siyuan Wang
Jo Nishiyama
Tao Chu
Kohei Yokosawa
LRM
264
1
0
15 Nov 2024
Understanding the Role of Equivariance in Self-supervised Learning
Understanding the Role of Equivariance in Self-supervised LearningNeural Information Processing Systems (NeurIPS), 2024
Yifei Wang
Kaiwen Hu
Sharut Gupta
Ziyu Ye
Yisen Wang
Stefanie Jegelka
SSL
316
6
0
10 Nov 2024
BlabberSeg: Real-Time Embedded Open-Vocabulary Aerial Segmentation
BlabberSeg: Real-Time Embedded Open-Vocabulary Aerial Segmentation
Haechan Mark Bong
Ricardo de Azambuja
Giovanni Beltrame
VLM
175
1
0
16 Oct 2024
PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation
PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation
Mike Ranzinger
Jon Barker
Greg Heinrich
Pavlo Molchanov
Bryan Catanzaro
Andrew Tao
277
12
0
02 Oct 2024
LACOSTE: Exploiting stereo and temporal contexts for surgical instrument
  segmentation
LACOSTE: Exploiting stereo and temporal contexts for surgical instrument segmentation
Qiyuan Wang
Shang Zhao
Zikang Xu
S Kevin Zhou
411
0
0
14 Sep 2024
CLIP-CID: Efficient CLIP Distillation via Cluster-Instance
  Discrimination
CLIP-CID: Efficient CLIP Distillation via Cluster-Instance DiscriminationAAAI Conference on Artificial Intelligence (AAAI), 2024
Kaicheng Yang
Tiancheng Gu
Xiang An
Haiqiang Jiang
Xiangzi Dai
Ziyong Feng
Weidong Cai
Jiankang Deng
VLM
305
23
0
18 Aug 2024
iiANET: Inception Inspired Attention Hybrid Network for efficient Long-Range Dependency
iiANET: Inception Inspired Attention Hybrid Network for efficient Long-Range Dependency
Haruna Yunusa
Qin Shiyin
Abdulrahman Hamman Adama Chukkol
Isah Bello
A. Lawan
Isah Bello
285
4
0
10 Jul 2024
Bringing Masked Autoencoders Explicit Contrastive Properties for Point
  Cloud Self-Supervised Learning
Bringing Masked Autoencoders Explicit Contrastive Properties for Point Cloud Self-Supervised Learning
Bin Ren
Guofeng Mei
D. Paudel
Weijie Wang
Yawei Li
Mengyuan Liu
Rita Cucchiara
Luc Van Gool
Andrii Zadaianchuk
3DPC
264
18
0
08 Jul 2024
Enhancing Vision-Language Model with Unmasked Token Alignment
Enhancing Vision-Language Model with Unmasked Token Alignment
Jihao Liu
Jinliang Zheng
Boxiao Liu
Yu Liu
Jiaming Song
CLIP
196
0
0
29 May 2024
How to Augment for Atmospheric Turbulence Effects on Thermal Adapted
  Object Detection Models?
How to Augment for Atmospheric Turbulence Effects on Thermal Adapted Object Detection Models?
Engin Uzun
Erdem Akagündüz
237
0
0
10 May 2024
An Experimental Study on Exploring Strong Lightweight Vision
  Transformers via Masked Image Modeling Pre-Training
An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training
Jin Gao
Shubo Lin
Shaoru Wang
Yutong Kou
Zeming Li
Liang Li
Congxuan Zhang
Xiaoqin Zhang
Yizheng Wang
Weiming Hu
285
6
0
18 Apr 2024
A Progressive Framework of Vision-language Knowledge Distillation and
  Alignment for Multilingual Scene
A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene
Wenbo Zhang
Yifan Zhang
Jianfeng Lin
Binqiang Huang
Jinlu Zhang
Wenhao Yu
VLM
247
2
0
17 Apr 2024
A Unified Membership Inference Method for Visual Self-supervised Encoder
  via Part-aware Capability
A Unified Membership Inference Method for Visual Self-supervised Encoder via Part-aware CapabilityConference on Computer and Communications Security (CCS), 2024
Jie Zhu
Jirong Zha
Ding Li
Leye Wang
289
10
0
03 Apr 2024
Learning to Rank Patches for Unbiased Image Redundancy Reduction
Learning to Rank Patches for Unbiased Image Redundancy Reduction
Yang Luo
Zhineng Chen
Peng Zhou
Zuxuan Wu
Xieping Gao
Yu-Gang Jiang
SSL
280
6
0
31 Mar 2024
Masked Modeling for Self-supervised Representation Learning on Vision
  and Beyond
Masked Modeling for Self-supervised Representation Learning on Vision and Beyond
Siyuan Li
Luyuan Zhang
Zedong Wang
Di Wu
Lirong Wu
...
Jun Xia
Cheng Tan
Yang Liu
Baigui Sun
Stan Z. Li
SSL
300
28
0
31 Dec 2023
Morphing Tokens Draw Strong Masked Image Models
Morphing Tokens Draw Strong Masked Image ModelsInternational Conference on Learning Representations (ICLR), 2023
Taekyung Kim
Byeongho Heo
Dongyoon Han
794
3
0
30 Dec 2023
AM-RADIO: Agglomerative Vision Foundation Model -- Reduce All Domains
  Into One
AM-RADIO: Agglomerative Vision Foundation Model -- Reduce All Domains Into One
Michael Ranzinger
Greg Heinrich
Jan Kautz
Pavlo Molchanov
VLM
823
121
0
10 Dec 2023
Rejuvenating image-GPT as Strong Visual Representation Learners
Rejuvenating image-GPT as Strong Visual Representation LearnersInternational Conference on Machine Learning (ICML), 2023
Sucheng Ren
Zeyu Wang
Hongru Zhu
Junfei Xiao
Yaoyao Liu
Cihang Xie
VLM
284
12
0
04 Dec 2023
Infrared Image Super-Resolution via GAN
Infrared Image Super-Resolution via GAN
Y. Huang
S. Omachi
GAN
321
0
0
01 Dec 2023
ViT-Lens: Towards Omni-modal Representations
ViT-Lens: Towards Omni-modal RepresentationsComputer Vision and Pattern Recognition (CVPR), 2023
Weixian Lei
Yixiao Ge
Kun Yi
Jianfeng Zhang
Difei Gao
Dylan Sun
Yuying Ge
Ying Shan
Mike Zheng Shou
203
32
0
27 Nov 2023
EdgeFM: Leveraging Foundation Model for Open-set Learning on the Edge
EdgeFM: Leveraging Foundation Model for Open-set Learning on the Edge
Bufang Yang
Lixing He
Neiwen Ling
Zhenyu Yan
Guoliang Xing
Xian Shuai
Xiaozhe Ren
Xin Jiang
442
35
0
18 Nov 2023
FLORA: Fine-grained Low-Rank Architecture Search for Vision Transformer
FLORA: Fine-grained Low-Rank Architecture Search for Vision TransformerIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Chi-Chih Chang
Yuan-Yao Sung
Shixing Yu
N. Huang
Diana Marculescu
Kai-Chiang Wu
ViT
168
4
0
07 Nov 2023
Asymmetric Masked Distillation for Pre-Training Small Foundation Models
Asymmetric Masked Distillation for Pre-Training Small Foundation ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Zhiyu Zhao
Bingkun Huang
Sen Xing
Gangshan Wu
Yu Qiao
Limin Wang
207
12
0
06 Nov 2023
Adaptive Multi-head Contrastive Learning
Adaptive Multi-head Contrastive LearningEuropean Conference on Computer Vision (ECCV), 2023
Lei Wang
Piotr Koniusz
Tom Gedeon
Liang Zheng
349
9
0
09 Oct 2023
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text
  Recognition
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text RecognitionACM Multimedia (ACM MM), 2023
Zixiao Wang
Hongtao Xie
Yuxin Wang
Jianjun Xu
Boqiang Zhang
Yongdong Zhang
325
27
0
08 Oct 2023
Masked Image Residual Learning for Scaling Deeper Vision Transformers
Masked Image Residual Learning for Scaling Deeper Vision TransformersNeural Information Processing Systems (NeurIPS), 2023
Guoxi Huang
Hongtao Fu
A. Bors
279
8
0
25 Sep 2023
Mitigating Adversarial Attacks in Federated Learning with Trusted
  Execution Environments
Mitigating Adversarial Attacks in Federated Learning with Trusted Execution EnvironmentsIEEE International Conference on Distributed Computing Systems (ICDCS), 2023
Simon Queyrut
V. Schiavoni
Pascal Felber
AAMLFedML
209
15
0
13 Sep 2023
ViT-Lens: Initiating Omni-Modal Exploration through 3D Insights
ViT-Lens: Initiating Omni-Modal Exploration through 3D Insights
Weixian Lei
Yixiao Ge
Jianfeng Zhang
Dylan Sun
Kun Yi
Ying Shan
Mike Zheng Shou
172
1
0
20 Aug 2023
Pelta: Shielding Transformers to Mitigate Evasion Attacks in Federated
  Learning
Pelta: Shielding Transformers to Mitigate Evasion Attacks in Federated Learning
Simon Queyrut
Yérom-David Bromberg
V. Schiavoni
FedMLAAML
174
1
0
08 Aug 2023
CLIP Brings Better Features to Visual Aesthetics Learners
CLIP Brings Better Features to Visual Aesthetics Learners
Liwu Xu
Jinjin Xu
Yuzhe Yang
Yi-Jie Huang
Yanchun Xie
Yaqian Li
VLM
215
5
0
28 Jul 2023
MOCA: Self-supervised Representation Learning by Predicting Masked
  Online Codebook Assignments
MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments
Spyros Gidaris
Andrei Bursuc
Oriane Siméoni
Antonín Vobecký
N. Komodakis
Matthieu Cord
Patrick Pérez
SSLViT
268
9
0
18 Jul 2023
IAdet: Simplest human-in-the-loop object detection
IAdet: Simplest human-in-the-loop object detection
Franco Marchesoni-Acland
Gabriele Facciolo
VLM
219
2
0
04 Jul 2023
Hybrid Distillation: Connecting Masked Autoencoders with Contrastive
  Learners
Hybrid Distillation: Connecting Masked Autoencoders with Contrastive LearnersInternational Conference on Learning Representations (ICLR), 2023
Bowen Shi
Xiaopeng Zhang
Yaoming Wang
Jin Li
Wenrui Dai
Junni Zou
H. Xiong
Qi Tian
295
9
0
28 Jun 2023
Continual Learners are Incremental Model Generalizers
Continual Learners are Incremental Model GeneralizersInternational Conference on Machine Learning (ICML), 2023
Jaehong Yoon
Sung Ju Hwang
Yu Cao
CLL
204
6
0
21 Jun 2023
Are Large Kernels Better Teachers than Transformers for ConvNets?
Are Large Kernels Better Teachers than Transformers for ConvNets?International Conference on Machine Learning (ICML), 2023
Tianjin Huang
Lu Yin
Zhenyu Zhang
Lijuan Shen
Meng Fang
Mykola Pechenizkiy
Zinan Lin
Shiwei Liu
219
16
0
30 May 2023
What Makes for Good Visual Tokenizers for Large Language Models?
What Makes for Good Visual Tokenizers for Large Language Models?
Guangzhi Wang
Yixiao Ge
Xiaohan Ding
Mohan S. Kankanhalli
Ying Shan
MLLMVLM
291
46
0
20 May 2023
ONE-PEACE: Exploring One General Representation Model Toward Unlimited
  Modalities
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Peng Wang
Shijie Wang
Junyang Lin
Shuai Bai
Xiaohuan Zhou
Jingren Zhou
Xinggang Wang
Chang Zhou
VLMMLLMObjD
591
154
0
18 May 2023
ImageBind: One Embedding Space To Bind Them All
ImageBind: One Embedding Space To Bind Them AllComputer Vision and Pattern Recognition (CVPR), 2023
Rohit Girdhar
Alaaeldin El-Nouby
Zhuang Liu
Mannat Singh
Kalyan Vasudev Alwala
Armand Joulin
Ishan Misra
VLM
555
1,305
0
09 May 2023
What Do Self-Supervised Vision Transformers Learn?
What Do Self-Supervised Vision Transformers Learn?International Conference on Learning Representations (ICLR), 2023
Namuk Park
Wonjae Kim
Byeongho Heo
Taekyung Kim
Sangdoo Yun
SSL
301
103
1
01 May 2023
A Strong and Reproducible Object Detector with Only Public Datasets
A Strong and Reproducible Object Detector with Only Public Datasets
Tianhe Ren
Jianwei Yang
Siyi Liu
Ailing Zeng
Feng Li
Hao Zhang
Hongyang Li
Zhaoyang Zeng
Lei Zhang
ObjD
169
13
0
25 Apr 2023
A Cookbook of Self-Supervised Learning
A Cookbook of Self-Supervised Learning
Randall Balestriero
Mark Ibrahim
Vlad Sobal
Ari S. Morcos
Shashank Shekhar
...
Pierre Fernandez
Amir Bar
Hamed Pirsiavash
Yann LeCun
Micah Goldblum
SyDaFedMLSSL
446
362
0
24 Apr 2023
12
Next
Page 1 of 2