v1v2v3 (latest)

Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation

27 May 2022

Jianmin Bao

ArXiv (abs)PDF HTML Github (258★)

Papers citing "Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation"

50 / 94 papers shown

EoS-FM: Can an Ensemble of Specialist Models act as a Generalist Feature Extractor?

119

26 Nov 2025

Make me an Expert: Distilling from Generalist Black-Box Models into Specialized Models for Semantic Segmentation

168

30 Aug 2025

Seeing Further on the Shoulders of Giants: Knowledge Inheritance for Vision Foundation Models

206

20 Aug 2025

PiPViT: Patch-based Visual Interpretable Prototypes for Retinal Image Analysis

341

12 Jun 2025

A Unified and Scalable Membership Inference Method for Visual Self-supervised Encoder via Part-aware Capability

435

15 May 2025

Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline Results

386

03 Apr 2025

RipVIS: Rip Currents Video Instance Segmentation Benchmark for Beach Monitoring and SafetyComputer Vision and Pattern Recognition (CVPR), 2025

367

01 Apr 2025

Wearable Accelerometer Foundation Models for Health via Knowledge Distillation

478

15 Dec 2024

Towards RAW Object Detection in Diverse ConditionsComputer Vision and Pattern Recognition (CVPR), 2024

189

24 Nov 2024

Explanation for Trajectory Planning using Multi-modal Large Language Model for Autonomous Driving

264

15 Nov 2024

Understanding the Role of Equivariance in Self-supervised LearningNeural Information Processing Systems (NeurIPS), 2024

316

10 Nov 2024

BlabberSeg: Real-Time Embedded Open-Vocabulary Aerial Segmentation

175

16 Oct 2024

PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation

277

02 Oct 2024

LACOSTE: Exploiting stereo and temporal contexts for surgical instrument segmentation

Qiyuan Wang

Shang Zhao

Zikang Xu

S Kevin Zhou

411

14 Sep 2024

CLIP-CID: Efficient CLIP Distillation via Cluster-Instance DiscriminationAAAI Conference on Artificial Intelligence (AAAI), 2024

Weidong Cai

Jiankang Deng

VLM

305

18 Aug 2024

iiANET: Inception Inspired Attention Hybrid Network for efficient Long-Range Dependency

Haruna Yunusa

Qin Shiyin

Abdulrahman Hamman Adama Chukkol

Isah Bello

A. Lawan

Isah Bello

285

10 Jul 2024

Bringing Masked Autoencoders Explicit Contrastive Properties for Point Cloud Self-Supervised Learning

Weijie Wang

264

08 Jul 2024

Enhancing Vision-Language Model with Unmasked Token Alignment

196

29 May 2024

How to Augment for Atmospheric Turbulence Effects on Thermal Adapted Object Detection Models?

Engin Uzun

Erdem Akagündüz

237

10 May 2024

An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training

285

18 Apr 2024

A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene

Wenhao Yu

247

17 Apr 2024

A Unified Membership Inference Method for Visual Self-supervised Encoder via Part-aware CapabilityConference on Computer and Communications Security (CCS), 2024

Jie Zhu

Jirong Zha

Ding Li

Leye Wang

289

03 Apr 2024

Learning to Rank Patches for Unbiased Image Redundancy Reduction

Yang Luo

Zhineng Chen

Zuxuan Wu

280

31 Mar 2024

Masked Modeling for Self-supervised Representation Learning on Vision and Beyond

Siyuan Li

Luyuan Zhang

Zedong Wang

Di Wu

Lirong Wu

...

Jun Xia

Cheng Tan

Yang Liu

Baigui Sun

Stan Z. Li

SSL

300

31 Dec 2023

Morphing Tokens Draw Strong Masked Image ModelsInternational Conference on Learning Representations (ICLR), 2023

Taekyung Kim

Byeongho Heo

Dongyoon Han

794

30 Dec 2023

AM-RADIO: Agglomerative Vision Foundation Model -- Reduce All Domains Into One

823

121

10 Dec 2023

Rejuvenating image-GPT as Strong Visual Representation LearnersInternational Conference on Machine Learning (ICML), 2023

Cihang Xie

284

04 Dec 2023

Infrared Image Super-Resolution via GAN

Y. Huang

S. Omachi

GAN

321

01 Dec 2023

ViT-Lens: Towards Omni-modal RepresentationsComputer Vision and Pattern Recognition (CVPR), 2023

Ying Shan

203

27 Nov 2023

EdgeFM: Leveraging Foundation Model for Open-set Learning on the Edge

442

18 Nov 2023

FLORA: Fine-grained Low-Rank Architecture Search for Vision TransformerIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

Chi-Chih Chang

168

07 Nov 2023

Asymmetric Masked Distillation for Pre-Training Small Foundation ModelsComputer Vision and Pattern Recognition (CVPR), 2023

Zhiyu Zhao

Bingkun Huang

Sen Xing

Gangshan Wu

Yu Qiao

Limin Wang

207

06 Nov 2023

Adaptive Multi-head Contrastive LearningEuropean Conference on Computer Vision (ECCV), 2023

Lei Wang

Piotr Koniusz

Tom Gedeon

Liang Zheng

349

09 Oct 2023

Symmetrical Linguistic Feature Distillation with CLIP for Scene Text RecognitionACM Multimedia (ACM MM), 2023

325

08 Oct 2023

Masked Image Residual Learning for Scaling Deeper Vision TransformersNeural Information Processing Systems (NeurIPS), 2023

Guoxi Huang

Hongtao Fu

A. Bors

279

25 Sep 2023

Mitigating Adversarial Attacks in Federated Learning with Trusted Execution EnvironmentsIEEE International Conference on Distributed Computing Systems (ICDCS), 2023

209

13 Sep 2023

ViT-Lens: Initiating Omni-Modal Exploration through 3D Insights

Ying Shan

172

20 Aug 2023

Pelta: Shielding Transformers to Mitigate Evasion Attacks in Federated Learning

174

08 Aug 2023

CLIP Brings Better Features to Visual Aesthetics Learners

215

28 Jul 2023

MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments

268

18 Jul 2023

IAdet: Simplest human-in-the-loop object detection

Franco Marchesoni-Acland

Gabriele Facciolo

VLM

219

04 Jul 2023

Hybrid Distillation: Connecting Masked Autoencoders with Contrastive LearnersInternational Conference on Learning Representations (ICLR), 2023

295

28 Jun 2023

Continual Learners are Incremental Model GeneralizersInternational Conference on Machine Learning (ICML), 2023

204

21 Jun 2023

Are Large Kernels Better Teachers than Transformers for ConvNets?International Conference on Machine Learning (ICML), 2023

Lu Yin

219

30 May 2023

What Makes for Good Visual Tokenizers for Large Language Models?

Ying Shan

291

20 May 2023

ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Peng Wang

Shijie Wang

Junyang Lin

Shuai Bai

Xiaohuan Zhou

Jingren Zhou

Xinggang Wang

Chang Zhou

VLM MLLM ObjD

591

154

18 May 2023

ImageBind: One Embedding Space To Bind Them AllComputer Vision and Pattern Recognition (CVPR), 2023

Kalyan Vasudev Alwala

Armand Joulin

Ishan Misra

VLM

555

1,305

09 May 2023

What Do Self-Supervised Vision Transformers Learn?International Conference on Learning Representations (ICLR), 2023

301

103

01 May 2023

A Strong and Reproducible Object Detector with Only Public Datasets

Jianwei Yang

Ailing Zeng

Lei Zhang

169

25 Apr 2023

A Cookbook of Self-Supervised Learning

...

Pierre Fernandez

446

362

24 Apr 2023