v1v2v3 (latest)

iBOT: Image BERT Pre-Training with Online Tokenizer

15 November 2021

Cihang Xie

Papers citing "iBOT: Image BERT Pre-Training with Online Tokenizer"

50 / 607 papers shown

Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few LabelsNeural Information Processing Systems (NeurIPS), 2023

Jun Zhu

538

21 Feb 2023

Self-supervised learning of Split Invariant Equivariant representationsInternational Conference on Machine Learning (ICML), 2023

304

14 Feb 2023

Semantic Image Segmentation: Two Decades of ResearchFoundations and Trends in Computer Graphics and Vision (FTCGV), 2023

273

13 Feb 2023

Anatomical Invariance Modeling and Semantic Alignment for Self-supervised Learning in 3D Medical Image AnalysisIEEE International Conference on Computer Vision (ICCV), 2023

304

11 Feb 2023

Self-supervised learning-based cervical cytology for the triage of HPV-positive women in resource-limited settings and low-data regime

Jean-Philippe Thiran

219

10 Feb 2023

Towards Geospatial Foundation Models via Continual PretrainingIEEE International Conference on Computer Vision (ICCV), 2023

467

114

09 Feb 2023

Evaluating Self-Supervised Learning via Risk DecompositionInternational Conference on Machine Learning (ICML), 2023

Yann Dubois

Tatsunori Hashimoto

Abigail Z. Jacobs

285

06 Feb 2023

AIM: Adapting Image Models for Efficient Video Action RecognitionInternational Conference on Learning Representations (ICLR), 2023

422

219

06 Feb 2023

Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative PretrainingInternational Conference on Machine Learning (ICML), 2023

Xiangyu Zhang

403

191

05 Feb 2023

MOMA:Distill from Self-Supervised Teachers

Xingtai Lv

Nandakishor Desai

M. Palaniswami

254

04 Feb 2023

Energy-Inspired Self-Supervised Pretraining for Vision ModelsInternational Conference on Learning Representations (ICLR), 2023

Ze Wang

Jiang Wang

Zicheng Liu

Qiang Qiu

261

02 Feb 2023

A Closer Look at Few-shot Classification AgainInternational Conference on Machine Learning (ICML), 2023

Lianli Gao

Jingkuan Song

267

28 Jan 2023

Aerial Image Object Detection With Vision Transformer Detector (ViTDet)IEEE International Geoscience and Remote Sensing Symposium (IGARSS), 2023

Liya Wang

A. Tien

414

28 Jan 2023

Understanding Self-Supervised Pretraining with Part-Aware Representation Learning

Mingyu Ding

Ping Luo

Leye Wang

Jingdong Wang

SSL

249

27 Jan 2023

Leveraging the Third Dimension in Contrastive Learning

207

27 Jan 2023

A Simple Recipe for Competitive Low-compute Self supervised Vision Models

Quentin Duval

Ishan Misra

Nicolas Ballas

221

23 Jan 2023

Self-Supervised Learning from Images with a Joint-Embedding Predictive ArchitectureComputer Vision and Pattern Recognition (CVPR), 2023

Pascal Vincent

471

596

19 Jan 2023

Learning Customized Visual Models with Retrieval-Augmented KnowledgeComputer Vision and Pattern Recognition (CVPR), 2023

Jianwei Yang

232

17 Jan 2023

RILS: Masked Visual Reconstruction in Language Semantic SpaceComputer Vision and Pattern Recognition (CVPR), 2023

Shusheng Yang

Ying Shan

194

17 Jan 2023

A Survey on Self-supervised Learning: Algorithms, Applications, and Future TrendsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

578

366

13 Jan 2023

Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked ModelingInternational Conference on Learning Representations (ICLR), 2023

281

136

09 Jan 2023

Learning Trajectory-Word Alignments for Video-Language TasksIEEE International Conference on Computer Vision (ICCV), 2023

Fei Huang

215

05 Jan 2023

Ego-Only: Egocentric Action Detection without Exocentric TransferringIEEE International Conference on Computer Vision (ICCV), 2023

357

03 Jan 2023

TinyMIM: An Empirical Study of Distilling MIM Pre-trained ModelsComputer Vision and Pattern Recognition (CVPR), 2023

321

03 Jan 2023

Disjoint Masking with Joint Distillation for Efficient Masked Image ModelingIEEE transactions on multimedia (IEEE TMM), 2022

Chunyu Xie

351

31 Dec 2022

Masked Event Modeling: Self-Supervised Pretraining for Event CamerasIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

286

20 Dec 2022

Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?International Conference on Learning Representations (ICLR), 2022

311

139

16 Dec 2022

Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and LanguageInternational Conference on Machine Learning (ICML), 2022

364

123

14 Dec 2022

Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked AutoencodersComputer Vision and Pattern Recognition (CVPR), 2022

Yu Qiao

288

184

13 Dec 2022

FastMIM: Expediting Masked Image Modeling Pre-training for Vision

198

13 Dec 2022

PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category DiscoveryComputer Vision and Pattern Recognition (CVPR), 2022

Salman Khan

248

106

11 Dec 2022

SEPT: Towards Scalable and Efficient Visual Pre-TrainingAAAI Conference on Artificial Intelligence (AAAI), 2022

Huaping Zhong

Conghui He

Lin Wang

199

11 Dec 2022

Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation LearningComputer Vision and Pattern Recognition (CVPR), 2022

Zuxuan Wu

Lu Yuan

326

120

08 Dec 2022

Group Generalized Mean Pooling for Vision Transformer

303

08 Dec 2022

Rethinking the Objectives of Vector-Quantized Tokenizers for Image SynthesisComputer Vision and Pattern Recognition (CVPR), 2022

Ying Shan

219

06 Dec 2022

Location-Aware Self-Supervised Transformers for Semantic SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

330

05 Dec 2022

Exploring Stochastic Autoregressive Image Modeling for Visual RepresentationAAAI Conference on Artificial Intelligence (AAAI), 2022

Fan Yang

114

03 Dec 2022

Spatio-Temporal Crop Aggregation for Video Representation LearningIEEE International Conference on Computer Vision (ICCV), 2022

Sepehr Sameni

Simon Jenni

Paolo Favaro

319

30 Nov 2022

SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic SegmentationInternational Conference on Machine Learning (ICML), 2022

Tianrui Li

248

196

27 Nov 2022

Self-Supervised Learning based on Heat Equation

Lu Yuan

Zicheng Liu

Youzuo Lin

154

23 Nov 2022

Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token MigrationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Yaowei Wang

205

23 Nov 2022

CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical FlowIEEE International Conference on Computer Vision (ICCV), 2022

498

160

18 Nov 2022

Towards All-in-one Pre-training via Maximizing Multi-modal Mutual InformationComputer Vision and Pattern Recognition (CVPR), 2022

Weijie Su

Gao Huang

Yu Qiao

Xiaogang Wang

Jie Zhou

Jifeng Dai

245

17 Nov 2022

CAE v2: Context Autoencoder with CLIP Target

...

Errui Ding

Jingdong Wang

VLM CLIP

276

17 Nov 2022

MAGE: MAsked Generative Encoder to Unify Representation Learning and Image SynthesisComputer Vision and Pattern Recognition (CVPR), 2022

331

232

16 Nov 2022

Masked Reconstruction Contrastive Learning with Information Bottleneck Principle

Congying Han

Xuecheng Nie

149

15 Nov 2022

EVA: Exploring the Limits of Masked Visual Representation Learning at ScaleComputer Vision and Pattern Recognition (CVPR), 2022

621

901

14 Nov 2022

SSL4EO-S12: A Large-Scale Multi-Modal, Multi-Temporal Dataset for Self-Supervised Learning in Earth Observation

Yi Wang

Nassim Ait Ali Braham

Zhitong Xiong

Chenying Liu

C. Albrecht

Xiao Xiang Zhu

233

13 Nov 2022

Demystify Self-Attention in Vision Transformers from a Semantic Perspective: Analysis and Application

154

13 Nov 2022

Far Away in the Deep Space: Dense Nearest-Neighbor-Based Out-of-Distribution Detection

Silvio Galesso

Max Argus

Thomas Brox

UQCV

270

12 Nov 2022