Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2111.07832
Cited By
v1
v2
v3 (latest)
iBOT: Image BERT Pre-Training with Online Tokenizer
15 November 2021
Jinghao Zhou
Chen Wei
Huiyu Wang
Wei Shen
Cihang Xie
Alan Yuille
Tao Kong
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"iBOT: Image BERT Pre-Training with Online Tokenizer"
50 / 607 papers shown
Title
Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels
Neural Information Processing Systems (NeurIPS), 2023
Zebin You
Yong Zhong
Fan Bao
Jiacheng Sun
Chongxuan Li
Jun Zhu
DiffM
VLM
516
50
0
21 Feb 2023
Self-supervised learning of Split Invariant Equivariant representations
International Conference on Machine Learning (ICML), 2023
Q. Garrido
Laurent Najman
Yann LeCun
SSL
262
40
0
14 Feb 2023
Semantic Image Segmentation: Two Decades of Research
Foundations and Trends in Computer Graphics and Vision (FTCGV), 2023
G. Csurka
Riccardo Volpi
Boris Chidlovskii
3DV
271
76
0
13 Feb 2023
Anatomical Invariance Modeling and Semantic Alignment for Self-supervised Learning in 3D Medical Image Analysis
IEEE International Conference on Computer Vision (ICCV), 2023
Yankai Jiang
Ming Sun
Heng Guo
Xiaoyu Bai
K. Yan
Le Lu
Minfeng Xu
MedIm
249
32
0
11 Feb 2023
Self-supervised learning-based cervical cytology for the triage of HPV-positive women in resource-limited settings and low-data regime
Thomas Stegmüller
C. Abbet
Behzad Bozorgtabar
Holly E. Clarke
P. Petignat
P. Vassilakos
Jean-Philippe Thiran
181
13
0
10 Feb 2023
Towards Geospatial Foundation Models via Continual Pretraining
IEEE International Conference on Computer Vision (ICCV), 2023
Matías Mendieta
Boran Han
Xingjian Shi
Yi Zhu
Chen Chen
VLM
AI4CE
439
114
0
09 Feb 2023
Evaluating Self-Supervised Learning via Risk Decomposition
International Conference on Machine Learning (ICML), 2023
Yann Dubois
Tatsunori Hashimoto
Abigail Z. Jacobs
237
9
0
06 Feb 2023
AIM: Adapting Image Models for Efficient Video Action Recognition
International Conference on Learning Representations (ICLR), 2023
Taojiannan Yang
Yi Zhu
Yusheng Xie
Aston Zhang
Chong Chen
Mu Li
ViT
389
217
0
06 Feb 2023
Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining
International Conference on Machine Learning (ICML), 2023
Zekun Qi
Runpei Dong
Guo Fan
Zheng Ge
Xiangyu Zhang
Kaisheng Ma
Li Yi
380
186
0
05 Feb 2023
MOMA:Distill from Self-Supervised Teachers
Xingtai Lv
Nandakishor Desai
M. Palaniswami
229
5
0
04 Feb 2023
Energy-Inspired Self-Supervised Pretraining for Vision Models
International Conference on Learning Representations (ICLR), 2023
Ze Wang
Jiang Wang
Zicheng Liu
Qiang Qiu
237
10
0
02 Feb 2023
A Closer Look at Few-shot Classification Again
International Conference on Machine Learning (ICML), 2023
Xu Luo
Hao Wu
Ji Zhang
Lianli Gao
Jing Xu
Jingkuan Song
252
70
0
28 Jan 2023
Aerial Image Object Detection With Vision Transformer Detector (ViTDet)
IEEE International Geoscience and Remote Sensing Symposium (IGARSS), 2023
Liya Wang
A. Tien
390
17
0
28 Jan 2023
Understanding Self-Supervised Pretraining with Part-Aware Representation Learning
Jie Zhu
Jiyang Qi
Mingyu Ding
Xiaokang Chen
Ping Luo
Xinggang Wang
Wenyu Liu
Leye Wang
Jingdong Wang
SSL
221
9
0
27 Jan 2023
Leveraging the Third Dimension in Contrastive Learning
Sumukh K Aithal
Anirudh Goyal
Alex Lamb
Yoshua Bengio
Michael C. Mozer
MDE
185
0
0
27 Jan 2023
A Simple Recipe for Competitive Low-compute Self supervised Vision Models
Quentin Duval
Ishan Misra
Nicolas Ballas
201
11
0
23 Jan 2023
Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture
Computer Vision and Pattern Recognition (CVPR), 2023
Mahmoud Assran
Quentin Duval
Ishan Misra
Piotr Bojanowski
Pascal Vincent
Michael G. Rabbat
Yann LeCun
Nicolas Ballas
SSL
AI4TS
MDE
445
557
0
19 Jan 2023
Learning Customized Visual Models with Retrieval-Augmented Knowledge
Computer Vision and Pattern Recognition (CVPR), 2023
Haotian Liu
Kilho Son
Jianwei Yang
Ce Liu
Jianfeng Gao
Yong Jae Lee
Chunyuan Li
VLM
227
77
0
17 Jan 2023
RILS: Masked Visual Reconstruction in Language Semantic Space
Computer Vision and Pattern Recognition (CVPR), 2023
Shusheng Yang
Yixiao Ge
Kun Yi
Dian Li
Ying Shan
Xiaohu Qie
Xinggang Wang
CLIP
168
14
0
17 Jan 2023
A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jie Gui
Tuo Chen
Jing Zhang
Qiong Cao
Zhe Sun
Haoran Luo
Dacheng Tao
556
339
0
13 Jan 2023
Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling
International Conference on Learning Representations (ICLR), 2023
Keyu Tian
Yi Jiang
Qishuai Diao
Chen Lin
Liwei Wang
Zehuan Yuan
249
135
0
09 Jan 2023
Learning Trajectory-Word Alignments for Video-Language Tasks
IEEE International Conference on Computer Vision (ICCV), 2023
Xu Yang
Zhang Li
Haiyang Xu
Hanwang Zhang
Qinghao Ye
Chenliang Li
Ming Yan
Yu Zhang
Fei Huang
Songfang Huang
181
7
0
05 Jan 2023
Ego-Only: Egocentric Action Detection without Exocentric Transferring
IEEE International Conference on Computer Vision (ICCV), 2023
Huiyu Wang
Mitesh Singh
Lorenzo Torresani
EgoV
332
34
0
03 Jan 2023
TinyMIM: An Empirical Study of Distilling MIM Pre-trained Models
Computer Vision and Pattern Recognition (CVPR), 2023
Sucheng Ren
Fangyun Wei
Zheng Zhang
Han Hu
288
50
0
03 Jan 2023
Disjoint Masking with Joint Distillation for Efficient Masked Image Modeling
IEEE transactions on multimedia (IEEE TMM), 2022
Xin Ma
Yu Xie
Chunyu Xie
Long Ye
Yafeng Deng
Xiang Ji
331
16
0
31 Dec 2022
Masked Event Modeling: Self-Supervised Pretraining for Event Cameras
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Simone Klenk
David Bonello
Lukas Koestler
Nikita Araslanov
Zorah Lähner
256
35
0
20 Dec 2022
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?
International Conference on Learning Representations (ICLR), 2022
Runpei Dong
Zekun Qi
Linfeng Zhang
Junbo Zhang
Jian‐Yuan Sun
Zheng Ge
Li Yi
Kaisheng Ma
ViT
3DPC
274
137
0
16 Dec 2022
Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language
International Conference on Machine Learning (ICML), 2022
Alexei Baevski
Arun Babu
Wei-Ning Hsu
Michael Auli
VLM
SSL
308
123
0
14 Dec 2022
Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders
Computer Vision and Pattern Recognition (CVPR), 2022
Renrui Zhang
Liuhui Wang
Yu Qiao
Shiyang Feng
Jiaming Song
3DPC
254
180
0
13 Dec 2022
FastMIM: Expediting Masked Image Modeling Pre-training for Vision
Jianyuan Guo
Kai Han
Han Wu
Yehui Tang
Yunhe Wang
Chang Xu
184
15
0
13 Dec 2022
PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery
Computer Vision and Pattern Recognition (CVPR), 2022
Shengxiang Zhang
Salman Khan
Zhiqiang Shen
Muzammal Naseer
Guangyi Chen
Fahad Shahbaz Khan
CLL
VLM
231
106
0
11 Dec 2022
SEPT: Towards Scalable and Efficient Visual Pre-Training
AAAI Conference on Artificial Intelligence (AAAI), 2022
Yiqi Lin
Huabin Zheng
Huaping Zhong
Jinjing Zhu
Weijia Li
Conghui He
Lin Wang
171
2
0
11 Dec 2022
Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning
Computer Vision and Pattern Recognition (CVPR), 2022
Rui Wang
Dongdong Chen
Zuxuan Wu
Yinpeng Chen
Xiyang Dai
Xiyang Dai
Lu Yuan
Yu-Gang Jiang
VGen
290
119
0
08 Dec 2022
Group Generalized Mean Pooling for Vision Transformer
ByungSoo Ko
Han-Gyu Kim
Byeongho Heo
Sangdoo Yun
Sanghyuk Chun
Geonmo Gu
Wonjae Kim
ViT
284
3
0
08 Dec 2022
Rethinking the Objectives of Vector-Quantized Tokenizers for Image Synthesis
Computer Vision and Pattern Recognition (CVPR), 2022
Yuchao Gu
Xintao Wang
Yixiao Ge
Ying Shan
Xiaohu Qie
Mike Zheng Shou
DiffM
200
30
0
06 Dec 2022
Location-Aware Self-Supervised Transformers for Semantic Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Mathilde Caron
N. Houlsby
Cordelia Schmid
ViT
305
23
0
05 Dec 2022
Exploring Stochastic Autoregressive Image Modeling for Visual Representation
AAAI Conference on Artificial Intelligence (AAAI), 2022
Yu-Hang Qi
Fan Yang
Yousong Zhu
Yufei Liu
Liwei Wu
Rui Zhao
Wei Li
DiffM
103
16
0
03 Dec 2022
Spatio-Temporal Crop Aggregation for Video Representation Learning
IEEE International Conference on Computer Vision (ICCV), 2022
Sepehr Sameni
Simon Jenni
Paolo Favaro
279
4
0
30 Nov 2022
SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation
International Conference on Machine Learning (ICML), 2022
Huaishao Luo
Junwei Bao
Youzheng Wu
Xiaodong He
Tianrui Li
VLM
231
193
0
27 Nov 2022
Self-Supervised Learning based on Heat Equation
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Xiyang Dai
Lu Yuan
Zicheng Liu
Youzuo Lin
138
5
0
23 Nov 2022
Fast-iTPN: Integrally Pre-Trained Transformer Pyramid Network with Token Migration
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yunjie Tian
Lingxi Xie
Jihao Qiu
Jianbin Jiao
Yaowei Wang
Qi Tian
Qixiang Ye
ViT
174
19
0
23 Nov 2022
CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow
IEEE International Conference on Computer Vision (ICCV), 2022
Philippe Weinzaepfel
Thomas Lucas
Vincent Leroy
Yohann Cabon
Vaibhav Arora
Romain Brégier
G. Csurka
L. Antsfeld
Boris Chidlovskii
Jérôme Revaud
ViT
402
153
0
18 Nov 2022
Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information
Computer Vision and Pattern Recognition (CVPR), 2022
Weijie Su
Xizhou Zhu
Chenxin Tao
Lewei Lu
Bin Li
Gao Huang
Yu Qiao
Xiaogang Wang
Jie Zhou
Jifeng Dai
225
54
0
17 Nov 2022
CAE v2: Context Autoencoder with CLIP Target
Xinyu Zhang
Jiahui Chen
Junkun Yuan
Qiang Chen
Jian Wang
...
Jimin Pi
Kun Yao
Junyu Han
Errui Ding
Jingdong Wang
VLM
CLIP
256
25
0
17 Nov 2022
MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis
Computer Vision and Pattern Recognition (CVPR), 2022
Tianhong Li
Huiwen Chang
Shlok Kumar Mishra
Han Zhang
Dina Katabi
Dilip Krishnan
272
225
0
16 Nov 2022
Masked Reconstruction Contrastive Learning with Information Bottleneck Principle
Ziwen Liu
Bonan li
Congying Han
Tiande Guo
Xuecheng Nie
SSL
129
2
0
15 Nov 2022
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
Computer Vision and Pattern Recognition (CVPR), 2022
Yuxin Fang
Wen Wang
Binhui Xie
Quan-Sen Sun
Ledell Yu Wu
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
CLIP
559
891
0
14 Nov 2022
SSL4EO-S12: A Large-Scale Multi-Modal, Multi-Temporal Dataset for Self-Supervised Learning in Earth Observation
Yi Wang
Nassim Ait Ali Braham
Zhitong Xiong
Chenying Liu
C. Albrecht
Xiao Xiang Zhu
213
92
0
13 Nov 2022
Demystify Self-Attention in Vision Transformers from a Semantic Perspective: Analysis and Application
Leijie Wu
Song Guo
Yaohong Ding
Junxiao Wang
Wenchao Xu
Richard Yi Da Xu
Jiewei Zhang
134
3
0
13 Nov 2022
Far Away in the Deep Space: Dense Nearest-Neighbor-Based Out-of-Distribution Detection
Silvio Galesso
Max Argus
Thomas Brox
UQCV
243
15
0
12 Nov 2022
Previous
1
2
3
...
10
11
12
13
9
Next