Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2111.07832
Cited By
v1
v2
v3 (latest)
iBOT: Image BERT Pre-Training with Online Tokenizer
15 November 2021
Jinghao Zhou
Chen Wei
Huiyu Wang
Wei Shen
Cihang Xie
Alan Yuille
Tao Kong
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"iBOT: Image BERT Pre-Training with Online Tokenizer"
50 / 605 papers shown
Title
Benchmarking Pathology Foundation Models: Adaptation Strategies and Scenarios
Jeaung Lee
Jeewoo Lim
Keunho Byeon
Jin Tae Kwak
181
12
0
21 Oct 2024
ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts
Xumeng Han
Longhui Wei
Bushi Liu
Zipeng Wang
Chenhui Qiang
Xin He
Yingfei Sun
Zhenjun Han
Qi Tian
MoE
397
13
0
21 Oct 2024
Upsampling DINOv2 features for unsupervised vision tasks and weakly supervised materials segmentation
Ronan Docherty
Antonis Vamvakeros
Samuel J. Cooper
348
3
0
20 Oct 2024
Fusion from Decomposition: A Self-Supervised Approach for Image Fusion and Beyond
Pengwei Liang
Junjun Jiang
Qing Ma
Xianming Liu
Jiayi Ma
184
5
0
16 Oct 2024
DRACO: A Denoising-Reconstruction Autoencoder for Cryo-EM
Neural Information Processing Systems (NeurIPS), 2024
Yingjun Shen
Haizhao Dai
Qihe Chen
Yan Zeng
Jiakai Zhang
Yuan Pei
Jingyi Yu
234
4
0
15 Oct 2024
EchoApex: A General-Purpose Vision Foundation Model for Echocardiography
A. Amadou
Yanzhe Zhang
Sebastien Piat
Paul Klein
Ingo Schmuecking
Tiziano Passerini
Puneet Sharma
225
11
0
14 Oct 2024
Browsing without Third-Party Cookies: What Do You See?
ACM/SIGCOMM Internet Measurement Conference (IMC), 2024
Maxwell Lin
Shihan Lin
Helen Wu
Karen Wang
Xiaowei Yang
BDL
462
42
0
14 Oct 2024
Locality Alignment Improves Vision-Language Models
International Conference on Learning Representations (ICLR), 2024
Ian Covert
Tony Sun
James Zou
Tatsunori Hashimoto
VLM
537
11
0
14 Oct 2024
Large-Scale 3D Medical Image Pre-training with Geometric Context Priors
Linshan Wu
Jiaxin Zhuang
Hao Chen
199
18
0
13 Oct 2024
Brain Mapping with Dense Features: Grounding Cortical Semantic Selectivity in Natural Images With Vision Transformers
International Conference on Learning Representations (ICLR), 2024
Andrew F. Luo
Jacob Yeung
Rushikesh Zawar
Shaurya Dewan
Margaret M. Henderson
Leila Wehbe
Michael J. Tarr
316
12
0
07 Oct 2024
Denoising with a Joint-Embedding Predictive Architecture
International Conference on Learning Representations (ICLR), 2024
Dengsheng Chen
Jie Hu
Xiaoming Wei
Enhua Wu
DiffM
453
5
0
02 Oct 2024
Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading
Mostafa Hajighasemloua
Samad Sheikhaei
Hamid Soltanian-Zadeha
179
0
0
01 Oct 2024
Radio Foundation Models: Pre-training Transformers for 5G-based Indoor Localization
International Conference on Indoor Positioning and Indoor Navigation (IPIN), 2024
Jonathan Ott
Jonas Pirkl
Maximilian Stahlke
Tobias Feigl
Christopher Mutschler
78
13
0
01 Oct 2024
Text-driven Human Motion Generation with Motion Masked Diffusion Model
Xingyu Chen
DiffM
VGen
148
6
0
29 Sep 2024
Harnessing Frozen Unimodal Encoders for Flexible Multimodal Alignment
Computer Vision and Pattern Recognition (CVPR), 2024
Mayug Maniparambil
Raiymbek Akshulakov
Y. A. D. Djilali
Sanath Narayan
Ankit Singh
Noel E. O'Connor
VLM
MLLM
132
0
0
28 Sep 2024
Embed and Emulate: Contrastive representations for simulation-based inference
Ruoxi Jiang
Peter Y. Lu
Rebecca Willett
186
1
0
27 Sep 2024
MEXMA: Token-level objectives improve sentence representations
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Joao Maria Janeiro
Benjamin Piwowarski
Patrick Gallinari
Loïc Barrault
110
4
0
19 Sep 2024
Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning
International Conference on Learning Representations (ICLR), 2024
Amin Karimi Monsefi
Mengxi Zhou
Nastaran Karimi Monsefi
Ser-Nam Lim
Wei-Lun Chao
R. Ramnath
280
4
0
16 Sep 2024
Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval
Engineering applications of artificial intelligence (EAAI), 2024
Amirreza Mahbod
Nematollah Saeidi
Sepideh Hatamikia
Ramona Woitek
VLM
MedIm
312
12
0
14 Sep 2024
Phikon-v2, A large and public feature extractor for biomarker prediction
Alexandre Filiot
Paul Jacob
Alice Mac Kain
Charlie Saillard
MedIm
203
59
0
13 Sep 2024
DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks
Amin Karimi Monsefi
Kishore Prakash Sailaja
Ali Alilooee
Ser-Nam Lim
R. Ramnath
VLM
345
16
0
10 Sep 2024
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Neural Information Processing Systems (NeurIPS), 2024
Yunze Man
Shuhong Zheng
Zhipeng Bao
M. Hebert
Liang-Yan Gui
Yu-Xiong Wang
504
31
0
05 Sep 2024
CanvOI, an Oncology Intelligence Foundation Model: Scaling FLOPS Differently
Jonathan Zalach
Inbal Gazy
Assaf Avinoam
Ron Sinai
Eran Shmuel
Inbar Gilboa
Christine Swisher
Naim Matasci
Reva Basho
David B. Agus
149
0
0
04 Sep 2024
Dual Advancement of Representation Learning and Clustering for Sparse and Noisy Images
ACM Multimedia (MM), 2024
Wenlin Li
Yucheng Xu
Xiaoqing Zheng
Suoya Han
Jun Wang
Xiaobo Sun
268
1
0
03 Sep 2024
ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images
European Conference on Computer Vision (ECCV), 2024
Xiaoshuai Zhang
Zhicheng Wang
Howard Zhou
Soham Ghosh
Danushen Gnanapragasam
Varun Jampani
Hao Su
Leonidas Guibas
DD
221
7
0
30 Aug 2024
A Survey of the Self Supervised Learning Mechanisms for Vision Transformers
Asifullah Khan
A. Sohail
Mustansar Fiaz
Mehdi Hassan
Tariq Habib Afridi
...
Muhammad Zaigham Zaheer
Kamran Ali
Tangina Sultana
Ziaurrehman Tanoli
Naeem Akhter
895
11
0
30 Aug 2024
Hierarchical Visual Categories Modeling: A Joint Representation Learning and Density Estimation Framework for Out-of-Distribution Detection
IEEE International Conference on Computer Vision (ICCV), 2023
Jinglun Li
Xinyu Zhou
Pinxue Guo
Yixuan Sun
Yiwen Huang
Weifeng Ge
Wenqiang Zhang
200
5
0
28 Aug 2024
A New Era in Computational Pathology: A Survey on Foundation and Vision-Language Models
Dibaloke Chanda
Milan Aryal
Nasim Yahya Soltani
Masoud Ganji
AI4CE
VLM
385
11
0
23 Aug 2024
Symmetric masking strategy enhances the performance of Masked Image Modeling
Khanh-Binh Nguyen
Chae Jung Park
261
0
0
23 Aug 2024
Sapiens: Foundation for Human Vision Models
European Conference on Computer Vision (ECCV), 2024
Rawal Khirodkar
Timur M. Bagautdinov
Julieta Martinez
Su Zhaoen
Austin James
Peter Selednik
Stuart Anderson
Forrest Iandola
VLM
406
162
0
22 Aug 2024
Cross-Domain Foundation Model Adaptation: Pioneering Computer Vision Models for Geophysical Data Analysis
Journal of Geophysical Research (JGR), 2024
Zhixiang Guo
Xinming Wu
Luming Liang
Hanlin Sheng
Nuo Chen
Zhengfa Bi
AI4CE
233
10
0
22 Aug 2024
PooDLe: Pooled and dense self-supervised learning from naturalistic videos
International Conference on Learning Representations (ICLR), 2024
Alex N. Wang
Christopher Hoang
Yuwen Xiong
Yann LeCun
Mengye Ren
443
4
0
20 Aug 2024
Masked Image Modeling: A Survey
International Journal of Computer Vision (IJCV), 2024
Vlad Hondru
Florinel-Alin Croitoru
Shervin Minaee
Radu Tudor Ionescu
Andrii Zadaianchuk
405
17
0
13 Aug 2024
Enhancing 3D Transformer Segmentation Model for Medical Image with Token-level Representation Learning
IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2024
Xinrong Hu
Dewen Zeng
Yawen Wu
Xueyang Li
Yiyu Shi
ViT
MedIm
124
0
0
12 Aug 2024
HySparK: Hybrid Sparse Masking for Large Scale Medical Image Pre-Training
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2024
Fenghe Tang
Ronghao Xu
Qingsong Yao
Xueming Fu
Quan Quan
Heqin Zhu
Zaiyi Liu
S. Kevin Zhou
SSL
MedIm
199
9
0
11 Aug 2024
PersonViT: Large-scale Self-supervised Vision Transformer for Person Re-Identification
Machine Vision and Applications (MVA), 2024
Bin Hu
Xinggang Wang
Wenyu Liu
ViT
208
11
0
10 Aug 2024
POA: Pre-training Once for Models of All Sizes
European Conference on Computer Vision (ECCV), 2024
Yingying Zhang
Xin Guo
Jiangwei Lao
Lei Yu
Lixiang Ru
Jian Wang
Guo Ye
Huimei He
Jingdong Chen
Ming Yang
404
2
0
02 Aug 2024
Virchow2: Scaling Self-Supervised Mixed Magnification Models in Pathology
Eric Zimmermann
Eugene Vorontsov
Julian Viret
Adam Casson
Michal Zelechowski
...
Razik Yousfi
Thomas J. Fuchs
Nicolò Fusi
Siqi Liu
Kristen Severson
MedIm
288
112
0
01 Aug 2024
MMCLIP: Cross-modal Attention Masked Modelling for Medical Language-Image Pre-Training
Biao Wu
Yutong Xie
Zeyu Zhang
Minh Hieu Phan
Qi Chen
Ling-Hao Chen
Qi Wu
LM&MA
187
9
0
28 Jul 2024
MARINE: A Computer Vision Model for Detecting Rare Predator-Prey Interactions in Animal Videos
Zsófia Katona
Seyed Sahand Mohamadi Ziabari
Fatemeh Karimi Nejadasl
253
1
0
25 Jul 2024
Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective
Jingren Liu
Zhong Ji
YunLong Yu
Jiale Cao
Yanwei Pang
Jungong Han
Xuelong Li
CLL
322
5
0
24 Jul 2024
SINDER: Repairing the Singular Defects of DINOv2
Haoqian Wang
Tong Zhang
Mathieu Salzmann
170
12
0
23 Jul 2024
Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning
Yibing Wei
Abhinav Gupta
Pedro Morgado
SSL
158
14
0
22 Jul 2024
ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders
Carlos Hinojosa
Shuming Liu
Guohao Li
193
8
0
17 Jul 2024
Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation
Hyun Seok Seong
WonJun Moon
Subeen Lee
Jae-Pil Heo
213
3
0
17 Jul 2024
A Closer Look at Benchmarking Self-Supervised Pre-training with Image Classification
Markus Marks
Manuel Knott
Neehar Kondapaneni
Elijah Cole
T. Defraeye
Fernando Pérez-Cruz
Pietro Perona
SSL
374
14
0
16 Jul 2024
STARS: Self-supervised Tuning for 3D Action Recognition in Skeleton Sequences
Soroush Mehraban
Mohammad Javad Rajabi
Andrea Iaboni
Babak Taati
3DPC
523
1
0
15 Jul 2024
Rethinking Image-to-Video Adaptation: An Object-centric Perspective
Rui Qian
Shuangrui Ding
Dahua Lin
OCL
197
8
0
09 Jul 2024
A Clinical Benchmark of Public Self-Supervised Pathology Foundation Models
Gabriele Campanella
Shengjia Chen
Ruchika Verma
Jennifer Zeng
A. Stock
...
Kuan-lin Huang
Ricky Kwan
Jane Houldsworth
Adam J. Schoenfeld
Chad M. Vanderbilt
AI4MH
OOD
LM&MA
209
56
0
09 Jul 2024
Learning from Memory: Non-Parametric Memory Augmented Self-Supervised Learning of Visual Features
T. Silva
Hélio Pedrini
Adín Ramírez Rivera
SSL
151
6
0
03 Jul 2024
Previous
1
2
3
4
5
...
11
12
13
Next