Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2111.07832
Cited By
v1
v2
v3 (latest)
iBOT: Image BERT Pre-Training with Online Tokenizer
15 November 2021
Jinghao Zhou
Chen Wei
Huiyu Wang
Wei Shen
Cihang Xie
Alan Yuille
Tao Kong
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"iBOT: Image BERT Pre-Training with Online Tokenizer"
50 / 605 papers shown
Title
MAESTRO: Masked AutoEncoders for Multimodal, Multitemporal, and Multispectral Earth Observation Data
Antoine Labatie
Michael Vaccaro
Nina Lardiere
A. Garioud
Nicolas Gonthier
220
0
0
14 Aug 2025
Towards Comprehensive Cellular Characterisation of H&E slides
Benjamin Adjadj
Pierre-Antoine Bannier
Guillaume Horent
Sebastien Mandela
Aurore Lyon
...
Reda Belbahri
Benoît Schmauch
Eric Durand
Katharina Von Loga
Lucie Gillet
VLM
98
1
0
13 Aug 2025
Benchmarking Foundation Models for Mitotic Figure Classification
Jonas Ammeling
J. Ganz
Emely Rosbach
Ludwig Lausser
C. Bertram
Katharina Breininger
Marc Aubreville
OOD
116
1
0
06 Aug 2025
CoMAD: A Multiple-Teacher Self-Supervised Distillation Framework
Sriram Mandalika
Lalitha V
MoE
VLM
118
0
0
06 Aug 2025
GECO: Geometrically Consistent Embedding with Lightspeed Inference
Regine Hartwig
Dominik Muhle
Riccardo Marin
Daniel Cremers
96
0
0
01 Aug 2025
Temporally Consistent Unsupervised Segmentation for Mobile Robot Perception
Christian Ellis
Maggie B. Wigness
Craig T. Lennon
L. Fiondella
VOS
168
0
0
29 Jul 2025
Self-Guided Masked Autoencoder
Neural Information Processing Systems (NeurIPS), 2025
Jeongwoo Shin
Inseo Lee
Junho Lee
Joonseok Lee
SSL
129
9
0
26 Jul 2025
A High Magnifications Histopathology Image Dataset for Oral Squamous Cell Carcinoma Diagnosis and Prognosis
Jinquan Guan
Junhong Guo
Qi Chen
Jian Chen
Y. Cai
Yilin He
Z. Huang
Yan Wang
Yutong Xie
131
0
0
22 Jul 2025
Latent Denoising Makes Good Visual Tokenizers
Jiawei Yang
Tianhong Li
Lijie Fan
Yonglong Tian
Yue Wang
153
13
0
21 Jul 2025
Improving Joint Embedding Predictive Architecture with Diffusion Noise
Yuping Qiu
Rui Zhu
Ying-cong Chen
DiffM
159
0
0
21 Jul 2025
Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning
Shashanka Venkataramanan
Valentinos Pariza
Mohammadreza Salehi
Lukas Knobel
Spyros Gidaris
Elias Ramzi
Andrei Bursuc
Yuki M. Asano
195
7
0
18 Jul 2025
Object Retrieval for Visual Question Answering with Outside Knowledge
Shichao Kan
Yuhai Deng
Yixiong Liang
Lihui Cen
Zhe Qu
Linna Zhang
Zhihai He
Yigang Cen
223
0
0
01 Jul 2025
LW2G: Learning Whether to Grow for Prompt-based Continual Learning
Qian Feng
Dawei Zhou
Hanbin Zhao
Chao Zhang
Jiahua Dong
Dengxin Dai
Hui Qian
VLM
CLL
292
8
0
01 Jul 2025
DIP: Unsupervised Dense In-Context Post-training of Visual Representations
Sophia Sirko-Galouchenko
Spyros Gidaris
Antonín Vobecký
Andrei Bursuc
Nicolas Thome
279
1
0
23 Jun 2025
Discrete JEPA: Learning Discrete Token Representations without Reconstruction
Junyeob Baek
Hosung Lee
Christopher Hoang
Mengye Ren
Sungjin Ahn
195
0
0
17 Jun 2025
Self-supervised Representation Learning with Local Aggregation for Image-based Profiling
Siran Dai
Qianqian Xu
Peisong Wen
Yang Liu
Qingming Huang
263
2
0
17 Jun 2025
MRI-CORE: A Foundation Model for Magnetic Resonance Imaging
Haoyu Dong
Yuwen Chen
H. Gu
Nicholas Konz
Yaqian Chen
Qihang Li
Maciej A. Mazurowski
MedIm
VLM
198
6
0
13 Jun 2025
SNR and Resource Adaptive Deep JSCC for Distributed IoT Image Classification
Ali Waqas
Sinem Coleri
217
0
0
12 Jun 2025
Attention, Please! Revisiting Attentive Probing Through the Lens of Efficiency
Bill Psomas
Dionysis Christopoulos
Eirini Baltzi
Ioannis Kakogeorgiou
Tilemachos Aravanis
N. Komodakis
Konstantinos Karantzalos
Yannis Avrithis
Giorgos Tolias
265
1
0
11 Jun 2025
Foundation Models in Medical Imaging: A Review and Outlook
Vivien van Veldhuizen
Vanessa Botha
C. Lu
Melis Erdal Cesur
Kevin Groot Lipman
...
Cees Snoek
Lodewyk Wessels
Ritse Mann
Eric Marcus
Jonas Teuwen
MedIm
VLM
AI4CE
384
2
0
10 Jun 2025
Multiple Object Stitching for Unsupervised Representation Learning
Pattern Recognition (Pattern Recogn.), 2025
Chengchao Shen
Dawei Liu
Jianxin Wang
OCL
SSL
214
0
0
09 Jun 2025
When Better Features Mean Greater Risks: The Performance-Privacy Trade-Off in Contrastive Learning
ACM Asia Conference on Computer and Communications Security (AsiaCCS), 2025
Ruining Sun
Hongsheng Hu
Wei Luo
Zhaoxi Zhang
Yanjun Zhang
Haizhuan Yuan
Leo Yu Zhang
MIACV
AAML
303
1
0
06 Jun 2025
GP-MoLFormer-Sim: Test Time Molecular Optimization through Contextual Similarity Guidance
Jirí Navrátil
Jarret Ross
Payel Das
Youssef Mroueh
Samuel C. Hoffman
Vijil Chenthamarakshan
Brian M. Belgodere
183
0
0
05 Jun 2025
Object-level Self-Distillation for Vision Pretraining
Çağlar Hızlı
Çağatay Yıldız
Pekka Marttinen
OCL
VLM
277
0
0
04 Jun 2025
Random Registers for Cross-Domain Few-Shot Learning
Shuai Yi
Yixiong Zou
Yuhua Li
Ruixuan Li
205
0
0
03 Jun 2025
Vision Transformers with Self-Distilled Registers
Yinjie Chen
Zipeng Yan
Chong Zhou
Bo Dai
Andrew F. Luo
398
4
0
27 May 2025
A Contrastive Learning Foundation Model Based on Perfectly Aligned Sample Pairs for Remote Sensing Images
Hengtong Shen
Haiyan Gu
Haitao Li
Yi Yang
Agen qiu
SSL
326
0
0
26 May 2025
The Missing Point in Vision Transformers for Universal Image Segmentation
Sajjad Shahabodini
Mobina Mansoori
Farnoush Bayatmakou
J. Abouei
Konstantinos N. Plataniotis
Arash Mohammadi
ViT
ISeg
250
0
0
26 May 2025
C3R: Channel Conditioned Cell Representations for unified evaluation in microscopy imaging
Umar Marikkar
Syed Sameed Husain
Muhammad Awais
Sara Atito
187
0
0
24 May 2025
Self-Organizing Visual Prototypes for Non-Parametric Representation Learning
T. Silva
Hélio Pedrini
Adín Ramirez Rivera
165
1
0
23 May 2025
Semantic Correspondence: Unified Benchmarking and a Strong Baseline
Kaiyan Zhang
Xinghui Li
Jingyi Lu
Kai Han
3DV
363
3
0
23 May 2025
Octic Vision Transformers: Quicker ViTs Through Equivariance
David Nordström
Johan Edstedt
Fredrik Kahl
Georg Bökman
ViT
476
0
0
21 May 2025
Ditch the Denoiser: Emergence of Noise Robustness in Self-Supervised Learning from Data Curriculum
Wenquan Lu
Jiaqi Zhang
Hugues Van Assel
Randall Balestriero
198
1
0
18 May 2025
DDAE++: Enhancing Diffusion Models Towards Unified Generative and Discriminative Learning
Weilai Xiang
Hongyu Yang
Di Huang
Yunhong Wang
359
3
0
16 May 2025
Register and [CLS] tokens yield a decoupling of local and global features in large ViTs
Alexander Lappe
M. Giese
266
2
0
09 May 2025
DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception
Computer Vision and Pattern Recognition (CVPR), 2025
Junjie Wang
Bin Chen
Yulin Li
Bin Kang
Yulin Chen
Zhuotao Tian
VLM
273
5
0
07 May 2025
No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves
Dengyang Jiang
Mengmeng Wang
Liuzhuozheng Li
Lei Zhang
Haoyu Wang
Wei Wei
Guang Dai
Yanning Zhang
Jingdong Wang
DiffM
462
13
0
05 May 2025
Self-Supervision Enhances Instance-based Multiple Instance Learning Methods in Digital Pathology: A Benchmark Study
Journal of Medical Imaging (JMI), 2025
Ali Mammadov
Loic Le Folgoc
Julien Adam
Anne Buronfosse
Gilles Hayem
Guillaume Hocquet
Pietro Gori
SSL
200
4
0
02 May 2025
SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models
Computer Vision and Pattern Recognition (CVPR), 2025
Wufei Ma
Luoxin Ye
Nessa McWeeney
Celso M de Melo
Jieneng Chen
LRM
409
21
0
01 May 2025
Boosting Generative Image Modeling via Joint Image-Feature Synthesis
Theodoros Kouzelis
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
DiffM
245
14
0
22 Apr 2025
CytoFM: The first cytology foundation model
Vedrana Ivezić
Ashwath Radhachandran
Ekaterina Redekop
Shreeram S. Athreya
Dongwoo Lee
Vivek Sant
Corey W. Arnold
W. Speier
281
0
0
18 Apr 2025
Can Masked Autoencoders Also Listen to Birds?
Lukas Rauch
Ilyass Moummad
René Heinrich
Alexis Joly
Bernhard Sick
Christoph Scholz
461
8
0
17 Apr 2025
EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance
Computer Vision and Pattern Recognition (CVPR), 2025
Yang Yue
Yulin Wang
Haojun Jiang
Pan Liu
Qing Xiao
Gao Huang
VGen
314
6
0
17 Apr 2025
Prototype-Guided Diffusion for Digital Pathology: Achieving Foundation Model Performance with Minimal Clinical Data
Ekaterina Redekop
Mara Pleasure
Vedrana Ivezić
Zichen Wang
Kimberly Flores
Anthony Sisk
W. Speier
C. Arnold
MedIm
193
2
0
15 Apr 2025
Evolved Hierarchical Masking for Self-Supervised Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Zhanzhou Feng
Shiliang Zhang
299
1
0
12 Apr 2025
Boosting multi-demographic federated learning for chest radiograph analysis using general-purpose self-supervised representations
Mahshad Lotfinia
Arash Tayebiarasteh
Samaneh Samiei
Mehdi Joodaki
Soroosh Tayebi Arasteh
277
0
0
11 Apr 2025
Evolutionary Machine Learning meets Self-Supervised Learning: a comprehensive survey
Adriano Vinhas
João Correia
Penousal Machado
SSL
SyDa
392
0
0
09 Apr 2025
Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene Understanding
Computer Vision and Pattern Recognition (CVPR), 2025
Pedro Hermosilla
Christian Stippel
Leon Sick
SSL
3DPC
360
0
0
09 Apr 2025
Falcon: Fractional Alternating Cut with Overcoming Minima in Unsupervised Segmentation
Xiao Zhang
Xiangyu Han
Xiwen Lai
Yao Sun
Pei Zhang
Konrad Kording
233
0
0
08 Apr 2025
Training state-of-the-art pathology foundation models with orders of magnitude less data
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Mikhail Karasikov
J. Doorn
Nicolas Kanzig
Melis Erdal Cesur
Hugo Mark Horlings
Robert Berke
Fei Tang
Sebastian Otálora
AI4CE
125
2
0
07 Apr 2025
Previous
1
2
3
4
5
...
11
12
13
Next