v1v2v3 (latest)

iBOT: Image BERT Pre-Training with Online Tokenizer

15 November 2021

Cihang Xie

Papers citing "iBOT: Image BERT Pre-Training with Online Tokenizer"

50 / 605 papers shown

Title
MAESTRO: Masked AutoEncoders for Multimodal, Multitemporal, and Multispectral Earth Observation Data Antoine Labatie Michael Vaccaro Nina Lardiere A. Garioud Nicolas Gonthier 220 0 0 14 Aug 2025
Towards Comprehensive Cellular Characterisation of H&E slides Benjamin Adjadj Pierre-Antoine Bannier Guillaume Horent Sebastien Mandela Aurore Lyon ... Reda Belbahri Benoît Schmauch Eric Durand Katharina Von Loga Lucie Gillet VLM 98 1 0 13 Aug 2025
Benchmarking Foundation Models for Mitotic Figure Classification Jonas Ammeling J. Ganz Emely Rosbach Ludwig Lausser C. Bertram Katharina Breininger Marc Aubreville OOD 116 1 0 06 Aug 2025
CoMAD: A Multiple-Teacher Self-Supervised Distillation Framework Sriram Mandalika Lalitha V MoE VLM 118 0 0 06 Aug 2025
GECO: Geometrically Consistent Embedding with Lightspeed Inference Regine Hartwig Dominik Muhle Riccardo Marin Daniel Cremers 96 0 0 01 Aug 2025
Temporally Consistent Unsupervised Segmentation for Mobile Robot Perception Christian Ellis Maggie B. Wigness Craig T. Lennon L. Fiondella VOS 168 0 0 29 Jul 2025
Self-Guided Masked AutoencoderNeural Information Processing Systems (NeurIPS), 2025 Jeongwoo Shin Inseo Lee Junho Lee Joonseok Lee SSL 129 9 0 26 Jul 2025
A High Magnifications Histopathology Image Dataset for Oral Squamous Cell Carcinoma Diagnosis and Prognosis Jinquan Guan Junhong Guo Qi Chen Jian Chen Y. Cai Yilin He Z. Huang Yan Wang Yutong Xie 131 0 0 22 Jul 2025
Latent Denoising Makes Good Visual Tokenizers Jiawei Yang Tianhong Li Lijie Fan Yonglong Tian Yue Wang 153 13 0 21 Jul 2025
Improving Joint Embedding Predictive Architecture with Diffusion Noise Yuping Qiu Rui Zhu Ying-cong Chen DiffM 159 0 0 21 Jul 2025
Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning Shashanka Venkataramanan Valentinos Pariza Mohammadreza Salehi Lukas Knobel Spyros Gidaris Elias Ramzi Andrei Bursuc Yuki M. Asano 195 7 0 18 Jul 2025
Object Retrieval for Visual Question Answering with Outside Knowledge Shichao Kan Yuhai Deng Yixiong Liang Lihui Cen Zhe Qu Linna Zhang Zhihai He Yigang Cen 223 0 0 01 Jul 2025
LW2G: Learning Whether to Grow for Prompt-based Continual Learning Qian Feng Dawei Zhou Hanbin Zhao Chao Zhang Jiahua Dong Dengxin Dai Hui Qian VLM CLL 292 8 0 01 Jul 2025
DIP: Unsupervised Dense In-Context Post-training of Visual Representations Sophia Sirko-Galouchenko Spyros Gidaris Antonín Vobecký Andrei Bursuc Nicolas Thome 279 1 0 23 Jun 2025
Discrete JEPA: Learning Discrete Token Representations without Reconstruction Junyeob Baek Hosung Lee Christopher Hoang Mengye Ren Sungjin Ahn 195 0 0 17 Jun 2025
Self-supervised Representation Learning with Local Aggregation for Image-based Profiling Siran Dai Qianqian Xu Peisong Wen Yang Liu Qingming Huang 263 2 0 17 Jun 2025
MRI-CORE: A Foundation Model for Magnetic Resonance Imaging Haoyu Dong Yuwen Chen H. Gu Nicholas Konz Yaqian Chen Qihang Li Maciej A. Mazurowski MedIm VLM 198 6 0 13 Jun 2025
SNR and Resource Adaptive Deep JSCC for Distributed IoT Image Classification Ali Waqas Sinem Coleri 217 0 0 12 Jun 2025
Attention, Please! Revisiting Attentive Probing Through the Lens of Efficiency Bill Psomas Dionysis Christopoulos Eirini Baltzi Ioannis Kakogeorgiou Tilemachos Aravanis N. Komodakis Konstantinos Karantzalos Yannis Avrithis Giorgos Tolias 265 1 0 11 Jun 2025
Foundation Models in Medical Imaging: A Review and Outlook Vivien van Veldhuizen Vanessa Botha C. Lu Melis Erdal Cesur Kevin Groot Lipman ... Cees Snoek Lodewyk Wessels Ritse Mann Eric Marcus Jonas Teuwen MedIm VLM AI4CE 384 2 0 10 Jun 2025
Multiple Object Stitching for Unsupervised Representation LearningPattern Recognition (Pattern Recogn.), 2025 Chengchao Shen Dawei Liu Jianxin Wang OCL SSL 214 0 0 09 Jun 2025
When Better Features Mean Greater Risks: The Performance-Privacy Trade-Off in Contrastive LearningACM Asia Conference on Computer and Communications Security (AsiaCCS), 2025 Ruining Sun Hongsheng Hu Wei Luo Zhaoxi Zhang Yanjun Zhang Haizhuan Yuan Leo Yu Zhang MIACV AAML 303 1 0 06 Jun 2025
GP-MoLFormer-Sim: Test Time Molecular Optimization through Contextual Similarity Guidance Jirí Navrátil Jarret Ross Payel Das Youssef Mroueh Samuel C. Hoffman Vijil Chenthamarakshan Brian M. Belgodere 183 0 0 05 Jun 2025
Object-level Self-Distillation for Vision Pretraining Çağlar Hızlı Çağatay Yıldız Pekka Marttinen OCL VLM 277 0 0 04 Jun 2025
Random Registers for Cross-Domain Few-Shot Learning Shuai Yi Yixiong Zou Yuhua Li Ruixuan Li 205 0 0 03 Jun 2025
Vision Transformers with Self-Distilled Registers Yinjie Chen Zipeng Yan Chong Zhou Bo Dai Andrew F. Luo 398 4 0 27 May 2025
A Contrastive Learning Foundation Model Based on Perfectly Aligned Sample Pairs for Remote Sensing Images Hengtong Shen Haiyan Gu Haitao Li Yi Yang Agen qiu SSL 326 0 0 26 May 2025
The Missing Point in Vision Transformers for Universal Image Segmentation Sajjad Shahabodini Mobina Mansoori Farnoush Bayatmakou J. Abouei Konstantinos N. Plataniotis Arash Mohammadi ViT ISeg 250 0 0 26 May 2025
C3R: Channel Conditioned Cell Representations for unified evaluation in microscopy imaging Umar Marikkar Syed Sameed Husain Muhammad Awais Sara Atito 187 0 0 24 May 2025
Self-Organizing Visual Prototypes for Non-Parametric Representation Learning T. Silva Hélio Pedrini Adín Ramirez Rivera 165 1 0 23 May 2025
Semantic Correspondence: Unified Benchmarking and a Strong Baseline Kaiyan Zhang Xinghui Li Jingyi Lu Kai Han 3DV 363 3 0 23 May 2025
Octic Vision Transformers: Quicker ViTs Through Equivariance David Nordström Johan Edstedt Fredrik Kahl Georg Bökman ViT 476 0 0 21 May 2025
Ditch the Denoiser: Emergence of Noise Robustness in Self-Supervised Learning from Data Curriculum Wenquan Lu Jiaqi Zhang Hugues Van Assel Randall Balestriero 198 1 0 18 May 2025
DDAE++: Enhancing Diffusion Models Towards Unified Generative and Discriminative Learning Weilai Xiang Hongyu Yang Di Huang Yunhong Wang 359 3 0 16 May 2025
Register and [CLS] tokens yield a decoupling of local and global features in large ViTs Alexander Lappe M. Giese 266 2 0 09 May 2025
DeCLIP: Decoupled Learning for Open-Vocabulary Dense PerceptionComputer Vision and Pattern Recognition (CVPR), 2025 Junjie Wang Bin Chen Yulin Li Bin Kang Yulin Chen Zhuotao Tian VLM 273 5 0 07 May 2025
No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves Dengyang Jiang Mengmeng Wang Liuzhuozheng Li Lei Zhang Haoyu Wang Wei Wei Guang Dai Yanning Zhang Jingdong Wang DiffM 462 13 0 05 May 2025
Self-Supervision Enhances Instance-based Multiple Instance Learning Methods in Digital Pathology: A Benchmark StudyJournal of Medical Imaging (JMI), 2025 Ali Mammadov Loic Le Folgoc Julien Adam Anne Buronfosse Gilles Hayem Guillaume Hocquet Pietro Gori SSL 200 4 0 02 May 2025
SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal ModelsComputer Vision and Pattern Recognition (CVPR), 2025 Wufei Ma Luoxin Ye Nessa McWeeney Celso M de Melo Jieneng Chen LRM 409 21 0 01 May 2025
Boosting Generative Image Modeling via Joint Image-Feature Synthesis Theodoros Kouzelis Efstathios Karypidis Ioannis Kakogeorgiou Spyros Gidaris N. Komodakis DiffM 245 14 0 22 Apr 2025
CytoFM: The first cytology foundation model Vedrana Ivezić Ashwath Radhachandran Ekaterina Redekop Shreeram S. Athreya Dongwoo Lee Vivek Sant Corey W. Arnold W. Speier 281 0 0 18 Apr 2025
Can Masked Autoencoders Also Listen to Birds? Lukas Rauch Ilyass Moummad René Heinrich Alexis Joly Bernhard Sick Christoph Scholz 461 8 0 17 Apr 2025
EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe GuidanceComputer Vision and Pattern Recognition (CVPR), 2025 Yang Yue Yulin Wang Haojun Jiang Pan Liu Qing Xiao Gao Huang VGen 314 6 0 17 Apr 2025
Prototype-Guided Diffusion for Digital Pathology: Achieving Foundation Model Performance with Minimal Clinical Data Ekaterina Redekop Mara Pleasure Vedrana Ivezić Zichen Wang Kimberly Flores Anthony Sisk W. Speier C. Arnold MedIm 193 2 0 15 Apr 2025
Evolved Hierarchical Masking for Self-Supervised LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024 Zhanzhou Feng Shiliang Zhang 299 1 0 12 Apr 2025
Boosting multi-demographic federated learning for chest radiograph analysis using general-purpose self-supervised representations Mahshad Lotfinia Arash Tayebiarasteh Samaneh Samiei Mehdi Joodaki Soroosh Tayebi Arasteh 277 0 0 11 Apr 2025
Evolutionary Machine Learning meets Self-Supervised Learning: a comprehensive survey Adriano Vinhas João Correia Penousal Machado SSL SyDa 392 0 0 09 Apr 2025
Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene UnderstandingComputer Vision and Pattern Recognition (CVPR), 2025 Pedro Hermosilla Christian Stippel Leon Sick SSL 3DPC 360 0 0 09 Apr 2025
Falcon: Fractional Alternating Cut with Overcoming Minima in Unsupervised Segmentation Xiao Zhang Xiangyu Han Xiwen Lai Yao Sun Pei Zhang Konrad Kording 233 0 0 08 Apr 2025
Training state-of-the-art pathology foundation models with orders of magnitude less dataInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025 Mikhail Karasikov J. Doorn Nicolas Kanzig Melis Erdal Cesur Hugo Mark Horlings Robert Berke Fei Tang Sebastian Otálora AI4CE 125 2 0 07 Apr 2025