v1v2v3 (latest)

iBOT: Image BERT Pre-Training with Online Tokenizer

15 November 2021

Cihang Xie

Papers citing "iBOT: Image BERT Pre-Training with Online Tokenizer"

50 / 605 papers shown

Title
Benchmarking Pathology Foundation Models: Adaptation Strategies and Scenarios Jeaung Lee Jeewoo Lim Keunho Byeon Jin Tae Kwak 181 12 0 21 Oct 2024
ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts Xumeng Han Longhui Wei Bushi Liu Zipeng Wang Chenhui Qiang Xin He Yingfei Sun Zhenjun Han Qi Tian MoE 397 13 0 21 Oct 2024
Upsampling DINOv2 features for unsupervised vision tasks and weakly supervised materials segmentation Ronan Docherty Antonis Vamvakeros Samuel J. Cooper 348 3 0 20 Oct 2024
Fusion from Decomposition: A Self-Supervised Approach for Image Fusion and Beyond Pengwei Liang Junjun Jiang Qing Ma Xianming Liu Jiayi Ma 184 5 0 16 Oct 2024
DRACO: A Denoising-Reconstruction Autoencoder for Cryo-EMNeural Information Processing Systems (NeurIPS), 2024 Yingjun Shen Haizhao Dai Qihe Chen Yan Zeng Jiakai Zhang Yuan Pei Jingyi Yu 234 4 0 15 Oct 2024
EchoApex: A General-Purpose Vision Foundation Model for Echocardiography A. Amadou Yanzhe Zhang Sebastien Piat Paul Klein Ingo Schmuecking Tiziano Passerini Puneet Sharma 225 11 0 14 Oct 2024
Browsing without Third-Party Cookies: What Do You See?ACM/SIGCOMM Internet Measurement Conference (IMC), 2024 Maxwell Lin Shihan Lin Helen Wu Karen Wang Xiaowei Yang BDL 462 42 0 14 Oct 2024
Locality Alignment Improves Vision-Language ModelsInternational Conference on Learning Representations (ICLR), 2024 Ian Covert Tony Sun James Zou Tatsunori Hashimoto VLM 537 11 0 14 Oct 2024
Large-Scale 3D Medical Image Pre-training with Geometric Context Priors Linshan Wu Jiaxin Zhuang Hao Chen 199 18 0 13 Oct 2024
Brain Mapping with Dense Features: Grounding Cortical Semantic Selectivity in Natural Images With Vision TransformersInternational Conference on Learning Representations (ICLR), 2024 Andrew F. Luo Jacob Yeung Rushikesh Zawar Shaurya Dewan Margaret M. Henderson Leila Wehbe Michael J. Tarr 316 12 0 07 Oct 2024
Denoising with a Joint-Embedding Predictive ArchitectureInternational Conference on Learning Representations (ICLR), 2024 Dengsheng Chen Jie Hu Xiaoming Wei Enhua Wu DiffM 453 5 0 02 Oct 2024
Local-to-Global Self-Supervised Representation Learning for Diabetic Retinopathy Grading Mostafa Hajighasemloua Samad Sheikhaei Hamid Soltanian-Zadeha 179 0 0 01 Oct 2024
Radio Foundation Models: Pre-training Transformers for 5G-based Indoor LocalizationInternational Conference on Indoor Positioning and Indoor Navigation (IPIN), 2024 Jonathan Ott Jonas Pirkl Maximilian Stahlke Tobias Feigl Christopher Mutschler 78 13 0 01 Oct 2024
Text-driven Human Motion Generation with Motion Masked Diffusion Model Xingyu Chen DiffM VGen 148 6 0 29 Sep 2024
Harnessing Frozen Unimodal Encoders for Flexible Multimodal AlignmentComputer Vision and Pattern Recognition (CVPR), 2024 Mayug Maniparambil Raiymbek Akshulakov Y. A. D. Djilali Sanath Narayan Ankit Singh Noel E. O'Connor VLM MLLM 132 0 0 28 Sep 2024
Embed and Emulate: Contrastive representations for simulation-based inference Ruoxi Jiang Peter Y. Lu Rebecca Willett 186 1 0 27 Sep 2024
MEXMA: Token-level objectives improve sentence representationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024 Joao Maria Janeiro Benjamin Piwowarski Patrick Gallinari Loïc Barrault 110 4 0 19 Sep 2024
Frequency-Guided Masking for Enhanced Vision Self-Supervised LearningInternational Conference on Learning Representations (ICLR), 2024 Amin Karimi Monsefi Mengxi Zhou Nastaran Karimi Monsefi Ser-Nam Lim Wei-Lun Chao R. Ramnath 280 4 0 16 Sep 2024
Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image RetrievalEngineering applications of artificial intelligence (EAAI), 2024 Amirreza Mahbod Nematollah Saeidi Sepideh Hatamikia Ramona Woitek VLM MedIm 312 12 0 14 Sep 2024
Phikon-v2, A large and public feature extractor for biomarker prediction Alexandre Filiot Paul Jacob Alice Mac Kain Charlie Saillard MedIm 203 59 0 13 Sep 2024
DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks Amin Karimi Monsefi Kishore Prakash Sailaja Ali Alilooee Ser-Nam Lim R. Ramnath VLM 345 16 0 10 Sep 2024
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene UnderstandingNeural Information Processing Systems (NeurIPS), 2024 Yunze Man Shuhong Zheng Zhipeng Bao M. Hebert Liang-Yan Gui Yu-Xiong Wang 504 31 0 05 Sep 2024
CanvOI, an Oncology Intelligence Foundation Model: Scaling FLOPS Differently Jonathan Zalach Inbal Gazy Assaf Avinoam Ron Sinai Eran Shmuel Inbar Gilboa Christine Swisher Naim Matasci Reva Basho David B. Agus 149 0 0 04 Sep 2024
Dual Advancement of Representation Learning and Clustering for Sparse and Noisy ImagesACM Multimedia (MM), 2024 Wenlin Li Yucheng Xu Xiaoqing Zheng Suoya Han Jun Wang Xiaobo Sun 268 1 0 03 Sep 2024
ConDense: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View ImagesEuropean Conference on Computer Vision (ECCV), 2024 Xiaoshuai Zhang Zhicheng Wang Howard Zhou Soham Ghosh Danushen Gnanapragasam Varun Jampani Hao Su Leonidas Guibas DD 221 7 0 30 Aug 2024
A Survey of the Self Supervised Learning Mechanisms for Vision Transformers Asifullah Khan A. Sohail Mustansar Fiaz Mehdi Hassan Tariq Habib Afridi ... Muhammad Zaigham Zaheer Kamran Ali Tangina Sultana Ziaurrehman Tanoli Naeem Akhter 895 11 0 30 Aug 2024
Hierarchical Visual Categories Modeling: A Joint Representation Learning and Density Estimation Framework for Out-of-Distribution DetectionIEEE International Conference on Computer Vision (ICCV), 2023 Jinglun Li Xinyu Zhou Pinxue Guo Yixuan Sun Yiwen Huang Weifeng Ge Wenqiang Zhang 200 5 0 28 Aug 2024
A New Era in Computational Pathology: A Survey on Foundation and Vision-Language Models Dibaloke Chanda Milan Aryal Nasim Yahya Soltani Masoud Ganji AI4CE VLM 385 11 0 23 Aug 2024
Symmetric masking strategy enhances the performance of Masked Image Modeling Khanh-Binh Nguyen Chae Jung Park 261 0 0 23 Aug 2024
Sapiens: Foundation for Human Vision ModelsEuropean Conference on Computer Vision (ECCV), 2024 Rawal Khirodkar Timur M. Bagautdinov Julieta Martinez Su Zhaoen Austin James Peter Selednik Stuart Anderson Forrest Iandola VLM 406 162 0 22 Aug 2024
Cross-Domain Foundation Model Adaptation: Pioneering Computer Vision Models for Geophysical Data AnalysisJournal of Geophysical Research (JGR), 2024 Zhixiang Guo Xinming Wu Luming Liang Hanlin Sheng Nuo Chen Zhengfa Bi AI4CE 233 10 0 22 Aug 2024
PooDLe: Pooled and dense self-supervised learning from naturalistic videosInternational Conference on Learning Representations (ICLR), 2024 Alex N. Wang Christopher Hoang Yuwen Xiong Yann LeCun Mengye Ren 443 4 0 20 Aug 2024
Masked Image Modeling: A SurveyInternational Journal of Computer Vision (IJCV), 2024 Vlad Hondru Florinel-Alin Croitoru Shervin Minaee Radu Tudor Ionescu Andrii Zadaianchuk 405 17 0 13 Aug 2024
Enhancing 3D Transformer Segmentation Model for Medical Image with Token-level Representation LearningIEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2024 Xinrong Hu Dewen Zeng Yawen Wu Xueyang Li Yiyu Shi ViT MedIm 124 0 0 12 Aug 2024
HySparK: Hybrid Sparse Masking for Large Scale Medical Image Pre-TrainingInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2024 Fenghe Tang Ronghao Xu Qingsong Yao Xueming Fu Quan Quan Heqin Zhu Zaiyi Liu S. Kevin Zhou SSL MedIm 199 9 0 11 Aug 2024
PersonViT: Large-scale Self-supervised Vision Transformer for Person Re-IdentificationMachine Vision and Applications (MVA), 2024 Bin Hu Xinggang Wang Wenyu Liu ViT 208 11 0 10 Aug 2024
POA: Pre-training Once for Models of All SizesEuropean Conference on Computer Vision (ECCV), 2024 Yingying Zhang Xin Guo Jiangwei Lao Lei Yu Lixiang Ru Jian Wang Guo Ye Huimei He Jingdong Chen Ming Yang 404 2 0 02 Aug 2024
Virchow2: Scaling Self-Supervised Mixed Magnification Models in Pathology Eric Zimmermann Eugene Vorontsov Julian Viret Adam Casson Michal Zelechowski ... Razik Yousfi Thomas J. Fuchs Nicolò Fusi Siqi Liu Kristen Severson MedIm 288 112 0 01 Aug 2024
MMCLIP: Cross-modal Attention Masked Modelling for Medical Language-Image Pre-Training Biao Wu Yutong Xie Zeyu Zhang Minh Hieu Phan Qi Chen Ling-Hao Chen Qi Wu LM&MA 187 9 0 28 Jul 2024
MARINE: A Computer Vision Model for Detecting Rare Predator-Prey Interactions in Animal Videos Zsófia Katona Seyed Sahand Mohamadi Ziabari Fatemeh Karimi Nejadasl 253 1 0 25 Jul 2024
Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective Jingren Liu Zhong Ji YunLong Yu Jiale Cao Yanwei Pang Jungong Han Xuelong Li CLL 322 5 0 24 Jul 2024
SINDER: Repairing the Singular Defects of DINOv2 Haoqian Wang Tong Zhang Mathieu Salzmann 170 12 0 23 Jul 2024
Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning Yibing Wei Abhinav Gupta Pedro Morgado SSL 158 14 0 22 Jul 2024
ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders Carlos Hinojosa Shuming Liu Guohao Li 193 8 0 17 Jul 2024
Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation Hyun Seok Seong WonJun Moon Subeen Lee Jae-Pil Heo 213 3 0 17 Jul 2024
A Closer Look at Benchmarking Self-Supervised Pre-training with Image Classification Markus Marks Manuel Knott Neehar Kondapaneni Elijah Cole T. Defraeye Fernando Pérez-Cruz Pietro Perona SSL 374 14 0 16 Jul 2024
STARS: Self-supervised Tuning for 3D Action Recognition in Skeleton Sequences Soroush Mehraban Mohammad Javad Rajabi Andrea Iaboni Babak Taati 3DPC 523 1 0 15 Jul 2024
Rethinking Image-to-Video Adaptation: An Object-centric Perspective Rui Qian Shuangrui Ding Dahua Lin OCL 197 8 0 09 Jul 2024
A Clinical Benchmark of Public Self-Supervised Pathology Foundation Models Gabriele Campanella Shengjia Chen Ruchika Verma Jennifer Zeng A. Stock ... Kuan-lin Huang Ricky Kwan Jane Houldsworth Adam J. Schoenfeld Chad M. Vanderbilt AI4MH OOD LM&MA 209 56 0 09 Jul 2024
Learning from Memory: Non-Parametric Memory Augmented Self-Supervised Learning of Visual Features T. Silva Hélio Pedrini Adín Ramírez Rivera SSL 151 6 0 03 Jul 2024