ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09886
  4. Cited By
SimMIM: A Simple Framework for Masked Image Modeling

SimMIM: A Simple Framework for Masked Image Modeling

18 November 2021
Zhenda Xie
Zheng-Wei Zhang
Yue Cao
Yutong Lin
Jianmin Bao
Zhuliang Yao
Qi Dai
Han Hu
ArXivPDFHTML

Papers citing "SimMIM: A Simple Framework for Masked Image Modeling"

50 / 173 papers shown
Title
Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced
  Pre-training
Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training
Hyesong Choi
Hyejin Park
Kwang Moo Yi
Sungmin Cha
Dongbo Min
29
9
0
12 Apr 2024
Guided Masked Self-Distillation Modeling for Distributed Multimedia
  Sensor Event Analysis
Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis
Masahiro Yasuda
Noboru Harada
Yasunori Ohishi
Shoichiro Saito
Akira Nakayama
Nobutaka Ono
29
3
0
12 Apr 2024
NeuroNet: A Novel Hybrid Self-Supervised Learning Framework for Sleep
  Stage Classification Using Single-Channel EEG
NeuroNet: A Novel Hybrid Self-Supervised Learning Framework for Sleep Stage Classification Using Single-Channel EEG
Cheol-Hui Lee
Hakseung Kim
Hyun-jee Han
Min-Kyung Jung
Byung C. Yoon
Dong-Joo Kim
27
5
0
10 Apr 2024
Social-MAE: Social Masked Autoencoder for Multi-person Motion
  Representation Learning
Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning
Mahsa Ehsanpour
Ian Reid
Hamid Rezatofighi
ViT
27
0
0
08 Apr 2024
Dissecting Query-Key Interaction in Vision Transformers
Dissecting Query-Key Interaction in Vision Transformers
Xu Pan
Aaron Philip
Ziqian Xie
Odelia Schwartz
25
1
0
04 Apr 2024
Explore In-Context Segmentation via Latent Diffusion Models
Explore In-Context Segmentation via Latent Diffusion Models
Chaoyang Wang
Xiangtai Li
Henghui Ding
Lu Qi
Jiangning Zhang
Yunhai Tong
Chen Change Loy
Shuicheng Yan
DiffM
63
6
0
14 Mar 2024
CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression
CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression
Xinjie Zhang
Shenyuan Gao
Zhening Liu
Jiawei Shao
Xingtong Ge
Dailan He
Tongda Xu
Yan Wang
Jun Zhang
31
1
0
13 Mar 2024
Enhancing Vision-Language Pre-training with Rich Supervisions
Enhancing Vision-Language Pre-training with Rich Supervisions
Yuan Gao
Kunyu Shi
Pengkai Zhu
Edouard Belval
Oren Nuriel
Srikar Appalaraju
Shabnam Ghadar
Vijay Mahadevan
Zhuowen Tu
Stefano Soatto
VLM
CLIP
62
12
0
05 Mar 2024
Self-Supervised Learning with Generative Adversarial Networks for
  Electron Microscopy
Self-Supervised Learning with Generative Adversarial Networks for Electron Microscopy
Bashir Kazimi
Karina Ruzaeva
Stefan Sandfeld
30
4
0
28 Feb 2024
Overcoming Dimensional Collapse in Self-supervised Contrastive Learning
  for Medical Image Segmentation
Overcoming Dimensional Collapse in Self-supervised Contrastive Learning for Medical Image Segmentation
Jamshid Hassanpour
V. Srivastav
Didier Mutter
N. Padoy
SSL
31
2
0
22 Feb 2024
Learning Low-Rank Feature for Thorax Disease Classification
Learning Low-Rank Feature for Thorax Disease Classification
Rajeev Goel
Utkarsh Nath
Yancheng Wang
Alvin C. Silva
Teresa Wu
Yingzhen Yang
8
0
0
14 Feb 2024
Masked LoGoNet: Fast and Accurate 3D Image Analysis for Medical Domain
Masked LoGoNet: Fast and Accurate 3D Image Analysis for Medical Domain
Amin Karimi Monsefi
Payam Karisani
Mengxi Zhou
Stacey S. Choi
Nathan Doble
Heng Ji
Srinivasan Parthasarathy
R. Ramnath
41
5
0
09 Feb 2024
Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey
Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey
Yi Xin
Jianjiang Yang
Haodi Zhou
Junlong Du
Junlong Du
Yue Fan
Qing Li
Qing Li
Yuntao Du
VLM
59
74
0
03 Feb 2024
Deconstructing Denoising Diffusion Models for Self-Supervised Learning
Deconstructing Denoising Diffusion Models for Self-Supervised Learning
Xinlei Chen
Zhuang Liu
Saining Xie
Kaiming He
DiffM
25
52
0
25 Jan 2024
Rethinking Patch Dependence for Masked Autoencoders
Rethinking Patch Dependence for Masked Autoencoders
Letian Fu
Long Lian
Renhao Wang
Baifeng Shi
Xudong Wang
Adam Yala
Trevor Darrell
Alexei A. Efros
Ken Goldberg
26
14
0
25 Jan 2024
LDReg: Local Dimensionality Regularized Self-Supervised Learning
LDReg: Local Dimensionality Regularized Self-Supervised Learning
Hanxun Huang
R. Campello
S. Erfani
Xingjun Ma
Michael E. Houle
James Bailey
28
5
0
19 Jan 2024
Morphing Tokens Draw Strong Masked Image Models
Morphing Tokens Draw Strong Masked Image Models
Taekyung Kim
Byeongho Heo
Dongyoon Han
44
3
0
30 Dec 2023
How to Efficiently Annotate Images for Best-Performing Deep Learning Based Segmentation Models: An Empirical Study with Weak and Noisy Annotations and Segment Anything Model
How to Efficiently Annotate Images for Best-Performing Deep Learning Based Segmentation Models: An Empirical Study with Weak and Noisy Annotations and Segment Anything Model
Yixin Zhang
Shen Zhao
Han Gu
Maciej Mazurowski
VLM
30
4
0
17 Dec 2023
4M: Massively Multimodal Masked Modeling
4M: Massively Multimodal Masked Modeling
David Mizrahi
Roman Bachmann
Ouguzhan Fatih Kar
Teresa Yeo
Mingfei Gao
Afshin Dehghan
Amir Zamir
MLLM
34
62
0
11 Dec 2023
MIMIR: Masked Image Modeling for Mutual Information-based Adversarial Robustness
MIMIR: Masked Image Modeling for Mutual Information-based Adversarial Robustness
Xiaoyun Xu
Shujian Yu
Jingzheng Wu
S. Picek
AAML
33
0
0
08 Dec 2023
A brief introduction to a framework named Multilevel
  Guidance-Exploration Network
A brief introduction to a framework named Multilevel Guidance-Exploration Network
Guoqing Yang
Zhiming Luo
Jianzhe Gao
Yingxin Lai
Kun Yang
Yifan He
Shaozi Li
3DH
19
0
0
07 Dec 2023
Unsupervised Video Domain Adaptation with Masked Pre-Training and Collaborative Self-Training
Unsupervised Video Domain Adaptation with Masked Pre-Training and Collaborative Self-Training
Arun V. Reddy
William Paul
Corban Rivera
Ketul Shah
Celso M. de Melo
Rama Chellappa
32
4
0
05 Dec 2023
Learning Anatomically Consistent Embedding for Chest Radiography
Learning Anatomically Consistent Embedding for Chest Radiography
Ziyu Zhou
Haozhe Luo
Jiaxuan Pang
Xiaowei Ding
Michael B. Gotway
Jianming Liang
SSL
9
5
0
01 Dec 2023
Predicting Gradient is Better: Exploring Self-Supervised Learning for
  SAR ATR with a Joint-Embedding Predictive Architecture
Predicting Gradient is Better: Exploring Self-Supervised Learning for SAR ATR with a Joint-Embedding Predictive Architecture
Wei-Jang Li
Yang Wei
Tianpeng Liu
Yuenan Hou
Yuxuan Li
Zhen Liu
Yongxiang Liu
Li Liu
16
17
0
26 Nov 2023
SegVol: Universal and Interactive Volumetric Medical Image Segmentation
SegVol: Universal and Interactive Volumetric Medical Image Segmentation
Yuxin Du
Fan Bai
Tiejun Huang
Bo-Lu Zhao
VLM
28
37
0
22 Nov 2023
Event Camera Data Dense Pre-training
Event Camera Data Dense Pre-training
Yan Yang
Liyuan Pan
Liu Liu
25
4
0
20 Nov 2023
Florence-2: Advancing a Unified Representation for a Variety of Vision
  Tasks
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Bin Xiao
Haiping Wu
Weijian Xu
Xiyang Dai
Houdong Hu
Yumao Lu
Michael Zeng
Ce Liu
Lu Yuan
VLM
31
140
0
10 Nov 2023
Foundation Ark: Accruing and Reusing Knowledge for Superior and Robust
  Performance
Foundation Ark: Accruing and Reusing Knowledge for Superior and Robust Performance
Dongao Ma
Jiaxuan Pang
Michael B. Gotway
Jianming Liang
MedIm
OOD
12
7
0
14 Oct 2023
Perceptual MAE for Image Manipulation Localization: A High-level Vision
  Learner Focusing on Low-level Features
Perceptual MAE for Image Manipulation Localization: A High-level Vision Learner Focusing on Low-level Features
Xiaochen Ma
Jizhe Zhou
Xiong Xu
Zhuohang Jiang
Chi-Man Pun
24
0
0
10 Oct 2023
Adaptive Multi-head Contrastive Learning
Adaptive Multi-head Contrastive Learning
Lei Wang
Piotr Koniusz
Tom Gedeon
Liang Zheng
26
4
0
09 Oct 2023
Masked Feature Modelling: Feature Masking for the Unsupervised
  Pre-training of a Graph Attention Network Block for Bottom-up Video Event
  Recognition
Masked Feature Modelling: Feature Masking for the Unsupervised Pre-training of a Graph Attention Network Block for Bottom-up Video Event Recognition
Dimitrios Daskalakis
Nikolaos Gkalelis
Vasileios Mezaris
32
0
0
24 Aug 2023
Masked Momentum Contrastive Learning for Zero-shot Semantic
  Understanding
Masked Momentum Contrastive Learning for Zero-shot Semantic Understanding
Jiantao Wu
Shentong Mo
Muhammad Awais
Sara Atito
Zhenhua Feng
J. Kittler
VLM
23
4
0
22 Aug 2023
MGMAE: Motion Guided Masking for Video Masked Autoencoding
MGMAE: Motion Guided Masking for Video Masked Autoencoding
Bingkun Huang
Zhiyu Zhao
Guozhen Zhang
Yu Qiao
Limin Wang
22
29
0
21 Aug 2023
pNNCLR: Stochastic Pseudo Neighborhoods for Contrastive Learning based
  Unsupervised Representation Learning Problems
pNNCLR: Stochastic Pseudo Neighborhoods for Contrastive Learning based Unsupervised Representation Learning Problems
Momojit Biswas
Himanshu Buckchash
Dilip K. Prasad
SSL
11
7
0
14 Aug 2023
DETR Doesn't Need Multi-Scale or Locality Design
DETR Doesn't Need Multi-Scale or Locality Design
Yutong Lin
Yuhui Yuan
Zheng-Wei Zhang
Chen Li
Nanning Zheng
Han Hu
25
5
0
03 Aug 2023
HandMIM: Pose-Aware Self-Supervised Learning for 3D Hand Mesh Estimation
HandMIM: Pose-Aware Self-Supervised Learning for 3D Hand Mesh Estimation
Zuyan Liu
Gaojie Lin
Congyi Wang
Min Zheng
Feida Zhu
3DH
13
0
0
29 Jul 2023
The RoboDepth Challenge: Methods and Advancements Towards Robust Depth
  Estimation
The RoboDepth Challenge: Methods and Advancements Towards Robust Depth Estimation
Lingdong Kong
Yaru Niu
Shaoyuan Xie
Hanjiang Hu
Lai Xing Ng
...
Zhenyu Li
Runze Chen
Haiyong Luo
Fang Zhao
Jing Yu
21
13
0
27 Jul 2023
Pre-Training with Diffusion models for Dental Radiography segmentation
Pre-Training with Diffusion models for Dental Radiography segmentation
Jérémy Rousseau
C. Alaka
E. Covili
H. Mayard
L. Misrachi
Willy Au
DiffM
MedIm
AI4CE
12
4
0
26 Jul 2023
Controllable Guide-Space for Generalizable Face Forgery Detection
Controllable Guide-Space for Generalizable Face Forgery Detection
Yingjie Guo
Cheng Zhen
Pengfei Yan
CVBM
AAML
21
19
0
26 Jul 2023
DenseMP: Unsupervised Dense Pre-training for Few-shot Medical Image
  Segmentation
DenseMP: Unsupervised Dense Pre-training for Few-shot Medical Image Segmentation
Zhaoxin Fan
Puquan Pan
Zeren Zhang
C. Chen
Tianyang Wang
Si Zheng
Min Xu
VLM
26
0
0
13 Jul 2023
Test-Time Training on Video Streams
Test-Time Training on Video Streams
Renhao Wang
Yu Sun
Yossi Gandelsman
Xinlei Chen
Alexei A. Efros
Alexei A. Efros
Xiaolong Wang
TTA
ViT
3DGS
24
16
0
11 Jul 2023
Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain
  Generalization
Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain Generalization
Yumeng Li
Dan Zhang
M. Keuper
Anna Khoreva
34
10
0
02 Jul 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large
  Language Models
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
29
6
0
14 Jun 2023
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
Lorenzo Baraldi
Roberto Amoroso
Marcella Cornia
Lorenzo Baraldi
Andrea Pilzer
Rita Cucchiara
36
2
0
12 Jun 2023
Exploring Effective Mask Sampling Modeling for Neural Image Compression
Exploring Effective Mask Sampling Modeling for Neural Image Compression
Lin Liu
Mingming Zhao
Shanxin Yuan
Wenlong Lyu
Wen-gang Zhou
Houqiang Li
Yanfeng Wang
Qi Tian
11
3
0
09 Jun 2023
Evaluating The Robustness of Self-Supervised Representations to
  Background/Foreground Removal
Evaluating The Robustness of Self-Supervised Representations to Background/Foreground Removal
Xavier F. Cadet
Ranya Aloufi
A. Miranville
S. Ahmadi-Abhari
Hamed Haddadi
16
0
0
02 Jun 2023
Image as First-Order Norm+Linear Autoregression: Unveiling Mathematical
  Invariance
Image as First-Order Norm+Linear Autoregression: Unveiling Mathematical Invariance
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Mengchen Liu
Lu Yuan
Zicheng Liu
Youzuo Lin
22
2
0
25 May 2023
A multimodal method based on cross-attention and convolution for
  postoperative infection diagnosis
A multimodal method based on cross-attention and convolution for postoperative infection diagnosis
Xianjie Liu
Hon-Yi Shi
19
0
0
23 May 2023
GeoMAE: Masked Geometric Target Prediction for Self-supervised Point
  Cloud Pre-Training
GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Training
Xiaoyu Tian
Haoxi Ran
Yue Wang
Hang Zhao
3DPC
ViT
16
38
0
15 May 2023
Medical supervised masked autoencoders: Crafting a better masking
  strategy and efficient fine-tuning schedule for medical image classification
Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification
Jia-ju Mao
Shu-Hua Guo
Yuan Chang
Xuesong Yin
Binling Nie
26
2
0
10 May 2023
Previous
1234
Next