Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.12058
Cited By
Aerial Image Object Detection With Vision Transformer Detector (ViTDet)
28 January 2023
Liya Wang
A. Tien
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Aerial Image Object Detection With Vision Transformer Detector (ViTDet)"
27 / 27 papers shown
Title
Segment Anything, Even Occluded
Wei-En Tai
Yu-Lin Shih
Cheng Sun
Y. Wang
Hwann-Tzong Chen
VLM
55
0
0
08 Mar 2025
Transformers in Small Object Detection: A Benchmark and Survey of State-of-the-Art
Aref Miri Rekavandi
Shima Rashidi
F. Boussaïd
Stephen Hoefs
Emre Akbas
Bennamoun
ViT
31
23
0
10 Sep 2023
Siamese Transition Masked Autoencoders as Uniform Unsupervised Visual Anomaly Detector
Haiming Yao
Xue Wang
Wenyong Yu
15
9
0
01 Nov 2022
A Unified View of Masked Image Modeling
Zhiliang Peng
Li Dong
Hangbo Bao
QiXiang Ye
Furu Wei
VLM
52
35
0
19 Oct 2022
It Takes Two: Masked Appearance-Motion Modeling for Self-supervised Video Transformer Pre-training
Yuxin Song
Min Yang
Wenhao Wu
Dongliang He
Fu Li
Jingdong Wang
ViT
92
8
0
11 Oct 2022
Ensemble Learning using Transformers and Convolutional Networks for Masked Face Recognition
Mohammed R. Al-Sinan
Aseel F. Haneef
H. Luqman
25
2
0
10 Oct 2022
Real-World Robot Learning with Masked Visual Pre-training
Ilija Radosavovic
Tete Xiao
Stephen James
Pieter Abbeel
Jitendra Malik
Trevor Darrell
SSL
146
238
0
06 Oct 2022
Self-Distillation for Further Pre-training of Transformers
Seanie Lee
Minki Kang
Juho Lee
Sung Ju Hwang
Kenji Kawaguchi
45
8
0
30 Sep 2022
Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection
Neelu Madan
Nicolae-Cătălin Ristea
Radu Tudor Ionescu
Kamal Nasrollahi
F. Khan
T. Moeslund
M. Shah
ViT
MedIm
244
60
0
25 Sep 2022
NamedMask: Distilling Segmenters from Complementary Foundation Models
Gyungin Shin
Weidi Xie
Samuel Albanie
ISeg
VLM
56
22
0
22 Sep 2022
Masked Imitation Learning: Discovering Environment-Invariant Modalities in Multimodal Demonstrations
Yilun Hao
Ruinan Wang
Zhangjie Cao
Zihan Wang
Yuchen Cui
Dorsa Sadigh
11
2
0
16 Sep 2022
Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training
Zhihong Chen
Yu Du
Jinpeng Hu
Yang Liu
Guanbin Li
Xiang Wan
Tsung-Hui Chang
79
107
0
15 Sep 2022
Exploring Target Representations for Masked Autoencoders
Xingbin Liu
Jinghao Zhou
Tao Kong
Xianming Lin
Rongrong Ji
79
49
0
08 Sep 2022
Masked Self-Supervision for Remaining Useful Lifetime Prediction in Machine Tools
Haoren Guo
H. Zhu
Jiahui Wang
P. Vadakkepat
W. Ho
T. Lee
31
12
0
04 Jul 2022
Masked World Models for Visual Control
Younggyo Seo
Danijar Hafner
Hao Liu
Fangchen Liu
Stephen James
Kimin Lee
Pieter Abbeel
OffRL
77
144
0
28 Jun 2022
MaskViT: Masked Visual Pre-Training for Video Prediction
Agrim Gupta
Stephen Tian
Yunzhi Zhang
Jiajun Wu
Roberto Martín-Martín
Li Fei-Fei
100
110
0
23 Jun 2022
SupMAE: Supervised Masked Autoencoders Are Efficient Vision Learners
Feng Liang
Yangguang Li
Diana Marculescu
SSL
TPM
ViT
40
22
0
28 May 2022
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training
Renrui Zhang
Ziyu Guo
Rongyao Fang
Bingyan Zhao
Dong Wang
Yu Qiao
Hongsheng Li
Peng Gao
3DPC
171
241
0
28 May 2022
Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation
Yixuan Wei
Han Hu
Zhenda Xie
Zheng-Wei Zhang
Yue Cao
Jianmin Bao
Dong Chen
B. Guo
CLIP
80
123
0
27 May 2022
Revealing the Dark Secrets of Masked Image Modeling
Zhenda Xie
Zigang Geng
Jingcheng Hu
Zheng-Wei Zhang
Han Hu
Yue Cao
VLM
186
105
0
26 May 2022
FaceMAE: Privacy-Preserving Face Recognition via Masked Autoencoders
K. Wang
Bo-Lu Zhao
Xiangyu Peng
Zheng Hua Zhu
Jiankang Deng
Xinchao Wang
Hakan Bilen
Yang You
PICV
38
11
0
23 May 2022
Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality
Xiang Li
Wenhai Wang
Lingfeng Yang
Jian Yang
95
73
0
20 May 2022
Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection
Feng Liu
Xiaosong Zhang
Zhiliang Peng
Zonghao Guo
Fang Wan
Xian-Wei Ji
QiXiang Ye
ObjD
43
20
0
19 May 2022
Training Vision-Language Transformers from Captions
Liangke Gui
Yingshan Chang
Qiuyuan Huang
Subhojit Som
Alexander G. Hauptmann
Jianfeng Gao
Yonatan Bisk
VLM
ViT
172
11
0
19 May 2022
Adversarial Masking for Self-Supervised Learning
Yuge Shi
N. Siddharth
Philip H. S. Torr
Adam R. Kosiorek
SSL
46
81
0
31 Jan 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,337
0
11 Nov 2021
Oriented R-CNN for Object Detection
Xingxing Xie
Gong Cheng
Jiabao Wang
Xiwen Yao
Junwei Han
ObjD
112
664
0
12 Aug 2021
1