Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2111.09883
Cited By
v1
v2 (latest)
Swin Transformer V2: Scaling Up Capacity and Resolution
18 November 2021
Ze Liu
Han Hu
Yutong Lin
Zhuliang Yao
Zhenda Xie
Yixuan Wei
Jia Ning
Yue Cao
Zheng Zhang
Li Dong
Furu Wei
B. Guo
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (14834★)
Papers citing
"Swin Transformer V2: Scaling Up Capacity and Resolution"
50 / 931 papers shown
Title
PEA: Improving the Performance of ReLU Networks for Free by Using Progressive Ensemble Activations
Á. Utasi
87
0
0
28 Jul 2022
Multi-Forgery Detection Challenge 2022: Push the Frontier of Unconstrained and Diverse Forgery Detection
Jianshu Li
Man Luo
Jian Liu
Tao Chen
Chengjie Wang
...
Bo Liu
Mingyu Guo
Ying Guo
Y. Ao
Pengfei Gao
95
1
0
27 Jul 2022
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
IEEE International Conference on Computer Vision (ICCV), 2022
Qiang Chen
Xiaokang Chen
Jian Wang
Shan Zhang
Kun Yao
Haocheng Feng
Junyu Han
Errui Ding
Gang Zeng
Jingdong Wang
ViT
238
189
0
26 Jul 2022
DETRs with Hybrid Matching
Computer Vision and Pattern Recognition (CVPR), 2022
Ding Jia
Yuhui Yuan
Hao He
Xiao-pei Wu
Haojun Yu
Weihong Lin
Lei-huan Sun
Chao Zhang
Hanhua Hu
376
255
0
26 Jul 2022
Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yang Liu
Guanbin Li
Guanbin Li
LRM
461
142
0
26 Jul 2022
Dive into Big Model Training
Qinghua Liu
Yuxiang Jiang
MoMe
AI4CE
LRM
92
3
0
25 Jul 2022
Applying Spatiotemporal Attention to Identify Distracted and Drowsy Driving with Vision Transformers
Samay Lakhani
ViT
MedIm
130
3
0
22 Jul 2022
Efficient Graph-Friendly COCO Metric Computation for Train-Time Model Evaluation
Luke Wood
François Chollet
78
9
0
21 Jul 2022
TinyViT: Fast Pretraining Distillation for Small Vision Transformers
European Conference on Computer Vision (ECCV), 2022
Kan Wu
Jinnian Zhang
Houwen Peng
Xiyang Dai
Bin Xiao
Jianlong Fu
Lu Yuan
ViT
210
376
0
21 Jul 2022
Vision Transformers: From Semantic Segmentation to Dense Prediction
International Journal of Computer Vision (IJCV), 2022
Li Zhang
Jiachen Lu
Sixiao Zheng
Xinxuan Zhao
Xiatian Zhu
Yanwei Fu
Tao Xiang
Jianfeng Feng
Philip H. S. Torr
ViT
239
15
0
19 Jul 2022
Towards Trustworthy Healthcare AI: Attention-Based Feature Learning for COVID-19 Screening With Chest Radiography
Kai Ma
Pengcheng Xi
K. Habashy
Ashkan Ebadi
Stéphane Tremblay
Alexander Wong
ViT
MedIm
95
2
0
19 Jul 2022
MonoIndoor++:Towards Better Practice of Self-Supervised Monocular Depth Estimation for Indoor Environments
Runze Li
Pan Ji
Yi Tian Xu
B. Bhanu
MDE
137
26
0
18 Jul 2022
Multi-manifold Attention for Vision Transformers
IEEE Access (IEEE Access), 2022
D. Konstantinidis
Ilias Papastratis
K. Dimitropoulos
P. Daras
ViT
175
17
0
18 Jul 2022
Current Trends in Deep Learning for Earth Observation: An Open-source Benchmark Arena for Image Classification
Isprs Journal of Photogrammetry and Remote Sensing (JIPRS), 2022
I. Dimitrovski
Ivan Kitanovski
D. Kocev
Nikola Simidjievski
VLM
202
92
0
14 Jul 2022
Rethinking Attention Mechanism in Time Series Classification
Information Sciences (Inf. Sci.), 2022
Bowen Zhao
Huanlai Xing
Xinhan Wang
Fuhong Song
Zhiwen Xiao
AI4TS
78
44
0
14 Jul 2022
Pyramid Transformer for Traffic Sign Detection
International Conference on Computer and Knowledge Engineering (ICCKE), 2022
Omid Nejati Manzari
A. Boudesh
S. B. Shokouhi
ViT
122
15
0
13 Jul 2022
MSP-Former: Multi-Scale Projection Transformer for Single Image Desnowing
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Sixiang Chen
Tian-Chun Ye
Yun-Peng Liu
Taodong Liao
Y. Ye
Erkang Chen
Peng Chen
ViT
215
65
0
12 Jul 2022
YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
Computer Vision and Pattern Recognition (CVPR), 2022
Chien-Yao Wang
Alexey Bochkovskiy
H. Liao
ObjD
438
8,940
0
06 Jul 2022
Softmax-free Linear Transformers
International Journal of Computer Vision (IJCV), 2022
Jiachen Lu
Junge Zhang
Xiatian Zhu
Jianfeng Feng
Tao Xiang
Li Zhang
ViT
204
14
0
05 Jul 2022
Spatiotemporal Feature Learning Based on Two-Step LSTM and Transformer for CT Scans
Chih-Chung Hsu
Chin-Han Tsai
Guangfeng Chen
Sin-Di Ma
Shen-Chieh Tai
MedIm
120
10
0
04 Jul 2022
Woodscape Fisheye Object Detection for Autonomous Driving -- CVPR 2022 OmniCV Workshop Challenge
Saravanabalagi Ramachandran
Ganesh Sistu
V. Kumar
J. McDonald
S. Yogamani
294
6
0
26 Jun 2022
LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs
Computer Vision and Pattern Recognition (CVPR), 2022
Yukang Chen
Jianhui Liu
Xinming Zhang
Xiaojuan Qi
Jiaya Jia
217
119
0
21 Jun 2022
HOPE: Hierarchical Spatial-temporal Network for Occupancy Flow Prediction
Yi Hu
Wenxin Shao
Bo Jiang
Jiajie Chen
Siqi Chai
Zhening Yang
Jingyu Qian
Helong Zhou
Qiang Liu
AI4CE
125
15
0
21 Jun 2022
Global Context Vision Transformers
International Conference on Machine Learning (ICML), 2022
Ali Hatamizadeh
Hongxu Yin
Greg Heinrich
Jan Kautz
Pavlo Molchanov
ViT
424
184
0
20 Jun 2022
EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm
International Journal of Computer Vision (IJCV), 2022
Jiangning Zhang
Xiangtai Li
Yabiao Wang
Chengjie Wang
Jianlong Wu
Yong Liu
Dacheng Tao
ViT
269
46
0
19 Jun 2022
Enhanced Bi-directional Motion Estimation for Video Frame Interpolation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Xin Jin
Longhai Wu
Guotao Shen
Youxin Chen
Jie Chen
Jayoon Koo
Cheul-hee Hahm
162
25
0
17 Jun 2022
Rectify ViT Shortcut Learning by Visual Saliency
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Chong Ma
Lin Zhao
Yuzhong Chen
David Liu
Xi Jiang
Tuo Zhang
Xiaoyan Cai
Hongtu Zhu
Dajiang Zhu
Tianming Liu
ViT
179
25
0
17 Jun 2022
OmniMAE: Single Model Masked Pretraining on Images and Videos
Computer Vision and Pattern Recognition (CVPR), 2022
Rohit Girdhar
Alaaeldin El-Nouby
Mannat Singh
Kalyan Vasudev Alwala
Armand Joulin
Ishan Misra
ViT
251
117
0
16 Jun 2022
ChordMixer: A Scalable Neural Attention Model for Sequences with Different Lengths
International Conference on Learning Representations (ICLR), 2022
Ruslan Khalitov
Tong Yu
Lei Cheng
Zhirong Yang
189
15
0
12 Jun 2022
On Data Scaling in Masked Image Modeling
Computer Vision and Pattern Recognition (CVPR), 2022
Zhenda Xie
Zheng Zhang
Yue Cao
Yutong Lin
Yixuan Wei
Jingdong Sun
Han Hu
195
68
0
09 Jun 2022
CASS: Cross Architectural Self-Supervision for Medical Image Analysis
Pranav Singh
E. Sizikova
Jacopo Cirrone
OOD
366
11
0
08 Jun 2022
Tutel: Adaptive Mixture-of-Experts at Scale
Conference on Machine Learning and Systems (MLSys), 2022
Changho Hwang
Wei Cui
Yifan Xiong
Ziyue Yang
Ze Liu
...
Joe Chau
Peng Cheng
Fan Yang
Mao Yang
Y. Xiong
MoE
334
182
0
07 Jun 2022
Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation
Computer Vision and Pattern Recognition (CVPR), 2022
Feng Li
Hao Zhang
Hu-Sheng Xu
Siyi Liu
Lei Zhang
L. Ni
H. Shum
ISeg
329
511
0
06 Jun 2022
EfficientFormer: Vision Transformers at MobileNet Speed
Neural Information Processing Systems (NeurIPS), 2022
Yanyu Li
Geng Yuan
Yang Wen
Eric Hu
Georgios Evangelidis
Sergey Tulyakov
Yanzhi Wang
Jian Ren
ViT
593
509
0
02 Jun 2022
KPGT: Knowledge-Guided Pre-training of Graph Transformer for Molecular Property Prediction
Knowledge Discovery and Data Mining (KDD), 2022
Han Li
Dan Zhao
Jianyang Zeng
154
68
0
02 Jun 2022
Decomposing NeRF for Editing via Feature Field Distillation
Neural Information Processing Systems (NeurIPS), 2022
Sosuke Kobayashi
Eiichi Matsumoto
Vincent Sitzmann
756
405
0
31 May 2022
Exploring Advances in Transformers and CNN for Skin Lesion Diagnosis on Small Datasets
Brazilian Conference on Intelligent Systems (BRACIS), 2022
Leandro M. de Lima
R. Krohling
ViT
MedIm
124
13
0
30 May 2022
Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence Learning
Neural Information Processing Systems (NeurIPS), 2022
Aniket Didolkar
Kshitij Gupta
Anirudh Goyal
Nitesh B. Gundavarapu
Alex Lamb
Nan Rosemary Ke
Yoshua Bengio
AI4CE
434
21
0
30 May 2022
Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation
Yixuan Wei
Han Hu
Zhenda Xie
Zheng Zhang
Yue Cao
Jianmin Bao
Dong Chen
B. Guo
CLIP
514
138
0
27 May 2022
How Tempering Fixes Data Augmentation in Bayesian Neural Networks
International Conference on Machine Learning (ICML), 2022
Gregor Bachmann
Lorenzo Noci
Thomas Hofmann
BDL
AAML
229
11
0
27 May 2022
Green Hierarchical Vision Transformer for Masked Image Modeling
Neural Information Processing Systems (NeurIPS), 2022
Lang Huang
Shan You
Mingkai Zheng
Fei Wang
Chao Qian
T. Yamasaki
258
83
0
26 May 2022
MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers
Computer Vision and Pattern Recognition (CVPR), 2022
Jihao Liu
Xin Huang
Jinliang Zheng
Yu Liu
Jiaming Song
196
79
0
26 May 2022
Vision Transformers in 2022: An Update on Tiny ImageNet
Ethan Huynh
ViT
156
14
0
21 May 2022
DProQ: A Gated-Graph Transformer for Protein Complex Structure Assessment
Xiao Chen
Alex Morehead
Jian Liu
Jianlin Cheng
156
7
0
21 May 2022
Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality
Xiang Li
Wenhai Wang
Lingfeng Yang
Jian Yang
284
85
0
20 May 2022
Vision Transformer Adapter for Dense Predictions
International Conference on Learning Representations (ICLR), 2022
Zhe Chen
Yuchen Duan
Wenhai Wang
Junjun He
Tong Lu
Jifeng Dai
Yu Qiao
803
742
0
17 May 2022
An Effective Transformer-based Solution for RSNA Intracranial Hemorrhage Detection Competition
Fangxin Shang
Siqi Wang
Xiaorong Wang
Yehui Yang
MedIm
74
4
0
16 May 2022
Sequencer: Deep LSTM for Image Classification
Neural Information Processing Systems (NeurIPS), 2022
Yuki Tatsunami
Masato Taki
VLM
ViT
319
107
0
04 May 2022
Improving the Transferability of Adversarial Examples with Restructure Embedded Patches
Huipeng Zhou
Yu-an Tan
Yajie Wang
Haoran Lyu
Shan-Hung Wu
Yuan-zhang Li
ViT
110
5
0
27 Apr 2022
SUES-200: A Multi-height Multi-scene Cross-view Image Benchmark Across Drone and Satellite
Runzhe Zhu
Ling Yin
Mingze Yang
Fei Wu
Yunchen Yang
Wenbo Hu
202
98
0
22 Apr 2022
Previous
1
2
3
...
17
18
19
Next