Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2111.09883
Cited By
v1
v2 (latest)
Swin Transformer V2: Scaling Up Capacity and Resolution
18 November 2021
Ze Liu
Han Hu
Yutong Lin
Zhuliang Yao
Zhenda Xie
Yixuan Wei
Jia Ning
Yue Cao
Zheng Zhang
Li Dong
Furu Wei
B. Guo
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (14834★)
Papers citing
"Swin Transformer V2: Scaling Up Capacity and Resolution"
50 / 931 papers shown
Title
BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame Interpolation
Computer Vision and Pattern Recognition (CVPR), 2023
Jun-ho Park
Jintae Kim
Chang-Su Kim
150
32
0
05 Apr 2023
BugNIST -- a Large Volumetric Dataset for Object Detection under Domain Shift
European Conference on Computer Vision (ECCV), 2023
Patrick M. Jensen
Anders B. Dahl
Carsten Gundlach
Rebecca J Engberg
H. Kjer
Vedrana Andersen Dahl
137
1
0
04 Apr 2023
Exploration of Lightweight Single Image Denoising with Transformers and Truly Fair Training
International Conference on Multimedia Retrieval (ICMR), 2023
Haram Choi
Cheolwoong Na
Jinseop S. Kim
Jihoon Yang
ViT
144
3
0
04 Apr 2023
Exploring Vision-Language Models for Imbalanced Learning
International Journal of Computer Vision (IJCV), 2023
Yidong Wang
Zhuohao Yu
Yongfeng Zhang
Qiang Heng
Haoxing Chen
Wei Ye
Rui Xie
Xingxu Xie
Shi-Bo Zhang
VLM
284
52
0
04 Apr 2023
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
Computer Vision and Pattern Recognition (CVPR), 2023
Xuanyao Chen
Zhijian Liu
Haotian Tang
Li Yi
Hang Zhao
Song Han
ViT
323
71
0
30 Mar 2023
VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Computer Vision and Pattern Recognition (CVPR), 2023
Limin Wang
Bingkun Huang
Zhiyu Zhao
Zhan Tong
Yinan He
Yi Wang
Yali Wang
Yu Qiao
VGen
319
519
0
29 Mar 2023
Scalable, Detailed and Mask-Free Universal Photometric Stereo
Computer Vision and Pattern Recognition (CVPR), 2023
Satoshi Ikehata
200
51
0
28 Mar 2023
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
IEEE International Conference on Computer Vision (ICCV), 2023
Abdelrahman M. Shaker
Muhammad Maaz
H. Rasheed
Salman Khan
Ming-Hsuan Yang
Fahad Shahbaz Khan
ViT
317
165
0
27 Mar 2023
Vision Transformer with Quadrangle Attention
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Qiming Zhang
Jing Zhang
Yufei Xu
Dacheng Tao
ViT
162
59
0
27 Mar 2023
CP-CNN: Core-Periphery Principle Guided Convolutional Neural Network
Lin Zhao
Haixing Dai
Zihao Wu
Dajiang Zhu
Tianming Liu
167
1
0
27 Mar 2023
Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers
Computer Vision and Pattern Recognition (CVPR), 2023
Cong Wei
Brendan Duke
R. Jiang
P. Aarabi
Graham W. Taylor
Florian Shkurti
ViT
173
21
0
24 Mar 2023
WM-MoE: Weather-aware Multi-scale Mixture-of-Experts for Blind Adverse Weather Removal
Yulin Luo
Rui Zhao
Xi Wei
Jinwei Chen
Yijie Lu
Shenghao Xie
Tianyu Wang
Ruiqin Xiong
Ming Lu
Shanghang Zhang
295
8
0
24 Mar 2023
The effectiveness of MAE pre-pretraining for billion-scale pretraining
IEEE International Conference on Computer Vision (ICCV), 2023
Mannat Singh
Quentin Duval
Kalyan Vasudev Alwala
Haoqi Fan
Vaibhav Aggarwal
...
Piotr Dollár
Christoph Feichtenhofer
Ross B. Girshick
Rohit Girdhar
Ishan Misra
LRM
357
85
0
23 Mar 2023
ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders
J. Hernandez
Ruben Villegas
Vicente Ordonez
SSL
152
2
0
21 Mar 2023
Human Pose as Compositional Tokens
Computer Vision and Pattern Recognition (CVPR), 2023
Zigang Geng
Chunyu Wang
Yixuan Wei
Ze Liu
Houqiang Li
Han Hu
174
69
0
21 Mar 2023
Large AI Models in Health Informatics: Applications, Challenges, and the Future
IEEE journal of biomedical and health informatics (IEEE JBHI), 2023
Jianing Qiu
Lin Li
Jiankai Sun
Jiachuan Peng
Peilun Shi
...
Bo Xiao
Wu Yuan
Ningli Wang
Dong Xu
Benny Lo
AI4MH
LM&MA
248
181
0
21 Mar 2023
EVA-02: A Visual Representation for Neon Genesis
Image and Vision Computing (IVC), 2023
Yuxin Fang
Quan-Sen Sun
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
ViT
CLIP
380
396
0
20 Mar 2023
Robustifying Token Attention for Vision Transformers
IEEE International Conference on Computer Vision (ICCV), 2023
Yong Guo
David Stutz
Bernt Schiele
ViT
350
34
0
20 Mar 2023
Internal Structure Attention Network for Fingerprint Presentation Attack Detection from Optical Coherence Tomography
IEEE Transactions on Biometrics Behavior and Identity Science (TBBIS), 2023
Hao Sun
Yilong Zhang
Peng Chen
Haixia Wang
Ronghua Liang
235
6
0
20 Mar 2023
LSwinSR: UAV Imagery Super-Resolution based on Linear Swin Transformer
IEEE Transactions on Geoscience and Remote Sensing (TGRS), 2023
Rui Li
Xiaowei Zhao
160
10
0
17 Mar 2023
MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image Segmentation
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2023
Saikat Roy
Gregor Koehler
Constantin Ulrich
Michael Baumgartner
Jens Petersen
Hyunjin Park
Paul F. Jaeger
Klaus Maier-Hein
ViT
MedIm
427
281
0
17 Mar 2023
Dual-path Adaptation from Image to Video Transformers
Computer Vision and Pattern Recognition (CVPR), 2023
Jungin Park
Jiyoung Lee
Kwanghoon Sohn
ViT
234
56
0
17 Mar 2023
High Accurate and Explainable Multi-Pill Detection Framework with Graph Neural Network-Assisted Multimodal Data Fusion
PLoS ONE (PLoS ONE), 2023
Anh Duy Nguyen
H. Pham
Huynh Thanh Trung
Quoc Viet Hung Nguyen
Thao Nguyen Truong
Phi Le Nguyen
MedIm
189
8
0
17 Mar 2023
ELFIS: Expert Learning for Fine-grained Image Recognition Using Subsets
Pablo J. Villacorta
Jesús M. Rodríguez-de-Vera
Marc Bolaños
Ignacio Sarasúa
Bhalaji Nagarajan
Petia Radeva
185
2
0
16 Mar 2023
Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Reinforcement
IEEE International Conference on Computer Vision (ICCV), 2023
Fartash Faghri
Hadi Pouransari
Sachin Mehta
Mehrdad Farajtabar
Ali Farhadi
Mohammad Rastegari
Oncel Tuzel
240
15
0
15 Mar 2023
Fully neuromorphic vision and control for autonomous drone flight
Science Robotics (Sci. Robot.), 2023
Federico Paredes-Valles
J. Hagenaars
Julien Dupeyroux
S. Stroobants
Ying Xu
Guido de Croon
158
69
0
15 Mar 2023
Deep Learning for Iris Recognition: A Review
Yi Yin
Si-Liang He
Renye Zhang
Hongli Chang
Xu Han
Jinghua Zhang
PILM
155
22
0
15 Mar 2023
Exploring Resiliency to Natural Image Corruptions in Deep Learning using Design Diversity
Rafael Rosales
Pablo Munoz
Michael Paulitsch
132
2
0
15 Mar 2023
ODIN: On-demand Data Formulation to Mitigate Dataset Lock-in
SP Choi
Jihun Lee
HyeongSeok Ahn
Sanghee Jung
Bumsoo Kang
VLM
139
0
0
13 Mar 2023
Multi-metrics adaptively identifies backdoors in Federated learning
IEEE International Conference on Computer Vision (ICCV), 2023
Siquan Huang
Yijiang Li
Chong Chen
Leyu Shi
Ying Gao
AAML
215
43
0
12 Mar 2023
Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
Computer Vision and Pattern Recognition (CVPR), 2023
Jierun Chen
Shiu-hong Kao
Hao He
Weipeng Zhuo
Song Wen
Chul-Ho Lee
Shueng-Han Gary Chan
OOD
318
1,430
0
07 Mar 2023
Fine-Grained ImageNet Classification in the Wild
Maria Lymperaiou
Konstantinos Thomas
Giorgos Stamou
VLM
145
1
0
04 Mar 2023
Unleashing Text-to-Image Diffusion Models for Visual Perception
IEEE International Conference on Computer Vision (ICCV), 2023
Wenliang Zhao
Yongming Rao
Zuyan Liu
Benlin Liu
Jie Zhou
Jiwen Lu
ObjD
VLM
MDE
920
294
0
03 Mar 2023
Visual Atoms: Pre-training Vision Transformers with Sinusoidal Waves
Computer Vision and Pattern Recognition (CVPR), 2023
Sora Takashima
Ryo Hayamizu
Nakamasa Inoue
Hirokatsu Kataoka
Rio Yokota
224
25
0
02 Mar 2023
Time Series as Images: Vision Transformer for Irregularly Sampled Time Series
Neural Information Processing Systems (NeurIPS), 2023
Zekun Li
Shiyang Li
Xifeng Yan
AI4TS
196
87
0
01 Mar 2023
Efficient and Explicit Modelling of Image Hierarchies for Image Restoration
Computer Vision and Pattern Recognition (CVPR), 2023
Yawei Li
Yuchen Fan
Xiaoyu Xiang
D. Demandolx
Rakesh Ranjan
Radu Timofte
Luc Van Gool
235
276
0
01 Mar 2023
OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System
Chao Xue
Wen Liu
Shunxing Xie
Zhenfang Wang
Jiaxing Li
...
Shi-Yong Chen
Yibing Zhan
Jing Zhang
Chaoyue Wang
Dacheng Tao
211
2
0
01 Mar 2023
Single-Cell Multimodal Prediction via Transformers
International Conference on Information and Knowledge Management (CIKM), 2023
Wenzhuo Tang
Haifang Wen
Renming Liu
Jiayuan Ding
Wei Jin
Yuying Xie
Hui Liu
Shucheng Zhou
AI4CE
192
13
0
01 Mar 2023
Learning to Generalize towards Unseen Domains via a Content-Aware Style Invariant Model for Disease Detection from Chest X-rays
IEEE journal of biomedical and health informatics (IEEE JBHI), 2023
Mohammad Zunaed
M. Haque
Taufiq Hasan
OOD
216
7
0
27 Feb 2023
ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth
S. Bhat
R. Birkl
Diana Wofk
Peter Wonka
Matthias Müller
VLM
MDE
444
736
0
23 Feb 2023
Human MotionFormer: Transferring Human Motions with Vision Transformers
International Conference on Learning Representations (ICLR), 2023
Hongyu Liu
Xintong Han
Chengbin Jin
Lihui Qian
Huawei Wei
...
Faqiang Wang
Haoye Dong
Yibing Song
Jia Xu
Qifeng Chen
135
19
0
22 Feb 2023
Hyneter: Hybrid Network Transformer for Object Detection
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Dong Chen
Duoqian Miao
Xuepeng Zhao
ViT
170
6
0
18 Feb 2023
Slapo: A Schedule Language for Progressive Optimization of Large Deep Learning Model Training
International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2023
Hongzheng Chen
Cody Hao Yu
Shuai Zheng
Zhen Zhang
Zhiru Zhang
Yida Wang
318
13
0
16 Feb 2023
CholecTriplet2022: Show me a tool and tell me the triplet -- an endoscopic vision challenge for surgical action triplet detection
C. Nwoye
Tong Yu
Saurav Sharma
Aditya Murali
Deepak Alapatt
...
Pietro Mascagni
B. Seeliger
Cristians Gonzalez
Didier Mutter
N. Padoy
208
33
0
13 Feb 2023
Team Triple-Check at Factify 2: Parameter-Efficient Large Foundation Models with Feature Representations for Multi-Modal Fact Verification
Wei-Wei Du
Hongfa Wu
Wei-Yao Wang
Chao-Han Huck Yang
165
9
0
12 Feb 2023
GMConv: Modulating Effective Receptive Fields for Convolutional Kernels
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Qi Chen
Chao Li
Jia Ning
Stephen Lin
Kun He
AAML
199
6
0
09 Feb 2023
Knowledge Distillation in Vision Transformers: A Critical Review
Gousia Habib
Tausifa Jan Saleem
Brejesh Lall
260
23
0
04 Feb 2023
Beyond Pretrained Features: Noisy Image Modeling Provides Adversarial Defense
Neural Information Processing Systems (NeurIPS), 2023
Zunzhi You
Daochang Liu
Bohyung Han
Chang Xu
AAML
VLM
418
6
0
02 Feb 2023
FCB-SwinV2 Transformer for Polyp Segmentation
Kerr Fitzgerald
B. Matuszewski
ViT
MedIm
184
17
0
02 Feb 2023
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video
International Conference on Machine Learning (ICML), 2023
Haiyang Xu
Qinghao Ye
Mingshi Yan
Yaya Shi
Jiabo Ye
...
Guohai Xu
Ji Zhang
Songfang Huang
Feiran Huang
Jingren Zhou
MLLM
VLM
MoE
235
217
0
01 Feb 2023
Previous
1
2
3
...
14
15
16
17
18
19
Next