Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.14030
Cited By
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng-Wei Zhang
Stephen Lin
B. Guo
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"
50 / 2,188 papers shown
Title
ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box
Yifu Zhang
Xing-Hui Wang
Xiaoqing Ye
Wei Zhang
Jincheng Lu
Xiao Tan
Errui Ding
Pei Sun
Jingdong Wang
VOT
24
20
0
27 Mar 2023
Vision Transformer with Quadrangle Attention
Qiming Zhang
Jing Zhang
Yufei Xu
Dacheng Tao
ViT
19
38
0
27 Mar 2023
UniDistill: A Universal Cross-Modality Knowledge Distillation Framework for 3D Object Detection in Bird's-Eye View
Shengchao Zhou
Weizhou Liu
Chen Hu
Shuchang Zhou
Chaoxiang Ma
21
42
0
27 Mar 2023
Learned Image Compression with Mixed Transformer-CNN Architectures
Jinming Liu
Heming Sun
J. Katto
10
220
0
27 Mar 2023
Joint Person Identity, Gender and Age Estimation from Hand Images using Deep Multi-Task Representation Learning
N. L. Baisa
CVBM
32
4
0
27 Mar 2023
Global-to-Local Modeling for Video-based 3D Human Pose and Shape Estimation
Xi Shen
Zongxin Yang
Xiaohan Wang
Jianxin Ma
Chang Zhou
Yezhou Yang
ViT
3DH
21
33
0
26 Mar 2023
SDTracker: Synthetic Data Based Multi-Object Tracking
Yingda Guan
Zhengyang Feng
Huiying Chang
Kuo Du
Tingting Li
Min Wang
21
0
0
26 Mar 2023
Sector Patch Embedding: An Embedding Module Conforming to The Distortion Pattern of Fisheye Image
Dian Yang
Jiadong Tang
Yu Gao
Yi Yang
M. Fu
18
1
0
26 Mar 2023
An Evaluation of Memory Optimization Methods for Training Neural Networks
Xiaoxuan Liu
Siddharth Jha
Alvin Cheung
21
0
0
26 Mar 2023
EfficientAD: Accurate Visual Anomaly Detection at Millisecond-Level Latencies
Kilian Batzner
Lars Heckler
Rebecca König
30
129
0
25 Mar 2023
MDQE: Mining Discriminative Query Embeddings to Segment Occluded Instances on Challenging Videos
Minghan Li
Shuai Li
Wangmeng Xiang
Lei Zhang
26
9
0
25 Mar 2023
Prompt-Guided Transformers for End-to-End Open-Vocabulary Object Detection
Hwanjun Song
Jihwan Bang
VLM
ObjD
24
14
0
25 Mar 2023
Towards Accurate Post-Training Quantization for Vision Transformer
Yifu Ding
Haotong Qin
Qing-Yu Yan
Z. Chai
Junjie Liu
Xiaolin K. Wei
Xianglong Liu
MQ
54
67
0
25 Mar 2023
Learned Two-Plane Perspective Prior based Image Resampling for Efficient Object Detection
Anurag Ghosh
Dinesh Reddy Narapureddy
Christoph Mertz
S. Narasimhan
28
4
0
25 Mar 2023
FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization
Pavan Kumar Anasosalu Vasu
J. Gabriel
Jeff J. Zhu
Oncel Tuzel
Anurag Ranjan
ViT
26
149
0
24 Mar 2023
PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View
Ze Shi
Haowen Shi
Kailun Yang
Zhen-fei Yin
Yining Lin
Kaiwei Wang
24
3
0
24 Mar 2023
PFT-SSR: Parallax Fusion Transformer for Stereo Image Super-Resolution
Hansheng Guo
Juncheng Li
Guangwei Gao
Zhi Li
T. Zeng
ViT
24
12
0
24 Mar 2023
MSFA-Frequency-Aware Transformer for Hyperspectral Images Demosaicing
Haijin Zeng
Kai Feng
Shaoguang Huang
Jiezhang Cao
Yongyong Chen
Hongyan Zhang
H. Luong
Wilfried Philips
21
1
0
23 Mar 2023
Towards Better Dynamic Graph Learning: New Architecture and Unified Library
Le Yu
Leilei Sun
Bowen Du
Weifeng Lv
AI4CE
22
96
0
23 Mar 2023
Top-Down Visual Attention from Analysis by Synthesis
Baifeng Shi
Trevor Darrell
Xin Eric Wang
17
28
0
23 Mar 2023
From Knowledge Distillation to Self-Knowledge Distillation: A Unified Approach with Normalized Loss and Customized Soft Labels
Zhendong Yang
Ailing Zeng
Zhe Li
Tianke Zhang
Chun Yuan
Yu Li
21
72
0
23 Mar 2023
LiDARFormer: A Unified Transformer-based Multi-task Network for LiDAR Perception
Zixiang Zhou
Dongqiangzi Ye
Weijia Chen
Yufei Xie
Yu Wang
Panqu Wang
H. Foroosh
29
10
0
21 Mar 2023
Machine Learning for Brain Disorders: Transformers and Visual Transformers
Robin Courant
Maika Edberg
Nicolas Dufour
Vicky Kalogeiton
MedIm
ViT
27
1
0
21 Mar 2023
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
Shihao Wang
Yingfei Liu
Tiancai Wang
Ying Li
Xiangyu Zhang
3DPC
39
190
0
21 Mar 2023
The Multiscale Surface Vision Transformer
Simon Dahan
Logan Z. J. Williams
Daniel Rueckert
E. C. Robinson
MedIm
ViT
10
2
0
21 Mar 2023
A High-Frequency Focused Network for Lightweight Single Image Super-Resolution
Xiaotian Weng
Yi Chen
Zhichao Zheng
Yanhui Gu
Junsheng Zhou
Yudong Zhang
21
0
0
21 Mar 2023
Human Pose as Compositional Tokens
Zigang Geng
Chunyu Wang
Yixuan Wei
Ze Liu
Houqiang Li
Han Hu
23
47
0
21 Mar 2023
Equiangular Basis Vectors
Yang Shen
Xuhao Sun
Xiuying Wei
33
7
0
21 Mar 2023
Learning Context-aware Classifier for Semantic Segmentation
Zhuotao Tian
Jiequan Cui
Li Jiang
Xiaojuan Qi
Xin Lai
Yixin Chen
Shu Liu
Jiaya Jia
85
22
0
21 Mar 2023
Detecting the open-world objects with the help of the Brain
Shuailei Ma
Yuefeng Wang
Ying-yu Wei
Peihao Chen
Zhixiang Ye
Jiaqi Fan
Enming Zhang
Thomas H. Li
VLM
ObjD
16
2
0
21 Mar 2023
One-to-Few Label Assignment for End-to-End Dense Detection
Shuai Li
Minghan Li
Ruihuang Li
Chenhang He
Lei Zhang
25
19
0
21 Mar 2023
Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Vithursan Thangarasa
Shreyas Saxena
Abhay Gupta
Sean Lie
21
3
0
21 Mar 2023
Tracker Meets Night: A Transformer Enhancer for UAV Tracking
Junjie Ye
Changhong Fu
Ziang Cao
Shan An
Guang-Zheng Zheng
Bowen Li
32
51
0
20 Mar 2023
Multiscale Audio Spectrogram Transformer for Efficient Audio Classification
Wenjie Zhu
M. Omar
35
22
0
19 Mar 2023
Spatio-Temporal AU Relational Graph Representation Learning For Facial Action Units Detection
Zihan Wang
Siyang Song
Cheng Luo
Yuzhi Zhou
Shiling Wu
Weicheng Xie
Linlin Shen
CVBM
31
13
0
19 Mar 2023
Vision Transformer-based Model for Severity Quantification of Lung Pneumonia Using Chest X-ray Images
Bouthaina Slika
Fadi Dornaika
H. Merdji
K. Hammoudi
ViT
MedIm
26
0
0
18 Mar 2023
HDformer: A Higher Dimensional Transformer for Diabetes Detection Utilizing Long Range Vascular Signals
Ella Lan
MedIm
20
1
0
17 Mar 2023
MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image Segmentation
Saikat Roy
Gregor Koehler
Constantin Ulrich
Michael Baumgartner
Jens Petersen
Fabian Isensee
Paul F. Jaeger
Klaus Maier-Hein
ViT
MedIm
24
136
0
17 Mar 2023
GNNFormer: A Graph-based Framework for Cytopathology Report Generation
Yangqiaoyu Zhou
Kai-Lang Yao
Wusuo Li
MedIm
11
1
0
17 Mar 2023
SwinVFTR: A Novel Volumetric Feature-learning Transformer for 3D OCT Fluid Segmentation
Sharif Amit Kamran
Khondker Fariha Hossain
Alireza Tavakkoli
Salah A. Baker
S. Zuckerbrod
ViT
MedIm
11
1
0
16 Mar 2023
Highly Efficient 3D Human Pose Tracking from Events with Spiking Spatiotemporal Transformer
Shihao Zou
Yuxuan Mu
X. Zuo
Zi-An Wang
Chao Li
Sen Wang
Weixin Si
Li Cheng
3DH
31
15
0
16 Mar 2023
DeDA: Deep Directed Accumulator
Hang Zhang
Rongguang Wang
Renjiu Hu
Jinwei Zhang
Jiahao Nick Li
MedIm
19
4
0
15 Mar 2023
Multi Modal Facial Expression Recognition with Transformer-Based Fusion Networks and Dynamic Sampling
Jun-Hwa Kim
Namho Kim
C. Won
CVBM
6
8
0
15 Mar 2023
Subjective and Objective Quality Assessment for in-the-Wild Computer Graphics Images
Zicheng Zhang
Wei Sun
Yingjie Zhou
Jun Jia
Zhichao Zhang
Jing Liu
Xiongkuo Min
Guangtao Zhai
23
25
0
14 Mar 2023
CAT: Causal Audio Transformer for Audio Classification
Xiaoyu Liu
Hanlin Lu
Jianbo Yuan
Xinyu Li
ViT
21
22
0
14 Mar 2023
RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose
Tao Jiang
Peng Lu
Li Zhang
Ning Ma
Rui Han
Chengqi Lyu
Yining Li
Kai-xiang Chen
3DH
31
155
0
13 Mar 2023
Diffusion-Based Hierarchical Multi-Label Object Detection to Analyze Panoramic Dental X-rays
Ibrahim Ethem Hamamci
Sezgin Er
Enis Simsar
Anjany Sekuboyina
M. Gundogar
B. Stadlinger
A. Mehl
Bjoern H. Menze
DiffM
MedIm
10
25
0
11 Mar 2023
Fine-grained Visual Classification with High-temperature Refinement and Background Suppression
Po-Yung Chou
Yu-Yung Kao
Cheng-Hung Lin
23
30
0
11 Mar 2023
Semantics-Aware Dynamic Localization and Refinement for Referring Image Segmentation
Zhao Yang
Jiaqi Wang
Yansong Tang
Kai-xiang Chen
Hengshuang Zhao
Philip H. S. Torr
31
23
0
11 Mar 2023
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Shilong Liu
Zhaoyang Zeng
Tianhe Ren
Feng Li
Hao Zhang
...
Chun-yue Li
Jianwei Yang
Hang Su
Jun Zhu
Lei Zhang
ObjD
58
1,804
0
09 Mar 2023
Previous
1
2
3
...
22
23
24
...
42
43
44
Next