Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.14030
Cited By
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng-Wei Zhang
Stephen Lin
B. Guo
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"
50 / 2,425 papers shown
Title
CVSNet: A Computer Implementation for Central Visual System of The Brain
Ruimin Gao
Hao-Li Zou
Zhekai Duan
26
3
0
31 May 2023
Vision Transformers for Mobile Applications: A Short Survey
Nahid Alam
Steven Kolawole
S. Sethi
Nishant Bansali
Karina Nguyen
ViT
18
3
0
30 May 2023
Client: Cross-variable Linear Integrated Enhanced Transformer for Multivariate Long-Term Time Series Forecasting
Jiaxin Gao
Wenbo Hu
Yuntian Chen
AI4TS
27
13
0
30 May 2023
MixDehazeNet : Mix Structure Block For Image Dehazing Network
Liping Lu
Qian Xiong
Duanfeng Chu
Bingrong Xu
35
27
0
28 May 2023
Graph Inductive Biases in Transformers without Message Passing
Liheng Ma
Chen Lin
Derek Lim
Adriana Romero Soriano
P. Dokania
Mark J. Coates
Philip H. S. Torr
Ser-Nam Lim
AI4CE
21
85
0
27 May 2023
COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models
Jinqi Xiao
Miao Yin
Yu Gong
Xiao Zang
Jian Ren
Bo Yuan
VLM
ViT
30
9
0
26 May 2023
Do We Really Need a Large Number of Visual Prompts?
Youngeun Kim
Yuhang Li
Abhishek Moitra
Ruokai Yin
Priyadarshini Panda
VLM
VPVLM
34
5
0
26 May 2023
Maskomaly:Zero-Shot Mask Anomaly Segmentation
J. Ackermann
Christos Sakaridis
F. I. F. Richard Yu
ISeg
24
23
0
26 May 2023
Gender, Smoking History and Age Prediction from Laryngeal Images
Tianxiao Zhang
A. Bur
Shannon M. Kraft
Hannah Kavookjian
Bryan Renslo
Xiangyu Chen
Bo Luo
Guanghui Wang
18
7
0
26 May 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Shilin Yan
Renrui Zhang
Ziyu Guo
Wenchao Chen
Wei Zhang
Hongyang Li
Yu Qiao
Hao Dong
Zhongjiang He
Peng Gao
VOS
20
30
0
25 May 2023
NexToU: Efficient Topology-Aware U-Net for Medical Image Segmentation
P. Shi
Xutao Guo
Yanwu Yang
Chenfei Ye
H. T. Ma
22
5
0
25 May 2023
All Points Matter: Entropy-Regularized Distribution Alignment for Weakly-supervised 3D Segmentation
Liyao Tang
Zhe Chen
Shanshan Zhao
Chaoyue Wang
Dacheng Tao
21
12
0
25 May 2023
Knowledge Diffusion for Distillation
Tao Huang
Yuan Zhang
Mingkai Zheng
Shan You
Fei Wang
Chao Qian
Chang Xu
31
50
0
25 May 2023
Multi-Modal Mutual Attention and Iterative Interaction for Referring Image Segmentation
Chang Liu
Henghui Ding
Yulun Zhang
Xudong Jiang
21
47
0
24 May 2023
Continuous Cross-resolution Remote Sensing Image Change Detection
Hao Chen
Haotian Zhang
Keyan Chen
Chenyao Zhou
Songhang Chen
Zhengxia Zou
Z. Shi
27
21
0
24 May 2023
GRILL: Grounded Vision-language Pre-training via Aligning Text and Image Regions
Woojeong Jin
Subhabrata Mukherjee
Yu Cheng
Yelong Shen
Weizhu Chen
Ahmed Hassan Awadallah
Damien Jose
Xiang Ren
ObjD
VLM
25
8
0
24 May 2023
A multimodal method based on cross-attention and convolution for postoperative infection diagnosis
Xianjie Liu
Hon-Yi Shi
19
0
0
23 May 2023
Neural Image Re-Exposure
Xinyu Zhang
Hefei Huang
Xu Jia
D. Wang
Huchuan Lu
23
4
0
23 May 2023
VDD: Varied Drone Dataset for Semantic Segmentation
Wenxiao Cai
Ke Jin
Jinyan Hou
Cong Guo
Letian Wu
Wankou Yang
40
11
0
23 May 2023
Discovering Universal Geometry in Embeddings with ICA
Hiroaki Yamagiwa
Momose Oyama
Hidetoshi Shimodaira
23
13
0
22 May 2023
HGFormer: Hierarchical Grouping Transformer for Domain Generalized Semantic Segmentation
Jian Ding
Nan Xue
Guisong Xia
Bernt Schiele
Dengxin Dai
ViT
17
30
0
22 May 2023
Is Synthetic Data From Diffusion Models Ready for Knowledge Distillation?
Zheng Li
Yuxuan Li
Penghai Zhao
Renjie Song
Xiang Li
Jian Yang
29
19
0
22 May 2023
FIT: Far-reaching Interleaved Transformers
Ting-Li Chen
Lala Li
19
12
0
22 May 2023
A bioinspired three-stage model for camouflaged object detection
Tianyou Chen
Jin Xiao
Xiaoguang Hu
Guofeng Zhang
Shaojie Wang
22
0
0
22 May 2023
Advancing Referring Expression Segmentation Beyond Single Image
YiXuan Wu
Zhao Zhang
Xie Chi
Feng Zhu
Rui Zhao
VLM
34
18
0
21 May 2023
Bi-VLGM : Bi-Level Class-Severity-Aware Vision-Language Graph Matching for Text Guided Medical Image Segmentation
Wenting Chen
Jie Liu
Yixuan Yuan
VLM
26
3
0
20 May 2023
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner
Zikang Liu
Sihan Chen
Longteng Guo
Handong Li
Xingjian He
J. Liu
13
1
0
19 May 2023
Learning Global-aware Kernel for Image Harmonization
Xintian Shen
Jiangning Zhang
Jun Chen
Shipeng Bai
Yue Han
Yabiao Wang
Chengjie Wang
Yong-Jin Liu
21
7
0
19 May 2023
Efficient Mixed Transformer for Single Image Super-Resolution
Ling Zheng
Jinchen Zhu
Jinpeng Shi
Shizhuang Weng
35
19
0
19 May 2023
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Peng Wang
Shijie Wang
Junyang Lin
Shuai Bai
Xiaohuan Zhou
Jingren Zhou
Xinggang Wang
Chang Zhou
VLM
MLLM
ObjD
26
114
0
18 May 2023
Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment
Tianhe Wu
Shu Shi
Haoming Cai
Ming Cao
Jing Xiao
Yinqiang Zheng
Yujiu Yang
31
21
0
18 May 2023
Boost Vision Transformer with GPU-Friendly Sparsity and Quantization
Chong Yu
Tao Chen
Zhongxue Gan
Jiayuan Fan
MQ
ViT
25
23
0
18 May 2023
S
3
^3
3
Track: Self-supervised Tracking with Soft Assignment Flow
Fatemeh Azimi
Fahim Mannan
Felix Heide
VOT
24
0
0
17 May 2023
HICO-DET-SG and V-COCO-SG: New Data Splits for Evaluating the Systematic Generalization Performance of Human-Object Interaction Detection Models
Kenta Takemoto
Moyuru Yamada
Tomotake Sasaki
H. Akima
35
0
0
17 May 2023
OSDP: Optimal Sharded Data Parallel for Distributed Deep Learning
Youhe Jiang
Fangcheng Fu
Xupeng Miao
Xiaonan Nie
Bin Cui
22
11
0
17 May 2023
CageViT: Convolutional Activation Guided Efficient Vision Transformer
Hao Zheng
Jinbao Wang
Xiantong Zhen
H. Chen
Jingkuan Song
Feng Zheng
ViT
10
0
0
17 May 2023
Blind Image Quality Assessment via Transformer Predicted Error Map and Perceptual Quality Token
Jinsong Shi
Pan Gao
A. Smolic
ViT
13
10
0
16 May 2023
GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Training
Xiaoyu Tian
Haoxi Ran
Yue Wang
Hang Zhao
3DPC
ViT
26
38
0
15 May 2023
Toward Moiré-Free and Detail-Preserving Demosaicking
Xuan-Yi Li
Y. Niu
Bo-Lu Zhao
Haoyuan Shi
Zitong An
26
1
0
15 May 2023
SB-VQA: A Stack-Based Video Quality Assessment Framework for Video Enhancement
Ding-Jiun Huang
Yu-Ting Kao
Tieh-Hung Chuang
Ya-Chun Tsai
Jing-Kai Lou
Shuen-Huei Guan
18
2
0
15 May 2023
Transavs: End-To-End Audio-Visual Segmentation With Transformer
Yuhang Ling
Yuxi Li
Zhenye Gan
Jiangning Zhang
M. Chi
Yabiao Wang
VOS
ViT
29
1
0
12 May 2023
OneCAD: One Classifier for All image Datasets using multimodal learning
S. Wadekar
Eugenio Culurciello
32
0
0
11 May 2023
Exploiting Diffusion Prior for Real-World Image Super-Resolution
Jianyi Wang
Zongsheng Yue
Shangchen Zhou
Kelvin C. K. Chan
Chen Change Loy
36
281
0
11 May 2023
Universal Source Separation with Weakly Labelled Data
Qiuqiang Kong
K. Chen
Haohe Liu
Xingjian Du
Taylor Berg-Kirkpatrick
Shlomo Dubnov
Mark D. Plumbley
16
17
0
11 May 2023
SalienDet: A Saliency-based Feature Enhancement Algorithm for Object Detection for Autonomous Driving
Ning Ding
Ce Zhang
A. Eskandarian
28
10
0
11 May 2023
Exploiting Fine-Grained DCT Representations for Hiding Image-Level Messages within JPEG Images
Junxue Yang
Xin Liao
28
5
0
11 May 2023
Patch-wise Mixed-Precision Quantization of Vision Transformer
Junrui Xiao
Zhikai Li
Lianwei Yang
Qingyi Gu
MQ
24
12
0
11 May 2023
Enhancing the Performance of Transformer-based Spiking Neural Networks by SNN-optimized Downsampling with Precise Gradient Backpropagation
Chenlin Zhou
Han Zhang
Zhaokun Zhou
Liutao Yu
Zhengyu Ma
Huihui Zhou
Xiaopeng Fan
Yonghong Tian
21
9
0
10 May 2023
Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification
Jia-ju Mao
Shu-Hua Guo
Yuan Chang
Xuesong Yin
Binling Nie
26
2
0
10 May 2023
2
∗
n
2 * n
2
∗
n
is better than
n
2
n^2
n
2
: Decomposing Event Coreference Resolution into Two Tractable Problems
Shafiuddin Rehan Ahmed
Abhijnan Nath
James H. Martin
Nikhil Krishnaswamy
22
13
0
09 May 2023
Previous
1
2
3
...
21
22
23
...
47
48
49
Next