ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.14030
  4. Cited By
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

25 March 2021
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng-Wei Zhang
Stephen Lin
B. Guo
    ViT
ArXivPDFHTML

Papers citing "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"

50 / 2,425 papers shown
Title
CVSNet: A Computer Implementation for Central Visual System of The Brain
CVSNet: A Computer Implementation for Central Visual System of The Brain
Ruimin Gao
Hao-Li Zou
Zhekai Duan
26
3
0
31 May 2023
Vision Transformers for Mobile Applications: A Short Survey
Vision Transformers for Mobile Applications: A Short Survey
Nahid Alam
Steven Kolawole
S. Sethi
Nishant Bansali
Karina Nguyen
ViT
18
3
0
30 May 2023
Client: Cross-variable Linear Integrated Enhanced Transformer for
  Multivariate Long-Term Time Series Forecasting
Client: Cross-variable Linear Integrated Enhanced Transformer for Multivariate Long-Term Time Series Forecasting
Jiaxin Gao
Wenbo Hu
Yuntian Chen
AI4TS
27
13
0
30 May 2023
MixDehazeNet : Mix Structure Block For Image Dehazing Network
MixDehazeNet : Mix Structure Block For Image Dehazing Network
Liping Lu
Qian Xiong
Duanfeng Chu
Bingrong Xu
35
27
0
28 May 2023
Graph Inductive Biases in Transformers without Message Passing
Graph Inductive Biases in Transformers without Message Passing
Liheng Ma
Chen Lin
Derek Lim
Adriana Romero Soriano
P. Dokania
Mark J. Coates
Philip H. S. Torr
Ser-Nam Lim
AI4CE
21
85
0
27 May 2023
COMCAT: Towards Efficient Compression and Customization of
  Attention-Based Vision Models
COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models
Jinqi Xiao
Miao Yin
Yu Gong
Xiao Zang
Jian Ren
Bo Yuan
VLM
ViT
30
9
0
26 May 2023
Do We Really Need a Large Number of Visual Prompts?
Do We Really Need a Large Number of Visual Prompts?
Youngeun Kim
Yuhang Li
Abhishek Moitra
Ruokai Yin
Priyadarshini Panda
VLM
VPVLM
34
5
0
26 May 2023
Maskomaly:Zero-Shot Mask Anomaly Segmentation
Maskomaly:Zero-Shot Mask Anomaly Segmentation
J. Ackermann
Christos Sakaridis
F. I. F. Richard Yu
ISeg
24
23
0
26 May 2023
Gender, Smoking History and Age Prediction from Laryngeal Images
Gender, Smoking History and Age Prediction from Laryngeal Images
Tianxiao Zhang
A. Bur
Shannon M. Kraft
Hannah Kavookjian
Bryan Renslo
Xiangyu Chen
Bo Luo
Guanghui Wang
18
7
0
26 May 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video
  Object Segmentation
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Shilin Yan
Renrui Zhang
Ziyu Guo
Wenchao Chen
Wei Zhang
Hongyang Li
Yu Qiao
Hao Dong
Zhongjiang He
Peng Gao
VOS
20
30
0
25 May 2023
NexToU: Efficient Topology-Aware U-Net for Medical Image Segmentation
NexToU: Efficient Topology-Aware U-Net for Medical Image Segmentation
P. Shi
Xutao Guo
Yanwu Yang
Chenfei Ye
H. T. Ma
22
5
0
25 May 2023
All Points Matter: Entropy-Regularized Distribution Alignment for
  Weakly-supervised 3D Segmentation
All Points Matter: Entropy-Regularized Distribution Alignment for Weakly-supervised 3D Segmentation
Liyao Tang
Zhe Chen
Shanshan Zhao
Chaoyue Wang
Dacheng Tao
21
12
0
25 May 2023
Knowledge Diffusion for Distillation
Knowledge Diffusion for Distillation
Tao Huang
Yuan Zhang
Mingkai Zheng
Shan You
Fei Wang
Chao Qian
Chang Xu
31
50
0
25 May 2023
Multi-Modal Mutual Attention and Iterative Interaction for Referring
  Image Segmentation
Multi-Modal Mutual Attention and Iterative Interaction for Referring Image Segmentation
Chang Liu
Henghui Ding
Yulun Zhang
Xudong Jiang
21
47
0
24 May 2023
Continuous Cross-resolution Remote Sensing Image Change Detection
Continuous Cross-resolution Remote Sensing Image Change Detection
Hao Chen
Haotian Zhang
Keyan Chen
Chenyao Zhou
Songhang Chen
Zhengxia Zou
Z. Shi
27
21
0
24 May 2023
GRILL: Grounded Vision-language Pre-training via Aligning Text and Image
  Regions
GRILL: Grounded Vision-language Pre-training via Aligning Text and Image Regions
Woojeong Jin
Subhabrata Mukherjee
Yu Cheng
Yelong Shen
Weizhu Chen
Ahmed Hassan Awadallah
Damien Jose
Xiang Ren
ObjD
VLM
25
8
0
24 May 2023
A multimodal method based on cross-attention and convolution for
  postoperative infection diagnosis
A multimodal method based on cross-attention and convolution for postoperative infection diagnosis
Xianjie Liu
Hon-Yi Shi
19
0
0
23 May 2023
Neural Image Re-Exposure
Neural Image Re-Exposure
Xinyu Zhang
Hefei Huang
Xu Jia
D. Wang
Huchuan Lu
23
4
0
23 May 2023
VDD: Varied Drone Dataset for Semantic Segmentation
VDD: Varied Drone Dataset for Semantic Segmentation
Wenxiao Cai
Ke Jin
Jinyan Hou
Cong Guo
Letian Wu
Wankou Yang
40
11
0
23 May 2023
Discovering Universal Geometry in Embeddings with ICA
Discovering Universal Geometry in Embeddings with ICA
Hiroaki Yamagiwa
Momose Oyama
Hidetoshi Shimodaira
23
13
0
22 May 2023
HGFormer: Hierarchical Grouping Transformer for Domain Generalized
  Semantic Segmentation
HGFormer: Hierarchical Grouping Transformer for Domain Generalized Semantic Segmentation
Jian Ding
Nan Xue
Guisong Xia
Bernt Schiele
Dengxin Dai
ViT
17
30
0
22 May 2023
Is Synthetic Data From Diffusion Models Ready for Knowledge
  Distillation?
Is Synthetic Data From Diffusion Models Ready for Knowledge Distillation?
Zheng Li
Yuxuan Li
Penghai Zhao
Renjie Song
Xiang Li
Jian Yang
29
19
0
22 May 2023
FIT: Far-reaching Interleaved Transformers
FIT: Far-reaching Interleaved Transformers
Ting-Li Chen
Lala Li
19
12
0
22 May 2023
A bioinspired three-stage model for camouflaged object detection
A bioinspired three-stage model for camouflaged object detection
Tianyou Chen
Jin Xiao
Xiaoguang Hu
Guofeng Zhang
Shaojie Wang
22
0
0
22 May 2023
Advancing Referring Expression Segmentation Beyond Single Image
Advancing Referring Expression Segmentation Beyond Single Image
YiXuan Wu
Zhao Zhang
Xie Chi
Feng Zhu
Rui Zhao
VLM
34
18
0
21 May 2023
Bi-VLGM : Bi-Level Class-Severity-Aware Vision-Language Graph Matching
  for Text Guided Medical Image Segmentation
Bi-VLGM : Bi-Level Class-Severity-Aware Vision-Language Graph Matching for Text Guided Medical Image Segmentation
Wenting Chen
Jie Liu
Yixuan Yuan
VLM
26
3
0
20 May 2023
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner
  and Dense Captioner
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner
Zikang Liu
Sihan Chen
Longteng Guo
Handong Li
Xingjian He
J. Liu
13
1
0
19 May 2023
Learning Global-aware Kernel for Image Harmonization
Learning Global-aware Kernel for Image Harmonization
Xintian Shen
Jiangning Zhang
Jun Chen
Shipeng Bai
Yue Han
Yabiao Wang
Chengjie Wang
Yong-Jin Liu
21
7
0
19 May 2023
Efficient Mixed Transformer for Single Image Super-Resolution
Efficient Mixed Transformer for Single Image Super-Resolution
Ling Zheng
Jinchen Zhu
Jinpeng Shi
Shizhuang Weng
35
19
0
19 May 2023
ONE-PEACE: Exploring One General Representation Model Toward Unlimited
  Modalities
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Peng Wang
Shijie Wang
Junyang Lin
Shuai Bai
Xiaohuan Zhou
Jingren Zhou
Xinggang Wang
Chang Zhou
VLM
MLLM
ObjD
26
114
0
18 May 2023
Assessor360: Multi-sequence Network for Blind Omnidirectional Image
  Quality Assessment
Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment
Tianhe Wu
Shu Shi
Haoming Cai
Ming Cao
Jing Xiao
Yinqiang Zheng
Yujiu Yang
31
21
0
18 May 2023
Boost Vision Transformer with GPU-Friendly Sparsity and Quantization
Boost Vision Transformer with GPU-Friendly Sparsity and Quantization
Chong Yu
Tao Chen
Zhongxue Gan
Jiayuan Fan
MQ
ViT
25
23
0
18 May 2023
S$^3$Track: Self-supervised Tracking with Soft Assignment Flow
S3^33Track: Self-supervised Tracking with Soft Assignment Flow
Fatemeh Azimi
Fahim Mannan
Felix Heide
VOT
24
0
0
17 May 2023
HICO-DET-SG and V-COCO-SG: New Data Splits for Evaluating the Systematic
  Generalization Performance of Human-Object Interaction Detection Models
HICO-DET-SG and V-COCO-SG: New Data Splits for Evaluating the Systematic Generalization Performance of Human-Object Interaction Detection Models
Kenta Takemoto
Moyuru Yamada
Tomotake Sasaki
H. Akima
35
0
0
17 May 2023
OSDP: Optimal Sharded Data Parallel for Distributed Deep Learning
Youhe Jiang
Fangcheng Fu
Xupeng Miao
Xiaonan Nie
Bin Cui
22
11
0
17 May 2023
CageViT: Convolutional Activation Guided Efficient Vision Transformer
CageViT: Convolutional Activation Guided Efficient Vision Transformer
Hao Zheng
Jinbao Wang
Xiantong Zhen
H. Chen
Jingkuan Song
Feng Zheng
ViT
10
0
0
17 May 2023
Blind Image Quality Assessment via Transformer Predicted Error Map and
  Perceptual Quality Token
Blind Image Quality Assessment via Transformer Predicted Error Map and Perceptual Quality Token
Jinsong Shi
Pan Gao
A. Smolic
ViT
13
10
0
16 May 2023
GeoMAE: Masked Geometric Target Prediction for Self-supervised Point
  Cloud Pre-Training
GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Training
Xiaoyu Tian
Haoxi Ran
Yue Wang
Hang Zhao
3DPC
ViT
26
38
0
15 May 2023
Toward Moiré-Free and Detail-Preserving Demosaicking
Toward Moiré-Free and Detail-Preserving Demosaicking
Xuan-Yi Li
Y. Niu
Bo-Lu Zhao
Haoyuan Shi
Zitong An
26
1
0
15 May 2023
SB-VQA: A Stack-Based Video Quality Assessment Framework for Video
  Enhancement
SB-VQA: A Stack-Based Video Quality Assessment Framework for Video Enhancement
Ding-Jiun Huang
Yu-Ting Kao
Tieh-Hung Chuang
Ya-Chun Tsai
Jing-Kai Lou
Shuen-Huei Guan
18
2
0
15 May 2023
Transavs: End-To-End Audio-Visual Segmentation With Transformer
Transavs: End-To-End Audio-Visual Segmentation With Transformer
Yuhang Ling
Yuxi Li
Zhenye Gan
Jiangning Zhang
M. Chi
Yabiao Wang
VOS
ViT
29
1
0
12 May 2023
OneCAD: One Classifier for All image Datasets using multimodal learning
OneCAD: One Classifier for All image Datasets using multimodal learning
S. Wadekar
Eugenio Culurciello
32
0
0
11 May 2023
Exploiting Diffusion Prior for Real-World Image Super-Resolution
Exploiting Diffusion Prior for Real-World Image Super-Resolution
Jianyi Wang
Zongsheng Yue
Shangchen Zhou
Kelvin C. K. Chan
Chen Change Loy
36
281
0
11 May 2023
Universal Source Separation with Weakly Labelled Data
Universal Source Separation with Weakly Labelled Data
Qiuqiang Kong
K. Chen
Haohe Liu
Xingjian Du
Taylor Berg-Kirkpatrick
Shlomo Dubnov
Mark D. Plumbley
16
17
0
11 May 2023
SalienDet: A Saliency-based Feature Enhancement Algorithm for Object
  Detection for Autonomous Driving
SalienDet: A Saliency-based Feature Enhancement Algorithm for Object Detection for Autonomous Driving
Ning Ding
Ce Zhang
A. Eskandarian
28
10
0
11 May 2023
Exploiting Fine-Grained DCT Representations for Hiding Image-Level
  Messages within JPEG Images
Exploiting Fine-Grained DCT Representations for Hiding Image-Level Messages within JPEG Images
Junxue Yang
Xin Liao
28
5
0
11 May 2023
Patch-wise Mixed-Precision Quantization of Vision Transformer
Patch-wise Mixed-Precision Quantization of Vision Transformer
Junrui Xiao
Zhikai Li
Lianwei Yang
Qingyi Gu
MQ
24
12
0
11 May 2023
Enhancing the Performance of Transformer-based Spiking Neural Networks
  by SNN-optimized Downsampling with Precise Gradient Backpropagation
Enhancing the Performance of Transformer-based Spiking Neural Networks by SNN-optimized Downsampling with Precise Gradient Backpropagation
Chenlin Zhou
Han Zhang
Zhaokun Zhou
Liutao Yu
Zhengyu Ma
Huihui Zhou
Xiaopeng Fan
Yonghong Tian
21
9
0
10 May 2023
Medical supervised masked autoencoders: Crafting a better masking
  strategy and efficient fine-tuning schedule for medical image classification
Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification
Jia-ju Mao
Shu-Hua Guo
Yuan Chang
Xuesong Yin
Binling Nie
26
2
0
10 May 2023
$2 * n$ is better than $n^2$: Decomposing Event Coreference Resolution
  into Two Tractable Problems
2∗n2 * n2∗n is better than n2n^2n2: Decomposing Event Coreference Resolution into Two Tractable Problems
Shafiuddin Rehan Ahmed
Abhijnan Nath
James H. Martin
Nikhil Krishnaswamy
22
13
0
09 May 2023
Previous
123...212223...474849
Next