ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09883
  4. Cited By
Swin Transformer V2: Scaling Up Capacity and Resolution
v1v2 (latest)

Swin Transformer V2: Scaling Up Capacity and Resolution

18 November 2021
Ze Liu
Han Hu
Yutong Lin
Zhuliang Yao
Zhenda Xie
Yixuan Wei
Jia Ning
Yue Cao
Zheng Zhang
Li Dong
Furu Wei
B. Guo
    ViT
ArXiv (abs)PDFHTMLGithub (14834★)

Papers citing "Swin Transformer V2: Scaling Up Capacity and Resolution"

50 / 933 papers shown
PromptCIR: Blind Compressed Image Restoration with Prompt Learning
PromptCIR: Blind Compressed Image Restoration with Prompt Learning
Bingchen Li
Xin Li
Yiting Lu
Ruoyu Feng
Mengxi Guo
Shijie Zhao
Li Zhang
Zhibo Chen
335
25
0
26 Apr 2024
Detection of Peri-Pancreatic Edema using Deep Learning and Radiomics
  Techniques
Detection of Peri-Pancreatic Edema using Deep Learning and Radiomics Techniques
Ziliang Hong
Debesh Jha
Koushik Biswas
Zheyu Zhang
Yury Velichko
...
Amir Borhani
Baris Turkbey
Alpay Medetalibeyoglu
Gorkem Durak
Ulas Bagci
MedIm
185
2
0
25 Apr 2024
NTIRE 2024 Quality Assessment of AI-Generated Content Challenge
NTIRE 2024 Quality Assessment of AI-Generated Content Challenge
Xiaohong Liu
Xiongkuo Min
Guangtao Zhai
Chunyi Li
Tengchuan Kou
...
Qi Yan
Youran Qu
Xiaohui Zeng
Lele Wang
Renjie Liao
380
43
0
25 Apr 2024
Mamba-360: Survey of State Space Models as Transformer Alternative for
  Long Sequence Modelling: Methods, Applications, and Challenges
Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges
Badri N. Patro
Vijay Srinivas Agneeswaran
Mamba
368
76
0
24 Apr 2024
Unexplored Faces of Robustness and Out-of-Distribution: Covariate Shifts
  in Environment and Sensor Domains
Unexplored Faces of Robustness and Out-of-Distribution: Covariate Shifts in Environment and Sensor Domains
Eunsu Baek
Keondo Park
Jiyoon Kim
Hyung-Sin Kim
OODDOOD
395
12
0
24 Apr 2024
Vision Transformer-based Adversarial Domain Adaptation
Vision Transformer-based Adversarial Domain Adaptation
Yahan Li
Yuan Wu
ViT
199
0
0
24 Apr 2024
CKGConv: General Graph Convolution with Continuous Kernels
CKGConv: General Graph Convolution with Continuous Kernels
Liheng Ma
Soumyasundar Pal
Yitian Zhang
Jiaming Zhou
Yingxue Zhang
Mark Coates
212
8
0
21 Apr 2024
Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question Answering
Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question Answering
Jie Ma
Min Hu
Pinghui Wang
Wangchun Sun
Lingyun Song
Hongbin Pei
Jun Liu
Youtian Du
502
17
0
18 Apr 2024
NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods
  and Results
NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results
Xin Li
Kun Yuan
Yajing Pei
Yiting Lu
Ming Sun
...
Kele Xu
Qisheng Xu
Tao Sun
Zhi-Guo Ding
Yuhan Hu
305
44
0
17 Apr 2024
The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report
The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report
Bin Ren
Yawei Li
Nancy Mehta
Radu Timofte
Hongyuan Yu
...
P. Yashaswini
Chaitra Desai
R. Tabib
Ujwala Patil
U. Mudenagudi
SupR
260
77
0
16 Apr 2024
Masked Autoencoders for Microscopy are Scalable Learners of Cellular
  Biology
Masked Autoencoders for Microscopy are Scalable Learners of Cellular Biology
Oren Z. Kraus
Kian Kenyon-Dean
Saber Saberian
Maryam Fallah
Peter McLean
...
Chi Vicky Cheng
Kristen Morse
Maureen Makes
Ben Mabey
Berton Earnshaw
211
54
0
16 Apr 2024
Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification
Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification
Yu-Yang Li
Yu Bai
Cunshi Wang
Mengwei Qu
Ziteng Lu
Roberto Soria
Jifeng Liu
193
2
0
16 Apr 2024
XoFTR: Cross-modal Feature Matching Transformer
XoFTR: Cross-modal Feature Matching Transformer
Önder Tuzcuoglu
Aybora Köksal
Bugra Sofu
Sinan Kalkan
A. Aydin Alatan
ViT
168
39
0
15 Apr 2024
In My Perspective, In My Hands: Accurate Egocentric 2D Hand Pose and
  Action Recognition
In My Perspective, In My Hands: Accurate Egocentric 2D Hand Pose and Action Recognition
Wiktor Mucha
Martin Kampel
EgoV
302
10
0
14 Apr 2024
AMU-Tuning: Effective Logit Bias for CLIP-based Few-shot Learning
AMU-Tuning: Effective Logit Bias for CLIP-based Few-shot Learning
Yuwei Tang
Zhenyi Lin
Qilong Wang
Q. Hu
Qinghua Hu
204
24
0
13 Apr 2024
Megalodon: Efficient LLM Pretraining and Inference with Unlimited
  Context Length
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length
Xuezhe Ma
Xiaomeng Yang
Wenhan Xiong
Beidi Chen
Lili Yu
Hao Zhang
Jonathan May
Luke Zettlemoyer
Omer Levy
Chunting Zhou
210
51
0
12 Apr 2024
Emerging Property of Masked Token for Effective Pre-training
Emerging Property of Masked Token for Effective Pre-training
Hyesong Choi
Hunsang Lee
Seyoung Joung
Hyejin Park
Jiyeong Kim
Dongbo Min
170
10
0
12 Apr 2024
Implicit and Explicit Language Guidance for Diffusion-based Visual
  Perception
Implicit and Explicit Language Guidance for Diffusion-based Visual Perception
Hefeng Wang
Jiale Cao
Jin Xie
Aiping Yang
Yanwei Pang
VLMDiffM
269
2
0
11 Apr 2024
ConsistencyDet: A Few-step Denoising Framework for Object Detection Using the Consistency Model
ConsistencyDet: A Few-step Denoising Framework for Object Detection Using the Consistency Model
Lifan Jiang
Zhihui Wang
Changmiao Wang
Ming Li
Jiaxu Leng
DiffM
341
0
0
11 Apr 2024
Improving Facial Landmark Detection Accuracy and Efficiency with
  Knowledge Distillation
Improving Facial Landmark Detection Accuracy and Efficiency with Knowledge Distillation
Zong-Wei Hong
Yu-Chen Lin
207
2
0
09 Apr 2024
Lightweight Deep Learning for Resource-Constrained Environments: A
  Survey
Lightweight Deep Learning for Resource-Constrained Environments: A Survey
Hou-I Liu
Marco Galindo
Hongxia Xie
Lai-Kuan Wong
Hong-Han Shuai
Yung-Hui Li
Wen-Huang Cheng
369
160
0
08 Apr 2024
Bidirectional Long-Range Parser for Sequential Data Understanding
Bidirectional Long-Range Parser for Sequential Data Understanding
George Leotescu
Daniel Voinea
A. Popa
216
1
0
08 Apr 2024
HSViT: Horizontally Scalable Vision Transformer
HSViT: Horizontally Scalable Vision Transformer
Chenhao Xu
Chang-Tsun Li
Chee Peng Lim
Douglas Creighton
ViT
243
6
0
08 Apr 2024
JDEC: JPEG Decoding via Enhanced Continuous Cosine Coefficients
JDEC: JPEG Decoding via Enhanced Continuous Cosine CoefficientsComputer Vision and Pattern Recognition (CVPR), 2024
Woo Kyoung Han
Sunghoon Im
Jaedeok Kim
Kyong Hwan Jin
235
3
0
03 Apr 2024
CAPE: CAM as a Probabilistic Ensemble for Enhanced DNN Interpretation
CAPE: CAM as a Probabilistic Ensemble for Enhanced DNN InterpretationComputer Vision and Pattern Recognition (CVPR), 2024
T. Chowdhury
Kewen Liao
Vu Minh Hieu Phan
Minh-Son To
Yutong Xie
Kevin Hung
David Ross
Anton Van Den Hengel
Johan Verjans
Zhibin Liao
261
3
0
03 Apr 2024
Semi-Supervised Unconstrained Head Pose Estimation in the Wild
Semi-Supervised Unconstrained Head Pose Estimation in the Wild
Huayi Zhou
Fei Jiang
Jin Yuan
Yong Rui
Hongtao Lu
Kui Jia
553
1
0
03 Apr 2024
Scene Adaptive Sparse Transformer for Event-based Object Detection
Scene Adaptive Sparse Transformer for Event-based Object DetectionComputer Vision and Pattern Recognition (CVPR), 2024
Yansong Peng
Hebei Li
Yueyi Zhang
Xiaoyan Sun
Feng Wu
ViT
209
41
0
02 Apr 2024
DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery
DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery
Yixuan Zhu
Ao Li
Yansong Tang
Wenliang Zhao
Jie Zhou
Jiwen Lu
172
16
0
01 Apr 2024
Bridging Remote Sensors with Multisensor Geospatial Foundation Models
Bridging Remote Sensors with Multisensor Geospatial Foundation Models
Boran Han
Shuai Zhang
Xingjian Shi
Markus Reichstein
258
45
0
01 Apr 2024
Rethinking Saliency-Guided Weakly-Supervised Semantic Segmentation
Rethinking Saliency-Guided Weakly-Supervised Semantic Segmentation
Beomyoung Kim
Donghyeon Kim
Sung Ju Hwang
524
0
0
01 Apr 2024
SpiralMLP: A Lightweight Vision MLP Architecture
SpiralMLP: A Lightweight Vision MLP Architecture
Haojie Mu
Burhan Ul Tayyab
Nicholas Chua
223
1
0
31 Mar 2024
DailyMAE: Towards Pretraining Masked Autoencoders in One Day
DailyMAE: Towards Pretraining Masked Autoencoders in One Day
Jiantao Wu
Shentong Mo
Sara Atito
Zhenhua Feng
Josef Kittler
Muhammad Awais
229
4
0
31 Mar 2024
On Inherent Adversarial Robustness of Active Vision Systems
On Inherent Adversarial Robustness of Active Vision Systems
Amitangshu Mukherjee
Timur Ibrayev
Kaushik Roy
AAML
220
1
0
29 Mar 2024
MambaMixer: Efficient Selective State Space Models with Dual Token and
  Channel Selection
MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection
Ali Behrouz
Michele Santacatterina
Ramin Zabih
457
46
0
29 Mar 2024
ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth
  Estimation
ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation
Suraj Patni
Aradhye Agarwal
Chetan Arora
VLMDiffMMDE
338
50
0
27 Mar 2024
ViTAR: Vision Transformer with Any Resolution
ViTAR: Vision Transformer with Any Resolution
Qihang Fan
Quanzeng You
Xiaotian Han
Yongfei Liu
Yunzhe Tao
Huaibo Huang
Ran He
Hongxia Yang
ViT
349
20
0
27 Mar 2024
Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and
  Time-Series Analysis
Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and Time-Series Analysis
Badri N. Patro
Suhas Ranganath
Vinay P. Namboodiri
Vijay Srinivas Agneeswaran
311
4
0
26 Mar 2024
Deepfake Generation and Detection: A Benchmark and Survey
Deepfake Generation and Detection: A Benchmark and Survey
Gan Pei
Jiangning Zhang
Menghan Hu
Ying Tai
Chengjie Wang
Yunsheng Wu
Guangtao Zhai
Jian Yang
Chunhua Shen
Dacheng Tao
375
85
0
26 Mar 2024
Integrating Mamba Sequence Model and Hierarchical Upsampling Network for
  Accurate Semantic Segmentation of Multiple Sclerosis Legion
Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion
Kazi Shahriar Sanjid
Md. Tanzim Hossain
Md. Shakib Shahariar Junayed
M. M. Uddin
Mamba
188
9
0
26 Mar 2024
PaPr: Training-Free One-Step Patch Pruning with Lightweight ConvNets for
  Faster Inference
PaPr: Training-Free One-Step Patch Pruning with Lightweight ConvNets for Faster InferenceEuropean Conference on Computer Vision (ECCV), 2024
Tanvir Mahmud
Burhaneddin Yaman
Chun-Hao Liu
Diana Marculescu
433
7
0
24 Mar 2024
SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate
  Time series
SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series
Badri N. Patro
Vijay Srinivas Agneeswaran
Mamba
351
71
0
22 Mar 2024
ParFormer: Vision Transformer Baseline with Parallel Local Global Token
  Mixer and Convolution Attention Patch Embedding
ParFormer: Vision Transformer Baseline with Parallel Local Global Token Mixer and Convolution Attention Patch Embedding
Novendra Setyawan
Ghufron Wahyu Kurniawan
Chi-Chia Sun
Jun-Wei Hsieh
Hui-Kai Su
W. Kuo
ViTMoE
259
0
0
22 Mar 2024
WeatherProof: Leveraging Language Guidance for Semantic Segmentation in
  Adverse Weather
WeatherProof: Leveraging Language Guidance for Semantic Segmentation in Adverse Weather
Blake Gella
Howard Zhang
Rishi Upadhyay
Tiffany Chang
Nathan Wei
Matthew Waliman
Yunhao Bao
C. Melo
Alex Wong
A. Kadambi
179
0
0
21 Mar 2024
Token Transformation Matters: Towards Faithful Post-hoc Explanation for
  Vision Transformer
Token Transformation Matters: Towards Faithful Post-hoc Explanation for Vision Transformer
Junyi Wu
Bin Duan
Weitai Kang
Hao Tang
Yan Yan
219
16
0
21 Mar 2024
Learning to Project for Cross-Task Knowledge Distillation
Learning to Project for Cross-Task Knowledge Distillation
Dylan Auty
Roy Miles
Benedikt Kolbeinsson
K. Mikolajczyk
229
0
0
21 Mar 2024
Style-Extracting Diffusion Models for Semi-Supervised Histopathology
  Segmentation
Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation
Mathias Öttl
Frauke Wilm
Jana Steenpass
Jingna Qiu
M. Rübner
...
Peter Fasching
Andreas Maier
R. Erber
Bernhard Kainz
Katharina Breininger
DiffMMedIm
147
5
0
21 Mar 2024
TexTile: A Differentiable Metric for Texture Tileability
TexTile: A Differentiable Metric for Texture Tileability
Carlos Rodriguez-Pardo
Dan Casas
Elena Garces
Jorge López-Moreno
DiffM
251
8
0
19 Mar 2024
DreamDA: Generative Data Augmentation with Diffusion Models
DreamDA: Generative Data Augmentation with Diffusion Models
Yunxiang Fu
Chaoqi Chen
Yu Qiao
Yizhou Yu
VLMDiffM
219
24
0
19 Mar 2024
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT
  Adaptation
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT AdaptationNeural Information Processing Systems (NeurIPS), 2024
Wangbo Zhao
Jiasheng Tang
Yizeng Han
Yibing Song
Kai Wang
Gao Huang
F. Wang
Yang You
335
23
0
18 Mar 2024
Gradient based Feature Attribution in Explainable AI: A Technical Review
Gradient based Feature Attribution in Explainable AI: A Technical Review
Yongjie Wang
Tong Zhang
Xu Guo
Zhiqi Shen
XAI
286
45
0
15 Mar 2024
Previous
123...789...171819
Next
Page 8 of 19
Pageof 19