Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2111.09883
Cited By

Swin Transformer V2: Scaling Up Capacity and Resolution

v1v2 (latest)

Swin Transformer V2: Scaling Up Capacity and Resolution

18 November 2021

ArXiv (abs)PDF HTML Github (14834★)

Papers citing "Swin Transformer V2: Scaling Up Capacity and Resolution"

50 / 933 papers shown

PromptCIR: Blind Compressed Image Restoration with Prompt Learning

PromptCIR: Blind Compressed Image Restoration with Prompt Learning

335

25

0

26 Apr 2024

Detection of Peri-Pancreatic Edema using Deep Learning and Radiomics
Techniques

Detection of Peri-Pancreatic Edema using Deep Learning and Radiomics Techniques

Debesh Jha

Koushik Biswas

...

Alpay Medetalibeyoglu

Gorkem Durak

185

2

0

25 Apr 2024

NTIRE 2024 Quality Assessment of AI-Generated Content Challenge

NTIRE 2024 Quality Assessment of AI-Generated Content Challenge

Xiaohong Liu

Xiongkuo Min

Guangtao Zhai

...

380

43

0

25 Apr 2024

Mamba-360: Survey of State Space Models as Transformer Alternative for
Long Sequence Modelling: Methods, Applications, and Challenges

Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges

Vijay Srinivas Agneeswaran

368

76

0

24 Apr 2024

Unexplored Faces of Robustness and Out-of-Distribution: Covariate Shifts
in Environment and Sensor Domains

Unexplored Faces of Robustness and Out-of-Distribution: Covariate Shifts in Environment and Sensor Domains

395

12

0

24 Apr 2024

Vision Transformer-based Adversarial Domain Adaptation

Vision Transformer-based Adversarial Domain Adaptation

199

0

0

24 Apr 2024

CKGConv: General Graph Convolution with Continuous Kernels

CKGConv: General Graph Convolution with Continuous Kernels

Soumyasundar Pal

Yitian Zhang

212

8

0

21 Apr 2024

Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question Answering

Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question Answering

Pinghui Wang

Lingyun Song

502

17

0

18 Apr 2024

NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods
and Results

NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results

Xin Li

...

305

44

0

17 Apr 2024

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

Yawei Li

Radu Timofte

...

260

77

0

16 Apr 2024

Masked Autoencoders for Microscopy are Scalable Learners of Cellular
Biology

Masked Autoencoders for Microscopy are Scalable Learners of Cellular Biology

Kian Kenyon-Dean

...

Chi Vicky Cheng

Berton Earnshaw

211

54

0

16 Apr 2024

Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification

Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification

193

2

0

16 Apr 2024

XoFTR: Cross-modal Feature Matching Transformer

XoFTR: Cross-modal Feature Matching Transformer

Önder Tuzcuoglu

A. Aydin Alatan

168

39

0

15 Apr 2024

In My Perspective, In My Hands: Accurate Egocentric 2D Hand Pose and
Action Recognition

In My Perspective, In My Hands: Accurate Egocentric 2D Hand Pose and Action Recognition

Martin Kampel

302

10

0

14 Apr 2024

AMU-Tuning: Effective Logit Bias for CLIP-based Few-shot Learning

AMU-Tuning: Effective Logit Bias for CLIP-based Few-shot Learning

204

24

0

13 Apr 2024

Megalodon: Efficient LLM Pretraining and Inference with Unlimited
Context Length

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Hao Zhang

Luke Zettlemoyer

210

51

0

12 Apr 2024

Emerging Property of Masked Token for Effective Pre-training

Emerging Property of Masked Token for Effective Pre-training

Hyesong Choi

Hyejin Park

Dongbo Min

170

10

0

12 Apr 2024

Implicit and Explicit Language Guidance for Diffusion-based Visual
Perception

Implicit and Explicit Language Guidance for Diffusion-based Visual Perception

Jiale Cao

Jin Xie

269

2

0

11 Apr 2024

ConsistencyDet: A Few-step Denoising Framework for Object Detection Using the Consistency Model

ConsistencyDet: A Few-step Denoising Framework for Object Detection Using the Consistency Model

341

0

0

11 Apr 2024

Improving Facial Landmark Detection Accuracy and Efficiency with
Knowledge Distillation

Improving Facial Landmark Detection Accuracy and Efficiency with Knowledge Distillation

207

2

0

09 Apr 2024

Lightweight Deep Learning for Resource-Constrained Environments: A
Survey

Lightweight Deep Learning for Resource-Constrained Environments: A Survey

Wen-Huang Cheng

369

160

0

08 Apr 2024

Bidirectional Long-Range Parser for Sequential Data Understanding

Bidirectional Long-Range Parser for Sequential Data Understanding

George Leotescu

216

1

0

08 Apr 2024

HSViT: Horizontally Scalable Vision Transformer

HSViT: Horizontally Scalable Vision Transformer

Douglas Creighton

243

6

0

08 Apr 2024

JDEC: JPEG Decoding via Enhanced Continuous Cosine Coefficients

JDEC: JPEG Decoding via Enhanced Continuous Cosine CoefficientsComputer Vision and Pattern Recognition (CVPR), 2024

235

3

0

03 Apr 2024

CAPE: CAM as a Probabilistic Ensemble for Enhanced DNN Interpretation

CAPE: CAM as a Probabilistic Ensemble for Enhanced DNN InterpretationComputer Vision and Pattern Recognition (CVPR), 2024

Vu Minh Hieu Phan

Anton Van Den Hengel

261

3

0

03 Apr 2024

Semi-Supervised Unconstrained Head Pose Estimation in the Wild

Semi-Supervised Unconstrained Head Pose Estimation in the Wild

553

1

0

03 Apr 2024

Scene Adaptive Sparse Transformer for Event-based Object Detection

Scene Adaptive Sparse Transformer for Event-based Object DetectionComputer Vision and Pattern Recognition (CVPR), 2024

Yueyi Zhang

209

41

0

02 Apr 2024

DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery

DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery

Wenliang Zhao

172

16

0

01 Apr 2024

Bridging Remote Sensors with Multisensor Geospatial Foundation Models

Bridging Remote Sensors with Multisensor Geospatial Foundation Models

Markus Reichstein

258

45

0

01 Apr 2024

Rethinking Saliency-Guided Weakly-Supervised Semantic Segmentation

Rethinking Saliency-Guided Weakly-Supervised Semantic Segmentation

524

0

0

01 Apr 2024

SpiralMLP: A Lightweight Vision MLP Architecture

SpiralMLP: A Lightweight Vision MLP Architecture

Burhan Ul Tayyab

223

1

0

31 Mar 2024

DailyMAE: Towards Pretraining Masked Autoencoders in One Day

DailyMAE: Towards Pretraining Masked Autoencoders in One Day

Shentong Mo

229

4

0

31 Mar 2024

On Inherent Adversarial Robustness of Active Vision Systems

On Inherent Adversarial Robustness of Active Vision Systems

Amitangshu Mukherjee

220

1

0

29 Mar 2024

MambaMixer: Efficient Selective State Space Models with Dual Token and
Channel Selection

MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection

Michele Santacatterina

457

46

0

29 Mar 2024

ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth
Estimation

ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation

Aradhye Agarwal

338

50

0

27 Mar 2024

ViTAR: Vision Transformer with Any Resolution

ViTAR: Vision Transformer with Any Resolution

Hongxia Yang

349

20

0

27 Mar 2024

Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and
Time-Series Analysis

Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and Time-Series Analysis

Suhas Ranganath

Vinay P. Namboodiri

Vijay Srinivas Agneeswaran

311

4

0

26 Mar 2024

Deepfake Generation and Detection: A Benchmark and Survey

Deepfake Generation and Detection: A Benchmark and Survey

Jiangning Zhang

Chengjie Wang

Guangtao Zhai

Jian Yang

Chunhua Shen

375

85

0

26 Mar 2024

Integrating Mamba Sequence Model and Hierarchical Upsampling Network for
Accurate Semantic Segmentation of Multiple Sclerosis Legion

Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion

Kazi Shahriar Sanjid

Md. Tanzim Hossain

Md. Shakib Shahariar Junayed

188

9

0

26 Mar 2024

PaPr: Training-Free One-Step Patch Pruning with Lightweight ConvNets for
Faster Inference

PaPr: Training-Free One-Step Patch Pruning with Lightweight ConvNets for Faster InferenceEuropean Conference on Computer Vision (ECCV), 2024

Burhaneddin Yaman

Diana Marculescu

433

7

0

24 Mar 2024

SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate
Time series

SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series

Vijay Srinivas Agneeswaran

351

71

0

22 Mar 2024

ParFormer: Vision Transformer Baseline with Parallel Local Global Token
Mixer and Convolution Attention Patch Embedding

ParFormer: Vision Transformer Baseline with Parallel Local Global Token Mixer and Convolution Attention Patch Embedding

Novendra Setyawan

Ghufron Wahyu Kurniawan

259

0

0

22 Mar 2024

WeatherProof: Leveraging Language Guidance for Semantic Segmentation in
Adverse Weather

WeatherProof: Leveraging Language Guidance for Semantic Segmentation in Adverse Weather

Matthew Waliman

179

0

0

21 Mar 2024

Token Transformation Matters: Towards Faithful Post-hoc Explanation for
Vision Transformer

Token Transformation Matters: Towards Faithful Post-hoc Explanation for Vision Transformer

Yan Yan

219

16

0

21 Mar 2024

Learning to Project for Cross-Task Knowledge Distillation

Learning to Project for Cross-Task Knowledge Distillation

Benedikt Kolbeinsson

229

0

0

21 Mar 2024

Style-Extracting Diffusion Models for Semi-Supervised Histopathology
Segmentation

Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation

Jingna Qiu

...

Katharina Breininger

147

5

0

21 Mar 2024

TexTile: A Differentiable Metric for Texture Tileability

TexTile: A Differentiable Metric for Texture Tileability

Carlos Rodriguez-Pardo

Jorge López-Moreno

251

8

0

19 Mar 2024

DreamDA: Generative Data Augmentation with Diffusion Models

DreamDA: Generative Data Augmentation with Diffusion Models

Yunxiang Fu

Chaoqi Chen

219

24

0

19 Mar 2024

Dynamic Tuning Towards Parameter and Inference Efficiency for ViT
Adaptation

Dynamic Tuning Towards Parameter and Inference Efficiency for ViT AdaptationNeural Information Processing Systems (NeurIPS), 2024

Gao Huang

Yang You

335

23

0

18 Mar 2024

Gradient based Feature Attribution in Explainable AI: A Technical Review

Gradient based Feature Attribution in Explainable AI: A Technical Review

286

45

0

15 Mar 2024

1 2 3...7 8 9...17 18 19

Page 8 of 19

Pageof 19