ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09883
  4. Cited By
Swin Transformer V2: Scaling Up Capacity and Resolution

Swin Transformer V2: Scaling Up Capacity and Resolution

18 November 2021
Ze Liu
Han Hu
Yutong Lin
Zhuliang Yao
Zhenda Xie
Yixuan Wei
Jia Ning
Yue Cao
Zheng-Wei Zhang
Li Dong
Furu Wei
B. Guo
    ViT
ArXivPDFHTML

Papers citing "Swin Transformer V2: Scaling Up Capacity and Resolution"

50 / 821 papers shown
Title
SynID: Passport Synthetic Dataset for Presentation Attack Detection
SynID: Passport Synthetic Dataset for Presentation Attack Detection
Juan E. Tapia
Fabian Stockhardt
Lázaro J. González Soler
Christoph Busch
24
0
0
12 May 2025
Adapting a Segmentation Foundation Model for Medical Image Classification
Adapting a Segmentation Foundation Model for Medical Image Classification
Pengfei Gu
Haoteng Tang
Islam A. Ebeid
Jose Angel Nuñez
Fabian Vazquez
Diego Adame
Marcus Zhan
Huimin Li
Bin Fu
D. Z. Chen
MedIm
VLM
21
0
0
09 May 2025
ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling
ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling
Xiao Wang
Jong Youl Choi
Takuya Kurihaya
Isaac Lyngaas
Hong-Jun Yoon
...
Dali Wang
Peter Thornton
Prasanna Balaprakash
M. Ashfaq
Dan Lu
19
0
0
07 May 2025
Stow: Robotic Packing of Items into Fabric Pods
Stow: Robotic Packing of Items into Fabric Pods
Nicolas Hudson
Josh Hooks
Rahul Warrier
Curt Salisbury
Ross Hartley
...
Christine Fuller
Alex Keklak
Alex Frenkel
Lillian J. Ratliff
Aaron Parness
38
0
0
07 May 2025
Rethinking Boundary Detection in Deep Learning-Based Medical Image Segmentation
Rethinking Boundary Detection in Deep Learning-Based Medical Image Segmentation
Yi-Mou Lin
Dong-Ming Zhang
X. B. Fang
Yufan Chen
K.-T. Cheng
Hao Chen
19
0
0
06 May 2025
SCOPE-MRI: Bankart Lesion Detection as a Case Study in Data Curation and Deep Learning for Challenging Diagnoses
SCOPE-MRI: Bankart Lesion Detection as a Case Study in Data Curation and Deep Learning for Challenging Diagnoses
Sahil Sethi
Sai Reddy
Mansi Sakarvadia
Jordan Serotte
Darlington Nwaudo
Nicholas Maassen
Lewis Shi
41
0
0
29 Apr 2025
Prompt Guiding Multi-Scale Adaptive Sparse Representation-driven Network for Low-Dose CT MAR
Prompt Guiding Multi-Scale Adaptive Sparse Representation-driven Network for Low-Dose CT MAR
Baoshun Shi
Bing Chen
Shaolei Zhang
Huazhu Fu
Zhanli Hu
MedIm
33
0
0
28 Apr 2025
Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models
Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models
Patrick Müller
Alexander Braun
M. Keuper
50
0
0
25 Apr 2025
High-Quality Cloud-Free Optical Image Synthesis Using Multi-Temporal SAR and Contaminated Optical Data
High-Quality Cloud-Free Optical Image Synthesis Using Multi-Temporal SAR and Contaminated Optical Data
Chenxi Duan
20
0
0
23 Apr 2025
LOOPE: Learnable Optimal Patch Order in Positional Embeddings for Vision Transformers
LOOPE: Learnable Optimal Patch Order in Positional Embeddings for Vision Transformers
M. Chowdhury
Md Rifat Ur Rahman
Akil Ahmad Taki
15
0
0
19 Apr 2025
Learning from Noisy Pseudo-labels for All-Weather Land Cover Mapping
Learning from Noisy Pseudo-labels for All-Weather Land Cover Mapping
Wang Liu
Zhiyu Wang
Xin Guo
Puhong Duan
Xudong Kang
Shutao Li
22
0
0
18 Apr 2025
Towards Scale-Aware Low-Light Enhancement via Structure-Guided Transformer Design
Towards Scale-Aware Low-Light Enhancement via Structure-Guided Transformer Design
Wei Dong
Yan Min
Han Zhou
Jun Chen
ViT
31
0
0
18 Apr 2025
BeetleVerse: A study on taxonomic classification of ground beetles
BeetleVerse: A study on taxonomic classification of ground beetles
S M Rayeed
Alyson East
Samuel Stevens
Sydne Record
Charles V. Stewart
16
0
0
18 Apr 2025
NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: Methods and Results
NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: Methods and Results
Xin Li
Kun Yuan
B. Li
Fengbin Guan
Yizhen Shao
...
Guohua Zhang
Z. Huang
Y. Deng
Qingmiao Jiang
Lu Chen
53
7
0
17 Apr 2025
SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling
SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling
Yasin Almalioglu
Andrzej Kucik
Geoffrey French
Dafni Antotsiou
Alexander Adam
Cedric Archambeau
16
0
0
17 Apr 2025
Perception Encoder: The best visual embeddings are not at the output of the network
Perception Encoder: The best visual embeddings are not at the output of the network
Daniel Bolya
Po-Yao (Bernie) Huang
Peize Sun
Jang Hyun Cho
Andrea Madotto
...
Shiyu Dong
Nikhila Ravi
Daniel Li
Piotr Dollár
Christoph Feichtenhofer
ObjD
VOS
103
0
0
17 Apr 2025
Simplifying Graph Transformers
Simplifying Graph Transformers
Liheng Ma
Soumyasundar Pal
Yingxue Zhang
Philip H. S. Torr
Mark J. Coates
26
0
0
17 Apr 2025
Metric-Solver: Sliding Anchored Metric Depth Estimation from a Single Image
Metric-Solver: Sliding Anchored Metric Depth Estimation from a Single Image
Tao Wen
J. Wang
Y. Chen
Shugong Xu
Chi Zhang
Xuelong Li
MDE
31
0
0
16 Apr 2025
Tokenize Image Patches: Global Context Fusion for Effective Haze Removal in Large Images
Tokenize Image Patches: Global Context Fusion for Effective Haze Removal in Large Images
Jiuchen Chen
Xinyu Yan
Qizhi Xu
Kaiqi Li
VLM
22
0
0
13 Apr 2025
SD-ReID: View-aware Stable Diffusion for Aerial-Ground Person Re-Identification
SD-ReID: View-aware Stable Diffusion for Aerial-Ground Person Re-Identification
Xiang Hu
Pingping Zhang
Yuhao Wang
Bin Yan
Huchuan Lu
21
0
0
13 Apr 2025
Mixture of Group Experts for Learning Invariant Representations
Mixture of Group Experts for Learning Invariant Representations
Lei Kang
Jia Li
Mi Tian
Hua Huang
MoE
25
0
0
12 Apr 2025
Hyperlocal disaster damage assessment using bi-temporal street-view imagery and pre-trained vision models
Hyperlocal disaster damage assessment using bi-temporal street-view imagery and pre-trained vision models
Yifan Yang
Lei Zou
Bing Zhou
Daoyang Li
Binbin Lin
J. Abedin
Mingzheng Yang
21
0
0
12 Apr 2025
Heart Failure Prediction using Modal Decomposition and Masked Autoencoders for Scarce Echocardiography Databases
Heart Failure Prediction using Modal Decomposition and Masked Autoencoders for Scarce Echocardiography Databases
Andrés Bell-Navas
M. Villalba-Orero
Enrique Lara Pezzi
J. Garicano-Mena
S. L. Clainche
40
0
0
10 Apr 2025
Audio-visual Event Localization on Portrait Mode Short Videos
Audio-visual Event Localization on Portrait Mode Short Videos
Wuyang Liu
Yi Chai
Yongpeng Yan
Yanzhen Ren
21
0
0
09 Apr 2025
A Robust Real-Time Lane Detection Method with Fog-Enhanced Feature Fusion for Foggy Conditions
A Robust Real-Time Lane Detection Method with Fog-Enhanced Feature Fusion for Foggy Conditions
Ronghui Zhang
Yuhang Ma
Tengfei Li
Ziyu Lin
Yueying Wu
Junzhou Chen
Lin Zhang
Jia Hu
Tony Z. Qiu
Konghui Guo
34
0
0
08 Apr 2025
EMF: Event Meta Formers for Event-based Real-time Traffic Object Detection
EMF: Event Meta Formers for Event-based Real-time Traffic Object Detection
Muhammad Ahmed Ullah Khan
Abdul Hannan Khan
Andreas Dengel
33
0
0
05 Apr 2025
Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline Results
Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline Results
Andrei Dumitriu
Florin Tatui
Florin Miron
Radu Tudor Ionescu
Radu Timofte
37
19
0
03 Apr 2025
Spline-based Transformers
Spline-based Transformers
Prashanth Chandran
Agon Serifi
Markus Gross
Moritz Bächer
33
0
0
03 Apr 2025
FLAMES: A Hybrid Spiking-State Space Model for Adaptive Memory Retention in Event-Based Learning
FLAMES: A Hybrid Spiking-State Space Model for Adaptive Memory Retention in Event-Based Learning
Biswadeep Chakraborty
Saibal Mukhopadhyay
40
0
0
02 Apr 2025
GRU-AUNet: A Domain Adaptation Framework for Contactless Fingerprint Presentation Attack Detection
GRU-AUNet: A Domain Adaptation Framework for Contactless Fingerprint Presentation Attack Detection
Banafsheh Adami
Nima Karimian
31
0
0
01 Apr 2025
rPPG-SysDiaGAN: Systolic-Diastolic Feature Localization in rPPG Using Generative Adversarial Network with Multi-Domain Discriminator
rPPG-SysDiaGAN: Systolic-Diastolic Feature Localization in rPPG Using Generative Adversarial Network with Multi-Domain Discriminator
Banafsheh Adami
Nima Karimian
25
1
0
01 Apr 2025
LATex: Leveraging Attribute-based Text Knowledge for Aerial-Ground Person Re-Identification
LATex: Leveraging Attribute-based Text Knowledge for Aerial-Ground Person Re-Identification
Xiang Hu
Yuhao Wang
Pingping Zhang
Huchuan Lu
VLM
37
0
0
31 Mar 2025
Efficient Token Compression for Vision Transformer with Spatial Information Preserved
Efficient Token Compression for Vision Transformer with Spatial Information Preserved
Junzhu Mao
Yang Shen
Jinyang Guo
Yazhou Yao
Xiansheng Hua
ViT
31
0
0
30 Mar 2025
FuXi-RTM: A Physics-Guided Prediction Framework with Radiative Transfer Modeling
FuXi-RTM: A Physics-Guided Prediction Framework with Radiative Transfer Modeling
Qiusheng Huang
Xiaohui Zhong
Xu Fan
Lei Chen
Hao Li
AI4TS
AI4CE
47
0
0
25 Mar 2025
Data-driven Mesoscale Weather Forecasting Combining Swin-Unet and Diffusion Models
Data-driven Mesoscale Weather Forecasting Combining Swin-Unet and Diffusion Models
Yuta Hirabayashi
Daisuke Matsuoka
DiffM
38
0
0
25 Mar 2025
CustomKD: Customizing Large Vision Foundation for Edge Model Improvement via Knowledge Distillation
CustomKD: Customizing Large Vision Foundation for Edge Model Improvement via Knowledge Distillation
Jungsoo Lee
Debasmit Das
Munawar Hayat
Sungha Choi
Kyuwoong Hwang
Fatih Porikli
VLM
60
0
0
23 Mar 2025
Fractal-IR: A Unified Framework for Efficient and Scalable Image Restoration
Fractal-IR: A Unified Framework for Efficient and Scalable Image Restoration
Yawei Li
Bin Ren
Jingyun Liang
Rakesh Ranjan
Mengyuan Liu
N. Sebe
Ming-Hsuan Yang
Luca Benini
53
0
0
22 Mar 2025
Beyond Accuracy: What Matters in Designing Well-Behaved Models?
Beyond Accuracy: What Matters in Designing Well-Behaved Models?
Robin Hesse
Doğukan Bağcı
Bernt Schiele
Simone Schaub-Meyer
Stefan Roth
VLM
54
0
0
21 Mar 2025
From Head to Tail: Efficient Black-box Model Inversion Attack via Long-tailed Learning
From Head to Tail: Efficient Black-box Model Inversion Attack via Long-tailed Learning
Ziang Li
Hongguang Zhang
Juan Wang
Meihui Chen
Hongxin Hu
Wenzhe Yi
Xiaoyang Xu
Mengda Yang
Chenjun Ma
51
0
0
20 Mar 2025
LIFT: Latent Implicit Functions for Task- and Data-Agnostic Encoding
LIFT: Latent Implicit Functions for Task- and Data-Agnostic Encoding
A. Kazerouni
Soroush Mehraban
Michael Brudno
Babak Taati
41
0
0
19 Mar 2025
Towards Scalable Modeling of Compressed Videos for Efficient Action Recognition
Towards Scalable Modeling of Compressed Videos for Efficient Action Recognition
Shristi Das Biswas
Efstathia Soufleri
Arani Roy
Kaushik Roy
54
0
0
17 Mar 2025
MEET: A Million-Scale Dataset for Fine-Grained Geospatial Scene Classification with Zoom-Free Remote Sensing Imagery
Yansheng Li
Yuning Wu
Gong Cheng
Chao Tao
Bo Dang
...
C. Zhang
Y. Liu
X. Tang
Jiayi Ma
Yongjun Zhang
45
0
0
14 Mar 2025
Unlocking Open-Set Language Accessibility in Vision Models
Fawaz Sammani
Jonas Fischer
Nikos Deligiannis
VLM
53
0
0
14 Mar 2025
HeightFormer: Learning Height Prediction in Voxel Features for Roadside Vision Centric 3D Object Detection via Transformer
Zhang Zhang
Chao Sun
Chao Yue
Da Wen
Yujie Chen
Tianze Wang
Jianghao Leng
ViT
39
0
0
13 Mar 2025
CPAny: Couple With Any Encoder to Refer Multi-Object Tracking
Weize Li
Yunhao Du
Qixiang Yin
Zhicheng Zhao
Fei Su
Daqi Liu
59
0
0
10 Mar 2025
Dynamic Dictionary Learning for Remote Sensing Image Segmentation
Xuechao Zou
Yue Li
Shun Zhang
Kai Li
Shiying Wang
Pin Tao
Junliang Xing
Congyan Lang
48
0
0
09 Mar 2025
FEDS: Feature and Entropy-Based Distillation Strategy for Efficient Learned Image Compression
H. Fu
Jie Liang
Zhenman Fang
Jingning Han
31
0
0
09 Mar 2025
DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation
Runze Zhang
Guoguang Du
Xiaochuan Li
Qi Jia
Liang Jin
...
Zhenhua Guo
Yaqian Zhao
Xiaoli Gong
Rengang Li
Baoyu Fan
VGen
70
0
0
08 Mar 2025
Viewport-Unaware Blind Omnidirectional Image Quality Assessment: A Flexible and Effective Paradigm
Jiebin Yan
Kangcheng Wu
Junjie Chen
Ziwen Tan
Yuming Fang
55
0
0
08 Mar 2025
EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images
Rohit Menon
Nils Dengler
Sicong Pan
Gokul Krishna Chenchani
Maren Bennewitz
EDL
86
0
0
06 Mar 2025
1234...151617
Next