v1v2 (latest)

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

IEEE International Conference on Computer Vision (ICCV), 2021

25 March 2021

ArXiv (abs)PDF HTML HuggingFace (5 upvotes)Github (14835★)

Papers citing "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows"

50 / 8,530 papers shown

Good Representation, Better Explanation: Role of Convolutional Neural Networks in Transformer-Based Remote Sensing Image Captioning

201

22 Feb 2025

TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba

370

21 Feb 2025

Surface Vision Mamba: Leveraging Bidirectional State Space Model for Efficient Spherical Manifold RepresentationInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025

734

21 Feb 2025

Tight Clusters Make Specialized ExpertsInternational Conference on Learning Representations (ICLR), 2025

467

21 Feb 2025

Intelligent Anomaly Detection for Lane Rendering Using Transformer with Self-Supervised Pre-Training and Customized Fine-TuningTransportation Research Record (TRR), 2023

394

21 Feb 2025

Dissecting Human Body Representations in Deep Networks Trained for Person IdentificationIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2025

260

21 Feb 2025

Myna: Masking-Based Contrastive Learning of Musical Representations

372

20 Feb 2025

Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition

436

20 Feb 2025

Thicker and Quicker: A Jumbo Token for Fast Plain Vision Transformers

491

20 Feb 2025

Event-Based Video Frame Interpolation With Cross-Modal Asymmetric Bidirectional Motion FieldsComputer Vision and Pattern Recognition (CVPR), 2023

372

20 Feb 2025

Variance Reduction Methods Do Not Need to Compute Full Gradients: Improved Efficiency through Shuffling

454

20 Feb 2025

Quantifying Memorization and Parametric Response Rates in Retrieval-Augmented Vision-Language Models

395

19 Feb 2025

A Comprehensive Survey on Composed Image Retrieval

482

19 Feb 2025

MaxSup: Overcoming Representation Collapse in Label Smoothing

511

18 Feb 2025

RingFormer: Rethinking Recurrent Transformer with Adaptive Level Signals

123

18 Feb 2025

Unsupervised Structural-Counterfactual Generation under Domain Shift

Krishn Vishwas Kher

Lokesh Venkata Siva Maruthi Badisa

Saksham Mittal

Kusampudi Venkata Datta Sri Harsha

Chitneedi Geetha Sowmya

SakethaNath Jagarlapudi

OOD CML

249

17 Feb 2025

Precise GPS-Denied UAV Self-Positioning via Context-Enhanced Cross-View Geo-Localization

267

17 Feb 2025

ProMRVL-CAD: Proactive Dialogue System with Multi-Round Vision-Language Interactions for Computer-Aided Diagnosis

222

15 Feb 2025

NPSim: Nighttime Photorealistic Simulation From Daytime Images With Monocular Inverse Rendering and Ray Tracing

Shutong Zhang

357

15 Feb 2025

Harnessing Vision Models for Time Series Analysis: A SurveyInternational Joint Conference on Artificial Intelligence (IJCAI), 2024

505

13 Feb 2025

CFIRSTNET: Comprehensive Features for Static IR Drop Estimation with Neural NetworkInternational Conference on Computer Aided Design (ICCAD), 2024

171

13 Feb 2025

Towards Virtual Clinical Trials of Radiology AI with Conditional Generative Modeling

281

13 Feb 2025

CoL3D: Collaborative Learning of Single-view Depth and Camera Intrinsics for Metric 3D Shape RecoveryIEEE International Conference on Robotics and Automation (ICRA), 2025

478

13 Feb 2025

Top-Theta Attention: Sparsifying Transformers by Compensated Thresholding

Konstantin Berestizshevsky

Renzo Andri

Lukas Cavigelli

442

12 Feb 2025

Uncertainty Aware Human-machine Collaboration in Camouflaged Object Detection

330

12 Feb 2025

Color-Quality Invariance for Robust Medical Image Segmentation

435

11 Feb 2025

SparseFormer: Detecting Objects in HRW Shots via Sparse Vision TransformerACM Multimedia (MM), 2024

472

11 Feb 2025

Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis

Amir Hosein Fadaei

M. Dehaqani

343

11 Feb 2025

The Value of Information in Human-AI Decision-making

703

10 Feb 2025

KARST: Multi-Kernel Kronecker Adaptation with Re-Scaling Transmission for Visual ClassificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

513

10 Feb 2025

Fully Exploiting Vision Foundation Model's Profound Prior Knowledge for Generalizable RGB-Depth Driving Scene Parsing

423

10 Feb 2025

Do we really have to filter out random noise in pre-training data for language models?

449

10 Feb 2025

Multi-Level Decoupled Relational Distillation for Heterogeneous Architectures

323

10 Feb 2025

Learning Clustering-based Prototypes for Compositional Zero-shot LearningInternational Conference on Learning Representations (ICLR), 2025

462

10 Feb 2025

Unconstrained Body Recognition at Altitude and Range: Comparing Four ApproachesIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2025

188

10 Feb 2025

Cell Nuclei Detection and Classification in Whole Slide Images with Transformers

Oscar Pina

Eduard Dorca

Verónica Vilaplana

158

10 Feb 2025

Linear Attention Modeling for Learned Image CompressionComputer Vision and Pattern Recognition (CVPR), 2025

749

09 Feb 2025

DiTASK: Multi-Task Fine-Tuning with Diffeomorphic TransformationsComputer Vision and Pattern Recognition (CVPR), 2025

Krishna Sri Ipsit Mantri

Carola-Bibiane Schönlieb

Bruno Ribeiro

Chaim Baskin

Moshe Eliasof

482

09 Feb 2025

AI-Driven HSI: Multimodality, Fusion, Challenges, and the Deep Learning Revolution

337

09 Feb 2025

Contrastive Representation Distillation via Multi-Scale Feature Decoupling

Cuipeng Wang

Tieyuan Chen

377

09 Feb 2025

A Novel Convolutional-Free Method for 3D Medical Imaging Segmentation

Canxuan Gang

MedIm ViT

272

08 Feb 2025

Drone Detection and Tracking with YOLO and a Rule-based Method

Purbaditya Bhattacharya

Patrick Nowak

361

07 Feb 2025

Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More

406

06 Feb 2025

L2GNet: Optimal Local-to-Global Representation of Anatomical Structures for Generalized Medical Image Segmentation

173

06 Feb 2025

Conditional Diffusion Models are Medical Image Classifiers that Provide Explainability and Uncertainty for Free

322

06 Feb 2025

Improving Adversarial Robustness via Phase and Amplitude-aware Prompting

267

06 Feb 2025

All-in-One Image Compression and RestorationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025

342

05 Feb 2025

Edge Attention Module for Object Classification

Santanu Roy

Ashvath Suresh

Archit Gupta

242

05 Feb 2025

LoCA: Location-Aware Cosine Adaptation for Parameter-Efficient Fine-TuningInternational Conference on Learning Representations (ICLR), 2025

910

05 Feb 2025

Exploiting Ensemble Learning for Cross-View Isolated Sign Language RecognitionThe Web Conference (WWW), 2025

466

04 Feb 2025