Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2111.09883
Cited By

Swin Transformer V2: Scaling Up Capacity and Resolution

v1v2 (latest)

Swin Transformer V2: Scaling Up Capacity and Resolution

18 November 2021

ArXiv (abs)PDF HTML Github (14834★)

Papers citing "Swin Transformer V2: Scaling Up Capacity and Resolution"

50 / 932 papers shown

Evaluating SAM2 for Video Semantic Segmentation

Evaluating SAM2 for Video Semantic Segmentation

Syed Hesham Syed Ariff

213

0

0

01 Dec 2025

MoLT: Mixture of Layer-Wise Tokens for Efficient Audio-Visual Learning

47

0

0

27 Nov 2025

When Do Domain-Specific Foundation Models Justify Their Cost? A Systematic Evaluation Across Retinal Imaging Tasks

When Do Domain-Specific Foundation Models Justify Their Cost? A Systematic Evaluation Across Retinal Imaging Tasks

Tahm Spitznagel

Gabor M. Somfai

183

0

0

27 Nov 2025

ACIT: Attention-Guided Cross-Modal Interaction Transformer for Pedestrian Crossing Intention Prediction

ACIT: Attention-Guided Cross-Modal Interaction Transformer for Pedestrian Crossing Intention Prediction

Steffen Müller

166

0

0

25 Nov 2025

Cross-Contrastive Clustering for Multimodal Attributed Graphs with Dual Graph Filtering

Cross-Contrastive Clustering for Multimodal Attributed Graphs with Dual Graph Filtering

135

0

0

25 Nov 2025

Glass Surface Detection: Leveraging Reflection Dynamics in Flash/No-flash Imagery

Glass Surface Detection: Leveraging Reflection Dynamics in Flash/No-flash Imagery

Rynson W. H. Lau

108

0

0

21 Nov 2025

A Dataset and Baseline for Deep Learning-Based Visual Quality Inspection in Remanufacturing

A Dataset and Baseline for Deep Learning-Based Visual Quality Inspection in RemanufacturingIEEE International Conference on Emerging Technologies and Factory Automation (ETFA), 2025

Johannes C. Bauer

Stephan Trattnig

84

1

0

19 Nov 2025

AdamNX: An Adam improvement algorithm based on a novel exponential decay mechanism for the second-order moment estimate

AdamNX: An Adam improvement algorithm based on a novel exponential decay mechanism for the second-order moment estimate

266

0

0

17 Nov 2025

MSLoRA: Multi-Scale Low-Rank Adaptation via Attention Reweighting

MSLoRA: Multi-Scale Low-Rank Adaptation via Attention Reweighting

122

0

0

16 Nov 2025

From Street to Orbit: Training-Free Cross-View Retrieval via Location Semantics and LLM Guidance

From Street to Orbit: Training-Free Cross-View Retrieval via Location Semantics and LLM Guidance

206

0

0

12 Nov 2025

WEDepth: Efficient Adaptation of World Knowledge for Monocular Depth Estimation

WEDepth: Efficient Adaptation of World Knowledge for Monocular Depth Estimation

141

0

0

11 Nov 2025

Hilbert-Guided Block-Sparse Local Attention

Hilbert-Guided Block-Sparse Local Attention

98

0

0

08 Nov 2025

CoMA: Complementary Masking and Hierarchical Dynamic Multi-Window Self-Attention in a Unified Pre-training Framework

CoMA: Complementary Masking and Hierarchical Dynamic Multi-Window Self-Attention in a Unified Pre-training Framework

109

0

0

08 Nov 2025

Differentiable Hierarchical Visual Tokenization

Differentiable Hierarchical Visual Tokenization

Martine Hjelkrem-Tan

Adín Ramirez Rivera

212

0

0

04 Nov 2025

SAFE: A Novel Approach to AI Weather Evaluation through Stratified Assessments of Forecasts over Earth

SAFE: A Novel Approach to AI Weather Evaluation through Stratified Assessments of Forecasts over Earth

Randall Balestriero

105

0

0

30 Oct 2025

Leveraging an Atmospheric Foundational Model for Subregional Sea Surface Temperature Forecasting

Leveraging an Atmospheric Foundational Model for Subregional Sea Surface Temperature Forecasting

Giovanny C-Londoño

Javier Sánchez

434

0

0

29 Oct 2025

Attentive Convolution: Unifying the Expressivity of Self-Attention with Convolutional Efficiency

Attentive Convolution: Unifying the Expressivity of Self-Attention with Convolutional Efficiency

141

0

0

23 Oct 2025

Interactive Hypergraph Visual Analytics for Exploring Large and Complex Image Collections

Interactive Hypergraph Visual Analytics for Exploring Large and Complex Image Collections

103

0

0

22 Oct 2025

CAGE: Curvature-Aware Gradient Estimation For Accurate Quantization-Aware Training

CAGE: Curvature-Aware Gradient Estimation For Accurate Quantization-Aware Training

Alexandra Volkova

213

0

0

21 Oct 2025

Towards Generalist Intelligence in Dentistry: Vision Foundation Models for Oral and Maxillofacial Radiology

Towards Generalist Intelligence in Dentistry: Vision Foundation Models for Oral and Maxillofacial Radiology

221

0

0

16 Oct 2025

MatchAttention: Matching the Relative Positions for High-Resolution Cross-View Matching

MatchAttention: Matching the Relative Positions for High-Resolution Cross-View Matching

219

0

0

16 Oct 2025

SkyDreamer: Interpretable End-to-End Vision-Based Drone Racing with Model-Based Reinforcement Learning

SkyDreamer: Interpretable End-to-End Vision-Based Drone Racing with Model-Based Reinforcement Learning

Aderik Verraest

Stavrow A. Bahnam

Guido C. H. E de Croon

Christophe De Wagter

178

1

0

16 Oct 2025

On the Use of Hierarchical Vision Foundation Models for Low-Cost Human Mesh Recovery and Pose Estimation

On the Use of Hierarchical Vision Foundation Models for Low-Cost Human Mesh Recovery and Pose Estimation

Shuhei Tarashima

172

0

0

14 Oct 2025

CuMPerLay: Learning Cubical Multiparameter Persistence Vectorizations

CuMPerLay: Learning Cubical Multiparameter Persistence Vectorizations

Brighton Nuwagira

Barış Coşkunuzer

134

3

0

14 Oct 2025

DREAM: A Benchmark Study for Deepfake REalism AssessMent

DREAM: A Benchmark Study for Deepfake REalism AssessMent

187

0

0

11 Oct 2025

AUREXA-SE: Audio-Visual Unified Representation Exchange Architecture with Cross-Attention and Squeezeformer for Speech Enhancement

AUREXA-SE: Audio-Visual Unified Representation Exchange Architecture with Cross-Attention and Squeezeformer for Speech Enhancement

Deepanshu Gupta

Harshith Jai Surya Ganji

Harshvardhan Choudhary

91

0

0

06 Oct 2025

A Comprehensive Review on Artificial Intelligence Empowered Solutions for Enhancing Pedestrian and Cyclist Safety

A Comprehensive Review on Artificial Intelligence Empowered Solutions for Enhancing Pedestrian and Cyclist Safety

Muhammad Monjurul Karim

158

0

0

30 Sep 2025

Swift: An Autoregressive Consistency Model for Efficient Weather Forecasting

Swift: An Autoregressive Consistency Model for Efficient Weather Forecasting

161

4

0

30 Sep 2025

Unsupervised Detection of Spatiotemporal Anomalies in PMU Data Using Transformer-Based BiGAN

Unsupervised Detection of Spatiotemporal Anomalies in PMU Data Using Transformer-Based BiGAN

Muhammad Imran Hossain

Jignesh Solanki

72

0

0

30 Sep 2025

Vid-LLM: A Compact Video-based 3D Multimodal LLM with Reconstruction-Reasoning Synergy

Vid-LLM: A Compact Video-based 3D Multimodal LLM with Reconstruction-Reasoning Synergy

146

1

0

29 Sep 2025

BRIDGE -- Building Reinforcement-Learning Depth-to-Image Data Generation Engine for Monocular Depth Estimation

BRIDGE -- Building Reinforcement-Learning Depth-to-Image Data Generation Engine for Monocular Depth Estimation

314

0

0

29 Sep 2025

DRIFT-Net: A Spectral--Coupled Neural Operator for PDEs Learning

DRIFT-Net: A Spectral--Coupled Neural Operator for PDEs Learning

126

0

0

29 Sep 2025

Variable Rate Image Compression via N-Gram Context based Swin-transformer

Variable Rate Image Compression via N-Gram Context based Swin-transformer

Priyanka Mudgal

183

0

0

28 Sep 2025

Beyond Outliers: A Study of Optimizers Under Quantization

Beyond Outliers: A Study of Optimizers Under Quantization

Georgios Vlassis

Alexandra Volkova

Torsten Hoefler

210

0

0

27 Sep 2025

HyPSAM: Hybrid Prompt-driven Segment Anything Model for RGB-Thermal Salient Object Detection

HyPSAM: Hybrid Prompt-driven Segment Anything Model for RGB-Thermal Salient Object Detection

128

1

0

23 Sep 2025

A Validation Strategy for Deep Learning Models: Evaluating and Enhancing Robustness

A Validation Strategy for Deep Learning Models: Evaluating and Enhancing Robustness

Abdul-Rauf Nuhu

Benjamin Lartey

185

0

0

23 Sep 2025

PMRT: A Training Recipe for Fast, 3D High-Resolution Aerodynamic Prediction

PMRT: A Training Recipe for Fast, 3D High-Resolution Aerodynamic Prediction

Sam Jacob Jacob

Harald Köstler

135

0

0

21 Sep 2025

Towards Interpretable and Efficient Attention: Compressing All by Contracting a Few

Towards Interpretable and Efficient Attention: Compressing All by Contracting a Few

379

0

0

21 Sep 2025

Random Direct Preference Optimization for Radiography Report Generation

Random Direct Preference Optimization for Radiography Report Generation

Valentin Samokhin

Dmitriy Umerenkov

Dmitry V. Dylov

Mikhail Belyaev

89

0

0

19 Sep 2025

Sequential Token Merging: Revisiting Hidden States

Sequential Token Merging: Revisiting Hidden States

153

0

0

19 Sep 2025

Region-Aware Deformable Convolutions

Region-Aware Deformable Convolutions

Abolfazl Saheban Maleki

146

0

0

18 Sep 2025

CAGE: Continuity-Aware edGE Network Unlocks Robust Floorplan Reconstruction

CAGE: Continuity-Aware edGE Network Unlocks Robust Floorplan Reconstruction

182

0

0

18 Sep 2025

AERIS: Argonne Earth Systems Model for Reliable and Skillful Predictions

AERIS: Argonne Earth Systems Model for Reliable and Skillful Predictions

Väinö Hatanpää

...

153

2

0

16 Sep 2025

MFAF: An EVA02-Based Multi-scale Frequency Attention Fusion Method for Cross-View Geo-Localization

MFAF: An EVA02-Based Multi-scale Frequency Attention Fusion Method for Cross-View Geo-Localization

130

0

0

16 Sep 2025

LoRA-fine-tuned Large Vision Models for Automated Assessment of Post-SBRT Lung Injury

LoRA-fine-tuned Large Vision Models for Automated Assessment of Post-SBRT Lung Injury

57

0

0

15 Sep 2025

CoAtNeXt:An Attention-Enhanced ConvNeXtV2-Transformer Hybrid Model for Gastric Tissue Classification

CoAtNeXt:An Attention-Enhanced ConvNeXtV2-Transformer Hybrid Model for Gastric Tissue Classification

Mustafa Yurdakul

Şakir Tasdemir

79

0

0

11 Sep 2025

Value bounds and Convergence Analysis for Averages of LRP attributions

Value bounds and Convergence Analysis for Averages of LRP attributions

Alexander Binder

Nastaran Takmil-Homayouni

232

0

0

10 Sep 2025

Learning spatially structured open quantum dynamics with regional-attention transformers

Learning spatially structured open quantum dynamics with regional-attention transformers

85

0

0

08 Sep 2025

IGAff: Benchmarking Adversarial Iterative and Genetic Affine Algorithms on Deep Neural Networks

IGAff: Benchmarking Adversarial Iterative and Genetic Affine Algorithms on Deep Neural Networks

Sebastian-Vasile Echim

Dumitru-Clementin Cercel

Florin-Catalin Pop

120

0

0

08 Sep 2025

Dynamic Group Detection using VLM-augmented Temporal Groupness Graph

Dynamic Group Detection using VLM-augmented Temporal Groupness Graph

Kaname Yokoyama

Chihiro Nakatani

Norimichi Ukita

100

0

0

05 Sep 2025

1 2 3 4...17 18 19