ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.09883
  4. Cited By
Swin Transformer V2: Scaling Up Capacity and Resolution

Swin Transformer V2: Scaling Up Capacity and Resolution

18 November 2021
Ze Liu
Han Hu
Yutong Lin
Zhuliang Yao
Zhenda Xie
Yixuan Wei
Jia Ning
Yue Cao
Zheng-Wei Zhang
Li Dong
Furu Wei
B. Guo
    ViT
ArXivPDFHTML

Papers citing "Swin Transformer V2: Scaling Up Capacity and Resolution"

50 / 821 papers shown
Title
Computational Analysis of Degradation Modeling in Blind Panoramic Image Quality Assessment
Jiebin Yan
Ziwen Tan
Jiale Rao
Lei Wu
Yifan Zuo
Yuming Fang
52
0
0
05 Mar 2025
Task-Agnostic Attacks Against Vision Foundation Models
Brian Pulfer
Yury Belousov
Vitaliy Kinakh
Teddy Furon
S. Voloshynovskiy
AAML
68
0
0
05 Mar 2025
Adaptive Camera Sensor for Vision Models
Eunsu Baek
Sunghwan Han
Taesik Gong
Hyung-Sin Kim
VLM
Presented at ResearchTrend Connect | VLM on 28 Mar 2025
156
0
0
04 Mar 2025
SAR-W-MixMAE: SAR Foundation Model Training Using Backscatter Power Weighting
Ali Caglayan
Nevrez Imamoglu
T. Kouyama
55
0
0
03 Mar 2025
Enhancing Retinal Vessel Segmentation Generalization via Layout-Aware Generative Modelling
Enhancing Retinal Vessel Segmentation Generalization via Layout-Aware Generative Modelling
Jonathan Fhima
Jan Van Eijgen
Lennert Beeckmans
Thomas Jacobs
Moti Freiman
Luis Filipe Nakayama
Ingeborg Stalmans
Chaim Baskin
Joachim A. Behar
MedIm
57
0
0
03 Mar 2025
Investigating the contribution of terrain-following coordinates and conservation schemes in AI-driven precipitation forecasts
Investigating the contribution of terrain-following coordinates and conservation schemes in AI-driven precipitation forecasts
Yingkai Sha
John S. Schreck
William E. Chapman
David John Gagne II
30
1
0
01 Mar 2025
FLStore: Efficient Federated Learning Storage for non-training workloads
Ahmad Faraz Khan
Samuel Fountain
Ahmed M. Abdelmoniem
A. R. Butt
A. Anwar
FedML
36
0
0
01 Mar 2025
Robust and Efficient Writer-Independent IMU-Based Handwriting Recognization
Robust and Efficient Writer-Independent IMU-Based Handwriting Recognization
Jindong Li
Tim Hamann
Jens Barth
Peter Kaempf
Dario Zanca
Bjoern M. Eskofier
29
0
0
28 Feb 2025
Explainable, Multi-modal Wound Infection Classification from Images Augmented with Generated Captions
Explainable, Multi-modal Wound Infection Classification from Images Augmented with Generated Captions
Palawat Busaranuvong
Emmanuel O. Agu
Reza Saadati Fard
Deepak Kumar
Shefalika Gautam
B. Tulu
Diane Strong
MedIm
55
0
0
27 Feb 2025
GONet: A Generalizable Deep Learning Model for Glaucoma Detection
GONet: A Generalizable Deep Learning Model for Glaucoma Detection
Or Abramovich
Hadas Pizem
Jonathan Fhima
Eran Berkowitz
Ben Gofrit
...
Meital Baskin
Jan Van Eijgen
Ingeborg Stalmans
E. Blumenthal
Joachim A. Behar
57
0
0
26 Feb 2025
MaxGlaViT: A novel lightweight vision transformer-based approach for early diagnosis of glaucoma stages from fundus images
MaxGlaViT: A novel lightweight vision transformer-based approach for early diagnosis of glaucoma stages from fundus images
Mustafa Yurdakul
Kubra Uyar
Şakir Taşdemir
48
1
0
24 Feb 2025
MVIP -- A Dataset and Methods for Application Oriented Multi-View and Multi-Modal Industrial Part Recognition
MVIP -- A Dataset and Methods for Application Oriented Multi-View and Multi-Modal Industrial Part Recognition
Paul Koch
Marian Schluter
Jörg Krüger
62
0
0
24 Feb 2025
MEX: Memory-efficient Approach to Referring Multi-Object Tracking
MEX: Memory-efficient Approach to Referring Multi-Object Tracking
Huu-Thien Tran
Phuoc-Sang Pham
Thai-Son Tran
Khoa Luu
VOT
70
1
0
20 Feb 2025
Precise GPS-Denied UAV Self-Positioning via Context-Enhanced Cross-View Geo-Localization
Precise GPS-Denied UAV Self-Positioning via Context-Enhanced Cross-View Geo-Localization
Yuanze Xu
Ming Dai
Wenxiao Cai
Wankou Yang
67
0
0
17 Feb 2025
Without Paired Labeled Data: An End-to-End Self-Supervised Paradigm for UAV-View Geo-Localization
Without Paired Labeled Data: An End-to-End Self-Supervised Paradigm for UAV-View Geo-Localization
Zhongwei Chen
Zhao-Xu Yang
Hai-Jun Rong
SSL
56
0
0
17 Feb 2025
Amnesia as a Catalyst for Enhancing Black Box Pixel Attacks in Image Classification and Object Detection
Amnesia as a Catalyst for Enhancing Black Box Pixel Attacks in Image Classification and Object Detection
Dongsu Song
Daehwa Ko
Jay Hoon Jung
AAML
55
0
0
10 Feb 2025
Learning Musical Representations for Music Performance Question Answering
Xingjian Diao
Chunhui Zhang
Tingxuan Wu
Ming Cheng
Z. Ouyang
Weiyi Wu
Jiang Gui
62
5
0
10 Feb 2025
Integrating Sequence and Image Modeling in Irregular Medical Time Series Through Self-Supervised Learning
Integrating Sequence and Image Modeling in Irregular Medical Time Series Through Self-Supervised Learning
Liuqing Chen
Shuhong Xiao
Shixian Ding
Shanhai Hu
Lingyun Sun
63
0
0
10 Feb 2025
Invizo: Arabic Handwritten Document Optical Character Recognition Solution
Alhossien Waly
Bassant Tarek
Ali Feteha
Rewan Yehia
Gasser Amr
Walid Gomaa
Ahmed M. Fares
48
0
0
07 Feb 2025
Addressing Out-of-Label Hazard Detection in Dashcam Videos: Insights from the COOOL Challenge
Anh-Kiet Duong
Petra Gomez-Krämer
33
2
0
27 Jan 2025
A margin-based replacement for cross-entropy loss
A margin-based replacement for cross-entropy loss
Michael W. Spratling
Heiko H. Schütt
61
0
0
21 Jan 2025
A Survey on Memory-Efficient Large-Scale Model Training in AI for Science
A Survey on Memory-Efficient Large-Scale Model Training in AI for Science
Kaiyuan Tian
Linbo Qiao
Baihui Liu
Gongqingjian Jiang
Dongsheng Li
31
0
0
21 Jan 2025
DLEN: Dual Branch of Transformer for Low-Light Image Enhancement in Dual Domains
DLEN: Dual Branch of Transformer for Low-Light Image Enhancement in Dual Domains
Junyu Xia
Jiesong Bai
Yihang Dong
ViT
70
0
0
21 Jan 2025
Keypoint Aware Masked Image Modelling
Keypoint Aware Masked Image Modelling
Madhava Krishna
Convin.AI
63
0
0
03 Jan 2025
VMamba: Visual State Space Model
VMamba: Visual State Space Model
Yue Liu
Yunjie Tian
Yuzhong Zhao
Hongtian Yu
Lingxi Xie
Yaowei Wang
Qixiang Ye
Jianbin Jiao
Yunfan Liu
Mamba
106
592
0
31 Dec 2024
Adaptive Dataset Quantization
Adaptive Dataset Quantization
Muquan Li
Dongyang Zhang
Qiang Dong
Xiurui Xie
Ke Qin
DD
MQ
83
0
0
22 Dec 2024
MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation
  via Hierarchical Modality Selection
MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection
Xu Zheng
Yuanhuiyi Lyu
Lutao Jiang
Jiazhou Zhou
Lin Wang
Xuming Hu
72
4
0
22 Dec 2024
V"Mean"ba: Visual State Space Models only need 1 hidden dimension
V"Mean"ba: Visual State Space Models only need 1 hidden dimension
Tien-Yu Chi
Hung-Yueh Chiang
Chi-Chih Chang
N. Huang
Kai-Chiang Wu
83
0
0
21 Dec 2024
Gesture Classification in Artworks Using Contextual Image Features
Gesture Classification in Artworks Using Contextual Image Features
Azhar Hussian
Mathias Zinnen
Thi My Hang Tran
Andreas K. Maier
Vincent Christlein
67
0
0
04 Dec 2024
GenMix: Effective Data Augmentation with Generative Diffusion Model
  Image Editing
GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing
Khawar Islam
M. Zaheer
Arif Mahmood
Karthik Nandakumar
Naveed Akhtar
DiffM
72
2
0
03 Dec 2024
Noisy Ostracods: A Fine-Grained, Imbalanced Real-World Dataset for
  Benchmarking Robust Machine Learning and Label Correction Methods
Noisy Ostracods: A Fine-Grained, Imbalanced Real-World Dataset for Benchmarking Robust Machine Learning and Label Correction Methods
Jiamian Hu
Yuanyuan Hong
Yihua Chen
He Wang
Moriaki Yasuhara
56
0
0
03 Dec 2024
MeasureNet: Measurement Based Celiac Disease Identification
MeasureNet: Measurement Based Celiac Disease Identification
Aayush Kumar Tyagi
Vaibhav Mishra
Ashok Tiwari
Lalita Mehra
Prasenjit Das
G. Makharia
Prathosh AP
Mausam
70
0
0
02 Dec 2024
STATIC : Surface Temporal Affine for TIme Consistency in Video Monocular
  Depth Estimation
STATIC : Surface Temporal Affine for TIme Consistency in Video Monocular Depth Estimation
Sunghun Yang
Minhyeok Lee
Suhwan Cho
Jungho Lee
Sangyoun Lee
MDE
76
0
0
02 Dec 2024
FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray
  Report Generation Models
FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models
Alice Heiman
Xiaoman Zhang
E. Chen
Sung Eun Kim
Pranav Rajpurkar
HILM
MedIm
72
0
0
27 Nov 2024
Box for Mask and Mask for Box: weak losses for multi-task partially
  supervised learning
Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning
Hoàng-Ân Lê
P. Berg
Minh Pham
64
0
0
26 Nov 2024
GeoFormer: A Multi-Polygon Segmentation Transformer
GeoFormer: A Multi-Polygon Segmentation Transformer
Maxim Khomiakov
Michael Riis Andersen
J. Frellsen
57
0
0
25 Nov 2024
Nd-BiMamba2: A Unified Bidirectional Architecture for Multi-Dimensional
  Data Processing
Nd-BiMamba2: A Unified Bidirectional Architecture for Multi-Dimensional Data Processing
Hao Liu
Mamba
AI4CE
77
1
0
22 Nov 2024
ReXrank: A Public Leaderboard for AI-Powered Radiology Report Generation
ReXrank: A Public Leaderboard for AI-Powered Radiology Report Generation
Xiaoman Zhang
Hong-Yu Zhou
Xiaoli Yang
Oishi Banerjee
J. N. Acosta
Josh Miller
Ouwen Huang
Pranav Rajpurkar
LM&MA
64
3
0
22 Nov 2024
Can Reasons Help Improve Pedestrian Intent Estimation? A Cross-Modal
  Approach
Can Reasons Help Improve Pedestrian Intent Estimation? A Cross-Modal Approach
Vaishnavi Khindkar
V. Balasubramanian
Chetan Arora
A. Subramanian
C. V. Jawahar
66
0
0
20 Nov 2024
Emotional Images: Assessing Emotions in Images and Potential Biases in
  Generative Models
Emotional Images: Assessing Emotions in Images and Potential Biases in Generative Models
Maneet Mehta
Cody Buntain
EGVM
27
1
0
08 Nov 2024
Confidence Calibration of Classifiers with Many Classes
Confidence Calibration of Classifiers with Many Classes
Adrien LeCoz
Stéphane Herbin
Faouzi Adjed
UQCV
31
1
0
05 Nov 2024
AM Flow: Adapters for Temporal Processing in Action Recognition
AM Flow: Adapters for Temporal Processing in Action Recognition
Tanay Agrawal
Abid Ali
A. Dantcheva
François Brémond
21
0
0
04 Nov 2024
MamT$^4$: Multi-view Attention Networks for Mammography Cancer
  Classification
MamT4^44: Multi-view Attention Networks for Mammography Cancer Classification
Alisher Ibragimov
Sofya Senotrusova
Arsenii Litvinov
E. Ushakov
E. Karpulevich
Yury Markin
24
0
0
03 Nov 2024
IO Transformer: Evaluating SwinV2-Based Reward Models for Computer
  Vision
IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision
Maxwell Meyer
Jack Spruyt
ViT
21
0
0
31 Oct 2024
DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination
DiffPAD: Denoising Diffusion-based Adversarial Patch Decontamination
Jia Fu
Xiao Zhang
Sepideh Pashami
Fatemeh Rahimian
Anders Holst
DiffM
AAML
22
0
0
31 Oct 2024
Context-Aware Token Selection and Packing for Enhanced Vision
  Transformer
Context-Aware Token Selection and Packing for Enhanced Vision Transformer
Tianyi Zhang
B. Li
Jae-sun Seo
Yu Cao
26
0
0
31 Oct 2024
Multi-Level Feature Distillation of Joint Teachers Trained on Distinct
  Image Datasets
Multi-Level Feature Distillation of Joint Teachers Trained on Distinct Image Datasets
Adrian Iordache
B. Alexe
Radu Tudor Ionescu
29
1
0
29 Oct 2024
SAM-Swin: SAM-Driven Dual-Swin Transformers with Adaptive Lesion
  Enhancement for Laryngo-Pharyngeal Tumor Detection
SAM-Swin: SAM-Driven Dual-Swin Transformers with Adaptive Lesion Enhancement for Laryngo-Pharyngeal Tumor Detection
Jia Wei
Yun Li
Xiaomao Fan
Wenjun Ma
Meiyu Qiu
Hongyu Chen
Wenbin Lei
11
0
0
29 Oct 2024
Enhancing Community Vision Screening -- AI Driven Retinal Photography
  for Early Disease Detection and Patient Trust
Enhancing Community Vision Screening -- AI Driven Retinal Photography for Early Disease Detection and Patient Trust
Xiaofeng Lei
Yih-Chung Tham
Jocelyn Hui Lin Goh
Yangqin Feng
Yang Bai
Z. Soh
Rick Siow Mong Goh
Xinxing Xu
Yong Liu
Ching-Yu Cheng
11
0
0
27 Oct 2024
PESFormer: Boosting Macro- and Micro-expression Spotting with Direct
  Timestamp Encoding
PESFormer: Boosting Macro- and Micro-expression Spotting with Direct Timestamp Encoding
Wang-Wang Yu
Kai-Fu Yang
Xiangrui Hu
Jingwen Jiang
Hong-Mei Yan
Yong-Jie Li
18
0
0
24 Oct 2024
Previous
12345...151617
Next