ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.01697
  4. Cited By
MaxViT: Multi-Axis Vision Transformer
v1v2v3v4 (latest)

MaxViT: Multi-Axis Vision Transformer

European Conference on Computer Vision (ECCV), 2022
4 April 2022
Zhengzhong Tu
Hossein Talebi
Han Zhang
Feng Yang
P. Milanfar
A. Bovik
Yinxiao Li
    ViT
ArXiv (abs)PDFHTMLGithub (473★)

Papers citing "MaxViT: Multi-Axis Vision Transformer"

50 / 370 papers shown
DisentangleFormer: Spatial-Channel Decoupling for Multi-Channel Vision
DisentangleFormer: Spatial-Channel Decoupling for Multi-Channel Vision
Jiashu Liao
Pietro Liò
Marc de Kamps
Duygu Sarikaya
166
0
0
03 Dec 2025
Life-IQA: Boosting Blind Image Quality Assessment through GCN-enhanced Layer Interaction and MoE-based Feature Decoupling
Life-IQA: Boosting Blind Image Quality Assessment through GCN-enhanced Layer Interaction and MoE-based Feature Decoupling
Long Tang
Guoquan Zhen
Jie Hao
Jianbo Zhang
Huiyu Duan
Liang Yuan
Guangtao Zhai
175
0
0
24 Nov 2025
EVCC: Enhanced Vision Transformer-ConvNeXt-CoAtNet Fusion for Classification
EVCC: Enhanced Vision Transformer-ConvNeXt-CoAtNet Fusion for Classification
Kazi Reyazul Hasan
M. Rahman
Wasif Jalal
Sadif Ahmed
Shahriar Raj
Mubasshira Musarrat
Muhammad Abdullah Adnan
ViT
121
0
0
24 Nov 2025
A Spatial Semantics and Continuity Perception Attention for Remote Sensing Water Body Change Detection
A Spatial Semantics and Continuity Perception Attention for Remote Sensing Water Body Change Detection
Quanqing Ma
Jiaen Chen
Peng Wang
Yao Zheng
Qingzhan Zhao
Yuchen Zheng
145
1
0
20 Nov 2025
AdaptViG: Adaptive Vision GNN with Exponential Decay Gating
AdaptViG: Adaptive Vision GNN with Exponential Decay Gating
Mustafa Munir
Md Mostafijur Rahman
R. Marculescu
67
0
0
13 Nov 2025
Hilbert-Guided Sparse Local Attention
Hilbert-Guided Sparse Local Attention
Yunge Li
Lanyu Xu
164
0
0
08 Nov 2025
MACMD: Multi-dilated Contextual Attention and Channel Mixer Decoding for Medical Image Segmentation
MACMD: Multi-dilated Contextual Attention and Channel Mixer Decoding for Medical Image Segmentation
Lalit Maurya
Honghai Liu
Reyer Zwiggelaar
MedIm
188
0
0
08 Nov 2025
Precipitation nowcasting of satellite data using physically-aligned neural networks
Precipitation nowcasting of satellite data using physically-aligned neural networks
Antônio Catão
Melvin Poveda
Leonardo Voltarelli
Paulo Orenstein
192
0
0
07 Nov 2025
Attentive Convolution: Unifying the Expressivity of Self-Attention with Convolutional Efficiency
Attentive Convolution: Unifying the Expressivity of Self-Attention with Convolutional Efficiency
Hao Yu
H. G. Chen
Yan Jiang
Wei Peng
Zhaodong Sun
Samuel Kaski
Guoying Zhao
191
0
0
23 Oct 2025
Counting Hallucinations in Diffusion Models
Counting Hallucinations in Diffusion Models
Shuai Fu
Jian Zhou
Qi Chen
Huang Jing
Huy Anh Nguyen
Xiaohan Liu
Zhixiong Zeng
Lin Ma
Quanshi Zhang
Qi Wu
DiffMHILM
341
1
0
15 Oct 2025
Q-Router: Agentic Video Quality Assessment with Expert Model Routing and Artifact Localization
Q-Router: Agentic Video Quality Assessment with Expert Model Routing and Artifact Localization
Shuo Xing
Soumik Dey
Mingyang Wu
Ashirbad Mishra
Naveen Ravipati
Binbin Li
Hansi Wu
Zhengzhong Tu
265
2
0
09 Oct 2025
Universal Neural Architecture Space: Covering ConvNets, Transformers and Everything in Between
Universal Neural Architecture Space: Covering ConvNets, Transformers and Everything in Between
Ondřej Týbl
Lukáš Neumann
AI4CE
279
0
0
07 Oct 2025
A Comprehensive Review on Artificial Intelligence Empowered Solutions for Enhancing Pedestrian and Cyclist Safety
A Comprehensive Review on Artificial Intelligence Empowered Solutions for Enhancing Pedestrian and Cyclist Safety
Shucheng Zhang
Yan Shi
Bingzhang Wang
Yuang Zhang
Muhammad Monjurul Karim
Kehua Chen
Chenxi Liu
Mehrdad Nasri
Yinhai Wang
228
0
0
30 Sep 2025
Introducing Multimodal Paradigm for Learning Sleep Staging PSG via General-Purpose Model
Introducing Multimodal Paradigm for Learning Sleep Staging PSG via General-Purpose Model
Jianheng Zhou
Chenyu Liu
J. Zhou
Y. Ding
Yang Liu
Haoran Luo
Ziyu Jia
Xinliang Zhou
AI4TS
128
0
0
26 Sep 2025
Frequency-Aware Model Parameter Explorer: A new attribution method for improving explainability
Frequency-Aware Model Parameter Explorer: A new attribution method for improving explainability
Ali Yavari
Alireza Mohamadi
Elham Beydaghi
Rainer A. Leitgeb
AAML
147
0
0
25 Sep 2025
Align Where the Words Look: Cross-Attention-Guided Patch Alignment with Contrastive and Transport Regularization for Bengali Captioning
Align Where the Words Look: Cross-Attention-Guided Patch Alignment with Contrastive and Transport Regularization for Bengali Captioning
Riad Ahmed Anonto
Sardar Md. Saffat Zabin
M. Saifur Rahman
VLM
188
1
0
22 Sep 2025
Multi-Modal Sensing Aided mmWave Beamforming for V2V Communications with Transformers
Multi-Modal Sensing Aided mmWave Beamforming for V2V Communications with Transformers
Muhammad Baqer Mollah
Honggang Wang
Hua Fang
153
2
0
14 Sep 2025
CoAtNeXt:An Attention-Enhanced ConvNeXtV2-Transformer Hybrid Model for Gastric Tissue Classification
CoAtNeXt:An Attention-Enhanced ConvNeXtV2-Transformer Hybrid Model for Gastric Tissue Classification
Mustafa Yurdakul
Şakir Tasdemir
155
0
0
11 Sep 2025
Sparse Transformer for Ultra-sparse Sampled Video Compressive Sensing
Sparse Transformer for Ultra-sparse Sampled Video Compressive Sensing
Miao Cao
Siming Zheng
Lishun Wang
Ziyang Chen
D. Brady
Xin Yuan
221
0
0
10 Sep 2025
Focus Through Motion: RGB-Event Collaborative Token Sparsification for Efficient Object Detection
Focus Through Motion: RGB-Event Collaborative Token Sparsification for Efficient Object Detection
Nan Yang
Yang Wang
Zhanwen Liu
Yuchao Dai
Yang Liu
Xiangmo Zhao
155
0
0
04 Sep 2025
A Lightweight Convolution and Vision Transformer integrated model with Multi-scale Self-attention Mechanism
A Lightweight Convolution and Vision Transformer integrated model with Multi-scale Self-attention Mechanism
Yi Zhang
Lingxiao Wei
Bowei Zhang
Z. Liu
Kai Yi
Shu Hu
ViT
196
3
0
23 Aug 2025
NAT: Learning to Attack Neurons for Enhanced Adversarial Transferability
NAT: Learning to Attack Neurons for Enhanced Adversarial TransferabilityIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Krishna Kanth Nakka
Alexandre Alahi
AAML
190
2
0
23 Aug 2025
A Fully Transformer Based Multimodal Framework for Explainable Cancer Image Segmentation Using Radiology Reports
A Fully Transformer Based Multimodal Framework for Explainable Cancer Image Segmentation Using Radiology Reports
Enobong Adahada
Isabel Sassoon
Kate Hone
Yongmin Li
ViTMedIm
117
0
0
19 Aug 2025
Subjective and Objective Quality Assessment of Banding Artifacts on Compressed Videos
Subjective and Objective Quality Assessment of Banding Artifacts on Compressed VideosIEEE Transactions on Image Processing (IEEE TIP), 2025
Qi Zheng
Li-Heng Chen
Chenlong He
Neil Berkbeck
Yilin Wang
Balu Adsumilli
A. Bovik
Yibo Fan
Zhengzhong Tu
263
0
0
12 Aug 2025
CMAMRNet: A Contextual Mask-Aware Network Enhancing Mural Restoration Through Comprehensive Mask Guidance
CMAMRNet: A Contextual Mask-Aware Network Enhancing Mural Restoration Through Comprehensive Mask Guidance
Yingtie Lei
Fanghai Yi
Yihang Dong
Weihuang Liu
Xiaofeng Zhang
Zimeng Li
Chi-Man Pun
Xuhang Chen
264
0
0
10 Aug 2025
Prototype-Driven Structure Synergy Network for Remote Sensing Images Segmentation
Prototype-Driven Structure Synergy Network for Remote Sensing Images SegmentationIEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2025
Junyi Wang
Jinjiang Li
Guodong Fan
Yakun Ju
Xiang Fang
Alex C. Kot
206
2
0
06 Aug 2025
Representation Shift: Unifying Token Compression with FlashAttention
Representation Shift: Unifying Token Compression with FlashAttention
Joonmyung Choi
S. Lee
Byungoh Ko
Eunseo Kim
Jihyung Kil
Hyunwoo J. Kim
248
2
0
01 Aug 2025
SwinECAT: A Transformer-based fundus disease classification model with Shifted Window Attention and Efficient Channel Attention
SwinECAT: A Transformer-based fundus disease classification model with Shifted Window Attention and Efficient Channel Attention
Peiran Gu
Teng Yao
Mengshen He
Fuhao Duan
Feiyan Liu
RenYuan Peng
Bao Ge
MedIm
262
1
0
29 Jul 2025
EA-ViT: Efficient Adaptation for Elastic Vision Transformer
EA-ViT: Efficient Adaptation for Elastic Vision Transformer
Chen Zhu
Wangbo Zhao
Huiwen Zhang
Samir Khaki
Yuhao Zhou
...
Zhihang Yuan
Yuzhang Shang
Xiaojiang Peng
Kai Wang
Dawei Yang
229
4
0
25 Jul 2025
GVCCS: A Dataset for Contrail Identification and Tracking on Visible Whole Sky Camera Sequences
GVCCS: A Dataset for Contrail Identification and Tracking on Visible Whole Sky Camera Sequences
Gabriel Jarry
Ramon Dalmau
Philippe Very
Franck Ballerini
Stephania-Denisa Bocu
308
1
0
24 Jul 2025
Perceptual Classifiers: Detecting Generative Images using Perceptual Features
Perceptual Classifiers: Detecting Generative Images using Perceptual Features
Krishna Srikar Durbha
Asvin Kumar Venkataramanan
Rajesh Sureddi
Alan C. Bovik
218
0
0
23 Jul 2025
A2Mamba: Attention-augmented State Space Models for Visual Recognition
A2Mamba: Attention-augmented State Space Models for Visual Recognition
Meng Lou
Yunxiang Fu
Yizhou Yu
Mamba
267
0
0
22 Jul 2025
Colorectal Cancer Tumor Grade Segmentation in Digital Histopathology Images: From Giga to Mini Challenge
Colorectal Cancer Tumor Grade Segmentation in Digital Histopathology Images: From Giga to Mini Challenge
Alper Bahcekapili
Duygu Arslan
Umut Ozdemir
Berkay Ozkirli
Emre Akbas
...
Luc Téot
Fahad Alsharekh
Shahad Alghannam
Hexiang Mao
Wenhua Zhang
273
0
0
07 Jul 2025
Boosting Generative Adversarial Transferability with Self-supervised Vision Transformer Features
Boosting Generative Adversarial Transferability with Self-supervised Vision Transformer Features
Shangbo Wu
Yu-an Tan
Ruinan Ma
Wencong Ma
Dehua Zhu
Yuanzhang Li
ViT
266
3
0
26 Jun 2025
Improving Black-Box Generative Attacks via Generator Semantic Consistency
Improving Black-Box Generative Attacks via Generator Semantic Consistency
Jongoh Jeong
Hunmin Yang
Jaeseok Jeong
Kuk-Jin Yoon
AAML
503
0
0
23 Jun 2025
Polyline Path Masked Attention for Vision Transformer
Polyline Path Masked Attention for Vision Transformer
Zhongchen Zhao
Chaodong Xiao
H. Lin
Qi Xie
Lei Zhang
Deyu Meng
Mamba
389
1
0
19 Jun 2025
synth-dacl: Does Synthetic Defect Data Enhance Segmentation Accuracy and Robustness for Real-World Bridge Inspections?
synth-dacl: Does Synthetic Defect Data Enhance Segmentation Accuracy and Robustness for Real-World Bridge Inspections?
Johannes Flotzinger
Fabian Deuser
Achref Jaziri
Heiko Neumann
Norbert Oswald
Visvanathan Ramesh
T. Braml
206
0
0
17 Jun 2025
Vision Transformers for End-to-End Quark-Gluon Jet Classification from Calorimeter Images
Vision Transformers for End-to-End Quark-Gluon Jet Classification from Calorimeter Images
Md Abrar Jahin
Shahriar Soudeep
Arian Rahman Aditta
M. F. Mridha
Nafiz Fahad
Md. Jakir Hossen
ViT
232
2
0
17 Jun 2025
Detecção da Psoríase Utilizando Visão Computacional: Uma Abordagem Comparativa Entre CNNs e Vision Transformers
Detecção da Psoríase Utilizando Visão Computacional: Uma Abordagem Comparativa Entre CNNs e Vision Transformers
Natanael Lucena
Fábio S. da Silva
Ricardo Rios
ViTMedIm
218
0
0
11 Jun 2025
CXR-LT 2024: A MICCAI challenge on long-tailed, multi-label, and zero-shot disease classification from chest X-ray
CXR-LT 2024: A MICCAI challenge on long-tailed, multi-label, and zero-shot disease classification from chest X-ray
Mingquan Lin
G. Holste
Song Wang
Yiliang Zhou
Yishu Wei
...
Hao Chen
Adam Flanders
George Shih
Zhangyang Wang
Yifan Peng
LM&MA
226
5
0
09 Jun 2025
DermaCon-IN: A Multi-concept Annotated Dermatological Image Dataset of Indian Skin Disorders for Clinical AI Research
DermaCon-IN: A Multi-concept Annotated Dermatological Image Dataset of Indian Skin Disorders for Clinical AI Research
Shanawaj S Madarkar
Mahajabeen Madarkar
Madhumitha V
Teli Prakash
Konda Reddy Mopuri
...
Adarsh Kasturi
Gandla Dilip Raj
PVN Supranitha
Harsh Udai
Harsh Udai
209
0
0
06 Jun 2025
Any-Class Presence Likelihood for Robust Multi-Label Classification with Abundant Negative Data
Any-Class Presence Likelihood for Robust Multi-Label Classification with Abundant Negative Data
Dumindu Tissera
Omar Awadallah
Muhammad Umair Danish
Ayan Sadhu
Katarina Grolinger
216
0
0
06 Jun 2025
Perfecting Depth: Uncertainty-Aware Enhancement of Metric Depth
Perfecting Depth: Uncertainty-Aware Enhancement of Metric Depth
Jinyoung Jun
Lei Chu
Jiahao Li
Yan Lu
Chang-Su Kim
MDE
390
2
0
05 Jun 2025
Seeing What Tastes Good: Revisiting Multimodal Distributional Semantics in the Billion Parameter Era
Seeing What Tastes Good: Revisiting Multimodal Distributional Semantics in the Billion Parameter EraAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Dan Oneaţă
Desmond Elliott
Stella Frank
232
3
0
04 Jun 2025
Learning from Noise: Enhancing DNNs for Event-Based Vision through Controlled Noise Injection
Learning from Noise: Enhancing DNNs for Event-Based Vision through Controlled Noise Injection
M. Kowalczyk
K. Jeziorek
T. Kryjak
288
0
0
04 Jun 2025
NatADiff: Adversarial Boundary Guidance for Natural Adversarial Diffusion
NatADiff: Adversarial Boundary Guidance for Natural Adversarial Diffusion
Max Collins
Jordan Vice
T. French
Lin Wang
DiffM
329
2
0
27 May 2025
GoLF-NRT: Integrating Global Context and Local Geometry for Few-Shot View Synthesis
GoLF-NRT: Integrating Global Context and Local Geometry for Few-Shot View SynthesisComputer Vision and Pattern Recognition (CVPR), 2025
You Wang
Li Fang
Hao Zhu
Fei Hu
Long Ye
Zhan Ma
ViT
270
0
0
26 May 2025
Joint Depth and Reflectivity Estimation using Single-Photon LiDAR
Joint Depth and Reflectivity Estimation using Single-Photon LiDAR
Hashan K. Weerasooriya
Prateek Chennuri
Weijian Zhang
Istvan Gyongy
Stanley H. Chan
3DV
435
4
0
19 May 2025
RainPro-8: An Efficient Deep Learning Model to Estimate Rainfall Probabilities Over 8 Hours
RainPro-8: An Efficient Deep Learning Model to Estimate Rainfall Probabilities Over 8 Hours
Rafael Pablos-Sarabia
Joachim Nyborg
Morten Birk
Jeppe Liborius Sjørup
Anders Lillevang Vesterholt
Ira Assent
BDLAI4Cl
442
2
0
15 May 2025
FoldNet: Learning Generalizable Closed-Loop Policy for Garment Folding via Keypoint-Driven Asset and Demonstration Synthesis
FoldNet: Learning Generalizable Closed-Loop Policy for Garment Folding via Keypoint-Driven Asset and Demonstration Synthesis
Yuxing Chen
Bowen Xiao
Hongan Wang
430
2
0
14 May 2025
12345678
Next
Page 1 of 8