ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.08083
  4. Cited By
MambaVision: A Hybrid Mamba-Transformer Vision Backbone
v1v2 (latest)

MambaVision: A Hybrid Mamba-Transformer Vision Backbone

10 July 2024
Ali Hatamizadeh
Jan Kautz
    Mamba
ArXiv (abs)PDFHTMLHuggingFace (33 upvotes)Github (2091★)

Papers citing "MambaVision: A Hybrid Mamba-Transformer Vision Backbone"

50 / 150 papers shown
DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions
DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions
Yifan Zhou
Takehiko Ohkawa
Guwenxiao Zhou
Kanoko Goto
Takumi Hirose
Yusuke Sekikawa
Nakamasa Inoue
3DHMamba
495
0
0
02 Dec 2025
PointNet4D: A Lightweight 4D Point Cloud Video Backbone for Online and Offline Perception in Robotic Applications
PointNet4D: A Lightweight 4D Point Cloud Video Backbone for Online and Offline Perception in Robotic Applications
Yunze Liu
Zifan Wang
Peiran Wu
Jiayang Ao
3DPC
213
0
0
01 Dec 2025
MambaScope: Coarse-to-Fine Scoping for Efficient Vision Mamba
MambaScope: Coarse-to-Fine Scoping for Efficient Vision Mamba
Shanhui Liu
Rui Xu
Yunke Wang
Mamba
391
0
0
29 Nov 2025
PathMamba: A Hybrid Mamba-Transformer for Topologically Coherent Road Segmentation in Satellite Imagery
PathMamba: A Hybrid Mamba-Transformer for Topologically Coherent Road Segmentation in Satellite Imagery
Jules Decaestecker
Nicolas Vigne
Mamba
409
0
0
26 Nov 2025
MambaEye: A Size-Agnostic Visual Encoder with Causal Sequential Processing
MambaEye: A Size-Agnostic Visual Encoder with Causal Sequential Processing
Changho Choi
Minho Kim
Jinkyu Kim
Mamba
173
0
0
25 Nov 2025
RNN as Linear Transformer: A Closer Investigation into Representational Potentials of Visual Mamba Models
RNN as Linear Transformer: A Closer Investigation into Representational Potentials of Visual Mamba Models
Timing Yang
Guoyizhe Wei
Alan Yuille
Feng Wang
Mamba
198
0
0
23 Nov 2025
Supervised Contrastive Learning for Few-Shot AI-Generated Image Detection and Attribution
Supervised Contrastive Learning for Few-Shot AI-Generated Image Detection and Attribution
Jaime Álvarez Urueña
David Camacho
Javier Huertas-Tato
235
0
0
20 Nov 2025
MambaTrack3D: A State Space Model Framework for LiDAR-Based Object Tracking under High Temporal Variation
MambaTrack3D: A State Space Model Framework for LiDAR-Based Object Tracking under High Temporal Variation
Shengjing Tian
Yinan Han
Xiantong Zhao
Xuehu Liu
Qi Lang
Mamba
312
0
0
19 Nov 2025
Systematic Evaluation of Time-Frequency Features for Binaural Sound Source Localization
Systematic Evaluation of Time-Frequency Features for Binaural Sound Source Localization
Davoud Shariat Panah
Alessandro Ragano
Dan Barry
Jan Skoglund
Andrew Hines
153
0
0
17 Nov 2025
DensePercept-NCSSD: Vision Mamba towards Real-time Dense Visual Perception with Non-Causal State Space Duality
DensePercept-NCSSD: Vision Mamba towards Real-time Dense Visual Perception with Non-Causal State Space Duality
Tushar Anand
Advik Sinha
Abhijit Das
Mamba
159
0
0
16 Nov 2025
Application of Graph Based Vision Transformers Architectures for Accurate Temperature Prediction in Fiber Specklegram Sensors
Application of Graph Based Vision Transformers Architectures for Accurate Temperature Prediction in Fiber Specklegram Sensors
Abhishek Sebastian
186
0
0
15 Nov 2025
Adaptive Morph-Patch Transformer for Aortic Vessel Segmentation
Adaptive Morph-Patch Transformer for Aortic Vessel Segmentation
Zhenxi Zhang
Fuchen Zheng
Adnan Iltaf
Yifei Han
Zhenyu Cheng
Yue Du
Bin Li
Tianyong Liu
Shoujun Zhou
MedIm
243
0
0
10 Nov 2025
MVSMamba: Multi-View Stereo with State Space Model
MVSMamba: Multi-View Stereo with State Space Model
Jianfei Jiang
Qiankun Liu
Hongyuan Liu
Haochen Yu
Liyong Wang
Jiansheng Chen
Huimin Ma
Mamba
227
0
0
03 Nov 2025
HieraMamba: Video Temporal Grounding via Hierarchical Anchor-Mamba Pooling
HieraMamba: Video Temporal Grounding via Hierarchical Anchor-Mamba Pooling
Joungbin An
Kristen Grauman
Mamba
295
0
0
27 Oct 2025
Simplifying Knowledge Transfer in Pretrained Models
Simplifying Knowledge Transfer in Pretrained Models
Siddharth Jain
Shyamgopal Karthik
Vineet Gandhi
200
0
0
25 Oct 2025
StretchySnake: Flexible SSM Training Unlocks Action Recognition Across Spatio-Temporal Scales
StretchySnake: Flexible SSM Training Unlocks Action Recognition Across Spatio-Temporal Scales
Nyle Siddiqui
Rohit Gupta
S. Swetha
Mubarak Shah
203
0
0
17 Oct 2025
EdgeNavMamba: Mamba Optimized Object Detection for Energy Efficient Edge Devices
EdgeNavMamba: Mamba Optimized Object Detection for Energy Efficient Edge Devices
Romina Aalishah
Mozhgan Navardi
T. Mohsenin
Mamba
243
3
0
16 Oct 2025
End-to-End Multi-Modal Diffusion Mamba
End-to-End Multi-Modal Diffusion Mamba
Chunhao Lu
Qiang Lu
Meichen Dong
Jake Luo
195
4
0
15 Oct 2025
Learning Human Motion with Temporally Conditional Mamba
Learning Human Motion with Temporally Conditional Mamba
Quang Minh Nguyen
T. H. Le
Baoru Huang
M. Vu
Ngan Le
Thieu Vo
Anh Duc Nguyen
Mamba
289
2
0
14 Oct 2025
Catch-Only-One: Non-Transferable Examples for Model-Specific Authorization
Catch-Only-One: Non-Transferable Examples for Model-Specific Authorization
Zihan Wang
Zhiyong Ma
Zhongkui Ma
Shuofeng Liu
Akide Liu
Derui Wang
Minhui Xue
Guangdong Bai
AAML
171
3
0
13 Oct 2025
Multimodal Learning with Augmentation Techniques for Natural Disaster Assessment
Multimodal Learning with Augmentation Techniques for Natural Disaster Assessment
Adrian-Dinu Urse
Dumitru-Clementin Cercel
Florin-Catalin Pop
146
0
0
04 Oct 2025
Gather-Scatter Mamba: Accelerating Propagation with Efficient State Space Model
Gather-Scatter Mamba: Accelerating Propagation with Efficient State Space Model
Hyun-kyu Ko
Youbin Kim
Jihyeon Park
Dongheok Park
Gyeongjin Kang
Wonjun Cho
Hyung Yi
Eunbyung Park
Mamba
250
0
0
01 Oct 2025
Can Mamba Learn In Context with Outliers? A Theoretical Generalization Analysis
Can Mamba Learn In Context with Outliers? A Theoretical Generalization Analysis
Hongkang Li
Songtao Lu
Xiaodong Cui
Pin-Yu Chen
Meng Wang
MLT
210
1
0
01 Oct 2025
AttentionViG: Cross-Attention-Based Dynamic Neighbor Aggregation in Vision GNNs
AttentionViG: Cross-Attention-Based Dynamic Neighbor Aggregation in Vision GNNs
Hakan Emre Gedik
Andrew Martin
Mustafa Munir
Oguzhan Baser
R. Marculescu
Sandeep Chinchali
Alan C. Bovik
ViT
150
0
0
29 Sep 2025
StableDub: Taming Diffusion Prior for Generalized and Efficient Visual Dubbing
StableDub: Taming Diffusion Prior for Generalized and Efficient Visual Dubbing
Liyang Chen
Tianze Zhou
Xu He
Boshi Tang
Zhiyong Wu
Yang Huang
Yang Wu
Zhongqian Sun
Wei Yang
Helen M. Meng
DiffM
243
0
0
26 Sep 2025
Sequential Token Merging: Revisiting Hidden States
Sequential Token Merging: Revisiting Hidden States
Yan Wen
Peng Ye
Lin Zhang
Baopu Li
Jiakang Yuan
Yaoxin Yang
Tao Chen
Mamba
178
0
0
19 Sep 2025
UM-Depth : Uncertainty Masked Self-Supervised Monocular Depth Estimation with Visual Odometry
UM-Depth : Uncertainty Masked Self-Supervised Monocular Depth Estimation with Visual Odometry
Tae-Wook Um
Ki-Hyeon Kim
Hyun-Duck Choi
Hyo-Sung Ahn
MDE
200
0
0
17 Sep 2025
VCMamba: Bridging Convolutions with Multi-Directional Mamba for Efficient Visual Representation
VCMamba: Bridging Convolutions with Multi-Directional Mamba for Efficient Visual Representation
Mustafa Munir
Alex Zhang
R. Marculescu
Mamba
253
1
0
04 Sep 2025
DSGC-Net: A Dual-Stream Graph Convolutional Network for Crowd Counting via Feature Correlation Mining
DSGC-Net: A Dual-Stream Graph Convolutional Network for Crowd Counting via Feature Correlation Mining
Yihong Wu
Jinqiao Wei
Xionghui Zhao
Yidi Li
Shaoyi Du
Bin Ren
Andrii Zadaianchuk
379
0
0
02 Sep 2025
MV-SSM: Multi-View State Space Modeling for 3D Human Pose Estimation
MV-SSM: Multi-View State Space Modeling for 3D Human Pose EstimationComputer Vision and Pattern Recognition (CVPR), 2025
Aviral Chharia
Wenbo Gou
Haoye Dong
172
6
0
31 Aug 2025
Characterizing the Behavior of Training Mamba-based State Space Models on GPUs
Characterizing the Behavior of Training Mamba-based State Space Models on GPUs
Trinayan Baruah
Kaustubh Shivdikar
Sara Prescott
David Kaeli
Mamba
112
1
0
25 Aug 2025
Towards Efficient Vision State Space Models via Token Merging
Towards Efficient Vision State Space Models via Token Merging
Jinyoung Park
Minseok Son
Changick Kim
187
0
0
19 Aug 2025
SRMA-Mamba: Spatial Reverse Mamba Attention Network for Pathological Liver Segmentation in MRI Volumes
SRMA-Mamba: Spatial Reverse Mamba Attention Network for Pathological Liver Segmentation in MRI Volumes
Jun Zeng
Yannan Huang
Elif Keles
Halil Ertugrul Aktas
Gorkem Durak
Nikhil Kumar Tomar
Quoc-Huy Trinh
Deepak Ranjan Nayak
Ulas Bagci
Debesh Jha
Mamba
252
0
0
17 Aug 2025
ENA: Efficient N-dimensional Attention
ENA: Efficient N-dimensional Attention
Yibo Zhong
3DVAI4TS
160
0
0
16 Aug 2025
Multi-State Tracker: Enhancing Efficient Object Tracking via Multi-State Specialization and Interaction
Multi-State Tracker: Enhancing Efficient Object Tracking via Multi-State Specialization and Interaction
Shilei Wang
Gong Cheng
Pujian Lai
Dong Gao
Junwei Han
111
1
0
15 Aug 2025
Security Analysis of ChatGPT: Threats and Privacy Risks
Security Analysis of ChatGPT: Threats and Privacy Risks
Yushan Xiang
Zhongwen Li
Xiaoqi Li
SILM
309
10
0
13 Aug 2025
Subjective and Objective Quality Assessment of Banding Artifacts on Compressed Videos
Subjective and Objective Quality Assessment of Banding Artifacts on Compressed VideosIEEE Transactions on Image Processing (IEEE TIP), 2025
Qi Zheng
Li-Heng Chen
Chenlong He
Neil Berkbeck
Yilin Wang
Balu Adsumilli
A. Bovik
Yibo Fan
Zhengzhong Tu
244
0
0
12 Aug 2025
RoadMamba: A Dual Branch Visual State Space Model for Road Surface Classification
RoadMamba: A Dual Branch Visual State Space Model for Road Surface Classification
Tianze Wang
Zhang Zhang
Chao Yue
Nuoran Li
Chao Sun
Mamba
220
1
0
02 Aug 2025
$MV_{Hybrid}$: Improving Spatial Transcriptomics Prediction with Hybrid State Space-Vision Transformer Backbone in Pathology Vision Foundation Models
MVHybridMV_{Hybrid}MVHybrid​: Improving Spatial Transcriptomics Prediction with Hybrid State Space-Vision Transformer Backbone in Pathology Vision Foundation Models
Won June Cho
Hongjun Yoon
Daeky Jeong
Hyeongyeol Lim
Yosep Chong
142
1
0
01 Aug 2025
VMatcher: State-Space Semi-Dense Local Feature Matching
VMatcher: State-Space Semi-Dense Local Feature Matching
Ali Youssef
Mamba
223
0
0
31 Jul 2025
RadioMamba: Breaking the Accuracy-Efficiency Trade-off in Radio Map Construction via a Hybrid Mamba-UNet
RadioMamba: Breaking the Accuracy-Efficiency Trade-off in Radio Map Construction via a Hybrid Mamba-UNetIEEE Transactions on Network Science and Engineering (IEEE TNS&E), 2025
Honggang Jia
Nan Cheng
Xiucheng Wang
Conghao Zhou
Ruijin Sun
Xuemin
Shen
Mamba
217
1
0
28 Jul 2025
Onboard Hyperspectral Super-Resolution with Deep Pushbroom Neural Network
Onboard Hyperspectral Super-Resolution with Deep Pushbroom Neural NetworkRemote Sensing (RS), 2025
Davide Piccinini
D. Valsesia
E. Magli
SupR
449
2
0
28 Jul 2025
VAMPIRE: Uncovering Vessel Directional and Morphological Information from OCTA Images for Cardiovascular Disease Risk Factor Prediction
VAMPIRE: Uncovering Vessel Directional and Morphological Information from OCTA Images for Cardiovascular Disease Risk Factor PredictionInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Lehan Wang
Hualiang Wang
Chubin Ou
Lushi Chen
Yunyi Liang
Xiaomeng Li
210
0
0
26 Jul 2025
Explaining How Visual, Textual and Multimodal Encoders Share Concepts
Explaining How Visual, Textual and Multimodal Encoders Share Concepts
Clément Cornet
Romaric Besançon
Hervé Le Borgne
201
0
0
24 Jul 2025
HybridTM: Combining Transformer and Mamba for 3D Semantic Segmentation
HybridTM: Combining Transformer and Mamba for 3D Semantic Segmentation
Xinyu Wang
Jinghua Hou
Zhe Liu
Yingying Zhu
Mamba
189
0
0
24 Jul 2025
SRMambaV2: Biomimetic Attention for Sparse Point Cloud Upsampling in Autonomous Driving
SRMambaV2: Biomimetic Attention for Sparse Point Cloud Upsampling in Autonomous Driving
Chuang Chen
Xiaolin Qin
Jing Hu
Wenyi Ge
3DPC
250
1
0
23 Jul 2025
A2Mamba: Attention-augmented State Space Models for Visual Recognition
A2Mamba: Attention-augmented State Space Models for Visual Recognition
Meng Lou
Yunxiang Fu
Yizhou Yu
Mamba
264
0
0
22 Jul 2025
ThinkingViT: Matryoshka Thinking Vision Transformer for Elastic Inference
ThinkingViT: Matryoshka Thinking Vision Transformer for Elastic Inference
A. Hojjat
Janek Haberer
Soren Pirk
Olaf Landsiedel
LRM
267
3
0
14 Jul 2025
A Memory-Efficient Framework for Deformable Transformer with Neural Architecture Search
A Memory-Efficient Framework for Deformable Transformer with Neural Architecture Search
Wendong Mao
Mingfan Zhao
Jianfeng Guan
Qiwei Dong
Zhongfeng Wang
161
0
0
13 Jul 2025
MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing
MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing
Langyu Wang
Bingke Zhu
Yingying Chen
Yiyuan Zhang
Ming Tang
Jinqiao Wang
VLM
380
1
0
02 Jul 2025
123
Next
Page 1 of 3