ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.08083
  4. Cited By
MambaVision: A Hybrid Mamba-Transformer Vision Backbone
v1v2 (latest)

MambaVision: A Hybrid Mamba-Transformer Vision Backbone

10 July 2024
Ali Hatamizadeh
Jan Kautz
    Mamba
ArXiv (abs)PDFHTMLHuggingFace (33 upvotes)

Papers citing "MambaVision: A Hybrid Mamba-Transformer Vision Backbone"

50 / 150 papers shown
DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions
DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions
Yifan Zhou
Takehiko Ohkawa
Guwenxiao Zhou
Kanoko Goto
Takumi Hirose
Yusuke Sekikawa
Nakamasa Inoue
3DHMamba
488
0
0
02 Dec 2025
PointNet4D: A Lightweight 4D Point Cloud Video Backbone for Online and Offline Perception in Robotic Applications
PointNet4D: A Lightweight 4D Point Cloud Video Backbone for Online and Offline Perception in Robotic Applications
Yunze Liu
Zifan Wang
Peiran Wu
Jiayang Ao
3DPC
204
0
0
01 Dec 2025
MambaScope: Coarse-to-Fine Scoping for Efficient Vision Mamba
MambaScope: Coarse-to-Fine Scoping for Efficient Vision Mamba
Shanhui Liu
Rui Xu
Yunke Wang
Mamba
377
0
0
29 Nov 2025
PathMamba: A Hybrid Mamba-Transformer for Topologically Coherent Road Segmentation in Satellite Imagery
PathMamba: A Hybrid Mamba-Transformer for Topologically Coherent Road Segmentation in Satellite Imagery
Jules Decaestecker
Nicolas Vigne
Mamba
399
0
0
26 Nov 2025
MambaEye: A Size-Agnostic Visual Encoder with Causal Sequential Processing
MambaEye: A Size-Agnostic Visual Encoder with Causal Sequential Processing
Changho Choi
Minho Kim
Jinkyu Kim
Mamba
168
0
0
25 Nov 2025
RNN as Linear Transformer: A Closer Investigation into Representational Potentials of Visual Mamba Models
RNN as Linear Transformer: A Closer Investigation into Representational Potentials of Visual Mamba Models
Timing Yang
Guoyizhe Wei
Alan Yuille
Feng Wang
Mamba
180
0
0
23 Nov 2025
Supervised Contrastive Learning for Few-Shot AI-Generated Image Detection and Attribution
Supervised Contrastive Learning for Few-Shot AI-Generated Image Detection and Attribution
Jaime Álvarez Urueña
David Camacho
Javier Huertas-Tato
226
0
0
20 Nov 2025
MambaTrack3D: A State Space Model Framework for LiDAR-Based Object Tracking under High Temporal Variation
MambaTrack3D: A State Space Model Framework for LiDAR-Based Object Tracking under High Temporal Variation
Shengjing Tian
Yinan Han
Xiantong Zhao
Xuehu Liu
Qi Lang
Mamba
295
0
0
19 Nov 2025
Systematic Evaluation of Time-Frequency Features for Binaural Sound Source Localization
Systematic Evaluation of Time-Frequency Features for Binaural Sound Source Localization
Davoud Shariat Panah
Alessandro Ragano
Dan Barry
Jan Skoglund
Andrew Hines
127
0
0
17 Nov 2025
DensePercept-NCSSD: Vision Mamba towards Real-time Dense Visual Perception with Non-Causal State Space Duality
DensePercept-NCSSD: Vision Mamba towards Real-time Dense Visual Perception with Non-Causal State Space Duality
Tushar Anand
Advik Sinha
Abhijit Das
Mamba
152
0
0
16 Nov 2025
Application of Graph Based Vision Transformers Architectures for Accurate Temperature Prediction in Fiber Specklegram Sensors
Application of Graph Based Vision Transformers Architectures for Accurate Temperature Prediction in Fiber Specklegram Sensors
Abhishek Sebastian
175
0
0
15 Nov 2025
Adaptive Morph-Patch Transformer for Aortic Vessel Segmentation
Adaptive Morph-Patch Transformer for Aortic Vessel Segmentation
Zhenxi Zhang
Fuchen Zheng
Adnan Iltaf
Yifei Han
Zhenyu Cheng
Yue Du
Bin Li
Tianyong Liu
Shoujun Zhou
MedIm
221
0
0
10 Nov 2025
MVSMamba: Multi-View Stereo with State Space Model
MVSMamba: Multi-View Stereo with State Space Model
Jianfei Jiang
Qiankun Liu
Hongyuan Liu
Haochen Yu
Liyong Wang
Jiansheng Chen
Huimin Ma
Mamba
221
0
0
03 Nov 2025
HieraMamba: Video Temporal Grounding via Hierarchical Anchor-Mamba Pooling
HieraMamba: Video Temporal Grounding via Hierarchical Anchor-Mamba Pooling
Joungbin An
Kristen Grauman
Mamba
282
0
0
27 Oct 2025
Simplifying Knowledge Transfer in Pretrained Models
Simplifying Knowledge Transfer in Pretrained Models
Siddharth Jain
Shyamgopal Karthik
Vineet Gandhi
182
0
0
25 Oct 2025
StretchySnake: Flexible SSM Training Unlocks Action Recognition Across Spatio-Temporal Scales
StretchySnake: Flexible SSM Training Unlocks Action Recognition Across Spatio-Temporal Scales
Nyle Siddiqui
Rohit Gupta
S. Swetha
Mubarak Shah
198
0
0
17 Oct 2025
EdgeNavMamba: Mamba Optimized Object Detection for Energy Efficient Edge Devices
EdgeNavMamba: Mamba Optimized Object Detection for Energy Efficient Edge Devices
Romina Aalishah
Mozhgan Navardi
T. Mohsenin
Mamba
229
2
0
16 Oct 2025
End-to-End Multi-Modal Diffusion Mamba
End-to-End Multi-Modal Diffusion Mamba
Chunhao Lu
Qiang Lu
Meichen Dong
Jake Luo
190
4
0
15 Oct 2025
Learning Human Motion with Temporally Conditional Mamba
Learning Human Motion with Temporally Conditional Mamba
Quang Minh Nguyen
T. H. Le
Baoru Huang
M. Vu
Ngan Le
Thieu Vo
Anh Duc Nguyen
Mamba
279
2
0
14 Oct 2025
Catch-Only-One: Non-Transferable Examples for Model-Specific Authorization
Catch-Only-One: Non-Transferable Examples for Model-Specific Authorization
Zihan Wang
Zhiyong Ma
Zhongkui Ma
Shuofeng Liu
Akide Liu
Derui Wang
Minhui Xue
Guangdong Bai
AAML
152
3
0
13 Oct 2025
Multimodal Learning with Augmentation Techniques for Natural Disaster Assessment
Multimodal Learning with Augmentation Techniques for Natural Disaster Assessment
Adrian-Dinu Urse
Dumitru-Clementin Cercel
Florin-Catalin Pop
127
0
0
04 Oct 2025
Gather-Scatter Mamba: Accelerating Propagation with Efficient State Space Model
Gather-Scatter Mamba: Accelerating Propagation with Efficient State Space Model
Hyun-kyu Ko
Youbin Kim
Jihyeon Park
Dongheok Park
Gyeongjin Kang
Wonjun Cho
Hyung Yi
Eunbyung Park
Mamba
241
0
0
01 Oct 2025
Can Mamba Learn In Context with Outliers? A Theoretical Generalization Analysis
Can Mamba Learn In Context with Outliers? A Theoretical Generalization Analysis
Hongkang Li
Songtao Lu
Xiaodong Cui
Pin-Yu Chen
Meng Wang
MLT
205
1
0
01 Oct 2025
AttentionViG: Cross-Attention-Based Dynamic Neighbor Aggregation in Vision GNNs
AttentionViG: Cross-Attention-Based Dynamic Neighbor Aggregation in Vision GNNs
Hakan Emre Gedik
Andrew Martin
Mustafa Munir
Oguzhan Baser
R. Marculescu
Sandeep Chinchali
Alan C. Bovik
ViT
141
0
0
29 Sep 2025
StableDub: Taming Diffusion Prior for Generalized and Efficient Visual Dubbing
StableDub: Taming Diffusion Prior for Generalized and Efficient Visual Dubbing
Liyang Chen
Tianze Zhou
Xu He
Boshi Tang
Zhiyong Wu
Yang Huang
Yang Wu
Zhongqian Sun
Wei Yang
Helen M. Meng
DiffM
235
0
0
26 Sep 2025
Sequential Token Merging: Revisiting Hidden States
Sequential Token Merging: Revisiting Hidden States
Yan Wen
Peng Ye
Lin Zhang
Baopu Li
Jiakang Yuan
Yaoxin Yang
Tao Chen
Mamba
174
0
0
19 Sep 2025
UM-Depth : Uncertainty Masked Self-Supervised Monocular Depth Estimation with Visual Odometry
UM-Depth : Uncertainty Masked Self-Supervised Monocular Depth Estimation with Visual Odometry
Tae-Wook Um
Ki-Hyeon Kim
Hyun-Duck Choi
Hyo-Sung Ahn
MDE
190
0
0
17 Sep 2025
VCMamba: Bridging Convolutions with Multi-Directional Mamba for Efficient Visual Representation
VCMamba: Bridging Convolutions with Multi-Directional Mamba for Efficient Visual Representation
Mustafa Munir
Alex Zhang
R. Marculescu
Mamba
249
1
0
04 Sep 2025
DSGC-Net: A Dual-Stream Graph Convolutional Network for Crowd Counting via Feature Correlation Mining
DSGC-Net: A Dual-Stream Graph Convolutional Network for Crowd Counting via Feature Correlation Mining
Yihong Wu
Jinqiao Wei
Xionghui Zhao
Yidi Li
Shaoyi Du
Bin Ren
Andrii Zadaianchuk
379
0
0
02 Sep 2025
MV-SSM: Multi-View State Space Modeling for 3D Human Pose Estimation
MV-SSM: Multi-View State Space Modeling for 3D Human Pose EstimationComputer Vision and Pattern Recognition (CVPR), 2025
Aviral Chharia
Wenbo Gou
Haoye Dong
157
6
0
31 Aug 2025
Characterizing the Behavior of Training Mamba-based State Space Models on GPUs
Characterizing the Behavior of Training Mamba-based State Space Models on GPUs
Trinayan Baruah
Kaustubh Shivdikar
Sara Prescott
David Kaeli
Mamba
105
1
0
25 Aug 2025
Towards Efficient Vision State Space Models via Token Merging
Towards Efficient Vision State Space Models via Token Merging
Jinyoung Park
Minseok Son
Changick Kim
177
0
0
19 Aug 2025
SRMA-Mamba: Spatial Reverse Mamba Attention Network for Pathological Liver Segmentation in MRI Volumes
SRMA-Mamba: Spatial Reverse Mamba Attention Network for Pathological Liver Segmentation in MRI Volumes
Jun Zeng
Yannan Huang
Elif Keles
Halil Ertugrul Aktas
Gorkem Durak
Nikhil Kumar Tomar
Quoc-Huy Trinh
Deepak Ranjan Nayak
Ulas Bagci
Debesh Jha
Mamba
235
0
0
17 Aug 2025
ENA: Efficient N-dimensional Attention
ENA: Efficient N-dimensional Attention
Yibo Zhong
3DVAI4TS
156
0
0
16 Aug 2025
Multi-State Tracker: Enhancing Efficient Object Tracking via Multi-State Specialization and Interaction
Multi-State Tracker: Enhancing Efficient Object Tracking via Multi-State Specialization and Interaction
Shilei Wang
Gong Cheng
Pujian Lai
Dong Gao
Junwei Han
106
1
0
15 Aug 2025
Security Analysis of ChatGPT: Threats and Privacy Risks
Security Analysis of ChatGPT: Threats and Privacy Risks
Yushan Xiang
Zhongwen Li
Xiaoqi Li
SILM
285
9
0
13 Aug 2025
Subjective and Objective Quality Assessment of Banding Artifacts on Compressed Videos
Subjective and Objective Quality Assessment of Banding Artifacts on Compressed VideosIEEE Transactions on Image Processing (IEEE TIP), 2025
Qi Zheng
Li-Heng Chen
Chenlong He
Neil Berkbeck
Yilin Wang
Balu Adsumilli
A. Bovik
Yibo Fan
Zhengzhong Tu
230
0
0
12 Aug 2025
RoadMamba: A Dual Branch Visual State Space Model for Road Surface Classification
RoadMamba: A Dual Branch Visual State Space Model for Road Surface Classification
Tianze Wang
Zhang Zhang
Chao Yue
Nuoran Li
Chao Sun
Mamba
192
1
0
02 Aug 2025
$MV_{Hybrid}$: Improving Spatial Transcriptomics Prediction with Hybrid State Space-Vision Transformer Backbone in Pathology Vision Foundation Models
MVHybridMV_{Hybrid}MVHybrid​: Improving Spatial Transcriptomics Prediction with Hybrid State Space-Vision Transformer Backbone in Pathology Vision Foundation Models
Won June Cho
Hongjun Yoon
Daeky Jeong
Hyeongyeol Lim
Yosep Chong
137
1
0
01 Aug 2025
VMatcher: State-Space Semi-Dense Local Feature Matching
VMatcher: State-Space Semi-Dense Local Feature Matching
Ali Youssef
Mamba
214
0
0
31 Jul 2025
RadioMamba: Breaking the Accuracy-Efficiency Trade-off in Radio Map Construction via a Hybrid Mamba-UNet
RadioMamba: Breaking the Accuracy-Efficiency Trade-off in Radio Map Construction via a Hybrid Mamba-UNetIEEE Transactions on Network Science and Engineering (IEEE TNS&E), 2025
Honggang Jia
Nan Cheng
Xiucheng Wang
Conghao Zhou
Ruijin Sun
Xuemin
Shen
Mamba
204
1
0
28 Jul 2025
Onboard Hyperspectral Super-Resolution with Deep Pushbroom Neural Network
Onboard Hyperspectral Super-Resolution with Deep Pushbroom Neural NetworkRemote Sensing (RS), 2025
Davide Piccinini
D. Valsesia
E. Magli
SupR
445
2
0
28 Jul 2025
VAMPIRE: Uncovering Vessel Directional and Morphological Information from OCTA Images for Cardiovascular Disease Risk Factor Prediction
VAMPIRE: Uncovering Vessel Directional and Morphological Information from OCTA Images for Cardiovascular Disease Risk Factor PredictionInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Lehan Wang
Hualiang Wang
Chubin Ou
Lushi Chen
Yunyi Liang
Xiaomeng Li
202
0
0
26 Jul 2025
Explaining How Visual, Textual and Multimodal Encoders Share Concepts
Explaining How Visual, Textual and Multimodal Encoders Share Concepts
Clément Cornet
Romaric Besançon
Hervé Le Borgne
189
0
0
24 Jul 2025
HybridTM: Combining Transformer and Mamba for 3D Semantic Segmentation
HybridTM: Combining Transformer and Mamba for 3D Semantic Segmentation
Xinyu Wang
Jinghua Hou
Zhe Liu
Yingying Zhu
Mamba
183
0
0
24 Jul 2025
SRMambaV2: Biomimetic Attention for Sparse Point Cloud Upsampling in Autonomous Driving
SRMambaV2: Biomimetic Attention for Sparse Point Cloud Upsampling in Autonomous Driving
Chuang Chen
Xiaolin Qin
Jing Hu
Wenyi Ge
3DPC
236
1
0
23 Jul 2025
A2Mamba: Attention-augmented State Space Models for Visual Recognition
A2Mamba: Attention-augmented State Space Models for Visual Recognition
Meng Lou
Yunxiang Fu
Yizhou Yu
Mamba
257
0
0
22 Jul 2025
ThinkingViT: Matryoshka Thinking Vision Transformer for Elastic Inference
ThinkingViT: Matryoshka Thinking Vision Transformer for Elastic Inference
A. Hojjat
Janek Haberer
Soren Pirk
Olaf Landsiedel
ViTLRM
262
3
0
14 Jul 2025
A Memory-Efficient Framework for Deformable Transformer with Neural Architecture Search
A Memory-Efficient Framework for Deformable Transformer with Neural Architecture Search
Wendong Mao
Mingfan Zhao
Jianfeng Guan
Qiwei Dong
Zhongfeng Wang
147
0
0
13 Jul 2025
MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing
MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing
Langyu Wang
Bingke Zhu
Yingying Chen
Yiyuan Zhang
Ming Tang
Jinqiao Wang
VLM
373
1
0
02 Jul 2025
123
Next
Page 1 of 3