Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.13542
Cited By
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
26 May 2022
Zhijian Liu
Haotian Tang
Alexander Amini
Xinyu Yang
Huizi Mao
Daniela Rus
Song Han
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation"
50 / 527 papers shown
Title
BETTY Dataset: A Multi-modal Dataset for Full-Stack Autonomy
Micah Nye
Ayoub Raji
Andrew Saba
Eidan Erlich
Robert Exley
...
Ritesh Misra
Matthew Sivaprakasam
Marko Bertogna
Deva Ramanan
Sebastian A. Scherer
18
0
0
12 May 2025
DepthFusion: Depth-Aware Hybrid Feature Fusion for LiDAR-Camera 3D Object Detection
Mingqian Ji
Jian Yang
Shanshan Zhang
3DPC
MDE
35
0
0
12 May 2025
RESAR-BEV: An Explainable Progressive Residual Autoregressive Approach for Camera-Radar Fusion in BEV Segmentation
Zhiwen Zeng
Yunfei Yin
Zheng Yuan
Argho Dey
Xianjian Bao
18
0
0
10 May 2025
DenseGrounding: Improving Dense Language-Vision Semantics for Ego-Centric 3D Visual Grounding
Henry Zheng
Hao Shi
Qihang Peng
Yong Xien Chng
Rui Huang
Yepeng Weng
Zhongchao Shi
Gao Huang
59
1
0
08 May 2025
OccCylindrical: Multi-Modal Fusion with Cylindrical Representation for 3D Semantic Occupancy Prediction
Zhenxing Ming
J. S. Berrio
Mao Shan
Yaoqi Huang
Hongyu Lyu
Nguyen Hoang Khoi Tran
Tzu-Yun Tseng
Stewart Worrall
3DPC
55
0
0
06 May 2025
Point Cloud Recombination: Systematic Real Data Augmentation Using Robotic Targets for LiDAR Perception Validation
Hubert Padusinski
Christian Steinhauser
Christian Scherl
Julian Gaal
Jacob Langner
3DPC
22
0
0
05 May 2025
DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion
Haoteng Li
Zhao Yang
Zezhong Qian
Gongpeng Zhao
Yuqi Huang
Jun-chen Yu
Huazheng Zhou
Longjun Liu
63
1
0
03 May 2025
Multimodal and Multiview Deep Fusion for Autonomous Marine Navigation
Dimitrios Dagdilelis
Panagiotis Grigoriadis
R. Galeazzi
3DPC
43
0
0
02 May 2025
Is Intermediate Fusion All You Need for UAV-based Collaborative Perception?
Jiuwu Hao
Liguo Sun
Yuting Wan
Yueyang Wu
Ti Xiang
Haolin Song
Pin Lv
59
0
0
30 Apr 2025
DiVE: Efficient Multi-View Driving Scenes Generation Based on Video Diffusion Transformer
Junpeng Jiang
Gangyi Hong
Miao Zhang
Hengtong Hu
Kun Zhan
Rui Shao
Liqiang Nie
VGen
51
0
0
28 Apr 2025
S3MOT: Monocular 3D Object Tracking with Selective State Space Model
Zhuohao Yan
Shaoquan Feng
Xingxing Li
Yuxuan Zhou
Chunxi Xia
Shengyu Li
VOT
66
0
0
25 Apr 2025
A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection
Carlo Sgaravatti
Roberto Basla
Riccardo Pieroni
Matteo Corno
S. Savaresi
Luca Magri
Giacomo Boracchi
3DPC
44
0
0
25 Apr 2025
A Data-Centric Approach to 3D Semantic Segmentation of Railway Scenes
Nicolas Münger
M. Ronecker
Xavier Diaz
Michael Karner
Daniel Watzenig
Jan Skaloud
3DPC
62
0
0
25 Apr 2025
NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration
Haotian Dong
X. Wang
D. Lin
Yipeng Wu
Qin Chen
R. Liu
Kairui Yang
Ping Li
Qing-Wu Guo
VGen
42
0
0
25 Apr 2025
SignX: The Foundation Model for Sign Recognition
Sen Fang
Chunyu Sui
Hongwei Yi
C. Neidle
Dimitris N. Metaxas
SLR
30
0
0
22 Apr 2025
Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory Prediction
Yushen He
Lei Zhao
Tianchen Deng
Zipeng Fang
Weidong Chen
26
0
0
18 Apr 2025
Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving
Shumin Wang
Zhuoran Yang
L. Wang
Zhipeng Tang
Heng Li
Lehan Pan
Sha Zhang
Jie Peng
J. Ji
Y. Zhang
3DPC
41
0
0
17 Apr 2025
E2E Parking Dataset: An Open Benchmark for End-to-End Autonomous Parking
Kejia Gao
Liguo Zhou
Mingjun Liu
Alois C. Knoll
20
0
0
15 Apr 2025
FastRSR: Efficient and Accurate Road Surface Reconstruction from Bird's Eye View
Yuting Zhao
Yuheng Ji
Xiaoshuai Hao
Shuxiao Li
26
0
0
13 Apr 2025
InSPE: Rapid Evaluation of Heterogeneous Multi-Modal Infrastructure Sensor Placement
Zhaoliang Zheng
Y. Zhang
Zongling Meng
Johnson Liu
Xin Xia
Jiaqi Ma
30
0
0
11 Apr 2025
Inverse++: Vision-Centric 3D Semantic Occupancy Prediction Assisted with 3D Object Detection
Zhenxing Ming
J. S. Berrio
Mao Shan
Stewart Worrall
3DPC
37
1
0
07 Apr 2025
SSLFusion: Scale & Space Aligned Latent Fusion Model for Multimodal 3D Object Detection
Bonan Ding
J. Xie
Jing Nie
Jiale Cao
14
0
0
07 Apr 2025
ZFusion: An Effective Fuser of Camera and 4D Radar for 3D Object Perception in Autonomous Driving
Sheng Yang
Tong Zhan
Shichen Qiao
Jicheng Gong
Qing Yang
Jian Wang
Yanfeng Lu
3DPC
39
0
0
04 Apr 2025
MinkOcc: Towards real-time label-efficient semantic occupancy prediction
Samuel Sze
Daniele De Martini
Lars Kunze
3DPC
44
0
0
03 Apr 2025
Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision
Xiaofeng Han
Shunpeng Chen
Zenghuang Fu
Zhe Feng
Lue Fan
...
Li Guo
Weiliang Meng
Xiaopeng Zhang
Rongtao Xu
Shibiao Xu
60
0
0
03 Apr 2025
Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting
Shu-Wei Lu
Yi-Hsuan Tsai
Yi-Ting Chen
33
0
0
02 Apr 2025
A Benchmark for Vision-Centric HD Mapping by V2I Systems
Miao Fan
Shanshan Yu
Shengtong Xu
Kun Jiang
Haoyi Xiong
Xiangzeng Liu
3DV
44
0
0
31 Mar 2025
Cal or No Cal? -- Real-Time Miscalibration Detection of LiDAR and Camera Sensors
Ilir Tahiraj
Jeremialie Swadiryus
F. Fent
Markus Lienkamp
29
0
0
31 Mar 2025
InteractionMap: Improving Online Vectorized HDMap Construction with Interaction
Kuang Wu
Chuan Yang
Zhanbin Li
53
0
0
27 Mar 2025
Resilient Sensor Fusion under Adverse Sensor Failures via Multi-Modal Expert Fusion
Konyul Park
Yecheol Kim
Daehun Kim
Jun-Won Choi
39
0
0
25 Mar 2025
GAA-TSO: Geometry-Aware Assisted Depth Completion for Transparent and Specular Objects
Y. Liu
Tong Jia
Da Cai
Hao Wang
Dongyue Chen
39
0
0
21 Mar 2025
FrustumFusionNets: A Three-Dimensional Object Detection Network Based on Tractor Road Scene
Lili Yang
Mengshuai Chang
Xiao Guo
Yuxin Feng
Yiwen Mei
Caicong Wu
3DPC
71
0
0
18 Mar 2025
Efficient Multimodal 3D Object Detector via Instance-Level Contrastive Distillation
Zhuoqun Su
Huimin Lu
Shuaifeng Jiao
Junhao Xiao
Y. Wang
Xieyuanli Chen
3DPC
56
0
0
17 Mar 2025
Industrial-Grade Sensor Simulation via Gaussian Splatting: A Modular Framework for Scalable Editing and Full-Stack Validation
Xianming Zeng
Sicong Du
Qifeng Chen
Lizhe Liu
Haoyu Shu
...
Peng Chen
Yapeng Xue
Chunming Zhao
Sheng Yang
Qiang Li
3DGS
52
0
0
14 Mar 2025
DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation
Hongbin Lin
Zilu Guo
Y. Zhang
Shuaicheng Niu
Yafeng Li
R. Zhang
Shuguang Cui
Zhen Li
DiffM
51
0
0
14 Mar 2025
Active Learning from Scene Embeddings for End-to-End Autonomous Driving
Wenhao Jiang
Duo Li
Menghan Hu
Chao Ma
Ke Wang
Zhipeng Zhang
44
0
0
14 Mar 2025
V2X-ReaLO: An Open Online Framework and Dataset for Cooperative Perception in Reality
Hao Xiang
Zhaoliang Zheng
Xin Xia
Seth Z. Zhao
Letian Gao
Zewei Zhou
Tianhui Cai
Y. Zhang
Jiaqi Ma
53
0
0
13 Mar 2025
CoCMT: Communication-Efficient Cross-Modal Transformer for Collaborative Perception
Rujia Wang
Xiangbo Gao
Hao Xiang
Runsheng Xu
Zhengzhong Tu
47
2
0
13 Mar 2025
Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection
Xuzhong Hu
Zaipeng Duan
Pei An
Jun zhang
Jie Ma
3DPC
82
0
0
12 Mar 2025
SparseVoxFormer: Sparse Voxel-based Transformer for Multi-modal 3D Object Detection
Hyeongseok Son
Jia He
Seung-In Park
Ying Min
Yunhao Zhang
ByungIn Yoo
50
0
0
11 Mar 2025
A Light Perspective for 3D Object Detection
M. E. Pederiva
J. M. D. Martino
A. Zimmer
3DPC
46
0
0
10 Mar 2025
Availability-aware Sensor Fusion via Unified Canonical Space for 4D Radar, LiDAR, and Camera
Dong-Hee Paek
Seung-Hyun Kong
38
1
0
10 Mar 2025
SP3D: Boosting Sparsely-Supervised 3D Object Detection via Accurate Cross-Modal Semantic Prompts
Shijia Zhao
Qiming Xia
Xusheng Guo
Pufan Zou
Maoji Zheng
Hai Wu
Chenglu Wen
Cheng-Yu Wang
3DPC
60
0
0
09 Mar 2025
OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection
Adrian Chow
Evelien Riddell
Yimu Wang
Sean Sedwards
Krzysztof Czarnecki
3DPC
46
0
0
09 Mar 2025
TransParking: A Dual-Decoder Transformer Framework with Soft Localization for End-to-End Automatic Parking
Hangyu Du
Chee-Meng Chew
ViT
41
1
0
08 Mar 2025
Fake It To Make It: Virtual Multiviews to Enhance Monocular Indoor Semantic Scene Completion
Anith Selvakumar
Manasa Bharadwaj
39
0
0
07 Mar 2025
Manboformer: Learning Gaussian Representations via Spatial-temporal Attention Mechanism
Ziyue Zhao
Qining Qi
Jianfa Ma
48
0
0
06 Mar 2025
BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving
Katharina Winter
Mark Azer
Fabian B. Flohr
53
0
0
05 Mar 2025
BEVMOSNet: Multimodal Fusion for BEV Moving Object Segmentation
Hiep Truong Cong
Ajay Kumar Sigatapu
Arindam Das
Yashwanth Sharma
Venkatesh Satagopan
Ganesh Sistu
Ciarán Eising
36
0
0
05 Mar 2025
DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Zhao Yang
Zezhong Qian
Xiaofan Li
Weixiang Xu
Gongpeng Zhao
Ruohong Yu
Lingsi Zhu
Longjun Liu
DiffM
VGen
61
1
0
05 Mar 2025
1
2
3
4
...
9
10
11
Next