Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.11790
Cited By
BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-View
22 December 2021
Junjie Huang
Guan Huang
Zheng Zhu
Yun Ye
Dalong Du
3DPC
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-View"
50 / 100 papers shown
Title
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Zongchuang Zhao
Haoyu Fu
Dingkang Liang
Xin Zhou
Dingyuan Zhang
Hongwei Xie
Bing Wang
Xiang Bai
MLLM
VLM
39
0
0
13 May 2025
OccCylindrical: Multi-Modal Fusion with Cylindrical Representation for 3D Semantic Occupancy Prediction
Zhenxing Ming
J. S. Berrio
Mao Shan
Yaoqi Huang
Hongyu Lyu
Nguyen Hoang Khoi Tran
Tzu-Yun Tseng
Stewart Worrall
3DPC
55
0
0
06 May 2025
DiVE: Efficient Multi-View Driving Scenes Generation Based on Video Diffusion Transformer
Junpeng Jiang
Gangyi Hong
Miao Zhang
Hengtong Hu
Kun Zhan
Rui Shao
Liqiang Nie
VGen
51
0
0
28 Apr 2025
STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow Prediction
Zhimin Liao
Ping Wei
Shuaijia Chen
Haoxuan Wang
Ziyang Ren
64
0
0
28 Apr 2025
Towards Latency-Aware 3D Streaming Perception for Autonomous Driving
Jiaqi Peng
Tai Wang
Jiangmiao Pang
Yuan Shen
33
0
0
27 Apr 2025
A Review of 3D Object Detection with Vision-Language Models
Ranjan Sapkota
Konstantinos I Roumeliotis
Rahul Harsha Cheppally
Marco Flores Calero
Manoj Karkee
VLM
74
1
0
25 Apr 2025
Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction
Dubing Chen
Huan Zheng
Jin Fang
Xingping Dong
Xianfei Li
Wenlong Liao
Tao He
Pai Peng
Jianbing Shen
35
0
0
17 Apr 2025
RoPETR: Improving Temporal Camera-Only 3D Detection by Integrating Enhanced Rotary Position Embedding
Hang Ji
Tao Ni
Xufeng Huang
Tao Luo
Xin Zhan
Junbo Chen
3DPC
35
0
0
17 Apr 2025
CoGen: 3D Consistent Video Generation via Adaptive Conditioning for Autonomous Driving
Yishen Ji
Ziyue Zhu
Zhenxin Zhu
Kaixin Xiong
Ming Lu
Zhiqi Li
Lijun Zhou
Haiyang Sun
Bing Wang
Tong Lu
VGen
53
1
0
28 Mar 2025
Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection
Ahyun Seo
Minsu Cho
71
0
0
26 Mar 2025
Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection
Xuzhong Hu
Zaipeng Duan
Pei An
Jun zhang
Jie Ma
3DPC
84
0
0
12 Mar 2025
BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance
Xin Ye
Burhaneddin Yaman
Sheng Cheng
Feng Tao
Abhirup Mallik
Liu Ren
DiffM
63
1
0
27 Feb 2025
Easy-Poly: A Easy Polyhedral Framework For 3D Multi-Object Tracking
Peng Zhang
Xin Li
Xin Lin
Liang He
VOT
78
0
0
25 Feb 2025
DeepInteraction++: Multi-Modality Interaction for Autonomous Driving
Zeyu Yang
Nan Song
Wei Li
Xiatian Zhu
L. Zhang
Philip H. S. Torr
69
4
0
24 Feb 2025
LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera
Weiyi Xiong
Zean Zou
Qiuchi Zhao
Fengchun He
Bing Zhu
64
0
0
21 Feb 2025
SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset
Goodarz Mehr
A. Eskandarian
61
1
0
04 Feb 2025
Dual-BEV Nav: Dual-layer BEV-based Heuristic Path Planning for Robotic Navigation in Unstructured Outdoor Environments
J. Zhang
Hanlin Dong
Jian Yang
J. H. Liu
Shibo Huang
Ke Li
Xuan Tang
Xian Wei
Xiong You
82
1
0
30 Jan 2025
SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
Jianing Li
Ming Lu
Hao Wang
Chenyang Gu
Wenzhao Zheng
Li Du
S. Zhang
88
0
0
28 Jan 2025
Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving
Yu Yang
Jianbiao Mei
Yukai Ma
Siliang Du
Wenqing Chen
Yijie Qian
Yuxiang Feng
Yong-jin Liu
84
11
0
20 Jan 2025
LDMapNet-U: An End-to-End System for City-Scale Lane-Level Map Updating
Deguo Xia
Weiming Zhang
Xiyan Liu
Wei Emma Zhang
Chenting Gong
Xiao Tan
Jizhou Huang
Mengmeng Yang
D. Yang
25
0
0
06 Jan 2025
MR-Occ: Efficient Camera-LiDAR 3D Semantic Occupancy Prediction Using Hierarchical Multi-Resolution Voxel Representation
Minjae Seong
Jisong Kim
Geonho Bang
Hawook Jeong
Jun Won Choi
101
1
0
31 Dec 2024
RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion
Xiaomeng Chu
Jiajun Deng
Guoliang You
Yifan Duan
Houqiang Li
Yanyong Zhang
109
0
0
17 Dec 2024
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
Haoyi Jiang
Liu Liu
Tianheng Cheng
Xinjie Wang
Tianwei Lin
Zhizhong Su
W. Liu
X. Wang
3DGS
ViT
108
5
0
17 Dec 2024
OmniHD-Scenes: A Next-Generation Multimodal Dataset for Autonomous Driving
Lianqing Zheng
Long Yang
Qunshu Lin
W. Ai
Minghao Liu
...
Jingyue Mo
Xiaokai Bai
Jie Bai
Zhixiong Ma
Xichan Zhu
96
6
0
14 Dec 2024
DSRC: Learning Density-insensitive and Semantic-aware Collaborative Representation against Corruptions
Jingyu Zhang
Yilei Wang
Lang Qian
Peng Sun
Zengwen Li
Sudong Jiang
Maolin Liu
Liang Song
93
1
0
14 Dec 2024
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
Bencheng Liao
Shaoyu Chen
Haoran Yin
Bo Jiang
Cheng Wang
...
Xinbang Zhang
Xiangyu Li
Y. Zhang
Qian Zhang
Xinggang Wang
102
14
0
22 Nov 2024
V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception
Lei Yang
X. Zhang
Jun Li
Chen Wang
Zhiying Song
...
Mo Zhou
Yang Shen
Kai Wu
Chen Lv
Chen Lv
58
4
0
17 Nov 2024
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection
Yongjin Lee
Hyeon-Mun Jeong
Yurim Jeon
Sanghyun Kim
45
0
0
16 Nov 2024
EMPERROR: A Flexible Generative Perception Error Model for Probing Self-Driving Planners
Niklas Hanselmann
Simon Doll
Marius Cordts
Hendrik P. A. Lensch
Andreas Geiger
39
0
0
12 Nov 2024
MambaBEV: An efficient 3D detection model with Mamba2
Zihan You
Hao Wang
Qichao Zhao
Jinxiang Wang
Jinxiang Wang
Mamba
63
4
0
16 Oct 2024
CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction
Pranav Gupta
Rishabh Rengarajan
Viren Bankapur
Vedansh Mannem
Lakshit Ahuja
Surya Vijay
Kevin Wang
3DPC
16
0
0
15 Oct 2024
RenderWorld: World Model with Self-Supervised 3D Label
Ziyang Yan
Wenzhen Dong
Yihua Shao
Yuhang Lu
Liu Haiyang
...
Haozhe Wang
Zhe Wang
Yan Wang
Fabio Remondino
Yuexin Ma
3DV
VGen
62
11
0
17 Sep 2024
Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction
Yuan Wu
Zhiqiang Yan
Zhengxue Wang
Xiang Li
Le Hui
Jian Yang
63
4
0
12 Sep 2024
DriveScape: Towards High-Resolution Controllable Multi-View Driving Video Generation
Wei Yu Wu
Xi Guo
Weixuan Tang
Tingxuan Huang
Chiyu Wang
Dongyue Chen
C. Ding
VGen
30
6
0
09 Sep 2024
HeightLane: BEV Heightmap guided 3D Lane Detection
Chaesong Park
Eunbin Seo
Jongwoo Lim
70
2
0
15 Aug 2024
RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via Ray-Centric Strategies
Xiaomeng Chu
Jiajun Deng
Guoliang You
Yifan Duan
Yao Li
Yanyong Zhang
31
3
0
20 Jul 2024
CT3D++: Improving 3D Object Detection with Keypoint-induced Channel-wise Transformer
Hualian Sheng
Sijia Cai
Na Zhao
Bing Deng
Qiao Liang
Min-Jian Zhao
Jieping Ye
3DPC
35
0
0
12 Jun 2024
Enhancing End-to-End Autonomous Driving with Latent World Model
Yingyan Li
Lue Fan
Jiawei He
Yuqi Wang
Yuntao Chen
Zhaoxiang Zhang
Tieniu Tan
70
8
0
12 Jun 2024
RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks
Zhechao Wang
Peirui Cheng
Pengju Tian
Yuchao Wang
Mingxin Chen
Shujing Duan
Zhirui Wang
Xinming Li
Xian Sun
26
2
0
11 Jun 2024
S2-Track: A Simple yet Strong Approach for End-to-End 3D Multi-Object Tracking
Lijun Zhou
Tao Tang
Pengkun Hao
Zihang He
Kalok Ho
...
Zhihui Hao
Haiyang Sun
Kun Zhan
Peng Jia
Xianpeng Lang
VOT
58
4
0
04 Jun 2024
GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
Yuanhui Huang
Wenzhao Zheng
Yunpeng Zhang
Jie Zhou
Jiwen Lu
3DGS
38
31
0
27 May 2024
BDC-Occ: Binarized Deep Convolution Unit For Binarized Occupancy Network
Zongkai Zhang
Zidong Xu
Wenming Yang
Qingmin Liao
Jing-Hao Xue
MQ
3DV
40
1
0
27 May 2024
Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving
Shaoyuan Xie
Lingdong Kong
Wenwei Zhang
Jiawei Ren
Liang Pan
Kai-xiang Chen
Ziwei Liu
AAML
50
9
0
27 May 2024
GEOcc: Geometrically Enhanced 3D Occupancy Network with Implicit-Explicit Depth Fusion and Contextual Self-Supervision
Xin Tan
Wenbin Wu
Zhiwei Zhang
Chaojie Fan
Yong Peng
Zhizhong Zhang
Yuan Xie
Lizhuang Ma
57
9
0
17 May 2024
DuoSpaceNet: Leveraging Both Bird's-Eye-View and Perspective View Representations for 3D Object Detection
Zhe Huang
Yizhe Zhao
Hao Xiao
Chenyan Wu
Lingting Ge
3DPC
44
1
0
17 May 2024
OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks
Sophia Sirko-Galouchenko
Alexandre Boulch
Spyros Gidaris
Andrei Bursuc
Antonín Vobecký
Patrick Pérez
Renaud Marlet
3DPC
30
7
0
22 Apr 2024
MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection
Hou-I Liu
Christine Wu
Jen-Hao Cheng
Wenhao Chai
Shian-Yun Wang
...
Jenq-Neng Hwang
Hong-Han Shuai
Wen-Huang Cheng
Hong-Han Shuai
Wen-Huang Cheng
34
2
0
07 Apr 2024
Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D Perception
Philipp Wolters
Johannes Gilg
Torben Teepe
Fabian Herzog
Anouar Laouichi
Martin Hofmann
Gerhard Rigoll
MDE
60
12
0
12 Mar 2024
POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images
Antonín Vobecký
Oriane Siméoni
David Hurych
Spyros Gidaris
Andrei Bursuc
Patrick Pérez
Josef Sivic
29
33
0
17 Jan 2024
M-BEV: Masked BEV Perception for Robust Autonomous Driving
Siran Chen
Yue Ma
Yu Qiao
Yali Wang
19
8
0
19 Dec 2023
1
2
Next