Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2203.17270
Cited By
v1
v2 (latest)
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
European Conference on Computer Vision (ECCV), 2022
31 March 2022
Zhiqi Li
Wenhai Wang
Hongyang Li
Enze Xie
Chonghao Sima
Tong Lu
Qiao Yu
Jifeng Dai
Re-assign community
ArXiv (abs)
PDF
HTML
Github (18★)
Papers citing
"BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers"
50 / 973 papers shown
Driving Through Uncertainty: Risk-Averse Control with LLM Commonsense for Autonomous Driving under Perception Deficits
Yuting Hu
Chenhui Xu
Ruiyang Qin
Dancheng Liu
Amir Nassereldine
Yiyu Shi
Jinjun Xiong
288
1
0
10 Mar 2025
CATPlan: Loss-based Collision Prediction in End-to-End Autonomous Driving
Ziliang Xiong
Shipeng Liu
Nathaniel Helgesen
Joakim Johnander
Per-Erik Forssén
538
1
0
10 Mar 2025
Learning A Zero-shot Occupancy Network from Vision Foundation Models via Self-supervised Adaptation
Sihao Lin
Daqi Liu
Ruochong Fu
Dongrui Liu
A. Song
Hongwei Xie
Zhihui Li
Bing Wang
Xiaojun Chang
318
0
0
10 Mar 2025
HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective Priors
Siyu Li
Yihong Cao
Hao-miao Shi
Yongsheng Zang
Xuan He
Kailun Yang
Hui Yuan
356
1
0
10 Mar 2025
TransParking: A Dual-Decoder Transformer Framework with Soft Localization for End-to-End Automatic Parking
Hangyu Du
Chee-Meng Chew
ViT
220
5
0
08 Mar 2025
DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous Driving
International Conference on Learning Representations (ICLR), 2025
Xiaosong Jia
Junqi You
Zhiyuan Zhang
Junchi Yan
405
63
0
07 Mar 2025
Manboformer: Learning Gaussian Representations via Spatial-temporal Attention Mechanism
Ziyue Zhao
Qining Qi
Jianfa Ma
217
0
0
06 Mar 2025
H3O: Hyper-Efficient 3D Occupancy Prediction with Heterogeneous Supervision
IEEE International Conference on Robotics and Automation (ICRA), 2025
Y. Shi
H. Cai
Amin Ansari
Fatih Porikli
391
3
0
06 Mar 2025
DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Zhao Yang
Zezhong Qian
Xiaofan Li
Weixiang Xu
Gongpeng Zhao
Ruohong Yu
Lingsi Zhu
Longjun Liu
DiffM
VGen
355
4
0
05 Mar 2025
IC-Mapper: Instance-Centric Spatio-Temporal Modeling for Online Vectorized Map Construction
ACM Multimedia (MM), 2024
Jiangtong Zhu
Zhao Yang
Yinan Shi
Jianwu Fang
Jianru Xue
ISeg
407
1
0
05 Mar 2025
Dur360BEV: A Real-world 360-degree Single Camera Dataset and Benchmark for Bird-Eye View Mapping in Autonomous Driving
IEEE International Conference on Robotics and Automation (ICRA), 2025
Wenke E
Chao Yuan
Li Li
Yixin Sun
Yona Falinie A. Gaus
Amir Atapour-Abarghouei
T. Breckon
368
0
0
02 Mar 2025
BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance
Computer Vision and Pattern Recognition (CVPR), 2025
Xin Ye
Burhaneddin Yaman
Sheng Cheng
Feng Tao
Abhirup Mallik
Liu Ren
DiffM
432
10
0
27 Feb 2025
CoopDETR: A Unified Cooperative Perception Framework for 3D Detection via Object Query
IEEE International Conference on Robotics and Automation (ICRA), 2025
Liang Luo
Shaocong Xu
Xucai Zhuang
Tongda Xu
Yan Wang
Qingbin Liu
Yilun Chen
Yuanhang Zhang
379
6
0
26 Feb 2025
Glad: A Streaming Scene Generator for Autonomous Driving
International Conference on Learning Representations (ICLR), 2025
Bin Xie
Yingfei Liu
Tiancai Wang
Jiale Cao
Xinming Zhang
3DGS
VGen
294
11
0
26 Feb 2025
VLM-E2E: Enhancing End-to-End Autonomous Driving with Multimodal Driver Attention Fusion
Pei Liu
Haipeng Liu
Haichao Liu
Xin Liu
Jinxin Ni
Jun Ma
438
20
0
25 Feb 2025
DeepInteraction++: Multi-Modality Interaction for Autonomous Driving
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Zeyu Yang
Nan Song
Wei Li
Xiatian Zhu
Guang Dai
Philip H. S. Torr
472
8
0
24 Feb 2025
LCV2I: Communication-Efficient and High-Performance Collaborative Perception Framework with Low-Resolution LiDAR
Xinxin Feng
Haoran Sun
Haifeng Zheng
260
0
0
24 Feb 2025
LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera
IEEE Robotics and Automation Letters (IEEE RA-L), 2025
Weiyi Xiong
Zean Zou
Qiuchi Zhao
Fengchun He
Bing Zhu
385
10
0
21 Feb 2025
Deflickering Vision-Based Occupancy Networks through Lightweight Spatio-Temporal Correlation
Fengcheng Yu
Haoran Xu
Canming Xia
Guang Tan
Guang Tan
399
0
0
21 Feb 2025
RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning
Hao Gao
Shaoyu Chen
Bo Jiang
Bencheng Liao
Yiang Shi
...
Xinbang Zhang
Y. Zhang
Wenyu Liu
Qian Zhang
Xinggang Wang
402
41
0
18 Feb 2025
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
Computer Vision and Pattern Recognition (CVPR), 2025
Jingcheng Ni
Yuxin Guo
Yichen Liu
Rui Chen
Lewei Lu
Z. Wu
DiffM
VGen
304
18
0
17 Feb 2025
FeaKM: Robust Collaborative Perception under Noisy Pose Conditions
Jiuwu Hao
Liguo Sun
Ti Xiang
Yuting Wan
Haolin Song
Pin Lv
358
0
0
16 Feb 2025
PDM-SSD: Single-Stage Three-Dimensional Object Detector With Point Dilation
Ao Liang
Haiyang Hua
Jian Fang
Wenyu Chen
Huaici Zhao
3DPC
230
0
0
10 Feb 2025
SMART: Advancing Scalable Map Priors for Driving Topology Reasoning
IEEE International Conference on Robotics and Automation (ICRA), 2025
Junjie Ye
David Paz
Hengyuan Zhang
Yuliang Guo
Xinyu Huang
Henrik I. Christensen
Yue Wang
Liu Ren
LRM
363
5
0
06 Feb 2025
SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
IEEE International Conference on Robotics and Automation (ICRA), 2025
Jianing Li
Ming Lu
Hao Wang
Chenyang Gu
Wenzhao Zheng
Li Du
Shanghang Zhang
382
1
0
28 Jan 2025
MetaOcc: Spatio-Temporal Fusion of Surround-View 4D Radar and Camera for 3D Occupancy Prediction with Dual Training Strategies
Long Yang
Lianqing Zheng
W. Ai
Minghao Liu
Sen Li
...
Shengyu Yan
Jie Bai
Zhixiong Ma
Tao Huang
Xichan Zhu
954
3
0
26 Jan 2025
mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework
Bingyi Liu
Jian Teng
Hongfei Xue
Enshu Wang
Chuanhui Zhu
Pu Wang
Libing Wu
445
5
0
21 Jan 2025
A Survey of World Models for Autonomous Driving
Tuo Feng
Wenguan Wang
Yue Yang
VGen
692
21
0
20 Jan 2025
Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving
AAAI Conference on Artificial Intelligence (AAAI), 2024
Yu Yang
Jianbiao Mei
Yukai Ma
Siliang Du
Wenqing Chen
Yijie Qian
Yuxiang Feng
Yong Liu
538
41
0
20 Jan 2025
Distilling Multi-modal Large Language Models for Autonomous Driving
Computer Vision and Pattern Recognition (CVPR), 2025
Deepti Hegde
R. Yasarla
H. Cai
Shizhong Han
Apratim Bhattacharyya
Shweta Mahajan
Litian Liu
Risheek Garrepalli
Vishal M. Patel
Fatih Porikli
203
26
0
17 Jan 2025
DriveLM: Driving with Graph Visual Question Answering
European Conference on Computer Vision (ECCV), 2023
Chonghao Sima
Katrin Renz
Kashyap Chitta
Lawrence Yunliang Chen
Hanxue Zhang
Chengen Xie
Jens Beißwenger
Ping Luo
Andreas Geiger
Hongyang Li
802
355
0
17 Jan 2025
Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion
AAAI Conference on Artificial Intelligence (AAAI), 2025
Li Liang
Naveed Akhtar
Jordan Vice
Xiangrui Kong
Lin Wang
255
7
0
13 Jan 2025
MapGS: Generalizable Pretraining and Data Augmentation for Online Mapping via Novel View Synthesis
Hengyuan Zhang
David Paz
Yuliang Guo
Xinyu Huang
Henrik I. Christensen
Liu Ren
3DGS
ViT
216
4
0
11 Jan 2025
CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object Detection
Information Fusion (Inf. Fusion), 2025
Yongqian Li
Yang Yang
Zhen Lei
3DPC
269
5
0
11 Jan 2025
A Novel Vision Transformer for Camera-LiDAR Fusion based Traffic Object Segmentation
International Conference on Agents and Artificial Intelligence (ICAART), 2025
Toomas Tahves
Junyi Gu
M. Bellone
Raivo Sell
ViT
188
0
0
06 Jan 2025
LDMapNet-U: An End-to-End System for City-Scale Lane-Level Map Updating
Knowledge Discovery and Data Mining (KDD), 2025
Deguo Xia
Weiming Zhang
Xiyan Liu
Wei Emma Zhang
Chenting Gong
Xiao Tan
Jizhou Huang
Mengmeng Yang
Ke Wang
288
6
0
06 Jan 2025
Master Stability Functions in Complex Networks
Suman Acharyya
Priodyuti Pradhan
Chandrakala Meena
264
0
0
26 Dec 2024
ImagineMap: Enhanced HD Map Construction with SD Maps
Yishen Ji
Zhiqi Li
Tong Lu
309
1
0
22 Dec 2024
Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor Regression
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Shaofei Huang
Zhenwei Shen
Zehao Huang
Yue Liao
Jizhong Han
Naiyan Wang
Si Liu
392
9
0
22 Dec 2024
A Black-Box Evaluation Framework for Semantic Robustness in Bird's Eye View Detection
AAAI Conference on Artificial Intelligence (AAAI), 2024
Fu Lee Wang
Yanghao Zhang
Xiangyu Yin
Guangliang Cheng
Zeyu Fu
Xiaowei Huang
Wenjie Ruan
AAML
436
0
0
18 Dec 2024
RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion
Computer Vision and Pattern Recognition (CVPR), 2024
Xiaomeng Chu
Jiajun Deng
Guoliang You
YiFan Duan
Houqiang Li
Yanyong Zhang
986
10
0
17 Dec 2024
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
Computer Vision and Pattern Recognition (CVPR), 2024
Haoyi Jiang
Liu Liu
Tianheng Cheng
Xinjie Wang
Tianwei Lin
Zhizhong Su
Wen Liu
Xinyu Wang
3DGS
ViT
485
30
0
17 Dec 2024
OmniHD-Scenes: A Next-Generation Multimodal Dataset for Autonomous Driving
Lianqing Zheng
Long Yang
Qunshu Lin
W. Ai
Minghao Liu
...
Jingyue Mo
Xiaokai Bai
Jie Bai
Zhixiong Ma
Xichan Zhu
695
13
0
14 Dec 2024
DSRC: Learning Density-insensitive and Semantic-aware Collaborative Representation against Corruptions
AAAI Conference on Artificial Intelligence (AAAI), 2024
Jingyu Zhang
Yilei Wang
Lang Qian
Yang Liu
Zengwen Li
Sudong Jiang
Maolin Liu
Liang Song
438
3
0
14 Dec 2024
GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
Computer Vision and Pattern Recognition (CVPR), 2024
Sicheng Zuo
Wenzhao Zheng
Yuanhui Huang
Jie Zhou
Jiwen Lu
3DV
3DGS
243
42
0
13 Dec 2024
PVP: Polar Representation Boost for 3D Semantic Occupancy Prediction
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Yujing Xue
Jiaxiang Liu
Jiawei Du
Qiufeng Wang
MDE
420
0
0
10 Dec 2024
HSDA: High-frequency Shuffle Data Augmentation for Bird's-Eye-View Map Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Calvin Glisson
Qiuxiao Chen
312
0
0
09 Dec 2024
Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction
Computer Vision and Pattern Recognition (CVPR), 2024
Dongxu Wei
Zhiqi Li
Peidong Liu
516
20
0
09 Dec 2024
Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection
Neural Information Processing Systems (NeurIPS), 2024
C. Zheng
Feng Wang
Naiyan Wang
Shuguang Cui
Hui Yuan
3DPC
224
4
0
06 Dec 2024
Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird's-Eye-View via Uncertainty Measure
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Saheli Hazra
Sudip Das
Rohit Choudhary
Arindam Das
Ganesh Sistu
Ciarán Eising
Ujjwal Bhattacharya
303
0
0
05 Dec 2024
Previous
1
2
3
4
5
6
...
18
19
20
Next