Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2203.17270
Cited By
v1
v2 (latest)
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
European Conference on Computer Vision (ECCV), 2022
31 March 2022
Zhiqi Li
Wenhai Wang
Hongyang Li
Enze Xie
Chonghao Sima
Tong Lu
Qiao Yu
Jifeng Dai
Re-assign community
ArXiv (abs)
PDF
HTML
Github (18★)
Papers citing
"BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers"
50 / 973 papers shown
Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention
Hannan Lu
Xiaohe Wu
Shudong Wang
Xiameng Qin
Xinyu Zhang
Junyu Han
W. Zuo
Ji Tao
349
4
0
04 Dec 2024
Redundant Queries in DETR-Based 3D Detection Methods: Unnecessary and Prunable
Lizhen Xu
Zehao Wu
Wenzhao Qiu
Zehao Wu
Xiuxiu Bai
K. Mei
Jianru Xue
488
2
0
03 Dec 2024
OODFace: Benchmarking Robustness of Face Recognition under Common Corruptions and Appearance Variations
Caixin Kang
Yubo Chen
Shouwei Ruan
Shiji Zhao
Ruochen Zhang
Jiayi Wang
Shan Fu
Xingxing Wei
CVBM
579
2
0
03 Dec 2024
Epipolar Attention Field Transformers for Bird's Eye View Semantic Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Christian Witte
Jens Behley
Cyrill Stachniss
Marvin Raaijmakers
319
1
0
02 Dec 2024
HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving
Z. Wu
Jingcheng Ni
Xiaodong Wang
Yuxin Guo
Rui Chen
Lewei Lu
Jifeng Dai
Yuwen Xiong
363
17
0
02 Dec 2024
SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection
Philipp Wolters
Johannes Gilg
Torben Teepe
Fabian Herzog
Felix Fent
Gerhard Rigoll
388
5
0
29 Nov 2024
Improving Batch Normalization with TTA for Robust Object Detection in Self-Driving
Dacheng Liao
Mengshi Qi
Liang Liu
Huadong Ma
294
0
0
28 Nov 2024
Visual SLAMMOT Considering Multiple Motion Models
Peilin Tian
Hao Li
340
2
0
28 Nov 2024
FASIONAD : FAst and Slow FusION Thinking Systems for Human-Like Autonomous Driving with Adaptive Feedback
Kangan Qian
Zhikun Ma
Yangfan He
Ziang Luo
Lewei He
...
Zheng Fu
Xinyu Jiao
Yunlong Wang
Ke Wang
Takafumi Matsumaru
AI4CE
243
8
0
27 Nov 2024
D
2
^2
2
-World: An Efficient World Model through Decoupled Dynamic Flow
Haiming Zhang
Xu Yan
Ying Xue
Zixuan Guo
Shuguang Cui
Hui Yuan
Bingbing Liu
252
0
0
26 Nov 2024
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection
Zhongyu Xia
Jishuo Li
Zhiwei Lin
Xinhao Wang
Longji Xu
Ming-Hsuan Yang
VLM
425
8
0
26 Nov 2024
Monocular Lane Detection Based on Deep Learning: A Survey
Xin He
Haiyun Guo
Kuan Zhu
Bingke Zhu
Xu Zhao
Jianwu Fang
Jinqiao Wang
707
13
0
25 Nov 2024
Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion
Computer Vision and Pattern Recognition (CVPR), 2024
Jongseong Bae
Junwoo Ha
Ha Young Kim
398
2
0
25 Nov 2024
Language Driven Occupancy Prediction
Zhu Yu
Bowen Pang
Lizhe Liu
Runmin Zhang
Qihao Peng
Maochun Luo
Maochun Luo
Mingxia Chen
Si-Yuan Cao
Hui-Liang Shen
488
7
0
25 Nov 2024
Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Rui Huang
Henry Zheng
Yan Wang
Zhuofan Xia
Marco Pavone
Gao Huang
3DPC
VLM
374
1
0
23 Nov 2024
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
Computer Vision and Pattern Recognition (CVPR), 2024
Bencheng Liao
Shaoyu Chen
Haoran Yin
Bo Jiang
Cheng Wang
...
Xinbang Zhang
Xiangyu Li
Y. Zhang
Qian Zhang
Xinggang Wang
668
153
0
22 Nov 2024
Spatiotemporal Decoupling for Efficient Vision-Based Occupancy Forecasting
Computer Vision and Pattern Recognition (CVPR), 2024
Jingyi Xu
Xieyuanli Chen
Junyi Ma
Jiawei Huang
Jintao Xu
Yanjie Wang
Ling Pei
199
4
0
21 Nov 2024
A Resource Efficient Fusion Network for Object Detection in Bird's-Eye View using Camera and Raw Radar Data
Kavin Chandrasekaran
Sorin Grigorescu
Gijs Dubbelman
P. Jancura
292
1
0
20 Nov 2024
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving
Shaoqing Xu
Fang Li
Shengyin Jiang
Ziying Song
Li Liu
Zhi-xin Yang
3DGS
SSL
291
11
0
19 Nov 2024
Unveiling the Hidden: Online Vectorized HD Map Construction with Clip-Level Token Interaction and Propagation
Neural Information Processing Systems (NeurIPS), 2024
Nayeon Kim
Hongje Seong
Daehyun Ji
Sujin Jang
184
8
0
17 Nov 2024
V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception
Lei Yang
Wei Wei
Jun Li
Chen Wang
Zhiying Song
...
Li-e Wang
Mo Zhou
Yang Shen
Kai Wu
Chen Lv
495
20
0
17 Nov 2024
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection
Yongjin Lee
Hyeon-Mun Jeong
Yurim Jeon
Sanghyun Kim
538
2
0
16 Nov 2024
EMPERROR: A Flexible Generative Perception Error Model for Probing Self-Driving Planners
IEEE Robotics and Automation Letters (RA-L), 2024
Niklas Hanselmann
Simon Doll
Marius Cordts
Hendrik P. A. Lensch
Andreas Geiger
479
2
0
12 Nov 2024
Fast and Efficient Transformer-based Method for Bird's Eye View Instance Prediction
Miguel Antunes-García
L. Bergasa
Santiago Montiel-Marín
R. Barea
Fabio Sánchez-García
Ángel Llamazares
249
1
0
11 Nov 2024
LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation
International Conference on 3D Vision (3DV), 2024
Weijie Ma
Jingwei Jiang
Yue Yang
Zhaoyu Chen
Hao Chen
313
3
0
09 Nov 2024
ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving
Neural Information Processing Systems (NeurIPS), 2024
Tao Ma
Hongbin Zhou
Qiusheng Huang
Xuemeng Yang
Jianfei Guo
Bo Zhang
Min Dou
Yu Qiao
Ding Wang
Jiaming Song
215
5
0
08 Nov 2024
CFPNet: Improving Lightweight ToF Depth Completion via Cross-zone Feature Propagation
International Conference on 3D Vision (3DV), 2024
Laiyan Ding
Hualie Jiang
Rui Xu
Rui Huang
579
4
0
07 Nov 2024
OccLoff: Learning Optimized Feature Fusion for 3D Occupancy Prediction
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Ji Zhang
Yiran Ding
Zixin Liu
3DPC
342
5
0
06 Nov 2024
CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection
Neural Information Processing Systems (NeurIPS), 2024
Jisong Kim
Minjae Seong
Jun Won Choi
465
9
0
05 Nov 2024
Decoupling Fine Detail and Global Geometry for Compressed Depth Map Super-Resolution
Computer Vision and Pattern Recognition (CVPR), 2024
Huan Zheng
Wencheng Han
Jianbing Shen
459
6
0
05 Nov 2024
VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization
Neural Information Processing Systems (NeurIPS), 2024
Yiwei Zhang
Jin Gao
Fudong Ge
Guan Luo
Bing Li
Zheng Zhang
Haibin Ling
Weiming Hu
166
1
0
03 Nov 2024
HeightMapNet: Explicit Height Modeling for End-to-End HD Map Learning
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Wenzhao Qiu
Zehao Wu
Hao zhang
Jianwu Fang
Jianru Xue
177
3
0
03 Nov 2024
GAFusion: Adaptive Fusing LiDAR and Camera with Multiple Guidance for 3D Object Detection
Computer Vision and Pattern Recognition (CVPR), 2024
Xiaotian Li
Baojie Fan
Jiandong Tian
Huijie Fan
3DPC
331
22
0
01 Nov 2024
Uncertainty Estimation for 3D Object Detection via Evidential Learning
Nikita Durasov
Rafid Mahmood
Jiwoong Choi
Marc T. Law
James Lucas
Pascal Fua
Jose M. Alvarez
UQCV
EDL
3DPC
338
5
0
31 Oct 2024
EMMA: End-to-End Multimodal Model for Autonomous Driving
Jyh-Jing Hwang
Runsheng Xu
Hubert Lin
Wei-Chih Hung
Jingwei Ji
...
Benjamin Sapp
Yin Zhou
James Guo
Dragomir Anguelov
Mingxing Tan
VLM
LM&Ro
433
117
0
30 Oct 2024
Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection
Neural Information Processing Systems (NeurIPS), 2024
Gyusam Chang
Jiwon Lee
Donghyun Kim
Jinkyu Kim
Dongwook Lee
Daehyun Ji
Sujin Jang
Sangpil Kim
396
9
0
29 Oct 2024
Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Bo Jiang
Shaoyu Chen
Bencheng Liao
Xingyu Zhang
Wei Yin
Qian Zhang
Chang Huang
Wen Liu
Xinyu Wang
VLM
MLLM
LRM
311
78
0
29 Oct 2024
BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV Alignment
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
M. Hosseinzadeh
Ian Reid
235
2
0
28 Oct 2024
UniDrive: Towards Universal Driving Perception Across Camera Configurations
Ye Li
Wenzhao Zheng
Xiaonan Huang
Kurt Keutzer
427
1
0
17 Oct 2024
Real-time Stereo-based 3D Object Detection for Streaming Perception
Neural Information Processing Systems (NeurIPS), 2024
Changcai Li
Zonghua Gu
Gang Chen
Libo Huang
Wei Zhang
Huihui Zhou
3DPC
214
2
0
16 Oct 2024
MambaBEV: An efficient 3D detection model with Mamba2
Zihan You
Hao Wang
Qichao Zhao
Jinxiang Wang
Jinxiang Wang
Mamba
263
4
0
16 Oct 2024
TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement
European Conference on Artificial Intelligence (ECAI), 2024
Zhiwei Lin
Hongbo Jin
Yongtao Wang
Yufei Wei
Nan Dong
284
5
0
15 Oct 2024
big.LITTLE Vision Transformer for Efficient Visual Recognition
He Guo
Yulong Wang
Zixuan Ye
Jifeng Dai
Yuwen Xiong
ViT
262
1
0
14 Oct 2024
ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object Detection
Jiwei Chen
Laiyan Ding
Chi Zhang
Feifei Li
254
1
0
14 Oct 2024
ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera
Jing Liang
He Yin
Xuewei Qi
Jong Jin Park
Min Sun
R. Madhivanan
Dinesh Manocha
3DPC
361
0
0
14 Oct 2024
SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
International Conference on Learning Representations (ICLR), 2024
Haoyi Zhu
Honghui Yang
Yating Wang
Jiange Yang
Limin Wang
Tong He
3DH
384
24
0
10 Oct 2024
Progressive Multi-Modal Fusion for Robust 3D Object Detection
Conference on Robot Learning (CoRL), 2024
Rohit Mohan
Daniele Cattaneo
Florian Drews
Abhinav Valada
3DPC
328
9
0
09 Oct 2024
QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation
Yuxin Li
Yiheng Li
Xulei Yang
Mengying Yu
Zihang Huang
Xiaojun Wu
Chai Kiat Yeo
160
0
0
09 Oct 2024
Learning Content-Aware Multi-Modal Joint Input Pruning via Bird's-Eye-View Representation
Yuxin Li
Yiheng Li
Xulei Yang
Mengying Yu
Zihang Huang
Xiaojun Wu
Chai Kiat Yeo
191
1
0
09 Oct 2024
BEVLoc: Cross-View Localization and Matching via Birds-Eye-View Synthesis
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Christopher Klammer
Michael Kaess
221
2
0
08 Oct 2024
Previous
1
2
3
...
5
6
7
...
18
19
20
Next