Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2203.17270
Cited By
v1
v2 (latest)
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
European Conference on Computer Vision (ECCV), 2022
31 March 2022
Zhiqi Li
Wenhai Wang
Hongyang Li
Enze Xie
Chonghao Sima
Tong Lu
Qiao Yu
Jifeng Dai
Re-assign community
ArXiv (abs)
PDF
HTML
Github (18★)
Papers citing
"BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers"
22 / 972 papers shown
Title
SRCN3D: Sparse R-CNN 3D for Compact Convolutional Multi-View 3D Object Detection and Tracking
Xinyu Jiao
Jingyan Shen
Yifan Sun
Yunlong Wang
Jiaxin Li
Shiqi Sun
Yunlong Wang
Diange Yang
ViT
3DPC
132
4
0
29 Jun 2022
LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic Segmentation
Conference on Robot Learning (CoRL), 2022
Florent Bartoccioni
Éloi Zablocki
Andrei Bursuc
Patrick Pérez
Matthieu Cord
Alahari Karteek
207
40
0
27 Jun 2022
BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection
AAAI Conference on Artificial Intelligence (AAAI), 2022
Yinhao Li
Zheng Ge
Guanyi Yu
Jinrong Yang
Zengran Wang
Yukang Shi
Jian‐Yuan Sun
Zeming Li
MDE
448
803
0
21 Jun 2022
3D Object Detection for Autonomous Driving: A Comprehensive Survey
International Journal of Computer Vision (IJCV), 2022
Jiageng Mao
Shaoshuai Shi
Xiaogang Wang
Jiaming Song
3DPC
372
329
0
19 Jun 2022
Simple-BEV: What Really Matters for Multi-Sensor BEV Perception?
IEEE International Conference on Robotics and Automation (ICRA), 2022
Adam W. Harley
Zhaoyuan Fang
Jie Li
Rares Andrei Ambrus
Katerina Fragkiadaki
219
166
0
16 Jun 2022
LET-3D-AP: Longitudinal Error Tolerant 3D Average Precision for Camera-Only 3D Detection
IEEE International Conference on Robotics and Automation (ICRA), 2022
Wei-Chih Hung
Vincent Casser
Henrik Kretzschmar
Jyh-Jing Hwang
Drago Anguelov
142
36
0
15 Jun 2022
Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel Transformer
Shaoyu Chen
Tianheng Cheng
Xinggang Wang
Wenming Meng
Qian Zhang
Wenyu Liu
ViT
205
94
0
09 Jun 2022
Delving into the Pre-training Paradigm of Monocular 3D Object Detection
Zhuoling Li
Chuanrui Zhang
En Yu
Haoqian Wang
139
1
0
08 Jun 2022
PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images
IEEE International Conference on Computer Vision (ICCV), 2022
Yingfei Liu
Junjie Yan
Fan Jia
Shuailin Li
Q. Gao
Tiancai Wang
Xinming Zhang
Jian Sun
3DPC
296
449
0
02 Jun 2022
TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Kashyap Chitta
Aditya Prakash
Bernhard Jaeger
Zehao Yu
Katrin Renz
Andreas Geiger
ViT
444
489
0
31 May 2022
Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object Detection
Kaicheng Yu
Tao Tang
Hongwei Xie
Zhiwei Lin
Zhongwei Wu
...
Jiong Deng
Dayang Hao
Yongtao Wang
Xi Liang
Bing Wang
3DPC
177
67
0
30 May 2022
BEVFusion: A Simple and Robust LiDAR-Camera Fusion Framework
Neural Information Processing Systems (NeurIPS), 2022
Tingting Liang
Hongwei Xie
Kaicheng Yu
Zhongyu Xia
Zhiwei Lin
Yongtao Wang
T. Tang
Bing Wang
Zhi Tang
3DPC
239
550
0
27 May 2022
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
IEEE International Conference on Robotics and Automation (ICRA), 2022
Zhijian Liu
Haotian Tang
Alexander Amini
Xinyu Yang
Huizi Mao
Daniela Rus
Song Han
524
1,223
0
26 May 2022
SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation
Conference on Robot Learning (CoRL), 2022
Yi Wei
Linqing Zhao
Wenzhao Zheng
Zheng Hua Zhu
Yongming Rao
Guan Huang
Jiwen Lu
Jie Zhou
MDE
308
89
0
07 Apr 2022
BEVDet4D: Exploit Temporal Cues in Multi-camera 3D Object Detection
Junjie Huang
Guan Huang
499
434
0
31 Mar 2022
EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation
Computer Vision and Pattern Recognition (CVPR), 2022
Hansheng Chen
Pichao Wang
Fan Wang
Wei Tian
Lu Xiong
Hao Li
300
1
0
24 Mar 2022
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection
IEEE International Conference on Computer Vision (ICCV), 2022
Renrui Zhang
Han Qiu
Tai Wang
Ziyu Guo
Xuan Xu
Xuanzhuo Xu
Ziteng Cui
Shiyang Feng
Jiaming Song
Hongsheng Li
ViT
MDE
437
137
0
24 Mar 2022
PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark
European Conference on Computer Vision (ECCV), 2022
Li Chen
Chonghao Sima
Yang Li
Zehan Zheng
Jiajie Xu
...
Hongyang Li
Conghui He
Jianping Shi
Yu Qiao
Junchi Yan
3DPC
ViT
241
225
0
21 Mar 2022
BEVSegFormer: Bird's Eye View Semantic Segmentation From Arbitrary Camera Rigs
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Lang Peng
Zhirong Chen
Zhang-Hua Fu
Pengpeng Liang
Erkang Cheng
362
169
0
08 Mar 2022
Voxelized 3D Feature Aggregation for Multiview Detection
International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2021
Jiahao Ma
Jinguang Tong
Shan Wang
Wei Zhao
Zicheng Duan
Chuong H. Nguyen
186
8
0
07 Dec 2021
View Birdification in the Crowd: Ground-Plane Localization from Perceived Movements
International Journal of Computer Vision (IJCV), 2021
Mai Nishimura
S. Nobuhara
Ko Nishino
309
5
0
09 Nov 2021
PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
International Journal of Computer Vision (IJCV), 2021
Shaoshuai Shi
Li Jiang
Jiajun Deng
Zhe Wang
Chaoxu Guo
Jianping Shi
Xiaogang Wang
Jiaming Song
3DPC
353
512
0
31 Jan 2021
Previous
1
2
3
...
18
19
20