Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2203.17270
Cited By
v1
v2 (latest)
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
European Conference on Computer Vision (ECCV), 2022
31 March 2022
Zhiqi Li
Wenhai Wang
Hongyang Li
Enze Xie
Chonghao Sima
Tong Lu
Qiao Yu
Jifeng Dai
Re-assign community
ArXiv (abs)
PDF
HTML
Github (18★)
Papers citing
"BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers"
50 / 973 papers shown
SuperFusion: Multilevel LiDAR-Camera Fusion for Long-Range HD Map Generation
IEEE International Conference on Robotics and Automation (ICRA), 2022
Hao Dong
Xianjing Zhang
Jintao Xu
Rui Ai
Weihao Gu
Huimin Lu
Arno Solin
Xieyuanli Chen
295
55
0
28 Nov 2022
BEV-Locator: An End-to-end Visual Semantic Localization Network Using Multi-View Images
Science China Information Sciences (Sci. China Inf. Sci.), 2022
Zhihuang Zhang
Mengze Xu
Wenqiang Zhou
T. Peng
L. Li
S. Poslad
208
31
0
27 Nov 2022
3DPPE: 3D Point Positional Encoding for Multi-Camera 3D Object Detection Transformers
Changyong Shu
Jiajun Deng
Feng Yu
Yifan Liu
3DPC
280
14
0
27 Nov 2022
3D Dual-Fusion: Dual-Domain Dual-Query Camera-LiDAR Fusion for 3D Object Detection
Yecheol Kim
Konyul Park
Minwook Kim
Dongsuk Kum
J. Choi
3DPC
228
29
0
24 Nov 2022
AeDet: Azimuth-invariant Multi-view 3D Object Detection
Computer Vision and Pattern Recognition (CVPR), 2022
Chengjian Feng
Zequn Jie
Yujie Zhong
Xiangxiang Chu
Lin Ma
3DPC
128
26
0
22 Nov 2022
Uncertainty-aware Vision-based Metric Cross-view Geolocalization
Computer Vision and Pattern Recognition (CVPR), 2022
F. Fervers
Sebastian Bullinger
C. Bodensteiner
Michael Arens
Rainer Stiefelhagen
180
60
0
22 Nov 2022
MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception
IEEE International Conference on Computer Vision (ICCV), 2022
Hongyu Zhou
Zheng Ge
Zeming Li
Xiangyu Zhang
155
55
0
19 Nov 2022
Sparse4D: Multi-view 3D Object Detection with Sparse Spatial-Temporal Fusion
Xuewu Lin
Tianwei Lin
Zi-Hui Pei
Lichao Huang
Zhizhong Su
3DPC
308
154
0
19 Nov 2022
BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision
Computer Vision and Pattern Recognition (CVPR), 2022
Chenyu Yang
Yuntao Chen
Haofei Tian
Chenxin Tao
Xizhou Zhu
...
Guoying Gu
Yu Qiao
Lewei Lu
Jie Zhou
Jifeng Dai
MDE
226
376
0
18 Nov 2022
BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection
International Conference on Learning Representations (ICLR), 2022
Zehui Chen
Zhenyu Li
Shiquan Zhang
Liangji Fang
Qinhong Jiang
Feng Zhao
302
80
0
17 Nov 2022
Monocular BEV Perception of Road Scenes via Front-to-Top View Projection
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Wenxi Liu
Qi Li
Weixiang Yang
Jiaxin Cai
Yuanlong Yu
Yuexin Ma
Shengfeng He
Jianxiong Pan
162
4
0
15 Nov 2022
Structured Knowledge Distillation Towards Efficient and Compact Multi-View 3D Detection
Linfeng Zhang
Yukang Shi
Hung-Shuo Tai
Zhipeng Zhang
Yuan He
Ke Wang
Kaisheng Ma
257
4
0
14 Nov 2022
Recursive Cross-View: Use Only 2D Detectors to Achieve 3D Object Detection without 3D Annotations
IEEE Robotics and Automation Letters (RA-L), 2022
Shun Gui
Yan Luximon
344
1
0
14 Nov 2022
Behavioral Intention Prediction in Driving Scenes: A Survey
Jianwu Fang
Fan Wang
Jianru Xue
Tat-Seng Chua
454
81
0
01 Nov 2022
Multi-Camera Calibration Free BEV Representation for 3D Object Detection
Hongxiang Jiang
Wenming Meng
Hongmei Zhu
Qiaosheng Zhang
Jihao Yin
186
4
0
31 Oct 2022
PlanT: Explainable Planning Transformers via Object-Level Representations
Conference on Robot Learning (CoRL), 2022
Katrin Renz
Kashyap Chitta
Otniel-Bogdan Mercea
A. Sophia Koepke
Zeynep Akata
Andreas Geiger
ViT
273
128
0
25 Oct 2022
CramNet: Camera-Radar Fusion with Ray-Constrained Cross-Attention for Robust 3D Object Detection
European Conference on Computer Vision (ECCV), 2022
Jyh-Jing Hwang
Henrik Kretzschmar
Joshua M. Manela
Sean M. Rafferty
N. Armstrong-Crews
Tiffany Chen
Drago Anguelov
3DPC
252
55
0
17 Oct 2022
Model-Based Imitation Learning for Urban Driving
Neural Information Processing Systems (NeurIPS), 2022
Anthony Hu
Gianluca Corrado
Nicolas Griffiths
Zak Murez
Corina Gurau
Hudson Yeo
Alex Kendall
R. Cipolla
Jamie Shotton
338
188
0
14 Oct 2022
X-Align: Cross-Modal Cross-View Alignment for Bird's-Eye-View Segmentation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Shubhankar Borse
Marvin Klingner
V. Kumar
H. Cai
Abdulaziz Almuzairee
S. Yogamani
Fatih Porikli
282
47
0
13 Oct 2022
Exploring Contextual Representation and Multi-Modality for End-to-End Autonomous Driving
Engineering applications of artificial intelligence (EAAI), 2022
Shoaib Azam
Farzeen Munir
Ville Kyrki
M. Jeon
Witold Pedrycz
216
4
0
13 Oct 2022
BEV-LaneDet: a Simple and Effective 3D Lane Detection Baseline
Ruihao Wang
Jianbang Qin
Kai Li
Yaochen Li
Dongping Cao
Jintao Xu
248
12
0
12 Oct 2022
Depth Is All You Need for Monocular 3D Detection
IEEE International Conference on Robotics and Automation (ICRA), 2022
Dennis Park
Jie Li
Di Chen
Vitor Campagnolo Guizilini
Adrien Gaidon
3DPC
MDE
219
9
0
05 Oct 2022
Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection
International Conference on Learning Representations (ICLR), 2022
Jeongseok Lee
Chenfeng Xu
Shijia Yang
Kurt Keutzer
Kris Kitani
Masayoshi Tomizuka
Wei Zhan
291
208
0
05 Oct 2022
CrossDTR: Cross-view and Depth-guided Transformers for 3D Object Detection
IEEE International Conference on Robotics and Automation (ICRA), 2022
Ching-Yu Tseng
Yi-Rong Chen
Hsin-Ying Lee
Tsung-Han Wu
Wen-Chin Chen
Winston H. Hsu
ViT
271
16
0
27 Sep 2022
Center Feature Fusion: Selective Multi-Sensor Fusion of Center-based Objects
IEEE International Conference on Robotics and Automation (ICRA), 2022
P. Jacobson
Yiyang Zhou
Wei Zhan
Masayoshi Tomizuka
Ming Wu
3DPC
173
14
0
26 Sep 2022
FusionRCNN: LiDAR-Camera Fusion for Two-stage 3D Object Detection
Remote Sensing (RS), 2022
Xinli Xu
Shaocong Dong
Lihe Ding
Jie Wang
Tingfa Xu
Jianan Li
3DPC
296
53
0
22 Sep 2022
BEVStereo: Enhancing Depth Estimation in Multi-view 3D Object Detection with Dynamic Temporal Stereo
Yinhao Li
Han Bao
Zheng Ge
Jinrong Yang
Jian‐Yuan Sun
Zeming Li
270
120
0
21 Sep 2022
GATraj: A Graph- and Attention-based Multi-Agent Trajectory Prediction Model
Isprs Journal of Photogrammetry and Remote Sensing (JIPRS), 2022
Hao Cheng
Mengmeng Liu
Linyuan Chen
Hellward Broszio
Monika Sester
M. Yang
281
85
0
16 Sep 2022
CRAFT: Camera-Radar 3D Object Detection with Spatio-Contextual Fusion Transformer
AAAI Conference on Artificial Intelligence (AAAI), 2022
Youngseok Kim
Sanmin Kim
Junwon Choi
Dongsuk Kum
383
118
0
14 Sep 2022
M
2
^2
2
-3DLaneNet: Exploring Multi-Modal 3D Lane Detection
Yueru Luo
Xu Yan
Chao Zheng
Chao Zheng
Shuqi Mei
Tang Kun
Shuguang Cui
Zhen Li
3DPC
169
11
0
13 Sep 2022
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Hongyang Li
Chonghao Sima
Jifeng Dai
Wenhai Wang
Lewei Lu
...
Xiaosong Jia
Siqian Liu
Jianping Shi
Dahua Lin
Yu Qiao
321
189
0
12 Sep 2022
MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction
International Conference on Learning Representations (ICLR), 2022
Bencheng Liao
Shaoyu Chen
Xinggang Wang
Tianheng Cheng
Qian Zhang
Wenyu Liu
Chang Huang
ViT
324
330
0
30 Aug 2022
DeepInteraction: 3D Object Detection via Modality Interaction
Neural Information Processing Systems (NeurIPS), 2022
Zeyu Yang
Jia-Qing Chen
Zhenwei Miao
Wei Li
Xiatian Zhu
Li Zhang
374
189
0
23 Aug 2022
Quality Matters: Embracing Quality Clues for Robust 3D Multi-Object Tracking
Jinrong Yang
En Yu
Zeming Li
Xiaoping Li
Wenbing Tao
VOT
265
18
0
23 Aug 2022
STS: Surround-view Temporal Stereo for Multi-view 3D Detection
Zengran Wang
Chen Min
Zheng Ge
Yinhao Li
Zeming Li
Hongyu Yang
Dihe Huang
MDE
177
69
0
22 Aug 2022
A Simple Baseline for Multi-Camera 3D Object Detection
AAAI Conference on Artificial Intelligence (AAAI), 2022
Yunpeng Zhang
Wenzhao Zheng
Zhengbiao Zhu
Guan Huang
Jie Zhou
Jiwen Lu
3DPC
172
27
0
22 Aug 2022
Exploring Point-BEV Fusion for 3D Point Cloud Object Tracking with Transformer
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Zhi-Chun Luo
Changqing Zhou
Liang Pan
Gongjie Zhang
Ti Liu
Yueru Luo
Haiyu Zhao
Ziwei Liu
Shijian Lu
3DPC
210
25
0
10 Aug 2022
Vision-Centric BEV Perception: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yuexin Ma
Tai Wang
Xuyang Bai
Huitong Yang
Yuenan Hou
Yaming Wang
Yu Qiao
Ruigang Yang
Tianyi Zhou
Xinge Zhu
545
177
0
04 Aug 2022
ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent Queries
Computer Vision and Pattern Recognition (CVPR), 2022
Junru Gu
Chenxu Hu
Tian-Yi Zhang
Xuanyao Chen
Yilun Wang
Yue Wang
Hang Zhao
327
119
0
02 Aug 2022
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
IEEE International Conference on Computer Vision (ICCV), 2022
Qiang Chen
Xiaokang Chen
Jian Wang
Shan Zhang
Kun Yao
Haocheng Feng
Junyu Han
Errui Ding
Gang Zeng
Jingdong Wang
ViT
304
195
0
26 Jul 2022
DETRs with Hybrid Matching
Computer Vision and Pattern Recognition (CVPR), 2022
Ding Jia
Yuhui Yuan
Hao He
Xiao-pei Wu
Haojun Yu
Weihong Lin
Lei-huan Sun
Chao Zhang
Hanhua Hu
452
258
0
26 Jul 2022
MV-FCOS3D++: Multi-View Camera-Only 4D Object Detection with Pretrained Monocular Backbones
Tai Wang
Qing Lian
Chenming Zhu
Xinge Zhu
Wenwei Zhang
3DPC
113
33
0
26 Jul 2022
UniFusion: Unified Multi-view Fusion Transformer for Spatial-Temporal Representation in Bird's-Eye-View
IEEE International Conference on Computer Vision (ICCV), 2022
Zequn Qin
Jingyu Chen
Chao Chen
Xiaozhi Chen
Xi Li
173
33
0
18 Jul 2022
Consistency of Implicit and Explicit Features Matters for Monocular 3D Object Detection
Qian Ye
L. Jiang
Wang Zhen
Yuyang Du
201
6
0
16 Jul 2022
ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature Learning
European Conference on Computer Vision (ECCV), 2022
Shengchao Hu
Li Chen
Peng Wu
Guoying Gu
Junchi Yan
Dacheng Tao
279
376
0
15 Jul 2022
Trans4Map: Revisiting Holistic Bird's-Eye-View Mapping from Egocentric Images to Allocentric Semantics with Vision Transformers
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Chang Chen
Kailai Li
Kailun Yang
Kunyu Peng
Rainer Stiefelhagen
ViT
154
8
0
13 Jul 2022
CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers
Conference on Robot Learning (CoRL), 2022
Runsheng Xu
Zhengzhong Tu
Hao Xiang
Wei Shao
Bolei Zhou
Jiaqi Ma
409
307
0
05 Jul 2022
Vision-based Uneven BEV Representation Learning with Polar Rasterization and Surface Estimation
Conference on Robot Learning (CoRL), 2022
Zhi Liu
Shaoyu Chen
Xiaojie Guo
Xinggang Wang
Tianheng Cheng
Hong Zhu
Qian Zhang
Wenyu Liu
Yi Zhang
MDE
127
34
0
05 Jul 2022
ORA3D: Overlap Region Aware Multi-view 3D Object Detection
British Machine Vision Conference (BMVC), 2022
Wonseok Roh
Gyusam Chang
Seokha Moon
Giljoo Nam
Chanyoung Kim
Younghyun Kim
Jinkyu Kim
Sangpil Kim
3DPC
238
14
0
02 Jul 2022
Masked Autoencoder for Self-Supervised Pre-training on Lidar Point Clouds
Georg Hess
Johan Jaxing
Elias Svensson
David Hagerman
Christoffer Petersson
Lennart Svensson
3DPC
ViT
283
52
0
01 Jul 2022
Previous
1
2
3
...
18
19
20
Next