ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.17270
  4. Cited By
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera
  Images via Spatiotemporal Transformers
v1v2 (latest)

BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers

European Conference on Computer Vision (ECCV), 2022
31 March 2022
Zhiqi Li
Wenhai Wang
Hongyang Li
Enze Xie
Chonghao Sima
Tong Lu
Qiao Yu
Jifeng Dai
ArXiv (abs)PDFHTMLGithub (18★)

Papers citing "BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers"

50 / 973 papers shown
SuperFusion: Multilevel LiDAR-Camera Fusion for Long-Range HD Map
  Generation
SuperFusion: Multilevel LiDAR-Camera Fusion for Long-Range HD Map GenerationIEEE International Conference on Robotics and Automation (ICRA), 2022
Hao Dong
Xianjing Zhang
Jintao Xu
Rui Ai
Weihao Gu
Huimin Lu
Arno Solin
Xieyuanli Chen
295
55
0
28 Nov 2022
BEV-Locator: An End-to-end Visual Semantic Localization Network Using
  Multi-View Images
BEV-Locator: An End-to-end Visual Semantic Localization Network Using Multi-View ImagesScience China Information Sciences (Sci. China Inf. Sci.), 2022
Zhihuang Zhang
Mengze Xu
Wenqiang Zhou
T. Peng
L. Li
S. Poslad
208
31
0
27 Nov 2022
3DPPE: 3D Point Positional Encoding for Multi-Camera 3D Object Detection
  Transformers
3DPPE: 3D Point Positional Encoding for Multi-Camera 3D Object Detection Transformers
Changyong Shu
Jiajun Deng
Feng Yu
Yifan Liu
3DPC
280
14
0
27 Nov 2022
3D Dual-Fusion: Dual-Domain Dual-Query Camera-LiDAR Fusion for 3D Object
  Detection
3D Dual-Fusion: Dual-Domain Dual-Query Camera-LiDAR Fusion for 3D Object Detection
Yecheol Kim
Konyul Park
Minwook Kim
Dongsuk Kum
J. Choi
3DPC
228
29
0
24 Nov 2022
AeDet: Azimuth-invariant Multi-view 3D Object Detection
AeDet: Azimuth-invariant Multi-view 3D Object DetectionComputer Vision and Pattern Recognition (CVPR), 2022
Chengjian Feng
Zequn Jie
Yujie Zhong
Xiangxiang Chu
Lin Ma
3DPC
128
26
0
22 Nov 2022
Uncertainty-aware Vision-based Metric Cross-view Geolocalization
Uncertainty-aware Vision-based Metric Cross-view GeolocalizationComputer Vision and Pattern Recognition (CVPR), 2022
F. Fervers
Sebastian Bullinger
C. Bodensteiner
Michael Arens
Rainer Stiefelhagen
180
60
0
22 Nov 2022
MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception
MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D PerceptionIEEE International Conference on Computer Vision (ICCV), 2022
Hongyu Zhou
Zheng Ge
Zeming Li
Xiangyu Zhang
155
55
0
19 Nov 2022
Sparse4D: Multi-view 3D Object Detection with Sparse Spatial-Temporal
  Fusion
Sparse4D: Multi-view 3D Object Detection with Sparse Spatial-Temporal Fusion
Xuewu Lin
Tianwei Lin
Zi-Hui Pei
Lichao Huang
Zhizhong Su
3DPC
308
154
0
19 Nov 2022
BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View
  Recognition via Perspective Supervision
BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective SupervisionComputer Vision and Pattern Recognition (CVPR), 2022
Chenyu Yang
Yuntao Chen
Haofei Tian
Chenxin Tao
Xizhou Zhu
...
Guoying Gu
Yu Qiao
Lewei Lu
Jie Zhou
Jifeng Dai
MDE
226
376
0
18 Nov 2022
BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object
  Detection
BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object DetectionInternational Conference on Learning Representations (ICLR), 2022
Zehui Chen
Zhenyu Li
Shiquan Zhang
Liangji Fang
Qinhong Jiang
Feng Zhao
302
80
0
17 Nov 2022
Monocular BEV Perception of Road Scenes via Front-to-Top View Projection
Monocular BEV Perception of Road Scenes via Front-to-Top View ProjectionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Wenxi Liu
Qi Li
Weixiang Yang
Jiaxin Cai
Yuanlong Yu
Yuexin Ma
Shengfeng He
Jianxiong Pan
162
4
0
15 Nov 2022
Structured Knowledge Distillation Towards Efficient and Compact
  Multi-View 3D Detection
Structured Knowledge Distillation Towards Efficient and Compact Multi-View 3D Detection
Linfeng Zhang
Yukang Shi
Hung-Shuo Tai
Zhipeng Zhang
Yuan He
Ke Wang
Kaisheng Ma
257
4
0
14 Nov 2022
Recursive Cross-View: Use Only 2D Detectors to Achieve 3D Object
  Detection without 3D Annotations
Recursive Cross-View: Use Only 2D Detectors to Achieve 3D Object Detection without 3D AnnotationsIEEE Robotics and Automation Letters (RA-L), 2022
Shun Gui
Yan Luximon
344
1
0
14 Nov 2022
Behavioral Intention Prediction in Driving Scenes: A Survey
Behavioral Intention Prediction in Driving Scenes: A Survey
Jianwu Fang
Fan Wang
Jianru Xue
Tat-Seng Chua
454
81
0
01 Nov 2022
Multi-Camera Calibration Free BEV Representation for 3D Object Detection
Multi-Camera Calibration Free BEV Representation for 3D Object Detection
Hongxiang Jiang
Wenming Meng
Hongmei Zhu
Qiaosheng Zhang
Jihao Yin
186
4
0
31 Oct 2022
PlanT: Explainable Planning Transformers via Object-Level
  Representations
PlanT: Explainable Planning Transformers via Object-Level RepresentationsConference on Robot Learning (CoRL), 2022
Katrin Renz
Kashyap Chitta
Otniel-Bogdan Mercea
A. Sophia Koepke
Zeynep Akata
Andreas Geiger
ViT
273
128
0
25 Oct 2022
CramNet: Camera-Radar Fusion with Ray-Constrained Cross-Attention for
  Robust 3D Object Detection
CramNet: Camera-Radar Fusion with Ray-Constrained Cross-Attention for Robust 3D Object DetectionEuropean Conference on Computer Vision (ECCV), 2022
Jyh-Jing Hwang
Henrik Kretzschmar
Joshua M. Manela
Sean M. Rafferty
N. Armstrong-Crews
Tiffany Chen
Drago Anguelov
3DPC
252
55
0
17 Oct 2022
Model-Based Imitation Learning for Urban Driving
Model-Based Imitation Learning for Urban DrivingNeural Information Processing Systems (NeurIPS), 2022
Anthony Hu
Gianluca Corrado
Nicolas Griffiths
Zak Murez
Corina Gurau
Hudson Yeo
Alex Kendall
R. Cipolla
Jamie Shotton
338
188
0
14 Oct 2022
X-Align: Cross-Modal Cross-View Alignment for Bird's-Eye-View
  Segmentation
X-Align: Cross-Modal Cross-View Alignment for Bird's-Eye-View SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Shubhankar Borse
Marvin Klingner
V. Kumar
H. Cai
Abdulaziz Almuzairee
S. Yogamani
Fatih Porikli
282
47
0
13 Oct 2022
Exploring Contextual Representation and Multi-Modality for End-to-End
  Autonomous Driving
Exploring Contextual Representation and Multi-Modality for End-to-End Autonomous DrivingEngineering applications of artificial intelligence (EAAI), 2022
Shoaib Azam
Farzeen Munir
Ville Kyrki
M. Jeon
Witold Pedrycz
216
4
0
13 Oct 2022
BEV-LaneDet: a Simple and Effective 3D Lane Detection Baseline
BEV-LaneDet: a Simple and Effective 3D Lane Detection Baseline
Ruihao Wang
Jianbang Qin
Kai Li
Yaochen Li
Dongping Cao
Jintao Xu
248
12
0
12 Oct 2022
Depth Is All You Need for Monocular 3D Detection
Depth Is All You Need for Monocular 3D DetectionIEEE International Conference on Robotics and Automation (ICRA), 2022
Dennis Park
Jie Li
Di Chen
Vitor Campagnolo Guizilini
Adrien Gaidon
3DPCMDE
219
9
0
05 Oct 2022
Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D
  Object Detection
Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object DetectionInternational Conference on Learning Representations (ICLR), 2022
Jeongseok Lee
Chenfeng Xu
Shijia Yang
Kurt Keutzer
Kris Kitani
Masayoshi Tomizuka
Wei Zhan
291
208
0
05 Oct 2022
CrossDTR: Cross-view and Depth-guided Transformers for 3D Object
  Detection
CrossDTR: Cross-view and Depth-guided Transformers for 3D Object DetectionIEEE International Conference on Robotics and Automation (ICRA), 2022
Ching-Yu Tseng
Yi-Rong Chen
Hsin-Ying Lee
Tsung-Han Wu
Wen-Chin Chen
Winston H. Hsu
ViT
271
16
0
27 Sep 2022
Center Feature Fusion: Selective Multi-Sensor Fusion of Center-based
  Objects
Center Feature Fusion: Selective Multi-Sensor Fusion of Center-based ObjectsIEEE International Conference on Robotics and Automation (ICRA), 2022
P. Jacobson
Yiyang Zhou
Wei Zhan
Masayoshi Tomizuka
Ming Wu
3DPC
173
14
0
26 Sep 2022
FusionRCNN: LiDAR-Camera Fusion for Two-stage 3D Object Detection
FusionRCNN: LiDAR-Camera Fusion for Two-stage 3D Object DetectionRemote Sensing (RS), 2022
Xinli Xu
Shaocong Dong
Lihe Ding
Jie Wang
Tingfa Xu
Jianan Li
3DPC
296
53
0
22 Sep 2022
BEVStereo: Enhancing Depth Estimation in Multi-view 3D Object Detection
  with Dynamic Temporal Stereo
BEVStereo: Enhancing Depth Estimation in Multi-view 3D Object Detection with Dynamic Temporal Stereo
Yinhao Li
Han Bao
Zheng Ge
Jinrong Yang
Jian‐Yuan Sun
Zeming Li
270
120
0
21 Sep 2022
GATraj: A Graph- and Attention-based Multi-Agent Trajectory Prediction
  Model
GATraj: A Graph- and Attention-based Multi-Agent Trajectory Prediction ModelIsprs Journal of Photogrammetry and Remote Sensing (JIPRS), 2022
Hao Cheng
Mengmeng Liu
Linyuan Chen
Hellward Broszio
Monika Sester
M. Yang
281
85
0
16 Sep 2022
CRAFT: Camera-Radar 3D Object Detection with Spatio-Contextual Fusion
  Transformer
CRAFT: Camera-Radar 3D Object Detection with Spatio-Contextual Fusion TransformerAAAI Conference on Artificial Intelligence (AAAI), 2022
Youngseok Kim
Sanmin Kim
Junwon Choi
Dongsuk Kum
383
118
0
14 Sep 2022
M$^2$-3DLaneNet: Exploring Multi-Modal 3D Lane Detection
M2^22-3DLaneNet: Exploring Multi-Modal 3D Lane Detection
Yueru Luo
Xu Yan
Chao Zheng
Chao Zheng
Shuqi Mei
Tang Kun
Shuguang Cui
Zhen Li
3DPC
169
11
0
13 Sep 2022
Delving into the Devils of Bird's-eye-view Perception: A Review,
  Evaluation and Recipe
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and RecipeIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Hongyang Li
Chonghao Sima
Jifeng Dai
Wenhai Wang
Lewei Lu
...
Xiaosong Jia
Siqian Liu
Jianping Shi
Dahua Lin
Yu Qiao
321
189
0
12 Sep 2022
MapTR: Structured Modeling and Learning for Online Vectorized HD Map
  Construction
MapTR: Structured Modeling and Learning for Online Vectorized HD Map ConstructionInternational Conference on Learning Representations (ICLR), 2022
Bencheng Liao
Shaoyu Chen
Xinggang Wang
Tianheng Cheng
Qian Zhang
Wenyu Liu
Chang Huang
ViT
324
330
0
30 Aug 2022
DeepInteraction: 3D Object Detection via Modality Interaction
DeepInteraction: 3D Object Detection via Modality InteractionNeural Information Processing Systems (NeurIPS), 2022
Zeyu Yang
Jia-Qing Chen
Zhenwei Miao
Wei Li
Xiatian Zhu
Li Zhang
374
189
0
23 Aug 2022
Quality Matters: Embracing Quality Clues for Robust 3D Multi-Object
  Tracking
Quality Matters: Embracing Quality Clues for Robust 3D Multi-Object Tracking
Jinrong Yang
En Yu
Zeming Li
Xiaoping Li
Wenbing Tao
VOT
265
18
0
23 Aug 2022
STS: Surround-view Temporal Stereo for Multi-view 3D Detection
STS: Surround-view Temporal Stereo for Multi-view 3D Detection
Zengran Wang
Chen Min
Zheng Ge
Yinhao Li
Zeming Li
Hongyu Yang
Dihe Huang
MDE
177
69
0
22 Aug 2022
A Simple Baseline for Multi-Camera 3D Object Detection
A Simple Baseline for Multi-Camera 3D Object DetectionAAAI Conference on Artificial Intelligence (AAAI), 2022
Yunpeng Zhang
Wenzhao Zheng
Zhengbiao Zhu
Guan Huang
Jie Zhou
Jiwen Lu
3DPC
172
27
0
22 Aug 2022
Exploring Point-BEV Fusion for 3D Point Cloud Object Tracking with
  Transformer
Exploring Point-BEV Fusion for 3D Point Cloud Object Tracking with TransformerIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Zhi-Chun Luo
Changqing Zhou
Liang Pan
Gongjie Zhang
Ti Liu
Yueru Luo
Haiyu Zhao
Ziwei Liu
Shijian Lu
3DPC
210
25
0
10 Aug 2022
Vision-Centric BEV Perception: A Survey
Vision-Centric BEV Perception: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yuexin Ma
Tai Wang
Xuyang Bai
Huitong Yang
Yuenan Hou
Yaming Wang
Yu Qiao
Ruigang Yang
Tianyi Zhou
Xinge Zhu
545
177
0
04 Aug 2022
ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent Queries
ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent QueriesComputer Vision and Pattern Recognition (CVPR), 2022
Junru Gu
Chenxu Hu
Tian-Yi Zhang
Xuanyao Chen
Yilun Wang
Yue Wang
Hang Zhao
327
119
0
02 Aug 2022
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
Group DETR: Fast DETR Training with Group-Wise One-to-Many AssignmentIEEE International Conference on Computer Vision (ICCV), 2022
Qiang Chen
Xiaokang Chen
Jian Wang
Shan Zhang
Kun Yao
Haocheng Feng
Junyu Han
Errui Ding
Gang Zeng
Jingdong Wang
ViT
304
195
0
26 Jul 2022
DETRs with Hybrid Matching
DETRs with Hybrid MatchingComputer Vision and Pattern Recognition (CVPR), 2022
Ding Jia
Yuhui Yuan
Hao He
Xiao-pei Wu
Haojun Yu
Weihong Lin
Lei-huan Sun
Chao Zhang
Hanhua Hu
452
258
0
26 Jul 2022
MV-FCOS3D++: Multi-View Camera-Only 4D Object Detection with Pretrained
  Monocular Backbones
MV-FCOS3D++: Multi-View Camera-Only 4D Object Detection with Pretrained Monocular Backbones
Tai Wang
Qing Lian
Chenming Zhu
Xinge Zhu
Wenwei Zhang
3DPC
113
33
0
26 Jul 2022
UniFusion: Unified Multi-view Fusion Transformer for Spatial-Temporal
  Representation in Bird's-Eye-View
UniFusion: Unified Multi-view Fusion Transformer for Spatial-Temporal Representation in Bird's-Eye-ViewIEEE International Conference on Computer Vision (ICCV), 2022
Zequn Qin
Jingyu Chen
Chao Chen
Xiaozhi Chen
Xi Li
173
33
0
18 Jul 2022
Consistency of Implicit and Explicit Features Matters for Monocular 3D
  Object Detection
Consistency of Implicit and Explicit Features Matters for Monocular 3D Object Detection
Qian Ye
L. Jiang
Wang Zhen
Yuyang Du
201
6
0
16 Jul 2022
ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal
  Feature Learning
ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature LearningEuropean Conference on Computer Vision (ECCV), 2022
Shengchao Hu
Li Chen
Peng Wu
Guoying Gu
Junchi Yan
Dacheng Tao
279
376
0
15 Jul 2022
Trans4Map: Revisiting Holistic Bird's-Eye-View Mapping from Egocentric
  Images to Allocentric Semantics with Vision Transformers
Trans4Map: Revisiting Holistic Bird's-Eye-View Mapping from Egocentric Images to Allocentric Semantics with Vision TransformersIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Chang Chen
Kailai Li
Kailun Yang
Kunyu Peng
Rainer Stiefelhagen
ViT
154
8
0
13 Jul 2022
CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse
  Transformers
CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse TransformersConference on Robot Learning (CoRL), 2022
Runsheng Xu
Zhengzhong Tu
Hao Xiang
Wei Shao
Bolei Zhou
Jiaqi Ma
409
307
0
05 Jul 2022
Vision-based Uneven BEV Representation Learning with Polar Rasterization
  and Surface Estimation
Vision-based Uneven BEV Representation Learning with Polar Rasterization and Surface EstimationConference on Robot Learning (CoRL), 2022
Zhi Liu
Shaoyu Chen
Xiaojie Guo
Xinggang Wang
Tianheng Cheng
Hong Zhu
Qian Zhang
Wenyu Liu
Yi Zhang
MDE
127
34
0
05 Jul 2022
ORA3D: Overlap Region Aware Multi-view 3D Object Detection
ORA3D: Overlap Region Aware Multi-view 3D Object DetectionBritish Machine Vision Conference (BMVC), 2022
Wonseok Roh
Gyusam Chang
Seokha Moon
Giljoo Nam
Chanyoung Kim
Younghyun Kim
Jinkyu Kim
Sangpil Kim
3DPC
238
14
0
02 Jul 2022
Masked Autoencoder for Self-Supervised Pre-training on Lidar Point
  Clouds
Masked Autoencoder for Self-Supervised Pre-training on Lidar Point Clouds
Georg Hess
Johan Jaxing
Elias Svensson
David Hagerman
Christoffer Petersson
Lennart Svensson
3DPCViT
283
52
0
01 Jul 2022
Previous
123...181920
Next