ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.17270
  4. Cited By
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera
  Images via Spatiotemporal Transformers
v1v2 (latest)

BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers

European Conference on Computer Vision (ECCV), 2022
31 March 2022
Zhiqi Li
Wenhai Wang
Hongyang Li
Enze Xie
Chonghao Sima
Tong Lu
Qiao Yu
Jifeng Dai
ArXiv (abs)PDFHTMLGithub (18★)

Papers citing "BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers"

50 / 973 papers shown
Seeing Beyond Views: Multi-View Driving Scene Video Generation with
  Holistic Attention
Seeing Beyond Views: Multi-View Driving Scene Video Generation with Holistic Attention
Hannan Lu
Xiaohe Wu
Shudong Wang
Xiameng Qin
Xinyu Zhang
Junyu Han
W. Zuo
Ji Tao
349
4
0
04 Dec 2024
Redundant Queries in DETR-Based 3D Detection Methods: Unnecessary and Prunable
Redundant Queries in DETR-Based 3D Detection Methods: Unnecessary and Prunable
Lizhen Xu
Zehao Wu
Wenzhao Qiu
Zehao Wu
Xiuxiu Bai
K. Mei
Jianru Xue
488
2
0
03 Dec 2024
OODFace: Benchmarking Robustness of Face Recognition under Common Corruptions and Appearance Variations
OODFace: Benchmarking Robustness of Face Recognition under Common Corruptions and Appearance Variations
Caixin Kang
Yubo Chen
Shouwei Ruan
Shiji Zhao
Ruochen Zhang
Jiayi Wang
Shan Fu
Xingxing Wei
CVBM
579
2
0
03 Dec 2024
Epipolar Attention Field Transformers for Bird's Eye View Semantic
  Segmentation
Epipolar Attention Field Transformers for Bird's Eye View Semantic SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Christian Witte
Jens Behley
Cyrill Stachniss
Marvin Raaijmakers
319
1
0
02 Dec 2024
HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for
  Autonomous Driving
HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving
Z. Wu
Jingcheng Ni
Xiaodong Wang
Yuxin Guo
Rui Chen
Lewei Lu
Jifeng Dai
Yuwen Xiong
363
17
0
02 Dec 2024
SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection
SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection
Philipp Wolters
Johannes Gilg
Torben Teepe
Fabian Herzog
Felix Fent
Gerhard Rigoll
388
5
0
29 Nov 2024
Improving Batch Normalization with TTA for Robust Object Detection in
  Self-Driving
Improving Batch Normalization with TTA for Robust Object Detection in Self-Driving
Dacheng Liao
Mengshi Qi
Liang Liu
Huadong Ma
294
0
0
28 Nov 2024
Visual SLAMMOT Considering Multiple Motion Models
Visual SLAMMOT Considering Multiple Motion Models
Peilin Tian
Hao Li
340
2
0
28 Nov 2024
FASIONAD : FAst and Slow FusION Thinking Systems for Human-Like
  Autonomous Driving with Adaptive Feedback
FASIONAD : FAst and Slow FusION Thinking Systems for Human-Like Autonomous Driving with Adaptive Feedback
Kangan Qian
Zhikun Ma
Yangfan He
Ziang Luo
Lewei He
...
Zheng Fu
Xinyu Jiao
Yunlong Wang
Ke Wang
Takafumi Matsumaru
AI4CE
243
8
0
27 Nov 2024
D$^2$-World: An Efficient World Model through Decoupled Dynamic Flow
D2^22-World: An Efficient World Model through Decoupled Dynamic Flow
Haiming Zhang
Xu Yan
Ying Xue
Zixuan Guo
Shuguang Cui
Hui Yuan
Bingbing Liu
252
0
0
26 Nov 2024
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection
Zhongyu Xia
Jishuo Li
Zhiwei Lin
Xinhao Wang
Longji Xu
Ming-Hsuan Yang
VLM
425
8
0
26 Nov 2024
Monocular Lane Detection Based on Deep Learning: A Survey
Monocular Lane Detection Based on Deep Learning: A Survey
Xin He
Haiyun Guo
Kuan Zhu
Bingke Zhu
Xu Zhao
Jianwu Fang
Jinqiao Wang
707
13
0
25 Nov 2024
Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion
Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene CompletionComputer Vision and Pattern Recognition (CVPR), 2024
Jongseong Bae
Junwoo Ha
Ha Young Kim
398
2
0
25 Nov 2024
Language Driven Occupancy Prediction
Language Driven Occupancy Prediction
Zhu Yu
Bowen Pang
Lizhe Liu
Runmin Zhang
Qihao Peng
Maochun Luo
Maochun Luo
Mingxia Chen
Si-Yuan Cao
Hui-Liang Shen
488
7
0
25 Nov 2024
Training an Open-Vocabulary Monocular 3D Object Detection Model without
  3D Data
Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Rui Huang
Henry Zheng
Yan Wang
Zhuofan Xia
Marco Pavone
Gao Huang
3DPCVLM
374
1
0
23 Nov 2024
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous DrivingComputer Vision and Pattern Recognition (CVPR), 2024
Bencheng Liao
Shaoyu Chen
Haoran Yin
Bo Jiang
Cheng Wang
...
Xinbang Zhang
Xiangyu Li
Y. Zhang
Qian Zhang
Xinggang Wang
668
153
0
22 Nov 2024
Spatiotemporal Decoupling for Efficient Vision-Based Occupancy
  Forecasting
Spatiotemporal Decoupling for Efficient Vision-Based Occupancy ForecastingComputer Vision and Pattern Recognition (CVPR), 2024
Jingyi Xu
Xieyuanli Chen
Junyi Ma
Jiawei Huang
Jintao Xu
Yanjie Wang
Ling Pei
199
4
0
21 Nov 2024
A Resource Efficient Fusion Network for Object Detection in Bird's-Eye
  View using Camera and Raw Radar Data
A Resource Efficient Fusion Network for Object Detection in Bird's-Eye View using Camera and Raw Radar Data
Kavin Chandrasekaran
Sorin Grigorescu
Gijs Dubbelman
P. Jancura
292
1
0
20 Nov 2024
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual
  Pre-training in Autonomous Driving
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving
Shaoqing Xu
Fang Li
Shengyin Jiang
Ziying Song
Li Liu
Zhi-xin Yang
3DGSSSL
291
11
0
19 Nov 2024
Unveiling the Hidden: Online Vectorized HD Map Construction with Clip-Level Token Interaction and PropagationNeural Information Processing Systems (NeurIPS), 2024
Nayeon Kim
Hongje Seong
Daehyun Ji
Sujin Jang
184
8
0
17 Nov 2024
V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception
Lei Yang
Wei Wei
Jun Li
Chen Wang
Zhiying Song
...
Li-e Wang
Mo Zhou
Yang Shen
Kai Wu
Chen Lv
495
20
0
17 Nov 2024
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection
Yongjin Lee
Hyeon-Mun Jeong
Yurim Jeon
Sanghyun Kim
538
2
0
16 Nov 2024
EMPERROR: A Flexible Generative Perception Error Model for Probing Self-Driving Planners
EMPERROR: A Flexible Generative Perception Error Model for Probing Self-Driving PlannersIEEE Robotics and Automation Letters (RA-L), 2024
Niklas Hanselmann
Simon Doll
Marius Cordts
Hendrik P. A. Lensch
Andreas Geiger
479
2
0
12 Nov 2024
Fast and Efficient Transformer-based Method for Bird's Eye View Instance
  Prediction
Fast and Efficient Transformer-based Method for Bird's Eye View Instance Prediction
Miguel Antunes-García
L. Bergasa
Santiago Montiel-Marín
R. Barea
Fabio Sánchez-García
Ángel Llamazares
249
1
0
11 Nov 2024
LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with
  Instance Representation
LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance RepresentationInternational Conference on 3D Vision (3DV), 2024
Weijie Ma
Jingwei Jiang
Yue Yang
Zhaoyu Chen
Hao Chen
313
3
0
09 Nov 2024
ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for
  Autonomous Driving
ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous DrivingNeural Information Processing Systems (NeurIPS), 2024
Tao Ma
Hongbin Zhou
Qiusheng Huang
Xuemeng Yang
Jianfei Guo
Bo Zhang
Min Dou
Yu Qiao
Ding Wang
Jiaming Song
215
5
0
08 Nov 2024
CFPNet: Improving Lightweight ToF Depth Completion via Cross-zone
  Feature Propagation
CFPNet: Improving Lightweight ToF Depth Completion via Cross-zone Feature PropagationInternational Conference on 3D Vision (3DV), 2024
Laiyan Ding
Hualie Jiang
Rui Xu
Rui Huang
579
4
0
07 Nov 2024
OccLoff: Learning Optimized Feature Fusion for 3D Occupancy Prediction
OccLoff: Learning Optimized Feature Fusion for 3D Occupancy PredictionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Ji Zhang
Yiran Ding
Zixin Liu
3DPC
342
5
0
06 Nov 2024
CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for
  3D Object Detection
CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object DetectionNeural Information Processing Systems (NeurIPS), 2024
Jisong Kim
Minjae Seong
Jun Won Choi
465
9
0
05 Nov 2024
Decoupling Fine Detail and Global Geometry for Compressed Depth Map Super-Resolution
Decoupling Fine Detail and Global Geometry for Compressed Depth Map Super-ResolutionComputer Vision and Pattern Recognition (CVPR), 2024
Huan Zheng
Wencheng Han
Jianbing Shen
459
6
0
05 Nov 2024
VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete
  Space via Vector Quantization
VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector QuantizationNeural Information Processing Systems (NeurIPS), 2024
Yiwei Zhang
Jin Gao
Fudong Ge
Guan Luo
Bing Li
Zheng Zhang
Haibin Ling
Weiming Hu
166
1
0
03 Nov 2024
HeightMapNet: Explicit Height Modeling for End-to-End HD Map Learning
HeightMapNet: Explicit Height Modeling for End-to-End HD Map LearningIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Wenzhao Qiu
Zehao Wu
Hao zhang
Jianwu Fang
Jianru Xue
177
3
0
03 Nov 2024
GAFusion: Adaptive Fusing LiDAR and Camera with Multiple Guidance for 3D
  Object Detection
GAFusion: Adaptive Fusing LiDAR and Camera with Multiple Guidance for 3D Object DetectionComputer Vision and Pattern Recognition (CVPR), 2024
Xiaotian Li
Baojie Fan
Jiandong Tian
Huijie Fan
3DPC
331
22
0
01 Nov 2024
Uncertainty Estimation for 3D Object Detection via Evidential Learning
Uncertainty Estimation for 3D Object Detection via Evidential Learning
Nikita Durasov
Rafid Mahmood
Jiwoong Choi
Marc T. Law
James Lucas
Pascal Fua
Jose M. Alvarez
UQCVEDL3DPC
338
5
0
31 Oct 2024
EMMA: End-to-End Multimodal Model for Autonomous Driving
EMMA: End-to-End Multimodal Model for Autonomous Driving
Jyh-Jing Hwang
Runsheng Xu
Hubert Lin
Wei-Chih Hung
Jingwei Ji
...
Benjamin Sapp
Yin Zhou
James Guo
Dragomir Anguelov
Mingxing Tan
VLMLM&Ro
433
117
0
30 Oct 2024
Unified Domain Generalization and Adaptation for Multi-View 3D Object
  Detection
Unified Domain Generalization and Adaptation for Multi-View 3D Object DetectionNeural Information Processing Systems (NeurIPS), 2024
Gyusam Chang
Jiwon Lee
Donghyun Kim
Jinkyu Kim
Dongwook Lee
Daehyun Ji
Sujin Jang
Sangpil Kim
396
9
0
29 Oct 2024
Senna: Bridging Large Vision-Language Models and End-to-End Autonomous
  Driving
Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving
Bo Jiang
Shaoyu Chen
Bencheng Liao
Xingyu Zhang
Wei Yin
Qian Zhang
Chang Huang
Wen Liu
Xinyu Wang
VLMMLLMLRM
311
78
0
29 Oct 2024
BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV
  Alignment
BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV AlignmentIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
M. Hosseinzadeh
Ian Reid
235
2
0
28 Oct 2024
UniDrive: Towards Universal Driving Perception Across Camera Configurations
UniDrive: Towards Universal Driving Perception Across Camera Configurations
Ye Li
Wenzhao Zheng
Xiaonan Huang
Kurt Keutzer
427
1
0
17 Oct 2024
Real-time Stereo-based 3D Object Detection for Streaming Perception
Real-time Stereo-based 3D Object Detection for Streaming PerceptionNeural Information Processing Systems (NeurIPS), 2024
Changcai Li
Zonghua Gu
Gang Chen
Libo Huang
Wei Zhang
Huihui Zhou
3DPC
214
2
0
16 Oct 2024
MambaBEV: An efficient 3D detection model with Mamba2
MambaBEV: An efficient 3D detection model with Mamba2
Zihan You
Hao Wang
Qichao Zhao
Jinxiang Wang
Jinxiang Wang
Mamba
263
4
0
16 Oct 2024
TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal
  Enhancement
TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal EnhancementEuropean Conference on Artificial Intelligence (ECAI), 2024
Zhiwei Lin
Hongbo Jin
Yongtao Wang
Yufei Wei
Nan Dong
284
5
0
15 Oct 2024
big.LITTLE Vision Transformer for Efficient Visual Recognition
big.LITTLE Vision Transformer for Efficient Visual Recognition
He Guo
Yulong Wang
Zixuan Ye
Jifeng Dai
Yuwen Xiong
ViT
262
1
0
14 Oct 2024
ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object Detection
ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object Detection
Jiwei Chen
Laiyan Ding
Chi Zhang
Feifei Li
254
1
0
14 Oct 2024
ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera
ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera
Jing Liang
He Yin
Xuewei Qi
Jong Jin Park
Min Sun
R. Madhivanan
Dinesh Manocha
3DPC
361
0
0
14 Oct 2024
SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
SPA: 3D Spatial-Awareness Enables Effective Embodied RepresentationInternational Conference on Learning Representations (ICLR), 2024
Haoyi Zhu
Honghui Yang
Yating Wang
Jiange Yang
Limin Wang
Tong He
3DH
384
24
0
10 Oct 2024
Progressive Multi-Modal Fusion for Robust 3D Object Detection
Progressive Multi-Modal Fusion for Robust 3D Object DetectionConference on Robot Learning (CoRL), 2024
Rohit Mohan
Daniele Cattaneo
Florian Drews
Abhinav Valada
3DPC
328
9
0
09 Oct 2024
QuadBEV: An Efficient Quadruple-Task Perception Framework via
  Bird's-Eye-View Representation
QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation
Yuxin Li
Yiheng Li
Xulei Yang
Mengying Yu
Zihang Huang
Xiaojun Wu
Chai Kiat Yeo
160
0
0
09 Oct 2024
Learning Content-Aware Multi-Modal Joint Input Pruning via
  Bird's-Eye-View Representation
Learning Content-Aware Multi-Modal Joint Input Pruning via Bird's-Eye-View Representation
Yuxin Li
Yiheng Li
Xulei Yang
Mengying Yu
Zihang Huang
Xiaojun Wu
Chai Kiat Yeo
191
1
0
09 Oct 2024
BEVLoc: Cross-View Localization and Matching via Birds-Eye-View
  Synthesis
BEVLoc: Cross-View Localization and Matching via Birds-Eye-View SynthesisIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Christopher Klammer
Michael Kaess
221
2
0
08 Oct 2024
Previous
123...567...181920
Next