ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.17270
  4. Cited By
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera
  Images via Spatiotemporal Transformers
v1v2 (latest)

BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers

European Conference on Computer Vision (ECCV), 2022
31 March 2022
Zhiqi Li
Wenhai Wang
Hongyang Li
Enze Xie
Chonghao Sima
Tong Lu
Qiao Yu
Jifeng Dai
ArXiv (abs)PDFHTMLGithub (18★)

Papers citing "BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers"

50 / 974 papers shown
MAESTRO: Task-Relevant Optimization via Adaptive Feature Enhancement and Suppression for Multi-task 3D Perception
MAESTRO: Task-Relevant Optimization via Adaptive Feature Enhancement and Suppression for Multi-task 3D Perception
C. Kang
Jisong Kim
Hongjae Shin
Junseo Park
J. Choi
127
0
0
22 Sep 2025
RCTDistill: Cross-Modal Knowledge Distillation Framework for Radar-Camera 3D Object Detection with Temporal Fusion
RCTDistill: Cross-Modal Knowledge Distillation Framework for Radar-Camera 3D Object Detection with Temporal Fusion
Geonho Bang
Minjae Seong
Jisong Kim
Geunju Baek
Daye Oh
Junhyung Kim
Junho Koh
Jun-Won Choi
176
0
0
22 Sep 2025
TinyBEV: Cross Modal Knowledge Distillation for Efficient Multi Task Bird's Eye View Perception and Planning
TinyBEV: Cross Modal Knowledge Distillation for Efficient Multi Task Bird's Eye View Perception and Planning
Reeshad Khan
John Gauch
185
0
0
22 Sep 2025
ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting
ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting
Xiaoyang Yan
Muleilan Pei
Shaojie Shen
3DGS
106
2
0
20 Sep 2025
SQS: Enhancing Sparse Perception Models via Query-based Splatting in Autonomous Driving
SQS: Enhancing Sparse Perception Models via Query-based Splatting in Autonomous Driving
Haiming Zhang
Yiyao Zhu
Wending Zhou
Xu Yan
Yingjie Cai
Bingbing Liu
Shuguang Cui
Zhen Li
160
1
0
20 Sep 2025
PAN: Pillars-Attention-Based Network for 3D Object Detection
PAN: Pillars-Attention-Based Network for 3D Object Detection
Ruan Bispo
Dane Mitrev
Letizia Mariotti
Clément Botty
Denver Humphrey
Anthony G. Scanlan
Ciarán Eising
3DPC
188
1
0
19 Sep 2025
RaFD: Flow-Guided Radar Detection for Robust Autonomous Driving
RaFD: Flow-Guided Radar Detection for Robust Autonomous Driving
Shuocheng Yang
Zikun Xu
Jiahao Wang
Shahid Nawaz
Jianqiang Wang
Shaobing Xu
121
0
0
18 Sep 2025
BEVUDA++: Geometric-aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection
BEVUDA++: Geometric-aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection
Rongyu Zhang
Jiaming Liu
Xiaoqi Li
Xiaowei Chi
Dan Wang
Li Du
Yuan Du
Shanghang Zhang
136
1
0
17 Sep 2025
FishBEV: Distortion-Resilient Bird's Eye View Segmentation with Surround-View Fisheye Cameras
FishBEV: Distortion-Resilient Bird's Eye View Segmentation with Surround-View Fisheye Cameras
Hang Li
Dianmo Sheng
Qiankun Dong
Z. Wang
Zhiwei Xu
Tao Li
139
0
0
17 Sep 2025
Maps for Autonomous Driving: Full-process Survey and Frontiers
Maps for Autonomous Driving: Full-process Survey and Frontiers
Pengxin Chen
Zhipeng Luo
Xiaoqi Jiang
Zhangcai Yin
Jonathan Li
139
0
0
16 Sep 2025
SPHERE: Semantic-PHysical Engaged REpresentation for 3D Semantic Scene Completion
SPHERE: Semantic-PHysical Engaged REpresentation for 3D Semantic Scene Completion
Zhiwen Yang
Yuxin Peng
3DGS
204
2
0
14 Sep 2025
CaR1: A Multi-Modal Baseline for BEV Vehicle Segmentation via Camera-Radar Fusion
CaR1: A Multi-Modal Baseline for BEV Vehicle Segmentation via Camera-Radar Fusion
Santiago Montiel-Marín
Ángel Llamazares
Miguel Antunes-García
Fabio Sánchez-García
L. Bergasa
147
0
0
12 Sep 2025
Towards Confidential and Efficient LLM Inference with Dual Privacy Protection
Towards Confidential and Efficient LLM Inference with Dual Privacy Protection
Honglan Yu
Yibin Wang
Feifei Dai
Dong Liu
Haihui Fan
Xiaoyan Gu
80
0
0
11 Sep 2025
Semantic Causality-Aware Vision-Based 3D Occupancy Prediction
Semantic Causality-Aware Vision-Based 3D Occupancy Prediction
Dubing Chen
Huan Zheng
Yucheng Zhou
Xianfei Li
Wenlong Liao
Tao He
Pai Peng
Jianbing Shen
3DPC
129
3
0
10 Sep 2025
InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection
InsFusion: Rethink Instance-level LiDAR-Camera Fusion for 3D Object Detection
Zhongyu Xia
Hansong Yang
Yongtao Wang
3DPC
183
0
0
10 Sep 2025
Asymmetry Vulnerability and Physical Attacks on Online Map Construction for Autonomous Driving
Asymmetry Vulnerability and Physical Attacks on Online Map Construction for Autonomous Driving
Yang Lou
Haibo Hu
Qun Song
Qian Xu
Yi Zhu
Rui Tan
Wei-Bin Lee
Jianping Wang
AAML
132
0
0
07 Sep 2025
CRAB: Camera-Radar Fusion for Reducing Depth Ambiguity in Backward Projection based View Transformation
CRAB: Camera-Radar Fusion for Reducing Depth Ambiguity in Backward Projection based View TransformationIEEE International Conference on Robotics and Automation (ICRA), 2025
In-Jae Lee
Sihwan Hwang
Youngseok Kim
Wonjune Kim
Sanmin Kim
Dongsuk Kum
136
1
0
06 Sep 2025
Vehicle-to-Infrastructure Collaborative Spatial Perception via Multimodal Large Language Models
Vehicle-to-Infrastructure Collaborative Spatial Perception via Multimodal Large Language Models
Kimia Ehsani
Walid Saad
118
0
0
04 Sep 2025
SliceSemOcc: Vertical Slice Based Multimodal 3D Semantic Occupancy Representation
SliceSemOcc: Vertical Slice Based Multimodal 3D Semantic Occupancy Representation
Han Huang
Han Sun
Ningzhong Liu
Huiyu Zhou
Jiaquan Shen
115
0
0
04 Sep 2025
Mapping like a Skeptic: Probabilistic BEV Projection for Online HD Mapping
Mapping like a Skeptic: Probabilistic BEV Projection for Online HD Mapping
Fatih Erdoğan
Merve Rabia Barın
Fatma Guney
113
0
0
29 Aug 2025
SKGE-SWIN: End-To-End Autonomous Vehicle Waypoint Prediction and Navigation Using Skip Stage Swin Transformer
SKGE-SWIN: End-To-End Autonomous Vehicle Waypoint Prediction and Navigation Using Skip Stage Swin Transformer
Fachri Najm Noer Kartiman
Rasim
Yaya Wihardi
Nurul Hasanah
Oskar Natan
Bambang Wahono
Taufik Ibnu Salim
ViT
83
1
0
28 Aug 2025
OpenM3D: Open Vocabulary Multi-view Indoor 3D Object Detection without Human Annotations
OpenM3D: Open Vocabulary Multi-view Indoor 3D Object Detection without Human Annotations
Peng-Hao Hsu
Ke Zhang
Fu-En Wang
Tao Tu
Ming-feng Li
Yu-Lun Liu
Albert Y. C. Chen
Min Sun
Cheng-Hao Kuo
3DPCVLM
111
5
0
27 Aug 2025
PseudoMapTrainer: Learning Online Mapping without HD Maps
PseudoMapTrainer: Learning Online Mapping without HD Maps
Christian Lowens
Thorben Funke
Jingchao Xie
Alexandru Paul Condurache
108
2
0
26 Aug 2025
SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather
SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse WeatherEuropean Conference on Computer Vision (ECCV), 2025
Edoardo Palladin
Roland Dietze
Praveen Narayanan
Mario Bijelic
Felix Heide
202
12
0
22 Aug 2025
RCDINO: Enhancing Radar-Camera 3D Object Detection with DINOv2 Semantic Features
RCDINO: Enhancing Radar-Camera 3D Object Detection with DINOv2 Semantic Features
Olga Matykina
Dmitry Yudin
114
0
0
21 Aug 2025
RATopo: Improving Lane Topology Reasoning via Redundancy Assignment
RATopo: Improving Lane Topology Reasoning via Redundancy Assignment
Han Li
Shaofei Huang
Longfei Xu
Yulu Gao
Beipeng Mu
Si Liu
91
0
0
21 Aug 2025
Adversarial Generation and Collaborative Evolution of Safety-Critical Scenarios for Autonomous Vehicles
Adversarial Generation and Collaborative Evolution of Safety-Critical Scenarios for Autonomous Vehicles
Jiangfan Liu
Yongkang Guo
Fangzhi Zhong
Tianyuan Zhang
Zonglei Jing
Yaning Tan
Jinyang Guo
Mingchuan Zhang
Aishan Liu
Xianglong Liu
AAML
184
1
0
20 Aug 2025
MoVieDrive: Multi-Modal Multi-View Urban Scene Video Generation
MoVieDrive: Multi-Modal Multi-View Urban Scene Video Generation
Guile Wu
David Huang
Dongfeng Bai
Bingbing Liu
VGen
137
0
0
20 Aug 2025
Self-Supervised Sparse Sensor Fusion for Long Range Perception
Self-Supervised Sparse Sensor Fusion for Long Range Perception
Edoardo Palladin
Samuel Brucker
Filippo Ghilotti
Praveen Narayanan
Mario Bijelic
Felix Heide
SSL
150
1
0
19 Aug 2025
Refine-and-Contrast: Adaptive Instance-Aware BEV Representations for Multi-UAV Collaborative Object Detection
Refine-and-Contrast: Adaptive Instance-Aware BEV Representations for Multi-UAV Collaborative Object Detection
Zhongyao Li
Peirui Cheng
Liangjin Zhao
Chen Chen
Yundu Li
Zhechao Wang
Xue Yang
Xian Sun
Zhirui Wang
102
1
0
18 Aug 2025
Neural Rendering for Sensor Adaptation in 3D Object Detection
Neural Rendering for Sensor Adaptation in 3D Object Detection
Felix Embacher
David Holtz
J. Uhrig
Marius Cordts
Markus Enzweiler
3DPC
144
0
0
18 Aug 2025
CMF-IoU: Multi-Stage Cross-Modal Fusion 3D Object Detection with IoU Joint Prediction
CMF-IoU: Multi-Stage Cross-Modal Fusion 3D Object Detection with IoU Joint Prediction
Zhiwei Ning
Zhaojiang Liu
Xuanang Gao
Yifan Zuo
Jie Yang
Yuming Fang
Wei Liu
3DPC
109
1
0
18 Aug 2025
An Initial Study of Bird's-Eye View Generation for Autonomous Vehicles using Cross-View Transformers
An Initial Study of Bird's-Eye View Generation for Autonomous Vehicles using Cross-View Transformers
Felipe Carlos dos Santos
Eric A. Antonelo
Gustavo Claudio Karl Couto
94
0
0
17 Aug 2025
OmniD: Generalizable Robot Manipulation Policy via Image-Based BEV Representation
OmniD: Generalizable Robot Manipulation Policy via Image-Based BEV Representation
Jilei Mao
Jiarui Guan
Yingjuan Tang
Qirui Hu
Zhihang Li
Junjie Yu
Yongjie Mao
Yunzhe Sun
Shuang Liu
Xiaozhu Ju
105
2
0
16 Aug 2025
CHARM3R: Towards Unseen Camera Height Robust Monocular 3D Detector
CHARM3R: Towards Unseen Camera Height Robust Monocular 3D Detector
Abhinav Kumar
Yuliang Guo
Zhihao Zhang
Xinyu Huang
Liu Ren
Xiaoming Liu
MDE
214
0
0
15 Aug 2025
CBDES MoE: Hierarchically Decoupled Mixture-of-Experts for Functional Modules in Autonomous Driving
CBDES MoE: Hierarchically Decoupled Mixture-of-Experts for Functional Modules in Autonomous Driving
Qi Xiang
Kunsong Shi
Zhigui Lin
Lei He
MoE
142
2
0
11 Aug 2025
Understanding Dynamic Scenes in Ego Centric 4D Point Clouds
Understanding Dynamic Scenes in Ego Centric 4D Point Clouds
Junsheng Huang
Shengyu Hao
Bocheng Hu
Hongwei Wang
Gaoang Wang
244
2
0
10 Aug 2025
ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting
ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting
Sandro Papais
Letian Wang
Brian Cheong
Steven Waslander
156
0
0
09 Aug 2025
CLIPVehicle: A Unified Framework for Vision-based Vehicle Search
CLIPVehicle: A Unified Framework for Vision-based Vehicle Search
Likai Wang
Ruize Han
Xiangqun Zhang
Wei Feng
145
0
0
06 Aug 2025
Efficient Inter-Task Attention for Multitask Transformer Models
Efficient Inter-Task Attention for Multitask Transformer Models
Christian Bohn
Thomas Kurbiel
Klaus Friedrichs
Hasan Tercan
Tobias Meisen
150
0
0
06 Aug 2025
Occupancy Learning with Spatiotemporal Memory
Occupancy Learning with Spatiotemporal Memory
Ziyang Leng
Jiawei Yang
Wenlong Yi
Bolei Zhou
171
4
0
06 Aug 2025
BEVCon: Advancing Bird's Eye View Perception with Contrastive Learning
BEVCon: Advancing Bird's Eye View Perception with Contrastive LearningIEEE Robotics and Automation Letters (IEEE RA-L), 2025
Ziyang Leng
Jiawei Yang
Zhicheng Ren
Bolei Zhou
SSL
139
0
0
06 Aug 2025
mmWave Radar-Based Non-Line-of-Sight Pedestrian Localization at T-Junctions Utilizing Road Layout Extraction via Camera
mmWave Radar-Based Non-Line-of-Sight Pedestrian Localization at T-Junctions Utilizing Road Layout Extraction via Camera
Byeonggyu Park
Hee-Yeun Kim
Byonghyok Choi
Hansang Cho
Byungkwan Kim
Soomok Lee
Mingu Jeon
Seong-Woo Kim
114
0
0
04 Aug 2025
A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding
A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding
Zhan Shi
Song Wang
Junbo Chen
Jianke Zhu
274
0
0
02 Aug 2025
CoST: Efficient Collaborative Perception From Unified Spatiotemporal Perspective
CoST: Efficient Collaborative Perception From Unified Spatiotemporal Perspective
Zongheng Tang
Yi Liu
Yifan Sun
Yulu Gao
Jinyu Chen
Runsheng Xu
Si Liu
184
1
0
01 Aug 2025
3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
Yung-Hsu Yang
Luigi Piccinelli
Mattia Segu
Siyuan Li
Rui Huang
Yuqian Fu
Marc Pollefeys
Hermann Blum
Z. Bauer
3DPC
256
3
0
31 Jul 2025
FASTopoWM: Fast-Slow Lane Segment Topology Reasoning with Latent World Models
FASTopoWM: Fast-Slow Lane Segment Topology Reasoning with Latent World Models
Yiming Yang
Hongbin Lin
Yueru Luo
Suzhong Fu
C. Zheng
Xinrui Yan
Shuqi Mei
Kun Tang
Shuguang Cui
Zhen Li
LRM
373
2
0
31 Jul 2025
MapDiffusion: Generative Diffusion for Vectorized Online HD Map Construction and Uncertainty Estimation in Autonomous Driving
MapDiffusion: Generative Diffusion for Vectorized Online HD Map Construction and Uncertainty Estimation in Autonomous Driving
T. Monninger
Zihan Zhang
Zhipeng Mo
Md Zafar Anwar
Steffen Staab
Sihao Ding
174
7
0
29 Jul 2025
GTAD: Global Temporal Aggregation Denoising Learning for 3D Semantic Occupancy Prediction
GTAD: Global Temporal Aggregation Denoising Learning for 3D Semantic Occupancy Prediction
Tianhao Li
Yang Li
Mengtian Li
Y. Deng
Weifeng Ge
122
0
0
28 Jul 2025
Collaborative Perceiver: Elevating Vision-based 3D Object Detection via Local Density-Aware Spatial Occupancy
Collaborative Perceiver: Elevating Vision-based 3D Object Detection via Local Density-Aware Spatial Occupancy
Jicheng Yuan
M. Duc
Qian Liu
M. Hauswirth
Danh Le-Phuoc
334
0
0
28 Jul 2025
Previous
12345...181920
Next
Page 2 of 20
Pageof 20