v1v2 (latest)

BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers

European Conference on Computer Vision (ECCV), 2022

31 March 2022

ArXiv (abs)PDF HTML Github (18★)

Papers citing "BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers"

50 / 973 papers shown

Vehicle Dynamics Embedded World Models for Autonomous Driving

150

02 Dec 2025

OpenBox: Annotate Any Bounding Boxes in 3D

121

01 Dec 2025

TGSFormer: Scalable Temporal Gaussian Splatting for Embodied Semantic Scene Completion

105

29 Nov 2025

SUPER-AD: Semantic Uncertainty-aware Planning for End-to-End Robust Autonomous Driving

144

28 Nov 2025

Map-World: Masked Action planning and Path-Integral World Model for Autonomous Driving

224

25 Nov 2025

Reasoning-VLA: A Fast and General Vision-Language-Action Reasoning Model for Autonomous Driving

171

25 Nov 2025

WPT: World-to-Policy Transfer via Online World Model Distillation

483

25 Nov 2025

DetAny4D: Detect Anything 4D Temporally in a Streaming RGB Video

144

24 Nov 2025

GuideFlow: Constraint-Guided Flow Matching for Planning in End-to-End Autonomous Driving

183

24 Nov 2025

Exploring Surround-View Fisheye Camera 3D Object Detection

199

24 Nov 2025

CubeletWorld: A New Abstraction for Scalable 3D Modeling

21 Nov 2025

Graph Query Networks for Object Detection with Automotive Radar

245

19 Nov 2025

Towards 3D Object-Centric Feature Learning for Semantic Scene Completion

260

17 Nov 2025

ExpertAD: Enhancing Autonomous Driving Systems with Mixture of Experts

210

13 Nov 2025

Twist and Compute: The Cost of Pose in 3D Generative Diffusion

150

11 Nov 2025

HD$^2$-SSC: High-Dimension High-Density Semantic Scene Completion for Autonomous Driving

^2

-SSC: High-Dimension High-Density Semantic Scene Completion for Autonomous DrivingAnnual Conference of the IEEE Industrial Electronics Society (IECON), 2024

Zhiwen Yang

Yuxin Peng

234

11 Nov 2025

HENet++: Hybrid Encoding and Multi-task Learning for 3D Perception and End-to-end Autonomous Driving

139

10 Nov 2025

Polymap: generating high definition map based on rasterized polygons

Shiyu Gao

Hao Jiang

08 Nov 2025

Evaluating the Impact of Weather-Induced Sensor Occlusion on BEVFusion for 3D Object Detection

107

06 Nov 2025

UniLION: Towards Unified Autonomous Driving Model with Linear Group RNNs

125

03 Nov 2025

Embodied Cognition Augmented End2End Autonomous Driving

116

03 Nov 2025

136

31 Oct 2025

Token Is All You Need: Cognitive Planning through Belief-Intent Co-Evolution

Shiyao Sang

141

30 Oct 2025

WOD-E2E: Waymo Open Dataset for End-to-End Driving in Challenging Long-tail Scenarios

...

394

30 Oct 2025

World Simulation with Video Foundation Models for Physical AI

...

480

28 Oct 2025

SynAD: Enhancing Real-World End-to-End Autonomous Driving Models through Synthetic Data Integration

116

28 Oct 2025

DQ3D: Depth-guided Query for Transformer-Based 3D Object Detection in Traffic Scenarios

Ziyu Wang

Wenhao Li

Ji Wu

114

27 Oct 2025

DAMap: Distance-aware MapNet for High Quality HD Map Construction

135

26 Oct 2025

Enhanced Motion Forecasting with Plug-and-Play Multimodal Large Language Models

131

20 Oct 2025

Towards 3D Objectness Learning in an Open World

140

20 Oct 2025

Vision-Centric 4D Occupancy Forecasting and Planning via Implicit Residual World Models

413

19 Oct 2025

DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion

...

186

17 Oct 2025

Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery

292

17 Oct 2025

FreqPDE: Rethinking Positional Depth Embedding for Multi-View 3D Object Detection Transformers

143

17 Oct 2025

MatchAttention: Matching the Relative Positions for High-Resolution Cross-View Matching

238

16 Oct 2025

Complementary Information Guided Occupancy Prediction via Multi-Level Representation FusionIEEE International Conference on Robotics and Automation (ICRA), 2025

191

15 Oct 2025

DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving

...

196

14 Oct 2025

CrossRay3D: Geometry and Distribution Guidance for Efficient Multimodal 3D Detection

...

219

14 Oct 2025

Bridging Perspectives: Foundation Model Guided BEV Maps for 3D Object Detection and Tracking

117

11 Oct 2025

CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving

206

09 Oct 2025

RayFusion: Ray Fusion Enhanced Collaborative Visual Perception

144

09 Oct 2025

Learning Global Representation from Queries for Vectorized HD Map Construction

122

08 Oct 2025

Progressive Gaussian Transformer with Anisotropy-aware Sampling for Open Vocabulary Occupancy Prediction

Chi Yan

Dan Xu

3DGS

207

06 Oct 2025

Flexible and Efficient Spatio-Temporal Transformer for Sequential Visual Place Recognition

130

05 Oct 2025

Sequence-Preserving Dual-FoV Defense for Traffic Sign and Light Recognition in Autonomous Vehicles

126

03 Oct 2025

FIN: Fast Inference Network for Map Segmentation

142

01 Oct 2025

EasyOcc: 3D Pseudo-Label Supervision for Fully Self-Supervised Semantic Occupancy Prediction Models

Seamie Hayes

Ganesh Sistu

Ciarán Eising

212

30 Sep 2025

DriveE2E: Closed-Loop Benchmark for End-to-End Autonomous Driving through Real-to-Simulation

127

28 Sep 2025

BEV-VLM: Trajectory Planning via Unified BEV Abstraction

118

27 Sep 2025

OmniScene: Attention-Augmented Multimodal 4D Scene Understanding for Autonomous Driving

193

24 Sep 2025