ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.17270
  4. Cited By
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera
  Images via Spatiotemporal Transformers
v1v2 (latest)

BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers

European Conference on Computer Vision (ECCV), 2022
31 March 2022
Zhiqi Li
Wenhai Wang
Hongyang Li
Enze Xie
Chonghao Sima
Tong Lu
Qiao Yu
Jifeng Dai
ArXiv (abs)PDFHTMLGithub (18★)

Papers citing "BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers"

50 / 969 papers shown
Title
Reasoning-VLA: A Fast and General Vision-Language-Action Reasoning Model for Autonomous Driving
Reasoning-VLA: A Fast and General Vision-Language-Action Reasoning Model for Autonomous Driving
Dapeng Zhang
Zhenlong Yuan
Zhangquan Chen
Chih-Ting Liao
Yinda Chen
Fei Shen
Qingguo Zhou
Tat-Seng Chua
LRM
94
0
0
25 Nov 2025
Map-World: Masked Action planning and Path-Integral World Model for Autonomous Driving
Map-World: Masked Action planning and Path-Integral World Model for Autonomous Driving
Bin Hu
Zijian Lu
Haicheng Liao
Chengran Yuan
Bin Rao
Yongkang Li
Guofa Li
Zhiyong Cui
C. Xu
Zhenning Li
100
0
0
25 Nov 2025
WPT: World-to-Policy Transfer via Online World Model Distillation
WPT: World-to-Policy Transfer via Online World Model Distillation
Guangfeng Jiang
Yueru Luo
Jun Liu
Y. Huang
Yiyao Zhu
Z. Qu
Dave Zhenyu Chen
B. Liu
Xu Yan
OffRLOnRL
349
0
0
25 Nov 2025
Exploring Surround-View Fisheye Camera 3D Object Detection
Exploring Surround-View Fisheye Camera 3D Object Detection
Changcai Li
Wenwei Lin
Zuoxun Hou
Gang Chen
Wei Zhang
Huihui Zhou
Weishi Zheng
3DPC
146
0
0
24 Nov 2025
DetAny4D: Detect Anything 4D Temporally in a Streaming RGB Video
DetAny4D: Detect Anything 4D Temporally in a Streaming RGB Video
Jiawei Hou
Shenghao Zhang
Can Wang
Zheng Gu
Yonggen Ling
Taiping Zeng
Xiangyang Xue
Jingbo Zhang
3DPC
97
0
0
24 Nov 2025
GuideFlow: Constraint-Guided Flow Matching for Planning in End-to-End Autonomous Driving
GuideFlow: Constraint-Guided Flow Matching for Planning in End-to-End Autonomous Driving
Lin Liu
Caiyan Jia
G. Yu
Ziying Song
Junqiao Li
Feiyang Jia
Peiliang Wu
Xiaoshuai Hao
Yandan Luo
52
1
0
24 Nov 2025
CubeletWorld: A New Abstraction for Scalable 3D Modeling
CubeletWorld: A New Abstraction for Scalable 3D Modeling
Azlaan Mustafa Samad
Hoang H. Nguyen
Lukas Berg
Henrik Müller
Yuan Xue
Daniel Kudenko
Zahra Ahmadi
20
0
0
21 Nov 2025
Graph Query Networks for Object Detection with Automotive Radar
Graph Query Networks for Object Detection with Automotive Radar
Loveneet Saini
Hasan Tercan
Tobias Meisen
GNN
147
0
0
19 Nov 2025
Towards 3D Object-Centric Feature Learning for Semantic Scene Completion
Towards 3D Object-Centric Feature Learning for Semantic Scene Completion
Weihua Wang
Yubo Cui
Xiangru Lin
Z. Li
Zheng Fang
3DPC
151
0
0
17 Nov 2025
ExpertAD: Enhancing Autonomous Driving Systems with Mixture of Experts
ExpertAD: Enhancing Autonomous Driving Systems with Mixture of Experts
Haowen Jiang
Xinyu Huang
You Lu
Dingji Wang
Yuheng Cao
Chaofeng Sha
Bihuan Chen
Keyu Chen
Xin Peng
MoE
153
0
0
13 Nov 2025
Twist and Compute: The Cost of Pose in 3D Generative Diffusion
Twist and Compute: The Cost of Pose in 3D Generative Diffusion
Kyle Fogarty
Jack Foster
Boqiao Zhang
Jing Yang
Cengiz Öztireli
DiffM
111
0
0
11 Nov 2025
HD$^2$-SSC: High-Dimension High-Density Semantic Scene Completion for Autonomous Driving
HD2^22-SSC: High-Dimension High-Density Semantic Scene Completion for Autonomous DrivingAnnual Conference of the IEEE Industrial Electronics Society (IECON), 2024
Zhiwen Yang
Yuxin Peng
160
0
0
11 Nov 2025
HENet++: Hybrid Encoding and Multi-task Learning for 3D Perception and End-to-end Autonomous Driving
HENet++: Hybrid Encoding and Multi-task Learning for 3D Perception and End-to-end Autonomous Driving
Zhongyu Xia
Zhiwei Lin
Yongtao Wang
Ming-Hsuan Yang
100
0
0
10 Nov 2025
Polymap: generating high definition map based on rasterized polygons
Polymap: generating high definition map based on rasterized polygons
Shiyu Gao
Hao Jiang
44
0
0
08 Nov 2025
Evaluating the Impact of Weather-Induced Sensor Occlusion on BEVFusion for 3D Object Detection
Evaluating the Impact of Weather-Induced Sensor Occlusion on BEVFusion for 3D Object Detection
Sanjay Kumar
Tim Brophy
E. Grua
Ganesh Sistu
Valentina Donzella
Ciarán Eising
64
1
0
06 Nov 2025
UniLION: Towards Unified Autonomous Driving Model with Linear Group RNNs
UniLION: Towards Unified Autonomous Driving Model with Linear Group RNNs
Zhe Liu
Jinghua Hou
Xiaoqing Ye
Jingdong Wang
Hengshuang Zhao
X. Bai
77
0
0
03 Nov 2025
Embodied Cognition Augmented End2End Autonomous Driving
Embodied Cognition Augmented End2End Autonomous Driving
Ling Niu
Xiaoji Zheng
Han Wang
Chen Zheng
Ziyuan Yang
Bokui Chen
Jiangtao Gong
56
0
0
03 Nov 2025
MLPerf Automotive
MLPerf Automotive
Radoyeh Shojaei
Predrag Djurdjevic
Mostafa El-Khamy
James Goel
Kasper Mecklenburg
John Owens
Pınar Muyan-Özçelik
T. S. John
Jinho Suh
Arjun Suresh
VLM
74
0
0
31 Oct 2025
WOD-E2E: Waymo Open Dataset for End-to-End Driving in Challenging Long-tail Scenarios
WOD-E2E: Waymo Open Dataset for End-to-End Driving in Challenging Long-tail Scenarios
Runsheng Xu
Hubert Lin
Wonseok Jeon
Hao Feng
Yuliang Zou
...
Brandyn White
Ben Sapp
Mingxing Tan
Jyh-Jing Hwang
Drago Anguelov
203
8
0
30 Oct 2025
Token Is All You Need: Cognitive Planning through Belief-Intent Co-Evolution
Token Is All You Need: Cognitive Planning through Belief-Intent Co-Evolution
Shiyao Sang
51
0
0
30 Oct 2025
World Simulation with Video Foundation Models for Physical AI
World Simulation with Video Foundation Models for Physical AI
Nvidia
A. M. Ali
Junjie Bai
Maciej Bala
Yogesh Balaji
...
Jing Zhang
Qinsheng Zhang
Kaiwen Zheng
Andrew Zhu
Yuke Zhu
VGenPINN
299
9
0
28 Oct 2025
SynAD: Enhancing Real-World End-to-End Autonomous Driving Models through Synthetic Data Integration
SynAD: Enhancing Real-World End-to-End Autonomous Driving Models through Synthetic Data Integration
Jongsuk Kim
Jaeyoung Lee
Gyojin Han
Dongjae Lee
Minki Jeong
Junmo Kim
59
0
0
28 Oct 2025
DQ3D: Depth-guided Query for Transformer-Based 3D Object Detection in Traffic Scenarios
DQ3D: Depth-guided Query for Transformer-Based 3D Object Detection in Traffic Scenarios
Ziyu Wang
Wenhao Li
Ji Wu
64
0
0
27 Oct 2025
DAMap: Distance-aware MapNet for High Quality HD Map Construction
DAMap: Distance-aware MapNet for High Quality HD Map Construction
Jinpeng Dong
Chen Li
Yutong Lin
Jingwen Fu
Sanping Zhou
N. Zheng
81
0
0
26 Oct 2025
Towards 3D Objectness Learning in an Open World
Towards 3D Objectness Learning in an Open World
Taichi Liu
Zhenyu Wang
Ruofeng Liu
Guang Wang
Desheng Zhang
3DPCVLM
81
0
0
20 Oct 2025
Enhanced Motion Forecasting with Plug-and-Play Multimodal Large Language Models
Enhanced Motion Forecasting with Plug-and-Play Multimodal Large Language Models
Katie Luo
Jingwei Ji
Tong He
Runsheng Xu
Yichen Xie
Dragomir Anguelov
Mingxing Tan
84
0
0
20 Oct 2025
Vision-Centric 4D Occupancy Forecasting and Planning via Implicit Residual World Models
Vision-Centric 4D Occupancy Forecasting and Planning via Implicit Residual World Models
Jianbiao Mei
Yu Yang
Xuemeng Yang
Licheng Wen
Jiajun Lv
Botian Shi
Y. Liu
VGen
263
1
0
19 Oct 2025
FreqPDE: Rethinking Positional Depth Embedding for Multi-View 3D Object Detection Transformers
FreqPDE: Rethinking Positional Depth Embedding for Multi-View 3D Object Detection Transformers
Haisheng Su
Junjie Zhang
Feixiang Song
Sanping Zhou
Wei Wu
N. Zheng
Junchi Yan
ViT3DPC
120
0
0
17 Oct 2025
DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion
DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion
Weijie Wang
Jiagang Zhu
Zeyu Zhang
Xiaofeng Wang
Zheng Hua Zhu
...
Wenkang Qin
Duochao Shi
Haoyun Li
Guanghong Jia
Jiwen Lu
VGen
120
3
0
17 Oct 2025
Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery
Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery
Jie-Ying Lee
Yi-Ruei Liu
Shr-Ruei Tsai
Wei-Cheng Chang
Chung-Ho Wu
Jiewen Chan
Zhenjun Zhao
Chieh Hubert Lin
Yu-Lun Liu
3DGS
245
2
0
17 Oct 2025
MatchAttention: Matching the Relative Positions for High-Resolution Cross-View Matching
MatchAttention: Matching the Relative Positions for High-Resolution Cross-View Matching
Tingman Yan
Tao Liu
Xilian Yang
Qunfei Zhao
Zeyang Xia
3DV
135
0
0
16 Oct 2025
Complementary Information Guided Occupancy Prediction via Multi-Level Representation Fusion
Complementary Information Guided Occupancy Prediction via Multi-Level Representation FusionIEEE International Conference on Robotics and Automation (ICRA), 2025
Rongtao Xu
Jinzhou Lin
Jialei Zhou
Jiahua Dong
Changwei Wang
Ruisheng Wang
Li Guo
Shibiao Xu
Xiaodan Liang
3DPC
118
0
0
15 Oct 2025
CrossRay3D: Geometry and Distribution Guidance for Efficient Multimodal 3D Detection
CrossRay3D: Geometry and Distribution Guidance for Efficient Multimodal 3D Detection
Huiming Yang
Wenzhuo Liu
Yicheng Qiao
Lei Yang
Xianzhu Zeng
...
Zhiwei Li
Zijian Zeng
Zhiying Jiang
Huaping Liu
Kunfeng Wang
131
0
0
14 Oct 2025
DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving
Yingyan Li
Shuyao Shang
Weisong Liu
Bing Zhan
Haochen Wang
...
Yasong An
Chufeng Tang
Lu Hou
Lue Fan
Zhaoxiang Zhang
VLM
85
4
0
14 Oct 2025
Bridging Perspectives: Foundation Model Guided BEV Maps for 3D Object Detection and Tracking
Bridging Perspectives: Foundation Model Guided BEV Maps for 3D Object Detection and Tracking
Markus Kappeler
Özgün Çiçek
Daniele Cattaneo
Claudius Gläser
Yakov Miron
Abhinav Valada
92
0
0
11 Oct 2025
CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving
CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving
Tianrui Zhang
Yichen Liu
Zilin Guo
Yuxin Guo
Jingcheng Ni
Chenjing Ding
Dan Xu
Lewei Lu
Z. Wu
VGen
114
0
0
09 Oct 2025
RayFusion: Ray Fusion Enhanced Collaborative Visual Perception
RayFusion: Ray Fusion Enhanced Collaborative Visual Perception
Shaohong Wang
Bin Lu
Xinyu Xiao
Hanzhi Zhong
Bowen Pang
Tong Wang
Zhiyu Xiang
Hangguan Shan
Eryun Liu
83
0
0
09 Oct 2025
Learning Global Representation from Queries for Vectorized HD Map Construction
Learning Global Representation from Queries for Vectorized HD Map Construction
Shoumeng Qiu
Xinrun Li
Yang Long
Xiangyang Xue
Varun Ojha
Jian Pu
72
0
0
08 Oct 2025
Progressive Gaussian Transformer with Anisotropy-aware Sampling for Open Vocabulary Occupancy Prediction
Progressive Gaussian Transformer with Anisotropy-aware Sampling for Open Vocabulary Occupancy Prediction
Chi Yan
Dan Xu
3DGS
140
0
0
06 Oct 2025
Flexible and Efficient Spatio-Temporal Transformer for Sequential Visual Place Recognition
Flexible and Efficient Spatio-Temporal Transformer for Sequential Visual Place Recognition
Yu Kiu
Chao-Yeh Chen
Ge Jin
Chen Feng
ViT
84
0
0
05 Oct 2025
Sequence-Preserving Dual-FoV Defense for Traffic Sign and Light Recognition in Autonomous Vehicles
Sequence-Preserving Dual-FoV Defense for Traffic Sign and Light Recognition in Autonomous Vehicles
Abhishek Joshi
Jahnavi Krishna Koda
Abhishek Phadke
AAML
84
0
0
03 Oct 2025
FIN: Fast Inference Network for Map Segmentation
FIN: Fast Inference Network for Map Segmentation
Ruan Bispo
Tim Brophy
Reenu Mohandas
Anthony G. Scanlan
Ciarán Eising
81
0
0
01 Oct 2025
EasyOcc: 3D Pseudo-Label Supervision for Fully Self-Supervised Semantic Occupancy Prediction Models
EasyOcc: 3D Pseudo-Label Supervision for Fully Self-Supervised Semantic Occupancy Prediction Models
Seamie Hayes
Ganesh Sistu
Ciarán Eising
111
1
0
30 Sep 2025
DriveE2E: Closed-Loop Benchmark for End-to-End Autonomous Driving through Real-to-Simulation
DriveE2E: Closed-Loop Benchmark for End-to-End Autonomous Driving through Real-to-Simulation
Haibao Yu
Wenxian Yang
Ruiyang Hao
C. Wang
Jiaru Zhong
Ping Luo
Zaiqing Nie
75
2
0
28 Sep 2025
BEV-VLM: Trajectory Planning via Unified BEV Abstraction
BEV-VLM: Trajectory Planning via Unified BEV Abstraction
Guancheng Chen
Sheng Yang
Tong Zhan
Jian Wang
72
0
0
27 Sep 2025
OmniScene: Attention-Augmented Multimodal 4D Scene Understanding for Autonomous Driving
OmniScene: Attention-Augmented Multimodal 4D Scene Understanding for Autonomous Driving
Pei Liu
Hongliang Lu
Haichao Liu
Haipeng Liu
Xin Liu
Ruoyu Yao
S. Li
Jun Ma
129
0
0
24 Sep 2025
TinyBEV: Cross Modal Knowledge Distillation for Efficient Multi Task Bird's Eye View Perception and Planning
TinyBEV: Cross Modal Knowledge Distillation for Efficient Multi Task Bird's Eye View Perception and Planning
Reeshad Khan
John Gauch
121
0
0
22 Sep 2025
MAESTRO: Task-Relevant Optimization via Adaptive Feature Enhancement and Suppression for Multi-task 3D Perception
MAESTRO: Task-Relevant Optimization via Adaptive Feature Enhancement and Suppression for Multi-task 3D Perception
C. Kang
Jisong Kim
Hongjae Shin
Junseo Park
J. Choi
76
0
0
22 Sep 2025
RCTDistill: Cross-Modal Knowledge Distillation Framework for Radar-Camera 3D Object Detection with Temporal Fusion
RCTDistill: Cross-Modal Knowledge Distillation Framework for Radar-Camera 3D Object Detection with Temporal Fusion
Geonho Bang
Minjae Seong
Jisong Kim
Geunju Baek
Daye Oh
Junhyung Kim
Junho Koh
Jun-Won Choi
100
0
0
22 Sep 2025
ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting
ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting
Xiaoyang Yan
Muleilan Pei
Shaojie Shen
3DGS
61
2
0
20 Sep 2025
1234...181920
Next