End-to-End Driving with Online Trajectory Evaluation via BEV World Model

End-to-end autonomous driving has achieved remarkable progress by integrating perception, prediction, and planning into a fully differentiable framework. Yet, to fully realize its potential, an effective online trajectory evaluation is indispensable to ensure safety. By forecasting the future outcomes of a given trajectory, trajectory evaluation becomes much more effective. This goal can be achieved by employing a world model to capture environmental dynamics and predict future states. Therefore, we propose an end-to-end driving framework WoTE, which leverages a BEV World model to predict future BEV states for Trajectory Evaluation. The proposed BEV world model is latency-efficient compared to image-level world models and can be seamlessly supervised using off-the-shelf BEV-space traffic simulators. We validate our framework on both the NAVSIM benchmark and the closed-loop Bench2Drive benchmark based on the CARLA simulator, achieving state-of-the-art performance. Code is released atthis https URL.
View on arXiv@article{li2025_2504.01941, title={ End-to-End Driving with Online Trajectory Evaluation via BEV World Model }, author={ Yingyan Li and Yuqi Wang and Yang Liu and Jiawei He and Lue Fan and Zhaoxiang Zhang }, journal={arXiv preprint arXiv:2504.01941}, year={ 2025 } }