Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.17398
Cited By
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
27 May 2024
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability"
50 / 73 papers shown
Title
Learning 3D Persistent Embodied World Models
Siyuan Zhou
Yilun Du
Yuncong Yang
Lei Han
Peihao Chen
Dit-Yan Yeung
Chuang Gan
VGen
32
0
0
05 May 2025
PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth
Bu Jin
Weize Li
Baihan Yang
Zhenxin Zhu
Junpeng Jiang
...
Kun Zhan
Hengtong Hu
X. Zhang
Peng Jia
Hao Zhao
VGen
24
0
0
03 May 2025
A Survey of Interactive Generative Video
Jiwen Yu
Yiran Qin
Haoxuan Che
Quande Liu
X. Wang
Pengfei Wan
Di Zhang
Kun Gai
Hao Chen
Xihui Liu
VGen
53
0
0
30 Apr 2025
MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention
Yucheng Li
Huiqiang Jiang
Chengruidong Zhang
Qianhui Wu
Xufang Luo
...
Amir H. Abdi
Dongsheng Li
Jianfeng Gao
Y. Yang
Lili Qiu
26
1
0
22 Apr 2025
DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment
X. Li
Chenming Wu
Zhao Yang
Zhihao Xu
Dingkang Liang
Y. Zhang
Ji Wan
J. Wang
VGen
62
1
0
22 Apr 2025
EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance
Yang Yue
Yulin Wang
Haojun Jiang
Pan Liu
S. Song
Gao Huang
VGen
17
0
0
17 Apr 2025
Decoupled Diffusion Sparks Adaptive Scene Generation
Yunsong Zhou
Naisheng Ye
William Ljungbergh
Tianyu Li
Jiazhi Yang
Zetong Yang
Hongzi Zhu
Christoffer Petersson
Hongyang Li
30
1
0
14 Apr 2025
End-to-End Driving with Online Trajectory Evaluation via BEV World Model
Yingyan Li
Yuqi Wang
Yang Liu
Jiawei He
Lue Fan
Zhaoxiang Zhang
OffRL
44
0
0
02 Apr 2025
Can Test-Time Scaling Improve World Foundation Model?
Wenyan Cong
Hanqing Zhu
Peihao Wang
Bangya Liu
Dejia Xu
Kevin Wang
David Z. Pan
Yan Wang
Zhiwen Fan
Z. Wang
34
0
0
31 Mar 2025
Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments
Luke Rowe
Roger Girgis
Anthony Gosselin
Liam Paull
C. Pal
Felix Heide
DiffM
VGen
33
1
0
28 Mar 2025
Exploring the Evolution of Physics Cognition in Video Generation: A Survey
Minghui Lin
Xiang Wang
Y. Wang
Shu Wang
Fengqi Dai
...
Cunxiang Wang
Zhengrong Zuo
Nong Sang
Siteng Huang
Donglin Wang
EGVM
VGen
75
3
0
27 Mar 2025
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and Roadmap
Tong Nie
Jian-jun Sun
Wei Ma
58
1
0
27 Mar 2025
AdaWorld: Learning Adaptable World Models with Latent Actions
Shenyuan Gao
Siyuan Zhou
Yilun Du
Jun Zhang
Chuang Gan
VGen
51
3
0
24 Mar 2025
ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation
Guosheng Zhao
Xiaofeng Wang
Chaojun Ni
Zheng Zhu
Wenkang Qin
Guan Huang
Xingang Wang
44
1
0
24 Mar 2025
MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving
Haiguang Wang
Daqi Liu
Hongwei Xie
Haisong Liu
Enhui Ma
Kaicheng Yu
Limin Wang
Bing Wang
VGen
67
0
0
20 Mar 2025
Learning-based 3D Reconstruction in Autonomous Driving: A Comprehensive Survey
Liewen Liao
Weihao Yan
Ming Yang
Songan Zhang
3DV
84
0
0
17 Mar 2025
Hydra-MDP++: Advancing End-to-End Driving via Expert-Guided Hydra-Distillation
Kailin Li
Zhenxin Li
Shiyi Lan
Yuan Xie
Zhizhong Zhang
J. Liu
Zuxuan Wu
Zhiding Yu
Jose M.Alvarez
37
1
0
17 Mar 2025
Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception
Dingkang Liang
Dingyuan Zhang
Xin Zhou
Sifan Tu
Tianrui Feng
Xiaofan Li
Yumeng Zhang
Mingyang Du
Xiao Tan
Xiang Bai
52
2
0
17 Mar 2025
Centaur: Robust End-to-End Autonomous Driving with Test-Time Training
Chonghao Sima
Kashyap Chitta
Zhiding Yu
Shiyi Lan
Ping Luo
Andreas Geiger
H. Li
Jose M. Alvarez
54
1
0
14 Mar 2025
MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction
Yingshuang Zou
Yikang Ding
Chuanrui Zhang
Jiazhe Guo
Bohan Li
Xiaoyang Lyu
Feiyang Tan
Xiaojuan Qi
Haoqian Wang
3DGS
49
0
0
13 Mar 2025
Other Vehicle Trajectories Are Also Needed: A Driving World Model Unifies Ego-Other Vehicle Trajectories in Video Latent Space
Jian Zhu
Zhengyu Jia
Tian Gao
Jiaxin Deng
Shidi Li
Fu Liu
Peng Jia
Xianpeng Lang
Xiaolong Sun
VGen
66
0
0
12 Mar 2025
Temporal Triplane Transformers as Occupancy World Models
Haoran Xu
Peixi Peng
Guang Tan
Yiqian Chang
Yisen Zhao
Yonghong Tian
36
0
0
10 Mar 2025
WorldModelBench: Judging Video Generation Models As World Models
Dacheng Li
Yunhao Fang
Yukang Chen
Shuo Yang
Shiyi Cao
...
Hongxu Yin
Joseph E. Gonzalez
Ion Stoica
Song Han
Y. Lu
VGen
52
3
0
28 Feb 2025
BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance
Xin Ye
Burhaneddin Yaman
Sheng Cheng
Feng Tao
Abhirup Mallik
Liu Ren
DiffM
58
1
0
27 Feb 2025
Glad: A Streaming Scene Generator for Autonomous Driving
Bin Xie
Yingfei Liu
Tiancai Wang
Jiale Cao
X. Zhang
3DGS
VGen
34
1
0
26 Feb 2025
VaViM and VaVAM: Autonomous Driving through Video Generative Modeling
Florent Bartoccioni
Elias Ramzi
Victor Besnier
Shashanka Venkataramanan
Tuan-Hung Vu
...
Mickael Chen
Éloi Zablocki
Andrei Bursuc
Eduardo Valle
Matthieu Cord
VGen
65
1
0
24 Feb 2025
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
Jingcheng Ni
Yuxin Guo
Yichen Liu
Rui Chen
Lewei Lu
Z. Wu
DiffM
VGen
51
3
0
17 Feb 2025
Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving
Yu Yang
Jianbiao Mei
Yukai Ma
Siliang Du
Wenqing Chen
Yijie Qian
Yuxiang Feng
Yong-jin Liu
58
11
0
20 Jan 2025
A Survey of World Models for Autonomous Driving
Tuo Feng
Wenguan Wang
Y. Yang
VGen
66
5
0
20 Jan 2025
Control-ITRA: Controlling the Behavior of a Driving Model
Vasileios Lioutas
Adam Scibior
Matthew Niedoba
Berend Zwartsenberg
Frank D. Wood
42
0
0
17 Jan 2025
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
Xiaotao Hu
Wei Yin
Mingkai Jia
Junyuan Deng
Xiaoyang Guo
Qian Zhang
Xiaoxiao Long
Ping Tan
VGen
29
10
0
31 Dec 2024
DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers
Yuntao Chen
Yuqi Wang
Zhaoxiang Zhang
43
6
0
24 Dec 2024
DINO-Foresight
\texttt{DINO-Foresight}
DINO-Foresight
: Looking into the Future with DINO
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
AI4CE
77
1
0
16 Dec 2024
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Mariam Hassan
Sebastian Stapf
Ahmad Rahimi
Pedro M B Rezende
Yasaman Haghighi
...
Mathieu Salzmann
Davide Scaramuzza
Marc Pollefeys
Paolo Favaro
Alexandre Alahi
VLM
VGen
64
4
0
15 Dec 2024
InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models
Yifan Lu
Xuanchi Ren
Jiawei Yang
Tianchang Shen
Zhangjie Wu
...
Y. Wang
Siheng Chen
Mike Chen
Sanja Fidler
Jiahui Huang
VGen
82
5
0
05 Dec 2024
Driving Scene Synthesis on Free-form Trajectories with Generative Prior
Zeyu Yang
Zijie Pan
Yuankun Yang
Xiatian Zhu
L. Zhang
VGen
62
0
0
02 Dec 2024
InfinityDrive: Breaking Time Limits in Driving World Models
Xi Guo
C. Ding
Haoxuan Dou
Xin Zhang
Weixuan Tang
Wei Yu Wu
VGen
81
5
0
02 Dec 2024
Motion Dreamer: Boundary Conditional Motion Reasoning for Physically Coherent Video Generation
Tianshuo Xu
Zhifei Chen
Leyi Wu
Hao Lu
Yuying Chen
Lihui Jiang
Bingbing Liu
Yingcong Chen
VGen
70
0
0
30 Nov 2024
ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration
Chaojun Ni
Guosheng Zhao
Xiaofeng Wang
Zheng Hua Zhu
Wenkang Qin
...
Kun Zhan
Peng Jia
Xianpeng Lang
Xingang Wang
Wenjun Mei
VGen
86
6
0
29 Nov 2024
DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation
Tianyi Yan
Dongming Wu
Wencheng Han
Junpeng Jiang
Xia Zhou
Kun Zhan
Cheng-Zhong Xu
Jianbing Shen
24
3
0
18 Nov 2024
Evaluating World Models with LLM for Decision Making
Chang Yang
Xinrun Wang
Junzhe Jiang
Qinggang Zhang
Xiao Huang
LLMAG
ELM
29
2
0
13 Nov 2024
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
Wenqiang Sun
Shuo Chen
F. Liu
Zilong Chen
Yueqi Duan
Jun Zhang
Yikai Wang
VGen
41
31
0
07 Nov 2024
Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey
Ao Fu
Yi Zhou
Tao Zhou
Y. Yang
Bojun Gao
Qun Li
Guobin Wu
Ling Shao
VGen
45
2
0
05 Nov 2024
WorldSimBench: Towards Video Generation Models as World Simulators
Yiran Qin
Zhelun Shi
Jiwen Yu
Xijun Wang
Enshen Zhou
...
Lu Sheng
Jing Shao
Lei Bai
Wanli Ouyang
Ruimao Zhang
EGVM
VGen
113
364
0
23 Oct 2024
EVA: An Embodied World Model for Future Video Anticipation
Xiaowei Chi
Hengyuan Zhang
Chun-Kai Fan
Xingqun Qi
Rongyu Zhang
...
Chi-Min Chan
Wei Xue
Wenhan Luo
Shanghang Zhang
Yike Guo
VGen
25
4
0
20 Oct 2024
DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation
Guosheng Zhao
Chaojun Ni
Xiaofeng Wang
Zheng Zhu
X. Zhang
...
Xinze Chen
Boyuan Wang
Youyi Zhang
Wenjun Mei
Xingang Wang
VGen
66
24
0
17 Oct 2024
Asymptotic Analysis of Sample-averaged Q-learning
Saunak Kumar Panda
Ruiqi Liu
Yisha Xiang
OnRL
37
0
0
14 Oct 2024
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation
Fanqing Meng
Jiaqi Liao
Xinyu Tan
Wenqi Shao
Quanfeng Lu
Kaipeng Zhang
Yu Cheng
Dianqi Li
Yu Qiao
Ping Luo
VGen
EGVM
19
23
0
07 Oct 2024
Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models
Yizhou Huang
Yihua Cheng
Kezhi Wang
LRM
20
1
0
30 Sep 2024
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes
Jianbiao Mei
Yukai Ma
Xuemeng Yang
Licheng Wen
Tiantian Wei
Min Dou
Yukai Ma
Min Dou
Botian Shi
Yong Liu
DiffM
VGen
42
3
0
06 Sep 2024
1
2
Next