Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.17080
Cited By
GAIA-1: A Generative World Model for Autonomous Driving
29 September 2023
Masane Fuchi
Lloyd Russell
Hudson Yeo
Zak Murez
Hiroto Minami
Alex Kendall
Tomohiro Takagi
Gianluca Corrado
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GAIA-1: A Generative World Model for Autonomous Driving"
50 / 168 papers shown
Title
iMotion-LLM: Motion Prediction Instruction Tuning
Abdulwahab Felemban
Eslam Mohamed Bakr
Xiaoqian Shen
Jian Ding
Abduallah A. Mohamed
Mohamed Elhoseiny
45
1
0
10 Jun 2024
Open-Endedness is Essential for Artificial Superhuman Intelligence
Edward Hughes
Michael Dennis
Jack Parker-Holder
Feryal M. P. Behbahani
Aditi Mavalankar
Yuge Shi
Tom Schaul
Tim Rocktaschel
LRM
32
18
0
06 Jun 2024
Shedding Light on Large Generative Networks: Estimating Epistemic Uncertainty in Diffusion Models
Lucas Berry
Axel Brando
D. Meger
16
5
0
05 Jun 2024
Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation
Enhui Ma
Lijun Zhou
Tao Tang
Zhan Zhang
Dong Han
...
Peng Jia
Xianpeng Lang
Haiyang Sun
Di Lin
Kaicheng Yu
VGen
18
20
0
03 Jun 2024
The Embodied World Model Based on LLM with Visual Information and Prediction-Oriented Prompts
Wakana Haijima
Kou Nakakubo
Masahiro Suzuki
Yutaka Matsuo
28
1
0
02 Jun 2024
OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving
Lening Wang
Wenzhao Zheng
Yilong Ren
Han Jiang
Zhiyong Cui
Haiyang Yu
Jiwen Lu
VGen
32
28
0
30 May 2024
In-Context Symmetries: Self-Supervised Learning through Contextual World Models
Sharut Gupta
Chenyu Wang
Yifei Wang
Tommi Jaakkola
Stefanie Jegelka
27
1
0
28 May 2024
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
60
75
0
27 May 2024
MetaEarth: A Generative Foundation Model for Global-Scale Remote Sensing Image Generation
Zhiping Yu
Chenyang Liu
Liqin Liu
Z. Shi
Zhengxia Zou
VGen
26
11
0
22 May 2024
A Survey of Robotic Language Grounding: Tradeoffs between Symbols and Embeddings
Vanya Cohen
J. Liu
Raymond J. Mooney
Stefanie Tellex
David Watkins
LM&Ro
29
12
0
21 May 2024
Diffusion for World Modeling: Visual Details Matter in Atari
Eloi Alonso
Adam Jelley
Vincent Micheli
Anssi Kanervisto
Amos Storkey
Tim Pearce
Franccois Fleuret
39
39
0
20 May 2024
CarDreamer: Open-Source Learning Platform for World Model based Autonomous Driving
Dechen Gao
Shuangyu Cai
Hanchu Zhou
Hang Wang
Iman Soltani
Junshan Zhang
31
2
0
15 May 2024
AnoVox: A Benchmark for Multimodal Anomaly Detection in Autonomous Driving
Daniel Bogdoll
Iramm Hamdard
Lukas Namgyu Rößler
Felix Geisler
Muhammed Bayram
...
Miguel de Campos
Anushervon Tabarov
Yitian Yang
Hanno Gottschalk
J. Marius Zöllner
34
5
0
13 May 2024
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Zheng Zhu
Xiaofeng Wang
Wangbo Zhao
Chen Min
Nianchen Deng
...
Dawei Zhao
Liang Xiao
Jian-jun Zhao
Jiwen Lu
Guan Huang
VGen
LM&Ro
79
36
0
06 May 2024
Video Diffusion Models: A Survey
Andrew Melnik
Michal Ljubljanac
Cong Lu
Qi Yan
Weiming Ren
Helge J. Ritter
VGen
66
12
0
06 May 2024
Vision-based 3D occupancy prediction in autonomous driving: a review and outlook
Yanan Zhang
Jinqing Zhang
Zengran Wang
Junhao Xu
Di Huang
23
14
0
04 May 2024
FlexiFilm: Long Video Generation with Flexible Conditions
Yichen Ouyang
Jianhao Yuan
Hao Zhao
Gaoang Wang
Bo-Lu Zhao
DiffM
42
6
0
29 Apr 2024
WorldGPT: Empowering LLM as Multimodal World Model
Zhiqi Ge
Hongzhe Huang
Mingze Zhou
Juncheng Li
Guoming Wang
Siliang Tang
Yueting Zhuang
35
26
0
28 Apr 2024
Predicting Long-horizon Futures by Conditioning on Geometry and Time
Tarasha Khurana
Deva Ramanan
AI4TS
33
0
0
17 Apr 2024
NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving
William Ljungbergh
Adam Tonderski
Joakim Johnander
Holger Caesar
Kalle Åström
Michael Felsberg
Christoffer Petersson
25
20
0
11 Apr 2024
LidarDM: Generative LiDAR Simulation in a Generated World
Vlas Zyrianov
Henry Che
Zhijian Liu
Shenlong Wang
VGen
25
20
0
03 Apr 2024
UFID: A Unified Framework for Input-level Backdoor Detection on Diffusion Models
Zihan Guan
Mengxuan Hu
Sheng R. Li
Anil Vullikanti
DiffM
AAML
34
3
0
01 Apr 2024
Urban Scene Diffusion through Semantic Occupancy Map
Junge Zhang
Qihang Zhang
Li Zhang
Ramana Rao Kompella
Gaowen Liu
Bolei Zhou
29
4
0
18 Mar 2024
Driving Style Alignment for LLM-powered Driver Agent
Ruoxuan Yang
Xinyue Zhang
Anais Fernandez-Laaksonen
Xin Ding
Jiangtao Gong
30
10
0
17 Mar 2024
Generalized Predictive Model for Autonomous Driving
Jiazhi Yang
Shenyuan Gao
Yihang Qiu
Li Chen
Tianyu Li
...
Ping Luo
Jun Zhang
Andreas Geiger
Yu Qiao
Hongyang Li
VGen
64
57
0
14 Mar 2024
Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Zicheng Zhang
Tong Zhang
Yi Zhu
Jian-zhuo Liu
Xiaodan Liang
QiXiang Ye
Wei Ke
VLM
44
2
0
13 Mar 2024
ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation
Guanxing Lu
Shiyi Zhang
Ziwei Wang
Changliu Liu
Jiwen Lu
Yansong Tang
44
49
0
13 Mar 2024
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Guosheng Zhao
Xiaofeng Wang
Zheng Zhu
Xinze Chen
Guan Huang
Xiaoyi Bao
Xingang Wang
VGen
33
14
0
11 Mar 2024
World Models for Autonomous Driving: An Initial Survey
Yanchen Guan
Haicheng Liao
Zhenning Li
Jia Hu
Runze Yuan
Yunjian Li
Guohui Zhang
Chengzhong Xu
32
31
0
05 Mar 2024
Pushing the Limits of Cross-Embodiment Learning for Manipulation and Navigation
Jonathan Yang
Catherine Glossop
Arjun Bhorkar
Dhruv Shah
Quan Vuong
Chelsea Finn
Dorsa Sadigh
Sergey Levine
36
41
0
29 Feb 2024
Video as the New Language for Real-World Decision Making
Sherry Yang
Jacob Walker
Jack Parker-Holder
Yilun Du
Jake Bruce
Andre Barreto
Pieter Abbeel
Dale Schuurmans
VGen
24
45
0
27 Feb 2024
DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models
Xiaoyu Tian
Junru Gu
Bailin Li
Yicheng Liu
Yang Wang
Chenxu Hu
Kun Zhan
Peng Jia
Xianpeng Lang
Hang Zhao
VLM
65
124
0
19 Feb 2024
Instance-Level Safety-Aware Fidelity of Synthetic Data and Its Calibration
Chih-Hong Cheng
Paul Stöckel
Xingyu Zhao
22
2
0
10 Feb 2024
Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following
Brian Yang
Huangyuan Su
N. Gkanatsios
Tsung-Wei Ke
Ayush Jain
Jeff Schneider
Katerina Fragkiadaki
DiffM
37
20
0
09 Feb 2024
Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents
Yuxi Wei
Zi Wang
Yifan Lu
Chenxin Xu
Chang-rui Liu
Hao Zhao
Siheng Chen
Yanfeng Wang
VGen
57
58
0
08 Feb 2024
The Essential Role of Causality in Foundation World Models for Embodied AI
Tarun Gupta
Wenbo Gong
Chao Ma
Nick Pawlowski
Agrin Hilmkil
...
Jianfeng Gao
Stefan Bauer
Danica Kragic
Bernhard Schölkopf
Cheng Zhang
28
15
0
06 Feb 2024
Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives
Sheng Luo
Wei-Neng Chen
Wanxin Tian
Rui Liu
Luanxuan Hou
...
Ling Shao
Yi Yang
Bojun Gao
Qun Li
Guobin Wu
47
13
0
05 Feb 2024
A Survey for Foundation Models in Autonomous Driving
Haoxiang Gao
Yaqian Li
Kaiwen Long
Ming Yang
Yiqing Shen
VLM
LRM
53
22
0
02 Feb 2024
Data-Centric Evolution in Autonomous Driving: A Comprehensive Survey of Big Data System, Data Mining, and Closed-Loop Technologies
Lincan Li
Wei Shao
Wei Dong
Yijun Tian
Qiming Zhang
Kaixiang Yang
Wenjie Zhang
18
8
0
23 Jan 2024
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
Xiaofeng Wang
Zheng Zhu
Guan Huang
Boyuan Wang
Xinze Chen
Jiwen Lu
VGen
27
32
0
18 Jan 2024
Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities
Xu Yan
Haiming Zhang
Yingjie Cai
Jingming Guo
Weichao Qiu
...
Lihui Jiang
Wei Zhang
Hongbo Zhang
Dengxin Dai
Bingbing Liu
54
17
0
16 Jan 2024
A Survey on Autonomous Driving Datasets: Statistics, Annotation Quality, and a Future Outlook
Mingyu Liu
Ekim Yurtsever
Jonathan Fossaert
Xingcheng Zhou
Walter Zimmer
Yuning Cui
B. L. Žagar
Alois C. Knoll
40
36
0
02 Jan 2024
Visual Point Cloud Forecasting enables Scalable Autonomous Driving
Zetong Yang
Li Chen
Yanan Sun
Hongyang Li
3DPC
25
40
0
29 Dec 2023
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Dan Kondratyuk
Lijun Yu
Xiuye Gu
José Lezama
Jonathan Huang
...
Irfan Essa
Huisheng Wang
David A. Ross
Bryan Seybold
Lu Jiang
VGen
15
237
0
21 Dec 2023
Foundation Models in Robotics: Applications, Challenges, and the Future
Roya Firoozi
Johnathan Tucker
Stephen Tian
Anirudha Majumdar
Jiankai Sun
...
Brian Ichter
Danny Driess
Jiajun Wu
Cewu Lu
Mac Schwager
LM&Ro
AI4CE
LRM
VLM
35
137
0
13 Dec 2023
Language Models, Agent Models, and World Models: The LAW for Machine Reasoning and Planning
Zhiting Hu
Tianmin Shu
LLMAG
LM&Ro
LRM
102
34
0
08 Dec 2023
Prospective Role of Foundation Models in Advancing Autonomous Vehicles
Jianhua Wu
B. Gao
Jincheng Gao
Jianhao Yu
Hongqing Chu
...
Xun Gong
Yi Chang
H. E. Tseng
Hong Chen
Jie Chen
33
3
0
08 Dec 2023
Towards Knowledge-driven Autonomous Driving
Xin Li
Yeqi Bai
Pinlong Cai
Licheng Wen
Daocheng Fu
...
Yikang Li
Botian Shi
Yong-Jin Liu
Liang He
Yu Qiao
32
26
0
07 Dec 2023
Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Felix Wimbauer
Bichen Wu
Edgar Schoenfeld
Xiaoliang Dai
Ji Hou
...
Jonas Kohler
Christian Rupprecht
Daniel Cremers
Peter Vajda
Jialiang Wang
DiffM
30
57
0
06 Dec 2023
Categorical Traffic Transformer: Interpretable and Diverse Behavior Prediction with Tokenized Latent
Yuxiao Chen
Sander Tonkens
Marco Pavone
25
9
0
30 Nov 2023
Previous
1
2
3
4
Next