ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.17080
  4. Cited By
GAIA-1: A Generative World Model for Autonomous Driving

GAIA-1: A Generative World Model for Autonomous Driving

29 September 2023
Masane Fuchi
Lloyd Russell
Hudson Yeo
Zak Murez
Hiroto Minami
Alex Kendall
Tomohiro Takagi
Gianluca Corrado
    VGen
ArXivPDFHTML

Papers citing "GAIA-1: A Generative World Model for Autonomous Driving"

50 / 168 papers shown
Title
Multi-agent Embodied AI: Advances and Future Directions
Multi-agent Embodied AI: Advances and Future Directions
Zhaohan Feng
Ruiqi Xue
Lei Yuan
Yang Yu
Ning Ding
M. Liu
Bingzhao Gao
Jian-jun Sun
Gang Wang
AI4CE
52
0
0
08 May 2025
SynSHRP2: A Synthetic Multimodal Benchmark for Driving Safety-critical Events Derived from Real-world Driving Data
SynSHRP2: A Synthetic Multimodal Benchmark for Driving Safety-critical Events Derived from Real-world Driving Data
Liang Shi
Boyu Jiang
Zhenyuan Yuan
Miguel A. Perez
Feng Guo
24
0
0
06 May 2025
PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth
PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth
Bu Jin
Weize Li
Baihan Yang
Zhenxin Zhu
Junpeng Jiang
...
Kun Zhan
Hengtong Hu
X. Zhang
Peng Jia
Hao Zhao
VGen
81
0
0
03 May 2025
A Survey of Robotic Navigation and Manipulation with Physics Simulators in the Era of Embodied AI
A Survey of Robotic Navigation and Manipulation with Physics Simulators in the Era of Embodied AI
Lik Hang Kenny Wong
Xueyang Kang
Kaixin Bai
Jianwei Zhang
45
0
0
01 May 2025
A Survey of Interactive Generative Video
A Survey of Interactive Generative Video
Jiwen Yu
Yiran Qin
Haoxuan Che
Quande Liu
X. Wang
Pengfei Wan
Di Zhang
Kun Gai
Hao Chen
Xihui Liu
VGen
53
0
0
30 Apr 2025
Learning to Drive from a World Model
Learning to Drive from a World Model
Mitchell Goff
Greg Hogan
George Hotz
Armand du Parc Locmaria
Kacper Raczy
Harald Schäfer
Adeeb Shihadeh
Weixing Zhang
Yassine Yousfi
34
0
0
27 Apr 2025
Generative AI in Embodied Systems: System-Level Analysis of Performance, Efficiency and Scalability
Generative AI in Embodied Systems: System-Level Analysis of Performance, Efficiency and Scalability
Zishen Wan
Jiayi Qian
Yuhang Du
Jason J. Jabbour
Yilun Du
Yang Katie Zhao
A. Raychowdhury
Tushar Krishna
Vijay Janapa Reddi
LM&Ro
86
0
0
26 Apr 2025
Dynamic Camera Poses and Where to Find Them
Dynamic Camera Poses and Where to Find Them
C. Rockwell
Joseph Tung
Tsung-Yi Lin
Ming-Yu Liu
David Fouhey
Chen-Hsuan Lin
35
0
0
24 Apr 2025
DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment
DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment
X. Li
Chenming Wu
Zhao Yang
Zhihao Xu
Dingkang Liang
Y. Zhang
Ji Wan
J. Wang
VGen
67
1
0
22 Apr 2025
MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention
MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention
Yucheng Li
Huiqiang Jiang
Chengruidong Zhang
Qianhui Wu
Xufang Luo
...
Amir H. Abdi
Dongsheng Li
Jianfeng Gao
Y. Yang
Lili Qiu
31
1
0
22 Apr 2025
CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning
CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning
Yang Yue
Yulin Wang
Chenxin Tao
Pan Liu
Shiji Song
Gao Huang
MedIm
24
0
0
18 Apr 2025
EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance
EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance
Yang Yue
Yulin Wang
Haojun Jiang
Pan Liu
S. Song
Gao Huang
VGen
27
0
0
17 Apr 2025
WORLDMEM: Long-term Consistent World Simulation with Memory
WORLDMEM: Long-term Consistent World Simulation with Memory
Zeqi Xiao
Yushi Lan
Yifan Zhou
Wenqi Ouyang
Shuai Yang
Yanhong Zeng
Xingang Pan
73
0
0
16 Apr 2025
Exploration-Driven Generative Interactive Environments
Exploration-Driven Generative Interactive Environments
N. Savov
Naser Kazemi
Mohammad Mahdi
Danda Pani Paudel
Xi Wang
Luc Van Gool
VGen
3DV
38
0
0
03 Apr 2025
End-to-End Driving with Online Trajectory Evaluation via BEV World Model
End-to-End Driving with Online Trajectory Evaluation via BEV World Model
Yingyan Li
Yuqi Wang
Yang Liu
Jiawei He
Lue Fan
Zhaoxiang Zhang
OffRL
93
0
0
02 Apr 2025
Can Test-Time Scaling Improve World Foundation Model?
Can Test-Time Scaling Improve World Foundation Model?
Wenyan Cong
Hanqing Zhu
Peihao Wang
Bangya Liu
Dejia Xu
Kevin Wang
David Z. Pan
Yan Wang
Zhiwen Fan
Z. Wang
34
0
0
31 Mar 2025
Sim-and-Real Co-Training: A Simple Recipe for Vision-Based Robotic Manipulation
Sim-and-Real Co-Training: A Simple Recipe for Vision-Based Robotic Manipulation
Abhiram Maddukuri
Z. L. Jiang
L. Chen
Soroush Nasiriany
Yuqi Xie
...
Scott Reed
Ken Goldberg
Ajay Mandlekar
Linxi Fan
Yuke Zhu
59
1
0
31 Mar 2025
Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments
Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments
Luke Rowe
Roger Girgis
Anthony Gosselin
Liam Paull
C. Pal
Felix Heide
DiffM
VGen
33
1
0
28 Mar 2025
CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models
CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models
Qingqing Zhao
Yao Lu
Moo Jin Kim
Zipeng Fu
Zhuoyang Zhang
...
Ankur Handa
Ming-Yu Liu
Donglai Xiang
Gordon Wetzstein
Tsung-Yi Lin
LM&Ro
LRM
43
10
0
27 Mar 2025
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and Roadmap
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and Roadmap
Tong Nie
Jian-jun Sun
Wei Ma
58
1
0
27 Mar 2025
Exploring the Evolution of Physics Cognition in Video Generation: A Survey
Exploring the Evolution of Physics Cognition in Video Generation: A Survey
Minghui Lin
Xiang Wang
Y. Wang
Shu Wang
Fengqi Dai
...
Cunxiang Wang
Zhengrong Zuo
Nong Sang
Siteng Huang
Donglin Wang
EGVM
VGen
78
3
0
27 Mar 2025
AdaWorld: Learning Adaptable World Models with Latent Actions
AdaWorld: Learning Adaptable World Models with Latent Actions
Shenyuan Gao
Siyuan Zhou
Yilun Du
Jun Zhang
Chuang Gan
VGen
54
3
0
24 Mar 2025
MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving
MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving
Haiguang Wang
Daqi Liu
Hongwei Xie
Haisong Liu
Enhui Ma
Kaicheng Yu
Limin Wang
Bing Wang
VGen
67
0
0
20 Mar 2025
Generating Multimodal Driving Scenes via Next-Scene Prediction
Generating Multimodal Driving Scenes via Next-Scene Prediction
Yanhao Wu
Haoyang Zhang
Tianwei Lin
Lichao Huang
Shujie Luo
Rui Wu
Congpei Qiu
Wei Ke
Tong Zhang
VGen
46
0
0
19 Mar 2025
Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception
Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception
Dingkang Liang
Dingyuan Zhang
Xin Zhou
Sifan Tu
Tianrui Feng
Xiaofan Li
Yumeng Zhang
Mingyang Du
Xiao Tan
Xiang Bai
65
2
0
17 Mar 2025
Learning-based 3D Reconstruction in Autonomous Driving: A Comprehensive Survey
Learning-based 3D Reconstruction in Autonomous Driving: A Comprehensive Survey
Liewen Liao
Weihao Yan
Ming Yang
Songan Zhang
3DV
84
0
0
17 Mar 2025
Centaur: Robust End-to-End Autonomous Driving with Test-Time Training
Chonghao Sima
Kashyap Chitta
Zhiding Yu
Shiyi Lan
Ping Luo
Andreas Geiger
H. Li
Jose M. Alvarez
56
1
0
14 Mar 2025
Inter-environmental world modeling for continuous and compositional dynamics
Kohei Hayashi
Masanori Koyama
Julian Jorge Andrade Guerreiro
KELM
52
0
0
13 Mar 2025
RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground Simulation
Yuwen Du
Anning Hu
Zichen Chao
Yifan Lu
Junhao Ge
Genjia Liu
Weitao Wu
Lanjun Wang
Siheng Chen
60
0
0
13 Mar 2025
Unlock the Power of Unlabeled Data in Language Driving Model
Unlock the Power of Unlabeled Data in Language Driving Model
Chaoqun Wang
Jie-jin Yang
Xiaobin Hong
Ruimao Zhang
46
0
0
13 Mar 2025
Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach
Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach
Steeven Janny
Hervé Poirier
L. Antsfeld
G. Bono
G. Monaci
Boris Chidlovskii
Francesco Giuliari
Alessio Del Bue
Christian Wolf
LM&Ro
50
0
0
11 Mar 2025
Object-Centric World Model for Language-Guided Manipulation
Youngjoon Jeong
Junha Chun
S. Cha
Taesup Kim
OCL
VGen
114
1
0
08 Mar 2025
TeraSim: Uncovering Unknown Unsafe Events for Autonomous Vehicles through Generative Simulation
TeraSim: Uncovering Unknown Unsafe Events for Autonomous Vehicles through Generative Simulation
Haowei Sun
Xintao Yan
Zhijie Qiao
Haojie Zhu
Yihao Sun
...
Yifan Wei
Wei Zheng
Y. Sun
Yasuo Fukai
Henry X. Liu
68
1
0
05 Mar 2025
Four Principles for Physically Interpretable World Models
Jordan Peper
Zhenjiang Mao
Yuang Geng
Siyuan Pan
Ivan Ruchkin
105
1
0
04 Mar 2025
Glad: A Streaming Scene Generator for Autonomous Driving
Bin Xie
Yingfei Liu
Tiancai Wang
Jiale Cao
X. Zhang
3DGS
VGen
44
1
0
26 Feb 2025
Generalist World Model Pre-Training for Efficient Reinforcement Learning
Generalist World Model Pre-Training for Efficient Reinforcement Learning
Yi Zhao
Aidan Scannell
Yuxin Hou
Tianyu Cui
Le Chen
Dieter Buchler
Arno Solin
Juho Kannala
J. Pajarinen
OffRL
OnRL
75
1
0
26 Feb 2025
VaViM and VaVAM: Autonomous Driving through Video Generative Modeling
VaViM and VaVAM: Autonomous Driving through Video Generative Modeling
Florent Bartoccioni
Elias Ramzi
Victor Besnier
Shashanka Venkataramanan
Tuan-Hung Vu
...
Mickael Chen
Éloi Zablocki
Andrei Bursuc
Eduardo Valle
Matthieu Cord
VGen
78
1
0
24 Feb 2025
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
Jingcheng Ni
Yuxin Guo
Yichen Liu
Rui Chen
Lewei Lu
Z. Wu
DiffM
VGen
59
3
0
17 Feb 2025
DMWM: Dual-Mind World Model with Long-Term Imagination
DMWM: Dual-Mind World Model with Long-Term Imagination
Lingyi Wang
Rashed Shelim
Walid Saad
Naren Ramakrishnan
LRM
94
1
0
11 Feb 2025
Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving
Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving
Yu Yang
Jianbiao Mei
Yukai Ma
Siliang Du
Wenqing Chen
Yijie Qian
Yuxiang Feng
Yong-jin Liu
84
11
0
20 Jan 2025
A Survey of World Models for Autonomous Driving
A Survey of World Models for Autonomous Driving
Tuo Feng
Wenguan Wang
Y. Yang
VGen
75
5
0
20 Jan 2025
Towards Unraveling and Improving Generalization in World Models
Qiaoyi Fang
Weiyu Du
Hang Wang
Junshan Zhang
OOD
28
0
0
03 Jan 2025
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT
Xiaotao Hu
Wei Yin
Mingkai Jia
Junyuan Deng
Xiaoyang Guo
Qian Zhang
Xiaoxiao Long
Ping Tan
VGen
34
10
0
31 Dec 2024
DrivingGPT: Unifying Driving World Modeling and Planning with
  Multi-modal Autoregressive Transformers
DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers
Yuntao Chen
Yuqi Wang
Zhaoxiang Zhang
89
7
0
24 Dec 2024
An Efficient Occupancy World Model via Decoupled Dynamic Flow and
  Image-assisted Training
An Efficient Occupancy World Model via Decoupled Dynamic Flow and Image-assisted Training
Haiming Zhang
Ying Xue
Xu Yan
Jiacheng Zhang
Weichao Qiu
Dongfeng Bai
Bingbing Liu
Shuguang Cui
Z. Li
68
5
0
18 Dec 2024
$\texttt{DINO-Foresight}$: Looking into the Future with DINO
DINO-Foresight\texttt{DINO-Foresight}DINO-Foresight: Looking into the Future with DINO
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
AI4CE
79
1
0
16 Dec 2024
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained
  Ego-Motion, Object Dynamics, and Scene Composition Control
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Mariam Hassan
Sebastian Stapf
Ahmad Rahimi
Pedro M B Rezende
Yasaman Haghighi
...
Mathieu Salzmann
Davide Scaramuzza
Marc Pollefeys
Paolo Favaro
Alexandre Alahi
VLM
VGen
69
5
0
15 Dec 2024
InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene
  Generation with World-Guided Video Models
InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models
Yifan Lu
Xuanchi Ren
Jiawei Yang
Tianchang Shen
Zhangjie Wu
...
Y. Wang
Siheng Chen
Mike Chen
Sanja Fidler
Jiahui Huang
VGen
98
5
0
05 Dec 2024
The Matrix: Infinite-Horizon World Generation with Real-Time Moving
  Control
The Matrix: Infinite-Horizon World Generation with Real-Time Moving Control
Ruili Feng
Han Zhang
Zhantao Yang
Jie Xiao
Zhilei Shu
Zhiheng Liu
Andy Zheng
Yukun Huang
Yu Liu
H. Zhang
VGen
87
9
0
04 Dec 2024
Driving Scene Synthesis on Free-form Trajectories with Generative Prior
Driving Scene Synthesis on Free-form Trajectories with Generative Prior
Zeyu Yang
Zijie Pan
Yuankun Yang
Xiatian Zhu
L. Zhang
VGen
67
1
0
02 Dec 2024
1234
Next