ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.12705
  4. Cited By
DreamGen: Unlocking Generalization in Robot Learning through Video World Models
v1v2 (latest)

DreamGen: Unlocking Generalization in Robot Learning through Video World Models

19 May 2025
Joel Jang
Seonghyeon Ye
Zongyu Lin
Jiannan Xiang
Johan Bjorck
Yu Fang
F. Hu
Shijie Huang
Kaushil Kundalia
Yen-Chen Lin
Loic Magne
Ajay Mandlekar
Avnish Narayan
You Liang Tan
Guanzhi Wang
Jing Wang
Qi Wang
Yinzhen Xu
Xiaohui Zeng
Kaiyuan Zheng
Ruijie Zheng
Ming-Yuan Liu
Luke Zettlemoyer
Dieter Fox
Jan Kautz
Scott Reed
Yuke Zhu
Linxi Fan
    VGenOffRLAI4TS
ArXiv (abs)PDFHTML

Papers citing "DreamGen: Unlocking Generalization in Robot Learning through Video World Models"

30 / 30 papers shown
AdaPower: Specializing World Foundation Models for Predictive Manipulation
AdaPower: Specializing World Foundation Models for Predictive Manipulation
Yuhang Huang
SHilong Zou
J. Zhang
Xinwang Liu
Ruizhen Hu
Kai Xu
111
1
0
03 Dec 2025
IGen: Scalable Data Generation for Robot Learning from Open-World Images
IGen: Scalable Data Generation for Robot Learning from Open-World Images
Chenghao Gu
Haolan Kang
Junchao Lin
Jinghe Wang
Duo Wu
...
Ziyang Gong
Letian Li
Hongying Zheng
Changwei Lv
Zhi Wang
VGenLM&Ro
195
1
0
01 Dec 2025
RynnVLA-002: A Unified Vision-Language-Action and World Model
RynnVLA-002: A Unified Vision-Language-Action and World Model
Jun Cen
Siteng Huang
Yuqian Yuan
Kehan Li
Hangjie Yuan
...
Xin Li
Hao Luo
Fan Wang
Deli Zhao
H. Chen
VGenSyDa
361
8
0
21 Nov 2025
Robot Learning from a Physical World Model
Robot Learning from a Physical World Model
Jiageng Mao
Sicheng He
Hao-Ning Wu
Yang You
Shuyang Sun
...
Huizhong Chen
Leonidas Guibas
Vitor Campagnolo Guizilini
Zhengyu Ma
Yue Wang
VGenPINN
464
6
0
10 Nov 2025
OmniDexGrasp: Generalizable Dexterous Grasping via Foundation Model and Force Feedback
OmniDexGrasp: Generalizable Dexterous Grasping via Foundation Model and Force Feedback
Yi-Lin Wei
Zhexi Luo
Yuhao Lin
Mu Lin
Zhizhao Liang
Shuoyu Chen
Wei-Shi Zheng
114
2
0
27 Oct 2025
ROPES: Robotic Pose Estimation via Score-Based Causal Representation Learning
ROPES: Robotic Pose Estimation via Score-Based Causal Representation Learning
Pranamya Kulkarni
Puranjay Datta
Burak Varıcı
Emre Acartürk
Karthikeyan Shanmugam
A. Tajer
CML
255
1
0
23 Oct 2025
Ctrl-World: A Controllable Generative World Model for Robot Manipulation
Ctrl-World: A Controllable Generative World Model for Robot Manipulation
Yanjiang Guo
Lucy Xiaoyang Shi
Jianyu Chen
Chelsea Finn
VGen
229
31
0
11 Oct 2025
WristWorld: Generating Wrist-Views via 4D World Models for Robotic Manipulation
WristWorld: Generating Wrist-Views via 4D World Models for Robotic Manipulation
Zezhong Qian
Xiaowei Chi
Yuming Li
Shizun Wang
Zhiyuan Qin
Xiaozhu Ju
Sirui Han
Shanghang Zhang
VGen
140
4
0
08 Oct 2025
Reliable and Scalable Robot Policy Evaluation with Imperfect Simulators
Reliable and Scalable Robot Policy Evaluation with Imperfect Simulators
Apurva Badithela
David Snyder
Lihan Zha
Joseph Mikhail
Matthew O'Kelly
Anushri Dixit
Anirudha Majumdar
164
3
0
05 Oct 2025
EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation
EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation
Yuan Xu
Jiabing Yang
X. Wang
Yixiang Chen
Zheng Zhu
...
Shuo Lu
Jing Liu
Nianfeng Liu
Yan Huang
Liang Wang
VGen
208
5
0
26 Sep 2025
LongScape: Advancing Long-Horizon Embodied World Models with Context-Aware MoE
LongScape: Advancing Long-Horizon Embodied World Models with Context-Aware MoE
Yu Shang
Lei Jin
Yiding Ma
Xin Zhang
Chen Gao
Wei Wu
Yong Li
DiffMVGen
159
1
0
26 Sep 2025
RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation
RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation
Yuming Jiang
Siteng Huang
Shengke Xue
Yaxi Zhao
Jun Cen
...
Kexiang Wang
Mingxiu Chen
F. Wang
Deli Zhao
Xin Li
VGenLM&Ro
126
11
0
18 Sep 2025
PhysicalAgent: Towards General Cognitive Robotics with Foundation World Models
PhysicalAgent: Towards General Cognitive Robotics with Foundation World Models
Artem Lykov
Jeffrin Sam
Hung Khang Nguyen
Vladislav Kozlovskiy
Yara Mahmoud
Valerii Serpiva
Miguel Altamirano Cabrera
Mikhail Konenkov
Dzmitry Tsetserukou
LM&RoVGenAI4CE
119
0
0
17 Sep 2025
Robotic Manipulation via Imitation Learning: Taxonomy, Evolution, Benchmark, and Challenges
Robotic Manipulation via Imitation Learning: Taxonomy, Evolution, Benchmark, and Challenges
Zezeng Li
Alexandre Chapin
Enda Xiang
Rui Yang
Bruno Machado
Na Lei
Emmanuel Dellandrea
Di Huang
Liming Chen
318
3
0
24 Aug 2025
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
Wenyao Zhang
Hongsi Liu
Zekun Qi
Yunnan Wang
X. Yu
...
He Wang
Dongbin Zhao
Li Yi
Wenjun Zeng
Xin Jin
VLM
256
69
0
06 Jul 2025
A Survey: Learning Embodied Intelligence from Physical Simulators and World Models
A Survey: Learning Embodied Intelligence from Physical Simulators and World Models
Xiaoxiao Long
Qingrui Zhao
Kaiwen Zhang
Zihao Zhang
Dingrui Wang
...
Jia Pan
Qiu Shen
Ruigang Yang
X. Cao
Qionghai Dai
LM&RoAI4CE
339
29
0
01 Jul 2025
UniVLA: Learning to Act Anywhere with Task-centric Latent Actions
UniVLA: Learning to Act Anywhere with Task-centric Latent ActionsRobotics (RAS), 2025
Qingwen Bu
Yanting Yang
Jisong Cai
Shenyuan Gao
Guanghui Ren
Maoqing Yao
Ping Luo
Hongyang Li
924
157
0
09 May 2025
$π_{0.5}$: a Vision-Language-Action Model with Open-World Generalization
π0.5π_{0.5}π0.5​: a Vision-Language-Action Model with Open-World Generalization
Physical Intelligence
Kevin Black
Noah Brown
James Darpinian
Karan Dhabalia
...
Homer Walke
Anna Walling
Haohuan Wang
Lili Yu
Ury Zhilinsky
LM&RoVLM
8.6K
542
0
22 Apr 2025
Solving New Tasks by Adapting Internet Video Knowledge
Solving New Tasks by Adapting Internet Video KnowledgeInternational Conference on Learning Representations (ICLR), 2025
Calvin Luo
Zilai Zeng
Yilun Du
Chen Sun
245
15
0
21 Apr 2025
Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets
Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets
Chuning Zhu
Raymond Yu
S. Feng
Benjamin Burchfiel
Paarth Shah
Abhishek Gupta
VGen
523
63
0
03 Apr 2025
WorldScore: A Unified Evaluation Benchmark for World Generation
WorldScore: A Unified Evaluation Benchmark for World Generation
Haoyi Duan
Hong-Xing Yu
Sirui Chen
L. Fei-Fei
Jiajun Wu
VGen
417
53
0
01 Apr 2025
CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models
CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action ModelsComputer Vision and Pattern Recognition (CVPR), 2025
Qingqing Zhao
Yao Lu
Moo Jin Kim
Zipeng Fu
Zhuoyang Zhang
...
Ankur Handa
Xuan Li
Donglai Xiang
Gordon Wetzstein
Nayeon Lee
LM&RoLRM
366
271
0
27 Mar 2025
Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control
Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control
Nvidia
Hassan Abu Alhaija
Jose M. Alvarez
Maciej Bala
Tiffany Cai
...
Yuchong Ye
Xiaodong Yang
Boxin Wang
Fangyin Wei
Yu Zeng
VGen
539
51
0
18 Mar 2025
GR00T N1: An Open Foundation Model for Generalist Humanoid Robots
GR00T N1: An Open Foundation Model for Generalist Humanoid Robots
Nvidia
Johan Bjorck
Fernando Castañeda
Nikita Cherniadev
Xingye Da
...
Ao Zhang
Hao Zhang
Yizhou Zhao
Ruijie Zheng
Yuke Zhu
VLM
618
527
0
18 Mar 2025
Unified Video Action Model
Unified Video Action Model
Shuang Li
Yihuai Gao
Dorsa Sadigh
Shuran Song
VGen
716
87
0
28 Feb 2025
Physics-Driven Data Generation for Contact-Rich Manipulation via Trajectory Optimization
Physics-Driven Data Generation for Contact-Rich Manipulation via Trajectory Optimization
Lujie Yang
H.J. Terry Suh
Tong Zhao
B. P. Graesdal
Tarik Kelestemur
Jiuguang Wang
Tao Pang
Russ Tedrake
454
22
0
27 Feb 2025
Qwen2.5-VL Technical Report
Qwen2.5-VL Technical Report
S. Bai
Keqin Chen
Xuejing Liu
Jialin Wang
Wenbin Ge
...
Zesen Cheng
Hang Zhang
Zhibo Yang
Haiyang Xu
Junyang Lin
VLM
929
3,577
0
20 Feb 2025
Latent Action Pretraining from Videos
Latent Action Pretraining from VideosInternational Conference on Learning Representations (ICLR), 2024
Seonghyeon Ye
Joel Jang
Byeongguk Jeon
Sejune Joo
Jianwei Yang
...
Kimin Lee
J. Gao
Luke Zettlemoyer
Dieter Fox
Minjoon Seo
494
182
0
15 Oct 2024
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
CogVideoX: Text-to-Video Diffusion Models with An Expert TransformerInternational Conference on Learning Representations (ICLR), 2024
Zhuoyi Yang
Jiayan Teng
Wendi Zheng
Ming Ding
Shiyu Huang
...
Weihan Wang
Yean Cheng
Xiaotao Gu
Yuxiao Dong
Jie Tang
DiffMVGen
1.1K
1,476
0
12 Aug 2024
DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
Alexander Khazatsky
Karl Pertsch
Suraj Nair
Ashwin Balakrishna
Sudeep Dasari
...
Thomas Kollar
Sergey Levine
Chelsea Finn
Sergey Levine
Chelsea Finn
667
565
0
19 Mar 2024
1
Page 1 of 1