Structured World Models from Human Videos

21 August 2023

Papers citing "Structured World Models from Human Videos"

50 / 99 papers shown

ManualVLA: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation

...

157

01 Dec 2025

SmallWorlds: Assessing Dynamics Understanding of World Models in Isolated Environments

137

28 Nov 2025

Reinforcing Action Policies by Prophesying

233

25 Nov 2025

In-N-On: Scaling Egocentric Manipulation with in-the-wild and on-task Data

356

19 Nov 2025

Robot Learning from a Physical World Model

...

Vitor Campagnolo Guizilini

Zhengyu Ma

Yue Wang

VGen PINN

421

10 Nov 2025

TwinVLA: Data-Efficient Bimanual Manipulation with Twin Single-Arm Vision-Language-Action Models

07 Nov 2025

Learning Interactive World Model for Object-Centric Reinforcement Learning

312

04 Nov 2025

XR-1: Towards Versatile Vision-Language-Action Models via Learning Unified Vision-Motion Representations

...

247

04 Nov 2025

Clone Deterministic 3D Worlds

155

30 Oct 2025

Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos

...

129

24 Oct 2025

MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation

...

124

30 Sep 2025

IGFuse: Interactive 3D Gaussian Scene Reconstruction via Multi-Scans Fusion

130

18 Aug 2025

Visuomotor Grasping with World Models for Surgical Robots

Hongbin Lin

Bin Li

K. W. S. Au

153

15 Aug 2025

Large Model Empowered Embodied AI: A Survey on Decision-Making and Embodied Learning

169

14 Aug 2025

villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models

...

361

31 Jul 2025

GR-3 Technical Report

...

320

21 Jul 2025

Latent Policy Steering with Embodiment-Agnostic Pretrained World Models

Yiqi Wang

Mrinal Verghese

Jeff Schneider

251

17 Jul 2025

A Survey: Learning Embodied Intelligence from Physical Simulators and World Models

...

301

01 Jul 2025

Goal-VLA: Image-Generative VLMs as Object-Centric World Models Empowering Zero-shot Robot Manipulation

174

30 Jun 2025

SafeMimic: Towards Safe and Autonomous Human-to-Robot Imitation for Mobile Manipulation

Arpit Bahety

Arnav Balaji

Ben Abbatematteo

Roberto Martín-Martín

145

18 Jun 2025

WorldPrediction: A Benchmark for High-level World Modeling and Long-horizon Procedural Planning

250

04 Jun 2025

What Do Latent Action Models Actually Learn?International Conference on Learning Representations (ICLR), 2024

174

27 May 2025

OSVI-WM: One-Shot Visual Imitation for Unseen Tasks using World-Model-Guided Trajectory Generation

Raktim Gautam Goswami

Prashanth Krishnamurthy

Yann LeCun

Farshad Khorrami

VGen OffRL

268

26 May 2025

WorldEval: World Model as Real-World Robot Policies Evaluator

194

25 May 2025

Imagine Beyond! Distributionally Robust Auto-Encoding for State Space Coverage in Online Reinforcement Learning

443

23 May 2025

TeleOpBench: A Simulator-Centric Benchmark for Dual-Arm Dexterous Teleoperation

...

467

19 May 2025

ReWiND: Language-Guided Rewards Teach Robot Policies without New Demonstrations

Jiahui Zhang

Yusen Luo

Abrar Anwar

Sumedh Anand Sontakke

420

16 May 2025

UniVLA: Learning to Act Anywhere with Task-centric Latent ActionsRobotics (RAS), 2025

889

102

09 May 2025

PIN-WM: Learning Physics-INformed World Models for Non-Prehensile Manipulation

412

23 Apr 2025

Novel Diffusion Models for Multimodal 3D Hand Trajectory Prediction

233

10 Apr 2025

ZeroMimic: Distilling Robotic Manipulation Skills from Web VideosIEEE International Conference on Robotics and Automation (ICRA), 2025

283

31 Mar 2025

PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction ModelComputer Vision and Pattern Recognition (CVPR), 2025

255

25 Mar 2025

AdaWorld: Learning Adaptable World Models with Latent Actions

555

24 Mar 2025

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model

...

622

13 Mar 2025

LuciBot: Automated Robot Policy Learning from Generated Videos

316

12 Mar 2025

Other Vehicle Trajectories Are Also Needed: A Driving World Model Unifies Ego-Other Vehicle Trajectories in Video Latent Space

966

12 Mar 2025

Cross-Embodiment Robotic Manipulation Synthesis via Guided Demonstrations through CycleVAE and Human Behavior Transformer

Apan Dastider

Hao Fang

Mingjie Lin

166

11 Mar 2025

Toward Stable World Models: Measuring and Addressing World Instability in Generative Environments

272

11 Mar 2025

Four Principles for Physically Interpretable World Models

427

04 Mar 2025

Exo-ViHa: A Cross-Platform Exoskeleton System with Visual and Haptic Feedback for Efficient Dexterous Skill Learning

299

03 Mar 2025

Magma: A Foundation Model for Multimodal AI AgentsComputer Vision and Pattern Recognition (CVPR), 2025

...

355

18 Feb 2025

Learning from Massive Human Videos for Universal Humanoid Pose Control

Vitor Campagnolo Guizilini

Yue Wang

338

18 Dec 2024

Reinforcement Learning from Wild Animal Videos

952

05 Dec 2024

CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation

...

354

187

29 Nov 2024

Understanding World or Predicting Future? A Comprehensive Survey of World ModelsACM Computing Surveys (ACM CSUR), 2024

...

Chen Gao

Fengli Xu

Yong Li

VGen SyDa

517

21 Nov 2024

Grounding Video Models to Actions through Goal Conditioned ExplorationInternational Conference on Learning Representations (ICLR), 2024

Yunhao Luo

Yilun Du

LM&Ro VGen

405

11 Nov 2024

Autoregressive Models in Vision: A Survey

...

489

08 Nov 2024

DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning

375

103

07 Nov 2024

STEER: Flexible Robotic Manipulation via Dense Language GroundingIEEE International Conference on Robotics and Automation (ICRA), 2024

Laura Smith

A. Irpan

Montserrat Gonzalez Arenas

295

05 Nov 2024

Pre-trained Visual Dynamics Representations for Efficient Policy LearningEuropean Conference on Computer Vision (ECCV), 2024

Hao Luo

Bohan Zhou

Zongqing Lu

267

05 Nov 2024