Structured World Models from Human Videos

21 August 2023

Papers citing "Structured World Models from Human Videos"

50 / 99 papers shown

ManualVLA: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation

...

161

01 Dec 2025

SmallWorlds: Assessing Dynamics Understanding of World Models in Isolated Environments

144

28 Nov 2025

Reinforcing Action Policies by Prophesying

233

25 Nov 2025

In-N-On: Scaling Egocentric Manipulation with in-the-wild and on-task Data

362

19 Nov 2025

Robot Learning from a Physical World Model

...

Vitor Campagnolo Guizilini

Zhengyu Ma

Yue Wang

VGen PINN

425

10 Nov 2025

TwinVLA: Data-Efficient Bimanual Manipulation with Twin Single-Arm Vision-Language-Action Models

07 Nov 2025

Learning Interactive World Model for Object-Centric Reinforcement Learning

313

04 Nov 2025

XR-1: Towards Versatile Vision-Language-Action Models via Learning Unified Vision-Motion Representations

...

253

04 Nov 2025

Clone Deterministic 3D Worlds

159

30 Oct 2025

Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos

...

129

24 Oct 2025

MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation

...

124

30 Sep 2025

IGFuse: Interactive 3D Gaussian Scene Reconstruction via Multi-Scans Fusion

131

18 Aug 2025

Visuomotor Grasping with World Models for Surgical Robots

Hongbin Lin

Bin Li

K. W. S. Au

159

15 Aug 2025

Large Model Empowered Embodied AI: A Survey on Decision-Making and Embodied Learning

170

14 Aug 2025

villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models

...

364

31 Jul 2025

GR-3 Technical Report

...

322

21 Jul 2025

Latent Policy Steering with Embodiment-Agnostic Pretrained World Models

Yiqi Wang

Mrinal Verghese

Jeff Schneider

255

17 Jul 2025

A Survey: Learning Embodied Intelligence from Physical Simulators and World Models

...

304

01 Jul 2025

Goal-VLA: Image-Generative VLMs as Object-Centric World Models Empowering Zero-shot Robot Manipulation

183

30 Jun 2025

SafeMimic: Towards Safe and Autonomous Human-to-Robot Imitation for Mobile Manipulation

Arpit Bahety

Arnav Balaji

Ben Abbatematteo

Roberto Martín-Martín

145

18 Jun 2025

WorldPrediction: A Benchmark for High-level World Modeling and Long-horizon Procedural Planning

250

04 Jun 2025

What Do Latent Action Models Actually Learn?International Conference on Learning Representations (ICLR), 2024

180

27 May 2025

OSVI-WM: One-Shot Visual Imitation for Unseen Tasks using World-Model-Guided Trajectory Generation

Raktim Gautam Goswami

Prashanth Krishnamurthy

Yann LeCun

Farshad Khorrami

VGen OffRL

275

26 May 2025

WorldEval: World Model as Real-World Robot Policies Evaluator

197

25 May 2025

Imagine Beyond! Distributionally Robust Auto-Encoding for State Space Coverage in Online Reinforcement Learning

451

23 May 2025

TeleOpBench: A Simulator-Centric Benchmark for Dual-Arm Dexterous Teleoperation

...

469

19 May 2025

ReWiND: Language-Guided Rewards Teach Robot Policies without New Demonstrations

Jiahui Zhang

Yusen Luo

Abrar Anwar

Sumedh Anand Sontakke

422

16 May 2025

UniVLA: Learning to Act Anywhere with Task-centric Latent ActionsRobotics (RAS), 2025

895

107

09 May 2025

PIN-WM: Learning Physics-INformed World Models for Non-Prehensile Manipulation

416

23 Apr 2025

Novel Diffusion Models for Multimodal 3D Hand Trajectory Prediction

234

10 Apr 2025

ZeroMimic: Distilling Robotic Manipulation Skills from Web VideosIEEE International Conference on Robotics and Automation (ICRA), 2025

286

31 Mar 2025

PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction ModelComputer Vision and Pattern Recognition (CVPR), 2025

256

25 Mar 2025

AdaWorld: Learning Adaptable World Models with Latent Actions

558

24 Mar 2025

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model

...

627

100

13 Mar 2025

LuciBot: Automated Robot Policy Learning from Generated Videos

318

12 Mar 2025

Other Vehicle Trajectories Are Also Needed: A Driving World Model Unifies Ego-Other Vehicle Trajectories in Video Latent Space

970

12 Mar 2025

Cross-Embodiment Robotic Manipulation Synthesis via Guided Demonstrations through CycleVAE and Human Behavior Transformer

Apan Dastider

Hao Fang

Mingjie Lin

167

11 Mar 2025

Toward Stable World Models: Measuring and Addressing World Instability in Generative Environments

281

11 Mar 2025

Four Principles for Physically Interpretable World Models

440

04 Mar 2025

Exo-ViHa: A Cross-Platform Exoskeleton System with Visual and Haptic Feedback for Efficient Dexterous Skill Learning

305

03 Mar 2025

Magma: A Foundation Model for Multimodal AI AgentsComputer Vision and Pattern Recognition (CVPR), 2025

...

359

18 Feb 2025

Learning from Massive Human Videos for Universal Humanoid Pose Control

Vitor Campagnolo Guizilini

Yue Wang

340

18 Dec 2024

Reinforcement Learning from Wild Animal Videos

956

05 Dec 2024

CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation

...

354

191

29 Nov 2024

Understanding World or Predicting Future? A Comprehensive Survey of World ModelsACM Computing Surveys (ACM CSUR), 2024

...

Chen Gao

Fengli Xu

Yong Li

VGen SyDa

517

21 Nov 2024

Grounding Video Models to Actions through Goal Conditioned ExplorationInternational Conference on Learning Representations (ICLR), 2024

Yunhao Luo

Yilun Du

LM&Ro VGen

439

11 Nov 2024

Autoregressive Models in Vision: A Survey

...

503

08 Nov 2024

DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning

383

104

07 Nov 2024

STEER: Flexible Robotic Manipulation via Dense Language GroundingIEEE International Conference on Robotics and Automation (ICRA), 2024

Laura Smith

A. Irpan

Montserrat Gonzalez Arenas

300

05 Nov 2024

Pre-trained Visual Dynamics Representations for Efficient Policy LearningEuropean Conference on Computer Vision (ECCV), 2024

Hao Luo

Bohan Zhou

Zongqing Lu

267

05 Nov 2024