ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.08576
  4. Cited By
Learning to Act from Actionless Videos through Dense Correspondences

Learning to Act from Actionless Videos through Dense Correspondences

International Conference on Learning Representations (ICLR), 2023
12 October 2023
Po-Chen Ko
Jiayuan Mao
Yilun Du
Shao-Hua Sun
Josh Tenenbaum
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Learning to Act from Actionless Videos through Dense Correspondences"

50 / 69 papers shown
Video2Act: A Dual-System Video Diffusion Policy with Robotic Spatio-Motional Modeling
Video2Act: A Dual-System Video Diffusion Policy with Robotic Spatio-Motional Modeling
Yueru Jia
Jiaming Liu
Shengbang Liu
Rui Zhou
W. Yu
Yuyang Yan
Xiaowei Chi
Yandong Guo
Boxin Shi
Shanghang Zhang
VGen
298
1
0
02 Dec 2025
IGen: Scalable Data Generation for Robot Learning from Open-World Images
IGen: Scalable Data Generation for Robot Learning from Open-World Images
Chenghao Gu
Haolan Kang
Junchao Lin
Jinghe Wang
Duo Wu
...
Ziyang Gong
Letian Li
Hongying Zheng
Changwei Lv
Zhi Wang
VGenLM&Ro
145
0
0
01 Dec 2025
TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos
TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos
Seungjae Lee
Yoonkyo Jung
Inkook Chun
Yao-Chih Lee
Zikui Cai
...
Aayush Talreja
Tan Dat Dao
Yongyuan Liang
Jia-Bin Huang
Furong Huang
107
0
0
26 Nov 2025
ViPRA: Video Prediction for Robot Actions
ViPRA: Video Prediction for Robot Actions
Sandeep Routray
Hengkai Pan
Unnat Jain
Shikhar Bahl
Deepak Pathak
230
2
0
11 Nov 2025
Simulating the Visual World with Artificial Intelligence: A Roadmap
Simulating the Visual World with Artificial Intelligence: A Roadmap
Jingtong Yue
Z. Huang
Z. Chen
Xintao Wang
Pengfei Wan
Ziwei Liu
VGenLM&Ro
464
1
0
11 Nov 2025
Robot Learning from a Physical World Model
Robot Learning from a Physical World Model
Jiageng Mao
Sicheng He
Hao-Ning Wu
Yang You
Shuyang Sun
...
Huizhong Chen
Leonidas Guibas
Vitor Campagnolo Guizilini
Zhengyu Ma
Yue Wang
VGenPINN
421
0
0
10 Nov 2025
A Step Toward World Models: A Survey on Robotic Manipulation
A Step Toward World Models: A Survey on Robotic Manipulation
Peng-Fei Zhang
Ying Cheng
Xiaofan Sun
S. Wang
Lei Zhu
Lei Zhu
Heng Tao Shen
LM&Ro
745
3
0
31 Oct 2025
World-in-World: World Models in a Closed-Loop World
World-in-World: World Models in a Closed-Loop World
Jiahan Zhang
Muqing Jiang
Nanru Dai
Taiming Lu
Arda Uzunoglu
...
Rama Chellappa
Tianmin Shu
Alan Yuille
Yilun Du
Jieneng Chen
VGenVLM
234
6
0
20 Oct 2025
Implicit State Estimation via Video Replanning
Implicit State Estimation via Video Replanning
Po-Chen Ko
Jiayuan Mao
Yu-Hsiang Fu
Hsien-Jeng Yeh
Chu-Rong Chen
Wei-Chiu Ma
Yilun Du
Shao-Hua Sun
120
1
0
20 Oct 2025
MoMaps: Semantics-Aware Scene Motion Generation with Motion Maps
MoMaps: Semantics-Aware Scene Motion Generation with Motion Maps
Jiahui Lei
Kyle Genova
George Kopanas
Noah Snavely
Leonidas Guibas
121
1
0
13 Oct 2025
When a Robot is More Capable than a Human: Learning from Constrained Demonstrators
When a Robot is More Capable than a Human: Learning from Constrained Demonstrators
Xinhu Li
Ayush Jain
Zhaojing Yang
Yigit Korkmaz
Erdem Bıyık
81
0
0
10 Oct 2025
An approach for systematic decomposition of complex llm tasks
An approach for systematic decomposition of complex llm tasks
Tianle Zhou
Jiakai Xu
G. Liu
Jiaxiang Liu
Haonan Wang
Eugene Wu
148
0
0
09 Oct 2025
Vision-Language-Action Models for Robotics: A Review Towards Real-World Applications
Vision-Language-Action Models for Robotics: A Review Towards Real-World ApplicationsIEEE Access (IEEE Access), 2025
Kento Kawaharazuka
Jihoon Oh
Jun Yamada
Ingmar Posner
Yuke Zhu
LM&Ro
261
24
0
08 Oct 2025
Luth: Efficient French Specialization for Small Language Models and Cross-Lingual Transfer
Luth: Efficient French Specialization for Small Language Models and Cross-Lingual Transfer
Maxence Lasbordes
Sinoué Gad
134
0
0
07 Oct 2025
MultiModal Action Conditioned Video Generation
MultiModal Action Conditioned Video Generation
Yichen Li
Antonio Torralba
VGen
184
3
0
02 Oct 2025
PoseDiff: A Unified Diffusion Model Bridging Robot Pose Estimation and Video-to-Action Control
PoseDiff: A Unified Diffusion Model Bridging Robot Pose Estimation and Video-to-Action Control
Haozhuo Zhang
Michele Caprio
Jing Shao
Qiang Zhang
Yong Dai
Shanghang Zhang
Wei Pan
VGen
176
0
0
29 Sep 2025
Robot Learning from Any Images
Robot Learning from Any Images
Siheng Zhao
Jiageng Mao
Wei Chow
Zeyu Shangguan
Tianheng Shi
...
Daniel Seita
Leonidas Guibas
Sergey Zakharov
Vitor Campagnolo Guizilini
Yue Wang
168
4
0
26 Sep 2025
Pixel Motion Diffusion is What We Need for Robot Control
Pixel Motion Diffusion is What We Need for Robot Control
E-Ro Nguyen
Y. Zhang
Kanchana Ranasinghe
Xiang Li
Michael S. Ryoo
DiffM
140
0
0
26 Sep 2025
From Watch to Imagine: Steering Long-horizon Manipulation via Human Demonstration and Future Envisionment
From Watch to Imagine: Steering Long-horizon Manipulation via Human Demonstration and Future Envisionment
Ke Ye
Jiaming Zhou
Yuanfeng Qiu
Jiayi Liu
Shihui Zhou
Kun-Yu Lin
Junwei Liang
VGen
186
1
0
26 Sep 2025
WoW: Towards a World omniscient World model Through Embodied Interaction
WoW: Towards a World omniscient World model Through Embodied Interaction
Xiaowei Chi
Peidong Jia
Chun-Kai Fan
Xiaozhu Ju
Weishi Mi
...
Wei Xue
Sirui Han
Yike Guo
Shanghang Zhang
Yong Dai
VGen
164
2
0
26 Sep 2025
VLBiMan: Vision-Language Anchored One-Shot Demonstration Enables Generalizable Bimanual Robotic Manipulation
VLBiMan: Vision-Language Anchored One-Shot Demonstration Enables Generalizable Bimanual Robotic Manipulation
Huayi Zhou
Kui Jia
LM&Ro
191
0
0
26 Sep 2025
What Happens Next? Anticipating Future Motion by Generating Point Trajectories
What Happens Next? Anticipating Future Motion by Generating Point Trajectories
Gabrijel Boduljak
Laurynas Karazija
Iro Laina
Christian Rupprecht
Andrea Vedaldi
VGen
113
1
0
25 Sep 2025
Pure Vision Language Action (VLA) Models: A Comprehensive Survey
Pure Vision Language Action (VLA) Models: A Comprehensive Survey
Dapeng Zhang
Jin Sun
Chenghui Hu
Xiaoyan Wu
Zhenlong Yuan
R. Zhou
Fei Shen
Qingguo Zhou
LM&Ro
295
15
0
23 Sep 2025
Generative Visual Foresight Meets Task-Agnostic Pose Estimation in Robotic Table-Top Manipulation
Generative Visual Foresight Meets Task-Agnostic Pose Estimation in Robotic Table-Top Manipulation
Chuye Zhang
Xiaoxiong Zhang
Wei Pan
Linfang Zheng
Wei Zhang
192
0
0
30 Aug 2025
Learning Primitive Embodied World Models: Towards Scalable Robotic Learning
Learning Primitive Embodied World Models: Towards Scalable Robotic Learning
Qiao Sun
Liujia Yang
Wei Tang
Wei Huang
Kaixin Xu
...
Tong He
Yilun Chen
Xili Dai
Nanyang Ye
Qinying Gu
VGenLM&Ro
409
1
0
28 Aug 2025
Spatial Policy: Guiding Visuomotor Robotic Manipulation with Spatial-Aware Modeling and Reasoning
Spatial Policy: Guiding Visuomotor Robotic Manipulation with Spatial-Aware Modeling and Reasoning
Yijun Liu
Yuwei Liu
Yuan Meng
J. Zhang
Yuwei Zhou
...
Jiacheng Jiang
Kangye Ji
Shijia Ge
Zhi Wang
Wenwu Zhu
97
1
0
21 Aug 2025
Precise Action-to-Video Generation Through Visual Action Prompts
Precise Action-to-Video Generation Through Visual Action Prompts
Yuang Wang
Chao Wen
Haoyu Guo
Sida Peng
Minghan Qin
Hujun Bao
Xiaowei Zhou
Ruizhen Hu
VGen
125
3
0
18 Aug 2025
GenFlowRL: Shaping Rewards with Generative Object-Centric Flow in Visual Reinforcement Learning
GenFlowRL: Shaping Rewards with Generative Object-Centric Flow in Visual Reinforcement Learning
Kelin Yu
Sheng Zhang
Harshit Soora
Furong Huang
Heng Huang
Erfaun Noorani
Ruohan Gao
VGen
94
4
0
14 Aug 2025
Boosting Action-Information via a Variational Bottleneck on Unlabelled Robot Videos
Boosting Action-Information via a Variational Bottleneck on Unlabelled Robot Videos
Haoyu Zhang
Long Cheng
SSL
105
1
0
12 Aug 2025
VLM-SFD: VLM-Assisted Siamese Flow Diffusion Framework for Dual-Arm Cooperative Manipulation
VLM-SFD: VLM-Assisted Siamese Flow Diffusion Framework for Dual-Arm Cooperative ManipulationIEEE Robotics and Automation Letters (IEEE RA-L), 2025
Jiaming Chen
Yiyu Jiang
Aoshen Huang
Yang Li
Wei Pan
140
0
0
16 Jun 2025
Self-Adapting Improvement Loops for Robotic Learning
Self-Adapting Improvement Loops for Robotic Learning
Calvin Luo
Zilai Zeng
Mingxi Jia
Yilun Du
Chen Sun
162
1
0
07 Jun 2025
3DFlowAction: Learning Cross-Embodiment Manipulation from 3D Flow World Model
3DFlowAction: Learning Cross-Embodiment Manipulation from 3D Flow World Model
Hongyan Zhi
Peihao Chen
Siyuan Zhou
Yubo Dong
Quanxi Wu
Lei Han
Mingkui Tan
387
13
0
06 Jun 2025
Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control
Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control
Xiao Fu
Xintao Wang
Xian Liu
Jianhong Bai
R. Xu
Pengfei Wan
Di Zhang
Dahua Lin
VGen
254
13
0
02 Jun 2025
Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction
Towards a Generalizable Bimanual Foundation Policy via Flow-based Video Prediction
Chenyou Fan
Fangzheng Yan
Fuchun Sun
Jiepeng Wang
Fangqiu Yi
Zhen Wang
Xuelong Li
VGen
1.1K
2
0
30 May 2025
Revisiting Multi-Agent World Modeling from a Diffusion-Inspired Perspective
Revisiting Multi-Agent World Modeling from a Diffusion-Inspired Perspective
Yang Zhang
Xinran Li
Jianing Ye
Delin Qu
Delin Qu
Chongjie Zhang
Xiu Li
Chenjia Bai
341
4
0
27 May 2025
TeViR: Text-to-Video Reward with Diffusion Models for Efficient Reinforcement Learning
TeViR: Text-to-Video Reward with Diffusion Models for Efficient Reinforcement Learning
Yuhui Chen
Haoran Li
Zhennan Jiang
Haowei Wen
Dongbin Zhao
267
4
0
26 May 2025
RLVR-World: Training World Models with Reinforcement Learning
RLVR-World: Training World Models with Reinforcement Learning
Jialong Wu
Shaofeng Yin
Ningya Feng
Mingsheng Long
OffRLVGen
496
16
0
20 May 2025
DreamGen: Unlocking Generalization in Robot Learning through Video World Models
DreamGen: Unlocking Generalization in Robot Learning through Video World Models
Joel Jang
Seonghyeon Ye
Zongyu Lin
Jiannan Xiang
Johan Bjorck
...
Dieter Fox
Jan Kautz
Scott Reed
Yuke Zhu
Linxi Fan
VGenOffRLAI4TS
392
0
0
19 May 2025
Extracting Visual Plans from Unlabeled Videos via Symbolic Guidance
Extracting Visual Plans from Unlabeled Videos via Symbolic Guidance
Wenyan Yang
Ahmet Tikna
Yi Zhao
Yuying Zhang
Luigi Palopoli
Marco Roveri
Joni Pajarinen
VGen
323
1
0
13 May 2025
LaDi-WM: A Latent Diffusion-based World Model for Predictive Manipulation
LaDi-WM: A Latent Diffusion-based World Model for Predictive Manipulation
Yuhang Huang
JIazhao Zhang
SHilong Zou
Xinwang Liu
Ruizhen Hu
Kai Xu
525
7
0
13 May 2025
Pixel Motion as Universal Representation for Robot Control
Pixel Motion as Universal Representation for Robot Control
Kanchana Ranasinghe
Xiang Li
Cristina Mata
Cristina Mata
Michael S. Ryoo
Michael Ryoo
VGen
391
7
0
12 May 2025
VISTA: Generative Visual Imagination for Vision-and-Language Navigation
VISTA: Generative Visual Imagination for Vision-and-Language Navigation
Yanjia Huang
Mingyang Wu
Renjie Li
Zhengzhong Tu
LM&Ro
575
3
0
09 May 2025
CLAM: Continuous Latent Action Models for Robot Learning from Unlabeled Demonstrations
CLAM: Continuous Latent Action Models for Robot Learning from Unlabeled Demonstrations
Anthony Liang
Pavel Czempin
Matthew Hong
Yutai Zhou
Erdem Biyik
Stephen Tu
459
10
0
08 May 2025
Solving New Tasks by Adapting Internet Video Knowledge
Solving New Tasks by Adapting Internet Video KnowledgeInternational Conference on Learning Representations (ICLR), 2025
Calvin Luo
Zilai Zeng
Yilun Du
Chen Sun
235
12
0
21 Apr 2025
FlowLoss: Dynamic Flow-Conditioned Loss Strategy for Video Diffusion Models
FlowLoss: Dynamic Flow-Conditioned Loss Strategy for Video Diffusion Models
Kuanting Wu
Kei Ota
Asako Kanezaki
DiffMVGen
346
0
0
20 Apr 2025
Diffusion Models for Robotic Manipulation: A Survey
Diffusion Models for Robotic Manipulation: A SurveyFrontiers in Robotics and AI (Front. Robot. AI), 2025
Rosa Wolf
Yitian Shi
Sheng Liu
Rania Rayyes
512
25
0
11 Apr 2025
AdaWorld: Learning Adaptable World Models with Latent Actions
AdaWorld: Learning Adaptable World Models with Latent Actions
Shenyuan Gao
Siyuan Zhou
Yilun Du
Jun Zhang
Chuang Gan
VGen
555
35
0
24 Mar 2025
Unified Video Action Model
Unified Video Action Model
Shuang Li
Yihuai Gao
Dorsa Sadigh
Shuran Song
VGen
685
65
0
28 Feb 2025
Self-Consistent Model-based Adaptation for Visual Reinforcement Learning
Self-Consistent Model-based Adaptation for Visual Reinforcement LearningInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Xinning Zhou
Chengyang Ying
Yao Feng
Hang Su
Jun Zhu
225
0
0
17 Feb 2025
VILP: Imitation Learning with Latent Video Planning
VILP: Imitation Learning with Latent Video PlanningIEEE Robotics and Automation Letters (IEEE RA-L), 2025
Zhengtong Xu
Qiang Qiu
Yu She
VGen
280
5
0
03 Feb 2025
12
Next