Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2011.01968
Cited By
v1
v2 (latest)
Learning 3D Dynamic Scene Representations for Robot Manipulation
3 November 2020
Zhenjia Xu
Zhanpeng He
Jiajun Wu
Shuran Song
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning 3D Dynamic Scene Representations for Robot Manipulation"
44 / 44 papers shown
Visual-Geometry Diffusion Policy: Robust Generalization via Complementarity-Aware Multimodal Fusion
Yikai Tang
Haoran Geng
Sheng Zang
Pieter Abbeel
Jitendra Malik
93
2
0
27 Nov 2025
Vision-Language Memory for Spatial Reasoning
Zuntao Liu
Yi Du
Taimeng Fu
Shaoshu Su
Cherie Ho
Chen Wang
VLM
LRM
374
0
0
25 Nov 2025
CORE-3D: Context-aware Open-vocabulary Retrieval by Embeddings in 3D
Mohamad Amin Mirzaei
Pantea Amoie
Ali Ekhterachian
Matin Mirzababaei
Babak Khalaj
3DPC
311
2
0
29 Sep 2025
FUNCanon: Learning Pose-Aware Action Primitives via Functional Object Canonicalization for Generalizable Robotic Manipulation
Hongli Xu
Lei Zhang
Xiaoyue Hu
Boyang Zhong
Kaixin Bai
Zoltán-Csaba Márton
Zhenshan Bing
Zhaopeng Chen
Alois Knoll
Jianwei Zhang
LM&Ro
186
3
0
23 Sep 2025
UNO: Unifying One-stage Video Scene Graph Generation via Object-Centric Visual Representation Learning
Huy Le
Nhat Chung
Tung Kieu
Jingkang Yang
Ngan Le
VOS
OCL
575
1
0
07 Sep 2025
HyperTASR: Hypernetwork-Driven Task-Aware Scene Representations for Robust Manipulation
Li Sun
Jiefeng Wu
Feng Chen
Ruizhe Liu
Yanchao Yang
302
2
0
26 Aug 2025
AntiGrounding: Lifting Robotic Actions into VLM Representation Space for Decision Making
Wenbo Li
Shiyi Wang
Yiteng Chen
Huiping Zhuang
Qingyao Wu
362
0
0
14 Jun 2025
FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation
Jun Guo
Xiaojian Ma
Yikai Wang
Min Yang
Huaping Liu
Qing Li
VGen
387
12
0
15 May 2025
Vision-Language Model Predictive Control for Manipulation Planning and Trajectory Generation
Jiaming Chen
Wentao Zhao
Ziyu Meng
Donghui Mao
Ran Song
Wei Pan
Wei Zhang
396
2
0
07 Apr 2025
Estimating Scene Flow in Robot Surroundings with Distributed Miniaturized Time-of-Flight Sensors
Jack Sander
Giammarco Caroleo
A. Albini
P. Maiolino
376
1
0
03 Apr 2025
Leveraging Foundation Models To learn the shape of semi-fluid deformable objects
Omar El Assal
Carlos M. Mateo
Sebastien Ciron
David Fofi
304
0
0
25 Nov 2024
PhysFlow: Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation
Computer Vision and Pattern Recognition (CVPR), 2024
Zhuoman Liu
Weicai Ye
Yan Luximon
Pengfei Wan
Di Zhang
VGen
AI4CE
582
22
0
21 Nov 2024
DEL: Discrete Element Learner for Learning 3D Particle Dynamics with Neural Rendering
Neural Information Processing Systems (NeurIPS), 2024
Jiaxu Wang
Jingkai Sun
Junhao He
Ziyi Zhang
Changwei Wang
Mingyuan Sun
Zhanchen Zhu
AI4CE
355
3
0
11 Oct 2024
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models
Xiaoyu Zhu
Hao Zhou
Pengfei Xing
Long Zhao
Hao Xu
Junwei Liang
Alex Hauptmann
Ting Liu
Andrew C. Gallagher
DiffM
416
14
0
18 Jul 2024
VLMPC: Vision-Language Model Predictive Control for Robotic Manipulation
Wentao Zhao
Jiaming Chen
Ziyu Meng
Donghui Mao
Ran Song
Wei Zhang
367
36
0
13 Jul 2024
Volumetric Environment Representation for Vision-Language Navigation
Rui Liu
Wenguan Wang
Yi Yang
322
77
0
21 Mar 2024
Grasp, See, and Place: Efficient Unknown Object Rearrangement with Policy Structure Prior
Kechun Xu
Zhongxiang Zhou
Jun Wu
Haojian Lu
Rong Xiong
Yue Wang
420
11
0
23 Feb 2024
Pyramid Diffusion for Fine 3D Large Scene Generation
Yuheng Liu
Xinke Li
Xueting Li
Lu Qi
Chongshou Li
Ming-Hsuan Yang
465
43
0
20 Nov 2023
Teaching Robots to Build Simulations of Themselves
Yuhang Hu
Jiong Lin
Hod Lipson
SSL
501
14
0
20 Nov 2023
Neural Field Dynamics Model for Granular Object Piles Manipulation
Conference on Robot Learning (CoRL), 2023
Shangjie Xue
Shuo Cheng
Pujith Kachana
Danfei Xu
AI4CE
366
15
0
01 Nov 2023
Stanford-ORB: A Real-World 3D Object Inverse Rendering Benchmark
Neural Information Processing Systems (NeurIPS), 2023
Zhengfei Kuang
Yunzhi Zhang
Hong-Xing Yu
Samir Agarwala
Shangzhe Wu
Jiajun Wu
398
55
0
24 Oct 2023
Out of Sight, Still in Mind: Reasoning and Planning about Unobserved Objects with Video Tracking Enabled Memory Models
IEEE International Conference on Robotics and Automation (ICRA), 2023
Yixuan Huang
Jialin Yuan
Chanho Kim
Pupul Pradhan
Bryan Chen
Fuxin Li
Tucker Hermans
475
12
0
26 Sep 2023
3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics under Challenging Scenes
Haotian Xue
Antonio Torralba
J. Tenenbaum
Daniel L. K. Yamins
Yunzhu Li
H. Tung
PINN
VGen
AI4CE
279
14
0
22 Apr 2023
Perceiving Unseen 3D Objects by Poking the Objects
IEEE International Conference on Robotics and Automation (ICRA), 2023
Linghao Chen
Yunzhou Song
Hujun Bao
Xiaowei Zhou
301
10
0
26 Feb 2023
Failure-aware Policy Learning for Self-assessable Robotics Tasks
IEEE International Conference on Robotics and Automation (ICRA), 2023
Kechun Xu
Runjian Chen
Shuqing Zhao
Zizhang Li
Hongxiang Yu
Ci Chen
Yue Wang
R. Xiong
308
2
0
25 Feb 2023
RoboNinja: Learning an Adaptive Cutting Policy for Multi-Material Objects
Zhenjia Xu
Zhou Xian
Xingyu Lin
Cheng Chi
Zhiao Huang
Chuang Gan
Shuran Song
268
39
0
22 Feb 2023
Object-Centric Scene Representations using Active Inference
Neural Computation (Neural Comput.), 2023
Toon Van de Maele
Tim Verbelen
Pietro Mazzaglia
Stefano Ferraro
Bart Dhoedt
OCL
BDL
288
5
0
07 Feb 2023
A Review of Scene Representations for Robot Manipulators
Carter Sifferman
LM&Ro
SSL
245
0
0
22 Dec 2022
LOPR: Latent Occupancy PRediction using Generative Models
Bernard Lange
Masha Itkina
Mykel J. Kochenderfer
AI4CE
491
9
0
03 Oct 2022
T3VIP: Transformation-based 3D Video Prediction
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Iman Nematollahi
Erick Rosete-Beas
Seyed Mahdi B. Azad
Raghunandan Rajan
Katharina Eggensperger
Wolfram Burgard
VGen
362
1
0
19 Sep 2022
DualAfford: Learning Collaborative Visual Affordance for Dual-gripper Manipulation
International Conference on Learning Representations (ICLR), 2022
Yan Zhao
Kai Cheng
Zhehuan Chen
Yourong Zhang
Qingnan Fan
Kaichun Mo
Hao Dong
555
29
0
05 Jul 2022
Play it by Ear: Learning Skills amidst Occlusion through Audio-Visual Imitation Learning
Maximilian Du
Olivia Y. Lee
Suraj Nair
Chelsea Finn
OffRL
310
46
0
30 May 2022
Revealing Occlusions with 4D Neural Fields
Computer Vision and Pattern Recognition (CVPR), 2022
Basile Van Hoorick
Purva Tendulkar
Dídac Surís
Dennis Park
Simon Stent
Carl Vondrick
223
23
0
22 Apr 2022
CaDeX: Learning Canonical Deformation Coordinate Space for Dynamic Surface Representation via Neural Homeomorphism
Computer Vision and Pattern Recognition (CVPR), 2022
Jiahui Lei
Kostas Daniilidis
286
71
0
30 Mar 2022
ACID: Action-Conditional Implicit Visual Dynamics for Deformable Object Manipulation
Bokui Shen
Zhenyu Jiang
Chris Choy
Leonidas Guibas
Silvio Savarese
Anima Anandkumar
Yuke Zhu
AI4CE
410
53
0
14 Mar 2022
Iterative Residual Policy: for Goal-Conditioned Dynamic Manipulation of Deformable Objects
Cheng Chi
Benjamin Burchfiel
Eric A. Cousineau
S. Feng
Shuran Song
449
107
0
01 Mar 2022
Learning Multi-Object Dynamics with Compositional Neural Radiance Fields
Conference on Robot Learning (CoRL), 2022
Danny Driess
Zhiao Huang
Yunzhu Li
Russ Tedrake
Marc Toussaint
OCL
AI4CE
531
96
0
24 Feb 2022
A Survey on Machine Learning Approaches for Modelling Intuitive Physics
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Jiafei Duan
Arijit Dasgupta
Jason Fischer
Cheston Tan
AI4CE
LRM
417
29
0
14 Feb 2022
Scene Editing as Teleoperation: A Case Study in 6DoF Kit Assembly
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021
Yulong Li
Shubh Agrawal
Jen-Shuo Liu
Steven K. Feiner
Shuran Song
442
14
0
09 Oct 2021
Learning Models as Functionals of Signed-Distance Fields for Manipulation Planning
Danny Driess
Jung-Su Ha
Marc Toussaint
Russ Tedrake
416
70
0
02 Oct 2021
Act the Part: Learning Interaction Strategies for Articulated Object Part Discovery
IEEE International Conference on Computer Vision (ICCV), 2021
S. Gadre
Kiana Ehsani
Shuran Song
473
64
0
03 May 2021
ManipulaTHOR: A Framework for Visual Object Manipulation
Computer Vision and Pattern Recognition (CVPR), 2021
Kiana Ehsani
Winson Han
Alvaro Herrasti
Eli VanderBilt
Luca Weihs
Eric Kolve
Aniruddha Kembhavi
Roozbeh Mottaghi
LM&Ro
978
157
0
22 Apr 2021
Multi-View Fusion for Multi-Level Robotic Scene Understanding
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021
Yunzhi Lin
Jonathan Tremblay
Stephen Tyree
Patricio A. Vela
Stan Birchfield
3DPC
254
33
0
25 Mar 2021
Machine Learning for Robotic Manipulation
Q. Vuong
OOD
226
2
0
04 Jan 2021
1
Page 1 of 1