Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.06252
Cited By
What Matters in Language Conditioned Robotic Imitation Learning over Unstructured Data
13 April 2022
Oier Mees
Lukás Hermann
Wolfram Burgard
LM&Ro
Re-assign community
ArXiv
PDF
HTML
Papers citing
"What Matters in Language Conditioned Robotic Imitation Learning over Unstructured Data"
50 / 105 papers shown
Title
ViSA-Flow: Accelerating Robot Skill Learning via Large-Scale Video Semantic Action Flow
Changhe Chen
Quantao Yang
Xiaohao Xu
Nima Fazeli
Olov Andersson
22
0
0
02 May 2025
GPA-RAM: Grasp-Pretraining Augmented Robotic Attention Mamba for Spatial Task Learning
Juyi Sheng
Yangjun Liu
Sheng Xu
Zhixin Yang
Mengyuan Liu
51
0
0
28 Apr 2025
CIVIL: Causal and Intuitive Visual Imitation Learning
Yinlong Dai
Robert Ramirez Sanchez
Ryan Jeronimus
Shahabedin Sagheb
Cara M. Nunez
Heramb Nemlekar
Dylan P. Losey
58
0
0
24 Apr 2025
Data-Agnostic Robotic Long-Horizon Manipulation with Vision-Language-Guided Closed-Loop Feedback
Y. Meng
Xiangtong Yao
Haihui Ye
Yirui Zhou
Shengqiang Zhang
Zhenshan Bing
Alois C. Knoll
LM&Ro
VLM
50
0
0
27 Mar 2025
RoboFlamingo-Plus: Fusion of Depth and RGB Perception with Vision-Language Models for Enhanced Robotic Manipulation
Sheng Wang
VLM
76
2
0
25 Mar 2025
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy
Zhi Hou
Tianyi Zhang
Yuwen Xiong
Haonan Duan
Hengjun Pu
...
Chengyang Zhao
X. Zhu
Yu Qiao
Jifeng Dai
Y. Chen
59
0
0
25 Mar 2025
LUMOS: Language-Conditioned Imitation Learning with World Models
Iman Nematollahi
Branton DeMoss
Akshay L Chandra
Nick Hawes
Wolfram Burgard
Ingmar Posner
OffRL
43
0
0
13 Mar 2025
MetaFold: Language-Guided Multi-Category Garment Folding Framework via Trajectory Generation and Foundation Model
Haonan Chen
Junxiao Li
Ruihai Wu
Yiwei Liu
Yiwen Hou
...
Chongkai Gao
Zhenyu Wei
Shensi Xu
Jiaqi Huang
Lin Shao
AI4CE
44
1
0
11 Mar 2025
Learning Dexterous In-Hand Manipulation with Multifingered Hands via Visuomotor Diffusion
Piotr Koczy
Michael C. Welle
Danica Kragic
48
0
0
04 Mar 2025
Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding
Wenxuan Song
Jiayi Chen
Pengxiang Ding
H. Zhao
Wei Zhao
Zhide Zhong
Zongyuan Ge
Jun Ma
Haoang Li
43
2
0
04 Mar 2025
Data Augmentation for Instruction Following Policies via Trajectory Segmentation
Niklas Höpner
Ilaria Tiddi
H. V. Hoof
39
0
0
25 Feb 2025
VLAS: Vision-Language-Action Model With Speech Instructions For Customized Robot Manipulation
Wei Zhao
Pengxiang Ding
M. Zhang
Zhefei Gong
Shuanghao Bai
H. Zhao
Donglin Wang
85
4
0
24 Feb 2025
X-IL: Exploring the Design Space of Imitation Learning Policies
Xiaogang Jia
Atalay Donat
Xi Huang
Xuan Zhao
Denis Blessing
...
Han A. Wang
Hanyi Zhang
Qian Wang
Rudolf Lioutikov
Gerhard Neumann
75
1
0
20 Feb 2025
Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models
Xinghang Li
Peiyan Li
Minghuan Liu
Dong Wang
Jirong Liu
Bingyi Kang
Xiao Ma
Tao Kong
Hanbo Zhang
Huaping Liu
LM&Ro
88
14
0
18 Dec 2024
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning
Moritz Reuss
Jyothish Pari
Pulkit Agrawal
Rudolf Lioutikov
DiffM
MoE
74
5
0
17 Dec 2024
RoboMM: All-in-One Multimodal Large Model for Robotic Manipulation
Feng Yan
Fanfan Liu
Liming Zheng
Yufeng Zhong
Yiyang Huang
Zechao Guan
Chengjian Feng
Lin Ma
82
2
0
10 Dec 2024
Inference-Time Policy Steering through Human Interactions
Yanwei Wang
Lirui Wang
Yilun Du
Balakumar Sundaralingam
Xuning Yang
Yu-Wei Chao
Claudia Pérez-DÁrpino
Dieter Fox
Julie Shah
VGen
99
4
0
25 Nov 2024
Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics
Taowen Wang
Dongfang Liu
James Liang
Wenhao Yang
Qifan Wang
Cheng Han
Jiebo Luo
Ruixiang Tang
Ruixiang Tang
AAML
74
2
0
18 Nov 2024
VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation
Youpeng Wen
Junfan Lin
Y. X. Zhu
J. Han
Hang Xu
Shen Zhao
Xiaodan Liang
VGen
DiffM
40
2
0
14 Nov 2024
STEER: Flexible Robotic Manipulation via Dense Language Grounding
Laura Smith
A. Irpan
Montserrat Gonzalez Arenas
Sean Kirmani
Dmitry Kalashnikov
Dhruv Shah
Ted Xiao
LLMSV
32
1
0
05 Nov 2024
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
Yang Yue
Yulin Wang
Bingyi Kang
Yizeng Han
Shenzhi Wang
Shiji Song
Jiashi Feng
Gao Huang
VLM
38
16
0
04 Nov 2024
GHIL-Glue: Hierarchical Control with Filtered Subgoal Images
Kyle Hatch
Ashwin Balakrishna
Oier Mees
Suraj Nair
Seohong Park
...
Masha Itkina
Benjamin Eysenbach
Sergey Levine
Thomas Kollar
Benjamin Burchfiel
50
2
0
26 Oct 2024
Scaling Robot Policy Learning via Zero-Shot Labeling with Foundation Models
Nils Blank
Moritz Reuss
Marcel Rühle
Ömer Erdinç Yagmurlu
Fabian Wenzel
Oier Mees
Rudolf Lioutikov
LM&Ro
OffRL
29
3
0
23 Oct 2024
VL-GLUE: A Suite of Fundamental yet Challenging Visuo-Linguistic Reasoning Tasks
Shailaja Keyur Sampat
Mutsumi Nakamura
Shankar Kailas
Kartik Aggarwal
Mandy Zhou
Yezhou Yang
Chitta Baral
MLLM
CoGe
ReLM
VLM
LRM
24
0
0
17 Oct 2024
Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance
Mitsuhiko Nakamoto
Oier Mees
Aviral Kumar
Sergey Levine
OffRL
71
9
0
17 Oct 2024
Incorporating Task Progress Knowledge for Subgoal Generation in Robotic Manipulation through Image Edits
Xuhui Kang
Yen-Ling Kuo
24
3
0
14 Oct 2024
The Ingredients for Robotic Diffusion Transformers
Sudeep Dasari
Oier Mees
Sebastian Zhao
M. K. Srirama
Sergey Levine
46
19
0
14 Oct 2024
Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation
Qingwen Bu
Hongyang Li
Li Chen
Jisong Cai
Jia Zeng
Heming Cui
Maoqing Yao
Yu Qiao
34
2
0
10 Oct 2024
PREDICT: Preference Reasoning by Evaluating Decomposed preferences Inferred from Candidate Trajectories
Stephane Aroca-Ouellette
Natalie Mackraz
B. Theobald
Katherine Metcalf
28
0
0
08 Oct 2024
GR-2: A Generative Video-Language-Action Model with Web-Scale Knowledge for Robot Manipulation
Chi-Lam Cheang
Guangzeng Chen
Ya Jing
Tao Kong
Hang Li
...
Hongtao Wu
Jiafeng Xu
Yichu Yang
Hanbo Zhang
Minzhao Zhu
VGen
LM&Ro
58
48
0
08 Oct 2024
LeLaN: Learning A Language-Conditioned Navigation Policy from In-the-Wild Videos
Noriaki Hirose
Catherine Glossop
A. Sridhar
Dhruv Shah
Oier Mees
Sergey Levine
LM&Ro
29
10
0
04 Oct 2024
SEAL: SEmantic-Augmented Imitation Learning via Language Model
Chengyang Gu
Yuxin Pan
Haotian Bai
Hui Xiong
Yize Chen
27
0
0
03 Oct 2024
Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning
Jianxiong Li
Zhihao Wang
Jinliang Zheng
Xiaoai Zhou
Guanming Wang
...
Yu Liu
Jingjing Liu
Ya-Qin Zhang
Junzhi Yu
Xianyuan Zhan
31
2
0
02 Oct 2024
GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal Conditioned Policy
Peiyan Li
Hongtao Wu
Yan Huang
Chilam Cheang
Liang Wang
Tao Kong
VGen
46
11
0
26 Aug 2024
All Robots in One: A New Standard and Unified Dataset for Versatile, General-Purpose Embodied Agents
Zhiqiang Wang
Hao Zheng
Yunshuang Nie
Wenjun Xu
Qingwei Wang
...
Wanxi Dong
Chang Cai
Liang Lin
Feng Zheng
Xiaodan Liang
22
4
0
20 Aug 2024
Autonomous Improvement of Instruction Following Skills via Foundation Models
Zhiyuan Zhou
P. Atreya
Abraham Lee
Homer Walke
Oier Mees
Sergey Levine
30
8
0
30 Jul 2024
LIMT: Language-Informed Multi-Task Visual World Models
Elie Aljalbout
Nikolaos Sotirakis
Patrick van der Smagt
Maximilian Karl
Nutan Chen
24
5
0
18 Jul 2024
Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals
Moritz Reuss
Ömer Erdinç Yagmurlu
Fabian Wenzel
Rudolf Lioutikov
OffRL
23
39
0
08 Jul 2024
RoboUniView: Visual-Language Model with Unified View Representation for Robotic Manipulaiton
Fanfan Liu
Feng Yan
Liming Zheng
Chengjian Feng
Yiyang Huang
Lin Ma
LM&Ro
21
11
0
27 Jun 2024
CogExplore: Contextual Exploration with Language-Encoded Environment Representations
Harel Biggie
Patrick Cooper
Doncey Albin
Kristen Such
Christoffer Heckman
LM&Ro
30
0
0
24 Jun 2024
Enabling robots to follow abstract instructions and complete complex dynamic tasks
Ruaridh Mon-Williams
Gen Li
Ran Long
Wenqian Du
Chris Lucas
LM&Ro
39
3
0
17 Jun 2024
Scaling Manipulation Learning with Visual Kinematic Chain Prediction
Xinyu Zhang
Yuhan Liu
Haonan Chang
Abdeslam Boularias
44
1
0
12 Jun 2024
Grasp as You Say: Language-guided Dexterous Grasp Generation
Yi-Lin Wei
Jian-Jian Jiang
Chengyi Xing
Xiantuo Tan
Xiao-Ming Wu
Hao Li
M. Cutkosky
Wei-Shi Zheng
43
13
0
29 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
64
38
0
23 May 2024
Octo: An Open-Source Generalist Robot Policy
Octo Model Team
Dibya Ghosh
Homer Walke
Karl Pertsch
Kevin Black
...
Quan Vuong
Ted Xiao
Dorsa Sadigh
Chelsea Finn
Sergey Levine
55
333
0
20 May 2024
From LLMs to Actions: Latent Codes as Bridges in Hierarchical Robot Control
Yide Shentu
Philipp Wu
Aravind Rajeswaran
Pieter Abbeel
27
9
0
08 May 2024
Learning Planning Abstractions from Language
Weiyu Liu
Geng Chen
Joy Hsu
Jiayuan Mao
Jiajun Wu
PINN
27
2
0
06 May 2024
RT-H: Action Hierarchies Using Language
Suneel Belkhale
Tianli Ding
Ted Xiao
P. Sermanet
Quon Vuong
Jonathan Tompson
Yevgen Chebotar
Debidatta Dwibedi
Dorsa Sadigh
LM&Ro
26
73
0
04 Mar 2024
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning
Jianxiong Li
Jinliang Zheng
Yinan Zheng
Liyuan Mao
Xiaoming Hu
...
Jihao Liu
Yu Liu
Jingjing Liu
Ya-Qin Zhang
Xianyuan Zhan
LM&Ro
OffRL
29
8
0
28 Feb 2024
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations
Tsung-Wei Ke
N. Gkanatsios
Katerina Fragkiadaki
VGen
28
102
0
16 Feb 2024
1
2
3
Next