ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.03181
  4. Cited By
Behavior Generation with Latent Actions
v1v2 (latest)

Behavior Generation with Latent Actions

5 March 2024
Seungjae Lee
Yibin Wang
Haritheja Etukuru
H. J. Kim
Mahi Shafiullah
Lerrel Pinto
    VGenOffRL
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Behavior Generation with Latent Actions"

50 / 87 papers shown
Title
MAPS: Preserving Vision-Language Representations via Module-Wise Proximity Scheduling for Better Vision-Language-Action Generalization
MAPS: Preserving Vision-Language Representations via Module-Wise Proximity Scheduling for Better Vision-Language-Action Generalization
Chengyue Huang
Mellon M. Zhang
Robert Azarcon
Glen Chou
Z. Kira
VLM
84
0
0
25 Nov 2025
ViPRA: Video Prediction for Robot Actions
ViPRA: Video Prediction for Robot Actions
Sandeep Routray
Hengkai Pan
Unnat Jain
Shikhar Bahl
Deepak Pathak
166
0
0
11 Nov 2025
Temporal Action Selection for Action Chunking
Temporal Action Selection for Action Chunking
Yueyang Weng
Xiaopeng Zhang
Yongjin Mu
Yingcong Zhu
Yanjie Li
Qi Liu
96
0
0
06 Nov 2025
XR-1: Towards Versatile Vision-Language-Action Models via Learning Unified Vision-Motion Representations
XR-1: Towards Versatile Vision-Language-Action Models via Learning Unified Vision-Motion Representations
Shichao Fan
K. Wu
Zhengping Che
X. Wang
Di Wu
...
M. M. Li
Qingjie Liu
Shanghang Zhang
Min Wan
Yong Dai
172
0
0
04 Nov 2025
Efficient Vision-Language-Action Models for Embodied Manipulation: A Systematic Survey
Efficient Vision-Language-Action Models for Embodied Manipulation: A Systematic Survey
Weifan Guan
Qinghao Hu
Aosheng Li
Jian Cheng
LM&Ro
294
3
0
20 Oct 2025
Fast Visuomotor Policy for Robotic Manipulation
Fast Visuomotor Policy for Robotic Manipulation
Jingkai Jia
Tong Yang
Xueyao Chen
Chenhuan Liu
Wenqiang Zhang
76
0
0
14 Oct 2025
HiMaCon: Discovering Hierarchical Manipulation Concepts from Unlabeled Multi-Modal Data
HiMaCon: Discovering Hierarchical Manipulation Concepts from Unlabeled Multi-Modal Data
Ruizhe Liu
Pei Zhou
Qian Luo
Li Sun
Jun Cen
Yibing Song
Yanchao Yang
SSL
271
0
0
13 Oct 2025
Action Deviation-Aware Inference for Low-Latency Wireless Robots
Action Deviation-Aware Inference for Low-Latency Wireless Robots
Jeyoung Park
Yeonsub Lim
Seungeun Oh
Jihong Park
Jinho Choi
Seong-Lyun Kim
78
0
0
03 Oct 2025
CroSTAta: Cross-State Transition Attention Transformer for Robotic Manipulation
CroSTAta: Cross-State Transition Attention Transformer for Robotic Manipulation
Giovanni Minelli
Giulio Turrisi
Victor Barasuol
Claudio Semini
72
0
0
01 Oct 2025
HAMLET: Switch your Vision-Language-Action Model into a History-Aware Policy
HAMLET: Switch your Vision-Language-Action Model into a History-Aware Policy
Myungkyu Koo
Daewon Choi
Taeyoung Kim
Kyungmin Lee
Changyeon Kim
Younggyo Seo
Jinwoo Shin
LM&RoVLM
133
0
0
01 Oct 2025
PhysiAgent: An Embodied Agent Framework in Physical World
PhysiAgent: An Embodied Agent Framework in Physical World
Zhihao Wang
Jianxiong Li
Jinliang Zheng
Wencong Zhang
Dongxiu Liu
Yinan Zheng
Haoyi Niu
Junzhi Yu
Xianyuan Zhan
LM&Ro
165
2
0
29 Sep 2025
Normalizing Flows are Capable Visuomotor Policy Learning Models
Normalizing Flows are Capable Visuomotor Policy Learning Models
Simon Kristoffersson Lind
Jialong Li
Maj Stenmark
Volker Kruger
141
0
0
25 Sep 2025
World4RL: Diffusion World Models for Policy Refinement with Reinforcement Learning for Robotic Manipulation
World4RL: Diffusion World Models for Policy Refinement with Reinforcement Learning for Robotic Manipulation
Zhennan Jiang
Kai Liu
Yuxin Qin
Shuai Tian
Yupeng Zheng
Mingcai Zhou
Chao Yu
Haoran Li
Dongbin Zhao
73
2
0
23 Sep 2025
MV-UMI: A Scalable Multi-View Interface for Cross-Embodiment Learning
MV-UMI: A Scalable Multi-View Interface for Cross-Embodiment Learning
Omar Rayyan
John Abanes
Mahmoud Hafez
Anthony Tzes
Fares Abu-Dakka
52
0
0
23 Sep 2025
Learning Dexterous Manipulation with Quantized Hand State
Learning Dexterous Manipulation with Quantized Hand State
Ying Feng
Hongjie Fang
Yinong He
Jingjing Chen
Chenxi Wang
Zihao He
Ruonan Liu
Cewu Lu
100
0
0
22 Sep 2025
GeoAware-VLA: Implicit Geometry Aware Vision-Language-Action Model
GeoAware-VLA: Implicit Geometry Aware Vision-Language-Action Model
Ali Abouzeid
Malak Mansour
Zezhou Sun
Dezhen Song
272
1
0
17 Sep 2025
CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification
CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification
Wei Li
Renshan Zhang
Rui Shao
Jie He
Liqiang Nie
VLM
165
15
0
28 Aug 2025
Robotic Manipulation via Imitation Learning: Taxonomy, Evolution, Benchmark, and Challenges
Robotic Manipulation via Imitation Learning: Taxonomy, Evolution, Benchmark, and Challenges
Zezeng Li
Alexandre Chapin
Enda Xiang
Rui Yang
Bruno Machado
Na Lei
Emmanuel Dellandrea
Di Huang
Liming Chen
195
2
0
24 Aug 2025
Survey of Vision-Language-Action Models for Embodied Manipulation
Survey of Vision-Language-Action Models for Embodied Manipulation
Haoran Li
Yuhui Chen
Wenbo Cui
Weiheng Liu
Kai Liu
Mingcai Zhou
Zhengtao Zhang
Dongbin Zhao
LM&Ro
328
3
0
21 Aug 2025
Self-Guided Action Diffusion
Self-Guided Action Diffusion
Rhea Malhotra
Yuejiang Liu
Chelsea Finn
48
1
0
17 Aug 2025
OmniD: Generalizable Robot Manipulation Policy via Image-Based BEV Representation
OmniD: Generalizable Robot Manipulation Policy via Image-Based BEV Representation
Jilei Mao
Jiarui Guan
Yingjuan Tang
Qirui Hu
Zhihang Li
Junjie Yu
Yongjie Mao
Yunzhe Sun
Shuang Liu
Xiaozhu Ju
60
2
0
16 Aug 2025
GBC: Generalized Behavior-Cloning Framework for Whole-Body Humanoid Imitation
GBC: Generalized Behavior-Cloning Framework for Whole-Body Humanoid Imitation
Yifei Yao
Chengyuan Luo
Jiaheng Du
Wentao He
Jun-Guo Lu
84
0
0
13 Aug 2025
CLASS: Contrastive Learning via Action Sequence Supervision for Robot Manipulation
CLASS: Contrastive Learning via Action Sequence Supervision for Robot Manipulation
Sung-Wook Lee
Xuhui Kang
Brandon Yang
Yen-Ling Kuo
SSL
134
2
0
03 Aug 2025
ParticleFormer: A 3D Point Cloud World Model for Multi-Object, Multi-Material Robotic Manipulation
ParticleFormer: A 3D Point Cloud World Model for Multi-Object, Multi-Material Robotic Manipulation
Suning Huang
Qianzhong Chen
Xiaohan Zhang
J. Sun
Mac Schwager
166
5
0
29 Jun 2025
Touch begins where vision ends: Generalizable policies for contact-rich manipulation
Touch begins where vision ends: Generalizable policies for contact-rich manipulation
Zifan Zhao
Siddhant Haldar
Jinda Cui
Lerrel Pinto
Raunaq M. Bhirangi
OffRL
193
3
0
16 Jun 2025
Adapting by Analogy: OOD Generalization of Visuomotor Policies via Functional Correspondence
Adapting by Analogy: OOD Generalization of Visuomotor Policies via Functional Correspondence
Pranay Gupta
H. Admoni
Andrea Bajcsy
117
2
0
15 Jun 2025
Real-Time Execution of Action Chunking Flow Policies
Real-Time Execution of Action Chunking Flow Policies
Kevin Black
Manuel Y. Galliker
Sergey Levine
OffRL
341
27
0
09 Jun 2025
BEAST: Efficient Tokenization of B-Splines Encoded Action Sequences for Imitation Learning
BEAST: Efficient Tokenization of B-Splines Encoded Action Sequences for Imitation Learning
Hongyi Zhou
Weiran Liao
Xi Huang
Yucheng Tang
Fabian Otto
...
Qian Wang
Ömer Erdinç Yagmurlu
Nils Blank
Moritz Reuss
Rudolf Lioutikov
293
2
0
06 Jun 2025
STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization
STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization
Hao Li
Qi Lv
Rui Shao
Xiang Deng
Yinchuan Li
Jianye Hao
Liqiang Nie
362
5
0
04 Jun 2025
Normalizing Flows are Capable Models for RL
Normalizing Flows are Capable Models for RL
Raj Ghugare
Benjamin Eysenbach
OffRLAI4CE
306
4
0
29 May 2025
Prior Reinforce: Mastering Agile Tasks with Limited Trials
Prior Reinforce: Mastering Agile Tasks with Limited Trials
Yihang Hu
Pingyue Sheng
Shengjie Wang
Yang Gao
Yang Gao
196
0
0
28 May 2025
OSVI-WM: One-Shot Visual Imitation for Unseen Tasks using World-Model-Guided Trajectory Generation
OSVI-WM: One-Shot Visual Imitation for Unseen Tasks using World-Model-Guided Trajectory Generation
Raktim Gautam Goswami
Prashanth Krishnamurthy
Yann LeCun
Farshad Khorrami
VGenOffRL
218
4
0
26 May 2025
Grounding Bodily Awareness in Visual Representations for Efficient Policy Learning
Grounding Bodily Awareness in Visual Representations for Efficient Policy Learning
Junlin Wang
Zhiyun Lin
1.4K
0
0
24 May 2025
ManiFeel: Benchmarking and Understanding Visuotactile Manipulation Policy Learning
ManiFeel: Benchmarking and Understanding Visuotactile Manipulation Policy Learning
Quan Khanh Luu
Pokuang Zhou
Zhengtong Xu
Zhiyuan Zhang
Qiang Qiu
Yu She
137
1
0
24 May 2025
Canonical Policy: Learning Canonical 3D Representation for SE(3)-Equivariant Policy
Canonical Policy: Learning Canonical 3D Representation for SE(3)-Equivariant Policy
Zhiyuan Zhang
Zhengtong Xu
Jai Nanda Lakamsani
Yu She
SSLLM&Ro3DPC
179
1
0
24 May 2025
H$^3$DP: Triply-Hierarchical Diffusion Policy for Visuomotor Learning
H3^33DP: Triply-Hierarchical Diffusion Policy for Visuomotor Learning
Yiyang Lu
Yufeng Tian
Zhecheng Yuan
Xinyu Wang
Pu Hua
Zhengrong Xue
Huazhe Xu
309
4
0
12 May 2025
UniVLA: Learning to Act Anywhere with Task-centric Latent Actions
UniVLA: Learning to Act Anywhere with Task-centric Latent ActionsRobotics (RAS), 2025
Qingwen Bu
Yanting Yang
Jisong Cai
Shenyuan Gao
Guanghui Ren
Maoqing Yao
Ping Luo
Hongyang Li
757
84
0
09 May 2025
Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action Environments
Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action Environments
Pranav Guruprasad
Yangyue Wang
Sudipta Chowdhury
Harshvardhan Sikka
Paul Pu Liang
LM&RoVLM
1.0K
2
0
08 May 2025
Task Reconstruction and Extrapolation for $π_0$ using Text Latent
Task Reconstruction and Extrapolation for π0π_0π0​ using Text Latent
Quanyi Li
500
2
0
06 May 2025
DiffOG: Differentiable Policy Trajectory Optimization with Generalizability
DiffOG: Differentiable Policy Trajectory Optimization with Generalizability
Zhengtong Xu
Zichen Miao
Qiang Qiu
Zhe Zhang
Yu She
454
0
0
18 Apr 2025
Towards Forceful Robotic Foundation Models: a Literature Survey
Towards Forceful Robotic Foundation Models: a Literature Survey
William Xie
N. Correll
OffRL
252
5
0
16 Apr 2025
Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets
Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets
Chuning Zhu
Raymond Yu
S. Feng
Benjamin Burchfiel
Paarth Shah
Abhishek Gupta
VGen
401
34
0
03 Apr 2025
Empirical Analysis of Sim-and-Real Cotraining of Diffusion Policies for Planar Pushing from Pixels
Empirical Analysis of Sim-and-Real Cotraining of Diffusion Policies for Planar Pushing from Pixels
Adam Wei
Abhinav Agarwal
Boyuan Chen
Rohan Bosworth
Nicholas Pfaff
Russ Tedrake
232
12
0
28 Mar 2025
Boosting Robotic Manipulation Generalization with Minimal Costly Data
Boosting Robotic Manipulation Generalization with Minimal Costly Data
Liming Zheng
Feng Yan
Fanfan Liu
C. Feng
Yufeng Zhong
Yiyang Huang
304
1
0
25 Mar 2025
Quantization-Free Autoregressive Action Transformer
Quantization-Free Autoregressive Action Transformer
Ziyad Sheebaelhamd
Michael Tschannen
Michael Muehlebach
Claire Vernade
208
1
0
18 Mar 2025
Curating Demonstrations using Online Experience
Curating Demonstrations using Online Experience
Annie S. Chen
Alec M. Lessing
Yuejiang Liu
Chelsea Finn
227
8
0
05 Mar 2025
Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding
Wenxuan Song
Jiayi Chen
Pengxiang Ding
Han Zhao
Wei Zhao
Zhide Zhong
Zongyuan Ge
Jun Ma
Haoang Li
222
30
0
04 Mar 2025
Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation
Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation
Siddhant Haldar
Lerrel Pinto
3DPC
287
20
0
27 Feb 2025
VaViM and VaVAM: Autonomous Driving through Video Generative Modeling
VaViM and VaVAM: Autonomous Driving through Video Generative Modeling
Florent Bartoccioni
Elias Ramzi
Victor Besnier
Shashanka Venkataramanan
Tuan-Hung Vu
...
Mickael Chen
Éloi Zablocki
Andrei Bursuc
Eduardo Valle
Matthieu Cord
VGen
288
8
0
24 Feb 2025
X-IL: Exploring the Design Space of Imitation Learning Policies
X-IL: Exploring the Design Space of Imitation Learning Policies
Xiaogang Jia
Atalay Donat
Xi Huang
Xuan Zhao
Denis Blessing
...
Han A. Wang
Hanyi Zhang
Qian Wang
Rudolf Lioutikov
Gerhard Neumann
244
3
0
20 Feb 2025
12
Next