Fast Visuomotor Policies via Partial Denoising

1 March 2025

Abstract

Diffusion policies are widely adopted in complex visuomotor tasks for their ability to capture multimodal action distributions. However, the multiple sampling steps required for action generation significantly harm real-time inference efficiency, which limits their applicability in long-horizon tasks and real-time decision-making scenarios. Existing acceleration techniques reduce sampling steps by approximating the original denoising process but inevitably introduce unacceptable performance loss. Here we propose Falcon, which mitigates this trade-off and achieves further acceleration. The core insight is that visuomotor tasks exhibit sequential dependencies between actions at consecutive time steps. Falcon leverages this property to avoid denoising from a standard normal distribution at each decision step. Instead, it starts denoising from partial denoised actions derived from historical information to significantly reduce the denoising steps while incorporating current observations to achieve performance-preserving acceleration of action generation. Importantly, Falcon is a training-free algorithm that can be applied as a plug-in to further improve decision efficiency on top of existing acceleration techniques. We validated Falcon in 46 simulated environments, demonstrating a 2-7x speedup with negligible performance degradation, offering a promising direction for efficient visuomotor policy design.

View on arXiv

@article{chen2025_2503.00339,
  title={ Fast Visuomotor Policies via Partial Denoising },
  author={ Haojun Chen and Minghao Liu and Xiaojian Ma and Zailin Ma and Huimin Wu and Chengdong Ma and Yuanpei Chen and Yifan Zhong and Mingzhi Wang and Qing Li and Yaodong Yang },
  journal={arXiv preprint arXiv:2503.00339},
  year={ 2025 }
}

Comments on this paper