Reference Free Platform Adaptive Locomotion for Quadrupedal Robots using a Dynamics Conditioned Policy

21 May 2025

Abstract

This article presents Platform Adaptive Locomotion (PAL), a unified control method for quadrupedal robots with different morphologies and dynamics. We leverage deep reinforcement learning to train a single locomotion policy on procedurally generated robots. The policy maps proprioceptive robot state information and base velocity commands into desired joint actuation targets, which are conditioned using a latent embedding of the temporally local system dynamics. We explore two conditioning strategies - one using a GRU-based dynamics encoder and another using a morphology-based property estimator - and show that morphology-aware conditioning outperforms temporal dynamics encoding regarding velocity task tracking for our hardware test on ANYmal C. Our results demonstrate that both approaches achieve robust zero-shot transfer across multiple unseen simulated quadrupeds. Furthermore, we demonstrate the need for careful robot reference modelling during training, enabling us to reduce the velocity tracking error by up to 30% compared to the baseline method. Despite PAL not surpassing the best-performing reference-free controller in all cases, our analysis uncovers critical design choices and informs improvements to the state of the art.

View on arXiv

@article{rytz2025_2505.16042,
  title={ Reference Free Platform Adaptive Locomotion for Quadrupedal Robots using a Dynamics Conditioned Policy },
  author={ David Rytz and Suyoung Choi and Wanming Yu and Wolfgang Merkt and Jemin Hwangbo and Ioannis Havoutis },
  journal={arXiv preprint arXiv:2505.16042},
  year={ 2025 }
}

Comments on this paper