20

OnlineHMR: Video-based Online World-Grounded Human Mesh Recovery

Yiwen Zhao
Ce Zheng
Yufu Wang
Hsueh-Han Daniel Yang
Liting Wen
Laszlo A. Jeni
Main:8 Pages
15 Figures
Bibliography:3 Pages
8 Tables
Appendix:4 Pages
Abstract

Human mesh recovery (HMR) models 3D human body from monocular videos, with recent works extending it to world-coordinate human trajectory and motion reconstruction. However, most existing methods remain offline, relying on future frames or global optimization, which limits their applicability in interactive feedback and perception-action loop scenarios such as AR/VR and telepresence. To address this, we propose OnlineHMR, a fully online framework that jointly satisfies four essential criteria of online processing, including system-level causality, faithfulness, temporal consistency, and efficiency. Built upon a two-branch architecture, OnlineHMR enables streaming inference via a causal key-value cache design and a curated sliding-window learning strategy. Meanwhile, a human-centric incremental SLAM provides online world-grounded alignment under physically plausible trajectory correction. Experimental results show that our method achieves performance comparable to existing chunk-based approaches on the standard EMDB benchmark and highly dynamic custom videos, while uniquely supporting online processing. Page and code are available atthis https URL.

View on arXiv
Comments on this paper