0
0

Memory-efficient Low-latency Remote Photoplethysmography through Temporal-Spatial State Space Duality

Kegang Wang
Jiankai Tang
Yuxuan Fan
Jiatong Ji
Yuanchun Shi
Yuntao Wang
Abstract

Remote photoplethysmography (rPPG), enabling non-contact physiological monitoring through facial light reflection analysis, faces critical computational bottlenecks as deep learning introduces performance gains at the cost of prohibitive resource demands. This paper proposes ME-rPPG, a memory-efficient algorithm built on temporal-spatial state space duality, which resolves the trilemma of model scalability, cross-dataset generalization, and real-time constraints. Leveraging a transferable state space, ME-rPPG efficiently captures subtle periodic variations across facial frames while maintaining minimal computational overhead, enabling training on extended video sequences and supporting low-latency inference. Achieving cross-dataset MAEs of 5.38 (MMPD), 0.70 (VitalVideo), and 0.25 (PURE), ME-rPPG outperforms all baselines with improvements ranging from 21.3% to 60.2%. Our solution enables real-time inference with only 3.6 MB memory usage and 9.46 ms latency -- surpassing existing methods by 19.5%-49.7% accuracy and 43.2% user satisfaction gains in real-world deployments. The code and demos are released for reproducibility onthis https URL.

View on arXiv
@article{wang2025_2504.01774,
  title={ Memory-efficient Low-latency Remote Photoplethysmography through Temporal-Spatial State Space Duality },
  author={ Kegang Wang and Jiankai Tang and Yuxuan Fan and Jiatong Ji and Yuanchun Shi and Yuntao Wang },
  journal={arXiv preprint arXiv:2504.01774},
  year={ 2025 }
}
Comments on this paper