Quality-Aware Framework for Video-Derived Respiratory Signals

16 December 2025

Nhi Nguyen

Constantino Álvarez Casado

Le Nguyen

Manuel Lage Cañellas

Miguel Bordallo López

ArXiv (abs)PDF HTML Github

Main:3 Pages

2 Figures

Bibliography:2 Pages

3 Tables

Appendix:1 Pages

Abstract

Video-based respiratory rate (RR) estimation is often unreliable due to inconsistent signal quality across extraction methods. We present a predictive, quality-aware framework that integrates heterogeneous signal sources with dynamic assessment of reliability. Ten signals are extracted from facial remote photoplethysmography (rPPG), upper-body motion, and deep learning pipelines, and analyzed using four spectral estimators: Welch's method, Multiple Signal Classification (MUSIC), Fast Fourier Transform (FFT), and peak detection. Segment-level quality indices are then used to train machine learning models that predict accuracy or select the most reliable signal. This enables adaptive signal fusion and quality-based segment filtering. Experiments on three public datasets (OMuSense-23, COHFACE, MAHNOB-HCI) show that the proposed framework achieves lower RR estimation errors than individual methods in most cases, with performance gains depending on dataset characteristics. These findings highlight the potential of quality-driven predictive modeling to deliver scalable and generalizable video-based respiratory monitoring solutions.

View on arXiv

Comments on this paper