Abstract
Accurate, non-contact vital-sign monitoring promises a scalable alternative to conventional sensors, yet low signal quality and long recording times have limited real-life adoption. We present a dual-modality system that combines Eulerian video magnified remote photoplethysmography (rPPG) from facial videos with optical flow-based shoulder tracking to estimate heart rate (HR) and respiratory rate (RR) from ultra-short 15 s recordings. With 200 participants, each providing 2 videos, 387 videos passed strict usability criteria, excluding flicker, blur, occlusion, and low illumination. For 15 s recordings, the HR estimates reached 98.5% accuracy within a ±10 beats per minute tolerance (MAE = 3.25, RMSE = 4.88, r = 0.93; p < 0.05) and the RR estimates achieved 98.4% accuracy within a ±5 respirations per minute tolerance (MAE = 0.69, RMSE = 0.87, r = 0.90; p < 0.05), exceeding prior studies that required 30 to 60 s recording lengths. Computational analysis on a standard home computer confirmed feasibility, with near real-time performance achievable on optimized hardware. By integrating complementary modalities and rigorous video quality control, the system overcomes low-SNR challenges, delivering high-fidelity, clinically validated vital signs monitoring. These results establish a robust, scalable, and precise framework for clinical and home care, demonstrating that accurate, contact-free HR and RR monitoring can now be achieved in seconds, making rapid, real-life vital signs assessment practical and accessible.