This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessFeature PaperArticle
A Dual-Branch Spatio-Temporal Feature Differencing Method for Robust rPPG Estimation
by
Gyumin Cho
Gyumin Cho 1
,
Man-Je Kim
Man-Je Kim 2,*
and
Chang Wook Ahn
Chang Wook Ahn 1,*
1
AI Graduate School, Gwangju Institute of Science and Technology, Gwangju 61005, Republic of Korea
2
Department of AI, Chonnam National University, Gwangju 61186, Republic of Korea
*
Authors to whom correspondence should be addressed.
Mathematics 2025, 13(23), 3830; https://doi.org/10.3390/math13233830 (registering DOI)
Submission received: 5 November 2025
/
Revised: 21 November 2025
/
Accepted: 26 November 2025
/
Published: 29 November 2025
Abstract
Remote photoplethysmography (rPPG) is a non-contact technology that estimates physiological signals, such as Heart Rate (HR), by capturing subtle skin color changes caused by periodic blood volume variations using only a standard RGB camera. While cost-effective and convenient, it suffers from a fundamental limitation: performance degrades severely in dynamic environments due to susceptibility to noise, such as abrupt illumination changes or motion blur. This study presents a deep learning framework that combines two structural modifications to ensure robustness in dynamic environments, specifically modeling movement noise and illumination change noise. The proposed framework structurally cancels global disturbances, such as illumination changes or global motion, through a dual-branch pipeline that encodes the face and background in parallel after Video Color Magnification (VCM) and then performs differencing. Subsequently, it utilizes a structure that injects a Temporal Shift Module (TSM) into the Spatio-Temporal Feature Extraction (SSFE) block to preserve long- and short-term temporal correlations and smooth noise, even amidst short and irregular movements. We measured MAE, RMSE, and correlation on the standard dataset UBFC-rPPG under four noise conditions: clean, illumination change noise, Movement Noise, Both Noise and the real-world in-vehicle dataset MR-NIRP (Stationary and Driving). Experimental results showed that the proposed method achieved consistent error reduction and correlation improvement compared to the VS-Net baseline in the illumination change noise-only and combined noise environments (UBFC-rPPG) and in the high-noise driving scenario (MR-NIRP). It maintained competitive performance in motion-only noise. Conversely, a modest performance disadvantage was observed under clean conditions (UBFC) and quasi-clean stationary conditions (MR-NIRP), interpreted as a design trade-off focused on global noise cancellation and temporal smoothing. Ablation studies demonstrated that the dual-branch pipeline is the primary contributor under illumination change noise, while TSM is the key contributor under movement noise, and that the combination of both elements achieves optimal robustness in the most complex scenarios.
Share and Cite
MDPI and ACS Style
Cho, G.; Kim, M.-J.; Ahn, C.W.
A Dual-Branch Spatio-Temporal Feature Differencing Method for Robust rPPG Estimation. Mathematics 2025, 13, 3830.
https://doi.org/10.3390/math13233830
AMA Style
Cho G, Kim M-J, Ahn CW.
A Dual-Branch Spatio-Temporal Feature Differencing Method for Robust rPPG Estimation. Mathematics. 2025; 13(23):3830.
https://doi.org/10.3390/math13233830
Chicago/Turabian Style
Cho, Gyumin, Man-Je Kim, and Chang Wook Ahn.
2025. "A Dual-Branch Spatio-Temporal Feature Differencing Method for Robust rPPG Estimation" Mathematics 13, no. 23: 3830.
https://doi.org/10.3390/math13233830
APA Style
Cho, G., Kim, M.-J., & Ahn, C. W.
(2025). A Dual-Branch Spatio-Temporal Feature Differencing Method for Robust rPPG Estimation. Mathematics, 13(23), 3830.
https://doi.org/10.3390/math13233830
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.