Illumination-Invariant Normalization for Robust rPPG Extraction

An, Byeong Seon; Park, Song Hee; Kim, Ye Jun; Song, Ye Rin; Cho, Geum Joon; Lee, Eui Chul

doi:10.3390/electronics15081683

Open AccessArticle

Illumination-Invariant Normalization for Robust rPPG Extraction

by

Byeong Seon An

^1,†

,

Song Hee Park

^1,†

,

Ye Jun Kim

¹,

Ye Rin Song

²,

Geum Joon Cho

³ and

Eui Chul Lee

^4,*

¹

Department of AI & Informatics, Graduate School, Sangmyung University, Hongjimun 2-gil 20, Jongno-gu, Seoul 03016, Republic of Korea

²

Department of Human-Centered Artificial Intelligence, Sangmyung University, Hongjimun 2-gil 20, Jongno-gu, Seoul 03016, Republic of Korea

³

Department of Medicine, College of Medicine, Korea University, Seoul 02841, Republic of Korea

⁴

Department of Human-Centered Artificial Intelligence, Graduate School, Sangmyung University, Hongjimun 2-gil 20, Jongno-gu, Seoul 03016, Republic of Korea

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Electronics 2026, 15(8), 1683; https://doi.org/10.3390/electronics15081683

Submission received: 12 March 2026 / Revised: 8 April 2026 / Accepted: 15 April 2026 / Published: 16 April 2026

(This article belongs to the Special Issue Advances in Digital Signal and Image Processing, Techniques, and Computations with Multidisciplinary Applications, 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

Remote photoplethysmography (rPPG) estimates heart rate by analyzing subtle blood-flow-induced color variations from camera videos; however, its performance is highly sensitive to illumination changes caused by variations in light intensity, position, and environmental conditions. To address this limitation, this study proposes a lightweight, training-free brightness normalization method that suppresses illumination-induced luminance fluctuations while preserving physiologically relevant color variations associated with blood perfusion. The proposed approach separates luminance and chrominance components from the frame-mean RGB vector and applies normalization only to the brightness component, thereby maintaining the intrinsic color direction essential for rPPG signal extraction and stabilizing temporal brightness without distorting chrominance relationships. Experimental evaluations show that channel-wise mean values vary only within

\pm 6

–

12 %

with negligible changes in standard deviation, while dynamic range and temporal stability are significantly improved. Furthermore, when combined with an SNR-based signal selection strategy, the proposed method reduces the mean absolute error (MAE) of the CHROM algorithm on the DLCN dataset from approximately 18–19 BPM to 4.87 BPM under complex illumination scenarios, with consistent improvements also observed on the MR-NIRP dataset. These results suggest that the proposed preprocessing method helps preserve blood-flow-induced temporal color variations and improves the robustness of rPPG measurement under diverse illumination conditions.

Keywords:

illumination normalization; remote photoplethysmography (rPPG); lightweight preprocessing; color-based physiological signal processing

1. Introduction

Remote photoplethysmography (rPPG) is a non-contact technique that estimates heart rate by analyzing subtle blood-volume-induced variations in skin color captured in camera videos. Because it does not require wearing sensors, rPPG can be deployed in scenarios where wearable devices are impractical or restricted, and it has therefore been actively studied for healthcare monitoring, driver-state assessment, and remote physiological sensing applications [1,2].

Despite its potential, rPPG is highly sensitive to recording conditions due to its reliance on optical reflectance. In particular, motion-induced artifacts caused by subtle subject movements, illumination imbalance between the camera and the skin, and changes in ambient lighting are major factors that degrade rPPG signal quality. These factors have long been recognized as key challenges that hinder the reliable use of rPPG in real-world environments. Motion artifacts are especially difficult to address because they comprise multiple irregular and aperiodic components, making them hard to quantify with a single signal model or to remove with a single unified strategy [3,4].

In contrast, illumination variations are relatively more amenable to physical and mathematical modeling, as the illumination component can be separated from the reflectance or chromatic components. Accordingly, a broad range of studies has explored illumination handling, including formula-driven approaches based on Retinex theory that jointly correct color and intensity [5,6] and model-based illumination normalization methods that normalize brightness or mitigate lighting effects [7,8]. More recently, incorporating illumination correction into rPPG pipelines has also been considered to improve robustness under changing lighting conditions.

However, most prior illumination-normalization-based rPPG studies primarily emphasize visual quality enhancement or color restoration accuracy. Comparatively less attention has been paid to whether these procedures preserve or enhance the temporally subtle chromatic fluctuations driven by blood perfusion—the core information source for rPPG. While normalization can stabilize global color distributions, it may simultaneously attenuate or distort the minute color variations associated with pulsatile blood flow [9].

To overcome these limitations, recent efforts have also explored applying deep learning-based illumination normalization or image restoration models to rPPG. Although learning-based approaches can be robust under complex lighting, their high computational cost and long inference latency often limit their practicality for rPPG applications that require real-time processing. This limitation is particularly critical for mobile or embedded deployment, where model compactness and latency are constraints as important as estimation accuracy [10,11].

Motivated by these considerations, this study proposes a lightweight illumination normalization method specifically designed for rPPG, with the goal of improving robustness to illumination changes while maintaining real-time feasibility. Unlike conventional illumination normalization methods that mainly target perceptual enhancement or global color restoration, the proposed method is designed to explicitly preserve the inter-channel RGB ratio structure associated with pulsatile blood-flow signals. By mathematically separating chromatic information from illumination and applying normalization only to the brightness component, the proposed method suppresses lighting-induced luminance fluctuations while preserving temporal chromatic stability that is critical for physiological signal extraction. Because it does not require training and is computationally efficient, it can serve as a practical preprocessing strategy for improving the temporal stability and reliability of rPPG signals across diverse lighting environments. Unlike conventional illumination normalization methods that treat inter-channel ratio preservation as a byproduct, the proposed method explicitly enforces this constraint as a primary design objective for rPPG signal extraction. This distinction is critical because rPPG relies on subtle inter-channel temporal variations rather than absolute intensity, and preserving this structure directly impacts physiological signal fidelity.

2. Related Works

This section reviews prior studies related to illumination normalization and low-light rPPG measurement. Table 1 summarizes representative approaches and highlights their contributions, rPPG-specific limitations, and how the proposed illumination normalization complements them.

2.1. Prior Studies on Illumination Normalization

Illumination normalization has generally evolved along two directions: (i) separating the illumination component and the reflectance component to compensate for illumination changes, and (ii) learning representations that are robust to illumination variation using deep learning.

Ref. [12] proposed an energy-minimization-based Retinex framework that simultaneously estimates illumination and reflectance, pointing out that existing methods often suffer from reduced estimation accuracy under complex lighting conditions because they rely mainly on simple

L_{2}

losses and regularization terms. Their method alternates between fixing and optimizing illumination and reflectance via alternating minimization, explicitly modeling the interaction between the two components. However, because it repeatedly solves large linear systems, the computational complexity per iteration increases to

O (M N)

, and the authors report substantial increases in storage and time consumption. Although parallel splitting-up is introduced to partially alleviate the computational burden, the approach still requires iterative optimization over the full frame, which can be costly for high-resolution facial videos or real-time rPPG applications.

Retinex-Based Fast Algorithm (RBFA) [13] proposed a method to improve the visual quality of low-light images by computing the reflectance component using the V channel in the HSV color space and an estimated illumination map, and then adjusting saturation by leveraging the difference between the mean reflectance and the mean saturation (S). This method effectively improves contrast and details by automatically balancing brightness and saturation from a single image. However, the method is primarily aimed at enhancing the visual quality of still images, and it does not discuss whether the temporally varying color changes induced by blood flow—critical in rPPG—are preserved, nor how the method relates to heart-rate (HR) estimation performance.

Ref. [14] focused on performance degradation in deep face-recognition models under severe illumination changes. They synthesized a reference image with stable illumination using an SVD-based procedure and trained a network to transform input images with large illumination variation to more closely match this reference. On Extended Yale B under severe illumination conditions (subsets 4–5), they improved the average recognition rate of ArcFace from 38.56% to 56.89%, demonstrating effectiveness for face recognition. Nevertheless, the illumination removal process may lead to substantial loss of skin-intrinsic color information, and the network architecture is complex, making real-time deployment difficult. In addition, because the model is trained to match a specific reference illumination condition, generalization to new illumination environments depends strongly on the training data distribution.

2.2. Model-Based rPPG Measurement Under Low-Light Conditions

Ref. [15] systematically evaluated representative model-based rPPG measurement methods, including DeepPhys, PhysNet, and TS-CAN, under challenging conditions such as low light and elevated heart rate. Using the newly proposed Challenging Heart Rate and Illumination (CHILL) dataset together with existing public benchmark datasets, they found that many deep learning-based rPPG models exhibit performance degradation, particularly under high heart-rate conditions. While the isolated effect of low-light conditions was relatively limited, cross-dataset evaluation revealed instability in generalization performance—for example, the HR MAE on CHILL increased by more than twofold depending on which dataset was used for training. These results suggest that existing model-based rPPG approaches have fundamental limitations in stably handling complex physiological variations and real-world environmental changes.

Ref. [16] pointed out that many prior rPPG studies rely on spatial skin perception and temporal rhythm interactions under ideal illumination, and thus their performance deteriorates in complex outdoor environments or in scenarios with extreme illumination changes. To address this, they proposed a video-transformer-based rPPG model that jointly learns spatiotemporal information, enhancing rPPG-related representations through spatiotemporal map reconstruction. They further mitigate time-varying external interference using components such as global interference sharing, subject background reference, and self-supervised disentanglement, and estimate heart rate via temporal regression that combines frequency-domain constraints with physiological priors. The proposed method showed competitive performance across diverse datasets and outdoor scenarios; however, because it relies on a learning-based structure, additional consideration is needed regarding model complexity and computational cost.

Ref. [17] noted that existing CNN-based rPPG methods cannot sufficiently model long-range spatiotemporal dependencies due to their limited receptive fields, and proposed PhysFormer, an rPPG framework based on a temporal-difference transformer. PhysFormer enhances quasi-periodic rPPG features via temporal-difference-guided global attention and improves rPPG representations by adaptively integrating local and global spatiotemporal information. It also improves training stability and generalization by combining dynamic constraints in the frequency domain with label distribution learning. Experimentally, PhysFormer achieved lower MAE and RMSE and higher correlation (r) than CNN-based baselines on OBF, MAHNOB-HCI, and MMSE-HR, demonstrating strong intra- and cross-dataset performance. The paper also discusses that, due to its architecture, the model has a relatively large number of parameters and computational demands, which can limit deployment in mobile or lightweight environments.

2.3. Retinex-Based rPPG Measurement Under Low-Light Conditions

Ref. [18] proposed a method to alleviate the sharp degradation of rPPG performance in low-light environments by estimating a G-channel-based Retinex-LIME illumination map and extracting rPPG from frames with the brightness component removed using GREEN, ICA, and POS. In evaluations on an in-house dataset spanning 1–100 lux, the method showed low-light compensation effects in some ranges; for instance, under very low illumination (1–4 lux), the HR MAE of POS decreased from 59.89 bpm to 56.84 bpm and from 23.98 bpm to 8.13 bpm when comparing the original video to the processed video. Conversely, in ranges where illumination was moderately available (approximately ≥10 lux), POS MAE was reported to increase (e.g., from 1.70 bpm to 2.63 bpm and from 1.27 bpm to 6.41 bpm). The paper reports that, under low light, pulse components in the red and blue channels nearly vanish while noise increases, causing POS—which combines three channels—to incur larger MAE than other methods. This suggests that when RGB inter-channel ratios and the chrominance structure are not stably maintained under low light, chrominance-based rPPG algorithms may be particularly vulnerable.

Ref. [19] proposed a deep learning-based image enhancement model (IEM) grounded in Retinex theory to mitigate rPPG performance degradation across diverse illumination conditions. The study emphasized that image-quality degradation caused by illumination changes directly affects rPPG signal extraction and heart-rate estimation accuracy, and applied a Retinex-inspired enhancement model as a preprocessing stage in the rPPG pipeline. The proposed IEM was integrated with a pre-trained 3D CNN-based rPPG model (PhysNet) and fine-tuned using a time-shifted negative Pearson correlation loss to alleviate temporal alignment issues between ground-truth PPG and predicted rPPG signals. Experiments on public datasets covering various lighting conditions and on an in-house dataset showed improved rPPG extraction and HR estimation accuracy under both low- and high-illumination settings, and demonstrated that the gains were attributable to the enhancement model itself rather than merely to re-training effects. However, this approach relies on a coupled structure of a learning-based enhancement model and an rPPG network, and it does not include an explicit analysis of whether temporally subtle color changes induced by blood flow are preserved during illumination correction.

Ref. [20], rPPG-FuseNet is a representative study that integrates a Retinex-based approach into the rPPG pipeline to mitigate performance degradation under non-uniform illumination. The method applies Multi-Scale Retinex (MSR) to compensate for illumination non-uniformity, and uses the original RGB video and the MSR-processed video as complementary inputs to construct spatiotemporal rPPG features. The proposed network fuses RGB and MSR signals to estimate heart rate, aiming to more effectively recover discriminative features that are obscured by illumination in low-light environments. The authors report that the fusion architecture incorporating Retinex-based preprocessing yields meaningful performance improvements over prior methods in both intra- and inter-database evaluations on public datasets. Nevertheless, because the approach depends on a learning-based network that takes MSR-processed results as input, separation of illumination components and blood-flow components is not performed explicitly, and is instead left to internal representation learning within the model.

Existing illumination correction methods often do not explicitly address the preservation of the subtle temporal color variations essential for rPPG, as they either rely on complex models, focus on still-image quality, or may distort the inter-channel RGB ratio structure through single-channel–based illumination maps. Although deep learning–based and Retinex-based approaches can improve performance under varying illumination, they are often computationally expensive, dependent on training data, and do not explicitly prioritize the preservation of pulsatile blood-flow-related chromatic information. In contrast, the proposed method is designed as an rPPG-specific normalization strategy that explicitly preserves the inter-channel RGB ratio structure associated with blood-flow-induced temporal color variations. By separating brightness and chromatic information from the frame-mean RGB vector and applying normalization only to the brightness component, the method suppresses illumination-induced luminance fluctuations while maintaining temporal chromatic stability relevant to physiological signal extraction. With an additional cost of approximately 0.16 ms per frame, the proposed method satisfies real-time constraints and showed reduced HR MAE across conventional rPPG algorithms (ICA, POS, and CHROM) under diverse illumination conditions.

3. Method

Figure 1 presents the overall framework of the proposed method, describing the end-to-end pipeline designed for robust rPPG extraction under diverse real-world illumination conditions.

3.1. Dataset

To evaluate the robustness of the proposed rPPG signal extraction method against illumination variations, this study employs two publicly available datasets constructed under distinct environmental conditions: DLCN [21] and MR-NIRP [22]. The DLCN dataset was collected in a controlled environment with systematically varying lighting conditions, whereas the MR-NIRP dataset was acquired in an uncontrolled driving environment. These contrasting settings enable a comprehensive assessment of the generalization capability of the proposed method under diverse illumination scenarios. A comparison of the key characteristics of the two datasets is summarized in Table 2. All facial images presented in this study were obtained from these publicly available datasets and were used within the scope of the datasets’ original ethical approval and informed consent procedures.

3.1.1. DLCN

The DLCN dataset is a recent publicly available benchmark constructed to quantitatively evaluate the performance of rPPG signal extraction under complex and diverse nighttime illumination conditions. It consists of RGB facial videos collected from a total of 98 participants, with all videos recorded at a resolution of 640 × 480 pixels and a frame rate of 30 fps. A distinctive characteristic of the DLCN dataset is its session-based structure, in which both illumination conditions and physiological states are systematically controlled.

For each subject, a total of eight sessions are provided. Sessions 1–4 are recorded under resting conditions, while Sessions 5–8 are captured after physical exercise, thereby incorporating physiological variability. Each session is further divided into four illumination scenarios defined by combinations of light intensity and light position, and the same scenarios are repeated under identical conditions before and after exercise. The detailed illumination scenarios are as follows:

Fixed Intensity & Fixed Position (FI & FP): A stable illumination condition in which both the light intensity and position remain constant.
Variable Intensity & Fixed Position (VI & FP): A condition where the light position is fixed while the illumination intensity varies over time, inducing brightness fluctuations.
Fixed Intensity & Variable Position (FI & VP): A condition in which the light intensity remains constant but the light position changes, resulting in continuously varying incident angles on the face.
Variable Intensity & Variable Position (VI & VP): The most complex scenario, where both the illumination intensity and position vary simultaneously, closely approximating real-world illumination conditions.

By independently or jointly varying illumination intensity and position across these four scenarios, the DLCN dataset systematically models diverse environments that induce abrupt changes in facial reflectance characteristics. Since each session is accompanied by ground-truth heart rate measurements, the dataset enables quantitative analysis of how video-based rPPG algorithms are affected by different levels of illumination variation. Owing to this well-controlled and comprehensive design, DLCN serves as a highly suitable benchmark for validating the robustness of rPPG algorithms against illumination changes.

Examples of DLCN frames under different scenarios over time are illustrated in Figure 2.

3.1.2. MR-NIRP

The MR-NIRP dataset is a publicly available benchmark consisting of videos captured in real-world driving environments along with synchronized ground-truth physiological signals, providing an important basis for evaluating rPPG performance under uncontrolled conditions. The dataset includes simultaneously recorded RGB videos and near-infrared (NIR) videos in the 940 nm and 975 nm bands from a total of 18 participants, with all videos captured at a resolution of 640 × 640 pixels and a frame rate of 30 fps.

The recording environments are broadly categorized into Driving and Garage settings. The Garage environment represents a relatively stable condition with a single illumination setup. Since this study focuses on analyzing the impact of illumination variations on rPPG signal extraction, the Garage environment—where illumination changes are minimal—is excluded, and only the Driving environment is utilized.

The Driving data are further divided into three scenarios based on motion intensity: Drive-small, Drive-still, and Drive-large. To independently assess the effects of illumination changes, this study uses only the Drive-small and Drive-still scenarios, which involve relatively limited motion. The Drive-large scenario is excluded from the analysis because it contains substantial head movements and posture changes, leading to a mixture of illumination effects and motion-induced artifacts.

Consequently, the MR-NIRP dataset used in this study comprises driving videos recorded under diverse and mixed illumination conditions, including non-uniform lighting, saturation, daytime, and nighttime scenarios. This configuration enables the validation of the robustness of the proposed method under irregular and dynamically changing illumination conditions encountered in real-world driving environments.

Examples of the MR-NIRP dataset under various environmental conditions are shown in Figure 3.

3.2. Illumination Normalization

In this study, we propose a preprocessing method that preserves color components while normalizing only the illumination component in RGB frames, thereby reducing the influence of illumination variations and stably emphasizing color changes. Specifically, each RGB frame is decomposed into an illumination component and a color component, the illumination component is independently normalized, and the color information is subsequently re-integrated. This formulation facilitates robust rPPG signal extraction under varying lighting conditions.

First, the mean RGB vector of frame t is computed as follows:

m_{t} = [\begin{matrix} R_{t} \\ G_{t} \\ B_{t} \end{matrix}],

(1)

where

R_{t}

,

G_{t}

, and

B_{t}

denote the average values of the red, green, and blue channels, respectively.

As described in Equation (1), the illumination magnitude is defined as the maximum component of the RGB vector, corresponding to the dominant channel intensity. This value is used as a global brightness scale of the frame. The illumination component

L_{t}

is computed as follows:

L_{t} = {∥ m_{t} ∥}_{\infty} = max (R_{t}, G_{t}, B_{t}) .

(2)

Using the illumination magnitude defined in Equation (2), the color component can be independently extracted by removing the illumination scale. In this study, the illumination magnitude is defined as the maximum component of the frame-mean RGB vector rather than the channel average or an

L_{2}

-norm-based magnitude, because the aim of the proposed formulation is not to reconstruct physical illumination itself, but to isolate a global brightness scale while preserving the inter-channel RGB ratio structure that is essential for chrominance-based rPPG. The use of the maximum RGB component is not intended to estimate physical illumination, but to define a dominant-channel-referenced scaling that minimizes cross-channel mixing. This is particularly important in rPPG, where pulsatile signals are encoded in subtle inter-channel relationships, and alternative formulations such as averaging or norm-based scaling may distort these relationships. Specifically, using the maximum RGB component enables brightness scaling with respect to the dominant channel, which helps preserve the original color direction of the RGB vector. In contrast, alternative definitions such as channel averaging or

L_{2}

norms may introduce cross-channel mixing effects in the scaling term, potentially distorting the relative chromatic structure. The color component

c_{t}

is computed as follows:

c_{t} = \frac{m_{t}}{L_{t} + ε},

(3)

where

ε

is a small constant introduced to avoid division by zero. As shown in Equation (3), the resulting color component represents a normalized color direction in which the maximum channel value is scaled to one. Accordingly, by using

L_{t} = max (R_{t}, G_{t}, B_{t})

, the normalized color component preserves the color direction of the original RGB vector while reducing illumination-driven magnitude variation, thereby minimizing distortion of subtle blood-flow-induced chromatic changes.

Figure 4 illustrates example images obtained by separating the illumination and color components from an input frame using Equations (2) and (3). The color component

c_{t}

represents inter-channel RGB ratios with the illumination magnitude removed. Since it encodes color direction rather than perceptual color, the resulting image may appear biased toward a single hue. In particular, human skin regions tend to appear reddish due to their inherently higher red-channel contribution.

After independently separating the color and illumination components, illumination normalization is applied. A general illumination scaling model is expressed as follows:

L_{t}^{'} = α \cdot L_{t},

(4)

where

α

denotes an illumination adjustment factor.

Given a target illumination level

L_{ref}

, the scaling factor

α

can be defined as follows:

α = \frac{L_{ref}}{L_{t}}, L_{t}^{'} = L_{ref} .

(5)

As shown in Equation (5), the illumination magnitude

L_{t}^{'}

is forced to a fixed reference value

L_{ref}

, regardless of the original frame brightness.

The normalized illumination value is then recombined with the original color direction to generate the final color vector. Importantly, the illumination component is not completely discarded, since illumination variations still carry informative cues for rPPG extraction. The final RGB vector is obtained by combining the normalized illumination magnitude with the original color component as follows:

m_{t}^{'} = L_{t}^{'} \cdot c_{t} = [\begin{matrix} R_{t}^{'} \\ G_{t}^{'} \\ B_{t}^{'} \end{matrix}],

(6)

where

m_{t}^{'}

represents an RGB vector with normalized illumination applied.

Compared to existing methods that decompose illumination and color components, the proposed approach is intended to be more suitable for rPPG tasks. Conventional techniques such as Retinex, Histogram Equalization, Gamma Correction, and Color Constancy introduce nonlinear transformations across RGB channels, which can distort or attenuate the subtle color variations that are critical for rPPG signal extraction. In contrast, the proposed method preserves the original color direction while linearly normalizing only the global illumination component. As a result, it improves robustness to illumination changes while helping preserve rPPG-related color variations, which may contribute to higher signal-to-noise ratio (SNR) and more stable heart rate estimation. A comparison with existing illumination normalization methods is summarized in Table 3.

3.3. rPPG Extraction

The rPPG processing pipeline proposed in this study consists of illumination normalization (I-norm) for obtaining color signals robust to illumination variations, facial and skin region extraction, algorithm-based rPPG signal extraction, SNR-based signal quality assessment, and heart rate (BPM) estimation. In particular, the proposed method applies I-norm to suppress brightness drift caused by illumination changes. Subsequently, an SNR-based signal quality evaluation is performed to select reliable rPPG signals, enabling stable heart rate estimation even under low-light conditions. The overall workflow of the proposed pipeline is illustrated in Figure 5.

After applying I-norm to each frame, facial preprocessing was conducted. Since the DLCN dataset is provided with pre-cropped facial regions, no additional facial preprocessing was applied. In contrast, for the MR-NIRP dataset, faces were detected and cropped from the original frames. For all frames, facial landmarks were extracted using a 3D Dense Face Alignment-based method [23], which were subsequently used to define skin regions and generate skin masks.

The skin mask was generated by constructing a facial boundary polygon based on jaw-to-brow landmarks to remove background regions. In addition, the eye, nose, and mouth regions were further masked to suppress motion-related noise. By excluding dynamic facial components such as eye blinking and mouth movements, the proposed pipeline aims to extract more reliable rPPG signals.

To analyze the effect of the proposed I-norm preprocessing on various color-based rPPG algorithms, several widely used methods were employed in this study, including POS [3], ICA [24], CHROM [4], GREEN [25], LGI [26], and PBV [27].

The Plane-Orthogonal-to-Skin (POS) method projects normalized RGB signals onto two orthogonal chrominance components and constructs the rPPG signal as follows:

x_{t} = \frac{G_{t}}{R_{t}}, y_{t} = \frac{G_{t}}{B_{t}}, {rPPG}_{t} = α x_{t} - β y_{t},

(7)

where

α

and

β

are scaling coefficients computed based on the standard deviations within a temporal window.

In Independent Component Analysis (ICA)-based rPPG, the RGB signal is assumed to be a linear mixture of statistically independent source components, and periodic blood-volume-induced variations are separated through independent component decomposition. The RGB signal vector

x (t)

is decomposed into independent components as follows:

s (t) = W x (t),

(8)

where

W

denotes the unmixing matrix. Among the resulting independent components, the one exhibiting the largest spectral energy within the heart rate frequency band is selected as the rPPG signal.

The Chrominance-based rPPG (CHROM) method constructs an illumination-robust rPPG signal by combining two chrominance difference signals. The chrominance components are defined as follows:

X_{t} = 3 R_{t} - 2 G_{t}, Y_{t} = 1.5 R_{t} + G_{t} - 1.5 B_{t},

(9)

and the final rPPG signal is obtained by combining the two components using their standard deviation ratio as follows:

{rPPG}_{t} = X_{t}^{'} - \frac{σ_{X}}{σ_{Y}} Y_{t}^{'},

(10)

where

σ_{X}

and

σ_{Y}

denote the standard deviations of

X_{t}

and

Y_{t}

within a temporal window.

The GREEN method is a simple approach that extracts the rPPG signal using only the green channel, which is known to be most sensitive to blood volume variations. The rPPG signal at frame t is defined as follows:

{rPPG}_{t} = G_{t},

(11)

followed by band-pass filtering to emphasize the heart-rate frequency band. Although computationally efficient and suitable for real-time processing, the GREEN method is relatively vulnerable to illumination changes.

Local Group Invariance (LGI) exploits instantaneous proportional changes in color signals to achieve robustness against illumination variations. Specifically, the temporal derivatives of RGB signals are normalized as follows:

g (t) = \frac{d}{d t} (\frac{{[\begin{matrix} R (t) & G (t) & B (t) \end{matrix}]}^{⊤}}{R (t) + G (t) + B (t)}),

(12)

and the three components of

g (t)

are subsequently reduced to a single rPPG signal using either principal component analysis (PCA) [28] or a weighted summation scheme.

The Pulse Blood Volume (PBV) method assumes that the relative ratios among color channels induced by blood volume changes remain constant. Based on this assumption, the rPPG signal is extracted by projecting the normalized RGB signal

x^{'} (t)

onto a specific direction as follows:

rPPG (t) = w^{⊤} x^{'} (t),

(13)

where

w

is a projection vector designed to maximize color responses associated with blood volume variations.

The raw rPPG signals obtained from each algorithm are first detrended to remove baseline drift, and a Butterworth band-pass filter with a frequency range of 0.7–3.0 Hz is applied to retain only the heart-rate band. Subsequently, an SNR (Signal-to-Noise Ratio)-based signal quality evaluation stage is introduced to assess the reliability of the rPPG signal.

The SNR quantifies how strongly heart-rate-related components appear in the rPPG signal relative to surrounding noise and is used as an indicator of signal quality. Based on the power spectral density (PSD)

P (f)

, a frequency band of ±0.1 Hz around the dominant frequency is defined as the signal band, while the remaining frequencies within the physiological range (0.7–3.0 Hz) are treated as the noise band. The SNR is computed as follows:

SNR = 10 {log}_{10} (\frac{\sum_{f \in B_{signal}} P (f)}{\sum_{f \in B_{noise}} P (f)}) .

(14)

A higher SNR value indicates that heart-rate-related frequency components are more dominant than noise components, implying higher rPPG signal quality. Therefore, in this study, the SNR value is used as a signal quality indicator to minimize the influence of noisy signal segments.

For heart rate estimation, both Contact Photoplethysmography (cPPG) and rPPG signals are resampled to 30 Hz and temporally aligned to have the same signal length. The analysis is then performed using non-overlapping 30-s windows. Within each window, the fast Fourier transform (FFT) is applied to obtain the frequency spectrum

X (f)

. The dominant frequency is determined as follows:

f_{dom} = arg max_{f} | X (f) |,

(15)

and the heart rate (BPM) is computed as follows:

HR = 60 \cdot f_{dom} .

(16)

The difference between rPPG-based BPM and cPPG-based BPM across all windows is evaluated using the mean absolute error (MAE), root mean square error (RMSE), mean absolute percentage error (MAPE), and the percentage of estimates within a ±6 BPM tolerance (PTE6).

Through this pipeline structure, illumination-robust rPPG signals can be generated, and the influence of noise can be reduced through SNR-based signal quality assessment, enabling more stable heart rate estimation.

4. Results

4.1. Signal-Level Validation of Illumination Normalization

To evaluate the effectiveness of the proposed illumination normalization method on the DLCN dataset, the AC/DC ratio was analyzed. Figure 6 illustrates the distribution of AC/DC ratios over the entire dataset, where the yellow distribution corresponds to the original signals and the blue distribution represents the normalized results. Compared to the original signals, the AC/DC ratio distribution after normalization is shifted toward lower values, suggesting that the proposed method reduces illumination-induced DC components while maintaining pulsatile information.

In addition, frame-level changes after applying illumination normalization to the DLCN dataset are illustrated in Figure 7.

In addition to overall illumination stabilization, several analyses were conducted to verify the preservation of color information. Specifically, skin-region signals of the R, G, B, Cb, and Cr channels were compared with and without illumination normalization. The comparison results are summarized in Table 4.

An increase was observed in all channels except for the Cb channel. Overall, the standard deviations exhibited minimal variation, suggesting that the color components were largely preserved after normalization. Figure 8 illustrates a comparison of the R, G, B, Cb, and Cr signals before and after illumination normalization.

To evaluate the degree to which illumination variations were suppressed, channel-wise temporal stability was analyzed in addition to color variation. The changes in the stability ratio for each channel are summarized in Table 5.

More than a twofold improvement in temporal stability was observed across all channels, suggesting that the proposed illumination normalization helps suppress temporal flicker. Beyond color preservation at the frame level, inter-channel correlations are also critical for rPPG signal extraction. To examine this aspect, changes in the correlations among the RGB channels and between the Cb and Cr channels were analyzed. The correlation changes between channels are summarized in Table 6.

As the correlation coefficients among the RGB channels increase overall, the color direction appears to become more stable, suggesting that illumination-induced common-mode variations are more consistently aligned. This stabilization may facilitatemore reliable directional color changes, which are essential for chrominance-based rPPG methods. Meanwhile, the Cb and Cr correlation increases from −0.431 to −0.322 after illumination normalization. Although the correlation value itself increases, its absolute magnitude decreases, indicating a weakened negative correlation between the chrominance channels. Since the Cb and Cr components primarily represent color-difference information that is more sensitive to illumination and color bias than to blood perfusion, this change suggests that illumination-related chrominance coupling is partially suppressed rather than directly enhancing blood-flow-induced information. Figure 9 compares the baseline rPPG signals, the signals obtained after illumination normalization (I-norm), and the reference cPPG signals for several representative methods. In the time domain, the I-norm signals generally exhibit waveform patterns that are more consistent with the reference cPPG signals than the baseline signals. In the frequency domain, the normalized PSDs show that, after illumination normalization, the dominant spectral peaks tend to move closer to the cPPG peak or become more clearly concentrated around the physiological frequency band. This tendency is particularly evident in methods such as CHROM, ICA, PBV, and POS, where the baseline signals exhibit broader or less consistent spectral distributions, whereas the I-norm signals show improved peak concentration and reduced spectral dispersion. These observations suggest that the proposed normalization reduces illumination-induced fluctuations and may help preserve pulse-related periodic components more effectively than the baseline input.

Before applying illumination normalization, the average frame processing time was 1.18 ms per frame. After incorporating the normalization process, the processing time increased to 1.34 ms per frame, corresponding to an average overhead of approximately 0.16 ms per frame. This additional computational cost is marginal and does not affect real-time processing performance.

4.2. Effect of Illumination Normalization on rPPG Extraction Performance

The effect of the proposed illumination normalization (I-norm) on rPPG signal extraction performance was analyzed. Heart rate estimation performance before (baseline) and after applying I-norm was compared on the DLCN dataset using the POS, ICA, and CHROM algorithms. The analysis was conducted using a window size of 30 s without overlap, and BPM values were computed for each window. The results are presented in Table 7.

As shown in Table 7, after applying I-norm, most algorithms exhibited reduced MAE and RMSE values and an increased PTE6 ratio. These results indicate that I-norm alone already provides consistent performance improvements over the baseline across multiple state- and scenario-based conditions, even before any signal-quality filtering is applied. In particular, the CHROM algorithm demonstrated a substantial improvement in the Rest condition, where the MAE decreased from 11.33 BPM to 5.84 BPM.

Table 8 shows that I-norm generally achieved the best performance among the compared image enhancement methods across both state- and scenario-based evaluations. In particular, I-norm yielded the lowest errors in most conditions for POS and CHROM, and maintained more stable performance than Retinex, Gamma, and Lab as illumination conditions became more challenging. Although the error increased under scenarios involving illumination position changes, I-norm still showed the most favorable overall results, indicating that it was among the most favorable enhancement methods for robust rPPG extraction under the evaluation settings considered in this study.

To evaluate the feasibility of real-time application, BPM estimation performance was further compared by varying the window size to 5, 10, 20, and 30 s with a 1-s estimation interval. The results are summarized in Table 9. The analysis indicates that the highest accuracy was achieved with a 30-s window. As the window size decreased, the estimation accuracy tended to decline because the periodic characteristics of the rPPG signal were insufficiently captured. In particular, a marked increase in MAE was observed across all algorithms when a 5-s window was used. Therefore, considering both accuracy and stability, a 30-s window was adopted as the default configuration in this study.

The same analysis was also conducted on the MR-NIRP dataset, and the results are presented in Table 10. A general trend of MAE reduction after applying I-norm was observed across most algorithms. For example, in the PBV algorithm, the MAE decreased from 13.98 to 10.27 under the Driving Small Motion condition. Similarly, the CHROM algorithm showed a substantial improvement in the Driving Still condition, where the MAE decreased from 11.16 to 7.21.

4.3. Effect of SNR-Based Signal Quality Filtering on rPPG Estimation

To further analyze the effect of I-norm, an additional signal quality evaluation based on the signal-to-noise ratio (SNR) was applied to assess the incremental effect of filtering low-quality segments after illumination normalization On the DLCN dataset, the results of Baseline + SNR and I-norm + SNR were compared, and the results are presented in Table 11.

When I-norm + SNR was applied, most algorithms showed substantial reductions in MAE, RMSE, and MAPE, along with an increase in the PTE6 ratio. In the Rest condition, the MAE of the POS algorithm decreased from 8.57 in Baseline + SNR to 2.91 after applying I-norm + SNR. Similarly, for the CHROM-based rPPG method, the MAE in the Rest condition decreased substantially from 7.70 to 2.36. A similar trend was observed in the Exercise condition, where the MAE of the CHROM algorithm decreased from 6.96 to 3.59. The SNR-based filtering step further improved performance by excluding low-quality segments. Importantly, I-norm alone already provides substantial performance improvement prior to any SNR-based filtering. For example, the CHROM MAE is significantly reduced before filtering is applied, indicating that the primary performance gain originates from illumination normalization rather than post hoc selection. Moreover, when comparing Baseline + SNR with I-norm + SNR, the I-norm-based pipeline generally achieved better overall performance, suggesting that illumination normalization contributed beyond the filtering effect.

The same analysis was also conducted on the MR-NIRP dataset, and the results are presented in Table 12. In the Driving Small Motion condition, the MAE of the POS algorithm decreased from 9.17 BPM to 4.15 BPM. In the Driving Still condition, the MAE of the ICA algorithm decreased from 7.38 BPM to 1.43 BPM.

The experimental results indicate that the proposed illumination normalization did not yield uniform improvements across all conditions, but produced more pronounced performance gains in scenarios with substantial illumination changes and in color-based rPPG algorithms. In particular, on the DLCN dataset, the improvement was more evident under complex conditions where both illumination intensity and position varied simultaneously, and the MAE of the CHROM algorithm was reduced from approximately 18–19 BPM to 4.87 BPM. Moreover, an overall reduction in error was also observed in the Drive-small and Drive-still conditions of the MR-NIRP dataset, suggesting that the proposed method can operate effectively not only in controlled illumination environments but also in more realistic conditions considered in this study. These findings suggest that the proposed method does not simply correct brightness, but also reduces frame-to-frame brightness fluctuations in a manner that is consistent with preserving the inter-channel color ratio information used for rPPG extraction.

4.4. SNR-Based Quality Filtering Strategy

The signal-to-noise ratio (SNR) is a metric that represents the ratio between the energy of the signal and that of the noise. When the SNR is greater than 0, the signal energy exceeds the noise energy, meaning that periodic patterns or dominant frequency components tend to appear more clearly. In physiological signals such as rPPG, this condition increases the likelihood that the frequency component corresponding to the heart rate can be observed in a stable manner.

In contrast, when the SNR is lower than 0, the noise energy becomes dominant over the signal energy, increasing the possibility that the frequency components of the true physiological signal are masked or distorted by noise. Therefore, SNR = 0 can be interpreted as the boundary where the signal energy is at least equal to the noise energy, and it can serve as a practical reference point for evaluating signal quality. An example of signals categorized based on SNR = 0 is illustrated in Figure 10.

When the SNR threshold (th) was set to 0, the data loss rate observed in the DLCN dataset is summarized in Table 13. The data retention rate remains high (over 93–97%), indicating that performance improvement is not achieved by discarding large portions of data.

Similarly, the data loss rate for the MR-NIRP dataset is presented in Table 14.

4.5. Comparison with Existing Benchmark Studies

The performance of the proposed I-norm + SNR-based rPPG processing pipeline was compared with previously reported results. Table 15 presents a comparison between the algorithm-based baseline results and deep learning-based model performances reported in the original DLCN study and the results obtained in this study.

The proposed method achieved lower MAE than conventional algorithm-based approaches under most illumination conditions. For example, in the FI&FP condition, the MAE of the POS algorithm decreased from 8.04 BPM in the previous study to 0.85 BPM with the proposed method. The results also show competitive performance compared with deep learning-based models.

A similar comparison was conducted on the MR-NIRP dataset, and the results are summarized in Table 16. In the Driving Small Motion condition, the proposed method achieved an MAE of 4.15 BPM using the POS algorithm, which is lower than the error reported by the baseline methods. In the Driving Still condition, the MAE reached 1.43 BPM when the ICA algorithm was applied, outperforming the existing algorithm-based approaches. The proposed method also showed meaningful performance compared with deep learning-based rPPG models. For instance, while PhysNet reported an MAE of 4.37 BPM in the Driving Still condition, the proposed method achieved a significantly lower MAE of 1.43 BPM.

An SNR value lower than 0 indicates that the noise component is greater than the pulse-related signal within the target frequency band. In the DLCN analysis, such failure segments were most frequent under VP conditions, accounting for approximately 33% ± 5, followed by VI at approximately 26% ± 3, FP at approximately 22% ± 4, and FI at approximately 19% ± 4. This can be interpreted as resulting from the fact that, when illumination position changes (VP) are involved, variations in facial shading, reflection, and local brightness imbalance become more pronounced, causing non-periodic optical changes to dominate over the periodic components of the rPPG signal. Changes in illumination intensity (VI) also led to SNR degradation, but their effect was relatively smaller than that of position changes. This tendency was consistent with the actual performance results, where the VI & VP condition, involving simultaneous changes in illumination intensity and position, emerged as the most challenging scenario. Therefore, the SNR-based selection was more effective in relatively stable illumination conditions, but it also revealed a limitation in that failure segments occurred more frequently in complex environments involving illumination position changes.

5. Discussion

In this study, only the luminance component was normalized while preserving color information as much as possible. To verify the effect of the proposed normalization, the differences in the mean values of the R, G, B, Cb, and Cr color channels before and after normalization were analyzed. Figure 11 visually illustrates the magnitude of these changes.

After applying the proposed brightness normalization technique, the mean values of all RGB and CbCr channels changed only within an approximately ±6–12% range, indicating that the channel-wise DC components were not excessively distorted and that the proposed normalization primarily compensated for global illumination bias. More importantly, these limited channel-wise shifts suggest that the method does not substantially alter the inter-channel RGB ratio structure, which is critical for preserving blood-flow-related chromatic information in rPPG. In particular, the increase in the mean values of the R, G, and B channels can be interpreted not as an artificial amplification of the absolute frame brightness, but as a slight upward adjustment of the effective signal level that includes skin reflectance components, which may enhance the responsiveness of channels sensitive to blood flow variations. In contrast, the decrease in the mean value of the Cb channel can be attributed to a shift in the blue-difference component toward a more neutral reference, mitigating illumination-induced color bias, while the increase in the Cr channel emphasizes redness variations associated with blood perfusion. Notably, these mean value changes were accompanied by negligible variations in standard deviation, suggesting that the normalization primarily adjusted the DC level while largely preserving the temporal rPPG waveform structure. This supports the interpretation that the proposed method functions not merely as a perceptual brightness correction, but as an rPPG-oriented normalization strategy that stabilizes illumination while preserving temporally subtle chromatic variations relevant to physiological signal extraction. At the same time, the dynamic range across all channels was expanded by approximately 2.4–2.6×, allowing subtle AC-component-based color variations to become more prominent and thereby contributing to an improvement in the SNR of the rPPG signal.

In addition to analyzing changes in color components, Temporal Stability was evaluated to assess the temporal stability of brightness variations. The results show that Temporal Stability improved by approximately twofold across all channels, suggesting that the proposed illumination normalization helps suppress temporal brightness instability and flicker components. Temporal flicker can induce low-frequency brightness variations that may overlap with the frequency band of rPPG signals, potentially degrading heart rate estimation accuracy. Therefore, the proposed normalization process can be interpreted as providing a more stable illumination environment for rPPG signal extraction by suppressing such flicker components.

To further verify rPPG performance under stabilized illumination conditions, inter-channel correlations were analyzed. Compared to the original video, the correlations among the R–G–B channels increased by an average of approximately

2 %

, while a relatively larger improvement of about

11 %

was observed for the Cb–Cr channel pair. This can be interpreted as a result of the corrected luminance component being consistently reflected across the RGB channels during brightness normalization, leading to improved color balance. In the chrominance domain, brightness normalization reduced mutual interference between the Cb and Cr channels and yielded a more normalized color-difference distribution, indicating reduced illumination-induced color distortion and enhanced preservation of independent chromatic information. These properties are particularly important for color-based physiological signal analysis such as rPPG, as they enable more stable separation of chromatic components and improve the overall reliability and robustness of signal extraction.

The final performance reported in this study was obtained by applying SNR-based signal selection after brightness normalization. However, the observed gains were not solely attributable to the filtering step. As shown in the comparative results, I-norm alone already yielded overall improvements over the baseline across multiple conditions, indicating that illumination normalization itself played a major role in the performance gain. The SNR-based filtering step provided additional improvement by excluding low-quality segments, thereby further enhancing the stability of BPM estimation. On the DLCN dataset, the MAE of the CHROM method under the most complex illumination condition (VI & VP) was reduced from approximately 18–19 BPM, as reported in prior benchmark studies, to

4.87

BPM, corresponding to an error reduction of over

70 %

, while PTE6 increased substantially from below

50 %

to over

80 %

. Although performance degradation was observed in some scenarios compared to the deep-learning-based CSTR-Net, this limitation can be attributed to the algorithm-based nature of the proposed approach rather than a learning-based model. Nevertheless, achieving illumination-robust performance without any additional training highlights the practical significance of the proposed method for real-time and lightweight systems. Similar trends were observed on the MR-NIRP dataset, where the MAE of the POS and ICA methods in the Driving Small Motion scenario decreased from

11.15

BPM to

4.15

BPM and from

13.88

BPM to

1.43

BPM, respectively, demonstrating competitive performance compared to prior MR-NIRP-based studies and some deep-learning-based models. These results suggest that the proposed brightness normalization and SNR-based post-processing generalize well not only to controlled environments but also to real driving conditions, indicating the potential to extend the practical limits of algorithm-based rPPG.

The findings of this study suggest that the proposed method is effective in improving robustness to illumination changes; however, its benefit varied depending on the algorithm and scenario. Larger improvements were observed for algorithms that are sensitive to inter-channel color changes, such as CHROM and POS, whereas methods such as ICA still exhibited relatively large errors under some complex conditions. This may be attributed to the fact that the proposed method primarily stabilizes the brightness component, but does not simultaneously address other performance-degrading factors, including motion, changes in reflection characteristics, and ROI tracking errors. In addition, because motion-intensive conditions were excluded from MR-NIRP in order to focus on the analysis of illumination effects, the present results have limited generalizability to real-world environments involving substantial motion. The exclusion of motion-intensive scenarios was a deliberate design choice to isolate illumination effects. Including large motion would introduce confounding factors, making it difficult to attribute performance changes specifically to illumination normalization. Broader evaluation against additional classical normalization methods remains for future work. In addition, direct comparison with alternative quality-aware selection strategies was not included in the present study, which should be addressed in future work. Therefore, future work should explore the integration of illumination normalization with motion artifact suppression and ROI stabilization techniques, while also evaluating its performance against other quality-aware selection approaches.

This study adopts an algorithm-based approach focused on compensating for illumination variations, and therefore performance limitations may arise in environments involving large motion or in scenarios requiring learning-based representations. Although the proposed method showed strong BPM estimation performance, the temporal waveform remained unstable under some low-light conditions, which may limit its applicability to waveform-based physiological indicators such as HRV (Heart Rate Variability). However, waveform fidelity itself was not quantitatively analyzed in this study, and thus the severity of this instability could not be specified numerically. In addition, direct physiological validation of hemodynamic information preservation was not provided, although improved temporal stability, preserved color-direction structure, and consistent BPM gains across multiple rPPG algorithms provide indirect evidence that the method does not severely distort blood-flow-related color variations. Moreover, while embedded hardware implementation was not directly evaluated, the proposed method consists only of lightweight, deterministic normalization operations, suggesting its feasibility as a compact preprocessing block for embedded rPPG pipelines. Future work will therefore include waveform-level validation, integration with motion compensation, and extension to deep-learning-based and embedded rPPG systems.

6. Conclusions

In this study, we proposed an illumination normalization technique and an SNR-based signal selection strategy to mitigate the degradation of rPPG signal quality caused by brightness variations. The proposed illumination normalization selectively compensates for the luminance component while preserving the relative relationships among color components. The experimental results showed that after brightness normalization, the mean variation in the color channels remained within ±6–12%, indicating that the structural characteristics of the color information were preserved. In addition, the normalization process helped suppress global brightness fluctuations and flicker components caused by illumination changes by expanding the dynamic range and improving temporal stability.

This signal-level stabilization led to significant improvements in heart rate estimation performance. On the DLCN dataset, under the most challenging illumination condition (VI&VP), the MAE of the CHROM method decreased from approximately 18–19 BPM reported in previous studies to 4.87 BPM, while the proportion of estimates within the tolerance range (PTE6) increased substantially to 82.48%. Similar improvements were observed on the MR-NIRP dataset. In the Driving Small Motion scenario, the MAE of the POS and ICA algorithms decreased to 4.15 BPM and 1.43 BPM, respectively, showing performance gains under the driving conditions considered in this study.

Although a performance gap remains in some scenarios compared with learning-based models, the proposed approach improves robustness to illumination variations without requiring additional training or increased model complexity. Therefore, the proposed method can serve as an effective preprocessing strategy for existing rPPG algorithms and has strong potential for real-time, lightweight physiological signal measurement systems.

Author Contributions

Conceptualization, B.S.A., S.H.P. and E.C.L.; methodology, B.S.A. and S.H.P.; software, B.S.A., S.H.P., Y.J.K. and Y.R.S.; validation, B.S.A. and S.H.P.; formal analysis, B.S.A. and S.H.P.; investigation, B.S.A., S.H.P., Y.J.K. and Y.R.S.; resources, G.J.C. and E.C.L.; data curation, B.S.A. and Y.J.K.; writing—original draft preparation, B.S.A. and S.H.P.; writing—review and editing, B.S.A., S.H.P. and E.C.L.; visualization, B.S.A. and S.H.P.; supervision, E.C.L.; project administration, E.C.L.; funding acquisition, E.C.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Research Foundation of Korea (NRF), funded by the Korean government, including the Ministry of Education (NRF-2024S1A5C3A02043877) and the Ministry of Science and ICT (RS-2024-00340935).

Institutional Review Board Statement

Ethical review and approval were waived for this study because it used only publicly available anonymized datasets and did not involve direct interaction or intervention with human participants.

Data Availability Statement

The data used in this study include the publicly available DLCN and MR-NIRP datasets. These datasets are available from their respective official repositories or from the authors of the original datasets. Further information regarding the data used in this study can be obtained from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Xiao, H.; Liu, T.; Sun, Y.; Li, Y.; Zhao, S.; Avolio, A. Remote photoplethysmography for heart rate measurement: A review. Biomed. Signal Process. Control 2024, 88, 105608. [Google Scholar] [CrossRef]
Chen, W.; Yi, Z.; Lim, L.J.R.; Lim, R.Q.R.; Zhang, A.; Qian, Z.; Huang, J.; He, J.; Liu, B. Deep learning and remote photoplethysmography powered advancements in contactless physiological measurement. Front. Bioeng. Biotechnol. 2024, 12, 1420100. [Google Scholar] [CrossRef]
Wang, W.; Den Brinker, A.C.; Stuijk, S.; De Haan, G. Algorithmic principles of remote PPG. IEEE Trans. Biomed. Eng. 2016, 64, 1479–1491. [Google Scholar] [CrossRef]
De Haan, G.; Jeanne, V. Robust pulse rate from chrominance-based rPPG. IEEE Trans. Biomed. Eng. 2013, 60, 2878–2886. [Google Scholar] [CrossRef]
Land, E.H.; McCann, J.J. Lightness and retinex theory. J. Opt. Soc. Am. 1971, 61, 1–11. [Google Scholar] [CrossRef]
Morel, J.M.; Petro, A.B.; Sbert, C. A PDE formalization of Retinex theory. IEEE Trans. Image Process. 2010, 19, 2825–2837. [Google Scholar] [CrossRef]
An, G.; Wu, J.; Ruan, Q. An illumination normalization model for face recognition under varied lighting conditions. Pattern Recognit. Lett. 2010, 31, 1056–1067. [Google Scholar] [CrossRef]
Lin, H.; Shi, Z. Multi-scale retinex improvement for nighttime image enhancement. Optik 2014, 125, 7143–7148. [Google Scholar] [CrossRef]
Lin, Y.C.; Lin, Y.H. A study of color illumination effect on the SNR of rPPG signals. In 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC); IEEE: Jeju, Republic of Korea, 2017; pp. 4301–4304. [Google Scholar]
Botina-Monsalve, D.; Benezeth, Y.; Miteran, J. RTrPPG: An ultra light 3DCNN for real-time remote photoplethysmography. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 19–20 June 2022; pp. 2146–2154. [Google Scholar]
Liu, X.; Hill, B.; Jiang, Z.; Patel, S.; McDuff, D. Efficientphys: Enabling simple, fast and accurate camera-based cardiac measurement. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, Waikoloa, HI, USA, 2–7 January 2023; pp. 5008–5017. [Google Scholar]
Zhang, X.; Wang, W. A variational framework based on synergy between illumination and reflectance for Retinex decomposition. Neurocomputing 2025, 647, 130480. [Google Scholar] [CrossRef]
Liu, S.; Long, W.; He, L.; Li, Y.; Ding, W. Retinex-based fast algorithm for low-light image enhancement. Entropy 2021, 23, 746. [Google Scholar] [CrossRef]
Hu, C.H.; Yu, J.; Wu, F.; Zhang, Y.; Jing, X.Y.; Lu, X.B.; Liu, P. Face illumination recovery for the deep learning feature under severe illumination variations. Pattern Recognit. 2021, 111, 107724. [Google Scholar] [CrossRef]
Acharya, B.; Saakyan, W.; Hammer, B.; Drimalla, H. The reliability of remote photoplethysmography under low illumination and elevated heart rates. npj Digit. Med. 2025, 8, 744. [Google Scholar] [CrossRef]
Shao, H.; Luo, L.; Qian, J.; Yan, M.; Chen, S.; Yang, J. Remote Photoplethysmography in Real-World and Extreme Lighting Scenarios. In Proceedings of the Computer Vision and Pattern Recognition Conference, Nashville, TN, USA, 11–15 June 2025; pp. 10858–10867. [Google Scholar]
Yu, Z.; Shen, Y.; Shi, J.; Zhao, H.; Torr, P.H.; Zhao, G. Physformer: Facial video-based physiological measurement with temporal difference transformer. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 4186–4196. [Google Scholar]
Xi, L.; Chen, W.; Zhao, C.; Wu, X.; Wang, J. Image enhancement for remote photoplethysmography in a low-light environment. In 2020 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020); IEEE: Buenos Aires, Argentina, 2020; pp. 1–7. [Google Scholar]
Chen, S.; Ho, S.K.; Chin, J.W.; Luo, K.H.; Chan, T.T.; So, R.H.; Wong, K.L. Deep learning-based image enhancement for robust remote photoplethysmography in various illumination scenarios. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 17–24 June 2023; pp. 6077–6085. [Google Scholar]
Jaiswal, K.B.; Meenpal, T. rPPG-FuseNet: Non-contact heart rate estimation from facial video via RGB/MSR signal fusion. Biomed. Signal Process. Control 2022, 78, 104002. [Google Scholar] [CrossRef]
Li, Z.; Wang, K.; Xiao, H.; Liu, X.; Zhou, F.; Jiang, J.; Liu, T. Exploring remote physiological signal measurement under dynamic lighting conditions at night: Dataset, experiment, and analysis. arXiv 2025, arXiv:2507.04306. [Google Scholar] [CrossRef]
Nowara, E.M.; Marks, T.K.; Mansour, H.; Veeraraghavan, A. Near-Infrared Imaging Photoplethysmography During Driving. IEEE Trans. Intell. Transp. Syst. 2020, 23, 3589–3600. [Google Scholar] [CrossRef]
Guo, J.; Zhu, X.; Yang, Y.; Yang, F.; Lei, Z.; Li, S.Z. Towards fast, accurate and stable 3D dense face alignment. In Computer Vision-(ECCV); Springer: Cham, Switzerland, 2020; pp. 152–168. [Google Scholar]
Poh, M.Z.; McDuff, D.J.; Picard, R.W. Non-contact, automated cardiac pulse measurements using video imaging and blind source separation. Opt. Express 2010, 18, 10762–10774. [Google Scholar] [CrossRef]
Verkruysse, W.; Svaasand, L.O.; Nelson, J.S. Remote plethysmographic imaging using ambient light. Opt. Express 2008, 16, 21434–21445. [Google Scholar] [CrossRef]
Pilz, C.S.; Zaunseder, S.; Krajewski, J.; Blazek, V. Local group invariance for heart rate estimation from face videos in the wild. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Salt Lake City, UT, USA, 18–22 June 2018; pp. 1254–1262. [Google Scholar]
De Haan, G.; Van Leest, A. Improved motion robustness of remote-PPG by using the blood volume pulse signature. Physiol. Meas. 2014, 35, 1913–1926. [Google Scholar] [CrossRef]
Abdi, H.; Williams, L.J. Principal component analysis. Wiley Interdiscip. Rev. Comput. Stat. 2010, 2, 433–459. [Google Scholar] [CrossRef]
Xiao, H.; Jiang, J.; Liu, S.; Li, Z. CSTR-Net: A Dual-Branch Contrastive Spatio-Temporal Enhancement Network for rPPG. SSRN 2025. [Google Scholar] [CrossRef]
Bouraffa, T.; Koutsakis, D.; Zelvyte, S. Deep Learning-based rPPG Models Towards Automotive Applications: A Benchmark Study. In Proceedings of the Winter Conference on Applications of Computer Vision, Tucson, AZ, USA, 28 February–4 March 2025; pp. 1171–1180. [Google Scholar]
Kim, S.; Lee, K.; Hwang, J.; Lee, H.; Lee, E.C. Enhanced remote photoplethysmography using hybrid RGB and near-infrared cameras. J. Electron. Imaging 2024, 33, 063058. [Google Scholar] [CrossRef]

Figure 1. Overall framework of the proposed rPPG extraction method under varying illumination conditions.

Figure 2. Example frames from the DLCN dataset across different illumination scenarios over time. In the scenario labels, I and P denote the intensity and position of the light source, respectively, while F and V indicate fixed and variable conditions.

Figure 3. Example frames from the MR-NIRP dataset under diverse illumination conditions in real-world driving environments.

Figure 4. Example of illumination–color decomposition from an input RGB frame, where the color component represents illumination-invariant inter-channel RGB ratios.

Figure 5. Overall framework of the proposed I-norm + SNR based rPPG processing.

Figure 6. Distribution of the AC/DC ratio on the DLCN dataset before and after applying illumination normalization (I-norm), where yellow and blue indicate the pre- and post-normalization results, respectively.

Figure 7. Comparison of DLCN dataset frames before and after applying illumination normalization (I-norm). (a) Original frames; (b) frames after applying I-norm.

Figure 8. Comparison of R, G, B, Cb, and Cr signals before and after illumination normalization.

Figure 9. Comparison of baseline rPPG signals, illumination-normalized rPPG signals (I-norm), and reference cPPG signals for different extraction methods in both the time and frequency domains.

Figure 10. Example rPPG signal waveforms classified by the SNR = 0 criterion.

Figure 11. Differences in the mean values of the R, G, B, Cb, and Cr channels before and after brightness normalization, illustrating that relative color relationships are largely preserved. The black circle denotes the zero-change reference.

Table 1. Comparison of related work and positioning of the proposed illumination normalization.

Category	Method/Key Idea	Contribution	Limitation (rPPG)	How Our Method Complements
Prior illumination normalization	[12] — Illumination–reflectance estimation	Joint illumination/reflectance estimation with high restoration quality.	Heavy optimization cost; unsuitable for real-time facial videos.	Learning-free, frame-wise RGB normalization enabling real-time rPPG.
	[13]—HSV-based RBFA	Improves low-light visual quality via V-channel adjustment.	Single-frame focus; temporal color variation not considered.	Explicitly preserves temporal color variation relevant to rPPG.
	[14]—Illumination recovery	Improves face recognition under illumination changes.	Heavy inference; risk of losing intrinsic skin color.	Closed-form normalization preserving chromatic direction.
Model-based low-light rPPG	[15]—CHILL benchmark	Systematic evaluation of DL rPPG under low light.	Large performance drop in cross-dataset settings.	Stabilizes rPPG across environments without retraining.
	[16]—Transformer-based rPPG	Strong performance under extreme lighting.	High complexity limits real-time deployment.	Lightweight preprocessing without deep models.
	[17]—PhysFormer	Transformer with temporal and frequency modeling.	Large model size and computation.	Formula-based normalization without heavy architectures.
Retinex-based low-light rPPG	[18]—Retinex-LIME + rPPG	Improves HR estimation under very low illumination.	G-channel bias; performance degrades as illumination rises.	Three-channel ratio preservation across illumination levels.
	[19]—DL Retinex enhancement	Improves rPPG via learning-based enhancement.	No explicit analysis of temporal color preservation.	Signal-level SNR filtering without additional training.
	[20]—rPPG-FuseNet	MSR-based fusion improves low-light rPPG.	Depends on learning-based fusion.	Consistent improvement via learning-free normalization.

Table 2. Comparison of DLCN and MR-NIRP datasets.

Category	DLCN	MR-NIRP
Environment	Controlled laboratory setting	Real-world driving environment
Illumination conditions	Systematically controlled illumination	Uncontrolled, real-world illumination
Motion level	Low	Low to moderate (driving scenarios)
Scenario design	Fixed/variable intensity and position	Driving-based various lighting mixing scenarios
Evaluation purpose	Robustness to structured illumination changes	Generalization under realistic illumination

Table 3. Comparison of illumination normalization methods in terms of their suitability for rPPG signal extraction.

Method	Key Differences from an rPPG Perspective	Impact on rPPG Extraction
Retinex-based Enhancement	Nonlinear redistribution of color and contrast significantly alters RGB channel ratios	High risk of rPPG signal loss due to distortion of subtle skin color variations
Histogram Equalization	Histogram-based mapping largely changes frame-to-frame color and brightness structure	Color-based rPPG features are not preserved, and temporal instability degrades signal consistency
Gamma Correction	R, G, and B channels are nonlinearly modified at different rates depending on the gamma value	Disruption of inter-channel relative changes increases rPPG estimation errors
Color Constancy	Correction of illumination color shifts the intrinsic skin tone	Reduction in subtle inter-channel ratio variations leads to decreased rPPG sensitivity
Proposed Method	Fully preserves RGB color direction (relative ratios) while reducing temporal illumination variations	Blood-flow-induced color changes are better maintained while illumination drift is reduced.

Table 4. Statistical differences across color channels under illumination variation.

Channel	Mean Difference	Standard Deviation Difference	Range Ratio
R	+6%	±0.001	2.56
G	+7%	±0.001	2.41
B	+12%	±0.003	2.62
Cb	−6%	±0.001	2.46
Cr	+11%	±0.002	2.42

Table 5. Stability ratio values across different color channels.

Metric	R	G	B	Cb	Cr
Stability ratio	2.65	2.54	2.36	2.40	2.03

Table 6. Inter-channel correlation comparison before and after illumination normalization.

Condition	R–G	G–B	R–B	Cb–Cr
RAW	0.878	0.916	0.830	−0.431
I-norm	0.895	0.926	0.851	−0.322

Table 7. Performance comparison before and after illumination normalization on the DLCN dataset.

			POS				ICA				CHROM
Method	Category	Condition	MAE	RMSE	MAPE (%)	PTE6 (%)	MAE	RMSE	MAPE (%)	PTE6 (%)	MAE	RMSE	MAPE (%)	PTE6 (%)
Baseline	State	Rest	12.33	14.66	15.49	58.89	9.20	10.54	11.22	54.54	11.33	11.33	15.13	61.66
	State	Exercise	10.08	12.04	11.68	63.15	12.93	14.81	13.46	47.16	8.12	8.12	8.99	69.57
	Scenario	FI & FP	4.82	5.97	6.13	84.03	5.99	7.21	7.37	74.49	7.71	7.71	10.57	76.80
		VI & FP	13.01	15.52	16.16	56.16	11.91	13.44	13.67	47.67	8.21	8.21	10.74	74.88
		FI & VP	12.70	15.15	15.18	57.72	13.12	15.05	14.35	39.74	10.52	10.52	12.39	59.00
		VI & VP	14.31	16.78	16.88	46.17	13.24	15.02	13.99	41.52	12.46	12.46	14.54	50.79
I-norm	State	Rest	7.56	8.94	8.98	65.91	12.21	13.91	14.01	46.74	5.84	6.89	6.81	73.32
	State	Exercise	10.82	12.76	10.55	60.57	18.52	20.74	18.42	40.08	9.71	11.42	9.45	65.21
	Scenario	FI & FP	3.59	4.48	4.15	85.57	7.50	8.89	8.72	68.82	2.81	3.31	3.08	88.15
		VI & FP	9.62	11.41	10.47	62.30	15.58	17.50	16.77	42.05	6.42	7.69	6.81	74.35
		FI & VP	9.56	11.62	10.03	61.54	17.99	20.38	18.62	34.09	8.95	10.68	9.31	65.11
		VI & VP	13.99	15.89	14.40	43.56	20.40	22.54	20.76	28.70	12.93	14.95	13.34	49.46

Table 8. Comparison of image enhancement methods.

Category	Condition	Image Enhancement Method
		I-Norm			Retinex			Gamma			Lab
		POS	ICA	CHROM	POS	ICA	CHROM	POS	ICA	CHROM	POS	ICA	CHROM
State	Rest	7.56	12.21	5.84	9.00	13.94	7.74	8.87	13.73	7.74	10.28	14.70	8.71
State	Exercise	10.82	18.52	9.71	11.42	19.35	10.51	11.82	18.66	10.37	13.47	20.51	11.97
Scenario	FI & FP	3.59	7.50	2.81	4.90	10.17	4.06	4.87	9.88	4.80	7.15	11.47	6.95
	VI & FP	9.62	15.58	6.42	10.90	17.18	8.11	10.81	15.78	7.90	10.33	15.52	8.38
	FI & VP	9.56	17.99	8.95	12.04	19.15	10.98	11.83	19.22	11.00	14.63	21.43	12.67
	VI & VP	13.99	20.40	12.93	14.01	20.07	13.26	13.87	19.89	12.52	15.38	22.02	13.36

Table 9. BPM MAE performance across different window sizes for real-time rPPG evaluation. Lower MAE indicates better BPM estimation accuracy, and the best results are highlighted in bold.

		Window Size (Frames)
		30			20			10			5
Category	Condition	POS	ICA	CHROM	POS	ICA	CHROM	POS	ICA	CHROM	POS	ICA	CHROM
State	Rest	7.31	12.28	5.78	7.65	12.66	6.36	8.85	13.60	7.46	10.83	15.56	9.62
State	Exercise	10.13	17.61	8.97	10.61	18.21	9.27	11.89	19.12	10.23	13.77	20.57	12.06
Scenario	FI & FP	3.46	7.63	2.78	3.77	8.01	3.16	4.69	9.69	3.90	6.81	12.20	5.93
	VI & FP	8.66	14.88	6.14	9.41	15.47	6.76	10.89	16.94	8.03	12.96	18.86	10.22
	FI & VP	10.55	17.77	9.05	10.63	18.36	9.41	11.92	18.89	10.51	13.95	20.31	12.59
	VI & VP	12.21	19.50	11.54	12.72	19.90	11.94	13.98	19.93	12.95	15.48	20.90	14.62

Table 10. Performance comparison before and after illumination normalization on the MR-NIRP dataset.

	Baseline								I-Norm
	Driving Small Motion				Driving Still				Driving Small Motion				Driving Still
Algorithm	MAE	RMSE	MAPE (%)	PTE6 (%)	MAE	RMSE	MAPE (%)	PTE6 (%)	MAE	RMSE	MAPE (%)	PTE6 (%)	MAE	RMSE	MAPE (%)	PTE6 (%)
POS	13.59	14.77	14.01	42.34	12.20	13.65	12.72	48.06	11.15	11.53	12.42	56.58	9.43	10.09	10.21	68.89
ICA	16.09	17.80	17.31	34.00	10.83	12.26	11.26	54.40	13.88	14.52	14.58	41.67	11.31	11.44	12.74	50.00
CHROM	13.07	14.57	13.83	40.54	11.16	12.83	12.09	46.33	12.10	12.65	13.34	43.75	7.21	8.02	7.05	69.70
GREEN	17.13	18.63	18.28	28.31	13.83	15.40	14.78	39.13	15.11	15.93	15.66	45.37	15.68	16.19	17.28	30.00
LGI	12.86	14.43	13.17	48.70	11.05	12.79	11.56	52.49	10.79	11.23	10.09	71.67	7.68	8.61	7.91	73.23
PBV	13.98	15.34	14.41	40.78	12.69	14.34	13.37	33.73	10.27	10.71	11.22	59.80	12.54	13.32	12.54	45.27

Table 11. Performance comparison before and after applying illumination normalization with SNR filtering on the DLCN dataset.

			POS				ICA				CHROM
Method	Category	Condition	MAE	RMSE	MAPE (%)	PTE6 (%)	MAE	RMSE	MAPE (%)	PTE6 (%)	MAE	RMSE	MAPE (%)	PTE6 (%)
Baseline + SNR	State	Rest	8.57	9.70	10.91	64.27	8.64	9.75	10.51	55.31	7.70	7.70	10.20	67.02
	State	Exercise	7.32	8.43	8.27	67.28	11.73	13.10	12.30	48.94	6.96	6.96	7.82	70.87
	Scenario	FI & FP	3.47	4.09	4.33	85.83	5.44	6.28	6.68	76.03	4.73	4.73	6.32	81.18
		VI & FP	9.11	10.46	11.20	61.75	10.95	12.20	12.66	48.93	6.15	6.15	8.04	78.59
		FI & VP	8.60	9.88	10.26	63.83	12.23	13.67	13.25	40.39	8.42	8.42	9.86	61.51
		VI & VP	10.61	11.83	12.59	51.69	12.13	13.54	13.03	43.15	10.01	10.01	11.81	54.52
I-norm + SNR	State	Rest	2.91	3.04	3.26	88.85	5.83	6.44	6.71	70.52	2.36	2.46	2.76	91.62
	State	Exercise	4.30	4.44	4.54	84.82	11.51	11.55	11.02	63.91	3.59	3.71	3.83	87.59
	Scenario	FI & FP	0.85	1.00	1.00	99.66	1.92	2.12	2.28	92.71	1.08	1.21	1.23	97.15
		VI & FP	3.52	3.58	3.86	90.44	7.44	7.51	7.80	71.36	1.45	1.54	1.66	96.88
		FI & VP	2.96	3.12	3.33	90.56	12.43	12.44	12.15	52.10	4.51	4.64	4.98	81.92
		VI & VP	7.09	7.25	7.43	66.66	12.90	12.92	13.23	51.19	4.87	4.95	5.31	82.48

Table 12. Performance comparison before and after applying illumination normalization with SNR filtering on the MR-NIRP dataset.

	Baseline + SNR								I-Norm + SNR
	Driving Small Motion				Driving Still				Driving Small Motion				Driving Still
Algorithm	MAE	RMSE	MAPE (%)	PTE6 (%)	MAE	RMSE	MAPE (%)	PTE6 (%)	MAE	RMSE	MAPE (%)	PTE6 (%)	MAE	RMSE	MAPE (%)	PTE6 (%)
POS	9.17	9.45	9.05	71.43	8.52	9.40	8.91	67.00	4.15	4.75	5.18	89.58	4.88	5.67	5.84	86.11
ICA	9.57	10.20	11.21	53.12	7.38	7.97	7.89	60.91	7.36	8.46	7.70	71.43	1.43	1.67	1.82	92.86
CHROM	12.80	12.56	13.79	52.50	8.69	10.20	9.23	56.11	4.53	5.55	5.28	87.50	2.82	3.80	3.34	85.19
GREEN	11.72	12.24	13.80	40.62	9.46	9.94	10.85	46.53	8.44	9.51	9.18	68.05	5.38	5.94	6.57	75.00
LGI	5.87	6.62	6.32	77.14	6.85	7.69	7.21	72.19	4.24	4.78	4.68	89.58	5.66	6.65	6.36	78.11
PBV	8.36	9.55	10.00	62.50	12.19	12.90	11.86	46.27	4.22	4.81	5.34	84.72	4.14	5.05	4.51	84.08

Table 13. Window filtering results using the SNR = 0 threshold on the DLCN dataset.

Method	Total Windows	Remaining Windows	Loss Rate (%)
POS	1554	1515	2.51
ICA		1385	10.88
CHROM		1507	3.02

Table 14. Window filtering results using the SNR = 0 threshold on the MR-NIRP dataset.

Method	Total Windows	Remaining Windows	Loss Rate (%)
POS	171	166	2.92
ICA		167	2.34
CHROM		159	6.47
GREEN		165	3.51
LGI		167	2.34
PBV		165	3.51

Table 15. Comparison of the proposed method (I-norm + SNR) with the DLCN baseline and deep learning-based models on the DLCN dataset.

		Scenario
Category	Method	FI & FP				VI & FP				FI & VP				VI & VP
		MAE	RMSE	MAPE (%)	PTE6 (%)	MAE	RMSE	MAPE (%)	PTE6 (%)	MAE	RMSE	MAPE (%)	PTE6 (%)	MAE	RMSE	MAPE (%)	PTE6 (%)
DLCN Baseline [21]	POS	8.04	-	-	-	13.61	-	-	-	17.28	-	-	-	19.19	-	-	-
	ICA	9.26	-	-	-	24.20	-	-	-	27.01	-	-	-	29.85	-	-	-
	CHROM	11.10	-	-	-	14.64	-	-	-	18.43	-	-	-	19.21	-	-	-
Deep Learning [29]	CSTR-Net	1.49	2.52	1.75	97	-	-	-	-	1.01	1.50	1.19	99	3.05	4.28	3.46	91
Ours	I-norm + SNR	0.85	1.00	1.00	99.66	1.45	1.54	1.66	96.88	2.96	3.12	3.33	90.56	4.87	4.95	5.31	82.48

Table 16. Comparison of the proposed method (I-norm + SNR) with existing methods on the MR-NIRP dataset.

		Driving Small Motion				Driving Still
Category	Method	MAE	RMSE	MAPE (%)	PTE6 (%)	MAE	RMSE	MAPE (%)	PTE6 (%)
Algorithm-Baseline [30]	POS	8.54	13.87	12.26	-	8.14	13.97	12.37	-
	ICA	10.58	14.92	14.43	-	9.46	14.07	13.34	-
	CHROM	9.78	15.22	14.01	-	8.38	14.68	12.68	-
	GREEN	13.27	17.31	17.81	-	11.91	16.61	16.80	-
	LGI	10.68	15.41	14.36	-	8.74	14.08	12.51	-
	PBV	12.01	16.88	16.19	-	9.86	15.56	14.06	-
Model-based [30]	DeepPhys	11.79	16.49	15.71	-	7.00	12.06	9.74	-
	TS-CAN	11.73	15.98	15.58	-	7.09	11.73	9.91	-
	EfficientPhys-C	11.07	15.48	14.78	-	7.44	12.52	10.42	-
	PhysNet	7.04	11.55	9.20	-	4.37	9.00	6.11	-
	PhysFormer	9.17	12.97	12.08	-	6.35	10.81	8.57	-
Multimodal [31]	NIR + RGB (four channels)	-	-	-	56.73	-	-	-	62.36
Ours	I-norm + SNR	4.15	4.75	5.18	89.58	1.43	1.67	1.82	92.86

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

An, B.S.; Park, S.H.; Kim, Y.J.; Song, Y.R.; Cho, G.J.; Lee, E.C. Illumination-Invariant Normalization for Robust rPPG Extraction. Electronics 2026, 15, 1683. https://doi.org/10.3390/electronics15081683

AMA Style

An BS, Park SH, Kim YJ, Song YR, Cho GJ, Lee EC. Illumination-Invariant Normalization for Robust rPPG Extraction. Electronics. 2026; 15(8):1683. https://doi.org/10.3390/electronics15081683

Chicago/Turabian Style

An, Byeong Seon, Song Hee Park, Ye Jun Kim, Ye Rin Song, Geum Joon Cho, and Eui Chul Lee. 2026. "Illumination-Invariant Normalization for Robust rPPG Extraction" Electronics 15, no. 8: 1683. https://doi.org/10.3390/electronics15081683

APA Style

An, B. S., Park, S. H., Kim, Y. J., Song, Y. R., Cho, G. J., & Lee, E. C. (2026). Illumination-Invariant Normalization for Robust rPPG Extraction. Electronics, 15(8), 1683. https://doi.org/10.3390/electronics15081683

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Illumination-Invariant Normalization for Robust rPPG Extraction

Abstract

1. Introduction

2. Related Works

2.1. Prior Studies on Illumination Normalization

2.2. Model-Based rPPG Measurement Under Low-Light Conditions

2.3. Retinex-Based rPPG Measurement Under Low-Light Conditions

3. Method

3.1. Dataset

3.1.1. DLCN

3.1.2. MR-NIRP

3.2. Illumination Normalization

3.3. rPPG Extraction

4. Results

4.1. Signal-Level Validation of Illumination Normalization

4.2. Effect of Illumination Normalization on rPPG Extraction Performance

4.3. Effect of SNR-Based Signal Quality Filtering on rPPG Estimation

4.4. SNR-Based Quality Filtering Strategy

4.5. Comparison with Existing Benchmark Studies

5. Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI