A Signal Normalization Approach for Robust Driving Stress Assessment Using Multi-Domain Physiological Data

Fruet, Damiano; Barà, Chiara; Pernice, Riccardo; Iovino, Marta; Faes, Luca; Nollo, Giandomenico

doi:10.3390/eng6110288

Open AccessArticle

A Signal Normalization Approach for Robust Driving Stress Assessment Using Multi-Domain Physiological Data

by

Damiano Fruet

^1,*

,

Chiara Barà

²

,

Riccardo Pernice

²

,

Marta Iovino

²

,

Luca Faes

^2,3

and

Giandomenico Nollo

¹

Department of Industrial Engineering, University of Trento, 38123 Trento, Italy

²

Department of Engineering, University of Palermo, 90128 Palermo, Italy

³

Faculty of Technical Sciences, University of Novi Sad, 21102 Novi Sad, Serbia

^*

Author to whom correspondence should be addressed.

Eng 2025, 6(11), 288; https://doi.org/10.3390/eng6110288

Submission received: 29 August 2025 / Revised: 9 October 2025 / Accepted: 14 October 2025 / Published: 28 October 2025

(This article belongs to the Special Issue Recent Advances in Digital Signal Processing for Engineering Applications)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Objective: Stress recognition is a widely investigated and debated area in biomedical research. Physiological monitoring has gained increasing attention as one of the methodologies used to assess an individual’s stress level. In this study, we investigated the effectiveness of a novel normalization technique applied to multi-domain physiological data for the objective classification of stress levels using a feature extraction approach. Methods: Electrocardiographic (ECG) and respiratory data from a publicly available database, collected from drivers experiencing various stress levels, underwent a novel inter-subject normalization procedure. This method involved adjusting the time scale of the original data to a common scale across subjects according to fixed resting heart and respiratory rates. Subsequently, a feature-based stress state classification procedure was conducted using the Support Vector Machine (SVM) algorithm. The efficacy of this inter-subject normalization procedure was assessed by comparing the classification results obtained using features from the original signals with those obtained from the inter-subject-normalized signals. Additionally, the inter-subject normalization procedure was compared with two common feature normalization approaches: standardization and scaling. Results: Features derived from the subject-normalized signals yielded improved performance, significantly enhancing accuracy from 68% to 73%, as well as precision and sensitivity. Conclusions: The novel inter-subject normalization procedure proves to be an effective technique for highlighting differences in features among various stress states and for mitigating basal physiological variability across subjects. Significance: Using inter-subject normalization on multi-domain physiological signals holds promise as a method to improve multilevel stress classification through feature extraction, ensuring that the features maintain their correspondence even after the normalization process.

Keywords:

driving stress; classification; support vector machine (SVM); electrocardiogram; breath; data normalization; stress classification; machine learning; imbalanced data

1. Introduction

Stress is a physiological response to various factors arising in individuals unable to consciously handle a specific situation. During a stressful situation, the sympathetic nervous system (SNS) is responsible for the fight-or-flight physiological reaction of the body, resulting in vasoconstriction and increased blood pressure and heart rate [1]. In the long term, stress can lead to different health problems, being directly related to several physiological processes such as those involving the autonomic nervous system [2], the immune systems [3], and the cardiovascular and respiratory systems [4].

Stress has been deeply investigated in recent years [1,5] given the relevant consequences of a prolonged stressful condition on the body. Nevertheless, the detection of stress remains a challenging task since no standardized and validated methodology for stress assessment has been established as the gold standard. Among the methods used to measure stress, there are questionnaires [6,7], the visual analogue scale [8], and the detection of specific biomarkers (e.g., cortisol) related to the stress level [9,10]. Scale-related stress assessment methods do not require expensive tools but are often time-consuming and non-objective and pose difficulties for continuous monitoring. Conversely, biomarker detection methods allow for continuous monitoring by sensing biomarker fluctuations over time. However, these detection methods usually require very limited invasive and sophisticated tools, often based on non-reusable materials, despite significant advancements in biomaterials and biofabrication that have made them much less invasive [11].

Given the drawbacks of the above-indicated detection methods, stress assessment through wearable sensors capable of acquiring physiological signals has emerged as a highly promising approach in recent times. Wearable devices are now even smaller in size and more affordable [12], thus becoming non-intrusive tools able to handle continuous monitoring. Furthermore, the recent growth of classification and machine learning algorithms in the physiological data analysis area has profoundly improved stress evaluation [13,14]. In this context, the most employed biosignals are the electrocardiographic (ECG) [15,16], electromyographic (EMG) [17], electroencephalographic (EEG) [18], and photoplethysmographic (PPG) [19] signals. These signals are often combined in a multi-domain approach which takes into account the interaction between multiple physiological data to better analyze the dynamics of each signal and extract further useful information.

Starting from the acquired biosignals, several methods have been reported in the literature to determine the stress level of a subject [13,20], mostly based on the extraction of various features from the signals, followed by a classifier to predict the stress level. For example, Gupta and colleagues [21] employed a support vector machine (SVM) classification on features extracted from the EEG signal. This classificatory approach was also used in [22] with the combination of ECG and EMG signals, obtaining an excellent binary classification accuracy. Other studies used the K-Nearest Neighbors (KNN) algorithm for classification using either ECG [16] or EEG [18] signals.

Although most machine learning approaches can predict the stress level of a subject with good accuracy, they do not consider a critical aspect related to physiological data: the inter-subject variability. Physiological signals, whether acquired at rest or in response to stimuli, exhibit significant subject dependency. While the normal resting heart rate (HR) can depend on factors like fitness [23] and age [24], the normal physiological range for resting HR itself exhibits high variability. This range typically spans 60 to 100 beats per minute (bpm) across healthy adults [25]. Whether measured in a clinical setting [26] or in real-world, out-of-clinic environments, this variability persists, with little change in the upper limits: the 95th percentile of HR is typically less than 110 bpm in individuals aged 18–45, less than 100 bpm in those aged 45–60, and less than 95 bpm in individuals older than 60 years old [27].

Furthermore, other critical subject-specific physiological features are linked to respiratory or electrodermal activity signals [28]. For instance, the respiratory rate in adults shows large inter-subject variability, generally ranging from 12 to 20 breaths per minute [29]. In this context, some studies aiming to detect stress have employed either feature extraction or deep learning approaches, often incorporating amplitude normalization to account for inter-subject amplitude differences. However, time-related differences among subjects are often not adequately addressed. Common normalization techniques like scaling [30] and feature standardization [31] do not account for individual subject dependencies in the time domain, particularly concerning raw signals.

One possible approach to overcome this limitation is to normalize features by transforming the original feature vector into a common feature space where the feature exhibits the same mean or an arbitrary value across all subjects. However, this approach has a limitation: normalization is applied to a single feature independently (e.g., the RR interval), leading to a loss of correspondence with other features derived from the same signal. For instance, in the case of normalized RR features [32,33], the RR series is normalized by the mean value of all RR intervals within one ECG recording. Yet the remaining extracted features are often derived from the original, unnormalized signal, thereby losing their association with the normalized RR feature.

Therefore, it is crucial to develop a normalization procedure that addresses this limitation. This procedure must allow for the extraction of all features from a signal that is already in a normalized domain, ensuring that all features remain interconnected and associated within the same normalized framework. The aim of this work is to introduce a novel normalization approach for physiological signals to optimize the entire feature extraction pipeline. This allows features to effectively accounting for inter-subject variability across the entire signal. Our algorithm builds upon the previous work of Gasparini et al. [34], which introduced a methodology for classifying PPG signal features that included an inter-subject normalization procedure to mitigate variability between individuals. As a key novelty, we propose a new interpretation of this inter-subject normalization algorithm, specifically tailored for multi-domain feature extraction within the context of multilevel stress classification. We validated our approach using an open-access database of multimodal physiological data collected during various driving conditions in a controlled environment [30]. A key distinction from Gasparini et al.’s method is that we applied the inter-subject normalization procedure to two physiological signals: ECG and respiratory data. Notably, our novel methodology operates directly on the raw physiological signals, transforming them into a different domain. This means that any features subsequently extracted from these normalized signals are inherently normalized within a common framework. The structure of this normalization is directly determined by the specific features used in the normalization process itself—heart rate for ECG signals and breath rate for respiratory signals in this project. This ensures a strong and inherent interrelation among all extracted features.

The performance of this algorithm was tested in a feature-based classification of driving stress using an SVM classifier. The features, derived from various physiological signals, were combined during the classification step. The classification performances were then compared when either skipping or utilizing this inter-subject normalization procedure in the preprocessing step. Additionally, these results were compared with those obtained using other commonly employed feature normalization approaches.

2. Materials and Methods

2.1. Physiological Data

The data used in this study belongs to the database of J.A. Healey and colleagues [30], a multimodal dataset of synchronized physiological data and video recordings. This dataset, previously employed in stress assessment studies [35,36], specifically comprises data collected from subjects under different driving conditions. Three different stress states were elicited during the experimental session, each one associated with a specific driving condition: low stress (resting, no driving), high stress (city driving) and medium stress (highway driving). The low-stress state was collected in two 15 min sessions, respectively, at the beginning and at the end of the experiment (herein referred as resting1 and resting2). In this condition, the subjects were sitting in a garage with their eyes closed, while the car was parked and idle. The medium-stress state was monitored while driving on the highway in two sessions (highway1 and highway2). The high stress was induced by driving in the congested streets of Boston (three driving phases named city1, city2 and city3). Depending on the traffic, acquisitions lasted from 50 to 90 min including the resting state.

The signals taken into account in this study were the ECG waveform, acquired through lead II configuration (sampling frequency of 496 Hz), and the respiration signal, recorded through an elastic Hall effect sensor monitoring the chest cavity expansion (acquired at 31 Hz). Consistent with the methodology of Lee’s study, only acquisitions with complete data, including ECG and respiratory data for each stress state and temporal division information for each driving state, were considered. The analysis thus comprises ten unique acquisitions.

2.2. Signal Preprocessing

The data preprocessing procedure prepared the raw physiological data for the successive feature extraction step. Following previous works [14], the preprocessing in this study consisted of filtering the raw physiological data and then partitioning the data based on the labelled stress state.

The raw ECG signal was filtered using a zero-phase passband Butterworth filter (0.1–115 Hz cutoff frequencies). For the raw respiratory signal, its relative mean value was first removed, and then the signal was filtered with a zero-phase bandpass Butterworth filter (0.01–5 Hz cutoff frequencies). Both the ECG and the respiratory signals were then resampled to 250 Hz, ensuring high data quality and reducing computational processing. The filtered signals were subsequently organized into a structure containing the three primary stress phases based on the reported labels: resting, highway driving, and city driving.

Subsequently, each signal was segmented into 20-s windows with a 5-s overlap, thus performing a data augmentation, as this increased the number of samples available for classification. The choice of a 20-s window length was made to maximize the number of windows while preserving temporal information within each window [37,38,39]. The 5-s overlap was chosen to ensure independence between samples, which, in turn, enhances the model’s ability to generalize during stress state prediction. According to the study by Farias da Silva and colleagues [40], a 5-s overlap was found to be the best compromise for the overlapping window data augmentation technique.

In the following analyses, two different sets of data originating from the same database were utilized to emphasize the effects of the inter-subject normalization procedure. The first dataset comprised raw signals subjected to the preprocessing procedure without any additional modifications, as described in this paragraph. The second dataset was generated by applying the inter-subject normalization process before the feature extraction step. Specifically, for each driver, the resting phases were considered for subject normalization. From now on, the term original signal will be used to identify the first dataset, while the second dataset will be identified by the term subject-normalized signal.

All the procedures, including the preprocessing steps, were conducted in the MATLAB environment, version 2024b [41].

2.3. Inter-Subject Normalization

This study employs an inter-subject normalization algorithm to account for inherent physiological variability among individuals during stress classification. During resting states, characterized by low stress, each driver exhibits unique physiological characteristics. For instance, a healthy individual’s resting heart rate typically ranges from 60 to 100 beats per minute (bpm), while their respiratory rate can vary from 12 to 20 breaths per minute [25,42]. This physiological diversity persists across all stress levels and can significantly impact classification accuracy.

Consider an example: a heart rate of 80 bpm might be recorded for one subject during rest, but the same heart rate could indicate moderate stress in another subject. This observation extends to other physiological conditions and stress levels. Without addressing this inter-subject variability, classification algorithms trained and tested with inconsistent data can lead to misclassification errors.

To mitigate this issue, this paper presents a novel resampling procedure applied to raw ECG and respiratory signals. This approach extends previous work by Gasparini and colleagues [34], offering a multidomain solution. The primary goal of this procedure is to reduce inter-subject variability in specific physiological features during the resting state. This reduction improves the ability to first identify changes in these features across different stress levels and, indirectly, extends the normalization to all other features extracted from the signals, as the signals themselves are transformed into a new normalized domain.

The normalization procedure involves selecting a specific feature from each physiological signal (ECG and respiration) to serve as the basis for normalization. Subsequently, a new sampling frequency is assigned to the original signal. This adjustment ensures consistency of the selected feature among all subjects within the resting phase. The inter-subject normalization procedure for both ECG and breath signals is further detailed in the following subsections.

2.3.1. ECG Signal

The inter-subject normalization procedure on the ECG signal was conducted considering the heart rate as the normalized feature. This ensured that, after normalization, all subjects exhibited the same average heart rate during the resting state.

Starting from the raw ECG signal with a sampling frequency

f_{c}

equal to 250 Hz, the Pan-Tompkins algorithm [43] was applied to detect the R peaks during the two resting phases. Subsequently, the R-R interval (RRI) time series were extracted (Figure 1a), from which the heart rate series was derived as the inverse of the RRI time series. The mean heart rate

f_{h}

across the two resting phases was then calculated as a reference value using the following equation:

f_{h} = \frac{60}{\frac{\frac{\sum_{i = 1}^{N - 1} \frac{1}{(t_{R_{i + 1}} - t_{R_{i}})}}{N - 1} + \frac{\sum_{i = 1}^{M - 1} \frac{1}{({t_{R}}_{i + 1} - t_{R_{i}})}}{M - 1}}{2}} \times 1000

(1)

where

{t_{R}}_{i}

is the time (expressed in milliseconds) corresponding to the R peak, and N and M represent the number of R-peaks detected in the first and second resting phases, respectively.

The inter-subject normalization procedure aimed to transform all subjects’ data into a subject-normalized domain. In this domain, each subject’s resting heart rate was standardized to a predefined value

f_{s h}

, which was set to 70 beats per minute (bpm). This normalization was achieved by resampling the raw ECG signals at a new sampling frequency

f_{s c h}

, which varied across subjects. The determination of

f_{s c h}

was based on the ratio between the chosen resting frequency

f_{s h}

and each subject’s mean resting heart rate

f_{h}

, as follows:

f_{s c h} = f_{c} * \frac{f_{s h}}{f_{h}}

(2)

where

f_{c}

is the original sampling frequency (250 Hz). The resampling frequency

f_{s c h}

determined from the resting phase heart rate, was then consistently applied to the ECG signals recorded during other stress phases (highway driving and city driving). This crucial step ensured that the temporal relationships and feature characteristics across different stress states were preserved following normalization.

Each subject could potentially exhibit a different heart rate during resting conditions. Therefore, following the described inter-subject normalization procedure, the raw ECG of each subject was resampled into a new domain characterized by a resampling frequency value determined by the ratio as described in (2). As an example, given a device that acquired the ECG signal with an original sampling frequency equal to 250 Hz:

If a driver exhibited a heart rate higher than 70 bpm at rest in the original signal, the value of the new resampling frequency was lower than 250 Hz.
If the driver presented a heart rate lower than 70 bpm at rest, the value of the new resampling frequency was higher than 250 Hz.

The inter-subject normalization procedure had two main consequences. Firstly, it shifted the signals of each subject into a new common domain based on chosen specific features (heart rate during resting in the case of ECG). In this new domain, each subject exhibited the same average value of the considered feature, maintaining unaffected the relative intra-subject difference during both resting and stressful conditions. Furthermore, the length of the resampled signals was different than the original ones (Figure 1b). This was caused by the resampling procedure introducing a lengthening or shortening of the signal due to the decreasing or increasing of the resampling frequency, respectively.

2.3.2. Respiratory Signal

Analogous to the procedure carried out on the ECG signal, the inter-subject normalization procedure aimed to resample the original respiratory signal. This was done with a new frequency to ensure all subjects exhibited the same breath rate during the resting state. Therefore, the breath rate feature was chosen as the reference. Specifically, the resampling frequency

f_{s c b}

, defined by the following equation, was employed:

f_{s c b} = f_{c} * \frac{f_{s b}}{f_{b}}

(3)

Here,

f_{s b}

is the selected resting breathing frequency (in this project, set to 14 breaths per minute). The value of

f_{b}

indicates the mean breath frequency during the two resting periods detected on the original signal and derived as the inverse of the breath-to-breath interval (BBI) series. To obtain this value, the inspiration peak positions were first determined on the original signal (Figure 1c). Subsequently, the time differences between consecutive peaks were calculated. These peak-to-peak series, when multiplied by 60 and divided by a factor of 1000, indicate the breath rate values expressed in breaths per minute. The final

f_{b}

value was obtained by averaging all the breath rate values within the two resting phases. The value of

f_{s c b}

derived in Equation (3) was then used to resample the respiratory signal across all stress phases, thereby preserving the temporal relationship between stress states for each considered feature.

Given that the original sampling frequency was consistent across different drivers, variations in the obtained resampling frequency across drivers were solely determined by the ratio between

f_{s b}

and

f_{b}

. Consequently, a breath rate on the original signal higher than 14 breaths per minute resulted in a resampling frequency lower than 250 Hz. Conversely, the resampling frequency was higher than 250 Hz. The same implications of the inter-subject normalization outlined in the previous paragraph for the ECG traces also apply to the respiratory signals.

2.4. Feature Extraction

Feature extraction was performed on 20-s windows of the signal. This methodology required the exclusive use of time-domain features, as a 20-s duration was considered inadequate for a comprehensive frequency-domain analysis. Accurate frequency-domain analysis typically requires signals of greater length and a higher density of data points to yield statistically significant results [44,45]. By segmenting the original signal into shorter windows, the number of resultant feature vectors available for the subsequent classification step was augmented, thereby contributing to a more generalizable algorithm. The 20-s window length was specifically chosen to balance the need for sufficient temporal information with the necessity to generate a substantial number of samples.

In the context of feature extraction in the time domain, it was essential to accurately identify key morphological points of interest in each signal. For instance, in the ECG signal, the identification of R peaks was crucial, while in the respiratory signal, the focus was on detecting respiratory peaks.

In this study, we employed a modified version of the Pan–Tompkins algorithm [46] to detect the positions of R peaks in the ECG signal from each window, resulting in the creation of the RR interval time series. From this, the obtained time-domain features were the average RR interval (

μ_{R R}

), the standard deviation (

σ_{R R}

) and the root mean square of successive RR interval differences (

{R M S S D}_{R R}

) [47]. In addition, the standard deviation of successive RR interval differences (

S D_{S D}

), the number of R peaks in each window that differ more than 50 milliseconds (

R R_{50}

) and the associated percentage (

P R R_{50}

) were calculated [48].

To extract the time-domain indices of the respiratory signal, the maximum and minimum points within each window were identified. The time difference between each maximum point was considered as a respiratory act. Therefore, the respiratory rate (

μ_{R E S P}

) was determined by calculating the average of the inverse of the time distances between the maxima. Inspiration and expiration times were investigated by computing the average rise time (

μ_{t I N S}

) and fall time (

μ_{t E S P}

) of the breathing signal within each respiratory act. Additionally, the average inspiration and expiration areas were extracted by considering the area under the inspiration and expiration phases in the respiratory signal, respectively.

It is worth noting that the difference in the length of raw signals and those subjected to the inter-subject normalization did not result in a difference in the length of the time series used in the feature extraction phase, for both the RRI and the BBI series. Additionally, any potential changes introduced by the inter-subject normalization process, particularly those associated with discrepancies in the time length between ECG signal samples and respiratory signal samples, did not affect stress classification. This is because the features were independently extracted from each physiological signal.

2.5. Data Organization and Imbalance Class Management

The feature dataset was organized in a table where each row represented a 20-s window, and each column corresponded to a specific feature. In total, 12 features were extracted from each of the 2979 windows for the original signals and each of the 2974 windows for the normalized signals. This slight difference in the number of samples was due to variations in the length of the normalized signals. These length changes resulted from different resampling frequencies applied to each subject, which in turn led to a different number of 20-s windows being obtainable from the complete signals.

An analysis of the number of windows associated with each stress label revealed a clear imbalance in class distribution. Specifically, for the time series derived from the original signals,

The resting condition had 1132 samples.
The city driving condition had 1275 samples.
The highway driving condition had 572 samples.

A similar sample distribution across the different classes was observed for the normalized signals. To address this class imbalance during classification and mitigate potential bias and inaccuracy that can arise when training a machine learning model on imbalanced data, the Synthetic Minority Oversampling Technique (SMOTE) [49] was employed. This algorithm generates synthetic samples for classes with fewer existing samples by interpolating between points in the feature space. Applying SMOTE ensured an equal representation of samples across all classes.

2.6. Stress State Classification

To compare our findings with prior research on the same dataset, we employed a Support Vector Machine (SVM) for classification, mirroring the methodology of a previous study [14]. Specifically, we utilized a Gaussian SVM, characterized by its Gaussian kernel function, a common choice for classification tasks [50,51]. The SVM was trained on 80% of the dataset, with the remaining 20% reserved for testing. This random partitioning strategy was employed to mitigate overfitting and ensure an unbiased evaluation of the model’s ability to generalize and accurately predict stress in unseen data.

Additionally, to account for potential variations in model performance due to different training and testing splits, we repeated the training and testing phases 100 times. This repetition minimizes the stochastic behavior of the process, and the final classification performances were obtained by averaging the values across these 100 iterations.

For each set of results, a global confusion matrix was generated. This matrix illustrates the relationship between predicted and true labels, from which key performance metrics—including accuracy, sensitivity, specificity, precision, and F-measure—were derived to comprehensively evaluate the model’s performance.

3. Inter-Subject Normalization Validation

The inter-subject normalization procedure was evaluated by comparing features extracted from subject-normalized signals with those from original signals. This evaluation also included comparisons to features processed using two common machine learning normalization techniques: standardization [52] and scaling [53].

Standardization

In the standardization procedure, each feature—derived from a window composed of a fixed number of seconds of the original signal—was normalized as follows:

X_{z} = \frac{X - \bar{X}}{σ_{X}}

(4)

Here,

X

indicates the original feature array,

\bar{X}

the mean of

X

, and

σ_{X}

the standard deviation of

X

. This normalization resulted in a feature array

X_{z}

with an average value equal to zero and a standard deviation equal to one.

Scaling

The scaling procedure remapped all features to the same scale, ensuring comparable ranges across features and subjects. For a single feature, the scaling procedure was represented as

X_{s} = \frac{X - m i n (X)}{\max (X) - m i n (X)}

(5)

Here,

X

is the original features array and

X_{s}

indicates the array with the considered features rescaled to the range [0, 1].

Inter-Subject Normalization

Unlike standardization and scaling, which directly operated on the features, inter-subject normalization modified the original signal. Consequently, features derived from inter-subject-normalized signals retained their original relationships. In contrast, standardization and scaling procedures acted independently on each feature, potentially leading to a loss of inter-feature information.

To evaluate the performance of each normalization procedure, the Chi-square goodness-of-fit test was applied to assess the normality of the distributions of performance metrics (i.e., precision, sensitivity, and accuracy). These metrics were obtained from 100 iterations of training and testing an SVM model. The original data served as the baseline for comparison. Specifically, the test was conducted on performance distributions derived from the original features, comparing them with distributions from subject-normalized features, standardized features, and scaled features.

To ensure a robust statistical comparison among the four groups, the ANOVA test was employed. Additionally, the Bonferroni method was applied to account for multiple comparisons in the statistical analysis. A two-sample parametric unpaired Student’s t-test was performed to detect significant differences between groups. This test specifically aimed to compare the performances achieved using the three different normalization procedures with the performances derived from the original data. The null hypothesis for all conditions was that there were no significant differences between the original features and the features obtained after inter-subject normalization, standardization, or scaling procedures.

4. Results

Following the inter-subject normalization procedure, both the electrocardiogram (ECG) and respiratory signals were resampled to a new domain. This resampling was performed using the individual’s resting heart rate (HR) and resting respiratory rate (RR) as normalization features, respectively. Table 1 details the heart rates from the original signals and their associated resampling frequencies for each subject, while Table 2 provides the same information for breath rates. It is important to note that a heart rate exceeding 70 bpm (or a breath rate greater than 14 breaths per minute) resulted in a resampling frequency lower than 250 Hz, and vice versa. This adaptive resampling ensured that the signals were transformed to a common domain relevant to each subject’s physiological baseline.

Starting from the inter-subject-normalized signals, and following the feature extraction and classification procedures, the developed Support Vector Machine (SVM) model was rigorously assessed. A confusion matrix was generated for each of the four experimental conditions examined in this study:

Original signals;
Subject-normalized signals;
Feature transformations through standardization;
Feature transformations using scaling procedures.

An example of a confusion matrix for a single iteration is illustrated in Figure 2. This matrix was derived by training the SVM model on the training dataset and subsequently testing it on the unseen test data.

The performance metrics, averaged across 100 iterations for all four conditions, are presented in Table 3 in terms of mean and standard deviation. Accuracy results are also visually represented in Figure 3. In these figures, ‘n.s.’ as a superscript denotes no statistical significance, while three asterisks (***) indicate statistical significance (p < 0.001) when comparing the original data to the considered group.

Our analysis revealed several key findings regarding the impact of different preprocessing techniques on model performance. When using the original, unprocessed data, the model consistently achieved precision, sensitivity, and accuracy of 68%, with specificity around 84%. On the other hand, both standardization and scaling procedures yielded very similar results to the original data, showing no substantial changes in precision, sensitivity, specificity, or accuracy. This suggests that these linear transformations alone did not significantly enhance the model’s ability to discriminate between classes in this context. In contrast, inter-subject normalization demonstrated an improvement in model performance. Precision, sensitivity, and accuracy all increased to 73%, while specificity also showed a slight improvement, reaching 86%. The superior statistical significance observed with subject-normalized data, as indicated by the three asterisks in Figure 3, further underscores the effectiveness of this method. These improvements strongly suggest that inter-subject normalization holds significant promise for enhancing the overall performance of the model in similar physiological signal analysis tasks.

5. Discussion

This study investigated the feasibility of an inter-subject normalization procedure on electrocardiographic (ECG) and respiratory signals to account for physiological variability and thus improve classification performance. We selected ECG and respiratory data due to their established relevance in stress research, as evidenced by previous studies [14,38,54]. These signals provide valuable insights into an individual’s physiological response to stress.

We performed a feature-based multilevel stress classification using a Support Vector Machine (SVM) classifier. This classification was conducted after applying inter-subject normalization and two widely used normalization procedures: standardization and scaling. Subsequently, we compared the classification performances derived from these procedures with those obtained from the original, unnormalized data.

As for general comments regarding the classification performance, the results from the confusion matrices (as shown in Figure 2) showed that the discrimination between high stress (city driving) and medium stress (highway driving) can be more difficult with respect to the identification of resting conditions. This may be due to the slightly different physiological behaviors associated with city and highway driving, which probably show less difference when compared to the physiological activity during resting. To enhance this classification performance, a possible solution is to incorporate more features from other physiological signals to better capture changes in physiological behavior associated with these stress conditions. However, the main aim of this work is to present the contribution of the novel inter-subject normalization procedure, highlighting its performance against other normalization procedures. The features extracted, as well as the methodology for classification adopted in the study, were kept identical across all conditions. This approach allows the study to fully focus on the contribution of the different normalization methodologies on the same dataset, as demonstrated in Table 3.

When using the original ECG and respiratory signals, our classification model consistently achieved a precision, sensitivity, and accuracy of 68%, while specificity was approximately 84%. Applying standardization and scaling techniques yielded comparable performance values (Table 3), suggesting these methods do not significantly alter the model’s performance compared to using raw data. This outcome aligns with expectations, as both standardization and scaling primarily adjust data distribution without introducing substantial changes to the underlying information.

The inter-subject normalization procedure successfully reduced inter-subject variability in physiological features during the resting state, effectively aligning subjects with a common physiological domain. This normalization enhances the classification of stress levels by ensuring consistency in physiological data across resting and stressful conditions for different acquisitions. By minimizing the inherent physiological differences present in the original data, this approach allows the classification model to focus specifically on the true, stress-induced changes. This process minimizes the inherent physiological variations present in the raw data, allowing the classification model to focus precisely on the true, stress-induced changes. While this deep mitigation of differences in the resting state could potentially introduce a risk of misinterpretation of the physiological data, the sole goal of this procedure is to enhance stress assessment. It achieves this by providing a common reference point from which to highlight any differences observed under stressful conditions. The results clearly demonstrated the effectiveness of this approach, showing significant improvements over the original data, standardization, and scaling procedures. Precision, sensitivity, and accuracy increased from 68% to 73%, and specificity improved by approximately two percentage points. The enhancement in precision and sensitivity is particularly noteworthy, as these metrics directly reflect the model’s ability to accurately identify instances of stress.

Traditional feature normalization methods, such as standardization (z-score) [52], min–max normalization [53] and decimal-scaling [55], are widely used in processing physiological signals. These approaches typically modify each feature independently by operating solely on the amplitude domain of the data (e.g., ECG or respiratory signal values) and ignoring the temporal relationships. While these methods often improve results compared to using the raw signal, their isolated treatment of features can disrupt the intrinsic consistency and varying levels of normalization between features, potentially impacting the subsequent classification step. In contrast, the proposed inter-subject normalization procedure operates directly on the time axis of the original signals rather than their amplitude scale. This involves placing the signals into a transformed space defined by a distinct sampling frequency. This novel approach ensures that features subsequently derived from the inter-subject-normalized signals are indirectly normalized in time. Crucially, this normalization structure is dependent only on specific physiological metrics used in the process (e.g., heart rate for ECG and breath rate for respiratory signals), thereby establishing a strong interconnection between all extracted features. This approach moves beyond simple, isolated feature adjustments—such as merely shifting a heart rate baseline to a fixed value like 70 bpm—where the natural, intrinsic relationships between other features remain uncorrelated or disrupted. Furthermore, since the proposed procedure primarily acts on the time axis, a common amplitude-domain normalization technique like z-score standardization could subsequently be applied to the time-normalized data. This two-step normalization would result in features being normalized in both the time axis (via the proposed inter-subject method) and the amplitude axis (via z-score), offering a potentially significant enhancement to overall classification performance.

A critical aspect of implementing this technique is identifying the most appropriate feature for defining the inter-subject normalization rescaling. In this project, we distinctly used the mean heart rate and mean breath rate during resting phases for ECG and respiratory signals, respectively, given their distinct physiological meanings.

Previous studies have demonstrated the use of physiological signals for stress assessment with various experimental protocols. For example, Smets and colleagues [38] achieved approximately 82% classification accuracy using an SVM in a binary stress classification problem by combining ECG and respiratory features. Similarly, Han et al. [54] discriminated against three stress states with 84% accuracy using similar features. These studies often incorporated both time and frequency domain features, which typically requires longer signal windows to ensure adequate frequency resolution. More recent studies on the specific dataset we consider have employed deep learning techniques, reaching an overall accuracy of about 90%. However, these promising results were achieved without addressing inter-subject variability. The aim of our proposed study is to validate the effectiveness of an inter-subject normalization procedure and to conduct a direct comparison with other widely used normalization techniques. By adopting the exact same data processing pipeline and only changing the data for classification, we ensure a fair and rigorous comparison. Our proposed work utilizes 20-s segments and the inter-subject normalization procedure to achieve efficient classification with strong performance, even without the inclusion of frequency domain features. This is a significant advantage, as it suggests our framework could be employed for real-time stress classification based on ultra-short-term recordings. Our approach using inter-subject-normalized data could potentially be combined with deep learning techniques, providing the algorithm with more robust input than data that ignores inter-subject variability. Future studies should deeply investigate this combination.

Further investigation is required to thoroughly understand the effect of inter-subject normalization on the frequency domain and to explore the potential contribution of frequency-domain features in the stress classification step. While the framework shows promise for short recordings, considering longer time windows for analysis would be beneficial for comprehensive understanding. Additionally, it would be valuable to investigate the incorporation of additional physiological data suitable for the inter-subject normalization approach, such as electrodermal activity (EDA), electromyography (EMG), or advanced features from other wearable devices like accelerometers and skin temperature sensors; combining these various data types has the potential to enhance stress classification accuracy and offer a more comprehensive understanding of an individual’s stress response.

6. Conclusions

This study investigated the impact of a subject-normalization procedure on stress classification. It utilized electrocardiogram (ECG) and respiratory physiological signals from drivers across varying stress levels. Employing a feature-driven methodology, the analysis used a total of approximately 3000 samples.

The findings demonstrate that this novel normalization procedure significantly improves stress classification compared to traditional standardization and scaling methods. The inter-subject normalization approach effectively reduced variability between individuals, leading to consistent results in both resting and stressful states. This enhancement was reflected in improved precision, sensitivity, and accuracy, indicating a greater ability to correctly identify stress. A key advantage of inter-subject normalization is its direct operation on the original signals, fostering strong interconnectivity among features. This differs from other methods that modify features independently. This novel procedure therefore establishes a deeper link between features, as they originate from a shared signal domain.

Future research should explore incorporating additional physiological data, extracting advanced features from wearable devices, and analyzing frequency-domain features. Such efforts could further enhance stress classification accuracy and provide a more complete understanding of stress responses. Given these results, this innovative inter-subject normalization technique shows strong potential for real-time stress classification in diverse applications like healthcare and stress management.

Author Contributions

D.F., L.F. and G.N. were responsible for the initial Conceptualization. The Methodology involved a broader team, including D.F., C.B., R.P., M.I., L.F. and G.N. D.F. solely handled Validation and authored the original draft, while the crucial review & editing were performed by all authors. Data curation was managed by D.F. and C.B. Investigation was led by L.F. and G.N., who also secured the Funding, with G.N. taking on the role of Supervision. Finally, C.B., R.P. and M.I. contributed the necessary Visualization. Each listed author is understood to have made substantial contributions, approved the final submission, and agreed to be personally accountable for the work’s integrity, fulfilling the journal’s authorship criteria. All authors have read and agreed to the published version of the manuscript.

Funding

We acknowledge the support of the MUR PNRR project INEST- Interconnected Nord-Est Innovation Ecosystem (ECS00000043) funded by the NextGenerationEU. L.F. and R.P. were supported by SiciliAn MicronanOTecH Research And Innovation CEnter “SAMOTHRACE” (MUR, PNRR-M4C2, ECS_00000022), spoke 3-Università degli Studi di Palermo S2-COMMs-Micro and Nanotechnologies for Smart & Sustainable Communities. L.F. was also supported by the project “DARE—Digital lifelong pRevEntion initiative” (MUR, PNC D.D. 931 06/06/2022, code PNC0000002).

Institutional Review Board Statement

Ethical review and approval were waived for this study due to the use of a publicly available, previously published, and fully de-identified dataset [30].

Informed Consent Statement

Informed consent was waived due to the study utilizing a publicly available, de-identified dataset [30], which does not involve collection of new data from human subjects.

Data Availability Statement

The data supporting the reported results are publicly available and were obtained from the Stress Recognition in Automobile Drivers dataset hosted on PhysioNet. The dataset can be accessed at the following link: https://physionet.org/content/drivedb/1.0.0/ (accessed on 5 April 2021). Details regarding the data collection procedures and recordings are fully reported in the dataset reference [30].

Conflicts of Interest

This statement accompanies the article “A Signal Normalization Approach for Robust Driving Stress Assessment Using Multi-Domain Physiological Data” authored by Damiano Fruet and co-authored by Chiara Barà, Riccardo Pernice, Marta Iovino, Luca Faes and Giandomenico Nollo. The authors collectively affirm that this manuscript represents original work that has not been published and is not being considered for publication elsewhere. We also affirm that all authors listed contributed significantly to the project and manuscript. Furthermore, we confirm that none of our authors have disclosures to make, and we declare no conflicts of interest.

References

Taelman, J.; Vandeput, S.; Spaepen, A.; Van Huffel, S. Influence of mental stress on heart rate and heart rate variability. In Proceedings of the 4th European Conference of the International Federation for Medical and Biological Engineering, Antwerp, Belgium, 23–27 November 2008; IFMBE Proceedings. Volume 22, pp. 1366–1369. [Google Scholar] [CrossRef]
McEwen, B.S.; Gianaros, P.J. Stress- and Allostasis- Induced Brain Plasticity. Annu. Rev. Med. 2011, 62, 431–445. [Google Scholar] [CrossRef]
Strain, J.J. The psychobiology of stress, depression, adjustment disorders and resilience. World J. Biol. Psychiatry 2018, 19, S14–S20. [Google Scholar] [CrossRef]
McEwen, B.S. Mood disorders and allostatic load. Biol. Psychiatry 2003, 54, 200–207. [Google Scholar] [CrossRef]
Rabbani, S.; Khan, N. Contrastive Self-Supervised Learning for Stress Detection from ECG Data. Bioengineering 2022, 9, 374. [Google Scholar] [CrossRef]
Taylor, J.M. Psychometric analysis of the ten-item perceived stress scale. Psychol. Assess. 2015, 27, 90–101. [Google Scholar] [CrossRef]
Cohen, S.; Kamarck, T.; Mermelstein, R. A Global Measure of Perceived Stress. J. Health Soc. Behav. 1983, 24, 385–396. [Google Scholar] [CrossRef]
Lesage, F.-X.; Berjot, S.; Deschamps, F. Clinical stress assessment using a visual analogue scale. Occup. Med. 2012, 62, 600–605. [Google Scholar] [CrossRef]
Ali, N.; Nater, U.M. Salivary Alpha-Amylase as a Biomarker of Stress in Behavioral Medicine. Int. J. Behav. Med. 2020, 27, 337–342. [Google Scholar] [CrossRef] [PubMed]
Hellhammer, D.H.; Wüst, S.; Kudielka, B.M. Salivary cortisol as a biomarker in stress research. Psychoneuroendocrinology 2009, 34, 163–171. [Google Scholar] [CrossRef] [PubMed]
Ok, J.; Park, S.; Jung, Y.H.; Kim, T. Wearable and Implantable Cortisol—Sensing Electronics for Stress Monitoring. Adv. Mater. 2024, 36, 2211595. [Google Scholar] [CrossRef] [PubMed]
Patel, S.; Park, H.; Bonato, P.; Chan, L.; Rodgers, M. A review of wearable sensors and systems with application in rehabilitation. J. Neuroeng. Rehabil. 2012, 9, 21. [Google Scholar] [CrossRef] [PubMed]
Zanetti, M.; Faes, L.; De Cecco, M.; Fornaser, A.; Valente, M.; Guandalini, G.; Nollo, G. Assessment of mental stress through the analysis of physiological signals acquired from wearable devices. In Ambient Assisted Living. ForItAAL 2018. Lecture Notes in Electrical Engineering; Springer: Cham, Switzerland, 2018; Volume 544, pp. 243–256. [Google Scholar] [CrossRef]
Fruet, D.; Bara, C.; Pernice, R.; Faes, L.; Nollo, G. Assessment of Driving Stress Through SVM and KNN Classifiers on Multi-Domain Physiological Data. In Proceedings of the MELECON 2022—IEEE Mediterranean Electrotechnical Conference, Palermo, Italy, 14–16 June 2022; pp. 920–925. [Google Scholar] [CrossRef]
Karthikeyan, P.; Murugappan, M.; Yaacob, S. ECG signals based mental stress assessment using wavelet transform. In Proceedings of the 2011 IEEE International Conference on Control System, Computing and Engineering, Penang, Malaysia, 25–27 November 2011; pp. 258–262. [Google Scholar] [CrossRef]
Bong, S.Z.; Murugappan, M.; Yaacob, S. Analysis of electrocardiogram (ECG) signals for human emotional stress classification. In Trends in Intelligent Robotics, Automation, and Manufacturing, Proceedings of the IRAM 2012, Kuala Lumpur, Malaysia, 28–30 November 2012; Springer: Berlin/Heidelberg, Germany, 2012; Communications in Computer and Information Science; Volume 330, pp. 198–205. [Google Scholar] [CrossRef]
Pourmohammadi, S.; Maleki, A. Stress detection using ECG and EMG signals: A comprehensive study. Comput. Methods Programs Biomed. 2020, 193, 105482. [Google Scholar] [CrossRef]
Rahman, T.; Ghosh, A.K.; Shuvo, M.H.; Rahman, M. Mental Stress Recognition using K-Nearest Neighbor (KNN) Classifier on EEG Signals. In Proceedings of the International Conference on Materials, Electronics & Information Engineering, ICMEIE 2015, Rajshahi, Bangladesh, 5–6 June 2015; pp. 1–4. [Google Scholar]
Heo, S.; Kwon, S.; Lee, J. Stress Detection with Single PPG Sensor by Orchestrating Multiple Denoising and Peak-Detecting Methods. IEEE Access 2021, 9, 47777–47785. [Google Scholar] [CrossRef]
Zanetti, M.; Mizumoto, T.; Faes, L.; Fornaser, A.; De Cecco, M.; Maule, L.; Valente, M.; Nollo, G. Multilevel assessment of mental stress via network physiology paradigm using consumer wearable devices. J. Ambient Intell. Humaniz. Comput. 2019, 12, 4409–4418. [Google Scholar] [CrossRef]
Gupta, R.; Alam, M.A.; Agarwal, P. Modified Support Vector Machine for Detecting Stress Level Using EEG Signals. Comput. Intell. Neurosci. 2020, 2020, 8860841. [Google Scholar] [CrossRef]
Soman, K.; Sathiya, A.; Suganthi, N. Classification of stress of automobile drivers using Radial Basis Function Kernel Support Vector Machine. In Proceedings of the International Conference on Information Communication and Embedded Systems (ICICES2014), Chennai, India, 27–28 February 2014; pp. 4–8. [Google Scholar] [CrossRef]
Gonzales, T.I.; Jeon, J.Y.; Lindsay, T.; Westgate, K.; Perez-Pozuelo, I.; Hollidge, S.; Wijndaele, K.; Rennie, K.; Forouhi, N.; Griffin, S.; et al. Resting heart rate is a population-level biomarker of cardiorespiratory fitness: The Fenland Study. PLoS ONE 2023, 18, e0285272. [Google Scholar] [CrossRef]
Ogliari, G.; Mahinrad, S.; Stott, D.J.; Jukema, J.W.; Mooijaart, S.P.; Macfarlane, P.W.; Clark, E.N.; Kearney, P.M.; Westendorp, R.G.J.; De Craen, A.J.M.; et al. Resting heart rate, heart rate variability and functional decline in old age. Can. Med. Assoc. J. 2015, 187, E442–E449. [Google Scholar] [CrossRef]
Association, A.H. All About Heart Rate (Pulse). Available online: https://www.heart.org/en/health-topics/high-blood-pressure/the-facts-about-high-blood-pressure/all-about-heart-rate-pulse (accessed on 3 February 2025).
Mason, J.W.; Ramseth, D.J.; Chanter, D.O.; Moon, T.E.; Goodman, D.B.; Mendzelevski, B. Electrocardiographic reference ranges derived from 79,743 ambulatory subjects. J. Electrocardiol. 2007, 40, 228–234.e8. [Google Scholar] [CrossRef]
Avram, R.; Tison, G.H.; Aschbacher, K.; Kuhar, P.; Vittinghoff, E.; Butzner, M.; Runge, R.; Wu, N.; Pletcher, M.J.; Marcus, G.M.; et al. Real-world heart rate norms in the Health eHeart study. npj Digit. Med. 2019, 2, 58. [Google Scholar] [CrossRef]
Wu, D.; Courtney, C.G.; Lance, B.J.; Narayanan, S.S.; Dawson, M.E.; Oie, K.S.; Parsons, T.D. Optimal arousal identification and classification for affective computing using physiological signals: Virtual reality stroop task. IEEE Trans. Affect. Comput. 2010, 1, 109–118. [Google Scholar] [CrossRef]
Moore, T. Respiratory assessment in adults. Nurs. Stand. 2007, 21, 48–56; quiz 58. [Google Scholar] [CrossRef]
Healey, J.A.; Picard, R.W. Detecting stress during real-world driving tasks using physiological sensors. IEEE Trans. Intell. Transp. Syst. 2005, 6, 156–166. [Google Scholar] [CrossRef]
Cui, X.; Tian, L.; Li, Z.; Ren, Z.; Zha, K.; Wei, X.; Peng, C.K. On the variability of heart rate variability—Evidence from prospective study of healthy young college students. Entropy 2020, 22, 1302. [Google Scholar] [CrossRef]
Lin, C.-C.; Yang, C.-M. Heartbeat Classification Using Normalized RR Intervals and Morphological Features. Math. Probl. Eng. 2014, 2014, 712474. [Google Scholar] [CrossRef]
Lin, C.C.; Yang, C.M. Heartbeat Classification Using Normalized RR Intervals and Wavelet Features. In Proceedings of the 2014 International Symposium on Computer, Consumer and Control, Taichung, Taiwan, 10–12 June 2014; pp. 650–653. [Google Scholar]
Gasparini, F.; Grossi, A.; Giltri, M.; Bandini, S. Personalized PPG Normalization Based on Subject Heartbeat in Resting State Condition. Signals 2022, 3, 249–265. [Google Scholar] [CrossRef]
Amin, M.; Ullah, K.; Asif, M.; Shah, H.; Mehmood, A.; Khan, M.A. Real-World Driver Stress Recognition and Diagnosis Based on Multimodal Deep Learning and Fuzzy EDAS Approaches. Diagnostics 2023, 13, 1897. [Google Scholar] [CrossRef] [PubMed]
Lee, J.; Lee, H.; Shin, M. Driving Stress Detection Using Multimodal Convolutional Neural Networks with Nonlinear Representation of Short-Term Physiological Signals. Sensors 2021, 21, 2381. [Google Scholar] [CrossRef]
Tervonen, J.; Pettersson, K.; Mäntyjärvi, J. Ultra-short window length and feature importance analysis for cognitive load detection from wearable sensors. Electronics 2021, 10, 613. [Google Scholar] [CrossRef]
Smets, E.; Casale, P.; Großekathöfer, U.; Lamichhane, B.; De Raedt, W.; Bogaerts, K.; Van Diest, I.; Van Hoof, C. Comparison of Machine Learning Techniques for Psychophysiological Stress Detection. In Pervasive Computing Paradigms for Mental Health. MindCare 2015. Communications in Computer and Information Science; Springer: Cham, Switzerland, 2015; Volume 604, pp. 13–22. [Google Scholar] [CrossRef]
Gjoreski, M.; Kolenik, T.; Knez, T.; Luštrek, M.; Gams, M.; Gjoreski, H.; Pejović, V. Datasets for cognitive load inference using wearable sensors and psychological traits. Appl. Sci. 2020, 10, 3843. [Google Scholar] [CrossRef]
Farias da Silva, M.A.; De Carvalho, R.L.; Almeida, T. Evaluation of a Sliding Window mechanism as Data Augmentation over Emotion Detection on Speech. Acad. J. Comput. Eng. Appl. Math. 2021, 2, 11–18. [Google Scholar] [CrossRef]
MATLAB, Version: 24.2.0.2712019 (R2024b); The MathWorks: Natick, MA, USA, 2024.
Addeh, A.; Vega, F.; Medi, P.R.; Williams, R.J.; Pike, G.B.; MacDonald, M.E. Direct machine learning reconstruction of respiratory variation waveforms from resting state fMRI data in a pediatric population. NeuroImage 2023, 269, 119904. [Google Scholar] [CrossRef]
Nollo, G.; Speranza, G.; Grasso, R.; Bonamini, R.; Mangiardi, L.; Antolini, R. Spontaneous beat-to-beat variability of the ventricular repolarization duration. J. Electrocardiol. 1992, 25, 9–17. [Google Scholar] [CrossRef]
Shaffer, F.; Steven, S.; Meehan, Z. The Promise of Ultra-Short-Term (UST) Heart Rate Variability Measurements. Biofeedback 2016, 44, 229–233. [Google Scholar] [CrossRef]
Volpes, G.; Barà, C.; Busacca, A.; Stivala, S.; Javorka, M.; Faes, L.; Pernice, R. Feasibility of Ultra-Short-Term Analysis of Heart Rate and Systolic Arterial Pressure Variability at Rest and during Stress via Time-Domain and Entropy-Based Measures. Sensors 2022, 22, 9149. [Google Scholar] [CrossRef] [PubMed]
Pan, J.; Tompkins, W.J. A real-time QRS detection algorithm. IEEE Trans. Biomed. Eng. 1985, 32, 230–236. [Google Scholar] [CrossRef]
Shaffer, F.; Ginsberg, J.P. An overview of heart rate variability metrics and norms. Front. Public Health 2017, 5, 258. [Google Scholar]
Shaffer, F.; Meehan, Z.M.; Zerr, C.L. A Critical Review of Ultra-Short-Term Heart Rate Variability Norms Research. Front. Neurosci. 2020, 14, 594880. [Google Scholar] [CrossRef]
Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic Minority Over-sampling Technique Nitesh. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
Virdi, P.; Narayan, Y.; Kumari, P.; Mathew, L. Discrete Wavelet Packet based Elbow Movement classification using Fine Gaussian SVM. In Proceedings of the 2016 IEEE 1st International Conference on Power Electronics, Intelligent Control and Energy Systems (ICPEICES), Delhi, India, 4–6 July 2016; pp. 1–5. [Google Scholar] [CrossRef]
Fruet, D.; Leonardelli, P.; Nollo, G. Affective state classification using timing-related features from short windowed PPG signal. In Proceedings of the 2023 IEEE International Workshop on Metrology for Industry 4.0 & IoT (MetroInd4.0&IoT), Brescia, Italy, 6–8 June 2023; pp. 153–158. [Google Scholar] [CrossRef]
Costadopoulos, N.; Islam, M.Z.; Tien, D. Using Z-score to Extract Human Readable Logic Rules from Physiological Data. In Proceedings of the 2019 11th International Conference on Knowledge and Systems Engineering (KSE), Da Nang, Vietnam, 24–26 October 2019; pp. 1–6. [Google Scholar]
Sraitih, M.; Jabrane, Y.; Hajjam El Hassani, A. A Robustness Evaluation of Machine Learning Algorithms for ECG Myocardial Infarction Detection. J. Clin. Med. 2022, 11, 4935. [Google Scholar] [CrossRef]
Han, L.; Zhang, Q.; Chen, X.; Zhan, Q.; Yang, T.; Zhao, Z. Detecting work-related stress with a wearable device. Comput. Ind. 2017, 90, 42–49. [Google Scholar] [CrossRef]
Lambert, T.P.; Chan, M.; Sanchez-Perez, J.A.; Nikbakht, M.; Lin, D.J.; Nawar, A.; Bashar, S.K.; Kimball, J.P.; Zia, J.S.; Gazi, A.H.; et al. A Comparison of Normalization Techniques for Individual Baseline-Free Estimation of Absolute Hypovolemic Status Using a Porcine Model. Biosensors 2024, 14, 61. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Effect of the inter-subject normalization on the length of exemplary ECG and breathing signals. Red dots highlight R peaks and respiratory peaks in the original signal, while green dots show the same peaks in the subject-normalized signal (a) Original 10-s window ECG signal containing 13 beats and (b) 10-s window subject-normalized ECG signal containing 14 beats. (c) Original 50-s window respiratory signal containing 12 breaths and (d) 50-s window subject-normalized respiratory signal containing 11 breaths.

Figure 2. Confusion matrix obtained by considering the original testing data in a single iteration. The color of each cell is proportional to its value and is normalized. Diagonal cell colors are normalized to the largest diagonal value, and off-diagonal cell colors are normalized to the largest off-diagonal value.

Figure 3. Box plot and reported statistical significance comparing model accuracy using features derived from original data, scaled features, standardized-normalized features, and features obtained from inter-subject-normalized signals. Statistical comparisons were performed using a parametric Student’s t-test. ‘n.s.’ indicates no statistical significance, while ‘***’ highlights statistical significance (p < 0.001).

Table 1. Heart rate derived from the original data and associated resampling frequencies in the subject-normalized domain.

Acquisition (n)	Heart Rate (Beats Per Minute)	Resampling Frequency (Hz)
1	67.3	260.0
2	66.2	264.4
3	80.7	216.9
4	71.9	243.4
5	61.3	285.5
6	76.6	228.5
7	61.4	285.0
8	60.7	288.3
9	83.7	209.1
10	62.4	280.4

Table 2. Breath rate derived from the original data and associated resampling frequencies in the subject-normalized domain.

Acquisition (n)	Breath Rate (Beats per Minute)	Resampling Frequency (Hz)
1	14.3	244.8
2	14.9	234.9
3	16.7	209.6
4	13.7	255.5
5	11.8	296.6
6	11.6	301.7
7	16.2	216.0
8	18.1	193.4
9	8.9	393.3
10	13.9	251.8

Table 3. Performance obtained during classification using features derived from the original signals, features from the inter-subject-normalized signals, standardized features, and scaled features. The characters ‘n.s.’ written as an apex denote no statistical significance, while three asterisks indicate statistical significance between the original data and the considered group. Significance was estimated comparing original data with other groups data.

Metric	Original Data	Standardization	Scaling	Inter-Subject Normalization
	Mean (SD)	Mean (SD)	Mean (SD)	Mean (SD)
Precision (%)	$68.1$ $(1.9)$	${68.3}^{n . s .}$ $(1.9)$	${68.2}^{n . s .}$ $(2.0)$	${72.9}^{* * *}$ $(1.9)$
Sensitivity (%)	$68.5$ $(1.8)$	${68.6}^{n . s .}$ $(1.8)$	${68.7}^{n . s .}$ $(1.9)$	${73.0}^{* * *}$ $(1.9)$
Specificity (%)	$84.2$ $(0.9)$	${84.3}^{n . s .}$ $(0.9)$	${84.3}^{n . s .}$ $(0.9)$	${86.5}^{* * *}$ $(0.9)$
Accuracy (%)	$68.5$ $(1.8)$	${68.6}^{n . s .}$ $(1.8)$	${68.7}^{n . s .}$ $(1.9)$	${73.0}^{* * *}$ $(1.9)$

*** Statistical significance (p < 0.001) compared with the original data.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Fruet, D.; Barà, C.; Pernice, R.; Iovino, M.; Faes, L.; Nollo, G. A Signal Normalization Approach for Robust Driving Stress Assessment Using Multi-Domain Physiological Data. Eng 2025, 6, 288. https://doi.org/10.3390/eng6110288

AMA Style

Fruet D, Barà C, Pernice R, Iovino M, Faes L, Nollo G. A Signal Normalization Approach for Robust Driving Stress Assessment Using Multi-Domain Physiological Data. Eng. 2025; 6(11):288. https://doi.org/10.3390/eng6110288

Chicago/Turabian Style

Fruet, Damiano, Chiara Barà, Riccardo Pernice, Marta Iovino, Luca Faes, and Giandomenico Nollo. 2025. "A Signal Normalization Approach for Robust Driving Stress Assessment Using Multi-Domain Physiological Data" Eng 6, no. 11: 288. https://doi.org/10.3390/eng6110288

APA Style

Fruet, D., Barà, C., Pernice, R., Iovino, M., Faes, L., & Nollo, G. (2025). A Signal Normalization Approach for Robust Driving Stress Assessment Using Multi-Domain Physiological Data. Eng, 6(11), 288. https://doi.org/10.3390/eng6110288

Article Menu

A Signal Normalization Approach for Robust Driving Stress Assessment Using Multi-Domain Physiological Data

Abstract

1. Introduction

2. Materials and Methods

2.1. Physiological Data

2.2. Signal Preprocessing

2.3. Inter-Subject Normalization

2.3.1. ECG Signal

2.3.2. Respiratory Signal

2.4. Feature Extraction

2.5. Data Organization and Imbalance Class Management

2.6. Stress State Classification

3. Inter-Subject Normalization Validation

4. Results

5. Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI