Measuring Heart Rate Variability Using Facial Video

Martinez-Delgado, Gerardo H.; Correa-Balan, Alfredo J.; May-Chan, José A.; Parra-Elizondo, Carlos E.; Guzman-Rangel, Luis A.; Martinez-Torteya, Antonio

doi:10.3390/s22134690

Open AccessArticle

Measuring Heart Rate Variability Using Facial Video

by

Gerardo H. Martinez-Delgado

¹,

Alfredo J. Correa-Balan

¹,

José A. May-Chan

¹,

Carlos E. Parra-Elizondo

¹

,

Luis A. Guzman-Rangel

² and

Antonio Martinez-Torteya

^3,*

¹

Programa de Ingeniería Mecatrónica, Universidad de Monterrey, San Pedro Garza García 66238, Mexico

²

Programa de Maestría en Ingeniería del Producto, Universidad de Monterrey, San Pedro Garza García 66238, Mexico

³

Escuela de Ingeniería y Tecnologías, Universidad de Monterrey, San Pedro Garza García 66238, Mexico

^*

Author to whom correspondence should be addressed.

Sensors 2022, 22(13), 4690; https://doi.org/10.3390/s22134690

Submission received: 1 June 2022 / Revised: 13 June 2022 / Accepted: 16 June 2022 / Published: 21 June 2022

(This article belongs to the Special Issue Monitoring Technologies in Healthcare Applications)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Heart Rate Variability (HRV) has become an important risk assessment tool when diagnosing illnesses related to heart health. HRV is typically measured with an electrocardiogram; however, there are multiple studies that use Photoplethysmography (PPG) instead. Measuring HRV with video is beneficial as a non-invasive, hands-free alternative and represents a more accessible approach. We developed a methodology to extract HRV from video based on face detection algorithms and color augmentation. We applied this methodology to 45 samples. Signals obtained from PPG and video recorded an average mean error of less than 1 bpm when measuring the heart rate of all subjects. Furthermore, utilizing PPG and video, we computed 61 variables related to HRV. We compared each of them with three correlation metrics (i.e., Kendall, Pearson, and Spearman), adjusting them for multiple comparisons with the Benjamini–Hochberg method to control the false discovery rate and to retrieve the q-value when considering statistical significance lower than 0.5. Using these methods, we found significant correlations for 38 variables (e.g., Heart Rate, 0.991; Mean NN Interval, 0.990; and NN Interval Count, 0.955) using time-domain, frequency-domain, and non-linear methods.

Keywords:

Photoplethysmography (PPG); heart rate measurement; Heart Rate Variability (HRV); non-contact; face imaging

1. Introduction

Heart Rate Variability (HRV), a quantitative marker of autonomic activity that measures the physiological phenomenon of variation in the time interval between heartbeats, has become a very important risk assessment tool. A reduced HRV can be associated with a poorer prognosis for multiple conditions, while more robust periodic changes in the R-R interval show signs of good health [1,2,3,4]. HRV is commonly related to heart health, but it can also provide useful information regarding blood pressure, gas exchange, and gut- and vascular-related matters [5].

According to the American Heart Association, HRV can be measured using time-domain, frequency-domain, and non-linear methods [6], with each method yielding different information for different applications. Time-domain features, such as the mean time between heartbeats (NN interval), the mean heart rate, and the standard deviation of the NN interval, are used to determine the heart rate at any point in time or the intervals between successive heartbeats. Frequency-domain methods are used to obtain information on how variance is distributed as a function of frequency. Some features of interest are the total power (i.e., the variance in NN intervals over the temporal segment) and the power in the very low- (VL), low- (L), and high- (H) frequency ranges (lower than 0.04 Hz, between 0.04 and 0.15 Hz, and between 0.15 and 0.4 Hz, respectively). Lastly, due to the unpredictability of the complex mechanisms that regulate the HRV, non-linear methods are also included because non-linear phenomena are involved.

This study aimed to find a way to obtain HRV measurements from a device other than the main tool currently used, the Holter ECG [7,8,9,10,11]. The alternative that we developed measures blood flow using video to obtain a signal similar to those obtained using Photoplethysmography (PPG), which previous research shows can be used to compute the HRV [12,13,14,15,16,17,18]. With the aid of artificial intelligence (AI), there have been large developments in diagnosis, prognosis, and treatment using visual pattern recognition as a gateway to contribute to the interpretation of images in the medical field [19]. Considering instances when a diagnosis can be prone to human error, such as rare diseases or lack of proper symptoms or omissions, AI and machine learning (ML) can play a big role in reducing the occurrence and probability of a misdiagnosis or overdiagnosis [20]. According to the National Academics of Science, Engineering, and Medicine report from 2015, a majority of people encounter at least one diagnostic mistake throughout their lifespan [21]. The implementation of AI with video, which is explored here, can be developed further with the help of technologies such as the Internet of Things (IoT), which are capable of monitoring patients in real-time and providing timely, effective, and quality healthcare services related to the cardiac condition [22].

The main contribution of this work is that by performing a correlation analysis, we determined that the HRV features extracted from video were comparable in quality to those obtained through PPG when using a database with PPG measurements derived from a pulse oximeter [23]. Significant correlations were found for time- and frequency-domain features as well as for characteristics measured with non-linear methods that index the unpredictability of a time series [24].

2. Materials and Methods

To generate our dataset, we used a commercial pulse oximeter to obtain the PPG signal and heart rate of each subject. We also used a camera with a frame rate of 30 fps; this rate permits smooth transitions between frames, which creates a smoother signal output, and a resolution of 1280 × 720 to record the faces of the participants throughout the entirety of the test. The higher the resolution, the more computational cost it presents. The test was designed to have a ten-minute duration in order to capture information in the very-low frequency range. Due to the effect of ballistocardiographic artifacts in PPG signals and how they can affect the obtention of the signal [12], we sat subjects comfortably at a distance between 30 and 50 cm from the camera in a room with ambient light in an attempt to project the least amount of shadows possible to the faces of the subjects. The sample population consisted of 5 subjects: 2 females aged 25 and 55 and 3 males aged 17, 24, and 60, whose skin tones ranged from pearl white to fair to olive. For each subject, we recorded 9 samples at different times of day throughout a 4-month period, amounting to 45 total videos.

Our methodology is shown in Figure 1. We computed HRV features using the PPG data [25] and video independently. For the video signal, we implemented face recognition and color augmentation stages. We also performed time alignment between the PPG and video signals. Once we obtained the features from both signals, we performed a correlation analysis to compute performance metrics.

We used the same methodology to extract HRV features from PPG signals used in our previous work [23], in which HRV was used to determine blood glucose concentration. There, a peak detection algorithm was developed to accurately measure the distance between each peak, and we computed the minimum distance between peaks and their heights to deal with the noise by taking into consideration the low complexity of the signal and its Gaussian behavior (analogous to the R-R interval). Then, the vector of time intervals was used to extract the HRV features using non-linear, time-domain, and frequency-domain methods with the help of the pyHRV toolbox [26]. The performance of the peak detection algorithm was evaluated by comparing its output to an annotated PPG database, yielding 99.89% precision. The time-domain methods resulted in 15 features, the frequency-domain methods resulted in 36, and the non-linear methods resulted in 12, yielding a total of 61 of the most commonly measured HRV characteristics [24].

The methodology employed to extract the HRV features from the video began with a face detection stage. Using face landmarks with OpenCV and the Viola–Jones face detectors variation from the Haar cascade classifier [27], we were able to obtain a region of interest (ROI) from the video (i.e., the face) and cropped all of the frames to focus solely on it. However, in order to minimize the variations between frames, the ROI only changed positions between frames when a large enough translation (experimentally fixed) was detected.

The next stage consisted of a color augmentation method using the Eulerian Video Magnification algorithm developed by Freeman et al. [28] and applied using MATLAB. This stage aimed at amplifying the subtle color variations in the skin as blood fluctuates through it. It decomposes a video sequence into different spatial frequency bands and applies temporal filters to each one, later amplifying them and adding them back to the original signal in order to reveal previously non-visible information.

From the color-augmented video, we only extracted the information from the red channel, given that according to Feng et al. [29], the red channel can capture muscle movement due to the reflection of light on the skin. This signal was then processed in the same way as the PPG signal was, as previously described. Thus, two databases identical in size with information regarding HRV were created, one using PPG data derived from a commercial pulse oximeter, and one using blood-flow-related data derived from video.

Considering that the pulse oximeter study and the video recording did not start and finish at the exact same time, there may have been some misalignment between the two signals. Therefore, we performed a time adjustment, in which the longest signal was trimmed (the same amount at the beginning and at the end of the recording) to match its counterpart. We also stratified the peak detection results on a per-minute basis in order to identify large sources of noise and possible variation between signals.

The mean error (Me), root mean square error (RMSE), and standard deviation (SD) of the heart rate obtained using each database were computed. Additionally, the correlation of each variable between datasets was calculated using the Kendall, Pearson, and Spearman tests. In order to account for multiple comparisons, we used the Benjamini–Hochberg method to control the false discovery rate (FDR) by considering statistical significance when the adjusted p-value (q-value) was lower than 0.5.

3. Results

In this section, we will describe the results obtained at each stage of our methodology since it is crucial to detect any possible errors that could cause reliability issues when measuring HRV, either at the face detection stage, the color augmentation stage, or at the peak detection stage.

3.1. Face Detection

Each video was inspected qualitatively when obtaining the ROI. All cropped videos passed our overall quality inspection and showed the forehead, eyes, nose, and part of the mouth of the subjects at almost all times, as shown in Figure 2b. However, some frames did not show a clear image of the face, as shown in Figure 2c, because the subject had turned, either slightly or completely from the camera, or because movements caused blur in the frame, as shown in Figure 2d. Since the video recording had a 10 min duration, it was expected that subjects would turn their face from the camera occasionally.

3.2. Color Augmentation

In order to evaluate the performance of the color augmentation stage, we compared the number of peaks detected from the processed video signal to those computed from the PPG signal, both globally and locally (1 min windows). However, we first performed a qualitative inspection to make sure that the red channel of the processed video yielded the expected results (i.e., an oscillating signal with a frequency within normal heart rate values). Additionally, we visually compared the shape and period of the video and PPG signals, as shown in Figure 3.

3.3. Peak Detection

After retrieving the signals from both the pulse oximeter and the video, we located the peaks of each sample. We also trimmed the longest signal of each pair of results per observation in order to align them time-wise. Figure 4 shows the results for one observation where it can be seen that the peak detection algorithm was accurate. Using the number of peaks, the sample rate of the pulse oximeter, and the fps of the video signal, we calculated the heart rate for each sample in beats per minute (bpm).

Table 1 shows that there was less than 1 bpm of difference on average between the heart rate measured from the pulse oximeter and the one measured from the video. In terms of total peaks, there was an average difference of 12.804 peaks per 10 min between signals.

Additionally, we performed the same comparison per 1 min window in order to detect specific time lapses in which there were significant differences between both signals. Table 2 shows that the biggest differences are in the first two minutes, with more than 2 bpm of difference between both means of measurement and a standard deviation in the first minute of more than 3 bpm. In addition, the table shows that at the last minute, there is an increase in the difference of more than 1.8 bpm.

3.4. Correlation Analysis

After computing the overall performance metrics, we obtained the correlation of each variable between both datasets. In total, 38 features were regarded as having a significant correlation between sources under the three different tests after adjusting for multiple comparisons. Table 3 shows the five features with the highest correlations under the Pearson test, the most commonly used metric for general correlation between variables of this kind [30]; it also includes the correlation coefficients (r), q-values, and the method used to measure it.

Furthermore, Table 4 represents the five features with the lowest q-values, which also share the trait of having the highest correlation under the Kendall test; it also includes the method used, correlation coefficients (r), and q-values.

Finally, Table 5 shows the five features with the lowest q-values and highest correlation under the Spearman test, presenting the same five features as in the previous tests; it also includes the corresponding method used to measure it, correlation coefficients (r), and q-values.

For the time-domain methods, the nine features that were significant under the three tests after adjusting for multiple comparisons can be seen in Table 6.

For the frequency-domain methods, there were 26 significant features. Table 7 shows the significant features retrieved when using the Welch method to calculate the power spectral density (PSD).

Table 8 shows the significant features obtained when using the autoregressive method to calculate the PSD.

Regarding the use of the Lomg–Scargle method to calculate the PSD, Table 9 presents the significant features obtained from this method.

Lastly, there were three significant features from the non-linear methods, all derived from a Poincaré plot, as seen in Table 10.

4. Discussion

Our results show that the signal derived from the video had a very similar composition in terms of the locations of peaks to the signal derived from the commercial pulse oximeter, as determined by their smaller than 1 bpm average difference. To truly test the performance of the peak detection method we proposed, a comparison was made to other previously developed methods. Table 11 shows the comparison between our proposed method and other methods, such as those involving deep learning [31] and neural networks [32], with the commonly used metrics to evaluate their performance [33].

From Table 11, we can see that our proposed methodology is first in both RMSE and Me and ranked fourth in SD, which leads us to believe that the information and setup we are using can be reliable when it comes to computing HRV features.

After performing a deeper analysis, in which the results were compared in 1 min windows, there was a tendency for the video to compute more peaks at the start and end of each study. This is probably due to the subjects settling into their chairs at the beginning of the study or preparing to get up by the end of it. These situations can lead to the video being cropped poorly, therefore obtaining a signal that does not relate to the skin of the subject, causing the signal from the R channel to provide false information. Nonetheless, even when taking this situation into account, the computation of heart rate was accurate, which means that when taking the correct time intervals and after cleaning the signal, we were able to compute the heart rate of a person as well as a pulse oximeter can.

Additionally, we demonstrated that we were able to measure 38 HRV features with a significant correlation between sources of information; we considered three correlation tests and p-value adjustment for multiple comparisons. The features extracted from the PPG signal derived from the pulse oximeter were significantly correlated to those derived from the video. Moreover, those features included information generated using multiple methods: time-domain methods, frequency-domain methods, and non-linear methods. Specifically, in the frequency-domain, there were a considerable number of useful correlations in the variables related to Welch’s periodogram, a method used for spectrum monitoring [36,37], and improvements when measuring HRV with alternative methods [38,39]. In addition, there was a considerable number of parameters related to autoregressive models that mainly focus on spectral analysis [39] and that can also be related to other non-parametric analyses [40]. For the non-linear metrics, there were also significant correlations in the Poincaré plot for both axes and their ratios. These metrics bring valuable information to variability in R-R Intervals and other measures of variability [41,42,43] and have been previously related to PPG as well [44].

This work has some limitations; we only used information from one of the three channels of the video, while some authors have recommended mixing both the red and green channels to filter the noise of sudden movements from the subject due to the wavelength dependency of reflection PPG and optics [29]. Another cause of error could be related to the ROI selected in this study, as the whole face of the subject was taken into account and there could have been a smaller ROI focused on the forehead or another large area of the face to prevent losing information when cropping the frame. Finally, the population size is small, and thus, these results are yet to be generalizable; we intend to increase the population size and validate these results. Another area of opportunity is the assessment of natural light quality to eliminate the noise resulting from changes in light. Authors recommend using more robust models during imaging [45,46] that use deep learning methods and statistical approaches for natural spatiotemporal scenes, which would help with frame cropping and improve the quality of the exploited video.

5. Conclusions

In this work, we demonstrated that the HRV information derived from a PPG signal is significantly correlated to those same features when derived from a video. We performed 45 studies that included a 10 min session in which the subjects were connected to a pulse oximeter and looked into a camera. Peaks were detected from the PPG signal that the pulse oximeter yielded and were validated against a manually annotated dataset. Peaks were detected from the R channel of the video using the same methodology after a face detection and color augmentation stage that aimed at focusing on the face of the subjects and amplifying the subtle color changes in the skin caused by blood flow. On average, an error of less than 1 bpm was found when calculating the heart rate using each signal independently. Furthermore, HRV features were extracted from each signal using time-domain, frequency-domain, and non-linear methods. When compared using Pearson, Spearman, and Kendall correlations tests, and after adjusting for multiple comparisons, 38 of the 61 extracted features had a significant correlation in all three tests, including features from the three types of methods tested. These results are promising when it comes to possible applications, as this methodology could be employed in experiments related to the regulation of autonomic balance, blood pressure, gas exchange, and respiratory rate, as well as gut-, heart-, and vascular-related issues.

Author Contributions

Conceptualization, L.A.G.-R. and A.M.-T.; data curation, L.A.G.-R.; formal analysis, G.H.M.-D.; investigation, G.H.M.-D., A.J.C.-B., J.A.M.-C. and C.E.P.-E.; methodology, G.H.M.-D., A.J.C.-B., J.A.M.-C., C.E.P.-E., L.A.G.-R. and A.M.-T.; software, G.H.M.-D., A.J.C.-B., J.A.M.-C. and C.E.P.-E.; supervision, A.M.-T.; writing—original draft, G.H.M.-D.; writing—review and editing, A.M.-T. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Proyectos de Investigación e Innovación and the Fondo de Publicaciones grants from Universidad de Monterrey.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Ethics Committee of Universidad de Monterrey.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Thayer, J.F.; Yamamoto, S.S.; Brosschot, J.F. The relationship of autonomic imbalance, heart rate variability and cardiovascular disease risk factors. Int. J. Cardiol. 2010, 141, 122–131. [Google Scholar] [CrossRef] [PubMed]
Bigger, J.T.; Steinman, R.C.; Rolnitzky, L.M.; Fleiss, J.L.; Albrecht, P.; Cohen, R.J. Power Law Behavior of RR-Interval Variability in Healthy Middle-Aged Persons, Patients With Recent Acute Myocardial Infarction, and Patients With Heart Transplants. Circulation 1996, 93, 2142–2151. [Google Scholar] [CrossRef] [PubMed]
Malik, M. Task Force of the European Society of Cardiology and the North American Society of Pacing and Electrophysiology, Heart Rate Variability. Circulation 1996, 93, 1043–1065. [Google Scholar] [CrossRef] [Green Version]
Billman, G.E. Heart Rate Variability? A Historical Perspective. Front. Physiol. 2011, 2, 86. [Google Scholar] [CrossRef] [Green Version]
Schwartz, M.S.; Andrasik, F. Biofeedback: A Practitioner’s Guide; Guilford Publications: New York, NY, USA, 2017. [Google Scholar]
Malik, M.; Bigger, J.T.; Camm, A.J.; Kleiger, R.E.; Malliani, A.; Moss, A.J.; Schwartz, P.J. Heart rate variability: Standards of measurement, physiological interpretation, and clinical use. Eur. Heart J. 1996, 17, 354–381. [Google Scholar] [CrossRef] [Green Version]
Solhjoo, S.; Haigney, M.C.; McBee, E.; Van Merrienboer, J.J.G.; Schuwirth, L.; Artino, A.R., Jr.; Battista, A.; Ratcliffe, T.A.; Lee, H.D.; Durning, S.J. Heart Rate and Heart Rate Variability Correlate with Clinical Reasoning Performance and Self-Reported Measures of Cognitive Load. Sci. Rep. 2019, 9, 14668. [Google Scholar] [CrossRef]
Yeragani, V.K.; Sobolewski, E.; Kay, J.; Jampala, V.C.; Igel, G. Effect of age on long-term heart rate variability. Cardiovasc. Res. 1997, 35, 35–42. [Google Scholar] [CrossRef] [Green Version]
Kanters, J.K.; Hojgaard, M.V.; Agner, E.; Holstein-Rathlou, N.-H. Short- and long-term variations in non-linear dynamics of heart rate variability. Cardiovasc. Res. 1996, 31, 400–409. [Google Scholar] [CrossRef]
Kleiger, R.E.; Miller, J.P.; Bigger, J.T.; Moss, A.J. Decreased heart rate variability and its association with increased mortality after acute myocardial infarction. Am. J. Cardiol. 1987, 59, 256–262. [Google Scholar] [CrossRef]
Kazmi, S.Z.H.; Zhang, H.; Aziz, W.; Monfredi, O.; Abbas, S.A.; Shah, S.A.; Kazmi, S.S.H.; Butt, W.H. Inverse Correlation between Heart Rate Variability and Heart Rate Demonstrated by Linear and Nonlinear Analysis. PLoS ONE 2016, 11, e0157557. [Google Scholar] [CrossRef] [Green Version]
Moco, A.V.; Stuijk, S.; de Haan, G. Ballistocardiographic Artifacts in PPG Imaging. IEEE Trans. Biomed. Eng. 2016, 63, 1804–1811. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Kumar, M.; Veeraraghavan, A.; Sabharwal, A. DistancePPG: Robust non-contact vital signs monitoring using a camera. Biomed. Opt. Express 2015, 6, 1565. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Poh, M.-Z.; McDuff, D.J.; Picard, R.W. Non-contact, automated cardiac pulse measurements using video imaging and blind source separation. Opt. Express 2010, 18, 10762. [Google Scholar] [CrossRef] [PubMed]
Lewandowska, M.; Rumiński, J.; Kocejko, T.; Nowak, J. Measuring pulse rate with a webcam—A non-contact method for evaluating cardiac activity. In Proceedings of the 2011 Federated Conference on Computer Science and Information Systems (FedCSIS), Szczecin, Poland, 18–21 September 2011; pp. 405–410. [Google Scholar]
Poh, M.-Z.; McDuff, D.J.; Picard, R.W. Advancements in Noncontact, Multiparameter Physiological Measurements Using a Webcam. IEEE Trans. Biomed. Eng. 2011, 58, 7–11. [Google Scholar] [CrossRef] [PubMed] [Green Version]
de Haan, G.; van Leest, A. Improved motion robustness of remote-PPG by using the blood volume pulse signature. Physiol. Meas. 2014, 35, 1913–1926. [Google Scholar] [CrossRef] [PubMed]
Haque, M.A.; Irani, R.; Nasrollahi, K.; Moeslund, T.B. Heartbeat Rate Measurement from Facial Video. IEEE Intell. Syst. 2016, 31, 40–48. [Google Scholar] [CrossRef] [Green Version]
Tran, B.X.; Latkin, C.A.; Vu, G.T.; Nguyen, H.L.T.; Nghiem, S.; Tan, M.-X.; Lim, Z.-K.; Ho, C.S.; Ho, R.C. The Current Research Landscape of the Application of Artificial Intelligence in Managing Cerebrovascular and Heart Diseases: A Bibliometric and Content Analysis. Int. J. Environ. Res. Public Health 2019, 16, 2699. [Google Scholar] [CrossRef] [Green Version]
Ahsan, M.M.; Luna, S.A.; Siddique, Z. Machine-Learning-Based Disease Diagnosis: A Comprehensive Review. Healthcare 2022, 10, 541. [Google Scholar] [CrossRef]
Balogh, E.P.; Miller, B.T.; Ball, J.R. Improving Diagnosis in Health Care; The National Academies Press: Washington, DC, USA, 2015. [Google Scholar] [CrossRef]
Umer, M.; Sadiq, S.; Karamti, H.; Karamti, W.; Majeed, R.; Nappi, M. IoT Based Smart Monitoring of Patients’ with Acute Heart Failure. Sensors 2022, 22, 2431. [Google Scholar] [CrossRef]
Guzman, L.; Cazares, A.M.G.; Martinez-Torteya, A. Model for Glycemic Level Detection using Heart Rate Variability in a Mexican Sample. In Proceedings of the 2020 IEEE-EMBS Conference on Biomedical Engineering and Sciences (IECBES), Langkawi Island, Malaysia, 1–3 March 2021; pp. 505–510. [Google Scholar] [CrossRef]
Shaffer, F.; Ginsberg, J.P. An Overview of Heart Rate Variability Metrics and Norms. Front. Public Health 2017, 5, 258. [Google Scholar] [CrossRef] [Green Version]
Verkruysse, W.; Svaasand, L.O.; Nelson, J.S. Remote plethysmographic imaging using ambient light. Opt. Express 2008, 16, 21434. [Google Scholar] [CrossRef] [Green Version]
Gomes, P.; Margaritoff, P.; Silva, H. pyHRV: Development and evaluation of an open-source python toolbox for heart rate variability (HRV). In Proceedings of the International Conference on Electrical, Electronic and Computing Engineering (Icetran), Veliko Gradište, Serbia, 3–6 June 2019; pp. 822–828. [Google Scholar]
Padilla, R.; Filho, C.F.F.C.; Costa, M.G.F. Evaluation of haar cascade classifiers designed for face detection. World Acad. Sci. Eng. Technol. 2012, 64, 362–365. [Google Scholar]
Wu, H.-Y.; Rubinstein, M.; Shih, E.; Guttag, J.; Durand, F.; Freeman, W. Eulerian video magnification for revealing subtle changes in the world. ACM Trans. Graph. 2012, 31, 1–8. [Google Scholar] [CrossRef]
Feng, L.; Po, L.; Xu, X.; Li, Y.; Ma, R. Motion-Resistant Remote Imaging Photoplethysmography Based on the Optical Properties of Skin. IEEE Trans. Circuits Syst. Video Technol. 2015, 25, 879–891. [Google Scholar] [CrossRef]
Hassan, M.; Malik, A.; Fofi, D.; Saad, N.; Karasfi, B.; Ali, Y.; Meriaudeau, F. Heart rate estimation using facial video: A review. Biomed. Signal Processing Control 2017, 38, 346–360. [Google Scholar] [CrossRef]
Hsu, G.-S.J.; Xie, R.-C.; Ambikapathi, A.; Chou, K.-J. A deep learning framework for heart rate estimation from facial videos. Neurocomputing 2020, 417, 155–166. [Google Scholar] [CrossRef]
Song, R.; Zhang, S.; Li, C.; Zhang, Y.; Cheng, J.; Chen, X. Heart Rate Estimation From Facial Videos Using a Spatiotemporal Representation With Convolutional Neural Networks. IEEE Trans. Instrum. Meas. 2020, 69, 7411–7421. [Google Scholar] [CrossRef]
Pagano, T.P.; Ortega, L.L.; Santos, V.R.; Bonfim, Y.d.; Paranhos, J.V.D.; Sá, P.H.M.; Nascimento, L.F.S.; Winkler, I.; Nascimento, E.G.S. Machine Learning Models and Videos of Facial Regions for Estimating Heart Rate: A Review on Patents, Datasets, and Literature. Electronics 2022, 11, 1473. [Google Scholar] [CrossRef]
Li, X.; Chen, J.; Zhao, G.; Pietikainen, M. Remote Heart Rate Measurement from Face Videos under Realistic Situations. In Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 23–28 June 2014; pp. 4264–4271. [Google Scholar] [CrossRef]
Lam, A.; Kuno, Y. Robust Heart Rate Measurement from Video Using Select Random Patches. In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 7–13 December 2015; pp. 3640–3648. [Google Scholar] [CrossRef]
Same, M.H.; Gandubert, G.; Gleeton, G.; Ivanov, P.; Landry, R. Simplified welch algorithm for spectrum monitoring. Appl. Sci. 2021, 11, 86. [Google Scholar] [CrossRef]
Krafty, R.T.; Zhao, M.; Buysse, D.J.; Thayer, J.F.; Hall, M. Nonparametric spectral analysis of heart rate variability through penalized sum of squares. Stat. Med. 2014, 33, 1383–1394. [Google Scholar] [CrossRef] [Green Version]
Fukunishi, M.; Mcduff, D.; Tsumura, N. Improvements in remote video based estimation of heart rate variability using the Welch FFT method. Artif. Life Robot. 2018, 23, 15–22. [Google Scholar] [CrossRef]
Boardman, A.; Schlindwein, F.S.; Rocha, A.P. A study on the optimum order of autoregressive models for heart rate variability. Physiol. Meas. 2002, 23, 325. [Google Scholar] [CrossRef] [PubMed]
Merri, M.; Farden, D.C.; Mottley, J.G.; Titlebaum, E.L. Sampling frequency of the electrocardiogram for spectral analysis of the heart rate variability. IEEE Trans. Biomed. Eng. 1990, 37, 99–106. [Google Scholar] [CrossRef] [PubMed]
Brennan, M.; Palaniswami, M.; Kamen, P. Poincare plot interpretation using a physiological model of HRV based on a network of oscillators. Am. J. Physiol.-Heart Circ. Physiol. 2002, 283, H1873–H1886. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Tayel, M.B.; AlSaba, E.I. Poincaré plot for heart rate variability. Int. J. Biomed. Biol. Eng. 2015, 9, 708–711. [Google Scholar]
Sacha, J.; Pluta, W. Alterations of an average heart rate change heart rate variability due to mathematical reasons. Int. J. Cardiol. 2008, 128, 444–447. [Google Scholar] [CrossRef]
Kazmi, S.A.; Shah, M.H.; Khan, S.; Khalifa, O.O.; Muzammil, M. Poincare based PPG signal analysis for varying physiological states. In Proceedings of the 2016 International Conference on Intelligent Systems Engineering (ICISE), Islamabad, Pakistan, 15–17 January 2016; pp. 105–110. [Google Scholar] [CrossRef]
Dendi, S.V.R.; Channappayya, S.S. No-Reference Video Quality Assessment Using Natural Spatiotemporal Scene Statistics. IEEE Trans. Image Processing 2020, 29, 5612–5624. [Google Scholar] [CrossRef]
Zhou, W.; Chen, Z. Deep Local and Global Spatiotemporal Feature Aggregation for Blind Video Quality Assessment. In Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing (VCIP), Macau, China, 1–4 December 2020; pp. 338–341. [Google Scholar] [CrossRef]

Figure 1. Flowchart depicting pulse oximeter and video processing to generate the datasets that will be used to compute performance metrics.

Figure 2. (a) Frame from original video taken from camera; (b) cropped frame according to ROI; (c) image cropped left side of face; (d) blurred frame caused by sudden movements.

Figure 3. (a) Raw PPG signal; (b) red channel of the color-augmented video signal; (c) half-minute interval of the raw PPG signal; (d) half-minute interval of the color-augmented video signal.

Figure 4. (a) Raw PPG signal with peak detection; (b) color-augmented video with peak detection; (c) half-minute interval of the raw PPG signal with peak detection; (d) half-minute interval of the color-augmented video with peak detection.

Table 1. Metric computation for heart rate.

Me (bpm)	SD (bpm)	RMSE (bpm)
0.8084	7.7758	1.0971

Table 2. Metric computation for heart rate minute by minute.

Minute Interval	Me in bpm (SD)
0:00–0:59	2.5308 (3.5017)
1:00–1:59	2.0444 (2.2049)
2:00–2:59	1.7841 (1.7617)
3:00–3:59	1.9370 (2.1008)
4:00–4:59	1.6222 (1.4815)
5:00–5:59	1.6500 (1.4966)
6:00–6:59	1.6667 (1.7965)
7:00–7:59	1.5333 (1.8165)
8:00–8:59	1.8667 (1.5609)
9:00–9:59	1.9333 (2.0158)

Table 3. Top five features according to the Pearson correlation test.

Feature	Method	r	q-Value
Heart Rate	Time-domain	0.991	2.71 × 10⁻³⁴
Mean NN Interval	Time-domain	0.990	1.85 × 10⁻³³
NN Interval Count	Time-domain	0.955	4.44 × 10⁻²¹
Logarithmic VL Frequency Power	Frequency-domain (autoregressive)	0.653	3.57 × 10⁻⁵
Absolute VL Frequency Power	Frequency-domain (autoregressive)	0.652	3.57 × 10⁻⁵

Table 4. Top five features according to the Kendall correlation test.

Feature	Method	r	q-Value
Heart Rate	Time-domain	0.934	5.09 × 10⁻¹⁶
Mean NN Interval	Time-domain	0.919	8.18 × 10⁻¹⁶
NN Interval Count	Time-domain	0.879	1.44 × 10⁻¹⁴
Logarithmic VL Frequency Power	Frequency-domain (autoregressive)	0.507	3.93 × 10⁻⁵
Absolute VL Frequency Power	Frequency-domain (autoregressive)	0.507	3.93 × 10⁻⁵

Table 5. Top five features according to the Spearman correlation test.

Feature	Method	r	q-Value
Heart Rate	Time-domain	0.990	8.27 × 10⁻³⁴
Mean NN Interval	Time-domain	0.987	2.55 × 10⁻³¹
NN Interval Count	Time-domain	0.962	1.43 × 10⁻²²
Logarithmic VL Frequency Power	Frequency-domain (autoregressive)	0.624	1.21 × 10⁻⁴
Absolute VL Frequency Power	Frequency-domain (autoregressive)	0.624	1.21 × 10⁻⁴

Table 6. Significant features regarding time-domain methods.

# of Feature	Feature
1	Heart Rate
2	Root Mean Square of Successive NN Interval Differences
3	SD of NN intervals
4	Percentage of Successive NN Intervals that differ by more than 20 ms
5	Successive NN Intervals that differ by more than 50 ms
6	NN interval count
7	Minimum NN interval
8	Mean NN interval
9	Mean Difference of Successive NN intervals

Table 7. Significant features regarding frequency-domain using the Welch method.

# of Feature	Feature
1	Peak VL Frequency Power
2	Absolute VL Frequency Power
3	Relative VL Frequency Power
4	Logarithmic VL Frequency Power
5	Absolute L Frequency Power
6	Logarithmic L Frequency Power
7	Logarithmic H Frequency Power

Table 8. Significant features in frequency-domain using the autoregressive method.

# of Feature	Feature
1	Absolute VL Frequency Power
2	Relative VL Frequency Power
3	Logarithmic VL Frequency Power
4	Logarithmic L Frequency Power
5	Absolute L Frequency Power
6	Absolute H Frequency Power
7	Relative H Frequency Power
8	Logarithmic H Frequency Power

Table 9. Significant features in frequency-domain using the Lomg–Scargle method.

# of Feature	Feature
1	Peak VL Frequency Power
2	Absolute VL Frequency Power
3	Relative VL Frequency Power
4	Logarithmic L Frequency Power
5	Absolute L Frequency Power
6	Absolute H Frequency Power
7	Relative H Frequency Power
8	Logarithmic H Frequency Power
9	Peak H Frequency Power

Table 10. Significant features from non-linear methods.

# of Feature	Feature
1	SD perpendicular to the line of identity (SD1)
2	SD along the line of identity (SD2)
3	SD1 to SD2 ratio

Table 11. Results of comparison with previous methods.

Citation	Me in bpm (SD)	RMSE (bpm)
Li et al., 2014 [34]	7.14 (9.53)	12.47
Lam et al., 2015 [35]	6.49 (8.54)	10.34
Feng et al., 2015 [29]	6.64 (8.01)	10.12
Haque et al., 2016 [18]	4.69 (3.43)	5.96
Song et al., 2020 [32]	5.98 (7.31)	7.45
Hsu et al., 2020 [31]	−2.07 (4.23)	3.08
Proposed	0.81 (7.77)	1.10

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Martinez-Delgado, G.H.; Correa-Balan, A.J.; May-Chan, J.A.; Parra-Elizondo, C.E.; Guzman-Rangel, L.A.; Martinez-Torteya, A. Measuring Heart Rate Variability Using Facial Video. Sensors 2022, 22, 4690. https://doi.org/10.3390/s22134690

AMA Style

Martinez-Delgado GH, Correa-Balan AJ, May-Chan JA, Parra-Elizondo CE, Guzman-Rangel LA, Martinez-Torteya A. Measuring Heart Rate Variability Using Facial Video. Sensors. 2022; 22(13):4690. https://doi.org/10.3390/s22134690

Chicago/Turabian Style

Martinez-Delgado, Gerardo H., Alfredo J. Correa-Balan, José A. May-Chan, Carlos E. Parra-Elizondo, Luis A. Guzman-Rangel, and Antonio Martinez-Torteya. 2022. "Measuring Heart Rate Variability Using Facial Video" Sensors 22, no. 13: 4690. https://doi.org/10.3390/s22134690

APA Style

Martinez-Delgado, G. H., Correa-Balan, A. J., May-Chan, J. A., Parra-Elizondo, C. E., Guzman-Rangel, L. A., & Martinez-Torteya, A. (2022). Measuring Heart Rate Variability Using Facial Video. Sensors, 22(13), 4690. https://doi.org/10.3390/s22134690

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Measuring Heart Rate Variability Using Facial Video

Abstract

1. Introduction

2. Materials and Methods

3. Results

3.1. Face Detection

3.2. Color Augmentation

3.3. Peak Detection

3.4. Correlation Analysis

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI