Next Article in Journal
Multi-Fundus Diseases Classification Using Retinal Optical Coherence Tomography Images with Swin Transformer V2
Previous Article in Journal
Automatic Jordanian License Plate Detection and Recognition System Using Deep Learning Techniques
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Noise-Robust Pulse Wave Estimation from Near-Infrared Face Video Images Using the Wiener Estimation Method †

1
Graduate School of Science and Engineering, Chiba University, Chiba 263-8522, Japan
2
Kampo Clinical Center, Hiroshima University Hospital, Hiroshima 734-8511, Japan
*
Author to whom correspondence should be addressed.
It is noted that a part of research was presented Color and Imaging Conference 2022: Hino, Y.; Ashida, K.; Tsumura, N. A noise-robust pulse wave estimation from NIR video using Wiener estimation method. In Proceedings of the Color and Imaging Conference, Scottsdale, AR, USA, 13–17 November 2022.
J. Imaging 2023, 9(10), 202; https://doi.org/10.3390/jimaging9100202
Submission received: 24 August 2023 / Revised: 26 September 2023 / Accepted: 27 September 2023 / Published: 28 September 2023
(This article belongs to the Topic Computer Vision and Image Processing)

Abstract

:
In this paper, we propose a noise-robust pulse wave estimation method from near-infrared face video images. Pulse wave estimation in a near-infrared environment is expected to be applied to non-contact monitoring in dark areas. The conventional method cannot consider noise when performing estimation. As a result, the accuracy of pulse wave estimation in noisy environments is not very high. This may adversely affect the accuracy of heart rate data and other data obtained from pulse wave signals. Therefore, the objective of this study is to perform pulse wave estimation robust to noise. The Wiener estimation method, which is a simple linear computation that can consider noise, was used in this study. Experimental results showed that the combination of the proposed method and signal processing (detrending and bandpass filtering) increased the SNR (signal to noise ratio) by more than 2.5 dB compared to the conventional method and signal processing. The correlation coefficient between the pulse wave signal measured using a pulse wave meter and the estimated pulse wave signal was 0.30 larger on average for the proposed method. Furthermore, the AER (absolute error rate) between the heart rate measured with the pulse wave meter was 0.82% on average for the proposed method, which was lower than the value of the conventional method (12.53% on average). These results show that the proposed method is more robust to noise than the conventional method for pulse wave estimation.

1. Introduction

It is noted that a part of research was presented Color and Imaging Conference 2022 [1]. Recently, there has been growing demand for monitoring technology in dark areas. Examples of applications of monitoring technology in dark areas include vital signs monitoring during sleep [2]. One method to achieve this is to attach contact-type measurement devices. However, the prolonged use of contact-type devices may cause discomfort to users. In addition, some people have difficulty in attaching contact-type devices; for example, newborns and people with burns [3,4]. Because of these disadvantages of contact-type devices, non-contact measurement methods are being proposed.
One non-contact measurement technique is the estimation of a pulse wave from face video images. By capturing subtle changes in the light reflected from the skin with a camera, pulse waves can be estimated [5,6]. From the estimated pulse wave, it is possible to obtain biometric information such as heart rate. Kurita et al. proposed a method for pulse wave estimation by separating RGB video images into melanin, hemoglobin, and shade components for analysis [7]. However, RGB cameras have problems in capturing video images and pulse wave estimations in low-light conditions. Near-infrared cameras have the advantage of being able to capture video images in low-light conditions. Garbey et al. proposed a method to measure pulse waves using a single-band medium wavelength infrared camera [8]. Zeng et al. proposed a method to estimate the pulse rate using a single-band near-infrared camera [9]. However, a single-band camera has the problem that the pulse wave estimation is affected by illumination in an environment with illumination fluctuation. Mitsuhashi et al. proposed a pulse wave estimation method that can be used in dark areas by using two-band near-infrared video images [10]. This method used two-band near-infrared video images to separate hemoglobin and shade components. By separating the components, the effect of illumination was removed and the pulse wave signal was estimated. However, this method does not consider noise for pulse wave estimation; thus, there is the issue with acquiring a distorted waveform when estimating the pulse wave signal. A distorted waveform may adversely affect the accuracy of physiological information obtained from pulse wave signals. In addition, hemoglobin absorption is weak in the near-infrared environment and estimated pulse wave signals are highly affected by noise [11,12].
There are some methods for denoising during pulse wave estimation [13,14]. However, these methods are deep-learning-based methods which can be complex in processing and difficult to implement. In addition, there are only a few types of large-scale datasets that include human biological information.
In this paper, therefore, we propose a method for estimating pulse waves in dark areas while considering the effect of noise. Our proposed method is a modified version of Mitsuhashi et al.’s method [10]. The Wiener estimation method was used for pulse wave estimation to consider noise. This estimation method can consider noise added to the video images by a linear operation. Therefore, it does not require complex processing techniques such as deep learning. Noise in images can be caused by thermal noise, illumination fluctuation or body motion. In this paper, we focus only on noise caused by thermal noise.

2. Skin Model in the Near-Infrared Environment

Light is classified into three categories: ultraviolet light, visible light and infrared light. Infrared light has a long wavelength range and is classified into near-infrared rays, mid-infrared rays and far-infrared rays according to its wavelength range. In this study, pulse wave estimation was performed using near-infrared video images in the wavelength range of approximately 780 nm to 1000 nm.
Human skin is a multilayered structure that can be divided into three main parts: the epidermis, dermis and subcutaneous tissue. The skin contains various pigments such as melanin, hemoglobin and bilirubin. Among these pigments, changes in melanin and hemoglobin have a significant effect on skin coloration. Melanin is found in the epidermis, whereas hemoglobin is found in the dermis, where capillaries are located. Kurita et al. [7] used a two-layer skin model composed of the epidermis and dermis. They simplified the skin model by assuming that the epidermis is a layer containing only melanin and the dermis is a layer containing only hemoglobin. This allows us to assume that the layer containing only melanin and the layer containing only hemoglobin are spatially independent. Therefore, under visible light illumination, human skin can be treated as a two-layer skin model consisting of a melanin layer and a hemoglobin layer, as shown in Figure 1a. In Kurita et al. [7], melanin and hemoglobin components were estimated using independent component analysis after removing shading components from the observed signal in the density space. In order to separate the melanin, hemoglobin and shade components in a visible light environment, a three-band image such as RGB image is required.
Kurita et al. [7] used an RGB camera; images obtained from an RGB camera contain only information obtained from visible light. On the other hand, near-infrared light has a longer wavelength than visible light. Therefore, it has a deeper penetration depth into the inside of a living body and is considered to be able to measure blood vessels deep inside the body, called microvessels, which exist even deeper than capillaries [15,16,17]. Therefore, it can be assumed that when light in the near-infrared environment enters the skin, reflection occurs only in the dermis, as shown in Figure 1b [9]. In this environment, no diffuse reflection occurs in the dermis. In order to separate the hemoglobin and shade components in the near-infrared environment, a two-band image is required. Therefore, the influence of the melanin component in the epidermis can be ignored in the near-infrared environment, whereas the melanin component is important in the visible light environment.

3. Hemoglobin and Shade Component Separation

3.1. Conventional Method

The pixel value v i when captured by the camera is expressed by the following equation:
v i = t i λ E λ S λ r x , y , λ d λ ,   i = 1 , , m ,
where t i ( λ ) is the spectral transmittance of the ith filter, E ( λ ) is the spectral radiance of the illumination, S ( λ ) is the spectral sensitivity of the sensor and r ( x , y , λ ) is the spectral reflectance of the object at coordinates ( x , y ) . Equation (1) can be represented graphically as in Figure 2.
Expressing Equation (1) in matrix form, it can be expressed as follows:
v = F r ,
where v is a vector of pixel values, F is a matrix summarizing spectral transmittance, spectral radiance and spectral sensitivity and r is a vector of object reflectance. This equation is also valid when replacing v with a vector of the negative logarithm of the pixel values and r with a vector of hemoglobin and shade components [9]. In the following sections, we will use the above replacement.
Equation (2) is the equation for the case where no noise is added. However, noise is added to the image during actual image capture due to thermal noise and other causes. Therefore, when noise is added, Equation (2) can be expressed as follows:
v = F r + n ,
where n denotes noise.
When estimating r from v , the following equation is used for estimation:
r = F 1 v .
It is noted that this estimation made by using simple inverse matrix (Equation (4)) is used as the conventional method in this paper. However, Equation (4) does not consider the noise added in Equation (3). Therefore, the estimated pigment components may differ significantly from the original ones.

3.2. Proposed Method

To estimate r from v after considering noise, the following matrix G can be used:
r ~ = G v ,
where r ~ is the pigment component estimated using the Wiener estimation method. It is desirable to minimize the error between the correct value r and the estimated value r ~ . For this purpose, the mean squared error between r and r ~ is first calculated. The mean square error can be expressed as follows:
M S E = ( r r ~ ) T ( r r ~ ) ,
where denotes the ensemble mean for the pigment component vector. The estimation matrix that minimizes the mean squared error shown in Equation (6) is then expressed in Equation (7).
G = R r v R v v 1 ,
where R r v denotes the cross-correlation matrix of r and v , and R v v denotes the autocorrelation matrix of v .
R r v = r v T ,
R v v = v v T ,
The autocorrelation matrix of r is expressed as in Equation (10).
R r r = r r T ,
Using this, Equation (7) can also be expressed as in Equation (11).
G = R r r F T F R r r F T 1 .
In this case, we assume that noise is added to the images, as shown in Equation (3). As a result, the estimation matrix is given by following equation:
G = R r r F T F R r r F T + R n n 1 ,
where R n n denotes the autocorrelation matrix of the noise.
R n n = n n T ,
As described above, the Wiener estimation method gives the estimation matrix that minimizes the mean-square error between the correct and estimated values using a simple linear operation when the signal and noise statics are known [18].

4. Experimental Setup and Methods

4.1. Experimental Setup

We performed the experiments on a lab test bench setup. The experimental setup is shown in Figure 3. Three male subjects in their 20 s participated in this experiment. In a darkroom, the subject was illuminated with infrared LED lights (SA6-IR, EnergyPower, Hong Kong, China). The wavelength of this light is 850 nm. Face video images were captured for 30 s using a multi-band near-infrared camera (Spectral Devices, MSC2-BIO-1-A) at 66.5 frames per second. Imaging was performed once for each subject. Each video image was saved as a still image, frame by frame to avoid the effects of compression. A total of 1995 frames of images were obtained for each subject. This camera had four wavelength bands centered at 735 nm, 800 nm, 865 nm and 930 nm. The resolution of each image was 512 × 512 pixels. The artificial skin patch was captured at the same time. The artificial skin patch was used to obtain the autocorrelation matrix R n n of the noise added to the captured video images. Simultaneously with the imaging, a pulse wave signal was measured by attaching a photoelectric pulse wave meter (Procomp Infiniti, Thought Technology, Montreal, Canada) to the tip of the index finger of the subject’s left hand. The subjects were instructed to place their chin on a chin rest, which minimized the subject’s head motion as much as possible. In addition, the subjects were instructed to move their head and finger as little as possible during imaging.

4.2. Calculation Autocorrelation Matrixes

As described in Section 3, the noise autocorrelation matrix R n n is required when using the Wiener estimation method. To obtain the noise autocorrelation matrix, it is necessary to obtain the noise added to the captured video images. In this study, the noise added to the video images was obtained by simultaneously capturing an artificial skin patch with the subject’s face. Ideally, the pixel values of artificial skin patch do not change because the surface condition of the artificial skin patch does not change over time. However, the pixel values of the artificial skin patch changed due to the influence of noise. This change in pixel value was used to determine the magnitude of the noise.
The ROI (region of interest) was set for the image of the artificial skin patch, as shown in Figure 4, and the temporal variation of the averaged pixel value was calculated within the ROI. The standard deviation was also calculated on the temporal variation of the averaged pixel value. The above process was performed on the two bands used for imaging and the larger standard deviation was used in subsequent procedures. Then, Gaussian noise was generated by setting the mean value to 0 and the standard deviation to the obtained value. Finally, the autocorrelation matrix of the noise R n n was obtained using this generated noise.
The autocorrelation matrix of the skin pigment R r r is also required when using the Wiener estimation method. To calculate R r r , a value of r is required. In this study, the value of r obtained using the conventional method (Equation (4)) was used. The pigment component r consists of a hemoglobin component and a shade component, and both the hemoglobin and shade component were calculated as the average of the values obtained from each of the three subjects. When calculating R r r , the hemoglobin component values varied within the range of values obtained from three subjects, while the shade component values were fixed. This reflects the fact that the hemoglobin component fluctuated with time because of the blood, while the shade component did not. With the above settings of r , R r r was calculated.

4.3. Acquisition of the Original Pulse Wave Signal and Signal Processing

The temporal variation in the averaged pixel values in the ROI was analyzed by selecting two bands from the multi-band near-infrared face video images taken under the imaging environment described in Section 4.1 and setting the ROI as shown in Figure 5. The nose and cheeks were selected as the ROI [19]. In a vertical direction, the ROI was set in the range shown in Figure 5 to avoid the eye blink and lip motion. The ROI size was set as large as possible within the above range because a larger ROI size reduces noise [20]. In the current study, two-band images were used: one with a central wavelength of 800 nm and the other with a central wavelength of 930 nm. These two wavelengths were selected based on a previous study [10]. By using the temporal variation in the acquired average pixel values in Equations (4) and (5), respectively, the estimation results of the original pulse wave signal by the conventional and proposed methods could be obtained.
Detrending [21] was performed on the original pulse wave signal. A bandpass filter was then applied. The frequency range transmitted by the bandpass filter was set to [0.75, 4.0] Hz [22,23]. The upper peak points were detected by finding the local maximum values for each waveform in the bandpass-filtered pulse wave signal. The peak points are used to estimate the heart rate. The heart rate can be calculated by using the interval between adjacent peak points, called the RR interval. The heart rate can be calculated by dividing 60 by R R i n t e r v a l ¯ (the average of RR interval in the signal), as shown in Equation (14).
H R = 60 R R i n t e r v a l ¯

4.4. Evaluation Metrics

In this paper, the correlation coefficient, SNR (signal to noise ratio) and AER (absolute error rate) were used as evaluation metrics for pulse wave estimation. To calculate SNR, the pulse wave signals were Fourier-transformed to obtain the “Signal” component (0.5–15 Hz) and the “Noise” component (frequencies after 15 Hz) [24]. The “Signal” and “Noise” components were used to calculate the SNR from Equation (15).
S N R = 20 log 10 S i g n a l N o i s e   d B .
The AER between the estimated heart rate and the heart rate obtained using a pulse wave meter was determined [10]. AER is expressed by the following equation:
A E R = | H R G T H R E V | H R G T × 100 .
where H R G T is the ground truth of the heart rate obtained using a pulse meter and H R E V is the estimated heart rate obtained from the pulse wave estimated by the conventional or proposed method.

5. Results

5.1. Original Pulse Wave Signals

Figure 6 shows the original pulse wave signals obtained by applying the conventional and proposed methods to two-band near-infrared images and pulse wave signals obtained using a pulse wave meter. The correlation coefficient was calculated between the estimated pulse wave signals and the pulse wave signal obtained using a pulse wave meter. SNR was calculated from the pulse wave signals. Based on the results of the correlation coefficient and SNR (Table 1), the proposed method provides a stronger correlation than the conventional method. The results on SNR show that the proposed method can estimate pulse wave signals robustly regarding noise.

5.2. After Signal Processing

Detrending was performed on the original pulse wave signal and the results are shown in Figure 7. A bandpass filter was applied. Figure 8 shows the results of applying the bandpass filter.
Correlation coefficients were calculated between each pulse wave signal and the pulse wave signal obtained using a pulse wave meter. SNR was calculated from each estimated pulse wave signal. Furthermore, the heart rate was estimated from each bandpass-filtered pulse wave signal, and AER was calculated between the estimated heart rate and the heart rate obtained from the pulse wave meter. The heart rate was estimated only from the bandpass-filtered pulse wave signals because the waveform after bandpass filtering had clearer peak points. Table 2 shows the results for the correlation coefficient, SNR and AER after each signal processing; the combined use of detrend and bandpass filtering makes the pulse wave signal robust to noise and improves the accuracy of heart rate estimation using the pulse wave signal. The pulse wave signal estimated using the proposed method showed a stronger correlation coefficient with the pulse wave signal obtained from a pulse wave meter. Furthermore, for the SNR and AER the results showed that the proposed method can achieve noise-robust pulse wave estimation and highly accurate heart rate estimation.

6. Discussion

The advantage of the proposed method is that it shows robustness to noise at the point of the original pulse wave signal, as shown in Figure 6. This makes it possible to estimate the pulse wave using the proposed method even when the pulse wave is buried in noise using only a bandpass filter, as shown in Figure 8a. The accuracy of heart rate estimation differed among subjects (Table 1 and Table 2). This may be due to slight differences in skin thickness [25] and the position of blood vessels among the subjects, which affects the accuracy of pulse wave estimation.
In this paper, the sample size was small (three subjects). In order to provide sufficient discussion with a small number of data, we describe below a discussion of the relationship between the calculation method of the autocorrelation matrix Rrr used in the Wiener estimation method and the results. As described in Section 4.2, when calculating the autocorrelation matrix R r r for the pigment components, the hemoglobin and shade component values were the average values for all subjects obtained using the conventional method (Equation (4)). Here, we show the results obtained when calculating R r r by utilizing the values of hemoglobin and shade components for each individual subject. Figure 9 shows the original pulse wave signal and the pulse wave signal after signal processing (detrend and bandpass filter) under the above condition settings. Table 3 shows the correlation coefficient with the pulse wave signal obtained from the pulse wave meter, the SNR of each estimated pulse wave signal and the AER between the estimated heart rate and the heart rate obtained from the pulse wave meter. Comparing Figure 9 with Figure 6, Figure 7 and Figure 8, although a change in the values of the vertical axis occurred in the original pulse wave signals, there was almost no effect on the approximate shape of the original pulse wave signals. Comparing Table 3 with Table 1 and Table 2, the SNR was lowest in the case of subject 1, regardless of the method used to set the value of R r r . The correlation coefficient, SNR and AER did not change significantly. These results indicate that the method of setting the R r r value does not significantly influence the pulse wave estimation results. This may be due to the similarity between the values of hemoglobin and shade components obtained from each subject and the average of the component values for all subjects. However, since the subjects in this experiment had a narrow range of age, race, gender and sex, the method used to set the R r r value is not considered to have a significant effect on the results. Further experiments will be performed with subjects of various ages, races and genders to further examine the individual differences in the accuracy of pulse wave estimation and the effect of the R r r value setting method on accuracy in the future.

7. Conclusion and Future Works

In this study, we proposed a noise-robust pulse wave estimation method for near-infrared face video images using the Wiener estimation method. We compared the proposed method with a conventional method in a near infrared environment. For pulse wave estimation in the near-infrared environment, pulse waves can be obtained by separating hemoglobin and shade components in two-band face video images. While the conventional method uses the inverse matrix of the pigment components (hemoglobin and shade component) vectors, the Wiener estimation method approximates the inverse matrix used in the conventional method by using the autocorrelation matrix of the pigment components and the autocorrelation matrix of the noise added to video images. Therefore, the estimation matrix obtained by the Wiener estimation method can consider noise, whereas the conventional method does not have any information about noise in the matrix used for component separation. The Wiener estimation method is a linear operation and can consider noise without the need for complex processing methods such as deep learning.
The pulse wave signal was estimated using the proposed method and was compared with the one estimated using the conventional method. As evaluation metrics, we used the correlation coefficient between the pulse wave signal obtained from a pulse wave meter and the estimated pulse wave signal, the SNR of each estimated pulse wave signal and the AER between the estimated heart rate (only the estimated pulse wave signals that had been detrended and bandpass filtered) and the heart rate obtained from the pulse wave meter. In the comparison, experimental results showed that the combination of the proposed method and signal processing (detrending and bandpass filtering) increased the SNR by more than 2.5 dB compared to the conventional method and signal processing. The correlation coefficient between the pulse wave signal measured using the pulse wave meter and the estimated pulse wave signal was 0.48 on average for the proposed method and 0.18 on average for the conventional method, indicating a stronger correlation with the proposed method. Furthermore, the AER with the heart rate measured using the pulse wave meter averaged 0.82% for the proposed method and 12.53% for the conventional method, indicating that the pulse wave estimated using the proposed method can be used to estimate the heart rate with high accuracy.
Although the results in the near-infrared environment were better than those obtained with the conventional method, it is necessary to verify the use of this method in environments other than the near-infrared environment, such as in RGB photography. It is also necessary to verify the method when the noise is larger than the noise added in this experiment. In this paper, only the noise caused by thermal noise was considered. However, for practical use, it will be necessary to consider other types of noise in the future.

Author Contributions

Conceptualization, Y.H. and N.T.; methodology, Y.H.; software, Y.H; formal analysis, Y.H; investigation, Y.H.; data curation, Y.H.; writing—original draft preparation, Y.H.; supervision, K.A. and N.T.; project administration, K.O.-O. and N.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

This study was approved by the Ethics Review Committee for Epidemiological Research, Hiroshima University Hospital (E2021-2489). All participants agreed to the publication of the results obtained from the study. All results have been decoded. Individual results cannot be tracked.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. These data are not publicly available due to ethical restrictions.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Hino, Y.; Ashida, K.; Tsumura, N. A noise-robust pulse wave estimation from NIR video using Wiener estimation method. In Proceedings of the Color and Imaging Conference, Scottsdale, AR, USA, 13–17 November 2022. [Google Scholar] [CrossRef] [PubMed]
  2. Gastel, M.V.; Stuijk, S.; Overeem, S.; Dijk, J.P.V.; Gilst, M.M.V.; Haan, G.D. Camera-based vital signs monitoring during sleep-A proof of concept study. IEEE J. Biomed. Health Inform. 2020, 25, 1409–1418. [Google Scholar] [CrossRef] [PubMed]
  3. Aarts, L.A.M.; Jeanne, V.; Cleary, J.P.; Nelson, J.S.; Oetomo, S.B.; Verkruysse, W. Non-contact heart rate monitoring utilizing camera photoplethysmography in the neonatal intensive care unit—A pilot study. Early Hum. Dev. 2013, 89, 943–948. [Google Scholar] [CrossRef] [PubMed]
  4. Iozzia, L.; Cerina, L.; Mainardi, L.T. Assessment of beat-to-beat heart rate detection method using a camera as contactless sensor. In Proceedings of the 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Orlando, FL, USA, 16–20 August 2016. [Google Scholar]
  5. Sun, Y.; Thakor, N. Photoplethysmography Revisited: From Contact to Noncontact, From Point to Imaging. IEEE Trans. Biomed. Eng. 2016, 63, 463–477. [Google Scholar] [CrossRef] [PubMed]
  6. Verkruysse, W.; Svaasand, L.O.; Nelson, J.S. Remote plethysmographic imaging using ambient light. Opt Express. 2008, 16, 21434–21445. [Google Scholar] [CrossRef] [PubMed]
  7. Kurita, K.; Yonezawa, T.; Kuroshima, M.; Tsumura, N. Non-contact Video Based Estimation for Heart Rate Variability Spectrogram using Ambient Light by Extracting Hemoglobin Information. In Proceedings of the Color and Imaging Conference, Darmstadt, Germany, 19–23 October 2015. [Google Scholar]
  8. Garbey, M.; Sun, N.; Merla, A.; Pavlidis, I. Contact-Free Measurement of Cardiac Pulse Based on the Analysis of Thermal Imagery. IEEE Trans. Biomed. Eng. 2007, 54, 1418–1426. [Google Scholar] [CrossRef] [PubMed]
  9. Zeng, W.; Zhang, Q.; Zhou, Y.; Xu, G.; Liang, G. Infrared Video based Non-invasive Heart Rate Measurement. In Proceedings of the IEEE Conference on Robotics and Biomimetics, Zhuhai, China, 6–9 December 2015. [Google Scholar]
  10. Mitsuhashi, R.; Okada, G.; Kurita, K.; Kawahito, S.; Koopipat, C.; Tsumura, N. Noncontact pulse wave detection by two-band infrared video-based measurement on face without visible lighting. Artif. Life Robot. 2018, 23, 345–352. [Google Scholar] [CrossRef]
  11. Mancini, D.M.; Bolinger, L.; Li, H.; Kendrick, K.; Chance, B.; Wilson, J.R. Validation of near-infrared spectroscopy in humans. J. Appl. Physiol. 1994, 77, 2740–2747. [Google Scholar] [CrossRef] [PubMed]
  12. McDuff, D. Camera Measurement of Physiological Vital Signs. ACM Comput. Surv. 2023, 55, 1–40. [Google Scholar] [CrossRef]
  13. Lokendra, B.; Puneet, G. AND-rPPG: A novel denoising-rPPG network for improving remote heart rate estimation. Comput. Biol. Med. 2021, 141, 105146. [Google Scholar] [CrossRef] [PubMed]
  14. Lee, J.S.; Hwang, G.; Ryu, M.; Lee, S.J. LSTC-rPPG: Long Short-Term Convolutional Network for Remote Photoplethysmography. In Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, Vancouver, BC, Canada, 17–24 June 2023. [Google Scholar]
  15. Ash, C.; Dubec, M.; Donne, K.; Bashford, T. Effect of wavelength and beam width on penetration in light-tissue interaction using computational methods. Lasers Med. Sci. 2017, 32, 1909–1918. [Google Scholar] [CrossRef] [PubMed]
  16. Anderson, R.R.; Parrish, J.A. The optics of human skin. J. Invest. Dermatol. 1981, 77, 13–19. [Google Scholar] [CrossRef] [PubMed]
  17. Liu, J.; Yan, B.P.; Dai, W.X.; Ding, X.R.; Zhang, Y.T.; Zhao, N. Multi-wavelength photoplethysmography method for skin arterial pulse extraction. Biomed. Opt. Express. 2016, 7, 4313–4326. [Google Scholar] [CrossRef] [PubMed]
  18. Tsumura, N.; Haneishi, H.; Miyake, Y. Estimation of Spectral Reflectances from Multi-Band Images by Multiple Regression Analysis. Jpn. J. Opt. 1998, 27, 384–391. [Google Scholar]
  19. Kumar, M.; Veeraraghavan, A.; Sabharwal, A. DistancePPG: Robust non-contact vital signs monitoring using a camera. Biomed. Opt. Express 2015, 6, 1565–1588. [Google Scholar] [CrossRef] [PubMed]
  20. Kwon, S.; Kim, J.; Lee, D.; Park, K. ROI analysis for remote photoplethysmography on facial video. In Proceedings of the 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Milan, Italy, 25–29 August 2015. [Google Scholar]
  21. Tarvainen, M.P.; Ranta-Aho, P.O.; Karjalainen, P.A. An advanced detrending method with application to HRV analysis. IEEE Trans. Biomed. Eng. 2002, 49, 172–175. [Google Scholar] [CrossRef] [PubMed]
  22. Poh, M.Z.; McDuff, D.J.; Picard, R.W. Non-contact, automated cardiac pulse measurements using video imaging and blind source separation. Opt. Express 2010, 18, 10762–10774. [Google Scholar] [CrossRef] [PubMed]
  23. Wang, W.; Brinker, A.C.D.; Haan, G.D. Discriminative signatures for Remote-PPG. IEEE Trans. Biomed. Eng. 2020, 67, 1462–1473. [Google Scholar] [CrossRef] [PubMed]
  24. Yamakoshi, T.; Lee, J.; Matsumura, K.; Yamakoshi, Y.; Rolfe, P.; Kiyohara, D.; Yamakoshi, K. Integrating Sphere Finger-Photoplethysmography: Preliminary Investigation towards Practical Non-Invasive Measurement of Blood Constituents. PLoS ONE 2015, 10, e0143506. [Google Scholar] [CrossRef] [PubMed]
  25. Moco, A.V.; Stuijk, S.; Haan, G.D. New insights into the origin of remote PPG signals in visible light and infrared. Sci. Rep. 2018, 8, 8501. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Skin model at different light wavelengths. (a) Visible light; (b) infrared light.
Figure 1. Skin model at different light wavelengths. (a) Visible light; (b) infrared light.
Jimaging 09 00202 g001
Figure 2. Process flow of Equation (1).
Figure 2. Process flow of Equation (1).
Jimaging 09 00202 g002
Figure 3. Experimental setup for capturing face video images.
Figure 3. Experimental setup for capturing face video images.
Jimaging 09 00202 g003
Figure 4. Near-infrared video images of artificial skin patch. (a) Video images of artificial skin patch; (b) averaged pixel values of (a).
Figure 4. Near-infrared video images of artificial skin patch. (a) Video images of artificial skin patch; (b) averaged pixel values of (a).
Jimaging 09 00202 g004
Figure 5. Near-infrared face video images and ROI setting.
Figure 5. Near-infrared face video images and ROI setting.
Jimaging 09 00202 g005
Figure 6. Original pulse wave signals. (a) Pulse wave meter (subject 1); (b) conventional method (subject 1); (c) proposed method (subject 1); (d) pulse wave meter (subject 2); (e) conventional method (subject 2); (f) proposed method (subject 2); (g) pulse wave meter (subject 3); (h) conventional method (subject 3); and (i) proposed method (subject 3).
Figure 6. Original pulse wave signals. (a) Pulse wave meter (subject 1); (b) conventional method (subject 1); (c) proposed method (subject 1); (d) pulse wave meter (subject 2); (e) conventional method (subject 2); (f) proposed method (subject 2); (g) pulse wave meter (subject 3); (h) conventional method (subject 3); and (i) proposed method (subject 3).
Jimaging 09 00202 g006
Figure 7. Pulse wave signals after detrending. (a) Conventional method (subject 1); (b) proposed method (subject 1); (c) conventional method (subject 2); (d) proposed method (subject 2); (e) conventional method (subject 3); (f) proposed method (subject 3).
Figure 7. Pulse wave signals after detrending. (a) Conventional method (subject 1); (b) proposed method (subject 1); (c) conventional method (subject 2); (d) proposed method (subject 2); (e) conventional method (subject 3); (f) proposed method (subject 3).
Jimaging 09 00202 g007
Figure 8. Pulse wave signals after bandpass filtering. (a) Conventional method (subject 1); (b) proposed method (subject 1); (c) conventional method (subject 2); (d) proposed method (subject 2); (e) conventional method (subject 3); (f) proposed method (subject 3).
Figure 8. Pulse wave signals after bandpass filtering. (a) Conventional method (subject 1); (b) proposed method (subject 1); (c) conventional method (subject 2); (d) proposed method (subject 2); (e) conventional method (subject 3); (f) proposed method (subject 3).
Jimaging 09 00202 g008aJimaging 09 00202 g008b
Figure 9. Pulse wave signals when changing the configuration of the pigment component values used for R r r . (a) Original pulse wave signal (subject 1); (b) pulse wave signal after detrending (subject 1); (c) pulse wave signal after applying bandpass filtering (subject 1); (d) original pulse wave signal (subject 2); (e) pulse wave signal after detrending (subject 2); (f) pulse wave signal after applying bandpass filtering (subject 2); (g) original pulse wave signal (subject 3); (h) pulse wave signal after detrending (subject 3); and (i) pulse wave signal after applying bandpass filtering (subject 3).
Figure 9. Pulse wave signals when changing the configuration of the pigment component values used for R r r . (a) Original pulse wave signal (subject 1); (b) pulse wave signal after detrending (subject 1); (c) pulse wave signal after applying bandpass filtering (subject 1); (d) original pulse wave signal (subject 2); (e) pulse wave signal after detrending (subject 2); (f) pulse wave signal after applying bandpass filtering (subject 2); (g) original pulse wave signal (subject 3); (h) pulse wave signal after detrending (subject 3); and (i) pulse wave signal after applying bandpass filtering (subject 3).
Jimaging 09 00202 g009aJimaging 09 00202 g009b
Table 1. Comparison of conventional and proposed methods (original pulse wave signals).
Table 1. Comparison of conventional and proposed methods (original pulse wave signals).
SubjectsMethodsCorrelation CoefficientSNR [dB]
Subject 1Conventional−0.004−8.1
Proposed−0.027−4.8
Subject 2Conventional−0.020−8.4
Proposed−0.078−4.6
Subject 3Conventional−0.004−8.2
Proposed0.059−3.0
Table 2. Correlation coefficient, SNR and AER results after each signal processing.
Table 2. Correlation coefficient, SNR and AER results after each signal processing.
SubjectsMethodsSignal ProcessingCorrelation CoefficientSNR [dB]AER [%]
Subject 1ConventionalDetrend0.027−8.8-
Detrend and
bandpass filter
0.109−5.520.6
ProposedDetrend0.3420.3-
Detrend and
bandpass filter
0.5063.30.90
Subject 2ConventionalDetrend0.096−8.8-
Detrend and
bandpass filter
0.1872.17.68
ProposedDetrend0.353−4.8-
Detrend and
bandpass filter
0.4115.60.06
Subject 3ConventionalDetrend0.090−2.9-
Detrend and
bandpass filter
0.2462.19.31
ProposedDetrend0.332−5.1-
Detrend and
bandpass filter
0.5174.71.50
Table 3. Correlation coefficient, SNR and AER results after changing R r r configuration.
Table 3. Correlation coefficient, SNR and AER results after changing R r r configuration.
SubjectsMethodsSignal ProcessingCorrelation CoefficientSNR [dB]AER [%]
Subject 1ProposedOriginal pulse−0.027−4.8-
Detrend0.3420.3-
Detrend and
bandpass filter
0.5063.30.90
Subject 2ProposedOriginal pulse−0.078−4.6-
Detrend0.353−4.8-
Detrend and
bandpass filter
0.4115.60.06
Subject 3ProposedOriginal pulse0.059−3.0-
Detrend0.332−5.1-
Detrend and
bandpass filter
0.5174.71.50
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Hino, Y.; Ashida, K.; Ogawa-Ochiai, K.; Tsumura, N. Noise-Robust Pulse Wave Estimation from Near-Infrared Face Video Images Using the Wiener Estimation Method. J. Imaging 2023, 9, 202. https://doi.org/10.3390/jimaging9100202

AMA Style

Hino Y, Ashida K, Ogawa-Ochiai K, Tsumura N. Noise-Robust Pulse Wave Estimation from Near-Infrared Face Video Images Using the Wiener Estimation Method. Journal of Imaging. 2023; 9(10):202. https://doi.org/10.3390/jimaging9100202

Chicago/Turabian Style

Hino, Yuta, Koichi Ashida, Keiko Ogawa-Ochiai, and Norimichi Tsumura. 2023. "Noise-Robust Pulse Wave Estimation from Near-Infrared Face Video Images Using the Wiener Estimation Method" Journal of Imaging 9, no. 10: 202. https://doi.org/10.3390/jimaging9100202

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop