Next Article in Journal
Human Recognition Using Deep Neural Networks and Spatial Patterns of SSVEP Signals
Previous Article in Journal
BATMAN: A Brain-like Approach for Tracking Maritime Activity and Nuance
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

DDM-HSA: Dual Deterministic Model-Based Heart Sound Analysis for Daily Life Monitoring

1
Department of Computer and Information Engineering, Daegu University, Kyeongsan 38453, Republic of Korea
2
Department of Biomedical Engineering, School of Medicine, Keimyung University, Daegu 42601, Republic of Korea
3
Clairaudience Company Limited, Daegu 42403, Republic of Korea
4
Department of Biomedical Engineering, Graduate School of Medicine, Keimyung University, Daegu 42601, Republic of Korea
*
Author to whom correspondence should be addressed.
Sensors 2023, 23(5), 2423; https://doi.org/10.3390/s23052423
Submission received: 7 November 2022 / Revised: 13 February 2023 / Accepted: 15 February 2023 / Published: 22 February 2023
(This article belongs to the Section Physical Sensors)

Abstract

:
A sudden cardiac event in patients with heart disease can lead to a heart attack in extreme cases. Therefore, prompt interventions for the particular heart situation and periodic monitoring are critical. This study focuses on a heart sound analysis method that can be monitored daily using multimodal signals acquired with wearable devices. The dual deterministic model-based heart sound analysis is designed in a parallel structure that uses two bio-signals (PCG and PPG signals) related to the heartbeat, enabling more accurate heart sound identification. The experimental results show promising performance of the proposed Model III (DDM-HSA with window and envelope filter), which had the highest performance, and S1 and S2 showed average accuracy (unit: %) of 95.39 (±2.14) and 92.55 (±3.74), respectively. The findings of this study are anticipated to provide improved technology to detect heart sounds and analyze cardiac activities using only bio-signals that can be measured using wearable devices in a mobile environment.

1. Introduction

Sudden cardiac events in patients with heart diseases or the occurrence of a particular heart event in healthy individuals may lead to a heart attack in extreme cases [1,2,3]. Therefore, prompt interventions depending on the particular heart event and periodic monitoring are crucial. Furthermore, among various heart diseases or events, arrhythmia is a precursory symptom that can influence the signs of abnormal cardiac symptoms, such as aortic stenosis and aortic insufficiency. In addition, because arrhythmia is a factor that induces stroke, cerebral infarction, and acute myocardial infarction, which are the leading causes of sudden death, studies to identify risk factors through continuous observation and monitoring of heart conditions in daily life have increased.
Among the identification methods of cardiac risk factors through observation of heart condition, coronary angiography (CAG) is the most accurate and precise standard examination method. This method involves injecting a contrast agent by inserting a thin tube with a diameter of 2 to 3 mm into the arterial blood vessel. However, subjects undergoing CAG cannot on the day of the examination, and other inconveniences exist, such as behavioral restrictions for a certain period after the examination. In addition, CAG is an invasive examination method [4]; therefore, this method poses an economic burden, and performing fast and convenient tests using this method in daily life is impossible. Therefore, an algorithm for diagnosing abnormal heart conditions, such as arrhythmias, by measuring and analyzing changes in bio-signals as shown by heartbeats has been developed. Yildirim et al. [5] developed an arrhythmia detection algorithm by applying a convolutional neural network model and laying an electrocardiogram (ECG) signal representing a change in the potential of the myocardium. Bharti et al. [6] developed an efficient algorithm to predict heart diseases by applying various machine learning and deep-learning-based techniques using ECG signals. Sraitih et al. [7] developed an algorithm using various machine learning techniques to automatically diagnose arrhythmias from ECG data. As mentioned earlier, although many studies related to ECG-based monitoring of cardiac activity are underway, the acquisition of ECG signals in daily life is challenging, because general ECG data collection requires that the subjects not move in a lying position, after attaching electrodes and suction cups to their body. Accordingly, a data collection method and an algorithm that can analyze the cardiac activity state using a wearable device instead of the ECG method are required in a mobile environment.
In mobile environments, various wearable sensors based on two signals, PCG and PPG, to analyze cardiac activity have been developed. PCG refers to the recording of blood vibrations generated from the heart and blood vessel walls owing to myocardial contraction, valve closure, and blood flow changes, with a microphone or smart stethoscope. Conversely, PPG is a signal expressed by irradiating peripheral blood vessels with light, and measuring and recording the amount of light absorbed, exploiting the characteristic that a significant amount of blood is released into the arterial system during each systole of the heart. It can be measured on fingers, toes, and earlobes. Smart stethoscopes have been introduced as commercial wearable devices to measure and record PCG in mobile environments. One example is the eKuore smart stethoscope (eKuore, Valencia, Spain) [8], which was developed for convenient use at home. In addition, the StethoMe Stethoscope (StethoMe, Poznań, Poland) [9], was developed to conveniently measure and observe heart sounds in daily life. PPG-based cardiac activity monitoring can be performed with the Galaxy Watch Active 2 (Samsung Electronics, Seoul, South Korea) [10]. The PPG sensor is a widely used sensor because it can be built into various smartwatches. However, in cardiac activity analysis, methods to eliminate noise from PCG and PPG signals and robust algorithms that can recognize peaks are required, and the development of PCG and PPG processing is underway.
This study focuses on a heart sound analysis method that can be monitored daily using multimodal signals acquired with wearable devices, as in Figure 1. The proposed dual deterministic model-based heart sound analysis (DDM-HSA) is designed in a parallel structure that can simultaneously measure and process two bio-signals (PCG and PPG signals) related to the heartbeat, enabling more accurate identification of heart sounds. Our proposed DDM-HSA method has the following advantages:
  • A parallel structure algorithm based on multimodal methods improves the accuracy of heart sound detection.
  • The proposed DDM-HSA outperforms existing heart sound detection methods using a single signal.
  • By interpolating the S2 peak using the envelope filtering method, S2 detection accuracy can be improved.
The remainder of this paper is organized as follows. Section 2 introduces the database and presents the proposed DDM-HSA method using PCG and PPG signals. The detection performance of the proposed method is demonstrated and discussed in Section 3. Finally, Section 4 concludes the paper.

2. Materials and Methods

This section describes the database and methodology used in this study.

2.1. Database

2.1.1. Acquisition System and Protocol

A multi-channel physiological data acquisition system, MP160 (BIOPAC System, Inc., Goleta, CA, USA) with a contact acoustic transducer (TSD108A) and PPG sensor (TSD200C) was used to collect the PPG and PCG signals. The acoustic transducer TSD108A was attached to the surface of the chest to measure heart sounds, as shown in Figure 2a,b. In addition, the PPG sensor TSD200C was attached to the middle finger of the left hand to collect PPG signals, as shown in Figure 3. The sampling rate of the data was set at 2000 Hz.
To obtain quantitative data from the subjects, they were required to perform a static activity (rest), as shown in Figure 2. All tasks were performed for 5 min, and the subjects were asked to rest in a chair.

2.1.2. Subjects

The study was conducted with twenty healthy subjects (ten males and ten females, mean ± standard deviation age of 27.4 ± 3.2 years), as shown in Table 1. Subjects were asked to avoid caffeine-containing beverages and nicotine for 4 h before the experiments [11]. They were also asked to abstain from alcohol and vigorous exercise one day before each experiment [12].
All subjects reported no history of cardiac or psychiatric disorders. In this study, subjects were excluded from the experiment if they reported the following: (1) a history of previous injury or heart repair; (2) being outside the range of 20 to 35 years old; (3) a history of depression, insomnia, or chronic stress; (4) presence of any medical condition that can hinder the subject from performing the exercise; (5) and pregnancy.
Each subject was briefed on the purpose of the study. The subjects provided written informed consent before participating in the experimental procedures, and the researchers worked to ensure their safety. Ethical review and approval were waived for this study due to only observational equipment being used to observe the signal from the surface of the subject. This study was conducted without any invasive activation, drug administration, or blood collection. In addition, no vulnerable subjects participated in this study, and no personal identification information was collected.

2.2. Noise Reduction

A block diagram illustrating the filtering of PPG and PCG signals is shown in Figure 4a. The MP160 (BIOPAC System, Inc., Goleta, CA, USA) commercial device collected raw data by using a 2 kHz sampling rate and a 16 bit analog-to-digital converter. In particular, noise from PPG and PCG was eliminated through the built-in bandpass filter of the MP160 device. All data were acquired using an MP160 device, as shown in Figure 4b. In particular, the PCG signal filtered through the bandpass filter at 20–200 Hz differed from its raw PCG data, and the waveforms of the heart sound S1 and S2 were observed to be precise.

2.3. DDM-HSA

2.3.1. Overview

This study proposes a DDM-HSA through S1 and S2 automatic identification algorithms using vascular transit time (VTT). The proposed DDM-HSA algorithm was designed in a parallel structure to identify heart sounds by simultaneously processing heart sounds and pulse waves through VTT. The overall flowchart of the DDM-HSA proposed in this study is shown in Figure 5a. It is designed to identify S1 through preprocessed heart sounds and to recognize S2 waveforms around the systolic peak of pulse waves using VTT. An example of the graphs corresponding to the signal-processing steps in the flowchart is shown in Figure 5b. A detailed description of the proposed method is provided in this subsection.

2.3.2. PCG Analysis

PCG analysis is the first step in the DDM-HSA method, and it recognizes S1 and S2 for heart sound analysis and is divided into five main steps: transformation into an analytic signal, envelope filtering, Shannon entropy, normalization, and heart sound detection.
Analytic signal s a ( t ) is the nonnegative frequency component of the original signal of s ( t ) [13]. The analytic signal s a ( t ) can be defined as follows:
s a ( t ) = s ( t ) + j s ( t ) ^ ,
where j is the imaginary unit and s ( t ) ^ is the Hilbert transform of s ( t ) . The real-valued s a ( t ) with Fourier transform S a ( f ) can be defined as follows:
S a ( f ) = { 1 2 S ( f ) ,   i f ( f > 0 ) , S ( f ) ,   i f ( f = 0 ) , 0 ,   i f ( f < 0 )
Using the aforementioned functions, S a ( f ) can be defined as follows:
S a ( f ) = 1 2 S ( f ) + s g n ( f ) 1 2 S ( f ) ,
where s g n ( f ) is a sign function, f is the frequency (that is eliminating the negative frequency component by defining the frequency below 0 as zero), and s a ( t ) is an analytic signal. The analytic signal s a ( t ) is the inverse Fourier transform of S a ( f ) expressed as follows:
s a ( t ) = F ˜ { S ( f ) } + F ˜ { s g n ( f ) } F ˜ { S ( f ) } = s ( t ) + j [ 1 π t s ( t ) ] = s ( t ) + j s ( t ) ^ ,
Because multiplication in the time domain is equivalent to convolution in the frequency domain and vice versa, the inverse Fourier transform of s g n ( f ) and S ( f ) is a convolution. Currently, the obtained 1 / π t s ( t ) is equivalent to s ^ ( t ) obtained by performing the Hilbert transform of s ( t ) . The Hilbert transform generates a signal whose phase is delayed (shifted) by 90° (quadrature-phase) from the original signal (in-phase); however, energy remains unchanged, because a phase shift is not related to the energy of the signal; only the amplitude changes [14]. Therefore, it is also called a quadrature filter; therefore, the Hilbert transform is performed on s ( t ) , and an analytic signal s a ( t ) can be obtained. If ω > 0, then s ( t ) is cos ( ω t ) , and its Hilbert transform s ^ ( t ) is defined as cos ( ω t π / 2 ) , that is, sin ( ω t ) . Thus, the Hilbert transform extracts the analytic signal s a ( t ) = s ( t ) + j s ^ ( t ) . Equation (4) is converted to Euler’s formula, and we obtain the following:
s a ( t ) = cos ( ω t ) + j s i n ( ω t ) = A ( t ) · cos ( ω t ) + A ( t ) · j s i n ( ω t ) = e j ω t ,
where A ( t ) and c o s ( ω t ) are the instantaneous amplitude and phase, respectively, obtained by converting the original signal comprising real and imaginary signals. The envelope (energy) and phasor can be extracted.
Envelope filtering (EF) is the task of making the S1 and S2 peaks of the PCG signal more distinct. The heart sound is measured using a microphone or an acoustic device. The noise factor is high, making heart sound identification challenging. This is a critical factor that influences the recognition performance of heart sound identification. In addition, the amplitude of S2 is relatively lower than that of S1. There is a problem, because S2 is regarded as noise (Figure 6). Therefore, we additionally implemented EF to enhance the desired heart sound peaks (S1 and S2) from PCG signals. To calculate the EF of the PCG, we modified the original SEF method proposed in [15] and redefined the EF method for PCG processing.
First, the envelope m ( t ) and phase of the signals c o s ϕ ( t ) were extracted from the real signal s ( t ) . The envelope m ( t ) was filtered using a bandpass filter (cut-off frequency of 20–200 Hz) to filter the S1 and S2 peaks. The filtered envelope is denoted as m f ( t ) . Here, we determine the threshold to adjust the elevation of the S2 waveform as follows:
T h r e s h o l d   ( t ) = m f i l t ( t ) + δ · m f i l t ¯
where m f i l t ¯ and δ ¯ represent the mean of the filtered envelope m f ( t ) and high-handed heuristic value, respectively. Hence, if the filtered signal m f ( t ) is higher than the threshold, the filtered signal m f ( t ) is converted by the amplitude of S1, and the final envelope filtered signal z ( t ) is obtained by multiplying m f ( t ) by c o s ϕ ( t ) .
Shannon Entropy (SE) calculation is a task that enhances the peak of the EF signal. To detect heart sound waveforms, the location of S1, S2, and their peaks must be identified. This study used the Savitzky–Golay filter, because it has been proven to be more advantageous in preserving peaks compared with other envelope methods [16,17]. The Shannon entropy S E ( t ) can be obtained as follows:
S E ( t ) = ( S D ( t ) ^ ) l o g ( S D ( t ) ^ ) .
S E ( t ) was smoothed using a Savitzky–Golay filter to cover the signal S D ( t ) . The filter window size was 900, which is approximately in the interval between the S1 and S2 waveforms. In addition, the degree of the Savitzky–Golay filter was selected as 3, and these parameters were all determined based on empirical and heuristic validation using preliminary data.
Normalization refers to the process of making the value of the filtered envelope scale from 0 to 1 using a min–max method.
S1 and S2 start-point detection refers to the step of primarily detecting the peaks of S1 and S2 in the PCG signals. The normalized signal is converted into an impulse signal as follows:
I m p u l s e ( t ) { 1 ,             σ × N S E ( t ) ¯ < N S E ( t ) 0 ,                 O t h e r w i s e .
where σ and N S E indicate the constant value (added to the mean value of S E ( t ) ) and the normalized SE, respectively. An alpha value of 5 was adopted, using the value set in [18] to determine the threshold for heart sound identification. This was expressed as an impulse signal, and the heart sound time points were extracted. Here, the S2 peak was more accurately detected using the blood VTT based on PPG, which is presented in the next subsection.

2.3.3. PPG Analysis

The heart sound S2, measured during the heart’s diastole, was detected near the systolic peak of the pulse wave when observing the heart sound and the pulse wave. This is due to the fact that a considerable amount of time (called the VTT) is required for blood to flow from the heart to other body parts [19,20,21]. Generally, heart sound S1 is transmitted from the heart to the peripheral blood vessels of the finger after approximately 500 ms. On the other hand, heart sound S2 occurs at approximately 20 ms, at which the systolic peak of the pulse wave is detected [22]. Therefore, PPG was additionally used to precisely recognize S2 obtained primarily from PCG and to revalidate the peak of S2 measured within 20 ms from the systolic peak of PPG. PPG analysis is the second main step of the DDM-HSA method and serves to recognize S2 more precisely.

2.3.4. VTT Calculation and S2 Detection

In order to generate the final DDM-HSA model to improve the accuracy of S1 and S2, the formula (9) can be defined using the heart sound points, the systolic peak points, and the VTT.
H S s p = { S 1                               ( T p p g _ p e a k V T T   T p c g _ s p   T p p g _ p e a k β ) S 2                                                                                         ( o t h e r w i s e )                                                                  
where H S s p is the time points S1 and S2; T p p g _ p e a k is the time at which the systolic peak of the pulse and T p c g _ s p is the time at which the heart sound time points, respectively, are detected. VTT is the time at which blood pumped out through heart contraction is delivered to the peripheral blood vessels and is typically observed as roughly 200 ms. β was set at 100 ms, assuming that S2 occurs about 300 ms after VTT and S1 occurs [23].

2.4. Performance Measures

The proposed heartbeat sound detection method in this study was evaluated using three performance measures: accuracy (ACC), sensitivity (SEN), and specificity (SPE), which were computed from the following four parameters:
  • True positive (TP): actual heartbeat correctly detected as an actual heartbeat.
  • False negative (FN): not heartbeat detected as not heartbeat.
  • True negative (TN): not heartbeat correctly detected as an actual heartbeat.
  • False positive (FP): actual heartbeat detected as not heartbeat.
ACC is the ratio of the correctly predicted observation to the total observations using (10), and SEN is the ratio of the TP correctly detected to the number of true beats by using (11). SPE refers to the ratio of the TN detected to the total of TN and FP by using formula (12).
Accuracy   ( ACC ) = ( TP + TN ) / ( TP + TN + FP + FN ) × 100 .
Sensitivity   ( SEN ) = TP / ( TP + FN ) × 100 .
Specificity   ( SPE ) = TN / ( TN + FP ) × 100 .

3. Results

Three models were generated to evaluate the heart sound analysis algorithm, as listed in Table 2. Model I is the basic model of DDM-HSA, and Model II is a model in which the DDM-HSA method is applied after applying a window to separate the acquired signal for each segment.

3.1. Comparison Result of Models

In the case of S1 peak, because the amplitude of the peak was higher than that of S2, and the shape of the peak was clear, S1 outperformed S2 in terms of accuracy in all models. A comparison of the results for the three models is shown in Figure 7. Eventually, Model III had the highest performance, and S1 and S2 showed average accuracy (unit: %) of 95.39 (±2.14) and 92.55 (±3.74), respectively. The proposed basic model of DDM-HSA was applied in Model I, and the detection performances (unit: %) of S1 and S2 were 87.12 (±4.99) and 63.96 (±13.18), respectively. The data segmentation method was applied to the existing DDM-HSA method in Model II, and the detection accuracies (unit: %) of S1 and S2 were 94.56 (±1.79) and 80.6 (±8.07), respectively.
When comparing the average accuracies of the three models, the performance of Model III was the most robust. This is because Model III applied the EF method, and as the S2 peak was interpolated to the S1 peak, the S2 wave became more apparent. Consequently, the average accuracy difference compared with Model II without EF was approximately 11.95%.

3.2. Result of Model I (DDM-HSA)

The performances off all subjects of S1 peak detection in Model I are listed in Table 3. The sensitivity (SEN, unit: %), precision (PRE, unit: %), specificity (SPE, unit: %), and accuracy (ACC, unit: %) of S1 detection were 92.25 (±5.07), 84.05 (±6.43), 81.98 (±8.24), and 87.12 (±4.99), respectively.
As shown in Table 4, the S2 detection performance of Model I was significantly lower than that of S1. This is because the amplitude of the S2 peak needs clarification compared with that of S1; thus, the S1 peak interfered with the S2 peak. In summary, when Model I was applied, the detection performance of S2 was 56.45 (±19.68), 73.83 (±14.47), 72.49 (±12.52), and 63.96 (±13.18) in SEN, PRE, SPE, and ACC, respectively.

3.3. Effect of the Window (Model II)

The performance of S1 and S2 to which Model II is applied can be seen in Table 5 and Table 6 to analyze the results of the effect of the data segment. In data segmentation, a particular window is applied to the entire signal to subdivide the data. In the case of S1 (Table 5), the performance of Model II in terms of SEN, PRE, SPE, and ACC was 97.27 (±1.93), 92.37 (±3.03), 91.85 (±3.50), and 94.56 (±1.79), respectively. In the case of S2, SEN, PRE, SPE, and ACC of Model II were 78.39 (±9.14), 88.68 (±6.64), 83.22 (±8.45), and 80.60 (±8.07), respectively. The average accuracy of Model II was improved by approximately 16.64% compared with that of Model I.
The accuracy of Model I differs from that of Model II because when the entire signal is detected simultaneously, the amplitudes of the peaks differ; a peak with a low amplitude is regarded by other factors as noise. In summary, when a heart sound is detected after a data segment by applying a window to the overall acquired signal, the detection of peaks can be prevented from interfering with each other.

3.4. Effect of the Envelope Filtering (Model III)

The indicators of the overall performance of Model III are listed in Table 7 and Table 8. Model III is a DDM-HSA model in which envelope filtering is applied. In summary, the S1 detection performance was 96.88 (±2.79), 94.14 (±2.67), 93.90 (±2.88), and 95.39 (±2.14) in terms of SEN, PRE, SPE, and ACC, respectively. In Model III, detection accuracy was improved by about 8.27 and 0.83 compared to that of Model I and Model II, and there was no significant difference from Model II. Since the envelope filtering is a method of interpolating the relatively low amplitude of S2 according to the amplitude of S1, the performance of S1 did not have a significant effect on envelope filtering.
On the other hand, the performance of S2 in Table 8 was 93.08, 95.12, 90.59, and 92.48 in terms of SEN, PRE, SPE, and ACC, respectively, showing the best performance among the models. This is because Model III, to which the envelope filtering technique is applied, improves the detection performance of S2 by interpolating the amplitude of S2 as much as that of S1.

4. Discussion

This paper proposes a novel DDM-HSA method for heart sound analysis in PCG and PPG signals. Our database’s reliable and promising performance demonstrates that the proposed method achieves robust heart sound detection. However, some issues remain to be discussed. In addition, we will discuss the performance of the proposed method compared to other approaches.

4.1. Comparison with Other Approaches for Heart Sounds Analysis

Several studies have proposed a novel approach for heart sounds analysis and are summarized in Table 9. Although many studies have been conducted, it is difficult to directly compare the methods with our work because the database used to validate the algorithms is totally different. Therefore, in this subsection comparing other approaches, we focus on and discuss the key contributions of each study.
The study by Giordano et al. [18] presented a method with robust performance (S1: 99.6%, S2: 98.9%) by measuring the timing of heart sound components in ECG and PCG signals. The most significant difference between the study in [18] and our study is the use of ECG signals. In the study in [18], the ECG signal was used to enhance the performance of heart sound analysis. However, since the ultimate goal of our research is heart sound analysis for the mobile environment, we used PPG, which is data that can be obtained using a wearable device.
On the other hand, the study by Babu et al. [24] showed a robust performance of heart sound analysis (S1: 100%, S2: 100%) using PPG and PCG signals. They used a PCG but with a microphone condenser attached to the stethoscope’s head. In our study, the most significant difference is that the microphone was attached to the clothes, so that the user could measure the heart sounds without being constrained, as much as possible. The study by Babu et al. [24] showed successful performance because they pursued robust and reliable performance of heart sound analysis.
Finally, the study by Huang [25] focused on ‘Liveness Detection,’ collecting gyroscopes and acoustic signals. Although they also analyzed heart sounds, the results from S1 and S2 were not reported, and it is not easy to make a direct comparison with our study because they used different data from the signals used in our study. However, the study by Huang [25] successfully implemented liveness detection using deep-breath recognition and various feature extraction methods.

4.2. Limitations

We presented a novel approach to detect heart sounds S1 and S2 using our database collected from 20 subjects. Our results show promising performance using only PPG and PCG signals for mobile environments. However, there is a limitation in that our data were collected in a controlled laboratory environment, and it is still not easy to generalize our results, because our data need to be more comprehensive. Therefore, the proposed method should be improved based on a larger group of samples for practical use for patients or subjects needing heart sound analysis.
Although we conducted a test experiment of a heart sound analysis algorithm operated in a mobile environment in this paper, the ultimate primary purpose of the proposed algorithm is to apply it to the hardware developed in our preliminary study as Figure 8 [1]. The proposed algorithm was tested based on the data collected from a few subjects to be applied to the smart stethoscope. In addition, we will collect more data in a real-time environment with algorithms applied to smart stethoscopes in the future and generalize the results to overcome the limitations of this paper.

5. Conclusions

The study provided a novel approach to improve the technology for detecting heart sounds and analyzing cardiac activity using only bio-signals that can be measured through wearable devices in a mobile environment. The contributions of this study can be summarized as follows:
  • We contributed to the analysis of heart sounds in daily life by presenting a DDM-HSA that can utilize PPG and PCG, which that can be measured using wearable devices.
  • We proposed an envelope filtering method to improve the performance of S2 detection. By applying it to DDM-HSA (Model III), the performance of S2 improved by about 28.59% compared with that of the existing method (Model I).
For the scalability of the proposed approach, several well-defined studies must be considered in the future. As a future research direction, the proposed method should be considered for applications in wearable devices. In our previous studies [1,26], we developed a wearable device that can measure PCG and PPG. The proposed DDM-HSA method will be applied to the device, and clinical trials will be conducted for performance evaluation. This future study is expected to have a significant impact on the analysis of cardiovascular activities in mobile environments.

Author Contributions

Conceptualization, M.L. and Q.W.; methodology, Q.W., S.L. and M.L.; algorithm, S.L. and M.L.; validation, S.L. and M.L.; investigation, S.L. and H.P.; data curation, S.L. and M.L.; writing—original draft preparation, Q.W., M.L. and S.L.; writing—review and editing, H.P. and Q.W.; supervision, Q.W.; project administration, Q.W.; funding acquisition, Q.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Basic Research Grant Program of the Keimyung University in 2018 and by the “Leaders in Industry-university Cooperation 3.0” Project, supported by the Ministry of Education and National research Education of Korea.

Institutional Review Board Statement

Ethical review and approval were waived for this study due to only observational equipment being used to observe the signal from the surface of the subject. This study was conducted without any invasive activation, drug administration, and blood collection. In addition, no vulnerable subjects were engaged to participate in this study, and no personal identification information was collected.

Informed Consent Statement

Informed consent was obtained from all the subjects involved in the study. Participants at sites could withdraw from the study at any time.

Data Availability Statement

The dataset supporting the conclusions of this article is not available due to privacy and ethical reasons.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the study design, collection, analyses, or interpretation of data, writing of the manuscript, or decision to publish the results.

References

  1. Lee, S.; Wei, Q.; Park, H.; Na, Y.; Jeong, D.; Lim, H. Development of a finger-ring-shaped hybrid smart stethoscope for automatic s1 and s2 heart sound identification. Sensors 2021, 21, 6294. [Google Scholar] [CrossRef]
  2. Lee, M.; Song, T.G.; Lee, J.H. Heartbeat classification using local transform pattern feature and hybrid neural fuzzy-logic system based on self-organizing map. Biomed. Signal Process. Control 2020, 57, 101960. [Google Scholar] [CrossRef]
  3. Lee, M.; Lee, J.H. A robust fusion algorithm of LBP and IMF with recursive feature elimination-based ECG processing for QRS and arrhythmia detection. Appl. Intell. 2022, 52, 939–953. [Google Scholar] [CrossRef]
  4. Achenbach, S.; Daniel, W.G. Noninvasive coronary angiography—An acceptable alternative? N. Engl. J. Med. 2001, 345, 1909–1910. [Google Scholar] [CrossRef]
  5. Yıldırım, Ö.; Pławiak, P.; Tan, R.S.; Acharya, U.R. Arrhythmia detection using deep convolutional neural network with long duration ECG signals. Comput. Biol. Med. 2018, 102, 411–420. [Google Scholar] [CrossRef]
  6. Bharti, R.; Khamparia, A.; Shabaz, M.; Dhiman, G.; Pande, S.; Singh, P. Prediction of Heart Disease Using a Combination of Machine Learning and Deep Learning. Comput. Intell. Neurosci. 2021, 2021, 8387680. [Google Scholar] [CrossRef]
  7. Sraitih, M.; Jabrane, Y.; el Hassani, A.H. An automated system for ECG arrhythmia detection using machine learning techniques. J. Clin. Med. 2021, 10, 5450. [Google Scholar] [CrossRef]
  8. The Electronic Stethoscope for Telemedicine—Ekuore Smart Stethoscope. Available online: https://www.ekuore.com/electronicstethoscope-ld-coronavirus/ (accessed on 28 July 2022).
  9. A Smart Way of Asthma Monitoring—StethoMe Stethoscope. Available online: https://www.stethome.com/en-gb/ (accessed on 28 July 2022).
  10. A Smart Way of PPG Monitoring—Galaxy Watch Active. Available online: https://www.samsung.com/sec/watches/galaxy-watch-active2-r820/SM-R820NSSAKOO/ (accessed on 28 July 2022).
  11. Park, H.; Dong, S.-Y.; Lee, M.; Youn, I. The Role of Heart-Rate Variability Parameters in Activity Recognition and Energy-Expenditure Estimation Using Wearable Sensors. Sensors 2017, 17, 1698. [Google Scholar] [CrossRef] [Green Version]
  12. Nieman, D.C.; Austin, M.D.; Benezra, L.; Pearce, S.; McInnis, T.; Unick, J.; Gross, S.J. Validation of cosmed’s fitmate in measuring oxygen consumption and estimating resting metabolic rate. Res. Sports Med. 2006, 14, 89–96. [Google Scholar] [CrossRef]
  13. Melia, U.; Clariá, F.; Vallverdú, M.; Caminal, P. Filtering and thresholding the analytic signal envelope in order to improve peak and spike noise reduction in EEG signals. Med. Eng. Phys. 2014, 36, 547–553. [Google Scholar] [CrossRef] [Green Version]
  14. Abdulmunem, M.E.; Badr, A.A. Hilbert Transform and its Applications: A survey. Int. J. Sci. Eng. Res. 2017, 8, 699–704. [Google Scholar]
  15. Lee, M.; Park, D.; Dong, S.Y.; Youn, I. A novel R peak detection method for mobile environments. IEEE Access 2018, 6, 51227–51237. [Google Scholar] [CrossRef]
  16. Rastogi, N.; Mehra, R. Analysis of Savitzky-Golay filter for baseline wander cancellation in ECG using wavelets. Int. J. Eng. Sci. Emerg. Technol. 2013, 6, 15–23. [Google Scholar]
  17. Agrawal, S.; Gupta, A. Fractal and EMD based removal of baseline wander and powerline interference from ECG signals. Comput. Biol. Med. 2013, 43, 1889–1899. [Google Scholar] [CrossRef]
  18. Giordano, N.; Knaflitz, M. A novel method for measuring the timing of heart sound components through digital phonocardiography. Sensors 2019, 19, 1868. [Google Scholar] [CrossRef] [Green Version]
  19. Allen, J. Photoplethysmography and its Application in Clinical Physiological Measurement. Physiol. Meas. 2007, 28, R1. [Google Scholar] [CrossRef] [Green Version]
  20. Shelley, K.H. Photoplethysmography: Beyond the Calculation of Arterial Oxygen Saturation and Heart Rate. Anesth. Analg. 2007, 105, S31–S36. [Google Scholar] [CrossRef] [Green Version]
  21. Nam, D.H.; Lee, W.B.; Hong, Y.S.; Lee, S.S. Measurement of Spatial Pulse Wave Velocity by Using a Clip-Type Pulsimeter Equipped with a Hall Sensor and Photoplethysmography. J. Sens. Sci. Technol. 2013, 13, 4714–4723. [Google Scholar] [CrossRef]
  22. Elgendi, M.; Fletcher, R.; Liang, Y.; Howard, N.; Lovell, N.H.; Abbott, D.; Lim, K.; Ward, R. The use of photoplethysmography for assessing hypertension. NPJ Digit. Med. 2019, 2, 1–11. [Google Scholar] [CrossRef] [Green Version]
  23. Varghees, V.N.; Ramachandran, K. A novel heart sound activity detection framework for automated heart sound analysis. Biomed. Signal Process. Control 2014, 13, 174–188. [Google Scholar] [CrossRef]
  24. Babu, K.A.; Ramkumar, B.; Manikandan, M.S. Automatic identification of S1 and S2 heart sounds using simultaneous PCG and PPG recordings. IEEE Sens. J. 2018, 18, 9430–9440. [Google Scholar] [CrossRef]
  25. Huang, C.; Chen, H.; Yang, L.; Zhang, Q. BreathLive: Liveness detection for heart sound authentication with deep breathing. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 2018, 2, 12. [Google Scholar] [CrossRef]
  26. Park, H.; Wei, Q.; Lee, S.; Lee, M. Novel Design of a Multimodal Technology-Based Smart Stethoscope for Personal Cardiovascular Health Monitoring. Sensors 2022, 22, 6465. [Google Scholar] [CrossRef]
Figure 1. The overall conceptual picture of the method proposed in this paper.
Figure 1. The overall conceptual picture of the method proposed in this paper.
Sensors 23 02423 g001
Figure 2. Pictures of experiment for data acquisition: (a) Acoustic transducer TSD108A (BIOPAC System, Inc., Goleta, CA, USA); (b) PPG sensor TSD200C (BIOPAC System, Inc., Goleta, CA, USA); (c) Experimental environment.
Figure 2. Pictures of experiment for data acquisition: (a) Acoustic transducer TSD108A (BIOPAC System, Inc., Goleta, CA, USA); (b) PPG sensor TSD200C (BIOPAC System, Inc., Goleta, CA, USA); (c) Experimental environment.
Sensors 23 02423 g002
Figure 3. Example of plots of simultaneously recorded ECG, PCG, and PPG signals. The sample data in this plot is for Subject ID XY008, included in the Keimyung University-Heart Sounds Database (KMU-HSDB). The partially extracted data for the graph is from 20,000 to 40,999 of the original data based on the sample number, and the data length is 10 s (sampling rate is 2000 Hz). The top black, middle blue, and bottom red plots represent the raw data of ECG, PCG, and PPG, respectively. The x-axis represents time, measured in seconds; the y-axis is the amplitude of each signal, and the unit is volts (V).
Figure 3. Example of plots of simultaneously recorded ECG, PCG, and PPG signals. The sample data in this plot is for Subject ID XY008, included in the Keimyung University-Heart Sounds Database (KMU-HSDB). The partially extracted data for the graph is from 20,000 to 40,999 of the original data based on the sample number, and the data length is 10 s (sampling rate is 2000 Hz). The top black, middle blue, and bottom red plots represent the raw data of ECG, PCG, and PPG, respectively. The x-axis represents time, measured in seconds; the y-axis is the amplitude of each signal, and the unit is volts (V).
Sensors 23 02423 g003
Figure 4. Noise reduction: (a) block diagram for filtering (b) Example of data obtained through MP160 (BIOPAC System, Inc., Goleta, CA, USA) (Subject ID: XY08). The red, blue, green, and magenta solid lines illustrate the ECG, PPG, raw PCG, and filtered PCG signals (bandpass filter with 20–150 Hz), respectively.
Figure 4. Noise reduction: (a) block diagram for filtering (b) Example of data obtained through MP160 (BIOPAC System, Inc., Goleta, CA, USA) (Subject ID: XY08). The red, blue, green, and magenta solid lines illustrate the ECG, PPG, raw PCG, and filtered PCG signals (bandpass filter with 20–150 Hz), respectively.
Sensors 23 02423 g004
Figure 5. The proposed method of DDM-HSA for daily life monitoring: (a) flowchart (b) processing example (blue-solid line: PCG; red-solid line: PPG).
Figure 5. The proposed method of DDM-HSA for daily life monitoring: (a) flowchart (b) processing example (blue-solid line: PCG; red-solid line: PPG).
Sensors 23 02423 g005
Figure 6. Example of comparison of the filtered signal m f i l t ( t ) and the filtered envelope signal z ( t ) .
Figure 6. Example of comparison of the filtered signal m f i l t ( t ) and the filtered envelope signal z ( t ) .
Sensors 23 02423 g006
Figure 7. Performance comparison of Model I, Model II, and Model III.
Figure 7. Performance comparison of Model I, Model II, and Model III.
Sensors 23 02423 g007
Figure 8. Our preliminary research [1] of hardware development for heart sound analysis in mobile environments. Reprinted/adapted with permission from Ref. [1]. 2021, Sensors.
Figure 8. Our preliminary research [1] of hardware development for heart sound analysis in mobile environments. Reprinted/adapted with permission from Ref. [1]. 2021, Sensors.
Sensors 23 02423 g008
Table 1. Description of subject participating in this study.
Table 1. Description of subject participating in this study.
Subject IDSexAge
S1F32
S2F24
S3F31
S4F28
S5F26
S6F30
S7F28
S8F28
S9F28
S10M29
S11M30
S12M28
S13M34
S14M24
S15M25
S16M31
S17M26
S18M25
S19M27
S20M22
TOTALM = 10, F = 1027.4 (3.2)
The numbers in parentheses indicate standard deviations.
Table 2. Description of the Model.
Table 2. Description of the Model.
Estimation ModelsDescription
Model IDDM-HSA
Model IIDDM-HSA + window
Model IIIDDM-HSA + window + envelope filter (EF)
Table 3. Performance of the S1 peak detection in Model I.
Table 3. Performance of the S1 peak detection in Model I.
Sub. No.Mean HR [bpm]TPTNFPFNSENPRESPEACC
161.50353.00255.00114.0016.0095.6675.5969.1182.38
277.50453.00385.0080.0012.0097.4284.9982.8090.11
376.67386.00322.00138.0074.0083.9173.6670.0076.96
477.17376.00363.00100.0087.0081.2178.9978.4079.81
573.33419.00366.0074.0021.0095.2384.9983.1889.20
670.50355.00300.00123.0068.0083.9274.2770.9277.42
759.67342.00318.0040.0016.0095.5389.5388.8392.18
872.33411.00379.0055.0023.0094.7088.2087.3391.01
968.33388.00315.0095.0022.0094.6380.3376.8385.73
1059.17343.00251.00104.0012.0096.6276.7370.7083.66
1161.17318.00300.0067.0049.0086.6582.6081.7484.20
1267.33375.00362.0042.0029.0092.8289.9389.6091.21
1363.17355.00315.0064.0024.0093.6784.7383.1188.39
1467.00399.00289.00113.003.0099.2577.9371.8985.57
1573.00414.00367.0071.0024.0094.5285.3683.7989.16
1663.67355.00327.0055.0027.0092.9386.5985.6089.27
1786.67488.00486.0034.0032.0093.8593.4993.4693.65
1876.67422.00438.0022.0038.0091.7495.0595.2293.48
1980.17413.00448.0033.0068.0085.8692.6093.1489.50
2062.33355.00314.0060.0019.0094.9285.5483.9689.44
AVG70.00386.00345.0074.2033.2092.2584.0581.9887.12
STD7.7541.7761.5233.1823.475.076.438.244.99
HR indicates the heart rate. TP, TN, FP, and FN mean the true positive, true negative, false positive, and false negative, respectively. SEN, PRE, SPE, and ACC indicate the sensitivity, precision, specificity, and accuracy, respectively.
Table 4. Performance of the S2 peak detection in Model I.
Table 4. Performance of the S2 peak detection in Model I.
Sub. No.Mean HR [bpm]TPTNFPFNSENPRESPEACC
161.50310268275984.0191.9990.8587.05
277.5022618716123948.6058.4053.7450.80
376.6725119911120954.5769.3464.1958.44
477.1726724313119657.6767.0964.9760.93
573.33386300405487.7390.6188.2487.95
670.502811385814266.4382.8970.4167.69
759.67260111439872.6385.8172.0872.46
872.332261889920852.0769.5465.5157.42
968.331093115830126.5965.2784.2853.92
1059.171702972418547.8987.6392.5269.08
1161.17252505711568.6681.5546.7363.71
1267.33832439532120.5446.6371.8943.94
1363.1710521110127427.7050.9767.6345.73
1467.001942985620848.2677.6084.1865.08
1573.002521369418657.5372.8359.1358.08
1663.671902419319249.7467.1472.1660.20
1786.67424255599681.5487.7881.2181.41
1876.67375168438581.5289.7179.6280.92
1980.172992135418262.1684.7079.7868.45
2062.3312419712825033.1649.2160.6245.92
AVG70.00239.20212.7076.60180.0056.4573.8372.4963.96
STD7.7594.1469.2637.7477.3319.6814.4712.5213.18
HR indicates the heart rate. TP, TN, FP, and FN represent true positive, true negative, false positive, and false negative, respectively. SEN, PRE, SPE, and ACC represent the sensitivity, precision, specificity, and accuracy, respectively.
Table 5. Performance of the S1 peak detection in Model II.
Table 5. Performance of the S1 peak detection in Model II.
Sub. No.Mean HR [bpm]TPTNFPFNSENPRESPEACC
161.5036631059399.1986.1284.0191.60
277.50451427381496.9992.2391.8394.41
376.67434403572694.3588.3987.6190.98
477.17424420433991.5890.7990.7191.14
573.3343141228997.9593.9093.6495.80
670.50411368551297.1688.2087.0092.08
759.6735033226897.7793.0992.7495.25
872.33422401331297.2492.7592.4094.82
968.3340436743698.5490.3889.5194.02
1059.1735131738498.8790.2389.3094.08
1161.1736334027498.9193.0892.6495.78
1267.33388386181696.0495.5795.5495.79
1363.1737134633897.8991.8391.2994.59
1467.0040035943299.5090.2989.3094.40
1573.0043440038499.0991.9591.3295.21
1663.6737735725598.6993.7893.4696.07
1786.67500507132096.1597.4797.5096.83
1876.67438449112295.2297.5597.6196.41
1980.17464464171796.4796.4796.4796.47
2062.3336634826897.8693.3793.0595.45
AVG70.00407.25385.6533.5511.9597.2792.3791.8594.56
STD7.7540.8251.1913.949.321.933.033.501.79
HR indicates the heart rate. TP, TN, FP, and FN represent the true positive, true negative, false positive, and false negative, respectively. SEN, PRE, SPE, and ACC represent the sensitivity, precision, specificity, and accuracy, respectively.
Table 6. Performance of the S2 peak detection in Model II.
Table 6. Performance of the S2 peak detection in Model II.
Sub. No.Mean HR [bpm]TPTNFPFNSENPRESPEACC
161.50333268113690.2496.8096.0692.75
277.50400187266586.0293.9087.7986.58
376.67388199437284.3590.0282.2383.62
477.173002438616364.7977.7273.8668.56
573.33411300112993.4197.3996.4694.67
670.50355138366883.9290.7979.3182.58
759.67311111164786.8795.1187.4087.01
872.33343188349179.0390.9884.6880.95
968.33352311245885.8593.6292.8488.99
1059.17264297249174.3791.6792.5282.99
1161.1731150165684.7495.1175.7683.37
1267.332742435513067.8283.2881.5473.65
1363.172552116012467.2880.9577.8671.69
1467.002942985610873.1384.0084.1878.31
1573.002941368814467.1276.9660.7164.95
1663.672642417811869.1177.1975.5572.04
1786.67434255548683.4688.9382.5283.11
1876.67399168236186.7494.5587.9687.10
1980.173442135413771.5286.4379.7874.47
2062.332541973412067.9188.1985.2874.55
AVG70.00329.00212.7041.4590.2078.3988.6883.2280.60
STD7.7555.8869.2624.0938.519.146.648.458.07
HR indicates the heart rate. TP, TN, FP, and FN represent the true positive, true negative, false positive, and false negative, respectively. SEN, PRE, SPE, and ACC represent sensitivity, precision, specificity, and accuracy, respectively.
Table 7. Performance of S1 peak detection in Model III.
Table 7. Performance of S1 peak detection in Model III.
Sub. No.Mean HR [bpm]TPTNFPFNSENPRESPEACC
161.50359349201097.2994.7294.5895.93
277.504574641898.2899.7899.7899.03
376.67439427332195.4393.0192.8394.13
477.17433425383093.5291.9391.7992.66
573.33413422182793.8695.8295.9194.89
670.50423386370100.0091.9691.2595.63
759.67323336223590.2293.6293.8592.04
872.3343340331199.7793.3292.8696.31
968.33384370402693.6690.5790.2491.95
1059.17336319361994.6590.3289.8692.25
1161.1736134720698.3794.7594.5596.46
1267.3340138816399.2696.1696.0497.65
1363.17362342371795.5190.7390.2492.88
1467.0040136042199.7590.5289.5594.65
1573.0043241919698.6395.7995.6697.15
1663.6737636022698.4394.4794.2496.34
1786.6750651191497.3198.2598.2797.79
1876.67434446142694.3596.8896.9695.65
1980.1747846318399.3896.3796.2697.82
2062.33374349250100.0093.7393.3296.66
AVG70.00406.25394.3024.9012.9596.8894.1493.9095.39
STD7.7547.3251.5711.3011.312.792.672.882.14
HR indicates the heart rate. TP, TN, FP, and FN represent true positive, true negative, false positive, and false negative, respectively. SEN, PRE, SPE, and ACC represent sensitivity, precision, specificity, and accuracy, respectively.
Table 8. Performance of the S2 peak detection in Model III.
Table 8. Performance of the S2 peak detection in Model III.
Sub. No.Mean HR [bpm]TPTNFPFNSENPRESPEACC
161.5036526822498.9294.3292.4196.05
277.50445187272095.7094.2887.3893.08
376.67446199431496.9691.2182.2391.88
477.1745724337698.7092.5186.7994.21
573.33411300222993.4194.9293.1793.31
670.50365138155886.2996.0590.2087.33
759.67345111211396.3794.2684.0993.06
872.33346188308879.7292.0286.2481.90
968.33369311244190.0093.8992.8491.28
1059.17305297185085.9294.4394.2989.85
1161.173665012199.7396.8380.6596.97
1267.33374243183092.5795.4193.1092.78
1363.17355211132493.6796.4794.2093.86
1467.00365298233790.8094.0792.8391.70
1573.00394136184489.9595.6388.3189.53
1663.6735124193191.8897.5096.4093.67
1786.6751925512199.8197.7495.5198.35
1876.67423168133791.9697.0292.8292.20
1980.1743221394989.8197.9695.9591.75
2062.3337219716299.4795.8892.4996.93
AVG70.00390.25212.7020.1028.9593.0895.1290.5992.48
STD7.7550.2669.268.9522.605.351.894.643.64
HR indicates the heart rate. TP, TN, FP, and FN represent the true positive, true negative, false positive, and false negative, respectively. SEN, PRE, SPE, and ACC represent sensitivity, precision, specificity, and accuracy, respectively.
Table 9. Comparison of the approaches for heart sound analysis.
Table 9. Comparison of the approaches for heart sound analysis.
PublicationDatabaseApproachHighest PerformanceKey Contributions
Giordano et al. [18]Own database
(ECG and PCG)
Measuring the timing of heart sound componentsS1: 99.6 (a)
S2: 98.9 (a)
(a) Sensitivity
Robust Performance
Babu et al. [24]Own database
(PPG and PCG)
Variational mode decomposition-based heart sound endpoint determinationS1: 100 (b)
S2: 100 (b)
(b) Accuracy
Robust Performance
Huang et al. [25]Own database
(Gyroscope and Acoustic signal)
Deep-breath detection and various feature extraction methods (duration ratio, amplitude ration, correlation coefficient)S1: N/R
S2: N/R
Liveness detection
This studyOwn database
(PPG and PCG)
Dual deterministic model with window, envelope filteringS1: 95.39 (c)
S2: 92.55 (c)
(c) Accuracy
Mobile Environments
N/R denotes not reported.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lee, M.; Wei, Q.; Lee, S.; Park, H. DDM-HSA: Dual Deterministic Model-Based Heart Sound Analysis for Daily Life Monitoring. Sensors 2023, 23, 2423. https://doi.org/10.3390/s23052423

AMA Style

Lee M, Wei Q, Lee S, Park H. DDM-HSA: Dual Deterministic Model-Based Heart Sound Analysis for Daily Life Monitoring. Sensors. 2023; 23(5):2423. https://doi.org/10.3390/s23052423

Chicago/Turabian Style

Lee, Miran, Qun Wei, Soomin Lee, and Heejoon Park. 2023. "DDM-HSA: Dual Deterministic Model-Based Heart Sound Analysis for Daily Life Monitoring" Sensors 23, no. 5: 2423. https://doi.org/10.3390/s23052423

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop