You are currently viewing a new version of our website. To view the old version click .
Applied Sciences
  • Article
  • Open Access

19 November 2022

Research on the Classification of ECG and PCG Signals Based on BiLSTM-GoogLeNet-DS

,
,
,
and
1
School of Electrical Engineering, Shenyang University of Technology, Shenyang 110870, China
2
College of Telecommunication and Electronic Engineering, Qiqihar University, Qiqihar 161006, China
*
Author to whom correspondence should be addressed.

Abstract

Because a cardiac function signal cannot reflect cardiac health in all directions, we propose a classification method using ECG and PCG signals based on BiLSTM-GoogLeNet-DS. The electrocardiogram (ECG) and phonocardiogram (PCG) signals used as research objects were collected synchronously. Firstly, the obtained ECG and PCG signals were filtered, and then the ECG and PCG signals were fused and classified by using a bi-directional long short-term memory network (BiLSTM). After that, the time-frequency processing was performed on the filtered ECG and PCG signals to obtain the time-frequency diagram of each signal; the one-dimensional signal was changed into a two-dimensional image signal, and the size of each image signal was adjusted to input the improved GoogLeNet network for classification. Then we obtained the two-channel classification results. The three-channel classification results, combined with the classification results of the above BiLSTM network, were finally obtained. The classification results of these three channels were used to make a decision via the fusion strategy of the improved D-S theory. Finally, we obtained the classification results. Taking 70% of the signals in the database as training data and 30% as test data, the obtained classification accuracy was 96.13%, the sensitivity was 98.48%, the specificity was 90.8%, and the F1 score was 97.24%. From the experimental results, the method proposed in this paper obtained high classification accuracy, and the classification effect was better than a cardiac function signal, which makes up for the low accuracy of the cardiac function signal for judging cardiac disease.

1. Introduction

The existing cardiovascular disease diagnosis and treatment technology means are singular, and the diagnosis and treatment processes are complex, which inconvenience a patient’s disease examination. Clinically, a doctor’s diagnosis about cardiovascular disease is mostly based on multi-modal detection data, such as PCG, ECG, cardiac color Doppler ultrasound, and so on. ECG and PCG signals play very important roles in the early prediction and diagnosis of heart disease. Although artificial auscultation is a convenient and low-cost cardiac diagnosis technology, doctors need to have very rich clinical experiences. It is easy to make mistakes in diagnoses according to the subjective judgments of doctors, especially for some complex heart diseases. An ECG examination is a common method used in modern cardiovascular diagnosis; when the patient has symptoms such as angina pectoris or myocardial infarction, the ECG signals will change significantly [1]. However, the examination of some diseases will still be less sensitive. When the patient asks the doctor for help, the doctor often makes a preliminary judgment about the heart disease according to the patient’s condition. For more serious heart diseases, other instruments need to be checked. This process may lead to the development of the disease, with the loss of the best opportunity for treatment. Moreover, in order to fully understand the health status of the patient’s heart, currently, only the detection method of an insertable catheter can be used in clinical practice, but this method is traumatic and risky to the patient, and requires the hospital to have high technical equipment.
Nowadays, most researchers carry out signal analyses and algorithm development for cardiac function signals, such as an ECG or PCG signal, while there are few related studies based on ECG and PCG signals. Although researchers have fused PCG, ECG, and cardiac impedance signals, most of them treat one kind of signal as the reference of another kind of signal. For example, a cardiac impedance signal is processed with the help of an ECG signal, the PCG signal extracts the feature points with the help of the ECG signal, or simply fuses the ECG and PCG signals to give the heart disease diagnosis result by only a cardiac function signal. For example, Rodion Stepanov et al. determined cardiac hemodynamic parameters by using the combined features of ECG, PCG, and cardiac impedance signals. The relevant parameters of cardiac impedance were obtained with the help of ECG and PCG signals [2]. Jan Nedoma et al. compared heart rate monitoring results by synchronizing the acquisition of the ballistic cardiography, electrocardiogram, and phonocardiogram signals [3]. Mónica Martin et al. improved heart disease diagnosis accuracy by acquiring ECG and PCG signals synchronously [4]. Han Li detected coronary artery disease by a dual-input neural network using electrocardiogram and phonocardiogram signals [5]. Abdelakram Hafid et al. synchronously acquired ECG and cardiac impedance signals with the least electrodes. The parameter calculation of a cardiac function signal was completed by means of an ECG signal [6]. Chakir Fatima et al. used different classifiers to classify the signals by the synchronously collected PCG and ECG signals and compared the classification results with the PCG signals [7]. Huan Zhang proposed the detection method of coronary artery disease, which made use of multi-modal features. The multi-modal feature models were proposed using ECG and PCG signals. Finally, the classification accuracies of these modes were compared [8].
Through the above-mentioned research, it can be found that people have gradually shifted the focus from cardiac function signals to more model cardiac function signals. The simultaneous research on multiple decision-making methods shows the advantages of multi-modal signal research and makes up for the flaw of single model signal diagnosis. Therefore, this paper integrates various technologies to realize the judgments of cardiac health statuses via ECG and PCG signals. This paper proposes the classification method of ECG and PCG signals based on BiLSTM-GoogLeNet-DS (combined with the fusion technology for deep learning). This method can extract the features of long-distance dependent information based on BiLSTM, which can improve the training time and learn the features better, and can produce higher accuracy for long-span time series data. The GoogLeNet model has also achieved excellent results in the field of image recognition with its multi-layer convolutional neural network structure and inception structure. This paper, combined with the advantages of these two deep learning networks, uses the BiLSTM network and GoogLeNet network to extract and classify the features of ECG and PCG signals. The final classification results were obtained by using the fusion technology twice with the help of the improved D-S theory. The experiments show that the proposed algorithm improves the classification accuracy of ECG and PCG signals effectively, and is more accurate than single-modal cardiac function signals.
The structure of this paper is organized as follows: the methodology of the proposed algorithm is presented in Section 2. Section 3 presents the related theory about the BiLSTM network and GoogLeNet model. Section 4 presents the implementation steps of the proposed algorithm. Section 5 presents the experiments and results. The experimental dataset and the parameter settings of the network model are presents. The comparative experiments are set. The evaluation criteria of the classification accuracy, sensitivity, specificity, precision, and F1 score are given. The comparison results with other algorithms are provided. Finally, the conclusions are drawn in Section 6.

2. Methodology

This paper mainly selects the PCG and ECG signals collected synchronously as the research objects. PhysioNet published a cardiovascular disease dataset containing both PCG and ECG signals in 2016 [9]. Based on this dataset, this paper proposes the classification method based on BiLSTM-GoogLeNet-DS to predict and classify the signals in the database. Firstly, the ECG and PCG signals are preprocessed, respectively, mainly for filtering. Then the processed signals are sent to the BiLSTM network to perform network fusion and classify the signals. After that, the filtered PCG and ECG signals are, respectively, subjected to wavelet transform to obtain the time-frequency diagram of each signal. Each time-frequency diagram is resized and converted into the image size suitable for GoogLeNet network training, and then classified through the improved GoogLeNet network to obtain the classification results of PCG and ECG signals, respectively. The three-channel classification results, combined with the classification results of the above BiLSTM network, are obtained. The classification decision is made by the fusion strategy of the improved D-S theory. Calculating the probability values of ECG and PCG signals belonging to different categories can lead to the basic probability assignment (BPA) function. The local recognition credibility of each feature is estimated by using the confusion matrix. The weighted correction is carried out when constructing the BPA of the classifier. Finally, the classification results are obtained. The following is the specific implementation process, as shown in Figure 1.
Figure 1. Classification method and implementation process of ECG and PCG signals.

4. Classification of ECG and PCG Signals Based on BiLSTM-GoogLeNet-DS

The ECG and PCG signal classification method based on BiLSTM-GoogLeNet-DS is proposed. Different fusion methods, combined with the network structure characteristics of the BiLSTM network and GoogLeNet network, are used to classify the ECG and PCG signals, which make up for the inaccuracy of disease judgment by using the cardiac function signal. Using the ECG and PCG signal classification, a better classification and diagnosis effect was obtained. The implementation steps of this algorithm are as follows:
Step 1: Obtain the ECG and PCG signals. This paper uses the PCG and ECG signals synchronously collected in the PhysioNet/CINC challenge 2016 heart sound database as the research object. Because the signal acquisition time in the database is different, the length of the signal was unified in the experiment, and 10,000 sampling points of each signal were intercepted for the experiment. The sampling frequency of the signal is 2000 Hz. The database includes 113 normal signals and 292 abnormal signals (405 signals in total). Figure 7 shows the synchronously collected PCG and ECG signals.
Figure 7. PCG and ECG signals.
Step 2: Preprocess the ECG and PCG signals. The main purpose is to filter the signals. The useful information of the ECG signal is concentrated below 20 Hz. If the ECG signal has strong power frequency interference, the signal will generally produce a peak at the power frequency of 50 or 60 Hz in the spectrum. The effective frequency of the PCG signal is concentrated below 300 Hz. In this paper, the ECG signal was mainly subjected to low-pass filtering with the cutoff frequency of 20 Hz; a high-pass filter with the cutoff frequency of 25 Hz and a low-pass filter with the cutoff frequency of 400 Hz were performed on the PCG signal. Since there is too much noise in the ECG signal, the denoising results of the ECG signal are given here. Figure 8 shows the comparison of the ECG signal before and after filtering.
Figure 8. Comparison before and after filtering: (a) ECG original signal; (b) ECG signal after filtering.
Step 3: Use the BiLSTM network for training to create the network model. The filtered signals are trained and classified with a total of 405 samples. Moreover, 70% samples are used for training and 30% samples are used for the test. Then the samples are readjusted according to the labels. The abnormal samples and normal samples are placed separately in a centralized manner and then reintegrated into a new dataset. This setting is convenient for network training and results comparison during the test. The integrated ECG and PCG data are spliced and fused as the input of the BiLSTM network. The optimal network model is obtained through training. Then the classification results are obtained. The BiLSTM network architecture is shown in Figure 9.
Figure 9. BiLSTM network structure.
Step 4: Convert the filtered one-dimensional ECG and PCG signals into a two-dimensional image. The filtered signal is subjected to the continuous wavelet transform to obtain the wavelet coefficients. Then the scale sequence is converted into the frequency sequence. Finally, the wavelet time- frequency diagram of each signal is obtained by combining the time series. The size of the time-frequency diagram is adjusted to 224 × 224, as shown in Figure 10.
Figure 10. Time-frequency diagrams of signals: (a) the time-frequency diagram of the ECG signal; (b) the time-frequency diagram of the PCG signal.
Step 5: Determine the GoogLeNet network structure and improve its structure. The two-dimensional properties of the time-frequency diagram of each signal obtained above are converted into a three-dimensional form 224 × 224 × 3, which is suitable for the input of the GoogLeNet network. Then 70% of the signals are used for training and 30% of the signals are used for the test. Then determine the GoogLeNet network structure, as shown in Figure 11. The original GoogLeNet network uses different inception modules to generate a total of 144 layers of the network structure. On the basis of retaining the original 140-layer structure, this paper improves the last four-layer structure. Using the new dropout layer, the fully-connected layer, the Softmax layer, and the classification layer replace the original layers. This network structure is used to train the PCG and ECG signals, respectively. The optimal network model is obtained by continuously adjusting the network parameters. Then the classification results of the two-channel signals are obtained.
Figure 11. GoogLeNet network structure.
Step 6: The three-channel classification results obtained in the above steps are fused at the decision level using the improved D-S theory. According to the obtained classification results, the BPA function under various classification methods is obtained. Then the ECG and PCG signals are fused and determined according to the improved D-S theory. The following is the implementation process.
(1)
Classification framework establishment
The above classification results are used as the evidence of ECG and PCG signal classifications. The classification framework is established, as shown in the formula.
Θ = { ω n o r m a l , ω a b n o r m a l }
ω n o r m a l represents the normal signal with a classification result, and ω a b n o r m a l represents the abnormal signal with a classification result. The subset of the framework Θ contains { { ω n o r m a l } , { ω a b n o r m a l } , { ω n o r m a l , ω a b n o r m a l } } .
Research studies that deposit large datasets in publicly available databases should specify where the data are deposited and provide the relevant accession numbers. If the accession numbers have not yet been obtained at the time of submission, please state that they will be provided during the review. They must be provided prior to publication.
(2)
The BPA function establishment
The normal PCG signal probability value p ( ω 1 n ) and the abnormal PCG signal probability value p ( ω 1 p ) obtained in Step 5 are used as the BPA function m 1 of the PCG signal. The normal ECG signal probability value p ( ω 2 n ) and the abnormal ECG signal probability value p ( ω 2 p ) are used as the BPA function m 2 of the ECG signal. The normal ECG and PCG signal probability value p ( ω 3 n ) and the abnormal ECG and PCG signal probability value p ( ω 3 p ) obtained in Step 3 are used as the BPA function m 3 of the ECG and PCG signals, as shown in the formula.
m 1 ( ω n o r m a l ) = p ( ω 1 n ) m 1 ( ω a b n o r m a l ) = p ( ω 1 p ) m 2 ( ω n o r m a l ) = p ( ω 2 n ) m 2 ( ω a b n o r m a l ) = p ( ω 2 p ) m 3 ( ω n o r m a l ) = p ( ω 3 n ) m 3 ( ω a b n o r m a l ) = p ( ω 3 p ) m 1 ( { ω n o r m a l , ω a b n o r m a l } ) = m 2 ( { ω n o r m a l , ω a b n o r m a l } ) = m 3 ( { ω n o r m a l , ω a b n o r m a l } ) = 0
According to the D-S theory, the probability function values m ( ω n o r m a l ) and m ( ω a b n o r m a l ) of the evidence are obtained, namely, the basic credibility function, as shown in Formula (14).
m ( ω n o r m a l ) = m 1 ( ω n o r m a l ) m 2 ( ω n o r m a l ) m 3 ( ω n o r m a l ) K m ( ω a b n o r m a l ) = m 1 ( ω a b n o r m a l ) m 2 ( ω a b n o r m a l ) m 3 ( ω a b n o r m a l ) K
K is called as the fusion index. The bigger the K value, the higher the support degree belonging to the same category.
K = m 1 ( ω n o r m a l ) m 2 ( ω n o r m a l ) m 3 ( ω n o r m a l ) + m 1 ( ω a b n o r m a l ) m 2 ( ω a b n o r m a l ) m 3 ( ω a b n o r m a l )
(3)
Implementation of the improved D-S theory algorithm.
The basic principle of the algorithm is to use the method based on local credibility to perform the weighted correction. The classification probability is corrected by the confusion matrix generated previously. Then construct the BPA function of the classifier.
For the classification problem in this paper, it represents the tendency to judge as normal or abnormal. P ω ( ω i | ω j ) represents the support degree of the target belonging to class ω i . Thus, P ω ( ω i | ω j ) is called the local credibility of the classifier for class ω i . The BPA function m ( ω i ) about the target belonging to ω i is:
m ( ω i ) = P i × P ω ( ω i ω j )
P ω ( ω i | ω j ) = ω i j j = 1 k ω i j
In the formula, P i is the probability of belonging to class ω i by the previous classification output results. P ω ( ω i | ω j ) is the local credibility information. ω i j represents the number of samples in which class ω i is judged to be class ω j . k is the category number of signals. The BPA functions are all obtained after the above-mentioned weighted-fusion correction [19,20,21,22,23,24,25].
Using the obtained BPA, information fusion is carried out according to Formulas (14) and (15).
Step 7: Obtain the final classification results according to the discriminant criterion by using the previous fusion results.
The discrimination criterion is as follows:
x i = 1 , x i > = ε x i = 1 , x i < ε
where ε is the set threshold value, x i is the final classification result, “−1” represents normal sample, and “1” represents the abnormal sample.

5. Experiments and Results

5.1. Dataset

The MIT PCG Database (MTTHSDB) contributes to this study. This dataset comes from the PhysioNet/CINC challenge 2016 heart sound classification competition on the PhysioNet website. PhysioNet acquire datasets (training-a to training-f) from different research institutes. The training-a dataset is provided by MIT and contains 409 samples. Among these 409 samples, 405 samples include ECG and PCG signals, simultaneously. There are 113 normal signals and 292 abnormal signals in the database. The signals are recorded at the sampling rate of 44.1 kHz. They are resampled at 2000 Hz in the post-processing. In the experiment, 70% of the signals in the database were used as the training set, and 30% were used as the test set [26].
The experiment was performed on the computer processor Intel (R) Celeron (R) CPU N3450 1.10 GHz and RAM 4.00 GB. The experimental software was MATLAB 2021a (Natick, MA, USA).

5.2. Parameter Settings of the Network Model

  • Parameter settings of the BiLSTM network model.
The main parameters in the model include the dimension of the input vector, the number of hidden layers, the related parameters of training, and so on. The optimal parameter settings of the BiLSTM network model are shown in Table 1.
Table 1. BiLSTM network model parameters.
2.
Parameter settings of the GoogLeNet network model.
The main parameters in the model include the dimension of the input vector, the related parameters of training, and so on. The optimal parameter settings of the GoogLeNet network model are shown in Table 2.
Table 2. GoogLeNet network model parameters.

5.3. Evaluation Metrics

To test the proposed algorithm, the classifications, accuracy, sensitivity, specificity, and F1 score are used to measure the performance. Since the signals are divided into normal and abnormal signals, the abnormal signals are regarded as positive and the normal signals are regarded as negative. If the positive signal is classified correctly, it is regarded as the true positive signal (TP), otherwise it is regarded as the false negative signal (FN); if the negative signal is classified correctly, it is regarded as the true negative signal (TN), otherwise it is regarded as the false positive signal (FP) [27,28].
A c c u r a c y = T P + T N T P + F P + F N + T N × 100 %
S e n s i t i v i t y = R e c a l l = T P T P + F N × 100 %
S p e c i f i c i t y = T N T N + F P × 100 %
P r e c i s i o n = T P T P + F P × 100 %
F 1 S c o r e = 2 × R e c a l l × P r e c i s i o n R e c a l l + P r e c i s i o n × 100 %

5.4. Comparative Experimental Setup

In the experiment, the following classification models are set to evaluate the model in this paper, and the classification performance of the BiLSTM-GoogLeNet-DS model is verified by comparing the models:
(1)
BiLSTM model.
The BiLSTM model is composed of two LSTMs in different directions, which can simultaneously capture the front and backward timing information and solve the long-distance dependency problem in the classification process. After the PCG and ECG signals are filtered separately, the signal features are extracted by BiLSTM, and then input into the fully-connected layer and the Softmax layer to realize the classification of the signal. In order to verify the superiority of ECG and PCG signal classification, in addition to inputting PCG and ECG signals into the BiLSTM network model to obtain their respective classification results, the paper also splices and fuses the PCG and ECG signals into the BiLSTM network model for classification. Since the classification effects of ECG and PCG signals are better than that of single-modal cardiac function signals using the PCG or ECG signal, this paper mainly focuses on the classification results of ECG and PCG signals using the BiLSTM model for further processing.
(2)
GoogLeNet model.
The GoogLeNet model consists of several inception modules. The GoogLeNet model structure selected in this paper consists of nine layers of inception modules, namely inception-3a, inception-3b, inception-4a, inception-4b, inception-4c, inception-4d, inception-4e, inception-5a and inception-5b. Each inception module includes three convolutional kernels of 1 × 1, 3 × 3, and 5 × 5. When the image data are input into the network, 7 × 7 convolution is performed first, then the ReLU layer, pooling layer, regularization, and 3 × 3 convolution. After the ReLU layer, pooling layer, and regularization, the data are input into each inception module in turn. The filtered PCG and ECG signals are subjected to continuous wavelet transform to obtain their time-frequency diagrams. The sizes of the diagrams are transformed to be suitable for the input into the GoogLeNet network. The network is used to classify the PCG and ECG signals, respectively.
(3)
BiLSTM-GoogLeNet-DS model.
The BiLSTM-GoogLeNet-DS model first splices and fuses ECG and PCG signals into the BiLSTM network to obtain the fused classification results. Then the ECG and PCG signals are, respectively, input into the GoogLeNet network for classification. Based on the above classification results, a total of three channel classification results are obtained. The improved D-S theory is used to fuse the above classification results. Then the classification results are finally determined according to the discriminant criterion.

5.5. Experimental Results and Analysis

In order to verify the superiority of the proposed algorithm, the above network models are compared with the BiLSTM-GoogLeNet-DS model, mainly from the evaluation indicators.
This paper improves the last four layers of the GoogLeNet network, and mainly updates the random dropout layer, fully-connected layer, Softmax layer, and classification layer. Using the improved D-S theory, the BPA function is weighted by the method based on confusion matrix. The following are some experimental results. Figure 12 shows the confusion matrix obtained by different models. The confusion matrix is mainly used to weight the BPA function. It solves the problem that the BPA function of the D-S theory is difficult to determine in decision fusion.
Figure 12. Confusion matrix obtained by different models: (a) ECG-PCG-BiLSTM; (b) ECG-GoogLeNet; (c) PCG-GoogLeNet; (d) BiLSTM-GoogLeNet-DS.
Interventionary studies involving animals or humans, and other studies that require ethical approval, must list the authority that provides approval and the corresponding ethical approval code.
In order to show more intuitively and clearly the superiority of the proposed algorithm, the BiLSTM-GoogLeNet-DS model is compared with the BiLSTM model and the GoogLeNet model, respectively. The comparison results are shown in Table 3. It can be seen from Table 3 that the classification method proposed in this paper, based on BiLSTM-GoogLeNet-DS, has the best classification effect. Although the classification method based on PCG-BiLSTM obtained 100% sensitivity, 100% precision, and a 100% F1 score, the specificity is only 0%. This means that the classification method cannot identify healthy samples. Although the classification method based on PCG-GoogLeNet obtained 100% sensitivity, the specificity is only 18.29%. This means that the classification effect of this method for health samples is also very unsatisfactory.
Table 3. Comparison of several algorithm models.
In order to express the comparison results in the table more clearly, this paper uses the following histograms for graphical displays. It can be clearly seen from Figure 13 that the classification accuracy of the algorithm proposed in this paper (based on the BiLSTM-GoogLeNet-DS model) is higher than that of other classification models. In Figure 14, the sensitivity, specificity, precision, and F1 score are compared. The best classification results are obtained by using the BiLSTM-GoogLeNet-DS model. The classification accuracy, sensitivity, specificity, precision, and F1 score are 96.13%, 98.48%, 90.8%, 96.04%, and 97.24%, respectively. From a comprehensive point of view, the algorithm proposed is better than other algorithm models.
Figure 13. The comparison of accuracy.
Figure 14. The comparison of sensitivity, specificity, precision, F1 score: (a) the comparison of sensitivity; (b) the comparison of specificity; (c) the comparison of precision; (d) the comparison of F1 score.
This paper also compares the methods from other literature studies. The comparison mainly focuses on the methods based on deep learning and the classification signals. Moreover, the current research studies on ECG and PCG signals mainly focus on the database provided in the PhysioNet/CINC challenge 2016 heart sound database, so the comparative classification effect is more convincing. The comparison results are shown in Table 4. The feature form is very similar, mainly focus on time-domain, frequency-domain, and time-frequency domain. The classification process is performed by 1D or 2D convolutional layers, and so on. The researched signal is single-modal or ECG and PCG signals. From the results, the classification effect of the method proposed in this paper is better than the methods in the literature, which shows that the algorithm proposed in this paper has certain advantages. The results also show that the classification method based on ECG and PCG signals has a stronger discriminative ability than those based on single-modal signals. Among the methods based on ECG and PCG signals, the classification effect of the method proposed in this paper is the best. Although the classification effect of the proposed method is close to that of Li Han’s [5] method, it also shows that the method proposed in this paper has slight advantages.
Table 4. Comparison between this algorithm and other algorithms in the literature.

6. Conclusions

In this paper, an ECG–PCG signal classification method based on BiLSTM-GoogLeNet-DS is proposed. The method filters the obtained ECG and PCG signals, and uses the BiLSTM network to fuse and classify the two cardiac function signals. Then, the time-frequency processing of the filtered signals is performed. The time-frequency diagrams of signals are input into the GoogLeNet network for classification. Combining the above classification results provides a discriminative decision using the improved D-S theory to obtain the final classification results.
The following conclusions can be drawn from the experiments:
(1)
The proposed ECG and PCG signal classification method based on the BiLSTM- GoogLeNet-DS model is superior to the classification methods of the BiLSTM model and the GoogLeNet model. The classification method of ECG and PCG signals is better than the classification methods of single-modal signals.
(2)
This paper uses the deep learning-oriented hybrid fusion method. After the feature layer fusion of signals, the improved D-S theory is used to fuse the decision layer. The better classification results are obtained.
(3)
The BiLSTM model is used for feature layer fusion, and the obtained classification results are better than those of single-modal signals. This method can extract the features of long-distance-dependent information according to the BiLSTM network, which can improve the training time and learn the features better. It can produce higher classification accuracy for long-span time series data.
(4)
The classification effect obtained by using the GoogLeNet model to classify cardiac function signals is better than the classification method based on the BiLSTM model, which highlights the advantages of GoogLeNet in classification and recognition.
(5)
The classification results of GoogLeNet and BiLSTM are fused by the improved D-S theory for the decision-making layer, which mainly calculates the BPA function for the classification result of each channel. Using the local recognition credibility information generated by the confusion matrix to weigh the BPA function can improve the final classification result.
(6)
The best classification results are obtained by applying the ECG and PCG signal classification method based on BiLSTM-GoogLeNet-DS with 96.13% classification accuracy, 98.48% sensitivity, 90.8% specificity, 96.04% precision, and a 97.24% F1 score.
It can be concluded from the experimental results that the information fusion of ECG and PCG signals improves the performance indicators of classification and improves the classification effect. How to fuse more modal cardiac function signals to be used for disease diagnosis and treatment will be the next research direction.

Author Contributions

Conceptualization, J.L. and L.K.; methodology, J.L.; software, J.L.; validation, Q.D. and X.D.; formal analysis, X.C.; investigation, J.L.; resources, Q.D.; data curation, J.L.; writing—original draft preparation, J.L.; writing—review and editing, L.K.; visualization, X.D.; supervision, X.C.; project administration, J.L.; funding acquisition, L.K. and J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Nature Science Foundation of China, grant number 52077143; Department of Education of Liaoning Province, grant number LZGD2020002, LJKZ0131; The Fundamental Research Funds in Heilongjiang Provincial Universities of China, grant number 135409424 and Heilongjiang Science Foundation Project, grant number ZD2019F004.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The database is supported by the PhysioNet/CinC Challenge 2016.

Acknowledgments

We acknowledge the database support from the PhysioNet/CinC Challenge 2016.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Mourad, N. ECG denoising algorithm based on group sparsity and singular spectrum analysis. Biomed. Signal Process. Control 2019, 50, 62–71. [Google Scholar] [CrossRef]
  2. Rodion, S.; Sergey, P.; Peter, F.; Dumbler, A. Beat-to-beat cardiovascular hemodynamic parameters based on wavelet spectrogram of impedance data. Biomed. Signal Process. Control. 2017, 36, 50–56. [Google Scholar]
  3. Nedoma, J.; Fajkus, M.; Martinek, R.; Kepak, S.; Cubik, J.; Zabka, S.; Vasinek, V. Comparison of BCG, PCG and ECG Signals in Application of Heart Rate Monitoring of the Human Body. In Proceedings of the 2017 40th International Conference on Telecommunications and Signal Processing (TSP), Barcelona, Spain, 5–7 July 2017; pp. 420–424. [Google Scholar]
  4. Monica, M.; Pedro, G.; Cristina, O.; Coimbra, M.; da Silva, H.P. Design and Evaluation of a Diaphragm for Electrocardiography in Electronic Stethoscopes. IEEE Trans. Biomed. Eng. 2020, 67, 391–398. [Google Scholar]
  5. Li, H.; Wang, X.; Liu, C.; Wang, Y.; Li, P.; Tang, H.; Yao, L.; Zhang, H. Dual-Input Neural Network Integrating Feature Extraction and Deep Learning for Coronary Artery Disease Detection Using Electrocardiogram and Phonocardiogram. IEEE Access 2019, 7, 146457–146469. [Google Scholar] [CrossRef]
  6. Abdelakram, H.; Sara, B.; Malika, K.; Attari, M.; Seoane, F. Simultaneous Recording of ICG and ECG Using Z-RPI Device with Minimum Number of Electrodes. J. Sens. 2018, 2018, 3269534. [Google Scholar]
  7. Fatima, C.; Abdelilah, J.; Chafik, N.; Hammouch, A. Recognition of cardiac abnormalities from synchronized ECG and PCG Signals. Phys. Eng. Sci. Med. 2020, 43, 673–677. [Google Scholar]
  8. Zhang, H.; Wang, X.; Liu, C.; Liu, Y.; Li, P.; Yao, L.; Li, H.; Wang, J.; Jiao, Y. Detection of coronary artery disease using multi-modal feature fusion and hybrid feature selection. Physiol. Meas. 2020, 41, 115007. [Google Scholar] [CrossRef]
  9. Heart Sound Database Web Site. Available online: https://www.physionet.org (accessed on 4 March 2016).
  10. Cheng, J.; Zou, Q.; Zhao, Y. ECG signal classification based on deep CNN and BiLSTM. BMC Med. Inform. Decis. Mak. 2021, 21, 365. [Google Scholar] [CrossRef]
  11. Wang, L.; Zhou, X. Detection of Congestive Heart Failure Based on LSTM-Based Deep Network via Short-Term RR Intervals. Sensors 2019, 19, 1502. [Google Scholar] [CrossRef]
  12. Xu, X.; Jeong, S.; Li, J. Interpretation of Electrocardiogram (ECG) Rhythm by Combined CNN and BiLSTM. IEEE Access 2020, 8, 125380–125388. [Google Scholar] [CrossRef]
  13. Singh, R.; Rajpal, N.; Mehta, R. An Empiric Analysis of Wavelet-Based Feature Extraction on Deep Learning and Machine Learning Algorithms for Arrhythmia Classification. Int. J. Interact. Multimedia Artif. Intell. 2021, 6, 25. [Google Scholar] [CrossRef]
  14. Alkhodari, M.; Fraiwan, L. Convolutional and recurrent neural networks for the detection of valvular heart diseases in phonocardiogram recordings. Comput. Methods Programs Biomed. 2021, 200, 105940. [Google Scholar] [CrossRef] [PubMed]
  15. AlDuwaile, D.; Islam, S. Using Convolutional Neural Network and a Single Heartbeat for ECG Biometric Recognition. Entropy 2021, 23, 733. [Google Scholar] [CrossRef]
  16. Alquran, H.; Alqudah, A.M.; Abu-Qasmieh, I.; Al-Badarneh, A.; Almashaqbeh, S. ECG classification using higher order spectral estimation and deep learning techniques. Neural Network World 2019, 4, 207–219. [Google Scholar] [CrossRef]
  17. Byeon, Y.-H.; Pan, S.-B.; Kwak, K.-C. Intelligent Deep Models Based on Scalograms of Electrocardiogram Signals for Biometrics. Sensors 2019, 19, 935. [Google Scholar] [CrossRef]
  18. Kim, J.-H.; Seo, S.-Y.; Song, C.-G.; Kim, K.-S. Assessment of Electrocardiogram Rhythms by GoogLeNet Deep Neural Network Architecture. J. Healthc. Eng. 2019, 2019, 2826901. [Google Scholar] [CrossRef]
  19. Dempster, A.P. Upper and lower probabilities induced by a multivalued mapping. Ann. Math. Stat. 1967, 38, 325–339. [Google Scholar] [CrossRef]
  20. Shafer, G.A. Mathematical Theory of Evidence; Princeton University Press: Princeton, NJ, USA, 1976. [Google Scholar]
  21. Chen, N.; Sun, F.; Ding, L.; Wang, H. An adaptive PNN-DS approach to classification using multi-sensor information fusion. Neural Comput. Appl. 2008, 31, 693–705. [Google Scholar] [CrossRef]
  22. Chen, L.; Feng, Y.; Maram, M.A.; Wang, Y.; Wu, M.; Hirota, K.; Pedrycz, W. Multi-SVM based Dempster-Shafer theory for gesture intention understanding using sparse coding feature. Appl. Soft Comput. J. 2019, 85, 105787. [Google Scholar] [CrossRef]
  23. Rajib, G.; Pradeep, K.; Pratim, P. A Dempster -Shafer theory based classifier combination for online Signature recognition andverification systems. Int. J. Mach. Learn. Cybern. 2019, 10, 2467–2482. [Google Scholar]
  24. Liu, Y.; Cheng, Y.; Zhang, Z.; Wu, J. Multi-information Fusion Fault Diagnosis Based on KNN and Improved Evidence Theory. J. Vib. Eng. Technol. 2021, 10, 841–852. [Google Scholar] [CrossRef]
  25. Xiaotao, C.; Lixin, J.; Ying, Y.; Ruiyang, H. Method of Network Representation Fusion Based on D-S Evidence Theory. Tien Tzu Hsueh Pao/Acta Electron. Sin. 2020, 48, 854–860. [Google Scholar]
  26. Liu, C.; Springer, D.; Li, Q.; Moody, B.; Juan, R.A.; Chorro, F.J.; Castells, F.; Roig, J.M.; Silva, I.; Johnson, A.E. An open access database for the evaluation of heart sound algorithms. Physiol. Meas. 2016, 37, 2181–2213. [Google Scholar] [CrossRef] [PubMed]
  27. Hyung, L.J.; Young, K.S.; Chun, O.P.; Gi, K.K.; Jin, S.D. Heart Sound Classification Using Multi Modal Data Representation and Deep Learning. J. Med. Imaging Health Inform. 2020, 10, 537–543. [Google Scholar]
  28. Li, J.; Ke, L.; Du, Q.; Chen, X.; Xiaodi, D. ECG and PCG signals classification algorithm based on improved D-S evidence theory. Biomed. Signal Process. Control. 2021, 71, 103078. [Google Scholar] [CrossRef]
  29. Tschannen, M.; Kramer, T.; Marti, G.; Heinzmann, M.; Wiatowski, T. Heart sound classification using deep structured features. In Proceedings of the Computing in Cardiology Conference (CinC), Vancouver, BC, Canada, 11–14 September 2016; pp. 565–568. [Google Scholar]
  30. Rubin, J.; Abreu, R.; Ganguli, A.; Nelaturi, S.; Matei, I.; Sricharan, K. Classifying heart sound recordings using deep convolutional neural networks and mel-frequency cepstral coefficients. In Proceedings of the Computing in Cardiology Conference (CinC), Vancouver, BC, Canada, 11–14 September 2016; pp. 813–816. [Google Scholar]
  31. Noman, F.; Ting, C.-M.; Salleh, S.-H.; Ombao, H. Short-segment heart sound classification using an ensemble of deep convolutional neural networks. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 12–17 May 2019; pp. 1318–1322. [Google Scholar]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.