1. Introduction
Supraventricular arrhythmia (SVA) is a general term that encompasses several types of heart rhythm disorders that originate above the ventricles in the electrical conduction system of the heart. These arrhythmias include atrial fibrillation, atrial flutter, atrial tachycardia, and other less common forms. In fact, the prevalence of atrial fibrillation has increased three-fold in the last 50 years. Moreover, approximately 90,000 cases of supraventricular tachycardia are detected annually in the United States, and about 25% of all emergency department visits for supraventricular tachycardia result in hospitalization [
1,
2].
Supraventricular extrasystoles (SVEs), another type of SVA, are a common finding in persons with no pathological history specifically, with a greater proportion appearing in the elderly. They can be recognized in the electrocardiogram by the appearance of a premature narrow QRS complex, preceded or not by a P wave. This alteration originates from an electrical signal and does not produce any symptomatologic event in the person, but over time, without treatment and with greater frequency, they can trigger the weakening of the heart and subsequent heart failure [
3]. Although SVEs alone are not usually dangerous, they can be a risk marker for other types of more serious supraventricular arrhythmias [
4,
5].
Multiple experiments have consistently demonstrated that each algorithm for detecting SVEs exhibits remarkable accuracy when applied to sampling frequencies of 120 Hz and higher [
6]. In particular, the utilization of a single lead from an electrocardiogram (ECG), specifically the second lead, has proven to be exceptionally effective, obviating the need for additional leads in the monitoring process [
3,
7] Moreover, recent studies have demonstrated the potential of machine learning and deep learning algorithms in the detection and prediction of supraventricular arrhythmias, including those associated with supraventricular extrasystoles. For instance, a deep learning model was developed to address the identification of patients with unrecognized paroxysmal supraventricular tachycardia (PSVT) using sinus rhythm electrocardiograms [
8]. The model demonstrated remarkable accuracy in detecting PSVT cases that might have otherwise been overlooked, providing a valuable tool for early diagnosis and intervention. Similarly, deep learning methodologies have been employed to diagnose ECG records under both normal conditions and supraventricular arrhythmia [
9], showcasing the potential of deep learning techniques in accurately classifying different ECG patterns and distinguishing between normal and abnormal cardiac rhythms. Furthermore, innovative applications of machine learning models for the prediction of arrhythmia occurrence after acute myocardial infarction have been developed, incorporating various structural regularization techniques [
10]. These studies highlight the potential of machine learning and deep learning algorithms in enhancing the detection and prediction of supraventricular arrhythmias, thereby contributing to the development of more effective and efficient telemonitoring systems for cardiac health.
Given the aforementioned considerations, the primary objective of this study was to devise a deep learning algorithm specifically designed for the identification and segmentation of supraventricular extrasystoles in ECG signals obtained from the second lead. The intended application of this algorithm is in the realm of telemonitoring for cardiac health purposes. To ensure the availability of a robust and consistent database for ECG signal extraction, the renowned and public MIT-BIH Supraventricular Arrhythmia Database was used. MIT-BIH is available online at Physionet, comprising 78 recordings of 30 min ECG data [
11]. This database offers a comprehensive and diverse range of ECG readings, enabling the algorithm to be trained and tested on a wide variety of supraventricular arrhythmias, thereby enhancing its accuracy and reliability.
2. Materials and Methods
2.1. Data Acquisition
For cloud-based ECG signal extraction, the dataset of choice is the MIT-BIH Supraventricular Arrhythmia Database, which is publicly available on Physionet [
11], encompassing a collection of 78 half-hour ECG records sampled at 360 Hz. This widely recognized database serves as a fundamental resource for the retrieval and analysis of ECG signals in a cloud computing environment. The dataset is composed of signals with clinical annotations (SVEs, for example), giving a wide range of future applications. By utilizing this extensive dataset, researchers and practitioners can access a diverse range of ECG recordings, enabling them to conduct in-depth investigations and develop advanced algorithms for enhanced detection and characterization of supraventricular arrhythmias. The wealth of data contained within the MIT-BIH Supraventricular Arrhythmia Database ensures the accuracy and reliability of cloud-based ECG signal extraction processes. In order to load and visualize the signals, and for the following steps in the methodology, Python programming language was used.
2.2. Preprocessing
Several signal preprocessing techniques were employed to ensure proper training. Firstly, a 50 Hz notch filter was applied to remove electrical noise. Additionally, frequencies above 150 Hz were eliminated using an FIR low-pass filter. To reduce computational costs, the entire signal was downsampled by a factor of 2, resulting in a sampling frequency of 180 Hz. Finally, signal normalization was applied, scaling the signal values from 0 to 1 to standardize the inputs for the model.
From that, the preprocessing focused on the selection of significant ECG characteristic annotations associated with EVS, including premature beats or supraventricular ectopics. These annotations were employed as labels for the subsequent stages of the project. Furthermore, the ECG signals from the MIT-BIH database were segmented into a matrix format, comprising a total of 512 samples, with each sample containing 12,288 observations.
2.3. Proposed Model
The artificial intelligence algorithm was developed using the TensorFlow open-source library, specifically utilizing a convolutional neural network model (see
Figure 1) based on a UNET for 1 dimension (conv1D), taking, as a start, 64 filters, with 512 filters in last convolutional layer [
12]. We considered using that approach because, for the required task (segmentation), it could be helpful to use a model that has good results in similar tasks (images) as UNET, extrapolating it to 1 dimension. As final activation function, a sigmoid function was used. We used Adam optimizer and binary cross-entropy loss function as proposed hyperparameters. As post-processing for prediction output to enhance the prediction accuracy, a threshold-based segmentation method was employed by averaging the peaks of the ECG signals.
2.4. Evaluation Metrics
K-fold cross-validation, with k = 50, was used for training and testing of the proposed model. The average metric results will be described next. Dice coefficient, precision, and recall were used as desired metrics for evaluation.
Additionally, an interval accuracy test (called norm_accuracy, which differs from accuracy well-known metric tests) was implemented to assess the algorithm’s ability to identify true positive cases of supraventricular extrasystoles, using the norm between the difference of ground truth with predicted signal (annotations). This metric is different from the Dice coefficient because we can observe, in a more intuitive way, whether there is a good segmentation or not. The formula for this custom metric is the following:
This comprehensive approach, combining data selection, preprocessing, algorithm design, and accuracy assessment, formed the foundation of the project’s methodology. Project workflow is represented in
Figure 2.
3. Results
After the neural network was trained, an average Dice coefficient of 79.01% was achieved. To provide a visual representation, example results of the interval segmented and detected for SVEs and its proper ECG signal superposition are observed in
Figure 3 and
Figure 4, enabling the identification and visualization of sections where the R-R peak exhibited abnormal durations indicative of supraventricular extrasystoles (SVEs). The obtained results were rigorously validated against the annotations of the MIT-BIH database, confirming the neural network’s capability to accurately identify and segment supraventricular extrasystoles.
Following proper training of the neural network, an overall norm accuracy of 92.57% was obtained. Precision and recall were obtained with 80.96% and 86.56%, respectively, as shown in
Table 1. To enhance result visualization, the Matplotlib and NumPy libraries were employed to determine a variable threshold. This threshold facilitated the complete identification of sections where the R-R peak displayed abnormal durations, indicating the presence of supraventricular extrasystoles. The validity of the results was duly verified against the annotations from the MIT-BIH database, further confirming the neural network’s proficiency in detecting and segmenting supraventricular extrasystoles.
4. Discussion
The prevalence of supraventricular arrhythmias (SVAs) has been on the rise in recent years, with atrial fibrillation alone showing a three-fold increase over the past years [
2]. SVAs, including supraventricular extrasystoles (SVEs), can have significant implications for cardiac health assessment, necessitating effective early detection and monitoring strategies [
7]. In this study, we aimed to develop a deep learning algorithm specifically tailored for the identification and segmentation of SVEs in ECG signals obtained from the second lead, with the ultimate goal of facilitating telemonitoring applications for cardiac health.
Although good results were obtained in general for the detection of arrhythmias other than supraventricular extrasystoles, false positives can be a problem in certain cases, and efforts should be made to reduce this error. The use of functions that allow estimation of error (by normalization, for example) and precision to contrast the training dataset are therefore indispensable.
Currently, studies using artificial intelligence to support the diagnosis of supraventricular arrhythmias have not focused on supraventricular extrasystole. Despite not being the focus, the detection of SVE is useful as a marker of future more serious complications [
4,
5]. Therefore, our solution contrasts in methodology with the state of the art, from marker selection to the training signal and the trained model. We used only one ECG channel supported by the literature and performed a simplification of convolutional neural networks [
3]. With a one-dimensional UNET architecture, we obtained results comparable with 12-channel models and with networks that are more complex [
8,
10].
In comparison to the state of the art, the model proposed in this study demonstrates similar overall accuracy, recall, and precision to studies detecting supraventricular tachycardia through deep learning (95.50%) and machine learning (97.00%) [
8,
13]. Moreover, the Dice coefficient of 79.01% indicates that there is a high degree of overlap between the predicted and actual instances of SVEs.
The results obtained from our algorithm were highly promising. The visualization of the results was facilitated by establishing a threshold, enabling the identification of sections where the R-R peak exhibited abnormal durations indicative of SVEs. As shown in
Figure 5, the validation of our results against the annotations of the MIT-BIH database further confirmed the algorithm’s ability to accurately identify and segment SVEs.
Our findings align with recent studies that have demonstrated the potential of machine learning and deep learning algorithms in the detection and prediction of supraventricular arrhythmias [
8,
9,
10]. The utilization of deep learning methodologies, such as convolutional neural networks, has proven to be effective in enhancing the detection and characterization of these arrhythmias. By leveraging these advancements, our algorithm contributes to the development of more efficient and accurate telemonitoring systems for cardiac health.
Future research should focus on addressing this limitation and optimizing the algorithm’s performance in terms of precision. Additionally, the generalizability of our algorithm to other datasets and populations should be evaluated to assess its applicability in real-world scenarios.
In addition to its usefulness in the field of telemonitoring for cardiac health, an accurate and reliable method for the detection of supraventricular extrasystoles (SVEs) may have potential applications as a predictor of severe supraventricular arrhythmias in patients undergoing Holter monitoring. Holter monitoring, which allows continuous recording of the heart rhythm for 24 h or more, is a widely used clinical tool for assessing the presence and severity of cardiac arrhythmias. By incorporating an SVE detection algorithm into the analysis of Holter recordings, those patients at increased risk of developing more severe supraventricular arrhythmias could be identified early, allowing for more timely and personalized intervention. This predictive and early warning capability could contribute significantly to the management and treatment of supraventricular arrhythmias, improving patients’ quality of life and reducing the risk of cardiac complications.
5. Conclusions
The integration of convolutional neural networks in deep learning techniques has revolutionized the computational analysis of electrocardiogram (ECG) signals, enabling the identification and detection of previously overlooked arrhythmias, including the often-asymptomatic supraventricular extrasystole. In this research endeavor, our focus was on the implementation of a sequential convolutional neural network, along with sophisticated biological signal filters, efficient management of the MIT-BIH Arrhythmia Database, advanced processing algorithms, and an interval accuracy test to evaluate the sensitivity of SVE prediction in ECG signals. The collective findings of this study demonstrate the significant potential of our approach as a preparatory tool for future high-precision diagnostic support for SVE. By harnessing the power of deep learning and leveraging biomedical devices such as electrocardiographs for heart rate monitoring, our method can aid healthcare professionals in the early detection and accurate characterization of SVEs, facilitating timely intervention and personalized treatment strategies. This groundbreaking research opens new avenues for enhancing cardiac health monitoring and improving patient outcomes in the realm of supraventricular arrhythmias.