Comparative Analysis of Feature Extraction Methods for ECG Arrhythmia Classification Using Ensemble Learning
Abstract
1. Introduction
2. Materials and Methods
2.1. Dataset and Experimental Design
2.2. Signal Preprocessing Pipeline
2.3. Feature Extraction Methods
2.3.1. Discrete Wavelet Transform (DWT)
Aj[k] = Σn x[n] · gj[2k − n]
2.3.2. Mel-Frequency Cepstral Coefficients (MFCC)
2.3.3. Hilbert–Huang Transform (HHT)
2.3.4. Synchrosqueezing Wavelet Transform (SSWT)
2.4. Classification Architecture
2.5. Class Balancing and Data Partitioning
2.6. Evaluation Framework
2.7. Implementation
3. Results
3.1. Overall Performance Comparison
3.2. Class-Specific Performance Analysis
3.3. Computational Efficiency Analysis
4. Discussion
4.1. DWT Superiority: Signal Processing Foundations
4.2. MFCC Performance: Unexpected Competitiveness
4.3. SSWT and HHT Limitations: Theory-Practice Gaps
4.4. Clinical Implications and Deployment Considerations
4.5. Interpretability Advantages and Limitations
4.6. Study Limitations
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| ECG | Electrocardiogram |
| P-wave | ECG waveform component (atrial depolarization) |
| QRS complex | ECG waveform component (ventricular depolarization) |
| T-wave | ECG waveform component (ventricular repolarization) |
| RR interval | Time between consecutive R-peaks |
| MLII | Modified Limb Lead II |
| N | Normal beat |
| S | Supraventricular Ectopic Beat (SVEB) |
| V | Ventricular Ectopic Beat (VEB) |
| F | Fusion Beat |
| Q | Unknown/Quiet Beat |
| MFCC | Mel-Frequency Cepstral Coefficients |
| DWT | Discrete Wavelet Transform |
| CWT | Continuous Wavelet Transform |
| SSWT | Synchrosqueezing Wavelet Transform |
| HHT | Hilbert–Huang Transform |
| STFT | Short-Time Fourier Transform |
| FFT/Fourier | Fast Fourier Transform |
| DCT | Discrete Cosine Transform |
| EMD | Empirical Mode Decomposition |
| IMF | Intrinsic Mode Function |
| Wavelets | Mathematical functions for time–frequency signal analysis |
| db4 | Daubechies 4 wavelet |
| db6 | Daubechies 6 wavelet |
| CF | Cascade Forest |
| gcForest | Grains Cascade Forest |
| SMOTE | Synthetic Minority Over-sampling Technique |
| SVM | Support Vector Machine |
| CNN | Convolutional Neural Network |
| k-NN | k-Nearest Neighbours |
| F1 | F1 score (harmonic mean of precision and recall) |
| AI | Artificial Intelligence |
| ANSI/AAMI | American National Standards Institute/Association for the Advancement of Medical Instrumentation |
| MIT-BIH | Massachusetts Institute of Technology—Beth Israel Hospital (Arrhythmia Database) |
| PTB-XL | Physikalisch-Technische Bundesanstalt ECG Database |
| CPSC | China Physiological Signal Challenge |
| RQ | Research Question |
| Hz | Hertz (unit of frequency) |
| ms | Milliseconds (unit of time) |
References
- Martin, S.S.; Aday, A.W.; Almarzooq, Z.I.; Anderson, C.A.M.; Arora, P.; Avery, C.L.; Baker-Smith, C.M.; Barone Gibbs, B.; Beaton, A.Z.; Boehme, A.K.; et al. 2024 Heart Disease and Stroke Statistics: A Report of US and Global Data From the American Heart Association. Circulation 2024, 149, E347–E913. [Google Scholar] [CrossRef]
- Kazemi Lichaee, F.; Salari, A.; Jalili, J.; Beikmohammad Dalivand, S.; Roshanfekr Rad, M.; Mojarad, M. Advancements in Artificial Intelligence for ECG Signal Analysis and Arrhythmia Detection: A Review. Int. J. Cardiovasc. Pract. 2024, 8, e143437. [Google Scholar] [CrossRef]
- Liu, J.; Li, Z.; Jin, Y.; Liu, Y.; Liu, C.; Zhao, L.; Chen, X. A Review of Arrhythmia Detection Based on Electrocardiogram with Artificial Intelligence. Expert Rev. Med. Devices 2022, 19, 549–560. [Google Scholar] [CrossRef]
- He, X.; Hu, C.; Ma, K.; Huang, J.; He, H. ECG-Based Arrhythmia Classification Using Discrete Wavelet Transform and Attention-Enhanced CNN-BiGRU Model. Phys. Eng. Sci. Med. 2025, 48, 1995–2009. [Google Scholar] [CrossRef]
- Ye, Y.; Chipusu, K.; Ashraf, M.A.; Ding, B.; Huang, Y.; Huang, J. Hybrid CNN-BLSTM Architecture for Classification and Detection of Arrhythmia in ECG Signals. Sci. Rep. 2025, 15, 34510. [Google Scholar] [CrossRef]
- Petch, J.; Di, S.; Nelson, W. Opening the Black Box: The Promise and Limitations of Explainable Machine Learning in Cardiology. Can. J. Cardiol. 2022, 38, 204–213. [Google Scholar] [CrossRef]
- Kovalchuk, O.; Barmak, O.; Radiuk, P.; Klymenko, L.; Krak, I. Towards Transparent AI in Medicine: ECG-Based Arrhythmia Detection with Explainable Deep Learning. Technologies 2025, 13, 34. [Google Scholar] [CrossRef]
- Singh, A.K.; Krishnan, S. ECG Signal Feature Extraction Trends in Methods and Applications. Biomed. Eng. OnLine 2023, 22, 22. [Google Scholar] [CrossRef]
- Pradhan, B.K.; Neelappu, B.C.; Sivaraman, J.; Kim, D.; Pal, K. A Review on the Applications of Time-Frequency Methods in ECG Analysis. J. Healthc. Eng. 2023, 2023, 3145483. [Google Scholar] [CrossRef]
- Lin, M.; Hong, Y.; Hong, S.; Zhang, S. Discrete Wavelet Transform Based ECG Classification Using gcForest: A Deep Ensemble Method. Technol. Health Care 2024, 32, 95–105. [Google Scholar] [CrossRef]
- Marchi, E.; Cervetto, M.; Galarza, C. Adaptive Synchrosqueezing Wavelet Transform for Real-Time Applications. Digit. Signal Process. 2023, 140, 104133. [Google Scholar] [CrossRef]
- Herry, C.L.; Frasch, M.; Seely, A.J.; Wu, H. Heart Beat Classification from Single-Lead ECG Using the Synchrosqueezing Transform. Physiol. Meas. 2017, 38, 171–187. [Google Scholar] [CrossRef]
- Huang, N.E.; Shen, Z.; Long, S.R.; Wu, M.C.; Shih, H.H.; Zheng, Q.; Yen, N.-C.; Tung, C.C.; Liu, H.H. The Empirical Mode Decomposition and the Hilbert Spectrum for Nonlinear and Non-Stationary Time Series Analysis. Proc. R. Soc. Lond. Ser. Math. Phys. Eng. Sci. 1998, 454, 903–995. [Google Scholar] [CrossRef]
- Wang, C.-L.; Wei, C.-C.; Tsai, C.-T.; Lee, Y.-H.; Liu, L.Y.-M.; Chen, K.-Y.; Lin, Y.-J.; Lin, P.-L. Early Detection of Myocardial Ischemia in Resting ECG: Analysis by HHT. Biomed. Eng. OnLine 2023, 22, 23. [Google Scholar] [CrossRef]
- Alodia Yusuf, S.A.; Hidayat, R. MFCC Feature Extraction and KNN Classification in ECG Signals. In Proceedings of the 2019 6th International Conference on Information Technology, Computer and Electrical Engineering (ICITACEE), Semarang, Indonesia, 26–27 September 2019; IEEE: New York, NY, USA, 2019; pp. 1–5. [Google Scholar]
- Zhang, D.; Yuan, X.; Zhang, P. Interpretable Deep Learning for Automatic Diagnosis of 12-Lead Electrocardiogram. iScience 2020, 24, 102373. [Google Scholar] [CrossRef]
- Song, M.-S.; Lee, S.-B. Comparative Study of Time-Frequency Transformation Methods for ECG Signal Classification. Front. Signal Process. 2024, 4, 1322334. [Google Scholar] [CrossRef]
- Moody, G.B.; Mark, R.G. The Impact of the MIT-BIH Arrhythmia Database. IEEE Eng. Med. Biol. Mag. 2001, 20, 45–50. [Google Scholar] [CrossRef]
- Luz, E.J.D.S.; Schwartz, W.R.; Cámara-Chávez, G.; Menotti, D. ECG-Based Heartbeat Classification for Arrhythmia Detection: A Survey. Comput. Methods Programs Biomed. 2016, 127, 144–164. [Google Scholar] [CrossRef]
- Association for the Advancement of Medical Instrumentation. ANSI/AAMI EC57:1998; Testing and Reporting Performance Results of Cardiac Rhythm and ST Segment Measurement Algorithms; Association for the Advancement of Medical Instrumentation: Arlington, VA, USA, 1998. [Google Scholar]
- Shah, A.; Singh, D.; Mohamed, H.G.; Bharany, S.; Rehman, A.U.; Hussen, S. Electrocardiogram Analysis for Cardiac Arrhythmia Classification and Prediction through Self Attention Based Auto Encoder. Sci. Rep. 2025, 15, 9230. [Google Scholar] [CrossRef]
- Zhou, Z.H.; Feng, J. Deep Forest: Towards An Alternative to Deep Neural Networks. arXiv 2017, arXiv:1702.08835. [Google Scholar] [CrossRef]
- Lundberg, S.M.; Erion, G.; Chen, H.; DeGrave, A.; Prutkin, J.M.; Nair, B.; Katz, R.; Himmelfarb, J.; Bansal, N.; Lee, S.-I. From Local Explanations to Global Understanding with Explainable AI for Trees. Nat. Mach. Intell. 2020, 2, 56–67. [Google Scholar] [CrossRef]





| a. Representative single-method ECG arrhythmia classification studies (without direct feature comparison) | |||||
| Study | Feature Method | Classifier | Dataset | Reported Performance | Key Limitation |
| [10] | DWT | gcForest | MIT-BIH | Acc: 98.55%, F1: 98.46% | No alternative feature methods evaluated |
| [15] | MFCC | k-NN | MIT-BIH | Acc: 93.50% | No comparison with time–frequency methods |
| [14] | HHT (EMD + Hilbert) | SVM | MIT-BIH | Sensitivity: 94.2% (ischemia) | Focused on ischemia; no arrhythmia comparison |
| [12] | SSWT | Neural Network | MIT-BIH | Acc: 98.10% | No systematic comparison with standard wavelets |
| b. Multi-method studies with confounding factors | |||||
| Study | Feature Methods | Classifiers | Dataset | Performance Summary | Key Limitation |
| [17] | STFT, CWT, SSWT | CNN (varying architectures) | MIT-BIH, PTB-XL | Performance varied across feature–classifier pairs | Feature and classifier varied simultaneously |
| [8] | Time, Frequency, Time–Frequency | — | Multiple | — | Review only; no experiments |
| [9] | DWT, EMD, WVD, STFT | — | Multiple | — | Theoretical comparison only |
| [16] | MFCC + Raw Signal | Deep CNN | Chapman, CPSC | Acc: 95.3%, F1: 93.7% | MFCC used as auxiliary input, not primary comparison |
| Class | AAMI Category | Beat Annotations | Description |
|---|---|---|---|
| N | Normal | ‘N’, ‘L’, ‘R’, ‘e’, ‘j’ | Normal and bundle branch block beats |
| S | Supraventricular Ectopic Beat (SVEB) | ‘A’, ‘a’, ‘J’, ‘S’ | Atrial or nodal premature beats |
| V | Ventricular Ectopic Beat (VEB) | ‘V’, ‘E’ | Premature ventricular contractions |
| F | Fusion Beat | ‘F’ | Fusion of ventricular and normal beats |
| Q | Unknown Beat | ‘/’, ‘f’, ‘Q’ | Paced, unclassifiable, or artefact beats |
| Method | Model | No Features | Exec. Time (s) | Accuracy (%) | Macro-Precision (%) | Macro-Recall (%) | Macro-F1 (%) |
|---|---|---|---|---|---|---|---|
| Baseline | CF | 7 | 1737.39 | 95.92 | 78.08 | 82.57 | 79.91 |
| SSWT | CF | 71 | 9711.54 | 97.49 | 86.63 | 82.72 | 84.54 |
| MFCC | CF | 27 | 3776.84 | 98.00 | 89.99 | 86.92 | 88.30 |
| DWT | CF | 212 | 10,050.11 | 98.79 | 94.39 | 91.67 | 92.93 |
| HHT | CF | 67 | 6237.78 | 96.94 | 86.10 | 81.98 | 83.59 |
| Class | SSWT (P/R/F1) | MFCC (P/R/F1) | DWT (P/R/F1) | HHT (P/R/F1) | Baseline (P/R/F1) |
|---|---|---|---|---|---|
| N | 0.98/0.99/0.99 | 0.99/0.99/0.99 | 0.99/0.99/0.99 | 0.98/0.98/0.98 | 0.98/0.97/0.98 |
| S | 0.93/0.84/0.88 | 0.91/0.88/0.90 | 0.91/0.90/0.90 | 0.85/0.83/0.84 | 0.85/0.87/0.86 |
| V | 0.90/0.88/0.89 | 0.91/0.93/0.92 | 0.96/0.97/0.96 | 0.88/0.91/0.89 | 0.82/0.86/0.84 |
| F | 0.53/0.44/0.48 | 0.69/0.56/0.62 | 0.87/0.74/0.80 | 0.63/0.42/0.50 | 0.28/0.45/0.34 |
| Q | 0.99/0.99/0.99 | 0.99/0.98/0.98 | 0.99/0.98/0.99 | 0.96/0.96/0.96 | 0.97/0.97/0.97 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Adeleye, V.; Elbattah, M. Comparative Analysis of Feature Extraction Methods for ECG Arrhythmia Classification Using Ensemble Learning. BioMedInformatics 2026, 6, 33. https://doi.org/10.3390/biomedinformatics6030033
Adeleye V, Elbattah M. Comparative Analysis of Feature Extraction Methods for ECG Arrhythmia Classification Using Ensemble Learning. BioMedInformatics. 2026; 6(3):33. https://doi.org/10.3390/biomedinformatics6030033
Chicago/Turabian StyleAdeleye, Victor, and Mahmoud Elbattah. 2026. "Comparative Analysis of Feature Extraction Methods for ECG Arrhythmia Classification Using Ensemble Learning" BioMedInformatics 6, no. 3: 33. https://doi.org/10.3390/biomedinformatics6030033
APA StyleAdeleye, V., & Elbattah, M. (2026). Comparative Analysis of Feature Extraction Methods for ECG Arrhythmia Classification Using Ensemble Learning. BioMedInformatics, 6(3), 33. https://doi.org/10.3390/biomedinformatics6030033

