Next Article in Journal
Optimizing Route Planning via the Weighted Sum Method and Multi-Criteria Decision-Making
Previous Article in Journal
Efficient AI-Driven Query Optimization in Large-Scale Databases: A Reinforcement Learning and Graph-Based Approach
Previous Article in Special Issue
A Novel Approach to Evaluating Industrial Spatial Structure Upgrading: Evidence from 284 Cities and 96 Sub-Industries in China (1978–2022)
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

A Hybrid Algorithm with a Data Augmentation Method to Enhance the Performance of the Zero-Inflated Bernoulli Model

1
Department of Management Sciences, Tamkang University, Tamsui District, New Taipei City 251301, Taiwan
2
Department of Statistics, Tamkang University, Tamsui District, New Taipei City 251301, Taiwan
3
Department of Mathematical Sciences, University of South Dakota, Vermillion, SD 57069, USA
*
Author to whom correspondence should be addressed.
Mathematics 2025, 13(11), 1702; https://doi.org/10.3390/math13111702
Submission received: 11 April 2025 / Revised: 11 May 2025 / Accepted: 21 May 2025 / Published: 22 May 2025

Abstract

The zero-inflated Bernoulli model, enhanced with elastic net regularization, effectively handles binary classification for zero-inflated datasets. This zero-inflated structure significantly contributes to data imbalance. To improve the ZIBer model’s ability to accurately identify minority classes, we explore the use of momentum and Nesterov’s gradient descent methods, particle swarm optimization, and a novel hybrid algorithm combining particle swarm optimization with Nesterov’s accelerated gradient techniques. Additionally, the synthesized minority oversampling technique is employed for data augmentation and training the model. Extensive simulations using holdout cross-validation reveal that the proposed hybrid algorithm with data augmentation excels in identifying true positive cases. Conversely, the hybrid algorithm without data augmentation is preferable when aiming for a balance between the metrics of recall and precision. Two case studies about diabetes and biopsy are provided to demonstrate the model’s effectiveness, with performance assessed through K-fold cross-validation.
Keywords: data augmentation; gradient descent method; Monte Carlo simulation; particle swarm optimization; SMOTE data augmentation; gradient descent method; Monte Carlo simulation; particle swarm optimization; SMOTE

Share and Cite

MDPI and ACS Style

Su, C.-J.; Chen, I.-F.; Tsai, T.-R.; Lio, Y. A Hybrid Algorithm with a Data Augmentation Method to Enhance the Performance of the Zero-Inflated Bernoulli Model. Mathematics 2025, 13, 1702. https://doi.org/10.3390/math13111702

AMA Style

Su C-J, Chen I-F, Tsai T-R, Lio Y. A Hybrid Algorithm with a Data Augmentation Method to Enhance the Performance of the Zero-Inflated Bernoulli Model. Mathematics. 2025; 13(11):1702. https://doi.org/10.3390/math13111702

Chicago/Turabian Style

Su, Chih-Jen, I-Fei Chen, Tzong-Ru Tsai, and Yuhlong Lio. 2025. "A Hybrid Algorithm with a Data Augmentation Method to Enhance the Performance of the Zero-Inflated Bernoulli Model" Mathematics 13, no. 11: 1702. https://doi.org/10.3390/math13111702

APA Style

Su, C.-J., Chen, I.-F., Tsai, T.-R., & Lio, Y. (2025). A Hybrid Algorithm with a Data Augmentation Method to Enhance the Performance of the Zero-Inflated Bernoulli Model. Mathematics, 13(11), 1702. https://doi.org/10.3390/math13111702

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop