Previous Article in Journal
Physics-Informed Machine Learning for Mechanical Performance Prediction of ECC-Strengthened Reinforced Concrete Beams: An Empirical-Guided Framework
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Comparison of the Robust Zero-Inflated and Hurdle Models with an Application to Maternal Mortality

Department of Computational Sciences, University of Fort Hare, Alice 5700, South Africa
*
Author to whom correspondence should be addressed.
Math. Comput. Appl. 2025, 30(5), 95; https://doi.org/10.3390/mca30050095
Submission received: 29 May 2025 / Revised: 19 August 2025 / Accepted: 1 September 2025 / Published: 2 September 2025

Abstract

This study evaluates the performance of count regression models in the presence of zero inflation, outliers, and overdispersion using both simulated and real-world maternal mortality dataset. Traditional Poisson and negative binomial regression models often struggle to account for the complexities introduced by excess zeros and outliers. To address these limitations, this study compares the performance of robust zero-inflated (RZI) and robust hurdle (RH) models against conventional models using the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) to determine the best-fitting model. Results indicate that the robust zero-inflated Poisson (RZIP) model performs best overall. The simulation study considers various scenarios, including different levels of zero inflation (50%, 70%, and 80%), outlier proportions (0%, 5%, 10%, and 15%), dispersion values (1, 3, and 5), and sample sizes (50, 200, and 500). Based on AIC comparisons, the robust zero-inflated Poisson (RZIP) and robust hurdle Poisson (RHP) models demonstrate superior performance when outliers are absent or limited to 5%, particularly when dispersion is low (5). However, as outlier levels and dispersion increase, the robust zero-inflated negative binomial (RZINB) and robust hurdle negative binomial (RHNB) models outperform robust zero-inflated Poisson (RZIP) and robust hurdle Poisson (RHP) across all levels of zero inflation and sample sizes considered in the study.

1. Introduction

Excess zeros are common in numerous fields, including healthcare, agriculture, ecology, and manufacturing industries [1,2]. A Zero-Inflated (ZI) model [3] argues that zero counts arise from a combination of two distributions; one distribution generates structural zeros, representing subjects who are not at all at risk of experiencing the event (and thus consistently produce zero counts). For instance, when quantifying specific high-risk behaviors, certain individuals may record a score of zero due to their lack of susceptibility to such health-risk behaviors [4]. The second distribution generates sampling zeros, representing individuals who are at risk but did not experience or report the event during the study period. The justification for categorizing zeros into two categories is that a high proportion of zeros frequently results from the presence of a subgroup of patients who are not at risk for certain outcomes during the study period [3]. The probability of belonging to the zero inflation component is assessed using a Bernoulli model, while a standard count distribution, such as Poisson or negative binomial (NB), represents the counts in the count component [5]. Conversely, a hurdle model [6] claims that all zero observations originate from a singular structural source, comprising a binary component to determine whether the response variable is zero or positive, alongside a truncated model, such as a truncated Poisson or truncated NB distribution, for the positive data. Excess zeros indicate more zeros than the distribution we are modeling would expect. Zero-inflated and hurdle models have been used extensively in various research domains to model such data [2,7]. The traditional models, such as the standard Poisson and NB models, often fail to address the complexity of outliers, overdispersion, and excess zeros simultaneously. The understanding and useful comparison of the robust zero-inflated (RZI) and robust hurdle (RH) models remain largely unexplored, despite substantial advances in statistical modeling methods for count data [1,2]. Although both models address the frequent excess zeros seen in the count data, their fundamental tenets and estimating strategies are different.
Maternal mortality data often exhibit zero inflation, characterized by an excess of zero counts or units reporting no maternal deaths, compared to what standard count models such as Poisson or NB would predict [8,9]. Identifying factors that contribute to maternal mortality and formulating effective interventions are essential to minimize the increase in maternal mortality. In this context, there is a need to employ statistical models that can adequately handle the excessive zeros, overdispersion, and potential outliers that are often observed in maternal mortality [10]. Previous studies that have looked at factors affecting maternal death have used different analysis strategies, with some ignoring the underlying zero inflation [11,12]. There has been considerable interest in the literature in comparing zero-inflated and hurdle models for analyzing count data with excess zeros and outliers [2,10,13]. However, the robustness to outliers that these comparisons frequently neglect can have significant effects on the accuracy and reliability of the models’ results. Robust zero-inflated models and robust hurdle models have been proposed to handle outliers in addition to zero inflation. There is a lack of comprehensive studies that directly examine the performance of robust hurdle and zero-inflated models in the presence of outliers. In addition, there has been little research on the application of these models to maternal mortality data [1,2,12], an important public health indicator. Using simulated data and data on maternal mortality, this study attempts to close these gaps by comparing the robust zero-inflated and hurdle models in the presence of outliers. Through this investigation, the study hopes to shed light on how well these models handle excessive zeros and outliers in count data, with an emphasis on enhancing modeling accuracy and forecasting rates of maternal death.

2. Methods

2.1. Overview of Count Data Models

This section outlines the statistical frameworks employed to model count data, addressing key challenges such as zero inflation, overdispersion, and outliers. Introduces robust extensions of conventional models, specifically robust zero-inflated and hurdle models, which are designed to improve estimation accuracy in the presence of deviations from model assumptions and outliers. These robust models provide a more reliable analytical framework for real-world datasets, particularly in maternal mortality research, where data irregularities are common.

2.2. Classical Zero-Inflated and Hurdle Models

2.2.1. Standard Zero-Inflated (ZI) Models

In the ZI model introduced by [3], zero outcomes can originate from two distinct sources. We have structural zeros, which occur due to an inherent process, and sampling zeros, which arise from a standard count distribution such as Poisson or NB. The general form of a ZI model is as follows:
P ( Y i = y i ) = π i + ( 1 π i ) p ( y i = 0 ; μ i ) if y i = 0 , ( 1 π i ) p ( y i ; μ i ) if y i > 0 ,
which combines a degenerate distribution at zero and an untruncated count distribution parameterized by μ i . When the count component follows a Poisson distribution, the model becomes a Zero-Inflated Poisson (ZIP) model:
P ( Y i = y i ) = π i + ( 1 π i ) e μ i if y i = 0 , ( 1 π i ) e μ i μ i y i y i ! if y i > 0 ,
here, μ i is the Poisson mean and π i is the probability of structural zeros. The mean and variance of the ZIP model are then given by the following:
E ( Y i ) = E [ E ( y i | z i ) ] = ( 1 π i ) μ i ,
Var ( Y i ) = E [ V a r ( y i | z i ) ] + V a r [ E ( y i | z i ) ] = ( 1 π i ) μ i ( 1 + π i μ i ) .
To model the count part of a zero inflation, Poisson regression assumes that the mean and variance are equal. However, real-world data often show overdispersion, where the variance exceeds the mean. This may result from unobserved heterogeneity, omitted variables, clustering, or outliers [14,15]. In such cases, the NB model is more appropriate. The ZINB model, which accounts for both excess zeros and overdispersion, is specified as
P ( Y i = y i ) = π i + ( 1 π i ) r μ i + r r if y i = 0 , ( 1 π i ) Γ ( y i + r ) Γ ( r ) y i ! μ i μ i + r y i r μ i + r r if y i > 0 ,
where μ i is the NB mean, π i is the structural zero probability, r is the dispersion parameter, and Γ is the gamma function. The mean and variance are as follows:
E ( Y i ) = ( 1 π i ) μ i ,
Var ( Y i ) = ( 1 π i ) μ i 1 + μ i r + π i μ i .
As r , the ZINB model converges to the ZIP model. Smaller values of r indicate higher overdispersion. In practical applications, both μ i and π i are modeled using explanatory variables. The ZI models can thus be parameterized as
log ( μ i ) = x i β , logit ( π i ) = z i α ,
where x i and z i are covariate vectors for the count and zero components, respectively, and α and β are their corresponding coefficients. The covariates for μ i and π i need not be the same.

2.2.2. Standard Hurdle Models

As an alternative to ZI models, hurdle models [6] are formulated as two component mixture models. They consist of a component that models excess zeros and another that models positive counts using a truncated count distribution, such as the truncated Poisson or truncated NB distribution. Let Y i denote the count response for the i-th observation, where i = 1 , , n , and n is the total number of observations. The general form of a hurdle model is expressed as
P ( Y i = y i ) = π i if y i = 0 , ( 1 π i ) p ( y i ; μ i ) 1 p ( 0 ; μ i ) if y i > 0 ,
where π i is the probability of observing a structural zero, and p ( y i ; μ i ) is the probability mass function (PMF) of a standard count distribution parameterized by vector μ i . The denominator, 1 p ( 0 ; μ i ) , normalizes the distribution to reflect that only positive counts are considered. If the count component follows a Poisson distribution, the corresponding hurdle Poisson model is
P ( Y i = y i ) = π i if y i = 0 , ( 1 π i ) e μ i μ i y i / y i ! 1 e μ i if y i > 0 .
where the π i denotes the probability of a zero count, μ i is the mean of the underlying Poisson distribution. The mean and variance of the HP are then given by
E ( Y ) = ( 1 π ) · μ 1 e μ
Var ( Y ) = ( 1 π ) μ ( 1 + μ e μ ) ( 1 e μ ) 2 + π μ 1 e μ 2 ,
to handle overdispersion in count data, the positive count component can alternatively follow an NB distribution. The HNB model is then given by
P ( Y i = y i ) = π i if y i = 0 , 1 π i 1 r μ i + r r · Γ ( y i + r ) Γ ( r ) y i ! μ i μ i + r y i r μ i + r r if y i > 0 ,
where r is the dispersion parameter and μ i is the mean of the NB distribution, is the Γ is the gamma function, and π i is the structural zero probability. The mean and variance of the HNB model are given by
E ( Y ) = ( 1 π i ) · μ 1 r μ + r r ,
Var ( Y i ) = ( 1 π i ) · μ i + μ i 2 r 1 r r + μ i r + π i ( 1 π i ) μ i 1 r r + μ i r 2 .
As with ZI models, covariates can be incorporated into both the zero and count components. The hurdle model can thus be parameterized as
log ( μ i ) = x i β , logit ( π i ) = z i α ,
where α and β are vectors of regression coefficients for the covariates x i and z i , respectively.

2.3. Robust Extensions of Zero-Inflated and Hurdle Models

2.3.1. Robust Zero-Inflated Models

Although ZIP and ZINB models handle excess zeros, they can be sensitive to outliers. The RZIP and RZINB models improve the standard ZI models by incorporating robust estimation techniques. The robust zero-inflated models (RZIP and RZINB), first proposed by [5] and later refined by [16], integrate Huber’s M-estimation into standard ZIP/ZINB structures. The robust approach uses the Huber Ψ -function [17], to downweight extreme values during parameter estimation. In the RZI models, the parameters are estimated using a robust Expectation–Maximization (EM) algorithm by [5]. In mixture models like ZI, the EM algorithm is a natural approach for computing maximum likelihood estimates (MLEs) [3]. This algorithm works by introducing latent variables that indicate whether a zero outcome came from the excess zero component or from the count distribution. According to [5] we define τ i = 1 when the observation y i originates from the zero inflation component, and τ i = 0 for the non-zero state
τ i = 1 if y i is from the zero state , 0 if y i is from the non - zero state .
With this, the complete data log-likelihood can be expressed as
c ( α , β ; y , τ ) = i = 1 n τ i z i α log ( 1 + e z i α ) + i = 1 n ( 1 τ i ) log f ( y i ; x i β ) ,
where τ i = ( τ 1 , , τ n ) . Maximizing this log-likelihood is straightforward because the complete data log-likelihood functions for α and β , denoted by α c ( α ; y , τ ) and β c ( β ; y , τ ) , can be optimized separately with respect to their respective parameters using standard methods. The EM algorithm maximizes the log-likelihood of models (10) and (13) through an iterative process consisting of two main steps which is the E-step and the M-step. In the E-step, each latent variable z i is replaced by its conditional expectation, given the current estimates of ( α , β ) . In the M-step, these expected values are treated as fixed, and the complete data log-likelihood c ( β , α ; y , z ) is maximized with respect to α and β . These two steps are repeated alternately until the parameter estimates converge. The algorithm begins with initial values θ ( 0 ) = ( α ( 0 ) , β ( 0 ) ) and proceeds iteratively through the following three steps until convergence.
E-step: Estimate τ i by its conditional expectation τ i t given the current estimates α ( r ) and β ( r ) .
τ i ( t ) = P ( y i zero state ) P ( zero state ) P ( y i zero state ) P ( zero state ) + P ( y i non - zero state ) P ( non - zero state ) .
M-step for α . Find α ( t + 1 ) by maximizing α c ( α ; y , τ ( t ) ) , equivalent to solving
1 n i = 1 n τ i ( t ) logit 1 z i α z i = 0 .
M-step for β . For the non-degenerate component, we find β ( t + 1 ) by maximizing α c ( α ; y , τ ( t ) ) which is equivalent to solving the estimation β ;
1 n i = 1 n ( 1 τ i ( t ) ) y i e x i β x i = 0 .
Hall [5] proposed in the RES approach that the estimating functions (18) and (19) from the M-step of the EM algorithm can be achieved with robustified estimating functions. That is, we downweight observations in the extreme tails of the Poisson or NB component using a bounded influence function ψ c , and apply leverage-based weights ω ( · ) . To robustify the estimation, we replace the standard score functions in Equations (18) and (19) with bounded influence versions that combine residual trimming through ψ c and leverage weighting via ω ( · ) . Specifically, solving for the parameter α ( t + 1 ) and β ( t + 1 ) in the following Equations (20) and (21). The estimation equation for the zero inflation component parameter α is expressed as follows:
1 n i = 1 n ω ( z i ) τ i ( t ) logit 1 ( z i α ) z i = 0 ,
1 n i = 1 n ( 1 τ i ( t ) ) ω ( x i ) ψ c ( y i ) o i ( β , c ) x i = 0 ,
where for the non-degenerate Poisson component, o i ( β , c ) = E ψ c ( Y i ) | Y i Poisson ( μ i ) = j 1 P ( Y i < j 1 ) + μ i P ( j 1 Y i j 2 ) + j 2 P ( Y i > j 2 ) , and for the negative binomial component, o i NB ( β , r , c ) = j 1 P ( Y i < j 1 ) + E [ Y i j 1 Y i j 2 ] · P ( j 1 Y i j 2 ) + j 2 P ( Y i > j 2 ) , where Y i NB ( μ i , r ) . The function ψ c ( y ) truncates extreme responses:
ψ c ( y ) = j 1 , y < j 1 , y , y [ j 1 , j 2 ] , j 2 , y > j 2 ,
with j 1 and j 2 being the c and ( 1 c ) quantiles of the non-degenerate distribution (Poisson or NB). The downweighting function ω ( · ) is used to reduce the influence of high-leverage points, and is taken as ω ( z i ) = 1 h i , where h i is the leverage from the design matrix z i , with an analogous definition for x i .

2.3.2. Robust Hurdle Models

The robust hurdle models used in this study build upon the methodology developed by [18], who proposed a robust version of the hurdle models that are based on standard hurdle models by incorporating bounded influence estimation through the Ψ -Huber function to mitigate the influence of outliers [19]. For the binary hurdle component, we replace the standard logistic regression with a robust logistic regression using the Ψ -Huber function [17]. This ensures that outliers in the binary classification of zeros and non-zeros have less influence.
i = 1 n ψ c ( r i ) w ( x i ) 1 v μ i μ i a ( β ) = i = 1 n ψ ( y i , μ i ) = 0 ,
where r i = ( y i μ i ) / v μ i are the Pearson residuals, v μ i = V a r ( Y i | Y i > 0 ) denotes the conditional variance of the positive count component, i.e., the variance of the truncated Poisson or truncated NB distribution, ψ c ( r ) = min ( c , max ( r , c ) ) is the Huber function, μ i = ( / β ) μ i , and where
a ( β ) = ψ c ( r ) w ( x ) 1 v μ i μ d F ( y | x ) d F n ( x ) = 1 n i = 1 n E [ ψ c ( r i ) ] w ( x i ) 1 v μ i μ i ,
ensures that Fisher is consistent with β . We have this for the truncated Poisson component
E trunc ψ c ( r i ) = 1 1 e μ i c P ( Y i j 2 + 1 ) P ( Y i j 1 ) + μ i v μ i P ( Y i = j 1 ) P ( Y i = j 2 ) ,
where Y i Poisson ( μ i ) , variance v μ i , and this for the truncated NB component
E trunc ψ c ( r i ) = 1 1 r μ i + r r c P ( Y i j 2 + 1 ) P ( Y i j 1 ) + μ i v μ i P ( Y i = j 1 ) P ( Y i = j 2 ) ,
where Y i NB ( μ i , r ) , variance v μ i , and cutoff points
j 1 = μ i c v μ i , j 2 = μ i + c v μ i .
The classical estimating equations of the quasi-likelihood estimator for GLM imply the structure of (22). These would utilize ψ ( y i , μ i ) = ( ( y i μ i ) / v μ i ) μ i . The function ψ c ( r i ) in (22) is responsible for managing deviations in the y-space. To manage deviations in the design space, leverage points are downweighted by adding weights w ( x i ) . For instance, w ( x i ) = 1 h i , where h i is the ith diagonal element of the hat matrix. According to the use in linear regression, the estimator produced from (22) is called a Mallows-type estimator. It reduces to a Huber-type estimator for all i when w ( x i ) = 1 . Note that it is easy to rewrite the estimating equations (22) as
i = 1 n w ˜ ( r i ) w ( x i ) 1 v μ i r i μ i a ( β ) = 0 .
The function w ˜ ( r i ) = ψ c ( r i ) / r i reflects the structure of weighted estimating equations involving two types of weights: w ˜ ( r i ) applied to the residuals and w ( x i ) applied to the design matrix.

2.4. Model Comparison

To assess the performance of the six different models (Poisson, NB, RZIP, RZINB, RHP, and RHNB) under different simulation scenarios, the study adopted the widely used model selection criteria: Akaike’s information criteria (AIC) developed by [20]. AIC has been presented as a model selection criterion for comparing non-nested models based on maximum likelihood, utilizing the fitted log-likelihood function to identify the optimal model. The AIC evaluates the relative quality of a statistical model by rewarding goodness of fit while imposing a penalty that increases with the number of estimated parameters. As the log-likelihood is anticipated to rise with the addition of parameters to a model, the AIC criterion imposes a penalty on models with a greater number of parameters (q). The penalty function may also depend on n, the number of observations. This penalty prevents overfitting. Consequently, the AIC is defined as
AIC = 2 log ( L ) + 2 q ,
where L is the maximal likelihood function value for the estimated model, q is the number of degrees of freedom, and 2 is a tuning parameter to balance the information in the model with the residuals, according to the number of degrees of freedom. It is desirable to choose a model with the lowest AIC. Although AIC penalizes the complexity of the model using a constant factor of 2 per parameter, another widely used criterion, the Bayesian Information Criterion (BIC), takes a different approach. Like AIC, BIC rewards goodness of fit but introduces a stronger penalty for model complexity, which makes BIC more conservative. The Bayesian Information Criterion (BIC) is defined as follows:
BIC = 2 log ( L ) + k log ( n ) ,
where L is the likelihood of the model, k is the number of parameters, and n is the sample size. When fitting models, it is possible to increase the likelihood by adding parameters, but doing so may result in overfitting. The BIC addresses this issue by introducing a penalty term for the number of parameters in the model. The penalty term in BIC is larger than the penalty term in AIC.

3. Simulation Study

To assess the performance of standard and robust count regression models under varying levels of overdispersion, zero inflation, and contamination, a structured simulation study was conducted. The dataset was generated from the ZINB model, which is characterized by a mean variance of this form
V a r ( Y i ) = μ i ( 1 + μ i r ) ,
that allows the variance to be greater than the mean, a common choice for modeling overdispersed count data such as maternal mortality in this study. The simulation design considered combinations of four key factors which are sample size (50, 200, and 500), dispersion levels ( r = 1 , 3 , 5 ) , i.e., 1 is considered as high dispersion, 3 as moderate, and 5 as low in this study, structural zero proportions (0.5, 0.7, and 0.8), and outlier proportions (0.00, 0.05, 0.10, and 0.15). A summary of the simulation scenarios considered in the study is shown below in Table 1. This produced 3 × 3 × 3 × 4 = 108 distinct simulation scenarios in all. Each scenario was run 1000 times to reduce the influence of simulation error, and the AIC was averaged over the 1000 runs.
For each observation i = 1 , , n , the response variable was generated from the ZINB model:
Y i ZINB ( π i , μ i , r ) ,
Y i = π i + ( 1 π i ) · P NB ( 0 | μ i , r ) y i = 0 ( 1 π i ) · P NB ( y | μ i , r ) y i > 0
with the mean μ i , π i is the structural zero probability, and dispersion r where log ( μ i ) = x i · α with a single continuous covariate x i drawn independently from a uniform distribution x i Uniform ( 0 , 1 ) . The intercept was fixed at 1, and the slope coefficient at 1.5, where x i = ( 1 , x i ) and α = ( 1 , 1.5 ) T . To assess the robustness of the models to extreme values, outliers were introduced in the simulated data by randomly selecting a proportion of the observations determined by the specified outlier level (0, 0.05, 0.10, or 0.15), and inflating their outcome values by a factor of 10. Specifically, for a given sample size n, a total of n × outlier level indices were randomly sampled, and the corresponding response values were multiplied by 10. This approach simulates extreme count values that may arise due to data entry errors or rare events, allowing us to evaluate the model’s performance in the presence of such anomalies.

3.1. Simulation Study Findings

3.1.1. Initial Assessment of the Simulation Data

The NB model in Table A2 consistently achieves the lowest AIC across all sample sizes for the initial assessment of the simulation, where the parameters are set to zero, such as the inflation and outliers, confirming it as the best fit for data generated from a ZINB model. As expected, the NB model outperforms the Poisson model, since the data was simulated using an NB model, which is also reflected in its lower AIC value. While robust models like RZIP, RZINB, RHP, and RHNB are designed to handle complexities such as zero inflation and outliers, their AIC values remain higher in this scenario, where no excess zeros or outliers were introduced. These results emphasize the importance of selecting models aligned with the data’s underlying structure, with the NB model excelling in cases of overdispersed count data.

3.1.2. AIC Comparison Across Regression Models

In this section, we describe the analysis of data generated with the ZINB regression model in sixteen different simulation scenarios with varying sample sizes (50, 200, and 500), different levels of outliers (0.0, 0.05, 0.10, and 0.15), and magnitude of dispersion (r = 1, 3, 5). Table A3, Table A4, Table A5, Table A6, Table A7, Table A8, Table A9, Table A10 and Table A11 display the averaged AIC model fit statistics for Poisson, NB, RZIP, RZINB, RHP, and RHNB regression models fitted on data generated with the ZINB regression model with different magnitudes of dispersion, outliers, and sample sizes. The AIC values reveal that the RZINB and RHNB models generally provide the best fit for data generated from a ZINB model, especially under conditions of higher proportions of zeros and levels of outliers. For example, with 5 as the low dispersion level, 0.0% outliers, and a 50% proportion of zeros, the RZIP model achieves a lower AIC than the RZINB model. As the proportion of zeros increases and outliers increases, the RZINB and RHNB models consistently show improved performance, indicating their robustness in handling zero-inflated data. Conversely, the Poisson model and NB have higher AIC values than the other models, reflecting their limitations in scenarios characterized by overdispersion and excess zeros. These results highlight the importance of selecting appropriate modeling techniques that align with the underlying data distribution and characteristics, particularly in the presence of zeros and outliers.

3.1.3. Performance Under Low Outlier Levels and Increasing Dispersion

The RZIP and RHP models, when the dispersion is 5, express superior performance in terms of model fit, particularly under conditions with no outliers (0% outlier level) or a low level of outliers (5% outlier level). This suggests that these models effectively handle zero-inflated datasets with minimal noise. Among them, the RZIP model consistently outperformed the RHP model across varying proportions of zeros (0.5, 0.7, 0.8), achieving the lowest AIC values. This highlights the robustness of RZIP in modeling highly zero-inflated data. These findings emphasize the efficacy of RZIP in accommodating zero inflation and managing datasets with either no or low extreme values. However, when the dispersion reached 3 and the sample size became 200 and above, the RZIP and RHP models demonstrated sensitivity to sample size and data characteristics as they were now outperformed by the RZINB and RHNB models. This reinforces their reliability in scenarios with increased dispersion and low levels of outliers, as evidenced by the results summarized in Table A6, Table A7, Table A8, Table A9, Table A10 and Table A11.

3.1.4. Performance Under Moderate Outlier Levels and Increasing Dispersion

As the level of outliers in the data increases to a moderate threshold of 10–15%, both the RZIP and RHP models show a noticeable decline in performance when compared to more resilient models like the RZINB and RHNB. This trend is evident even when sample size and dispersion remain constant, highlighting the limitations of the RZIP and RHP models in handling moderate to high levels of outliers with low dispersion. As outliers continue to increase and reach 15%, RZIP and RHP persistently lag, suggesting that these models are less effective under conditions with a substantial presence of outliers. In Table A3, Table A4, Table A5, Table A6, Table A7, Table A8, Table A9, Table A10 and Table A11, findings consistently show that RZINB and RHNB outperform other models, especially under moderate and high outlier conditions. This higher performance is consistent across different percentages of zeros and sample sizes, making RZINB and RHNB the most reliable options for dealing with outlier laden data.
As the dispersion parameter in the dataset increased from 3 to 1 and the proportion of outliers escalated from 0% to 5%, 10%, and 15%, notable patterns emerged in model performance. The RZINB model achieved the lowest AIC at outlier thresholds of 0%, 5%, and 10%. However, as the outlier threshold increased to 15%, the RHNB model outperformed RZINB, particularly under higher zero-inflation scenarios. RHNB emerged as the most reliable model under conditions of severe contamination, characterized by high proportions of outliers and substantial overdispersion. Its ability to handle excess zeros while maintaining robustness against noise induced by extreme values highlights its utility in challenging scenarios.

3.1.5. Influence of Sample Size on Model Performance

Examining the performance of RZIP, RHP, RZINB, and RHNB models across sample sizes of 50, 200, and 500 reveals key trends. Model performance varies significantly with sample size, dispersion, and outliers. RZIP and RHP perform well with smaller sample sizes (50) and dispersion of 5, handling outliers up to 5% as shown in Table A3. However, as the sample size and dispersion increase or the outlier proportion exceeds 5%, these models lose accuracy. In contrast, RZINB and RHNB excel under these more challenging conditions, demonstrating superior stability and fit with larger samples, higher dispersion, and more outliers. This comparison emphasizes the importance of aligning model choice with data characteristics. RZIP and RHP are suited for simpler datasets, while RZINB and RHNB are more effective for complex datasets with greater variability and contamination.

4. Application to Maternal Death

4.1. Description of the Study Sample

The real-world dataset comprises records from 222 healthcare facilities in Nairobi, Kenya, which reported maternal health outcomes between October 2021 and January 2022. Each record represents aggregated data at the facility level, not at the patient level. The outcome variable of interest, MaternalDeaths, refers to the number of maternal deaths recorded at each facility, making it appropriate to model this as a count variable. The definition of the variables in the dataset that the study uses includes maternal health outcomes and predictor variables related to delivery characteristics, maternal complications, antenatal care, and stillbirth. The variables used in this study, including maternal health outcomes and predictors related to delivery characteristics, maternal complications, antenatal care, and stillbirths, are described in detail in Table A1 in the Appendix A.

4.2. Exploratory Data Analysis (EDA)

4.2.1. Maternal Mortality Data Overview

The high number of zeros observed in Table 2 justifies the application of robust zero-inflated and hurdle models in this study. These models are designed to handle datasets with excess zeros, outliers, and overdispersion, ensuring a more accurate and robust analysis of maternal mortality data. Maternal death rates are a critical indicator of healthcare quality and access, particularly for women during pregnancy, childbirth, and the postpartum period. Analyzing the frequency distribution of maternal death counts helps us to understand patterns and identify areas for intervention. In this analysis, the study examines a dataset that records the frequency of different maternal death counts. A frequency table of maternal deaths is given in Table 2.
The average maternal death count is 1.3198. However, the variance of 14.118 shows a wide range around the mean, indicating significant data dispersion. The median maternal death count is 0, suggesting that there were no maternal deaths in at least half of the recorded instances. This is supported by the mode of zero, which indicates that the most common outcome in the sample is an absence of maternal deaths. With 61.7% of the dependent variable, the number of maternal deaths, being zero, the dataset contains 4.95% outliers, determined by an algorithm based on the 3 σ rule, which classifies values over three standard deviations from the mean as outliers. Additionally, since the sample variance of the outcome variable (14.118) is higher than the sample mean (1.3198), there is evidence of overdispersion in the data. Thus, our sample dataset clearly shows zero inflation and overdispersion.
In examining the boxplot visualization of the maternal dataset in Figure 1, outliers are observed within the count variable, indicating instances where maternal deaths deviate significantly from the typical distribution. It is essential to recognize and comprehend these outliers, as they may affect the reliability and accuracy of inferences made from the dataset. To provide an accurate evaluation of what is impacting maternal mortality rates using robust count statistical modeling, it is vital to address the existence of outliers.

4.2.2. Assessment of Multicollinearity Test

The analysis identified substantial correlations among several independent variables, indicating multicollinearity (see Figure 2). This issue arises when predictor variables are highly correlated, leading to redundancy and inflated standard errors in the regression coefficients. As a result, estimates become unstable, and theoretically important variables may appear statistically insignificant, obscuring their true relationships with the outcome.
To mitigate multicollinearity between explanatory variables, a correlation-based feature selection technique was applied [21]. Variables exhibiting high linear correlation (|r| > 0.7) were examined, and from each highly correlated pair, one variable was excluded based on theoretical relevance, interpretability, and contribution to model performance. The updated correlation matrix shown in Figure 3 indicates a significant reduction in multicollinearity, confirming the effectiveness of the selection process. By removing highly correlated variables, the updated model, now free from the detrimental effects of high multicollinearity, provided more reliable and interpretable estimates. The reduction in multicollinearity is expected to improve the stability and reliability of the coefficient estimates. However, it is important to approach the interpretation of the results with caution. The relationships between the predictors and the outcome variable should be evaluated in the context of the selected variables.

4.2.3. Adjustment for Facility Size Through an Offset Term

To account for differences in the number of births handled by each healthcare facility, an offset term was included in all fitted count models. Facilities serving larger populations are naturally expected to record more maternal deaths, independent of underlying risk factors. Without this adjustment, model comparisons could be biased toward facilities with higher patient volumes. The offset term was defined as the natural logarithm of the total number of births recorded at each facility during the study period, denoted as log ( TotalBirths i ) . Including this offset allows the models to estimate maternal mortality rates per birth rather than raw counts, ensuring fair comparisons across facilities of different sizes. Mathematically, the count component of the model becomes the following:
log ( μ i ) = β 0 + β 1 x i 1 + + β p x i p + log ( TotalBirths i ) ,
where μ i is the expected number of maternal deaths at facility i, x i j are the predictor variables, and log ( TotalBirths i ) is the fixed offset term. For interpretability, the offset can also be scaled to express rates per 1000 births as log ( TotalBirths i / 1000 ) . This adjustment ensures that model estimates reflect differences in maternal mortality risk, rather than differences in the volume of births handled by each facility.

4.3. Comparison of the Fitted Count Data Models

This is a comparison of all fitted models for maternal death counts using Akaike information criteria and Bayesian information criteria. Table 3 below provides the AIC and BIC values used to choose the model that best fits the data.
The comparison of the performance of the model in Table 3 reveals that the RZIP model provided the best fit to the data, outperforming all other models based on AIC and BIC. The NB model showed considerable improvement over the standard Poisson model, indicating its ability to handle overdispersion in the data more effectively. Among the robust models, the RZINB and RHP models showed competitive performance, especially the RZINB, but were still outperformed by the RZIP model. The RHNB model showed a slightly poorer fit compared to the RZIP, RHP, and RZINB models but performed better than the standard count (ZINB and HNB) and traditional (Poisson and NB) models. Looking at the ZINB, showed a better fit over the HNB and traditional models. These findings generally emphasize the suitability of the RZIP model for this dataset, demonstrating its ability to effectively address zero inflation and outliers at approximately 5%.
Figure 4 shows the observed histogram of maternal deaths overlaid with fitted probability curves from six different count regression models (Poisson, NB, RZIP, RZINB, RHP, and RHNB) that included covariates for assisted deliveries, breech delivery, early teen pregnancy, late teen pregnancy, attending at least four ANC visits (ANC4Visits), cesarean section (CS), and stillbirth. For each model, fitted probabilities were computed for individual observations using the estimated parameters and included covariates and then averaged across all observations to generate the marginal predicted distribution. This approach allows for a fair comparison with the empirical distribution of maternal deaths. The observed distribution exhibits significant excess zeros beyond what would be expected under standard count model assumptions, as also shown in Table 2. This zero inflation necessitates the evaluation of models explicitly designed to handle both structural and sampling zeros. The standard Poisson model performs poorly due to its assumption of equidispersion (equal mean and variance), leading it to severely underestimate the frequency of zero outcomes. Although the NB model partially addresses this by accommodating overdispersion, it still lacks a mechanism to model excess structural zeros. In contrast, the robust hurdle models (RHNB and RHP) and robust zero-inflated models (RZINB and RZIP) offer superior fits by explicitly modeling the dual processes that generate zero and positive counts separately. Among these, the RZIP model provides the best overall fit. Its two-part structure, separating the not-at-risk group (structural zeros) from the at-risk group (count component), enables it to accurately capture both the high frequency of zeros and the distribution of non-zero maternal deaths. These results are consistent with those in Table 3, where the RZIP achieved the lowest AIC and BIC scores.

4.4. Robust Count Regression Estimation Results

A significant frequency of zero counts and outliers was observed in the maternal mortality data. These observations deviate substantially from the expected patterns of standard count data distributions such as the Poisson and NB models. Such deviations suggest that the data may arise from two latent subpopulations, one that is not at risk of maternal mortality and thus always yields a count of zero, and another that is at risk and generates counts according to a standard count process (possibly zero or positive). In maternal mortality analysis, at-risk facilities are those that conduct deliveries with potential complications, high-risk pregnancies, or limited access to advanced obstetric care, making maternal deaths possible. Not-at-risk facilities are structurally unlikely to report maternal deaths because they either do not offer delivery services, handle only low-risk births, transfer complicated cases, or operate solely as antenatal/postnatal clinics. Furthermore, the presence of outliers in the data requires the use of robust estimation techniques, which the standard count models cannot account for. Given these challenges, an RZIP model is identified as the most appropriate for this analysis, as shown in Figure 4 and supported by model comparison results in Table 3. In the following Table 4, the study displays parameter estimates and the robust standard error of the best-performing model, which is RZIP.
Table 4 displays the estimated coefficients of the RZIP model for two parts, namely the count model and the zero model. The zero model explains the excess zeros in the data, while the count model concentrates on the count (frequency) of the event, which is maternal mortality in our case. Each component describes a distinct feature of the data. In the count component, the RZIP model estimates the expected number of maternal deaths per 1000 births among the at-risk group. The estimated intercept is 0.2002 ( S E = 0.2722 ) , which corresponds to an expected count of 0.4245 maternal deaths when all predictors are at their baseline levels. This significantly lower baseline suggests that, in the absence of specific risk factors, maternal mortality remains below the mean of the sample. Among predictors, BreachDelivery has a statistically significant positive effect on maternal mortality among the at-risk group, with an estimated coefficient of 0.2813 (p = 0.0251), corresponding to a 32.48 % increase in the expected number of maternal deaths per 1000 births ( e 0.2813 1 = 0.3248 ) , reinforcing its role as a critical risk factor. Conversely, AssistedDeliveries do not exhibit a meaningful impact, with an estimated effect of 0.5130 ( S E = 0.0417 ,  p = 0.8435), in the at-risk group ( e 0.5130 1 = 0.4013 ) , corresponding to a 40.13% decrease in the expected number of maternal deaths per 1000 births of an individual being in an at-risk group in maternal mortality, suggesting a limited influence on maternal mortality among individuals at risk. EarlyTeenPreg, however, shows a notable positive association in the at-risk group, with an estimated coefficient of 0.1621 ( S E = 0.0304 , p = 0.0185), indicating a 17 . 59 % increase in the expected count of maternal deaths per 1000 births among individuals at risk ( e 0.1621 1 = 0.1759 ) . The statistical significance of this effect suggests that early teenage pregnancy is an important predictor of maternal mortality within the at-risk group.
LateTeenPreg, on the other hand, has an estimated effect of 0.0863 ( S E = 0.0325 , p = 0.7900), in the at-risk group corresponding to a 8.26% decrease in the expected count of maternal deaths per 1000 births ( e 0.0863 1 = 0.0826 ) . However, this effect is not statistically significant ( p > 0.05), suggesting that late teenage pregnancy may not be a strong predictor of maternal mortality among individuals at risk. ANC4Visits and CS are both negatively associated with maternal mortality among individuals at risk, based on the count component of the model. ANC4Visits shows a statistically significant effect, with a 19.46 % decrease in the expected count of maternal deaths per 1000 births ( e 0.2161 1 = 0.1946 ) , Although CS is associated with a 5.31 % decrease in the expected number of maternal death per 1000 births ( e 0.0546 1 = 0.0531 ) , this effect is not statistically significant, indicating limited evidence of its independent contribution within the at-risk population. In contrast, StillBirth shows a positive significant effect, with a 32.77 % increase in the expected count of maternal deaths per 1000 births ( e 0.2835 1 = 0.3277 ) . This finding strongly strengthens the critical role of stillbirth as a key risk factor for maternal mortality among at-risk individuals and emphasizes the need for targeted interventions addressing this factor to reduce maternal deaths.
The zero inflation component models the log-odds of belonging to the not-at-risk group, those for whom maternal mortality cannot occur (i.e., they always yield a zero count). Among predictors, BreachDelivery has an estimated effect of 0.5700 ( S E = 0.1646 , p = 0.5500), implying a 43.45 % decreased odds of an individual being in the not-at-risk group ( e 1.8413 1 = 0.8413 ) , suggesting a limited influence on maternal mortality among individuals that are not-at-risk group. Similarly, AssistedDeliveries has an estimated effect of 0.1189 ( S E = 0.2146 , p = 0.8560), suggesting a limited influence on maternal mortality among individuals who are not-at-risk group. EarlyTeenPreg has an estimated effect of 0.6093 ( S E = 4903 , p = 0.2140), suggesting that for each unit increase in early teenage pregnancies, the odds of belonging to the not-at-risk group ( e 0.6093 1 = 0.8391 ) , but there is limited influence on maternal mortality among individuals not-at-risk ( p > 0.05) indicating no strong evidence. In contrast, LateTeenPreg demonstrates a significant association in the zero-inflation component, with an estimated coefficient of 0.0704 ( S E = 0.2177 , p = 0.0306), indicating a 7.29 % increase in the odds of being in the not-at-risk group ( e 0.0704 1 = 0.0729 ) . This implies that regions with higher late teenage pregnancy rates may be more likely to fall into a subgroup structurally unlikely to report maternal deaths, possibly reflecting reporting differences or underlying protective mechanisms.
ANC4Visits has a statistically significant positive effect, with an estimated coefficient of 2.2527 ( S E = 0.9386 , p = 0.0164), suggesting that for each unit increase in ANC4Visits pregnancies, the odds of belonging to the not-at-risk group ( e 2.2527 1 = 8.5133 ) . This result suggests that higher ANC4Visits is associated with a substantially greater probability of avoiding maternal death, indicating that antenatal care may play a protective role in identifying or preventing risk before it manifests. The significance of this effect strengthens the importance of ANC4Visits in reducing the likelihood of being at risk for maternal mortality. CS has an estimated coefficient of 0.6830 ( S E = 2.0994 , p = 0.0015 ) , corresponding to a 97.98 % increase in the odds of being in the not-at-risk group per unit increase ( e 0.6830 1 = 0.9798 ) . This indicates that individuals with higher values of CS are significantly more likely to be in the not-at-risk group. Similarly, StillBirth has a negative estimated coefficient of 2.9196 ( S E = 9.6433 , p = 0.9862 ) , suggesting a 94.60 % decrease in the odds of being in the not-at-risk group ( e 2.9196 1 = 0.9460 ) . This estimate is also not statistically significant, and the large standard error reflects substantial uncertainty. Thus, there is no reliable evidence that StillBirth affects the likelihood of being in the not-at-risk group.

4.5. The RZIP Model Validation

The chi-squared test in Table 5 was used to validate the model. The test statistic is 220.203 with 214 degrees of freedom. The p-value is 0.371, which is greater than the typical significance level of 0.05. This indicates that there is no evidence to reject the null hypothesis. Thus, the fit of the model to the data is adequate, and there is no significant lack of fit based on this test.

4.6. Overview of Application Results

Count part (Poisson mean with offset):
log ( μ i ) = 1.6043 + 0.2813 · BreachDelivery i 0.5130 · AssistedDeliveries i + 0.1621 · EarlyTeenPreg i 0.0086 · LateTeenPreg i 0.2161 · ANC 4 Visits i 0.0546 · CS i + 0.2835 · StillBirth i + log ( TotalBirths i ) .
Zero-inflation part (logistic model):
log π i 1 π i = 11.0051 0.5700 · BreachDelivery i + 0.1189 · AssistedDeliveries i + 0.6093 · EarlyTeenPreg i + 0.0704 · LateTeenPreg i + 2.2527 · ANC 4 Visits i + 0.6830 · CS i 2.9196 · StillBirth i .
Probability model:
P ( Y i = y i ) = π i + ( 1 π i ) · e μ i , y i = 0 , ( 1 π i ) · μ i y i e μ i y i ! , y i > 0 ,
where μ i and π i are defined above, and log ( TotalBirths i ) is included as an offset to adjust for differences in facility size.

5. Conclusions

This study examined the performance of various count regression models using both simulated and real maternal mortality data. The findings revealed that the RZIP model was the best performer when outlier (0–5%) and dispersion ( r = 5 ) levels were low, making it the most suitable model for such conditions. This result was consistent across both real and simulated data. However, as outlier levels and dispersion increased, the RZINB and RHNB models provided a better fit and predictive accuracy. These models effectively handled datasets with extreme zero inflation and severe outliers, outperforming traditional Poisson and NB models. The AIC and BIC further confirmed the importance of robust zero-inflated models like RZIP in capturing data complexities. In general, RZIP is recommended for datasets with minimal outliers (≤5%), while RZINB and RHNB are better suited for highly dispersed data with severe outliers. These insights highlight the need for researchers to carefully assess model performance and assumptions when analyzing complex count data.
Analysis of the RZIP model showed some statistical significance of maternal mortality. Early teenage pregnancy was associated with a significantly increased risk of maternal death per 1000 births among the at-risk group. This emphasizes a lack of physiological and psychological readiness for childbirth from these young mothers, especially in low-resource settings where access to advanced obstetric care is limited. In alignment with Sustainable Development Goal (SDG) 3.2 [22], which aims to end preventable deaths of newborns and children under five, greater attention must be directed toward maternal health interventions that prioritize early screening and safe delivery methods, such as cesarean sections where medically appropriate. Moreover, the study confirmed that ANC4Visits were statistically significant in both components, positively associated with maternal death in the at-risk group, but also showed a significant increase in the odds of being in the not-at-risk group. In pursuit of the broader SDG 3 goals, health systems should implement tailored strategies that provide adolescent-friendly maternal health services, including education, early risk assessment, and access to skilled birth attendants. Future work should explore the development of a robust hybrid zero inflation model, which combines the strengths of zero-inflated and hurdle frameworks with robust estimation to better handle extreme zero inflation, overdispersion, and outliers. Addressing multicollinearity through techniques such as principal component analysis (PCA) or regularization can further enhance model accuracy.

Author Contributions

Conceptualization, P.P., R.T.C. and C.S.M.; methodology, P.P. and R.T.C.; software, P.P. and R.T.C.; validation, P.P., R.T.C. and C.S.M.; formal analysis, P.P.; investigation, P.P.; resources, P.P.; data curation, P.P.; writing original draft preparation, P.P.; writing review and editing, P.P., R.T.C. and C.S.M.; visualization, P.P.; supervision, R.T.C. and C.S.M.; project administration, R.T.C. and C.S.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by a supervisor-linked bursary from the University’s Research and Innovation Office, which also contributed partial funding for the article processing charges (APC).

Data Availability Statement

Supporting data is accessible at: https://doi.org/10.5061/dryad.zs7h44jdc.

Acknowledgments

The authors would like to express their sincere gratitude to the reviewers and the Editor for their insightful comments and constructive feedback, which significantly contributed in improving the quality of this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Appendix A.1

The description of the variables used in this study is defined in the following Table A1.
Table A1. Descriptions of variables used in the study.
Table A1. Descriptions of variables used in the study.
VariableDescription
MaternalDeathsTotal number of maternal deaths recorded.
AssistedDeliveriesNumber of vaginal deliveries conducted with assistance.
BreachDeliveryNumber of deliveries in breech position.
CSNumber of caesarean section deliveries.
LiveBirthsTotal number of live births.
EarlyTeenPregPregnant adolescents aged 10–14 years at their first antenatal care (ANC) visit.
LateTeenPregPregnant adolescents aged 15–19 years at their first ANC visit.
NormalDeliveriesNumber of deliveries without surgical or assisted interventions.
ANC4VisitsNumber of women who had at least four ANC visits during pregnancy.
Uterotonics3stgWomen who received uterotonics during or immediately after the third stage of labor.
CarbatosinMothers given Carbatosin within one minute after birth.
OxytocinMothers given Oxytocin within one minute after birth.
EclampsiaNumber of reported cases of eclampsia.
AntHaemorrageNumber of cases of antepartum haemorrhage.
PostHaemorrageNumber of cases of postpartum haemorrhage.
ObstructedLabourDeliveries complicated by obstructed labor.
RupturedUterusNumber of cases involving uterine rupture.
SepsisNumber of maternal sepsis cases.
FGMComplicatonsDelivery complications attributed to Female Genital Mutilation (FGM).
StillbirthNumber of macerated stillbirths recorded.

Appendix A.2

Note: The bold numbers denote the lowest AIC (which determines the best model), while the ones in the block reflect the second-lowest AIC (which determines the second-best model).
Table A2. Comparing AIC values for models fitted to data generated using an NB model.
Table A2. Comparing AIC values for models fitted to data generated using an NB model.
Sample SizePoissonNBRZIPRZINBRHPRHNB
50266.960153.870268.480199.260268.020199.130
200589.190461.600489.710470.840488.640469.150
5001896.7601389.0601650.3401401.3001649.5901400.290
Table A3. AIC fit statistics with varying parameters, a fixed sample size of 50, and a dispersion of 5.
Table A3. AIC fit statistics with varying parameters, a fixed sample size of 50, and a dispersion of 5.
OutliersProp of ZeroPoissonNBRZIPRZINBRHPRHNB
0.00.5496.377307.548209.245244.406209.406244.489
0.7382.985295.174203.395250.639203.697250.616
0.8299.579250.422164.671189.534166.136189.748
0.050.5892.394619.407221.731408.289225.545408.386
0.7651.145462.172355.345253.034355.238253.397
0.8431.770335.132111.974238.511113.412238.386
0.100.5985.508792.451335.400234.011392.386233.324
0.7590.654489.585264.072259.122264.009259.544
0.8720.789440.900324.987216.363324.780217.951
0.150.5805.736635.367524.872442.714524.771441.866
0.7864.878786.379529.374362.166529.329362.849
0.8802.551656.557506.557321.393506.352319.657
Table A4. AIC fit statistics with varying parameters, a fixed sample size of 200, and a dispersion of 5.
Table A4. AIC fit statistics with varying parameters, a fixed sample size of 200, and a dispersion of 5.
OutliersProp of ZeroPoissonNBRZIPRZINBRHPRHNB
0.00.5790.969673.619626.241628.711637.4850639.259
0.7634.526478.498459.158462.408459.250460.426
0.8508.746344.618343.351355.219343.578355.186
0.050.51358.8951081.735699.238705.374703.072781.635
0.71071.338602.092490.695594.989500.646595.087
0.8767.973499.480362.790482.297368.665482.345
0.100.51847.0621399.119750.727753.169899.971743.987
0.71383.3691119.060887.516519.300887.508521.188
0.81273.901985.569720.022501.215719.685499.567
0.150.52100.4911650.317792.049782.836796.301783.722
0.71758.8361065.086550.029545.644552.289510.068
0.81302.0951999.867707.985425.134707.974410.876
Table A5. AIC fit statistics for models with varying parameters, given a fixed sample size of 500 and dispersion of 5.
Table A5. AIC fit statistics for models with varying parameters, given a fixed sample size of 500 and dispersion of 5.
OutliersProp of ZeroPoissonNBRZIPRZINBRHPRHNB
0.00.51952.2801684.0401466.4601579.4201466.6801580.030
0.71587.8791208.6071145.7251198.6741146.9821198.491
0.81283.602962.424857.373890.434890.491859.216
0.050.53352.6702659.2601743.1332173.1541757.5322173.380
0.72626.2481847.1961246.2351311.1591244.0101311.525
0.82113.3941338.705960.9051120.507990.6351119.966
0.100.53666.6952676.7512456.7831854.2942456.7011873.620
0.73593.7612316.1441817.9781310.9831709.4061314.567
0.82753.6191978.5931598.1491076.6591598.0861077.034
0.150.55709.1573974.5482186.3901972.9432186.6291951.884
0.74480.3171367.3432745.1521365.1662745.6531361.781
0.83471.3942008.5941899.4321007.2101895.4441006.745
Table A6. AIC fit statistics for models with varying parameters, given a fixed sample size of 50 and dispersion of 3.
Table A6. AIC fit statistics for models with varying parameters, given a fixed sample size of 50 and dispersion of 3.
OutliersProp of ZeroPoissonNBRZIPRZINBRHPRHNB
0.00.5464.842351.282249.946251.093250.849251.078
0.7532.040410.725309.592310.436309.739310.684
0.8608.652485.119383.096384.573383.288384.762
0.050.5895.515672.718442.356542.138573.728497.223
0.7782.124650.333413.902496.301442.678499.847
0.8694.543472.150328.243370.745399.654369.982
0.100.5954.332742.886593.001332.668512.795416.232
0.7912.765731.921572.581419.562573.478452.482
0.8821.889642.357428.340388.123456.178325.847
0.150.51064.543834.251673.892489.328578.652390.412
0.7892.145702.123523.459462.317548.712389.439
0.8781.473601.314489.762384.908412.658368.434
Table A7. AIC fit statistics for models with varying parameters, given a fixed sample size of 200 and dispersion of 3.
Table A7. AIC fit statistics for models with varying parameters, given a fixed sample size of 200 and dispersion of 3.
OutliersProp of ZeroPoissonNBRZIPRZINBRHPRHNB
0.00.51456.8251302.7171204.1501154.2341205.4651156.786
0.71267.5471054.892983.328935.659982.531937.426
0.81098.231856.499785.289751.235783.984652.948
0.050.52103.4511507.1221376.4531318.2241378.9451321.736
0.71896.7571459.4781238.9921189.6301240.7131187.521
0.81768.3431230.5491124.3251070.6591126.2771074.380
0.100.52450.9211623.4501509.3821459.5161512.8721434.170
0.72238.1701498.2141387.4151329.4251390.4451327.236
0.82023.4601357.9181243.3561196.2321246.1451193.704
0.150.52697.3542078.6491670.4751595.9761673.8321599.870
0.72489.9981925.231512.3191453.1781515.5341450.142
0.82267.1021673.891378.6721319.9821380.3241317.945
Table A8. AIC fit statistics for models with varying parameters, given a fixed sample size of 500 and dispersion of 3.
Table A8. AIC fit statistics for models with varying parameters, given a fixed sample size of 500 and dispersion of 3.
OutliersProp of ZeroPoissonNBRZIPRZINBRHPRHNB
0.00.51903.4151723.1871567.1201517.3241569.6871519.436
0.71765.3261498.5541329.7621285.3821331.4751287.767
0.81593.7601245.9801176.3401130.4801178.3401123.210
0.050.52403.8091824.7101698.3251629.8901701.3461634.780
0.72176.4501689.3401568.7301514.3801570.5681512.097
0.81978.2301534.9781432.1991386.7861435.4131380.672
0.100.52732.4352019.8731902.3541825.3641905.1521820.458
0.72508.6791875.7681768.4351703.5231780.5471716.352
0.82304.5601703.2341612.9581550.5481615.3401554.352
0.150.52954.7872154.3222050.1531969.3742052.781947.894
0.72732.2342013.3541898.2611823.5431901.6451816.976
0.82534.8891845.9781743.5841675.3741746.1821643.389
Table A9. AIC fit statistics for models with varying parameters, given a fixed sample size of 50 and dispersion of 1.
Table A9. AIC fit statistics for models with varying parameters, given a fixed sample size of 50 and dispersion of 1.
OutliersProp of ZeroPoissonNBRZIPRZINBRHPRHNB
0.00.5364.842301.282275.946251.093278.849269.078
0.7532.104410.725349.592310.436345.739315.684
0.8608.652485.119398.096384.573389.288379.762
0.050.5545.315435.218388.256300.223372.178342.138
0.7482.124410.333343.902276.847330.678296.301
0.8434.543372.150328.243269.982309.654289.745
0.100.5554.332442.886393.001382.668376.232312.795
0.7512.765431.921372.581272.482349.562303.478
0.8421.889372.357338.34285.847318.123306.178
0.150.5664.543534.251473.892390.412459.328328.652
0.7592.145502.123423.459399.439408.712372.317
0.8481.473401.314389.762334.908362.658318.434
Table A10. AIC fit statistics for models with varying parameters, given a fixed sample size of 200 and dispersion of 1.
Table A10. AIC fit statistics for models with varying parameters, given a fixed sample size of 200 and dispersion of 1.
OutliersProp of ZeroPoissonNBRZIPRZINBRHPRHNB
0.00.51567.1341328.7621256.4551196.5461258.0121198.832
0.71389.3451149.2111087.5431030.2131089.9181032.314
0.81221.788987.485923.678870.568926.34878.845
0.050.52032.3941534.2911420.7891361.8991423.1291364.329
0.71854.6781398.6541289.6321234.4561291.8761231.986
0.81698.4551245.5891143.5561101.7651146.3451087.554
0.100.52345.6871658.3841556.4851487.8321558.8981489.132
0.72187.4251523.9581423.7601365.3341426.1421368.233
0.81999.8971376.4351278.3221223.9281280.6071222.485
0.150.52598.1321787.3441698.4511627.9821701.1261629.232
0.72387.4451645.9281545.7681489.5341548.6781487.761
0.82198.3421498.7261398.2451343.1321400.8171341.627
Table A11. AIC fit statistics for models with varying parameters, given a fixed sample size of 500 and dispersion of 1.
Table A11. AIC fit statistics for models with varying parameters, given a fixed sample size of 500 and dispersion of 1.
OutliersProp of ZeroPoissonNBRZIPRZINBRHPRHNB
0.00.54154.7483576.3542454.1622397.1922456.6172398.760
0.72899.4452321.6872145.3142204.9892143.9382207.523
0.82643.1922045.8981934.7581889.7661937.3741787.564
0.050.53521.1782676.4352554.1522489.3422557.8392487.554
0.73254.1202439.6272321.4952263.9812324.7662265.321
0.82978.4052178.3242067.5142013.4252070.3542015.236
0.100.53845.6122789.5672678.4582618.1922681.4502607.341
0.73576.3342556.7082434.1202356.9802437.7612375.154
0.83321.4152287.8542176.3842110.2132178.9882116.697
0.150.54154.8982923.6672812.4572732.7282815.3142724.189
0.73898.4752689.2322576.7782512.3452579.9892511.238
0.83627.5342435.4122321.6452267.6892324.7812265.671

References

  1. Shahsavari, S.; Yaseri, M.; Hosseini, M.; Moghimbeigi, A. Robust Inference for Zero-Inflated Models with Outliers Applied to the Number of Involved Lymph Nodes in Patients with Breast Cancer. 2023. Available online: https://doi.org/10.21203/rs.3.rs-2692063/v1 (accessed on 29 May 2025).
  2. Feng, C.X. A comparison of zero-inflated and hurdle models for modeling zero-inflated count data. J. Stat. Distrib. Appl. 2021, 8, 8. [Google Scholar] [CrossRef] [PubMed]
  3. Lambert, D. Zero-inflated Poisson regression, with an application to defects in manufacturing. Technometrics 1992, 34, 1–14. [Google Scholar] [CrossRef]
  4. Min, Y.; Agresti, A. Random effect models for repeated measures of zero-inflated count data. Stat. Model. 2005, 5, 1–19. [Google Scholar] [CrossRef]
  5. Hall, D.B. Robust estimation for zero-inflated Poisson regression. Scand. J. Stat. 2010, 37, 237–252. [Google Scholar] [CrossRef]
  6. Mullahy, J. Specification and testing of some modified count data models. J. Econom. 1986, 33, 341–365. [Google Scholar] [CrossRef]
  7. Tüzen, F. A simulation study for count data models under varying degrees of outliers and zeros. Commun. Stat.—Simul. Comput. 2018, 49, 1078–1088. [Google Scholar] [CrossRef]
  8. Tawiah, K.; Iddi, S.; Lotsi, A. On Zero-Inflated Hierarchical Poisson Models with Application to Maternal Mortality Data. Int. J. Math. Math. Sci. 2020, 2020, 1407320. [Google Scholar] [CrossRef]
  9. Bassey, U.E.; Akinyemi, M.I.; Njoku, K.F. On Zero inflated models with applications to maternal healthcare utilization. Int. J. Math. Sci. Optim. Theory Appl. 2021, 7, 65–75. [Google Scholar] [CrossRef]
  10. Abonazel, M.R.; El-sayed, S.M.; Saber, O.M. Performance of robust count regression estimators in the case of overdispersion, zero inflated, and outliers: Simulation study and application to German health data. Commun. Math. Biol. Neurosci. 2021, 2021, 55. [Google Scholar] [CrossRef]
  11. Adehi, M.; Yakasai, A.; Dikko, H.; Asiribo, E.; Dahiru, T. Risk of maternal mortality using relative risk ratios obtained from poisson regression analysis. Int. J. Dev. Res. 2017, 7, 15405–15409. [Google Scholar]
  12. Okello, S.; Otieno Omondi, E.; Odhiambo, C.O. Improving performance of hurdle models using rare-event weighted logistic regression: An application to maternal mortality data. R. Soc. Open Sci. 2023, 10, 221226. [Google Scholar] [CrossRef] [PubMed]
  13. Chau, A.M.H.; Lo, E.C.M.; Wong, M.C.M.; Chu, C.H. Interpreting poisson regression models in dental caries studies. Caries Res. 2018, 52, 339–345. [Google Scholar] [CrossRef] [PubMed]
  14. Dean, C.B.; Lundy, E.R. Overdispersion. Wiley StatsRef Stat. Ref. Online 2016, 1–9. [Google Scholar] [CrossRef]
  15. Payne, E.H.; Hardin, J.W.; Egede, L.E.; Ramakrishnan, V.; Selassie, A.W.; Gebregziabher, M. Approaches for dealing with various sources of overdispersion in modeling count data: Scale adjustment versus modeling. Stat. Methods Med. Res. 2015, 26, 1802–1823. [Google Scholar] [CrossRef] [PubMed]
  16. Shahsavari, S.; Moghimbeigi, A.; Kalhor, R.; Jafari, A.M.; Bagherpour-kalo, M.; Yaseri, M.; Hosseini, M. Zero-Inflated Count Regression Models in Solving Challenges Posed by Outlier-Prone Data; an Application to Length of Hospital Stay. Arch. Acad. Emerg. Med. 2024, 12, e13. [Google Scholar] [PubMed]
  17. Huber, P.J. Robust estimation of a location parameter. In Breakthroughs in Statistics: Methodology and Distribution; Springer: Berlin/Heidelberg, Germany, 1992; pp. 492–518. [Google Scholar]
  18. Cantoni, E.; Zedini, A. A robust version of the hurdlemodel. J. Stat. Plan. Inference 2011, 141, 1214–1223. [Google Scholar] [CrossRef]
  19. Miranda, M.; Miranda, M.C.; Gomes, M.I. A robust hurdle poisson model in the estimation of the extremal index. In Recent Developments in Statistics and Data Science; Springer: Cham, Switzerland, 2022; pp. 15–28. [Google Scholar] [CrossRef]
  20. Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control. 1974, 19, 716–723. [Google Scholar] [CrossRef]
  21. Hall, M.A. Correlation-based feature selection of discrete and numeric class machine learning. In Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, 29 June–2 July 2000. [Google Scholar]
  22. GBD 2019 Under-5 Mortality Collaborators. Global, regional, and national progress towards Sustainable Development Goal 3.2 for neonatal and child health: All-cause and cause-specific mortality findings from the Global Burden of Disease Study 2019. Lancet 2021, 398, 870–905. [Google Scholar]
Figure 1. Visualization of outliers.
Figure 1. Visualization of outliers.
Mca 30 00095 g001
Figure 2. Visualization indicating multicollinearity patterns.
Figure 2. Visualization indicating multicollinearity patterns.
Mca 30 00095 g002
Figure 3. Retained variables after multicollinearity screening.
Figure 3. Retained variables after multicollinearity screening.
Mca 30 00095 g003
Figure 4. Probability models for the number of maternal deaths.
Figure 4. Probability models for the number of maternal deaths.
Mca 30 00095 g004
Table 1. Simulation scenarios considered in the simulation study.
Table 1. Simulation scenarios considered in the simulation study.
Sample SizeProp of ZerosDispersionOutliers
0.00
500.510.05
2000.730.10
5000.850.15
Table 2. Frequency table of maternal deaths.
Table 2. Frequency table of maternal deaths.
CountFrequencyPercentage (%)Cumulative (%)
013761.7161.71
12913.0674.77
22812.6187.39
383.6090.99
473.1594.14
552.2596.40
720.9097.30
810.4597.75
1310.4598.20
1410.4598.65
1510.4599.10
2810.4599.55
3910.45100.00
Total222100.00
Table 3. Estimated AIC and BIC for the maternal mortality data.
Table 3. Estimated AIC and BIC for the maternal mortality data.
ModelAICBIC
Poisson249.039265.880
NB248.600263.569
ZINB231.808268.242
HNB240.032276.466
RZIP129.871164.161
RZINB131.804168.237
RHP136.419170.709
RHNB138.419174.853
Table 4. Model estimation of coefficients using robust zero-inflated models.
Table 4. Model estimation of coefficients using robust zero-inflated models.
ParameterRZIP (Count Model)RZIP (Zero Model)
Estimate (SE) p-Value Estimate (SE) p-Value
Intercept1.6043 (0.1131)-11.0051 (6.8591)-
BreachDelivery0.2813 (0.0293) 0.0251 −0.5700 (0.1646)0.5500
AssistedDeliveries−0.5130 (0.0417) 0.8435 0.1189 (0.2146) 0.8560
EarlyTeenPreg0.1621 (0.0304) 0.0185 0.6093 (0.4903)0.2140
LateTeenPreg−0.0086 (0.0325)0.79000.0704 (0.2177)0.0306
ANC4Visits−0.2161 (0.0424) 0.0319 2.2527 (0.9386)0.0164
CS−0.0546 (0.0273) 0.6580 0.6830 (2.0994)0.0015
Stillbirth0.2835 (0.0305) 0.0140 −2.9196 (9.6433)0.9862
Table 5. Model validation results.
Table 5. Model validation results.
StatisticValue
Chi-Square Statistic220.203
Degrees of Freedom214
p-value0.371
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Pitsha, P.; Chiruka, R.T.; Marange, C.S. A Comparison of the Robust Zero-Inflated and Hurdle Models with an Application to Maternal Mortality. Math. Comput. Appl. 2025, 30, 95. https://doi.org/10.3390/mca30050095

AMA Style

Pitsha P, Chiruka RT, Marange CS. A Comparison of the Robust Zero-Inflated and Hurdle Models with an Application to Maternal Mortality. Mathematical and Computational Applications. 2025; 30(5):95. https://doi.org/10.3390/mca30050095

Chicago/Turabian Style

Pitsha, Phelo, Raymond T. Chiruka, and Chioneso S. Marange. 2025. "A Comparison of the Robust Zero-Inflated and Hurdle Models with an Application to Maternal Mortality" Mathematical and Computational Applications 30, no. 5: 95. https://doi.org/10.3390/mca30050095

APA Style

Pitsha, P., Chiruka, R. T., & Marange, C. S. (2025). A Comparison of the Robust Zero-Inflated and Hurdle Models with an Application to Maternal Mortality. Mathematical and Computational Applications, 30(5), 95. https://doi.org/10.3390/mca30050095

Article Metrics

Back to TopTop