1. Introduction
A discretization process refers to the reduction of a probability density into a finite number of representative mass points. This is particularly useful when evaluating models that are computationally expensive or when complete distributional knowledge is inaccessible. The discretization process is generally implemented in two steps: first, selecting the discretization points—often based on percentiles of the continuous distribution—and second, assigning probability masses accordingly. Such approaches are well-suited for practical data, for instance, lifetimes measured in days for medical patients, counts of units until system failure in engineering, or frequency data from accidents, species occurrence, insurance claims, and longevity studies; see Keefer and Bodily [
1], Zaino and D’Errico [
2], Hammond and Bickel [
3], among others. Discrete probability models have thus become increasingly relevant in medical, engineering, reliability, and survival contexts. However, most existing studies focus on continuous formulations or generic discretization approaches, such as the ZLindley (ZL) model by Saaidia et al. [
4], overlooking the complexity of distributions. To address this gap, researchers have proposed new discretization methods that extend the theory of reliability while preserving essential structural properties of the continuous parent distribution. For example, see Afify et al. [
5], Alotaibi et al. [
6], Chesneau et al. [
7], Haj Ahmad and Elshahhat [
8], Elshahhat et al. [
9], and references therein.
The discretization approach proposed by Roy and Gupta [
10] constructs a discrete probability model directly from the survival function of a continuous distribution by defining the probability mass at an integer point as the difference between successive survival probabilities. This mechanism guarantees non-negativity and unit total mass while preserving important reliability features of the parent distribution, such as monotonic hazard behavior and tail characteristics. As a result, the obtained discrete model remains closely connected to its continuous analog and is particularly suitable for survival and reliability analysis involving integer-valued lifetimes. Among various approaches, survival-based discretization has gained particular prominence because it preserves essential reliability characteristics of the underlying continuous model, such as monotone hazard behavior, tail properties, and stochastic ordering, while yielding analytically tractable probability mass functions that are well-suited for likelihood-based inference and censoring analysis. For more details, see Roy and Gupta [
10] (for discrete normal), Roy and Ghosh [
11] (for discrete Rayleigh), and Bebbington et al. [
12] (for discrete additive Weibull model).
In this paper, we use the surviving discretization approach to convert the continuous ZL distribution to its discrete counterpart. Real-world applications often require data that has been filtered or collected at discrete intervals. This need arises even though the ZL distribution can effectively reproduce an increasing hazard rate shape within a continuous framework. The survival function technique for discretizing the ZL distribution not only retains crucial statistical properties like percentiles and quantiles, but it also makes it easier to evaluate data with Type-II censoring and other constraints. Through comprehensive simulation studies and carefully designed inferential procedures, we address the computational challenges associated with numerical optimization and Bayesian sampling of the discrete ZL (DZL) model that naturally arise in survival-based discrete models, while maintaining analytical tractability and reliable estimation performance. Although the DZL model outperforms other models in comparative analysis, it may be susceptible to the effects of specific data patterns, especially if the failure rate differs from the model that features a monotonic risk rate structure designed to accommodate it. Let n be the number of objects that are subjected to a life-testing experiment; only the first r failure events (denoted as ) are recorded, and when the rth failure occurs, items are withdrawn, thereby stopping the examination process.
The DZL model distinguishes itself from other discrete distributions with one, two, or three parameters by providing an increasing hazard rate function. It also possesses several key statistical properties, including closed-form expressions for the probability mass function, survival function, moments, stress–strength reliability, and mean residual life, which ensure analytical tractability and practical interpretability. In light of its hazard rates, the proposed model can replicate a wide range of datasets, including those with negative skew. In two real-world applications, the DZL distribution outperformed nine distinct discrete distribution models described in the literature. Moreover, these properties facilitate reliable inference under censoring and small-sample settings, making the DZL model particularly suitable for discrete lifetime and reliability data encountered in real-world applications. Bootstrapping, Bayesian, and maximum-likelihood approaches are used to estimate the DZL() parameter using censoring and uncensoring procedures. Monte Carlo simulations assess the performance of trained estimators by utilizing accuracy metrics, including squared error, absolute bias, interval length, and coverage percentage. Collecting datasets from both engineering and healthcare applications, the DZL features are assessing their adaptability, predictive capability, and applied significance. In addition, the analysis illustrates how the model’s inferential results can inform practical processes and systematically compares its performance with that of widely recognized discrete probability models, for example, discrete Weibull, discrete gamma, discrete Nadarajah–Haghighi, and geometric, among others.
The remainder of the paper is structured as follows: The DZL distribution and its features are explored in
Section 2 and
Section 3, respectively. The point and interval estimates are created in
Section 4 and
Section 5, respectively.
Section 6 displays and interprets the Monte Carlo results, while
Section 7 examines data groups. The results and conclusions are discussed in
Section 8. In Abbreviations Section, all abbreviations used in the paper are defined and listed.
2. New DZL Distribution
The Lindley distribution, originally introduced by Lindley [
13] as a mixture of exponential and gamma laws, has found wide application in reliability and survival analysis. Despite its usefulness, the Lindley law is limited by its monotonically decreasing hazard function, which restricts its flexibility in modeling diverse lifetime behaviors. Subsequently, the ZL model has been introduced to enhance the flexibility of traditional models like the Lindley distribution. It is designed to handle both symmetric and left-skewed data, and is particularly suitable for reliability and survival analysis. Building upon this advancement, the present work develops the discrete analog of the ZL distribution. A new DZL model retains the flexibility of its continuous counterpart while being tailored for integer-valued data, thereby extending its applicability to reliability studies, count processes, and real-world phenomena where observations naturally occur in discrete form. We next introduce the probabilistic structure of the one-parameter ZL
distribution (
is a scale) by specifying its probability density function, denoted by
, and its corresponding survival function, denoted by
, for a random variable
X that follows this model, as
and
see, for more details, Augusto Taconeli and Rodrigues de Lara [
14].
Saaidia et al. [
4] stated that the ZL distribution is simpler and more tractable than many generalized models. It is also more flexible than the standard Lindley distribution and other competing distributions, such as XLindley, Xgamma, and Zegghdoudi. It is suitable for datasets that are not well-modeled using conventional distributions. In short, the ZL distribution is an innovative and practical model for age data, offering greater mathematical simplicity and flexibility than many traditional distributions. Recently, two new versions of the ZL distribution were proposed by Mahnashi and Zaagan [
15] and Zeghbib et al. [
16], respectively. Discretizing the ZL distribution provides a powerful extension of its continuous version, allowing its use in many real-world situations that involve count data or event occurrences observed in discrete time units. This transformation is especially useful in practical fields like reliability analysis, actuarial science, and queuing systems, where the concept of continuity may not align with the way data is captured, as they are often in integers. By using a survival-function-based discretization approach, the resulting model maintains key structural properties, such as flexibility in skewness and tail behavior, while offering a better fit than traditional discrete formulations like Poisson, geometric, or negative binomial. Numerous techniques have been developed for discretizing continuous distribution models, among which the method based on the survival function stands out as particularly influential. This survival-based discretization framework serves as the foundation for formulating the discrete analog of the model. Following this technique, we introduce a DZL
model and some of its essential characteristics. Consistent with the approach outlined by Roy and Gupta [
10], the probability mass function (PMF,
) of the DZL distribution, constructed via survival discretization, is
where
.
Consequently, from (
3), the cumulative distribution function (CDF,
), survival function (SF,
), and hazard rate function (HRF,
) of
Y, respectively, are
and
For various configurations of
in (
4) and (
6),
Figure 1a illustrates that the DZL probability mass function is always decreasing, highlighting its suitability for modeling monotonically declining count data. In contrast,
Figure 1b demonstrates that the DZL hazard rate function is monotonically increasing, indicating an increasing risk or failure rate over time—a behavior that is desirable in many reliability and survival analysis contexts.
3. Statistical Functions
This section presents a detailed examination of several key statistical properties of the DZL distribution, including its quantile function, moment structure, order statistics, and stress–strength, among others.
3.1. Quantile
The inverse of the CDF for a discrete distribution defines the quantile function
, which is commonly used in simulation studies to generate random samples. Starting from the CDF given in Equation (
4), we derive, through algebraic manipulation, the closed-form expression for the quantile function of the DZL distribution:
hence,
To solve Equation (
7), we substitute
,
The symbolic quantile function
, after some simplifications, is
Since y exists in both the linear term and the exponent, there is generally no closed-form solution. Usually, a numerical iterative solution is required.
3.2. Moments
This part introduces the rth non-central moment of the DZL model in turn to develop the other central moments and associated coefficients.
Theorem 1. Let . The rth non-central moment of Y (say, ) is given byprovided the series converges. Proof. For a discrete random variable, the
rth non-central moment of
Y can be written as
Applying a standard summation-by-parts argument for discrete sequences yields
Using the binomial expansion, the finite difference term can be expressed as
Substituting (
13) into (
12), we get
Hence, from (
5) and (
14), the
rth non-central moment of the DZL distribution can be expressed as
which completes the proof. □
3.3. Descriptive and Dispersion Measures
Setting
in Theorem 1, the mean
(or
) of
y, a non-negative random variable that follows the
distribution, is given by
Using the following summation identities into (
16), where
, we get
which is equivalent to
Similarly, setting
into Theorem 1, the second non-central moment (say,
) becomes
Again, using the following summation identities into (
16), where
, we get
Thus, from (
20), the moment
can be formulated as follows:
As a result, the DZL distribution’s variance (say,
) is
where
provided in (
16) and (
21), respectively.
To evaluate and compare the variability of two independent samples, the coefficient of variation () serves as an appropriate relative measure of dispersion. It is formally defined as the quotient of the standard deviation and the corresponding mean, It is desirable because it is scale-invariant and dimensionless, which allows for meaningful comparisons of variability across datasets that differ in magnitude or measurement units. In practice, this makes a widely used index in reliability studies, survival analysis, and quality control.
Unlike
, the dispersion index (DI),
, is not scale-invariant and is mainly employed in the context of non-negative discrete data. It is a fundamental diagnostic tool for count data models, where it serves to characterize departures from the equi-dispersion property of the Poisson distribution. The dispersion index
characterizes the variability pattern of the data, where
indicates
over-dispersion, meaning that the observed variability exceeds the Poisson benchmark,
corresponds to
equi-dispersion, which is consistent with the Poisson case, and
signifies
under-dispersion, where the variability is lower than expected under Poisson assumptions. A comprehensive numerical analysis is summarized in
Table 1, which reports the mean (
), variance (
), dispersion index (
), coefficient of variation (
), skewness (
), and kurtosis (
) for various options of
. As
grows, the results in
Table 1 reveal the following trends:
For : The mean decreases progressively, indicating a leftward shift in the central tendency of the distribution;
For : A decreasing variance reflects a reduction in the overall dispersion of the data;
For : The decreasing index of dispersion suggests the distribution transitions from substantial over-dispersion toward a more equi-dispersed structure;
For : The increasing coefficient of variation implies that relative variability becomes more pronounced as the mean diminishes;
For : The upward trend in skewness indicates that the distribution becomes increasingly asymmetric, with a longer right tail;
For : Rising kurtosis values signal a growing peakedness and heavier tails, suggesting an increased likelihood of extreme outcomes in the distribution.
For further examination,
Figure 2 visualizes the actual behavior of
,
,
, and
and supports the numerical findings in
Table 1. In conclusion, as
increases, the distribution becomes more concentrated around smaller values while exhibiting higher relative variability, skewness, and kurtosis.
3.4. Stress–Strength Reliability
Let
(for
) be two independent discrete random variables following DZL distributions with parameters
and
, respectively. The stress–strength reliability (SSR) parameter (say,
) is defined as
which represents the probability that the strength
exceeds the applied stress
; see Singh et al. [
17] for further details.
Theorem 2. If and are independent, then the SSR parameter is given bywhere denotes the SF of . Proof. Taking two independent discrete random variables
(for
), the SSR index is given by
Since
are independent,
Therefore, the expression of
is defined as
For the DZL distribution, the PMF of
is
Consequently, from (
5) and (
27), the SSR of the DZL model can be represented as
where
and
□
It is worth noticing here that the SSR of the DZL distribution is exact and converges for all for . The SSR can be efficiently evaluated numerically and reduced to simpler expressions in special cases, such as . This reliability measure plays an important role in reliability engineering and lifetime data analysis involving discrete survival models.
3.5. Mean Residual Life
The mean residual life (MRL) function is an essential dependability metric that indicates a unit’s predicted remaining lifetime after surviving up to a specified time point; see Tang et al. [
18].
Theorem 3. The MRL function of a discrete random variable Y, given that it follows the , iswhere denotes the SF of Y. Proof. The MRL, also known as the average remaining life span, refers to a component that survived for a specific duration
, and is defined as
Since,
hence, with simple algebraic manipulations, we obtain
where the change of variable
has been applied.
Consequently, from (
5) and (
33), the MRL of the DZL model can be represented as
which completes the proof. □
The MRL (
34) exists and is finite for all
. Its explicit representation highlights the analytical tractability of the DZL model and its suitability for reliability analysis involving discrete lifetime data. To illustrate the behavior of the SSR and MRL functions derived from Equations (
29) and (
34), three-dimensional surface plots are presented. These plots depict the SSR for various values of the stress/strength parameters
(for
) and the MRL for different time values
, as shown in
Figure 3.
Figure 3a illustrates that the SSR decreases monotonically with increasing stress parameter
and increases with the strength parameter
, confirming the expected reliability behavior of the DZL model. The mean residual life surface shows a clear decreasing pattern with respect to time
and the model parameter
, indicating an aging property consistent with wear-out characteristics in discrete lifetime data.
3.6. Order Statistics
Let
be the CDF of a single DZL random variable. For the
l-th order statistic
from a sample
we have the CDF
Using the negative binomial theorem, the CDF can be represented as the following series:
Hence, from (
4), Equation (
36) becomes
The PMFs of the
l-th order statistics for the DZL model are given by
7. Real-World Applications
This part analyzes two distinct genuine datasets originating from engineering and clinical sectors, with the aim of (i) assessing the versatility, efficacy, and practical relevance of the DZL model; (ii) demonstrating the utility of the model’s inferential outcomes in supporting real-world decision-making; and (iii) benchmarking its performance against nine existing discrete distributions.
Table 6 outlines the following datasets employed in this comparative analysis:
Electronic Devices: The reliability analysis of failure time data is fundamental in industrial engineering, particularly when assessing the performance and durability of electronic components. This application examines the failure records (in hours) of 18 electronic systems presented by Wang [
25]. This dataset (say, Data-1) has been widely used to evaluate the applicability of lifetime distributions in modeling real-world reliability behavior; see Elshahhat and Abu El Azm [
26].
Leukemia Patients: Analysis of remission duration in leukemia patients following treatment is a critical aspect of evaluating therapeutic efficacy. In this application, remission times in weeks were recorded for a cohort of 20 patients suffering from leukemia who were randomly allocated to a certain therapy program. This dataset (say, Data-2), presented by Bakouch et al. [
27] and later reanalyzed by Eliwa and El-Morshedy [
28], serves as a valuable scenario for analyzing discrete lifetime data.
The proposed two datasets, Data-
i for
, provide a varied and practical basis for evaluating the proposed model. Their distinct origins underscore the model’s versatility across different fields. Importantly, both datasets present realistic features such as discrete outcomes and non-normal behavior, which introduce genuine challenges for statistical modeling and help to rigorously test the model’s performance.
Table 7 summarizes key descriptive statistics for Data-
i,
, including the minimum, maximum, quartiles (
), mean, mode, and standard deviation (SD), among others. Data-1 displays a wider range (5 to 420) and considerably greater variability (SD ≃ 131.5), along with moderate right skewness (skewness ≃ 0.3), suggesting the presence of relatively extreme values. In contrast, Data-2 spans a narrower range (1 to 49), shows lower dispersion (SD ≃ 14.7), and is moderately right-skewed (skewness ≃ 0.65). All datasets exhibit mesokurtic behavior (kurtosis
), indicating tail weight similar to that of the normal distribution.
Figure 9 shows violin plots for Data-
i for
to visualize the data distribution comprehensively.
Figure 9a exhibits a high degree of spread and skewness, suggesting the presence of extreme values or heavy-tailed behavior. In contrast,
Figure 9b shows a more symmetric shape and more centralized data around a lower value range, indicating less variability and more concentration near the median. As a result, the subplots in
Figure 9 show central clustering; the left dataset is more dispersed and irregular, whereas the right one suggests a tighter, more balanced structure around the median.
To further illustrate the versatility, robustness, and practical utility of the proposed DZL distribution, an extensive comparative study is conducted. For each dataset listed in
Table 6, the goodness-of-fit of the DZL model is rigorously examined and compared against well-established discrete distributions from the literature, as outlined in
Table 8.
To determine the most appropriate model among the proposed DZL distribution and its competitors, a thorough model comparison is conducted using a suite of fit criteria. These include the negative log-likelihood (NLL), Akaike criterion (AC), consistent AC (CAC), Bayesian criterion (BC), Hannan–Quinn criterion (HQC), and the Kolmogorov–Smirnov (KS) statistic, along with its associated
-value.
Table 9 summarizes the MLEs (along with their corresponding standard errors (Std.Es)) for all model parameters across both datasets. It also reports the computed values of each model selection criterion for the competing distributions listed in
Table 8. Notably, the results in
Table 9 indicate that the proposed DZL distribution consistently yields the lowest values across all information-based metrics, suggesting a superior fit. Furthermore, the DZL model yields the highest
-value and the smallest KS value compared to the others. According to the outcomes, the DZL distribution provides the best overall fit and is therefore the most suitable model for analyzing all given datasets.
Visualization approaches for fitting are important diagnostic tools for determining the level of convergence between empirical data and a proposed theoretical distribution.
Figure 10 shows how the DZL model outperforms other models using three important graphical assessment methods: (i) fitted PMF curves overlaid on the empirical histograms; (ii) plots of the fitted survival (reliability) functions; and (iii) probability–probability (PP) plots. These visualizations provide intuitive and informative evidence supporting the adequacy of the DZL distribution in capturing the underlying structure of the observed data.
Figure 10 provides visual confirmation of the numerical outcomes; see
Table 9, further highlighting the superior fitting capability of the proposed DZL model in comparison to a variety of existing traditional (or novel) discrete distributions.
To calculate the estimators for the DZL(
) parameter, different Type-II right-censored samples were generated from each dataset listed in
Table 6. Estimation was performed using likelihood, bootstrap (with 10,000 replications), and Bayesian (with
and
) methods. For each sample, both point estimates (with Std.Es) and 95% interval estimates (with Int.Ws) for
were computed, as summarized in
Table 10. Inspection of
Table 10 reveals that Bayesian point estimates consistently attain smaller Std.Es than the frequentist (MLE-based) estimates, indicating greater precision. Similarly, Bayesian interval estimates—constructed using BCI or HPD methods—generally produce narrower Int.Ws compared to frequentist intervals, including BP/BT and ACI[NA]/ACI[NL]. Moreover, as
r increases, both Std.Es and Int.Ws decrease, reflecting the enhanced efficiency of the estimators. A critical consideration in applying the Markov chain methodology is ensuring that the generated samples have achieved adequate convergence.
Figure 11 illustrates that the M–H algorithm attains stable convergence and that the posterior distributions of the DZL model are approximately symmetric across samples from both Data-
i (
). In addition,
Table 11 presents a detailed summary of the key posterior statistics for
, based on 30,000 MCMC iterations, providing numerical confirmation of the algorithm’s reliability. These numerical results are close to those reported in
Table 10 and visually affirmed by the patterns observed in
Figure 11, collectively supporting the reliability of the Bayesian inference framework employed.
Lastly, the analysis of both datasets (Data-i, ) confirms that the DZL distribution exhibits strong advantages of features, delivering robust findings, high flexibility, and satisfactory fit relative to both classical and contemporary discrete models. Its utility across diverse application domains, coupled with its stable inferential performance, particularly within the Bayesian setup, highlights its utility as both a practical and theoretically rigorous model for discrete data analysis. These findings emphasize the relevance of newly proposed discrete distributions in capturing complex real-world phenomena and demonstrate their potential for advancing model assessment and parameter estimation in clinical and industrial contexts.
To sum up, although the real-data applications considered in this study involve relatively small sample sizes, such settings are common in many applied fields where data collection is inherently limited. The satisfactory performance of the proposed DZL distribution under these conditions highlights its robustness and practical relevance. As demonstrated in the proposed numerical analyses, the inferential procedures remain stable even in small-sample scenarios. These findings may be viewed as a first step toward validating the adaptability of the proposed distribution to larger-scale real-world phenomena, while further applications to larger datasets are left for future investigation.
8. Conclusions
In this study, we proposed and systematically investigated the DZL distribution, a novel survival-function-based discretization of the continuous ZLindley law. The proposed model inherits the analytical tractability and flexibility of its continuous counterpart, while addressing the practical need to model count-based, integer-valued, and censored lifetime data. Through rigorous mathematical development, we derived essential distributional properties, including moments, quantiles, order statistics, stress–strength, mean residual life, and hazard-rate structures, thereby establishing a comprehensive theoretical foundation. From an inferential perspective, both maximum likelihood and Bayesian estimation frameworks were developed and supplemented with resampling- and simulation-based interval estimation procedures. Comparative Monte Carlo experiments demonstrated that the Bayesian estimators—particularly under different prior specifications—consistently outperform their frequentist counterparts, especially in the presence of Type-II censoring. Moreover, applications to two real-world datasets showed that the DZL distribution provides superior fits compared to nine competing discrete lifetime models, thereby reinforcing its empirical relevance. Beyond offering a robust alternative to classical discrete distributions, the DZL model enriches discrete reliability theory by enabling realistic modeling of increasing hazard-rate patterns, a limitation shared by many existing discrete families. Its demonstrated superiority in both simulated and applied settings suggests broad applicability in fields such as reliability engineering, actuarial science, survival analysis, and queueing systems, where discrete data frequently arise. Future research directions include extending the DZL framework to multi-parameter generalizations, regression-based formulations, and bivariate or multivariate extensions, as well as developing computationally efficient Bayesian inference methods for large-scale applications. Overall, the DZL distribution establishes itself as a versatile, analytically tractable, and practically powerful addition to modern discrete distribution theory.