Next Article in Journal
A Mixture Model for Survival Data with Both Latent and Non-Latent Cure Fractions
Previous Article in Journal
Statistical Tools Application for Literature Review: A Case on Maintenance Management Decision-Making in the Steel Industry
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Unit-Modified Weibull Distribution: Theory, Estimation, and Real-World Applications

1
Mathematics Department, Faculty of Science, Mansoura University, Mansoura 35516, Egypt
2
Department of Mathematics and Statistics, Dalhousie University, Halifax, NS B3H 4R2, Canada
3
Department of Statistics & Operation Research, College of science, King Saud University, Riyadh P.O. Box 11451, Saudi Arabia
*
Author to whom correspondence should be addressed.
Stats 2025, 8(3), 81; https://doi.org/10.3390/stats8030081
Submission received: 13 August 2025 / Revised: 4 September 2025 / Accepted: 8 September 2025 / Published: 12 September 2025

Abstract

This paper introduces the Unit-Modified Weibull (UMW) distribution, a novel probability model defined on the unit interval ( 0 ,   1 ) . We derive its key statistical properties and estimate its parameters using the maximum likelihood method. The performance of the estimators is assessed via a simulation study based on mean squared error, coverage probability, and average confidence interval length. To evaluate the practical utility of the model, we analyze three real-world data sets. Both parametric and nonparametric goodness-of-fit techniques are employed to compare the UMW distribution with several well-established competing models. In addition, nonparametric diagnostic tools such as total time on test transform plots and violin plots are used to explore the data’s behavior and assess the adequacy of the proposed model. Results indicate that the UMW distribution offers a competitive and flexible alternative for modeling bounded data.

1. Introduction

In many disciplines, statisticians regularly search for new, or at least relatively recent, statistical models to better model data sets because statistical models play an essential role in data analysis. Modeling new phenomena is particularly important in the era of big data and data science because new trends and data sets require more sophisticated analytical tools. Statistical modeling is at the heart of real-world data sciences as scientists continue to develop new distributions to match rapidly changing applications and real-world events.
Since some fundamental probability distributions still exist, however, they rarely capture the full range of complex and diverse phenomena that we observe in more modern data sets. As a result, researchers have developed many different strategies (generalization, modification, compounding, transformation) to address existing models and their limitations. However, many such adjustments provide only partial solutions. The more effective means, however, will be to design new distribution models that do not inherit any of the constraints that are present in existing models.
Among these developments, a particularly promising approach is the development and application of unit distributions, which refers to specialized models designed for data confined to the interval ( 0 ,   1 ) , offering enhanced flexibility and accuracy in modeling proportions, probabilities, percentages, and other bounded outcomes across diverse fields such as biological studies, mortality and recovery rates, health risks, economics, measurement sciences, and machine learning. Unlike traditional models, which often fail to naturally accommodate bounded data, unit distributions provide enhanced flexibility and precision in capturing their behavior. By adapting these distributions to modern data sets’ unique features, including asymmetry, heavy tails, and multimodality, researchers can improve both the accuracy and interpretability of statistical inference. As a result, unit distributions represent a critical and evolving frontier in probability modeling, expanding the toolkit for contemporary data science. Over the past few years, several researchers have proposed novel probability distributions designed for data on the unit interval. Among these are unit-Gamma distribution [1], log–Lindley distribution [2], Topp–Leone distribution [3], unit-Weibull distribution [4], log-xgamma distribution [5], unit-Gompertz distribution [6], unit log-logistic distribution [7], unit Burr-XII distribution [8], unit-modified Burr-III distribution [9], two-parameter Mirra distribution [10], unit-Chen distribution [11], and unit-exponentiated Weibull distribution [12].
Sarhan and Zaindin [13] proposed a new generalization to the Weibull distribution named Modified Weibull distribution with three parameters α , β , γ (MW ( α , β , γ ) or MW for short). This model exhibits either a decreasing or unimodal probability density function (PDF). Additionally, its hazard function can take multiple forms, including increasing (with an initial value of α ), decreasing, or constant, making it adaptable to various lifetime data scenarios. The cumulative distribution function (CDF) and PDF of the random variable X M W ( α , β , γ ) are given by
F X ( x ; α , β , γ ) = 1 e α x β x γ
and
f X ( x ; α , β , γ ) = ( α + β γ x γ 1 ) e α x β x γ ,
respectively, where α , β 0 , such that α + β > 0 , and γ > 0 .
In this paper, we present a novel three-parameter probability distribution defined on the unit interval, derived through transformation of MW distribution. The proposed UMW distribution offers exceptional flexibility in modeling bounded data, making it a valuable addition to the family of unit-interval-based probability distributions. By leveraging the structural properties of the MW distribution, the UMW distribution not only inherits its strengths but also extends its applicability to scenarios where data is constrained within the ( 0 ,   1 ) range, such as proportions, rates, and percentages.
While many unit distributions exist, a significant gap remains for a model that offers both flexibility in hazard shapes and tractability in estimation. Specifically, existing models often struggle to capture complex, non-monotonic hazard rates like bathtub or unimodal shapes, which are common in reliability and survival data, without becoming overly parameterized and computationally complex.
Our proposed Unit-Modified Weibull (UMW) distribution directly addresses this gap. It is derived from the Modified Weibull distribution, which is renowned for its versatile hazard rate shapes. By transforming it to the unit interval, the UMW distribution inherits this flexibility, capable of producing increasing, decreasing, bathtub, and unimodal hazard functions, while remaining a parsimonious three-parameter model. This combination of a rich class of hazard shapes with a relatively simple structure provides a more effective tool for modeling proportional data where failure patterns are complex, thus filling a distinct niche that many existing unit distributions do not.
This capability to generate a diverse array of hazard rate functions is one of the most compelling features of the UMW distribution. This versatility makes it particularly suitable for modeling complex real-world phenomena where traditional bounded distributions may fall short. Furthermore, the inclusion of three parameters allows for greater adaptability in capturing various data behaviors, enhancing its utility in statistical modeling and applied research.
The remainder of this paper is structured as follows. Section 2 introduces the CDF, PDF, survival function, hazard rate function, and reversed hazard rate function of the proposed UMW distribution. Section 3 presents key statistical properties of the UMW distribution. In Section 4, we discuss parameter estimation using the maximum likelihood method. Section 5 provides a simulation study to assess the performance of the estimators. Section 6 demonstrates the applicability of the UMW model by analyzing three real-world data sets and comparing its fit with that of several established distributions. Finally, Section 7 concludes the paper and outlines potential directions for future research.

2. Unit-Modified Weibull Distribution

Let X MW ( α , β , γ ) distribution, and Y = e X , which implies X = log Y , mapping the support of X from ( 0 ,   ) to the unit interval ( 0 , 1 ) for Y. The transformed variable Y will follow the UMW distribution with parameters ( α , β , γ ) . The CDF of the UMW distribution is derived as follows:
F Y ( y ; α , β , γ ) = P ( Y y ) = P ( e X y ) = P ( X log y ) = 1 F X ( log y ; α , β , γ ) = exp α ( log y ) β ( log y ) γ = exp log y α β ( log y ) γ = y α · e β ( log y ) γ ,
where 0 < y < 1 ; α , β 0 , such that α + β > 0 , and γ > 0 . Moreover, using the transformation method, the PDF of UMW distribution can be derived as follows:
f Y ( y ; α , β , γ ) = f X ( log y ; α , β , γ ) · d d y ( log y ) = 1 y f X ( log y ; α , β , γ ) = α + β γ ( log y ) γ 1 · y α 1 · e β ( log y ) γ .
Moreover, the hazard rate function (HRF) of the UMW distribution can be written as
h Y ( y ; α , β , γ ) = α + β γ ( log y ) γ 1 · y α 1 · e β ( log y ) γ 1 y α · e β ( log y ) γ
and the reversed hazard rate function (RHRF) is given by
r Y ( y ; α , β , γ ) = 1 y α + β γ ( log y ) γ 1 .
This distribution encompasses several well-established models commonly employed in survival analysis as special cases. A comprehensive overview of these sub-models, which highlights the flexibility and broad applicability of the UMW distribution, is presented in Table 1.

Limit Behavior of the PDF and HRF

The limit behavior of f Y ( y ; α , β , γ ) at y 0 can be studied as follows. As y 0 , we have log y , which implies log y . Moreover, we have ( log y ) γ 1 if γ > 1 and y α 1 if α < 1 , and 0 if α > 1 . Thus, the behavior of f Y ( y ; α , β , γ ) as y 0 + depends on the interplay between the term y α 1 , the exponential decay e β ( log y ) γ , and the polynomial growth ( log y ) γ 1 . Consequently, we have
lim y 0 + f Y ( y ; α , β , γ ) = 0 , ( α < 1 and γ > 1 ) or ( α + β > 1 a n d γ = 1 ) or ( α 1 a n d γ < 1 ) ; constant , α + β = 1 a n d γ = 1 ; , ( α + β 1 a n d γ = 1 ) or ( α < 1 a n d γ < 1 ) .
We now investigate the limiting behavior of the PDF as y 1 . Observing that log y 0 , it follows that ( log y ) γ 1 0 for γ > 1 and ( log y ) γ 1 for γ < 1 , while e β ( log y ) γ 1 . Now, we have
lim y 1 f Y ( y ; α , β , γ ) = α , γ > 1 ; α + β , γ = 1 ; , γ < 1 .
Similarly, we can derive the limiting behavior of the HRF as y 0 + in the form
lim y 0 + h Y ( y ; α , β , γ ) = 0 , γ > 1 or ( α + β > 1 a n d γ = 1 ) or ( α 1 a n d γ < 1 ) ; , ( α + β 1 a n d γ = 1 ) or ( α < 1 a n d γ < 1 ) .
Moreover, as F ( y ; α , β , γ ) = y α e β ( log y ) γ 1 when y 1 , it follows that 1 F ( y ; α , β , γ ) 0 , causing the denominator of the hazard rate to vanish while the numerator stays finite, thereby leading to the remarkable conclusion that lim y 1 h Y ( y ; α , β , γ ) = , where α , β 0 such that α + β > 0 , γ > 0 .
Figure 1 illustrates the behavior and shape of the PDF and HRF by assuming specific numerical values for the model parameters. The graphical representations highlight the flexibility and adaptability of the proposed model in capturing diverse data patterns. Based on the results presented in Figure 1, several key observations can be made:
  • The proposed PDF exhibits a wide range of shapes, including decreasing, unimodal, and increasing forms. This versatility is particularly advantageous because it allows the model to accommodate various types of real-world data. For instance, the decreasing form may be suitable for modeling phenomena with high initial probabilities that taper off over time, while the unimodal shape can represent processes with a peak probability at a certain point. The increasing form, on the other hand, is useful for scenarios where the likelihood of an event grows over time. Such flexibility ensures that the model can be applied to different statistical and practical contexts.
  • The corresponding HRF demonstrates even more diverse behaviors, including bathtub-shaped, increasing, increasing–decreasing–increasing, and upside-down bathtub followed by bathtub patterns. These variations are significant because they reflect different failure or risk trends commonly observed in reliability engineering, survival analysis, and other fields. For example, the bathtub-shaped HRF is characteristic of systems with high initial failure rates (early failures), a stable period (useful life), and an eventual increase due to wear-out (aging). Meanwhile, the increasing–decreasing–increasing pattern captures more complex scenarios where risk fluctuates non-monotonically. Such adaptability makes the model highly effective in fitting a broad spectrum of real-world data sets, enhancing its practical utility.
Overall, the proposed model’s ability to generate these varied PDF and HRF shapes underscores its robustness and wide applicability. By encompassing multiple forms, the model can more effectively represent empirical data, making it a valuable tool for statistical modeling and data analysis across various disciplines.

3. Mathematical Properties

This section delivers a rigorous and insightful exploration of pivotal theoretical advancements pertaining to the UMW distribution, elucidating its intrinsic structural and probabilistic characteristics through a meticulous analysis of quantiles, moments, the moment-generating function, skewness, and kurtosis, thereby enriching the foundational understanding of its behavior and broadening its theoretical applicability.

3.1. Quantiles

Given the CDF F Y ( y ; α , β , γ ) defined by Equation (3), the quantile function of the UMW distribution can be derived as Q Y ( q ) = F 1 ( q ) , q ( 0 ,   1 ) . Due to the analytical intractability of inverting the distribution function in closed form, the quantile values can be efficiently and accurately obtained numerically by solving the following non-linear equation.
α log ( y ) β ( log y ) γ log ( q ) = 0 ,
where 0 < y < 1 ; α , β 0 such that α + β > 0 , γ > 0 , and q ( 0 ,   1 ) . This equation can be solved easily using the bisection method.

3.2. Moments

The r t h moment, around zero, for a random variable Y U M W ( α , β , γ ) distribution is given by
μ r = k = 0 ( 1 ) k β k k ! ( r + α ) γ k + 1 α Γ ( γ k + 1 ) + β γ ( r + α ) 1 α Γ ( γ k + γ ) ,
where r = 0 , 1 , 2 , and Γ ( . ) is the gamma function. Moreover, the moment-generating function can be reported as
M Y ( t ) = s = 0 k = 0 ( 1 ) k t s β k s ! k ! ( s + α ) γ k + 1 α Γ ( γ k + 1 ) + β γ ( s + α ) 1 α Γ ( γ k + γ ) .
Using Equation (7), the mean, variance, skewness ( S k ( Y ) ), and kurtosis ( K u ( Y ) ) can be given by the respective equations listed below:
μ 1 = k = 0 ( 1 ) k β k k ! ( 1 + α ) γ k + 1 α Γ ( γ k + 1 ) + β γ ( 1 + α ) 1 α Γ ( γ k + γ ) ,
V ( Y ) = k = 0 ( 1 ) k β k k ! ( 2 + α ) γ k + 1 α Γ ( γ k + 1 ) + β γ ( 2 + α ) 1 α Γ ( γ k + γ ) μ 1 2 ,
S k ( Y ) = μ 3 3 μ 1 μ 2 + 3 μ 1 3 V ( Y ) 3 / 2
and
K u ( Y ) = μ 4 4 μ 1 μ 3 + 6 μ 1 2 μ 2 3 μ 1 4 V ( Y ) 2 .
Deriving closed-form solutions for many statistical measures, such as moments, skewness, and kurtosis, is often analytically intractable, particularly for complex or newly proposed probability distributions. When exact mathematical expressions are either too cumbersome or impossible to obtain, researchers frequently turn to numerical methods to explore the behavior of these quantities. Computational techniques provide a practical and efficient means to assess the convergence of associated series, validate theoretical results, and ensure the robustness of statistical models.
Given the challenges in obtaining explicit formulas, numerical approximation becomes indispensable. By evaluating key statistical measures across different parameter configurations, researchers can gain deeper insights into the distribution’s properties, including its central tendency, dispersion, asymmetry, and tail behavior. This approach not only supports theoretical derivations but also enhances the interpretability and applicability of the model in real-world scenarios.
Table 2, Table 3 and Table 4 provide comprehensive numerical evaluations of key statistical measures, including the mean ( E ( Y ) ), variance ( V ( Y ) ), S k ( Y ) , and K u ( Y ) for the UMW distribution across varying values of the parameters, α , β , and γ . The numerical results demonstrate that the UMW distribution exhibits remarkable shape flexibility, controlled by the parameters α , β , and γ as follows:
(i)
Increasing the α parameter consistently elevates the mean while reducing variance, accompanied by a pronounced increase in both skewness and kurtosis indicating a shift toward extreme right-skewed, leptokurtic distributions;
(ii)
An increase in the β parameter produces a nuanced effect: it elevates the mean and skewness while paradoxically reducing both variance and kurtosis, highlighting a complex interplay between central tendency and tail behavior;
(iii)
The γ parameter serves as a moderating factor: as γ increases, both the mean and skewness systematically reduce across all configurations. Meanwhile, its effects on variance and kurtosis exhibit β -dependent behavior, sometimes counteracting and sometimes amplifying the influence of the remaining parameters.
This sophisticated interplay of parameters endows the UMW distribution with exceptional adaptability, making it particularly suitable for modeling diverse real-world phenomena that exhibit varying degrees of central tendency, dispersion, and tail heaviness. The observed patterns not only validate the distribution’s theoretical properties but also provide practical guidance for parameter selection in applied statistical modeling scenarios.

4. Parameter Estimation

Parameter estimation serves as a cornerstone of statistical inference, enabling researchers to deduce unknown model parameters from observed data. This process is fundamental to predictive modeling, hypothesis testing, and uncertainty quantification, with applications spanning scientific research, machine learning, and econometrics. Consider a simple random sample Y 1 , Y 2 , , Y n drawn from the UMW ( θ ) distribution with PDF given in Equation (4), where θ = ( α , β , γ ) denotes the vector of unknown parameters. In this section, we derive estimates for θ using a maximum likelihood estimation (MLE) framework. Additionally, we derive asymptotic confidence intervals for these parameters, offering a robust measure of estimation uncertainty under large-sample conditions.

4.1. Maximum Likelihood Method

The maximum likelihood estimates (MLEs) of the parameters are obtained by maximizing the likelihood function. Let y 1 , y 2 , , y n denote the observed value of random sample of size n from the UMW distribution for the vector of parameters θ = ( α , β , γ ) . The likelihood function is given as
l ( θ ) = i = 1 n f ( y i ; θ ) ,
and the MLEs of the components of θ are the values of θ , say θ ^ , which maximize the likelihood function. That is, θ ^ satisfies
θ ^ MLE = arg max θ l ( θ ) .
Equivalently, the MLEs of the parameters are obtained by maximizing the log-likelihood function, which can be written as
L ( θ ) = ( γ 1 ) i = 1 n log y i β i = 1 n ( log y i ) γ + i = 1 n log ( α + β γ ( log y i ) γ 1 ) .
To maximize the log-likelihood function, we solve the system of non-linear likelihood equations obtained by differentiating Equation (13). The required score vector components for these equations are given below:
S 1 ( α , β , γ ) = L α = i = 1 n 1 α + β γ K ( y i ; γ ) ;
S 2 ( α , β , γ ) = L β = i = 1 n γ K ( y i ; γ ) α + β γ K ( y i ; γ ) i = 1 n ( log y i ) γ ;
S 3 ( α , β , γ ) = L γ = β i = 1 n K ( y i ; γ ) + γ K ( y i ; γ ) M ( y i ) α + β γ K ( y i ; γ ) + i = 1 n log y i β i = 1 n ( log y i ) γ M ( y i ) .
where K ( y i ; γ ) = ( log y i ) γ 1 and M ( y i ) = log ( log y i ) . The information matrix is a three-by-three symmetric matrix given by
I ( α , β , γ ) = I 11 ( α , β , γ ) I 12 ( α , β , γ ) I 13 ( α , β , γ ) I 12 ( α , β , γ ) I 22 ( α , β , γ ) I 23 ( α , β , γ ) I 13 ( α , β , γ ) I 23 ( α , β , γ ) I 33 ( α , β , γ )
where
I 11 = S 1 α = i = 1 n 1 α + β γ K ( y i ; γ ) 2 ; I 12 = S 1 β = i = 1 n γ K ( y i ; γ ) α + β γ K ( y i ; γ ) 2 ; I 13 = S 1 γ = i = 1 n β K ( y i ; γ ) + β γ K ( y i ; γ ) M ( y i ) α + β γ K ( y i ; γ ) 2 ; I 22 = S 2 β = i = 1 n γ 2 K 2 ( y i ; γ ) α + β γ K ( y i ; γ ) 2 ;
I 23 = S 2 γ = i = 1 n { K ( y i ; γ ) + γ K ( y i ; γ ) M ( y i ) α + β γ K ( y i ; γ ) β γ K ( y i ; γ ) + γ K ( y i ; γ ) M ( y i ) K ( y i ; γ ) α + β γ K ( y i ; γ ) 2 } + i = 1 n ( log y i ) γ M ( y i ) ;
I 33 = S 3 γ = i = 1 n β γ M 2 ( y i ) K ( y i ; γ ) + 2 β M ( y i ) α + β γ K ( y i ; γ ) + β K ( y i ; γ ) + β γ M ( y i ) K ( y i ; γ ) × β K ( y i ; γ ) α + β γ K ( y i ; γ ) 2 + β γ M ( y i ) K ( y i ; γ ) α + β γ K ( y i ; γ ) 2 + β i = 1 n M 2 ( y i ) ( log ( y ( i ) ) ) γ .
To derive the MLEs of the model parameters α , β , and γ , we solve the following system of equations:
S 1 ( α , β , γ ) = 0 , S 2 ( α , β , γ ) = 0 and S 3 ( α , β , γ ) = 0 .
The MLEs of the model parameters are the solution of the likelihood equations such that the information matrix is positive definite. Using advanced numerical methods in R, we can efficiently solve these equations to high precision, uncovering their roots with computational rigor.

4.2. Asymptotic Confidence Intervals

When MLEs of model parameters lack closed-form solutions, their exact sampling distributions become analytically intractable, making precise confidence intervals unavailable. In such cases, we must resort to the asymptotic properties of MLEs to construct approximate confidence intervals.
The MLE θ ^ = ( α ^ , β ^ , γ ^ ) of the parameter vector θ = ( α , β , γ ) follows an asymptotic normal distribution:
θ ^ approx N θ , I 1 ( θ ) ,
where I ( θ ) is the Fisher information matrix. The ( 1 ϑ ) 100 % confidence interval for each parameter θ j is given by
θ ^ j ± z ϑ / 2 · SE ( θ ^ j ) , j = 1 , 2 , 3 ,
where
  • SE ( θ ^ j ) = Var ( θ ^ j ) is the standard error of θ ^ j , ;
  • Var ( θ ^ j ) is the j-th diagonal element of the inverse Fisher information matrix I 1 ( θ ^ ) ;
  • z ϑ / 2 is the critical value from the standard normal distribution such that P ( Z > z ϑ / 2 ) = ϑ / 2 .

5. Simulation Studies

A comprehensive simulation study is conducted using R statistical software to evaluate the performance of the MLEs for the parameters of the UMW distribution. The study is designed to assess the consistency, efficiency, and asymptotic properties of the estimators under varying sample sizes and parameter configurations. The simulation framework employed a systematic approach to ensure robustness, incorporating 10,000 iterations per sample size to mitigate stochastic variability and provide reliable estimates of the statistical properties of the estimators.
The investigation spanned nine sample sizes (ranging from small to large), namely n = 20 , 50 , 80 , 100 , 150 , 200 , 300 , 400 , 500 , enabling a thorough examination of the estimators’ behavior across different data availability scenarios. Four distinct parameter configurations are selected to test the estimators under diverse distributional shapes and scales:
  • α = 0.5 , β = 0.5 , γ = 0.8
  • α = 0.8 , β = 1 , γ = 1.5
  • α = 1.2 , β = 1.5 , γ = 3
  • α = 1.2 , β = 0.5 , γ = 0.2
These configurations are chosen to represent a spectrum of plausible real-world scenarios, ensuring that the evaluation is not limited to a single parametric regime. To ensure the robustness and reliability of the numerical optimization, a sensitivity analysis was performed. For each configuration and sample size, the estimation algorithm was initiated from multiple starting points within a reasonable neighborhood of the true parameters. The results demonstrated a high degree of convergence to the same solution, indicating that the MLE for the UMW distribution is robust to the choice of initial values across the tested scenarios. The iterative algorithms consistently converged without computational issues, confirming the numerical stability of the estimation procedure. This robustness provides confidence in the reliability of the presented average of MLE (AMLE), mean square error (MSE), average interval length (AIL), and coverage probability (CP) results. The simulation’s performance is rigorously scrutinized through a selection of pivotal metrics, AMLE, MSE, AIL, and CP, each illuminating a distinct facet of estimator quality. The MSE stands as the bedrock of predictive accuracy, quantifying average squared deviation between the estimated and true parameter values, serving as a composite indicator of bias and variance. A declining MSE with increasing sample size signals consistency and efficiency of the MLE. Complementing this, the AIL reflects the precision of the confidence intervals for the model parameters. Narrower intervals indicate higher precision while maintaining valid coverage. Yet, precision alone is insufficient without validity; this is where CP ascends as the definitive arbiter of interval trustworthiness. By assessing the empirical frequency with which these intervals encompass the true parameter, a CP aligning with the nominal level underscores the methodological rigor and reliability of the inferential framework. Together, these metrics forge a comprehensive evaluation paradigm, balancing accuracy, precision, and statistical fidelity. The simulation outcomes are meticulously documented in Table 5 and Table 6, with the corresponding MSE and AIL trends visually synthesized in Figure 2, Figure 3, Figure 4 and Figure 5. A rigorous analysis of these tabulated results and graphical representations yields the following key insights:
  • The MSE exhibited a monotonic decrease as the sample size increased across all parameter configurations. This trend robustly supports the asymptotic unbiasedness and consistency of the MLE method, as estimators converge toward their true values with larger data sets.
  • The AIL contracted systematically with larger sample sizes, indicating sharper and more informative confidence intervals.
  • The CP consistently approached the nominal 95 % level across all sample sizes, particularly for n > 100 . This alignment confirms that the MLE-based confidence intervals are statistically valid, even for smaller samples in most cases. The coverage probability for the parameter β is notably low (88.2%) when n = 20 in the fourth configuration ( α = 1.2 , β = 0.5 , γ = 0.2 ). This occurs because a small shape parameter ( γ = 0.2 ) causes more extreme distributional behavior, making estimation difficult with very limited data. The standard normal approximation used for confidence intervals becomes less reliable here, leading to under-coverage. Fortunately, this is only a small-sample issue, the coverage quickly improves to 92.9% at n = 50 and reaches acceptable levels (≥93.2%) for n 150 . This suggests that although MLE is generally reliable, users should be cautious when interpreting results from very small samples, especially in skewed models. Future studies might improve small-sample inference using bias-corrected or profile likelihood methods.
The simulation outcomes underscore the theoretical robustness of MLE for the UMW distribution, with empirical results aligning closely with asymptotic expectations. The observed MSE decay and AIL reduction validate the efficiency of the estimators, while the CP adherence reinforces their reliability for practical inference.

6. Real Data Analysis

This section evaluates the flexibility and performance of the UMW model using three real-world data sets, comparing it against seven established models: UW, ULFR, UR, UE, Beta, Kumaraswamy (Kum), and Topp–Leone (TL).
The assessment employs multiple goodness-of-fit criteria: the negative log-likelihood (-L), Akaike information criterion (AIC), consistent AIC (CAIC), Hannan–Quinn information criterion (HQIC), Anderson–Darling ( A * ), and Cramér–von Mises ( W * ). The Kolmogorov–Smirnov (K–S) test and its p-value further evaluate the fit between empirical and theoretical distributions. The results confirm the UMW model’s strong performance, supporting its suitability for modeling unit interval data.
The first data set, from [16], records kidney dialysis patients’ infection times (in months). The second, from [17], captures the failure times of an airplane’s air conditioning system (in hours). The third, from [18], comprises 22 observations of P3 algorithm computing times.
Table 7 summarizes the data sets’ descriptive statistics. Kidney dialysis times show the lowest skewness and variability, while air conditioning data exhibits extreme right-skewness and outliers. The P3 algorithm times display high variance but moderate skewness, reflecting distinct analytical requirements. Nonparametric visualizations including total time on test (TTT) plot and violin plot are presented in Figure 6 and Figure 7 to illustrate the data distributions.
The TTT plots in Figure 6 provide insights into the hazard rate behavior of the three data sets:
  • Kidney Dialysis Data: The TTT plot initially displays a concave shape, gradually transitioning to a linear trend. This suggests a decreasing hazard rate in the early phase, followed by a roughly constant hazard rate.
  • Air Conditioning Data: The plot is approximately linear throughout, indicating a constant hazard rate, which is typical of an exponential distribution.
  • P3 Algorithm Data: This plot shows a convex shape initially, followed by a concave trend. Such a pattern is characteristic of a bathtub-shaped hazard rate, implying an increasing failure rate at the beginning and a decreasing rate thereafter. This indicates that the UMW with γ > 1 may be appropriate.
The violin plots in Figure 7 provide insight into the distributional characteristics of three real-world data sets:
  • Kidney Dialysis Data: The distribution is approximately symmetric with a moderate spread, centered around the median. The density appears fairly uniform, indicating that the data is evenly distributed without notable skewness or extreme outliers.
  • Air Conditioning Data: This data set exhibits strong right-skewness. The density is concentrated near lower values, with a long tail stretching toward higher values. A few visible outliers highlight the presence of extreme observations.
  • P3 Algorithm Data: The data is also right-skewed, with most values clustered at the lower end. Unlike the air conditioning data, it lacks prominent outliers and displays a smoother decline in density toward the upper range.
Table 8, Table 9, Table 10, Table 11, Table 12 and Table 13 present the MLEs with their corresponding standard errors (SEs) along with the goodness-of-fit criteria for the competing models applied to the three data sets. The results demonstrate that the UMW distribution consistently outperforms its counterparts across key statistical measures, including the log-likelihood, AIC, BIC, CAIC, HQIC, and other nonparametric criteria. These comparative assessments highlight the UMW distribution’s superior flexibility and accuracy in modeling all three real data sets. Notably, its ability to provide a tighter fit compared to other distributions suggests that it is a more suitable choice for capturing the underlying patterns in the data sets. Consequently, the UMW model emerges as the most robust candidate among the evaluated models, reinforcing its practical utility in statistical applications.
Moreover, graphical analyses provide comprehensive validation of our results, as summarized below:
(1)
Parameter Estimation Analysis: Figure 8, Figure 9 and Figure 10 present
  • Profile likelihood function for each individual of the model parameters, α , β , and γ .
  • Well-defined peaks in the likelihood curves, confirming the uniqueness of the MLEs.
(2)
P-P Plots for Model Fit Assessment: Figure 11, Figure 12 and Figure 13 illustrate
  • P-P plots for the UMW distribution and the competing distributions.
  • Close adherence of the P-P plots, for UMW, to the 45-degree line, indicating excellent agreement between empirical and theoretical probabilities. This further supports the UMW distribution’s superior fit over competing models.
(3)
Q-Q Diagnostic Plots: Figure 14, Figure 15 and Figure 16 validate the fit through
  • Q-Q plots comparing empirical and theoretical quantiles.
  • Near-perfect alignment of data points with the reference line, reinforcing the UMW distribution’s robustness in capturing the empirical data structure.
(4)
Goodness-of-Fit Visualizations: Figure 17, Figure 18 and Figure 19 and Figure 20, Figure 21 and Figure 22 depict
  • Histograms of the data sets overlaid with the fitted UMW density function.
  • Empirical versus estimated CDFs, highlighting the UMW model’s accuracy in replicating the observed data trends.
Collectively, the graphical diagnostics, including parameter estimation plots, P–P and Q–Q plots, and distributional visualizations, consistently support the suitability of the UMW distribution for modeling all three real data sets.
For further comparisons between the UMW model and its sub-models, as well as the ULFR model and its sub-models, we utilize the likelihood ratio test statistic, defined as
Λ = 2 L Full L Sub χ d f 2
where L Full and L Sub are the log-likelihood functions evaluated at the maximum likelihood estimates (MLEs) of the parameters for the full and sub-model, respectively, and df is the difference in the number of parameters between the two models. Table 14 presents the values of the likelihood ratio test statistic and corresponding p-values across all tested hypotheses, based on the three data sets. From the results, we observe the following:
  • Data I: Although the UMW model provides a better fit compared to the UW, ULFR, and UR sub-models, none of these sub-models can be rejected at conventional significance levels. Notably, the UE model is significantly rejected in favor of both UMW and ULFR.
  • Data II: All sub-models are significantly rejected compared to UMW, except for ULFR and UR, which do not show significant differences. Additionally, UE is significantly rejected in favor of ULFR.
  • Data III: All sub-models are significantly rejected in favor of UMW at significance levels corresponding to p-values less than 0.07, and UR and UE are also significantly rejected in favor of ULFR.
Finally, to assess the adequacy of the proposed UMW distribution in modeling the three lifetime data sets, we employ Cox–Snell residuals [19], a standard diagnostic tool in lifetime analysis. For a correctly specified model, these residuals are expected to follow a standard exponential distribution, i.e., Exp ( 1 ) . The Cox–Snell residual for the i-th observation is defined as
r i = log S ^ ( y i ; θ ^ ) ,
where S ^ ( y i ; θ ^ ) is the fitted survival function evaluated at the observed data point y i using the MLE θ ^ of the parameter vector.
The validation is performed graphically using Q-Q and P-P plots of the residuals against the theoretical Exp ( 1 ) distribution. We provide those plots for the three real data sets, as shown in Figure 23.
The collective evidence from both diagnostic plots indicates that the Cox–Snell residuals for the UMW model conform well to the Exp ( 1 ) distribution for the three data sets. While no model is perfect, the minor deviations in the extreme upper tail of the Q-Q plot, for data I, are not substantial enough to invalidate the model. Therefore, we conclude that the UMW distribution provides an adequate and reasonable fit to the three data sets.

7. Concluding Remarks

This paper presents the Unit-Modified Weibull (UMW) distribution, a new three-parameter model defined on the unit interval ( 0 ,   1 ) for analyzing proportional and fractional data. Developed by transforming the Modified Weibull distribution, the UMW effectively captures diverse hazard rate shapes, including increasing, decreasing, bathtub, and unimodal forms, making it especially useful for modeling bounded data in fields like engineering, medicine, and economics.
We derived the key statistical properties of the distribution and estimated its parameters using the maximum likelihood method. Simulation studies confirmed that the estimators are consistent and reliable across various sample sizes. The model’s practical utility was illustrated through three real-data applications, where it demonstrated better fit than many established unit distributions.
The UMW distribution opens up several avenues for future work, including Bayesian approaches, regression modeling, and more complex reliability applications. Its flexibility and performance suggest it will be a valuable tool for researchers and practitioners working with unit-interval data.

Author Contributions

Conceptualization, A.M.S.; Methodology, M.E.S. and T.M.; Software, A.M.S.; Validation, A.M.S.; Formal analysis, T.M.; Investigation, M.E.S.; Resources, T.M.; Data curation, M.E.S. and T.M.; Writing—original draft, M.E.S.; Writing—review & editing, A.M.S.; Visualization, A.M.S. and T.M.; Supervision, A.M.S.; Project administration, T.M. All authors have read and agreed to the published version of the manuscript.

Funding

This paper is funded by Ongoing Research Funding program, (ORF-2025-1454), King Saud University, Riyadh, Saudi Arabia.

Acknowledgment

The authors gratefully acknowledge the support of King Saud University through the Ongoing Research Funding Program (ORF-2025-1454), Riyadh, Saudi Arabia.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Consul, P.C.; Jain, G.C. On the log-gamma distribution and its properties. Stat. Hefte 1971, 12, 100–106. [Google Scholar] [CrossRef]
  2. Gómez-Déniz, E.; Sordo, M.A.; Calderín-Ojeda, E. The Log–Lindley distribution as an alternative to the beta regression model with applications in insurance. Insur. Math. Econ. 2014, 54, 49–57. [Google Scholar] [CrossRef]
  3. Condino, F.; Domma, F. A new distribution function with bounded support: The reflected generalized Topp-Leone power series distribution. Metron 2017, 75, 51–68. [Google Scholar] [CrossRef]
  4. Mazucheli, J.; Menezes, A.F.B.; Ghitany, M.E. The unit-Weibull distribution and associated inference. J. Appl. Probab. Stat. 2018, 13, 1–22. [Google Scholar]
  5. Altun, E.; Hamedani, G.G. The log-xgamma distribution with inference and application. J. Société Française Stat. 2018, 159, 40–55. [Google Scholar]
  6. Mazucheli, J.; Menezes, A.F.; Dey, S. Unit-Gompertz distribution with applications. Statistica 2019, 79, 25–43. [Google Scholar]
  7. Ribeiro-Reis, L.D. Unit log-logistic distribution and unit log-logistic regression model. J. Indian Soc. Probab. Stat. 2021, 22, 375–388. [Google Scholar] [CrossRef]
  8. Korkmaz, M.Ç.; Chesneau, C. On the unit Burr-XII distribution with the quantile regression modeling and applications. Comput. Appl. Math. 2021, 40, 29. [Google Scholar] [CrossRef]
  9. Haq, M.A.U.; Hashmi, S.; Aidi, K.; Ramos, P.L.; Louzada, F. Unit modified Burr-III distribution: Estimation, characterizations and validation test. Ann. Data Sci. 2023, 10, 415–440. [Google Scholar] [CrossRef]
  10. Al-Omari, A.I.; Alanzi, A.R.; Alshqaq, S.S. The unit two parameters Mirra distribution: Reliability analysis, properties, estimation and applications. Alex. Eng. J. 2024, 92, 238–253. [Google Scholar] [CrossRef]
  11. Sarhan, A.M. Unit-Chen distribution and its quantile regression model with applications. Sci. Afr. 2025, 27, e02555. [Google Scholar] [CrossRef]
  12. Sarhan, A.M.; Sobh, M.E. Unit exponentiated weibull model with applications. Sci. Afr. 2025, 27, e02606. [Google Scholar] [CrossRef]
  13. Sarhan, A.M.; Zaindin, M. Modified Weibull distribution. Appl. Sci. 2009, 11, 123–136. [Google Scholar]
  14. Bakouch, H.S.; Hussain, T.; Tošić, M.; Stojanović, V.S.; Qarmalah, N. Unit exponential probability distribution: Characterization and applications in environmental and engineering data modeling. Mathematics 2023, 11, 4207. [Google Scholar] [CrossRef]
  15. Bantan, R.A.; Chesneau, C.; Jamal, F.; Elgarhy, M.; Tahir, M.H.; Ali, A.; Anam, S. Some new facts about the unit-Rayleigh distribution with applications. Mathematics 2020, 8, 1954. [Google Scholar] [CrossRef]
  16. Alotaibi, N.; Hashem, A.F.; Elbatal, I.; Alyami, S.A.; Al-Moisheer, A.S.; Elgarhy, M. Inference for a Kavya–Manoharan inverse length biased exponential distribution under progressive-stress model based on progressive type-II censoring. Entropy 2022, 24, 1033. [Google Scholar] [CrossRef] [PubMed]
  17. Linhart, H.; Zucchini, W. Model Selection; Wiley: New York, NY, USA, 1986. [Google Scholar]
  18. Caramanis, M.; Stremel, J.; Fleck, W.; Daniel, S. Probabilistic production costing: An investigation of alternative algorithms. Int. J. Electr. Power Energy Syst. 1983, 5, 75–86. [Google Scholar] [CrossRef]
  19. Cox, D.R.; Snell, E.J. A general definition of residuals. J. R. Stat. Soc. Ser. (Methodol.) 1968, 30, 248–275. [Google Scholar] [CrossRef]
Figure 1. The PDF and HRF of UMW distribution for different values of α , β , γ .
Figure 1. The PDF and HRF of UMW distribution for different values of α , β , γ .
Stats 08 00081 g001
Figure 2. The MSE of the MLE obtained via simulation, with true parameter values α = 0.5 , β = 0.5 , and γ = 0.8 .
Figure 2. The MSE of the MLE obtained via simulation, with true parameter values α = 0.5 , β = 0.5 , and γ = 0.8 .
Stats 08 00081 g002
Figure 3. The MSE of the MLE obtained via simulation, with true parameter values α = 0.8 , β = 1 , and γ = 1.5 .
Figure 3. The MSE of the MLE obtained via simulation, with true parameter values α = 0.8 , β = 1 , and γ = 1.5 .
Stats 08 00081 g003
Figure 4. The MSE of the MLE obtained via simulation, with true parameter values α = 1.2 , β = 1.5 , and γ = 3 .
Figure 4. The MSE of the MLE obtained via simulation, with true parameter values α = 1.2 , β = 1.5 , and γ = 3 .
Stats 08 00081 g004
Figure 5. The MSE of the MLE obtained via simulation, with true parameter values α = 1.2 , β = 0.5 , and γ = 0.2 .
Figure 5. The MSE of the MLE obtained via simulation, with true parameter values α = 1.2 , β = 0.5 , and γ = 0.2 .
Stats 08 00081 g005
Figure 6. TTT plot for the three data sets.
Figure 6. TTT plot for the three data sets.
Stats 08 00081 g006
Figure 7. The violin plots for the three data sets.
Figure 7. The violin plots for the three data sets.
Stats 08 00081 g007
Figure 8. Profile likelihood functions vs. the UMW’s parameters for the kidney dialysis data.
Figure 8. Profile likelihood functions vs. the UMW’s parameters for the kidney dialysis data.
Stats 08 00081 g008
Figure 9. Profile likelihood functions vs. the UMW’s parameters for air conditioning data.
Figure 9. Profile likelihood functions vs. the UMW’s parameters for air conditioning data.
Stats 08 00081 g009
Figure 10. Profile likelihood functions vs. the UMW’s parameters for P3 algorithm data.
Figure 10. Profile likelihood functions vs. the UMW’s parameters for P3 algorithm data.
Stats 08 00081 g010
Figure 11. The P-P plots for the kidney dialysis data using the eight competitive models.
Figure 11. The P-P plots for the kidney dialysis data using the eight competitive models.
Stats 08 00081 g011
Figure 12. The P-P plots for the air conditioning data using all used models.
Figure 12. The P-P plots for the air conditioning data using all used models.
Stats 08 00081 g012
Figure 13. The P-P plots for the P3 algorithm data using all models.
Figure 13. The P-P plots for the P3 algorithm data using all models.
Stats 08 00081 g013
Figure 14. The Q-Q plots for the kidney dialysis data using all models.
Figure 14. The Q-Q plots for the kidney dialysis data using all models.
Stats 08 00081 g014
Figure 15. The Q-Q plots for the air conditioning data using all models.
Figure 15. The Q-Q plots for the air conditioning data using all models.
Stats 08 00081 g015
Figure 16. The Q-Q plots for the P3 algorithm data using all models.
Figure 16. The Q-Q plots for the P3 algorithm data using all models.
Stats 08 00081 g016
Figure 17. Histogram of the kidney dialysis data with the fitted PDFs of all compared models.
Figure 17. Histogram of the kidney dialysis data with the fitted PDFs of all compared models.
Stats 08 00081 g017
Figure 18. Histogram of the air conditioning data with the fitted PDFs of all compared models.
Figure 18. Histogram of the air conditioning data with the fitted PDFs of all compared models.
Stats 08 00081 g018
Figure 19. Histogram of the P3 algorithm data with the fitted PDFs of all compared models.
Figure 19. Histogram of the P3 algorithm data with the fitted PDFs of all compared models.
Stats 08 00081 g019
Figure 20. The empirical and fitted CDF for the kidney dialysis data using the competitive models.
Figure 20. The empirical and fitted CDF for the kidney dialysis data using the competitive models.
Stats 08 00081 g020
Figure 21. The empirical and fitted CDF for the air conditioning data using the competitive models.
Figure 21. The empirical and fitted CDF for the air conditioning data using the competitive models.
Stats 08 00081 g021
Figure 22. The empirical and fitted CDF for the P3 algorithm data using the competitive models.
Figure 22. The empirical and fitted CDF for the P3 algorithm data using the competitive models.
Stats 08 00081 g022
Figure 23. (a) Q-Q plots of Cox–Snell residuals against the theoretical Exp(1) distribution for the three data sets. (b) P-P plots of the empirical CDF of the residuals against the theoretical Exp(1) CDF.
Figure 23. (a) Q-Q plots of Cox–Snell residuals against the theoretical Exp(1) distribution for the three data sets. (b) P-P plots of the empirical CDF of the residuals against the theoretical Exp(1) CDF.
Stats 08 00081 g023
Table 1. The sub-models of the UMW distribution.
Table 1. The sub-models of the UMW distribution.
Model α β γ Reference
Unit exponential (UE)-0-[14]
Unit exponential (UE)--1[14]
Unit Rayleigh (UR)0-2[15]
Unit Weibull (UW)0--[4]
Unit linear failure rate (ULFR)--2New model
Uniform10--
Table 2. Descriptive statistics of the UMW distribution at β = 0.1 .
Table 2. Descriptive statistics of the UMW distribution at β = 0.1 .
Measure γ α 0.10.51.02.03.04.07.0
E ( Y ) 0.50.16380.37940.53010.68320.76080.80770.8789
1.00.16670.37500.52380.67740.75610.80390.8765
3.00.23580.39510.52540.67290.75220.80090.8752
5.00.28750.41890.53440.67390.75190.80060.8750
7.00.31510.43370.54110.67520.75210.80060.8750
Var ( Y ) 0.50.07680.09930.08530.05480.03670.02600.0119
1.00.06310.09010.08050.05330.03620.02580.0119
3.00.03530.06790.06990.05150.03610.02610.0121
5.00.02290.05490.06220.04970.03580.02610.0121
7.00.01790.04840.05760.04830.03550.02610.0121
S k ( Y ) 0.51.95732.21145.881424.226461.7552124.4307524.0429
1.01.95022.41106.215124.849961.9803124.0099517.9699
3.03.61864.12537.921625.607461.2560120.8403503.4262
5.08.95226.586910.057427.164961.8812120.4489500.8145
7.015.70478.690611.776428.598362.8389120.8426500.2055
K u ( Y ) 0.54.81071.88791.79782.48193.19593.80275.0836
1.04.71431.94691.83162.47203.16003.75095.0119
3.05.68452.30381.83652.29672.97553.59164.9244
5.08.32342.62911.79542.11572.82563.49914.9221
7.010.86842.85761.78382.00022.71643.42714.9197
Table 3. Descriptive statistics of the UMW distribution at β = 1 .
Table 3. Descriptive statistics of the UMW distribution at β = 1 .
Measure γ α 0.10.51.02.03.04.07.0
E ( Y ) 0.50.57320.65460.71910.79330.83530.86270.9074
1.00.52380.60000.66670.75000.80000.83330.8889
3.00.45320.52920.60250.70220.76490.80720.8763
5.00.43120.51030.58730.69280.75920.80370.8754
7.00.42080.50200.58110.68940.75740.80260.8752
Var ( Y ) 0.50.12950.09590.06950.04140.02750.01970.0093
1.00.08050.06860.05560.03750.02670.01980.0099
3.00.02440.03690.04130.03690.02920.02270.0115
5.00.01490.03260.04060.03850.03090.02390.0119
7.00.01190.03180.04110.03950.03170.02450.0120
S k ( Y ) 0.53.70798.812919.434958.1719126.4021231.2504836.4035
1.06.215111.689822.061757.2337116.5257205.8796714.1188
3.025.225821.438426.195248.564788.9606152.5219542.5166
5.045.717023.562325.158343.787780.0642139.0163515.6736
7.059.298123.549724.096641.532076.5179134.1807508.5252
K u ( Y ) 0.51.58532.07532.67093.67374.43955.03656.2121
1.01.83162.05052.40003.09533.69644.20005.2841
3.03.44512.51112.02722.11802.59553.14404.5663
5.06.82183.11081.99221.87382.37292.99924.6226
7.010.49353.40001.96901.78782.31592.98314.6866
Table 4. Descriptive statistics of the UMW distribution at β = 5 .
Table 4. Descriptive statistics of the UMW distribution at β = 5 .
Measure γ α 0.10.51.02.03.04.07.0
E ( Y ) 0.50.93520.93890.94280.94900.95370.95750.9654
1.00.83610.84620.85710.87500.88890.90000.9231
3.00.61390.65030.68880.74790.79070.82230.8803
5.00.53440.58570.63860.71710.77120.80980.8764
7.00.49610.55590.61680.70530.76460.80600.8756
Var ( Y ) 0.50.01260.01080.00910.00680.00540.00440.0027
1.00.01930.01740.01530.01220.00990.00820.0051
3.00.01450.01890.02160.02180.01950.01670.0099
5.00.01020.02030.02650.02810.02480.02060.0113
7.00.00870.02220.03010.03180.02740.02230.0117
S k ( Y ) 0.5577.0437735.1459960.90911511.59062203.50793047.15356589.5472
1.0216.6983263.6965331.2708498.6863714.1188983.56352175.9447
3.0133.6461106.0717103.3127129.5327180.9329257.3317683.0323
5.0149.843470.343960.992378.1086117.3836179.4241557.9673
7.0152.711753.134845.549961.737798.1447156.8770528.2465
K u ( Y ) 0.515.273415.230915.154914.968414.771414.579914.0764
1.04.24564.42024.62224.97925.28415.54706.1542
3.02.89092.61792.25522.10372.31082.65773.8681
5.05.84253.47882.25561.80562.06062.52734.0928
7.09.99573.86222.19831.70972.03972.59074.3129
Table 5. Simulation results for the UMW parameters.
Table 5. Simulation results for the UMW parameters.
nParameter α = 0.5 , β = 0.5 , γ = 0.8 α = 0.8 , β = 1 , γ = 1.5
AMLEMSEAILCPAMLEMSEAILCP
α 0.52650.09450.461595.1%0.85150.15420.746495.4%
20 β 0.50530.12270.581292.5%1.05550.21800.973293.9%
γ 0.85730.12970.591194.8%1.61160.24991.114494.2%
α 0.50930.05510.282395.7%0.80980.08950.448995.5%
50 β 0.50160.07430.367494.2 %1.01280.12260.591494.4%
γ 0.81820.07620.355594.3%1.54640.13790.672395.8%
α 0.50790.04460.222696.4%1.54640.13790.672395.8%
80 β 0.49990.05910.290094.7%1.00980.09680.465994.4%
γ 0.81350.05850.278995.1%1.52390.11080.522494.1%
α 0.50530.04150.198194.8%0.80560.06290.315895.7%
100 β 0.49860.05160.259195.2%1.00290.08750.414493.9%
γ 0.81350.05160.249295.7%1.51950.09490.465496.4%
α 0.50390.03340.161394.9%0.80450.05220.257594.7%
150 β 0.50080.04390.212295.5%1.00340.07080.338295.1%
γ 0.81040.04250.202595.1%1.51310.07600.378495.4 %
α 0.50480.02900.139995.2%0.80820.04660.224094.3%
200 β 0.50140.03800.183994.3%1.00050.05930.292294.9%
γ 0.80360.03510.173895.4%1.50690.06860.326195.0%
α 0.50040.02330.113294.5%0.80160.03700.181495.1%
300 β 0.50140.03130.150295.1%1.00120.04890.238694.9%
γ 0.80340.02940.141896.1%1.50240.05480.265494.3%
α 0.50100.01970.098294.4%0.80340.03250.157595.1%
400 β 0.50080.02650.129994.2%0.99960.04040.206495.2%
γ 0.80140.02490.122595.6%1.50310.04690.229893.8%
α 0.50140.01780.087994.6%0.80190.02830.140695.2%
500 β 0.50020.02390.116294.7%1.00300.03820.185194.9%
γ 0.80260.02450.109894.4%1.50420.04180.205794.8%
Table 6. Simulation results for the UMW parameters.
Table 6. Simulation results for the UMW parameters.
nParameter α = 1.2 , β = 1.5 , γ = 3 α = 1.2 , β = 0.5 , γ = 0.2
AMLEMSEAILCPAMLEMSEAILCP
α 1.25540.22221.100495.2%1.25150.21541.096995.8%
20 β 1.58410.29691.417594.7%0.19150.068170.311988.2%
γ 3.21730.47772.226594.6%0.54680.08820.378394.7%
α 1.23140.14280.682694.3%1.21660.13790.674494.7%
50 β 1.55380.17890.867295.8%0.19690.04140.203992.9%
γ 3.07210.27081.332794.4%0.51560.04660.223895.0%
α 1.21740.10910.533595.8%1.21400.10390.532195.7%
80 β 1.51880.13650.668395.2%0.19970.03300.163394.2%
γ 3.04660.20691.043895.3%0.50830.03620.174394.5%
α 1.21270.09650.475496.3%1.21290.09720.475495.3%
100 β 1.51220.12370.594794.4%0.19910.03120.145893.5%
γ 3.04250.19110.932295.4%0.50740.03430.155494.0%
α 1.20760.07840.386594.7%1.21170.08130.387894.8%
150 β 1.51440.09830.485796.3%0.19940.02510.119493.2%
γ 3.02770.15560.757596.2%0.50540.02670.126494.5%
α 1.20090.06680.332995.0%1.20550.06800.334195.6%
200 β 1.50930.08280.418995.8%0.19980.02090.103693.9%
γ 3.02390.13780.654195.4%0.50380.02270.108993.9%
α 1.20690.05490.273194.8%1.20450.05620.272695.3%
300 β 1.50670.06680.341496.4%0.20000.01750.084794.3%
γ 3.01590.10710.532794.7 %0.50110.01840.088594.3%
α 1.20280.04810.235795.4%1.20080.04810.235394.7%
400 β 1.50260.06130.294893.4 %0.20080.01480.073694.2%
γ 3.00990.09920.460195.0 %0.50090.01560.076694.1%
α 1.20180.04210.210795.2%1.20130.04390.210694.3%
500 β 1.50260.05560.263695.4%0.20020.01370.065794.7%
γ 3.00950.08290.411595.2%0.50110.01430.068594.9%
Table 7. Descriptive statistics of the three real data sets.
Table 7. Descriptive statistics of the three real data sets.
DataMin.MeanQ1MedianQ3VarSkewnessKurtosisMax.
Kidney dialysis data0.08330.37740.20830.30000.48330.06110.76512.42190.9167
Air conditioning data0.00380.22490.04720.08300.31320.07361.69364.96670.9849
P3 algorithm data0.01000.30390.04700.11800.54350.10100.71141.88380.8740
Table 8. The MLE estimates and corresponding SE for the kidney dialysis data.
Table 8. The MLE estimates and corresponding SE for the kidney dialysis data.
Model α ^ β ^ γ ^ SE ( α ^ ) SE ( β ^ ) SE ( γ ^ )
UMW0.43710.08973.61390.19600.12431.4790
UW0.61251.6991-0.14240.2669-
ULFR0.20640.3931-0.18150.1334-
UR0.5222--0.0987--
UE0.7623--0.1441--
Beta1.35682.1058-0.33320.5497-
kum1.26512.0797-0.25440.5714-
TL1.3779--0.2604--
Table 9. The MLE estimates and corresponding SE for the air conditioning data.
Table 9. The MLE estimates and corresponding SE for the air conditioning data.
Model α ^ β ^ γ ^ SE ( α ^ ) SE ( β ^ ) SE ( γ ^ )
UMW0.20710.02502.94260.09990.03480.8629
UW0.27871.4561-0.08580.2262-
ULFR0.14850.1004-0.08860.0335-
UR0.1498--0.0274--
UE0.3351--0.0612--
Beta0.51421.3430-0.11180.3643-
kum0.54511.3837-0.11480.3362-
TL0.6017--0.1099--
Table 10. The MLE estimates and corresponding SE for the P3 algorithm data.
Table 10. The MLE estimates and corresponding SE for the P3 algorithm data.
Model α ^ β ^ γ ^ SE ( α ^ ) SE ( β ^ ) SE ( γ ^ )
UMW0.35710.00543.81850.11340.00891.1726
UW0.40031.2259-0.12630.2213-
ULFR0.30420.0662-0.13900.0432-
UR0.1672--0.0356--
UE0.8194--0.1747--
Beta0.55391.2198-0.14230.3758-
kum0.57181.2306-0.14780.3483-
TL0.6778--0.1445--
Table 11. The L , AIC, BIC, CAIC, HQIC, W * , A * , K S , and corresponding p-values derived from using eight competitive models for the kidney dialysis data.
Table 11. The L , AIC, BIC, CAIC, HQIC, W * , A * , K S , and corresponding p-values derived from using eight competitive models for the kidney dialysis data.
Model L AICBICCAICHQIC W * A * K S p-Value
UMW−6.3169−6.6339−2.6373−5.6339−5.41210.03810.28190.08790.9820
UW−5.0611−6.1223−3.4579−5.6423−5.30770.06590.44150.12400.7824
ULFR−5.4608−6.9215−4.2571−6.4415−6.10690.04830.34890.11140.8779
UR−4.4825−6.9650−5.6328−6.8112−6.55770.05570.38330.15790.4869
UE1.89335.78667.11885.94046.19380.26061.46950.27580.0283
Beta−3.7776−3.5552−0.8908−3.0752−2.74070.11010.68590.14120.6321
kum−3.6625−3.3249−0.6606−2.8449−2.51040.11360.70490.13770.6629
TL−3.8524−5.7049−4.3727−5.5510−5.29760.10670.66790.14420.6056
Table 12. The L , AIC, BIC, CAIC, HQIC, W * , A * , K S , and corresponding p-values derived from using eight competitive models for the air conditioning system data.
Table 12. The L , AIC, BIC, CAIC, HQIC, W * , A * , K S , and corresponding p-values derived from using eight competitive models for the air conditioning system data.
Model L AICBICCAICHQIC W * A * K S p-Value
UMW−18.3687−30.7375−26.5339−29.8144−29.39270.07540.46250.11020.8596
UW−15.1924−26.3847−23.5823−25.9403−25.48820.15931.01920.17410.3228
ULFR−17.4207−30.8414−28.0390−30.3969−29.94480.10550.65460.15150.4969
UR−12.7731−23.5462−22.1449−23.4033−23.09790.12540.79330.12620.7260
UE35.933573.867175.268374.009974.31530.61643.41080.6626<0.0001
Beta−13.2463−22.4926−19.6902−22.0482−21.59610.21731.38590.19580.2003
kum−13.5389−23.0778−20.2754−22.6334−22.18130.21091.34700.18790.2403
TL−11.9802−21.9604−20.5592−21.8175−21.51210.23791.51030.19390.2095
Table 13. The L , AIC, BIC, CAIC, HQIC, W * , A * , K S , and corresponding p-values derived from using eight competitive models for the computing time of P3 algorithm data.
Table 13. The L , AIC, BIC, CAIC, HQIC, W * , A * , K S , and corresponding p-values derived from using eight competitive models for the computing time of P3 algorithm data.
Model L AICBICCAICHQIC W * A * K S p-Value
UMW−9.5357−13.0714−9.7982−11.7380−11.73800.04250.28790.12280.8944
UW−7.1755−10.3509−8.1689−9.7194−9.83690.10480.64060.18330.4508
ULFR−7.9048−11.8096−9.6275−11.1779−11.29550.08270.51460.15970.6286
UR−2.7357−3.4713−2.3803−3.2713−3.21430.07950.49660.24230.1511
UE3.07588.15169.24278.35168.40860.16210.95820.44160.0004
Beta−6.7819−9.5639−7.3818−8.9323−9.04980.11730.71230.20020.3413
kum−6.8436−9.6872−7.5052−9.0557−9.17320.11580.70370.19630.3650
TL−5.4982−8.9965−7.9055−8.7965−8.73950.12490.75570.18480.4401
Table 14. Testing of hypothesis results.
Table 14. Testing of hypothesis results.
FullSubdfData IData IIData III
ModelModel Λ p-Value Λ p-Value Λ p-Value
UMWUW12.51160.113016.35260.01174.72040.0298
ULFR11.71220.19071.8960.16853.26180.0709
UR23.66880.1597111.19120.003713.60.0011
UE216.42040.00027108.60440.000025.2230.0000
ULFRUR11.95660.161889.29520.002310.33820.0013
UE114.70820.00013106.70840.0000021.96120.0000
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sarhan, A.M.; Manshi, T.; Sobh, M.E. The Unit-Modified Weibull Distribution: Theory, Estimation, and Real-World Applications. Stats 2025, 8, 81. https://doi.org/10.3390/stats8030081

AMA Style

Sarhan AM, Manshi T, Sobh ME. The Unit-Modified Weibull Distribution: Theory, Estimation, and Real-World Applications. Stats. 2025; 8(3):81. https://doi.org/10.3390/stats8030081

Chicago/Turabian Style

Sarhan, Ammar M., Thamer Manshi, and M. E. Sobh. 2025. "The Unit-Modified Weibull Distribution: Theory, Estimation, and Real-World Applications" Stats 8, no. 3: 81. https://doi.org/10.3390/stats8030081

APA Style

Sarhan, A. M., Manshi, T., & Sobh, M. E. (2025). The Unit-Modified Weibull Distribution: Theory, Estimation, and Real-World Applications. Stats, 8(3), 81. https://doi.org/10.3390/stats8030081

Article Metrics

Back to TopTop