The UG-EM Lifetime Model: Analysis and Application to Symmetric and Asymmetric Survival Data

Omalsad H. Odhah; Saba M. Alwan; Sarah Aljohani

doi:10.3390/sym17122027

,

and

¹

Department of Mathematical Sciences, College of Science, Princess Nourah Bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia

²

Department of Mathematics and Sciences, Prince Sultan University, Riyadh 11586, Saudi Arabia

^*

Author to whom correspondence should be addressed.

Symmetry2025, 17(12), 2027;https://doi.org/10.3390/sym17122027

This article belongs to the Section Mathematics

Version Notes

Order Reprints

Abstract

This paper introduces the UG-EM (Unconditional Gamma-Exponential Model) as a new compound lifetime model designed to enhance flexibility in tail behavior compared to traditional distributions. The UG-EM model provides a unified framework for analyzing deviations from symmetry in survival data, effectively capturing right-skewed patterns, which are commonly observed in real-world lifetime phenomena. The main analytical properties are derived, including the probability density, cumulative distribution, hazard and reversed-hazard functions, mean residual life, and several measures of dispersion and uncertainty. The effects of the UG-EM parameters (α and λ) are examined, showing that increasing either parameter can cause a temporary reduction in entropy H(T) at early times followed by a long-term increase; in some cases, the influence of α is stronger than that of λ. Parameter estimation is carried out using the maximum likelihood method and assessed through Monte Carlo simulations to evaluate estimator bias and variability, highlighting the significant role of sample size in estimation accuracy. The proposed model is applied to three survival datasets (Lung, Veteran, and Kidney) and compared with classical alternatives such as Exponential, Weibull, and Log-normal distributions using standard goodness-of-fit criteria. Results indicate that the UG-EM model offers superior flexibility and can capture patterns that simpler models fail to represent, although the empirical results do not demonstrate a clear, consistent superiority over standard competitors across all tested datasets. The paper also discusses identifiability issues, estimation challenges, and practical implications for reliability and medical survival analysis. Recommendations for further theoretical development and broader model comparison are provided.

Keywords:

Unconditional Gamma-Exponential (UG-EM); symmetry; compound lifetime distributions; survival analysis; hazard function; maximum likelihood estimation (MLE); mean residual life (MRL)

1. Introduction

Mixture models and unconditional distribution models have been widely applied in the analysis of lifetime and survival data, particularly for capturing unobserved heterogeneity.

Pickles in [1] reviewed some of the main approaches to the analysis of multivariate censored survival data. They compared the performance of conditional and mixture likelihood approaches in estimating models with frailty effects in censored bivariate survival data and found that the mixture methods were surprisingly robust against misspecification of the frailty distribution. Building on the idea of model flexibility, Ref. [2] proposed a generalized Weibull distribution (the exponentiated Weibull), providing greater flexibility in modeling various shapes of hazard functions (increasing, decreasing, and bathtub-shaped). Similarly, Ref. [3] developed a mixture model based on the extended exponential–geometric distribution to describe heterogeneous survival data, where the maximum likelihood method was used to estimate the model parameters. In contrast, Ref. [4] developed survival models derived from stable distributions of the positive numbers—the gamma, the degenerate, and the inverse Gaussian distributions—to describe heterogeneity in populations and to show how these models affect hazard and survival functions.

Lawless in [5] provides a comprehensive framework for modeling and analyzing lifetime data. The book covers classical lifetime distributions, such as Weibull, Exponential, and Gamma, along with methods for parameter estimation. In addition to that, Ref. [6] addresses censored and truncated data, life testing, hazard models, and diagnostic tools for assessing system performance based on empirical data.

Another important contribution to the field of reliability engineering emphasizing practical and lifetime modeling was made by [7], while Ref. [8] presented a mathematical and probabilistic treatment of lifetime distributions, including Weibull models and other theoretical aspects.

Kuo and Peng in [9] introduced a mixture model approach to analyze beetle data that included both exact observations and interval-censored cases. Building on this line of research, later contributions have sought to develop more flexible lifetime models. For instance, Rubio and Hong [10] proposed a log two-piece model as a flexible class of lifetime distribution. They estimated its parameters via maximum likelihood and evaluated the model using information criteria such as AIC. The applicability of their method was further demonstrated with real datasets, including Mayo primary biliary cirrhosis and lung cancer studies.

In addition to mixture and frailty approaches, considerable attention has been given to extending simple one-parameter models, such as exponential distribution, by additional parameters, often to provide greater flexibility in the tail behavior. Ref. [11] was among the first to formalize this idea, proposing the Beta-G family of distributions by embedding the Beta distribution to generate new probability models. This framework was subsequently broadened by [12], which developed the mathematical properties of these generated families, including their density and distribution functions, moments, and reliability characteristics.

Building on the work of Kumaraswamy, Cordeiro and de Castro, Ref. [13], introduced a new family of generalized distributions extending classical models such as the Weibull and Gamma. This idea was further developed through the generalized beta-generated family [14], enhancing flexibility in modeling hazard behaviors. Later, Torabi and Montazari [15] proposed the logistic-uniform distribution, adding additional adaptability to lifetime modeling. Collectively, these studies advanced the theoretical foundation for developing more flexible and realistic reliability models.

Many flexible lifetime models have been proposed by introducing extra shape parameters, compounding techniques, and hierarchical structures. Study [16] introduced the beta exponential distribution, an extension of the classical exponential model obtained by applying the beta generator to provide more flexibility in modeling lifetime data with various hazard rate shapes. The study provided a comprehensive treatment of the mathematical properties of the distribution. Similarly, Ref. [17] proposed a four-parameter lifetime model, called the gamma-extended Fréchet distribution, which is a new lifetime model that generalizes the traditional Fréchet distribution. Later, Ref. [18] developed a new general method for generating families of continuous distributions based on transformations of random variables. Collectively, these contributions have advanced the development of flexible distribution families for lifetime and reliability modeling.

Kundu and Gupta [19] investigated the Marshall–Olkin bivariate Weibull distribution, developed Bayesian estimation methods for its parameters, and provided a comprehensive framework for analyzing dependent lifetime data, enhancing reliability analysis in multicomponent systems. In contrast, Ref. [20] proposed a generalized modified Weibull distribution, extending the classical Weibull model to capture a wide variety of hazard rate shapes. These models are able to capture various hazard patterns, including bathtub-shaped, unimodal, and other non-monotonic forms often observed in reliability and medical data.

Ghitany, Atieh, and Nadarajah [21] examined the Lindley distribution as an alternative to the exponential model, complementing the exponential–geometric model earlier proposed by Adamidis and Loukas in 1998 and demonstrating its effectiveness for lifetime data with non-constant hazard rates. Study [22] introduced the complementary exponential–geometric distribution, further enhancing flexibility in modeling heterogeneous survival data. Complementing these distributional developments, Ref. [23] presented a comprehensive framework for Bayesian survival analysis, offering powerful inferential tools for lifetime modeling. Meanwhile, the book by Ref. [24] focused on the analysis of multivariate survival data, addressing dependence structures among correlated lifetimes. Its strong emphasis on conceptual foundations and modeling strategies makes it an equally valuable reference for both applied statisticians and practitioners.

Ref. [25] proposed a modified Weibull extension to model bathtub-shaped failure rates, addressing early failures, stable periods, and wear-out phases. Building on methodological developments in survival analysis, Ref. [26] provided tools for analyzing interval-censored failure time data. Extending previous ideas introduced by Lehmann in 1953, these were later applied by Refs. [16,27], who introduced a class of exponentiated generalized distributions, highlighting their properties and real-data applications. Similarly, Ref. [28] developed the beta generalized exponential distribution to flexibly model diverse lifetime behaviors. Ref. [29] presented the generalized additive models for location, scale, and shape, offering a flexible framework for modeling univariate response variables. Collectively, these studies enriched the statistical framework for analyzing complex survival and reliability data.

Building on the foundational distributional frameworks, recent studies have utilized established estimation and analytical techniques to investigate the properties of newly developed lifetime models. Ref. [30] proposed shrinkage-type estimators and compared them with the standard maximum likelihood estimation (MLE) method in reliability analysis. Ref. [31] examined the mathematical properties of two newly introduced lifetime distributions, deriving survival and hazard functions, moments, moment-generating functions, mean deviation, Rényi entropy, and quantile functions, and demonstrated the consistency of MLE through Monte Carlo simulations. Similarly, Refs. [32,33] introduced new families of compound probability distributions and analyzed their statistical characteristics using MLE. Ref. [34] derived analytical expressions for the PDF, CDF, survival, and hazard functions, mean residual life, and several entropy measures for an entropy-transformed Weibull model. Ref. [35] focused on cumulative residual entropy and its dynamics for residual lifetimes, while Ref. [36] examined generalized entropy measures to assess information loss in reliability systems. Expanding on these developments, Ref. [37] proposed extended concepts of cumulative residual entropy and formulated expressions for residual and cumulative entropies for continuous distributions. Collectively, these studies highlight how established statistical tools continue to enhance the analysis, characterization, and understanding of complex modern lifetime distributions.

Mixture models combine two or more probability distributions to represent heterogeneous populations. The mixed distribution describes the overall population, while the mixing distribution assigns weights to each component. Mathematically, it is expressed as

M (x) = \int k (x| θ) P (u) d u

, where

M (x)

denotes the mixture distribution,

k (x ∣ u)

is the conditional (component) density function of

x

given

u,

and

P (u)

is the mixing distribution, determining the relative contributions of each [38].

2. Model Formulation

2.1. Unconditional Model

Suppose T has a (conditional on

β

) Gamma distribution with mean

α / β

, and

β

has an exponential distribution with mean

1 / λ

. In this case the unconditional distribution of T is a mixture model which is called the Unconditional Gamma–Exponential Model (UG-EM).

Let the conditional (on

β

) probability density function of T be given by

f_{c o n .} (t | β) = \frac{β^{α}}{Γ (α)} t^{α - 1} e^{- β t}, α > 0, β > 0, t > 0

(1)

With shape

α

and rate

β

. Suppose that

β

itself follows an exponential distribution with rate

λ

:

g (β) = λ e^{- λ β}, λ > 0, β > 0

(2)

The unconditional (marginal) distribution of T is

f (t) = \int_{0}^{\infty} f_{c o n .} (t | β) . g (β) d β = \frac{λ t^{α - 1}}{Γ (α)} \int_{0}^{\infty} {β^{α} e}^{- β (t + λ)} d β = \frac{λ t^{α - 1}}{{(t + λ)}^{α + 1}} \cdot \frac{Γ (α + 1)}{Γ (α)} = \frac{α λ t^{α - 1}}{{(t + λ)}^{α + 1}}, t > 0, α > 0, λ > 0

(3)

where

α

and

λ

are the shape and scale parameters, respectively, of the disruption. Usually,

f_{c o n .} (t | β)

is called mixed distribution, while

g (β)

is called mixing distribution. Now let us check

\int_{0}^{\infty} \frac{α λ t^{α - 1}}{{(t + λ)}^{α + 1}} d t = 1

, which is a probability density function.

L e t u = \frac{t}{t + λ} \to t = \frac{λ u}{1 - u} \to d t = \frac{λ}{{(1 - u)}^{2}} d u

f (t) = \int_{0}^{\infty} \frac{α λ t^{α - 1}}{{(t + λ)}^{α + 1}} d t = \int_{0}^{1} \frac{α λ {(\frac{λ u}{1 - u})}^{α - 1}}{{(\frac{λ}{1 - u})}^{α + 1}} \cdot \frac{λ}{{(1 - u)}^{2}} d u = \int_{0}^{1} α u^{α - 1} d t = 1

2.2. Special Cases

-: In UG-EM, if $α = 1$ ,

f (t) = \frac{λ}{{(t + λ)}^{2}}, t \geq 0, λ > 0

The Lomax (Pareto Type II) distribution is as follows:

f_{L o m a x} (t) = \frac{α}{λ} {(1 + \frac{t}{λ})}^{- (α + 1)}, t \geq 0, α > 0, λ > 0 = \frac{λ}{{(t + λ)}^{2}}, f o r α = 1, t \geq 0, λ > 0

Thus, when

(α = 1)

, the UG-EM density coincides with the Lomax (Pareto Type II) distribution. Lomax, in [39], introduced the Lomax distribution, also known as the Pareto Type II distribution, which is a two-parameter model widely used in reliability and economics.

The primary motivation for introducing the UG-EM model is its Compound Probabilistic Structure. It is explicitly derived as an Unconditional Gamma–Exponential (UG-EM) lifetime model, utilizing the Gamma–Exponential methodology to model Unobserved Heterogeneity. This provides a robust statistical framework for lifetime analysis. Furthermore, we note that the Lomax (or Pareto Type II) distribution is a two-parameter special case within the broader UG-EM family, occurring when (

α

= 1), but the UG-EM model operates on a broader parameter space (

α

> 0).

-: The (UG-EM), can be written as

f (t) = \frac{α}{λ} \cdot {(\frac{t}{λ})}^{α - 1} {(1 + \frac{t}{λ})}^{- α - 1}

If we put

x = \frac{t}{λ}

, we get

f (t) = \frac{α}{λ} \cdot x^{α - 1} {(1 + x)}^{- α - 1}

(4)

By comparison with Beata-prime distribution

f (t) = \frac{t^{p - 1} {(1 + t)}^{- p - q}}{β (p, q)}, t \geq 0,

with

p = α, q = 1

, noting that

β (α, 1) = 1 / α

, we get that UG-EM is a scaled version of the Beta-prime distribution with parameters

p = α, q = 1

.

We note that the analytical expression for the probability density function of the UG-EM model is algebraically equivalent to that of a scaled Beta-prime distribution in [40]. However, the derivation of these functions in this section serves a crucial methodological and analytical purpose.

2.3. Cumulative Distribution (CDF) of (UG-EM)

F (t) = \int_{0}^{t} α λ \cdot \frac{u^{α - 1}}{{(u + λ)}^{α + 1}} d u = {(\frac{t}{t + λ})}^{α}, t > 0, α, λ > 0

(5)

It is easy to check

f (t) = d F (t) / d t

.

A graphical illustration is shown by Figure 1, in which we present both the probability density function

f (t)

and the cumulative distribution function

F (t)

for several values of the parameters α and λ. This helps visualize the flexibility of the model and its ability to represent various types of lifetime data.

Figure 1. Graphical illustration of

f (t)

and

F (t)

for several values of parameters.

3. Model Analysis: Reliability and Statistical Properties

3.1. Reliability and Hazard Rate Functions

The reliability function, denoted by R(t), measures the probability that a system or component will still operate without failure during a specified period of time t. The reliability function can be expressed as

R (t) = P (T > t) = 1 - F (t) = 1 - {(\frac{t}{t + λ})}^{α} = \frac{{(t + λ)}^{α} - t^{α}}{{(t + λ)}^{α}}, t > 0, α > 0, λ > 0 .

(6)

Clearly,

-: Large $t$ , $R (t) \to 0$ (high chance of failure)
-: Small $t$ , $R (t) \to 1$ (high chance of survival)

The hazard rate function

(H R)

represents the instantaneous rate of occurrence of an event (such as failure or death) at a specific time t given that the event has not occurred up to that time. For a given distribution,

h (t)

is defined simply as

h (t) = f (t) / R (t),

where

f (t)

and

R (t)

are the density and reliability functions for this distribution. Substituting the density and reliability functions of our model from Equations (3) and (6), for

h (t),

we get the following form:

h (t) = \frac{\frac{α λ t^{α - 1}}{{(t + λ)}^{α + 1}}}{\frac{{(t + λ)}^{α} - t^{α}}{{(t + λ)}^{α}}} = \frac{α λ t^{α - 1}}{(t + λ) [{(t + λ)}^{α} - t^{α}]}, α > 0, λ > 0, t > 0

(7)

It is clear that at

α = λ = 1

, we get

R (t) = h (t) = \frac{1}{t + 1}

, representing a hyperbolic decline over time. This coincidence is unusual, since in general,

R (t)

and

h (t)

exhibit very different shapes.

The reliability function R(t) and hazard rate h(t) are used to describe the lifetime behavior of the proposed model, allowing us to analyze how the system’s survival probability and risk of failure evolve over time. Figure 2 illustrates

F (t), f (t), R (t),

and

h (t)

for different values of the parameters. Figure 3 illustrates the reliability function

R (t)

and hazard rate

h (t)

and how survival and risk evolve over time with different values of parameters α (shape parameter) and λ (scale parameter). The curve

R (t)

declines from unity toward zero, with a steeper descent when the shape parameter α exceeds unity. The hazard curve

h (t)

exhibits a decreasing profile over time for α below unity, remains flat when α equals unity, and rises when α exceeds unity.

Figure 2.

F (t), f (t), R (t),

and

h (t)

for different times with (a)

α = 1

and

λ = 1

, (b)

α = 2

and

λ = 1.5

.

Figure 3. Reliability and hazard functions for varying shape (α) and scale (λ) parameters.

3.2. Reversed Hazard Rate

The reversed hazard rate (RHR) is an important concept with diverse applications in actuarial sciences, forensic studies, and various other fields. The reversed hazard rate (RHR) describes the instantaneous failure rate at time

t

, conditional on the event having occurred at or before t. Mathematically, it is defined as

r (t) = \frac{f (t)}{F (t)}, t > 0

where

f (t)

and F(t) are the probability density function (pdf) and cumulative distribution function (cdf) of the non-negative random variable T, respectively. In the UG-EM model, the RHR formula is obtained directly from the general definition:

r (t) = \frac{\frac{α λ t^{α - 1}}{{(t + λ)}^{α + 1}}}{\frac{t^{α}}{{(t + λ)}^{α}}} = \frac{α λ}{t (t + λ)}, α > 0, λ > 0, t > 0

(8)

which is decreasing over time. Figure 4 presents the pattern of r(t) and h(t) for different values of these parameters. Different curves illustrate how parameters such as

α

or

λ

influence the failure dynamics.

r (t)

decreases initially, then may increase again, indicating changing survival behavior; however,

h (t)

increases, showing that the system becomes more likely to fail over time (aging effect). This contrast highlights that survival expectations and transient failure risk are not always moving in the same direction, and thus, it is important to take both roles into account when a general reliability estimate is needed. The hazard rate and the reversed hazard rate are complementary concepts, they provide a complete picture of the failure behavior of a system, capturing both future and past perspectives of risk.

Figure 4. Different curves of

r (t)

with

h (t)

of UG-EM for different values of parameters

α

and

λ

.

3.3. Effect of Parameters

In the following, we present a graphical illustration of these parameters’ effects on reliability function

R (t)

, hazard rate

h (t),

and mean residual life

r (t) .

Figure 5 illustrates how the reliability function

R (t)

, hazard rate

h (t)

, and mean residual life

r (t)

vary with parameters α and λ. As shown, the system exhibits clear parametric sensitivity, with noticeable gradients in all three plots. In particular, the reliability decreases when increasing both parameters, while the hazard rate and residual life respond nonlinearly, indicating the complex behavior of the model under different parameter regimes.

Figure 5. Illustration of the effects of parameters α and λ on reliability function R(t), hazard rate h(t), and mean residual life r(t).

3.4. The Mean Residual Life (MRL)

The Mean Residual Life (MRL), which is also referred to as remaining lifetime, is a survival analysis concept. It tells us the expected time left until an event occurs, given that a system has survived up to a certain point. Mathematically, the mean residual life at time t, denoted as MRL(t), is defined as follows [41].

M R L (t) = μ (t) = E (T - t | T > t) = \frac{1}{R (t)} \int_{t}^{\infty} R (x) d x = \frac{{(t + λ)}^{α}}{{{(t + λ)}^{α} - t}^{α}} \int_{t}^{\infty} 1 - {(\frac{x}{x + λ})}^{α} d x

(9)

Let us use this idea of binomial expansion to clarify the discoveries of

M R L (t)

.

For large x, we have

\frac{λ}{x} \to \infty

, and therefore

{(\frac{x}{x + λ})}^{α} = {(1 + \frac{λ}{x})}^{- α} \approx 1 - α \frac{λ}{x} + \frac{α (α + 1)}{2!} {(\frac{λ}{x})}^{2} - \frac{α (α + 1) (α + 2)}{3!} {(\frac{λ}{x})}^{3} + \dots

Similarly,

[{1 - (\frac{x}{x + λ})}^{α}] \approx α \frac{λ}{x} - \frac{α (α + 1)}{2} {(\frac{λ}{x})}^{2} + \frac{α (α + 1) (α + 2)}{3!} {(\frac{λ}{x})}^{3} - \dots

Hence, for sufficiently large

x,

{1 - (\frac{x}{x + λ})}^{α} \approx \frac{α λ}{x}

Thus,

M R L (t) = \frac{1}{R (t)} \int_{t}^{\infty} 1 - {(\frac{x}{x + λ})}^{α} d x \approx \frac{1}{R (t)} \int_{t}^{\infty} \frac{α λ}{x} d x = \frac{1}{R (t)} (α λ \lim_{C \to \infty} [\ln c - \ln t]) = \infty

So,

M R L (t)

diverges for any

t > 0, α > 0, λ > 0 .

In the following, Table 1 presents the pattern of MRL for different values of parameters at fixed time.

Table 1. The MRL of UG-EM at

t = 2

and

5

, for different values of α and

λ

.

Clearly, MRL is increasing for the increasing. This is illustrated graphically in Figure 6.

Figure 6. The MRL of UG-EM with increasing values of C.

4. The Moments of UG-EM

This section focuses on the moments of the UG-EM distribution, which are key statistical measures used to describe its central tendency, variability, and shape.

4.1. The $r - t h$ Moments

E (T^{r}) = \int_{0}^{\infty} \frac{α λ t^{α + r - 1}}{{(t + λ)}^{α + 1}} d t = \underset{c \to \infty}{l i m} \int_{0}^{c} \frac{α λ t^{α + r - 1}}{{(t + λ)}^{α + 1}} d t, r = 1,2, 3, \dots

We analyzed the limit behavior of this integration as

t

approaches infinity for

r \geq 1

; regrettably, we found that the integral does not converge to a finite value. Let us see the mean

E (T) = \int_{0}^{\infty} \frac{α λ t^{α}}{{(t + λ)}^{α + 1}} d t

(10)

Lat

t = λ x,

then

d t = λ d x

E (T) = \int_{0}^{\infty} \frac{α λ {(λ x)}^{α}}{{(λ x + λ)}^{α + 1}} λ d x = α λ \int_{0}^{\infty} \frac{x^{α}}{{(x + 1)}^{α + 1}} d x = α λ I (α)

(11)

where

I (α) = \int_{0}^{\infty} \frac{x^{α}}{{(x + 1)}^{α + 1}} d x

Let split

I (α)

to be

I (α) = I_{1} (α) + I_{2} (α)

(12)

where

I_{1} (α) = \int_{0}^{1} \frac{x^{α}}{{(x + 1)}^{α + 1}} d x a n d I_{2} (α) = \int_{1}^{\infty} \frac{x^{α}}{{(x + 1)}^{α + 1}} d x

I_{1} (α) = \int_{0}^{1} \frac{x^{α}}{{(x + 1)}^{α + 1}} d x \leq \int_{0}^{1} \frac{x^{α}}{1^{α + 1}} d x \leq \int_{0}^{1} x^{α} d x = \frac{1}{α + 1} < \infty, \forall α > 0

Then,

I_{1} (α)

is finite. For

I_{2} (α)

, we have

\frac{x^{α}}{{(x + 1)}^{α + 1}} = \frac{x^{α}}{{x^{α + 1} (1 + \frac{1}{x})}^{α + 1}} = \frac{1}{x} \cdot \frac{1}{{(1 + \frac{1}{x})}^{α + 1}}

\lim_{x \to \infty} \frac{\frac{x^{α}}{{(x + 1)}^{α + 1}}}{\frac{1}{x}} = \lim_{x \to \infty} \frac{1}{{(1 + \frac{1}{x})}^{α + 1}} = 1

So,

\frac{x^{α}}{{(x + 1)}^{α + 1}} ~ \frac{1}{x} a s x \to \infty

\lim_{c \to \infty} \int_{1}^{c} \frac{x^{α}}{{(x + 1)}^{α + 1}} d x = \lim_{c \to \infty} \int_{1}^{c} \frac{1}{x} d x = \infty

Therefore,

I (α) = I_{1} (α) + I_{2} (α) = \infty,

and hence

E (T) = \infty

And the variance

V a r (T) = E (T^{2}) - {[E (T)]}^{2}

(13)

where

E (T^{2}) = α λ \int_{0}^{\infty} \frac{t^{α + 1}}{{(t + λ)}^{α + 1}} d t

Similarly, by applying the same asymptotic argument to

E (T^{2})

as for

E (T)

, we get

E (T^{2}) = \infty

, therefore

V a r (T) = \infty

.

In such cases, asymptotic analysis, numerical evaluation, or simulation-based approaches can be employed to provide clearer insights.

4.2. Simulation and Numerical Approximation

In this section, we present a numerical approximation of the integrals for

E (T) = α λ \underset{C \to \infty}{l i m} \int_{0}^{c} \frac{t^{α}}{{(t + λ)}^{α + 1}} d t

,

E (T^{2}) = α λ \lim_{C \to \infty} \int_{0}^{c} \frac{t^{α + 1}}{{(t + λ)}^{α + 1}} d t

and

V a r (T) = E (T^{2}) - {(E (T))}^{2}

as the upper limit c increases, for fixed values of the parameters α and λ. Table 2 presents numerical approximation for the behavior of the mean and variance of the UG-EM model at cutoff c. Also, graphical illustration is presented in Figure 7.

Table 2. Approximate values of

E (T), E (T^{2}),

and

V a r (T)

for different values of α, λ, and cutoff c, based on numerical integration.

Figure 7. Growth of

E (T), E (T^{2}),

and

V a r (T)

over time for the UG-EM model at different values of the model parameters.

Clearly, as the integration limit c increases, both E(T) and Var(T) are constantly increasing. In the limit as c→∞, the values approach their theoretical (possibly infinite) expectations.

The functions increase steadily with c, because the integrand is always positive, so adding more area as c grows naturally leads to a larger total value.

4.3. The Mode and the Median of UG-EM

-: The median $t_{m e d}$ is the solution of this equation

P (T < t) = 0.5

{\to (\frac{t}{t + λ})}^{α} = 0.5

\to t = t_{m e d} = \frac{λ {0.5}^{1 / α}}{1 - {0.5}^{1 / α}}

(14)

-: The mod of the UG-EM is $t_{m o d}$ that satisfies this equation

\frac{α (α - 1) λ t^{α - 2} {(t + λ)}^{α + 1} - α (α + 1) λ t^{α - 1} {(t + λ)}^{α}}{{(t + λ)}^{2 (α + 1)}} = 0

By solving this equation, we get

{t = t}_{m o d} = \frac{λ (α - 1)}{2}, α \geq 1

(15)

In the following, Table 3 presents numerical computations of the median and mode of the UG-EM model for different parameter values.

Table 3. Median and mode of UG-EM for different values of its parameters.

Figure 8, below, presents how the parameters effect the patterns of the median and mode of the model.

Figure 8. Effect of both α and λ on the median and mode of the UG-EM.

As evident from the data presented above in Table 3 and Figure 8, the median and mode exhibit a noticeable rise as the values of α or λ increase.

5. Entropy

In information theory, entropy is a measure of the amount of information or uncertainty in a system. It is a non-negative measure, and it depends on the probability distribution of events or outcomes. High entropy means more randomness and unpredictability in the data, while low entropy implies more predictability and less information content and less surprise. It finds applications in diverse scientific and engineering contexts. The entropy

H (t)

of the random variable

T

, with a density function

f (t)

, is defined as the expectation of the function

- \ln f (t)

H (t) = - E [\ln f (t)] = - \int_{- \infty}^{\infty} f (t) \cdot \ln f (t) d t

According to the UG-EM

H (t) = - \int_{0}^{\infty} \ln \frac{α λ t^{α - 1}}{{(t + λ)}^{α + 1}} \cdot \frac{α λ t^{α - 1}}{{(t + λ)}^{α + 1}} d t = - \lim_{c \to \infty} \int_{0}^{c} \ln \frac{α λ t^{α - 1}}{{(t + λ)}^{α + 1}} \cdot \frac{α λ t^{α - 1}}{{(t + λ)}^{α + 1}} d t

(16)

Entropy shows how uncertain or spread out the outcomes behave, while expectation shows the average outcome in the center of the data. That is why putting them side by side in a table makes the analysis clearer and more insightful. In Table 4 and Figure 9, as follow, we present the expectation and entropy for the UG-EM model at different values of

α

and

λ

.

Table 4. Expectation and entropy evolution in UG-EM over time for varying α and λ.

Figure 9. Expectation and entropy evolution in UG-EM over time for varying α and λ.

Table 4 and Figure 9 illustrate the time evolution of the expectation

E (T)

and entropy

H (T)

for UG-EM for various values of α and λ. The two quantities increase with time, which means the system is developing and becoming more uncertain. Still, entropy

H (T)

grows more slowly. Changes in α affect the system much more strongly than changes in λ, something we will see more clearly below. These results are numerical approximations intended to illustrate how the system’s behavior changes with the parameters, even in cases where the theoretical integrals do not converge.

When fitting the data for E(T) and H(T) in Table 4, a quadratic model yields

R^{2}

values of

99.9

% and 98.9%, respectively. This indicates that both α and λ have a significant effect on increases or decreases in the system mean or entropy. Such influence reflects a higher degree of disorder and unpredictability within the UG-EM model. Figure 10 presents these effects.

Figure 10. The effect of parameters

α

and

λ

on E(T) and H(t) at a fixed time.

Clearly, E(T) increases with both α and λ. Conversely, H(T) exhibits a decreasing trend with increasing values of both parameters. These plots capture the behavior at a specific moment in time.

It is important to note, from Table 4 and Figure 10, that both the mean E(T) and entropy H(T) increase over time, indicating system growth and rising uncertainty. However, at fixed times there are small transient effects: increasing α or λ can produce a temporary concentration (slight decrease) in H(T) at early times, while their net effect over longer time horizons is an increase in H(T). The influence of α is stronger than that of λ.

6. Order Statistics

Consider an

i . i . d

. random sample of size

m

, denoted as

X_{1}, X_{2}, . . ., X_{m}

, selected from a continuous distribution with probability density function

f (x)

and cumulative distribution function

F (x) .

Let

X_{1 : m}

X_{2 : m}

…

X_{k : m}

,

1 < k < m

represent the corresponding order statistics. The probability density function (pdf) of the

k - t h

order statistic, denoted as

X_{k : m}

, are given by [42]

f_{k . m} (t) = \frac{m!}{(k - 1)! (m - k)!} f (t) {[F (t)]}^{k - 1} {[1 - F (t)]}^{m - k}

f_{k . m} (t, α, λ) = \frac{m!}{(k - 1)! (m - k)!} * \frac{α λ t^{α - 1}}{{(t + λ)}^{α + 1}} {(\frac{t}{t + λ})}^{α (k - 1)} {[\frac{{{(t + λ)}^{α} - t}^{α}}{{(t + λ)}^{α}}]}^{m - k} = \frac{m!}{(k - 1)! (m - k)!} * \frac{α λ t^{α k - 1} {[{{(t + λ)}^{α} - t}^{α}]}^{m - k}}{{(t + λ)}^{α m + 1}}

(17)

and corresponding cumulative distribution function (cdf)

F_{k . m} (t) = \sum_{i = k}^{m} (\begin{matrix} m \\ i \end{matrix}) {[F (t)]}^{i} {[1 - F (t)]}^{m - i} = \sum_{i = k}^{m} (\begin{matrix} m \\ i \end{matrix}) {[F (t)]}^{i} {[R (t)]}^{m - i}, 1 < k < m, t > 0

F_{k . m} (t, α, λ) = \sum_{i = k}^{m} (\begin{matrix} m \\ i \end{matrix}) {(\frac{t}{t + λ})}^{α i} {[\frac{{{(t + λ)}^{α} - t}^{α}}{{(t + λ)}^{α}}]}^{m - i}

(18)

As a special case, we have

F_{m a x} (t) = {[F (t)]}^{m} = {(\frac{t}{t + λ})}^{α m}

(19)

F_{m i n} (t) = {1 - [R (t)]}^{m} = 1 - {[\frac{{{(t + λ)}^{α} - t}^{α}}{{(t + λ)}^{α}}]}^{m}

(20)

For,

1 < k < m, t > 0, α > 0

and

λ > 0

.

Figure 10 below presents the PDF of the k-th order statistic for specific parameter values.

The plot in Figure 11 illustrates the behavior of the probability density functions (PDFs) of order statistics

T (k : m)

from a UG-EM distribution for sample size

m = 9

, with shape parameter

α = 3.5

and rate parameter

λ = 2

. We see the peak of each curve move to the right and become lower as k increases. In plain terms: the smaller order statistics concentrate near low t-values (the sample minimum), while the larger ones appear at higher t-values and are more spread out. These patterns match the usual behavior of ordered samples.

Figure 11. Illustration of the probability density behavior of the k-th order statistics for a fixed sample size and fixed parameters.

7. Statistical Inference (Estimation)

7.1. Maximum Likelihood Estimation (MLE)

By maximizing the likelihood function (LF), we seek the parameter values that make the observed data most likely to occur. This approach ensures that our estimated model aligns well with the actual data we have at hand. We consider a random sample,

t_{i} (i = 1 \dots n)

, drawn from the (UG-EM) in (1), with the following joint probability function (the likelihood function)

L (t_{1}, \dots, t_{n}| Θ) = \prod_{i = 1}^{n} f (t_{i}, Θ), Θ = (α, λ)

L (t_{1}, \dots, t_{n}| α, λ) = {(α λ)}^{n} \prod_{i = 1}^{n} \frac{t_{i}^{α - 1}}{{(t_{i} + λ)}^{α + 1}} = {(α λ)}^{n} \frac{\prod_{i = 1}^{n} t_{i}^{α - 1}}{\prod_{i = 1}^{n} {(t_{i} + λ)}^{α + 1}}

(21)

By taking the natural logarithm of the

L (t; α, λ)

in Equation (21), we get

ℓ (t_{1}, \dots, t_{n}| Θ) = n \ln α + n \ln λ + (α - 1) \sum_{i = 1}^{n} \ln t_{i} - (α + 1) \sum_{i = 1}^{n} \ln (t_{i} + λ)

(22)

-: when $α$ is known, $α_{0}$ , the MLE of $λ$ is simply the solution of the following differential equation

\frac{\partial ℓ}{\partial λ} = \frac{n}{λ} - (α_{0} + 1) \sum_{i = 1}^{n} \frac{1}{t_{i} + λ} = 0

(23)

-: Similarly, when $λ$ is known, $λ_{0}$ , the MLE of $α$ is simply the solution of the following differential equation

\frac{\partial ℓ}{\partial α} = \frac{n}{α} + \sum_{i = 1}^{n} \ln t_{i} - \sum_{i = 1}^{n} \ln (t_{i} + λ_{0}) = 0

(24)

\frac{1}{α} = \frac{1}{n} \sum_{i = 1}^{n} \ln (t_{i} + λ_{0}) - \frac{1}{n} \sum_{i = 1}^{n} \ln t_{i}

\frac{1}{\hat{α}} = \bar{T_{1}} - \bar{T_{2}} \Rightarrow \hat{α} = {(\bar{T_{1}} - \bar{T_{2}})}^{- 1}

where

T_{1} = \ln (t + λ_{0})

and

T_{2} = \ln t

.

-: When both α and λ are not known, in such a scenario the Maximum Likelihood Estimates (MLEs) for these parameters are the simultaneous solutions of the following system of equations

\begin{matrix} \frac{\partial ℓ}{\partial α} = \frac{n}{α} + \sum_{i = 1}^{n} \ln t_{i} - \sum_{i = 1}^{n} \ln (t_{i} + λ) = 0 \\ \frac{\partial ℓ}{\partial λ} = \frac{n}{λ} - (α + 1) \sum_{i = 1}^{n} \frac{1}{t_{i} + λ} = 0 \end{matrix}\}

(25)

This system of nonlinear equations cannot be solved theoretically; therefore, it will be addressed numerically using an appropriate iterative method, such as the Newton–Raphson method or the Gradient Descent Optimization Method, which will be utilized later.

7.2. Fisher Information and Confidence Intervals

The inverse of the Fisher information matrix provides us with the asymptotic covariance matrix for the Maximum Likelihood Estimates (MLEs) of the parameters θ, where

θ = (λ, α)

. This matrix is an important tool in statistical analysis because it describes how the uncertainties in the estimating process change as we gather more data or as our sample size increases, contributing valuable insights into the precision of our parameter estimates. In the following, the observed Fisher information matrix is presented

I_{O} (θ) = {I (θ)|}_{M L E s} = {[\begin{matrix} - \frac{\partial^{2} ℓ}{\partial α^{2}} & - \frac{\partial^{2} ℓ}{\partial α \partial λ} \\ - \frac{\partial^{2} ℓ}{\partial λ \partial α} & - \frac{\partial^{2} ℓ}{\partial λ^{2}} \end{matrix}]}_{α = \hat{α}, λ = \hat{λ}}

= [\begin{matrix} \frac{n}{α^{2}} & \sum_{i = 1}^{n} \frac{1}{t_{i} + λ} \\ \sum_{i = 1}^{n} \frac{1}{t_{i} + λ} & - \frac{n}{λ^{2}} + (α + 1) \sum_{i = 1}^{n} \frac{1}{{(t_{i} + λ)}^{2}} \end{matrix}]

(26)

where

l

is defined is Equation (22). In this case, the covariance matrix becomes simply

I_{O}^{- 1} (θ)

. The approximate

(1 - ξ) 100 %

confidence limit (CLs) for the parameters

α, λ

are as follows:

\hat{α} \pm Z_{\frac{ξ}{2}} \sqrt[2]{σ_{O}^{2} (\hat{α})}; \hat{λ} \pm Z_{ξ / 2} \sqrt[2]{σ_{O}^{2} (\hat{λ})}

respectively. Where

σ_{O}^{2} (\hat{α})

and

σ_{O}^{2} (\hat{λ})

are the variances of

\hat{α}

and

\hat{λ}

, respectively, which are calculated from the observed data and presented through the diagonal element of

I_{O}^{- 1} (θ)

. Also,

Z_{ξ / 2}

is the value corresponding to the upper

(ξ / 2)

percentile of the standard normal distribution.

7.3. Simulation

In the nonlinear system described by Equation (3), we generated 100 different samples from our model UG-EM using various assumed parameter values

(α, λ = 2, 3.5; 3, 2;

and

4, 3.5)

and different sample sizes

(n = 50, 100, 200, 300,

and

500) .

Each case was repeated 100 times to obtain the empirical means, biases, and mean squared errors (MSE) of the estimators. In addition, bootstrap confidence intervals were computed at 90% and 95% levels to evaluate the precision and reliability of the estimates.

The simulation results in Table 5 show that the MLEs of the UG-EM parameters improve as the sample size increases. The bias and MSE decrease with larger samples, indicating consistency and convergence of the estimators toward the true parameter values and how the sample size greatly affects the estimation process. Moreover, the confidence intervals become narrower at higher sample sizes, confirming greater precision and reliability in the parameter estimation process. The QQ plots in Figure 12 further illustrate this behavior by comparing the sample quantiles of the simulated data with the corresponding theoretical quantiles of the UG-EM distribution for several values of α and λ at different sample sizes. As the sample size increases, the points align more closely with the 45° reference line, reflecting improved agreement between the simulated and theoretical distributions. The plots confirm that the UG-EM distribution fits the simulated data well and that larger samples provide more accurate representations of the theoretical distribution.

Table 5. Estimation results for UG-EM parameters Θ = (α, λ): MLEs averages, biases, MSEs, and CIs for different sample sizes.

Figure 12. QQ plots for the UG-EM distribution for selected parameter values (α and λ) across different sample sizes (n = 100, 200, 300, 500).

8. Application

The fitness of the UG-EM model can be demonstrated through its application to three well-known survival datasets available in the R survival package: first, Lung data, which is from a clinical trial of non-small cell lung cancer patients, from treatment initiation to death or last follow-up. This records the time of survival in days from treatment initiation to death or censoring. Then, Kidney data: This measurement records the time to graft failure in kidney transplant recipients from the surgery date to the date of failure or the end of observation. Finally, Veteran data: These were received in a trial comparing two therapies for small-cell lung cancer. The data estimate the time of survival in days from entry until censoring or death.

Note: It should be noted that the current analysis assumes complete observations. Right-censored data are not explicitly handled in this study, and the reported results should be interpreted accordingly. Future work will extend the model to properly account for censoring in survival datasets.

Data Sources and Ethical Statement: The datasets used in this study (Lung, Veteran, and Kidney) are publicly available through the R survival package (2025.09.0+387). All datasets are fully de-identified and therefore exempt from ethical approval. The Lung dataset originates from the North Central Cancer Treatment Group (NCCTG) study on prognostic variables in advanced lung cancer patients [43]. The Veteran dataset is based on the data described in Kalbfleisch and Prentice [44]. The Kidney dataset corresponds to catheter survival data analyzed using frailty models [45].

In Table 6 and according to the criteria AIC and BIC, the Weibull model is the best-fitting model for both Veteran and Lung datasets due to the minimum value of both of them, while the Log-normal distribution is a close second-best fit for both these datasets, particularly for the Veteran data. The Log-normal model gives the best fit for the Kidney data. The UG-EM model did not outperform other candidates in any of the datasets, although it showed competitive performance in the Kidney data. Notably, the Gamma model failed to fit the data well, so it was excluded from the final comparisons.

Table 6. Comparison of exponential, Weibull, Log-normal, and UG-EM models on Lung, Veteran, and Kidney data.

The UG-EM model is particularly well-suited to capture the behavior of the data that has a long survival time and decays noticeably more slowly than an exponential curve. In other words, it has a heavy or slowly vanishing right tail.

In the following, a graphical comparison of the Exponential, Weibull, Log-normal, and UG-EM models based on Lung, Veteran, and Kidney data is presented in Figure 13.

Figure 13. Comparison of exponential, Weibull, Log-normal, and UG-EM Models on Lung, Veteran, and Kidney data.

9. Conclusions

We proposed the UG-EM (Unconditional Gamma–Exponential) model as a flexible compound lifetime distribution for analyzing right-skewed survival data. The study presents a complete analytical development, including density, cumulative, hazard, and reversed-hazard functions, along with measures of dispersion, entropy, and mean residual life. Simulation studies, including convergence diagnostics and bootstrap confidence intervals, confirmed the stability and consistency of the MLEs, especially for larger sample sizes. Applications to real datasets (Lung, Veteran, Kidney) showed that the model effectively captures deviations from symmetry and provides meaningful insights into parameter behavior. Overall, the UG-EM model offers a useful and analytically tractable framework for lifetime modeling, complementing existing distributions without claiming to universally outperform them. Future studies could extend the UG-EM distribution to more generalized forms, assess its performance on larger and more diverse datasets, and compare it with advanced survival models. Deeper mathematical investigation of its properties and evaluating its behavior under complex censoring schemes also represent promising directions for future research.

Author Contributions

Methodology, S.M.A.; Software, S.M.A.; Validation, O.H.O. and S.M.A.; Formal analysis, S.M.A.; Investigation, S.M.A.; Data curation, S.M.A.; Writing—original draft, O.H.O., S.M.A. and S.A.; Writing—review & editing, O.H.O., S.M.A. and S.A.; Visualization, S.M.A.; Supervision, S.M.A.; Project administration, S.M.A.; Funding acquisition, O.H.O. and S.A. All authors have read and agreed to the published version of the manuscript.

Funding

Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2025R743), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

The authors would like to extend their sincere appreciation to the Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2025R743), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia. Also, the authors would like to acknowledge the support of Prince Sultan University for their financial support.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

UG-EM	Unconditional Gamma-Exponential Model
HRF	Hazard rate function
RF	Reliability function
Sk	Coefficient of skewness
Ku	Coefficient of kurtosis
LF	Likelihood function
PDF/pdf	Probability density function
CDF/cdf	Cumulative distribution function
RHR	Reversed hazard rate
MRL	Mean residual life
MTTF	Mean time to failure
MLEs	Maximum likelihood estimates
IC	Information criteria
AIC	Akaike information criterion
BIC	Bayesian information criterion.
CLs	Confidence limits =Confidence Intervals
MSE	Mean Square Error
SD	Standard Deviation

References

Pickles, A.; Crouchley, R. A comparison of frailty models for multivariate survival data. Stat. Med. 1995, 14, 1447–1461. [Google Scholar] [CrossRef]
Mudholkar, G.S.; Srivastava, D.K.; Kollia, G.D. A generalization of the Weibull distribution with application to the analysis of survival data. J. Am. Stat. Assoc. 1996, 91, 1575–1583. [Google Scholar] [CrossRef]
Erisoglu, Ü.; Erol, H. Modeling heterogeneous survival data using mixture of extended exponential–geometric distributions. Commun. Stat.-Simul. Comput. 2010, 39, 1726–1739. [Google Scholar] [CrossRef]
Hougaard, P. Survival models for heterogeneous populations derived from stable distributions. Biometrika 1986, 73, 387–396. [Google Scholar] [CrossRef]
Lawless, J.F. Statistical Models and Methods for Lifetime Data; John Wiley & Sons: Hoboken, NJ, USA, 2011. [Google Scholar]
Meeker, W.Q.; Escobar, L.A.; Pascual, F.G. Statistical Methods for Reliability Data; John Wiley & Sons: Hoboken, NJ, USA, 2021. [Google Scholar]
Kapur, K.C.; Pecht, M.G. Reliability Engineering; John Wiley & Sons: Hoboken, NJ, USA, 2014. [Google Scholar]
Murthy, D.P.; Xie, M.; Jiang, R. Weibull Models; John Wiley & Sons: Hoboken, NJ, USA, 2004. [Google Scholar]
Kuo, L.; Peng, F. A mixture-model approach to the analysis of survival data. BIOSTATISTICS 2000, 5, 255–272. [Google Scholar]
Rubio, F.J.; Hong, Y. Survival and lifetime data analysis with a flexible class of distributions. J. Appl. Stat. 2016, 43, 1794–1813. [Google Scholar] [CrossRef]
Eugene, N.; Lee, C.; Famoye, F. Beta-Normal Distribution and Its Applications. Commun. Stat. Theory Methods 2002, 31, 497–512. [Google Scholar] [CrossRef]
Jones, M.C. Families of distributions arising from distributions of order statistics. Test 2004, 13, 1–43. [Google Scholar] [CrossRef]
Cordeiro, G.M.; de Castro, M. A new family of generalized distributions. J. Stat. Comput. Simul. 2011, 81, 883–898. [Google Scholar] [CrossRef]
Alexander, C.; Cordeiro, G.M.; Ortega, E.M.M.; Sarabia, J.M. Generalized beta-generated distributions. Comput. Stat. Data Anal. 2012, 56, 1880–1897. [Google Scholar] [CrossRef]
Torabi, H.; Montazari, N.H. The Logistic-Uniform Distribution and Its Applications. Commun. Stat. Simul. Comput. 2014, 43, 2551–2569. [Google Scholar] [CrossRef]
Nadarajah, S.; Kotz, S. The beta exponential distribution. Reliab. Eng. Syst. Saf. 2006, 91, 689–697. [Google Scholar] [CrossRef]
Da Silva, R.V.; de Andrade, T.A.; Maciel, D.B.; Campos, R.P.; Cordeiro, G.M. A new lifetime model: The gamma extended Fréchet distribution. J. Stat. Theory Appl. 2013, 12, 39–54. [Google Scholar] [CrossRef]
Alzaatreh, A.; Lee, C.; Famoye, F. A new method for generating families of continuous distributions. Metron 2013, 71, 63–79. [Google Scholar] [CrossRef]
Kundu, D.; Gupta, A.K. Bayes estimation for the Marshall–Olkin bivariate Weibull distribution. Comput. Stat. Data Anal. 2013, 57, 271–281. [Google Scholar] [CrossRef]
Carrasco, J.M.; Ortega, E.M.; Cordeiro, G.M. A generalized modified Weibull distribution for lifetime modeling. Comput. Stat. Data Anal. 2008, 53, 450–462. [Google Scholar] [CrossRef]
Ghitany, M.E.; Atieh, B.; Nadarajah, S. Lindley distribution and its application. Math. Comput. Simul. 2007, 78, 493–506. [Google Scholar] [CrossRef]
Louzada, F.; Roman, M.; Cancho, V.G. The complementary exponential geometric distribution: Model, properties, and a comparison with its counterpart. Comput. Stat. Data Anal. 2011, 55, 2516–2524. [Google Scholar] [CrossRef]
Ibrahim, J.G.; Chen, M.H.; Sinha, D. Bayesian Survival Analysis; Springer: Berlin/Heidelberg, Germany, 2005. [Google Scholar]
Hougaard, P. Analysis of Multivariate Survival Data; Springer: New York, NY, USA, 2000; Volume 564. [Google Scholar]
Xie, M.; Tang, Y.; Goh, T.N. A modified Weibull extension with bathtub-shaped failure rate function. Reliab. Eng. Syst. Saf. 2002, 76, 279–285. [Google Scholar] [CrossRef]
Sun, J. The Statistical Analysis of Interval-Censored Failure Time Data; Springer: Berlin/Heidelberg, Germany, 2007. [Google Scholar]
Cordeiro, G.M.; Ortega, E.M.; Da Cunha, D.C. The exponentiated generalized class of distributions. J. Data Sci. 2013, 11, 1–27. [Google Scholar] [CrossRef]
Barreto-Souza, W.; Santos, A.H.S.; Cordeiro, G.M. The beta generalized exponential distribution. J. Stat. Comput. Simul. 2010, 80, 159–172. [Google Scholar] [CrossRef]
Rigby, R.A.; Stasinopoulos, D.M.; Lane, P.W. Generalized additive models for location, scale and shape. J. R. Stat. Soc. Ser. C (Appl. Stat.) 2005, 54, 507–554. [Google Scholar] [CrossRef]
Ahmed, S.E.; Belaghi, R.A.; Hussein, A.; Safariyan, A. New and efficient estimators of reliability characteristics for a family of lifetime distributions under progressive censoring. Mathematics 2024, 12, 1599. [Google Scholar] [CrossRef]
Dutta, S.; Yadav, R. Generating new lifetime distributions using parsimonious transformation: Properties and applications. Int. J. Stat. Data Anal. 2025, 11, 45–58. [Google Scholar] [CrossRef]
Kneib, T.; Schlüter, J.C.; Wacker, B. Revisiting Maximum Log-Likelihood Parameter Estimation for Two-Parameter Weibull Distributions: Theory and Applications. Results Math 2024, 79, 224. [Google Scholar] [CrossRef]
Khedr, A.E.; Abdelrahman, A.A.; El-Dawoody, A. A novel family of compound probability distributions: Properties, copulas, risk analysis and assessment under a reinsurance revenues data set. Thai J. Stat. 2025, 23, 615–642. [Google Scholar]
Sindhu, T.N.; Shafiq, A.; Lone, S.A.; Al-Mdallal, Q.M.; Abushal, T.A. Distributional properties of the entropy transformed Weibull distribution and applications to various scientific fields. Sci. Rep. 2024, 14, 31827. [Google Scholar] [CrossRef] [PubMed]
Kayid, M.; Alshehri, M.A. Cumulative Residual Entropy of the Residual Lifetime of a Mixed System at the System Level. Entropy 2023, 25, 1033. [Google Scholar] [CrossRef]
Tanak, A.K.; Najafi, M.; Borzadaran, G.M. A new lifetime distribution by maximizing entropy: Properties and applications. Probab. Eng. Informational Sci. 2024, 38, 189–206. [Google Scholar] [CrossRef]
Sakr, H.H.; Mohamed, M.S. On residual cumulative generalized exponential entropy and its application in human health. Electron. Res. Arch. 2025, 33, 1633–1666. [Google Scholar] [CrossRef]
Dixit, V.; Martin, R. Revisiting consistency of a recursive estimator of mixing distributions. Electron. J. Stat. 2023, 17, 1007–1042. [Google Scholar] [CrossRef]
Lomax, K.S. Business failures: Another example of the analysis of failure data. J. Am. Stat. Assoc. 1954, 49, 847–852. [Google Scholar] [CrossRef]
Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions; John Wiley & Sons: Hoboken, NJ, USA, 1995; Volume 2. [Google Scholar]
Modarres, M.; Kaminskiy, M.P.; Krivtsov, V. Reliability Engineering and Risk Analysis: A Practical Guide; CRC Press: Boca Raton, FL, USA, 2016. [Google Scholar]
Mitzenmacher, M.; Upfal, E. Probability and Computing: Randomization and Probabilistic Techniques in Algorithms and Data Analysis, 2nd ed.; Cambridge University Press: Cambridge, UK, 2017. [Google Scholar]
Loprinzi, C.L.; Laurie, J.A.; Wieand, H.S.; Krook, J.E.; Novotny, P.J.; Kugler, J.W.; Bartel, J.; Law, M.; Bateman, M.; Klatt, N.E.; et al. Prospective evaluation of prognostic variables from patient-completed questionnaires. North Central Cancer Treatment Group. J. Clin. Oncol. 1994, 12, 601–607. Available online: https://stat.ethz.ch/R-manual/R-devel/library/survival/html/lung.html (accessed on 20 November 2025). [PubMed]
Kalbfleisch, J.D.; Prentice, R.L. The Statistical Analysis of Failure Time Data; Wiley: New York, NY, USA, 1980; Available online: https://stat.ethz.ch/R-manual/R-devel/RHOME/library/survival/html/veteran.html (accessed on 20 November 2025).
McGilchrist, C.A.; Aisbett, C.W. Regression with frailty in survival analysis. Biometrics 1991, 47, 461–466. Available online: https://stat.ethz.ch/R-manual/R-devel/RHOME/library/survival/html/kidney.html (accessed on 20 November 2025). [CrossRef] [PubMed]

Figure 1. Graphical illustration of

f (t)

and

F (t)

for several values of parameters.

Figure 2.

F (t), f (t), R (t),

and

h (t)

for different times with (a)

α = 1

and

λ = 1

, (b)

α = 2

and

λ = 1.5

.

Figure 3. Reliability and hazard functions for varying shape (α) and scale (λ) parameters.

Figure 4. Different curves of

r (t)

with

h (t)

of UG-EM for different values of parameters

α

and

λ

.

Figure 5. Illustration of the effects of parameters α and λ on reliability function R(t), hazard rate h(t), and mean residual life r(t).

Figure 6. The MRL of UG-EM with increasing values of C.

Figure 7. Growth of

E (T), E (T^{2}),

and

V a r (T)

over time for the UG-EM model at different values of the model parameters.

Figure 8. Effect of both α and λ on the median and mode of the UG-EM.

Figure 9. Expectation and entropy evolution in UG-EM over time for varying α and λ.

Figure 10. The effect of parameters

α

and

λ

on E(T) and H(t) at a fixed time.

Figure 11. Illustration of the probability density behavior of the k-th order statistics for a fixed sample size and fixed parameters.

Figure 12. QQ plots for the UG-EM distribution for selected parameter values (α and λ) across different sample sizes (n = 100, 200, 300, 500).

Figure 13. Comparison of exponential, Weibull, Log-normal, and UG-EM Models on Lung, Veteran, and Kidney data.

Table 1. The MRL of UG-EM at

t = 2

and

5

, for different values of α and

λ

.

Table 1. The MRL of UG-EM at

t = 2

and

5

, for different values of α and

λ

.

c	$α$	$λ$	$MRL at t = 2$	$α$	$λ$	$MRL at t = 5$
10	2	3.5	5.722408	3	2.5	4.149589
50			16.04833			17.88753
100			21.24397			24.7736
500			33.89718			41.51089
1000			39.44651			48.84542
5000			52.39552			65.95611
10,000			57.98268			73.33828
100,000			76.55305			97.87419
1,000,000			95.12686			122.4144

Table 2. Approximate values of

E (T), E (T^{2}),

and

V a r (T)

for different values of α, λ, and cutoff c, based on numerical integration.

Table 2. Approximate values of

E (T), E (T^{2}),

and

V a r (T)

for different values of α, λ, and cutoff c, based on numerical integration.

$α$	$λ$	$c$	$E (T)$	$E (T^{2})$	$V a r (T)$
0.1	2	10	0.33113	1	1
		100	0.75607	18	18
		1000	1.21267	197	196
		10,000	1.67279	1996	1993
		100,000	2.13327	19,995	19,991
0.1	5	10	0.49071	2	2
		100	1.44794	42	40
		1000	2.57517	486	479
		10,000	3.724	4979	4965
		100,000	4.87504	49,973	49,949
0.5	2	10	1.26323	5	4
		100	3.32795	89	78
		1000	5.6039	982	951
		10,000	7.90379	9975	9913
		100,000	10.2061	99,968	99,864
0.5	5	10	1.6486	8	5
		100	6.13693	198	160
		1000	11.73022	2406	2268
		10,000	17.46986	24,863	24,558
		100,000	23.22464	249,819	249,280
4	2	10	2.38192	15	9
		100	15.40624	585	348
		1000	33.12996	7605	6507
		10,000	51.47888	79,421	76,771
		100,000	69.89236	799,227	794,342
4	5	10	1.23151	9	7
		100	22.90011	1073	548
		1000	64.79596	17,985	13,787
		10,000	110.4014	196,841	184,652
		100,000	156.4081	1,995,695	1,971,231

Table 3. Median and mode of UG-EM for different values of its parameters.

λ	α	Median	Mode
5.5	4	29.07	8.25
6	1	6.00	0
1	10	13.93	4.5
5.5	12	92.49	30.25
11.8	5.5	87.85	26.55
3	8	33.15	10.5
1	1.5	1.70	0.25
0.5	6	4.08	1.25
5.9	6.4	51.58	15.93
10	1.9	22.71	4.5
5	6	40.83	12.5
7	9	87.43	28
10	10	139.33	45

Table 4. Expectation and entropy evolution in UG-EM over time for varying α and λ.

$t$	$λ$	$α = 2$		$λ$	$α = 3$		λ	$α = 4$
$t$	$λ$	$E (T)$	$H (T)$	$λ$	$E (T)$	$H (T)$	λ	$E (T)$	$H (T)$
20	3	4.74	2.29	3	5.13	2.16	3	5.13	1.98
40		7.80	2.85		9.28	2.88		10.13	2.81
60		9.83	3.11		12.16	3.22		13.74	3.22
80		11.35	3.26		14.34	3.42		16.53	3.47
90		11.99	3.31		15.26	3.50		17.72	3.56
100		12.56	3.35		16.10	3.56		18.80	3.64
120		13.57	3.42		17.57	3.66		20.71	3.76
140		14.44	3.48		18.84	3.73		22.36	3.86
200		16.46	3.58		21.83	3.87		26.28	4.04
20	4	4.89	2.22	4	5.02	2.00	4	4.76	1.76
40		8.60	2.87		9.90	2.81		10.47	2.67
60		11.17	3.18		13.45	3.22		14.85	3.16
80		13.11	3.37		16.21	3.47		18.32	3.46
90		13.93	3.44		17.38	3.56		19.82	3.57
100		14.67	3.49		18.46	3.63		21.19	3.66
120		15.98	3.58		20.35	3.76		23.63	3.81
140		17.11	3.65		21.99	3.85		25.74	3.93
200		19.77	3.77		25.88	4.03		30.81	4.16
20	5	4.89	2.13	5	4.8	1.8	5	4.33	1.56
40		9.13	2.86		10.2	2.7		10.46	2.53
60		12.16	3.21		14.3	3.2		15.44	3.06
80		14.49	3.43		17.6	3.5		19.50	3.40
90		15.48	3.51		19.0	3.6		21.27	3.53
100		16.39	3.57		20.3	3.7		22.90	3.64
120		17.98	3.68		22.5	3.8		25.82	3.82
140		19.36	3.76		24.5	3.9		28.37	3.95
200		22.62	3.91		29.3	4.1		34.52	4.23

Table 5. Estimation results for UG-EM parameters Θ = (α, λ): MLEs averages, biases, MSEs, and CIs for different sample sizes.

Parameters			$\hat{α}$					$\hat{λ}$
$α$	$λ$	Sample Size	$n = 50$	$n = 100$	$n = 200$	$n = 300$	$n = 500$	$n = 50$	$n = 100$	$n = 200$	$n = 300$	$n = 500$
2	3.5	Average	919.55	17.62	2.16	2.10	2.03	2570.92	47.30	3.90	3.75	3.57
		Bias	917.55	15.62	0.16	0.10	0.03	2567.42	43.80	0.40	0.25	0.07
		MSE	355,738,300.00	106,405.10	0.44	0.20	0.07	2,846,422,000.00	849,711.30	2.57	1.15	0.45
		95%CI-Low	0.99	1.57	1.17	1.31	1.56	0.99	2.39	1.01	1.92	2.68
		95%CI-UPP	28.94	6.67	2.50	2.82	2.81	81.30	16.50	4.04	5.79	6.22
		90%CI-Low	1.08	1.61	1.19	1.37	1.58	1.25	2.75	1.19	1.99	2.84
		90%CI-UPP	11.36	6.04	2.31	2.58	2.65	31.29	14.30	3.55	4.72	5.48
		Convergence Rate	100	100	100	100	100	100	100	100	100	100
			$\hat{α}$					$\hat{λ}$
3	2	Sample Size	50.00	100.00	200.00	300.00	500.00	50.00	100.00	200.00	300.00	500.00
		Average	1850.18	79.12	3.59	3.27	3.14	1490.08	67.89	2.50	2.22	2.12
		Bias	1847.18	76.12	0.59	0.27	0.14	1488.08	65.89	0.50	0.22	0.12
		MSE	300,570,900.00	693,313.80	8.77	1.19	0.45	176,678,100.00	499,674.70	6.59	0.86	0.34
		95%CI-Low	1.94	2.02	1.92	2.59	2.08	1.02	1.30	1.05	1.70	1.16
		95%CI-UPP	37,983.84	11,475.60	5.94	16.34	5.95	25,871.89	9537.34	5.32	12.84	4.70
		90%CI-Low	2.31	2.16	2.00	2.89	2.19	1.18	1.39	1.13	1.95	1.23
		90%CI-UPP	9112.01	64.66	4.68	10.29	4.66	8571.51	57.36	3.43	8.70	3.40
		Convergence Rate	99.8	100	100	100	100	99.8	100	100	100	100
			$\hat{α}$					$\hat{λ}$
4	3.5	Sample Size	50.00	100.00	200.00	300.00	500.00	50.00	100.00	200.00	300.00	500.00
		Average	4639.48	337.01	167.79	4.61	4.47	4918.09	338.92	169.25	4.15	4.01
		Bias	4635.48	333.01	163.79	0.61	0.47	4914.59	335.42	165.75	0.65	0.51
		MSE	1,705,671,000.00	3,772,530.00	8,918,910.00	3.82	3.66	1,987,658,000.00	3,801,269.00	9,171,170.00	4.30	3.89
		95%CI-Low	1.83	13.36	1.83	2.75	3.14	1.12	12.25	1.20	2.10	2.44
		95%CI-UPP	111,134.50	552,705.90	90.85	9.82	7.54	132,190.90	570,470.30	96.21	10.12	7.57
		90%CI- Low	2.49	22.65	1.94	2.82	3.28	1.63	21.50	1.29	2.18	2.52
		90%CI- UPP	80,731.49	306,188.90	25.57	8.24	6.58	102,487.00	316,410.70	27.77	7.62	6.20
		Convergence Rate	99.8	99.8	100	100	100	99.8	99.8	100	100	100

Table 6. Comparison of exponential, Weibull, Log-normal, and UG-EM models on Lung, Veteran, and Kidney data.

$D a t a$	$n$	$M o d e l$	$L o g L i k$	$d f$	$A I C$	$∆ A I C$	$B I C$	$∆ B I C$
lung	165	Exponential	−1096.5	1	2194.997	20.46648	2198.103	17.36054
		Weibull	−1085.27	2	2174.531	0	2180.743	0
		Log-normal	−1105.91	2	2215.816	41.28505	2222.028	41.28505
		UG-EM	−1125.96	2	2255.92	81.38909	2262.132	81.38909
Veteran	128	Exponential	−743.046	1	1488.092	5.48908	1490.944	2.63705
		Weibull	−739.301	2	1482.602	0	1488.307	0
		Log-normal	−739.722	2	1483.444	0.841807	1489.148	0.841807
		UG-EM	−743.882	2	1491.764	9.161759	1497.468	9.161759
Kidney	58	Exponential	−335.08	1	672.1596	4.581431	674.2201	2.520987
		Weibull	−333.398	2	670.7967	3.218488	674.9176	3.218488
		Log-normal	−331.789	2	667.5782	0	671.6991	0
		UG-EM	−333.712	2	671.4242	3.846022	675.5451	3.846022

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

The UG-EM Lifetime Model: Analysis and Application to Symmetric and Asymmetric Survival Data

Abstract

1. Introduction

2. Model Formulation

2.1. Unconditional Model

2.2. Special Cases

2.3. Cumulative Distribution (CDF) of (UG-EM)

3. Model Analysis: Reliability and Statistical Properties

3.1. Reliability and Hazard Rate Functions

3.2. Reversed Hazard Rate

3.3. Effect of Parameters

3.4. The Mean Residual Life (MRL)

4. The Moments of UG-EM

4.1. The $r - t h$ Moments

4.2. Simulation and Numerical Approximation

4.3. The Mode and the Median of UG-EM

5. Entropy

6. Order Statistics

7. Statistical Inference (Estimation)

7.1. Maximum Likelihood Estimation (MLE)

7.2. Fisher Information and Confidence Intervals

7.3. Simulation

8. Application

9. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Article Metrics

Citations

Article Access Statistics

The UG-EM Lifetime Model: Analysis and Application to Symmetric and Asymmetric Survival Data

Abstract

1. Introduction

2. Model Formulation

2.1. Unconditional Model

2.2. Special Cases

2.3. Cumulative Distribution (CDF) of (UG-EM)

3. Model Analysis: Reliability and Statistical Properties

3.1. Reliability and Hazard Rate Functions

3.2. Reversed Hazard Rate

3.3. Effect of Parameters

3.4. The Mean Residual Life (MRL)

4. The Moments of UG-EM

4.1. The r − t h Moments

4.2. Simulation and Numerical Approximation

4.3. The Mode and the Median of UG-EM

5. Entropy

6. Order Statistics

7. Statistical Inference (Estimation)

7.1. Maximum Likelihood Estimation (MLE)

7.2. Fisher Information and Confidence Intervals

7.3. Simulation

8. Application

9. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Article Metrics

Citations

Article Access Statistics

4.1. The $r - t h$ Moments