1. Introduction
Without any particular exaggeration, one may say that nearly everything in lifetime data analysis revolves around the hazard rate function (HRF). Lifetime models (LTMs) are categorized according to the shapes of their HRFs. Special attention is paid to the LTMs of “flat-bottomed” HRFs that are commonly named bathtub HRFs. Unfortunately, over time, this name has been used to describe any HRF having a minimum and evidently not being flat-bottomed. The true, i.e., flat-bottomed bathtub hazard rate [
1], is specific to a non-homogeneous population. This population consists of subpopulations of “weak” and “strong” items.
Categorized roughly, LTMs may fall into monolithic or hybrid categories. The most representative monolithic LTMs to be recalled here seem to be the Weibull (W) [
2], Gamma (G) and Gamma Weibull (GW). Their failure density functions (FDFs) are
where
is the scale parameter,
are the shape parameters,
is the failure free time parameter and
is the step function defined as
Please note that LTM (
3) came into being by embedding (
1) into (
2). It makes (
3) more flexible than both (
1) and (
2) owing to the second shape parameter, namely
. However, none of the LTMs in question is sufficiently flexible to be applicable to non-homogeneous populations. It is because they cannot be bimodal. The reader may check it on their own.
The prime example of the hybrid LTM is the compound Weibull (CW) proposed by [
3]. Its FDF is given by
where
.
There is no doubt that (
5) can be regarded as an extension of (
1), however far-reaching.
As mentioned above, monolithic LTMs are unimodal. In contrast, hybrid LTMs may be bimodal. Of course, they can also be unimodal for (
1) or (
2), to the great surprise of the analyst, which turns out to be inapplicable to a homogeneous population. Struck by the superiority of (
5) over (
1)–(
3), one must not overlook the fact that (
5) has twice as many parameters as (
3). Therefore, employing (
5), one should have at its disposal much more input data than when employing (
3). It is essential to guarantee that the (
5) and (
3) sets of estimates are at the same level of accuracy. We face the problem of equilibrating flexibility and data consumption.
To familiarize ourselves with the problem, let us consider the results of the following simple, but very instructive, Monte Carlo experiment. A set of input data that comprises one hundred samples each of 30 items was drawn from the exponential population. The population scale parameter was set equal to one. Then the (
1)–(
3) LTMs were sequentially fitted to the dataset. Parameters were estimated with the maximum-likelihood (ML) method.
Table 1 shows standard deviations of scale parameter estimates.
LTM (
3) produced scale parameter estimates of standard deviation more than five times greater than LTM (
1) did. The explanation is simple, saying freely, “underfeeding” of the scale parameter took place because the shape parameters have “eaten” most of the input data for their estimation purposes.
In general, no one disputes the need for LTMs to be flexible. On the other hand, does the LTM need to have as many as eight parameters? The LTM below, called the Kumaraswamy transmuted exponentiated additive Weibull (KTEAW) [
4], satisfies the mentioned criterion. The cumulative failure function (CFF) of the KTEAW is defined as
where
In general, there are currently two techniques to increase flexibility of LTMs: In the formula of the failure density function, more parameters or the same parameter are embedded in more than one place. The reader is prompted to compare (
1) with (
2). The Weibull distribution turned out to be a little more flexible than the Gamma distribution in Monte Carlo experiments.
The shape parameter can be called static in the sense that it shapes the LTM identically at each time point. In this paper, we will be able to shape the LTM dynamically owing to the following modification: we replace the shape parameter with the shape function. This is an innovative idea. The subject of modification, of course, will be the Weibull LTM, further named the Weibull distribution with shape function (WDSF). The CFF and FDF take the following forms:
The above FDF is a sum of two components. This is a unique property of the LTM in question. Although born as a monolithic LTM, the WDSF turns out to be a hybrid-like LTM. Please note that WDSF is free of the fraction parameter
, which is a data guzzler in (
5). As it is easy to guess, we, further in this paper, consider the simplest version of WDSF that involves a linear shape function.
By the end of this section, let us return to the very origins of the reliability domain. Let us recall two books published by reliability pathfinders of that time. These are [
5,
6]. A device stops working properly, not because the Finger of Fate points it out and says “fail”. It does it because the physical failure process reached its critical level. In the mentioned books, commonly used LTMs have assigned a particular mathematical failure mechanism or process.
For instance, both books assign the Weibull LTM to the model of the weakest link of the chain of elements forming a very long series reliability structure. This point of view appears to be timeless. Therefore, in this paper we have the courage to extend the Weibull LTM assigning the following failure process: As time flows and the wear-out process advances, the series reliability structure steadily elongates, causing device reliability to decrease. Referential mathematical considerations will be presented in
Section 3.
The main goal of our work is to complement the literature on the theory of reliability models by introducing a new distribution with a linear shape function, which is a modification of the Weibull LTM. The additional goal of our paper is to define an estimation method that measures the absolute values of the differences between the empirical and theoretical reliability functions (RFs) (see
Section 4).
The rest of the paper is organized as follows.
Section 2 is devoted to the review of modified Weibull distributions. The properties of the two-version WDSF such as the CFF, RF, FDF, HRF, hazard rate average function (HRAF), quantile (Q) and pseudo-random number generator (PRNG) are described in
Section 3. The estimation methods used are described in
Section 4. Illustrative examples of the applicability and flexibility of the WDSF are presented in
Section 5. Concluding remarks are provided in
Section 7. CFF formulas of bimodal LTMs defined with five to eight parameters and having a bathtub HRF are provided in
Appendix A. As the popularity of the R environment has increased significantly recently, main properties of the new distribution have been implemented in R software, version 4.3.1 [
7]. Their full codes are in
Appendix B.
2. A Review of Modified Weibull Distributions
Generalized Weibull distributions can be constructed in many ways. The first, and in our opinion, the most important way is to define distributions with the Weibull distribution as their special case (including a mixture of two or more Weibull variables). Other ways are, i.e., adding a constant to the hazard rate of the Weibull model; transformations (linear, inverse or log) of the Weibull random variable; transformations of the CFF or survival function of the Weibull models in such a way that the new model remains a CFF or survival function. More details can be found in [
8].
By reviewing the statistical literature, we found 165 generalized Weibull distributions with 2–8 parameters. Among them, four distributions have a domain different from
. They are the reflected Weibull distribution [
9] defined for
as well as the Log-Weibull [
10,
11], modified odd Weibull normal [
12] and extended odd Weibull normal [
13] distributions defined for
.
In the rest of this section, we will focus on such generalized Weibull distributions for which the Swedish research’s distribution is their special case. In this case, the large family of generalized Weibull distributions reduces to 71 distributions with three to eight parameters, named by the authors as modified Weibull distributions.
Modified Weibull distributions are divided into six groups, according to the number of their parameters. The list of these models is presented below. Information on hazard rate function shapes is provided in the superscript (
1unimodal,
2increasing,
3decreasing,
4bathub). Pseudo-bimodal lifetime models with bathtub hazard rate function are in underline (22 models). Bimodal lifetime models with bathtub hazard rate function are in bold (seven models) and their CFFs are presented in
Appendix A.
Group I includes 19 models with three parameters. These are the following: generalized Gamma or Gamma Weibull
1–4 [
14], generalization of Gamma Weibull
1–4 [
15], exponentiated Weibull
1–4 [
16], generalized Weibull
1–4 [
17], exponentiated Weibull
1–4 [
18], power generalized Weibull
1–4 [
19], modified Weibull extension
2–4 [
20], modified Weibull
2–4 [
21], Marshall–Olkin Extended Weibull
1–4 [
22], extended Weibull type
1,4 [
23], generalized power Weibull
2–4 [
24], Extended Weibull
1–4 [
25], Sarhan and Zaindin’s Modified Weibull
2,3 [
26], Weibull Geometric
1–4 [
27], transmuted Weibull
1–4 [
28], complementary Weibull geometric
1–4 [
29],
Alpha power Weibull1–4 [
30],
MIT Weibull1–4 [
31] and Semi-Modified Alpha Power Weibull Distribution [
32].
Group II includes 18 models with four parameters. These are the following: four-parameter generalized Gamma
1–4 [
33], additive Weibull
2–4 [
34], Generalized Modified Weibull
1,2,4 [
35], Kumaraswamy Weibull
1–4 [
36], Exponentiated Generalized Gamma
1,4 [
37], exponentiated modified Weibull
1–4 [
38], Transmuted modified Weibull
2 [
39], Exponentiated transmuted Weibull
2–4 [
40], Generalized Weibull-Exponential
1–3 [
41], Weibull Lomax
2,3 [
42], generalized power generalized Weibull
1–4 [
43], Generalization of Generalized Gamma
1–4 [
44],
Weibull Lomax2–4 [
45],
additive Chen-Weibull2,4 [
46],
Poisson modified Weibull1–4 [
47], modified power generalized Weibull
1–4 [
48], Generalized New Extended Weibull [
49] and new modified exponentiated Weibull (NMEW)
2,4 [
50].
Group III includes 27 models with five parameters. These are the following:
Beta modified Weibull1–4 [
51], Kumaraswamy generalized Gamma
1–4 [
52], beta generalized Weibull
1–4 [
53], Transmuted Exponentiated Modified Weibull
2,3 [
54], Beta Generalized Gamma
1–4 [
55],
transmuted additive Weibull2,4 [
56], Exponentiated Kumaraswamy Weibull
1–4 [
57],
new modified Weibull2–4 [
58], Kumaraswamy modified Weibull
1–4 [
59], beta transmuted Weibull
1–3 [
60],
McDonald Weibull1–4 (McW) [
61], Exponentiated Transmuted Modified Weibull
1,2,4 [
62], exponentiated generalized modified Weibull
2,4 [
63],
Gamma generalized modified Weibull [
64],
generalized modified Weibull geometric4 [
65], additive modified Weibull
2–4 [
66], transmuted exponentiated Weibull geometric
1–4 [
67], transmuted new generalized Weibull
2–3 [
68],
Burr XII modified Weibull1–4 [
69],
log-logistic modified Weibull1,2,4 [
70], Kumaraswamy alpha power Weibull
1–4 [
71],
exponentiated additive Weibull1–4 (EAW) [
72,
73], generalized extended exponential Weibull
1–4 [
74], generalized Weibull generalized exponential
1–2 [
75],
new generalized modified Weibull1–4 [
76] and
improved modified Weibull3,4 [
77].
Group IV includes seven models with six parameters. These are the following:
the mixture of two Weibull4 [
3],
McDonald modified Weibull1–4 (McMW) [
78],
McDonald extended Weibull1–4 (McEW) [
79],
Additive Weibull log logistic1–4 [
80], underbarBeta exponentiated modified Weibull
1–4 [
81],
exponentiated power generalized Weibull binomial1–4 [
82], and
McDonald Generalized Power Weibull1–4 (McGPW) [
83].
Concluding the review of modified Weibull distributions, we would like to mention two distributions with seven and eight parameters. The
Kumaraswamy transmuted exponentiated modified Weibull1–4 (KTEMW) with 54 special cases forms group V [
84]. The
Kumaraswamy transmuted exponentiated additive Weibull1–4 (KTEAW) [
4] with 79 special cases (including KTEMW) forms group VI.
3. Properties of Weibull Distribution with Shape Function
The main goal of the paper is to present the two-version Weibull distribution with the shape function previously denoted as the WDSF. This section describes its properties such as the CFF, RF, FDF, HRF, hazard rate average function (HRAF), quantile (Q), and pseudo-random number generator (PRNG). Both versions of our model will be used in
Section 5.
We assume that the device of our interest has a long series reliability structure. It makes the real lifetime distribution close to the Weibull distribution. The formula below is the Weibull LTM with the elongation coefficient of the device’s reliability structure.
We impose two weak conditions on
with
and
. Owing to such weak conditions, a wide scope opens itself for defining
functions not only of reasonable forms but also of monstrous ones. One reasonable form is
because it introduces only one parameter into the Weibull LTM and therefore equilibrates flexibility and data consumption. Thus, we get
Definition 1. Let from (7) or (8) be a linear shape function with two parameters given by ; then the CFF of the Weibull distribution with linear shape function in the first version () is defined aswhere , is the scale parameter, are the shape parameters and . If , then we get the Weibull distribution. Definition 2. Let from (7) or (8) be a linear shape function with three parameters given by then the CFF of the Weibull distribution with linear shape function in the second version (WDSFII) is defined aswhere , is the failure free time parameter, is the step function (4) and . If , then we get the first version of our proposal. Proposition 1. The RFs of the WDSFI and WDSFII are defined, respectively, as Proof of Proposition 1. The proof based on the RF definition is trivial. □
It is interesting to compare the
and Weibull models on the Weibull probability paper. Taking the logarithm of both sides (
11) twice, we get
Let
then
Figure 1 presents the Weibull probability paper with transformed reliability functions plotted on it. The figure is intended to visualize similarity and dissimilarity between the WDSF
I and Weibull models. One may easily notice the following:
There may exist even relatively long-time intervals inside in which the functions are indistinguishable. It lasts as long as b is much greater as .
Both reliability functions cross a horizontal line in the same point.
As time passes, functions slowly but steadily part. This is the region of the second mode of the WDSFI lifetime model.
Proposition 2. The FDFs of the WDSFI and WDSFII are respectively given bywhere . Proof of Proposition 2. By computing the derivatives based on (
9) and (
10) regarding the lifetime
t, we easily obtain Formulas (
13) and (
14). Recall that
and the Formulas (
13) and (
14) are non-negative.
Let . has a maximum value of 0.135335 for .
Let
; then we have from (
13)
Let
, then we have from (
14)
□
Figure 2 shows the RF of the WDSF
I and WDSF
II. Pseudo-bimodality or bimodality is visible here. The R codes (R Core Team 2021) for calculating the RF values of the WDSF
I and WDSF
II are provided in
Appendix B.
Figure 3 shows the FDF of the WDSF
I and WDSF
II. We see the pseudo-bimodality (on the left) and bimodality (on the right). The R codes for calculating the FDF values of WDSF
I and WDSF
II are provided in
Appendix B.
The WDSF
II with FDF given by (
14) is identifiable in the parameter space
. This identifiability is apparent at the first glance. Particular parameters of the WDSF
II lifetime model have very strictly prescribed roles to impact failure density function. It causes that, for example, an impact of the scale parameter in no way can be compensated with shape and location parameters. In turn, it causes that it is impossible to obtain the same two values of the density function for two different sets of parameters. It is a meaningful fact that during over forty years of my activity in the reliability domain, I never encountered any validation of the Weibull lifetime model. Despite not being in any way validated, the model has for decades kept itself on the second place in the ranking of applicability.
Proposition 3. The HRFs of the WDSFI and WDSFII have, respectively, forms Proof of Proposition 3. The proof based on the HRF definition is trivial. □
Figure 4 shows the bathtub HRF of the WDSF
I and WDSF
II. The curves flatten as c decreases. The R codes for calculating the HRF values of WDSF
I and WDSF
II are provided in
Appendix B.
Proposition 4. The HRAFs of the WDSFI and WDSFII are, respectively, defined as [85] Proof of Proposition 4. The proof based on Formulas (
11) and (
12) and definition of the HRAF is trivial. □
Figure 5 shows the bathtub HRAF of the WDSF
I and WDSF
II. HRAF curves are flatter than HRF curves. The R codes for calculating the HARF values of WDSF
I and WDSF
II are provided in
Appendix B.
Figure 6 shows the logarithm of theoretical and empirical HRAFs for the WDSF
I and sample size
.
Proposition 5. Let . The Qs and of the WDSFI and WDSFII are, respectively, solutions of the equations Proof of Proposition 5. The proof based on (
9) and (
10) is easy. □
The R codes for calculating the
and
values of WDSF
I and WDSF
II are provided in
Appendix B.
Proposition 6. Let , and follow the WDSFI and WDSFII, respectively. We can obtain and in two ways.
Proof of Proposition 6. Based on (
9), we get
and after taking logarithm on both sides (
23), we have (
21). Similarly, based on (
10), we obtain (
22). Recursive formulas in step 3 of the algorithm are obtained based on (
9) and (
10) using the CFF inversion method. □
The R codes for the PRNG of WDSF
I and WDSF
II are provided in
Appendix B.
In this passage, we focus on
Figure 7 and
Figure 8 to answer such questions: what is a small sample used for and what is a large one used for? Conventionally, small samples are those of size
. Analysts of lifetime data as a rule deal with small samples. Thus,
Figure 7 presents the empirical and theoretical reliability functions for sample size
n = 30. This figure impresses on readers how far empirical points may depart from the actual reliability function.
Figure 8 presents the empirical and theoretical reliability functions for sample size
n = 1000. Empirical points practically do not depart from the actual reliability function. This figure impresses on readers that the Weibull-sf random number generator works satisfactorily.
4. Estimation Methods
An additional goal for this article is outlined in this section.
Looking through the literature searching for distributions that are modifications of the Weibull distribution, we find that the most dominant method of parameter estimation is the maximum-likelihood (ML) method. However, the question remains whether this choice is right. The younger the paper, the more often the parameters are estimated using other methods, e.g., the ordinary least-squares (LS) and weighted least-squares (WLS) ones; see, e.g., [
48,
86,
87,
88,
89].
What relates to the WDSFII is that the main fraction of the probability mass is concentrated on the “contractual” end of the distribution. Thus, for small and medium values of t, the Weibull and WDSFII are practically indistinguishable in terms of the reliability or density functions. Censoring appears to be inapplicable to the Weibull-sf except in some special cases. This is the reason why in the reviewed paper a problem of censoring has been omitted. Please notice that this is the same inconvenient situation as with the compound Weibull lifetime model. Finally, one should not marvel at this because the WDSFII has been designed as a “lighter” version of the compound Weibull lifetime model.
Let be parameter vectors and be a random sample of size n from the WDSFI and WDSFII. To estimate unknown values of parameters, we use estimation methods such as the ML, LS, WLS and least absolute values (LAWs). The LAW, which is the first additional goal of the work, measures the absolute values of the differences between the empirical and theoretical RFs.
The likelihood functions (LFs) of the WDSF
I and WDSF
II, based on (13) and (14), are given by, respectively,
where
.
The log-likelihood functions (LLFs) of the WDSF
I and WDSF
II, based on (
25) and (
26), are defined as, respectively,
Formulas have complex forms, so in practice we maximize the LF (27) and (28) or the LLFs (29) and (30) to obtain the ML estimates.
To obtain the OLS estimates of the WDSF
I and WDSF
II parameters, we minimize the following objective functions, respectively.
To obtain the WLS estimates of the WDSF
I and WDSF
II parameters, we minimize the following objective functions, respectively.
Let
be the empirical RF. To obtain the LAW estimates of the WDSF
I and WDSF
II parameters, we minimize the following objective functions, respectively.
The following three books with many real-world data examples are particularly interesting. The first one is the Lai and Xie’s book [
90]. The analysed examples are characterized by sample sizes with
. The second one is the Lee and Wang’s book [
91]. The analysed examples are characterized by sample sizes with
. The third one is the Nelson’s book [
92]. The analysed examples are characterized by sample sizes with
.
Based on the above sample size data, a simulation study was performed with
samples with a size of
. The samples were drawn from the WDSF
II with
, where
. To obtain highly accurate parameter estimates, the optimization procedure was run
times with random initial values
for
and
for
c and
for
. Our estimates for a given sample are the values that minimize (
29), (
31) and (
33) and maximize (
25) or (
27).
The biases (Bs) and the root mean squared errors (RMSEs) of the WDSF
II estimates are shown in
Table 2,
Table 3 and
Table 4. We observe that as the sample size increases, the estimates approach the true values, which means that the estimates are consistent. Bs and RMSE values are the lowest for
. The Bs are the largest for
(
and
(
. The RMSEs are the highest for
. The Bs increase with the value of
c for
. The RMSEs increase with the value of
c for
and decrease for
. The Bs and RMSEs are the lowest (highest) for the ML (LAW) method associated with
and
. The ML method is not suitable for estimating scale parameters. In this case, the lowest Bs and RMSE values are for the WLS method.
To examine the accuracy of the coverage probability of the asymptotic confidence intervals (CIs), another simulation study was performed with
samples using sample sizes of
. The study focused on the parameters
and samples drawn from the WDSF
II with
and
. The coverage probabilities of the obtained
CIs for
reported in
Table 5 are very close to the nominal level. The results suggested that the obtained standard errors and hence the asymptotic CIs are reliable.
6. Discussion
The results obtained in this study demonstrate that the proposed WDSFI and WDSFII models provide a flexible and accurate framework for modelling full-life-cycle data with a wide range of hazard ratio shapes, including monotonic, bathtub, and pseudobimodal patterns. Parameter estimation using OLS, WLS, and LAW methods yielded consistent results, with the LAW approach often demonstrating improved robustness in the presence of irregularities in the empirical reliability function.
From the perspective of previous research, our findings are consistent with, and even extend, previous work on generalizations of the Weibull distribution and its modified forms. Similar to these studies, we observed that introducing additional parameters can significantly improve model fit, allowing for more complex hazard ratio shapes. However, the WDSFI and WDSFII models differ in structure, offering better adaptation to datasets where existing models still exhibit systematic biases.
Regarding our working hypotheses, empirical analyses support the assumption that additional flexibility in scale structure and distribution shape leads to better data representation without sacrificing interpretability. The models’ performance on real-world datasets suggests they may be particularly well-suited for applications in engineering reliability, biomedical survival analysis, and materials degradation studies, where multiphase failure mechanisms are present.
In a broader context, these findings contribute to the growing literature on extended-life models and highlight the continuing need for statistical tools capable of capturing various hazardous behaviours. Such tools are crucial for risk assessment, preventive maintenance planning, and decision-making in safety-critical systems.
Future research directions include te following:
Extending the proposed models to more precisely process censored, truncated, and interval data.
Investigation of Bayesian inference methods to incorporate a priori information and quantify uncertainty.
Development of multivariate or concatenated life-cycle models to capture inter-component dependencies.
Incorporation of models into regression frameworks, such as proportional hazards or accelerated failure time models, to assess the impact of interdependent variables.
Overall, this study confirms the practical value of the WDSFI and WDSFII models and opens promising avenues for both theoretical development and applied research in reliability and survival analysis.
7. Conclusions
This article presents a three- and four-parameter flexible modified Weibull LTM called the Weibull distribution with a linear shape function. An innovative idea is to replace the Weibull shape parameter with a shape function. An estimation method based on theoretical and empirical reliability functions is proposed. An extensive literature review was performed, considering the modalities and shapes of the risk rate function. The simulation study is carried out using ML, LS, WLS and LAW methods. Furthermore, to check the suitability and flexibility, the new LTMs are validated with two real datasets and compared with other bimodal LTMs with a bathtub HRF.
The paper shows that even a three-parameter distribution can compete in data modelling with LTM distributions that have two or even almost three times more parameters.
Regarding further research, there is a lot to be accomplished. It appears that there is only one way to succeed—laborious numerical experiments. Research is to be carried out in two directions. The first is to strive to elongate and flatten the central segment of the hazard rate function. To do so, replace a linear shape function with a nonlinear one. Theoretically, any function that is strictly increasing and convex downward may be useful, but probably to different degrees. The second is to carry out Monte Carlo experiments to assess the accuracy of the parameter estimation.