1. Introduction
Reliability studies have gained paramount importance in diverse industrial sectors, including electronics, automotive systems, and automation technologies. Despite its significance, traditional reliability testing often incurs prohibitive costs due to the lengthy durations required to observe failure times under normal operating conditions. For high-reliability products, such as aerospace components or medical devices, failure events may manifest only after years or decades of service, which makes the traditional testing method unfeasible. To address this problem, the accelerated lifetime test (ALT) was introduced by Nelson [
1], revolutionizing reliability experimentation. By subjecting test units to elevated stress levels—such as increased temperature, voltage, or mechanical load—the ALT accelerates failure mechanisms while maintaining failure mode consistency with normal conditions. This approach enables the extrapolation of lifetime characteristics through mathematical modeling of stress-life relationships, significantly reducing both testing time and cost. Accelerated lifetime modeling has been systematically advanced by Meeker [
2] (statistical frameworks for ALT data), Limon [
3] (adaptive stress test design), and Bagdonavicius [
4] (parametric models for degradation processes). In this approach, we establish the test conditions at elevated levels, such as increased current, voltage, temperature, or humidity, to accelerate the lifetime testing of units. There are various methods for designing stress in accelerated lifetime test, including constant stress, step stress [
5], ramp stress [
6], and others. Furthermore, a combination of these methods can be utilized in experiments. However, the constant stress accelerated lifetime test (CSALT) is the most commonly used approach in studies, as it allows for the collection of more accurate and reliable data through a simpler testing process. We can learn more about the optimal constant-stress accelerated life test framework from Ling [
7]. Furthermore, many researchers have explored the CSALT using various distributions. For instance, Mazen Nassar [
8] investigated the exponentiated Rayleigh distribution under the CSALT, while Devendra Kumar [
9] studied the CSALT based on the generalized inverse Lindley distribution.
In this article, we study the Gompertz distribution, first proposed by Gompertz [
10]. It is widely used in survival analysis, actuarial science, and demography to model processes where failure rates increase with time. Many researchers have explored the properties of the Gompertz distribution, including Winsor [
11] and Pollard [
12].
This distribution is notable for its ability to model the increase in human mortality with age, as well as the aging-related failures of equipment or systems over time. These characteristics underpin its broad applicability in reliability studies. The Gompertz distribution has long been a cornerstone in reliability and survival analysis, prized for its ability to model failure rates that increase exponentially over time. However, real-world engineering systems and life data often exhibit more complex failure behaviors that cannot be adequately captured by the monotonic hazard rate of the classical Gompertz model. Recent statistical literature has focused on developing new families of distributions through generator functions. These generators transform a baseline distribution to create richer, more flexible models, often by introducing additional shape parameters. In this study, we adopt the recently proposed Sine-Modified Power-Generated family [
13] to extend the Gompertz distribution. This approach combines the desirable properties of trigonometric functions with a power modification, aiming to create a highly adaptable model for accelerated life testing scenarios. This extension allows for a more rigorous evaluation of our proposed models, effectively testing their robustness and generalization capability against a significantly more flexible distributional framework. The standard Gompertz distribution is inherently constrained by its monotonically increasing hazard rate function, rendering it inadequate for capturing complex failure phenomena such as the bathtub-shaped or unimodal hazard rates characteristic of certain real-world engineering systems. Through the inclusion of an additional parameter, the SMPoG distribution achieves superior flexibility. This enables it to accommodate a wider spectrum of hazard rate patterns—encompassing increasing, decreasing, bathtub, and unimodal trends—making it a more powerful and applicable tool for modeling complex data from accelerated life tests.
This article focuses on statistical inference and establishes several objectives. Firstly, we aim to select an appropriate relationship between stress levels and the shape parameter in the Gompertz distribution. To accomplish this, we employ four estimation methods: maximum likelihood estimation (MLE), least squares estimation (LSE), maximum product spacing estimation (MPSE), and Cramér-von-Mises estimation (CVM). The performance of two models under these estimation methods is compared using their mean squared errors (MSEs), as discussed by Allen [
14]. The foundations and methodologies of MLE, LSE, MPSE, and CVM have been explored extensively in works by Zhang and Wang [
15], Fan and Yu [
16], Abdel Ghaly [
17], and Dey and Nassar [
18,
19], among others. Additionally, we conduct Monte Carlo simulations across different sample sizes—small, medium, and large—to assess the models’ performance under varying conditions. At the same time, quantitative comparison of parameter estimation methods can be studied from the results of Monte Carlo simulations. Insights into Monte Carlo simulation methodology can be found in Harrison [
20]. Secondly, based on the estimations obtained in the first objective, we calculate the shape parameter and reliability function for the two models. This allows us to further evaluate the performance of the models and assess the accuracy of applying high-level estimators to standard conditions. Lastly, we use a real dataset to illustrate the advantages of the better-fitting Gompertz distribution model, highlighting its practical application and reliability in real scenarios.
The structure of this article is organized as follows. In
Section 2, we discuss the fundamental properties of the Gompertz distribution and propose assumptions regarding the relationship between its shape parameter and stress levels.
Section 3 focuses on deriving the objective functions for the four estimation methods—maximum likelihood estimation (MLE), least squares estimation (LSE), maximum product spacing estimation (MPSE), and Cramér-von-Mises estimation (CVM)—and we deduce the formulas for the shape parameter and the reliability function. Furthermore, we conduct three simulations focused on testing models’ robustness, by generating data from the Sine-Modified Power Gompertz (SMPoG) distribution.
Section 4 presents the simulation study for two models under varying sample sizes. Additionally, we evaluate the performance of the shape parameter and reliability function under different scenarios. In
Section 5, a real dataset is analyzed, where we calculate the Kolmogorov–Smirnov (K–S) distance and corresponding p-values, and additionally employ the Akaike Information Criterion (AIC) to evaluate the model’s performance on this dataset. Leave-one-out cross-validation (LOOCV) is further employed to rigorously evaluate the robustness and generalizability of the models. By iteratively holding out each sample as a validation set and training the model on the remaining data, this approach effectively leverages the limited dataset to assess how well the models perform on unseen data. To quantify predictive performance and intuitively distinguish the differences among estimation methods, we adopt three key metrics: mean absolute error (MAE) to measure average prediction bias, root mean squared error (RMSE) to emphasize larger deviations, and coefficient of variation (CV) to evaluate the stability of prediction errors. These metrics collectively provide a comprehensive view of both the accuracy and consistency of each method, reinforcing the reliability of our findings regarding model generalizability. Finally, in the concluding
Section 6, we summarize our findings and discuss model selection based on the Gompertz distribution.
3. Different Methods for Estimating Parameters in Two Models
In this section, four methods, including (1) the method of maximum likelihood, (2) the method of least squares, (3) the method of maximum product of spacing, and (4) the method of Cramér-von-Mises, are employed in parameter estimation under two different models. Sanku Dey’s study [
21] provides a comprehensive investigation of estimation methodologies for the Gompertz distribution, offering valuable insight into parameter estimation method.
3.1. Maximum Likelihood Method
Maximum likelihood method is widely used in parameters’ estimation. Based on (
1), we derive the joint maximum likelihood function as
where
d represents
The log-likelihood function, denoted by
, is the natural logarithm of the likelihood function
:
We can obtain the MLEs (results estimated by maximum likelihood method) of a, b, c in Model 1 by solving the following equations.
MLEs of a, b, c in Model 2 can be obtained by solving the following equations.
However, since the above equations do not have closed-form solutions, numerical techniques are required to address these problems.
3.2. Method of Least Squares
The goal of the least squares method is to minimize the sum of the squared differences between the model’s predicted values and the actual observed data points. Within the framework of accelerated life tests under the Gompertz distribution, the empirical distribution function is approximated using the estimator
, which represents the cumulative proportion of failures observed at the i-th ordered failure time under the j-th stress level. This approximation is widely adopted in statistical inference for right-censored lifetime data analysis, providing a consistent and asymptotically efficient estimator of the underlying distribution function. The estimations can be obtained by minimizing the following formulas:
The LSEs (results estimated by method of least squares) under Model 1 can be obtained by solving the following three nonlinear equations:
where
The LSEs under Model 2 can be obtained by solving the following three nonlinear equations:
where
All of the above equations do not have closed-form solutions. We need to learn a numerical technique from Gill [
22], who studied algorithms for solving linear least-squares problems.
3.3. Method of Maximum Product of Spacings
The maximum product of spacings estimates parameters by maximizing the geometric mean of uniform spacings, defined as the differences between consecutive cumulative distribution function (CDF) values of ordered data. For a random sample
, let
denote the order statistics. The spacings are computed as
where
is the theoretical CDF,
, and
. In the end, we need to maximize
. For the
j-th stress group (
), let
denote the order statistics of the observed lifetimes
. These ordered data facilitate the computation of spacings.
which is used in the method of maximum product of spacings to estimate parameters across stress levels.
The cumulative function for two models are as follows:
Furthermore, we derive
(
calculated in Model 1) and
(
calculated in Model 2):
The final objective formulas, which are expressed as
and
, are derived as follows:
The MPSEs (results estimated by method of maximum product of spacings) under Model 1 can be obtained by solving the following equations:
The MPSEs under Model 2 can be obtained by solving the following equations:
where
,
,
,
,
,
have been defined as (
23)–(
28).
3.4. Method of Cramér-Von Mises
The Cramér–von Mises criterion is a minimum-distance estimation approach, which minimizes the difference between the observed empirical distribution and the theoretical distribution. The feasibility of this method has been demonstrated by Macdonald [
23]. The Cramér–von Mises estimators (CVMs) are obtained by minimizing the following objective function:
where
is the empirical distribution function and
is the theoretical cumulative distribution function parameterized by
.
Based on the aforementioned theories, the objective function to be minimized can be expressed as follows:
CVMs (results estimated by method of Cramér-Von Mises) under Model 1 can be obtained by solving the following equations:
CVMs under Model 2 can be obtained by solving the following equations:
where
,
,
,
,
,
have been defined as (
23)–(
28).
3.5. Inference on Shape Parameter and Reliability Prediction
Finally, the estimators of
for the two models are derived from the MLEs, LSEs, MPSEs, and CVMs of parameters
a and
c, where
follows the stress-lifetime relationship in Gompertz:
where
and
represent the estimates of the parameters
a and
b, obtained through the various estimation methods discussed earlier.
Under normal stress level
, we can obtain the shape parameter from the following formulas:
Furthermore, the reliability under usual condition
at the lifetime
can be predicted by
3.6. Estimation for SMPoG Distribution via Grid Search
To enhance modeling flexibility, we now consider the case where the lifetime data follow the proposed SMPoG distribution. The full parameter vector is now . From the above theories of four estimation methods, we can conclude the objective functions as follows.
The total likelihood function is the product of the individual SMPoG pdfs:
where
is determined by the respective stress model. The corresponding log-likelihood function is:
The expression for
is highly non-linear, as defined in (
6). We now specify the log-likelihood for our two stress-life models.
For Model 1, the log-likelihood function
is:
For Model 2, the log-likelihood function
is
The LSEs are obtained by minimizing the sum of squared differences between the SMPoG theoretical CDF,
, and the empirical CDF. For the Model 1, the objective function of the least squares method is
For Model 2, the LS objective function is
The objective function
of MPSE must be maximized numerically. For Model 1, the objective function to be maximized is
For Model 2, the objective function is
In both cases,
is the CDF of the SMPo-Gompertz distribution as defined (
5), with the convention that
and
. Due to the complexity of the CDF, these objective functions can only be maximized using numerical algorithms.
Lastly, the CVM objective function is adapted by replacing the baseline Gompertz CDF with the SMPoG CDF,
. The objective function for the Model 1 is
The objective function for the Model 2 is
The CVMs are the parameter values that minimize these C functions. As with the other methods, finding these estimates requires numerical optimization.
As established in the preceding sections, the objective functions for all four estimation methods (MLE, LS, MPS, and CVM) are highly non-linear with respect to the full parameter vector . Direct multi-dimensional optimization of these functions can be computationally challenging and prone to convergence issues, particularly due to the complex, non-linear influence of the parameter . The likelihood surface may exhibit multiple local optima or large, flat regions, making it difficult for standard optimization algorithms to locate the global optimum reliably.
To address this challenge and enhance the stability of our estimation procedure, we employ a ‘hybrid approach combining grid search with numerical optimization’ [
16]. This strategy decouples the estimation of the most challenging parameter,
, from the others. The procedure is implemented as follows Algorithm 1:
Algorithm 1 Grid search for parameter estimation |
- 1:
Define a Grid for : Establish a discrete grid of plausible candidate values for as , where each . - 2:
Conditional Optimization: For each :
- 3:
Select the Global Optimum: Collect all candidate solutions and their corresponding minimized objective values . Determine the final estimate as the parameter set with the smallest :
|
This grid search approach systematically explores the parameter space for , significantly increasing the probability of finding the global optimum and improving the robustness and convergence of the overall estimation process. The results presented in our simulation study are all obtained using this technique.
4. Simulation Study
The entirety of the simulation study and associated computations were executed using the R (R version 4.3.2) statistical programming language. The primary method for parameter estimation is numerical optimization via the Nelder–Mead algorithm, as implemented in the optim function.
We carry out a simulation study to compare the fit of two models applied to the Gompertz distribution. Additionally, we evaluate the performance of four different parameter estimation including MLEs, LSEs, MPSEs, CVMs by analyzing the estimated values and their corresponding MSEs. For the study, we consider three levels of stress (, , ) and vary the initial values of parameters a, c, while initial value of b is 0.5.
The simulations are conducted by selecting sample sizes , , and , with N being equally allocated across each stress level. Each simulation is replicated 1000 times to compute the average estimation results and the MSEs for each estimator. The detailed simulation process is carried out in the following steps:
Set the initial values of a, b, c, stress level
, simulation scale
N, and the two different forms of
, which can be seen from (
39), (
40).
Generate N random samples from the Gompertz distribution under the CSALT, and assign N equally to each stress level.
Obtain the MLEs, LSEs, MPSEs, and CVMs under the two models.
Replicate steps 1 to 3 a total of 1000 times, and then calculate the mean squared errors for the MLEs, LSEs, MPSEs, and CVMs.
Under Model 1, MLE performances best with lowest MSEs regardless of sample size. At the same time, the four methods demonstrate comparable performance under Model 2.
Under Model 1, estimations of b perform better with lower MSEs, and the MSEs of are all low. In Model 2, MSEs of c are higher than MSEs of other parameters.
Comparing MSEs in two models, we find that MSEs of Model 2 are significantly smaller than Model 1, which can prove that Model 2 fits the Gompertz distribution better under the CSALT.
From
Table 1,
Table 2,
Table 3,
Table 4,
Table 5,
Table 6, we can find that
increases with increasing stress levels, from which we can explain the increasing product failure rate with rising stress levels.
The MSEs decrease as N increases from 30 to 90 across all estimation methods, accurately highlighting the improved accuracy and stability achieved with larger sample sizes.
For a normal stress level of
, we conduct six sets of simulations using different initial values for parameters
a and
c. From the results presented in
Figure 2 (results of Model 1), it is observed that, for the same value of
c, the predicted value of
decreases as
a decreases. Similarly, for the same value of
a, the predicted value of
also decreases with decreasing
c.
In
Figure 2 (results of Model 2), it is evident that, for the same value of
c, the predicted value of
increases with increasing
a. In contrast, for the same value of
a, the predicted value of
decreases with increasing
c due to
. Comparing the results from the two models, it is clear that Model 2 provides more stable results and reflects the variations in
with changes in
a and
c more accurately. From this perspective, we can conclude that Model 2 is a better fit for the Gompertz distribution under accelerated life testing (ALT).
Based on the estimators obtained for
, we further calculate the reliability at
, which can be seen in
Table 7, and we find that reliability decreases significantly with increasing
. However, under Model 2, reliability exhibits greater stability during longer test durations. This suggests that Model 2 enables more effective reliability analysis over extended time periods. Thus, further reliability analyses based on Model 2 could provide more information over a longer testing period.
To further test the robustness and generalization capability of our proposed stress-life models, we conducted an additional set of simulations where the lifetime data were generated from the SMPoG distribution. The SMPoG distribution includes an additional shape parameter, , which allows for more complex hazard rate shapes beyond the standard Gompertz model. This part of the study aims to see how well the simple linear model (Model 1) and the inverse power-law model (Model 2) can approximate the underlying stress-life relationship when the true data generating process is more complex.
We designed three distinct scenarios, each considering three stress levels (
,
,
) with 50 samples allocated to each stress level, to evaluate the models under varying conditions. The performance was assessed based on the mean squared error (MSE) of the parameter estimates, with lower MSEs indicating higher accuracy. The simulation results, presented in
Table 8,
Table 9,
Table 10 deliver a clear and consistent message. Across all three scenarios—spanning moderate complexity, higher baseline complexity, and high stress-acceleration effects—the inverse power-law model (Model 2) consistently and decisively outperformed the simple linear model (Model 1). In every case, the MSEs for the key parameters (a, b, c).
And the derived shape parameters
under the inverse power-law model were significantly lower than those of the linear model. The performance gap was particularly stark in
Table 10, where the linear model’s rigid structure failed to capture the strong non-linear stress effect, resulting in extremely high estimation errors. In contrast, the inverse power-law model remained robust and accurate throughout.
This systematic outperformance strongly suggests that the Power Law relationship is a more generalizable and reliable choice for modeling the connection between stress and lifetime parameters. Its inherent flexibility allows it to better approximate complex, real-world failure mechanisms, making it the recommended model for practical applications where the underlying stress-life relationship is unknown.
The effect of sample size on parameter estimation was examined from two perspectives: the shape of the sampling distribution and the normality of model residuals.
5. Real Data Analysis
We obtained a real dataset from the Reliasoft website (
http://www.reliasoft.com/newsletter/lq2002/qalt.htm, (accessed on 6 January 2025)), which has been previously analyzed in studies conducted by Abdel Ghaly [
17] and Mazen Nassar [
19]. This dataset records complete failure times, and therefore no censoring is present. We use liquid density as the stress levels (1 g/mL and 1.4 g/mL) and record the failure times (in years) of a pump’s motor. The collected data are presented in
Table 11. We analyze the data by calculating the Kolmogorov–Smirnov (K–S) distances and corresponding p-values for each stress level.
Table 12 demonstrates the strong overall performance of the model at two stress levels, 1 g/mL and 1.4 g/mL. For 1 g/mL, the K–S distance is 0.3113, accompanied by a
p-value of 0.2868, indicating a good fit between the distributions. At 1.4 g/mL, the model shows even greater fitness, with a smaller K–S distance of 0.1928 and a significantly higher
p-value of 0.7854, reflecting a closer match. While both stress levels exhibit satisfactory results, 1.4 g/mL exhibits a superior fit. In this study, we only use Model 2 to estimate the parameters.
We use the four previously mentioned methods to estimate the parameters
a,
b,
c,
, and
of the Gompertz distribution under Model 2. A summary of these estimation results is provided in
Table 13. The comparison between the theoretical cumulative distribution curves and empirical data points under varying stress conditions is visually apparent in
Figure 6, which directly demonstrates the fitness of the Gompertz model. To further assess the performance of different estimation methods for Model 2, we conducted AIC (Akaike Information Criterion) [
24] comparisons. As shown in
Table 14, the MLE method yields the highest Log−Likelihood value (27.44) and the smallest AIC value (−48.89) among the four methods, indicating a relatively better fit in terms of this criterion. Specifically, the AIC values for LSE, MPSE, and CVM are −45.12, −45.78, and −47.25, respectively, which are larger (less negative) than that of MLE. This AIC comparison provides additional evidence for evaluating the suitability of each estimation method when applied to Model 2.
To assess the generalization performance of models and the performance of different parameter estimation methods, we conducted leave- one-out cross-validation and analyzed metrics like MAE, MSE, RMSE, and CV, which can be learned from
Table 15. Regarding method performance, MLE excels in accuracy, boasting the lowest MSE (0.00500) and RMSE (0.0707), which means it minimizes both average and extreme prediction errors. MPSE shows the best stability with the smallest CV (0.770), yet it lags in accuracy as indicated by higher MAE, MSE, and RMSE. LSE has a slightly lower MAE (0.0609) than MLE but is less effective in reducing overall error as seen from MSE and RMSE. CVM falls behind MLE in both accuracy and stability. Overall, MLE achieves a favorable balance between accuracy and stability, making it the most suitable method for this context while the model itself proves to have good generalization capability.
Furthermore, we calculate the K–S distance along with the corresponding
p-value of four methods for each dataset unit. We show the results in
Table 16. At stress level 1.0, the CVM method performs best, with the lowest K–S distance (0.2871) and the highest
p-value (0.3180). The MLE and LSE methods show similar and relatively good performance, with slightly higher K–S distances and lower p-values. The MPSE method performs the worst, having the highest K–S distance (0.4567) and the lowest
p-value (0.0200). At stress level 1.4, the LSE method achieves smallest K–S distance (0.1705) and the highest
p-value (0.8880). The CVM method ranks second, with a low K–S distance (0.2196) and a high
p-value (0.6448). The MLE and MPSE methods show relatively weaker performance. Overall, the CVM method is the best fit at lower stress levels, while the LSE method performs best at higher stress levels. The MPSE method shows poor fitting results under both stress levels. In conclusion, a normal stress level of
is set to estimate the system parameters under normal conditions. The reliability of the system at
(in years) can be obtained in
Table 17, which has been shown in
Figure 7 more directly.