Estimation and Bayesian Prediction for the Generalized Exponential Distribution Under Type-II Censoring

Wang, Wei; Gui, Wenhao

doi:10.3390/sym17020222

Open AccessArticle

Estimation and Bayesian Prediction for the Generalized Exponential Distribution Under Type-II Censoring

by

Wei Wang

and

Wenhao Gui

^*

School of Mathematics and Statistic, Beijing Jiaotong University, Beijing 100044, China

^*

Author to whom correspondence should be addressed.

Symmetry 2025, 17(2), 222; https://doi.org/10.3390/sym17020222

Submission received: 15 January 2025 / Revised: 27 January 2025 / Accepted: 31 January 2025 / Published: 2 February 2025

(This article belongs to the Special Issue Bayesian Statistical Methods for Forecasting)

Download

Browse Figures

Versions Notes

Abstract

This research focuses on the prediction and estimation problems for the generalized exponential distribution under Type-II censoring. Firstly, maximum likelihood estimations for the parameters of the generalized exponential distribution are computed using the EM algorithm. Additionally, confidence intervals derived from the Fisher information matrix are developed and analyzed alongside two bootstrap confidence intervals for comparison. Compared to classical maximum likelihood estimation, Bayesian inference proves to be highly effective in handling censored data. This study explores Bayesian inference for estimating the unknown parameters, considering both symmetrical and asymmetrical loss functions. Utilizing Gibbs sampling to produce Markov Chain Monte Carlo samples, we employ an importance sampling approach to obtain Bayesian estimates and compute the corresponding highest posterior density (HPD) intervals. Furthermore, for one-sample prediction and, separately, for the two-sample case, we provide the corresponding posterior distributions, along with methods for computing point predictions and predictive intervals. Through Monte Carlo simulations, we evaluate the performance of Bayesian estimation in contrast to maximum likelihood estimation. Finally, we conduct an analysis of a real dataset derived from deep groove ball bearings, calculating Bayesian point predictions and predictive intervals for future samples.

Keywords:

Bayesian estimation; type-II censoring; generalized exponential distribution; EM algorithm; bayesian prediction

1. Introduction

In 1999, Reference [1] first formally introduced the generalized exponential distribution. It can also be regarded as a particular member of the three-parameter Exponentiated Weibull (EW) distribution, introduced by Reference [2]. Furthermore, its origins can be traced back to the early work of Gompertz and Verhulst, who introduced cumulative distribution functions for modeling human mortality and population growth. Interested readers can refer to References [3,4] for more details.

The probability density function (PDF) and cumulative distribution function (CDF) of generalized exponential distribution can be expressed as:

f (x) = α λ e^{- λ x} {(1 - e^{- λ x})}^{α - 1}, x > 0, α > 0, λ > 0,

(1)

F (x) = {(1 - e^{- λ x})}^{α},

(2)

The hazard rate and reliability functions can be expressed as:

h (x) = \frac{α λ e^{- λ x} {(1 - e^{- λ x})}^{α - 1}}{1 - {(1 - e^{- λ x})}^{α}},

(3)

R (x) = 1 - {(1 - e^{- λ x})}^{α} .

(4)

Figure 1 illustrates the PDF plots for the chosen parameter values. Based on Figure 1, the density function demonstrates different behaviors depending on

α

. For

α \leq 1

, it is monotonically decreasing, whereas for

α > 1

, it becomes unimodal and right-skewed, similar to Weibull or Gamma distributions. The hazard rate plots with different parameter values are shown in Figure 2. It shows that the hazard function decreases when

α < 1

, remains constant when

α = 1

, and increases when

α > 1

. Notably, the hazard function eventually converges to the value of

λ

.

There are some interesting physical interpretations for the generalized exponential distribution. It defines the distribution of the maximum among

α

random variables that are independently and exponentially distributed when

α

is a positive integer greater than 1. This distribution applies to situations such as the overall lifetime of parallel systems of electronic components, each with an exponential lifetime distribution. In contrast, for a series system, the overall lifetime corresponds to the minimum of the exponentially distributed lifetimes, which follows an exponential distribution with rate parameter

α λ

. Figure 3 presents a schematic representation of both series and parallel systems.

The generalized exponential distribution has gained significant scholarly interest due to its practical importance, particularly in situations involving censored data. This attention has led to a wealth of research focusing on its statistical properties. Reference [1] conducted a comprehensive study of its properties, addressing maximum likelihood estimation (MLE) along with its asymptotic behavior and issues related to hypothesis testing.

Reference [5] compared the generalized exponential distribution with the Weibull and Gamma distributions, demonstrating its advantages in modeling skewed life data. Further, Reference [6] investigated various parameter estimation methods and evaluated their performance.

Reference [7] studied estimation issues for the generalized exponential distribution under hybrid censoring, utilizing the EM algorithm to compute MLEs and importance sampling to derive Bayesian estimates. The Bayesian estimation for the parameter using doubly censored samples was derived by Reference [8]. Reference [9] applied both classical and Bayesian approaches for the estimation of parameters as well as the reliability and hazard functions based on adaptive Type-II progressive censored data. For more detailed results and examples related to the generalized exponential distribution, please refer to References [10,11,12,13].

However, research on Bayesian prediction problems remains relatively scarce for generalized exponential distribution. Prediction is an important aspect in life-testing experiments, where data from an existing sample is used to make inferences about future observations. Several studies have explored prediction problems in various contexts. For instance, Reference [14] provided a comprehensive and detailed review of Bayesian prediction techniques and their diverse applications. Additionally, Reference [15] studied one-sample and two-sample Bayesian prediction problems for the inverse Weibull distribution under Type-II censoring. Similarly, Reference [16] investigated the Poisson-Exponential distribution by employing both classical and Bayesian approaches, providing point predictions and credible predictive intervals.

This paper focuses on both one-sample and two-sample prediction problems. In the one-sample prediction case, let

X_{1} < \dots < X_{r}

represent the observed informative sample and

X_{r + 1} < \dots < X_{n}

denote the future sample. The aim is to predict and make inferences about the future order statistics

X_{k}

for

r < k \leq n

. In the two-sample case, the observed sample of size r is denoted by

X_{1} < \dots < X_{r}

, and the future ordered sample of size m is represented by

Y_{1} < \dots < Y_{m}

. These two samples are based on the same distribution and assumed to be independent, enabling predictions and inferences for the order statistics

Y_{1} < \dots < Y_{m}

.

The primary objectives of this study are twofold. Firstly, we aim to obtain the MLEs and Bayesian estimates under the Type-II censoring scheme and to compare their performance. Secondly, we seek to derive the Bayesian posterior predictive density for future order statistics and, based on this posterior predictive density, compute both point predictions and predictive intervals.

This paper proceeds as follows: In Section 2, the EM algorithm is employed to approximate the MLE of the model. For interval estimation, the Fisher information matrix is used to construct asymptotic confidence intervals (ACIs). Additionally, two bootstrap-based interval estimation methods are explored as comparative alternatives to evaluate their performance. Section 3 presents Bayesian parameter estimation under three different loss functions, utilizing Gibbs sampling and importance sampling techniques. HPD credible intervals are further obtained from the Gibbs sampling results. Section 4 focuses on Bayesian point predictions and the computation of predictive intervals for future order statistics. In Section 5, simulation studies are conducted to evaluate the performance of the proposed methodologies, followed by an analysis of a real-world dataset related to deep groove ball bearings, which is discussed in Section 6. Finally, we present conclusions in Section 7.

2. Maximum Likelihood Estimation

Let

x_{1} < x_{2} < \dots < x_{r}

denote a sample containing r observations obtained from the generalized exponential distribution under a Type-II censoring scheme characterized by

R = (n, r)

. The likelihood function associated with this sample is given by:

L (α, λ) = \frac{n!}{(n - r)!} \prod_{i = 1}^{r} (α λ e^{- λ x_{i}} {(1 - e^{- λ x_{i}})}^{α - 1}) \cdot {(1 - {(1 - e^{- λ x_{r}})}^{α})}^{n - r} .

(5)

Neglecting the constant term, the log-likelihood function is:

l (α, λ) = r ln α λ + (α - 1) \sum_{i = 1}^{r} ln (1 - e^{- λ x_{i}}) - λ \sum_{i = 1}^{r} x_{i} + (n - r) ln (1 - {(1 - e^{- λ x_{r}})}^{α}) .

(6)

To compute the MLEs, we differentiate (6) with respect to

α

and

λ

and set the derivatives to zero. The partial derivatives are:

\frac{\partial l (α, λ)}{\partial α} = \frac{r}{α} + \sum_{i = 1}^{r} ln (1 - e^{- λ x_{i}}) - (n - r) \frac{{(1 - e^{- λ x_{r}})}^{α} ln (1 - e^{- λ x_{r}})}{1 - {(1 - e^{- λ x_{r}})}^{α}} = 0,

\frac{\partial l (α, λ)}{\partial λ} = \frac{r}{λ} + \sum_{i = 1}^{r} \frac{(α - 1) x_{i}}{e^{λ x_{i}} - 1} - \sum_{i = 1}^{r} x_{i} - (n - r) \frac{α {(1 - e^{- λ x_{r}})}^{α - 1} e^{- λ x_{r}} x_{r}}{1 - {(1 - e^{- λ x_{r}})}^{α}} = 0 .

However, solving these equations analytically is not feasible. Therefore, numerical methods, such as the EM algorithm, are recommended. Initially introduced in Reference [17], the EM algorithm is a suitable approach for this scenario.

Under Type-II censoring framework, the observed data, represented as

X = (x_{1}, x_{2}, \dots, x_{r})

, corresponds to the first r observations. The remaining

n - r

data points, which are censored and unobserved, are denoted as

Z = (z_{1}, z_{2}, \dots, z_{n - r})

. Together, these two components form the complete sample W, where

W = (X, Z)

.

When constants are excluded, the log-likelihood function for W, represented as

L_{c} (α, λ; W)

, can be expressed as:

\begin{matrix} L_{c} (α, λ; W) = & n ln (α λ) - λ \sum_{i = 1}^{r} x_{i} - λ \sum_{i = 1}^{n - r} z_{i} \\ + (α - 1) \sum_{i = 1}^{r} ln (1 - e^{- λ x_{i}}) + (α - 1) \sum_{i = 1}^{n - r} ln (1 - e^{- λ z_{i}}) . \end{matrix}

(7)

The EM algorithm iteratively computes the MLEs by alternating between the E-step and M-step. In the E-step, the expectation of the log-likelihood function is calculated given the observed data X and the current parameter estimates. These estimates are updated in the M-step by maximizing the expected complete log-likelihood function. Therefore, we first derive the expression for the pseudo-log-likelihood function as follows:

\begin{matrix} L_{s} (α, λ) = & n ln α λ - λ \sum_{i = 1}^{r} x_{i} + \sum_{i = 1}^{r} (α - 1) ln (1 - e^{- λ x_{i}}) \\ - \sum_{i = 1}^{n - r} λ E (z_{i} | z_{i} > x_{r}) + \sum_{i = 1}^{n - r} (α - 1) E (ln (1 - e^{- λ z_{i}}) | z_{i} > x_{r}), \end{matrix}

(8)

where

\begin{matrix} E (z_{i} | z_{i} > x_{r}) = & \frac{1}{1 - F (x_{r})} \int_{x_{r}}^{\infty} x f (x) d x, \\ = & Q_{1} (α, λ, x_{r}) \end{matrix}

(9)

\begin{matrix} E (ln (1 - e^{- λ z_{i}}) | z_{i} > x_{r}) = & \frac{1}{1 - F (x_{r})} \int_{x_{r}}^{\infty} ln (1 - e^{- λ x}) f (x) d x . \\ = & Q_{2} (α, λ, x_{r}) \end{matrix}

(10)

In the M-step, suppose the estimates of

α

and

λ

at j-th iteration are denoted as

α^{(j)}

and

λ^{(j)}

, respectively. The new estimates can be calculated by maximizing the following expression:

\begin{matrix} L_{s} (α, λ) = & n ln α λ - λ \sum_{i = 1}^{r} x_{i} + (α - 1) \sum_{i = 1}^{r} ln (1 - e^{- λ x_{i}}) \\ - λ (n - r) Q_{1} (α^{(j)}, λ^{(j)}, x_{r}) + (α - 1) (n - r) Q_{2} (α^{(j)}, λ^{(j)}, x_{r}) . \end{matrix}

(11)

Here, we introduce an application of the iteration method, which is similar to the method proposed by Reference [5]. In order to maximize (11), we first take the partial derivative with respect to

α

and

λ

.

\frac{\partial L_{s} (α, λ)}{\partial α} = \frac{n}{α} + \sum_{i = 1}^{r} ln (1 - e^{- λ x_{i}}) + (n - r) Q_{2} (α^{(j)}, λ^{(j)}, x_{r}) = 0,

(12)

\frac{\partial L_{s} (α, λ)}{\partial λ} = \frac{n}{λ} - \sum_{i = 1}^{r} x_{i} + (α - 1) \sum_{i = 1}^{r} \frac{e^{- λ x_{i}} x_{i}}{1 - e^{- λ x_{i}}} - (n - r) Q_{1} (α^{(j)}, λ^{(j)}, x_{r}) = 0 .

(13)

From (12), the estimate of

α

at

j + 1

-th iteration can be determined as as follows:

\hat{α} (λ) = - \frac{n}{(n - r) Q_{2} (α^{(j)}, λ^{(j)}, x_{r}) + \sum_{i = 1}^{r} ln (1 - e^{- λ x_{i}})},

(14)

Thus, the maximization of (11) can be achieved by solving the corresponding fixed-point equation:

λ = ϕ (λ),

(15)

where

ϕ (λ) = {[\frac{1}{n} \sum_{i = 1}^{r} x_{i} + \frac{n - r}{n} Q_{1} (α^{(j)}, λ^{(j)}, x_{r}) - \frac{1}{n} \sum_{i = 1}^{r} \frac{(\hat{α} (λ) - 1) e^{- λ x_{i}} x_{i}}{1 - e^{- λ x_{i}}}]}^{- 1} .

(16)

After computing

λ^{(j + 1)}

, the corresponding value of

α^{(j + 1)}

can be determined using the relation

α^{(j + 1)} = \hat{α} (λ^{(j + 1)})

.

The iterative procedure described in Equations (14) and (15) is deemed to have converged when the following inequality holds:

|α^{(j + 1)} - α^{(j)}| + |λ^{(j + 1)} - λ^{(j)}| < τ,

where

τ

denotes a small positive threshold to ensure accuracy and stability. The above process of the EM algorithm is summarized in the following Algorithm 1:

Algorithm 1 EM Algorithm for MLEs under Type-II Censoring

1:: Input: Initial values $(α^{(0)}, λ^{(0)})$ , threshold $τ$ .
2:: Initialization: Set $j = 0$ .
3:: repeat
4:: Calculate the expectations $Q_{1} (x_{r}, α^{(j)}, λ^{(j)})$ and $Q_{2} (x_{r}, α^{(j)}, λ^{(j)})$ .
5:: Solve the fixed-point equation to obtain $λ^{(j + 1)}$ using Equation (15).
6:: Update $α^{(j + 1)}$ using $α^{(j + 1)} = \hat{α} (λ^{(j + 1)})$ from Equation (14).
7:: Increment $j = j + 1$ .
8:: until

$|α^{(j + 1)} - α^{(j)}| + |λ^{(j + 1)} - λ^{(j)}| < τ .$

In Section 5, when applying the EM algorithm for MLEs under Type-II censoring, the MLEs obtained from complete samples are used as the initial values.

2.1. Fisher Information Matrix

Reference [18] introduces several core concepts closely related to the Fisher information matrix. Building on this foundation, this subsection describes the approach for obtaining the observed information matrix, following the missing value principles outlined in Reference [19], which is then used to construct confidence intervals.

Specifically, let

θ = (α, λ)

, where

I_{X} (θ)

represents the observed information matrix,

I_{W | X} (θ)

corresponds to the missing information matrix, and

I_{W} (θ)

indicates the complete information matrix. These matrices are related as follows:

I_{X} (θ) = I_{W} (θ) - I_{W | X} (θ) .

(17)

The missing information matrix, representing the Fisher information associated with the censored data, is expressed as:

\begin{matrix} I_{W | X} (θ) & = (n - r) I_{Z | X} (θ), \\ = (n - r) E_{Z | X} [- \frac{\partial^{2} ln f_{Z ∣ X} (z ∣ x_{r}, θ)}{\partial θ^{2}}], \end{matrix}

(18)

where

f_{Z ∣ X} (z ∣ x_{r}, θ) = \frac{f (z)}{1 - F (x_{r})}, z > x_{r} .

(19)

The complete information matrix, base on the log-likelihood function (7) is then given by:

I_{W} (θ) = - E [\frac{\partial^{2} ln L_{c} (θ; W)}{\partial θ^{2}}] .

(20)

The details of the elements within these two matrices are elaborated in Appendix A. We can calculate

I_{X} (θ)

using the derived components of

I_{W} (θ)

and

I_{W | X} (θ)

easily. The variance-covariance matrix corresponding to the MLEs,

\hat{α}

and

\hat{λ}

, is then approximated by taking the inverse of

I_{X} (θ)

, represented as

I_{X}^{- 1} (\hat{α}, \hat{λ})

.

On the basis of this approximation, the

100 (1 - p) %

asymptotic confidence intervals for

α

and

λ

are given as:

(\hat{α} - η_{p / 2} \sqrt{D (\hat{α})}, \hat{α} + η_{p / 2} \sqrt{D (\hat{α})}), (\hat{λ} - η_{p / 2} \sqrt{D (\hat{λ})}, \hat{λ} + η_{p / 2} \sqrt{D (\hat{λ})}),

where,

D (\hat{α})

and

D (\hat{λ})

represent the diagonal elements of the inverse observed information matrix, denoted as

I_{X}^{- 1} (\hat{θ})

. The critical value

η_{p / 2}

corresponds to the upper

p / 2

quantile of the standard normal distribution.

2.2. Bootstrap Methods

The asymptotic confidence interval approach is based on the assumption that the estimators follow a normal distribution, an assumption that holds true when the sample size is adequately large. Nevertheless, in practical, the sample size is often limited. To address this, we propose two bootstrap techniques. The first is the percentile bootstrap method [20], which generates bootstrap samples by resampling the observed data. The second method, the bootstrap-t approach [21], also generates bootstrap samples but computes a bootstrap-t statistic that adjusts the estimates using their standard deviation.

Algorithms 2 and 3 present the procedures for implementing the Boot-p and Boot-t methods, respectively.

One potential direction for future research involves exploring the methods proposed in References [22,23] for identifying extreme values, which may help improve the accuracy of the bootstrap method.

Algorithm 2 Percentile Bootstrap (Boot-p) Method

1:: Input: the Type-II censored sample X; the number of bootstrap replications M; the censoring scheme $R = (n, r)$ .
2:: Compute the MLEs $(\hat{α}, \hat{λ})$ for the generalized exponential distribution using the censored sample X.
3:: for $j = 1$ to M do
4:: Generate a bootstrap sample from the generalized exponential distribution parameterized by $(\hat{α}, \hat{λ})$ under the censoring scheme R.
5:: Calculate the MLEs $(({\hat{α}}_{j}^{*}, {\hat{λ}}_{j}^{*}))$ using the bootstrap sample.
6:: end for
7:: Arrange the bootstrap estimates ${{\hat{φ}}_{j}^{*}}$ in ascending order to obtain the sorted set ${{\hat{φ}}_{(1)}^{*}, {\hat{φ}}_{(2)}^{*}, \dots, {\hat{φ}}_{(M)}^{*}}$ , where $\hat{φ}$ is the MLE obtained from the original sample X and ${\hat{φ}}_{j}^{*}$ is the bootstrap estimate of $φ$ .
8:: Determine the percentile values for the confidence interval:

${\hat{φ}}_{Boot - p} (\frac{p}{2}) = {\hat{φ}}_{(⌊ \frac{p}{2} M ⌋)}^{*}, {\hat{φ}}_{Boot - p} (1 - \frac{p}{2}) = {\hat{φ}}_{(⌊ (1 - \frac{p}{2}) M ⌋)}^{*},$

where $⌊ Δ ⌋$ denotes the greatest integer less than or equal to $Δ$ .
9:: Output: The $100 (1 - p) %$ Boot-p confidence interval: $({\hat{φ}}_{Boot - p} (\frac{p}{2}), {\hat{φ}}_{Boot - p} (1 - \frac{p}{2}))$ .

Algorithm 3 Bootstrap-t (Boot-t) Method

1:: Input: the Type-II censored sample X; the number of bootstrap replications M; the censoring scheme $R = (n, r)$ .
2:: Compute the MLEs $(\hat{α}, \hat{λ})$ for the generalized exponential distribution using the censored sample X.
3:: for $j = 1$ to M do
4:: Generate a bootstrap sample from the generalized exponential distribution parameterized by $(\hat{α}, \hat{λ})$ under the censoring scheme R.
5:: Calculate the MLEs $(({\hat{α}}_{j}^{*}, {\hat{λ}}_{j}^{*}))$ using the bootstrap sample.
6:: Calculate the bootstrap-t statistic for the parameter $φ$ :

$T_{j}^{*} = \frac{{\hat{φ}}_{j}^{*} - \hat{φ}}{\sqrt{D ({\hat{φ}}_{j}^{*})}},$
7:: end for
8:: Sort the bootstrap-t statistics ${T_{j}^{*}}$ in ascending order to get the ordered set ${T_{(1)}^{*}, T_{(2)}^{*}, \dots, T_{(M)}^{*}}$ .
9:: Compute $T_{L}^{*} = T_{(⌊ \frac{p}{2} M ⌋)}^{*}, T_{U}^{*} = T_{(⌊ (1 - \frac{p}{2}) M ⌋)}^{*}$ and pick their corresponding MLEs ${\hat{φ}}_{L}^{*}, {\hat{φ}}_{U}^{*}$ .
10:: Compute the bounds of the Boot-t confidence interval for $φ$ :

${\hat{φ}}_{Boot - t} (\frac{p}{2}) = \hat{φ} + \sqrt{D ({\hat{φ}}_{L}^{*})} \cdot T_{L}^{*}, {\hat{φ}}_{Boot - t} (1 - \frac{p}{2}) = \hat{φ} + \sqrt{D ({\hat{φ}}_{U}^{*})} \cdot T_{U}^{*} .$
11:: Output: The $100 (1 - p) %$ Boot-t confidence interval:

$({\hat{φ}}_{Boot - t} (\frac{p}{2}), {\hat{φ}}_{Boot - t} (1 - \frac{p}{2})) .$

3. Bayesian Estimation

This section focuses on deriving the Bayes estimates for

α, λ

using type-II censored data within the framework of a specified loss function. Selecting an appropriate loss function is crucial, as it determines the penalty associated with estimation errors. In this study, we examine three types of loss functions: the general entropy loss function (GELF), the Linex loss function (LILF), and the squared error loss function (SELF). The SELF is widely used due to its symmetric nature and is most appropriate when the estimation error leads to symmetric consequences. For scenarios where the consequences of overestimation and underestimation differ, the Linex loss function, which introduces an asymmetry in the penalties, is more suitable. Meanwhile, the GELF accounts for entropy-based considerations, allowing greater flexibility in modeling uncertainty and estimation inaccuracies. Let

\hat{θ}

denote an estimate of the

θ

. The mathematical expression of the above three loss functions is provided below:

L_{G E} (\hat{θ}) = {(\hat{θ} / θ)}^{- δ} - δ ln (\hat{θ} / θ) - 1, δ \neq 0,

(21)

L_{L I} (\hat{θ}) = e^{h (\hat{θ} - θ)} - h (\hat{θ} - θ) - 1, h \neq 0,

(22)

L_{S E} (\hat{θ}) = {(θ - \hat{θ})}^{2} .

(23)

The Bayes estimates under each of these loss functions are defined as follows:

Bayes estimate for

θ

under GELF:

{\hat{θ}}_{GE} = {(E [{(\frac{1}{θ})}^{δ} ∣ X])}^{- \frac{1}{δ}}, δ \neq 0 .

(24)

Bayes estimate for

θ

under LILF:

{\hat{θ}}_{LI} = - \frac{1}{h} ln E [e^{- h θ} ∣ X], h \neq 0 .

(25)

Bayes estimate for

θ

under SELF:

{\hat{θ}}_{SE} = E [θ ∣ X] .

(26)

This setup allows for flexibility in choosing an appropriate loss function based on the characteristics of the estimation problem. Following Reference [24], let us assume that

α

and

λ

follow gamma prior distributions, expressed as:

π_{1} (α) \propto α^{u - 1} e^{- v α}, α > 0,

(27)

π_{2} (λ) \propto λ^{s - 1} e^{- b λ}, λ > 0 .

(28)

where all hyper-parameters u, v, s, and b are known and positive.

The joint posterior distribution of

α

and

λ

given type-II censored data X is

π (α, λ ∣ X) = \frac{π_{1} (α) π_{2} (λ) L (α, λ)}{\int_{0}^{\infty} \int_{0}^{\infty} π_{1} (α) π_{2} (λ) L (α, λ) d α d λ} .

(29)

Since the denominator is a normalizing constant, we can simplify to obtain

\begin{matrix} π (α, λ | X) \propto & α^{r + u - 1} λ^{r + s - 1} e^{- α (- \sum_{i = 1}^{r} ln (1 - e^{- λ x_{i}}) + v)} e^{- λ (\sum_{i = 1}^{r} x_{i} + b)} \\ \times {[1 - {(1 - e^{- λ x_{r}})}^{α}]}^{n - r} / \prod_{i = 1}^{r} (1 - e^{- λ x_{i}}) . \\ \propto & h_{1} (λ | X) h_{2} (α | λ, X) g (α, λ | X), \end{matrix}

(30)

here

\begin{matrix} g (α, λ | X) = \frac{{(1 - {(1 - e^{- λ x_{r}})}^{α})}^{n - r}}{\prod_{i = 1}^{r} (1 - e^{- λ x_{i}}) \cdot {(- \sum_{i = 1}^{r} ln (1 - e^{- λ x_{i}}) + v)}^{r + u}}, \\ h_{1} (λ | X) \sim g a m m a (r + s, \sum_{i = 1}^{r} x_{i} + b), \\ h_{2} (α | λ, X) \sim g a m m a (r + u, - \sum_{i = 1}^{r} ln (1 - e^{- λ x_{i}}) + v) . \end{matrix}

It is evident that the quantities

r + s

,

\sum_{i = 1}^{r} x_{i} + b

,

r + u

, and

- \sum_{i = 1}^{r} ln (1 - e^{- λ x_{i}}) + v

are all positive.

Therefore, the posterior expectation of any function

u (α, λ)

can be given directly as:

E (u (α, λ) ∣ X) = \frac{\int_{0}^{\infty} \int_{0}^{\infty} u (α, λ) g (α, λ ∣ X) h_{1} (λ ∣ α, X) h_{2} (α ∣ X) d α d λ}{\int_{0}^{\infty} \int_{0}^{\infty} g (α, λ ∣ X) h_{1} (λ ∣ α, X) h_{2} (α ∣ X) d α d λ} .

(31)

Based on (30) and (31), Gibbs sampling is initially applied to draw Monte Carlo Markov Chain (MCMC) samples from

π (α, λ ∣ X)

. These samples are subsequently utilized in the importance sampling algorithm outlined in Algorithm 4 to approximate the values in (24)–(26).

Algorithm 4 Importance Sampling for Bayesian Estimation

1:

for

j = 1

to N do

2:

Sample

λ_{j}

from

h_{1} (λ ∣ data) \sim Gamma (r + s, \sum_{i = 1}^{r} x_{i} + b)

.

3:

Sample

α_{j}

from

h_{2} (α ∣ λ_{j}, data) \sim Gamma (r + u, - \sum_{i = 1}^{r} ln (1 - e^{- λ_{j} x_{i}}) + v)

.

4:

end for

5:

Compute the Bayesian estimate of

θ

using three different loss functions:

General Entropy Loss (GELF):

${\hat{θ}}_{G E} \approx {(\frac{\sum_{j = 1}^{N} θ_{j}^{- δ} g (α_{j}, λ_{j} | X)}{\sum_{j = 1}^{N} g (α_{j}, λ_{j} | X)})}^{- 1 / δ} .$
Linex Loss (LILF):

${\hat{θ}}_{L L} \approx - \frac{1}{h} ln (\frac{\sum_{j = 1}^{N} e^{- h θ_{j}} g (α_{j}, λ_{j} | X)}{\sum_{j = 1}^{N} g (α_{j}, λ_{j} | X)}) .$
Squared Error Loss (SELF):

${\hat{θ}}_{S E} \approx \frac{\frac{1}{N} \sum_{j = 1}^{N} θ_{j} g (α_{j}, λ_{j} | X)}{\frac{1}{N} \sum_{j = 1}^{N} g (α_{j}, λ_{j} | X)} .$

The credible interval for

θ

can be constructed using the methodology introduced by Reference [25]. Let

π (θ ∣ X)

represent the posterior density function of

θ

, while

Π (θ ∣ X)

corresponds to its cumulative distribution function. The p-quantile of

θ

, denoted as

θ^{(p)}

, is expressed as:

θ^{(p)} = inf {θ : Π (θ ∣ X) \geq p},

For any specific value

θ^{*}

, the posterior cumulative distribution function is expressed as:

Π (θ^{*} ∣ X) = E [1_{θ \leq θ^{*}} ∣ X],

where

1_{θ \leq θ^{*}}

is an indicator function. An approximation for

Π (θ^{*} ∣ X)

can be written as:

Π (θ^{*} ∣ X) = \frac{\frac{1}{N} \sum_{j = 1}^{N} 1_{θ_{j} \leq θ^{*}} g (α_{j}, λ_{j} ∣ X)}{\frac{1}{N} \sum_{j = 1}^{N} g (α_{j}, λ_{j} ∣ X)} .

Let

{θ_{(j)}}

denote the ordered values of

{θ_{k}}

, and define

ω_{(j)}

as:

ω_{(j)} = \frac{g (α_{(j)}, λ_{(j)} ∣ X)}{\sum_{j = 1}^{N} g (α_{(j)}, λ_{(j)} ∣ X)},

for

j = 1, \dots, N

. Using this,

Π (θ^{*} ∣ X)

can be estimated as:

\hat{Π} (θ^{*} ∣ X) = \{\begin{matrix} 0, & if θ^{*} < θ_{(1)}, \\ \sum_{j = 1}^{k} ω_{(j)}, & if θ_{(k)} \leq θ^{*} < θ_{(k + 1)}, \\ 1, & if θ^{*} \geq θ_{(N)} . \end{matrix}

θ^{(p)}

, can subsequently be estimated using the following expression:

{\hat{θ}}^{(p)} = \{\begin{matrix} θ_{(1)}, & if p = 0, \\ θ_{(k)}, & if \sum_{j = 1}^{k - 1} ω_{(j)} < p \leq \sum_{j = 1}^{k} ω_{(j)} . \end{matrix}

In order to construct the

100 (1 - p) %

HPD interval for

θ

, we first establish the following definition:

R_{j} = ({\hat{θ}}^{(j / N)}, {\hat{θ}}^{((1 - p + j) N / N)}),

where

j = 1, \dots, ⌊ p N ⌋

. The HPD interval is then identified by selecting

R_{j^{*}}

, the interval with the shortest length.

4. Bayesian Prediction

Bayesian prediction for future observations is explored in this section. In this study, we consider Bayesian prediction within the framework of both one-sample and two-sample cases. We provide the corresponding posterior distributions, along with methods for computing point predictions and predictive intervals.

4.1. One-Sample Prediction

Let

X_{1} < \dots < X_{r}

represent the observed informative sample and

X_{r + 1} < \dots < X_{n}

denote the future sample. We aim to predict and make inferences about the future order statistic

X_{k}

(where

r < k \leq n

). Noting that the conditional density function of

X_{k}

can be written as

f_{X_{k}} (y | α, λ, X) = ζ \frac{{(F (y) - F (x_{r}))}^{k - r - 1} f (y) {(1 - F (y))}^{n - k}}{{(1 - F (x_{r}))}^{n - r}},

(32)

where

ζ = (k - r) (\binom{n - r}{k - r})

. By applying the binomial expansion, the function mentioned above can be reformulated as:

f_{X_{k}} (y | α, λ, X) = ζ \sum_{i = 0}^{k - r - 1} (\begin{matrix} k - r - 1 \\ i \end{matrix}) {(- 1)}^{k - r - 1 - i} {(1 - F (y))}^{n - r - 1 - i} {(1 - F (x_{r}))}^{i - n + r} f (y) .

(33)

Therefore the corresponding survival function of

X_{k}

given X is then expressed as:

S_{X_{k}} (t | α, λ, X) = \frac{\int_{t}^{\infty} f_{X_{k}} (y | α, λ, X) d y}{\int_{x_{r}}^{\infty} f_{X_{k}} (y | α, λ, X) d y},

(34)

= \frac{\sum_{i = 0}^{k - r - 1} (\begin{matrix} k - r - 1 \\ i \end{matrix}) {(- 1)}^{k - r - 1 - i} \frac{{[1 - {(1 - e^{- λ t})}^{α}]}^{n - r - i}}{n - r - i} {[1 - {(1 - e^{- λ x_{r}})}^{α}]}^{i - n + r}}{\sum_{i = 0}^{k - r - 1} (\begin{matrix} k - r - 1 \\ i \end{matrix}) {(- 1)}^{k - r - 1 - i} \frac{1}{n - r - i}} .

(35)

The posterior predictive density function of

X_{k}

can subsequently be expressed as:

f_{X_{k}}^{*} (y | X) = \int_{0}^{\infty} \int_{0}^{\infty} π (α, λ | X) f_{X_{k}} (y | α, λ, X) d λ d α,

(36)

and the posterior survival function is

S_{X_{k}}^{*} (y | X) = \int_{0}^{\infty} \int_{0}^{\infty} π (α, λ | X) S_{X_{k}} (y | α, λ, X) d λ d α .

(37)

Let

{(α_{j}, λ_{j}); j = 1, \dots, N}

represent the samples obtained through Algorithm 4. The corresponding simulation-consistent estimates for

f_{X_{k}}^{*}

and

S_{X_{k}}^{*}

can then be approximated by:

{\hat{f^{*}}}_{X_{k}} (y | X) = \frac{1}{N} \sum_{j = 1}^{N} ω_{j} \cdot f_{X_{k}} (y | α_{j}, λ_{j}, X),

(38)

and

{\hat{S^{*}}}_{X_{k}} (y | X) = \frac{1}{N} \sum_{j = 1}^{N} ω_{j} \cdot S_{X_{k}} (y | α_{j}, λ_{j}, X),

(39)

where

ω_{j}

is defined as

ω_{j} = \frac{g (α_{j}, λ_{j} | X)}{\sum_{j = 1}^{N} g (α_{j}, λ_{j} | X)} .

(40)

The two-sided

100 (1 - p) %

symmetric predictive interval

(L_{0}, U_{0})

for

X_{k}

is the solution to the following equations:

S_{X_{k}}^{*} (L_{0} | X) = 1 - \frac{p}{2} and S_{X_{k}}^{*} (U_{0} | X) = \frac{p}{2} .

(41)

The point prediction for

X_{k}

can be determined by utilizing the corresponding predictive distribution. Specifically, it is computed as:

\begin{matrix} \hat{X_{k}} = & \int_{x_{r}}^{\infty} y f_{X_{k}}^{*} (y | X) d y = \int_{0}^{\infty} \int_{0}^{\infty} \int_{x_{r}}^{\infty} y \cdot f_{X_{k}} (y | X) π (α, λ | X) d y d λ d α \\ = & \int_{0}^{\infty} \int_{0}^{\infty} I_{k} (x_{r}; α, λ) π (α, λ | X) d λ d α, \end{matrix}

(42)

where

I_{k} (x_{r}; α, λ) = ζ \sum_{i = 0}^{k - r - 1} (\begin{matrix} k - r - 1 \\ i \end{matrix}) {(- 1)}^{k - r - 1 - i} {(1 - F (x_{r}))}^{i - n + r} \int_{x_{r}}^{\infty} y {(1 - F (y))}^{n - r - 1 - i} f (y) d y .

(43)

We cannot compute the above expression analytically. Therefore, we approximate

\hat{X_{k}}

using the previously drawn samples:

\hat{X_{k}} = \frac{1}{N} \sum_{j = 1}^{N} ω_{j} \cdot I_{k} (x_{r}; α_{j}, λ_{j}) .

(44)

4.2. Two-Sample Prediction

This section addresses the problem of dealing with two distinct sets of samples, the first, identified as the informative sample, and the second, known as the future sample. The future order statistics, represented as

Y_{1} < \dots < Y_{m}

, are considered to be statistically independent of the informative sample,

X_{1} < \dots < X_{r}

. The main objective is to derive the predictive density for the k-th order statistic

Y_{k}

, conditioned on the observed sample X. The density function of

Y_{k}

can be expressed as follows:

g_{Y_{k}} (y | α, λ, X) = κ {[1 - F (y)]}^{m - k} {[F (y)]}^{k - 1} f (y),

(45)

where

κ = \frac{m!}{(m - k)! (k - 1)!}

. If we let

{g^{*}}_{Y_{k}} (y | X)

denote the posterior predictive density of

Y_{k}

, then it can be expressed as:

{g^{*}}_{Y_{k}} (y | X) = \int_{0}^{\infty} \int_{0}^{\infty} π (α, λ | X) g_{Y_{k}} (y | α, λ, X) d λ d α .

(46)

The function

G_{Y_{k}} (y | α, λ, X)

represents the survival function of

g_{Y_{k}} (y | α, λ, X)

, given by:

\begin{matrix} G_{Y_{k}} (y | α, λ, X) & = κ \int_{y}^{\infty} {[1 - F (u)]}^{m - k} {[F (u)]}^{k - 1} f (u) d u \\ = κ \int_{F (y)}^{1} {(1 - ξ)}^{m - k} ξ^{k - 1} d ξ . \end{matrix}

(47)

The predictive survival function of

Y_{k}

, denoted by

{G^{*}}_{Y_{k}} (y | X)

, is therefore given by:

{G^{*}}_{Y_{k}} (y | X) = \int_{0}^{\infty} \int_{0}^{\infty} π (α, λ | X) G_{Y_{k}} (y | α, λ, X) d λ d α .

(48)

In practical applications,

{g^{*}}_{Y_{k}} (y | X)

can be approximated based on importance sampling method:

{\hat{g^{*}}}_{Y_{k}} (y | X) = \frac{1}{N} \sum_{j = 1}^{N} ω_{j} \cdot g_{Y_{k}} (y | α, λ, X),

(49)

and

{G^{*}}_{Y_{k}} (y | X)

can be approximated by:

{\hat{G^{*}}}_{Y_{k}} (y | X) = \frac{1}{N} \sum_{j = 1}^{N} ω_{j} \cdot G_{Y_{k}} (y | α, λ, X),

(50)

where

ω_{j}

is defined as in (40). The two-sided

100 (1 - p) %

symmetric predictive interval

(L_{1}, U_{1})

for

Y_{(k)}

is the solution to the following equations:

{G^{*}}_{Y_{k}} (L_{1} | X) = 1 - \frac{p}{2} and {G^{*}}_{Y_{k}} (U_{1} | X) = \frac{p}{2} .

(51)

To obtain the point prediction for

Y_{k}

, we calculate it based on the corresponding predictive distribution

{g^{*}}_{Y_{k}} (y | X)

.

{\hat{Y}}_{k} = \int_{0}^{\infty} y \cdot {g^{*}}_{Y_{k}} (y | X) d y = \int_{0}^{\infty} \int_{0}^{\infty} H_{k} (α, λ) π (α, λ | X) d λ d α,

(52)

where

H_{k} (α, λ) = \int_{0}^{\infty} y \cdot g_{Y_{k}} (y | α, λ, X) d y .

(53)

5. Simulation

Monte Carlo simulation results are provided to evaluate the performance of Bayesian estimation methods in contrast to classical estimation. The comparisons are conducted under various censoring schemes and a range of parameter settings. The analysis was performed on a system equipped with an 11th Gen Intel(R) Core(TM) i7-11800H @ 2.30 GHz processor. Here, we used R software (version 4.4.1) for all computations.

To begin with, Type-II censored samples are generated under various censoring schemes, assuming that they follow a generalized exponential distribution with the true parameter values

\tilde{α} = 2

and

\tilde{λ} = 1

. The EM algorithm is subsequently applied to calculate the MLEs. Then, we set the hyper-parameters

(u, v, s, b)

the values

(2, 1, 2, 1)

. Under these settings, Bayesian estimates are computed using various loss functions through importance sampling techniques. The simulation is performed over 3000 iterations, and the mean estimates of

\hat{α}

and

\hat{λ}

are computed. Additionally, a comparison is performed between the MLEs and Bayesian estimates, utilizing the mean squared error (MSE) as the primary criterion.

The definitions for the mean and mean squared error of the estimates

\hat{α}

and

\hat{λ}

are as follows:

M S E_{\hat{α}} = \frac{1}{M} \sum_{j = 1}^{M} {({\hat{α}}_{j} - \tilde{α})}^{2} and M S E_{\hat{λ}} = \frac{1}{M} \sum_{j = 1}^{M} {({\hat{λ}}_{j} - \tilde{λ})}^{2},

M e a n_{\hat{α}} = \frac{1}{M} \sum_{j = 1}^{M} {\hat{α}}_{j} and M e a n_{\hat{λ}} = \frac{1}{M} \sum_{j = 1}^{M} {\hat{λ}}_{j},

where

{\hat{α}}_{j}

and

{\hat{λ}}_{j}

are derived from the j-th simulation. Here,

j = 1, 2, \dots, M

, with M set to 3000 for this study.

Furthermore, we explore the effect of different values of h in the LILF and

δ

in the GELF to analyze their influence on the Bayesian estimates.

Under various Type-II censoring schemes, Table 1 and Table 2 provide the average estimates and MSEs for the parameters

α

and

λ

. The results show that the Bayesian estimates outperform the ML method. Notably, when utilizing the GELF with

δ = 0.8

, the Bayesian estimates stand out by exhibiting superior performance, achieving the smallest MSEs, and closely approximating the actual parameter values.

Table 3 summarizes the interval estimates for the unknown parameters under various censoring schemes, along with the associated coverage probabilities (CPs) and average lengths (ALs).

The findings reveal that the HPD intervals consistently have the shortest ALs among all the interval estimation methods considered, followed by the asymptotic confidence intervals, while the bootstrap confidence intervals have the longest ALs, ensuring that the intervals are broad enough to cover the asymmetry. Moreover, as n and r increase, the ALs of different interval estimates tend to decrease. In terms of CPs, the asymptotic intervals achieve the highest CPs, with the HPD intervals coming next and the bootstrap intervals showing the lowest CPs. Therefore, considering both AL and CP, the HPD intervals demonstrate superior performance compared to the asymptotic and bootstrap intervals.

Next, we evaluate the feasibility and robustness of the model under various parameter combinations. For the fixed censoring scheme

(n = 40, r = 30)

, seven different parameter combinations are specified. For each combination, the MLEs, Bayesian estimates, and the corresponding

95 %

interval estimates are computed. Table 4 presents the mean estimates along with the MSEs (in brackets), which are used to evaluate the accuracy of the estimates for different parameter combinations. Table 5 provides the confidence and credible interval estimates corresponding to each parameter combination. Consistent with the previous discussion, the Bayesian estimates exhibit smaller MSEs compared to the MLEs. Additionally, the HPD intervals are characterized by the shortest average lengths.

Then, we explore the impact of prior distribution selection on Bayesian estimation. Similar to the previous discussion, we set the true values to be

\tilde{α} = 2.6

and

\tilde{λ} = 1.2

. We consider two types of prior distributions: the non-informative prior, where the hyper-parameters u, v, s, and b are set to 0, and the informative prior, where

u = 2.6

,

v = 1

,

s = 1.2

, and

b = 1

. In this case, the actual value of the parameter is typically considered as the expectation of the prior distribution. The results are provided in Table 6. It can be concluded from the table that Bayesian estimates with informative priors yield smaller MSEs compared to those with non-informative priors. This indicates that incorporating prior information in the Bayesian procedure enhances the precision of the estimates.

Subsequently, we consider the Bayesian prediction problem. First, we employ the inverse transformation method to generate

n = 30

random numbers from the generalized exponential distribution, with the true values set as

\tilde{α} = 2

and

\tilde{λ} = 1

. These generated values are presented in Table 7. Following this, we define different censoring schemes, namely

(30, 10)

,

(30, 15)

,

(30, 20)

, and

(30, 25)

. For one-sample prediction, we compute point predictions and

95 %

interval predictions for

k = 26

, 28, and 30. The detailed results are displayed in Table 8. For two-sample prediction, we set

m = 30

and calculated both point predictions and predictive intervals. The results are summarized in Table 9.

6. Data Analysis

This section provides an illustrative example by analyzing a dataset reported by Lawless [26]. The detailed observed data are shown in Table 10.

This dataset has been analyzed in previous studies, with Reference [5] highlighting the effectiveness of the generalized exponential distribution in modeling this data.

To analyze real data, we begin by calculating the MLEs using the complete data, followed by computing the Kolmogorov–Smirnov (K-S) statistic, Akaike Information Criterion (AIC), and Bayesian Information Criterion (BIC). Additionally, in some studies, the Anderson–Darling (A-D) statistic has been found to be more effective than the K-S statistic. For further details on the A-D statistic and its associated probability calculations, please refer to Reference [27]. We also compare the goodness-of-fit for other life distributions, such as the Weibull, Log-Normal, and Gamma distributions. The PDFs of these distributions are provided below:

Gamma : f_{Gamma} (x) = \frac{λ^{α} x^{α - 1} e^{- λ x}}{Γ (α)}, x > 0,

Weibull : f_{Weibull} (x) = \frac{α}{λ} {(\frac{x}{λ})}^{α - 1} e^{- {(x / λ)}^{α}}, x > 0,

Log - Normal : f_{Log - Normal} (x) = \frac{1}{x σ \sqrt{2 π}} exp (- \frac{{(ln x - μ)}^{2}}{2 σ^{2}}), x > 0

Table 11 presents the test results for each distribution. Typically, better model fit is indicated by smaller values of K-S, A-D, AIC, and BIC, along with a higher log-likelihood value. As shown in the table, the generalized exponential distribution provides a good fit for this dataset. To further visualize the model performance, Figure 4 includes two plots based on the MLEs. The first plot compares the fitted CDFs of the four distributions mentioned above. The second plot displays the fitted PDFs of these four distributions alongside a histogram of the dataset.

Now, we consider the situation where the final three observations are censored, denoted as

r = 20

. To estimate the unknown parameters, both Bayesian estimation and maximum likelihood estimation approaches are utilized. A summary of the estimation results is presented in Table 12.

The corresponding 95% asymptotic confidence intervals for

(α, λ)

are

(2.5512, 8.5386)

and

(0.0225, 0.0412)

, respectively. Meanwhile, the 95% HPD credible intervals are

(2.5123, 6.4661)

for

α

and

(0.0222, 0.03426)

for

λ

. Notably, the HPD intervals are shorter than the asymptotic confidence intervals.

Now, we make predictive inference for the 21st, 22nd, and 23rd future order statistics, denoted as

X_{21}

,

X_{22}

, and

X_{23}

, using (41) and (44). We assume that the hyper-parameters are the same as in Section 5. From Table 13, it is evident that the predicted values closely resemble the true values, thereby corroborating the effectiveness of our theoretical model. The Bayesian point prediction for the 21st order statistic is 127.92, with a 95% predictive interval ranging from 106.1134 to 147.1283. This suggests that, based on the observed sample, the 21st deep groove ball bearing is expected to fail between 106.11 and 147.13 million revolutions. Figure 5 illustrates the posterior predictive density functions for the censored observations. Additionally, the predictive survival functions for

X_{21}

,

X_{22}

, and

X_{23}

are displayed in Figure 6. The plots clearly indicate that as k increases, the rate of decline in the predictive survival function decreases.

We now focus on the two-sample prediction problem discussed in Section 4.2. Assume 23 new deep groove ball bearings, denoted as

Y_{1}, Y_{2}, \dots, Y_{23}

, are subjected to the same test. The goal is to derive the predictive density and make predictive inference for

Y_{k}

.

Figure 7 illustrates the point predictions and predictive intervals for the future samples derived from the observed dataset. Detailed numerical results are provided in Table 14. The Bayesian point prediction for the median is 65.36, with a 95% predictive interval for the median ranging from 42.1386 to 91.5963. This suggests that, based on the observed sample, the 12th deep groove ball bearing is expected to fail between 42.14 and 91.60 million revolutions. The posterior predictive density functions for

Y_{21}

to

Y_{23}

are shown in Figure 8. It can be observed that as k increases, the expected values of

Y_{k}

shift rightward, and the variances increase. Additionally, Figure 9 presents the predictive survival functions for

Y_{21}

,

Y_{22}

, and

Y_{23}

.

The results above, including both one-sample and two-sample predictions, can guide maintenance decisions by using the predictive intervals to schedule timely replacement or servicing of bearings, helping to prevent unexpected failures, minimize downtime, and reduce maintenance costs.

We encourage further research utilizing larger sample sizes and real-world datasets to achieve more accurate point and interval predictions, as well as to explore the impact of prior distribution selection on predictive outcomes.

7. Conclusions

In this study, we explore the estimation and prediction problems for the generalized exponential distribution based on Type-II censored data. The MLE is performed using the EM algorithm. Additionally, Bayesian estimation methods are investigated under various loss functions. Using Gibbs sampling and importance sampling, Bayesian estimates and HPD credible intervals are constructed. Monte Carlo simulations are conducted to assess and compare the performance of classical and Bayesian estimation techniques. Notably, the results indicate that Bayesian methods consistently provide lower MSEs and more precise interval estimates. For prediction, the proposed methods are applied to the endurance test data for deep groove ball bearings, where the last three observations are censored. Both one-sample and two-sample prediction scenarios are analyzed, with posterior predictive distributions used to construct predictive intervals. The findings demonstrate that Bayesian approaches offer a robust and reliable framework for both estimation and prediction in the presence of Type-II censored data.

One limitation of the Type-II censoring scheme is that units can only be censored at the final termination time. Future studies are encouraged to investigate more adaptable censoring schemes, such as progressive type-II hybrid censoring and adaptive Type-II progressive censoring. Moreover, exploring different prior distributions, alternative loss functions, or applying the model to other lifetime distributions could further improve the versatility and reliability of the approach.

Author Contributions

Conceptualization and methodology, W.W. and W.G.; software, W.W.; investigation, W.W.; writing—original draft preparation, W.W.; writing—review and editing, W.G.; supervision, W.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by Project 202510004158 which was supported by National Training Program of Innovation and Entrepreneurship for Undergraduates. Wenhao’s work was partially supported by the Science and Technology Research and Development Project of China State Railway Group Company, Ltd. (No. N2023Z020).

Data Availability Statement

The dataset utilized in this study is publicly accessible in reference [26].

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

PDF	Probability density function
CDF	Cumulative distribution function
MLE	Maximum likelihood estimation
ACI	Asymptotic confidence interval
CI	Confidence interval
HPD	Highest posterior density
GELF	General entropy loss function
LILF	Linex loss function
SELF	Squared error loss function
MCMC	Markov Chain Monte Carlo
MSE	Mean squared error
Boot-p CI	Bootstap-p confidence interval
Boot-t CI	Bootstap-t confidence interval
AL	Average length
CP	Coverage probability
K–S	Kolmogorov–Smirnov
AIC	Akaike Information Criterion
BIC	Bayesian Information Criterion
A-D	Anderson–Darling

Appendix A

Let

a_{i j} (α, λ)

represent the

(i, j)

th element of the matrix

I_{W} (θ)

, defined as follows:

\begin{matrix} a_{11} = \frac{n}{α^{2}}, \\ a_{22} = \frac{n}{λ^{2}} + \frac{n α (α - 1)}{λ^{2}} \int_{0}^{\infty} x^{2} e^{- 2 x} {(1 - e^{- x})}^{α - 3} d x, \\ a_{12} = a_{21} = - \frac{n α}{λ} \int_{0}^{\infty} x e^{- 2 x} {(1 - e^{- x})}^{α - 2} d x, \end{matrix}

Let

b_{i j}

represent the

(i, j)

th elements of the matrix

I_{W | X} (θ)

, given by

\begin{matrix} b_{11} & = (n - r) \cdot (\frac{1}{α^{2}} - \frac{{(1 - e^{- λ x_{r}})}^{α} {[ln (1 - e^{- λ x_{r}})]}^{2}}{{ψ_{1}}^{2}}), \\ b_{22} & = (n - r) \cdot (\frac{1}{λ^{2}} - \frac{α x_{r}^{2} (α e^{- x_{r} λ} - ψ_{1}) {(1 - e^{- x_{r} λ})}^{α - 2} e^{- λ x_{r}}}{{ψ_{1}}^{2}} + (α - 1) ψ_{2}), \\ b_{12} & = (n - r) \frac{x_{r} e^{- x_{r} λ} {(1 - e^{- x_{r} λ})}^{α - 1} (α ln (1 - e^{- x_{r} λ}) + ψ_{1})}{{ψ_{1}}^{2}} - (n - r) ψ_{3} \\ = b_{21} . \end{matrix}

where

\begin{matrix} ψ_{1} = 1 - {(1 - e^{- x_{r} λ})}^{α}, \\ ψ_{2} = \frac{1}{λ^{2} (1 - {(1 - e^{- x_{r} λ})}^{α})} \int_{{(1 - e^{- x_{r} λ})}^{α}}^{1} {(ln (1 - ξ^{1 / α}))}^{2} (1 - ξ^{1 / α}) ξ^{- 2 / α} d ξ, \\ ψ_{3} = \frac{1}{λ (1 - {(1 - e^{- x_{r} λ})}^{α})} \int_{{(1 - e^{- x_{r} λ})}^{α}}^{1} (- ln (1 - ξ^{1 / α})) (1 - ξ^{1 / α}) ξ^{- 1 / α} d ξ . \end{matrix}

References

Gupta, R.D.; Kundu, D. Theory & methods: Generalized exponential distributions. Aust. N. Z. J. Stat. 1999, 41, 173–188. [Google Scholar]
Mudholkar, G.S.; Srivastava, D.K. Exponentiated Weibull family for analyzing bathtub failure-rate data. IEEE Trans. Reliab. 1993, 42, 299–302. [Google Scholar] [CrossRef]
Gompertz, B. XXIV. On the nature of the function expressive of the law of human mortality, and on a new mode of determining the value of life contingencies. In a letter to Francis Baily, Esq. FRS &c. Philos. Trans. R. Soc. Lond. 1825, 115, 513–583. [Google Scholar]
Verhulst, P.-F. Notice sur la loi que la population suit dans son accroissement. Corresp. Mathématique Phys. 1838, 10, 113–129. [Google Scholar]
Gupta, R.D.; Kundu, D. Exponentiated Exponential Family: An Alternative to Gamma and Weibull Distributions. Biom. J. 2001, 43, 117–130. [Google Scholar] [CrossRef]
Gupta, R.D.; Kundu, D. Generalized exponential distribution: Different method of estimations. J. Stat. Comput. Simul. 2001, 69, 315–337. [Google Scholar] [CrossRef]
Kundu, D.; Pradhan, B. Estimating the parameters of the generalized exponential distribution in presence of hybrid censoring. Commun. Stat.-Theory Methods 2009, 38, 2030–2041. [Google Scholar] [CrossRef]
Kim, C.; Song, S. Bayesian estimation of the parameters of the generalized exponential distribution from doubly censored samples. Stat. Pap. 2010, 51, 583–597. [Google Scholar] [CrossRef]
Mohie El-Din, M.M.M.; Amein, M.M.; Shafay, A.R.; Mohamed, S. Estimation of generalized exponential distribution based on an adaptive progressively type-II censored sample. J. Stat. Comput. Simul. 2017, 87, 1292–1304. [Google Scholar] [CrossRef]
Pradhan, B.; Kundu, D. On progressively censored generalized exponential distribution. Test 2009, 18, 497–515. [Google Scholar] [CrossRef]
Guo, L.; Gui, W. Statistical inference of the reliability for generalized exponential distribution under progressive type-II censoring schemes. IEEE Trans. Reliab. 2018, 67, 470–480. [Google Scholar] [CrossRef]
Madi, M.T.; Raqab, M.Z. Bayesian inference for the generalized exponential distribution based on progressively censored data. Commun. Stat.—Theory Methods 2009, 38, 2016–2029. [Google Scholar] [CrossRef]
Gupta, R.D.; Kundu, D. Generalized exponential distribution: Existing results and some recent developments. J. Stat. Plan. Inference 2007, 137, 3537–3547. [Google Scholar] [CrossRef]
Al-Hussaini, E.K. Predicting observables from a general class of distributions. J. Stat. Plan. Inference 1999, 79, 79–91. [Google Scholar] [CrossRef]
Kundu, D.; Howlader, H. Bayesian inference and prediction of the inverse Weibull distribution for Type-II censored data. Comput. Stat. Data Anal. 2010, 54, 1547–1558. [Google Scholar] [CrossRef]
Arabi Belaghi, R.; Noori Asl, M.; Gurunlu Alma, O.; Singh, S.; Vasfi, M. Estimation and prediction for the Poisson-Exponential distribution based on type-II censored data. Am. J. Math. Manag. Sci. 2019, 38, 96–115. [Google Scholar] [CrossRef]
Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. B (Methodol.) 1977, 39, 1–22. [Google Scholar] [CrossRef]
Fisher, R.A. On the mathematical foundations of theoretical statistics. Philos. Trans. R. Soc. Lond. Ser. A Contain. Pap. Math. Phys. Character 1922, 222, 309–368. [Google Scholar]
Louis, T.A. Finding the Observed Information Matrix When Using the EM Algorithm. J. R. Stat. Soc. Ser. B (Methodol.) 1982, 44, 226–233. [Google Scholar] [CrossRef]
Efron, B. The Jackknife, the Bootstrap and Other Resampling Plans; SIAM: Philadelphia, PA, USA, 1982. [Google Scholar]
Hall, P. Theoretical comparison of bootstrap confidence intervals. Ann. Stat. 1988, 16, 927–953. [Google Scholar] [CrossRef]
Jäntschi, L. Detecting extreme values with order statistics in samples from continuous distributions. Mathematics 2020, 8, 216. [Google Scholar] [CrossRef]
Jäntschi, L. A test detecting the outliers for continuous distributions based on the cumulative distribution function of the data being tested. Symmetry 2019, 11, 835. [Google Scholar] [CrossRef]
Raqab, M.Z.; Madi, M.T. Bayesian inference for the generalized exponential distribution. J. Stat. Comput. Simul. 2005, 75, 841–852. [Google Scholar] [CrossRef]
Chen, M.H.; Shao, Q.M. Monte Carlo estimation of Bayesian credible and HPD intervals. J. Comput. Graph. Stat. 1999, 8, 69–92. [Google Scholar] [CrossRef]
Lawless, J.F. Statistical Models and Methods for Lifetime Data; John Wiley & Sons: Hoboken, NJ, USA, 2011. [Google Scholar]
Jäntschi, L.; Bolboacă, S.D. Computation of probability associated with Anderson–Darling statistic. Mathematics 2018, 6, 88. [Google Scholar] [CrossRef]

Figure 1. PDF plots of the generalized exponential distribution for various parameter values.

Figure 2. Hazard rate plots of the generalized exponential distribution for various parameter values.

Figure 3. Illustration of series and parallel system models.

Figure 4. Comparison of fitted CDFs and PDFs for various distributions.

Figure 5. One-sample posterior predictive density functions of

X_{21}

,

X_{22}

and

X_{23}

.

Figure 5. One-sample posterior predictive density functions of

X_{21}

,

X_{22}

and

X_{23}

.

Figure 6. One-sample predictive survival functions of

X_{21}

,

X_{22}

and

X_{23}

.

Figure 6. One-sample predictive survival functions of

X_{21}

,

X_{22}

and

X_{23}

.

Figure 7. Two-sample point prediction and interval prediction.

Figure 8. Two-sample posterior predictive density functions of

Y_{21}

,

Y_{22}

and

Y_{23}

.

Figure 8. Two-sample posterior predictive density functions of

Y_{21}

,

Y_{22}

and

Y_{23}

.

Figure 9. Two-sample predictive survival functions of

Y_{21}

,

Y_{22}

and

Y_{23}

.

Figure 9. Two-sample predictive survival functions of

Y_{21}

,

Y_{22}

and

Y_{23}

.

Table 1. Mean and corresponding MSE values for

\hat{α}