Different Methods for Estimating Default Parameters of Alpha Power-Transformed Power Distributions Using Record-Breaking Data

Rasha Abd El-Wahab Attwa; Taha Radwan

doi:10.3390/sym16010030

and

¹

Department of Mathematics, Faculty of Science, Zagazig University, Zagazig 44519, Egypt

²

Department of Management Information Systems, College of Business Administration, Qassim University, Buraydah 52571, Saudi Arabia

³

Department of Mathematics and Statistics, Faculty of Management Technology and Information Systems, Port Said University, Port Said 42521, Egypt

^*

Author to whom correspondence should be addressed.

Symmetry2024, 16(1), 30;https://doi.org/10.3390/sym16010030

This article belongs to the Special Issue Symmetry in Probability Theory and Statistics

Version Notes

Order Reprints

Abstract

The current study addresses the estimation of the default parameters of alpha power-transformed power (APTPO) distributions. For the location and scale parameters of the APTPO distributions, we provide coefficients for both the best linear unbiased estimators (BLUE) and the best linear invariant estimators (BLIE) methods. Furthermore, we establish a forecast for future records. The parameters of the APTPO distribution are estimated using the maximum likelihood estimation method (MLE). The goodness-of-fit test (using Akaike information criterion (AIC)) is computed using both the inter-record time sequence and the entire sample. Also, we utilize a simulation approach to demonstrate the practicality and benefits of our perspective. Finally, we demonstrate the accuracy of these parameters and the performance of estimators through a real-life example.

Keywords:

statistical model; maximum likelihood estimates; best linear unbiased estimators; best linear invariant estimators; Akaike information criterion

1. Introduction

When acquiring observations becomes challenging or when observational data is being lost during an experiment, records become significant. The concepts of record values, record times, and inter-record times for analyzing the breaking strength data of a specific material were initially introduced by Chandler [1]. He concluded that the predicted value of the inter-record time is infinite for every particular probability distribution function of a random variable. Feller [2] provided several examples of record values in the context of gambling issues.

Suppose that

X_{1}, X_{2}, \dots, X_{n}

are a sequence of independent and identically distributed random variables with the cumulative probability distribution function

F (x)

.

Let

m_{n} = m i n {X_{1}, X_{2}, \dots, X_{n}}

for

n \geq 1

. We say

X_{j}

is a lower record value of

{X_{n}, n \geq 1}

if

X_{j} < X_{j - 1}, j > 1

. When considering upper record values, a similar definition exists. By definition,

X_{1}

is a lower as well as upper record value. The record times reveal the indices at which the lower record values occur.

{L (r); r > 0}

, where

L (r) = m i n {j | j > L (r - 1), X_{j} < X_{L (r - 1)}; r > 1}

, and

L (1) = 1

. The probability density function of

X_{L (r)}

is given by the following:

f_{r} (x) = \frac{1}{Γ (r)} {(- ln (F (x)))}^{r - 1} f (x), x \in (- \infty, \infty) .

(1)

And the cumulative probability distribution function of

X_{L (r)}

is the following:

F_{r} (x) = \frac{1}{Γ (r)} \int_{- \infty}^{x} {(- ln (F (x)))}^{r - 1} f (x) d x, x \in (- \infty, \infty) .

The joint probability density function of two lower record values,

X_{L (r)}

and

X_{L (s)}

, is given by

f (x_{r}, x_{s}) = \frac{{(- ln (F (x_{r})))}^{r - 1} {[ln (F (x_{r})) - ln (F (x_{s}))]}^{s - r - 1}}{Γ (r) Γ (s - r)} \frac{f (x_{r}) f (x_{s})}{F (x_{r})}, - \infty < x_{s} < x_{r} < \infty

(2)

The concept of parametric inference for data breaking records was first introduced by Samaniego and Whitaker [3]. They explored the characteristics of estimates using the maximum likelihood method for the mean of a basic exponential probability distribution. Gulati and Padgett [4] expanded Samaniego and Whitaker’s technique to include the Weibull probability distribution. Raul et al. [5] studied the maximum likelihood and Bayesian estimation of parameters and prediction of future records for the Weibull distribution using

δ

-record data. Ahsanullah [6] examined data from an exponential distribution, focusing on predicting the

s^{t h}

record value based on the first m record values

(s > m)

. Nigm [7] was the first to present record values for the Inverse Weibull distribution (IW) along with explicit formulas for its means, variances, and covariances. Furthermore, certain concurrent inferences regarding the forecast of a future record value and the examination of the current record values for spuriousness were made.

Various random events, observed in specific survival, financial, or reliability studies, have been thoroughly modeled using asymmetrical models such as Gumbel, logistic, Weibull, and generalized extreme value distributions.

The Pareto distribution is well-known in various fields, including reliability analysis, actuarial science, survival analysis, life testing, economics, finance, hydrology, telecommunication, physics, and engineering. According to Johnson et al. [8], the cumulative density function defines the Pareto distribution of the first kind.

In a novel approach to generating distributions with an application to the exponential distribution, Mahadevi and Kundu [9] introduced the alpha power transformation (APT) distribution to incorporate skewness into the baseline distribution. The formula for the APT cumulative distribution function is

G_{ξ} (x; α) = \frac{α^{F (x)} - 1}{α - 1}, α > 0, α \neq 1 (3)

(3)

and the density function formula for APT is as follows:

g_{ξ} (x; α) = (\frac{log α}{α - 1}) f (x) α^{F (x)}, α > 0, α \neq 1

(4)

There are several research papers that have applied this distribution in various ways. For instance, Mazen et al. [10] focused on the finite sample characteristics of Monte Carlo simulation-based parameter estimates for the alpha power exponential distribution. They also examined a single real data set and estimated the distribution parameters under conflicting hazards using the maximum likelihood approach. Also, Refah et al. [11] addressed estimation issues relating to the alpha power exponential distribution and employed an adaptive progressive Type-II hybrid censoring strategy. Maximum likelihood and Bayesian approaches were used to estimate unknown parameters, reliability, and hazard rate functions. Furthermore, in the study conducted by Fatehi and Chhaya [12], the alpha power transformed extended power Lindley (APTEPL) distribution, which is a new generalization of the extended power Lindley distribution, was explored and introduced.

Now, let X be a complete random variable

(r v)

from the power function probability distribution, with CDF and PDF given by

H (x; λ, β, ν) = {((x - λ) / β)}^{ν}, λ \leq x \leq λ + β, β > 0, λ > 0

(5)

h (x; λ, β, ν) = \frac{ν}{β} {((x - λ) / β)}^{ν - 1}, λ \leq x \leq λ + β, β > 0, λ > 0

(6)

where v is the shape parameter,

λ

is the location parameter, and

β

is the scale parameter.

This work aims to propose a new generalization of the Power distribution, known as the alpha power transformed of Power APTPO distribution, according to Equations (3) and (4). Approximate methods, such as the best linear unbiased estimates (BLUE), are frequently practical. The BLUE, which considers both individual uncertainties and their correlations, is commonly used. If the true uncertainties and their correlations are known, the approach is inherently impartial (see Luca [13]). The Transformed Power Function distribution was expanded upon by Idika et al. [14] as the APTPO. Here are some characteristics of the APTPO distribution. Three approaches were used for parameter estimation: maximum likelihood, ordinary least-squares, and weighted least-squares. After comparing the outcomes of a simulation research, the authors opted for maximum likelihood. For the APTPO distribution, breaking data is used to determine the coefficients for the parameters of our proposed distribution for both the best linear unbiased estimators (BLUE) and the best linear invariant estimators (BLIE). Forecasting future observations is possible by utilizing the return level for the entire sample from the APTPO distribution.

The work is outlined as follows: Section 2 determines the construction of our proposed distribution and the probability density function (PDF) of the lower record values. The effect of the parameters is also illustrated graphically. Section 3 employs BLUE and BLIE methods to estimate the parameters of the APTPO distribution based on lower record values. This section covers future record prediction and simulation studies. In Section 4, based on the inter-record time sequence and the complete sample, the APTPO distribution’s parameters are estimated using the maximum likelihood estimation method. The remaining portions of this section compare the parameters in the inter-record times and the entire sample using a goodness-of-fit test. Section 5 provides an illustrative example to demonstrate the previous applications of the new distribution. Finally, in Section 6, conclusions are presented.

2. Construction of the Alpha Power Transformed of Power (APTPO) Distribution

The Alpha Power Transformed of Power (APTPO) distribution is a novel mathematical structure introduced using the APT method, as follows: Let X be a random variable

(r v)

following the complete power function. The cumulative distribution function (CDF) and probability density function (PDF) of the APTPO distribution are given by

G (H (x; λ, β, ν)) = \frac{α^{{((x - λ) / β)}^{ν}} - 1}{α - 1}, λ \leq x \leq λ + β^{1 - \frac{1}{ν}}, β > 0,

(7)

g (H (x; λ, β, ν)) = (\frac{ν log α}{β (α - 1)}) {((x - λ) / β)}^{ν - 1} α^{{((x - λ) / β)}^{ν}}, λ \leq x \leq λ + β^{1 - \frac{1}{ν}}, β > 0,

(8)

By setting

λ = 0

and

β = 1

using Equations (7) and (8), the CDF and PDF of the APTPO distribution can be expressed as follows:

G (H (x; λ, β, ν)) = α^{x^{ν}} - 1 / α - 1, 0 \leq x \leq 1, β > 0,

(9)

g (H (x; λ, β, ν)) = \frac{ν log α}{α - 1} α^{x^{ν}} x^{ν - 1}, 0 \leq x \leq 1

(10)

So, the probability density function of

X_{L (r)}

from the APTPO distribution is given by

g^{*} (H (x; λ, β, ν)) = \frac{ν log α}{Γ (r) (α - 1)} {[- ln (α^{x^{ν}} - 1 / α - 1)]}^{r - 1} α^{x^{ν}} x^{ν - 1}, 0 \leq x \leq 1

(11)

The effects of the parameter

α

on the shape of the distributions are illustrated graphically in Figure 1 and Figure 2. Plots of the PDF and CDF of the APTPO distribution of X are displayed in Figure 1 and Figure 2 for certain parameter values. Figure 3 presents a plot of the APTPO PDF of the lower record values for various parameters. These charts demonstrate the significant flexibility of the proposed model.

Figure 1. The CDF of APTPO distribution for different values of

α

and

ν

.

Figure 2. The PDF of APTPO distribution for different values of

α

and

ν

.

Figure 3. The APTPO PDF of the lower record values for different values of

α

and

ν

and r.

3. Estimating Parameters of the APTPO Probability Distribution Using Lower Record Values

In this section, we use lower record values to estimate the parameters of the APTPO probability distribution. Section 3.1 presents the derivation of the BLUE based on the r lower record values of the APTPO probability distribution. Section 3.2 details the derivation of the BLIE using the r-lower record values from the APTPO probability distribution. Section 3.3 focuses on the development of a prediction for future records.

3.1. Estimate Parameters of APTPO Distribution Using Best Linear Unbiased Estimates (BLUEs)

By applying Equation (11), the

n^{t h}

moment of

X_{L (r)}

from the APTPO probability distribution is given as

E {(X_{L (r)})}^{n} = \int_{0}^{1} \frac{ν log (α)}{Γ (r) (α - 1)} {[- ln (\frac{α^{x^{ν}} - 1}{α - 1})]}^{r - 1} α^{x^{ν}} x^{ν + n - 1} d x, n \geq 1

(12)

Let

e^{y} = α^{x^{ν}},

then,

E {(X_{L (r)})}^{n} = \frac{{(log (α))}^{\frac{- n}{ν}}}{(α - 1) Γ (r)} \int_{0}^{log (α)} e^{y} y^{\frac{n}{ν}} {[- \ln (\frac{e^{y} - 1}{α - 1})]}^{r - 1} d y

Let

z = e^{y}

, and we can obtain that

E {(X_{L (r)})}^{n} = \frac{{(log (α))}^{\frac{- n}{ν}}}{(α - 1) Γ (r)} \int_{1}^{α} {(log z)}^{\frac{n}{ν}} {[- \ln (\frac{z - 1}{α - 1})]}^{r - 1} d z

(13)

For

n = 1

, we have

E (X_{L (r)}) = θ 1_{r}

(14)

For

n = 2

, we can calculate

E {(X_{L (r)})}^{2}

, which helps in the calculation of

V a r (X_{L (r)}) = θ 1_{r} θ 2_{r}

(15)

Consequently, for

s > r, x_{s} < x_{r},

we can calculate the covariance of

X_{L (r)}

and

X_{L (s)}

as follows:

C o v (X_{L (r)}, X_{L (s)}) = E (X_{L (r)} X_{L (s)}) - E (X_{L (r)}) E (X_{L (s)}) = θ 1_{s} . θ 2_{r}

(16)

By applying the following theorem, one can estimate the parameters of the APTPO probability distribution using lower record values:

Theorem 1.

Let

x_{1}, x_{2}, \dots, x_{r}

be r record values from the APTPO probability distribution (Equation (8)). Then, the best linear unbiased estimates (BLUE), denoted as

\hat{λ}

and

\hat{β}

, for λ and β, respectively, are as follows:

\hat{λ} = \frac{α^{'} V^{- 1} (α 1^{'} - 1 α^{'}) V^{- 1} h}{Δ}

and

\hat{β} = \frac{1^{'} V^{- 1} (1 α^{'} - α 1^{'}) V^{- 1} h}{Δ}

where

h^{'} = (x_{1}, x_{2}, \dots, x_{r}),

α^{'} = (θ 1_{1}, θ 1_{2}, \dots, θ 1_{r}),

V = (υ_{i} j), υ_{i} j = θ 2_{i} θ 1_{j}, 1 \leq i, j \leq r,

Δ = (α^{'} V^{- 1} α) (1^{'} V^{- 1} 1) - {(α^{'} V^{- 1} 1)}^{2} .

Proof.

h^{'} = (x_{1}, x_{2}, \dots, x_{r}),

then

E (h^{'}) = μ 1 + β^{2} α,

V a r (h^{'}) = β^{2} V

where, from Equations (13) and (16),

1^{'} = (1, 1, \dots \dots, 1),

α^{'} = (θ 1_{1}, θ 1_{2}, \dots, θ 1_{r})

V = (υ_{i} j), υ_{i j} = θ 2_{i} b_{j}, 1 \leq i, j \leq r,

V^{- 1} = (V^{i j}), 1 \leq i < j \leq r .

Then, the entries of

V^{- 1}

are given by

V^{i i} = \frac{θ 2_{i + 1} θ 1_{i - 1} - θ 2_{i - 1} θ 1_{i + 1}}{(θ 2_{i + 1} θ 1_{i - 1} - θ 2_{i - 1} θ 1_{i}) (θ 2_{i + 1} θ 1_{i} - θ 2_{i} θ 1_{i + 1})}, i = 1, \dots, r - 1,

V^{i j} = V^{j i} = \frac{- 1}{θ 2_{i + 1} θ 1_{i} - θ 2_{i} θ 1_{i + 1}}, j = i + 1, i = 1, 2, \dots, r - 1

V^{i j} = 0 f o r | i - j | > 1

V^{r r} = \frac{θ 1_{r - 1}}{θ 1_{r} (θ 2_{r} θ 1_{r - 1} - θ 2_{r - 1} θ 1_{r})},

Δ = (α^{'} V^{- 1} α) (1^{'} V^{- 1} 1) - {(α^{'} V^{- 1} 1)}^{2} .

Applying the method introduced by Lioyd [15], the best linear unbiased estimates (BLUE), denoted as

\hat{λ}

and

\hat{β}

, for

λ

and

β

based on r lower record values from the APTPO distribution, are given by

\hat{λ} = \frac{α^{'} V^{- 1} (α 1^{'} - 1 α^{'}) V^{- 1} h}{Δ}

and

\hat{β} = \frac{1^{'} V^{- 1} (1 α^{'} - α 1^{'}) V^{- 1} h}{Δ}

□

The variance and covariance of

\hat{λ}, \hat{β}

are given by

V a r (\hat{λ}) = \frac{α^{'} V^{- 1} λ}{Δ} β^{2},

V a r (\hat{β}) = \frac{1^{'} V^{- 1} 1}{Δ} β^{2},

C o v (\hat{λ}, \hat{β}) = \frac{α^{'} V^{- 1} 1}{Δ} β^{2},

By using the Matlab program (version 2021), the coefficients of the BLUEs for

λ, β

and variance-covariance for

λ

and

β

are given in Table 1 and Table 2, respectively.

Table 1. Coefficients for the BLUE of

λ

and

β

(

ν = 0.5, α = 1.5

).

Table 2. Coefficient for variance-covariance of the BLUE of

λ

and

β

in terms of

β^{2}

(

ν = 0.5, α = 1.5

).

3.2. Best Linear Invariant Estimates (BLIEs)

The best linear invariant estimators (BLIE)

\tilde{λ}, \tilde{β}

of

λ

and

β

(in terms of minimum mean squared error and invariance with respect to the location parameter

λ

) are

\tilde{λ} = \hat{λ} - \hat{β} (\frac{E_{12}}{1 + E_{22}})

(17)

and

\tilde{β} = \frac{\hat{β}}{1 + E_{22}},

(18)

where

\hat{μ}

and

\hat{β}

are BLUE of

λ

and

β

, and

(\begin{matrix} V a r (\hat{λ}) & C o v (\hat{λ}, \hat{β}) \\ C o v (\hat{λ}, \hat{β}) & V a r (\hat{β}) \end{matrix}) = {\hat{β}}^{2} (\begin{matrix} E_{11} & E_{12} \\ E_{21} & E_{22} \end{matrix})

The mean square errors of these estimators are

M S E (\tilde{λ}) = {\hat{β}}^{2} [E_{11} - \frac{E_{12}^{2}}{1 + E_{22}}]

and

M S E (\tilde{β}) = {\hat{β}}^{2} [\frac{E_{22}^{2}}{1 + E_{22}}]

3.3. Prediction of the Future Record

Finally, the concept for a specific phenomena that is probabilistically defined by the APTPO function probability distribution has been introduced in this paper. We generated some lower record value distributional features and achieved certain attributes that are important to this distribution. To predict future observations, this can be accomplished by utilizing return levels.

F (x_{s}) = 1 / s, s > r

which gives

x_{s} = \hat{λ} + \hat{β} {(\frac{ln [α - 1 - s]}{ln (α)})}^{(1 / ν)}

(19)

4. The Maximum Likelihood Technique

Let

x_{1}, x_{2}, \dots, x_{n}

follow a completely random sampling from the APTPO distribution function (7). The records required for this investigation were obtained as follows: The first recording,

X_{L (1)}

, is

x_{1}

, so the first observation is

X_{L (1)} = x_{1}

. Observing the independently distributed random variables with the same distribution

X_{i}^{^{'} s}

sequentially from

x_{2}, \dots x_{n}

yields the second record value,

X_{L (2)}

. Let the next observation that is less than

X_{L (1)}

need a number of trials to acquire

X_{L (2)}

equal to

K_{1}

. For example, let the next observation that is less than

X_{L (1)}

be

X_{7}

, so the number of trials to obtain

X_{L (2)}

will be

K_{1} = 6

.

Let

X_{L (1)} = x_{1}, K_{1} = k_{1}, X_{L (2)} = x_{2}, K_{2} = k_{2} \dots, X_{L (r)} = x_{r}, K_{r} = k_{r}

, where

{X_{L (i)}, 1 \leq i \leq r}

is the record value sequence and

{K_{i}, i > 0}

and

k_{r} = 1

is the inter-record time sequence. Note that the number of records acquired

(r)

will be smaller than n, the size of the entire random data sample, when this approach is used. It’s important to emphasize that the lower record values are the record numbers that do not include the inter-record times.

The likelihood function can be stated as

L (x, μ, β) = \prod_{i = 1}^{r} f (x_{i}) {[1 - F (x_{i})]}^{(k_{i} - 1)}

For the record-breaking samples, let

X_{L (1)} = x_{1}, K_{1} = k_{1}, X_{L (2)} = x_{2}, K_{2} = k_{2} \dots, X_{L (r)} = x_{r}, K_{r} = k_{r}

, where

f (x_{i})

and

F (x_{i})

are the PDF and CDF, respectively, of the random variable from which the record observations are obtained.

Applying the likelihood function to the record observations obtained from the APTPO distribution, we obtain that

L_{1} (x, λ, β) = \prod_{i = 1}^{r} \frac{ν log (α)}{α - 1} {z_{i}}^{ν - 1} α^{β z_{i}^{ν}} {[1 - \frac{α β z_{i}^{ν} - 1}{α - 1}]}^{k_{i} - 1}

(20)

where,

z_{i} = \frac{x_{i} - λ}{β}

.

The log of the likelihood function is

\begin{matrix} log L_{1} (x, λ, β) = \sum_{i = 1}^{r} {log (\frac{ν log (α)}{α - 1}) + (ν - 1) \sum_{i = 1}^{r} log (z_{i}) + β log (α) \sum_{i = 1}^{r} z_{i}^{ν} \\ + (k_{i} - 1) \sum_{i = 1}^{r} log [\frac{α - α β z_{i}^{ν}}{α - 1}]} \end{matrix}

(21)

By taking the partial derivative of Equation (21) with respect to

λ

and

β

, we obtain the following equations:

\frac{\partial log L_{1} (x, λ, β)}{\partial λ} = \sum_{i = 1}^{r} \frac{ν - 1}{x_{i} - λ} - ν log (α) \sum_{i = 1}^{r} z_{i}^{ν - 1} + \sum_{i = 1}^{r} \frac{(k_{i} - 1) α ν z_{i}^{ν - 1}}{β (α - α z_{i}^{ν})},

(22)

\frac{\partial log L_{1} (x, λ, β)}{\partial β} = \sum_{i = 1}^{r} - \frac{ν - 1}{β} + log (α) \sum_{i = 1}^{r} z_{i}^{ν} + \frac{ν log (α)}{β} \sum_{i = 1}^{r} z_{i}^{ν - 1} + \frac{α}{β} \sum_{i = 1}^{r} \frac{(k_{i} - 1) z_{i}^{ν}}{α - α z_{i}^{ν}}

(23)

The maximum likelihood estimators for

μ

and

β

for the record samples are obtained by setting Equations (22) and (23) to zero.

The estimates of the parameters inherent in Equations (20) and (21) are obtained as follows for the complete sample

X_{1}, X_{2}, \dots, X_{n}

.

We can write the log-likelihood from the APTPO probability density function, given by Equation (8), as follows:

log (L_{2} (x, λ, β)) = \sum_{i = 1}^{r} {log (\frac{ν log (α)}{α - 1}) + (ν - 1) \sum_{i = 1}^{n} log (z_{i}) + β log (α) \sum_{i = 1}^{n} z_{i}^{ν}

(24)

By taking the partial derivative of (24) with respect to

λ

and

β

, we obtain the following equations:

\frac{\partial log L_{2} (x, λ, β)}{\partial λ} = \sum_{i = 1}^{r} \frac{ν - 1}{x_{i} - λ} - ν log (α) \sum_{i = 1}^{r} z_{i}^{ν - 1},

(25)

and

\frac{\partial log L_{2} (x, λ, β)}{\partial β} = \sum_{i = 1}^{r} - \frac{ν - 1}{β} + log (α) \sum_{i = 1}^{r} z_{i}^{ν} + \frac{ν log (α)}{β} \sum_{i = 1}^{r} z_{i}^{ν - 1}

(26)

The maximum likelihood estimators for

μ

and

β

for the complete samples are obtained by setting Equations (25) and (26) to zero.

To estimate the approximate confidence intervals for the parameters of the APTPO distribution, one needs the

2 \times 2

observed information matrices for the record-breaking samples and complete sample, which are denoted by

I (Θ_{1})

and

I (Θ_{2}),

where

Θ_{1} = (λ, β)

and

Θ_{2} = (λ, β)

. Then, the

2 \times 2

total observed information matrix associated with the APTPO distribution for the record-breaking samples is given by

I (Θ_{1})

, where their parameters are replaced by their

M L E ’ s

, where

I (Θ_{1}) = (\begin{matrix} I_{λ λ} & I_{λ β} \\ I_{β β} & I_{β β} \end{matrix})

with

\frac{\partial^{2} log L_{1} (x, λ, β)}{\partial λ^{2}} = \sum_{i = 1}^{r} \frac{- ν + 1}{{(x_{i} - λ)}^{2}} + \frac{ν (ν - 1) log (α)}{β} \sum_{i = 1}^{r} z_{i}^{ν - 2}

- \sum_{i = 1}^{r} \frac{(k_{i} - 1) ν^{2} z_{i}^{2 (ν - 1)}}{{(1 - β z_{i}^{ν})}^{2}} - \sum_{i = 1}^{r} \frac{(k_{i} - 1) ν (ν - 1) z_{i}^{ν - 2}}{(1 - β z_{i}^{ν})},

\frac{\partial^{2} log L_{1} (x, λ, β)}{\partial λ β} = \sum_{i = 1}^{r} - \frac{ν (ν - 1) log (α) z (ν - 1)}{β} - \sum_{i = 1}^{r} \frac{ν z_{i}^{ν} (k_{i} - 1) (ν - 1)}{(x - λ) (β^{2} z_{i}^{2 ν} - 2 β z_{i}^{ν} + 1)}

\frac{\partial^{2} log L_{1} (x, λ, β)}{\partial β^{2}} = \sum_{i = 1}^{r} \frac{ν - 1}{β^{2}} + \sum_{i = 1}^{r} - \frac{ν (ν - 1) log (α) z^{ν}}{β} - \sum_{i = 1}^{r} \frac{(k_{i} - 1) (ν - 1) (z_{i}^{ν} ν - β z_{i}^{2 ν})}{β (β^{2} z_{i}^{2 ν} - 2 β z_{i}^{ν} + 1)}

The

2 \times 2

total observed information matrix associated with the APTPO distribution is given by

I (Θ_{2}),

wherein the parameters are replaced by their

M L E ’ s

, where

I (Θ_{2}) = (\begin{matrix} I_{λ λ} & I_{λ β} \\ I_{β λ} & I_{β β} \end{matrix})

with

\frac{\partial^{2} log L_{2} (x, λ, β)}{\partial λ^{2}} = \sum_{i = 1}^{r} \frac{- ν + 1}{{(x_{i} - λ)}^{2}} + \frac{ν (ν - 1) log (α)}{β} \sum_{i = 1}^{r} z_{i}^{ν - 2}

\frac{\partial^{2} log L_{2} (x, λ, β)}{\partial λ β} = \sum_{i = 1}^{r} - \frac{ν (ν - 1) log (α) z (ν - 1)}{β}

\frac{\partial^{2} log L_{2} (x, λ, β)}{\partial β^{2}} = \sum_{i = 1}^{r} \frac{ν - 1}{β^{2}} + \sum_{i = 1}^{r} \frac{ν (ν - 1) log (α) z (ν - 2)}{β}

Under standard regularity conditions,

(Θ_{1} - \hat{Θ_{1}})

asymptotically follows the multivariate normal distribution

N_{3} (o, - I {(\hat{Θ_{1}})}^{- 1})

and the asymptotic distribution of

(Θ_{2} - \hat{Θ_{2}})

is

N_{2} (o, - I {(\hat{Θ_{2}})}^{- 1})

. These distributions can be utilized to construct approximate confidence intervals for the model parameters. Thus, denoting, for example, the total observed information matrix evaluated at

\hat{Θ_{1}}

, that is,

- I (\hat{Θ_{1}})

, by

- \hat{I}

, one would have the following approximate

100 (1 - α) %

confidence intervals for the parameters of the APTPO distribution:

\hat{λ} \pm z_{\frac{α}{2}} \sqrt{{(- {\hat{I}}^{- 1})}_{λ λ}} \hat{β} \pm z_{\frac{α}{2}} \sqrt{{(- {\hat{I}}^{- 1})}_{β β}}

where

z_{\frac{α}{2}}

denotes the

100 {(1 - \frac{α}{2})}^{t h}

percentile of the standard normal distribution.

Goodness of Fit Tests

The goodness of fit for a statistical model describes how well it matches a set of data. Goodness of fit measures are frequently used to characterize the discrepancy between actual values and values that would have been anticipated under the relevant model. When testing statistical hypotheses, such data can be used, for illustration, to check for residual normality and determine whether two samples were drawn from the same distribution. When applying an Akaike information criterion (AIC) (see Akaike [16]), this can be evaluated as follows:

A I C = - 2 log (L) + 2 K

(27)

where L is the likelihood of the function and K is the number of estimated parameters. Note that smaller values indicate a better model.

5. Simulation Study

The performance of the estimators established in the preceding section can be verified through the following simulation studies.

1. The APTPO distribution is used with

λ = 20, β = 1, ν = 0.5

, and

α = 1.5

from a small random sample of size

n = 20

to serve as a model:

19.94, 19.97, 19.99, 19.24, 19.3114, 19.916, 19.881, 19.84, 19.8, 19.38, 19.996, 19.755, 19.697

19.64, 19.58, 19.08, 19, 19.52, 19.16, 19.94

Four record values can be collected from the given random sample, such as

x_{i} = 19.94, 19.24, 19.08, 19

k_{i} = 3, 11, 1, 1

We determined the

λ

and

β

estimation parameters for

r = 1, 2, 3,

and 4 by applying the BLUE and BLIE methods. The standard error for each case is calculated. Applying Equation (19) in each situation yields the prediction for the fifth future observation. The results are listed in Table 3.

Table 3. The result of the simulation.

On the other hand, applying the MLE method to estimate the parameters of the APTPO distribution (see Section 4) for complete data and using inter-record time

k_{i}

, along with applying the AIC method (Equation (27)), provides the results shown in Table 4. This table indicates that the value of AIC in the case of inter-record times is smaller than the value in the case of the complete sample. This implies that the use of inter-record times is preferable.

Table 4. The maximum likelihood estimates and statistical model for goodness of fit.

2. The APTPO distribution is used with

λ = 50, β = 1.5, ν = 0.5

, and

α = 2

from a large random sample of size

n = 50

to serve as a model:

49.9815, 49.4179, 49.3826, 49.6761, 49.9952, 49.9894, 49.9599, 49.2360, 49.8097, 49.8765,

48.8804, 48.7138, 49.5201, 49.3467, 49.9988, 48.7558, 49.6465, 49.6161, 49.4526, 49.9312,

48.9214, 48.6715, 49.0818, 49.9145, 49.7851, 49.7594, 48.6289, 49.5849, 49.9464, 49.8555,

49.1209, 49.8332, 49.0023, 49.9716, 48.5862, 49.7048, 49.0422, 49.8962, 48.9620, 50,

49.1597, 48.7976, 48.8392, 49.3103, 49.2734, 49.1981, 48.5432, 48.5000, 49.7326, 49.48674

Eleven record values can be collected from the given random sample, such as

x_{i} = 49.9815, 49.4179, 49.3826, 49.2360, 48.8804, 48.7558, 48.6715,

48.6289, 48.5862, 48.5432, 48.5000

k_{i} = 1, 1, 1, 5, 3, 5, 6, 5, 8, 12, 1 .

We determined the

λ

and

β

estimation parameters for

r = 1, 2, 3,

and 4 by applying the BLUE and BLIE methods. The standard error for each case is calculated. Applying Equation (19) in each situation yields the prediction for the 12th future observation. The results are listed in Table 5.

Table 5. The result of the simulation.

On the other hand, applying the MLE method to estimate the parameters of the APTPO distribution (see Section 4) for complete data and using inter-record time

k_{i}

provides the results shown in Table 6.

Table 6. The maximum likelihood estimates and statistical model for simulation.

6. Real Life Example

The vinyl chloride data, obtained from clean upgrading monitoring wells in mg/L, were used by Bhaumik et al. [17]. The data set includes the following sample:

5.1, 1.2, 1.3, 0.6, 0.5, 2.4, 0.5, 1.1, 8.0, 0.8, 0.4, 0.6, 0.9, 0.4, 2.0, 0.5, 5.3, 3.2, 2.7, 2.9, 2.5,

2.3, 1.0, 0.2, 0.1, 0.1, 1.8, 0.9, 2.0, 4.0, 6.8, 1.2, 0.4, 0.2 .

Idika et al. [14] proved that the data follow the APTPO distribution, yielding the smallest K-S statistic values and the largest K-S p-value (0.9602) with

\hat{ν} = 2.021596

and

\hat{α} = 21.074041

. Seven record values can be collected from the given random sample, such as

x_{i} = 5.1, 1.2, 0.6, 0.5, 0.4, 0.2, 0.1

k_{i} = 1, 1, 2, 1, 6, 13, 1

We determined the

λ

and

β

estimation parameters for

r = 1, 2, 3, 4, 5, 6

, and 7 by applying the BLUE and BLIE methods. The standard error for each case is determined, and the results are listed in Table 7.

Table 7. The result of the estimation.

On the other hand, applying the MLE method to estimate the parameters of the APTPO distribution (see Section 4) for complete data and using inter-record time

k_{i}

, along with applying AIC method (Equation (27)), provides the results shown in Table 8. This table indicates that the value of AIC in the case of inter-record times is smaller than the value in the case of the complete sample. This means that using inter-record times is preferable.

Table 8. The maximum likelihood estimates and statistical model for goodness of fit.

7. Conclusions

In this study, we introduced the concept of records for a specific event described probabilistically by the APTPO PDF. We found the coefficients for the best linear unbiased estimates and the best linear invariant estimators. A method for forecasting future observations based on available data was also provided. The analytical framework for records and maximum probability estimates was established. Additionally, we calculated approximate confidence intervals for the parameters of the APTPO distribution. We used engaging classical applications to demonstrate the value of our analytical advancements. Furthermore, it was utilized to model a real-life scenario, serving as an explanatory tool to verify that our data fit the suggested distributions. Finally, the estimates from our study using inter-record times outperformed earlier findings.

Author Contributions

Conceptualization, R.A.E.-W.A. and T.R.; methodology, R.A.E.-W.A. and T.R.; software, R.A.E.-W.A. and T.R.; validation, R.A.E.-W.A. and T.R.; formal analysis, R.A.E.-W.A. and T.R.; investigation, R.A.E.-W.A. and T.R.; resources, R.A.E.-W.A. and T.R.; data curation, R.A.E.-W.A. and T.R.; writing—original draft preparation, R.A.E.-W.A. and T.R.; writing—review and editing, R.A.E.-W.A. and T.R.; visualization, R.A.E.-W.A. and T.R.; supervision, R.A.E.-W.A. and T.R.; project administration, R.A.E.-W.A. and T.R.; funding acquisition, T.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Deanship of Scientific Research, Qassim University, Saudi Arabia.

Data Availability Statement

The data presented in this study are available in this article.

Acknowledgments

The researchers would like to thank the Deanship of Scientific Research, Qassim University for funding publication of this project.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Chandler, K.M. The Distribution and Frequency of Record Values. J. R. Stat. Soc. Ser. B 1952, 14, 220–228. [Google Scholar] [CrossRef]
Feller, W. An Introduction to Probability Theory and Its Application; John Wiley & Sons, Inc.: New York, NY, USA, 1965. [Google Scholar]
Samaniego, F.J.; Whitaker, L.R. On Estimating Population Characteristics from Record-Breaking Observations. I. Parametric Results. Nav. Res. Logist. Q. 1986, 33, 531–543. [Google Scholar] [CrossRef]
Gulati, S.; Padgett, W.J. Parametric and Nonparametric Inference from Record-Breaking Data; Springer: New York, NY, USA, 2003. [Google Scholar]
Raul, G.; Javier, F.; Lina, M.; Gerardo, S. Statistical Inference for the Weibull Distribution Based on δ-Record Data. Symmetry 2019, 12, 20. [Google Scholar]
Ahsanullah, M. Linear Prediction of Record Values for the Two Parameter Exponential Distribution. Ann. Inst. Stat. Math. 1980, 32, 363–368. [Google Scholar] [CrossRef]
Nigm, E.M. Record Values from Inverse Weibull Distribution and Associated Inference. J. Appl. Stat. 2007, 16, 103–114. [Google Scholar]
Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions-I; John Wiley: New York, NY, USA, 1994. [Google Scholar]
Mahadevi, A.; Kundu, D. A new method of generating distribution with an application to exponential distribution with an application to exponential distribution. Comm. Statist. Theory Methods 2017, 46, 6543–6557. [Google Scholar] [CrossRef]
Mazen, N.; Ahmed, Z.A.; Mohammed, K.S. Estimation Methods of Alpha Power Exponential Distribution with applications to Engineering and Medical Data. Pak. J. Stat. Oper. Res. 2020, 16, 149–166. [Google Scholar]
Refah, A.; Ahmed, E.; Hoda, R.; Mazen, N. Inferences for Alpha Power Exponential Distribution Using Adaptive Progressively Type-II Hybrid Censored Data with Applications. Symmetry 2022, 14, 651. [Google Scholar] [CrossRef]
Fatehi, Y.E.; Chhaya, D.S. Alpha Power Transformed Extended power Lindley Distribution. J. Stat. Theory Appl. 2022, 22, 1–18. [Google Scholar]
Luca, L. Combination of measurements and the BLUE method. In Proceedings of the 12th Quark Confinement, Thessaloniki, Greece, 28 August–4 September 2017; Volume 137, p. 106. [Google Scholar] [CrossRef]
Idika, E.; Johnson Ohakweb, O.; Osuc, B.O.; Chris, U. Onyemachid α-Power transformed transformed power function distribution with applications. Heliyon 2021, 7, e08047. [Google Scholar]
Lloyd, E.H. Least-squares estimation of location and scale parameters using order statistics. Biometrika 1952, 39, 88–95. [Google Scholar] [CrossRef]
Akaike, H. A New Look at the Statistical Model Identification. IEEE Trans. Automat. Contr. 1974, 19, 716–723. [Google Scholar] [CrossRef]
Bhaumik, D.K.; Kapur, K.; Gibbons, R.D. Testing Parameters of a Gamma Distribution for Small Samples. Technometrics 2009, 51, 326–334. [Google Scholar] [CrossRef]

Figure 1. The CDF of APTPO distribution for different values of

α

and

ν

.

Figure 2. The PDF of APTPO distribution for different values of

α

and

ν

.

Figure 3. The APTPO PDF of the lower record values for different values of

α

and

ν

and r.

Table 1. Coefficients for the BLUE of

λ

and

β

(

ν = 0.5, α = 1.5

).

Table 1. Coefficients for the BLUE of

λ

and

β

(

ν = 0.5, α = 1.5

).

n	r	The Coefficient for the BLUE of $λ$	The Coefficient for the BLUE of $β$
2	1	−0.561	4.247
2	2	1.561	−4.2479
3	1	−0.1014	2.997
3	2	−0.1654	0.45
3	3	1.2467	−3.4472
4	1	−0.0212	2.7789
4	2	−0.0346	0.0942
4	3	−0.158	0.43
4	4	1.2138	−3.3031
5	1	−0.0045	2.7335
5	2	−0.0074	0.0201
5	3	−0.0337	0.0916
5	4	−0.1557	0.4236
5	5	1.2012	−3.2688
6	1	−0.001	2.7238
6	2	−0.0016	0.0042
6	3	−0.0071	0.0193
6	4	−0.0327	0.089
6	5	−0.1584	0.04311
6	6	1.2007	−3.2679
7	1	−0.0002	2.7217
7	2	−0.0003	0.0009
7	3	−0.0015	0.0041
7	4	−0.0007	0.0188
7	5	−0.0334	0.0909
7	6	−0.1535	0.4177
7	7	1.1958	−3.2541
8	1	0	2.7213
8	2	−0.0001	0.0002
8	3	−0.0003	0.0008
8	4	−0.0014	0.0039
8	5	−0.007	0.019
8	6	−0.0321	0.0873
8	7	−0.1548	0.4212
8	8	1.1957	−3.2537
9	1	0	2.7212
9	2	0	0
9	3	−0.0001	0.0002
9	4	−0.0003	0.0008
9	5	−0.0014	0.0039
9	6	−0.0066	0.018
9	7	−0.0319	0.0869
9	8	−0.157	0.4273
9	9	1.1974	−3.2584
10	1	0	2.7212
10	2	0	0
10	3	0	0
10	4	−0.0001	0.002
10	5	−0.0003	0.0008
10	6	−0.0014	0.0037
10	7	−0.0066	0.0179
10	8	−0.0324	0.0881
10	9	−0.1552	0.4223
10	10	1.1959	−3.2543

Table 2. Coefficient for variance-covariance of the BLUE of

λ

and

β

in terms of

β^{2}

(

ν = 0.5, α = 1.5

).

Table 2. Coefficient for variance-covariance of the BLUE of

λ

and

β

in terms of

β^{2}

(

ν = 0.5, α = 1.5

).

	$r$ = 2	$r$ = 3	$r$ = 4	$r$ = 5	$r$ = 6	$r$ = 7	$r$ = 8	$r$ = 9	$r$ = 10
1	0.0522	0.0094	0.002	0.0004	0.000009	0.00009	0	0	0
2	0.8417	1.1585	0.2137	1.2252	1.2277	1.2282	1.2283	1.2284	1.228
3	−0.1421	−0.0257	−0.0054	−0.0011	−0.00024	−0.0005	−0.00001	0	0

Table 3. The result of the simulation.

	BLUE ( $\hat{θ}$ )	S.E ( $\hat{θ}$ )	BLIE ( $\tilde{θ}$ )	S.E ( $\tilde{θ}$ )
$λ$	18.959	0.1186	18.9697	0.3149
$β$	2.6692	2.9407	0.2767	2.5271
Prediction 5th observation	18.9797		19.041

Table 4. The maximum likelihood estimates and statistical model for goodness of fit.

	$\hat{λ}$	$\hat{β}$	AIC
Complete sample	18.959	0.0011	18.959
Inter record times	2.6692	2.958	0.674

Table 5. The result of the simulation.

	BLUE ( $\hat{θ}$ )	S.E ( $\hat{θ}$ )	BLIE ( $\tilde{θ}$ )	S.E ( $\tilde{θ}$ )
$λ$	48.4893	0	48.4893	0.0035
$β$	−3.8069	15.8646	−0.2257	3.6924
Prediction 12th observation	48.4386		48.4863

Table 6. The maximum likelihood estimates and statistical model for simulation.

	$\hat{λ}$	$\hat{β}$
Complete sample	48.5	2.7394
Inter record times	48.5755	0.4994

Table 7. The result of the estimation.

	BLUE ( $\hat{θ}$ )	S.E ( $\hat{θ}$ )	BLIE ( $\tilde{θ}$ )	S.E ( $\tilde{θ}$ )
$λ$	0.2265	0.1763	4.9421	−0.399
$β$	5.6408	0	8.2233	0

Table 8. The maximum likelihood estimates and statistical model for goodness of fit.

	$\hat{λ}$	$\hat{β}$	AIC
Complete sample	5.0849	10.1106	−10.4665
Inter record times	5.1	−1.0381	−45.3494

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Different Methods for Estimating Default Parameters of Alpha Power-Transformed Power Distributions Using Record-Breaking Data

Abstract

1. Introduction

2. Construction of the Alpha Power Transformed of Power (APTPO) Distribution

3. Estimating Parameters of the APTPO Probability Distribution Using Lower Record Values

3.1. Estimate Parameters of APTPO Distribution Using Best Linear Unbiased Estimates (BLUEs)

3.2. Best Linear Invariant Estimates (BLIEs)

3.3. Prediction of the Future Record

4. The Maximum Likelihood Technique

Goodness of Fit Tests

5. Simulation Study

6. Real Life Example

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics