Abstract
The current study addresses the estimation of the default parameters of alpha power-transformed power (APTPO) distributions. For the location and scale parameters of the APTPO distributions, we provide coefficients for both the best linear unbiased estimators (BLUE) and the best linear invariant estimators (BLIE) methods. Furthermore, we establish a forecast for future records. The parameters of the APTPO distribution are estimated using the maximum likelihood estimation method (MLE). The goodness-of-fit test (using Akaike information criterion (AIC)) is computed using both the inter-record time sequence and the entire sample. Also, we utilize a simulation approach to demonstrate the practicality and benefits of our perspective. Finally, we demonstrate the accuracy of these parameters and the performance of estimators through a real-life example.
1. Introduction
When acquiring observations becomes challenging or when observational data is being lost during an experiment, records become significant. The concepts of record values, record times, and inter-record times for analyzing the breaking strength data of a specific material were initially introduced by Chandler [1]. He concluded that the predicted value of the inter-record time is infinite for every particular probability distribution function of a random variable. Feller [2] provided several examples of record values in the context of gambling issues.
Suppose that are a sequence of independent and identically distributed random variables with the cumulative probability distribution function .
Let for . We say is a lower record value of if . When considering upper record values, a similar definition exists. By definition, is a lower as well as upper record value. The record times reveal the indices at which the lower record values occur. , where , and . The probability density function of is given by the following:
And the cumulative probability distribution function of is the following:
The joint probability density function of two lower record values, and , is given by
The concept of parametric inference for data breaking records was first introduced by Samaniego and Whitaker [3]. They explored the characteristics of estimates using the maximum likelihood method for the mean of a basic exponential probability distribution. Gulati and Padgett [4] expanded Samaniego and Whitaker’s technique to include the Weibull probability distribution. Raul et al. [5] studied the maximum likelihood and Bayesian estimation of parameters and prediction of future records for the Weibull distribution using -record data. Ahsanullah [6] examined data from an exponential distribution, focusing on predicting the record value based on the first m record values . Nigm [7] was the first to present record values for the Inverse Weibull distribution (IW) along with explicit formulas for its means, variances, and covariances. Furthermore, certain concurrent inferences regarding the forecast of a future record value and the examination of the current record values for spuriousness were made.
Various random events, observed in specific survival, financial, or reliability studies, have been thoroughly modeled using asymmetrical models such as Gumbel, logistic, Weibull, and generalized extreme value distributions.
The Pareto distribution is well-known in various fields, including reliability analysis, actuarial science, survival analysis, life testing, economics, finance, hydrology, telecommunication, physics, and engineering. According to Johnson et al. [8], the cumulative density function defines the Pareto distribution of the first kind.
In a novel approach to generating distributions with an application to the exponential distribution, Mahadevi and Kundu [9] introduced the alpha power transformation (APT) distribution to incorporate skewness into the baseline distribution. The formula for the APT cumulative distribution function is
and the density function formula for APT is as follows:
There are several research papers that have applied this distribution in various ways. For instance, Mazen et al. [10] focused on the finite sample characteristics of Monte Carlo simulation-based parameter estimates for the alpha power exponential distribution. They also examined a single real data set and estimated the distribution parameters under conflicting hazards using the maximum likelihood approach. Also, Refah et al. [11] addressed estimation issues relating to the alpha power exponential distribution and employed an adaptive progressive Type-II hybrid censoring strategy. Maximum likelihood and Bayesian approaches were used to estimate unknown parameters, reliability, and hazard rate functions. Furthermore, in the study conducted by Fatehi and Chhaya [12], the alpha power transformed extended power Lindley (APTEPL) distribution, which is a new generalization of the extended power Lindley distribution, was explored and introduced.
Now, let X be a complete random variable from the power function probability distribution, with CDF and PDF given by
where v is the shape parameter, is the location parameter, and is the scale parameter.
This work aims to propose a new generalization of the Power distribution, known as the alpha power transformed of Power APTPO distribution, according to Equations (3) and (4). Approximate methods, such as the best linear unbiased estimates (BLUE), are frequently practical. The BLUE, which considers both individual uncertainties and their correlations, is commonly used. If the true uncertainties and their correlations are known, the approach is inherently impartial (see Luca [13]). The Transformed Power Function distribution was expanded upon by Idika et al. [14] as the APTPO. Here are some characteristics of the APTPO distribution. Three approaches were used for parameter estimation: maximum likelihood, ordinary least-squares, and weighted least-squares. After comparing the outcomes of a simulation research, the authors opted for maximum likelihood. For the APTPO distribution, breaking data is used to determine the coefficients for the parameters of our proposed distribution for both the best linear unbiased estimators (BLUE) and the best linear invariant estimators (BLIE). Forecasting future observations is possible by utilizing the return level for the entire sample from the APTPO distribution.
The work is outlined as follows: Section 2 determines the construction of our proposed distribution and the probability density function (PDF) of the lower record values. The effect of the parameters is also illustrated graphically. Section 3 employs BLUE and BLIE methods to estimate the parameters of the APTPO distribution based on lower record values. This section covers future record prediction and simulation studies. In Section 4, based on the inter-record time sequence and the complete sample, the APTPO distribution’s parameters are estimated using the maximum likelihood estimation method. The remaining portions of this section compare the parameters in the inter-record times and the entire sample using a goodness-of-fit test. Section 5 provides an illustrative example to demonstrate the previous applications of the new distribution. Finally, in Section 6, conclusions are presented.
2. Construction of the Alpha Power Transformed of Power (APTPO) Distribution
The Alpha Power Transformed of Power (APTPO) distribution is a novel mathematical structure introduced using the APT method, as follows: Let X be a random variable following the complete power function. The cumulative distribution function (CDF) and probability density function (PDF) of the APTPO distribution are given by
By setting and using Equations (7) and (8), the CDF and PDF of the APTPO distribution can be expressed as follows:
So, the probability density function of from the APTPO distribution is given by
The effects of the parameter on the shape of the distributions are illustrated graphically in Figure 1 and Figure 2. Plots of the PDF and CDF of the APTPO distribution of X are displayed in Figure 1 and Figure 2 for certain parameter values. Figure 3 presents a plot of the APTPO PDF of the lower record values for various parameters. These charts demonstrate the significant flexibility of the proposed model.
Figure 1.
The CDF of APTPO distribution for different values of and .
Figure 2.
The PDF of APTPO distribution for different values of and .
Figure 3.
The APTPO PDF of the lower record values for different values of and and r.
3. Estimating Parameters of the APTPO Probability Distribution Using Lower Record Values
In this section, we use lower record values to estimate the parameters of the APTPO probability distribution. Section 3.1 presents the derivation of the BLUE based on the r lower record values of the APTPO probability distribution. Section 3.2 details the derivation of the BLIE using the r-lower record values from the APTPO probability distribution. Section 3.3 focuses on the development of a prediction for future records.
3.1. Estimate Parameters of APTPO Distribution Using Best Linear Unbiased Estimates (BLUEs)
By applying Equation (11), the moment of from the APTPO probability distribution is given as
Let then,
Let , and we can obtain that
For , we have
For , we can calculate , which helps in the calculation of
Consequently, for we can calculate the covariance of and as follows:
By applying the following theorem, one can estimate the parameters of the APTPO probability distribution using lower record values:
Theorem 1.
Let be r record values from the APTPO probability distribution (Equation (8)). Then, the best linear unbiased estimates (BLUE), denoted as and , for λ and β, respectively, are as follows:
and
where
Proof.
then
where, from Equations (13) and (16),
Then, the entries of are given by
Applying the method introduced by Lioyd [15], the best linear unbiased estimates (BLUE), denoted as and , for and based on r lower record values from the APTPO distribution, are given by
and
□
The variance and covariance of are given by
By using the Matlab program (version 2021), the coefficients of the BLUEs for and variance-covariance for and are given in Table 1 and Table 2, respectively.
Table 1.
Coefficients for the BLUE of and ().
Table 2.
Coefficient for variance-covariance of the BLUE of and in terms of ().
3.2. Best Linear Invariant Estimates (BLIEs)
The best linear invariant estimators (BLIE) of and (in terms of minimum mean squared error and invariance with respect to the location parameter ) are
and
where and are BLUE of and , and
The mean square errors of these estimators are
and
3.3. Prediction of the Future Record
Finally, the concept for a specific phenomena that is probabilistically defined by the APTPO function probability distribution has been introduced in this paper. We generated some lower record value distributional features and achieved certain attributes that are important to this distribution. To predict future observations, this can be accomplished by utilizing return levels.
which gives
4. The Maximum Likelihood Technique
Let follow a completely random sampling from the APTPO distribution function (7). The records required for this investigation were obtained as follows: The first recording, , is , so the first observation is . Observing the independently distributed random variables with the same distribution sequentially from yields the second record value, . Let the next observation that is less than need a number of trials to acquire equal to . For example, let the next observation that is less than be , so the number of trials to obtain will be .
Let , where is the record value sequence and and is the inter-record time sequence. Note that the number of records acquired will be smaller than n, the size of the entire random data sample, when this approach is used. It’s important to emphasize that the lower record values are the record numbers that do not include the inter-record times.
The likelihood function can be stated as
For the record-breaking samples, let , where and are the PDF and CDF, respectively, of the random variable from which the record observations are obtained.
Applying the likelihood function to the record observations obtained from the APTPO distribution, we obtain that
where, .
The log of the likelihood function is
By taking the partial derivative of Equation (21) with respect to and , we obtain the following equations:
The maximum likelihood estimators for and for the record samples are obtained by setting Equations (22) and (23) to zero.
The estimates of the parameters inherent in Equations (20) and (21) are obtained as follows for the complete sample .
We can write the log-likelihood from the APTPO probability density function, given by Equation (8), as follows:
By taking the partial derivative of (24) with respect to and , we obtain the following equations:
and
The maximum likelihood estimators for and for the complete samples are obtained by setting Equations (25) and (26) to zero.
To estimate the approximate confidence intervals for the parameters of the APTPO distribution, one needs the observed information matrices for the record-breaking samples and complete sample, which are denoted by and where and . Then, the total observed information matrix associated with the APTPO distribution for the record-breaking samples is given by , where their parameters are replaced by their , where
with
The total observed information matrix associated with the APTPO distribution is given by wherein the parameters are replaced by their , where
with
Under standard regularity conditions, asymptotically follows the multivariate normal distribution and the asymptotic distribution of is . These distributions can be utilized to construct approximate confidence intervals for the model parameters. Thus, denoting, for example, the total observed information matrix evaluated at , that is, , by , one would have the following approximate confidence intervals for the parameters of the APTPO distribution:
where denotes the percentile of the standard normal distribution.
Goodness of Fit Tests
The goodness of fit for a statistical model describes how well it matches a set of data. Goodness of fit measures are frequently used to characterize the discrepancy between actual values and values that would have been anticipated under the relevant model. When testing statistical hypotheses, such data can be used, for illustration, to check for residual normality and determine whether two samples were drawn from the same distribution. When applying an Akaike information criterion (AIC) (see Akaike [16]), this can be evaluated as follows:
where L is the likelihood of the function and K is the number of estimated parameters. Note that smaller values indicate a better model.
5. Simulation Study
The performance of the estimators established in the preceding section can be verified through the following simulation studies.
1. The APTPO distribution is used with , and from a small random sample of size to serve as a model:
Four record values can be collected from the given random sample, such as
We determined the and estimation parameters for and 4 by applying the BLUE and BLIE methods. The standard error for each case is calculated. Applying Equation (19) in each situation yields the prediction for the fifth future observation. The results are listed in Table 3.
Table 3.
The result of the simulation.
On the other hand, applying the MLE method to estimate the parameters of the APTPO distribution (see Section 4) for complete data and using inter-record time , along with applying the AIC method (Equation (27)), provides the results shown in Table 4. This table indicates that the value of AIC in the case of inter-record times is smaller than the value in the case of the complete sample. This implies that the use of inter-record times is preferable.
Table 4.
The maximum likelihood estimates and statistical model for goodness of fit.
2. The APTPO distribution is used with , and from a large random sample of size to serve as a model:
Eleven record values can be collected from the given random sample, such as
We determined the and estimation parameters for and 4 by applying the BLUE and BLIE methods. The standard error for each case is calculated. Applying Equation (19) in each situation yields the prediction for the 12th future observation. The results are listed in Table 5.
Table 5.
The result of the simulation.
On the other hand, applying the MLE method to estimate the parameters of the APTPO distribution (see Section 4) for complete data and using inter-record time provides the results shown in Table 6.
Table 6.
The maximum likelihood estimates and statistical model for simulation.
6. Real Life Example
The vinyl chloride data, obtained from clean upgrading monitoring wells in mg/L, were used by Bhaumik et al. [17]. The data set includes the following sample:
Idika et al. [14] proved that the data follow the APTPO distribution, yielding the smallest K-S statistic values and the largest K-S p-value (0.9602) with and . Seven record values can be collected from the given random sample, such as
We determined the and estimation parameters for , and 7 by applying the BLUE and BLIE methods. The standard error for each case is determined, and the results are listed in Table 7.
Table 7.
The result of the estimation.
On the other hand, applying the MLE method to estimate the parameters of the APTPO distribution (see Section 4) for complete data and using inter-record time , along with applying AIC method (Equation (27)), provides the results shown in Table 8. This table indicates that the value of AIC in the case of inter-record times is smaller than the value in the case of the complete sample. This means that using inter-record times is preferable.
Table 8.
The maximum likelihood estimates and statistical model for goodness of fit.
7. Conclusions
In this study, we introduced the concept of records for a specific event described probabilistically by the APTPO PDF. We found the coefficients for the best linear unbiased estimates and the best linear invariant estimators. A method for forecasting future observations based on available data was also provided. The analytical framework for records and maximum probability estimates was established. Additionally, we calculated approximate confidence intervals for the parameters of the APTPO distribution. We used engaging classical applications to demonstrate the value of our analytical advancements. Furthermore, it was utilized to model a real-life scenario, serving as an explanatory tool to verify that our data fit the suggested distributions. Finally, the estimates from our study using inter-record times outperformed earlier findings.
Author Contributions
Conceptualization, R.A.E.-W.A. and T.R.; methodology, R.A.E.-W.A. and T.R.; software, R.A.E.-W.A. and T.R.; validation, R.A.E.-W.A. and T.R.; formal analysis, R.A.E.-W.A. and T.R.; investigation, R.A.E.-W.A. and T.R.; resources, R.A.E.-W.A. and T.R.; data curation, R.A.E.-W.A. and T.R.; writing—original draft preparation, R.A.E.-W.A. and T.R.; writing—review and editing, R.A.E.-W.A. and T.R.; visualization, R.A.E.-W.A. and T.R.; supervision, R.A.E.-W.A. and T.R.; project administration, R.A.E.-W.A. and T.R.; funding acquisition, T.R. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by the Deanship of Scientific Research, Qassim University, Saudi Arabia.
Data Availability Statement
The data presented in this study are available in this article.
Acknowledgments
The researchers would like to thank the Deanship of Scientific Research, Qassim University for funding publication of this project.
Conflicts of Interest
The authors declare no conflicts of interest.
References
- Chandler, K.M. The Distribution and Frequency of Record Values. J. R. Stat. Soc. Ser. B 1952, 14, 220–228. [Google Scholar] [CrossRef]
- Feller, W. An Introduction to Probability Theory and Its Application; John Wiley & Sons, Inc.: New York, NY, USA, 1965. [Google Scholar]
- Samaniego, F.J.; Whitaker, L.R. On Estimating Population Characteristics from Record-Breaking Observations. I. Parametric Results. Nav. Res. Logist. Q. 1986, 33, 531–543. [Google Scholar] [CrossRef]
- Gulati, S.; Padgett, W.J. Parametric and Nonparametric Inference from Record-Breaking Data; Springer: New York, NY, USA, 2003. [Google Scholar]
- Raul, G.; Javier, F.; Lina, M.; Gerardo, S. Statistical Inference for the Weibull Distribution Based on δ-Record Data. Symmetry 2019, 12, 20. [Google Scholar]
- Ahsanullah, M. Linear Prediction of Record Values for the Two Parameter Exponential Distribution. Ann. Inst. Stat. Math. 1980, 32, 363–368. [Google Scholar] [CrossRef]
- Nigm, E.M. Record Values from Inverse Weibull Distribution and Associated Inference. J. Appl. Stat. 2007, 16, 103–114. [Google Scholar]
- Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions-I; John Wiley: New York, NY, USA, 1994. [Google Scholar]
- Mahadevi, A.; Kundu, D. A new method of generating distribution with an application to exponential distribution with an application to exponential distribution. Comm. Statist. Theory Methods 2017, 46, 6543–6557. [Google Scholar] [CrossRef]
- Mazen, N.; Ahmed, Z.A.; Mohammed, K.S. Estimation Methods of Alpha Power Exponential Distribution with applications to Engineering and Medical Data. Pak. J. Stat. Oper. Res. 2020, 16, 149–166. [Google Scholar]
- Refah, A.; Ahmed, E.; Hoda, R.; Mazen, N. Inferences for Alpha Power Exponential Distribution Using Adaptive Progressively Type-II Hybrid Censored Data with Applications. Symmetry 2022, 14, 651. [Google Scholar] [CrossRef]
- Fatehi, Y.E.; Chhaya, D.S. Alpha Power Transformed Extended power Lindley Distribution. J. Stat. Theory Appl. 2022, 22, 1–18. [Google Scholar]
- Luca, L. Combination of measurements and the BLUE method. In Proceedings of the 12th Quark Confinement, Thessaloniki, Greece, 28 August–4 September 2017; Volume 137, p. 106. [Google Scholar] [CrossRef]
- Idika, E.; Johnson Ohakweb, O.; Osuc, B.O.; Chris, U. Onyemachid α-Power transformed transformed power function distribution with applications. Heliyon 2021, 7, e08047. [Google Scholar]
- Lloyd, E.H. Least-squares estimation of location and scale parameters using order statistics. Biometrika 1952, 39, 88–95. [Google Scholar] [CrossRef]
- Akaike, H. A New Look at the Statistical Model Identification. IEEE Trans. Automat. Contr. 1974, 19, 716–723. [Google Scholar] [CrossRef]
- Bhaumik, D.K.; Kapur, K.; Gibbons, R.D. Testing Parameters of a Gamma Distribution for Small Samples. Technometrics 2009, 51, 326–334. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).