You are currently viewing a new version of our website. To view the old version click .
Symmetry
  • Article
  • Open Access

13 August 2023

Asymmetric Right-Skewed Size-Biased Bilal Distribution with Mathematical Properties, Reliability Analysis, Inference and Applications

,
and
1
Department of Mathematics, Faculty of Science, Al al-Bayt University, Mafraq 25113, Jordan
2
Department of Mathematical Sciences, Umm Al-Qura University, Makkah 24382, Saudi Arabia
3
Department of Mathematical Sciences, College of Science, Princess Nourah bint Abdulrahman University, Riyadh 11671, Saudi Arabia
*
Author to whom correspondence should be addressed.
This article belongs to the Special Issue Symmetry in Probability Theory and Statistics

Abstract

Asymmetric distributions, as opposed to symmetric distributions, may be more resilient to extreme values or outliers. Furthermore, when data show substantial skewness, asymmetric distributions can shed light on the underlying processes or phenomena being investigated. In this direction, the size-biased Bilal distribution (SBBD) is suggested in this study as a generalization to the Bilal distribution. The length-biased and area-biased Bilal distributions are discussed in detail as two special cases. The main statistical properties of the distribution including the rth moment, coefficients of variation, skewness, kurtosis, moment generating function, incomplete moments, moments of residual life, harmonic mean, Fisher’s information, and the Rényi entropy as a measure of uncertainty are presented. Graphical representations of the cumulative distribution, probability density, odds, survival, hazard, reversed hazard rate, and cumulative hazard functions are presented for further explanation of the distribution behavior. In addition, the methods of moments and maximum likelihood estimates are taken into account for estimating the model parameters. A simulation study is carried out to see the efficiency of the maximum likelihood in terms of standard errors and bias. Real data sets of precipitation and myeloid leukemia patients are considered to show the practical significance of the suggested distributions as an alternative to some well-known distributions such as the Rama, Rani, Bilal, and exponential distributions. It is found that the size-biased Bilal distribution is right-skewed and has a superior fitting performance compared to the other distributions in this study.

1. Introduction

One of the most crucial objectives of a statistician is to describe real-world situations using probability distributions. In many real-world applications, including the fields of medicine, actuarial science, engineering, industry, and finance, modeling and interpreting lifetime data are crucial. This has led researchers to concentrate on creating families of probability distributions in recent decades. The size-biased distributions appear when the gathered data in actual life scenarios are assigned based on a weight function rather than randomly. Size-biased distribution refers to the distribution that results in sampling process selecting sample units with a probability that is proportional to some size measure of the unit. The concept of weighted distributions was first proposed by [1] to accurately model real data. Over the last 25 years, some fitting models for scanned data have been collected using the weighted distributions technique. Assume that the random variable X has a probability density function (pdf) f ( x ) ; then, the pdf of the weighted random variable X w is obtained using the following:
f w ( x ) = w ( x ) f ( x ) E [ w ( X ) ]
such that E [ w ( X ) ] exists and the normalization factor, w ( x ) , makes the overall probability equal to 1. The concept of length-biased sampling was first introduced by [2], and then [3] formulated and explored this idea broadly in the context of modeling statistical data when the conventional method of employing standard distributions was proven to be ineffective. Ref. [3] introduced a special case of the weighted distribution known as size-biased distribution of order t and if the weighted function has the form w ( x ) = x t , the pdf is obtained using the following:
f t ( x ) = x t f ( x ) E t ( X ) .
For t = 1 and t = 2 , the obtained distributions are called length-biased and area-biased distributions, respectively.
Furthermore, due to the importance of the weighted distributions, many researchers studied it in various cases. Ref. [4] talked about using weighted binomial distribution to model human families and determine the size of wildlife families. The relationship between initial reliability measures and size-biased distribution was outlined by [5]. Ref. [6] provided the theory of bivariate weighted distribution. Ref. [7] proposed the length-biased weighted Maxwell distribution. Ref. [8] suggested size-biased weighted transmuted Weibull distribution. Length and area-biased Maxwell distributions were introduced by [9]. Ref. [10] proposed size-biased Ishita distribution. Weighted power Lomax distribution and its length-biased form were proposed by [11]. For modeling skewed and heavy-tailed data, Ref. [12] presented the normal weighted inverse Gaussian distribution. Ref. [13] offered weighted generalized Quasi Lindley distribution. Ref. [14] suggested generalized weighted exponential distribution. Ref. [15] introduced weighted Gamma–Pareto distribution, and [16] investigated a modified weighted Pareto distribution of Type I.
Ref. [17] offered a new continuous lifetime distribution, called Bilal distribution (BD), which is demonstrated to belong with the class of novel renewal failure rates that are better than average. The Bilal distribution’s density function is always unimodal, with around 25% and 28% less skewness and kurtosis than the exponential distribution’s density function, respectively. The distribution function, qth quantiles, and failure rate function, on the other hand, are in compact form, and the distinct moments are expressed explicitly in terms of the exponential function. He defined the probability density function of BD as follows:
f B D ( x ; θ ) = 6 θ 1 e x θ e 2 x θ , x 0 , θ > 0
where E B D ( X ) = 5 6 θ , V B D ( X ) = 13 36 θ 2 , coefficient of variation C V = 0.72 , kurtosis K u = 6.443786982 , and the corresponding cumulative distribution function (CDF) is defined as follows:
F B D ( x ; θ ) = 1 3 2 e x θ e 2 x θ , x 0 , θ > 0
It is of interest to note here that base Bilal distribution has a one parameter and the suggested modifications each has one parameter with suitable weight functions which yields more flexibility in modeling a variety of real data sets for simple usage. The novelty of this research lies in combination the Bilal distribution with the weighted distribution to modify the Bilal distribution for modeling real data sets. To our knowledge, this is the first work on length-biased and area-biased Bilal distributions. The rest of this paper is organized as follows: The size-biased Bilal distribution and some of its properties are presented in Section 2. Section 3 addresses some certain distributional features of the suggested length-biased and area-biased Bilal distributions. In Section 4, the applicability of the proposed models is demonstrated based on two real data sets. Finally, the paper is concluded in Section 5.

2. Structure of the SBBD

This section introduces the size-biased Bilal distribution of the order t, expressed using Equation (2). Then, the length and area-biased Bilal distributions are derived as two special cases. The pdf of the size-biased Bilal random variable, X S B B D , of order t is expressed using
f S B B D ( x ; θ ) = x t f B D ( x ; θ ) E B D ( X t ) = 6 t + 1 θ t 1 Γ ( t + 1 ) 3 t + 1 2 t + 1 x t e 2 x θ 1 e x θ ,   x > 0 , θ > 0 ,
and the corresponding CDF is as follows:
F S B B D ( x ; θ ) = 2 t + 1 Γ ( t + 1 ) Γ t + 1 , 3 x θ + 3 t + 1 Γ t + 1 , 2 x θ t Γ ( t ) 2 t + 1 3 t + 1 Γ ( t + 1 ) , t = 1 , 2 , ; x , θ > 0 ,
where Γ ( t ) = 0 z t 1 e z d z is the gamma function and Γ ( a , t ) = t z a 1 e z d z is the incomplete gamma function. Figure 1 shows the pdf plots of SBBD for various schemes, where θ = 0.9 and 0.3 with t = 1, 2, 3, 4, 5, 6, 7. It is clear that SBBD can take various shapes depending on the value of the parameters θ and t. For example, when t = 1 , SBBD is skewed to the right for both values θ = 0.9 , 0.3 , increasing and then decreasing as in both the plots, and is semi-symmetric for t = 7 with θ = 0.3 . These different shapes give the distribution more flexibility in modeling some type of data.
Figure 1. The plots of SBBD for θ = 0.9 (left) and θ = 0.3 (right) with t = 1 ,   2 ,   3 , 4, 5, 6, 7.
Theorem 1.
The rth moment of the size-biased Bilal distributed random variable, XSBBD, is obtained using the following equation:
E X r = θ r 2 r + t + 1 3 r + t + 1 Γ ( r + t + 1 ) 6 r 2 t + 1 3 t + 1 Γ ( t + 1 ) , θ > 0 , r + t > 2
Proof. 
To prove the theorem, the rth moment is defined as follows:
E X r = 0 x r f S B B D ( x ; θ ) d x = 0 x r 6 t + 1 θ t 1 Γ ( t + 1 ) 3 t + 1 2 t + 1 x t e 2 x θ 1 e x θ d x = 6 t + 1 θ t 1 Γ ( t + 1 ) 3 t + 1 2 t + 1 0 x r + 1 e 2 x θ d x 0 x r + t e 3 x θ d x = 6 t + 1 θ t 1 Γ ( t + 1 ) 3 t + 1 2 t + 1 Γ ( r + t + 1 ) 2 θ r + t + 1 Γ ( r + t + 1 ) 3 θ r + t + 1 = 6 t + 1 θ t 1 Γ ( r + t + 1 ) θ r + t + 1 Γ ( t + 1 ) 3 t + 1 2 t + 1 3 r + t + 1 2 r + t + 1 6 r + t + 1 = θ r 2 r + t + 1 3 r + t + 1 ( r + t ) ! 6 r 2 t + 1 3 t + 1 t ! .
For r = 1 , 2 , 3 , 4 in Equation (7), we obtain the first four moments, respectively, for the origin of X:
E X = θ 2 t + 2 3 t + 2 ( t + 1 ) ! 6 2 t + 1 3 t + 1 t ! ,   E X 2 = θ 2 2 t + 3 3 t + 3 ( t + 2 ) ! 36 2 t + 1 3 t + 1 t ! ,
E X 3 = θ 3 2 t + 4 3 t + 4 ( t + 3 ) ! 216 2 t + 1 3 t + 1 t ! ,   E X 4 = θ 4 2 t + 5 3 t + 5 ( t + 4 ) ! 1296 2 t + 1 3 t + 1 t ! .
Theorem 2.
If X has a SBBD, then the moment generating functions of XSBBD is obtained using the following equation:
M X ( w ) = 6 t + 1 3 t + 1 2 t + 1 2 θ w ( t + 1 ) 3 θ w ( t + 1 ) , w < 3 θ , t > 2 .
Proof. 
The proof is similar to the proof of Theorem (1), considering the following:
M X ( w ) = 0 e w x f S B B D ( x ; θ ) d x .
Theorem 3.
The rth incomplete moment of the SBBD is defined as follows:
E X I M S B B D r = 3 r + t + 1 Γ r + t + 1 , 2 h θ Γ ( r + t + 1 ) + 2 r + t + 1 Γ ( r + t + 1 ) Γ r + t + 1 , 3 h θ 2 t + 1 3 t + 1 Γ ( t + 1 ) .
Proof. 
Recall the pdf in (5), the incomplete rth moment of the SBBD can be proved as follows:
E X I M L B B D r = 0 h x r f L B B D ( x ; θ ) d x = 6 t + 1 θ t 1 Γ ( t + 1 ) 3 t + 1 2 t + 1 0 h x r + t e 2 x θ e 3 x θ d x = 6 t + 1 θ t 1 Γ ( t + 1 ) 3 t + 1 2 t + 1 θ 2 r t 1 h r + t h θ r t Γ ( r + t + 1 ) Γ r + t + 1 , 2 h θ θ 3 r t 1 h r + t h θ r t Γ ( r + t + 1 ) Γ r + t + 1 , 3 h θ = 6 t + 1 Γ ( t + 1 ) 3 t + 1 2 t + 1 2 r t 1 Γ ( r + t + 1 ) Γ r + t + 1 , 2 h θ 3 r t 1 Γ ( r + t + 1 ) Γ r + t + 1 , 3 h θ = 3 r + t + 1 Γ r + t + 1 , 2 h θ Γ ( r + t + 1 ) + 2 r + t + 1 Γ ( r + t + 1 ) Γ r + t + 1 , 3 h θ 2 t + 1 3 t + 1 Γ ( t + 1 ) .
Rényi entropy is a statistical mechanics and information theory concept that gauges how random or uncertain a system is. The Rényi entropy of the random variable X as defined by [18] is expressed using H α ( X ) = 1 1 α log 0 f ( x ) α d x ,   α > 0 , α 1 .
Theorem 4.
The Rényi entropy of the SBBD is obtained using the following:
H α X S B B D = 1 1 α log 6 t + 1 θ t 1 Γ ( t + 1 ) 3 t + 1 2 t + 1 α j = 0 α ( 1 ) α j α j ( α t ) ! 3 α j θ ( α t + 1 ) .
Proof. 
The Rényi entropy of the size-biased distributed random variable X can be derived as follows:
H α X L B B D = 1 1 α log 0 6 t + 1 θ t 1 Γ ( t + 1 ) 3 t + 1 2 t + 1 x t e 2 x θ 1 e x θ α d x = 1 1 α log 6 t + 1 θ t 1 Γ ( t + 1 ) 3 t + 1 2 t + 1 α 0 x α t e 2 α x θ 1 e x θ α d x = 1 1 α log 6 t + 1 θ t 1 Γ ( t + 1 ) 3 t + 1 2 t + 1 α 0 x α t e 2 α x θ j = 0 α ( 1 ) α j α j e x θ α j d x = 1 1 α log 6 t + 1 θ t 1 Γ ( t + 1 ) 3 t + 1 2 t + 1 α j = 0 α ( 1 ) α j α j 0 x α t e 2 α x θ e x θ α j d x = 1 1 α log 6 t + 1 θ t 1 Γ ( t + 1 ) 3 t + 1 2 t + 1 α j = 0 α ( 1 ) α j α j ( α t ) ! 3 α j θ ( α t + 1 ) .

3. Special Cases of the SBBD

In this section, by setting  t = 1 , 2 in Equation (5), we obtain the pdfs of the suggested length-biased Bilal distribution (LBBD) and area-biased Bilal distribution (ABBD), respectively:
f L B B D ( x ; θ ) = 36 5 θ 2 x x e x θ e 2 x θ , x > 0 , θ > 0 ,
f A B B D ( x ; θ ) = 108 19 θ 3 x 2 x 2 e x θ e 2 x θ , x > 0 , θ > 0 .
For various distribution parameter values, the pdf plots of the LBBD and ABBD are shown in Figure 2. The pdfs of the recommended distributions are skewed to the right and become more flat as the values of θ increase. Furthermore, depending on the parameter values, the pdfs of the ABBD and LBBD can display a variety of behaviors.
Figure 2. The pdf plots of the LBBD and ABBD when θ = 1 , 2 , 3 , 4 , 5 , 6 , 7 .
The CDFs of the LBBD and ABBD, respectively, are expressed using the following:
F L B B D ( x ; θ ) = 1 9 e 2 x θ ( θ + 2 x ) 4 e 3 x θ ( θ + 3 x ) 5 θ ,
F A B B D ( x ; θ ) = 1 27 19 θ 2 e 2 x θ θ 2 + 2 x 2 + 2 θ x + 4 19 e 3 x θ 9 x 2 θ 2 + 6 x θ + 2 .
Some of the CDFs plots of the LBBD and ABBD when θ = 1 , 2 , 3 , 4 , 5 , 6 , 7 are presented in Figure 3.
Figure 3. The CDFs plots of the LBBD and ABBD when θ = 1, 2, 3, 4, 5, 6, 7.

3.1. Moments and Related Measure

The moments and related measures, coefficients of variation (Cv), skewness (Sk), and kurtosis (Ku), the incomplete moments and moment of the residual lifetime, moment generating function, reliability functions of the proposed distributions are all described in this section in closed expressions. Also, the parameters estimation, Fisher’s information and entropies, as well as the order statistics are presented.
The rth moments of the LBBD and ABBD by setting t = 1 , 2 in Equation (7), respectively, are obtained using the following:
E X L B B D r = 1 5 θ 6 r 3 r + 2 2 r + 2 Γ ( r + 2 )   and   E X A B B D r = 1 38 27 2 r 8 3 r θ r Γ ( r + 3 ) ,
for θ > 0 and r = 1 , 2 , .
An alternative formulation of a real-valued random variable’s probability distribution is the moment-generating function (mgf). The mgf of the LBBD and ABBD, respectively, are expressed using the following:
M X L B B D ( w ) = 36 ( 2 θ w 5 ) 5 θ 2 w 2 5 θ w + 6 2   and   M X A B B D ( w ) = 216 19 1 ( θ w 3 ) 3 1 ( θ w 2 ) 3 .
The Lorenz ([19]) and Benforroni [20] curves are used to measure inequality using the incomplete moments of a probability distribution. The rth incomplete moments of the LBBD and ABBD from Theorem (2) by setting t = 1 , 2 , respectively, are defined as follows:
E X I M L B B D r = 0 h x r f L B B D ( x ; θ ) d x = 1 5 θ 6 r 2 r + 2 Γ r + 2 , 3 h θ ( r + 1 ) ! + 3 r + 2 ( r + 1 ) ! Γ r + 2 , 2 h θ ,
and
E X I M A B B D r = 0 h x r f A B B D ( x ; θ ) d x = 4 19 θ 3 r 3 2 r + 3 ( r + 2 ) ! Γ r + 3 , 2 h θ + Γ r + 3 , 3 h θ ( r + 2 ) ! .
The residual life is the time after a component lives up to time ( t 0 ) until the time of failure, which is determined by the conditional random variable X t | X > t . Hence, the rth moment of the residual lifetime (MRL) of the LBBD and ABBD, respectively, are distinct:
E X M R L L B B D r = 1 S L B B D ( s ; θ ) s x s r f L B B D ( x ; θ ) d x = 1 S L B B D ( s ; θ ) θ r 1 Γ ( r + 1 ) 5 6 r e 3 s θ 3 r + 2 e s / θ ( θ + θ r + 2 s ) 2 r + 2 ( θ + θ r + 3 s ) ,
E X M R L A B B D r = 1 S A B B D ( s ; θ ) s x s r f A B B D ( x ; θ ) d x = 1 S A B B D ( s ; θ ) θ r 2 Γ ( r + 1 ) 19 3 r 2 r + 1 e 3 s θ 3 r + 3 e s / θ θ 2 ( r + 1 ) ( r + 2 ) + 4 θ ( r + 1 ) s + 4 s 2 2 r + 3 θ 2 ( r + 1 ) ( r + 2 ) + 6 θ ( r + 1 ) s + 9 s 2 .
The harmonic means of the LBBD and ABBD are defined as follows:
H L B B D = 5 θ 6   and   H A B B D = 19 θ 15 .
The rth quartiles of the LBBD and ABBD, respectively, are the solutions for Q r in the following equations:
e 3 Q r θ 5 θ 9 e Q r / θ θ + 2 Q r 4 θ + 3 Q r = r 4 ,
and
e 3 Q r θ 19 θ 2 27 e Q / θ θ 2 + 2 Q r 2 + 2 θ Q r 4 2 θ 2 + 9 Q r 2 + 6 θ Q r = r 4 ,   r = 1 , 2 , 3 ,
where Q 1 , Q 2 and Q 3 are respectively called the first, second and third quartiles.
The coefficient of skewness is used to determine the skewness of the distribution and is expressed as S k = E X 3 3 μ σ 2 μ 3 σ 3 . For the LBBD and ABBD, correspondingly, the coefficients of skewness are expressed as S k L B B D = 1.23386 and S k A B B D = 0.514944 .
The top of the distribution’s flatness is measured by the coefficient of kurtosis, which can be defined as K u = E X 4 4 μ E X 3 + 6 E X 2 σ 2 + 3 E X 4 σ 4 . The LBBD and ABBD has the coefficient of kurtosis K u L B B D = 5.35547 and K u A B B D = 1.07703 , respectively. The Cv is defined as C v = σ μ و and for the LBBD and ABBD, they are expressed as C v L B B D = 0.591959 and C v A B B D = 4.79273 , respectively.
Table 1 presents the values of the mean and standard deviation (SD) of the LBBD and ABBD for different selections of θ . According to Table 1, as the value of θ rises, the mean and standard deviation values of the LBBD and ABBD rise as well.
Table 1. The mean and standard deviation of the LBBD and ABBD for various values of θ.

3.2. Reliability Functions

The survival function is frequently used to predict the probability that an event will not occur after a specific amount of time or threshold in domains like survival analysis and reliability engineering, among others. The survival function is defined as S ( x ) = 1 F ( x ) , and for the LBBD and ABBD, respectively, it is characterized as
S L B B D ( x ; θ ) = 9 e x / θ ( θ + 2 x ) 4 ( θ + 3 x ) 5 θ e 3 x θ ,
and
S A B B D ( x ; θ ) = 27 e x / θ 2 x 2 + 2 x θ + θ 2 4 9 x 2 + 6 x θ + 2 θ 2 19 θ 2 e 3 x θ .
Figure 4 shows the survival functions plots of the LBBD and ABBD for θ = 0.1 , 0.2 , , 0.7 . The plots reveal that the survival functions curves decreases and are skewed to the right. The distances between each two curves depend on the parameters’ values.
Figure 4. The survival function plots of the LBBD and ABBD for parameter choices.
In survival analysis and reliability engineering, the hazard function is a basic idea. It is utilized to simulate the instantaneous probability of an event (such as death, failure, or the occurrence of a specific event) occurring at a given time, given that it has not yet happened at that time. It is the ratio between the pdf and survival function of a random variable X,  H ( x ) = f ( x ) 1 F ( x ) . For the LBBD and ABBD, the hazard functions, respectively, are expressed using
H L B B D ( x ; θ ) = 36 x e x / θ 1 9 θ e x / θ ( θ + 2 x ) 4 θ ( θ + 3 x ) ,
and
H A B B D ( x ; θ ) = 108 x 2 e x / θ 1 27 θ e x / θ θ 2 + 2 x 2 + 2 θ x 4 θ 2 θ 2 + 9 x 2 + 6 θ x .
Figure 5 presents the hazard functions of the LBBD and ABBD for some parameter choices, and it is clear that the hazard functions curves are increasing with different ranges depending on the parameter value.
Figure 5. The hazard functions plot of the LBBD and ABBD for some parameter choices.
The reversed hazard rate (RH) function is distinct as the ratio between the life probability density to its distribution function as R H ( x ) = f ( x ) F ( x ) . The RH functions for the LBBD and ABBD, respectively, are defined as
R H L B B D ( x ; θ ) = 36 x e x / θ 1 θ 5 θ e 3 x θ 9 e x / θ ( θ + 2 x ) + 4 ( θ + 3 x ) ,
and
R H A B B D ( x ; θ ) = 108 x 2 e x / θ 1 19 θ 3 e 3 x θ 27 θ e x / θ θ 2 + 2 x 2 + 2 θ x + 4 θ 2 θ 2 + 9 x 2 + 6 θ x .
Figure 6 displays some plots of the reversed hazard functions of the LBBD and ABBD for selected values of the parameters as θ = 0.1 , 0.2 , , 0.7 , where the reversed hazard functions are decreasing for these parameter values.
Figure 6. The LBBD and ABBD reversed hazard functions plots for various values of θ.
The odds function is obtained as the ratio between F ( x ) and S ( x ) as O ( x ) = F ( x ) S ( x ) . The LBBD and ABBD have the odds functions, respectively, defined by the following:
O L B B D ( x ; θ ) = 5 θ e 3 x θ 9 e x / θ ( θ + 2 x ) + 4 ( θ + 3 x ) 9 e x / θ ( θ + 2 x ) 4 ( θ + 3 x ) ,
O A B B D ( x ; θ ) = 4 2 θ 2 + 9 x 2 + 6 θ x 27 e x / θ θ 2 + 2 x 2 + 2 θ x + 19 θ 2 e 3 x θ 27 e x / θ θ 2 + 2 x 2 + 2 θ x 4 2 θ 2 + 9 x 2 + 6 θ x .
Some plots of the odds functions of the LBBD and ABBD for θ = 0.1 , 0.2 , , 0.7 are presented in Figure 7, which indicate that the curves are increasing for all parameter values.
Figure 7. Plots of the LBBD and ABBD odds functions at some values of θ.
The cumulative hazard (CH) function of a random variable X from a univariate continuous distribution is defined as C H ( x ) = l n 1 F ( x ) . The CH for the LBBD and ABBD, respectively, are expressed as
C H L B B D ( x ; θ ) = l n 9 e x / θ ( θ + 2 x ) 4 ( θ + 3 x ) 5 θ e 3 x θ ,
and
C H A B B D ( x ; θ ) = l n 27 e x / θ 2 x 2 + 2 x θ + θ 2 4 9 x 2 + 6 x θ + 2 θ 2 19 θ 2 e 3 x θ .
Figure 8 shows some plots of the LBBD and ABBD cumulative hazard functions at various values of θ = 0.1 , 0.2 , , 0.7 . It is obvious that the odds function of both distributions is increasing and its values are larger for small values of θ .
Figure 8. The plots of the LBBD and ABBD cumulative hazard functions for θ = 0.1 , 0.2 , , 0.7 .

3.3. Parameters Estimation

The method of moments (MOMs) and maximum likelihood technique are used in this section to estimate the LBBD and ABBD parameters. Let X 1 , X 2 , , X n be a random sample of size n selected from a pdf f ( x ) and CDF F ( x ) . The method of moments estimators of the distributions parameter are θ ˜ L B B D = 15 19 X ¯ and θ ˜ A B B D = 38 65 X ¯ , where X ¯ = i = 1 n x i n is the sample mean.
To find the maximum likelihood estimators of the distribution parameter, let x 1 , x 2 , , x n be the sample observations of X 1 , X 2 , , X n from the LBBD and ABBD. The log-likelihood functions are, respectively, obtained using the following:
ln ( L ) L B B D = n log 36 5 + n log 1 θ 2 2 i = 1 n x i θ + i = 1 n log x i + i = 1 n log 1 e x i θ ,
ln ( L ) A B B D = n log 108 19 + n log 1 θ 3 2 i = 1 n x i θ + i = 1 n log x i 2 + i = 1 n log 1 e x i θ .
The partial derivatives of these equations with respect to the parameter θ are obtained using the following:
ln ( L ) L B B D θ = i = 1 n 2 x i θ 2 i = 1 n x i e x i θ θ 2 1 e x i θ 2 n θ ,   l n ( L ) A B B D θ = i = 1 n 2 x i θ 2 i = 1 n x i e x i θ θ 2 1 e x i θ 3 n θ .
The exact solution of these nonlinear equations is not easy, so one may maximize them by using some optimization approaches as the Newton–Raphson method. Table 2 provides some values of the MLE θ ^ A B B D and θ ^ L B B D , and the corresponding standard errors (SE) for various distributions parameter with sample sizes n = 40, 80, 120, 160, 200, 240, and 500.
Table 2. The MLE θ ^ A B B D and θ ^ L B B D with SE for selected values of θ and n.
Table 2 demonstrates that the standard errors values for both MLE estimators θ ^ A B B D and θ ^ L B B D , which shows that distributions parameters are decreasing for larger sample sizes for fixed parameter values.

3.4. Fisher’s Information and Entropies

This subsection introduces the Fisher’s information (FI) as well as the Rényi entropy of the LBBD and ABBD.
Theorem 5.
The Fisher’s information of ABBD and LBBD, respectively, are as follows:
F I A B B D ( θ ) = 3.77457 θ 2   and   F I L B B D ( θ ) = 2.85636 θ 2 .
Proof. 
Take the log of the ABBD pdf as
log f L B B D ( x ; θ ) = log 36 5 θ 2 + log ( x ) + log 1 e x θ 2 x θ .
The first and second derivatives of log f L B B D ( x ; θ ) with respect to θ respectively are as follows:
θ log f L B B D ( x ; θ ) = x θ 2 1 1 e x / θ + 2 2 θ ,
2 θ 2 log f L B B D ( x ; θ ) = 2 θ 3 θ + x 1 e x / θ 1 2 x 2 e x / θ θ 4 e x / θ 1 2 .
Hence,
F I L B B D ( θ ) = E 2 θ 2 l o g f L B B D ( x ; θ ) = 36 5 θ 5 0 2 x θ + x 1 e x / θ 1 2 x 3 e x / θ θ e x / θ 1 2 1 e x θ e 2 x θ d x = 2195 24 π 4 50 θ 2 = 2.85636 θ 2 .
Similarly,
F I A B B D ( θ ) = E 2 θ 2 l o g f A B B D ( x ; θ ) = 24 109 108 Z e t a 5 19 θ 2 = 3.77457 θ 2 .
Some values of the Fisher’s information for both of LBBD and ABBD are presented in Table 3 for selected values of the parameter θ .
Table 3. Fisher’s information values of the LBBD and ABBD for some values of θ.
Table 3 reveals that the values decrease as the distribution’s parameter values rise, and they eventually approach zero for high parameter values.
According to information theory, a random variable’s entropy measures the average amount of uncertainty, information, or surprise that could result from its potential outcomes. From Equation (10), the Rényi entropies of the LBBD and ABBD, respectively, are obtained using the following:
H α X L B B D = 1 1 α log 36 5 θ 2 α j = 0 α α ! ( 1 ) α + j α j 3 α j θ α + 1 ,
H α X A B B D = 1 1 α log 108 19 θ 3 α j = 0 α ( 2 α ) ! ( 1 ) α + j α j 3 α j θ 2 α + 1 .
Table 4 presents the values of the suggested Rényi entropies of the distributions. It is evident that when the values of α for θ = 3.8 increase, the Rényi entropies values decrease. Also, for fixed α , the Rényi entropies values of θ = 3 are less than their counterparts when θ = 8 .
Table 4. H α X A B B D values with θ = 3.8 for some values of α .

3.5. Order Statistics

In statistical theory, order statistics are essential, especially in the idea of extreme value. Consider a random sample X 1 , X 2 ,   , X m of size m selected from f ( x ) with F ( x ) . Let X ( 1 : m ) ,   X ( 2 : m ) , , X ( m : m ) be the corresponding order statistics of the sample. The jth order statistics pdf is defined as follows:
f ( j : m ) ( x ) = m ! ( j 1 ) ! ( m j ) ! × F ( x ) j 1 1 F ( x ) m j f ( x ) ,   for j = 1 , 2 , , m .
For simplicity, we use 1 F ( x ) m j = a = 0 m j a ( 1 ) a [ F ( x ) ] a . Now, Equation (15) will be
f ( j : m ) ( x ) = m ! ( j 1 ) ! ( m j ) ! f ( x ) a = 0 m j a ( 1 ) a [ F ( x ) ] a + j 1 .
Therefore, the pdf for the LBBD and ABBD jth order statistics, respectively, are obtained using
f L B B D ( j : m ) ( x ; θ ) = m ! ( j 1 ) ! ( m j ) ! 36 5 θ 2 x 1 e x θ e 2 x θ a = 0 m j a ( 1 ) a × 1 9 5 θ e 2 x θ 2 x + θ 4 e 3 x θ 3 x + θ a + j 1 ,
and
f A B B D ( j : m ) ( x ; θ ) = 108 m ! 19 θ 3 ( j 1 ) ! ( m j ) ! x 2 e 2 x θ 1 e x θ a = 0 m j a ( 1 ) a × 1 27 19 θ 2 e 2 x θ θ 2 + 2 x 2 + 2 θ x + 4 19 e 3 x θ 9 x 2 θ 2 + 6 x θ + 2 a + j 1 .
For j = 1 , m in Equations (16) and (17), for both LBBD and ABBD, respectively, we can obtain the pdfs of the minimum (min) and maximum (max) order statistics. Figure 9 shows three-dimensional plots of the pdf of the min and max order statistics of the ABBD and LBBD when m = 7 .
Figure 9. Three-dimensional plots of the minimum and maximum order statistics pdfs of the ABBD and LBBD for m = 7.

4. Applications to Real Data

In this section, we demonstrate the applicability of the proposed ABBD and LBBD distributions using two real data sets.
The first data consists of thirty consecutive measurements of March precipitation (in inches) throughout a 30-year period in Minneapolis. The data set values are 0.32, 0.47, 0.52, 0.59, 0.77, 0.81, 0.81, 0.9, 0.96, 1.18, 1.20, 1.20, 1.31, 1.35, 1.43, 1.51, 1.62, 1.74, 1.87, 1.89, 1.95, 2.05, 2.10, 2.20, 2.48, 2.81, 3.0, 3.09, 3.37, and 4.75. The same data are considered by [21,22].
The second one consists the survival times (in months) of 20 acute myeloid leukemia patients discussed by [23], and the observations are 2.226, 2.113, 3.631, 2.473, 2.720, 2.050, 2.061, 3.915, 0.871, 1.548, 2.746, 1.972, 2.265, 1.200, 2.967, 2.808, 1.079, 2.353, 0.726, and 1.958.
The descriptive statistics for both data sets are presented in Table 5, and it is clear that both data sets are skewed to the right. The density, histogram, box and total time on test (TTT) plots for both data sets are shown in Figure 10. The P-P plots of the fitted distributions for the both data sets are displayed in Figure 11. The estimated pdfs of the suggested and competitive distributions are shown in Figure 12.
Table 5. Descriptive statistics of the data sets.
Figure 10. The density, TTT, box and histogram plots for first (green) and second (blue) data sets.
Figure 11. The P-P plots of the fitted distributions for the first (green) and second (blue) data sets.
Figure 12. The estimated pdfs of the competitive distributions for the first (green) and second (blue) data sets.
Since the proposed distributions in this study have just one parameter, we compared them to certain existing distributions that likewise have just one parameter for a fair comparison.
In order to explain how adaptable LBBD and ABBD are, we are going to evaluate them in contrast to a variety of well-recognized models, including Rani ([24]), Exponential, Rama ([25]) and Bilal distributions with pdfs, which are respectively expressed as follows:
o
f ( x ; θ ) = θ 5 θ 5 + 24 θ + x 4 e θ x , x > 0 , θ > 0 ,
o
f x ; θ = θ e θ x , x 0 , θ > 0 ,
o
f x ; θ = θ 4 θ 3 + 6 1 + x 3 e θ x , x > 0 , θ > 0 ,
o
f ( x ; θ ) = 6 θ 1 e x θ e 2 x θ , x 0 , θ > 0 .
We used the maximum likelihood method to estimate these distributions parameters and fit them to the data sets. To compare the results, we used the negative maximized log-likelihood values ( l ), the Hannan–Quinn information criterion (HQIC), the Bayesian information criterion (BIC), the Akaike information criterion (AIC), and the Consistent Akaike information criterion (CAIC) defined as AIC = 2 l + 2 κ , CAIC = 2 l + 2 κ n n κ 1 , BIC = 2 l + κ ln ( n ) , HQIC = 2 κ ln ln ( n ) 2 l , where n is the sample size and κ is the number of parameters. The results are displayed in Table 6 and the best model is indicated by lower AIC, BIC, CAIC, HQIC values for goodness of statistics. For the first data set, the LBBD fits the data better than other distributions and similarly, the ABBD fits the second data set due to the lowest values of these measures. Therefore, both suggested models are considered as best fitted models as compared to the competitors considered in this study. These results are supported by Figure 10, Figure 11 and Figure 12 where it is seen that the proposed models fit both data sets well.
Table 6. The AIC, CAIC, BIC, HQIC, and K-S values for the data sets.

5. Conclusions

In this study, size-biased Bilal distribution is suggested as a modification of the base Bilal distribution. Two special cases of the SBBD are derived, LBBD and ABBD. The survival function, hazard function, reversed hazard rate function, odds function, and cumulative hazard function of the LBBD and ABBD are obtained. Further statistical features include moments, incomplete moments, moment generating function, and the rth moment. Fisher’s information and entropies, order statistics and harmonic mean are presented. Also, the method of moments and the maximum likelihood estimation methods are used for estimating the unknown distributions parameters. Finally, using two real data sets, the suggested model’s applicability is demonstrated. According to the findings in this study, the ABBD and LBBD models may be deemed as competitive models with superior capacity than the other models studied in this paper. As expected, the new distributions are probable to garner interest from academics in other fields. In the future, the ranked set sampling techniques can be used to estimate the distribution parameters (see [26]).

Author Contributions

Conceptualization, A.I.A.-O., R.A. and G.A.; Methodology, A.I.A.-O. and R.A.; Software, A.I.A.-O.; Formal analysis, A.I.A.-O., R.A. and G.A.; Resources, G.A.; Writing—original draft, A.I.A.-O., R.A. and G.A.; Writing—review & editing, A.I.A.-O., G.A. and R.A.; Visualization, A.I.A.-O.; Funding acquisition, G.A. All authors have read and agreed to the published version of the manuscript.

Funding

Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2023R226), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Data Availability Statement

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Fisher, R.A. The effect of methods of ascertainment upon the estimation of frequencies. Ann. Eugen. 1934, 6, 13–25. [Google Scholar] [CrossRef]
  2. Cox, D.R. Renewal Theory; Metrheun’s Monograph; Barnes and Noble, Inc.: New York, NY, USA, 1962. [Google Scholar]
  3. Rao, C.R. On discrete distributions arising out of methods of ascertainment. Sankhya Indian J. Stat. Ser. A 1965, 27, 311–324. [Google Scholar]
  4. Patil, G.P.; Rao, C.R. Weighted distributions and size-biased sampling with applications to wildlife populations and human families. Biometrics 1978, 34, 179–189. [Google Scholar] [CrossRef]
  5. Gupta, R.C.; Keating, J.P. Relations for reliability measures under length biased sampling. Scand. J. Stat. 1986, 13, 49–56. [Google Scholar]
  6. Arnold, B.C.; Nagaraja, H.N. On some properties of bivariate weighted distributions. Commun. Stat.-Theory Methods 1991, 20, 1853–1860. [Google Scholar] [CrossRef]
  7. Modi, K.; Gill, V. Length-biased weighted Maxwell distribution. Pak. J. Stat. Oper. Res. 2015, 2015, 465–472. [Google Scholar] [CrossRef][Green Version]
  8. Mobarak, M.A.; Nofal, Z.; Mahdy, M. On size-biased weighted transmuted Weibull distribution. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 2017, 7, 317–325. [Google Scholar] [CrossRef]
  9. Sharma, V.K.; Dey, S.; Singh, S.K.; Manzoor, U. On length and area-biased Maxwell distributions. Commun. Stat.-Simul. Comput. 2018, 47, 1506–1528. [Google Scholar] [CrossRef]
  10. Al-Omari, A.I.; Al-Nasser, A.D.; Ciavolino, E. A size-biased Ishita distribution and application to real data. Qual. Quant. 2019, 53, 493–512. [Google Scholar] [CrossRef]
  11. Hassan, A.S.; Almetwally, E.M.; Khaleel, M.A.; Nagy, H.F. Weighted power Lomax distribution and its length biased version: Properties and estimation based on censored samples. Pak. J. Stat. Oper. Res. 2021, 17, 343–356. [Google Scholar] [CrossRef]
  12. Maina, C.B.; Weke, P.G.; Ogutu, C.A.; Ottieno, J.A. A Normal weighted inverse Gaussian distribution for skewed and heavy-tailed data. Appl. Math. 2022, 13, 163–177. [Google Scholar] [CrossRef]
  13. Benchiha, S.; Al-Omari, A.I.; Alotaibi, N.; Shrahili, M. Weighted generalized Quasi Lindley distribution: Different methods of estimation, applications for COVID-19 and engineering data. AIMS Math 2021, 6, 11850–11878. [Google Scholar] [CrossRef]
  14. Abbas, S.; Zaniab, S.; Mehmood, O.; Ozal, G.; Shahbaz, M.Q. A new generalized weighted exponential distribution: Properties and applications. Thail. Stat. 2022, 20, 271–283. [Google Scholar]
  15. Dar, A.A.; Ahmed, A.; Reshi, J.A. Weighted Gamma-Pareto distribution and its application. Pak. J. Stat. 2020, 36, 287–304. [Google Scholar]
  16. Sahmran, M.A. Modified weighted Pareto distribution type I (MWPDTI). Baghdad Sci. J. 2020, 17, 0869. [Google Scholar] [CrossRef]
  17. Abd-Elrahman, A.M. Utilizing ordered statistics in lifetime distributions production: A new lifetime distribution and applications. J. Probab. Stat. Sci. 2013, 11, 153–164. [Google Scholar]
  18. Rényi, A. On measures of entropy and information. In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Contributions to the Theory of Statistics; The Regents of the University of California: Oakland, CA, USA, 1961. [Google Scholar]
  19. Lorenz, M. Methods of measuring the concentration of wealth. Publ. Am. Stat. Assoc. 1905, 9, 209–219. [Google Scholar]
  20. Bonferroni, C. Elementi di Statistica Generale; Libreria Seeber: Firenze, Italy, 1930. [Google Scholar]
  21. Hinkley, D. On quick choice of power transformation. J. R. Stat. Soc. Ser. C 1977, 26, 67–69. [Google Scholar] [CrossRef]
  22. Jamal, F.; Arslan Nasir, M.; Ozel, G.; Elgarhy, M.; Mamode Khan, N. Generalized inverted Kumaraswamy generated family of distributions: Theory and applications. J. Appl. Stat. 2019, 46, 2927–2944. [Google Scholar] [CrossRef]
  23. Afify, A.Z.; Nofal, Z.M.; Butt, N.S. Transmuted complementary Weibull geometric distribution. Pak. J. Stat. Oper. Res. 2014, 10, 435–454. [Google Scholar] [CrossRef]
  24. Shanker, R. Rani distribution and its application. Biom. Biostat. Int. J. 2017, 6, 1–14. [Google Scholar] [CrossRef][Green Version]
  25. Shanker, R. Rama distribution and its application. Int. J. Stat. Appl. 2017, 7, 26–35. [Google Scholar]
  26. Al-Omari, A.I.; Bouza, C.N. Review of ranked set sampling: Modifications and applications. Rev. Investig. Oper. 2014, 35, 215–240. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.