Next Article in Journal
Bearing Capacity Analysis of the Weak Basement, Progressive Destruction Analysis, and Evaluation of the Dump on an Inclined Strip Section Using the Upper-Limit Method: A Case Study in an Anonymous Open-Cast Coal Mine
Previous Article in Journal
Research on Sustainable Mining and Water Prevention in Large Open-Pit Water Deposits
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Novel Odd Beta Prime-Logistic Distribution: Desirable Mathematical Properties and Applications to Engineering and Environmental Data

by
Ahmad Abubakar Suleiman
1,2,*,
Hanita Daud
1,*,
Narinderjit Singh Sawaran Singh
3,
Mahmod Othman
4,
Aliyu Ismail Ishaq
5 and
Rajalingam Sokkalingam
1
1
Fundamental and Applied Sciences Department, Universiti Teknologi PETRONAS, Seri Iskandar 32610, Malaysia
2
Department of Statistics, Aliko Dangote University of Science and Technology, Wudil 713281, Nigeria
3
Faculty of Data Science and Information Technology, INTI International University, Persiaran Perdana BBN Putra Nilai, Nilai 71800, Malaysia
4
Department of Information System, Universitas Islam Indragiri, Tembilahan 29212, Indonesia
5
Department of Statistics, Ahmadu Bello University, Zaria 810107, Nigeria
*
Authors to whom correspondence should be addressed.
Sustainability 2023, 15(13), 10239; https://doi.org/10.3390/su151310239
Submission received: 20 April 2023 / Revised: 13 June 2023 / Accepted: 15 June 2023 / Published: 28 June 2023

Abstract

:
In parametric statistical modeling, it is important to construct new extensions of existing probability distributions (PDs) that can make modeling data more flexible and help stakeholders make better decisions. In the present study, a new family of probability distributions (FPDs) called the odd beta prime generalized (OBP-G) FPDs is proposed to improve the traditional PDs. A new PD called the odd beta prime-logistic (OBP-logistic) distribution has been developed based on the developed OBP-G FPDs. Some desirable mathematical properties of the proposed OBP-logistic distribution, including the moments, moment-generating function, information-generating function, quantile function, stress–strength, order statistics, and entropies, are studied and derived. The proposed OBP-logistic distribution’s parameters are determined by adopting the maximum likelihood estimation (MLE) method. The applicability of the new PD was demonstrated by employing three data sets and these were compared by the known extended logistic distributions, such as the gamma generalized logistic distribution, new modified exponential logistic distribution, gamma-logistic distribution, exponential modified Weibull logistic distribution, exponentiated Weibull logistic distribution, and transmuted Weibull logistic distribution. The findings reveal that the studied distribution provides better results than the competing PDs. The empirical results showed that the new OBP-logistic distribution performs better than the other PDs based on several statistical metrics. We hoped that the newly constructed OBP-logistic distribution would be an alternative to other well-known extended logistic distributions for the statistical modeling of symmetric and skewed data sets.

1. Introduction

The probability distributions (PDs) are vital statistical tools for modeling the underlying behavior of a given data set collected from surveys, observational studies, experiments, and more. In many practical scenarios, most of these PDs are not suitable to fit every characteristic of real phenomenon. However, statistical literature lacks a standard PD model that can adequately suit every kind of phenomenon [1,2,3]. Therefore, it is imperative to establish new PDs that can provide better flexibility, which can be achieved by adding one or more parameters to the well-established PDs [4]. In recent times, there has been a gradual emergence of new FPDs developed for data modelling in many practical domains, such as the environment, finance, engineering, and biological sciences. Some of the well-known methods for generating new PDs are included in [5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20]. Other methods to generate flexible PDs are the odd generalized NH-G [21], Marshall–Olkin odd Lindley-G [22], Maxwell-G [23], odd inverse power generalized Weibull-G [24], generalized Rayleigh-G [25], generalized odd linear exponential-G [26], generalized exponential extended exponentiated-G [27] and generalized alpha exponent power-G [28], among others.
In this paper, we introduced a novel FPD derived using the beta prime (BP) distribution, popularly known as the beta of the second kind, Feller–Pareto, the Pearson Type VI, or generalized F distribution [29,30,31]. However, few studies have looked into the application of BP distribution [29]. For instance, the distribution of taxed income in Finland was fitted by [32] using a four-parameter BP distribution. The findings indicated that their proposed distribution performed better than the two- and three-parameter log-normal distributions. The study by [33] estimated the BP distribution for family incomes in the United States, while [34] determined the parameters via the maximum likelihood estimation (MLE) method and studied its properties. Moreover, the application of BP distribution was utilized by [35,36] to study regression models for positive random variables (RVs). As defined in [29,37,38], the BP has the following cumulative distribution function (cdf):
W x ; a , b = I x 1 + x B a , b ,               x > 0 ,       a , b > 0 ,
where B x a , b = 0 x ω a 1 1 ω b 1 d ω is the incomplete beta function.
The associated probability density function (pdf) is defined as
w ( x ; a , b ) = 1 B a , b x a 1 1 + x a + b ,               x > 0 .
The logistic distribution (LD) is a univariate continuous PD for modeling data in diverse contexts, including life sciences, physical sciences, sports, finance, insurance, neural networks, logit models, logistic regression, and recently, machine learning. This distribution is more flexible to the underlying data involving extreme events and has a fatter tail than a normal distribution suitable for modeling returns in the stock market [39]. Due to its convenience and significance, the LD is regarded by numerous researchers as a growth curve. For example, studies by [40,41] used LD in human populations and some biological organisms. Another study by [42] applied LD to data associated with the agricultural population. Another important application of the LD is in the analysis of survival data [43]. More interesting practical applications of the LD can be found in [44] and the references therein. The cdf of the LD is expressed as
P ( x ; μ , s ) = 1 1 + exp x μ s ,               x , μ ,     s > 0 .
The pdf of the LD corresponding to (3) is defined as
P ( x ; μ , s ) = exp x μ s s 1 + exp x μ s 2 ,               x , μ ,     s > 0 ,
where μ is the location parameter and s is the scale parameter.
Several generalizations of the LD have been introduced in the literature to study its properties and examine its adaptability and flexibility in modeling skewed data [45]. These generalizations are essential to enhance the fit in the non-central probability regions and to improve the use of LD regarding the asymmetric probability curves [46]. For example, Ref. [47] introduced the LD of type IV and applied it to study binomial regression data. In [48], the logistic regression model was proposed. Ref. [49] proposed three generalizations of LD types. Comprehensive generalizations of the LD have been studied and summarized [44]. The skewed LD was proposed [50]. An extended version of the LD was developed and studied [51] by adding a new parameter. Two extended versions of skewed LD have been established [52] using the method proposed and defined by [53]. The two widely used approaches to derive different generalizations of PDs, including the LD [54], are the T-X framework studied by [55] and modified by [56]. The exponentiated exponential LD was recently established by [57], and several of its properties where defined and derived.
Recently, Ref. [9] introduced a technique for generalizing FPDs with the cdf defined by
F ( x ) = a W Q ( x ) v ( m ) d m = M W Q ( x ) ,
where M is an RV, v is the pdf of the RV M a , b , such that a < b and W Q ( x ) is a link function of any cdf of continuous PDs that takes different forms (see [55]). If we consider the odd function form, W Q ( x ) = Q ( x ) 1 Q ( x ) , then the cdf will be
F ( x ) = 0 Q ( x ) 1 Q ( x ) v ( m ) d m = M Q ( x ) 1 Q ( x ) .
Overall, the quest for high-tailed models is motivated by their capacity to offer more precise and realistic representations of extreme occurrences, enabling better risk management, financial modeling, hedging strategies, insurance evaluations, systemic risk analysis, and climate science. By developing models that can accurately represent extreme values, decision-makers can better understand and manage risks, make informed choices, and improve the overall robustness of their systems.
The justifications of this paper are as follows:
(i)
To define novel FPDs using the BP distribution.
(ii)
To develop new PD that can accommodate both monotonic and non-monotonic hazard rates.
(iii)
To establish heavy-tailed models for different data sets.
(iv)
To generate a PD that can provide suitable shapes to fit symmetric and skewed real data sets that are commonly found in practical disciplines, including environment, engineering, and finance.
(v)
To obtain a flexible PD that can consistently provide more realistic fits to given data sets when tested against known competing PDs.
The present paper is an extended version of our previously published conference paper [58]. However, this article expands upon the conference paper by providing more detailed derivations, analyses, and presenting a more comprehensive discussion and interpretation of the results. Moreover, the present article includes additional sections, such as an expanded introduction, literature review, validity test for the proposed family, new extension of LD and its properties, simulation studies and conclusion, which were not present in the conference paper.
The remaining parts of the paper are sectioned as follows: Section 2 highlights the related literature review. Section 3 introduces a novel odd beta prime FPD along with its cdf, pdf, validity test, and mixture representations. In Section 4, a special odd beta prime generalized LD is presented. Several statistical features of the new PD are studied and derived in Section 5. Section 6 describes the parameter estimation of the proposed PD by adopting the MLE approach. Section 7 contains Monte Carlo simulation to assess the estimators’ accuracy. Section 8 proves the usefulness of the constructed PD using three data sets. Discussion is presented in Section 9. Section 10 is reserved for conclusions.

2. Literature Review

Many efforts have been made to generate new FPD that expand the established classical PDs to increase the flexibility of modeling data in real-life applications. Several developed families have been adopted and applied to modeling diverse data types with varied characteristics. Many families of probability distributions were established using various approaches in the statistical literature. In this section, we have reviewed some recent families of distributions alongside their sub-models.
Bell generalized FPD was introduced by [59]. Bell Weibull (BellW), a unique model of this family, was first introduced. Some properties of this model were derived. The numerical demonstration was used to evaluate several actuarial measures for the BellW distribution. The significance of the proposed model was verified based on finance and medical data sets.
The unit exponentiated half logistic power series (UEHLPS) FPD was proposed by [60]. Several special models of this family were introduced, including the UEHL Poisson, UEHL Geometric, UEHL Binomial, and UEHL Logarithmic models. The general properties were derived. The parameter estimation for the UEHLP model was performed based on various classical estimation methods. A numerical simulation was utilized to examine the estimators’ accuracy. The usefulness of the UEHLP distribution was evaluated by analyzing the COVID-19, milk production, and failure data sets.
A new extended cosine generalized FPD was constructed by [61]. Some statistical features were obtained. Several sub-models of this family were introduced. The Bayesian and non-Bayesian approaches were adopted for parameter estimation. A simulation study was conducted to evaluate these approaches and demonstrate their practicality. The significance of the proposed models was assessed based on the three data sets, such as the lifetimes of devices, carbon fibers, and the single fibers.
The sine-exponentiated Weibull generalized FPD was pioneered by [62] based on the sine generalized and exponentiated Weibull generalized FPDs. Several properties of the developed distributions were obtained. Six different techniques of estimation were employed to estimate the parameters. The adequacy of these techniques was verified using Monte Carlo simulations. The significance of the family was analyzed based on five real data sets. The developed distribution was quite flexible compared to several well-established distributions.
The type II half-logistic odd Fréchet generalized FPD was constructed by [63] based on the classical Fréchet family. This family was used to produce symmetrical and asymmetrical special models. The new models are produced by this family. Some mathematical features of this family were derived. The estimate of the parameters was performed based on six distinct techniques. The Monte Carlo simulation study was employed to test these estimation techniques. The model was explored to analyze the real-word data sets involving biomedical, engineering, environment, and strength failure.
The odd Perks generalized FPD was studied by [64]. Four different special models of this family were examined, including the odd Perks uniform, odd Perks exponential, odd Perks Weibull, and odd Perks Lomax models. Some essential features of the proposed family were derived. The Bayesian and non-Bayesian methods were applied to estimate the model parameters. The odd Perks Weibull was explored to propose a new log-location-scale regression model. Several data sets were employed to verify the relevance and significance of the introduced distributions.
The truncated Cauchy power Weibull generalized FPD was generated by [65]. This family was used to establish three special models, such as the truncated Cauchy power Weibull Lomax, truncated Cauchy power Weibull exponential, and truncated Cauchy power Weibull Rayleigh models. Various properties of this family were discussed. The parameters were estimated based on the MLE and Bayesian techniques using censored data. The adequacy of the estimators was assessed using Monte Carlo simulation. The importance of the family was illustrated by fitting the three members of this family to three real datasets. The result revealed that the members of the family performed better compared to comparator models, suggesting that the members of the family provided a better fit.
The type I half logistic Burr X generalized FPD was introduced by [66]. Three sub-models of this family were introduced. Some important features of this family were obtained. The parameter estimators were produced using the MLE and Bayesian techniques using the type II censored data. The estimators were evaluated using Monte Carlo simulation. Three medical data sets from various countries were used to validate the goodness of the new models.

3. Development of Odd Beta Prime Generalized Family of Distributions

Here, we highlight the development of the novel generalized family of BP distributions using the method defined by [9] named the odd beta prime generalized (OBP-G) FPDs. The cdf of OBP-G FPDs will be developed by substituting the pdf in (6) with the pdf given in (2) as follows:
F x ; a , b , η = 1 B a , b 0 Q x ; η 1 Q x ; η t a 1 1 + t a + b d t , =   B Q x ; η 1 Q x ; η a , b B a , b ;           t > 0 ,       a > 0 , b > 0 ,   η ,
where Q . ; η is a cdf of a baseline distribution with the vector parameter η . The associated pdf of OBP-G FPDs is obtained by differentiating (7) as
f x ; a , b , η = d d x F x ; a , b , η = 1 B a , b       d d x B Q x ; η 1 Q x ; η a , b , = 1 B a , b   Q x ; η 1 Q x ; η a 1 1 + Q x ; η 1 Q x ; η a + b   d d x Q x ; η 1 Q x ; η ;         x > 0 ,     a > 0 ,   b > 0 ,     η .
After simplifications, the pdf of (8) is
f x ; a , b , η = q x ; η B a , b 1 Q x ; η 2 Q x ; η 1 Q x ; η a 1 1 + Q x ; η 1 Q x ; η a + b ,           x > 0 ,       a > 0 , b > 0 ,   η ,  
where q . ; η is the pdf of the baseline model Q . ; η .
Henceforth, the RV X with the pdf given in (9) will be presented by X O B P G . We omit the dependence on parameters a , b and η from (7) and (9) by denoting F x ; a , b , η = F x and f x ; a , b , η = f x , respectively.
To verify whether the new OBP-G family is a valid FPD, the pdf given in (9) is used to satisfy the f ( x ) d x = 1 , where f is the pdf of OBP-G FPDs. For the sake of validation, we provide clear proof as follows:
f ( x ) d x = 1 B a , b 0 q x ; η 1 Q x ; η 2 Q x ; η 1 Q x ; η a 1 1 + Q x ; η 1 Q x ; η a + b d x .
With the following equations, where
y = Q x ; η 1 Q x ; η   ,               d y d x = 1 Q x ; η · q x ; η   Q x ; η q x ; η   1 Q x ; η 2 ,
we can obtain,
f ( x ) d x = 1 B a , b 0   y a 1 1 + y a + b   d y ,
where (12) is the beta of the second kind; hence,
f ( x ) d x = 1 B a , b   B a , b = 1 .
This completes the proof and concludes that the OBP-G FPDs are indeed valid FPDs.

Mixture Representations of the pdf of OBP-G FPDs

This subsection presents an important expansion of the OBP-G family pdf defined in (9).
Now, we can consider the Binomial expansion given by [58] as
1 + z n = j = 0 n j   z j = j = 0 ( 1 ) j n + j 1 j   z j .
Applying (14) in (9) yields
f x = q x ; η B a , b   j , k = 0 ( 1 ) j + k a + b + j 1                       j   a + j + k               k   Q x ; η a + j + k 1 .
After simplifying (15), we can obtain the mixture representation of OBP-G FPDs as
f x = j , k = 0 Λ j , k   q x ; η   Q x ; η a + j + k 1 .
where Λ j , k = ( 1 ) j + k a + b + j 1 j   a + j + k k B a , b .

4. Development of Odd Beta Prime-Logistic Distribution

This section introduces a new PD called the odd beta prime-logistic (OBP-logistic) distribution, which is developed by adding two parameters of the LD defined in (4) to the OBP-G FPDs presented in (9). The cdf of OBP-logistic can be derived by inserting (3) in (7) as
F x = B 1 exp x μ s a , b B a , b ,           x .
The associated pdf of (17) is derived by inputting (3) and (4) into (9) to obtain
f x = 1 s B a , b 1 exp x μ s a 1 + 1 exp x μ s a + b ,                 x ,
where a , b > 0 are shape parameters, μ is the location parameter and s > 0 is the scale parameter. Several potential shapes of the new OBP-logistic distribution are depicted in Figure 1a–c. The different shapes of this PD include symmetric (a), positive-skewed (b), and negative-skewed (c).
The survival function of the OBP-logistic is expressed by
S x = 1 B 1 exp x μ s a , b B a , b ,           x ,           a , b > 0 .
The hazard rate of the OBP-logistic is defined as
h x = 1 exp x μ s a s 1 + 1 exp x μ s B a , b B 1 exp x μ s a , b ,                 x ,           a , b > 0 .
Figure 2a,b display some possible shapes of the hazard rate for the OBP-logistic model. As observed from the figure, the hazard rate of this PD can assume flexible shapes, for some chosen parameter values. In addition, it is observed from Figure 2a,b that the hazard rate of the OBP-logistic model indicates decreasing (a) and increasing (b) shapes.

5. Statistical Features of OBP-Logistic Distribution

This section covers derivations for various statistical features of the OBP-logistic model.

5.1. Moments

This subsection provides the rth moment for the OBP-logistic distribution.
If we suppose that X follows the OBP-logistic distribution, the rth moment of X is expressed by
E X r = x r f ( x ) d x ,
where f ( x ) is defined in (16), and using (16) and (21), we can obtain
E X r = j , k = 0 Λ j , k x r q x   Q x a + j + k 1 d x .
By substituting (3) and (4), the moments in (22), we have
E X r = j , k = 0 Λ j , k S x r   exp x μ s 1 + exp x μ s a + j + k + 1 d x .
We can set
m = 1 1 + exp x μ s ,   so   that   d x =     s m 1 m   d m .
By inserting (24) in (23), we obtain
E X r = j , k = 0 Λ j , k 0 1 μ s log 1 m m r 1 m m a + j + k d m .
By setting r = 1 , we obtain
E x = j , k = 0 Λ j , k μ 0 1 1 m 2 1 m a + j + k + 1 1 d m                               j , k = 0 Λ j , k   s 0 1 log 1 m 1 m m a + j + k d m 0 1 log m 1 m m a + j + k d m ,
where E X is the mean for the OBP-logistic distribution.
Therefore, log ( m ) = c = 1 ( 1 ) c + 1 w 1 c c for 0 < m 2 and log ( 1 m ) = c = 1 ( 1 ) c m c c for m < 1 . This form of expansion expressed in (25) has been studied in [67] in Theorem 3.5.
Thus, Equation (25) gives
E X = μ B 2   ,   a + j + k + 1 j , k = 0 Λ j , k s c = 1 ( 1 ) 2 c c B c + 1   ,   a + j + k + 1   ,   B 2   ,   a + j + k + c + 1 .
This is the mean of the OBP-logistic distribution.

5.2. Moment-Generating Function (mgf)

The derivation of the mgf of OBP-logistic distribution in terms of the mixture representation defined in (16) is provided in this subsection.
If we assume that RV X follows the OBP-logistic distribution, the mgf of X is given by
M x t = E exp t X = exp t x f ( x ) d x ,
provided that this expectation exists for t in some neighborhood of 0.
Using (16) and (27), we obtain
M x t = j , k = 0 Λ j , k exp t x   q x   Q x a + j + k 1 d x .
Substituting (3) and (4) in (28), we obtain
M x t = j , k = 0 Λ j , k s exp t x   exp x μ s 1 + exp x μ s a + j + k + 1 d x .
With the following equation,
m = 1 1 + exp x μ s ,   we   obtain   d x =     s 1 + exp x μ s 2 exp x μ s   d m .
Inserting (30) in (29), we have
M x t = exp m t j , k = 0 Λ j , k 0 1 m a + j + k + s t 1 1 m 1 s t 1 d m .
After simplifications, we obtained the mgf of the OBP-logistic distribution as
M x t = exp m t j , k = 0 Λ j , k   B a + j + k + s t   ,   1 s t .

5.3. Information-Generating Function (IGF)

In information theory and statistics, the IGF has been utilized to generate some important information quantities, such as Kullback–Leibler divergence and Shannon entropy. It has been widely applied in physics and chemistry to analyze the atomic structure of a given phenomenon or system. This subsection provides the derivation of the IGF of OBP-logistic distribution.
The IGF of the RV X is given as
I γ ( x ) = f γ ( x ) d x ,   for   γ > 0 ,
when the integral is finite.
f is defined in (18). Hereafter, the integrand f γ ( x ) in (33) can be given by
f γ x = 1 s B a , b γ 1 / exp x μ s a · γ 1 + 1 / exp x μ s ( a + b ) · γ .
Using (23), the integrand in (34) is
f γ x = s B a , b γ 1 w 1 a γ 1 w ( a + b ) γ ,
where w = 1 1 + 1 exp x μ s .
By   setting   d x = 1 w 2 1 s · w 1 w ,   we   can   obtain   d w     = s d w w ( 1 w ) .
By inputting (35) and (36) into (33), we have
I γ ( x ) = 1 s B a , b γ 0 1 1 w 1 a γ 1 w ( a + b ) γ s d w w ( 1 w ) = s s B a , b γ B γ b   ,   γ a .
This gives the igf of the OBP-logistic distribution.

5.4. Quantile Function

One can suppose the RV X follows the OBP-logistic distribution with cdf (17). Then, the quantile function (QF) of the OBP-logistic model is derived by inverting (17) as follows:
F x =   B Q x 1 Q x a , b B a , b = I Q x 1 Q x   ;   a , b .
The QF of the baseline cdf is computed by inverting (38) as
Q x 1 Q x = I 1 u     ;     a , b .
Then, we can obtain
Q x 1 + I 1 u     ;     a , b = I 1 u     ;     a , b .
Therefore,
Q x = I 1 u     ;     a , b 1 + I 1 u     ;     a , b ,
which is the QF of the baseline model, where u is the uniform random variable in the interval (0,1). The QF of the OBP-logistic model is obtained by inserting (3) in (39) as
1 1 + exp x μ s = K ,
where K = I 1 u     ;     a , b 1 + I 1 u     ;     a , b .
We can simplify (40) as
K 1 + exp x μ s = 1 ,
where x u = s log 1 K K .
Hence, the QF of OBP-logistic model is
x = u s log 1 K K .

5.5. Stress–Strength

Here, we present the formulation of the stress–strength function of the OBP-logistic distribution. The stress–strength function is used to assess the reliability of a product exposed to variant stress [68].
If we suppose that X1 and X2 are two independent RVs with the OBP-logistic model with a , b , μ , s parameters, then the stress–strength function of the OBP-logistic model is expressed by
R = P X 2 < X 1 = x 1 f x 1 ; a 1 , b , μ , s f x 2 ; a 2 , b , μ , s d x 1 d x 2 .
where R = P X 2 < X 1 is the failure that occurs when the applied stress exceeds the system’s strength; f x 1 ; a 1 , b , μ , s and F x 1 ; a 2 , b , μ , s are the pdf and cdf of the OBP-logistic distribution. The integrand in (41) can be expressed as
R = f x 1 ; a 1 , b , μ , s F x 1 ; a 2 , b , μ , s d x 1 .
As given in [69], the cdf of the Beta function can be expanded as
I z ( a , b ) = z a B a , b i = 0 1 b i a + i i !   z i .
Inserting (43) in (42), we obtain
f x 1 ; a 1 , b , μ , s F x 1 ; a 2 , b , μ , s = f x 1 ; a 1 , b , μ , s ×                                                                   1 exp x μ s a 2 B a 2 , b i = 0 1 b i a 2 + i i ! 1 exp x μ s i .
For simplicity, (44) can be expressed as
f x 1 ; a 1 , b , μ , s F x 1 ; a 2 , b , μ , s = Ψ × f x 1 ; a 1 , b , μ , s i = 0 1 exp x μ s a 2 + i ,
where Ψ = 1 b i B a 2 , b a 2 + i i ! .
By inserting (45) in (41), we obtain
R = Ψ   ×   Ω 1 exp x μ s a 1 + a 2 + i 1 + 1 exp x μ s a 1 + b d x 1 ,
where Ω = 1 s B ( a 1 , b ) .
When y = 1 exp x μ s , we can obtain the following equation:
d x 1 = s × exp x μ s .
Inserting (47) into (46), we have
R = Ψ   ×   Ω ×   s y a 1 + a 2 + i 1 1 + y a 1 + b   d y .
After simplifications, the stress–strength function of the OBP-logistic model is
R = 1 B ( a 1 , b ) B ( a 2 , b ) i = 0 1 b i a 2 + i i !   ×   B a 1 + a 2 + i   ,   b a 2 i .

5.6. Order Statistics

Here, we derive the order statistics of the OBP-logistic. If we assume that the RV X 1 : n , X 2 : n , , X n : n follows the OBP-logistic model and X 1 : n < X 2 : n < < X n : n is a set of RVs of n order, then the distribution of the i t h order statistics is
F i : n x = n ! i 1 ! n i ! F x i 1 1 F x n i .
F i : n x = n ! f x i 1 ! n i ! j = 0 ( n i ) 1 j n i j F x i + j 1 ,
where f and F are the pdf and cdf of the OBP-logistic model. Now, (51) can be expressed as
F i : n x = n ! f x i 1 ! n i ! j = 0 ( n i ) 1 j n i j × k , m = 0 1 k + m i + j 1 k k m F x m ,
I x ( a , b ) = x a B a , b w = 0 1 b w a + w w i !   x w .
Inserting (53) into (52), we have
F i : n x = f x j = 0 ( n i ) k , m = 0 Ψ j , k , m 1 exp x μ s a B a , b w = 0 1 b w a + w w i ! × 1 exp x μ s w m .
F i : n x = w 1 m = 0 1 b w 1 1 b w m × 1 exp x μ s a + a m + w 1 + + w m 1 B a , b m × a + w 1 a + w m w 1 ! w m ! × 1 s B a , b 1 + 1 exp x μ s ( a + b ) j = 0 ( n i ) k , m = 0 Ψ j , k , m .
Applying the Kampé de Fériet series [70], (54) can be expressed as
F i : n x = 1 exp x μ s a 1 + m 1 s B a , b 1 + m 1 + 1 exp x μ s a + b × a m j = 0 ( n i ) k , m = 0 Ψ j , k , m ×             F 1 : 1 1 : 2 a 1 + m : 1 b , a , , 1 b , a , b + a 1 + m : a + 1 , , a + 1 , 1 , , 1 .
This is the order statistics of the OBP-logistic model.

5.7. Entropies

In this subsection, two measures of variation in uncertainty that include the Rényi and q entropies are presented.
If we assume that X is a RV with pdf f , the Rényi entropy [71] is expressed by
R γ ( x ) = 1 1 γ log f γ ( x ) d x ,           γ > 0 ,       γ 1 ,       x ,
where f is the pdf of the OBP-logistic distribution.
Using (37) and (57), we have
R γ ( x ) = 1 1 γ log s s B a , b γ B γ b   ,   γ a .
The expression of the q -entropy is given by
Q γ ( x ) = 1 1 γ log 1 f γ ( x ) d x ,           γ > 0 ,       γ 1 ,       x .
The q -entropy is obtained by substituting (37) as
Q γ ( x ) = 1 1 γ log 1 s s B a , b γ B γ b   ,   γ a .

6. Maximum Likelihood Estimation

Here, we use the MLE method to develop estimators for estimating the parameters of the OBP-logistic model. If we assume that x 1 , x 2 , x n are the possible outcomes of an experimental sample of size ( n ) drawn from a OBP-logistic model with set of parameters Φ = a , b , μ , s , then the likelihood function for the parameter vector Φ = a , b , μ , s T is given by
L x ; a , b , μ , s = Γ ( a + b ) s Γ ( a ) Γ ( b ) n i = 1 n 1 exp x μ s a 1 + 1 exp x μ s a + b ,
where B a , b = Γ ( a ) Γ ( b ) Γ ( a + b ) (see [72]).
Then, the log-likelihood function of (61) is
l = log L x ; a , b , μ , s = n log Γ ( a + b ) n log s n log Γ ( a ) n log Γ ( b ) + a i = 1 n log 1 exp x i μ s a + b i = 1 n log 1 + 1 exp x i μ s .
The ML estimates (MLEs) Φ ^ of Φ are determined by maximizing (62). The components of the score vector with respect to a , b , μ and s are
l a = n ψ ( a + b ) n ψ ( a ) + i = 1 n log 1 exp x i μ s i = 1 n log 1 + 1 exp x i μ s ,
l b = n ψ ( a + b ) n ψ ( b ) i = 1 n log 1 + 1 exp x i μ s ,
l μ = n a s + ( a + b ) s i = 1 n 1 1 + exp x i μ s ,
l s = n s a s 2 i = 1 n x i μ + ( a + b ) s 2 i = 1 n x i μ 1 + exp x i μ s ,
where ψ is the digamma function.
The estimators for the parameters can be obtained by setting (63)–(66) to zero. Although it is laborious to find the system’s solutions analytically, the R programming language [73] provides the numerical solutions of the system using iterative Newton–Raphson methods.

7. Monte Carlo Simulation Study

Here, we test the accuracy of the MLEs based on n random samples. Simulations are performed to study the behavior of the MLE for the OBP-logistic distribution. It is employed for maximizing (62). The number of simulation replicates is observed for R = 1000 times on different random samples of various sizes of n = 50 ,   150 ,   300 , ,   2050 . The average MLEs, biases, and mean square errors (MSEs) are obtained at each sample size based on the quantile function defined in (40). The average bias and MSE can be expressed as
Bias = 1 R i = 1 R Φ ^ i Φ ,
MSE = 1 R i = 1 R Φ ^ i Φ 2 ,
where Φ = ( a , b , μ , s ) T .
Table 1 displays the simulation results for different parameter values. Accuracy is achieved as the estimates tend to be closer to their actual values when the n increases. At first, we generate 1000 random samples from the OBP-logistic distribution ( a , b , μ , s ) from different variations in the parameters ( a = 0.5 ,   b = 1.5 ,   μ = 0.75 ,   s = 0.5 ) for fixed sample sizes n = 50 ,   150 ,   300 , ,   2050 and secondly, for different variations in the parameters ( a = 0.35 ,   b = 1.7 ,   μ = 1.2 ,   s = 0.2 ) for the fixed sample sizes n = 50 ,   150 ,   300 , ,   2050 .
In addition, the values of the error measures of the parameter estimates decrease when the n increases. Therefore, it is evident that the MLE approach generates consistent estimates in assessing the parameters of the OBP-logistic distribution for all possible parameter value selections. Table 1 also demonstrated that as n increases, the MSE for the estimators decreases.

8. Applications

This section illustrates the performance of the new OBP-logistic model against several competitive models using data from gas fiber (GF), carbon fiber (CF) and magnesium concentrations (MC). The fitting flexibility of the OBP-logistic distribution is measured by comparing it to the gamma generalized logistic (GGL) distribution [74], new modified exponential logistic (NMEL) distribution [75], gamma- logistic (GL) distribution [45], exponential modified Weibull logistic (EMWL) distribution [76], exponentiated Weibull logistic (EWL) distribution [77], and transmuted Weibull logistic (TWL) distribution [78] using the statistical accuracy measures, such as the minimized ( l ^ ), the Akaike information criterion (AIC), Bayesian information criterion (BIC), Cramer–von Mises (CM), and Kolmogorov–Smirnov (KS) and Anderson– Darling (AD) statistics. Given the competitive models, the suitable model is the one that provides the lowest values of the aforementioned measures [79,80]. All computations in this study were performed using R studio software (version 4.2.1).

8.1. Data Set 1: Glass Fiber Data

These data were collected from the UK National Physical Laboratory and were utilized to illustrate the potential of the WAPIE PD as used previously by [11,23,81,82,83,84]. The observations are reported below (Table 2).

8.2. Data Set 2: Carbon Fiber Data

As used by [37], the 100 collected samples are reported below (Table 3).

8.3. Data Set 3: Magnesium Concentration Data

Ensuring clean water and sanitation is crucial for promoting the good health and well-being of an increasing population. These data were studied by [85] to assess the magnesium concentrations for the groundwater quality. The data set is given below (Table 4).
Table 5 depicts the statistical summary for the GF, CF and MC data sets alongside their empirical density and box plots, as shown in Figure 3a,b, Figure 4a,b and Figure 5a,b. The figures indicate that the distribution of the GF data set was skewed to the left, while the CF and MC data sets were skewed to the right, respectively. Therefore, the OBP-logistic distribution could provide a suitable fit for these types of data sets.
The MLEs with corresponding standard errors (SEs) for the OBP-logistic and competing models (GGL, NMEL, GL, EMWL, EWL, and TWL) fitted to the GF, CF and MC data are presented in Table 6, Table 7 and Table 8, respectively. The statistical metrics, l ^ , AIC, BIC, CM, and AD, are shown in Table 9, Table 10 and Table 11, respectively. From these tables, the OBP-logistic model revealed the lowest values of l ^ , AIC, BIC, CM, and AD statistics in comparison to other competitive models. Thus, the OBP-logistic model can be selected as the best-fitting model for the GF and CF data sets. Moreover, plots in Figure 6, Figure 7 and Figure 8 also confirm these findings.

9. Discussion

This work introduces the family of probability distributions using the approach proposed by [55] to generate a compound distribution with more flexibility. The proposed family was used to develop the extended version of the logistic distribution as a sub-model. The pdf of the extended odd beta prime-logistic distribution and hazard function provided decreasing and increasing shapes, which is suitable for modeling many practical data sets from biological, environmental, finance, and engineering fields. The performance of the parameter estimates of the developed model were validated using the Monte Carlo simulation. The results from the simulation study revealed that the estimates tend to be closer to their actual values when n increases. In addition, the values of the error measures of the parameter estimates decrease when n increases. These findings provide evidence that the MLE approach generates consistent and reliable estimates for the proposed distribution across a wide range of parameter value selections. The numerical results obtained by using the statistical measure of accuracy, such as the l ^ , AIC, BIC, CM, AD, and KS statistics, in comparison to other competitive models and the results are presented in Table 9, Table 10 and Table 11. According to these tables, the OBP-logistic model revealed the smallest values of l ^ , AIC, BIC, CM, AD, and KS statistics in comparison to other competitive models. The numerical results presented in Table 9, Table 10 and Table 11 demonstrate that the OBP-logistic model consistently outperforms the other competitive models, as indicated by the smallest values of the aforementioned statistical measures. This suggests that the developed model exhibits superior flexibility and accuracy in fitting the data compared to the existing distributions. It is worth noting that all mathematical formulations and computations were implemented using the R Studio software. This ensures that the results are reproducible and reliable, contributing to the robustness of the study.
Overall, this paper introduces the OBP-G family of distributions and focuses on the OBP-logistic distribution as a notable member of this family. We provide a comprehensive analysis of its statistical properties, applicability to real data sets, and performance compared to other established distributions. The findings suggest that the OBP-logistic distribution offers significant advantages and is a promising choice for modeling a wide range of data sets.

10. Conclusions

In this paper, we present a novel family of probability distributions called the odd beta prime generalized (OBP-G) family of distributions. We discuss the cumulative distribution function (CDF), probability density function (PDF), and mixture representations of this new family. Additionally, we propose a new distribution within this family, namely the OBP-logistic distribution. The OBP-logistic distribution is developed based on the OBP-G family and offers a flexible modeling approach for various types of data sets. The probability density and hazard rate functions of the OBP-logistic distribution demonstrate its capability to handle skewed, symmetric, monotonically decreasing, and decreasing lifetime data. To assess the statistical properties of the OBP-logistic distribution, we define and derive several desirable features. Furthermore, we apply the maximum likelihood estimation (MLE) method to obtain estimators for the distribution’s parameters. To demonstrate the practicality of the OBP-logistic distribution, we compare it with well-established extended logistic distributions using three real data sets. We employ various statistical accuracy measures to assess the performance of the proposed distribution. The results reveal that the OBP-logistic distribution outperforms the competing distributions in terms of statistical accuracy.
For future research, this study can be extended by employing alternative methods for parameter estimation, such as the Bayesian approach, which can be preferred to make inferences regarding the unknown parameters of the model. It is also possible to assess the applicability of the new model using other data sets from various fields, such as medical sciences and finance. The study can also be improved by obtaining the properties of the OBP-G FPDs that are not covered in this paper.

Author Contributions

Conceptualization, A.A.S., H.D., N.S.S.S. and M.O.; Methodology, A.A.S., H.D., M.O. and A.I.I.; Software, A.A.S. and A.I.I.; Validation, A.A.S., M.O., H.D. and R.S.; Supervision, H.D., R.S. and M.O.; Formal analysis, A.A.S., H.D., M.O., A.I.I., N.S.S.S. and R.S.; Writing—original draft, A.A.S., H.D., A.I.I. and R.S.; Data curation, A.A.S., H.D. and A.I.I.; Writing—review and editing, M.O., A.A.S., H.D., R.S., N.S.S.S. and A.I.I.; Visualization, A.A.S., M.O., H.D. and R.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Yayasan Universiti Teknologi PETRONAS (YUTP) with the cost center 015LC0-401 and INTI International University, Malaysia.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The simulated data sets and all other data sets used to support the findings of this study are available from the corresponding author upon request.

Acknowledgments

The authors would like to sincerely thank the Universiti Teknologi PETRONAS and Aliko Dangote University of Science and Technology for providing support to this project. The first author would also like to express his gratitude to the Universiti Teknologi PETRONAS for sponsoring his PhD studies and providing him with a position as a graduate assistant. The authors wish to extend their sincere thanks to the support of the Faculty of Data Science and Information Technology, INTI International University, Malaysia for providing state-of-the-art research support to carry on this work. Finally, the authors express their gratitude to the referees for their insightful comments.

Conflicts of Interest

The authors declare no conflict of interest.

Nomenclature

W x ; a , b Cumulative distribution function of beta of the second kind
I x a , b Incomplete beta function ratio
a , b Shape parameters of beta of the second kind
B a , b Beta function
B x a , b Incomplete beta function
w ( x ; a , b ) Probability density function of beta of the second kind
P ( x ; μ , s ) Cumulative distribution function of logistic distribution
μ Location parameter of logistic distribution
s Scale parameter of logistic distribution
F x ; a , b , η Cumulative distribution function of family of distributions
Q x ; η 1 Q x ; η Odds ration
η Vector parameter
f x ; a , b , η Probability density function of the baseline distribution
S x Survival function
h x Hazard function
E X r r t h moment
M x t Moment-generating function
I γ ( x ) Information-generating function
R Stress–Strength function
F i : n x Order statistics
f Probability density function of OBP-logistic distribution
F Cumulative density function of OBP-logistic distribution
R γ ( x ) Rényi entropy
Q γ ( x ) q-entropy
n Sample size
Φ , a , b , μ , s T Vector of parameters
L x ; a , b , μ , s Likelihood function
l Log-likelihood function
ψ Digamma function
I γ ( x ) Information-generating function
I 1 u     ;     a , b Inverted cumulative distribution function
u Uniform random variable on the interval (0,1)

References

  1. Eliwa, M.S.; Altun, E.; Alhussain, Z.A.; Ahmed, E.A.; Salah, M.M.; Ahmed, H.H.; El-Morshedy, M. A new one-parameter lifetime distribution and its regression model with applications. PLoS ONE 2021, 16, e0246969. [Google Scholar] [CrossRef] [PubMed]
  2. Nasiru, S. Extended Odd Fréchet-G Family of Distributions. J. Probab. Stat. 2018, 2018, 2931326. [Google Scholar] [CrossRef]
  3. El-Morshedy, M.; Alshammari, F.S.; Hamed, Y.S.; Eliwa, M.S.; Yousof, H.M. A New Family of Continuous Probability Distributions. Entropy 2021, 23, 194. [Google Scholar] [CrossRef]
  4. Yousof, H.M.; Altun, E.; Ramires, T.G.; Alizadeh, M.; Rasekhi, M. A new family of distributions with properties, regression models and applications. J. Stat. Manag. Syst. 2018, 21, 163–188. [Google Scholar] [CrossRef]
  5. Eugene, N.; Lee, C.; Famoye, F. Beta-normal distribution and its applications. Commun. Stat.-Theory Methods 2002, 31, 497–512. [Google Scholar] [CrossRef]
  6. Zografos, K.; Balakrishnan, N. On families of beta-and generalized gamma-generated distributions and associated inference. Stat. Methodol. 2009, 6, 344–362. [Google Scholar] [CrossRef]
  7. Nadarajah, S.; Cordeiro, G.M.; Ortega, E.M. General results for the Kumaraswamy-G distribution. J. Stat. Comput. Simul. 2012, 82, 951–979. [Google Scholar] [CrossRef]
  8. Cordeiro, G.M.; Ortega, E.M.; da Cunha, D.C. The exponentiated generalized class of distributions. J. Data Sci. 2013, 11, 1–27. [Google Scholar] [CrossRef]
  9. Lee, C.; Famoye, F.; Alzaatreh, A.Y. Methods for generating families of univariate continuous distributions in the recent decades. Wiley Interdiscip. Rev. Comput. Stat. 2013, 5, 219–238. [Google Scholar] [CrossRef]
  10. Cordeiro, G.M.; Alizadeh, M.; Ortega, E.M. The exponentiated half-logistic family of distributions: Properties and applications. J. Probab. Stat. 2014, 2014. [Google Scholar] [CrossRef]
  11. Bourguignon, M.; Silva, R.B.; Cordeiro, G.M. The Weibull-G family of probability distributions. J. Data Sci. 2014, 12, 53–68. [Google Scholar] [CrossRef]
  12. Tahir, M.H.; Cordeiro, G.M.; Alizadeh, M.; Mansoor, M.; Zubair, M.; Hamedani, G.G. The odd generalized exponential family of distributions with applications. J. Stat. Distrib. Appl. 2015, 2, 1–28. [Google Scholar] [CrossRef] [Green Version]
  13. Amal, S.H.; Elgarhy, M. A New Family of Exponentiated Weibull-Generated Distributions. Int. J. Math. Its Appl. 2016, 4, 135–148. [Google Scholar]
  14. Afify, A.Z.; Altun, E.; Alizadeh, M.; Ozel, G.; Hamedani, G. The odd exponentiated half-logistic-G family: Properties, characterizations and applications. Chil. J. Stat. 2017, 8, 65–91. [Google Scholar]
  15. Gomes-Silva, F.S.; Percontini, A.; de Brito, E.; Ramos, M.W.; Venâncio, R.; Cordeiro, G.M. The odd Lindley-G family of distributions. Austrian J. Stat. 2017, 46, 65–87. [Google Scholar] [CrossRef] [Green Version]
  16. Reyad, H.; Alizadeh, M.; Jamal, F.; Othman, S. The Topp Leone odd Lindley-G family of distributions: Properties and applications. J. Stat. Manag. Syst. 2018, 21, 1273–1297. [Google Scholar] [CrossRef]
  17. ul Haq, M.A.; Elgarhy, M. The odd Frèchet-G family of probability distributions. J. Stat. Appl. Probab. 2018, 7, 189–203. [Google Scholar] [CrossRef]
  18. Alizadeh, M.; Rasekhi, M.; Yousof, H.M.; Hamedani, G. The transmuted Weibull-G family of distributions. Hacet. J. Math. Stat. 2018, 47, 1671–1689. [Google Scholar] [CrossRef]
  19. Oluyede, B. The gamma-Weibull-G Family of distributions with applications. Austrian J. Stat. 2018, 47, 45–76. [Google Scholar] [CrossRef] [Green Version]
  20. Hassan, A.S.; Nassr, S.G. Power Lindley-G family of distributions. Ann. Data Sci. 2019, 6, 189–210. [Google Scholar] [CrossRef]
  21. Ahmad, Z.; Elgarhy, M.; Hamedani, G.; Butt, N.S. Odd generalized NH generated family of distributions with application to exponential model. Pak. J. Stat. Oper. Res. 2020, 16, 53–71. [Google Scholar] [CrossRef]
  22. Jamal, F.; Reyad, H.; Chesneau, C.; Nasir, M.A.; Othman, S. The Marshall-Olkin odd Lindley-G family of distributions: Theory and applications. Punjab Univ. J. Math. 2020, 51, 7. [Google Scholar]
  23. Ishaq, A.I.; Abiodun, A.A. The Maxwell–Weibull distribution in modeling lifetime datasets. Ann. Data Sci. 2020, 7, 639–662. [Google Scholar] [CrossRef]
  24. Al-Moisheer, A.S.; Elbatal, I.; Almutiry, W.; Elgarhy, M. Odd Inverse Power Generalized Weibull Generated Family of Distributions: Properties and Applications. Math. Probl. Eng. 2021, 2021, 5082192. [Google Scholar] [CrossRef]
  25. Barranco-Chamorro, I.; Iriarte, Y.A.; Gómez, Y.M.; Astorga, J.M.; Gómez, H.W. A Generalized Rayleigh Family of Distributions Based on the Modified Slash Model. Symmetry 2021, 13, 1226. [Google Scholar] [CrossRef]
  26. Jamal, F.; Handique, L.; Ahmed, A.H.N.; Khan, S.; Shafiq, S.; Marzouk, W. The Generalized Odd Linear Exponential Family of Distributions with Applications to Reliability Theory. Math. Comput. Appl. 2022, 27, 55. [Google Scholar] [CrossRef]
  27. Hussain, S.; Sajid Rashid, M.; Ul Hassan, M.; Ahmed, R. The Generalized Exponential Extended Exponentiated Family of Distributions: Theory, Properties, and Applications. Mathematics 2022, 10, 3419. [Google Scholar] [CrossRef]
  28. Hussain, S.; Rashid, M.S.; Ul Hassan, M.; Ahmed, R. The Generalized Alpha Exponent Power Family of Distributions: Properties and Applications. Mathematics 2022, 10, 1421. [Google Scholar] [CrossRef]
  29. Bourguignon, M.; Santos-Neto, M.; de Castro, M. A new regression model for positive random variables with skewed and long tail. Metron 2021, 79, 33–55. [Google Scholar] [CrossRef]
  30. Kalbfleisch, J.D.; Prentice, R.L. The Statistical Analysis of Failure Time Data; John Wiley & Sons: Hoboken, NJ, USA, 2011. [Google Scholar]
  31. Venter, G. Transformed beta and gamma distributions and aggregate losses. In Proceedings of Casualty Actuarial Society; Casualty Actuarial Society: Arlington, VA, USA, 1983; Volume 70, pp. 289–308. [Google Scholar]
  32. Vartia, P.; Vartia, Y.O. Description of the Income Distribution by the Scaled F Distribution Model; Elinkeinoelämän Tutkimuslaitos: Helsinki, Finland, 1981. [Google Scholar]
  33. McDonald, J.B.; Ransom, M.R. Functional forms, estimation techniques and the distribution of income. Econom. J. Econom. Soc. 1979, 47, 1513–1525. [Google Scholar] [CrossRef]
  34. McDonald, J.B. Model selection: Some generalized distributions. Commun. Stat.-Theory Methods 1987, 16, 1049–1074. [Google Scholar] [CrossRef]
  35. McDonald, J.B.; Butler, R.J. Regression models for positive random variables. J. Econom. 1990, 43, 227–251. [Google Scholar] [CrossRef]
  36. Tulupyev, A.; Suvorova, A.; Sousa, J.; Zelterman, D. Beta prime regression with application to risky behavior frequency screening. Stat. Med. 2013, 32, 4044–4056. [Google Scholar] [CrossRef] [PubMed]
  37. Ferreira, J.; Soares, C.G. Modelling the long-term distribution of significant wave height with the Beta and Gamma models. Ocean. Eng. 1999, 26, 713–725. [Google Scholar] [CrossRef]
  38. Dubey, S.D. Compound gamma, beta and F distributions. Metrika 1970, 16, 27–31. [Google Scholar] [CrossRef]
  39. Joshi, R.K.; Kumar, V. The Logistic Gompertz Distribution with Properties and Applications. Bull. Math. Stat. Res. 2020, 8, 81–94. [Google Scholar]
  40. Brown, R.Z. Social behavior, reproduction, and population changes in the house mouse (Mus musculus L.). Ecol. Monogr. 1953, 23, 218–240. [Google Scholar] [CrossRef]
  41. Schultz, H. The standard error of a forecast from a curve. J. Am. Stat. Assoc. 1930, 25, 139–185. [Google Scholar] [CrossRef]
  42. Oliver, F. Methods of estimating the logistic growth function. J. R. Stat. Soc. Ser. C (Appl. Stat.) 1964, 13, 57–66. [Google Scholar] [CrossRef]
  43. Ravikumar, K. Negative Binomial Logistic Distribution. Turk. J. Comput. Math. Educ. (TURCOMAT) 2021, 12, 5963–5976. [Google Scholar]
  44. Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions, Volume 2; John Wiley & Sons: Hoboken, NJ, USA, 1995. [Google Scholar]
  45. Alzaatreh, A.; Ghosh, I.; Said, H. On the gamma-logistic distribution. J. Mod. Appl. Stat. Methods 2014, 13, 5. [Google Scholar] [CrossRef] [Green Version]
  46. Morais, A.L.; Cordeiro, G.M.; Cysneiros, A.H. The beta generalized logistic distribution. Braz. J. Probab. Stat. 2013, 27, 185–200. [Google Scholar] [CrossRef]
  47. Prentice, R.L. A generalization of the probit and logit methods for dose response curves. Biometrics 1976, 32, 761–768. [Google Scholar] [CrossRef] [PubMed]
  48. Stukel, T.A. Generalized logistic models. J. Am. Stat. Assoc. 1988, 83, 426–431. [Google Scholar] [CrossRef]
  49. Balakrishnan, N.; Leung, M. Order statistics from the type I generalized logistic distribution. Commun. Stat.-Simul. Comput. 1988, 17, 25–50. [Google Scholar] [CrossRef]
  50. Wahed, A.; Ali, M.M. The skew-logistic distribution. J. Statist. Res 2001, 35, 71–80. [Google Scholar]
  51. Nadarajah, S. The skew logistic distribution. AStA Adv. Stat. Anal. 2009, 93, 187–203. [Google Scholar] [CrossRef]
  52. Gupta, R.D.; Kundu, D. Generalized logistic distributions. J. Appl. Stat. Sci. 2010, 18, 51. [Google Scholar]
  53. Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
  54. Aljarrah, M.A.; Famoye, F.; Lee, C. Generalized logistic distribution and its regression model. J. Stat. Distrib. Appl. 2020, 7, 1–21. [Google Scholar] [CrossRef]
  55. Alzaatreh, A.; Lee, C.; Famoye, F. A new method for generating families of continuous distributions. Metron 2013, 71, 63–79. [Google Scholar] [CrossRef] [Green Version]
  56. Aljarrah, M.A.; Lee, C.; Famoye, F. On generating TX family of distributions using quantile functions. J. Stat. Distrib. Appl. 2014, 1, 1–17. [Google Scholar] [CrossRef]
  57. Ghosh, I.; Alzaatreh, A. A new class of generalized logistic distribution. Commun. Stat.-Theory Methods 2018, 47, 2043–2055. [Google Scholar] [CrossRef]
  58. Suleiman, A.; Othman, M.; Ishaq, A.; Daud, H.; Indawati, R.; Abdullah, M.L.; Husin, A. The Odd Beta Prime-G Family of Probability Distributions: Properties and Applications. In Proceedings of the 1st International Online Conference on Mathematics and Applications, Online, 1–15 May 2023; MDPI: Basel, Switzerland, 2023. [Google Scholar] [CrossRef]
  59. Alsadat, N.; Ahmad, A.; Jallal, M.; Gemeay, A.M.; Meraou, M.A.; Hussam, E.; Elmetwally, E.M.; Hossain, M.M. The novel Kumaraswamy power Frechet distribution with data analysis related to diverse scientific areas. Alex. Eng. J. 2023, 70, 651–664. [Google Scholar] [CrossRef]
  60. Alghamdi, S.M.; Shrahili, M.; Hassan, A.S.; Mohamed, R.E.; Elbatal, I.; Elgarhy, M. Analysis of Milk Production and Failure Data: Using Unit Exponentiated Half Logistic Power Series Class of Distributions. Symmetry 2023, 15, 714. [Google Scholar] [CrossRef]
  61. Muhammad, M.; Bantan, R.A.R.; Liu, L.; Chesneau, C.; Tahir, M.H.; Jamal, F.; Elgarhy, M. A New Extended Cosine—G Distributions for Lifetime Studies. Mathematics 2021, 9, 2758. [Google Scholar] [CrossRef]
  62. Alyami, S.A.; Elbatal, I.; Alotaibi, N.; Almetwally, E.M.; Elgarhy, M. Modeling to Factor Productivity of the United Kingdom Food Chain: Using a New Lifetime-Generated Family of Distributions. Sustainability 2022, 14, 8942. [Google Scholar] [CrossRef]
  63. Alyami, S.A.; Babu, M.G.; Elbatal, I.; Alotaibi, N.; Elgarhy, M. Type II Half-Logistic Odd Fr&eacute;chet Class of Distributions: Statistical Theory and Applications. Symmetry 2022, 14, 1222. [Google Scholar]
  64. Elbatal, I.; Alotaibi, N.; Almetwally, E.M.; Alyami, S.A.; Elgarhy, M. On Odd Perks-G Class of Distributions: Properties, Regression Model, Discretization, Bayesian and Non-Bayesian Estimation, and Applications. Symmetry 2022, 14, 883. [Google Scholar] [CrossRef]
  65. Alotaibi, N.; Elbatal, I.; Almetwally, E.M.; Alyami, S.A.; Al-Moisheer, A.S.; Elgarhy, M. Truncated Cauchy Power Weibull-G Class of Distributions: Bayesian and Non-Bayesian Inference Modelling for COVID-19 and Carbon Fiber Data. Mathematics 2022, 10, 1565. [Google Scholar] [CrossRef]
  66. Algarni, A.; Almarashi, A.M.; Elbatal, I.; Hassan, A.S.; Almetwally, E.M.; Daghistani, A.M.; Elgarhy, M. Type I Half Logistic Burr X-G Family: Properties, Bayesian, and Non-Bayesian Estimation under Censored Samples and Applications to COVID-19 Data. Math. Probl. Eng. 2021, 2021, 5461130. [Google Scholar] [CrossRef]
  67. David Sam Jayakumar, G.S.; Sulthan, A.; Samuel, W. A new bivariate beta distribution of Kind-1 of Type-A. J. Stat. Manag. Syst. 2019, 22, 141–158. [Google Scholar] [CrossRef]
  68. Baro-Tijerina, M.; Pina-Monarrez, M.R.; Villa-Covarrubias, B. Stress-strength Weibull analysis with different shape parameter and probabilistic safety factor. DYNA 2020, 87, 28–33. [Google Scholar] [CrossRef]
  69. Gauss, M.C.; Saralees, N. Closed-form expressions for moments of a class of beta generalized distributions. Braz. J. Probab. Stat. 2011, 25, 14–33. [Google Scholar] [CrossRef]
  70. Verma, A. Finite summation formulas of generalized Kampé de Fériet series. arXiv 2020, arXiv:2003.07530. [Google Scholar]
  71. Rényi, A. On measures of entropy and information. In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, CA, USA, 20 June–30 July 1960; University of California Press: Berkeley, CA, USA; Volume 1, pp. 547–561. [Google Scholar]
  72. Khammash, G.S.; Agarwal, P.; Choi, J. Extended k-Gamma and k-Beta Functions of Matrix Arguments. Mathematics 2020, 8, 1715. Available online: https://www.mdpi.com/2227-7390/8/10/1715 (accessed on 16 March 2023). [CrossRef]
  73. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2022; Available online: https://cir.nii.ac.jp/crid/1574231874043578752 (accessed on 5 April 2023).
  74. Kumar, C.S.; Manju, L. Gamma Generalized Logistic Distribution: Properties and Applications. J. Stat. Theory Appl. 2022, 21, 155–174. [Google Scholar] [CrossRef]
  75. Ahmad, Z.; Almaspoor, Z.; Khan, F.; Alhazmi, S.E.; El-Morshedy, M.; Ababneh, O.; Al-Omari, A.I. On fitting and forecasting the log-returns of cryptocurrency exchange rates using a new logistic model and machine learning algorithms. AIMS Math. 2022, 7, 18031–18049. [Google Scholar] [CrossRef]
  76. Nassar, M.M.; Radwan, S.S.; Elmasry, A.S. the Exponential Modified Weibull Logistic Distribution (EMWL). EPH-Int. J. Math. Stat. 2018, 4, 22–38. [Google Scholar] [CrossRef]
  77. MURAT, U.; Gamze, Ö. Exponentiated Weibull-logistic distribution. Bilge Int. J. Sci. Technol. Res. 2020, 4, 55–62. [Google Scholar]
  78. Nassar, M.; Radwan, S.; Elmasry, A. Transmuted Weibull logistic distribution. Int. J. Innov. Res. Dev. 2017, 6, 122–131. [Google Scholar] [CrossRef] [Green Version]
  79. Abdullahi, U.A.; Suleiman, A.A.; Ishaq, A.I.; Usman, A.; Suleiman, A. The Maxwell–Exponential Distribution: Theory and Application to Lifetime Data. J. Stat. Model. Anal. (JOSMA) 2021, 3, 2. [Google Scholar] [CrossRef]
  80. Singh, V.V.; Suleman, A.A.; Ibrahim, A.; Abdullahi, U.A.; Suleiman, S.A. Assessment of probability distributions of groundwater quality data in Gwale area, north-western Nigeria. Ann. Optim. Theory Pract. 2020, 3, 37–46. [Google Scholar]
  81. Eferhonore, E.-E.; Thomas, J.; Zelibe, S.C. Theoretical analysis of the Weibull alpha power inverted exponential distribution: Properties and applications. Gazi Univ. J. Sci. 2020, 33, 265–277. [Google Scholar]
  82. Ceren, Ü.; Cakmakyapan, S.; Gamze, Ö. Alpha power inverted exponential distribution: Properties and application. Gazi Univ. J. Sci. 2018, 31, 954–965. [Google Scholar]
  83. Merovci, F.; Khaleel, M.A.; Ibrahim, N.A.; Shitan, M. The beta Burr type X distribution properties with application. SpringerPlus 2016, 5, 697. [Google Scholar] [CrossRef] [Green Version]
  84. Maxwell, O.; Oyamakin, S.O.; Th, E.J. The Gompertz Length Biased Exponential Distribution and its application to Uncensored Data. Curr. Trends Biostat. Biom. 2018, 1, 52–57. [Google Scholar] [CrossRef]
  85. Suleiman, A.A.; Abdullahi, U.A.; Suleiman, A.; Yunus, R.B.; Suleiman, S.A. Assessment of Groundwater Quality Using Multivariate Statistical Techniques. In Intelligent Systems Modeling and Simulation II: Machine Learning, Neural Networks, Efficient Numerical Algorithm and Statistical Methods; Springer: Berlin/Heidelberg, Germany, 2022; pp. 567–579. [Google Scholar]
Figure 1. Plots of the pdf of the OBP-logistic distribution for different values of parameters. (a)–(c) show that the pdf of the OBP-logistic distribution can be symmetric, right-skewed or left-skewed density function.
Figure 1. Plots of the pdf of the OBP-logistic distribution for different values of parameters. (a)–(c) show that the pdf of the OBP-logistic distribution can be symmetric, right-skewed or left-skewed density function.
Sustainability 15 10239 g001
Figure 2. Plots of the hazard rate of the OBP-logistic distribution. (a,b) show that the hazard rate of the OBP-logistic distribution can be monotonically decreasing or monotonically increasing hazard function.
Figure 2. Plots of the hazard rate of the OBP-logistic distribution. (a,b) show that the hazard rate of the OBP-logistic distribution can be monotonically decreasing or monotonically increasing hazard function.
Sustainability 15 10239 g002
Figure 3. The density and box plots for the GF data. (a) show negative skewness of extreme values for the GF data (b) show statistical behavior of extreme values for the GF data.
Figure 3. The density and box plots for the GF data. (a) show negative skewness of extreme values for the GF data (b) show statistical behavior of extreme values for the GF data.
Sustainability 15 10239 g003
Figure 4. The density and box plots for the CF data. (a) show positive skewness of extreme values for the CF data (b) show statistical behavior of extreme values for the CF data.
Figure 4. The density and box plots for the CF data. (a) show positive skewness of extreme values for the CF data (b) show statistical behavior of extreme values for the CF data.
Sustainability 15 10239 g004
Figure 5. The density and box plots for the MC data. (a) show positive skewness for the MC data (b) show negative kurtosis for the MC data.
Figure 5. The density and box plots for the MC data. (a) show positive skewness for the MC data (b) show negative kurtosis for the MC data.
Sustainability 15 10239 g005
Figure 6. Plots of empirical and fitted densities; and empirical cdfs and fitted cdfs for GF data.
Figure 6. Plots of empirical and fitted densities; and empirical cdfs and fitted cdfs for GF data.
Sustainability 15 10239 g006
Figure 7. Plots of empirical and fitted densities; and empirical cdfs and fitted cdfs for CF data.
Figure 7. Plots of empirical and fitted densities; and empirical cdfs and fitted cdfs for CF data.
Sustainability 15 10239 g007
Figure 8. Plots of empirical and fitted densities; and empirical cdfs and fitted cdfs for MC data.
Figure 8. Plots of empirical and fitted densities; and empirical cdfs and fitted cdfs for MC data.
Sustainability 15 10239 g008
Table 1. Simulation results (MLE, bias, and MSE) for OBP-logistic distribution for several values.
Table 1. Simulation results (MLE, bias, and MSE) for OBP-logistic distribution for several values.
Set   1 :   a   =   0.5 ,   b   =   1.5 ,   μ   =   0.75 ,   s = 0.5 Set 2:  a   =   0.35 ,   b   =   1.7 ,   μ   =   1.2 ,   s   =   0.2
ParameternMeanBiasMSEMeanBiasMSE
a ^ 501.0216390.4216340.3234640.3768960.0699390.051017
1500.7832060.2832050.1591130.4153240.1542170.051843
3000.7056640.2056660.0884890.3878390.0978450.026851
7500.6322030.1322080.0535100.3775620.0708520.018937
10500.5798780.0798710.0248550.3698320.0591260.013711
15500.5345310.0645370.0167110.3605320.0472110.008374
20500.5112790.0612790.0153900.3568910.0368240.004921
b ^ 503.0711202.0511266.5010422.7579211.0579348.337832
1502.3359251.3159274.2818311.9030670.2039531.077834
3001.9168250.8968232.9273131.8153830.1158480.662473
7501.8534720.5334761.6300271.7259020.0259540.398832
10501.8011390.2651130.7272891.7249420.2907100.024529
15501.7957700.1787760.4103611.7172910.0049750.193952
20501.6287140.1567180.3636021.7047010.0025620.009541
μ ^ 500.526858−0.2731060.1044491.8754020.7808643.704327
1500.620049−0.1799510.0602711.4601740.2080160.877854
3000.662551−0.1374460.0394441.3864210.0727560.392853
7500.709550−0.0904410.0237091.2834190.0496430.270834
10500.715512−0.0744890.0170911.2472420.0159530.174934
15500.739470−0.0572080.0105841.2276410.0102610.118342
20500.757674−0.0423250.0071791.2078450.0078340.111962
s ^ 500.6574500.0896430.0105470.2346280.0129990.009617
1500.5797620.0198840.0059130.2389170.0389210.005998
3000.5460310.0095460.0017500.2336100.0236100.004892
7500.5365450.0065750.0011470.2298280.0197210.003671
10500.5245670.0045670.0007560.2226110.0169630.002930
15500.5345410.0034540.0004710.2189360.0129530.002538
20500.5465220.0026220.0002420.2040970.0096180.001273
Table 2. Glass fiber data.
Table 2. Glass fiber data.
0.550.740.770.810.841.240.931.041.111.131.30
1.251.271.281.291.481.361.391.421.481.511.49
1.491.501.501.551.521.531.541.551.611.581.59
1.601.611.631.611.611.621.621.671.641.661.66
1.661.701.681.691.701.781.731.761.761.771.89
1.811.821.841.842.002.012.24
Table 3. Carbon fiber data.
Table 3. Carbon fiber data.
3.702.742.732.503.603.113.272.871.473.11
2.413.193.221.693.283.091.873.154.903.75
2.952.973.392.962.532.672.933.223.392.81
3.332.553.313.312.852.563.563.152.352.55
2.382.812.772.172.831.921.413.682.971.36
2.764.913.681.841.593.191.570.815.561.73
2.001.221.121.712.171.175.082.481.183.51
1.691.254.381.840.393.682.480.851.612.79
2.031.801.571.082.031.612.121.892.882.82
2.052.434.202.590.981.592.174.704.423.65
Table 4. Magnesium concentration data.
Table 4. Magnesium concentration data.
0.740.150.370.070.120.030.290.110.110.37
0.120.090.610.130.150.190.110.150.100.60
0.090.710.120.400.550.110.140.130.460.22
Table 5. Statistical summary for the GF, CF, and MC data sets.
Table 5. Statistical summary for the GF, CF, and MC data sets.
DataMin.Q1MedianMeanQ3Max.VarianceSkewnessKurtosis
10.5501.3751.5901.5071.6852.2400.105−0.8790.800
20.3901.8402.7002.6213.2205.5601.0280.3630.043
30.0300.110 0.1450.2510.3700.7400.0431.077−0.269
Table 6. MLEs with corresponding standard errors (in parentheses) of competitive models for GF data.
Table 6. MLEs with corresponding standard errors (in parentheses) of competitive models for GF data.
Model a ^ b ^ μ ^ s ^ λ ^ t ^
OBP-logistic0.6345
(0.1279)
0.7346
(0.1671)
1.5415
(0.0368)
0.1708
(0.0184)
__
GGL0.3811
(0.0325)
0.2578
(0.0229)
_0.2854
(0.0425)
0.1764
(0.0124)
_
NMEL_7.9262
(0.8735)
1.5262
(0.0408)
0.5286
(0.0437)
0.8543
(0.0267)
_
GL13.1164
(2.3079)
18.4734
(3.3134)
_4.8783
(0.8954)
2.6508
(0.7354)
_
EMWL5.7806
(0.5761)
1.62813
(0.0371)
_1.2532
(0.0192)
0.3518
(0.0113)
_
EWL0.2791
(0.0274)
0.3215
(0.0286)
_0.4176
(0.0210)
1.5068
(0.0405)
_
TWL17.4410
(3.0783)
__8.3092
(1.7391)
11.5746
(2.0725)
2.6301
(0.6390)
Table 7. MLEs with corresponding standard errors (in parentheses) of competitive models for CF data.
Table 7. MLEs with corresponding standard errors (in parentheses) of competitive models for CF data.
Model a ^ b ^ μ ^ s ^ λ ^ t ^
OBP-logistic0.5876
(0.1134)
0.6753
(0.3452)
2.5975
(0.1001)
0.5732
(0.0475)
__
GGL0.8774
(0.0444)
0.4439
(0.0314)
_0.7354
(0.0649)
0.2763
(0.0873)
_
NMEL_4.1184
(0.3441)
2.4985
(0.1053)
1.8534
(0.0342)
0.5285
(0.0263)
_
GL4.4477
(0.6068)
9.5189
(1.3750)
_1.4567
(0.0326)
3.0653
(0.4375)
_
EMWL2.7929
(0.2141)
2.9438
(0.1110)
_0.7393
(0.0741)
0.2481
(0.0173)
_
EWL1.4502
(0.0807)
2.6214
(0.1008)
_1.3092
(0.0981)
1.0088
(0.0713)
_
TWL5.9529
(0.8194)
__3.7407
(0.3681)
2.2711
(0.3264)
0.7622
(0.0137)
Table 8. MLEs with corresponding standard errors (in parentheses) of competitive models for MC data.
Table 8. MLEs with corresponding standard errors (in parentheses) of competitive models for MC data.
Model a ^ b ^ μ ^ s ^ λ ^ t ^
OBP-logistic1.3290
(0.1827)
0.2756
(0.0401)
1.7834
(0.0854)
0.1964
(0.0543)
__
GGL0.2162
(0.0360)
0.1129
(0.017)
_1.9743
(0.6342)
0.3714
(0.0302)
_
NMEL3.5783
(1.0428)
1.7937
(0.4270)
7.1363
(1.957)
1.6453
(0.4943)
__
GL0.2513
(0.037)
0.2040
(0.0263)
_1.5462
(0.3648)
0.6328
(0.0843)
_
EMWL2.2019
(0.3317)
0.1765
(0.0258)
_1.7845
(0.1534)
0.5281
(0.0848)
_
EWL1.6847
(0.1418)
0.7768
(0.1002)
_2.4271
(0.5832)
1.3977
(0.1143)
_
TWL1.3290
(0.1827)
__0.2756
(0.0402)
3.8262
(1.9436)
1.6749
(0.5483)
Table 9. Statistical metrics for GF data.
Table 9. Statistical metrics for GF data.
Model l ^ AICBICKSCMADp-Value (KS)
OBP-logistic15.021234.041938.32810.125290.172471.214600.83122
GGL28.005560.009864.29610.231270.691823.773620.29976
NMEL23.789351.579956.86620.223650.505932.375840.36177
GL33.127370.254674.54090.248350.861354.638340.20137
EMWL17.206739.413644.69990.202210.275041.280610.71306
EWL16.911836.823640.10990.131270.245381.249880.76951
TWL22.951549.903053.18930.216360.365803.087000.53102
Table 10. Statistical metrics for CF data.
Table 10. Statistical metrics for CF data.
Model l ^ AICBICKSCMADp-Value (KS)
OBP-logistic141.310287.621291.8310.057530.061650.427920.90347
GGL148.419300.839306.0500.117730.275281.465020.60725
NMEL143.779293.559301.7690.090250.157500.737710.68425
GL158.737321.474326.6840.146730.519332.847140.54792
EMWL141.529288.058292.2680.060490.063310.437690.73061
EWL143.270290.540294.7510.063060.068060.468050.72370
TWL146.233296.467302.6770.093390.160021.075840.62563
Table 11. Statistical metrics for MC data.
Table 11. Statistical metrics for MC data.
Model l ^ AICBICKSCMADp-Value (KS)
OBP-logistic14.1082−26.2164−23.41400.158340.131640.838620.88436
GGL18.7392−6.38254−3.58010.242480.383902.395560.25342
NMEL14.8236−24.1073−21.20490.246780.296011.525160.46418
GL22.7643−6.21814−3.41570.290290.492372.655610.20345
EMWL14.8047−25.6695−22.86710.188780.189681.104710.75396
EWL15.5494−24.0988−22.99640.207690.203881.093610.69807
TWL16.2846−22.5692−19.76680.240510.287811.505970.37649
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Suleiman, A.A.; Daud, H.; Singh, N.S.S.; Othman, M.; Ishaq, A.I.; Sokkalingam, R. A Novel Odd Beta Prime-Logistic Distribution: Desirable Mathematical Properties and Applications to Engineering and Environmental Data. Sustainability 2023, 15, 10239. https://doi.org/10.3390/su151310239

AMA Style

Suleiman AA, Daud H, Singh NSS, Othman M, Ishaq AI, Sokkalingam R. A Novel Odd Beta Prime-Logistic Distribution: Desirable Mathematical Properties and Applications to Engineering and Environmental Data. Sustainability. 2023; 15(13):10239. https://doi.org/10.3390/su151310239

Chicago/Turabian Style

Suleiman, Ahmad Abubakar, Hanita Daud, Narinderjit Singh Sawaran Singh, Mahmod Othman, Aliyu Ismail Ishaq, and Rajalingam Sokkalingam. 2023. "A Novel Odd Beta Prime-Logistic Distribution: Desirable Mathematical Properties and Applications to Engineering and Environmental Data" Sustainability 15, no. 13: 10239. https://doi.org/10.3390/su151310239

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop