1. Introduction
Reliability measures of the type
, often represented as
R and referred to as stress–strength reliability, are important for evaluating the performance of different systems and processes. This measure indicates the probability that a random variable
X, which can represent a general performance metric or quality indicator, is less than another random variable
Y, which could signify a threshold or standard to be met. In this context,
X and
Y are not limited to engineering concepts like stress and strength but are applicable in any scenario where two quantities are compared. A higher
R value signifies a more reliable system, indicating a greater likelihood that the performance metric
X will be below the threshold
Y. Calculating
requires an understanding of the joint distribution of
X and
Y, which can be determined through various methods, including simulation, analytical solutions, or the use of copulas to model dependencies between variables. We refer the reader to [
1] for further details on this subject.
Let
Y and
X be independent continuous random variables from probability density function (PDF)
and cumulative distribution function (CDF)
, respectively. We can write the stress–strength reliability measure as:
Thus,
R is a measure of component reliability, and it may be interpreted as the probability of a system failure when the applied stress
Y is greater than its strength
X. It is often assumed that
X and
Y are independent random variables and that they belong to the same family of probability distributions. Rathie et al. [
2] present a recent survey on the subject.
Studying reliability measures such as
for asymmetric marginal distributions is crucial for understanding a variety of real-world scenarios that require data-driven solutions. In the case of finance applications, where risk assessment is critical, asymmetric distributions play a central role, especially heavy tailed ones [
3]. For example, in stock market analysis, knowing the likelihood that a particular stock will do better than another is crucial to making wise investment choices. Investors can efficiently manage risk and optimize their portfolios with the aid of reliability metrics, which assist in quantifying these probabilities [
4].
Furthermore, asymmetry in distributions is common in a wide range of social and ecological phenomena, including the spread of illnesses and the distribution of money [
5]. Asymmetric distributions can be used to describe the different scenarios of disease transmission within populations in epidemiology [
6]. Researchers can better understand the likelihood of particular outcomes and aid in the creation of tailored intervention methods by examining reliability measurements in such circumstances. In short, studying reliability measures of the type
for asymmetric distributions makes modeling and prediction more precise, which in turn helps one to make more informed decisions across a variety of domains.
In particular, reliability measures of the stress–strength type for classic extreme value distributions were studied by [
7], who derived expressions for
R in terms of special functions for
l-max stable laws (Fréchet, Weibull, and Gumbel). Several authors have worked on the estimation and application of stress–strength for the
l-max stable distributions (e.g., [
8,
9,
10,
11]). Some generalizations of
l-max stable distributions have been proposed to either allow better data fitting or provide more convenient mathematical properties. In the work of Aryal and Tsokos [
12], for example, the generalized extreme value distribution (GEV) was extended to a model named transmuted GEV (TGEV). Bivariate data were also considered like bimodal Weibull [
13], bimodal Gumbel [
14], and bimodal GEV (BGEV) [
15] distributions.
The
l-max stable distributions are derived as a limiting distribution of linearly normalized partial maxima. Another approach to generalize such distributions is by non-linearly normalizing partial maxima of independent identically distributed random variables (iid RVs). This way, for a given CDF
, suppose there exists sequences of real numbers
and
with
such that
weakly, where
is a non-degenerate CDF. The three-parameter
p-max stable laws can be obtained from CDF (
2) by the definition of the same
p-type. That means that we assume there exist positive constants
such that
where
for
. It was shown in [
16] that
H is of the same
p-type as one of the following distributions: log-Fréchet, log-Weibull, inverse log-Fréchet, inverse log-Weibull, standard Fréchet, and standard Weibull. Such limiting distributions are heavy tailed and asymmetric. Therefore, the convergence in (
2) is usually studied by assessing the approximation on the tails, as discussed, for example, by Feng and Chen [
17] and references therein.
To the best of our knowledge, the literature lacks previous in-depth studies on reliability inference for p-max stable distributions, and this work stands as a contribution by providing estimation methods for R based on stochastic optimization for this class of distributions. Thus, in this paper, we consider the problem of estimating the stress–strength parameter R when X and Y are independent three-parameter p-max stable random variables with the same CDF but different parameters. In order to validate our results, a robust framework was proposed and applied to model real and synthetic datasets, rigorously indicating the capacities of the p-max models and the usability of the analytical formulas hereby derived to calculate R.
Our main contributions are as follows:
to analytically derive R in terms of special functions, for each three-parameter p-max stable law with fewer parameter restrictions compared to previous results in the literature;
to propose an estimator for R;
to apply the results to the modeling of real datasets. In particular, two real scenarios are investigated, showing the versatility of stress–strength reliability (SSR) modeling approaches using p-max models. First, soccer pass completion proportions of two different championships (UEFA Champions League and 2022 FIFA World Cup) were compared, allowing scouting professionals to use the SSR results as a proxy for technical level comparison of teams that competed at those tournaments. Then, a second application involved the modeling and comparison of the strength of carbon fibers of different lengths when subjected to tension efforts. In both modeling scenarios, the best fitting p-max stable distribution (both qualitatively (by graphical methods) and quantitatively (by information criteria)) was taken as a starting point.
This paper is organized as follows:
Section 2 introduces preliminaries, especially the definition of the
-function, the
-function, and the three-parameter
p-max stable laws.
Section 3, on the other hand, deals with the derivation of
R when
X and
Y are independent
p-max stable random variables. The maximum likelihood estimation for
R is presented in
Section 4. In
Section 5, we deal with Monte Carlo simulations as well as with the modeling of two real situations involving football datasets and different-length carbon fibers. The
Section 6 deals with conclusions.
3. Reliability for Three-Parameter -Max Stable Laws
In this section, the reliability of two independent three-parameter p-max stable random variables is derived in terms of the -function. In addition, with suitable parameter restrictions, the -function and a simpler form in terms of standard functions are obtained. Firstly, we consider the case of two independents .
Theorem 1. Let Y and X be independent random variables, respectively, with CDF and , , and . Then,provided that . In particular, if , thenWhen , R can be written explicitly as Proof. Set
(
). Then,
where
. Substituting
and taking
, we can rewrite (
16) as
Hence, (
13) follows from (
17) and (
4). In addition, applying (
3) with
, we obtain (
14). In the case where
, we have the explicit form (
15). □
Secondly, we consider the case of two independents .
Theorem 2. Let Y and X be independent random variables, respectively, with CDF and , , and . Then,provided that . In particular, if , thenWhen , R can be written explicitly as Proof. Set
(
). Then,
where
. Substituting
and taking
, we can rewrite (
21) as
Hence, (
18) follows from (
22) and (
4). In addition, applying (
3) with
, we obtain (
19). In the case where
, we have the explicit form (
20). □
Thirdly, we consider the case of two independents . The proofs of Theorems 3 and 4 are similar to those of Theorems 1 and 2, respectively. The details are omitted.
Theorem 3. Let Y and X be independent random variables, respectively, with CDF and , , . Then,provided that . In particular, if , thenWhen , R can be written explicitly as Now, we consider the case of two independents .
Theorem 4. Let Y and X be independent random variables, respectively, with CDF and , , and . Then,provided that . In particular, if , thenWhen , R can be written explicitly as Lastly, we consider the cases of two independents .
Theorem 5. Let Y and X be independent random variables, respectively, with CDF , , and . Then,
- (a)
In particular, if , we have - (b)
In particular, if , we have
Proof. We prove case
, and case
follows analogously. We have
Substituting
in (
28), we obtain
Therefore, (
25) follows from (
29) and (
4) (alternatively, (
26) follows from (
29) and (
3)). In particular, taking
, (
27) follows from (
29). □
By combining all the Theorems from 1 to 4, it is possible to state the following Corollary:
Corollary 1. Let Y and X be independent random variables, respectively, with CDF and , , , and . Then,provided that . In particular, if , thenWhen , can be written explicitly as We finish this section by noting that Theorems 1–5 can be generalized to random samples of a given
F distribution that is in the domain of attraction of one of the
p-max stable laws (see [
20] for a complete characterization of the domains of attraction of the
p-max stable laws). We describe below these generalizations.
Let
be a sample from the CDF
F and assume that there exist sequences of real numbers
and
with
such that (
2) holds for some
. Set
and
. Equations (
1) and (
2) imply
where
. Using the corresponding Theorems (1–5), (
33) can be obtained in terms of the1
-function.
4. Estimation
This section deals with parameter estimation for the p-max stable laws via a random optimization method and bootstrap confidence intervals.
Several authors (e.g., [
8,
9,
11]) have estimated
R by maximum likelihood. However, they relied on strong parameter restrictions to obtain an explicit form for
R. Thus, the estimation of the parameters must be done jointly in the two samples. In our case, such restrictions were not necessary since we worked with expressions of
R in terms of functions
and
, releasing any requirements about similar parameters between different samples.
To the best of our knowledge, there are few studies concerning parameter estimation, although the literature suggests several theoretical studies of
p-max stable distributions (e.g., [
21]). Here, we present a different approach for parameter estimation for the
p-max stable laws.
We initially consider the PDF
. For the other
p-max stable laws, similar expressions are obtained using the PDFs presented in
Section 2.2. Take
as a sample of
n observations. The likelihood function for the PDF
is given by the following:
Note that if and only if for all . Then, we are not able to obtain the MLE explicitly, so an additional condition is required in the likelihood maximization process.
Remark 1. The MLE of R is obtained using the invariance property of MLE. This is due to the Theorems 1–5 that describe R in terms of the function (which is an integral, hence a continuous and measurable function).
4.1. A Random Optimization Method for Approximating the MLE
Now, we describe the optimization methodology to be implemented for parameter estimation. Let be a likelihood function for which the maximum is assumed to be finite.
Algorithm 1 can find the point of maximum for which . Particularly, unlike conventional algorithms, random points in space are generated according to a generic distribution G (not necessarily uniform) on the parameter space . This allows us to introduce weights in some regions of the parameter space, as a kind of prior information.
Algorithm 1. Let be independent and identically distributed random vectors with common distribution G on . Let be defined by
Step 1. and.
Step .
Having defined , let be defined as It was proved by [
22] that for given
and
that is, the
-region of attraction of
has been attained with probability
, provided that the stop rule consists of terminating the algorithm for
k such that
where
m denotes the Lebesgue measure on
. This means that with high probability, the algorithm reaches the desired maximum.
4.2. Bootstrap
The bootstrap method used in the next section to obtain bootstrap confidence intervals of R is described below.
Algorithm 2 describes the approach used in the next section to obtain bootstrap confidence intervals of R.
Generate independent bootstrap samples and of sizes and , respectively.
Compute the parameter estimation based on and .
Obtain .
Repeat steps times.
The approximate confidence interval of R is given by , where and are the cumulative distribution function of .
Algorithm 2. Let and be samples of sizes and , respectively, and a positive integer M.
- Step 1
Generate independent bootstrap samples and .
- Step 2
Compute the parameter estimation based on and .
- Step 3
Obtain .
- Step 4
Repeat steps to M times.
- Step 5
The approximate confidence interval of R is given by , where and is the cumulative distribution function of .
6. Conclusions
Our study aimed to investigate the estimation of the for independent marginals X and Y following p-max stable distributions. In order to do so, we obtained exact expressions for R. By using the new formulas proposed, direct and exact reliability applications are made possible for an important class of asymmetric distributions.
We discuss the application of a novel class of special functions, the so-called extreme value -function, which allows us to write the expressions of R explicitly and with minimal restrictions. In particular, by imposing additional parameter restrictions, R can be calculated in terms of -functions as well as even more compact expressions.
To the best of our knowledge, there are no previous works in the literature aiming to provide expressions and frameworks to perform reliability statistical inference for p-max stable distributions, and this work stands as a contribution by providing estimation methods based on stochastic optimization.
A restraint of our estimation method is the fact that it relies on compact search spaces for fixed N. However, we tested the performance of the proposed estimator by a Monte Carlo simulation study. Even though the search range N exponentially governs the computational effort required, the reported results reveal the correctness of the methodological approach hereby proposed.
Two applications to real datasets were carried out to show the performance of the p-max stable laws in reliability scenarios. Future work may explore other extreme value distributions and their reliability calculations, such as bimodal Weibull, bimodal Gumbel, bimodal GEV, and the extreme-value Birnbaum-Saunders distribution.
Overall, it is possible to summarize the strengths of the present paper as follows:
General expressions were analytically derived for when X and Y follow three-parameter p-max stable laws with fewer parameter restrictions compared to previous results in the literature;
A stochastic optimization procedure was proposed to build an estimator for R based on the novel expressions derived;
The validity of the expressions and of the general methodological framework developed were demonstrated by Monte Carlo simulations;
The suitability of the p-max distributions to model real datasets was attested by study cases.
On the other hand, the main weaknesses of the present paper are as follows:
The stochastic optimization procedure relies on compact search spaces for fixed N, whose impact is exponential on the computational effort required;
The amount of data used to illustrate the methodology and equations developed in the paper is limited; thus, the superiority of the p-max distributions over other possible models needs to be assessed in a case-by-case fashion.