Advanced Approach for Estimating Failure Rate Using Saddlepoint Approximation

: In the present study paper, a failure (hazard) rate function approximates the probability distribution for the linear combination of a random variable considered a highly complex model. The saddlepoint approximation approach is used to approximate the probability mass function and the cumulative distribution function to derive the approximation of the failure (hazard) rate with a high level of accuracy. The superior performance of this method is shown by numerical simulations and comparison with the performance of other approximation methods.


Introduction
Advanced statistical methods require models that calculate probabilities derived from intractable distributions, which causes complicated calculations. The saddlepoint approximation technique can solve some of these problems. This method of approximation was proposed by Daniels [1], and it is a particular application of the mathematical saddlepoint methods applied to statistical science. This technique provides an approximation formula for the cumulative distribution function (CDF) or the probability mass function (PMF) from its associated moment generating function (MGF). The saddlepoint approach can answer many scientific applied problems, such as bivariate symmetry tests for complete and competing risks data (Bell and Haller [2]). Kepner and Randies [3] introduced a comparison of tests for bivariate symmetry versus location and/or scale alternatives. Rao and Raghunath [4] proposed a simple nonparametric test for bivariate symmetry about a line. This method of approximation also provides us with a very accurate solution for estimating the hazard rate function (see Dissanayake and Trindade [5]), and Al Mutairi [6] discussed improved methods for calculating the hazard rate function for the stopped sum model using saddlepoint approximation. The distribution of a linear combination of random variables appears in many applied problems and has attracted the attention of many scholars; specifically, the linear combination of two independent random variables has been investigated by many researchers (see, e.g., Amari and Misra [7], Di Salvo [8], Almasi et al. [9], Al Mutairi and Low [10], and Al Mutairi [11] among others). Currently, the demand for these models has increased, due to their importance and wide range of applications.
This study improves the hazard rate function estimation methods for a linear combination of independent Poisson random variables by using the saddlepoint approximation.
It is well known that the Poisson distribution is stable; that is, the distribution of the sum of two independent Poisson random variables has the Poisson distribution. This means that if two random variables X 1 ∼ Poisson(λ 1 ) and X 2 ∼ Poisson(λ 2 ) are independent, the distribution of their sum X 1 + X 2 ∼ Poisson(λ 1 + λ 2 ). At the same time, rescaling a Poisson random variable is not Poisson anymore. Random variable Y = 0.5X, where X ∼ Poisson(λ), is not integer-valued and, hence, not Poisson. Thus, the exact distribution of a linear combination of independent Poisson random variables is not always known and very difficult to obtain. Therefore, approximation methods are essential and can provide a piece of helpful information about the distribution of these complicated models. The saddlepoint approximation method can replace the exact answer with high accuracy, and this adds to its many advantages in terms of saving time and effort.

Saddlepoint Approximation for the Linear Combination of Poisson Random Variables
The linear combination of Poisson random variables is given by linear combinations of convoluted random variables, which occur in a wide range of fields. In most cases, the exact distribution of these linear combinations is extremely difficult to determine, and the normal approximation usually performs very badly for these complicated distributions. A better method of approximating linear combination distributions involves the additional use of saddlepoint approximation. Saddlepoint approximation is able to provide accurate expressions for distribution functions that are unknown in their closed forms. This method not only yields an accurate approximation near the centre of the distribution but also controls the relative error in the far tail of the distribution.
The linear combination of Poisson random variables is given by where c i , i = 1, 2, . . . , n are real constants and X i ∼ Poisson(λ i ), i = 1, 2, . . . , n are independent Poisson random variables. Note that we do not require that X i , i = 1, 2, . . . , n be identically distributed.
Recall that the cumulant generating function (CGF) of a random variable X is defined as a logarithm of its moment generating function K X (s) = ln Ee sX .
The saddlepoint technique requires calculating the cumulant generating function for the linear combination L c of Poisson random variables as Note that in Butler [12] Sections 1.1.2 and 1.1.5 (formulae (1.13) and (1.14)), the saddlepoint approximation is defined only for continuous and integer-valued distributionŝ whereŝ is the unique solution for the saddlepoint equation The saddlepoint approximation formula for the discrete cumulative distribution function requires some modification of the continuous formula in order to derive the formula with a very high level of accuracy. Daniels [13] derived three such continuity corrections. Let X be a discrete random variable with CDF F (k) supporting the integers with mean µ. The right or left tail probability is used to solve many applied problems and the failure rate, as shown in this work.
All three saddlepoint continuity corrections are very accurate, and which one is more accurate than the others depends upon the specific applications or problems (Butler [12]).
The distribution of the linear combination of Poisson random variables L c is neither continuous nor integer-valued, and, hence, we need a continuity correction for the saddlepoint approximation. For a more detailed summary of these arguments, see Daniels [1] and Daniels [13] Section 2 for more details. The first continuity correction for the right tail is given asP where µ is the mean of the random variable X, Φ and φ are the standard normal distribution and density functions, respectively, and The second continuity correction is given aŝ The third continuity correction suggested in Butler [12] Section 1 (pp. 17-18) is the same as formulaP r2 but with u 2 replaced by u 3 = s K ( s).

An Example of the Saddlepoint Approximation for the Distribution Function
Suppose the linear combination of two independent Poisson random variables

The saddlepoint equation is
To show the performance of this method, we consider the sum of two independent Poisson random variables with means λ 1 and λ 2 , which is also a Poisson random variable with mean λ 1 + λ 2 . Let and, hence, the saddlepoint isŝ = −1.098612289 and the PMSf (x) from Equation (1) iŝ The exact f (x) is given by calculating Poisson(3) when X = 1.
Next, we find the first continuity correction given in Equation (2) because The exact right tail is with an absolute error of 0.007. We note that saddlepoint approximations give accurate results in calculating the mass function and distribution function and can replace the exact answers. We can compare this with other approximation methods, for example, the normal approximation for the right tail when X = 1 with an absolute error of 0.07.
To prove the performance of the saddlepoint approach, the present study compared it with another method of approximation, namely, the Haldane approximation. Pentikäinen [14] proved the accuracy of that method under certain circumstances. Suppose X is a random variable with mean µ x , variance σ 2 x , and coefficient of skewness γ x . The Haldane type A approximation for the right tail is given bŷ (see Borowiak [15]). We getĤ = 0.8849 with an absolute error of 0.06. From the calculations above, we conclude that the saddlepoint approximation for the distribution function is more accurate than the normal and Haldane approximations and, hence, the saddlepoint hazard (rate) function is more accurate than those of the associated normal and Haldane approximations.
The aim of this study is to compare the saddlepoint and normal approximations, which are common approximation methods.

The Hazard Rate Function Based on the Saddlepoint Approximation
The hazard rate function for a discrete distribution taking values in Z + = {0, 1, 2, 3, . . .} is defined as (see, e.g., Daly [16]). Consider the same example of the sum of two independent Poisson random variables with means 1 and 2 as in the previous section. Equations (3) and (5) lead to the saddlepoint approximation of the hazard rate function with the first continuity correctionĥ s1 (1) = 0.16197288996 0.9428487449 = 0.1717909589, while the exact hazard rate based on Equations (4) and (6) is Here, the absolute error for the saddlepoint and exact values is 0.01. For the normal approximation, based on Equation (7), the hazard rate approximation is with an absolute error of 0.06. The discussion above pertains to the special case of the linear combination of two independent, not identically Poisson random variables with constant c 1 = c 2 = 1. We consider this case, because we already know the exact solution for this sum, which gives us the possibility of comparing the exact solution with the methods of approximation. Now we consider other positive real values for c 1 and c 2 , which gives the linear combination of independent Poisson random variables. It has a complicated distribution, and the exact mass and density functions, in most cases, are unknown or difficult to obtain. Therefore, the methods of approximation are very important. Tables 1-3 show the exact values for the three saddlepoint continuity corrections and the normal approximations, respectively, of the hazard rate function for the linear combination L c 1 = 3X 1 + 5X 2 , where X 1 ∼ Poisson(2) and X 2 ∼ Poisson(3), for different values x. The exact value is found by generating 10 6 random samples from the linear combination using the R programme.  Table 2. Second continuity saddlepoint correctionĥ s2 (x) with its corresponding normal approximationĥ N2 (x) for L c 1 .
x From the above calculations in Tables 1-3, the performance of the saddlepoint approach is evident because of the relative error between the saddlepoint and the exact values. Thus, the relative error of the saddlepoint is smaller than the relative error of the normal approximation. Table 3. Third continuity saddlepoint correctionĥ s3 (x) with its corresponding normal approximationĥ N3 (x) for L c 1 . Next, to show the effectiveness of this method of approximation, we consider a four-component linear combination of independent Poisson random variables Applying the same technique as for two independent Poisson random variables, we obtain that the CGF is given by and the saddlepoint equation is obtained as For example, suppose that (5), and X 4 ∼ Poisson (3) with constants c 1 = c 2 = c 3 = c 4 = 1. Consider the value x = 1. This also gives us a good opportunity to investigate the accuracy of this method, because the exact solution is already known as Poisson (11). From the saddlepoint equation K L c 2 (ŝ) = x, we obtain that eŝ = 1 11 and the saddlepoint isŝ = −2.3978952.
Based on Equation (1), with its corresponding exact answer f (1) = 11e −11 = 0.0001837187. This leads to a high level of accuracy for calculating the mass function, with an absolute error of 0.00001556. The exact right tail probability when x = 1 is obtained as Calculating the first continuity correction by Equation (2), we obtain This provides the first continuity correction of the saddlepoint approximation for the right tail with x = 1 with an absolute error of 0.00002.
The corresponding normal approximation for x = 1 is with an absolute error of 0.184. This means that the saddlepoint approximation is closer to the exact value than the normal approximation. From the calculations above, we can derive the saddlepoint approximation for the hazard rate for the first continuity correction when x = 1 aŝ with an absolute error, for the exact hazard rate h e (1) = 0.0001837, of 0.00019. Tables 4-6 present the three saddlepoint approximations with their associated normal approximation, respectively. From Tables 4-6, we see that the saddlepoint method of approximation does well, with a higher level of accuracy than the normal approximation. Thus, the relative error for the saddlepoint is smaller than the relative error for the normal approximation.

Conclusions
The saddlepoint approximation technique is an efficient tool that can derive an accurate approximation of the hazard rate function for complicated models, such as a linear combination of a random variable, with the relative error still bounded and very small. All three saddlepoint continuity corrections work very well and can replace the exact solution.