Abstract
The (general) hypoexponential distribution is the distribution of a sum of independent exponential random variables. We consider the particular case when the involved exponential variables have distinct rate parameters. We prove that the following converse result is true. If for some , are independent copies of a random variable X with unknown distribution F and a specific linear combination of ’s has hypoexponential distribution, then F is exponential. Thus, we obtain new characterizations of the exponential distribution. As corollaries of the main results, we extend some previous characterizations established recently by Arnold and Villaseñor (2013) for a particular convolution of two random variables.
MSC:
62G30; 62E10
1. Introduction and Main Results
Sums of exponentially distributed random variables play a central role in many stochastic models of real-world phenomena. Hypoexponential distribution is the convolution of k exponential distributions each with their own rate , the rate of the exponential distribution. As an example, consider the distribution of the time to absorption of a finite state Markov process. If we have a state process, where the first k states are transient and the state is an absorbing state, then the time from the start of the process until the absorbing state is reached is phase-type distributed. This becomes the hypoexponential if we start in state 1 and move skip-free from state i to with rate until state k transitions with rate to the absorbing state .
We write for , if has density
The distribution of the sum , where for are not all identical, is called (general) hypoexponential distribution (see [1,2]). It is absolutely continuous and we denote by its density. It is called the hypoexponetial distribution as it has a coefficient of variation less than one, compared to the hyper-exponential distribution which has coefficient of variation greater than one and the exponential distribution which has coefficient of variation of one. In this paper, we deal with a particular case of the hypoexponential distribution when all are distinct, i.e., when . In this case, it is known ([3], p. 311; [4], Chapter 1, Problem 12) that
Here the weight is defined as
Please note that , where are identified (see [5]) as the Lagrange basis polynomials associated with the points . The convolution density in (1) is the weighted average of the values of the densities of , where the weights sum to 1 (see [5]). Notice, however, since the weights can be both positive or negative, is not a “usual” mixture of densities. If we place ’s in increasing or decreasing order, then the corresponding coefficients ’s alternate in sign.
Consider the Laplace transforms , , . They are well-defined and will play a key role in the proofs of the main results.
To begin with, let us look at the case when all ’s are identically distributed, i.e., for , so we can use for the common Laplace transform. The sum has Erlang distribution whose Laplace transform , because of the independence, is expressed as follows:
If we go in the opposite direction, assuming that has Erlang distribution with Laplace transform , then we conclude that for each , which in turn implies that . By words, if are independent and identically distributed random variables and their sum has Erlang distribution, then the common distribution is exponential.
Does a similar characterization hold when the rate parametersare all different? The answer to this question is not obvious. It is our goal in this paper to show that the answer is positive.
Let be positive real numbers, such that . Without loss of generality suppose that . Assume that , for fixed , are independent and identically distributed as a random variable X with density f, , . Then (1) is equivalent to the following:
Here the coefficients/weights are given as follows:
We use now the common Laplace transform . Please note that since for , relation (2) implies that
The idea now is to start with an arbitrary non-negative random variable X with unknown density f and Laplace transform . If the Laplace transform of the linear combination satisfies (4), we will derive that . Thus, the common distribution of , is exponential. More precisely, the following characterization result holds.
Theorem 1.
Suppose that,, are independent copies of a non-negative random variable X with density f. Assume further that X satisfies Cramér’s condition: there is a numbersuch thatfor all. If relation (2) is satisfied for fixedand fixed positive mutually different numbers, thenfor some.
The studies of characterization properties of exponential distributions are abundant. Comprehensive surveys can be found in [6,7,8,9]. More recently, Arnold and Villaseñor [10] obtained a series of exponential characterizations involving sums of two random variables and conjectured possible extensions for sums of more than two variables (see also [11]). Corollary 1 below extends the characterizations in [10,11] to sums of n variables, for any fixed .
Consider the special case of (2) when for . Under this choice of ’s, the formula for the weight simplifies to (see [4], Chapter 1, Problem 13)
Therefore, Theorem 1 reduces to the following corollary.
Corollary 1.
Suppose that,, are independent copies of a non-negative random variable X with density f. Assume further that X satisfies Cramér’s condition: there is a numbersuch thatfor all. If for fixed,
thenfor some.
The exponential distribution has the striking property that if (unit exponential), then the density f equals the survival function (the tail of the cumulative distribution function) . Therefore, in case of unit exponential distribution, (2) can be written as follows:
We will show that (6) is a sufficient condition for to be unit exponential.
Theorem 2.
Suppose that,, are independent copies of a non-negative random variable X with distribution function F. Assume also that X satisfies Cramér’s condition: there is a numbersuch thatfor all. If relation (6) is satisfied for fixed, then.
Setting for , we obtain the following corollary of Theorem 2.
Corollary 2.
Suppose that,, are independent copies of a non-negative random variable X with distribution function F. Assume also that X satisfies Cramér’s condition: there is a numbersuch thatfor all. If for fixed,
thenfor some.
2. Auxiliaries
We will need the Leibniz rule for differentiating a product of functions. Denote by the kth derivative of with . Let us define a multi-index set as an n-tuple of non-negative integers, and denote . Leibniz considered the problem of determining the kth derivative of the product of n smooth functions and obtained the formula (e.g., [12])
Here the summation is taken over all multi-index sets with . Formula (8) can easily be proved by induction.
Lemma 1.
Assume thatis a functional series, such that for some, theorder derivativeexists for all. Then for arbitrary positive real constants, we have
Proof.
Formula (9) is proved by applying Leibniz rule (8) to . □
In addition to (9), we will need some properties of Lagrange basis polynomials collected below.
Lemma 2
(see [13]). Let be positive real numbers, such that for . Denote
Then, for , we have the following:
- (i)
- .
- (ii)
- .
- (iii)
- , where the equality holds if and only if.
Proof.
Claim (i) follows by integrating (1) over . Claim (ii) is proved in Corollary 1 of [13]. To prove claim (iii) we involve , the multi-index set as in (8). For , we have , where
According to Proposition 5 in [13] we obtain, for and , the following chain of relations:
Clearly, the equality in (10) holds if and only if . The proof is complete. □
The properties in Lemma 2 can be easily verified, as an illustration, for , , and . Indeed,
3. Proofs of the Characterization Theorems
In the proofs of both theorems we follow the four-step scheme.
- Consider for to be independent copies of a non-negative random variable X with density f. Suppose are positive real numbers.
- Assume the characterization propertywhere is given in (3).
- For the Laplace transform , , obtain the equation
- Using Leibniz rule for differentiating product of functions and properties of Lagrange basis polynomials, show that (11) has a unique solution given by for some and conclude that
Proof of Theorem 1.
Recall that (see (4))
Dividing both sides of this equation by , we obtain
where . Consider the series
which, as a consequence of Cramér’s condition for , is convergent in a proper neighborhood of . To prove the theorem, it is sufficient to show that
We will prove that (12) implies (14) by showing that the coefficients in (13) satisfy , , and for . Notice first that
Denote
By (12) we have and therefore and for all . Equating ’s to the corresponding coefficients of the series in the right-hand side of (12), we will obtain equations for . As a first step, note that
Next, we apply Leibniz rule for differentiation. To fix the notation, let us define a multi-index set , as a set of -tuples of non-negative integer numbers, with . Applying Lemma 1 for fixed and fixed , we obtain
Introduce the set and partition it into three disjoint subsets as follows:
where for
For example, if , , and , then , , and . Referring to (16) and (17), we have for
For the term in the middle, since , we have when and for any
where the summation in is over all k-tuples (with th component dropped) , such that and . Using that by Lemma 2(ii) with , we obtain for any
Here the summation in is over all k-tuples , such that and . For the first term in the last expression of (18), we have for any
Furthermore, since by Lemma 2(i) with , we have for any
Lemma 2(iii) with implies that and for any . It follows from (18)–(20) that
where and for .
Let . Since and the sets and are empty, we obtain , where . Hence, there are no restrictions on the coefficient , other than , since X has positive mean. Therefore, there is a number such that
Let . Since the set is empty, Equation (21) yields where recall that . Thus, . Next, applying (21) and taking into account that for , we will show by induction that for any . Assuming for , we will show that . Indeed, by (21) we have
because at least one index , satisfies and hence , by assumption. Therefore, and, since , we have , which completes the induction. Hence,
The Equations (15) and (22)–(23) imply (14), which completes the proof of the theorem. □
Proof of Theorem 2.
Taking into account (6), similarly to (4) and using integration-by-parts, we obtain
Using the fact that (see Lemma 2(ii)), this simplifies to
Dividing both sides of (24) by , for , we obtain
where, as before, . Consider the series , which is convergent by assumption. To prove the theorem, it is sufficient to show that , , or, equivalently, that the coefficients of the above series satisfy , , and for . Clearly, . Recall that
By (25) we have and therefore and for all . We will express in terms of ’s. Proceeding as in the proof of Theorem 1, applying Leibniz rule for differentiating a product of functions, and using the same notation, we obtain for that
As with (19), applying Lemma 2(ii), we obtain
where the summation in is over all k-tuples , such that and . Furthermore, since and by Lemma 2, we have for any
It follows from (26) and (27) that for ,
Let . Since and the set is empty, we obtain , where by Lemma 2(iii). Therefore, . Let . Since is empty, Equation (28) yields where by Lemma 2(iii). Thus, . Assuming for , we will show that . Indeed,
because at least one index , satisfies , in which case , by assumption. Therefore, and, since , we have , which completes the induction proof. Hence, for any . Since and for , we obtain , which clearly completes the proof of the theorem. □
4. Concluding Remarks
Arnold and Villaseñor [10] proved that if and are two independent and non-negative random variables with common density f and , then
if and only if for some . Motivated by this result, we extended it in two directions considering: (i) arbitrary number of independent identically distributed non-negative random variables and (ii) linear combination of independent variables with arbitrary positive and distinct coefficients . Namely, our main result is that
where , if and only if for some .
In this paper, we dealt with the situation where the rate parameters are all distinct from each other. The other extreme case of equal ’s is trivial. The obtained characterization seems of interest on its own, but it can also serve as a basis for further investigations of intermediate cases of mixed type with some ties and at least two distinct parameters (see [2]). Of certain interest is also the case where not all weights ’s are positive (see [1]).
Funding
This research was funded in part by the National Scientific Foundation of Bulgaria at the Ministry of Education and Science, grant No. KP-6-H22/3.
Acknowledgments
I thank Jordan Stoyanov for his mentorship, useful suggestions and critical comments on previous versions of the paper. The author acknowledges the valuable suggestions from the anonymous reviewers.
Conflicts of Interest
The author declares no conflict of interest.
References
- Li, K.-H.; Li, C.T. Linear combination of independent exponential random variables. Methodol. Comput. Appl. Probab. 2019, 21, 253–277. [Google Scholar] [CrossRef]
- Smaili, K.H.; Kadri, T.H.; Kadry, S. Finding the PDF of the hypoexponential random variable using the Kad matrix similar to the general Vandermonde matrix. Commun. Statist. Theory Methods 2016, 45, 1542–1549. [Google Scholar] [CrossRef]
- Ross, S.M. Introduction to Probability Models, 12th ed.; Academic Press: New York, NY, USA, 2019. [Google Scholar]
- Feller, W. An Introduction to Probability Theory and Its Applications, 2nd ed.; Wiley: New York, NY, USA, 1971; Volume II. [Google Scholar]
- Sen, A.; Balakrishnan, N. Convolution of geometrics and a reliability problem. Stat. Probab. Lett. 1999, 43, 421–426. [Google Scholar] [CrossRef]
- Ahsanullah, M. Characterizations of Univariate Continuous Distributions; Atlantic Press: Amsterdam, The Netherlands, 2017. [Google Scholar]
- Arnold, B.C.; Huang, J.S. Characterizations. In The Exponential Distribution: Theory, Methods and Applications; Balakrishnan, N., Basu, A.P., Eds.; Gordon and Breach: Amsterdam, The Netherlands, 1995; pp. 79–95. [Google Scholar]
- Azlarov, T.; Volodin, N.A. Characterization Problems Associated with the Exponential Distribution; Springer: Berlin, Germany, 1986. [Google Scholar]
- Nagaraja, H.N. Characterizations of probability distributions. In Springer Handbook of Engineering Statistics; Pham, H., Ed.; Springer: Berlin, Germany, 2006; pp. 395–402. [Google Scholar]
- Arnold, B.C.; Villaseñor, J.A. Exponential characterizations motivated by the structure of order statistics in sample of size two. Stat. Probab. Lett. 2013, 83, 596–601. [Google Scholar] [CrossRef]
- Yanev, G.P. On Arnold-Villaseñor conjectures for characterizing exponential distribution based on sample of size three. REVSTAT 2020, 18, 177–188. [Google Scholar]
- Thaheem, A.B.; Laradji, A. Classroom note: A generalization of Leibniz rule for higher derivatives. Intern. J. Math. Educ. Sci. Technol. 2003, 34, 739–742. [Google Scholar] [CrossRef]
- Smaili, K.H.; Kadri, T.H.; Kadry, S. Hypoexpponential distribution with different parameters. Appl. Math. 2013, 4, 624–631. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2020 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).