Next Article in Journal
Regularity of Idempotent Reflexive GP-V’-Rings
Previous Article in Journal
S&SEM: A Secure and Speed-Up Election Mechanism for PoS-Based Blockchain Network
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Comparison of MLE for Some Index Distributions Based on Censored Samples

1
Glorious Sun School of Business and Management, Donghua University, Shanghai 200051, China
2
Department of Statistics, Shanghai University of Finance and Economics Zhejiang College, Jinhua 321000, China
3
Department of Mathematics, Shanghai University of Finance and Economics Zhejiang College, Jinhua 321000, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Mathematics 2024, 12(20), 3264; https://doi.org/10.3390/math12203264
Submission received: 19 September 2024 / Revised: 11 October 2024 / Accepted: 15 October 2024 / Published: 17 October 2024
(This article belongs to the Special Issue Decision Making under Uncertainty in Soft Computing)

Abstract

:
This paper elucidates the prerequisites for maximum likelihood estimation (MLE) of parameters within the exponential and scale parameter families. Estimation of these parameters is predicated on data derived from censored samples and seeks to adhere to stochastic ordering principles. The study establishes that for two independent normal distributions and a two-parameter exponential distribution discernible by the distinct parameter sets, the MLEs of the parameters evince a stochastically ordered relationship when evaluated using full datasets. Furthermore, this research is extended to corroborate the persistence of stochastic ordering in the MLEs of such parameters under conditions of fixed censoring of samples.

1. Introduction

In medical research, finance, service industries, and business analytics, the exponential distribution and MLE are pivotal. Among other things, they assess treatment outcomes amid censored survival data, gauge risks for insurance purposes, optimize service delivery in call centers, ensure stochastic order for decision-making, facilitate statistical inferences from incomplete datasets, and validate model accuracy. This approach is paramount for efficient resource management and informed strategic planning across sectors.
In a series of examples, such as life-test or reliability tests, there is prior information available about the parameters. An experimenter can use this prior information to judge that a product is more reliable than its competitive product. In this scenario, the expectation is that parameter estimation will also effectively capture the comparative magnitudes of the parameters. Numerous inferential methods for estimating global parameters under these ordered constraints have been explored. The literature provides comprehensive details on inferential techniques for order constraints, as presented by Barlow (1972) [1] and expanded upon by Nuesch (1999) [2], among others.
N. Balakrishnan and Jiemi (2001) [3] proposed the random order problem for maximum likelihood estimation of parameters under the full sample, and proposed three conditions for single parameter distribution based on the maximum likelihood estimate of the full sample. The random order theory is a set of theories about the “size” of random variables. In a certain sense, it can be used to compare one random variable (vector) in terms of its size or number of variables with respect to another random variable (vector) [4,5,6]. Random order is a theoretical tool and method for decision-making in random environments, and is also a typical representative of uncertain variable comparison [7,8,9,10]. The study of random order has a wide and profound practical background, making it of great significance in both theory and application [11,12,13,14,15].
The life test is divided into the complete life test and the censored life test according to the failure of the sample, with the latter being the most widely used. Bartholomew (1957) [16] mentioned some problems and solutions that may arise during life test. The exponential distribution is one of the more widely used distributions for modeling life in reliability theory and practical reliability engineering [17,18,19]. Many useful results on exponential distributions can be found in [20,21]. The problem of estimating the mean of the exponential distribution is very important, and different maximum likelihood estimates can exist depending on the censored sample.
Consider a population that has a probability density function f ( x , θ ) , where the parameter θ belongs to a subset Θ of ( , + ) . One important aspect of statistical inference is obtaining a point estimator of θ . Many different methods of estimation are known in the literature, for example moment estimation, maximum likelihood estimation, least-squares estimation, etc. [22]. Various properties of these approaches, such as unbiasedness, weak and strong consistency, and asymptotic normality, have been discussed; in this paper, our aim is as follows: based on censored samples, we discuss the conditions under which the maximum likelihood estimates of the parameters of the family of exponential distributions satisfy random ordering [23,24,25,26].
By “order preserving”, we refer to the following concept: suppose that X and Y are independent of each other from two exponentially distributed aggregates, with X 1 , X 2 , , X n a sample from the aggregate f ( x , θ 1 ) and Y 1 , Y 2 , , Y n a sample from population f ( x , θ 2 ) . Assume that θ 1 , θ 2 Θ , and further that θ 1 < θ 2 . We denote the point estimators of θ 1 and θ 2 obtained from applying the same estimation method based on X = X 1 , X 2 , , X n and Y = Y 1 , Y 2 , , Y n by θ 1 ^ θ 1 ^ ( X ) and θ 2 ^ θ 2 ^ ( Y ) , respectively. Let X 1 , X 2 , , X r and Y 1 , Y 2 , , Y s be Type-I censored samples, where r n , s n . Because the real parameter values θ 1 and θ 2 satisfy θ 1 < θ 2 , it is desirable that a certain order exist between their point estimators θ 1 ^ and θ 2 ^ . In general, it cannot be true that θ 1 ^ θ 2 ^ point-wise due to the randomness of samples. Then, it is of interest to investigate what kind of order may exist between θ 1 ^ and θ 2 ^ . Because of its popularity and importance, a natural candidate for possible order between θ 1 ^ and θ 2 ^ is the usual stochastic order ≤.
Definition 1. 
If the random variables X and Y have distribution functions F 1 and F 2 , we say that X is stochastically less than or equal to Y if P ( X > t ) P ( Y > t ) , t ( , + ) , which is denoted by X s t Y or F 1 s t F 2 .
In comparison to the various stochastic orders known in the literature, the order ≤ or ≥ is not a restrictive one; however, even this order does not always hold between θ 1 ^ and θ 2 ^ , as the following example indicates.
Studying the order preservation of maximum likelihood estimates obtained from censored samples can verify consistency, rank parameters, perform variable selection, and evaluate robustness, helping us to better understand and apply maximum likelihood estimation methods. Consistency means that the maximum likelihood estimates converge to the true parameters as the sample size approaches infinity. By studying the order preservation of maximum likelihood estimates under truncated samples, we can further verify the consistency of maximum likelihood estimates in finite samples. Order preservation studies allow for parameter ranking. Through the analysis of order preservation after removing samples, we can compare the order of parameter estimates under different models or hypotheses, identify important parameters or model features, and help to understand the influence and importance of parameters. In regression models with a large number of predictors, analyzing the order preservation of parameter estimates after sample removal can help in selecting important variables based on the order preservation of parameter estimates, which not only reduces model complexity but also enhances model interpretability and prediction accuracy. In the presence of outliers or extreme observations, analyzing the order preservation after removing samples can quantify the sensitivity of maximum likelihood estimates to outliers and help to evaluate the robustness and stability of parameter estimates.
The rest of the paper is organized as follows. We demonstrate in Section 2 that the maximum likelihood estimators (MLEs) of the single-parameter exponential distribution, double-parameter exponential distribution, and normal distribution, which are part of the exponential family, exhibit stochastic order relations under complete sampling. In Section 3, we present the general form of the exponential family distribution and the specific conditions required for the location and scale parameter families to satisfy stochastic order in censored samples, followed by a numerical analysis. Section 4 details the conditions under which the MLEs of the double-parameter exponential distribution meet the criteria for stochastic order in the context of censored samples. Section 5 provided illustrative examples to clarify these concepts.

2. Comparison of MLE under Complete Samples

2.1. One-Parameter Exponential Distribution

Let g ( x , θ ) 0 be an integrable function on ( a , b ) for each θ Θ ( , + ) and let h ( θ ) be defined by
1 h ( θ ) = a b g ( x , θ ) d x < .
In addition, we denote
f ( x , θ ) h ( θ ) g ( x , θ ) .
Here, f ( x , θ ) is a valid probability density function on ( a , b ) . Suppose that θ 1 < θ 2 Θ and that the random variables X and Y have density functions f ( x , θ 1 ) and f ( y , θ 2 ) , respectively. Furthermore, let X 1 , X 2 , , X r and Y 1 , Y 2 , , Y s , be censored samples under f ( x , θ 1 ) and f ( y , θ 2 ) , respectively. The following result provides the conditions under which X s t Y , and the maximum likelihood estimators θ 1 ^ and θ 2 ^ for θ 1 and θ 2 satisfy θ 1 ^ s t θ 2 ^ . Throughout this article, it is always assumed that the maximum likelihood estimator belonging to the parameter space Θ exists and is unique. The conditions that guarantee this are not provided explicitly.
Lemma 1. 
Let f ( x , θ ) = h ( θ ) g ( x , θ ) be defined by (1) and (2).
Suppose that:
1. 
h ( θ ) is log-concave, i.e., d 2 d θ 2 ln h ( θ ) 0 , θ Θ ;
2. 
g ( x , θ ) is log-concave in θ Θ , i.e., 𝜕 2 𝜕 θ 2 ln g ( x , θ ) 0 , θ Θ , x ( a , b ) ;
3. 
𝜕 2 𝜕 x 𝜕 θ ln g ( x , θ ) 0 , θ Θ , x ( a , b ) .
If θ 1 < θ 2 Θ and at least one of the inequalities in 1–3 is strict, then there is ( a ) X s t Y and ( b ) θ 1 ^ s t θ 2 ^ .
The above lemma provides a general condition for the MLE to have the property of preserving the usual stochastic orders. It is given that when conditions 1, 2, and 3 are satisfied, two mutually independent exponentially distributed random variables exist under the equivalent carve-outs under the usual random order.

2.2. Two-Parameter Exponential Distribution

Theorem 1. 
Let the samples X 1 , X 2 , , X n follow a two-parameter exponential distribution with probability density function
f x , μ 1 , θ 1 = 1 θ 1 exp x μ 1 θ 1 , x μ 1 ,
and let the samples Y 1 , Y 2 , , Y n follow a two-parameter exponential distribution with probability density function
f y , μ 2 , θ 2 = 1 θ 2 exp y μ 2 θ 2 , y μ 2 .
If
μ 1 μ 2 and θ 1 θ 2 ,
then the maximum likelihood estimators μ ^ 1 and θ ^ 1 obtained from X 1 , X 2 , , X n satisfy the stochastic order relationship with the maximum likelihood estimators μ ^ 2 and θ ^ 2 obtained from Y 1 , Y 2 , , Y n , denoted as
μ ^ 1 s t μ ^ 2 and θ ^ 1 s t θ ^ 2 .
Proof. 
The likelihood function of μ , θ is
L ( x 1 , x 2 , , x n , μ , θ ) = 1 θ n exp i = 1 n x i μ θ ,
and its log-likelihood function is
ln L x 1 , x 2 , , x n , μ , θ = n ln θ i = 1 n x i μ θ .
Observing the expression of ln L x 1 , x 2 , , x n , μ , θ , for any fixed θ , in order to maximize the likelihood function μ must be as large as possible with μ x ( 1 ) . The likelihood function L ( x 1 , x 2 , , x n , ν , θ ) reaches its maximum; hence, μ ^ = X 1 ,and then θ ^ = S / n , where S = X ( 1 ) X ( 1 ) i = 1 n . Thus, we have
μ ^ 1 = X ( 1 ) , θ ^ 1 = S 1 n , S 1 = i = 1 n X ( i ) X ( 1 ) ,
μ ^ 2 = X ( 2 ) , θ ^ 2 = S 2 n , S 2 = i = 1 n Y ( i ) Y ( 1 ) .
Knowing μ ^ = X ( 1 ) W 1 , θ n , μ , we aim to find μ ^ 1 s t μ ^ 2 , that is, t , p μ ^ 1 > t p μ ^ 2 > t ,
t n θ 1 exp n x μ 1 θ 1 d x t n θ 2 exp n y μ 2 θ 2 d y .
From
μ 1 μ 2 , θ 1 θ 2 ,
we know that the above formula holds, that is, μ 1 s t μ ^ 2 .
Given that S = i = 1 n X ( i ) X ( 1 ) Γ ( n 1 , θ ) , we want to obtain θ ^ 1 s t θ ^ 2 , that is,
t , p θ ^ 1 > t p θ ^ 2 > t p S 1 n > t p S 2 n > t .
Because
p S 1 n > t = t θ 1 n 1 x n n 2 n Γ ( n 1 ) exp θ 1 x n d x = θ 1 n 1 ( n 2 ) ! n n 1 t exp θ 1 x n x n 2 d x ,
p S 2 n > t = t θ 2 n 1 y n n 2 n Γ ( n 1 ) exp θ 2 y n d y = θ 2 n 1 ( n 2 ) ! n n 1 t exp θ 2 y n y n 2 d y ,
and θ 1 θ 2 , we have
p S 1 n > t p S 2 n > t ;
therefore, θ ^ 1 s t θ ^ 2 . This completes the proof of Theorem 1. □

2.3. Normal Distribution

Theorem 2. 
Let the samples X 1 , X 2 , , X n follow a normal distribution with probability density function
f x , μ 1 , σ 1 = 1 2 π σ 1 exp x μ 1 2 2 σ 1 2 , < x < ,
and let the samples Y 1 , Y 2 , , Y n follow a normal distribution with probability density function
f y , μ 2 , σ 2 = 1 2 π σ 2 exp y μ 2 2 2 σ 2 2 , < y < .
If μ 1 μ 2 and σ 1 2 σ 2 2 , then the maximum likelihood estimators μ ^ 1 and σ ^ 1 2 obtained from X 1 , X 2 , , X n satisfy the stochastic order relationship with the maximum likelihood estimators μ ^ 2 and σ ^ 2 2 obtained from Y 1 , Y 2 , , Y n , which we denote as μ ^ 1 s t μ ^ 2 and σ ^ 1 2 s t σ ^ 2 2 .
Proof. 
For all t, the probability p ( X ¯ > t ) p ( Y ¯ > t ) if and only if
p X ¯ μ 1 σ 1 n > t μ 1 σ 1 n p Y ¯ μ 2 σ 2 n > t μ 2 σ 2 n ,
where the random variable X ¯ μ σ n follows a standard normal distribution N ( 0 , 1 ) . Therefore,
p X ¯ μ 1 σ 1 n > t μ 1 σ 1 n = t μ 1 σ 1 n 1 2 π exp x 2 2 d x ,
p Y ¯ μ 2 σ 2 n > t μ 2 σ 2 n = t μ 2 σ 2 n 1 2 π exp y 2 2 d y .
Given that μ 1 μ 2 and σ 1 2 σ 2 2 , it follows that
t μ 1 σ 1 n t μ 2 σ 2 n ,
which implies
p X ¯ μ 1 σ 1 n > t μ 1 σ 1 n p Y ¯ μ 2 σ 2 n > t μ 2 σ 2 n ,
indicating that μ ^ 1 s t μ ^ 2 .
To achieve σ 1 2 s t σ 2 2 , it suffices to show that for all t, p ( σ ^ 1 2 > t ) p ( σ ^ 2 2 > t ) . To this end, we can consider
p n σ ^ 1 2 σ 1 2 > n t σ 1 2 p n σ ^ 2 2 σ 2 2 > n t σ 2 2 .
Because σ ^ 2 = 1 n i = 1 n X i X ¯ 2 , it follows that n σ ^ 2 σ 2 follows a chi-squared distribution with n degrees of freedom. Therefore,
p n σ ^ 1 2 σ 1 2 > n t σ 1 2 = n t σ 1 2 x n 2 1 e x 2 2 n 2 Γ n 2 d x ,
p n σ ^ 2 2 σ 2 2 > n t σ 2 2 = n t σ 2 2 y n 2 1 e y 2 2 n 2 Γ n 2 d y .
Given that σ 1 2 σ 2 2 , it follows that n t σ 1 2 n t σ 2 2 ; thus,
n t σ 1 2 x n 2 1 e x 2 2 n 2 Γ n 2 d x n t σ 2 2 y n 2 1 e y 2 2 n 2 Γ n 2 d y ,
which implies
p n σ ^ 1 2 σ 1 2 > n t σ 1 2 p n σ ^ 2 2 σ 2 2 > n t σ 2 2 ,
confirming that σ ^ 1 2 s t σ ^ 2 2 . □

3. Comparison of MLEs under Censored Samples

3.1. Generalization of the Exponential Distribution

Theorem 3. 
Assuming that the exponential distributions of populations X and Y are independent, the parameters are θ 1 and θ 2 , respectively. Suppose that X 1 , X 2 ,…, X n are lifetimes following the exponential distribution with parameter θ 1 , while Y 1 , Y 2 ,…, Y n are lifetimes following the exponential distribution with parameter θ 2 . Given ( 0 < t < + ) , let r = be the number of X i that satisfies X i t and s = be the number of Y i that satisfies Y i t , r n , s n . Moreover, θ 1 ^ and θ 2 ^ are maximum likelihood estimators of θ 1 and θ 2 . Suppose that:
(I) 
t b g ( x , θ ) d x increases with respect to θ, while g ( x , θ ) and its derivatives are continuous and t b 𝜕 𝜕 θ g ( x , θ ) d x decreases with respect to θ;
(II) 
r s , x = [ t b 𝜕 𝜕 θ g ( x , θ ) d x ] | θ = θ 1 ^ t b g ( x , θ 1 ^ ) d x , z = [ t b 𝜕 𝜕 θ g ( z , θ ) d z ] | θ = θ 2 ^ t b g ( z , θ 2 ^ ) d z , Δ z Δ x n r n s ;
If θ 1 < θ 2 Θ , then there is (a) X s t Y and (b) θ 1 ^ s t θ 2 ^ .
Proof. 
We shall first prove (a). For any t ( a , b ) , we have
f ( t , θ 2 ) f ( t , θ 1 ) = h ( θ 2 ) g ( t , θ 2 ) h ( θ 1 ) g ( t , θ 1 ) .
Note that condition (I) implies that
𝜕 𝜕 t ln g ( t , θ 2 ) 𝜕 𝜕 t ln g ( t , θ 1 ) , t ( a , b ) ,
which is equivalent to
𝜕 𝜕 t ln ( g ( t , θ 2 ) g ( t , θ 1 ) ) 0 .
From (1) and (2), we can see that f ( t , θ 2 ) / f ( t , θ 1 ) increases in t ( a , b ) . This means that X is smaller than Y in the likelihood ratio order denoted by X l r Y . This order further yields X s t Y ; hence, (a) is proved.
Because X 1 , X r and Y 1 , Y s are censored samples, they are independent and identically distributed; in addition, X i s t Y i is known from conclusion (a), and there is a random variable Z i ( i = 1 , 2 , , s ) known by the coupling method that makes X i Z i point by point, where Z i and Y i have the same distribution.
If n products are put into the timing truncation experiment, the truncation time is t and the timing truncation sample 0 x 1 x 2 x r t is obtained, that is, r products fail in the time interval [ 0 , t ] and n r do not fail at time x r . The probability of a product failure in the interval ( x i , x i + d x i ) is approximately f ( x i , θ ) d x i , ( i = 1 , 2 r ) . The probability of the remaining n r product life exceeding x r is t b f ( x , θ ) d x ) n r .
Therefore, the likelihood function of the sample is
L ( x , θ ) = i = 1 r f ( x i , θ ) ( t b f ( x , θ ) d x ) n r = [ h ( θ ) ] n i = 1 r g ( x i , θ ) ( t b g ( x , θ ) d x ) n r ,
i.e.,
𝜕 𝜕 θ ln L ( x , θ ) = n h ( θ ) h ( θ ) + i = 1 r g θ ( x i , θ ) g ( x i , θ ) + ( n r ) 𝜕 𝜕 θ t b g ( x , θ ) d x t b g ( x , θ ) d x .
Thus, we can know that θ 1 ^ ( X ) is determined by
n h ( θ 1 ) h ( θ 1 ) + i = 1 r g θ ( x i , θ 1 ) g ( x i , θ 1 ) + ( n r ) 𝜕 𝜕 θ t b g ( x , θ ) d x t b g ( x , θ ) d x , | θ = θ 1 = 0
while θ 2 ^ ( Z ) is determined by
n h ( θ 2 ) h ( θ 2 ) + i = 1 s g θ ( z i , θ 2 ) g ( z i , θ 2 ) + ( n s ) 𝜕 𝜕 θ t b g ( z , θ ) d z t b g ( z , θ ) d z . | θ = θ 2 = 0
To prove ( b ) , it is sufficient to show that θ 1 ^ ( X ) θ 2 ^ ( Z ) is pointwise, that is,
p ( θ 1 ^ ( X ) θ 2 ^ ( Z ) ) = 1 .
Using the coupling method, we obtain θ 1 ^ θ 1 ^ ( X ) s t θ 2 ^ ( Y ) θ 2 ^ .
Now, we shall prove that θ 1 ^ θ 2 ^ ( Z ) pointwise. Let us first assume the contrary, that is, that θ 1 ^ ( X ) > θ 2 ^ ( Z ) holds on a set of positive probabilities. From condition 1, the function h ( θ ) / h ( θ ) decreases in θ Θ ; hence, it follows that on the set θ 1 ^ > θ 2 ^ ( Z ) we have
h ( θ 1 ^ ) h ( θ 1 ^ ) h ( θ 2 ^ ) h ( θ 2 ^ ) .
Inequalities (19), (21), and (23) imply that
1 n i = 1 r g θ ( x i , θ 1 ^ ) g ( x i , θ 1 ^ ) + ( n r ) n [ t b 𝜕 𝜕 θ g ( x , θ ) d x ] | θ = θ 1 ^ t b g ( x , θ 1 ^ ) d x 1 n i = 1 s g θ ( z i , θ 2 ^ ) g ( z i , θ 2 ^ ) + ( n s ) n [ t b 𝜕 𝜕 θ g ( z , θ ) d z ] | θ = θ 2 ^ . t b g ( z , θ 2 ^ ) d z
From condition (I), it can be seen that
𝜕 𝜕 θ ln g ( z , θ ) 𝜕 𝜕 θ ln g ( x , θ ) , θ Θ , x < z ( a , b ) ,
and consequently that
g θ ´ ( x i , θ ) g ( x i , θ ) g θ ´ ( z i , θ ) g ( z i , θ ) , θ Θ , x < z ( a , b ) .
Because X i Z i pointwise ( 1 i n ) , from (25) it follows that
g θ ´ ( X i , θ 1 ^ ) g ( X i , θ 1 ^ ) g θ ´ ( Z i , θ 1 ^ ) g ( Z i , θ 1 ^ ) .
On the other hand, it is true that
g θ ´ ( Z i , θ 1 ^ ) g ( Z i , θ 1 ^ ) g θ ´ ( Z i , θ 2 ^ ( Z ) ) g ( Z i , θ 2 ^ ) .
Because Lemma 1 implies that g ´ θ ( t , θ ) / g ( t , θ ) decreases in θ Θ for any t ( a , b ) and θ 1 ^ > θ 2 ^ ( Z ) , by combining (26) and (27) we obtain
g θ ´ ( X i , θ 1 ^ ) g ( X i , θ 1 ^ ) g θ ´ ( Z i , θ 2 ^ ( Z ) ) g ( Z i , θ 2 ^ ) ,
and consequently,
i = 1 n g θ ´ ( X i , θ 1 ^ ) g ( X i , θ 1 ^ ) i = 1 n g θ ´ ( Z i , θ 2 ^ ( Z ) ) g ( Z i , θ 2 ^ ( Z ) ) .
Because of r s , we have
1 n i = 1 r g θ ´ ( x i , θ 1 ^ ) g ( x i , θ 1 ^ ) 1 n i = 1 s g θ ´ ( z i , θ 2 ^ ) g ( z i , θ 2 ^ ) .
According to condition (I),
x b g ( x , θ ) d x
is increased with respect to θ and θ 1 ^ > θ 2 ^ ; thus, we have
t b g ( x , θ 1 ^ ) d x t b g ( z , θ 2 ^ ) d z .
Also from condition (I),
x b 𝜕 𝜕 θ g ( x , θ ) d x
decreases with respect to θ and θ 1 ^ > θ 2 ^ ; thus, we have
[ t b 𝜕 𝜕 θ g ( x , θ ) d x ] | θ = θ 1 ^ t b 𝜕 𝜕 θ g ( z , θ ) d z ] | θ = θ 2 ^ .
From Condition (II), Δ z Δ x n r n s ; thus, it is apparent that
n r n [ t b 𝜕 𝜕 θ g ( x , θ ) d x ] | θ = θ 1 ^ t b g ( x , θ 1 ^ ) d x n s n t b 𝜕 𝜕 θ g ( z , θ ) d z ] | θ = θ 2 ^ t b g ( z , θ 2 ^ ) d z .
Because at least one of conditions (I) or (II) is strictly established, we can combine (29) and (30) to obtain
1 n i = 1 r g θ ( x i , θ 1 ^ ) g ( x i , θ 1 ^ ) + ( n r ) n [ t b 𝜕 𝜕 θ g ( x , θ ) d x ] | θ = θ 1 ^ t b g ( x , θ 1 ^ ) d x < 1 n i = 1 s g θ ( z i , θ 2 ^ ) g ( z i , θ 2 ^ ) + ( n s ) n [ t b 𝜕 𝜕 θ g ( z , θ ) d z ] | θ = θ 2 ^ t b g ( z , θ 2 ^ ) d z .
This contradicts (24), meaning that the hypothesis does not hold; thus, θ 1 ^ ( X ) θ 2 ^ ( Z ) is further obtained by θ 1 ^ s t θ 2 ^ , and ( b ) is proved. □
Example 1. 
For the gamma density function f ( x , α ) = λ α Γ ( α ) x α 1 e λ α , x > 0 , α > 0 , where λ > 0 is known, it can be verified that conditions (I) and (II) in Theorem 1 are satisfied. Let h ( α ) = λ α Γ ( α ) ; then,
d 2 d α 2 ln h ( α ) = d d α Γ ( α ) d α Γ ( α ) = j = 0 1 ( α + j ) 2 < 0 , α > 0 .
Therefore, the conditions in Theorem 3 are all satisfied, and if α 1 α 2 , then there is α 1 ^ s t α 2 ^ .
Example 2 
(Beta Distribution). Consider the beta density function
f ( x , α ) = Γ ( α + β ) Γ ( α ) Γ ( β ) x α 1 ( 1 x ) β 1 , 0 < x < 1 , α > 0
with known β > 0 . This is an exponential family density with h ( α ) = Γ ( α + β ) Γ ( α ) Γ ( β ) . It can be verified that all the conditions of Theorem 3 are satisfied. In particular, we have
d 2 d α 2 ln h ( α ) = d d α Γ ( α + β ) Γ ( α + β ) Γ ( α ) Γ ( α )
= j = 1 1 ( α + β + j ) 2 j = 1 1 ( α + j ) 2 < 0 .
Hence, 0 < α 1 < α 2 implies that α 1 ^ s t α 2 ^ . Similarly, assuming that α is known, it can be shown that 0 < β 1 < β 2 implies the maximum likelihood estimators β 1 ^ s t β 2 ^ .

3.2. Location and Scale Family

It can be seen that Theorem 1 cannot be applied to the exponential density function f ( x , θ ) = 1 θ e x θ , x > 0 , θ > 0 .
To better study the randomness of the maximum likelihood estimator under censored samples, Theorem 3 needs to be further improved.
Assuming that g ( x ) is a probability density function defined on the interval ( 0 , ) , then for any θ Θ ( 0 , + ) , f ( x , θ ) 1 θ g ( x θ ) , the density function set
{ f ( x , θ ) = 1 θ g ( x θ ) , θ Θ ( 0 , + ) }
is called the scale distribution family.
Theorem 4. 
Let { f ( x , θ ) 1 θ g ( x θ ) , θ Θ } :
(i) 
x g ( x ) g ( x ) is strictly increasing with respect to x > 0 ;
(ii) 
r s , x = 𝜕 𝜕 θ [ t b g ( x θ ) d x ] | θ = θ 1 ^ t b g ( x θ 1 ^ ) d x , y = 𝜕 𝜕 θ [ t b g ( y θ ) d y ] | θ = θ 2 ^ t b g ( y θ 2 ^ ) d y , Δ y Δ x ( n r ) θ 1 ^ ( n s ) θ 2 ^ ;
If θ 1 < θ 2 Θ , then (a) X s t Y ; further, if g ( x ) is either strictly log-concave or strictly log-convex, then (b) θ 1 ^ s t θ 2 ^ .
Proof. 
We shall first prove (a).
Let X 1 X r be an independent and identically distributed random variable with the common probability density f ( x , θ 1 ) . Furthermore, let Y 1 Y s be an independent and identically distributed random variable with common probability density f ( y , θ 2 ) .
Using the coupling method, it can be seen that there is an independent and identically distributed random variable Z i ( 1 i s ) such that X i Z i pointwise; moreover, Z i has the same distribution as Y i ( 1 i s ) . Then, X 1 X r and Z 1 Z s can be proved as in Theorem 1, providing X s t Y .
Without loss of generality, we simply assume that X i Y i pointwise; thus, we need to show that θ 1 ^ θ 2 ^ pointwise.
Using the contradiction method, suppose that θ 1 ^ > θ 2 ^ ; then, in the set { θ 1 ^ > θ 2 ^ } , we have X i θ 1 ^ < Y i θ 2 ^ .
From (32), under the condition of censored samples (assuming the censored number is r), θ 1 ^ is determined by
n + i = 1 r g ( ´ x i θ 1 ) g ( x i θ 1 ) x i θ 1 + ( n r ) 𝜕 𝜕 θ [ t b g ( x θ ) d x ] | θ = θ 1 t b g ( x θ 1 ) d x θ 1 = 0
and θ 2 ^ is determined by
n + i = 1 s g ( ´ y i θ 2 ) g ( y i θ 2 ) y i θ 2 + ( n s ) 𝜕 𝜕 θ [ t b g ( y θ ) d y ] | θ = θ 2 t b g ( y θ 2 ) d y θ 2 = 0 ,
that is, it always holds that
i = 1 r g ( ´ x i θ 1 ^ ) g ( x i θ 1 ^ ) x i θ 1 ^ + ( n r ) 𝜕 𝜕 θ [ t b g ( x θ ) d x ] | θ = θ 1 ^ t b g ( x θ 1 ^ ) d x θ 1 ^ = i = 1 s g ( ´ y i θ 2 ^ ) g ( y i θ 2 ^ ) y i θ 2 ^ + ( n s ) 𝜕 𝜕 θ [ t b g ( y θ ) d y ] | θ = θ 2 ^ t b g ( y θ 2 ^ ) d y θ 2 ^ .
Because x g ( x ) g ( x ) is strictly increasing and X i θ 1 ^ < Y i θ 2 ^ , we have
g ( x i θ 1 ^ ) g ( x i θ 1 ^ ) x i θ 1 ^ < g ( y i θ 2 ^ ) g ( y i θ 2 ^ ) y i θ 2 ^
and r s ; thus,
i = 1 r g ( x i θ 1 ^ ) g ( x i θ 1 ^ ) x i θ 1 ^ < i = 1 s g ( y i θ 2 ^ ) g ( y i θ 2 ^ ) y i θ 2 ^ .
From the condition Δ y Δ x ( n r ) θ 1 ^ ( n s ) θ 2 ^ we can obtain
( n r ) 𝜕 𝜕 θ [ t b g ( x θ ) d x ] | θ = θ 1 ^ t b g ( x θ 1 ^ ) d x θ 1 ^ ( n s ) 𝜕 𝜕 θ [ t b g ( y θ ) d y ] | θ = θ 2 ^ t b g ( y θ 2 ^ ) d y θ 2 ^ .
Inequalities (37) and (38) imply that
i = 1 r g ( ´ x i θ 1 ^ ) g ( x i θ 1 ^ ) x i θ 1 ^ + ( n r ) 𝜕 𝜕 θ [ t b g ( x θ ) d x ] | θ = θ 1 ^ t b g ( x θ 1 ^ ) d x θ 1 ^ < i = 1 s g ( ´ y i θ 2 ^ ) g ( y i θ 2 ^ ) y i θ 2 ^ + ( n s ) 𝜕 𝜕 θ [ t b g ( y θ ) d y ] | θ = θ 2 ^ t b g ( y θ 2 ^ ) d y θ 2 ^ .
As (35) and (39) are contradictory, the hypothesis does not hold; therefore, θ 1 ^ s t θ 2 ^ is proved. □
Example 3. 
Consider the function
f ( x , θ ) = x α Γ ( α ) θ α e x θ , x > 0 , θ > 0
with known α > 0 and f ( x , θ ) satisfying the conditions in Theorem 4. Therefore, if 0 < θ 1 θ 2 , then there is θ 1 ^ s t θ 2 ^ , In particular, when α = 1 , it is an exponential distribution, which is also true.
Example 4. 
The logistic distribution with density
f ( x , μ ) = exp ( ( x μ ) ) 1 + exp ( ( x μ ) ) 2
satisfies the conditions of Theorem 4; thus, if μ 1 < μ 2 , then η 1 η 2 .
Now, let us investigate the scale family. Suppose that g ( x ) is a probability density function on ( 0 , ) . For any θ > 0 , f ( x , θ ) = g ( x / θ ) θ forms a density on ( 0 , ) . We refer to the collection of density functions { f ( x , θ ) = g ( x / θ ) θ , θ ( 0 , ) } as a scale family of distribution.
The scale parameter can be reduced to a location parameter by logarithmic transform. Therefore, the following result follows immediately from Theorem 4.

3.3. Numerical Examples

As part of this study, we conducted a simulation analysis on two exponential populations with parameters θ = 2 and θ = 5 . We employed censored data with a predetermined number of failures r = 35 to obtain the maximum likelihood estimator θ ^ . For each of these two exponential distributions, 50 experimental units were tested and the required sample data were recorded to apply this method. This experiment was repeated 1000 times. Based on the 1000 values of θ ^ corresponding to each population under this method, the empirical cumulative distribution functions (ECDFs) of θ ^ were calculated.
The empirical cumulative distribution functions (ECDFs) of θ ^ associated with θ = 2 and θ = 5 obtained by applying censoring with r = 35 are plotted in Figure 1.

3.4. Data Analysis

We used numerical simulation to analyze a practical application as a case study. The Monte Carlo method was used to randomly generate samples from the exponential distributions of θ 1 = 2 and θ 2 = 5 with sample sizes of n = 10, 50, 100, 200, and 500. The maximum likelihood estimates of parameters θ 1 and θ 2 , denoted as θ ^ 1 and θ ^ 2 , were obtained from each set of simulated data. The entire program was repeated 5000 times, with the bias and mean square error (MSE) used to evaluate the maximum likelihood estimates. The calculation formulas are as follows:
B i a s ( θ j ) = 1 5000 i = 1 5000 ( θ ^ j θ j ) , j = 1 , 2 ,
and
M S E ( θ j ) = 1 5000 i = 1 5000 ( θ ^ j θ j ) 2 , j = 1 , 2 , .
Table 1 shows the maximum likelihood estimates, biases, and mean square errors for parameters θ 1 = 2 and θ 2 = 5 of the exponential distribution. It can be observed that the maximum likelihood estimates for both parameters approach the true values as the sample size increases. Furthermore, the biases and mean square errors decrease as the sample size increases, indicating that the maximum likelihood estimation exhibits good stability.

4. Two-Parameter Exponential Distribution

Theorem 5. 
In addition, we performed a tailing experiment on two sets of samples with a capacity of n. Here, the tail number is m, the tailed samples 0 < x 1 x 2 x m obey the two-parameter exponential distribution
f ( x , μ 1 , θ 1 ) = 1 θ 1 exp x μ 1 θ 1 , x μ 1 ,
and the tailed samples 0 < y 1 y 2 y m obey the two-parameter exponential distribution
f ( y , μ 2 , θ 2 ) = 1 θ 2 exp y μ 2 θ 2 , y μ 2 .
If
μ 1 μ 2 , θ 1 θ 2 ,
then there is
μ ^ 1 s t μ ^ 2 , θ ^ 1 s t θ ^ 2 ,
where μ ^ 1 , θ ^ 1 are the censored maximum likelihood estimators obtained from 0 < x 1 x 2 x m and μ ^ 2 , θ ^ 2 are the censored maximum likelihood estimators obtained from 0 < y 1 y 2 y m .
Proof. 
The likelihood function of μ 1 , θ 1 is as follows:
L x 1 , x 2 , , x m , μ 1 , θ 1 = i = 1 r f x i , μ 1 , θ 1 x m f x , μ 1 , θ 1 d x n r = 1 θ 1 r exp i = 1 r x i μ 1 θ 1 exp x r μ 1 θ 1 n r = 1 θ 1 r exp 1 θ 1 i = 1 r x i + ( n r ) x r n μ 1
and its log-likelihood function is
𝜕 ln L x 1 , x 2 , , x m , μ 1 , θ 1 𝜕 θ 1 = r θ 1 + 1 θ 1 2 i = 1 r x i + ( n r ) x r n μ 1 = 0 .
In order to make the likelihood function L ( x 1 , x 2 , , x m , μ 1 , θ 1 ) reach the maximum value, we have μ ^ 1 = X 1 according to the context, and
θ ^ 1 = 1 r i = 1 r X i + ( n r ) X r n X 1 ;
Similarly,
θ ^ 2 = 1 r i = 1 r Y i + ( n r ) Y r n Y 1 .
Because
X 1 f x , μ 1 , θ 1 = 1 θ 1 exp x μ 1 θ 1 Y 1 f y , μ 2 , θ 2 = 1 θ 2 exp y μ 2 θ 2
i = 1 r X i + ( n r ) X r n X 1 Γ r 1 , θ 1 i = 1 r Y i + ( n r ) Y r n Y 1 Γ r 1 , θ 2
are given, we can use a proof method similar to that in Theorem 4 to obtain
μ ^ 1 s t μ ^ 2 , θ ^ 1 s t θ ^ 2 .

5. Illustrative Example

Assume that we have two types of batteries and that their lifetimes follow exponential distributions with parameters λ 1 = 1 100 (i.e., the average lifetime is 100 h) and λ 2 = 1 120 (i.e., the average lifetime is 120 h). We want to test the stochastic order of the lifetimes of the two types of batteries.
We can draw 50 samples from each distribution, but only record the lifetime data exceeding 80 h (censoring point t = 80 h).
The following are the sample data (in hours) drawn from the two distributions, listing only the samples exceeding 80 h:
Battery Type 1: λ 1 = 1 100
  • Samples: 82, 85, 90, 84, 81, 83, 85, 89, 88, 87, …
Battery Type 2: λ 2 = 1 120
  • Samples: 81, 86, 92, 83, 85, 87, 90, 82, 84, 88, ….
For the exponential distribution, the MLE λ can be calculated using the following formula:
λ ^ = n i = 1 n x i
where n is the sample size and x i are the sample data.
λ ^ 1 = 50 82 + 85 + 90 +
λ ^ 2 = 50 81 + 86 + 92 +
Assuming the calculations yield
λ ^ 1 = 1 85 ,
λ ^ 2 = 1 90 ,
we then need to verify whether λ ^ 1 λ ^ 2 holds. Because 1 85 > 1 90 , it is implied that λ ^ 1 > λ ^ 2 , meaning that Battery Type 1 has a higher failure rate. As we are interested in the lifespan, we can conclude that Battery Type 2 has a longer lifespan.
Based on the maximum likelihood estimates, we can conclude that Battery Type 2 has a longer lifespan, satisfying the condition of stochastic order λ ^ 1 λ ^ 2 .
This practical example demonstrates how to compare the lifespan distributions of batteries with different parameters using the MLEs from censored samples and infer differences in battery performance through stochastic order.

6. Conclusions

In this study, we have successfully validated two crucial theorems which establish that maximum likelihood estimators (MLEs) for parameters associated with location and exponential distribution families maintain stochastic ordering when estimated from censored samples provided that certain conditions are met. Furthermore, we provide practical examples that demonstrate the applicability of these theorems in real-world scenarios.

Author Contributions

Conceptualization, Y.L.; methodology, J.R., X.L. and P.L.; software (R language 4.3): J.R.; formal analysis, C.G.; data curation, P.L.; writing—original draft, J.R. and X.L.; writing—editing, C.G. and J.R.; project administration, Y.L., C.G. and J.R.; funding acquisition, J.R. All authors have read and agreed to the published version of the manuscript.

Funding

This paper was supported by the Shanghai University of Finance and Economics, Zhejiang College Educational Committee (Grant No. 2020 GR007).

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Brunk, H.D.; Barlow, R.E.; Bartholomew, D.J.; Bremner, J.M. International Statistical Review. In Statistical Inference under Order Restrictions; Wiley: Hoboken, NJ, USA, 1972; Volume 41, pp. 395–412. [Google Scholar]
  2. Nuesch, P.E. Order Restricted Statistical Inference. J. Appl. Econom. 1991, 6, 105–107. [Google Scholar]
  3. Viveros, R.; Balakrishnan, N. Interval estimation of parameters of life from progressively censored data. Technometrics 1994, 36, 84–91. [Google Scholar] [CrossRef]
  4. Balakrishnan, N.; Brain, C.; Mi, J. Stochastic order and MLE of the mean of the exponential distribution. Methodol. Comput. Appl. Probab. 2001, 4, 83–93. [Google Scholar] [CrossRef]
  5. Balakrishnan, N.; Mi, J. Order-preserving property of maximum likelihood estimator. J. Stat. Plan. Inference 2001, 98, 89–99. [Google Scholar] [CrossRef]
  6. Bain, L.J.; Engelhardt, M. Interval estimation for the two-parameter double exponential distribution. Technometrics 1973, 15, 875–887. [Google Scholar] [CrossRef]
  7. Kng, F.; Fei, H. Limit theorems for the maximum likelihood estimate under general multiply Type II censoring. Ann. Inst. Stat. Math. 1996, 48, 731–755. [Google Scholar]
  8. Cohen, C.A.; Whitten, B. Modified maximum likelihood and modified moment estimators for the three-parameter Weibull distribution. Commun. Stat.-Theory Methods 1982, 11, 2631–2656. [Google Scholar] [CrossRef]
  9. Prajapati, D.; Mitra, S.; Kundu, D. A new decision theoretic sampling plan for type-I and type-I hybrid censored samples from the exponential distribution. Sankhya B 2019, 81, 251–288. [Google Scholar] [CrossRef]
  10. Krishnamoorthy, K.; Xia, Y. Confidence intervals for a two-parameter exponential distribution: One-and two-sample problems. Commun. Stat.-Theory Methods. 2018, 47, 935–952. [Google Scholar] [CrossRef]
  11. Rényi, A. On the theory of order statistics. Acta Math. Acad. Sci. Hung. 1953, 4, 48–89. [Google Scholar] [CrossRef]
  12. Kundu, D.; Kannan, N.; Balakrishnan, N. Analysis of progressively censored competing risks data. Handb. Stat. 2003, 23, 331–348. [Google Scholar]
  13. Guo, M.-Y.; Zhang, J.; Yan, R. Stochastic comparisons of second largest order statistics with dependent heterogeneous random variables. Commun. Stat.-Theory Methods 2024, 1–19. [Google Scholar] [CrossRef]
  14. Qiu, G.; Raqab, M. On weighted extropy of ranked set sampling and its comparison with simple random sampling counterpart. Commun. Stat.-Theory Methods 2024, 53, 378–395. [Google Scholar] [CrossRef]
  15. Crescenzo, A.D.; Paolillo, L.; Suárez-Llorens, A. Stochastic comparisons, differential entropy and varentropy for distributions induced by probability density functions. Metrika 2024, 1–17. [Google Scholar] [CrossRef]
  16. Bartholomew, D.J. A problem in life testing. J. Am. Stat. Assoc. 1957, 52, 350–355. [Google Scholar] [CrossRef]
  17. Al-Athari, M.F.M. Estimation of the mean of truncated exponential distribution. J. Math. Stat. 2008, 4, 284. [Google Scholar] [CrossRef]
  18. Weißbach, R.; Wied, D. Truncating the exponential with a uniform distribution. Stat. Pap. 2022, 63, 1247–1270. [Google Scholar] [CrossRef]
  19. Hannon, P.M.; Dahiya, R.C. Estimation of parameters for the truncated exponential distribution. Commun. Stat.-Theory Methods. 1999, 28, 2591–2612. [Google Scholar] [CrossRef]
  20. Hu, Y.-H.; Emura, T. Maximum likelihood estimation for a special exponential family under random double-truncation. Comput. Stat. 2015, 30, 1199–1229. [Google Scholar] [CrossRef]
  21. Sabti, A.N.; Ansseif, A.A.l.; Shakir, A.M. Estimating the Reliability for the Sequential System of Two Truncated Exponential Distribution. Ind. Eng. Manag. Syst. 2021, 20, 455–463. [Google Scholar] [CrossRef]
  22. Raschke, M. Inference for the truncated exponential distribution. Stoch. Environ. Res. Risk Assess. 2012, 26, 127–138. [Google Scholar] [CrossRef]
  23. Blumenthal, S.; Dahiya, R.C. Estimating scale and truncation parameters for the truncated exponential distribution with type-I censored sampling. Commun. Stat.-Theory Methods 2005, 34, 1–21. [Google Scholar] [CrossRef]
  24. Suich, R.; Rutemiller, H.C. Point Estimation of the Parameter of a Truncated Exponential Distribution. IEEE Trans. Reliab. 1982, 31, 393–397. [Google Scholar] [CrossRef]
  25. Akahira, M. Statistical Estimation for Truncated Exponential Families; Springer: Berlin/Heidelberg, Germany, 2017. [Google Scholar]
  26. Kumar, D.; Dey, S.; Nadarajah, S. Extended exponential distribution based on order statistics. Commun. Stat.-Theory Methods 2017, 46, 9166–9184. [Google Scholar] [CrossRef]
Figure 1. Censoring with r = 35 .
Figure 1. Censoring with r = 35 .
Mathematics 12 03264 g001
Table 1. Monte Carlo simulation results of the exponential distribution.
Table 1. Monte Carlo simulation results of the exponential distribution.
Par.nMLEBiasMSE
θ 1 = 2 102.77590.11140.2273
502.20500.02510.0170
1002.08480.01170.0056
2002.02430.00420.0023
5001.9884−0.00020.0008
θ 2 = 5 105.93810.15500.7082
505.41960.15290.3287
1005.08410.01400.0325
2005.03560.00770.0130
5004.9924−0.00720.0052
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, Y.; Gao, C.; Liu, X.; Luo, P.; Ren, J. A Comparison of MLE for Some Index Distributions Based on Censored Samples. Mathematics 2024, 12, 3264. https://doi.org/10.3390/math12203264

AMA Style

Liu Y, Gao C, Liu X, Luo P, Ren J. A Comparison of MLE for Some Index Distributions Based on Censored Samples. Mathematics. 2024; 12(20):3264. https://doi.org/10.3390/math12203264

Chicago/Turabian Style

Liu, Yunhan, Changchun Gao, Xiaofeng Liu, Ping Luo, and Jianguo Ren. 2024. "A Comparison of MLE for Some Index Distributions Based on Censored Samples" Mathematics 12, no. 20: 3264. https://doi.org/10.3390/math12203264

APA Style

Liu, Y., Gao, C., Liu, X., Luo, P., & Ren, J. (2024). A Comparison of MLE for Some Index Distributions Based on Censored Samples. Mathematics, 12(20), 3264. https://doi.org/10.3390/math12203264

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop