Next Article in Journal
Correction: Chen et al. Scoring Individual Moral Inclination for the CNI Test. Stats 2024, 7, 894–905
Previous Article in Journal
Enhancing Diversity and Improving Prediction Performance of Subsampling-Based Ensemble Methods
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Energy Statistic-Based Goodness-of-Fit Test for the Lindley Distribution with Application to Lifetime Data

Department of Mathematics and Statistics, Coastal Carolina University, Conway, SC 29526, USA
*
Author to whom correspondence should be addressed.
Stats 2025, 8(4), 87; https://doi.org/10.3390/stats8040087
Submission received: 25 August 2025 / Revised: 20 September 2025 / Accepted: 24 September 2025 / Published: 26 September 2025

Abstract

In this article, we propose a goodness-of-fit test for a one-parameter Lindley distribution based on energy statistics. The Lindley distribution has been widely used in reliability studies and survival analysis, especially in applied sciences. The proposed test procedure is simple and more powerful against general alternatives. Under different settings, Monte Carlo simulations show that the proposed test is able to be well controlled for any given nominal levels. In terms of power, the proposed test outperforms other existing similar methods in different settings. We then apply the proposed test to real-life datasets to demonstrate its competitiveness and usefulness.

1. Introduction

Research on reliability studies cannot be complete without mentioning exponential distributions. This has led many researchers to overlook other more flexible distributions in the analysis of lifetime data such as Gamma, Weibull, Lindley, and Log-normal distributions. The Lindley distribution was proposed by Lindley [1,2] as part of the larger exponential family and as a more superior competitor distribution for models based on the exponential distribution. In fact, Ghitany et al. [3] found that the Lindley distribution is a better alternative distribution for exponential distributions and provided a wide range of properties of the Lindley distribution such as moments, failure rate function, mean residual life function, and maximum likelihood estimation, among many others. The Lindley distribution is an example of an exponential family and has been expressed as a mixture of two well-known distributions, Exponential and Gamma distributions such that its probability distribution function (pdf), f ( x ) in Equation (1) can be written as
f ( x ) = w f 1 ( x ) + ( 1 w ) f 2 ( x ) , x > 0 ,
where f 1 ( x ) Exp ( θ ) , f 2 ( x ) Γ ( 2 , θ ) , w = θ 1 + θ , and θ > 0 is the scale parameter.
In recent years, studies involving Lindley distribution have gained momentum and have shown great success in areas such as modeling and analyzing survival, reliability, and failure lifetime-related data. One advantage of the Lindley distribution over the exponential distribution is the fact that it possesses an increasing hazard rate function and a decreasing mean residual life function whereas in the latter both functions are constant; see more details in Ghitany et al. [3]. These studies are very important and common in applied sciences such as construction, engineering, medicine, insurance, banking, and many others. Several authors have studied possible applications of the Lindley distribution. Such studies stem from distribution properties (Ghitany et al. [3], Lindley [1]), bayesian estimation (Ali et al. [4], Ali et al. [4]), estimation of the stress–strength reliability parameter (Al-Mutairi et al. [5]) and load-sharing parallel system model and its application (Singh and Gupta [6]). Currently, there are several generalizations and extensions of the Lindley distribution such as the double Lindley distribution, the Kumaraswamy quasi Lindley distribution, the Log-Lindley distribution, the exponentiated power Lindley distribution, and the generalized weighted Lindley distribution, among others; see, for example, Satheesh Kumar and Jose [7], Ibrahim et al. [8], Nedjar and Zeghdoudi [9], Ramos and Louzada [10], Gómez-Déniz et al. [11], and Elbatal and Elgarhy [12].
Definition 1.
Let X L i n d l e y ( θ ) . The pdf and cdf of X are given by
f ( x ) = θ 1 + θ ( 1 + x ) e θ x , x > 0 , θ > 0
and
F X ( x ) = 1 θ + 1 + θ x 1 + θ e θ x , x > 0 , θ > 0 ,
respectively, and θ is the scale parameter.
Its mean and variance are given as
Mean : μ = θ + 2 θ ( θ + 1 )
and
Variance : σ 2 = θ 2 + 4 θ + 2 θ 2 ( θ + 1 ) 2
Figure 1 provides some examples of the forms that the density function in Equation (1) can take for selected values of the scale parameter θ .
We note that both method of moments (MoM) and maximum likelihood (ML) estimates of the scale parameter θ are the same and defined as follows.
Definition 2.
Let X 1 , X 2 , , X n be a random sample from the Lindley distribution, the maximum likelihood (ML) or method of moments (MoM) estimate of the scale parameter θ is given as
θ ^ = ( X ¯ 1 ) + ( X ¯ 1 ) 2 + 8 X ¯ 2 X ¯ , X ¯ > 0 ,
where X ¯ is the sample mean.
Ghitany et al. [3] proved that in this estimate, θ ^ is positively biased, that is, E θ ^ θ > 0 . In addition, the estimate is consistent and asymptotically normal with mean zero and standard deviation 1 σ , where σ 2 is given in Equation (4); for further details, see, for example, Ghitany et al. [3]. We will use this estimate of the scale parameter θ in the development of the energy goodness-of-fit statistic for the Lindley distribution. These well-established properties of the Lindley distribution to provide a desirable environment for the construction of goodness-of-fit tests. Although we have seen a significant amount of research on the Lindley distribution and its applications, there is limited research in the literature on the goodness-of-fit test for the distribution. Authors such as Noughabi and Noughabi [13], Noughabi and Noughabi [14], and Noughabi [15] have proposed a goodness-of-fit test for the Lindley distribution based on estimates of the Gini index, Empirical likelihood ratio (ELR) and Kullback–Leibler (KL) discrimination, respectively. In the literature, different goodness-of-fit tests for different statistical distributions have been proposed, see, for example, Best and Rayner [16], Shan et al. [17], Vexler et al. [18], and Rizzo [19], Noughabi and Noughabi [20], among many others. For example, Vexler et al. [18] and Vexler and Gurevich [21] applied the empirical likelihood ratio to test goodness-of-fit for inverse gaussian distribution and on sample entropy, respectively. Best and Rayner [16] proposed smooth tests of fit for the Lindley and Poisson–Lindley distributions. Ning and Ngunkeng [22] and Gupta and Chen [23] proposed goodness-of-fit tests for the Skew-normal distribution based on the empirical likelihood ratio and Pearson’s χ 2 tests, respectively. Other tests available in the literature are the well-known and common EDF tests, which are based on empirical distribution functions, see, for example, Stephens [24] and Stephens [25]. We thus introduce a more competitive testing procedure based on energy statistics proposed by Sźekely [26] and Rizzo [27]. Sźekely and Rizzo [28] have demonstrated extensively the superiority of energy-based tests over many other tests, such as EDF-based, Gini index and Kullback–Leibler discrimination tests.
In this article, we propose a one-sample (univariate) goodness-of-fit test based on energy statistics ( Sźekely [26] and Rizzo [27]). In a more recent work, Logan and Ning [29] proposed a goodness-of-fit test using energy statistics for the Skew-normal distribution and Ofosuhene [30] developed a goodness-of-fit test based on energy statistics and distance correlation for inverse-gaussian distribution. There are a few other research works in the literature involving the energy goodness-of-fit test for some known distributions, see, for example, Sźekely and Rizzo [28] and Rizzo [19]. For a given sequence of independent random variables of size n and with a cdf G ( x ) , the test statistic based on energy statistics will reject the null that F ( x ) = G ( x ) for large values of test statistics. If the null distribution F ( x ) and the given data come from the same underlying distribution, then values of the test statistic are expected to be small. In addition, there are several other studies involving energy distance statistics such as testing for multivariate normality (Sźekely and Rizzo [31]), testing for equality of distributions (Sźekely and Rizzo [32], Rizzo [33]), and change point analysis (Njuki and Ning [34], Njuki [35], Matterson and James [36], Kim et al. [37]), among many others.
The energy distance is defined as a statistical distance between the distributions of random vectors which characterizes equality of distributions, see, for example, Sźekely and Rizzo [28,38,39] and Sźekely [26]. The concept of energy statistics initially described by Sźekely [40] is based on the notion of Newton’s gravitational potential energy, which is a function of the distance between two bodies; for further details, see Sźekely and Rizzo [38]. The idea of energy statistics therefore is to consider statistical observations as heavenly bodies governed by a statistical potential energy, which is zero if and only if an underlying statistical null hypothesis is true, see, for example, Sźekely and Rizzo [28,38]. In this work, we propose a procedure that is more superior for the goodness-of-fit test for the Lindley distribution (Lindley [1])-based energy statistics. Unlike the proposed method based on energy statistics, most of the existing methods depend on the (empirical) distribution function of random variables. Energy-type tests have been shown to be typically more powerful against general alternatives than corresponding tests, especially those based on classical statistics (non-energy type) such as Kolmogorov–Smirnov, Anderson Darling (Anderson and Darling [41]), Watson statistic (Watson [42]), and Kuiper statistic (Kuiper [43]). Furthermore, energy statistic-based tests have an additional advantage such that these tests have an invariance property with respect to any distance-preserving transformation of the dataset, see, for example, Sźekely and Rizzo [28,38] and Matterson and James [36]. Tests based on energy statistics can be easily extended to multivariate and high dimension settings, see, for example, Sźekely and Rizzo [31], Rizzo [33] and Sźekely and Rizzo [32].
We organize this article as follows. In Section 2, we propose a testing procedure based on energy statistics for the goodness-of-fit test and establish its theoretical results on the Lindley distribution. We perform different simulations in Section 3 to compare the results of our energy goodness-of-fit test with other existing ones. In Section 4, we demonstrate the application of our method using three real-life datasets. The conclusion is provided in Section 5. The proofs to new results and other supplemental materials are given in the Appendix A.

2. The Energy Goodness-of-Fit Statistic for the Lindley Distribution

We propose a test procedure for the goodness-of-fit test based on the energy statistics for the Lindley distribution. Next, we define the energy goodness-of-fit test based on the characterization of equal distributions.
Definition 3.
(Energy distance). If X and Y are independent random vectors with E ( X ) < and E ( Y ) < , then the energy distance between X and Y is defined as follows:
E ( X , Y ) = 2 E | X     Y |     | X     X |     | Y     Y |     0 ,
X = d X , Y = d Y , and equality holds if and only if X and Y are identically distributed.
The energy distance between two independent distributions is thus given by Equation (6) and hence, under the null hypothesis, it is assumed that the data follow a null (Lindley) distribution against a general alternative hypothesis. Let x = { x 1 , , x n } be a sample of random observations from the distribution F and X F 0 , null distribution, then the empirical one-sample goodness-of-fit test based on energy statistics for testing H 0 : F = F 0 versus H 1 : F F 0 is defined as
n E n ( x , X ) = n 2 n i = 1 n E | x i     X |     E | X     X |     1 n 2 i = 1 n j = 1 n | x i x j | ,
where X and X are independently and identically distributed random variables from the null distribution, F 0 . The null hypothesis, F = F 0 , is rejected for large values of the test statistic n E n , where F 0 Lindley ( θ ) in our case. Under the null hypothesis, the limiting distribution of n E n ( x , X ) is a quadratic quantity of the form j = 1 λ j Z j 2 such that Z j , j = 1 , 2 , , are i.i.d. standard normal random variables and λ j are nonnegative constants that depend on the null distribution. Thus, the goodness-of-fit test can be implemented by finding the constants λ j . In practice, this could be difficult, and we therefore resort to the use of the Monte Carlo simulation approach to obtain empirical critical values of n E n so that P ( n E n > c α ) = α . This fact is guaranteed since the test based on n E n is a consistent goodness-of-fit test against all general alternatives, see, for example Sźekely and Rizzo [31] and Móri et al. [44].
The establishment of the energy statistic-based goodness-of-fit test for the Lindley distribution in Equation (7) depends on the expected distances E | x i X | and E | X X | , where X and X are i.i.d. copies from the Lindley distribution.
Proposition 1.
Let X Lindley ( θ ) , then for any fixed x R
E | x X | = x + 2 x e θ x 1 + θ + 2 e θ x 1 + θ + 4 e θ x θ ( 1 + θ ) + θ 2 θ ( 1 + θ ) 2 1 + θ
A full proof of Proposition 1 is provided in Appendix A.
Proposition 2.
Let X and X be independent and identically distributed random variables from a Lindley distribution with the shape parameter θ . Then,
E | X     X | = 2 θ 2 + 6 θ + 3 2 θ ( 1 + θ ) 2
The detailed proof of Proposition 2 involving integration by parts and expansion of the joint expectation is deferred to Appendix A.
Proposition 3.
Let X 1 , X 2 , , X n be a sequence of independent random variables from the null distribution F 0 and let X ( 1 ) , X ( 2 ) , , X ( n ) be the ordered sample. Then, the last term of the test statistic n E n in Equation (7) can be linearized as follows;
1 n 2 i = 1 n j = 1 n | x i x j | = 2 n 2 k = 1 n ( ( 2 k 1 ) n ) x ( k )
Proof. 
The proof of Proposition 3 is given by Proposition 3.3 of Ofosuhene [30] and Rizzo [27]. □
This linearization will reduce the computational complexity of the test from O ( n 2 ) to O ( n log n ) , which is useful during extensive operations and simulations. Thus, the one-sample (univariate case) energy goodness-of-fit test for the Lindley distribution based on Equation (7) together with Propositions 1, 2, and 3 is defined as
Q n = n E n ( x , X ) = n { 2 n i n x i + 2 x i e θ x i 1 + θ + 2 e θ x i 1 + θ + 4 e θ x i θ ( 1 + θ ) + θ 2 θ ( 1 + θ ) 2 1 + θ 2 θ 2 + 6 θ + 3 2 θ ( 1 + θ ) 2 2 n 2 k = 1 n ( ( 2 k 1 ) n ) x ( k ) } ,
where x = { x 1 , , x n } is the observed sample values and X Lindley ( θ ) , θ > 0 .

3. Simulations and Discussions

In this section, we perform Monte Carlo simulations to investigate the performance of our proposed test in terms of controlling the Type I error and power comparisons for varying values of the scale parameter θ . In practice, the goodness-of-fit tests based on the empirical distribution function (EDF) are most commonly used; we thus compare powers of the proposed test and the EDF tests under various combinations of alternative distributions and sample sizes. These popular and well-known tests are Kolmogorov–Smirnov ( K S ), Anderson–Darling ( A 2 ), Cramer-von-Mises ( W 2 ), Watson ( U 2 ), and Kuiper (V) statistics. Throughout this article, we consider the number of simulations B to be 10,000.

3.1. Simulated Type I Error (Size of the Test) and Empirical Critical Value

We find the empirical Type I error and critical values of the proposed test under the Lindley ( θ ) distribution assumption. For these simulations, we consider the nominal levels of α = 0.01 , 0.05 , 0.10 and random samples of sizes n = 10 , 25 , 50 , 100 , 150 , 200 , 300 , 500 , 1000 . The algorithm applied to find the empirical critical values and the simulated Type I error is as follows.
  • Generate lifetime dataset of size n from a Lindley ( θ ) distribution.
  • Compute the energy goodness-of-fit statistic for the data using the formula in Equation (11).
  • Repeat Steps 1 and 2 B times and obtain energy goodness-of-fit statistics Q n j , where j = 1 , 2 , , B .
  • Obtain the empirical critical values by computing a 95% quantile of energy statistics computed in Step 3 and denote it as Q n .
  • Repeat Steps 1 through 4 and compute the empirical Type I error as the number of times the energy goodness-of-fit statistic Q n j is greater than the critical value Q n .
Table 1 gives the simulated critical values, and they remain stable across a wide range of sample sizes and nominal levels, indicating consistency in the test’s threshold behavior. Empirical Type I errors are reported in Table 2 for selected values of the scale parameter θ . These simulated errors confirm that our proposed test is accurately able to control the Type I error rate near the nominal level α = 0.05 for θ = 0.5 , 1 , 2 , 5 and sample sizes n = 10 , 25 , 50 , 100 , 200 , 500 . We note that these findings suggest the test is reliable and robust under the null hypothesis across different distributional shapes and sample sizes.

3.2. Power Comparisons Under Different Alternative Distributions

In this section, we perform simulations for B times under different alternative distributions to investigate the proposed test in terms of power. We consider sample sizes n = 10 , 25 , 50 , 100 , 200 with the nominal level α = 0.05 . The single-parameter alternative distributions considered in this study are reported in Table 3 and their respective parameters specified in Table 4. These alternatives cover probability densities with decreasing failure rate, increasing failure rate, unimodal, and bathtub failure rate functions. To compare its performance, we consider the common and well-known goodness-of-fit tests based on distribution functions, namely, the Kolmogorov–Smirnov test ( K S ), the Kuiper test (V), the Cramer-von-Mises test ( W 2 ), the Watson’s test ( U 2 ), and the Anderson–Darling test ( A 2 ) statistics. The empirical power of our proposed test can be estimated using the following algorithm.
  • Obtain the critical value under the Lindley ( θ ) distribution assumption as explained in Section 3.1.
  • Generate a dataset x 1 , , x n from the desired alternative distribution.
  • Process the data as if they were from a Lindley ( θ ) distribution and use the Maximum Likelihood Estimation (MLE) formula in Equation (5) to find the estimator θ ^ of the scale parameter θ .
  • Compute the energy goodness-of-fit statistic for the dataset x 1 , , x n using Equation (11).
  • Based on the energy goodness-of-fit statistic and the critical value in Step 1, determine whether or not the null hypothesis is rejected.
  • Repeat Steps 1 through 5 for B times and calculate the empirical power as the fraction of rejections of the test in B times.
The empirical powers for the EDF (non-energy) tests considered in this study are also obtained in a similar manner. The results and comparisons are reported in Table 4. We notice that all considered methods are consistent and their powers increase with the increase in sample sizes. In Table 4, the proposed test has outperformed other existing tests against all choices of alternative distributions considered in this study. More importantly, we noticed that our proposed test has a larger power even for small sample sizes such as n = 10 and n = 25 , , and it is evident in the case of uniform distribution. Also, it is worth to note that for large samples, the Anderson–Darling and Cramer-von-Mises test statistics are competitive in most cases and their powers are more or less the same as that of our proposed test. For small sample sizes, there is a mixture of results for the EDF-based tests, and the Anderson–Darling test has the lowest power when n = 10 except for the exponential model.

4. Application to the Real-Life Datasets

In this section, the performance of the proposed test is investigated using three real datasets. We will apply the test to determine whether or not the underlying distribution is the Lindley one. We use the bootstrap algorithm process through simulations to find the approximate p-value associated with the proposed test as described below.
  • Fit the original data x 1 , , x n with a Lindley distribution, Lindley ( θ ) , and use the formula in Equation (5) to obtain the maximum likelihood estimate (MLE) of θ .
  • Use the formula in Equation (11) to calculate the energy goodness-of-fit statistic of the original data and denote it Q n 1 .
  • Simulate y 1 , , y n , a random sample of size n , from a Lindley ( θ ^ ) distribution, where θ ^ is obtained in step 1.
  • Compute the energy goodness-of-fit statistic using the formula in Equation (11) for the simulated data and denote it Q n 1 .
  • Repeat this simulation procedure for B times and obtain B energy goodness-of-fit statistics and denote them as Q n 1 , , Q n B .
  • The p-value will then be approximated as
    p ^ = 1 B b = 1 B I Q n b Q n 1
    where I ( · ) is an indicator function that takes the value of one when Q n b Q n 1 and zero otherwise.
In our applications, the first dataset is listed in Table A1, which contains waiting times (in minutes) before service for 100 bank customers analyzed in [3]. The maximum likelihood estimate (MLE) of the scale parameter is θ ^ = 0.1866 , and its corresponding value of our test statistic is n E n = 2.3598 . Since the simulated p-value is approximately 0.9243 , we do not reject the null that the data follow the Lindley distribution. Other tests except the Cramer-von-Mises statistic suggested that the data follow the Lindley distribution. Their estimated p-values and test statistics are reported in Table A4.
The second dataset provided in Table A2 contains failure times of the electronic components in an accelerated life test. The data are reported and analyzed by Lawless [45]. The MLE of our test statistic is θ ^ = 0.0702 , and its corresponding value of test statistic is n E n = 6.0811 . Since the simulated p-value is approximately 0.9357 , we do not reject the null that the data follow the Lindley distribution. We observe in Table A5 a similar conclusion for other tests except for the Cramer-von-Mises test statistic, which fails to support the underlying distribution assumption as the Lindley one.
Table A3 provides our last dataset, which contains the quality of the yarn by recording the number of stress cycles needed before the yarn breaks. This dataset was originally analyzed by [46], and Shanker and Fesshaye [47] used the same dataset as an illustrative example in their analysis. The MLE of our test statistic is θ ^ = 0.0090 , and the test statistic value is n E n = 152.3379 . The empirical p-value is approximately 0.3954 , and therefore we do not reject the null hypothesis that the data follow the Lindley distribution. In addition, the Kolmogorov–Smirnov test statistic, the Anderson–Darling test statistic, the Kuiper test statistic, and the Watson test statistic provide little to no evidence to reject the null hypothesis that the data do follow the Lindley distribution. The Cramer-von-Mises test statistic again fails to support the fact that the data can be modeled with the Lindley distribution. The approximate p-values and their corresponding test statistics are reported in Table A6.
Finally, Figure 2a, Figure 3a, and Figure 4a provide density estimates for each of our three datasets considered in this study. In addition, Figure 2b, Figure 3b, and Figure 4b compare empirical and theoretical distribution functions for these real datasets. In both situations, the Lindley distribution seems to fit the datasets adequately. Surprisingly, it is worthy to note that the Cramer-von-Mises test rejected the null hypothesis for all of our empirical applications.

5. Conclusions

We have proposed a goodness-of-fit test based on energy statistics for the Lindley distribution. Under the null assumption, the proposed test is simple, robust, and computationally efficient to implement for any size of the sample. Monte Carlo simulations are carried out to evaluate the finite sample performance of the proposed test and compare the testing method with other similar existing tests in terms of controlling Type I errors and powers. It is evident that our test outperformed other tests especially for smaller sample sizes.
We successfully applied our proposed procedure to three real-life datasets to demonstrate its applicability.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/stats8040087/s1, Proof of Proposition 1; Prooof of Proposition 2.

Author Contributions

J.N.: Conceptualization, Methodology, Validation, Formal analysis, Investigation, Software, Resources, Supervision, Project Administration, Visualization, Writing—original draft preparation, Writing—review and editing; R.A.: Methodology, Software, Validation, Investigation, Writing—original draft preparation, Visualization, Formal analysis, Data curation, Writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The source of the datasets are provided in the paper.

Acknowledgments

The authors are grateful to three anonymous reviewers for their helpful comments on improving the contents of this article.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Appendix A.1

We present proofs of results in the article and supporting materials from our simulations and data applications.
Proof of Proposition 1. 
Let X Lindley ( θ ) and x R . Then;
E | x X | = 0 | x y | f X ( y ) d y = 0 x ( x y ) f X ( y ) d y + x ( y x ) f X ( y ) d y = x 0 x f X ( y ) d y 0 x y f X ( y ) d y + x y f X ( y ) d y x x f X ( y ) d y = x F X ( x ) 0 x y f X ( y ) d y + 0 y f X ( y ) d y 0 x y f X ( y ) d y x ( 1 F X ( x ) ) = 2 x F ( x ) x + E ( X ) 2 0 x y f X ( y ) d y = 2 x 1 ( 1 θ θ x 1 + θ ) e θ x x + θ + 2 θ ( 1 + θ ) 2 θ 1 + θ ( x e θ x x 2 e θ x ) + ( e θ x + 1 2 x e θ x θ ) 2 x e θ x 2 θ 2 = x + 2 x e θ x 1 + θ + 2 e θ x 1 + θ + 4 e θ x θ ( 1 + θ ) + θ 2 θ ( 1 + θ ) 2 1 + θ ,
where the CDF F ( x ) and mean E ( X ) of the Lindley distribution are explicitly given in Equation (2) and Equation (3), respectively. □
Proof of Proposition 2. 
Let X , X i i d Lindley ( θ ) .
E | X X | = 0 0 | x y | f X ( x ) f X ( y ) d y d x = 2 0 0 x ( x y ) f X ( x ) f X ( y ) d y d x = 2 0 0 x ( x y ) θ 2 1 + θ ( 1 + x ) e θ x θ 2 1 + θ ( 1 + y ) e θ y d y d x = 2 m 2 0 0 x ( x + x 2 + x 2 y y y 2 x y 2 ) e θ x e θ y d y d x , m = θ 2 1 + θ . = 2 m 2 3 4 θ 3 + 7 4 θ 4 3 8 θ 5 1 4 θ 5 + 2 θ 5 1 4 θ 3 1 4 θ 4 5 8 θ 5 = 2 θ 2 1 + θ 2 2 4 θ 3 + 6 4 θ 4 + 6 8 θ 5 = 2 θ 2 + 6 θ + 3 2 θ ( 1 + θ ) 2 .
The complete proofs of Propositions 1 and 2 are provided in the attached supplementary file. Below are results and datasets used in our empirical applications presented.
Table A1. Waiting time (in minutes).
Table A1. Waiting time (in minutes).
0.80.81.31.51.81.91.92.12.62.7
2.93.13.23.33.53.64.04.14.24.2
4.34.34.44.44.64.74.74.84.94.9
5.05.35.55.75.76.16.26.26.26.3
6.76.97.17.17.17.17.47.67.78.0
8.28.68.68.68.88.88.98.99.59.6
9.79.810.710.911.011.011.111.211.211.5
11.912.412.512.913.013.113.313.613.713.9
14.115.415.417.317.318.118.218.418.919.0
19.920.621.321.421.923.027.031.633.138.5
Table A2. Failure times of electronic components.
Table A2. Failure times of electronic components.
1.45.16.310.812.1
18.519.722.223.030.6
37.346.353.959.866.2
Table A3. Number of stress cycles.
Table A3. Number of stress cycles.
8614625165398249400292131169
175176762641536419526288264
15722042321180198382061121
28222414918032525019690229166
38337651513414040135597246
2111809331535357112427981186
49718242318522940033829039871
24618518856855556124420284
393396203829239236286194277143
198264105203124137135350193188
Table A4. Computed test statistics of waiting time data for different tests.
Table A4. Computed test statistics of waiting time data for different tests.
Test StatisticStatistic Valuep Value
n E n 2.35980.9243
K S 0.68190.7304
V0.05540.9998
W 2 1.10850.0017
U 2 0.05850.6255
A 2 0.49010.7632
Table A5. Computed test statistics of electronic life test data for different tests.
Table A5. Computed test statistics of electronic life test data for different tests.
Test StatisticStatistic Valuep Value
n E n 6.08110.9357
K S 0.45030.9843
V0.03831.0000
W 2 0.90290.0065
U 2 0.03880.8588
A 2 0.33790.9210
Table A6. Computed test statistics of stress cycles data for different tests.
Table A6. Computed test statistics of stress cycles data for different tests.
Test StatisticStatistic Valuep Value
n E n 152.33790.3954
K S 0.99070.1744
V0.16151.0000
W 2 1.66300.0000
U 2 0.18620.0527
A 2 0.91200.3855

References

  1. Lindley, D.V. Fiducial distribution and Bayes’ theorem. J. R. Stat. Soc. Ser. B (Methodol.) 1958, 20, 102–107. [Google Scholar] [CrossRef]
  2. Lindley, D.V. Introduction Probability and Statistics from a Bayesian Viewpoint; Part II: Inference; Cambridge University Press: New York, NY, USA, 1965. [Google Scholar]
  3. Ghitany, M.E.; Atieh, B.; Nadarajah, S. Lindley distribution and its application. Math. Comput. Simul. 2008, 78, 493–506. [Google Scholar] [CrossRef]
  4. Ali, S.; Aslam, M.; Kazmi, S.M.A. A study of the effect of the loss function on Bayes estimate, posterior risk and hazard function for Lindley distribution. Appl. Math. Model. 2013, 37, 6068–6078. [Google Scholar] [CrossRef]
  5. Al-Mutairi, D.K.; Ghitany, M.E.; Kundu, D. Inferences on the stress-strength reliability from Lindley distributions. Commun. Stat. -Theory Methods 2013, 42, 1443–1463. [Google Scholar] [CrossRef]
  6. Singh, B.; Gupta, P.K. Load-sharing system model and its application to the real data set. Math. Comput. Simul. 2012, 82, 1615–1629. [Google Scholar] [CrossRef]
  7. Satheesh Kumar, C.; Jose, R. On Double Lindley Distribution and Some of its Properties. Am. J. Math. Manag. Sci. 2018, 38, 23–43. [Google Scholar] [CrossRef]
  8. Ibrahim, M.; Yadav, A.S.; Yousof, H.M.; Goual, H.; Hamedani, G. A new extension of Lindley distribution: Modified validation test, characterizations and different methods of estimation. Commun. Stat. Appl. Methods 2019, 26, 475–493. [Google Scholar] [CrossRef]
  9. Nedjar, S.; Zeghdoudi, H. On gamma Lindley distribution: Properties and simulations. J. Comput. Appl. Math. 2016, 298, 167–174. [Google Scholar] [CrossRef]
  10. Ramos, P.L.; Louzada, F. The generalized weighted Lindley distribution: Properties, estimation, and applications. Cogent Math. 2016, 3, 1256022. [Google Scholar] [CrossRef]
  11. Gómez-Déniz, E.; Sordo, M.A.; Calderín-Ojeda, E. The log-Lindley distribution as an alternative to the beta regression model with applications in insurance. Insur. Math. Econ. 2014, 54, 49–57. [Google Scholar] [CrossRef]
  12. Elbatal, I.; Elgarhy, M. Statistical properties of Kumaraswamy quasi Lindley distribution. Int. J. Math. Trends Technol. 2013, 4, 237–246. [Google Scholar]
  13. Noughabi, H.A.; Noughabi, M.S. Gini Index based goodness-of-fit test for Lindley distribution. Commun. Stat.-Simul. Comput. 2024, 54, 2065–2075. [Google Scholar] [CrossRef]
  14. Noughabi, H.A.; Noughabi, M.S. Empirical likelihood ratio-based goodness of fit test for Lindley distribution. Stat. Optim. Inf. Comput. 2024, 12, 869–881. [Google Scholar] [CrossRef]
  15. Noughabi, H.A. A powerful goodness-of-fit test for Lindley distribution with application to real data. Pak. J. Stat. Oper. Res. 2021, 17, 761–769. [Google Scholar] [CrossRef]
  16. Best, D.J.; Rayner, J.C.W. Smooth Tests of Fit for More Flexible Alternatives to the Exponential and Poisson: The Lindley and Poisson-Lindley Distributions; Working Paper; National Institute for Applied Statistics Research Australia, University of Wollongong: Wollongong, Australia, 2014; Volume 8, pp. 7–14. [Google Scholar]
  17. Shan, G.; Vexler, A.; Wilding, G.E.; Hutson, A.D. Simple and exact empirical likelihood ratio tests for normaluity based on moment relations. Commun. Stat.-Simul. Comput. 2010, 40, 141–158. [Google Scholar] [CrossRef]
  18. Vexler, A.; Shan, G.G.; Kim, S.G.; Tsai, W.M.; Tian, L.L.; Hutson, A.D. An empirical likelihood ratio based goodness-of-fit test for Inverse gausian distributions. J. Stat. Plan. Inference 2011, 141, 2128–2140. [Google Scholar] [CrossRef]
  19. Rizzo, M.L. New goodness-of-fit tests for Pareto distributions. ASTIN Bull. J. IAA 2009, 39, 691–715. [Google Scholar] [CrossRef]
  20. Noughabi, H.A.; Noughabi, M.S. On Testing the Adequacy of the Lindley Model and Power Study. Stat. Optim. Inf. Comput. 1970, 11, 719–729. [Google Scholar] [CrossRef]
  21. Vexler, A.; Gurevich, G. Empirical likelihood ratios applied to goodness-of-fit tests on sample entropy. Comput. Stat. Data Anal. 2010, 54, 531–545. [Google Scholar] [CrossRef]
  22. Ning, W.; Ngunkeng, G. An empirical likelihood ratio based goodness-of-fit test for skew normality. Stat. Methods Appl. 2013, 22, 209–226. [Google Scholar] [CrossRef]
  23. Gupta, A.K.; Chen, T. Goodness-of-fit tests for skew normal distribution. Commun. Stat.-Simul. Comput. 2001, 30, 907–930. [Google Scholar]
  24. Stephens, M.A. Tests Based on EDF Statistics. In Goodness-of-Fit Techniques; D’Agostino, R.B., Stephens, M.A., Eds.; Marcel Dekker: New York, NY, USA, 1986; pp. 97–193. [Google Scholar]
  25. Stephens, M.A. Edf statistics for goodness of fit and some comparisons. J. Am. Stat. Assoc. 1974, 69, 730–737. [Google Scholar] [CrossRef]
  26. Sźekely, G.J. E-Statistics: Energy of Statistical Samples; Technical Report No. 02–16; Bowling Green State University, Department of Mathematics and Statistics: Bowling Green, OH, USA, 2002. [Google Scholar]
  27. Rizzo, M.L. A New Rotation Invariant Goodness-of-Fit Test. Ph.D. Thesis, Bowling Green State University, Bowling Green, OH, USA, 2002. [Google Scholar]
  28. Sźekely, G.J.; Rizzo, M.L. The Energy of Data and Distance Correlation, 1st ed.; Chapman and Hall: London, UK, 2023. [Google Scholar]
  29. Logan, O.; Ning, W. Goodness-of-fit test for skew normality based on energy statistics. Random Oper. Stoch. Equ. 2020, 28, 227–236. [Google Scholar]
  30. Ofosuhene, P. The Energy Goodness-of-Fit Test for the Inverse Gaussian Distribution. Ph.D. Thesis, Bowling Green State University, Bowling Green, OH, USA, 2020. [Google Scholar]
  31. Sźekely, G.J.; Rizzo, M. A new test for multivariate normality. J. Multivar. Anal. 2005, 93, 58–80. [Google Scholar] [CrossRef]
  32. Sźekely, G.J.; Rizzo, M.L. Testing for Equal Distributions in high Dimension. InterStat 2004, 11. [Google Scholar]
  33. Rizzo, M.L. A Test of Homogeneity for Two Multivariate Populations, Physical and Engineering Sciences Section; American Statistical Association: Alexandria, VA, USA, 2003. [Google Scholar]
  34. Njuki, J.; Ning, W. Energy statistic-based modified information criterion for detecting the change in distribution. J. Appl. Stat. 2025, 1–23. [Google Scholar] [CrossRef]
  35. Njuki, J.M. Nonparametric Sequential Tests for Change Point Analysis Using Energy Statistics. Ph.D. Thesis, Bowling Green State University, Bowling Green, OH, USA, 2022. [Google Scholar]
  36. Matterson, D.S.; James, N.A. A nonparametric Approach for Multiple Change Point Analysis of Multivariate Data. J. Am. Stat. Assoc. 2014, 109, 334–345. [Google Scholar] [CrossRef]
  37. Kim, A.Y.; Marzban, C.; Percival, D.B.; Stuetzle, W. Using labeled data to evaluate change detectors in a multivariate streaming environment. Signal Process. 2009, 89, 2529–2536. [Google Scholar] [CrossRef]
  38. Sźekely, G.J.; Rizzo, M.L. The Energy of Data. Annu. Rev. Stat. Its Appl. 2017, 4, 447–479. [Google Scholar] [CrossRef]
  39. Sźekely, G.J.; Rizzo, M. A Class of Statistical Based on Distances. J. Stat. Plan. Inference 2013, 143, 1249–1272. [Google Scholar] [CrossRef]
  40. Sźekely, G.J. E-Statistics: Energy of Statistical Samples; Technical Report 03-05; BGSU, Department of Mathematics and Statistics: Bowling Green, OH, USA, 2000. [Google Scholar]
  41. Anderson, T.W.; Darling, D.A. A test of goodness of fit. J. Am. Stat. Assoc. 1954, 49, 765–769. [Google Scholar] [CrossRef]
  42. Watson, G.S. Goodness-of-fit tests on a circle. Biometrika 1961, 48, 109–114. [Google Scholar] [CrossRef]
  43. Kuiper, N.H. Tests concerning random points on a circle. Proc. Ned. Akad. Van Wet. Ser. A 1960, 63, 38–47. [Google Scholar] [CrossRef]
  44. Móri, F.T.; Sźekely, G.J.; Rizzo, M.L. On energy tests of normality. J. Stat. Plan. Inference 2021, 213, 1–15. [Google Scholar] [CrossRef]
  45. Lawless, J.F. Statistical Models and Methods for Lifetime Data; John Wiley & Sons: New York, NY, USA, 1982. [Google Scholar]
  46. Picciotto, R. Tensile Fatigue Characteristics of a Sized Polyester/Viscose Yarn and Their Effect on Weaving Performance. Master’s Thesis, Department of Textile Technology, North Carolina State University, Raleigh, NC, USA, 1970. [Google Scholar]
  47. Shanker, R.; Fesshaye, H. On modeling of lifetime data using Akash, Shanker, Lindley and exponential distributions. Biom. Biostat. Int. J. 2016, 3, 214–224. [Google Scholar] [CrossRef]
Figure 1. Lindley densities for selected values of θ .
Figure 1. Lindley densities for selected values of θ .
Stats 08 00087 g001
Figure 2. Banks’ waiting time (in minutes) for customers before service.
Figure 2. Banks’ waiting time (in minutes) for customers before service.
Stats 08 00087 g002
Figure 3. Failure times of the electronic components in an accelerated life test.
Figure 3. Failure times of the electronic components in an accelerated life test.
Stats 08 00087 g003
Figure 4. Number of stress cycles for yarns’ failure.
Figure 4. Number of stress cycles for yarns’ failure.
Stats 08 00087 g004
Table 1. Simulated critical values for the Lindley distribution.
Table 1. Simulated critical values for the Lindley distribution.
n α 0.010.050.10
105.88103.61022.7356
255.88333.61652.7349
505.85313.59782.7671
1005.71223.58322.7429
1505.79043.68322.7262
2005.76343.59602.7391
3005.52543.54172.7782
5005.73113.52072.7915
10005.83763.57652.6762
Table 2. Simulated Type I error control of the test for the nominal level α = 0.05 .
Table 2. Simulated Type I error control of the test for the nominal level α = 0.05 .
n θ = 0.5 θ = 1 θ = 2 θ = 5
100.05100.05000.05220.0516
250.04570.04500.04860.0504
500.05010.04930.04810.0502
1000.05330.04990.04930.0523
2000.05190.04610.05070.0514
5000.04960.05120.04840.0524
Table 3. Alternative distributions for the power comparison.
Table 3. Alternative distributions for the power comparison.
DistributionDensityNotation
Weibull f ( x ) = 1 θ exp ( x θ ) W ( θ )
Gamma f ( x ) = x θ 1 exp ( x ) Γ ( θ ) Γ ( θ )
Half-Normal 2 θ π exp ( x 2 θ 2 π ) H N ( θ )
Chen f ( x ) = θ x θ 1 exp ( x θ ) exp ( 1 e θ ) C H ( θ )
Uniform f ( x ) = 1 3 U ( 0 , 3 )
Modified Extreme Value f ( x ) = 1 exp 1 e x θ E V ( θ )
Exponential f ( x ) = exp ( x ) E x p ( θ )
Log-Normal f ( x ) = 1 x 2 π exp ( ( ln x θ ) 2 2 ) L N ( θ )
Dhillon f ( x ) = 1 + ( θ x ) λ e θ x λ x e θ x + 1 exp ( ln ( λ x e θ x + 1 ) ) D L ( θ )
Nakagami f ( x ) = 2 θ θ Γ ( θ ) x 2 θ 1 exp ( θ x 2 ) N K ( θ )
Linear Failure Rate f ( x ) = ( 1 + θ x ) exp ( x θ x 2 2 ) L F ( θ )
Table 4. Power comparisons at the level of significance α = 0.05 .
Table 4. Power comparisons at the level of significance α = 0.05 .
DistributionSample Size n n E n KS V W 2 U 2 A 2
100.16150.07620.07670.07700.08040.0505
250.25270.13180.13170.15600.14150.1330
W ( 1.3 ) 500.38760.21750.20060.26480.22320.2625
1000.64220.43280.38910.51820.43370.5497
2000.87950.72080.66910.80900.71130.8524
100.38200.05810.06870.06620.07270.0433
250.41660.07380.08650.08160.08710.0696
Γ ( 1.5 ) 500.50270.11280.12580.12970.13650.1375
1000.63810.20480.22270.23440.24240.2841
2000.80670.38030.38230.43370.42750.5515
100.12740.07410.07970.07830.07980.0534
250.21640.13440.12810.14710.12910.1231
H N ( 1 ) 500.37200.22240.20040.27200.21870.2429
1000.65050.43230.39000.52310.40800.4976
2000.91440.72410.69400.83040.69380.8214
100.16460.09550.10750.11120.10700.0787
250.32700.20490.19490.24700.19930.2065
C H ( 0.8 ) 500.56960.35910.35030.46420.36780.4327
1000.88360.67810.67310.80600.67280.7912
2000.99600.94850.95480.98660.94840.9855
100.67920.16570.24420.21350.22570.1572
250.94630.39890.60010.56310.52980.5253
U ( 0 , 3 ) 500.99880.74880.92910.90130.85450.9098
1001.00000.98520.99960.99880.99430.9990
2001.00001.00001.00001.00001.00001.0000
100.23570.13980.14120.15790.15040.1259
250.49130.28980.30450.37060.31590.3231
E V ( 1.5 ) 500.80730.56410.58300.70070.59500.6706
1000.98540.88850.90740.96510.90780.9583
2001.00000.99710.99900.99990.99841.0000
100.16520.05970.05010.05550.04850.0688
250.20190.06580.06010.07170.06200.0792
E x p ( 1 ) 500.24960.08660.07100.09940.07610.1158
1000.34240.14010.11130.16440.12020.1796
2000.48450.24650.18380.28670.20510.2998
100.41630.11660.13090.12650.13940.0979
250.62180.20980.25670.24060.28460.2510
L N ( 0.8 ) 500.85380.43560.55420.49110.59740.5962
1000.99010.80440.91170.86350.93400.9648
2001.00000.99500.99900.99800.99971.0000
100.62490.15460.17540.17970.19040.1270
250.86800.39880.39400.46560.43670.4611
D L ( 1.5 ) 500.97490.71800.71060.78640.77300.8309
1000.99940.97270.96600.98750.98290.9958
2001.00000.99990.99991.00001.00001.0000
100.12940.11290.11080.12080.12040.0825
250.27910.21590.19810.27240.21940.2193
L F ( 2 ) 500.50280.39420.35760.48040.38040.4493
1000.81700.68420.65280.79860.67660.7797
2000.99140.96170.94990.98700.95310.9848
100.19860.12060.12370.14120.13420.0971
250.41360.24780.23570.30300.25750.2695
N K ( 0.6 ) 500.70190.50040.46990.60910.49430.5902
1000.94310.81080.78780.90910.81350.9087
2000.99940.98760.98710.99760.98740.9985
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Njuki, J.; Avallone, R. Energy Statistic-Based Goodness-of-Fit Test for the Lindley Distribution with Application to Lifetime Data. Stats 2025, 8, 87. https://doi.org/10.3390/stats8040087

AMA Style

Njuki J, Avallone R. Energy Statistic-Based Goodness-of-Fit Test for the Lindley Distribution with Application to Lifetime Data. Stats. 2025; 8(4):87. https://doi.org/10.3390/stats8040087

Chicago/Turabian Style

Njuki, Joseph, and Ryan Avallone. 2025. "Energy Statistic-Based Goodness-of-Fit Test for the Lindley Distribution with Application to Lifetime Data" Stats 8, no. 4: 87. https://doi.org/10.3390/stats8040087

APA Style

Njuki, J., & Avallone, R. (2025). Energy Statistic-Based Goodness-of-Fit Test for the Lindley Distribution with Application to Lifetime Data. Stats, 8(4), 87. https://doi.org/10.3390/stats8040087

Article Metrics

Back to TopTop