Next Article in Journal / Special Issue
New Modified Burr III Distribution, Properties and Applications
Previous Article in Journal
The Local Antimagic Chromatic Numbers of Some Join Graphs
Previous Article in Special Issue
Discrete Pseudo Lindley Distribution: Properties, Estimation and Application on INAR(1) Process
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Sine Modified Lindley Distribution

1
Department of Statistics, Deva Matha College, Kuravilangad 686633, Kerala, India
2
Department of Statistics, St. Thomas College, Palai 686574, Kerala, India
3
Laboratoire de Mathématiques Nicolas Oresme (LMNO), Université de Caen Normandie, Campus II, Science 3, 14032 Caen, France
*
Author to whom correspondence should be addressed.
Math. Comput. Appl. 2021, 26(4), 81; https://doi.org/10.3390/mca26040081
Submission received: 30 September 2021 / Revised: 11 December 2021 / Accepted: 13 December 2021 / Published: 16 December 2021
(This article belongs to the Special Issue Computational Mathematics and Applied Statistics)

Abstract

:
The paper contributes majorly in the development of a flexible trigonometric extension of the well-known modified Lindley distribution. More precisely, we use features from the sine generalized family of distributions to create an original one-parameter survival distribution, called the sine modified Lindley distribution. As the main motivational fact, it provides an attractive alternative to the Lindley and modified Lindley distributions; it may be better able to model lifetime phenomena presenting data of leptokurtic nature. In the first part of the paper, we introduce it conceptually and discuss its key characteristics, such as functional, reliability, and moment analysis. Then, an applied study is conducted. The usefulness, applicability, and agility of the sine modified Lindley distribution are illustrated through a detailed study using simulation. Two real data sets from the engineering and climate sectors are analyzed. As a result, the sine modified Lindley model is proven to have a superior match to important models, such as the Lindley, modified Lindley, sine exponential, and sine Lindley models, based on goodness-of-fit criteria of importance.

1. Introduction

The last few years in applied sciences have been marked by the need and volume of data to be analyzed. To meet this need, new models have been proposed, and their improvement is a hot topic. These require, among other things, the underlying development of new (statistical or probabilistic) distributions. In this regard, one idea is to modify existing distributions in order to make the corresponding models more flexible and adaptable to several kinds of data. Hence, several modifications based on mathematical techniques have been proposed, generating distributions classified under “families of distributions”. The readers are referred to [1] for a bird’s-eye view. In recent times, the families described by ”trigonometric transformations" have gained a lot of interest because of their applicability and working capability in a variety of situations. Related to this topic, Refs. [2,3,4] were among the first to study the sinusoidal transformation that leads to the sine generated (S-G) family. For this family, the cumulative distribution function (cdf) and probability density function (pdf) are given by
F S ( x ; η ) = sin π 2 G x ; η , x R ,
and
f S ( x ; η ) = π 2 g x ; η cos π 2 G x ; η , x R ,
respectively, where G ( x ; η ) and g ( x ; η ) represent the cdf and pdf of a certain continuous distribution with a parameter vector denoted by η . Thus, the functions F S ( x ; η ) and f S ( x ; η ) are linked to a baseline or parent distribution determined beforehand, relying on the purpose of study. It is worth noting that the baseline cdf has not been supplemented with any additional parameters. The S-G family was developed as a viable substitute for the parent distribution; we can see it from the following first-order stochastic ordering (FOSO) property:
G ( x ; η ) F S ( x ; η )
for all x R , as well as the possibility of creating versatile statistical distributions that can accept a wide range of data. To make the statement clearer, the exponential distribution is used as a parent distribution by [2] to define the sine exponential distribution. The inverse Weibull (IW) distribution proposed by [5] is used as the reference distribution by [4], thus creating the sine IW (SIW) distribution. The sine power Lomax distribution investigated by [6] is one of the most recent works highlighting the importance of the S-G family. It enhances the parental power Lomax distribution on several functional aspects. Among the trigonometric families of distributions, a few of them, including the C-S family by [7], SKum-G family by [8], STL-G family by [9], and T-G family by [10], were influenced by these efforts.
In this research, we contribute to the developments of the S-G family by linking it to a particular one-parameter distribution introduced by [11]: the modified Lindley (ML) distribution. The sine ML (S-ML) distribution is thus introduced. In order to comprehend the outlined approach, a review of the ML distribution is essential. As a first comment, the ML distribution presented by [11] is achieved by implementing the tuning exponential function e θ x , with θ > 0 , to the Lindley distribution, with the motive of modifying its capabilities for new modeling perspectives. On the mathematical side, the cdf and pdf of the ML distribution are defined by
G M L ( x ; θ ) = 1 1 + e θ x θ x 1 + θ e θ x , if   x > 0 0 , if   x 0
and
g M L ( x ; θ ) = θ 1 + θ e 2 θ x 1 + θ e θ x + 2 θ x 1 , if   x > 0 0 , if   x 0 ,
respectively. Basically, the ML distribution satisfies the following FOSO property:
G L ( x ; θ ) G M L ( x ; θ ) G E ( x ; θ )
for all x R , where G L ( x ; θ ) and G E ( x ; θ ) represent the cdfs of the Lindley and exponential distributions, respectively. In this sense, the ML distribution constitutes a real alternative to these two classical distributions. The ML distribution is also identified as a linear combination of the exponential distribution with parameter θ and the gamma distribution with parameters ( 2 , 2 θ ) , and it has an “increasing-reverse bathtub-constant” hazard rate function (hrf). The real benefit is quite noteworthy; the ML model is superior to the Lindley and exponential models for the three data sets seen in [11]. A few inspired distributions enhancing or generalising the ML distribution were proposed for the purpose of optimality. These include the Poisson ML distribution by [12], wrapped ML distribution by [13], and discrete ML distribution by [14].
The immediate aim of the S-ML distribution is to use the S-G technique to enhance the effectiveness of the ML distribution on diverse data sets. In particular, thanks to the FOSO properties in Equations (3) and (6), it is a real and attractive alternative to the Lindley and ML distributions. Further exploration in the following research will reveal deeper motives. To summarise, the S-ML model’s utility and adaptability make it particularly appealing to fit data from various fields. Remarkably, the characterized pdf shows a variety of curve shapes, some of which have only one mode, are decreasing, and are asymmetrical to the right. In comparison to the pdf of the ML distribution, when it is unimodal, the pdf of the S-ML distribution has a more rounded peak, meaning that it is better adapted to fit a data histogram presenting a high kurtosis level. Furthermore, the S-ML distribution exhibits a non-monotonic hrf which is “increasing-reverse bathtub-constant” shaped. The hrf of the ML distribution has this feature as well. As with other competent models, the accuracy of the fits is persistent in the case of the S-ML model due to their characteristics. The claim is demonstrated by examining two published real-world data sets, primarily from engineering and climate data, against twelve competent models.
We prepare the rest of the paper in the following manner. The concept, quality, and key aspects of the S-ML distribution are covered in Section 2. A moment analysis is conducted in Section 3. The maximum likelihood estimation of the parameter θ is explained in Section 4. A simulation study is presented in Section 5. Section 6 assesses the proposed model’s applicability to real-world data. Finally, in Section 7, the conclusions are provided.

2. The S-ML Distribution

The mathematical foundation for the S-ML distribution is first presented.

2.1. Functional Analysis

To begin, we perform a functional analysis of the S-ML distribution. By substituting Equations (4) and (5) in Equations (1) and (2), respectively, we derive the major functions of the S-ML distribution; the cdf and pdf are given as follows
F S M L ( x ; θ ) = cos π 2 1 + e θ x x θ 1 + θ e θ x , if   x > 0 0 , if   x 0
and
f S M L ( x ; θ ) = π 2 θ 1 + θ e 2 θ x 1 + θ e θ x + 2 x θ 1 sin π 2 1 + e θ x x θ 1 + θ e θ x , if   x > 0 0 , if   x 0 ,
with θ > 0 . As a primary result mentioned in the introduction section, the following FOSO property holds: G M L ( x ; θ ) F S M L ( x ; θ ) for any x R , making an immediate difference between the ML and S-ML modeling from the cdf viewpoint. Differences can also be observed on the respective pdfs, as discussed below. Naturally, variant forms of f S M L ( x ; θ ) can be obtained by changing the value of θ . Due to the relative complexity of this function in the analytical sense, we propose a graphical study for shape analysis. The more representative shapes of this pdf are shown in Figure 1.
We can observe from Figure 1, that, for smaller values of θ , the plot of f S M L ( x ; θ ) is unimodal, and for larger values of θ , the plot of f S M L ( x ; θ ) is decreasing. As a result, the S-ML distribution is suitable for modeling a vast majority of lifetime phenomena. Compared to the parent ML distribution, the following observations are made: When it is unimodal, we observe that the pdf of the S-ML distribution has a more rounded peak, meaning that it is better adapted to fit a data histogram presenting a high kurtosis level. In other words, the S-ML model is more able to analyze data of a leptokurtic nature.

2.2. Reliability Analysis

We complete the previous functional analysis by studying the complementary reliability functions, such as the survival function (sf), hrf (for hazard rate function), reversed hrf (rhrf), second rate of failure (srf), and the cumulative hrf (chrf) of the S-ML distribution. In a broader sense, the sf measures the probability that the life of an item will survive beyond any specified time. Mathematically, the sf of the S-ML distribution is given by
S S M L ( x ; θ ) = 1 F S M L x ; θ = 1 cos π 2 1 + e θ x x θ 1 + θ e θ x , if   x > 0 1 , if   x 0 .
The hrf measures the likelihood of an item deteriorating or expiring depending on its lifetime. As a direct consequence, it is critical in the classification of survival distributions. The hrf of the S-ML distribution is specified by
h S M L ( x ; θ ) = f S M L ( x ; θ ) S S M L ( x ; θ ) = π 2 θ 1 + θ e 2 θ x 1 + θ e θ x + 2 x θ 1 cot π 4 1 + e θ x x θ 1 + θ e θ x , if   x > 0 0 , if   x 0 .
Further, Figure 2 displays the shapes of this hrf for various values of θ .
Figure 2 emphasizes that the hrf of the S-ML distribution has “increasing-reverse bathtub-constant” shapes, which is also possessed by the hrf of the ML distribution. This makes a solid difference between the Lindley and exponential distributions. It is also a desirable property for modelling purposes.
The rhrf is the ratio between the pdf to its cdf and it plays a role in analyzing censored data. Analytically, it corresponds to
r S M L ( x ; θ ) = π 2 θ 1 + θ e 2 θ x 1 + θ e θ x + 2 x θ 1 tan π 2 1 + e θ x x θ 1 + θ e θ x , if   x > 0 0 , if   x 0 .
The srf is the logarithmic ratio of the sf at time x and x + 1 , and it is given by
r S M L * ( x ; θ ) = ln 1 cos ( π / 2 ) 1 + e θ x x θ / ( 1 + θ ) e θ x 1 cos ( π / 2 ) 1 + e θ ( x + 1 ) ( x + 1 ) θ / ( 1 + θ ) e θ x + 1 , if   x > 0 0 , if   x 0 .
The chrf is the negative logarithm of sf and is given by
H S M L ( x ; θ ) = ln 1 cos π 2 1 + e θ x x θ 1 + θ e θ x , if   x > 0 0 , if   x 0 .
With these functions, we conclude different reliability analysis in regard with the S-ML distribution.

3. Moment Analysis

For any lifetime distribution, a moment analysis is necessary to handle numerically its modeling capacities, identifying the behavior of various central and dispersion moment parameters, as well as moment skewness and kurtosis coefficients.
As a first notion, for any positive integer r 1 , and a random variable X with the S-ML distribution, the r-th moment of X exists. It can be expressed as
mom ( r ) = E ( X r ) = 0 + x r f S M L ( x ; θ ) d x = π 2 θ 1 + θ 0 + x r e 2 θ x 1 + θ e θ x + 2 x θ 1 sin π 2 1 + e θ x x θ 1 + θ e θ x d x .
Integral developments in the classical sense are limited. Computer software, on the other hand, can be used to quantitatively evaluate it for a given θ .
We propose a series development of mom ( r ) in the next result, which can be used for computational purposes in a less opaq method than a “ready to use but black box” computer program.
Proposition 1.
The r-th moment of X can be expanded as
mom ( r ) = r θ r k = 1 + = 0 2 k 2 k ( 1 ) k + 1 ( 2 k ) ! π 2 2 k ( 1 + θ ) ( + r 1 ) ! ( + 2 k ) r + .
Proof. 
For the proof, we do not directly use the integral expression of mom ( r ) as described in (8). An integration by part gives
mom ( r ) = 0 + x r f S M L ( x ; θ ) d x = r 0 + x r 1 S S M L ( x ; θ ) d x .
Now, by utilizing the series expansion of the cosine function and the classical binomial formula, we obtain
S S M L ( x ; θ ) = 1 k = 0 + ( 1 ) k ( 2 k ) ! π 2 2 k 1 + e θ x x θ 1 + θ 2 k e 2 k θ x = k = 1 + ( 1 ) k + 1 ( 2 k ) ! π 2 2 k 1 + e θ x x θ 1 + θ 2 k e 2 k θ x = k = 1 + = 0 2 k 2 k ( 1 ) k + 1 ( 2 k ) ! π 2 2 k θ 1 + θ x e ( + 2 k ) θ x .
Hence, after some developments including the change of variable y = ( + 2 k ) θ x (so that d x = [ 1 / ( ( + 2 k ) θ ) ] d y ), and the calculus of gamma-type integral, we get
mom ( r ) = r 0 + x r 1 k = 1 + = 0 2 k 2 k ( 1 ) k + 1 ( 2 k ) ! π 2 2 k θ 1 + θ x e ( + 2 k ) θ x d x = r k = 1 + = 0 2 k 2 k ( 1 ) k + 1 ( 2 k ) ! π 2 2 k θ 1 + θ 0 + x r + 1 e ( + 2 k ) θ x d x = r θ r k = 1 + = 0 2 k 2 k ( 1 ) k + 1 ( 2 k ) ! π 2 2 k ( 1 + θ ) ( + r 1 ) ! ( + 2 k ) r + .
Proposition 1 is proved. □
Then, based on Proposition 1, the following finite sum approximation remains acceptable:
mom ( r ) r θ r k = 1 U = 0 2 k 2 k ( 1 ) k + 1 ( 2 k ) ! π 2 2 k ( 1 + θ ) ( + r 1 ) ! ( + 2 k ) r + ,
where U represents any large integer.
From the above moment formulas, we can easily derive the mean, variance, moment skewness coefficient and moment kurtosis coefficient; the mean is given by mom ( 1 ) , the variance is obtained as V ( X ) = E ( ( X mom ( 1 ) ) 2 ) , the moment skewness coefficient can be derived as MS = E ( ( X mom ( 1 ) ) 3 ) / V ( X ) 3 / 2 and the moment kurtosis coefficient can be derived as MK = E ( ( X mom ( 1 ) ) 4 ) / V ( X ) 2 .
Table 1 gives a glimpse of these values for different values of θ .
From Table 1, we can observe that, as the value of the parameter θ of the S-ML distribution increases, all the considered measures increase. Furthermore, since MS > 0 , it is clear that the S-ML distribution is mainly right-skewed, and since MK > 3 , it is mainly leptokurtic.
We can complete the previous moment results by investigating the incomplete moments. To begin, let r 1 be an integer, t 0 , and X be a random variable with the S-ML distribution. Based on this variable, we define its incomplete version by Y ( t ) = X if X t and Y ( t ) = 0 if X > t . Then, the r-th incomplete moment of X given at t exists, and it is defined by
mom ( r , t ) = E ( Y ( t ) r ) = 0 t x r f S M L ( x ; θ ) d x .
It is involved in developments of important probabilistic objects, such as mean deviations, income curves, etc. More basically, it can be viewed as a truncated version of the standard r-moment. We may refer to [15] in this regard.
In the next results, we present a series expansion of mom ( r , t ) , which can be used for approximation purposes.
Proposition 2.
The r-th incomplete moment of X given at t exists and can be expanded as
mom ( r , t ) = t r 1 cos π 2 1 + e θ t t θ 1 + θ e θ t + r θ r k = 1 + = 0 2 k 2 k ( 1 ) k + 1 ( 2 k ) ! π 2 2 k ( 1 + θ ) 1 ( + 2 k ) r + γ ( r + , ( + 2 k ) θ t ) ,
where γ ( a , t ) denotes the incomplete gamma function defined by γ ( a , t ) = 0 t x a 1 e x d x , where a > 0 and t 0 .
Proof. 
The proof follows the lines of the one of Proposition 1. An integration by part gives
mom ( r , t ) = 0 t x r f S M L ( x ; θ ) d x = t r S S M L ( t ; θ ) + r 0 t x r 1 S S M L ( x ; θ ) d x .
It follows from the series expansion in Equation (9) and the change of variable y = ( + 2 k ) θ x that
0 t x r 1 S S M L ( x ; θ ) d x = k = 1 + = 0 2 k 2 k ( 1 ) k + 1 ( 2 k ) ! π 2 2 k θ 1 + θ 0 t x r + 1 e ( + 2 k ) θ x d x = 1 θ r k = 1 + = 0 2 k 2 k ( 1 ) k + 1 ( 2 k ) ! π 2 2 k ( 1 + θ ) 1 ( + 2 k ) r + γ ( r + , ( + 2 k ) θ t ) .
Therefore
mom ( r , t ) = t r 1 cos π 2 1 + e θ t t θ 1 + θ e θ t + r θ r k = 1 + = 0 2 k 2 k ( 1 ) k + 1 ( 2 k ) ! π 2 2 k ( 1 + θ ) 1 ( + 2 k ) r + γ ( r + , ( + 2 k ) θ t ) .
This concludes the proof of Proposition 2. □
In some sense, Proposition 2 generalizes Proposition 1; by taking t + , Proposition 2 becomes Proposition 1.
The rest of the study is devoted to the applicability of the S-ML model, illustrated with concrete examples of data analysis.

4. Inferential Analysis

The inference of the S-ML model is covered in this section. The parameter θ is supposed to be unknown. In order to estimate it, the maximum likelihood estimation method is employed. We adopt the methodology as described in a broader context, as seen in [16].
Thus, the next is a mathematical representation of this methodology in the setting of the S-ML distribution. First, let n be a positive integer and x 1 , x 2 , , x n be observations drawn from a random variable X following the S-ML distribution. Then, the corresponding likelihood function and log-likelihood function are as follows
L = i = 1 n f S M L ( x i ; θ ) = π 2 n θ 1 + θ n e 2 θ i = 1 n x i i = 1 n 1 + θ e θ x i + 2 x i θ 1 × i = 1 n sin π 2 1 + e θ x i x i θ 1 + θ e θ x i ,
and
ln L = n ln π n ln 2 + n ln θ n ln 1 + θ 2 θ i = 1 n x i + i = 1 n ln 1 + θ e θ x i + 2 x i θ 1 + i = 1 n ln sin π 2 1 + e θ x i x i θ 1 + θ e θ x i ,
respectively. The maximum likelihood estimate (MLE) of θ can be defined via the following argmax definition:
θ ^ = argmax ln θ > 0 L .
This estimate can be formalized through the solution of the non-linear equations expressed as d ln L / d θ = 0 , where
d d θ ln L = n θ n 1 + θ 2 i = 1 n x i + i = 1 n e θ x i θ x i + x i + 1 + 2 x i ( 1 + θ ) e θ x i + 2 x i θ 1 + i = 1 n π 2 e θ x i θ x i 2 e θ x i θ + 1 + x i e θ x i θ + 1 θ x i e θ x i θ + 1 2 π 2 x i e θ x i 1 + θ x i e θ x i θ + 1 × cot π 2 1 + e θ x i x i θ 1 + θ e θ x i .
There is no analytical solution for this equation, but θ ^ can be determined at least numerically with any statistical software such as the R software (see [17]). Based on θ ^ , the estimated pdf (epdf) of the S-ML model is given by f S M L ( x ; θ ^ ) and the estimated cdf (ecdf) of the S-ML model is given by F S M L ( x ; θ ^ ) .
Let I ( θ ) = E [ d 2 ln [ f S M L ( X ; θ ) ] / d θ 2 ] be the expected Fisher information matrix. Then, the estimated standard error (SE) of θ is achieved by considering the value of the diagonal component of I ( θ ^ ) 1 raised to half.

5. Simulation Study

In the framework of the S-ML model, a simulation study is carried out to study the performance of θ ^ given as Equation (10) in terms of their bias (bias) and mean squared error (MSE). The simulated procedure can be described as follows:
We generate samples of sizes n = 20, 50, 100, 200, 500, 1000 from the S-ML distribution with θ = ( 1.25 , 1.50 , 2.00 , 2.50 ) . For each sample, the MLE θ ^ is calculated. Here, 1000 such repetitions are made to calculate the standard mean MLE (MMLE), bias and MSE of these estimates using the formula:
MMLE ( θ ^ ) = 1 1000 i = 1 1000 θ ^ i , Bias θ ( θ ^ ) = 1 1000 i = 1 1000 ( θ ^ i θ )
and
MSE θ ( θ ^ ) = 1 1000 i = 1 1000 ( θ ^ i θ ) 2 ,
respectively, where θ ^ i is the estimate of θ for each iteration in the simulation study; i is from 1 to 1000. The results of the study are reported in Table 2.
From Table 2, it is observed that as sample size n increases,
  • Bias decreases, which shows the accuracy of θ ^ ;
  • MSE decreases, which indicates the consistency (or preciseness) of θ ^ .

6. Applications of the S-ML Model

We use the S-ML model on two data sets based on the maximum likelihood method as introduced previously. The data differ in size, traits, and background, but they are all of current interest in their areas.

6.1. Method

We proceed as follows for each data set:
  • The data are presented briefly, accompanied with their reference;
  • A table that encapsulates the basic statistical measures of the data is provided;
  • The goodness-of-fit measures of the models under consideration are evaluated and arranged in order of model performance in a table;
  • The MLE(s) of the model parameters is(are) shown, as well as the relevant SEs, as supplementary work;
  • It is concluded with a visual concept by presenting the histogram of the data and the epdf, as well as the empirical cdf plots and ecdf for the S-ML model exclusively in another graph.
The adequacy measures that are used for model fitting are provided here. Suppose x 1 , x 2 , , x n represent the data and x ( 1 ) , x ( 2 ) , , x ( n ) be their ordered values. As an initial step, we consider the Cramér von-Mises, Anderson Darling, and Kolmogorov–Smirnov statistics defined by
A * = n i = 1 n 2 i 1 n ln ( F S M L ( x ( i ) ; θ ^ ) ) + ln ( S S M L ( x ( n + 1 i ) ; θ ^ ) ) ,
W * = 1 12 n + i = 1 n F S M L ( x ( i ) ; θ ^ ) 2 i 1 2 n 2
and
D n = max i = 1 , 2 , , n F S M L ( x ( i ) ; θ ^ ) i 1 n , i n F S M L ( x ( i ) ; θ ^ ) ,
respectively. The p-value of the Kolmogorov–Smirnov test linked to D n is also examined. Of course, the above definitions can be adapted to any other model than the S-ML model. The measures of adequacy are extensively employed to determine which model is best in terms of fitting the data set under study. The model having the least value for the W * or A * , and the highest p-value, is considered to give the best fit that is in correspondence with the data.
Furthermore, we consider the following goodness-of-fit measures: Akaike information criterion (AIC) and Bayesian information criterion (BIC), given as follows
AIC = 2 k 2 LL , BIC = 2 LL + k ln ( n ) ,
respectively, where LL is the value of the log-likelihood function taken at θ ^ and k, being the number of parameters of the model, here k = 1 for the S-ML model. As it is widely understood, the model with the lowest value for AIC or BIC is selected as the greatest player of models that fits the data set compared to the other models. For more information on the usage and the underlying meaning of the measures W * , A * , D n , AIC and BIC, we refer to [18].
In order to study the best fit of the S-ML model, we aim to compare it with some useful and competent models, which include the ML, Lindley, sine exponential and sine Lindley models listed in Table 3. It is worth noting that models with three parameters are also considered. The aim is to prove that our model can be efficient enough to outperform more complex models in the literature.

6.2. Precipitation Data Set

The data set has thirty consecutive values of precipitation (in inches) in the month of March in Minneapolis, as provided by [29] and recently used by [30]. The data are: (0.77, 1.74, 0.81, 1.2, 1.95, 1.2, 0.47, 1.43, 3.37, 2.2, 3, 3.09, 1.51, 2.1, 0.52, 1.62, 1.31, 0.32, 0.59, 0.81, 2.81, 1.87, 1.18, 1.35, 4.75, 2.48, 0.96, 1.89, 0.9, 2.05). The descriptive statistical measures of these data are presented in Table 4.
Based on the information in Table 4, the data are right-skewed and leptokurtic. The MLE, SE, and goodness-of-fit measures of the S-ML model and those of the other models for precipitation data set are given in Table 5 and Table 6.
We can observe from Table 5 that the S-ML model has the lowest statistics with the highest p-value, implying that it delivers a better fit than the other models studied. Comparing the models in Table 6, we can see that the lognormal model gives a better fit, while the S-ML model takes the second place, but with less modeling complexity in terms of the number of parameters. Figure 3 depicts the epdf and ecdf plots of the S-ML model for the precipitation data set.
From Figure 3, it is obvious that the S-ML model captures the histogram’s overall pattern and illustrates the comparison of the cdf with the empirical cdf of the S-ML model. The suitable behaviour of the S-ML model is further confirmed by these graphs. Apart from the lognormal model, the S-ML model clearly fits better than the Lindley, ML, S-Expo and S-Lindley, and other models.

6.3. Time between Failure Data Set

This data set refers to the time between failures for repairable items. It was obtained from [31]. The data are: (1.43, 0.11, 0.71, 0.77, 2.63, 1.49, 3.46, 2.46, 0.59, 0.74, 1.23, 0.94, 4.36, 0.40, 1.74, 4.73, 2.23, 0.45, 0.70, 1.06, 1.46, 0.30, 1.82, 2.37, 0.63, 1.23, 1.24, 1.97, 1.86, 1.17). The descriptive statistical measures of these data are presented in Table 7.
From Table 7, we can observe that the failure time data set is right-skewed and leptokurtic.
The MLE, SE, and goodness-of-fit measures of the S-ML model and those of the other models for the failure time data set are given in Table 8 and Table 9.
Table 8 and Table 9 show that, for the failure time data set, the S-ML model has the lowest statistics and the highest p-value, meaning that it provides a better match than the other models investigated.
Figure 4 depicts the epdf and ecdf plots of the S-ML model for the failure time data set.
From Figure 4, it is obvious that the S-ML model captures the histogram’s overall pattern and illustrates the comparison of the cdf with the empirical cdf of the S-ML model. The suitable behaviour of the S-ML model is further confirmed by these graphs.

7. Conclusions

The article’s major contribution is a flexible trigonometric extension of the well-known modified Lindley model that proposes a novel efficient statistical modelling technique. We employ the features of the sine generalized family of distributions in this regard, and develop the sine modified Lindley distribution. We have displayed a few of its more noteworthy attributes, with a focus on the shape properties of the corresponding probability density and hazard rate functions, as well as discussing moments. Simulation studies and applications demonstrate the utility of the model under consideration. In particular, we compared it to the primary current models derived from the Lindley, exponential and other models with one or more parameters, using two real-world data sets. As a result, the obtained findings are really satisfactory, demonstrating that the novel distribution has a wide range of applications that could be the subject of additional research in a variety of scientific fields.

Author Contributions

Conceptualization, L.T., V.G. and C.C.; methodology, L.T., V.G. and C.C.; software, L.T., V.G. and C.C.; validation, L.T., V.G. and C.C.; formal analysis, L.T., V.G. and C.C.; investigation, L.T., V.G. and C.C.; writing—original draft preparation, L.T., V.G. and C.C.; writing—review and editing, L.T., V.G. and C.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

We would like to express our gratitude to the four reviewers for their useful comments on the work’s first draft.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Brito, C.R.; Rêgo, L.C.; Oliveira, W.R.; Gomes-Silva, F. Method for generating distributions and classes of probability distributions: The univariate case. Hacet. J. Math. Stat. 2019, 48, 897–930. [Google Scholar]
  2. Kumar, D.; Singh, U.; Singh, S.K. A new distribution using sine function: Its application to bladder cancer patients data. J. Stat. Appl. Probab. 2015, 4, 417–427. [Google Scholar]
  3. Souza, L. New Trigonometric Classes of Probabilistic Distributions. Ph.D. Thesis, Universidade Federal Rural de Pernambuco, Recife, Brazil, 2015. [Google Scholar]
  4. Souza, L.; Junior, W.R.O.; de Brito, C.C.R.; Chesneau, C.; Ferreira, T.A.E.; Soares, L. On the Sin-G class of distributions: Theory, model and application. J. Math. Model 2019, 7, 357–379. [Google Scholar] [CrossRef]
  5. Nelson, W. Applied Life Data Analysis; John Wiley & Sons: New York, NY, USA, 1982. [Google Scholar]
  6. Nagarjuna, V.B.V.; Vardhan, R.V.; Chesneau, C. On the Accuracy of the Sine Power Lomax Model for Data Fitting. Modelling 2021, 2, 5. [Google Scholar] [CrossRef]
  7. Souza, L.; Junior, W.R.O.; de Brito, C.C.R.; Chesneau, C.; Ferreira, T.A.E.; Soares, L. General properties for the Cos-G class of distributions with applications. Eurasian Bull. Math. 2019, 2, 63–79. [Google Scholar]
  8. Chesneau, C.; Jamal, F. The sine Kumaraswamy-G family of distributions. J. Math. Ext. 2021, 15, 1–33. [Google Scholar]
  9. Al-Babtain, A.A.; Elbatal, I.; Chesneau, C.; Elgarhy, M. Sine Topp-Leone-G family of distributions: Theory and applications. Open Phys. 2020, 18, 574–593. [Google Scholar] [CrossRef]
  10. Souza, L.; Junior, W.R.O.; de Brito, C.C.R.; Chesneau, C.; Ferreira, T.A.E.; Fernandes, L.R. Tan-G class of trigonometric distributions and its applications. Cubo 2021, 23, 1–20. [Google Scholar] [CrossRef]
  11. Chesneau, C.; Tomy, L.; Gillariose, J. A new modified Lindley distribution with properties and applications. J. Stat. Manag. Syst. 2021. [Google Scholar] [CrossRef]
  12. Chesneau, C.; Tomy, L.; G, V. The Poisson-modified Lindley distribution. Appl. Math. E Notes. in press.
  13. Chesneau, C.; Tomy, L.; Jose, M. Wrapped modified Lindley distribution. J. Stat. Manag. Syst. 2021. [Google Scholar] [CrossRef]
  14. Tomy, L.; G, V.; Chesneau, C. The discrete modified Lindley distribution. 2021; unpublished work. [Google Scholar]
  15. Cordeiro, G.M.; Silva, R.B.; Nascimento, A.D.C. Recent Advances in Lifetime and Reliability Models; Bentham Books: Sharjah, United Arab Emirates, 2020. [Google Scholar] [CrossRef]
  16. Casella, G.; Berger, R.L. Statistical Inference; Duxbury Advanced Series Thomson Learning: Pacific Grove, CA, USA, 2002. [Google Scholar]
  17. R Development Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2005. [Google Scholar]
  18. Konishi, S.; Kitagawa, G. Information Criteria and Statistical Modeling; Springer: New York, NY, USA, 2007. [Google Scholar]
  19. Lindley, D.V. Fiducial distributions and Bayes theorem. J. R. Stat. Soc. A 1958, 20, 102–107. [Google Scholar] [CrossRef]
  20. Kumar, D.; Singh, U.; Singh, S.K.; Chaurasia, P.K. Statistical properties and application of a Lifetime model using sine function. Int. J. Creat. Res. Thoughts 2018, 6, 993–1002. [Google Scholar]
  21. Chesneau, C.; Tomy, L.; Gillariose, J.; Jamal, F. The inverted modified Lindley distribution. J. Stat. Theory Pract. 2020, 14, 1–17. [Google Scholar] [CrossRef]
  22. Sharma, V.; Singh, S.; Singh, U.; Agiwal, V. The inverse Lindley distribution: A stress-strength reliability model with applications to head and neck cancer data. J. Ind. Prod. Eng. 2015, 32, 162–173. [Google Scholar] [CrossRef]
  23. Haq, M.A. Transmuted exponentiated inverse Rayleigh distribution. J. Stat. Appl. Prob. 2016, 5, 337–343. [Google Scholar] [CrossRef]
  24. Ahmad, A.; Ahmad, S.; Ahmed, A. Transmuted Inverse Rayleigh Distribution: A Generalization of the Inverse Rayleigh Distribution. Math. Theory Model. 2014, 4, 90–98. [Google Scholar]
  25. Voda, V.G. On the inverse Rayleigh distributed random variable. Rep. Statist. App. Res. 1972, 19, 13–21. [Google Scholar]
  26. Lomax, K.S. Business Failures, Another example of the analysis of failure data. J. Am. Stat. Assosciation 1954, 49, 847–852. [Google Scholar] [CrossRef]
  27. Aitchinson, J.; Brown, J.A.C. The Lognormal Distribution; Cambridge University Press: Cambridge, UK, 1957. [Google Scholar]
  28. Kalbfleisch, J.D.; Prentice, R.L. The Statistical Analysis of Failure Time Data; Wiley: New York, NY, USA, 1980. [Google Scholar]
  29. Hinkley, D. On quick choice of power transformation. Appl. Stat. 1977, 26, 67–69. [Google Scholar] [CrossRef]
  30. Yusuf, A.; Mikail, B.B.; Aliyu, A.I.; Sulaiman, A.L. The Inverse Burr Negative Binomial Distribution with Application to Real Data. Stat. J. Theor. Appl. Stat. 2016, 5, 53–65. [Google Scholar] [CrossRef]
  31. Murthy, D.P.; Xie, M.; Jiang, R. Weibull Models; Wiley: Hoboken, NJ, USA, 2004. [Google Scholar]
Figure 1. Plots of (a) unimodal shapes and (b) decreasing shapes for f S M L ( x ; θ ) .
Figure 1. Plots of (a) unimodal shapes and (b) decreasing shapes for f S M L ( x ; θ ) .
Mca 26 00081 g001
Figure 2. Plots of h S M L ( x ; θ ) with selected values of θ .
Figure 2. Plots of h S M L ( x ; θ ) with selected values of θ .
Mca 26 00081 g002
Figure 3. Plots of the (a) epdf and (b) ecdf of the S-ML model for the precipitation data set.
Figure 3. Plots of the (a) epdf and (b) ecdf of the S-ML model for the precipitation data set.
Mca 26 00081 g003
Figure 4. Plots of the (a) epdf and (b) ecdf of the S-ML model for the failure time data set.
Figure 4. Plots of the (a) epdf and (b) ecdf of the S-ML model for the failure time data set.
Mca 26 00081 g004
Table 1. Values of various moment measures of the S-ML distribution.
Table 1. Values of various moment measures of the S-ML distribution.
θ mom(1) V ( X ) MSMK
0.51.4484651.2103671.4495556.374828
1.00.68033290.30039461.5207296.633623
1.50.43638710.13223561.5732126.842306
2.00.3188330.073754451.6119877.005059
2.50.25028450.046870651.641487.133455
3.00.20560520.032359491.6645697.236649
Table 2. Outcome of the simulation study.
Table 2. Outcome of the simulation study.
θn θ ^
BiasMSE
1.25200.0405480.052057
500.0171120.0194051
1000.0078620.009083
2000.0050180.0045803
5000.0023770.001814
10000.0013180.000915
1.50200.515230.072304
500.0211530.028311
1000.0083430.0125480
2000.0051950.006043
5000.00273030.002816
10000.0016190.001239
2.00200.0602370.135020
500.0291700.052147
1000.0123380.024522
2000.0090040.012285
5000.0014640.004796
10000.0038240.002366
2.50200.1069340.246197
500.0334150.083405
1000.0232730.044903
2000.0074080.019908
5000.0078560.008118
10000.0014310.004158
Table 3. Competent models with the S-ML model.
Table 3. Competent models with the S-ML model.
ModelsAbbreviationsCdfsReferences
LindleyLindley 1 1 + x θ 1 + θ e x θ [19]
sine exponentialS-Expo cos π 2 e θ x [2]
sine LindleyS-Lindley cos π 2 1 + θ x 1 + θ e θ x [20]
modified LindleyML 1 1 + e θ x θ x 1 + θ e θ x [11]
inverted modified LindleyI-ML 1 + θ 1 + θ 1 x e θ / x e θ / x [21]
inverted LindleyIL 1 + θ 1 + θ 1 x e θ / x [22]
transmuted exponentiated inverse RayleighTEIR e θ α / x 2 1 + λ λ e θ α / x 2 [23]
transmuted inverse RayleighTIR e θ / x 2 1 + λ λ e θ / x 2 [24]
inverse RayleighIR e α / x 2 [25]
LomaxLomax 1 1 + x λ α [26]
log normalLNormal ϕ ln x μ σ [27]
generalized beta type IIGB2 | α | β α θ B ( θ , δ ) 0 x y α θ 1 ( 1 + ( y / β ) α ) θ + δ d y [28]
Table 4. Descriptive statistical measures for the precipitation data set.
Table 4. Descriptive statistical measures for the precipitation data set.
MeanMedianVarianceSkewnessKurtosisMinMax
1.681.4711.0866824.2068840.324.75
Table 5. MLEs, SEs, and goodness-of-fit measures for the precipitation data set with one parameter models.
Table 5. MLEs, SEs, and goodness-of-fit measures for the precipitation data set with one parameter models.
ModelMLE (SE)AICBICA*W*Dnp-Value
S-ML θ ^ = 0.44 (0.0551)82.910984.31210.67960.09720.12730.7153
ML θ ^ = 0.6644 (0.0974)85.889887.2911.12780.17230.15660.4532
Lindley θ ^ = 0.9096 (0.1247)88.287489.68861.59080.26180.18820.2383
S-Expo θ ^ = 0.3396 (0.0576)90.793292.19442.17710.38730.22020.1088
S-Lindley θ ^ = 0.6091 (0.0729)85.641487.04261.15660.18170.16370.3966
I-ML θ ^ = 1.247 (0.1906)89.736691.13781.39090.21700.19750.1925
IL θ ^ = 1.5833 (0.2268)92.44231.82660.30400.19040.22790.0887
IR α ^ = 0.8588 (0.1568)92.29292.6742.18220.430770.23960.06369
Table 6. MLEs, SEs, and goodness-of-fit measures for the precipitation data set with models having more than one parameter.
Table 6. MLEs, SEs, and goodness-of-fit measures for the precipitation data set with models having more than one parameter.
ModelMLE (SE)AICBICA*W*Dnp-Value
TEIR α ^ = 1.1878 (5.739)90.202294.40581.13595530.21175530.18175010.2749038
λ ^ = 0.67006 (0.266)
θ ^ = 0.6362 (4.778)
TIR λ ^ = 0.0001 (0.40171)91.07394.7591.1360.211770.18170.2748
θ ^ = 0.8588 (0.2136)
Lomax α ^ = 58619.76 (52.96)94.997.82.51390.45390.23520.0724
λ ^ = 98190 (96.69)
LNormal μ ^ = 0.33737 (0.11368)8183.80.198550.03110.09130.9640
σ ^ = 0.62263 (0.08038)
GB2 α ^ = 580.4141 (1593.1699)84.189.77.431.580.4450.0000138
β ^ = 0.8125 (0.6469)
θ ^ = 4.3731 (6.6885)
δ ^ = 520.2863 (584.0752)
Table 7. Descriptive statistical measures for the failure time data set.
Table 7. Descriptive statistical measures for the failure time data set.
MeanMedianVarianceSkewnessKurtosisMinMax
1.5426671.2350001.1271671.2954624.3191700.1100004.730000
Table 8. MLEs, SEs, and goodness-of-fit measures for the failure time data set with one parameter models.
Table 8. MLEs, SEs, and goodness-of-fit measures for the failure time data set with one parameter models.
ModelMLE (SE)AICBICA*W*Dnp-Value
S-ML θ ^ = 0.47420 ( 0.06039 ) 82.327683.72880.226561580.028972350.078051380.99313719
ML θ ^ = 0.7297 ( 0.1082 ) 83.505184.90630.44010.06500.11120.8514
Lindley θ ^ = 0.9762 ( 0.1345 ) 85.094686.49580.72650.11380.14060.5929
S-Expo θ ^ = 0.3662 ( 0.0625 ) 86.554787.95591.05820.179810.167220.3711
S-Lindley θ ^ = 0.64690 ( 0.0783 ) 83.738685.13980.45990.06540.11390.8310
I-ML θ ^ = 0.9222 ( 0.1361 ) 92.641694.04260.95820.14300.13940.6043
IL θ ^ = 1.1603 ( 0.1619 ) 95.865897.26701.2610.19040.14110.5879
IR α ^ = 0.237 ( 0.043 ) 135.289136.6901.4232.4100.4420.00001
Table 9. MLEs, SEs, and goodness-of-fit measures for the failure time data set with models having more than one parameter.
Table 9. MLEs, SEs, and goodness-of-fit measures for the failure time data set with models having more than one parameter.
ModelMLE (SE)AICBICA*W*Dnp-Value
TEIR α ^ = 0.022 ( 0.065 ) 122.99127.198.3731.470.370.000410
λ ^ = 0.880 ( 0.114 )
θ ^ = 8.211 ( 23.735 )
TIR λ ^ = 0.880 ( 0.114 ) 120.990123.7928.3691.4710.37610.0004
θ ^ = 0.185 ( 0.035 )
Lomax α ^ = 19793.12 ( 81.02 ) 9092.81.330.2320.1840.259
λ ^ = 305 ( 51.04 ) )
LNormal μ ^ = 0.1597 ( 0.1464 ) 85.588.30.25770.03690.09870.9322
σ ^ = 0.801 ( 0.1035 )
GB2 α ^ = 655.80 ( 2342.40 ) 87.292.87.531.640.0420.00003
β ^ = 0.907 ( 0.77 )
θ ^ = 2.351 ( 3.56 )
δ ^ = 582.259 ( 519.98 )
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Tomy, L.; G, V.; Chesneau, C. The Sine Modified Lindley Distribution. Math. Comput. Appl. 2021, 26, 81. https://doi.org/10.3390/mca26040081

AMA Style

Tomy L, G V, Chesneau C. The Sine Modified Lindley Distribution. Mathematical and Computational Applications. 2021; 26(4):81. https://doi.org/10.3390/mca26040081

Chicago/Turabian Style

Tomy, Lishamol, Veena G, and Christophe Chesneau. 2021. "The Sine Modified Lindley Distribution" Mathematical and Computational Applications 26, no. 4: 81. https://doi.org/10.3390/mca26040081

Article Metrics

Back to TopTop