1. Introduction
Autoregressive (AR) models are fundamental tools in time series analysis, widely used for modeling and forecasting data over a specified time period. The classical AR(1) process assumes Gaussian innovations due to its mathematical simplicity and tractability. However, in many real-world applications such as finance, hydrology, and energy studies, data exhibit asymmetry, heavy tails, or positivity constraints that violate the normality assumption. Therefore, researchers have developed autoregressive processes with non-Gaussian innovations to better capture such empirical behaviors.
One of the earliest non-Gaussian extensions of the AR(1) process was proposed by Andel [
1], who introduced an AR(1) model with exponential innovations. The work of [
1] opened a way for further exploration of AR processes with skewed and heavy-tailed innovations such as gamma innovation [
2], skew-normal (SN) innovation [
3], generalized hyperbolic (GH) innovations [
4], and gamma-Lindley innovations [
5]. Recently, increasing focus has been given to Lindley-based innovations, motivated by their positive support and analytical tractability. Ref. [
6] developed an AR(1) process with Lindley innovations, calling the process LER(1). Ref. [
7] introduced an AR(1) process with Lindley marginals and called the process LAR(1) and applied the Gaussian estimation method. The generalization of the LER(1) process, the AR(1)-weighted Lindley process, was introduced by [
8]. Ref. [
9] used the scale mixtures of SN distributions as an innovation process and carried out the parameter estimation using the Bayesian approach. The AR(1) process with epsilon-SN innovations was introduced by [
10]. Similarly to [
11], ref. [
12] used the scale mixtures of the normal innovations, and parameter estimation was carried out with the expectation-maximization (EM) algorithm. The missing value estimation in the AR(1) process with exponential innovations was investigated by [
13].
Ref. [
11] introduced the XLindley (XL) distribution as an extended and more flexible form of the classical Lindley distribution (see [
14] for more details on the Lindley distribution). The XL distribution is a mixture of exponential and Lindley distributions with
mixing proportion. Ref. [
11] compared the XL with exponential, xgamma [
15], and Lindley distributions and stated that the XL distribution demonstrates better results than its competitive models according to the model selection criteria and goodness-of-fit statistics. However, the flexibility of the XL distribution is limited since it has only one parameter that is a scale parameter. To gain flexibility, we propose a natural generalization of the XL distribution by adding a shape parameter. The proposed distribution is called generalized XL (GXL). The GXL distribution contains the XL distribution as its sub-model. The statistical properties of the GXL distribution are all obtained in explicit forms. Therefore, the GXL is a tractable distribution and can be adapted to many models, such as regression, time series and survival.
Thanks to the tractable properties of the GXL, the AR(1) process with GXL innovations (AR(1)-GXL) is introduced. The parameters of the AR(1)-GXL process are estimated using three different approaches. The theoretical properties of the AR(1)-GXL process are derived. Two data sets are analyzed with the AR(1)-GXL process and compared with three different innovation distributions: gamma, WL and normal. The ARGXL, the web application of the proposed model, is developed using the shiny package of the R software (version 4.3.1). The ARGXL is accessible via
https:gazistat.shinyapps.io/ARGXL (accessed on 2 December 2025). The results presented in this study can be reproduced using the ARGXL web application. Furthermore, researchers can upload their own data sets to obtain parameter estimates for the AR(1)-GXL process, perform residual analyses, and obtain the model fit measures.
Although several non-Gaussian autoregressive models based on Lindley-type distributions have been proposed in the literature, including the Lindley and weighted Lindley AR processes, the proposed AR(1)-GXL model introduces several fundamental novelties. First, unlike the classical Lindley distribution, which is governed by a single scale parameter, the GXL distribution incorporates an additional shape parameter, significantly enhancing its flexibility in modeling skewness and tail behavior. Second, in contrast to weighed Lindley-based AR model, the proposed innovation distribution provides the ability to model higher skewness and heavy-tailed structures in time series. These features distinguish the proposed model as a flexible and practically alternative to existing Lindley and weighted Lindley autoregressive models.
The remaining parts of the study are outlined as follows:
Section 2 discusses the GXL distribution and its properties. The AR(1)-GXL process is introduced in
Section 3.
Section 4 is devoted to the parameter estimation methods for the AR(1)-GXL process. Two applications are given in
Section 5. The web application, ARGXL, is introduced in
Section 6. The important outcomes of the presented study are summarized in
Section 7.
2. Generalized XLindley Distribution
The probability density function (pdf) of the XL distribution is
where
. The XL distribution is a mixture distribution of two independent random variables (rvs) such as
and
with
mixing proportion. Consider the function
where
. The corresponding pdf of (
2) is obtained by determining the appropriate normalizing constant. A natural generalization of the XL distribution, say GXL, is proposed with the following proposition.
Proposition 1. The pdf of the GXL distribution iswhereis the normalization constant. The resulting density is referred to as GXL distribution. Proof. The GXL distribution is defined as
where
is the shape parameter and
is a normalizing constant that is derived as follows:
Dividing this integral into two parts, we have
The first part of the integration is easy to derive. For the second part, we apply the partial integration setting
and use the gamma function properties,
So, the result of the integration in Equation (
6) is
The normalizing constant is
Inserting Equation (
11) into Equation (
5), we have
The proof is completed. □
The proposed extension of the XL distribution preserves analytical tractability, includes the XLindley distribution as a special case when
, and allows for greater control over skewness and tail behavior. The cumulative distribution function (cdf) of GXL is
where
and
are the lower incomplete and complete gamma functions, respectively. The shapes of the GXL are displayed in
Figure 1. It is observed that the GXL density has right-skewed and almost symmetric shapes.
2.1. Momnents and Related Measures
Proposition 2. The kth raw moment of the GXL distribution is Proof. Let
, we have
The
kth raw moment of the rv
X is
The integration in Equation (
16) can be divided in two parts, as follows:
From the gamma integration, it is known that
So, the integration in Equation (
17) is
Inserting Equation (
19) into Equation (
16), we have
Replacing
in Equation (
20), the
kth raw moment is
□
Simply, the variance of the GXL is
. The third and fourth central moments are required to calculate the skewness and kurtosis measures,
where
and
So, the skewness and excess kurtosis can be computed via
and
.
Figure 2 displays the skewness and kurtosis plots of the GXL. When
is constant, the skewness and kurtosis increase once
increases. When
is constant, the skewness and kurtosis decrease once
increases.
The skewness and kurtosis parameters play an important role in capturing empirical features of non-Gaussian data. The higher skewness reflects stronger asymmetry, which is often associated with accumulation effects or positive shocks in financial time series. Similarly, higher kurtosis indicates heavier tails and higher possibility of extreme observations, observed in volatile markets. The ability of the GXL distribution to control both skewness and kurtosis through its shape and scale parameters allows practitioners to model asymmetric behavior and tail risk more accurately than classical models.
Proposition 3. The moment generating function (mgf) of the GXL is Proof. The mgf of
X is
Substituting pdf of
X in Equation (
29), we have
Divide the integration into two parts,
As in Proposition 1, we use the gamma integration, as follows:
where
. Finally, we have
□
Proposition 4. The Laplace transformation (LT) of the GXL is Proof. The result follows immediately from the relation between the LT and mgf, . □
2.2. Mixture Representation and Data Generation
Proposition 5. The GXL is a mixture distribution of exponential and gamma distributions.
Proof. Let
. Then, the pdf of the GXL is
Hence, the pdf can be written as a mixture density of the exponential and gamma distributions, as follows:
where
and the weight function is defined by
□
Algorithm 1 is defined to generate random variables from the GXL distribution using the mixture representation.
| Algorithm 1 A new algorithm for generating random variables |
- 1:
Set the parameters and - 2:
- 3:
Generate - 4:
If generate otherwise - 5:
Return Step 3.
|
2.3. Interpretation of the Additional Parameter
The motivation of the proposed generalization is not only to increase flexibility but also to introduce an additional parameter that admits a structural and interpretable role within the innovation process. The proposed GXL distribution arises from a mixture representation as a convex combination of an exponential and a gamma distribution. Under this representation, the parameter does not behave as a shape parameter but controls the relative dominance of the gamma component over the exponential component. From a time series perspective, the parameter regulates the degree of the magnitude of positive shocks in the innovation process. Small values of correspond to frequent and small memoryless shocks, while larger values of reflect rare but more persistent innovations.
4. Estimation
Three different approaches are used to estimate the parameters of the AR(1)-GXL process. These are maximum likelihood, Gaussian and least squares estimation methods.
4.1. Maximum Likelihood
Assume that the innovation process
follows the GXL distribution, given in Equation (
3). The innovation process is defined as
. So, the log-likelihood function of the AR(1)-GXL process is
Taking partial derivatives of Equation (
56) with respect to the unknown parameters, we have the score vectors
The ML estimators of the AR(1)-GXL process can be obtained by equating these score vectors to zero and solving simultaneously. However, there are no closed-form expressions for the ML estimators of the AR(1)-GXL process. For this reason, we use the direct maximization method to obtain the ML estimations of the processes.
4.2. Gaussian
The Gaussian estimation (GE) method, proposed by [
17], is based on the likelihood function of the Gaussian distribution. The mean and variance of the Gaussian distribution are replaced by the conditional mean and variance of the AR-GXL(1) process. The conditional likelihood function is
The log-likelihood function of (
60) is
where
. The definitions of
, and
are in Equations (
43) and (
44), respectively. Inserting the conditional mean and variance of
into Equation (
61), we have
There are no explicit solutions for the parameters of the AR(1)-GXL distribution. Therefore, Equation (
62) should be maximized using iterative optimization algorithms. For this purpose, we use the Nelder–Mead algorithm defined in the optim function of R.
4.3. Conditional Least Squares
The conditional least squares (CLS) estimations of the AR(1)-GXL are obtained by minimizing
where
is defined in Equation (
43). Inserting Equation (
43) into Equation (
63), we have
Since it is not possible to obtain closed form expressions for the CLS estimators of the AR(1)-GXL process, as in the MLE approach, the equation in Equation (
64) should be minimized using the iterative optimization algorithms. Again, we use the Nelder–Mead algorithm defined in the optim function.
4.4. Simulation
The ML, GE and CLS methods are compared via simulation study. The simulation replication number is set to
. Two parameter vectors and four sample sizes are used. These are
,
, and
. The simulation results are reported in
Table 1.
The ML method exhibits the best overall performance for the autoregressive parameter . The ML estimates are nearly unbiased, with mean values consistently close to the true value of , and with mean square error (MSE) values smaller than those of GE and CLS. The MSE for under ML decreases rapidly with increasing n, approaching zero when , demonstrating the expected asymptotic efficiency of the ML estimator.
For the parameters and , ML maintains good performance but displays moderately higher variance compared to CLS. Although ML remains nearly unbiased for these parameters, its MSE values are consistently larger than those of CLS across all sample sizes. This pattern suggests that ML, while theoretically optimal in large samples, may exhibit less stable behavior for the parameters of the innovation distribution when n is relatively small.
The GE estimator performs consistently worse than CLS and ML for all parameters and across both scenarios. The most important deficiency appears in the estimation of the shape parameter , where GE exhibits substantial positive bias and extremely large MSEs. The important findings of the simulation study can be summarized as follows:
The ML estimator is the most accurate and stable method for estimating .
The CLS estimator consistently outperforms the alternatives for the parameters and , providing the smallest MSEs and near-unbiased estimates.
The GE estimator performs poorly for all parameters and especially inefficient in estimating .
Finally, we recommend the use of the CLS method instead of the GE and ML methods for small sample sizes. However, if the sample size is large enough, the ML and CLS methods can be preferred.