Next Article in Journal
A Novel Deep Hybrid Learning Framework for Structural Reliability Under Civil and Mechanical Constraints
Previous Article in Journal
A Transformer-Based Multi-Task Learning Model for Vehicle Traffic Surveillance
Previous Article in Special Issue
A Lambert-Type Lindley Distribution as an Alternative for Skewed Unimodal Positive Data
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A New Goodness-of-Fit Test for Azzalini’s Skew-t Distribution Based on the Energy Distance Framework with Applications

by
Joseph Njuki
1 and
Abeer M. Hasan
2,*
1
Department of Mathematics and Statistics, Coastal Carolina University, Conway, SC 29526, USA
2
Department of Mathematics and Statistics, North Carolina A&T State University, Greensboro, NC 27411, USA
*
Author to whom correspondence should be addressed.
Mathematics 2025, 13(23), 3833; https://doi.org/10.3390/math13233833 (registering DOI)
Submission received: 29 September 2025 / Revised: 21 November 2025 / Accepted: 26 November 2025 / Published: 29 November 2025

Abstract

In response to the growing need for flexible parametric models for skewed and heavy-tailed data, this paper introduces a novel goodness-of-fit test for the Skew-t distribution, a widely used flexible parametric probability distribution. Traditional methods often fail to capture the complex behavior of data in fields such as engineering, public health, and the social sciences. Our proposed test, based on energy statistics, provides practitioners with a robust and powerful tool for assessing the suitability of the Skew-t distribution for their data. We present a comprehensive methodological evaluation, including a comparative study that highlights the advantages of our approach over traditional tests. The results of our simulation studies demonstrate a significant improvement in power, leading to more reliable inference. To further showcase the practical utility of our method, we apply the proposed test to three real-world datasets, offering a valuable contribution to both the theoretical and applied aspects of statistical modeling for non-normal data.

1. Introduction

Skewed and heavy-tailed data are prevalent in various applied domains, including econometrics, environmental science, and risk analysis. Income distributions, housing prices, and insurance claims often display asymmetry and excess kurtosis; the tail behavior encodes meaningful extremes, such as financial losses or contaminant spikes, that should not be dismissed as outliers; see, for example, Ibragimov et al. [1], Guo [2], Cortés et al. [3], and Ahmad et al. [4].
A natural starting point for modeling asymmetry is the Skew-normal distribution introduced by Azzalini [5] (see also [6]). By augmenting the normal distribution with a skewness parameter, the Skew-normal preserves analytical tractability while allowing for controlled departures from symmetry. However, because the Skew-normal remains light-tailed, it is ill-suited to settings where leptokurtosis is intrinsic to the data-generating process.
To address heavy tails alongside asymmetry, the Skew-t family extends the t distribution by introducing a skewness parameter. An influential approach views Skew-t random variables as scale mixtures of Skew-normal and Chi-square variables [5,7], a perspective adopted and elaborated by several authors, including Hasan et al. [8]. Related extensions, such as extended Skew-t (EST) and alternative parameterizations, further enhance the modeling flexibility (see, for example, [9]). Empirically, Skew-t models have seen broad application, from financial time series and risk assessment to environmental monitoring and robust analysis under truncation or censoring [10,11]. Non-central variants expand the toolkit and have been studied in detail, with their properties and use cases documented in Hasan et al. [8] and Hasan [12].
Despite this progress, the assessment of the Skew-t model remains essential. In practice, misspecification of tail weight or asymmetry can distort inference on extremes, dependence, and risk. This motivates rigorous goodness-of-fit (GoF) procedures tailored to Skew-t families—methods that can diagnostically test statistical adequacy against alternatives that differ in tail behavior, asymmetry, or both.
In this paper, we develop an energy-based goodness-of-fit test for Azzalini’s standard Skew-t distribution proposed in Azzalini and Capitanio [6] and defined as a random variable with probability density function (PDF) that takes the following form:
f ν ,   α ( x ) = 2 g ν ( x ) G ν + 1 α x ν + 1 ν + x 2 ,
where g and G are the PDF and the cumulative CDF of the standard t distribution. The parameters α and ν are referred to as the skewness parameter and the degrees of freedom, respectively. We will denote this density by S t ( ν ,   α ) . The special case for α = 0 reduces to the standard Student t distribution with ν degrees of freedom. The CDF of this Skew-t distribution does not have a closed form.
A scale–location extension of this definition has been proposed by adding a location parameter ξ and a scale parameter ω by replacing x in Equation (1) by x ξ ω . The expected value of the scale–location Skew-t variable can be obtained by
E ( X ) = ξ + δ × Γ ( 1 2 ( ν 1 ) ) Γ ( ν 2 ) ν π ,
where ν > 1 , δ = α 1 + α 2 , and Γ ( z ) = 0 t z 1 e t d t . The reader is referred to Azzalini and Capitanio [6,13] for more details on the theoretical properties and alternative parameterizations of the Skew-t distribution. For simplicity, we will refer to Azzalini’s Skew-t distribution as the Skew-t distribution for the remainder of this paper unless otherwise stated.
Without loss of generality, we will focus on the standard Skew-t distribution in our construction of the goodness-of-fit test, since an arbitrary four-parameter Skew-t density can be converted to the standard one via a linear transformation. Figure 1 provides density curves of the standard Skew-t distribution for varying degrees of freedom ( ν ) when the skewness parameter α = 1 is held fixed. This illustrates the role of each parameter as the parameter α controls the skewness of the density, and the parameter ν controls the thickness of the tail.
There is limited research on the goodness-of-fit test for Azzalini’s Skew-t distributions, except those based on empirical distribution functions (EDFs), such as Kolmogorov–Sminorv, Cramer–von-Mises, etc., discussed and modified in [14,15]. Recently, Maghami and Bahrami [16] proposed a goodness-of-fit test for Azzalini’s Skew-t distribution based on the correlation coefficient (r). Most studies related to Skew-t distributions focus on properties, extensions, generalizations, change point analysis, and applications, for example, [12,17,18,19], among many others. To address this research gap, we develop an energy-based goodness-of-fit test for the Skew-t distribution. Our test leverages the well-established properties of energy-based testing to achieve higher power for any given sample size and parameter combination α and ν .
In this article, we propose a new one-sample (univariate) energy goodness-of-fit test based on energy statistics proposed by [20,21]. In the most recent work, Refs. [22,23,24] proposed goodness-of-fit tests based on energy statistics for Skew-normal, Inverse Gaussian, and Lindley distributions, respectively. For a given sequence of independent random variables of size n and with a cdf G, the test statistic based on energy statistics will reject the null hypothesis that F = G for large values of the test statistic. If the null distribution, F, and the given data come from the same underlying distribution G, then the values of the test statistic are expected to be smaller. Furthermore, there have been numerous studies involving energy statistics such as testing for multivariate normality [25,26], testing for equality of distributions [27,28], one-sample goodness-of-fit tests [22,24,29,30], and change point analysis [31,32,33,34], among many others.
The energy distance is a statistical distance between the distributions of random vectors that characterizes the equality of distributions; see, for example, [21,29,35]. The concept of energy statistics described by Sźekely [21] is based on the notion of Newton’s gravitational potential energy, which is a function of the distance between two bodies. The idea of energy statistics, therefore, is to consider statistical observations as heavenly bodies governed by a statistical potential energy, which is zero if and only if an underlying statistical null hypothesis is true, see, for example, [29,36].
Definition 1. 
Sźekely and Rizzo [35] defined the energy distance between distributions of two independent and univariate random samples X and Y with finite expectations as follows:
E ( X ,   Y ) = 2 E | X Y | E | X X | E | Y Y | 0 ,
X = d X , Y = d Y , and equality holds if and only if X and Y are identically distributed.

1.1. Existence and Uniqueness of the MLEs for Azzalini’s Skew-t Distribution

Azzalini and Genton [37] presented a detailed discussion of the existence and uniqueness of the maximum likelihood estimates (MLEs) for Azzalini’s Skew-t distribution. They concluded that the distribution is generally robust, with the profile log-likelihood for the skewness parameter α being unimodal and free of stationary points at α = 0 . The Fisher information matrix remains non-singular for all finite degrees of freedom, ensuring the existence and uniqueness of the MLE under regular conditions. Simulation studies confirm that the MLEs for skewness and tail parameters (degrees of freedom) are well-behaved for moderate to large samples, while likelihood-based adjustments such as the deviance approach can mitigate rare boundary issues in small samples (see Azzalini and Genton [37] and Azzalini and Capitanio [6] for more details).
Recent advances have addressed the instability and divergence of the MLE for α in small samples. Azzalini and Arellano-Valle [38] introduced a penalized log-likelihood framework that ensures robust, finite, and unique estimates for α , even in small or multivariate settings. In our computations, we adopted this penalized likelihood approach to enhance robustness and avoid instability in parameter estimation for the Skew-t family.

1.2. Motivation and Scientific Contribution

The Skew-t distribution stands out as a flexible and reliable tool for modeling asymmetric and heavy-tailed data. Its regular profile likelihood and robust MLE properties make it especially suitable for general-purpose robust inference in both univariate and multivariate settings. In this paper, we propose a procedure that is superior for the goodness-of-fit test for the Skew-t distribution based on energy distance statistics (Sźekely and Rizzo [35]) and the definition of Azzalini’s Skew-t distribution (Azzalini and Capitanio [13]). Unlike the proposed method based on energy statistics, many existing methods depend on the distribution function of random variables. Energy statistic-based tests have been shown to be typically more powerful against general alternatives than corresponding tests based on classical statistics (non-energy type), such as Kormogorov–Smirnov, correlation, Anderson–Darling, and Cramer–von-Mises. In addition, energy statistic-based tests have an invariance property with respect to any distance-preserving transformation of the dataset; see [29,33,36]. This enables studies that involve energy statistics to be extended to multivariate and high-dimensional settings.
In Section 2 of this article, we introduce a test procedure based on energy statistics for the goodness of fit of Azzalini’s Skew-t distribution and discuss its theoretical properties. We perform various simulations in Section 3 to compare the approach with other existing methods. In Section 4, we apply our method to three case studies. The conclusions are provided in Section 5. Limitations and future research are discussed in Section 6.

2. Proposed Energy-Based Goodness-of-Fit Test

We propose a one-sample univariate goodness-of-fit test based on the energy statistics proposed by [26,35] for the Skew-t distribution. The null hypothesis is that the data X follow the null distribution F 0 , which is a Skew-t distribution, against the alternative that the Skew-t distribution is a poor fit for the data.
Definition 2. 
Let X 1 , X n be a random sample from a univariate population with distribution F and let x 1 , x n be the observed values of the random variables in the sample. Then, the one-sample energy statistic goodness-of-fit test for testing the hypothesis H 0 : F = F 0 vs H a : F F 0 is defined as follows.
n E n = n 2 n i = 1 n E | x i X | E | X X | 1 n 2 i = 1 n j = 1 n | x i x j | ,
where X and X are independent and identically distributed variables with distribution F 0 , and the expectations are taken with respect to the null distribution F 0 .
The null hypothesis F = F 0 , is rejected for large values of the test statistic n E n . Under the null hypothesis, the limiting distribution of n E n is a quadratic quantity of the form j = 1 λ j Z j 2 such that Z j ,   j = 1 ,   2 , , are i.i.d. standard normal random variables and λ j are nonnegative constants that depend on the null distribution. Thus, the goodness-of-fit test can be implemented by finding the constants λ j . In practice, this could be difficult, and we therefore resort to the use of empirical critical values of n E n so that P ( n E n > C α ) = α . This fact is guaranteed since the test based on n E n is a consistent goodness-of-fit test, see, for example, Sźekely and Rizzo [26] and Móri et al. [25].
The goodness-of-fit test statistic based on energy statistics is dependent on the derivation of the expected values of | x X | and | X X | where X and X are independent and identically distributed random variables from the null distribution F 0 .
Proposition 1. 
Let X f ( x ) where f ( x ) is the standard Skew-t density defined in Equation (1). Then, for any fixed x R ,
E | x X | = R | x t | f ( t ) d t = 2 x F ( x ) x + Γ ( 1 2 ( ν 1 ) ) Γ ( ν 2 ) δ ν π 2 x t f ( t ) d t ,
where F ( . ) is the CDF of the standard Skew-t distribution.
Proof of Proposition 1. 
Let X f ( x ) . Then, for every fixed real number x, we have
E | x X | = R | x t | f X ( t ) d t = x ( x t ) f X ( t ) d t + x ( t x ) f X ( t ) d t = x x f X ( t ) d t x t f X ( t ) d t + x t f X ( t ) d t x x f X ( t ) d t = x F X ( x ) x t f X ( t ) d t + t f X ( t ) d t x t f X ( t ) d t x ( 1 F X ( x ) ) = 2 x F ( x ) x + E ( X ) 2 x t f X ( t ) d t = 2 x F ( x ) x + Γ ( 1 2 ( ν 1 ) ) Γ ( ν 2 ) δ ν π 2 x t f X ( t ) d t ,
where the integral term can be numerically evaluated in R (version 4.4.2) using the command dst() available in the Azzalini sn package. □
Sometimes the derivation of the second term in Equation (4) may not be analytically feasible and, therefore, we can use the following approximation as suggested in [22].
Proposition 2 
(Quantile-based Approximation). Let X and X be independent and identically distributed random variables with a well-defined cumulative distribution function, F ( x ) . Given that the quantile or inverse CDF function of X exists, we have the following.
E | X X | = 4 m i = 1 m y i F 1 ( y i ) 2 m i = 1 m F 1 ( y i ) ,
where m is the number of equally sized sub-intervals of [ 0 ,   1 ] and y i is chosen from the ith sub-interval. It is worth noting that Proposition 2 applies to all distribution functions.
Proof. 
This proof is provided by Opperman and Ning [22]. □

3. Simulations and Results

This section presents a simulation-based assessment of the proposed goodness-of-fit (GoF) test for Azzalini’s Skew-t distribution, built upon the energy distance framework. We evaluated the test performance on varying values of the skewness parameter ( α ) and degrees of freedom ( ν ). Simulations are conducted under both the null hypothesis and multiple alternative distributions, and type I error and empirical power are computed.
In this simulation study, we calculate two key quantities. The size of the test (type I error) and the empirical power (1-type II error). We used R (R version 4.4.2) and the sn package, https://cran.r-project.org/web/packages/sn/refman/sn.html (accessed on 21 November 2025), to carry out the simulations. Due to the complexity of repeatedly evaluating the Skew-t CDF and performing numerical integration, all simulations were parallelized using the foreach and doParallel R packages. The simulations were distributed across 11 processor cores, significantly accelerating the test’s evaluation across various scenarios. The programming code is too lengthy to include in the paper. However, it can be obtained by contacting the authors via email.

3.1. The Univariate Energy Test Statistic

The univariate energy test statistic is derived from the expected pairwise distances between observations and a reference Skew-t distribution. We used the standard Azzalini’s Skew-t distribution in our development of the test statistic and rely on standardization to convert an arbitrary Skew-t to the standard one before applying the test. Let F θ be the cumulative distribution function (CDF) of Azzalini’s standard Skew-t distribution with parameter vector ( α ,   ν ) . The test statistic for a sample { x 1 , , x n } is
T n = n { 2 n i = 1 n 2 x i F ( x i ) x i + Γ ( 1 2 ( ν 1 ) ) Γ ( ν 2 ) δ ν π 2 x i t f X ( t ) d t 4 m i = 1 m y i F 1 ( y i ) 2 m i = 1 m F 1 ( y i ) 1 n 2 i = 1 n j = 1 n | x i x j | } .
where δ = α 1 + α 2 , ν > 1 , and m is the number of equally sized sub-intervals of [ 0 ,   1 ] and y i is chosen from the ith sub-interval.
This expression is approximated using the following.
  • A term involving the integral of y · f S T ( y ; α ,   ν ) up to each data point.
  • A quantile-based approximation for the inter-distributional expectation E | Y Y | .
  • A linear-order statistic approximation for the within-sample expectation E | x i x j | .
According to Rizzo [20], the last term of the test statistic n E n in Equation (4) can be linearized in order to reduce the computational complexity of the test from O ( n 2 ) to O ( n log n ) , which is useful during extensive simulations and applications. Let x ( 1 ) ,   x ( 2 ) , ,   x ( n ) be the ordered sample of the random sample x 1 ,   x 2 , ,   x n . Then, the linearization of the double sum of the test n E n is given as
i = 1 n j = 1 n | x i x j | = 2 k = 1 n ( ( 2 k 1 ) n ) x ( k ) .

3.2. Critical Value Simulation Under Varying α and ν

In this section, we conduct a simulation study to estimate the critical values for the energy-based goodness-of-fit test under Azzalini’s Skew-t distribution. The critical value simulations are summarized in Table 1. We selected degrees of freedom d f = 5 ,   10 ,   30 to account for different levels of heavy tail behavior and skewness coefficient α = 1 ,   0 ,   1 to cover the three possible scenarios of right skew, symmetry, and left skew. We used 5000 replicates in this study, with sample sizes n = 50 ,   100 ,   150 ,   200 . Table 2 presents the results of the type I error simulations, illustrating that the empirical values are very close to the theoretical value of 0.05, regardless of the sample size and the parameter values chosen for each sample.

3.3. Type I Error Control

To explore the sensitivity of the test, we vary the skewness parameter α { 1 ,   0 ,   1 } and the degrees of freedom ν { 5 ,   10 ,   30 } . For each combination of ( α ,   ν ) , we
  • Generate B = 5000 samples of size n = 50 from the Skew-t distribution with parameters ( 0 ,   1 ,   α ,   ν ) ;
  • Estimate the parameters θ ^ = ( ξ ^ ,   ω ^ ,   α ^ ,   ν ^ ) using the maximum penalized likelihood method available in the sn package;
  • Standardize the data to the standard Skew-t random variables and compute the energy goodness-of-fit statistic using the formula in Equation (7);
  • Determine the empirical 95th percentile critical value under the null hypothesis.
The empirical type I error is then evaluated across different parameter settings to confirm robustness.
Table 2 shows that the proposed energy-based test maintained the nominal significance level in all the configurations examined. For n = 50 , the empirical type I error was close to 0.05, although slightly higher in extreme skewness or low degrees of freedom. As the sample size increased, the test stabilized and was consistently aligned with the theoretical level.
Figure 2a shows critical values with a varying skewness parameter, α , for ν = 5 and sample size n = 50 . For each selected level of significance ( L O S = 0.01 , 0.025 , 0.05 , 0.10 ), the critical values seem to stabilize as the skewness parameter α progresses further from 0. At α = 0 , we see a dramatic increase in the simulated critical value. Note that Azzalini’s Skew-t distribution reduces to the classic Student t distribution when α = 0 . In this special case, Azzalini’s skewness t might be an overfitting model for the data, as the skewness parameter is no longer needed and the Student t distribution is a better fit. If | α | > 2 , the critical value stabilizes. This implies that for a fixed predetermined confidence level, the actual value of the skewness parameter α has little to no effect on the critical value as long as | α | > 2 ; in other words, the data exhibit a noticeable skewness.
Figure 2b shows critical values with varying ν for α = 1.0 and sample size n = 50 . When the degrees of freedom ν = 1 , Azzalini’s Skew-t distribution reduces to Cauchy’s distribution, and this is why we see higher critical values for the goodness-of-fit test. As degrees of freedom increase ν > 10 , the critical values for a predetermined significance level stabilize as depicted by the nearly horizontal lines in Figure 2b.

3.4. Power Analysis Under Various Alternatives

The simulation findings demonstrate the superior statistical power of the proposed energy-based goodness-of-fit, n E n , test statistic compared to several established goodness-of-fit tests, including the Anderson–Darling (A-D), Cramér–von Mises (CvM), Watson, Kolmogorov–Smirnov (K-S), and Kuiper tests. The analysis was carried out on various alternative distributions with sample sizes (n) of 50, 100, 150, and 200.
For power studies, samples are drawn from alternative distributions that deviate from Azzalini’s Skew-t distribution family. These include the following:
  • Chi-square: This is asymmetric and heavy-tailed.
  • StudentorGosset’s t: This is symmetric and heavy-tailed.
  • Exponential: This has a lighter tail and positive skew.
  • SHASH: The SHASH (Sinh-Arcsinh) distribution discussed in [39] is a highly flexible statistical distribution defined by four parameters that separately control the location, scale, skewness, and kurtosis of a variable.
  • Generalized t (GT): This is to assess sensitivity to misspecification.
  • Log-normal: This is asymmetric and heavy-tailed.
  • KwCWG: This is the Kumaraswamy Complementary Weibull geometric probability distribution. This is a five-parameter density that is well-suited for modeling skewed data with heavy tails.
The empirical power of our proposed test can thus be obtained using the following algorithm.
  • Obtain the critical value under the Skew-t distribution assumption with parameters ( 0 ,   1 ,   α ,   ν ) as explained in Section 3.3.
  • Generate a dataset x 1 , ,   x n from the desired alternative distribution.
  • Process the data as if they were from a Skew-t distribution and estimate the parameters θ ^ = ( ξ ^ ,   ω ^ ,   α ^ ,   ν ^ ) using the maximum penalized likelihood method available in the sn package.
  • Standardize the data to the standard Skew-t random variables t 1 , ,   t n , then compute the energy goodness-of-fit statistic using Equation (7).
  • Based on the energy goodness-of-fit statistic and the critical value in Step 1, determine whether or not the null hypothesis is rejected.
  • Repeat Steps 1 through 5 for B times and the empirical power is calculated as the proportion of times the null is rejected across B = 5000 repetitions.
The empirical powers for the classical EDF (non-energy) tests considered in this study are also obtained in a similar manner. Table 3 summarizes the results of the power comparison simulations. For each case, the bold font indicates the highest power achieved at the combination of the sample size and alternative distribution displayed in the row heading, for each test shown in the column heading.

3.5. Superior Performance of the n E n Test

As illustrated in Table 3, across all scenarios, the energy-based test exhibited the highest power. This dominance is particularly evident in cases of heavy-tailed and skewed distributions.
  • For the Log-normal ( L N ( 0.5 , 1 ) ) distribution, the n E n test achieved a power of 0.9701 at a sample size of just 50, far exceeding the next best test, Watson’s test, which had a power of 0.3988.
  • Against the GT ( ν = 3 ,   τ = 1 ) distribution, the n E n test reached a power of 1.0000 for sample sizes of 150 and 200, indicating perfect detection in the simulation.
  • For the Exponential (Exp(1)) distribution, the power of n E n ranged from 0.8413 to 0.9848, consistently outperforming all other competitors.
  • For the KwCWG distribution, the power of the n E n test ranged from 0.0845 to 0.1170, with the energy test outperforming the other tests in three out of the four sample size settings. The overall small power in this comparison indicates that the tests are having difficulty distinguishing between the two distributions. This is expected as both distributions are suitable to model skewness and heavy tails. However, since the Skew-t distribution uses four parameters instead of five, it is more parsimonious.
The K−S and A−D tests performed poorly in almost all scenarios, especially for skewed and heavy-tailed data. The correlation-based test was competitive but slightly less sensitive to deviations in kurtosis. The energy test consistently ranked the best performing method in all conditions except a single case for the KwCWG distribution.

3.6. Effect of Sample Size and Parameters

As anticipated, the power for all tests increased with the sample size. For example, in the case of the SHASH distribution, the power of the n E n test rose from 0.5734 for n = 50 to 0.6749 for n = 200 . It should be noted that for the standard t 10 distribution, which is a special case of the Skew-t distribution when the skewness parameter α = 0 , all tests showed very low power. The highest power achieved was only 0.2790 by the n E n test at n = 200 . This suggests that all tests have difficulty distinguishing the two distributions, although the n E n test still has a relative advantage. The proposed energy-based test achieved the highest power under conditions of high skewness ( α = 5 ) and low degrees of freedom ( ν = 3 ). As ν increased, the power decreased slightly. This decrease in power is likely because the Skew-t distribution converges to the Skew-normal family as degrees of freedom, ν , making it harder to distinguish the Skew-normal from the Skew-t distribution as the degrees of freedom increase.

3.7. Summary

In summary, the simulation results provide strong evidence that the energy-based statistic, n E n , is a more powerful goodness-of-fit test than the other methods considered in the study, especially for non-normal distributions with skewness or heavy tails. In general, the energy-based GoF test for the Skew-t distribution demonstrates
  • Robust control of type I error;
  • Superior power in a wide range of alternatives;
  • Flexibility for skewed, heavy-tailed, and multi-modal data.
Its simplicity and effectiveness make it a promising tool for model validation in applied settings. Although the proposed method relies on a Monte Carlo approximation for expectation terms, the computational cost is manageable and can be parallelized. The use of linearized summation and pre-simulated reference distributions further improves efficiency.

4. Real Data Applications

To evaluate the practical utility of the proposed test, we present three case studies with real-world datasets. For each case study, we use the penalized MLE to fit the data, compare the fit using information criteria, visually assess goodness-of-fit using histograms of the original data and superimposed density curves, and compare the CDF of the fitted Skew-t model to the empirical density curve. We compute p-values for the likelihood ratio test to illustrate that the Skew-t distribution provides a better fit to the data than the nested models (normal, Skew-Cauchy, and Skew-normal). We finally present the test statistic and p-value for our proposed energy-based test, along with those for the selected comparator tests.
Since the underlying distributions of these datasets are not exactly known in advance, we use the bootstrap algorithm to determine whether they come from Azzalini’s Skew-t distribution. The bootstrap procedure is used to approximate the p-value of the proposed test as given below.
  • Fit the real data y 1 , ,   y n with an Azzalini’s Skew-t distribution in Equation (1) and obtain the maximum likelihood estimates (MLEs) of ξ ,   ω ,   α , and ν from the Azzalini’s sn package available in R (version 4.4.2).
  • Use the formula in Equation (7) to calculate the energy goodness-of-fit statistic of the standardized data and denote it T n 1 .
  • Simulate x 1 , ,   x n , a random sample of size n , from the Azzalini Skew-t distribution with parameters specified as ξ ^ ,   ω ^ ,   α ^ , and ν ^ that were obtained in Step 1.
  • Standardize the data with the standard Skew-t distribution and compute the energy goodness-of-fit statistic for the simulated data using the formula in Equation (7) and denote this value as T n 1 .
  • Repeat this process for B times and obtain B energy goodness-of-fit statistics and denote them by T n 1 , ,   T n B .
  • The bootstrap p-value is therefore approximated as
    p ^ = 1 B b = 1 B I T n b T n 1 ,
    where I ( · ) is an indicator function that takes the value of one when T n b T n 1 and zero otherwise.
A similar procedure was applied for classical (EDF) tests to approximate their respective bootstrap p-values.

4.1. Case Study 1

The first dataset represents the body mass index (BMI) of 102 male Australian athletes (Cook and Weisberg [40]). The dataset is provided in Table A1 and is available in the dr package in R. This dataset was previously analyzed by Maghami and Bahrami [16], who used the correlation coefficient as a measure of goodness of fit for the Skew-t distribution. The dataset exhibits moderate skewness and potential heavy tails, making it an ideal candidate for Skew-t modeling.

4.2. Case Study 2

The second dataset consists of Apple Inc.’s closing prices. This is the daily rate of returns for Apple stock Macrotrends [41]. This dataset was used in the analysis by [19] to detect structural changes in the distribution using the MIC-based method. The dataset is obtained from [41], which provides historical stock price data. We limited the data to the range from 31 January 2019 to 24 January 2020 to avoid the heterogeneity introduced by change points. The stock price shows a general upward trend with some noticeable fluctuations. Given that stock prices exhibit time dependence, we transform the raw closing prices, P t , into daily returns, defined as
R t = P t + 1 P t P t
This transformation allows us to analyze relative price changes rather than absolute levels. The resulting data consist of 248 daily return observations, which we suspect to follow a Skew-t distribution. The filtered and transformed data are available in Table A2 in Appendix A.

4.3. Case Study 3

The third case study is the daily returns of the Dow Jones Industrial Average stock over the period of one year, from 1 November 2024 to 31 October 2025. This data is available in the Tidyquant package in R (version 4.4.2). Stock prices have frequently fluctuated this year due to tariff-related volatility, so we expect the data to exhibit heavy tails and outliers, making it an ideal candidate for Skew-t distributions. The data are then transformed using Equation (9) to correct for the dependence between observations. The transformed data consists of 249 observations and is available in Table A3 in Appendix A.

4.4. Model Fitting

The histograms in Figure 3, Figure 4 and Figure 5 reveal skewness; therefore, distributions such as the Skew-t, Skew-normal, Skew-Cauchy, and normal distributions are good candidates for modeling these datasets. Parameter estimation is conducted using the penalized maximum likelihood method. The results of the penalized maximum likelihood estimates (PMLEs), log-likelihood (LogL), Akaike information criterion (AIC), Schwarz information criterion (SIC), and corresponding p-value for the likelihood ratio test (LRT) against the Skew-t distribution are reported in Table 4, Table 5 and Table 6.
In all datasets, the Skew-t distribution yielded the lowest AIC values, indicating a better overall fit. Surprisingly, the Skew-normal distribution gave a lower SIC value for the body mass index (BMI) dataset. In addition, the associated p-value (0.0502) for the likelihood ratio test (LRT) indicates some evidence that the data may follow Skew-t distribution. Other likelihood ratio tests (LRTs) yielded small p-values, suggesting strong evidence in favor of Skew-t distribution.
In the first dataset (BMI), our proposed test statistic is 1.7109, with a corresponding p-value of 0.5840, supporting the assertion that the BMI data follow the Skew-t distribution. Classical tests considered in the study also supported the null hypothesis that body mass index (BMI) data can be modeled using the Skew-t distribution. In the second dataset, the test statistic for the proposed procedure is 3.9635, and the corresponding p-value is 0.5473. These results support the fact that the data follow the Skew-t distribution. Similar results are observed for the classical (EDF) tests, which suggest that the Skew-t distribution is an adequate model for the data. For the last dataset, the proposed test statistic is 9.7168 and its corresponding p-value is 0.5012, indicating that the Dow Jones daily returns follow a Skew-t distribution. We observe a similar conclusion from the classical (EDF) tests.
The results for all datasets are summarized in Table 7, Table 8 and Table 9. Furthermore, density estimates and empirical cumulative functions suggest that the Skew-t distribution fits the datasets adequately, as shown in Figure 3a,b for the BMI dataset, Figure 4a,b for the Apple stock prices dataset, and Figure 5a,b for the Dow Jones daily returns data, respectively.

4.5. Discussion

In the three case studies we presented in this article, the Skew-t distribution fits the data adequately and has the lowest AIC value among the distributions. This assertion is also well confirmed by the density and empirical CDF estimates as shown in Figure 3, Figure 4 and Figure 5. Through simulations, 2000 bootstrap samples are drawn from the Skew-t distribution with the parameter estimates specified in Table 7, Table 8 and Table 9, and the approximate p-values of the proposed test for the three case studies are obtained as 0.5840, 0.5473, and 0.5012, respectively. We therefore fail to reject the null hypothesis and conclude that the three datasets can be modeled with the Skew-t distribution.
The three case studies highlight the energy-based GOF test’s ability to validate Skew-t modeling in real-world scenarios, particularly when skewness and heavy tails are present. In addition, they illustrate the utility of energy-based approaches in detecting misfits in a range of candidate models.

5. Conclusions

In this paper, we developed a new goodness-of-fit test for Azzalini’s Skew-t distribution based on the energy distance framework. This method offers a flexible and powerful alternative to traditional EDF-based and correlation-based GoF tests. Our simulations demonstrate strong control of type I error and superior power in a wide range of alternatives, particularly those involving skewness, heavy tails, or multi-modal distributions. The proposed test performs reliably even in small to moderate sample sizes and is computationally feasible through bootstrap calibration and efficient implementation strategies.
We used simulation studies to explore the power of this test relative to classical alternative testing approaches under Azzalini’s Skew-t distribution. Our simulations show that the energy-based test performs better than other tests, regardless of the alternative distribution. We also used three distinct datasets to illustrate two main ideas: the importance of the Skew-t distribution for modeling skewed data with heavy tails, and the reliability of our test for detecting goodness of fit in practice. Our case studies with real data analysis confirm the practical value in model validation contexts where accurate distributional assumptions are critical.

6. Limitations and Future Research

The primary limitation of our GOF test is that it can be computationally demanding, especially if applied to a very large dataset. In this paper, we propose multiple solutions to optimize computation time and complexity, including leveraging parallel processing and approximations. Additional approaches to simplify and reduce compute time could be explored in another study. Future research directions include developing similar tests for other distributions, especially variations in the Skew-t distribution, such as the non-central Skew-t distribution and the unified Skew-t distribution, and their multivariate extensions. From a theoretical perspective, the properties of the proposed test statistic, such as asymptotic behavior, variance, and bias, could be studied. Also, exploring analytic approximations to the energy statistic under complex parametric families is a potential topic for future research in this area. In general, the energy-based goodness-of-fit test provides a robust and versatile tool for statistical modeling in both the theoretical and applied domains.

Author Contributions

Conceptualization, J.N. and A.M.H.; methodology, J.N. and A.M.H.; software, J.N. and A.M.H.; validation, J.N. and A.M.H.; formal analysis, J.N. and A.M.H.; investigation, J.N. and A.M.H.; resources, J.N. and A.M.H.; data curation, J.N. and A.M.H.; writing—original draft preparation, J.N. and A.M.H.; writing—review and editing, J.N. and A.M.H.; visualization, J.N. and A.M.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The source of each dataset is provided in the article. Additionally, cleaned and filtered datasets are provided in Appendix A.

Acknowledgments

The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

We present the two datasets used in our applications.
Table A1. Body Mass Index for 102 Male Australian Athletes.
Table A1. Body Mass Index for 102 Male Australian Athletes.
22.4623.8823.6823.1522.3224.0223.2925.1122.8126.25
21.3822.5226.7323.5725.8424.0623.8525.0923.8425.31
19.6926.0725.5023.6926.7925.6125.0624.9322.9620.69
23.9724.6425.9323.6925.3822.6823.3622.4422.5719.81
21.1920.3921.1221.8929.9727.3923.1121.7520.8922.83
22.0220.0720.1521.2419.6323.5821.6525.1723.2532.52
22.5930.1834.4221.8623.9924.8121.6821.0423.1220.76
23.1322.3522.2823.5519.8526.5124.7833.7330.1823.31
24.5125.3723.6724.2825.8221.9323.3823.0725.2123.25
22.9326.8621.2625.4324.5427.7923.5827.5623.7622.01
22.3421.07
Table A2. AAPL Daily Returns (31 January 2019–24 January 2020).
Table A2. AAPL Daily Returns (31 January 2019–24 January 2020).
0.720120.048072.840501.710940.03445−1.89394−0.31005−0.575090.86172−0.41548
0.36433−0.222480.299260.64354−0.563861.116570.728450.057400.30976−0.98359
1.051120.50295−0.18198−0.57540−1.157460.237683.464221.123540.442211.11165
1.300821.02085−0.792470.873863.68303−2.07084−1.20911−1.033170.899410.13265
0.651760.679131.453670.685500.174040.669421.57361−0.299850.56140−0.83242
−0.040210.181020.010041.947300.359370.328661.44233−0.15423−0.90751−0.47739
0.15174−1.925614.90856−0.650781.24313−1.54428−2.695700.01971−1.07442−1.76365
−5.811941.583031.19792−0.43997−0.56818−3.126991.91710−2.04716−1.70697−0.38406
−0.41348−0.476910.51866−1.81155−1.011033.658391.614341.468182.661701.27794
1.15796−0.31825−0.02060−0.726240.596662.35185−0.292270.80356−0.34092−0.10061
−1.515762.16291−0.03003−0.911191.834080.585460.82869−0.08806−2.061400.60994
0.98887−0.728240.768280.93950−0.34599−0.562341.13597−1.492762.285410.78178
−0.08140−0.790720.347790.93385−0.429222.04042−2.16391−2.11581−5.234781.89304
1.035532.20559−1.19942−0.253754.23484−2.97650−0.498152.359471.864410.00475
1.08386−0.08465−4.622051.89992−1.128380.671041.69318−0.12918−1.456361.69665
1.95516−0.009380.426711.181303.17951−0.22362−1.945400.525710.363800.93793
−0.81250−1.461810.45469−0.475501.53896−0.51577−0.486602.353530.27682−2.50678
0.849472.803180.02203−1.171501.172021.347842.65983−0.14394−0.23317−0.40371
0.388280.480281.73427−0.228681.341880.164491.231631.00170−2.31279−0.01233
2.260962.838080.65671−0.143690.042780.851350.273690.79188−0.091540.95816
−0.691941.187930.50421−0.30326−1.16415−0.44834−0.087791.75338−0.780861.34322
−0.22028−1.15622−1.783010.882631.467101.93162−1.400010.584440.852940.25483
1.359321.711790.19653−0.238940.10009−0.207121.631830.095071.98403−0.03795
0.593510.730652.28163−0.972200.79682−0.470301.608632.124080.226072.13644
−1.35033−0.428551.252651.10710−0.677690.356950.48159−0.28820
Table A3. Dow Jones Daily Returns (1 November 2024–31 October 2025).
Table A3. Dow Jones Daily Returns (1 November 2024–31 October 2025).
−0.006130.010220.03572−0.000010.005940.00691−0.008630.00108−0.00472−0.00699
−0.00127−0.002780.003220.010640.009710.009930.00277−0.003080.00422−0.00286
−0.001710.00690−0.00552−0.00275−0.00539−0.00347−0.00224−0.00531−0.00196−0.00252
−0.00612−0.025850.000360.011760.001560.009090.00066−0.00770−0.00973−0.00069
−0.003570.00802−0.00060−0.004170.00251−0.016340.008550.005230.01654−0.00158
0.007760.012370.002970.00925−0.003160.006510.00306−0.003050.00377−0.00752
−0.002760.003020.00712−0.00280−0.009930.003770.00277−0.005050.00773−0.00370
0.000230.00160−0.01010−0.016950.000760.00368−0.00431−0.004460.01391−0.01482
−0.015520.01142−0.009940.00523−0.02079−0.01141−0.00199−0.013000.016530.00852
−0.006220.00922−0.000270.000760.014240.00010−0.00312−0.00365−0.016920.01005
−0.000280.00561−0.03977−0.05503−0.00912−0.008430.07870−0.024990.015640.00776
−0.00385−0.01733−0.01329−0.024830.026630.010710.012290.000500.002840.00746
0.003500.002060.01385−0.00239−0.009460.006980.00619−0.002880.02814−0.00636
−0.002120.006460.007840.00322−0.00268−0.01914−0.00003−0.006120.01780−0.00578
0.002780.001290.000840.00506−0.00216−0.002550.01047−0.000030.00246−0.00003
0.00238−0.017920.00752−0.00704−0.001050.000830.008880.01191−0.002470.00941
0.009970.006290.00908−0.000240.00774−0.00942−0.003730.004920.00433−0.00625
0.00199−0.009810.005260.00519−0.00320−0.000430.004050.01141−0.007030.00465
−0.00143−0.00456−0.00385−0.00743−0.012290.01342−0.001400.00184−0.005080.00471
−0.004540.011000.01043−0.000250.00078−0.000760.000230.00036−0.003400.01890
−0.007650.002990.003240.00157−0.00202−0.00547−0.000540.00773−0.004830.00251
0.00431−0.004820.01356−0.005940.00107−0.002740.005690.002700.003750.00143
−0.00191−0.00370−0.003770.006530.001490.001770.000930.001690.00513−0.00135
−0.00197−0.00003−0.00522−0.018960.012930.00440−0.00037−0.006510.005190.01117
0.00467−0.007120.003100.010110.007150.00340−0.00156−0.002310.00086

References

  1. Ibragimov, M.; Ibragimov, R.; Walden, J. Heavy-Tailed Distributions and Robustness in Economics and Finance; Springer: Cham, Switzerland, 2015; Volume 214. [Google Scholar]
  2. Guo, Z.Y. Heavy-tailed distributions and risk management of equity market tail events. J. Risk Control 2017, 4, 31–41. [Google Scholar] [CrossRef]
  3. Cortés, I.; Reyes, J.; Iriarte, Y.A. A Weighted Skew-Logistic Distribution with Applications to Environmental Data. Mathematics 2024, 12, 1287. [Google Scholar] [CrossRef]
  4. Ahmad, Z.; Mahmoudi, E.; Dey, S. A new family of heavy tailed distributions with an application to the heavy tailed insurance loss data. Commun. Stat.-Simul. Comput. 2022, 51, 4372–4395. [Google Scholar] [CrossRef]
  5. Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
  6. Azzalini, A.; Capitanio, A. The Skew-Normal and Related Families; Cambridge University Press: New York, NY, USA, 2014. [Google Scholar]
  7. Henze, N. On a Skew-t Distribution. Scand. J. Stat. 1986, 13, 271–275. [Google Scholar]
  8. Hasan, A.; Ning, W.; Gupta, A. A New Approach for the Skew t Distribution with Applications to Environmental Data. Adv. Appl. Stat. 2016, 49, 117–136. [Google Scholar] [CrossRef]
  9. Arellano-Valle, R.B.; Azzalini, A. The centred parameterization and related quantities of the skew-t distribution. J. Multivar. Anal. 2013, 113, 73–90. [Google Scholar] [CrossRef]
  10. Tagle, F.; Castruccio, S.; Genton, M.G. A hierarchical bi-resolution spatial skew-t model. Spat. Stat. 2020, 35, 100398. [Google Scholar] [CrossRef]
  11. Galarza, C.E.; Matos, L.A.; Castro, L.M.; Lachos, V.H. Moments of the doubly truncated selection elliptical distributions with emphasis on the unified multivariate skew-t distribution. J. Multivar. Anal. 2022, 189, 104944. [Google Scholar] [CrossRef]
  12. Hasan, A. A Study of Non-Central Skew t Distributions and Their Applications in Data Analysis and Change Point Detection. Ph.D. Thesis, Bowling Green State University, Bowling Green, OH, USA, 2013. [Google Scholar]
  13. Azzalini, A.; Capitanio, A. Distributions generated by perturbation of symmetry with emphasis on a multivariate Skew t-distribution. J. R. Stat. Soc. Ser. B Stat. Methodol. 2003, 65, 367–389. [Google Scholar] [CrossRef]
  14. Stephens, M.A. EDF statistics for goodness of fit and some comparisons. J. Am. Stat. Assoc. 1974, 69, 730–737. [Google Scholar] [CrossRef]
  15. Stephens, M.A. Tests Based on EDF Statistics. In Goodness-of-Fit Techniques; D’Agostino, R.B., Stephens, M.A., Eds.; Marcel Dekker: New York, NY, USA, 1986; pp. 97–193. [Google Scholar]
  16. Maghami, M.; Bahrami, M. Goodness of Fit Test for the Skew-T Distribution. J. Math. Comput. Sci. 2015, 14, 274–283. [Google Scholar] [CrossRef]
  17. Hasan, A.; Ning, W.; Gupta, A.K. An information-based Approach to the Change-point Problem of the Non-central Skew t Distribution with Applications to Stock Market Data. Seq. Anal. 2014, 33, 458–474. [Google Scholar] [CrossRef]
  18. Kim, H.J. On a Skew-t Distribution. J. Korean Commun. Stat. 2001, 8, 867–873. [Google Scholar]
  19. Hasan, A.; Chen, Y. On the Modified Information-Based Approach to the Change Point Detection (CPD) Problem under the Non-Central Skew t Distribution. J. Stat. Theory Pract. 2025, 19, 54. [Google Scholar] [CrossRef]
  20. Rizzo, M.L. A New Rotation Invariant Goodness-of-Fit Test. Ph.D. Thesis, Bowling Green State University, Bowling Green, OH, USA, 2002. [Google Scholar]
  21. Sźekely, G.J. E-statistics: Energy of Statistical Samples; Technical Report 03-05; BGSU, Department of Mathematics and Statistics: Bowling Green, OH, USA, 2000. [Google Scholar]
  22. Opperman, L.; Ning, W. Goodness-of-fit test for skew normality based on energy statistics. Random Oper. Stoch. Equ. 2020, 28, 227–236. [Google Scholar] [CrossRef]
  23. Ofosuhene, P. The Energy Goodness-of-Fit Test for the Inverse Gaussian Distribution. Ph.D. Thesis, Bowling Green State University, Bowling Green, OH, USA, 2020. [Google Scholar]
  24. Njuki, J.; Avallone, R. Energy Statistic-Based Goodness-of-Fit Test for the Lindley Distribution with Application to Lifetime Data. Stats 2025, 8, 87. [Google Scholar] [CrossRef]
  25. Móri, F.T.; Sźekely, G.J.; Rizzo, M.L. On energy tests of normality. J. Stat. Plan. Inference 2021, 213, 1–15. [Google Scholar] [CrossRef]
  26. Sźekely, G.J.; Rizzo, M. A new test for multivariate normality. J. Multivar. Anal. 2005, 93, 58–80. [Google Scholar] [CrossRef]
  27. Rizzo, M.L. A Test of Homogeneity for Two Multivariate Populations; American Statistical Association: Alexandria, VA, USA, 2003. [Google Scholar]
  28. Sźekely, G.J.; Rizzo, M.L. Testing for Equal Distributions in high Dimension. InterStat 2004, 5, 1249–1272. [Google Scholar]
  29. Sźekely, G.J.; Rizzo, M.L. The Energy of Data and Distance Correlation, 1st ed.; Chapman and Hall: London, UK, 2023. [Google Scholar]
  30. Rizzo, M.L. New goodness-of-fit tests for Pareto distributions. ASTIN Bull. J. IAA 2009, 39, 691–715. [Google Scholar] [CrossRef]
  31. Njuki, J.; Ning, W. Energy statistic-based modified information criterion for detecting the change in distribution. J. Appl. Stat. 2025, 1–23. [Google Scholar] [CrossRef]
  32. Njuki, J.M. Nonparametric Sequential tests for Change Point Analysis Using Energy Statistics. Ph.D. Thesis, Bowling Green State University, Bowling Green, OH, USA, 2022. [Google Scholar]
  33. Matterson, D.S.; James, N.A. A nonparametric Approach for Multiple Change Point Analysis of Multivariate Data. J. Am. Stat. Assoc. 2014, 109, 334–345. [Google Scholar] [CrossRef]
  34. Kim, A.Y.; Marzban, C.; Percival, D.B.; Stuetzle, W. Using labeled data to evaluate change detectors in a multivariate streaming environment. Signal Process. 2009, 89, 2529–2536. [Google Scholar] [CrossRef]
  35. Sźekely, G.J.; Rizzo, M. A Class of Statistical Based on Distances. J. Stat. Plan. Inference 2013, 143, 1249–1272. [Google Scholar] [CrossRef]
  36. Sźekely, G.J.; Rizzo, M.L. The Energy of Data. Annu. Rev. Stat. Its Appl. 2017, 4, 447–479. [Google Scholar] [CrossRef]
  37. Azzalini, A.; Genton, M.G. Robust likelihood methods based on the skew-t and related distributions. Int. Stat. Rev. 2008, 76, 106–129. [Google Scholar] [CrossRef]
  38. Azzalini, A.; Arellano-Valle, R.B. Maximum penalized likelihood estimation for skew-normal and skew-t distributions. J. Stat. Plan. Inference 2013, 143, 419–433. [Google Scholar] [CrossRef]
  39. Jones, M.C. A family of distributions on the real line with four parameters. In Statistical Models and Methods for Financial Markets; Bali, T.G., Lim, E., Eds.; Springer: New York, NY, USA, 2006; pp. 75–93. [Google Scholar]
  40. Cook, R.D.; Weisberg, S. Bayesian Density Estimation Using Skew Student-t-Normal Mixtures: An Introduction to Regression Graphics; John Wiley and Sons: New York, NY, USA, 1994. [Google Scholar]
  41. Macrotrends. Apple—Stock Price History. 2025. Available online: https://www.macrotrends.net/stocks/charts/AAPL/apple/stock-price-history (accessed on 10 March 2025).
Figure 1. Azzalini’s standard Skew-t density curves when the skewness parameter, (a) fixed α = 1 , and the degrees of freedom, ν , varies over { 1 ,   5 ,   10 } and (b) fixed degrees of freedom, ν = 10 , varies the skewness parameter α over { 5 ,   0 ,   5 } .
Figure 1. Azzalini’s standard Skew-t density curves when the skewness parameter, (a) fixed α = 1 , and the degrees of freedom, ν , varies over { 1 ,   5 ,   10 } and (b) fixed degrees of freedom, ν = 10 , varies the skewness parameter α over { 5 ,   0 ,   5 } .
Mathematics 13 03833 g001
Figure 2. Critical values for various parameter values for sample size n = 50 at levels of significance L O S = 0.01 , 0.025 , 0.05 , 0.10 .
Figure 2. Critical values for various parameter values for sample size n = 50 at levels of significance L O S = 0.01 , 0.025 , 0.05 , 0.10 .
Mathematics 13 03833 g002
Figure 3. Histogram, density curves, and CDFs of the body mass index (BMI) of 102 male Australian athletes. (a) Histogram of BMI with density estimates of Skew-t, Skew-normal, Skew-Cauchy, and normal. (b) Empirical and theoretical Skew-t CDFs.
Figure 3. Histogram, density curves, and CDFs of the body mass index (BMI) of 102 male Australian athletes. (a) Histogram of BMI with density estimates of Skew-t, Skew-normal, Skew-Cauchy, and normal. (b) Empirical and theoretical Skew-t CDFs.
Mathematics 13 03833 g003
Figure 4. Histogram, density curves, and CDFs of Apple’s daily returns with 248 observations. (a) Histogram of Apple’s daily returns with density curves of the fitted Skew-t, Skew-normal, Skew-Cauchy, and normal distributions. (b) Empirical and theoretical Skew-t CDFs.
Figure 4. Histogram, density curves, and CDFs of Apple’s daily returns with 248 observations. (a) Histogram of Apple’s daily returns with density curves of the fitted Skew-t, Skew-normal, Skew-Cauchy, and normal distributions. (b) Empirical and theoretical Skew-t CDFs.
Mathematics 13 03833 g004
Figure 5. Histogram, density curves, and CDFs of the Dow Jones daily returns with 249 observations. (a) Histogram of Dow Jones daily returns with density estimates of Skew-t, Skew-normal, Skew-Cauchy, and normal. (b) Empirical and theoretical Skew-t CDFs.
Figure 5. Histogram, density curves, and CDFs of the Dow Jones daily returns with 249 observations. (a) Histogram of Dow Jones daily returns with density estimates of Skew-t, Skew-normal, Skew-Cauchy, and normal. (b) Empirical and theoretical Skew-t CDFs.
Mathematics 13 03833 g005
Table 1. Simulated critical values for energy statistic of Skew-t distribution.
Table 1. Simulated critical values for energy statistic of Skew-t distribution.
(α, ν)
n(1, 5)(0, 5)(−1, 5)(1, 10)(0, 10)(−1, 10)(1, 30)(0, 30)(−1, 30)
503.78563.86634.07812.04411.95961.91431.36241.33321.3421
1004.57424.66354.64012.34162.33142.34431.65841.58151.6448
1505.39135.69105.34852.82912.86422.82792.02392.02912.0270
2006.38936.69866.39463.34143.48783.37102.40032.41722.3981
Table 2. Simulated size of the test for Skew-t distribution.
Table 2. Simulated size of the test for Skew-t distribution.
(α, ν)
n(1, 5)(0, 5)(−1, 5)(1, 10)(0, 10)(−1, 10)(1, 30)(0, 30)(−1, 30)
500.05080.05320.05130.04760.05080.05130.05100.04840.0540
1000.04840.04760.04920.05110.04870.05030.05010.05580.0488
1500.04960.05060.05240.05130.05090.04910.04780.04920.0508
2000.05030.04940.05060.05050.04980.05010.04970.04960.0499
Table 3. Power comparison with n = 50 ,   100 ,   150 ,   200 .
Table 3. Power comparison with n = 50 ,   100 ,   150 ,   200 .
DistributionSample Size nEnergyK-SKuiperCvMWatsonA-D
χ 2 ( 2 ) 500.83690.22480.21340.28090.24600.3878
1000.89330.29240.28320.38500.32980.5061
1500.95590.40180.36380.51030.42880.6223
2000.97150.49730.44180.61840.50690.7270
t 10 500.10660.03420.03960.03320.03820.0372
1000.17360.03560.04180.03920.03800.0384
1500.22680.03600.03980.03340.03820.0378
2000.27900.03870.04100.04130.03930.0397
Exp ( 1 ) 500.84130.26400.24910.30690.26920.4092
1000.91520.50900.49240.55380.51400.6128
1500.96550.64960.62240.70120.64860.7702
2000.98480.74390.71690.80120.74210.8601
Shash ( 3 ,   1 ) 500.57340.11450.12920.13450.13020.1545
1000.60450.21310.21540.25460.23280.261
1500.64010.30320.30720.38440.34980.3958
2000.67490.41660.41960.51840.47420.5261
G T ( 3 ,   1 ) 500.95960.11280.11660.13060.11880.3350
1000.99160.15620.18600.20040.19660.4480
1501.00000.23060.27560.32280.32320.5816
2001.00000.30300.37980.42780.42920.6720
K w C W G
(0.1, 3, 1, 3, 2)
500.09300.07500.08750.07600.07700.0810
1000.10800.09050.08850.07750.07950.0855
1500.11450.10150.11700.08450.08500.0950
2000.12900.10800.11600.08950.09150.1045
L N ( 0.5 , 1 ) 500.97010.33310.19500.39880.23040.4611
1000.99310.336910.20920.39740.24410.4644
1500.99950.34490.21790.41020.25860.4775
2000.99980.36390.24090.44290.28180.5155
Table 4. MLE, LogL, AIC, and SIC values for BMI dataset.
Table 4. MLE, LogL, AIC, and SIC values for BMI dataset.
Parameter/ModelSkew-tNormalSkew-CauchySkew-Normal
μ 21.649023.903622.788920.8765
σ 2.65702.74031.37624.0610
α 1.6421-0.51263.2992
ν 4.5503---
LogL−236.0511−248.0613−247.5519−237.9670
AIC480.1022500.1227501.1037481.9341
SIC490.6020505.3726508.9787489.8090
LRT p-value-≪0.0001≪0.00010.0502
Table 5. MLE, LogL, AIC, and SIC values for Apple stock daily return dataset.
Table 5. MLE, LogL, AIC, and SIC values for Apple stock daily return dataset.
Parameter/ModelSkew-tNormalSkew-CauchySkew-Normal
μ 0.52540.27490.17981.3519
σ 1.15141.42590.76551.7858
α −0.2285-0.1191−1.1688
ν 5.3556---
LogL−431.3161−440.3975−461.9316−438.1367
AIC870.6321884.795929.8633882.2733
SIC884.6858891.8219940.4036892.8136
LRT p-value-0.0001≪0.00010.0002
Table 6. MLE, LogL, AIC, and SIC values for Dow Jones daily return data.
Table 6. MLE, LogL, AIC, and SIC values for Dow Jones daily return data.
Parameter/ModelSkew-tNormalSkew-CauchySkew-Normal
μ 0.00110.0006−0.0002−0.0063
σ 0.00630.01070.00460.0127
α −0.0913-0.13050.9420
ν 3.0501---
LogL822.8359776.2823801.3888778.9004
AIC−1637.6720−1548.5650−1596.778−1551.8010
SIC−1623.6020−1541.5300−1586.225−1541.2480
LRT p-value-≪0.0001≪0.0001≪0.0001
Table 7. Summary of test statistics and corresponding p-values of BMI data.
Table 7. Summary of test statistics and corresponding p-values of BMI data.
TestEnergyK-SKuiperCvMWatsonA-D
Statistic value1.71090.04540.87310.02950.02900.2321
p-value0.58400.98470.98470.98270.98400.9807
Table 8. Summary of test statistics and p-values of Apple daily return data.
Table 8. Summary of test statistics and p-values of Apple daily return data.
TestEnergyK-SKuiperCvMWatsonA-D
Statistic value3.96350.03571.05760.02910.02900.1737
p-value0.54730.82600.81670.93810.93800.9913
Table 9. Summary of test statistics and corresponding p-values of Dow Jones daily return data.
Table 9. Summary of test statistics and corresponding p-values of Dow Jones daily return data.
TestEnergyK-SKuiperCvMWatsonA-D
Statistic value9.71680.03400.98060.04040.04050.2878
p-value0.50120.81500.91300.89450.88750.8680
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Njuki, J.; Hasan, A.M. A New Goodness-of-Fit Test for Azzalini’s Skew-t Distribution Based on the Energy Distance Framework with Applications. Mathematics 2025, 13, 3833. https://doi.org/10.3390/math13233833

AMA Style

Njuki J, Hasan AM. A New Goodness-of-Fit Test for Azzalini’s Skew-t Distribution Based on the Energy Distance Framework with Applications. Mathematics. 2025; 13(23):3833. https://doi.org/10.3390/math13233833

Chicago/Turabian Style

Njuki, Joseph, and Abeer M. Hasan. 2025. "A New Goodness-of-Fit Test for Azzalini’s Skew-t Distribution Based on the Energy Distance Framework with Applications" Mathematics 13, no. 23: 3833. https://doi.org/10.3390/math13233833

APA Style

Njuki, J., & Hasan, A. M. (2025). A New Goodness-of-Fit Test for Azzalini’s Skew-t Distribution Based on the Energy Distance Framework with Applications. Mathematics, 13(23), 3833. https://doi.org/10.3390/math13233833

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop