A New Goodness-of-Fit Test for Azzalini’s Skew-t Distribution Based on the Energy Distance Framework with Applications

Njuki, Joseph; Hasan, Abeer M.

doi:10.3390/math13233833

Open AccessArticle

A New Goodness-of-Fit Test for Azzalini’s Skew-t Distribution Based on the Energy Distance Framework with Applications

by

Joseph Njuki

¹

and

Abeer M. Hasan

^2,*

¹

Department of Mathematics and Statistics, Coastal Carolina University, Conway, SC 29526, USA

²

Department of Mathematics and Statistics, North Carolina A&T State University, Greensboro, NC 27411, USA

^*

Author to whom correspondence should be addressed.

Mathematics 2025, 13(23), 3833; https://doi.org/10.3390/math13233833 (registering DOI)

Submission received: 29 September 2025 / Revised: 21 November 2025 / Accepted: 26 November 2025 / Published: 29 November 2025

(This article belongs to the Special Issue Advances in Flexible Parametric Distributions for Modeling Skewness and Kurtosis)

Download

Browse Figures

Versions Notes

Abstract

In response to the growing need for flexible parametric models for skewed and heavy-tailed data, this paper introduces a novel goodness-of-fit test for the Skew-t distribution, a widely used flexible parametric probability distribution. Traditional methods often fail to capture the complex behavior of data in fields such as engineering, public health, and the social sciences. Our proposed test, based on energy statistics, provides practitioners with a robust and powerful tool for assessing the suitability of the Skew-t distribution for their data. We present a comprehensive methodological evaluation, including a comparative study that highlights the advantages of our approach over traditional tests. The results of our simulation studies demonstrate a significant improvement in power, leading to more reliable inference. To further showcase the practical utility of our method, we apply the proposed test to three real-world datasets, offering a valuable contribution to both the theoretical and applied aspects of statistical modeling for non-normal data.

Keywords:

Skew-t distribution; Goodness-of-Fit (GOF); energy statistics; skewness; Empirical Distribution Function (EDF); Probability Density Function (PDF); Maximum Likelihood Estimation (MLE); Cumulative Density Function (CDF); Akaike information criterion (AIC); Schwarz information criterion (SIC)

MSC:

62F03; 62E20; 62E10; 62P05

1. Introduction

Skewed and heavy-tailed data are prevalent in various applied domains, including econometrics, environmental science, and risk analysis. Income distributions, housing prices, and insurance claims often display asymmetry and excess kurtosis; the tail behavior encodes meaningful extremes, such as financial losses or contaminant spikes, that should not be dismissed as outliers; see, for example, Ibragimov et al. [1], Guo [2], Cortés et al. [3], and Ahmad et al. [4].

A natural starting point for modeling asymmetry is the Skew-normal distribution introduced by Azzalini [5] (see also [6]). By augmenting the normal distribution with a skewness parameter, the Skew-normal preserves analytical tractability while allowing for controlled departures from symmetry. However, because the Skew-normal remains light-tailed, it is ill-suited to settings where leptokurtosis is intrinsic to the data-generating process.

To address heavy tails alongside asymmetry, the Skew-t family extends the t distribution by introducing a skewness parameter. An influential approach views Skew-t random variables as scale mixtures of Skew-normal and Chi-square variables [5,7], a perspective adopted and elaborated by several authors, including Hasan et al. [8]. Related extensions, such as extended Skew-t (EST) and alternative parameterizations, further enhance the modeling flexibility (see, for example, [9]). Empirically, Skew-t models have seen broad application, from financial time series and risk assessment to environmental monitoring and robust analysis under truncation or censoring [10,11]. Non-central variants expand the toolkit and have been studied in detail, with their properties and use cases documented in Hasan et al. [8] and Hasan [12].

Despite this progress, the assessment of the Skew-t model remains essential. In practice, misspecification of tail weight or asymmetry can distort inference on extremes, dependence, and risk. This motivates rigorous goodness-of-fit (GoF) procedures tailored to Skew-t families—methods that can diagnostically test statistical adequacy against alternatives that differ in tail behavior, asymmetry, or both.

In this paper, we develop an energy-based goodness-of-fit test for Azzalini’s standard Skew-t distribution proposed in Azzalini and Capitanio [6] and defined as a random variable with probability density function (PDF) that takes the following form:

f_{ν, α} (x) = 2 g_{ν} (x) G_{ν + 1} (α x \sqrt{\frac{ν + 1}{ν + x^{2}}}),

(1)

where g and G are the PDF and the cumulative CDF of the standard t distribution. The parameters

α

and

ν

are referred to as the skewness parameter and the degrees of freedom, respectively. We will denote this density by

S t (ν, α)

. The special case for

α = 0

reduces to the standard Student t distribution with

ν

degrees of freedom. The CDF of this Skew-t distribution does not have a closed form.

A scale–location extension of this definition has been proposed by adding a location parameter

ξ

and a scale parameter

ω

by replacing x in Equation (1) by

(\frac{x - ξ}{ω})

. The expected value of the scale–location Skew-t variable can be obtained by

E (X) = ξ + \frac{δ \times Γ (\frac{1}{2} (ν - 1))}{Γ (\frac{ν}{2})} \sqrt{\frac{ν}{π}},

(2)

where

ν > 1

,

δ = \frac{α}{\sqrt{1 + α^{2}}}

, and

Γ (z) = \int_{0}^{\infty} t^{z - 1} e^{- t} d t

. The reader is referred to Azzalini and Capitanio [6,13] for more details on the theoretical properties and alternative parameterizations of the Skew-t distribution. For simplicity, we will refer to Azzalini’s Skew-t distribution as the Skew-t distribution for the remainder of this paper unless otherwise stated.

Without loss of generality, we will focus on the standard Skew-t distribution in our construction of the goodness-of-fit test, since an arbitrary four-parameter Skew-t density can be converted to the standard one via a linear transformation. Figure 1 provides density curves of the standard Skew-t distribution for varying degrees of freedom (

ν

) when the skewness parameter

α = 1

is held fixed. This illustrates the role of each parameter as the parameter

α

controls the skewness of the density, and the parameter

ν

controls the thickness of the tail.

There is limited research on the goodness-of-fit test for Azzalini’s Skew-t distributions, except those based on empirical distribution functions (EDFs), such as Kolmogorov–Sminorv, Cramer–von-Mises, etc., discussed and modified in [14,15]. Recently, Maghami and Bahrami [16] proposed a goodness-of-fit test for Azzalini’s Skew-t distribution based on the correlation coefficient (r). Most studies related to Skew-t distributions focus on properties, extensions, generalizations, change point analysis, and applications, for example, [12,17,18,19], among many others. To address this research gap, we develop an energy-based goodness-of-fit test for the Skew-t distribution. Our test leverages the well-established properties of energy-based testing to achieve higher power for any given sample size and parameter combination

α

and

ν

.

In this article, we propose a new one-sample (univariate) energy goodness-of-fit test based on energy statistics proposed by [20,21]. In the most recent work, Refs. [22,23,24] proposed goodness-of-fit tests based on energy statistics for Skew-normal, Inverse Gaussian, and Lindley distributions, respectively. For a given sequence of independent random variables of size n and with a cdf G, the test statistic based on energy statistics will reject the null hypothesis that

F = G

for large values of the test statistic. If the null distribution, F, and the given data come from the same underlying distribution G, then the values of the test statistic are expected to be smaller. Furthermore, there have been numerous studies involving energy statistics such as testing for multivariate normality [25,26], testing for equality of distributions [27,28], one-sample goodness-of-fit tests [22,24,29,30], and change point analysis [31,32,33,34], among many others.

The energy distance is a statistical distance between the distributions of random vectors that characterizes the equality of distributions; see, for example, [21,29,35]. The concept of energy statistics described by Sźekely [21] is based on the notion of Newton’s gravitational potential energy, which is a function of the distance between two bodies. The idea of energy statistics, therefore, is to consider statistical observations as heavenly bodies governed by a statistical potential energy, which is zero if and only if an underlying statistical null hypothesis is true, see, for example, [29,36].

Definition 1.

Sźekely and Rizzo [35] defined the energy distance between distributions of two independent and univariate random samples X and Y with finite expectations as follows:

E (X, Y) = 2 E | X - Y | - E | X - X^{'} | - E | Y - Y^{'} | \geq 0,

(3)

X \overset{d}{=} X^{'}

,

Y \overset{d}{=} Y^{'}

, and equality holds if and only if X and Y are identically distributed.

1.1. Existence and Uniqueness of the MLEs for Azzalini’s Skew-t Distribution

Azzalini and Genton [37] presented a detailed discussion of the existence and uniqueness of the maximum likelihood estimates (MLEs) for Azzalini’s Skew-t distribution. They concluded that the distribution is generally robust, with the profile log-likelihood for the skewness parameter

α

being unimodal and free of stationary points at

α = 0

. The Fisher information matrix remains non-singular for all finite degrees of freedom, ensuring the existence and uniqueness of the MLE under regular conditions. Simulation studies confirm that the MLEs for skewness and tail parameters (degrees of freedom) are well-behaved for moderate to large samples, while likelihood-based adjustments such as the deviance approach can mitigate rare boundary issues in small samples (see Azzalini and Genton [37] and Azzalini and Capitanio [6] for more details).

Recent advances have addressed the instability and divergence of the MLE for

α

in small samples. Azzalini and Arellano-Valle [38] introduced a penalized log-likelihood framework that ensures robust, finite, and unique estimates for

α

, even in small or multivariate settings. In our computations, we adopted this penalized likelihood approach to enhance robustness and avoid instability in parameter estimation for the Skew-t family.

1.2. Motivation and Scientific Contribution

The Skew-t distribution stands out as a flexible and reliable tool for modeling asymmetric and heavy-tailed data. Its regular profile likelihood and robust MLE properties make it especially suitable for general-purpose robust inference in both univariate and multivariate settings. In this paper, we propose a procedure that is superior for the goodness-of-fit test for the Skew-t distribution based on energy distance statistics (Sźekely and Rizzo [35]) and the definition of Azzalini’s Skew-t distribution (Azzalini and Capitanio [13]). Unlike the proposed method based on energy statistics, many existing methods depend on the distribution function of random variables. Energy statistic-based tests have been shown to be typically more powerful against general alternatives than corresponding tests based on classical statistics (non-energy type), such as Kormogorov–Smirnov, correlation, Anderson–Darling, and Cramer–von-Mises. In addition, energy statistic-based tests have an invariance property with respect to any distance-preserving transformation of the dataset; see [29,33,36]. This enables studies that involve energy statistics to be extended to multivariate and high-dimensional settings.

In Section 2 of this article, we introduce a test procedure based on energy statistics for the goodness of fit of Azzalini’s Skew-t distribution and discuss its theoretical properties. We perform various simulations in Section 3 to compare the approach with other existing methods. In Section 4, we apply our method to three case studies. The conclusions are provided in Section 5. Limitations and future research are discussed in Section 6.

2. Proposed Energy-Based Goodness-of-Fit Test

We propose a one-sample univariate goodness-of-fit test based on the energy statistics proposed by [26,35] for the Skew-t distribution. The null hypothesis is that the data X follow the null distribution

F_{0},

which is a Skew-t distribution, against the alternative that the Skew-t distribution is a poor fit for the data.

Definition 2.

Let

X_{1}, \dots X_{n}

be a random sample from a univariate population with distribution F and let

x_{1}, \dots x_{n}

be the observed values of the random variables in the sample. Then, the one-sample energy statistic goodness-of-fit test for testing the hypothesis

H_{0} : F = F_{0}

vs

H_{a} : F \neq F_{0}

is defined as follows.

n E_{n} = n \{\frac{2}{n} \sum_{i = 1}^{n} E | x_{i} - X | - E | X - X^{'} | - \frac{1}{n^{2}} \sum_{i = 1}^{n} \sum_{j = 1}^{n} | x_{i} - x_{j} |\},

(4)

where X and

X^{'}

are independent and identically distributed variables with distribution

F_{0}

, and the expectations are taken with respect to the null distribution

F_{0}

.

The null hypothesis

F = F_{0},

is rejected for large values of the test statistic

n E_{n}

. Under the null hypothesis, the limiting distribution of

n E_{n}

is a quadratic quantity of the form

\sum_{j = 1}^{\infty} λ_{j} Z_{j}^{2}

such that

Z_{j}, j = 1, 2, \dots,

are i.i.d. standard normal random variables and

λ_{j}

are nonnegative constants that depend on the null distribution. Thus, the goodness-of-fit test can be implemented by finding the constants

λ_{j}

. In practice, this could be difficult, and we therefore resort to the use of empirical critical values of

n E_{n}

so that

P (n E_{n} > C_{α}) = α .

This fact is guaranteed since the test based on

n E_{n}

is a consistent goodness-of-fit test, see, for example, Sźekely and Rizzo [26] and Móri et al. [25].

The goodness-of-fit test statistic based on energy statistics is dependent on the derivation of the expected values of

| x - X |

and

| X - X^{'} |

where X and

X^{'}

are independent and identically distributed random variables from the null distribution

F_{0} .

Proposition 1.

Let

X \sim f (x)

where

f (x)

is the standard Skew-t density defined in Equation (1). Then, for any fixed

x \in R,

E | x - X | = \int_{R} | x - t | f (t) d t = 2 x F (x) - x + \frac{Γ (\frac{1}{2} (ν - 1))}{Γ (\frac{ν}{2})} δ \sqrt{\frac{ν}{π}} - 2 \int_{- \infty}^{x} t f (t) d t,

(5)

where

F (.)

is the CDF of the standard Skew-t distribution.

Proof of Proposition 1.

Let

X \sim f (x)

. Then, for every fixed real number x, we have

\begin{matrix} E | x - X | & = \int_{R} | x - t | f_{X} (t) d t \\ = \int_{- \infty}^{x} (x - t) f_{X} (t) d t + \int_{x}^{\infty} (t - x) f_{X} (t) d t \\ = x \int_{- \infty}^{x} f_{X} (t) d t - \int_{- \infty}^{x} t f_{X} (t) d t + \int_{x}^{\infty} t f_{X} (t) d t - x \int_{x}^{\infty} f_{X} (t) d t \\ = x F_{X} (x) - \int_{- \infty}^{x} t f_{X} (t) d t + (\int_{- \infty}^{\infty} t f_{X} (t) d t - \int_{- \infty}^{x} t f_{X} (t) d t) - x (1 - F_{X} (x)) \\ = 2 x F (x) - x + E (X) - 2 \int_{- \infty}^{x} t f_{X} (t) d t \\ = 2 x F (x) - x + \frac{Γ (\frac{1}{2} (ν - 1))}{Γ (\frac{ν}{2})} δ \sqrt{\frac{ν}{π}} - 2 \int_{- \infty}^{x} t f_{X} (t) d t, \end{matrix}

where the integral term can be numerically evaluated in R (version 4.4.2) using the command dst() available in the Azzalini sn package. □

Sometimes the derivation of the second term in Equation (4) may not be analytically feasible and, therefore, we can use the following approximation as suggested in [22].

Proposition 2

(Quantile-based Approximation). Let X and

X^{'}

be independent and identically distributed random variables with a well-defined cumulative distribution function,

F (x)

. Given that the quantile or inverse CDF function of X exists, we have the following.

E | X - X^{'} | = \frac{4}{m} \sum_{i = 1}^{m} y_{i} F^{- 1} (y_{i}) - \frac{2}{m} \sum_{i = 1}^{m} F^{- 1} (y_{i}),

(6)

where m is the number of equally sized sub-intervals of

[0, 1]

and

y_{i}

is chosen from the ith sub-interval. It is worth noting that Proposition 2 applies to all distribution functions.

Proof.

This proof is provided by Opperman and Ning [22]. □

3. Simulations and Results

This section presents a simulation-based assessment of the proposed goodness-of-fit (GoF) test for Azzalini’s Skew-t distribution, built upon the energy distance framework. We evaluated the test performance on varying values of the skewness parameter (

α

) and degrees of freedom (

ν

). Simulations are conducted under both the null hypothesis and multiple alternative distributions, and type I error and empirical power are computed.

In this simulation study, we calculate two key quantities. The size of the test (type I error) and the empirical power (1-type II error). We used R (R version 4.4.2) and the sn package, https://cran.r-project.org/web/packages/sn/refman/sn.html (accessed on 21 November 2025), to carry out the simulations. Due to the complexity of repeatedly evaluating the Skew-t CDF and performing numerical integration, all simulations were parallelized using the foreach and doParallel R packages. The simulations were distributed across 11 processor cores, significantly accelerating the test’s evaluation across various scenarios. The programming code is too lengthy to include in the paper. However, it can be obtained by contacting the authors via email.

3.1. The Univariate Energy Test Statistic

The univariate energy test statistic is derived from the expected pairwise distances between observations and a reference Skew-t distribution. We used the standard Azzalini’s Skew-t distribution in our development of the test statistic and rely on standardization to convert an arbitrary Skew-t to the standard one before applying the test. Let

F_{θ}

be the cumulative distribution function (CDF) of Azzalini’s standard Skew-t distribution with parameter vector

(α, ν)

. The test statistic for a sample

{x_{1}, \dots, x_{n}}

is

\begin{matrix} T_{n} & = n {\frac{2}{n} \sum_{i = 1}^{n} [2 x_{i} F (x_{i}) - x_{i} + \frac{Γ (\frac{1}{2} (ν - 1))}{Γ (\frac{ν}{2})} δ \sqrt{\frac{ν}{π}} - 2 \int_{- \infty}^{x_{i}} t f_{X} (t) d t] \\ - \frac{4}{m} \sum_{i = 1}^{m} y_{i} F^{- 1} (y_{i}) - \frac{2}{m} \sum_{i = 1}^{m} F^{- 1} (y_{i}) - \frac{1}{n^{2}} \sum_{i = 1}^{n} \sum_{j = 1}^{n} | x_{i} - x_{j} |} . \end{matrix}

(7)

where

δ = \frac{α}{\sqrt{1 + α^{2}}}

,

ν > 1

, and m is the number of equally sized sub-intervals of

[0, 1]

and

y_{i}

is chosen from the ith sub-interval.

This expression is approximated using the following.

A term involving the integral of $y \cdot f_{S T} (y; α, ν)$ up to each data point.
A quantile-based approximation for the inter-distributional expectation $E | Y - Y^{'} |$ .
A linear-order statistic approximation for the within-sample expectation $E | x_{i} - x_{j} |$ .

According to Rizzo [20], the last term of the test statistic

n E_{n}

in Equation (4) can be linearized in order to reduce the computational complexity of the test from

O (n^{2})

to

O (n log n)

, which is useful during extensive simulations and applications. Let

x_{(1)}, x_{(2)}, \dots, x_{(n)}

be the ordered sample of the random sample

x_{1}, x_{2}, \dots, x_{n}

. Then, the linearization of the double sum of the test

n E_{n}

is given as

\sum_{i = 1}^{n} \sum_{j = 1}^{n} | x_{i} - x_{j} | = 2 \sum_{k = 1}^{n} ((2 k - 1) - n) x_{(k)} .

(8)

3.2. Critical Value Simulation Under Varying $α$ and $ν$

In this section, we conduct a simulation study to estimate the critical values for the energy-based goodness-of-fit test under Azzalini’s Skew-t distribution. The critical value simulations are summarized in Table 1. We selected degrees of freedom

d f = 5, 10, 30

to account for different levels of heavy tail behavior and skewness coefficient

α = 1, 0, - 1

to cover the three possible scenarios of right skew, symmetry, and left skew. We used 5000 replicates in this study, with sample sizes

n = 50, 100, 150, 200

. Table 2 presents the results of the type I error simulations, illustrating that the empirical values are very close to the theoretical value of 0.05, regardless of the sample size and the parameter values chosen for each sample.

3.3. Type I Error Control

To explore the sensitivity of the test, we vary the skewness parameter

α \in {- 1, 0, 1}

and the degrees of freedom

ν \in {5, 10, 30}

. For each combination of

(α, ν)

, we

Generate $B = 5000$ samples of size $n = 50$ from the Skew-t distribution with parameters $(0, 1, α, ν)$ ;
Estimate the parameters $\hat{θ} = (\hat{ξ}, \hat{ω}, \hat{α}, \hat{ν})$ using the maximum penalized likelihood method available in the sn package;
Standardize the data to the standard Skew-t random variables and compute the energy goodness-of-fit statistic using the formula in Equation (7);
Determine the empirical 95th percentile critical value under the null hypothesis.

The empirical type I error is then evaluated across different parameter settings to confirm robustness.

Table 2 shows that the proposed energy-based test maintained the nominal significance level in all the configurations examined. For

n = 50

, the empirical type I error was close to 0.05, although slightly higher in extreme skewness or low degrees of freedom. As the sample size increased, the test stabilized and was consistently aligned with the theoretical level.

Figure 2a shows critical values with a varying skewness parameter,

α

, for

ν = 5

and sample size

n = 50 .

For each selected level of significance (

L O S = 0.01, 0.025, 0.05, 0.10

), the critical values seem to stabilize as the skewness parameter

α

progresses further from 0. At

α = 0

, we see a dramatic increase in the simulated critical value. Note that Azzalini’s Skew-t distribution reduces to the classic Student t distribution when

α = 0

. In this special case, Azzalini’s skewness t might be an overfitting model for the data, as the skewness parameter is no longer needed and the Student t distribution is a better fit. If

| α | > 2

, the critical value stabilizes. This implies that for a fixed predetermined confidence level, the actual value of the skewness parameter

α

has little to no effect on the critical value as long as

| α | > 2

; in other words, the data exhibit a noticeable skewness.

Figure 2b shows critical values with varying

ν

for

α = 1.0

and sample size

n = 50 .

When the degrees of freedom

ν = 1

, Azzalini’s Skew-t distribution reduces to Cauchy’s distribution, and this is why we see higher critical values for the goodness-of-fit test. As degrees of freedom increase

ν > 10

, the critical values for a predetermined significance level stabilize as depicted by the nearly horizontal lines in Figure 2b.

3.4. Power Analysis Under Various Alternatives

The simulation findings demonstrate the superior statistical power of the proposed energy-based goodness-of-fit,

n E_{n}

, test statistic compared to several established goodness-of-fit tests, including the Anderson–Darling (A-D), Cramér–von Mises (CvM), Watson, Kolmogorov–Smirnov (K-S), and Kuiper tests. The analysis was carried out on various alternative distributions with sample sizes (n) of 50, 100, 150, and 200.

For power studies, samples are drawn from alternative distributions that deviate from Azzalini’s Skew-t distribution family. These include the following:

Chi-square: This is asymmetric and heavy-tailed.
StudentorGosset’s t: This is symmetric and heavy-tailed.
Exponential: This has a lighter tail and positive skew.
SHASH: The SHASH (Sinh-Arcsinh) distribution discussed in [39] is a highly flexible statistical distribution defined by four parameters that separately control the location, scale, skewness, and kurtosis of a variable.
Generalized t (GT): This is to assess sensitivity to misspecification.
Log-normal: This is asymmetric and heavy-tailed.
KwCWG: This is the Kumaraswamy Complementary Weibull geometric probability distribution. This is a five-parameter density that is well-suited for modeling skewed data with heavy tails.

The empirical power of our proposed test can thus be obtained using the following algorithm.

Obtain the critical value under the Skew-t distribution assumption with parameters $(0, 1, α, ν)$ as explained in Section 3.3.
Generate a dataset $x_{1}, \dots, x_{n}$ from the desired alternative distribution.
Process the data as if they were from a Skew-t distribution and estimate the parameters $\hat{θ} = (\hat{ξ}, \hat{ω}, \hat{α}, \hat{ν})$ using the maximum penalized likelihood method available in the sn package.
Standardize the data to the standard Skew-t random variables $t_{1}, \dots, t_{n}$ , then compute the energy goodness-of-fit statistic using Equation (7).
Based on the energy goodness-of-fit statistic and the critical value in Step 1, determine whether or not the null hypothesis is rejected.
Repeat Steps 1 through 5 for B times and the empirical power is calculated as the proportion of times the null is rejected across $B = 5000$ repetitions.

The empirical powers for the classical EDF (non-energy) tests considered in this study are also obtained in a similar manner. Table 3 summarizes the results of the power comparison simulations. For each case, the bold font indicates the highest power achieved at the combination of the sample size and alternative distribution displayed in the row heading, for each test shown in the column heading.

3.5. Superior Performance of the $n E_{n}$ Test

As illustrated in Table 3, across all scenarios, the energy-based test exhibited the highest power. This dominance is particularly evident in cases of heavy-tailed and skewed distributions.

For the Log-normal ( $L N (0.5, 1)$ ) distribution, the $n E_{n}$ test achieved a power of 0.9701 at a sample size of just 50, far exceeding the next best test, Watson’s test, which had a power of 0.3988.
Against the GT ( $ν = 3, τ = 1$ ) distribution, the $n E_{n}$ test reached a power of 1.0000 for sample sizes of 150 and 200, indicating perfect detection in the simulation.
For the Exponential (Exp(1)) distribution, the power of $n E_{n}$ ranged from 0.8413 to 0.9848, consistently outperforming all other competitors.
For the KwCWG distribution, the power of the $n E_{n}$ test ranged from 0.0845 to 0.1170, with the energy test outperforming the other tests in three out of the four sample size settings. The overall small power in this comparison indicates that the tests are having difficulty distinguishing between the two distributions. This is expected as both distributions are suitable to model skewness and heavy tails. However, since the Skew-t distribution uses four parameters instead of five, it is more parsimonious.

The K−S and A−D tests performed poorly in almost all scenarios, especially for skewed and heavy-tailed data. The correlation-based test was competitive but slightly less sensitive to deviations in kurtosis. The energy test consistently ranked the best performing method in all conditions except a single case for the KwCWG distribution.

3.6. Effect of Sample Size and Parameters

As anticipated, the power for all tests increased with the sample size. For example, in the case of the SHASH distribution, the power of the

n E_{n}

test rose from 0.5734 for

n = 50

to 0.6749 for

n = 200

. It should be noted that for the standard

t_{10}

distribution, which is a special case of the Skew-t distribution when the skewness parameter

α = 0

, all tests showed very low power. The highest power achieved was only 0.2790 by the

n E_{n}

test at

n = 200

. This suggests that all tests have difficulty distinguishing the two distributions, although the

n E_{n}

test still has a relative advantage. The proposed energy-based test achieved the highest power under conditions of high skewness (

α = 5

) and low degrees of freedom (

ν = 3

). As

ν

increased, the power decreased slightly. This decrease in power is likely because the Skew-t distribution converges to the Skew-normal family as degrees of freedom,

ν \to \infty

, making it harder to distinguish the Skew-normal from the Skew-t distribution as the degrees of freedom increase.

3.7. Summary

In summary, the simulation results provide strong evidence that the energy-based statistic, $n E_{n}$ , is a more powerful goodness-of-fit test than the other methods considered in the study, especially for non-normal distributions with skewness or heavy tails. In general, the energy-based GoF test for the Skew-t distribution demonstrates

Robust control of type I error;
Superior power in a wide range of alternatives;
Flexibility for skewed, heavy-tailed, and multi-modal data.

Its simplicity and effectiveness make it a promising tool for model validation in applied settings. Although the proposed method relies on a Monte Carlo approximation for expectation terms, the computational cost is manageable and can be parallelized. The use of linearized summation and pre-simulated reference distributions further improves efficiency.

4. Real Data Applications

To evaluate the practical utility of the proposed test, we present three case studies with real-world datasets. For each case study, we use the penalized MLE to fit the data, compare the fit using information criteria, visually assess goodness-of-fit using histograms of the original data and superimposed density curves, and compare the CDF of the fitted Skew-t model to the empirical density curve. We compute p-values for the likelihood ratio test to illustrate that the Skew-t distribution provides a better fit to the data than the nested models (normal, Skew-Cauchy, and Skew-normal). We finally present the test statistic and p-value for our proposed energy-based test, along with those for the selected comparator tests.

Since the underlying distributions of these datasets are not exactly known in advance, we use the bootstrap algorithm to determine whether they come from Azzalini’s Skew-t distribution. The bootstrap procedure is used to approximate the p-value of the proposed test as given below.

Fit the real data $y_{1}, \dots, y_{n}$ with an Azzalini’s Skew-t distribution in Equation (1) and obtain the maximum likelihood estimates (MLEs) of $ξ, ω, α$ , and $ν$ from the Azzalini’s sn package available in R (version 4.4.2).
Use the formula in Equation (7) to calculate the energy goodness-of-fit statistic of the standardized data and denote it $T_{n}^{1}$ .
Simulate $x_{1}, \dots, x_{n}$ , a random sample of size $n,$ from the Azzalini Skew-t distribution with parameters specified as $\hat{ξ}, \hat{ω}, \hat{α}$ , and $\hat{ν}$ that were obtained in Step 1.
Standardize the data with the standard Skew-t distribution and compute the energy goodness-of-fit statistic for the simulated data using the formula in Equation (7) and denote this value as $T_{n}^{* 1}$ .
Repeat this process for B times and obtain B energy goodness-of-fit statistics and denote them by $T_{n}^{* 1}, \dots, T_{n}^{* B}$ .
The bootstrap p-value is therefore approximated as

$\hat{p} = \frac{1}{B} \sum_{b = 1}^{B} I (T_{n}^{* b} \geq T_{n}^{1}),$

where $I (\cdot)$ is an indicator function that takes the value of one when $T_{n}^{b *} \geq T_{n}^{1}$ and zero otherwise.

A similar procedure was applied for classical (EDF) tests to approximate their respective bootstrap p-values.

4.1. Case Study 1

The first dataset represents the body mass index (BMI) of 102 male Australian athletes (Cook and Weisberg [40]). The dataset is provided in Table A1 and is available in the dr package in R. This dataset was previously analyzed by Maghami and Bahrami [16], who used the correlation coefficient as a measure of goodness of fit for the Skew-t distribution. The dataset exhibits moderate skewness and potential heavy tails, making it an ideal candidate for Skew-t modeling.

4.2. Case Study 2

The second dataset consists of Apple Inc.’s closing prices. This is the daily rate of returns for Apple stock Macrotrends [41]. This dataset was used in the analysis by [19] to detect structural changes in the distribution using the MIC-based method. The dataset is obtained from [41], which provides historical stock price data. We limited the data to the range from 31 January 2019 to 24 January 2020 to avoid the heterogeneity introduced by change points. The stock price shows a general upward trend with some noticeable fluctuations. Given that stock prices exhibit time dependence, we transform the raw closing prices,

P_{t}

, into daily returns, defined as

R_{t} = \frac{P_{t + 1} - P_{t}}{P_{t}}

(9)

This transformation allows us to analyze relative price changes rather than absolute levels. The resulting data consist of 248 daily return observations, which we suspect to follow a Skew-t distribution. The filtered and transformed data are available in Table A2 in Appendix A.

4.3. Case Study 3

The third case study is the daily returns of the Dow Jones Industrial Average stock over the period of one year, from 1 November 2024 to 31 October 2025. This data is available in the Tidyquant package in R (version 4.4.2). Stock prices have frequently fluctuated this year due to tariff-related volatility, so we expect the data to exhibit heavy tails and outliers, making it an ideal candidate for Skew-t distributions. The data are then transformed using Equation (9) to correct for the dependence between observations. The transformed data consists of 249 observations and is available in Table A3 in Appendix A.

4.4. Model Fitting

The histograms in Figure 3, Figure 4 and Figure 5 reveal skewness; therefore, distributions such as the Skew-t, Skew-normal, Skew-Cauchy, and normal distributions are good candidates for modeling these datasets. Parameter estimation is conducted using the penalized maximum likelihood method. The results of the penalized maximum likelihood estimates (PMLEs), log-likelihood (LogL), Akaike information criterion (AIC), Schwarz information criterion (SIC), and corresponding p-value for the likelihood ratio test (LRT) against the Skew-t distribution are reported in Table 4, Table 5 and Table 6.

In all datasets, the Skew-t distribution yielded the lowest AIC values, indicating a better overall fit. Surprisingly, the Skew-normal distribution gave a lower SIC value for the body mass index (BMI) dataset. In addition, the associated p-value (0.0502) for the likelihood ratio test (LRT) indicates some evidence that the data may follow Skew-t distribution. Other likelihood ratio tests (LRTs) yielded small p-values, suggesting strong evidence in favor of Skew-t distribution.

In the first dataset (BMI), our proposed test statistic is 1.7109, with a corresponding p-value of 0.5840, supporting the assertion that the BMI data follow the Skew-t distribution. Classical tests considered in the study also supported the null hypothesis that body mass index (BMI) data can be modeled using the Skew-t distribution. In the second dataset, the test statistic for the proposed procedure is 3.9635, and the corresponding p-value is 0.5473. These results support the fact that the data follow the Skew-t distribution. Similar results are observed for the classical (EDF) tests, which suggest that the Skew-t distribution is an adequate model for the data. For the last dataset, the proposed test statistic is

9.7168

and its corresponding p-value is 0.5012, indicating that the Dow Jones daily returns follow a Skew-t distribution. We observe a similar conclusion from the classical (EDF) tests.

The results for all datasets are summarized in Table 7, Table 8 and Table 9. Furthermore, density estimates and empirical cumulative functions suggest that the Skew-t distribution fits the datasets adequately, as shown in Figure 3a,b for the BMI dataset, Figure 4a,b for the Apple stock prices dataset, and Figure 5a,b for the Dow Jones daily returns data, respectively.

4.5. Discussion

In the three case studies we presented in this article, the Skew-t distribution fits the data adequately and has the lowest AIC value among the distributions. This assertion is also well confirmed by the density and empirical CDF estimates as shown in Figure 3, Figure 4 and Figure 5. Through simulations, 2000 bootstrap samples are drawn from the Skew-t distribution with the parameter estimates specified in Table 7, Table 8 and Table 9, and the approximate p-values of the proposed test for the three case studies are obtained as 0.5840, 0.5473, and 0.5012, respectively. We therefore fail to reject the null hypothesis and conclude that the three datasets can be modeled with the Skew-t distribution.

The three case studies highlight the energy-based GOF test’s ability to validate Skew-t modeling in real-world scenarios, particularly when skewness and heavy tails are present. In addition, they illustrate the utility of energy-based approaches in detecting misfits in a range of candidate models.

5. Conclusions

In this paper, we developed a new goodness-of-fit test for Azzalini’s Skew-t distribution based on the energy distance framework. This method offers a flexible and powerful alternative to traditional EDF-based and correlation-based GoF tests. Our simulations demonstrate strong control of type I error and superior power in a wide range of alternatives, particularly those involving skewness, heavy tails, or multi-modal distributions. The proposed test performs reliably even in small to moderate sample sizes and is computationally feasible through bootstrap calibration and efficient implementation strategies.

We used simulation studies to explore the power of this test relative to classical alternative testing approaches under Azzalini’s Skew-t distribution. Our simulations show that the energy-based test performs better than other tests, regardless of the alternative distribution. We also used three distinct datasets to illustrate two main ideas: the importance of the Skew-t distribution for modeling skewed data with heavy tails, and the reliability of our test for detecting goodness of fit in practice. Our case studies with real data analysis confirm the practical value in model validation contexts where accurate distributional assumptions are critical.

6. Limitations and Future Research

The primary limitation of our GOF test is that it can be computationally demanding, especially if applied to a very large dataset. In this paper, we propose multiple solutions to optimize computation time and complexity, including leveraging parallel processing and approximations. Additional approaches to simplify and reduce compute time could be explored in another study. Future research directions include developing similar tests for other distributions, especially variations in the Skew-t distribution, such as the non-central Skew-t distribution and the unified Skew-t distribution, and their multivariate extensions. From a theoretical perspective, the properties of the proposed test statistic, such as asymptotic behavior, variance, and bias, could be studied. Also, exploring analytic approximations to the energy statistic under complex parametric families is a potential topic for future research in this area. In general, the energy-based goodness-of-fit test provides a robust and versatile tool for statistical modeling in both the theoretical and applied domains.

Author Contributions

Conceptualization, J.N. and A.M.H.; methodology, J.N. and A.M.H.; software, J.N. and A.M.H.; validation, J.N. and A.M.H.; formal analysis, J.N. and A.M.H.; investigation, J.N. and A.M.H.; resources, J.N. and A.M.H.; data curation, J.N. and A.M.H.; writing—original draft preparation, J.N. and A.M.H.; writing—review and editing, J.N. and A.M.H.; visualization, J.N. and A.M.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The source of each dataset is provided in the article. Additionally, cleaned and filtered datasets are provided in Appendix A.

Acknowledgments

The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

We present the two datasets used in our applications.

Table A1. Body Mass Index for 102 Male Australian Athletes.

22.46	23.88	23.68	23.15	22.32	24.02	23.29	25.11	22.81	26.25
21.38	22.52	26.73	23.57	25.84	24.06	23.85	25.09	23.84	25.31
19.69	26.07	25.50	23.69	26.79	25.61	25.06	24.93	22.96	20.69
23.97	24.64	25.93	23.69	25.38	22.68	23.36	22.44	22.57	19.81
21.19	20.39	21.12	21.89	29.97	27.39	23.11	21.75	20.89	22.83
22.02	20.07	20.15	21.24	19.63	23.58	21.65	25.17	23.25	32.52
22.59	30.18	34.42	21.86	23.99	24.81	21.68	21.04	23.12	20.76
23.13	22.35	22.28	23.55	19.85	26.51	24.78	33.73	30.18	23.31
24.51	25.37	23.67	24.28	25.82	21.93	23.38	23.07	25.21	23.25
22.93	26.86	21.26	25.43	24.54	27.79	23.58	27.56	23.76	22.01
22.34	21.07

Table A2. AAPL Daily Returns (31 January 2019–24 January 2020).

0.72012	0.04807	2.84050	1.71094	0.03445	−1.89394	−0.31005	−0.57509	0.86172	−0.41548
0.36433	−0.22248	0.29926	0.64354	−0.56386	1.11657	0.72845	0.05740	0.30976	−0.98359
1.05112	0.50295	−0.18198	−0.57540	−1.15746	0.23768	3.46422	1.12354	0.44221	1.11165
1.30082	1.02085	−0.79247	0.87386	3.68303	−2.07084	−1.20911	−1.03317	0.89941	0.13265
0.65176	0.67913	1.45367	0.68550	0.17404	0.66942	1.57361	−0.29985	0.56140	−0.83242
−0.04021	0.18102	0.01004	1.94730	0.35937	0.32866	1.44233	−0.15423	−0.90751	−0.47739
0.15174	−1.92561	4.90856	−0.65078	1.24313	−1.54428	−2.69570	0.01971	−1.07442	−1.76365
−5.81194	1.58303	1.19792	−0.43997	−0.56818	−3.12699	1.91710	−2.04716	−1.70697	−0.38406
−0.41348	−0.47691	0.51866	−1.81155	−1.01103	3.65839	1.61434	1.46818	2.66170	1.27794
1.15796	−0.31825	−0.02060	−0.72624	0.59666	2.35185	−0.29227	0.80356	−0.34092	−0.10061
−1.51576	2.16291	−0.03003	−0.91119	1.83408	0.58546	0.82869	−0.08806	−2.06140	0.60994
0.98887	−0.72824	0.76828	0.93950	−0.34599	−0.56234	1.13597	−1.49276	2.28541	0.78178
−0.08140	−0.79072	0.34779	0.93385	−0.42922	2.04042	−2.16391	−2.11581	−5.23478	1.89304
1.03553	2.20559	−1.19942	−0.25375	4.23484	−2.97650	−0.49815	2.35947	1.86441	0.00475
1.08386	−0.08465	−4.62205	1.89992	−1.12838	0.67104	1.69318	−0.12918	−1.45636	1.69665
1.95516	−0.00938	0.42671	1.18130	3.17951	−0.22362	−1.94540	0.52571	0.36380	0.93793
−0.81250	−1.46181	0.45469	−0.47550	1.53896	−0.51577	−0.48660	2.35353	0.27682	−2.50678
0.84947	2.80318	0.02203	−1.17150	1.17202	1.34784	2.65983	−0.14394	−0.23317	−0.40371
0.38828	0.48028	1.73427	−0.22868	1.34188	0.16449	1.23163	1.00170	−2.31279	−0.01233
2.26096	2.83808	0.65671	−0.14369	0.04278	0.85135	0.27369	0.79188	−0.09154	0.95816
−0.69194	1.18793	0.50421	−0.30326	−1.16415	−0.44834	−0.08779	1.75338	−0.78086	1.34322
−0.22028	−1.15622	−1.78301	0.88263	1.46710	1.93162	−1.40001	0.58444	0.85294	0.25483
1.35932	1.71179	0.19653	−0.23894	0.10009	−0.20712	1.63183	0.09507	1.98403	−0.03795
0.59351	0.73065	2.28163	−0.97220	0.79682	−0.47030	1.60863	2.12408	0.22607	2.13644
−1.35033	−0.42855	1.25265	1.10710	−0.67769	0.35695	0.48159	−0.28820

Table A3. Dow Jones Daily Returns (1 November 2024–31 October 2025).

−0.00613	0.01022	0.03572	−0.00001	0.00594	0.00691	−0.00863	0.00108	−0.00472	−0.00699
−0.00127	−0.00278	0.00322	0.01064	0.00971	0.00993	0.00277	−0.00308	0.00422	−0.00286
−0.00171	0.00690	−0.00552	−0.00275	−0.00539	−0.00347	−0.00224	−0.00531	−0.00196	−0.00252
−0.00612	−0.02585	0.00036	0.01176	0.00156	0.00909	0.00066	−0.00770	−0.00973	−0.00069
−0.00357	0.00802	−0.00060	−0.00417	0.00251	−0.01634	0.00855	0.00523	0.01654	−0.00158
0.00776	0.01237	0.00297	0.00925	−0.00316	0.00651	0.00306	−0.00305	0.00377	−0.00752
−0.00276	0.00302	0.00712	−0.00280	−0.00993	0.00377	0.00277	−0.00505	0.00773	−0.00370
0.00023	0.00160	−0.01010	−0.01695	0.00076	0.00368	−0.00431	−0.00446	0.01391	−0.01482
−0.01552	0.01142	−0.00994	0.00523	−0.02079	−0.01141	−0.00199	−0.01300	0.01653	0.00852
−0.00622	0.00922	−0.00027	0.00076	0.01424	0.00010	−0.00312	−0.00365	−0.01692	0.01005
−0.00028	0.00561	−0.03977	−0.05503	−0.00912	−0.00843	0.07870	−0.02499	0.01564	0.00776
−0.00385	−0.01733	−0.01329	−0.02483	0.02663	0.01071	0.01229	0.00050	0.00284	0.00746
0.00350	0.00206	0.01385	−0.00239	−0.00946	0.00698	0.00619	−0.00288	0.02814	−0.00636
−0.00212	0.00646	0.00784	0.00322	−0.00268	−0.01914	−0.00003	−0.00612	0.01780	−0.00578
0.00278	0.00129	0.00084	0.00506	−0.00216	−0.00255	0.01047	−0.00003	0.00246	−0.00003
0.00238	−0.01792	0.00752	−0.00704	−0.00105	0.00083	0.00888	0.01191	−0.00247	0.00941
0.00997	0.00629	0.00908	−0.00024	0.00774	−0.00942	−0.00373	0.00492	0.00433	−0.00625
0.00199	−0.00981	0.00526	0.00519	−0.00320	−0.00043	0.00405	0.01141	−0.00703	0.00465
−0.00143	−0.00456	−0.00385	−0.00743	−0.01229	0.01342	−0.00140	0.00184	−0.00508	0.00471
−0.00454	0.01100	0.01043	−0.00025	0.00078	−0.00076	0.00023	0.00036	−0.00340	0.01890
−0.00765	0.00299	0.00324	0.00157	−0.00202	−0.00547	−0.00054	0.00773	−0.00483	0.00251
0.00431	−0.00482	0.01356	−0.00594	0.00107	−0.00274	0.00569	0.00270	0.00375	0.00143
−0.00191	−0.00370	−0.00377	0.00653	0.00149	0.00177	0.00093	0.00169	0.00513	−0.00135
−0.00197	−0.00003	−0.00522	−0.01896	0.01293	0.00440	−0.00037	−0.00651	0.00519	0.01117
0.00467	−0.00712	0.00310	0.01011	0.00715	0.00340	−0.00156	−0.00231	0.00086

References

Ibragimov, M.; Ibragimov, R.; Walden, J. Heavy-Tailed Distributions and Robustness in Economics and Finance; Springer: Cham, Switzerland, 2015; Volume 214. [Google Scholar]
Guo, Z.Y. Heavy-tailed distributions and risk management of equity market tail events. J. Risk Control 2017, 4, 31–41. [Google Scholar] [CrossRef]
Cortés, I.; Reyes, J.; Iriarte, Y.A. A Weighted Skew-Logistic Distribution with Applications to Environmental Data. Mathematics 2024, 12, 1287. [Google Scholar] [CrossRef]
Ahmad, Z.; Mahmoudi, E.; Dey, S. A new family of heavy tailed distributions with an application to the heavy tailed insurance loss data. Commun. Stat.-Simul. Comput. 2022, 51, 4372–4395. [Google Scholar] [CrossRef]
Azzalini, A. A class of distributions which includes the normal ones. Scand. J. Stat. 1985, 12, 171–178. [Google Scholar]
Azzalini, A.; Capitanio, A. The Skew-Normal and Related Families; Cambridge University Press: New York, NY, USA, 2014. [Google Scholar]
Henze, N. On a Skew-t Distribution. Scand. J. Stat. 1986, 13, 271–275. [Google Scholar]
Hasan, A.; Ning, W.; Gupta, A. A New Approach for the Skew t Distribution with Applications to Environmental Data. Adv. Appl. Stat. 2016, 49, 117–136. [Google Scholar] [CrossRef]
Arellano-Valle, R.B.; Azzalini, A. The centred parameterization and related quantities of the skew-t distribution. J. Multivar. Anal. 2013, 113, 73–90. [Google Scholar] [CrossRef]
Tagle, F.; Castruccio, S.; Genton, M.G. A hierarchical bi-resolution spatial skew-t model. Spat. Stat. 2020, 35, 100398. [Google Scholar] [CrossRef]
Galarza, C.E.; Matos, L.A.; Castro, L.M.; Lachos, V.H. Moments of the doubly truncated selection elliptical distributions with emphasis on the unified multivariate skew-t distribution. J. Multivar. Anal. 2022, 189, 104944. [Google Scholar] [CrossRef]
Hasan, A. A Study of Non-Central Skew t Distributions and Their Applications in Data Analysis and Change Point Detection. Ph.D. Thesis, Bowling Green State University, Bowling Green, OH, USA, 2013. [Google Scholar]
Azzalini, A.; Capitanio, A. Distributions generated by perturbation of symmetry with emphasis on a multivariate Skew t-distribution. J. R. Stat. Soc. Ser. B Stat. Methodol. 2003, 65, 367–389. [Google Scholar] [CrossRef]
Stephens, M.A. EDF statistics for goodness of fit and some comparisons. J. Am. Stat. Assoc. 1974, 69, 730–737. [Google Scholar] [CrossRef]
Stephens, M.A. Tests Based on EDF Statistics. In Goodness-of-Fit Techniques; D’Agostino, R.B., Stephens, M.A., Eds.; Marcel Dekker: New York, NY, USA, 1986; pp. 97–193. [Google Scholar]
Maghami, M.; Bahrami, M. Goodness of Fit Test for the Skew-T Distribution. J. Math. Comput. Sci. 2015, 14, 274–283. [Google Scholar] [CrossRef]
Hasan, A.; Ning, W.; Gupta, A.K. An information-based Approach to the Change-point Problem of the Non-central Skew t Distribution with Applications to Stock Market Data. Seq. Anal. 2014, 33, 458–474. [Google Scholar] [CrossRef]
Kim, H.J. On a Skew-t Distribution. J. Korean Commun. Stat. 2001, 8, 867–873. [Google Scholar]
Hasan, A.; Chen, Y. On the Modified Information-Based Approach to the Change Point Detection (CPD) Problem under the Non-Central Skew t Distribution. J. Stat. Theory Pract. 2025, 19, 54. [Google Scholar] [CrossRef]
Rizzo, M.L. A New Rotation Invariant Goodness-of-Fit Test. Ph.D. Thesis, Bowling Green State University, Bowling Green, OH, USA, 2002. [Google Scholar]
Sźekely, G.J. E-statistics: Energy of Statistical Samples; Technical Report 03-05; BGSU, Department of Mathematics and Statistics: Bowling Green, OH, USA, 2000. [Google Scholar]
Opperman, L.; Ning, W. Goodness-of-fit test for skew normality based on energy statistics. Random Oper. Stoch. Equ. 2020, 28, 227–236. [Google Scholar] [CrossRef]
Ofosuhene, P. The Energy Goodness-of-Fit Test for the Inverse Gaussian Distribution. Ph.D. Thesis, Bowling Green State University, Bowling Green, OH, USA, 2020. [Google Scholar]
Njuki, J.; Avallone, R. Energy Statistic-Based Goodness-of-Fit Test for the Lindley Distribution with Application to Lifetime Data. Stats 2025, 8, 87. [Google Scholar] [CrossRef]
Móri, F.T.; Sźekely, G.J.; Rizzo, M.L. On energy tests of normality. J. Stat. Plan. Inference 2021, 213, 1–15. [Google Scholar] [CrossRef]
Sźekely, G.J.; Rizzo, M. A new test for multivariate normality. J. Multivar. Anal. 2005, 93, 58–80. [Google Scholar] [CrossRef]
Rizzo, M.L. A Test of Homogeneity for Two Multivariate Populations; American Statistical Association: Alexandria, VA, USA, 2003. [Google Scholar]
Sźekely, G.J.; Rizzo, M.L. Testing for Equal Distributions in high Dimension. InterStat 2004, 5, 1249–1272. [Google Scholar]
Sźekely, G.J.; Rizzo, M.L. The Energy of Data and Distance Correlation, 1st ed.; Chapman and Hall: London, UK, 2023. [Google Scholar]
Rizzo, M.L. New goodness-of-fit tests for Pareto distributions. ASTIN Bull. J. IAA 2009, 39, 691–715. [Google Scholar] [CrossRef]
Njuki, J.; Ning, W. Energy statistic-based modified information criterion for detecting the change in distribution. J. Appl. Stat. 2025, 1–23. [Google Scholar] [CrossRef]
Njuki, J.M. Nonparametric Sequential tests for Change Point Analysis Using Energy Statistics. Ph.D. Thesis, Bowling Green State University, Bowling Green, OH, USA, 2022. [Google Scholar]
Matterson, D.S.; James, N.A. A nonparametric Approach for Multiple Change Point Analysis of Multivariate Data. J. Am. Stat. Assoc. 2014, 109, 334–345. [Google Scholar] [CrossRef]
Kim, A.Y.; Marzban, C.; Percival, D.B.; Stuetzle, W. Using labeled data to evaluate change detectors in a multivariate streaming environment. Signal Process. 2009, 89, 2529–2536. [Google Scholar] [CrossRef]
Sźekely, G.J.; Rizzo, M. A Class of Statistical Based on Distances. J. Stat. Plan. Inference 2013, 143, 1249–1272. [Google Scholar] [CrossRef]
Sźekely, G.J.; Rizzo, M.L. The Energy of Data. Annu. Rev. Stat. Its Appl. 2017, 4, 447–479. [Google Scholar] [CrossRef]
Azzalini, A.; Genton, M.G. Robust likelihood methods based on the skew-t and related distributions. Int. Stat. Rev. 2008, 76, 106–129. [Google Scholar] [CrossRef]
Azzalini, A.; Arellano-Valle, R.B. Maximum penalized likelihood estimation for skew-normal and skew-t distributions. J. Stat. Plan. Inference 2013, 143, 419–433. [Google Scholar] [CrossRef]
Jones, M.C. A family of distributions on the real line with four parameters. In Statistical Models and Methods for Financial Markets; Bali, T.G., Lim, E., Eds.; Springer: New York, NY, USA, 2006; pp. 75–93. [Google Scholar]
Cook, R.D.; Weisberg, S. Bayesian Density Estimation Using Skew Student-t-Normal Mixtures: An Introduction to Regression Graphics; John Wiley and Sons: New York, NY, USA, 1994. [Google Scholar]
Macrotrends. Apple—Stock Price History. 2025. Available online: https://www.macrotrends.net/stocks/charts/AAPL/apple/stock-price-history (accessed on 10 March 2025).

Figure 1. Azzalini’s standard Skew-t density curves when the skewness parameter, (a) fixed

α = 1

, and the degrees of freedom,

ν

, varies over

{1, 5, 10}

and (b) fixed degrees of freedom,

ν = 10

, varies the skewness parameter

α

over

{- 5, 0, 5}

.

Figure 1. Azzalini’s standard Skew-t density curves when the skewness parameter, (a) fixed

α = 1

, and the degrees of freedom,

ν

, varies over

{1, 5, 10}

and (b) fixed degrees of freedom,

ν = 10

, varies the skewness parameter

α

over

{- 5, 0, 5}

.

Figure 2. Critical values for various parameter values for sample size

n = 50

at levels of significance

L O S = 0.01, 0.025, 0.05, 0.10

.

Figure 2. Critical values for various parameter values for sample size

n = 50

at levels of significance

L O S = 0.01, 0.025, 0.05, 0.10

.

Figure 3. Histogram, density curves, and CDFs of the body mass index (BMI) of 102 male Australian athletes. (a) Histogram of BMI with density estimates of Skew-t, Skew-normal, Skew-Cauchy, and normal. (b) Empirical and theoretical Skew-t CDFs.

Figure 4. Histogram, density curves, and CDFs of Apple’s daily returns with 248 observations. (a) Histogram of Apple’s daily returns with density curves of the fitted Skew-t, Skew-normal, Skew-Cauchy, and normal distributions. (b) Empirical and theoretical Skew-t CDFs.

Figure 5. Histogram, density curves, and CDFs of the Dow Jones daily returns with 249 observations. (a) Histogram of Dow Jones daily returns with density estimates of Skew-t, Skew-normal, Skew-Cauchy, and normal. (b) Empirical and theoretical Skew-t CDFs.

Table 1. Simulated critical values for energy statistic of Skew-t distribution.

(α, ν)
n	(1, 5)	(0, 5)	(−1, 5)	(1, 10)	(0, 10)	(−1, 10)	(1, 30)	(0, 30)	(−1, 30)
50	3.7856	3.8663	4.0781	2.0441	1.9596	1.9143	1.3624	1.3332	1.3421
100	4.5742	4.6635	4.6401	2.3416	2.3314	2.3443	1.6584	1.5815	1.6448
150	5.3913	5.6910	5.3485	2.8291	2.8642	2.8279	2.0239	2.0291	2.0270
200	6.3893	6.6986	6.3946	3.3414	3.4878	3.3710	2.4003	2.4172	2.3981

Table 2. Simulated size of the test for Skew-t distribution.

(α, ν)
n	(1, 5)	(0, 5)	(−1, 5)	(1, 10)	(0, 10)	(−1, 10)	(1, 30)	(0, 30)	(−1, 30)
50	0.0508	0.0532	0.0513	0.0476	0.0508	0.0513	0.0510	0.0484	0.0540
100	0.0484	0.0476	0.0492	0.0511	0.0487	0.0503	0.0501	0.0558	0.0488
150	0.0496	0.0506	0.0524	0.0513	0.0509	0.0491	0.0478	0.0492	0.0508
200	0.0503	0.0494	0.0506	0.0505	0.0498	0.0501	0.0497	0.0496	0.0499

Table 3. Power comparison with

n = 50, 100, 150, 200

.

Table 3. Power comparison with

n = 50, 100, 150, 200

.

Distribution	Sample Size n	Energy	K-S	Kuiper	CvM	Watson	A-D
$χ^{2} (2)$	50	0.8369	0.2248	0.2134	0.2809	0.2460	0.3878
	100	0.8933	0.2924	0.2832	0.3850	0.3298	0.5061
	150	0.9559	0.4018	0.3638	0.5103	0.4288	0.6223
	200	0.9715	0.4973	0.4418	0.6184	0.5069	0.7270
$t_{10}$	50	0.1066	0.0342	0.0396	0.0332	0.0382	0.0372
	100	0.1736	0.0356	0.0418	0.0392	0.0380	0.0384
	150	0.2268	0.0360	0.0398	0.0334	0.0382	0.0378
	200	0.2790	0.0387	0.0410	0.0413	0.0393	0.0397
$Exp (1)$	50	0.8413	0.2640	0.2491	0.3069	0.2692	0.4092
	100	0.9152	0.5090	0.4924	0.5538	0.5140	0.6128
	150	0.9655	0.6496	0.6224	0.7012	0.6486	0.7702
	200	0.9848	0.7439	0.7169	0.8012	0.7421	0.8601
$Shash (3, 1)$	50	0.5734	0.1145	0.1292	0.1345	0.1302	0.1545
	100	0.6045	0.2131	0.2154	0.2546	0.2328	0.261
	150	0.6401	0.3032	0.3072	0.3844	0.3498	0.3958
	200	0.6749	0.4166	0.4196	0.5184	0.4742	0.5261
$G T (3, 1)$	50	0.9596	0.1128	0.1166	0.1306	0.1188	0.3350
	100	0.9916	0.1562	0.1860	0.2004	0.1966	0.4480
	150	1.0000	0.2306	0.2756	0.3228	0.3232	0.5816
	200	1.0000	0.3030	0.3798	0.4278	0.4292	0.6720
$K w C W G$ (0.1, 3, 1, 3, 2)	50	0.0930	0.0750	0.0875	0.0760	0.0770	0.0810
	100	0.1080	0.0905	0.0885	0.0775	0.0795	0.0855
	150	0.1145	0.1015	0.1170	0.0845	0.0850	0.0950
	200	0.1290	0.1080	0.1160	0.0895	0.0915	0.1045
$L N (0.5, 1)$	50	0.9701	0.3331	0.1950	0.3988	0.2304	0.4611
	100	0.9931	0.33691	0.2092	0.3974	0.2441	0.4644
	150	0.9995	0.3449	0.2179	0.4102	0.2586	0.4775
	200	0.9998	0.3639	0.2409	0.4429	0.2818	0.5155

Table 4. MLE, LogL, AIC, and SIC values for BMI dataset.

Parameter/Model	Skew-t	Normal	Skew-Cauchy	Skew-Normal
$μ$	21.6490	23.9036	22.7889	20.8765
$σ$	2.6570	2.7403	1.3762	4.0610
$α$	1.6421	-	0.5126	3.2992
$ν$	4.5503	-	-	-
LogL	−236.0511	−248.0613	−247.5519	−237.9670
AIC	480.1022	500.1227	501.1037	481.9341
SIC	490.6020	505.3726	508.9787	489.8090
LRT p-value	-	≪0.0001	≪0.0001	0.0502

Table 5. MLE, LogL, AIC, and SIC values for Apple stock daily return dataset.

Parameter/Model	Skew-t	Normal	Skew-Cauchy	Skew-Normal
$μ$	0.5254	0.2749	0.1798	1.3519
$σ$	1.1514	1.4259	0.7655	1.7858
$α$	−0.2285	-	0.1191	−1.1688
$ν$	5.3556	-	-	-
LogL	−431.3161	−440.3975	−461.9316	−438.1367
AIC	870.6321	884.795	929.8633	882.2733
SIC	884.6858	891.8219	940.4036	892.8136
LRT p-value	-	0.0001	≪0.0001	0.0002

Table 6. MLE, LogL, AIC, and SIC values for Dow Jones daily return data.

Parameter/Model	Skew-t	Normal	Skew-Cauchy	Skew-Normal
$μ$	0.0011	0.0006	−0.0002	−0.0063
$σ$	0.0063	0.0107	0.0046	0.0127
$α$	−0.0913	-	0.1305	0.9420
$ν$	3.0501	-	-	-
LogL	822.8359	776.2823	801.3888	778.9004
AIC	−1637.6720	−1548.5650	−1596.778	−1551.8010
SIC	−1623.6020	−1541.5300	−1586.225	−1541.2480
LRT p-value	-	≪0.0001	≪0.0001	≪0.0001

Table 7. Summary of test statistics and corresponding p-values of BMI data.

Test	Energy	K-S	Kuiper	CvM	Watson	A-D
Statistic value	1.7109	0.0454	0.8731	0.0295	0.0290	0.2321
p-value	0.5840	0.9847	0.9847	0.9827	0.9840	0.9807

Table 8. Summary of test statistics and p-values of Apple daily return data.

Test	Energy	K-S	Kuiper	CvM	Watson	A-D
Statistic value	3.9635	0.0357	1.0576	0.0291	0.0290	0.1737
p-value	0.5473	0.8260	0.8167	0.9381	0.9380	0.9913

Table 9. Summary of test statistics and corresponding p-values of Dow Jones daily return data.

Test	Energy	K-S	Kuiper	CvM	Watson	A-D
Statistic value	9.7168	0.0340	0.9806	0.0404	0.0405	0.2878
p-value	0.5012	0.8150	0.9130	0.8945	0.8875	0.8680

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Njuki, J.; Hasan, A.M. A New Goodness-of-Fit Test for Azzalini’s Skew-t Distribution Based on the Energy Distance Framework with Applications. Mathematics 2025, 13, 3833. https://doi.org/10.3390/math13233833

AMA Style

Njuki J, Hasan AM. A New Goodness-of-Fit Test for Azzalini’s Skew-t Distribution Based on the Energy Distance Framework with Applications. Mathematics. 2025; 13(23):3833. https://doi.org/10.3390/math13233833

Chicago/Turabian Style

Njuki, Joseph, and Abeer M. Hasan. 2025. "A New Goodness-of-Fit Test for Azzalini’s Skew-t Distribution Based on the Energy Distance Framework with Applications" Mathematics 13, no. 23: 3833. https://doi.org/10.3390/math13233833

APA Style

Njuki, J., & Hasan, A. M. (2025). A New Goodness-of-Fit Test for Azzalini’s Skew-t Distribution Based on the Energy Distance Framework with Applications. Mathematics, 13(23), 3833. https://doi.org/10.3390/math13233833

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A New Goodness-of-Fit Test for Azzalini’s Skew-t Distribution Based on the Energy Distance Framework with Applications

Abstract

1. Introduction

1.1. Existence and Uniqueness of the MLEs for Azzalini’s Skew-t Distribution

1.2. Motivation and Scientific Contribution

2. Proposed Energy-Based Goodness-of-Fit Test

3. Simulations and Results

3.1. The Univariate Energy Test Statistic

3.2. Critical Value Simulation Under Varying $α$ and $ν$

3.3. Type I Error Control

3.4. Power Analysis Under Various Alternatives

3.5. Superior Performance of the $n E_{n}$ Test

3.6. Effect of Sample Size and Parameters

3.7. Summary

4. Real Data Applications

4.1. Case Study 1

4.2. Case Study 2

4.3. Case Study 3

4.4. Model Fitting

4.5. Discussion

5. Conclusions

6. Limitations and Future Research

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

A New Goodness-of-Fit Test for Azzalini’s Skew-t Distribution Based on the Energy Distance Framework with Applications

Abstract

1. Introduction

1.1. Existence and Uniqueness of the MLEs for Azzalini’s Skew-t Distribution

1.2. Motivation and Scientific Contribution

2. Proposed Energy-Based Goodness-of-Fit Test

3. Simulations and Results

3.1. The Univariate Energy Test Statistic

3.2. Critical Value Simulation Under Varying α and ν

3.3. Type I Error Control

3.4. Power Analysis Under Various Alternatives

3.5. Superior Performance of the n E n Test

3.6. Effect of Sample Size and Parameters

3.7. Summary

4. Real Data Applications

4.1. Case Study 1

4.2. Case Study 2

4.3. Case Study 3

4.4. Model Fitting

4.5. Discussion

5. Conclusions

6. Limitations and Future Research

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.2. Critical Value Simulation Under Varying $α$ and $ν$

3.5. Superior Performance of the $n E_{n}$ Test