Analytical Bounds for Mixture Models in Cauchy–Stieltjes Kernel Families

Alsharari, Fahad; Fakhfakh, Raouf; Alshahrani, Fatimah

doi:10.3390/math13030381

Open AccessArticle

Analytical Bounds for Mixture Models in Cauchy–Stieltjes Kernel Families

by

Fahad Alsharari

¹

,

Raouf Fakhfakh

^1,*

and

Fatimah Alshahrani

²

¹

Department of Mathematics, College of Science, Jouf University, P.O. Box 2014, Sakaka 72311, Saudi Arabia

²

Department of Mathematical Sciences, College of Science, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia

^*

Author to whom correspondence should be addressed.

Mathematics 2025, 13(3), 381; https://doi.org/10.3390/math13030381

Submission received: 15 December 2024 / Revised: 20 January 2025 / Accepted: 22 January 2025 / Published: 24 January 2025

(This article belongs to the Section D1: Probability and Statistics)

Download Versions Notes

Abstract

Mixture models are widely used in mathematical statistics and theoretical probability. However, the mixture probability distribution is rarely explicit in its formula. One must then decide whether to keep the parent probability distribution or to obtain an approximation of the mixture probability distribution. In such cases, it is essential to estimate or evaluate the distance between a mixture probability distribution and its parent probability distribution. On the other hand, orthogonal polynomials offer a versatile mathematical tool for approximating, fitting, and analyzing mixture models, facilitating more accurate and efficient modeling in statistics and data science. This article considers mixture models in Cauchy–Stieltjes Kernel (CSK) families. Using a suitable basis of polynomials, we obtain an expression for the distance in the

L^{2}

-norm between the mixed probability distribution and its parent probability distribution which belongs to a given CSK family. For the distance between the corresponding distribution functions, bounds are derived in

L^{1}

-norm. The results are illustrated by some examples from quadratic CSK families.

Keywords:

variance function; orthogonal polynomials; mixture models

MSC:

60E10; 46L54

1. Introduction

In statistical modeling and data analysis, mixture models play a pivotal role in capturing complex data structures where the underlying population is assumed to be heterogeneous. However, the mixture probability distribution rarely has an explicit formula. Then, we must choose either to keep a parent probability distribution (i.e., the underlying distribution from which the components or sub-distributions of the mixture are drawn) or to obtain an approximation of the mixing probability distribution. In such cases, it is very important to approximate or evaluate the distance between a mixture probability distribution and its parent probability distribution. Therefore, the literature focuses on establishing bounds concerning the distance between a mixture probability distribution and its parent probability distribution. In this context, bounds were evaluated for different distances: for uniform distance in [1], for

L^{1}

-norm in [2] and for difference between distribution functions in [3,4]. However, orthogonal polynomials offer a versatile mathematical tool for approximating, fitting, and analyzing mixture models, facilitating more accurate and efficient modeling in statistics and data science. They help in simplifying computations in mixture models. By using the property of orthogonality, the polynomial terms can be efficiently computed, reducing the complexity of estimating parameters in the mixture model.

On the other hand, the study of mixture models in a family of probability measures is significant because it enables the flexible modeling of complicated data derived from numerous underlying distributions. Many real-world circumstances create data from a variety of multiple sources or latent groups rather than a single process or distribution. Mixture models capture this heterogeneity by merging many probability distributions, each capturing a distinct portion of the data. In this paper, based on the concept of orthogonal polynomials, we are interested in providing analytical bounds for mixture models in Cauchy–Stieltjes Kernel (CSK) families. For a better presentation of the purpose of this article, we need first to introduce some basic concepts about CSK families and their associated orthogonal polynomials.

The setting of CSK families in free probability is recently introduced. It concerns families of probabilities defined similarly to natural exponential families by exploring the Cauchy–Stieltjes kernel

{(1 - ζ θ)}^{- 1}

replacing the exponential kernel exp

(ζ θ)

. Denote by

P_{c}

the set of (non-degenerate) compactly supported probabilities on the real line. Let

ρ \in P_{c}

, then

M_{ρ} (θ) = \int \frac{1}{1 - θ ζ} ρ (d ζ)

(1)

is defined ∀

θ \in (θ_{-}^{ρ}, θ_{+}^{ρ})

with

1 / θ_{+}^{ρ} = \max {\sup supp (ρ), 0}

and

1 / θ_{-}^{ρ} = \min {\inf supp (ρ), 0}

.

The family of probabilities

K (ρ) = {P_{(θ, ρ)} (d ζ) = \frac{1}{M_{ρ} (θ) (1 - θ ζ)} ρ (d ζ) : θ \in (θ_{-}^{ρ}, θ_{+}^{ρ})}

is called the CSK family induced by

ρ

.

Following [5], the mean function

θ \mapsto K_{ρ} (θ)

is strictly increasing on

(θ_{-}^{ρ}, θ_{+}^{ρ})

. The image of

(θ_{-}^{ρ}, θ_{+}^{ρ})

by

K_{ρ} (\cdot)

is the mean domain of

K (ρ)

and is denoted as

(m_{-}^{ρ}, m_{+}^{ρ})

. Denoting by

ψ_{ρ}

the inverse function of

K_{ρ} (\cdot)

. Writing for

m \in (m_{-}^{ρ}, m_{+}^{ρ})

,

Q_{(m, ρ)} (d ζ) = P_{(ψ_{ρ} (m), ρ)} (d ζ)

, we obtain the mean re-parametrization of

K (ρ)

as

K (ρ) = {Q_{(m, ρ)} (d ζ) : m \in (m_{-}^{ρ}, m_{+}^{ρ})} .

Define

B = B (ρ) = \max {\sup supp (ρ), 0} = 1 / θ_{+}^{ρ} \in [0, \infty),

(2)

and

A = A (ρ) = \min {\inf supp (ρ), 0} = 1 / θ_{-}^{ρ} \in (- \infty, 0] .

(3)

It is shown in [6] that

m_{-}^{ρ} = A - \lim_{z \to A^{-}} \frac{1}{G_{ρ} (z)} and m_{+}^{ρ} = B - \lim_{z \to B^{+}} \frac{1}{G_{ρ} (z)},

where

G_{ρ} (z) = \int \frac{1}{z - ζ} ρ (d ζ), z \in C ∖ supp (ρ)

(4)

is the Cauchy transform of

ρ

.

The map

m \mapsto V_{ρ} (m) = \int {(ζ - m)}^{2} Q_{(m, ρ)} (d ζ)

(5)

is called the variance function (VF) of

K (ρ)

, see [5]. An interesting fact is that the governing measure

ρ

is characterized by

V_{ρ} (\cdot)

and the first moment of

ρ

(denoted

m_{1}^{ρ}

): If we set

ϖ = ϖ (m) = m + \frac{V_{ρ} (m)}{m - m_{1}^{ρ}},

(6)

then the Cauchy transform satisfies

G_{ρ} (ϖ) = \frac{m - m_{1}^{ρ}}{V_{ρ} (m)} .

(7)

In addition,

Q_{(m, ρ)} (d ζ) = h_{ρ} (ζ, m) ρ (d ζ)

with

h_{ρ} (ζ, m) : = \frac{V_{ρ} (m)}{V_{ρ} (m) + (m - m_{1}^{ρ}) (m - ζ)} .

(8)

Now we come to the concept of polynomials associated with CSK families. Bryc [5] characterized the class of quadratic CSK families such that the VF is a polynomial in the mean m of degree

\leq 2

. This class consists of the free Meixner laws. Different findings involving orthogonal polynomials have been proved for the quadratic class of CSK families. Some results are stated in [7] for the sequence of polynomials associated with a CSK family and new versions are provided of the Feinsilver and Meixener characterizations based on the orthogonality of polynomials. These versions encompass the quadratic class of CSK families. For completeness, we recall the CSK -version of Feinsilver characteristic property, see ([7], Theorem 3.2).

Theorem 1.

Let

K (ρ) = {Q_{(m, ρ)} (d ζ) : m \in (m_{-}^{ρ}, m_{+}^{ρ})}

be the CSK family induced by

ρ \in P_{c}

with mean

m_{1}^{ρ} = 0

. Assume that

V_{ρ} (\cdot)

is analytic near 0 and

V_{ρ} (0) > 0

. Define the polynomials

P_{n} (.)

,

n = 0, 1, 2, \dots,

as

{P_{n} (ζ) = \frac{1}{n!} \frac{\partial^{n}}{\partial m^{n}} h_{ρ} (ζ, m)|}_{m = 0} .

(9)

Then, the following assertions are equivalent.

(i): Polynomials ${(P_{n})}_{n = 0, 1, 2, \dots,}$ are ρ-orthogonal.
(ii): $K (ρ)$ is a quadratic CSK family.
(iii): $a_{0} > 0$ , $a_{1} \in R$ , $a_{2} > - 1$ exists so that

$x P_{n} (ζ) = (1 + a_{2}) P_{n - 1} (ζ) + a_{1} P_{n} (ζ) + a_{0} P_{n + 1} (ζ) .$

In addition,

V_{ρ} (m) = a_{0} + a_{1} m + a_{2} m^{2} .

Now, we present the purpose of this article in more detail. We study mixtures of laws from the perspective of compactly supported CSK families. We provide the distance of a mixing law from its parent law in a CSK family. Mixing laws of the form

M_{(μ, σ)} (ζ) = \int h_{μ} (ζ, m) σ (d m)

are considered, where

σ

is a given probability measure and

h_{μ} (\cdot, m)

represent a parent probability measure with mean m, that belongs to a CSK family governed by a (non-degenerate) compactly supported probability measure

μ

. The objective is to find bounds for the distance between

M_{(μ, σ)} (\cdot)

and

h_{μ} (\cdot, m)

, for some m fixed in the mean domain of the corresponding CSK family. We investigate the polynomial expansion of the probability

h_{μ} (ζ, m)

and we deduce expansions of the mixing density

M_{(μ, σ)} (\cdot)

. For the quadratic CSK family, the difference between

M_{(μ, σ)} (\cdot)

and a parent density function

h_{μ} (\cdot, m)

is provided by means of orthogonal polynomials in

L^{1}

and

L^{2}

-norm. We also give bounds for the distance between the mixing distribution function and its parent distribution function.

2. Main Results

Consider

K (μ) = {Q_{(m, μ)} (d ζ) : m \in (m_{-}^{μ}, m_{+}^{μ})}

the CSK family induced by

μ \in P_{c}

, with

m_{1}^{μ} = 0

. According to [7],

r > 0

exists so that ∀

m \in (- r, r)

h_{μ} (ζ, m) = \sum_{n = 0}^{+ \infty} m^{n} P_{n} (ζ),

(10)

where

P_{n} (.)

,

n = 0, 1, 2, \dots,

are polynomials introduced as (9).

Throughout the paper, for some

μ \in P_{c}

with

m_{1}^{μ} = 0

, a mixture of the form

M_{(μ, σ)} (ζ) = \int h_{μ} (ζ, m) σ (d m)

(11)

is considered, where

σ

is a real probability measure.

E_{σ} (\cdot)

will denote the expectation with respect to

σ

. It is obligatory that all moments of

σ

exist finitely: that is, for all integers p,

E_{σ} (m^{p}) < \infty

. Let us first discuss a significant outcome of (10).

Lemma 1.

Let

M_{(μ, σ)} (\cdot)

be a mixture density defined by (11). Suppose that

s u p p (σ) \subset (- r, r)

. If the series

\sum_{n = 0}^{+ \infty} E_{σ} {(| m |}^{n}) | P_{n} (ζ) |

converge then we have

M_{(μ, σ)} (ζ) = \sum_{n = 0}^{+ \infty} E_{σ} (m^{n}) P_{n} (ζ) .

(12)

Proof.

Combining (11) with (10) we obtain

M_{(μ, σ)} (ζ) = \int h_{μ} (ζ, m) σ (d m) = \int (\sum_{n = 0}^{+ \infty} m^{n} P_{n} (ζ)) σ (d m) = \sum_{n = 0}^{+ \infty} E_{σ} (m^{n}) P_{n} (ζ) .

□

For the permutation between series and integrals, see ([8] Proposition 3.1).

Consequently, we obtain the following expansion of the difference between the parent and the mixture density.

Proposition 1.

Let

M_{(μ, σ)} (\cdot)

be a mixture density defined by (11) and let

P_{n} (\cdot)

,

n = 0, 1, 2, \dots,

defined by (9). If

s u p p (σ) \subset (- r, r)

, then ∀

t \in (- r, r)

, we have

M_{(μ, σ)} (ζ) - h_{μ} (ζ, t) = \sum_{n = 1}^{+ \infty} {E_{σ} (m^{n}) - t^{n}} P_{n} (ζ) .

(13)

Proof.

Combining (10) and (12), we obtain

M_{(μ, σ)} (ζ) - h_{μ} (ζ, t) = \sum_{n = 0}^{+ \infty} E_{σ} (m^{n}) P_{n} (ζ) - \sum_{n = 0}^{+ \infty} t^{n} P_{n} (ζ) = \sum_{n = 1}^{+ \infty} {E_{σ} (m^{n}) - t^{n}} P_{n} (ζ) .

□

Remark 1.

If we take

t = 0

, then we have

h_{μ} (ζ, 0) = 1

and Proposition 1 gives the following:

M_{(μ, σ)} (ζ) - 1 = \sum_{n = 1}^{+ \infty} E_{σ} (m^{n}) P_{n} (ζ) .

Furthermore, for the choice

t = 0 = E_{σ} (m)

, we obtain

M_{(μ, σ)} (x) - 1 = V a r_{σ} (m) P_{2} (ζ) + \sum_{n = 3}^{+ \infty} E_{σ} (m^{n}) P_{n} (ζ) .

where

V a r_{σ} (m) = E_{σ} (m^{2}) - {(E_{σ} (m))}^{2}

denotes the variance of σ.

Denote by

H_{(μ, σ)} (\cdot)

and

F_{μ} (\cdot, m)

the distribution functions associated to

M_{(μ, σ)} (\cdot)

and

h_{μ} (\cdot, m)

, respectively, that is,

H_{(μ, σ)} (x) = \int_{- \infty}^{x} M_{(μ, σ)} (ζ) μ (d ζ) and F_{μ} (x, m) = \int_{- \infty}^{x} h_{μ} (ζ, m) μ (d ζ) .

We first provide a general outcome for all distribution functions in a CSK family.

Proposition 2.

Let

M_{(μ, σ)} (\cdot)

be a mixture density defined by (11) and let

{(P_{n} (\cdot))}_{n \in N}

be defined by (9). If

s u p p (σ) \subset (- r, r)

, then ∀

t \in (- r, r)

, we have

| H_{(μ, σ)} (x) - F_{μ} (x, t) | \leq \sum_{n = 1}^{+ \infty} |E_{σ} (m^{n}) - t^{n}| \int_{- \infty}^{x} |P_{n} (ζ)| μ (d ζ) .

(14)

Proof.

From Proposition 1, one sees that

\begin{matrix} | H_{(μ, σ)} (x) - F_{μ} (x, t) | & = & |\int_{- \infty}^{x} (M_{(μ, σ)} (ζ) - h_{μ} (ζ, t)) ν (d ζ)| \\ = & |\int_{- \infty}^{x} \sum_{n = 1}^{+ \infty} {E_{σ} (m^{n}) - t^{n}} P_{n} (ζ) μ (d ζ)| \\ \leq & \sum_{n = 1}^{+ \infty} |E_{σ} (m^{n}) - t^{n}| \int_{- \infty}^{x} |P_{n} (ζ)| μ (d ζ) . \end{matrix}

□

We now provide some results related to quadratic CSK families.

Theorem 2.

Assume that

K (μ) = {Q_{(m, μ)} (d x) : m \in (m_{-}^{μ}, m_{+}^{μ})}

is a quadratic CSK family. Under the hypothesis of Proposition 1, if

\sum_{n = 0}^{+ \infty} | E_{σ} (m^{n}) | ∥ P_{n} (\cdot) ∥

(15)

converges, then we have

∥ M_{(μ, σ)} (\cdot) - h_{μ} {(\cdot, t) ∥}^{2} = \sum_{n = 1}^{+ \infty} {(E_{σ} (m^{n}) - t^{n})}^{2} {∥ P_{n} (\cdot) ∥}^{2} .

(16)

Moreover, if

t = 0,

we obtain

∥ M_{(μ, σ)} {(\cdot) - 1 ∥}^{2} = \sum_{n = 1}^{+ \infty} {(E_{σ} (m^{n}))}^{2} {∥ P_{n} (\cdot) ∥}^{2} .

(17)

Proof.

From Proposition 1, we have that

∥ M_{(μ, σ)} (\cdot) - h_{μ} {(\cdot, t) ∥}^{2} = \int {(\sum_{n = 1}^{+ \infty} {E_{σ} (m^{n}) - t^{n}} P_{n} (ζ))}^{2} μ (d ζ) .

(18)

The existence of this series is guaranteed by (15). Relation (18) is

∥ M_{(μ, σ)} (\cdot) - h_{μ} {(\cdot, t) ∥}^{2} = \int (\sum_{n = 1}^{+ \infty} {E_{σ} (m^{n}) - t^{n}} P_{n} (ζ)) (\sum_{k = 1}^{+ \infty} {E_{σ} (m^{k}) - t^{k}} P_{k} (ζ)) μ (d ζ) .

(19)

Since we deal with quadratic CSK families, recall Theorem 1, the polynomials

P_{n} (ζ)

,

n = 0, 1, 2, \dots

, are

μ

-orthogonal. Then, Equation (19) reduces to (16). □

Proposition 3.

Assume that

K (μ) = {Q_{(m, μ)} (d ζ) : m \in (m_{-}^{μ}, m_{+}^{μ})}

is a quadratic CSK family. Under the hypothesis of Proposition 1, we have

\int | M_{(μ, σ)} (ζ) - h_{μ} (ζ, t) | μ (d ζ) \leq \sum_{n = 1}^{+ \infty} | E_{σ} (m^{n}) - t^{n} | \int | P_{n} (ζ) | μ (d ζ) .

Moreover, if

t = 0,

we obtain

\int | M_{(μ, σ)} (ζ) - 1 | μ (d ζ) \leq \sum_{n = 1}^{+ \infty} | E_{σ} (m^{n}) | \int | P_{n} (ζ) | μ (d ζ) .

Proof.

\begin{matrix} \int | M_{(μ, σ)} (ζ) - h_{μ} (ζ, t) | μ (d ζ) & = & \int |\sum_{n = 1}^{+ \infty} (E_{σ} (m^{n}) - t^{n}) P_{n} (ζ)| μ (d ζ) \\ \leq & \sum_{n = 1}^{+ \infty} | E_{σ} (m^{n}) - t^{n} | \int | P_{n} (ζ) | μ (d ζ) . \end{matrix}

□

In prior studies, the distance between mixture and parent laws may have been explored qualitatively or under specific conditions, but not always with concrete bounds. The new contribution here is the establishment of quantitative bounds that allow for a more precise understanding of how far a mixture law can be from its parent law. Traditionally, the distance between a mixture distribution and its parent law might be analyzed using moment-based methods [9,10] or using distances like total variation [11,12] or Kullback–Leibler divergence [13,14]. However, using orthogonal polynomials introduces a new layer of precision by representing both the mixture and parent distributions in terms of their polynomial expansions. This allows for a more detailed study of how the mixture deviates from the parent distribution across different orders of moments. Orthogonal polynomials can help sharpen the bounds on the distance between the mixture law and the parent law. In many cases, they offer a more refined approach compared to traditional methods, allowing for exact or tighter bounds in the analysis of distances. This results in stronger mathematical guarantees for approximating or bounding the behavior of mixture distributions, especially when the parent distribution belongs to a CSK family.

3. Examples

In this section, some illustrations of the previous results are given for semicircle and free Poisson mixtures. In free probability, the semicircle law represents the free analog of the Gaussian law in classical probability. It is a result of random matrix theory that describes the distribution of eigenvalues of certain types of random matrices. The free Poisson law provides a distribution similar to the classical Poisson distribution, but in the context of free random variables. It describes the asymptotic behavior of singular values of large rectangular random matrices and it is important for understanding complex interactions in systems like quantum mechanics and random matrices.

We recall from [15] a technical result which is useful for the following examples:

Lemma 2.

(i): For $n = 0, 1, 2, \dots,$ the Tchebychev polynomials of the first kind satisfy

$| T_{n} (x) | \leq 1, \forall x \in [- 2, 2] .$

(20)
(ii): For $n = 0, 1, 2, \dots,$ the Tchebychev polynomials of the second kind satisfy

$| S_{n} (x) | \leq n + 1, \forall x \in [- 2, 2] .$

(21)

Example 1.

If μ is the semicircle law

m_{1}^{μ} = 0

and variance 1. The associated orthogonal polynomials

P_{n} (\cdot)

,

n = 0, 1, 2, \dots,

are derived from Tchebychev polynomials of the second kind. Then, we have

(i): $\int | M_{(μ, σ)} (ζ) - 1 | μ (d ζ) \leq \sum_{n = 1}^{+ \infty} (n + 1) | E_{σ} (m^{n}) | .$
(ii): $| H_{(μ, σ)} (x) - F_{μ} (x, 0) | \leq F_{ν} (x, 0) \sum_{n = 1}^{+ \infty} (n + 1) | E_{σ} (m^{n}) | .$

where

F_{ν} (x, 0)

is the distribution function of the standard Semicircle law.

If σ is the uniform distribution on the interval

[0, \frac{1}{2}]

, then we have

E_{σ} (m^{n}) = \frac{{(\frac{1}{2})}^{n}}{n + 1} .

In this case, we obtain

\int | M_{(μ, σ)} (ζ) - 1 | μ (d x) \leq \sum_{n = 1}^{+ \infty} {(\frac{1}{2})}^{n} = 1,

and

| H_{(μ, σ)} (x) - F_{μ} (x, 0) | \leq F_{μ} (x, 0) .

Example 2.

If μ is the free Poisson law with

m_{1}^{μ} = 0

and variance 1. The associated orthogonal polynomials

P_{n} (\cdot)

,

n = 0, 1, 2, \dots,

are derived from Tchebychev polynomials of the first kind. Then, we have

(i): $\int | M_{(μ, σ)} (ζ) - 1 | μ (d ζ) \leq \sum_{n = 1}^{+ \infty} | E_{σ} (m^{n}) | .$
(ii): $| H_{(μ, σ)} (x) - F_{μ} (x, 0) | \leq F_{μ} (x, 0) \sum_{n = 1}^{+ \infty} | E_{σ} (m^{n}) | .$

where

F_{μ} (x, 0)

is the distribution function of the free Poisson law.

If σ is the uniform distribution on the interval

[0, \frac{1}{2}]

, then we have

\int | M_{(μ, σ)} (ζ) - 1 | μ (d x) \leq \sum_{n = 1}^{+ \infty} \frac{{(\frac{1}{2})}^{n}}{n + 1} \leq \sum_{n = 1}^{+ \infty} {(\frac{1}{2})}^{n + 1} = \frac{1}{2},

and

| H_{(μ, σ)} (x) - F_{μ} (x, 0) | \leq \frac{1}{2} F_{μ} (x, 0) .

4. Conclusions

We have examined a mixture of probability distributions from a CSK family in this study. A formula is derived for the difference between the parent probability distribution from a CSK family and the mixed probability distribution using a suitable base of polynomials. We have also evaluated the distance of the mixture from the parent probability distribution in the

L^{2}

-norm. Additionally, the

L^{1}

-norm bounds are determined for the difference between distribution functions. A few instances are used to demonstrate the findings via quadratic CSK families. However, the results of this paper can be extended to cover families of probability measures having polynomial variance functions as the mean of arbitrary degree based on a new notion of generalized orthogonality of polynomials introduced in [8]. Furthermore, other alternative methods such as stochastic representation, as presented in [16], can offer a powerful approach to capture the complexity of families of probability measures. Instead of relying solely on deterministic formulations, stochastic representation allows for the incorporation of random processes and latent variables, providing a more flexible framework for modeling diverse distributions. By modeling the mixture components using stochastic processes, such as random measures or Markov chains, one can account for the underlying uncertainty, dependencies, and variability within the data. This approach can be particularly useful when dealing with complex or heterogeneous families of probability measures, providing a more robust and adaptable way to represent mixture models.

The motivation for investigating analytical bounds for mixture models in the CSK families of probability measures derives from the need to better comprehend and quantify uncertainty in real-world systems with complicated, multimodal distributions. In various domains, such as finance, signal processing, and machine learning, data are frequently derived from a combination of underlying processes or populations. Mixture models provide a versatile framework for capturing heterogeneity. By focusing on CSK families, which are linked to distributions with heavy tails or singularities, this work aims to provide sharper, more reliable bounds that can enhance the accuracy of statistical inference and prediction. Such advances can improve model robustness, optimize decision-making, and provide better uncertainty quantification in applications like risk management, anomaly detection, and complex data analysis. In summary, the present work not only aims to advance statistical theory but also aims to provide practical solutions to pressing challenges in applied domains. By harnessing the power of the Cauchy–Stieltjes kernel and orthogonal polynomials, we can improve the accuracy and reliability of statistical models, leading to better decision-making and risk management in complex, multimodal environments. This research represents an important step towards bridging the gap between advanced theoretical frameworks and their concrete applications, ultimately contributing to more robust, efficient, and interpretable models for a wide range of real-world problems.

Author Contributions

Conceptualization, F.A. (Fahad Alsharari); Methodology, R.F.; Validation, F.A. (Fahad Alsharari) and F.A. (Fatimah Alshahrani); Formal analysis, F.A. (Fatimah Alshahrani); Investigation, F.A. (Fahad Alsharari); Resources, F.A. (Fahad Alsharari); Data curation, R.F.; Writing—original draft, R.F.; Writing–review & editing, R.F.; Project administration, F.A. (Fatimah Alshahrani); Funding acquisition, F.A. (Fatimah Alshahrani). All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2025R358), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Data Availability Statement

No new data were created or analyzed in this study.

Conflicts of Interest

The authors declare no conflict of interest.

References

Shaked, M. Bounds on the distance of a mixture from its parent distribution. J. Appl. Probab. 1981, 18, 853–863. [Google Scholar] [CrossRef]
Shimizu, R. Expansion of the scale mixture of the multivariate normal distributions with error bound evaluated in the L¹-Norm. J. Multivar. Anal. 1995, 53, 126–138. [Google Scholar] [CrossRef]
Pommeret, D. Distance of a mixture from its parent distribution. Sankhya Indian J. Stat. 2005, 67, 699–714. [Google Scholar]
Hall, P. Polynomial expansion of density and distribution functions of scale mixtures. J. Multivar. Anal. 1981, 11, 173–184. [Google Scholar] [CrossRef]
Bryc, W. Free exponential families as kernel families. Demonstr. Math. 2009, XLII, 657–672. [Google Scholar] [CrossRef]
Bryc, W.; Hassairi, A. One-sided Cauchy-Stieltjes kernel families. Journ. Theoret. Probab. 2011, 24, 577–594. [Google Scholar] [CrossRef]
Fakhfakh, R. Characterization of quadratic Cauchy-Stieltjes Kernel families based on the orthogonality of polynomials. J. Math. Anal. Appl. 2018, 459, 577–589. [Google Scholar] [CrossRef]
Bryc, W.; Fakhfakh, R.; Mlotkowski, W. Cauchy-Stieltjes families with polynomial variance funtions and generalized orthogonality. Probab. Math. Stat. 2019, 39, 237–258. [Google Scholar] [CrossRef]
Lindsay, B.; Roeder, K. Moment-based oscillation properties of mixture models. Ann. Statist. 1997, 25, 378–386. [Google Scholar] [CrossRef]
Lindsay, B.G.; Pilla, R.S.; Basak, P. Moment-Based Approximations of Distributions Using Mixtures: Theory and Applications. Ann. Inst. Stat. Math. 2000, 52, 215–230. [Google Scholar] [CrossRef]
Nielsen, F.; Sun, K. Guaranteed deterministic bounds on the total variation distance between univariate mixtures. In Proceedings of the 2018 IEEE 28th International Workshop on Machine Learning for Signal Processing (MLSP), Aalborg, Denmark, 17–20 September 2018; pp. 1–6. [Google Scholar] [CrossRef]
Davies, S.; Mazumdar, A.; Pal, S.; Rashtchian, C. Lower Bounds on the Total Variation Distance Between Mixtures of Two Gaussians. In Proceedings of the 33rd International Conference on Algorithmic Learning Theory, Paris, France, 29 March–1 April 2022; Volume 167, pp. 319–341. [Google Scholar]
Hershey, J.R.; Olsen, P.A. Approximating the Kullback Leibler Divergence Between Gaussian Mixture Models. In Proceedings of the 2007 IEEE International Conference on Acoustics, Speech and Signal Processing—ICASSP ’07, Honolulu, HI, USA, 16–20 April 2007; pp. IV-317–IV-320. [Google Scholar] [CrossRef]
Van Hulle, M.M. Mixture density modeling, Kullback–Leibler divergence, and differential log-likelihood. Signal Process. 2005, 85, 951–963. [Google Scholar] [CrossRef]
Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions; Dover: New York, NY, USA, 1972. [Google Scholar]
Fang, K.T.; Kotz, S.; Ng, K.W. Symmetric Multivariate and Related Distributions; Chapman and Hall: London, UK; New York, NY, USA, 1990. [Google Scholar] [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Alsharari, F.; Fakhfakh, R.; Alshahrani, F. Analytical Bounds for Mixture Models in Cauchy–Stieltjes Kernel Families. Mathematics 2025, 13, 381. https://doi.org/10.3390/math13030381

AMA Style

Alsharari F, Fakhfakh R, Alshahrani F. Analytical Bounds for Mixture Models in Cauchy–Stieltjes Kernel Families. Mathematics. 2025; 13(3):381. https://doi.org/10.3390/math13030381

Chicago/Turabian Style

Alsharari, Fahad, Raouf Fakhfakh, and Fatimah Alshahrani. 2025. "Analytical Bounds for Mixture Models in Cauchy–Stieltjes Kernel Families" Mathematics 13, no. 3: 381. https://doi.org/10.3390/math13030381

APA Style

Alsharari, F., Fakhfakh, R., & Alshahrani, F. (2025). Analytical Bounds for Mixture Models in Cauchy–Stieltjes Kernel Families. Mathematics, 13(3), 381. https://doi.org/10.3390/math13030381

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Analytical Bounds for Mixture Models in Cauchy–Stieltjes Kernel Families

Abstract

1. Introduction

2. Main Results

3. Examples

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI