Boundedness of Variance Functions of Natural Exponential Families with Unbounded Support

Bar-Lev, Shaul K.

doi:10.3390/math13132045

Open AccessFeature PaperArticle

Boundedness of Variance Functions of Natural Exponential Families with Unbounded Support

by

Shaul K. Bar-Lev

Faculty of Industrial Engineering and Technology Management, HIT—Holon Institute of Technology, 52 Golomb St., Holon 5810201, Israel

Mathematics 2025, 13(13), 2045; https://doi.org/10.3390/math13132045

Submission received: 22 May 2025 / Revised: 16 June 2025 / Accepted: 17 June 2025 / Published: 20 June 2025

(This article belongs to the Section D1: Probability and Statistics)

Download

Browse Figure

Versions Notes

Abstract

The variance function (VF) is central to natural exponential family (NEF) theory. Prompted by an online query about whether, beyond the classical normal NEF, other real-line NEFs with bounded VFs exist, we establish three complementary sets of sufficient conditions that yield many such families. One set imposes a polynomial-growth bound on the NEF’s generating measure, ensuring rapid tail decay and a uniformly bounded VF. A second set relies on the Legendre duality, requiring a uniform positive lower bound on the second derivative of the generating function, which likewise ensures a bounded VF. The third set starts from the standard normal distribution and constructs an explicit sequence of NEFs whose Laplace transforms and VFs remain bounded. Collectively, these results reveal a remarkably broad class of NEFs whose Laplace transforms are not expressible in elementary form (apart from those stemming from the standard normal case), yet can be handled easily using modern symbolic and numerical software. Worked examples show that NEFs with bounded VFs are far more varied than previously recognized, offering practical alternatives to the normal and other classical models for real-data analysis across many fields.

Keywords:

densities supported on the whole real line; natural exponential families; bounded variance functions; Legendre duality

MSC:

62E99; 62P99

1. Introduction

The motivation for this study comes from an online query asking whether natural exponential families (NEFs) with unbounded support can have bounded variance functions (VFs) beyond the classical normal NEF. In response, we establish explicit sufficient conditions for the existence of such NEFs. These conditions reveal a broad class of previously undocumented non-elementary NEFs whose Laplace transforms (LTs) or VFs cannot be expressed in elementary (algebraic) form. Thanks to modern mathematical software, such as Maple and Mathematica, as well as open-source libraries in Python and R, this obstacle is no longer a practical limitation; symbolic or numerical evaluation of the relevant transforms has become routine. The normal NEF serves as a benchmark within the subclass studied in this context. Our partial solution deepens understanding of the interplay between support structure and variance behavior in NEFs, thereby addressing a gap in the current literature. Although our focus is theoretical, we expect the subclasses introduced in this paper to assist in modeling real-world data across diverse fields.

The VF is central to the theory of natural exponential families (NEFs). Although the normal NEF is the only known example with a constant VF, it is natural to ask whether other NEFs—particularly those with unbounded support on

R

—can also have bounded VFs. This issue is important both theoretically and practically, as bounded VFs often lead to more stable modeling.

This paper establishes three complementary results—Theorem 1 and Propositions 1 and 2—that provide sufficient conditions under which an NEF supported on the entire real line can still have a bounded VF. Theorem 1 imposes a polynomial-growth constraint on the cumulant generating function, ensuring sufficient tail decay of the density. Proposition 1 relies on the Legendre duality: a uniform positive lower bound on the second derivative of the generating function guarantees global convexity and, consequently, a bounded VF. Proposition 2 builds on the standard normal distribution and constructs an explicit sequence of NEFs whose VFs remain bounded; for this sequence, both the LTs and the VFs can be written in closed form. Together, these results introduce a broad new class of NEF models, greatly expanding the range of distributions available for real-data analysis and providing strong alternatives to traditional families.

This paper is structured as follows. Section 2 provides an overview of the essential background on NEFs. Section 3 presents Theorem 1, along with Propositions 1 and 2, and explains how these results are related. Section 4 provides several illustrative examples, while Section 5 offers concluding remarks.

2. Preliminaries on NEFs

The preliminaries are mainly drawn from [1,2]. We provide only the brief background necessary for the developments that follow. Let

μ

be a non-Dirac positive Radon measure on

R

, S the support of

μ

, and C the convex hull of S. LT of

μ

is a mapping

L :

R \to (0, \infty]

defined by

L (θ) = \int_{R} e^{θ x} μ (d x) .

Let D denotes the effective domain of

μ

, i.e.,

D = \{θ \in R : L (θ) < \infty\}

and assume that

Θ = i n t

D \neq ϕ .

Let

k (θ) = ln L (θ)

be the cumulant transform of

L .

Then the NEF F generated by

μ

is defined by the family of probabilities

F = F (μ) = \{P (θ, μ (d x)) = \exp \{θ x - k (θ)\} μ (d x), θ \in Θ\} .

(1)

The measure

μ

is called a generating measure of the NEF. It is not unique, as any other measure of the form

μ^{*} (d x) = e^{a x + b} μ (d x),

with real constants a and b, generates the same NEF (c.f., [2]). The function

k = \log L

defined on

Θ

is real analytic on there, and its successive derivatives are the successive cumulants of F. In particular,

k^{'}

and

k^{″}

, are the mean and variance of F.

As we consider only cases where

μ

is absolutely continuous with respect to the Lebesgue measure on

R

, we henceforth restrict our attention to generating measures of the form

μ (d x) = f (x) d x

, where f is the respective Radon–Nikodym derivative.

The following are some notions and properties related to NEFs that will be used in the sequel.

Regular NEFs. If D is an open interval, F is called regular. Particularly, if

D = R, R^{+},

or

R^{-},

F is regular.

Mean value parameterization: The cumulant transform k is strictly convex and real analytic on

Θ

and

k^{'} (θ) = \int_{R} x \exp \{θ x - k (θ)\} f (x) d x

is the mean function of F. The open interval

M = k^{'} (Θ)

is called the mean domain of F. Since the map

θ ⟼ k^{'} (θ)

is one-to-one, its inverse function

ψ : M ⟶ Θ

is well defined. Hence, the map

m ⟼ P (m, F) = P (ψ (m), μ)

is one-to-one from M onto F and is called the mean value parameterization of

F .

Variance Function (VF). The variance corresponding to the NEF (1) is

V (m) = 1 / ψ^{'} (m) = k^{″} (θ) .

(2)

The map

m ⟼ V (m)

from M into

R^{+}

is called the variance function (VF) of F. In fact, a VF of an NEF F is a pair

(V,

M)

. It uniquely determines an NEF within the class of NEFs. For instance, the VF

(σ^{2}, R)

with fixed

σ > 0

characterizes the normal NEF within the class of NEFs.

Steep NEFs. An NEF

F = F (μ)

is called steep iff

M = i n t

C. Also, F is steep if its effective domain D is an open interval (see [3], Theorem 8.2), particularly, F is steep if

D = R, R^{+},

or

R^{-} .

Steep NEFs ensure, with probability one, the existence of the MLE for m as the stationary point of the first derivative of the likelihood function.

3. Sufficient Conditions on the Boundedness of VFs

Finding necessary and sufficient conditions for this problem is extremely challenging. Even identifying necessary conditions that closely approach sufficiency is highly non-trivial. Undoubtedly, achieving such results would constitute a significant theoretical advance, but we leave this task for future research.

Nevertheless, sufficient conditions can be established, and this paper provides them in Theorem 1 and Propositions 1 and 2. Theorem 1 imposes a polynomial-growth bound on the generating measure of the NEF, ensuring rapid tail decay and a uniformly bounded VF. Proposition 1 is based on Legendre duality and requires a uniform positive lower bound on the second derivative of the generating measure, which likewise ensures a bounded VF. Proposition 2 begins with the standard normal distribution and constructs an explicit sequence of NEFs whose LTs and VFs remain bounded.

Most examples stemming from Theorem 1 or Proposition 1 lack closed-form expressions for their LTs or VFs, but modern mathematical software can handle them efficiently in practice. By contrast, every example associated with Proposition 2 admits explicit LTs and VFs.

Collectively, these results uncover an exceptionally broad class of NEFs. To our knowledge, none of the NEFs produced by these three results have appeared previously in the literature, thereby providing a rich collection of practical models that rival the normal and other classical distributions for real-data analysis. In this paper, we do not address applications of these families to real data; this will be pursued in future research.

Theorem 1.

Let

g : R \to [0, \infty)

be a continuous function satisfying the following tail growth conditions:

g (x) \geq \{\begin{matrix} C_{+} x^{p}, & x \geq R, \\ C_{-} {| x |}^{q}, & x \leq - R, \end{matrix},

for some constants

C_{+}, C_{-,} R > 0

and

p, q > 1 .

Define

f (x) = e^{- g (x)}, x \in R, L (θ) = \int_{R} e^{θ x - g (x)} d x, a n d k (θ) = \log L (θ),

and let F be the NEF generated by

f .

Then,

1.

f \in L^{1} (R)

, implying that the NEF

F

generated by f is supported on

R .

2. The LT

L (θ) = \int_{R} e^{θ x - g (x)} d x

is finite for all

θ \in R

, i.e., the respective canonical parameter space is

Θ = R

. Thus, F is steep and

Θ = M = R

.

3. For

p, q \geq 2

, the VF

V (θ) = k^{″} (θ)

of F is uniformly bounded by 1. (For

1 < p < 2,

the VF is not uniformly bounded as it grows without limit as θ or m tends to ∞ or

- \infty

).

Proof.

1. Let

R > 0

be sufficiently large. Split the integral

f (x) = \int_{R} e^{- g (x)} d x

into three intervals of integration:

(- \infty, - R), [- R, R],

and

(R, \infty) .

Since f is continuous on

R

, then by the extreme value theorem,

f (x) = \int_{- R}^{R} e^{- g (x)} d x

is bounded. On

[R, \infty)

,

e^{- g (x)} \leq e^{- C_{+} x^{p}}

, which integrates for

p > 1

. The left tail is similar to

q > 1

. Thus,

A = \int_{R} e^{- g (x)} d x < \infty

. Hence,

f \in L^{1} (R) .

2. For this part, we use Young’s inequality ([4], p. 27), which states the following: Let

r, s > 1

with

\frac{1}{r} + \frac{1}{s} = 1

. Then, for all real

a, b

,

| a b | \leq \frac{{| a |}^{r}}{r} + \frac{{| b |}^{s}}{s} .

For this part, we split the implementation of Young’s inequality into the two parts of the tails:

(a) On the right tail (

x \geq R

,

θ \geq 0

) we take

r = p

and

s = p^{*} = p / (p - 1)

. Set

a = C_{+}^{1 / p} x, b = θ C_{+}^{- 1 / p} .

Then

a b = θ x

and

θ x \leq \frac{C_{+} x^{p}}{p} + \frac{θ^{p^{*}}}{p^{*} C_{+}^{p^{*} / p}} .

(b) On the left tail (

x \leq - R

,

θ \leq 0

) we take

r = q

,

s = q^{*} = q / (q - 1)

and

a = C_{-}^{1 / q} | x |, b = | θ | C_{-}^{- 1 / q},

giving

| θ x | \leq \frac{C_{-} {| x |}^{q}}{q} + \frac{{| θ |}^{q^{*}}}{q^{*} C_{-}^{q^{*} / q}} .

Fix

θ \geq 0

and use the bounds derived from Young’s inequality to obtain

θ x - g (x) \leq - (1 - \frac{1}{p}) C_{+} x^{p} + \frac{θ^{p^{*}}}{p^{*} C_{+}^{p^{*} / p}} (x \geq R) .

Exponentiating and integrating yield

\int_{R}^{\infty} e^{θ x - g (x)} d x \leq \exp \{\frac{θ^{p^{*}}}{p^{*} C_{+}^{p^{*} / p}}\} \int_{R}^{\infty} \exp \{- (1 - \frac{1}{p}) C_{+} x^{p}\} d x .

The integral converges because

p > 1

. The left tail for

θ \geq 0

already decays exponentially; the compact part

[- R, R]

is finite. For

θ < 0,

a symmetric bound with q gives convergence. Hence

L (θ) < \infty

for all

θ

. Thus,

Θ = R,

F is regular and therefore steep with

M = R

.

3. For

p \geq 2

,

q \geq 2,

we shall find a quadratic bound on the cumulant transform

k .

For this, we first show that

L (θ) \leq C e^{θ^{2} / 2}, for some constant C .

(3)

Let

R > 0

and split

L (θ) = \int_{- \infty}^{- R} e^{θ x} f (x) d x + \int_{- R}^{R} e^{θ x} f (x) d x + \int_{R}^{\infty} e^{θ x} f (x) d x .

(4)

Consider the central interval

[- R, R]

and use the elementary inequality

2 a b \leq a^{2} + b^{2}

with

a = | θ |

and

b = R

, then

| θ | R \leq \frac{{| θ |}^{2}}{2} + \frac{R^{2}}{2} or e^{| θ | R} \leq e^{\frac{R^{2}}{2}} e^{\frac{θ^{2}}{2}} .

Insert this into the central integral in (4), then

\int_{- R}^{R} e^{θ x} f (x) d x \leq C_{c} e^{| θ | R} \leq C_{c} e^{\frac{R^{2}}{2}} e^{\frac{θ^{2}}{2}} ≐ C_{c e n t} e^{\frac{θ^{2}}{2}} .

(5)

For the right and left tails, there exist constants A and B such that

x \geq R \Rightarrow f (x) \leq A e^{- x^{p}}, x \leq - R \Rightarrow f (x) \leq B e^{- {| x |}^{q}} .

Similarly, for

a = | θ |

and

b = x,

we get

θ x \leq \frac{θ^{2}}{2} + \frac{x^{2}}{2}, or e^{θ x} \leq e^{θ^{2} / 2} e^{x^{2} / 2} for all θ, x .

Thus, for the right tail, we have

\int_{R}^{\infty} e^{θ x} f (x) d x \leq e^{θ^{2} / 2} A \int_{R}^{\infty} \exp (\frac{x^{2}}{2} - x^{p}) d x \leq e^{θ^{2} / 2} A C_{p},

where

C_{p} = \int_{R}^{\infty} \exp (\frac{x^{2}}{2} - x^{p}) d x < \infty

. The left tail is similar with a constant

C_{q}

. Let

C ≐ C_{c e n t} + A C_{p} + B C_{q}

then (3) holds and thus

k (θ) = \log L (θ) \leq C_{0} + θ^{2} / 2

, where

C_{0} = \log C .

We now show that the VF is bounded. Consider the second finite difference

Δ_{h}^{2} k (θ) = k (θ + h) - 2 k (θ) + k (θ - h) .

The strict convexity of k yields a lower bound:

Δ_{h}^{2} k (θ) \geq 0

. An upper bound of

Δ_{h}^{2} k (θ)

gives

\begin{matrix} Δ_{h}^{2} k (θ) & \leq [C_{0} + \frac{{(θ + h)}^{2}}{2}] - 2 [C_{0} + \frac{θ^{2}}{2}] + [C_{0} + \frac{{(θ - h)}^{2}}{2}] \\ = \frac{1}{2} [{(θ + h)}^{2} - 2 θ^{2} + {(θ - h)}^{2}] = h^{2} . \end{matrix}

Hence,

0 < Δ_{h}^{2} k (θ) \leq h^{2} .

Because

k (θ)

is real analytic,

k^{″} (θ)

exists everywhere and

k^{″} (θ) = \lim_{h \to 0} \frac{Δ_{h}^{2} k (θ)}{h^{2}} = \lim_{h \to 0} \frac{h^{2}}{h^{2}} = 1 .

This implies that

V (θ) = k^{″} (θ) \leq 1 .

□

Remark 1.

By Theorem 1 and all of the propositions below,

f \in L^{1} (R)

is not necessarily a probability density function. To make it such, we need to divide it by

c_{g}

, where

c_{g} = \int_{R} f (x) d x

is the normalizing constant, which may depend on the various parameters involved. In most of the following examples, these normalizing constants can be computed numerically or expressed in terms of certain transcendental functions.

In the following proposition, the tail condition required by Theorem 1 is satisfied. However, we prefer to present it separately as Proposition 1, since it assumes that

g^{″} (x)

is bounded below by some constant

δ > 0

, which provides a stronger and more self-contained control over the variance function. The proof, based on Legendre duality, is both elegant and innovative, and we believe that its presentation is valuable in its own right. Moreover, the condition on

g^{″}

is relatively easy to verify in concrete examples and facilitates the construction of NEFs that satisfy it. This property of

g^{″}

will be illustrated in Examples 3–5 below.

The Legendre duality presents the cumulant transform

k = \log L

and the convex generating function g as Legendre conjugates (see [5], Chapter 26; [3]). Concretely,

g (x) = \sup_{θ \in R} {θ x - k (θ)}, k (θ) = \sup_{x \in R} {θ x - g (x)},

and, for each

θ \in R

, the maximizer

x_{θ}

of the second expression satisfies

g^{'} (x_{θ})

and

k (θ) = θ x_{θ} - g (x_{θ}) .

Proposition 1.

Let

g : R \to [0, \infty)

be a twice continuously differentiable and convex function satisfying the following condition: There exists a constant

δ > 0

such that

g^{″} (x) \geq δ

for all

x \in R

. Define

f (x) = e^{- g (x)}

and F the NEF generated by f. Then,

1.

f \in L^{1} (R)

, i.e., the support of F is

R

.

2. The LT

L (θ)

of f is finite for all

θ \in R

, and thus F is regular and steep with

Θ = M = R

.

3.

E_{θ} [X] = x_{θ}, V a r_{θ} (X) = \frac{1}{g^{″} (x_{θ})} \leq \frac{1}{δ} .

where

x_{θ}

is the unique minimizer of

h_{θ} (x) = g (x) - θ x .

4. g satisfies Theorem 1.

Proof.

1. Since

g^{″} (x) \geq δ > 0

, we can integrate this inequality:

g^{'} (x) = g^{'} (0) + \int_{0}^{x} g^{″} (t) d t \geq g^{'} (0) + δ x .

Hence,

g^{'} (x) \to \infty

as

x \to \infty

, and

g^{'} (x) \to - \infty

as

x \to - \infty

.

Now integrate again:

g (x) = g (0) + \int_{0}^{x} g^{'} (t) d t \to \infty as | x | \to \infty .

So for large

| x |

,

g (x) \geq \frac{δ}{2} x^{2} - C

for some constant C. Therefore,

f (x) = e^{- g (x)} \leq C e^{- δ x^{2} / 2},

and this is integrable over

R

. Thus,

f \in L^{1} (R)

.

2. For showing

Θ = R

, we analyze

L (θ) = \int_{R} e^{θ x - g (x)} d x .

From above,

g (x) \geq \frac{δ}{2} x^{2} - C

, so,

e^{θ x - g (x)} \leq e^{θ x - δ x^{2} / 2 + C} = e^{C} \cdot e^{θ x - δ x^{2} / 2} .

This is integrable over

R

for all

θ \in R

since the exponent is quadratic in x, and the quadratic term dominates. Therefore,

L (θ) < \infty

for all

θ

, and the canonical parameter space

Θ = R = M

.

3. We prove this part by using the Legendre duality. Let

h_{θ} (x) : = g (x) - θ x

. The unique minimizer

x_{θ}

satisfies

g^{'} (x_{θ}) = θ

, due to the strict convexity of g. Since g is convex and differentiable, the Legendre transform of g is

k (θ) = \sup_{x \in R} \{θ x - g (x)\}, attained at x = x_{θ} where g^{'} (x_{θ}) = θ .

So,

k (θ) = θ x_{θ} - g (x_{θ}), k^{'} (θ) = x_{θ} .

Hence,

E_{θ} (X) = k^{'} (θ) = \frac{d}{d θ} [θ x_{θ} - g (x_{θ})] = x_{θ} - θ \frac{d x_{θ}}{d θ} - g^{'} (x_{θ}) \frac{d x_{θ}}{d θ} = x_{θ}, θ \in R .

and

V_{θ} (X) = k^{″} (θ) = \frac{d x_{θ}}{d θ} .

Differentiate both sides of

g^{'} (x_{θ}) = θ

using the chain rule:

g^{″} (x_{θ}) \cdot \frac{d x_{θ}}{d θ} = 1 \Rightarrow \frac{d x_{θ}}{d θ} = \frac{1}{g^{″} (x_{θ})} .

Thus,

V a r_{θ} (X) = \frac{1}{g^{″} (x_{θ})} \leq \frac{1}{δ}, \sin ce g^{″} (x_{θ}) \geq δ .

4. We show that there exist constants

C_{1}, C_{2} > 0

and

p_{1}, p_{2} > 1

such that

g (x) \geq C_{1} x^{p_{1}} for large x > 0, g (x) \geq C_{2} {| x |}^{p_{2}} for large x < 0 .

For this, fix any point

x_{0} \in R

. We can integrate this inequality because

g^{″} (x) \geq δ > 0

. By integrating from

x_{0}

to

x > x_{0}

, we get

g^{'} (x) = g^{'} (x_{0}) + \int_{x_{0}}^{x} g^{″} (t) d t \geq g^{'} (x_{0}) + δ (x - x_{0}) .

Now, integrating

g^{'} (t)

from

x_{0}

to x, then

g (x) = g (x_{0}) + \int_{x_{0}}^{x} g^{'} (t) d t .

Using the lower bound for

g^{'} (t)

implies

g (x) \geq g (x_{0}) + \int_{x_{0}}^{x} [g^{'} (x_{0}) + δ (t - x_{0})] d t \geq g (x_{0}) + g^{'} (x_{0}) (x - x_{0}) + \frac{δ}{2} {(x - x_{0})}^{2} .

Hence, as

x \to \infty

, this inequality shows

g (x) \geq \frac{δ}{2} x^{2} - C,

for some constant

C > 0

(since

x_{0}

,

g (x_{0})

, and

g^{'} (x_{0})

are fixed). So, for sufficiently large x,

g (x) \geq C_{1} x^{2}

for some

C_{1} > 0

. A symmetric argument applies for

x \to - \infty

, using integration from x to

x_{0}

. Therefore,

g (x)

satisfies the tail lower bounds required by Theorem 1, with

p_{1} = p_{2} = 2

. □

The following proposition is based on the standard normal distribution and generates many examples that, in some cases, satisfy Theorem 1 while violating Proposition 1. The LTs and VFs for these examples can be expressed explicitly, and the corresponding NEFs may have broad practical applicability.

Proposition 2.

Let

f_{n} (x) = \frac{1}{\sqrt{2 π}} x^{2 n} e^{- x^{2} / 2}, x \in R, n \in N,

(6)

and

F_{n}

be the NEF generated by

f_{n}

. Denote by

L_{n}

and

V_{n}

, respectively, the LT and VF corresponding to

F_{n}

. Then,

1. The support and the canonical parameter space of

F_{n}

is

R

.

2. The LT of

f_{n}

has the form

L_{n} (θ) = \int_{R} x^{2 n} e^{θ x - x^{2} / 2} \frac{d x}{\sqrt{2 π}} = e^{θ^{2} / 2} P_{n} (θ), θ \in R,

where

P_{n} (θ) = \sum_{k = 0}^{n} (\binom{2 n}{2 k}) (2 k - 1)!! θ^{2 (n - k)} .

(7)

Here,

k!!

is the double factorial, and

P_{n}

is an even polynomial of degree

2 n

whose all coefficients are strictly positive.

3. Fix

n \in N,

then the VF

V_{n}

satisfies

V_{n} (θ) < V_{n} (0) = 2 n + 1, \forall θ \neq 0,

(8)

4.

f_{n} (x), n \in N,

does satisfy the premises of Theorem 1 but not of Proposition 1.

5. The LT and VF of

f_{n}

in (6) are given, respectively, by

L_{n} (θ) = e^{θ^{2} / 2} P_{n} (θ),

and

V_{n} (θ) = \frac{d^{2}}{d θ^{2}} \log L_{n} (θ) = 1 + \frac{P_{n}^{″} (θ)}{P_{n} (θ)} - {(\frac{P_{n}^{'} (θ)}{P_{n} (θ)})}^{2} .

where

P_{n}

is given by (7). The special cases

n = 1, 2, 3

are given, respectively, by (a)

n = 1,

P_{1} (θ) = θ^{2} + 1, L_{1} (θ) = e^{θ^{2} / 2} (θ^{2} + 1),

V_{1} (θ) = \frac{θ^{4} + 6 θ^{2} + 3}{{(θ^{2} + 1)}^{2}}, V_{1} (0) = 3 .

(9)

(b)

n = 2,

P_{2} (θ) = θ^{4} + 6 θ^{2} + 3, L_{2} (θ) = e^{θ^{2} / 2} (θ^{4} + 6 θ^{2} + 3),

V_{2} (θ) = \frac{θ^{8} + 8 θ^{6} + 30 θ^{4} + 45}{θ^{8} + 12 θ^{6} + 42 θ^{4} + 36 θ^{2} + 9}, V_{2} (0) = 5 .

(10)

(c)

n = 3

P_{3} (θ) = θ^{6} + 15 θ^{4} + 45 θ^{2} + 15, L_{3} (θ) = e^{θ^{2} / 2} (θ^{6} + 15 θ^{4} + 45 θ^{2} + 15),

V_{3} (θ) = \frac{θ^{12} + 24 θ^{10} + 225 θ^{8} + 840 θ^{6} + 1575 θ^{4} + 1575}{θ^{12} + 30 θ^{10} + 315 θ^{8} + 1380 θ^{6} + 2475 θ^{4} + 1350 θ^{2} + 225}, V_{3} (0) = 7 .

(11)

Proof.

1. This part is trivial

2. Clearly,

L_{n} (θ) = E [X^{2 n} e^{θ X}] = e^{θ^{2} / 2} P_{n} (θ),

where

P_{n} (θ) = E [{(X + θ)}^{2 n}] .

To get the explicit form of

P_{n} (θ),

write

{(X + θ)}^{2 n} = \sum_{k = 0}^{2 n} (\binom{2 n}{k}) X^{k} θ^{2 n - k}, X \sim N (0, 1) .

Taking expectations

P_{n} (θ) = \sum_{k = 0}^{2 n} (\binom{2 n}{k}) E [X^{k}] θ^{2 n - k} .

Only even powers contribute due to the symmetry of

X \sim N (0, 1)

, and thus,

E [X^{2 k}] = (2 k - 1)!!,

so that

P_{n} (θ) = \sum_{k = 0}^{n} (\binom{2 n}{2 k}) (2 k - 1)!! θ^{2 (n - k)},

is an even polynomial of degree

2 n

with strictly positive coefficients. Note, however, that

P_{n} = 2^{- n} H_{2 n}

, where

H_{m}

denotes the Hermite polynomial.

3. To prove (8), note that

V_{n} (θ) = \frac{L_{n}^{″} (θ)}{L_{n} (θ)} - {(\frac{L_{n}^{'} (θ)}{L_{n} (θ)})}^{2}, L_{n} (θ) = E [X^{2 n} e^{θ X}], X \sim N (0, 1) .

The following steps carry out the proof of this part: (a) computing

V_{n}

at 0; (b) defining the tilted measure

μ_{θ}

; (c) showing that

V_{n} (θ) < R_{n} (θ)

for

θ \neq 0

; (d) showing the symmetry of

R_{n} (θ),

where

R_{n}

is defined in (12); (e) justifying the log-convexity of

M_{k} (θ)

in k; (f) showing the monotonicity of

R_{n} (θ) .

(a) Evaluation at

θ = 0

: Compute

L_{n} (0) = E [X^{2 n}] = (2 n - 1)!!, L_{n}^{'} (0) = E [X^{2 n + 1}] = 0, L_{n}^{″} (0) = E [X^{2 n + 2}] = (2 n + 1) (2 n - 1)!! .

So,

V_{n} (0) = \frac{L_{n}^{″} (0)}{L_{n} (0)} = 2 n + 1 .

(b) Let

μ_{θ} (d x) ≐ \frac{x^{2 n} e^{θ x - x^{2} / 2}}{L_{n} (θ) \sqrt{2 π}} d x .

Then,

E_{θ} [X^{k}] = \frac{M_{2 n + k} (θ)}{M_{2 n} (θ)},

and

V_{n} (θ) = V a r_{μ_{θ}} (X) = \frac{M_{2 n + 2} (θ)}{M_{2 n} (θ)} - {(\frac{M_{2 n + 1} (θ)}{M_{2 n} (θ)})}^{2} .

(c) Define

R_{n} (θ) ≐ \frac{M_{2 n + 2} (θ)}{M_{2 n} (θ)}, μ_{n} (θ) ≐ \frac{M_{2 n + 1} (θ)}{M_{2 n} (θ)},

(12)

thus,

V_{n} (θ) = R_{n} (θ) - μ_{n} {(θ)}^{2} < R_{n} (θ) for all θ \neq 0 .

(d) Since

M_{k} (- θ) = {(- 1)}^{k} M_{k} (θ),

it follows that

R_{n} (θ) = R_{n} (- θ) .

So, it suffices to prove that

R_{n}^{'} (θ) < 0

for

θ > 0

.

(e) Let

M_{k} (θ) = E [X^{k} e^{θ X}] = e^{θ^{2} / 2} E [{(X + θ)}^{k}],

and define

Z = X + θ

, then

M_{k} (θ) = e^{θ^{2} / 2} E (Z^{k}), Z \sim N (0, 1) .

We now analyze the sequence

\{M_{k} (θ)\}

to show that, for each fixed

θ,

\frac{M_{k + 1} (θ)}{M_{k} (θ)} is strictly increasing in k .

(13)

To this end, we establish the following: strict log-convexity of the sequence

{M_{k} (θ)}

; convexity of

\log M_{k} (θ)

in k; monotonicity of successive ratios, followed by the application of (13). Recall that a sequence

{a_{k}}_{k \geq 0}

of positive numbers is log-convex if

a_{k}^{2} < a_{k - 1} a_{k + 1} (k \geq 1) .

(14)

For the normal moments, this strict inequality follows from Turán’s inequality for Hermite polynomials (c.f., [6,7,8]); hence, the sequence

{M_{k} (θ)}_{k \geq 0}

is strictly log-convex. Setting

a_{k} = M_{k}

and taking logarithms in (14) yields

\log M_{k - 1} - 2 \log M_{k} + \log M_{k + 1} > 0 .

That is, the discrete second difference of the function

k \mapsto \log M_{k} (θ)

is positive. In discrete calculus, this is precisely the definition of strict convexity. Moreover, for any positive sequence, strict log-convexity is equivalent to the property that

\frac{a_{k + 1}}{a_{k}} increases strictly with k .

Applying this to

M_{k} (θ)

yields (13), for every fixed

θ

.

(f) Differentiate

R_{n} (θ)

to obtain

R_{n}^{'} (θ) = \frac{M_{2 n + 3} M_{2 n} - M_{2 n + 2} M_{2 n + 1}}{M_{2 n}^{2}} .

Hence, from the final inequality in (e), it follows that

\frac{M_{2 n + 3}}{M_{2 n + 2}} < \frac{M_{2 n + 2}}{M_{2 n + 1}} \Rightarrow M_{2 n + 3} M_{2 n + 1} < M_{2 n + 2}^{2}, \Rightarrow M_{2 n + 3} M_{2 n} < M_{2 n + 2} M_{2 n + 1}, \Rightarrow R_{n}^{'} (θ) < 0 .

All the preceding steps demonstrate that

V_{n} (θ) < R_{n} (θ) < R_{n} (0) = 2 n + 1,

and

V_{n} (θ) < V_{n} (0) = 2 n + 1 for all θ \neq 0 .

4.

g_{n} (x) = \frac{x^{2}}{2} - 2 n \log | x | + const .

For

| x | \geq 1,

we use the inequality

\log | x | \leq | x | / 2,

and then note that

\frac{x^{2}}{2} - n | x | \geq \frac{x^{2}}{4}

once

| x | \geq 2 n

. Hence,

g_{n} (x) \geq \frac{x^{2}}{2} - 2 n \log | x | \geq \frac{x^{2}}{2} - n | x | \geq \frac{x^{2}}{4} (| x | \geq 2 n),

Thus, the quadratic tail condition of Theorem 1 (with

p = q = 2

) is indeed satisfied. However,

g_{n}

violates Proposition 1 as

g_{n}^{'} (x) = x - \frac{2 n}{x}, x \neq 0,

g_{n}^{″} (x) = 1 + \frac{2 n}{x^{2}}, x \neq 0 .

Since

g_{n}

is not defined everywhere, we cannot even begin to verify a uniform bound for all x.

5. A straightforward calculation yields these results. □

To illustrate the functional relationship between the variance and the canonical parameter

θ

, as well as the upper bound of the variance function, Figure 1 displays

V_{n}

,

= 1, 2,

and 3, corresponding to Equations (9), (10), and (11), respectively.

4. Examples

None of the following examples has appeared in the literature in NEF form, likely because their Laplace transforms lack closed-form expressions (except for those related to Proposition 2). As noted in the Introduction, modern mathematical software can easily handle such cases numerically.

Collectively, these examples provide a rich class of NEFs with unbounded support and bounded variance functions, offering valuable alternatives to the normal NEF and other standard distributions in statistical modeling. We present only a small selection from a much larger set, omitting additional cases for brevity.

Example 1.

The Generalized normal distributions (GNDs). Let

g (x) = {| x |}^{p}

and

f (x) = e^{- {| x |}^{p}}, p > 1, x \in R,

(15)

where the appropriate normalizing constant making f a density is

c_{g} ≐ c_{p} =

2 p / (Γ (1 / p))

.

The class of generalized normal distributions (GNDs) has a long history, beginning with Russian researchers (see [9], and the references therein). A GND with

1 < p < 2

provides a robust, flexible alternative to the normal distribution, making it suitable for data that exhibit sharp peaks and moderate outliers. It is especially useful in signal processing, robust regression, and sparsity-based inference. The density given in (15) is symmetric and unimodal, with heavier tails and a sharper peak than the normal distribution; these features make it leptokurtic. and well suited to settings where moderate outliers or pronounced central tendencies are expected. GNDs with

p \in (1, 2)

appear as error distributions in robust estimation ([10]), Bayesian regression ([11]), signal and audio processing ([12]), and independent component analysis and sparse coding ([13]). These examples satisfy Theorem 1 but not Proposition 1—except when

p = 2,

which corresponds to the normal model. Specifically, Theorem 1 requires

g (x) \geq C_{1} x^{p_{1}}

for large

x > 0

, and

g (x) \geq C_{2} {| x |}^{p_{2}}

for large

x < 0

, with constants

C_{1}, C_{2} > 0

and exponents

p_{1}, p_{2} > 1

. This condition yields a bounded variance for the associated NEF. For this case,

g_{p} (x) = \{\begin{matrix} x^{p} & if x \geq 0, \\ {(- x)}^{p} = {| x |}^{p} & if x < 0, \end{matrix}

which meets the first two conditions of Theorem 1 with

C_{1} = C_{2} = 1

and

p_{1} = p_{2} = p > 1

. For

1 < p < 2

, Part 3 of Theorem 1 only hints that the VF is not uniformly bounded; indeed, it diverges as the canonical parameter

θ,

or m, tends to ∞ or

- \infty

.

Uniform boundedness of V requires

p \geq 2,

so Proposition 1 applies only to the Gaussian case

p = 2 .

Specifically,

g^{″} (x) = [p (p - 1)] {| x |}^{(p - 2)}, x \neq 0 .

For

1 < p < 2

, this second derivative vanishes at the origin, whereas for

p > 2

, it diverges there. Consequently, Proposition 1 fails whenever

p \neq 2 .

In particular,

g^{″} (0) = \{\begin{matrix} 2, & p = 2, \\ 0, & p > 2, \\ \infty, & 1 < p < 2 . \end{matrix}

(Details are omitted for brevity.)

Example 2.

The extended generalized normal distributions (EGNDs). Let

g (x) = \frac{{| x |}^{p}}{a} + \frac{{| x |}^{q}}{b}, p \geq 2, q \geq 2, a > 0, b > 0 .

(16)

We refer to the NEF

F,

generated by

f = e^{- g (x)},

as the EGND NEF, as we intend to explore it further and apply it to real data sets. As in Example 1, all the assumptions of Theorem 1 are satisfied. Proposition 1 holds only when either p or q equals 2. For example, if

p = 2

, then

g_{a, b, 2} (x) = \frac{a + b}{a b} x^{2}

with

a^{*} = a b / (a + b)

, in which case,

L_{a, b, 2} (θ) = \sqrt{\frac{π a^{*}}{2}} e^{θ^{2} a^{*} / 4}, V_{a, b, 2} (θ) = \frac{a^{*}}{2} .

Example 3.

Let

g (x) = (1 + δ) x^{2} + \sin^{2} (x), δ > 0, f (x) = e^{- g (x)} .

This g satisfies both Theorem 1 and Proposition 1. Theorem 1 requires

g (x) \geq C_{1} x^{p_{1}}

as

x \to \infty

and

g (x) \geq C_{2} {| x |}^{p_{2}}

as

x \to - \infty

, for some

p_{1}, p_{2} > 1

and

C_{1}, C_{2} > 0

.

Here,

g (x) = (1 + δ) x^{2} + \sin^{2} (x),

and

\sin^{2} (x) \in [0, 1]

is bounded for all x.

As

| x | \to \infty

, we have

g (x) \sim (1 + δ) x^{2},

so for sufficiently large

| x |

,

g (x) \geq (1 + δ) x^{2} \geq C x^{2},

with

C = 1 + δ > 0

. Hence, both Theorem 1 and Proposition 1 are satisfied, with

g^{″} (x) \geq 2 δ .

The choice

δ > 0

is is essential, as the function loses strict convexity when

δ = 0

.

Example 4.

Let

g (x) = e^{x} + e^{- x} .

First, we show that

f (x) = e^{- g (x)} = e^{- e^{x} - e^{- x}}

is integrable over

R

; that is,

\int_{- \infty}^{\infty} f (x) d x < \infty .

For

A > 0

, consider the left tail:

\int_{- \infty}^{- A} e^{- e^{- x}} d x = \int_{e^{A}}^{\infty} \frac{e^{- u}}{u} d u < \infty,

where the change of variables

u = e^{- x}

is used. A similar change of variables handles the right tail. Hence, f is integrable.

Next,

g^{″} (x) = e^{x} + e^{- x} > 0, \forall x \in R

with a single minimum at

x = 0,

so

g^{″} (x) \geq δ = 2,

\forall x \in R .

Consequently, both Theorem 1 and Proposition 1 hold.

Example 5.

Let

g (x) = \log (1 + e^{x}) + \frac{x^{2}}{2} .

Differentiating twice gives

g^{″} (x) = \frac{d^{2}}{d x^{2}} (\log (1 + e^{x}) + \frac{x^{2}}{2}) = \frac{e^{x}}{{(1 + e^{x})}^{2}} + 1 = σ (x) (1 - σ (x)) + 1,

where

σ (x) = e^{x} / (1 + e^{x})

. Since

σ (x) (1 - σ (x)) \in (0, 1 / 4)

, it follows that

g^{″} (x) \in (1, \frac{5}{4}) .

Thus, Proposition 1 is satisfied with

δ = 1 .

Theorem 1 also holds: for large positive

x,

g (x) \geq C_{1} x^{2}

with

C_{1} = \frac{1}{2};

and for large negative x,

g (x) \geq \frac{1}{2} x^{2} = C_{2} {| x |}^{2}

with

C_{2} = \frac{1}{2} .

Hence, the required growth conditions are met, and since

g^{″} (x) \geq δ = 1

everywhere, Proposition 1 is confirmed.

5. Concluding Remarks

NEFs supported on the entire real line and possessing bounded VFs are uncommon; historically, the normal family has been the primary example. This paper broadens that landscape by presenting three complementary sets of sufficient conditions that ensure unbounded support and bounded VFs.

Tail-growth control (Theorem 1). If the cumulant generator g dominates two even polynomials of degree $p, q > 2$ , then $f = e^{- g}$ is integrable, its Laplace transform is finite on $R$ , and the corresponding VF is bounded by 1.
Uniform strong convexity (Proposition 1). Whenever $g^{″} (x) \geq ε > 0$ for all $x \in R$ , Legendre duality yields $V (m) \leq 1 / ε$ for every mean m.
A closed-form Gaussian sequence (Proposition 2). Weighting the standard normal density by even powers of x produces a family of NEFs whose VFs satisfy $V_{n} \leq 2 n + 1$ and whose Laplace transforms remain elementary.

A set of twelve fully worked examples demonstrates how each route succeeds—or fails—in practice, highlighting the logical independence of the two analytic criteria. Although most examples lack closed-form Laplace transforms, modern symbolic and numerical software make them readily applicable. These results can generate almost infinitely many NEFs of this type, satisfying at least one of the three statements in Theorem 1 or Propositions 1 and 2.

For an empirical study, it is often advantageous to embed a given g in a richer parametric family. Simple rescaling

g \mapsto g / a (a > 0)

(17)

or decompositions

g = g_{1} + g_{2} \mapsto g_{1} / a + g_{2} / b (a, b > 0)

(18)

preserve the hypotheses of Theorem 1 and Propositions 1 and 2, while endowing the resulting NEFs with additional shape control (see Example 2 for an illustration). These extra degrees of freedom, in conjunction with the canonical parameter, make the families more adaptable to real data and will be further explored in forthcoming applied studies.

The purpose of presenting so many examples is to underscore the virtually endless supply of exponential families supported on the real line that can be constructed—none of which were previously known. Moreover, each of these families can be extended to a multi-parameter family, as shown in Equations (17) and (18).

Looking ahead, two applied research projects are currently underway with several collaborators, aiming to apply the theoretical results of this paper to modeling real data in the various fields indicated below.

1. Model competition. The proposed NEFs, all defined on

R

with bounded VFs, offer natural competitors to the normal, Laplace, logistic, hyperbolic, and secant distributions—as well as other classical distributions—in contexts where tail behavior or regularization is essential.

2. Statistical methodology. Investigating maximum likelihood and Bayesian inference for these families—particularly those lacking closed-form transforms—promises new tools for robust regression, signal processing, and econometrics.

3. Focus. In particular, we will focus on the NEFs generated by the generalized normal distributions (Example 1) and the extended generalized normal distributions (Example 2). We will explore their probabilistic properties and apply them to real data arising from signal and audio processing, independent component analysis, and sparse coding.

Funding

This research received no external funding.

Data Availability Statement

No new data were created or analyzed in this study.

Acknowledgments

I am grateful to Gérard Letac for addressing the interesting online problem and for providing the case

n = 1

in Proposition 1. I also thank the two reviewers for their valuable comments, which significantly improved the presentation of the paper.

Conflicts of Interest

The author declares no conflicts of interest.

References

Bar-Lev, S.K.; Kokonendji, C.C. On the mean value parameterization of natural exponential families—A Revisited Review. Math. Methods Stat. 2017, 26, 159–175. [Google Scholar] [CrossRef]
Letac, G.; Mora, M. Natural real exponential families with cubic variance functions. Ann. Stat. 1990, 18, 1–37. [Google Scholar] [CrossRef]
Barndorff-Nielsen, O.E. Information and Exponential Families in Statistical Theory; Wiley: New York, NY, USA, 1978. [Google Scholar]
Rudin, W. Real and Complex Analysis, 3rd ed.; McGraw-Hill: New York, NY, USA, 1987. [Google Scholar]
Rockafellar, R.T. Convex Analysis; Princeton University Press: Princeton, NJ, USA, 1970. [Google Scholar]
Dimitrov, D.K. Higher-order Turán inequalities. Proc. Am. Math. Soc. 1998, 126, 2033–2037. [Google Scholar] [CrossRef]
Szegὅ, G. Orthogonal Polynomials, 4th ed.; American Mathematical Society Colloquium Publications: Providence, RI, USA, 1975; Volume 23, ISBN 0-8218-1023-5. [Google Scholar]
Turán, P. Hermite-Expansion and Strips for Zeros of Polynomials. Archiv. Math. 1954, 5, 148–152. [Google Scholar] [CrossRef]
Zolotarev, V.M. One-Dimensional Stable Distributions; American Mathematical Society: Providence, RI, USA, 1986. [Google Scholar]
Gentle, J.E. Computational Statistics; Springer: Berlin/Heidelberg, Germany, 2009. [Google Scholar]
Griffin, J.E.; Brown, P.J. Inference with normal-gamma prior distributions in regression problems. Bayesian Anal. 2010, 5, 171–188. [Google Scholar]
Eltoft, T.; Kim, T.; Lee, T.W. On the multivariate Laplace distribution. IEEE Signal Process. Lett. 2006, 13, 300–303. [Google Scholar] [CrossRef]
Hyvärinen, A.; Oja, E. Independent component analysis: Algorithms and applications. Neural Netw. 2000, 13, 411–430. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Plots of

V_{1}

,

V_{2}

, and

V_{3}

as functions of

θ

.

Figure 1. Plots of

V_{1}

,

V_{2}

, and

V_{3}

as functions of

θ

.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bar-Lev, S.K. Boundedness of Variance Functions of Natural Exponential Families with Unbounded Support. Mathematics 2025, 13, 2045. https://doi.org/10.3390/math13132045

AMA Style

Bar-Lev SK. Boundedness of Variance Functions of Natural Exponential Families with Unbounded Support. Mathematics. 2025; 13(13):2045. https://doi.org/10.3390/math13132045

Chicago/Turabian Style

Bar-Lev, Shaul K. 2025. "Boundedness of Variance Functions of Natural Exponential Families with Unbounded Support" Mathematics 13, no. 13: 2045. https://doi.org/10.3390/math13132045

APA Style

Bar-Lev, S. K. (2025). Boundedness of Variance Functions of Natural Exponential Families with Unbounded Support. Mathematics, 13(13), 2045. https://doi.org/10.3390/math13132045

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Boundedness of Variance Functions of Natural Exponential Families with Unbounded Support

Abstract

1. Introduction

2. Preliminaries on NEFs

3. Sufficient Conditions on the Boundedness of VFs

4. Examples

5. Concluding Remarks

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI