A Deformed Exponential Statistical Manifold

Josué Vieira, Francisca Leidmar; Félix de Andrade, Luiza Helena; Facundo Vigelis, Rui; Casimiro Cavalcante, Charles

doi:10.3390/e21050496

Open AccessArticle

A Deformed Exponential Statistical Manifold

by

Francisca Leidmar Josué Vieira

^1,*

,

Luiza Helena Félix de Andrade

²

,

Rui Facundo Vigelis

³ and

Charles Casimiro Cavalcante

⁴

¹

Departamento de Matemática, Universidade Regional do Cariri, Juazeiro do Norte-CE 63041-145, Brazil

²

Departamento de Ciências Naturais, Matemática e Estatística, Universidade Federal Rural do Semi-Árido, Mossoró-RN 59625-900, Brazil

³

Curso de Engenharia de Computação, Campus Sobral, Universidade Federal do Ceará, Sobral-CE 62042-280, Brazil

⁴

Departamento de Engenharia de Teleinformática, Universidade Federal do Ceará, Fortaleza-CE 60020-181, Brazil

^*

Author to whom correspondence should be addressed.

Entropy 2019, 21(5), 496; https://doi.org/10.3390/e21050496

Submission received: 19 April 2019 / Revised: 12 May 2019 / Accepted: 13 May 2019 / Published: 15 May 2019

(This article belongs to the Section Information Theory, Probability and Statistics)

Download Versions Notes

Abstract

Consider

μ

a probability measure and

P_{μ}

the set of

μ

-equivalent strictly positive probability densities. To endow

P_{μ}

with a structure of a

C^{\infty}

-Banach manifold we use the

φ

-connection by an open arc, where

φ

is a deformed exponential function which assumes zero until a certain point and from then on is strictly increasing. This deformed exponential function has as particular cases the q-deformed exponential and

κ

-exponential functions. Moreover, we find the tangent space of

P_{μ}

at a point p, and as a consequence the tangent bundle of

P_{μ}

. We define a divergence using the q-exponential function and we prove that this divergence is related to the q-divergence already known from the literature. We also show that q-exponential and

κ

-exponential functions can be used to generalize of Rényi divergence.

Keywords:

deformed exponential manifold; statistical manifold; φ-family; information geometry; exponential arcs

1. Introduction

Let

P_{μ}

be the set of

μ

-equivalent strictly positive probability densities, where

μ

is a given probability measure. In order to build a structure to

P_{μ}

, Amari considered the parametric case, where the construction depends on a parameter belonging to the Euclidean space [1,2]. The case of non-parametric statistical models was initially studied by Pistone and Sempi [3]. In this case,

P_{μ}

was equipped with a structure of a

C^{\infty}

-Banach manifold using the Orlicz space associated to an Orlicz function. In a later work [4], Pistone and Cena proved that the probability distribution z belongs to the maximal exponential model to the probability distribution p, if and only if, z is connected to p by an open exponential arc. Moreover, the new manifold structure obtained from the connection by an open exponential arc is equivalent to the one defined in [3,5]. Results involving conditions connecting two probability densities by an open exponential arc were recently studied in [6].

The deformed exponential function was first introduced by Naudts in [7] and studied in more details later in [8,9]. In [10], the authors propose a generalization for the exponential family

E_{p}

, based in the replacement of the exponential function exp by a deformed exponential function

φ

. It is then proposed a

φ

-family of probability distributions denoted by

F_{c}^{φ}

, with

p = φ (c)

. The described family was modeled on Musielak–Orlicz spaces and a Banach manifold structure to

P_{μ}

is obtained. As a consequence of such model, a more general form of the Kullback–Leibler divergence was obtained and called

φ

-divergence. Furthermore, the arcs for the deformed exponential function were investigated and it was provided the necessary and sufficient conditions to connect by a

φ

-arc any two probability distributions [11]. This result was generalized later by [12,13]. A generalization to exponential arcs was defined in [14] and it also proved that the probability distribution z belongs to the

φ

-family

F_{c}^{φ}

if, and only if, z is connected to p by an open

φ

-arc.

An example of deformed exponential function is the q-exponential one that it was used by Loaiza and Quiceno [15] to define an atlas modeled on essentially bounded function spaces. The charts for the given atlas are defined in terms of connections by an one-dimensional q-exponential model and of the q-deformations of cumulant maps [4]. Moreover, using equivalence class it was constructed the tangent space and the tangent bundle.

In this paper we endow

P_{μ}

with a structure of a

C^{\infty}

-Banach manifold using a deformed exponential function. This deformed exponential function has zero value until a certain point and from then on has the behaviour similar to the “classical” exponential function, which is strictly increasing. Particular cases of that function are: q-deformed exponential and

κ

-exponential. In order to build this structure, as in [15], we divide

P_{μ}

into equivalence classes using the connection provided by generalized exponential arcs as defined in [14]. Also, we define a set

A_{c}^{φ}

, that is the connected component of

P_{μ}

and will be the generalized

φ

-family of probability distributions. Moreover, by means of the derivative of the transition map, we find the tangent space and, consequently, the tangent bundle. In addition, we define a divergence using the q-exponential function which is related with the q-divergence defined in [15]. Finally, we show that the

κ

-exponential and q-exponential functions can be used in the generalization of Rényi’s divergence.

The rest of the paper is organized as follows. In Section 2 we revisit some important results about the q-exponential statistical manifold and provide a brief introduction about Musielak–Orlicz spaces. In Section 3, we have our main results. We discuss generalized open exponential arcs and build generalized

φ

-families of probability distributions. Alterwards, in Section 4, we find the derivative of the transition map and, as a consequence, the tangent space and tangent bundle. Moreover, in Section 5 we define a divergence using the q-exponential function and we use those results to prove that the q-exponential and

κ

-exponential functions can be used to generalize Rényi’s divergence. Finally, in Section 6 our conclusions and future perspectives are stated.

2. Background and Preliminary Results

The deformed exponential function that we will use to equip

P_{μ}

with a structure of a

C^{\infty}

-Banach manifold has as a particular case the q-exponential function and the parametrization domain is obtained from a Musielak–Orlicz space. For this reason, the purpose of this section is to make a brief presentation of the results involving the q-exponential manifold and the Musielak–Orlicz spaces.

2.1. A q-Exponential Statistical Banach Manifold

In the same way as in [15], we consider

(T, Σ, μ)

a probability space and

q \in (0, 1)

. The q-deformed exponential function is given by [16]

e_{q}^{x} = {(1 + (1 - q) x)}^{1 / (1 - q)}, where \frac{- 1}{1 - q} \leq x .

(1)

Definition 1.

We say that

p, z \in P_{μ}

are connected by an one-dimensional q-exponential model if there exists

r \in P_{μ}

,

u \in L^{\infty} (r . μ)

, a real function of a real variable ψ and

δ > 0

, such that for all

t \in (- δ, δ)

the function f defined by

f (t) = e_{q}^{t u ⊖_{q} ψ (t)} r,

(2)

satisfies that there are

t_{0}, t_{1} \in (- δ, δ)

, with

f (t_{0}) = p

,

f (t_{1}) = z

and

t u ⊝_{q} ψ (t) : = \frac{t u - ψ (t)}{t u + (1 - q) ψ (t)}

, for

ψ (t) \neq {(q - 1)}^{- 1}

.

Consider the following partition of

P_{μ}

into equivalence classes:

p, z \in P_{μ}

are related (

p \sim_{q} z

) if and only if there exists an one-dimensional q-exponential model connecting p and z, according to Equation (2). As a consequence, the measures

p . μ

and

z . μ

are equivalent and the essentially bounded function spaces

L^{\infty} (p . μ)

and

L^{\infty} (z . μ)

are equal.

We need to define a family of q-deformations of the moment-generating functional denoted by

M_{p}^{q}

, it means,

M_{p}^{q} : D_{M_{p}^{q}} \to [0, \infty],

M_{p}^{q} (u) = \int_{T} e_{q}^{(u)} d μ,

where

D_{M_{p}^{q}} = \{u \in L^{\infty} (p . μ); \frac{- 1}{1 - q} < u, \int_{T} e_{q}^{(u)} d μ < \infty\} .

Also, we define a family of cumulant generating functional

K_{p}^{q} : B_{p, \infty} (0, 1) \to [0, \infty]

where

K_{p}^{q} (u) = {ln}_{q} [M_{p}^{q}] .

Notice that

B_{p, \infty} (0, 1) \subset D_{M_{p}^{q}}

, where

B_{p, \infty} (0, 1)

is the open unit ball in

L^{\infty} (p . μ)

. Some properties of the functional

K_{p}^{q}

are described in the theorem below.

Theorem 1

([15], Theorem 9). The cumulant generating function

K_{p}^{q}

satisfies:

(1): The function $z = e_{q}^{u ⊖_{q} K_{p}^{q} (u)} p$ is a probability density on $P_{μ}$ , since $u \in B_{p, \infty} (0, 1)$ ;
(2): $K_{p}^{q}$ is infinitely Fréchet differentiable and its n-th derivative evaluated at the directions $(v_{1}, \dots, v_{n}) \in B_{p, \infty} (0, 1) \times \dots \times B_{p, \infty} (0, 1)$ , is of the form

$D^{n} K_{p}^{q} (u) . (v_{1} \dots v_{n}) = {[M_{p}^{q} (u)]}^{1 - q} Q_{n} (q) \int_{T} (v_{1} \dots v_{n}) d μ;$
(3): The functional $K_{p}^{q}$ is analytic in $B_{p, \infty} (0, 1)$ .

The function

K_{p}^{q}

is used to define the q-exponential models

e_{q, p} : V_{p} \to P_{μ},

where

e_{q, p} (u) = e_{q}^{(u ⊖_{q} K_{p}^{q} (u))} p .

(3)

Moreover, the set

B_{p} = \{u \in L^{\infty} (p . μ); \int_{T} u p d μ = 0\}

(4)

is a Banach space and

V_{p} = {u \in B_{p} {; | | u | |}_{p, \infty} < 1}

(5)

is the open unit ball of

B_{p}

. Since

{| | u | |}_{p, \infty} < 1,

we obtain

\frac{- 1}{1 - q} < u

. Therefore

\frac{- 1}{1 - q} < \frac{u - K_{p}^{q} (u)}{1 + (1 - q) K_{p}^{q} (u)} = u ⊖_{q} K_{p}^{q} (u)

and consequently

e_{q, p} (u) = e_{q}^{(u ⊖_{q} K_{p}^{q} (u))} p

is well defined.

The inverse of

e_{q, p}

is given by [15]

e_{q, p}^{- 1} (z) = \frac{{ln}_{q} (\frac{z}{p}) - \int_{T} {ln}_{q} (\frac{z}{p}) p d μ}{1 + (1 - q) \int_{T} {ln}_{q} (\frac{z}{p}) p d μ} .

The transition map

e_{q, p_{2}}^{- 1} \circ e_{q, p_{1}} : e_{q, p_{1}}^{- 1} (U_{p_{1}} \cap U_{p_{2}}) \to e_{q, p_{2}}^{- 1} (U_{p_{1}} \cap U_{p_{2}})

, where

U_{p}

is the range of

e_{q, p}

, is expressed as [15]

e_{q, p_{2}}^{- 1} (e_{q, p_{1}} (u)) = \frac{u + [1 + (1 - q) u] {ln}_{q} (\frac{p_{1}}{p_{2}}) - \int_{T} u + [1 + (1 - q) u] {ln}_{q} (\frac{p_{1}}{p_{2}}) p_{2} d μ}{1 + (1 - q) \int_{T} u + [1 + (1 - q) u] {ln}_{q} (\frac{p_{1}}{p_{2}}) p_{2} d μ},

where

p_{1}, p_{2} \in P_{μ}

with

U_{p_{1}} \cap U_{p_{2}} \neq \emptyset

and

u \in e_{q, p_{1}}^{- 1} (U_{p_{1}} \cap U_{p_{2}})

.

The map

e_{q, p}

is injective and the set

e_{q, p}^{- 1} (U_{p_{1}} \cap U_{p_{2}})

is open in the

B_{p_{1}}

-topology, where

p_{1}, p_{2} \in P_{μ}

. Hence, the transition map

e_{q, p_{2}}^{- 1} \circ e_{q, p_{1}}

is a topological homeomorphism and consequently the collection of pairs

{\{(U_{p}, e_{q, p}^{- 1})\}}_{p \in P_{μ}}

is a

C^{\infty}

-atlas modeled on

B_{p}

. Then,

P_{μ}

is a

C^{\infty}

-Banach manifold, since

e_{q, p}

is a parametrization.

There exists a relation between the constructed manifold and the Tsallis relative entropy. In fact, let us consider, for

t \neq 0

and

0 < q < 1

, the following function

f (t) = - t {ln}_{q} (\frac{1}{t}),

where

{ln}_{q} (x) = \frac{x^{1 - q} - 1}{1 - q}, if x > 0

. Given p and z in

P_{μ}

, the Tsallis divergence, also called q-divergence of z with relation to p, is expressed by

I^{(q)} (z | | p) = \int_{T} p f (\frac{z}{p}) d μ .

(6)

Proposition 1

([15], Proposition 16). Taking p, z in

P_{μ}

, we obtain

(1): $I^{(q)} (z | | p) \geq 0$ , with equality iff $p = z$ .
(2): $I^{(q)} (z | | p) \leq \int_{T} (z - p) f^{'} (\frac{z}{p}) d μ$ .

2.2. Musielak–Orlicz Spaces and $φ$ -Families of Probability Distributions

Consider

(T, Σ, μ)

a

σ

-finite, non-atomic measure space. Let

P_{μ} = {p \in L^{0}; p > 0 and \int_{T} p d μ = 1}

, where

L^{0}

is the linear space of all real-valued, measurable functions on T, with equality

μ

-a.e.

t \in T

. The map

Φ : T \times [0, \infty) \to [0, \infty]

is a Musielak–Orlicz function if, for

μ

-a.e. (almost everywhere)

t \in T

, the following conditions hold [17]:

(1): $Φ (t, \cdot)$ is convex and lower semi-continuous;
(2): $Φ (t, 0) = {lim}_{u ↓ 0} Φ (t, u) = 0$ and $Φ (t, \infty) = \infty$ ;
(3): $Φ (\cdot, u)$ is measurable for each $u \geq 0$ .

Since the items (1) and (2) occur, it follows that

Φ (t, .)

is not equal to 0 or ∞ in the interval

(0, \infty)

.

Consider the functional

I_{Φ} (u) = \int_{T} Φ (t, | u (t) |) d μ

, for any

u \in L^{0}

. The Musielak–Orlicz space, Musielak–Orlicz class, Morse–Transue space associated the a Musielak–Orlicz function

Φ

are defined, respectively, by

L^{Φ} = {u \in L^{0}; I_{Φ} (λ u) < \infty for each λ \in (- ε, ε), there exists ε > 0},

{\tilde{L}}^{Φ} = {u \in L^{0}; I_{Φ} (u) < \infty}

and

E^{Φ} = {u \in L^{0}; I_{Φ} (λ u) < \infty for all λ > 0} .

Consider the Luxemburg norm

{∥ u ∥}_{Φ} = \inf \{λ > 0; I_{Φ} (\frac{u}{λ}) \leq 1\},

and the Orlicz norm

{∥ u ∥}_{Φ, 0} = sup \{|\int_{T} u v d μ|; v \in {\tilde{L}}^{Φ^{*}} and I_{Φ^{*}} (v) \leq 1\},

where

Φ^{*} (t, v) = {sup}_{u \geq 0} (u v - Φ (t, u))

is the Fenchel conjugate of

Φ (t, \cdot) .

The Musielak–Orlicz space

L^{Φ}

equipped with one of these two norms is a Banach space. The norms above are equivalent and the inequalities

{∥ u ∥}_{Φ} \leq {∥ u ∥}_{Φ, 0} \leq 2 {∥ u ∥}_{Φ}

hold for all

u \in L^{Φ}

. For more details see [18,19].

Define the Musielak–Orlicz function as

Φ_{c} (t, u) = φ (t, c (t) + u) - φ (t, c (t)),

(7)

where

c : T \to R

is a measurable function such that

φ (t, c (t))

is

μ

-integrable and we write

L_{c}^{φ}

,

{\tilde{L}}_{c}^{φ}

and

E_{c}^{φ}

, in the place of

L^{Φ_{c}}

,

{\tilde{L}}^{Φ_{c}}

and

E^{Φ_{c}}

respectively. In [10] it was defined the parametrization

φ_{c} : B_{c}^{φ} \to F_{c}^{φ},

where

φ_{c} (u) = φ (c + u - ψ (u) u_{0}),

(8)

for each

u \in B_{c}^{φ} = B_{c}^{φ} \cap K_{c}^{φ}

, and

B_{c}^{φ} = \{u \in L_{c}^{φ}; \int_{T} u φ_{+}^{'} (c) d μ = 0\},

(9)

K_{c}^{φ} = \{u \in L_{c}^{φ}; \int_{T} φ (c + λ u) < \infty for each λ \in (- ε, 1 + ε), there exists ε > 0\} .

(10)

The application

ψ : B_{c}^{φ} \to [0, \infty)

is called the normalizing function and it is defined in such a way that

φ_{c} (u) = φ (c + u - ψ (u) u_{0})

is in

P_{μ}

. We have that

⋃ {F_{c}^{φ}; φ (c) \in P_{μ}} = P_{μ}

,

φ_{c_{1}}^{- 1} (F_{c_{1}}^{φ} \cap F_{c_{2}}^{φ})

and

φ_{c_{2}}^{- 1} (F_{c_{1}}^{φ} \cap F_{c_{2}}^{φ})

are open for any

c_{1}, c_{2} : T \to R

measurable such that

φ (c_{1})

and

φ (c_{2})

are in

P_{μ}

. The transition map is a

C^{\infty}

-isomorphism and consequently

φ_{c}

is a parametrization.

In the next section, we will use the generalized open exponential arcs to build a parametrization to

P_{μ}

.

3. Construction of Generalized $φ$ -Families of Probability Distributions

Let

(T, Σ, μ)

, be a

σ

-finite, non-atomic measure space and consider a deformed exponential function

φ : T \times R \to [0, \infty)

. In other words,

φ (t, \cdot)

is convex for

μ

-a.e.

t \in T

and the limits

{lim}_{u \to - \infty} φ (t, u) = 0

,

{lim}_{u \to \infty} φ (t, u) = \infty

for

μ

-a.e.

t \in T

hold. In this work we consider two additional conditions on the deformed exponential

φ

:

(a1): $φ (t, x) = 0,$ for all $x < a_{φ},$ where $a_{φ} = inf \{x \in R; φ (x) > 0\};$
(a2): given a measurable function $c : T \to R$ such that $\int_{T} φ (t, c (t)) d μ = 1$ , we have

$\int_{T} φ (t, c (t) + λ) d μ < \infty, for all λ > 0 .$

(11)

For a measurable function

q : T \to (0, 1)

, we define the q-deformed exponential function

{exp}_{q} : T \times R \to [0, \infty)

as

{exp}_{q} (t, u) = {exp}_{q (t)} (u)

, where

{exp}_{q} (u) = {[1 + (1 - q) u]}_{+}^{1 / (1 - q)},

and

{[1 + (1 - q) u]}_{+} = max {1 + (1 - q) u, 0}

. In this case, the q-deformed exponential function satisfies the condition (a1) with

a_{φ} = \frac{- 1}{1 - q}

. In the next example, we prove that the q-deformed exponential function satisfies the condition (a2) for

0 < q < 1

.

Example 1.

Given

α \geq 1

, we consider two cases:

If

u \leq 0

, we have that

α u \leq u

. Then,

\begin{array}{l} {exp}_{q} (α u) & \leq {exp}_{q} (u) \\ \leq α^{\frac{1}{1 - q}} {exp}_{q} (u) . \end{array}

If

u > 0

, we obtain

\begin{array}{l} {exp}_{q} (α u) & = {(1 + (1 - q) α u)}^{\frac{1}{1 - q}} \\ = {(α α^{- 1} + (1 - q) α u)}^{\frac{1}{1 - q}} \\ = α^{\frac{1}{1 - q}} (α^{- 1} + (1 - q) u))^{\frac{1}{1 - q}} \\ \leq α^{\frac{1}{1 - q}} {(1 + (1 - q) u))}^{\frac{1}{1 - q}} \\ = α^{\frac{1}{1 - q}} {exp}_{q} (u) . \end{array}

By the convexity property of

{exp}_{q} (t, .)

, we obtain for any

λ \in (0, 1)

that

\begin{array}{l} {exp}_{q} (c + u) & \leq λ {exp}_{q} (λ^{- 1} c) + (1 - λ) {exp}_{q} ({(1 - λ)}^{- 1} u) \\ \leq λ^{1 - 1 / (1 - q)} {exp}_{q} (c) + {(1 - λ)}^{1 - 1 / (1 - q)} {exp}_{q} (u) . \end{array}

Then, any positive function

u_{0} : T \to (0, \infty)

such that

\int_{T} {exp}_{q} (u_{0}) d μ < \infty

satisfies

\int_{T} {exp}_{q} (c + λ u_{0}) d μ < \infty

for all

λ > 0

.

Now, we provide an example of a deformed exponential function that satisfies condition (a1), but does not satisfy condition (a2).

Example 2.

Consider the function

φ (u) = \{\begin{matrix} e^{{(u + 1)}^{2} / 2}, & u \geq 0 \\ e^{1 / 2} (u + 1), & - 1 \leq u \leq 0, \\ 0, & u \leq - 1 \end{matrix}

where the measure μ is σ-finite and non atomic. Note that φ is convex, and satisfies

φ (x) = 0,

for all

x < a_{φ},

where

a_{φ} = inf {x \in R; φ (x) > 0}

and

{lim}_{u \to \infty} φ (u) = \infty .

We will find a measurable function

c : T \to R

with

\int_{T} φ (c) d μ < \infty,

but

\int_{T} φ (c + λ) d μ = \infty,

for some

λ > 0

. For each

m \geq 1,

we consider

v_{m} (t) : = (m log (2) - \frac{3}{2}) 1_{E_{m}} (t),

where

E_{m} = \{t \in T; m log (2) - \frac{3}{2} > 0\}

and

1_{E_{m}} (t) = \{\begin{matrix} 1, & t \in E_{m} (t) \\ 0, & t \notin E_{m} (t) \end{matrix}

. Since

v_{m} ↑ \infty,

we can find a subsequence

{v_{m_{n}}}

such that

\int_{E_{m_{n}}} e^{{(v_{m_{n}} + 2)}^{2} / 2} d μ \geq 2^{n} .

According to [17], there exists a subsequence

w_{k} = v_{m_{n_{k}}}

and pairwise disjoint sets

A_{k} \subseteq E_{m_{n_{k}}}

for which

\int_{A_{k}} e^{{(v_{m_{n}} + 2)}^{2} / 2} d μ = 1 .

Let us define

c = \bar{c} 1_{T ∖ A} + \sum_{k = 1}^{\infty} w_{k} 1_{A_{k}}

where

A = ⋃_{k = 1}^{\infty} A_{k}

and

\bar{c}

is any measurable function such that

φ (\bar{c} (t)) > 0

for

t \in T ∖ A

and

\int_{T ∖ A} φ (\bar{c}) d μ < \infty .

Observing that

e^{{(w_{k} (t) + 2)}^{2} / 2} = 2^{m_{n_{k}}} e^{{(w_{k} (t) + 1)}^{2} / 2}, f o r t \in A_{k},

we obtain

\int_{A_{k}} e^{({w_{k} (t) + 1)}^{2} / 2} d μ = \frac{1}{2^{m_{n_{k}}}}, f o r e v e r y m \geq 1 .

Hence, we can write

\begin{matrix} \int_{T} φ (c) d μ = & \int_{T ∖ A} φ (\bar{c}) d μ + \sum_{k = 1}^{\infty} \int_{A_{k}} e^{({w_{k} (t) + 1)}^{2} / 2} d μ \\ = & \int_{T ∖ A} φ (\bar{c}) d μ + \sum_{k = 1}^{\infty} \frac{1}{2^{m_{n_{k}}}} < \infty . \end{matrix}

On the other hand, we also have

\begin{matrix} \int_{T} φ (c + 1) d μ = & \int_{T ∖ A} φ (\bar{c}) d μ + \sum_{k = 1}^{\infty} \int_{A_{k}} e^{({w_{k} (t) + 2)}^{2} / 2} d μ \\ = & \int_{T ∖ A} φ (\bar{c}) d μ + \sum_{k = 1}^{\infty} 1 \\ = & \infty, \end{matrix}

which shows that (a2) is not satisfied.

Definition 2.

We say that p and z in

P_{μ}

are φ-connected by an open arc, if there exists an open interval

I \supset [0, 1]

and a constant

κ (α)

, such that

φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - κ (α)) \in P_{μ},

(12)

for each

α \in I

, where

κ (α)

depends of α, p and z.

According to the proof proved in [11], we have that

κ (α) \leq 0

for each

α \in [0, 1]

. Indeed,

for $α = 0, 1$ , we have clearly that $κ (α) = 0$ ;
for $α \in (0, 1),$ the convexity of the function of the $φ$ ensures that $0 \leq φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z)) < (1 - α) p + α z$ . Integrating the inequality we obtain

$0 \leq \int_{T} φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z)) d μ \leq 1 .$

Since $κ (α)$ satisfies

$\int_{T} φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - κ (α)) d μ = 1,$

then $κ (α) \leq 0$ , for $α \in [0, 1]$ .

Now we will define, by using generalized exponential arcs, important sets for the construction of generalized

φ

-family of probability distributions. Let us define

\tilde{κ} (α) = sup \{λ > 0; \exists ε > 0 where (1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - λ > a_{φ}, μ - a . e . t \in T, for each α \in (- ε, 1 + ε)\} .

as p and z are

φ

-connected by an open arc, we have that

(1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - κ (α) > a_{φ}

, for each

α \in I

. Hence,

κ (α) < \tilde{κ} (α)

, i.e.,

κ (α) \in [- \infty, \tilde{κ} (α)) .

For

p \in P_{μ}

, where

p = φ (c)

, consider the set

R_{c}^{φ} = \{q \in P_{μ}; \exists ε > 0 where (1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - \tilde{κ} (α) \geq a_{φ}, μ - a . e . t \in T, for each α \in (- ε, 1 + ε)\} .

We will show that the set

A_{c}^{φ} = \{q \in R_{c}^{φ}; \exists ε > 0 where \int_{T} φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - \tilde{κ} (α)) d μ < 1, for each α \in (- ε, 1 + ε)\}

(13)

is a generalized

φ

-family of probability distributions.

Consider the partition of

P_{μ}

into equivalence classes using the following relation: given p, z ∈

P_{μ}

we say that

p \sim z

if and only if p and z are

φ

-connected by an open arc. This equivalence relation is necessary to define an atlas modeled on Banach spaces.

Consider then

L_{c}^{φ}

be the Musielak–Orlicz space, given as

L_{c}^{φ} = \{u \in L^{0}; \exists ε > 0 where \int_{T} φ (c + λ u) d μ < \infty, for each λ \in (- ε, ε)\}

and the set

N_{c}^{φ} = \{u \in L_{c}^{φ}; \exists ε \in (0, 1) where c + λ u \geq a_{φ}, for each λ \in [- ε, ε],\} .

Lemma 1.

The set

N_{c}^{φ}

is a closed subspace.

Proof.

Clearly

0 \in N_{c}^{φ} .

Given

u, v \in N_{c}^{φ},

there exist

ε_{1,} ε_{2} \in (0, 1),

such that

c + λ u \geq a_{φ}, μ - a . e ., for each λ \in [- ε_{1,} ε_{1}]

and

c + λ v \geq a_{φ}, μ - a . e ., for each λ \in [- ε_{2,} ε_{2}] .

Considering

ε = m i n {ε_{1,} ε_{2}},

we have that

u + v \in N_{c}^{φ}

. Finally, given

α \in R

we obtain

α u \in N_{c}^{φ}

, since

c + λ (α u) \geq a_{φ}, μ - a . e ., for each λ \in [- \frac{ε_{1}}{α}, \frac{ε_{1}}{α}]

.

The fact that remains to show is that

N_{c}^{φ}

is closed. For this, let

(u_{n}) \in N_{c}^{φ},

convergent

μ

-a.e. for

u \in L_{c}^{φ}

. This implies that there exists a subsequence

(u_{n})

, such that

c + λ u_{n} \to c + λ u, μ -

a.e.

t \in T

.

Then, for each

n \in N

we can find

ε_{n} \in (0, 1),

with

c + λ u_{n} \geq a_{φ}, μ -

a.e.

t \in T

, for each

λ \in [- ε_{n,} ε_{n}] .

The compactness of

[- ε_{n,} ε_{n}]

ensures that the coverage

\{(- {\bar{ε}}_{n} - δ, {\bar{ε}}_{n} + δ); n \in N\}

admits a finite undercoverage. Let

{- {\bar{ε}}_{1} - δ, ε_{1} + δ, \dots, - {\bar{ε}}_{n_{0}} - δ, ε_{n_{0}} + δ}

the set of the elements that constitute the finite undercoverage. Taking

\bar{ε} = min {- {\bar{ε}}_{1} - δ, {\bar{ε}}_{1} + δ, \dots, - {\bar{ε}}_{n_{0}} - δ, {\bar{ε}}_{n_{0}} + δ},

it follows that

c + λ u_{n} \geq a_{φ},

μ -

a.e.

t \in T

, for each

λ \in [- {\bar{ε}}_{,} \bar{ε}] .

Passing to the limit, we obtain

c + λ u \geq a_{φ}, μ -

a.e.

t \in T

, for each

λ \in [- \bar{ε}, \bar{ε}] .

Therefore,

u \in N_{c}^{φ}

and consequently

N_{c}^{φ}

is closed. □

Define the set

{\tilde{K}}_{c}^{φ} = \{u \in N_{c}^{φ}; \exists ε \in (0, 1), such that \int_{T} φ (c + λ u) d μ < \infty, for each λ \in (- ε, 1 + ε)\} .

(14)

Lemma 2.

The set

{\tilde{K}}_{c}^{φ}

is open in

N_{c}^{φ}

.

Proof.

Let

u \in {\tilde{K}}_{c}^{φ}

. Then, there exists

ε \in (0, 1)

, such that

\int_{T} φ (c + α u) d μ < \infty

for each

α \in [- ε, 1 + ε]

and

u \in N_{c}^{φ}

. Considering

δ = {[\frac{2}{ε} (1 + ε) (1 + \frac{ε}{2})]}^{- 1}

, we have that for any

v \in B_{δ} = \{w \in N_{c}^{φ} {; | | w | |}_{Φ_{c}} < δ\}

it occurs

I_{Φ_{c}} (\frac{v}{δ}) \leq 1

and consequently

\int_{T} φ (c + \frac{1}{δ} | v |) d μ \leq 2

. Given

α \in (0, 1 + \frac{ε}{2})

we denote

λ = \frac{α}{1 + ε}

. The inequality

\frac{α}{1 - λ} = \frac{α}{1 - \frac{α}{1 + ε}} \leq \frac{1 + \frac{ε}{2}}{1 - \frac{1 + \frac{ε}{2}}{1 + ε}} = \frac{2}{ε} (1 + ε) (1 + \frac{ε}{2}) = \frac{1}{δ},

implies

\begin{array}{l} φ (c + α (u + v)) & \leq φ (λ φ (c + \frac{α}{λ}) + (1 - λ) φ (c + \frac{α}{1 - λ} v)) \\ \leq λ φ (c + \frac{α}{λ}) + (1 - λ) φ (c + \frac{α}{1 - λ} v) \\ \leq λ φ (c + (1 + ϵ) u) + (1 - λ) φ (c + \frac{1}{δ} | v |) . \end{array}

For

α \in (- \frac{ε}{2}, 0),

we can write

\begin{array}{l} φ (c + α (u + v)) & \leq \frac{1}{2} φ (c + 2 α u) + \frac{1}{2} φ (c + 2 α v) \\ \leq \frac{1}{2} φ (c + 2 α u) + \frac{1}{2} φ (c + | v |) . \end{array}

Then, we have

\int_{T} φ (c + α (u + v)) d μ < \infty,

for any

α \in (- \frac{ε}{2}, 1 + \frac{ε}{2})

. Hence,

u + v \in K_{c}^{φ}

and since

N_{c}^{φ}

is a subspace, we obtain

u + v \in {\tilde{K}}_{c}^{φ}

. As a consequence,

B_{δ} (u)

is contained in

{\tilde{K}}_{c}^{φ}

and therefore the set

{\tilde{K}}_{c}^{φ}

is open. □

The set

{\tilde{K}}_{c}^{φ}

defined in (14) is important to guarantee that

φ (c + α u)

may be in

P_{μ}

. Now, we establish a relationship between the connection by an open arc and

{\tilde{K}}_{c}^{φ}

similar to that was proved in [14].

Proposition 2.

Fix

p \in P_{μ}

. We say that

z \in P_{μ}

is φ-connected to p by an open arc, if and only if, there exists an open interval

I \supset [0, 1]

and a random variable

u \in L_{c}^{φ}

, such that

p (α) \propto φ (c + α u) \in P_{μ}

, for each

α \in I

and

p (0) = p

and

p (1) = z

.

Proof.

Since that z is

φ

-connected to p by an open arc, there exists an interval

I \supset [0, 1]

, such that

\int_{T} φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z)) d μ < \infty

, for each

α \in I

. Considering

u = φ^{- 1} (z) - φ^{- 1} (p)

, we have

\begin{array}{l} \int_{T} φ (c + α u) d μ & = \int_{T} φ (φ^{- 1} (p) + α (φ^{- 1} (z) - φ^{- 1} (p))) d μ \\ = \int_{T} φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z)) d μ, \end{array}

where

u = φ^{- 1} (z) - φ^{- 1} (p)

and

φ (c) = p

. Therefore

u \in L_{c}^{φ}

. Another conclusion that arises from the fact of q is

φ

-connected to p by a open arc is that

(1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - κ (α) > a_{φ}

. Hence,

p (α) \propto φ (c + α u) \in P_{μ}

, for each

α \in I

and

p (0) = p

and

p (1) = z

.

Reciprocally, taking

p (1) = q

, we get

φ (c + u) = z

, and consequently

u = φ^{- 1} (z) - φ^{- 1} (p)

with

φ (c) = p = p (0)

. □

One should notice that as a consequence of Proposition 2, given

p, z \in P_{μ}

φ

-connected by an open arc, the random variable

u \in {\tilde{K}}_{c}^{φ} = K_{c}^{φ} \cap N_{c}^{φ}

. In fact, this follows from two reasons: as

p, z \in P_{μ}

it follows that

φ^{- 1} (p), φ^{- 1} (z) > a_{φ}

and as z is

φ

-connected the p by an open arc we have

\int_{T} φ (c + α u) d μ < \infty

for each

α \in (- ε, 1 + ε)

.

Remark 1.

Since the function φ-arc is injective, in the Proposition 2 only the case

z \neq p

is considered. Therefore, there exists

z \in A_{c}^{φ}

such that

z \neq p

.

Lemma 3.

Let

z \in A_{c}^{φ}

φ-connected to p by an open arc. The map

V (λ) = \int_{T} φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - λ) d μ

is then well defined. Moreover,

V (λ)

is strictly increasing.

Proof.

Proposition 2 ensures that

p (α) \propto φ (c + α u) \in P_{μ}

, where

u = φ^{- 1} (z) - φ^{- 1} (p) \in {\tilde{K}}_{c}^{φ}

and

φ (c + u) = z

. Then, we can find

ε > 0

such that

φ (c + (1 + ε) (φ^{- 1} (z) - φ^{- 1} (p)))

is

μ

-integrable. Given

α \in (- ε, 1 + ε)

, taking

\bar{λ} = \frac{α}{1 + ε}

, we obtain

\begin{array}{l} φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - λ) & = φ (\bar{λ} (c + \frac{α}{\bar{λ}} (φ^{- 1} (z) - φ^{- 1} (p))) + (1 - \bar{λ}) (c + \frac{α}{1 - \bar{λ}} λ)) \\ \leq \bar{λ} φ (c + \frac{α}{\bar{λ}} (φ^{- 1} (z) - φ^{- 1} (p))) + (1 - \bar{λ}) φ (c + \frac{α}{1 - \bar{λ}} λ), \end{array}

and consequently

φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - λ)

is

μ

-integrable, for every

λ \in R

and for each

α \in (- ε, 1 + ε)

. This proves that

V (λ)

is well defined. By the dominated convergence theorem, the map

λ \mapsto V (λ) = \int_{T} φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - λ) d μ

is continuous,

{lim}_{λ \to \infty} V (λ) = 0

and

{lim}_{λ \to - \infty} V (λ) = \infty

. Hence, given

λ \in {λ \in R; (1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - λ > a_{φ}, μ - a . e . t \in T,

for each α \in (- ε, 1 + ε)}

, we have that

V (λ)

is strictly increasing. □

Proposition 3.

Fix

p = φ (c) \in P_{μ}

and

z \in R_{c}^{φ}

. Then,

z \in A_{c}^{φ}

if, and only if z is φ-connected the p by a open arc.

Proof.

Given

z \in A_{c}^{φ}

there exists

ε > 0

, such that

\int_{T} φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - \tilde{κ} (α)) d μ < 1

and

(1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - \tilde{κ} (α) \geq a_{φ}

,

μ

-a.e.

t \in T

for each

α \in (- ε, 1 + ε)

. Then,

\int_{T} φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z)) d μ < \infty

for each

α \in (- ε, 1 + ε)

which ensures that q is

φ

-connected to p by an open arc.

Reciprocally, take

z \in R_{c}^{φ}

φ

-connected to p by an open arc. In this way there exists

ε > 0

, where

\int_{T} φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - κ (α)) d μ = 1

and

(1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - κ (α) > a_{φ}

, for each

α \in (- ε, 1 + ε)

. Note that

\begin{array}{l} \int_{T} φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - \tilde{κ} (α)) d μ & \leq \int_{T} φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - κ (α)) d μ \\ = 1, \end{array}

(15)

because

κ (α) < \tilde{κ} (α)

, for each

α \in (- ε, 1 + ε)

and

φ

is non-decreasing. Suppose that

z \notin A_{c}^{φ}

, there exists

α \in (- ε, 1 + ε)

such that

\int_{T} φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - \tilde{κ} (α)) d μ \geq 1 .

(16)

The Equations (15) and (16) ensure that

\int_{T} φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - \tilde{κ} (α)) d μ = 1

for each

α \in (- ε, 1 + ε)

. Therefore, by Lemma 3 it exists a unique

λ_{0}

satisfying

(1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - λ_{0} > a_{φ},

μ - a . e . t \in T

, such that

V (λ_{0}) = 1

. Since

κ (α)

is such that

(1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - κ (α) > a_{φ}, μ - a . e . t \in T, for each α \in (- ε, 1 + ε)

, it follows that

λ_{0} = κ (α)

and consequently

κ (α)

is unique. Hence,

κ (α) = \tilde{κ} (α)

for each

α \in (- ε, 1 + ε)

, that is an absurd. □

By Corollary 3 the sets

A_{c}^{φ}

are the connected components of

P_{μ}

. Then, we need to find a domain for the parametrization in such a way that the image is

A_{c}^{φ}

.

We will make some similar considerations to the ones present in [10].

Remark that, for

u \in {\tilde{K}}_{c}^{φ}

,

φ (c + u)

is not necessarily in

P_{μ}

. Define

ψ : {\tilde{K}}_{c}^{φ} \to R

, such that the density

φ_{c} (u) = φ (c + u - ψ (u))

(17)

is contained in

P_{μ}

. We have that the open domain maximal of

ψ

is contained in

{\tilde{K}}_{c}^{φ}

. Note that

ψ

is well defined, since

c + u - ψ (u) > a_{φ}, μ - a . e . t \in T

. It can be then proved that

ψ : {\tilde{K}}_{c}^{φ} \to R

is convex, and as a consequence

ψ : {\tilde{K}}_{c}^{φ} \to R

is continuous, since

{\tilde{K}}_{c}^{φ}

is open by Lemma 2.

Let

φ_{+}^{'}

be the operator acting on the set of real-valued functions

u : T \to R

given by

φ_{+}^{'} (u) (t) = φ_{+}^{'} (t, u (t))

, where

φ_{+}^{'} (t, .)

is the right-derivative of

φ (t, .) .

Also, notice that the function

ψ : {\tilde{K}}_{c}^{φ} \to R

can assume both positive and negative values. Consider the closed subspace

{\tilde{B}}_{c}^{φ} = \{u \in N_{c}^{φ}; \int_{T} u φ_{+}^{'} (c) d μ = 0\} .

Observe that the image of

ψ

will be contained in

[0, \infty)

, since the domain of

ψ

is restricted to a

{\tilde{B}}_{c}^{φ}

. By the convexity property of

φ (t, .),

we have

u φ_{+}^{'} (t, c (t)) \leq φ (t, c (t) + u) - φ (t, c (t)) for all u \in R .

Hence, we have that

1 = \int u φ_{+}^{'} (c) d μ + \int φ (c) d μ \leq \int φ (c + u) d μ < \infty for any u \in {\tilde{K}}_{c}^{φ} \cap {\tilde{B}}_{c}^{φ} = {\tilde{B}}_{c}^{φ} .

Thus, it follows that

ψ (u) \geq 0

in order to

φ (c + u - ψ (u))

be in

P_{μ}

.

Given a measurable function

c : T \to R

such that

p = φ (c)

is a probability density in

P_{μ}

. Consider the set

M_{c}^{φ} = {(M_{c}^{φ})}_{1} \cap {(M_{c}^{φ})}_{2},

where

{(M_{c}^{φ})}_{1} = {u \in {\tilde{B}}_{c}^{φ}; c + α (u - ψ (u)) - \tilde{κ} (α)) > a_{φ} μ - a . e . for each α \in I \supset [0, 1]}

and

{(M_{c}^{φ})}_{2} = \{u \in {\tilde{B}}_{c}^{φ}; \int_{T} φ (c + α (u - ψ (u)) - \tilde{κ} (α))) d μ < 1, for each α \in I \supset [0, 1]\} .

Proposition 4.

Given

u \in M_{c}^{φ}

, we have that

φ (c + u - ψ (u)) \in A_{c}^{φ}

.

Proof.

Given

u \in M_{c}^{φ}

, we have

c + α (u - ψ (u)) + \tilde{κ} (α)) > a_{φ}

and

\int_{T} φ (c + α u - (α ψ (u) + \tilde{κ} (α))) d μ < 1, μ - a . e . t \in T, for each α \in I \supset [0, 1] .

Hence,

(1 - α) φ^{- 1} (p) + α φ^{- 1} (φ (c + u - ψ (u))) - \tilde{κ} (α) = (1 - α) c + α (c + u - ψ (u)) - \tilde{κ} (α) = c + α (u - ψ (u)) - \tilde{κ} (α) > a_{φ},

for each

α \in I \supset [0, 1]

, which implies in

φ (c + u - ψ (u)) \in R_{c}^{φ}

. In addition,

\int_{T} φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (φ (c + u - ψ (u)) - \tilde{κ} (α)) d μ = \int_{T} φ (c + α (u - ψ (u)) - \tilde{κ} (α)) d μ < 1,

for each

α \in I \supset [0, 1]

and therefore,

φ (c + u - ψ (u)) \in A_{c}^{φ}

. □

Proposition 5.

The set

M_{c}^{φ}

is open in

B_{c}^{φ}

.

Proof.

Consider the sets

{(M_{c}^{φ})}_{1} = {u \in {\tilde{B}}_{c}^{φ}; c + α (u - ψ (u)) - \tilde{κ} (α) > a_{φ} μ - a . e . for each α \in I \supset [0, 1]}

and

{(M_{c}^{φ})}_{2} = \{u \in {\tilde{B}}_{c}^{φ}; \int_{T} φ (c + α (u - ψ (u)) - \tilde{κ} (α)) d μ < 1, for each α \in I \supset [0, 1]\} .

Define the functions

f (α, u) = c + α u - α ψ (u) - \tilde{κ} (α) a n d g (α, u) = \int_{T} φ (c + α u - α ψ (u) - \tilde{κ} (α)) d μ .

The function f is well defined and continuous, since $ψ : {\tilde{K}}_{c}^{φ} \to R$ is continuous;
The map g is well defined in ${(M_{c}^{φ})}_{2}$ and continuous, since $φ$ and $ψ$ are continuous.

Moreover, given

u \in M_{c}^{φ}

, in particular

u \in {(M_{c}^{φ})}_{1}

and

u \in {(M_{c}^{φ})}_{2}

. By the continuity of f and g respectively, exist

ε_{1}, ε_{2} \in (0, 1)

, such that for each

v_{1} \in B_{ε_{1}} (u) \subset B_{c}^{φ}

, we have

f (v_{1}) > a_{φ}

and for each

v_{2} \in B_{ε_{2}} (u) \subset B_{c}^{φ}

, we have

g (v_{2}) < 1

. Taking,

ε = min \{ε_{1}, ε_{2}\}

, we obtain that

B_{ε} (u) \subset M_{c}^{φ}

and consequently

M_{c}^{φ}

is open in

B_{c}^{φ}

. □

Clearly

P_{μ} = ⋃ {A_{c}^{φ}; φ (c) \in P_{μ}}

. Consider the measurable functions

c_{1}, c_{2} : T \to R

, where

p_{1} = φ (c_{1})

and

p_{2} = φ (c_{2})

belong to

P_{μ}

. The parametrization

φ_{c_{1}} : M_{c_{1}}^{φ} \to A_{c_{1}}^{φ}

and

φ_{c_{2}} : M_{c_{2}}^{φ} \to A_{c_{2}}^{φ}

have a transition map given as

φ_{c_{2}}^{- 1} \circ φ_{c_{1}} : φ_{c_{1}}^{- 1} (A_{c_{1}}^{φ} \cap A_{c_{2}}^{φ}) \to φ_{c_{2}}^{- 1} (A_{c_{1}}^{φ} \cap A_{c_{2}}^{φ}) .

Given

ψ_{1} : M_{c_{1}}^{φ} \to [0, \infty)

and

ψ_{2} : M_{c_{2}}^{φ} \to [0, \infty)

being the normalizing functions associated to

c_{1}

and

c_{2},

respectively, and the functions

u \in M_{c_{1}}^{φ}

and

v \in M_{c_{2}}^{φ}

are such that

φ_{c_{1}} (u) = φ_{c_{2}} (v) \in A_{c_{1}}^{φ} \cap A_{c_{2}}^{φ} .

So, we have

v = c_{1} - c_{2} + u - ψ_{1} (u) + ψ_{2} (v) .

(18)

Multiplying the Equation (18) by

{(φ)}_{+}^{'} (c_{2})

and integrating with respect to the measure

μ,

once the function v is in

M_{c_{2}}^{φ}

, we obtain

0 = \int_{T} (c_{1} - c_{2} + u) {(φ)}_{+}^{'} (c_{2}) d μ - ψ_{1} (u) \int_{T} {(φ)}_{+}^{'} (c_{2}) d μ + ψ_{2} (v) \int_{T} {(φ)}_{+}^{'} (c_{2}) d μ,

and we can write

ψ_{2} (v) = \frac{- \int_{T} (c_{1} - c_{2} + u) {(φ)}_{+}^{'} (c_{2}) d μ}{\int_{T} {(φ)}_{+}^{'} (c_{2}) d μ} + ψ_{1} (u) \frac{\int_{T} {(φ)}_{+}^{'} (c_{2}) d μ}{\int_{T} {(φ)}_{+}^{'} (c_{2}) d μ} .

Therefore

v = c_{1} - c_{2} + u - ψ_{1} (u) - \frac{\int_{T} (c_{1} - c_{2} + u) {(φ)}_{+}^{'} (c_{2}) d μ}{\int_{T} {(φ)}_{+}^{'} (c_{2}) d μ} + ψ_{1} (u) \frac{\int_{T} {(φ_{q})}_{+}^{'} (c_{2}) d μ}{\int_{T} {(φ_{q})}_{+}^{'} (c_{2}) d μ} .

Hence, the transition map

φ_{c_{2}}^{- 1} \circ φ_{c_{1}}

can be expressed as

φ_{c_{2}}^{- 1} \circ φ_{c_{1}} (w) = c_{1} - c_{2} + w - \frac{\int_{T} (c_{1} - c_{2} + w) {(φ)}_{+}^{'} (c_{2}) d μ}{\int_{T} {(φ)}_{+}^{^{'}} (c_{2}) d μ},

(19)

for every

w \in φ_{c_{1}}^{- 1} (A_{c_{1}}^{φ} \cap A_{c_{2}}^{φ}) .

Showing that w and

c_{1} - c_{2}

are in

L_{c_{2}}^{φ}

and the spaces

L_{c_{1}}^{φ}

and

L_{c_{2}}^{φ}

have equivalent norms we obtain that this transition map will be of class

C^{\infty}

.

In the next corollary we have that Musielak–Orlicz spaces are equal. The proof follows as the one provided in [14].

Corollary 1.

Let

p, z \in P_{μ}

φ-connected by an open arc, where

p = φ (c)

and

z = φ (\tilde{c})

. Then,

L_{c}^{φ} = L_{\tilde{c}}^{φ}

.

Proof.

We have that z is

φ

-connected to p by a open arc. Then, by Corollary 3, we have that

\tilde{c} = c + u - ψ (u)

. The result follows immediately from [10]. □

It follows from Corollary 1 that

φ_{c_{2}}^{- 1} \circ φ_{c_{1}}

is of class

C^{\infty}

, and consequently, the set

φ_{c_{1}}^{- 1} (A_{c_{1}}^{φ} \cap A_{c_{2}}^{φ})

is open in $B_{c}^{φ}$ .

Proposition 6

([14], Proposition 8). The relation given in the Definition 2 is an equivalence relation.

Proof.

Since reflexivity and symmetry properties immediately follow from the definition, we will only prove transitivity. Let be

p, z, s \in P_{μ}

, such that,

p (t) \propto φ (c + t u), s (t) \propto φ (c + t v), t \in (- ε, 1 + ε)

with

p (0) = φ (c) = p

,

p (1) = φ (c + u) = z,

s (0) = φ (c) = p

,

s (1) = φ (c + v) = s

and

u, v \in N_{c}^{φ}

. Consider

z (t) \propto φ (c + (1 - t) u + t v) \propto φ (c + u + t (v - u))

is defined with

c + u = \tilde{c}, p (t) \propto φ (\tilde{c} + t (v - u))

, where

z (0) = φ (\tilde{c}) = φ (c + u) = z

,

z (1) = φ (\tilde{c} + (v - u)) = φ (c + v) = s

. Therefore z and s are

φ

-connected. □

As a consequence of the Corollary 3 and of the Proposition 6 we have that the

φ

-families

A_{c}^{φ}

are maximal, in the sense that

A_{c}^{φ} \cap A_{\tilde{c}}^{φ} = \emptyset

or if

A_{c}^{φ} \cap A_{\tilde{c}}^{φ} \neq \emptyset

, then

A_{c}^{φ} = A_{\tilde{c}}^{φ}

.

Hence, we can write the following proposition.

Proposition 7.

The collection

{\{(M_{c}^{φ}, φ_{c})\}}_{φ (c) \in P_{μ}}

equip

P_{μ}

with a

C^{\infty}

-differentiable structure.

4. The Tangent Bundle

In the previous section, the expression of the transition application

φ_{c_{2}}^{- 1} \circ φ_{c_{1}}

was important to garantee that

P_{μ}

could be equipped with a

C^{\infty}

-Banach structure. Now, we will use the transition application to find the tangent space of

P_{μ}

at the point

p = φ (c)

and the tangent bundle.

Given

p \in P_{μ}

, we consider the triple

(A_{c}^{φ}; φ_{c}^{- 1}; v),

where

A_{c}^{φ}

is the

φ

-family,

φ_{c}

is the parametrization and v is a vector in

φ_{c}^{- 1} (A_{c}^{φ})

which is contained in the vector space

L^{Φ_{c}}

.

Let us define the following equivalence relation:

(A_{c}^{φ}; φ_{c}^{- 1}; v) \sim (A_{\tilde{c}}^{φ}; φ_{\tilde{c}}^{- 1}; w) \Leftrightarrow {(φ_{\tilde{c}}^{- 1} \circ φ_{c})}^{'} (φ_{c} (p)) (v) = w .

The class

[A_{c}^{φ}; φ_{c}^{- 1}; v]

is called the tangent vector of

P_{μ}

in p and the set of all classes is called the tangent space and is denoted by

T_{p} (P_{μ})

. For more details we refer the reader to [20].

The vector

v \in φ_{c}^{- 1} (A_{c}^{φ})

is the velocity vector of a curve in the parametrization domain. In fact, consider

(A_{c_{1}}^{φ}, φ_{c_{1}}^{- 1})

and

(A_{c_{2}}^{φ}, φ_{c_{2}}^{- 1})

be charts about

p \in P_{μ}

and

g : I \subset T \to P_{μ}

a curve such that

g (t_{0}) = p

, for some

t_{0} \in T

. Taking

g (t) = φ_{c_{1}} (u_{1}) = φ (c_{1} + u_{1} - ψ (u_{1}))

, we have that

u_{1} (t) = φ_{c_{1}}^{- 1} (g (t))

. Moreover,

g (t) = φ_{c_{1}} (u_{1})

and

u_{2} (t) = φ_{c_{2}}^{- 1} (g (t))

. Using random variables we have that

u_{2} (t_{0}) = φ_{c_{2}}^{- 1} (g (t_{0})) = φ_{c_{2}}^{- 1} \circ φ_{c_{1}} (u_{1} (t_{0}))

. Hence, by the chain rule we can write

u_{2}^{'} (t_{0}) = {(φ_{c_{2}}^{- 1} \circ φ_{c_{1}})}^{'} (u_{1} (t_{0})) u_{1}^{'} (t_{0}) = {(φ_{c_{2}}^{- 1} \circ φ_{c_{1}})}^{'} (φ_{c_{1}}^{- 1} (p)) u_{1}^{'} (t_{0}) .

We will denote

τ (P_{μ})

as the tangent bundle, which is defined as the disjointed unity of

T_{p} (P_{μ})

, that is,

τ (P_{μ}) = ⨆_{p \in P_{μ}} T_{p} (P_{μ}) .

Proposition 8.

The local representation of the tangent bundle

τ (P_{μ})

is of the form

(u_{1,} v_{1}) \mapsto (φ_{c_{2}}^{- 1} \circ φ_{c_{1}} (u_{1}), v_{1} - \frac{\int_{T} v_{1} {(φ)}_{+}^{'} (c_{2}) d μ}{\int_{T} {(φ)}_{+}^{'} (c_{2}) d μ}) \in {\tilde{K}}_{c_{2}}^{φ} \times L_{c_{2}}^{φ} .

(20)

Proof.

Given

w \in φ_{c_{1}}^{- 1} (A_{c_{1}}^{φ} \cap A_{c_{2}}^{φ}),

we have that the derivative of the map

φ_{c_{2}}^{- 1} \circ φ_{c_{1}}

evaluated at w in the direction of

v \in L_{c}^{φ}

is of the form

{(φ_{c_{2}}^{- 1} \circ φ_{c_{1}})}^{'} (w) v = v - \frac{\int_{T} v {(φ)}_{+}^{'} (c_{2}) d μ}{\int_{T} {(φ)}_{+}^{'} (c_{2}) d μ} .

(21)

In fact, by the convexity of

φ

, we have that

\int_{T} (c_{1} - c_{2} + w) {(φ)}_{+}^{'} (c_{2}) d μ \leq \int_{T} [φ (c_{1} + w) + φ (c_{2})] d μ .

Since

w \in φ_{c_{1}}^{- 1} (A_{c_{1}}^{φ} \cap A_{c_{2}}^{φ}) \subset {\tilde{K}}_{c_{1}}^{φ},

we have that

φ (c_{1} + w)

is

μ

-integrable, and consequently,

\int_{T} (c_{1} - c_{2} + w) {(φ)}_{+}^{'} (c_{2}) d μ

is

μ

-integrable. Then, from the dominated convergence theorem follows that (21) occurs.

The tangent bundle is then denoted by

τ (P_{μ}) = {(φ_{c} (u), v); φ_{c} (u) \in A_{c}^{φ} \subset P_{μ} and v is a tangent vector to φ_{c} (u)} .

Its charts are expressed as

(v, u) \in τ (A_{c}^{φ}) \mapsto (φ_{c_{2}}^{- 1} (v), v - \frac{\int_{T} v {(φ)}_{+}^{'} (c_{2}) d μ}{\int_{T} {(φ)}_{+}^{'} (c_{2}) d μ}),

which was defined in the collection of open subsets

A_{c_{1}}^{φ} \times {\tilde{K}}_{c_{1}}^{φ}

of

P_{μ} \times L_{c_{1}}^{φ}

. Then, since Equation (21) occurs, the transition mappings are given for

(u_{1,} v_{1}) \in {\tilde{K}}_{c_{1}}^{φ} \times L_{c_{1}}^{φ}

by

(u_{1,} v_{1}) \mapsto (φ_{c_{2}}^{- 1} \circ φ_{c_{1}} (u_{1}), v_{1} - \frac{\int_{T} v_{1} {(φ)}_{+}^{'} (c_{2}) d μ}{\int_{T} {(φ)}_{+}^{'} (c_{2}) d μ}) \in {\tilde{K}}_{c_{2}}^{φ} \times L_{c_{2}}^{φ} .

(22)

□

5. Divergence in Statistical Manifolds

This section will be divided into two parts. The first one is responsible by the definition of the

φ

-divergence for the case where

φ

is the deformed exponential defined in Section 3 and to define a divergence using the q-exponential. In the second part, we prove that the q-exponential and

κ

-exponential functions can be used to generalize the divergence of Rényi [13,21].

5.1. The $φ$ -Divergence and q-Divergence

To define the divergence associated to the normalization function

ψ : \tilde{K}_{c}^{φ} \to R

is necessary the convexity of

ψ

. This is guaranteed by the fact that

N_{c}^{φ}

is a subspace and

ψ : K_{c}^{φ} \to R

is convex [10]. In this way, the Bregman’s divergence

B_{ψ} : {\tilde{B}}_{c}^{φ} \times {\tilde{B}}_{c}^{φ} \to [0, \infty)

associated the

ψ : {\tilde{B}}_{c}^{φ} \to [0, \infty)

is given by [22,23,24]

B_{ψ} (v, u) = ψ (v) - ψ (u) - \partial_{+} ψ (u) (v - u) .

(23)

Then, we can define the divergence

D_{ψ} : {\tilde{B}}_{c}^{φ} \times {\tilde{B}}_{c}^{φ} \to [0, \infty)

related the generalized

φ

-family

A_{c}^{φ}

as

D_{ψ} (u, v) = B_{ψ} (v, u)

.

Given

u, v \in {\tilde{B}}_{c}^{φ}

, we have that

φ (c + u - ψ (u)), φ (c + v - ψ (v)) \in P_{μ}

and as a consequence

c + u - ψ (u), c + v - ψ (v) > a_{φ}

. Supposing

φ

is continuously differentiable, it follows that the divergence

D_{ψ}

does not depend on the parametrization of

A_{c}^{φ}

. This allows us to define the divergence between the probability densities

p = φ_{c} (u)

and

z = φ_{c} (v)

, for

u, v \in {\tilde{B}}_{c}^{φ}

as

D (p ‖ z) = D_{ψ} (u, v) = \frac{\int_{T} \frac{φ_{c}^{- 1} (p) - φ_{c}^{- 1} (z)}{{(φ_{c}^{- 1})}^{'} (p)} d μ}{\int_{T} \frac{1}{{(φ_{c}^{- 1})}^{'} (p)} d μ} .

(24)

Note that the divergence is well defined inside the same

φ

-family. The condition

D (p ‖ z) = \infty

if p and z are not in the same

φ

-family extends the divergence for

P_{μ}

. We will denote those divergence by

D_{φ}

and called it

φ

-divergence [10].

Given

u, v \in {\tilde{B}}_{c}^{φ}

, we have that

u, v > a_{φ}

, then

φ (t, .)

is strictly convex in

{\tilde{B}}_{c}^{φ}

, and therefore

D_{φ}

is always non-negative and

D_{φ} (p ‖ z)

is equal to zero if and only if

p = z

. In the following example, we find the

φ

-divergence for the case in which the deformed exponential function

φ

is the q-deformed exponential function.

Example 3.

Consider the q-exponential

{exp}_{q} (t, u) = {exp}_{q (t)} (u)

instead of

φ (t, u),

whose inverse

φ^{- 1} (t, u)

is the q-logarithm

{ln}_{q} (t, u) = {ln}_{q (t)} (u) .

Then, we have

D (p ‖ z) = \frac{\int_{T} \frac{{ln}_{q} (p) - {ln}_{q} (z)}{{ln}_{q}^{'} (p)} d μ}{\int_{T} \frac{1}{{ln}_{q}^{'} (p)} d μ},

where

{ln}_{q} (p)

denotes

{ln}_{q (t)} (p (t)) .

Since the q-logarithm

{ln}_{q} (u) = \frac{u^{1 - q} - 1}{1 - q},

has as derivative

{ln}_{q}^{'} (u) = \frac{1}{u^{q}},

we have that

\int_{T} \frac{{ln}_{q} (p) - {ln}_{q} (z)}{{ln}_{q}^{'} (p)} d μ = \int_{T} \frac{\frac{p^{1 - q} - 1}{1 - q} - \frac{z^{1 - q} - 1}{1 - q}}{\frac{1}{p^{q}}} d μ = \int_{T} \frac{p^{q} (p^{1 - q} - z^{1 - q})}{1 - q} d μ

and

\int_{T} \frac{1}{{ln}_{q}^{'} (p)} d μ = \int_{T} \frac{1}{\frac{1}{p^{q}}} d μ = \int_{T} p^{q} d μ .

Therefore

D (p ‖ z) = \frac{\int_{T} \frac{p^{q} (p^{1 - q} - z^{1 - q})}{1 - q} d μ}{\int_{T} p^{q} d μ} .

(25)

The divergence

D (p ‖ z)

in (25) is related with the q-divergence defined in (6). In fact,

\begin{array}{l} I^{(q)} (p ‖ z) & = \int_{T} z f (\frac{p}{z}) d μ \\ = \int_{T} z (\frac{p}{z} {ln}_{q} (\frac{z}{p})) d μ \\ = - \int_{T} p (\frac{{ln}_{q} (z) - {ln}_{q} (p)}{1 + (1 - q) {ln}_{q} (p)}) d μ \\ = - \int_{T} p (\frac{(z^{1 - q} - p^{1 - q}) / (1 - q)}{1 + (1 - q) \frac{p^{1 - q} - 1}{(1 - q)}}) d μ \\ = \int_{T} p^{q} (\frac{(p^{1 - q} - z^{1 - q})}{(1 - q)}) d μ . \end{array}

Then

D (p ‖ z) = \frac{I^{(q)} (p ‖ z)}{\int_{T} p^{q} d μ}

and we can define the metric

g : Σ (P_{μ}) \times Σ (P_{μ}) \to F (P_{μ})

as

g (u, v) = \frac{q \int_{T} \frac{u v}{z} d μ}{\int_{T} z^{q} d μ},

(26)

where

Σ (P_{μ})

is the set of vector fields

u : A_{c}^{φ} \to T_{p} (A_{c}^{φ})

and

F (P_{μ})

the set of

C^{\infty}

functions

f : A_{c}^{φ} \to R

. This map is well defined, since

{(\frac{\partial}{\partial u})}_{p} {D (p ‖ z) |}_{p = z} = 0

and

{(\frac{\partial}{\partial v})}_{p} {D (p ‖ z) |}_{p = z} = 0 .

Notice that considering

\int_{T} p^{q} d μ = 1

we will have that divergence in (25) coincides with the q-divergence defined in [15], the metric in (26) coincides with the metric given in [25] and the family of covariant derivatives (connections) given by

\nabla_{w}^{q} u = \frac{\partial}{\partial w} u - \frac{(1 - q)}{r} u w + \frac{u}{A^{2}} B - w \frac{C}{A^{2}},

where

A = \int_{T} z^{q} d μ, B = {(\frac{\partial}{\partial w})}_{p} {A |}_{p = z}

and

C = {(\frac{\partial}{\partial u})}_{p} {A |}_{p = z}

coincides with the family of covariant derivatives (connections) given in [25]. The notation

{(\frac{\partial}{\partial w})}_{p} {A |}_{p = z}

means the derivative of A in the direction of w in the point z when

p = z

.

5.2. Generalization of Divergence of Rényi and ${exp}_{κ}$

Now, we will recall that the Rényi divergence is related with the

φ

-divergence and we will see that a necessary and sufficient condition for the existence of generalization of Rényi divergence is the condition (a2). Consequently, we prove that the q-deformed exponential and

κ

-exponential functions can be used in the generalization of Rényi divergence.

In [12] was defined a generalization of the Rényi divergence of order

α \in (0, 1)

as

D_{R, φ}^{(α)} (p ‖ z) = \frac{κ (α)}{α (1 - α)},

(27)

where

κ (α)

satisfies the Equation (12). This generalization in the case

α \in {0, 1}

is defined as the limit

D_{R, φ}^{(0)} (p ‖ z) = lim_{α \to 0} D_{R, φ}^{(α)} (p ‖ z)

(28)

and

D_{R, φ}^{(1)} (p ‖ z) = lim_{α \to 1} D_{R, φ}^{(α)} (p ‖ z) .

(29)

The limits in (28) and (29), under some conditions, are finite-valued and converges to the

φ

-divergence:

D_{R, φ}^{(0)} (z ‖ p) = D_{R, φ}^{(1)} (p ‖ z) = D_{φ} (p ‖ z) < \infty .

In the next proposition we have that a necessary and sufficient condition to connect two probability densities of

P_{μ}

by an open arc is the condition (a2).

Proposition 9

([12], Proposition 1). Let μ be a non-atomic measure. Consider

φ : R \to [0, \infty)

be a positive, deformed exponential function. Fix any

α \in (0, 1)

. The condition (a2) is satisfied if, and only if, given p and z in

P_{μ}

, there exists a constant

κ (α) : = κ (α; p, z)

such that

\int_{T} φ ((1 - α) φ^{- 1} (p) + α φ^{- 1} (z) - κ (α)) d μ = 1 .

(30)

In the Example 1, where the measure

μ

was assumed to be non-atomic, we have that the q-exponential function satisfies the condition (a2). Then, by Proposition 9 and Equation (27), we conclude that this function can be used in the generalization of Rényi divergence. Analogously, the function given in the Example 2 cannot be used in the generalization of Rényi divergence.

Supposing that

μ

is non-atomic, it is presented on the next proposition an equivalent criterion for a deformed exponential function

φ

to satisfy condition (a2).

Proposition 10

([12], Proposition 3). Let

φ : R \to [0, \infty)

be a deformed exponential function. Then (a2) is satisfied if, and only if,

\underset{u \to \infty}{lim sup} \frac{φ (u)}{φ (u - λ_{0})} < \infty, f o r s o m e λ_{0} > 0 .

In the next example, we will show a class of deformed exponential functions that can be used in the generalization of Rényi divergence.

Example 4.

We will show that the Kaniadakis κ-exponential

{exp}_{κ} (.)

satisfies the condition (a3). The κ-exponential

{exp}_{κ} : R \to (0, \infty)

for

κ \in [- 1, 1]

is defined as [26,27]

{exp}_{κ} (u) = \{\begin{matrix} {(κ u + \sqrt{1 + κ^{2} u^{2}})}^{\frac{1}{κ}}, & i f κ \neq 0, \\ exp (u) & i f κ = 0 . \end{matrix}

Its inverse, the so called κ-logarithm

{log}_{κ} : (0, \infty) \to R

, is given by

{log}_{κ} (u) = \{\begin{matrix} \frac{v^{κ} - v^{- κ}}{2 κ}, & i f κ \neq 0, \\ ln (v) & i f κ = 0 . \end{matrix}

We will verify that there exists

α \in (0, 1)

and

λ > 0

for which

λ \leq {log}_{κ} (v) - {log}_{κ} (κ v), f o r a l l v > 0 .

(31)

Some manipulations imply that the derivative of

{log}_{κ} (v) - {log}_{κ} (α v)

is negative for

0 < v \leq v_{0}

and positive for

v \geq v_{0}

, where

v_{0} = {(\frac{α^{- κ} - 1}{1 - α^{κ}})}^{\frac{1}{2 κ}} > 0 .

Consequently, the difference

{log}_{κ} (v) - {log}_{κ} (α v)

attains a minimum at

v_{0}

. Given

α \in (0, 1)

, inequality (31) is satisfied for some

λ > 0

. Inserting

v = {exp}_{κ} (u)

into (31), we can write

α {exp}_{κ} (u) \leq {exp}_{κ} (u - λ), f o r a l l u \in R .

(32)

If

n \in N

is such that

n λ \geq 1

, then a repeated application of (32) yields

α^{n} {exp}_{κ} (u) \leq {exp}_{κ} (u - n λ) \leq {exp}_{κ} (u - 1), f o r a l l u \in R .

Then,

\underset{u \to \infty}{lim sup} \frac{φ (u)}{φ (u - λ_{0})} = \underset{u \to \infty}{lim sup} \frac{{exp}_{κ} (u)}{{exp}_{κ} (u - 1)} \leq \underset{u \to \infty}{lim sup} \frac{1}{α^{n}} < \infty .

Therefore, by Proposition 10 Kaniadakis

κ

-exponential

{exp}_{κ} (.)

satisfies the condition (a2).

As consequence of the Example 4 and Proposition 9, we have that

{exp}_{κ} (u)

can be used in the generalization of Rényi divergence.

6. Conclusions

In this paper we constructed a parametrization of the statistical Banach manifold using a deformed exponential function. We have found the tangent space of

P_{μ}

in p and we also constructed the tangent bundle of

P_{μ}

. We defined the

φ

-divergence where

φ

is the q-exponential function and we establish a relation between this divergence and the q-divergence defined in [15]. Another important contribution is that the q-exponential and

κ

-exponential functions can be used to generalize the divergence of Rényi. The perspective for future works is to define the parallel transport, once we find the tangent plane. We also intend to construct a parametrization for

P_{μ}

using a deformed exponential function satisfying (a1) in the case where for each measurable function

c : T \to R

, with

\int_{T} φ (c) d μ = 1

, there exists a measurable function

u_{0 c} : T \to R

, such that

\int_{T} φ (c + λ u_{0 c}) d μ < \infty

, for each

λ > 0

.

Author Contributions

Conceptualization, F.L.J.V., R.F.V. and C.C.C.; writing—original draft, F.L.J.V.; writing—review and editing, F.L.J.V., L.H.F.d.A., R.F.V. and C.C.C.

Funding

The authors would like to thank Coordenação de Aperfeiçoamento de Pessoal de Nível Superior-Brasil (CAPES)-Finance Code 001, Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) (Procs. 309472/2017-2 and 408609/2016-8) and FUNCAP (Proc. IR7-00126-00037.01.00/17).

Conflicts of Interest

The authors declare no conflict of interest.

References

Amari, S.I. Differential Geometry of Curved Exponential Families-Curvatures and Information Loss. Ann. Stat. 1982, 10, 357–385. [Google Scholar] [CrossRef]
Amari, S.-I. Differential-Geometrical Methods in Statistics; Springer Science & Business: Berlin/Heidelberg, Germany, 2012; Volume 28. [Google Scholar]
Pistone, G.; Sempi, C. An infinite-dimensional geometric structure on the space of all the probability measures equivalent to a given one. Ann. Stat. 1995, 23, 1543–1561. [Google Scholar] [CrossRef]
Cena, A.; Pistone, G. Exponential statistical manifold. Ann. Inst. Stat. Math. 2007, 59, 27–56. [Google Scholar] [CrossRef]
Pistone, G.; Rogantin, M.P. The exponential statistical manifold: mean parameters, orthogonality and space transformations. Bernoulli 1999, 5, 721–760. [Google Scholar] [CrossRef]
Santacroce, M.; Siri, P.; Trivellato, B. New results on mixture and exponential models by Orlicz spaces. Bernoulli 2016, 22, 1431–1447. [Google Scholar] [CrossRef]
Naudts, J. Estimators, escort probabilities, and-exponential families in statistical physics. J. Ineq. Pure Appl. Math. 2004, 5, 102. [Google Scholar]
Matsuzoe, H.; Wada, T. Deformed algebras and generalizations of independence on deformed exponential families. Entropy 2015, 17, 5729–5751. [Google Scholar] [CrossRef]
Naudts, J. Generalised Thermostatistics; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
Vigelis, R.F.; Cavalcante, C.C. On ϕ-families of probability distributions. J. Theor. Probab. 2013, 26, 870–884. [Google Scholar] [CrossRef]
Eguchi, S.; Komori, O. Path connectedness on a space of probability density functions. In Proceedings of the International Conference on Geometric Science of Information; Springer: Cham, Switzerland, 2015; pp. 615–624. [Google Scholar]
Vigelis, R.F.; de Andrade, L.H.F.; Cavalcante, C.C. On the Existence of Paths Connecting Probability Distributions. In Proceedings of the International Conference on Geometric Science of Information; Springer: Cham, Switzerland, 2017; pp. 801–808. [Google Scholar]
de Souza, D.C.; Vigelis, R.F.; Cavalcante, C.C. Geometry induced by a generalization of Rényi divergence. Entropy 2016, 18, 407. [Google Scholar] [CrossRef]
de Andrade, L.H.F.; Vieira, F.L.J.; Vigelis, R.F.; Cavalcante, C.C. Mixture and exponential arcs on generalized statistical manifold. Entropy 2018, 20, 147. [Google Scholar] [CrossRef]
Loaiza, G.; Quiceno, H. A q-exponential statistical Banach manifold. J. Math. Anal. Appl. 2013, 398, 466–476. [Google Scholar] [CrossRef]
Tsallis, C. What are the numbers that experiments provide. Quim. Nov. 1994, 17, 468–471. [Google Scholar]
Musielak, J. Orlicz Spaces and Modular Spaces; Springer: Berlin/Heidelberg, Germany, 2006; Volume 1034. [Google Scholar]
Rao, M.M.; Zhong, D.R. Theory of Orlicz Spaces; M. Dekker: New York, NY, USA, 1991. [Google Scholar]
Krasnosel’skii, M.A.; Rutitskii, Y.B. Convex Function and Orlicz Spaces; Noordhoff: Groningen, The Netherlands, 1961; Translated from Russian. [Google Scholar]
Lang, S. Introduction to Differentiable Manifolds; Springer Science and Business Media: Berlin/Heidelberg, Germany, 2002. [Google Scholar]
Van Erven, T.; Harremoës, P. Rényi divergence and Kullback-Leibler divergence. IEEE Trans. Inform. Theoy 2014, 60, 3797–3820. [Google Scholar] [CrossRef]
Bregman, L.M. The relaxation method of finding the common point of convex sets and its application to the solution of problems in convex programming. USSR Comput. Math. Math. Phys. 1967, 7, 200–217. [Google Scholar] [CrossRef]
Zhang, J. Divergence function, duality, and convex analysis. Neural Comput. 2004, 16, 159–195. [Google Scholar] [CrossRef] [PubMed]
Korbel, J.; Hänel, R.; Thurner, S. Information geometric duality of ϕ-deformed exponential families. Entropy 2019, 21, 112. [Google Scholar] [CrossRef]
Loaiza, G.; Quiceno, H. A Riemannian geometry in the q-exponential Banach manifold induced by q-divergences. In Proceedings of the International Conference on Geometric Science of Information; Springer: Cham, Switzerland, 2013; pp. 737–742. [Google Scholar]
Kaniadakis, G. Non-linear kinetics underlying generalized statistics. Physics A 2001, 296, 405–425. [Google Scholar] [CrossRef]
Pistone, G. Kappa-exponential models from the geometrical viewpoint. Eur. Phys. J. B 2009, 70, 29–37. [Google Scholar] [CrossRef]

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Josué Vieira, F.L.; Félix de Andrade, L.H.; Facundo Vigelis, R.; Casimiro Cavalcante, C. A Deformed Exponential Statistical Manifold. Entropy 2019, 21, 496. https://doi.org/10.3390/e21050496

AMA Style

Josué Vieira FL, Félix de Andrade LH, Facundo Vigelis R, Casimiro Cavalcante C. A Deformed Exponential Statistical Manifold. Entropy. 2019; 21(5):496. https://doi.org/10.3390/e21050496

Chicago/Turabian Style

Josué Vieira, Francisca Leidmar, Luiza Helena Félix de Andrade, Rui Facundo Vigelis, and Charles Casimiro Cavalcante. 2019. "A Deformed Exponential Statistical Manifold" Entropy 21, no. 5: 496. https://doi.org/10.3390/e21050496

APA Style

Josué Vieira, F. L., Félix de Andrade, L. H., Facundo Vigelis, R., & Casimiro Cavalcante, C. (2019). A Deformed Exponential Statistical Manifold. Entropy, 21(5), 496. https://doi.org/10.3390/e21050496

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Deformed Exponential Statistical Manifold

Abstract

1. Introduction

2. Background and Preliminary Results

2.1. A q-Exponential Statistical Banach Manifold

2.2. Musielak–Orlicz Spaces and $φ$ -Families of Probability Distributions

3. Construction of Generalized $φ$ -Families of Probability Distributions

4. The Tangent Bundle

5. Divergence in Statistical Manifolds

5.1. The $φ$ -Divergence and q-Divergence

5.2. Generalization of Divergence of Rényi and ${exp}_{κ}$

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

A Deformed Exponential Statistical Manifold

Abstract

1. Introduction

2. Background and Preliminary Results

2.1. A q-Exponential Statistical Banach Manifold

2.2. Musielak–Orlicz Spaces and φ -Families of Probability Distributions

3. Construction of Generalized φ -Families of Probability Distributions

4. The Tangent Bundle

5. Divergence in Statistical Manifolds

5.1. The φ -Divergence and q-Divergence

5.2. Generalization of Divergence of Rényi and exp κ

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2.2. Musielak–Orlicz Spaces and $φ$ -Families of Probability Distributions

3. Construction of Generalized $φ$ -Families of Probability Distributions

5.1. The $φ$ -Divergence and q-Divergence

5.2. Generalization of Divergence of Rényi and ${exp}_{κ}$