On Some Properties of Tsallis Hypoentropies and Hypodivergences

Furuichi, Shigeru; Mitroi-Symeonidis, Flavia-Corina; Symeonidis, Eleutherius

doi:10.3390/e16105377

Open AccessArticle

On Some Properties of Tsallis Hypoentropies and Hypodivergences

by

Shigeru Furuichi

^1,*,

Flavia-Corina Mitroi-Symeonidis

² and

Eleutherius Symeonidis

³

¹

Department of Information Science, College of Humanities and Sciences, Nihon University, 3-25-40, Sakurajyousui, Setagaya-ku, Tokyo, 156-8550, Japan

²

Faculty of Engineering Sciences, LUMINA - University of South-East Europe, ¸ Sos. Colentina 64b, Bucharest, RO-021187, Romania

³

Mathematisch-Geographische Fakultät, Katholische Universität Eichstätt-Ingolstadt, 85071 Eichstätt, Germany

^*

Author to whom correspondence should be addressed.

Entropy 2014, 16(10), 5377-5399; https://doi.org/10.3390/e16105377

Submission received: 15 September 2014 / Accepted: 8 October 2014 / Published: 15 October 2014

(This article belongs to the Section Statistical Physics)

Download Versions Notes

Abstract

:

Both the Kullback–Leibler and the Tsallis divergence have a strong limitation: if the value zero appears in probability distributions (p₁, · · ·, p_n) and (q₁, · · ·, q_n), it must appear in the same positions for the sake of significance. In order to avoid that limitation in the framework of Shannon statistics, Ferreri introduced in 1980 hypoentropy: “such conditions rarely occur in practice”. The aim of the present paper is to extend Ferreri’s hypoentropy to the Tsallis statistics. We introduce the Tsallis hypoentropy and the Tsallis hypodivergence and describe their mathematical behavior. Fundamental properties, like nonnegativity, monotonicity, the chain rule and subadditivity, are established.

MSC classifications: 26D15; 94A17

Keywords:

mathematical inequality; Tsallis entropy; Tsallis hypoentropy; Tsallis hypodivergence; chain rule; subadditivity

1. Preliminaries

Throughout this paper, X, Y and Z denote discrete random variables taking on the values {x₁, · · ·, x_|X|}, {y₁, · · ·, y_{|Y |}} and {z₁, · · ·, z_|Z|}, respectively, where |A| denotes the number of the values of the discrete random variable A. We denote the discrete random variable following a uniform distribution by U. We set the probabilities as p(x_i) ≡ Pr(X = x_i), p(y_j) ≡ Pr(Y = y_j) and p(z_k) ≡ Pr(Z = z_k). If |U| = n, then

p (u_{k}) = \frac{1}{n}

for all k = 1, · · ·, n. In addition, we denote by p(x_i, y_j) = Pr(X = x_i, Y = y_j), p(x_i, y_j, z_k) = Pr(X = x_i, Y = y_j, Z = z_k) the joint probabilities, by p(x_i|y_j) = Pr(X = x_i|Y = y_j), p(x_i|y_j, z_k) = Pr(X = x_i|Y = y_j, Z = z_k) the conditional probabilities, and so on.

The notion of entropy was used in statistical thermodynamics by Boltzmann [1] in 1871 and Gibbs [2] in 1902, in order to quantify the diversity, uncertainty and randomness of isolated systems. Later, it was seen as a measure of “information, choice and uncertainty” in the theory of communication, when Shannon [3] defined it by:

H (X) \equiv - \sum_{i = 1}^{∣ X ∣} p (x_{i}) log p (x_{i}) .

(1)

In what follows, we consider |X| = |Y| = |U| = n, unless otherwise specified.

Making use of the concavity of the logarithmic function, one can easily check that the equiprobable states are maximizing the entropy, that is:

H (X) \leq H (U) = log n .

(2)

The right-hand side term of this inequality has been known since 1928 as Hartley entropy [4].

For two random variables X and Y following distributions {p(x_i)} and {p(y_i)}, the Kullback–Leibler [5] discrimination function (divergence or relative entropy) is defined by:

D (X ‖ Y) \equiv \sum_{i = 1}^{n} p (x_{i}) (log p (x_{i}) - log p (y_{i})) = - \sum_{i = 1}^{n} p (x_{i}) log \frac{p (y_{i})}{p (x_{i})} .

(3)

(We note that the relative entropy is usually defined for two probability distributions P = {p_i} and Q = {q_i} as

D (P ‖ Q) \equiv - \sum_{i = 1}^{n} p_{i} log \frac{q_{i}}{p_{i}}

in the standard notation of information theory. D(P||Q) is often rewritten by D(X||Y ) for random variables X and Y following the distributions P and Q. Throughout this paper, we use the style of Equation (3) for relative entropies to unify the notation with simple descriptions.) Here, the conventions

a \cdot log \frac{0}{a} = - \infty (a > 0)

and

0 \cdot log \frac{b}{0} = 0 (b \geq 0)

are used. (We also note that the convention is often given in the following way with the definition of D(X||Y ). If there exists i, such that p(x_i) ≠ 0 = p(y_i), then we define D(X||Y ) ≡ +∞ (in this case, D(X||Y ) is no longer significant as an information measure). Otherwise, D(X||Y ) is defined by Equation (3) with the convention

0 \cdot log \frac{0}{0} = 0

. This fact has been mentioned in the abstract of the paper.) In what follows, we use such conventions in the definitions of the entropies and divergences. However, we do not state them repeatedly.

It holds that:

H (U) - H (X) = D (X ‖ U) .

(4)

Moreover, the cross-entropy (or inaccuracy):

H^{(c r o s s)} (X, Y) \equiv - \sum_{i = 1}^{n} p (x_{i}) log p (y_{i})

(5)

satisfies the identity:

D (X ‖ Y) = H^{(c r o s s)} (X, Y) - H (X) .

(6)

Many extensions of Shannon entropy have been studied. The Rényi entropy [6] and α-entropy [7] are famous. The mathematical results until the 1970s are well written in the book [8]. In the present paper, we focus on the hypoentropy introduced by Carlo Ferreri and the Tsallis entropy introduced by Constantino Tsallis.

The hypoentropy at the level λ (λ-entropy) was introduced in 1980 by Ferreri [9] as an alternative measure of information in the following form:

F_{λ} (X) \equiv \frac{1}{λ} (λ + 1) log (λ + 1) - \frac{1}{λ} \sum_{i = 1}^{n} (1 + λ p (x_{i})) log (1 + λ p (x_{i}))

(7)

for λ > 0. According to Ferreri [9], the parameter λ can be interpreted as a measure of the information inaccuracy of economic forecast. As we will show that F_λ(X) ≤ H(X) in the second section, the name hypoentropy comes from this property.

On the other hand, Tsallis introduced a one-parameter extension of the entropy in 1988 in [10], for handling systems that appear to deviate from standard statistical distributions. It plays an important role in the nonextensive statistical mechanics of complex systems, being defined as:

T_{q} (X) \equiv - \sum_{i = 1}^{n} p {(x_{i})}^{q} {ln}_{q} p (x_{i}) = \sum_{i = 1}^{n} p (x_{i}) {ln}_{q} \frac{1}{p (x_{i})} (q \geq 0, q \neq 1) .

(8)

Here, the q–logarithmic function for x > 0 is defined by

{ln}_{q} (x) \equiv \frac{x^{1 - q} - 1}{1 - q}

, which converges to the usual logarithmic function log(x) in the limit q → 1. The Tsallis divergence (relative entropy) [11] is given by:

S_{q} (X ‖ Y) \equiv \sum_{i = 1}^{n} p {(x_{i})}^{q} ({ln}_{q} p (x_{i}) - {ln}_{q} p (y_{i})) = - \sum_{i = 1}^{n} p (x_{i}) {ln}_{q} \frac{p (y_{i})}{p (x_{i})} .

(9)

Note that some important properties of the Tsallis relative entropy were given in the papers [12–14].

2. Hypoentropy and Hypodivergence

For nonnegative real numbers, a_i and b_i (i = 1, · · ·, n), we define the generalized relative entropy (for incomplete probability distributions):

D^{(g e n)} (a_{1}, \dots, a_{n} ‖ b_{1}, \dots, b_{n}) \equiv \sum_{i = 1}^{n} a_{i} log \frac{a_{i}}{b_{i}} .

(10)

Then, we have the so-called “log-sum” inequality:

\sum_{i = 1}^{n} a_{i} log \frac{a_{i}}{b_{i}} \geq (\sum_{i = 1}^{n} a_{i}) log \frac{\sum_{i = 1}^{n} a_{i}}{\sum_{i = 1}^{n} b_{i}},

(11)

with equality if and only if

\frac{a_{i}}{b_{i}} = c o n s t

. for all i = 1, · · ·, n.

If we impose the condition:

\sum_{i = 1}^{n} a_{i} = \sum_{i = 1}^{n} b_{i} = 1,

then D⁽^gen⁾(a₁, · · ·, a_n||b₁, · · ·, b_n) is just the relative entropy,

D (a_{1}, \dots, a_{n} ‖ b_{1}, \dots, b_{n}) \equiv \sum_{i = 1}^{n} a_{i} log \frac{a_{i}}{b_{i}} .

(12)

We put

a_{i} = \frac{1}{λ} + p (x_{i})

and

b_{i} = \frac{1}{λ} + p (y_{i})

with λ > 0 and

\sum_{i = 1}^{n} p (x_{i}) = \sum_{i = 1}^{n} p (y_{i}) = 1

, p(x_i) ≥ 0, p(y_i) ≥ 0. Then, we find that it is equal to the hypodivergence (λ-divergence) introduced by Ferreri in [9],

K_{λ} (X ‖ Y) \equiv \frac{1}{λ} \sum_{i = 1}^{n} (1 + λ p (x_{i})) log \frac{1 + λ p (x_{i})}{1 + λ p (y_{i})} .

(13)

Clearly, we have:

lim_{λ \to \infty} K_{λ} (X ‖ Y) = D (X ‖ Y) .

(14)

Using the “log-sum” inequality, we have the nonnegativity:

K_{λ} (X ‖ Y) \geq 0,

(15)

with equality if and only if p(x_i) = p(y_i) for all i = 1, · · ·, n.

For the hypoentropy F_λ(X) defined in Equation (7), we firstly show the fundamental relations. To do so, we prepare with the following lemma.

Lemma 1

For any a > 0 and 0 ≤ x ≤ 1, we have

x (1 + a) log (1 + a) \geq (1 + a x) log (1 + a x) .

(16)

Proof

We set f(x) ≡ x(1 + a) log(1 + a) – (1 + ax) log(1 + ax). For any a > 0, we then have

\frac{d^{2} f (x)}{d x^{2}} = \frac{- a^{2}}{1 + a x} < 0

and f(0) = f(1) = 0. Thus, we have the inequality.

Proposition 1

For λ > 0, we have the following inequalities:

0 \leq F_{λ} (X) \leq F_{λ} (U) .

(17)

The equality in the first inequality holds if and only if p(x_j) = 1 for some j (then p(x_i) = 0 for all i ≠ j). The equality in the second inequality holds if and only if p(x_i) = 1/n for all i = 1, · · ·, n.

Proof

From the nonnegativity of the hypodivergence Equation (15), we get:

0 \leq K_{λ} (X ‖ U)

(18)

= \frac{1}{λ} \sum_{i = 1}^{n} (1 + λ p (x_{i})) log (1 + λ p (x_{i})) - \frac{1}{λ} (n + λ) log (1 + \frac{λ}{n}) .

(19)

Thus, we have:

- \frac{1}{λ} \sum_{i = 1}^{n} (1 + λ p (x_{i})) log (1 + λ p (x_{i})) \leq - \frac{1}{λ} (n + λ) log (1 + \frac{λ}{n}) .

(20)

Adding

\frac{1}{λ} (λ + 1) log (λ + 1)

to both sides, we have:

F_{λ} (X) \leq F_{λ} (U),

(21)

with equality if and only if p(x_i) = 1/n for all i = 1, · · ·, n.

For the first inequality, it is sufficient to prove:

(1 + λ) log (1 + λ) - \sum_{i = 1}^{n} (1 + λ p (x_{i})) log (1 + λ p (x_{i})) \geq 0.

(22)

Since

\sum_{i = 1}^{n} p (x_{i}) = 1

, the above inequality is written as:

\sum_{i = 1}^{n} {p (x_{i}) (1 + λ) log (1 + λ) - (1 + λ p (x_{i})) log (1 + λ p (x_{i}))} \geq 0,

(23)

so that we have only to prove:

p (x_{i}) (1 + λ) log (1 + λ) - (1 + λ p (x_{i})) log (1 + λ p (x_{i})) \geq 0,

(24)

for any λ > 0 and 0 ≤ p(x_i) ≤ 1. Lemma 1 shows this inequality and the equality condition.

It is a known fact [9] that F_λ(X) is monotonically increasing as a function of λ and:

lim_{λ \to \infty} F_{λ} (X) = H (X),

(25)

whence its name, as we noted in the Introduction. Thus, the hypoentropy appears as a generalization of Shannon’s entropy. One can see that the hypoentropy also equals zero as the entropy does, in the case of certainty (i.e., for a so-called pure state when all probabilities vanish, but one).

It also holds that:

F_{λ} (U) - F_{λ} (X) = K_{λ} (X ‖ U) .

(26)

It is of some interest for the reader to look at the hypoentropy that arises for equiprobable states,

F_{λ} (U) = (1 + \frac{1}{λ}) log (1 + λ) - (1 + \frac{n}{λ}) log (1 + \frac{λ}{n}) .

(27)

Seen as a function of two variables, n and λ, it increases in each variable [9]. Since:

lim_{λ \to \infty} F_{λ} (U) = log n,

(28)

We shall call it Hartley hypoentropy. (Throughout the paper, we add the name Hartley to the name of mathematical objects whenever they are considered for the uniform distribution. In the same way, we proceed with the name Tsallis, which we add to the name of some mathematical objects that we define, to emphasize that they are used in the framework of Tsallis statistics. This means that we will have Tsallis hypoentropies, Tsallis hypodivergences, and so on.) We have the cross-hypoentropy:

F_{λ}^{(c r o s s)} (X, Y) \equiv (1 + \frac{1}{λ}) log (1 + λ) - \frac{1}{λ} \sum_{i = 1}^{n} (1 + λ p (x_{i})) log (1 + λ p (y_{i})) .

(29)

It holds:

K_{λ} (X ‖ Y) = F_{λ}^{(c r o s s)} (X, Y) - F_{λ} (X) \geq 0,

(30)

therefore, we have

F_{λ}^{(c r o s s)} (X, Y) \geq F_{λ} (X)

.

We can show an upper bound for F_λ(X) as a direct consequence.

Proposition 2

The following inequality holds.

F_{λ} (X) \leq (1 - p_{m a x}) log (1 + λ),

for all λ > 0, where p_max ≡ max {p(x₁), · · ·, p(x_n)}.

Proof

In the inequality (30), if for a fixed k, one takes the probability of the k-th component of Y to be p(y_k) = 1, then:

- \sum_{i = 1}^{n} (1 + λ p (x_{i})) log (1 + λ p (x_{i})) \leq - (1 + λ p (x_{k})) log (1 + λ) .

(31)

This implies that:

F_{λ} (X) \leq (1 + \frac{1}{λ}) log (1 + λ) - \frac{1}{λ} (1 + λ p (x_{k})) log (1 + λ)

(32)

= (1 - p (x_{k})) log (1 + λ) .

(33)

Since k is arbitrarily fixed, the conclusion follows.

Remark 1

It is of interest to notice now that, for the particular case X = U, we have:

F_{λ} (U) \leq (1 - \frac{1}{n}) log (1 + λ) .

(34)

We add here one more detail: the inequality (34) can be verified using Bernoulli’s inequality.

3. Tsallis Hypoentropy and Hypodivergence

Now, we turn our attention to the Tsallis statistics. We extend the definition of hypodivergences as follows:

Definition 1

The Tsallis hypodivergence (q-hypodivergence, Tsallis relative hypoentropy) is defined by:

D_{λ, q} (X ‖ Y) \equiv - \frac{1}{λ} \sum_{i = 1}^{n} (1 + λ p (x_{i})) {ln}_{q} \frac{1 + λ p (y_{i})}{1 + λ p (x_{i})}

(35)

for λ > 0 and q ≥ 0.

Then, we have the relation:

lim_{λ \to \infty} D_{λ, q} (X ‖ Y) = S_{q} (X ‖ Y)

(36)

which is the Tsallis divergence, and:

lim_{q \to 1} D_{λ, q} (X ‖ Y) = K_{λ} (X ‖ Y)

(37)

which is the hypodivergence.

Remark 2

This definition can be also obtained from the generalized Tsallis relative entropy (for incomplete probability distributions {a₁, · · ·, a_n} and {b₁, · · ·, b_n}):

D_{q}^{(g e n)} (a_{1}, \dots, a_{n} ‖ b_{1}, \dots, b_{n}) \equiv - \sum_{i = 1}^{n} a_{i} {ln}_{q} \frac{b_{i}}{a_{i}},

(38)

by putting

a_{i} = \frac{1}{λ} + p (x_{i})

and

b_{i} = \frac{1}{λ} + p (y_{i})

for λ > 0.

The generalized relative entropy (10) and the generalized Tsallis relative entropy (38) can be written as the generalized f-divergence (for incomplete probability distributions):

D_{f}^{(g e n)} (a_{1}, \dots, a_{n} ‖ b_{1}, \dots, b_{n}) \equiv \sum_{i = 1}^{n} a_{i} f (\frac{b_{i}}{a_{i}})

(39)

for a convex function f on (0, ∞) and a_i ≥ 0, b_i ≥ 0 (i = 1, · · ·, n). See [15] and [16] for details.

By the concavity of the q-logarithmic function, we have the following “ln_q-sum” inequality:

- \sum_{i = 1}^{n} a_{i} {ln}_{q} \frac{b_{i}}{a_{i}} \geq - (\sum_{i = 1}^{n} a_{i}) {ln}_{q} (\frac{\sum_{i = 1}^{n} b_{i}}{\sum_{i = 1}^{n} a_{i}}),

(40)

with equality if and only if

\frac{a_{i}}{b_{i}} = c o n s t

for all i = 1, · · ·, n. Using the “ln_q-sum” inequality, we have the nonnegativity of the Tsallis hypodivergence:

D_{λ, q} (X ‖ Y) \geq 0,

(41)

with equality if and only if p(x_i) = p(y_i) for all i = 1, · · ·, n (the equality condition comes from the equality condition of the “ln_q-sum” inequality and the condition

\sum_{i = 1}^{n} p (x_{i}) = \sum_{i = 1}^{n} p (y_{i}) = 1

.

Definition 2

For λ > 0 and q ≥ 0, the Tsallis hypoentropy (q-hypoentropy) is defined by:

H_{λ, q} (X) \equiv \frac{h (λ, q)}{λ} {- (1 + λ) {ln}_{q} \frac{1}{1 + λ} + \sum_{i = 1}^{n} (1 + λ p (x_{i})) {ln}_{q} \frac{1}{1 + λ p (x_{i})}}

(42)

where the function h(λ, q) > 0 satisfies two conditions,

lim_{q \to 1} h (λ, q) = 1

(43)

and:

lim_{λ \to \infty} \frac{h (λ, q)}{λ^{1 - q}} = 1.

(44)

These conditions are equivalent to:

lim_{q \to 1} H_{λ, q} (X) = F_{λ} (X) = Hypoentropy

(45)

and, respectively,

lim_{λ \to \infty} H_{λ, q} (X) = T_{q} (X) = Tsallis entropy .

(46)

Some interesting examples are h(λ, q) = λ¹^−q and h(λ, q) = (1+λ)¹^−q.

Remark 3

It may be remarkable to discuss the Tsallis cross-hypoentropy. The first candidate for the definition of the Tsallis cross-hypoentropy is:

H_{λ, q}^{(c r o s s)} (X, Y) \equiv \frac{h (λ, q)}{λ} {- (1 + λ) {ln}_{q} \frac{1}{1 + λ} - \sum_{i = 1}^{n} {(1 + λ p (x_{i}))}^{q} {ln}_{q} (1 + λ p (y_{i}))}

(47)

which recovers the cross-hypoentropy defined in Equation (29) in the limit q → 1. Then, we have:

\begin{array}{l} H_{λ, q}^{(c r o s s)} (X, Y) - H_{λ, q} (X) = \frac{h (λ, q)}{λ} \sum_{i = 1}^{n} {(1 + λ p (x_{i}))}^{q} {{ln}_{q} (1 + λ p (x_{i})) - {ln}_{q} (1 + λ p (y_{i}))} \\ = - \frac{h (λ, q)}{λ} \sum_{i = 1}^{n} (1 + λ p (x_{i})) {ln}_{q} \frac{1 + λ p (y_{i})}{1 + λ p (x_{i})} \\ = h (λ, q) D_{λ, q} (X ‖ Y) \geq 0. \end{array}

The last inequality is due to the nonnegativity given in Equation (41). Since lim_q→₁ h(λ, q) = 1 by the definition of the Tsallis hypoentropy (see Equation (43)), the above relation recovers the inequality (30) in the limit q → 1.

The second candidate for the definition of the Tsallis cross-hypoentropy is:

{\tilde{H}}_{λ, q}^{(c r o s s)} (X, Y) \equiv \frac{h (λ, q)}{λ} {- (1 + λ) {ln}_{q} \frac{1}{1 + λ} + \sum_{i = 1}^{n} (1 + λ p (x_{i})) {ln}_{q} \frac{1}{1 + λ p (y_{i})}},

(48)

which also recovers the cross-hypoentropy defined in Equation (29) in the limit q → 1. Then, we have:

\begin{array}{l} {\tilde{H}}_{λ, q}^{(c r o s s)} (X, Y) - H_{λ, q} (X) = - \frac{h (λ, q)}{λ} \sum_{i = 1}^{n} (1 + λ p (x_{i})) {{ln}_{q} \frac{1}{1 + λ p (x_{i})} - {ln}_{q} \frac{1}{1 + λ p (y_{i})}} \\ = h (λ, q) {\tilde{D}}_{λ, q} (X ‖ Y), \end{array}

where the alternative Tsallis hypodivergence has to be defined by:

{\tilde{D}}_{λ, q} (X ‖ Y) \equiv - \frac{1}{λ} \sum_{i = 1}^{n} (1 + λ p (x_{i})) {{ln}_{q} \frac{1}{1 + λ p (x_{i})} - {ln}_{q} \frac{1}{1 + λ p (y_{i})}} .

We have D̃_λ,q(X||Y ) ≠ D_λ,q(X||Y ) and lim_q→₁ D̃_λ,q(X||Y) = K_λ(X||Y ). However, the nonnegativity of D̃_λ,q(X||Y ) (q ≥ 0) does not hold in general, as the following counter-examples show. Take λ = 1, n = 2, p(x₁) = 0.9, p(y₁) = 0.8, q = 0.5, then D̃_λ,q(X||Y ) ≃ −0.0137586. In addition, take λ = 1, n = 3, p(x₁) = 0.3, p(x₂) = 0.4, p(y₁) = 0.2, p(y₂) = 0.7 and q = 1.9, then D̃_λ,q(X||Y ) ≃ −0.0195899. Therefore, we may conclude that Equation (47) is to be given the preference over Equation (48).

We turn to show the nonnegativity and maximality for the Tsallis hypoentropy.

Lemma 2

For any a > 0, q ≥ 0 and 0 ≤ x ≤ 1, we have:

x (1 + a) {ln}_{q} \frac{1}{1 + a} \leq (1 + a x) {ln}_{q} \frac{1}{1 + a x} .

(49)

Proof

We set

g (x) \equiv x (1 + a) {ln}_{q} \frac{1}{1 + a} - (1 + a x) {ln}_{q} \frac{1}{1 + a x}

. For any a > 0 and q ≥ 0, we then have

\frac{d^{2} g (x)}{d x^{2}} = q a^{2} {(\frac{1}{1 + a x})}^{2 - q} \geq 0

and g(0) = g(1) = 0. Thus, we have the inequality.

Proposition 3

For λ > 0, q ≥ 0 and h(λ, q) > 0 satisfying (43) and (44), we have the following inequalities:

0 \leq H_{λ, q} (X) \leq H_{λ, q} (U) .

(50)

The equality in the first inequality holds if and only if p(x_j) = 1 for some j (then p(x_i) = 0 for all i ≠ j). The equality in the second inequality holds if and only if p(x_i) = 1/n for all i = 1, · · ·, n.

Proof

In a similar way to the proof of Proposition 1, for the first inequality, it is sufficient to prove:

- \sum_{i = 1}^{n} {p (x_{i}) (1 + λ) {ln}_{q} \frac{1}{1 + λ} - (1 + λ p (x_{i})) {ln}_{q} \frac{1}{1 + λ p (x_{i})}} \geq 0,

(51)

so that we have only to prove:

p (x_{i}) (1 + λ) {ln}_{q} \frac{1}{1 + λ} \leq (1 + λ p (x_{i})) {ln}_{q} \frac{1}{1 + λ p (x_{i})}

(52)

for any λ > 0, q ≥ 0 and 0 ≤ p(x_i) ≤ 1. Lemma 2 shows this inequality with the equality condition.

The second inequality is proven by the use of the nonnegativity of the Tsallis hypodivergence in the following way:

0 \leq D_{λ, q} (X ‖ U) = - \frac{1}{λ} \sum_{i = 1}^{n} (1 + λ p (x_{i})) {ln}_{q} \frac{1 + \frac{λ}{n}}{1 + λ p (x_{i})}

(53)

which implies (by the use of the formula

{ln}_{q} \frac{b}{a} = b^{1 - q} {ln}_{q} \frac{1}{a} + {ln}_{q} b

:

\frac{1}{λ} \sum_{i = 1}^{n} (1 + λ p (x_{i})) {ln}_{q} \frac{1}{1 + λ p (x_{i})} \leq \frac{n + λ}{λ} {ln}_{q} \frac{n}{n + λ} .

(54)

The equality condition of the second inequality follows from the equality condition of the nonnegativity of the Tsallis hypodivergence (41).

We may call:

H_{λ, q} (U) = \frac{h (λ, q)}{λ} {- (1 + λ) {ln}_{q} \frac{1}{1 + λ} + (n + λ) {ln}_{q} \frac{1}{1 + \frac{λ}{n}}}

the Hartley–Tsallis hypoentropy. We study the monotonicity for n or λ of the Hartley–Tsallis hypoentropy H_λ,q(U) and the Tsallis hypoentropy H_λ,q(X). (Throughout the present paper, the term “monotonicity” means the monotone increasing/decreasing as a function of the parameter λ. We emphasize that it does not mean the monotonicity for some stochastic maps.)

Lemma 3

The function:

f (x) = (x + 1) {ln}_{q} \frac{x}{x + 1} (x > 0)

is monotonically increasing in x, for any q ≥ 0.

Proof

By direct calculations, we have:

\frac{d f (x)}{d x} = \frac{1}{1 - q} {{(1 + \frac{1}{x})}^{q - 1} (1 + \frac{1 - q}{x}) - 1}

and:

\frac{d^{2} f (x)}{d x^{2}} = - q x^{- 3} {(1 + \frac{1}{x})}^{q - 2} \leq 0.

Since

{lim}_{x \to \infty} \frac{d f (x)}{d x} = 0

, we have

\frac{d f (x)}{d x} \geq 0

.

Proposition 4

The Hartley–Tsallis hypoentropy:

H_{λ, q} (U) = \frac{h (λ, q)}{λ} {- (1 + λ) {ln}_{q} \frac{1}{1 + λ} + (n + λ) {ln}_{q} \frac{1}{1 + \frac{λ}{n}}}

is a monotonically increasing function of n, for any λ > 0 and q ≥ 0.

Proof

Note that:

H_{λ, q} (U) = h (λ, q) {- (1 + \frac{1}{λ}) {ln}_{q} \frac{1}{1 + λ} + (1 + \frac{n}{λ}) {ln}_{q} \frac{1}{1 + \frac{λ}{n}}} .

Putting

x = \frac{n}{λ} \geq 0

for λ > 0 fixed in Lemma 3, we get the function:

g (n) = (1 + \frac{n}{λ}) {ln}_{q} \frac{1}{1 + \frac{λ}{n}},

which is a monotonically increasing function of n. Thus, we have the present proposition.

Remark 4

We have the relation:

lim_{n \to \infty} H_{λ, q} (U) = h (λ, q) {- (1 + \frac{1}{λ}) {ln}_{q} \frac{1}{1 + λ} - 1} .

We notice from the condition (44) that:

\begin{array}{l} lim_{λ \to \infty} (lim_{n \to \infty} H_{λ, q} (U)) = lim_{λ \to \infty} \frac{h (λ, q)}{λ^{1 - q}} \cdot λ^{1 - q} {- 1 - (1 + \frac{1}{λ}) {ln}_{q} \frac{1}{1 + λ}} \\ = \frac{1}{1 - q} lim_{λ \to \infty} \frac{1 + q λ - {(1 + λ)}^{q}}{λ^{q}} = {\begin{array}{l} 0 & (q = 0) \\ \infty & (0 < q < 1) \\ \frac{1}{q - 1} & (q > 1), \end{array} \end{array}

and conclude that the result is independent of the choice of h(λ, q).

For the limit λ → 0, we consider two cases.

(1): In the case of h(λ, q) = λ¹^−q, we have:

$\begin{array}{l} lim_{λ \to 0} (lim_{n \to \infty} H_{λ, q} (U)) = lim_{λ \to 0} λ^{1 - q} {- 1 - (1 + \frac{1}{λ}) {ln}_{q} \frac{1}{1 + λ}} \\ = \frac{1}{1 - q} lim_{λ \to 0} \frac{1 + q λ - {(1 + λ)}^{q}}{λ^{q}} = {\begin{array}{l} \infty & (q > 2) \\ 1 & (q = 2) \\ 0 & (0 \leq q < 2), \end{array} \end{array}$

as one obtains using l’Hôpital’s rule.
(2): In the case of h(λ, q) = (1 + λ)¹⁻^q, we have for all q ≥ 0:

$\begin{array}{l} lim_{λ \to 0} (lim_{n \to \infty} H_{λ, q} (U)) = lim_{λ \to 0} {(1 + λ)}^{1 - q} {- 1 - (1 + \frac{1}{λ}) {ln}_{q} \frac{1}{1 + λ}} \\ = \frac{1}{1 - q} lim_{λ \to 0} \frac{1 + q λ - {(1 + λ)}^{q}}{λ {(1 + λ)}^{q - 1}} = \frac{q}{1 - q} lim_{λ \to 0} \frac{1 - {(1 + λ)}^{q - 1}}{{(1 + λ)}^{q - 1} + (q - 1) λ {(1 + λ)}^{q - 2}} = 0. \end{array}$

These results mean that our Hartley–Tsallis hypoentropy with h(λ, q) = λ¹⁻^q or (1 + λ)¹⁻^q has the same limits as the Hartley hypoentropy, F_λ(U) (see also [9]), in the case 0 < q < 1.

We study here the monotonicity of H_λ,q(X) for h(λ, q) = (1 + λ)¹⁻^q. The other case h(λ, q) = λ¹⁻^q is studied in the next section; see Lemma 5.

Proposition 5

We assume h(λ, q) = (1 + λ)¹⁻^q. Then, H_λ,q(X) is a monotone increasing function of λ > 0 when 0 ≤ q ≤ 2.

Proof

Note that:

H_{λ, q} (X) = \sum_{i = 1}^{n} S n_{λ, q} (p (x_{i})),

where:

S n_{λ, q} (x) \equiv \frac{{(1 + λ)}^{1 - q}}{λ (1 - q)} {{(1 + λ x)}^{q} - {(1 + λ)}^{q} x + x - 1}

is defined on 0 ≤ x ≤ 1, 0 ≤ q ≤ 2 and λ > 0. Then, we have:

\frac{d H_{λ, q} (X)}{d λ} = \sum_{i = 1}^{n} \frac{d S n_{λ, q} (p (x_{i}))}{d λ} = \sum_{i = 1}^{n} s_{λ, q} (p (x_{i})),

where:

s_{λ, q} (x) \equiv \frac{q λ (1 - x) {1 - {(1 + λ x)}^{q - 1}} + 1 - x + {(1 + λ)}^{q} x - {(1 + λ x)}^{q}}{(1 - q) λ^{2} {(1 + λ)}^{q}}

is defined on 0 ≤ x ≤ 1, 0 ≤ q ≤ 2 and λ > 0. By some computations, we have:

\frac{d^{2} s_{λ, q} (x)}{d x^{2}} = \frac{- q {(1 + λ x)}^{q - 3} [1 + λ {(x - 1) (q - 1) + 1}]}{{(1 + λ)}^{q}} \leq 0,

since (x − 1)(q − 1) + 1 ≥ 0 for 0 ≤ x ≤ 1 and 0 ≤ q ≤ 2. We easily find s_λ,q(0) = s_λ,q(1) = 0. Thus, we have s_λ,q(x) ≥ 0 for 0 ≤ x ≤ 1, 0 ≤ q ≤ 2 and λ > 0. Therefore, we have

\frac{d H_{λ, q} (X)}{d λ} \geq 0

for 0 ≤ q ≤ 2 and λ > 0.

This result agrees with the known fact that the usual (Ferreri) hypoentropy is increasing as a function of λ.

Closing this subsection, we give a q-extended version for Proposition 2.

Proposition 6

Let p_max ≡ max{p(x₁), ···, p(x_n)}. Then, we have the following inequality.

H_{λ, q} (X) \leq \frac{h (λ, q)}{λ} {{(1 + λ)}^{q} - {(1 + λ p_{m a x})}^{q}} {ln}_{q} (1 + λ)

(55)

for all λ > 0 and q ≥ 0.

Proof

From the “ln_q-sum” inequality, we have D_λ,q(X||Y ) ≥ 0. Since λ > 0, we have:

- \sum_{i = 1}^{n} (1 + λ p (x_{i})) {ln}_{q} \frac{1 + λ p (y_{i})}{1 + λ p (x_{i})} \geq 0

(56)

which is equivalent to:

\sum_{i = 1}^{n} {(1 + λ p (x_{i}))}^{q} {{ln}_{q} (1 + λ p (x_{i})) - {ln}_{q} (1 + λ p (y_{i}))} \geq 0.

(57)

Thus, we have:

- \sum_{i = 1}^{n} {(1 + λ p (x_{i}))}^{q} {ln}_{q} (1 + λ p (x_{i})) \leq - \sum_{i = 1}^{n} {(1 + λ p (x_{i}))}^{q} {ln}_{q} (1 + λ p (y_{i})),

(58)

which extends the result given from the inequality (30). For arbitrarily fixed k, we set p(y_k) = 1 (and p(y_i) = 0 for i ≠ k) in the above inequality; then, we have:

- \sum_{i = 1}^{n} {(1 + λ p (x_{i}))}^{q} {ln}_{q} (1 + λ p (x_{i})) \leq - {(1 + λ p (x_{k}))}^{q} {ln}_{q} (1 + λ) .

(59)

Since

x^{q} {ln}_{q} x = - x {ln}_{q} \frac{1}{x}

, we have:

\sum_{i = 1}^{n} (1 + λ p (x_{i})) {ln}_{q} \frac{1}{1 + λ p (x_{i})} \leq - {(1 + λ p (x_{k}))}^{q} {ln}_{q} (1 + λ) .

(60)

Multiplying both sides by

\frac{h (λ, q)}{λ} > 0

and then adding

- \frac{h (λ, q)}{λ} (1 + λ) {ln}_{q} \frac{1}{1 + λ} = \frac{h (λ, q)}{λ} {(1 + λ)}^{q} {ln}_{q} (1 + λ)

(61)

to both sides, we have:

H_{λ, q} (X) \leq \frac{h (λ, q)}{λ} {{(1 + λ)}^{q} - {(1 + λ p (x_{k}))}^{q}} {ln}_{q} (1 + λ) .

(62)

Since k is arbitrary, we have this proposition.

Letting q → 1 in the above proposition, we recover Proposition 2.

4. The Subadditivities of the Tsallis Hypoentropies

Throughout this section, we assume |X| = n, |Y | = m, |Z| = l. We define the joint Tsallis hypoentropy at the level λ by:

H_{λ, q} (X, Y) \equiv \frac{h (λ, q)}{λ} {- (1 + λ) {ln}_{q} \frac{1}{1 + λ} + \sum_{i = 1}^{n} \sum_{j = 1}^{m} (1 + λ p (x_{i}, y_{i})) {ln}_{q} \frac{1}{1 + λ p (x_{i}, y_{i})}} .

(63)

Note that H_λ,q(X, Y ) = H_λ,q(Y, X).

For all i = 1, ···, n for which p(x_i) ≠ 0, we define the Tsallis hypoentropy of Y given X = x_i, at the level λp(x_i), by:

\begin{array}{l} H_{λ p (x_{i}), q} (Y ∣ x_{i}) \equiv \frac{h (λ p (x_{i}), q)}{λ p (x_{i})} {- (1 + λ p (x_{i})) {ln}_{q} \frac{1}{1 + λ p (x_{i})} + \sum_{j = 1}^{m} (1 + λ p (x_{i}) p (y_{j} ∣ x_{i})) {ln}_{q} \frac{1}{1 + λ p (x_{i}) p (y_{j} ∣ x_{i})}} \\ = \frac{h (λ p (x_{i}), q)}{λ p (x_{i})} {- (1 + λ p (x_{i})) {ln}_{q} \frac{1}{1 + λ p (x_{i})} + \sum_{j = 1}^{m} (1 + λ p (x_{i}, y_{j})) {ln}_{q} \frac{1}{1 + λ p (x_{i}, y_{j})}} . \end{array}

(64)

For n = 1, this coincides with the hypoentropy H_λ,q(Y ). As for the particular case m = 1, we get H_λp₍_{x_i}₎_,q(Y |x_i) = 0.

Definition 3

The Tsallis conditional hypoentropy at the levelλ is defined by:

H_{λ, q} (Y ∣ X) \equiv \sum_{i = 1}^{n} p {(x_{i})}^{q} H_{λ p (x_{i}), q} (Y ∣ x_{i}) .

(65)

(As a usual convention, the corresponding summand is defined as zero, if p(x_i) = 0.)

Throughout this section, we consider the particular function h(λ, q) = λ¹⁻^q for λ > 0, q ≥ 0.

Lemma 4

We assume h(λ, q) = λ¹⁻^q. The chain rule for the Tsallis hypoentropy holds:

H_{λ, q} (X, Y) = H_{λ, q} (X) + H_{λ, q} (Y ∣ X) .

(66)

Proof

The proof is done by straightforward computation as follows.

\begin{array}{l} H_{λ, q} (X) + H_{λ, q} (Y ∣ X) = \frac{λ^{1 - q}}{λ} {- (1 + λ) {ln}_{q} \frac{1}{1 + λ} + \sum_{i = 1}^{n} (1 + λ p (x_{i})) {ln}_{q} \frac{1}{1 + λ p (x_{i})}} \\ + \sum_{i = 1}^{n} \frac{{(λ p (x_{i}))}^{1 - q}}{λ p (x_{i})} p {(x_{i})}^{q} {- (1 + λ p (x_{i})) {ln}_{q} \frac{1}{1 + λ p (x_{i})} + \sum_{j = 1}^{m} (1 + λ p (x_{i}, y_{j})) {ln}_{q} \frac{1}{1 + λ p (x_{i}, y_{j})}} \\ = \frac{λ^{1 - q}}{λ} {- (1 + λ) {ln}_{q} \frac{1}{1 + λ} + \sum_{i = 1}^{n} (1 + λ p (x_{i})) ln \frac{1}{1 + λ p (x_{i})}} \\ + \frac{λ^{1 - q}}{λ} {- \sum_{i = 1}^{n} (1 + λ p (x_{i})) {ln}_{q} \frac{1}{1 + λ p (x_{i})} + \sum_{i = 1}^{n} \sum_{j = 1}^{m} (1 + λ p (x_{i}, y_{j})) {ln}_{q} \frac{1}{1 + λ p (x_{i}, y_{j})}} \\ = \frac{λ^{1 - q}}{λ} {- (1 + λ) {ln}_{q} \frac{1}{1 + λ} + \sum_{i = 1}^{m} \sum_{j = 1}^{m} (1 + λ p (x_{i}, y_{j})) {ln}_{q} \frac{1}{1 + λ p (x_{i}, y_{j})}} = H_{λ, q} (X, Y) . \end{array}

In the limit λ → ∞, the identity (66) becomes T_q(X, Y ) = T_q(X) + T_q(Y |X), where

T_{q} (Y ∣ X) \equiv \sum_{i = 1}^{n} p {(x_{i})}^{q} T_{q} (Y ∣ x_{i}) = - \sum_{i = 1}^{n} \sum_{j = 1}^{m} p {(x_{i}, y_{j})}^{q} {ln}_{q} p (y_{j} ∣ x_{i})

is the Tsallis conditional entropy and

T_{q} (X, Y) \equiv \sum_{i = 1}^{n} \sum_{j = 1}^{m} p (x_{i}, y_{j}) {ln}_{q} \frac{1}{p (x_{i}, y_{j})}

is the Tsallis joint entropy (see also p. 3 in [17]).

In the limit q → 1 in Lemma 4, we also obtain the identity F_λ(X, Y ) = F_λ(X) + F_λ(Y |X), which naturally leads to the definition of F_λ(Y |X) as conditional hypoentropy.

In order to obtain the subadditivity for the Tsallis hypoentropy, we prove the monotonicity in λ of the Tsallis hypoentropy.

Lemma 5

We assume h(λ, q) = λ¹⁻^q. The Tsallis hypoentropy H_λ,q(X) is a monotonically increasing function of λ > 0 when 0 ≤ q ≤ 2 and a monotonically decreasing function of λ > 0 when q ≥ 2 (or q ≤ 0).

Proof

Note that:

H_{λ, q} (X) = \sum_{i = 1}^{n} L n_{λ, q} (p (x_{i})),

(67)

where:

L n_{λ, q} (x) \equiv \frac{{(1 + λ x)}^{q} - {(1 + λ)}^{q} x + x - 1}{λ^{q} (1 - q)}

(68)

is defined on 0 ≤ x ≤ 1 and λ > 0. Then, we have:

\frac{d H_{λ, q} (X)}{d λ} = \sum_{i = 1}^{n} \frac{d L n_{λ, q} (p (x_{i}))}{d λ} = \sum_{i = 1}^{n} l_{λ, q} (p (x_{i})),

(69)

where:

l_{λ, q} (x) \equiv \frac{q}{λ^{2} (1 - q)} {{(\frac{1}{λ} + 1)}^{q - 1} x - {(\frac{1}{λ} + x)}^{q + 1} - \frac{(x - 1)}{λ^{q - 1}}}

(70)

is defined on 0 ≤ x ≤ 1 and > > 0. By elementary computations, we obtain:

\frac{d^{2} l_{λ, q} (x)}{d x^{2}} = q (q - 2) λ^{1 - q} {(1 + λ x)}^{q - 3} .

(71)

Since we have l_λ,q(0) = l_λ,q(1) = 0, we find that l_λ,q(x) ≥ 0 for 0 ≤ q ≤ 2 and any λ > 0. We also find that l_λ,q(x) ≤ 0 for q ≥ 2 (or q ≤ 0) and any λ > 0. Therefore, we have

\frac{d H_{λ, q} (X)}{d λ} \geq 0

when 0 ≤ q ≤ 2, and

\frac{d H_{λ, q} (X)}{d λ} \leq 0

when q ≥ 2 (or q ≤ 0).

This result also agrees with the known fact that the usual (Ferreri) hypoentropy is increasing as a function of λ.

Theorem 1

We assume h(λ, q) = λ¹⁻^q. It holds H_λ,q(Y |X) ≤ H_λ,q(Y ) for 1 ≤ q ≤ 2.

Proof

We prove this theorem by the method used in [18] with Jensen’s inequality. We note that Ln_λ,q(x) is a nonnegative and concave function in x, when 0 ≤ x ≤ 1, λ > 0 and q ≥ 0. Here, we use the notation for the conditional probability as

p (y_{j} ∣ x_{i}) = \frac{p (x_{i}, y_{j})}{p (x_{i})}

when p(x_i) ≠ 0. By the concavity of Ln_λ,q(x), we have:

\sum_{i = 1}^{n} p (x_{i}) L n_{λ, q} (p (y_{j} ∣ x_{i})) \leq L n_{λ, q} (\sum_{i = 1}^{n} p (x_{i}) p (y_{j} ∣ x_{i}))

(72)

= L n_{λ, q} (\sum_{i = 1}^{n} p (x_{i}, y_{j})) = L n_{λ, q} (p (y_{j})) .

(73)

Summing both sides of the above inequality over j, we have:

\sum_{i = 1}^{n} p (x_{i}) \sum_{j = 1}^{m} L n_{λ, q} (p (y_{j} ∣ x_{i})) \leq \sum_{j = 1}^{m} L n_{λ, q} (p (y_{j})) .

(74)

Since p(x_i)^q ≤ p(x_i) for 1 ≤ q ≤ 2 and Ln_λ,q(x) ≥ 0 for 0 ≤ x ≤ 1, λ > 0 and q ≥ 0, we have:

p {(x_{i})}^{q} \sum_{j = 1}^{m} L n_{λ, q} (p (y_{j} ∣ x_{i})) \leq p (x_{i}) \sum_{j = 1}^{m} L n_{λ, q} (p (y_{j} ∣ x_{i})) .

(75)

Summing both sides of the above inequality over i, we have:

\sum_{i = 1}^{n} p {(x_{i})}^{q} \sum_{j = 1}^{m} L n_{λ, q} (p (y_{j} ∣ x_{i})) \leq \sum_{i = 1}^{n} p (x_{i}) \sum_{j = 1}^{m} L n_{λ, q} (p (y_{j} ∣ x_{i})) .

(76)

By the two inequalities (74) and (76), we have:

\sum_{i = 1}^{n} p {(x_{i})}^{q} \sum_{j = 1}^{m} L n_{λ, q} (p (y_{j} ∣ x_{i})) \leq \sum_{j = 1}^{m} L n_{λ, q} (p (y_{j})) .

(77)

Here, we can see that

\sum_{j = 1}^{m} L n_{λ, q} (p (y_{j} ∣ x_{i}))

is the Tsallis hypoentropy for fixed x_i and the Tsallis hypoentropy is a monotonically increasing function of λ in the case 1 ≤ q ≤ 2, due to Lemma 5. Thus, we have:

\sum_{j = 1}^{m} L n_{λ p (x_{i}), q} (p (y_{j} ∣ x_{i})) \leq \sum_{j = 1}^{m} L n_{λ, q} (p (y_{j} ∣ x_{i})) .

(78)

By the two inequalities (77) and (78), we finally have:

\sum_{i = 1}^{n} p {(x_{i})}^{q} \sum_{j = 1}^{m} L n_{λ p (x_{i}), q} (p (y_{j} ∣ x_{i})) \leq \sum_{j = 1}^{m} L n_{λ, q} (p (y_{j})),

(79)

which implies (since

p (y_{j} ∣ x_{i}) = \frac{p (x_{i}, y_{j})}{p (x_{i})}

:

\sum_{i = 1}^{n} p {(x_{i})}^{q} H_{λ p (x_{i}), q} (Y ∣ x_{i}) \leq \sum_{j = 1}^{m} L n_{λ, q} (p (y_{j})),

(80)

since we have for all fixed x_i,

\begin{array}{l} H_{λ p (x_{i}), q} (Y ∣ x_{i}) = \frac{1}{λ^{q} p {(x_{i})}^{q}} \sum_{j = 1}^{m} {- p (y_{j} ∣ x_{i}) (1 + λ p (x_{i})) {ln}_{q} \frac{1}{1 + λ p (x_{i})} \\ + (1 + λ p (x_{i}) p (y_{j} ∣ x_{i})) {ln}_{q} \frac{1}{1 + λ p (x_{i}) p (y_{j} ∣ x_{i})}} = \sum_{j = 1}^{m} L n_{λ p (x_{i}), q} (p (y_{j} ∣ x_{i})) . \end{array}

Therefore, we have H_λ,q(Y |X) ≤ H_λ,q(Y ).

Corollary 1

We have the following subadditivity for the Tsallis hypoentropies:

H_{λ, q} (X, Y) \leq H_{λ, q} (X) + H_{λ, q} (Y)

(81)

in the case h(λ, q) = λ¹⁻^q for 1 ≤ q ≤ 2.

Proof

The proof is easily done by Lemma 4 and Theorem 1.

We are now in a position to prove the strong subadditivity for the Tsallis hypoentropies. The strong subadditivity for entropy is one of interesting subjects in entropy theory [19]. For this purpose, we firstly give a chain rule for three random variables X, Y and Z.

Lemma 6

We assume h(λ, q) = λ¹⁻^q. The following chain rule holds:

H_{λ, q} (X, Y, Z) = H_{λ, q} (X ∣ Y, Z) + H_{λ, q} (Y, Z) .

(82)

Proof

The proof can be done following the recipe used in Lemma 4.

\begin{array}{l} H_{λ, q} (X ∣ Y, Z) + H_{λ, q} (Y, Z) \\ = \sum_{j = 1}^{m} \sum_{k = 1}^{l} p {(y_{j}, z_{k})}^{q} \frac{1}{{(λ p (y_{j}, z_{k}))}^{q}} {- (1 + λ p (y_{j}, z_{k})) {ln}_{q} \frac{1}{1 + λ p (y_{j}, z_{k})} \\ + \sum_{i = 1}^{n} (1 + λ p (y_{j}, z_{k}) \frac{p (x_{i}, y_{j}, z_{k})}{p (y_{j}, z_{k})}) {ln}_{q} \frac{1}{1 + λ p (y_{j}, z_{k}) \frac{p (x_{i}, y_{j}, z_{k})}{p (y_{j}, z_{k})}}} \\ + \frac{1}{λ^{q}} {- (1 + λ) {ln}_{q} \frac{1}{1 + λ} + \sum_{j = 1}^{m} \sum_{k = 1}^{l} (1 + λ p (y_{j}, z_{k})) {ln}_{q} \frac{1}{1 + λ p (y_{j}, z_{k})}} \\ = \frac{1}{λ^{q}} {- (1 + λ) {ln}_{q} \frac{1}{1 + λ} + \sum_{i = 1}^{n} \sum_{j = 1}^{m} \sum_{k = 1}^{l} (1 + λ p (x_{i}, y_{j}, z_{k})) {ln}_{q} \frac{1}{1 + λ p (x_{i}, y_{j}, z_{k})}} \\ = H_{λ, q} (X, Y, Z) . \end{array}

Theorem 2

We assume h(λ, q) = λ¹⁻^q. The strong subadditivity for the Tsallis hypoentropies,

H_{λ, q} (X, Y, Z) + H_{λ, q} (Z) \leq H_{λ, q} (X, Z) + H_{λ, q} (Y, Z),

(83)

holds for 1 ≤ q ≤ 2.

Proof

This theorem is proven in a similar way as Theorem 1. By the concavity of the function Ln_λp₍_{z_k}₎_,q(x) in x and by using Jensen’s inequality, we have:

\sum_{j = 1}^{m} p (y_{j} ∣ z_{k}) L n_{λ p (z_{k}), q} (p (x_{i} ∣ y_{j}, z_{k})) \leq L n_{λ p (z_{k}), q} (\sum_{j = 1}^{m} p (y_{j} ∣ z_{k}) p (x_{i} ∣ y_{j}, z_{k})) .

Multiplying both sides by p(z_k)^q and summing over i and k, we have:

\sum_{j = 1}^{m} \sum_{k = 1}^{l} p {(z_{k})}^{q} p (y_{j} ∣ z_{k}) \sum_{i = 1}^{n} L n_{λ p (z_{k}), q} (p (x_{i} ∣ y_{j}, z_{k})) \leq \sum_{k = 1}^{l} p {(z_{k})}^{q} \sum_{i = 1}^{n} L n_{λ p (z_{k}), q} (p (x_{i} ∣ z_{k})),

(84)

since

\sum_{j = 1}^{m} p (y_{j} ∣ z_{k}) p (x_{i} ∣ y_{j}, z_{k}) = p (x_{i} ∣ z_{k})

. By p(y_j |z_k)^q ≤ p(y_j |z_k) for all j, k and 1 ≤ q ≤ 2, and by the nonnegativity of the function Ln_λp₍_{z_k}₎_,q, we have:

p {(y_{j} ∣ z_{k})}^{q} \sum_{i = 1}^{n} L n_{λ p (z_{k}), q} (p (x_{i} ∣ y_{j}, z_{k})) \leq p (y_{j} ∣ z_{k}) \sum_{i = 1}^{n} L n_{λ p (z_{k}), q} (p (x_{i} ∣ y_{j}, z_{k})) .

Multiplying both sides by p(z_k)^q and summing over j and k in the above inequality, we have:

\sum_{j = 1}^{m} \sum_{k = 1}^{l} p {(z_{k})}^{q} p {(y_{j} ∣ z_{k})}^{q} \sum_{i = 1}^{n} L n_{λ p (z_{k}), q} (p (x_{i} ∣ y_{j}, z_{k})) \leq \sum_{j = 1}^{m} \sum_{k = 1}^{l} p {(z_{k})}^{q} p (y_{j} ∣ z_{k}) \sum_{i = 1}^{n} L n_{λ p (z_{k}), q} (p (x_{i} ∣ y_{j}, z_{k})) .

(85)

From the two inequalities (84) and (85), we have:

\sum_{j = 1}^{m} \sum_{k = 1}^{l} p {(z_{k})}^{q} p {(y_{j} ∣ z_{k})}^{q} \sum_{i = 1}^{n} L n_{λ p (z_{k}), q} (p (x_{i} ∣ y_{j}, z_{k})) \leq \sum_{k = 1}^{l} p {(z_{k})}^{q} \sum_{i = 1}^{n} L n_{λ p (z_{k}), q} (p (x_{i} ∣ z_{k})),

which implies:

\sum_{j = 1}^{m} \sum_{k = 1}^{l} p {(y_{j}, z_{k})}^{q} \sum_{i = 1}^{n} L n_{λ p (y_{j}, z_{k}), q} (p (x_{i} ∣ y_{j}, z_{k})) \leq \sum_{k = 1}^{l} p {(z_{k})}^{q} \sum_{i = 1}^{n} L n_{λ p (z_{k}), q} (p (x_{i} ∣ z_{k})),

since p(y_j, z_k) ≤ p(z_k) (because of

\sum_{j = 1}^{m} p (y_{j}, z_{k}) = p (z_{k})

for all j and k and the function Ln_λp₍_{z_k}₎_,q is monotonically increasing in λp(z_k) > 0, when 1 ≤ q ≤ 2. Thus, we have H_λ,q(X|Y, Z) ≤ H_λ,q(X|Z), which is equivalent to the inequality:

H_{λ, q} (X, Y, Z) - H_{λ, q} (Y, Z) \leq H_{λ, q} (X, Z) - H_{λ, q} (Z)

by Lemmas 4 and 6.

Remark 5

Passing to the limit λ → ∞ in Corollary 1 and Theorem 2, we recover the subadditivity and the strong subadditivity [20] for the Tsallis entropy:

T_{q} (X, Y) \leq T_{q} (X) + T_{q} (Y) (q \geq 1)

and:

T_{q} (X, Y, Z) + T_{q} (Z) \leq T_{q} (X, Z) + T_{q} (Y, Z) (q \geq 1) .

Thanks to the subadditivities, we may define the Tsallis mutual hypoentropies for 1 ≤ q ≤ 2 and λ > 0.

Definition 4

Let 1 ≤ q ≤ 2 and λ > 0. The Tsallis mutual hypoentropy is defined by:

I_{λ, q} (X; Y) \equiv H_{λ, q} (X) - H_{λ, q} (X ∣ Y)

and the Tsallis conditional mutual hypoentropy is defined by:

I_{λ, q} (X; Y ∣ Z) \equiv H_{λ, q} (X ∣ Z) - H_{λ, q} (X ∣ Y, Z) .

From the chain rule given in Lemma 4, we find that the Tsallis mutual hypoentropy is symmetric, that is,

\begin{array}{l} I_{λ, q} (X; Y) \equiv H_{λ, q} (X) - H_{λ, q} (X ∣ Y) \\ = H_{λ, q} (X) + H_{λ, q} (Y) - H_{λ, q} (X, Y) \\ = H_{λ, q} (Y) - H_{λ, q} (Y ∣ X) = I_{λ, q} (Y; X) . \end{array}

(86)

In addition, we have:

0 \leq I_{λ, q} (X; Y) \leq min {H_{λ, q} (X), H_{λ, q} (Y)}

(87)

from the subadditivity given in Theorem 1 and the nonnegativity of the Tsallis conditional hypoentropy. We also find I_λ,q(X, Y |Z) ≥ 0 from the strong subadditivity given in Theorem 2.

Moreover, we have the chain rule for the Tsallis mutual hypoentropies in the following.

\begin{array}{l} I_{λ, q} (X; Y ∣ Z) = H_{λ, q} (X ∣ Z) - H_{λ, q} (X ∣ Y, Z) \\ = H_{λ, q} (X ∣ Z) - H_{λ, q} (X) + H_{λ, q} (X) - H_{λ, q} (X ∣ Y, Z) \\ = - I_{λ, q} (X; Z) + I_{λ, q} (X; Y, Z) . \end{array}

(88)

From the strong subadditivity, we have H_λ,q(X|Y, Z) ≤ H_λ,q(X|Z); thus, we have:

I_{λ, q} (X; Z) \leq I_{λ, q} (X; Y, Z)

for 1 ≤ q ≤ 2 and λ > 0.

5. Jeffreys and Jensen–Shannon Hypodivergences

In what follows, we indicate extensions of two known information measures.

Definition 5 ([21,22])

The Jeffreys divergence is defined by:

J (X ‖ Y) \equiv D (X ‖ Y) + D (Y ‖ X)

(89)

and the Jensen–Shannon divergence is defined as:

J S (X ‖ Y) \equiv \frac{1}{2} {D (X ‖ \frac{X + Y}{2}) + D (Y ‖ \frac{X + Y}{2})}

(90)

= H (\frac{X + Y}{2}) - \frac{1}{2} (H (X) + H (Y)) .

(91)

The Jensen–Shannon divergence was introduced in 1991 in [23], but its roots can be older, since one can see some analogous formulae used in thermodynamics under the name entropy of mixing (p. 598 in [24]), for the study of gaseous, liquid or crystalline mixtures.

Jeffreys and Jensen–Shannon divergences have been extended to the context of Tsallis theory in [25]:

Definition 6

The Jeffreys–Tsallis divergence is:

J_{q} (X ‖ Y) \equiv S_{q} (X ‖ Y) + S_{q} (Y ‖ X)

(92)

and the Jensen–Shannon–Tsallis divergence is:

J S_{q} (X ‖ Y) \equiv \frac{1}{2} {S_{q} (X ‖ \frac{X + Y}{2}) + S_{q} (Y ‖ \frac{X + Y}{2})} .

(93)

Note that:

J S_{q} (X ‖ Y) \neq T_{q} (\frac{X + Y}{2}) - \frac{1}{2} (T_{q} (X) + T_{q} (Y)) .

This expression was used in [26] as Jensen–Tsallis divergence.

In accordance with the above definition, we define the directed Jeffreys and Jensen–Shannon q-hypodivergence measures between two distributions and emphasize the mathematical significance of our definitions.

Definition 7

The Jeffreys–Tsallis hypodivergence is:

J_{λ, q} (X ‖ Y) \equiv D_{λ, q} (X ‖ Y) + D_{λ, q} (Y ‖ X)

(94)

and the Jensen–Shannon–Tsallis hypodivergence is:

J S_{λ, q} (X ‖ Y) \equiv \frac{1}{2} {D_{λ, q} (X ‖ \frac{X + Y}{2}) + D_{λ, q} (Y ‖ \frac{X + Y}{2})} .

(95)

Here, we point out that again, one has:

J S_{λ} (X ‖ Y) = \frac{1}{2} K_{λ} (X ‖ \frac{X + Y}{2}) + \frac{1}{2} K_{λ} (Y ‖ \frac{X + Y}{2})

(96)

= F_{λ} (\frac{X + Y}{2}) - \frac{1}{2} (F_{λ} (X) + F_{λ} (Y)),

(97)

where:

J S_{λ} (X ‖ Y) \equiv lim_{q \to 1} J S_{λ, q} (X ‖ Y) .

Lemma 7

The following inequality holds:

D_{λ, q} (X ‖ \frac{X + Y}{2}) \leq \frac{1}{2} D_{λ, \frac{1 + q}{2}} (X ‖ Y)

for q ≥ 0 and λ > 0.

Proof

Using the inequality between the arithmetic and geometric mean, one has:

D_{λ, q} (X ‖ \frac{X + Y}{2}) = - \frac{1}{λ} \sum_{i = 1}^{n} (1 + λ p (x_{i})) {ln}_{q} \frac{\frac{(1 + λ p (x_{i})) + (1 + λ p (y_{i}))}{2}}{1 + λ p (x_{i})}

(98)

\leq - \frac{1}{λ} \sum_{i = 1}^{n} (1 + λ p (x_{i})) {ln}_{q} \sqrt{\frac{1 + λ p (y_{i})}{1 + λ p (x_{i})}}

(99)

= - \frac{1}{2 λ} \sum_{i = 1}^{n} (1 + λ p (x_{i})) \frac{{(\frac{1 + λ p (y_{i})}{1 + λ p (x_{i})})}^{1 - \frac{1 + q}{2}} - 1}{1 - \frac{1 + q}{2}}

(100)

= \frac{1}{2} D_{λ, \frac{1 + q}{2}} (X ‖ Y) .

(101)

Thus, the proof is completed.

In the limit λ → ∞, Lemma 7 recovers Lemma 3.4 in [25].

Lemma 8 ([25])

The function:

f (x) = - {ln}_{r} \frac{1 + {exp}_{q} x}{2}

is concave for 0 ≤ r ≤ q.

The next two results of the present paper are stated in order to establish the counterpart of Theorem 3.5 in [25] for hypodivergences.

Proposition 7

It holds:

J S_{λ, q} (X ‖ Y) \frac{1}{4} J_{λ, \frac{1 + q}{2}} (X ‖ Y)

(102)

for q ≥ 0 and λ > 0.

Proof

By the use of Lemma 7, one has:

2 J S_{λ, q} (X ‖ Y) = D_{λ, q} (X ‖ \frac{X + Y}{2}) + D_{λ, q} (Y ‖ \frac{X + Y}{2})

(103)

\leq \frac{1}{2} D_{λ, \frac{1 + q}{2}} (X ‖ Y) + \frac{1}{2} D_{λ, \frac{1 + q}{2}} (Y ‖ X)

(104)

= \frac{1}{2} J_{λ, \frac{1 + q}{2}} (X ‖ Y) .

(105)

This completes the proof.

Proposition 8

It holds that:

J S_{λ, r} (X ‖ Y) \leq - \frac{n + λ}{λ} {ln}_{r} \frac{1 + {exp}_{q} (- \frac{1}{2} \cdot \frac{λ}{n + λ} \cdot J_{λ, q} (X ‖ Y))}{2}

(106)

for 0 ≤ r ≤ q and λ > 0.

Proof

According to Lemma 8,

\begin{array}{l} J S_{λ, r} (X ‖ Y) = - \frac{n + λ}{2} {\sum_{i = 1}^{n} \frac{1 + λ p (x_{i})}{n + λ} {ln}_{r} \frac{1 + {exp}_{q} {ln}_{q} (\frac{1 + λ p (y_{i})}{1 + λ p (x_{i})})}{2} + \sum_{i = 1}^{n} \frac{1 + λ p (y_{i})}{n + λ} {ln}_{r} \frac{1 + {exp}_{q} {ln}_{q} (\frac{1 + λ p (x_{i})}{1 + λ p (y_{i})})}{2}} \\ \leq - \frac{n + λ}{2 λ} {{ln}_{r} \frac{1 + {exp}_{q} \sum_{i = 1}^{n} \frac{1 + λ p (x_{i})}{n + λ} {ln}_{q} (\frac{1 + λ p (y_{i})}{1 + λ p (x_{i})})}{2} + {ln}_{r} \frac{1 + {exp}_{q} \sum_{i = 1}^{n} \frac{1 + λ p (y_{i})}{n + λ} {ln}_{q} (\frac{1 + λ p (x_{i})}{1 + λ p (y_{i})})}{2}} \\ = - \frac{n + λ}{2} {{ln}_{r} \frac{1 + {exp}_{q} (- \frac{λ}{n + λ} D_{λ, q} (X ‖ Y))}{2} + {ln}_{r} \frac{1 + {exp}_{q} (- \frac{λ}{n + λ} D_{λ, q} (Y ‖ X))}{2}} . \end{array}

(107)

Then:

\begin{array}{l} J S_{λ, r} (X ‖ Y) \leq - \frac{n + λ}{λ} {ln}_{r} \frac{1 + {exp}_{q} - \frac{λ}{n + λ} (\frac{D_{λ, q} (X ‖ Y) + D_{λ, q} (Y ‖ X)}{2})}{2} \\ = - \frac{n + λ}{λ} {ln}_{r} \frac{1 + {exp}_{q} (- \frac{1}{2} \cdot \frac{λ}{n + λ} \cdot J_{λ, q} (X ‖ Y))}{2} . \end{array}

(108)

Thus, the proof is completed.

We further define the dual symmetric hypodivergences.

Definition 8

The dual symmetric Jeffreys–Tsallis hypodivergence is defined by:

J_{λ, q}^{(d s)} (X ‖ Y) \equiv D_{λ, q} (X ‖ Y) + D_{λ, 2 - q} (Y ‖ X)

and the dual symmetric Jensen–Shannon–Tsallis hypodivergence is defined by:

J S_{λ, q}^{(d s)} (X ‖ Y) \equiv \frac{1}{2} {D_{λ, q} (X ‖ \frac{X + Y}{2}) + D_{λ, 2 - q} (Y ‖ \frac{X + Y}{2})} .

Using Lemma 7, we have the following inequality.

Proposition 9

It holds:

J S_{λ, q}^{(d s)} (X ‖ Y) \leq \frac{1}{4} J_{λ, \frac{1 + q}{2}}^{(d s)} (X ‖ Y)

for 0 ≤ q ≤ 2 and λ > 0.

In addition, we have the following inequality.

Proposition 10

It holds:

J S_{λ, q}^{(d s)} (X ‖ Y) \leq - \frac{n + λ}{λ} {ln}_{r} \frac{1 + {exp}_{q} (- \frac{λ}{2 (n + λ)} J_{λ, q} (X ‖ Y))}{2}

for 1 < r ≤ 2, r ≤ q and λ > 0.

Proof

The proof can be done by similar calculations with Proposition 8, applying the facts (see Lemmas 3.9 and 3.10 in [25]) that exp_q(x) is a monotonically increasing function in q for x ≥ 0 and the inequality −ln₂₋_r x ≤ −ln_r x holds for 1 < r ≤ 2 and x > 0.

6. Concluding Remarks

In this paper, we introduced the Tsallis hypoentropy H_λ,q(X) and studied some properties of H_λ,q(X). We named H_λ,q(X) Tsallis hypoentropy because of the relation H_λ,q(X) ≤ T_q(X), which follows from the monotonicity in λ given in Proposition 5 and Lemma 5 for the case h(λ, q) = (1 + λ)¹⁻^q and the case h(λ, q) = λ¹⁻^q, respectively (this relation can be also proven directly). In this naming, we follow Ferreri, as he has termed F_λ(X) hypoentropy due to the relation F_λ(X) ≤ H(X).

The monotonicity of the hypoentropy and the Tsallis hypoentropy for λ > 0, indeed, is an interesting feature. It may be remarkable to examine the monotonicity of the Tsallis entropy for the parameter q ≥ 0. We find that the Tsallis entropy T_q(X) is monotonically decreasing with respect to q ≥ 0. Indeed, we find

\frac{d T_{q} (X)}{d q} = \sum_{j = 1}^{n} \frac{p_{j}^{q} v_{q} (p_{j})}{{(1 - q)}^{2}}

, where v_q(x) ≡ 1 − x¹⁻^q + (1 − q) log x (0 ≤ x ≤ 1). Since x^qv_q(x) = 0 for x = 0 and q > 0, we prove v_q(x) ≤ 0 for 0 < x ≤ 1. We find

\frac{d v_{q} (x)}{d x} = \frac{(1 - q) (1 - x^{1 - q})}{x} \geq 0

when 0 < x ≤ 1; thus, we have v_q(x) ≤ v_q(1) = 0, which implies

\frac{d T_{q} (X)}{d q} \leq 0

. This monotonicity implies the relations H(X) ≤ T_q(X) for 0 ≤ q < 1 and T_q(X) ≤ H(X) for q > 1 (these relations also follow from the inequalities

log \frac{1}{x} \leq {ln}_{q} \frac{1}{x}

for 0 ≤ q < 1, x > 0 and

log \frac{1}{x} \geq {ln}_{q} \frac{1}{x}

for q > 1, x > 0).

As other important results, we also gave the chain rules, subadditivity and the strong subadditivity of the Tsallis hypoentropies in the case of h(λ, q) = λ¹⁻^q. For the case of h(λ, q) = (1+λ)¹⁻^q, we can prove H_λ,q(Y |X) ≤ H_λ,q(X) and H_λ,q(X|Y, Z) ≤ H_λ,q(X|Z) for 1 ≤ q ≤ 2 in a similar way to the proofs of Theorems 1 and 2, since the function Sn_λ,q(x) defined in the proof of Proposition 5 is also nonnegative, monotone increasing and concave in x ∈ [0, 1], and we have

H_{λ p (x_{i}), q} (Y ∣ x_{i}) = \sum_{j = 1}^{m} S n_{λ p (x_{i}), q} (p (y_{j} ∣ x_{i}))

for all fixed x_i. However, we cannot obtain the inequalities:

\begin{matrix} H_{λ, q} (X, Y) \leq H_{λ, q} (X) + H_{λ, q} (Y) (1 \leq q \leq 2), \\ H_{λ, q} (X, Y, Z) + H_{λ, q} (Z) \leq H_{λ, q} (X, Z) + H_{λ, q} (Y, Z) (1 \leq q \leq 2) \end{matrix}

for h(λ, q) = (1 + λ)¹⁻^q, because the similar proof for the chain rules does not work well in this case.

Acknowledgments

The first author was partially supported by JSPS KAKENHI Grant Number 24540146.

Author Contributions

The work presented here was carried out in collaboration between all authors. The study was initiated by the second author. The first author played also the role of the corresponding author. All authors contributed equally and significantly in writing this article. All authors have read and approved the final manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

Boltzmann, L.E. Einige allgemeine Sätze über Wärmegleichgewicht. Wiener Berichte 1871, 63, 679–711. [Google Scholar]
Gibbs, J.W. Elementary Principles in Statistical Mechanics—Developed with Especial Reference to the Rational Foundation of Thermodynamics; Charles Scribner’s Sons: New York, NY, USA, 1902. [Google Scholar]
Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J 1948, 27. [Google Scholar]
Hartley, R.V.L. Transmission of Information. Bell Syst. Tech. J 1928, 7, 535–563. [Google Scholar]
Kullback, S.; Leibler, R.A. On the information and sufficiency. Ann. Math. Stat 1951, 17, 79–86. [Google Scholar]
Rényi, A. On measures of information and entropy. Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, CA, USA, 20 June–30 July 1960; University of California Press: Berkeley, CA, USA, 1961; pp. 547–561. [Google Scholar]
Havrda, J.; Charvat, F. Quantification methods of classification processes: Concept of structural alpha-entropy. Kybernetika 1967, 3, 30–35. [Google Scholar]
Aczél, J.; Daróczy, Z. On Measures of Information and Their Characterizations; Academic Press: New York NY, USA, 1975. [Google Scholar]
Ferreri, C. Hypoentropy and related heterogeneity, divergence and information measures. Statistica 1980, 2, 155–167. [Google Scholar]
Tsallis, C. Possible generalization of Boltzmann–Gibbs statistics. J. Stat. Phys 1988, 52, 479–487. [Google Scholar]
Tsallis, C. Generalized entropy-based criterion for consistent testing. Phys. Rev. E 1998, 58, 1442–1445. [Google Scholar]
Borland, L.; Plastino, A.R.; Tsallis, C. Information gain within nonextensive thermostatistics. J. Math. Phys 1998, 39, 6490–6501. [Google Scholar]
Gilardoni, G. On Pinsker’s and Vajda’s type inequalities for Csiszár’s f-divergence. IEEE Trans. Inf. Theory 2010, 56, 5377–5386. [Google Scholar]
Rastegin, A.E. Bounds of the Pinsker and Fannes types on the Tsallis relative entropy. Math. Phys. Anal. Geom 2013, 16, 213–228. [Google Scholar]
Csiszár, I. Information-type measures of difference of probability distributions and indirect observations. Stud. Sci. Math. Hung 1967, 2, 299–318. [Google Scholar]
Csiszár, I. Axiomatic characterizations of information measures. Entropy 2008, 10, 261–273. [Google Scholar]
Furuichi, S. On uniqueness theorems for Tsallis entropy and Tsallis relative entropy. IEEE Trans. Inf. Theory 2005, 47, 3638–3645. [Google Scholar]
Daroczy, Z. Generalized information functions. Inf. Control 1970, 16, 36–51. [Google Scholar]
Petz, D.; Virosztek, D. Some inequalities for quantum Tsallis entropy related to the strong subadditivity. Math. Inequal. Appl 2014, in press. [Google Scholar]
Furuichi, S. Information theoretical properties of Tsallis entropies. J. Math. Phys 2006, 47. [Google Scholar] [CrossRef]
Dragomir, S.S.; Šunde, J.; Buşe, C. New inequalities for Jeffreys divergence measure. Tamsui Oxf. J. Math. Sci 2000, 16, 295–309. [Google Scholar]
Jeffreys, H. An invariant form for the prior probability in estimation problems. Proc. R. Soc. Lond. A 1946, 186, 453–461. [Google Scholar]
Lin, J. Divergence measures based on the Shannon entropy. IEEE Trans. Inf. Theory 1991, 37, 145–151. [Google Scholar]
Tolman, R.C. The Principles of Statistical Mechanics; Clarendon Press: London, UK, 1938. [Google Scholar]
Furuichi, S.; Mitroi, F.-C. Mathematical inequalities for some divergences. Physica A 2012, 391, 388–400. [Google Scholar]
Hamza, A.B. A nonextensive information-theoretic measure for image edge detection. J. Electron. Imaging 2006, 15, 13011.1–13011.8. [Google Scholar]

© 2014 by the authors; licensee MDPI, Basel, Switzerland This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Share and Cite

MDPI and ACS Style

Furuichi, S.; Mitroi-Symeonidis, F.-C.; Symeonidis, E. On Some Properties of Tsallis Hypoentropies and Hypodivergences. Entropy 2014, 16, 5377-5399. https://doi.org/10.3390/e16105377

AMA Style

Furuichi S, Mitroi-Symeonidis F-C, Symeonidis E. On Some Properties of Tsallis Hypoentropies and Hypodivergences. Entropy. 2014; 16(10):5377-5399. https://doi.org/10.3390/e16105377

Chicago/Turabian Style

Furuichi, Shigeru, Flavia-Corina Mitroi-Symeonidis, and Eleutherius Symeonidis. 2014. "On Some Properties of Tsallis Hypoentropies and Hypodivergences" Entropy 16, no. 10: 5377-5399. https://doi.org/10.3390/e16105377

Article Menu

On Some Properties of Tsallis Hypoentropies and Hypodivergences

Abstract

1. Preliminaries

2. Hypoentropy and Hypodivergence

Lemma 1

Proof

Proposition 1

Proof

Proposition 2

Proof

Remark 1

3. Tsallis Hypoentropy and Hypodivergence

Definition 1

Remark 2

Definition 2

Remark 3

Lemma 2

Proof

Proposition 3

Proof

Lemma 3

Proof

Proposition 4

Proof

Remark 4

Proposition 5

Proof

Proposition 6

Proof

4. The Subadditivities of the Tsallis Hypoentropies

Definition 3

Lemma 4

Proof

Lemma 5

Proof

Theorem 1

Proof

Corollary 1

Proof

Lemma 6

Proof

Theorem 2

Proof

Remark 5

Definition 4

5. Jeffreys and Jensen–Shannon Hypodivergences

Definition 5 ([21,22])

Definition 6

Definition 7

Lemma 7

Proof

Lemma 8 ([25])

Proposition 7

Proof

Proposition 8

Proof

Definition 8

Proposition 9

Proposition 10

Proof

6. Concluding Remarks

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI