Equivalence of Partition Functions Leads to Classification of Entropies and Means

Michel S. Elnaggar; Achim Kempf

doi:10.3390/e14081317

and

¹

Department of Applied Mathematics, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada

²

Department of Applied Mathematics, University of Waterloo, 200 University Avenue West, Waterloo, Ontario N2L 3G1, Canada

^*

Author to whom correspondence should be addressed.

^†

Present address: Bell Mobility Inc., Mississauga, Ontario L4W 5N2, Canada.

Entropy2012, 14(8), 1317-1342;https://doi.org/10.3390/e14081317

This article belongs to the Special Issue Advances in Applied Thermodynamics

Version Notes

Order Reprints

Abstract

We derive a two-parameter family of generalized entropies, S_pq, and means m_pq. To this end, assume that we want to calculate an entropy and a mean for n non-negative real numbers {x₁,…,x_n}. For comparison, we consider {m₁,…,m_k} where m_i = m for all i = 1,…,k and where m and k are chosen such that the l^p and l^q norms of {x₁,…,x_n} and {m₁,…,m_k} coincide. We formally allow k to be real. Then, we define k, log k, and m to be a generalized cardinality k_pq, a generalized entropy S_pq, and a generalized mean m_pq respectively. We show that this family of entropies includes the Shannon and Rényi entropies and that the family of generalized means includes the power means (such as arithmetic, harmonic, geometric, root-mean-square, maximum, and minimum) as well as novel means of Shannon-like and Rényi-like forms. A thermodynamic interpretation arises from the fact that the l^p norm is closely related to the partition function at inverse temperature β = p. Namely, two systems possess the same generalized entropy and generalized mean energy if and only if their partition functions agree at two temperatures, which is also equivalent to the condition that their Helmholtz free energies agree at these two temperatures.

Keywords:

cardinality; dimensionality; entropy; equivalence; free energy; information and thermodynamics; norm; mean; partition function; Shannon and Rényi axioms

PACS Codes:

05.70.-a; 02.50.-r

1. Introduction

Two of the most basic concepts of thermodynamics are: (a) the average of measurement outcomes and (b) the uncertainty or entropy about measurement outcomes. Consider, for example, a physical system, A, that is in contact with a heat bath at some fixed temperature, i.e., a canonical ensemble. The measurement of the system’s energy can return any one of its energy eigenstates. What then is (a) the mean energy to expect and (b) how uncertain is the prediction of the measured energy state?

We notice that, in principle, many different notions of energy mean and many different measures of entropy could be employed here. Of course, in thermodynamics, the Boltzmann factor weighted mean as well as the Shannon/von Neumann entropy are of foremost importance. In this paper, we show that also other important notions of average such as the harmonic mean, the geometric mean and the arithmetic mean arise naturally, along with generalized notions of entropy including Rényi entropies [1], all unified in a two-parameter family of notions of means and notions of entropies.

To this end, consider systems (canonical ensembles) in a heat bath. We begin by considering the simplest kind of system, namely the type of system which possesses only one energy level, E. Let us denote its degeneracy by k. Unambiguously, we should assign that system the mean

m : = E

and the entropy

S : = \log k

. Let us denote these simple one-level systems by the term reference system.

Now, let X be a system with arbitrary discrete energy levels. Our aim is to assign X a mean and an entropy by finding that reference system M which is in some sense equivalent to X. Then we assign X the same value for the mean and entropy as the reference system M.

But how do we decide if a reference system is in some sense equivalent to system X? Given that we want the reference system M and system X to share two properties, namely a mean and an entropy, we expect any such condition for the equivalence of two systems to require two equations to be fulfilled. Further, since the properties of systems are encoded in their partition function

Z (X)

, we expect that these two equations can be expressed in terms of the partition functions of the two systems in question.

To this end, let us adopt what may be considered the simplest definition. We choose two temperatures, T₁ and T₂ and we define that a reference system is (T₁, T₂)-equivalent to system X if the partition functions of the two systems coincide with each other at these two temperatures. Since the Helmholtz free energy

A_{H}

obeys

A_{H} = - K_{B} T \log (Z)

, where

K_{B}

is the Boltzmann constant, this is the same as saying that two systems are put in the same equivalence class if their Helmholtz free energies coincide at these two temperatures.

This allows us now to assign any system X a mean and an entropy. We simply find its unique (T₁ and T₂)-equivalent reference system M. Then the mean and entropy of X are defined to be the mean and the entropy of the reference system M.

Clearly, the so-defined mean and entropy of a system X now actually depend on two temperatures, namely (T₁, T₂). As we will show below, in the limit when we let the two temperatures become the same temperature, we recover the usual Boltzmann factor-weighted mean, i.e., the usual mean energy, along with the usual Shannon/von Neumann entropy.

For general (T₁, T₂), we cover more, however. Namely, we naturally obtain a unifying 2-parameter family of notions of mean that includes for example the geometric, the harmonic, the arithmetic and the root-mean-square (RMS) means. And we obtain a unifying 2-parameter family of notions of entropy that, for example, includes the Rényi family of entropies.

To be precise, let us assume that a system X has only discrete energy levels, {E_i}, where i enumerates all the energy levels counting also possible degeneracies. Notice that {E_i} is formally what is called a multiset, because its members are allowed to occur more than once. Similarly, let us also collect the exponentiation of the negative energies

x_{i} : = \exp (- Ε_{i})

in the multiset

{x_{i}}

. Either multiset can be used to describe the same thermodynamic system X. Let

β : = 1 / K_{B} T

denote the inverse temperature, where

K_{B}

is the Boltzmann constant. The partition function of system X, i.e., the sum of its Boltzmann factors, then reads:

Z_{β} (X) = \sum_{i} e^{- β Ε_{i}} = \sum_{i} {x_{i}}^{β}

(1)

For later reference, note that the partition function is therefore related to the l^p norm of X = {x₁,…,x_n} for

p = β

through

Z_{β} (X) = ​​​ | | X | |_{β}^{β}

.

Now the key definition is that we call two physical systems

(β_{1}, β_{2})

-equivalent if their partition functions coincide at the two inverse temperatures

(β_{1}, β_{2})

, i.e., systems

X

and

M

are

(β_{1}, β_{2})

-equivalent if

Z_{β_{1}} (X) = Z_{β_{1}} (M)

and

Z_{β_{2}} (X) = Z_{β_{2}} (M)

. To be more explicit, one may also call such systems

(β_{1}, β_{2})

-partition function equivalent, or also

(β_{1}, β_{2})

-Helmholtz free energy equivalent, but we will here use the term

(β_{1}, β_{2})

-equivalent for short.

In particular, for any given system X, let us consider the

(β_{1}, β_{2})

-equivalent reference system M which possesses just one energy level, with energy E₀ and degeneracy k, where we formally allow k to be any positive number. E₀ and k are then determined by the two conditions that the partition function of M is to coincide with that of X at the two inverse temperatures

β_{1}

and

β_{2}

. Then, we define

S_{β_{1}, β_{2}} (X) : = \log k

to be the generalized entropy, and

m_{β_{1} β_{2}} (X) : = Ε_{0}

to be the generalized mean energy of system

X

with respect to the temperatures

(β_{1}, β_{2})

.

We will explore the properties of these families of generalized entropies and means in the subsequent sections. First, however, let us consider the special limiting case when the two temperatures coincide (i.e.,

β_{1} = β_{2} = β

). As will be detailed in the subsequent sections of the manuscript, in this limiting case, the two equivalence conditions of partition functions can be shown to reduce to:

{(Z_{β} (X))}^{1 / β} = {(Z_{β} (M))}^{1 / β}

(2a)

\frac{\partial}{\partial β} {(Z_{β} (X))}^{1 / β} = \frac{\partial}{\partial β} {(Z_{β} (M))}^{1 / β}

(2b)

which can be shown to be equivalent to:

Z_{β} (X) = Z_{β} (M)

(3a)

\sum_{i} Ε_{i} \frac{e^{- β Ε_{i}}}{\sum_{j} e^{- β Ε_{j}}} = Ε_{0}

(3b)

The conditions (3a) and (3b) physically mean that systems

X

and

M

have the same partition function and average energy, respectively, at the inverse temperature

β

. Notice that this is also the same as saying that the two systems have the same average energy and the same Helmholtz free energy at the inverse temperature

β

. Now by employing either pair of conditions, (2) or (3), we then recover indeed the usual thermodynamic entropy of the system X, which is given in the Shannon form at the inverse temperature

β

by

S_{β β} (X) = - \sum_{i} \frac{e^{- β Ε_{i}}}{\sum_{j} e^{- β Ε_{j}}} \log (\frac{e^{- β Ε_{i}}}{\sum_{j} e^{- β Ε_{j}}})

(4)

The proofs of Equations (2)–(4) are straightforward. We will spell them out in detail in the subsequent sections where the setting is abstract mathematical.

Before we begin the mathematical treatment, let us remark that entropy is not only a cornerstone of thermodynamics but it is also crucial in information theory. Due to its universal significance, measures of uncertainty in the form of an entropy have been proposed by physicists and mathematicians for over a century [2]. Our approach here for deriving a generalized family of entropies was originally motivated by basic questions regarding the effective dimensionality of multi-antenna systems (e.g., [3,4,5]). After initial attempts in [6] and later in [5,7], we here give for the first time a comprehensive derivation with the proofs and we also include the family of generalized means.

The manuscript is organized as follows. In Section 2, we introduce the proposed family of entropies and means mathematically, and also show some special cases thereof. The axiomatic formulation is presented in Section 3, followed by the study of resulting properties in Section 4. Proofs are provided in the appendices.

2. Mathematical Definition of Generalized Entropies and Means

Let

X = {x_{i} : i = 1, 2, \dots, K_{X}}

be a multiset of real non-negative numbers, where

K_{X}

denotes the cardinality of X. We assume that X possesses at least one non-zero element. Further, let

p, q

be arbitrary fixed real numbers obeying

p q \geq 0

. Let

M = {m_{i} : m_{i} = m, m > 0, i = 1, 2, \dots, k}

be a reference multiset possessing exactly one real positive element

m

, which is of multiplicity

k \geq 1

. We introduce the following definitions:

$N_{X}$ is the number of non-zero elements of X, therefore $N_{X} \leq K_{X}$ . To simplify notation, the subscript X is omitted when dealing with one multiset at hand.
$K_{X \max}$ is the multiplicity of the maximum elements of X
$K_{X \min}$ is the multiplicity of the minimum elements of X

Our objective is to determine suitable values for

m

and

k

, possibly non-integer, that can serve as mean and effective cardinality of X, respectively, namely by imposing a suitable criterion for the equivalence of X to a reference multiset M. Having two unknowns (

m

and

k

) in M, we need two equivalence conditions. We choose to impose the equivalence of the p-norms and the q-norms:

\begin{array}{l} {‖ X ‖}_{p} \equiv {‖ M ‖}_{p} = m k^{1 / p} \\ {‖ X ‖}_{q} \equiv {‖ M ‖}_{q} = m k^{1 / q} \end{array}

(5)

Here, the p-norm

{‖ \cdot ‖}_{p}

is defined as usual through:

{‖ X ‖}_{p} : = {(\sum_{i = 1}^{K_{X}} {(x_{i})}^{p})}^{1 / p}

(6)

with the proviso that

{(x_{i})}^{p}

is replaced by 0 if

x_{i} = 0

and

p \leq 0

. We remark that, for

p < 1

, (6) is merely a quasi-norm since the triangle inequality does not hold. Note the singularity

\lim_{p \to 0^{+}} {‖ X ‖}_{p} = \infty

.

Solving for

k

and

m

in (5), we obtain:

\begin{array}{l} k_{p q} (X) = {[{‖ X ‖}_{p} / {‖ X ‖}_{q}]}^{1 / (\frac{1}{p} - \frac{1}{q})} p \neq q \\ m_{p q} (X) = {[{‖ X ‖}_{p}^{p} / {‖ X ‖}_{q}^{q}]}^{\frac{1}{p - q}} p \neq q \end{array}

(7)

We call

k_{p q} (X)

and

m_{p q} (X)

the norm-induced effective cardinality and generic mean of order

p, q

for the multiset X, respectively. Let us now express (7) in a logarithmic form and define the entropy

S_{p q} (X)

as follows:

S_{p q} (X) : = \log k_{p q} (X) = \frac{\log {‖ X ‖}_{p} - \log {‖ X ‖}_{q}}{\frac{1}{p} - \frac{1}{q}} p \neq q

(8a)

\log m_{p q} (X) = \frac{p \log {‖ X ‖}_{p} - q \log {‖ X ‖}_{q}}{p - q} p \neq q

(8b)

Notice that

S_{p q} (X) = S_{q p} (X)

and

m_{p q} (X) = m_{q p} (X)

, i.e., both the entropy and mean are symmetrical with respect to the order

p, q

.

Next, we express

S_{p q} (X)

and

m_{p q} (X)

in the limiting case when

p \to q

. For

S_{q q} (X)

we find:

\begin{array}{l} S_{q q} (X) : = \log k_{q q} (X) \\ \underset{p \to q}{= \lim} S_{p q} (X) \\ \underset{p \to q}{= \lim} \frac{\log {‖ X ‖}_{p} - \log {‖ X ‖}_{q}}{\frac{1}{p} - \frac{1}{q}} \\ \overset{l' H \hat{o} pital}{=} - q^{2} \frac{\frac{\partial}{\partial q} {‖ X ‖}_{q}}{{‖ X ‖}_{q}} \\ = - \sum_{i = 1}^{K} \frac{{x_{i}}^{q}}{\sum_{j = 1}^{K} {x_{j}}^{q}} \log (\frac{{x_{i}}^{q}}{\sum_{j = 1}^{K} {x_{j}}^{q}}) \end{array}

(9)

where the last step is obtained by straightforward manipulations. Similarly, we find for

m_{q q} (X)

:

\begin{array}{l} m_{q q} (X) \underset{p \to q}{= \lim} m_{p q} (X) = {‖ X ‖}_{q} / k_{q q}^{1 / q} \\ \log m_{q q} (X) \underset{p \to q}{= \lim} \frac{p \log {‖ X ‖}_{p} - q \log {‖ X ‖}_{q}}{p - q} \\ \overset{l' H \hat{o} pital}{=} \log {‖ X ‖}_{q} + q \frac{\frac{\partial}{\partial q} {‖ X ‖}_{q}}{{‖ X ‖}_{q}} \\ = \frac{\frac{\partial}{\partial q} {‖ X ‖}_{q}^{q}}{{‖ X ‖}_{q}^{q}} \\ = \sum_{i = 1}^{K} \frac{{x_{i}}^{q}}{\sum_{j = 1}^{K} {x_{j}}^{q}} \log x_{i} \end{array}

(10)

where in the second to last step, we used the fact that

\frac{d}{d x} g {(x)}^{f (x)} = g^{f} {\frac{d f}{d x} \log g + \frac{f}{g} \frac{d g}{d x}}

, and the last step is obtained by straightforward manipulations.

It is worthwhile to mention the following useful relation linking

S_{q q} (X)

,

{‖ X ‖}_{q}

and

m_{q q} (X)

, which is readily deduced from (9) and (10):

\log m_{q q} (X) = \log {‖ X ‖}_{q} - S_{q q} (X) / q

(11)

We remark that in the early phase of this work [5], each author independently suggested either

k_{p q} (X)

or

k_{q q} (X)

as two possible distinct notions for the effective cardinality. In [7], it was reported that the average energy and Shannon entropy of a thermodynamic system are obtained by starting from equivalence of partition functions of two systems at two temperatures when the two temperatures coincide as mentioned in the introduction. Clearly, the limiting operation in (9) makes the connection and establishes (7) as the general definition of this norm-induced family of entropies and means.

In fact, for the case of degenerate order, (

p \to q

), the quantities

k_{q q} (X)

,

S_{q q} (X)

, and

m_{q q} (X)

, could have been obtained as well through a differential equivalence of the

q

-norm. To see this, we impose the following two conditions:

\begin{array}{l} {‖ X ‖}_{q} \equiv {‖ M ‖}_{q} \\ \frac{\partial}{\partial q} {‖ X ‖}_{q} \equiv \frac{\partial}{\partial q} {‖ M ‖}_{q} \end{array}

(12)

After employing

\frac{d}{d x} g {(x)}^{f (x)} = g^{f} {\frac{d f}{d x} \log g + \frac{f}{g} \frac{d g}{d x}}

and solving for

k

and

m

, we ultimately obtain

k_{q q} (X)

and

m_{q q} (X)

as given by (9) and (10). The condition (12) is the mathematical equivalent of the aforementioned physical condition (2) imposed on the two thermodynamic systems, which yielded the Shannon entropy form (4).

From (9), it is obvious that

S_{q q} (X)

is the Shannon entropy of the distribution

{{x_{i}}^{q} / \sum_{i = 1}^{K} {x_{i}}^{q}}

, which is called the escort distribution of order

q

[8] of

{x_{i} / \sum_{i = 1}^{K} x_{i}}

. On the other hand,

S_{p q} (X)

is a more general expression of the Rényi entropy of order

α

. For a probability distribution

P = {p_{i}}

, the Rényi entropy of order

α

is given by [9]:

S_{α}^{(R)} (P) : = \frac{1}{1 - α} \log \frac{\sum_{i} {p_{i}}^{α}}{\sum_{i} p_{i}}, α > 0, α \neq 1

(13)

By setting the order

p = 1

in

S_{p q}

, we obtain from (8):

\begin{array}{l} S_{1 q} (X) = \frac{\log \sum_{i = 1}^{K} {x_{i}}^{q} - q \log \sum_{j = 1}^{K} x_{j}}{1 - q} q > 0, q \neq 1 \\ = \frac{1}{1 - q} \log \sum_{i = 1}^{K} {(\frac{x_{i}}{\sum_{j = 1}^{K} x_{j}})}^{q} \end{array}

(14)

By comparing (13) and (14), we readily identify

S_{1 q} (X)

as the Rényi entropy of order

q

for a complete statistical distribution given by

{x_{i} / \sum_{i = 1}^{K} x_{i}}

, where the multiset elements add to 1. Formally:

S_{1 q} (X) = S_{q}^{(R)} ({x_{i} / \sum_{i = 1}^{K} x_{i}})

(15)

In the degenerate case (when

q \to 1

),

S_{11} (X)

is the Shannon entropy of the latter distribution. For

p \neq 1

,

S_{p q} (X)

from (8) can be rearranged as a generalization of (13):

\begin{array}{l} S_{p q} (X) = \frac{p}{p - q} \log \sum_{i = 1}^{K} {(\frac{{x_{i}}^{p}}{\sum_{j = 1}^{K} {x_{j}}^{p}})}^{q / p} \\ = \frac{1}{1 - q / p} \log \sum_{i = 1}^{K} {(\frac{{x_{i}}^{p}}{\sum_{j = 1}^{K} {x_{j}}^{p}})}^{q / p} \\ = S_{q / p}^{(R)} ({{x_{i}}^{p} / \sum_{i = 1}^{K} {x_{i}}^{p}}) \end{array}

(16)

which can be viewed as the Rényi entropy of order

q / p

for the

p^{t h}

order escort distribution

{{x_{i}}^{p} / \sum_{i = 1}^{K} {x_{i}}^{p}}

.

Rényi defined his entropy for

p, q \geq 0

. We relax this condition further and allow

S_{p q} (X)

and

m_{p q} (X)

to be defined for any real indices

p, q

such that

p q \geq 0

. Accordingly, we obtain the following properties:

\begin{array}{l} S_{- p, - q} ({x_{i}}) = S_{p, q} ({x_{i}^{- 1} : x_{i} \neq 0}) \\ m_{- p, - q} ({x_{i}}) = m_{p, q}^{- 1} ({x_{i}^{- 1} : x_{i} \neq 0}) \end{array}

(17)

When at least one order

p, q

is zero, we find the interesting results:

k_{0 q} (X) = N_{X}

(18)

S_{0 q} (X) = \log N_{X}

(19)

m_{0 q} (X) = {\begin{matrix} {(\frac{\sum_{i = 1}^{N_{X}} {x_{i}}^{q}}{N_{X}})}^{1 / q} & x_{i} \neq 0, q \neq 0 \\ {(\prod_{i = 1}^{N_{X}} x_{i})}^{1 / N_{X}} & x_{i} \neq 0, q \to 0 \end{matrix}

(20)

We recognize

S_{0 q} (X)

in (19) as the Hartley entropy [10], which will be shown later to be the maximum value of any entropy. From (20), we obtain a famous family of generic p-means of the non-zero elements of X: particularly

m_{0, - \infty}, m_{0, - 1}, m_{0, 1}, m_{0, 2}, m_{0, \infty}

are the minimum, harmonic mean, arithmetic mean, root-mean-square mean, and maximum, respectively. In the limiting case

q \to 0

, we obtain

m_{0, 0}

, which is the geometric mean. In Table 1, we summarize these and other particular cases of means and entropies at specific

p, q

. The key point is that each

p, q

uniquely defines an entropy

S_{p q}

with a corresponding mean

m_{p q}

, such that each pair of

S_{p q}

and

m_{p q}

is coupled in this sense.

Table 1. Special cases of

S_{p q}

and

m_{p q}

. Note that

S_{p q} = S_{q p}

and

m_{p q} = m_{q p}

(Property 4.3).

**Table 1.** Special cases of $S_{p q}$ and $m_{p q}$ . Note that $S_{p q} = S_{q p}$ and $m_{p q} = m_{q p}$ (Property 4.3).
Order $p, q$	$S_{p q} (X)$	$S_{p q}$ name	$m_{p q} (X)$	$m_{p q}$ name
$0, q q \neq 0$	$\log N_{X}$	Boltzmann-Hartley entropy	${(\frac{\sum_{i = 1}^{N_{X}} {x_{i}}^{q}}{N_{X}})}^{1 / q}$	Generic $q$ mean. Specific q values are harmonic (-1), arithmetic (1), root-mean-square (2), maximum ( $\infty$ ), minimum ( $- \infty$ )
0,0	$\log N_{X}$	Boltzmann-Hartley entropy	${(\prod_{i = 1}^{N_{X}} x_{i})}^{1 / N_{X}}$	Geometric mean
$\infty, \infty$	$\log K_{X \max}$		$\max (X)$	maximum
$\infty, q q > 0$	$\log (K_{X \max} + \sum_{x_{i} \neq \max (x_{i})} {(\frac{x_{i}}{\max (x_{i})})}^{q})$		$\max (X)$	maximum
$- \infty, - \infty$	$\log K_{X \min}$		$\min (X)$	minimum
$- \infty, q q < 0$	$\log (K_{X \min} + \sum_{x_{i} \neq \min (x_{i})} {(\frac{x_{i}}{\min (x_{i})})}^{q})$		$\min (X)$	minimum
$1, q q > 0$	$\frac{1}{1 - q} \log \sum_{i = 1}^{K} {(\frac{x_{i}}{\sum_{j = 1}^{K} x_{j}})}^{q}$	Rényi entropy, order $q$ , of the complete distribution ${x_{i} / \sum_{i = 1}^{K} x_{i}}$	$\frac{\sum_{i = 1}^{K} x_{i}}{\exp S_{1 q} (X)}$	“Rényi-like” mean
1,1	$- \sum_{i = 1}^{K} \frac{x_{i}}{\sum_{j = 1}^{K} x_{j}} \log (\frac{x_{i}}{\sum_{j = 1}^{K} x_{j}})$	Gibbs-Shannon entropy	$\frac{\sum_{i = 1}^{K} x_{i}}{\exp S_{11} (X)}$	“Shannon-like” mean

A typical plot for

{‖ X ‖}_{p}

,

S_{p p}

and

m_{p p}

is shown in log scale in Figure 1, illustrating some of the properties to be discussed hereafter. In particular, we notice:

$\log {‖ X ‖}_{p}$ has a two-sided singularity at $p = 0$ .
$S_{p p}$ is non-decreasing/non-increasing for negative/positive $p$ , respectively, and is guaranteed to be maximized at $p = 0$ . This is discussed more generally in Property 4.6.
$m_{p p}$ ranges from $\min (X)$ to $\max (X)$ and is always non-decreasing with respect to $p$ . This is discussed more generally in Property 4.6.
$p = 1$ has a specific property of making $m_{p p} k_{p p} = {‖ X ‖}_{p}$ .

Figure 1. Typical plot for

{‖ X ‖}_{p}

,

S_{p p}

and

m_{p p}

in log scale.

3. An Axiomatic Approach to the Generalized Entropies and Means

In order to simplify the conceptual underpinnings, let us now describe the generalized entropies

S_{p q}

and means

m_{p q}

, through two simple axioms.

3.1. Axioms for the Generalized Entropy

Let

p, q

be fixed real numbers obeying

p q \geq 0

. Consider a map,

S_{p q}

, which maps multisets of positive real numbers into the real numbers. We call

S_{p q}

a generalized entropy of order

p, q

if it obeys the following two axioms:

Entropy Axiom 1:

S_{p q} (M)

of a uniform multiset

M = {m_{0}, m_{0}, \dots, m_{0}}

,

m_{0} > 0

, with multiplicity

k

, equals

\log k

, (where the base of the logarithm is arbitrarily chosen), i.e.,

M = {m_{i} : m_{i} = m_{0}, m_{0} > 0, i = 1, 2, \dots, k} \Rightarrow S_{p q} (M) = \log k

(21)

Entropy Axiom 2:

If

p ​​​ \neq q

, the map

S_{p q}

depends only on the ratio of the multiset’s p and q norms, i.e.,

S_{p q}

is some function,

f_{p q}

, of this ratio:

S_{p q} (X) : = f_{p q} (\frac{{‖ X ‖}_{p}}{{‖ X ‖}_{q}}) p \neq q

(22a)

If

p = q

, the map

S_{q q}

depends only on the ratio of the multiset’s q norm to its derivative, i.e.,

S_{p q}

is some function,

f_{q}

, of this ratio:

S_{q q} (X) : = f_{q} (\frac{\frac{\partial}{\partial q} {‖ X ‖}_{q}}{{‖ X ‖}_{q}}) p \to q

(22b)

To see that (22b) arises in the limit from (22a) we notice that, since the logarithm is strictly monotone, (22a) is equivalent to saying that

S_{p q}

is some function

h_{p q}

of some finite number times the logarithm of the ratio of norms:

\begin{array}{l} S_{p q} (X) = h_{p q} (\frac{1}{p - q} \log \frac{{‖ X ‖}_{p}}{{‖ X ‖}_{q}}) \\ = h_{p q} (\frac{\log {‖ X ‖}_{p} - \log {‖ X ‖}_{q}}{p - q}) \end{array}

(23)

Choosing

p = q + ε

and taking the limit

ε \to 0

we obtain (22b) with

f_{q} = h_{q q}

.

Proposition: The two entropy axioms in (21) and (22) uniquely define

S_{p q} (X)

and

S_{q q} (X)

, namely as given in section 2 by (8) and (9).

Proof:

Entropy Axiom 2 implies that for any two multisets

X

and

Y

:

\frac{{‖ X ‖}_{p}}{{‖ X ‖}_{q}} = \frac{{‖ Y ‖}_{p}}{{‖ Y ‖}_{q}} \Rightarrow S_{p q} (X) = S_{p q} (Y) f o r p \neq q

(24a)

\frac{\frac{\partial}{\partial q} {‖ X ‖}_{q}}{{‖ X ‖}_{q}} = \frac{\frac{\partial}{\partial q} {‖ Y ‖}_{q}}{{‖ Y ‖}_{q}} \Rightarrow S_{q q} (X) = S_{q q} (Y) f o r p = q

(24b)

Choosing for

Y

the uniform multiset

M

of Axiom 1, and taking the logarithm of both sides yields:

\begin{array}{l} \log \frac{{‖ X ‖}_{p}}{{‖ X ‖}_{q}} = (\frac{1}{p} - \frac{1}{q}) \log k p \neq q \\ \frac{\frac{\partial}{\partial q} {‖ X ‖}_{q}}{{‖ X ‖}_{q}} = - \frac{\log k}{q^{2}} p = q \end{array}

(25)

Using that

S_{p q} (X) = S_{p q} (Y) = S_{p q} (M) = \log k

, we now uniquely obtain the formulas for

S_{p q} (X)

and

S_{q q} (X)

given in (8) and (9), respectively. We note that the functions

f_{p q}

and

f_{q}

are therefore:

\begin{array}{l} f_{p q} (\cdot) = \frac{\log (\cdot)}{\frac{1}{p} - \frac{1}{q}} p \neq q \\ f_{q} (\cdot) = - q^{2} (\cdot) p = q \end{array}

(26)

Remarks:

Even though $k$ is treated as an integer representing the multiplicity in Axiom 1, this condition is tacitly relaxed in Axiom 2 to include non-integer values, which we may call the effective cardinality (or effective dimensionality) of order $p, q$ .
The logarithmic measure of Axiom 1 is directly connected to the celebrated Boltzmann entropy formula $S^{(B)} = K_{B} \log W$ , where $K_{B}$ is the Boltzmann constant and $W$ is the number of the microstates in the system. The logarithmic measure is also connected to the so-called “Hartley’s measure” [11,12], which indicates the non-specificity [13] and does not require a probability distribution assumption. In Axiom 1, a multiset of equal positive numbers is all that is required. In fact, Axiom 1 encompasses the additivity and monotonicity axioms [12,13], which are equivalent to the earlier Khinchin’s axioms of additivity, maximality and expansibility [14].
In Axiom 2, note that the p-norm definition is relaxed to include the values $p < 1$ , which would result in the triangle inequality to be violated should the multisets be treated as vectors.

3.2. Axioms for the Generalized Mean

We define the p^th moment of the multiset

X = {x_{i}}

as:

p^{t h} m o m e n t (X) : = {‖ X ‖}_{p}^{p} = \sum_{i} {x_{i}}^{p}

(27)

The nomenclature “p^th moment” is motivated by the fact that for the density function

r (x) : = \sum_{i} δ (x - x_{i})

, where

δ (x)

is the Dirac delta function, the p^th moment is indeed:

\begin{array}{l} p^{t h} m o m e n t (X) : = \int x^{p} r (x) d x \\ = \int x^{p} \sum_{i} δ (x - x_{i}) d x \\ = \sum_{i} {x_{i}}^{p} = {‖ X ‖}_{p}^{p} \end{array}

(28)

Let

p, q

be fixed real numbers obeying

p q \geq 0

. Consider a map,

m_{p q}

, which maps multisets of positive real numbers into the real numbers. We call

m_{p q}

a generalized mean of order

p, q

if it obeys the following two axioms:

Mean Axiom 1:

m_{p q} (M)

of a uniform multiset

M = {m_{0}, m_{0}, \dots, m_{0}}

,

m_{0} > 0

is

m_{0}

. Formally:

M = {m_{i} : m_{i} = m_{0}, m_{0} > 0, i = 1, 2, \dots, k} \Rightarrow m (M) : = m_{0}

(29)

Mean Axiom 2:

If

p ​​​ \neq q

, the map

m_{p q}

depends for any multiset X only on the ratio of the multiset’s p^th and q^th moments,

| | X | |_{p}^{p}

and

| | X | |_{q}^{q}

, i.e.,

m_{p q}

is some function,

g_{p q}

, of their ratio:

m_{p q} (X) : = g_{p q} (\frac{{‖ X ‖}_{p}^{p}}{{‖ X ‖}_{q}^{q}}) p \neq q

(30a)

If

p = q

, the map

m_{q q}

is a function only of the ratio:

m_{q q} (X) : = g (\frac{\frac{\partial}{\partial q} {‖ X ‖}_{q}^{q}}{{‖ X ‖}_{q}^{q}}) p = q

(30b)

The fact that (30b) is the limit of (30a) follows by the same reasoning as in (23).

Proposition:

The two mean axioms in (29) and (30) uniquely define

m_{p q}

, namely as given in (7) and (10).

Proof:

Axiom 2 implies for any two multisets

X

and

Y

that:

\begin{array}{l} \frac{{‖ X ‖}_{p}^{p}}{{‖ X ‖}_{q}^{q}} = \frac{{‖ Y ‖}_{p}^{p}}{{‖ Y ‖}_{q}^{q}} \Rightarrow m_{p q} (X) : = m_{p q} (Y) p \neq q \\ \frac{\frac{\partial}{\partial q} {‖ X ‖}_{q}^{q}}{{‖ X ‖}_{q}^{q}} = \frac{\frac{\partial}{\partial q} {‖ Y ‖}_{q}^{q}}{{‖ Y ‖}_{q}^{q}} \Rightarrow m_{q q} (X) : = m_{q q} (Y) p = q \end{array}

(31)

Choosing the multiset

Y

to be the uniform multiset

M

from (29) we obtain

\begin{array}{l} \frac{{‖ X ‖}_{p}^{p}}{{‖ X ‖}_{q}^{q}} = m_{0}^{p - q} p \neq q \\ \frac{\frac{\partial}{\partial q} {‖ X ‖}_{q}^{q}}{{‖ X ‖}_{q}^{q}} = \log m_{0} p = q \end{array}

(32)

We can now use that

m (X) = m (M) = m_{0}

, to obtain

m_{p q} (X)

and

m_{q q} (X)

as given in (7) and (10), respectively. Accordingly the functions

g_{p q}

and

g

are found to be

\begin{array}{l} g_{p q} (\cdot) = {(\cdot)}^{\frac{1}{p - q}} p \neq q \\ g (\cdot) = \exp (\cdot) p = q \end{array}

(33)

We have obtained axiomatizations of the generalized entropies and means which revealed, in particular, that the generalized entropies can be characterized as those entropies that cover the reference multiset case (the multiset of equal elements) and that are functions of only the ratio of the multisets’

l^{p}

and

l^{q}

norms. Similarly, the axiomatization also revealed that the generalized means can be characterized as those means which cover the reference multiset case and which are functions of only the ratio of the multisets’ p^th and q^th moments

| | X | |_{p}^{p}

and

| | X | |_{q}^{q}

. We will now develop an axiomatization that links up with Section 2, yielding simultaneously a unique family of generalized entropies and means.

3.3. Unifying Axioms for Generalized Entropies and Means

We notice that, as is straightforward to verify:

\frac{{‖ X ‖}_{p}}{{‖ X ‖}_{q}} = \frac{{‖ Y ‖}_{p}}{{‖ Y ‖}_{q}} and \frac{{‖ X ‖}_{p}^{p}}{{‖ X ‖}_{q}^{q}} = \frac{{‖ Y ‖}_{p}^{p}}{{‖ Y ‖}_{q}^{q}} \Leftrightarrow {‖ X ‖}_{p} = {‖ Y ‖}_{p} and {‖ X ‖}_{q} = {‖ Y ‖}_{q}

(34)

This means that we can describe the generalized entropies and means also through a unifying set of axioms. To this end, let

p, q

be fixed real numbers obeying

p q \geq 0

. Consider maps

S_{p q}

and

m_{p q}

, which map multisets of positive real numbers into the real numbers. We call

S_{p q}

and

m_{p q}

generalized entropies and means of order

p, q

respectively, if they obey the following two axioms:

Unifying Axiom 1:

S_{p q}

and

m_{p q}

applied to a multiset of k equal elements

M = {m_{0}, m_{0}, \dots, m_{0}}

,

m_{0} > 0

yield the values

\log k

and

m_{0}

, respectively.

Unifying axiom 2:

\begin{array}{l} {‖ X ‖}_{p} = {‖ Y ‖}_{p} a n d {‖ X ‖}_{q} = {‖ Y ‖}_{q} \Rightarrow S_{p q} (X) : = S_{p q} (Y) a n d m_{p q} (X) : = m_{p q} (Y) p \neq q \\ {‖ X ‖}_{q} = {‖ Y ‖}_{q} a n d \frac{\partial}{\partial q} {‖ X ‖}_{q} = \frac{\partial}{\partial q} {‖ Y ‖}_{q} \Rightarrow S_{q q} (X) : = S_{q q} (Y) a n d m_{q q} (X) : = m_{q q} (Y) p = q \end{array}

(35)

Proposition:

The maps

S_{p q}

and

m_{p q}

are unique and given by Equations (7)–(10).

Proof:

The proofs are straightforward and proceed similarly to the proofs of the propositions related to the entropy and mean axioms.

4. Properties of $S_{p q}$ and $m_{p q}$

We list in this section useful properties of

m_{p q}

,

S_{p q}

and

k_{p q}

with proofs in the appendix. The definitions of

m_{p q}

,

S_{p q}

and

k_{p q}

are given by (7)–(10). We also add two plots for an example multiset

X = {10; 9; 8; 7; 6; 0.5; 0.4; 0.3; 0.2; 0.1}

in Figure 2 and Figure 3 in order to provide some numerical illustration of the properties hereunder.

Figure 2. Numerical example showing the multiset elements

{x_{i}}

versus their index

i

; with their corresponding mean

m_{p q}

and effective cardinality

k_{p q}

for different values of

p, q

.

Figure 3. Numerical example for the multiset

X

showing its mean

m_{p q}

and effective cardinality

k_{p q}

plotted versus

q

for fixed values of

p

.

4.1. Scaling

Given a multiset

X = {x_{i} : i = 1, 2, \dots, K}

and a constant

γ > 0

, we have:

\begin{array}{l} S_{p q} ({γ x_{i} : i = 1, 2, \dots, K}) = S_{p q} (X) \\ m_{p q} ({γ x_{i} : i = 1, 2, \dots, K}) = γ m_{p q} (X) \end{array}

(36)

That is the entropy is invariant of the scaling, whereas the mean varies linearly with scaling. The proof is straightforward, based on (7).

4.2. Symmetry with Respect to the Elements of $X$

S_{p q} (X)

does not depend on the order of the elements of

X

.

4.3. Symmetry with Respect to the Order $p, q$

By exchanging

p

and

q

in (8), we readily find that:

S_{p q} (X) = S_{q p} (X)

(37)

4.4. Sign Change of the Order $p, q$

From (17), for

p q \geq 0

, we obtain:

\begin{array}{l} S_{- p, - q} ({x_{i}}) = S_{p, q} ({x_{i}^{- 1} : x_{i} \neq 0}) \\ m_{- p, - q} ({x_{i}}) = m_{p, q}^{- 1} ({x_{i}^{- 1} : x_{i} \neq 0}) \end{array}

(38)

4.5. Range of $S_{p q} (X)$

Let

N_{X} \geq 1

be the number of the non-zero elements in

X

. Therefore:

0 \leq S_{p q} (X) \leq \log N_{X} p q \geq 0

(39)

The minimum value occurs when

X

has exactly one non-zero element. For

p, q \neq 0

, the maximum value occurs when all the non-zero elements of

X

are equal. When either

p

or

q

is zero,

S_{p q} (X)

yields the maximum value,

\log N_{X}

, for any distribution of

X

, which is physically intuitive since the zero^th order renders all the non-zero multiset elements to an equal value and thus we reach the equiprobable case leading to maximum entropy. The proof of this property is in Appendix A.3.

4.6. Monotonicity of $S_{p q} (X)$ and $m_{p q} (X)$ with respect to $p, q$

We have the following results for the monotonicity of

S_{p q} (X)

and

m_{p q} (X)

with respect to

p, q

. The proofs are in Appendix B and Appendix C, respectively:

\begin{array}{l} q \frac{\partial}{\partial p} S_{p q} \leq 0 p \neq q, p q \geq 0 \\ p \frac{\partial}{\partial p} S_{p p} \leq 0 p \to q \end{array}

(40)

\begin{array}{l} \frac{\partial}{\partial p} m_{p q} \geq 0 p \neq q, p q \geq 0 \\ \frac{\partial}{\partial p} m_{p p} \geq 0 p \to q \end{array}

(41)

where equality holds when all the non-zero elements

x_{i}

are equal. Accordingly, for

p \neq q

and

p q \geq 0

, by fixing one order (say

q

),

m_{p q}

is always non-decreasing with respect to the other order (

p

); whereas

S_{p q}

is non-decreasing for

p < 0

, non-increasing for

p > 0

, with a maximum value at

p = 0

. The result is true when switching

p, q

from the symmetry Property 4.3.

Similarly, for the degenerate case

p \to q

,

m_{p p}

is always non-decreasing with respect to

p

; whereas

S_{p p}

is non-decreasing for

p < 0

, non-increasing for

p > 0

, with a maximum value at

p = 0

. In all cases, both the mean and entropy are invariant with respect to the order

p, q

if and only if all the non-zero elements

x_{i}

are equal.

This property explains the monotonicity of the curves in Figure 1 and Figure 3.

4.7. Range of $m_{p q} (X)$

From Property 4.6,

m_{p q} (X)

is non-decreasing with respect to

p, q

. From Table 1, we know that

m_{p, \infty} (X) = \max (X)

and

m_{p, - \infty} (X) = \min (X)

. Therefore,

\min (X) \leq m_{p q} (X) \leq \max (X)

, which is an intuitive range for a mean. This is also true in the degenerate case

p \to q

, i.e.,

\min (X) \leq m_{p p} (X) \leq \max (X)

(42)

where, as usual, equality holds when all the non-zero elements

x_{i}

are equal.

4.8. Additivity of the Joint Multiset Entropy

For the two probability distribution multisets

X = {x_{i} : i = 1, 2, \dots, K_{X}}

and

Y = {y_{i} : i = 1, 2, \dots, K_{Y}}

, we define the joint multiset

X * Y : = {x_{i} y_{j} : \begin{matrix} i = 1, 2, \dots, K_{X} \\ j = 1, 2, \dots, K_{Y} \end{matrix}}

. Therefore, we have:

\begin{array}{l} S (X * Y) = S (X) + S (Y) \\ m (X * Y) = m (X) m (Y) \end{array}

(43)

The proof is straightforward by using the fact that

{‖ X * Y ‖}_{p} = {‖ X ‖}_{p} {‖ Y ‖}_{p}

along with (7)–(10), Property 4.8 is true for both

p \neq q

and for the degenerate case

p \to q

.

4.9. Sub-Additivity of the Effective Cardinality Subject to the Multiset Additive Union Operation

Let

⊎

denote a multiset additive union operation [15] (page 50), e.g.,

{2, 2} ⊎ {1, 1, 2} = {1, 1, 2, 2, 2}

. Let

X = {x_{i} : i = 1, 2, \dots . K_{X}}

and

Y = {y_{j} : j = 1, 2, \dots . K_{Y}}

be two multisets of non-negative real numbers. Moreover, let

ξ

and

η

be two positive real scaling factors of the elements of

X

and

Y

, respectively. Then:

k_{p q} ({ξ x_{i}} ⊎ {η y_{j}}) \leq k_{p q} (X) + k_{p q} (Y)

(44)

where the equality holds under the following condition for the value

ξ / η

:

\frac{ξ_{0}}{η_{0}} = {\begin{matrix} {(\frac{\sum_{j} {y_{j}}^{q} / \sum_{j} {y_{j}}^{p}}{\sum_{i} {x_{i}}^{q} / \sum_{i} {x_{i}}^{p}})}^{\frac{1}{q - p}} & p \neq q \\ \frac{{‖ Y ‖}_{q}}{{‖ X ‖}_{q}} {(\frac{k_{q q} (X)}{k_{q q} (Y)})}^{1 / q} & p \to q \end{matrix}

(45)

This property is a generalization of the effective alphabet size of two disjoint alphabets mixture as discussed in [16] (Problem 2.10). To see this, set

p = q = 1

and note that

{‖ X ‖}_{1} = {‖ Y ‖}_{1} = 1

for a complete probability distribution. Accordingly,

k_{p q} (X)

represents the effective alphabet size of

X

corresponding to the entropy

S_{p q} (X)

. The proof is in Appendix D.

4.10. Effective Rank of a Matrix

Let

X

be the multiset of the singular values of a matrix

M

. Then,

k_{0 q} (X) = rank (M)

. Accordingly, for general

p, q

,

k_{p q} (X)

can be viewed as a biased effective rank of

M

corresponding to the order

p, q

. From Property 4.5, we have

1 \leq k_{p q} (X) \leq rank (M)

. The minimum value occurs when

M

has exactly one non-zero singular value (the rank of

M

is 1). The maximum value,

rank (M)

, is reached for any

p q > 0

when all the non-zero singular values are equal. The effective rank can be helpful to determine, in a well-defined

p, q

sense, how to view an ill-conditioned matrix, which is a full-rank from a mathematical perspective, but is effectively behaving as if possessing a lower rank. Such ill-conditioned matrices often arise in problems of oversampling or determining the degrees of freedom, where the singular values, ordered in non-increasing order by definition, exhibit some sort of “knee cut-off”, similar to

{x_{i}}

in Figure 2. A biased effective rank can help to compare matrices when the knee cut-off is not sharp, thus giving more weight to the small or large singular values according to the order

p, q

in a consistent manner for different matrices. An example thereof is the evaluation of the degrees of freedom of some applications such as multi-antenna systems [3,4,5,6], optical imaging systems [20], or in general any case of similar limitation to space-bandwidth product [21].

4.11. Geometrical Interpretation of $S_{p q}$ and $m_{p q}$ on Log Scale, with Thermodynamics Analogy

For the multiset

X = {x_{i}}

, after taking the logarithm of both sides of (5), we obtain a simple relation between

{‖ X ‖}_{q}

,

S_{p q}

and

m_{p q}

as in (11):

\begin{array}{l} \log {‖ X ‖}_{q}^{q} = \log {‖ M ‖}_{q}^{q} = S_{p q} + q \log m_{p q} \\ \log {‖ X ‖}_{p}^{p} = \log {‖ M ‖}_{p}^{p} = S_{p q} + p \log m_{p q} \end{array}

(46)

Accordingly, a secant cutting the function

f (q) = \log {‖ X ‖}_{q}^{q}

at

q = p_{0}

and

q = q_{0}

will have a slope and intercept of

\log m_{p_{0} q_{0}}

and

S_{p_{0} q_{0}}

, respectively, as shown in Figure 4. Based on the discussion of Section 1, (46) readily yields the following analogous expressions for a thermodynamics system described by the Boltzmann factors

{e^{- Ε_{i}}}

:

\begin{array}{l} \log Z_{β 1} (X) = S_{β 1, β 2} - β_{1} E_{0} \\ \log Z_{β 2} (X) = S_{β 1, β 2} - β_{2} E_{0} \end{array}

(47)

In the limiting case, when

p \to q, β_{1} \to β_{2}

, the secant in Figure 4 becomes a tangent and we get the Gibbs–Shannon entropy at the inverse temperature

β

. Consequently, (47) can be re-written after introducing the Boltzmann constant

K_{B}

and using the absolute temperature

T = 1 / K_{B} β

as:

\begin{array}{l} - K_{B} T \log Z_{β} : = A_{H} \\ = E_{0} - T S_{β} \end{array}

(48)

where

A_{H}

is the Helmholtz free energy of the system.

Figure 4. A secant of

\log {‖ X ‖}_{q}^{q}

versus

q

.

5. Discussion and Conclusions

A two-parameter family of cardinalities, entropies and means has been derived for multisets of non-negative elements. Rather than starting from thermodynamic or information theoretic considerations to derive entropies and means, see e.g., [9,17,18,19], we here defined the generalized entropies and means through simple abstract axioms. There are other families of entropies in the literature (e.g., [23]), which are a generalization of Shannon entropy. The generalized entropy in this manuscript is shown to preserve the additivity (Property 4.8), which is not the case with the generalized entropies based on Tsallis non-additive entropy as in [23].

Our first two axiomatizations treat the generalized entropies and means separately. It revealed that the generalized entropies are exactly those entropies that are functions of only the ratio of the multisets’

l^{p}

and

l^{q}

norms. It also revealed that the generalized means are exactly those means that are functions only of the ratio of the multisets’ p^th and q^th moments,

| | X | |_{p}^{p}

and

| | X | |_{q}^{q}

. Subsequently, our unifying axiomatization characterized the generalized entropies and means together. This showed that if two multisets have exactly the same

l^{p}

and

l^{q}

norms, then they share the same generalized entropy and mean.

We presented several key features of the new families of generalized entropies and means, for example, that the family of generalized entropies contains and generalizes the Rényi family of entropies, of which the Shannon entropy is a special case, thus including some of the desiderata for entropies [22]. We also showed the monotonicity with respect to

p, q

, extreme values, symmetry with respect to

p, q

, and additivity preservation. The effective cardinality

k_{p q}

measures the distribution uniformity of the multiset elements in the sense of the p- and q-norm equivalence to a reference flat multiset. From an information theory perspective,

S_{p q}

and

k_{p q}

represent a two-parameter entropy of order

p, q

and its corresponding effective alphabet size, respectively, when a probability distribution is constructed after proper normalization of the multiset elements. Furthermore, we recall that knowing the

ℓ^{p}

and

ℓ^{q}

norms of a multiset is to know the multiset’s p^th and q^th moments. Our findings here therefore imply that knowledge of a multiset’s p^th and q^th moments is exactly enough information to deduce the multiset’s (p,q) entropy and (p,q) mean. Further, knowledge of sufficiently many moments of a multiset can be sufficient to reconstruct the multiset. Conversely, it should be interesting to examine how many (p,q)-entropies and/or (p,q)-means are required to completely determine the multiset.

Regarding the thermodynamic interpretation, we noticed that to require that the

ℓ^{p}

and

ℓ^{q}

norms of multisets coincide is mathematically equivalent to requiring that the two partition functions of two thermodynamic systems coincide at two temperatures. This in turn is equivalent to requiring that the Helmholtz free energy of the two thermodynamic systems coincide at two temperatures. The Helmholtz free energy represents the maximum mechanical work that can be extracted from a thermodynamic system under certain idealized circumstances. This suggests that there perhaps exists a thermodynamic interpretation of the generalized entropies and means in terms of the extractability of mechanical work. In this case, the fact that the generalized entropies and means depend on two rather than one temperature could be related to the fact that the maximum efficiency of a heat engine, obtained in Carnot cycles, is a function of two temperatures. We did show that in the limiting case, when the two temperatures become the same, one recovers the usual Boltzmann factor weighted mean energy as well as the usual Shannon/von Neumann entropy.

Acknowledgments

The authors gratefully acknowledge the research support by the Canada Research Chair, Discovery and Postdoctoral fellowship programs of the Natural Sciences and Engineering Research Council of Canada (NSERC).The first author is grateful for the kind hospitality at the department of Applied Mathematics at the University of Waterloo during the course of this research.

References

Rényi, A. On the foundations of information theory. Rev. Int. Stat. Inst. 1965, 33, 1–14. [Google Scholar] [CrossRef]
Beck, C. Generalized information and entropy measures in physics. Contemp. Phys. 2009, 50, 495–510. [Google Scholar] [CrossRef]
Poon, A.S.Y.; Brodersen, R.W.; Tse, D.N.C. Degrees of freedom in multiple-antenna channels: A signal space approach. IEEE Trans. Inform. Theor. 2005, 51, 523–536. [Google Scholar] [CrossRef]
Migliore, M.D. On the role of the number of degrees of freedom of the field in MIMO channels. IEEE Trans. Antenn. Propag. 2006, 54, 620–628. [Google Scholar] [CrossRef]
Elnaggar, M.S. Electromagnetic Dimensionality of Deterministic Multi-Polarization MIMO Systems. Ph.D. Thesis; University of Waterloo: Waterloo, ON, Canada, 2007. Available online: http://uwspace.uwaterloo.ca/handle/10012/3434 (accessed on 18 April 2012).
Elnaggar, M.S.; Safavi-Naeini, S.; Chaudhuri, S.K. A novel dimensionality metric for multi-antenna systems. In Proceedings of Asia-Pacific Microwave Conference (APMC 2006), Yokohama, Japan, December 2006; pp. 242–245.
Elnaggar, M.S.; Kempf, A. On a Generic Entropy Measure in Physics and Information. In Proceedings of the 7th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOPT 2009), Seoul, Korea, June 2009.
Beck, C.; Schlogl, F. Thermodynamics of Chaotic Systems: An Introduction; Cambridge University Press: Cambridge, UK, 1995. [Google Scholar]
Rényi, A. On measures of entropy and information. In Proceedings of the Fourth Berkeley Symposium on Mathematics, Statistics and Probability, University of California, Berkeley, CA, USA; 1961; Volume 1, pp. 541–561. [Google Scholar]
Aczél, J.; Forte, B.; Ng, C.T. Why the Shannon and Hartley entropies are ‘natural’. Adv. Appl. Probab. 1974, 6, 131–146. [Google Scholar] [CrossRef]
Hartley, R.V.L. Transmission of information. Bell Syst. Tech. J. 1928, 7, 535–564. [Google Scholar] [CrossRef]
Rényi, A. Probability Theory; North-Holland Pub. Co.: Amsterdam, The Netherlands, 1970. [Google Scholar]
Klir, G.J.; Folger, T.A. Fuzzy Sets, Uncertainty and Information; Prentice Hall: Upper Saddle River, NJ, USA, 1988. [Google Scholar]
Khinchin, A.I. Mathematical Foundations of Information Theory; Dover: New York, NY, USA, 1957. [Google Scholar]
Blizard, W.D. Multiset theory. Notre Dame J. Formal Logic 1989, 30, 36–66. [Google Scholar] [CrossRef]
Cover, T.M.; Thomas, J.A. Elements of Information Theory; John Wiley & Sons: Hoboken, NJ, USA, 1991. [Google Scholar]
Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423, 623–656. [Google Scholar] [CrossRef]
Ash, R.B. Information Theory; Interscience Publishers: New York, NY, USA, 1965. [Google Scholar]
Reza, F.M. An Introduction to Information Theory; Dover: New York, NY, USA, 1994. [Google Scholar]
Gori, F.; Guattari, G. Shannon number and degrees of freedom of an image. Opt. Commun. 1973, 7, 163–165. [Google Scholar] [CrossRef]
Landau, H.; Pollak, H. Prolate spheroidal wave functions, Fourier analysis and uncertainty: Part III: The dimension of the space of essentially time- and band-limited signals. Bell Syst. Tech. J. 1962, 41, 1295–1336. [Google Scholar] [CrossRef]
Aczel, J.; Daroczy, Z. On Measures of Information and Their Characterizations; Academic Press: New York, NY, USA, 1975. [Google Scholar]
Shigeru, F. An axiomatic characterization of a two-parameter extended relative entropy. J. Math. Phys. 2010, 51, 123302. [Google Scholar]

Appendix A. Proof of Property 4.5: Range of $S_{p q}$

A.1. Non-Increasing ${‖ X ‖}_{p}$ with Respect to p

We want to show that

{‖ X ‖}_{p}

is non-increasing with respect to p. This is equivalent to showing that

\frac{\partial}{\partial p} (\log {‖ X ‖}_{p}) = \frac{\partial}{\partial p} (\frac{\log \sum_{i = 1}^{N} {x_{i}}^{p}}{p}) \leq 0

.

\begin{array}{l} p^{2} \frac{\partial}{\partial p} (\log {‖ X ‖}_{p}) = p \sum_{i} \frac{{x_{i}}^{p}}{\sum_{j} {x_{j}}^{p}} \log x_{i} - \log \sum_{j} {x_{j}}^{p} \\ = \sum_{i} \frac{{x_{i}}^{p}}{\sum_{j} {x_{j}}^{p}} \log {x_{i}}^{p} - \sum_{i} \frac{{x_{i}}^{p}}{\sum_{j} {x_{j}}^{p}} \log \sum_{j} {x_{j}}^{p} \\ = \sum_{i} \frac{{x_{i}}^{p}}{\sum_{j} {x_{j}}^{p}} \log \frac{{x_{i}}^{p}}{\sum_{j} {x_{j}}^{p}} \leq 0 \end{array}

Therefore,

{‖ X ‖}_{p} \geq {‖ X ‖}_{q}

for p<q. The equality occurs only when

{x_{i}}^{p} = \sum {x_{i}}^{p}

, i.e., when

{x_{i}}

includes exactly one non-zero element. Note that there is a singularity for

\log {‖ X ‖}_{p}

such that

\lim_{p \to 0^{+}} \log {‖ X ‖}_{p} = \infty

and

\lim_{p \to 0^{-}} \log {‖ X ‖}_{p} = - \infty

. The property herein applies over the intervals

- \infty < p < 0

and

0 < p < \infty

A.2. Non-Decreasing Generalized p-Mean with Respect to p

Let

X

be a multiset of

N

strictly positive numbers. For p < q, we want to show that

{(\frac{\sum_{i = 1}^{N} {x_{i}}^{p}}{N})}^{1 / p} \leq {(\frac{\sum_{i = 1}^{N} {x_{i}}^{q}}{N})}^{1 / q}

. To this end, we define

f (x) = x^{q / p}

, where

x > 0

and

p / q > 0

, i.e.,

p, q \neq 0

have the same sign. Clearly,

\frac{d^{2}}{d x^{2}} f (x) > 0

and thus

f (x)

is convex. From Jensen’s inequality,

f (\sum w_{i} x_{i}) \leq \sum w_{i} f (x_{i}), where \sum_{i = 1}^{n} w_{i} = 1 . Accordingly, f (\sum w_{i} {x_{i}}^{p}) \leq \sum w_{i} f ({x_{i}}^{p}), hence

{(\sum w_{i} {x_{i}}^{p})}^{q / p} \leq {\sum w_{i} ({x_{i}}^{p})}^{q / p} = \sum w_{i} {x_{i}}^{q}

{(\sum w_{i} {x_{i}}^{p})}^{1 / p} \leq {(\sum w_{i} {x_{i}}^{q})}^{1 / q}

By setting

w_{i} = 1 / N

, the proof is complete. Equality holds when all

x_{i}

are equal.

A.3. Proof of Property 4.5: $0 \leq S_{p q} \leq \log N$

We start by proving that

S_{p q} \geq 0

. From Lemma A.1, for p < q, we have

\log {‖ X ‖}_{p} \geq \log {‖ X ‖}_{q}

. Therefore,

\log {‖ X ‖}_{p} - \log {‖ X ‖}_{q} \geq 0

,

p, q \neq 0

and have the same sign. Since

1 / p - 1 / q > 0

, we get

\frac{\log {‖ X ‖}_{p} - \log {‖ X ‖}_{q}}{1 / p - 1 / q} = S_{p q} \geq 0

. Following the same steps for p > q yields the same result. Finally, when

p \to q

, we have

S_{q q} = - \sum_{i} \frac{{x_{i}}^{q}}{\sum_{j} {x_{j}}^{q}} \log (\frac{{x_{i}}^{q}}{\sum_{j} {x_{j}}^{q}}) \geq 0

. In all cases, the equality holds when

{x_{i}}

includes exactly one non-zero element.

Next, we show that

S_{p q} \leq \log N

. From Lemma A.2, assuming both

p, q \neq 0

have the same sign, for p < q, we have

{(\frac{\sum_{i = 1}^{N} {x_{i}}^{p}}{N})}^{1 / p} \leq {(\frac{\sum_{i = 1}^{N} {x_{i}}^{q}}{N})}^{1 / q}

. Taking the logarithm yields

\frac{\log {‖ X ‖}_{p} - \log {‖ X ‖}_{q}}{1 / p - 1 / q} = S_{p q} \leq \log N

. For p > q, we obtain the same result.

When

p \to q

, we have:

\begin{array}{l} S_{q q} - \log N = - \sum_{i = 1}^{N} \frac{{x_{i}}^{q}}{\sum_{j = 1}^{N} {x_{j}}^{q}} \log (\frac{{x_{i}}^{q}}{\sum_{j = 1}^{N} {x_{j}}^{q}}) - \sum_{i = 1}^{N} \frac{{x_{i}}^{q}}{\sum_{j = 1}^{N} {x_{j}}^{q}} \log N \\ = \sum_{i = 1}^{N} \frac{{x_{i}}^{q}}{\sum_{j = 1}^{N} {x_{j}}^{q}} \log (\frac{\sum_{j = 1}^{N} {x_{j}}^{q}}{N {x_{i}}^{q}}) \\ \leq \sum_{i = 1}^{N} \frac{{x_{i}}^{q}}{\sum_{j = 1}^{N} {x_{j}}^{q}} (\frac{\sum_{j = 1}^{N} {x_{j}}^{q}}{N {x_{i}}^{q}} - 1) = \sum_{i = 1}^{N} (\frac{1}{N} - \frac{{x_{i}}^{q}}{\sum_{j = 1}^{N} {x_{j}}^{q}}) = 0 \end{array},

where

\log x \leq (x - 1)

was used. In all cases, equality holds when all the non-zero elements of

X

are equal.

Appendix B. Proof of Property 4.6: Monotonicity of $S_{p q}$

B.1. Non-Degenerate Case $p \neq q$

We have

S_{p q} = \log k_{p q} = \frac{q \log \sum_{i} {x_{i}}^{p} - p \log \sum_{i} {x_{i}}^{q}}{q - p}

. Accordingly:

\begin{array}{l} {(q - p)}^{2} \frac{\partial}{\partial p} S_{p q} = q (q - p) \sum_{i} \frac{{x_{i}}^{p}}{\sum_{j} {x_{j}}^{p}} \log x_{i} - (q - p) \log \sum_{i} {x_{i}}^{q} + q \log \sum_{i} {x_{i}}^{p} - p \log \sum_{i} {x_{i}}^{q} \\ = q (q - p) \sum_{i} \frac{{x_{i}}^{p}}{\sum_{j} {x_{j}}^{p}} \log x_{i} - q \log \sum_{i} {x_{i}}^{q} + q \log \sum_{i} {x_{i}}^{p} \\ = q (q - p) \sum_{i} \frac{{x_{i}}^{p}}{\sum_{j} {x_{j}}^{p}} \log x_{i} - q \log \sum_{j} {x_{i}}^{p} + q \log \sum_{j} {x_{i}}^{p} \end{array}

where we inserted another summation index for the last two terms in the last step. Therefore:

\begin{array}{l} \frac{{(q - p)}^{2}}{q} \frac{\partial}{\partial p} S_{p q} = (q - p) \sum_{i} \frac{{x_{i}}^{p}}{\sum_{j} {x_{j}}^{p}} \log x_{i} - \log \sum_{j} {x_{i}}^{q} + \log \sum_{j} {x_{i}}^{p} \\ = \sum_{i} \frac{{x_{i}}^{p}}{\sum_{j} {x_{j}}^{p}} \log {x_{i}}^{q} - \sum_{i} \frac{{x_{i}}^{p}}{\sum_{j} {x_{j}}^{p}} \log {x_{i}}^{q} - \log \sum_{j} {x_{i}}^{p} + \log \sum_{j} {x_{i}}^{p} \\ = \sum_{i} \frac{{x_{i}}^{p}}{\sum_{j} {x_{j}}^{p}} \log (\frac{{x_{i}}^{q} \sum_{j} {x_{j}}^{p}}{{x_{i}}^{p} \sum_{j} {x_{j}}^{q}}) \end{array}

where we have multiplied the last two terms in the second step by

\frac{\sum_{i} {x_{i}}^{p}}{\sum_{j} {x_{i}}^{p}}

. Consequently:

\frac{{(q - p)}^{2}}{q} \frac{\partial}{\partial p} S_{p q} \leq \sum_{i} \frac{{x_{i}}^{p}}{\sum_{j} {x_{j}}^{p}} (\frac{{x_{i}}^{q} \sum_{j} {x_{j}}^{p}}{{x_{i}}^{p} \sum_{j} {x_{j}}^{q}} - 1) = 0,

where we have used the fact that

\log x \leq (x - 1)

. The equality occurs only when

\frac{{x_{i}}^{q}}{\sum_{j} {x_{i}}^{p}} = \frac{{x_{i}}^{p}}{\sum_{j} {x_{j}}^{p}}

for every

x_{i}

, implying that each

x_{i}

is either zero or some fixed value

x_{0}

.

Accordingly, after multiplying the last inequality by

\frac{q^{2}}{{(q - p)}^{2}}

, we obtain:

q \frac{\partial}{\partial p} S_{p q} \leq 0 p \neq q,

where the equality holds when all the non-zero elements

x_{i}

are equal.

Since p and q are assumed to have the same sign (from the condition

p q \geq 0

), we deduce that, with respect to either p or q (while fixing the other order),

S_{p q}

is non-decreasing for negative order p,q, non-increasing for positive order p,q, with a maximum value at either

p, q = 0

(note that

{\frac{\partial}{\partial p} S_{p q} |}_{q = 0} = 0

).

S_{p q}

is invariant to p,q if and only if all the non-zero elements

x_{i}

are equal.

B.2. Degenerate Case $p \to q$

From Appendix B.1, if

0 \leq a \leq b \leq c \leq d

, then

S_{c d} \leq S_{b d} \leq S_{b c} \leq S_{a c} \leq S_{a b}

. In the limiting case, when

c \to d^{-}

and

b \to a^{+}

, and knowing from (9) that this limit exists, we readily obtain

S_{d d} \leq S_{a a}

. In a similar fashion, for negative indices values

d \leq c \leq b \leq a \leq 0

, we get

S_{a a} \geq S_{d d}

. In any case, the equality occurs when all the non-zero elements

x_{i}

are equal.

S_{p p}

has a maximum value when

p = 0

. Accordingly:

p \frac{\partial}{\partial p} S_{p p} \leq 0 .

Appendix C. Proof of Property 4.6: Monotonicity of $m_{p q}$

C.1. Non-Degenerate Case $p \neq q$

We have from (5)

\log {‖ X ‖}_{q} = \log m_{p q} + S_{p q} / q

. Differentiating with respect to p, we get:

\frac{\partial}{\partial p} m_{p q} = - m_{p q} \frac{\frac{\partial}{\partial p} S_{p q}}{q} .

Since

k_{p q} > 0

(Property 4.5), therefore (5) confirms that

m_{p q} > 0

as well because the norm is defined to discard any zero elements (6). From Appendix B,

\frac{\frac{\partial}{\partial p} S_{p q}}{q} \leq 0

. Therefore:

\frac{\partial}{\partial p} m_{p q} \geq 0 p, q \neq 0 p \neq q

with equality when all the non-zero elements

x_{i}

are equal.

C.2. Degenerate Case $p \to q$

From (9),

\log m_{p p} (X) = \log {‖ X ‖}_{p} - S_{p p} (X) / p

, accordingly:

\frac{\frac{\partial}{\partial p} m_{p p}}{m_{p p}} = \frac{\frac{\partial}{\partial p} {‖ X ‖}_{p}}{{‖ X ‖}_{p}} - \frac{p \frac{\partial}{\partial p} S_{p p} - S_{p p}}{p^{2}}

Moreover, from (9),

S_{q q} (X) = - q^{2} \frac{\frac{\partial}{\partial q} {‖ X ‖}_{q}}{{‖ X ‖}_{q}}

. Consequently,

p^{2} \frac{\frac{\partial}{\partial p} m_{p p}}{m_{p p}} = - p \frac{\partial}{\partial p} S_{p p} \geq 0

, where we used the result of Appendix B.2. Therefore, since

m_{p p} > 0

, we get:

\frac{\partial}{\partial p} m_{p p} \geq 0 .

Appendix D. Proof of Property 4.9: Sub-Additivity of the Effective Cardinality

For

p \neq q

, we want to show that

k_{p q} ({ξ x_{i}} ⊎ {η y_{j}}) \leq k_{p q} (X) + k_{p q} (Y)

, where

ξ, η \neq 0

. Our objective is to find a condition in terms of

ξ

and

η

that maximizes

S_{p q} ({ξ x_{i}} ⊎ {η y_{j}})

. We have:

(p - q) S_{p q} ({ξ x_{i}} ⊎ {η y_{j}}) = p \log (\sum_{i} ξ^{q} {x_{i}}^{q} + \sum_{j} η^{q} {y_{j}}^{q}) - q \log (\sum_{i} ξ^{p} {x_{i}}^{q} + \sum_{j} η^{p} {y_{j}}^{p})

(49)

By setting

\frac{\partial}{\partial ξ} S_{p q} = 0

and

\frac{\partial}{\partial η} S_{p q} = 0

, we obtain the following two conditions:

\frac{\sum_{i} {ξ_{0}}^{q} {x_{i}}^{q}}{\sum_{i} {ξ_{0}}^{q} {x_{i}}^{q} + \sum_{j} {η_{0}}^{q} {y_{j}}^{q}} = \frac{\sum_{i} {ξ_{0}}^{p} {x_{i}}^{q}}{\sum_{i} {ξ_{0}}^{p} {x_{i}}^{q} + \sum_{j} {η_{0}}^{p} {y_{j}}^{p}}

(50)

\frac{\sum_{j} {η_{0}}^{q} {y_{j}}^{q}}{\sum_{i} {ξ_{0}}^{q} {x_{i}}^{q} + \sum_{j} {η_{0}}^{q} {y_{j}}^{q}} = \frac{\sum_{j} {η_{0}}^{p} {y_{j}}^{p}}{\sum_{i} {ξ_{0}}^{p} {x_{i}}^{q} + \sum_{j} {η_{0}}^{p} {y_{j}}^{p}}

(51)

From (50) and (51), we have:

\frac{\sum_{i} {ξ_{0}}^{q} {x_{i}}^{q}}{\sum_{i} {ξ_{0}}^{p} {x_{i}}^{q}} = \frac{\sum_{j} {η_{0}}^{q} {y_{j}}^{q}}{\sum_{j} {η_{0}}^{p} {y_{j}}^{p}}

(52)

Accordingly, the required proportionality condition is:

\frac{ξ_{0}}{η_{0}} = {(\frac{\sum_{j} {y_{j}}^{q} / \sum_{j} {y_{j}}^{p}}{\sum_{i} {x_{i}}^{q} / \sum_{i} {x_{i}}^{p}})}^{\frac{1}{q - p}}

(53)

From (49), we have:

k_{p q} ({ξ_{0} x_{i}} ⊎ {η_{0} y_{j}}) = {(\frac{{(\sum_{i} {ξ_{0}}^{q} {x_{i}}^{q} + \sum_{j} {η_{0}}^{q} {y_{j}}^{q})}^{p}}{{(\sum_{i} {ξ_{0}}^{p} {x_{i}}^{p} + \sum_{j} {η_{0}}^{p} {y_{j}}^{p})}^{q}})}^{\frac{1}{p - q}}

(54)

Substituting

\sum_{i} {ξ_{0}}^{p} {x_{i}}^{p} + \sum_{j} {η_{0}}^{p} {y_{j}}^{p}

from (50) in (54) yields:

\begin{array}{l} k_{p q} ({ξ_{0} x_{i}} ⊎ {η_{0} y_{j}}) = {(\frac{{(\sum_{i} {ξ_{0}}^{q} {x_{i}}^{q})}^{q}}{{(\sum_{i} {ξ_{0}}^{p} {x_{i}}^{p})}^{q}} {(\sum_{i} {ξ_{0}}^{q} {x_{i}}^{q} + \sum_{j} {η_{0}}^{q} {y_{j}}^{q})}^{p - q})}^{\frac{1}{p - q}} \\ = \frac{{(\sum_{i} {ξ_{0}}^{q} {x_{i}}^{q})}^{\frac{q}{p - q}}}{{(\sum_{i} {ξ_{0}}^{p} {x_{i}}^{p})}^{\frac{q}{p - q}}} (\sum_{i} {ξ_{0}}^{q} {x_{i}}^{q} + \sum_{j} {η_{0}}^{q} {y_{j}}^{q}) \\ = \frac{{(\sum_{i} {ξ_{0}}^{q} {x_{i}}^{q})}^{1 + \frac{q}{p - q}}}{{(\sum_{i} {ξ_{0}}^{p} {x_{i}}^{p})}^{\frac{q}{p - q}}} + (\sum_{j} {η_{0}}^{q} {y_{j}}^{q}) {(\frac{\sum_{i} {ξ_{0}}^{q} {x_{i}}^{q}}{\sum_{i} {ξ_{0}}^{p} {x_{i}}^{p}})}^{\frac{q}{p - q}} \\ = \frac{{(\sum_{i} {ξ_{0}}^{q} {x_{i}}^{q})}^{\frac{p}{p - q}}}{{(\sum_{i} {ξ_{0}}^{p} {x_{i}}^{p})}^{\frac{q}{p - q}}} + (\sum_{j} {η_{0}}^{q} {y_{j}}^{q}) {(\frac{\sum_{j} {η_{0}}^{q} {y_{j}}^{q}}{\sum_{j} {η_{0}}^{p} {y_{j}}^{p}})}^{\frac{q}{p - q}} \\ = \frac{{(\sum_{i} {ξ_{0}}^{q} {x_{i}}^{q})}^{\frac{p}{p - q}}}{{(\sum_{i} {ξ_{0}}^{p} {x_{i}}^{p})}^{\frac{q}{p - q}}} + \frac{{(\sum_{j} {η_{0}}^{q} {y_{j}}^{q})}^{\frac{p}{p - q}}}{{(\sum_{j} {η_{0}}^{p} {y_{j}}^{p})}^{\frac{q}{p - q}}} \end{array}

where we employed (52) to obtain the second to last step. Consequently, since

ξ_{0}

and

η_{0}

cancel out in the last step, we get:

\begin{array}{l} k_{p q} (ξ_{0} X ⊎ η_{0} Y) = {(\frac{{(\sum_{i} {x_{i}}^{q})}^{p}}{{(\sum_{i} {x_{i}}^{p})}^{q}})}^{\frac{1}{p - q}} + {(\frac{{(\sum_{j} {y_{j}}^{q})}^{p}}{{(\sum_{j} {y_{j}}^{p})}^{q}})}^{\frac{1}{p - q}} \\ = k_{p q} (X) + k_{p q} (Y) \end{array}

.

In order to confirm that (53) is indeed a maximizing condition of

k_{p q} (ξ_{0} X ⊎ η_{0} Y)

, consider the following example. Let

X = {x_{0}}

and

Y = {y_{0}}

with multiplicity

K_{X}

and

K_{Y}

, respectively. From the range of

k_{p q}

(Property 4.5), we know that

\max (k_{p q} (ξ_{0} X ⊎ η_{0} Y)) = K_{X} + K_{Y}

only when all the elements are equal, i.e., when

ξ_{0} x_{0} = η_{0} y_{0}

. Indeed, (53) confirms this maximizing condition and thus

k_{p q} (ξ X ⊎ η Y) \leq k_{p q} (X) + k_{p q} (Y)

.

By taking the limit when

p \to q

, (53) becomes

\frac{ξ_{0}}{η_{0}} = \frac{{‖ Y ‖}_{q}}{{‖ X ‖}_{q}} {(\frac{k_{q q} (X)}{k_{q q} (Y)})}^{1 / q} p \to q

(55)

and

k_{q q} (ξ X ⊎ η Y) \leq k_{q q} (X) + k_{q q} (Y)

.

© 2012 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Equivalence of Partition Functions Leads to Classification of Entropies and Means

Abstract

1. Introduction

2. Mathematical Definition of Generalized Entropies and Means

3. An Axiomatic Approach to the Generalized Entropies and Means

3.1. Axioms for the Generalized Entropy

3.2. Axioms for the Generalized Mean

3.3. Unifying Axioms for Generalized Entropies and Means

4. Properties of $S_{p q}$ and $m_{p q}$

4.1. Scaling

4.2. Symmetry with Respect to the Elements of $X$

4.3. Symmetry with Respect to the Order $p, q$

4.4. Sign Change of the Order $p, q$

4.5. Range of $S_{p q} (X)$

4.6. Monotonicity of $S_{p q} (X)$ and $m_{p q} (X)$ with respect to $p, q$

4.7. Range of $m_{p q} (X)$

4.8. Additivity of the Joint Multiset Entropy

4.9. Sub-Additivity of the Effective Cardinality Subject to the Multiset Additive Union Operation

4.10. Effective Rank of a Matrix

4.11. Geometrical Interpretation of $S_{p q}$ and $m_{p q}$ on Log Scale, with Thermodynamics Analogy

5. Discussion and Conclusions

Acknowledgments

References

Appendix A. Proof of Property 4.5: Range of $S_{p q}$

A.1. Non-Increasing ${‖ X ‖}_{p}$ with Respect to p

A.2. Non-Decreasing Generalized p-Mean with Respect to p

A.3. Proof of Property 4.5: $0 \leq S_{p q} \leq \log N$

Appendix B. Proof of Property 4.6: Monotonicity of $S_{p q}$

B.1. Non-Degenerate Case $p \neq q$

B.2. Degenerate Case $p \to q$

Appendix C. Proof of Property 4.6: Monotonicity of $m_{p q}$

C.1. Non-Degenerate Case $p \neq q$

C.2. Degenerate Case $p \to q$

Appendix D. Proof of Property 4.9: Sub-Additivity of the Effective Cardinality

Article Metrics

Citations

Article Access Statistics

Equivalence of Partition Functions Leads to Classification of Entropies and Means

Abstract

1. Introduction

2. Mathematical Definition of Generalized Entropies and Means

3. An Axiomatic Approach to the Generalized Entropies and Means

3.1. Axioms for the Generalized Entropy

3.2. Axioms for the Generalized Mean

3.3. Unifying Axioms for Generalized Entropies and Means

4. Properties of S p q and m p q

4.1. Scaling

4.2. Symmetry with Respect to the Elements of X

4.3. Symmetry with Respect to the Order p , q

4.4. Sign Change of the Order p , q

4.5. Range of S p q ( X )

4.6. Monotonicity of S p q ( X ) and m p q ( X ) with respect to p , q

4.7. Range of m p q ( X )

4.8. Additivity of the Joint Multiset Entropy

4.9. Sub-Additivity of the Effective Cardinality Subject to the Multiset Additive Union Operation

4.10. Effective Rank of a Matrix

4.11. Geometrical Interpretation of S p q and m p q on Log Scale, with Thermodynamics Analogy

5. Discussion and Conclusions

Acknowledgments

References

Appendix A. Proof of Property 4.5: Range of S p q

A.1. Non-Increasing ‖ X ‖ p with Respect to p

A.2. Non-Decreasing Generalized p-Mean with Respect to p

A.3. Proof of Property 4.5: 0 ≤ S p q ≤ log N

Appendix B. Proof of Property 4.6: Monotonicity of S p q

B.1. Non-Degenerate Case p ≠ q

B.2. Degenerate Case p → q

Appendix C. Proof of Property 4.6: Monotonicity of m p q

C.1. Non-Degenerate Case p ≠ q

C.2. Degenerate Case p → q

Appendix D. Proof of Property 4.9: Sub-Additivity of the Effective Cardinality

Article Metrics

Citations

Article Access Statistics

4. Properties of $S_{p q}$ and $m_{p q}$

4.2. Symmetry with Respect to the Elements of $X$

4.3. Symmetry with Respect to the Order $p, q$

4.4. Sign Change of the Order $p, q$

4.5. Range of $S_{p q} (X)$

4.6. Monotonicity of $S_{p q} (X)$ and $m_{p q} (X)$ with respect to $p, q$

4.7. Range of $m_{p q} (X)$

4.11. Geometrical Interpretation of $S_{p q}$ and $m_{p q}$ on Log Scale, with Thermodynamics Analogy

Appendix A. Proof of Property 4.5: Range of $S_{p q}$

A.1. Non-Increasing ${‖ X ‖}_{p}$ with Respect to p

A.3. Proof of Property 4.5: $0 \leq S_{p q} \leq \log N$

Appendix B. Proof of Property 4.6: Monotonicity of $S_{p q}$

B.1. Non-Degenerate Case $p \neq q$

B.2. Degenerate Case $p \to q$

Appendix C. Proof of Property 4.6: Monotonicity of $m_{p q}$

C.1. Non-Degenerate Case $p \neq q$

C.2. Degenerate Case $p \to q$