An Integral Representation of the Relative Entropy

Hirata, Miku; Nemoto, Aya; Yoshida, Hiroaki

doi:10.3390/e14081469

Open AccessArticle

An Integral Representation of the Relative Entropy

by

Miku Hirata

¹,

Aya Nemoto

¹ and

Hiroaki Yoshida

^2,*

¹

Department of Mathematics, Ochanomizu University, 2-1-1, Otsuka, Bunkyo-ku, Tokyo 112-8610, Japan

²

Department of Information Sciences, Ochanomizu University, 2-1-1, Otsuka, Bunkyo-ku, Tokyo 112-8610, Japan

^*

Author to whom correspondence should be addressed.

Entropy 2012, 14(8), 1469-1477; https://doi.org/10.3390/e14081469

Submission received: 15 June 2012 / Revised: 28 July 2012 / Accepted: 2 August 2012 / Published: 8 August 2012

Download Versions Notes

Abstract

:

Recently the identity of de Bruijn type between the relative entropy and the relative Fisher information with the reference moving has been unveiled by Verdú via MMSE in estimation theory. In this paper, we shall give another proof of this identity in more direct way that the derivative is calculated by applying integrations by part with the heat equation. We shall also derive an integral representation of the relative entropy, as one of the applications of which the logarithmic Sobolev inequality for centered Gaussian measures will be given.

Keywords:

relative entropy; relative Fisher information; de Bruijn identity; logarithmic Sobolev inequality; Stam inequality

1. Introduction

Probability measures on

R^{n}

treated in this paper are absolutely continuous with respect to the standard Lebesgue measure and we shall identify them with their densities.

For a probability measure f, the entropy

H (f)

and the Fisher information

J (f)

can be introduced, which play important roles in information theory, probability, and statistics. For more details on these subjects see the famous book [1].

Hereafter, for an n-variables function

ϕ (x) = ϕ (x_{1}, x_{2}, \dots, x_{n})

on

R^{n}

, the integral of ϕ over the whole

R^{n}

by the standard Lebesgue measure

d x = d x_{1} d x_{2} \dots d x_{n}

is abbreviated as

\int_{R^{n}} ϕ d x = \int \int \dots \int_{R^{n}} ϕ (x_{1}, x_{2}, \dots, x_{n}) d x_{1} d x_{2} \dots d x_{n} .

that is, we shall leave out

(x_{1}, x_{2}, \dots, x_{n})

in the integrand in order to simplify the expressions.

Definition 1.1. Let f be a probability measure on

R^{n}

. Then the (differential) entropy of f is defined by

H (f) = - \int_{R^{n}} f log f d x .

For a random variable

X

on

R^{n}

with the density f, we write the entropy of

X

by

H (X) = H (f)

.

The Fisher information for a differentiable density f is defined by

J (f) = \int_{R^{n}} \frac{∥ \nabla f ∥^{2}}{f} d x = \int_{R^{n}} f ∥ \nabla (log f) ∥^{2} d x .

When the random variable

X

on

R^{n}

has the differentiable density f, we also write as

J (X) = J (f)

.

The important result for a behavior of the Fisher information on convolution (sum of independent random variables) is the Stam inequality, which was first stated by Stam in [2] and subsequently proved by Blachman [3],

\frac{1}{J (f * g)} \geq \frac{1}{J (f)} + \frac{1}{J (g)}

(1)

where we have the equality if and only if f and g are Gaussian.

The importance of the Stam inequality can be found in its applications, for instance, the entropy power inequality [2]; the logarithmic Sobolev inequality [4]; Cercignani conjecture [5]; the Shannon conjecture on entropy and the central limit theorem [6,7].

For

t \geq 0

, we denote by

P_{t} f

the convolution of f with the n-dimensional Gaussian density with mean vector

0

and covariance matrix

t I_{n}

, where

I_{n}

is the identity matrix. Namely,

{(P_{t})}_{t \geq 0}

is the heat semigroup acting on f and satisfies the partial differential equation

\frac{\partial}{\partial t} P_{t} f = \frac{1}{2} Δ (P_{t} f)

(2)

which is called the heat equation. In this paper, we simply denote

P_{t} f

by

f_{t}

and call it the Gaussian perturbation of f. Namely, letting

X

be the random variable on

R^{n}

with the density f and

Z

be an n-dimensional Gaussian random variable independent of

X

with mean vector

0

and covariance matrix

I_{n}

, the Gaussian perturbation

f_{t}

stands the density function

f (x, t)

of the independent sum

X + \sqrt{t} Z

.

The remarkable relation between the entropy and the Fisher information can be established by a Gaussian perturbation (see, for instance, [1], [2] or [8]);

\frac{d}{d t} H (f_{t}) = \frac{1}{2} J (f_{t}) for t > 0

(3)

which is known as the de Bruijn identity.

Let f and g be probability measures on

R^{n}

such that

f ≪ g

(f is absolutely continuous with respect to g). Setting the probability measure g as a reference, the relative entropy and the relative Fisher information can be introduced as follows:

Definition 1.2. The relative entropy of f with respect to g,

D (f ∥ g)

is defined by

D (f ∥ g) = \int_{R^{n}} f (log \frac{f}{g}) d x = \int_{R^{n}} f log f d x - \int_{R^{n}} f log g d x,

which takes always a non-negative value.

We also define the relative Fisher information of f with respect to g by

J (f ∥ g) = \int_{R^{n}} f ∥ \nabla (log \frac{f}{g}) ∥^{2} d x = \int_{R^{n}} f ∥ \nabla (log f) - \nabla (log g) ∥^{2} d x,

which is also non-negative. When random variables

X

and

Y

have the densities f and g, respectively, the relative entropy and the relative Fisher information of

X

with respect to

Y

are defined by

D (X ∥ Y) = D (f ∥ g)

and

J (X ∥ Y) = J (f ∥ g)

, respectively.

In view of the de Bruijn identity, one might expect that there is a similar connection between the relative entropy and the relative Fisher information. Indeed, the gradient formulas for the relative entropy functionals were obtained in [9,10,11], where the reference measures would not be changed in their cases.

Recently Verdú in [12], however, investigated the derivative in t of

D (f_{t} ∥ g_{t})

for two Gaussian perturbations

f_{t}

and

g_{t}

. Here we should note that the reference measure does move by the same time parameter in this case. The following identity of de Bruijn type

\frac{d}{d t} D (f_{t} ∥ g_{t}) = - \frac{1}{2} J (f_{t} ∥ g_{t})

has been derived via MMSE in estimation theory (see also [13], for general perturbations).

The main aim in this paper is that we shall give an alternative proof of this identity by direct calculation with integrations by part, the method of which is similar to ones in [11,14]. Moreover, it will be easily found that the above identity yields an integral representation of the relative entropy. We shall also see the simple proof of the logarithmic Sobolev inequality for centered Gaussian in univariate (

n = 1

) case as an application of the integral representation.

2. An Integral Representation of the Relative Entropy

We shall make the Gaussian perturbations

f_{t}

and

g_{t}

, respectively, and consider the relative entropy

D (f_{t} ∥ g_{t})

, where the absolute continuity

f_{t} ≪ g_{t}

remains true for

t > 0

.

Here, we regard

D (f_{t} ∥ g_{t})

as a function of t and calculate the derivative,

\frac{d}{d t} D (f_{t} ∥ g_{t}) = \frac{d}{d t} \int_{R^{n}} f_{t} log \frac{f_{t}}{g_{t}} d x = \frac{d}{d t} \int_{R^{n}} f_{t} log f_{t} d x - \frac{d}{d t} \int_{R^{n}} f_{t} log g_{t} d x

(4)

by integrations by part with help of the heat equation.

Proposition 2.1. Let

f ≪ g

be probability measures on

R^{n}

with finite Fisher informations

J (f) < \infty

and

J (g) < \infty

, and finite relative entropy

D (f ∥ g) < \infty

. Then we obtain

\frac{d}{d t} D (f_{t} ∥ g_{t}) = - \frac{1}{2} J (f_{t} ∥ g_{t}) f o r t > 0 .

Proof. First we should notice that the Fisher informations

J (f_{t})

and

J (g_{t})

are finite at any

t > 0

. Because, for instance, if an n-dimensional random variable

X

has the density f and

Z

is an n-dimensional Gaussian random variable independent of

X

with mean vector

0

and covariance matrix

I_{n}

, then by applying the Stam inequality (1) to independent random variables

X

and

\sqrt{t} Z

, we have that

J (f_{t}) = J (X + \sqrt{t} Z) \leq {(\frac{1}{J (X)} + \frac{1}{J (\sqrt{t} Z)})}^{- 1} = \frac{J (X)}{1 + \frac{t}{n} J (X)} \leq J (f) < \infty

(5)

where

J (Z) = n

is by simple calculation. We shall also notice that the function

D (f_{t} ∥ g_{t})

is non-increasing in t, that is, for

t > 0

,

0 \leq D (f_{t} ∥ g_{t}) \leq D (f ∥ g) < \infty,

which can be found in [15] (p. 101). Therefore,

D (f_{t} ∥ g_{t})

is finite for

t > 0

. But by a nonlinear approximation argument in [11], we can impose a stronger assumption without loss of generality that

“ the relative density \frac{f_{t}}{g_{t}} is bounded away from 0 and \infty on R^{n} ”

(6)

Concerning the first term in the most right hand side of (4), it follows immediately that

\frac{d}{d t} \int_{R^{n}} f_{t} log f_{t} d x = - \frac{1}{2} \int_{R^{n}} \frac{∥ \nabla f_{t} ∥^{2}}{f_{t}} d x

(7)

by the de Bruijn identity (3), hence, we shall concentrate our attention upon the second term.

Since the densities

f_{t}

and

g_{t}

satisfy the heat equation (2), the second term can be reformulated as follows:

\begin{matrix} \frac{d}{d t} \int_{R^{n}} f_{t} log g_{t} d x & = & \int_{R^{n}} f_{t} (\partial_{t} log g_{t}) d x + \int_{R^{n}} log g_{t} (\partial_{t} f_{t}) d x \end{matrix}

\begin{matrix} = & \int_{R^{n}} f_{t} \frac{\partial_{t} g_{t}}{g_{t}} d x + \int_{R^{n}} log g_{t} (\frac{1}{2} Δ f_{t}) d x \end{matrix}

\begin{matrix} = & \int_{R^{n}} \frac{f_{t}}{g_{t}} (\frac{1}{2} Δ g_{t}) d x + \int_{R^{n}} log g_{t} (\frac{1}{2} Δ f_{t}) d x \end{matrix}

(8)

In this reformulation, we have changed integration and differentiation at the first equality, which is justified by a routine argument with the bounded convergence theorem (see, for instance, [16]).

Applying integration by part to the first term in the last expression of (8), it becomes

\int_{R^{n}} \frac{f_{t}}{g_{t}} (\frac{1}{2} Δ g_{t}) d x = - \frac{1}{2} \int_{R^{n}} \nabla (\frac{f_{t}}{g_{t}}) \cdot \nabla g_{t} d x

(9)

which can be asserted by the observation below. As

g_{t}

has finite Fisher information

J (g_{t}) < \infty

,

\frac{\nabla g_{t}}{\sqrt{g_{t}}}

has finite 2-norm in

L^{2} (R^{n})

and must be bounded at infinity. Furthermore, from our technical assumption (6),

\sqrt{\frac{f_{t}}{g_{t}}}

is also bounded. Hence if we factorize as

\frac{f_{t}}{g_{t}} (\nabla g_{t}) = \sqrt{f_{t}} \sqrt{\frac{f_{t}}{g_{t}}} \frac{\nabla g_{t}}{\sqrt{g_{t}}},

then it can be found that

\frac{f_{t}}{g_{t}} (\nabla g_{t})

will vanish at infinity.

Applying integration by part to the second term in the last expression of (8), it becomes

\int_{R^{n}} log g_{t} (\frac{1}{2} Δ f_{t}) d x = - \frac{1}{2} \int_{R^{n}} \frac{\nabla g_{t}}{g_{t}} \cdot \nabla f_{t} d x

(10)

Here it should be noted that

log g_{t} (\nabla f_{t})

will vanish at infinity by the following observation. Similarly, we factorize it as

log g_{t} (\nabla f_{t}) = 2 (\sqrt{g_{t}} log \sqrt{g_{t}}) \sqrt{\frac{f_{t}}{g_{t}}} (\frac{\nabla f_{t}}{\sqrt{f_{t}}}) .

Then the boundedness of

\frac{\nabla f_{t}}{\sqrt{f_{t}}}

comes from that

J (f_{t}) < \infty

, and one of

\sqrt{\frac{f_{t}}{g_{t}}}

is by the assumption (6) same as before. Furthermore, the limit formula

lim_{ξ \to 0} ξ log ξ = 0

ensures that

(\sqrt{g_{t}} log \sqrt{g_{t}})

will vanish at infinity.

Substitute the Equation (9) and Equation (10) into (8), it follows that

\begin{matrix} \frac{d}{d t} \int_{R^{n}} f_{t} log g_{t} d x & = & - \frac{1}{2} \int_{R^{n}} \nabla (\frac{f_{t}}{g_{t}}) \cdot \nabla g_{t} d x - \frac{1}{2} \int_{R^{n}} \frac{\nabla g_{t}}{g_{t}} \cdot \nabla f_{t} d x \end{matrix}

\begin{matrix} = & - \frac{1}{2} \int_{R^{n}} (\frac{\nabla f_{t}}{g_{t}} - f_{t} \frac{\nabla g_{t}}{g_{t}^{2}}) \cdot \nabla g_{t} d x - \frac{1}{2} \int_{R^{n}} f_{t} \frac{\nabla g_{t}}{g_{t}} \cdot \frac{\nabla f_{t}}{f_{t}} d x \end{matrix}

\begin{matrix} = & - \int_{R^{n}} f_{t} \frac{\nabla g_{t}}{g_{t}} \cdot \frac{\nabla f_{t}}{f_{t}} d x + \frac{1}{2} \int_{R^{n}} f_{t} \frac{\nabla g_{t}}{g_{t}} \cdot \frac{\nabla g_{t}}{g_{t}} d x \end{matrix}

(11)

Combining the Equation (7) and Equation (11), we have that

\begin{matrix} \frac{d}{d t} & \int_{R^{n}} f_{t} log f_{t} d x - \frac{d}{d t} \int_{R^{n}} f_{t} log g_{t} d x \\ = - \frac{1}{2} \int_{R^{n}} f_{t} \frac{\nabla f_{t}}{f_{t}} \cdot \frac{\nabla f_{t}}{f_{t}} d x + \int_{R^{n}} f_{t} \frac{\nabla g_{t}}{g_{t}} \cdot \frac{\nabla f_{t}}{f_{t}} d x - \frac{1}{2} \int_{R^{n}} f_{t} \frac{\nabla g_{t}}{g_{t}} \cdot \frac{\nabla g_{t}}{g_{t}} d x \\ = - \frac{1}{2} \int_{R^{n}} f_{t} ∥ \frac{\nabla f_{t}}{f_{t}} - \frac{\nabla g_{t}}{g_{t}} ∥^{2} d x \end{matrix}

which means

\frac{d}{d t} D (f_{t} ∥ g_{t}) = - \frac{1}{2} \int_{R^{n}} f_{t} ∥ \nabla (log f_{t}) - \nabla (log g_{t}) ∥^{2} d x = - \frac{1}{2} J (f_{t} ∥ g_{t}) .

Let

X

and

Y

be n-dimensional random variables with the densities f and g, respectively, and

Z

be an n-dimensional Gaussian random variable independent of

X

and

Y

with mean vector

0

and covariance matrix

I_{n}

.

Since the relative entropy is scale invariant, it follows that

D (X + \sqrt{t} Z ∥ Y + \sqrt{t} Z) = D (\frac{1}{\sqrt{t}} X + Z ∥ \frac{1}{\sqrt{t}} Y + Z) .

We know that both of

\frac{1}{\sqrt{t}} X + Z

and

\frac{1}{\sqrt{t}} Y + Z

, as

t \to \infty

converge to Z in distribution. Thus, we have

lim_{t \to \infty} D (f_{t} ∥ g_{t}) = 0,

and the following integral representation for the relative entropy can be obtained:

Theorem 2.2. Let

f ≪ g

be probability measures with finite Fisher informations and finite relative entropy

D (f ∥ g)

. Then we have the integral representation,

D (f ∥ g) = \frac{1}{2} \int_{0}^{\infty} J (f_{t} ∥ g_{t}) d t .

3. An Application to the Logarithmic Sobolev Inequality

In this section, we shall give a proof of the logarithmic Sobolev inequality for a centered Gaussian measure in case of

n = 1

. Although several proofs of the logarithmic Sobolev inequality have already been given in many literatures (see, for instance, [10,17]), we shall give it here again as an application of the integral representation in Theorem 2.2.

Theorem 3.1. Let g be the centered Gaussian measure of variance

σ^{2}

. Then for any probability measure f on

R

of finite moment of order 2 with finite Fisher information

J (f) < \infty

, the following inequality holds:

D (f ∥ g) \leq \frac{σ^{2}}{2} J (f ∥ g) .

Proof. It is clear that the perturbed measure

g_{t}

is the centered Gaussian of variance

σ^{2} + t

and the score of which is given by

(\partial_{x} log g_{t}) = - \frac{x}{σ^{2} + t} .

Then using the Stein relation (see, for instance, [15]), the relative Fisher information

J (f_{t} ∥ g_{t})

can be expanded as follows:

\begin{matrix} J (f_{t} ∥ g_{t}) & = & \int_{R} {\{(\partial_{x} log f_{t}) - (\partial_{x} log g_{t})\}}^{2} f_{t} d x \end{matrix}

\begin{matrix} = & J (f_{t}) + 2 \int_{R} \{\partial_{x} (- \frac{x}{σ^{2} + t})\} f_{t} d x + \int_{R} {(- \frac{x}{σ^{2} + t})}^{2} f_{t} d x \end{matrix}

\begin{matrix} = & J (f_{t}) - \frac{2}{σ^{2} + t} \int_{R} f_{t} d x + \frac{1}{{(σ^{2} + t)}^{2}} \int_{R} x^{2} f_{t} d x \end{matrix}

(12)

As it was seen in (5), by Stam inequality, we have that

J (f_{t}) \leq {(\frac{1}{J (f)} + t)}^{- 1} = \frac{1}{(1 / α) + t}

(13)

where we put

α = J (f) < \infty

.

Since f has finite moment of order 2, if we put the second moment of f as

β = m_{2} (f) < \infty

, then it is easy to see that the second moment of

f_{t}

is given by

m_{2} (f_{t}) = \int x^{2} f_{t} d x = β + t

(14)

Substitute (13) and (14) into (12) and we obtain that

J (f_{t} ∥ g_{t}) \leq \frac{1}{(1 / α) + t} - \frac{2}{σ^{2} + t} + \frac{β + t}{{(σ^{2} + t)}^{2}} = \frac{1}{(1 / α) + t} - \frac{1}{σ^{2} + t} + \frac{β - σ^{2}}{{(σ^{2} + t)}^{2}} .

Integrating for

t \geq 0

, we have

\begin{matrix} \frac{1}{2} \int_{0}^{\infty} J (f_{t} ∥ g_{t}) d t & \leq \frac{1}{2} \int_{0}^{\infty} (\frac{1}{(1 / α) + t} - \frac{1}{σ^{2} + t} + \frac{β - σ^{2}}{{(σ^{2} + t)}^{2}}) d t \\ = \frac{1}{2} {[log (\frac{(1 / α) + t}{σ^{2} + t}) - \frac{β - σ^{2}}{σ^{2} + t}]}_{0}^{\infty} \\ = \frac{1}{2} (log (σ^{2} α) + \frac{β}{σ^{2}} - 1) . \end{matrix}

Since

log y

is dominated as

log y \leq y - 1

for

y > 0

, it follows that

\frac{1}{2} \int_{0}^{\infty} J (f_{t} ∥ g_{t}) d t \leq \frac{1}{2} (σ^{2} α - 2 + \frac{β}{σ^{2}})

(15)

On the other hand, the relative Fisher information

J (f ∥ g)

can be given as

\begin{matrix} J (f ∥ g) & = & \int_{R} {(\partial_{x} log f - (- \frac{x}{σ^{2}}))}^{2} f d x \end{matrix}

\begin{matrix} = & \int_{R} {(\partial_{x} log f)}^{2} f d x - \frac{2}{σ^{2}} \int_{R} f d x + \frac{1}{{(σ^{2})}^{2}} \int_{R} x^{2} f d x \end{matrix}

\begin{matrix} = & J (f) - \frac{2}{σ^{2}} + \frac{m_{2} (f)}{{(σ^{2})}^{2}} = α - \frac{2}{σ^{2}} + \frac{β}{{(σ^{2})}^{2}} \end{matrix}

(16)

Combining (15) and (16), we have

\frac{1}{2} \int_{0}^{\infty} J (f_{t} ∥ g_{t}) d t \leq \frac{σ^{2}}{2} J (f ∥ g),

which means our desired inequality by Theorem 2.2.

Remark 3.2. Similar way to the proof of Theorem 3.1 can be found in the paper by Stam [2], where it is not for relative case. Namely, based on convolution inequalities and the de Bruijn identity, the isoperimetric inequality on entropy for a standardized random variable X on

R

,

(2 π e) e^{- 2 H (X)} \leq J (X)

(17)

was shown. This inequality is essentially the same as the logarithmic Sobolev inequality for the standard Gaussian measure, where the left hand side in (17) is the reciprocal of the entropy power.

Acknowledgments

The authors are grateful to the anonymous reviewers for their correcting inaccuracies, useful suggestions, and valuable comments. Especially, the extension to the

R^{n}

-version of Proposition 2.1 is based on the reviewer’s comments.

References

Cover, T.; Thomas, J. Elements of Information Theory, 2nd ed.; Wiley-Interscience: Hoboken, NJ, USA, 2006. [Google Scholar]
Stam, A.J. Some inequalities satisfied by the quantities of information of Fisher and Shannon. Inf. Contr. 1959, 2, 101–112. [Google Scholar] [CrossRef]
Blachman, N.M. The convolution inequality for entropy powers. IEEE Trans. Inform. Theor. 1965, 2, 267–271. [Google Scholar] [CrossRef]
Carlen, E. Superadditivity of Fisher’s information and logarithmic Sobolev inequalities. J. Funct. Anal. 1991, 101, 194–211. [Google Scholar] [CrossRef]
Villani, C. Cercignani’s conjecture is sometimes true and always almost true. Commun. Math. Phys. 2003, 234, 455–490. [Google Scholar] [CrossRef]
Madiman, M.; Barron, A. Generalized entropy power inequalities and monotonicity properties of information. IEEE Trans. Inform. Theor. 2007, 53, 2317–2329. [Google Scholar] [CrossRef]
Johnson, O.; Barron, A. Fisher information inequalities and the central limit theorem. Probab. Theor. Relat. Field. 2004, 129, 391–409. [Google Scholar] [CrossRef]
Dembo, A.; Thomas, J.; Cover, T. Information theoretic inequalities. IEEE Trans. Inform. Theor. 1991, 37, 1501–1518. [Google Scholar] [CrossRef]
Arnold, A.; Markowich, P.; Toscani, G.; Unterreiter, A. On convex Sobolev inequalities and the rate of convergence to equilibrium for Fokker–Planck type equations. Comm Part. Differ. Equat. 2001, 26, 43–100. [Google Scholar] [CrossRef]
Bakry, D.; Émery, M. Diffusions hypercontractives. In Séminar de Probabilités XIX, 1983/84, Lecture Notes in Math. 1123; Springer-Verlag: Berlin, Germany, 1985; pp. 177–206. [Google Scholar]
Otto, F.; Villani, C. Generalization of an inequality by Talagrand and links with the logarithmic Sobolev inequality. J. Funct. Anal. 2000, 173, 361–400. [Google Scholar] [CrossRef]
Verdú, S. Mismatched estimation and relative entropy. IEEE Trans. Inform. Theor. 2010, 56, 3712–3719. [Google Scholar] [CrossRef]
Guo, D.; Shamai, S.; Verdú, S. Mutual information and minimum mean-square error in Gaussian channels. IEEE Trans. Inform. Theor. 2005, 51, 1261–1283. [Google Scholar] [CrossRef]
Villani, C. A short proof of the concavity of entropy power. IEEE Trans. Inform. Theor. 2000, 46, 1695–1696. [Google Scholar] [CrossRef]
Johnson, O. Information Theory and the Central Limit Theorem; Imperial College Press: London, UK, 2004. [Google Scholar]
Barron, A. Entropy and the central limit theorem. Ann. Probab. 1986, 14, 336–342. [Google Scholar] [CrossRef]
Gross, L. Logarithmic Sobolev inequalities. Amer. J. Math. 1975, 97, 1061–1083. [Google Scholar] [CrossRef]

© 2012 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Share and Cite

MDPI and ACS Style

Hirata, M.; Nemoto, A.; Yoshida, H. An Integral Representation of the Relative Entropy. Entropy 2012, 14, 1469-1477. https://doi.org/10.3390/e14081469

AMA Style

Hirata M, Nemoto A, Yoshida H. An Integral Representation of the Relative Entropy. Entropy. 2012; 14(8):1469-1477. https://doi.org/10.3390/e14081469

Chicago/Turabian Style

Hirata, Miku, Aya Nemoto, and Hiroaki Yoshida. 2012. "An Integral Representation of the Relative Entropy" Entropy 14, no. 8: 1469-1477. https://doi.org/10.3390/e14081469

APA Style

Hirata, M., Nemoto, A., & Yoshida, H. (2012). An Integral Representation of the Relative Entropy. Entropy, 14(8), 1469-1477. https://doi.org/10.3390/e14081469

Article Menu

An Integral Representation of the Relative Entropy

Abstract

1. Introduction

2. An Integral Representation of the Relative Entropy

3. An Application to the Logarithmic Sobolev Inequality

Acknowledgments

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI