Square Root Convexity of Fisher Information along Heat Flow in Dimension Two

is a random variable with density function satisfying the heat equation. In this paper, we consider the high dimensional case and prove that the Fisher information is square root convex in dimension two, that is

\frac{d^{2}}{d t^{2}} \sqrt{I_{X}} \geq 0

for

n = 2

. The proof is based on the semidefinite programming approach.

Keywords:

Fisher information; heat equation; semidefinite programming; log-convex; square root convex; sum of squares

1. Introduction

Let X be a random variable defined on

R^{n}

with density function

f (x)

, which is assumed to be differentiable. The differential entropy

H (X)

and the Fisher information

I (X)

of X are, respectively, defined to be

\begin{matrix} H (X) : = - \int_{R^{n}} f (x) log f (x) d x and I (X) : = \int_{R^{n}} \frac{{| \nabla f (x) |}^{2}}{f (x)} d x . \end{matrix}

In 1948, Shannon [1] proposed the entropy power inequality (EPI)

N_{X + Y} \geq N_{X} + N_{Y}

, where X and Y are independent random variables defined by

R^{n}

and

N (X) : = exp (\frac{2}{n} H (X)) /

(2 π e) .

As one of the most important inequalities in information theory, Shannon’s EPI has many proofs and applications [2,3,4,5,6].

In 1985, Costa [7] proved a generalization of Shannon’s EPI, that is, the entropy power

N (X_{t})

of

X_{t} = X + \sqrt{t} Z

is concave in t, where X is a random variable and

Z = N (0, I_{n})

is the n-dimensional standard normal distribution, independent of X. This inequality also has many proofs and applications [8,9,10,11].

Costa also proved that

\frac{d}{d t} H (X_{t}) \geq 0

and

\frac{d^{2}}{d t^{2}} H (X_{t}) \leq 0

[7] (Corollary 1). Along this line, Cheng and Geng [12] proposed the completely monotone conjecture (CMC)

{(- 1)}^{m + 1} \frac{d^{m}}{d t^{m}} H (X_{t}) \geq 0, m \in N_{+}

and proved the conjecture for

m = 3, 4

and

n = 1

. Guo, Yuan, and Gao [13] proved the conjecture in the cases

m = 3, n = 2, 3, 4

and the case

m = 4, n = 2

, using semidefinite programming (SDP) software programs. Other related results were also obtained based on the SDP approach [14,15].

The CMC was implicitly considered by Mckean [16] in studying the entropy for solutions of the heat equation

u_{t} = ▵ u

. The density function of

X_{t}

is a solution of the heat equation

u_{t} = \frac{1}{2} ▵ u

[2]. Interestingly, the converse is also true; that is, if the density function of a random variable

Y_{t}

is a solution of the heat equation, then

Y_{t}

has the form of

X_{t}

[11]. Thus, studying properties of

H (X_{t})

and

I (X_{t})

are equivalent to studying that of a probability measure satisfying the heat equation.

Cheng and Geng [12] also proposed the log-convexity conjecture: the Fisher information along the heat flow is log-convex, which can be deduced from CMC. In 2021, Ledoux, Nair, and Wang [17] proved the log-convexity conjecture for

n = 1

.

In this paper, we consider the two-dimensional case as suggested in [17]. We prove the square root convexity (abbr. sqrt-convexity) of Fisher information along heat flow in dimension two. Precisely, we prove the following result.

Theorem 1.

Let X be a random variable defined on

R^{2}

,

Z = N (0, I_{2})

a Gaussian variable independent of X, and

X_{t} = X + \sqrt{t} Z

. Then we have

\frac{d^{2}}{d t^{2}} \sqrt{I (X_{t})} \geq 0 .

(1)

The main idea of the proof is that proof for inequality (1) can be reduced to the proof of whether a quadratic polynomial is a sum of squares (SOS) [18] of linear forms, which can be solved with SDP [19]. The SOS is explicitly given, which provides a rigourous proof for the theorem. The SDP problem related with Theorem 1 has 71 variables, which is difficult to solve by manual calculation.

We also show that log-convexity of the Fisher information along heat flow in dimension two cannot be proven with the SDP approach. More precisely, the SDP software program terminates, but fails to give a solution to prove the log-convexity. This does not imply that the log-convexity in dimension two is not correct, because the SOS problem to be solved with the SDP program is only a sufficient condition but not a necessary for the log-convexity. Theorem 1 is proven as a weaker form of the log-convexity conjecture for

n = 2

. We also show that Theorem 1 implies the CMC for the third-order derivative in dimension two without assuming the log-concavity of p(x). Refer to Corollary 1 for details.

In Theorem 1, we do not assume that X is a log-concave variable. If adding the log-concave condition, then from Toscani [20],

\frac{1}{I (X_{t})}

is concave, which implies inequality (1) and the proof can be found in Lemma 2.

A drawback of the approach based on SDP is that the proof is difficult for people to check. Although the SOS gives an explicit proof for the theorem, it is quite large to be computed manually. To alleviate this problem, we give the programs and data in github.com, so that interested readers may check the proof using software systems. Refer to Remark 2 for details on how to do this. We also give an illustration for the method by proving Theorem 1 for the case n = 1 in Section 3.1. On the other hand, in the proof of information inequalities, it often happens that the computation is too large to be performed manually, and using computer programs becomes one of the major approaches in proving information inequalities [14,21,22,23,24,25]. To show our result more intuitively, we give the figures of

\sqrt{I (X_{t})}

and

log I (X_{t})

in Figure 1, where

p (y_{1}, y_{2})

in Equation (2) is

\frac{y_{1}^{2} y_{2}^{2}}{2 π} exp (- \frac{y_{1}^{2} + y_{2}^{2}}{2})

. In this case, both

\sqrt{I (X_{t})}

and

log I (X_{t})

are convex in t.

Figure 1. Figures for

\sqrt{I (X_{t})}

and

log I (X_{t})

which are convex in t.

2. Preliminaries

2.1. Notations and Preliminary Results

Let X be a random variable defined by

R^{n}

with density function

p (x)

, which is assumed to be differentiable and

Z = N (0, I_{n})

the n-dimensional standard normal distribution, independent of X. Then

X_{t} = X + \sqrt{t} Z

is also a random variable defined on

R^{n}

with density function

f (x, t) : = \frac{1}{{(2 π t)}^{n / 2}} \int_{R^{n}} p (y) exp (- \frac{{∥ x - y ∥}^{2}}{2 t}) d y,

(2)

which is differentiable since

p (x)

is. It is known that

f (x, t)

satisfies the heat Equation (2)

\frac{\partial}{\partial t} f (x, t) = \frac{1}{2} ▵ f (x, t) .

The differential entropy

H (X_{t})

and Fisher information

I (X_{t})

of

X_{t}

are, respectively, defined as

\begin{matrix} H (X_{t}) : = - \int_{R^{n}} f (x, t) log f (x, t) d x and \\ I (X_{t}) : = \int_{R^{n}} \frac{{| \nabla f (x, t) |}^{2}}{f (x, t)} d x . \end{matrix}

For convenience, we use

H (t)

and

I (t)

to denote

H (X_{t})

and

I (X_{t})

in the rest of the paper.

We can easily obtain the following relation between

H (t)

and

I (t)

by de Bruijn’s identity [2]:

\frac{d}{d t} H (t) = \frac{1}{2} I (t) .

(3)

By the definition of

I (t)

, the Fisher information is always positive, so we can take the square root of it. By Equation (3) and the fact

\frac{\partial^{2}}{\partial t^{2}} H (t) \leq 0

[7], the first derivative of the Fisher information is always negative:

\frac{d}{d t} \sqrt{I (t)} = \frac{1}{2 \sqrt{I (t)}} \frac{d}{d t} I (t) = \frac{1}{\sqrt{I (t)}} \frac{\partial^{2}}{\partial t^{2}} H (t) \leq 0 .

(4)

A function

f (t)

is called sqrt-convex in t if the square root of

f (t)

is convex in t. The following lemma gives an equivalent form of sqrt-convexity, which will be used in the proof of Lemma 10.

Lemma 1.

Theorem 1 is valid, that is,

I (t)

is sqrt-convex in t, if and only if

2 I (t) \frac{d^{2}}{d t^{2}} I (t) - {(\frac{d}{d t} I (t))}^{2} \geq 0 .

(5)

Proof.

The convexity of

\sqrt{I (t)}

is equivalent to the fact that second-order derivative of

\sqrt{I (t)}

is positive. From Equation (4), we have

\begin{matrix} \frac{d^{2}}{d t^{2}} \sqrt{I (t)} & = - \frac{1}{4 I (t) \sqrt{I (t)}} {(\frac{d}{d t} I (t))}^{2} + \frac{1}{2 \sqrt{I (t)}} \frac{d^{2}}{d t^{2}} I (t) \\ = \frac{1}{4 I (t) \sqrt{I (t)}} (2 I (t) \frac{d^{2}}{d t^{2}} I (t) - {(\frac{d}{d t} I (t))}^{2}) . \end{matrix}

Since

I (t) > 0

, the lemma is proven. □

Corollary 1.

If

I (t)

is sqrt-convex in t for

n = 2

, then the CMC for the third-order with dimension two is correct.

Proof.

Since

\frac{d}{d t} H (t) = \frac{1}{2} I (t)

, it suffices to prove

\frac{d^{2}}{d t^{2}} I (t) \geq 0

. Using Lemma 1, if

I (t)

is sqrt-convex in t for

n = 2

, then we have

2 I (t) \frac{d^{2}}{d t^{2}} I (t) \geq {(\frac{d}{d t} I (t))}^{2} \geq 0

. Because

I (t) > 0

, then

\frac{d^{2}}{d t^{2}} I (t) \geq 0

. □

Lemma 2 gives the relationship among sqrt-convexity, log-convexity, and concavity of

\frac{1}{I (t)}

.

Lemma 2.

If

\frac{1}{I (t)}

is concave in t, then

log (I (t))

is convex in t. If

log (I (t))

is convex in t, then

I (t)

is sqrt-convex in t.

Proof.

Since

\frac{d^{2}}{d t^{2}} \frac{1}{I (t)} = \frac{1}{I {(t)}^{3}} (2 {(\frac{d}{d t} I (t))}^{2} - I (t) \frac{d^{2}}{d t^{2}} I (t)) \leq 0

, we have

I (t) \frac{d^{2}}{d t^{2}} I (t) \geq 2 {(\frac{d}{d t} I (t))}^{2} \geq {(\frac{d}{d t} I (t))}^{2}

. Then,

\frac{d^{2}}{d t^{2}} log (I (t)) = \frac{1}{I {(t)}^{2}} (I (t) \frac{d^{2}}{d t^{2}} I (t) - {(\frac{d}{d t} I (t))}^{2}) \geq 0

, which means that

log (I (t))

is convex. Similarly, convexity of

log (I (t))

means that

I (t) \frac{d^{2}}{d t^{2}} I (t) \geq {(\frac{d}{d t} I (t))}^{2}

. Then we can obtain

2 I (t) \frac{d^{2}}{d t^{2}} I (t) \geq {(\frac{d}{d t} I (t))}^{2}

. By Lemma 1,

I (t)

is sqrt-convex in t. □

We consider the two-dimensional case and suppose that the two variables are

x = {x_{1}, x_{2}}

. For convenience, we use f instead of

f (x, t)

and

f_{a, b}

instead of

\frac{\partial^{a + b} f (x, t)}{\partial x_{1}^{a} \partial x_{2}^{b}}

. Then we can rewrite the Fisher information as

I (t) = \int_{R^{2}} \frac{f_{1, 0}^{2} + f_{0, 1}^{2}}{f} d x_{1} d x_{2}

(6)

and the heat equation as

\frac{\partial f}{\partial t} = \frac{f_{2, 0} + f_{0, 2}}{2} .

(7)

By Equation (7), it is easy to see that for each

f_{a, b} = \frac{\partial^{a + b} f}{\partial x_{1}^{a} \partial x_{2}^{b}}

, we have

\frac{\partial f_{a, b}}{\partial t} = \frac{\partial^{a + b}}{\partial x_{1}^{a} \partial x_{2}^{b}} \frac{\partial f}{\partial t} = \frac{\partial^{a + b}}{\partial x_{1}^{a} \partial x_{2}^{b}} \frac{f_{2, 0} + f_{0, 2}}{2} = \frac{f_{2 + a, b} + f_{a, b + 2}}{2} .

(8)

In the following, we formally define the concept of differential forms, which are used to reduce the size of the SDP problems to be solved. Refer to Remark 1 for details.

A differential monomial is of the form

M = \prod_{i = 1}^{k} v_{i}^{n_{i}}

, where

v_{i} = f_{a_{i}, b_{i}}

,

n_{i} \in N_{+}

, and

a_{i}, b_{i} \in N

. We define the order of

v_{i}

to be

ord (v_{i}) = a_{i} + b_{i}

, the total order of M to be

ord (M) = \sum_{i = 1}^{k} n_{i} \cdot ord (v_{i})

. The total degree of M is

\deg (M) = \sum_{i = 1}^{k} n_{i}

. A differential polynomial is a finite linear combination of differential monomials over

Q

. A differential polynomial P is called the k-th order differentially homogenous polynomial, or simply a k-th order differential form, if each of its differential monomial is of total degree k and total order k.

In Lemma 3, we compute the expression of

I (t), \frac{d}{d t} I (t), \frac{d^{2}}{d t^{2}} I (t)

.

Lemma 3.

We have

\begin{matrix} I (t) = \int_{R^{2}} \frac{I_{1}}{f} d x_{1} d x_{2}, \frac{d}{d t} I (t) = \int_{R^{2}} \frac{I_{2}}{f^{3}} d x_{1} d x_{2}, \frac{d^{2}}{d t^{2}} I (t) = \int_{R^{2}} \frac{I_{3}}{f^{5}} d x_{1} d x_{2}, \end{matrix}

(9)

where each

I_{i}

is a

2 i

-th order differential form for

i = 1, 2, 3

.

Proof.

By Equation (6),

I_{1} = f_{1, 0}^{2} + f_{0, 1}^{2}

(10)

is a second-order differential form, so the lemma is correct for

I_{1}

. For

I_{2}

,

\frac{d}{d t} I (t) = \int_{R^{2}} \frac{1}{f} \frac{\partial I_{1}}{\partial t} - \frac{I_{1}}{f^{2}} \frac{\partial f}{\partial t} = \int_{R^{2}} \frac{1}{f^{3}} (f^{2} \frac{\partial I_{1}}{\partial t} - f I_{1} \frac{f_{2, 0} + f_{0, 2}}{2}) .

Then,

\begin{matrix} I_{2} & = f^{2} \frac{\partial I_{1}}{\partial t} - f I_{1} \frac{f_{2, 0} + f_{0, 2}}{2} \\ = f^{2} f_{1, 0} f_{3, 0} + f^{2} f_{1, 0} f_{1, 2} + f^{2} f_{0, 1} f_{2, 1} + f^{2} f_{0, 1} f_{0, 3} \\ - \frac{f f_{1, 0}^{2} f_{2, 0}}{2} - \frac{f f_{0, 1}^{2} f_{2, 0}}{2} - \frac{f f_{1, 0}^{2} f_{0, 2}}{2} - \frac{f f_{0, 1}^{2} f_{0, 2}}{2} \\ = f_{1, 0} (f^{2} f_{3, 0} + f^{2} f_{1, 2} - \frac{f f_{1, 0} f_{2, 0}}{2} - \frac{f f_{1, 0} f_{0, 2}}{2}) \\ + f_{0, 1} (f^{2} f_{2, 1} + f^{2} f_{0, 3} - \frac{f f_{0, 1} f_{2, 0}}{2} - \frac{f f_{0, 1} f_{0, 2}}{2}) \\ = f_{1, 0} F_{1, 0} + f_{0, 1} F_{0, 1}, \end{matrix}

(11)

where

\begin{matrix} F_{1, 0} : = f^{2} f_{3, 0} + f^{2} f_{1, 2} - \frac{f f_{1, 0} f_{2, 0}}{2} - \frac{f f_{1, 0} f_{0, 2}}{2}, \\ F_{0, 1} : = f^{2} f_{2, 1} + f^{2} f_{0, 3} - \frac{f f_{0, 1} f_{2, 0}}{2} - \frac{f f_{0, 1} f_{0, 2}}{2} \end{matrix}

(12)

are third-order differential forms.

Thus,

I_{2}

is a fourth-order differential form. Similarly, we can show that

I_{3}

is a sixth-order differential form:

\begin{matrix} I_{3} & = f^{4} f_{3, 0} f_{1, 2} + f^{4} f_{2, 1} f_{0, 3} + f^{2} f_{2, 0} f_{1, 0}^{2} f_{0, 2} + f^{2} f_{2, 0} f_{0, 1}^{2} f_{0, 2} - f^{3} f_{3, 0} f_{2, 0} f_{1, 0} \\ - f^{3} f_{3, 0} f_{0, 2} f_{1, 0} - f^{3} f_{2, 1} f_{2, 0} f_{0, 1} - f^{3} f_{2, 1} f_{0, 2} f_{0, 1} - f^{3} f_{1, 2} f_{2, 0} f_{1, 0} - f^{3} f_{1, 2} f_{0, 2} f_{1, 0} \\ - f^{3} f_{0, 3} f_{2, 0} f_{0, 1} - f^{3} f_{0, 3} f_{0, 2} f_{0, 1} + 1 / 2 f^{2} f_{0, 2}^{2} f_{0, 1}^{2} + 1 / 2 f^{2} f_{2, 0}^{2} f_{1, 0}^{2} + 1 / 2 f^{2} f_{2, 0}^{2} f_{0, 1}^{2} \\ + 1 / 2 f^{2} f_{0, 2}^{2} f_{1, 0}^{2} + 1 / 2 f^{4} f_{1, 2}^{2} + 1 / 2 f^{4} f_{2, 1}^{2} + 1 / 2 f^{4} f_{0, 3}^{2} + 1 / 2 f^{4} f_{3, 0}^{2} \\ - 1 / 4 f^{3} f_{0, 1}^{2} f_{4, 0} + 1 / 2 f^{4} f_{0, 1} f_{4, 1} - 1 / 4 f^{3} f_{0, 1}^{2} f_{0, 4} - 1 / 2 f^{3} f_{0, 1}^{2} f_{2, 2} + f^{4} f_{0, 1} f_{2, 3} \\ + f^{4} f_{1, 0} f_{3, 2} - 1 / 2 f^{3} f_{1, 0}^{2} f_{2, 2} - 1 / 4 f^{3} f_{1, 0}^{2} f_{4, 0} + 1 / 2 f^{4} f_{0, 1} f_{0, 5} - 1 / 4 f^{3} f_{1, 0}^{2} f_{0, 4} \\ + 1 / 2 f^{4} f_{1, 0} f_{5, 0} + 1 / 2 f^{4} f_{1, 0} f_{1, 4} . \end{matrix}

(13)

The lemma is proven. □

Inspired by Cauchy–Schwarz inequality, we obtain the following inequality which is used in the proof of Lemma 9.

Lemma 4.

For functions

f_{1}, f_{2}, g_{1}, g_{2}

in

x = {x_{1}, x_{2}}

, we have

{(\int_{R^{2}} (f_{1} g_{1} + f_{2} g_{2}) d x_{1} d x_{2})}^{2} \leq \int_{R^{2}} (f_{1}^{2} + f_{2}^{2}) d x_{1} d x_{2} \int_{R^{2}} (g_{1}^{2} + g_{2}^{2}) d x_{1} d x_{2} .

(14)

Proof.

Using the Cauchy–Schwarz inequality, we have

| f_{1} g_{1} + f_{2} g_{2} | \leq \sqrt{(f_{1}^{2} + f_{2}^{2}) (g_{1}^{2} + g_{2}^{2})}

. Using the Cauchy–Schwarz inequality of integral form, we have

\begin{matrix} {(\int_{R^{2}} \sqrt{(f_{1}^{2} + f_{2}^{2}) (g_{1}^{2} + g_{2}^{2})} d x_{1} d x_{2})}^{2} \leq \int_{R^{2}} (f_{1}^{2} + f_{2}^{2}) d x_{1} d x_{2} \int_{R^{2}} (g_{1}^{2} + g_{2}^{2}) d x_{1} d x_{2} . \end{matrix}

Combining the above two inequalities, we prove the lemma. □

2.2. Constraints

The density function f and its derivatives satisfy certain integral equations, from which the constraints of the SDP problems to be solved are obtained. Due to these reasons, these integral equations are called constraints. Precisely, a

2 m

-th order differential form R is called a

2 m

-th order constraint, if

\begin{matrix} \int_{R^{2}} \frac{R}{f^{2 m - 1}} d x_{1} d x_{2} = 0 . \end{matrix}

It is easy to see that the equations in (9) are still valid if

I_{k}

is replaced by

I_{k} + C_{k}

, when

C_{k}

is a

2 k

-th order constraint. Guo, Yuan, and Gao [13] proposed a method to compute the constraints, which will be used here to compute the constraints in dimension two. In the following, we show how to compute the

2 m

-order constraints.

Lemma 5

([13]). Let

k, m_{i}, n_{i} \in N_{+}

and

f^{(m_{i})}

be the

m_{i}

-th order derivative of f in Equation (2). Then

\int_{- \infty}^{\infty} f [\prod_{i = 1}^{k} \frac{{[f^{(m_{i})}]}^{k_{i}}}{f^{k_{i}}}] |_{x_{a} = - \infty}^{\infty} d x_{b} = 0,

(15)

where

a = 1, b = 2

or

a = 2, b = 1

.

This lemma guarantees that when using the integration by parts, the integral term of lower dimensions vanishes. The following lemma shows how to generate constraints. We repeat the proof here, because the proof procedure will be used in the proof of Lemma 7.

Lemma 6.

Let M be a differential monomial with total order

2 m - 1

. Then we can use integration by parts to obtain a

2 m

-th order constraint from M.

Proof.

Let

x_{a}

be one of the variables

x_{1}, x_{2}

, and

x_{b}

be another variable. Then we have

\int_{R^{2}} \frac{\partial}{\partial x_{a}} \frac{M}{f^{2 m - 2}} d x_{a} d x_{b} = \int_{- \infty}^{\infty} f \frac{M}{f^{2 m - 1}} |_{x_{a} = - \infty}^{\infty} d x_{b} \overset{(15)}{=} 0 .

Then using integration by parts, we have

\begin{matrix} \int_{R^{2}} \frac{\partial}{\partial x_{a}} \frac{M}{f^{2 m - 2}} d x_{a} d x_{b} & = \int_{R^{2}} \frac{1}{f^{2 m - 2}} \frac{\partial M}{\partial x_{a}} - (2 m - 2) M \frac{\partial f}{\partial x_{a}} \frac{1}{f^{2 m - 1}} d x_{a} d x_{b} \\ = \int_{R^{2}} \frac{1}{f^{2 m - 1}} (f \frac{\partial M}{\partial x_{a}} - (2 m - 2) M \frac{\partial f}{\partial x_{a}}) d x_{a} d x_{b} = 0 . \end{matrix}

Thus,

M^{'} : = f \frac{\partial M}{\partial x_{a}} - (2 m - 2) M \frac{\partial f}{\partial x_{a}}

is a

2 m

-th order constraint and the lemma is proven. □

3. Proof of Theorem 1

The proof of Theorem 1 mainly consists of two steps. The first step, summarized in Lemma 10, is used to reduce the proof of Theorem 1 to the proof of the non-negativeness for a quadratic form with undetermined coefficients. This step is given in Section 3.2, Section 3.3 and Section 3.4. The reduction has three main ingredients: (1) Constraints given in Lemma 8 are used to form the SOS and Lemmas 5 and 6 show how to compute the constraints. (2) Lemma 7 is used to reduce all involved quantities into quadratic forms in certain variables. (3) By introducing

{\tilde{J}}_{3}

in Lemma 9 and using the Cauchy–Schwarz inequality in Lemma 4, the quantity

{(\int_{R^{2}} \frac{I_{2}}{f^{3}} d x_{1} d x_{2})}^{2}

is relaxed to a simple form.

The second step, given in Section 3.5, is to compute the undetermined coefficients of the quadratic form using SDP, which is summarized as Problem 1. This step has two sub-steps: (1) In Problem 2, the undetermined coefficients

α_{i}

and

β_{j}

are computed by omitting the second degree terms. (2) In Problem 5, the undetermined coefficients

λ_{k}

are computed using the values of

α_{i}

and

β_{j}

obtained in the first sub-step. In these two sub-steps, the quadratic forms are linear in the undetermined coefficients which can be computed with SDP and the computation procedure is given in Problems 3 and 4.

3.1. An Illustrative Example

In this subsection, we will prove Theorem 1 for n =1 and use this as an illustration of our proving method.

By Lemma 1, it suffices to prove (5). For convenience, we write

f (x, t)

as f and

\frac{\partial^{a}}{\partial x^{a}} f (x, t)

as

f_{a}

. Using Lemma 6, we can obtain the constraints

{\hat{E}}_{i}, i = 1, \dots, 6

:

\begin{matrix} \int_{R} \frac{3 f f_{1}^{2} f_{2} - 2 f_{1}^{4}}{f^{3}} d x = 0 . \\ \int_{R} \frac{{\hat{E}}_{i}}{f^{5}} d x = 0 (i = 1 \dots 6), \\ {\hat{E}}_{1} = f^{4} f_{1} f_{5} + f^{4} f_{2} f_{4} - f^{3} f_{1}^{2} f_{4}, & {\hat{E}}_{2} = f^{4} f_{2} f_{4} + f^{4} f_{3}^{2} - f^{3} f_{1} f_{2} f_{3}, \\ {\hat{E}}_{3} = f^{3} f_{1}^{2} f_{4} + 2 f^{3} f_{1} f_{2} f_{3} - 2 f^{2} f_{1}^{3} f_{3}, & {\hat{E}}_{4} = 2 f^{3} f_{1} f_{2} f_{3} + f^{3} f_{2}^{3} - 2 f^{2} f_{1}^{2} f_{2}^{2}, \\ {\hat{E}}_{5} = f^{2} f_{1}^{3} f_{3} + 3 f^{2} f_{1}^{2} f_{2}^{2} - 3 f f_{1}^{4} f_{2}, & {\hat{E}}_{6} = 5 f f_{1}^{4} f_{2} - 4 f_{1}^{6} . \end{matrix}

(16)

By Lemma 3, we have

\begin{matrix} I (t) & = \int_{R} \frac{f_{1}^{2}}{f} d x, \\ \frac{d}{d t} I (t) & = \int_{R} \frac{2 f^{2} f_{1} f_{3} - f f_{1}^{2} f_{2}}{2 f^{3}} d x = \int_{R} \frac{f_{1} (2 f^{2} f_{3} - f f_{1} f_{2})}{2 f^{3}} d x, \\ = \int_{R} \frac{f_{1} (2 f^{2} f_{3} - f f_{1} f_{2} + α (3 f f_{1} f_{2} - 2 f_{1}^{3}))}{2 f^{3}} d x, \\ \frac{d^{2}}{d t^{2}} I (t) & = \int_{R} \frac{2 f^{4} f_{3}^{2} - 4 f^{3} f_{1} f_{2} f_{3} + 2 f^{4} f_{1} f_{5} + 2 f^{2} f_{1}^{2} f_{2}^{2} - f^{3} f_{1}^{2} f_{4}}{4 f^{5}} d x \\ = \int_{R} \frac{E_{2}}{4 f^{5}} d x, \end{matrix}

where

E_{2} : = 2 f^{4} f_{3}^{2} - 4 f^{3} f_{1} f_{2} f_{3} + 2 f^{4} f_{1} f_{5} + 2 f^{2} f_{1}^{2} f_{2}^{2} - f^{3} f_{1}^{2} f_{4}

. By Lemma 4,

\begin{matrix} {(\frac{d}{d t} I (t))}^{2} & \leq \int_{R} \frac{f_{1}^{2}}{f} d x \int_{R} \frac{{(2 f^{2} f_{3} - f f_{1} f_{2} + α (3 f f_{1} f_{2} - 2 f_{1}^{3}))}^{2}}{4 f^{5}} d x, \\ = I (t) \int_{R} \frac{E_{1} {(α)}^{2}}{4 f^{5}} d x, \end{matrix}

where

E_{1} (α) : = 2 f^{2} f_{3} - f f_{1} f_{2} + α (3 f f_{1} f_{2} - 2 f_{1}^{3})

.

By Lemma 1, it suffices to find an

α

such that

2 E_{2} - E_{1} {(α)}^{2} \geq 0

is true under the constraints

{\hat{E}}_{i}, i = 1, \dots, 6

, which is a consequence of the following SOS:

2 E_{2} - E_{1} {(- \frac{1}{3})}^{2} - 4 {\hat{E}}_{1} + 4 {\hat{E}}_{2} - 2 {\hat{E}}_{3} + \frac{4}{3} {\hat{E}}_{5} - \frac{4}{15} {\hat{E}}_{6} = {(2 f^{2} f_{3} - 2 f f_{1} f_{2} + \frac{2}{3} f_{1}^{3})}^{2} + \frac{8}{45} f_{1}^{6} \geq 0 .

(17)

By (16) and (17),

\begin{matrix} 2 I (t) \frac{d^{2}}{d t^{2}} I (t) - {(\frac{d}{d t} I (t))}^{2} & \geq I (t) \int_{R} \frac{2 E_{2} - E_{1} {(α)}^{2}}{4 f^{5}} d x \\ = I (t) \int_{R} \frac{2 E_{2} - E_{1} {(- \frac{1}{3})}^{2} - 4 {\hat{E}}_{1} + 4 {\hat{E}}_{2} - 2 {\hat{E}}_{3} + \frac{4}{3} {\hat{E}}_{5} - \frac{4}{15} {\hat{E}}_{6}}{4 f^{5}} d x \\ \geq 0 . \end{matrix}

Theorem 1 of case n = 1 is proven.

Equation (17) can be obtained in two steps. In the first step, we compute

α

. Instead of

2 E_{2} - E_{1} {(α)}^{2} \geq 0

, we consider

2 E_{2} - ({(2 f^{2} f_{3} - f f_{1} f_{2})}^{2} + 2 α (2 f^{2} f_{3} - f f_{1} f_{2}) (3 f f_{1} f_{2} - 2 f_{1}^{3})) \geq 0

under the constraints, which can be solved by SDP since

α

is linear in the expression. Suppose that the solution for

α

is

α_{0}

.

In the second step, we check whether

2 E_{2} - E_{1} {(α_{0})}^{2} \geq 0

is valid under the constraints using SDP, and the SOS in (17) can be found. Details of the proof procedure are given in the rest of this section.

3.2. Compute Constraints

In this section, we compute the fourth-order and sixth-order constraints using Lemma 6. For instance, from the differential monomial

M = f f_{0, 1} f_{2, 0}

with total order 3, we obtain two fourth-order constraints:

\begin{matrix} C_{1} = f \frac{\partial M}{\partial x_{1}} - 2 M \frac{\partial f}{\partial x_{1}} = f^{2} f_{3, 0} f_{0, 1} + f^{2} f_{2, 0} f_{1, 1} - f f_{2, 0} f_{1, 0} f_{0, 1}, \\ C_{2} = f \frac{\partial M}{\partial x_{2}} - 2 M \frac{\partial f}{\partial x_{2}} = f^{2} f_{2, 1} f_{0, 1} + f^{2} f_{2, 0} f_{0, 2} - f f_{2, 0} f_{0, 1}^{2} . \end{matrix}

By considering all differential monomials with total order 3 and total degree 3, we obtain 20 constraints. Some of the constraints cannot be divided by

f_{0, 1}

or

f_{1, 0}

, which are not needed in the proof due to the form of

I_{2}

in Equation (11). Finally, we obtain eight fourth-order constraints

f_{1, 0} P_{i} (1 \leq i \leq 4)

and

f_{0, 1} Q_{i} (1 \leq i \leq 4)

, where

\begin{matrix} P_{1} = 3 f f_{1, 0} f_{2, 0} - 2 f_{1, 0}^{3}, & P_{2} = 3 f f_{1, 0} f_{1, 1} - 2 f_{0, 1} f_{1, 0}^{2}, \\ P_{3} = 2 f f_{0, 1} f_{1, 1} + f f_{0, 2} f_{1, 0} - 2 f_{0, 1}^{2} f_{1, 0}, & P_{4} = 2 f f_{0, 1} f_{2, 0} + f f_{1, 0} f_{1, 1} - 2 f_{0, 1} f_{1, 0}^{2}, \\ Q_{1} = 3 f f_{0, 1} f_{0, 2} - 2 f_{0, 1}^{3}, & Q_{2} = 3 f f_{0, 1} f_{1, 1} - 2 f_{0, 1}^{2} f_{1, 0}, \\ Q_{3} = f f_{0, 1} f_{1, 1} + 2 f f_{0, 2} f_{1, 0} - 2 f_{0, 1}^{2} f_{1, 0}, & Q_{4} = f f_{0, 1} f_{2, 0} + 2 f f_{1, 0} f_{1, 1} - 2 f_{0, 1} f_{1, 0}^{2} . \end{matrix}

(18)

Similarly, we obtain 136 sixth-order constraints

R_{j} (1 \leq j \leq 136)

. In summary, we obtain constraints

f_{1, 0} P_{i} (1 \leq i \leq 4), f_{0, 1} Q_{i} (1 \leq i \leq 4)

, and

R_{j} (1 \leq j \leq 136)

, which satisfy

\begin{matrix} \begin{matrix} \int_{R^{2}} \frac{f_{1, 0} P_{i}}{f^{3}} d x_{1} d x_{2} = \int_{R^{2}} \frac{f_{0, 1} Q_{i}}{f^{3}} d x_{1} d x_{2} = 0, i = 1, \dots, 4, \\ \int_{R^{2}} \frac{R_{j}}{f^{5}} d x_{1} d x_{2} = 0, j = 1, \dots, 136 . \end{matrix} \end{matrix}

(19)

3.3. Reduce to Quadratic Form

In order to obtain an SDP problem with a smaller size, we will reduce all differential polynomials in the proof into quadratic forms in a set of new variables

M = {M_{i} : 1 \leq i \leq 14}

which are all the differential monomials with total order 3 and total degree 3:

M = \{\begin{matrix} M_{1} = f^{2} f_{3, 0}, & M_{2} = f^{2} f_{2, 1}, & M_{3} = f^{2} f_{1, 2}, & M_{4} = f^{2} f_{0, 3}, \\ M_{5} = f f_{2, 0} f_{1, 0}, & M_{6} = f f_{2, 0} f_{0, 1}, & M_{7} = f f_{1, 1} f_{1, 0} & M_{8} = f f_{1, 1} f_{0, 1}, \\ M_{9} = f f_{0, 2} f_{1, 0}, & M_{10} = f f_{0, 2} f_{0, 1} & M_{11} = f_{1, 0}^{3}, & M_{12} = f_{1, 0}^{2} f_{0, 1}, \\ M_{13} = f_{1, 0} f_{0, 1}^{2}, & M_{14} = f_{0, 1}^{3} . \end{matrix}\}

(20)

We rewrite

F_{1, 0}, F_{0, 1}

in Equation (12) and

P_{i} (1 \leq i \leq 4), Q_{i} (1 \leq i \leq 4)

in Equation (18) as linear forms in

M

:

\begin{matrix} {\tilde{F}}_{1, 0} = M_{1} + M_{3} - \frac{1}{2} M_{5} - \frac{1}{2} M_{9}, & {\tilde{F}}_{0, 1} = M_{2} + M_{4} - \frac{1}{2} M_{6} - \frac{1}{2} M_{10}, \\ {\tilde{P}}_{1} = 3 M_{5} - 2 M_{11}, & {\tilde{P}}_{2} = 3 M_{7} - 2 M_{12}, \\ {\tilde{P}}_{3} = 2 M_{8} + M_{9} - 2 M_{13}, & {\tilde{P}}_{4} = 2 M_{6} + M_{7} - 2 M_{12}, \\ {\tilde{Q}}_{1} = 3 M_{10} - 2 M_{14}, & {\tilde{Q}}_{2} = 3 M_{8} - 2 M_{13}, \\ {\tilde{Q}}_{3} = M_{8} + 2 M_{9} - 2 M_{13}, & {\tilde{Q}}_{4} = M_{10} + 2 M_{7} - 2 M_{12} . \end{matrix}

(21)

The following lemma shows that any sixth-order constraint can be reduced to another sixth-order constraint which can be written as a quadratic form in

M

.

Lemma 7.

For any differential monomial M with total order 6 and total degree 6, we can compute a sixth-order differential form P such that

\int_{R^{2}} \frac{M}{f^{5}} d x_{1} d x_{2} = \int_{R^{2}} \frac{P}{f^{5}} d x_{1} d x_{2}

and P is a quadratic form in

M

in Equation (20).

Proof.

Since M is a differential monomial with total degree 6 and total order 6, let

M = \prod_{i = 1}^{6} v_{i}

with

v_{i} = f_{a_{i}, b_{i}} = \frac{\partial^{c_{i}} f}{\partial x_{1}^{a_{i}} \partial x_{2}^{b_{i}}}

satisfying

c_{i} = a_{i} + b_{i}

,

\sum_{i = 1}^{6} c_{i} = 6

, and

c_{s} \geq c_{k}

for

s \leq k

. We call

(c_{1}, \dots, c_{6})

the order type and

c_{1}

the leading order of M.

If

c_{1} \geq 4

, similar to the proof of Lemma 6, we can use integration by parts to obtain a new polynomial

P_{1}

with leading order

c_{1} - 1

.

\begin{matrix} \int_{R^{2}} \frac{M}{f^{5}} d x_{1} d x_{2} & = & \int_{R^{2}} \frac{1}{f^{5}} \frac{\partial^{c_{1}} f}{\partial x_{1}^{a_{1}} \partial x_{2}^{b_{1}}} \prod_{i = 2}^{6} v_{i} d x_{1} d x_{2} \\ = & - \int_{R^{2}} \frac{\partial^{c_{1} - 1} f}{\partial x_{1}^{a_{1} - 1} \partial x_{2}^{b_{1}}} \frac{\partial}{\partial x_{1}} (\frac{1}{f^{5}} \prod_{i = 2}^{6} v_{i}) d x_{1} d x_{2}, \end{matrix}

(22)

where we assume

a_{1} \geq 1

, without loss of generality. Let

P_{1} = f^{5} (\frac{\partial^{c_{1} - 1} f}{\partial x_{1}^{a_{1} - 1} \partial x_{2}^{b_{1}}} \frac{\partial}{\partial x_{1}} (\frac{1}{f^{5}} \prod_{i = 2}^{6} v_{i}))

. It is easy to see that

P_{1}

is a sixth-order differential form. Since

c_{1} \geq 4

, we have

c_{i} \leq 2

for

i = 2, \dots, 6

, and hence the leading orders of all monomials of

P_{1}

are equal to or less than

c_{1} - 1

. If the leading order of a monomial

\tilde{M}

of

P_{1}

is still equal to or more than 4, we can repeat procedure (22) for

\tilde{M}

until the leading orders of all monomials of

P_{1}

are equal to or less than 3.

After the above procedure, we obtain a sixth-order differential form

P_{1}

such that the leading orders of all monomials of

P_{1}

are equal to or less than 3. If the order type of a monomial

\tilde{M}

of

P_{1}

is

(2, 2, 2, 0, 0, 0)

, then we use procedure (22) to change

\tilde{M}

to a differential polynomial

P_{2}

. It is clear that the leading orders of all monomials of

P_{2}

are equal to or less than 3 and the order types of all monomials of

P_{2}

are not

(2, 2, 2, 0, 0, 0)

. Using the above procedure, we may eliminate all monomials with order type

(2, 2, 2, 0, 0, 0)

. For instance, for the monomial

f^{3} f_{2, 0} f_{1, 1} f_{0, 2}

with order type

(2, 2, 2, 0, 0, 0)

, we can obtain a sixth-order differential form

f^{3} f_{2, 1} f_{0, 2} f_{1, 0} + f^{3} f_{1, 2} f_{1, 1} f_{1, 0} - 2 f^{2} f_{1, 1} f_{0, 2} f_{1, 0}^{2}

.

After the above two reduction procedures, we obtain a differential polynomial P such that the leading orders of all monomials of P are equal to or less than 3 and the order types of all monomials of P are not

(2, 2, 2, 0, 0, 0)

. Then the order types of the monomials of P are

\begin{matrix} (3, 3, 0, 0, 0, 0), (3, 2, 1, 0, 0, 0), (3, 1, 1, 1, 0, 0), (2, 2, 1, 1, 0, 0), (2, 1, 1, 1, 1, 0) . \end{matrix}

All monomials with the above order types can be written as

M_{i} M_{j}

for certain

M_{i}, M_{j}

in Equation (20). For instance, the monomial

f^{4} f_{3, 0} f_{2, 1}

has order type

(3, 3, 0, 0, 0, 0)

, which can be written as

M_{1} M_{2}

. Thus, P is a quadratic form in variables

M

. The lemma is proven. □

Using Lemma 7 to each monomial of

I_{3}

in Equation (13), we obtain a quadratic form

{\tilde{I}}_{3}

in

M

\begin{matrix} {\tilde{I}}_{3} & = 1 / 2 M_{1}^{2} - M_{1} M_{5} + 3 / 2 M_{2}^{2} - 3 M_{2} M_{6} + 3 / 2 M_{3}^{2} + 1 / 2 M_{4}^{2} - 2 M_{4} M_{6} - M_{4} M_{7} \\ - M_{4} M_{10} - 1 / 2 M_{5}^{2} + 3 / 2 M_{6}^{2} - 3 M_{7}^{2} - 2 M_{7} M_{10} + 3 M_{8}^{2} - 5 / 2 M_{9}^{2} - 3 / 2 M_{9} M_{11} \\ + 21 M_{9} M_{13} - 1 / 2 M_{10}^{2} + 3 / 5 M_{11}^{2} + 3 M_{12}^{2} - 15 M_{13}^{2} + 3 / 5 M_{14}^{2} \end{matrix}

(23)

which satisfies

\begin{matrix} \int_{R^{2}} \frac{I_{3}}{f^{5}} d x_{1} d x_{2} = \int_{R^{2}} \frac{{\tilde{I}}_{3}}{f^{5}} d x_{1} d x_{2} . \end{matrix}

(24)

Using Lemma 7 to all monomials of

R_{j} (1 \leq j \leq 136)

, we obtain

{\bar{R}}_{j}

which are quadratic forms in

M

. Doing Gaussian elimination to

{\bar{R}}_{j} (1 \leq j \leq 136)

to eliminate the linearly dependent ones, we obtain 48 constraints

{\tilde{R}}_{j} (1 \leq j \leq 48)

which are given in Appendix B.

The variables in

M

satisfy certain relations, such as

M_{5} M_{8} = f^{2} f_{2, 0} f_{1, 1} f_{1, 0} f_{0, 1} = M_{6} M_{7}

, which are called intrinsic constraints. We have 15 intrinsic constraints

{\tilde{R}}_{i} (49 \leq i \leq 63)

. In total, we have 63 sixth-order constraints which are quadratic forms in

M

:

\begin{matrix} {\tilde{R}}_{i} (i = 1, \dots, 48) \\ {\tilde{R}}_{49} = M_{5} M_{8} - M_{6} M_{7}, & {\tilde{R}}_{50} = M_{5} M_{10} - M_{6} M_{9}, \\ {\tilde{R}}_{51} = M_{5} M_{12} - M_{6} M_{11}, & {\tilde{R}}_{52} = M_{5} M_{13} - M_{6} M_{12}, \\ {\tilde{R}}_{53} = M_{5} M_{14} - M_{6} M_{13}, & {\tilde{R}}_{54} = M_{7} M_{10} - M_{8} M_{9}, \\ {\tilde{R}}_{55} = M_{7} M_{12} - M_{8} M_{11}, & {\tilde{R}}_{56} = M_{7} M_{13} - M_{8} M_{12}, \\ {\tilde{R}}_{57} = M_{7} M_{14} - M_{8} M_{13}, & {\tilde{R}}_{58} = M_{9} M_{12} - M_{10} M_{11}, \\ {\tilde{R}}_{59} = M_{9} M_{13} - M_{10} M_{12}, & {\tilde{R}}_{60} = M_{9} M_{14} - M_{10} M_{13}, \\ {\tilde{R}}_{61} = M_{11} M_{13} - M_{12}^{2}, & {\tilde{R}}_{62} = M_{11} M_{14} - M_{12} M_{13}, \\ {\tilde{R}}_{63} = M_{12} M_{14} - M_{13}^{2} . \end{matrix}

(25)

where

{\tilde{R}}_{i} (i = 1, \dots, 48)

are given in Appendix B.

The following lemma summarizes all the constraints needed in the proof.

Lemma 8.

From Equations (19), (21) and (25), we obtain the following fourth-order constraints and sixth-order constraints

\begin{matrix} \begin{matrix} \int_{R^{2}} \frac{f_{1, 0} {\tilde{P}}_{i}}{f^{3}} d x_{1} d x_{2} = \int_{R^{2}} \frac{f_{0, 1} {\tilde{Q}}_{i}}{f^{3}} d x_{1} d x_{2} = 0, i = 1, \dots, 4, \\ \int_{R^{2}} \frac{{\tilde{R}}_{j}}{f^{5}} d x_{1} d x_{2} = 0, j = 1, \dots, 63, \end{matrix} \end{matrix}

(26)

where

{\tilde{R}}_{j}

are quadratic forms in

M

and

{\tilde{P}}_{i}, {\tilde{Q}}_{i}

are linear forms in

M

.

Proof.

We need only to consider the equalities for

{\tilde{R}}_{j} (1 \leq j \leq 48)

.

{\bar{R}}_{i}

is obtained from

R_{i}

by applying Lemma 7 to each monomial of

R_{i}

. Then by Equation (19) and Lemma 7, we have

\int_{R^{2}} \frac{R_{j}}{f^{5}} d x_{1} d x_{2} = \int_{R^{2}} \frac{{\bar{R}}_{j}}{f^{5}} d x_{1} d x_{2} = 0, j = 1, \dots, 136

.

{\tilde{R}}_{j}

are obtained from

{\bar{R}}_{j} (1 \leq j \leq 136)

by doing Gaussian elimination, so the

{\tilde{R}}_{j}

are linear combinations of

{\bar{R}}_{j}

over

Q

. Thus

\int_{R^{2}} \frac{{\tilde{R}}_{j}}{f^{5}} d x_{1} d x_{2} = 0, j = 1, \dots, 48

. The lemma is proven. □

3.4. Reduction to Semidefinite Positiveness of a Quadratic Form

In this section, we give an

Θ

, which is a quadratic form in

M

, such that Theorem 1 is true if

Θ

\geq 0

, that is,

Θ

is a semidefinite positive polynomial when

f_{a, b}

are treated as independent variables.

In the following key lemma, we introduce

{\tilde{J}}_{3}

in order to generate a common factor

I = \int_{R^{2}} \frac{I_{1}}{f} d x_{1} d x_{2}

in the proof of Lemma 10.

Lemma 9.

Let

\begin{matrix} \tilde{J_{3}} : = {({\tilde{F}}_{1, 0} + \tilde{P})}^{2} + {({\tilde{F}}_{0, 1} + \tilde{Q})}^{2} + \tilde{R}, \\ \tilde{P} : = \sum_{i = 1}^{4} α_{i} \tilde{P_{i}}, \tilde{Q} : = \sum_{i = 1}^{4} β_{i} \tilde{Q_{i}}, \tilde{R} : = \sum_{j = 1}^{63} γ_{j} \tilde{R_{j}}, α_{i}, β_{i}, γ_{j} \in R, \end{matrix}

(27)

where

{\tilde{F}}_{1, 0}, {\tilde{F}}_{0, 1}, {\tilde{P}}_{i}, {\tilde{Q}}_{i}

are defined in Equation (21) and

{\tilde{R}}_{j}

are defined in Equation (25). Then,

\tilde{J_{3}}

is a quadratic form in

M

and satisfies

{(\int_{R^{2}} \frac{I_{2}}{f^{3}} d x_{1} d x_{2})}^{2} \leq \int_{R^{2}} \frac{I_{1}}{f} d x_{1} d x_{2} \int_{R^{2}} \frac{\tilde{J_{3}}}{f^{5}} d x_{1} d x_{2},

(28)

where

I_{1}

and

I_{2}

are defined in Equation (10) and Equation (11), respectively.

Proof.

\tilde{J_{3}}

is clearly a quadratic form in

M

. From Equations (10) and (11),

I_{1} = f_{1, 0}^{2} + f_{0, 1}^{2}

and

I_{2} = f_{1, 0} {\tilde{F}}_{1, 0} + f_{0, 1} {\tilde{F}}_{0, 1}

(F_{1, 0} = {\tilde{F}}_{1, 0}, F_{0, 1} = {\tilde{F}}_{0, 1})

. Using the inequality (14) with

f_{1} = \frac{f_{1, 0}}{\sqrt{f}}

,

f_{2} = \frac{f_{0, 1}}{\sqrt{f}}

,

g_{1} = \frac{{\tilde{F}}_{0, 1}}{f^{2} \sqrt{f}}

,

g_{2} = \frac{{\tilde{F}}_{1, 0}}{f^{2} \sqrt{f}}

, we have

\begin{matrix} {(\int_{R^{2}} \frac{I_{2}}{f^{3}} d x_{1} d x_{2})}^{2} \\ \overset{(27), (26)}{=} & {(\int_{R^{2}} \frac{I_{2} + f_{1, 0} \tilde{P} + f_{0, 1} \tilde{Q}}{f^{3}} d x_{1} d x_{2})}^{2} \\ \overset{(11)}{=} & {(\int_{R^{2}} \frac{f_{1, 0} ({\tilde{F}}_{1, 0} + \tilde{P}) + f_{0, 1} ({\tilde{F}}_{0, 1} + \tilde{Q})}{f^{3}} d x_{1} d x_{2})}^{2} \\ \overset{(14)}{\leq} & \int_{R^{2}} \frac{f_{1, 0}^{2} + f_{0, 1}^{2}}{f} d x_{1} d x_{2} \int_{R^{2}} \frac{{({\tilde{F}}_{1, 0} + \tilde{P})}^{2} + {({\tilde{F}}_{0, 1} + \tilde{Q})}^{2}}{f^{5}} d x_{1} d x_{2} \\ \overset{(26)}{=} & \int_{R^{2}} \frac{f_{1, 0}^{2} + f_{0, 1}^{2}}{f} d x_{1} d x_{2} \int_{R^{2}} \frac{{({\tilde{F}}_{1, 0} + \tilde{P})}^{2} + {({\tilde{F}}_{0, 1} + \tilde{Q})}^{2} + \tilde{R}}{f^{5}} d x_{1} d x_{2} \\ \overset{(10), (27)}{=} & \int_{R^{2}} \frac{I_{1}}{f} d x_{1} d x_{2} \int_{R^{2}} \frac{{\tilde{J}}_{3}}{f^{5}} d x_{1} d x_{2} . \end{matrix}

The lemma is proven. □

In Lemma 10, proof of Theorem 1 is finally reduced to the proof of an inequality for a quadratic form with undetermined coefficients.

Lemma 10.

Let

{\tilde{I}}_{3}

be defined in Equation (23) and

{\tilde{J}}_{3}

be defined in Equation (27). Then Theorem 1 is true if there exist

α_{i}, β_{i}, γ_{j} \in R

such that

Θ : = 2 {\tilde{I}}_{3} - {\tilde{J}}_{3} \geq 0,

(29)

where Θ is a quadratic form in

M

.

Proof.

Θ

is clearly a quadratic form in

M

, since

{\tilde{I}}_{3}

and

{\tilde{J}}_{3}

are. By Lemma 3, we have

\begin{matrix} 2 I (t) \frac{d^{2}}{d t^{2}} I (t) - {(\frac{d}{d t} I (t))}^{2} \\ = & 2 (\int_{R^{2}} \frac{I_{1}}{f} d x_{1} d x_{2}) (\int_{R^{2}} \frac{I_{3}}{f^{5}} d x_{1} d x_{2}) - {(\int_{R^{2}} \frac{I_{2}}{f^{3}} d x_{1} d x_{2})}^{2} \\ \overset{(28)}{\geq} & 2 (\int_{R^{2}} \frac{I_{1}}{f} d x_{1} d x_{2}) (\int_{R^{2}} \frac{I_{3}}{f^{5}} d x_{1} d x_{2}) - (\int_{R^{2}} \frac{I_{1}}{f} d x_{1} d x_{2}) (\int_{R^{2}} \frac{{\tilde{J}}_{3}}{f^{5}} d x_{1} d x_{2}) \\ \overset{(24)}{=} & (\int_{R^{2}} \frac{I_{1}}{f} d x_{1} d x_{2}) (\int_{R^{2}} \frac{2 {\tilde{I}}_{3} - {\tilde{J}}_{3}}{f^{5}} d x_{1} d x_{2}) \\ \overset{(29)}{\geq} & 0 . \end{matrix}

Since

f > 0

and

I_{1} > 0

, by Lemma 1, Theorem 1 is true if

Θ

\geq 0

. □

3.5. Prove Theorem 1 by Solving an SDP Problem

In this section, we will give an

Θ

in Equation (29) satisfying

Θ

\geq 0

and hence proving Theorem 1. By Lemma 10, in order to prove Theorem 1, it suffices to solve the following problem.

Problem 1.

Find

α_{i}, β_{i}, γ_{j} \in R

such that

Θ = 2 {\tilde{I}}_{3} - {\tilde{J}}_{3} = 2 {\tilde{I}}_{3} - \sum_{j = 1}^{63} γ_{j} {\tilde{R}}_{j} - {({\tilde{F}}_{1, 0} + \sum_{i = 1}^{4} α_{i} {\tilde{P}}_{i})}^{2} - {({\tilde{F}}_{0, 1} + \sum_{i = 1}^{4} β_{i} {\tilde{Q}}_{i})}^{2} \geq 0,

(30)

where

{\tilde{I}}_{3}

is defined in Equation (23);

{\tilde{R}}_{j}

are defined in Equation (25); and

{\tilde{F}}_{1, 0}, {\tilde{F}}_{0, 1}, {\tilde{P}}_{i}, {\tilde{Q}}_{i}

are defined in Equation (21).

It is impossible to compute

α_{i}, β_{i}, γ_{j}

in Problem 1 with SDP directly, since

Θ

is not linear in

α_{i}, β_{i}

. We use the following strategy to solve Problem 1:

S1: Expanding the squares ${({\tilde{F}}_{1, 0} + \sum_{i = 1}^{4} α_{i} {\tilde{P}}_{i})}^{2}$ and ${({\tilde{F}}_{0, 1} + \sum_{i = 1}^{4} β_{i} {\tilde{Q}}_{i})}^{2}$ and deleting the terms $- {(\sum_{i = 1}^{4} α_{i} {\tilde{P}}_{i})}^{2}$ and $- {(\sum_{i = 1}^{4} β_{i} {\tilde{Q}}_{i})}^{2}$ , we obtain Problem 2 which is weaker than Problem 1.
S2: Since $\tilde{Θ}$ in Problem 2 is linear in $α_{i}, β_{i}, γ_{j}$ , we can use SDP to solve Problem 2 and let ${\tilde{α}}_{i}, {\tilde{β}}_{i}, {\tilde{γ}}_{j}$ be the solutions.
S3: Let $Θ_{1}$ be obtained from $Θ$ by substituting $α_{i}, β_{i}$ with ${\tilde{α}}_{i}, {\tilde{β}}_{i}$ . Then, $Θ_{1}$ is linear in $γ_{j}$ and we can use SDP to compute $γ_{j}$ such that $Θ_{1}$ $\geq 0$ is true. Under this condition, Problem 1 becomes Problem 5, and it suffices to solve Problem 5 in order to prove Theorem 1.

Problem 2.

Find

α_{i}, β_{i}, γ_{j} \in R

such that

\tilde{Θ} : = (2 {\tilde{I}}_{3} - {\tilde{F}}_{1, 0}^{2} - {\tilde{F}}_{0, 1}^{2}) - \sum_{j = 1}^{63} γ_{j} {\tilde{R}}_{j} - 2 \sum_{i = 1}^{4} α_{i} {\tilde{F}}_{1, 0} {\tilde{P}}_{i} - 2 \sum_{i = 1}^{4} β_{i} {\tilde{F}}_{0, 1} {\tilde{Q}}_{i} \geq 0,

where

{\tilde{I}}_{3}

is defined in Equation (23);

{\tilde{R}}_{j}

are defined in Equation (25); and

{\tilde{F}}_{1, 0}, {\tilde{F}}_{0, 1}, {\tilde{P}}_{i}, {\tilde{Q}}_{i}

are defined in Equation (21).

Since

\tilde{Θ}

is a quadratic form in

M

, it is well known that

\tilde{Θ}

\geq 0

is equivalent to the fact that the symmetric matrix

\hat{Θ}

\in R^{14 \times 14}

of

\tilde{Θ}

is positive semidefinite, that is,

\hat{Θ}

⪰ 0

[19]. In other words, Problem 2 is equivalent to the following SDP problem [19].

Problem 3.

\begin{matrix} min_{α_{i}, β_{i}, γ_{j} \in R} 1 s . t . \\ \hat{Θ} : = (2 {\hat{I}}_{3} - \hat{F_{1, 0}^{2}} - \hat{F_{0, 1}^{2}}) - \sum_{j = 1}^{63} γ_{j} {\hat{R}}_{j} - \sum_{i = 1}^{4} 2 α_{i} \hat{F_{1, 0} P_{i}} - \sum_{i = 1}^{4} 2 β_{i} \hat{F_{0, 1} Q_{i}} ⪰ 0, \end{matrix}

where

\hat{Q} \in R^{n \times n}

is the corresponding symmetric matrix for any quadratic form Q in

M

and

n = | M | = 14

.

We set the objective function to be 1, which means that it suffices to satisfy the constraints.

We actually solve the following dual problem [19] of Problem 3:

Problem 4.

\begin{matrix} max_{X} & - trace (X^{T} \hat{I}) \\ s . t . & trace (X^{T} {\hat{R}}_{j}) = 0, j = 1, \dots, 63 \\ trace (X^{T} 2 \hat{F_{1, 0} P_{i}}) = 0, i = 1, \dots, 4 \\ trace (X^{T} 2 \hat{F_{0, 1} Q_{i}}) = 0, i = 1, \dots, 4 \\ X ⪰ 0 \end{matrix}

where

\hat{I} : = 2 {\hat{I}}_{3} - \hat{F_{1, 0}^{2}} - \hat{F_{0, 1}^{2}}, X \in R^{n \times n}

, and

n = | M | = 14

.

Remark 1.

If not using differential forms to reduce the polynomials into quadratic forms in

M

, then we need to consider all differential monomials with total degree 3 and total order

\leq 6

as the bases for the SDP Problem 4. In such a case,

n = 100

instead of

n = 14

, and we need to solve a much larger SDP problem for

X \in R^{n \times n}

.

We use the CVX package in Matlab [26] to solve Problem 4. The program is given in Appendix A. Our complete code and data are available (accessed on 30 November 2022) at https://github.com/liujunliang19/sqrt-convex.

With CVX, we obtain a set of solutions for

γ_{j}, α_{i}, β_{i}

, which are given in Appendix C. From the above discussions, we see that these values are also solutions to Problem 2.

Finally, according to step S3 just above Problem 2, we put the solutions for

α_{i}, β_{i}

back into

Θ

in Problem 1 and obtain the following problem.

Problem 5.

Find

λ_{j} \in R

such that

Θ_{1} : = 2 {\tilde{I}}_{3} - \sum_{j = 1}^{63} λ_{j} {\tilde{R}}_{j} - {({\tilde{F}}_{1, 0} - \frac{29}{110} {\tilde{P}}_{1} - \frac{32}{139} {\tilde{P}}_{4})}^{2} - {({\tilde{F}}_{0, 1} - \frac{29}{110} {\tilde{Q}}_{2} - \frac{23}{100} {\tilde{Q}}_{3})}^{2} \geq 0,

(31)

where

{\tilde{I}}_{3}

is defined in Equation (23);

{\tilde{R}}_{j}

are defined in Equation (25); and

{\tilde{F}}_{1, 0}, {\tilde{F}}_{0, 1}, {\tilde{P}}_{i}, {\tilde{Q}}_{i}

are defined in Equation (21).

Similar to Problems 3 and 4, we obtain a set of solutions for

λ_{j}

, which are given in Appendix D. Now

Θ_{1}

is a semi-positive quadratic form and it is well known that

Θ_{1}

can be written as an SOS. The value of

Θ_{1}

as well as its SOS representation are given in Appendix E. Hence, we solve Problem 1 and therefore prove Theorem 1.

Remark 2.

Note that the SOS given in Appendix E provides an explicit and direct proof for Theorem 1 and the solution procedure for the SDP is not needed, similar to Equation (17) for the case of

n = 1

. Of course, the SOS in Appendix E is quite large and difficult to check manually. In order for interested readers to check the proof with a mathematical software system, we also give the complete code and data in https://github.com/liujunliang19/sqrt-convex (accessed on 30 November 2022). The SOS expression for

H_{1}

is in the bottom of our Maple code named sqrt-convex2.mw, which can be run directly.

Remark 3.

We also try to use the above approach to prove the log-convexity of the Fisher information along heat flow for

n = 2

. The CVX program returns failed. Thus, we cannot prove the log-convexity with the above approach. We also cannot say that the log-convexity is not correct, since the log-convexity is not equivalent to Problem 3.

Remark 4.

Theorem 1 is stronger than the CMC for the third-order derivative with dimension two. In other words, given Theorem 1, we can obtain

\frac{d^{3}}{d t^{3}} H (X_{t}) \geq 0 (n = 2)

. Using Lemma 1, we obtain

2 I (t) \frac{d^{2}}{d t^{2}} I (t) \geq {(\frac{d}{d t} I (t))}^{2} \geq 0 (n = 2)

. Since

I (t) \geq 0

, we have

\frac{d^{2}}{d t^{2}} I (t) \geq 0 (n = 2)

. Using Equation (3), we have

\frac{d^{3}}{d t^{3}} H (X_{t}) = \frac{1}{2} \frac{d^{2}}{d t^{2}} I (t) \geq 0 (n = 2)

.

4. Conclusions

In this paper, we prove the sqrt-convexity of Fisher Information along heat flow in dimension two. It is easy to find that this conclusion is weaker than the log-convexity conjecture. However, it is stronger than the CMC for the third-order derivative with dimension two.

The proof is based on the SDP method. In order to reduce the size of the SDP problem, we prove that any sixth-order differential form can be reduced to an “equivalent” differential polynomial which is a quadratic form in certain new variables. Based on this fact, we reduce the sixth-order differential forms into quadratic forms in a set of new variables, which reduces the size of the SDP problem significantly.

For possible future research directions, it is interesting to prove the sqrt-convexity for higher dimensions

(n \geq 3)

using the method given in this paper. In this case, the main difficulty is to establish inequality (27) in higher dimensions. Another question is to prove the log-convexity by introducing more constraints or new methods to solve Problem 1 without using the relaxation method used in Problem 2. The methods introduced in this paper may be used to prove other EPI inequalities related with the heat equations.

Author Contributions

Conceptualization, J.L.; methodology, J.L. and X.G.; software, J.L.; validation, J.L. and X.G.; formal analysis, J.L. and X.G.; investigation, J.L.;data curation, J.L.; writing—original draft preparation, J.L.; writing—review and editing, X.G.; supervision, X.G.; project administration, X.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by NSFC grant number 11688101 and NKRDP 2018YFA0704705.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The code for the SDP solver and data are available (accessed on 30 November 2022) at https://github.com/liujunliang19/sqrt-convex.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Matlab Codes for SDP

The following Matlab codes are used to solve Problem 4.

cvx_begin
variable Z(n,n) symmetric
dual variable y
maximize(trace(C∗X))
subject to
[trace(A_1∗Z),trace(A_2∗Z),...,trace(A_m∗Z)]’==
zeros(m,1):y;
Z == semidefinite(n);
cvx_end

Appendix B

We give

{\tilde{R}}_{i} (1 \leq i \leq 48)

in Equation (25).

\begin{matrix} {\tilde{R}}_{1} = 4 M_{5} M_{12} - \frac{16}{5} M_{11} M_{12} \\ {\tilde{R}}_{2} = M_{8} M_{14} - \frac{4}{5} M_{13} M_{14} \\ {\tilde{R}}_{3} = M_{1} M_{11} + 3 M_{5}^{2} - \frac{12}{5} M_{11}^{2} \\ {\tilde{R}}_{4} = M_{2} M_{11} + 3 M_{5} M_{7} - \frac{12}{5} M_{11} M_{12} \\ {\tilde{R}}_{5} = M_{3} M_{14} + 3 M_{8} M_{10} - \frac{12}{5} M_{13} M_{14} \\ {\tilde{R}}_{6} = M_{4} M_{14} + 3 M_{10}^{2} - \frac{12}{5} M_{14}^{2} \\ {\tilde{R}}_{7} = 3 M_{5} M_{13} - \frac{1}{2} M_{9} M_{11} - 2 M_{12}^{2} \\ {\tilde{R}}_{8} = M_{1} M_{12} + 2 M_{5} M_{6} + M_{5} M_{7} - \frac{12}{5} M_{11} M_{12} \\ {\tilde{R}}_{9} = M_{2} M_{14} + 3 M_{8}^{2} + \frac{9}{2} M_{9} M_{13} - 6 M_{13}^{2} \\ {\tilde{R}}_{10} = M_{3} M_{11} + 3 M_{7}^{2} + \frac{3}{4} M_{9} M_{11} - 3 M_{12}^{2} \\ {\tilde{R}}_{11} = M_{4} M_{13} + M_{8} M_{10} + 2 M_{9} M_{10} - \frac{12}{5} M_{13} M_{14} \\ {\tilde{R}}_{12} = - M_{5} M_{9} + M_{7}^{2} + \frac{5}{4} M_{9} M_{11} - M_{12}^{2} \\ {\tilde{R}}_{13} = - 3 M_{6} M_{10} + 3 M_{8}^{2} + \frac{45}{2} M_{9} M_{13} - 18 M_{13}^{2} \\ {\tilde{R}}_{14} = M_{1} M_{13} + 2 M_{5} M_{8} + M_{6}^{2} - \frac{1}{2} M_{9} M_{11} - 2 M_{12}^{2} \\ {\tilde{R}}_{15} = M_{2} M_{12} + 2 M_{5} M_{8} + M_{7}^{2} + \frac{3}{4} M_{9} M_{11} - 3 M_{12}^{2} \\ {\tilde{R}}_{16} = M_{3} M_{13} + 2 M_{7} M_{10} + M_{8}^{2} + \frac{9}{2} M_{9} M_{13} - 6 M_{13}^{2} \\ {\tilde{R}}_{17} = 2 M_{2} M_{7} - 2 M_{3} M_{5} + 4 M_{5} M_{8} - 6 M_{5} M_{9} + 2 M_{7}^{2} + \frac{15}{2} M_{9} M_{11} - 6 M_{12}^{2} \\ {\tilde{R}}_{18} = \frac{2}{3} M_{3} M_{7} - \frac{2}{3} M_{4} M_{5} - \frac{8}{3} M_{5} M_{10} + 4 M_{7} M_{8} - \frac{4}{3} M_{7} M_{9} + 10 M_{9} M_{12} - 8 M_{11} M_{14} \\ {\tilde{R}}_{19} = M_{1} M_{8} + M_{1} M_{9} - M_{2} M_{6} - M_{2} M_{7} + 2 M_{6}^{2} - 2 M_{7}^{2} - \frac{5}{2} M_{9} M_{11} + 2 M_{12}^{2} \\ {\tilde{R}}_{20} = - M_{2} M_{8} + \frac{1}{3} M_{3} M_{7} + \frac{2}{3} M_{4} M_{5} + \frac{8}{3} M_{5} M_{10} - 4 M_{7} M_{8} + \frac{4}{3} M_{7} M_{9} - 10 M_{9} M_{12} + 8 M_{11} M_{14} \\ {\tilde{R}}_{21} = \frac{12}{5} M_{3} M_{9} - \frac{8}{5} M_{4} M_{6} - \frac{4}{5} M_{4} M_{7} + \frac{12}{5} M_{5} M_{9} + \frac{24}{5} M_{6} M_{10} - \frac{12}{5} M_{7}^{2} - \frac{8}{5} M_{7} M_{10} - \frac{16}{5} M_{9}^{2} \\ - 3 M_{9} M_{11} - 12 M_{9} M_{13} + \frac{12}{5} M_{12}^{2} + \frac{48}{5} M_{13}^{2} \\ {\tilde{R}}_{22} = - \frac{5}{3} M_{3} M_{8} + \frac{4}{3} M_{3} M_{9} - \frac{1}{3} M_{4} M_{6} + \frac{2}{3} M_{4} M_{7} + \frac{4}{3} M_{5} M_{9} + \frac{8}{3} M_{6} M_{10} - \frac{4}{3} M_{7}^{2} - 2 M_{7} M_{10} \\ - \frac{2}{3} M_{9}^{2} - \frac{5}{3} M_{9} M_{11} - 15 M_{9} M_{13} + \frac{4}{3} M_{12}^{2} + 12 M_{13}^{2} \\ {\tilde{R}}_{23} = 3 M_{1} M_{9} - 6 M_{2} M_{6} + 3 M_{3} M_{5} + 5 M_{3} M_{8} - 4 M_{3} M_{9} + M_{4} M_{6} - 2 M_{4} M_{7} - 4 M_{5} M_{9} + 6 M_{6}^{2} \\ - 8 M_{6} M_{10} - 2 M_{7}^{2} + 6 M_{7} M_{10} + 2 M_{9}^{2} - \frac{5}{2} M_{9} M_{11} + 45 M_{9} M_{13} + 2 M_{12}^{2} - 36 M_{13}^{2} \\ {\tilde{R}}_{24} = 5 M_{5} M_{11} - 4 M_{11}^{2} \\ {\tilde{R}}_{25} = 5 M_{10} M_{14} - 4 M_{14}^{2} \\ {\tilde{R}}_{26} = 5 M_{7} M_{11} - 4 M_{11} M_{12} \\ {\tilde{R}}_{27} = - 20 M_{9} M_{14} + 16 M_{13} M_{14} \\ {\tilde{R}}_{28} = M_{1} M_{6} - M_{2} M_{5} \\ {\tilde{R}}_{29} = 2 M_{5} M_{14} - 2 M_{9} M_{12} \\ {\tilde{R}}_{30} = M_{6} M_{14} - 6 M_{9} M_{13} + 4 M_{13}^{2} \\ {\tilde{R}}_{31} = 4 M_{7} M_{12} + M_{9} M_{11} - 4 M_{12}^{2} \\ {\tilde{R}}_{32} = 2 M_{7} M_{14} + 3 M_{9} M_{13} - 4 M_{13}^{2} \\ {\tilde{R}}_{33} = M_{4} M_{11} + 3 M_{7} M_{9} - 3 M_{9} M_{12} \\ {\tilde{R}}_{34} = M_{1} M_{14} + 3 M_{6} M_{8} - 3 M_{9} M_{12} \\ {\tilde{R}}_{35} = 3 M_{7} M_{13} + 2 M_{9} M_{12} - 4 M_{11} M_{14} \\ {\tilde{R}}_{36} = - M_{4} M_{8} + M_{4} M_{9} + 2 M_{8} M_{10} - 2 M_{9} M_{10} \\ {\tilde{R}}_{37} = - M_{1} M_{3} + M_{1} M_{8} + M_{2}^{2} - M_{2} M_{7} \\ {\tilde{R}}_{38} = 2 M_{1} M_{7} - 2 M_{2} M_{5} + 4 M_{5} M_{6} - 4 M_{5} M_{7} \\ {\tilde{R}}_{39} = 2 M_{3} M_{10} - 2 M_{4} M_{8} + 4 M_{8} M_{10} - 4 M_{9} M_{10} \\ {\tilde{R}}_{40} = M_{4} M_{12} + 2 M_{7} M_{10} + M_{9}^{2} - 3 M_{9} M_{13} \\ {\tilde{R}}_{41} = 12 M_{5} M_{10} - 12 M_{7} M_{8} - 30 M_{9} M_{12} + 24 M_{11} M_{14} \\ {\tilde{R}}_{42} = M_{3} M_{12} + 2 M_{7} M_{8} + M_{7} M_{9} + 2 M_{9} M_{12} - 4 M_{11} M_{14} \\ {\tilde{R}}_{43} = M_{2} M_{13} + M_{6} M_{8} + 2 M_{7} M_{8} + 2 M_{9} M_{12} - 4 M_{11} M_{14} \\ {\tilde{R}}_{44} = 2 M_{2} M_{10} - 2 M_{3} M_{8} + 4 M_{6} M_{10} - 4 M_{7} M_{10} - 30 M_{9} M_{13} + 24 M_{13}^{2} \\ {\tilde{R}}_{45} = 3 M_{1} M_{10} - 2 M_{3} M_{7} - M_{4} M_{5} - 4 M_{5} M_{10} + 6 M_{6} M_{8} - 2 M_{7} M_{9} \\ {\tilde{R}}_{46} = M_{2} M_{4} - M_{3}^{2} + 2 M_{3} M_{8} + M_{3} M_{9} - M_{4} M_{6} - 2 M_{4} M_{7} + 2 M_{6} M_{10} - 2 M_{9}^{2} \\ {\tilde{R}}_{47} = - M_{2} M_{9} + M_{3} M_{6} + 4 M_{5} M_{10} - 2 M_{6} M_{8} - 4 M_{7} M_{8} + 2 M_{7} M_{9} - 10 M_{9} M_{12} + 8 M_{11} M_{14} \\ {\tilde{R}}_{48} = - M_{1} M_{4} + 3 M_{1} M_{10} + M_{2} M_{3} - M_{2} M_{8} - M_{2} M_{9} - M_{3} M_{7} + 4 M_{6} M_{8} - 4 M_{7} M_{8} \\ - 10 M_{9} M_{12} + 8 M_{11} M_{14} \end{matrix}

Appendix C. Solutions to Problem 2

We give the solutions

α_{i} (1 \leq i \leq 4)

,

β_{i} (1 \leq i \leq 4)

,

γ_{j} (1 \leq j \leq 63)

to Problem 4, which are also solutions to Problems 2 and 3.

\begin{matrix} α_{1} = - 29 / 110 & α_{2} = 0 & α_{3} = 0 & α_{4} = - 32 / 139 \\ β_{1} = 0 & β_{2} = - 29 / 110 & β_{3} = - 23 / 100 & β_{4} = 0 \\ γ_{1} = 0 & γ_{2} = 0 & γ_{3} = - 283 / 207 & γ_{4} = 0 & γ_{5} = 0 \\ γ_{6} = - 175 / 128 & γ_{7} = 93 / 130 & γ_{8} = 0 & γ_{9} = - 85 / 92 & γ_{10} = - 110 / 119 \\ γ_{11} = 0 & γ_{12} = - 114 / 83 & γ_{13} = 779 / 161 & γ_{14} = - 76 / 83 & γ_{15} = - 493 / 219 \\ γ_{16} = - 232 / 103 & γ_{17} = 270 / 173 & γ_{18} = 0 & γ_{19} = - 167 / 68 & γ_{20} = 0 \\ γ_{21} = 297 / 52 & γ_{22} = - 107 / 188 & γ_{23} = 409 / 256 & γ_{24} = 37 / 113 & γ_{25} = 37 / 113 \\ γ_{26} = 0 & γ_{27} = 0 & γ_{28} = 0 & γ_{29} = 0 & γ_{30} = 24 / 85 \\ γ_{31} = 69 / 112 & γ_{32} = 85 / 69 & γ_{33} = 0 & γ_{34} = 0 & γ_{35} = 0 \\ γ_{36} = 0 & γ_{37} = 481 / 285 & γ_{38} = 0 & γ_{39} = 0 & γ_{40} = - 118 / 129 \\ γ_{41} = 0 & γ_{42} = 0 & γ_{43} = 0 & γ_{44} = 127 / 152 & γ_{45} = 0 \\ γ_{46} = - 27 / 16 & γ_{47} = 0 & γ_{48} = 0 & γ_{49} = 118 / 101 & γ_{50} = 0 \\ γ_{51} = 0 & γ_{52} = - 555 / 247 & γ_{53} = 0 & γ_{54} = 256 / 219 & γ_{55} = 14 / 51 \\ γ_{56} = 0 & γ_{57} = - 241 / 88 & γ_{58} = 0 & γ_{59} = 8 / 79 & γ_{60} = 0 \\ γ_{61} = 13 / 29 & γ_{62} = 0 & γ_{63} = 204 / 455 \end{matrix}

Appendix D. Solutions to Problem 5

We give the solutions

λ_{j} (1 \leq j \leq 63)

to Problem 5.

\begin{matrix} λ_{1} = 0 & λ_{2} = 0 & λ_{3} = - 363 / 248 & λ_{4} = 0 & λ_{5} = 0 \\ λ_{6} = - 157 / 109 & λ_{7} = 47 / 63 & λ_{8} = 0 & λ_{9} = - 355 / 317 & λ_{10} = - 208 / 307 \\ λ_{11} = 0 & λ_{12} = - 208 / 99 & λ_{13} = 4372 / 845 & λ_{14} = - 111 / 92 & λ_{15} = - 255 / 104 \\ λ_{16} = - 529 / 241 & λ_{17} = 233 / 132 & λ_{18} = 0 & λ_{19} = - 645 / 253 & λ_{20} = 0 \\ λ_{21} = 821 / 142 & λ_{22} = - 21 / 68 & λ_{23} = 263 / 151 & λ_{24} = 43 / 108 & λ_{25} = 107 / 283 \\ λ_{26} = 0 & λ_{27} = 0 & λ_{28} = 0 & λ_{29} = 0 & λ_{30} = 227 / 328 \\ λ_{31} = 61 / 76 & λ_{32} = 97 / 75 & λ_{33} = 0 & λ_{34} = 0 & λ_{35} = 0 \\ λ_{36} = 0 & λ_{37} = 342 / 157 & λ_{38} = 0 & λ_{39} = 0 & λ_{40} = - 191 / 186 \\ λ_{41} = 0 & λ_{42} = 0 & λ_{43} = 0 & λ_{44} = 188 / 181 & λ_{45} = 0 \\ λ_{46} = - 243 / 128 & λ_{47} = 0 & λ_{48} = 0 & λ_{49} = 281 / 522 & λ_{50} = 0 \\ λ_{51} = 0 & λ_{52} = - 269 / 187 & λ_{53} = 0 & λ_{54} = 97 / 114 & λ_{55} = 1 / 36 \\ λ_{56} = 0 & λ_{57} = - 307 / 137 & λ_{58} = 0 & λ_{59} = - 263 / 296 & λ_{60} = 0 \\ λ_{61} = - 10 / 181 & λ_{62} = 0 & λ_{63} = - 25 / 84 \end{matrix}

Appendix E. SOS Expression of Θ₁

The value of

Θ_{1}

is:

\begin{array}{l} Θ_{1} = M_{1}^{2} + \frac{28}{157} M_{1} M_{3} - \frac{78}{55} M_{1} M_{5} + \frac{7133009}{5521219} M_{1} M_{8} - \frac{6453649}{5310217} M_{1} M_{9} + \frac{5581}{13640} M_{1} M_{11} + \frac{3653}{12788} M_{1} M_{13} \\ + \frac{443}{157} M_{2}^{2} - \frac{13}{128} M_{2} M_{4} - \frac{5041031}{1910150} M_{2} M_{6} - \frac{1614857}{541650} M_{2} M_{7} + \frac{5022}{9955} M_{2} M_{10} + \frac{3983}{2600} M_{2} M_{12} \\ + \frac{1139}{17435} M_{2} M_{14} + \frac{397}{128} M_{3}^{2} + \frac{44197}{49830} M_{3} M_{5} - \frac{10036650925}{4133321792} M_{3} M_{8} - \frac{50886897819}{16213582720} M_{3} M_{9} - \frac{6366}{16885} M_{3} M_{11} \\ + \frac{42683}{33499} M_{3} M_{13} + M_{4}^{2} - \frac{602116651}{583222400} M_{4} M_{6} + \frac{419279509}{291611200} M_{4} M_{7} - \frac{78}{55} M_{4} M_{10} + \frac{497}{4650} M_{4} M_{12} \\ + \frac{2313}{5995} M_{4} M_{14} + \frac{543657}{750200} M_{5}^{2} - \frac{3509920763}{2386432620} M_{5} M_{8} + \frac{2688875063}{25080385770} M_{5} M_{9} - \frac{205631}{326700} M_{5} M_{11} \\ + \frac{96556}{248115} M_{5} M_{13} + \frac{505083713}{382030000} M_{6}^{2} - \frac{86969}{652500} M_{6} M_{7} + \frac{139595170060937}{490605224824000} M_{6} M_{10} - \frac{358527}{467500} M_{6} M_{12} \\ + \frac{35063}{451000} M_{6} M_{14} + \frac{22297627286581783}{8281308816495000} M_{7}^{2} - \frac{8095283498600389}{6438694624495375} M_{7} M_{10} - \frac{1203457}{427500} M_{7} M_{12} \\ + \frac{78722}{565125} M_{7} M_{14} + \frac{2270979774558}{1247276139265} M_{8}^{2} + \frac{393049}{2202594} M_{8} M_{9} + \frac{141277}{275220} M_{8} M_{11} - \frac{4809243}{2646977} M_{8} M_{13} \\ + \frac{158124697228757}{104796491910720} M_{9}^{2} + \frac{1386601574744315513}{4899089794897378080} M_{9} M_{11} - \frac{93944237793653606351280127}{77398522886148846455005400} M_{9} M_{13} \\ + \frac{215856}{329725} M_{10}^{2} + \frac{121743}{407000} M_{10} M_{12} - \frac{452981}{856075} M_{10} M_{14} + \frac{1021241}{5063850} M_{11}^{2} - \frac{595422}{1383745} M_{11} M_{13} \\ + \frac{254245231794897253159}{199355947139484135000} M_{12}^{2} - \frac{21653}{115500} M_{12} M_{14} + \frac{262837614282547093857383}{236176835310829086828700} M_{13}^{2} + \frac{16560133}{93312175} M_{14}^{2} \end{array}

Next, we give the SOS expression of

Θ_{1} = \sum_{i = 1}^{14} κ_{i} {(\sum_{j = i}^{14} μ_{i, j} M_{i} M_{j})}^{2}

. The parameters not mentioned above are 0.

\begin{array}{l} κ_{1} = 1, κ_{2} = 443 / 157, κ_{3} = 9760565 / 3155072, κ_{4} = 29005915 / 29032448, \\ κ_{5} = 208680430142632541 / 1502617428470367000, \\ κ_{6} = 254845206367035693279770017 / 616731381005248623385150000, \\ κ_{7} = 3205317995201138587458416275628086155489003267179 / \\ 3029566856759437193752730379916263883723707019560, \\ κ_{8} = 129346265420106916568678950322309264192296788391 / \\ 152823044461521957391224862681910484518198052480, \\ κ_{9} = 27809943919324934445244380001625250511737806865364775965142311767209 / \\ 122092260292579205097028684025778705457781727828878414585029349764800, \\ κ_{10} = 424832608852609712256798704857924428777343282773304441738461658519256056247379 / \\ 4925558197128944002809370841764515187045912505370123534195449522984520918307840, \\ κ_{11} = 138872556785505338170420018992660029962225023247254592880803918765702192997559309557 / \\ 17445046689968868434326878732657870806791020754286779471873299119392675650009247955712, \\ κ_{12} = 140278968440223253899895444891338549285051104801125327426403925721337636757988731331 \\ 8704835141306089 / 3479869695739952558003981016858935978619719181780257106754389615137471 \\ 5789547779914385019009658472680, \\ κ_{13} = 604447646634282248350351358416617870250513088153272472223118344345175656211357785467 \\ 5650055082115013879911559085796418320811 / 1930330919766627607839400790038364046255969055 \\ 75207876468599876531144568637730192598055535034357490574533326935257744858897200, \\ κ_{14} = 573376402650759464814193517607183020227696371005835999431768228660749434037209496944 \\ 3798520188034237831888760036314143950893 / 1096088362600344822912522877798221456332750932 \\ 362094411886308854783332104852973001677722701445651310610269715521038140265139000, \\ μ_{i, i} = 1 (1 \leq i \leq 14), \\ μ_{1, 3} = 14 / 157, \\ μ_{1, 5} = - 39 / 55, μ_{1, 8} = 7133009 / 11042438, μ_{1, 9} = - 6453649 / 10620434, \\ μ_{1, 11} = 5581 / 27280, μ_{1, 13} = 3653 / 25576, \\ μ_{2, 4} = - 2041 / 113408, μ_{2, 6} = - 791441867 / 1692392900, μ_{2, 7} = - 1614857 / 3056700, \\ μ_{2, 10} = 394227 / 4410065, μ_{2, 12} = 625331 / 2303600, μ_{2, 14} = 178823 / 15447410, \\ μ_{3, 5} = 39831683744 / 243184476975, μ_{3, 8} = - 65560045349620353 / 159483119878645585, \\ μ_{3, 9} = - 306382997632769383 / 625596768584742350, μ_{3, 11} = - 31040751344 / 464456485525, \\ μ_{3, 13} = 303544066272 / 1504058167901, \\ μ_{4, 6} = - 18070850742068032 / 33437308892230375, μ_{4, 7} = 6316978375269568 / 9119266061517375, \\ μ_{4, 10} = - 203628062976 / 288753883825, μ_{4, 12} = 4537774192 / 67438752375, \\ μ_{4, 14} = 10676034477952 / 55123275954725), \\ μ_{5, 8} = - 384773385951616686292970285 / 773914896181275444840093733, \\ μ_{5, 9} = 748702786947138499895423 / 805251661545357453927439, \\ μ_{5, 11} = - 375807353452502501303 / 384389352322729140522, \\ μ_{5, 13} = 26671705865381497872323625 / 19133233141225605982754783, \\ μ_{6, 7} = - 62733895020493663769897851306 / 66514598861796315946019974437, \\ μ_{6, 10} = - 18192447632086750779448425576715 / 62363680140490038573879080400104, \\ μ_{6, 12} = 8261208607905215485888011779 / 308107854497746153175241950553, \\ μ_{6, 14} = 138676599045243082750813383316315 / 361032323039607556705731629293441, \\ μ_{7, 10} = - 979758613168820041603217521125493464258092802119268806781 / \\ 8564763461513580075513760193457485740528012470323034179704, \\ μ_{7, 12} = 1854028485320471742919224371249503332742540456277079 / \\ 1887932299173470628013007186344942745583022924368431, \\ μ_{7, 14} = 9727374195894641621834927966046970684551801110074467927 / \\ 100338964846699164038247636061398850749555639257552538545, \\ μ_{8, 9} = - 12779618730487291463191846129496757762906461696978267 / \\ 52695409839620717596246667003408149613413326996916618, \\ μ_{8, 11} = - 122118378516557819068765438108425804752485918809740 / \\ 3692965224009472574952352710652251801954265605351441, \\ μ_{8, 13} = - 94939523121681601513656137577561549398850896305080 / \\ 124043068537882532989363113359094584360412620066969, \\ μ_{9, 11} = 3550893523515801884306409421029786766431528385700436202236083804607990380 / \\ 25638571308047962888694144055478352577031830643123959622559965206737088489, \\ μ_{9, 13} = - 7945848812909987868058927152552583529722064868069915968379981242142702296181690979 \\ 8 / 94546593373096755745578683571472373707556639624258871883954420295939113462405459891, \\ μ_{10, 12} = 63405502958698345886668393872792270425893974774905390610957167604969534529672 / \\ 424832608852609712256798704857924428777343282773304441738461658519256056247379, \\ μ_{10, 14} = - 905925800492569322388118625829368447231159919975109986163690065715646157190607027 \\ 267176 / 10703799622824271570367336429542375230077925714200000663940636741048182250415120216 \\ 16865, \\ μ_{11, 13} = - 105476225900697044986144688198887230178243505011443868627920704994758066688450160 \\ 26062203618333419498 / 101430268009464405995394306109144217152680242592189500043254326952447 \\ 19795065794880212493780307453215, \\ μ_{12, 14} = - 789055628026834299501182987960627756240418386432379047235785969484045323353728731 \\ 20303046433951024275381808209 / 385246930821572992901155980884640884342608948610835474003000 \\ 440010481668921844997018828943090780516847699842935 . \end{array}

References

Shannon, C.E. A mathematical theory of communications. Bell Syst. Technol. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Stam, A.J. Some inequalities satisfied by the quantities of information of Fisher and Shannon. Inf. Control 1959, 2, 101–112. [Google Scholar] [CrossRef]
Blachman, N.M. The convolution inequality for entropy powers. IEEE Trans. Inf. Theory 1965, 11, 267–271. [Google Scholar] [CrossRef]
Lieb, E.H. Proof of an entropy conjecture of Wehrl. Commun. Math. Phys. 1978, 62, 35–41. [Google Scholar] [CrossRef]
Verdú, S.; Guo, D. A simple proof of the entropy-power inequality. IEEE Trans. Inf. Theory 2006, 52, 2165–2166. [Google Scholar] [CrossRef]
Rioul, O. Information theoretic proofs of entropy power inequalities. IEEE Trans. Inf. Theory 2011, 57, 33–55. [Google Scholar] [CrossRef]
Costa, M.H.M. A new entropy power ineqaulity. IEEE Trans. Inf. Theory 1985, 31, 751–760. [Google Scholar] [CrossRef]
Costa, M.H.M. On the Gaussian interference channel. IEEE Trans. Inf. Theory 1985, 31, 607–615. [Google Scholar] [CrossRef]
Bergmans, P.P. A simple converse for broadcast channels with additive white Gaussian noise. IEEE Trans. Inf. Theory 1974, 20, 279–280. [Google Scholar] [CrossRef]
Dembo, A. Simple proof of the concavity of the entropy power with respect to added Gaussian noise. IEEE Trans. Inf. Theory 1989, 35, 887–888. [Google Scholar] [CrossRef]
Villani, C. A short proof of the ‘concavity of entropy power’. IEEE Trans. Inf. Theory 2000, 46, 1695–1696. [Google Scholar] [CrossRef]
Cheng, F.; Geng, Y. Higher order derivatives in Costa’s entropy power inequality. IEEE Trans. Inf. Theory 2015, 61, 5892–5905. [Google Scholar] [CrossRef]
Guo, L.; Yuan, C.M.; Gao, X.S. Lower bounds on multivariate higher order derivatives of differential entropy. Entropy 2022, 24, 1155. [Google Scholar] [CrossRef] [PubMed]
Zhang, X.; Anantharam, V.; Geng, Y. Gaussian optimality for derivatives of differential entropy using linear matrix inequalities. Entropy 2018, 20, 182. [Google Scholar] [CrossRef] [PubMed]
Guo, L.; Yuan, C.M.; Gao, X.S. A generalization of the concavity of Renyi entropy power. Entropy 2021, 23, 1593. [Google Scholar] [CrossRef] [PubMed]
McKean, H.P., Jr. Speed of approach to equilibrium for Kacs caricature of a Maxwellian gas. Arch. Rational Mech. Anal. 1966, 21, 343–367. [Google Scholar] [CrossRef]
Ledoux, M.; Nair, C.; Wang, Y.N. Log-Convexity of Fisher Information along Heat Flow; University of Toulouse: Toulouse, France, 2021. [Google Scholar]
Powers, V. Hilbert’s 17th problem and the champagne problem. Am. Math. Mon. 1996, 103, 879–887. [Google Scholar] [CrossRef]
Vandenberghet, L.; Boyd, S. Semidefinite programming. SIAM Rev. 1996, 38, 49–95. [Google Scholar] [CrossRef]
Toscani, G. A concavity property for the reciprocal of Fisher information and its consequences on Costa’s EPI. Phys. A Stat. Mech. Its Appl. 2015, 432, 15. [Google Scholar] [CrossRef]
Yeung, R.W.; Li, C.T. Machine-Proving of Entropy Inequalities. IEEE BITS Inf. Theory Mag. 2021, 1, 12–22. [Google Scholar] [CrossRef]
Yeung, R.W.; Yan, Y.-O. Information Theoretic Inequality Prover (ITIP), MATLAB Program Software Package. 1996. Available online: http://home.ie.cuhk.edu.hk/~ITIP (accessed on 22 March 2023).
Pulikkoonattu, R.; Diggavi, S. Xitip, ITIP-Based C Program Software Package. 2006. Available online: http://xitip.epfl.ch (accessed on 22 March 2023).
Csirmaz, L. A Minimal Information Theoretic Inequality Prover (Minitip). 2016. Available online: https://github.com/lcsirmaz/minitip (accessed on 22 March 2023).
Li, C.T. Python Symbolic Information Theoretic Inequality Prover (psitip). 2020. Available online: https://github.com/cheuktingli/ (accessed on 22 March 2023).
Grant, M.; Boyd, S.; Ye, Y. CVX: Matlab Software for Disciplined Convex Programming, Version 2.0 Beta. Available online: http://cvxr.com/cvx (accessed on 22 March 2023).

Figure 1. Figures for

\sqrt{I (X_{t})}

and

log I (X_{t})

which are convex in t.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Square Root Convexity of Fisher Information along Heat Flow in Dimension Two

Abstract

1. Introduction

2. Preliminaries

2.1. Notations and Preliminary Results

2.2. Constraints

3. Proof of Theorem 1

3.1. An Illustrative Example

3.2. Compute Constraints

3.3. Reduce to Quadratic Form

3.4. Reduction to Semidefinite Positiveness of a Quadratic Form

3.5. Prove Theorem 1 by Solving an SDP Problem

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Matlab Codes for SDP

Appendix B

Appendix C. Solutions to Problem 2

Appendix D. Solutions to Problem 5

Appendix E. SOS Expression of Θ₁

References

Article Metrics

Citations

Article Access Statistics

Square Root Convexity of Fisher Information along Heat Flow in Dimension Two

Abstract

1. Introduction

2. Preliminaries

2.1. Notations and Preliminary Results

2.2. Constraints

3. Proof of Theorem 1

3.1. An Illustrative Example

3.2. Compute Constraints

3.3. Reduce to Quadratic Form

3.4. Reduction to Semidefinite Positiveness of a Quadratic Form

3.5. Prove Theorem 1 by Solving an SDP Problem

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Matlab Codes for SDP

Appendix B

Appendix C. Solutions to Problem 2

Appendix D. Solutions to Problem 5

Appendix E. SOS Expression of Θ1

References

Article Metrics

Citations

Article Access Statistics

Appendix E. SOS Expression of Θ₁