Identity Extension for Function on Three Intervals and Application to Csiszar Divergence, Levinson and Ky Fan Inequalities

Pečarić, Josip; Miao, Jinyan; Pečarić, Ðilda

doi:10.3390/appliedmath5040136

Open AccessArticle

Identity Extension for Function on Three Intervals and Application to Csiszar Divergence, Levinson and Ky Fan Inequalities

by

Josip Pečarić

¹,

Jinyan Miao

² and

Ðilda Pečarić

^3,*

¹

Department of Mathematical, Physical and Chemical Sciences, Croatian Academy of Sciences and Arts, University of Zagreb, 10000 Zagreb, Croatia

²

Institute for Sustainable Industries & Liveable Cities (ISILC), Victoria University, Melbourne 8001, Australia

³

Department of Media and Communication, University North, 48000 Koprivnica, Croatia

^*

Author to whom correspondence should be addressed.

AppliedMath 2025, 5(4), 136; https://doi.org/10.3390/appliedmath5040136

Submission received: 6 August 2025 / Revised: 29 August 2025 / Accepted: 1 September 2025 / Published: 5 October 2025

Download Versions Notes

Abstract

Using Taylor-type expansions, we obtain identity expressions for functions on three intervals and differences for two pairs of Csiszár

ϕ

-divergence. With some more assumptions in these identities, inequalities for functions on three intervals and Csiszár

ϕ

-divergence can be obtained as special cases. They can also deduce the known generalized trapezoid type inequality. Furthermore, we use the identity to obtain a new extension for Levinson inequality; thus, new refinements and reverses for Ky Fan-type inequalities are established, which can be used to compare or estimate the yields in investments. Special cases of Csiszár

ϕ

-divergence are given, and we obtain new inequalities concerning different pairs of Kullback–Leibler distance, Hellinger distance,

α

-order entropy and

χ^{2}

-distance.

Keywords:

Taylor’s formula; convex function; Csiszár ϕ-divergence; Levinson inequality; Ky Fan inequality

MSC:

26D15; 26A51

1. Introduction

How can some classic inequalities and results can be extended, refined or estimated, if we extend more intrinsic theorems under the perspective of more general functions? This paper shows an approach. Recall some classic inequalities and existing theorems first.

We denote by

A (a), G (a)

and

H (a)

the unweighted arithmetic, geometric, and harmonic means of positive real numbers

a_{1}, \dots, a_{n}

with

a = (a_{1}, \dots, a_{n}) \in R_{+}^{n}

. Arithmetic mean is the most commonly used average mean in daily life. Geometric mean is applied to data with exponential growth property, like investment growth [1], production growth, and population growth. The compound annual growth rate (CAGR)

r \cdot 100 %

in n years is calculated by the following:

\begin{matrix} r = {(\prod_{i = 1}^{n} (1 + r_{i}))}^{\frac{1}{n}} - 1 \end{matrix}

regarding the growth rate

r_{i} \cdot 100 % (r_{i} \geq - 1)

in each i-th year.

The equivalent resistance value of two parallel resistance, is two times the harmonic means of a resitance value of two. The overall velocity v for a distance

2 l

, is the harmonic mean of velocity

v_{1}

in the first half distance l and velocity

v_{2}

in the second half distance l, calculated as follows:

\begin{matrix} v = \frac{2 l}{t_{1} + t_{2}} = \frac{2}{\frac{t_{1}}{l} + \frac{t_{2}}{l}} = \frac{2}{\frac{1}{v_{1}} + \frac{1}{v_{2}}} . \end{matrix}

The fundamental inequalities are written as follows:

H (a) \leq G (a) \leq A (a)

These formulas have Ky Fan-type refinements. The second inequality on the right side of (1) is due to Ky Fan in [2] (p. 5), which is known as Ky Fan inequality in the literature, and the first inequality on the left side, known as Wang–Wang inequality, is proposed in [3]. Inequality is written as follows:

\frac{H (a)}{H (1 - a)} \leq \frac{G (a)}{G (1 - a)} \leq \frac{A (a)}{A (1 - a)}

(1)

holds for

0 < a_{i} \leq \frac{1}{2}

.

Part of (2) is proved in [4], and the whole chain (2) is proved in [5]. Inequality is written as follows:

\frac{H (a)}{H (1 + a)} \leq \frac{G (a)}{G (1 + a)} \leq \frac{A (a)}{A (1 + a)}

(2)

holds for

a_{i} > 0

.

They are followed by [5,6,7,8,9,10,11,12,13,14,15].

In [16], the following Conclusion 1 is established for convex functions on three intervals, to generalize Levinson inequality and unify the right side of Ky Fan-type inequalities (1) and (2). As pointed out in the same paper, it also has some connection with the Lah–Ribarič inequality [17], and its extension can be used to prove the Hermite–Hadamard inequality [18].

Conclusion 1.

Let

a_{0} < a_{1} < a_{2} < a_{3}

and

x : [a_{0}, a_{3}] \to R

be a given function. Let

p : [a_{0}, a_{1}] \cup [a_{2}, a_{3}] \to R_{+}

and

q : [a_{1}, a_{2}] \to R_{+}

be two nonnegative functions. If

x (t)

is increasing in

t \in [a_{0}, a_{3}]

and

\begin{matrix} t h e f o l l o w i n g : & \int_{a_{1}}^{a_{2}} q (t) x (t) d t = \int_{a_{0}}^{a_{1}} p (t) x (t) d t + \int_{a_{2}}^{a_{3}} p (t) x (t) d t, \end{matrix}

(3)

\begin{matrix} \int_{a_{1}}^{a_{2}} q (t) d t = \int_{a_{0}}^{a_{1}} p (t) d t + \int_{a_{2}}^{a_{3}} p (t) d t, \end{matrix}

(4)

then the following:

\int_{a_{1}}^{a_{2}} q (t) ϕ (x (t)) d t \leq \int_{a_{0}}^{a_{1}} p (t) ϕ (x (t)) d t + \int_{a_{2}}^{a_{3}} p (t) ϕ (x (t)) d t

(5)

holds for every ϕ that is convex such that the integrals exist.

Then in [19], this Conclusion 1 is further extended and is also used to establish inequality for Csiszár

ϕ

-divergence. Note that this divergence can quantify the difference between two probability distributions, making it a useful tool for hypothesis testing and statistical inference; quantify the difference between the true distribution and the estimated distribution, making it a useful tool for data compression; and quantify the difference between the true distribution and the predicted distribution, making it a useful tool for classification problems and machine learning.

Recall the notion of Csiszár

ϕ

-divergence [20,21,22]. Given a convex function

ϕ : R_{+} \to R_{+}

, the

ϕ

-divergence functional, written as follows:

I_{ϕ} (p, q) : = \sum_{i = 1}^{n} q_{i} ϕ (\frac{p_{i}}{q_{i}})

is a generalized measure of information, a “distance function” on the set of probability distributions

P_{n}

. By appropriately defining this convex function

ϕ

, various divergences are derived; see Chapter 1 in [22] and Chapter 9.2 in [20] as well as related references. In [19], the following comparison between two different

I_{ϕ} (p_{1}, q_{1})

and

I_{ϕ} (p_{2}, q_{2})

is established.

Theorem 1.

Let

ϕ : [0, \infty) \to R

be a convex function. If

p_{1}, q_{1}, p_{2}, q_{2} \in P_{n}

and the following:

\begin{matrix} \frac{p_{1, i}}{q_{1, i}} \in [m, M], i = 1, \dots, n; \\ \frac{p_{2, i}}{q_{2, i}} \in [0, m] \cup [M, \infty), i = 1, \dots, n \end{matrix}

for some

m \leq 1 \leq M

, then we have the following:

I_{ϕ} (p_{1}, q_{1}) \leq I_{ϕ} (p_{2}, q_{2}) .

(6)

This theorem also holds for the probability distribution function (see [19]). This inequality looks similar to data processing inequality in a discrete case, but we do not have an assumption for some

P_{X Y} = P_{X} P_{Y | X}

, further supposing the ratio of

d P

.

In this paper, we will give identity expressions for these two inequalities (5) and (6), and by letting the function be convex, we obtain inequalities in previous papers as special cases. The identity can also establish the known trapezoid-type inequality (7) (which can be seen as an estimation for the Hermite–Hadamard-type inequality). As the first application, we use these identities to extend Levinson inequality and give new refinements and reverses of Ky Fan-type inequalities (1) and (2). As the second application, we use the identity to obtain new inequalities for special examples of distance functions. Throughout this paper, we use the following notation:

\begin{matrix} {(x - s)}_{+} = \{\begin{matrix} 0, & x < s, \\ x - s, & x \geq s . \end{matrix} \end{matrix}

2. Identity Extension

In this section, we will give identity expressions for Conclusion 1 and Theorem 1. Some special cases and corollary are also mentioned.

Theorem 2.

Let

a_{0} < a_{1} < a_{2} < a_{3}

and

x : [a_{0}, a_{3}] \to R

be a given function. Let

p : [a_{0}, a_{1}] \cup [a_{2}, a_{3}] \to R_{+}

and

q : [a_{1}, a_{2}] \to R_{+}

be two nonnegative functions. If

x (t)

is increasing in

t \in [a_{0}, a_{3}]

and (3), (4) are satisfied, then for

ϕ : I \to R

such that

ϕ^{'}

is absolutely continuous, we have the following:

\begin{matrix} \int_{a_{0}}^{a_{1}} p (t) ϕ (x (t)) d t - \int_{a_{1}}^{a_{2}} q (t) ϕ (x (t)) d t + \int_{a_{2}}^{a_{3}} p (t) ϕ (x (t)) d t = \int_{x (a_{0})}^{x (a_{3})} ϕ^{″} (s) k (s) d s, \end{matrix}

where the following is obtained:

\begin{matrix} k (s) = \int_{a_{0}}^{a_{1}} p (t) {(x (t) - s)}_{+} d t - \int_{a_{1}}^{a_{2}} q (t) {(x (t) - s)}_{+} d t + \int_{a_{2}}^{a_{3}} p (t) {(x (t) - s)}_{+} d t . \end{matrix}

Furthermore, we have

k (s) \geq 0

.

Proof.

In (14) of Theorem 14, set

f = ϕ, n = 2, α = a_{0}, β = a_{3}, a = x (a_{0}), b = x (a_{3}), g (x) = x (t)

and the following:

\begin{matrix} p (x) = \{\begin{matrix} p (t), & t \in [a_{0}, a_{1}], \\ - q (t), & t \in (a_{1}, a_{2}), \\ p (t), & t \in [a_{2}, a_{3}], \end{matrix} \end{matrix}

in Theorem 2, taking (3) and (4) into consideration, we obtain the identity.

As for the nonnegativity of

k (s)

, this can be divided into three situations.

1. If

x (a_{2}) \leq s \leq x (a_{3})

, then we obtain the following:

\begin{matrix} k (s) = \int_{a_{2}}^{a_{3}} p (t) {(x (t) - s)}_{+} d t \geq 0 . \end{matrix}

2. If

x (a_{1}) < s < x (a_{2})

, then we obtain the following:

\begin{matrix} k (s) = - \int_{a_{1}}^{a_{2}} q (t) {(x (t) - s)}_{+} d t + \int_{a_{2}}^{a_{3}} p (t) {(x (t) - s)}_{+} d t \\ = - \int_{c}^{a_{2}} q (t) (x (t) - s) d t + \int_{a_{2}}^{a_{3}} p (t) (x (t) - s) d t \end{matrix}

for a certain

c \in [a_{1}, a_{2}]

.

2.1. If

\int_{c}^{a_{2}} q (t) d t \leq \int_{a_{2}}^{a_{3}} p (t) d t

, then we obtain the following:

\begin{matrix} - \int_{c}^{a_{2}} q (t) (x (t) - s) d t + \int_{a_{2}}^{a_{3}} p (t) (x (t) - s) d t \\ \geq - \int_{c}^{a_{2}} q (t) (x (a_{2}) - s) d t + \int_{a_{2}}^{a_{3}} p (t) (x (a_{2}) - s) d t \geq 0 . \end{matrix}

2.2. If

\int_{c}^{a_{2}} q (t) d t > \int_{a_{2}}^{a_{3}} p (t) d t

, then according to (3) and (4) we have the following:

\begin{matrix} - \int_{c}^{a_{2}} q (t) (x (t) - s) d t + \int_{a_{2}}^{a_{3}} p (t) (x (t) - s) d t \\ = \int_{a_{1}}^{c} q (t) (x (t) - s) d t - \int_{a_{0}}^{a_{1}} p (t) (x (t) - s) d t \\ \geq \int_{a_{1}}^{c} q (t) (x (a_{1}) - s) d t - \int_{a_{0}}^{a_{1}} p (t) (x (a_{1}) - s) d t \geq 0 . \end{matrix}

3. If

x (a_{0}) \leq s \leq x (a_{1})

, then we obtain the following:

\begin{matrix} k (s) = \int_{a_{0}}^{a_{1}} p (t) {(x (t) - s)}_{+} d t - \int_{a_{1}}^{a_{2}} q (t) (x (t) - s) d t + \int_{a_{2}}^{a_{3}} p (t) (x (t) - s) d t \\ = \int_{c}^{a_{1}} p (t) (x (t) - s) d t - \int_{a_{1}}^{a_{2}} q (t) (x (t) - s) d t + \int_{a_{2}}^{a_{3}} p (t) (x (t) - s) d t \end{matrix}

for a certain

c \in [a_{0}, a_{1}]

, according to (3) and (4) we have the following:

\begin{matrix} \int_{c}^{a_{1}} p (t) (x (t) - s) d t - \int_{a_{1}}^{a_{2}} q (t) (x (t) - s) d t + \int_{a_{2}}^{a_{3}} p (t) (x (t) - s) d t \\ = - \int_{a_{0}}^{c} p (t) (x (t) - s) d t \geq 0 . \end{matrix}

Hence, in all situations we affirm

k (s) \geq 0

. □

From Theorem 2 we can prove the following theorem, which gives the upper and lower bounds.

Theorem 3.

Let

a_{0} < a_{1} < a_{2} < a_{3}

and

x : [a_{0}, a_{3}] \to R

be a given function. Let

p : [a_{0}, a_{1}] \cup [a_{2}, a_{3}] \to R_{+}

and

q : [a_{1}, a_{2}] \to R_{+}

be two nonnegative functions. If

x (t)

is increasing in

t \in [a_{0}, a_{3}]

and (3), (4) are satisfied, then for

ϕ : I \to R

such that

m \leq ϕ^{″} \leq M

, we have the following:

\begin{matrix} \frac{m}{2} (\int_{a_{0}}^{a_{1}} p (t) x^{2} (t) d t - \int_{a_{1}}^{a_{2}} q (t) x^{2} (t) d t + \int_{a_{2}}^{a_{3}} p (t) x^{2} (t) d t) \\ \leq \int_{a_{0}}^{a_{1}} p (t) ϕ (x (t)) d t - \int_{a_{1}}^{a_{2}} q (t) ϕ (x (t)) d t + \int_{a_{2}}^{a_{3}} p (t) ϕ (x (t)) d t \\ \leq \frac{M}{2} (\int_{a_{0}}^{a_{1}} p (t) x^{2} (t) d t - \int_{a_{1}}^{a_{2}} q (t) x^{2} (t) d t + \int_{a_{2}}^{a_{3}} p (t) x^{2} (t) d t) . \end{matrix}

Proof.

Utilize Theorem 2 to obtain the following expression:

\begin{matrix} \int_{a_{0}}^{a_{1}} p (t) ϕ (x (t)) d t - \int_{a_{1}}^{a_{2}} q (t) ϕ (x (t)) d t + \int_{a_{2}}^{a_{3}} p (t) ϕ (x (t)) d t = \int_{x (a_{0})}^{x (a_{3})} ϕ^{″} (s) k (s) d s, \end{matrix}

then from

k (s) \geq 0

in Theorem 2 and

m \leq ϕ^{″} \leq M

we have

\begin{matrix} m \int_{x (a_{0})}^{x (a_{3})} k (s) d s \leq \int_{x (a_{0})}^{x (a_{3})} ϕ^{″} (s) k (s) d s \leq M \int_{x (a_{0})}^{x (a_{3})} k (s) d s . \end{matrix}

Use Fubini’s theorem to obtain the following:

\begin{matrix} \int_{x (a_{0})}^{x (a_{3})} k (s) d s = \int_{a_{0}}^{a_{1}} p (t) \frac{{(x (t) - x (a_{0}))}^{2}}{2} d t \\ - \int_{a_{1}}^{a_{2}} q (t) \frac{{(x (t) - x (a_{0}))}^{2}}{2} d t + \int_{a_{2}}^{a_{3}} p (t) \frac{{(x (t) - x (a_{0}))}^{2}}{2} d t \\ = \frac{1}{2} (\int_{a_{0}}^{a_{1}} p (t) x^{2} (t) d t - \int_{a_{1}}^{a_{2}} q (t) x^{2} (t) d t + \int_{a_{2}}^{a_{3}} p (t) x^{2} (t) d t), \end{matrix}

the last step is due to (3) and (4). □

Remark 1.

It is easy to observe the following:

\begin{matrix} \int_{a_{0}}^{a_{1}} p (t) x^{2} (t) d t - \int_{a_{1}}^{a_{2}} q (t) x^{2} (t) d t + \int_{a_{2}}^{a_{3}} p (t) x^{2} (t) d t \geq 0 \end{matrix}

as it transforms from

k (s) \geq 0

.

By putting

m = 0

in Theorem 3, we obtain Conclusion 1 for twice differentiable convex functions ϕ.

From Theorem 2 another upper bound can also be given as the following.

Theorem 4.

Let

a_{0} < a_{1} < a_{2} < a_{3}

and

x : [a_{0}, a_{3}] \to R

be a given function. Let

p (t) : [a_{0}, a_{1}] \cup [a_{2}, a_{3}] \to R_{+}

and

q (t) : [a_{1}, a_{2}] \to R_{+}

be two nonnegative functions. If

x (t)

is increasing in

t \in [a_{0}, a_{3}]

and the following:

\begin{matrix} \int_{a_{1}}^{a_{2}} q (t) x (t) d t = \int_{a_{0}}^{a_{1}} p (t) x (t) d t + \int_{a_{2}}^{a_{3}} p (t) x (t) d t, \\ \int_{a_{1}}^{a_{2}} q (t) d t = \int_{a_{0}}^{a_{1}} p (t) d t + \int_{a_{2}}^{a_{3}} p (t) d t, \end{matrix}

then for

ϕ : I \to R

such that

ϕ^{″} \in L^{p} (x (a_{0}), x (a_{3}))

, we have the following:

\begin{matrix} |\int_{a_{0}}^{a_{1}} p (t) ϕ (x (t)) d t - \int_{a_{1}}^{a_{2}} q (t) ϕ (x (t)) d t + \int_{a_{2}}^{a_{3}} p (t) ϕ (x (t)) d t| \\ \leq ∥ ϕ^{″} ∥_{p} {(\int_{x (a_{0})}^{x (a_{3})} k^{q} (s) d s)}^{1 / q}, \end{matrix}

for

p > 1, 1 / p + 1 / q = 1

, where we obtain the following:

\begin{matrix} k (s) = \int_{a_{0}}^{a_{1}} p (t) {(x (t) - s)}_{+} d t - \int_{a_{1}}^{a_{2}} q (t) {(x (t) - s)}_{+} d t + \int_{a_{2}}^{a_{3}} p (t) {(x (t) - s)}_{+} d t . \end{matrix}

Proof.

Take absolute value and use Hölder inequality for Theorem 2. □

From Theorem 4 we can deduce the following extension for trapezoid inequality, see chapter 2 in [18].

Corollary 1.

Let twice differentiable

ϕ : [a, b] \to R

such that

ϕ^{″} \in L^{p} (a, b)

, we have the following:

\begin{matrix} |\frac{ϕ (a) + ϕ (b)}{2} - \frac{1}{b - a} \int_{a}^{b} ϕ (t) d t| \\ \leq \frac{1}{2} {(b - a)}^{\frac{q + 1}{q}} {[B (q + 1, q + 1)]}^{\frac{1}{q}} {∥ ϕ^{″} ∥}_{p} \end{matrix}

(7)

for

p > 1, 1 / p + 1 / q = 1

, where B is Euler’s Beta-function.

Proof.

In Theorem 4, set

a_{0} = a - ε, a_{1} = a, a_{2} = b, a_{3} = b + ε

,

x (t) = t, p (t) \equiv \frac{1}{2 ε}, q (t) \equiv \frac{1}{b - a}

, it is clear all the conditions are satisfied.

Then, taking

ε \to 0

, we have the following:

\begin{matrix} k (s) = \frac{(b - s) (s - a)}{2 (b - a)} . \end{matrix}

With some calculation, we obtain the following:

\begin{matrix} {(\int_{a}^{b} {(\frac{(b - s) (s - a)}{2 (b - a)})}^{q} d s)}^{1 / q} = \frac{1}{2} {(b - a)}^{\frac{q + 1}{q}} {[B (q + 1, q + 1)]}^{\frac{1}{q}} \end{matrix}

from which we obtain the desired result. □

Taking

p = \infty

as the original trapezoid inequality, written as follows:

\begin{matrix} |\int_{a}^{b} ϕ (t) d t - \frac{b - a}{2} [ϕ (a) + ϕ (b)]| \leq \frac{{(b - a)}^{3}}{12} {∥ ϕ^{″} ∥}_{\infty} . \end{matrix}

It is very useful in numerical integration, as we cut the interval into more smaller intervals (suppose evenly n intervals), the remainder terms (error) will be much smaller, because of the following:

n {(\frac{b - a}{n})}^{3} \leq {(b - a)}^{3} .

Remark 2.

Another corollary is

a_{0} = a - \frac{ε}{2}, a_{1} = \frac{a + b - ε}{2}, a_{2} = \frac{a + b + ε}{2}, a_{3} = b + \frac{ε}{2}

,

x (t) = t, p (t) \equiv \frac{1}{b - a}, q (t) \equiv \frac{1}{ε}

, and take

ε \to 0

, we can obtain estimation for

\begin{matrix} |ϕ (\frac{a + b}{2}) - \frac{1}{b - a} \int_{a}^{b} ϕ (t) d t|, \end{matrix}

which is more difficult to calculate.

Then we give the identity expression for Theorem 1, without assuming

ϕ

to be convex first.

Theorem 5.

If

p_{1}, q_{1}, p_{2}, q_{2} \in P_{n}

, then for

ϕ : [0, \infty) \to R

such that

ϕ^{'}

is absolutely continuous, we have the following:

\begin{matrix} I_{ϕ} (p_{2}, q_{2}) - I_{ϕ} (p_{1}, q_{1}) = \\ \int_{0}^{\infty} ϕ^{″} (s) (\sum_{i = 1}^{n} {(p_{2, i} - s q_{2, i})}_{+} - \sum_{i = 1}^{n} {(p_{1, i} - s q_{1, i})}_{+}) d s . \end{matrix}

Proof.

In (13) of Theorem 14, set

f = ϕ, n = 2, N = n, a = 0, b = \infty, p_{i} = q_{1, i}, x_{i} = \frac{p_{1, i}}{q_{1, i}}

in Theorem 5, we obtain the expression for

I_{ϕ} (p_{1}, q_{1})

.

Similarly, in (13) of Theorem 14, set

f = ϕ, n = 2, N = n, a = 0, b = \infty, p_{i} = q_{2, i}, x_{i} = \frac{p_{2, i}}{q_{2, i}}

in Theorem 5, we obtain the expression for

I_{ϕ} (p_{2}, q_{2})

.

Take into consideration

p_{1}, q_{1}, p_{2}, q_{2} \in P_{n}

, we complete the proof. □

Then we state two lemmas below, in which, if

p_{1}, q_{1}, p_{2}, q_{2}

satisfy some further conditions, we have the following:

\begin{matrix} \sum_{i = 1}^{n} {(p_{2, i} - s q_{2, i})}_{+} - \sum_{i = 1}^{n} {(p_{1, i} - s q_{1, i})}_{+} \geq 0 . \end{matrix}

The proofs are in the Some Lemmas section.

Lemma 1.

In Theorem 5, if we further suppose the following:

\begin{matrix} \frac{p_{1, i}}{q_{1, i}} \in [m, M], i = 1, \dots, n; \\ \frac{p_{2, i}}{q_{2, i}} \in [0, m] \cup [M, \infty), i = 1, \dots, n \end{matrix}

for some

m \leq 1 \leq M

, then we obtain the following:

\begin{matrix} \sum_{i = 1}^{n} {(p_{2, i} - s q_{2, i})}_{+} - \sum_{i = 1}^{n} {(p_{1, i} - s q_{1, i})}_{+} \geq 0 \end{matrix}

for all

s \in [0, \infty)

.

Lemma 2.

In Theorem 5, if we further suppose

q_{1} = q_{2}

, which means

q_{1, i} = q_{2, i}, i = 1, \dots, n

, and (here we use q to refer either

q_{1}

or

q_{2}

), then we obtain the following:

\frac{p_{2}}{q} ≻_{q} \frac{p_{1}}{q},

(the sequence

\frac{p_{2}}{q}

majorizes the sequence

\frac{p_{1}}{q}

with the weight q), then we obtain the following:

\begin{matrix} \sum_{i = 1}^{n} {(p_{2, i} - s q_{2, i})}_{+} - \sum_{i = 1}^{n} {(p_{1, i} - s q_{1, i})}_{+} \geq 0 \end{matrix}

for all

s \in [0, \infty)

.

Combine Theorem 5, Lemma 1 and Lemma 2, we have the following estimation about

I_{ϕ} (p_{2}, q_{2}) - I_{ϕ} (p_{1}, q_{1})

.

Theorem 6.

Under the assumptions of Theorem 5, if condition in Lemma 1 or Lemma 2 is further satisfied, then we obtain the following:

\begin{matrix} r \int_{0}^{\infty} (\sum_{i = 1}^{n} {(p_{2, i} - s q_{2, i})}_{+} - \sum_{i = 1}^{n} {(p_{1, i} - s q_{1, i})}_{+}) d s \\ \leq I_{ϕ} (p_{2}, q_{2}) - I_{ϕ} (p_{1}, q_{1}) \\ \leq R \int_{0}^{\infty} (\sum_{i = 1}^{n} {(p_{2, i} - s q_{2, i})}_{+} - \sum_{i = 1}^{n} {(p_{1, i} - s q_{1, i})}_{+}) d s \end{matrix}

for

ϕ : [0, \infty) \to R

such that

r \leq ϕ^{″} \leq R

.

Proof.

Under the assumptions of Lemma 1 or Lemma 2, we have the following:

\begin{matrix} \sum_{i = 1}^{n} {(p_{2, i} - s q_{2, i})}_{+} - \sum_{i = 1}^{n} {(p_{1, i} - s q_{1, i})}_{+} \geq 0, \end{matrix}

from Theorem 5 we prove the conclusion. □

Remark 3.

Set

r = 0

, we obtain Theorem 1 for twice differentiable convex functions ϕ.

Remark 4.

If

p_{1}, q_{1}, p_{2}, q_{2} > 0

, then we have a more direct expression, written as follows:

\begin{matrix} \int_{0}^{\infty} (\sum_{i = 1}^{n} {(p_{2, i} - s q_{2, i})}_{+} - \sum_{i = 1}^{n} {(p_{1, i} - s q_{1, i})}_{+}) d s = \sum_{i = 1}^{n} (\frac{p_{2, i}^{2}}{2 q_{2, i}} - \frac{p_{1, i}^{2}}{2 q_{1, i}}) \end{matrix}

for

ϕ : [min \frac{p_{2, i}}{q_{2, i}}, max \frac{p_{2, i}}{q_{2, i}}] \to R

such that

r \leq ϕ^{″} \leq R

.

3. Application

In this section, we first use Theorem 3 to establish Levinson and Ky Fan-type inequalities, and then use Theorem 6 and Remark 4 to give several special examples of distance functions in mathematical statistics and information theory.

In [16], the following general inequality for 3-convex function is established, which can be used to prove Jensen inequality, Levinson inequality, refine power means inequality and unify two Ky Fan-type inequalities.

Conclusion 2.

Suppose that

f : [α, β] \to R

is a continuous 3-convex function, for

i = 1, \dots, n : a_{i}, b_{i} \in [α, β]; b_{i} > a_{i}

, the inequality is written as follows:

\begin{matrix} f (\frac{\sum_{i = 1}^{n} (b_{i}^{2} - a_{i}^{2})}{2 \sum_{i = 1}^{n} (b_{i} - a_{i})} + \frac{\sum_{i = 1}^{n} (b_{i} - a_{i})}{2 n}) \\ - & f (\frac{\sum_{i = 1}^{n} (b_{i}^{2} - a_{i}^{2})}{2 \sum_{i = 1}^{n} (b_{i} - a_{i})} - \frac{\sum_{i = 1}^{n} (b_{i} - a_{i})}{2 n}) \\ \leq & \frac{1}{n} \sum_{i = 1}^{n} (f (b_{i}) - f (a_{i})) \end{matrix}

hold.

In this section, based on Theorem 3, we will further extend this conclusion and use the extension to refine Ky Fan-type inequalities. These results are generalizations of Levinson, Jensen, and Ky Fan inequalities. The condition

m \leq f^{' ″} \leq M

is more general than

f^{' ″} \geq 0

as

m, M

may be

- \infty, \infty

.

Theorem 7.

Let

f : [α, β] \to R

be a continuous function such that

m \leq f^{' ″} \leq M

. For

a_{i}, b_{i} \in [α, β]; b_{i} > a_{i}, i = 1, \dots, n

, we have the following:

\begin{matrix} \frac{m}{6 n} (\sum_{i = 1}^{n} (b_{i}^{3} - a_{i}^{3}) - \frac{3 {[\sum_{i = 1}^{n} (b_{i}^{2} - a_{i}^{2})]}^{2}}{4 \sum_{i = 1}^{n} (b_{i} - a_{i})} - \frac{{[\sum_{i = 1}^{n} (b_{i} - a_{i})]}^{3}}{4 n^{2}}) \\ \leq \frac{1}{n} \sum_{i = 1}^{n} (f (b_{i}) - f (a_{i})) - f (\frac{\sum_{i = 1}^{n} (b_{i}^{2} - a_{i}^{2})}{2 \sum_{i = 1}^{n} (b_{i} - a_{i})} + \frac{\sum_{i = 1}^{n} (b_{i} - a_{i})}{2 n}) \end{matrix}

(8)

\begin{matrix} + f (\frac{\sum_{i = 1}^{n} (b_{i}^{2} - a_{i}^{2})}{2 \sum_{i = 1}^{n} (b_{i} - a_{i})} - \frac{\sum_{i = 1}^{n} (b_{i} - a_{i})}{2 n}) \end{matrix}

(9)

\begin{matrix} \leq \frac{M}{6 n} (\sum_{i = 1}^{n} (b_{i}^{3} - a_{i}^{3}) - \frac{3 {[\sum_{i = 1}^{n} (b_{i}^{2} - a_{i}^{2})]}^{2}}{4 \sum_{i = 1}^{n} (b_{i} - a_{i})} - \frac{{[\sum_{i = 1}^{n} (b_{i} - a_{i})]}^{3}}{4 n^{2}}) . \end{matrix}

(10)

And we also have the follwoing:

\begin{matrix} \sum_{i = 1}^{n} (b_{i}^{3} - a_{i}^{3}) \geq \frac{3 {[\sum_{i = 1}^{n} (b_{i}^{2} - a_{i}^{2})]}^{2}}{4 \sum_{i = 1}^{n} (b_{i} - a_{i})} + \frac{{[\sum_{i = 1}^{n} (b_{i} - a_{i})]}^{3}}{4 n^{2}} . \end{matrix}

Proof.

The proof is divided into two parts.

1. Term (9) satisfies all the corresponding condition in Theorem 3.

It is easy to observe the following:

\begin{matrix} \frac{1}{n} \sum_{i = 1}^{n} (f (b_{i}) - f (a_{i})) - f (\frac{\sum_{i = 1}^{n} (b_{i}^{2} - a_{i}^{2})}{2 \sum_{i = 1}^{n} (b_{i} - a_{i})} + \frac{\sum_{i = 1}^{n} (b_{i} - a_{i})}{2 n}) \\ + f (\frac{\sum_{i = 1}^{n} (b_{i}^{2} - a_{i}^{2})}{2 \sum_{i = 1}^{n} (b_{i} - a_{i})} - \frac{\sum_{i = 1}^{n} (b_{i} - a_{i})}{2 n}) \\ = \int_{α}^{β} C_{1} (t) f^{'} (t) d t - \int_{\bar{a}}^{\bar{b}} C_{2} (t) f^{'} (t) d t \\ = \int_{α}^{\bar{a}} C_{1} (t) f^{'} (t) d t - \int_{\bar{a}}^{\bar{b}} (C_{2} (t) - C_{1} (t)) f^{'} (t) d t + \int_{\bar{b}}^{β} C_{1} (t) f^{'} (t) d t, \end{matrix}

where

\bar{a} = \frac{\sum_{i = 1}^{n} (b_{i}^{2} - a_{i}^{2})}{2 \sum_{i = 1}^{n} (b_{i} - a_{i})} - \frac{\sum_{i = 1}^{n} (b_{i} - a_{i})}{2 n}, \bar{b} = \frac{\sum_{i = 1}^{n} (b_{i}^{2} - a_{i}^{2})}{2 \sum_{i = 1}^{n} (b_{i} - a_{i})} + \frac{\sum_{i = 1}^{n} (b_{i} - a_{i})}{2 n},

and

C_{1} (t) = \frac{k}{n}

, if t belongs to and only belongs to k interval(s) of all

[a_{i}, b_{i}], (i = 1, \dots, n)

, written as follows:

\begin{matrix} C_{2} (t) = \{\begin{matrix} 1, & t \in [\bar{a}, \bar{b}], \\ 0, & t \notin [\bar{a}, \bar{b}] . \end{matrix} \end{matrix}

Then, in Theorem 3, we can set

p (t) = C_{1} (t), q (t) = (C_{2} (t) - C_{1} (t)), x (t) = t, ϕ = f^{'}, a_{0} = α, a_{1} = \bar{a}, a_{2} = \bar{b}, a_{3} = β

in Theorem 7, this is feasible, as we only need to check condition (3) and (4) naturally hold.

First, we obtain the following:

\int_{α}^{β} C_{1} (t) d t = \frac{1}{n} \sum_{i = 1}^{n} (b_{i} - a_{i}) = \bar{b} - \bar{a} = \int_{\bar{a}}^{\bar{b}} C_{2} (t) d t,

then the following:

\int_{α}^{\bar{a}} C_{1} (t) d t + \int_{\bar{b}}^{β} C_{1} (t) d t = \int_{\bar{a}}^{\bar{b}} (C_{2} (t) - C_{1} (t)) d t,

which satisfies condition (4).

Second, we obtain the following:

\int_{α}^{β} C_{1} (t) t d t = \frac{1}{n} \sum_{i = 1}^{n} \int_{a_{i}}^{b_{i}} t d t = \frac{1}{n} \sum_{i = 1}^{n} \frac{b_{i}^{2} - a_{i}^{2}}{2} = \frac{{\bar{b}}^{2} - {\bar{a}}^{2}}{2} = \int_{\bar{a}}^{\bar{b}} C_{2} (t) t d t,

then the following:

\int_{α}^{\bar{a}} C_{1} (t) t d t + \int_{\bar{b}}^{β} C_{1} (t) t d t = \int_{\bar{a}}^{\bar{b}} (C_{2} (t) - C_{1} (t)) t d t,

which satisfies condition (3).

Thus, we can use Theorem 3 to obtain the following:

\begin{matrix} \frac{m}{2} (\int_{α}^{\bar{a}} C_{1} (t) t^{2} d t - \int_{\bar{a}}^{\bar{b}} (C_{2} (t) - C_{1} (t)) t^{2} d t + \int_{\bar{b}}^{β} C_{1} (t) t^{2} d t) \\ \leq \int_{α}^{\bar{a}} C_{1} (t) f^{'} (t) d t - \int_{\bar{a}}^{\bar{b}} (C_{2} (t) - C_{1} (t)) f^{'} (t) d t + \int_{\bar{b}}^{β} C_{1} (t) f^{'} (t) d t \\ \leq \frac{M}{2} (\int_{α}^{\bar{a}} C_{1} (t) t^{2} d t - \int_{\bar{a}}^{\bar{b}} (C_{2} (t) - C_{1} (t)) t^{2} d t + \int_{\bar{b}}^{β} C_{1} (t) t^{2} d t), \end{matrix}

and

\begin{matrix} \int_{α}^{\bar{a}} C_{1} (t) t^{2} d t - \int_{\bar{a}}^{\bar{b}} (C_{2} (t) - C_{1} (t)) t^{2} d t + \int_{\bar{b}}^{β} C_{1} (t) t^{2} d t \geq 0 . \end{matrix}

2. Calculate terms (8) and (10), and nonnegativity.

\begin{matrix} 0 \leq \int_{α}^{\bar{a}} C_{1} (t) t^{2} d t - \int_{\bar{a}}^{\bar{b}} (C_{2} (t) - C_{1} (t)) t^{2} d t + \int_{\bar{b}}^{β} C_{1} (t) t^{2} d t \\ = \int_{α}^{β} C_{1} (t) t^{2} d t - \int_{\bar{a}}^{\bar{b}} C_{2} (t) t^{2} d t \\ = \frac{1}{n} \sum_{i = 1}^{n} \int_{a_{i}}^{b_{i}} t^{2} d t - \int_{\bar{a}}^{\bar{b}} t^{2} d t = \frac{1}{3 n} \sum_{i = 1}^{n} (b_{i}^{3} - a_{i}^{3}) - \frac{1}{3} ({\bar{b}}^{3} - {\bar{a}}^{3}) \\ = \frac{1}{3 n} \sum_{i = 1}^{n} (b_{i}^{3} - a_{i}^{3}) - \frac{1}{3} (\frac{3 {[\sum_{i = 1}^{n} (b_{i}^{2} - a_{i}^{2})]}^{2}}{4 n \sum_{i = 1}^{n} (b_{i} - a_{i})} + \frac{{[\sum_{i = 1}^{n} (b_{i} - a_{i})]}^{3}}{4 n^{3}}), \end{matrix}

which is equivalent to the desired conclusion. □

Theorem 8 is powerful enough to deduce several old and new inequalities below.

Remark 5.

Take

m = 0

in Theorem 7, we obtain Conclusion 2, if we further set

b_{i} = c - a_{i}, 0 \leq a_{i} \leq \frac{c}{2}

we obtain the Levinson inequality.

We can also deduce a known estimation for Jensen inequality below.

Corollary 2.

Let

g : [α, β] \to R

be a continuous function such that

m \leq g^{″} \leq M

. For

a_{i} \in [α, β]; i = 1, . \dots n

, we have the following:

\begin{matrix} \frac{m}{2} (\frac{\sum_{i = 1}^{n} a_{i}^{2}}{n} - {(\frac{\sum_{i = 1}^{n} a_{i}}{n})}^{2}) \\ \leq \frac{1}{n} \sum_{i = 1}^{n} g (a_{i}) - g (\frac{\sum_{i = 1}^{n} a_{i}}{n}) \\ \leq \frac{M}{2} (\frac{\sum_{i = 1}^{n} a_{i}^{2}}{n} - {(\frac{\sum_{i = 1}^{n} a_{i}}{n})}^{2}) . \end{matrix}

Proof.

In Theorem 7, set

b_{i} = a_{i} + Δ (Δ > 0)

, and multiple the inequality with

\frac{1}{Δ}

. Let

Δ \to 0

, we obtain the following:

\begin{matrix} \frac{m}{2} (\frac{\sum_{i = 1}^{n} a_{i}^{2}}{n} - {(\frac{\sum_{i = 1}^{n} a_{i}}{n})}^{2}) \\ \leq \frac{1}{n} \sum_{i = 1}^{n} f^{'} (a_{i}) - f^{'} (\frac{\sum_{i = 1}^{n} a_{i}}{n}) \\ \leq \frac{M}{2} (\frac{\sum_{i = 1}^{n} a_{i}^{2}}{n} - {(\frac{\sum_{i = 1}^{n} a_{i}}{n})}^{2}), \end{matrix}

let

g = f^{'}

we obtain the corollary. □

Then we use Theorem 7 to establish refinements and reverses for Ky Fan-type inequalities (1), (2) mentioned in the introduction section, these special cases are new.

Theorem 8.

Denoting by

A (a), G (a)

the arithmetic and geometric means of positive real numbers

a_{1}, \dots, a_{n}

with

a = (a_{1}, \dots, a_{n}) \in R_{+}^{n}

, we have the following:

\frac{G (a)}{G (1 - a)} \leq \frac{G (a)}{G (1 - a)} \cdot e^{K / {(1 - a_{min})}^{3}} \leq \frac{A (a)}{A (1 - a)} \leq \frac{G (a)}{G (1 - a)} \cdot e^{K / a_{min}^{3}}

for

0 < a_{i} \leq \frac{1}{2}

, where

a_{min} = min {a_{1}, \dots, a_{n}}

, and the following:

\begin{matrix} K = \frac{1}{n} \sum_{i = 1}^{n} a_{i}^{2} - \frac{2}{3 n} \sum_{i = 1}^{n} a_{i}^{3} - {(\frac{1}{n} \sum_{i = 1}^{n} a_{i})}^{2} + \frac{2}{3} {(\frac{1}{n} \sum_{i = 1}^{n} a_{i})}^{3} \geq 0 . \end{matrix}

Proof.

Set

f (x) = ln x

defined on

[a_{min}, 1 - a_{min}]

in Theorem 7, for

0 < a_{i} \leq \frac{1}{2}

and

b_{i} = 1 - a_{i}

. As

f^{' ″} = \frac{2}{x^{3}}

we know the following:

\frac{2}{{(1 - a_{min})}^{3}} \leq f^{' ″} \leq \frac{2}{a_{min}^{3}} .

Thus we can use Theorem 7. With some calculation and simplification, then taking exp we obtain the desired results. □

Remark 6.

The Ky Fan inequality is equivalent to the following:

\frac{G (a)}{G (2 - a)} \leq \frac{A (a)}{A (2 - a)}

(11)

for

0 < a_{i} \leq 1

, the readers may also establish similar refinement and reverse like the proof above, for the following discussion.

Consider two investment products

X, Y

that have inverse yield

b_{i} = (1 - a_{i}) \cdot 100 %, - b_{i} = (a_{i} - 1) \cdot 100 %

in each period i, for

i = 1, \dots, n

(e. g. ETF and its inverse ETF).

If

b_{i}

is nonnegative in each period of time, it is clear that the overall investment yield of X is higher than Y after n periods, which is written as follows:

(\prod_{i = 1}^{n} (1 + b_{i}) - 1) \cdot 100 % = y_{X} \geq y_{Y} = (\prod_{i = 1}^{n} (1 - b_{i}) - 1) \cdot 100 % .

With (11) we obtain a more accurate estimation, written as follows:

y_{X} \geq [(y_{Y} + 1) {(\frac{1 + A (b)}{1 - A (b)})}^{n} - 1] \cdot 100 % \geq y_{Y},

where

A (b)

is the arithmetic average of X’s yields

b_{i}

in n periods.

Apply the refinement and reverse of Ky Fan inequality, we can obtain even more detailed estimations, left to readers.

Theorem 9.

Denote by

A (a), G (a)

the arithmetic and geometric means of positive real numbers

a_{1}, \dots, a_{n}

with

a = (a_{1}, \dots, a_{n}) \in R_{+}^{n}

, we have the following:

\frac{G (a)}{G (1 + a)} \leq \frac{G (a)}{G (1 + a)} \cdot e^{K / {(1 + a_{max})}^{3}} \leq \frac{A (a)}{A (1 + a)} \leq \frac{G (a)}{G (1 + a)} \cdot e^{K / a_{min}^{3}}

for

a_{i} > 0

, where the following is formulated:

\begin{matrix} a_{min} = min {a_{1}, \dots, a_{n}}, a_{max} = max {a_{1}, \dots, a_{n}}, \\ K = \frac{1}{n} \sum_{i = 1}^{n} a_{i}^{2} - {(\frac{1}{n} \sum_{i = 1}^{n} a_{i})}^{2} \geq 0 . \end{matrix}

Proof.

Set

f (x) = ln x

defined on

[a_{min}, 1 + a_{max}]

in Theorem 7, for

a_{i} > 0

and

b_{i} = 1 + a_{i}

. As

f^{' ″} = \frac{2}{x^{3}}

we know the following:

\frac{2}{{(1 + a_{max})}^{3}} \leq f^{' ″} \leq \frac{2}{a_{min}^{3}} .

Thus we can use Theorem 7. With some calculation and simplification, taking exp we obtain the desired results. □

Remark 7.

If

λ > 0

, then we obtain the following:

\frac{G (a)}{G (λ + a)} \leq \frac{A (a)}{A (λ + a)}

(12)

for

a_{i} > 0

. Readers may also establish similar refinement and reverse like the proof above, replacing 1 with λ, for the following discussion.

Funds with better annual yield performance often have higher management fee rates, so it is important to consider the actual investment yields after paying fees. Here we just consider a simplified model.

Suppose the “better” fund X has the annual yield

(a_{i} + λ_{i} - 1) \cdot 100 %

,

λ_{i} \geq 0

for each year i, with a fixed annual management fee rate

θ \cdot 100 %

, and the “worse” fund Y has the annual yield

(a_{i} - 1) \cdot 100 %

for each year i, without management fee. Define

λ = min {λ_{i}; i = 1, \dots, n}

.

From (12) we have the following:

\begin{matrix} {(\frac{A (a)}{A (λ + a)})}^{n} \prod_{i = 1}^{n} (a_{i} + λ_{i}) \geq {(\frac{A (a)}{A (λ + a)})}^{n} \prod_{i = 1}^{n} (a_{i} + λ) \geq \prod_{i = 1}^{n} a_{i}, \end{matrix}

thus, we obtain the following:

\begin{matrix} \prod_{i = 1}^{n} [(a_{i} + λ_{i}) (1 - \frac{λ}{λ + A (a)})] \geq \prod_{i = 1}^{n} a_{i} . \end{matrix}

It means, if the annual fee rate

θ \leq \frac{λ}{λ + A (a)}

, then the actual yield of the fund X must be higher than Y after n years.

Sometimes we even don’t need to know the actual value of each

a_{i}

. For example, if we know the worse Y has a negative yield each year, then we affirm

A (a) \leq 1

; if we further know the annual fee rate of X satisfy

θ \leq \frac{λ}{λ + 1}

, then we predict the actual yield of the fund X must be higher than Y.

Apply the refinement and reverse of Ky Fan-type inequality, we have more detailed prediction about, when the X has actual higher or lower yield than Y, concerning the annual fee rate θ. Left to readers.

For similar converses of Ky Fan inequality with the following form:

\frac{A (a)}{A (1 - a)} \leq \frac{G (a)}{G (1 - a)} \cdot e^{C}

for some C concerning

a_{i}

, see [9,23].

Then we list some examples of distance functions, and use Theorem 6, Remark 4 to give inequalities for these distance functions.

Definition 1.

For the follwoing:

ϕ (t) = t ln t, t > 0

the ϕ-divergence is calculated as follows:

I_{ϕ} (p, q) : = \sum_{i = 1}^{n} p_{i} ln (\frac{p_{i}}{q_{i}}),

the Kullback–Leibler distance.

We have the following estimation for two different pairs of Kullback–Leibler distances.

Theorem 10.

For

p_{1}, q_{1}, p_{2}, q_{2} \in P_{n}

with

p_{1}, q_{1}, p_{2}, q_{2} > 0

, if condition in Lemma 1 or Lemma 2 is further satisfied, then the following is calculated:

\begin{matrix} min {\frac{q_{2, i}}{p_{2, i}}} \cdot \sum_{i = 1}^{n} (\frac{p_{2, i}^{2}}{2 q_{2, i}} - \frac{p_{1, i}^{2}}{2 q_{1, i}}) \\ \leq \sum_{i = 1}^{n} p_{2, i} ln (\frac{p_{2, i}}{q_{2, i}}) - \sum_{i = 1}^{n} p_{1, i} ln (\frac{p_{1, i}}{q_{1, i}}) \\ \leq max {\frac{q_{2, i}}{p_{2, i}}} \cdot \sum_{i = 1}^{n} (\frac{p_{2, i}^{2}}{2 q_{2, i}} - \frac{p_{1, i}^{2}}{2 q_{1, i}}) . \end{matrix}

Proof.

As

ϕ^{″} (t) = \frac{1}{t}

, we have the following:

\begin{matrix} min {\frac{q_{j, i}}{p_{j, i}}} = \frac{1}{max {\frac{p_{j, i}}{q_{j, i}}}} \leq ϕ^{″} \leq \frac{1}{min {\frac{p_{j, i}}{q_{j, i}}}} = max {\frac{q_{j, i}}{p_{j, i}}} . \end{matrix}

And under the assumption of Lemma 1 or Lemma 2, we have the following:

\begin{matrix} min {\frac{q_{j, i}}{p_{j, i}}} = min {\frac{q_{2, i}}{p_{2, i}}}, max {\frac{q_{j, i}}{p_{j, i}}} = max {\frac{q_{2, i}}{p_{2, i}}} . \end{matrix}

Then we can use Theorem 6 and Remark 4 to obtain the result. □

Definition 2.

Let the following be true:

ϕ (t) = \frac{1}{2} {(1 - \sqrt{t})}^{2}, t > 0

the ϕ-divergence is as follows:

I_{ϕ} (p, q) : = \frac{1}{2} \sum_{i = 1}^{n} {(\sqrt{q_{i}} - \sqrt{p_{i}})}^{2},

the Hellinger distance.

We have the following estimation for two different pairs of Hellinger distance.

Theorem 11.

For

p_{1}, q_{1}, p_{2}, q_{2} \in P_{n}

with

p_{1}, q_{1}, p_{2}, q_{2} > 0

, if condition in Lemma 1 or Lemma 2 is further satisfied, then the following is calculated:

\begin{matrix} min {\frac{1}{4} {(\frac{p_{2, i}}{q_{2, i}})}^{- 3 / 2}} \cdot \sum_{i = 1}^{n} (\frac{p_{2, i}^{2}}{2 q_{2, i}} - \frac{p_{1, i}^{2}}{2 q_{1, i}}) \\ \leq \frac{1}{2} \sum_{i = 1}^{n} {(\sqrt{q_{2, i}} - \sqrt{p_{2, i}})}^{2} - \frac{1}{2} \sum_{i = 1}^{n} {(\sqrt{q_{1, i}} - \sqrt{p_{1, i}})}^{2} \\ \leq max {\frac{1}{4} {(\frac{p_{2, i}}{q_{2, i}})}^{- 3 / 2}} \cdot \sum_{i = 1}^{n} (\frac{p_{2, i}^{2}}{2 q_{2, i}} - \frac{p_{1, i}^{2}}{2 q_{1, i}}) . \end{matrix}

Proof.

As

ϕ^{″} (t) = \frac{t^{- 3 / 2}}{4}

, with similar discussion in Theorem above we obtain the result. □

Definition 3.

For

α > 1

, let the following be true:

ϕ (t) = t^{α}, t > 0

the ϕ-divergence is written as follows:

I_{ϕ} (p, q) : = \sum_{i = 1}^{n} p_{i}^{α} q_{i}^{1 - α},

the

α

-order entropy. And Rényi divergence of order α is defined by the following:

D_{α} (p, q) : = \frac{1}{α - 1} ln (\sum_{i = 1}^{n} p_{i}^{α} q_{i}^{1 - α}) .

We have the following estimation for two different pairs of

α

-order entropy.

Theorem 12.

For

p_{1}, q_{1}, p_{2}, q_{2} \in P_{n}

with

p_{1}, q_{1}, p_{2}, q_{2} > 0

, if condition in Lemma 1 or Lemma 2 is further satisfied, then the following is calculated:

\begin{matrix} min {α (α - 1) {(\frac{p_{2, i}}{q_{2, i}})}^{α - 2}} \cdot \sum_{i = 1}^{n} (\frac{p_{2, i}^{2}}{2 q_{2, i}} - \frac{p_{1, i}^{2}}{2 q_{1, i}}) \\ \leq \sum_{i = 1}^{n} p_{2, i}^{α} q_{2, i}^{1 - α} - \sum_{i = 1}^{n} p_{1, i}^{α} q_{1, i}^{1 - α} \\ \leq max {α (α - 1) {(\frac{p_{2, i}}{q_{2, i}})}^{α - 2}} \cdot \sum_{i = 1}^{n} (\frac{p_{2, i}^{2}}{2 q_{2, i}} - \frac{p_{1, i}^{2}}{2 q_{1, i}}) . \end{matrix}

Proof.

As

ϕ^{″} (t) = α (α - 1) t^{α - 2}

, with similar discussion in Theorem above we obtain the result. □

Definition 4.

Let the following be true:

ϕ (t) = {(t - 1)}^{2}, t > 0

the ϕ-divergence is calculated as follows:

I_{ϕ} (p, q) : = \sum_{i = 1}^{n} \frac{{(p_{i} - q_{i})}^{2}}{q_{i}},

the

χ^{2}

-distance.

We have the following estimation for two different pairs of

χ^{2}

-distance. This is a special case of identity.

Theorem 13.

For

p_{1}, q_{1}, p_{2}, q_{2} \in P_{n}

with

p_{1}, q_{1}, p_{2}, q_{2} > 0

, if condition in Lemma 1 or Lemma 2 is further satisfied, then the following is calculated:

\begin{matrix} \sum_{i = 1}^{n} \frac{{(p_{2, i} - q_{2, i})}^{2}}{q_{2, i}} - \sum_{i = 1}^{n} \frac{{(p_{1, i} - q_{1, i})}^{2}}{q_{1, i}} = \sum_{i = 1}^{n} (\frac{p_{2, i}^{2}}{q_{2, i}} - \frac{p_{1, i}^{2}}{q_{1, i}}) . \end{matrix}

Proof.

As

ϕ^{″} (t) = 2

, the inequality in Theorem 6 becomes identity. □

Remark 8.

If we let

p_{1, i} = q_{1, i}

in the four theorems above, we obtain estimation for these four types of divergence

I_{ϕ} (p_{2}, q_{2})

.

4. Conclusions

In this paper, we give identities for convex functions on three intervals and the difference for Csiszár

ϕ

-divergence, which are Theorem 2 and Theorem 5. These two general identities can deduce some inequalities [19] by letting the outer function be convex. Then they are used to establish the extension for Levinson inequality [24,25,26,27], thus Ky Fan-type inequalities [2,5] can be refined and improved. Some special cases of distance function inequalities [20] for Csiszár

ϕ

-divergence can be proven.

5. Some Lemmas

The generalized Taylor-type expansions [28] are needed in this paper.

Theorem 14.

(i) Let

N, n \in N

and

f : I \to R

be a function such that

f^{(n - 1)}

is absolutely continuous on

I \subset R

,

a, b \in I

,

a < b

. Furthermore, let

x_{i} \in [a, b]

and

p_{i} \in R

for

i \in {1, 2, \dots, N}

. Then the following is calculated:

\begin{matrix} \sum_{i = 1}^{N} p_{i} f (x_{i}) & = \sum_{k = 0}^{n - 1} \frac{f^{(k)} (a)}{k!} \sum_{i = 1}^{N} p_{i} {(x_{i} - a)}^{k} \\ + \frac{1}{(n - 1)!} \int_{a}^{b} f^{(n)} (s) (\sum_{i = 1}^{N} p_{i} {(x_{i} - s)}_{+}^{n - 1}) d s . \end{matrix}

(13)

(ii) Let

p : [α, β] \to R

and

g : [α, β] \to [a, b]

be integrable functions. Let f satisfy assumptions from part (i). Then the following is calculated:

\begin{matrix} \int_{α}^{β} p (x) f (g (x)) d x & = \sum_{k = 0}^{n - 1} \frac{f^{(k)} (a)}{k!} \int_{α}^{β} p (x) {(g (x) - a)}^{k} d x \\ + \frac{1}{(n - 1)!} \int_{a}^{b} f^{(n)} (s) \int_{α}^{β} p (x) {(g (x) - s)}_{+}^{n - 1} d x d s . \end{matrix}

(14)

Proof for Lemma 1

Proof.

This can be divided into three situations. Notice that in the proof we will use the condition

p_{1}, q_{1}, p_{2}, q_{2} \in P_{n}

for several times, to obtain some identities.

1. For

s \in [0, m]

.

\begin{matrix} \sum_{i = 1}^{n} {(p_{2, i} - s q_{2, i})}_{+} - \sum_{i = 1}^{n} {(p_{1, i} - s q_{1, i})}_{+} \\ = \sum_{i = 1}^{n} (p_{2, i} - s q_{2, i}) - \sum_{i = 1}^{n} (p_{1, i} - s q_{1, i}) - \sum_{i : \frac{p_{2, i}}{q_{2, i}} < s} (p_{2, i} - s q_{2, i}) \\ = 0 - \sum_{i : \frac{p_{2, i}}{q_{2, i}} < s} (p_{2, i} - s q_{2, i}) \geq 0 . \end{matrix}

2. For

s \in [m, M]

.

\begin{matrix} \sum_{i = 1}^{n} {(p_{2, i} - s q_{2, i})}_{+} - \sum_{i = 1}^{n} {(p_{1, i} - s q_{1, i})}_{+} \\ = \sum_{i : \frac{p_{2, i}}{q_{2, i}} \geq s} q_{2, i} (\frac{p_{2, i}}{q_{2, i}} - s) - \sum_{i : \frac{p_{1, i}}{q_{1, i}} \geq s} q_{1, i} (\frac{p_{1, i}}{q_{1, i}} - s) . \end{matrix}

2.1. If the following is true:

\begin{matrix} \sum_{i : \frac{p_{2, i}}{q_{2, i}} \geq s} q_{2, i} \geq \sum_{i : \frac{p_{1, i}}{q_{1, i}} \geq s} q_{1, i}, \end{matrix}

then the following is calculated:

\begin{matrix} \sum_{i : \frac{p_{2, i}}{q_{2, i}} \geq s} q_{2, i} (\frac{p_{2, i}}{q_{2, i}} - s) - \sum_{i : \frac{p_{1, i}}{q_{1, i}} \geq s} q_{1, i} (\frac{p_{1, i}}{q_{1, i}} - s) \\ \geq \sum_{i : \frac{p_{2, i}}{q_{2, i}} \geq s} q_{2, i} (M - s) - \sum_{i : \frac{p_{1, i}}{q_{1, i}} \geq s} q_{1, i} (M - s) \geq 0 . \end{matrix}

2.2. If the following is true:

\begin{matrix} \sum_{i : \frac{p_{2, i}}{q_{2, i}} \geq s} q_{2, i} < \sum_{i : \frac{p_{1, i}}{q_{1, i}} \geq s} q_{1, i}, \end{matrix}

which indicates the following:

\begin{matrix} \sum_{i : \frac{p_{2, i}}{q_{2, i}} < s} q_{2, i} > \sum_{i : \frac{p_{1, i}}{q_{1, i}} < s} q_{1, i}, \end{matrix}

then the following is calculated:

\begin{matrix} \sum_{i : \frac{p_{2, i}}{q_{2, i}} \geq s} q_{2, i} (\frac{p_{2, i}}{q_{2, i}} - s) - \sum_{i : \frac{p_{1, i}}{q_{1, i}} \geq s} q_{1, i} (\frac{p_{1, i}}{q_{1, i}} - s) \\ = - \sum_{i : \frac{p_{2, i}}{q_{2, i}} < s} q_{2, i} (\frac{p_{2, i}}{q_{2, i}} - s) + \sum_{i : \frac{p_{1, i}}{q_{1, i}} < s} q_{1, i} (\frac{p_{1, i}}{q_{1, i}} - s) \\ \geq \sum_{i : \frac{p_{2, i}}{q_{2, i}} < s} q_{2, i} (s - m) - \sum_{i : \frac{p_{1, i}}{q_{1, i}} < s} q_{1, i} (s - m) \geq 0 . \end{matrix}

3. For

s \in [M, \infty)

.

\begin{matrix} \sum_{i = 1}^{n} {(p_{2, i} - s q_{2, i})}_{+} - \sum_{i = 1}^{n} {(p_{1, i} - s q_{1, i})}_{+} = \sum_{i = 1}^{n} {(p_{2, i} - s q_{2, i})}_{+} \geq 0 . \end{matrix}

□

Proof for Lemma 2:

Proof.

As

\frac{p_{2}}{q}

weighted majorizes

\frac{p_{1}}{q}

with the weight q, an equivalent condition is written as follows:

\begin{matrix} \sum_{i = 1}^{n} q_{i} \cdot \frac{p_{2, i}}{q_{i}} = \sum_{i = 1}^{n} q_{i} \cdot \frac{p_{1, i}}{q_{i}}, \\ \sum_{i = 1}^{n} q_{i} {(\frac{p_{2, i}}{q_{i}} - s)}_{+} \geq \sum_{i = 1}^{n} q_{i} {(\frac{p_{1, i}}{q_{i}} - s)}_{+} \end{matrix}

for all

s \in R

, see chapter 4 in [29]. □

Author Contributions

Initial idea, Ð.P., J.P., and J.M.; writing and mathematical proof, Ð.P., J.P., and J.M.; proof reading, Ð.P., J.P., and J.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Acknowledgments

The authors would like to appreciate reviewers for their contributions to the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Anson, M.J.; Fabozzi, F.J.; Jones, F.J. The Handbook of Traditional and Alternative Investment Vehicles: Investment Characteristics and Strategies; John Wiley & Sons: Hoboken, NJ, USA, 2010. [Google Scholar]
Beckenbach, E.F.; Bellman, R. Inequalities; Springer: Berlin/Heidelberg, Germany, 1961. [Google Scholar]
Wang, W.-L.; Wang, P.-F. A class of inequalities for symmetric functions. Acta Math. Sin. 1984, 27, 485–497. (In Chinese) [Google Scholar]
El-Neweihi, E.; Proschan, F. Unified treatment of some inequalities among ratios of means. Proc. Am. Math. Soc. 1981, 81, 388–390. [Google Scholar] [CrossRef]
Govedarica, V.; Jovanović, M. On the inequalities of Ky Fan, Wang-Wang and Alzer. J. Math. Anal. Appl. 2002, 270, 709–712. [Google Scholar] [CrossRef]
Alzer, H. Inequalities for arithmetic, geometric and harmonic means. Bull. Lond. Math. Soc. 1990, 22, 362–366. [Google Scholar] [CrossRef]
Alzer, H. Refinements of Ky Fan’s inequality. Proc. Am. Math. Soc. 1993, 117, 159–165. [Google Scholar] [CrossRef]
Mcgregor, M.T. On Some Inequalities of Ky Fan and Wang-Wang. J. Math. Anal. Appl. 1993, 180, 182–188. [Google Scholar] [CrossRef]
Wang, W.-L. Some inequalities involving means and their converses. J. Math. Anal. Appl. 1999, 238, 567–579. [Google Scholar] [CrossRef]
Dragomir, S.S.; Scarmozzino, F.P. On the Ky Fan inequality. RGMIA Res. Rep. Collect. 2001, 4, 1. [Google Scholar] [CrossRef]
Neuman, E.; Sandor, J. On the Ky Fan inequality and related inequalities II. Bull. Aust. Math. Soc. 2005, 72, 87–107. [Google Scholar] [CrossRef]
Habibzadeh, S.; Roobin, J.; Moslehian, M.S. Operator Ky Fan type inequalities. Linear Algebra Its Appl. 2018, 556, 220–237. [Google Scholar] [CrossRef]
Nowicka, M.; Witkowski, A. Comparison of orders generated by Ky Fan type inequalities for bivariate means. Rev. Real Acad. Cienc. Exactas Fis. Nat. Ser. A-Mat. 2024, 118, 10. [Google Scholar] [CrossRef]
Tinaztepe, R. An application of Ky Fan inequality: On Kullback-Leibler divergence between a probability distribution and its negation. J. Math. Inequalities 2023, 17, 1639–1646. [Google Scholar] [CrossRef]
Dinu, C.; Dǎnciulescu, D.; Ţugui, A. Some inequalities related to Ky Fan inequality on time scales. Carpathian J. Math. 2021, 31, 101–108. [Google Scholar] [CrossRef]
Miao, J.; Pečarić, J. Inequality for 3-convex and 4-convex functions. Department of Mathematical, Physical and Chemical Sciences, Croatian Academy of Sciences and Arts. University of Zagreb: 10000 Zagreb, Croatia, accepted.
Lah, P.; Ribarič, M. Converse of Jensen’s Inequality for Convex Functions. Univ. Beograd. Publ. Elektrotehn. Fak. Ser. Mat. Fiz. 1973, 412–460, 201–205. [Google Scholar]
Dragomir, S.S.; Pearce, C.E.M. Selected Topics on Hermite-Hadamard Inequalities and Applications. 2000. Available online: https://rgmia.org/papers/monographs/Master.pdf (accessed on 5 August 2025).
Pečarić, Ð.; Pečarić, J.; Miao, J. Extension of an Inequality on Three Intervals and Applications to Csiszár ϕ-Divergence and Landau-Kolmogorov Inequality. Axioms 2025, 14, 563. [Google Scholar] [CrossRef]
Pečarić, Ð.; Pečarić, J. Inequalities and Zipf-Mandelbrot Law; Element: Zagreb, Croatia, 2019. [Google Scholar]
Csiszár, I. Information measures: A critical survey. In Transactions of the Seventh Prague Conference on Information Theory, Statistical Decision Functions, Random Processes; Academia: Prague, Czech Republic, 1978; Volume B, pp. 73–86. [Google Scholar]
Dragomir, S.S. (Ed.) Inequalities for Csiszár f-Divergence in Information Theory; RGMIA Monographs, Victoria University: Melbourne, Australia, 2000. [Google Scholar]
Simić, S. On a converse of Ky Fan inequality. Kragujev. J. Math. 2010, 33, 95–99. [Google Scholar]
Levinson, N. Generalisation of an inequaity of Ky Fan. J. Math. Anal. Appl. 1964, 8, 133–134. [Google Scholar] [CrossRef]
Baloch, I.A.; Pečarić, J.; Praljak, M. Generalization of Levinson’s inequality. J. Math. Inequalities 2015, 9, 571–586. [Google Scholar] [CrossRef]
Witkowski, A. On Levinson’s Inequality. RGMIA Res. Rep. Collect. 2012, 15, 68. [Google Scholar]
Mercer, A.M. Short proofs of Jensen’s and Levinson’s inequalities. Math. Gaz. 2010, 94, 492–495. [Google Scholar] [CrossRef]
Khan, A.R.; Pečarić, J.; Praljak, M.; Varošanec, S. General Linear Inequalities and Positivity; Element: Zagreb, Croatia, 2017. [Google Scholar]
Marshall, A.W.; Olkin, I.; Arnord, B.C. Inequalities: Theory of Majorization and Its Applications, 2nd ed.; Academic Press: New York, NY, USA, 2011. [Google Scholar]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Pečarić, J.; Miao, J.; Pečarić, Ð. Identity Extension for Function on Three Intervals and Application to Csiszar Divergence, Levinson and Ky Fan Inequalities. AppliedMath 2025, 5, 136. https://doi.org/10.3390/appliedmath5040136

AMA Style

Pečarić J, Miao J, Pečarić Ð. Identity Extension for Function on Three Intervals and Application to Csiszar Divergence, Levinson and Ky Fan Inequalities. AppliedMath. 2025; 5(4):136. https://doi.org/10.3390/appliedmath5040136

Chicago/Turabian Style

Pečarić, Josip, Jinyan Miao, and Ðilda Pečarić. 2025. "Identity Extension for Function on Three Intervals and Application to Csiszar Divergence, Levinson and Ky Fan Inequalities" AppliedMath 5, no. 4: 136. https://doi.org/10.3390/appliedmath5040136

APA Style

Pečarić, J., Miao, J., & Pečarić, Ð. (2025). Identity Extension for Function on Three Intervals and Application to Csiszar Divergence, Levinson and Ky Fan Inequalities. AppliedMath, 5(4), 136. https://doi.org/10.3390/appliedmath5040136

Article Menu

Identity Extension for Function on Three Intervals and Application to Csiszar Divergence, Levinson and Ky Fan Inequalities

Abstract

1. Introduction

2. Identity Extension

3. Application

4. Conclusions

5. Some Lemmas

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI