1. Introduction
The study of matrix means plays a crucial role in various mathematical and applied fields, particularly in matrix analysis, optimization, and quantum information theory. Among these means, the geometric mean [
1] is one of the most extensively studied. It is well known [
2] that the geometric mean serves as the barycenter of two given positive definite matrices under the Riemannian metric. The weighted matrix geometric mean, which serves as the geodesic curve connecting two given matrices in quantum information in the form of quantum entropic [
3,
4] and fidelity [
5,
6,
7,
8] measures. In 1980, Kubo and Ando [
9] introduced an axiomatic framework for matrix means on the set of positive definite matrices. These matrix means play a central role in defining various quantum divergences in quantum information theory ([
10,
11,
12,
13,
14,
15]).
The spectral geometric mean of positive definite matrices was introduced by Fiedler and Pták in 1997 [
16]. Later, in 2015, Kim and Lee [
17] defined the weighted spectral mean and studied several of its properties. Although the spectral geometric mean has received less attention compared with the geometric mean, it remains an object of interest among researchers.
In [
18], Bhatia, Jain, and Lim investigated the Bures–Wasserstein distance between positive definite matrices and solved the least squares problem to determine the barycenter of two positive definite matrices, known as the weighted Wasserstein mean. Notably, the weighted spectral geometric mean and the weighted Wasserstein mean are, in some sense, generated by the functions
and
, respectively. This suggests that the relationship between these means may exhibit similarities to that between the weighted geometric mean and the weighted arithmetic mean.
Recently, there has been a growing body of research on these means. Franco and Dumitru [
19] introduced a near-order relation. Building on this, researchers such as J. Kim, H. Huang, and L. Gan further explored various inequalities associated with different order relations [
20,
21]. More recently, Furuichi discovered connections between the spectral geometric mean and Tsallis entropy in quantum information theory [
22].
In this paper, we introduce the concept of the 2-geometric mean and investigate its relationships with the spectral geometric mean and the Wasserstein mean. These formulations provide a foundation for revisiting and further exploring their properties. Additionally, we establish several inequalities satisfied by these means within the context of the near-order relation.
The paper is organized as follows: In
Section 2, we recall fundamental definitions and properties of matrix means. In
Section 3, we introduce new representations of the spectral geometric mean and the Wasserstein mean, along with concise proofs of their fundamental properties. In
Section 4, we reexamine the near-order relation and derive general inequalities for these means. Furthermore, we establish norm inequalities and identify conditions under which equality holds.
2. Preliminaries
Let
be the set of all
positive definite matrices,
be the space of all
Hermitian matrices, and
be the group of
unitary matrices. For
, the geometric mean
was firstly defined by Puzs and Woronowicz [
1] in 1975. They showed that it is the unique positive definite solution to the Riccati equation
It is well-known [
2] that the geometric mean
is the midpoint of the geodesic
joining
A and
B under the Riemannian metric
, where
denotes the Frobenius norm.
The spectral geometric mean of
was introduced by Fiedler and Pták in 1997 [
16], and one of its formulations is
It is called
the spectral geometric mean because
is similar to
, and the eigenvalues of their spectral mean are the positive square roots of the corresponding eigenvalues of
([Theorem 3.2] in [
16]).
In 2015, Kim and Lee [
17] defined the weighted spectral mean:
It is obvious that
is a curve joining
A and
B. They studied the relative operator entropy associated with the spectral geometric mean, as well as several properties similar to those of the Tsallis relative operator entropy defined via the matrix geometric mean.
Note that in (
2), the geometric mean
is a main component of the weighted spectral mean
, while the middle term is
A, independent of
t. We define a new weighted mean below.
It is obvious that
and
; hence,
is a curve joining
A and
B. For
,
is the spectral geometric mean (
1). We call
the weighted
F-mean and it is different from (
2).
Although the spectral geometric mean appears to be less studied and less prominent than the geometric mean, it remains of interest to several researchers. One of the motivations for studying the spectral geometric mean is that its trace corresponds to the quantum fidelity:
In 1980, Kubo and Ando developed a general theory of matrix means for positive definite matrices [
9]. Let
f be a positive real-valued function on
. For any positive integer
n and positive definite matrices
in
, we define a binary operation
on
as follows:
This binary operation is called an
operator connection if the following requirements are fulfilled:
- (1)
and imply ;
- (2)
;
- (3)
If and , then .
An operator connection is called an operator mean or a matrix mean if , and is called symmetric if for all in . Kubo and Ando showed that there exists an affine order-isomorphism from the class of connections to the class of positive operator monotone functions on , which is given by .
The weighted Wasserstein mean
[
18] is the geodesic curve joining
A and
B in
with respect to the Wasserstein distance:
If we denote
then from the Riccati equation,
And, the weighted Wasserstein mean adopts the following form
Recently, Franco and Dumitru introduced a near-order relation defined by
. Then, T.Y. Tam, J. Kim, H. Huang, F. Furuichi, Y. Seo, and L. Gan, among others, have continued to investigate these objects and explore various inequalities within different order relations [
20,
21] and applications in quantum information.
3. The 2-Geometric Mean
In this section, we provide a new form of the spectral geometric mean and establish new simple proofs for its basic properties. The formula of the weighted geometric mean
can be extended to any real number
t. For
, we notice that
Also,
Indeed,
, if and only if, according to the Riccati equation,
From (
1) and (
6), we have the following new form of the spectral geometric mean.
Definition 1. For positive definite matrices A and B in , the spectral geometric mean A is represented as Similarly, we have the following representations:
The new weighted spectral geometric mean
[
23] is represented as
- ()
The weighted spectral geometric mean
is defined as
- ()
The Wasserstein mean [
18] is represented as
where
From this point of view, one can see that the difference between the spectral geometric mean and the Wasserstein mean lies in how they depend on the function of . This distinction is particularly evident in the form of the 2-geometric mean. Therefore, employing this notation could lead to broader generalizations.
Recall also that Kraus’ theorem (see, for example, [Theorems 8.1] in [
24]) characterizes completely positive maps, which model quantum operations between quantum states. Informally, the theorem guarantees that the action of any such quantum operation
on a quantum state
can always be written as
for some set of matrices
satisfying the inequality
where
I denotes the identity matrix. Therefore, if
are positive definite matrices, then
Notice that in the definition of the spectral geometric mean, Fiedler and Pták introduced a set of properties that this mean must satisfy, but they did not explain why
is included in the definition. However, let us provide some reasons for its existence. The following result can be found in ([Theorem 3.19] in [
25]), but for the convenience of readers, we provide a short proof here.
Proposition 1 (Theorem 3.19 in [
25])
. For given positive definite matrices A and B, the geometric mean is the unique minimizer of the loss function Proof. Recall that the Fréchet derivative of a function
at
X in the direction of a perturbation
H is defined as follows:
Applying (
11) to
, for any Hermitian matrix
H, we have
Setting
for any Hermitian
H, we obtain
or,
From the Riccati equation, it follows that
, since the geometric mean is symmetric. □
In the case where
A and
B are two density matrices—which represent quantum states in quantum information theory—the function value
evaluate at the minimizer
corresponds precicely to the quantum fidelity ([Theorem 3.19] in [
25]):
We collect some properties of that we often use in the paper.
Lemma 1. For positive definite matrices , and C,
- (A1)
- (A2)
; hence,
- (A3)
for some given function f.
- (A4)
If , then for any positive definite matrix X,
Proof. Properties (A1) and (A2) follow from (
6). For (A3), we have
The identity (
12) can be rewritten as
Multiplying both sides by
, we obtain
□
Let us use the new forms to re-establish some basic properties of the spectral geometric means mentioned above.
Proposition 2. For positive definite matrices A and B in , the following are satisfied:
- (i)
- (ii)
- (iii)
When , we have
- (iv)
- (v)
, if and only if ;
- (vi)
- (vii)
Proof. (i) Using (A3) and the fact that
, we have
(iii) Firstly, using (A1), we have the following
On account of (i), from (
13), we have
As a consequence of (
13) and (
14), we have
(iv) Using the representation (
) and (A3), we have
(v) We have
if and only if
Since
it is easy to check that the identity
, if and only if
(vi) The proof of this identity is similar to the proof of (iii), considering that
where
and the second identity follows from (A1).
(vii) For any
, the inequality is equivalent to the following
which is equivalent to
where
B is replaced with
. The last inequality is obvious since
□
Recall that the arithmetic mean and the harmonic mean are dual, which means , and the functions satisfy the relation . Between the arithmetic mean, the geometric mean, and the harmonic mean, there are two main relations:
- (R1)
for any ;
- (R2)
Dinh, Le, and Vo [
26] proved that for an arbitrary Kubo–Ando matrix mean
, such that
, for positive definite matrices
X and
Y,
or, equivalently,
This identity (
16) has an interesting geometrical interpretation: the barycenter of
and
coincides with the barycenter of
X and
Y with respect to the Riemannian distance.
Recently, Franco [
27] obtained a similar identity for the spectral geometric mean and the Wasserstein mean, as
In the following theorem, we establish a more general result of (
17).
Proof. Let
and
. Then, on account of (A4), we have
Consequently,
where the last identity follows from the fact that
. □
4. Near Order Relation and Means
Recently, Franco and Dumitru introduced the near order relation ⪯ on
as follows:
This order is weaker than the traditional Loewner order, which states that for Hermitian matrices
A and
B,
if and only if
From Proposition 1,
if the inverse of the minimizer of
is a contraction.
In this section, we review some inequalities in this the near order relation. Fortunately, many results can be obtained easily with the 2-geometric mean form for the spectral geometric mean and the Wasserstein mean.
The following lemma is crucial when we consider inequalities with respect to the near order. During the preparation of this note, we were informed that there is a similar result in Franco’s paper [
27], which is in a different form.
Lemma 2. Let A be a positive definite matrix, and let for any positive Then, for any positive definite matrix X,
- (i)
- (ii)
where is the operator norm.
Proof. On account of Lemma 1, inequality (i) is equivalent to the following
which is true because of the assumption.
(ii) Suppose that , we have . This means, □
The following result was obtained by Huang and Gan [
28] which is an obvious consequence of Lemma 2.
Theorem 2. The following are equivalent for , and for
- (i)
;
- (ii)
whenever ;
- (iii)
whenever .
Proof. Observe that the condition is equivalent to that Also, the representing function of is This function is increasing in t, if and only if . Similarly, the representing function of is . This function is increasing for any x. □
In the following theorem, we prove a similar relation to (R2), which was obtained in [
27].
Theorem 3. Let , and . Then, Proof. The inequalities in (
18) are equivalent to
According to Lemma 2, it is enough to show that
for any
and for any
We show, for example, the second inequality. It is not difficult to see that the function
attains a minimum at
for any
. This means,
. □
Remark 1. Notice that the second inequality in (18) was proven by Huang and Gan in [28]. Our proof is essentially shorter. Also, Huang and Gan showed that if then , and if , then for any , Again, these statements can be proved by analyzing the sign of the function , in term of x or t on . In the following, we establish some norm inequalities for the spectral geometric mean and the Wasserstein mean.
Theorem 4. Let , and . Then
- (i)
;
- (ii)
where is the operator norm.
Proof. (i) can be found in [
29]. Here, we provide a short direct proof. Suppose that
or,
, we have
and
. By monotonicity of the geometric mean, we have
From here,
The second inequality in (ii) was proved in [
30]. However, all inequalities in (ii) are direct consequences of Lemma 2. □
To conclude this note, we establish new characterizations of central elements in the algebra
. We begin with the following fact, which is a special case of the main result in [
31]. We also note that some global characterizations of the trace and commutativity were previously studied by the fourth author and his co-authors in [
32,
33].
Lemma 3.
Let . Suppose that for any Hermitian matrix X:Then, A is a scalar multiple of the identity. Theorem 5. Let A be a positive definite matrix. Then
- (i)
for any positive definite matrix X, if and only if A is a scalar multiple of the identity matrix;
- (ii)
if and only if ;
- (iii)
if and only if .
Proof. (i) was obtained in [
29]. Here, we provide a direct proof. Let
. It follows that
, if and only if
This means,
, if and only if
where
Since
we have that
This means,
, if and only if
. From here, it follows that
A and
are commuting; hence,
and
Y are commuting. As a consequence, we can see that
B and
are commuting. By Lemma 3,
A commutes with every matrix
Thus,
A must be a scalar multiple of the identity matrix.
(ii) From the assumption, it implies that , if and only if The first inequality is equivalent to This occurs if and only if , which implies that Consequently, From here, one can see that
(iii) can be proved using similar arguments. □
Remark 2. We would like to obtain a similar result for the equality . Unfortunately, our proof could not be transferred to the general case.