1. Introduction: The Chentsov Uniqueness Theorem and the Chentsov–Morozova Problem
The basic theorems of classical and quantum information geometry are categorical in character: one is the Chentsov theorem and the other one is its quantum counterpart, the Petz–Kubo–Ando (PKA) theorem.
Let us first describe the structure of the Chentsov theorem. The idea is as follows: Imagine that we want to use a family of Riemannian metrics on the family of the simplexes of probability vectors to distinguish the states (namely the probability vectors themselves). It would be natural that these metrics should contract under the effect of noise, namely under the effect of the morphisms of such structures, which are the stochastic maps; the distance between states could shrink if we muddy the waters. Let us translate this into a formal mathematical structure.
If
is a differential manifold, let us denote by
the tangent space to
in the point
. In the present commutative case, we define a
Markov morphism as a stochastic map,
. Let
The tangent space of
can be naturally represented as
A
monotone metric will be any family of Riemannian metrics
on
,
, such that
holds for every Markov morphism
, for every
and for every
.
Let us remember that Fisher Information is the Riemannian metric on
, defined as
Rao was the first to realize that Fisher Information was indeed a Riemann metric on statistical models. The surprising result, as proven by Chentsov (see [
1]), is as follows:
Theorem 1. There exists a unique monotone metric on (up to scalars) given by the Fisher information.
How do we generalize this in the quantum setting? Chentsov himself and Morozova were the ones to correctly formalize this new categorical problem (see [
2]).
We denote by
(resp.
), the space of complex (resp. self-adjoint)
matrices and define
as the space of the faithful states. This means
Due to the needs of quantum dynamics in the non-commutative case, a
Markov morphism should be defined as a completely positive and trace-preserving operator
. There exists a straightforward identification of
with the space of self-adjoint traceless matrices; namely, for any
,
Emphasizing the perfect analogy with the classical case, a
monotone metric or
Quantum Fisher Information in the non-commutative case is defined as a family of Riemannian metrics
on
,
, such that
holds for every Markov morphism
, for every
, and for every
.
Again, we see that distances becomes shorter under noise effect. It is now time to see if non-commutative monotone metrics exist and how we can classify them. The Fisher metric comes from division by ρ, but there is no natural division by ρ in the quantum setting.
To solve this problem, we need more complex mathematical instruments. This is the argument contained in the following sections.
2. Means for Positive Numbers and the Function
A basic ingredient to answer the Chentsov–Morozova problem is the notion of operator means. To introduce this, we first need to understand the notion of numerical means.
Definition 1. Let . A mean for pairs of positive numbers is a function such that (see [3]) (1) ;
(2) ;
(3) ;
(4) ;
(5) is continuous;
(6) for one has .
Definition 3. is the class of functions such that
(i) f is continuous;
(ii) f is monotone increasing;
(iii) ;
(iv) .
Proposition 1. There is a bijection between and given by the formulas Here we have some examples of means and of the associated representing function (
Table 1).
Remark 1. It is possible to prove that, in the above table, the representing functions are concave and more: they are all operator concave. However, in we also have convex functions, such as this piecewise affine function (see [4]): Setting , it is straightforward to verify that each mean has a continuous extension to , provided by We call the functions with regular and all the others non-regular. Indeed, the associated regular means have an additive character; namely, if then and . On the contrary, if the mean appears multiplicative, that is, and .
Note that no negative connotation should be associated with the denomination non-regular.
Definition 4. For such that , we set It is not difficult to see that this definition associates a non-regular element, , to a regular one, f.
3. Operator Means, Operator Monotone Functions, Quantum Fisher Information: The Petz–Kubo–Ando Theorem
We are now ready to introduce operator means. As previously, let be the set of all complex matrices and be the set of all the self-adjoint elements of . We shall denote general matrices by , while letters will be used for self-adjoint matrices. The Hilbert–Schmidt scalar product will be denoted by , where the adjoint of matrix X is denoted by . Let be the set of strictly positive elements in and let be the set of strictly positive density matrices previously introduced, namely . If not otherwise specified, we shall (from now on) only consider faithful states .
A function is said to be operator monotone (increasing) if, for any and A, such that , the inequality holds. A positive operator monotone function, f, is said to be symmetric if and normalized if .
Definition 5. is the class of functions such that
- (i)
,
- (ii)
for
- (iii)
f is operator monotone.
Note that all the functions in
Section 2 (except for the counterexample in Remark 1) belong to
.
Proposition 2. All the functions in are operator concave.
The Kubo–Ando theory of operator means [
3,
5,
6] can be seen as the matrix version of
Section 2.
Definition 6. A bivariate mean for pairs of positive operators is a functiondefined in and with values in positive definite operators on a Hilbert space and satisfying mutatis mutandis conditions to in Definition 1. In addition, the transformer inequality (see [6])holds for positive definite A,B, and arbitrary Notice that the transformer inequality replaces in Definition 1. We denote the set of matrix means as and let be the set of the Quantum Fisher Information(s).
The fundamental result is as follows.
Theorem 2 (Petz, Kubo, Ando in [
6,
7])
. There are two bijections linking , , and , provided by the following formulas: Let us rephrase the Petz–Kubo–Ando classification theorem: any operator monotone function generates an operator mean , which in turn produces Quantum Fisher Information using the above formulas. There are no other operator means or QFIs; they all come from an , according to the above described procedure.
This explains why it is so interesting to study the structure of : any understanding in this field necessarily provides us with more insight into operator means and Quantum Fisher Information(s).
This is exactly what the correspondence will produce.
4. The Bijection for Operator Monotone Functions
As in
Section 2, we divide the representing functions for operator means into two parts.
Definition 7. For , we define We say that a function is regular if and non-regular if cf. [8,9]. We introduce the sets of regular and non-regular functions,
and notice that, trivially,
is the disjoint union of
and
.
Definition 8. Set cf. [
5].
Theorem 3. The correspondence is a bijection between and .
5. The Inversion Formula and Wigner–Yanase–Dyson Information
Definition 9. Define .
Proposition 3. If g is non-regular then is regular; namely, . Moreover, if and , then The correspondence between the WYD information (see [
10]),
and the Quantum Fisher Information depends on the operator monotonicity of the functions
See [
8,
10,
11] for the existing proofs. Indeed, Proposition 3 provides a new approach to the above result.
The function
is operator monotone and, moreover,
and
is non-regular. The calculations show that
. Therefore, the function
for
Here we provide the first examples of the correspondence (
Table 2).
6. Regular QFI in Terms of Covariance
Quantum covariance is usually defined as
where
and
. The above formula can be written using the arithmetic mean of the left and right multiplication operator as
where
. This simple remark led Petz to the following definition (see [
12]):
Definition 10. For any , define the Quantum f-Covariance as As usual,
. If
, then
which is the above given standard definition for the quantum covariance.
With this generalized notion of Petz covariance, we show that there is an unexpected relation between QFI and the covariance itself.
We stated previously that there exists a natural identification of
with the space of self-adjoint traceless matrices; namely, for any
Moreover, the PKA theorem states that the Quantum Fisher Information(s) are given by the formula
for positive matrices
, where
.
Monotone metrics are usually normalized in such a way that implies .
Remark 2. Let us remember that ; the tangent space in ρ to the state space has a natural orthogonal decomposition in terms of “commuting” and “noncommuting” parts as Due to the Chentsov uniqueness theorem, the different QFI(s) are characterized from what they do on the noncommuting part of the tangent space; namely, on that is on tangent vectors of the form .
We are now ready to state the QFI(s) in terms of covariances.
Theorem 4. Gibilisco, Imparato, and Isola (Proposition 6.3, page 11 in [13]). The above formula has many important consequences.
7. A Look at the Petz–Sudar Theorem
In the PKA classification theorem (Theorem 2 in
Section 3), we see that the QFI is defined only for faithful states (
). It is Petz himself, in collaboration with Sudár, who understood how to define a
radial extension of a QFI to pure states and how to prove that only regular QFIs possess such an extension (for all details, the reader can refer to [
9] or to [
13]). The statement is as follows:
Theorem 5 (Petz and Sudár in [
9])
. A QFI admits a radial extension iff it is regular (). In such a casewhere is the Fubini-study metric on the space of pure states. The fact that the radial limit of
does not depend on
f is an immediate consequence of Theorem 4 in
Section 6.
It is natural to ask, can the Petz–Sudár theorem be generalized and proven using Formula (22)? Here, generalization means using Formula (22) for states that are neither faithful nor pure.
8. Extension of Regular QFI and MASI for Non-Faithful States
A far-reaching generalization of the Wigner–Yanase Skew Information has been proposed by Hansen in [
8].
Definition 11. Metric Adjusted Skew Information (MASI).
For and , set In the case where
, we can see that the MASI coincides with the Wigner–Yanase Skew Information:
Note that recently, using MASI, it has been proven that the Local Quantum Uncertainty (LQU) and the Interferometric Power (IP), which are two important measures of quantum discord, are instances of a family of quantum discords parametrized by the function
. This allows a unified study of the properties of LQU and IP (see [
14]). Due to Theorem 4, we have the following:
It is important to note that the two sides of Equation (
16) are somehow different in nature. The MASI on the left side is defined only for faithful states (
), while the right-hand side always makes good sense since quantum covariance is defined for any state. Therefore, one can look to Equation (
16) as a “definition” of the LHS, which solves the problem of extending the MASI with an approach that is different from the one proposed by Hansen in Theorem 3.8 in [
8]. Motivated by the above consideration, it is natural to introduce the following sesquilinear form, which is the natural extension of MASI for two observables.
Another important remark is that, using the
correspondence, it is possible to establish a relation between MASI and the quasi-entropy
introduced by Petz in [
15];
can be seen as a quantum version of Csiszar
F-entropy in classical statistics and information theory (see [
16]). Indeed, if
, Theorem 3.1 in [
17] proves that
9. Inequalities for the MASI and the Bloch Sphere Case
In this section, we discuss some basic properties of MASI and we will see how the
function appears, for example, as a calculation tool. What follows is the generalization of the work in [
18] that appears in [
19].
(a) If a quantum evolution is given by a Hamiltonian H that commutes with the observable A, then the MASI is a constant of motion. Namely, if we set and , then the function is constant. Since the Quantum Fisher Information contracts under coarse graining, we can see that QFI is a unitary covariant, and this is the crucial ingredient of the proof.
(b) For any MASI, we have:
(c) The constant
is optimal in inequality (
17). Namely, if
, the inequality
is false and a counterexample can be found in the elementary
case, namely on the Bloch sphere.
Let us see how this can be proven by means of the function.
Let be a complete orthonormal base composed of eigenvectors of , and the corresponding eigenvalues. Set , where . Note that the entry of A.
Recall that the Pauli matrices are as follows:
A generic
density matrix in the Stokes parameterization is written as
where
and
. Let
. The eigenvalues of
are
and
.
From this last result, the optimality of the constant follows.
10. The Dynamical Uncertainty Principle
From Equation (
16), one has
This is the case
of the Dynamical Uncertainty Principle, which reads
or equivalency
where
. On the left-hand side, we have the
Generalized Variance of the random vector
. Please note that, in this case, the right-hand side depends on the state-observables’ non-commutativity, and this is strictly related to a non-trivial dynamic induced by the observables according to the Schrödinger equation.
To understand the terminology, recall that the Standard Uncertainty Principle (SUP) in the Robertson version reads
where
is an arbitrary number of observables (self-adjoint matrices) and
is a state. For
, one obtains the Schrödinger uncertainty principle from which the Heisenberg uncertainty principle follows. The bound in the right-hand side depends on the non-commutativity among the observables (see [
20,
21]).
Now, let be odd; in this case the right-hand side is the determinant of an antisymmetric matrix and therefore is zero; for an odd number of observables the SUP does not say anything “quantum”.
Therefore, using the QFI and the
correspondence, a new uncertainty principle has been proven, which is also not trivial for an odd number of observables. Moreover, SUP and DUP have been generalized for an arbitrary
g-covariance; see [
22,
23,
24].
If we set
one can see that (Theorem 4.4 in [
22])
This implies, for example, that we have the biggest bound in the DUP for . Indeed, in this case, for any regular g, and this provides the conclusion.
11. Semplification of Kosaki’s Work
To see how the
correspondence sheds light on certain subjects, consider the paper by Kosaki [
25]. In this paper, the author’s aim is to study how the RHS of the DUP (for
) depends on the function
f. The main result by Kosaki is as follows. Remember that
Let
be fixed and set
The main result in [
25] is Theorem 5, which reads as follows:
is decreasing in (0,1/2),
so that
. The result was the final output of a rather complicated
tour de force of calculations.
Look how simple the approach is using the
correspondence. First of all, it is straightforward that
For
x fixed, the function
is decreasing in (0,1/2) so that
and the Kosaki’s conclusion follows. One should read the complicated proof in [
25] to fully appreciate the efficiency and clarity of the
correspondence.
12. Refinements of Heisenberg Uncertainty Relations
In the literature, several quantities appear with the same aim: to measure
quantum uncertainty. We will discuss some examples in this paper. For example, to quantify such uncertainty Luo introduced the following state-observable quantity,
where
. Furthermore, he was able to prove the following inequality:
Clearly, this can be seen as a refinement of the Heisenberg uncertainty principle because .
After some failed attempt to generalize this result (see Kosaki [
25], Remarks 3.2 and 3.3), Yanagi (see [
26]) was able to prove a generalization that makes sense for the WYD information. He introduced the following quantity,
and was able to prove this inequality:
Note that
where
which is the function associated with the WYD information. It is straightforward to propose an
f-depending quantity,
as a measure of quantum uncertainty and try to prove the following inequality:
Unfortunately, this inequality, in general, is false. Yanagi proved that the theorem holds true under a condition involving ; namely, we have the following:
Proposition 7. For ifthen it holds On the other hand, one can prove the following:
Proposition 8. For any and This has the following as a consequence:
From this, an unconditional inequality follows: if we switch from the constant
to the constant
(see [
27]), we obtain the following:
Proposition 9. For and , it holds that 13. State Quantum Uncertainty Based on MASI
Luo proposed a notion of quantum uncertainty depending only on the state
. In the paper [
28], starting from the Wigner–Yanase information and from an orthonormal basis
, he introduced the quantity
as a measure of such uncertainty. First, Luo proved that
is basis independent, and after that
where
is the spectrum of
. Applications of the function
also appear in paper [
29].
If we remember that the WY information is the QFI associated with the functions
we obtain
The above considerations lead naturally to the following questions:
(i) For a regular
, does the definition
produce a basis-independent function of the state ?
(ii) Imagine that we obtain a positive answer for (i). We may also ask if
These questions both received a positive answer from Cai in their paper [
30]. Once again, the
function shows up when one has to look at a general scheme for Quantum Fisher Information.
14. The Average Coherence of a Quantum State
In a recent paper [
31], Fan, Li, and Luo attempted to study quantum coherence (an important feature of a quantum system) by eliminating the influence of a reference basis. They introduced the average quantum coherence using three procedures: (1) average over all orthonormal basis; (2) average over all elements of operator orthonormal basis; (3) average over a complete family of MUBs (Mutually Unbiased Bases). The result of the paper was that these three different procedures produce the same quantity. The basic ingredient of the proof is the
correspondence.
Indeed, if
is a quantum channel and
are the Kraus operators of
, the authors define a channel-depending coherence as
In the first case, they consider the channel as induced by a von Neumann measurement or equivalently by an orthonormal basis. Averaging on this reference basis is equivalent to integrating over the unitary orbit of a fixed basis, which amounts to using the normalized Haar measure over the unitary group,
, of the system Hilbert space. They set
where
. In the second case, they defined
where
is a family of
operators that constitute an operator orthonormal basis for
, the space of all bounded linear operators on
H. This can be proven to be independent of the chosen basis.
In the third case, they define the coherence averaging on MUBs, which surely exist if d is a power of a prime number.
Finally, they proved the following result:
Theorem 6. For any state ρ of any prime power dimensional system and for any regular operator monotone function f, one has that Note that if are the eigenvalues of the state , then
15. Conclusions
The notion of means has its roots deeply situated in the history of Western mathematics; the Greeks themselves knew eleven different types of means. Still, the subject is currently undergoing strong developments. As an example, starting from the work of Rao [
32,
33] and Prakasa–Rao [
34,
35], the Jensen inequality for numerical and operator means has been proven, and more generalizations seem to be on their way [
4,
36,
37].
The correspondence is indeed a correspondence between means. From the work by Petz, we know that two of the basic objects of Quantum Probability and Quantum Statistics, namely Quantum Covariance and Quantum Fisher Information, are indeed necessarily built on the notion of operator mean, and this explains why we find the manifold of different applications of the correspondence described in this paper.
It is rational to expect that this is not the end of the story and that many other applications will appear in the coming years.
At the moment, the most promising area is that of the extension of the Petz–Kubo–Ando theorem for states that are neither faithful nor pure. We expect this could be performed using the formula
for regular
f. The right-hand side makes sense for any state and, on the other hand, by a continuity-approximation argument the formula is, somehow, forced to be unique; indeed, we can approximate any state by faithful states. A fully satisfying theorem would certainly deduce the scalar product
from first principles, as in the Petz proof of the PKA theorem. At the moment, a similar theorem has not yet been proven.