Abstract
A new canonical divergence is put forward for generalizing an information-geometric measure of complexity for both classical and quantum systems. On the simplex of probability measures, it is proved that the new divergence coincides with the Kullback–Leibler divergence, which is used to quantify how much a probability measure deviates from the non-interacting states that are modeled by exponential families of probabilities. On the space of positive density operators, we prove that the same divergence reduces to the quantum relative entropy, which quantifies many-party correlations of a quantum state from a Gibbs family.
1. Introduction
The many fields of applicability of methods of information geometry to the science of complexity encompass both classical and quantum systems [1]. Among them, an information-geometric approach to the complexity as the extent to which an object, as a whole, is more than its parts was established in [2] and then developed to relate various known measures of complexity to a general class of information-geometric complexity measures (see [3] for a comprehensive overview on this topic). The general idea for quantifying the extent to which the system is more than the sum of its parts is the following. Let be a set of systems; for any system , we assign the collection of system parts which may be an element of a set that formally differs from . The corresponding assignment can be interpreted as a reduced description of the system in terms of its parts. Having the parts , we have to reconstruct by taking the sum of the parts in order to obtain a system that can be compared with the original system. The corresponding construction map is denoted by . The composition
then corresponds to the sum of parts of the system , and we can compare with . It turns out that , under natural conditions, is the projection to the set of non-complex systems [4]. Therefore, the quantification of how much the system differs from is established by a divergence function such that
Finally, the complexity of a system is defined by
Clearly, there are many choices for the divergence , thus such a complexity measure is far from being unique. However, to ensure compatibility with , one has to further assume that satisfies
Here comes the role of a canonical divergence for providing an information-geometric measure of complexity which can be interpreted as unique.
In the framework of information geometry, a dual structure on a smooth manifold is given in terms of a metric tensor and two affine connections, which are dual in the following sense [5]:
where denotes the space of sections on . Eguchi named a function satisfying the property in Equation (1) as a contrast (or divergence) function whenever allows recovering the dual structure on in the following way [6]:
where
and and are local coordinate systems of and , respectively. Here, and are the connection symbols of and , respectively. The investigation on a divergence function allowing to recover the dualistic structure on a smooth manifold is usually referred to as the inverse problem in information geometry. Matumoto [7] showed that such divergence exists for any statistical manifold. However, it is not unique and there are infinitely many divergences that give the same dual structure. Hence, the search for a divergence that can be somehow considered as the most natural is of upmost importance. When a manifold is dually flat, Amari and Nagaoka [5] introduced a Bregman type divergence to this end, with relevant properties concerning the generalized Pythagorean theorem and the geodesic projection theorem. This is referred to as canonical divergence and it is commonly assessed as the natural solution of the inverse problem in information geometry for dually flat manifolds. However, the need for a general canonical divergence, which applies to any dualistic structure, is a very crucial issue, as pointed out in [8]. In any case, such a divergence should recover the canonical divergence of Bregman type if applied to a dually flat structure. In addition, in the self-dual case where coincides with the Levi–Civita connection of , the divergence should be one half of the squared Riemannian distance: [3]. In the context of the information-geometric approach to complexity, a further requirement is needed to ensure the compatibility in Equation (3). This is the geodesic projection property, which, in the present context, states that every minimizer of is achieved by the geodesic projection of onto the set of non-complex systems. In [9], Ay and Amari recently introduced a canonical divergence that satisfies all these requirements. Such a divergence is defined in terms of geodesic integration of the inverse exponential map. More precisely, given and the -geodesic connecting with , the canonical divergence introduced in [9] is given by
Here, denotes the exponential map of , which is defined by whenever the -geodesic , satisfying , exists on an interval of containing . Therefore, if is the -geodesic such that and , then . According to this definition, we have that , where is the -parallel transport from to . This implies that the divergence assumes the following useful expression:
Analogously, the dual function of is defined as the -geodesic integration of the inverse of the -exponential map [9]. Therefore, we have for the dual divergence a similar expression as Equation (7) for the canonical divergence :
where is the -geodesic connecting with . Therefore, the compatibility in Equation (3) of with suggests that the projection of a system onto the space of non-complex systems can be achieved along the geodesic connection with . Actually, it has recently been proved that the -geodesic minimizes the action integral of a suitably chosen kinetic energy [10]. An analogous result holds about the -geodesic. In this way, both divergences, and , turn out to solve the Hamilton–Jacobi problem in information geometry, as put forward in [11].
The search for a general canonical divergence is still an open problem and it turns out to be of upmost importance in the context of the information-geometric approach to complexity (see progresses along this avenue put forward in [9,12]).
In this article, we aim to propose the canonical divergence in Equation (7) as an efficient tool for providing a unified definition of complexity measures. For this reason, we firstly consider on the simplex of probability distributions where a measure of complexity as one instance of Equation (2) is supplied in terms of the Kullback–Leibler (KL)-divergence [4].
The general methods described for defining the complexity measure in Equation (2) can be particularized to the systems consisting of a finite node set and each node can be in finitely many states . Then, we model the whole system as a probability measure on the corresponding product configuration set . The parts are given by marginals where is taken from a set of subsets of , denoted by . Therefore, the decomposition map reads in this case as , whereas the reconstruction map is defined by the maximum entropy estimate of , leading to the projection . The image of turns out to be the closure of an exponential family , which plays the role of the set of non-complex systems. A deviation measure, which is compatible with the maximum entropy projection is then the (KL)-divergence, which is defined by
on the -simplex [6]. Finally, the measure of complexity as one instance of Equation (2) is obtained by
We may notice that, if consists of all subsets of of cardinality , elements of the set of non-complex systems are totally uncorrelated in the sense that has the product form [2]. Consider random variables with joint probability distribution and marginal probability distributions . Then, we have
where is the Shannon entropy. This quantity is referred to as multi information and denoted by . In particular, when , this is nothing but the mutual information. Very remarkably, the minimizer in the closure of of the (KL)-divergence, namely , is obtained by projecting onto the closure of along a mixture -geodesic [13]. This is usually referred to as the geodesic projection property of the (KL)-divergence. The geometric structure given by the Fisher metric, the mixture and exponential affine connections was introduced by Amari and Nagaoka on the space of probability densities for studying statistical estimation problems [5].
In this article, we then consider both divergences, and , on with the endowed dualistic structure given by the classic Fisher metric and the mixture and the exponential connections. Here, we show that . Actually, this result has already been shown in [9]. However, we prove it differently by relying on the nice representations of and given by Equations (7) and (8), respectively. This proves that can be interpreted as a generalization of the (KL)-divergence.
A further step for proving the effectiveness of is to consider it (and its dual function) on the manifold of quantum states where the general idea for defining a complexity measure of a classic system expressed by Equation (2) has been extended to the quantum setting in terms of the quantum relative entropy [14]. More precisely, by considering a composite set of units (or parties, or particles), , the composite system is described by the product algebra . Here, is the -subalgebra of complex matrices such that the identity . The many-party correlations are quantified in the state of a composite quantum system which can not be observed in subsystems composed of less than a given number of parties. In this context, the exponential families, which amount to the non-complex system in the classical case, are replaced by states that are fully described by their restriction to selected subsystems. These correspond to the family of Gibbs states of the -local Hamiltonians . Here, a -local Hamiltonian is defined as a sum of product terms with at most -non-scalar factors , where denotes a real self-adjoint operator. Therefore, the many-party correlations of a composite quantum state which captures all correlations in that cannot be observed in any -party subsystem is the divergence
from the Gibbs family [14]. Here, the divergence is the quantum relative entropy defined by
where denotes the trace operator on the finite-dimensional Hilbert space of density matrices. Similar to the classical case, we can consider the family of Gibbs states whose closure corresponds to the set of product states . Consider then a composite quantum state such that
where and is the identity operator on the product where is missing. In this case, the many-party correlations of is the quantum multi information:
where is the von Neumann entropy of . In particular, when , this corresponds to the quantum mutual information. Algorithms for the evaluation of as a complexity measure for quantum states are studied in [15]. In that context, the many-party correlations is related to the entanglement of quantum systems as defined in [16].
The scope of the present article is mainly to present the canonical divergence defined in Equation (7) as an important tool for generalizing the concept of complexity measure claimed by Equation (10) for classical systems as well as the concept of many-party correlation given by Equation (11) for quantum systems. To this end, we consider the space of density matrices endowed with the quantum analog of the Fisher metric and the mixture and exponential affine connections. This structure turns out to be induced on the manifold of positive density operators by the Bogoliubov inner product [17]. In this setting, we prove that the divergence introduced in [9] reduces to the quantum relative entropy. In addition, we also show that .
The layout of the paper is as follows. Section 2 is devoted to the calculation of the canonical divergence and its dual function on the simplex of probability distributions. In Section 3, we describe the differential geometrical framework for finite quantum systems induced by the Bogoliubov inner product. In this particular framework, we then prove that the divergence given by Equation (7) reduces to the quantum relative entropy. Finally, we draw some conclusions in Section 4 by outlining the results obtained in this work and discussing possible extensions.
2. Canonical Divergence on the Simplex of Probability Measures
A dualistic structure on the simplex of probability measures was introduced by Amari in terms of the Fisher metric, the mixture and exponential connections [18]. Given a finite set , we can represent probability measures on the set as elements of . In this representation, the Dirac measures form the canonical basis of . Then, the -dimensional simplex of probability measure is given by
In this section, we show that the canonical divergence coincides with the Kullback–Leibler divergence whenever . In addition, we prove that, for the dual canonical divergence, the following relation holds true. According to Equations (7) and (8), we need the Fisher metric defined on the tangent bundle , the mixture -geodesic and the exponential -geodesic both connecting with . On the tangent space , the Fisher metric results in
The dualistic structure on , given by the Fisher metric, the -connection and the -connection , is dually flat, and the - and -geodesics connecting with are [3]:
We are now ready to compute the canonical divergence for arbitrary . From Equations (7), (14) and (15), we have that
where we use because . Analogously, we can compute the dual canonical divergence by means of Equation (8). Therefore, by using Equations (14) and (16), we obtain that
To develop further the calculation, let us analyze the derivative . Recall that
Therefore, by taking the derivative of with respect to , we obtain
By stepping back to Equation (18), we start by performing an integration by parts:
where the last term is obtained by noticing that
Since we know that
we can observe that . Hence, we obtain from Equation (19)
because . This proves that
3. Geometric Structure of a Manifold of Quantum States
We start this section by showing that natural analogs of the Fisher metric and the exponential and mixture connections are defined on a manifold of quantum states [17]. To this end, we need to specify an inner product on the space of density operators. Since the divergence of Equation (7) is defined on a statistical manifold with symmetric connections, we choose the Bogoliubov inner product. This is because of a well-known result that claims the -connection induced by a generalized covariance is symmetric if and only if such a covariance is the Bogoliubov inner product [5]. At the end of this section, we motivate this choice in more detail.
Let be a finite-dimensional Hilbert space, be the space of all the Hermitian operators on and be the space of positive density operators on . Since is an open subset of , then it can be naturally seen as a smooth manifold of dimension [17]. Let be a tangent vector at to ; we call its -representation and symbolically write
It is worth noticing that, as an element of the tangent space, can be naturally interpreted as a derivative. As an example, when a coordinate system is given on so that each state is parameterized as , the -representation of the natural basis vector is written as , where . This allows us to introduce the -connection on the manifold of the quantum states in terms of the covariant derivative , which is defined by the following relation:
where the right hand side means the derivative by of and denotes the space of sections on .
To introduce the -connection on , we need to specify a family of inner products on usually named as generalized covariance. For the reason mentioned above, we consider the Bogoliubov inner product, which is given by
Given , we then define the -representation of as the Hermitian operator satisfying the following relation:
For all , we assume ( denotes the identity operator). Thus, we can see that the derivative of the function by is written as
This implies that we can consider the -representation of a given as
Therefore, it turns out that is the derivative of the map from to , which may be written as follows:
By considering , we can immediately observe that
This proves that, although the -representation depends on the choice of the generalized covariance, the space can be simply written as follows
This fact supplies the manifold of quantum states with the -connection. To see this, let us consider the linear isomorphism from to defined by . By writing this correspondence as , , the -connection is then defined by
where the right hand side means the derivative by of .
Finally, we define the inner product on by
which is usually called the quantum Fisher metric. The procedure thus far described endows the manifold of quantum states with a geometric structure given by the quantum Fisher metric, and two torsion-free connections, namely the -connection and the -connection , which are dual with respect to in the following sense:
In addition, the dual structure is dually flat, meaning that the curvature tensors of and are both null.
Suppose that a coordinate system is given and that each element is specified by the coordinate as . According to Equation (20), we have that the mixture representation of is given by , whereas, by Equation (23), we have that the exponential representation of is written as . Therefore, the dual structure with respect to an arbitrary coordinate system reads as follows
A generalized covariance is a family of inner products on the space of Hermitian operators on the Hilbert space , where depends smoothly on for all and that satisfies the following properties:
- For every unitary matrix on the Hilbert space , it is
- If the Lie bracket , then
This can be viewed as a quantum version of the -product
of random variables and with respect to a probability measure . Since is the covariance of and when their expectations vanish, we can call the family satisfying the above conditions a generalized covariance.
According to the theory by Eguchi, a divergence function induces a dual structure on in the way expressed by Equations (4) and (5). It turns out that the connections and obtained in such a way are torsion-free (or symmetric) [13]. To use the canonical divergence in Equation (7) in the quantum setting, we are then forced to select the Bogoliubov inner product for providing the quantum analog of the Fisher metric, the -connection and -connection on the manifold of positive density operators. Indeed, while the -connection is always torsion-free, it turns out that the -connection induced on from a generalized covariance is symmetric if and only if such a covariance is the Bogoliubov inner product.
Canonical Divergence on the Manifold of Quantum States
In this section we show that the divergence function of Equation (7) reduces to the quantum relative entropy whenever the dual structure on is given by the Fisher metric (Equation (28)), the mixture connection (Equation (21)) and the exponential connection (Equation (27)).
Let be two density matrices. To implement the computation of the divergence for quantum states, we consider the -geodesic [19]. Then, the and representations of the tangent vector are easily computed by means of Equations (20) and (25), respectively:
Let us recall that is a curve in the space of density matrices and the logarithm of a positive matrix is a well-defined matrix. Therefore, the derivative with respect to of is viewed as the matrix of the derivatives of the entries of with respect to . Moreover, the same holds for the integration of a matrix: this is the matrix of the integration of the entries. Finally, since the trace is a linear operator it commutes with the integration. Hence, with the abuse of notation where we keep instead of the entry , the computation in Equation (33) is performed as follows by integration by parts:
This proves that , which is the quantum relative entropy given by Equation (12).
The dual divergence of is computed by considering the -geodesic connecting and . Let , where is a self-adjoint Hamiltonian. Then, the -geodesic from to is given by
where and denotes the exponential matrix [19]. Since the trace operator is linear in its argument, it commutes with the derivative operator. Therefore, according to Equations (20) and (25), we obtain that the and representations of are given by
The dual divergence of is written as follows:
To perform the computation in Equation (37), we use the expressions of and given by Equations (35) and (36):
At this point, we can use the linearity of the trace operator and then the latter expression reduces to:
Carrying the integration by parts out, we obtain
where we use . This proves that
4. Conclusions
As we have demonstrated, for a geometric definition of a general complexity measure, it is important to have a canonical divergence. This paper is based on recent progresses in defining a general canonical divergence within Information Geometry [9,12]. This divergence is defined in terms of geodesic integration of the inverse exponential map and holds the geodesic projection property when the structure is dually flat [3]. Let and be a submanifold of , the search for that minimizes the divergence , supplies the solution for defining an information-geometric complexity measure as in Equation (2). When every minimizer of the divergence is given by the geodesic projection of onto , we say that holds the geodesic projection property. In this regard, the canonical divergence in Equation (7) would provide a measure of complexity as Equation (2) for a quite wide range of systems. A further step for defining Equation (2) for general systems has been put forward in [12], where a new divergence is introduced that turns out to be a generalization of the canonical divergence in Equation (7). As an example of Equation (2), we have considered the measure of complexity given by Equation (10), which quantifies how much a probability measure on the product configuration set of the finitely many states on a discrete set deviates from a family of exponential probabilities that amounts to the non-complex set of system states, as it is given by non-interacting states [2]. In this case, the Kullback–Leibler divergence turns out to be suitable for providing the measure of complexity in Equation (2) for classic states on discrete sets [4]. To put the theory of Ay [2] in perspective and propose the canonical divergence in Equation (7) as suitable for supplying the complexity in Equation (2) on general systems, we have then proved that coincides with the (KL)-divergence on the simplex of probability measures endowed with the dual structure given by the Fisher metric and the mixture and exponential connections.
The quantum counterpart of the general theory yielding the measure of complexity in Equation (2) does not yet exist. However, a quantum analog of Equation (10) has been established on the manifold of positive density operators [14]. Here, the family of non-interacting states is replaced by states that are fully described by their restriction to selected subsystems that turn out to be a family of Gibbs states. Therefore, many-party correlations are quantified in the state of composite quantum system, which cannot be observed in subsystems composed of fewer than a given number of parties. The suitable tool for providing such a quantification is established by the quantum relative entropy. This is because the maximum-entropy principle solves the inverse problem to reconstruct a global state from subsystem states and it also gives a natural scale of many-party correlation in terms of the gap to the maximal entropy value. Hence, the many-party correlation of a quantum state is quantified by the divergence from a family of Gibbs state. The many-party correlation in Equation (11) has been implemented in algorithms [15] proving to be related to the entanglement of quantum systems as defined in [16]. To consider the canonical divergence in Equation (7) as an efficient tool for extending the general theory leading to Equation (2), we have considered on the manifold of positive density operators with the quantum analog of the Fisher metric and , connections induced by the Bogoliubov inner product. We have finally proved that the canonical divergence coincides with the quantum relative entropy.
Author Contributions
The authors have equally contributed to the manuscript. They have all read and approved its final version.
Funding
This research received no external funding.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Felice, D.; Cafaro, C.; Mancini, S. Information geometric methods for complexity. Chaos 2018, 28, 032101. [Google Scholar] [CrossRef] [PubMed]
- Ay, N. Information geometry on complexity and stochastic interaction. Entropy 2015, 17, 2432–2458. [Google Scholar] [CrossRef]
- Ay, N.; Jost, J.; Van Le, H.; Schwachhöfer, L. Information Geometry, 1st ed.; Springer International Publishing: Cham, Switzerland, 2017. [Google Scholar]
- Ay, N.; Olbrich, E.; Bertschinger, N.; Jost, J. A geometric approach to complexity. Chaos 2011, 21, 037103. [Google Scholar] [CrossRef] [PubMed]
- Amari, S.-I.; Nagaoka, H. Methods of Information Geometry; Oxford University Press: Oxford, UK, 2000. [Google Scholar]
- Eguchi, S. A differential geometric approach to statistical inference on the basis of contrast functions. Hiroshima Math. J. 1985, 15, 341–391. [Google Scholar] [CrossRef]
- Matumoto, T. Any statistical manifold has a contrast function—On the C3-functions taking the minimum at the diagonal of the product manifold. Hiroshima Math. J. 1993, 23, 327–337. [Google Scholar] [CrossRef]
- Ay, N.; Tuschmann, W. Duality versus dual flatness in quantum information geometry. J. Math. Phys. 2003, 44, 1512–1518. [Google Scholar] [CrossRef]
- Ay, N.; Amari, S.-I. A Novel Approach to Canonical Divergences within Information Geometry. Entropy 2015, 17, 8111–8129. [Google Scholar] [CrossRef]
- Felice, D.; Ay, N. Dynamical Systems induced by Canonical Divergence in dually flat manifolds. arXiv 2018, arXiv:1812.04461. [Google Scholar]
- Ciaglia, F.; Di Cosmo, F.; Felice, D.; Mancini, S.; Marmo, G.; Pérez-Pardo, J.M. Hamilton-Jacobi approach to potential functions in information geometry. J. Math. Phys. 2017, 58, 063506. [Google Scholar] [CrossRef]
- Felice, D.; Ay, N. Towards a canonical divergence within Information Geometry. arXiv 2018, arXiv:1806.11363. [Google Scholar]
- Eguchi, S. Geometry of minimum contrast. Hiroshima Math. J. 1992, 22, 631–647. [Google Scholar] [CrossRef]
- Weis, S.; Knauf, A.; Ay, N.; Zhao, M.J. Maximizing the divergence from a hierarchical model of quantum states. Open Syst. Inf. Dyn. 2015, 22, 1550006. [Google Scholar] [CrossRef]
- Niekamp, S.; Galla, T.; Kleinmann, M.; Gühne, O. Computing complexity measures for quantum states based on exponential families. J. Phys. A Math. Theor. 2013, 46, 125301. [Google Scholar] [CrossRef]
- Vedral, V.; Plenio, M.B.; Rippin, M.A.; Knight, P.L. Quantifying entanglement. Phys. Rev. Lett. 1997, 78, 2275–2279. [Google Scholar] [CrossRef]
- Nagaoka, H. Differential Geometrical Aspects of Quantum State Estimation and Relative Entropy. In Quantum Communications and Measurement; Belavkin, V.P., Hirota, O., Hudson, R.L., Eds.; Springer: Boston, MA, USA, 1995. [Google Scholar]
- Amari, S. Differential geometry of curved exponential families-curvatures and information loss. Ann. Stat. 1985, 10, 357–387. [Google Scholar] [CrossRef]
- Petz, D. Quantum Information Theory and Quantum Statistics; Springer: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).