1. Introduction
The many fields of applicability of methods of information geometry to the science of complexity encompass both classical and quantum systems [
1]. Among them, an information-geometric approach to the complexity as the extent to which an object, as a whole, is more than its parts was established in [
2] and then developed to relate various known measures of complexity to a general class of information-geometric complexity measures (see [
3] for a comprehensive overview on this topic). The general idea for quantifying the extent to which the system is more than the sum of its parts is the following. Let
be a set of systems; for any system
, we assign the collection of system parts which may be an element of a set
that formally differs from
. The corresponding assignment
can be interpreted as a reduced description of the system
in terms of its parts. Having the parts
, we have to reconstruct
by taking the sum of the parts in order to obtain a system that can be compared with the original system. The corresponding construction map is denoted by
. The composition
then corresponds to the sum of parts of the system
, and we can compare
with
. It turns out that
, under natural conditions, is the projection
to the set of non-complex systems
[
4]. Therefore, the quantification of how much the system
differs from
is established by a divergence function
such that
Finally, the complexity of a system
is defined by
Clearly, there are many choices for the divergence
, thus such a complexity measure is far from being unique. However, to ensure compatibility with
, one has to further assume that
satisfies
Here comes the role of a canonical divergence for providing an information-geometric measure of complexity which can be interpreted as unique.
In the framework of information geometry, a dual structure
on a smooth manifold
is given in terms of a metric tensor and two affine connections, which are dual in the following sense [
5]:
where
denotes the space of sections on
. Eguchi named a function
satisfying the property in Equation (
1) as a contrast (or divergence) function whenever
allows recovering the dual structure
on
in the following way [
6]:
where
and
and
are local coordinate systems of
and
, respectively. Here,
and
are the connection symbols of
and
, respectively. The investigation on a divergence function allowing to recover the dualistic structure on a smooth manifold is usually referred to as the inverse problem in information geometry. Matumoto [
7] showed that such divergence exists for any statistical manifold. However, it is not unique and there are infinitely many divergences that give the same dual structure. Hence, the search for a divergence that can be somehow considered as
the most natural is of upmost importance. When a manifold is dually flat, Amari and Nagaoka [
5] introduced a Bregman type divergence to this end, with relevant properties concerning the generalized Pythagorean theorem and the geodesic projection theorem. This is referred to as
canonical divergence and it is commonly assessed as the natural solution of the inverse problem in information geometry for dually flat manifolds. However, the need for a general canonical divergence, which applies to any dualistic structure, is a very crucial issue, as pointed out in [
8]. In any case, such a divergence should recover the canonical divergence of Bregman type if applied to a dually flat structure. In addition, in the self-dual case where
coincides with the Levi–Civita connection of
, the divergence
should be one half of the squared Riemannian distance:
[
3]. In the context of the information-geometric approach to complexity, a further requirement is needed to ensure the compatibility in Equation (
3). This is the geodesic projection property, which, in the present context, states that every minimizer
of
is achieved by the geodesic projection of
onto the set of non-complex systems. In [
9], Ay and Amari recently introduced a canonical divergence that satisfies all these requirements. Such a divergence is defined in terms of geodesic integration of the inverse exponential map. More precisely, given
and the
-geodesic
connecting
with
, the canonical divergence introduced in [
9] is given by
Here,
denotes the exponential map of
, which is defined by
whenever the
-geodesic
, satisfying
, exists on an interval of
containing
. Therefore, if
is the
-geodesic such that
and
, then
. According to this definition, we have that
, where
is the
-parallel transport from
to
. This implies that the divergence
assumes the following useful expression:
Analogously, the dual function of
is defined as the
-geodesic integration of the inverse of the
-exponential map [
9]. Therefore, we have for the dual divergence
a similar expression as Equation (
7) for the canonical divergence
:
where
is the
-geodesic connecting
with
. Therefore, the compatibility in Equation (
3) of
with
suggests that the projection
of a system
onto the space of non-complex systems can be achieved along the geodesic connection
with
. Actually, it has recently been proved that the
-geodesic minimizes the action integral of a suitably chosen kinetic energy [
10]. An analogous result holds about the
-geodesic. In this way, both divergences,
and
, turn out to solve the Hamilton–Jacobi problem in information geometry, as put forward in [
11].
The search for a general canonical divergence is still an open problem and it turns out to be of upmost importance in the context of the information-geometric approach to complexity (see progresses along this avenue put forward in [
9,
12]).
In this article, we aim to propose the canonical divergence in Equation (
7) as an efficient tool for providing a unified definition of complexity measures. For this reason, we firstly consider
on the simplex of probability distributions where a measure of complexity as one instance of Equation (
2) is supplied in terms of the Kullback–Leibler (KL)-divergence [
4].
The general methods described for defining the complexity measure in Equation (
2) can be particularized to the systems consisting of a finite node set
and each node
can be in finitely many states
. Then, we model the whole system as a probability measure
on the corresponding product configuration set
. The parts are given by marginals
where
is taken from a set of subsets of
, denoted by
. Therefore, the decomposition map
reads in this case as
, whereas the reconstruction map
is defined by the maximum entropy estimate
of
, leading to the projection
. The image of
turns out to be the closure of an exponential family
, which plays the role of the set
of non-complex systems. A deviation measure, which is compatible with the maximum entropy projection
is then the (KL)-divergence, which is defined by
on the
-simplex
[
6]. Finally, the measure of complexity as one instance of Equation (
2) is obtained by
We may notice that, if
consists of all subsets of
of cardinality
, elements of the set
of non-complex systems are totally uncorrelated in the sense that
has the product form
[
2]. Consider random variables
with joint probability distribution
and marginal probability distributions
. Then, we have
where
is the Shannon entropy. This quantity is referred to as
multi information and denoted by
. In particular, when
, this is nothing but the
mutual information. Very remarkably, the minimizer
in the closure of
of the (KL)-divergence, namely
, is obtained by projecting
onto the closure of
along a mixture
-geodesic [
13]. This is usually referred to as the geodesic projection property of the (KL)-divergence. The geometric structure given by the Fisher metric, the mixture
and exponential
affine connections was introduced by Amari and Nagaoka on the space of probability densities for studying statistical estimation problems [
5].
In this article, we then consider both divergences,
and
, on
with the endowed dualistic structure given by the classic Fisher metric and the mixture
and the exponential
connections. Here, we show that
. Actually, this result has already been shown in [
9]. However, we prove it differently by relying on the nice representations of
and
given by Equations (
7) and (
8), respectively. This proves that
can be interpreted as a generalization of the (KL)-divergence.
A further step for proving the effectiveness of
is to consider it (and its dual function) on the manifold of quantum states where the general idea for defining a complexity measure of a classic system expressed by Equation (
2) has been extended to the quantum setting in terms of the quantum relative entropy [
14]. More precisely, by considering a composite set of
units (or parties, or particles),
, the composite system is described by the product algebra
. Here,
is the
-subalgebra of complex
matrices such that the identity
. The many-party correlations are quantified in the state of a composite quantum system which can not be observed in subsystems composed of less than a given number of parties. In this context, the exponential families, which amount to the non-complex system in the classical case, are replaced by states that are fully described by their restriction to selected subsystems. These correspond to the family of Gibbs states
of the
-local Hamiltonians
. Here, a
-local Hamiltonian is defined as a sum of product terms
with at most
-non-scalar factors
, where
denotes a real self-adjoint operator. Therefore, the many-party correlations of a composite quantum state
which captures all correlations in
that cannot be observed in any
-party subsystem is the divergence
from the Gibbs family
[
14]. Here, the divergence
is the quantum relative entropy defined by
where
denotes the trace operator on the finite-dimensional Hilbert space of density matrices. Similar to the classical case, we can consider the family
of Gibbs states whose closure corresponds to the set of product states
. Consider then a composite quantum state
such that
where
and
is the identity operator on the product
where
is missing. In this case, the many-party correlations of
is the
quantum multi information:
where
is the von Neumann entropy of
. In particular, when
, this corresponds to the
quantum mutual information. Algorithms for the evaluation of
as a complexity measure for quantum states are studied in [
15]. In that context, the many-party correlations is related to the entanglement of quantum systems as defined in [
16].
The scope of the present article is mainly to present the canonical divergence
defined in Equation (
7) as an important tool for generalizing the concept of complexity measure claimed by Equation (
10) for classical systems as well as the concept of many-party correlation given by Equation (
11) for quantum systems. To this end, we consider the space of density matrices endowed with the quantum analog of the Fisher metric and the mixture
and exponential
affine connections. This structure turns out to be induced on the manifold of positive density operators by the Bogoliubov inner product [
17]. In this setting, we prove that the divergence introduced in [
9] reduces to the quantum relative entropy. In addition, we also show that
.
The layout of the paper is as follows.
Section 2 is devoted to the calculation of the canonical divergence and its dual function on the simplex of probability distributions. In
Section 3, we describe the differential geometrical framework for finite quantum systems induced by the Bogoliubov inner product. In this particular framework, we then prove that the divergence given by Equation (
7) reduces to the quantum relative entropy. Finally, we draw some conclusions in
Section 4 by outlining the results obtained in this work and discussing possible extensions.
3. Geometric Structure of a Manifold of Quantum States
We start this section by showing that natural analogs of the Fisher metric and the exponential and mixture connections are defined on a manifold of quantum states [
17]. To this end, we need to specify an inner product on the space of density operators. Since the divergence
of Equation (
7) is defined on a statistical manifold
with symmetric connections, we choose the Bogoliubov inner product. This is because of a well-known result that claims the
-connection induced by a generalized covariance is symmetric if and only if such a covariance is the Bogoliubov inner product [
5]. At the end of this section, we motivate this choice in more detail.
Let
be a finite-dimensional Hilbert space,
be the space of all the Hermitian operators on
and
be the space of positive density operators on
. Since
is an open subset of
, then it can be naturally seen as a smooth manifold of dimension
[
17]. Let
be a tangent vector at
to
; we call
its
-representation and symbolically write
It is worth noticing that, as an element of the tangent space,
can be naturally interpreted as a derivative. As an example, when a coordinate system
is given on
so that each state is parameterized as
, the
-representation of the natural basis vector is written as
, where
. This allows us to introduce the
-connection on the manifold
of the quantum states in terms of the covariant derivative
, which is defined by the following relation:
where the right hand side means the derivative by
of
and
denotes the space of sections on
.
To introduce the
-connection on
, we need to specify a family
of inner products on
usually named as
generalized covariance. For the reason mentioned above, we consider the Bogoliubov inner product, which is given by
Given
, we then define the
-representation of
as the Hermitian operator
satisfying the following relation:
For all
, we assume
(
denotes the identity operator). Thus, we can see that the derivative of the function
by
is written as
This implies that we can consider the
-representation
of a given
as
Therefore, it turns out that
is the derivative of the map
from
to
, which may be written as follows:
By considering
, we can immediately observe that
This proves that, although the
-representation depends on the choice of the generalized covariance, the space
can be simply written as follows
This fact supplies the manifold
of quantum states with the
-connection. To see this, let us consider the linear isomorphism
from
to
defined by
. By writing this correspondence as
,
, the
-connection
is then defined by
where the right hand side means the derivative by
of
.
Finally, we define the inner product
on
by
which is usually called the
quantum Fisher metric. The procedure thus far described endows the manifold
of quantum states with a geometric structure
given by the quantum Fisher metric, and two torsion-free connections, namely the
-connection
and the
-connection
, which are dual with respect to
in the following sense:
In addition, the dual structure is dually flat, meaning that the curvature tensors of and are both null.
Suppose that a coordinate system
is given and that each element
is specified by the coordinate
as
. According to Equation (
20), we have that the mixture representation
of
is given by
, whereas, by Equation (
23), we have that the exponential representation
of
is written as
. Therefore, the dual structure
with respect to an arbitrary coordinate system
reads as follows
A generalized covariance is a family of inner products on the space of Hermitian operators on the Hilbert space , where depends smoothly on for all and that satisfies the following properties:
For every
unitary matrix on the Hilbert space
, it is
If the Lie bracket
, then
This can be viewed as a quantum version of the
-product
of random variables
and
with respect to a probability measure
. Since
is the covariance of
and
when their expectations vanish, we can call the family
satisfying the above conditions a
generalized covariance.
According to the theory by Eguchi, a divergence function
induces a dual structure
on
in the way expressed by Equations (4) and (
5). It turns out that the connections
and
obtained in such a way are torsion-free (or symmetric) [
13]. To use the canonical divergence in Equation (
7) in the quantum setting, we are then forced to select the Bogoliubov inner product for providing the quantum analog of the Fisher metric, the
-connection and
-connection on the manifold of positive density operators. Indeed, while the
-connection is always torsion-free, it turns out that the
-connection induced on
from a generalized covariance is symmetric if and only if such a covariance is the Bogoliubov inner product.
Canonical Divergence on the Manifold of Quantum States
In this section we show that the divergence function of Equation (
7) reduces to the quantum relative entropy whenever the dual structure
on
is given by the Fisher metric (Equation (
28)), the mixture connection (Equation (
21)) and the exponential connection (Equation (
27)).
Let
be two density matrices. To implement the computation of the divergence
for quantum states, we consider the
-geodesic
[
19]. Then, the
and
representations of the tangent vector
are easily computed by means of Equations (
20) and (
25), respectively:
From Equations (
7) and (
28), we have then
Let us recall that
is a curve in the space of density matrices and the logarithm of a positive matrix is a well-defined matrix. Therefore, the derivative with respect to
of
is viewed as the matrix of the derivatives of the entries of
with respect to
. Moreover, the same holds for the integration of a matrix: this is the matrix of the integration of the entries. Finally, since the trace is a linear operator it commutes with the integration. Hence, with the abuse of notation where we keep
instead of the entry
, the computation in Equation (
33) is performed as follows by integration by parts:
This proves that
, which is the quantum relative entropy given by Equation (
12).
The dual divergence of
is computed by considering the
-geodesic connecting
and
. Let
, where
is a self-adjoint Hamiltonian. Then, the
-geodesic from
to
is given by
where
and
denotes the exponential matrix [
19]. Since the trace operator is linear in its argument, it commutes with the derivative operator. Therefore, according to Equations (
20) and (
25), we obtain that the
and
representations of
are given by
The dual divergence of
is written as follows:
To perform the computation in Equation (
37), we use the expressions of
and
given by Equations (
35) and (
36):
At this point, we can use the linearity of the trace operator and then the latter expression reduces to:
Carrying the integration by parts out, we obtain
where we use
. This proves that
4. Conclusions
As we have demonstrated, for a geometric definition of a general complexity measure, it is important to have a canonical divergence. This paper is based on recent progresses in defining a general canonical divergence within Information Geometry [
9,
12]. This divergence is defined in terms of geodesic integration of the inverse exponential map and holds the geodesic projection property when the structure
is dually flat [
3]. Let
and
be a submanifold of
, the search for
that minimizes the divergence
, supplies the solution for defining an information-geometric complexity measure as in Equation (
2). When every minimizer
of the divergence
is given by the geodesic projection of
onto
, we say that
holds the geodesic projection property. In this regard, the canonical divergence in Equation (
7) would provide a measure of complexity as Equation (
2) for a quite wide range of systems. A further step for defining Equation (
2) for general systems has been put forward in [
12], where a new divergence is introduced that turns out to be a generalization of the canonical divergence in Equation (
7). As an example of Equation (
2), we have considered the measure of complexity given by Equation (
10), which quantifies how much a probability measure on the product configuration set of the finitely many states on a discrete set
deviates from a family of exponential probabilities that amounts to the non-complex set of system states, as it is given by non-interacting states [
2]. In this case, the Kullback–Leibler divergence turns out to be suitable for providing the measure of complexity in Equation (
2) for classic states on discrete sets [
4]. To put the theory of Ay [
2] in perspective and propose the canonical divergence in Equation (
7) as suitable for supplying the complexity in Equation (
2) on general systems, we have then proved that
coincides with the (KL)-divergence on the simplex of probability measures endowed with the dual structure given by the Fisher metric and the mixture and exponential connections.
The quantum counterpart of the general theory yielding the measure of complexity in Equation (
2) does not yet exist. However, a quantum analog of Equation (
10) has been established on the manifold of positive density operators [
14]. Here, the family of non-interacting states is replaced by states that are fully described by their restriction to selected subsystems that turn out to be a family of Gibbs states. Therefore, many-party correlations are quantified in the state of composite quantum system, which cannot be observed in subsystems composed of fewer than a given number of parties. The suitable tool for providing such a quantification is established by the quantum relative entropy. This is because the maximum-entropy principle solves the inverse problem to reconstruct a global state from subsystem states and it also gives a natural scale of many-party correlation in terms of the gap to the maximal entropy value. Hence, the many-party correlation of a quantum state is quantified by the divergence from a family of Gibbs state. The many-party correlation in Equation (
11) has been implemented in algorithms [
15] proving to be related to the entanglement of quantum systems as defined in [
16]. To consider the canonical divergence in Equation (
7) as an efficient tool for extending the general theory leading to Equation (
2), we have considered
on the manifold of positive density operators with the quantum analog of the Fisher metric and
,
connections induced by the Bogoliubov inner product. We have finally proved that the canonical divergence coincides with the quantum relative entropy.