1. Introduction
Information geometry originally dealt with parameter spaces of families of probability densities viewed as differentiable manifolds [
1,
2]. More specifically, let
E be a measure space and let
be a parameterized family of densities on
E satisfying:
M is a topological manifold (in most cases, it is simply an open subset of ).
The topology of S induced by the norm is compatible with the topology of M.
It exists a probability measure on E such that for any , .
is smooth uniformly in x.
.
The moments up to order 3 of exist and are smooth.
The matrix F with entries is positive definite.
The last assumption allows to endow
M with the structure of a Riemannian manifold with metric:
Parameterized families of the so-called exponential type, the densities of which can be written as:
play a special role in statistics and have a well behaved Riemannian structure. When
, the family is said to be natural and is defined entirely by
. In such a case, the Fisher information matrix takes the form:
so that the Riemannian metric is Hessian. The structure of such manifolds has been thoroughly studied in [
3]. Finally, from considerations arising in statistical estimation, a pair of dual connections
with respect to the Fisher metric can be constructed [
4]. They possess vanishing torsion and are related by the skewness tensor:
with:
As a generalization, a smooth Riemannian manifold
equipped with a pair
of torsionless dual connections is called a statistical manifold. It can be defined equivalently by
where
T is a fully symmetric
-tensor. It turns out [
5] that any statistical manifold can be embedded as a statistical model, i.e., one related to a parameterized family of densities.
For a Riemannian manifold
, lifting geometric objects to the tangent bundle
(resp. cotangent bundle
) is a classical problem [
6,
7,
8] that relies most of the time on the Whitney sum
(resp.
) with
the vertical bundled obtained from the kernel of the canonical projection
(resp.
and
the horizontal subspace arising from a fixed affine connection ∇. In the tangent bundle, Reference [
8] introduces a lift based on horizontal and vertical lifts of vector fields and relies on a quasi-complex structure on
. For
, the preferred method involves complete lifts [
9] and Riemann extensions [
10], which are pseudo-Riemannian metrics of neutral signature defined on the cotangent bundle and associated in a canonical way to affine connections with vanishing torsion. The complete lift of the connection is defined to be the Levi-Civita one with respect to its Riemann extension. Complete and vertical lifts of different kinds of tensors are also presented in [
6]. Finally, horizontal lifts of connections are presented in [
7].
In this paper, the complete lift of dual connections is defined and yields a pair of dual connections, which have vanishing torsion if the original connections do. The strategy adopted is to lift the skewness tensor, here defined in a more general setting as a
-tensor with cyclic symmetry. The procedure described in [
6] is adapted to this case, effectively allowing to get a skewness tensor on
. The first step is the symmetric lift presented in Definition 6, which yields a
-tensor on
. Applied to the skewness tensor of a pair of dual connections, it yields the mutual torsion of the lifted dual connections. The action of this tensor on vertical and complete lifts of vector fields is given in Proposition 14 and Equation (
38). Finally, lifted connections are studied with respect to vertical and complete lifts of forms and vector fields, showing their relationship with mutual curvatures of original dual connections. For a couple of fields
solutions of the equation
, the action of a lifted connection on the complete lifts
can be simplified, allowing in a future work a projective limit to be defined.
2. Statistical Structures
In information geometry, dual connections are the basic objects defining the so-called statistical manifold structure [
4]. The purpose of this section is to recap some properties of statistical structures and to introduce the
-skewness tensor in the context of dual connections with or without torsion. It has the cyclic symmetry property, which is exactly what is needed to lift it as a
-tensor on the cotangent bundle.
In the sequel, M is a smooth n-dimensional manifold endowed with a Riemann metric g.
Definition 1. Let be affine connections on . They are said to be dual if for any triple of vector fields: The torsion of a connection ∇ is the tensor T defined as: . If the torsion of T vanishes, so does the torsion of .
Proposition 1. Let be dual connections. Then for any triple : Proof. The covariant derivative of the tensor
g is given, for any triple
of vector fields by:
Exchanging ∇ and
, one also has:
Proposition 2. If are without torsion then the tensor:is fully symmetric. Proof. Symmetry in
is obvious from the symmetry of
g, while symmetry in Z is obtained from the relations:
□
Definition 2. Let be affine connections on . Their mutual torsion is the tensor: Remark 1. The divergence tensor is defined for dual connections as . For torsion-less connections, the two notions agree, i.e., .
In the case of dual connections with vanishing torsion, the commutation defect of the divergence is related to the mutual curvature of the connections.
Definition 3. Let be a pair of connections. Their mutual curvature is the tensor -tensor: As in the case of the curvature, it is often useful to introduce the
-tensor:
The curvature and the mutual curvature of dual connections enjoy symmetry properties.
Proposition 3. Let be a pair of dual connections. Then, for any vector fields ; Proof. The proof of the first property is found in, for example, [
4]. For the second, the definition of
is written as:
Using the duality property:
Using duality once again:
□
In the case of dual connections without torsion, the definition of simplifies to . Letting , the next proposition relates the commutation defect to the curvatures.
Proposition 4. For any vector fields : Proof. By simple computation:
and the claims follows by identification of the terms. □
Proposition 5. Let be dual affine connections on . Then, for any triple of vector fields:where is the Levi-Civita connection. Proof. Since the two connections are dual:
Using the definition of
it becomes:
Then, using an alternating sum over the cyclic permutations of
and the Koszul formula:
yields the result. □
Remark 2. Proposition 5 is the analogue of the Kozsul formula for dual connections. It is a defining property given .
Notation 1. The -tensor:is the skewness tensor associated to the connections . When no confusion is possible in the case of dual connections, the subscripts will be dropped so that stands for Remark 3. The formula of Proposition 5 can be rewritten to give the expression of : Proposition 6. For any triple :where T is the torsion of ∇. Proof. Using the definition:
and the fact that the Levi-Civita has vanishing torsion:
Proposition 7. The tensor U has the cyclic symmetry propery, that is for any triple of vector fields: Proof. Using the symmetry of the Riemann metric, the same derivation as in Proposition 5 but applied to the terms
yields:
By identification it becomes:
□
Proposition 8. Let U be a tensor with cyclic symmetry, then the connections defined by:are dual Proof. For any triple
of vector fields:
Under the assumption of Equation (
10), it becomes:
and since
U has cyclic symmetry:
□
Proposition 9. Let be a pair of affine connections. For any triple of vector fields: Proof. Direct computation from the definition of U. □
Remark 4. Proposition 9 shows that the mutual torsion of a pair of dual connections is uniquely defined by a cyclic symmetric tensor. Conversely, for a pair of connections, the cyclic symmetry defect of the tensor , namely is the obstruction of being dual. Please note also that the torsion for a pair of dual connections can be seen as the obstruction for the tensor U to be totally symmetric.
Remark 5. A statistical manifold may be defined as a quadruple with M a smooth manifold, g a Riemannian metric, ∇ an affine connection and U a tensor with cyclic symmetry. It slightly more general than the usual definition since U is not required to be totally symmetric, thus allowing connections with torsion.
3. Dual Connections Lifts
Let U be a coordinate neighborhood in M and let be the canonical projection. is a coordinate neighborhood in with coordinates denoted as .
The lift of connections on the cotangent bundle has been studied in [
6,
7] using the Riemann extension defined in [
10]. Another kind of lift is introduced in [
11] along with a metric on
. Let
be a smooth Riemannian manifold and let ∇ be an affine connection. The kernel of
defines an integrable distribution, called the vertical distribution, hereafter denoted by
. It is spanned by the vectors:
Complementary to it, there is a horizontal distribution spanned by the vectors:
with:
These basis vectors are conveniently put into a matrix form, following the convention of [
11]:
where
is the matrix with entries:
Definition 4. The Riemannian extension of a torsion-free affine connection ∇ on is the symmetric -tensor with a component matrix:where Γ is the matrix defined in (16). Proposition 10. Let ∇
be a torsion-free affine connection on M and let be its adapted frame in . With respect to it, the component matrix of the Riemannian extension is: Proof. In the adapted frame, the expression of the component matrix of the Riemannian extension is:
which is equal to:
using the assumption that ∇ is torsion-free,
and the claim follows. □
Definition 5. The Levi-Civita connection with respect to the Riemannian extension, denoted by , is called the complete lift of the connection ∇.
Proposition 11. The Christoffel symbols of the complete lift are given by: When , the torsion-free assumption is automatically satisfied, so that in an adapted frame the Riemannian extension reduces to the one of Proposition 10.
Proposition 12. Let be a pair of dual affine connections on . Then, with respect to the Riemannian extension of , the following relations hold:where is the matrix with entries:and L (resp. ) is the component matrix of the adapted frame to ∇
(resp. ). Proof. In the case of dual connections, Equation (
12) yields:
and so:
where
. From (
20) (resp. (21)), it comes:
The other equations are proved the same way. □
The above relations show that the horizontal subspaces of ∇ and
are related by the Riemannian extension in a very simple way. Let
be a vector in
with decomposition
(resp.
) according to the horizontal subspace of ∇ (resp.
), then:
with
the Euclidean inner product.
Another interesting fact is that with respect to the adapted frames of ∇ (resp. ∇), the Riemannian extension becomes a modified Riemannian extension in the sense of [
12]. To a given modified Riemannian extension, it is thus possible to associate a pair of dual connections with a given torsion (this last restriction comes from the fact that only the symmetric part of the tensor
D enters the expression).
Since duality is related to metric, it is not so obvious how to lift a pair of mutually dual connections in a canonical way since the complete lifts of ∇ and
involve different Riemannian extensions. The preferred approach will be thus to lift the mutual torsion
D to a
-tensor, which can be done by extending the approach of [
6], and to exploit the fact that it has a cyclic symmetry property.
In the sequel, the symmetric (resp. anti-symmetric) part with respect to the contravariant indices of the
-tensor
D will be denoted by
(resp.
), i.e.,:
Proposition 13. The expression:defines a 2-form on . Its exterior derivative is given by: Rearranging the terms, the form
can be rewritten as:
It turns out that the above tensor has cyclic symmetry since it is and skew-symmetric. This can be made more explicit by first noticing that the first line on the right hand side obviously has this property. In the second line, considering as an example the first term , a cyclic permutation of the arguments yields . Now, the indices change gives , which is exactly the original second term. The remaining terms can be worked the same way.
Considering now the symmetric part of
D, a similar procedure can applied to obtain a fully symmetric
-tensor. Let us denote by ⊙ the symmetric tensor product, that is:
From
, a symmetric tensor on
can be defined as:
Following the construction of Proposition 13 and the formula of [
13], a fully symmetric lift can be defined.
Definition 6. The symmetric lift of is the -tensor with components: Gathering things together, both the symmetric and the anti-symmetric part of
D can be lifted to a cyclic symmetric
-tensor. In the sequel, the notation of [
6] is adopted: Latin letters
refer to
x components, overlined letters
refers to
p components and capital letters can be used for both. As an example,
.
Definition 7. The cyclic symmetric complete lift of the -tensor D, denoted , is the -tensor with components : Let
be the Riemann extension of the Levi-Civita connection. From
, the complete lift of
D can be defined as the
-tensor
such that for any triple of vector fields:
Given the matrix form of the Riemannian extension for the Levi-Civita connection:
its inverse is readily obtained as:
The components of
in coordinates can be obtained by composing the matrix
A, yielding:
with the notation
. Please note that the above relations are different from the one given in [
6] for the complete lift of a skew-symmetric
-tensor since here the Riemann extension is used in place of the canonical
-tensor
and only the cyclic symmetry is assumed. This last fact can be noticed in the third and fourth lines of Equation (
28).
The next definitions are recalled for the sake of completeness.
Definition 8. Let be a degree 1 differential form. Its vertical lift to is the vector field: The vector fields admit both a vertical and a complete lift. Only the later will be used here.
Definition 9. Let be a vector field on M. Its complete lift to is the vector field: Finally -tensors can be lifted in a quite obvious way:
Definition 10. Let F be a -tensor field. Its vertical lift to is the vector field: The action of on vertical and complete lift can now be obtained.
Proposition 14. Let X be a vector field and be 1-forms. Then:where (resp. ) is the -tensor defined by: (resp. ). Proof. Let
. Then
. Let
X be vector field and
its complete lift. By linearity:
Since
, the second term in the right hand side vanishes. For the fist one, only
is non-zero, so that:
The tensor has expression , so that is the form , of which the vertical lift is . □
Please note while the expression obtained is similar to the one of [
6], the sign is opposite.
The case of the action on two complete lifts is a little bit more complicated. First of all, given two vector fields
, a simple computation yields:
After rewriting, Equation (
30) becomes:
Let us consider, for
fixed vector fields, the
-tensor
:
Its vertical lift is then:
On the other hand, the complete lift of the vector field
is:
Combining Equations (
32) and (
33) yields:
Putting the expression in Equation (
31) yields:
Let
K be a
-tensor
K. Its Lie derivative can be written [
14] (p. 32, Proposition 35):
It thus becomes:
which can be written in coordinates:
Plugging it into Equation (
35) finally gives the reduced expression:
where
are the tensors defined in coordinates by:
The Equation (
38) completely defines the tensor
.
From the complete lift
, dual connections with respect to the Riemannian extension can be obtained:
The pair
defines the complete lift of the original statistical structure to the pseudo-Riemannian manifold
. When ∇ is without torsion, then
D is symmetric. Using Equation (
38) and the fact that in such a case
show that
is itself symmetric, proving that
has vanishing torsion.