1. Introduction
The theory of integrable systems begins with W.R. Hamilton who in 1835 pronounced that the equations of motion of an
n body system conform to the principle of least action, and consequently can be represented as
under the transformation
where
is the total energy, with
T the kinetic and
V the potential energy of the system. He then observed that
H is conserved along the solutions of the system. Hamilton’s discovery gave rise to a new class of differential equations of the form
associated with any function
H of
variables
and
. Such equations became known as the
canonical equations. Then the transformations
that preserved the canonical form of these equations were also called canonical, and the functions whose values were conserved by canonical systems became known as
integrals.
Hamilton’s discovery had an immediate impact on the scientific community of the nineteenth century. Canonical equations became the central object of study in the mathematics of that period with the contributions of J. Liouville, S.D. Poisson, C.G. Jacobi and H. Poincaré leading the way towards a new branch in mathematics known today as the theory of integrable systems. This theory was principally driven by a lasting interest in the existence of extra integrals of motion and the symmetries that are accountable for the existence of these integrals. Its defining moment may be attributed to S.D. Poisson who in 1809 [
1] introduced his bracket (known since as the Poisson bracket)
for functions
f and
g in the canonical variables
.
The introduction of the Poisson bracket greatly facilitated the emerging theory of that period. It provided an alternative definition of canonical systems as differential systems that satisfy
and it also redefined integrals of motion associated with
H as functions
F that satisfy
. It was Jacobi, however, who noticed the fundamental property of the Poisson bracket
that has been known ever since as the Jacobi’s identity. It is then an easy consequence of Jacobi’s identity that
is a third integral of motion for
H for any two integrals
and
(known as Poisson’s theorem [
2]). Alternatively integrals of motion were detected through a suitable change of canonical coordinates. Jacobi characterized such changes of coordinates through a generating function
. According to Jacobi
is canonical if and only if
Poincaré characterized canonical change of coordinates in terms of differential forms:
is canonical if and only if
for some function
S.
From contemporary perspectives the theory of integrable systems begins with C.G. Jacobi and his seminal book
Lectures in Dynamics [
3]. Jacobi demostrated that the canonical Equation (
3) can be integrated with the aid of a partial differential equation
in terms of an unknown function
S. He showed that if a particular solution of (
7) can be found in terms of
n arbitrary constants of motion
then
for some function
, and the transformation
transforms the canonical coordinates
into new canonical coordinates
relative to which the canonical Equation (
3) are transformed into the equations
whose solutions are given by
Canonical coordinates whose solutions are given by (
9) are called
action-angle coordinates [
4].
Equation
is known as
Jacobi’s equation. Poincaré referred to the above result as the first theorem of Jacobi in his treatise of celestial mechanics [
2]. Jacobi’s solution of the above partial differential equation in terms of the elliptic coordinates stands out as the most original and, perhaps, the most enigmatic contribution to the theory of canonical systems. Jacobi’s use of elliptic coordinates suggested the existence of a special class of variational problems whose solutions can be described by Abelian integrals in some privileged system of coordinates, exemplified by the geodesic problem on the ellipsoid. In the absence of any apparent symmetries on the ellipsoid that account for the integrability of the geodesic problem, this result of Jacobi seemed particularly mysterious.
In Jacoby summary, the main problem in the integration of differential equations appears in the choice of right variables. Given no general rule for finding the right choice, it is better to introduce special variables first, and then investigate the problems that naturally lend themselves to these variables [
3]. Jacobi, however, does not comment on another exceptional aspect of his discovery, namely the mysterious presence of partial differential equations for the problems of variational calculus, an issue that remained open for a long time.
Almost a hundred years later, C. Carathéodory in the introduction to his famous book on the calculus of variations [
5] remarks that “ neither Jacobi, nor his students, nor the many other prominent men who so brilliantly represented and advanced this discipline during the nineteenth century, thought in any way of the relationship between the calculus of variations and partial differential equation”. H. Poincaré also sidestepped this issue by treating canonical systems as the solutions of a dynamical system
where
denote the constants
. Since
the above differential equation can be reformulated as
which shows that Equations (
3) and (
10) have the same solutions. Poincaré equation used Equation (
10) to show that a transformation
is canonical if and only if the differential form
satisfies
for some function
.
Among many other stellar advancements of that epoch, the following result of J. Liouville, reported in 1855 [
6], seemed particularly influential for the present mathematics [
4]. Liouville considered a differential system
associated with a function
. He then assumed the existence of
n integrals of motion
such that the system of equations
can be solved for
in the variables
. He also imposed the condition that
are in involution, that is,
Liouville interpreted
as the exactness condition for the differential form
and concluded that there is a function
such that
that is,
. But then
S can be used as the generating function for the canonical transformation
where
. Liouville refers to
as a complete system. Indeed, in the new coordinates
remain constants of motion and therefore
. Since
,
F is independent of
, that is,
F is a function of
t and
h. But then
is a given function of time, and
is given by its integral. When
F is a function of
x and
y, and not explicitly dependent on time, then
is only a function of
h. Therefore, the general solution is given by
This heritage from 19-th century mathematics forms a core of knowledge indispensable for problems of mathematical physics, symplectic geometry, calculus of variations and optimal control theory, and its unanswered questions still motivate much of the current research in integrable systems.
This paper will address the “hidden” symmetries that account for the existence of extra integrals of motion. We will show that the canonical integrable systems, such as Jacobi’s geodesic problem on the ellipsoid, Neumann’s mechanical problem on the sphere, Euler’s top, and the associated heavy tops, all derive their constants of motion from certain “meta” systems on Lie groups that admit isospectral representations of the form
on the Lie algebra
of
G.
We will confine our attention to semi-simple Lie groups
G having a compact subgroup
K, for then the Lie algebra
admits a decomposition
, where
the Lie algebra of
K and
is the orthogonal complement of
relative to the Killing form
. But then
and therefore
as a vector space also carries the semi-direct Lie algebra
associated with the semi-direct product
. We will then single out a class of left-invariant variational problems on
G that admit an isospectral representation with
where
in the semi-direct case and
in the semi-simple case,
,
, and where
A is a fixed element in
. It is then known that the spectral invariants
are in involution relative to the canonical Poisson bracket on
, respectively on
. We will show that these invariants shed light on the hidden symmetries that surround many of the aforementioned integrable systems. In the process we will be able to demonstrate that the quest for the geometric origins behind the “mysterious” integrals of motions also leads to new and unexpected encounters with problems of Riemannian and sub-Riemannian geometry in which geometric control theory plays a major role.
2. Symplectic Background, Hamiltonian Systems
The theoretic framework upon which above claims are made is rooted in symplectic geometry. Below is a brief summary of the theoretical ingredients required for our main results.
Recall that a manifold
M together with a non-degenerate and closed 2-form
is called
symplectic. The symplectic form yields a correspondence between functions and vector fields: to every function
f there is a vector field
defined by
for all vector fields
X on
M. Then
is called the Hamiltonian vector field generated by
f. Every symplectic manifold is even dimensional, and at each point of
M there is a neighbourhood with coordinates
on which Hamiltonian vector fields are given by
This choice of coordinates in which
is given by (
17) is called
symplectic, or canonical in the terminology of the 19-th century.
Every cotangent bundle
is a symplectic manifold with its canonical symplectic form,
in terms of the symplectic coordinates
. As a symplectic manifold the cotangent bundle is special, in the sense that it is also a vector bundle. Hence every vector field
X on
M can be lifted to a unique Hamiltonian vector field
in
via the function
,
. Vector field
is called the
Hamiltonian lift of X. The same procedure is applicable to any time varying vector field, and by extension to any differential system on
M. Thus any differential system in
M can be lifted to a Hamiltonian system in
. This fact is also important for problems of optimal control where the Maximum Principle singles out the appropriate Hamiltonian lifts that govern the optimal solutions [
7].
When the base manifold is a Lie group
G, and when the underlying differential system is either left or right invariant, then there is a special system of coordinates based on the representation of
as
, with
the dual of
. This coordinate system preserves the left invariant symmetries and elucidates the conserved quantities of the associated Hamiltonian systems. The passage to these coordinates and the associated formalism was amply documented in my earlier publications [
7,
8,
9]. Below we will highlight the main points in this theory required for our results.
2.1. Left-Invariant Trivializations and the Symplectic Form
Having in mind applications that involve left-invariant variational systems the cotangent bundle and the tangent bundle will be viewed as the products and via the left-translations. More explicitly, tangent vectors will be identified with pairs via the relation , where denotes the tangent map associated with the left translation . Similarly, points will be identified with pairs via . Then , the tangent bundle of the cotangent bundle , will be identified with , with the understanding that an element denotes the tangent vector at the base point .
We will make use of the fact that
is a Lie group in its own right since
, as a vector space, is an abelian Lie group. Then left-invariant vector fields
V in
will be denoted by
,
in
. In this setting the canonical symplectic form on
is given by
for any left-invariant vector fields
and
[
7]. The above form is invariant under the left-translations in
, and is especially revealing for the Hamiltonian vector fields generated by left-invariant functions on
.
A function
H on
is left-invariant if
for all
and all
. That is, left-invariant functions coincide with functions of
. Each left-invariant vector field
on
G lifts to a linear function
on
because
Functions
H on
generate Hamiltonian vector fields
on
whose integral curves are the solutions of
In a more general case, where
H depends on both
and
, the integral curves of
are the solutions of
that can be easily shown through the relations
This situation occurs in problems of mechanics in the presence of potential functions. For example, the movements of a three-dimensional rigid body with a potential function
are described by the Hamiltonian
on the cotangent bundle of
, where
denote the columns of the matrix transpose of the rotation
R in
. For then the directional derivative of
V in the direction
is given by
where
denotes the standard inner product
in
. Thus
and the equations of motion for
H are given by
These equations extend to an “
n-dimensional rigid body” with the Hamiltonian
where
In this context, is the generalization of the angular momentum, is the generalization of the angular velocity, is the generalized inertia tensor, and is the external torque.
2.2. Poisson Manifolds, Coadjoint Orbits
Equation (
19) lend themselves to an insightful description in terms of the Poisson structure on
inherited from the symplectic form
. Recall that a manifold
M together with a bilinear, skew-symmetric form
that satisfies
for all functions
on
M, is called a Poisson manifold.
Every symplectic manifold is also a Poisson manifold with the Poisson bracket given by
. However, the converse may not be true due to the fact that the Poisson bracket may be degenerate at some points of
M. Nevertheless, each function
f on
M induces a Poisson vector field
through the formula
as in the symplectic case. Poisson vector fields clarify the relation with symplectic manifolds through the following fundamental fact: every Poisson manifold is foliated by the orbits of its family of Poisson vector fields and each orbit is a symplectic submanifold of
M with its symplectic form
[
7].
The dual
of a Lie algebra
is a Poisson manifold with the Poisson bracket
for any functions
f and
h on
. In the literature on integrable systems the bracket
is known as the Lie-Poisson bracket [
10]. We have taken its negative to be compatible with the projections of left-invariant Hamiltonian vector fields on
(and also to agree with the sign conventions in [
7]).
It follows that each function
H on
defines a Poisson vector field
on
via the formula
in which case the integral curves of
are the solutions of
Thus, as we already mentioned above, each function H on may be simultaneously viewed as a Hamiltonian on , and a function on the Poisson space . Of course, Poisson equations coincide with the projections of the Hamiltonian equations on .
Solutions of Equation (
24) are intimately linked with the coadjoint orbits of
G through the following proposition. due to of A.A. Kirillov [
11] (the proof is also given in [
7]).
Proposition 1. Let denote the family of Poisson vector fields on and let denote the orbit of through a point . Then M is equal to the connected component of the coadjoint orbit of G that contains . Consequently each coadjoint orbit is a symplectic submanifold of .
Recall that the coadjoint orbit of G through a point is given by
The fact that the Poisson equations can be naturally restricted to coadjoint orbits implies useful reductions in the theory of Hamiltonian systems.
2.3. Representation of Coadjoint Orbits on Lie Algebras
On semi-simple Lie groups Poisson Equation (
24) can be expressed on
as
because the Killing form, or any scalar multiple of it
is non-degenerate, and invariant, in the sense that,
, and can be used to identify
with
via the formula
Then coadjoint orbits are identified with the adjoint orbits and the Poisson vector fields are identified with vector fields . Each vector field is tangent to an orbit at L, and , in is the symplectic form on each orbit .
In a reductive semi-simple Lie group
G there is also the semi-direct product
described earlier which generates its own coadjoint orbits on the dual of the Lie algebra
of
. Recall that the Lie algebra
of
consists of pairs
together with the Lie bracket
When the elements
are identified with the sums the sums
in
,
as a vector space, carries a double Lie algebra; the semi-direct product Lie algebras
, and the semi-simple Lie algebra
. We then have
for any
in
and any
in
, with
in the semi-direct case, and
in the semi-simple case.
Since both
and
Lie algebras over the same vector space, the Poisson equations on
can be also represented on
via the quadratic form
, but the resulting expression takes a slightly different form. To see the difference, let
and
denote the decompositions of
and
L onto the factors
and
. On the semi-direct product Poisson equations reduce to
This equation can be combined with the equations for the semi-simple case in terms of the parameter
s as
One can show that
is the coadjoint orbit through
under the action of
when
is identified with
in
, and when
is identified with
[
7].
The adjoint orbits of a non-compact semi-simple Lie groups
G can be realized as the cotangent bundles of flag manifolds [
12], and the same has been shown recently for the coadjoint orbits under the action of the semi-direct products [
13,
14]. We will make use of that fact later on in the paper.
3. Affine-Quadratic Problems
As stated earlier, we will restrict our attention to semi-simple Lie groups
G and compact subgroups
K with zero centre. We refer to
as a reductive pair. Then
and
will denote their Lie algebras, and
will denote the orthogonal complement of
in
relative to the Killing
in
. Recall that
is non-degenerate and satisfies
Hence
is well defined and satisfies
( in fact,
because
is semi-simple). We will also assume that
. Note that the Killing form is negative-definite on
because
K has zero centre [
15], hence any negative scalar multiple
of it is positive definite on
. We shall assume that such a scalar product is fixed.
An affine quadratic problem is defined through a positive definite quadratic form
Q on
, and a regular element
A in the Cartan space
. An element
A in
is called regular if
is an abelian subalgebra in
. The corresponding affine-quadratic problem consists of finding the solutions
in
G of the affine control system
generated by a square-integrable control
in
that transfers a given state
in
G to a given terminal state
in time
T with a minimal energy
. Any positive definite quadratic form
Q is of the form
for some self-adjoint and positive linear operator
on
. Then there exists an orthonormal basis
in
such that
is diagonal relative to it. That is, if
then
for some constants
. Then (
29) can be rewritten as as
where
are the left-invariant vector fields with
and
, with
the energy associated with each solution. The most natural case occurs when
, that is, when
. We will refer to this case as the canonical affine-quadratic problem.
When
A is regular, then (
29) is controllable, a consequence of our assumption
, that is, any terminal state
can be reached in some finite time
from any initial state
. But then there is an optimal solution
on the interval
for which the energy of transfer
is minimal (see [
7] for the proof). Therefore the above optimal control problem is well-posed.
To each affine-quadratic problem there is an analogous “shadow problem” defined on the semi-direct product defined by the same data as in the original problem. It follows that every affine space that defines an affine left-invariant system on G also defines a corresponding left-invariant affine system on the semi-direct product . Thus behind every affine quadratic optimal problem on G there is a corresponding affine-quadratic “shadow” problem on the semi-direct product . The shadow problem is also well defined in the sense that optimal solutions exist on some interval for each pair of boundary points and .
According to Pontriyagin’s Maximum Principle every optimal trajectory generated by a bounded and measurable control is the projection of an extremal curve, and each extremal curve is an integral curve of a suitable Hamiltonian system on the cotangent bundle of the ambient space. The Maximum Principle is also valid for optimal problems with
controls over affine systems with quadratic costs ([
16]).
Let now
be an optimal trajectory generated by a control
. According to the Maximum Principle,
is the projection of an extremal curve
in
along which the cost extended Hamiltonian
is maximal at
relative to all competing controls
. In this notation, each
is the Hamiltonian lift of
, i.e.,
. In the abnormal case, which we will not treat here,
, and the Maximum principle results in the constraints
In the normal case,
, the maximality condition implies that the optimal controls are of the form
. Consequently, optimal solutions are the projections of solution curves of a single Hamiltonian vector field
generated by the Hamiltonian
Recall that each lift
is a linear function on
given by
with
. Thus
H is left-invariant, hence its Hamiltonian equations are given by
The associated Poisson equations can be now written in
as
after the identification of
with
via the scalar product
, and the decomposition
(Equation (
27)). In the canonical case (
) the preceding equations reduce to
Note that
is an integral for (
32). This integral is a universal integral of motion in the sense that it remains constant for any left-invariant Hamiltonian on
.
3.1. Isospectral Representations
We now single out a remarkable class of affine-quadratic Hamiltonians that plays a prominent role in the theory of integrable systems. It consists of Hamiltonians
that admit a spectral representation of the form
for some element
that comutes with
A, where
and
are the solutions of the Poisson Equation (
32). Such a class is called
isospectral and
is called the associated spectral curve. This terminology has origins in J. Zimmerman’s PhD thesis in 2002, in which he showed that the rolling sphere problem is isospectral [
17]. We will return to Zimmerman’s problem and relate its results to the canonical affine-quadratic problem [
18].
For Hamiltonian systems that admit an isospectral representation, the discrete spectral invariants of
L are replaced by the functional invariants
. Remarkably, the functional invariants
are in involution with each other, both with respect to the semi-simple and the semi-direct product Lie bracket, and in some instances generate a sufficient number of integrals of motion to ensure complete integrability ([
7], 9.2). For instance, the family of functions
is completely integrable on each coadjoint orbit in
[
19]. This means that
H is completely integrable on each coadjoint orbit in
whenever
H is in involution with the Hamiltonian lifts
. This implies that the canonical affine Hamiltonian is completely integrable on coadjoint orbits since each left-invariant vector field with values in the isotropy group of
A is a symmetry for the canonical system. It is reasonable to expect that the analogous family of functions is also completely integrable on coadjoint orbits of
G, but, to the best of my knowledge, the proofs have not yet appeared in the literature.
The focus on the affine-quadratic problem and the associated Hamiltonians allows for the following characterization of isospectral Hamiltonians (proved in [
7]).
Theorem 1. An affine Hamiltonian is isospectral if and only for some element that commutes with A. In the isospectral case, is an invariant set for equations (32). On this set (32) are given byand admit the reduced spectral representation This theorem shows that the fundamental results A.T. Fomenko, A. S. Mischenko, and V.V. Trofimov on integrable left-invariant Riemannian metrics on compact Lie groups [
20,
21] based on Manakov’s seminal work on the
n-dimensional Euler’s top [
22] are subordinate to the isospectral properties of the affine Hamiltonian system, in the sense that the spectral invariants of
on
are always in involution with a larger family of functions generated by the spectral invariants of
on
associated with an affine Hamiltonian
H.
3.2. Affine Hamiltonians and Mechanical Tops
Let us now draw comparisons between the semi-direct Poisson equations
and the “top-like” equations:
associated with the Hamiltonian
. We will consider two cases- tops with linear potentials and tops with quadratic potentials.
Linear potentials. Equation (
38) will be referred to
heavy top-like equations when the potential energy
V is generated by a linear Newtonian field, that is, when
, where
a is a vector in
, and
are constants. When
, the external torque
is equal to zero, and Equation (
38) reduces to the Hamiltonian equation associated with a left-invariant Riemannian metric induced by the operator
(called the
n-dimensional Euler’s top in some Russian literature [
20]).
Heavy top-like equations can be written more compactly as
where
, and
. Since
,
is a solution of
. Hence each solution resides on the sphere
.
Our theorems below relate Equation (
39) to the Poisson Equation (
37) on the reductive Lie algebras
and
associated with reductive pairs
where
is
when
and
when
, and
.
We will tackle both cases simultaneously but first we will need to introduce additional notation and terminology. We will use to denote the Lie algebra of endowed with the trace form . Relative to we define its invariant bilinear form in the ambient space .
Then
, will denote the matrix defined by
and
denotes the matrix
. Since
belongs to
for any
in
. We then have
Theorem 2. Heavy top-like Equation (39) are isomorphic to the Poisson Equation (37) on the coadjoint orbit through under the coadjoint action of . The passage to the affine Hamiltonian is via the following correspondences For a proof see [
14]. The preceding theorem clarifies the presence of heavy tops in the Hamiltonian equations on Lie algebras [
10]. It also clarifies the relation between the tops and elastic rods initiated by G. Kirchhoff known as the “kinetic analogues” [
23,
24]. It also proves that the classification of completely integrable elastic rods in [
7,
8] carries over to the heavy tops.
Quadratic potentials. We will now show that the tops with quadratic potential
V are also present in the equations of affine Hamiltonians, but this time on the tangent bundle of
, or more precisely on the tangent bundle of the semi-direct product
where
denotes the space of symmetric
matrices with zero trace. For that purpose let
with
,
, and
S a symmetric
. In accordance with (
38) the Hamiltonian equations of
are given by
where
.
Theorem 3. Top-like Equation (41) are isomorphic with the Poisson equations generated by the affine Hamiltonian on the coadjoint orbit through and under the action of the semi-direct product . Proof. Every solution
of (
41) generates symmetric matrices
and
given by
with
. Then,
Additionally,
which in turn implies that (
41) can be written as
Let now
. Note first that
Therefore and satisfy the same differential equation. Hence whenever . If we now rename as we get the Poisson equations for the shadow Hamiltonian □
The preceding theorem links isospectral Hamiltonians to the equations of the top under quadratic potentials and paves a way to the
n-dimensional generalization of O. Bogoyavlensky’s famous result on integrability of three-dimensional mechanical tops in the presence of a quadratic potential [
25]. The path to isospectral Hamiltonians is provided by Manakov’s observation that the inertia tensor
for a rigid body is confined to the transformations
, for some positive definite matrix
S. For then,
. Indeed, in this situation
, and
Hence the corresponding affine Hamiltonian
is isospectral on
(Theorem 1). Since the equations of the Hamiltonian
corresponding to the top with quadratic potential
can be identified with the Poisson equations of
on the coadjoint orbit through
, the isospectral invariants of
are integrals of motion for the top. (Theorem 3). Since
for each non-singular symmetric matrix
S, the spectral invariants of
form a completely integrable family of functions on each coadjoint orbit in
(semi-simple and semi-direct). the top with a quadratic potential is completely integrable in all dimensions.
3.3. Three-Dimensional Tops- Kirchhoff-Kowalewski Type
We will now turn our attention to the class of affine-quadratic systems of Kirchhoff-Kowalewski type on complex Lie algebras with a particular interest on the symmetries that account for the existence of Kowalewski’s integral reported in her seminal paper on the motions of a rigid body around a fixed point under the influence of gravity [
26]. We will follow our recent paper [
27] and show that there is a natural Hamiltonian on
that answers the fundamental questions raised by Kowalewki’s paper, namely, what is the geometric rational behind her approach in which all the variables were treated as complex quantities, and secondly. what are the symmetries that account for the existence of not only her integral of motion, but also of similar integrals, known as Kowalewski type integrals, that subsequently appeared in the literature on integrable systems [
8,
28,
29,
30,
31].
Theorem 2 suggests that the search for the answers to the above questions should begin with the Poisson equations associated with an affine-quadratic Hamiltonian on
since both
and
are real forms for
(see also [
24]). We will show that Kowalewski’s “mysterious” change of variables appear naturally in the passage from
to
an important intermediate step towards the right Hamiltonian on
. The journey from
to
to this remarkable Hamiltonian begins with
where
is the coordinate representation of a point
L in
relative to an orthonormal basis
that conforms to the following Lie bracket
Table 1:
Then
are the Poisson equations generated by
H, where
denote the drift element in
,
,
, and
. The same equations can be also expressed as
When
the above equations formally coincide with the equations of the top:
(Equation (
21)).
On
there are two Casimirs:
Hence generic coadjoint orbits in are four-dimensional. Since each coadjoint orbit is symplectic, integrable cases occur whenever there is an extra integral of motion functionally independent of H, , and . Since the motion of the top is subordinate to the Poisson system of H on , the search for integrable tops reduces to the search for an additional integral of motion functionally independent from , and H.
Let us now come to the conditions of Kowalewski
and her “mysterious” variables
After the substitutions, Equation (
46) become
from which it can be easily extracted that
is an integral of motion. Following the terminology in [
7] we will refer refer to this integral as the Kirchhoff-Kowalewski integral. It is only in the special case
and
that this integral coincides with the integral of motion found by Kowalewski. The real versions of the Kirchhoff-Kowalewski integral were originally discovered by V. Kuznetsov and I.V. Komarov in their studies of the hydrogen atom [
28,
29].
Let us reveal the geometric rational behind Kowalewski’s change of variables. The explanations are most naturally articulated through the root system in
. Recall that any maximal commutative sub-algebra
of a Lie algebra
is called a Cartan subalgebra. All Cartan subalgebras in a semi-simple Lie algebra are conjugate, and hence all have the same dimension. The dimension of any Cartan algebra is the rank of
. The rank of
is two. Evidently each pair
, in
Table 1 generates a Cartan algebra in
. Since these algebras are conjugate, there is no preferential choice. However, in regard to the equations of the top, there is a preferential choice when two moments of inertia are equal. In the case that
the natural choice is the Cartan algebra generated by the pair
.
An element
in the dual
of a Cartan algebra
is called a root if for some
,
for all
. An easy calculation shows that there are four roots
,
given by
The corresponding root spaces are one dimensional, and are generated by
Together with
and
these matrices form a basis for
. A simple calculation shows that
Furthermore, , , and , for all i and j.
The Lie algebras
and
spanned by
, and
satisfy
. and each is isomorphic to
under the identification
An easy calculation shows that the coordinates
of an arbitrary point
relative to the basis
are transformed to the coordinates
relative to the basis
according to the following formulas:
Let us now
be given by
where
denotes the Kronecker product of matrices
A and
B.
To see that recall first that consists of matrices M that satisfy , where J is the matrix that defines the symplectic form on , i.e., . It is easy to check that both and satisfy with . Since , our claim follows.
We will identify
with pure complex quaternions
via the correspondence
Then the standard basis
in
is identified with the matrices
and any element
in
is represented by the quaternion
,
,
,
.
Let now , , so that and for each i, , and let . Matrices form an orthonormal basis in relative to the inner product on .
It is easy to verify that
is a Poisson map. Therefore
for any function
H on
, where
denotes the dual map of
. After the identification of
with
via the trace form, the Poisson equations of
associated with
H in (
44) become
or, in simpler form,
where
,
, and
Now we see Kowalewski variables
as the natural coordinates in this Poisson representation. Under Kowalewski’s conditions
,
, Equation (
55) reduce to Equation (
49). The passage from
H to
reveals the geometric rational behind the ad-hoc change of variables in (
48) and serves as a natural segue to our ultimate Hamiltonian on
.
3.4. Kowalewski’s Conditions and Isospectral Representations
We now address the origins of the “enigmatic” conditions (
47) through an extended affine-quadratic Hamiltonian
on
defined by complex numbers
,
, and
and an extended basis
,
and
, where
and
The reader can easily verify that
has the following decomposition
where
is the Lie algebra spanned by
and where
and
are respectively the linear spans of
and
. These spaces conform to the following Lie algebraic relations:
After
is identified with
via the scalar product
the above Hamiltonian can be written as
where
and
,
,
(note that
is negative on
which accounts for the negative signs in the expression for
).
Since
is semi-simple the Poisson equations for
are given by
These equations can be written in a more succinct form as
in terms of the following notations:
, as in the previous section, and
with
.
We now come to the crux of the matter, the existence of integrals of motion for the above system. The intermediate question is the existence of an integral I of the form for some constants and .
Proposition 2. is an integral of motion for in exactly two cases: when and , then , and in the second case, when , then . (for the proof see [27]). This first condition singles out the top of Lagrange, while the second condition is a precursor to Kowalewski’s top as will be demonstrated below. Note that the second condition and can be also written as , and where , , and . Then , and since it is orthogonal to , it will be denoted by .
We will say that (
59) satisfies the preliminary condition of Kowalewski whenever
and
. It follows that the preliminary condition of Kowalewski is synonymous with the integral of motion
. (This integral of motion was also discovered earlier by A.M Savu in [
32]).
We will now assume that the preliminary condition holds and we will pursue conditions on the ratio
, where
, that guarantee extra integrals of motion for system (
59). Note that in this situation
. Systems that satisfy the preliminary condition of Kowalewski and also satisfy
will be said to satisfy the Kowalewsky conditions. The following proposition provides an important characterization of Kowalewski’s conditions for both
and
.
Proposition 3. Assume that (59) satisfies the preliminary condition of Kowalewski and is restricted to the manifold . Then satisfies the isospectrality condition (as in Theorem 1) for some matrix , with if only if (59) satisfies the Kowalewski conditions. In fact, [27]. Indeed, under Kowalewski’s conditions
Since , .
It follows from Theorem 1 that Kowalewski’s condition is necessary and sufficient for the existence of isospectral representation
on the invariant manifold
. Consequently,
are integrals of motion for (
59), in involution with each other for each
, or
. Remarkably, the prototype of Kowalewski’s integrals of motion is found among the above spectral invariants. (see also [
33,
34] for other spectral representations).
We will show the existence of Kowalewki’s integral of motion directly from the equations
obtained from (
59) under the change of variables
,
Equation (
60) may be seen as a semisimple extension of the Kowalewski-type gyrostat in two constant fields introduced in [
35].
Equations (
60) may be also expressed in terms of the coordinates as
One readily obtains the following fundamental equalities
Hence,
is an integral of motion for (
61) since
An interested reader may want to show that the following are also integrals of motion
The preceding calculation also draws attention to the following general fact:
Proposition 4. is a constant of motion for any differential system in the variables that satisfyindependently of the equations that govern the evolution of and . To come back to the top of Kowalewski, note that
is an invariant subsystem for (
60). On this set,
, and
c reduces to
and remains an integral of motion for the reduced system
with its fundamental relations (
63)
This reduced system coincides the Kirchhoff-Kowalewski system on
(Equation (
49),
, after
u is replaced by
w). Then
coincides with the integral of motion discovered by Kowalewski. The remaining isospectral integrals of motion
and
coincide with the Casimirs on
.
To recover the semi-simple form of the Kirchhoff-Kowalewski integral, let
. In terms of
and
the preceding system becomes
This system satisfies the same equations as the Kirchhoff-Kowalewski system except for
. Indeed,
The remaining equations given by
are the same as (
66), and consequently yield
as an integral of motion for the preceding system, as well as for the Kirchhoff-Kowalewski system (Equation (
49)) when
is replaced by
.
The papers of V. Dragovi
and K. Kuki
[
30] and V. V. Sokolov [
31] produce differential systems which admit Kowalewski type integrals different from the ones in this paper and yet follow the same integration procedure used by S. Kowalewski in her original paper. Remarkably, all these systems satisfy the fundamental relations (
63) from which the existence of their extra integrals of motion could be easily ascertained.
4. Kepler, Jacobi, Neumann and Moser
Let us now return to and its Lie algebra endowed with the trace form . As a vector space V, the set of matrices with zero trace admits several kinds of Lie algebras and each of these Lie algebras induces its own Poisson structure on V. The most common Lie algebra is itself. Then induces the orthogonal decomposition where denotes the vector space of symmetric matrices in V. But then V also carries the semi-direct product structure .
However,
is also a closed subgroup of
G and hence the pair
induces its own Cartan decomposition
, where
is the orthogonal complement to
. In fact
K is the set of points in
G fixed by the automorphism
where
D denotes diagonal matrix with its first
p diagonal entries equal to 1 and the remaining
q diagonal entries equal to
The set of points
such that
satisfies
, that is,
. It follows that its tangent map
induces the above decomposition with
Consequently, matrices in are symmetric relative to the scalar product , in .
We will now return to the canonical Hamiltonians
and their Poisson Equation (
33) restricted to the coadjoint orbits through rank one matrices
in
. We will consider two cases: the coadjoint orbit through a symmetric rank-one matrix
of unit length under the action of
, and the second case, the coadjoint orbit through rank-one matrix
of unit length, symmetric relative to the Lorentzian inner product in
under the action of
. The above matrices can be naturally expressed in terms of the notations introduced earlier, the scalar product
in the ambient space
, and matrices
and
. For then
If then let . It follows that is the Euclidean sphere of radius when and a hyperboloid of two sheets when . We have chosen to be the sheet defined by .
Proposition 5. The coadjoint orbit through is symplectomorphic to the cotangent bundle of the real projective space in the semi-simple case, and it is symplectomorphic to the cotangent bundle of in the semi-direct case.
For the proof see [
36]. Here it is implicitly understood that the cotangent bundles are identified with the tangent bundles via the ambient inner product
. Then each tangent vector
is identified with
in
and
in
.
On the orbit through
,
, and the associated Poisson equations are of the form
A simple calculation show that
On the unit sphere, Equation (
71) after
A is replaced by
coincide with the equations for the mechanical problem of C. Neumann for a particle on the sphere moving under a quadratic potential [
37]. The preceding equations for
could be analogously interpreted as the equations on the hyperboloid for a particle moving under quadratic potential [
7].
The canonical affine-quadratic problem illuminates deep and beautiful connections between Kepler’s gravitational problem, Jacobi’s geodesic problem on the ellipsoid, and Neumann’s mechanical problems.
Let us first examine the isospectral integrals associated with the spectral curve
on the coadjoint orbit through rank-one matrices. The zero trace requirement is inessential for the calculations below and will be disregarded. Additionally,
A will be replaced by
and
will be rescaled by dividing by
to read
The spectrum of
is then given by
Matrix
is of the form.
where
We then have the following proposition
Lemma 1. For the proof see [
7] (p. 200).
Corollary 1. Function is an integral of motion for H.
Function F is a rational function with poles at the eigenvalues of the matrix Hence, is an integral of motion for H if and only if the residues of F are constants of motion for H.
In the Euclidean case the eigenvalues of
are real and distinct since
is symmetric and regular. Hence there is no loss in generality in assuming that
A is diagonal. Let
denote its diagonal entries Then
where
denote the residues of
It follows that
The preceding calculation yields the following proposition.
Proposition 6. Each residue is an integral of motion for Newmann’s spherical system, and functions are in involution.
These results coincide with the ones reported in [
38,
39,
40], but the connection with the affine-quadratic problem shows that similar integrals of motion exist for the hyperbolic Neumann problem [
7] (p. 191).
In the literature on integrable systems the integrals of motion for Neumann’s problem are related to the integrals of motion for Jacobi’s problem on the ellipsoid through the transformation of H. Knörrer that transforms the Neumann’s equations on energy level
onto the equations of Jacobi on the ellipsoid [
39,
41]. Our exposition takes another route: we will instead show that an “elliptic” problem on the sphere is completely integrable with its integrals of motion as in Neumann’s problem, and then we will show that the Hamiltonian equations for the elliptic problem on the sphere and Jacobi’s problem on the ellipsoid are symplectomorphic. We will then use this symplectomorphism to show the existence of Jacobi’s integrals of motion on the ellipsoid.
Let now
denote an affine-quadratic Hamiltonian on
defined by a diagonal matrix
D with positive diagonal entries. As before, we will dispense with zero-trace requirements since they are inessential. The above Hamiltonian is generated by a positive definite operator
and the drift
. We will call this Hamiltonian elliptic for reasons that will be made clear later on. Let
. Then
Therefore
is isospectral (Proposition 1) and its Hamiltonian equations admit a representation
Since is a spectral curve for the canonical affine Hamiltonian we have the following corollary.
Corollary 2. The spectral invariants of are common integrals of motion for both the canonical Hamiltonian and the elliptic Hamiltonian .
As before, on coadjoint orbit through
, the Poisson equations of
( the semi-direct version) are given by
We then have
which shows that the Hamiltonian
is given by
The correspondence defines a symplectomorphism between the cotangent bundle of with its canonical Poisson bracket and the coadjoint orbit through (Proposition 5).
Proposition 7. On energy level Equation (72) correspond tounder the correspondence . These equations can be reparametrized by a parameter
to read
We will presently show that Equation (
73) are Hamiltonian equations that correspond to the geodesic problem on the sphere relative to the elliptic metric
.
As an intermediate step we will now derive the Hamiltonian equations associated with the geodesic problem on the quadric surface
induced by the scalar product
. We will follow the procedure based on the version of the Maximum Principle for variational problems with constraints outlined in [
7] (p. 218) and identify the quadric surface with the submanifold
. Then its cotangent bundle will be defined in terms of the constraints
The Hamiltonian lift of a curve
that belongs to
is given by
for the multipliers
and
that satisfy
. If
, then
According to the Maximum Principle an extremal control
must optimize
on
over all controls that satisfy
. Hence extremal controls are the critical points of
for some multiplier
, that is, they are solutions of
. It follows that the extremal controls are of the form
, But then
implies that
For this choice of controls
where
. An easy calculation shows that
Then the extremal curves are the solutions of the following differential equation:
which emanate from
, that is, satisfy
We will now single out the cases relevant for our earlier claims.
The geodesic problem on the ellipsoid. In this classic case initiated by C. Jacobi
and
. Hence
Then Equation (
74) reduce to
The preceding equation agree with the equations in J. Moser [
39].
The elliptic problem on the sphere. Here the ambient metric is defined by a positive-definite matrix
D and
. In such a case Equation (
74) are given by
The Hamiltonian equations are then given by
which agrees with Equation (
73) when
.
Proposition 8. The Hamiltonian systems that correspond to the elliptic problem on the sphere and the geodesic problem on the ellipsoid are symplectomorphic.
Proof. Let
denote the coordinates on the tangent bundle of the sphere and let
denote the coordinates on the tangent bundle of the ellipsoid
. In these coordinates the systems in question are given by
where
and
.
Let
denote the mapping from the cotangent bundle of the sphere to the cotangent bundle of
E defined by
Let
denote the Liouville-Poincaré canonical form on
. Then
because
. Since
takes the Liouville form on
to the Liouville form on
, it also takes the canonical symplectic form on
to the canonical symplectic for on
and hence is a symplectomorphism.
It now follows from (
78) that
and that
. Then,
and thus
takes the Hamiltonian flow on the sphere onto the Hamiltonian flow on the ellipsoid. □
Proposition 9. Jacobi’s problem on the ellipsoid is completely integrable. Functionsare constants of motion, all in involution with each other, for the Hamiltonian system on the cotangent bundle of the ellipsoid. Proof. We have shown that
are an involutive family of integrals of motion for the elliptic-geodesic problem on the sphere. We have also shown that the above integrals of motion are the residues of the function
We will now show that functions (
79) are the residues of the pull-back of
F under the symplectomorphism
. First note that
F remains unchanged if the variable
y is replaced by
with
an arbitrary number. Since
, we may replace
y by
and
x by
. Also note that
(use Equation (
75)). Then,
It follows that
is constant along the solutions of Jacobi’s equations.. A calculation identical to the one used for Neumann’s system shows that
are the residues of
F, and hence are integrals of motion for Jacobi’s equations. □
Degenerate Case and Kepler’s Problem
Let us now return to the Hamiltonian equations generated by the canonical affine Hamiltonian
on the coadjoint orbit through
for some
with
and their equivalent formulation on the tangent bundle of
:
When
the Hamiltonian
H reduces to
and the corresponding equations reduce to
Then Equation (
82) yield an integral of motion
, and Equation (
81) reduce to
Upon differentiating we get
We will now assume that
so that
is the hyperboloid
. On energy level
,
the solutions of (
83) are given by
where
and
are constant vectors (complex when
) that satisfy
.
For the above curves trace great circles on the sphere and for the solutions trace great hyperbolas on the hyperboloid (an immediate consequence of the fact that ). That is, solutions are the geodesics on spaces of constant non-zero curvature. The zero curvature case may be obtained by considering as a continuous parameter and then letting it tend to zero (as will be explained below).
We will now show that there exists a canonical change of coordinates
in which
p is the stereographic projection through the point
given by
such that in the new coordinates the preceding geodesic differential system is transformed into the
n-dimensional Kepler’s system, an
n-dimensional generalization of the Hamiltonian equations that describe the motion of a planet around an immovable planet in the presence of the gravitational force.
Equation (
84) yields
, where
. Therefore the inverse map
is given by
Assume that the cotangent bundle of
is identified with its tangent bundle
via the Euclidean inner product
, and let
denote the points of
. We will next find
such that
for all
with
and
. For then the transformation
is a symplectomorphism since it pulls back the Liouville form
on
onto the Liouville form
in
(The symplectic form is the exterior derivative of the Liouville-Poincaré form). It follows that
After the appropriate differentiations in (
85) we get
Since
y is orthogonal to
x,
. Therefore,
After the substitutions
into the preceding equation we get
To pass to the problem of Kepler, write the Hamiltonian
in the variables
. An easy calculation in (
86) yields
. Therefore,
The corresponding flow is given by
On energy level
and the preceding equations reduce to
After the reparametrization
Equation (
23) become
On
,
and
So in the spherical case and in the hyperbolic case.
The Euclidean case can be obtained by a limiting argument in which is regarded as a continuous parameter which tends to zero.
To explain in more detail, let
The transformation with is the inversion about the circle in the affine hyperplane , and is the corresponding transformation of the Euclidean metric . The Hamiltonian associated with this metric is equal to This Hamiltonian can be also obtained as the limit of when . On energy level and therefore, . Of course, the solutions of (12) tend to the Euclidean geodesics as tends to zero. Consequently, is a solution of and hence, is a geodesic corresponding to the standard Euclidean metric.
Let us also note that the angular momentum and the Laplace-Runge-Lenz vector for Kepler’s problem have simple geometric interpretation on the coadjoint orbits according to the following proposition.
Proposition 10. Let and On energy level , This remarkable discovery that the solutions of Kepler’s problem are intimately related to the geometry of spaces of constant curvature goes back to A.V. Fock’s paper of 1935 [
42] in which he reported that the symmetry group for the motions of the hydrogen atom is
for negative energy,
for zero energy and
for positive energy. It is then not altogether surprising that similar results apply to the problem of Kepler since the energy function for Kepler’s problem is formally the same as the energy function for the hydrogen atom.
This connection between the problem of Kepler and the geodesics on the sphere was reported by J. Moser in 1970 [
43], while Y. Osipov [
44] reported similar results later for geodesics on spaces of negative constant curvature. In spite of their brilliance, these papers did not attempt any explanations in regard to this enigmatic connection between planetary motions and geodesics on space forms. This issue later inspired V. Guillemin and S. Sternberg to take up the problem of Kepler in a larger geometric context, with Moser’s observation as the background, in a paper titled
Variations on a theme by Kepler [
45]. The introduction of Kepler’s problem through the canonical affine-quadratic problem exemplifies, once again, this fascinating and recurrent interplay between mathematical physics, geometry and integrable systems.
5. Homogeneous Riemannian Manifolds and Rolling Geodesics
Our overview of integrable systems raises a natural question: what is the geometric origin behind the affine-quadratic problem that accounts for its ubiquitous presence in the theory of integrable systems? A partial answer to this question comes, somewhat unexpectedly, from a new class of variational problems, called rolling problems. We will take up this issue next. Since the underlying variational problems require new concepts and terminology, we will be obliged to make a slight detour into an earlier paper [
9] in order to introduce the necessary ingredients.
The general setting is defined by a reductive pair
with
G semi-simple and
K compact. We assume that the Lie algebra decomposition
, with
the orthogonal complement of
relative to the Killing form on
satisfies the strong Cartan conditions
We will also assume the Killing form is of definite sign on
in which case
will denote a scalar multiple of the Killing form that is positive on
. We recall that the Killing form is invariant under any linear automorphism of
and hence the quadratic form
is
invariant [
15].
We consider
G a semi-Riemannian manifold relative to the left-invariant metric
induced by
(the Killing form is not necessarily positive on
, hence the metric is in general of indefinite sign, i.e., it is semi-Riemannian [
46]). The left-invariant distributions
and
are called horizontal and vertical respectively. Then curves that are tangent to
, i.e., satisfy
are called horizontal. Likewise curves that are tangent to
are called vertical. It follows that
We will assume that
consisting of the left coset
is endowed with a manifold structure so that the natural projection
is a smooth surjection [
46]. A curve
in
G is called a lift of a curve
if
. A lift is called horizontal when
is a horizontal curve. Every curve
in
M is the projection of a horizontal curve
. If a curve
is a solution of
for some curve
then
The correspondence
given by
is an isomorphism and induces a metric on
MLet now
denote the group of diffeomorphisms defined by the left action
We then have
Proposition 11. The metric (89) is invariant under , that is,for any and any tangent vectors and in . It follows that each is an isometry. Since G acts transitively on M, M can be represented by the orbit where and e is the group identity in G. It follows that for any . Note that is the flow generated by a right-invariant vector field . Therefore the flow of is -related to the flow in M. We will let denote the infinitesimal generator of the flow .
It follows that each
is a Killing vector field on
M. A vector field whose flow acts on
M by isometries is called a Killing vector field (see [
46] for additional details). The correspondence
is one to one and onto
. Since the Lie brackets of vector fields related by a mapping
F are also
F-related, the Lie brackets
are
-related to
. Therefore the correspondence
is a Lie algebra homomorphism, and hence
is a finite dimensional Lie algebra of Killing vector fields that satisfies
for each
.
Note that
. So if
then
and therefore
. It then follows that
is an isometric isomorphism from
onto
. More generally if
is any horizontal curve then
implies that
and
Therefore is an isometry that maps onto .
A homogeneous manifold
with a
G-invariant metric defined by a reductive pair (G,K) with
G semi-simple and
K compact, will be referred to as semi-simple (it is defined by a semi-simple Lie group
G, a compact subgroup
K, and the metric induced by the Killing form). It can be shown that any symmetric Riemannian space with no Euclidean factors can be reduced to a semi-simple manifold (so that
holds). Conversely, every semi-simple manifold is locally symmetric. It is symmetric when
G is simply connected (see [
48], Proposition 6.27). We will not pursue further proximities with symmetric spaces since the present exposition makes no use of geodesic symmetries.
We now come to the main topic of this section, rolling of semi-simple manifolds on their tangent spaces. We begin by recalling the basic definition.
Definition 1. A curve on a Riemannian manifold M rolls on a curve on another Riemannian manifold if there exists an isometry that satisfies:and also satisfies the condition that is a parallel vector field in along for each parallel vector field along in M. This intrinsic definition of rolling was introduced in [
49], and later used in [
50,
51]. In this context the triple
is called a rolling curve. It is clear that rolling is reflexive in the sense that if
is rolled on
by an isometry
then
is rolled on
by the isometry
, and therefore
is also a rolling curve.
We will now examine rollings of semi-simple manifolds on their tangent planes. It comes as a pleasant surprise that such rollings are essentially described by Equation (
91) reinterpreted in terms of rolling. So the passage to rolling becomes largely a question of semantics, as demonstrated in the text below.
We will consider rollings of
M on
with its metric
defined by (
89). The rollings on other tangent spaces are conjugate to the rollings on
[
47]. Let
be an arbitrary curve in
M and let
be a curve in
that
is rolled on. It follows that
for some horizontal curve
. If
is a solution of
then according to (
91)
If we now let
be any solution in
of
then
is an isometry that rolls
on
since the parallel transport condition is satisfied (for proofs see [
47]). Of course, then
rolls
on
.
It follows that each horizontal curve
in
G defines a family of curves
in
, each a solution of
, with
induced by
, that roll on
. The converse is also true: every solution
of the differential system
defines a curve
in
M on which
in
is rolled by the isometry
.
We will regard (
93) as the fundamental object in rolling defined on
, a Lie group with its group operation
Then will denote the Lie algebra of with the Lie bracket .
Let now
. We will view
as a left-invariant distribution on
defined by the left-translates of vector space
in
. The distribution
is called the rolling distribution and its integral curves are called rolling motions. Any rolling motion
is a solution of
and can be associated with the rolling curve
, where
and
.
Since
and
satisfy strong Cartan conditions
and
,
satisfies
and
. Therefore,
Hence the Lie algebra generated by the left-invariant vector fields tangent to
is equal to
, therefore any two points in
can be connected by a rolling motion, and each rolling motion inherits a natural length
from
G. It is then known that any pair of points in
can be connected by an integral curve of
of minimal length because vector fields in
are complete [
50]. The above shows that
with the above metric is a sub-Riemannian manifold. We will refer to the associated sub-Riemannian geodesics as the rolling geodesics.
We will now turn to the Maximum principle to find the necessary conditions that the rolling geodesics must satisfy. To put the matter in the control theoretic context, let
be an orthonormal basis in
so that
becomes an orthonormal basis in
. Then an absolutely continuous curve
is a rolling motion if and only if
for some bounded and measurable control functions
, in which case the length of
is given by
. The rolling problem is an optimal control problem and consists of finding the solutions
on a fixed time interval
that satisfy the given boundary conditions
and
along which the energy of transfer
is minimal. It is known that each rolling geodesic is locally optimal and hence is a solution to the above control problem [
7,
50].
5.1. Rolling Hamiltonians
To emphasize the invariant properties of the problem we will rewrite (
96) as
where each
a left-invariant vector field
,
. If
is an optimal trajectory then, according to the Maximum Principle,
is the projection of an extremal curve
in
along which the cost extended Hamiltonian
is maximal relative to all other control functions. Here
is the Hamiltonian lift of
, i.e.,
.
There are two kinds of extremal curves depending whether
(abnormal case) or
(normal case). In the abnormal case the Maximum principle results in the constraints
and beyond that gives no further information about the optimal control in question. In the normal case, however, the above maximum yields
, where
is a solution curve of a single Hamiltonian vector field corresponding to the Hamiltonian
Each optimal solution is either the projection of an abnormal or a normal extremal curve. If is the projection of a normal extremal curve then is an integral curve of and the control that generates is of the form .
We will not concern ourselves with the abnormal extremals. It is very likely that every optimal trajectory is the projection of a normal extremal curve, as in [
52], in which case the abnormal extremals could be ignored. Instead we will turn to the normal Hamiltonian
H and its Hamiltonian equations
Let us first consider the solutions of the associated Poisson equation
and the structure of the coadjoint orbits.
Since
is a Euclidean vector space, its tangent space at the origin can be identified with
. Then the Lie algebra
will be identified with
, and its dual with
, where
It then follows that every
can be written as
with
and
. Since
is an abelian algebra the projection
on
is constant on each coadjoint orbit of
. The argument is straightforward:
for any
. It follows that the coadjoint orbits in
are of the form
This fact can be also verified directly from Equation (
100): we have
where
and
. Therefore,
from which follows that
Since is arbitrary .
To uncover other constants of motion identify
with
via the natural quadratic forms on each of the factors, and then recast the preceding equations on
. More precisely, identify each
in
with a tangent vector
via the formula
. Similarly, identify
with
via the formula
. Then decompose
into the sum
,
and
. Relative to the basis
in
,
where
. It follows that
and
Since
X and
are arbitrary,
Coupled with
Equation (
101) constitute the Hamiltonian equations on
generated by the Hamiltonian
.
Each extremal curve projects onto a geodesic , and each geodesic further projects onto the pair of curves in M and in that are rolled upon each other by . Note that in this identification of the Lie algebras with their duals, coadjoint orbits are identified with the affine sets .
Recall now the Hamiltonian equations associated with the canonical affine-quadratic problem (Equation (
33)):
The propositions below reveal a remarkable fact that the Poisson equations of a canonical affine-quadratic Hamiltonian are subordinate to the Poisson equations associated with a rolling Hamiltonian. This connection identifies the drift term in the affine-quadratic system with a coadjoint invariant of the rolling Poisson system. To keep the systems apart we will use bold letters when referring to the variables in the rolling Hamiltonian in contrast to the variables in the affine-quadratic Hamiltonian which will remain the same.
Proposition 12. Let be an integral curve of the rolling Hamiltonian , that is, Thenis an integral curve of the affine Hamiltonian , where , and is the solution of with .
Moreover, if a solution of then in is the projection of an extremal curveassociated with the shadow Hamiltonian . The converse also holds according to the following proposition.
Proposition 13. Suppose that is an extremal curve of the affine Hamiltonian . Letwhere is a solution of and let be a solution of Then together withis an extremal curve of the rolling Hamiltonian . However, if , is an extremal curve of the shadow Hamiltonian H, then , solutions oftogether withdefine an extremal curve of the rolling Hamiltonian . The proofs follow by straightforward calculations (also done in [
9]).
Let us now come back to isospectral representations and Zimmerman’s method [
17,
52]. For that purpose let
. Then Poisson’s equations for the rolling problem can be written as
These equations are invariant under a dilational change of variables
. It then follows that
satisfies the equation
Therefore
is the spectral curve for
. But then the Poisson system associated with the affine-quadratic Hamiltonian also admits an isospectral representation after the substitutions
For then
are the extremal curves for the Poisson system associated with the affine-quadratic system (Proposition 13) and further satisfy
To be consistent with my earlier publications, replace
by
to get
where
, and
. Equation (
107) agrees with the isospectral representation (
34).
To get the spectral curve
for the shadow Hamiltonian, use relations
,
and
from Proposition 13. Then
Then a calculation analogous to the one above gives
. After the rescaling
we get a modified Lax pair
5.2. Rolling Problem on Spaces of Constant Curvature
We will now introduce another optimal problem intertwined with the rolling problem. It consists of finding a continuously differentiable curve
in
M in an interval
, with its tangent vector
of unit length and its covariant derivative bounded and measurable in
that satisfies fixed tangential directions
and
along which the integral
minimal among all other curves that satisfy the same boundary conditions. Here
, where
denotes the covariant derivative along
. The integral
is known as the elastic energy of the curve
[
24]. Curves
defined on some interval
are called elastic if for each
there exits an interval
over which the elastic energy of
is minimal relative to the boundary conditions
and
[
7].
On semi-simple manifolds the curvature problem can be lifted to the unit tangent bundle of
G, and it is this lifted version of the problem that will be of interest for this paper. In this formulation of the problem the tangent bundle of
G is realized as the product
with
identified with
. Then each tangent vector
is the projection of a manifold
in
where
and
. The lifted curvature problem consists of finding a curve
in
,
, a solution of
that originates in the manifold
at
and terminates at the manifold
at
for which the energy of transfer
is minimal. If
is the projected curve, then
is the solution of
that satisfies
and the boundary conditions
It is a simple exercise to show that a curve
is elastic if and only if it is the projection of a solution of the lifted curvature problem on a fixed interval
.
The Hamiltonian system for the curvature problem (Equation (
109)) can also be obtained through the Maximum principle properly modified to account for the constraints, as outlined in ([
7], Chapter 11). To go into these details would take us away from the central theme of the paper, so instead, we will just quote the relevant equations from [
7] (pp. 354–355).
The curvature Hamiltonian
H is given by
together with the associated Hamiltonian equations
subject to the transversality condition
. The transversality condition can be incorporated into the above equations to yield an equivalent system
We will now confine our attention to spaces of constant curvature, with a particular interest on the connections between the rolling problems and the elastic curves reported in [
52]. For those reasons let us return to the “spheres”
and their rollings on the isometry groups groups
,
endowed with the quadratic form
The rolling equations associated with the rollings of
on the tangent plane
are given by
In what follows we will make use of the following isospectral integrals of motion associated with the preceding rolling problem extracted from the functions
and
These integrals of motion are rescaled variants of the integrals of motion in [
52] after the metric is replaced by
(the metric in this paper is a scalar multiple of the metric used in [
52]).
Recall that on spaces of constant Riemannian curvature the curvature
k is defined by
for any
V and
X in
that satisfy
and
. In particular
k is equal to
on
. Note that
.
Proposition 14. Rolling geodesics that are the projections of the extremal curves on and project on the elastic curves in . Conversely each elastic curve in is the projection of such an extremal curve.
Proof. Each elastic curve on
is the projection of an extremal curve corresponding to the curvature problem (Equation (
110)). On spaces of constant Riemannian curvature
Therefore, Equation (
110) can be written as
It follows that
, and therefore,
for some constant element
A in
. The transversality condition
can be recast as
. These observations can be incorporated in the preceding equations to get
If we now identify
with
, and
with
, then the preceding equations reduce to the rolling Hamiltonian system. Moreover,
Hence
so the first constraint is satisfied. To verify the second constraint note that
, and therefore
and
. Therefore,
To prove the converse assume that is a rolling extremal curve on . As a geodesic it satisfies , or . We need to show that for some such that .
Then for some vector . It then follows that , and .
Hence, dim dim(. The mapping satisfies because . On spaces of non-zero constant curvature, the kernel of this mapping is zero because implies that . Since and have the same dimension, F maps onto . So every curve is of the form for some perpendicular to .
It remains to show that
belongs to
when the rolling geodesic is on
, that is, when
. Now assume that
,
and
. It follows from above that
, and therefore
But , and therefore . □
The following proposition characterizes elastic curves [
7].
Proposition 15. Let and denote the geodesic curvature and the torsion of the projection curve associated with an extremal curve of the curvature problem. Then is the solution of the following equationand . All other curvatures in the Serret-Frenet frame along are zero. I believe that the proof given below is more to the point than similar proofs given elsewhere [
7,
52].
Proof. We leave it to the reader to verify that
when
. Let
and let
. Then
Since
,
. Therefore,
As to the second part, let
. Since
,
is a unit vector that projects onto the tangent vector
. Then
Therefore
where
is a unit vector in
that projects onto the unit normal
along
. Continuing,
where
Since
is is orthogonal to
and
X, it is in the direction of the binormal vector
. So if we define
and
then
and
projects onto the binormal vector
along
. Hence,
or
Evidently is in the linear span of , hence the Serret-Frenet frame along terminates. □
Corollary 3. Elastic curves in are rolled on the elastic curves in the tangent space .
Proof. Since the geodesic curvature is preserved under rolling, the elastic curves in are rolled on the elastic curves in relative to the Euclidean metric inherited from the metric on . So the statement follows from the rolling definition. □
This remarkable relation between the elastic curves and the rolling geodesics breaks down on spaces of non-constant Riemannian curvature, as it becomes evident when one compares Equation (
110) for the curvature problem to the Equation (
101) for the rolling problem. It is interesting to note that the solutions of either of these two Equations (
101) and (
110) are not known beyond the spaces of constant curvature. While the curvature equation seems particularly challenging beyond the spaces of constant curvature, the rolling geodesic equations remain integrable on all semi-simple spaces and should be “solvable” according to the general theory of integrable systems.
Apart from the above remarks, there is another spectacular property of elastic curves that makes them special: elastic curves appear as soliton solutions in the non-linear Schroedinger equation [
53]. More generally it was shown in [
53] that the space of periodic horizontal curves of fixed length
L in the isometry group
G over a three dimensional space of constant curvature can be given a structure of an infinite dimensional Poisson manifold relative to which some famous equations of mathematical physics appear as Poisson equations associated with geometric invariants of curves on the base space. In particular, Heisenberg’s magnetic equation and Schroedinger’s non-linear equation appear as Poisson equation associated with
where
,
. Since this function can be also expressed as
where
is the geodesic curvature of the projected curve in the underlying symmetric space, elastic curves appear naturally in this setting (see also [
54,
55,
56,
57] for related results). This leap to infinite dimensional Hamiltonians and related hierarchies of commuting Hamiltonians further illustrates the relevance of Lie algebraic methods in the theory of integrable systems.