1. Introduction
Stabilizing the orientation or attitude of a rigid body is significantly desirable for controlling the kinematics or dynamics of aircraft, space crafts, satellites, helicopters, unmanned aerial vehicles (UAVs) and submarines [
1]. Before justifying stabilization algorithms, we first need to turn the attitudes into mathematical notations. The minimal representations, like Euler angles or modified Rodriguez parameters, suffer from singularities. Early results were established on quaternions with four parameters, attempting to obtain globally effective control laws by resolving the problems caused by singularities [
1,
2]. However, the quaternion-based system faces ambiguity in representing one attitude with two antipodal points, which may exhibit an unwinding phenomenon if not carefully resolved. The Lie group structure, however, allows for a global and unique representation of rigid-body attitudes in terms of a rotation matrix; see [
3] for more detailed discussions. In this line, many well-known techniques for control and estimation, such as PID control [
4,
5,
6], state observers [
7,
8] and output regulation [
9], are extended to systems whose configuration space involves a Lie group. By specializing the abstract notation of a Lie group into a concrete one
, i.e., a special orthogonal group of three dimensions, it is straightforward to come up with engineering algorithms for practical applications, including stabilization [
6], coordination [
10], synchronization [
11,
12], observers [
7,
13] and tracking [
4,
14] of rigid-body attitudes. Basically, this paper is focused on attitude stabilization on
, as we believe that it is possible to establish the other algorithms based on this essential framework of stabilization without any significant adaption. As pointed out in [
15], achieving global stability by solely using continuous control law is not possible. Therefore, we set our goal as establishing almost global stability based on a continuous strategy of PID control.
There are various methods for controller design, allowing us to achieve the desired attitude stabilization. Roughly, we categorize the control algorithms into model-based and model-free classes by investigating if their computation of control inputs requires prior knowledge of the system model or not. When exact information about physical systems is available, the model-based strategies, such as state observers [
7,
8,
16], sliding model control [
17] and output regulation [
9], are beneficial in obtaining more accurate control by simulating the physical models. Nevertheless, the simulation relies heavily on computation power, and thus has limitations in practical applications with restricted computing resources. As a typical model-free method, PID control requires the least knowledge about the system’s model. Now, more than
of industrial feedback control systems employ PID or PID-type control due to their simple structure and easiness of parameter tuning. The input commands of PID control consist of three terms, i.e., the proportion, time derivative, and time integral of errors. It is convenient to compute those terms for systems defined in Euclidean spaces. For systems evolving in non-Euclidean spaces, however, the classical definition of PID inputs in vector spaces makes no mathematical sense. Geometric extension of PID control to systems defined on nonlinear spaces, such as on Lie groups or, more generally, on Riemannian manifolds, has attracted significant attention in the last two decades. By defining the proportional control as a proportion of the gradient of an error function, PD control was generalized to mechanical systems on Riemannian manifolds [
18]. Following this framework, we further defined the integral action in Lie algebra as the time integral of PD commands and came up with a left-invariant PID controller for a left-invariant system on Lie groups [
6] with body-fixed biases. Parallel work in [
5] is focused on providing an intrinsic PID controller with the integral action defined in a manner of covariant derivative. The effectiveness of geometric PID control proposed in those published works has been verified by examples of attitude stabilization on
. Results in [
19] also extended the classical PID to that for both first-order and second-order systems on smooth manifolds.
For systems evolving on nonlinear spaces, defining integral control requires the transformation of velocities among the tangent spaces of different configuration points. The structure of Lie group provides two canonical ways to transport the velocities in tangent space of an arbitrary point to that of the group identity, i.e., the Lie algebra, resulting in the concepts of left-invariant and right-invariant velocity. In robotic applications, we usually but not rigorously use the left-invariant and right-invariant terms to describe velocities in body and inertial frames, respectively. The work in [
6] is focused on the left-invariant design of PID controllers for systems admitting body-fixed (left-invariant) biases. They also devised a right-invariant design for systems with inertially fixed (right-invariant) disturbance. In order to maintain the right invariance of an integral action, its design expressed in body frames needs to compensate for the variations due to the influence of frame change. This paper is built upon our previous work in [
6]. However, we believe that those compensations are not necessarily required when the integral control is confirmed with the ability to deal with state-dependent biases. To our best knowledge, using a left-invariant design of PID controllers to robustly stabilize the attitudes of a rigid body with inertially fixed biases has not been studied. In addition, tangent and cotangent spaces are identified, and the influence of an inertia matrix is ignored in [
6]. Instead, we view the attitude dynamics of a rigid body as a mechanical system and consider the case with control inputs defined in the dual space of a Lie algebra, i.e.,
.
Although traditional PID control has been generalized to more geometric settings, analyzing the convergence and robustness of the resulting systems needs further studies. The existing results in [
6,
20] only counter constant biases. In practice, however, most biases vary as the system’s states evolve. For instance, inertially fixed bias in the above setting turns out to be a state-dependent bias in the body frames of a rigid body. Although rejecting state-dependent [
21] or time-varying disturbance [
5] has been considered, their analysis does not pay attention to the speed of convergence. The result in [
6,
20,
22] reduces the derivative of the Lyapunov function into a binary quadratic form, and the proof has to be produced by recalling the associated LaSalle’s invariance principle; it is difficult to judge the convergent speed in the third dimension. The work in [
5] has a representation of the ternary quadratic form for the derivative of the Lyapunov function, which allows for decreasing the Lyapunov function in a gradient-descent manner. However, its construction of the Lyapunov function does not include the biased term, and thus, the system is only ensured to converge to a neighborhood of the desired equilibrium point. We attempt to explicitly construct a Lyapunov function whose derivative is a ternary quadratic form and prove the gradient-descent convergence of a left-invariant PID-controlled system (with the controller defined in body frames) to the exact point of equilibrium despite the influence of state-dependent biases (the expression of inertially fixed biases in body frames).
The contributions of this paper are in the following aspects:
A geometric PID controller is defined on for left-invariant dynamical systems evolving on . The time-varying effects in body frames caused by inertially fixed biases are suppressed by integral actions.
A gradient-descent Lyapunov function is established by applying two stages of variable change. A criterion for parameter tuning of the geometric PID controller is justified by ensuring the decline of the suggested Lyapunov function.
AGAS stability of the stable equilibrium point is proved for the resulting close system by decreasing the Lyapunov function without involving LaSalle’s invariance principle.
The contents of the present paper are organized as follows. In
Section 2, we establish the preliminary concepts and provide the necessary mathematical notations. The system model and geometric PID controller are clearly defined in
Section 3. The main efforts of this paper are devoted to the convergence analysis of the resulting close system; see
Section 4 for details. Simulation results of a numerical example are reported in
Section 5, and conclusions are reached in
Section 6.
2. Preliminaries and Mathematical Notations
We denote the set of three-dimensional special orthogonal matrices by
, which is a Lie group as it is a continuous manifold and simultaneously satisfies the group structure. For a matrix group, the group operation, group identity, and inverse of a group element are just matrix multiplication, the identity matrix
and the inverse of a matrix. Associated with each
Q is a tangent space
. The special tangent space at the group identity
, i.e., a set of all skew-symmetric matrices, is indeed a Lie algebra
with Lie bracket operation defined as
. The matrix
, in fact, only has three independent variables, and we are inspired by this fact to represent the velocity with a three-dimensional vector. The Lie algebra is then defined as
with the associated Lie bracket justified as a cross product in three-dimensional space. The transformations between vector notation
and matrix notation
follow
There are two canonical ways allowing for translation of the velocity into that in . Left multiplication by results in the definition of left-invariant velocity , while right multiplication by gives the right-invariant velocity . The terms and are not equal in most cases, although they belong to the same space . In practice, usually but not rigorously, we use and to model velocities in body frames and in the inertial frame, respectively. The transformations in matrix notation, or in vector notation, follow the adjoint representation , which indeed is an association of left multiplication by Q and right multiplication by . We denote by the dual space of and define the dual of as such that the inner product is preserved under adjoint transformations, i.e., .
In order to characterize the distance to the element of group identity, an error function is defined as a scalar function
with
Its gradient
is then identified by computing the time derivative of the error function
. The resulting explicit expression for the gradient writes in terms of the skew-symmetric part of
Q:
By letting
, we obtain four critical points, i.e., one minimum point
with
and three maximum points
with
. The minimum point is a stable equilibrium, while the other three are unstable equilibrium points corresponding to the attitudes with a
rotation around three principle axes.
In what follows, we use the abbreviated notation , instead of , to describe left-invariant vectors, while the notations and are only used when emphasizing their difference is necessary. The norm of a matrix A is defined as and the norm of vector v follows . The operation “·” in and represents algebraic multiplications of matrices or vectors. When the dimension of is clear from the context, we always simplify the notation of into k.
4. Convergence Analysis
Before coming up with the construction of a Lyapunov function, we first perform two steps of variable change for the resulting close system. In the first stage, we show that there exists a negative proportion of in , which ensures the gradient decline of the error function. The second stage of variable change allows us to find a suitable metric function whose exponential decrease proves that and approach the opposite of and , respectively, in a gradient-descent manner.
4.1. First-Stage Variable Change
It is not convenient to analyze the stability of a geometric PID-controlled system in the coordinates
. We thus, at this first stage, take a new coordinate system
with
The inverse of this coordinate change allows us to express
,
and
in terms of
X and
Y:
with
,
. Performing this first-stage variable change leads to a transformation of the original system into that expressed in the renewed coordinates:
where
H is a Hessian matrix satisfying
and
represents the gradient of
with regard to the attitude such that
. It is easy to check that
is upper-bounded by
with a positive
such that
; see [
4,
20].
By defining vectors
and
, we obtain a representation of the resulting close system in a more compact form:
with matrix
A and vectors
,
:
The process of first-stage variable change and the property of resulting representation can be summarized as follows.
Proposition 1. Suppose that the moments of inertia and the Hessian matrix of a rigid-body system are upper-bounded by and , and the gradient of bias has an upper bound . Then, the closed system (10)–(12) has a representation of the form (15) and (16) under the coordinate change (13) and (14). The norms of vector and are upper-bounded by and with , and . Proof of Proposition 1. It is straightforward to obtain the renewed representation of the system by following the above computation. The remaining task is to identify the upper bounds for
and
. From their expressions in (
18), we know that
and obtain the result in (
19) for the upper bound of
with
. The vector
can be considered as a combination of two terms
and
. By the assumptions of
,
and
, we then have
which allows us to reach the conclusion of (20) for the upper bound of
with
and
justified accordingly. □
After applying the first-stage variable change, we have established a result with having a term , by which gradient-descent decrease in is possible. However, is not always negative for arbitrary nonzero vectors. Therefore, we need to further perform a second-stage variable change, which actually is a similar transformation and , such that the term is definitely negative for all vectors .
4.2. Second-Stage Variable Change
The second-stage variable change is mainly about the diagonalization of matrix
A. By letting
, we turn the matrix
A in (
17) into
This matrix has two eigenvalues:
with
and
having explicit expressions
which indeed are two solutions of the equation
We then define the second-stage variable change as
or in matrix form, as
for
. The matrix
S is then justified as
By defining
and computing its derivative, we further obtain the transformation of systems (
15) and (16) into that expressed in terms of
,
,
and
:
with the matrix
and vectors
,
:
The process of second-stage variable change and the property of the resulting representation can be summarized as next proposition.
Proposition 2. Suppose that the Proposition 1 is satisfied and . Then, there exists a coordinate change (22) and (23), such that the closed system (15) and (16) has a representation of the form (24) and (25). The quadratic form is then upper-bounded by :with . The norm of vectors and is upper-bounded by and with , and . Proof of Proposition 2. It is straightforward to obtain the renewed representation of the system by following the above computation. The remaining task is to identify the upper bounds of
,
and
. From (
26), we know that
by which
is identified accordingly. From the solutions for
in (
21), we know the fact that
and
. Following the definition of matrix norm and applying the Gershgorin circle theorem allows us to estimate the upper bound of
and
,
From (
19) and (
27), we know that
The term
in (
27) can be considered as a combination of two terms
and
. With the results in (20) for
and
, we then have
The above results allow us to reach the conclusion by (
29) and (30) for the upper bound of
and
with
and
justified accordingly. □
4.3. Almost Global Asymptotic Stability
Now, we are ready to present the main result of this paper.
Theorem 1. Suppose that the moments of inertia and the Hessian matrix of a rigid-body system are bounded by and . Assume that the bias and its gradient are upper-bounded with and . If the control parameters are identified as taking and sufficiently large , with , then the stable equilibrium point of system (10)–(12) is almost globally asymptotically stable (AGAS). Starting from the initial point with , and by decreasing the Lyapunov functionthe system’s state is stabilized to the desired equilibrium , while the bias is rejected by integral action . Sufficiently large control parameters allow for the extension of attraction region almost globally to . Proof of Theorem 1. Submitting (
24) and (25) into the time derivative of the Lyapunov function (
31) results into
Exploring the results (
28)–(
30) established in Proposition 2, we obtain
By defining
, we further have
with the matrix
By the Gershgorin circle theorem,
requires
We replace
with explicit expressions and rewrite the conditions as
with
. The resulting requirements are possible to be fulfilled by choosing the parameters for controller design appropriately and taking right value of the weight in Lyapunov function. Firstly, we take a large
such that
. Once the value of
is justified, the condition (
32) can be satisfied by enlarging
. In next, for fixed
and
, we have the chance to reach the condition (33) by taking a sufficiently large value for
, as it makes
and
sufficiently small.
In order to prove almost global stability, we need to ensure that Q does not reach the unstable equilibria, i.e., is always satisfied for all t. Let and ; then, we need to prove that and implies .
Firstly, we show that
is upper-bounded by
. We know the fact that
at the stable equilibrium point
, and the values of
and
for arbitrary
Q can be represented as the path integral along a curve parameterized by
, starting from an initial point
I at
and ending at the point
Q at
. Without loss of generality, we take the special curve whose tangent vector equals
as the candidate path for computing integration. Therefore, we have
As the norm of the Hessian matrix is upper-bounded by
, we in further obtain
Next, we prove that
is upper-bounded. As
for all
, we thus obtain the estimate of the upper bound for
:
with
We further obtain the value of the upper bound for
as follows.
Finally, we prove that the value of
does not reach its maximum. The nominal part of the system’s dynamics
actually represents a damped oscillator whose frequency is proportional to
. Enlarging
to infinity allows us to reduce the period
T of this oscillator to zero. Therefore, we have the chance to restrict the value of
over a period
sufficiently close to that of
, i.e.,
This means that, given an arbitrary value of
, we can figure out a real value of
a satisfying
by making
sufficiently large, such that
implies
for all
. Over a period, the action by
in
is averaged out, and the action by
decreases the value of
, i.e.,
. Thus, the condition
remains valid for all
, and the attitudes are ensured to be prevented from reaching the unstable equilibrium points.
Now we complete the proof by reaching a conclusion on almost global asymptotic stability of the desired equilibrium point , i.e., sufficiently enlarging control parameters allows us to extend the attraction region almost globally to . □
For a rigid body system with a right-invariant bias
upper-bounded by
F, we have
and
by following the explicit expressions in (
4) and (
5). Thus, the condition
is satisfied by taking
. With
and
, and by the results in the above theorem, we can claim that left-invariant PID controller almost globally stabilizes attitudes of a rigid body with right-invariant biases.
5. Simulations
The effectiveness of the proposed geometric PID controller and our framework of stability analysis is verified by the following numerical simulations. The rigid body to be controlled allows for rotation around three perpendicular axes with different values for moments of inertia, i.e., . The rigid body is assumed to admit an inertially fixed bias .
In order to satisfy the conditions required in the stability analysis, the parameters of the controller are carefully identified as
,
and
such that
and
. The weight coefficient in the Lyapunov function is taken as
with
. We let the controlled system start to move with zero velocity and without integral action at initial time and from an arbitrarily specified value of initial attitude, which is an association of
clockwise rotation around
z axis after
clockwise rotation around
x axis.
Figure 1 shows the evolution of proportional (dashed on the figures) and derivative (solid on the figures) parts of input commands. After a very short initial transient, the derivative input
does converge to the opposite of proportional input
; see
Figure 2. This behavior implies that
is forced to approach
rapidly, which confirms the ability of derivative control in tracking the gradient of error function. The benefit of performing PD control is then reasonably straightforward: by letting
, the error function declines in a gradient-descent manner, i.e.,
. Unfortunately, this is also a limitation for this design of PID controller. For a fixed
, enlarging
will accelerate the tracking speed of
towards
, but at the same time will decrease the value of
and thus will slow down the convergence speed of error function. Balancing the value of
and
is worth further studies.
Figure 3 shows the evolution of integral action (solid on the figures) and bias expressed in body frames (dashed on the figures). Similar to that in
Figure 1, the integral action
rapidly converges to the opposite of biased term
after a short initial transient (see
Figure 4), which illustrates the advantages of integral action in dynamically countering state-dependent biases, rather than just compensating for the effect of steady-state errors.
Figure 5 explains the evolution of Lyapunov function. As the Lyapunov function converges to zero, the values of
and
(or equivalently
) are reduced to zero. Furthermore, this fact allows us to conclude that
is robustly stabilized to the exact value of targeted equilibrium point
while the influence of biases
is completely attenuated by the integral action
.