1. Introduction
The zero moment point (
) and linearized inverted pendulum have continued to play a fundamental role in the control of postural equilibrium of biped robots and lower-limb exoskeletons since their introduction by Vukobratovic [
1]. Vukobratovic showed that the center of pressure (
) of reaction forces under the feet soles on a flat horizontal surface coincides with a point he called the zero moment point (
) and that postural equilibrium can be guaranteed if the
(alias
) is maintained inside the convex hull of the surface encompassing the supporting foot (or feet in double stance). Moreover, a very simple relationship, based on the linearized inverted pendulum, links the
and the center of gravity (
) projection on the ground of the mechanical chain. The goal is to control the
acting on the joint angles as both are algebraically linked by the kinematics of the chain, the target objective being the
.
If motion maintaining postural equilibrium is desired (e.g., during a step in gait), to control the
requires a certain degree of anticipation. Hence, in the so-called preview control framework [
2], transition of
,
velocity, and
were precomputed in advance and applied in open loop as reference to the biped control. However, this approach has not been able to model a closed-loop system and does not face the problems of disturbance rejection and stability.
A solution to track the preview trajectories in closed loop was successively proposed by Choi [
3]. The preview
trajectories during rectilinear gait were reviewed, and a closed-loop strategy was devised and proven, using Lyapunov techniques, to guarantee closed-loop stability and a bounded error tracking of the
and
preview references. The loop was closed from the actual values of
and
, generating a feedback signal to control the
velocity. Then, a speed control for the joints from the
velocity was designed using a
Jacobian with specified embedded motion. The output measures were not detailed, but presumably,
was assessed by measuring the joint angles from the weight distribution on the kinematic chain, and
was measured by pressure sensors under the feet of the biped. The strategy, according to theory, guarantees closed-loop stability. However, when the authors tested it in simulations and practical examples, it showed a lack of robustness to disturbances or poor damping of the closed-loop dynamics. Lyapunov theory guarantees stability but does not say how much the resulting closed-loop poles will be damped.
If real-time measures (let us call them and ) of and are independently available, it is reasonable to assume that before closing the loop, a filtering is performed, fusing both data. However, if these are generated, as stated before, the latter by direct measures and the former indirectly from joint measures and a priori information, they are not always consistent with the relationship stated by the linearized inverted pendulum. The main reasons are: uncertainties in the model parameters (especially in the weight distribution when dealing with an exoskeleton interacting with a patient), external forces acting on the biped (crutches or a chair in a sit-to-stand exercise), centrifugal forces in the frontal plane when motion is not rectilinear.
This study was motivated by the intention to improve postural equilibrium in lower-limb exoskeletons for rehabilitation. The same approach is at the basis of all applications needing balance control of biped robots, such as biped walking in rectilinear [
4,
5] and curved trajectories [
6], in haptic lower-limb exoskeletons [
7], and in performing sit-to-stand exercises [
8], described by the authors in other papers.
1.1. Paper Contributions
The main contribution of the paper is the development of a feedback control more robust than the one offered by Choi. In order to achieve this result, a detailed understanding of the closed-loop dynamics generated by controlling an inverted pendulum is presented, with particular attention devoted to the control design techniques and the engineering problems in closing the loop. Then, in order to make the filtering effective and close the loop from and , ensuring compatibility, the proposed approach operates at two levels: a nonlinear algebraic function and a a linear dynamic model.
The nonlinear function is a simplified mapping from joint angles to
, called statistically equivalent serial chain (SESC) [
9], to be identified in a priori experiments. As this identification is based on the same force sensors under the feet used for measuring the
, it also resolves any calibration mismatch.
The linear model is an extended system based on the inverted pendulum from input u, the reference velocity, and output and , used to estimate, along with the , , and external disturbances affecting the , the model states. Then, using the estimated states, the loop from and can be closed applying robust control theory. The reasons for estimating the unknown external force disturbances are twofold: (1) to take into account real external disturbances, especially in exoskeletons but also in biped robots (e.g., centrifugal forces in turning while walking); (2) robustness in the joint estimation, accommodating modeling errors, parameter uncertainties, and the simplifications introduced by the linearized inverted pendulum.
Still recently, the linearized inverted pendulum has continued to be at the basis of the models for balance control ([
10,
11] and references therein). However, to the authors’ knowledge, there are no works introducing, for robustness, an extended system to estimate disturbances, or the need for a SESC identification.
The proposed control is a non-conventional tracking problem, as two separate model states are tracked. Two different control design techniques are proposed and tested to control the extended system: (1) computing a robust estimator and solving the output feedback problem from the estimated states using a numerical approach based on the Levenberg–Marquardt algorithm [
12,
13]; (2) solving the standard robust regulator, adapted to deal with the preview signal tracking.
In order to test the approaches, three different experiments were performed. First, both observer and state feedbacks were implemented and compared with Choi’s original feedback through simulation of the 2D linearized inverted pendulum. In a standing position, a preview shift of the
from the heels to the tips of the feet was imposed, while in the meantime, an external force disturbance was applied. Then, a real lower-size mechanical mock-up was considered, composed of foot, leg, thigh, and trunk, with three degrees of freedom (DOF) in the sagittal plane to represent the real exoskeleton for implementing the sit-to-stand exercise. The SESC model of this simplified kinematics was identified with a priori experiments and used in the proposed feedback control through the
Jacobian of the chain. Finally, a non-linear simulation of the full-scale exoskeleton with patient was run on the same exercise executed by the linearized inverted pendulum, emulating the first phase of a stand-to-sit exercise. A complete sit-to-stand exercise with the presence of a chair and switching dynamics exploiting the same control technique can be found in [
8].
The paper is organized as follows.
Section 2 reviews Choi’s results.
Section 3 introduces the main contribution of the paper: a
model of the linearized inverted pendulum and an extended system, embedding in the model external disturbances, for applying robust estimation and robust control. This model is also used in the Appendix to show the limitations of Choi’s feedback.
Section 3.1 and
Section 3.2 present the robust estimator–estimate state feedback and the standard robust regulator.
Section 3.3 and
Section 4 contain simulated and real control experiments. In particular,
Section 4.1 approaches the identification of the SESC model, and
Section 4.2 presents the simulation of a stand-to-sit exercise.
Section 5 concludes the paper. The appendix discusses the limitations of Choi’s original method.
2. Choi’s Approach
As in [
3], the 3D linearized inverted pendulum is split into two separate, independent 2D models for the sagittal and the frontal planes. However, in this paper only the sagittal plane will be considered, with axes
x (horizontal) and
z (vertical). The equation linking
and
, adopting Choi’s notation, is
where
p is the coordinate of the
,
c is the projection of the
on the ground,
is the height of the
, and
g is the acceleration of gravity.
is the only parameter of the model of the simplified biped walking system.
Let
,
, and
indicate the desired preview trajectories of the
, of the
, and of its derivative during a postural exercise, and assume that the pendulum joint servo is controlled in speed by an input signal
u according to
where
accounts for the speed-tracking error and process disturbances and
u is given by the following feedback law:
Then, Choi’s results prove, with Lyapunov theory ([
3], Theorem 1), that if
and
, the closed-loop system is bounded disturbance (
) - bounded errors (
) stable.
Anyway, in spite of stability, a feedback implemented using Choi’s equations has a poor damping of the closed-loop dynamics in practical operating conditions. A proof and discussion about this topic can be found in
Appendix A.
3. An Extended System for COG–ZMP Robust Estimation and Control
Choi does not introduce any input–output dynamic model to prove his results, but only a Lyapunov function directly based on Equations (
1)–(
4). Here, vice versa, the essence of the feedback control problem involving measures of
and
with the reference velocity as input is captured by the simple model of the block diagram of
Figure 1: a third-order model with states
, where the jerk of the
(in the following with an excess of notation,
indicates its projection on the ground) is controlled by a reference velocity signal
u in an internal partial speed loop with velocity gain
and output
and
. The third-order model is needed to guarantee a realistic strictly proper system for the design of the state estimator and feedback as position and acceleration are both present in the output, and in the meantime, representing the internal speed loop with the servo dynamics. If the gain of the local speed loop is taken relatively high (
) the
speed will closely track the reference
u, as desired. This model does not take into account external forces acting on the system or internal disturbances, as in the case of a lower-limb exoskeleton with the presence of a patient. External forces are introduced when crutches are used or when the patient is sitting on a chair in a sit-to-stand exercise, or simply to accommodate discrepancies between
and
measures. Internal disturbances are generated by the involuntary motion of the patient in the small freedom offered by the exoskeleton, independent of the joint motion, and obviously, by modeling errors.
Then the model can be completed, generating the extended system of
Figure 2, as defined in robust control theory (for the definition of the extended system and its role in robust control and
theory, see [
14,
15]).
F represents low-frequency external forces influencing the
. In the model
(i.e.,
), the value measured and
, the one linked to the
by the linearized inverted pendulum relationship, are defined separately, where
is the difference between the two, the effect of disturbance
F to be estimated.
and
are measured as before, taking into account measurement noise represented by two high pass filters
. Output objectives are set on the
and on the
for sensitivity requirements with respect to process noise
(in a different context, here
has the same interpretation and scope as in Equation (
3)) and the effect of the unknown external force
F. The weighting functions
and
are chosen to guarantee steady-state gain (i.e., tracking error with respect to disturbances
and
F) and the frequency band of the loop transfer function in the designed feedback. In order to have a balanced design, the control activity
(with a weighting function
) is added as an objective against measurement noises
and
, to set the control activity. This extended system is used to design robust estimators of
,
,
, and
, as well as robust controls.
Let
be the estimates of the
projection on the ground, its derivative, the
, and
. Then the control strategy of Equation (
4) is modified as follows:
with the control scheme represented in
Figure 3. This means that
c (and in the steady state,
p) must track a perturbed reference in order to guarantee that the
, and not the
linked to the
, follows the desired preview signal, despite
and
converging to the same value in the steady state, independently of the presence of disturbances. A feedback from
is also introduced as it has a critical influence on the closed loop damping.
In the next subsections, two different approaches, based on robust control theory, to compute the state observer and the state feedback coefficients, are introduced and tested. The estimates and the coefficients in Equation (
5) result explicitly from standard
techniques by operating a state-space transformation in the extended system of
Figure 2, choosing as states
, augmented (for the whole extended system) by the unobservable or uncontrollable states introduced by the dynamics of the weighting functions.
3.1. Separate Estimator and Feedback
In this first approach, the extended system with the new state representation is used to design a state estimator.
Then, a static output feedback problem for constrained pole placement is solved on the cascade of the extended system and observer to derive the gains
,
, and
in (
5).
It is known that static output feedback has no analytical solution. Hence, a numerical technique based on the Levenberg–Marquardt algorithms was implemented. No algorithm details are presented here. Just note that by minimizing the sum of the squares of a certain number of penalty functions, the closed-loop poles are brought into a stability region with desired damping, constraining to be greater than a lower bound, and the control activity (measured by the norm of the closed-loop operator from measuring noises to control objectives) to be smaller than an upper bound.
3.2. Robust Control
In the second approach, estimation and feedback are jointly computed solving a
robust control regulator. Even if the classical separation of
does not apply in
controls [
14,
15], a weakly coupled state observer and estimated state feedback can still be recognized. Then, maintaining the feedback coefficients of the estimated states of interest (
) and setting to zero the remaining ones relative to unobservable or uncontrollable modes (it can be verified that this has very little influence on the closed-loop poles), the strategy of Equation (
5) can be implemented. It is interesting to note the similarity of the performances obtained between the two approaches, as shown in the next section.
3.3. Simulation Results
The model of an exoskeleton with a patient used in [
5,
7] was considered, with parameter
and choosing a speed gain for the velocity loop of
. In this experiment, the linear model of the block diagram of
Figure 2 was simulated. In an erect posture, with a preview reference computed as suggested by [
2], a step transition on the sagittal plane of
p of
m moving the
c from the heels to the tips of the feet at time
s was imposed. Then at
s, an external horizontal force disturbance acting in the sagittal plane on the
, tries to move the
outside the feet support by an additional
m, causing—if not compensated—a loss of balance.
The experiments are compared using the identical tracking gain
of the
loop and the best (for damping) value of
with Choi’s control and feedback from a robust estimator obtained from the extended system of
Figure 2. The gain parameters adopted in the case of Choi’s feedback were
,
, with a resulting damping ratio of the dominant poles of
; and in the case of observer feedback,
,
,
, with a resulting damping ratio of the dominant poles of
.
The figures represent reference (dashed) and actual (blue), (red), and (cyan), speed estimate (green), and the estimate of the disturbance effect (violet).
Figure 4 shows the results adopting the Choi control. The low damping of the closed-loop poles is clearly visible. Note that, ignoring the disturbances, the
does not follow the reference path and exits from the tip of the feet.
Vice versa, when disturbances are also estimated, after a short interval of time depending on the filtering bandwidth of the estimator, the
returns to the desired value.
Figure 5 and
Figure 6 show the feedback from the extended estimator with compensation of disturbances, obtained with the approaches of
Section 3.1 and
Section 3.2, respectively.
The transition of the force disturbance was chosen to be unrealistically steep to evidence that, because of the estimator bandwidth, the compensation of the disturbance can’t be perfect, depending on the values assigned to the weighting functions in the extended system.
The delay in the estimation of
, as a consequence of the disturbance, is shown in
Figure 7, where
is in blue and its estimate is in green.
4. Control of a 3 DOF Biped
The approach was tested on the real three-DOF small-scale mock-up of an exoskeleton (
Figure 8) and on the simulation of the full-scale exoskeleton. The chains, in both cases, are composed of two joint feet, legs, thighs, and one trunk. The 3 × 3 Jacobian matrix relating
to joints embedding knee motion and trunk attitude is the following:
where
are the angles of the ankle, knee, and hip,
is the Jacobian of
c, and
is the attitude of the trunk. Joints are controlled by velocity servos, with their references being obtained through the inverse of the Jacobian matrix (
6) driven by speed feedback signals. The
speed feedback is similar to the one used for the linearized inverted pendulum (
5), where the measures of the
were obtained from the
and that of the
from the joint-angle measures
. The remaining two feedbacks, from the knee angle and trunk attitude measures
, are simply proportional feedbacks, the last measure being obtained from an inertial sensor:
where
and
are the references chosen according to the desired postural exercise.
4.1. Joint Angles– Map Fitting and Control
In order to apply the results of the previous section to a biped device, it is necessary to have a reliable evaluation of the
from the joint angles, consistent with the
. Its value depends on the position of the center of masses and weights of each link of the chain. Those data are scarcely known in advance but can be identified with a series of a priori experiments. This approach is called statically equivalent serial chain (SESC) modeling [
9] (see also [
16,
17] for applications to rehabilitation).
Espiau et al., in [
9], showed from experiments measuring the projection of the
on a force table that the physical parameters of a kinematic chain cannot be identified uniquely. What can be identified is only a set of expressions of them, representing classes of equivalent (with respect to the
) chains. These expressions appear in linear form in the SESC model. Considering a three-joint kinematic model composed of feet, leg, thigh, and HAT (head, arms, trunk) for motions in the sagittal plane, the parameters of the SESC model can be identified using least squares with two equations and two sets of experiments: collecting and recording a series of joint-angle positions with the corresponding measures of the
in the steady state and a set of samples of joint angles and
trajectories in motion spanning the operating area at random. A slight modification of the model presented in [
16] is proposed here, where the first equation, expressing explicitly
, refers to steady-state experiments, while the second equation, expressing
indirectly (Equations (
1) and (
2) can be rewritten as
), refers to dynamical ones.
The equations are:
with
where
are the length of legs and thighs,
are the masses of feet, legs, thighs, and trunk (HAT), (
),
is the center of mass of the feet,
are the coordinates of the ankle, and
are the distances from the center of mass to the distal joints for the leg, thigh, and proximal joint for HAT. Coefficient
b accounts for the difference (the feet do not move during the dynamical experiments) in sensing the
statically and dynamically. From this model, the six parameters of (
9) are identified recursively with a non-linear least squares technique such as Levenberg–Marquardt, where
is obtained approximately from numerical differentiation of
. The actual small-scale leg was first identified, with results (statical and dynamical) contained in
Figure 9 and
Figure 10.
Then a control exercise was carried out, maintaining the
position fixed and the posture erect while performing an up-and-down motion (such as sit-to-stand) of the body. The results (
and
), based on the identified model and the proposed control scheme, when a disturbing force is applied in the sagittal plane are shown in
Figure 11. The action of the feedback on the
to compensate the disturbance is clear.
4.2. Simulation of a Stand-to-Sit Exercise
In order to validate the results on the linearized inverted pendulum of
Section 3.3, the proposed control (with the same estimator and feedback parameters) was applied to the non-linear simulation of a multi-chain with 3 DOF, having as average the same
. It represents a biped in the sagittal plane emulating the first phase of a stand-to-sit exercise to test the balance control of a future full-scale exoskeleton with a patient. While the pelvis is lowered from a standing posture to reach the chair and the trunk attitude assumes a natural bending forward, the
is shifted from heels to tips and a disturbance force is applied, as in the previous experiment of
Figure 5 and
Figure 6. The animation of the exercise can be seen in
Figure 12. The resulting response of the
in
Figure 13 is very similar to that of the linearized inverted pendulum. Particularly, in the final phase of the exercise, the reaction of the exoskeleton to preserve equilibrium against the push forward of the disturbing force is particularly visible.
Vice versa, Choi’s original feedback with identical gain, applied to the nonlinear simulator, has not been able to guarantee stability.
5. Conclusions
In this paper, Choi’s feedback for postural control of a biped robot, based on a linearized inverted pendulum model, has been revised. In practical situations, this feedback generates very undamped closed-loop dynamics. Then, the design problem was reformulated using state estimation and state feedback control. In fact, closing the loop with a state observer of the and exploiting velocity along with position and acceleration estimates guarantees a greater damping of closed-loop poles, with identical steady-state gain. However, to be effective in fusing and measures, this observer needs to be extended to also estimate external disturbance forces, and the kinematic model of the needs to be tuned to the actual mass distribution. The former problem was tackled by a robust estimator based on an extended system embedding into the model unknown force disturbances, the latter by identifying a priori the SESC model of the mapping between joint angles and the . This a priori identification can also be repeated to maintain the mapping up to date in cases of changes in the weight distribution of the biped.
Two approaches to design the feedback were pursued: one is numerical, computing the state feedback for a given observer with a Levenberg–Marquardt algorithm. The second exploits integrally estimator and feedback obtained from a robust control regulator and adapts it to the tracking of the preview signals. The results show similar performances, with good disturbance compensation. It must be emphasized that the adoption of an extended system with its weighting functions offers a formal technique to set the observer and feedback characteristics, guaranteeing the desired loop gain and bandwidth.
Robustness was shown by applying the control designed for a linearized inverted pendulum to two non-linear systems: a three-DOF kinematic chain of an actual mechanical small-scale leg and the simulation of an exoskeleton constraining a patient to perform a joint-legged stand-to-sit exercise in the sagittal plane. The proposed control correctly integrates the information, which is poorly reconstructed from joint measures and kinematics of the chain, with actual measures, accommodating uncertainties in the model and unknown external force disturbances. Moreover, an identification procedure of the SESC model was proposed and tested.
COG–ZMP and linearized inverted pendulum models continue to be at the basis of balance control of bipeds. However, extended systems and SESC models have not yet been proposed in order to offer robustness to the approach.
The proposed
control was successfully used by the authors for the balance of turning during walking of biped robots and for a more detailed and complete sit-to-stand exercise described in the companion paper [
8]. In particular, future developments will consider haptic exoskeletons, where the action of the patient on some joints, through electromiographical signals, controls the motion of part of the degrees of freedom, while the automatic control discussed here guarantees balance acting on the ankles or on the hips.
Computing robust estimation and robust control, as well as the block diagrams present in the paper, were made with the design environment G++ developed by the authors described in [
18] and that can be downloaded from [
19]. However, the used technique is fairly standard in the robust control field and can be found in classical textbooks such as [
14].