1. Introduction
In recent years, with the shortage of land resources, human exploration of the ocean has intensified. In order to exploit and utilize marine resources more efficiently, many bio-inspired underwater robots have been developed in countries around the world. For example, Wang et al. developed a fish-shaped robot called Robcutt II [
1]. Alessandro Crespi put forward a Salamandra Robotica II amphibious robot that is able to swim and walk [
2]. A soft robotic fish was designed by Katzschmann [
3]. Daniele Costa et al. designed a bionic autonomous underwater robot based on the fluid shape of fish bodies [
4]. Our team has researched a series of amphibious spherical robots [
5,
6].
Generally, research concerning bionic underwater robots emphasizes control in terms of not only external appearance but also internal motion; control is a basic necessity for completing tasks. Generally, basic motion control of underwater vehicles requires tracking a predetermined underwater trajectory. At present, a large number of studies in the literature focus on the design of trajectory tracking controllers for underwater robots.
With respect to model uncertainty, which is a common problem, a controller independent of the model is the best choice; examples include the proportional integral derivative (PID) controller [
7,
8], the sliding-mode controller (SMC) [
9,
10], the neural network controller, and so on [
11,
12]. PID controllers perform well in tracking trajectory but lack the ability to reject disturbances. Sliding-mode controllers offer an algorithm for trajectory tracking that is very popular due not only to its model independence but also its good anti-interference performance. Elmokadem et al. proposed a terminal sliding-mode controller for AUVs that considered system uncertainty and environmental disturbances [
13]. The simulation results showed that the controller was robust under bounded disturbances. Cui adopted the Lyapunov method to design an integral sliding-mode controller that successfully solved the problems involved in controlling robots with unknown velocity, unknown disturbances, and unknown hydrodynamic model parameters [
10]. Although the sliding-mode controller is not sensitive to model parameters, its control quantity exhibits a high-frequency chattering phenomenon, which is not conducive to the execution of actuators. The neural network algorithm is another commonly utilized algorithm in underwater trajectory tracking. In [
11], the authors proposed an efficient neural network approach with a single-layer structure for tracking control. Miao et al. designed a novel adaptive neural network tracking controller by combining dynamic surface control (DSC) and a minimal learning parameter (MLP) [
12]. In addition, a radial basis function neural network was employed to account for tracking errors. Neural-network-based tracking controllers are insensitive to the dynamic model of the robot and possess strong adaptability and learning capabilities. However, due to the changeable nature of the underwater environment, the training and learning processes are difficult, which leads to poor real-time performance of the tracking algorithm.
If the kinematic model and hydrodynamic model of an underwater vehicle can be acquired, we can design a more suitable controller according to the approximate model parameters to achieve better control effectiveness.
Some algorithms based on models have been researched for trajectory tracking, such as model predictive control (MPC), backstepping control (BSC), and so on. Zhang et al. applied MPC to achieve three-dimensional trajectory tracking of an autonomous underwater vehicle (AUV). In [
14], the conventional MPC algorithm was improved upon, and an event-trigger-based MPC algorithm was applied to the straight-line trajectory tracking task of an underactuated underwater vehicle (UUV). Shen et al. developed a Lyapunov-based model predictive control (LMPC) [
15]. They considered practical constraints, for example, the actuator saturation. The algorithm was simulated and tested on the Saab SeaEye Falcon model AUV and achieved good results. They also proposed a nonlinear model predictive control method that realized a different type of underwater trajectory tracking for the AUV [
16]. However, in order to achieve a suitable control quantity, the MPC method needs to predict many steps in advance, which results in the dimension of the weight matrix being very large. Therefore, the computational efficiency of the MPC is reduced, and real-time performance is somewhat poor.
Lyapunov-based backstepping controls represent the mainstream method for AUV tracking controls because their control law exploits ’good’ nonlinearities in the system dynamics. Xu et al. adopted the backstepping method to determine virtual control quantity, then designed a controller that allowed a weight-bearing autonomous underwater vehicle (WAUV) to track its underwater trajectory [
17]. In order to solve the differential explosion problem caused by using the backstepping method for the 3D path tracking of an underactuated underwater vehicle, Zhou et al. utilized the derivative output value of a biologically inspired model to replace the repeated calculation of multiple derivatives, which greatly reduced the complexity of the controller [
18]. In [
19], the backstepping method and the disturbance state observer were combined to design a controller for trajectory tracking. Although backstepping-method-based controllers can be designed quickly and efficiently, they are not suitable for controlling vehicles with large dimensions; this is especially true for backstepping controllers with more than three levels. High-order derivative terms can amplify high-frequency micro-noise, which is not ideal for the controller.
From this analysis, it can be seen that each control algorithm possesses both advantages and disadvantages. In addition, the research focus of this paper, namely the turtle-inspired spherical robot (TASR), possesses characteristics such as unmeasured velocity, uncertain model parameters, and a nonlinear system; these factors present challenges for the design of a trajectory-tracking controller. Therefore, the design of a combined controller based on multiple algorithms is needed for the underwater trajectory tracking of TASRs. In this paper, the Lyapunov-based backstepping algorithm is adopted as the main controller, making full use of its good performance with respect to nonlinear systems. TDs and LESO are added to the main controller as auxiliary algorithms. ESOs are popular due to their ability to estimate uncertain information, including disturbances and state information. Xie et al. designed a reduced-order LESO to estimate disturbances that needs velocity information as input [
20]. In [
21], a nonlinear ESO was constructed to estimate the unmeasured velocity and disturbances. Inspired by [
21], an LESO is put forward to avoid inaccurate estimates inherent in the use of nonlinear ESOs.
The primary contributions of this paper are summarized in three parts:
(1) A backstepping-based double closed-loop controller is designed. Beginning with an outer-loop backstepping controller, a virtual feedback variable is introduced to simplify the process.
(2) TDs are shown to be capable of tracking the differential signal of the desired velocity provided by the outer-loop controller, which solves the differential explosion problem.
(3) An LESO is designed to estimate the velocity states and lumped disturbances; it only requires position and orientation information. In addition, we introduce a rotating inertial coordinate to linearize the design of an ESO in terms of surge direction.
The rest of this paper is organized as follows. The modeling of a turtle-inspired amphibious spherical robot is introduced in
Section 2. In
Section 3, a double closed-loop backstepping controller with TDs and an LESO is designed, and the stability proof is presented. The results of the simulation are discussed in
Section 4. Finally, the conclusions and directions for future work are summarized in
Section 5.
3. Design of the Double Closed-Loop Backstepping Controller with an LESO and TDs
In order to achieve trajectory tracking for the TASR, a double closed-loop backstepping control with an LESO and TDs algorithm is proposed, as shown in
Figure 7. In addition, the Lyapunov stability proof is provided. An outer-loop kinematic-model-based backstepping controller is designed to export the desired intermediate control variables according to the reference trajectory. An inner-loop backstepping controller is based on the dynamic model of the TASR. Furthermore, an LESO is added into the inner-loop controller to solve two primary problems. First, the design of the LESO is utilized to estimate the model uncertainty and the environmental disturbances. Second, the unmeasured velocity information is provided by an LESO estimation. Furthermore, to avoid the differential explosion of the inner-loop backstepping controller, TDs are introduced to track the derivative of the desired intermediate control variables from the outer-loop controller.
3.1. Outer-Loop Backstepping-Based Controller
For the sake of convenience, the dynamics mode of the TASR in Equation (
13) is unfolded as the following:
The reference pose of the robot is written as
. The derivative of the reference trajectory is denoted as
. Note that
and
represent velocity in the inertial frame. Further, the pose error in the TASR body coordinate system is defined as
. The subscript
represents the error in the frame
.
Figure 8 depicts the pose error in the horizontal plane. The pose error in the inertial frame can be transformed into that of the body frame by Equation (
15).
where
is the matrix of rotation transformation, and
can be regarded as the error between the target trajectory and the robot. If
, the TASR is considered as having tracked the reference trajectory.
After deriving the two sides of Equation (
15), the following state equation of pose error is obtained:
It can be seen from Equation (
16) that
,
, and
are coupled.
is regulated by forward speed
u in the robot coordinate system.
is regulated by heading angular velocity
r. In addition,
is regulated by
and
together. So, we can adjust
and
to make
,
, and
.
According to the above analysis, a virtual feedback variable is constructed:
where
is a positive constant and
is a common nonlinear function. A specific expression of
is expressed as follows:
It can be found that
, if and only if
or
,
. The derivative of
is:
The virtual feedback variables corresponding to Equation (
17) are analyzed below. The first-order derivative of
, with respect to intermediate control variable
r, is continuous and bounded. If the external control makes
, then
. According to
in Equation (
16), if
, then
. Next, let us take part of the Lyapunov function
. It can be seen that
is positive-definite.The derivative of the function over time
t is
. The derivative of
is positive-definite, and
. This shows that
is an indirectly controlled quantity and is dominated by
and
once again.
The candidate function of Lyapunov is chosen:
where
is a positive constant. It is obvious that Equation (
20) consists of four parts, and each of them is positive-definite. When
,
,
, and
are equal to 0, then
. The output angle range of IMU is
,
, the yaw angle range of the reference trajectory is
,
, and the angle error range is
,
. The derivative of Equation (
20) is written as follows:
By introducing Equations (
16) and (
17) into Equation (
21), we obtain:
If we assign
into Equation (
22), then we obtain
We assume that when
,
,
,
,
,
,
, and
of the reference trajectory are all bounded, and
,
, and
do not converge to 0 at the same time. Then, the controller law of the outer-loop backstepping controller is presented as follows:
where
,
, and
represent the desired values of the intermediate control quantities
u,
r, and
w, respectively.
When the three intermediate control variables reach their corresponding expected values, Equation (
25) can be obtained.
where
and
are positive scalar. If and only if
,
,
, and
are 0,
, then
is negative-definite.
appears in
, and
is as follows:
where
and
appear in
and are computed according to Equations (
27) and (
28):
The stability analysis of the outer-loop backstepping controller is carried out as follows. Due to the fact that
,
,
,
,
, and
of the reference trajectory are all bounded, and the fact that every part of Equation (
25) is negative-definite, then
,
,
, and
all converge to 0. Further, according to
,
. Due to
,
. Because preconditions
and
do not converge to 0 at the same time,
r does not converge to 0 in the process of
, so
leads to
. According to
,
. To sum up,
, and the TASR accomplishes the trajectory tracking task.
3.2. The Inner Loop of the Backstepping Controller
According to the previous section, the main problem is making the TASR track the desired trajectory by adjusting the intermediate control variables u, , and r to their corresponding desired values. However, these intermediate control quantities cannot be set directly. The TASR system is a typical nonlinear second-order system model. Generally speaking, the actual control quantities , , and should be adjusted to make , , and , and then the robot can track the reference trajectory.
Errors between the expected speed and the actual speed of the robot are constructed in Equation (
29).
In order to make the error converge to zero, a candidate function of Lyapunov is chosen as follows:
If
is positive-definite and the last three terms are all positive-definite, then
is positive-definite. By computing the derivation of Equation (
30), the following equation is obtained:
Equation (
29) is written as:
Substituting Equations (
23), (24a)–(24c), and (
32) into (
31), the following Equation is obtained:
Assuming that
,
,
,
,
,
,
,
,
,
, and
of the reference trajectory are all continuous and bounded. So, we combine the dynamic model given by Equation (
14) and obtain the following control law:
For the stability analysis, the control law provided by Equations (34a)–(34c) is introduced into the derivative of the Lyapunov candidate function, i.e., Equation (
33), and the following equation is obtained:
The first four terms constitute . According to the previous section, is negative-definite. If , , and remain positive, then the last three terms are negative-definite. To sum up, is negative-definite, which means that , , and . In other words, when , then ,, , and finally , , , , and the TASR completes the task of tracking the reference trajectory.
3.3. Design of an Inner-Loop Backstepping Controller with TDs
It can be found that the control law described in Equations (34a)–(34c) contains differentiation terms
and
. As outputs of the outer-loop controller,
and
are described in Equations (24a)–(24c), which also contain differentiation terms. If the differentiation of Equations (24a)–(24c) is taken directly as with the traditional backstepping technique, the problem of “explosion of terms” may result. In order to avoid this problem, TDs referring to the ADRC methods described in the literature [
23] are adopted to track the differentiation of
and
. The TDs are designed as follows:
where
is the tracking signal of
and
is the tracking signal of the differentiation of
. The same goes for
and
. The definition of the function fhan refers to the literature [
23]. The discrete forms of the above equations are presented as Equations (
38) and (
39):
h represents the sampling time. For consistency of symbols,
is marked as
and
is marked as
. Using the
produced by the TDs, the controller can be written as:
3.4. Design of an LESO for an Inner-Loop Backstepping Controller
We must consider that the TASR is affected by ocean currents, propeller thrust fluctuations, and other factors when it is performing operations in a lake or sea. In addition, different speeds make the hydrodynamic parameters
,
, and
,
change to some extent (we assume that
and
change only slightly). These can be considered lumped disturbances. The dynamic equation in the actual environment with disturbances can be written as follows:
where,
,
, and
are the total disturbances in
u,
r, and
w directions from the ocean current, propeller thrust fluctuation, and so on.
However, the velocity of the TASR is unmeasured directly. It is necessary to combine the kinematic and dynamic models together to form a second-order system to design an extended state observer.
A typical nonlinear second-order system with disturbances is expressed as follows:
where
is the lumped disturbance, including a nonlinear uncertain term and an external environment disturbance,
b is the model parameter, and
is the controller input. Taking
,
,
, the state-space equation is expressed as follows:
where
h is the derivative of
and
is bounded. Referring to [
24], the following observer can be constructed:
where
, and
and
are the estimates of
and
.
and
are gain parameters. According to the literature [
25], gain parameters are adjusted based on a bandwidth
. Generally,
and
.
According to the kinematic model in Equation (
6) and the dynamic model with disturbances in Equation (
41), it can be seen that the models for yaw and heave are easily transformed into the typical nonlinear second-order system expressed in Equation (
42). Hence, for the heave and yaw directions, the estimates of lumped disturbances are presented as
and
, and the estimates of velocity are marked as
and
, respectively.
In contrast, for the surge direction, an expression as in Equation (
42) cannot be directly employed. To solve this problem, a rotation inertia coordinate, as shown in
Figure 9, is established.
In the frame
, the kinematic and dynamic models of the TASR with respect to surge direction are expressed in Equation (
45). Note that Equation (
45) is easily transformed into the typical nonlinear system expressed in Equation (
42). According to Equation (
44), the lumped disturbance and velocity for surge direction can be obtained and marked as
and
, respectively.
Following from the above analysis, the control law of the double closed-loop backstepping controller with TDs and an LESO is concluded as follows:
4. Simulation Results
To verify the feasibility of the proposed control scheme, which we refer to as TDs-LESO-BSC below, we conduct some simulation analyses using the professional software MATLAB2015a on a system running Windows 7, equipped with an Intel core i7 CPU at 3.6 GHz and 8 GB of RAM. For the simulation, some parameters of the proposed controller need to be set. The model parameters are acquired by online recursive least-squares-based model identification [
22]:
and
. The gains in Equation (46a)–(46c) are selected as follows:
and
. The parameters of the TDs in Equations (
38) and (
39) are set as follows:
and
. The bandwidth of the LESO in Equation (
44) for the directions of surge, yaw, and heave are set as
and 30, respectively. Meanwhile, the actuated forces in the surge and heave directions are bounded between −4 N and 4 N, and the actuated torque in the yaw direction is bounded between −0.5 Nm and 0.5 Nm. The sampling time is 0.02 s.
4.1. 3D Trajectory Tracking Performance
To verify the 3D trajectory tracking performance, the reference 3D trajectory with respect to time t is described by
whose shape is a standard spiral line with radius of 2 m. The original state of the TASR is set as
and
. In addition, the following disturbances in surge, yaw, and heave degrees are introduced to the TASR. Note that gains in the sinusoidal signal are related to different values of the positive constant
. Because the max torque of the TASR with respect to the yaw is
, the disturbance introduced for the yaw is a little smaller than that for the other two directions.
With
, the curves of the tracking trajectory are shown in
Figure 10. The proposed tracking controller successfully drives the TASR toward the desired trajectory, which verifies the stability and feasibility of the control method. The tracking errors of the TASR states are plotted in
Figure 11. All errors converge almost to zero. The required control forces are shown in
Figure 12. The control inputs in the surge and heave directions converge to a sinusoidal signal, which demonstrates the anti-interference ability of the TDs-LESO-BSC. On account of the fact that the disturbance in yaw is slight, the control input converges to a steady value.
Furthermore, the estimated performance of the LESO is evaluated in
Figure 13. It can be seen that the estimates of velocity and lumped disturbances are consistent with the actual values after 10 s. Accurately observed lumped disturbances improve the anti-interference properties of the controller. Further, accurate estimates of velocity are able to solve the problem that miniature bio-underwater robots cannot perceive their velocity.
For a more intuitive perspective on the tracking accuracy of the TDs, the contrasts between the actual and estimated values are described in
Figure 14. Derivations of the surge and yaw velocities can be tracked accurately. Moreover, the derivation tracking performance is evaluated by the RMS errors, where the RMS derivation tracking errors for
and
are 0.0015 m/s
2 and 0.00001 rad/s
2, respectively. Accurate tracking of the derivation for surge and yaw velocities improves the trajectory tracking effect. In the simulation, the running time of the controller is recorded. The running times with TDs and without TDs are 0.701 s and 0.924 s, respectively. The design that includes TDs improves the efficiency of the proposed controller.
4.2. Robust Test
In order to investigate the robustness of the TDs-LESO-BSC further, we simulate TASR tracking control under different conditions with respect to lumped disturbances, with values of
varying in
. The RMS tracking errors under different lumped disturbance conditions are computed in
Table 2. It can be seen that tracking quality is insensible to enhanced disturbances. The simulation results demonstrate the robustness of the TDs-LESO-BSC. However, the insensitivity of the tracking quality only holds for limited interference. Once the control input bounds restrict the disturbance-resisting capabilities of the TASR, the tracking quality worsens, even though the disturbances can be estimated accurately.
4.3. Comparative Simulations
To reveal the superiority of the proposed control scheme, a simulation comparing it to a traditional backstepping controller (BSC) is conducted in this paper. The parameters of the BSC are consistent with the proposed controller. With
, the curves of the tracking trajectory are shown in
Figure 15. It is found that the TASR driven by the TDs-LESO-BSC controller is able to track the desired trajectory accurately. However, on account of a serious disturbance, the trajectory based on BSC fails to converge to the desired trajectory. The tracking error of the TASR can be seen in
Figure 16. The position and yaw angle errors converge to 0 when the TASR is driven by the TDs-LESO-BSC controller. However, the errors seriously fluctuate during the adjustment of the BSC controller. Further, to analyze the tracking error results quantitatively, the root mean square (RMS) error is employed to evaluate tracking accuracy. As can be seen from
Table 3, the RMS errors of the TDs-LESO-BSC are distinctly smaller than those of the BSC, which highlights the superior 3D tracking performance of TDs-LESO-BSC.
In order to further verify the superiority of the TDs-LESO-BSC, another comparison is investigated. In this case, the desired trajectory for a raster scan is described by Equation (
49). The initial state of the robot is
. The parameters for the controller are the same as in the above simulation scenarios. Disturbances as defined in Equation (
48) with
are exerted. The curves of the tracking trajectory are depicted in
Figure 17. It is obvious that the TDs-LESO-BSC controller is able to resist the serious disturbances and drive the TASR to track the desired trajectory. In contrast, the trajectory based on the traditional BSC exhibits significant oscillation.
Figure 18 shows the contrast of the state errors. The state errors based on TDs-LESO-BSC mostly converge to zero. The position errors based on the BSC are larger, especially in the X-axial and Z-axial directions. For a more intuitive observation, the RMS values for the state errors are listed in
Table 4. It can be seen that the RMS errors in the X, Y, and Z axial directions are smaller for TDs-ESO-BSC than for the BSC. The above analysis verifies the anti-interference ability and accurate trajectory tracking ability of the proposed controller.
4.4. Discussion
On the basis of the above comprehensive simulation analysis, the proposed double closed-loop backstepping controller with TDs and LESO is demonstrated to be a feasible and robust trajectory tracking control method for the TASR. In addition, a comparison of simulation times verifies that the TDs not only avoid the “explosion of term” problem but also improve computing efficiency. Further, the robust test and comparative simulations indicate that the proposed control scheme, enhanced by the LESO, has a satisfactory capability to resist disturbances and model uncertainties, thus ensuring accurate trajectory tracking performance. However, the algorithm cannot reject all environmental disturbances due to the limited thrust capabilities of the robot’s systems.
In addition to this, there are notable advantages to the proposed control scheme. The proposed LESO guarantees that the controller is able to achieve trajectory tracking by relying only on position and orientation information in an unmeasured velocity situation. This scheme provides an alternative method of trajectory tracking for miniature bio-inspired underwater robots that cannot implement a DVL due to limited space.
5. Conclusions and Future Work
In this paper, we implemented a TASR control scheme, consisting of a backstepping controller, TDs, and an LESO, to handle 3D trajectory tracking tasks while considering the challenges presented by a nonlinear system, unmeasured velocity, and model parameter uncertainties; we also considered ambient interference. The double closed-loop backstepping method is known for its ability to process nonlinear systems. However, the defects of the backstepping method are obvious. For example, with an increase in system order, there are too many unknown parameters and the problem of differential explosion. Therefore, we introduced a virtual feedback variable into the outer-loop backstepping control that decreased the confined adjusted parameters. Furthermore, to avoid the problem of “explosion of terms”, TDs were designed to track the derivatives of desired velocities. For the problems of unmeasured velocity, model parameter uncertainties, and ambient interference, an LESO was designed to reject lumped disturbances and control the TASR by relying only on position and orientation information. Theoretical analysis and simulation results verified the feasibility of the proposed control scheme.
In the future, this control scheme will be implemented in a TASR to verify the robustness of the controller in a real environment.