Hierarchical Sliding Mode Control Combined with Nonlinear Disturbance Observer for Wheeled Inverted Pendulum Robot Trajectory Tracking

Hou, Ming; Zhang, Xuedong; Chen, Du; Xu, Zheng

doi:10.3390/app13074350

Open AccessArticle

Hierarchical Sliding Mode Control Combined with Nonlinear Disturbance Observer for Wheeled Inverted Pendulum Robot Trajectory Tracking

by

Ming Hou

^*,

Xuedong Zhang

,

Du Chen

and

Zheng Xu

School of Automation, Beijing Information Science & Technology University, Beijing 100192, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2023, 13(7), 4350; https://doi.org/10.3390/app13074350

Submission received: 28 February 2023 / Revised: 27 March 2023 / Accepted: 27 March 2023 / Published: 29 March 2023

(This article belongs to the Special Issue Advances in Robot Path Planning, Volume II)

Download

Browse Figures

Versions Notes

Abstract

:

Featured Application

The research presented herein could be used mostly in warehouse logistics transport activities in smart manufacturing.

Abstract

A proposed optimized model for the trajectory tracking control of a wheeled inverted pendulum robot (WIPR) system is presented in this study, which addresses the problem of poor trajectory tracking performance in the presence of unknown disturbances due to the nonlinear and underactuated characteristics of the system. First, a kinematic controller was used to track a reference trajectory and generate a control law that specifies the desired forward and rotation speeds of the system. Next, a nonlinear disturbance observer (NDO) was designed to enhance the system’s robustness to external disturbances and improve its tracking performance. Then, the coupled system state variables were decoupled into two subsystems: a forward rotation subsystem and a tilt angle velocity subsystem. An improved hierarchical sliding mode controller was designed to control these subsystems separately. Finally, simulation experiments were conducted to compare the proposed method with a common sliding mode control approach. The simulation results demonstrate that the proposed method achieves better tracking performance in the presence of unknown disturbances.

Keywords:

wheeled inverted pendulum robot; underactuated; nonlinear disturbance observer; hierarchical sliding mode control

1. Introduction

With the rapid development of technology, human society has simultaneously achieved increased convenience and comfort [1]. In today’s factory warehouses and production lines, a variety of robots add to the possibilities of Industry 4.0. In the warehouses of more advanced companies, transport robots can be found everywhere, replacing traditional manpower and eliminating the need for workers to carry out repetitive lifting and carrying. These “smart” robots can accomplish a task as long as they can follow the requirements of a given transport trajectory. This paper studies the trajectory tracking of a mobile wheeled inverted pendulum on a given reference trajectory to achieve perfect tracking of the ideal motion trajectory of the robots, meeting the requirements of factory transport robots and providing a powerful source of assistance to realizing smart factories [2,3,4,5].

Mobile wheeled inverted pendulum models, such as WIPRs, have attracted much attention because of their special advantages, such as compactness, mobility, and human-like functions. WIPRs are widely used to verify the effectiveness of nonlinear underactuated control methods, and compared with the traditional inverted pendulum, WIPRs have more applications than traditional inverted pendulum vehicles, especially in unknown, dynamic, and nonlinear environments, and are commonly used in logistics transportation, commuting, and navigation, as well as in the aforementioned application in the environment of factory transportation. However, a WIPR is classified as a typical model of nonlinear underactuated systems with two input torques driving two wheels and three degrees of freedom (forward, rotation, and tilt angle of the pendulum), and achieving its high-performance motion control is still a challenging task for the control community [6,7,8,9].

On the one hand, when a WIPR moves, it is always assumed that the ground can provide enough friction to prevent the robot from side-slipping and wheel-sliding (i.e., the robot is guaranteed to move with purely rolling wheels without skidding phenomena), which is a non-complete constraint at this point. On the other hand, consider that the underdriven inverted pendulum body needs to use the input torque of two driving wheels to control the three degrees of freedom of WIPR forward movement, rotation, and the angle of the inverted pendulum. If we want to use MWIPR to track the trajectory, we need to drive a WIPR in real time to control the three form variables with two input variables, which is a typical underactuated problem. Finally, in the real world, WIPRs operate in factories or other similar environments and always encounter various unknown disturbances that interfere with the system. Therefore, the three problems of incomplete constraints and underactuated and unknown perturbations are the main challenges faced by this particular mobile robot for trajectory tracking control [10,11,12,13].

The three issues mentioned above are of importance for the following reasons. First, the incomplete constraint will lead to the WIPR being unable to follow any trajectory movement, especially in the case of high-speed heavy load; if the robot’s incomplete constraints are not considered in motion planning, this is likely to lead to untimely obstacle avoidance and unreachable trajectory. Second, underdriven robots often have excellent dynamic performance or price advantages in terms of drive cost, but their biggest problem is the higher requirements in controller design. Finally, unknown disturbances will affect the control accuracy of the system to a certain extent, and more seriously, will affect the stability of the control system [14,15].

Many researchers and practitioners have proposed several control algorithms to overcome the difficulties faced by the problems associated with WIPR systems. One of the widely used methods is fuzzy control, which is an empirical, rule-based control technique that can effectively control nonlinear systems. By establishing a dynamic model of the WIPR, a fuzzy-logic-based controller can be designed to take the position and angle information of the WIPR as the input and output control signals to control its motion state. For instance, Jian Huang, in [16], proposed an Integral Interval Type 2 Fuzzy Logic (IT2FL) method that can maintain the MTWIP equilibrium while obtaining the desired position and orientation to make it work in an uncertain environment. However, the disadvantages of fuzzy control include low control accuracy, strong dependence on control rules, and difficulty in designing control rules.

The second control algorithm type is neural network control. Chenguang Yang [17] decomposed the underdriven WIPR model into two subsystems. The approximation characteristics of the neural network were used for motion control of the fully driven subsystem, and the sub-fully driven system was used to indirectly control the tilt angle motion of the pendulum. However, the method requires a large number of wavelet coefficient vectors, making the neural network computationally intensive.

Finally, sliding mode control, as the most typical robust control method, shows good tracking performance and strong robustness, which support its wide use in linear and nonlinear systems. For underactuated systems, various sliding mode control methods have been proposed by researchers to achieve different control effects, such as integral sliding mode control, terminal sliding mode control, and hierarchical sliding mode control [18,19,20]. Among them, the application of hierarchical sliding mode control in practical underdriven systems is receiving more and more attention, such as balancing control of a double-inverted pendulum and trajectory tracking control of a wheeled inverted pendulum. Nabanita Adhikary, in [21], proposed an integral inverse-step sliding mode controller for underdriven system control. A feedback control law was designed based on the backpropagation method, and a sliding surface was introduced in the final stage of the algorithm. Jian Huang, in [22], designed two terminal sliding mode controllers to control the speed and braking of a UW-Car based on the dynamic model and the terminal sliding mode control method. He Ping [23] proposed a hierarchical sliding mode controller (HSMC) developed to simultaneously perform speed control and balance control of a two-wheeled self-balancing vehicle (TWSBV).

Hierarchical sliding mode control is a control strategy based on sliding mode control, which divides the sliding surface into two layers. In the first layer, a high-speed sliding surface is introduced, and the control system approaches the desired state quickly. In the second layer, a low-speed sliding surface is introduced, and the control system stabilizes near the desired state. The layered sliding mode control can improve the control accuracy and stability and also has a good effect on the response speed and robustness of the system. Therefore, hierarchical sliding mode control has the same drawback in that it is insensitive to disturbances, which can easily cause the “jitter” phenomenon of the system. To address the shortcomings of sliding mode control, this paper proposes an improved hierarchical sliding mode control method with adaptive exponential convergence law, which can adaptively adjust the control convergence law according to the control state and smooth the sign function, thus effectively improving the problem of the strong jitter of the traditional sliding mode control, and combining the nonlinear disturbance observer (NDO), which is the most powerful method for the control of sliding mode. The NDO can effectively solve the negative impact caused by the unknown disturbance and make the system more robust, and achieve an ideal control effect on the trajectory tracking ability of the WIPR system [24,25,26,27,28,29,30].

Overall, this paper includes the following four aspects: the first part constructs the dynamic model of the WIPR system, decouples the multi-coupled state variables, and facilitates the subsequent controller design; the second part establishes the kinematic trajectory tracking controller of the system and solves to obtain the desired speed of the dynamic control system. In the third part, an optimization model of the WIPR system combining nonlinear disturbance observer and hierarchical sliding mode control is designed, and the convergence of the nonlinear disturbance observer and the stability of the improved hierarchical sliding mode controller is demonstrated. The fourth part constructs the simulation model using the MATLAB/Simulink platform and conducts numerical simulation comparison experiments.

The contributions of this paper are as follows:

(1): A wheeled inverted pendulum robot with a transport platform is envisioned for use in warehouses or other application scenarios to move goods.
(2): The convergence law of hierarchical sliding mode control is improved to mitigate the jitter phenomenon of the sliding mode control system, and an adaptive function is introduced to minimize the system jitter.
(3): By combining a nonlinear disturbance observer and hierarchical sliding mode control to estimate unknown external disturbances as input compensation, the system is made to control more accurately.

2. Materials and Methods

2.1. WIPR Model

A WIPR is a wheeled inverted pendulum transport robot with a placement table, as illustrated in Figure 1. Its left and right wheels are independent drive wheels that control the robot’s movement speed, rotation direction, and tilt angle of the pendulum using the principle of differential drive to manage the position and posture of WIPR. The generalized world coordinate system is denoted

Σ O X Y Z

while

(x, y)

, representing the center coordinate of the robot wheels. The robot’s forward velocity and rotational angular velocity are denoted as

v

and

w

, respectively. The angle of the robot’s direction of motion concerning the

X

-axis is represented by

θ

, while

α

is the tilt angle of the pendulum concerning the

Z

-axis.

M

refers to the total weight of the transport platform plus the pendulum, whereas m denotes the weight of each drive wheel. The distance between the two wheels is represented by

d

, while

τ_{r}

, and

τ_{l}

are the torque of the right wheel and the left wheel, respectively. The rotational inertia of each driven wheel is denoted by

I_{w}

and

I_{M}

represents the rotational inertia of the transport platform and the pendulum together. The length of the pendulum is represented by

L

. Detailed introduction of robot parameters can be seen in Table 1.

Remark 1.

The forward velocity of MWIPR is

x_{v}

, and

x_{v} = \dot{x} \cos θ + \dot{y} \sin θ

.

Assumption 1.

The tires of the MWIPR do not experience any skidding, and there is no potential for lateral deflection during its motion.

According to Assumption 1, the incomplete constraint equation of WIPR in Equation (1) can be listed as follows:

\dot{x} \sin θ - \dot{y} \cos θ = 0,

(1)

The position and posture of the WIPR in the world coordinate system are represented by

q = {[x, y, θ, α]}^{T}

. As the Lagrangian modeling method does not require the inclusion of internal forces within the system, it is a quick and straightforward method of building a model. This property makes it particularly well-suited for constructing multivariable and nonlinear dynamic models for the WIPR, as demonstrated in this paper. By dividing

q

into

q_{m}

and

α

, the position of the robot in the coordinate system is denoted by

q_{m}

, while the angle of the pendulum is represented by

α

. Therefore, the Lagrangian method [31] is employed to establish the dynamic model of the WIPR, and the resulting mathematical model is presented below as Equation (2).

[\begin{matrix} M_{m} (q) & M_{m α} (q) \\ M_{α m} (q) & M_{α} (q) \end{matrix}] [\begin{matrix} {\ddot{q}}_{m} \\ \ddot{α} \end{matrix}] + [\begin{matrix} C_{m} (q) & C_{m α} (q) \\ C_{α m} (q) & C_{α} (q) \end{matrix}] [\begin{matrix} {\dot{q}}_{m} \\ \dot{α} \end{matrix}] + [\begin{matrix} G_{m} \\ G_{α} \end{matrix}] = [\begin{matrix} B_{m} (q_{m}) τ_{m} \\ 0 \end{matrix}] + [\begin{matrix} A^{T} (q_{m}) λ \\ 0 \end{matrix}] + [\begin{matrix} d_{m} \\ d_{α} \end{matrix}],

(2)

By defining

A (q_{m}) = [\sin θ, - \cos θ, 0]

, the incompleteness constraint of Equation (1) yields the following result:

A (q_{m}) q_{m} = 0,

(3)

The WIPR system’s incomplete constraint force is

A^{T} (q_{m}) λ

, where

λ

is the Lagrange Multiplier. To eliminate the constraint forces in the system, we seek to find a matrix

S (q_{m}) \in ℝ^{3 \times 2}

that satisfies

S^{T} (q_{m}) A^{T} (q_{m}) = 0

.

By defining

ν = {[v, w]}^{T}

, therefore, Equation (4) can be deduced.

{\dot{q}}_{m} = S (q) ν,

(4)

To eliminate the incompetent constraint forces, a new vector

\dot{ζ} = {[{\dot{ζ}}_{1}, {\dot{ζ}}_{2}, {\dot{ζ}}_{3}]}^{T}

= {[v, w, \dot{α}]}^{T}

is defined and used to transform the equation. The transformation involves multiplying both sides of the equation by a scalar

S^{T} (q_{m})

, resulting in Equation (5):

M (q) \ddot{ζ} + C (q, \dot{q}) \dot{ζ} + G (q) = τ + τ_{d},

(5)

The dynamics of the system can be described using the following equation, in which

M (q) \in ℝ^{3 \times 3}

represents the inertia matrix,

C (q, \dot{q}) \dot{q} \in ℝ^{3 \times 3}

is the Coriolis force matrix,

G (q) \in ℝ^{3 \times 1}

is the gravity matrix,

τ

is the control input matrix, and

τ_{d}

is the total unknown disturbance. The detailed expressions of each vector or matrix are presented below.

M (q) = [\begin{matrix} S^{T} M_{m} S & S^{T} M_{m}_{α} \\ M_{α m} S & M_{α} \end{matrix}] = [\begin{matrix} m_{11} & 0 & m_{13} \\ 0 & m_{22} & 0 \\ m_{31} & 0 & m_{33} \end{matrix}], C (q, \dot{q}) = [\begin{matrix} S^{T} C_{m} S + S^{T} M_{m} \dot{S} & C_{m α} \\ C_{α m} S + M_{α m} \dot{S} & C_{α} \end{matrix}] [\begin{matrix} 0 & 0 & c_{13} \\ 0 & c_{22} & c_{23} \\ 0 & c_{32} & 0 \end{matrix}], G (q) = [\begin{matrix} S^{T} G_{m} \\ G_{α} \end{matrix}] = [\begin{matrix} 0 \\ 0 \\ g_{3} \end{matrix}] = [\begin{matrix} 0 \\ 0 \\ - M g L \sin α \end{matrix}], τ = [\begin{matrix} S^{T} B_{m} (q_{m}) τ_{m} \\ 0 \end{matrix}] = [\begin{matrix} τ_{1} \\ τ_{2} \\ 0 \end{matrix}] τ_{d} = [\begin{matrix} S^{T} d_{m} \\ d_{α} \end{matrix}] = [\begin{matrix} d_{1} \\ d_{2} \\ d_{3} \end{matrix}] .

The value of each variable in the expression is indicated as

m_{11} = 2 m + 2 I_{M} / r^{2} + M

,

m_{13} = m_{31} = M L \cos α

,

m_{22} = d^{2} m / 2 + I_{M} d^{2} / 2 r^{2} + I_{ω} + M L^{2} \sin^{2} α

,

m_{33} = M L^{2} + I_{M}

,

c_{22} = (1 / 2) M L^{2} \dot{α} \sin^{2} (2 α)

,

c_{23} = - (1 / 2) ω M L^{2} \sin (2 α)

,

c_{13} = - M L \dot{α} \sin α

,

c_{32} =

- (1 / 2) ω M L^{2} \sin (2 α)

.

Due to the coupling of the state variables in the system, Equation (5) is decoupled into Equation (6).

\{\begin{array}{l} (m_{11} m_{33} - m_{13} m_{31}) {\ddot{ζ}}_{1} - m_{13} c_{32} {\dot{ζ}}_{2} + m_{33} c_{13} {\dot{ζ}}_{3} - m_{13} g_{3} \\ = m_{33} (τ_{1} + d_{1}) - m_{13} d_{3} \\ m_{22} {\ddot{ζ}}_{2} + c_{22} {\dot{ζ}}_{2} + c_{23} {\dot{ζ}}_{3} = τ_{2} + d_{2} \\ (m_{11} m_{33} - m_{13} m_{31}) {\ddot{ζ}}_{3} + m_{11} c_{32} {\dot{ζ}}_{2} - m_{31} c_{13} {\dot{ζ}}_{3} + m_{11} g_{3} \\ = m_{11} d_{3} - m_{31} (τ_{1} + d_{1}) \end{array},

(6)

By defining

ξ_{1} = {[ζ_{1}, ζ_{2}, ζ_{3}]}^{T}, ξ_{2} = {[{\dot{ζ}}_{1}, {\dot{ζ}}_{2}, {\dot{ζ}}_{3}]}^{T}

, Equation (6) is converted to Equation (7).

\{\begin{array}{l} {\dot{ξ}}_{1} = ξ_{2} \\ {\dot{ξ}}_{2} = f (ξ) + g (ξ) τ + D \end{array},

(7)

where

f (ξ) = [\begin{matrix} f_{1} \\ f_{2} \\ f_{3} \end{matrix}] = [\begin{matrix} \frac{1}{Ω} (m_{13} c_{32} {\dot{ζ}}_{2} - m_{33} c_{13} {\dot{ζ}}_{3} - m_{13} g_{3}) \\ \frac{1}{m_{22}} (- c_{22} {\dot{ζ}}_{2} - c_{23} {\dot{ζ}}_{3}) \\ \frac{1}{Ω} (m_{11} c_{32} {\dot{ζ}}_{2} - m_{31} c_{13} {\dot{ζ}}_{3} + m_{11} g_{3}) \end{matrix}], g (ξ) = [\begin{matrix} \frac{m_{33}}{Ω} \\ \frac{1}{m_{22}} \\ \frac{- m_{31}}{Ω} \end{matrix}], D = [\begin{matrix} D_{1} \\ D_{2} \\ D_{3} \end{matrix}] = [\begin{matrix} \frac{m_{33} d_{1} - m_{13} d_{3}}{Ω} \\ \frac{d_{2}}{m_{22}} \\ \frac{m_{11} d_{3} - m_{31} d_{1}}{Ω} \end{matrix}] .

where

Ω = m_{11} m_{33} - m_{13} m_{31}

.

2.2. The Design of the Kinematic Control Law

In kinematic trajectory tracking control for a WIPR, the system can be simplified to a general two-wheeled non-complete mobile robot for trajectory tracking. The process involves utilizing a reference trajectory state vector

q_{m}_{r} = {[x_{r}, y_{r}, θ_{r}]}^{T}

, an actual state vector

q_{m} = {[x, y, θ]}^{T}

, and a control objective designed to manage linear and angular velocities. The objective is to ensure that the actual robot travel trajectory aligns with the reference trajectory, even if the trajectory error

q_{m e} = {[e_{x}, e_{y}, e_{θ}]}^{T} = {[x_{r} - x, y_{r} - y, θ_{r} - θ]}^{T}

approaches zero. To meet the requirement expressed above, the control laws for

v_{d}

and

w_{d}

can be devised as Equation (8).

\lim_{t \to \infty} ‖ q_{m e} ‖ = 0,

(8)

The Lyapunov function is selected as Equation (9).

V_{1} = \frac{1}{2} x_{e}^{2} + \frac{1}{2} y_{e}^{2} + 1 - \cos θ_{e},

(9)

While the error persists, the value

V_{1}

remains greater than zero, thereby rendering the function positive definite. Equation (10) describes the derivative of

V_{1}

.

{\dot{V}}_{1} = e_{x} {\dot{e}}_{x} + e_{y} {\dot{e}}_{y} + {\dot{e}}_{θ} \sin e_{θ} = e_{x} (w e_{y} - v + v_{r} \cos e_{θ}) + e_{y} (- w e_{x} + v_{r} \sin e_{θ}) + (w_{r} - w) \sin e_{θ} = - v e_{x} - w \sin e_{θ} + e_{x} v_{r} \cos e_{θ} + e_{y} v_{r} \sin e_{θ} + w_{r} \sin e_{θ} \leq 0,

(10)

Lyapunov’s stability theorem [32] establishes that the system can achieve asymptotic stability if the function is negative definite, i.e., if

{\dot{V}}_{1} \leq 0

. Accordingly, the sought control law is as follows (11).

\{\begin{array}{l} v_{d} = v_{r} \cos e_{θ} + k_{1} e_{x} \\ w_{d} = w_{r} + v_{r} e_{y} + k_{2} \sin e_{θ} \end{array},

(11)

Both

k_{1}

and

k_{2}

are positive constants.

Substituting the control law into

{\dot{V}}_{1}

gives the following equation.

{\dot{V}}_{1} = - e_{x} (v_{r} \cos e_{θ} + k_{1} e_{x}) - (w_{r} + v_{r} e_{y} + k_{2} \sin e_{θ}) \sin e_{θ} + e_{x} v_{r} \cos e_{θ} + e_{y} v_{r} \sin e_{θ} + w_{r} \sin e_{θ} = - e_{x} v_{r} \cos e_{θ} + e_{x} v_{r} \cos e_{θ} - k_{1} e_{x}^{2} - w_{r} \sin e_{θ} + w_{r} \sin e_{θ} - v_{r} e_{y} \sin e_{θ} + e_{y} v_{r} \sin e_{θ} - k_{2} \sin_{e_{θ}}^{2} = - k_{1} e_{x}^{2} - k_{2} \sin_{e_{θ}}^{2} \leq 0 .

(12)

Therefore,

{\dot{V}}_{1} \leq 0

can be proven.

So far, the desired velocity required for the design of the dynamical system is shown in Equation (11), and the velocity tracking problem of the dynamical system and the angle tracking problem of the pendulum will be solved next.

2.3. The Design of NDO

A nonlinear disturbance observer is developed to estimate the actual disturbance in the system for an unknown disturbance D, thereby strengthening the system’s robustness. To address practical considerations, it is assumed that any disturbance is bounded as follows [33,34,35].

Lemma 1.

For initial conditions that are bounded, a Liapunov function is also uniformly bounded

x (t)

if there exists a continuous positive definite Liapunov function

V (x)

satisfying the following conditions:

δ_{1} (‖ x ‖) \leq V (x) \leq δ_{2} (‖ x ‖), \dot{V} (x) \leq - κ V (x) + c,

(13)

where

δ_{1}, δ_{2} : ℝ^{n} \to ℝ

is the

V

class function, and

κ, c

all are positive constants.

Assumption 2.

Since no disturbance can be infinite in the real world, we assume that the perturbations in the WIPR system studied in this paper are all bounded, and their first-order derivatives and second-order derivatives are assumed to be bounded; thus, the following equations can be obtained.

‖ \dot{D} ‖ \leq ε_{1}, ‖ \ddot{D} ‖ \leq ε_{2}, ε_{1} > 0, ε_{2} > 0

‖ \cdot ‖

represents the Euclidean norm of the vector. The NDO is designed as in Equation (14).

\hat{D} = Z_{1} + P_{1} (ξ_{2}) {\dot{Z}}_{1} = - L_{1} (ξ_{2}) [f (ξ) + g (ξ) τ + \hat{D}] + \hat{\dot{D}} \hat{\dot{D}} = Z_{2} + P_{2} (ξ_{2}) {\dot{Z}}_{2} = - L_{2} (ξ_{2}) [f (ξ) + g (ξ) τ + \hat{D}],

(14)

\hat{D}

and

\hat{\dot{D}}

represent the estimates of the total perturbation and its derivative, respectively, while

Z_{1}

and

Z_{2}

are intermediate variables in the observer. To meet the requirements of the system, the self-designed nonlinear functions

P_{1} (ξ_{2})

and

P_{2} (ξ_{2})

are used and must satisfy the conditions

L_{1} (ξ_{2}) = \frac{\partial P_{1} (ξ_{2})}{\partial ξ_{2}}

and

L_{2} (ξ_{2}) = \frac{\partial P_{2} (ξ_{2})}{\partial ξ_{2}}

.

Property 1.

The error between the estimated and actual values of the disturbance is represented by

\tilde{D} = D - \hat{D}

, while

\tilde{\dot{D}} = \dot{D} - \hat{\dot{D}}

represents the error between the derivative of the actual value of a perturbation and the derivative of the estimated value of the same perturbation.

The derivation of

\tilde{D}

and

\tilde{\dot{D}}

substitution of Equations (7) and (14) into the above equation leads to results

\dot{\tilde{D}}

and

\tilde{\dot{D}}

, which are the equations of the NDO, (15) and (16), respectively.

\dot{\tilde{D}} = \dot{D} - \dot{\hat{D}} = \dot{D} - {\dot{Z}}_{1} - \frac{\partial P (ξ_{2})}{\partial ξ_{2}} \cdot \frac{d ξ_{2}}{d t} = \dot{D} - {\dot{Z}}_{1} - L_{1} (ξ_{2}) {\dot{ξ}}_{2} = \dot{D} + L_{1} (ξ_{2}) [f (ξ) + g (ξ) τ + \hat{D}] - \hat{\dot{D}} - L_{1} (ξ_{2}) {\dot{ξ}}_{2} \dot{D} + L_{1} (ξ_{2}) [f (ξ) + g (ξ) τ + \hat{D}] - \hat{\dot{D}} - L_{1} (ξ_{2}) [f (ξ) + g (ξ) τ + D] = - L_{1} (ξ_{2}) \tilde{D} + \tilde{\dot{D}},

(15)

\dot{\tilde{\dot{D}}} = \ddot{D} - \dot{\hat{\dot{D}}} = \dot{D} - {\dot{Z}}_{2} - \frac{\partial P (ξ_{2})}{\partial ξ_{2}} \cdot \frac{d ξ_{2}}{d t} = \ddot{D} - {\dot{Z}}_{2} - L_{2} (ξ_{2}) {\dot{ξ}}_{2} = \ddot{D} + L_{2} (ξ_{2}) [f (ξ) + g (ξ) τ + \hat{D}] - L_{2} (ξ_{2}) [f (ξ) + g (ξ) τ + D] = - L_{2} (ξ_{2}) \tilde{D} + \ddot{D},

(16)

By letting

E = {[{\tilde{D}}^{T}, {\tilde{\dot{D}}}^{T}]}^{T}

and substituting the appropriate Equations (15)–(17) can be obtained.

\dot{E} = L E + δ \ddot{D},

(17)

where

L = [\begin{matrix} - L_{1} (ξ_{2}) & I_{3} \\ - L_{2} (ξ_{2}) & 0 \end{matrix}]

,

δ = [\begin{matrix} 0 \\ I_{3} \end{matrix}]

. The observer’s stability is examined, and a Lyapunov function is selected to make sure that it can reliably predict the system state despite any nonlinear disturbances. By selecting an appropriate Liapunov function, we rigorously prove the stability of the observer and the precision of its estimation precisely.

Property 2.

L

is a skew-symmetric matrix.

V_{2} = \frac{1}{2} E^{T} E,

(18)

The proof of the derivative of

V_{2}

can be expressed as follows:

{\dot{V}}_{2} = E^{T} \dot{E} = E^{T} (L E + δ \ddot{D}) \leq E^{T} (L + 0.5 ‖ δ ‖^{2} I_{6}) E + 0.5 ε_{2}^{2},

(19)

The above design of an NDO for a WIPR can be summarized in the following theorem.

Theorem 1.

For the existence of an unknown disturbance in a WIPR system, the perturbation estimation error is bounded for the observer designed according to Equation (14).

Proof.

The design parameters

L_{1} (ξ_{2}) = \frac{\partial P_{1} (ξ_{2})}{\partial ξ_{2}}

and

L_{2} (ξ_{2}) = \frac{\partial P_{2} (ξ_{2})}{\partial ξ_{2}}

are such that

L + 0.5 ‖ δ ‖^{2} I_{6}

is a negative definite matrix, according to Lemma 1, then

E

is bounded. □

2.4. The Design of Improved Slide Mode Control

The sliding mode control algorithm consists of two key elements: (i) the design of the sliding mode surface; and (ii) the design of the convergence rate. The design of the sliding mode surface is mainly based on the system structure as well as the control objective. As for the design of convergence law, there are four different convergence laws: the isokinetic convergence law, exponential convergence law, power convergence law, and general convergence law. In this paper, based on the optimal control objective of WIPPR to cope with nonlinearity and underdrive, as well as unknown disturbances, the traditional exponential convergence law is improved by introducing an adaptive control function, and an improved sliding mode control based on the adaptive exponential convergence law is proposed, which can weaken the system jitter while speeding up the system response, making the sliding mode control more suitable for tracking the reference trajectory of the WIPR system under the action of unknown disturbances [36]. The design for the traditional exponential convergence law is shown in Equation (20).

\dot{s} = - η sgn (s) - μ s,

(20)

where:

s

denotes the slip surface function; the parameters

η

,

μ

denote the convergence coefficient; and

sgn (s)

denotes the sign function.

In the traditional exponential convergence law, the isokinetic term is denoted

- η sgn (s)

, and the exponential term is denoted

- μ s

. When the state of the system is far from the slip surface, the exponential term and the isokinetic term in the convergence law act simultaneously to help the system move toward the slip surface, and the magnitude of the isokinetic term and the exponential term are mainly determined by the reference

η

,

μ

. The exponential term is small, and the isokinetic term acts mainly when the system is moving close to the surface.

This paper makes a corresponding improvement based on the traditional exponential convergence law and introduces the adaptive

o (s)

function to adjust the convergence law in accordance with the control state of the system, as shown in Figure 2, which can accelerate the convergence speed of the sliding mode and weaken the overshoot phenomenon. This allows the sliding mode control to reduce the jitter phenomenon of the system. Following the inclusion of the adaptive function

o (s)

, the new exponential convergence law is as follows:

\{\begin{array}{l} \dot{s} = - η o (s) sgn (s) - μ s \\ o (s) = \frac{(| s | + 1) [\log_{a} (| s | + a)]}{| s | + \log_{a} (| s | + a) + b} \end{array},

(21)

where

a > 0

,

b > 0

,

η > 0

,

μ > 0

.

Through the analysis, relative to not adding the adaptive function (i.e., o(s) = 1), it can be found that when the system motion point is far away from the sliding surface (namely, when s is far away from the origin 0), the adaptive function

o (s)

will increase the convergence law, which will speed up the system convergence speed, shorten the system state convergence time to the target state, and reduce the control time; when the system motion point is close to the sliding surface,

|s|

will converge to 0 and

o (s)

will be less than 1. The role of

o (s)

here is to suppress the jitter amplitude and weaken the state variable fluctuation problem after the system is stabilized, and the suppression effect will be more obvious as the parameter

b

increases. To further weaken the jitter problem of the stabilized system, the smoothing process is carried out for the symbolic function

sgn (s)

in this paper, which is known as the traditional symbolic function [37], as shown in the following equation.

sgn (s) = \{\begin{array}{l} 1 s > 0 \\ 0 s = 0 \\ - 1 s < 0 \end{array},

(22)

The symbolic function after the smoothing process is shown below.

sgn (s) = \frac{s}{|s| + β}, β = 0.01,

(23)

2.5. The Design of the Forward-Rotation Subsystem

For convenience, the system has been reorganized into the following form (24).

\{\begin{array}{l} {\ddot{ζ}}_{1} = \frac{1}{Ω} (m_{13} c_{32} {\dot{ζ}}_{2} - m_{33} c_{13} {\dot{ζ}}_{3} - m_{13} g_{3}) + \frac{m_{33}}{Ω} τ_{1} + D_{1} \\ {\ddot{ζ}}_{2} = \frac{1}{m_{22}} (- c_{22} {\dot{ζ}}_{2} - c_{23} {\dot{ζ}}_{3}) + \frac{1}{m_{22}} τ_{2} + D_{2} \end{array},

(24)

The state variables in the system described by Equation (24) are highly coupled. To address this issue and to expand the system’s asymptotic stability domain, a hierarchical sliding mode controller was designed. The controller’s primary objective is to utilize an input control law that can simultaneously control both system variables

ζ_{1}

and

ζ_{2}

, thereby, mitigating the problem of system coupling [38].

Having obtained the expected forward velocity (

v_{d}

) and angular velocity (

w_{d}

) from Equation (11), the error between the actual and expected values can be defined as follows:

\{\begin{array}{l} e_{x_{v}} = ζ_{1 d} - ζ_{1} = x_{r} \cos θ_{r} + y_{r} \sin θ_{r} - x_{v} \\ e_{v} = {\dot{ζ}}_{1 d} - {\dot{ζ}}_{1} = v_{d} - v \\ e_{θ} = ζ_{2 d} - ζ_{2} = θ_{r} - θ \\ e_{w} = {\dot{ζ}}_{2 d} - {\dot{ζ}}_{2} = w_{d} - w \end{array},

(25)

To design the sliding mode control error tracking scheme for the

v

-

w

subsystem, two mutually independent first-layer sliding mode surfaces were initially constructed. The equations used to create these slide surfaces are as follows:

s_{1} = γ_{1} e_{x_{v}} + e_{v}, γ_{1} > 0,

(26)

s_{2} = γ_{2} e_{θ} + e_{w}, γ_{2} > 0,

(27)

The results of deriving Equations (26) and (27) are presented below.

{\dot{s}}_{1} = γ_{1} {\dot{e}}_{x_{v}} + {\dot{e}}_{v} = γ_{1} {\dot{e}}_{x_{v}} + {\ddot{ζ}}_{1 d} - {\ddot{ζ}}_{1} = 0 {\dot{s}}_{2} = γ_{2} {\dot{e}}_{θ} + {\dot{e}}_{w} = γ_{2} {\dot{e}}_{θ} + {\ddot{ζ}}_{2 d} - {\ddot{ζ}}_{2} = 0,

(28)

According to Filippov’s equivalent control theory, the equivalent control laws for

ζ_{1}

and

ζ_{2}

are as follows:

\{\begin{array}{l} τ_{e q 1} = - \frac{Δ}{m_{33}} (γ_{1} {\dot{e}}_{x_{v}} + {\ddot{ζ}}_{1 d} + {\hat{D}}_{1}) + \frac{1}{m_{33}} (m_{13} c_{32} {\dot{ζ}}_{2} - m_{33} c_{13} {\dot{ζ}}_{3} - m_{13} g_{3}) \\ τ_{e q 2} = m_{22} (γ_{2} {\dot{e}}_{θ} + {\ddot{ζ}}_{2 d}) + c_{22} {\dot{ζ}}_{2} + c_{23} {\dot{ζ}}_{3} - {\hat{D}}_{2} \end{array}

(29)

The second sliding surface can be expressed as a linear combination of the first sliding surface.

s_{3} = γ_{3} s_{1} + γ_{4} s_{2}, γ_{3,} γ_{4} > 0,

(30)

To control

ζ_{1}

and

ζ_{2}

, the equivalent control law must be included at the same time to control and enter their designed sliding surface, respectively. Therefore, the total control law is shown in the following equation.

τ_{u} = τ_{e q 1} + τ_{e q 2} + τ_{s w},

(31)

τ_{s w}

is the switching law of the converging slide surface phase, and the expressions are as follows.

τ_{s w} = \frac{- (γ_{3} \frac{m_{22}}{Ω} τ_{e q 2} + γ_{4} \frac{1}{m_{22}} τ_{e q 1} + η_{1} o (s_{3}) sgn (s_{3}) + μ_{1} s_{3})}{γ_{3} \frac{m_{22}}{Ω} + γ_{4} \frac{1}{m_{22}}},

(32)

To mitigate the jitter phenomenon of the system, the isokinetic and exponential terms of the sliding mode control are improved, where

η_{1}

and

μ_{1}

are the isokinetic and exponential terms of the previous design convergence law, and both are positive constants;

o (s_{3})

and

sgn (s_{3})

are the adaptive and symbolic functions designed in the previous paper. To prove that the designed controller is stable, the Lyapunov function is chosen as follows.

V_{3} = \frac{1}{2} s_{3}^{2} > 0

(33)

The derivative of

V_{3}

for time is given by the following expression.

{\dot{V}}_{3} = s_{3} {\dot{s}}_{3} = s_{3} (γ_{3} {\dot{s}}_{1} + γ_{4} {\dot{s}}_{2}) = s_{3} [γ_{3} (γ_{1} {\dot{e}}_{x_{v}} + {\dot{e}}_{v}) + γ_{4} (γ_{2} {\dot{e}}_{θ} + {\dot{e}}_{w})] = s_{3} [γ_{3} (γ_{1} {\dot{e}}_{x_{v}} + {\dot{v}}_{d} - \dot{v}) + γ_{4} (γ_{2} {\dot{e}}_{θ} + {\dot{w}}_{d} - \dot{w})] = s_{3} \{γ_{3} [γ_{1} {\dot{e}}_{x_{v}} + {\dot{v}}_{d} - f_{1} - \frac{m_{33}}{Ω} (τ_{e q 1} + τ_{e q 2} + τ_{s w}) - D_{1}] + γ_{3} [γ_{2} {\dot{e}}_{θ} + {\dot{w}}_{d} - f_{2} - \frac{1}{m_{22}} (τ_{e q 1} + τ_{e q 2} + τ_{s w}) - D_{2}]\} = s_{3} [- η_{1} o (s_{3}) sgn (s_{3}) - μ_{1} s_{3}^{2} + γ_{1} ({\hat{D}}_{1} - D_{1}) + γ_{2} ({\hat{D}}_{2} - D_{2})] \leq - η_{1} o (s_{3}) |s_{3}| - μ_{1} s_{3}^{2} + |s_{3}| (γ_{1} |{\tilde{D}}_{1}| + γ_{2} |{\tilde{D}}_{2}|)

(34)

By design

η_{1} o (s_{3}) > γ_{1} |{\tilde{D}}_{1}| + γ_{2} |{\tilde{D}}_{2}|

, the result of the following equation can be obtained.

{\dot{V}}_{3} \leq - η_{1} s_{3}^{2} \leq - \frac{η_{1}}{2} V_{3}

(35)

From Lemma 1 in [38], the following equation can be obtained.

V_{3} (t) \leq e^{- \frac{η_{1}}{2} (t - t_{0})} V (t_{0})

(36)

It can be seen that the

V_{3} (t)

index converges to 0, and the rate of convergence depends on

η_{1}

.

As demonstrated by the preceding equation, the error state can attain the slip surface in a finite amount of time. Subsequently, the first layer of slip surfaces

s_{1}

and

s_{2}

can converge asymptotically to zero, leading to the convergence of both the rotational and forward velocities of WIPR to the desired values.

2.6. The Design of the Tilt-Angle Subsystem

The system discussed in the previous section can achieve complete tracking of

ζ_{1}

and

ζ_{2}

within a finite time, which enables us to transform it into the following form:

{\ddot{ζ}}_{3} = \frac{1}{Ω} (m_{11} c_{32} {\dot{ζ}}_{2 d} - m_{31} c_{13} {\dot{ζ}}_{3} + m_{11} g_{3}) - \frac{m_{31}}{Ω} τ_{3} + D_{3},

(37)

As WIPR aims to maintain a vertical and stable direction of the pendulum during its motion, all relevant parameters (

α_{d}, {\dot{α}}_{d}, {\ddot{α}}_{d}

) can be set to zero. As such, the following definitions can be employed:

\{\begin{array}{l} e_{α} = α_{d} - α = - α \\ e_{\dot{α}} = {\dot{α}}_{d} - \dot{α} = - \dot{α} \end{array},

(38)

Let the sliding mode surface be defined as Equation (39), with its derivative expressed as Equation (40).

s_{4} = γ_{5} e_{α} + e_{\dot{α}},

(39)

{\dot{s}}_{4} = γ_{5} {\dot{e}}_{α} + {\dot{e}}_{\dot{α}} = - γ_{5} \dot{α} - \ddot{α} = - η_{2} o (s_{4}) sgn (s_{4}) - μ_{2} s_{4},

(40)

After substituting Equation (37), the control law for the tilt angle subsystem can be derived as presented in Equation (41).

τ_{α} = - \frac{Φ}{m_{31}} (γ_{5} \dot{α} + {\hat{D}}_{3}) - \frac{1}{m_{31}} (m_{11} c_{12} {\dot{ζ}}_{2} - m_{31} c_{13} {\dot{ζ}}_{3} + m_{11} g_{3} - η_{2} o (s_{4}) sgn (s_{4})) - μ_{2} s_{4}

(41)

To prove the stability of the designed system, the Lyapunov function is chosen as follows.

V_{4} = \frac{1}{2} s_{4}^{2}

(42)

The derivative of

V_{4}

for time is given by the following expression.

{\dot{V}}_{4} = s_{4} {\dot{s}}_{4} = s_{4} (- γ_{5} \dot{α} - \ddot{α}) = s_{4} (- γ_{5} \dot{α} - f_{3} - \frac{1}{m_{22}} τ_{α} - D_{3}) = s_{4} [- η_{2} o (s_{4}) sgn (s_{4}) - μ_{2} s_{4}^{2} + γ_{5} ({\hat{D}}_{3} - D_{3})] \leq - η_{2} o (s_{4}) |s_{4}| - μ_{2} s_{4}^{2} + γ_{5} |{\tilde{D}}_{3}|

(43)

By choosing

η_{2} o (s_{4}) > γ_{5} |{\tilde{D}}_{3}|

, it enables

{\dot{V}}_{4} < 0

to hold, indicating that the system achieves asymptotic stability.

Figure 3 displays the schematic block diagram of the control system.

3. Simulation

The focus of this section is to discuss the trajectory-tracking effect of the system in a simulation environment, and to verify the feasibility of the proposed control scheme and what the advantages of the proposed method are compared with other control systems in this paper. Next, the simulation results of different control systems in the face of the same disturbance will be compared to verify the control effectiveness of each system. The parameters in the system are shown in the following Table 2.

The simulation experiments in Matlab/Simulink verified the high-precision trajectory tracking capability of the system and the stability of the pendulum in robot motion. During the simulation study, the initial position was set as

q_{m} = {[0, 0, 0]}^{T}

, and the initial position of the reference trajectory was set as

q_{m}_{r} = {[0, 0, π / 2]}^{T}

, where the desired tracking velocity = 1 m/s and the angular velocity = 1 rad/s. Therefore, the trajectory of the robot should be a circle with a radius of 1 m, and the center of the circle is

(0, 0)

. Therefore, the time function of the reference trajectory was chosen as

\{x = \sin t, y = \cos t\}

.

An external perturbation was added, as shown in the following equation.

\{\begin{array}{l} D_{1} = 0.4 \sin (0.4 t) N m \\ D_{2} = 0.5 \cos (0.5 t) N m \\ D_{3} = 0.6 \sin (0.6 t) N m \end{array},

(44)

To demonstrate the superiority of the proposed method in this paper, three comparative experiments were conducted under the given disturbance conditions: the first experiment involved the simulation results of the unimproved HSMC method, the second experiment involved the simulation results of the improved IHSMC method with adaptive law but without nonlinear disturbance observer, and the third experiment involved the simulation results of the proposed method in this paper (referred to as PC).

First of all, by observing Figure 4, Figure 5 and Figure 6, it can be concluded that the proposed method in this paper is better than the other two control methods in terms of both the speed of convergence of the error to the steady state and the magnitude of the fluctuation of the error after reaching the steady state when compared with the other two methods. This undoubtedly reflects the effectiveness of the method in this paper, which can track the given reference trajectory very accurately.

Figure 7 and Figure 8 give the tracking of the desired speed of the WIPR system under the three control methods. Compared with the other two methods, firstly, the control method in this paper can track the desired speed more rapidly, reaching the effect of tracking the desired speed at 0.7 s, whereas the other two methods track the desired speed in more than 1 s, which is much slower than the method in this paper, and the fluctuation frequency is high, which may affect the stability of the WIPR. As can be seen from Figure 8, the present method exceeds the other two methods in the tracking effect of rotational velocity relative to the forward velocity, for one. The convergence speed is fast, and more importantly, the proposed method is very stable after the velocity tracking reaches the steady state, which can be regarded as showing no fluctuation compared with the other two methods.

The HSMC method with general exponential convergence law has more frequent angle oscillations, and the system is more unstable, as can be seen from the angle change of the WIPR pendulum shown in Figure 9, whereas the IHSMC improved convergence law method’s pendulum has smoother oscillations after reaching stability, and the control effect is obviously stronger than that of the HSMC with general exponential convergence law. In terms of response time and maximum overshoot, the suggested method outperforms the other two ways, and it can continue to operate smoothly and without oscillations once it has reached the stabilization point.

The variograms of the input torque for the three control methods are presented in Figure 10, Figure 11 and Figure 12. The results indicate that when the convergence law of HSMC follows the general exponential convergence law, the jitter vibration of the input torque for the left and right wheels of WIPR is evident, which adversely affects the output of the actuator (i.e., affects the output of the drive motors of the left and right wheels). In contrast, Figure 11 illustrates that the improved convergence law significantly reduces the jitter phenomenon, resulting in a more beneficial improvement for the actuator.

Figure 12 presents the variation of input torque under the proposed control method. It can be observed that the input torque obtained by this method is smoother than IHSMC, and the jitter suppression effect is more satisfactory. This approach achieves a better torque input graph, making it the most effective method among the three for actuator benefits. Therefore, the proposed control method demonstrates superior performance in terms of reducing jitter and enhancing actuator benefits compared with the other two control methods, making it a better solution.

Figure 13 shows the trajectory tracking diagrams of different control systems. From an intuitive point of view, the proposed scheme is also significantly better than the other two schemes. As shown by the simulation comparison experiment, the method proposed in this paper is feasible, and its effect is excellent.

4. Conclusions

The purpose of this paper was to study the trajectory-tracking problem for WIPRs and propose a hierarchical sliding mode controller with a nonlinear perturbation observer to achieve accurate control of the reference trajectory and maintain the pendulum stability during motion. A nonlinear disturbance observer was designed to make the system more robust to unknown external disturbances. The underdriven coupling of WIPRs was addressed by dividing the system into two subsystems through the decoupling of its control state variables. The hierarchical sliding mode control method with an improved convergence law was then applied to control the system and suppress the “jitter” phenomenon. Finally, the Lyapunov function was chosen to verify the stability of the system mathematically.

The feasibility of the control system was verified using simulation software. However, considering the complexity of the real-world environment and external uncertainty, future work will focus on building a hardware system for the robot to study the real effects of the control method of WIPRs in the real world.

Author Contributions

All of the nominated authors initially made significant contributions to the paper. Conceptualization, M.H. and X.Z.; methodology, X.Z.; software, X.Z.; validation, X.Z., D.C. and M.H.; formal analysis, M.H.; investigation, M.H.; resources, X.Z.; data curation, X.Z., D.C. and Z.X; writing—original draft preparation, M.H.; writing—review and editing, X.Z., D.C. and Z.X.; visualization, D.C.; supervision, M.H.; project administration, M.H.; funding acquisition, M.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (61971048) and the Key Incubation Project of Beijing University of Information Science and Technology (5212110927).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to internal laboratory confidentiality agreements.

Conflicts of Interest

The authors declare no conflict of interest.

References

Kim, S.; Kwon, S. Nonlinear Optimal Control Design for Underactuated Two-Wheeled Inverted Pendulum Mobile Platform. IEEE/ASME Trans. Mechatron. 2017, 22, 2803–2808. [Google Scholar] [CrossRef]
Chen, M. Robust tracking control for self-balancing mobile robots using disturbance observer. IEEE/CAA J. Autom. Sin. 2017, 4, 458–465. [Google Scholar] [CrossRef]
Kim, Y.; Kwon, S. Robust Stabilization of Underactuated Two-Wheeled Balancing Vehicles on Uncertain Terrains with Nonlinear-Model-Based Disturbance Compensation. Actuators 2022, 11, 339. [Google Scholar] [CrossRef]
Liu, J.; Vazquez, S.; Wu, L.; Marquez, A.; Gao, H.; Franquelo, L.G. Extended State Observer-Based Sliding-Mode Control for Three-Phase Power Converters. IEEE Trans. Ind. Electron. 2017, 64, 22–31. [Google Scholar] [CrossRef] [Green Version]
Yang, C.; Li, Z.; Li, J. Trajectory Planning and Optimized Adaptive Control for a Class of Wheeled Inverted Pendulum Vehicle Models. IEEE Trans. Cybern. 2013, 43, 24–36. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Mirzaeinejad, H. Optimization-based nonlinear control laws with increased robustness for trajectory tracking of non-holonomic wheeled mobile robots. Transp. Res. Part C Emerg. Technol. 2019, 101, 1–17. [Google Scholar] [CrossRef]
Wang, F.-C.; Chen, Y.-H.; Wang, Z.-J.; Liu, C.-H.; Lin, P.-C.; Yen, J.-Y. Decoupled Multi-Loop Robust Control for a Walk-Assistance Robot Employing a Two-Wheeled Inverted Pendulum. Machines 2021, 9, 205. [Google Scholar] [CrossRef]
Yue, M.; Ning, Y.; Yu, S.; Zhang, Y. Composite following control for wheeled inverted pendulum vehicles based on hu-man-robot interaction. Sci. China Inf. Sci. 2019, 62, 50206. [Google Scholar]
Watson, M.T.; Gladwin, D.T.; Prescott, T.J.; Conran, S.O. Dual-Mode Model Predictive Control of an Omnidirectional Wheeled Inverted Pendulum. IEEE/ASME Trans. Mechatron. 2019, 24, 2964–2975. [Google Scholar] [CrossRef]
Albert, K.; Phogat, K.S.; Anhalt, F.; Banavar, R.N.; Chatterjee, D.; Lohmann, B. Structure-Preserving Constrained Optimal Trajectory Planning of a Wheeled Inverted Pendulum. IEEE Trans. Robot. 2020, 36, 910–923. [Google Scholar] [CrossRef]
Ginoya, D.; Shendge, P.D.; Phadke, S.B. Sliding Mode Control for Mismatched Uncertain Systems Using an Extended Disturbance Observer. IEEE Trans. Ind. Electron. 2014, 61, 1983–1992. [Google Scholar] [CrossRef]
Pathak, K.; Franch, J.; Agrawal, S. Velocity and position control of a wheeled inverted pendulum by partial feedback linearization. IEEE Trans. Robot. 2005, 21, 505–513. [Google Scholar] [CrossRef]
Takei, T.; Imamura, R.; Yuta, S. Baggage Transportation and Navigation by a Wheeled Inverted Pendulum Mobile Robot. IEEE Trans. Ind. Electron. 2009, 56, 3985–3994. [Google Scholar] [CrossRef]
Gong, S.; Zhang, A.; She, J.; Zhang, X.; Liu, Y. Trajectory Design and Tracking Control for Nonlinear Underactuated Wheeled Inverted Pendulum. Math. Probl. Eng. 2018, 2018, e6134764. [Google Scholar] [CrossRef]
Huang, C.-F.; Yeh, T.-J. Anti Slip Balancing Control for Wheeled Inverted Pendulum Vehicles. IEEE Trans. Control Syst. Technol. 2019, 28, 1042–1049. [Google Scholar] [CrossRef]
Yang, C.; Li, Z.; Cui, R.; Xu, B. Neural Network-Based Motion Control of an Underactuated Wheeled Inverted Pendulum Model. IEEE Trans. Neural Netw. Learn. Syst. 2014, 25, 2004–2016. [Google Scholar] [CrossRef] [PubMed]
Ren, C.; Li, X.; Yang, X.; Ma, S. Extended State Observer-Based Sliding Mode Control of an Omnidirectional Mobile Robot With Friction Compensation. IEEE Trans. Ind. Electron. 2019, 66, 9480–9489. [Google Scholar] [CrossRef]
Chen, L.; Wang, H.; Huang, Y.; Ping, Z.; Yu, M.; Zheng, X.; Ye, M.; Hu, Y. Robust hierarchical sliding mode control of a two-wheeled self-balancing vehicle using perturbation estimation. Mech. Syst. Signal Process. 2020, 139, 106584. [Google Scholar] [CrossRef]
Zhou, Y.; Wang, Z. Robust motion control of a two-wheeled inverted pendulum with an input delay based on optimal integral sliding mode manifold. Nonlinear Dyn. 2016, 85, 2065–2074. [Google Scholar] [CrossRef]
Adhikary, N.; Mahanta, C. Integral backstepping sliding mode control for underactuated systems: Swing-up and stabilization of the Cart–Pendulum System. ISA Trans. 2013, 52, 870–880. [Google Scholar] [CrossRef]
Ri, S.; Huang, J.; Wang, Y.; Kim, M.; An, S. Terminal Sliding Mode Control of Mobile Wheeled Inverted Pendulum System with Nonlinear Disturbance Observer. Math. Probl. Eng. 2014, 2014, e284216. [Google Scholar] [CrossRef] [Green Version]
Ping, H.; Hai, W.; Linfeng, L.; Huifang, K.; Ming, Y.; Canghua, J.; Zhihong, M. A novel hierarchical sliding mode control strategy for a two-wheeled self-balancing vehicle. In Proceedings of the 2017 36th Chinese Control Conference (CCC), Dalian, China, 26–28 July 2017; pp. 3731–3736. [Google Scholar] [CrossRef]
Moness, M.; Mahmoud, D.; Hussein, A. Real-time Mamdani-like fuzzy and fusion-based fuzzy controllers for balancing two-wheeled inverted pendulum. J. Ambient. Intell. Humaniz. Comput. 2020, 13, 3577–3593. [Google Scholar] [CrossRef]
Sun, W.; Su, S.-F.; Xia, J.; Wu, Y. Adaptive Tracking Control of Wheeled Inverted Pendulums with Periodic Disturbances. IEEE Trans. Cybern. 2020, 50, 1867–1876. [Google Scholar] [CrossRef] [PubMed]
Palm, R. Sliding mode fuzzy control. In Proceedings of the 1992 IEEE International Conference on Fuzzy Systems, San Diego, CA, USA, 8–12 March 1992; pp. 519–526. [Google Scholar] [CrossRef]
Jmel, I.; Dimassi, H.; Hadj-Said, S.; M’Sahli, F. An adaptive sliding mode observer for inverted pendulum under mass variation and disturbances with experimental validation. ISA Trans. 2020, 102, 264–279. [Google Scholar] [CrossRef] [PubMed]
Huang, J.; Guan, Z.; Matsuno, T.; Fukuda, T.; Sekiyama, K. Sliding-Mode Velocity Control of Mobile-Wheeled Inverted-Pendulum Systems. IEEE Trans. Robot. 2010, 26, 750–758. [Google Scholar] [CrossRef]
Fukushima, H.; Muro, K.; Matsuno, F. Sliding-Mode Control for Transformation to an Inverted Pendulum Mode of a Mobile Robot with Wheel-Arms. IEEE Trans. Ind. Electron. 2015, 62, 4257–4266. [Google Scholar] [CrossRef] [Green Version]
Chen, X.; Komada, S.; Fukuda, T. Design of a nonlinear disturbance observer. IEEE Trans. Ind. Electron. 2000, 47, 429–437. [Google Scholar] [CrossRef]
Saradagi, A.; Muralidharan, V.; Krishnan, V.; Menta, S.; Mahindrakar, A.D. Formation Control and Trajectory Tracking of Nonholonomic Mobile Robots. IEEE Trans. Control Syst. Technol. 2017, 26, 2250–2258. [Google Scholar] [CrossRef]
Yoshida, K.; Sekikawa, M.; Hosomi, K. Nonlinear analysis on purely mechanical stabilization of a wheeled inverted pendulum on a slope. Nonlinear Dyn. 2016, 83, 905–917. [Google Scholar] [CrossRef] [Green Version]
Chen, W.-H.; Yang, J.; Guo, L.; Li, S. Disturbance-Observer-Based Control and Related Methods—An Overview. IEEE Trans. Ind. Electron. 2016, 63, 1083–1095. [Google Scholar] [CrossRef] [Green Version]
Huang, J.; Ri, S.; Liu, L.; Wang, Y.; Kim, J.; Pak, G. Nonlinear Disturbance Observer-Based Dynamic Surface Control of Mobile Wheeled Inverted Pendulum. IEEE Trans. Control Syst. Technol. 2015, 23, 2400–2407. [Google Scholar] [CrossRef]
Xie, L.; Yu, X. State Observer Based Robust Backstepping Fault-Tolerant Control of the Free-Floating Flexible-Joint Space Manipulator. Appl. Sci. 2023, 13, 2634. [Google Scholar] [CrossRef]
Yue, M.; Wei, X.; Li, Z. Adaptive sliding-mode control for two-wheeled inverted pendulum vehicle based on zero-dynamics theory. Nonlinear Dyn. 2014, 76, 459–471. [Google Scholar] [CrossRef]
Chang, W.-J.; Hsu, F.-L. Sliding mode fuzzy control for Takagi–Sugeno fuzzy systems with bilinear consequent part subject to multiple constraints. Inf. Sci. 2016, 327, 258–271. [Google Scholar] [CrossRef]
Xu, J.-X.; Guo, Z.-Q.; Lee, T.H. Design and Implementation of Integral Sliding-Mode Control on an Underactuated Two-Wheeled Mobile Robot. IEEE Trans. Ind. Electron. 2014, 61, 3671–3681. [Google Scholar] [CrossRef]
Liu, Y.; Jing, Y.W.; Liu, X.P.; Li, X.H. Survey on finite-time control for nonlinear systems. Control Theory Appl. 2020, 37, 1–12. [Google Scholar]

Figure 1. WIPR system.

Figure 2. The adaptive function

o (s)

.

Figure 2. The adaptive function

o (s)

.

Figure 3. The control system.

Figure 4. The error of x.

Figure 5. The error of y.

Figure 6. The error of θ.

Figure 7. WIPR forward velocity

v

.

Figure 7. WIPR forward velocity

v

.

Figure 8. WIPR rotation velocity

w

.

Figure 8. WIPR rotation velocity

w

.

Figure 9. The angle of the WIPR pendulum

α

.

Figure 9. The angle of the WIPR pendulum

α

.

Figure 10. Input torque under HSMC.

Figure 11. Input torque under IHSMC.

Figure 12. Input torque under PC.

Figure 13. Tracking of circular trajectories.

Table 1. Parameter descriptions.

Parameter	Description
$m_{w}$	Mass of each wheel
$M$	The total weight of the transport platform plus the pendulum
$I_{w}$	The rotational inertia of each driven wheel
$I_{M}$	The rotational inertia of the transport platform and the pendulum
$d$	The distance between the two wheels
$L$	The length of the pendulum
$τ_{l}$	The torque of the left wheel
$τ_{r}$	The torque of the right wheel
$v$	WIPR forward velocity
$w$	WIPR rotation velocity
$θ$	WIPR Yaw angle
$α$	The tilt angle of the pendulum

Table 2. The value of each parameter variable in the system.

Parameter (Unit)	Value
$M (kg)$	8
$m (kg)$	0.5
$I_{M} (kg \cdot m^{2})$	5
$I_{w} (kg \cdot m^{2})$	0.3
$d (m)$	0.5
$L (m)$	0.5
$r (m)$	0.1

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hou, M.; Zhang, X.; Chen, D.; Xu, Z. Hierarchical Sliding Mode Control Combined with Nonlinear Disturbance Observer for Wheeled Inverted Pendulum Robot Trajectory Tracking. Appl. Sci. 2023, 13, 4350. https://doi.org/10.3390/app13074350

AMA Style

Hou M, Zhang X, Chen D, Xu Z. Hierarchical Sliding Mode Control Combined with Nonlinear Disturbance Observer for Wheeled Inverted Pendulum Robot Trajectory Tracking. Applied Sciences. 2023; 13(7):4350. https://doi.org/10.3390/app13074350

Chicago/Turabian Style

Hou, Ming, Xuedong Zhang, Du Chen, and Zheng Xu. 2023. "Hierarchical Sliding Mode Control Combined with Nonlinear Disturbance Observer for Wheeled Inverted Pendulum Robot Trajectory Tracking" Applied Sciences 13, no. 7: 4350. https://doi.org/10.3390/app13074350

APA Style

Hou, M., Zhang, X., Chen, D., & Xu, Z. (2023). Hierarchical Sliding Mode Control Combined with Nonlinear Disturbance Observer for Wheeled Inverted Pendulum Robot Trajectory Tracking. Applied Sciences, 13(7), 4350. https://doi.org/10.3390/app13074350

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Hierarchical Sliding Mode Control Combined with Nonlinear Disturbance Observer for Wheeled Inverted Pendulum Robot Trajectory Tracking

Abstract

Featured Application

Abstract

1. Introduction

2. Materials and Methods

2.1. WIPR Model

2.2. The Design of the Kinematic Control Law

2.3. The Design of NDO

2.4. The Design of Improved Slide Mode Control

2.5. The Design of the Forward-Rotation Subsystem

2.6. The Design of the Tilt-Angle Subsystem

3. Simulation

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI