Adaptive PID Control and Its Application Based on a Double-Layer BP Neural Network

Zhang, Ming-Li; Zhang, Yi-Jie; He, Xiao-Long; Gao, Zheng-Jie

doi:10.3390/pr9081475

Open AccessArticle

Adaptive PID Control and Its Application Based on a Double-Layer BP Neural Network

¹

School of Economics and Management, Yanshan University, Qinhuangdao 066000, China

²

School of Mechanical Engineering, Hebei University of Chinese Medicine, Shijiazhuang 050200, China

³

Key Laboratory for Health Care with Chinese Medicine of Hebei Province, Shijiazhuang 050200, China

^*

Author to whom correspondence should be addressed.

Processes 2021, 9(8), 1475; https://doi.org/10.3390/pr9081475

Submission received: 24 July 2021 / Revised: 15 August 2021 / Accepted: 17 August 2021 / Published: 23 August 2021

(This article belongs to the Special Issue Neural Networks, Fuzzy Systems and Other Computational Intelligence Techniques for Advanced Process Control)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, focusing on the inconvenience of variable value PID based on manual parameter adjustment for the hydraulic drive unit (HDU) of a legged robot, a method employing double-layer back propagation (BP) neural networks for learning the law of PID control parameters is proposed. The first layer is used to learn the relationship between different control parameters and the control performance of the system under various working conditions. The second layer is used to study the relationship between the parameters of the working conditions and the optimizing control parameters under various working conditions. The effectiveness of the proposed control method was verified by simulation and experiment. The results showed that the proposed method can provide a theoretical and experimental basis for the selection of control parameters, and can be extended to similar controllers, therefore possessing engineering application value.

Keywords:

legged robot; hydraulic drive unit (HDU); BP neural network; PID control

1. Introduction

Robots can walk in a variety of ways. At present, the movement forms can be roughly divided into wheeled [1], tracked [2], wheel-foot compound [3], snake-like [4], bionic legged [5], and so on. Compared with other types of robots, bionic-legged robots have the characteristic of discontinuous support because they have a similar leg structure to tetrapods. Especially when combined with hydraulic drive, which has a high power-to-weight ratio, it not only has good adaptability to unknown and unstructured environments but can also pass through the barrier. Therefore, this type of robot is particularly suitable for use in complex environments in the wild.

The leg controller serves as the bottom-level controller of this kind of robot, and each leg of the robot has several degrees of freedom controlled by highly integrated valve cylinders, also known as the hydraulic drive unit (HDU) [6,7]. While the HDU serves as the bottom-level controller of each leg, its control performance directly affects the control strategy and performance of the robot. Commonly, HDU bottom-level control methods can be divided into position control and force control. Based on bottom-level control, control methods of the leg can be extended to compliance control, contact force control, and so on. The above methods are not only applied in electrically driven robots such as Scara [8] and Stewart [9], but they can also be applied to robots such as Bigdog [10], Hydraulic quadrupedal (HyQ) [11], Light Weight Robot (LWR) [12], and Atlas [13].

This paper mainly researched the performance of the HDU in position control. The position control system in the HDU is a kind of high-order nonlinear system. Designing a superior control method requires a very detailed understanding of the characteristics of the controlled system. The establishment of a mathematical model involves analysis of the controlled system, and an accurate mathematical model can truly reflect the dynamic characteristics of the system, fully simulate the actual system in simulation research, and shorten the design cycle of the control method. High-performance intelligent control methods suitable for low-order nonlinear systems can also be used in it. However, in order to ensure the control stability and reliability of the whole machine, such a control method is not often used in engineering practice. The traditional control method is simple to implement and the effect is obvious. Furthermore, the change in the control parameters can truly reflect the system characteristics, which can be used to conduct a preliminary analysis of the system performance. Thus, the HDU position control system is still based on traditional PID control.

A neural network is a computational model that comprehensively simulates the human brain neural network in terms of structure, mechanism, and function [14,15,16]. By virtue of its complex nonlinear network structure and efficient iterative learning performance, it has obvious advantages compared with other nonlinear optimization methods. Some research works have shown that neural networks can fit arbitrary nonlinear functions. Swic presented an original machine learning-based automated approach for controlling the process of machining of low-rigidity shafts using artificial intelligence methods. Three models of hybrid controllers based on different types of neural networks and genetic algorithms were developed [17]. Rego deals with the problem of finding the control Lyapunov function that keeps the system stable. To find the Lyapunov function, this paper proposes the use of reinforcement learning with two neural networks based on the Lyapunov stability theory [18]. Nobahari focuses on developing a nonlinear controller based on the convolutional neural networks to control different plants. It is assumed that prior knowledge of the plants is very limited and there are only sensory input–output data history of the plants [19]. Wang studied the hysteresis nonlinear characteristics of piezoelectric actuators, a novel hybrid modeling method based on long short-term memory (LSTM) and nonlinear autoregressive with external input (NARX) neural networks is proposed [20].

The neural network is used to learn the relationship between parameters and control performance under different working conditions, and to find out the optimal control parameters under the current working conditions, which can improve the control accuracy of the system under various working conditions and eliminate the work of manual adjustment of parameters. Compared with variable value PID based on manual parameter adjustment, the method based on neural networks can output parameters with continuous variation according to different working conditions, thereby improving the accuracy of control. In addition, the latter method is not restricted by a specific number of conditions in the expert table. Thus, the applicable scope of the improved expert table holds great significance for the application of engineering.

The structure and the contribution of this paper is organized as follows: in Section 2, a mathematical model is established for the HDU position control system. In the model, many factors are carefully considered, such as servo valve nonlinearity, flow-pressure nonlinearity, and load characteristics. In Section 3, aiming at the inconvenience of variable value PID based on manual parameter adjustment in engineering practice, a method of employing double-layer back propagation (BP) neural networks for learning the law of PID control parameters is proposed, and the simulation results are shown, this is the main contribution of our paper. In Section 4, experimental research is carried out on the HDU performance test platform.

2. Introduction to the Sampling System

The HDU is a highly integrated system that includes a servo valve-controlled cylinder, which is the legged robot joint actuator. Figure 1 shows the quadruped robot prototype, the single leg hydraulic drive system, and the HDU.

Figure 2 shows a block diagram of the closed-loop position control system for the HDU in Figure 1c. The block diagram takes into account such factors as the flow-pressure nonlinearity of the servo valve, the asymmetry of the servo cylinder, the complex variability of the load, and so on. The detailed derivation process is shown in Appendix A.1.

The parameters definition and simulation values of the above system are shown in Table 1. The purpose of this paper is to present a new PID controller based on neural networks instead of the PID control parameter in Figure 2.

3. Adaptive PID Parameter Control Method Based on a Double-Layer BP Neural Network

3.1. Learning Strategy Design

Neurons are the basic unit of neural networks and their main function is to simulate the functional characteristics of biological neurons [21,22,23]. Considering that the input of the neural network in this paper comes from the sensor data of the control system, a Tanh activation function in the Sigmoid activation functions (the latter is generally referred to as a Sigmoid activation function) was selected as the activation function of neurons.

In order to make the system automatically output the optimal control parameters according to the working conditions, it is necessary to design the appropriate neural network structure first. If the neural network is too simple, the fitting accuracy will be reduced; if the neural network is too complex, the convergence will be slow, and even the generalization ability of the neural network will be reduced. Therefore, it is very important to design a neural network with an appropriate structure. Then, designing learning strategies to enable the neural network to learn effectively are needed, including the learning objects of the neural network, the selection of samples, the initial processing of samples, and iterative learning methods. In this section, a parameters learner based on a double-layer BP neural network is designed, which can realize automatic parameter learning. The overall learning strategy is shown in Figure 3, and the details are explained in the following sections.

3.2. Generation of Learning Samples

The sample is a very important part of neural network learning problems and is the source of learning for effective information. The sample data in this paper were driven by position control system simulation or experimental collection in the HDU. The data contained random interference generated by the system itself, and the range of each variable data was also different, so it was necessary to process the data before it was used for learning. The sample data used in this section had to meet the following conditions:

(1): The samples should cover a wider range of working conditions and control parameters as much as possible, and the performance indexes under the corresponding working conditions should be obtained through experiment or simulation, so that the neural network can learn the characteristics of the control system and improve the adaptive ability of the control method.
(2): The sample should be universal. The hydraulic system is a highly nonlinear time-varying system, and the dynamic characteristics of the system change with the different external conditions. Collection of data should be carried out after the hydraulic system has been started up and run stably under good heat dissipation conditions.
(3): The data interval of each variable in the sample should be as consistent as possible, which is beneficial for improving the convergence speed and stability of neural networks.

According to the above conditions and principles, a plan of learning data for the PID position control system of the HDU was designed in this section. By generating the input signals and change signals of the control parameters, then importing them into the control model, automatic data acquisition was realized.

In order to prove the effectiveness of the proposed learning strategy, part of the overall working conditions of the HUD were selected for verification to reduce unnecessary work, and then the control parameter range was simplified based on the simulation results of the PID control system shown in Section 2. The working conditions and control parameters finally determined in this section are shown in Table 2.

The final working conditions are generated by the permutation and combination of sinusoidal frequency, sinusoidal amplitude, and P gain in Table 2, and there are eight groups of sinusoidal frequency, 15 groups of P gain, 10 groups of sinusoidal amplitude, and 1200 working conditions in total. In order to avoid the mutual influence between two adjacent working conditions, each working condition runs for two cycles, with an overall sampling time of approximately 1632 s. Moreover, the mean of the control deviation absolute value at each moment of the last cycle is taken as the basis for evaluating the control performance.

The desired input signals in the simulation are shown in Figure 4. Due to the long sampling time, sinusoidal curves at different frequencies are relatively dense, as shown in the Figure 4 below.

The P gain of controller in the simulation is shown in Figure 5.

The working conditions parameters include sinusoidal frequency and amplitude of input signal, the control parameters are P gain of the PID control method, and the performance index in the system is the mean of control deviation absolute value. It can be seen that in Table 2, there is an order of magnitude difference in the size of these three variables, which is not beneficial to the learning of the neural network. Therefore, the above three variables should be appropriately transformed to make their interval roughly between 0 and 1. So, the concept “data after processing” in the following section is the data after normalization.

3.3. Performance Fitting of Control System

In Section 3.2, the mean of control deviation e under different working conditions and control parameters are obtained through simulation. In this section, neural network 1 is used to fit the relationship among the working condition parameters, control parameters, and the mean of control deviation e. Then, neural network 1 can be used to calculate the mean of control deviation e with different control parameters under each working condition. The parameters with the minimum of mean of control deviation e under each working condition are selected, so as to complete the optimization process of the control parameters.

(1): Input and output of the neural network

Neural network 1 was designed. The input of the neural network is a three-dimensional vector, which represents the sinusoidal frequency and amplitude of the input signal and P gain, respectively, and the output is the mean of control deviation e of the corresponding set of parameters.

u 1 = [\frac{i a m}{5}; \frac{f r e q}{2}; \frac{p g a i n - 5}{15}]

(1)

(2): Selection of the loss function

The loss function is the index used to evaluate the model fitting effect, and the goal of neural network learning is to make the loss function as small as possible. The input and output variables of the neural network are continuous values, and the mean square error function is adopted. Its expression is as follows:

J (θ) = {\sum_{i = 1}^{m} (h_{θ} (x_{i}) - y_{i})}^{2}

(2)

(3): Determination of the neural network structural parameters

The total number of neural network layers is three, including the input layer, the output layer, and a hidden layer. The number of neurons in the hidden layer is 13, and the activation function is Sigmoid, the overall structure of neural network 1 is shown in Figure 6. The sinusoidal input signals and control parameters are shown in Table 2, the output of the neural network (mean of control deviation e) indicates the mean of the control deviation e between the input signals and output signals of the HDU position control system.

(4): Training of neural network 1

The input of neural network 1 after data processing is shown in Figure 7.

The output of neural network after data processing is shown in Figure 8.

The processed data are fed into the neural network for learning until the gradient is less than 10⁻⁶ or the mean square deviation is less than 10⁻⁴.

3.4. Optimization of the Control Parameters

The sinusoidal frequency and amplitude of the input signals can be determined for a specific working condition. Taking the control parameters as independent variables, mapping the relationship established through neural network 1 as a function and the mean of control deviation e as the dependent variable, the relationship between the control performance and control parameters can be obtained under this working condition. There is an obvious rule between the control performance and the control parameters, so the control parameters with better control performance can be obtained through the curves. The optimal control parameters under the working conditions are selected according to certain rules and neural network 2 is used to learn the relationship between the working condition parameters and the selected control parameters. After learning, the neural network is used to adaptively change the control parameters according to the working conditions, so as to realize the adaptive control. The specific learning model was designed as follows:

(1): Selection of the neural network input and output

The purpose of neural network 2 is to calculate the control parameters that meet the rules under different working conditions. Therefore, the input of the neural network are the sinusoidal frequency and amplitude of the input signals, which are generated through permutation and combination with a sinusoidal frequency of 0.4~2 Hz and a sinusoidal amplitude of 1~5 mm, forming at the intervals of 0.01 Hz and 0.05 mm, respectively. The neural network output are the selected control parameters which could control the model in Figure 2 instead of the PID. The overall structure of neural network 2 is shown in Figure 9.

(2): Rules of parameter selection

The control parameters with the minimum of control deviation e are selected to form the output sample of the neural network.

(3): Training of neural network 2

Neural network 2 consists of three layers, including a hidden layer and 10 neurons in this hidden layer. The activation function is Sigmoid, and the loss function is the mean square error.

3.5. Simulation

Neural network 2, after training, was applied to the HDU position control system. Then, the updated schematic diagram of the HDU position control system is shown in Figure 10.

While the working conditions parameters changed, the neural network 2 automatically adjusted the control parameters according to the working condition to realize the adaptive control. Based on the MATLAB/Simulink model of the system established in Section 2, this section introduces a MATLAB function module for the neural network 2 calculation, and the results were output to the PID control model.

In the simulation, the initial position of the hydraulic cylinder piston was 25 mm, the P gain was the output of neural network 2, the I gain was 2, and the D gain was 0. The simulation working conditions are shown in Table 3.

The ideal control deviation (reference signal) was 0 which means that there is no control deviation between the input and the output. The comparison curves with constant and variable value PID are shown in Figure 11 (adaptive PID control based on a neural network is neural network PID for short, control deviation e is deviation e for short).

The control deviation of the adaptive PID control system based on the neural network (the blue curves in Figure 11) is shown in Table 4 (maximal relative deviation is equal to the ratio of the maximum deviation to the sinusoidal amplitude).

According to the simulation results, under the three working conditions, the maximum relative deviation of the adaptive PID method based on a neural network decreased by an average of 31.3% compared with the maximum relative deviation of the constant value PID and increased by 7.87% compared with the maximum relative deviation of the variable value PID. The deviation of the adaptive PID method based on a neural network was greatly reduced compared with the constant value PID, which approached the effect of the manually adjusted PID control parameters and maintained a good control performance under multiple working conditions. Due to space limitations, additional simulation results are not included in this paper.

4. Experiments

4.1. Introduction to the Experimental System

The experiment of this study was carried out on the performance test platform of the HDU. The platform is mainly composed of two HDUs, which are installed in the top. The HDU on the left adopts the position of closed-loop control, while the HDU on the right adopts the force closed-loop control position. In the experiment, the HDU on the left carried out the performance test of the relevant control algorithm, and the HDU on the right carried out the zero-force servo control. In each experiment, the working conditions of the left and right HDUs were the same. The photo of the experimental platform is shown in Figure 12a.

The controller used in the experiments is a semi-physical simulation experiment platform dSPACE-MicroLabBox shown in Figure 12b. MicroLabBox is supported by a comprehensive dSPACE software package, real-time interface (RTI) for Simulink (MathWorks, Natick, America) for model-based I/O integration and the experiment software ControlDesk, which provides access to the real-time application during run time by means of graphical instruments.

After the control algorithm in MATLAB/simulink, we used the code to automatically generate the target C code that could then be identified by the controller. Compared with manual C coding, combining MATLAB/simulink with the encoder can quickly design and test the control algorithms, avoid the complexity of the underlying C code writing, and improve the speed of the controller implementation stage. In the experiment, the data sampling frequency was 1 KHz. Figure 13 is the schematic diagram of the experimental signal input and data acquisition.

4.2. Collection of Learning Samples

As a joint actuator of robots, the HDU is the key to determining the motion performance of robots. According to the movement of the robot during trotting, pacing, and other gaits, the proposed sampling range of experimental learning samples is shown in Table 5.

The final working conditions were obtained by permutation and combination in the table, with a total of 324 groups of working conditions, and each group of working conditions ran for three cycles. In order to avoid mutual influence between adjacent conditions, the mean of control deviation for the last two working conditions was taken as the evaluation of the performance index. The generated system input signal sequence is shown in Figure 14 and Figure 15, and the signal acquisition interface is shown in Figure 16.

4.3. Optimization of the Control Parameters

The samples obtained in Section 4.2 were used to learn the relationship among the working conditions parameters, the control parameters, and the control performance, and the neural network structure and data processing methods used were the same as those in Section 3.3. The training performance of the neural network is shown in Figure 17.

It can be seen that after the completion of neural network learning, the value of the mean square error reached the magnitude 10⁻⁴, which well estimated the control performance and laid a foundation for the next calculation of control parameters.

The control performance index of the HDU was set as follows: the maximum of control deviation e should not exceed 5% of the sinusoidal amplitude. Based on the obtained neural network, the corresponding system performance under different working conditions and the control parameters were calculated, and the control parameters required to meet the control performance requirements were selected. The working condition parameters were taken as the input of neural network 2, and the selected control parameters were taken as the desired output of neural network 2. The sinusoidal frequency of the input signal was 0.5–2 Hz and the amplitude was 5–15 mm, and the input signals were generated by permutation and combination at intervals of 0.01 Hz and 0.05 mm, respectively.

The neural network structure and data processing methods used were the same as those used in Section 3.4. The learning performance of neural network 2 is shown in Figure 18.

It can be seen that the neural network converged rapidly, and the value of the mean square error reached an order of magnitude 10⁻¹ after learning, which meets the requirements of controlling parameter adjustment accuracy.

4.4. Experiment of Adaptive PID Control Based on a Neural Network

In order to verify the performance of the adaptive PID control based on a neural network, an experiment was carried out on the performance test platform of the HDU under the working conditions shown in Table 3, and the control performance of the system under different working conditions was tested.

The initial position of the piston of the HDU was 25 mm, and the oil source pressure of the system was 5 MPa. The working conditions were input into the adaptive PID control system based on the neural network, and a deviation curve was obtained, which was compared with the deviation curve of the PID control with constant and variable values, as shown in Figure 19.

The control deviation of the adaptive PID method based on the neural network (the blue curves in Figure 19) is shown in Table 5.

As shown in Figure 19 and Table 6, due to the setting of the parameter selection rules, the control deviation was slightly larger than that of the constant value PID under working condition 1. It greatly improved over that of the constant value PID method under the other two working conditions. The maximum relative deviation of the three working conditions reduced by 22.13% on average compared with that of the constant value PID method, which is close to the deviation level of the variable value PID method. On the whole, the control accuracy of the adaptive PID method based on a neural network was between the constant value PID method and the variable value PID method, which is slightly worse than the variable value PID method. However, its control accuracy was better than that of the constant value PID method, which has good adaptability and can maintain better control accuracy under various working conditions.

According to the proposed method in this paper, more parameter information corresponding to working conditions can be learned, and the same research idea can be extended to other control systems with similar structures. Moreover, based on this double-layer BP neural network, other “machine learning” methods such as deep deterministic policy gradient (DDPG) could be researched.

5. Conclusions

In this paper, an adaptive PID control method using a double-layer BP was designed. Neural network 1 is used to fit the relationship among the working parameters, the control parameters, and the control performance. Neural network 2 is used to fit the relationship between the working condition parameters and the selected control parameters, and to realize the adaptive adjustment of the PID control parameters according to the working condition parameters. The results showed that the designed method can automatically adjust the control parameters in the learning range and the working conditions near it, and it has a certain adaptability. It basically achieved the desired control precision. Compared with the constant value PID method, the deviation was reduced by 31.3%, and the performance was close to that of the variable value PID. Avoiding the disadvantage of the variable value PID requiring repeated manual adjustment of parameters, it provides practical value in engineering.

Author Contributions

Conceptualization, M.-L.Z.; Y.-J.Z.; X.-L.H.; Z.-J.G.; methodology, M.-L.Z. and Y.-J.Z.; software, X.-L.H. and Z.-J.G.; validation, M.-L.Z. and Y.-J.Z.; formal analysis, X.-L.H.; Z.-J.G.; investigation, M.-L.Z.; Y.-J.Z.; resources, M.-L.Z.; data curation, X.-L.H.; Z.-J.G.; writing—original draft preparation, Z.-J.G.; writing—review and editing, M.-L.Z.; Y.-J.Z.; visualization, X.-L.H.; supervision, M.-L.Z.; project administration, M.-L.Z.; funding acquisition, M.-L.Z. and Y.-J.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Soft Science Research Project of Inovation Competence Enhancement Plan of Hebei Province (no. 21556105D and no. 21552501D) and Graduate Innovation Funding Project of Hebei Province (no. CXZZB2021120).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not aplicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1. Mathamatical Model of HDU Position Closed-Loop Control System

Whether the electro-hydraulic servo valve can output the corresponding flow and pressure under the control of the electrical analog signals is the core of the electro-hydraulic servo control system, and whether the model is accurate or not has a great influence on the overall modeling accuracy. The general modeling method for the electro-hydraulic servo valve is linearization at a specific working point (usually at zero position of the servo valve). However, this method cannot accurately reconstruct the characteristics of the servo valve in all working areas. In order to improve the accuracy of the model, the nonlinear factors of pressure and flow for the electro-hydraulic servo valve are considered in this paper, and the flow equations of the electro-hydraulic servo valve were obtained as follows:

The inlet oil flow of the servo valve is:

q_{1} = \{\begin{cases} K_{d} x_{v} \sqrt{p_{s} - p_{1}} x_{v} \geq 0 \\ K_{d} x_{v} \sqrt{p_{1} - p_{0}} x_{v} < 0 \end{cases}

(A1)

The return oil flow of the servo valve is:

q_{2} = \{\begin{cases} K_{d} x_{v} \sqrt{p_{2} - p_{0}} x_{v} \geq 0 \\ K_{d} x_{v} \sqrt{p_{s} - p_{2}} x_{v} < 0 \end{cases}

(A2)

The equivalent flow coefficient

K_{d}

is expressed as:

K_{d} = C_{d} W \sqrt{\frac{2}{ρ}}

(A3)

For the convenience of expression and research, let:

K_{1} = \{\begin{cases} K_{d} \sqrt{p_{s} - p_{1}} x_{v} \geq 0 \\ K_{d} \sqrt{p_{1} - p_{0}} x_{v} < 0 \end{cases}

(A4)

K_{2} = \{\begin{cases} K_{d} \sqrt{p_{2} - p_{0}} x_{v} \geq 0 \\ K_{d} \sqrt{p_{s} - p_{2}} x_{v} < 0 \end{cases}

(A5)

According to Equations (A1), (A2), (A4), and (A5), the flow equations of the servo valve can be written.

The inlet oil flow of the servo valve is:

q_{1} = K_{1} \cdot x_{v}

(A6)

The return oil flow of the servo valve is:

q_{2} = K_{2} \cdot x_{v}

(A7)

The response of the servo valve is often much higher than that of the hydraulic power components. In order to simplify the analysis and design of the dynamic characteristics in the system, the transfer function of the electro-hydraulic servo valve is equivalent to a second-order oscillation link, and the transfer function of the spool position and the input voltage of servo valve is obtained in Equation (A8).

\frac{X_{v}}{U_{g}} = \frac{K_{a} K_{x v}}{\frac{s^{2}}{ω^{2}} + \frac{2 ζ}{ω} s + 1}

(A8)

The hydraulic cylinder is a component of the hydraulic actuator, the final carrier of the output power of the hydraulic system, and the control object of the electro-hydraulic servo valve. Its dynamic characteristics largely determine the performance of the system. Assuming that the connecting pipe diameter between the servo valve and the hydraulic cylinder is large enough, the pressure loss, fluid quality influence, and pipeline dynamic characteristics are all ignored; the hydraulic cylinder pressure in the working cavity is equal, the oil bulk modulus and the oil temperature are constant, and the internal and external leakage of the hydraulic cylinder are laminar flow, then the flow equations of the two working cavities for the asymmetric hydraulic cylinder can be obtained.

The rodless cavity flow of the asymmetric hydraulic cylinder and the volume of the servo valve to the rodless cavity are:

\{\begin{cases} q_{1} = A_{1} \frac{d x_{p}}{d t} + C_{i p} (p_{1} - p_{2}) + \frac{V_{1}}{β_{e}} \frac{d p_{1}}{d t} \\ V_{1} = V_{01} + A_{1} x_{p} \end{cases}

(A9)

The rod cavity flow of the asymmetric hydraulic cylinder and the volume of the servo valve to the rod cavity are:

\{\begin{cases} q_{2} = A_{2} \frac{d x_{p}}{d t} + C_{i p} (p_{1} - p_{2}) - C_{e p} p_{2} - \frac{V_{2}}{β_{e}} \frac{d p_{2}}{d t} \\ V_{2} = V_{02} - A_{2} x_{p} \end{cases}

(A10)

The inlet or return oil cavities of the HDU are all set inside the servo cylinder body. Considering the difference of the initial position for the piston in the servo cylinder, the initial volume of the rodless and rod cavities can be obtained as:

\{\begin{cases} V_{01} = V_{g 1} + A_{1} L_{0} \\ V_{02} = V_{g 2} + A_{2} (L - L_{0}) \end{cases}

(A11)

Considering the coulomb friction force of the hydraulic cylinder is very small relative to the load force, the coulomb friction force is included in the load force and not considered separately. According to Newton’s second law, the dynamic equilibrium equation on the piston is:

A_{1} p_{1} - A_{2} p_{2} = m_{t} \frac{d x_{p}^{2}}{d t} + B_{p} \frac{d x_{p}}{d t} + K x_{p} + F_{L}

(A12)

The transfer function between the feedback voltage of the position sensor and the position of the piston rod in the servo cylinder is:

\frac{U_{p}}{x_{p}} = K_{X}

(A13)

A block diagram of the position closed-loop control system of the HDU can be obtained by combining Equations (A1)–(A13), which is shown in Figure 2 in Section 2.

Appendix A.2. Neuron Model

Let the input of neurons be an n-dimensional vector,

u_{1}, u_{2}, \cdot \cdot \cdot, u_{n}

, and let the vector

u = [u_{1}; u_{2}; \cdot \cdot \cdot u_{n}]

represent the input of neurons. Neurons assign different weights to each input element, and the final input is obtained after summing, which is called the net input:

z = \sum_{i = 1}^{n} w_{i} u_{i} + b = w^{T} u + b

(A14)

where

w = [w_{1}; w_{2}; \cdot \cdot \cdot w_{n}]

is the n-dimensional weight vector and

b

is the offset value.

In the human brain, different input signals cause the neurons to produce different electrical signals. Artificial neurons use a nonlinear function to simulate this function and finally obtain the output value of the neuron

x

:

x = f (z)

(A15)

where

f (\cdot)

is referred to as an activation function.

The introduction of the activation function improves the ability of expression and learning in neural networks. Derivable activation functions can use numerical optimization methods to update the network parameters, and self-defined activation functions can limit the scope of input and output in neural networks, keeping the overall calculation domain within a reasonable range, then improving the stability of the learning.

Sigmoid activation functions are S-shaped on the whole, closing to linear near 0 and tending to saturate at both ends [24,25]. The commonly used Sigmoid activation functions can be divided into logistic activation functions and Tanh activation functions.

Logistic activation functions are expressed as:

σ (x) = \frac{1}{1 + e^{- x}}

(A16)

It can be seen that the standard logistic activation functions can map the data from the real interval to the scope of 0 and 1. After a certain transformation, the input can cover the whole range of data for the sensors in the control system, and the output can be limited to a certain effective interval, which can be continuously derivable.

Tanh activation functions are expressed as:

Tanh (x) = \frac{e^{x} - e^{- x}}{e^{x} + e^{- x}}

(A17)

The standard Tanh activation functions can map data from the real interval to the scope of −1 and 1, which can be used to control the output control value in the system.

References

Savaee, E.; Hanzaki, A.R. A new algorithm for aalibration of an omni-directional wheeled mobile robot based on effective kinematic parameters estimation. J. Intell. Robot. Syst. 2021, 101, 28. [Google Scholar] [CrossRef]
Li, Z.Q.; Chen, L.Q.; Zheng, Q.; Dou, X.Y.; Yang, L. Control of a path following caterpillar robot based on a sliding mode variable structure algorithm. Biosyst. Eng. 2019, 186, 293–306. [Google Scholar] [CrossRef]
Chen, Z.H.; Wang, S.K.; Wang, J.Z.; Xu, K.; Lei, T.; Zhang, H.; Wang, X.W.; Liu, D.H.; Si, J.G. Control strategy of stable walking for a hexapod wheel-legged robot. ISA Trans. 2020, 108, 367–380. [Google Scholar] [CrossRef] [PubMed]
Luo, M.; Wan, Z.Y.; Sun, Y.N.; Skorina, E.H.; Tao, W.J.; Chen, F.C.; Gopalka, L.; Yang, H.; Onal, C.D. Motion planning and iterative learning control of a modular soft robotic snake. Front. Robot. AI 2020, 7, 299242. [Google Scholar] [CrossRef] [PubMed]
Rodino, S.; Curcio, E.M.; Bella, A.D.; Persampieri, M.; Funaro, M.; Carbone, G. Design, simulation, and preliminary validation of a four-legged robot. Machines 2020, 8, 82. [Google Scholar] [CrossRef]
Ba, K.X.; Song, Y.H.; Yu, B.; He, X.L.; Huang, Z.P.; Li, C.H.; Yuan, L.P.; Kong, X.D. Dynamics compensation of impedance-based motion control for LHDS of legged robot. Robot. Auton. Syst. 2021, 139, 103704. [Google Scholar] [CrossRef]
Li, M.T.; Jiang, Z.Y.; Wang, P.F.; Sun, L.N.; Ge, S.S. Control of a quadruped robot with bionic springy legs in trotting gait. J. Bionic Eng. 2014, 11, 188–198. [Google Scholar] [CrossRef]
Souzanchi, K.M.; Arab, A.; Akbarzadeh, T.M.R.; Fate, M.M. Robust impedance control of uncertain mobile manipulators using time-delay compensation. IEEE Trans. Control. Syst. Technol. 2017, 26, 1942–1953. [Google Scholar] [CrossRef]
Chen, Y.H.; Zhao, J.B.; Wang, J.Z.; Li, D.Y. Fractional-order impedance control for a wheel-legged robot. In Proceedings of the 2017 29th Chinese Control and Decision Conference (CCDC), Chongqing, China, 28–30 May 2017. [Google Scholar]
Playter, R.; Buehler, M.; Raibert, M. BigDog. In Proceedings of the Conference on Unmanned Systems Technology VIII, Kissimmee, FL, USA, 17–20 April 2006. [Google Scholar]
Semini, C.; Barasuol, V.; Goldsmith, J.; Frigerio, M.; Focchi, M.; Gao, Y.F.; Caldwell, D.G. Design of the hydraulically actuated, torque-controlled quadruped robot HyQ2Max. IEEE/ASME Trans. Mechatron. 2016, 22, 635–646. [Google Scholar] [CrossRef]
Focchi, M.; Barasuol, V.; Havoutis, I.; Buchili, J.; Semini, C.; Caldwell, D.G. Local reflex generation for obstacle negotiation in quadrupedal locomotion. In Proceedings of the Conference on Climbing and Walking Robots (CLAWAR), Sydney, Australia, 14–17 July 2015. [Google Scholar]
Wiedebach, G.; Bertrand, S.; Wu, T.F.; Fiorio, L.; Mccrory, S.; Griffin, R.; Nori, F.; Pratt, J. Walking on partial footholds including line contacts with the humanoid robot atlas. In Proceedings of the 16th IEEE-RAS International Conference on Humanoid Robots (Humanoids), Cancun, Mexico, 15–17 November 2016. [Google Scholar]
Chen, Y.F.; Shen, L.G.; Li, R.J.; Xu, X.C.; Hong, H.C.; Lin, H.J.; Chen, J.R. Quantification of interfacial energies associated with membrane fouling in a membrane bioreactor by using BP and GRNN artificial neural networks. J. Colloid Interface Sci. 2020, 565, 1–10. [Google Scholar] [CrossRef] [PubMed]
Tang, M.C.; Zhou, C.C.; Zhang, N.N.; Liu, C.; Pan, J.H.; Cao, S.S. Prediction of the ash content of flotation concentrate based on froth image processing and BP neural network modeling. Int. J. Coal Prep. Util. 2021, 3, 191–202. [Google Scholar] [CrossRef]
Lee, C.Y.; Chen, Y.H. Motor Fault Detection Using wavelet transform and improved PSO-BP neural network. Processes 2020, 8, 1322. [Google Scholar] [CrossRef]
Swic, A.; Wolos, D.; Klosowski, G. The Use of Neural Networks and Genetic Algorithms to Control Low Rigidity Shafts Machining. Sensors 2020, 20, 4683. [Google Scholar] [CrossRef]
Rego, R.C.B.; de Araujo, F.M.U. Nonlinear Control System with Reinforcement Learning and Neural Networks Based Lyapunov Functions. IEEE Lat. Am. Trans. 2021, 19, 1253–1260. [Google Scholar] [CrossRef]
Nobahari, H.; Seifouripour, Y. A Nonlinear Controller Based on the Convolutional Neural Networks. In Proceedings of the 7th International Conference on Robotics and Mechatronics (ICRoM), Tehran, Iran, 20–21 November 2019. [Google Scholar]
Wang, G.; Yao, X.M.; Zhao, W. A novel piezoelectric hysteresis modeling method combining LSTM and NARX neural networks. Mod. Phys. Lett. B 2020, 34, 2050306. [Google Scholar] [CrossRef]
Nair, V.; Hinton, G.E. Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th International Conference on Machine Learning, Haifa, Israel, 21–24 June 2010. [Google Scholar]
Jinsakul, N.; Tsai, C.F.; Tsai, C.E.; Wu, P. Enhancement of deep learning in image classification performance using xception with the swish activation function for colorectal polyp preliminary screening. Mathematics 2019, 7, 1170. [Google Scholar] [CrossRef] [Green Version]
Goodfelow, I.; Warde-Farley, D.; Mirza, M.; Courville, A.; Bengio, Y. Maxout networks. In Proceedings of the 30th International Conference on Machine Learning, Atlanta, GA, USA, 16–21 June 2013. [Google Scholar]
Langer, S. Approximating smooth functions by deep neural networks with sigmoid activation function. J. Multivar. Anal. 2021, 182, 104696. [Google Scholar] [CrossRef]
Uteuliyeva, M.; Zhumekenov, A.; Kabdolov, O. Fourier neural networks: A comparative study. Intell. Data Anal. 2020, 24, 1107–1120. [Google Scholar] [CrossRef]

Figure 1. Photos of the quadruped robot prototype, the single leg, and the HDU.

Figure 2. Block diagram of the HDU position closed-loop control system.

Figure 3. Learning steps of the adaptive parameters in the neural network.

Figure 4. Desired position signals of input in the simulation.

Figure 5. P gain of controller in the simulation.

Figure 6. The overall structure of neural network 1.

Figure 7. Input of neural network 1 after data processing.

Figure 8. Output of the neural network after data processing.

Figure 9. Overall structure of neural network 2.

Figure 10. Updated schematic diagram of the HDU position control system.

Figure 11. Comparative deviation curves of different PID control methods.

Figure 12. Photos of the experimental platform.

Figure 13. Schematic diagram of the experimental signal input and data acquisition.

Figure 14. Input position signals of the input in the experiment.

Figure 15. Control parameter signals in the experiment.

Figure 16. Signal acquisition interface.

Figure 17. Training performance of the neural network.

Figure 18. Training performance of the neural network.

Figure 19. Comparative deviation curves of the different PID control methods.

Table 1. Parameters definition and simulation values of HDU position control system.

Parameter/Input	Value	Unit	Parameter/Input	Value	Unit
Servo valve gain $K_{axv}$	4.5 × 10⁻⁴	m/V	Density of hydraulic oil $ρ$	0.867 × 10³	kg/m³
Natural frequency of servo valve $ω_{sv}$	628	rad/s	Position sensor gain K_X	200	V/m
Damping ratio of servo valve $ζ_{sv}$	0.82	-	Internal leakage coefficient of servo cylinder $C_{i p}$	2.38 × 10⁻¹³	m³/(s·Pa)
Area of the cavity without rod $A_{1}$	5.98 × 10⁻⁴	m²	Conversion mass m_t	1.1315	kg
Area of the cavity with rod $A_{2}$	3.97 × 10⁻⁴	m²	Effective bulk modulus $β_{e}$	8 × 10⁸	Pa
Volume of inlet oil cavity pipe $V_{g 1}$	1.1 × 10⁻⁶	m³	Load stiffness K	0	N/m
Volume of return oil cavity pipe $V_{g 2}$	2.0 × 10⁻⁷	m³	Load damping $B_{p}^{}$	2000	N·s/m
Total stroke of the servo cylinder piston L	0.07	m	Conversion coefficient K_d	1.248 × 10⁻⁴	m²/s
Initial position of the servo cylinder piston L₀	0.035	m	Input position X_r	-	m
System supply oil pressure $p_{s}$	5 × 10⁶	Pa	Output position X_p	-	m
System return oil pressure $p_{0}$	0.5 × 10⁶	Pa	Control deviation e	-	m
External leakage coefficient of servo cylinder $C_{e p}$	0	m³/(s·Pa)	Load Force F_L	0	N

Table 2. Collection range of simulation learning samples.

Parameters		Range of Value
Working conditions	Sinusoidal frequency	0.5~4 Hz, with a step of 0.5 Hz
Working conditions	Sinusoidal amplitude	1~10 mm, with a step of 1 mm
Control parameters	P gain	7~14, with a step of 0.5
	I gain	2
	D gain	0

Table 3. Working conditions in simulation.

	0.5 Hz	1 Hz	2 Hz
Amplitude	0.5 Hz	1 Hz	2 Hz
2 mm	Working condition 1	-	-
4 mm	-	Working condition 2	-
6 mm	-	-	Working condition 3

Table 4. Control deviation of the adaptive PID control system based on a neural network.

Working Conditions	Working Condition 1	Working Condition 2	Working Condition 3
Maximal deviation/mm	0.026	0.096	0.28
Maximal relative deviation	1.3%	2.4%	4.67%

Table 5. Collection range of the learning samples.

Parameters		Range of Value
Working conditions	Sinusoidal frequency	0.5~2 Hz, with a step of 0.3 Hz
Working conditions	Sinusoidal amplitude	2~5 mm, with a step of 1 mm
Control parameters	P gain	10~50, with a step of 5
	I gain	2
	D gain	0

Table 6. Control deviation of the adaptive PID control system based on a neural network.

Working Conditions	Working Condition 1	Working Condition 2	Working Condition 3
Maximal deviation/mm	0.1	0.21	0.3
Maximal relative deviation	5%	5.25%	5%

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, M.-L.; Zhang, Y.-J.; He, X.-L.; Gao, Z.-J. Adaptive PID Control and Its Application Based on a Double-Layer BP Neural Network. Processes 2021, 9, 1475. https://doi.org/10.3390/pr9081475

AMA Style

Zhang M-L, Zhang Y-J, He X-L, Gao Z-J. Adaptive PID Control and Its Application Based on a Double-Layer BP Neural Network. Processes. 2021; 9(8):1475. https://doi.org/10.3390/pr9081475

Chicago/Turabian Style

Zhang, Ming-Li, Yi-Jie Zhang, Xiao-Long He, and Zheng-Jie Gao. 2021. "Adaptive PID Control and Its Application Based on a Double-Layer BP Neural Network" Processes 9, no. 8: 1475. https://doi.org/10.3390/pr9081475

APA Style

Zhang, M. -L., Zhang, Y. -J., He, X. -L., & Gao, Z. -J. (2021). Adaptive PID Control and Its Application Based on a Double-Layer BP Neural Network. Processes, 9(8), 1475. https://doi.org/10.3390/pr9081475

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Adaptive PID Control and Its Application Based on a Double-Layer BP Neural Network

Abstract

1. Introduction

2. Introduction to the Sampling System

3. Adaptive PID Parameter Control Method Based on a Double-Layer BP Neural Network

3.1. Learning Strategy Design

3.2. Generation of Learning Samples

3.3. Performance Fitting of Control System

3.4. Optimization of the Control Parameters

3.5. Simulation

4. Experiments

4.1. Introduction to the Experimental System

4.2. Collection of Learning Samples

4.3. Optimization of the Control Parameters

4.4. Experiment of Adaptive PID Control Based on a Neural Network

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix A.1. Mathamatical Model of HDU Position Closed-Loop Control System

Appendix A.2. Neuron Model

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI