1. Introduction
Pneumatic actuators have a wide range of applications in industrial automation and robotics, such as in load positioning [
1], vehicles’ active suspension [
2], air-brake systems [
3], conveyor belt systems [
4], pneumatic muscle actuators (PMA) [
5,
6] and soft robotics [
7,
8]. Nevertheless, the precise control of pneumatic systems requires further consideration as it cannot be satisfactorily achieved through many model-based control schemes. This emanates from the nonlinear and uncertain characteristics of pneumatic systems, which are caused by various factors such as dead-zone, air compressibility, frictional forces effects, and changes in airflow rate parameters, making obtaining an accurate model for such systems challenging [
9,
10,
11,
12].
Among the early attempts to control pneumatic systems, proportional-integral-derivative (PID) controllers [
13,
14] and their combination with acceleration feedback and nonlinear compensators [
15,
16] received more attention. Despite PID controllers’ popularity in industrial applications, they could not obtain the desired level of robustness, accuracy, and speed in controlling pneumatic systems. To improve robustness and overcome uncertainties, different robust control methods, including the H
∞ [
17,
18,
19], quantitative feedback theory (QFT) [
20,
21] and sliding mode control (SMC) theory [
22,
23,
24], have been proposed. The SMC controllers have demonstrated satisfactory performance in overcoming uncertainties in pneumatic systems, provided that the uncertainties’ boundaries are available to the controller. Adaptive controllers have also shown promising performance in controlling pneumatic actuators [
25,
26,
27,
28], and although they do not need any prior information on the uncertainties’ boundaries, an accurate model of each plant variation must be available to the controller for adaptation. However, obtaining accurate uncertainty boundaries or system models for most pneumatic actuators is challenging, and therefore, model-free control approaches, such as intelligent control methods, have attracted much interest. This includes fuzzy logic [
6,
29,
30,
31,
32] and neural network-based controllers [
33,
34]. The intelligent control methods have also been combined with other controllers, such as the adaptive method, to achieve better performance in controlling pneumatic systems [
35,
36]. Nevertheless, employing a fuzzy logic controller requires expert knowledge or field test results, and using neural network controllers needs a training phase.
The iterative learning control (ILC) algorithm is an alternative intelligent method usually used for repetitive processes [
37]. In this method, information from previous repetitions is used to learn about a system’s dynamics for generating a more suitable control signal. This learning process is performed in an iterative manner to improve the controller’s performance from one iteration to the other for achieving a zero-error convergence. ILC algorithms are particularly useful in real-time control systems, given their relatively quick response to the changes of the input signal. Many industrial processes are repetitive, which means the same control action should be performed repeatedly. Therefore, it is reasonable to make use of previously acquired data for improving a controller’s convergence and robustness in such processes. The difference between ILC and other learning-type control methods, such as the neural network and adaptive control, is that the ILC only modifies the control signal according to predefined control law. In contrast, other learning-type controllers monitor the system’s performance and update their control law during the process, accordingly [
37]. Unlike an adaptive controller, ILC methods do not need any information on the system’s model and only operate based on the historical input and output. Moreover, contrasting to other intelligent controllers, no training is required, and a well-selected ILC method should be able to converge to the expected state within a few iterations [
38]. The ILC has been used for controlling a pneumatic actuated X-Y table [
39] and was combined with PID to control a simplified model of a pneumatic servo system [
40]. Recently, the ILC method was proposed for accurate tracking of PMA [
41,
42] and controlling pneumatic valves [
43].
This paper suggests an ILC method to overcome the nonlinearities and uncertainties resulting from air characteristics, pressure loss, leakage and load variations in a pneumatic system consisting of a valve, a mechanical actuator and the connecting pipes. The considered pneumatic system for this study is a cylinder-piston actuator, which is the most commonly used type of pneumatic system in the industry. Although ILC is a model-free control method, in
Section 2, the mathematical model and physical properties of the considered pneumatic actuator have been discussed to demonstrate the system’s nonlinearities and uncertainties. MATLAB SimScape blocks have been used for simulating the system to provide a more detailed model than those used in previous studies on the ILC-controlled pneumatic systems. The asymptotic stability, monotonic convergent and zero steady-state error are the three aspects that should be considered in designing an ILC algorithm, which originally limits the ILC application to linear, repetitive systems. In this paper, the application of ILC has been further expanded to control the considered pneumatic system containing nonlinear behavior and responding to non-repetitive signals. The design of the proposed ILC method is done through a detailed mathematical analysis based on the physical properties of the considered actuator that is provided in
Section 3. The simulation results, provided in
Section 4, show that the designed ILC controller is capable of tracking a non-repetitive reference signal and can overcome the internal and payload uncertainties.
2. Mathematical Modeling and Simulation of the System
The considered pneumatic actuator in this paper follows the system given in [
44], which is a double-acting pneumatic cylinder controlled by a 4-port-3-position (4/3) electro-pneumatic valve, as is depicted in
Figure 1. According to the ideal gas law
and are the gas’s pressure, density, compressibility factor, and temperature, respectively, and is the gas constant. The ideal gas model is sufficiently accurate for modeling dry air under standard conditions.
In the control volume approach, each pneumatic component is considered as an internal node enclosed by a control surface. The mass flow rate in the control surface can be expressed as
and
are the mass flow rates of gas entering and leaving the control surface. The gas volume properties are denoted by subscript
, such as in
and
, representing the pressure and temperature of the gas volume in the internal node, respectively.
is the mass flow rate of the gas volume with respect to pressure at constant temperature and volume; and
is the mass flow rate of the gas volume with respect to temperature at constant pressure and volume. For an ideal gas,
Similarly, the control surface heat flow rate can be expressed as
where
and
are the energy flow rates due to the gas entering and leaving the control surface.
is the heat flow rate as a result of heat transferring between the control system and its surrounding, and
is the internal energy of the gas volume in the internal node. For an ideal gas,
where
and
are the specific enthalpy and specific heat capacity of the gas volume in the internal node, respectively.
For simulating the gas behavior in SimScape, a supply unit, as shown in
Figure 2, has been set up. The gas properties, together with the thermal conductivity and dynamic viscosity, which are used in modeling the gas transport behavior, are defined in the SimScape Gas properties block. The system’s reference temperature and pressure, which are taken as atmospheric temperature and pressure in this model, are defined by the SimScape Reservoir block, and the air compressor is modeled by a Pressure source block. The SimScape Pressure source block has two ports and is capable of maintaining a constant pressure difference between its ports. Therefore, by connecting the input port to the reservoir, a set pressure gas can be injected into the system.
The pipes in the system can be modeled using SimScape’s Pipe (G), thermal mass, convective heat transfer and temperature source blocks, as shown in
Figure 3.
The parameters used in calculating the gas pressure drop and convective heat transfer between the gas flowing in the pipe and the pipe’s wall are set in the Pipe (G) block. The pressure drop at each end of the pipe is given as
where
is the pipe’s cross-sectional area, and
is the pressure losses due to viscous friction,
. The
notation is used to abbreviate the equations and can be replaced by
or
. This notation will be followed in the rest of this section. The pressure loss in a pipe depends on Reynolds numbers. The Reynolds number at each end of the pipe is equal to
where
denotes the pipe’s diameter. If the calculated Reynolds number is less than 2000, the gas flow follows the laminar regime, and the pressure drop is equal to
For Reynolds numbers greater than 4000, the gas flow follows the turbulent flow regime, with a pressure drop of
denotes the pipe’s length and
is the aggregate equivalent length of local resistances, which has been taken as 10% of the pipe’s length. The shape factor,
, is considered as a constant number equal to 2.59, but the Darcy friction factor,
, is a function of the Reynolds number and is equal to
where
is the internal surface absolute roughness. For the Reynolds numbers between 2000 and 4000, a transition between laminar and turbulence regimes has been assumed.
The convective heat transfer between the gas flowing in the pipe and the pipe’s wall are modeled as
where
and
are the specific heat capacity and thermal conductivity calculated at the average temperature.
is the Nusselt number,
is the gas volume thermal conductivity, and
and
are the inlet and pipe internal wall temperatures, respectively. The value of the Nusselt number depends on the flow regime. For the laminar flow, the Nusselt number is taken as a constant, 3.66, and for turbulence flow is equal to
and are the Reynolds and Prandtl numbers, which are obtained at the average temperature.
The pipe properties for modeling the conduction heat transfer in the pipe’s wall can be adjusted in the thermal mass block, and the parameters for modeling the convective heat transfer between the pipe’s wall and the surrounding environment can be set in the convective heat transfer and temperature source blocks. The temperature source can maintain the atmospheric temperature regardless of the amount of heat flow into the system. The conduction heat transfer in the pipe’s wall can be mathematically modeled by
where
,
and
are the pipe’s wall density, specific heat capacity, and temperature. Finally, the convective heat transfer between the pipe’s wall and the surrounding environment can be expressed as
where
and
are the surrounding air thermal conductivity and temperature.
The next component of the pneumatic actuator is the valve, which can be modeled as a set of restrictions capable of controlling the gas flow according to an input control signal. The restriction causes contraction of the gas at its input port, followed by the gas expansion at its output port, which results in a pressure drop across the ports. Considering the process as adiabatic, this pressure difference can be modeled as
where
is the restriction’s cross-sectional area, and
is the gas volume density at it.
denotes the discharge coefficient, and
is the ratio of the restriction cross-sectional area to the ports cross-sectional area. It is assumed that both ports have the same dimensions. As is seen from Equation (15), the pressure difference is proportional to the square of the gas flow rate,
, which is typical in the turbulence regime. However, in the laminar regime, the pressure difference is linearly proportional to the gas flow rate, and thus
can be approximated as
where
and
are the pressures of the inflow and outflow gases into the restriction, and
is the laminar flow pressure ratio, which is taken as a constant, 0.999. The amount of pressure at the restriction is equal to
For the laminar regime, this can be approximated as
The local restriction (G) block has been used for modeling the valve in the SimScape environment. Each active state of the valve can be modeled by two local restriction blocks connecting ports P and T to ports A and B in
Figure 1. The block allows modeling the valve leakage by defining a non-zero minimum restriction area,
. Moreover, the valve spool displacement,
, has been used to adjust the orifice area. The orifice cross-sectional area is linearly proportional to the spool displacement as
where
is the maximum spool displacement causing the maximum cross-sectional area in the restriction,
, and
is the minimum value for the spool displacement.
Figure 4 shows the model of the valve using SimScape components.
The complete model for the cylinder using SimScape blocks is depicted in
Figure 5. The cylinder consists of two chambers, where each can be modeled using the translational mechanical converter (G) block in the SimScape to model the relation between the gas pressure inside a chamber and the applied mechanical force to the interface. Each cylinder chamber can be considered as an internal node, with a mass flow rate of
A heat flow rate of
where
is the energy flow rate as a result of the gas transportation into/out of the chamber, and
is due to the convective heat transfer between the gas in the chamber and the cylinder’s body. The partial derivative terms in Equations (20) and (21) can be obtained from Equations (3) and (5), respectively. Equations (11)–(14) are also applicable here to model the heat transfer between the gas in the chamber and the surrounding environment.
The gas volume in the chamber depends on the displacement of the moving interface, and is equal to
where
denotes the dead volume, and
and
represent the interface cross-sectional area and displacement. The sign of the displacement value depends on the movement direction of the interface. The applied force to the interface can be expressed as
where
is the atmospheric pressure. Therefore, the total force caused by the gas pressure in chambers 1 and 2, considering the forces’ directions, is equal to
Moreover, a translational hard stop block is used to restrict the cylinder interface movement within the length of the cylinder. The viscous friction coefficient for the piston movement can be modelled by a translational damper block. Two mass blocks are used for modelling the load and piston masses. The convective heat transfer, thermal mass, and temperature source blocks can model the heat transfer between the chamber gases and the surrounding environment. The motion equation for the load connected to the piston rod is equal to
where
denotes the rod’s displacement,
is the piston mass, and
is the load mass, which causes the payload force of
on the actuator.
represents the viscous friction coefficient for the piston movement.
3. Controller Design
This section covers the procedure for designing an ILC controller for the considered pneumatic actuator covered in
Section 2. The ILC method can generally be applied to repetitive processes. In such processes, every iteration starts and ends at the same conditions and lasts for a fixed period of time,
. The state-space equation for a repetitive linear time-invariant system (with no feedthrough) can be represented as
denotes the trial number, and
,
and
respectively represent the state variable vector, output, and input of the system at the
iteration.
are the state space system matrices. Suppose the system’s desired output for every iteration is
, which makes the error at the
iteration equal to
The idea of the ILC is to define a control law using previous trials’ information such that the error monotonically decreases in every new iteration until it reaches zero (
). The control signal is calculated according to a recursive law as
If is designed in a way that , then the learning law is known as noncausal. Moreover, the control signal generated at the iteration can be calculated based on the information collected from all previous iterations. This method is called the high-order ILC (HOILC). However, a simplified law is preferred as long as it can attain the convergence with satisfactory speed.
The ILC algorithm can be presented in discrete-time, which is more suitable for being implemented by a microcontroller. For a sampling time of
, where
, the system can be considered as
,
and
. The system’s output can then be calculated as
where
denotes the forward time-shift operator as
. For a rational LTI system,
can be expanded into
The error at the
iteration is equal to
The ILC control law in the discrete-time domain can then be presented as
where
represents any sample in the range of
In such cases, it is common to implement the ILC method in the form of digital filters.
We begin our design by considering a general form for the ILC law as
To represent (34) in matrix format, let us consider
sample sequences for the input, output and desired signals as
And
,
,
and
as matrices equal to
Therefore, (34) can be represented as
An ILC method is regarded to be asymptotically stable (AS) if
The converged control signal can be defined as
, and in order for (37) to be AS,
where
and
is the
eigenvalue of matrix
.
Theorem 1. The asymptotic error of the controlled system is equal towhereis anidentity matrix. Proof of Theorem 1. Based on (32),
and following (30)
, which makes
. Therefore, the asymptotic error,
can be obtained as
and by using (37) when
□
Theorem 2. In order to have , , and should be selected as Proof of Theorem 2. From (40), for
, we should have
□
Although by selecting matrices as (41), a zero asymptotic error can be achieved, and the transient error also needs to be analyzed to prevent having significant transient errors in the system response. The controlled system is called monotonically convergent if
where
is the Euclidean norm, and
is the convergence rate.
Theorem 3. For the ILC law given in (34) we have,wheredenotes the maximum singular value operator. Using values obtained in (41)
and
Considering that in ILC
, we have
where
denotes an all one
vector. Therefore,
Since for the matrices
and
given in (36) it can be proved that
, we have
□
Using the results obtained from (39), (41), and (43), the ILC law given in (37) can achieve asymptotic stability as well as monotonic convergent and zero steady-state error when it is in the form of
However, further consideration has to be taken into account before using (44) to design a controller for the pneumatic actuator. First, the considered pneumatic actuator is a nonlinear system with the state space equation of
We assume that
and
are global Lipschitz continuous (GLC) functions. Therefore,
By taking into account that the considered pneumatic system is asymptotically stable and for a sampling time of
, where
, the system can be considered as
where
and
. This follows the Lyapunov stability analysis, where it is assumed that for a physically stable system all the linear terms are in
and
and higher-order terms are in
and
, in the sense that when
gets small,
and
get small faster. This means that the nonlinearities’ effects in the system’s dynamic would eventually vanish, and that the system has a dominating linear characteristic at its steady-state condition. The remaining nonlinearities is as a result of input effect,
, which can be denoted as
. Using (47), the system’s output can be presented as
where
term is as a result of the nonlinearities’ residue in the model. By applying the proposed ILC law given in (37), we have
In order to make sure that the ILC method is AS, an additional term has been added to the ILC law as
where
. The existence of
is guaranteed as
and
are assumed to be GLC functions. As the nonlinearities in the system can be controlled (removed) by adding the proportional control term,
, the rest of the system can be considered linear, and (39), (41), and (43) will be held.
The ILC, in theory, is effective for repetitive processes. However, we would like to expand the ILC application to control the pneumatic system responding to non-repetitive inputs and disturbances. For this purpose, the proof given for Theorem 3 has to be revisited. As part of this proof, we had
However, as the process is not repetitive, we cannot consider
. Instead, we should select the
value as its maximum, since this choice can help the condition to be asymptotically held as
. Therefore, the ILC law for controlling the pneumatic actuator can be summarized as:
4. Results and Discussion
Table 1 presents the parameters’ values used in simulating the piston-cylinder pneumatic actuator.
The effect of applied voltage on the valve’s cross-sectional area, where the valve supplies port B for the negative voltage values and connects the supply port P to port A for the positive values, is shown in
Figure 6. As is seen, a dead-band behavior appears in the vicinity of 0 V. This follows the pattern of the same in the physical actuator studied in [
44], shown in
Figure 6 of this reference.
The gas flow rate,
, in the valve with respect to the applied voltage values is presented in
Figure 7. As a result of leakage, the gas flow rate for the applied voltage less than ~0.5 V remains at zero. By applying more voltage, the gas flow rate increases by following a laminar regime until it reaches a point that the flow rate enters into a turbulence regime, in which a decrease in the flow rate slope can be observed. This follows the pattern of the same in the physical actuator studied in [
44] shown in the
Figure 7 of this reference.
We then apply an input voltage to the valve and measure the rod’s displacement. This is depicted in
Figure 8. The input signal is selected in such a way as to take the valve’s spool to its extreme positions. An applied 12 V signal charges chamber 1 and allows gas to be exhausted from chamber 2, which causes the rod to extend to its maximum displacement, 0.2 m. The signal has been applied long enough so that the system can be settled. The valve is then moved to its neutral state by applying 0 V, and as is shown, the rod’s displacement remains unchanged. By applying −12 V to the valve, the gas in chamber 1 exhausts, and chamber 2 fills with gas, moving the piston inward until it reaches zero. Again, a 0 V signal has been applied to move the valve to its neutral state, where holding the rod’s position unchanged. The response transition time is around 2 s.
The frequency response of the system has also been studied. As was discussed, the air is compressible and has a low damping characteristic, which causes a nonlinear response and increases the pneumatic system’s dynamic order. Moreover, before the system can apply any force to a load, the pipes and cylinders have to be filled with air. This results in further nonlinearities in the form of dead-band and transmission attenuations. Moreover, a pneumatic system is affected by frictional forces caused by the mechanical parts’ movements. All these uncertainties and nonlinearities make linearizing a pneumatic system a complicated practice that generates inaccurate results. Therefore, a system identification approach has been implemented to estimate the frequency response of the pneumatic actuator. In this approach, a set of sinusoidal signals with different frequencies are applied to the system and the piston’s rod displacement is measured. The collected results are then used to draw the system’s bode plot. The estimated bode plot for the pneumatic actuator obtained by applying sinusoidal voltage signals in the range of 0.5 to 100 Hz is given in
Figure 9, showing around 1.6 rad/s frequency bandwidth for the considered pneumatic actuator.
According to [
45], servo pneumatic actuators in approximately 70% of industrial applications should move 1–10 kg payloads with ±2 to ±0.02 mm precision. Therefore, in this paper, we consider the same criteria in studying the performance of the designed controllers. One major advantage of ILC is that it does not need accurate knowledge of the system model, and instead, it can learn from the system’s historical input and output. However, some degree of estimation can help to obtain a better design with a faster convergence. For this purpose, the estimated bode plot, given in
Figure 9, will be used. As is seen, the system is a lowpass with the bandwidth of 1.6 rad/s. Therefore, we use
to estimate the linear part of the system. Since the system’s bandwidth is 1.6 rad/s, it can be assumed that the system does not have much of fluctuations over a period of one second. Therefore, by taking
, we can consider that the system would be seen as a repetitive process from the controller’s perspective. The
for the system given by (53) based on
is equal to
and
should be selected such that
and
. Theoretically,
will perfectly satisfy both conditions. However, as
this design would take the system into saturation due to its high gain value of
. Considering the maximum rod’s displacement as 0.2 m and the maximum input voltage as 12 V, the maximum gain value should be limited to
Another observation from (56) is that the maximum number of non-zero elements in each column is two, which means that regardless of the selected value for
, only two consecutive samples are used in the control law. As a result,
would be sufficient for implementing the ILC to control this pneumatic actuator. Therefore, we consider
as a
lower triangular matrix and calculate its arrays’ values.
Although from the theoretical perspective, a larger value of
improves the AS condition of the ILC method, from the practical aspect, it should be limited to 60 to prevent the system from saturation.
Figure 10 shows the relation between
and
to avoid saturation. From a repetitive process perspective, the best value would be
to minimize the transient error as is given by (43). However, the inputs to the considered pneumatic system are not repetitive, and following (52), the values in (57) are selected as
, which makes
and
, satisfying the conditions in (52). The value for
should be in the range of
to satisfy the AS condition of the ILC method, which we adjusted to 0.25 in this design. However, choosing the optimal value for
requires the system’s knowledge, which is not available.
Based on the above discussions, the designed ILC method for the pneumatic actuator system is implemented as
We further used the obtained frequency response of the system, given in
Figure 9, to design a PID controller in order to compare its performance to that of the proposed ILC method. The frequency domain techniques, namely the Nichols chart and Inverse Nichols chart, have been used to achieve the required performance in reference tracking and disturbance rejection. To obtain the precision of ±0.002 m, the tracking boundaries are designed to be in the range of
and the sensitivity bound is designed to be less than 3 dB for the lower frequencies (
) and less than 6 dB for all frequencies. The obtained PID controller is calculated as
Moreover, two recently proposed intelligent model-free control approaches have been selected to compare the performance of the designed ILC method with theirs. The first approach, proposed in [
46], uses type-2 Takagi–Sugeno (T-S) fuzzy systems with the memory state feedback control. The controller is expressed using the state variable
as
where
denotes the interval type-2 (IT2) fuzzy set of the premise variable,
, and
and
are the number of fuzzy IF-THEN rules and fuzzy sets of controller, respectively.
are nonlinear weight coefficient functions that are calculated based on upper and lower membership functions. The footprint of uncertainty (FOU) for the employed T-S fuzzy system is given in
Figure 11. Two fuzzy rules are defined taking the system error as the state variable and the coefficients are selected according to a procedure explained in the reference as
and
. The feedback control gains are then calculated as
and
.
In the other approach proposed in [
47], a type-1 T–S fuzzy observer is used to develop a piecewise control strategy for dealing with the nonlinear terms in the system. The membership functions employed for constructing the observer is depicted in
Figure 12. The controller gain is calculated as [5.37, −5.37].
The performance of the controllers in tracking a reference signal under a constant load of 10 kg is shown in
Figure 13. As is seen, the designed ILC is capable of tracking a non-repetitive signal. Next, we compare the performance of the ILC with the other controllers, which is given in
Table 2. It can be seen that the ILC demonstrates a faster response, however, it demonstrates overshoots in the transient response. The maximum overshoot percentage happens during the forward motion when the actuator moves from the resting to reach 0.1 m, which equals 13.5%. This can be considered as the major drawback of the ILC, which is caused by the way that ILC calculates the control signal. The ILC only modifies the control signal according to predefined control law and based on the historical input and output. Therefore, a sudden change in the input signal will cause overshoot in the control signal. However, in a well-designed ILC that holds monotonic convergent, the gap between the input and output is reduced after sufficient iterations. The ILC can obtain ±0.002 m precision (2% settling time) in 0.52 s compared to 0.46 s for the PID and 0.35 s for the IT2 T_S fuzzy controller. The controller based on T-S fuzzy observer could not achieve the required precision. The difference between the reference and output of the system controlled by each of these controllers is shown in
Figure 14, and the summation of the square of the error (
) is given in
Table 2. The ILC demonstrates superior performance than other controllers in terms of both speed (rise time) and tracking accuracy.
Next, we compare the performance of the controllers in overcoming the payload uncertainty, as is shown in
Figure 15. The top plot in this figure shows the payload variation applied to the piston while the reference signal remains constant at 0.1 m. As is seen, the ILC-controlled system can maintain the required precession (±0.002 m) regardless of changes in the load. However, this cannot be adequately achieved with the other controllers. The plot of the disturbance rejection error is given in
Figure 16. The summation of the square of the error for the controllers is given in
Table 3. Again, the ILC shows a better performance than the other controllers in overcoming the uncertainties.