1. Introduction
Iterative learning control (ILC) has been recognized as one of the most effective intelligent control strategies because it needs less prior knowledge of the system parameter requirements and because of its significant performance (e.g., [
1,
2,
3,
4]). The core mission of the ILC mechanism is to design an adequate control input law for achieving perfect repetitive tracking throughout the whole operation duration as the iterations increase. The conventional control laws can be based on proportional-type, derivative-type, and/or integral-type tracking errors, which are exploited to supply sufficient conditions for guaranteeing either an asymptotical or a monotonic convergence (e.g., [
5,
6,
7,
8,
9]). Nevertheless, the conventional control input law is usually embedded with a constant learning gain, with no evaluation of the tracking performance, and its convergence condition is dependent upon the system parameter information.
For improving the tracking performance of conventional ILC schemes, an optimized control input law has been explored by minimizing the performance index function on the basis of the norm-optimal or parameter-optimal framework. Regarding this topic, norm-optimal ILC (NOILC) algorithms have been comprehensively analyzed by solving optimization problems in the form of the principle of the minimum, gradient or Newton or quasi-Newton algorithm (e.g., [
10,
11,
12,
13]). Moreover, parameter-optimal ILC (POILC) has probed the learning gain by minimizing the quadratic norms of the tracking error and the learning gain (e.g., [
14,
15,
16]). It is clear that optimized ILC contributes significantly to the development of ILC.
However, the above-mentioned optimized ILCs are heavily dependent upon the systematic Markov parameter matrix. This makes those optimized ILCs hardly implementable if the system Markov parameters are not available. A feasible option, in a trade-off sense, may be to design a suboptimal ILC scheme to identify the unknown Markov parameters and incorporate the identification of a Markov parameter matrix into the learning control process. In this respect, a novel data-driven ILC has been presented to determine the learning gain matrices with sequential estimations of the Markov parameter matrix, which are acquired by the lower-triangular Toeplitz-spanned matrices of a linear combination of the multi-batch outputs and inputs. However, theoretical analysis of the convergence of either the estimation error or the tracking error has not yet been explored [
17]. Another type of data-driven ILC scheme is a gradient-based learning gain matrix that is constructed in an interactive form with the estimation of the system parameter matrix updated by multi-batch inputs and outputs [
18]. Further, the relevant works are extended to different types of discrete-time systems (e.g., [
19,
20,
21]). It is obvious that these works achieved impressive results. However, in order to avoid the matrix inversion and guarantee the convergence of a chi-type data-driven ILC, the gain matrix of the control law is compromised that replaces the matrix
with
. However, it cannot achieve optimal control in the practical sense. Recently, a newly input–output-driven gain-adaptive ILC has been proposed to abolish the harsh condition [
22].
On the other side, the multi-phase batch process is a common process in which the dynamics of each batch can be described as a switched system whose dynamics switches among a finite number of subsystems with different phases (or time intervals) [
23,
24,
25]. Although the achievement of ILC schemes for single-phase batch processes is remarkable (e.g., [
26,
27,
28,
29]), to date, there have been few investigations of the ILC mechanism for the multi-phase batch process. It was not until 2007 that a formulation of control was presented for the first time for multi-phase batch processes in [
24]. The work that followed first proposed an iterative learning two-dimensional (2D) predictive control model by solving a quadratic programming problem for multi-phase batch processes [
25]. Other studies have explored system stability, shortest running time, interval time-varying delays, and robustness [
30,
31,
32]. Another class of ILC investigations for the multi-phase batch process is conventional ILC schemes that are employed in repetitive switched systems, such as a D-type ILC scheme for a class of switched continuous-time nonlinear systems [
33], a hybrid ILC scheme for a class of discrete-time linear switched systems [
34], and a PD-type ILC for a class of linear continuous-time switched systems with measurement noise [
35]. However, in multi-phase batch processes, neither 2D switched-system ILCs nor conventional ILCs consider the tracking performance. For an ILC scheme, the more learning information is used, the the better control performance that may be obtained. Hence, it is very necessary to exploit an effective ILC strategy with multifarious learning information.
The purpose of this paper is to study a data-driven optimal ILC (DDOILC) for linear discrete-time multi-phase batch processes by incorporating multi-batch learning inputs and outputs to establish the tracking performance index and the parameter evaluation index on the basis of the Owens-type norm-optimal or parameter-optimal mechanism. The main idea is to update the current estimated systematic Markov parameter vector using the residual from the real outputs to the predicted outputs and the lower-triangular Toeplitz matrix of the inputs at each phase so that the parameter evaluation index is minimal. Then, the DDOILC strategy is constructed in an interactive form with Markov parameter vector estimation.
The main contributions are listed as follows. Firstly, we exploit a data-driven ILC for a linear multi-phase batch process by assessing the tracking performances and estimation mechanism, where the optimal input law is independent of the systematic Markov parameter matrix. Secondly, the system states can be different in different phases, which means the end state of the former phase may not be the same as that of the initial state of the next phase at the switched point. Thirdly, by taking advantage of the algebra method, the monotone convergence of the tracking error is derived if uncertainty falls within an appropriate range, and the convergence condition does not depend on the precise system parameter information.
The remainder of this paper is as follows. In
Section 2, the problem description and preliminaries are elaborated for the multi-phase batch process.
Section 3 presents an iterative learning identification algorithm for the system Markov parameters, and
Section 4 exploits a data-driven optimal ILC. The numerical experiment in
Section 5 demonstrates the method’s validity, and
Section 6 concludes the paper.
2. Problem Description and Preliminaries
Consider a multi-phase batch process whose system dynamics in each batch can be depicted by a class of repetitive single-input–single-output (SISO) switched linear systems with the following form:
where
and
represent the sampling time and
is the batch index.
,
and
are the state vector, scalar input, and scalar output, respectively. The function
represents a time- varying switching signal defined as
, and the switching signal shows that each batch is divided into
phases. The matrix pair
switches among the following finite set:
where
,
,
, and
are unknown constant matrices with appropriate dimensions.
Generally, it can be assumed that the switching signal is expressed as follows:
where
.
Since all the states in batch
can be divided into
groups according to the corresponding phases,
can be used to indicate the states of phase
. Then, for phase
, the switched system (1) can be formulated as
where
and
,
. Without loss of generality, set
, where
represents the zero vector and its dimension is the same as the state.
Some super-vectors are denoted as
Then, system (3) is reformulated in an input-output form as
where
is a Markov parameter matrix with a lower triangular Toeplitz structure expressed as
Denote
and
; then, the input-output description (4) is represented by the Markov vector
in
where
is constructed as
Let
be a given desired trajectory of system (3). The objective of ILC is to design an updating control input law
that enables the output
of system (3) to follow the desired trajectory
as precisely as possible as the batch index approaches infinity, represented mathematically as
where
is the tracking error of system (3) and is defined as
. Further, for the whole multi-phase batch process, the desired trajectory
, the output
, and the tracking error
are defined as
It is fairly obvious that the tracking error of the multi-phase batch process is convergent to zero if (), that is, .
Remark 1. It can be seen from the above description that the multi-phase batch process (1) with switching signal (2) is equivalent to system (3). Therefore, the construction of a data-driven optimal ILC for a multi-phase batch process (1) is no different than the construction of the ILC for system (3). Further, in the existing ILC strategies for multi-phase batch process control [30,31,32], the end state of the former phase should be the same as the initial state of the next phase at the switched point. However, in many cases, the dimensions of the states in the different phases may not be the same. In this paper, the dimensions of the states in different phases can be different. 3. Iterative Learning Identification
The main idea of iterative learning identification is that the updating law is designed by minimizing the sum of residual error between the system output and the predicted output and two iteration-adjacent estimations difference.
Definition 1. For the controller, ifand, then the order of the controlleris said to be, noted as.
Definition 2. For the Markov vector, ifand, then the relative degree of system (1) represented by Markov vectoris said to be, denoted as.
Let
be the
-th estimation of the Markov vector
. Then
is the
-th estimation of the Markov matrix
. Further, the
-th output is estimated by
Subtracting (6) from Equation (5) yields the
-th output error
as
Furthermore, the residual error is defined as the difference between the real output and the predicted output:
where
is seen as the
-th predicted output.
For the purpose of generating an updating law to modify the estimation
, the following minimization problem is constructed using the residual of the real output to the predicted output and the two adjacent identifications
and
.
where
is a positive weighting factor to weight the importance of the cost function
to the residual error energy
.
If
, then it is obvious that the matrix
is nonsingular. Further,
By letting the gradient of the objective function of (10) with respect to the argument
equal to zero, it is easy to reach the following learning identification algorithm:
If
, then it is evident that the matrix
is singular, and some super vectors are denoted as
From Definition 1, it is confirmed that the segment
is a null vector and
with the block matrix
is nonsingular. Here,
and
are zero matrices with appropriate dimensions. Moreover, some super vectors are denoted as follows:
Thus, it follows from the above denotations that
Substituting (12) and (13) into (9) yields
Letting the partial derivatives of the objective function of (14) with respect to the argument
and
equal zero yields
Therefore, it is concluded that the iterative learning identification algorithm for Markov vector
is
.
Remark 2. It is worth noting from (9)–(15) that the solutionof the minimization problem (9) is existent and unique. It can be derived from (16) thatwhich implies that the time order of the compensatoris adaptive to the time order of the controller.
For further theoretical analysis, the following lemmas are indispensable and easily stated.
Lemma 1. The equationis identical and holds the compatible nonsingular matricesand.
Lemma 2. The equalityis true for any positive semidefinite matrix.
Lemma 3. The functionstrictly monotonically increases on interval.
Theorem 1. For the iterative learning identification algorithm (16), the estimation error is strictly monotonically declining, that is,.
Proof of Theorem 1. Case 1. Matrix is nonsingular or .
By virtue of Lemma 1 and the iterative learning identification algorithm (16), the estimation error between the real Markov vector
and its
-th estimation can be derived as
Calculating the inner products of both sides of (17) and considering Lemma 2 obtain
Case 2. Matrix is singular or .
By means of the iterative learning identification algorithm (16), the estimation error between the real Markov vector
and its
-th estimation can be calculated as
By utilizing Lemma 1, the first partition of
is derived as
Taking the inner products of both sides of (20) and employing Lemma 2, we obtain
This completes the proof. □
Remark 3. In Theorem 1, the estimation error is strictly monotonically declining, which does not restrict the weighting factor or rely on any reset condition; it is hoped that the iterative learning identification (16) may improve the estimation performance as the iteration increases. Although the iterative learning identification algorithm (16) for a linear discrete-time-invariant system with no perturbation is feasible, whether it is available for a linear time-varying system is not yet verified.
4. Data-Driven Optimal ILC
For the multi-phase batch process, a data-driven optimal ILC scheme is used to construct an updating law of the control command
in a recursive form in order to optimize a criterion listed as
where
is a positive tuning factor to adjust the weight of the cost
to the energy
.
By virtue of the definition of the system tracking error, we have
Substituting (24) into (23) generates
According to the first-order necessary condition of the optimization, solving the optimal problem (25) is equivalent to the gradient of the objective function
with respect to the argument vector
being zero, that is,
It follows from (26) that
Furthermore, the Markov matrix
is replaced by the
-th estimation
, then for system (3), the DDOILC is constructed as
Remark 4. In the case when the system Markov parameters are unavailable, iterative learning identification (16) is embedded into the norm-optimal ILC (27), which is formed in the DDOILC scheme. In (28), the control law is equipped with an inversion matrix, which is prominently different from the existing strategies [19,21,22]; for avoiding matrix inversion and guaranteeing the convergence of the tracking error, the gain matrix of the control law is transigent and replaces the matrixwith.
Theorem 2. Assume that the proposed DDOILC (28) is applied to system (3). Then, the tracking error is monotonically convergent if the uncertaintyfalls within an appropriate domain.
Proof of Theorem 2. Let the relative degree of system (3) be , that is, . Since is a lower-triangular Toeplitz matrix formed by the Markov vector , . For the case when , is full rank and nonsingular. Meanwhile, for the case when , it is obvious that is singular and can be partitioned in blocks as , where is a nonsingular lower-triangular Toeplitz matrix.
Substituting (28) into (24) generates
By denoting
as estimation error of lower-triangular Toeplitz matrix
, we obtain
Case 1. is nonsingular or .
Further, by (30), we obtain
According to (31), we obtain
Considering Lemma 1 and utilizing (32) yields
By means of Lemma 2, Lemma 3, and (33), we achieve
It follows from the above derivation that the eigenvalue function is associated with the elements of the uncertainty and that the inequality is always true . Therefore, it is concluded that is guaranteed if the uncertainty falls into an appropriately small domain. This reveals that the tracking error is monotonically convergent. In particular, it is understood that if .
Case 2. is nonsingular or .
Substituting
and
into
yields
It follows from (36) that
Utilizing Lemma 2 and Equation (37) generates
According to Lemmas 2 and 3, it is easy to obtain
As discussed in Case 1, if the uncertainty falls into an appropriately small domain, then holds. Further, it can be understood from (29) that for all if we only consider the segment in the learning process. According to the expressions and , it is derived that the segment of the output of (4), , is out of control and thus the segment tracking is invalid. For this case, it is feasible that setting as the initial output for compatibility with the zero initial output resetting, leads to . □
5. Numerical Simulation
To demonstrate the effectiveness of the proposed DDOILC (28), a numerical example was considered in the simulation study that offers a comparison between the proposed DDOILC algorithm and the traditional D-type ILC. It should be emphasized that in the simulation, all system parameters are supposed to be unavailable and to just serve as the input and output data generator for the plants to be controlled. No parameter information of the systems will be included in the proposed DDOILC scheme design. The results of the simulation show that the proposed DDOILC has good convergence although the system parameters are unavailable.
Let us consider the following linear discrete-time switched system:
where
,
,
and the switching sequence is assumed to be
Suppose that system (40) contains the following two subsystems,
where
,
,
and
.
It can be noticed that the states of subsystems (1) and (2) are different; in other words, the system states are not identical in different phases.
The desired trajectory is given as
,
. Set the initial state
,
. Then, subsystems (42) and (43) are equivalent to
and
, respectively, where
and
can be seen as unknown lower-triangular Toeplitz matrices. For subsystems (42) and (43), the learning identification algorithm (44) and DDOILC scheme (45) are listed as
where
.
The initial inputs are
and
, where
is an
-dimensional random matrix whose elements are within
. The initial estimation vectors are chosen as
The system output and tracking error of (40) are expressed as
In this section, the proposed DDOILC (44) is applied to system (40), and the tracking performance of the DDOILC is simulated as follows.
Figure 1 exhibits the switching rule produced by (41) with the values 1 and 2.
Figure 2 manifests the convergences of learning identification algorithm (44) with the different parameters selected as
and
under the given parameters
of DDOILC (45), respectively. It is shown that the estimation error of the Markov vector
from the estimated Markov vector
decreases as the batch number increases, that is,
. It is also shown that the rate of convergence of learning identification algorithm (44) accelerates as the parameters
and
decrease.
Figure 3 demonstrates the convergences of the tracking errors of the DDOILC with the parameters chosen as
,
,
, and
under the given parameters
of learning identification algorithm (44), where the tracking errors are measured in the form of 2-norm. It is shown that not only are the tracking errors of the DDOILC monotonically convergent to zero but also the tracking performance is better as the parameters
and
decrease.
Figure 4 meshes the tracking error of the DDOILC with the parameters chosen as
and
;
is over the time-batch plane, which indicates that the tracking error is convergent to zero over the time-batch plane. Thus, the proposed DDOILC is effective.
For the system (40), the outputs of DDOILC with the parameters
and
at the 3rd, 4th, and 15th batches are exhibited in
Figure 5, where the dashed curve refers the desired trajectory, the dotted curve represents the 3rd output, the solid curve shows the output at the 4th batch and the dashed–dotted curve represents the 15th output.
Further, a comparison of the proposed DDOILC with the following D-type ILC was made:
where
is the constant learning gain.
In
Figure 6, a comparison of the tracking errors of the DDOILC and D-type ILC is depicted, where the weight factors of the DDOILC are selected as
and
, and learning gains of the D-type ILC are chosen as
,
, and
. It is shown that the 2-norm of the tracking error of the DDOILC is monotonically convergent to zero, and that the D-type ILC is also effective, but the proposed DDOILC performs better than the D-type ILC. Further,
Figure 7 illustrates a more accurate comparison of tracking errors of the DDOILC and D-type ILC measured by the natural logarithm of the 2-norm based on
Figure 6.
Based on the parameters shown in
Figure 6, the outputs of the DDOILC and D-type ILC for system (40) at 4th batch are exhibited in
Figure 8, where the solid curve represents the desired trajectory, the dotted curve refers the 4th output of DDOILC, and the dashed curve shows the 4th output of the D-type ILC.
Figure 8 reveals that the proposed DDOILC performs better than the D-type ILC.