1. Introduction
Linear quadratic (LQ) optimal control problems with indeterminacy factors have been widely studied in theoretical and practical perspectives. In most cases, these indeterminacy factors are modeled as stochastic processes. Many achievements in the study of stochastic control have been made, such as minimum variance control [
1], Linear-quadratic-Gaussian (LQG) [
2,
3,
4] and probability density function (PDF) control [
5]. Actually, due to lack of sufficient information and data, the statistical properties of the uncertainties in some special systems are very difficult to obtain. Taking the mobile system as an example, because the terrain conditions cannot be reused, the stochastic model can not be modeled with the insufficient data [
6]. Moreover, there are several uncertainties which are not essentially stochastic.
Non-stochastic uncertainties as an alternative which only consider the range of uncertain information have attracted the interests of scholars. Terra proposed a recursive robust linear quadratic regulator based on the robust weighted least-squares method for discrete-time systems with norm-bounded parametric uncertainties [
7]. Huang investigated the robust quadratic optimal control based on robust optimization for the systems in the presence of ellipsoidal bounded disturbance [
8]. Zhang studied the robust optimal control with adjustable uncertainty sets which treated the uncertainty sets as additional decision variables [
9]. However, the methods above were with the assumption that the system states were observed completely.
Actually, the states in the real systems are usually not measurable completely and accurately. Set-membership filtering can provide an estimated feasible set of true states for systems with non-stochastic noises, and so far, many approaches have been proposed to obtain the solutions [
10,
11,
12,
13]. Based on the set-membership estimation theory, Qian studied the output tracking control problem for systems with ellipsoidal uncertainties in system parameters and noise, and an adaptive robust tracking control algorithm was proposed [
14]. However, the performance index was given with
norm of the output deviation over the finite time horizon, whose transient performance is poorer than the linear quadratic optimal control. Bertsimas studied the linear quadratic control problems with norm-bounded uncertainties [
15]. However, for the imperfect state information case, the state estimation error set was assumed in fixed bounded set which neglected the learning feature of the state estimation. Ping investigated the output feedback robust predictive control with ellipsoidal estimation error sets [
16,
17]. However, the methods updated the estimation state error set by solving one convex optimal problem each time.
Motivated by the above discussions, the LQ control problem with non-stochastic noises is considered in this paper. Specifically, the non-stochastic process and measurement noises are confined to ellipsoidal bounded sets. Ellipsoidal bounded sets as more realistic and flexible non-stochastic uncertain sets, on the one hand, are of good tractability and well-motivated from a stochastic setting, where they naturally occur when a normal distribution is cut off at a certain level of probability. On the one hand, they can describe kinds of bounded uncertainties as well, such as 2-norm bounded (by selecting the identity matrix as the size matrix for the ellipsoid set) [
18]. In addition, compared with the traditional stochastic assumption, the bound of the noises can be obtained more easily. As the noises are bounded, the traditional Kalman filtering and Dynamic Bellman Equation are not applicable the proposed control problem. For the LQ control methods with ellipsoidal bounded noises, most of them considered the states to be completely observed and for the output feedback control methods, most of them need to solve one complex optimization problem at each time to update the estimation state error set. In this paper, we converted the multi-step quadratic global optimal control problem to multiple one-step quadratic local approximate optimal control problems. Then set-membership filtering and robust optimization are adopted to design the state feedback control strategy which ensures the optimality of control performance (the quadratic cost function about the system states and the control law).
The remaining part of this paper is organized as follows: In
Section 2, the problem formulation is presented. Then, in
Section 3, we present the robust control algorithm based on the set-membership filtering and robust optimization. To show the effectiveness of our proposed method, simulation results are presented in
Section 4. Finally, conclusions and future research directions are presented in
Section 5.
Notations. Throughout the paper,
means that the real symmetric matrix
S is non-negative definite (positive definite).
I is the identity matrix with appropriate dimensions. The ellipsoid
is denoted by
where the vector
s is the center of the ellipsoid, the matrix
characterizes its shape.
2. Problem Formulation
Consider the following uncertain discrete-time linear dynamic system,
where
is the state vector,
is the control vector,
is the measure output vector.
and
are the process noise and observation noise respectively, which are confined to specified ellipsoidal sets,
where
and
represent the bounds of noises
and
respectively. The initial state
takes value in the following ellipsoidal set,
Considering the quadratic cost function,
where matrices
and
are non-negative definite and positive definite symmetric matrices of appropriate dimensions, respectively.
Our target of this work is to find a feedback control series
such that the quadratic cost function
is minimized when the noises
and
belong to bounded ellipsoidal uncertain sets. Namely,
Due to the noises and being bounded in this paper, the dynamics Bellman equation cannot be applied to solve the multi-step quadratic optimal control problem. In order to effectively address the proposed optimal control problem, we attempt to find a feasible approximate optimal solution.
Because the value of
is not influenced by the control law
, so we simplified the cost function to
. To ensure the optimality of
J for all the possible values in the noises sets
and
, the minimization of the worst case of
J is considered. To realize the approximate optimality of the multi-step quadratic cost function
J, we optimize each one-step quadratic cost function
(
) which is defined as follows,
Then, the multi-step quadratic global optimal control problem is converted to multiple one-step quadratic local optimal control problems which can provide a feasible approximate optimal solution. Each one-step quadratic optimal control problem is as follows,
In optimization problem (
6), the performance index
is a function with respect to the system states which are not measurable completely. As a result, the state estimation error directly affects the feedback control performance. If we can yield more accurate true state feasible set, the maximum value of
can be more accurate. Then by minimizing the maximum value of
, optimal control law
can be obtained as well.
The following lemmas are used in the designing of the optimal control law.
Lemma 1. S-procedure [19] Let be quadratic functions of the variable ,where , and are with appropriate dimensions. The following condition on , ⋯, ,for all ζ such thatholds if there existsuch that When , the converse holds, provided that there is some such that .
Lemma 2. Schur complement [15] Given the matrices , and U with appropriate dimensions. Letwhere . Then H is positive (semi) definite if and only if the matrix is positive (semi) definite. 3. Robust Optimal Control Strategy
Considering that the states are not measured fully, we firstly adopted set-membership filtering to realize the states estimation. Then, based on the states estimation, taking the negative impact of the estimation error and noises into consideration, we designed the robust optimal control strategy in this paper.
3.1. Estimation of True State Feasible Set
For the unknown-but-bounded noises, set-membership filtering can provide a true state feasible set which contains the true states all the time. In addition, the filtering gain can be obtained off-line with small computation burden.
According to set-membership filtering theory, assuming that at instant the true state feasible set is , where is the center of state feasible set, is used to define its shape.
At instant
k, the true states
belong to the optimal ellipsoidal feasible set
, the center
and the matrix
can be updated as follows,
The scalar parameters
and
are selected to make the sizes of ellipsoidal sets
and
to be minimized respectively. There are two common criteria to optimize size of the ellipsoid. One is Minimum-trace criterion, and the other is Minimum-volume criterion [
13]. In this paper, we adopt the latter one and
satisfies
where
denote the eigenvalues of matrix
and
n is the dimension of the state.
satisfies
where
denote the eigenvalues of matrix
,
is given by (12) and
is the derivative of
with respect to
.
3.2. Analysis of State Estimation Error
The center
of true state feasible ellipsoid set is adopted as the estimated state in this paper, then state estimation error can be defined as follows,
Based on the set-membership filtering results, it yields that at instant
k the estimation error
belongs to the following ellipsoid set,
namely,
the shape matrix
of error ellipsoid set can be refreshed according to Equations (9)–(13), which avoids solving complex convex optimization problem.
3.3. Robust Control Strategy
According to the definition
and the dynamics system state Equation (
1), we have
Here, let
then Equation (
14) can be rewritten as follows,
Substituting Equation (
15) into Equation (
5), it has
where
It can be seen from (
16) that
contains two uncertainties, one is the process noise and the other is the estimation state error. To realize the control optimality, how to deal with the uncertainties effectively is important. In this subsection, we introduce two uncertainties into the constrains of optimization problem, based on the lemmas outlined in
Section 2, the uncertain problem (
6) can be converted into one certain semidefinite programming (SDP) problem by using the robust optimization technique.
The main result is given by the following theorem.
Theorem 1. Under the bounded ellipsoid uncertain noises, the problem (6) can be solved by the following certain semi-definite programming (SDP) problem,where λ, , and z are the decision variables, andthe control law is Proof of Theorem 1. For the system performance index in Equation (
16), because
and
, therefore
and
, respectively. Meanwhile, because
and
are symmetric matrices, so
is symmetric and positive definite matrix. Therefore, it can yield that
.
Let
, then substituting it into Equation (
16), it has
At stage
k, the first two parts of
can be considered as a known constant. Let
the optimal problem of
can be converted to be that of
,
Therefore, the problem (
6) is equivalent to the following problem,
By introducing an auxiliary variable
, problem (
18) can be written as follows,
the constrain in (
19)
can be rewritten in the following form
Let
it yields
namely,
the constrain in (
19)
can be rewritten in the following form
And the constrain in (
19)
namely,
can be rewritten as follows
According to Lemma 1 (S-procedure), for all
that satisfies (
20) and (
21), the Equation (
22) holds if there exist
such that
namely,
According to Lemma 2 (Schur complement), it yields
This completes the proof. □
The proposed control law
, let
, then the control law can be rewiritten as
, which contains two parts: the estimated states feedback and uncertainties compensation. For the first part, the feedback gain
can be computed off-line. For the second part
, it denotes the compensation for the estimated states error and noises which can be obtained by solving the semi-definite programming problem (
17).
4. Simulation Analysis
This section illustrates the features of the non-stochastic robust control law proposed in this paper by the following two examples compared with the output feedback robust model predictive control(OFRMPC) in [
16].
The proposed robust optimal control algorithm can be run as follows.
Step 1. Set , input and .
Step 2. Update true state feasible set
according to Equations (
7)–(13).
Step 3. Calculate control law
by solving optimal problem (
17).
Step 4. Apply to the system. If , stop; Otherwise, set and go back to Step 2.
Example 1. Consider the following two-dimension uncertain linear system,where , , , , and are the state, the control input, the measured output, the process noise and the observation noise, respectively. The noises are independent mutually and belong to the ellipsoid sets,
where
The original state
belongs to the following ellipsoid set,
where
The matrices in performance index are with
and they are selected as non-negative definite and positive definite symmetric matrices of appropriate dimensions, respectively. By using Sedumi toolbox, LMI toolbox and Optimization Toolbox in MATLAB, the simulation results are as follows.
Figure 1 and
Figure 2 show the comparison of the system state and the estimated states between the proposed method and the OFRMPC method.
Figure 3 and
Figure 4 display the estimated errors of system states for the proposed method and OFRMPC method. Here, the estimated states are the centers of the estimated state feasible sets. It can be seen that the estimated states with the proposed method are closer to the true state.
Figure 5 displays the evolution of true state feasible sets for the proposed method. The blue line means the bound of estimated state feasible ellipsoidal sets. It shows that the feasible states sets from instant 2 become smaller and smaller. Meanwhile, they contain the true states all the time and this can be seen directly from the partial enlarged detail in the figure.
Figure 6 shows the curve of the control law for the proposed method and the OFRMPC method. It is observed that the control law proposed in this paper tends to zero and the control law for OFRMPC fluctuates around zero. The upper bound
of the cost function at each time is shown in
Figure 7 and they become small gradually. And the quadratic cost function of the proposed method is 49.753, while the quadratic cost function for OFRMPC method is 55.324. This implies that the proposed method has better control performance.
Example 2. For the position control problem of the industrial robot with six degrees of freedom, because most of the motion in the robot processing is at low speed except the singularity, the coupling of the motion of the robot is weak. In addition, a reasonable path can be planned for the singularity to keep the robot away from its own singularity and avoid strong coupling between movements in different directions. In reference [20], the position control model of industrial robot is simplified as an independent but identical second-order system with disturbance in each direction,where , , are the displacement and velocity for the center point of the end of the manipulator along X, Y, Z direction respectively. The system matrices and input matrices in each direction are as follows,where t is time. The disturbance matrices in each direction are as follows,which contain the system noises, the un-modeled dynamics and the coupling effect of the motion of the robot in X, Y, Z directions and are bounded. Because the dynamics system in each direction has the same structure, here we take the motion control in
X direction as an example to show the effectiveness of the proposed method. The laser tracker is used to measure the position for the center point of the end of the manipulator, the measurement accuracy is
m/m. By discretizing the dynamics system in
X direction with the discrete sampling time
, it yields
The noises are independent mutually and belong to the ellipsoid sets,
where
The original state
belongs to the following ellipsoid set,
where
and the matrices in performance index are with
The simulation results are as follows.
Figure 8 and
Figure 9 display the estimation errors of displacement and velocity for the center point of the end of the manipulator in
X direction for the proposed method and OFRMPC method. It can be seen that the estimation errors of the proposed method are smaller than those of OFRMPC method and tend to zeros after 15 s.
Figure 10 displays the evolution of true displacement and velocity feasible sets for the proposed method. It shows that the feasible sets become smaller and smaller. Meanwhile, they contain the true states all the time.
Figure 11 shows the curve of the control law for the proposed method and the OFRMPC method. The upper bound
of the cost function at each time is shown in
Figure 12 and they become small gradually. The quadratic cost function of the proposed method is 224.3114, while the quadratic cost function for OFRMPC method is 281.3348. This implies that the proposed method has better control performance.