Next Article in Journal
RBFNN-Based Anti-Input Saturation Control for Hypersonic Vehicles
Previous Article in Journal
Round-Robin Study for Ice Adhesion Tests
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Asynchronously H Tracking Control and Optimization for Switched Flight Vehicles with Time-Varying Delay

1
Unmanned System Reasearch Institute, Northwestern Polytechnical University, Xi’an 710072, China
2
Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190, China
*
Author to whom correspondence should be addressed.
Aerospace 2024, 11(2), 107; https://doi.org/10.3390/aerospace11020107
Submission received: 1 December 2023 / Revised: 10 January 2024 / Accepted: 19 January 2024 / Published: 24 January 2024

Abstract

:
The current paper verifies the asynchronous H control and optimization problem for flight vehicles with a time-varying delay. The nonlinear dynamic model and Jacobian linearization establish the flight vehicle’s switched model. An asynchronous H tracking controller is designed, considering the existing asynchronous switching between the controllers and corresponding subsystems. In order to promote transient efficiency, the tracking controller comprises the model-based part and the learning-based part. The model-based part guarantees stability and prescribed efficiency, and the learning-based part compensates for undesirable uncertainties. The multiple Lyapunov function (MLF) and mode-dependent average dwell time (MDADT) methods are utilized to guarantee stability and the specified attenuation efficiency. The existing conditions and the solutions of model-based sub-controllers are represented by linear matrix inequalities (LMIs). The deep Q learning (DQL) algorithm provides the learning-based part. Different from the conventional method, the controller parameters are scheduled online. Therefore, robustness, stability, and dynamic efficiency can be met simultaneously. A numerical example illustrates the efficiency and advantage of the presented approach.

1. Introduction

As an efficient way to access space, flight vehicles have attracted considerable attention owing to their high civilian and military value [1,2]. The rapid development of various flight vehicles provides efficient and convenient tools for multiple missions, such as remote attacks, autonomous detection, and material transportation. Due to their potential applications in industry, agricultural military, and other fields, fruitful research results have emerged in relation to flight vehicles. The problems of stability analysis, control strategy design, and the formation control of flight vehicles have been studied by scholars at home and abroad [3,4]. For instance, tracking control in the event-triggered case was proposed in [5]; the observer-based backstepping control strategy was presented in [6] and an extended state observer was proposed in [7]. However, the characteristics of high nonlinearity, complex dynamics, strong couplings, and model uncertainties seriously influence the performance of flight vehicles, putting forward higher requirements for reliability and control performance.
Nonlinear systems have created many concerns due to the nonlinear nature of most practical systems [8,9]. The most popular approaches to analyzing nonlinear systems can be composed of two parts: the nonlinear design method and the linear design method [10]. The nonlinear properties of nonlinear systems are considered in the nonlinear design method, such as the backstepping and adaptive control methods. The linear design method approximates the nonlinear systems with linear systems around their operation points based on Jacobian linearization. Therefore, the controller design and analysis for complex nonlinear systems can be converted into designing controllers for linear systems, such as the gain-scheduled method, the proportional-integral-derivative (PID) method, and robust control. The linear design method can provide solvable results and stability guarantees for complex nonlinear systems. As an effective tool with which to address the design of complex nonlinear systems, switched systems theory has been investigated by researchers, and there is much literature on modeling, stability analysis, switching strategy design, fault detection, and fault-tolerance [11,12]. The switched systems establish the connection between complicated nonlinear and simplified linear systems [13], attracting a great deal of attention. In recent literature, various interesting results have been presented for various problems of switched systems. As a fundamental controller design problem, stability analysis has been fully studied, and several methods have been proposed. For example, the common Lyapunov function (CLF) method was presented for the switched systems with arbitrary switching, meaning a CLF was shared for all the subsystems. Therefore, the switching among subsystems cannot increase the energy. However, in most practical situations, finding a CLF for all the subsystems is a challenge, limiting the CLF method’s applications. Moreover, the switching logic depends on the time, states, or their combination in many practical applications, motivating the investigation of average dwell time (ADT) and MDADT methods. This means that the subsystems should dwell long enough in the subsystems with poor performance, which is mainly applied to restricted switching. Compared with the existing CLF approach, the ADT and MDADT methods lead to less conservative results. In [14], the quantized H filtering problem was investigated for switched T-S fuzzy systems, and the ADT method was utilized to guarantee the exponential stability of the error system with a given H performance. The quantization phenomenon and parameter perturbations were considered, and the fuzzy-based Lyapunov function approach was provided. In [1], the stability of highly nonlinear switched stochastic systems containing the time-varying delay was analyzed. Lyapunov function and ADT were employed to derive sufficient conditions to ensure the H stability to avoid the inappropriate response induced by the time delay. The proposed method extends the stability results to the environment with a time delay, which is more applicable in a practical environment. Moreover, it can be deduced that the common dwell time can be employed for all the subsystems, indicating the worst situation, which results in conservativeness. The MDADT approach was presented in [15] to derive narrower bounds on dwell time. The features of every subsystem are considered, which has its own dwell time. This means that the dwell time depends on the system mods, releasing the ADT method restrictions [16]. Therefore, the MDADT method has been widely applied for stability analysis and stabilization. In [17], the fault estimation observer was proposed. The MDADT method was presented to realize the augmented system’s stability and H performance. Compared with the conventional ADT approach, fewer conservative results were realized. In [18], the stability and robust control for switched systems were verified. The multiple discontinuous Lyapunov function and MDADT were integrated to guarantee the stability and prescribed weighted performance. It turns out that the proposed method realizes small bounds on dwell time. In [19], the event-triggered exponential H filter design was investigated, and the MDADT approach was adopted to derive the exponential stability conditions. In this study, the event-triggered communication scheme is applied to promote the resource limitation through the network. The time-varying delay and bounded disturbance are considered, and the LMI technique is employed to derive sufficient conditions to attain the desired efficiency. Compared with the conventional results, the presented approach is more applicable and less conservative. One of the primary purposes of the switched systems is to obtain fewer conservative results and narrower dwell time bounds, which has motivated researchers in recent years.
In most practical systems, a time-varying delay is inevitable due to the transmission limitation in the flight vehicles network, degrading the performance and causing asynchronous switching. The asynchronous switching indicates that the controller switching lags behind the system mode switching. Therefore, we can see unmatched and matched periods in all subsystems, which will increase the Lyapunov energy in the unmatched periods. Thus, the energy function increases in the unmatched periods and decreases in the matched ones. There are many results of time delay and asynchronous switching. In [12], the stability and stabilization problems were studied for switched systems with impulsive switching signals under asynchronous switching. A novel Lyapunov-like function was established, and the conditions to ensure the system’s exponential stability were given by the edge-dependent switching signals. In [20], finite-time stabilization and finite-time bounded stabilization were investigated for switched systems. The asynchronous environmental switching was considered, and the sufficient stabilization conditions were presented as nonlinear differential matrix inequalities. The results in the paper validate that asynchronous switching can affect the system’s dynamic performance, and it is essential to avoid the undesirable response induced by asynchronous switching. Moreover, the multiple event-triggered strategies for switched systems with asynchronous switching were investigated in [21]. The controller-mode-dependent Lyapunov function was established using an asynchronous switching strategy. The multiple event-triggered schemes were applied, and the stability criteria were provided based on the ADT method. The state feedback controller was proposed to avoid the Zeno behavior. Additionally, the conventional method for time-delay systems lies in the Lyapunov–Krasovskii function. The time-varying delay bounds were considered, and the robust stability was realized for the worst case of unknown delay. In [22], the formation-containment control problem was verified for multi-agent systems (MAS) containing time-varying delays and switching topologies. It is assumed that the leaders can communicate through switching topologies with time-varying delays. The Lyapunov–Krasovskii function method was applied to ensure the convergence of the formation-containment error. An edge-based state observer was developed to evaluate the MAS states involving the time delay. The results demonstrate that the Lyapunov–Krasovskii function method can successfully solve the stabilization problem for time-delay systems. In [8], the adaptive fuzzy control problem was investigated for switched nonlinear systems containing the input delay. A nonlinear disturbance observer for arbitrary switching systems was presented, and a piecewise switched adaptive law was developed. Padé approximation and dynamic surface control methods were proposed to address the input delay problem. Unlike the traditional method, the proposed method realizes fewer conservative results and more relevant results. From the statement mentioned in the paper, it is evident that developing more applicable and less conservative methods for switched systems is interesting and necessary.
It is noticed that the model-dependent methods are proposed assuming that the structure or bounds of model uncertainties are known in prior [23,24]. However, in many situations in a realistic environment, the information of uncertainties cannot be obtained, which motivates the studies on model-free methods. With the development of computational ability, intelligent methods have been applied to the control issues of flight vehicles [25]. Among the artificial methods, deep learning and reinforcement learning have been extensively employed in many military and economic fields. The deep neural networks and reinforcement learning were utilized in deep learning to realize online fitting and achieve better performance by trial and error. Therefore, we can see that data-based algorithms like deep learning and reinforcement learning can improve the controller efficiency in the presence of system uncertainties. Deep reinforcement learning (DRL) integrates deep learning and reinforcement learning benefits. This was widely studied, and fruitful results emerged. In [26], the DRL algorithm was employed to design the missile’s guidance law, formulating a Markovian decision process in which the reward function was designed to realize the trade-off between accuracy, energy consumption, and interception time. The deep deterministic policy gradient algorithm was adopted, and the guidance gain was scheduled online. In [27], the PID controller was combined with the DRL algorithm to improve the control performance. The DRL algorithm was adopted to compensate for the system uncertainties. However, in the existing literature on DRL, we can see that ensuring the algorithm’s convergence is a challenge. Accordingly, an algorithm should be developed to simultaneously ensure stability and dynamic performance.
According to the above discussion, it can be deduced that the performance enhancement and stability preservation problems should be verified simultaneously. The controller design problem for switched systems in more applicable environments has not entirely been verified. The design flexibility and control performance can be enhanced by deriving the narrower dwell time bounds. Moreover, it is essential to incorporate the benefits of conventional robust control with an intelligent algorithm. Therefore, the current study investigates the asynchronous H tracking control problem and optimization for switched flight vehicles with time-varying delays. The nonlinear dynamic model and Jacobian linearization can establish the flight vehicles’ switched model. The proposed controller comprises a dynamic-based sub-controller and a learning-based sub-controller. The nominal tracking controller is proposed considering the asynchronous switching induced by the time-varying delay. The MLF and MDADT approaches are integrated to ensure stability and attenuation performance. In order to ensure stability and transient performance simultaneously, a learning-based sub-controller is proposed to compensate for the system uncertainties. The DQL algorithm is provided to achieve better convergence. Therefore, the essential novelties of the current study are summarized as follows: (1) a more applicable and less conservative asynchronous H tracking controller is proposed for switched flight vehicles with time-varying delays by utilizing MLF and MDADT approaches. The LMI approach is adopted to extract sufficient conditions ensuring the stability and prescribed attenuation index; (2) the presented tracking controller consists of a dynamic-based sub-controller and a learning-based sub-controller. The former is designed for the nominal case, and the latter is provided to compensate for the model uncertainties. The advantages of conventional H tracking control and an intelligent algorithm are combined; (3) the DQL is adopted, and the online scheduling is described with a Markovian process. The controller parameters are defined as the output action to simultaneously realize stability, robustness, and dynamic performance.
The remainder of the current paper is arranged as follows. The problem formulation is presented in Section 2. In Section 3, the intelligent H tracking controller is given. The simulation results are given in Section 4 to evaluate the efficiency of the presented approach. Finally, the paper concludes in Section 5.

2. Preliminaries and Problem Formulation

The current study considers the discrete-time switched systems with the time-varying delay as:
x ( k + 1 ) = A i x ( k ) + A h i x ( k h ( k ) ) + B i u ( k ) + D i ω ( k ) y ( k ) = C i x ( k ) ,
where x ( k ) R x stands for the state vector; u ( k ) R u describes the input signal; y ( k ) R y stands for the output signal; ω ( k ) R ω describes the external disturbance with ω ( k ) L 2 [ 0 , ) ; h ( k ) [ h m , h M ] describes the time-varying delay in the systems caused by the network’s transformation limitation. We define A i , A h i , B i , C i and D i as the state-space matrices with proper dimensions; i = σ ( k ) : [ 0 , ) Ω = { 1 , 2 , . . . , n } represents the piecewise continuous switching signal.
We define the command signal as r ( k ) . Therefore, the tracking error is described as:
e ( k ) = r ( k ) y ( k ) .
Remark 1.
The command signal is bounded with 30 d e g r ( k ) 30 d e g .
The tracking control problem of switched systems in (1) can be described as the controller design problem, such that
lim k e ( k ) = 0 .
The tracking error integral is defined in (4).
g ( k ) = s = 0 k 1 e ( s ) = s = 0 k 1 ( r ( s ) y ( s ) ) .
Then, the tracking controller for (1) can be proposed as:
u ( k ) = K 1 i x ( k ) + K 2 i g ( k ) ,
where K 1 i and K 2 i are unknown parameter matrices to be determined.
Due to the time-varying delay in the system, the tracking controller switching always lags behind the system mode switching. All subsystems have unmatched and matched periods. The activated time instant of the ith subsystem is defined as k i , and the activated time instant of the corresponding controller is defined as k i + Δ i , where Δ i denotes the unmatched period length in the ith subsystem.
Suppose
x ˜ ( k ) = [ x T k g T ( k ) ] T , d ( k ) = [ ω T ( k ) , r T ( k ) ] T .
Now, the following closed-loop switched systems can be derived:
x ˜ ( k + 1 ) = A ˜ i i x ˜ ( k ) + A ˜ h i x ˜ ( k h ( k ) ) + D ˜ i d ( k ) e k = C ˜ i x ˜ ( k ) + E ˜ i d ( k ) , k k i + Δ i , k i + 1
x ˜ k + 1 = A ˜ i j x ˜ k + A ˜ h i x ˜ k h k + D ˜ i d k e k = C ˜ i x ˜ k + E ˜ i d k , k k i , k i + Δ i ,
where
A ˜ i i = A i + B i K 1 i B i K 2 i C i I , A ˜ i j = A i + B i K 1 j B i K 2 j C i I
A ˜ h i = A h i 0 0 0 , D ˜ i = D i 0 0 I , C ˜ i = C i 0 , E ˜ i = 0 I .

3. Main Results

3.1. Nominal Tracking Controller Design

The following definitions are provided for the convenience of the tracking controller design.
Definition 1
([15]). Consider the switching signal σ ( k ) and k 1 > k 0 0 , let N σ i ( k 0 , k 1 ) denote the switching number of the i-th subsystem occurring on the [ k 0 , k 1 ] ; T i ( k 0 , k 1 ) describes the running time during time interval [ k 0 , k 1 ] . The positive constant τ a i is defined as the MDADT of the switching signal σ ( k ) such that
N σ i k 0 , k 1 N 0 i + T i k 0 , k 1 τ a i ,
where N 0 i 0 are mode-dependent chatter bounds.
Definition 2.
If there are constants κ > 0 and ε > 0 for a given switching signal σ ( k ) , the switched systems in (6) and (7) are globally uniformly exponentially stable (GUES) when d ( k ) = 0 such that
e k κ e ε k k 0 e k 0 .
Definition 3.
For a given switching signal σ ( k ) , and constants γ > 0 and 0 < λ < 1 , if the switched systems in (6) and (7) are GUES with the prescribed H attenuation performance, such that
s = k 0 1 λ s e T s e s γ 2 s = k 0 d T s d s ;
Accordingly, the tracking controller design problem can be divided into two parts: (1) the systems (6) and (7) are GUES when d ( k ) = 0 ; (2) the closed-loop systems satisfy the prescribed attenuation index in (10).
Theorem 1 gives sufficient conditions to guarantee the systems (6) and (7) GUES.
Theorem 1.
For given constants μ 1 i > 1 , μ 2 i > 1 , 0 < α i < 1 , and β i > 0 , if there are positive-definite matrices P i , Q 1 i , Q 2 i , Q 3 i , P i j , Q 1 i j , Q 2 i j , and Q 3 i j , the closed-loop systems in (6) and (7), with MDADT satisfying (15), are GUES if (11)–(14) hold.
P i μ 1 i P i j Q 1 i μ 1 i Q 1 i j Q 2 i μ 1 i Q 2 i j Q 3 i μ 1 i Q 3 i j
P i j μ 2 i P j Q 1 i j μ 2 i Q 1 j Q 2 i j μ 2 i Q 2 j Q 3 i j μ 2 i Q 3 j
Φ i = P i 1 Φ 12 i Φ 22 i < 0
Φ i j = P i j 1 Φ 12 i j Φ 22 i j < 0
τ a i τ a i = ln μ 1 i μ 2 i + Δ i ln θ i ln τ a i ,
where
α ˜ i = 1 α i , β ˜ i = 1 + β i , θ i = β ˜ i / α ˜ i ,
Φ 12 i = A ˜ i i A ˜ h i 0 0 , Φ 12 i j = A ˜ i j A ˜ h i 0 0 ,
Φ 22 i = diag 1 α i P i + Q 1 i + h M h m + 1 Q 2 i + Q 3 i , 1 α i h m Q 2 i , 1 α i h m Q 1 i , 1 α i h M Q 3 i ,
Φ 22 i j = diag 1 + β i P i j + Q 1 i j + h M h m + 1 Q 2 i j + Q 3 i j , 1 + β i h m Q 2 i j , 1 + β i h m Q 1 i j , 1 + β i h M Q 3 i j .
Proof. 
The Lyapunov-like function of the ith subsystem is described as:
V i k = l = 1 3 V l i k
where
V 1 i ( k ) = x ˜ T ( k ) P i x ˜ k
V 2 i k = s = k h m k 1 ( 1 α ) k s 1 x ˜ T ( s ) Q 1 i x ˜ ( s ) + s = k h ( k ) k 1 ( 1 α ) k s 1 x ˜ T ( s ) Q 2 i x ˜ s + s = k h M k 1 1 α k s 1 x ˜ T s Q 3 i x ˜ s
V 3 i k = j = k h M + 1 k h m s = j k 1 1 α k s 1 x ˜ T s Q 2 i x ˜ s .
The weighted difference of V i ( k ) is defined as:
Δ V 1 i k + α V 1 i k = V 1 i k + 1 1 α V 1 i k = x ˜ T ( k + 1 ) P i x ˜ ( k + 1 ) 1 α x ˜ T ( k ) P i x ˜ ( k )
Δ V 2 i k + α V 2 i k = V 2 i k + 1 1 α V 2 i k = s = k h m + 1 k 1 α k s x ˜ T s Q 1 i x ˜ s + s = k h ( k ) + 1 k 1 α k s x ˜ T s Q 2 i x ˜ s + s = k h M + 1 k 1 α k s x ˜ T s Q 3 i x ˜ s s = k h m k 1 1 α k s x ˜ T s Q 1 i x ˜ s s = k h ( k ) k 1 1 α k s x ˜ T s Q 2 i x ˜ s s = k h M k 1 1 α k s x ˜ T s Q 3 i x ˜ s x ˜ T k Q 1 i x ˜ k 1 α h m x ˜ T k h m Q 1 i x ˜ k h m + x ˜ T k Q 2 i x ˜ k 1 α h m x ˜ T k h ( k ) Q 2 i x ˜ k h ( k ) + x ˜ T k Q 3 i x ˜ k 1 α h M x ˜ T k h M Q 3 i x ˜ ( k h M ) + s = k h M + 1 k h m 1 α k s x ˜ T s Q 2 i x ˜ s
Δ V 3 i k + α V 3 i k = V 3 i k + 1 1 α V 3 i k = j = k h M + 2 k h m + 1 s = j k 1 α k s x ˜ T s Q 2 i x ˜ s j = k h M + 1 k h m s = j k 1 1 α k s x ˜ T s Q 2 i x ˜ s = h M h m x ˜ T k Q 2 i x ˜ k s = k h M + 1 k h m 1 α k s x ˜ T s Q 2 i x ˜ s .
We define the augmented vector as:
ξ k = x ˜ T k x ˜ T k h ( k ) x ˜ T k h m x ˜ T k h M T .
The following equations can be derived under zero-initial conditions when d ( k ) = 0 .
Δ V i k + α V i k x ˜ T ( k ) A ˜ i T P i A ˜ i x ˜ ( k ) + x ˜ T k h ( k ) A ˜ h i T P i A ˜ i x ˜ k h ( k ) + x ˜ T ( k ) 1 α P i + Q 1 i + h M h m + 1 Q 2 i + Q 3 i x ˜ ( k ) + x ˜ T k h ( k ) 1 α h m Q 2 i x ˜ k h ( k ) + x ˜ T k h m 1 α h m Q 1 i x ˜ k h m + x ˜ T k h M 1 α h M Q 3 i x ˜ k h M = ξ T ( k ) Φ 12 i T P i Φ 12 i + Φ 22 i ξ ( k ) .
Together with the Schur complement, we have
Δ V i ( k ) + α V i ( k ) ξ T ( k ) Φ 12 i T P i Φ 12 i + Φ 22 i ξ ( k ) ξ T ( k ) Φ i ξ ( k ) 0 .
Similarly, the Lyapunov-like function in unmatched periods is defined as:
V i j k = l = 1 3 V l i j k ,
where
V 1 i j k = x ˜ T k P i j x ˜ k
V 2 i j k = s = k h m k 1 1 + β k s 1 x ˜ T s Q 1 i j x ˜ s + s = k h ( k ) k 1 1 + β k s 1 x ˜ T s Q 2 i j x ˜ s + s = k h M k 1 1 + β k s 1 x ˜ T s Q 3 i j x ˜ s
V 3 i j k = j = k h M + 1 k h m s = j k 1 1 + β k s 1 x ˜ T s Q 2 i j x ˜ s .
The following equation can be obtained by the difference method:
Δ V i k β V i k ξ T ( k ) Φ 12 i j T P i j Φ 12 i j + Φ 22 i j ξ ( k ) ξ T ( k ) Φ i j ξ ( k ) 0 .
Combining (22) with (27) gives:
V σ k p k p + 1 α ˜ σ k p k p + 1 k p Δ p V σ k p + Δ p k p + Δ p μ 1 σ k p α ˜ σ k p k p + 1 k p Δ p V σ k p k p + Δ p μ 1 σ k p α ˜ σ k p k p + 1 k p Δ p β ˜ σ k p Δ p V σ k p k p μ 1 σ k p μ 2 σ k p α ˜ σ k p k p + 1 k p Δ p β ˜ σ k p Δ p V σ k p 1 k p = μ 1 σ k p μ 2 σ k p α ˜ σ k p k p + 1 k p θ σ k p Δ p V σ k p 1 k p .
The switching instants in the interval [ k 0 , k ] are k 1 , k 2 , ⋯, and k N σ . Accordingly, the following equation can be derived by iteration:
V σ k N σ k μ 1 σ k N σ μ 2 σ k N σ α ˜ σ k N σ k k N σ θ σ k N σ Δ N σ V σ k N σ 1 k N σ μ 1 σ k N σ μ 1 σ k N σ 1 μ 2 σ k N σ μ 2 σ k N σ 1 α ˜ σ k N σ k k N σ α ˜ σ k N σ 1 k N σ k N σ 1 θ σ k N σ Δ N σ θ σ k N σ 1 Δ N σ 1 V σ k p 2 k N σ 1 · · · s = 0 N σ μ 1 σ k s μ 2 σ k s α ˜ σ k s k s + 1 k s θ σ k s Δ s V 0 k 0 = exp s = 0 N σ ln μ 1 σ k s μ 2 σ k s + k s + 1 k s ln α ˜ σ k s + Δ s ln θ σ k s V 0 k 0 = exp i = 1 n N σ i k 0 , k 1 ln μ 1 i μ 2 i + Δ i ln θ i + T i k 0 , k 1 ln α ˜ i V 0 k 0 .
Together with (8), (22), (24), and (28), we can obtain that:
V σ ( k N σ ) k exp i = 1 n T i k 0 , k 1 τ a i ln μ 1 i μ 2 i + Δ i ln θ i + T i k 0 , k 1 ln α ˜ i V 0 k 0 exp i = 1 n T i k 0 , k 1 × ln μ 1 i μ 2 i + Δ i ln θ i τ a i + ln α ˜ i V 0 k 0 .
According to (15), we can obtain
ln μ 1 i μ 2 i + Δ i ln θ i τ a i + ln α ˜ i < 0 .
Therefore, the switched systems in (6) and (7) are GUES if the MDADT satisfies (15). Now, the proof is finished.    □
Remark 2.
The (22) gives the variation of the Lyapunov function in the matched periods and (27) provides the variation of the Lyapunov function in the unmatched periods. According to (22) and (27), we can obtain the relationship between the Lyapunov function at time instant k and the initial state by iteration.
Theorem 1 provides a stability analysis. Accordingly, sufficient conditions to guarantee the prescribed attenuation index can be derived.
Theorem 2.
For given constants μ 1 i > 1 , μ 2 i > 1 , 0 < α i < 1 , β i > 0 , if there are positive-definite matrices P i , Q 1 i , Q 2 i , Q 3 i , P i j , Q 1 i j , Q 2 i j , and Q 3 i j , and (31) and (32) hold, i , j N , i j , the closed-loop systems in (6) and (7), with MDADT meeting (15), are GUES with the prescribed attenuation index γ.
Φ ˜ i = P i 1 0 Φ ˜ 13 i I Φ ˜ 23 i Φ ˜ 33 i < 0
Φ ˜ i j = P i j 1 0 Φ ˜ 13 i j I Φ ˜ 23 i Φ ˜ 33 i j < 0 ,
where
Φ ˜ 13 i = A ˜ i i A ˜ h i 0 0 D ˜ i , Φ ˜ 13 i j = A ˜ i j A ˜ h i 0 0 D ˜ i , Φ ˜ 23 i = C ˜ i 0 0 0 E ˜ i ,
Φ ˜ 33 i = diag 1 α i P i + Q 1 i + h M h m + 1 Q 2 i + Q 3 i , 1 α i h m Q 2 i , 1 α i h m Q 1 i , 1 α i h M Q 3 i , γ 2 I ,
Φ ˜ 33 i j = diag ( 1 + β i P i j + Q 1 i j + h M h m + 1 Q 2 i j + Q 3 i j , 1 + β i h m Q 2 i j , 1 + β i h m Q 1 i j , 1 + β i h M Q 3 i j , γ 2 I ) .
Proof. 
By combining Theorem 1 and (31) and (32), it is evident that the closed-loop systems in (6) and (7) are GUES. Suppose
ξ ˜ ( k ) = [ x ˜ T ( k ) x ˜ T ( k h ( k ) ) x ˜ T ( k h m ) x ˜ T ( k h M ) d T ( k ) ; ] T ,
according to the Schur complement, we have
Δ V i k + α i V i k + W k ξ ˜ T ( k ) Φ ˜ 13 i T P i Φ ˜ 13 i + Φ ˜ 23 i T Φ ˜ 23 i + Φ ˜ 33 i ξ ˜ ( k ) ξ ˜ T ( k ) Φ ˜ i ξ ˜ ( k ) < 0
Δ V i j k β i V i j k + W k ξ ˜ T ( k ) Φ ˜ 13 i j T P i j Φ ˜ 13 i j + Φ ˜ 23 i j T Φ ˜ 23 i j + Φ ˜ 33 i j ξ ˜ ( k ) ξ ˜ T ( k ) Φ ˜ i j ξ ˜ ( k ) < 0
where W k = e T ( k ) e ( k ) γ 2 d T ( k ) d ( k ) .
V σ k p k p + 1 μ 1 σ k p μ 2 σ k p α ˜ σ k p k p + 1 k p Δ p β ˜ σ k p Δ p V σ k p 1 k p s = k p + Δ p k p + 1 1 α ˜ σ k p k p + 1 1 s W s s = k p k p + Δ p 1 α ˜ σ k p k p + 1 k p Δ p 1 β ˜ σ k p k p + Δ p s W s = μ 1 σ k p μ 2 σ k p α ˜ σ k p k p + 1 k p θ σ k p Δ p V σ k p 1 k p s = k p + Δ p k p + 1 1 α ˜ σ k p k p + 1 1 s W s s = k p k p + Δ p 1 α ˜ σ k p k p + 1 s 1 θ σ k p k p + Δ p s W s .
The following equation can be obtained by iteration on the interval [ k 0 , k 1 ] .
V σ k N σ k μ 1 σ k N σ μ 2 σ k N σ α ˜ σ k N σ k k N σ θ σ k N σ Δ N σ V σ k N σ 1 k N σ s = k N σ + Δ N σ k 1 α ˜ σ k N σ k 1 s W s s = k N σ k N σ + Δ N σ 1 α ˜ σ k N σ k s 1 θ σ k N σ k N σ + Δ N σ s W s μ 1 σ k N σ μ 2 σ k N σ α ˜ σ k N σ k k N σ θ σ k N σ Δ N σ ( μ 1 σ k N σ 1 μ 2 σ k N σ 1 α ˜ σ k N σ 1 k N σ k N σ 1 θ σ k N σ 1 Δ N σ 1 V σ k N σ 2 k N σ 1 s = k N σ 1 + Δ N σ 1 k N σ 1 α ˜ σ k N σ 1 k N σ 1 s W s s = k N σ 1 k N σ 1 + Δ N σ 1 1 α ˜ σ k N σ 1 k N σ s 1 θ σ ( k N σ 1 ) k N σ 1 + Δ N σ 1 s W ( s ) ) s = k N σ + Δ N σ k 1 α ˜ σ k N σ k 1 s W s s = k N σ k N σ + Δ N σ 1 α ˜ σ k N σ k s 1 θ σ k N σ k N σ + Δ N σ s W s s = 0 N σ μ 1 σ k s μ 2 σ k s α ˜ σ k s k s + 1 k s θ σ k s Δ s V 0 k 0 r = 0 N σ μ 1 σ k r μ 2 σ k r α ˜ σ k r k r + 1 k r θ σ k r Δ r s = k 0 + Δ 0 k 1 1 α ˜ σ k 0 k 1 1 s W s + s = k 0 k 0 + Δ 0 1 α ˜ σ k 0 k 1 s 1 θ σ k 0 k 0 + Δ 0 s W s
s = k N σ + Δ N σ k 1 α ˜ σ k N σ k 1 s W s s = k N σ k N σ + Δ N σ 1 α ˜ σ k N σ k s 1 θ σ k N σ k N σ + Δ N σ s W s i = 1 n μ 1 i N σ i k 0 , k μ 2 i N σ i k 0 , k α ˜ i T i k 0 , k 1 θ i Δ i N σ i k 0 , k V 0 k 0 s = k 0 k 1 ( μ 1 j m = 1 m j n μ 1 m N σ m s , k μ 2 m N σ m s , k μ 1 j N σ j s , k μ 2 j N σ j s , k × α ˜ m T m s , k θ m Δ m N σ m s , k α ˜ j T j s , k 1 θ j Δ j N σ j s , k ) W s s = k 0 k 1 ( μ 1 j m = 1 m j n μ 1 m N σ m s , k μ 2 m N σ m s , k μ 1 j N σ j s , k μ 2 j N σ j s , k × α ˜ m T m s , k θ m Δ m N σ m s , k α ˜ j T j s , k 1 θ j Δ j N σ j s , k + 1 1 ) W s ,
where m and j are the number of inactivated and activated subsystems in the s-th step, respectively.
Considering the initial condition and V σ ( k ) ( k ) 0 , we have
s = k 0 k 1 ( μ 1 j m = 1 m j n μ 1 m N σ m s , k μ 2 m N σ m s , k μ 1 j N σ j s , k μ 2 j N σ j s , k × α ˜ m T m s , k θ m Δ m N σ m s , k α ˜ j T j s , k 1 θ j Δ j N σ j s , k ) W s + s = k 0 k 1 ( μ 1 j m = 1 m j n μ 1 m N σ m s , k μ 2 m N σ m s , k μ 1 j N σ j s , k μ 2 j N σ j s , k × α ˜ m T m s , k θ m Δ m N σ m s , k α ˜ j T j s , k 1 θ j Δ j N σ j s , k + 1 1 ) W s 0 .
Multiplying both sides of (36) by
m = 1 n μ 1 m N σ m k 0 , k μ 2 m N σ m k 0 , s μ 1 j N σ j k 0 , k θ m Δ m N σ m k 0 , k ,
we have
s = k 0 k 1 ( m = 1 m j n μ 1 m N σ m k 0 , s μ 2 m N σ m k 0 , s μ 1 j N σ j k 0 , s μ 2 j N σ j k 0 , s × α ˜ m T m s , k θ m Δ m N σ m k 0 , s α ˜ j T j s , k 1 θ j Δ j N σ j k 0 , s ) e T s e s + s = k 0 k 1 ( μ 1 j m = 1 m j n μ 1 m N σ m k 0 , s μ 2 m N σ m k 0 , s μ 1 j N σ j k 0 , s × μ 2 j N σ j k 0 , s α ˜ m T m s , k θ m Δ m N σ m k 0 , s α ˜ j T j s , k 1 θ j Δ j N σ j k 0 , s + 1 1 ) e T s e s s = k 0 k 1 ( m = 1 m j n μ 1 m N σ m k 0 , s μ 2 m N σ m k 0 , s μ 1 j N σ j k 0 , s × μ 2 j N σ j k 0 , s α ˜ m T m s , k θ m Δ m N σ m k 0 , s α ˜ j T j s , k 1 θ j Δ j N σ j k 0 , s ) γ 2 d T s d s + s = k 0 k 1 ( μ 1 m m = 1 m j n μ 1 m N σ m k 0 , s μ 2 m N σ m k 0 , s μ 1 j N σ j k 0 , s × μ 2 j N σ j k 0 , s α ˜ m T m s , k θ m Δ m N σ m k 0 , s α ˜ j T j s , k 1 θ j Δ j N σ j k 0 , s + 1 1 ) γ 2 d T s d s .
Together with (8) and (15), we can obtain that
0 N σ m ( k 0 , s ) T m ( k 0 , s ) τ a m T m ( k 0 , s ) l n α ˜ m l n μ 1 m μ 2 m + Δ m l n θ m .
Combining (36) and (37) with μ 1 m > 1 , μ 2 m > 1 , 0 < α ˜ m < 1 , θ m > 1 , and Δ m > 1 , it can be inferred that:
s = k 0 k 1 m = 1 n μ 1 m μ 2 m θ m Δ m T m k 0 , s ln α ˜ m ln μ 1 m μ 2 m + Δ m ln θ m α ˜ m T m s , k 1 e T s e s + s = k 0 k 1 μ 1 j θ j Δ j 1 m = 1 m j n μ 1 m μ 2 m θ m Δ m T m k 0 , s ln α ˜ m ln μ 1 m μ 2 m + Δ m ln θ m α ˜ m T m s , k 1 e T s e s s = k 0 k 1 m = 1 n α ˜ m T m s , k 1 γ 2 d T s d s + s = k 0 k 1 μ 1 j θ j Δ j 1 m = 1 n α ˜ m T m s , k 1 γ 2 d T s d s .
Thus, we have
s = k 0 k 1 m = 1 n α ˜ m T m ( k 0 , k ) 1 e T ( s ) e ( s ) + s = k 0 k 1 μ 1 j θ j Δ j 1 m = 1 n α ˜ m T m k 0 , k 1 e T s e s s = k 0 k 1 m = 1 n α ˜ m T m s , k 1 γ 2 d T s d s + s = k 0 k 1 μ 1 j θ j Δ j 1 m = 1 n α ˜ m T m s , k 1 γ 2 d T s d s .
When k , we have
s = k 0 1 α max s k 0 e T s e s γ 2 s = k 0 d T s d s ,
where α m a x = max { ff m } .
According to Definition 3, the switched systems in (6) and (7) are GUES with the prescribed attenuation index γ . This completes the proof.    □
Accordingly, the LMI problem in the theorem can be solved. Theorem 3 is given to obtain the solution of the tracking controller.
Theorem 3.
For given constants μ 1 i > 1 , μ 2 i > 1 , 0 < α i < 1 , β i > 0 , if there are positive-definite matrices P i , S i , Q 1 i , Q 2 i , Q 3 i , P i j , Q 1 i j , Q 2 i j , and Q 3 i j , i , j N , i j , the solution of the nominal tracking controller can be realized by the following equations:
min tr ( S i , P i ) s . t .
S i I I P i 0
Φ ˜ i = S i 0 Φ ˜ 13 i I Φ ˜ 23 i Φ ˜ 33 i < 0
Φ ˜ i j = S i j 0 Φ ˜ 13 i j I Φ ˜ 23 i Φ ˜ 33 i j < 0
S i P i = I .
Then, the closed systems in (6) and (7) are GUES with the prescribed attenuation performance γ with the MDADT satisfying (15).

3.2. Learning-Based Tracking Controller Design

The nominal tracking controller is proposed according to the H control theory. The stability and prescribed attenuation performance of switched systems are guaranteed. However, as we all know, attaining the optimal compromise of robustness and transient efficiency is challenging. Therefore, the online scheduling algorithm is presented using the DQL approach. The learning-based tracking controller compensates for the system uncertainties to realize performance improvement.
The DQL is an important DRL algorithm. The agent learns an action from an unknown environment according to the reward function in the reinforcement learning framework. In this paper, the controller parameters’ online scheduling is viewed as a Markovian decision process. The action is described with the learning-based tracking controller parameters, which are given to compensate for the undesirable response induced by system uncertainties. The action is provided to maximize the expected discounted reward function during the predefined interval. The action and the reward function are given as follows:
A k = [ Δ K 1 i Δ K 2 i ]
R k = i = k K f γ d i k r c , i = r c , k + γ d r c , k + 1 + γ d 2 r c , k + 2 + + γ d K f k r c , K f = r c , k + γ d R k + 1 ,
where γ d [ 0 , 1 ] denotes the discount factor; K f is the terminal step.
The state vector of the DRL algorithm is defined as:
S k = x k e k g k .
The reward function is defined as:
r c , k = ε 1 e k 2 + ε 2 r e m + ε 3 r s
r e m = k e m , e e m 0 , e < e m
r s = k u m , u u m 0 , u < u m ,
where ε 1 , ε 2 , and ε 3 denote the weights of the reward function, and k e m and k u m are given constants.
In the DQL algorithm, a deep neural network is introduced to approximate the action–state value function Q e S k , A k , and the action is developed according to the maximum Q value:
a = arg max a Q e S k , A k .
It is supposed that the algorithm has two neural networks, including the critic and target neural networks. The weights of the critic neural network are changed according to the TD error, and the weights of the target critic neural network are updated according to (52).
Q e S k , A k , ω = L r R + γ d max a ( Q e S , A , ω ) + ( 1 L r ) Q e ( S k , A k , ω ) ,
where Q e s k , a k , ω and Q e s k , a k , ω denote the outputs of the critic and target critic neural networks, respectively; L r denotes the learning rate; R is the value of the reward function from the state S k to S ; max a Q s , a , ω represents the target critic network’s maximum Q value. Therefore, the design algorithm of the learning-based tracking controller can be given as shown in Algorithm 1.
Algorithm 1 Learning-based tracking controller design.
1:
Calculate the nominal tracking controller based on Theorem 3.
2:
Initialize the parameters of the Q value network and the target Q value network.
3:
Initialize the Replay buffer R, e p i s o d e = 0.
4:
for  e p i s o d e = 1 to M do
5:
   Initialize the state S1 with a random value and take the initial observation.
6:
   for t = 1 to K do
7:
     Calculate the action A k according to the state and the reward function.
8:
     Calculate the reward function and the state of the next time instant.
9:
     Store the transition pair ( S k , A k , R k , S k + 1 ) in the Replay
10:
     Sample a random minibatch transition pair from the Replay buffer.
11:
     Update the weights of the Q value function.
12:
     Update the weights of the target Q value network.
13:
   end for
14:
end for
Remark 3.
The model-based method and learning-based method are combined in this paper. The model-based method is provided based on the robust control theory and the switched control theory. To improve the transient performance, the deep Q learning algorithm is applied in the learning-based method. The output of action is supposed to be the compensation of controllers, and the reward function is defined as the weighted function of tracking error and constraints. By using this method, the parameters of controllers can be trained, and the stability, robustness, and transient performance can be guaranteed simultaneously.

4. Numerical Example

The flight vehicles studied in this paper are the HiMAT vehicles, which can be modeled as switched systems in the flight envelope. This paper considers the switching signals between subsystems 1, 2, 8, 12, and 18. The flight conditions of the operating points are given in Table 1:
Therefore, the sampling time is chosen as T s = 0.02 . The switched model of longitudinal motion dynamics can be described as:
A 1 = 0.9804 0.0188 0.1768 0.9720 , B 1 = 0.0049 0.0034 0.0007 0.1579 0.0979 0.0993
A 2 = 0.9728 0.0188 0.3773 0.9622 , B 2 = 0.0075 0.0050 0.0014 0.2941 0.1765 0.1831
A 8 = 0.9766 0.0190 0.3312 0.9668 , B 8 = 0.0077 0.0054 0.0018 0.3759 0.2798 0.2113
A 12 = 0.9649 0.0188 0.2242 0.9509 , B 12 = 0.0136 0.0094 0.0042 0.9015 0.6166 0.4367
A 18 = 0.9657 0.0191 0.9772 0.9523 , B 18 = 0.0061 0.0033 0.0023 0.4595 0.2426 0.2576
v k + 1 = 0.9922 0.1247 0.1247 0.9922 v k d k = 1 0 v k ,
where v ( k ) stands for the state vector, and the initial value of v ( k ) is chosen as [ 0.01 , 0 ] T .
The current paper considers the switched systems parameters as: a 1 = 0.25 , a 2 = 0.23 , a 8 = 0.26 , a 12 = 0.23 , a 18 = 0.27 , b 1 = 0.03 , b 2 = 0.04 , b 8 = 0.02 , b 12 = 0.05 , b 18 = 0.04 , Δ 1 = 0.1 , Δ 2 = 0.12 , Δ 8 = 0.16 , Δ 12 = 0.14 , Δ 18 = 0.12 , μ 11 = 1.22 , μ 12 = 1.15 , μ 18 = 1.16 , μ 112 = 1.18 , μ 118 = 1.17 , μ 21 = 1.21 , μ 22 = 1.17 , μ 28 = 1.18 , μ 212 = 1.16 , and μ 218 = 1.15 . Therefore, the MDADT in (16) can be calculated as follows:
τ a 1 = 1.4641 , τ a 2 = 1.2735 , τ a 3 = 1.2131 , τ a 4 = 1.3673 , τ a 5 = 1.0779 .
The ADT can be obtained as τ a = 1.6800 .
It is evident that the dwell time obtained by the MDADT approach is less than that obtained by the ADT method. Therefore, it leaves more room for controller design. The system can stay in the modes with a better performance for a longer time.
We set the attenuation performance γ as 0.7. According to the conditions mentioned above, the nominal tracking controller can be derived using Theorem 3. The time delay and switching logic are presented in Figure 1 and Figure 2, respectively.
Firstly, we give a comparison between the ADT and MDADT methods, as described in Figure 3, Figure 4, Figure 5 and Figure 6. It is evident that the MDADT method can realize a better performance compared with the conventional ADT method. The attack angle response based on the MDADT method can avoid the undesirable response induced by the external disturbance. The generated controller can improve the robustness to the environment. Moreover, the actuator response is admissible.
Figure 7, Figure 8, Figure 9, Figure 10 and Figure 11 compare the MDADT approach with the presented method. It can be seen that the transient performance and robustness are enhanced using the DQL algorithm. The controller can compensate for the adverse effect induced by the system uncertainties. A better response of attack angle can be obtained based on the compensation of the uncertainties. Moreover, the actuator response of the proposed controller is admissible.
Accordingly, the proposed method can realize a better transient performance and robustness than the conventional method. The fewer conservative results can be derived through the MDADT approach. The presented method can eliminate the undesirable response induced by time-varying delays and asynchronous switching. The learning-based tracking controller compensates for the system uncertainties. Therefore, we can see that the presented method can simultaneously ensure stability, robustness, and transient performance.

5. Conclusions

This paper presents a novel asynchronously H tracking controller design and an online scheduling method for flight vehicles containing time-varying delays. The feedback controller is constructed using the flight vehicle’s switched model, derived based on Jacobian linearization. Due to the limitation of network transmission, the asynchronous switching induced by the time-varying delay is considered. The nominal tracking controller is proposed to guarantee robustness and stability. The learning-based tracking controller is established to avoid the undesirable response generated by system uncertainties. A combination of the MLF and MDADT methods is considered to verify the stability and attenuation efficiency. The sufficient existing conditions are derived based on the LMI technique. Then, the learning-based tracking controller is designed using the DQL to enhance the tracking controller’s transient efficiency. The proposed approach ensures stability, robustness, and transient efficiency, simultaneously.

Author Contributions

Conceptualization, X.Y.; Methodology, X.Y., B.F. and X.M.; Software, X.Y.; Validation, X.Y., Y.L., D.Y. and X.W.; Writing—Original Draft Preparation, X.Y. and B.F.; Writing—Review & Editing, X.Y., B.F., X.M., Y.L., D.Y. and X.W.; Supervision, X.M.; funding acquisition, B.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number 62103338.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Zhao, Y.; Zhu, Q. Stabilization by delay feedback control for highly nonlinear switched stochastic systems with time delay. Int. J. Robust Nonlinear Control 2021, 31, 3070–3089. [Google Scholar] [CrossRef]
  2. Zeng, D.; Liu, Z.; Chen, C.L.P.; Zhang, Y. Adaptive neural tracking control for switched nonlinear systems with state quantization. Neurocomputing 2021, 454, 392–404. [Google Scholar] [CrossRef]
  3. Zhang, D.; Feng, G. A new switched system approach to leader-follower consensus of heterogeneous linear multiagent systems with DoS attack. IEEE Trans. Syst. Man, Cybern. Syst. 2021, 52, 392–404. [Google Scholar] [CrossRef]
  4. Zhao, Y.M.; Pang, S.Y.; Li, L. H finite-time composite anti-disturbance switching control for switched systems. ISA Trans. 2021, 115, 71–78. [Google Scholar] [PubMed]
  5. Shao, X.; Shi, Y.; Zhang, W.; Zhao, J. Prescribed fast tracking control for flexible air-breathing hypersonic vehicles: An event-triggered case. Chin. J. Aeronaut. 2021, 34, 200–215. [Google Scholar] [CrossRef]
  6. Lu, Y. Disturbance observer-based backstepping control for hypersonic flight vehicles without use of measured flight path angle. Chin. J. Aeronaut. 2021, 34, 396–406. [Google Scholar] [CrossRef]
  7. Zhang, S.; Wang, Q.; Yang, G.; Zhang, M. Anti-disturbance backstepping control for air-breathing hypersonic vehicles based on extend state observer. ISA Trans. 2019, 92, 84–93. [Google Scholar] [CrossRef] [PubMed]
  8. Zhang, Q.; He, D. Disturbance-observer-based adaptive fuzzy control for strict-feedback switched nonlinear systems with input delay. IEEE Trans. Fuzzy Syst. 2021, 29, 1942–1952. [Google Scholar] [CrossRef]
  9. Zhao, M.P.Y.; Ma, D.; Yu, S. Robust guaranteed cost rate anti-bump switching control for switched systems. Int. J. Robust Nonlinear Control 2021, 31, 6334–6348. [Google Scholar] [CrossRef]
  10. Xie, J.; Yang, D. Global stabilization of switched nonlinear systems with vanishing control vector fields and its application. Int. J. Robust Nonlinear Control 2021, 31, 5149–5164. [Google Scholar] [CrossRef]
  11. Wang, Y.; Xu, N.; Liu, Y.; Zhao, X. Adaptive fault-tolerant control for switched nonlinear systems based on command filter technique. Appl. Math. Comput. 2021, 392, 125725. [Google Scholar] [CrossRef]
  12. Wang, Z.; Gao, L.; Liu, H. Stability and stabilization of impulsive switched system with inappropriate impulsive switching signals under asynchronous switching. Nonlinear Anal. Hybrid Syst. 2021, 39, 100976. [Google Scholar] [CrossRef]
  13. Wang, Z.; Yuan, J. Dissipativity-based composite antidisturbance control for T-S fuzzy switched stochastic nonlinear systems subjected to multisource disturbances. IEEE Trans. Fuzzy Syst. 2021, 29, 1226–1237. [Google Scholar] [CrossRef]
  14. Zheng, Q.; Xu, S.; Zhang, Z. Nonfragile quantized H filtering for discrete-time switched T-S fuzzy systems with local nonlinear models. IEEE Trans. Fuzzy Syst. 2021, 29, 1507–1517. [Google Scholar] [CrossRef]
  15. Zhao, X.; Zhang, L.; Shi, P.; Liu, M. Stability and stabilization of switched linear systems with mode-dependent average dwell time. IEEE Trans. Autom. Control 2012, 57, 1809–1815. [Google Scholar] [CrossRef]
  16. Sun, T.; Liu, T.; Sun, X. Stability analysis of cyclic switched linear systems: An average cycle dwell time approach. Inf. Sci. 2021, 544, 227–237. [Google Scholar] [CrossRef]
  17. Hu, H.; Li, Y.; Liu, J.; Tian, E.; Xie, X. Fault estimation for delta operator switched systems with mode-dependent average dwell-time. J. Frankl. Inst. 2021, 358, 5971–5984. [Google Scholar] [CrossRef]
  18. Liu, L.; Zhao, X.; Sun, X.; Zong, G. Stability and l2-gain analysis of discrete-time switched systems with mode-dependent average dwell time. IEEE Trans. Syst. Man, Cybern. Syst. 2020, 50, 2305–2314. [Google Scholar] [CrossRef]
  19. Zong, G.; Duan, D.; Yang, D. Exponential H filtering of networked linear switched systems with mode-dependent average dwell time: An event-triggered scheme. Int. J. Syst. Sci. 2019, 50, 1450–1464. [Google Scholar] [CrossRef]
  20. Wang, R.; Xing, J.; Xiang, Z. Finite-time asynchronous control of linear time-varying switched systems. Int. J. Adapt. Control Signal Process 2021, 35, 1824–1841. [Google Scholar] [CrossRef]
  21. Shi, J.; Zhao, J. Multiple event-triggered schemes for a class of switched descriptor systems with asynchronous switching. Int. J. Robust Nonlinear Control 2021, 31, 9040–9054. [Google Scholar] [CrossRef]
  22. Zhou, S.; Dong, X.; Hua, Y.; Yu, J.; Ren, Z. Predefined formation-containment control of high-order multi-agent systems under communication delays and switching topologies. IET Control Theory Appl. 2021, 15, 1661–1672. [Google Scholar] [CrossRef]
  23. Shu, F.; Zhai, J. Adaptive output feedback stabilization for switched p-normal nonlinear systems with unknown homogeneous growth rate. Nonlinear Anal. Hybrid Syst. 2021, 39, 100985. [Google Scholar] [CrossRef]
  24. Wang, L.; Yang, R.; Zhang, H. Improved event-triggered sliding mode control of switched systems with disturbances. Asian J. Control 2021, 23, 2214–2226. [Google Scholar] [CrossRef]
  25. Liu, M.; Hu, T. Deep learning enabled localization for UAV autolanding, Chinese Journal of Aeronautics. Chin. J. Aeronaut. 2021, 34, 585–600. [Google Scholar] [CrossRef]
  26. Shaoming, H.; Hyo-Sang, S.; Antonios, T. Computational missile guidance: A deep reinforcement learning approach. J. Aerosp. Inf. Syst. 2021, 18, 571–582. [Google Scholar]
  27. Gheisarnejad, M.; Khooban, M.H. An intelligent non-integer PID controller-based deep reinforcement learning: Implementation and experimental results. IEEE Trans. Ind. Electron. 2020, 68, 3609–3618. [Google Scholar] [CrossRef]
Figure 1. The time delay response.
Figure 1. The time delay response.
Aerospace 11 00107 g001
Figure 2. The switching logic response.
Figure 2. The switching logic response.
Aerospace 11 00107 g002
Figure 3. The attack angle response.
Figure 3. The attack angle response.
Aerospace 11 00107 g003
Figure 4. The elevator deflection response.
Figure 4. The elevator deflection response.
Aerospace 11 00107 g004
Figure 5. The elevon deflection response.
Figure 5. The elevon deflection response.
Aerospace 11 00107 g005
Figure 6. The canard deflection response.
Figure 6. The canard deflection response.
Aerospace 11 00107 g006
Figure 7. The attack angle response.
Figure 7. The attack angle response.
Aerospace 11 00107 g007
Figure 8. The elevator deflection response.
Figure 8. The elevator deflection response.
Aerospace 11 00107 g008
Figure 9. The elevon deflection response.
Figure 9. The elevon deflection response.
Aerospace 11 00107 g009
Figure 10. The canard deflection response.
Figure 10. The canard deflection response.
Aerospace 11 00107 g010
Figure 11. The episodes reward response.
Figure 11. The episodes reward response.
Aerospace 11 00107 g011
Table 1. Flight conditions of the vehicle.
Table 1. Flight conditions of the vehicle.
Point NumberMaHeight/mAttack Angle
10.29762.53.18
20.4762.51.49
80.661001.48
120.976251.19
181.212,2002.23
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yang, X.; Fu, B.; Ma, X.; Liu, Y.; Yuan, D.; Wu, X. Asynchronously H Tracking Control and Optimization for Switched Flight Vehicles with Time-Varying Delay. Aerospace 2024, 11, 107. https://doi.org/10.3390/aerospace11020107

AMA Style

Yang X, Fu B, Ma X, Liu Y, Yuan D, Wu X. Asynchronously H Tracking Control and Optimization for Switched Flight Vehicles with Time-Varying Delay. Aerospace. 2024; 11(2):107. https://doi.org/10.3390/aerospace11020107

Chicago/Turabian Style

Yang, Xing, Bin Fu, Xiaochuan Ma, Yu Liu, Dongyu Yuan, and Xintong Wu. 2024. "Asynchronously H Tracking Control and Optimization for Switched Flight Vehicles with Time-Varying Delay" Aerospace 11, no. 2: 107. https://doi.org/10.3390/aerospace11020107

APA Style

Yang, X., Fu, B., Ma, X., Liu, Y., Yuan, D., & Wu, X. (2024). Asynchronously H Tracking Control and Optimization for Switched Flight Vehicles with Time-Varying Delay. Aerospace, 11(2), 107. https://doi.org/10.3390/aerospace11020107

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop