Next Article in Journal
Correlations, Information Backflow, and Objectivity in a Class of Pure Dephasing Models
Previous Article in Journal
Linear and Nonlinear Effects in Connectedness Structure: Comparison between European Stock Markets
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Optimal UAV Formation Tracking Control with Dynamic Leading Velocity and Network-Induced Delays

1
Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China
2
School of Information, Beijing Wuzi University, Beijing 101149, China
3
Purple Mountain Laboratory: Networking, Communications and Security, Nanjing 210096, China
*
Author to whom correspondence should be addressed.
Entropy 2022, 24(2), 305; https://doi.org/10.3390/e24020305
Submission received: 21 December 2021 / Revised: 16 February 2022 / Accepted: 17 February 2022 / Published: 21 February 2022
(This article belongs to the Section Signal and Data Analysis)

Abstract

:
With the rapid development of UAV technology, the research of optimal UAV formation tracking has been extensively studied. However, the high maneuverability and dynamic network topology of UAVs make formation tracking control much more difficult. In this paper, considering the highly dynamic features of uncertain time-varying leader velocity and network-induced delays, the optimal formation control algorithms for both near-equilibrium and general dynamic control cases are developed. First, the discrete-time error dynamics of UAV leader–follower models are analyzed. Next, a linear quadratic optimization problem is formulated with the objective of minimizing the errors between the desired and actual states consisting of velocity and position information of the follower. The optimal formation tracking problem of near-equilibrium cases is addressed by using a backward recursion method, and then the results are further extended to the general dynamic case where the leader moves at an uncertain time-varying velocity. Additionally, angle deviations are investigated, and it is proved that the similar state dynamics to the general case can be derived and the principle of control strategy design can be maintained. By using actual real-world data, numerical experiments verify the effectiveness of the proposed optimal UAV formation-tracking algorithm in both near-equilibrium and dynamic control cases in the presence of network-induced delays.

1. Introduction

UAVs have been extensively studied and have attracted more and more attention due to their high maneuverability and versatility [1,2,3]. For example, UAVs can be used as an air communication platform to provide or improve communication services for ground equipment and can also be used as air relays to address issues such as information transmission, monitoring, and control. Currently, UAVs with communication capabilities have been widely used in many scenarios, such as communication assistance, intelligent transportation, disaster rescue, and low-altitude monitoring [4,5,6,7,8].
However, a single UAV is limited by both battery energy and action scale constraints, thus causing challenges in mission implementation capabilities. Compared with a single UAV, the coordination of multiple UAVs brings advantages such as high efficiency, robustness, and flexibility. UAV swarms have a wide range of potential application, especially in highly reconfigurable and distributed intelligence autonomous systems [9,10,11]. Leader–follower formation control, as the most classic formation method to ensure successful mission execution, has the simplest formation control structure and requires the fewest communication connections between UAVs [12,13,14]. Although extensive studies have been performed on leader–follower formation control, it still faces unique challenges relating to the requirements of real-time application [15,16]. First, how to effectively address the influence of network-induced delays on the multi-dimensional formation control of UAVs is a new challenge. Network-induced delays degrade the performance of UAV systems and bring potential drawbacks to the systems’ stability [17,18]. Most existing works focus on stability analysis considering network-induced delays instead of effective control strategy design. For example, in some works, Lyapunov–Krasovskii functionals and Young’s inequalities are normally used in UAV control design to achieve stability for the whole formation. Making matters worse, the high mobility and dynamic nature of UAV application scenarios make the formation control of UAVs much more complicated and difficult. Considering the dynamic leader velocity, the measurement information may not be consistent with the current state of the leader, thus increasing the difficulty of designing a real-time control strategy.
Unfortunately, there is little research that focuses on the scenario which considers both time-varying leader velocity and network-induced delays, which are the fundamental dynamic natures introduced by the high mobility of UAVs [19]. In this paper, the optimal UAV formation tracking control design is comprehensively investigated for the leader–follower model in the discrete-time domain with dynamic leader velocity and network-induced delays. First, a linear discrete-time model of UAVs’ error dynamics is formulated. Subsequently, the optimal formation tracking control problem of near-equilibrium cases is addressed with stable leader velocity. Finally, the results are further extended to the general time-varying velocity case, and the effect of angle deviation is also analyzed. The main contributions are summarized as follows.
  • Based on the analysis of UAVs’ error dynamics, considering the high dynamic characteristics including both uncertain time-varying leader velocity and network-induced delays, the discrete-time UAV system model is presented. Then, the formation tracking optimization problem is formulated as a linear quadratic cost function.
  • To alleviate the influence of dynamic features, a two-step optimal formation tracking control algorithm is proposed in near-equilibrium control cases. That is, the optimal control strategy determined by the current states of the UAVs and previous control signals can be obtained during the online step, while the corresponding control gain is derived during the offline step by using backward recursion.
  • Additionally, it is found that the proposed optimal control algorithm can be extended to the general dynamic case when the leader has dynamic time-varying velocity. Finally, the angle deviations are investigated, and it is proved that the similar state dynamics as the general case can be derived, thus the principle of the proposed control strategy for the general dynamic case can be maintained.
  • Numerical experiment results based on real UAV flight data demonstrate that the proposed optimal UAV formation-tracking algorithm is applicable to general dynamic control cases in the presence of network-induced delays. In addition, compared with existing algorithms, faster convergence speed and better system stability are achieved.
The rest of this paper is organized as follows. In Section 2, some related works are reviewed. In Section 3, the system model and problem formulation are presented. Afterward, the formation tracking control algorithm is proposed in Section 4. Section 5 is the simulation results and discussion, and the conclusion is presented in Section 6.

2. Related Works

Network-induced delays significantly degrade the system performance and control stability of UAV formation tracking [20]. At present, significant efforts have been made to compensate for the influence of network-induced delays on UAV control. Lin et al. [21] study the consensus problem of a continuous-time multi-agent system with delay and jointly connected topology, and then a linear delay-based protocol is proposed and the sufficient condition for average consensus is achieved. When the input delay is considered, Zhu et al. [22] propose an event-based leader–follower consensus algorithm, and the necessary and sufficient conditions are presented. Considering both time delays and switching topology, the necessary and sufficient conditions for finite-field consensus of networks are derived by Li [23], subject to limited computation, memory, and communication capabilities. Then, the authors further extend the results to the multi-agent system in [24]. Currently, Chen et al. [25] use the Lyapunov–Krasovskii functionals and Young’s inequalities in the design process for the leader–follower consensus problem to eliminate the effects of network-induced delays.
In addition, due to the challenge of high mobility UAV applications, the dynamic formation control problem with system uncertainty has been considerable interest [26,27,28,29]. Olfati-Saber et al. [30] propose a distributed algorithm for flocking control that allows all groups of UAVs to ultimately achieve the same velocity, and the result is further expanded in [31] to show that the velocity and position of the center of the UAV swarm can exponentially converge on the virtual leader. In order to alleviate the velocity information requirements of the leader, Ghommam et al. [32] propose an adaptive feedback control algorithm restricted to parametric uncertainties of LOS-based leader–follower formations. Necessary and sufficient conditions for UAV swarm systems to achieve time-varying formations are developed in [33], but the time-delay is ignored. Currently, Yazdani et al. [34] design a continuous adaptive controller to solve the flocking problem of a multi-agent system with a dynamic virtual leader, which is also affected by both time-varying uncertainty and external interference. Considering both constant and time-varying velocities of the leader, a distributed coordinated tracking control scheme with network-induced delay and external interference is proposed in [35].
The comparison with relevant existing works can be summarized as in Table 1, in which “Yes” means that the related item is studied in the existing work, while “No” means that it is not. Different from these existing works, both dynamic leading velocity and network-induced delays are investigated in this paper. Additionally, an optimal control strategy for UAV formation tracking control under these conditions is derived.

3. System Model and Problem Formulation

Figure 1 shows a typical 3D-based formation tracking control model. The leader UAV is controlled by a control station with dynamic velocity, and follower UAVs keep the desired formation while tracking with the leader. The state information of the leader is measured through sensors and transmitted by wireless sensor agent networks. Then, the control signal is generated depending on the measured information [1,3]. In this scenario, each UAV takes the nearest UAV as the tracking target and follows it, so the whole formation can be divided into several basic leader–follower units. Nevertheless, due to the high mobility of UAVs, the leader always has dynamic velocity and acceleration. Thereby, two fundamental challenges arise. One is how to guarantee that the follower keeps up with the leader smoothly under uncertain time-varying leader velocity. The other is how to reduce the effect of network-induced delays. The features of the dynamic leader velocity and network-induced delays are depicted in Figure 2, which significantly affects the design of the controller.
The dynamics of a follower are given by
l ˙ ( t ) = p ( t ) , p ˙ ( t ) = u ( t Δ T ( t ) ) ,
where l ( t ) = [ l x ( t ) ,   l y ( t ) ,   l z ( t ) ] T and p ( t ) = [ p x ( t ) ,   p y ( t ) ,   p z ( t ) ] T are the position and velocity of the follower, respectively; u ( t ) = [ u x ( t ) ,   u y ( t ) ,   u z ( t ) ] T represents the follower’s acceleration, namely the formation tracking control strategy; Δ T ( t ) is the network-induced delay, which mainly includes the leader-to-controller delay, controller-to-follower delay, and signal processing delay.
The dynamics of desired states can be described by two integrators [31]
l ˙ r ( t ) = p r ( t ) , p ˙ r ( t ) = g r ( l r ( t ) , p r ( t ) ) ,
where p r ( t ) = [ p r x ( t ) ,   p r y ( t ) ,   p r z ( t ) ] T and l r ( t ) = [ l r x ( t ) ,   l r y ( t ) ,   l r z ( t ) ] T are the desired time-varying velocity and position information of leader, respectively, which are wholly determined by the leader’s state; g r ( l r ( t ) , p r ( t ) ) represents the leader’s acceleration.
The leader–follower model for UAV formation is shown in Figure 3. The objective for the follower is to track the reference trajectory generated by the leader and keep a desired time-varying formation. Define the velocity and position errors as
Δ p ( t ) = p ( t ) p r ( t ) , Δ l ( t ) = l ( t ) l r ( t ) .
Then, the error dynamics of follower formation tracking can be derived as
Δ l ˙ ( t ) = Δ p ( t ) , Δ p ˙ ( t ) = u ( t Δ T ( t ) ) g r ( l r ( t ) , p r ( t ) ) .
Define the state vector as
s ( t ) = [ Δ l x ( t ) ,   Δ p x ( t ) ,   Δ l y ( t ) ,   Δ p y ( t ) ,   Δ l z ( t ) ,   Δ p z ( t ) ] T .
Based on the error dynamics, the formation tracking model can be expressed as
s ˙ ( t ) = A s ( t ) + B [ u ( t Δ T ( t ) ) g r ( l r ( t ) , p r ( t ) ) ] ,
where
A = [ A ¯ 0 2 × 2 0 2 × 2 0 2 × 2 A ¯ 0 2 × 2 0 2 × 2 0 2 × 2 A ¯ ] , B = [ B ¯ 0 2 × 1 0 2 × 1 0 2 × 1 B ¯ 0 2 × 1 0 2 × 1 0 2 × 1 B ¯ ] ,
0 i × j denotes the i × j zero matrix, and the block matrices are given by
A ¯ = [ 0 1 0 0 ] ,   B ¯ = [ 0 1 ] .
Figure 4 shows the timing diagram for the UAV formation tracking control. The continuous-domain UAV states are first sampled, and then transmitted to the controller. Based on the received state information, the generated control signal is finally transmitted to the actuator to improve the formation tracking of the follower. In the control procession, it can be seen that network-induced delays from the sensor to the controller and from the controller to the actuator are caused, which are typically assumed to be stochastic.
Then, the corresponding discrete-time dynamics in the j - t h sampling interval [ j T ,   ( j + 1 ) T ) can be derived as (see derivation details in Appendix A)
s j + 1 = E s j + D j 1 u j + D j 2 u j 1 + δ j ,
where
s j = s ( j T ) ,   E = e A T , D j 1 = 0 T Δ T j e A t d t B ,   D j 2 = T Δ T j T e A t d t B , δ j = j T ( j + 1 ) T e A [ ( j + 1 ) T t ] g r ( l r ( t ) , p r ( t ) ) d t B ,
u j is the control signal generated based on system state s j , and Δ T j is the network-induced delay in j - t h sampling interval, which is typically assumed to be stochastic.
Note that due to the inherent high mobility of the leader, the network-induced delay Δ T j causes the time-varying feature of D j 1 and D j 2 . On the other hand, the dynamic leader’s acceleration introduces the uncertain item δ j . The objective of the formation tracking control is to minimize the errors between desired and actual states of the follower and to maintain smooth control of the follower. Therefore, using the typical quadratic cost function, the optimization formation tracking problem in the dynamic leader–follower system can be formulated as
min { u j }   E [ s N T Q s N + j = 0 N 1 ( s j T Q s j + u j T R u j ) ] s . t .   s j + 1 = E s j + D j 1 u j + D j 2 u j 1 + δ j ,
where N is the finite time horizon, Q and R are determined system parameters, E is the expectation operator due to the stochastic nature of the leader’s velocity and network-induced delays, s j T Q s j represents the tracking errors, and u j T R u j denotes the effect of follower acceleration on preventing harsh control reactions.

4. Formation Tracking Control Algorithm

In this section, in order to solve the formation tracking optimization problem in (11), a near-equilibrium case in which the leader moves along a straight line at a near-constant velocity is considered at first. The optimal control strategy is derived by a two-step algorithm through backward iteration. Then, the results are extended to the general dynamic case in which the leader moves with an uncertain time-varying velocity. Lastly, the angle deviation is analyzed.

4.1. Near-Equilibrium Control Strategy Design

In most of the cases, the formation tracking system remains steady so that the leader flies smoothly at a near-constant velocity and the follower attempts to maintain its stable formation throughout tracking the reference trajectory. That is to say, the leader velocity keeps a near-constant value of p r ( t ) = p * + Δ , where Δ 0 . In this near-equilibrium case, the acceleration is g r ( l r ( t ) , p r ( t ) ) 0 and the network-induced delay is Δ T j Δ T . That is, the uncertain item δ j is approximately equal to zero.
Then, the optimization problem (11) can be simplified as
min { u j }   s N T Q s N + j = 0 N 1 ( s j T Q s j + u j T R u j ) s . t .   s j + 1 = E s j + D 1 u j + D 2 u j 1 ,
where D 1 and D 2 become the determined parameters.
Let
x j = [ s j u j 1 ] .
Then, the discrete-time dynamics can be rewritten as
x j + 1 = F x j + H u j ,
where
F = [ E D 2 0 3 × 6 0 3 × 3 ] ,   H = [ D 1 I 3 × 3 ] ,
and I i × i denotes the i × i identity matrix.
Subsequently, the optimization problem (12) is equivalent to
min { u j }   x N T Q ¯ x N + j = 0 N 1 ( x j T Q ¯ x j + u j T R u j ) s . t .   x j + 1 = F x j + H u j ,
where
Q ¯ = [ Q 0 6 × 3 0 3 × 6 0 3 × 3 ] .  
Theorem 1.
The optimal control strategy for the near-equilibrium formation tracking problem (12) is given by
u j * = L j x j ,   j = 0 ,   1 , ,   N 1 ,
where  L j is iteratively calculated by
L j = [ H T S j + 1 H + R ] 1 H T S j + 1 F , S j = F T S j + 1 F + Q ¯ L j T H T S j + 1 F , S N = Q ¯ .
The proof can be achieved similarly to the derivation process of optimal control strategy in [6].

4.2. General Dynamic Control Strategy Design

In general, the leader flies with a time-varying velocity with a highly dynamic state. Then, the uncertain time-varying term δ j , typically assumed to be a stochastic variable with zero mean value and variance matrix σ δ , should be considered together with the time-varying terms D j 1 and D j 2 in the optimal formation tracking control design.
Based on the definition x j = [ s j T ,   u j 1 T ] T in (13), the dynamic formation tracking control problem (11) can be rewritten as
min { u j }   E [ x N T Q ¯ x N + j = 0 N 1 ( x j T Q ¯ x j + u j T R u j ) ] s . t .   x j + 1 = F ˜ j x j + H ˜ j u j + [ δ j T , 0 3 × 1 ] T ,
where F ˜ j , H ˜ j , and δ j are time-varying items that
F ˜ j = [ E D j 2 0 3 × 6 0 3 × 3 ] ,   H ˜ j = [ D j 1 I 3 × 3 ] .
Define the residual cost function as
V j = min { u i } E [ x N T Q ¯ x N + i = j N 1 ( x i T Q ¯ x i + u i T R u i ) ] .
Theorem 2.
The optimal control design for the general dynamic formation tracking problem (20) can be similarly derived as
u ˜ j * = L ˜ j x j ,   j = 0 ,   1 , ,   N 1 ,  
where L ˜ j can be iteratively calculated by
L ˜ j = E [ H ˜ j T S ˜ j + 1 H ˜ j + R ] 1 E [ H ˜ j T S ˜ j + 1 F ˜ j ] , S ˜ j = E [ F ˜ j T S ˜ j + 1 F ˜ j ] + Q ¯ L ˜ j T E [ H ˜ j T S ˜ j + 1 F ˜ j ] , S ˜ N = Q ¯ ,
and the corresponding residual cost function is given by
V ˜ j = E [ x j T S ˜ j x j ] + i = j + 1 N t r ( S ˜ i 1 , 1 σ δ ) .
where S ˜ i 1 , 1 denotes the ( 1 , 1 ) t h block of S ˜ i with the same size as δ i , and t r ( ) is the trace of the matrix.
Proof of Theorem 2.
When j = N 1 : V ˜ N 1 can be deduced by
V ˜ N 1 = min u ˜ N 1 E [ [ x N 1 u N 1 ] T ˜ [ x N 1 u N 1 ] ] + E [ δ N 1 T S ˜ N 1 , 1 δ N 1 ] = E min u ˜ N 1 [ [ x N 1 u N 1 ] T ˜ [ x N 1 u N 1 ] ] + t r ( S ˜ N 1 , 1 σ δ ) ,
where
˜ = [ b 1 , 1 b 1 , 2 T b 2 , 1 b 2 , 2 ] ,
that
b 1 , 1 = E [ F ˜ N 1 T S ˜ N F ˜ N 1 ] + Q ¯ , b 2 , 2 = E [ H ˜ N 1 T S ˜ N H ˜ N 1 ] + R , b 2 , 1 = E [ H ˜ N 1 T S ˜ N F ˜ N 1 ] .
Accordingly, the optimal control strategy can be derived as
u ˜ N 1 * = L ˜ N 1 x N 1 ,
where
L ˜ N 1 = E [ H ˜ N 1 T S ˜ N H ˜ N 1 + R ] 1 E [ H ˜ N 1 T S ˜ N F ˜ N 1 ] ,
and the corresponding residual cost function has the quadratic form
V ˜ N 1 = E [ x N 1 T S ˜ N 1 x N 1 ] + t r ( S ˜ N 1 , 1 σ δ ) .
When j = N 2 ,   ,   1 ,   0 : assuming V ˜ i , i j + 1 also has the quadratic form
V ˜ j = E [ x j T S ˜ j x j ] + i = j + 1 N t r ( S ˜ i 1 , 1 σ δ ) .
Similar to the derivation from (16) to (24) in [6], the optimal formation tracking control strategy for general dynamic cases can be given by
u ˜ j * = L ˜ j x j ,
where L ˜ j can be deduced iteratively on the basis of S ˜ j + 1 as in (24), and
V ˜ j = E [ x j T S ˜ j x j ] + i = j + 1 N 1 t r ( S ˜ i 1 , 1 σ δ ) .
Thus, when the uncertain item δ k is considered, an extra cost item i = j + 1 N t r ( S ˜ i 1 , 1 σ δ ) is introduced in the residual cost function. Although time-varying items F ˜ j , H ˜ j and δ j introduced by the highly dynamic features of the leader exist, the same form of control strategy can be obtained in two steps in a backward recursion manner, which is summarized as Algorithm 1.
Algorithm 1: Formation Tracking Control Design.
1   Step 1: off-line
2   Initialize S ˜ N = Q ¯ .
3   for  j = N 1 : 1 : 0 do
4   Calculate L ˜ j by using
5    L ˜ j = E [ H ˜ j T S ˜ j + 1 H ˜ j + R ] 1 E [ H ˜ j T S ˜ j + 1 F ˜ j ] .
6   Calculate S ˜ j by using
7    S ˜ j = E [ F ˜ j T S ˜ j + 1 F ˜ j ] + Q ¯ L ˜ j T E [ H ˜ j T S ˜ j + 1 F ˜ j ] .
8   end
9   Step 2: On-line
10  Initialize s 0 ,   u ˜ j * = 0 ,   j 0 .
11  for  j = 0 : 1 : N 1 do
12   Update s j and x j = [ z j T , ( u ˜ j 1 * ) T ] T .
13   Calculate u ˜ j * by using u ˜ j * = L ˜ j x j .
14  end

4.3. Angle Deviation Analysis

UAV formation tracking control usually consists of an outer-loop control, including position and velocity control, and an inner-loop control, including pitch/yaw/roll angle control. Many works simply focus on the outer-loop control strategy design. However, the angle deviation is of great importance to UAVs’ motion.
Below, taking an electric quad rotor aircraft as an example, the angle deviation is investigated. In an electric quad rotor aircraft, each motor is attached to a rigid cross frame. Vertical motion and forward/backward motion are controlled by the collective throttle input, i.e., the sum of the thrusts of each motor, and controlling the differential speed of the front and rear motors. The left/right motion of the vehicle is achieved by controlling the differential speed of the right and left motors. The quad-rotor dynamics, evolving in 3D and subject to one force and 3 moments, are modeled by the following equations [13,36].
c = f 1 + f 2 + f 3 + f 4 , f i = k i w i 2 , i = 1...4 , m l ¨ x = c sin θ , m l ¨ y = c cos θ sin ϕ , m l ¨ z = c cos θ cos ϕ m g , ψ ¨ = ζ ψ , θ ¨ = ζ θ , ϕ ¨ = ζ ϕ ,
where c is the total thrust produced by four rotors, f i is the force generated by rotor i , k i 0 is a constant, w i is the angular speed of motor i , m is the quadcopter’s mass, and g is the gravitational constant. ψ , θ and ϕ represent yaw, pitch, and roll angles, respectively, and ζ ψ ,   ζ θ , and ζ ϕ are the control inputs for yawing, pitching, and rolling moments, respectively.
Define the state vector as
s = [ l x , p x , l y , p y , l z , p z , ψ , ψ ˙ , θ , θ ˙ , ϕ , ϕ ˙ ] T .
The control vector is defined as
u = [ c 1 , c 2 , c 3 , c 4 ] T = [ c m g , ζ ψ , ζ θ , ζ ϕ ] .
Then, the dynamics equation can be derived as
[ l ˙ x p ˙ x l ˙ y p ˙ y l ˙ z p ˙ z ψ ˙ ψ ˙ θ ˙ θ ˙ ϕ ˙ ϕ ˙ ] = [ p x c 1 sin θ / m g sin θ p y c 1 cos θ sin ϕ / m + g cos θ sin ϕ p z c 1 cos θ cos ϕ / m + g cos θ cos ϕ g ψ ˙ c 2 θ ˙ c 3 ϕ ˙ c 4 ] .
Based on [13,36], the above dynamics equation can be linearized by a Taylor series as
s ˙ ( t ) = A s ( t ) + B u ( t ) ,
where A and B respectively represent the state matrix and input matrix that
A = [ 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 g 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 g 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 ] , B = [ 0 0 0 0 0 1 m 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 1 ] .
When the network-induced delay Δ T ( t ) and angle deviations Δ ψ , Δ θ and Δ ϕ to the yaw, pitch, and roll angles are considered. Assuming the angle deviations are small, the dynamics equation can be further derived based on (38) and (39) as
s ˙ ( t ) = A s ( t ) + B u ( t Δ T ( t ) ) + ω ( t ) ,
where the disturbance term is given by
ω ( t ) = [ 0 Δ θ g cos θ 0 Δ ϕ g cos θ cos ϕ + Δ θ g sin θ sin ϕ 0 Δ ϕ g cos θ sin ϕ + Δ θ g sin θ cos ϕ 0 6 × 1 ] ,
and here 0 i × 1 denotes the zero matrix with i × 1 size.
Similarly, the corresponding discrete-time dynamics in the j - t h sampling interval [ j T ,   ( j + 1 ) T ) can be derived as
s j + 1 = E s j + D j 1 u j + D j 2 u j 1 + δ ˜ j ,
where δ ˜ j = j T ( j + 1 ) T e A [ ( j + 1 ) T t ] ω ( t ) d t B and other parameters are given as the same as in (10).
It is found that the discrete-time dynamics considering angle deviation can be integrated as the same form of (9). Therefore, the proposed control strategy in (23) for the general dynamic control case can still be used to address the problem of angle deviation.

5. Simulations and Discussion

In this section, numerical simulations in accordance with real UAV flight data [8,9,32] are given to demonstrate the effectiveness of the proposed UAV formation tracking control strategy in a leader–follower model. In the near-equilibrium case, the quasi-static velocity and fixed desired distance between leader and follower are set. While in the general dynamic case, the desired velocity p and distance l r * are dynamic. The simulation parameters of both the near-equilibrium case and the general dynamic case are summarized in the Table 2.
First, the position, velocity, and 3D trajectories of followers in the near-equilibrium case are shown in Figure 5, Figure 6 and Figure 7. Afterward, the results are extended to the general dynamic case. In the near-equilibrium case, it can be seen that the velocity and position errors approach zero and that the proposed optimal control algorithm can ensure system stability under various network-induced delays. In addition, we observe that the network-induced delay causes performance degradation of UAV formation tracking because it requires more time for the follower to catch up with the leader when the network-induced delay becomes larger.
To be more specific, in the x dimension, the velocity of the follower gradually changes from the initial value of 5 m/s to the desired value of 10 m/s, and the relevant distance error between the leader and the follower reaches 0 step by step as the speed of the follower increases. In the y dimension, the velocity of the follower first increases from 5 m/s to the desired value of 8 m/s, while the distance error between the leader and the follower increases to around 8 m because the velocity of the follower is lower than the desired velocity. Then, the velocity of the follower continues to increase to around 13 m/s in order to reduce the distance between the leader and the follower. Finally, the velocity of the follower converges with the desired velocity so that the distance error between the leader and the follower is further reduced to the desired value of 0. Similarly, in the z dimension, as the velocity of the follower first decreases and then increases to the desired velocity of 3 m/s, the distance error between the leader and the follower gradually decreases to the desired value of 0. In Figure 6, the same results are shown in the 3D trajectory. The follower can quickly catch up with the leader when there is a sudden velocity and distance error between the leader and the follower, and the follower can maintain the formation’s stability once it successfully follows the leader.
The position, velocity, and 3-D trajectories of the follower in the general dynamic case are shown in Figure 8, Figure 9 and Figure 10. It can be seen that the follower can catch up with the leader even if the leader moves with an uncertain time-varying velocity, which indicates that the proposed optimal control algorithm can achieve excellent formation tracking performances in both near-equilibrium and general dynamic cases. However, the dynamic velocity causes serious disturbance to control stability. In the x and y dimensions, at first, the state of the follower remains stable until the velocity of the leader changes. The unstable item δ j is introduced to the formation system, and the network-induced delay is time-varying as well. Fortunately, the follower still can catch up with the leader in the presence of both the dynamic leader velocity and network-induced delays. Similarly, the performance degradation is introduced by network-induced delay (i.e., the smaller the network-induced delay, the faster the follower reacts).
The performance comparisons between the proposed algorithm and the existing algorithm 1 [34] that ignores network-induced delays and the existing algorithm 2 in [21] that ignores the dynamic feature of the UAV leader are shown in Figure 11 and Figure 12. Compared with the existing algorithms, our proposed algorithm reacts more quickly when the velocity of the leader is changing. This is due to our inclusion of the last control strategy and the uncertain time-varying term in the current control strategy, thus effectively compensating for the influence of dynamic leader velocity and network-induced delays. The algorithm in [21] did not include the time-varying term in the system model and ignored the dynamic features of the system. The algorithm in [34] addressed this problem using a novel approach, but the effect of network-induced delays is still not accounted for. It is obvious that our proposed algorithm provides better stability and faster convergence.

6. Conclusions

In this paper, considering the highly dynamic features, including the uncertain time-varying acceleration of the leader and network-induced delays, the optimal control strategy for UAV formation tracking systems is studied. In order to control the follower to achieve the desired state, a linear quadratic optimization problem is proposed with the objective of minimizing the errors between the actual and desired states of the followers. Then, a two-step backward recursion algorithm is developed to address this challenge in a near-equilibrium case, and the results are extended to a general dynamic case. Lastly, the angle deviation is analyzed, and the proposed control strategy algorithm can also be applied to solve such motion problems. Simulated experiments demonstrate the effectiveness of the proposed algorithm. It can be concluded that even if the time-varying acceleration of the leader is uncertain, the follower can still quickly reach the desired state. In addition, the network-induced delay will result in slower reactions by the follower.

Author Contributions

Conceptualization, L.L. and Z.W.; methodology, L.L., M.X. and Z.W.; software, M.X., H.C. and Y.S.; validation, M.X., Z.W., C.F. and H.C.; formal analysis, L.L., M.X. and C.F.; investigation, M.X., H.C. and Y.S.; data curation, M.X. and Y.S.; writing—original draft preparation, L.L., M.X. and Z.W.; writing—review and editing, L.L., M.X. and Z.W.; project administration, L.L. and Z.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Beijing Natural Science Foundation 4222002 and L202016, the National Natural Science Foundation of China under Grants 61901011 and 62001011, the Foundation of Beijing Municipal Commission of Education KM201910005026, and the Beijing Nova Program of Science and Technology Z191100001119094.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Appendix A

Below, the derivation from (6) to (9) is shown. First, based on (6), multiplying both sides of the equation by an exponential term as
e A t s ˙ ( t ) = e A t A s ( t ) + e A t B [ u ( t Δ T ( t ) ) g r ( l r ( t ) , p r ( t ) ) ] .
Then, we have
d e A t s ( t ) d t = e A t B [ u ( t Δ T ( t ) ) g r ( l r ( t ) , p r ( t ) ) ] .
Integrating both sides of (A2) as
j T ( j + 1 ) T d e A t s ( t ) d t = j T ( j + 1 ) T e A t B [ u ( t Δ T ( t ) ) g r ( l r ( t ) , p r ( t ) ) ] .
Then, we have
s j + 1 = e A T s j + j T ( j + 1 ) T e A [ ( j + 1 ) T t ] B [ u ( t Δ T ( t ) ) g r ( l r ( t ) , p r ( t ) ) ] d t .
In a sampling interval, the control strategy is piecewise and includes two parts due to network-induced delays. During [ j T , j T + Δ T j ) and ( j T + Δ T j ,   ( j + 1 ) T ] , the control strategy is u j 1 and u j , respectively. Then, the formula (A4) can be written as
s j + 1 = e A T s j + j T j T + Δ T j e A [ ( j + 1 ) T t ] B d t u j 1 + j T + Δ T j ( j + 1 ) T e A [ ( j + 1 ) T t ] B d t u j j T ( j + 1 ) T e A [ ( j + 1 ) T t ] g r ( l r ( t ) , p r ( t ) ) d t B .
The discrete-time dynamics can be further obtained as
s j + 1 = e A T s j + 0 T Δ T j e A t d t B u j + T Δ T j T e A t d t B u j 1 j T ( j + 1 ) T e A [ ( j + 1 ) T t ] g r ( l r ( t ) , p r ( t ) ) d t B ,
which is equivalent to the discrete-time dynamics in (9).

References

  1. Gupta, L.; Jain, R.; Vaszkun, G. Survey of important issues in UAV communication networks. IEEE Commun. Surv. Tutor. 2015, 18, 1123–1152. [Google Scholar] [CrossRef] [Green Version]
  2. Mozaffari, M.; Saad, W.; Bennis, M.; Nam, Y.H.; Debbah, M. A tutorial on UAVs for wireless networks: Applications, challenges, and open problems. IEEE Commun. Surv. Tutor. 2019, 21, 2334–2360. [Google Scholar] [CrossRef] [Green Version]
  3. Erdelj, M.; Natalizio, E.; Chowdhury, K.R.; Akyildiz, I.F. Help from the sky: Leveraging UAVs for disaster management. IEEE Pervasive Comput. 2017, 16, 24–32. [Google Scholar] [CrossRef]
  4. Wallace, L.; Lucieer, A.; Watson, C.; Turner, D. Development of a UAV-LiDAR system with application to forest inventory. Remote Sens. 2012, 4, 1519–1543. [Google Scholar] [CrossRef] [Green Version]
  5. Hayat, S.; Yanmaz, E.; Muzaffar, R. Survey on unmanned aerial vehicle networks for civil applications: A communications viewpoint. IEEE Commun. Surv. Tutor. 2016, 18, 2624–2661. [Google Scholar] [CrossRef]
  6. Wang, Z.; Gao, Y.; Fang, C.; Liu, L.; Zhou, H.; Zhang, H. Optimal control design for connected cruise control with stochastic communication delays. IEEE Trans. Veh. Technol. 2020, 69, 15357–15369. [Google Scholar] [CrossRef]
  7. Gu, J.; Su, T.; Wang, Q.; Du, X.; Guizani, M. Multiple moving targets surveillance based on a cooperative network for multi-UAV. IEEE Commun. Mag. 2018, 56, 82–89. [Google Scholar] [CrossRef]
  8. Pack, D.J.; DeLima, P.; Toussaint, G.J.; York, G. Cooperative control of UAVs for localization of intermittently emitting mobile targets. IEEE Trans. Syst. Man Cybern. Part B Cybern. 2009, 39, 959–970. [Google Scholar] [CrossRef]
  9. Liu, X.; Liu, Y.; Chen, Y. Reinforcement learning in multiple-UAV networks: Deployment and movement design. IEEE Trans. Veh. Technol. 2019, 68, 8036–8049. [Google Scholar] [CrossRef] [Green Version]
  10. Skorobogatov, G.; Barrado, C.; Salamí, E. Multiple UAV systems: A survey. Unmanned Syst. 2020, 8, 149–169. [Google Scholar] [CrossRef]
  11. Lin, Z.; Ding, W.; Yan, G.; Yu, C.; Giua, A. Leader–follower formation via complex Laplacian. Automatica 2013, 49, 1900–1906. [Google Scholar] [CrossRef]
  12. Wu, F.; Chen, J.; Liang, Y. Leader-follower formation control for quadrotors. IOP Conf. Ser. Mater. Sci. Eng. 2017, 187, 012016. [Google Scholar] [CrossRef] [Green Version]
  13. Ali, Q.; Montenegro, S. Explicit model following distributed control scheme for formation flying of mini-UAVs. IEEE Access 2016, 4, 397–406. [Google Scholar] [CrossRef]
  14. Motlagh, N.H.; Taleb, T.; Arouk, O. Low-altitude unmanned aerial vehicles-based internet of things services: Comprehensive survey and future perspectives. IEEE Internet Things J. 2016, 3, 899–922. [Google Scholar] [CrossRef]
  15. Wang, Z.; Wang, X.; Liu, L.; Huang, M.; Zhang, Y. Decentralized feedback control for wireless sensor and actuator networks with multiple controllers. Int. J. Mach. Learn. Cybern. 2017, 8, 1471–1483. [Google Scholar] [CrossRef]
  16. Khuwaja, A.A.; Chen, Y.; Zhao, N.; Alouini, M.-S.; Dobbins, P. A survey of channel modeling for UAV communications. IEEE Commun. Surv. Tutor. 2018, 20, 2804–2821. [Google Scholar] [CrossRef] [Green Version]
  17. Wang, Z.; Gao, Y.; Fang, C.; Liu, L.; Guo, S.; Li, P. Optimal connected cruise control with arbitrary communication delays. IEEE Syst. J. 2019, 14, 2913–2924. [Google Scholar] [CrossRef]
  18. Zhang, Q.; Jiang, M.; Feng, Z.; Li, W.; Zhang, W.; Pan, M. IoT enabled UAV: Network architecture and routing algorithm. IEEE Internet Things J. 2019, 6, 3727–3742. [Google Scholar] [CrossRef]
  19. Motlagh, N.H.; Bagaa, M.; Taleb, T. Energy and delay aware task assignment mechanism for UAV-based IoT platform. IEEE Internet Things J. 2019, 6, 6523–6536. [Google Scholar] [CrossRef] [Green Version]
  20. Khawaja, W.; Guvenc, I.; Matolak, D.W.; Fiebig, U.C.; Schneckenburger, N. A survey of air-to-ground propagation channel modeling for unmanned aerial vehicles. IEEE Commun. Surv. Tutor. 2019, 21, 2361–2391. [Google Scholar] [CrossRef] [Green Version]
  21. Lin, P.; Jia, Y. Consensus of a class of second-order multi-agent systems with time-delay and jointly-connected topologies. IEEE Trans. Autom. Control 2010, 55, 778–784. [Google Scholar]
  22. Zhu, W.; Jiang, Z.P. Event-based leader-following consensus of multi-agent systems with input time delay. IEEE Trans. Autom. Control 2014, 60, 1362–1367. [Google Scholar] [CrossRef]
  23. Li, X.; Chen, M.Z.Q.; Su, H.; Li, C. Consensus networks with switching topology and time-delays over finite fields. Automatica 2016, 68, 39–43. [Google Scholar] [CrossRef]
  24. Li, Y.; Li, H.; Ding, X.; Zhao, G. Leader–follower consensus of multiagent systems with time delays over finite fields. IEEE Trans. Cybern. 2018, 49, 3203–3208. [Google Scholar] [CrossRef]
  25. Chen, K.; Wang, J.; Zhang, Y.; Liu, Z. Leader-following consensus for a class of nonlinear strick-feedback multiagent systems with state time-delays. IEEE Trans. Syst. Man Cybern. Syst. 2018, 50, 2351–2361. [Google Scholar] [CrossRef]
  26. Mei, J.; Ren, W.; Ma, G. Distributed coordinated tracking with a dynamic leader for multiple Euler-Lagrange systems. IEEE Trans. Autom. Control 2011, 56, 1415–1421. [Google Scholar] [CrossRef]
  27. Wang, J.; Xin, M. Integrated optimal formation control of multiple unmanned aerial vehicles. IEEE Trans. Control. Syst. Technol. 2012, 21, 1731–1744. [Google Scholar] [CrossRef]
  28. Shi, J.; Yue, D.; Xie, X. Optimal leader-follower consensus for constrained-input multiagent systems with completely unknown dynamics. IEEE Trans. Syst. Man Cybern. Syst. 2020, 52, 1182–1191. [Google Scholar] [CrossRef]
  29. Abdessameud, A.; Tayebi, A.; Polushin, I.G. Leader-follower synchronization of Euler-Lagrange systems with time-varying leader trajectory and constrained discrete-time communication. IEEE Trans. Autom. Control 2016, 62, 2539–2545. [Google Scholar] [CrossRef]
  30. Olfati-Saber, R. Flocking for multi-agent dynamic systems: Algorithms and theory. IEEE Trans. Autom. Control 2006, 51, 401–420. [Google Scholar] [CrossRef] [Green Version]
  31. Su, H.; Wang, X.; Lin, Z. Flocking of multi-agents with a virtual leader. IEEE Trans. Autom. Control 2009, 54, 293–307. [Google Scholar] [CrossRef]
  32. Ghommam, J.; Saad, M. Adaptive leader–follower formation control of underactuated surface vessels under asymmetric range and bearing constraints. IEEE Trans. Veh. Technol. 2017, 67, 852–865. [Google Scholar] [CrossRef]
  33. Dong, X.; Yu, B.; Shi, Z.; Zhong, Y. Time-varying formation control for unmanned aerial vehicles: Theories and applications. IEEE Trans. Control. Syst. Technol. 2014, 23, 340–348. [Google Scholar] [CrossRef]
  34. Yazdani, S.; Su, H. A fully distributed protocol for flocking of time-varying linear systems with dynamic leader and external disturbance. IEEE Trans. Syst. Man Cybern. Syst. 2020, 52, 1234–1242. [Google Scholar] [CrossRef]
  35. Sun, Y.; Dong, D.; Qin, H.; Wang, N.; Li, X. Distributed coordinated tracking control for multiple uncertain Euler–Lagrange systems with time-varying communication delays. IEEE Access 2019, 7, 12598–12609. [Google Scholar] [CrossRef]
  36. Lara, D.; Sanchez, A.; Lozano, R.; Castillo, P. Real-time embedded control system for VTOL aircrafts: Application to stabilize a quad-rotor helicopter. In Proceedings of the IEEE International Symposium on Intelligent Control, Munich, Germany, 4–6 October 2006; pp. 2553–2558. [Google Scholar]
Figure 1. UAV formation tracking model.
Figure 1. UAV formation tracking model.
Entropy 24 00305 g001
Figure 2. Dynamic features.
Figure 2. Dynamic features.
Entropy 24 00305 g002
Figure 3. Leader–follower model for UAV formation.
Figure 3. Leader–follower model for UAV formation.
Entropy 24 00305 g003
Figure 4. Timing diagram for UAV formation tracking through a wireless communication network.
Figure 4. Timing diagram for UAV formation tracking through a wireless communication network.
Entropy 24 00305 g004
Figure 5. Scenario 1: velocity trajectory comparisons of followers under different delays in near-equilibrium case.
Figure 5. Scenario 1: velocity trajectory comparisons of followers under different delays in near-equilibrium case.
Entropy 24 00305 g005
Figure 6. Scenario 1: position trajectory error comparisons of followers under different delays in near-equilibrium case.
Figure 6. Scenario 1: position trajectory error comparisons of followers under different delays in near-equilibrium case.
Entropy 24 00305 g006
Figure 7. Scenario 1: 3-D trajectory of follower in near-equilibrium case.
Figure 7. Scenario 1: 3-D trajectory of follower in near-equilibrium case.
Entropy 24 00305 g007
Figure 8. Scenario 2: velocity trajectory comparisons of followers under different delays in general dynamic case.
Figure 8. Scenario 2: velocity trajectory comparisons of followers under different delays in general dynamic case.
Entropy 24 00305 g008
Figure 9. Scenario 2: position trajectory errors comparisons of followers under different delays in general dynamic case.
Figure 9. Scenario 2: position trajectory errors comparisons of followers under different delays in general dynamic case.
Entropy 24 00305 g009
Figure 10. Scenario 2: 3D trajectory of follower in general dynamic case.
Figure 10. Scenario 2: 3D trajectory of follower in general dynamic case.
Entropy 24 00305 g010
Figure 11. Scenario 2: velocity trajectory comparisons with the existing algorithms.
Figure 11. Scenario 2: velocity trajectory comparisons with the existing algorithms.
Entropy 24 00305 g011
Figure 12. Scenario 2: Position trajectory error comparisons with the existing algorithms.
Figure 12. Scenario 2: Position trajectory error comparisons with the existing algorithms.
Entropy 24 00305 g012
Table 1. Comparison with existing works.
Table 1. Comparison with existing works.
Ref.Dynamic
Leader Velocity
DelayApproachResult
[21]NoYesNeighbor-based linear protocol with time-delay.A sufficient condition is derived and time-delay cannot be arbitrarily large.
[22]NoYesA piecewise constant and neighbor-based feedback control rule.A necessary condition is presented and continuous communication between neighboring agents is avoided.
[24]NoYesFinite-field leader–follower consensus protocol with time delays and switching topology.Two criteria for the finite-field leader–follower consensus with time delays and switching topology are presented.
[25]NoYesAn adaptive leader–follower consensus control protocol with unknown nonlinearities and state time-delays.The consensus tracking error will converge to an adjustable neighborhood of the origin.
[30]NoNoThree flocking algorithms: two for free flocking and one for constrained flocking.Migration of flocks can be performed using a peer-to-peer network of agents, i.e., “flocks need no leaders.”
[31]YesNoFlocking of multi-agent protocol with a virtual leader.Modification to the Olfati-Saber algorithm in [30].
[33]NoYesConsensus-based approaches are applied to achieve time-varying formation.Necessary and sufficient conditions for UAV swarm systems to achieve time-varying formations are proposed.
[34]YesNoA continuous adaptive controller is designed.An adaptive estimator for each uninformed agent can estimate the velocity of the leader.
[35]YesYesAn adaptive leader–follower formation control protocol is proposed.The overall closed-loop system is proved to be semi-globally, uniformly, and ultimately bounded by Lyapunov stability theory.
Table 2. Simulation parameters setting.
Table 2. Simulation parameters setting.
ParameterScenario 1
Near-Equilibrium Case
Scenario 2
General Dynamic Case
Sampling period0.4 s0.4 s
Network-induced delays τ [ 0.1   s , 0.2   s ] τ [ 0.1   s , 0.2   s ]
Desired velocityFixed
( p x , p y , p z ) = ( 10 , 8 , 3 ) m/s
Dynamic,
average velocity: 15 m/s
Desired distanceDepend on velocityDepend on velocity
UncertaintyNone Disturbance distribution N ( 0 , 1 )
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Wang, Z.; Xu, M.; Liu, L.; Fang, C.; Sun, Y.; Chen, H. Optimal UAV Formation Tracking Control with Dynamic Leading Velocity and Network-Induced Delays. Entropy 2022, 24, 305. https://doi.org/10.3390/e24020305

AMA Style

Wang Z, Xu M, Liu L, Fang C, Sun Y, Chen H. Optimal UAV Formation Tracking Control with Dynamic Leading Velocity and Network-Induced Delays. Entropy. 2022; 24(2):305. https://doi.org/10.3390/e24020305

Chicago/Turabian Style

Wang, Zhuwei, Mengjiao Xu, Lihan Liu, Chao Fang, Yang Sun, and Huamin Chen. 2022. "Optimal UAV Formation Tracking Control with Dynamic Leading Velocity and Network-Induced Delays" Entropy 24, no. 2: 305. https://doi.org/10.3390/e24020305

APA Style

Wang, Z., Xu, M., Liu, L., Fang, C., Sun, Y., & Chen, H. (2022). Optimal UAV Formation Tracking Control with Dynamic Leading Velocity and Network-Induced Delays. Entropy, 24(2), 305. https://doi.org/10.3390/e24020305

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop