1. Introduction
With the rapid development of science and technology, the world’s military powers attach great importance to the cooperation capability of an unmanned combat system. In recent decades, air–ground heterogeneous unmanned combat systems, which consist of unmanned aerial vehicles (UAVs) and unmanned ground vehicles (UGVs), have been favored by military powers due to their fast response speed, strong communication capability, strong payload capacity, and high target reconnaissance accuracy [
1,
2,
3].
In the field of multi-agent systems, cooperative control has received extensive attention and research. Examples of such applications include the field of robot collaboration, UAV formation, cooperative transport, and combat reconnaissance [
4]. Formation control is an application hotspot in the field of distributed cooperation. In general, formation control can be divided into two categories according to the presence or absence of leader agents: leader and leaderless [
5,
6,
7,
8]. Reference [
9] used the leader–follower method to complete the trajectory tracking task of the UAV–UGV system but did not cooperate in the process of completing the formation. Furthermore, Reference [
10] studied the cooperation problem of the UAV–UGV system by improving the artificial physics approach but did not study the formation problem. Reference [
11] studied the time-varying formation control of cooperative heterogeneous multi-agent systems and combined formation with cooperation.
In the cooperative reconnaissance and cooperative strike of the unmanned combat system, based on forming an established formation, the unmanned combat system must also identify the complex and volatile battlefield environments and quickly cross barriers, such as fences and fortifications. Therefore, quickly making the whole formation reach the desired state is also an important concern of the formation problem. Reference [
12] used a virtual-structure-based approach and multiple-impedance control to achieve the optimal formation of three mobile robots, and the mobile robots carried out the cooperative formation. However, this study was based on the study of homogeneous multi-agents. In Reference [
13], the leader–follower strategy and the virtual leader strategy were integrated into an optimal control framework to study the optimal formation of multiple UAVs. However, this study did not investigate the cooperation of multiple UAVs. For the optimal formation of heterogeneous multi-agent, there are few related research results. Reference [
14] is based on using reinforcement learning methods to achieve the optimal formation of heterogeneous multi-agent systems but does not study cooperative control. It should be noted that optimal formation control alone can only solve a relatively limited number of problems, and cooperative optimal formation control is still an open problem.
In addition, most of the existing research results are based on the same dynamic model, namely the homogeneous agent model [
15,
16,
17,
18]. Compared with homogeneous multi-agent systems, multi-agent systems composed of heterogeneous dynamic models are more flexible in practical applications. Therefore, it is of great significance to study heterogeneous multi-agent systems. A large number of valuable research results have been obtained for the heterogeneous cooperative problem [
19,
20,
21].
In this paper, the cooperative optimal formation problem of heterogeneous multi-agent systems is studied on the unmanned aerial vehicle and unmanned ground vehicle model. There are threefold main innovations:
Firstly, a heterogeneous modeled UAV–UGV system is proposed, a cooperative architecture of heterogeneous multi-agent systems with equal number is designed, and a Laplacian matrix of communication topology is designed. In addition, a novel block Kronecker product is used to describe the UAV–UGV system. Based on this, distributed formation control is proposed.
The second contribution is to introduce the optimal control method into the formation control protocol, design the distributed formation optimal control protocol, and prove the stability using the method of graph theory.
The third contribution is to design a cooperative formation control protocol for the air–ground system based on the heterogeneous system model so that the UAV–UGV system can achieve the cooperative formation effect. Then, the optimal control is introduced into the cooperative formation control protocol, and the cooperative optimal formation control protocol is designed, which enables the UAV–UGV system formation to quickly achieve the expected effect.
3. Design of Control Protocol
To realize the formation of heterogeneous multi-agent systems of UAVs and UGVs, this section is based on the formation control protocol. Firstly, the optimal control law is applied to the single agent. Then, according to the combination of the optimal control law and the formation control protocol, a heterogeneous multi-agent system with distributed optimal formation control is realized. Finally, according to the motion equation of the heterogeneous multi-agent system, the cooperative formation control and cooperative optimal formation control of the heterogeneous multi-agent system are realized.
Lemma 1. [
22].
For an Laplacian matrix is a random matrix with positive diagonal elements. If has a unique zero eigenvalue, Rank , then its left eigenvector has and , where, ,.
3.1. Formation Control
Formation control protocol for the UAVs:
Formation control protocol for the UGVs:
where
represent the positive gain coefficients,
Protocols (9) and (10) shall be unified into the same type:
Define the state-space form of the multi-agent system formation:
where
is the identity matrix, and is the Kronecker product.
Theorem 1. If Protocols (9) and (10) are satisfied,, Systems (2) and (5) can implement the formation defined in (3) and (6).
Proof of Theorem 1. Substitute Formula (11) into Formula (12) to obtain: where
According to the linear stability theorem, the parameters , and need to be selected so that has a zero eigenvalue and other eigenvalues have genuine negative parts. The parameters and need to meet the stability of UGV consistency, and the parameters and need to meet the stability of UAV consistency. After selecting parameters, can be converted to a Jordan standard type: . Let be the first row of and the left eigenvector have eigenvalue 0. Let the first column of and the right eigenvector have eigenvalue Therefore, ; as time approaches infinity, the system’s state becomes:. According to Lemma 1, as time approaches infinity, Systems (2) and (5) asymptotically agree and the systems complete formation. □
3.2. Optimal Control
The solution of optimal control requires the states of all multi-agents. Before providing performance indicators, and are defined:
Then, define the performance indicator function as:
As the UAV and the UGV are independent in different coordinate systems, the UAV weight must be set to where, . The UGV weight must be set to , where .
According to the optimal control theory, the optimal control law of a single agent UAV is:
, where
is the solution of Riccati’s in Equation (15)
The optimal control law for a single agent UGV is:
, where
is the solution of Riccati’s in Equation (16)
Through the above calculation, the optimal control law can be obtained. Let , the dimension of matrix is expressed as Similarly, the optimal control law can also be solved, let the dimension of matrix is expressed as .
3.3. Distributed Optimal Formation Control
All UAVs have the same dynamics model, so all UAVs are homogeneous multi-agents. Similarly, all UGVs are homogeneous multi-agents. Therefore, optimal control laws can be extended to the formation control of UAV and UGV multi-agent systems.
Define the multi-agent system to be optimized:
where
,
and
are derived from the matrix
and
are derived from the matrix
.
Theorem 2. If the unmanned ground vehicle system in (2) and the unmanned aerial vehicle system in (5) use Protocols (17) and (18), respectively, the formation can be completed, and Performance Functions (13) and (14) can be optimized.
Proof of Theorem 2. Protocols (17) and (18) shall be unified into the same type:
where
is the identity matrix, and
is the Kronecker product.
Let =
Substituting into Equation (19):
Elementary row and column transformation can be taken on
:
If , it can be seen that:.
where has only zero eigenvalues. Therefore, we must select the parameters , so that has zero eigenvalue, and all other eigenvalues have negative real parts. The parameters , need to meet the stability of UAV consistency, and the parameters and need to meet the stability of UGV consistency. After determining the parameters, can be converted to a Jordan standard type: . Let be the first row of and the left eigenvector have eigenvalue 0. Let be the first column of and the right eigenvector with eigenvalue Therefore, ; when the time approaches infinity, the system’s state becomes: , . According to Lemma 1 it is then seen that the system can reach asymptotic consensus in cases where time tends toward infinity. □
3.4. Heterogeneous Cooperative Formation Control
For UAV:
For UGV:
In combination with the Laplace matrix, Protocols (21) and (22) are rewritten as:
Define the state-space form of the heterogeneous multi-agent formation:
where
Theorem 3. if Protocols (21) and (22) meet,, the heterogeneous system in (7) can be achieved, and (8) is defined in the formation, then formation control is realized.
Proof of Theorem 3. Substitute Equation (23) into Equation (24) and obtain:
where is the identity matrix and is the Kronecker product.
Internal stability parameters and should guarantee the stability of a single UAV agent. Therefore, the UAV is written as:
, where .
The characteristic polynomial is:
where
is the identity matrix. For a single UAV:
According to the Routh–Hurwitz stability criterion:
According to the linear stability theorem, the parameter , and should be selected so that there is a zero eigenvalue and other eigenvalues have negative genuine part parameters. The parameters and must meet the stability of UGV consistency, and the parameters and must meet the stability of UAV consistency. After selecting the parameters, can be converted to a Jordan standard type: . Let be the first row of and the left eigenvector have eigenvalue 0. Let the first column of and the right eigenvector have eigenvalue Therefore, ; when the time approaches infinity, the system’s state becomes: , . According to Lemma 1, when the time approaches infinity, the system in (7) asymptotically agrees; that is, the system achieves cooperative formation.□
3.5. Heterogeneous Cooperative Optimal Formation Control
UAV systems and UGV systems are heterogeneous systems. Applying the optimal control law to the heterogeneous system can be expressed as:
where
,and
are derived from the matrix
and
are derived from the matrix
.
In combination with the Laplace matrix, Protocols
and
are rewritten as:
Define the state space of heterogeneous multi-agent formation:
where
Theorem 4. Heterogeneous systems can complete formation and optimize Performance Functions (13) and (14) if they use Protocols (25) and (26), respectively.
Proof of Theorem 4. Each UAV and UGV is defined as a group of formation units, so their number is made the same, that is, L = K; the block Laplacian matrix of the system has the same number of rows and columns.
Then, the heterogeneous system in (7) becomes:
where
,
.
According to State Equation (29):
It is easy to discover the eigenvalues of the matrix : .
Verify the rank of the matrix with the above eigenvalues. Let :
Rank = Rank = 6, according to the rank criterion of PBH, the system state space is controllable.
Let =
Substituting into Equation (28):
The
basic determinant change:
where
Now
, so matrix
is similar to Λ. Therefore, there is a nonsingular transformation matrix Q, making
,
is the matrix for which the sum of each row is zero. Therefore, there is at least one zero eigenvalue. Elementary row and column transformation can be taken on
:
If
,
, it can be seen that:
where the
and
have only zero eigenvalues. Therefore, we must select the parameters
,
so that
has zero eigenvalue and all other eigenvalues have negative real parts. The parameters
,
must meet the stability of UAV consistency, and the parameters
and
must meet the stability of UGV consistency. After determining the parameters,
can be converted to a Jordan standard type:
. Let
be the first row of
and the left eigenvector have eigenvalue 0. Let
the first column of
and the right eigenvector have eigenvalue
Therefore,
; when the time approaches infinity, the system’s state becomes:
,
. According to Lemma 1, it is then seen that the system can reach asymptotic consensus in cases where time tends toward infinity.□
4. Simulations
The formation protocol, distributed optimal formation protocol, cooperative formation protocol, and cooperative optimal formation protocol individually designed in this paper are simulated and analyzed using Matlab2016a. The effectiveness of the designed control protocol is verified via simulation. To achieve a better cooperative formation task effect in the system, the speed values set by the UAV system and the UGV system are similar.
The initial state is as follows: the position of UAV1 is , the position of UAV2 is , and the position of UAV3 is . The speed of UAV1 is , the speed of UAV2 is , and the speed of UAV3 is . The attitude angle of UAV1 is , the attitude angle of UAV2 is , and the attitude angle of UAV3 is
The rate of the attitude angle of UAV1 is , the rate of the attitude angle of UAV2 is , and the rate of the attitude angle of UAV3 is . The setting position of UAV1 is , the setting position of UAV2 is , and the setting position of UAV3 is . The UAV and UGV systems have a set speed of . The position of UGV1 is , the position of UGV2 is , and the position of UGV3 is . The speed of UGV1 is , the speed of UGV2 is , and the speed of UGV3 is . The setting position of UGV1 is , the setting position of UGV2 is , and the setting position of UGV3 is .
The parameters are .
Simulations of the UAV system and the UGV system using formation control and distributed optimal formation control are shown in
Figure 1 and
Figure 2. By comparing the two figures in
Figure 1 and
Figure 2, it can be found that both protocols can be used to complete triangular formation at the same time, but when distributed optimal formation control is used, it can be observed that the error between the actual position and the set value is significantly reduced in a short period of time.
When using the formation control protocol, with parameters
.
Figure 1 shows the change of position coordinates of the multi-agent system and shows the actual formation position of the UAV and UGV when
. It can be seen that there is a significant error with the set value.
When using the distributed optimal formation control protocol, with parameters
.
Figure 2 shows the position coordinate changes of the system and shows the actual formation status of the UAV and UGV when
s. It can be observed that the system can quickly complete the triangular formation and that the error between the actual position and the set value is small.
It can be seen from
Figure 3 and
Figure 4 that the state of each system variable changes with time when the formation control protocol is used. It can be seen from the figures that the state of each variable in the system is stable from 40 s to 50 s.
From
Figure 5 and
Figure 6, it can be seen that the system uses the distributed optimal formation control protocol to change the state of each variable with time. It is observed that the system reaches stability between 30 and 40 s.
The simulation of cooperative formation protocol and cooperative optimal formation protocol of heterogeneous multi-agents is shown in
Figure 7 and
Figure 8.
When using the cooperative formation control protocol, with parameters
.
Figure 7 shows the change of position coordinates of the system and the expected formation state of the system at the 10th second. It can be seen that the actual formation position of the system has a significant error with the set value.
When using the cooperative optimal formation control protocol, with parameter .
Figure 8 shows the change in position coordinates of the system and the expected formation state of the system at t=10 s. It can be observed that the system can quickly complete the triangular formation and that the error between the actual position and the set value is small. In addition, the cooperative optimal formation protocol speeds up the convergence rate of the system, which is of great help to the formation time of the system.
When using the formation control and the distributed optimal formation control, the formation states of the UAV system at different times are shown in
Figure 9 and
Figure 10.
Figure 9 shows the formation status of the UAV system at different times under the use of the formation control protocol. It can be observed that the system can complete the triangle formation but also that there is a relative reach error with the set position in the formation completion process and that there is still an error at the fiftieth second.
Figure 10 shows the formation status of the UAV system at different times under the use of the distributed optimal formation control protocol. It can be observed that the system can quickly complete the triangle formation and that the error between the actual position and the set value is very small at the thirtieth second.
When using the formation control and the distributed optimal formation control, the formation states of the UGV system at different times are shown in
Figure 11 and
Figure 12.
Figure 11 shows the formation status of the UGV system at different moments when the formation control protocol is used. It can be observed that the UGV system can complete the triangle formation, but there is still a certain error with the set value in the formation completion process. At the thirtieth second, the difference between the actual position and the set value gradually decreases.
Figure 12 shows the change in the formation shape of the UGV system at different moments when the distributed optimal formation control protocol is used; it can be observed that the UGV system can quickly complete the triangle formation and that the error between the actual position and the set value is small.
When using the heterogeneous cooperative formation control and the heterogeneous cooperative optimal formation control, the formation states of the UAV system at different times are shown in
Figure 13 and
Figure 14.
Figure 13 shows the formation status of the UAV system at different times under the use of the heterogeneous cooperative formation control protocol. It is observed that the system can complete the triangle formation. At the thirtieth second, the difference between the actual position and the set value gradually decreases.
Figure 14 shows the formation status of the UAV system at different times under the use of the heterogeneous cooperative optimal formation control. It can be observed that the system can quickly complete the triangle formation and that the error between the actual position and the set value is very small at the tenth second.
When using the heterogeneous cooperative formation control and the heterogeneous cooperative optimal formation control, the formation states of the UGV system at different times are shown in
Figure 15 and
Figure 16.
Figure 15 shows the formation status of the UGV system at different times under the use of the heterogeneous cooperative formation control protocol. It can be observed that the system can complete the triangle formation. At the thirtieth second, the difference between the actual position and the set value gradually decreases.
Figure 16 shows the formation status of the UGV system at different times under the use of the heterogeneous cooperative optimal formation control. It can be observed that the system can quickly complete the triangle formation and that the error between the actual position and the set value is very small at the tenth second.
In order to fully verify the theoretical results, the structure is complicated, and the communication Laplacian matrix is as follows:
The added set of UAV states is as follows: the position of UAV4 is
, the position of UAV5 is
, and the position of UAV
. The speed of UAV4
, the speed of UAV5
, and the speed of UAV6 is
.The attitude angle of UAV4 is
, the attitude angle of UAV5
, and the attitude angle of UAV6 is
. The setting position of UAV4 is
, the setting position of UAV5 is
, and the setting position of UAV6 is
. The UAV systems have a set speed of
. The simulation results are shown in
Figure 17,
Figure 18,
Figure 19 and
Figure 20.
The experiments have verified the formation control and distributed optimization formation control, as shown in
Figure 17 and
Figure 18, respectively. By comparing
Figure 17 and
Figure 18, it can be found that both protocols can be used to complete triangular formation at the same time, but when distributed optimal formation control is used, it can be observed that the error between the actual position and the set value is significantly reduced in a short period of time.
When using the formation control protocol, with parameters
,
Figure 17 shows the change of position coordinates of the complex system and the expected formation state of the complex system. However, it can be seen that there is a significant error with the set value.
When using the distributed optimal formation control protocol, with parameters
.
Figure 18 shows the change of position coordinates of the complex system and the expected formation state of the complex system. It can be observed that the complex system can quickly complete the triangular formation and that the error between the actual position and the set value is small.
The experiments have verified the cooperative formation control and the cooperative optimal formation control, as shown in
Figure 19 and
Figure 20, respectively.
When using the cooperative formation control protocol, with parameters
,
Figure 19 shows the change in position coordinates of the complex system and the expected formation state of the complex system. However, it can be seen that there is a significant error with the set value.
When using the cooperative optimal formation control protocol, with parameters
,
Figure 20 shows the change in position coordinates of the complex system and the expected formation state of the complex system. It can be observed that the complex system can quickly complete the triangular formation and that the error between the actual position and the set value is small. In addition, the cooperative optimal formation protocol speeds up the convergence rate of the complex system, which is of great help to the formation time of the complex system.