1. Introduction
Multi-agent systems (MASs) are widely used in rescue [
1,
2,
3], reconnaissance [
4,
5,
6], exploration [
7,
8,
9], and other fields. They can cooperate to complete complex tasks that a single agent cannot complete.
In recent years, formation control of multi-agent systems has become a scorching research field, and formation control has also been widely applied in many engineering fields, such as robot field [
10,
11,
12], attitude control of multi-satellite systems [
13,
14], and flight control of UAVs [
15,
16]. By using an appropriate formation control method, the formation control challenge is to accomplish some difficult global activities. Many formation control methods and techniques have been studied. For example, paper [
17] uses A* algorithm combined with an optimization algorithm for formation to realize a collision-free path. In the literature [
18], multiple mobile robots adopt the leader-following method for formation control. This paper [
19] uses the method of minimizing the Kullback–Leibler divergence to achieve the desired formation of multiple UAVs. To reduce the communication burden of multiple UAVs, this paper [
20] designs a distributed frame structure, the control protocol includes event triggering, and the system completes the distributed formation. However, the above results for formation control are applied to isomorphic multi-agent systems. In practical applications, there are many heterogeneous multi-agent systems (HMASs) with different structures, dynamic models, and even information perception and decision-making abilities. Article [
21] studies the formation problem of a class of general first/second-order discrete-time heterogeneous multi-agent systems. In article [
22], agents from the same group build a time-varying formation in their own dimension, while agents from different groups move cooperatively in different dimensions to accomplish the cross-dimensional formation of heterogeneous multi-agent systems.
When applied in practice, how can the system quickly complete the task? This will consider optimization problems such as the following: In paper [
23], distributed optimal control is proposed to optimize the trajectory of many unicycle robots. The optimal collaborative control of linear multi-agent systems is investigated in this paper [
24], and the proposed controller can minimize the quadratic global cost function to a specific optimal value, with the optimal solution being independent of uncertainty. In the paper [
25], multi-agent systems’ leader–follower formation control problem adopts a stress matrix, which has better formation flexibility. The inverse optimal control theory is used to demonstrate the effectiveness of distributed control protocols in paper [
26].
Due to the collaboration of multi-agent systems that should be are autonomous, fault-tolerant, coordinated, adaptable, and scalable, they may also involve a variety of other fields of research, including resource exploration [
27,
28], target positioning [
29,
30], environmental monitoring [
31,
32], and military exercise [
33,
34]. So far, the collaborative control problem of MASs has attracted wide attention from researchers in physics, autonomous vehicles, industrial engineering, biology, machine intelligence, and other fields. However, there are challenges in collaborative control between multiple agents with different dynamics.
Based on the above results, the literature [
21,
22] only studies the formation problem of heterogeneous systems, without considering the optimization of system formation. Literature [
23,
24,
25,
26] considers the flexibility and optimization of formation based on the same dynamic model. In this paper, we discuss the formation problem of mixed-order multi-agent systems by combining the optimal control theory and cooperative control and solve the formation optimization problem of systems with different dynamics models. At the same time, the system can achieve coordination. The main contributions are as follows:
Firstly, the dynamic model of each agent system is introduced, and the different dynamic model systems are written into a state space by using a block Kronecker product. Secondly, static formation control protocols and dynamic formation control protocols are designed, respectively, and graph theory is used to prove that the control protocols can complete formation. Furthermore, the optimal control law of each agent system is designed by using the optimal control theory, and the problem of dimensionality inconsistency is solved by using cooperative control. The optimal control theory and cooperative control are introduced into the static and dynamic formation control protocols, and the static and dynamic cooperative optimal formation control protocols are designed. Finally, the collaborative optimal formation control protocol and the formation control protocol are compared by simulation, and the system state variables can rapidly converge and realize the cooperative formation, which verifies the effectiveness of the optimal control and the cooperative control.
The rest of this paper is organized as the following. The preparatory knowledge and system model are introduced in
Section 2.
Section 3 introduces the design of the control protocol. Then in
Section 4, simulation experiments verify the effectiveness of the proposed control protocol.
Section 5 summarizes the main contents of this article.
2. Preliminaries
2.1. Graph Theory
A graph is used to represent the topology of the information exchange between agents, where is the set of agent nodes, and each node represents an agent. is the set of edges about , indicating that there is information exchange between agent and , and the information is from to . A weighted adjacency matrix with nonnegative adjacency elements , where is the weight of the edge . For , if the agents and can receive information from each other, then the elements in the adjacency matrix are ; otherwise, the element in the adjacency matrix is To simplify the calculation, let . Note that in the undirected graph, and in the directed graph.
The set of neighbor node , which represents the set composed of all agents that have information exchange with agent 𝑖. In the graph, the degree represents the number of neighbors of a node, that is, the number of edges per node. The node degree is defined as , and the degree matrix of the graph is diagonal, as follows: . The Laplace matrix L of a multi-agent system is defined as: .
2.2. Formation Definition
The problem of formation control is to find a control protocol that allows multiple intelligences to form formations or configurations. The desired vector of the system formation is:
The formation error vector can be expressed as:
By introducing the error vector, the formation problem is transformed into the error consistency problem about the state variables. When the system error vector reaches consistency, it means that the formation is realized.
Definition 1. When all the states in the system meet the definition of Formula (1), it indicates that the system realizes static formation control. Definition 2. When all the states in the system meet the definition of Formula (2), it indicates that the system realizes dynamic formation control. 2.3. High-Order UAV Dynamics Model
For the dynamics model of the UAV, refer to the literature [
35]. Referring to
Figure 1, unmanned aerial vehicle attitude angle includes: roll angle
, pitching angle
and yaw angle
. The roll angle refers to the rotation angle along the
axis, the pitch angle refers to the rotation angle along the
axis, the yaw angle refers to the rotation angle along the
axis.
and
are the torques of the four propellers due to rotation,
and
are the lift forces generated by the four propellers.
, and
denote the rotational inertia along the
,
and
axes, respectively. Combined with the literature [
36], the drag coefficients are neglected in this paper, the roll angle and pitch angle only have small variations, and the yaw angle has no variation, that is,
,
,
,
,
,
. The dynamics model of the UAV is simplified and the dynamics model is represented as follows:
where
,
,
represent position state,
and
represent the attitude state,
is the lift force in the height direction,
and
represent moments of the quadrotor,
, and
represent inertial moments of the quadrotor.
According to Equation (3), the Equation of state is:
The subscript a represents the UAV state variables, where
, ,
,
,
,
2.4. Second-Order USV Dynamics Model
The details dynamic equations of USV can be found in [
37]. In this article, the USV is limited to the water surface for cross-media relay communication, and the agents taken into consideration is the moving in a planar environment; systems are characterized by a second-order model.
where
represents the position state,
represents the velocity in the direction of
, and
represents the input of agent
.
According to Equation (5), the Equation of state is:
The subscript s represents the USV state variables, where
,
,
, , .
2.5. Second-Order UUV Dynamics Model
The details kinematic and dynamic equations of UUV can be found in [
38]. According to the structure of the UUV studied in the paper, there is no thruster to control the angular velocity in roll, the rolling has little influence on the translational motion, and the dynamics of the actuators and thrusters are reasonably neglected in this paper. It is assumed that the UUV is torpedo-type, such that the system is characterized by a second-order model [
39].
To simplify the problem, we present dynamic systems as follows:
Refer to
Figure 2, where
represents the position state,
represents the velocity in the direction of
, and
represents the input of agent
.
According to Equation (7), the Equation of state is:
The subscript u represents the UUV state variables, where
, ,
,
, , .
2.6. Heterogeneous Multi-Agent System
Write the UAV system, UUV system, and USV system as a state-space model:
where
, ,
, ,
, ,
, ,
, ,
, ,
, ,
, , .
The UAV, UUV, and USV are grouped into a group, and the formation can be extended to multiple pairs, expressed as: .
represent an m-dimensional identity matrix, is Kronecker product.
3. Design of Control Protocol
3.1. Formation Control Protocol
When the system formation reaches the desired state, the expected control input is 0, and the formation state equation of system (9) can be expressed as:
Based on literature [
40], a static formation control protocol is proposed:
Dynamic formation control protocol is proposed:
Lemma 1 [
41]
. For an Laplacian matrix is a random matrix with positive diagonal elements. If has a unique zero eigenvalue, Rank , then its left eigenvector has and , where, , . Theorem 1. For the formation equation of state (8), when the communication topology 𝐺 = (𝑉 𝐸, 𝐴) is connected undirected graph or directed graph containing spanning tree, if the gain parameter satisfies the conditions:, , , , , , , , using static formation control protocol can achieve formation and the system state variables to achieve convergence.
Proof of Theorem 1. To explain the system (9) communication relationship, Laplacian matrix relations are described as follows: .
Where, represents the Laplacian matrix relationship between isomorphic agents, and the Laplacian matrix represents the ith agent system pointing at the jth agent system, namely, the communication connection of heterogeneous agent system.
The static formation control protocol is written in a unified form:
where
Substituting into Equation (10) yields:
The parameters
,
,
,
must be chosen so that
has a zero eigenvalue and all the other eigenvalues have negative genuine parts. The gain parameter can be determined using the Routh–Hurwitz stability criterion [
42] as follows:
After selecting parameters to stabilize the system,
can be transformed into Jordan standard type as follows:
Let be the first row of and the left eigenvector of 0 eigenvalue, and be the first column of and the right eigenvector of 0 eigenvalue. Therefore, , as time approaches infinity, the system’s state becomes: , . Lemma 1 states that systems are asymptotically convergent and complete formation as the time approaches infinity. □
Theorem 2. For the formation equation of state (10), when the communication topology G = (V E, A) is connected undirected graph or directed graph containing a spanning tree, dynamic formation control protocol can achieve formation and the system state variables to achieve convergence.
Proof of Theorem 2. Refer to the proof of Theorem 1. □
3.2. Optimal Control
The main problem of optimal control research is to determine an optimal control law in the allowable control domain according to the mathematical model of the controlled object, so that the performance index of the system reaches the extreme value, that is, the optimal control law is determined when the performance index reaches the extreme value. The controlled objects in this paper are UAVs, USVs, and UUVs. In practical control problems, most of the control quantity is limited by objective conditions and can only be taken within a certain range, which is called admissible control. The performance index is a measure of system performance, and its content and form depend on the task to be completed by the optimal control problem. Based on the literature [
43,
44], the optimal control law is solved as follows:
Consider the equation of state of the system as:
The control vector
can minimize the following performance index:
Let the Hamilton function is:
If the Hamilton function satisfies the following conditions:
When the above conditions are satisfied, the optimal control law is obtained.
For the UAV system, let
, an integral performance index composed of error variables and control variables is constructed as follows:
where
is the symmetric non-negative definite matrix of the appropriate dimension, and
is the symmetric positive definite matrix of suitable dimension.
In order to facilitate engineering application, and in the performance index are taken as a diagonal linear matrix. When is taken, the first part of the performance index can be expressed as , which is the total measurement of tracking error in the process of movement of the system. When is taken, the second part of the performance index can be expressed as , which is the full measure of the system’s energy consumption.
Through the aforementioned analysis, the physical meaning of the quadratic performance index is to make the system’s dynamic error and energy consumption in the control process optimal. The optimal control law is as follows:
, where
is the solution of the Riccati equation:
The control parameter equation of UAV is: , the dimension of is 3 × 12, and it also has the following form .
For the USV system, let
, an integral performance index composed of error variables and control variables is constructed as follows:
where
is the symmetric non-negative definite matrix of appropriate dimension, and
is the symmetric positive definite matrix of suitable dimension.
In order to facilitate engineering application, and in the performance index are taken as a diagonal linear matrix. When is taken, the first part of the performance index can be expressed as , which is the total measurement of tracking error in the process of movement of the system. When is taken, the second part of the performance index can be expressed as , which is the full measure of the system’s energy consumption.
Through the aforementioned analysis, the physical meaning of the quadratic performance index is to make the system’s dynamic error and energy consumption in the control process optimal. The optimal control law is as follows:
, where
is the solution of the Riccati equation:
The control parameter equation of USV is: , the dimension of is 2 × 4, and it also has the following form .
For the UUV system, let
, an integral performance index composed of error variables and control variables is constructed as follows:
where
is the symmetric non-negative definite matrix of the appropriate dimension, and
is the symmetric positive definite matrix of the appropriate dimension.
In order to facilitate engineering application, and in the performance index are taken as a diagonal linear matrix. When is taken, the first part of the performance index can be expressed as , which is the total measurement of tracking error in the process of movement of the system. When is taken, the second part of the performance index can be expressed as , which is the full measure of the system’s energy consumption.
Through the aforementioned analysis, the physical meaning of the quadratic performance index is to make the system’s dynamic error and energy consumption in the control process optimal. The optimal control law is as follows:
, where
is the solution of the Riccati equation:
The control parameter equation of UUV is: , the dimension of is 3 × 6, and it also has the following form .
Theorem 3. For UAV state Equation (2), the optimal control law corresponding to performance indicator Equation (13) is .
Proof of Theorem 3. Suppose
is the optimal control satisfying the performance index Equation (13), then the minimum principle must be satisfied, and the Hamilton function is constructed according to Equation (13):
where
is a covariate, and since
is unconstrained, the minimum condition is Hamilton Function (19) with the control input
taking an unconditional minimum. Compute the derivative of the Hamilton function for the control input
:
, let
:
Due to ; therefore, Formula (20) uses Hamilton Formula (19) to obtain the minimum control, that is, the optimal control.
According to the regular Equation:
Let
, the matrix
is undetermined, so,
is substituted into Equation (21):
Substituting regular Equation (21) into Equation (22), we can achieve:
Equation (20) is substituted into the Riccati equation in (23):
Let , , then the solution of is positive definite, select Lyapunov functions:.
On the derivation: . Because , , there must be: , so there are: . According to Lyapunov stability theorem, the system with optimal control is asymptotically stable. □
3.3. Cooperative Control
Since the models of agents in heterogeneous systems are not the same, the key to collaboration is to find the common part between the models. The heterogeneous system studied consists of unmanned aerial vehicles (UAVs), unmanned surface vehicles (USVs), and unmanned underwater vehicles (UUVs). UAVs contain position state, velocity state, attitude angle, and attitude angle change rate state; USVs contain position state and velocity state; UUVs contain position state and velocity state. Therefore, position state and velocity state are the common domain of the three, unmanned aerial vehicles (UAVs) and unmanned underwater vehicles (UUVs) are three-dimensional space, and unmanned surface vehicles (USVs) are two-dimensional space. Transformation matrix , , , are proposed.
3.4. Collaborative Optimal Control for Mixed-Order Heterogeneous Systems
By changing only the gain parameters, the collaborative optimal control of the system is created without altering the formation control protocol’s structure, which preserves the distribution of the formation control protocol.
represents the number of neighbors of agent
in the UAVs,
represents the number of neighbors of agent
in the USVs, and
represents the number of neighbors of agent
in the UUVs. Therefore, the optimal control theory and cooperative control are added into the static formation control protocol can be obtained:
Cooperative optimal dynamic formation control protocol:
By adopting the above cooperative optimal control protocol, the system’s communication topology is not required to be a complete graph but only a connected graph. The above cooperative optimal control protocol takes into account the advantages of optimal control and cooperative control, which do not affect the cooperative control of the multi-agent system and can realize different kinds of task requirements according to the set performance indicators.
Theorem 4. For the formation equation of state (10), when the communication topology 𝐺 = (𝑉 𝐸, 𝐴) is connected undirected graph or directed graph containing a spanning tree, using the optimal control theory and cooperative control is added into the static formation control protocol can achieve optimal cooperative formation and the system state variables to achieve convergence.
Proof of Theorem 4. As mentioned above, the UAV, USV, and UUV are grouped into a group, the system matrix has the same number of rows and columns, and it can take the following forms:
where
The matrix is constructed according to the PBH criterion as follows:
Through matrix , its eigenvalue can be obtained as: . Let , the rank of the matrix can be obtained as , and according to the PBH criterion, the heterogeneous system is controllable.
The equation of a state for the cooperative optimal static formation control protocol is expressed as follows:
where
Substituting
into Equation (27), we can get:
where
By applying elementary row and row transformations, this matrix becomes:
where
From the matrix transformation described above, it can be inferred that , and the matrix is similar to the matrix . A non-singular transformation matrix exists, resulting in , , and is the matrix where the sum of each row is zero.
As a result, at least one eigenvalue is zero. The primary column and row transformation can be performed in
:
If
,
, then:
. Combined with the proof of Theorem 1,
can be converted to Jordan standard form:
Let
be the first row of
and the left eigenvector of zero eigenvalue, and
be the first column of
and the right eigenvector of zero eigenvalue. Therefore,
. The system state as time reaches infinity is:
Lemma 1 states that the system (27) can achieve convergence and the error vector is zero as the time tends to infinity, completing the cooperatively optimal formation. □
5. Conclusions
This paper proposes a cooperative optimal formation control strategy for mixed-order heterogeneous multi-agent systems based on optimal control theory and cooperative control.
Firstly, for heterogeneous multi-agent systems with different dimensions and models written as a state space, the block Kronecker product is used to write the system in space. Secondly, the graph theory matrix proves the effectiveness of the proposed dynamic and static formation control protocols. Further, the optimal control theory and cooperative control are proposed, and the dimensional inconsistency problem is solved by using the cooperative control, the optimal gain parameters are obtained by the optimal control, then the cooperative formation control is presented. Finally, the effectiveness of the cooperative optimal formation control protocol is verified by simulation, and it can be verified that the incorporation of optimal control theory and cooperative control can hasten the system’s convergence and complete the cooperative formation. In future work, the system will be affected by the environment, the USV and UUV will be affected by the ocean, the UAV will be affected by the wind speed, and the environmental effects will be the system obstacles, such obstacles will be taken into consideration in the system formation.