Collaborative Optimal Formation Control for Heterogeneous Multi-Agent Systems

Li, Yandong; Liu, Meichen; Lian, Jiya; Guo, Yuan

doi:10.3390/e24101440

Open AccessArticle

Collaborative Optimal Formation Control for Heterogeneous Multi-Agent Systems

by

Yandong Li

,

Meichen Liu

^*,

Jiya Lian

and

Yuan Guo

College of Computer and Control Engineering, Qiqihar University, Qiqihar 161000, China

^*

Author to whom correspondence should be addressed.

Entropy 2022, 24(10), 1440; https://doi.org/10.3390/e24101440

Submission received: 14 August 2022 / Revised: 25 September 2022 / Accepted: 28 September 2022 / Published: 10 October 2022

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

In this paper, the distributed optimal control method is used to study the cooperative formation of heterogeneous multi-agents in the air–ground environment. The considered system consists of an unmanned aerial vehicle (UAV) and an unmanned ground vehicle (UGV). The optimal control theory is introduced into the formation control protocol, the distributed optimal formation control protocol is designed, and the stability is verified by graph theory. Furthermore, the cooperative optimal formation control protocol is designed, and the stability is analyzed using a block Kronecker product and matrix transformation theory. Through the comparison of simulation results, the introduction of optimal control theory shortens the formation time of the system and accelerates the convergence speed of the system.

Keywords:

heterogeneous multi-agent system; unmanned aerial vehicle; unmanned ground vehicle; optimal control; cooperative formation control

1. Introduction

With the rapid development of science and technology, the world’s military powers attach great importance to the cooperation capability of an unmanned combat system. In recent decades, air–ground heterogeneous unmanned combat systems, which consist of unmanned aerial vehicles (UAVs) and unmanned ground vehicles (UGVs), have been favored by military powers due to their fast response speed, strong communication capability, strong payload capacity, and high target reconnaissance accuracy [1,2,3].

In the field of multi-agent systems, cooperative control has received extensive attention and research. Examples of such applications include the field of robot collaboration, UAV formation, cooperative transport, and combat reconnaissance [4]. Formation control is an application hotspot in the field of distributed cooperation. In general, formation control can be divided into two categories according to the presence or absence of leader agents: leader and leaderless [5,6,7,8]. Reference [9] used the leader–follower method to complete the trajectory tracking task of the UAV–UGV system but did not cooperate in the process of completing the formation. Furthermore, Reference [10] studied the cooperation problem of the UAV–UGV system by improving the artificial physics approach but did not study the formation problem. Reference [11] studied the time-varying formation control of cooperative heterogeneous multi-agent systems and combined formation with cooperation.

In the cooperative reconnaissance and cooperative strike of the unmanned combat system, based on forming an established formation, the unmanned combat system must also identify the complex and volatile battlefield environments and quickly cross barriers, such as fences and fortifications. Therefore, quickly making the whole formation reach the desired state is also an important concern of the formation problem. Reference [12] used a virtual-structure-based approach and multiple-impedance control to achieve the optimal formation of three mobile robots, and the mobile robots carried out the cooperative formation. However, this study was based on the study of homogeneous multi-agents. In Reference [13], the leader–follower strategy and the virtual leader strategy were integrated into an optimal control framework to study the optimal formation of multiple UAVs. However, this study did not investigate the cooperation of multiple UAVs. For the optimal formation of heterogeneous multi-agent, there are few related research results. Reference [14] is based on using reinforcement learning methods to achieve the optimal formation of heterogeneous multi-agent systems but does not study cooperative control. It should be noted that optimal formation control alone can only solve a relatively limited number of problems, and cooperative optimal formation control is still an open problem.

In addition, most of the existing research results are based on the same dynamic model, namely the homogeneous agent model [15,16,17,18]. Compared with homogeneous multi-agent systems, multi-agent systems composed of heterogeneous dynamic models are more flexible in practical applications. Therefore, it is of great significance to study heterogeneous multi-agent systems. A large number of valuable research results have been obtained for the heterogeneous cooperative problem [19,20,21].

In this paper, the cooperative optimal formation problem of heterogeneous multi-agent systems is studied on the unmanned aerial vehicle and unmanned ground vehicle model. There are threefold main innovations:

Firstly, a heterogeneous modeled UAV–UGV system is proposed, a cooperative architecture of heterogeneous multi-agent systems with equal number is designed, and a Laplacian matrix of communication topology is designed. In addition, a novel block Kronecker product is used to describe the UAV–UGV system. Based on this, distributed formation control is proposed.

The second contribution is to introduce the optimal control method into the formation control protocol, design the distributed formation optimal control protocol, and prove the stability using the method of graph theory.

The third contribution is to design a cooperative formation control protocol for the air–ground system based on the heterogeneous system model so that the UAV–UGV system can achieve the cooperative formation effect. Then, the optimal control is introduced into the cooperative formation control protocol, and the cooperative optimal formation control protocol is designed, which enables the UAV–UGV system formation to quickly achieve the expected effect.

2. Preliminaries

This section mainly introduces the preliminary knowledge of unmanned aerial vehicles and unmanned ground vehicles, including the use of graph theory to describe the internal relationship of the system and the state-space equation of the UAV system and the UGV system.

2.1. Graph Theory

A weighted undirected graph

G = (V, E, A)

consists of

n

vertices, where

V = (v_{1}, v_{2}, \dots v_{n})

represents the set of all vertices in the undirected graph, and each vertex represents an agent.

E = {e_{i j} = (v_{i}, v_{j})} \subseteq v \times v

represents the edge set between vertices, and

e_{i j} = (v_{i}, v_{j})

represents the edge from

v_{i}

vertex to

v_{j}

vertex; an edge connection between two vertices indicates that there is an information interaction between these two vertices. The graph is undirected if it allows two-way communication; otherwise, it is directed.

A = {[a_{i j}]}_{n \times n}

represents the adjacency matrix indicating the relationship between agents, where

a_{i j}

is the weight of the side

e_{i j} = (v_{i}, v_{j})

and where the diagonal elements of the matrix

A

are all 0. For

i, j = 1, 2, 3, \dots, n (i \neq j)

, if the agents

v_{i}

and

v_{j}

can receive information from each other, then the elements in the adjacency matrix are

a_{i j} = a_{j i} > 0

; otherwise, the element in the adjacency matrix is

0 .

In an undirected graph, the degree represents the number of neighbors of a node, that is, the number of edges per node.

D = d i g {d_{1}, \dots, d_{n}}

of undirected graph G is a diagonal matrix with

d_{i} = \sum_{j = 1}^{n} a_{i j}

.Then, the Laplacian matrix of G is defined as L = D

-

A, which has at least one zero eigenvalue with

1 = {[1, 1 \dots, 1]}^{T}

as its corresponding right eigenvector. In addition, L has exactly one zero eigenvalue if and only if the directed graph G contains a directed spanning tree.

2.2. UGV Dynamics Model

Single UGV motion model:

{\begin{array}{l} {\dot{p}}_{g i} = v_{g i} \\ {\dot{v}}_{g i} = u_{g i} \end{array}

(1)

where

p_{g i} = {[p_{g i}^{x}, p_{g i}^{y}, p_{g i}^{z}]}^{T}

stands for the position in ground space,

v_{g i} = {[v_{g i}^{x}, v_{g i,}^{y}, v_{g i}^{z}]}^{T}

is the velocity in the direction

p_{g i}

, and

u_{g i} = {[u_{g i}^{x}, u_{g i}^{y}, u_{g i}^{z}]}^{T}

represents the input of agent

i

. If there are

k

UGVs, the above formula is converted to the states:

{\dot{X}}_{G} = A_{G} X_{G} + B_{G} U_{G}

(2)

where

X_{G} = {(P_{G}, V_{G})}^{T}

,

P_{G} = (p_{1}, p_{2}, p_{3} \dots p_{k})

,

p_{i} = (x_{i}, y_{i}, z_{i}), i = 1, 2, \dots, k

;

V_{G} = (v_{1}, v_{2}, v_{3} \dots v_{k}), v_{i} = (v_{i}^{x}, v_{i}^{y}, v_{i}^{z}), i = 1, 2, \dots, k

U_{G} = (u_{1}, u_{2}, u_{3} \dots u_{k}), u_{i} = (u_{i}^{x}, u_{i}^{y}, u_{i}^{z}), i = 1, 2, \dots, k

A_{G} = [\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}] \otimes I_{k}, B_{G} = [\begin{matrix} 0 \\ 1 \end{matrix}] \otimes I_{k} .

Subscript G represents the state variable of the unmanned vehicle.

The expected formation state is

h_{g} = {(h_{g}^{x}, h_{g}^{y}, h_{g}^{z})}^{T}

, the formation state is transformed into the position state, and the new error position state naturally appears, namely:

δ_{G} = {(δ_{G}^{x}, δ_{G}^{y}, δ_{G}^{Z})}^{T} = {(P_{g i}^{x} - h_{g}^{x}, P_{g i}^{y} - h_{g}^{y}, P_{g i}^{z} - h_{g}^{z})}^{T}

.

Therefore, the problem of formation control becomes finding a protocol

U_{G}

to drive the error vector

δ_{G}

to zero, which means that

{l i m}_{t \to \infty}^{} ∥ δ_{g i} - δ_{g j} ∥ = 0, {l i m}_{t \to \infty}^{} = ∥ v_{g i} ∥ = 0 .

(3)

2.3. UAV Dynamics Model

The motion model of a single UAV is:

{\begin{array}{l} \ddot{x} = g θ \\ \ddot{y} = - g ϕ \\ \ddot{z} = f_{z} / m - g \\ \ddot{ϕ} = M_{ϕ} / I_{x} \\ \ddot{θ} = M_{θ} / I_{y} \\ \ddot{φ} = M_{φ} / I_{z} \end{array}

(4)

where

g

is the acceleration of gravity;

x, y,

and

z

are the positions of the UAV in three coordinate systems;

ϕ, θ,

and

φ

are the roll angle, the pitch angle, and the yaw angle of the UAV, respectively;

f_{z}

is the lift force in the direction of height;

M_{ϕ}, M_{θ},

and

M_{φ}

are the torques on the three axes of the body coordinate system;

I_{x}, I_{y}

, and

I_{Z}

are the inertial matrices in the body coordinate system. For

L

UAVs, the above equations are converted to the state-space form as follows:

{\dot{X}}_{A} = A_{A} X_{A} + B_{A} U_{A}

(5)

where

X_{A} = {(P_{A}, V_{A}, Ω_{A}, {\dot{Ω}}_{A})}^{T}

,

P_{A} = (p_{1}, p_{2}, p_{3} \dots p_{l})

,

p_{i} = (x_{i}, y_{i}, z_{i}), i = 1, 2, \dots, l

;

V_{A} = (v_{1}, v_{2}, v_{3} \dots v_{l}), v_{i} = (v_{i}^{x}, v_{i}^{y}, v_{i}^{z}), i = 1, 2, \dots, l

Ω_{A} = (Ω_{1}, Ω_{2}, Ω_{3}, \dots Ω_{l}), Ω_{i} = (g θ_{i}, - g ϕ_{i}, 0), i = 1, 2, \dots, l

{\dot{Ω}}_{A} = ({\dot{Ω}}_{1}, {\dot{Ω}}_{2}, {\dot{Ω}}_{3}, \dots {\dot{Ω}}_{l}), {\dot{Ω}}_{i} = (g {\dot{θ}}_{i}, - g {\dot{ϕ}}_{i}, 0), i = 1, 2, \dots, l

A_{A} = (\begin{matrix} 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 \end{matrix}) \otimes I_{l}, B_{A} = (\begin{matrix} 0 \\ 0 \\ 0 \\ 1 \end{matrix}) \otimes I_{l}

U_{A} = {(u_{1}, u_{2}, u_{3} \dots u_{l})}^{T}, u_{i} = (u_{i}^{x}, u_{i}^{y}, u_{i}^{z}), i = 1, 2, \dots, l .

Subscript A represents the state variable of the unmanned vehicle.

The expected formation state is

h_{a} = {(h_{a}^{x}, h_{a}^{y}, h_{a}^{z})}^{T}

, the formation state is transformed into the position state, and the new error position state naturally appears, namely:

δ_{A} = {(δ_{A}^{x}, δ_{A}^{y}, δ_{A}^{z})}^{T} = {(P_{a i}^{x} - h_{a}^{x}, P_{a i}^{y} - h_{a}^{y}, P_{a i}^{z} - h_{a}^{z})}^{T}

.

Therefore, the problem of formation control becomes finding a protocol

U_{A}

to drive the error vector

δ_{A}

to zero, which means that

{l i m}_{t \to \infty}^{} ∥ δ_{a i} - δ_{a j} ∥ = 0, {l i m}_{t \to \infty}^{} = ∥ v_{a i} ∥ = 0, {l i m}_{t \to \infty}^{} ∥ Ω_{i} ∥ = 0, {l i m}_{t \to \infty}^{} ∥ {\dot{Ω}}_{i} ∥ = 0 .

(6)

2.4. Heterogeneous Multi-Agent System

To analyze heterogeneous multi-agent systems more conveniently, the UAV system and UGV system are written into the same state space and combined with the state-space model of the single agent above; the form of the heterogeneous multi-agent state-space model is defined as:

\dot{X} = A X + B U

(7)

where

X = {(X_{G}^{T}, X_{A}^{T})}^{T}, A = (\begin{matrix} A_{G} & 0 \\ 0 & A_{A} \end{matrix}), B = (\begin{matrix} B_{G} & 0 \\ 0 & B_{A} \end{matrix}), and U = {(U_{G}^{T}, U_{A}^{T})}^{T}

. The Laplace matrix is

L = (\begin{matrix} L_{A A} & L_{A G} \\ L_{G A} & L_{G G} \end{matrix})

, where

L_{A G}, L_{G A}

represents information between heterogeneous agent systems. This paper takes the heterogeneous multi-agent system composed of three UGVs and three UAVs as the research object, and its Laplace matrix relationship is as follows:

L = (\begin{matrix} \begin{matrix} \underset{L_{A A}}{\underset{⏟}{\begin{matrix} \begin{matrix} - 3 \\ \begin{matrix} 1 \\ 1 \end{matrix} \end{matrix} & \begin{matrix} \begin{matrix} 1 \\ \begin{matrix} - 3 \\ 1 \end{matrix} \end{matrix} & \begin{matrix} 1 \\ \begin{matrix} 1 \\ - 3 \end{matrix} \end{matrix} \end{matrix} \end{matrix}}} \\ \underset{L_{G A}}{\underset{⏟}{\begin{matrix} \begin{matrix} \begin{matrix} 0 \\ 0 \end{matrix} \\ 1 \end{matrix} & \begin{matrix} \begin{matrix} \begin{matrix} 0 \\ 1 \end{matrix} \\ 0 \end{matrix} & \begin{matrix} \begin{matrix} 1 \\ 0 \end{matrix} \\ 0 \end{matrix} \end{matrix} \end{matrix}}} \end{matrix} & \begin{matrix} \underset{L_{A G}}{\underset{⏟}{\begin{matrix} \begin{matrix} \begin{matrix} \begin{matrix} 0 \\ 0 \end{matrix} \\ 1 \end{matrix} & \begin{matrix} \begin{matrix} 0 \\ 1 \end{matrix} \\ 0 \end{matrix} \end{matrix} & \begin{matrix} \begin{matrix} 1 \\ 0 \end{matrix} \\ 0 \end{matrix} \end{matrix}}} \\ \underset{L_{G G}}{\underset{⏟}{\begin{matrix} \begin{matrix} \begin{matrix} - 3 \\ 1 \end{matrix} \\ 1 \end{matrix} & \begin{matrix} \begin{matrix} \begin{matrix} 1 \\ - 3 \end{matrix} \\ 1 \end{matrix} & \begin{matrix} \begin{matrix} 1 \\ 1 \end{matrix} \\ - 3 \end{matrix} \end{matrix} \end{matrix}}} \end{matrix} \end{matrix})

The expected formation state is

h = {(h_{a}^{T}, h_{g}^{T})}^{T}

, the formation state is transformed into the position state, and the new error position state naturally appears, namely:

δ = {(δ_{A}^{x}, δ_{A}^{y}, δ_{A}^{Z}, δ_{G}^{x}, δ_{G}^{y}, δ_{G}^{Z})}^{T} = {(P_{a i}^{x} - h_{a}^{x}, P_{a i}^{y} - h_{a}^{y}, P_{a i}^{z} - h_{a}^{z}, P_{g i}^{x} - h_{g}^{x}, P_{g i}^{y} - h_{g}^{y}, P_{g i}^{z} - h_{g}^{z})}^{T} .

Therefore, the problem of formation control becomes finding a protocol

U

to drive the error vector

δ

to zero, which means that

\underset{t \to \infty}{l i m} ∥ δ_{i} - δ_{j} ∥ = 0, \underset{t \to \infty}{l i m} = ∥ v_{i} ∥ = 0, \underset{t \to \infty}{l i m} ∥ Ω_{i} ∥ = 0, \underset{t \to \infty}{l i m} ∥ {\dot{Ω}}_{i} ∥ = 0

(8)

3. Design of Control Protocol

To realize the formation of heterogeneous multi-agent systems of UAVs and UGVs, this section is based on the formation control protocol. Firstly, the optimal control law is applied to the single agent. Then, according to the combination of the optimal control law and the formation control protocol, a heterogeneous multi-agent system with distributed optimal formation control is realized. Finally, according to the motion equation of the heterogeneous multi-agent system, the cooperative formation control and cooperative optimal formation control of the heterogeneous multi-agent system are realized.

Lemma 1.

[22]. For an

N * N

Laplacian matrix

L, N e^{- L t}, t > 0

is a random matrix with positive diagonal elements. If

L

has a unique zero eigenvalue, Rank

(N) = N - 1

, then its left eigenvector has

v = {[v_{1}, v_{2}, \dots v_{n}]}^{T} \geq 0

and

1_{N}^{T} v = 1, L^{T} v = 0

, where,

t \to \infty

,

e^{- L t} \to 1_{N} v^{T}

.

3.1. Formation Control

Formation control protocol for the UAVs:

u_{i a} = α \sum_{j \in N_{i}}^{} (δ_{a j} - δ_{a i}) - β v_{a i} - γ_{1} Ω_{i} - γ_{2} {\dot{Ω}}_{i} .

(9)

Formation control protocol for the UGVs:

u_{i g} = α \sum_{j \in N_{i}}^{} (δ_{g j} - δ_{g i}) - β v_{g i}

(10)

where

α, β, γ_{1}, γ_{2}

represent the positive gain coefficients,

δ_{a i} = {(δ_{a i}^{x}, δ_{a i}^{y}, δ_{a i}^{z})}^{T}, δ_{g i} = {(δ_{g i}^{x}, δ_{g i}^{y}, δ_{g i}^{z})}^{T}, v_{g i} = {({\dot{P}}_{g i}^{x}, {\dot{P}}_{g i}^{y}, {\dot{P}}_{g i}^{z})}^{T}, v_{a i} = {({\dot{P}}_{a i}^{x}, {\dot{P}}_{a i}^{y}, {\dot{P}}_{a i}^{z})}^{T}

Ω_{i} = {(g θ, - g ϕ, 0)}^{T}, {\dot{Ω}}_{i} = {(g \dot{θ}, - g \dot{ϕ}, 0)}^{T}

Protocols (9) and (10) shall be unified into the same type:

U = - L_{d} \cdot \tilde{X}

(11)

Define the state-space form of the multi-agent system formation:

\dot{\tilde{X}} = A \tilde{X} + B U

(12)

where

L_{d} = (\begin{array}{l} α L_{s} \otimes I_{3} & - β I_{l} \otimes I_{3} & - γ_{1} I_{l} \otimes I_{3} & - γ_{2} I_{l} \otimes I_{3} & 0 & 0 \\ 0 & 0 & 0 & 0 & α L_{s} \otimes I_{3} & - β I_{k} \otimes I_{3} \end{array}),

L_{S} = (\begin{array}{l} - 1 & 1 & 0 \\ 0 & - 1 & 0 \\ 1 & 0 & - 1 \end{array}), \tilde{X} = {(δ_{a i}^{T}, v_{a i}^{T}, Ω^{T}, {\dot{Ω}}^{T}, δ_{g i}^{T}, v_{g i}^{T})}^{T}, A = (\begin{matrix} A_{A} & 0 \\ 0 & A_{G} \end{matrix}), B = (\begin{matrix} B_{A} & 0 \\ 0 & B_{G} \end{matrix}),

I

is the identity matrix, and

\otimes

is the Kronecker product.

Theorem 1.

If Protocols (9) and (10) are satisfied

α > 0, β > 0, γ_{1} > 0, γ_{2} > 0

,

β ≫ α, γ_{1} > β, γ_{2} > β, β γ_{1} γ_{2} > β^{2} γ_{2} α

, Systems (2) and (5) can implement the formation defined in (3) and (6).

Proof of Theorem 1.

Substitute Formula (11) into Formula (12) to obtain:

\dot{\tilde{X}} = - T_{d} \cdot \tilde{X}

where

T_{d} = [\begin{array}{l} 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 & 0 \\ α L_{s} \otimes I_{3} & - β I_{l} \otimes I_{3} & - γ_{1} I_{l} \otimes I_{3} & - γ_{2} I_{l} \otimes I_{3} & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 & α L_{s} \otimes I_{3} & - β I_{k} \otimes I_{3} \end{array}]

According to the linear stability theorem, the parameters

α, β, γ_{1}

, and

γ_{2}

need to be selected so that

T_{d}

has a zero eigenvalue and other eigenvalues have genuine negative parts. The parameters

α

and

β

need to meet the stability of UGV consistency, and the parameters

γ_{1}

and

γ_{2}

need to meet the stability of UAV consistency. After selecting parameters,

T_{d}

can be converted to a Jordan standard type:

T_{d} = P J P^{- 1}

. Let

v_{1}^{T}

be the first row of

P^{- 1}

and the left eigenvector have eigenvalue 0. Let

w_{1} be

the first column of

P

and the right eigenvector have eigenvalue

0 .

Therefore,

v_{1}^{T} w_{1} = 1

; as time approaches infinity, the system’s state becomes:. According to Lemma 1, as time approaches infinity, Systems (2) and (5) asymptotically agree and the systems complete formation. □

3.2. Optimal Control

The solution of optimal control requires the states of all multi-agents. Before providing performance indicators,

\bar{X}

and

\bar{W}

are defined:

\bar{X} = {(δ_{a i}^{x}, δ_{a i}^{y}, δ_{a i}^{z}, {\dot{P}}_{a i}^{x}, {\dot{P}}_{a i}^{y}, {\dot{P}}_{a i}^{z}, g θ, - g ϕ, 0, g \dot{θ}, - g \dot{ϕ}, 0)}^{T}, \bar{W} = {(δ_{g i}^{x}, δ_{g i}^{y}, δ_{g i}^{z}, {\dot{P}}_{g i}^{x}, {\dot{P}}_{g i}^{y}, {\dot{P}}_{g i}^{z})}^{T}

Then, define the performance indicator function as:

J_{i} = \int_{0}^{\infty} [{\bar{X}}_{i}^{T} Q {\bar{X}}_{i} + u_{i a}^{T} R u_{i a}] d t

(13)

ω_{i} = \int_{0}^{\infty} [{\bar{w}}_{i}^{T} Q {\bar{w}}_{i} + u_{i g}^{T} γ u_{i g}] d t

(14)

As the UAV and the UGV are independent in different coordinate systems, the UAV weight must be set to

Q = q^{*} I_{12}, R = r * I_{3},

where,

q > 0, r > 0

. The UGV weight must be set to

T = λ * I_{6}, γ = μ * I_{3}

, where

λ > 0, μ > 0

.

According to the optimal control theory, the optimal control law of a single agent UAV is:

u_{a}^{*} = - R^{- 1} B_{A}^{T} P_{A} \bar{X}

, where

P_{A}

is the solution of Riccati’s in Equation (15)

A_{A}^{T} P_{A} + P_{A} A_{A} - P_{A} B_{A} R^{- 1} B_{A}^{T} P_{A} + Q = 0

(15)

The optimal control law for a single agent UGV is:

u_{g}^{*} = - γ^{- 1} B_{G}^{T} P_{G} \bar{w}

, where

P_{G}

is the solution of Riccati’s in Equation (16)

A_{G}^{T} P_{G} + P_{G} A_{G} - P_{G} B_{G} γ^{- 1} B_{G}^{T} P_{G} + T = 0

(16)

Through the above calculation, the optimal control law

u_{a}^{*}

can be obtained. Let

K = R^{- 1} B_{A}^{T} P_{A}

, the dimension of matrix

K

is

3 * 12, K

expressed as

K = [k_{1}, k_{2}, k_{3}, k_{4}] \otimes I_{3} .

Similarly, the optimal control law

u_{g}^{*}

can also be solved, let

G = γ^{- 1} B_{G}^{T} P_{G}

the dimension of matrix

G

is

3 * 6, G

expressed as

G = [g_{1}, g_{2}] \otimes I_{3}

.

3.3. Distributed Optimal Formation Control

All UAVs have the same dynamics model, so all UAVs are homogeneous multi-agents. Similarly, all UGVs are homogeneous multi-agents. Therefore, optimal control laws can be extended to the formation control of UAV and UGV multi-agent systems.

u_{i a}^{*} = k_{1} \sum_{j \in N_{i}}^{} (δ_{a j} - δ_{a i}) - k_{2} v_{a i} - k_{3} Ω_{i} - k_{4} {\dot{Ω}}_{i}

(17)

u_{i g}^{*} = g_{1} \sum_{j \in N_{i}}^{} (δ_{g j} - δ_{g i}) - g_{2} v_{g i}

(18)

Define the multi-agent system to be optimized:

\dot{\tilde{X}} = A \tilde{X} + B U

(19)

where

\tilde{X} = {(δ_{a i}^{T}, v_{a i}^{T}, Ω^{T}, {\dot{Ω}}^{T}, δ_{g i}^{T}, v_{g i}^{T})}^{T}, A = (\begin{matrix} A_{A} & 0 \\ 0 & A_{G} \end{matrix}), B = (\begin{matrix} B_{A} & 0 \\ 0 & B_{G} \end{matrix})

,

k_{1}, k_{2}, k_{3},

and

k_{4}

are derived from the matrix

K . g_{1}

and

g_{2}

are derived from the matrix

G

.

Theorem 2.

If the unmanned ground vehicle system in (2) and the unmanned aerial vehicle system in (5) use Protocols (17) and (18), respectively, the formation can be completed, and Performance Functions (13) and (14) can be optimized.

Proof of Theorem 2.

Protocols (17) and (18) shall be unified into the same type:

U = - L_{l} \cdot \tilde{X}

(20)

where

L_{l} = (\begin{array}{l} k_{1} L_{s} \otimes I_{3} & - k_{2} I_{l} \otimes I_{3} & - k_{3} I_{l} \otimes I_{3} & - k_{4} I_{l} \otimes I_{3} & 0 & 0 \\ 0 & 0 & 0 & 0 & g_{1} L_{s} \otimes I_{3} & - g_{2} I_{k} \otimes I_{3} \end{array}),

L_{S} = (\begin{array}{l} - 1 & 1 & 0 \\ 0 & - 1 & 0 \\ 1 & 0 & - 1 \end{array}), \tilde{X} = {(δ_{a i}^{T}, v_{a i}^{T}, Ω^{T}, {\dot{Ω}}^{T}, δ_{g i}^{T}, v_{g i}^{T})}^{T}, I

is the identity matrix, and

\otimes

is the Kronecker product.

Let

U = (\begin{matrix} U_{11} & U_{12} \\ U_{21} & U_{22} \end{matrix})

=

(\begin{array}{l} k_{1} L_{s} & - k_{2} I_{l} & - k_{3} I_{l} & - k_{4} I_{l} & 0 & 0 \\ 0 & 0 & 0 & 0 & g_{1} L_{s} & - g_{2} I_{k} \end{array})

Substituting

U = (\begin{matrix} U_{11} & U_{12} \\ U_{21} & U_{22} \end{matrix}) \cdot \tilde{X}

into Equation (19):

\dot{\tilde{X}} = - T_{l} \cdot \tilde{X}

where

T_{l} = [\begin{array}{l} 0 & I & 0 & 0 & 0 & 0 \\ 0 & 0 & I & 0 & 0 & 0 \\ 0 & 0 & 0 & I & 0 & 0 \\ k_{1} L_{s} & - k_{2} I & - k_{3} I & - k_{4} I & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & I \\ 0 & 0 & 0 & 0 & g_{1} L_{s} & - g_{2} I \end{array}]

Elementary row and column transformation can be taken on

T_{l}

:

T_{l} = [\begin{array}{l} I & 0 & 0 & 0 & 0 & 0 \\ 0 & I & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & I & 0 & 0 \\ 0 & 0 & 0 & 0 & I & 0 \\ 0 & 0 & k_{1} L_{s} & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & g_{1} L_{s} \end{array}] = (\begin{matrix} I_{1} & 0 & 0 \\ 0 & 0 & I_{2} \\ 0 & k_{1} L & g_{1} L \end{matrix}) = E

If

Rank (L) = N - 1, Rank ([\begin{matrix} I_{1} \\ I_{2} \end{matrix}]) = r

, it can be seen that:

Rank (T_{l}) = Rank (E) = r + N - 1

.

where

T_{l}

has only zero eigenvalues. Therefore, we must select the parameters

k_{1}

,

k_{2}, k_{3}, k_{4}, g_{1}, and g_{2}

so that

T_{l}

has zero eigenvalue, and all other eigenvalues have negative real parts. The parameters

k_{1}

,

k_{2}, k_{3} and k_{4}

need to meet the stability of UAV consistency, and the parameters

g_{1}

and

g_{2}

need to meet the stability of UGV consistency. After determining the parameters,

T_{l}

can be converted to a Jordan standard type:

T_{l} = P J P^{- 1}

. Let

v_{1}^{T}

be the first row of

P^{- 1}

and the left eigenvector have eigenvalue 0. Let

w_{1}

be the first column of

P

and the right eigenvector with eigenvalue

0 .

Therefore,

v_{1}^{T} w_{1} = 1

; when the time approaches infinity, the system’s state becomes:

\lim_{t \to \infty}^{} \tilde{X} = \lim_{t \to \infty}^{} e^{T_{l} t} \tilde{X} (0)

,

e^{T_{l} t} \tilde{X} (0) \to (w_{1} v_{1}^{T}) \tilde{X} (0) (t \to \infty)

. According to Lemma 1 it is then seen that the system can reach asymptotic consensus in cases where time tends toward infinity. □

3.4. Heterogeneous Cooperative Formation Control

For UAV:

{\tilde{u}}_{i a} = α \sum_{j \in N_{i}}^{} a_{i j} (δ_{j} - δ_{i}) - β a_{i j} (v_{j} - v_{i}) - γ_{1} Ω_{i} - γ_{2} {\dot{Ω}}_{i}

(21)

For UGV:

{\tilde{u}}_{i g} = α \sum_{j \in N_{i}}^{} a_{i j} (δ_{j} - δ_{i}) - β a_{i j} (v_{j} - v_{i})

(22)

δ_{i} = {(δ_{i a}^{T}, δ_{i g}^{T})}^{T}, v_{i} = {(v_{i a}^{T}, v_{i g}^{T})}^{T}, Ω_{i} = {(g θ, - g ϕ, 0)}^{T}, \dot{Ω} = {(g \dot{θ}, - g \dot{ϕ}, 0)}^{T}

In combination with the Laplace matrix, Protocols (21) and (22) are rewritten as:

\tilde{U} = H \cdot \tilde{X}

(23)

H = (\begin{matrix} \begin{array}{l} α L_{A A} \otimes I_{3} \\ α L_{G A} \otimes I_{3} \end{array} & \begin{matrix} \begin{array}{l} - β L_{A A} \otimes I_{3} \\ - β L_{G A} \otimes I_{3} \end{array} & \begin{matrix} \begin{array}{l} - γ_{1} I_{l} \otimes I_{3} \\ 0 \end{array} & \begin{matrix} \begin{array}{l} {- γ}_{2} I_{l} \otimes I_{3} \\ 0 \end{array} & \begin{matrix} \begin{array}{l} α L_{A G} \otimes I_{3} \\ α L_{G G} \otimes I_{3} \end{array} & \begin{array}{l} - β L_{A G} \otimes I_{3} \\ - β L_{G G} \otimes I_{3} \end{array} \end{matrix} \end{matrix} \end{matrix} \end{matrix} \end{matrix})

Define the state-space form of the heterogeneous multi-agent formation:

\dot{\tilde{X}} = A \tilde{X} + B \tilde{U}

(24)

where

A = (\begin{array}{l} A_{A} & 0 \\ 0 & A_{G} \end{array}), B = (\begin{matrix} B_{A} & 0 \\ 0 & B_{G} \end{matrix}), \tilde{X} = {({\bar{X}}^{T}, {\bar{W}}^{T})}^{T} .

Theorem 3.

if Protocols (21) and (22) meet

α > 0, β > 0, γ_{1} > 0, γ_{2} > 0, β ≫ α, γ_{1} > β

,

γ_{2} > β, β γ_{1} γ_{2} > β^{2} γ_{2} α

, the heterogeneous system in (7) can be achieved, and (8) is defined in the formation, then formation control is realized.

Proof of Theorem 3.

Substitute Equation (23) into Equation (24) and obtain:

\dot{\tilde{X}} = - T_{S} \cdot \tilde{X}

where

T_{s} = [\begin{matrix} 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 & 0 \\ α L_{A A} \otimes I_{3} & - β L_{A A} \otimes I_{3} & - γ_{1} I_{l} \otimes I_{3} & - γ_{2} I_{l} \otimes I_{3} & α L_{A G} \otimes I_{3} & - β L_{A G} \otimes I_{3} \\ 0 & 0 & 0 & 0 & 0 & 1 \\ α L_{G A} \otimes I_{3} & - β L_{G A} \otimes I_{3} & 0 & 0 & α L_{G G} \otimes I_{3} & - β L_{G G} \otimes I_{3} \end{matrix}],

I

is the identity matrix and

\otimes

is the Kronecker product.

Internal stability parameters

γ_{1}

and

γ_{2}

should guarantee the stability of a single UAV agent. Therefore, the UAV is written as:

(\begin{matrix} {\dot{P}}_{A} \\ {\dot{V}}_{A} \\ {\dot{Ω}}_{A} \\ {\ddot{Ω}}_{A} \end{matrix}) = Γ (\begin{matrix} P_{A} \\ V_{A} \\ Ω_{A} \\ {\dot{Ω}}_{A} \end{matrix})

, where

Γ = (\begin{matrix} 0 & I & 0 & 0 \\ 0 & 0 & I & 0 \\ 0 & 0 & 0 & I \\ - α I & - β I & - γ_{1} I & - γ_{2} I \end{matrix})

.

The characteristic polynomial is:

\det (S I - Γ) = | \begin{matrix} S & - I & 0 & 0 \\ 0 & S & - I & 0 \\ 0 & 0 & S & - I \\ α I & β I & γ_{1} I & S + γ_{2} I \end{matrix} |

where

I

is the identity matrix. For a single UAV:

| S I - Γ | = s^{4} + γ_{2} s^{3} + γ_{1} s^{2} + β s + α

According to the Routh–Hurwitz stability criterion:

\begin{matrix} α > 0, β > 0, γ_{1} > 0, γ_{2} > 0, γ_{1} γ_{2} > β, γ_{1} γ_{2} β > β^{2} + γ_{2}^{2} \end{matrix}

According to the linear stability theorem, the parameter

α, β, γ_{1}

, and

γ_{2}

should be selected so that there is a zero eigenvalue and other eigenvalues have negative genuine part parameters. The parameters

α

and

β

must meet the stability of UGV consistency, and the parameters

γ_{1}

and

γ_{2}

must meet the stability of UAV consistency. After selecting the parameters,

T_{s}

can be converted to a Jordan standard type:

T_{s} = P J P^{- 1}

. Let

v_{1}^{T}

be the first row of

P^{- 1}

and the left eigenvector have eigenvalue 0. Let

w_{1} be

the first column of

P

and the right eigenvector have eigenvalue

0 .

Therefore,

v_{1}^{T} w_{1} = 1

; when the time approaches infinity, the system’s state becomes:

\lim_{t \to \infty} \tilde{X} = \lim_{t \to \infty} e^{T_{s} t} \tilde{X} (0)

,

e^{T_{s} t} \tilde{X} (0) \to (w_{1} v_{1}^{T}) \tilde{X} (0) (t \to \infty)

. According to Lemma 1, when the time approaches infinity, the system in (7) asymptotically agrees; that is, the system achieves cooperative formation.□

3.5. Heterogeneous Cooperative Optimal Formation Control

UAV systems and UGV systems are heterogeneous systems. Applying the optimal control law to the heterogeneous system can be expressed as:

{\tilde{u}}_{i a} = k_{1} \sum_{j \in N_{i}}^{} a_{i j} (δ_{j} - δ_{i}) - k_{2} a_{i j} (v_{j} - v_{i}) - k_{3} Ω_{i} - k_{4} {\dot{Ω}}_{i}

(25)

{\tilde{u}}_{i g} = g_{1} \sum_{j \in N_{i}}^{} a_{i j} (δ_{j} - δ_{i}) - g_{2} a_{i j} (v_{j} - v_{i})

(26)

where

k_{1}, k_{2}, k_{3}

,and

k_{4}

are derived from the matrix

K . g_{1}

and

g_{2}

are derived from the matrix

G

.

In combination with the Laplace matrix, Protocols

(25)

and

(26)

are rewritten as:

\tilde{U} = S \cdot \tilde{X}

(27)

S = (\begin{array}{l} k_{1} L_{A A} \otimes I_{3} & - k_{2} L_{A A} \otimes I_{3} & - k_{3} I_{l} \otimes I_{3} & - k_{4} I_{l} \otimes I_{3} & k_{1} L_{A G} \otimes I_{3} & - k_{2} L_{A G} \otimes I_{3} \\ g_{1} L_{G A} \otimes I_{3} & - g_{2} L_{G A} \otimes I_{3} & 0 & 0 & g_{1} L_{G G} \otimes I_{3} & - g_{2} L_{G G} \otimes I_{3} \end{array})

Define the state space of heterogeneous multi-agent formation:

\dot{\tilde{X}} = A \tilde{X} + B \tilde{U}

(28)

where

A = (\begin{matrix} A_{A} & 0 \\ 0 & A_{G} \end{matrix}), B = (\begin{matrix} B_{A} & 0 \\ 0 & B_{G} \end{matrix}), \tilde{X} = {({\bar{X}}^{T}, {\bar{W}}^{T})}^{T}

Theorem 4.

Heterogeneous systems can complete formation and optimize Performance Functions (13) and (14) if they use Protocols (25) and (26), respectively.

Proof of Theorem 4.

Each UAV and UGV is defined as a group of formation units, so their number is made the same, that is, L = K; the block Laplacian matrix of the system has the same number of rows and columns.

Then, the heterogeneous system in (7) becomes:

\dot{X} = \hat{A} X + \hat{B} U

(29)

where

\hat{A} = (\begin{matrix} 0 & I & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & I & 0 & 0 \\ 0 & 0 & 0 & 0 & I & 0 \\ 0 & 0 & 0 & 0 & 0 & I \\ 0 & 0 & 0 & 0 & 0 & 0 \end{matrix})

,

\hat{B} = (\begin{matrix} 0 - - - - 0 \\ I - - - - 0 \\ 0 - - - - 0 \\ 0 - - - - 0 \\ 0 - - - - 0 \\ 0 - - - - I \end{matrix})

.

According to State Equation (29):

(\begin{matrix} S I - \hat{A} - - - - B \end{matrix}) = (\begin{matrix} S & - I & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & S & 0 & 0 & 0 & 0 & I & 0 \\ 0 & 0 & S & - I & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & S & - I & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & S & - I & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & S & 0 & I \end{matrix})

It is easy to discover the eigenvalues of the matrix

\hat{A}

:

λ_{1} = λ_{2} = \dots = λ_{6} = 0

.

Verify the rank of the matrix

(\begin{matrix} S I - \hat{A} - - - - B \end{matrix})

with the above eigenvalues. Let

S = λ_{1} = λ_{2} = \dots = λ_{6} = 0

:

Rank

(\begin{matrix} S I - \hat{A} - - - - B \end{matrix})

= Rank

(\begin{matrix} 0 & - I & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & I & 0 \\ 0 & 0 & 0 & - I & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & - I & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & - I & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & I \end{matrix})

= 6, according to the rank criterion of PBH, the system state space is controllable.

Let

\tilde{U} = (\begin{matrix} U_{11} & U_{12} \\ U_{21} & U_{22} \end{matrix})

=

(\begin{array}{l} k_{1} L_{A A} & - k_{2} L_{A A} & - k_{3} I & - k_{4} I & k_{1} L_{A G} & - k_{2} L_{A G} \\ g_{1} L_{G A} & - g_{2} L_{G A} & 0 & 0 & g_{1} L_{G G} & - g_{2} L_{G G} \end{array})

Substituting

\tilde{U} = (\begin{matrix} U_{11} & U_{12} \\ U_{21} & U_{22} \end{matrix}) \cdot \tilde{X}

into Equation (28):

\dot{\tilde{X}} = - T * \cdot \tilde{X}

where

T * = [\begin{array}{l} 0 & I & 0 & 0 & 0 & 0 \\ 0 & 0 & I & 0 & 0 & 0 \\ 0 & 0 & 0 & I & 0 & 0 \\ k_{1} L_{A A} & - k_{2} L_{A A} & - k_{3} I & - k_{4} I & g_{1} L_{A G} & - g_{2} L_{A G} \\ 0 & 0 & 0 & 0 & 0 & I \\ k_{1} L_{G A} & - k_{2} L_{G A} & 0 & 0 & g_{1} L_{G G} & - g_{2} L_{G G} \end{array}]

The

T *

basic determinant change:

(\begin{matrix} λ + I & - I & 0 & 0 & 0 & 0 \\ I & λ & - I & 0 & 0 & 0 \\ 2 I & 0 & λ - I & - I & 0 & 0 \\ C_{1} & C_{2} & k_{3} I & λ + k_{4} I & - g_{1} L_{A G} & g_{2} L_{A G} \\ I & 0 & 0 & 0 & λ & - I \\ C_{3} & k_{2} L_{G A} - I & 0 & 0 & - g_{1} L_{G G} & g_{2} L_{G G} \end{matrix}) = λ I - Λ,

where

Λ = (\begin{matrix} - I & I & 0 & 0 & 0 & 0 \\ - I & 0 & I & 0 & 0 & 0 \\ - 2 I & 0 & I & I & 0 & 0 \\ - C_{1} & - C_{2} & - k_{3} I & - k_{4} I & g_{1} L_{A G} & - g_{2} L_{A G} \\ - I & 0 & 0 & 0 & 0 & I \\ - C_{3} & I - k_{2} L_{G A} & 0 & 0 & g_{1} L_{G G} & - g_{2} L_{G G} \end{matrix})

C_{1} = I - k_{1} L_{A A} - g_{2} L_{A G} - k_{2} L_{A A} - k_{4} I .

C_{2} = k_{2} L_{A A} - k_{3} I - I

C_{3} = I - k_{1} L_{G A} - g_{2} L_{G G}

Now

λ I - T * ≅ λ I - Λ

, so matrix

T *

is similar to Λ. Therefore, there is a nonsingular transformation matrix Q, making

Λ = Q T * Q^{- 1}

,

\dot{\tilde{X}} = Λ \cdot \tilde{X}; Λ

is the matrix for which the sum of each row is zero. Therefore, there is at least one zero eigenvalue. Elementary row and column transformation can be taken on

T *

:

T * = (\begin{matrix} I & 0 & 0 & 0 & 0 & 0 \\ 0 & I & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & I & 0 \\ 0 & 0 & 0 & 0 & 0 & I \\ 0 & 0 & k_{1} L_{A A} & - k_{2} L_{A A} & g_{1} L_{A G} & - g_{2} L_{A G} \\ 0 & 0 & k_{1} L_{G A} & - k_{2} L_{G A} & g_{1} L_{G G} & - g_{2} L_{G G} \end{matrix}) = (\begin{matrix} I_{1} & 0 & 0 \\ 0 & 0 & I_{2} \\ 0 & k L & g L \end{matrix}) = E

If

Rank (L) = N - 1

,

Rank ([\begin{matrix} I_{1} \\ I_{2} \end{matrix}]) = r

, it can be seen that:

Rank (T^{*}) = Rank (Λ) = Rank (E) = r + N - 1

where the

Λ

and

T *

have only zero eigenvalues. Therefore, we must select the parameters

k_{1}

,

k_{2}, k_{3}, k_{4}, g_{1}, and g_{2}

so that

T *

has zero eigenvalue and all other eigenvalues have negative real parts. The parameters

k_{1}

,

k_{2}, k_{3}, and k_{4}

must meet the stability of UAV consistency, and the parameters

g_{1}

and

g_{2}

must meet the stability of UGV consistency. After determining the parameters,

T *

can be converted to a Jordan standard type:

T * = P J P^{- 1}

. Let

v_{1}^{T}

be the first row of

P^{- 1}

and the left eigenvector have eigenvalue 0. Let

w_{1}

the first column of

P

and the right eigenvector have eigenvalue

0 .

Therefore,

v_{1}^{T} w_{1} = 1

; when the time approaches infinity, the system’s state becomes:

\lim_{t \to \infty} \tilde{X} = \lim_{t \to \infty} e^{T^{*} t} \tilde{X} (0)

,

e^{T^{*} t} \tilde{X} (0) \to (w_{1} v_{1}^{T}) \tilde{X} (0) (t \to \infty)

. According to Lemma 1, it is then seen that the system can reach asymptotic consensus in cases where time tends toward infinity.□

4. Simulations

The formation protocol, distributed optimal formation protocol, cooperative formation protocol, and cooperative optimal formation protocol individually designed in this paper are simulated and analyzed using Matlab2016a. The effectiveness of the designed control protocol is verified via simulation. To achieve a better cooperative formation task effect in the system, the speed values set by the UAV system and the UGV system are similar.

The initial state is as follows: the position of UAV1 is

(30, 50, 50) m

, the position of UAV2 is

(90, 30, 50) m

, and the position of UAV3 is

(60, 30, 15) m

. The speed of UAV1 is

(1, 1, - 2) m / s

, the speed of UAV2 is

(1, - 2, 1) m / s

, and the speed of UAV3 is

(- 2, 1, 1) m / s

. The attitude angle of UAV1 is

{(0, 0, 0)}^{\circ}

, the attitude angle of UAV2 is

{(0, 0, 0)}^{\circ}

, and the attitude angle of UAV3 is

{(0, 0, 0)}^{\circ}

The rate of the attitude angle of UAV1 is

{(0, 0, 0)}^{\circ} / s

, the rate of the attitude angle of UAV2 is

{(0, 0, 0)}^{\circ} / s

, and the rate of the attitude angle of UAV3 is

{(0, 0, 0)}^{\circ} / s

. The setting position of UAV1 is

(10, 10, 30) m

, the setting position of UAV2 is

(0, 5, 30) m

, and the setting position of UAV3 is

(0, 15, 30) m

. The UAV and UGV systems have a set speed of

(1, 0, 0) m / s

. The position of UGV1 is

(90, 50, 0) m

, the position of UGV2 is

(65, 10, 0) m

, and the position of UGV3 is

(30, 20, 0) m

. The speed of UGV1 is

(1, 1, 0) m / s

, the speed of UGV2 is

(2, - 2, 0) m / s

, and the speed of UGV3 is

(- 1, 1, 0) m / s

. The setting position of UGV1 is

(10, 10, 0) m

, the setting position of UGV2 is

(0, 5, 0) m

, and the setting position of UGV3 is

(0, 15, 0) m

.

The parameters are

α = 0.2, β = 1.5, γ_{1} = 5, γ_{2} = 2, k_{1} = 2.3452, k_{2} = 6.4707, k_{3} = 7.7541, k_{4} = 4.5835, g_{1} = 0.4472, and g_{2} = 1.0461

.

Simulations of the UAV system and the UGV system using formation control and distributed optimal formation control are shown in Figure 1 and Figure 2. By comparing the two figures in Figure 1 and Figure 2, it can be found that both protocols can be used to complete triangular formation at the same time, but when distributed optimal formation control is used, it can be observed that the error between the actual position and the set value is significantly reduced in a short period of time.

When using the formation control protocol, with parameters

α = 0.2, β = 1.5, γ_{1} = 5, and γ_{2} = 2

. Figure 1 shows the change of position coordinates of the multi-agent system and shows the actual formation position of the UAV and UGV when

t = 10 s

. It can be seen that there is a significant error with the set value.

When using the distributed optimal formation control protocol, with parameters

k_{1} = 2.3452, k_{2} = 6.4707, k_{3} = 7.7541, k_{4} = 4.5835, g_{1} = 0.4472, and g_{2} = 1.0461

. Figure 2 shows the position coordinate changes of the system and shows the actual formation status of the UAV and UGV when

t = 10

s. It can be observed that the system can quickly complete the triangular formation and that the error between the actual position and the set value is small.

It can be seen from Figure 3 and Figure 4 that the state of each system variable changes with time when the formation control protocol is used. It can be seen from the figures that the state of each variable in the system is stable from 40 s to 50 s.

From Figure 5 and Figure 6, it can be seen that the system uses the distributed optimal formation control protocol to change the state of each variable with time. It is observed that the system reaches stability between 30 and 40 s.

The simulation of cooperative formation protocol and cooperative optimal formation protocol of heterogeneous multi-agents is shown in Figure 7 and Figure 8.

When using the cooperative formation control protocol, with parameters

α = 0.2, β = 1.5, γ_{1} = 5, and γ_{2} = 2

. Figure 7 shows the change of position coordinates of the system and the expected formation state of the system at the 10th second. It can be seen that the actual formation position of the system has a significant error with the set value.

When using the cooperative optimal formation control protocol, with parameter

k_{1} = 2.3452, k_{2} = 6.4707, k_{3} = 7.7541, k_{4} = 4.5835, g_{1} = 0.4472, and g_{2} = 1.0461

.

Figure 8 shows the change in position coordinates of the system and the expected formation state of the system at t=10 s. It can be observed that the system can quickly complete the triangular formation and that the error between the actual position and the set value is small. In addition, the cooperative optimal formation protocol speeds up the convergence rate of the system, which is of great help to the formation time of the system.

When using the formation control and the distributed optimal formation control, the formation states of the UAV system at different times are shown in Figure 9 and Figure 10.

Figure 9 shows the formation status of the UAV system at different times under the use of the formation control protocol. It can be observed that the system can complete the triangle formation but also that there is a relative reach error with the set position in the formation completion process and that there is still an error at the fiftieth second.

Figure 10 shows the formation status of the UAV system at different times under the use of the distributed optimal formation control protocol. It can be observed that the system can quickly complete the triangle formation and that the error between the actual position and the set value is very small at the thirtieth second.

When using the formation control and the distributed optimal formation control, the formation states of the UGV system at different times are shown in Figure 11 and Figure 12.

Figure 11 shows the formation status of the UGV system at different moments when the formation control protocol is used. It can be observed that the UGV system can complete the triangle formation, but there is still a certain error with the set value in the formation completion process. At the thirtieth second, the difference between the actual position and the set value gradually decreases.

Figure 12 shows the change in the formation shape of the UGV system at different moments when the distributed optimal formation control protocol is used; it can be observed that the UGV system can quickly complete the triangle formation and that the error between the actual position and the set value is small.

When using the heterogeneous cooperative formation control and the heterogeneous cooperative optimal formation control, the formation states of the UAV system at different times are shown in Figure 13 and Figure 14.

Figure 13 shows the formation status of the UAV system at different times under the use of the heterogeneous cooperative formation control protocol. It is observed that the system can complete the triangle formation. At the thirtieth second, the difference between the actual position and the set value gradually decreases.

Figure 14 shows the formation status of the UAV system at different times under the use of the heterogeneous cooperative optimal formation control. It can be observed that the system can quickly complete the triangle formation and that the error between the actual position and the set value is very small at the tenth second.

When using the heterogeneous cooperative formation control and the heterogeneous cooperative optimal formation control, the formation states of the UGV system at different times are shown in Figure 15 and Figure 16.

Figure 15 shows the formation status of the UGV system at different times under the use of the heterogeneous cooperative formation control protocol. It can be observed that the system can complete the triangle formation. At the thirtieth second, the difference between the actual position and the set value gradually decreases.

Figure 16 shows the formation status of the UGV system at different times under the use of the heterogeneous cooperative optimal formation control. It can be observed that the system can quickly complete the triangle formation and that the error between the actual position and the set value is very small at the tenth second.

In order to fully verify the theoretical results, the structure is complicated, and the communication Laplacian matrix is as follows:

L = (\begin{matrix} - 3 & 1 & 1 & 0 & 0 & 1 & 0 & 0 & 0 \\ 1 & - 3 & 1 & 0 & 1 & 0 & 0 & 0 & 0 \\ 1 & 1 & - 3 & 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & - 3 & 1 & 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 1 & - 3 & 1 & 0 & 0 & 0 \\ 1 & 0 & 0 & 1 & 1 & - 3 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 & - 3 & 1 & 1 \\ 0 & 0 & 0 & 0 & 1 & 0 & 1 & - 3 & 1 \\ 0 & 0 & 0 & 1 & 0 & 0 & 1 & 1 & - 3 \end{matrix})

The added set of UAV states is as follows: the position of UAV4 is

(40, 50, 10) m

, the position of UAV5 is

(60, 30, 20) m

, and the position of UAV

6 is (30, 20, 10) m

. The speed of UAV4

is (1, 1, 1) m / s

, the speed of UAV5

is (2, 2, 1) m / s

, and the speed of UAV6 is

(3, 1, 1) m / s

.The attitude angle of UAV4 is

{(0, 0, 0)}^{\circ}

, the attitude angle of UAV5

is {(0, 0, 0)}^{\circ}

, and the attitude angle of UAV6 is

{(0, 0, 0)}^{\circ}

. The setting position of UAV4 is

(10, 10, 10) m

, the setting position of UAV5 is

(0, 5, 10) m

, and the setting position of UAV6 is

(0, 15, 10) m

. The UAV systems have a set speed of

(1, 0, 0) m / s

. The simulation results are shown in Figure 17, Figure 18, Figure 19 and Figure 20.

The experiments have verified the formation control and distributed optimization formation control, as shown in Figure 17 and Figure 18, respectively. By comparing Figure 17 and Figure 18, it can be found that both protocols can be used to complete triangular formation at the same time, but when distributed optimal formation control is used, it can be observed that the error between the actual position and the set value is significantly reduced in a short period of time.

When using the formation control protocol, with parameters

α = 0.2, β = 1.5, γ_{1} = 5, γ_{2} = 2

, Figure 17 shows the change of position coordinates of the complex system and the expected formation state of the complex system. However, it can be seen that there is a significant error with the set value.

When using the distributed optimal formation control protocol, with parameters

k_{1} = 2.3452, k_{2} = 6.4707, k_{3} = 7.7541, k_{4} = 4.5835, g_{1} = 0.4472, and g_{2} = 1.0461

. Figure 18 shows the change of position coordinates of the complex system and the expected formation state of the complex system. It can be observed that the complex system can quickly complete the triangular formation and that the error between the actual position and the set value is small.

The experiments have verified the cooperative formation control and the cooperative optimal formation control, as shown in Figure 19 and Figure 20, respectively.

When using the cooperative formation control protocol, with parameters

α = 0.2, β = 1.5, γ_{1} = 5, and γ_{2} = 2

, Figure 19 shows the change in position coordinates of the complex system and the expected formation state of the complex system. However, it can be seen that there is a significant error with the set value.

When using the cooperative optimal formation control protocol, with parameters

k_{1} = 2.3452, k_{2} = 6.4707, k_{3} = 7.7541, k_{4} = 4.5835, g_{1} = 0.4472, and g_{2} = 1.0461

, Figure 20 shows the change in position coordinates of the complex system and the expected formation state of the complex system. It can be observed that the complex system can quickly complete the triangular formation and that the error between the actual position and the set value is small. In addition, the cooperative optimal formation protocol speeds up the convergence rate of the complex system, which is of great help to the formation time of the complex system.

5. Conclusions

In this paper, a heterogeneous multi-agent system has been established by analyzing the dynamics model of the unmanned ground vehicle and the unmanned aerial vehicle. Firstly, the formation control protocol is proposed based on the communication topology of a multi-agent system. Then, according to the internal state of a single agent, the optimal control law of a single agent system is designed using the optimal control theory, and the optimal control law is introduced into the system to achieve the distributed optimal formation. Finally, based on the cooperative architecture of the heterogeneous multi-agent system, the cooperative formation design of the heterogeneous multi-agent system is carried out, and the optimal control theory is introduced into the heterogeneous multi-agent system to realize the optimal cooperative formation of the heterogeneous system. The stability of the system is further analyzed by graph theory. The communication topology of the multi-agent system does not interfere with the protocol and the protocol can optimize the performance function while the system completes the formation task. The simulation results show that the optimal control can accelerate the convergence speed of the system and greatly help the system to quickly reach the desired formation state. In the next step, we plan to investigate the anomaly detection and recognition problems under heterogeneous multi-agent cooperation architecture, and we plan to apply the theoretical research results in practice to engineering applications.

Author Contributions

Conceptualization, Y.L. and M.L.; methodology, Y.L.; software, M.L. and J.L.; validation, Y.L., M.L. and J.L.; writing original draft preparation, Y.L.; resources, Y.G.; writing review and editing, M.L.; supervision, Y.G.; funding acquisition, Y.G. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (61872204), the Scientific Research Project of Heilongjiang Provincial Universities, China (Grant No.145109143).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data that support the findings of this study are available from the corresponding author, Meichen Liu, upon reasonable request.

Acknowledgments

We appreciate all the authors for their contributions and the support of the foundation.

Conflicts of Interest

The authors declare no conflict of interest.

References

Cheung, C.; Grocholsky, B. UAV-UGV Collaboration with a PackBot UGV and Raven SUAV for Pursuit and Tracking of a Dynamic Target. In Unmanned Systems Technology X; SPIE: Bellingham, WA, USA, 2008; Volume 6962, pp. 381–390. [Google Scholar]
Moseley, M.B.; Grocholsky, B.P.; Cheung, C.; Singh, S. Integrated long-range UAV/UGV collaborative target tracking. In Unmanned Systems Technology XI; SPIE: Bellingham, WA, USA, 2009; Volume 7332, pp. 30–40. [Google Scholar]
Schulteis, T.M.; Price, J.G. Project stork UAV/UGV collaborative initiative. In Unmanned Ground Vehicle Technology VI; SPIE: Bellingham, WA, USA, 2004; Volume 5422, pp. 414–425. [Google Scholar]
Cao, Y.; Yu, W.; Ren, W.; Chen, G. An overview of recent progress in the study of distributed multi-agent coordination. IEEE Trans. Ind. Inform. 2013, 9, 427–438. [Google Scholar] [CrossRef] [Green Version]
Chen, F.; Dimarogonas, D.V. Leader–follower formation control with prescribed performance guarantees. IEEE Trans. Control. Netw. Syst. 2021, 8, 450–461. [Google Scholar] [CrossRef]
Han, N.; Luo, X. Tracking and distributed formation control for leader-following heterogeneous multi-agent systems. In Proceedings of the 2016 35th Chinese Control Conference (CCC), Chengdu, China, 27–29 July 2016; pp. 7897–7901. [Google Scholar]
Li, S.; Zhang, J.; Li, X.; Wang, F.; Luo, X.; Guan, X. Formation control of heterogeneous discrete-time nonlinear multi-agent systems with uncertainties. IEEE Trans. Ind. Electron. 2017, 64, 4730–4740. [Google Scholar] [CrossRef]
Liang, S.; Wang, F.; Chen, Z.; Liu, Z. Formation Control for Discrete-Time Heterogeneous Multi-Agent Systems; John Wiley & Sons: Hoboken, NJ, USA, 2022. [Google Scholar]
Brandão, A.S.; Sarcinelli-Filho, M.; Carelli, R. Leader-following control of a UAV-UGV formation. In Proceedings of the 2013 16th International Conference on Advanced Robotics (ICAR), Montevideo, Uruguay, 25–29 November 2013; pp. 1–6. [Google Scholar]
Luo, Q.; Duan, H. An improved artificial physics approach to multiple UAVs/UGVs heterogeneous coordination. Sci. China Technol. Sci. 2013, 56, 2473–2479. [Google Scholar] [CrossRef]
Rahimi, R.; Abdollahi, F.; Naqshi, K. Time-varying formation control of a collaborative heterogeneous multi agent system. Robot. Auton. Syst. 2014, 62, 1799–1805. [Google Scholar] [CrossRef]
Abbaspour, A.; Alipour, K.; Jafari, H.Z.; Moosavian, S.A.A. Optimal formation and control of cooperative wheeled mobile robots. Comptes Rendus Mécanique 2015, 343, 307–321. [Google Scholar] [CrossRef]
Ai, X.L.; Yu, J.Q.; Chen, Y.B.; Chen, F.Z.; Shen, Y.C. Optimal formation control with limited communication for multi-unmanned aerial vehicle in an obstacle-laden environment. Part G J. Aerosp. Eng. 2017, 231, 979–997. [Google Scholar] [CrossRef]
Lin, W.; Zhao, W.; Liu, H. Robust optimal formation control of heterogeneous multi-agent system via reinforcement learning. IEEE Access 2020, 8, 218424–218432. [Google Scholar] [CrossRef]
Huang, R.; Zhou, J.; Huang, H. Flexible formation control through time-varying weighting multi-agent flocking. In Proceedings of the 2019 Chinese Control And Decision Conference (CCDC), Nanchang, China, 3–5 June 2019; pp. 3424–3429. [Google Scholar]
Yan, Z.-P.; Liu, Y.-B.; Yu, C.-B.; Zhou, J.-J. Leader-following coordination of multiple UUVs formation under two independent topologies and time-varying delays. J. Cent. South Univ. 2017, 24, 382–393. [Google Scholar] [CrossRef]
Zhao, J.; Dai, F.; Song, Y. A Distributed Optimal Formation Control for Multi-Agent System of UAVs. In Proceedings of International Conference on Artificial Life and Robotics; ALife Robotics Corporation Ltd.: Oita, Japan, 2022. [Google Scholar]
Zhou, J.; Zeng, D.; Lu, X. Multi-agent trajectory-tracking flexible formation via generalized flocking and leader-average sliding mode control. IEEE Access 2020, 8, 36089–36099. [Google Scholar] [CrossRef]
Cui, Y.; Fei, M.; Du, D. Event-triggered cooperative compensation control for consensus of heterogeneous multi-agent systems. IET Control. Theory Appl. 2016, 10, 1573–1582. [Google Scholar] [CrossRef]
Li, S.; Feng, G.; Wang, J.; Luo, X.; Guan, X. Adaptive control for cooperative linear output regulation of heterogeneous multi-agent systems with periodic switching topology. IET Control. Theory Appl. 2015, 9, 34–41. [Google Scholar] [CrossRef]
Yuan, C.; He, H. Cooperative output regulation of heterogeneous multi-agent systems with a leader of bounded inputs. IET Control. Theory Appl. 2018, 12, 233–242. [Google Scholar] [CrossRef]
Ren, W.; Beard, R.W. Distributed Consensus in Multi-Vehicle Cooperative Control; Springer: London, UK, 2008. [Google Scholar]

Figure 1. Formation control protocol for heterogeneous multi-agent systems.

Figure 2. Distributed optimal formation control protocol for heterogeneous multi-agent systems.

Figure 3. Position and velocity states change with time using the formation control protocol.

Figure 4. Attitude angle changes with time using a formation control protocol.

Figure 5. Changes in position and velocity states with time using the distributed optimal formation control protocol.

Figure 6. Attitude angle changes with time using the distributed optimal formation control protocol.

Figure 7. Distributed cooperative formation control for heterogeneous multi-agent systems.

Figure 8. Distributed cooperative optimal formation control for heterogeneous multi-agent systems.

Figure 9. UAV system state formation based on formation control.

Figure 10. UAV system state formation based on distributed optimal formation control.

Figure 11. UGV system state formation based on formation control.

Figure 12. UGV system state formation based on distributed optimal formation control.

Figure 13. UAV system state formation based on heterogeneous cooperative formation control.

Figure 14. UAV system state formation based on heterogeneous cooperative optimal formation control.

Figure 15. UGV system state formation based on heterogeneous cooperative formation control.

Figure 16. UGV system state formation based on heterogeneous cooperative optimal formation control.

Figure 17. Formation control protocol for complex heterogeneous multi-agent systems.

Figure 18. Distributed optimal formation control protocol for complex heterogeneous multi-agent systems.

Figure 19. Distributed cooperative formation control for complex heterogeneous multi-agent systems.

Figure 20. Distributed cooperative optimal formation control for complex heterogeneous multi-agent systems.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, Y.; Liu, M.; Lian, J.; Guo, Y. Collaborative Optimal Formation Control for Heterogeneous Multi-Agent Systems. Entropy 2022, 24, 1440. https://doi.org/10.3390/e24101440

AMA Style

Li Y, Liu M, Lian J, Guo Y. Collaborative Optimal Formation Control for Heterogeneous Multi-Agent Systems. Entropy. 2022; 24(10):1440. https://doi.org/10.3390/e24101440

Chicago/Turabian Style

Li, Yandong, Meichen Liu, Jiya Lian, and Yuan Guo. 2022. "Collaborative Optimal Formation Control for Heterogeneous Multi-Agent Systems" Entropy 24, no. 10: 1440. https://doi.org/10.3390/e24101440

APA Style

Li, Y., Liu, M., Lian, J., & Guo, Y. (2022). Collaborative Optimal Formation Control for Heterogeneous Multi-Agent Systems. Entropy, 24(10), 1440. https://doi.org/10.3390/e24101440

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Collaborative Optimal Formation Control for Heterogeneous Multi-Agent Systems

Abstract

1. Introduction

2. Preliminaries

2.1. Graph Theory

2.2. UGV Dynamics Model

2.3. UAV Dynamics Model

2.4. Heterogeneous Multi-Agent System

3. Design of Control Protocol

3.1. Formation Control

3.2. Optimal Control

3.3. Distributed Optimal Formation Control

3.4. Heterogeneous Cooperative Formation Control

3.5. Heterogeneous Cooperative Optimal Formation Control

4. Simulations

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI