Formation Control for Mixed-Order UAVs–USVs–UUVs Systems under Cooperative and Optimal Control

Liu, Meichen; Li, Yandong; Zhu, Ling; Guo, Yuan; Liu, Bohao

doi:10.3390/jmse11040704

Open AccessArticle

Formation Control for Mixed-Order UAVs–USVs–UUVs Systems under Cooperative and Optimal Control

by

Meichen Liu

^1,2,

Yandong Li

^1,2,*,

Ling Zhu

³,

Yuan Guo

⁴ and

Bohao Liu

¹

College of Computer and Control Engineering, Qiqihar University, Qiqihar 161000, China

²

Heilongjiang Key Laboratory of Big Data Network Security Detection and Analysis, Qiqihar University, Qiqihar 161000, China

³

School of Mechanical and Electronic Engineering, Qiqihar University, Qiqihar 161000, China

⁴

School of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150080, China

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2023, 11(4), 704; https://doi.org/10.3390/jmse11040704

Submission received: 22 February 2023 / Revised: 17 March 2023 / Accepted: 22 March 2023 / Published: 24 March 2023

(This article belongs to the Section Ocean Engineering)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, cooperative control and optimal control methods are used for the formation control of mixed-order heterogeneous multi-agent systems. The system consists of unmanned aerial vehicles (UAVs), unmanned surface vehicles (USVs), and unmanned underwater vehicles (UUVs). The system is represented in a state space using a block Kronecker product. The static and dynamic formation control protocols are proposed respectively, and the graph theory is used to prove that formation control protocols can realize system formation. Furthermore, the optimal control and cooperative control are introduced into the static and the dynamic formation control protocols, and the static cooperative optimal formation control protocol and the dynamic cooperative optimal formation control protocol are designed. Through MATLAB simulation, the static cooperative optimal control protocol and static formation control protocol are compared, and the dynamic cooperative optimal control protocol and dynamic formation control protocol are compared. By comparison, the state variables of the system can reach convergence quickly, and the system can complete formation in a short time, which verifies the effectiveness of the optimal theory and cooperative control.

Keywords:

mixed-order multi-agent system; optimal control; cooperative control; cooperative optimal formation

1. Introduction

Multi-agent systems (MASs) are widely used in rescue [1,2,3], reconnaissance [4,5,6], exploration [7,8,9], and other fields. They can cooperate to complete complex tasks that a single agent cannot complete.

In recent years, formation control of multi-agent systems has become a scorching research field, and formation control has also been widely applied in many engineering fields, such as robot field [10,11,12], attitude control of multi-satellite systems [13,14], and flight control of UAVs [15,16]. By using an appropriate formation control method, the formation control challenge is to accomplish some difficult global activities. Many formation control methods and techniques have been studied. For example, paper [17] uses A* algorithm combined with an optimization algorithm for formation to realize a collision-free path. In the literature [18], multiple mobile robots adopt the leader-following method for formation control. This paper [19] uses the method of minimizing the Kullback–Leibler divergence to achieve the desired formation of multiple UAVs. To reduce the communication burden of multiple UAVs, this paper [20] designs a distributed frame structure, the control protocol includes event triggering, and the system completes the distributed formation. However, the above results for formation control are applied to isomorphic multi-agent systems. In practical applications, there are many heterogeneous multi-agent systems (HMASs) with different structures, dynamic models, and even information perception and decision-making abilities. Article [21] studies the formation problem of a class of general first/second-order discrete-time heterogeneous multi-agent systems. In article [22], agents from the same group build a time-varying formation in their own dimension, while agents from different groups move cooperatively in different dimensions to accomplish the cross-dimensional formation of heterogeneous multi-agent systems.

When applied in practice, how can the system quickly complete the task? This will consider optimization problems such as the following: In paper [23], distributed optimal control is proposed to optimize the trajectory of many unicycle robots. The optimal collaborative control of linear multi-agent systems is investigated in this paper [24], and the proposed controller can minimize the quadratic global cost function to a specific optimal value, with the optimal solution being independent of uncertainty. In the paper [25], multi-agent systems’ leader–follower formation control problem adopts a stress matrix, which has better formation flexibility. The inverse optimal control theory is used to demonstrate the effectiveness of distributed control protocols in paper [26].

Due to the collaboration of multi-agent systems that should be are autonomous, fault-tolerant, coordinated, adaptable, and scalable, they may also involve a variety of other fields of research, including resource exploration [27,28], target positioning [29,30], environmental monitoring [31,32], and military exercise [33,34]. So far, the collaborative control problem of MASs has attracted wide attention from researchers in physics, autonomous vehicles, industrial engineering, biology, machine intelligence, and other fields. However, there are challenges in collaborative control between multiple agents with different dynamics.

Based on the above results, the literature [21,22] only studies the formation problem of heterogeneous systems, without considering the optimization of system formation. Literature [23,24,25,26] considers the flexibility and optimization of formation based on the same dynamic model. In this paper, we discuss the formation problem of mixed-order multi-agent systems by combining the optimal control theory and cooperative control and solve the formation optimization problem of systems with different dynamics models. At the same time, the system can achieve coordination. The main contributions are as follows:

Firstly, the dynamic model of each agent system is introduced, and the different dynamic model systems are written into a state space by using a block Kronecker product. Secondly, static formation control protocols and dynamic formation control protocols are designed, respectively, and graph theory is used to prove that the control protocols can complete formation. Furthermore, the optimal control law of each agent system is designed by using the optimal control theory, and the problem of dimensionality inconsistency is solved by using cooperative control. The optimal control theory and cooperative control are introduced into the static and dynamic formation control protocols, and the static and dynamic cooperative optimal formation control protocols are designed. Finally, the collaborative optimal formation control protocol and the formation control protocol are compared by simulation, and the system state variables can rapidly converge and realize the cooperative formation, which verifies the effectiveness of the optimal control and the cooperative control.

The rest of this paper is organized as the following. The preparatory knowledge and system model are introduced in Section 2. Section 3 introduces the design of the control protocol. Then in Section 4, simulation experiments verify the effectiveness of the proposed control protocol. Section 5 summarizes the main contents of this article.

2. Preliminaries

2.1. Graph Theory

A graph

G = (V, E, A)

is used to represent the topology of the information exchange between agents, where

V = (v_{1}, v_{2}, \dots v_{n})

is the set of agent nodes, and each node represents an agent.

E \subseteq \{(i, j) : i, j \in V\}

is the set of edges about

e_{i j}

, indicating that there is information exchange between agent

i

and

j

, and the information is from

i

to

j

. A weighted adjacency matrix

A = {[a_{i j}]}_{n \times n}

with nonnegative adjacency elements

a_{i j}

, where

a_{i j}

is the weight of the edge

e_{i j} = (v_{i}, v_{j})

. For

i, j = 1,2, 3, \dots, n (i \neq j)

, if the agents

v_{i}

and

v_{j}

can receive information from each other, then the elements in the adjacency matrix are

a_{i j} > 0

; otherwise, the element in the adjacency matrix is

0 .

To simplify the calculation, let

a_{i j} \in \{0,1\}

. Note that

a_{i j} = a_{j i}

in the undirected graph, and

a_{i j} \neq a_{j i}

in the directed graph.

The set of neighbor node

N_{i} = \{j | j \in V : e_{i j} \in E\}

, which represents the set composed of all agents that have information exchange with agent 𝑖. In the graph, the degree represents the number of neighbors of a node, that is, the number of edges per node. The node degree is defined as

d_{i} = \sum_{j = 1}^{n} a_{i j}

, and the degree matrix

D

of the graph is diagonal, as follows:

D = d i a g (d_{1}, d_{2}, \dots d_{n})

. The Laplace matrix L of a multi-agent system is defined as:

L = D - A

.

2.2. Formation Definition

The problem of formation control is to find a control protocol that allows multiple intelligences to form formations or configurations. The desired vector of the system formation is:

X_{d} = {[d (P_{A}) d (V_{A}) d (Ω_{A}) d ({\dot{Ω}}_{A}) d (P_{S}) d (V_{S}) d (P_{U}) d (V_{U})]}^{T} .

The formation error vector can be expressed as:

\tilde{X} = X - X_{d} = {[{\tilde{P}}_{A} {\tilde{V}}_{A} {\tilde{Ω}}_{A} {\tilde{\dot{Ω}}}_{A} \tilde{P_{S}} \tilde{V_{S}} {\tilde{P}}_{U} {\tilde{V}}_{U}]}^{T}

By introducing the error vector, the formation problem is transformed into the error consistency problem about the state variables. When the system error vector reaches consistency, it means that the formation is realized.

Definition 1.

When all the states in the system meet the definition of Formula (1), it indicates that the system realizes static formation control.

\{\begin{matrix} \begin{matrix} \lim_{t \to \infty} ‖P_{j} - P_{i}‖ = P_{d j} - P_{d i} i, j = 1,2, 3, \dots, l \\ \lim_{t \to \infty} ‖v_{i}‖ = 0 i = 1,2, 3, \dots, l \end{matrix} \\ \lim_{t \to \infty} ‖Ω_{i}‖ = 0 i = 1,2, 3, \dots, m \\ \lim_{t \to \infty} ‖{\dot{Ω}}_{i}‖ = 0 i = 1,2, 3, \dots, m \end{matrix}

(1)

Definition 2.

When all the states in the system meet the definition of Formula (2), it indicates that the system realizes dynamic formation control.

\{\begin{matrix} \lim_{t \to \infty} ‖P_{j} - P_{i}‖ = P_{d j} - P_{d i} i, j = 1,2, 3, \dots, l \\ \lim_{t \to \infty} ‖v_{j} - v_{i}‖ = 0 i, j = 1,2, 3, \dots, l \\ \begin{matrix} \lim_{t \to \infty} ‖Ω_{i}‖ = 0 i = 1,2, 3, \dots, m \\ \lim_{t \to \infty} ‖{\dot{Ω}}_{i}‖ = 0 i = 1,2, 3, \dots, m \end{matrix} \end{matrix}

(2)

2.3. High-Order UAV Dynamics Model

For the dynamics model of the UAV, refer to the literature [35]. Referring to Figure 1, unmanned aerial vehicle attitude angle includes: roll angle

ϕ

, pitching angle

θ

and yaw angle

φ

. The roll angle refers to the rotation angle along the

X_{b}

axis, the pitch angle refers to the rotation angle along the

Y_{b}

axis, the yaw angle refers to the rotation angle along the

Z_{b}

axis.

M_{1}, M_{2}, M_{3}

and

M_{4}

are the torques of the four propellers due to rotation,

F_{1}, F_{2}, F_{3}

and

F_{4}

are the lift forces generated by the four propellers.

I_{x}, I_{y}

, and

I_{Z}

denote the rotational inertia along the

X_{b}

,

Y_{b}

and

Z_{b}

axes, respectively. Combined with the literature [36], the drag coefficients are neglected in this paper, the roll angle and pitch angle only have small variations, and the yaw angle has no variation, that is,

s i n ϕ \approx ϕ, s i n θ \approx θ

,

c o s ϕ \approx 1

,

c o s θ \approx 1

,

φ = 0

,

s i n φ = 0

,

c o s φ = 1

. The dynamics model of the UAV is simplified and the dynamics model is represented as follows:

\{\begin{array}{l} {\dot{p}}_{a}^{x} = g θ \\ {\dot{p}}_{a}^{y} = - g ϕ \\ {\dot{p}}_{a}^{z} = f_{z} / m - g \\ \dot{ϕ} = M_{ϕ} / I_{x} \\ \dot{θ} = M_{θ} / I_{y} \\ \dot{φ} = M_{φ} / I_{z} \end{array}

(3)

where

p_{a}^{x}

,

p_{a}^{y}

,

p_{a}^{z}

represent position state,

ϕ, θ,

and

φ

represent the attitude state,

f_{z}

is the lift force in the height direction,

M_{ϕ}, M_{θ},

and

M_{φ}

represent moments of the quadrotor,

I_{x}, I_{y}

, and

I_{Z}

represent inertial moments of the quadrotor.

According to Equation (3), the Equation of state is:

{\dot{X}}_{A 1} = A_{A} X_{A 1} + B_{A} U_{A 1}

(4)

The subscript a represents the UAV state variables, where

X_{A 1} = (P_{A 1}, V_{A 1}, Ω_{A 1}, {\dot{Ω}}_{A 1})

,

P_{A 1} = {(p_{a i}^{x}, p_{a i}^{y}, p_{a i}^{z})}^{T}

,

V_{A 1} = {(v_{a i}^{x}, v_{a i}^{y}, v_{a i}^{z})}^{T}, Ω_{A 1} = {(g θ_{a i}, - g ϕ_{a i}, 0)}^{T}

,

{\dot{Ω}}_{A 1} = {(g {\dot{θ}}_{a i}, - g {\dot{ϕ}}_{a i}, 0)}^{T}

,

U_{A 1} = {(u_{a i}^{x}, u_{a i}^{y}, u_{a i}^{z})}^{T}

,

A_{A} = (\begin{matrix} 0_{3 \times 3} & I_{3} & 0_{3 \times 3} & 0_{3 \times 3} \\ 0_{3 \times 3} & 0_{3 \times 3} & I_{3} & 0_{3 \times 3} \\ 0_{3 \times 3} & 0_{3 \times 3} & 0_{3 \times 3} & I_{3} \\ 0_{3 \times 3} & 0_{3 \times 3} & 0_{3 \times 3} & 0_{3 \times 3} \end{matrix}) {, B}_{A} = (\begin{matrix} 0_{3 \times 3} \\ 0_{3 \times 3} \\ 0_{3 \times 3} \\ I_{3} \end{matrix}), I_{3} = (\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{matrix})

2.4. Second-Order USV Dynamics Model

The details dynamic equations of USV can be found in [37]. In this article, the USV is limited to the water surface for cross-media relay communication, and the agents taken into consideration is the moving in a planar environment; systems are characterized by a second-order model.

\{\begin{array}{l} {\dot{p}}_{s i} = v_{s i} \\ {\dot{v}}_{s i} = u_{s i} \end{array}

(5)

where

p_{s i} = {[p_{s i}^{x}, p_{s i}^{y}]}^{T}

represents the position state,

v_{s i} = {[v_{s i}^{x}, v_{s i}^{y}]}^{T}

represents the velocity in the direction of

p_{s i}

, and

u_{s i}

represents the input of agent

i

.

According to Equation (5), the Equation of state is:

{\dot{X}}_{S 1} = A_{S} X_{S 1} + B_{S} U_{S 1}

(6)

The subscript s represents the USV state variables, where

X_{S 1} = (P_{S 1}, V_{S 1})

,

P_{S 1} = {(p_{s i}^{x}, p_{s i}^{y})}^{T}

V_{S 1} = {(v_{s i}^{x}, v_{s i}^{y})}^{T}

,

U_{S 1} = {(u_{s i}^{x}, u_{s i}^{y})}^{T}

A_{S} = [\begin{matrix} 0_{2 \times 2} & I_{2} \\ 0_{2 \times 2} & 0_{2 \times 2} \end{matrix}]

,

B_{S} = [\begin{matrix} 0_{2 \times 2} \\ I_{2} \end{matrix}]

,

I_{2} = (\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix})

.

2.5. Second-Order UUV Dynamics Model

The details kinematic and dynamic equations of UUV can be found in [38]. According to the structure of the UUV studied in the paper, there is no thruster to control the angular velocity in roll, the rolling has little influence on the translational motion, and the dynamics of the actuators and thrusters are reasonably neglected in this paper. It is assumed that the UUV is torpedo-type, such that the system is characterized by a second-order model [39].

To simplify the problem, we present dynamic systems as follows:

\{\begin{array}{l} {\dot{p}}_{u i} = v_{u i} \\ {\dot{v}}_{u i} = u_{u i} \end{array}

(7)

Refer to Figure 2, where

p_{v i} = {⌈p_{u i}^{x}, p_{u i}^{y}, p_{u i}^{z}⌉}^{T}

represents the position state,

v_{v i} = {[v_{u i}^{x}, v_{u i}^{y}, v_{u i}^{z}]}^{T}

represents the velocity in the direction of

p_{u i}

, and

u_{u i}

represents the input of agent

i

.

According to Equation (7), the Equation of state is:

{\dot{X}}_{U 1} = A_{U} X_{U 1} + B_{U} U_{U 1}

(8)

The subscript u represents the UUV state variables, where

X_{U 1} = (P_{U 1}, V_{U 1})

,

P_{U 1} = {(p_{u i}^{x}, p_{u i}^{y}, p_{u i}^{z})}^{T}

,

V_{U 1} = {(v_{u i}^{x}, v_{u i}^{y}, v_{u i}^{z})}^{T}

,

U_{U 1} = {(u_{u i}^{x}, u_{u i}^{y}, u_{u i}^{z})}^{T}

A_{U} = [\begin{matrix} 0_{3 \times 3} & I_{3} \\ 0_{3 \times 3} & 0_{3 \times 3} \end{matrix}]

,

B_{U} = [\begin{matrix} 0_{3 \times 3} \\ I_{3} \end{matrix}]

,

I_{3} = (\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{matrix})

.

2.6. Heterogeneous Multi-Agent System

Write the UAV system, UUV system, and USV system as a state-space model:

\dot{X} = A X + B U

(9)

where

X = [P_{A}, V_{A}, Ω_{A}, {\dot{Ω}}_{A}, P_{S}, V_{S}, P_{U}, V_{U}]

P_{A} = [P_{1}, P_{2}, P_{3}, \dots P_{m}]

,

p_{i} = {[x_{i}, y_{i}, z_{i}]}^{T}

,

i = 1,2, \dots, m

V_{A} = [v_{1}, v_{2}, v_{3}, \dots v_{m}]

,

v_{i} = {[v_{i}^{x}, v_{i}^{y}, v_{i}^{z}]}^{T}

,

i = 1,2, \dots, m

Ω_{A} = [Ω_{1}, Ω_{2}, Ω_{3}, \dots Ω_{m}]

,

Ω_{i} = {[g θ_{i}, - g ϕ_{i}, 0]}^{T}

,

i = 1,2, \dots, m

{\dot{Ω}}_{A} = [{\dot{Ω}}_{1}, {\dot{Ω}}_{2}, {\dot{Ω}}_{3}, \dots {\dot{Ω}}_{m}]

,

{\dot{Ω}}_{i} = {[g {\dot{θ}}_{i}, - g {\dot{ϕ}}_{i}, 0]}^{T}

,

i = 1,2, \dots, m

P_{S} = [P_{m + 1}, \dots P_{k}]

,

p_{i} = {[x_{i}, y_{i}]}^{T}

,

i = m + 1, \dots, k

V_{s} = [v_{m + 1}, \dots v_{k}]

,

v_{i} = {[v_{i}^{x}, v_{i}^{y}]}^{T}

,

i = m + 1, \dots, k

P_{U} = [P_{k + 1}, \dots P_{l}]

,

p_{i} = {[x_{i}, y_{i}, z_{i}]}^{T}

,

i = k + 1, \dots, l

V_{U} = [v_{k + 1}, \dots v_{l}]

,

v_{i} = {[v_{i}^{x}, v_{i}^{y}, v_{i}^{z}]}^{T}

,

i = k + 1, \dots, l

.

The UAV, UUV, and USV are grouped into a group, and the formation can be extended to multiple pairs, expressed as:

2 m = k, 3 m = l

.

Where matrix A is:

A = (\begin{matrix} 0_{3 \times 3} & I_{3 \times 3} & 0_{3 \times 3} & 0_{3 \times 3} & 0_{3 \times 2} & 0_{3 \times 2} & 0_{3 \times 3} & 0_{3 \times 3} \\ 0_{3 \times 3} & 0_{3 \times 3} & I_{3 \times 3} & 0_{3 \times 3} & 0_{3 \times 2} & 0_{3 \times 2} & 0_{3 \times 3} & 0_{3 \times 3} \\ 0_{3 \times 3} & 0_{3 \times 3} & 0_{3 \times 3} & I_{3 \times 3} & 0_{3 \times 2} & 0_{3 \times 2} & 0_{3 \times 3} & 0_{3 \times 3} \\ 0_{3 \times 3} & 0_{3 \times 3} & 0_{3 \times 3} & 0_{3 \times 3} & 0_{3 \times 2} & 0_{3 \times 2} & 0_{3 \times 3} & 0_{3 \times 3} \\ 0_{2 \times 3} & 0_{2 \times 3} & 0_{2 \times 3} & 0_{2 \times 3} & 0_{2 \times 2} & I_{2 \times 2} & 0_{2 \times 3} & 0_{2 \times 3} \\ 0_{2 \times 3} & 0_{2 \times 3} & 0_{2 \times 3} & 0_{2 \times 3} & 0_{2 \times 2} & 0_{2 \times 2} & 0_{2 \times 3} & 0_{2 \times 3} \\ 0_{3 \times 3} & 0_{3 \times 3} & 0_{3 \times 3} & 0_{3 \times 3} & 0_{3 \times 2} & 0_{3 \times 2} & 0_{3 \times 3} & I_{3 \times 3} \\ 0_{3 \times 3} & 0_{3 \times 3} & 0_{3 \times 3} & 0_{3 \times 3} & 0_{3 \times 2} & 0_{3 \times 2} & 0_{3 \times 3} & 0_{3 \times 3} \end{matrix}) \otimes I_{m}

Matrix B is:

B = (\begin{matrix} 0_{3 \times 3} & 0_{3 \times 2} & 0_{3 \times 3} \\ 0_{3 \times 3} & 0_{3 \times 2} & 0_{3 \times 3} \\ 0_{3 \times 3} & 0_{3 \times 2} & 0_{3 \times 3} \\ I_{3 \times 3} & 0_{3 \times 2} & 0_{3 \times 3} \\ 0_{2 \times 3} & 0_{2 \times 2} & 0_{2 \times 3} \\ 0_{2 \times 3} & I_{2 \times 2} & 0_{2 \times 3} \\ 0_{3 \times 3} & 0_{3 \times 2} & 0_{3 \times 3} \\ 0_{3 \times 3} & 0_{3 \times 2} & I_{3 \times 3} \end{matrix}) \otimes I_{m}

Input U is:

U = (\begin{matrix} U_{A 1} \\ U_{S 1} \\ U_{U 1} \end{matrix})

I_{m}

represent an m-dimensional identity matrix,

\otimes

is Kronecker product.

3. Design of Control Protocol

3.1. Formation Control Protocol

When the system formation reaches the desired state, the expected control input is 0, and the formation state equation of system (9) can be expressed as:

\dot{\tilde{X}} = A \tilde{X} + B U

(10)

Based on literature [40], a static formation control protocol is proposed:

\{\begin{matrix} u_{i a} = α \sum_{j = 1}^{l} a_{i j} ({\tilde{p}}_{j} - {\tilde{p}}_{i}) - β {\tilde{v}}_{i} - γ_{1} Ω_{i} - γ_{2} {\dot{Ω}}_{i} i = 1,2, 3, \dots, m \\ u_{i s} = α \sum_{j = 1}^{l} a_{i j} ({\tilde{p}}_{j} - {\tilde{p}}_{i}) - β {\tilde{v}}_{i} i = m + 1, \dots, k \\ u_{i u} = α \sum_{j = 1}^{l} a_{i j} ({\tilde{p}}_{j} - {\tilde{p}}_{i}) - β {\tilde{v}}_{i} i = k + 1, \dots, l \end{matrix}

(11)

Dynamic formation control protocol is proposed:

\{\begin{matrix} u_{i a} = α \sum_{j = 1}^{l} a_{i j} ({\tilde{p}}_{j} - {\tilde{p}}_{i}) + β \sum_{j = 1}^{l} a_{i j} ({\tilde{v}}_{j} - {\tilde{v}}_{i}) - γ_{1} Ω_{i} - γ_{2} {\dot{Ω}}_{i} i = 1,2, 3, \dots, m \\ u_{i s} = α \sum_{j = 1}^{l} a_{i j} ({\tilde{p}}_{j} - {\tilde{p}}_{i}) + β \sum_{j = 1}^{l} a_{i j} ({\tilde{v}}_{j} - {\tilde{v}}_{i}) i = m + 1, \dots, k \\ u_{i u} = α \sum_{j = 1}^{l} a_{i j} ({\tilde{p}}_{j} - {\tilde{p}}_{i}) + β \sum_{j = 1}^{l} a_{i j} ({\tilde{v}}_{j} - {\tilde{v}}_{i}) i = k + 1, \dots, l \end{matrix}

(12)

Lemma 1

[41]. For an

N^{*} N

Laplacian matrix

L, N e^{- L t}, t > 0

is a random matrix with positive diagonal elements. If

L

has a unique zero eigenvalue, Rank

(N) = N - 1

, then its left eigenvector has

P_{l} = {[P_{l 1} P_{l 2} \dots P_{l n}]}^{T} \geq 0

and

1_{n}^{T} P_{l} = 1, L^{T} P_{l} = 0

, where,

t \to \infty

,

e^{- L t} \to 1_{n} P_{l}^{T}

.

Theorem 1.

For the formation equation of state (8), when the communication topology 𝐺 = (𝑉 𝐸, 𝐴) is connected undirected graph or directed graph containing spanning tree, if the gain parameter satisfies the conditions:

α > 0

,

β > 0

,

γ_{1} > 0

,

γ_{2} > 0

,

β ≫ α

,

γ_{1} > β ≫ α

,

γ_{2} > β ≫ α

,

γ_{2} γ_{1} β > β^{2} + {γ_{2}}^{2} α

, using static formation control protocol can achieve formation and the system state variables to achieve convergence.

Proof of Theorem 1.

To explain the system (9) communication relationship, Laplacian matrix relations are described as follows:

L = (\begin{matrix} \begin{matrix} L_{A A} \\ L_{S A} \\ 0 \end{matrix} & \begin{matrix} L_{A S} \\ L_{S S} \\ L_{U S} \end{matrix} & \begin{matrix} 0 \\ L_{S U} \\ L_{U U} \end{matrix} \end{matrix})

.

Where,

L_{i i}

represents the Laplacian matrix relationship between isomorphic agents, and the Laplacian matrix

L_{i j}

represents the ith agent system pointing at the jth agent system, namely, the communication connection of heterogeneous agent system.

The static formation control protocol is written in a unified form:

U = T_{d} \times \tilde{X}

where

T_{d} = (\begin{matrix} \begin{matrix} α L_{A A} \otimes I_{3} \\ α L_{S A} \otimes T_{2} \\ 0 \end{matrix} & \begin{matrix} - β I_{m} \otimes I_{3} \\ 0 \\ 0 \end{matrix} & \begin{matrix} \begin{matrix} - γ_{1} I_{m} \otimes I_{3} \\ 0 \\ 0 \end{matrix} & \begin{matrix} - γ_{2} I_{m} \otimes I_{3} \\ 0 \\ 0 \end{matrix} & \begin{matrix} \begin{matrix} α L_{A S} \otimes T_{1} \\ α L_{S S} \otimes I_{2} \\ α L_{U S} \otimes T_{1} \end{matrix} & - \begin{matrix} 0 \\ β I_{m} \otimes I_{2} \\ 0 \end{matrix} & \begin{matrix} \begin{matrix} 0 \\ α L_{S U} \otimes T_{2} \\ α L_{U U} \otimes I_{3} \end{matrix} & \begin{matrix} 0 \\ 0 \\ - β I_{m} \otimes I_{3} \end{matrix} \end{matrix} \end{matrix} \end{matrix} \end{matrix})

T_{1} = (\begin{matrix} \begin{matrix} 1 \\ \begin{matrix} 0 \\ 0 \end{matrix} \end{matrix} & \begin{matrix} 0 \\ \begin{matrix} 1 \\ 0 \end{matrix} \end{matrix} \end{matrix}), T_{2} = (\begin{matrix} \begin{matrix} \begin{matrix} 1 \\ 0 \end{matrix} & \begin{matrix} 0 \\ 1 \end{matrix} \end{matrix} & \begin{matrix} 0 \\ 0 \end{matrix} \end{matrix})

Substituting

U

into Equation (10) yields:

\dot{\tilde{X}} = A \tilde{X} + B U = A \tilde{X} + B T_{d} \tilde{X} = T \tilde{X}

where

T = (\begin{matrix} \begin{matrix} 0 \\ 0 \\ \begin{matrix} 0 \\ α L_{A A} \otimes I_{3} \\ \begin{matrix} 0 \\ α L_{S A} \otimes T_{2} \\ \begin{matrix} 0 \\ 0 \end{matrix} \end{matrix} \end{matrix} \end{matrix} & \begin{matrix} I_{m} \otimes I_{3} \\ 0 \\ \begin{matrix} 0 \\ - β I_{m} \otimes I_{3} \\ \begin{matrix} 0 \\ 0 \\ \begin{matrix} 0 \\ 0 \end{matrix} \end{matrix} \end{matrix} \end{matrix} & \begin{matrix} \begin{matrix} 0 \\ I_{m} \otimes I_{3} \\ \begin{matrix} 0 \\ - γ_{1} I_{m} \otimes I_{3} \\ \begin{matrix} 0 \\ 0 \\ \begin{matrix} 0 \\ 0 \end{matrix} \end{matrix} \end{matrix} \end{matrix} & \begin{matrix} 0 \\ 0 \\ \begin{matrix} I_{m} \otimes I_{3} \\ {- γ}_{2} I_{m} \otimes I_{3} \\ \begin{matrix} 0 \\ 0 \\ \begin{matrix} 0 \\ 0 \end{matrix} \end{matrix} \end{matrix} \end{matrix} & \begin{matrix} \begin{matrix} 0 \\ 0 \\ \begin{matrix} 0 \\ α L_{A S} \otimes T_{1} \\ \begin{matrix} 0 \\ α L_{S S} \otimes I_{2} \\ \begin{matrix} 0 \\ α L_{U S} \otimes T_{1} \end{matrix} \end{matrix} \end{matrix} \end{matrix} & \begin{matrix} 0 \\ 0 \\ \begin{matrix} 0 \\ 0 \\ \begin{matrix} I_{m} \otimes I_{2} \\ - β I_{m} \otimes I_{2} \\ \begin{matrix} 0 \\ 0 \end{matrix} \end{matrix} \end{matrix} \end{matrix} & \begin{matrix} \begin{matrix} 0 \\ 0 \\ \begin{matrix} 0 \\ 0 \\ \begin{matrix} 0 \\ α L_{S U} \otimes T_{2} \\ \begin{matrix} 0 \\ α L_{U U} \otimes I_{3} \end{matrix} \end{matrix} \end{matrix} \end{matrix} & \begin{matrix} 0 \\ 0 \\ \begin{matrix} 0 \\ 0 \\ \begin{matrix} 0 \\ 0 \\ \begin{matrix} I_{m} \otimes I_{3} \\ - β I_{m} \otimes I_{3} \end{matrix} \end{matrix} \end{matrix} \end{matrix} \end{matrix} \end{matrix} \end{matrix} \end{matrix})

The parameters

α

,

β

,

γ_{1}

,

γ_{2}

must be chosen so that

T

has a zero eigenvalue and all the other eigenvalues have negative genuine parts. The gain parameter can be determined using the Routh–Hurwitz stability criterion [42] as follows:

α > 0, β > 0, γ_{1} > 0, γ_{2} > 0, β ≫ α, γ_{1} > β ≫ α, γ_{2} > β ≫ α, γ_{2} γ_{1} β > β^{2} + {γ_{2}}^{2} α

After selecting parameters to stabilize the system,

T

can be transformed into Jordan standard type as follows:

T = P J P^{- 1} .

Let

P_{d l}^{T}

be the first row of

P^{- 1}

and the left eigenvector of 0 eigenvalue, and

P_{d r}

be the first column of

P

and the right eigenvector of 0 eigenvalue. Therefore,

P_{d l}^{T} P_{d r} = 1

, as time approaches infinity, the system’s state becomes:

\lim_{t \to \infty} \tilde{X} = \lim_{t \to \infty} e^{T t} \tilde{X} (0)

,

e^{T t} \tilde{X} (0) \to (P_{d r} P_{d l}^{T}) \tilde{X} (0), t \to \infty

. Lemma 1 states that systems are asymptotically convergent and complete formation as the time approaches infinity. □

Theorem 2.

For the formation equation of state (10), when the communication topology G = (V E, A) is connected undirected graph or directed graph containing a spanning tree, dynamic formation control protocol can achieve formation and the system state variables to achieve convergence.

Proof of Theorem 2.

Refer to the proof of Theorem 1. □

3.2. Optimal Control

The main problem of optimal control research is to determine an optimal control law in the allowable control domain according to the mathematical model of the controlled object, so that the performance index of the system reaches the extreme value, that is, the optimal control law is determined when the performance index reaches the extreme value. The controlled objects in this paper are UAVs, USVs, and UUVs. In practical control problems, most of the control quantity is limited by objective conditions and can only be taken within a certain range, which is called admissible control. The performance index is a measure of system performance, and its content and form depend on the task to be completed by the optimal control problem. Based on the literature [43,44], the optimal control law is solved as follows:

Consider the equation of state of the system as:

\dot{x} = f (x, u, t)

The control vector

u

can minimize the following performance index:

J = \int_{0}^{\infty} F (x, u, t) d t

Let the Hamilton function is:

H (x, u, λ, t) = F (x, u, t) + λ^{T} f (x, u, t)

If the Hamilton function satisfies the following conditions:

\dot{λ} = - \frac{\partial H}{\partial x} .

\dot{x} = - \frac{\partial H}{\partial λ}

\frac{\partial H}{\partial u} = 0

When the above conditions are satisfied, the optimal control law is obtained.

For the UAV system, let

{\tilde{X}}_{A 1} = {[{\tilde{P}}_{A 1} {\tilde{V}}_{A 1} {\tilde{Ω}}_{A 1} {\tilde{\dot{Ω}}}_{A 1}]}^{T}

, an integral performance index composed of error variables and control variables is constructed as follows:

J_{i} = \int_{0}^{\infty} [{\tilde{X}}_{A 1}^{T} Q {\tilde{X}}_{A 1} + u_{i a}^{T} R u_{i a}] d t

(13)

where

Q \geq 0

is the symmetric non-negative definite matrix of the appropriate dimension, and

R > 0

is the symmetric positive definite matrix of suitable dimension.

In order to facilitate engineering application,

Q

and

R

in the performance index are taken as a diagonal linear matrix. When

Q = d i a g [q_{1} q_{2} \dots]

is taken, the first part of the performance index can be expressed as

\int_{0}^{\infty} [{\tilde{X}}_{A 1}^{T} Q {\tilde{X}}_{A 1}] d t = \int_{0}^{\infty} [\sum_{i = 1} q_{i} {\tilde{X}}_{A 1}^{2}] d t

, which is the total measurement of tracking error in the process of movement of the system. When

R = d i a g [r_{1} r_{2} \dots]

is taken, the second part of the performance index can be expressed as

\int_{0}^{\infty} [u_{i a}^{T} R u_{i a}] d t = \int_{0}^{\infty} [\sum_{i = 1} r_{i} {u_{i a}}^{2}] d t

, which is the full measure of the system’s energy consumption.

Through the aforementioned analysis, the physical meaning of the quadratic performance index is to make the system’s dynamic error and energy consumption in the control process optimal. The optimal control law is as follows:

u_{a}^{*} = - R^{- 1} B_{A}^{T} P_{A 1} {\tilde{X}}_{A 1}

, where

P_{A 1}

is the solution of the Riccati equation:

A_{A}^{T} P_{A 1} + P_{A 1} A_{A} - P_{A 1} B_{A} R^{- 1} B_{A}^{T} P_{A 1} + Q = 0

(14)

The control parameter equation of UAV is:

K_{a} = R^{- 1} B_{A}^{T} P_{A 1}

, the dimension of

K_{a}

is 3 × 12, and it also has the following form

K_{a} = [\begin{matrix} k_{a 1} & k_{a 1} & \begin{matrix} k_{a 3} & k_{a 4} \end{matrix} \end{matrix}] \otimes I_{3}

.

For the USV system, let

{\tilde{X}}_{S 1} = {[{\tilde{P}}_{S 1} {\tilde{V}}_{S 1}]}^{T}

, an integral performance index composed of error variables and control variables is constructed as follows:

S_{i} = \int_{0}^{\infty} [{\tilde{X}}_{S 1}^{T} G {\tilde{X}}_{S 1} + u_{i s}^{T} T u_{i s}] d t

(15)

where

G \geq 0

is the symmetric non-negative definite matrix of appropriate dimension, and

T > 0

is the symmetric positive definite matrix of suitable dimension.

In order to facilitate engineering application,

G

and

T

in the performance index are taken as a diagonal linear matrix. When

G = d i a g [g_{1} g_{2} \dots]

is taken, the first part of the performance index can be expressed as

\int_{0}^{\infty} [{\tilde{X}}_{S 1}^{T} G {\tilde{X}}_{S 1}] d t = \int_{0}^{\infty} [\sum_{i = 1} g_{i} {\tilde{X}}_{S 1}^{2}] d t

, which is the total measurement of tracking error in the process of movement of the system. When

T = d i a g [t_{1} t_{2} \dots]

is taken, the second part of the performance index can be expressed as

\int_{0}^{\infty} [u_{i s}^{T} T u_{i s}] d t = \int_{0}^{\infty} [\sum_{i = 1} t_{i} {u_{i s}}^{2}] d t

, which is the full measure of the system’s energy consumption.

Through the aforementioned analysis, the physical meaning of the quadratic performance index is to make the system’s dynamic error and energy consumption in the control process optimal. The optimal control law is as follows:

u_{s}^{*} = - T^{- 1} {B_{S}}^{T} P_{S 1} {\tilde{X}}_{S 1}

, where

P_{S 1}

is the solution of the Riccati equation:

A_{S}^{T} P_{S 1} + P_{S 1} A_{S} - P_{S 1} B_{S} T^{- 1} B_{S}^{T} P_{S 1} + G = 0

(16)

The control parameter equation of USV is:

K_{s} = T^{- 1} {B_{S}}^{T} P_{S 1}

, the dimension of

K_{s}

is 2 × 4, and it also has the following form

K_{s} = [\begin{matrix} k_{s 1} & k_{s 2} \end{matrix}] \otimes I_{2}

.

For the UUV system, let

{\tilde{X}}_{U 1} = {[{\tilde{P}}_{U 1} {\tilde{V}}_{U 1}]}^{T}

, an integral performance index composed of error variables and control variables is constructed as follows:

w_{i} = \int_{0}^{\infty} [{\tilde{X}}_{U 1}^{T} F {\tilde{X}}_{U 1} + u_{i u}^{T} Y u_{i u}] d t

(17)

where

F \geq 0

is the symmetric non-negative definite matrix of the appropriate dimension, and

Y > 0

is the symmetric positive definite matrix of the appropriate dimension.

In order to facilitate engineering application,

F

and

Y

in the performance index are taken as a diagonal linear matrix. When

F = d i a g [f_{1} f_{2} \dots]

is taken, the first part of the performance index can be expressed as

\int_{0}^{\infty} [{\tilde{X}}_{U 1}^{T} F {\tilde{X}}_{U 1}] d t = \int_{0}^{\infty} [\sum_{i = 1} f_{i} {\tilde{X}}_{U 1}^{2}] d t

, which is the total measurement of tracking error in the process of movement of the system. When

Y = d i a g [y_{1} y_{2} \dots]

is taken, the second part of the performance index can be expressed as

\int_{0}^{\infty} [u_{i u}^{T} Y u_{i u}] d t = \int_{0}^{\infty} [\sum_{i = 1} y_{i} {u_{i u}}^{2}] d t

, which is the full measure of the system’s energy consumption.

Through the aforementioned analysis, the physical meaning of the quadratic performance index is to make the system’s dynamic error and energy consumption in the control process optimal. The optimal control law is as follows:

u_{u}^{*} = - F^{- 1} {B_{U}}^{T} P_{U 1} {\tilde{X}}_{U 1}

, where

P_{U 1}

is the solution of the Riccati equation:

{A_{U}^{T} P}_{U 1} + P_{U 1} A_{U} - {P_{U 1} B}_{U} Y^{- 1} B_{U}^{T} P_{U 1} + F = 0

(18)

The control parameter equation of UUV is:

K_{u} = F^{- 1} {B_{U}}^{T} P_{U 1}

, the dimension of

K_{u}

is 3 × 6, and it also has the following form

K_{u} = [\begin{matrix} k_{u 1} & k_{u 2} \end{matrix}] \otimes I_{3}

.

Theorem 3.

For UAV state Equation (2), the optimal control law corresponding to performance indicator Equation (13) is

u_{a}^{*}

.

Proof of Theorem 3.

Suppose

u_{a}^{*}

is the optimal control satisfying the performance index Equation (13), then the minimum principle must be satisfied, and the Hamilton function is constructed according to Equation (13):

H = \frac{1}{2} {\tilde{X}}_{A 1}^{T} Q {\tilde{X}}_{A 1} + \frac{1}{2} u_{i a}^{T} R u_{i a} + λ^{T} A_{A} {\tilde{X}}_{A 1} + λ^{T} B_{A} u_{i a}

(19)

where

λ

is a covariate, and since

u_{a}^{*}

is unconstrained, the minimum condition is Hamilton Function (19) with the control input

u_{i a}

taking an unconditional minimum. Compute the derivative of the Hamilton function for the control input

u_{i a}

:

\frac{\partial H}{\partial u_{i a}} = R u_{i a} + λ B_{A}

, let

\frac{\partial H}{\partial u_{i a}} = 0

:

u_{i a} = - R^{- 1} {B_{A}}^{T} λ

(20)

Due to

\frac{\partial^{2} H}{\partial^{2} u_{i a}} = R > 0

; therefore, Formula (20) uses Hamilton Formula (19) to obtain the minimum control, that is, the optimal control.

According to the regular Equation:

\{\begin{matrix} {\dot{\tilde{X}}}_{A 1} = \frac{\partial H}{\partial λ} = A_{A} {\tilde{X}}_{A 1} + B_{A} u_{i a} \\ \dot{λ} = - \frac{\partial H}{\partial {\tilde{X}}_{A 1}} = - Q {\tilde{X}}_{A 1} - {A_{A}}^{T} λ \end{matrix}

(21)

Let

λ = P_{A 1} {\tilde{X}}_{A 1}

, the matrix

P_{A 1}

is undetermined, so,

\dot{λ} = P_{A 1} {\dot{\tilde{X}}}_{A 1}

is substituted into Equation (21):

P_{A 1} {\dot{\tilde{X}}}_{A 1} = - Q {\tilde{X}}_{A 1} - {A_{A}}^{T} P_{A 1} {\tilde{X}}_{A 1}

(22)

Substituting regular Equation (21) into Equation (22), we can achieve:

P_{A 1} A_{A} {\tilde{X}}_{A 1} + P_{A 1} B_{A} u_{i a} = - Q {\tilde{X}}_{A 1} - {A_{A}}^{T} P_{A 1} {\tilde{X}}_{A 1}

(23)

Equation (20) is substituted into the Riccati equation in (23):

{A_{A}}^{T} P_{A 1} + P_{A 1} A_{A} - P_{A 1} B_{A} R^{- 1} {B_{A}}^{T} P_{A 1} + Q = 0

(24)

Let

Q > 0

,

R > 0

, then the solution of

P_{A 1}

is positive definite, select Lyapunov functions:

V ({\tilde{X}}_{A 1}) = {\tilde{X}}_{A 1}^{T} P_{A 1} {\tilde{X}}_{A 1} \geq 0

.

On the

V ({\tilde{X}}_{A 1})

derivation:

\dot{V} ({\dot{\tilde{X}}}_{A 1}) = {\tilde{X}}_{A 1}^{T} P_{A 1} {\tilde{X}}_{A 1} + {\tilde{X}}_{A 1}^{T} P_{A 1} {\dot{\tilde{X}}}_{A 1} = - {\tilde{X}}_{A 1}^{T} (P_{A 1} B_{A} R^{- 1} {B_{A}}^{T} P_{A 1} + Q) {\tilde{X}}_{A 1}

. Because

Q > 0

,

R > 0

, there must be:

(P_{A 1} B_{A} R^{- 1} {B_{A}}^{T} P_{A 1} + Q) > 0

, so there are:

\dot{V} ({\dot{\tilde{X}}}_{A 1}) \leq 0

. According to Lyapunov stability theorem, the system with optimal control

u_{a}^{*}

is asymptotically stable. □

3.3. Cooperative Control

Since the models of agents in heterogeneous systems are not the same, the key to collaboration is to find the common part between the models. The heterogeneous system studied consists of unmanned aerial vehicles (UAVs), unmanned surface vehicles (USVs), and unmanned underwater vehicles (UUVs). UAVs contain position state, velocity state, attitude angle, and attitude angle change rate state; USVs contain position state and velocity state; UUVs contain position state and velocity state. Therefore, position state and velocity state are the common domain of the three, unmanned aerial vehicles (UAVs) and unmanned underwater vehicles (UUVs) are three-dimensional space, and unmanned surface vehicles (USVs) are two-dimensional space. Transformation matrix

m_{S A}

,

m_{A S}

,

m_{S U}

,

m_{U S}

are proposed.

Where,

m_{A S} = [\begin{matrix} \begin{matrix} 1 & 0 \end{matrix} \\ \begin{matrix} 0 & 1 \end{matrix} \\ \begin{matrix} 0 & 0 \end{matrix} \end{matrix}], m_{S A} = [\begin{matrix} \begin{matrix} 1 & 0 & 0 \end{matrix} \\ \begin{matrix} 0 & 1 & 0 \end{matrix} \end{matrix}], m_{U S} = [\begin{matrix} \begin{matrix} 1 & 0 \end{matrix} \\ \begin{matrix} 0 & 1 \end{matrix} \\ \begin{matrix} 0 & 0 \end{matrix} \end{matrix}], m_{S U} = [\begin{matrix} \begin{matrix} 1 & 0 & 0 \end{matrix} \\ \begin{matrix} 0 & 1 & 0 \end{matrix} \end{matrix}]

3.4. Collaborative Optimal Control for Mixed-Order Heterogeneous Systems

By changing only the gain parameters, the collaborative optimal control of the system is created without altering the formation control protocol’s structure, which preserves the distribution of the formation control protocol.

|N_{A i}|

represents the number of neighbors of agent

i

in the UAVs,

|N_{S i}|

represents the number of neighbors of agent

i

in the USVs, and

|N_{U i}|

represents the number of neighbors of agent

i

in the UUVs. Therefore, the optimal control theory and cooperative control are added into the static formation control protocol can be obtained:

u_{i a} = \frac{k_{a 1}}{|N_{A i}|} \sum_{j \in N_{A i}} a_{i j} ({\tilde{p}}_{j} - {\tilde{p}}_{i}) + \frac{k_{a 1}}{|N_{S i}|} \sum_{j \in N_{S i}} a_{i j} (m_{A S} {\tilde{p}}_{j} - {\tilde{p}}_{i}) - k_{a 2} {\tilde{v}}_{i} - k_{a 3} Ω_{i} - k_{a 4} {\dot{Ω}}_{i} u_{i s} = \frac{k_{s 1}}{|N_{S i}|} \sum_{j \in N_{S i}} a_{i j} ({\tilde{p}}_{j} - {\tilde{p}}_{i}) + \frac{k_{s 1}}{|N_{A i}|} \sum_{j \in N_{A i}} a_{i j} (m_{S A} {\tilde{p}}_{j} - {\tilde{p}}_{i}) + \frac{k_{s 1}}{|N_{U i}|} \sum_{j \in N_{U i}} a_{i j} (m_{S U} {\tilde{p}}_{j} - {\tilde{p}}_{i}) - k_{s 2} {\tilde{v}}_{i} u_{i u} = \frac{k_{u 1}}{|N_{U i}|} \sum_{j \in N_{U i}} a_{i j} ({\tilde{p}}_{j} - {\tilde{p}}_{i}) + \frac{k_{u 1}}{|N_{S i}|} \sum_{j \in N_{S i}} a_{i j} (m_{U S} {\tilde{p}}_{j} - {\tilde{p}}_{i}) - k_{u 2} {\tilde{v}}_{i}

(25)

Cooperative optimal dynamic formation control protocol:

u_{i a} = \frac{k_{a 1}}{|N_{A i}|} \sum_{j \in N_{A i}} a_{i j} ({\tilde{p}}_{j} - {\tilde{p}}_{i}) + \frac{k_{a 1}}{|N_{S i}|} \sum_{j \in N_{S i}} a_{i j} (m_{A S} {\tilde{p}}_{j} - {\tilde{p}}_{i}) + \frac{k_{a 2}}{|N_{A i}|} \sum_{j \in N_{A i}} a_{i j} ({\tilde{v}}_{j} - {\tilde{v}}_{i}) + \frac{k_{a 2}}{|N_{S i}|} \sum_{j \in N_{S i}} a_{i j} (m_{A S} {\tilde{v}}_{j} - {\tilde{v}}_{i}) - k_{a 3} Ω_{i} - k_{a 4} {\dot{Ω}}_{i} u_{i s} = \frac{k_{s 1}}{|N_{S i}|} \sum_{j \in N_{S i}} a_{i j} ({\tilde{p}}_{j} - {\tilde{p}}_{i}) + \frac{k_{s 1}}{|N_{A i}|} \sum_{j \in N_{A i}} a_{i j} (m_{S A} {\tilde{p}}_{j} - {\tilde{p}}_{i}) + \frac{k_{s 1}}{|N_{U i}|} \sum_{j \in N_{U i}} a_{i j} (m_{S U} {\tilde{p}}_{j} - {\tilde{p}}_{i}) + \frac{k_{s 2}}{|N_{S i}|} \sum_{j \in N_{S i}} a_{i j} ({\tilde{v}}_{j} - {\tilde{v}}_{i}) + \frac{k_{s 2}}{|N_{A i}|} \sum_{j \in N_{A i}} a_{i j} (m_{S A} {\tilde{v}}_{j} - {\tilde{v}}_{i}) + \frac{k_{s 2}}{|N_{U i}|} \sum_{j \in N_{U i}} a_{i j} (m_{S U} {\tilde{v}}_{j} - {\tilde{v}}_{i}) u_{i u} = \frac{k_{u 1}}{|N_{U i}|} \sum_{j \in N_{U i}} a_{i j} ({\tilde{p}}_{j} - {\tilde{p}}_{i}) + \frac{k_{u 1}}{|N_{S i}|} \sum_{j \in N_{S i}} a_{i j} (m_{U S} {\tilde{p}}_{j} - {\tilde{p}}_{i}) + \frac{k_{u 2}}{|N_{U i}|} \sum_{j \in N_{U i}} a_{i j} ({\tilde{v}}_{j} - {\tilde{v}}_{i}) + \frac{k_{u 2}}{|N_{S i}|} \sum_{j \in N_{S i}} a_{i j} (m_{U S} {\tilde{v}}_{j} - {\tilde{v}}_{i})

(26)

By adopting the above cooperative optimal control protocol, the system’s communication topology is not required to be a complete graph but only a connected graph. The above cooperative optimal control protocol takes into account the advantages of optimal control and cooperative control, which do not affect the cooperative control of the multi-agent system and can realize different kinds of task requirements according to the set performance indicators.

Theorem 4.

For the formation equation of state (10), when the communication topology 𝐺 = (𝑉 𝐸, 𝐴) is connected undirected graph or directed graph containing a spanning tree, using the optimal control theory and cooperative control is added into the static formation control protocol can achieve optimal cooperative formation and the system state variables to achieve convergence.

Proof of Theorem 4.

As mentioned above, the UAV, USV, and UUV are grouped into a group, the system matrix has the same number of rows and columns, and it can take the following forms:

\dot{\tilde{X}} = \bar{A} \tilde{X} + \bar{B} U

(27)

where

\bar{A} = (\begin{matrix} 0 & I & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & I & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & I & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & I & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & I \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix})

\bar{B} = (\begin{matrix} 0 & 0 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & 0 \\ I & 0 & 0 \\ 0 & 0 & 0 \\ 0 & I & 0 \\ 0 & 0 & 0 \\ 0 & 0 & I \end{matrix})

The matrix is constructed according to the PBH criterion as follows:

(\begin{matrix} S I - \bar{A} & \bar{B} \end{matrix}) = (\begin{matrix} S & - I & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & S & - I & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & S & - I & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & S & 0 & 0 & 0 & 0 & I & 0 & 0 \\ 0 & 0 & 0 & 0 & S & - I & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & S & 0 & 0 & 0 & I & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & S & - I & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & S & 0 & 0 & I \end{matrix})

Through matrix

\bar{A}

, its eigenvalue can be obtained as:

λ_{1} = λ_{2} = \dots λ_{8} = 0

. Let

S = λ_{1} = λ_{2} = \dots λ_{8} = 0

, the rank of the matrix can be obtained as

R a n k = (\begin{matrix} S I - \bar{A} & B \end{matrix}) = 8

, and according to the PBH criterion, the heterogeneous system is controllable.

The equation of a state for the cooperative optimal static formation control protocol is expressed as follows:

(\begin{matrix} U_{A} \\ U_{S} \\ U_{U} \end{matrix}) = (\begin{matrix} U_{a 1} \otimes I_{3} & U_{a 2} \otimes m_{A S} & U_{a 3} \otimes I_{3} \\ U_{s 1} \otimes m_{S A} & U_{s 2} \otimes I_{2} & U_{s 3} \otimes m_{S U} \\ U_{u 1} \otimes I_{3} & U_{u 2} \otimes m_{U S} & U_{u 3} \otimes I_{3} \end{matrix}) \times \tilde{X}

where

(\begin{matrix} U_{a 1} & U_{a 2} & U_{a 3} \\ U_{s 1} & U_{s 2} & U_{s 3} \\ U_{u 1} & U_{u 2} & U_{u 3} \end{matrix}) = (\begin{matrix} \frac{k_{a 1}}{| N_{A i} |} L_{A A} & {- k}_{a 2} I_{m} & - k_{a 3} I_{m} & - k_{a 4} I_{m} & \frac{k_{a 1}}{| N_{S i} |} L_{A S} & 0 & 0 & 0 \\ \frac{k_{s 1}}{| N_{A i} |} L_{S A} & 0 & 0 & 0 & \frac{k_{s 1}}{| N_{S i} |} L_{S S} & {- k}_{s 2} I_{m} & \frac{k_{s 1}}{| N_{U i} |} L_{S U} & 0 \\ 0 & 0 & 0 & 0 & \frac{k_{u 1}}{| N_{S i} |} L_{U S} & 0 & \frac{k_{u 1}}{| N_{U i} |} L_{U U} & - k_{u 2} I_{m} \end{matrix})

Substituting

(\begin{matrix} U_{a 1} & U_{a 2} & U_{a 3} \\ U_{s 1} & U_{s 2} & U_{s 3} \\ U_{u 1} & U_{u 2} & U_{u 3} \end{matrix}) \times \tilde{X}

into Equation (27), we can get:

\dot{\tilde{X}} = \bar{T} \tilde{X}

(28)

where

\bar{T} = (\begin{matrix} 0 & I & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & I & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & I & 0 & 0 & 0 & 0 \\ \frac{k_{a 1}}{| N_{A i} |} L_{A A} & - k_{a 2} I & - k_{a 3} I & - k_{a 4} I & \frac{k_{a 1}}{| N_{S i} |} L_{A S} & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & I & 0 & 0 \\ \frac{k_{s 1}}{| N_{A i} |} L_{S A} & 0 & 0 & 0 & \frac{k_{s 1}}{| N_{S i} |} L_{S S} & {- k}_{s 2} I & \frac{k_{s 1}}{| N_{U i} |} L_{S U} & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & I \\ 0 & 0 & 0 & 0 & \frac{k_{u 1}}{| N_{S i} |} L_{U S} & 0 & \frac{k_{u 1}}{| N_{U i} |} L_{U U} & - k_{u 2} I \end{matrix})

By applying elementary row and row transformations, this matrix becomes:

Λ = (\begin{matrix} - I & I & 0 & 0 & 0 & 0 & 0 & 0 \\ - I & 0 & I & 0 & 0 & 0 & 0 & 0 \\ - 2 I & 0 & I & I & 0 & 0 & 0 & 0 \\ r_{1} & r_{2} & - k_{a 3} I & - k_{a 4} I & \frac{k_{a 1}}{| N_{S i} |} L_{A S} & 0 & 0 & 0 \\ - I & 0 & 0 & 0 & 0 & I & 0 & 0 \\ r_{3} & I & 0 & 0 & \frac{k_{s 1}}{| N_{S i} |} L_{S S} & {- k}_{s 2} I & \frac{k_{s 1}}{| N_{U i} |} L_{S U} & 0 \\ - I & 0 & 0 & 0 & 0 & 0 & 0 & I \\ r_{4} & I & 0 & 0 & \frac{k_{u 1}}{| N_{S i} |} L_{U S} & 0 & \frac{k_{u 1}}{| N_{U i} |} L_{U U} & - k_{u 2} I \end{matrix})

where

r_{1} = k_{a 2} I + k_{a 4} I + \frac{k_{a 1}}{|N_{A i}|} L_{A A} - I r_{2} = k_{a 3} I - k_{a 2} I + I r_{3} = k_{s 2} I - I r_{4} = k_{u 2} I - I

λ Ι - Λ = (\begin{matrix} λ + I & - I & 0 & 0 & 0 & 0 & 0 & 0 \\ I & λ & - I & 0 & 0 & 0 & 0 & 0 \\ 2 I & 0 & λ - I & - I & 0 & 0 & 0 & 0 \\ {- r}_{1} & {- r}_{2} & k_{a 3} I & λ + k_{a 4} I & \frac{k_{a 1}}{| N_{S i} |} L_{A S} & 0 & 0 & 0 \\ I & 0 & 0 & 0 & λ & - I & 0 & 0 \\ r_{3} & I & 0 & 0 & \frac{k_{s 1}}{| N_{S i} |} L_{S S} & {- k}_{s 2} I & \frac{k_{s 1}}{| N_{U i} |} L_{S U} & 0 \\ - I & 0 & 0 & 0 & 0 & 0 & 0 & I \\ r_{4} & I & 0 & 0 & \frac{k_{u 1}}{| N_{S i} |} L_{U S} & 0 & \frac{k_{u 1}}{| N_{U i} |} L_{U U} & - k_{u 2} I \end{matrix})

From the matrix transformation described above, it can be inferred that

λ I - Λ ≅ λ I - \bar{T}

, and the matrix

Λ

is similar to the matrix

\bar{T}

. A non-singular transformation matrix

\bar{Q}

exists, resulting in

Λ = \bar{Q} \bar{T} {\bar{Q}}^{- 1}

,

\dot{\tilde{X}} = Λ \tilde{X}

, and

Λ

is the matrix where the sum of each row is zero.

As a result, at least one eigenvalue is zero. The primary column and row transformation can be performed in

\bar{T}

:

\bar{T} = (\begin{matrix} I & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & I & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & I & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & I & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & I \\ 0 & 0 & \frac{k_{a 1}}{| N_{A i} |} L_{A A} & \frac{k_{a 1}}{| N_{S i} |} L_{A S} & 0 & 0 & 0 & 0 \\ 0 & 0 & \frac{k_{s 1}}{| N_{A i} |} L_{S A} & \frac{k_{s 1}}{| N_{S i} |} L_{S S} & \frac{k_{s 1}}{| N_{U i} |} L_{S U} & 0 & 0 & 0 \\ 0 & 0 & 0 & \frac{k_{u 1}}{| N_{S i} |} L_{U S} & \frac{k_{u 1}}{| N_{U i} |} L_{U U} & 0 & 0 & 0 \end{matrix}) = (\begin{matrix} I & 0 & 0 \\ 0 & 0 & I \\ 0 & k L & 0 \end{matrix}) = E

If

R a n k (L) = N - 1

,

R a n k (\begin{matrix} I_{1} \\ I_{2} \end{matrix}) = r

, then:

R a n k (\bar{T}) = R a n k (Λ) = R a n k (E) = r + N - 1

. Combined with the proof of Theorem 1,

\bar{T}

can be converted to Jordan standard form:

\bar{T} = P J P^{- 1}

Let

P_{d l}^{T}

be the first row of

P^{- 1}

and the left eigenvector of zero eigenvalue, and

P_{d r}

be the first column of

P

and the right eigenvector of zero eigenvalue. Therefore,

P_{d l}^{T} P_{d r} = 1

. The system state as time reaches infinity is:

\lim_{t \to \infty} \tilde{X} = \lim_{t \to \infty} e^{T t} \tilde{X} (0) e^{T t} \tilde{X} (0) \to (P_{d r} P_{d l}^{T}) \tilde{X} (0), t \to \infty

Lemma 1 states that the system (27) can achieve convergence and the error vector is zero as the time tends to infinity, completing the cooperatively optimal formation. □

4. Simulation

This paper comprises three UAVs, three USVs, and three UUVs. Communication topology’s Laplacian matrix is as follows:

L = (\begin{matrix} 3 & - 1 & - 1 & - 1 & 0 & 0 & 0 & 0 & 0 \\ - 1 & 3 & - 1 & 0 & - 1 & 0 & 0 & 0 & 0 \\ - 1 & - 1 & 3 & 0 & 0 & - 1 & 0 & 0 & 0 \\ - 1 & 0 & 0 & 4 & - 1 & - 1 & - 1 & 0 & 0 \\ 0 & - 1 & 0 & - 1 & 4 & - 1 & 0 & - 1 & 0 \\ 0 & 0 & - 1 & - 1 & - 1 & 4 & 0 & 0 & - 1 \\ 0 & 0 & 0 & - 1 & 0 & 0 & 3 & - 1 & - 1 \\ 0 & 0 & 0 & 0 & - 1 & 0 & - 1 & 3 & - 1 \\ 0 & 0 & 0 & 0 & 0 & - 1 & - 1 & - 1 & 3 \end{matrix}) = (\begin{matrix} L_{A A} & L_{A S} & 0 \\ L_{S A} & L_{S S} & L_{S U} \\ 0 & L_{U S} & L_{U U} \end{matrix})

Table 1 lists the system state variables of the UAVs, and Table 2 lists the state variables of the USVs and UUVs systems. The control parameters of the formation control protocol are:

α = 0.2, β = 1.5, γ_{1} = 5, γ_{2} = 2

. The control parameters of cooperative optimal formation control protocol are:

Q = 13 * I_{12}, R = 2 * I_{3} {, G = 1 * I_{4}, T = 5 * I_{2}, F = 2 * I_{6}, Y = 4 * I_{3}, k}_{a 1} = 2.5495, k_{a 2} = 6.9756, k_{a 3} = 8.2681, k_{a 4} = 4.7996, k_{s 1} = 0.4472, k_{s 2} = 1.0461, k_{u 1} = 0.7071, k_{u 2} = 1.3836

.

4.1. Simulation of Static and Dynamic Formation Control Protocol

Figure 3, Figure 4 and Figure 5 are the simulation under the static formation control protocol, and Figure 3 is the actual position trajectory of each agent system. It can be seen from the figure that each agent system finally completes the formation with a triangle.

Figure 4 and Figure 5 show each agent system’s convergence of state variables. Figure 4 shows that each agent system realizes position state convergence after 30 s. Figure 5 shows velocity state consistency after 30 s.

It can be seen from Figure 4 and Figure 5 the observation that when the time goes to infinity, the system’s speed reaches the same, and the position converges gradually. Since the speed of the static formation control protocol is 0, the slope of the position change is 0.

Figure 6, Figure 7 and Figure 8 are the simulation under the dynamic formation control protocol, and Figure 6 is the actual position trajectory of each agent system. It can be seen from the figure that each agent system finally completes the formation with a triangle.

Figure 7 and Figure 8 show each agent system’s convergence of state variables. Figure 7 shows that each agent system realizes position state convergence after 25 s. Figure 8 shows velocity state consistency after 25 s.

It can be seen from Figure 7 and Figure 8 the observation that when the time goes to infinity, the system’s velocity reaches the same, and the position converges gradually. Since the speed of the dynamic formation control protocol is not zero, the velocity is the differential of the position for a time, so the position is constantly changing.

4.2. Simulation of Cooperative Optimum Formation Control Protocol

Figure 9, Figure 10 and Figure 11 show the simulation of introducing optimal control and cooperative control to the static formation control protocol, and Figure 9 shows the actual position trajectory of each agent system. It can be seen from the figure that each agent system completes the cooperative optimum formation with a triangle.

Figure 10 and Figure 11 show each agent system’s convergence of state variables. Figure 10 shows that position-state convergence is achieved after 20 s. Figure 11 shows that velocity state consistency is achieved after 20 s. Compared to Figure 10 and Figure 11 with Figure 4 and Figure 5, the system can quickly reach the expected value and complete the formation.

Figure 12, Figure 13 and Figure 14 show the simulation of introducing optimal control and cooperative control to the dynamic formation control protocol, and Figure 12 shows the actual position trajectory of each agent system. It can be seen from the figure that each agent system completes the cooperative optimum formation with a triangle.

Figure 13 and Figure 14 show each agent system’s convergence of state variables. Figure 13 shows that position-state convergence is achieved after 20 s. Figure 14 shows that velocity state consistency is achieved after 15 s. Compared to Figure 13 and Figure 14 with Figure 5 and Figure 6, the system can achieve convergence and cooperative formation quickly.

5. Conclusions

This paper proposes a cooperative optimal formation control strategy for mixed-order heterogeneous multi-agent systems based on optimal control theory and cooperative control.

Firstly, for heterogeneous multi-agent systems with different dimensions and models written as a state space, the block Kronecker product is used to write the system in space. Secondly, the graph theory matrix proves the effectiveness of the proposed dynamic and static formation control protocols. Further, the optimal control theory and cooperative control are proposed, and the dimensional inconsistency problem is solved by using the cooperative control, the optimal gain parameters are obtained by the optimal control, then the cooperative formation control is presented. Finally, the effectiveness of the cooperative optimal formation control protocol is verified by simulation, and it can be verified that the incorporation of optimal control theory and cooperative control can hasten the system’s convergence and complete the cooperative formation. In future work, the system will be affected by the environment, the USV and UUV will be affected by the ocean, the UAV will be affected by the wind speed, and the environmental effects will be the system obstacles, such obstacles will be taken into consideration in the system formation.

Author Contributions

M.L. and Y.L.; methodology, M.L.; software, L.Z.; validation, M.L., Y.L. and B.L.; formal analysis, M.L.; resources, Y.L.; data curation, Y.L. and M.L.; writing—original draft preparation, M.L.; writing—review and editing, M.L.; supervision, L.Z.; project administration, Y.L.; funding acquisition, Y.G. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (61872204), the Scientific Research Project of Heilongjiang Provincial Universities, China (Grant No.145109143).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data that support the findings of this study are available from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Lee, S.; Kim, H.; Lee, B. An Efficient Rescue System with Online Multi-Agent SLAM Framework. Sensors 2020, 20, 235. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Liu, H.; Chen, Z.K.; Tian, Y.L.; Wang, B.; Yang, H.; Wu, G.H. Evaluation method for helicopter maritime search and rescue response plan with uncertainty. Chin. J. Aeronaut. 2021, 34, 493–507. [Google Scholar] [CrossRef]
Queralta, J.P.; Taipalmaa, J.; Pullinen, B.C.; Sarker, V.K.; Gia, T.N.; Tenhunen, H.; Gabbouj, M.; Raitoharju, J.; Westerlund, T. Collaborative Multi-Robot Search and Rescue: Planning, Coordination, Perception, and Active Vision. IEEE Access 2020, 8, 191617–191643. [Google Scholar] [CrossRef]
Cai, J.Q.; Peng, Z.H.; Ding, S.X.; Sun, J.B. Problem-specific multi-objective invasive weed optimization algorithm for reconnaissance mission scheduling problem. Comput. Ind. Eng. 2021, 157, 107345. [Google Scholar] [CrossRef]
Ma, L.B.; He, F.H.; Wang, L.; Li, C.X.; Yao, Y. A Non-Convex Optimization Approach to Dynamic Coverage Problem of Multi-agent Systems in an Environment with Obstacles. J. Syst. Sci. Complex. 2020, 33, 426–445. [Google Scholar] [CrossRef]
Vallejo, D.; Castro-Schez, J.J.; Glez-Morcillo, C.; Albusac, J. Multi-agent architecture for information retrieval and intelligent monitoring by UAVs in known environments affected by catastrophes. Eng. Appl. Artif. Intell. 2020, 87, 103243. [Google Scholar] [CrossRef]
Arcile, J.; Devillers, R.; Klaudel, H. Dynamic Exploration of Multi-agent Systems with Periodic Timed Tasks. Fundam. Inform. 2020, 175, 59–95. [Google Scholar] [CrossRef]
Lee, K.; Kabir, R.H. Density-aware decentralised multi-agent exploration with energy constraint based on optimal transport theory. Int. J. Syst. Sci. 2022, 53, 851–869. [Google Scholar] [CrossRef]
Zhang, Z.L.; Yu, J.C.; Tang, J.H.; Xu, Y.F.; Wang, Y. MR-TopoMap: Multi-Robot Exploration Based on Topological Map in Communication Restricted Environment. IEEE Robot. Autom. Lett. 2022, 7, 10794–10801. [Google Scholar] [CrossRef]
Bai, C.C.; Yan, P.; Pan, W.; Guo, J.F. Learning-Based Multi-Robot Formation Control With Obstacle Avoidance. IEEE Trans. Intell. Transp. Syst. 2022, 23, 11811–11822. [Google Scholar] [CrossRef]
Liang, D.; Liu, Z.Y.; Bhamra, R. Collaborative Multi-Robot Formation Control and Global Path Optimization. Appl. Sci. 2022, 12, 7046. [Google Scholar] [CrossRef]
Veeramani, S.; Muthuswamy, S. Hybrid type multi-robot path planning of a serial manipulator and SwarmItFIX robots in sheet metal milling process. Complex Intell. Syst. 2022, 8, 2937–2954. [Google Scholar] [CrossRef]
Lee, S.; Park, S.Y.; Kim, J.; Ka, M.H.; Song, Y. Mission Design and Orbit-Attitude Control Algorithms Development of Multistatic SAR Satellites for Very-High-Resolution Stripmap Imaging. Aerospace 2023, 10, 33. [Google Scholar] [CrossRef]
Scharnagl, J.; Haber, R.; Dombrovski, V.; Schilling, K. NetSat-Challenges and lessons learned of a formation of 4 nano-satellites. Acta Astronaut. 2022, 201, 580–591. [Google Scholar] [CrossRef]
Bai, T.T.; Bo, W.D.; Ali, Z.A.; Masroor, S. Formation control of multiple UAVs via pigeon inspired optimisation. Int. J. Bio-Inspired Comput. 2022, 19, 135–146. [Google Scholar] [CrossRef]
Kahagh, A.M.; Pazooki, F.; Haghighi, S.E.; Asadi, D. Real-time formation control and obstacle avoidance algorithm for fixed-wing UAVs. Aeronaut. J. 2022, 126, 2111–2133. [Google Scholar] [CrossRef]
Wen, G.G.; Peng, Z.X.; Yu, Y.G.; Rahmani, A. Planning and control of three-dimensional multi-agent formations. IMA J. Math. Control Inf. 2013, 30, 265–284. [Google Scholar] [CrossRef]
Li, X.L.; Er, M.L.; Yang, G.H.; Wang, N. Bearing-based formation manoeuvre control of nonholonomic multi-agent systems. Int. J. Syst. Sci. 2019, 50, 2993–3002. [Google Scholar] [CrossRef]
Liao, W.; Wei, X.H.; Lai, J.Z.; Sun, H. Formation control for multi-UAVs systems based on Kullback-Leibler divergence. Trans. Inst. Meas. Control 2020, 42, 598–603. [Google Scholar] [CrossRef]
Zhao, W.; Li, R.F.; Zhang, H.P. Finite-time distributed formation tracking control of multi-UAVs with a time-varying reference trajectory. IMA J. Math. Control Inf. 2018, 35, 1297–1318. [Google Scholar] [CrossRef]
Liang, S.; Wang, F.Y.; Chen, Z.Q.; Liu, Z.X. Formation control for discrete-time heterogeneous multi-agent systems. Int. J. Robust Nonlinear Control 2022, 32, 5848–5865. [Google Scholar] [CrossRef]
Ma, L.; Wang, Y.L.; Fei, M.R.; Pan, Q.K. Cross-dimensional formation control of second-order heterogeneous multi-agent systems. ISA Trans. 2022, 127, 188–196. [Google Scholar] [CrossRef]
Foderaro, G.; Ferrari, S.; Wettergren, T.A. Distributed optimal control for multi-agent trajectory optimization. Automatica 2014, 50, 149–154. [Google Scholar] [CrossRef] [Green Version]
Zhang, Z.; Zhang, S.X.; Li, H.P.; Yan, W.S. Cooperative robust optimal control of uncertain multi-agent systems. J. Frankl. Inst.-Eng. Appl. Math. 2020, 357, 9467–9483. [Google Scholar] [CrossRef]
Zhi, H.; Chen, L.M.; Li, C.J.; Guo, Y.N. Leader-Follower Affine Formation Control of Second-Order Nonlinear Uncertain Multi-Agent Systems. IEEE Trans. Circuits Syst. II Express Briefs 2021, 68, 3547–3551. [Google Scholar] [CrossRef]
Zhang, L.P.; Zhang, G.S. Cooperative optimal control for descriptor multi-agent systems. IMA J. Math. Control Inf. 2020, 37, 935–952. [Google Scholar] [CrossRef]
Cui, J.J.; Liu, Y.W.; Nallanathan, A. Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks. IEEE Trans. Wirel. Commun. 2020, 19, 729–743. [Google Scholar] [CrossRef] [Green Version]
Liu, X.Y.; Xu, C.; Yu, H.B.; Zeng, P. Multi-agent deep reinforcement learning for end-edge orchestrated resource allocation in industrial wireless networks. Front. Inf. Technol. Electron. Eng. 2022, 23, 47–60. [Google Scholar] [CrossRef]
Hu, B.B.; Zhang, H.T.; Shi, Y. Cooperative label-free moving target fencing for second-order multi-agent systems with rigid formation. Automatica 2023, 148, 110788. [Google Scholar] [CrossRef]
Xu, B.W.; Zhang, H.T.; Meng, H.F.; Hu, B.B.; Chen, D.X.; Chen, G.R. Moving Target Surrounding Control of Linear Multiagent Systems with Input Saturation. IEEE Trans. Syst. Man Cybern.-Syst. 2022, 52, 1705–1715. [Google Scholar] [CrossRef]
Shin, H.; Na, K.I.; Chang, J.H.; Uhm, T. Multimodal layer surveillance map based on anomaly detection using multi-agents for smart city security. ETRI J. 2022, 44, 183–193. [Google Scholar] [CrossRef]
Srivastava, I.; Bhat, S.; Singh, A.R. Fault diagnosis, service restoration, and data loss mitigation through multi-agent system in a smart power distribution grid. Energy Sources 2020, 1–26. [Google Scholar] [CrossRef]
Lee, C.E.; Baek, J.; Son, J.; Ha, Y.G. Deep AI military staff: Cooperative battlefield situation awareness for commander’s decision making. J. Supercomput. 2023, 79, 6040–6069. [Google Scholar] [CrossRef]
Yang, M.; Peng, Y.; Ju, R.S.; Xu, X.; Yin, Q.J.; Huang, K.D. A Lookahead Behavior Model for Multi-Agent Hybrid Simulation. Appl. Sci. 2017, 7, 1095. [Google Scholar] [CrossRef] [Green Version]
Lai, L.C.; Yang, C.C.; Wu, C.J. Time-optimal control of a hovering quad-rotor helicopter. J. Intell. Robot. Syst. 2006, 45, 115–135. [Google Scholar] [CrossRef]
Liu, Z.X.; Yuan, C.; Zhang, Y.M.; Luo, J. A Learning-Based Fault Tolerant Tracking Control of an Unmanned Quadrotor Helicopter. J. Intell. Robot. Syst. 2016, 84, 145–162. [Google Scholar] [CrossRef]
Xie, W.J.; Ma, B.L.; Fernando, T.; Iu, H.H.C. A new formation control of multiple underactuated surface vessels. Int. J. Control 2018, 91, 1011–1022. [Google Scholar] [CrossRef]
Zhang, W.; Zeng, J.; Yan, Z.P.; Wei, S.L.; Zhang, J.; Yang, Z.W. Consensus Control of Multiple AUVs Recovery System Under Switching Topologies and Time Delays. IEEE Access 2019, 7, 119965–119980. [Google Scholar] [CrossRef]
Qi, X.; Cai, Z.J. Three-dimensional formation control based on nonlinear small gain method for multiple underactuated underwater vehicles. Ocean. Eng. 2018, 151, 105–114. [Google Scholar] [CrossRef]
Zhao, J.; Dai, F.; Song, Y. Consensus of heterogeneous mixed-order multi-agent systems including UGV and UAV. In Proceedings of the 2021 Chinese Intelligent Systems Conference, Fuzhou, China, 16–17 October 2021; Volume III, pp. 202–210. [Google Scholar]
Ren, W.; Beard, R.W. Distributed Consensus in Multi-Vehicle Cooperative Control; Springer: London, UK, 2008; Volume 27. [Google Scholar]
Aweya, J.; Ouellette, M.; Montuno, D.Y. Design and stability analysis of a rate control algorithm using the Routh-Hurwitz stability criterion. IEEE/ACM Trans. Netw. 2004, 12, 719–732. [Google Scholar] [CrossRef]
Lewis, F.L.; Vrabie, D.; Syrmos, V.L. Optimal Control; Series Engineering Pro Collection; Wiley: Hoboken, NJ, USA, 2012. [Google Scholar]
Zhi, H.; Chen, L.; Li, C.; Lv, Y. Optimal leader-follower affine formation control of linear multi-agent systems. Optim. Control Appl. Methods 2022, 43, 304–320. [Google Scholar] [CrossRef]

Figure 1. The illustration of UAV kinematics.

Figure 2. The illustration of UUV.

Figure 3. Formation state with static formation control.

Figure 4. Position state with static formation control.

Figure 5. Velocity state with static formation control.

Figure 6. Formation state with dynamic formation control.

Figure 7. Position state with dynamic formation control.

Figure 8. Velocity state with dynamic formation control.

Figure 9. Static formation state with cooperative control and optimal control.

Figure 10. Position state with static cooperative optimum formation control.

Figure 11. Velocity state with static cooperative optimum formation control.

Figure 12. Dynamic formation state with cooperative control and optimal control.

Figure 13. Position state with dynamic cooperative optimum formation control.

Figure 14. Velocity state with dynamic cooperative optimum formation control.

Table 1. Status variables of the UAV.

UAVs	$Initial Position (m)$	$Initial Seed (m / s)$	Initial Attitude $Angle (°)$	Initial Attitude $Angle Rate (° / s)$	$Expected Position (m)$	$Expected Seed (m / s)$	Expected Attitude $Angle (°)$	Expected Attitude $Angle Rate (° / s)$
UAV1	(30,50,50)	(1,1,1)	(3,2,0)	(0,0,0)	(60,70,30)	(0,0,0)	(0,0,0)	(0,0,0)
UAV2	(90,30,50)	(1,−2,1)	(4,1,0)	(0,0,0)	(80,100,30)	(0,0,0)	(0,0,0)	(0,0,0)
UAV3	(60,30,15)	(−2,1,1)	(2,2,0)	(0,0,0)	(80,50,30)	(0,0,0)	(0,0,0)	(0,0,0)

Table 2. Status variables of the USV and UUV.

USVs and UUVs	$Initial Position (m)$	$Initial Seed (m / s)$	$Expected Position (m)$	$Expected Seed (m / s)$
USV1	(90,50)	(0,−1)	(60,70)	(0,0)
USV2	(65,10)	(0,1)	(80,100)	(0,0)
USV3	(30,20)	(1,1)	(80,50)	(0,0)
UUV1	(40,50,−10)	(−1,−1,−1)	(50,60,−30)	(0,0,0)
UUV2	(60,30,−30)	(2,−2,−1)	(70,90,−30)	(0,0,0)
UUV3	(30,20,−20)	(−1,1,−1)	(60,50,−30)	(0,0,0)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, M.; Li, Y.; Zhu, L.; Guo, Y.; Liu, B. Formation Control for Mixed-Order UAVs–USVs–UUVs Systems under Cooperative and Optimal Control. J. Mar. Sci. Eng. 2023, 11, 704. https://doi.org/10.3390/jmse11040704

AMA Style

Liu M, Li Y, Zhu L, Guo Y, Liu B. Formation Control for Mixed-Order UAVs–USVs–UUVs Systems under Cooperative and Optimal Control. Journal of Marine Science and Engineering. 2023; 11(4):704. https://doi.org/10.3390/jmse11040704

Chicago/Turabian Style

Liu, Meichen, Yandong Li, Ling Zhu, Yuan Guo, and Bohao Liu. 2023. "Formation Control for Mixed-Order UAVs–USVs–UUVs Systems under Cooperative and Optimal Control" Journal of Marine Science and Engineering 11, no. 4: 704. https://doi.org/10.3390/jmse11040704

APA Style

Liu, M., Li, Y., Zhu, L., Guo, Y., & Liu, B. (2023). Formation Control for Mixed-Order UAVs–USVs–UUVs Systems under Cooperative and Optimal Control. Journal of Marine Science and Engineering, 11(4), 704. https://doi.org/10.3390/jmse11040704

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Formation Control for Mixed-Order UAVs–USVs–UUVs Systems under Cooperative and Optimal Control

Abstract

1. Introduction

2. Preliminaries

2.1. Graph Theory

2.2. Formation Definition

2.3. High-Order UAV Dynamics Model

2.4. Second-Order USV Dynamics Model

2.5. Second-Order UUV Dynamics Model

2.6. Heterogeneous Multi-Agent System

3. Design of Control Protocol

3.1. Formation Control Protocol

3.2. Optimal Control

3.3. Cooperative Control

3.4. Collaborative Optimal Control for Mixed-Order Heterogeneous Systems

4. Simulation

4.1. Simulation of Static and Dynamic Formation Control Protocol

4.2. Simulation of Cooperative Optimum Formation Control Protocol

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI