Multi-UAV Reconnaissance Task Assignment for Heterogeneous Targets Based on Modified Symbiotic Organisms Search Algorithm

Chen, Hao-Xiang; Nan, Ying; Yang, Yi

doi:10.3390/s19030734

Open AccessArticle

Multi-UAV Reconnaissance Task Assignment for Heterogeneous Targets Based on Modified Symbiotic Organisms Search Algorithm

by

Hao-Xiang Chen

^*

,

Ying Nan

and

Yi Yang

Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China

^*

Author to whom correspondence should be addressed.

Sensors 2019, 19(3), 734; https://doi.org/10.3390/s19030734

Submission received: 22 January 2019 / Revised: 2 February 2019 / Accepted: 5 February 2019 / Published: 12 February 2019

(This article belongs to the Section Physical Sensors)

Download

Browse Figures

Versions Notes

Abstract

:

This paper considers a reconnaissance task assignment problem for multiple unmanned aerial vehicles (UAVs) with different sensor capacities. A modified Multi-Objective Symbiotic Organisms Search algorithm (MOSOS) is adopted to optimize UAVs’ task sequence. A time-window based task model is built for heterogeneous targets. Then, the basic task assignment problem is formulated as a Multiple Time-Window based Dubins Travelling Salesmen Problem (MTWDTSP). Double-chain encoding rules and several criteria are established for the task assignment problem under logical and physical constraints. Pareto dominance determination and global adaptive scaling factors is introduced to improve the performance of original MOSOS. Numerical simulation and Monte-Carlo simulation results for the task assignment problem are also presented in this paper, whereas comparisons with non-dominated sorting genetic algorithm (NSGA-II) and original MOSOS are made to verify the superiority of the proposed method. The simulation results demonstrate that modified SOS outperforms the original MOSOS and NSGA-II in terms of optimality and efficiency of the assignment results in MTWDTSP.

Keywords:

UAV; reconnaissance task assignment; sensor; Pareto dominance determination; symbiotic organisms search

1. Introduction

Unmanned aerial vehicles (UAVs) have played increasingly important roles in military reconnaissance during the last decade. When dealing with multiple reconnaissance tasks [1], it is often required that UAVs form teams and work collaboratively, which necessitates cooperative task assignments [2]. The purpose of cooperative task assignments is to allocate necessary tasks and determine the appropriate task execution sequence to the UAVs to maximize overall performance. The battlefield situation and the range of permissible performance of the UAV set are commonly considered in the cooperative task assignments.

The cooperative task assignment problem (CTAP) is usually considered as NP-Hard Problems. In some prior works, the basic CTAP was formulated as a mixed-integrated linear programming (MILP) problem [3], which aims to obtain an optimal assignment solution. Decentralized approaches were taken to deal with the dynamic, unexpected situations by using the information obtained by itself or through communications with neighboring UAVs [4]. With the intention of reducing the computational burden, optimization algorithms such as genetic algorithms (GA) were adopted to obtain suboptimal solutions [5]. Shaferman et al. [6] took UAVs’ dynamic constrains into consideration to ensure flyable trajectories and applied a stochastic search method to improve assignment solutions. Gyeongtaek et al. [7] put forward a market-based distributed assignment algorithm for timing constrained tasks and determine the allocation sequence by the estimated time of arrival (ETA) of each UAV. In the works mentioned above, UAVs’ paths to the task positions were not precisely calculated during the process of assignment.

In the actual assignment planner, the path for the UAVs to perform the tasks should also be taken into consideration. Thus, in some other prior studies, the basic CTAP were formulated as a travelling salesman problem (TSP) [8], which aims to find the optimal scheme for a salesman to visit all the cities with the shortest path. For the original TSP, the length between two cities is usually considered as Euclidean length. According to the dynamic characteristics and constraints of UAVs, Dubins vehicle model and Dubins path were introduced into the CTAP for route length computation [9]. Thus, the CTAP was formulated as a Dubins travelling salesman problem (DTSP) [10]. Furthermore, Zhang et al. [11] took the effective range of UAVs’ sensors into consideration and formulated the CTAP as a Dubins travelling salesman problem with neighborhood (DTSPN). Wang et al. [12] considered the multi-UAV reconnaissance task allocation as an extended multiple DTSP, where the visit paths to the heterogeneous targets must meet specific constraints due to the targets’ feature. Since the reconnaissance tasks are time-sensitive, the execution time required will affect UAVs’ choices of tasks and flight paths. Considering the time windows of the tasks (targets), Karabulut et al. [13] formulated the TSP with a service time containing ready time and due time, which was so called TSPTW. Nunes et al. [14] explained the different circumstance, under which the time window was took as a hard temporal constraint and a soft temporal constraint.

As for the subject of CTAP, the literature mentioned above concentrated on the heterogeneity features and kinematic constraints of UAVs, but the features of targets were ignored or considered to be homogeneous. In this paper, we focus on the reconnaissance task allocation problem for UAVs, considering ground targets with heterogeneous features and sizes are considered. To describe the CTAP, a novel cooperative reconnaissance task assignment model for multi-UAV is formulated: multiple time window-based Dubins travelling salesmen problem (MTWDTSP). Compared with the MILP form, MTWDTSP takes into account the influence the influence of various flight paths on the assignment results; as for compared with common TSP form, MTWDTSP could deal with the effect of different time windows on task sequence. In this particular problem, heterogeneous reconnaissance targets are also presented as point, strip, and surface targets, according to the features of targets and the performance of UAV sensor. To accomplish the reconnaissance of each target, UAVs must cover all the heterogeneous targets with the airborne sensors.

The MTWDTSP is a typical NP-hard multi-objective optimization problem. Group intelligent optimization methods (such as Bayesian approach [15]) are employed to solve the problem with independent task points in a static environment within limited space, which can reduce the complexity of modeling and calculation. In recent years, heuristic and bio-inspired algorithms based group intelligent optimization methods have been adopted to solve the multi-objective optimization problem, including the simulated annealing algorithm [16], the Tabu search algorithm [17], genetic algorithms (NSGA-II) [18,19], the artificial fish school algorithm [20], ant colony algorithm [21] and the particle swarm optimization (niching PSO) [22,23,24,25]. These algorithms perform better than most traditional mathematical techniques in solving these problems, because they do not require substantial gradient information. Traditional group intelligent optimization algorithms can obtain locally optimal solutions for low-dimensional problems. As the number of UAVs and tasks (targets) grows, the traditional group intelligent optimization algorithms can easily premature and fall into local optimum. In order to search over the designed possible solution spaces as much as possible, Cheng and Prayogo [26] first proposed in 2014 a new meta-heuristic optimization algorithm known as symbiotic organisms search (SOS). In this paper, we introduce a modified multi-objective symbiotic organisms search (MOSOS) [27] to solve the MTWDTSP with dynamic time window constraints, task type constraints, and UAV sensor constraints. To improve the performance of MOSOS, a globally adaptive parasitism parameter [28] is used to increase convergence speed. Pareto optimal solution set is introduced to improve the variety of population and increase the probability of converging to the optimal solution. Also, a double-chain chromosomes encoding is adopted to pre-process the designed solution space for the efficiency of the algorithm.

2. Mathematical Model of the MTWDTSP

In this section, the models of sensors, targets, and UAVs are established. The paths of covering targets with different shapes are also designed to calculate the estimated reconnaissance time.

2.1. Modeling of the UAV and Sensor

In the particular problem, a set of heterogeneous targets need to be reconnoitered by different types of UAVs subject to several types of constraints. In this article, Dubins model for UAV [10] is introduced with the following basic assumptions: (1) the velocity of each UAV is constant; (2) the UAVs perform the reconnaissance task at given altitudes; (3) the UAVs fly at different altitudes without inter-UAV collision; (4) the flight time of each UAV is limited. At this point, the spatial dimension of the UAV can be reduced from three dimensions to two dimensions (X_U-Z_U plane):

S_{U} = (x, z, ψ, V_{U}, r_{\min})

(1)

{\begin{matrix} \dot{x} = V_{U} \cos ψ \\ \dot{z} = V_{U} \sin ψ \\ \dot{ψ} = c \cdot V_{U} / r_{\min} \end{matrix}

(2)

where x and z represent the UAV’s coordinates in plane space, V_U represents for the speed, r_min represents the minimum turning radius, and

ψ

is the flight yaw angle.

Airborne sensors are required to reconnoiter ground targets. It is assumed that the reconnaissance task on this target is finished when a ground target is fully covered by the sensor’s field of view. In this study, a sensor’s field of view is considered as a circle region with the constant opening angle on the ground, neglecting the influence of altitude and noise, as illustrated in Figure 1. In Figure 1, H_U is the flight altitude, W_U is the detection width of the sensor, the axis X_U-Z_U-Y_U denotes the body/dynamic system of the UAV.

2.2. Modeling of Heterogeneous Ground Targets

In general, one of the fundamental features is the target shape. To facilitate the determination of the reconnaissance duration and path, the shape of a target can be divided into point targets, strip targets, and surface targets. Due to the different sizes of targets, the same UAV will have different detection times for each type of target. Assuming that the detection time is always lower than the time window, then the following provisions are made to determine a suitable target detection time:

(1): For point targets, the default detection time is a fixed duration: $t_{d} = t_{D e c}$ ;
(2): For strip targets, the UAV detects targets along the direction of strip extension (shown as Figure 2), and the detection duration is $t_{d} = L_{s t} / V_{U}$ , where $L_{s t}$ represents the extension length of the target and $V_{U}$ is the UAV’s speed.
(3): For surface targets, the UAV must fly along the “Z” path to detect the entire area. Considering the irregularity of the detection area, as shown in Figure 3a, an equal interval rotation method should first be performed to find the smallest circumscribed rectangle (SCR) of the irregular area. Then the circumscribed rectangle should be divided into several smaller rectangles, and finally, the UAV can detect the whole area along the “Z” path.
Denote the SCR as $H_{m i n} = L_{s f} * W_{s f}$ , where L_sf and W_sf represent the length and width, respectively, shown in Figure 3b. Therefore, the detection duration of surface targets is $t_{d} = \frac{1}{V_{U}} (L_{s f} \cdot ⌈ W_{s f} / w_{U} ⌉ + π w_{U} \cdot ⌊ W_{s f} / w_{U} ⌋)$ , where $w_{U}$ is the detection width of a single UAV, $⌈ \cdot ⌉, ⌊ \cdot ⌋$ represent for the upward and downward rounding operators.

It is worth pointing out that the way to select the entry point of the surface or the strip and the generation of Dubins-Path may be found in [12].

The second feature of heterogeneous target is the detectable time. The time attribute of the reconnaissance task means that the target has a fixed time window (as shown in Figure 4):

T_{t a s k} = (t_{b e g i n}, t_{e n d})

, where

t_{b e g i n}

is the earliest time available for detection and

t_{e n d}

is the latest time available, which is considered as a hard-constraint. Denote

G e t_{u a v}

as the arrival time of the UAV, and then the waiting time (shown in Figure 5.) before execution

t_{c}

can be calculated as follows:

t_{C} = {\begin{matrix} t_{b e g i n} - G e t_{u a v} & i f t_{b e g i n} > G e t_{U A V} \\ 0 & i f t_{b e g i n} \leq G e t_{U A V} < t_{e n d} \\ - 1 & e l s e \end{matrix}

(3)

where

t_{c} = - 1

means the failure of this task.

2.3. Assignment Model of Reconnaissance Tasks for Multi-UAVs

Task assignment problem for multiple cooperative UAVs can be described as follows: for

N_{U}

UAVs and

N_{T}

reconnaissance tasks of geographically dispersed targets, under the decision variable

{Χ |}_{N_{U}}^{N_{T}}

, the whole group of the UAVs is able to complete all tasks satisfying all types of constraints

C_{A s n} (c_{1}, c_{2} \dots c_{N_{C}})

while ensuring that the overall performance indicator

J_{A s n}

will be optimized. The objective of the task assignment is to minimize the total cost (performance indicator), which can be expressed as:

{\begin{matrix} {Χ |}_{N_{U}}^{N_{T}} = \arg \min_{X} (J_{A s n}) \\ C_{A s n} (c_{1}, c_{2} \dots c_{N_{C}}) \leq 0 \end{matrix}

(4)

Considering the reconnaissance model, the number of the UAVs, the type of restrictions, and the time window of the task itself, certain tasks may not be successfully implemented. Therefore, determining how to prioritize the important tasks and make the total flight distance as short as possible become the optimization goals of the problem. To describe the above progress in a mathematical form, the decision variable of task assignments can be given as follows:

{X |}_{N_{U}}^{N_{T}} = {χ_{i j}^{k p} | i = 1, 2 \dots N_{U}; j \in T; k = 1, 2 \dots N_{S}; p = 1, 2 \dots N_{P}}

(5a)

χ_{i j}^{k p} = {\begin{matrix} 1, & U A V i f o r T a s k j \\ 0, & e l s e \end{matrix}

(5b)

where T = < 1,2….

N_{T}

> denotes the set of the number for task points, k denotes the type of UAV, p denotes the pth drone of the kth type, and the number of all reconnaissance tasks is

N_{T}

. Therefore, the decision variable has the following matrix form:

{X |}_{N_{U}}^{N_{T}} = [\begin{matrix} a_{00} & a_{01} & \dots & \dots & \dots & a_{0 T_{\max}} \\ a_{10} & a_{11} & \dots & \dots & \dots & a_{1 T_{\max}} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ ⋮ & ⋮ & ⋱ & ⋮ \\ ⋮ & ⋮ & ⋱ & ⋮ \\ a_{T_{\max} 0} & a_{T_{\max} 1} & \dots & \dots & \dots & a_{T_{\max} T_{\max}} \end{matrix}] = [\begin{matrix} 0 & 0 & \dots & 1 & \dots & 0 \\ 0 & 0 & \dots & \dots & 1 & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & ⋮ & 0 & ⋮ \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & 1 & \dots & \dots & 0 \end{matrix}]

(6)

Given that the base node number is 0, it can be seen that the element has the form of a sparse matrix. In this study, considering that each task reconnoiters only once at most, there is at most one nonzero item per row in the sparse matrix.

2.4. Mathematical Model of the Performance Indicator (PI) and Constraints

In this problem, the constraints of assignment involve two parts: physical and logical. Physical constraints are related to UAVs’ flight characteristics and task properties, while logical constraints focus on the requirements of each type of task and the strategies for UAVs to implement the reconnaissance tasks. Some of the fundamental constraints are as follows [29,30]:

(1): All UAVs start from the base and eventually return to the base.
(2): Each task can be completed no more than once.
(3): The assignment scheme should match the time window and imaging requirements of each individual task.
(4): Each UAV can be assigned only one task at the same time period.
(5): The total time for each UAV to implement the tasks should not be longer than the longest flight time.

The details of the constraints are described as below:

(1) PIs based on task assignment:

\min J_{1} (Χ) = λ (N_{U} + 2 - S (Χ)) + (1 - λ) N (Χ)

(7)

S (Χ) = \sum_{k = 1}^{N_{k}} \sum_{p = 1}^{N_{p}} \sum_{i = 1}^{N_{U}} \sum_{j = 1}^{N_{T}} w_{j} χ_{i j}^{k p}

(8)

N_{U} = \sum_{k = 1}^{N_{s}} N_{p}

(9)

where

S (X)

represents the total weight of the tasks that have been assigned,

w_{j}

reflects the importance of the jth reconnaissance task,

N_{U}

is the total number of all UAVs, and

N (X)

is the number of the employed UAVs to execute the assigned tasks.

The PI (7) is designed in a flexible form with the purpose to let the task assignment model choose the proper objective function to optimize depending on the conditions of UAVs and tasks.

λ

represents the adjustment parameter, and divides

m i n J_{1} (X)

into two parts: when

λ = 1

,

\min J_{1} (Χ) = \min (N_{U} + 2 - S (Χ))

, which indicates that the PI chooses to optimize the maximum weight of assigned tasks; when

λ = 0

,

\min J_{1} (Χ) = \min N (Χ)

, which indicates that the PI chooses to optimize the minimum number of the employed UAVs.

Defining the symbolic functions as the following form:

sgn (χ) = {\begin{matrix} 1 & χ = 1 \\ 0 & e l s e \end{matrix}

(10)

so that:

N (X) = \sum_{i = 1}^{N_{U}} sgn (\sum_{j = 1}^{N_{T}} χ_{i j}^{k p})

(11)

If the value of

sgn (χ)

is kept within the range of 0 to 1, the following condition is satisfied:

λ = {\begin{matrix} 1, S (X) < 1 \\ 0, S (X) = 1 \end{matrix}

(12)

Then, it can be ensured that the PI optimizes the maximum number of tasks, and after all the tasks are successfully executed, the number of UAVs is minimized at the same time. Since

N (X) \leq N_{U}

, the following must hold:

J_{1} (X_{1}) = N (X_{1}) < J_{1} (X_{2}) = N_{U} + 2 - S (X_{2})

(13)

This equation is a universal calculation formula for all the possible decision variables X. Therefore, when designing the task weight coefficient w, it is necessary to satisfy the following requirement:

\sum_{j = 1}^{N_{T}} w_{j} = 1

(14)

Denote

w = [w_{1}, w_{2}, w_{3}, \dots, w_{N_{T}}]

, and

w^{'} = [0, w]

represents the extension vector of a weight vector. Then, there is the following expression:

S (X) = \sum_{k = 1}^{N_{k}} \sum_{p = 1}^{N_{p}} χ^{k p} {w^{'}}^{T} \leq 1

(15)

If and only if the non-zero row element of the matrix of all elements in decision variable X is N_T, then all targets have been successfully reconnoitered.

(2) PI based on flight distance:

\min J_{2} (X) = \sum_{k = 1}^{N_{s}} \sum_{p = 1}^{N_{p}} \sum_{i \in T} \sum_{j \in T} D i s_{i j} χ_{i j}^{k p}

(16)

where

D i s_{i j}

represents the distance from node i to node j. The total optimization PI can be written as follows:

m i n J (X) = f (J_{1} (X), J_{2} (X))

(17)

The PI must satisfy the following constraints [29]:

(a): Each target is scouted for only once:

$\sum_{k = 1}^{N_{s}} \sum_{p = 1}^{N_{p}} \sum_{i \in T} \sum_{j \in T} χ_{i j}^{k p} - 1 \leq 0$

(18)
(b): The number of UAVs cannot exceed the maximum number in the base:

$\sum_{p = 1}^{N_{p}} sgn (\sum_{j = 1}^{N_{T}} χ_{i j}^{k p}) - N_{k} \leq 0$

(19)
(c): Each UAV departs from the base and eventually returns to the base (as mentioned above, the base is noted as 0):
$\forall j = 1, 2, 3, \dots, N_{T}$ , and since $χ_{i j}^{k p} = 1$ , then:

${\begin{matrix} \sum_{l = 1}^{N_{T}} x_{0 l}^{k p} = 1 \\ \sum_{l = 1}^{N_{T}} x_{l 0}^{k p} = 1 \end{matrix}$

(20)
(d): Time window constraints:
Define the time window of task j $: [m s_{j}, m o_{j}]$ , $T_{j}$ indicates when the UAV will detect the target; therefore, the constraint can be given as follows:

$m s_{j} \leq T_{j} \leq m o_{j}$

(21)

$T_{j} = T_{i} + S e r_{i}^{k} + t_{i j}^{k} + t_{C}_{j}$

(22)

where $S e r_{i}^{k}$ is the reconnaissance time of the kth type of UAV, $t_{i j}^{k}$ indicates the time for the kth type of UAV to fly from node i to node j, and $t_{C j}$ is the wait time before executing the task.
(e): Sensor constraints of tasks:
The minimum sensor accuracy required for the task j is $S e n T_{j}$ , the sensor accuracy of the kth type of UAV is $S e n V_{j}^{k}$ , and the constraint can be given as follows:

$S e n V_{j}^{k} - S e n T_{j} \leq 0$

(23)
(f): The total flight time of each UAV cannot exceed the maximum flight time:

$\sum_{i = 0}^{N_{T}} \sum_{j = 1}^{N_{T}} χ_{i j}^{k p} (t_{i j}^{k} + t_{C}_{j} + S e r_{j}^{k}) + x_{l a s t - 0}^{k p} t_{l a s t - 0}^{k} - T L_{k} \leq 0$

(24)

where node 0 represents the base by default, last represents the last reconnaissance task point of the UAV, and the latter item $t_{l a s t - 0}^{k}$ represents the time to return from the last task point.

3. Modified SOS for MTWDTSP

The original MOSOS algorithm works on the cooperative behavior seen among organisms in nature. During the search process, each organism benefits from continuous interaction with others in three different phases [24]: mutualism, commensalism and parasitism. Mutualism allows the two sides to benefit from each other; commensalism benefits one party, while the other party is not affected; parasitism benefits one party, and the other party suffers. These three phases are adopted from the most common symbioses used by organisms to increase their fitness and survival advantage over the long term. The mechanisms for updating the best organism are triggered after one generation of organisms has completed its three phases. The phases are repeated until the stopping criterion is achieved. The original MOSOS enjoys some advantages such as simplicity in operation, few control parameters, good stability, and strong optimization ability, but there are several shortcomings such as early maturity and delay in later-search [30]. As for the proposed problem, combining with the motivation to improve the diversity and quality of the solution, the accuracy of the algorithm convergence, and reduce the computational complexity, several modifications are introduced to the original MOSOS algorithm [31]. A “Pareto dominance determination” approach [32] is adopted to preprocess the performance indicator of the proposed problem to improve the diversity of non-inferior solutions. Meanwhile, globally adaptive approach is used to improve speed, accuracy and convergence characteristics of the original MOSOS.

3.1. Double-Chain Encoding of the Decision Variable

Considering the constraints of the multi-UAV reconnaissance task assignment problem, a double-chain encoding method is developed, where both chromosomes are encoded by integers. In Section 2, the decision variable is given by Formulas (4)–(6); and then the individual variable dimensions for each particle are as follows:

N_{p} \times N_{k} \times T_{m a x} \times T_{m a x}

(25)

Considering the dimension, the number of individuals and the iteration equations, undoubtedly, there will be a complication of the problem and a slow convergence of the algorithm. As noted above, each element of the decision variable

χ

is in the form of a sparse matrix. If the sparse matrix is compressed and the location of nonzero elements is stored, then the particle’s variable dimension will be compressed as follows:

N_{p} \times N_{k} \times M \times 2 \times 2

(26)

where M represents the number of nonzero elements in each sparse matrix. Compared to the original coding, the decision space is greatly compressed, but it is still large. It can be imagined that there will still be a large number of zero matrices in the optimal decision variable

χ

.

Considering the constraint functions (a), (b) and (c) proposed in Section 2.4, the double-chain encoding is used from the perspective of the task point, and the decision code string is then mapped to the task sequence of each UAV. For example, for the assignment of six tasks, there are currently three types of UAVs, each of which has 3, 2, and 3 UAVs, so that if T_UAV = (i,j) represents the jth drone of the ith type, the example can be easily constructed as in Table 1.It is clear that for the UAV (1,1), the UAV passes through mission points 3 and 5, with mission point 5 flown to first and mission point 3 flown to last. A detailed decoding table is constructed as shown in Table 2.

Considering the uniform dimensions of the double-chain encoding, T_UAV = (i,j) is reduced dimensionally so that the following formulas are satisfied:

If N = \sum_{k = 1}^{N_{k}} n_{k} and {‖ T_{U A V} = (i, j) ‖}_{1} = M,

(27a)

then {\begin{matrix} i = N_{p} \\ j = n_{N_{k}} - (\sum_{p = 1}^{N_{p}} n_{p} - M) \\ \sum_{p = 1}^{N_{p} - 1} n_{p} < M \leq \sum_{p = 1}^{N_{p}} n_{p} \end{matrix}

(27b)

In summary, the decision variables of

2 \times N_{T}

dimension can be defined to describe the mission assignment status of the UAV, and

N_{T}

represents the number of tasks. The assignment scheme generated by this encoding determines that each task point is visited by a UAV, but in reality, due to constraints such as the time window, there will be situations where certain task points cannot be executed. M can be set to zero to indicate that a UAV was not available to execute the task.

3.2. Pareto Dominance Determination and Optimal Solution Set

According to the mathematical model constructed in Section 2, the task assignment problem can be summarized as a nonlinear multi-objective optimization (minimum) problem:

\begin{matrix} m i n f (X) = (f_{1} (X), f_{2} (X), f_{3} (X) \dots f_{n} (X)) \\ \begin{matrix} X \in Ω \in R^{n} & s . t c_{i} (X) \leq 0 \end{matrix} \end{matrix}

(28)

where

X = (χ_{1}, χ_{2}, χ_{3}, \dots, χ_{n})

is the n-dimensional decision vector in

R^{n}

space and

c_{i} (X) \leq 0

(

i = 1, 2, 3, \dots, p

) represents the constraints of the optimization problem. The series of sub-PIs may conflict with each other; this nonlinear multi-objective optimization problem will produce multiple solutions. French economist V. Pareto proposed the concept of Pareto solution set for comparing the different feasible solutions. Assuming that for the minimization optimization problem, there are two feasible solutions

X_{1}

and

X_{2}

, if the following holds:

\forall f_{i} (X), f_{i} (X_{1}) \leq f_{i} (X_{2}) and \exists J_{k} (X), f_{i} (X_{1}) < f_{i} (X_{2})

(29)

Then,

X_{1}

dominates

X_{2}

, and

X_{1}

is called a noninferior solution, and

X_{2}

is called an inferior solution. If solutions

X_{1}

and

X_{2}

are not dominated by each other, then both solutions are noninferior solutions and are added to the noninferior solution set. This solution set is the so-called “Pareto solution set” and is often referred to as the optimal frontier (as shown in Figure 6).

Figure 6 illustrates the process of minimizing the optimization of the two indicator functions. It can be clearly seen that as the number of iterations increases, the distribution of the optimal solution set tends to increasingly form a curve, and the red squares indicate the best frontier after 200 iterations. Then, using the artificial knowledge, the feasible solutions in the set are analyzed and compared based on different weights of PIs.

This paper focuses on the assignment problem for UAVs with obstacles and threats in the flight zone and presents a modified SOS algorithm based on Pareto dominance determination inspired by traditional evolutional algorithms and PSO. At the same time, a suitable coding operator is designed to effectively address all types of constraints to solve the task assignment problem.

3.3. Modified SOS with Global Adaptive Scaling Factor

Specific steps of the SOS algorithm can be expressed as follows:

Step 1. Initialize the population: First, generate N_b random individual “biologicals” according to Equation (30); each "biological" is an initial solution.

X_{i} = l_{b} + r a n d (1, D_{Χ}) \cdot (u_{b} - l_{b})

(30)

where

X_{i}

represents the ith (i = 1, 2,..., N_b) "biological" in the ecosystem, D_X is the dimension of the solution, and rand(1,D_X) is a 1 × D_X dimension scaling factor vector. u_b and l_b are upper and lower bounds of the search space, respectively.

Step 2. Mutualism: At this stage a “biological” X_j is picked randomly from the population to interact with X_i and produce mutual benefits so that each progresses toward the optimal solution. X_i and X_j generate new solutions of X_i and X_j according to Equation (31). In the formula, X_best is the optimal individual in the ecosystem, V_M is a symbiotic vector and represents the middle point of the two individuals, and BF is the benefit factor; since each individual has a different degree of benefit, BF takes a value of 1 or 2.

\begin{array}{l} X_{i n e w} = X_{i} + r a n d (0, 1) \cdot (X_{b e s t} - V_{M} \cdot B F_{1}) \\ X_{j n e w} = X_{j} + r a n d (0, 1) \cdot (X_{b e s t} - V_{M} \cdot B F_{2}) \\ V_{M} = \frac{X_{i} + X_{j}}{2} \end{array}

(31)

Step 3. Globally Adaptive Parasitism [31]: For the original MOSOS algorithm, only one individual can benefit in a symbiotic relationship, while another individual is not affected. For the ith individual

X_{i}

, one individual

X_{j}

is randomly selected from other individuals. The original MOSOS algorithm always uses the search strategy of “current-to-best.” Because of the randomness of the scaling factor, the search equation limits the guiding effect of the global optimal value, resulting in insufficient convergence accuracy and a long convergence time, which does not meet the requirement of the task assignment problem. For this reason, this paper proposes an adaptive hybrid search strategy with global optimal guidance. The specific form is as follows:

X_{i n e w} = X_{i} + (1 - μ) (X_{i} - X_{j}) + μ (X_{b e s t} - X_{j})

(32a)

μ = μ_{\min} + (μ_{\max} - μ_{\min}) \frac{G_{I}}{G_{\max}}

(32b)

where

μ \in [0, 1]

is the scaling factor;

μ_{\max}

,

μ_{\min}

are the maximum and minimum scaling factors, respectively; G_max is the maximum number of cycles; and G_I is the current number of iterations. This strategy maintains the difference between individuals by adding a differential disturbance factor. It simultaneously introduces an adaptive scaling factor. The initial cycle has a smaller

μ

. The algorithm emphasizes the global search, reduces the tendency to move toward a single point in the search space, and prevents the algorithm from falling into a local minimum. With the progress of the algorithm,

μ

gradually increases, the guiding role of the current optimal individual is strengthened, the leading algorithm advances, and the search process is no longer just a complete random search but instead is more purposeful and directional.

3.4. Optimal Solution Selection of the Proposed Algorithm

The core of the MOSOS algorithm is to determine the global optimal solution

X_{b e s t}

according to Pareto dominance determination, and only one set of noninferior solutions can be identified. Since the optimal solution must exist in the noninferior solution set, in this study the external noninferior profile C of the “biological” population is constructed for the selection of global optimal solution

X_{b e s t}

. Refer to the selection technique used in [29], solutions in the noninferior solution set is sorted by their dominance firstly, then compared with the external noninferior profile C, and replace the dominated solutions in profile C with better ones.

At the meantime, since each iteration of the MOSOS algorithm will inevitably generate many new solutions, the scale of external non-inferior profile will increase dramatically. With the purpose to distribute the optimal front edge as evenly as possible, it is necessary to eliminate a part of the densely distributed individuals and avoid falling into local optimum. Therefore, the external noninferior profile need to be “pruned” during the iterations. Refer to the crowded function defined in NSGA-II, an individual density function for sorting the solutions with normalized PIs and constraints is developed in this study as follows:

D e n_{i} = \frac{1}{{‖ (J_{i + 1, 1} - J_{i - 1, 1}) ‖}_{2} + {‖ (J_{i + 1, 2} - J_{i - 1, 2}) ‖}_{2}}

(33)

As shown in Figure 7,

{‖ (J_{i + 1, 1} - J_{i - 1, 1}) ‖}_{2}, {‖ (J_{i + 1, 2} - J_{i - 1, 2}) ‖}_{2}

are the length and width of the quadrilateral, and the density of the individual i is reciprocal of their sum. Which reflects, the larger

{‖ (J_{i + 1, 1} - J_{i - 1, 1}) ‖}_{2}, {‖ (J_{i + 1, 2} - J_{i - 1, 2}) ‖}_{2}

, the lower density of individual i.

Considering the efficiency of the proposed algorithm, the size of the external non-inferior profile C is limited to less than 1/5 of the size of the ecosystem. The following operations are required during the iterations of MOSOS: when the number of non-inferior solutions in the profile C exceeds its maximum size, calculate the density of each non-inferior solution. Then eliminate the individuals with the highest density until the number of non-inferior solutions does not exceed the largest scale of profile C.

In the end, in order to let individuals to track the position of X_best quickly and accelerate the convergence of the algorithm, the individual with the lowest density in the external non-inferior profile C is selected as the global optimal solution X_best. The algorithm of optimal solution selection is given as follows:

Algorithm 1: Algorithm of the optimal solution selection.

Step 1.: Initialize all the “biologicals” individual variables and calculate their PI values and constraint function values.
Step 2.: According to Section 3.1, all optimal noninferior solutions are added to the external nontrivial file C.
Step 3.: Execute the SOS search iteration formula to obtain a new “biologicals” individual X and calculate its index function value and constraint function value.
Step 4.: Compare all the solutions in the new X with the external noninferior files C according to the criterion. If the new solution dominates the solution in C, then the inferior solution in C is deleted and the new solution is added to C; if the new solution and the solution in C do not dominate each other, the new solution is also added to C; if the new solution is dominated by the C solution, nothing is done.
Step 5.: Conduct pruning operations on the non-inferiority external profile C, and select X_best.
Step 6.: Repeat Steps 2–5 until the SOS algorithm reaches the maximum number of iterations.

5. Numerical Simulation Results and Analysis

The proposed method for task assignments was tested via numerical simulation experiments. The numerical experiments were created by using Visual C++ (Ver. 10.0), and the figures illustrating the results were created using MATLAB. The entire simulation was performed on a workstation consisting of a 3.5 GHz Intel Core-i7 CPU and 16 GB of physical RAM running 64-bit Microsoft Windows 10.

A total of 12 tasks were set in the battlefield. The parameter settings for all task points are shown in Table 3. The minimum turning radius r_min is set as 1 km, and the detection width W_U is set as 2 km.

The distribution map of all task points is shown in Figure 8. Circles represent for the point targets, short strips and rectangles stand for strip targets and surface targets, respectively.

There are the categories of UAVs on the battlefield, located at the starting positions (50, 0), of which parameters are shown in Table 4.

The MOSOS algorithm is set to have a population size of 30; the number of termination iterations G_max is 200, with a maximum external profile size of 30 and an external file of “biological” individual size of 5, scaling factors

μ_{\max} = 0.87

and

μ_{\min} = 0.13

.

Scenario 1: One UAV of each type, with a total of 3 UAVs are employed in the base for task assignment. Since the number of UAVs is much smaller than the number of tasks, UAVs are unable to complete of the tasks, then the adjustment parameter

λ

is set to 1 in this scenario. Figure 9 displayed the distribution of PIs of the initial positions of the population and the optimal frontier distribution of the initial generation. Figure 10 shows the evolution of the history frontier distribution in the iterative process, including the initial generation, the 50th generation, the 100th generation, and the 200th generation.

From the perspective of implementing tasks more economically in the real combat, it is expected that existing UAVs could implement the tasks as much as possible, or implement all the tasks with less UAVs. Thus, the final feasible solution is selected as the one with the smallest value of PI J1, then code of assignment variables to the task is constructed as follows:

[\begin{matrix} 0 & 0 & 0 & 1 & 1 & 0 & 3 & 2 & 2 & 0 & 0 & 3 \\ 1 & 1 & 1 & 2 & 1 & 1 & 1 & 2 & 1 & 1 & 1 & 2 \end{matrix}]

The serial number and detailed flight sequence of each UAV is shown in Table 5, while the paths of each employed UAV are displayed in Figure 11. It can be seen that all the UAVs are employed to execute tasks with significant weight under permitted conditions. Besides, the reconnaissance sequences of the assigned targets for the UAVs are in a good order, and the total flight distance is 456.19 km. Thus, proposed method can obtain satisfied task allocation results for multi-UAV cooperative reconnaissance problems on heterogeneous targets.

Scenario 2: Three UAVs of each type, with a total of 9 UAVs are prepared in the base for task assignment. Figure 12 displayed the distribution of PIs of the initial positions of the population and the optimal frontier distribution of the initial generation.

Figure 13 shows the evolution of the history frontier distribution in the iterative process, each point in the graph represents the best non-inferior solution to the number of iterations. Since there exists solutions for UAVs to complete all the tasks, the adjustment parameter

λ

is set to 0 in the latter stage of optimization. As seen in the figure, with the increase in the number of iterations of the algorithm, the external archive is continuously approaching the best frontier, indicating that the algorithm is convergent. The front-end distribution of all generations of noninferior solutions is uniform, which reflects the distribution of the real optimal frontier.

If taking the most reconnaissance tasks completed as the main PI, that is, the index function J₁, then in this simulation process, a total of 6 drones were deployed to perform reconnaissance tasks of all 9 UAVs, and a total of 12 task points were visited. The code of assignment variables to the task is constructed as follows:

[\begin{matrix} 2 & 1 & 5 & 2 & 6 & 5 & 9 & 2 & 3 & 1 & 6 & 5 \\ 2 & 2 & 1 & 1 & 1 & 2 & 1 & 3 & 1 & 1 & 2 & 3 \end{matrix}]

The detailed flight sequence of each UAV is shown in Table 6, while the paths of each employed UAV are displayed in Figure 14. It can be seen that all the targets are visited by the employed UAVs. The total flight distance of all employed UAVs is 937.52 km. Thus, it is proved once again that the proposed method can obtain satisfied task allocation results for multi-UAV cooperative reconnaissance problems on heterogeneous targets.

With the intention to verify the superiority of the proposed method (IMOSOS) in Section 3, improved MOPSO (IMOPSO) [33], NSGA-II [12] and original MOSOS with the same simulation conditions and stopping criterion are implemented for comparison. The initial parameters of IMOPSO are set as: w_max = 0.9; w_min = 0.4; factor C1 = C2 = 1.454. The initial parameters of NSGA-II are set as: probabilities of crossover and mutation are 0.9 and 0.1, respectively; weight factors α and β of sub-objectives are both set to be 0.5. The initial parameters of original MOSOS is the same as the proposed method. The final feasible solutions of all the algorithms are selected as the one with the smallest value of PI J₁.

Monte-Carlo simulations on Scenario 2 are carried out, as the 12 tasks mentioned above with random positions. And the statistical results of 200 runs displayed in Table 7. Among the results are the best, worst, and average (i.e. Avg.) value of the PIs proposed in Section 3. Also, the compute efficiency of each algorithm is given in Table 7. Table 8 shows the improvement ratio of IMOSOS compared with other algorithms. From the statistical results of the referred indicators, the IMOSOS performs better in searching optimal solutions to the certain task assignment problem than the other three algorithms. Meanwhile, according to the average results of the referred indicators, the improved MOSOS can provide stable solutions as the three algorithms. Thus, with the help of problem formulation of MTWDTSP and modification to the original MOSOS, this proposed method enjoys certain superiority and efficiency for multi-UAV reconnaissance task allocation problems with heterogeneous targets.

Three main aspects have considerable influence on the computational efficiency of the algorithm in the simulation process: the number of “biological” individual, the iterations of the algorithm and the scale of the external non-inferior archive. The increased number of task points and the number of the UAVs have little effect on the computational efficiency of the algorithm. The encoding rules ensure that the number of task points is linearly related to the calculation time of the algorithm. If more tasks, UAVs or bases are considered in this particular problem, in order to solve the problem effectively, the size of encoded decision variables and iterations may increase correspondingly. Therefore, for the problem of multi-UAV multi-reconnaissance tasks, the proposed method can provide a valid suboptimal solution.

6. Conclusions

This paper considers a reconnaissance task assignment problem for multi-UAVs as a multi-objective, multi-constraint nonlinear optimization problem. A model of cooperative UAVs and reconnaissance tasks of heterogeneous ground targets with time windows is built to describe this problem. To solve the task assignment problem, two PIs were constructed and need to be optimized. When dealing with the multi-UAV task assignment optimization model, this paper adopts the modified MOSOS algorithm based on the Pareto dominance determination and the global adaptive operation. To address the various constraints of the task assignment model, the decision variables designed from each UAV itself are mapped to the task point of view. Therefore, the double-chain encoding for modified MOSOS inherently meets the constraint requirements of the task assignment model and greatly reduces the dimensionality of the decision variables, thereby speeding up the convergence of the algorithm. The global best external archive and the best external individual archive are designed in this study. Furthermore, this study defines the density function of each individual, thus ensuring that the solution to the problem meets the requirements of diversity and homogeneity. Finally, numerical simulation results and statistical results of Monte-Carlo simulations are proposed to verify the superiority and efficiency of the introduced method.

Author Contributions

H.-X.C. wrote the original article, designed experiments and analyzed simulation results; Y.N. reviewed the original article and provided part of original simulation data; Y.Y. provided funding project and part of original simulation data.

Funding

This work was supported by funding of Jiangsu Innovation Program for Graduate Education, KYLX15 0319, and the Fundamental Research Funds for the Central Universities.

Conflicts of Interest

The authors declare no conflict of interest.

Data Availability

The values of some of the simulation parameters are available online (literatures), and other data is original from the authors of this paper.

Abbreviations

x	The UAV’s coordinates in plane space
z	The UAV’s coordinates in plane space
V_U	Speed of the UAV
r_min	Minimum turning radius of the UAV
$ψ$	The flight yaw angle of the UAV
H_U	The flight altitude of the UAV
W_U	The detection width of the sensor
t_d	Target detection time
t_Dec	Detection time of the point target
L_st	Extension length of the strip target
L_sf	Length of the smallest circumscribed rectangle
W_sf	Width of the smallest circumscribed rectangle
T_Task	Time window of the task
$t_{b e g i n}$	The earliest time available for detection
$t_{e n d}$	The latest time available for detection
$t_{c}$	The waiting time before execution
$N_{U}$	Number of UAVs
$N_{T}$	Number of reconnaissance tasks
${Χ \|}_{N_{U}}^{N_{T}}$	The decision variable
$J_{A s n}$	Performance indicator of the task assignment problem
$C_{A s n}$	Constraint of the task assignment problem
$χ_{i j}^{k p}$	Element of the decision variable
T	The set of the number for task points
N_k	The number of types of the UAV
N_p	The number of the UAV of the kth type
$J_{1}$	Performance indicator of tasks/UAVs
$λ$	Adjustment parameter
$S (Χ)$	The total weight of the tasks that have been assigned
$N (X)$	The number of the employed UAVs to execute the assigned tasks
$sgn (\cdot)$	Symbol function
$w$	Weight vector
$J_{2}$	Performance indicator of flight distance
$D i s_{i j}$	The distance from task i to task j
$S e n T_{j}$	The minimum sensor accuracy required for the task j
$S e n V_{j}^{k}$	The sensor accuracy of the kth type of UAV
$X_{i}$	The ith (i = 1, 2,..., N_b) "biological" in the ecosystem
D_X	The dimension of the solution
$B F$	The benefit factor
$μ$	The scaling factor
G_max	The maximum number of cycles
G_I	The current number of iterations
C	The external noninferior profile of the noninferior solution set

References

Chandler, P.R.; Pachter, M.; Rasmussen, S.J.; Schumacher, C. Multiple task assignment for a UAV team. In Proceedings of the AIAA Guidance, Navigation, and Control Conference, Monterey, CA, USA, 5–8 August 2002. [Google Scholar]
Shima, T.; Rasmussen, J. UAV Cooperative Decision and Control: Challenges and Practical Approaches; SIAM: Philadelphia, PA, USA, 2009. [Google Scholar]
Shima, T.; Schumacher, C. Assigning cooperating UAVs to simultaneous tasks on consecutive targets using genetic algorithms. J. Oper. Res. Soc. 2009, 60, 973–982. [Google Scholar] [CrossRef]
Choi, H.; Kim, Y.; Kim, H. Genetic Algorithm Based Decentralized Task Assignment for Multiple Unmanned Aerial Vehicles in Dynamic Environments. Int. J. Aeronaut. Space Sci. 2011, 12, 163–174. [Google Scholar] [CrossRef]
Edison, E.; Shima, T. Integrated Task Assignment and Path Optimization for Cooperating Uninhabited Aerial Vehicles Using Genetic Algorithms. Comput. Oper. Res. 2011, 38, 34–356. [Google Scholar] [CrossRef]
Shaferman, V.; Shima, T. Unmanned Aerial Vehicles Cooperative Tracking of Moving Ground Target in Urban Environments. J. Guid. Control Dyn. 2008, 31, 1360–1371. [Google Scholar] [CrossRef]
Oh, G.; Kim, Y.; Ahn, J.; Choi, H.-L. Market-Based Distributed Task Assignment of Mulstiple Unmanned Aerial Vehicles for Cooperative Timing Mission. J. Aircr. 2017, 54, 1–13. [Google Scholar] [CrossRef]
Larranaga, P.; Kuijpers, C.M.H.; Murga, R.H. Genetic Algorithms for the Travelling Salesman Problem: A Review of Representations and Operators. Artif. Intell. Rev. 1999, 13, 129–170. [Google Scholar] [CrossRef]
Dubins, L.E. On curves of minimal length with a constraint on average curvature and with prescribed initial and terminal positions and tangents. Am. J. Math. 1957, 79, 497–516. [Google Scholar] [CrossRef]
Savla, K.; Frazzoli, E.; Bullo, F. Traveling Salesperson Problems for the Dubins Vehicle. IEEE Trans. Autom. Control 2008, 53, 1378–1391. [Google Scholar] [CrossRef] [Green Version]
Zhang, X.; Chen, J.; Xin, B.; Peng, Z. A memetic algorithm for path planning of curvature-constrained uavs performing surveillance of multiple ground targets. Chin. J. Aeronaut. 2014, 27, 622–633. [Google Scholar] [CrossRef]
Wang, Z.; Liu, L.; Long, T.; Wen, Y. Multi-UAV Reconnaissance Task Allocation for Heterogeneous Targets Using an Opposition-Based Genetic Algorithm with Double-Chromosome Encoding. Chin. J. Aeronaut. 2018, 31, 339–350. [Google Scholar] [CrossRef]
Karabulut, K.; Tasgetiren, M.F. A variable iterated greedy algorithm for the traveling salesman problem with time windows. Inform. Sci. 2014, 279, 383–395. [Google Scholar] [CrossRef]
Nunes, E.; Manner, M.; Mitiche, H.; Gini, M. A taxonomy for task allocation problems with temporal and ordering constraints. Rob. Auto. Syst. 2017, 90, 55–70. [Google Scholar] [CrossRef]
Lanillos, P.; Yañez-Zuluaga, J.; Ruz, J.J.; Besada-Portas, E. A bayesian approach for constrained multi-agent minimum time search in uncertain dynamic domains. In Proceedings of the Fifteenth Annual Conference on Genetic and Evolutionary Computation Conference (GECCO 2013), Amsterdam, The Netherlands, 6–10 July 2013; pp. 391–398. [Google Scholar]
Darrah, M.A.; Niland, W.; Stolarik, B. Multiple UAV Task Allocation for an Electronic Warfare Mission Comparing Genetic Algorithms and Simulated Annealing. 2006; preprint. [Google Scholar]
Ryan, J.L.; Bailey, T.G.; Moore, J.T.; Carlton, W.B. Reactive tabu search in unmanned aerial reconnaissance simulations. In Proceedings of the 30th conference on Winter simulation, Washington, DC, USA, 13–16 December 1998. [Google Scholar]
Shima, T.; Rasmussen, S.; Sparks, A.; Passino, K. Multiple task assignments for cooperating uninhabited aerial vehicles using genetic algorithms. Comput. Oper. Res. 2006, 33, 3252–3269. [Google Scholar] [CrossRef]
Wang, S.; Ali, S.; Yue, T.; Liaaen, M. Integrating Weight Assignment Strategies with NSGA-II for Supporting User Preference Multiobjective Optimization. IEEE Trans. Evolut. Comput. 2018, 22, 378–393. [Google Scholar] [CrossRef]
Cai, Y. Artificial Fish School Algorithm Applied in a Combinatorial Optimization Problem. Int. J. Intell. Syst. Appl. 2010, 2, 37–43. [Google Scholar] [CrossRef] [Green Version]
Perez-Carabaza, S.; Besada-Portas, E.; Lopez-Orozco, J.A.; Jesus, M. Ant colony optimization for multi-UAV minimum time search in uncertain domains. Appl. Soft Comput. 2018, 62, 789–806. [Google Scholar] [CrossRef]
Geng, N.; Gong, D.W.; Zhang, Y. PSO-Based Robot Path Planning for Multisurvivor Rescue in Limited Survival Time. Math. Probl. Eng. 2014, 2014, 187370. [Google Scholar] [CrossRef]
Gong, D.-W.; Zhang, Y.; Qi, C.-L. Localising odour source using multi-robot and anemotaxis-based particle swarm optimisation. IET Contr. Theor. Appl. 2012, 6, 1661–1670. [Google Scholar] [CrossRef]
Geng, N.; Gong, D.; Zhang, Y. Robot path planning in an environment with many terrains based on interval multi-objective PSO. In Proceedings of the 2013 IEEE Congress on Evolutionary Computation (CEC), Cancun, Mexico, 20–23 June 2013. [Google Scholar]
Zhang, J.; Gong, D.; Zhang, Y. A niching PSO-based multi-robot cooperation method for localizing odor sources. Neurocomputing 2014, 123, 308–317. [Google Scholar] [CrossRef]
Cheng, M.-Y.; Prayogo, D. Symbiotic Organisms Search: A new meta-heuristic optimization algorithm. Comput. Struct. 2014, 139, 98–112. [Google Scholar] [CrossRef]
Tran, D.H.; Cheng, M.Y.; Prayogo, D. A novel Multiple Objective Symbiotic Organisms Search (MOSOS) for time–cost–labor utilization tradeoff problem. Knowledge-Based Syst. 2016, 94, 132–145. [Google Scholar] [CrossRef] [Green Version]
Panda, A.; Pani, S. A Symbiotic Organisms Search algorithm with adaptive penalty function to solve multi-objective constrained optimization problem. Appl. Soft Comput. 2016, 46, 344–360. [Google Scholar] [CrossRef]
Korsah, G.A.; Kannan, B.; Browning, B.; Stentz, A.; Dias, M.B. xBots: An approach to generating and executing optimal multi-robot plans with cross-schedule dependencies. In Proceedings of the 2012 IEEE International Conference on Robotics and Automation, Saint Paul, MN, USA, 14–18 May 2012. [Google Scholar]
Lanillos, P.; Besada-Portas, E.; Lopez-Orozco, J.A.; de la Cruz, J.M. Minimum time search in uncertain dynamic domains with complex sensorial platforms. Sensors 2014, 14, 14131–14179. [Google Scholar] [CrossRef] [PubMed]
Tejani, G.G.; Savsani, V.J.; Patel, V.K. Adaptive symbiotic organisms search (SOS) algorithm for structural design optimization. J. Comput. Design Eng. 2016, 3, 226–249. [Google Scholar] [CrossRef] [Green Version]
Van Den Berg, E.; Friedlander, M.P. Probing the Pareto Frontier for Basis Pursuit Solutions. Siam J. Sci. Comput. 2008, 31, 890–912. [Google Scholar] [CrossRef]
Ding, S.; Chen, C.; Xin, B. A bi-objective load balancing model in a distributed simulation system using NSGA-II and MOPSO approaches. Appl. Soft Comput. 2018, 63, 249–267. [Google Scholar] [CrossRef]

Figure 1. Body/dynamic system and airborne sensor of UAV.

Figure 2. Reconnaissance of strip targets.

Figure 3. Reconnaissance of surface targets. (a) Smallest circumscribed rectangle of surface targets; (b) Reconnaissance path.

Figure 4. Time window of the reconnaissance task.

Figure 5. Schematic diagram of the waiting time and arrival time. (a) Arrive before available; (b) Arrive when available.

Figure 6. Distribution of Pareto optimal front.

Figure 7. Density function between individuals.

Figure 8. Distribution map of all task points.

Figure 9. Initial “biological” performance index distribution of Scenario 1.

Figure 10. The best frontier distribution in history of Scenario 1.

Figure 11. Paths of all UAVs in Scenario 1.

Figure 12. Initial “biological” performance index distribution in Scenario 2.

Figure 13. The best frontier distribution in history in Scenario 2.

Figure 14. Paths of employed UAVs.

Table 1. Example of UAV Task Assignment.

Task Number	1	2	3	4	5	6
T_UAV	(2,1)	(3,2)	(1,1)	(2,1)	(1,1)	(3,1)
Task Order	1	1	2	2	1	1

Table 2. Task Assignment Decoding.

T_UAV	(1,1)	(2,1)	(3,1)	(3,2)
Task Timing	5-3	1-4	6	2

Table 3. Parameters of Task Point Parameters.

Task No.	Position/km	Time Win./s	Sensor Req.	Shape	Size/km	Weight Para.
1	(22,55)	(100,300)	3	Point	-	0.04587509
2	(47,80)	(100,200)	3	Strip	4.2	0.10454398
3	(80,13)	(100,200)	3	Strip	5.1	0.12473234
4	(36,46)	(250,500)	2	Surface	6*2	0.13173999
5	(50,50)	(250,500)	2	Strip	4.0	0.09815578
6	(75,50)	(250,500)	2	Point	-	0.03243373
7	(33,75)	(750,1000)	1	Strip	5.6	0.11667218
8	(50,75)	(750,1000)	1	Point	-	0.09324776
9	(75,75)	(650,1000)	1	Surface	8*4	0.15580116
10	(63,72)	(500,750)	2	Point	-	0.02624827
11	(13,45)	(1100,1300)	2	Point	-	0.03795899
12	(25,20)	(560,1500)	2	Point	-	0.03259074

Table 4. Parameters of UAVs.

Category	Max. Speed	Max. Cru. Time	Sensor Level
1	230 m/s	800 s	3
2	310 m/s	420 s	3
3	170 m/s	1100 s	2

Table 5. Flight Sequence of Each UAV in Scenario 1.

UAV No.	UAV Category	Task Sequence
1	1	5-4
2	2	9-8
3	3	7-12

Table 6. Flight Sequence of Each UAV in Scenario 2.

UAV No.	UAV Category	Task Sequence
1	1	10-2
2	1	4-1-8
3	1	9
5	2	3-6-12
6	2	5-11
9	3	7

Table 7. Statistical results of Monte-Carlo simulations on Scenario 2.

Indicator	J1			J2 (km)			Compute Efficiency (s)
Algorithm	Best	Worst	Avg.	Best	Worst	Avg.	Best	Worst	Avg.
IMOSOS	6	8	6.47	937.52	1089.71	956.45	9.2	17.5	12.7
MOSOS	6	9	7.33	941.27	1146.49	1017.92	14.3	22.9	17.1
NSGA-II	6	9	7.21	932.31	1123.56	998.41	16.4	26.6	21.3
IMOPSO	6	8	7.21	939.26	1109.56	979.9	12.6	20.9	15.3

Table 8. Improvement ratio of IMOSOS compared with other algorithms.

Indicator	J1	J2	Compute Efficiency
MOSOS	11.7%	6.1%	25.7%
NSGA-II	10.2%	4.2%	40.2%
IMOPSO	10.2%	2.0%	16.9%

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, H.-X.; Nan, Y.; Yang, Y. Multi-UAV Reconnaissance Task Assignment for Heterogeneous Targets Based on Modified Symbiotic Organisms Search Algorithm. Sensors 2019, 19, 734. https://doi.org/10.3390/s19030734

AMA Style

Chen H-X, Nan Y, Yang Y. Multi-UAV Reconnaissance Task Assignment for Heterogeneous Targets Based on Modified Symbiotic Organisms Search Algorithm. Sensors. 2019; 19(3):734. https://doi.org/10.3390/s19030734

Chicago/Turabian Style

Chen, Hao-Xiang, Ying Nan, and Yi Yang. 2019. "Multi-UAV Reconnaissance Task Assignment for Heterogeneous Targets Based on Modified Symbiotic Organisms Search Algorithm" Sensors 19, no. 3: 734. https://doi.org/10.3390/s19030734

APA Style

Chen, H. -X., Nan, Y., & Yang, Y. (2019). Multi-UAV Reconnaissance Task Assignment for Heterogeneous Targets Based on Modified Symbiotic Organisms Search Algorithm. Sensors, 19(3), 734. https://doi.org/10.3390/s19030734

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Multi-UAV Reconnaissance Task Assignment for Heterogeneous Targets Based on Modified Symbiotic Organisms Search Algorithm

Abstract

1. Introduction

2. Mathematical Model of the MTWDTSP

2.1. Modeling of the UAV and Sensor

2.2. Modeling of Heterogeneous Ground Targets

2.3. Assignment Model of Reconnaissance Tasks for Multi-UAVs

2.4. Mathematical Model of the Performance Indicator (PI) and Constraints

3. Modified SOS for MTWDTSP

3.1. Double-Chain Encoding of the Decision Variable

3.2. Pareto Dominance Determination and Optimal Solution Set

3.3. Modified SOS with Global Adaptive Scaling Factor

3.4. Optimal Solution Selection of the Proposed Algorithm

5. Numerical Simulation Results and Analysis

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

Data Availability

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI