1. Introduction
With the rapid rise of artificial intelligence and aircraft technology, unmanned aerial vehicles (UAVs) are playing an increasing role in engineering applications by virtue of their low cost and perfect functions [
1,
2]. Nowadays, the coordinated operation of multiple UAVs has developed into an important mode to adapt to more complex environments and play better application effects [
3,
4,
5]. Among them, cooperative task allocation of multiple UAVs is one of the key technologies in the top-level design of UAV cooperative operations [
6,
7]. However, most of the existing UAV task allocation investigations are based on the static progress and the definite mission environment, without considering the key problems of uncertain environment and dynamic change in the actual application process.
It is not enough for multi-UAV task allocation research to stay only in static target allocation in deterministic environments. On the one hand, task allocation under synergistic multi-UAVs is inevitably a dynamic process [
8]. The static task allocation scheme is unable to match the flexible and changing situation in real applications. On the other hand, if the influence of environmental parameters in the task allocation process, such as changes in the target location information, the uncertainty of the duration of the target’s execution, and the success rate of the UAV’s strikes against the ground targets, as well as the threat of the ground positions to the UAVs are not considered, it will lead to the fact that the “optimal” task allocation scheme formulated in advance will fail to obtain the desired effect or even to complete the mission [
9]. Both static target assignment and deterministic environments are idealized treatments for practical task assignment problems, and it is necessary to further research dynamic task assignment for multiple UAVs under uncertain environments.
Dynamic task allocation is a multistage allocation process that is much more complex than static task allocation [
10]. For one thing, changing environment information needs to be taken into account throughout the decision-making process, requiring an “observe–fire–observe” strategy. For another, the purpose of dynamic target allocation is to find the global optimal solution of the whole confrontation process, not a single stage.
Although most of the current research focuses on static task allocation, some scholars have realized the importance of dynamic task allocation and conducted research.
The work in [
11] designed a compromise task assignment method for multiple UAVs operating in rescue and search scenarios under multiple constraints, where the introduction of compromised dynamic performance impact can handle new tasks that may arise during task execution to accommodate dynamic events. Considering the problems of the limitation and wastage of computational resources a multitask particle swarm optimization algorithm based on a dynamic on-demand allocation strategy was proposed to achieve dynamic allocation in the study [
12]. The research in [
13] proposed an improved particle swarm optimization algorithm and introduced constraints such as weapon deflagration time window to verify the effectiveness of the particle swarm algorithm in solving the weapon target allocation problem with multiple missiles. In their research on the heterogeneous UAV task allocation problem, the researchers in [
14] constructed a dynamic collaborative task allocation model and then improved the contract net algorithm by using psychological coefficients, the blackboard model, as well as the buffer pool mechanism, which resulted in improved efficiency of the task allocation and more reasonable results for collaborative task allocation. Several researchers [
15,
16,
17] proposed a multiobjective simplified swarm optimization algorithm to address the problems of long computation time and low solving efficiency in the weapon target allocation algorithm and proved its high computational efficiency and solving accuracy. Some studies [
18,
19] proposed a multiobjective particle swarm control allocation method based on adaptive probability guidance and increased the variation factor according to the convergence index to realize the multiobjective control allocation of the manipulation surface. For the multiarmed bandit (MAB) problem in nonstationary environments, the work in [
20] utilized the predictive power of large language models (LLMs) to deal with dynamic environments and opened new avenues for applying LLMs to enhance traditional decision-making strategies in dynamic environments. The objective of this paper is to present a dynamic strategy for the allocation of tasks within the context of engineering practice for the Blue side’s multilayered defense system. Consequently, the scheme of the task allocation at each level can be effectively aligned by introducing dynamic time slices. Beyond that, the rest of the literature mentioned above addressed the allocation scheme to some extent in a dynamic process. However, there is still a lack of studies that allow for quantitative adjustment of the allocation frequency. The correlation between the level of dynamics and the efficacy of the entire system requires further characterization.
For the uncertain environment, the actual tasking problem is often faced with partial information uncertainty with the increasing number of disturbing factors in the mission environment. The information related to the heterogeneous multi-UAVs of own side can be easily obtained, but the information related to the ground targets to be attacked may be imprecise, or force majeure is encountered during the flight, which will lead to the critical parameters in the multi-UAV tasking being fuzzy numbers.
The literature on task allocation in uncertain environments is similarly limited and the mainstream approaches can be categorized into robust optimization vs. fuzzy planning. Some scholars have proposed the use of robust optimization methods to solve the problem of multi-UAV task allocation in uncertain environments. In the work in [
21], a two-stage robust optimization model was proposed to solve the problem of drone scheduling under the uncertainty of demand. In the study in [
22], considering the uncertainty of drone flight fuel consumption parameters, a robust optimization method was used to construct a robust equivalent of the original model, which achieves an effective balance between the robustness and the optimality of the task assignment. A robust optimization module was proposed in the research in [
23] to improve the robustness of the task assignment scheme under time–cost uncertainty. However, the task allocation schemes solved by robust optimization methods suffer from the problem of over-conservatism, which makes it difficult to achieve global optimization under dynamic task allocation problems. Meanwhile, aiming at solving the problem of task assignment under uncertain conditions, the fuzzy planning method describes the uncertainty parameter in the form of a fuzzy set. In the study in [
24], a fuzzy resource scheduling model with time–cost constraints was developed based on the fuzzy planning theory for the resource scheduling problem with uncertain execution time, and an improved chaotic ant colony algorithm was proposed to solve the model. In the research in [
25], a multitarget fuzzy machine constraint model based on the credibility measurement theory was constructed for the vehicle routing problem with fuzzy demand and fuzzy time windows, and an improved hybrid algorithm was used to solve the model. In the study in [
26], a multistage fuzzy multiobjective task allocation planning model was developed to address the fuzzy target threat in the problem of coordinated ground attack by a drone, and a Nash equilibrium solution based on a multi-strategy fusion algorithm was proposed. The fuzzy planning approach emphasizes constraint satisfaction by transforming the objective function and constraints of a fuzzy plan into a general planning model suitable for UAV tasking.
At the level of intelligent algorithmic solutions, there is a growing body of literature that applies the grey wolf optimization (GWO) algorithm to address the optimization problems associated with UAVs. The works in [
27,
28,
29,
30,
31] demonstrated the efficacy of the GWO algorithm in addressing optimization issues across diverse domains. Among these, literature [
27] conducted research on the specific trajectory planning problem of bridge inspection, while literature [
28,
29,
30,
31] proposed different improvement strategies for the convergence speed and search capability of the GWO algorithm. Among them, the work in [
28] improved the grey wolf search strategy to enhance the convergence speed and combined it with the implementation of variability in the differential evolution algorithm to promote the search capability. And the work in [
29] put forward a novel relative distance adaptation strategy with the objective of enhancing the convergence speed. This was combined with the simulated annealing algorithm for alternative position updates, with the intention of improving the search process with different capabilities. In contrast, the work in [
30] employed a Gaussian variation strategy and a spiral function to enhance the search capability, and the work in [
31] enhanced the convergence speed by integrating two improved grey wolf algorithms, wherein reinforcement learning is employed for policy switching.
Motivated by the above algorithm research, this paper not only proposes relevant improvements to the convergence speed and search ability of the algorithm as well but also designs a novel four-dimensional grey wolf information strategy to enhance the global optimization ability, a weak drawback of the classical GWO algorithm, by spatial coverage of the solution vector.
In conclusion, this paper addresses the dynamic task assignment problem for heterogeneous multi-UAVs in uncertain environments. Firstly, based on the binary interval number theory, a fuzzy multiconstraint planning model is constructed for the uncertain environment with the optimization objective of minimizing cost. In addition, this paper sets up dynamic time slices to realize dynamic task allocation with the adjustment of time slices. Finally, the four-dimensional information grey wolf optimization (4DI-GWO) algorithm is designed to search for the global optimal solution.
The main novelties of this study are as follows:
Aiming at the task allocation problem in uncertain environments, a fuzzy multiconstraint planning model is constructed based on binary interval number theory with a series of uncertainties, such as deviations in the target location information, the uncertainty of the duration of the target’s execution, and so on.
The dynamic task allocation is carried out when multi-UAVs detect and attack against ground targets. The time factor in dynamic task allocation is fully considered, based on which time slices are designed to adjust the number of assignments according to the decision maker’s preference.
A four-dimensional information grey wolf optimization algorithm is proposed that greatly enhances the global search capability and the initial packaged information of the community of grey wolves with adaptive tuning. The residual performances of the algorithm, such as convergence speed, are ensured by the remaining improvement strategies.
The remainder of this paper is organized as follows: the dynamic task allocation problem in uncertain environments is defined and described in
Section 2. The dynamic allocation method based on the 4DI-GWO algorithm for heterogeneous multi-UAVs is described in
Section 3. In
Section 4, simulations are conducted to validate the algorithms and methods. Finally, we conclude the paper in
Section 5.
2. Task Allocation Model for Uncertain Environments
2.1. Problem Description
This paper examines the task allocation of multiple heterogeneous UAVs for coordinated missions against ground targets in an uncertain environment. The scenario considered is a real application environment where target-related parameters, such as the target location information, the mission sustainment time, and the probability of UAV survival, are uncertain. In the proposed scenario, there are no no-fly zones, terrain obstacles, or sudden threats. All UAVs fly at the same altitude and have access to the same information. The task assignment scheme must be optimized to minimize task failure rates and improve scheme robustness with multiple constraints under these uncertainties.
Figure 1 illustrates the confrontation between Red’s heterogeneous multi-UAV swarms and Blue’s ground targets.
There are three types of UAVs, including Reconnaissance UAVs, Strike UAVs, and Reconnaissance–Strike UAVs, with a total number of , . Reconnaissance UAVs are designed solely for targeted reconnaissance missions, while Strike UAVs are designed solely for targeted strike missions. Reconnaissance–Strike UAVs are capable of performing both reconnaissance and strike missions. The attributes of the ith UAV are represented by the multinomials , which represent the code, the type, the value of the UAV itself, the probability of striking the target, the position of the UAV, and the cruising speed of the UAV. The cruising speed of the UAV is set to be constant during the mission. In this paper, it is assumed that both the Reconnaissance–Strike and Strike UAVs carry enough strike resources to accomplish the strike mission for all targets as well.
With respect to the target parameters, this paper sets targets, , in a two-dimensional uncertain mission environment. Each target requires a detection mission and an attack mission, i.e., there are 2N missions in the mission environment. The multivariate group is used to represent the relative parameters of the jth ground target, including the code, the type, the intrinsic value, the probability that the target destroys the UAV, the coordinates’ position, the duration of the reconnaissance mission, and the duration of the strike mission. In this paper, it is assumed that if one wants to obtain the corresponding benefit of destroying the target, as demonstrated in Equation (5), it can only be achieved after the target is destroyed.
2.2. Fuzzy Variables Handling
In a complex application environment, the Red side can easily obtain information about heterogeneous UAVs. Still, the information about the Blue targets to be attacked may be imprecise, including the relative distance,
, between the UAVs and the targets, the time to execute the reconnaissance or strike mission,
, and the probability,
, of UAV survival during the mission. Compared with the clear mission environment, the above imprecise key parameters make it difficult to solve the mission assignment problem accurately. Therefore, this paper abstracts uncertain variables into interval numbers, and a fuzzy-constrained planning model is constructed based on binary interval number theory.
where
,
, and
.
Interval number is a powerful tool for describing uncertainty information, which has been widely used in the fields of multiattribute uncertainty decision and fuzzy control. When describing uncertainty information using interval numbers, it is necessary to design an interval number ordering criterion in addition to solving the basic interval number synthesis problem. Therefore, this paper applies the interval ordering method based on the possibility degree, which defines a measure that reflects the extent to which one interval is larger than another and derives an ordering between intervals based on this measure. The specific sorting rules are as follows.
Given the interval numbers
and
,
represents the degree of likelihood that
is greater than or equal to
. This degree of likelihood is a measure of the relationship between the two intervals, and it generally takes a value between 0 and 1. The value is determined according to the definition of different likelihood formulas. The intervals
and
obey a uniform distribution and their values are independent of each other, so calculating the probability
is equivalent to calculating the probability that
, where
and
represent random values in the intervals
and
. The results of the calculation are as follows:
where
.
In addition, the binary interval theory is used later in the optimization part of the 4DI-GWO algorithm to compute fitness while updating the grey wolf position. Specifically, when calculating the fitness value, it is first divided into the upper and lower fitness bounds and computed by the corresponding parameter upper and lower bounds, respectively. The final fitness value is then weighted and summed to obtain the final fitness output. In the updating position operation, the degree of probability of the interval number order is used as the foundation for selecting candidate solutions and updating iterations.
2.3. Benefit and Cost Functions
The cost of destroying targets by heterogeneous multi-UAVs consists of two aspects: (1) the cost of the existing threat while UAVs performing missions, ; (2) the cost of the UAVs’ range to complete own mission sets, .
If the survival probability of the
UAV after passing the
target is
and
is the probability that the
target destroys the UAV, it will be available to
. Therefore, the threat cost of a single
UAV operating with
missions is
Assuming that obstacles are not taken into account, the UAV will tend to perform tasks closer to its position, resulting in lower fuel consumption. The range cost can be expressed as
where
is the farthest distance of all UAVs from the
task, namely
.
The benefit of destroying a target by heterogeneous multi-UAVs refers to the value of destruction caused to the target, which is defined as the value of the target and the probability of destruction, the magnitude of which reflects the importance of the target and the execution capability of the UAV, and which can guide the final allocation result to maximize the mission effectiveness during decision optimization. Let the probability of destruction of the
UAV performing the
mission be
, then the benefit of destroying the target is
2.4. Task Allocation Model
The task allocation problem of heterogeneous multi-UAVs with uncertain environments is modeled as a fuzzy multiconstraint model. First, the problem is transformed into a single-objective fuzzy constraint model by the linear weighted sum method, and the linear scale transformation method is used to transform each of the quantities into a value within the set of [0, 1]. Different weight vectors are set to balance the influence of each factor on the allocation results, where weights
,
, and
represent the weight of the threat cost
of attacking the target, the weight of flight cost
, and the weight of gain
of destroying the target, respectively. The different weights reflect the different decision preferences of commanders and decision-makers. Therefore, based on the fuzzy theory, the fuzzy constraint model for the heterogeneous multi-UAVs’ task allocation problem in the above uncertain environment is as follows:
Equations (7)–(12) define the specific constraints for the execution of each target’s detection and attack mission. Equation (7) ensures that each mission can be executed, while Equation (8) ensures that each mission can only be executed once. Equation (9) specifies the range constraints of each UAV, where represents the maximum range of the UAV. Equation (10) defines the value of the decision variable . In Equation (11), is the type of mission for the target, with indicating a reconnaissance mission and indicating a strike mission, which can only be carried out after the reconnaissance mission has been completed for each target. In Equation (12), are the moments when the UAVs swarm conduct reconnaissance or strike missions against the target, respectively. From the perspective of engineering application, the reconnaissance UAV should be assigned to carry out reconnaissance on the target first, and then the striking UAV should be assigned to carry out a destruction mission on it. Therefore, there exists a chronological sequence when the reconnaissance/strike mission is assigned to the same target, as shown in Equation (12).
4. Discussion
In this section, the effectiveness and superiority of the proposed dynamic task allocation strategy in an uncertain environment are verified through numerical simulation. Experiment 1 confirms that the suggested approach works well in an ambiguous setting. Experiment 2 uses several sets of simulations with varying loads to confirm the strategy’s robustness. Experiment 3 confirms the superior algorithmic performance of 4DI-GWO by comparing the simulations of several algorithms under challenging circumstances. Experiment 4 uses multiple tests to confirm the strategy’s dynamic allocation performance. Experiment 5 validates the efficacy of the strategy in varying levels of uncertainty environments.
The parameters of the algorithm are set as follows: MaxIter = 500; Population size PS = 30. All algorithms and test programs are simulated using MATLAB 2021a in this study, and the hardware information is Intel (R) Core (TM) i5-10300H CPU@ 2.50 GHz (Intel, Santa Clara, CA, USA), RTX 2060 14 GB, DDR4 16 GB, 512 GB SSG. Symbols and simulation parameters are summarized in
Appendix A.
Experiment 1
To test the 4DI-GWO algorithm’s ability to solve the dynamic task allocation scheme for heterogeneous multi-UAVs with uncertain information, such as mission execution time, relative distances of UAV targets, and the survival probability of the UAVs, this paper assumes four UAVs conduct reconnaissance and strike missions against 10 enemy targets in a 10 km × 10 km mission area, namely M = 4, N = 10. The dynamic time slice is set to w = 1. The UAV and mission attribute tables are shown in
Table 1 and
Table 2, respectively.
In
Table 1, RS in UAV type stands for the Reconnaissance–Strike UAV. The information on the UAVs is defined using real numbers considering the existing UAV technology with interaircraft networking and communication capabilities.
In uncertain environments, parameters related to the target are expressed as binary interval numbers. These parameters include the probability of a ground target striking the UAV, the geographic location of the target, and the time required to perform different tasks. The fitness change curve for the 4DI-GWO algorithm-solving process is depicted in
Figure 6.
Table 3 displays the results of task allocation for heterogeneous multi-UAVs.
As shown in
Figure 6, the algorithm’s convergence is fast due to the introduction of the four-dimensional information strategy, the nonlinear factor convergence strategy, and the weighted combinatorial position update strategy. A feasible solution can be obtained at iteration number t = 26. Additionally, by incorporating the mutation operator strategy, the 4DI-GWO algorithm is capable of escaping local optimization multiple times during the later iterations (t = 150 and t = 180), resulting in improved global optimization capabilities.
Table 3 shows the mission sequence and flight information for each UAV. The ground targets were all assigned and executed in the order of reconnaissance followed by the strike. Additionally, the high-value target with the highest value was assigned to two Reconnaissance–Strike UAVs with better capability, which is desirable.
In summary, this experiment shows that the 4DI-GWO algorithm is feasible for solving the heterogeneous multi-UAVs’ task assignment problem under conditions of uncertain target location, uncertain target strike probability, and uncertain mission execution time.
Experiment 2
To evaluate the impact of changes in the number of UAVs and targets on the performance of the 4DI-GWO algorithm, this paper conducts tests by manipulating the number of UAVs and targets separately. Specifically, the number of UAVs is increased while keeping the number of targets constant, and vice versa. Experiment 2 allows us to assess the performance of the 4DI-GWO algorithm under different conditions.
Figure 7a,b shows the curve of the fitness value during the solution process.
Figure 7a shows that as the number of UAVs increases, the fitness value also increases at the beginning of the iteration. However, as more UAVs are added, more possibilities arise, causing the blue solid line to converge to a lower fitness value through more rounds of iterative optimization, namely more UAVs could result in a better allocation scheme. Furthermore, discounting the late jump out of the local optimum, the convergence times of all three states are approximately equal. This indicates that the 4DI-GWO algorithm is not only effective but also highly robust in terms of changes in UAVs.
Figure 7b shows that the convergence speed of the 4DI-GWO algorithm decreases gradually as the number of targets increases. More targets result in more flight attrition and unknown strike risks. In addition, the average number of tasks assigned to each UAV also increases, increasing both the task list completion time and the flight cost of the UAV. Therefore, the final fitness value of the proposed algorithm also increases with the number of targets.
In general,
Figure 7a,b demonstrates that the 4DI-GWO algorithm can effectively solve task allocation problems in uncertain environments with favorable convergence speed, regardless of the number of targets or UAVs. The proposed 4DI-GWO algorithm can obtain feasible solutions under all the above settings, and the algorithm’s generalization performance is confirmed to a certain extent.
Experiment 3
To determine the superiority of the proposed algorithm, the most complex and challenging task setup is chosen in this experiment, consisting of 10 UAVs and 16 ground targets. The proposed 4DI-GWO algorithm and other classical optimization algorithms, namely PSO [
35], GWO [
32], IGWO [
33], GA [
36], and ACO [
37], are independently run 100 times under the above task loads.
The average values of the final mission assignment results are statistically recorded in
Table 4. The best results are shown in bold, where BST, AVG, WST, STD, and AVGIter denote the optimal, average, worst, standard deviation of the fitness value, and the number of iterations to obtain the first feasible solution obtained by the algorithm in 100 randomized runs. AVGDis and AVGComTime are the average of all UAV travel distances and the average of mission list completion times in 100 randomized runs, respectively.
Table 4 shows that the proposed 4DI-GWO algorithm outperforms the other four algorithms under difficult task load settings in terms of the best fitness value (BST), average fitness value (AVG), and worst fitness value (WST), demonstrating its superior optimization capability. The bolded values in the
Table 4 are the optimal values for each performance index. Although the number of iterations at the moment of convergence is slightly higher for the 4DI-GWO algorithm than for GWO and IGWO, this is due to the fact that 4DI-GWO searches over a wider range and has more initial solution vectors. In other words, only a few more iterative searches are needed to carry out a better global search over a wider range. Such a trade-off is perfectly acceptable. Notably, in some instances, algorithms, except for IGWO and 4DI-GWO, may not be able to locate the viable solution due to the excessively large worst fitness value. Additionally, the standard deviation (STD) of the 4DI-GWO algorithm’s adaptation is lower than that of the IGWO algorithm, which demonstrates the stability of the 4DI-GWO. Additionally, while the 4DI-GWO algorithm does not have the smallest number of iterations before stabilization, it validates the effectiveness of the proposed algorithm’s strategy for escaping local optima and its robust global convergence capability. The allocation schemes produced by the proposed algorithm outperform those of other algorithms in terms of both flight range and flight time.
Figure 8 shows the variation curves of the average fitness values of the six algorithms under the above task load setting. It is evident that the PSO algorithm converges to local optimality early and has a higher adaptation value than the other algorithms due to its inability to find a feasible solution. Similarly, the ACO algorithm also faces challenges with its global search ability. Although the GA algorithm is less likely to fall into local optimality, its optimization speed is slower. Compared with other intelligent algorithms, the classical GWO algorithm has strong optimization ability on its own. The IGWO algorithm, which adds the DLH search strategy, has a faster convergence speed and achieves a lower final fitness value. This paper proposes the 4DI-GWO algorithm, which not only achieves a convergence speed similar to that of the IGWO algorithm in the early stages through the combination of various improvement strategies but also exhibits a strong global search capability. Additionally, the final average adaptability value is lower than that of other algorithms.
It is well demonstrated that the 4DI-GWO algorithm, which incorporates multiple improvement strategies, outperforms the other five algorithms in solving the heterogeneous multidrone task assignment problem in uncertain environments.
Next, this paper evaluates the time and space resources required by the 4DI-GWO algorithm to process the problem by extrapolating the computational complexity, a crucial indicator of algorithm efficiency.
In terms of time complexity, the time complexity of generating the initial population in Steps 1–2 is O(N). Assuming that the time complexity of the performance metric f is T(f), the time complexity of solving the individuals in Step 3 is O(NT(f) + N). After that, the time complexity of updating the iterative optimal solution in Steps 4–6 is O(MaxIter*N*T(f)). So, the time complexity of 4DI-GWO is O(n*N) + O(N*T(f) + N) + O(MaxIter*N*T(f)) = O(MaxIter*N*T(f)), where MaxIter is the maximum number of iterations and N is the population size. The time complexity of the proposed algorithm is equivalent to that of the classical GWO algorithm, PSO algorithm, and GA algorithm and superior to that of the ACO algorithm because of the ACO’s further node information updating.
The spatial complexity of the 4DI-GWO algorithm refers to the storage space required during algorithm operation. Specifically, the 4DI-GWO algorithm typically only needs to store the current best solution, candidate solutions, and some data structures related to the search and optimization process. In implementation, the spatial complexity of the 4DI-GWO algorithm is approximately O(3N), where 3N signifies the candidate solution’s expansion to three dimensions as a consequence of the four-dimensional grey wolf information strategy. While the transformation does not significantly alter the solution time, it does affect the storage space. Although the space complexity of the proposed algorithm is larger than that of the classical GWO algorithm, it remains within an acceptable range.
Experiment 4
To verify the effect of the number of dynamic time slices on the effectiveness of the task assignment scheme and the flight condition, this experiment chooses to change the number of dynamic time slices and performs the simulation verification based on the eight abovementioned types of task loads.
Figure 9a–h shows the three fitness values and task completion times for 10 random runs of the 4DI-GWO algorithm with different numbers of time slices and different task loads.
Figure 9a–h shows that as the number of time slices increases, the time cost decreases under the eight task load settings, resulting in a general downward trend in the fitness value of the allocation scheme. This is because the dynamic allocation algorithm analyzes the observation allocation results, target threat level, UCAV destruction, and the statistical analysis of the force information of each party more frequently. And the 4DI-GWO algorithm is suitable for optimizing complex situations. As the number of time slices increases, mission decision making in multi-UAV uncertain environments becomes more refined.
Comparing
Figure 9a,e–g, it is evident that the dynamic time-slice setting can significantly enhance the optimization effect and reduce the overall task elapsed time when possessing a certain number of UAVs. Furthermore, it is obvious from
Figure 9h that increasing the number of time slices can significantly obtain a lower fitness value and reduce the elapsed time of the complex and computationally intensive task, despite the large number of targets involved. The dynamic time slice setting results in a more efficient task allocation scheme.
Experiment 5
To ascertain the impact of varying degrees of uncertainty in environmental information on the efficacy of the proposed algorithm, Experiment 5 employs a simulation validation approach, utilizing environmental information with distinct degrees of uncertainty. These degrees of uncertainty encompass deterministic, slightly uncertain, moderately uncertain, and extremely uncertain environments.
In this experiment, the boundaries of the intervals
describing the critical parameter information in the uncertain environment are borrowed from the extreme value
of the ternary fuzzy number
, namely
. In this way, the uncertain parameter information
can be expressed in terms of the degree of uncertainty
, i.e.,
, where
is the intermediate number of the ternary fuzzy number set as a constant value in the experiment to ensure the control variables and the degree of uncertainty
. To ascertain the impact of the degree of uncertainty associated with the uncertain information on the efficacy of the task assignment scheme,
Figure 10 presents the fitness value and the average task completion time of the task assignment scheme of the 4DI-GWO algorithm for 10 random runs in M = 4 N = 10 cases with varying degrees of uncertainty, specifically
equal to 0, 0.3, 0.6, and 0.9, respectively.
As illustrated in
Figure 10, when the UAV swarm is situated in a deterministic environment, namely
, the task allocation is relatively straightforward and readily converges to the optimal solution with the minimized fitness values and task completion time. As the degree of uncertainty increases, the algorithm becomes more challenging to solve, resulting in a delay in the start of the task, which in turn leads to an increase in the average task completion time of the UAVs and an increase in the fitness value of the task allocation scheme. A comparison of deterministic and extremely uncertain environments reveals a significant difficulty gap. This finding corroborates the assertion that the proposed algorithm can perform task assignments in the presence of diverse levels of uncertainty and is resilient to fluctuations in the uncertainty of key parameters.
Experiment 6
In order to further verify the maximum number of targets that can be handled by the proposed allocation strategy, Experiment 6 further enhances the number of targets on the basis of the existing simulation, while ensuring that the number of UAVs remains unchanged. The upper limit of the number of targets that can be handled by the UAV swarm in the current posture, M = 4, is given by the simulation results.
It is evident from
Figure 11 that when the number of targets changes from 11 to 19, the proposed strategies are all optimally configured, and the final convergence of the stable fitness values is below 120. Nevertheless, when the number of targets is increased to 20, the algorithm’s optimized fitness value remains at approximately 180, indicating a local optimum. It is evident that the current UAV swarm is unable to handle 20 targets. The allocation strategy sacrifices the fitness value to satisfy multiple constraints in the performance metrics. This also demonstrates that the number of UAVs should be selected to approximately align with the number of targets and that the number of targets that can be handled by the current UAV swarm should not exceed 20.