1. Introduction
With the development of unmanned aerial vehicles (UAVs) towards intelligence, autonomy, and miniaturization, the types of UAVs are increasing, and their mission fields are continuously expanding, with a wider range of mission types [
1]. In recent years, driven by the military competition among major countries and the surge of artificial intelligence technology, swarm combat has received widespread attention and has excelled on the battlefield. Swarms are typically low-cost and have a single function, and can be obtained through appropriate modifications based on existing mature technologies. In combat, they can rely on the advantages of quantity to achieve victory in local conflicts. At the same time, the system’s robustness is very strong. Even if some UAVs are shot down, the system can still be reconfigured to continue executing combat missions. This subversive combat model has drawn the attention of various countries, with many nations initiating projects and research on swarm combat [
2,
3,
4].
Faced with the offensive of a large number of low-cost swarms, there is no very effective solution. Even “low, slow, and small” swarms without intelligence pose significant challenges to the existing air defense system when conducting counter-swarm operations. Currently, the research on anti-swarm technology significantly is lagging behind the development of swarm technology, and there is an urgent need to make breakthroughs to address the subversive combat method posed by UAV swarm operations. Anti-swarm technologies are primarily categorized into three areas: detection and early warning, jamming and deception, and destructive interception [
4]. Detection and early warning primarily involve technologies such as radar, electro-optical, infrared, radio, and acoustic detection and tracking systems. Jamming and deception are considered soft-kill methods and include techniques such as electromagnetic interference, cyber-attacks, information deception, and camouflage deception. Hard-kill methods generally employ air defense missiles, laser weapons, high-power microwave weapons, artillery shells, and swarm-anti-swarm tactics. The use of air defense missiles for interception is too costly. Meanwhile, laser weapons, high-power microwave weapons, and artillery shells are short-range destructive means with a limited strike capacity and suboptimal effectiveness. They are also affected by the weight of the platforms, which can make deployment difficult. Additionally, there is the risk of further damage caused by the crash and fall of the UAVs. The swarm-anti-swarm method is a novel approach that encompasses many advanced technologies, including intelligent decision-making and control [
5]. It has a natural advantage in anti-swarm combat. This method can engage incoming swarms with as many strikes as possible at medium to short range, using either suicidal or recoverable UAVs [
6]. For the remaining swarms, short-range artillery, directed energy weapons, and other means can be employed for attack. This approach allows for a more efficient and targeted response to the threat posed by UAV swarms.
For the swarm-anti-swarm approach, systematic research is currently lacking. One line of research concentrates on multi-agent systems. Simonjan et al. [
7] have employed the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm to model two multi-agent deep reinforcement learning strategies, but the number of agents is restricted to seven due to the suboptimal performance of multi-agent learning in large groups. SEO et al. [
8] have based their threat modeling on Partially Observable Markov Decision Processes (POMDPs), aiming to asymmetrically maximize defensive advantage and attack complexity while minimizing the success rate of network attacks. These studies, similar to intelligent air combat, mainly focus on the action confrontation between UAVs [
9,
10,
11]. Another area of research is more focused on the construction of swarm defense systems, encompassing reconnaissance, interception, and striking capabilities. Brust et al. [
12] have proposed a drone defense system for intercepting and escorting malicious drones outside the flight zone. The defense drone swarm can self-organize defense formations upon detecting intruders and hunt down malicious drones in the form of a network swarm. This is a system where multiple UAVs surround and capture one UAV. Li et al. [
13] have investigated the optimization problem of deploying air defense systems against reconnaissance swarms. Shi et al. [
3] at Zhejiang University have developed an anti-UAV system, which integrates various passive surveillance technologies for the detection, localization, and radio frequency interference of UAVs.
In real scenarios of swarm attacks on targets, the offensive actions of UAV swarms differ from those in air combat or missile attacks. The actual attack process often relies on a large number of small, slow, and low-flying UAVs that directly assault the target, with the swarm not demonstrating a high degree of intelligence. For the effective method of swarm-anti-swarm, there is a lack of research that closely mirrors real scenarios. Previous research on air combat is not applicable because the small, slow, and low-flying UAVs are not equivalent to the anti-swarm UAVs, including aspects such as the performance, cost, and level of intelligence. The scenario of high-speed anti-swarm UAVs hard-killing small, slow, and low-flying UAVs is akin to a “cavalry assault.” In this scenario, the anti-swarm UAVs rapidly penetrate the swarm at mid-short range and, while satisfying constraints, aim to strike as many UAVs as possible. However, research on this scenario is limited. Studies that are relevant include UAV inspection scheduling, task allocation, and weapon–target assignment [
14], which provide a reference but still have significant differences and require a reassessment of the issue.
This paper conducts research on the target allocation for high-speed, recoverable anti-swarm UAVs during a single penetration of an approaching swarm. The anti-swarm UAV is a tail-sitter vertical takeoff and landing (VTOL) configuration, with a speed greater than that of the approaching swarm UAVs, subject to the constraint of turning radius R. The difficulties of this problem include the following: (1) the difficulty of optimized modeling: the maximum number of feasible solutions is unknown, and it is unknown which solutions are feasible, with the two being coupled; (2) the coupling of constraints and the problem process: when searching for the optimal solution, it is also necessary to simultaneously judge whether the trajectory satisfies the constraint of the turning radius; (3) the sparsity of solutions: due to the constraints, the optimal feasible solutions are sparse, as they consist of a set of points whose coordinates satisfy the constraints simultaneously and in the largest number; (4) the difficulty of constraint modeling: trajectory optimization is necessary, otherwise, feasible optimal solutions might be discarded.
Addressing the aforementioned challenges, this paper first analyzes the scenario of a single anti-swarm UAV penetrating a stationary two-dimensional distributed swarm and then extends the study to three-dimensional cases, multiple anti-swarm UAVs penetrating the swarm, and the impact of the movement of the swarm. Finally, a qualitative discussion on solutions for the movement of swarming drones is provided. By analyzing the scale between the turning radius and the spatial distribution of the swarm, a baseline for a single anti-swarm UAV to penetrate is solved using the gradient descent method, filtering out a small number of feasible solution spaces, which greatly reduces the sparsity problem. During the solution process, diversity is designed into the initial population, and an elite strategy is introduced to prevent the overall degradation of the population performance. Each time a feasible point is selected, a judgment must be made on whether the flight trajectory satisfies the constraint of the turning radius. To address this issue, this paper employs a piecewise cubic spline interpolation method to optimize the flight trajectory, minimizing the maximum curvature. The reciprocal of the optimized maximum curvature is used as the criterion for judging whether the turning radius constraint is met. Ultimately, the sequence of swarm UAVs to be engaged by the anti-swarm UAVs and their corresponding trajectories can be obtained.
The outline of this paper is as follows:
Section 2 describes the hard-kill anti-swarm UAVs and their operational styles.
Section 3 presents the process of the improved optimization genetic algorithm and the preparation conditions for optimization, including the calculation of the baseline and the optimization of trajectories.
Section 4 provides the optimization results, ranging from single to multiple anti-swarm UAVs striking, and on this basis, analyzes the impact of the strike frequency, swarm density, and turning radius constraints on the strike effectiveness.
Section 5 qualitatively discusses the solutions for dealing with the movement of the swarm.
2. Problem Description
The anti-swarm UAV is designed with a tail-sitter configuration, capable of vertical takeoff and landing, and is recoverable, which facilitates flexible deployment, a low cost, and superior performance compared to the incoming low, slow, and small swarm UAVs, as illustrated in
Figure 1. We assume that the maximum speed of the anti-swarm UAV is significantly greater than that of the incoming low, slow, and small UAVs, with a constant rate of speed, and that the coordinates of the incoming swarm are known. Due to the higher speed of the anti-swarm UAVs, collisions are intended to effectively destroy incoming drones. Consequently, the flight path is significantly influenced by the turning radius constraints.
Figure 2 illustrates the coordinate distribution of the incoming swarm with the blue dots (in a two-dimensional scenario), while the red circles depict the scale of the turning radius for our anti-swarm UAVs. The larger red circle represents the correct scale, and the smaller, semi-transparent circle indicates an incorrect scale. Given the spacing relative to the incoming swarm, the anti-swarm UAVs focus on “stabs”; hence, the minimum turning radius is relatively large. If there is no turning radius limit, the problem will resemble the Traveling Salesman Problem.
The problem description and optimization process for the three-dimensional scenario are similar to those for the two-dimensional scenario, but the mathematical models for the calculation of the baseline and trajectory optimization are slightly different and require further extension. The schematic diagram for the three-dimensional case is shown in
Figure 3.
After analyzing the scale, we can obtain a basic description of the problem: N anti-swarm UAVs set off from the starting point to penetrate the incoming swarm. The objective is to strike as many drones as possible in a single attack while satisfying the constraint of the turning radius.
3. Modeling and Optimization Method
This section addresses the challenges posed by hard-kill anti-swarm UAVs and provides an overall framework for an improved genetic algorithm that integrates various strategies, along with the necessary preparatory work.
3.1. Improved Genetic Algorithm with Multi-Strategy Integration
Genetic algorithms are heuristic search algorithms that simulate the principles of natural selection and genetics. They are known for their strong global search capabilities, flexible pattern operations, high robustness, and natural parallelism. However, they also have drawbacks such as slow convergence, complex parameter settings, genetic drift, and a tendency to become stuck in local optima. To enhance the efficiency of genetic algorithms and adapt them to specific problems, it is often necessary to improve or integrate them with other strategies.
Given the assumption that the coordinates of the swarm UAVs are known, in order to determine the number of points that the anti-swarm UAV can strike with a single penetration of the swarm and the corresponding coordinates, Equation (1) is established in the form of the following solution:
The form of the population is then as follows:
The matrix of position coordinates for the two-dimensional swarm is defined as
P:
Each row of
represents a possible combination of swarm coordinates during the search process, which is the combination that the anti-swarm UAV can strike in a single penetration. These combinations need to be interpolated using interpolation methods, and the curvature of the curve is judged. If the reciprocal of the curvature, which is the turning radius, is smaller than the minimum value, it indicates that the constraint is not satisfied, and the solution needs to be discarded. In summary, the design of the fitness function is as follows:
where
Cons represents the constraint on the curvature radius, and if it meets the constraint, it is set to 1; otherwise, it is set to 0.
represents the number of strike points, and it is desired to have as many as possible while meeting the constraints. Therefore, we need to process the population further. As shown in the first row of
Figure 4, the population requires a random distribution of fewer individuals, as having too many is likely to make it difficult to meet the constraint of the turning radius, and the iterative convergence process should proceed from few to many, rather than from many to few. Secondly, as shown in the second row of
Figure 4, there are a large number of completely random 0–1 sequences to increase the diversity of the population. Finally, as shown in the third row of
Figure 4, we also need to force the number of solutions to be no less than 2, because a straight line always meets the curvature condition. To further solve the problem of sparse solutions converging slowly, an elite strategy is introduced. As shown in
Figure 4, in each iteration, the population with the
is replicated in proportion to the initial population to prevent the optimization process from diverging. This method can effectively solve the problem of sparse solutions being difficult to converge.
The overall framework of the improved genetic algorithm is depicted in
Figure 5. We first significantly reduce the solution space through the calculation of the baseline, then generate the initial population based on the principles shown in
Figure 4, and proceed into the iterative process. For each iteration, possible combinations are interpolated using piecewise cubic spline interpolation, and the spline curves are optimized. The constraints of the turning radius are then checked to determine the value of the fitness function. Following this, the selection, crossover, and mutation stages are performed. Finally, an elite strategy is employed to forcefully retain the population with the highest fitness, and the process enters the next iteration until the specified number of iterations is reached.
3.2. Baseline Calculation
Due to the low cost of UAVs, incoming swarms are often numerous, as illustrated in
Figure 2, with an assumed number of 300. The problem of maximizing the number of drones that an anti-swarm UAV can strike in a single penetration of the swarm while satisfying the constraint of the turning radius is a complex optimization problem with a vast solution space, and the solutions are extremely sparse. Assuming that a single UAV can strike a maximum of 10 drones, the possible choices would be from a pool of
drones, but whether striking these 10 drones would satisfy the constraint of the turning radius is unknown. In reality, we do not yet know the maximum number of drones that can be struck in a single strike.
To reduce the complexity of the solution, we need to screen the points. From the scale analysis in the problem description, it is known that the anti-swarm UAV penetrates the swarm at a high speed and is constrained by the turning radius, which does not allow it to continuously circle within the swarm. Instead, it “stabs” through, although not necessarily in a straight line, and the degree of curvature is limited. Therefore, we can draw a baseline with screening the swarm coordinates near the baseline as the solution space. Intuitively, it is felt that in areas where the point set has a high distribution density, passing through approximately in a straight line can traverse more points. Another important factor is the depth, because even if the density is not high, if the depth is sufficient, many points can still be traversed. Combining these two points, the two criteria for selecting the baseline are density D and depth L. These two factors can be further integrated in an integral form, which can be expressed as .
3.2.1. Baseline for Two-Dimensional Scenario
For a two-dimensional distribution of points, the baseline can be obtained using the gradient descent method, with the specific calculation steps as follows:
- (1)
Divide the grid and calculate the density distribution of the point set, interpolating to obtain the function .
- (2)
Define the straight line, and use Monte Carlo sampling to generate points in the range [x, y]. The coefficients [a, b] are treated as optimization variables.
- (3)
Apply the gradient descent method to find a set of [a, b] that maximizes the integral of the objective function along the straight line.
The grid division is determined by the interquartile range (
IQR) criterion for histogram width, which is a method that can better extract the characteristics of the data. First, the
IQR is calculated based on the distribution of the point set spacing, as follows:
Among them,
Q1 is the first quartile, meaning that 25% of the data in the dataset is less than or equal to this value;
Q3 is the third quartile, indicating that 75% of the data in the dataset is less than or equal to this value. The grid width (
GW) is as follows:
The final solution is [a, b] = [0.51, 5.34]. The density distribution and the baseline are shown in
Figure 6.
After obtaining the baseline, we select points close to the baseline, and points far from the baseline do not enter the single optimization process. With the coefficients of the baseline known, we calculate the distance from each point to the line using the point-to-line distance formula. The number of points selected to enter the optimization process should be several times the estimated impact value, but it should not be too large, otherwise, the solution space will still be huge. This article takes a value of 80. We then choose 80 of these points for single optimization, reducing the solution space from 300 to 80, as shown in
Figure 7.
3.2.2. Baseline for Three-Dimensional Scene
In three-dimensional cases, the calculation of the baseline is similar to the process in two-dimensional cases, but the mathematical computations are more complex. The parametric equation of a line in space is as follows:
For a plane line, the variables to be optimized are the coefficients [a, b], while for a space line, there are six variables to be optimized, specifically, [
x0,
y0,
z0] and [
l,
m,
n]. The remaining process is similar to the two-dimensional procedure, and the final solution obtained is [
x0,
y0,
z0] = [3.34, 6.18, 15.85] m and [
l,
m,
n] = [51.45, 4.47, 22.65]. The spatial density distribution and the baseline are shown in
Figure 8.
Similarly, 80 points are selected for single optimization, reducing the solution space from 300 to 80, as shown in
Figure 9.
In both the two-dimensional and three-dimensional cases, the benefit of filtering the point set through the baseline is not only in the significant reduction in the solution space but also in making the optimization process more likely to converge. This is because when searching for feasible solutions within the point set, it is necessary to judge the curvature of the trajectory. After filtering, the point set is distributed around the baseline, and when searching for feasible solutions, the likelihood of satisfying the curvature constraints is much greater than searching within the initial point set, making it easier to find solutions. In the initial point set, the set of feasible solutions that satisfy the curvature constraints is extremely sparse, which can easily lead to divergence in the solution process.
3.3. Trajectory Optimization
An important difficulty of the problem lies in the coupling of constraints and the solution process; i.e., when searching for the optimal solution, it is also necessary to simultaneously judge whether the trajectory satisfies the curvature constraints. Suppose a sequence of 10 points is selected and after curve interpolation, it is found that these 10 points do not meet the curvature constraints; then, this solution will be discarded. However, it is also possible that the interpolation curve is not suitable and the curve has not been optimized. By using an appropriate curve and optimizing the curvature of the curve, it may be possible to satisfy the curvature constraints. Therefore, we need to adopt suitable interpolation methods and necessary curvature optimization to ensure that we can obtain a better flight trajectory while also ensuring that feasible solutions are not eliminated.
There has been extensive research on trajectory optimization, with fitting methods based on B-splines, Bezier curves, Dubins curves, etc., but there is less research on interpolation. In this study, the anti-swarm drone needs to collide with the swarm drones, which is an interpolation problem rather than a fitting problem. It has been assumed that the anti-swarm drone moves at a constant speed; therefore, the first and second derivatives of the velocity are continuous. Considering this, we plan to use piecewise cubic spline interpolation to handle the trajectory optimization.
A piecewise cubic spline interpolation function is given on the interval [m, n] with a set of node divisions,
, satisfying the following conditions: (1) The function has continuous first and second derivatives on [m, n]; (2) The function is a polynomial of degree no greater than three on each subinterval. Then, the piecewise function
is called the cubic spline function for the given set of nodes. If the spline function satisfies the interpolation conditions
then the cubic spline function
is referred to as the cubic spline interpolation function. It must also satisfy the following conditions:
In the formula, . Equations (9)–(11) are the continuity conditions that the cubic spline function must satisfy. is an interpolation polynomial of degree no greater than three in each subinterval, so there are four unknown parameters in each subinterval. The number of subintervals is n, which means the total number of unknown parameters is 4n. From Formulas (8) to (11), it can be seen that the interpolation conditions and spline conditions provide only 4n − 2 conditions. Therefore, it is necessary to add two known conditions to solve for the unknown parameters. Usually, the known conditions are increased by imposing endpoint constraints, which are the boundary conditions. Commonly used boundary conditions include natural boundaries, fixed boundaries, and clamped boundaries, among others, or custom boundaries can be defined to adjust the curve. In this problem, we need to find a set of boundary conditions such that the maximum curvature of the spline curve over all the intervals is minimized.
3.3.1. Trajectory for Two-Dimensional Scene
The formula for calculating the curvature of a two-dimensional plane curve is as follows:
Taking a two-dimensional space curve as an example, we are given a set of random points. We use the particle swarm optimization algorithm to optimize this problem. The specific optimization process is similar to the general particle swarm optimization process. Referring to reference [
15], after obtaining the optimized boundary conditions, we compare the resulting curve with the curves of two conventional boundary conditions, as well as the comparison of the curvatures of each curve, as shown in
Figure 10.
After particle swarm optimization, it can be ensured that the maximum curvature is minimized, which means the minimum turning radius of the trajectory is maximized. When assessing the feasibility of the curves generated during the search process, using the results after optimizing the curvature can prevent the search process from discarding feasible optimization solutions.
3.3.2. Trajectory for Three-Dimensional Scene
The expression of a spatial curve is much more complex than that of a plane curve. A spatial line is obtained by the intersection of two surfaces in space. The general equation of a spatial curve is
Another way to express this is by using parametric equations in the following form:
The expression forms of Equations (13) and (14) are not easily extended from the optimization method we use for the two-dimensional problem. In the two-dimensional case, by optimizing the endpoint states of the piecewise cubic spline function, the maximum curvature of the spline curve is minimized. The expression of the two-dimensional plane curve is
. If one wishes to extend this to a three-dimensional space curve, the expression of the space curve needs to be improved. As shown in Equation (14),
t can be represented as
through the inverse function. Therefore, Equation (13) can be converted into a parametric equation similar to Equation (15):
By further setting
,
, Equation (15) can be represented as a set of two plane curves and a line connected by a parameter in the form of a parametric equation:
The curvature of the space curve can be calculated using Equation (17):
Wherein
t and
T are the tangent vector and acceleration vector of the curve, respectively.
denotes the vector cross product, and
represents the magnitude of a vector. In Equation (16),
and
are the two-dimensional planar expressions, which can be controlled by the states of the starting and ending points. After the transformation by Equations (14)–(17), the spatial curve can be controlled by two sets of starting and ending points, i.e., four parameters
. By substituting the parameters into Equation (16) and then using Equation (17), the curvature of the spatial curve can be calculated. By adopting an optimization method similar to the two-dimensional process, an optimized spatial piecewise cubic spline interpolation curve can be obtained, with the minimum maximum curvature in space. Given a set of random points, the curve with optimized boundary conditions is compared with two conventional boundary condition curves, along with the comparison of their curvatures, as shown in
Figure 11. This result can be used to determine whether the turning radius meets the constraints in the optimization process of three-dimensional trajectory problems.
After the extension of the optimization of the planar curve curvature, the optimization results of the spatial curve curvature were obtained. Its function is similar to that in the two-dimensional case, both aiming to ensure the trajectory with the minimum maximum curvature.