Next Article in Journal
Crowd Density Estimation via Global Crowd Collectiveness Metric
Next Article in Special Issue
Improved Grey Wolf Algorithm: A Method for UAV Path Planning
Previous Article in Journal
Visualization of Aerial Droplet Distribution for Unmanned Aerial Spray Systems Based on Laser Imaging
Previous Article in Special Issue
A Pseudo-Exponential-Based Artificial Potential Field Method for UAV Cluster Control under Static and Dynamical Obstacles
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multi-Batch Carrier-Based UAV Formation Rendezvous Method Based on Improved Sequential Convex Programming

by
Zirui Zhang
1,
Liguo Sun
1 and
Yanyang Wang
1,2,*
1
School of Aeronautic Science and Engineering, Beihang University, Beijing 100191, China
2
Jiangxi Research Institute, Beihang University, Nanchang 330096, China
*
Author to whom correspondence should be addressed.
Drones 2024, 8(11), 615; https://doi.org/10.3390/drones8110615
Submission received: 9 September 2024 / Revised: 16 October 2024 / Accepted: 24 October 2024 / Published: 26 October 2024

Abstract

:
The limitations of the existing catapults necessitate multiple batches of take-offs for carrier-based unmanned aerial vehicles (UAVs) to form a formation. Because of the differences in takeoff time and location of each batch of UAVs, ensuring the temporal and spatial consistency and rendezvous efficiency of the formation becomes crucial. Concerning the challenges mentioned above, a multi-batch formation rendezvous method based on improved sequential convex programming (SCP) is proposed. A reverse solution approach based on the multi-batch rendezvous process is developed. On this basis, a non-convex optimization problem is formulated considering the following constraints: UAV dynamics, collision avoidance, obstacle avoidance, and formation consistency. An SCP method that makes use of the trust region strategy is introduced to solve the problem efficiently. Due to the spatiotemporal coupling characteristics of the rendezvous process, an inappropriate initial solution for SCP will inevitably reduce the rendezvous efficiency. Thus, an initial solution tolerance mechanism is introduced to improve the SCP. This mechanism follows the idea of simulated annealing, allowing the SCP to search for better reference solutions in a wider space. By utilizing the initial solution tolerance SCP (IST-SCP), the multi-batch formation rendezvous algorithm is developed correspondingly. Simulation results are obtained to verify the effectiveness and adaptability of the proposed method. IST-SCP reduces the rendezvous time from poor initial solutions without significantly increasing the computing time.

1. Introduction

Compared to manned aircraft, unmanned aerial vehicles (UAVs) have gradually become a research hotspot due to their advantages such as their low cost, strong maneuverability, simple structure, and good stealthiness [1,2,3]. In July 2013, the US military’s unmanned flying-wing layout aircraft, X47-B, accomplished a fully autonomous landing, marking the arrival of the era of carrier-based UAVs [4]. However, due to the limited payload capacity of a single UAV, it is challenging to fulfill complex mission requirements using them [5]. To overcome this drawback, multi-UAV collaboration and clustering have been proposed [6,7,8]. In multi-UAV coordination aboard carriers, the problem of multi-UAV formation rendezvous stands as one of the foundational aspects for achieving efficient collaboration.
The takeoff process of carrier-based UAVs relies on aircraft carriers, which face limitations due to their available deck space [9,10]: aircraft carrier decks typically have only 2–4 takeoff runways/catapults. Moreover, after the preceding aircraft takes off, there are delays due to the recharging of catapults and the aircraft wake vortices [11]. Subsequent aircraft using the same runway need to wait for a certain period before taking off. When the number of aircraft needing to be assembled exceeds the number of takeoff runways, the aircraft awaiting assembly cannot take off simultaneously. This results in the issue of multiple batches of formation rendezvous, causing aircraft from different batches to have different takeoff times. Moreover, during the batch takeoff process, due to the carrier’s movement, UAVs from different batches take off from different locations. The carrier-based UAV formation rendezvous problem aims to allow UAVs to take off at various times and locations and rendezvous into a specified formation.
The current research on UAV formation coordination and rendezvous primarily focuses on two types of problems.
The first type of rendezvous problem entails solving for the rendezvous location. Rucco et al. [12] and Wang et al. [13] addressed the rendezvous of small UAVs towards a moving unmanned ground vehicle (UGV), utilizing a rendezvous time predictive method to pre-determine the rendezvous time. Manyam et al. [14] developed a rendezvous method based on Bayesian quadratic curves, presetting the rendezvous time and planning for the rendezvous location and rendezvous path. McDonald et al. [15] simplified the rendezvous process into uniform and uniformly accelerated motions to predict and determine the feasible rendezvous time, thereby solving for the rendezvous locations and trajectories.
The second type of rendezvous problem entails solving for the rendezvous time. This problem was first investigated by McLain et al. [16,17], who aimed to plan the path for multiple UAVs in hostile environments to minimize the length of the rendezvous path, ensuring they all arrive at the target simultaneously for a coordinated rendezvous. Chen et al. [18] preset the rendezvous location and solved for the rendezvous time by developing a decentralized two-layer coordinative framework. Zhao et al. [19] developed a distributed approach that, under the pre-determined rendezvous location, optimized the rendezvous time for multiple UAVs.
The rendezvous problem studied in this paper is more complex than both the first type of problem and the second type of problem due to the inter-batch coordination. The rendezvous positions and times of the preceding batches influence those of the subsequent batches. When aiming to minimize the rendezvous time for the entire formation, a critical issue that demands resolution is determining the optimal time and location for each batch of UAVs to rendezvous into the formation. Thus, the rendezvous problem of carrier-based UAVs necessitates optimizing for both the rendezvous time and the rendezvous location. Research on this type of rendezvous problem can hardly be found in the existing literature. It is worthwhile to explore a batch-coordinated rendezvous method for carrier-based UAVs.
Currently, classical methods for formation rendezvous include velocity–consistency-based approaches [20,21], guidance-based methods [22], and strategies based on multi-agent cooperative path planning [23]. With the rapid advancement of path planning algorithms in recent years, rendezvous methods based on collaborative path planning strategies are receiving more focus. Jouffroy et al. [24] utilized particle swarm optimization to optimize the parameters of Dubins curves, thereby forming trajectories for the UAVs. Yao et al. [25] employed the A* algorithm to acquire the motion trajectories for multiple UAVs to rendezvous, and subsequently smoothed these paths into curves meeting the flight requirements for the UAVs using third-order B-spline curves. Shao et al. [26] utilized an enhanced particle swarm optimization algorithm to optimize the PH path parameters of individual UAVs, achieving the objective of formation rendezvous.
The aforementioned research focused on optimizing the geometric parameters of the UAV’s path, considering spatial consistency during UAV formation rendezvous. Temporal consistency constraints of the problem are usually transformed into spatial consistency constraints by assuming uniform motion of the aircraft. However, in the context of multi-batch carrier-based UAV formation rendezvous, the departures of aircraft from different batches do not occur simultaneously or at the same location. Under the constraint of consistent final velocities, it is impossible to form a formation using the assumption of uniform motion. Thus, the kinematic constraints of aircraft and the consistency constraints in both the temporal and spatial dimensions need to be considered.
When confronting offline trajectory planning problems that consider aircraft kinematic equations and spatiotemporal coupling consistency, the aforementioned planning approaches, such as swarm optimization algorithms, require the discretization of variables for encoding. During the process of space discretization, such methods might encounter issues like dimension explosion [27], high computational complexity, and slow convergence [28]. Hence, exploring methods suitable for handling dynamic constraints, multi-agent collision avoidance, and time-varying consistency constraints in this problem is crucial.
Convex optimization methods, as typical mathematical programming approaches, have a well-developed foundation in optimization theory and make use of numerous efficient polynomial-time solving algorithms. Based on the convex optimization concept, sequential convex programming (SCP) guarantees optimality while maintaining a high computational efficiency [29].
In the field of UAV formation rendezvous, this method is a promising branch [30]. Wang and McDonald [13,15] optimized the rendezvous location of UAVs based on SCP. Chen [18] and Zhao [19] optimized the rendezvous time based on SCP. Therefore, the SCP method is also applicable to the inter-batch coordination issue addressed in this paper.
When the final rendezvous state varies with time, it is difficult to obtain an appropriate initial guess for the SCP. Mao et al. [31] introduced trust region and virtual control techniques into SCP, thereby allowing the initial solution to not necessarily comply with the original problem’s dynamic constraints. Based on this work, they extended the technique to non-convex state and control constraints [32], thereby enabling the initial solution to not necessarily satisfy the original problem’s other non-convex constraints. The successive convexification algorithm (Scvx) they proposed serves as the baseline SCP employed in this paper. These works reduced the difficulty in obtaining an initial solution, yet there remains room for improvement in the sensitivity to initial solutions of the SCP [33]. Banerjee et al. [34] developed a learning-based method to offer a better initial solution for SCP. Instead of optimizing the initial solution, an initial solution tolerance mechanism is developed which decreases the algorithm’s sensitivity for initial values. Inspired by the simulated annealing (SA) algorithm, a novel SCP method that was improved by introducing an initial solution tolerance mechanism is proposed in this article.
The objective of this paper is to propose an efficient rendezvous method for addressing the formation rendezvous problem for carrier-based UAVs that take off in multiple batches. Considering the time intervals and starting/ending location differences during the takeoff process that occur between different carrier-based UAVs, a reverse solution approach is proposed. Based on this, a non-convex optimization problem is derived mathematically, and the optimization model is established to minimize the formation rendezvous time cost. In the existing literature, there is a lack of research on trajectory planning problems for multi-batch UAVs’ rendezvous; therefore, different rendezvous strategies considering different terminal specifications are urgently needed. Subsequently, under the concept of convex optimization, the relaxation technique is introduced to convexify these original non-convex problems. To develop an efficient solver, a rendezvous algorithm is proposed based on SCP that is improved by introducing an initial solution tolerance mechanism. Consequently, the issue that an inappropriate initial solution reduces the rendezvous efficiency is successfully circumvented. Finally, experimental scenarios for the formation of carrier-based UAVs are designed; thereafter, simulation experiments are conducted to validate the effectiveness and adaptability of the algorithm proposed in this paper. Specifically, the initial solution tolerance SCP (IST-SCP) performs better in terms of the rendezvous time compared with the standard SCP, and it has a shorter computing time compared with the initial solution optimization method.
The rest of the paper is organized as follows: Section 2 presents the multi-batch formation rendezvous optimal control problem formulated by the reverse solution approach. Section 3 offers the details of formation rendezvous algorithms based on IST-SCP. The simulation results and analysis are presented in Section 4. Finally, the conclusions are drawn in Section 5.

2. Problem Formulation

In this section, the dynamics model of UAVs is introduced. Then, the process of the multi-batch formation rendezvous of carrier-based UAVs is illustrated and, based on this, a reverse solution approach for formation rendezvous is proposed. Finally, other constraints for the formation rendezvous problem are added, and the non-convex optimal control problem is formulated.

2.1. Dynamics Model of UAVs

The subject of this study is fixed-wing carrier-based UAVs, and this study aims to solve the problem of their rendezvous into a specified formation through multiple batches. The utilized drone dynamics model, employed as the dynamic constraints for the optimal control problem, is as follows.
x ˙ g = V c o s γ c o s χ y ˙ g = V c o s γ s i n χ z ˙ g = V s i n γ V ˙ = g ( n x s i n γ ) χ ˙ s i n = g n y V c o s γ χ c o s χ ˙ c o s = g n y V c o s γ χ s i n γ ˙ = g ( n z c o s γ ) V
where x g , y g , z g represent the ground-based coordinates; V , χ , γ stand for the UAV’s speed, heading angle, and flight path angle, respectively; u = n x , n y , n z T  represent UAV control variables, indicating the overload in the x y z directions within the flight path coordinated frame. Since the heading angle χ changes periodically, the cosine and sine values are utilized in the state space instead of χ itself, as χ s i n = s i n χ ,   χ c o s = c o s χ .

2.2. Formation Rendezvous Process in Multiple Batches

For analytical convenience, let us assume that each batch of UAVs arrives at the designated formation position simultaneously. For N UAVs launched in q batches, each batch consisting of p 1 , p 2 , . . . , p q UAVs, these UAVs need to assemble into a specific formation. Assuming that UAVs within the same batch enter the formation simultaneously, this problem can be expressed as follows:
t i f = t f j m = 1 j 1 p m < i m = 1 j p m , 1 j q
where i represents the ith UAV, j represents the jth batch of UAVs, t i f denotes the time at which the ith UAV enters the formation, and t f j represents the time at which the jth batch of UAVs enters the formation.
Due to the time intervals between batches during takeoff and the inability of fixed-wing aircraft to hover, this paper adopts a rendezvous mode for UAV formations where the UAVs circle and rendezvous at a specified location and altitude, as illustrated in Figure 1.
In Figure 1, the process of forming a hexagonal formation using three batches of UAVs as an example is described: each batch of UAVs is launched from a path-known carrier at a specified time at a certain interval. They take off and fly towards a circling orbit to enter the formation. The position of each UAV within the formation is predetermined. The first batch of UAVs to take off enters the inner circling orbit, and these UAVs circle within the orbit to await the later UAVs. Upon entering the orbit, they directly assume their designated formation positions, forming a partial formation. The coordinates of the formation centroid when the jth batch of aircraft enters the formation are x c j , y c j . Each UAV uses the radius R c of the orbit where the formation centroid is located as the reference radius, calculates the radius R i of the circling orbit, and calculates the angular position relative to the formation centroid based on the geometric shape of the formation to attain its designated formation position. The angular velocity ω c of the UAV formation remains constant throughout this process, and the formation conducts uniform circular motion at the given orbit radius R c . When the last UAV assumes its formation position, the formation rendezvous task is completed.
In terms of the rendezvous sequence, a Gantt chart illustrating the circling rendezvous process is depicted in Figure 2.
In this problem, the optimization objective is the completion time t f of the rendezvous. Because the UAVs’ velocity is greater than that of the carrier, the rendezvous time of the front batch is shorter than that of the following batch when the circling orbit is designated, which means that t f j 0 . Thus, optimizing the completion time t f is equivalent to optimizing the time when the last batch of aircraft reaches the designated formation position t f q .

2.3. Reverse Solution Approach

To address the challenges of spatiotemporal coupling during the rendezvous process while maintaining the optimality of the rendezvous time, a reverse solution approach is proposed.
As depicted in Figure 3, this approach allows us to calculate the shortest time for the final batch of UAVs to enter the circling orbit. Using this time as a reference, the entry time and trajectories for the other batches of UAVs are reverse solved. This adjustment enables these batches to synchronize their entry into the circling orbit according to the entry time of the final batch of UAVs, meeting the consistency constraints.
The objective function for each batch is represented as follows:
J = w t t f j + w z g h z g
where g h z g = z c z g represents the altitude penalty term, driving the UAVs’ climb to the designated altitude z c .
For the final batch of UAVs, this objective function serves as the optimization goal for this problem. However, for the preceding q − 1 batches of UAVs, utilizing this objective can facilitate these batches’ swift entry into the formation, avoiding any impact on subsequent batches.
In this process, the circling orbit radius and circling angular velocity of the UAV formation are set. Therefore, the final constraint for the ith UAV can be represented in the following form:
x f i = x H + R i c o s ω c t f j t f q π 2 χ i y H + R i s i n ω c t f j t f q π 2 χ i z c V i s i n ω c t f j t f q + χ i c o s ω c t f j t f q + χ i 0
where x H , y H represents the coordinates of the circling orbit’s center, and t f q denotes the time when the final batch of aircraft enters the formation. The formation’s geometric parameters only affect R i , χ i , avoiding the influence of the formation shape on the problem-solving process.

2.4. Other Constraints and Non-Convex Optimization Problem Formulation

Besides meeting the dynamic constraints and the consistency constraints mentioned earlier, the rendezvous process also needs to satisfy the following constraints.
Considering the spatial constraints of the formation rendezvous task and the limitations of UAVs’ flight performance, we impose constraints on the state and control variables mentioned in (1) as follows:
x m i n x x m a x u m i n u u m a x
The UAVs need to avoid the no-fly zones along the path while heading to the formation area. The no-fly zone avoidance constraint is as follows:
C x i x o b l r + R o b l i = 1 ,   2 ,   ,   N l = 1 ,   2 ,   ,   L
where C represents a matrix that extracts the UAV’s state information x i τ , y i τ from the state vector x i ; x o b l represents the center coordinates of the lth no-fly zone; r denotes the safety radius of the aircraft, and R o b l represents the radius of the lth no-fly zone.
To prevent collisions among the UAVs within the formation, the collision avoidance constraint is as follows:
C x i x l 2 r i = 1 ,   2 ,   ,   N l = 1 ,   2 ,   ,   N i l
where x i , x l represent the states of the ith and lth UAVs, respectively.
The initial state constraint is as follows:
x 0 i = c i
where c i is a constant, and the state at which each carrier-based UAV takes off is predetermined.
Thus, the non-convex optimal control problem, the MFRP (multi-batch formation rendezvous problem), is formulated as follows:
m i n J = w t t f j + w z g h z g s . t . 1 2 4 5 6 7 8
Among these, constraints (1), (4), (6), and (7) are non-convex and require convexification.

3. Multi-Batch Formation Rendezvous Algorithm

In this section, convexification and discretization techniques are introduced first. Thereafter, an improved SCP algorithm incorporating an initial solution tolerance mechanism is derived. Subsequently, a rendezvous optimization method is developed with the improved SCP as its core for the formation rendezvous problem.

3.1. Constraints Convexification and Discretization

As mentioned previously, the MFRP contains non-convexity. Therefore, it is necessary to convexify the non-convex constraints around a certain reference solution x ¯ , u ¯ , t ¯ . In this process, introducing a trust region constraint is crucial to ensuring the effectiveness of the convexification approximation. However, the solution space near the reference solution might not always contain a solution after convexification. Hence, the relaxation factors are added to the constraints to prevent situations with no feasible solution.
The linear convexification of (1), (4), (6), (7) is shown in (10–13).
x ˙ i = A i x i + B i u i + D i
C x ¯ i x o b l + ( C x ¯ i x o b l ) T C x ¯ i x o b l ( C x i C x ¯ i ) r + R o b l
( x ¯ i x ¯ l ) T C T C ( x ¯ i x ¯ l ) C ( x i x l ) 2 r
x f i = x H + R i c o s χ ¯ i t f π 2 R i s i n χ ¯ i t f π 2 ω c t f j y H + R i s i n χ ¯ i t f π 2 + R i c o s χ ¯ i t f π 2 ω c t f j z c V i c o s χ ¯ i t f ω c s i n χ ¯ i t f t f j s i n χ ¯ i t f + ω c c o s χ ¯ i t f t f j 0
The coefficient matrices A, B, and D in (10) are as shown in (14).
A i = 0 0 0 c o s χ ¯ c o s γ ¯ V ¯ c o s γ ¯ 0 V ¯ c o s χ ¯ s i n γ ¯ 0 0 0 s i n χ ¯ c o s γ ¯ 0 V ¯ c o s γ ¯ V ¯ s i n χ ¯ s i n γ ¯ 0 0 0 s i n γ ¯ 0 0 V ¯ c o s γ ¯ 0 0 0 0 0 0 g c o s γ ¯ 0 0 0 g n ¯ y V ¯ 2 c o s γ ¯ s i n χ ¯ 0 g n ¯ y V ¯ c o s γ ¯ g n ¯ y V ¯ c o s 2 γ ¯ s i n χ ¯ s i n γ ¯ 0 0 0 g n ¯ y V ¯ 2 c o s γ ¯ c o s χ ¯ g n ¯ y V ¯ c o s γ ¯ 0 g n ¯ y V ¯ c o s 2 γ ¯ c o s χ ¯ s i n γ ¯ 0 0 0 g ( n ¯ z c o s γ ¯ ) V ¯ 2 0 0 g s i n γ ¯ V ¯ B i = 0 0 0 0 0 0 0 0 0 g 0 0 0 g s i n χ ¯ V ¯ c o s γ ¯ 0 0 g c o s χ ¯ V ¯ c o s γ ¯ 0 0 0 g V ¯ D i = x ¯ ˙ i A i x ¯ i B i u ¯ i
Then, introduce the trust region constraint as shown in (15). Note that the radius of the trust region is updated following the solving of each sequential subproblem using the update strategy proposed in Section 3.2.
x x ¯ δ x u u ¯ δ u t t ¯ δ t
Hence, the convexified MFRCS (multi-batch formation rendezvous convex subproblem) is depicted below.
m i n J = w t t f j + w z g h z g s . t . 10 2 11 5 12 13 8 15
For the MFRCS, when the trust region is inappropriate, it might lead to conflicts with other constraints. To ensure stability in the solving process, relaxation factors are added to each constraint and penalized in the objective function. This results in the MFRCSR (multi-batch formation rendezvous convex subproblem with relaxation factors), shown as follows:
m i n J = w t t f + w z g h z g + w p p u v s , v o , v c , v i c , v t c
s . t . x ˙ i = A x i + B u i + D i + v s i
C x ¯ i x o b l + ( C x ¯ i x o b l ) T C x ¯ i x o b l ( C x i C x ¯ i ) r + R o b l + v o i
( x ¯ i x ¯ l ) T C T C ( x ¯ i x ¯ l ) C ( x i x l ) 2 r + v c i
x 0 i = c i + v i c i
x f i = g f x , u , t + v t c i
g c t = 0
t r u x , u , t δ
where v s , v o , v c , v i c , v t c refer to the relaxation factors for the motion constraints, obstacle avoidance constraints, collision avoidance constraints, initial state constraints, and final state constraints, respectively. In (17a), w p is the penalty weight, and the function p u v s , v o , v c , v i c , v t c represents the penalty function for the relaxation factor to eventually converge to 0; as shown in (18), it adopts the sum of the 1-norms of the relaxation factors. By incorporating these relaxation factors, the algorithm’s initial solution and sequential solutions are no longer required to satisfy the original problem’s constraints [33]. Equations (17g,h) correspond to the time consistency constraint and the trust region constraint, respectively.
p u v s , v o , v c , v i c , v t c = v s 1 + v o 1 + v c 1 + v i c 1 + v t c
The MFRCSR still contains time-continuous equations with an infinite-dimensional variable. To simplify the computations, the dynamic equations in (17b) are approximated using linearized rectangular integration. Discretizing the MFRCSR results in the MFRDCS (multi-batch formation rendezvous discrete time convex subproblem), as represented by the following equation.
m i n J = w t t f + w z g h z g + w p p u v s , v o , v c , v i c , v t c
  s . t . x k + 1 i = x k i + t ¯ A x k i + B u k i + D i + t A x ¯ k i + B u ¯ k i + D i + v k s i
( C x ¯ k i x k o b l ) T C x ¯ k i x k o b l ( C x k i C x ¯ k i ) + C x ¯ k i x k o b l r i + R o b l + v k o i
( x ¯ k i x ¯ k l ) T C T C ( x ¯ k i x ¯ k l ) C ( x k i x k l ) 2 r + v k c i
x 1 i = c i + v i c i
x K i = g 5 x k i , u k i , t i + v t c i
g c t = 0
t r u x k i , u k i , t i δ
It should be noted that, due to the time intervals between the takeoffs of the different batches of carrier-based UAVs, the k representing time does not correspond to the same moment across different batches. The avoidance constraint mentioned above applies only to UAVs in the same batch. To handle avoidance between UAVs from different batches, it is necessary to compute the position of each UAV at the current time to ensure collision avoidance.
At this point, we have obtained the MFRDCS, which is suitable for convex optimization solvers.

3.2. Initial Solution Tolerance Sequential Convex Programming

When solving the convexified subproblems, convex optimization solvers may be misguided by these convexified constraints. As shown in Figure 4, when the reference solution is on the right side of the obstacle, the penalty generated by the convexified obstacle constraint can only approximately reflect the true penalty situation on the right side. Therefore, the variation in the sequence solution always stays on one side of the obstacle, and is only determined by the initial guess path’s position relative to the obstacle.
For the problem, in which the final state does not change over time, this limitation of the SCP method seems insignificant. However, when the final rendezvous state is closely related to the time, this limitation can lead the SCP method into a quite unsatisfactory local optimal solution.
For the q − 1 batches of UAVs, the final rendezvous positions circle within the orbit. As shown in Figure 5, the initial trajectory guess of type A (green dotted line) is on the left side of the centroid of the no-fly zone, and the initial path guess of type B (yellow dotted line) is on the right side of the centroid of the no-fly zone. Both A and B share one initial rendezvous position guess. As the SCP algorithm optimizes the rendezvous time, the rendezvous position moves in the opposite direction of the formation’s circling angular velocity (green arrow and yellow arrow). Path A (green curve), obstructed by the no-fly zone, is optimized to a local optimum because of the poor approximation of the penalty value on the right side of the no-fly zone. However, Path B (yellow curve) reached the global optimal solution with a better initial path guess.
It is difficult to accurately guess the initial solution of multiple UAVs for the complex environment/collision-free constraints/spatiotemporal relationship of the rendezvous problem. Instead of optimizing a better initial solution guess, an improved SCP method inspired by the simulated annealing (SA) algorithm is proposed.
The simulated annealing (SA) algorithm was first proposed by Metropolis et al. [35] in 1953. The SA algorithm starts from a higher initial temperature and, with a continuous decrease in temperature parameters, uses probability-based climbing to randomly search for the global optimal solution of the objective function in the solution space. That is, it can probabilistically jump away from the local optimal solution and eventually approach the global optimal solution.
The problem that the solution is trapped in the local optimum because of an inaccurate initial guess is mainly because the SCP method cannot handle the original problem of the non-convexity of obstacle constraints. It is unable to search for the optimal solution in the global solution space.
To fix this problem, an initial solution tolerance mechanism is combined with SCP. The initial solution tolerance SCP (IST-SCP) algorithm is shown in Figure 6.
At the beginning, an initial guess is provided by setting the initial rendezvous position and defining a linear interpolation between the takeoff and rendezvous states which is the same as the conventional SCP algorithm. Using this initial guess, the current minimum cost solution is initialized, which will be used to record the minimum cost solution. After convex relaxation around the reference solution, the probability-based search module is introduced. This module generates random disturbances with the obstacle avoidance penalty value v o . When a random number between 0 and 1 is greater than the search probability P1, we set the obstacle avoidance penalty to 0 for the optimizer (when calculating termination and update conditions, v o is calculated as the other penalty values). The purpose of doing so is to eliminate obstacles that hinder the rendezvous optimization, enabling the algorithm to explore spatiotemporal coupled trajectories in a wider space. Since P 1 = e 1 / T , T is the annealing temperature, which decreases as the iteration increases, and P1 will eventually drop to 0. As the annealing temperature decreases, the algorithm gradually stops the search frequency and ultimately optimizes the solution under the constraints of the MFRDCS.
In determining the termination criteria, two criteria need to be considered:
  • Whether the search module is triggered;
  • Whether the solution of the current iteration matches the solution of the previous iteration.
For criterion 1, when the search module is triggered, the solution obtained by the solver does not consider the penalty of the obstacle avoidance constraints, and this solution may not be feasible for both the MFRDCS and the original MFRP. Thus, the algorithm can only terminate when the search module is not triggered.
For criterion 2, based on the termination strategy in [33], d s , d o , d c , d i c , d t c can be defined as shown in (20). These represent the real constraint deviation during MFRP solving, while their counterparts, represented by the relaxation factors v s , v o , v c , v i c , v t c , indicate the linearly approximated constraint deviation.
d s = x ˙ V c o s γ c o s χ V c o s γ s i n χ V s i n γ g ( n x s i n γ ) g n y V c o s χ g n y V s i n χ g ( n z c o s γ ) V
d o = C x i x o b l r + R o b l
d c = C x i x l 2 r
d i c = x i t 0 i c i
d t c = x i t f i g f x , u , t
Criterion (2) can be expressed as follows:
J 1 = w t t ¯ f t f * + w z g h z ¯ g g h z g *
where t ¯ f , z ¯ g , d ¯ , corresponding to x ¯ , u ¯ , t ¯ , represent the solution obtained from solving the previous iteration’s subproblem, which also serves as the relaxed reference solution for this iteration’s subproblem. t f * , z g * , v * , corresponding to x * , u * , t * , represent the solution obtained from solving the subproblem in the current iteration ( superscript * stands for the optimal solution/objective value of the subproblem). J 1 represents the difference between the real (non-approximate) cost from the previous iteration and the convex approximate cost of the current iteration. When J 1 < ε , it is considered that the convex approximate cost of the current iteration’s solution is close enough to the real cost from the previous iteration, indicating convergence of the algorithm.
Note that v ¯ is not used here to describe the first part of criterion 2. This is because criterion 2 is mainly used to describe whether the solution of the nthiteration is consistent with that of the n − 1th iteration, indicating whether the algorithm has converged. v ¯ represents the deviation of the convex approximate constraint from that of the previous iteration. Due to different trust region selections between iterations, the level of convex approximation varies. Using the deviation in the previous iteration’s real constraints helps eliminate judgment errors caused by differences in the convex approximation between iterations.
If the current solution does not meet the termination condition, the trust region, reference solution, and annealing temperature need to be updated for the next iteration.
The trust region is updated with the following constraints: similar to J 1 , define J 2 :
J 2 = w t t ¯ f t f * + w z g h z ¯ g g h z g * + w p p u ( d ¯ ) p u ( d * )
Relaxation acceptability can be expressed as:
ρ = J 2 J 1
The term J 2 represents the difference between the real cost of the previous iteration and the current iteration. ρ stands for the ratio of the difference between the real cost and the convex approximation cost. A higher value of ρ indicates a smaller convex approximation deviation, signifying that the current relaxation is reasonable. Conversely, a lower value of ρ suggests a larger convex approximation deviation, implying that the current relaxation might not be precise enough, and that it might be necessary to promptly adjust the trust region size.
The strategy for updating the trust region is as follows:
ρ ρ u p : δ = α δ u p d a t e x ¯ , u ¯ , t n e x t ρ m i d ρ < ρ u p : δ = δ u p d a t e x ¯ , u ¯ , t n e x t ρ l o w ρ < ρ m i d : δ = δ / α u p d a t e x ¯ , u ¯ , t n e x t ρ < ρ l o w : δ = δ / α x ¯ , u ¯ , t n e x t = x ¯ , u ¯ , t
where ρ l o w < ρ m i d < ρ u p represents the evaluation parameter for relaxation acceptability, and α is a constant greater than 1, indicating the update speed of the trust region. If ρ ρ u p , it indicates a good convex approximation, and the trust region can be moderately expanded to enhance the solving efficiency; otherwise, if ρ m i d ρ < ρ u p , it indicates an acceptable range for the convex approximation, and the current trust region is maintained; further, if ρ l o w ρ < ρ m i d , it indicates an inadequate convex approximation, yet the obtained solution is within acceptable limits, so the trust region size is decreased for further solving; finally, if ρ < ρ l o w , it indicates a failed convex approximation with an unacceptable solution, meaning that updating the base solution and decreasing the trust region are avoided.
Update reference solution: if the current solution is acceptable, the real cost value of the current solution J * and the minimum cost value J m i n are compared. Similar to the SA algorithm, if J * < J m i n , it means that the current solution is the best, so J m i n is updated and the current solution is used as the reference solution for the next iteration. Otherwise, the current suboptimal solution is accepted based on P2 as the reference solution for the next iteration. According to the metropolis criterion:
P 2 = e J * J m i n T
This means that, at high temperatures, new states with significant energy differences from the current state are acceptable; at low temperatures, only new states with a smaller energy difference from the current state are acceptable. As the temperature gradually decreases, P2 tends to 0, and the algorithm will use the current best solution as the reference solution for the next iteration.
Update annealing temperature: the annealing temperature gradually decreases with each iteration, which can be expressed as:
T = β T
where β is a constant less than 1.

3.3. The Convergence Analysis of IST-SCP

This section focuses on proving the convergence of IST-SCP. First, the general form of the rendezvous problem is presented. Then, according to the theorems proved in [32], the convergence of the SCP without the initial solution tolerance mechanism is illustrated. Finally, the impact of the initial solution tolerance mechanism on the convergence of the SCP is analyzed.
The original non-convex MFRP can be rewritten in the general form, as Equation (27).
m i n J x , u , t
s . t . f i x , u , t = 0 i I e q
f i x , u , t 0 i I i n e q
h j x , u , t = 0 j J e q
h j x , u , t 0 j J i n e q
where J x , u , t in (27a) represents the objective function in (3). f i x , u , t in (27b) represents the non-convex equality constraints, which are the constraints in (1) and (4). f i x , u , t , in (27c), represents the non-convex inequality constraints, which are the constraints in (6) and (7). h j x , u , t , in (27d), represents the convex equality constraints, which are the constraints in (2) and (8). h j x , u , t , in (27e), represents the convex inequality constraints, which are the constraints in (5). I e q I i n e q represents the set of non-convex constraint indices, and J e q J i n e q represents the set of convex constraint indices.
By utilizing the L1 penalty function, the original problem in (27) is reformulated as in Equation (28).
m i n J L 1 x , u , t = J x , u , t + i I e q λ i f i x , u , t + i I i n e q λ i m a x 0 , f i x , u , t + j J e q σ j h j x , u , t + j J i n e q σ j m a x 0 , h j x , u , t
To facilitate the analysis, some fundamental theorems and assumption of convex optimization are provided.
Theorem 1. 
If there exist  λ ¯ 0 , σ ¯ 0  corresponding to  x ¯ , u ¯ , t ¯ , such that, for any  λ λ ¯ , σ σ ¯  within an open neighborhood of  x ¯ , u ¯ , t ¯ , J L 1 x ¯ , u ¯ , t ¯ J L 1 x , u , t , then  x ¯ , u ¯ , t ¯  is a local optimum to the problem in Equation (27).
Assumption 1. 
The set of the active inequality constraints indices are defined as:
I i n e q a c i | f i x ¯ , u ¯ , t ¯ = 0 , i I i n e q J i n e q a c j | h j x ¯ , u ¯ , t ¯ = 0 , j J i n e q
The constraints in the MFRP are continuously differentiable. Therefore, define  F e q x ¯ , u ¯ , t ¯  constructed by  f i x ¯ , u ¯ , t ¯  for  i I e q  as its column, define  H e q x ¯ , u ¯ , t ¯  constructed by  h j x ¯ , u ¯ , t ¯  for  j J e q  as its column, define  F i n e q x ¯ , u ¯ , t ¯  constructed by  f i x ¯ , u ¯ , t ¯  for  i I i n e q  as its column, and define  H i n e q x ¯ , u ¯ , t ¯  constructed by  h i x ¯ , u ¯ , t ¯  for  j J i n e q  as its column. Then, the linear independence constraint qualification (LICQ) is satisfied at  x ¯ , u ¯ , t ¯  if the columns of the following matrix are linearly independent:
A c x ¯ , u ¯ , t ¯ = F e q x ¯ , u ¯ , t ¯ H e q x ¯ , u ¯ , t ¯ F i n e q x ¯ , u ¯ , t ¯ H i n e q x ¯ , u ¯ , t ¯
Theorem 2. 
If the constraints of the problem in Equation (27) satisfy the LICQ assumption in Assumption 1, and  x ¯ , u ¯ , t ¯  is a local optimum of the problem in Equation (27), then there exist Lagrange multipliers  μ ¯ i  for  i I e q , μ ¯ i 0  for  i I i n e q , τ ¯ j  for  j J e q , and  τ ¯ j 0  for  j J i n e q , such that:
J x ¯ , u ¯ , t ¯ + i I e q μ ¯ i f i x ¯ , u ¯ , t ¯ + i I i n e q μ ¯ i f i x ¯ , u ¯ , t ¯ + j J e q τ ¯ j h j x ¯ , u ¯ , t ¯ + j J i n e q τ ¯ j h j x ¯ , u ¯ , t ¯ = 0
The point that satisfies the above conditions is referred to as the KKT point.
Given the consistency in the formulation of the problem, the convergence of the SCP without the initial solution tolerance mechanism applies to the conclusion drawn in [32]. The convergence of the SCP without the initial solution tolerance mechanism is shown in Theorem 3.
Theorem 3. 
Given Assumption 1, the SCP without the initial solution tolerance mechanism always has limit points, and any limit point,  x ¯ , u ¯ , t ¯ , is a stationary point of the non-convex penalty problem in Equation (28). Furthermore, if  x ¯ , u ¯ , t ¯  is feasible for the original non-convex problem in Equation (27), then it is a KKT point of the MFRP.
For IST-SCP, when the annealing temperature T tends to zero, both P 1 and P 2 tend to zero, which means that the function of searching for and accepting a less optimal solution of the IST-SCP is deactivated. At this point, the IST-SCP will degenerate into the conventional SCP with an initial solution x ¯ , u ¯ , t ¯ generated by the initial solution tolerance mechanism.
In conclusion, the initial tolerance mechanism does not influence the convergence of the SCP. The sequence solutions generated by the IST-SCP will converge to the local optimal of the original MFRP.

3.4. Multi-Batch Formation Rendezvous Algorithm

The pseudo-codes for the IST-SCP-based formation rendezvous algorithm are shown in the below Algorithm 1.
Algorithm 1: Multi_batch formation rendezvous algorithm
I n i t i a l i z e   x q , u q , t q x ¯ q , u ¯ q , t ¯ q
I n i t i a l i z e   T q , δ q
w h i l e   J 1 q > ε :
    c a l c u l a t e   P 1 q v o q = 0 ?
    s o l v e   M F R D C S q x q * , u q * , t q *
    c a l c u l a t e   J 1 q , J 2 q , ρ q , P 2 q
    u p d a t e   x ¯ q , u ¯ q , t ¯ q , δ q , T q
e n d
w h i l e   j < q :
    I n i t i a l i z e   x j , u j , t j x ¯ j , u ¯ j , t ¯ j
    I n i t i a l i z e   T j , δ j
    w h i l e   J 1 j > ε :
    c a l c u l a t e   P 1 j v o j = 0 ?
    s o l v e   M F R D S j x j * , u j * , t j *
    c a l c u l a t e   J 1 j , J 2 j , ρ j , P 2 j
    u p d a t e   x ¯ j , u ¯ j , t ¯ j , δ j , T j
    e n d  
e n d
The Algorithm 1 states that the rendezvous problem of the final batch of UAVs, MFRDCSq, is solved first. Then, utilizing the results as the reference to the consistency constraints, the rendezvous problems of other batches of UAVs are solved, denoted as MFRDCSj.

4. Simulation Results and Analysis

In this section, a common scene related to carrier-based UAV formation rendezvous is addressed, validating the effectiveness of the multi-batch formation rendezvous algorithm. Compared with the conventional SCP, the effectiveness of the IST-SCP is verified. Compared with the algorithms that optimize the initial solution (GA+SCP and SA+SCP), the efficiency of the IST-SCP is proved. Additionally, the adaptability of the algorithm concerning (1) the number of batches and the number of UAVs per batch, and (2) the geometric shape of the UAV formation is evaluated.
The conventional SCP method used for comparison was developed by Acikmese et al. [33], who introduced the relaxation factor and the trust region strategy into the SCP algorithm. In recent years, this method has demonstrated commendable performance in various domains such as Mars entry [36], atmospheric re-entry [37], and power system optimization [38].
The simulations are performed in MATLAB with 8 GB of memory and an Intel Core i7 2.2 GHz processor. The convex solver utilized in the SCP is CVX. The parameters of the IST-SCP and the conventional SCP are shown in Table 1.
The scenario of carrier-based UAV formation rendezvous is constructed as: a carrier starts from the coordinate origin (0,0) and travels northeast at a constant speed of V = 42 km/h. During this process, every 2 min, two carrier-based UAVs simultaneously take off. In total, six isomorphic UAVs with a wingspan of 18 m take off. They are directed to a specified altitude of 600 m, at a specified circling angular velocity of 0.04 rad/s, with a circling radius of 3 km around the designated formation center. The objective is to form a regular hexagonal formation with a center-to-center distance of 24 m between the aircraft. During the formation rendezvous process of UAVs, it is necessary to avoid cylindrical no-fly zones. The detailed parameters and rendezvous scenario are illustrated in Table 2 and Figure 7.
The velocity of the carrier is that used in [11], which is around 22 knots. The time interval for takeoff between the batches is that used in [39]. The center-to-center distance in the formation is that used in [40], and adopts a larger value to leave enough space for circling maneuvers.
The green line in the figure represents the trajectory of the carrier’s navigation. The blue circles denote the predetermined circling orbits of the formation centroid. The green squares represent the locations where the UAVs from three different batches take off. The red circular areas delineate the no-fly zones for the carrier-based UAVs.

4.1. Effectiveness Analysis of the Multi-Batch Formation Rendezvous Algorithm

The algorithm 1 proposed in Section 3.4 is applied in the aforementioned task scene. Because of the probability-based search module and metropolis criterion, the algorithm 1 is calculated five times, and takes the shortest rendezvous time as the final result, which is shown in Figure 8.
Figure 8a illustrates the 3D flight trajectory of the six UAVs during the formation rendezvous process. It is noticed that, after sequentially taking off in three batches, the aircraft climbed to the designated altitude because of the altitude penalty term in (3). Then, they avoided the cylindrical no-fly zones. Eventually, they reached the designated formation position and executed the circling flight, forming the intended formation.
Figure 8b displays a projection of the flight trajectories of the six UAVs on the horizontal plane. It indicates that all six UAVs found the optimal paths from their starting positions to the final destinations. Under the constraints of consistency, they ultimately formed the specified regular hexagonal formation.
The effectiveness of the multi-batch formation rendezvous algorithm is well proved.
To highlight the effectiveness of IST-SCP, the optimizer in the multi-batch formation rendezvous algorithm is replaced by conventional SCP. The results are shown in Table 3.
It is obvious that, for each batch of UAVs, the rendezvous time generated by IST-SCP is shorter than conventional SCP.
The rendezvous time of each UAV is related to its path and velocity. The velocities during the rendezvous process of IST-SCP and conventional SCP are shown in Figure 9.
It is shown that the velocity calculated by both algorithms first increases and then decreases due to the formation’s circling velocity, except for that of the climbing process at the beginning. This means that both algorithms can generate the maximum velocity of the UAVs flying on this path. Therefore, the difference in rendezvous time is mainly caused by the path.
As shown in Figure 10, the IST-SCP jumps out of the local optimum caused by inaccurate initial solutions and obtains a path with a shorter rendezvous time.
Taking the numerical results in Figure 10b as an example, the optimization details of the IST-SCP are presented. The initial solution of the second batch is on the left sides of no-fly zone 1 and no-fly zone 3. At this point, the annealing temperature was at its maximum, T = T m a x , and so was P 1 = 0.99 . Thus, the subproblem was more likely to disregard the influence of the relaxation factor v o when being solved, searching for the optimal rendezvous time and location within the entire task space. However, the trajectory obtained in this way had a greater obstacle avoidance penalty. Under the influence of the greatest P 2 , the IST-SCP was encouraged to search even when the obstacle avoidance penalty was high. As the annealing temperature decreased, P 1 gradually decreased, and the probability of considering the relaxation factor v o in solving the subproblems increased. Simultaneously, P 2 was also decreasing, leading the algorithm to favor solutions with a lesser obstacle avoidance penalty as the new reference solution. This continued until the annealing temperature dropped to its minimum,   T = T m i n . At this point, the reference trajectory was on the left side of no-fly zone 2 and was on the right side of no-fly zone 3. The IST-SCP fully degenerated into conventional SCP, optimizing the rendezvous time and location until the algorithm’s stopping criteria were met.
To better demonstrate the reduced sensitivity of the IST-SCP to the initial solutions, computational results for the rendezvous time under various initial solutions for the second batch of UAVs are presented in Table 4. It can be observed that the initial solution tolerance mechanism effectively reduces the sensitivity to the initial values.
The rendezvous time generated by the IST-SCP is reduced by up to 11.02%, and on average by 8.40%, in comparison to the SCP. The effectiveness of the multi-batch formation rendezvous algorithm has been well demonstrated.

4.2. Efficiency Analysis of the Multi-Batch Formation Rendezvous Algorithm

To highlight the efficiency of IST-SCP, the SA+SCP and GA+SCP methods, which optimize the initial solution, are compared with IST-SCP in computing time.
The SA+SCP and GA+SCP methods contain two layers. The first layer, using SA/GA, takes the initial rendezvous position as the optimization variable. Then, the second layer, using conventional SCP, calculates the cost value for each initial solution and converts it back to the first layer to find the best initial rendezvous position for the conventional SCP.
The rendezvous time and computing time for from six to eight UAVs were simulated. The results are shown in Table 5. Compared with the GA+SCP, the computing time of the IST-SCP is reduced by 96.60% on average. Compared with the SA+SCP, the computing time of the IST-SCP is reduced by 97.14% on average. It is obvious that IST-SCP, as an initial solution tolerance algorithm, has higher efficiency than such initial solution optimization algorithms.

4.3. Adaptability Analysis of Multi-Batch Formation Rendezvous Algorithm

This section will prove the strong adaptability of the multi-batch formation rendezvous algorithm concerning the following factors: (1) the number of batches and the number of UAVs per batch, and (2) the geometric shape of the UAV formation.
(1) The number of batches and the number of UAVs per batch:
The solution for (1) four batches, each with two UAVs, totaling eight UAVs, to form a regular octagon, (2) three batches, with one UAV in the first batch, two UAVs in the second batch, and three UAVs in the third batch, to form a regular hexagon, (3) and three batches, with one UAV in the first batch, three UAVs in the second batch, and two UAVs in the third batch, to form a regular hexagon, are shown in Figure 11.
The adaptability to the number of batches and the number of UAVs per batch of the multi-batch formation rendezvous algorithm is well demonstrated.
(2) The geometric shape of the UAV formation:
The solution for (1) three batches, with one UAV in the first batch and two UAVs in the second batch and third batches, to form a diamond-shaped formation, and for (2) three batches, with one UAV in the first batch and three UAVs in the second third batches, to form a herringbone formation are shown in Figure 12.
The adaptability to the geometric shape of the multi-batch formation rendezvous algorithm is well proved.
In conclusion, it is well proved that the multi-batch formation rendezvous algorithm has strong adaptability concerning the number of batches, the number of UAVs per batch, and the geometric shape of the formation.

5. Conclusions

In this paper, the multi-batch formation rendezvous problem for carrier-based UAVs is studied.
(1) An innovative initial solution tolerance SCP (IST-SCP) algorithm is proposed to solve the problem that the conventional SCP algorithm is easily affected by the initial solution and falls into the local optimum. By introducing a probability-based search module and metropolis criterion, the IST-SCP algorithm can accept inappropriate initial solutions and jump out of the local optimum. As the convergency of the IST-SCP is well proved, the IST-SCP algorithm can be applied to a general spatiotemporal coupled trajectory planning problem. Then, utilizing IST-SCP, the multi-batch formation rendezvous method is developed to solve the multi-batch carrier-based UAV formation rendezvous problem.
(2) A new type of UAV rendezvous problem is solved for the first time, considering all kinds of rendezvous limitations of carrier-based UAVs. A reverse solution approach is presented for modeling the optimization problems, which simplifies the spatiotemporal coupling consistency constraints without compromising the optimality of the problem. Then, considering the non-convexity arising from constraints such as the UAVs’ dynamics, obstacle avoidance specifications and convex relaxation techniques are introduced to convexify the original problem.
(3) Simulation experiments are carried out to validate the effectiveness of the multi-batch formation rendezvous algorithm. It is proved that IST-SCP generates better solutions than the conventional SCP and has higher efficiency than the initial solution optimization method. The adaptability of the proposed algorithm is also verified in terms of the following factors: the number of batches, the number of UAVs per batch, and the geometric shape of the formation.
In future studies, we will prioritize considerations of wind field models, the carrier motion, environmental uncertainties, communication delays, optimizations in energy consumption, and computational loads, alongside the challenges inherent in practical deployments.

Author Contributions

Conceptualization, Y.W.; methodology, Z.Z.; software, Z.Z.; validation, Y.W., Z.Z. and L.S.; formal analysis, L.S.; investigation, Z.Z.; resources, L.S.; data curation, Z.Z.; writing—original draft preparation, Z.Z.; writing—review and editing, Y.W. and L.S.; visualization, Z.Z.; supervision, Y.W. and L.S.; project administration, L.S.; funding acquisition, Y.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Hayat, S.; Yanmaz, E.; Muzaffar, R. Survey on Unmanned Aerial Vehicle Networks for Civil Applications: A Communications Viewpoint. IEEE Commun. Surv. Tutor. 2016, 18, 2624–2661. [Google Scholar] [CrossRef]
  2. Mozaffari, M.; Saad, W.; Bennis, M.; Nam, Y.-H.; Debbah, M. A Tutorial on UAVs for Wireless Networks: Applications, Challenges, and Open Problems. IEEE Commun. Surv. Tutor. 2019, 21, 2334–2360. [Google Scholar] [CrossRef]
  3. Zhao, B.; Xian, B.; Zhang, Y.; Zhang, X. Nonlinear Robust Adaptive Tracking Control of a Quadrotor UAV Via Immersion and Invariance Methodology. IEEE Trans. Ind. Electron. 2015, 62, 2891–2902. [Google Scholar] [CrossRef]
  4. Zhen, Z.; Jiang, S.; Ma, K. Automatic carrier landing control for unmanned aerial vehicles based on preview control and particle filtering. Aerosp. Sci. Technol. 2018, 81, 99–107. [Google Scholar] [CrossRef]
  5. Ma, B.; Liu, Z.; Jiang, F.; Zhao, W.; Dang, Q.; Wang, X.; Zhang, J.; Wang, L. Reinforcement learning based UAV formation control in GPS-denied environment. Chin. J. Aeronaut. 2023, 36, 281–296. [Google Scholar] [CrossRef]
  6. Chandhar, P.; Danev, D.; Larsson, E.G. Massive MIMO for Communications With Drone Swarms. IEEE Trans. Wirel. Commun. 2018, 17, 1604–1629. [Google Scholar] [CrossRef]
  7. Morse, B.S.; Engh, C.H.; Goodrich, M.A. UAV Video Coverage Quality Maps and Prioritized Indexing for Wilderness Search and Rescue. In Proceedings of the 5th ACM/IEEE International Conference on Human Robot Interaction, HRI 2010, Osaka, Japan, 2–5 March 2010. [Google Scholar]
  8. Zhang, J.; Xing, J. Cooperative task assignment of multi-UAV system. Chin. J. Aeronaut. 2020, 33, 2825–2827. [Google Scholar] [CrossRef]
  9. Liu, Y.; Han, W.; Su, X.; Cui, R. Optimization of fixed aviation support resource station configuration for aircraft carrier based on aircraft dispatch mission scheduling. Chin. J. Aeronaut. 2023, 36, 127–138. [Google Scholar] [CrossRef]
  10. Wang, X.; Liu, J.; Su, X.; Peng, H.; Zhao, X.; Lu, C. A review on carrier aircraft dispatch path planning and control on deck. Chin. J. Aeronaut. 2020, 33, 3039–3057. [Google Scholar] [CrossRef]
  11. Bardera-Mora, R.; Garcia-Magariño, A.; Rodríguez-Sevillano, A.; Barcala-Montejano, M.A. Aerodynamic Flow Effects on Aircraft Carrier Takeoff Performance. J. Aircr. 2019, 56, 1005–1013. [Google Scholar] [CrossRef]
  12. Rucco, A.; Sujit, P.B.; Aguiar, A.P.; de Sousa, J.B.; Pereira, F.L. Optimal Rendezvous Trajectory for Unmanned Aerial-Ground Vehicles. IEEE Trans. Aerosp. Electron. Syst. 2018, 54, 834–847. [Google Scholar] [CrossRef]
  13. Wang, Z.; McDonald, S.T. Convex relaxation for optimal rendezvous of unmanned aerial and ground vehicles. Aerosp. Sci. Technol. 2020, 99, 105756. [Google Scholar] [CrossRef]
  14. Manyam, S.G.; Casbeer, D.W.; Weintraub, I.E.; Taylor, C. Trajectory Optimization for Rendezvous Planning Using Quadratic Bézier Curves. In Proceedings of the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Prague, Czech Republic, 27 September–1 October 2021; pp. 1405–1412. [Google Scholar]
  15. McDonald, S.; Wang, Z. Real-Time Optimal Trajectory Generation for UAV to Rendezvous with an Aerial Orbit. In Proceedings of the AIAA Aviation 2019 Forum, Dallas, TX, USA, 17–21 June 2019. [Google Scholar]
  16. Mclain, T.W.; Beard, R.W. Trajectory Planning for Coordinated Rendezvous of Unmanned Air Vehicles. In Proceedings of the AIAA Guidance, Navigation, and Control Conference and Exhibit, Monterey, CA, USA, 5–8 August 2002. [Google Scholar]
  17. McLain, T.W.; Chandler, P.R.; Rasmussen, S.; Pachter, M. Cooperative Control of UAV Rendezvous. In Proceedings of the 2001 American Control Conference(Cat. No. 01CH37148), Arlington, VA, USA, 25–27 June 2001; pp. 2309–2314. [Google Scholar]
  18. Cheng, Z.; Zhao, L.; Shi, Z. Decentralized Multi-UAV Path Planning Based on Two-Layer Coordinative Framework for Formation Rendezvous. IEEE Access 2022, 10, 45695–45708. [Google Scholar] [CrossRef]
  19. Zhao, F.; Yu, J.; Hua, Y.; Dong, X.; Li, Q.; Ren, Z. Decoupled SCP-Based Trajectory Planning in the Complex Environment for Multiple Fixed-Wing UAV Systems. In Proceedings of the 2022 International Conference on Unmanned Aircraft Systems (ICUAS), Dubrovnik, Croatia, 21–24 June 2022; pp. 670–675. [Google Scholar]
  20. Wang, X.; Garcia, E.; Kingston, D.; Casbeer, D. Consensus-Based Simultaneous Arrival of Multiple UAVs with Constrained Velocity. In Proceedings of the American Control Conference, Chicago, IL, USA, 1–3 July 2015; pp. 5732–5737. [Google Scholar]
  21. Shiyu, Z.; Rui, Z. Cooperative guidance for multimissile salvo attack. Chin. J. Aeronaut. 2008, 21, 533–539. [Google Scholar] [CrossRef]
  22. Shi, H.-r.; Lu, F.-x.; Wu, L. Cooperative trajectory optimization of UAVs in approaching stage using feedback guidance methods. Def. Technol. 2023, 24, 361–381. [Google Scholar] [CrossRef]
  23. Zheng, D.; Zhang, Y.-f.; Li, F.; Cheng, P. UAVs cooperative task assignment and trajectory optimization with safety and time constraints. Def. Technol. 2023, 20, 149–161. [Google Scholar] [CrossRef]
  24. Jouffroy, V.; Bovier-Lapierre, X.; Ariff, O.K.; Richer, T. Path Generation for Rendezvous of Dissimilar UAVs Using Particle Swarm Optimization of Dubin’s Curve Sets. In Proceedings of the AIAA Guidance, Navigation, and Control Conference, Portland, OR, USA, 8–11 August 2011. [Google Scholar]
  25. Yao, W.; Qi, N.; Liu, Y. Online Trajectory Generation with Rendezvous for UAVs Using Multistage Path Prediction. J. Aerosp. Eng. 2017, 30, 04016092. [Google Scholar] [CrossRef]
  26. Shao, Z.; Yan, F.; Zhou, Z.; Zhu, X. Path Planning for Multi-UAV Formation Rendezvous Based on Distributed Cooperative Particle Swarm Optimization. Appl. Sci. 2019, 9, 2621. [Google Scholar] [CrossRef]
  27. Ait Saadi, A.; Soukane, A.; Meraihi, Y.; Benmessaoud Gabis, A.; Mirjalili, S.; Ramdane-Cherif, A. UAV Path Planning Using Optimization Approaches: A Survey. Arch. Comput. Methods Eng. 2022, 29, 4233–4284. [Google Scholar] [CrossRef]
  28. Zhang, D.; Yong, X.; Jie, L.; Gang, L.; Yan, C. UAV Path Planning Based on Chaos Ant Colony Algorithm. In Proceedings of the International Conference on Computer Science & Mechanical Automation, London, UK, 29 June–1 July 2016. [Google Scholar]
  29. Bonalli, R.; Lew, T.; Pavone, M. Analysis of Theoretical and Numerical Properties of Sequential Convex Programming for Continuous-Time Optimal Control. IEEE Trans. Autom. Control 2023, 68, 4570–4585. [Google Scholar] [CrossRef]
  30. Wang, Z. A survey on convex optimization for guidance and control of vehicular systems. Annu. Rev. Control 2024, 57, 100957. [Google Scholar] [CrossRef]
  31. Mao, Y.; Szmuk, M.; Açıkmeşe, B. Successive Convexification of Non-Convex Optimal Control Problems and its Convergence Properties. In Proceedings of the 2016 IEEE 55th Conference on Decision and Control (CDC), Las Vegas, NV, USA, 12–14 December 2016; pp. 3636–3641. [Google Scholar]
  32. Mao, Y.; Szmuk, M.; Xu, X.; Açikmese, B. Successive convexification: A superlinearly convergent algorithm for non-convex optimal control problems. arXiv 2018, arXiv:1804.06539. [Google Scholar]
  33. Malyuta, D.; Reynolds, T.P.; Szmuk, M.; Lew, T.; Bonalli, R.; Pavone, M.; Açıkmeşe, B. Convex Optimization for Trajectory Generation: A Tutorial on Generating Dynamically Feasible Trajectories Reliably and Efficiently. IEEE Control Syst. 2022, 42, 40–113. [Google Scholar] [CrossRef]
  34. Banerjee, S.; Lew, T.; Bonalli, R.; Alfaadhel, A.; Alomar, I.A.; Shageer, H.M.; Pavone, M. Learning-Based Warm-Starting for Fast Sequential Convex Programming and Trajectory Optimization. In Proceedings of the 2020 IEEE Aerospace Conference, Big Sky, MT, USA, 7–14 March 2020; pp. 1–8. [Google Scholar]
  35. Kirkpatrick, S.; Gelatt, C.D.; Vecchi, M.P. Optimization by Simulated Annealing. Science 1983, 220, 671–680. [Google Scholar] [CrossRef] [PubMed]
  36. Liu, X.; Li, S.; Xin, M. Mars entry trajectory planning with range discretization and successive convexification. J. Guid. Control Dyn. 2022, 45, 755–763. [Google Scholar] [CrossRef]
  37. Dominguez Calabuig, G.J.; Mooij, E. Optimal on-Board Abort Guidance Based on Successive Convexification for Atmospheric Re-Entry. In Proceedings of the AIAA Scitech 2021 Forum, Virtual, 11–15 & 19–21 January 2021; p. 0860. [Google Scholar]
  38. Tziovani, L.; Hadjidemetriou, L.; Timotheou, S. Successive Convexification Algorithms for Optimizing Power Systems with Energy Storage Models. IEEE Trans. Smart Grid 2023, 15, 1807–1820. [Google Scholar] [CrossRef]
  39. Deng, Z.; Liu, X.; Dou, Y.; Su, X.; Li, H.; Wang, L.; Wang, X. Autonomous sortie scheduling for carrier aircraft fleet under towing mode. Def. Technol. 2024. [Google Scholar] [CrossRef]
  40. Zhang, Q.; Liu, H.H.T. Aerodynamics Modeling and Analysis of Close Formation Flight. J. Aircr. 2017, 54, 2192–2204. [Google Scholar] [CrossRef]
Figure 1. Schematic diagram of the process of circling and rendezvous (the solid airplane represents the actual position of the airplane, the hollow airplane represents the target formation position of the airplane, and the green square represents the position of the airplane at takeoff).
Figure 1. Schematic diagram of the process of circling and rendezvous (the solid airplane represents the actual position of the airplane, the hollow airplane represents the target formation position of the airplane, and the green square represents the position of the airplane at takeoff).
Drones 08 00615 g001
Figure 2. Gantt chart of the rendezvous process (the green rectangle represents the stage of the aircraft waiting on the flight deck, the yellow rectangle represents the stage of the aircraft from takeoff to circling, and the blue rectangle represents the stage of the aircraft circling in the air).
Figure 2. Gantt chart of the rendezvous process (the green rectangle represents the stage of the aircraft waiting on the flight deck, the yellow rectangle represents the stage of the aircraft from takeoff to circling, and the blue rectangle represents the stage of the aircraft circling in the air).
Drones 08 00615 g002
Figure 3. Schematic diagram of the reverse solution approach (the solid red triangle represents the actual position of the formation centroid, while the hollow red triangle represents the expected position of the formation centroid when a certain batch of aircraft is incorporated). First, minimize the rendezvous time of last batch, as shown in (1); second, using this as a reference, reverse-solve the other batches’ trajectories and ensure the temporal consistency constraints are met by adjusting the rendezvous positions, as shown in (2,3); finally, the formation successfully rendezvous, as shown in (4).
Figure 3. Schematic diagram of the reverse solution approach (the solid red triangle represents the actual position of the formation centroid, while the hollow red triangle represents the expected position of the formation centroid when a certain batch of aircraft is incorporated). First, minimize the rendezvous time of last batch, as shown in (1); second, using this as a reference, reverse-solve the other batches’ trajectories and ensure the temporal consistency constraints are met by adjusting the rendezvous positions, as shown in (2,3); finally, the formation successfully rendezvous, as shown in (4).
Drones 08 00615 g003
Figure 4. The initial trajectory leads the sequential solutions to the same side of an obstacle because of the poor approximation of the convexified penalty (the left part of the figure represents the variation in the flight trajectory on the horizontal plane projection as the subproblem is sequentially solved. The right part of the figure shows that this change is to gradually reduce the punishment of obstacle avoidance constraints).
Figure 4. The initial trajectory leads the sequential solutions to the same side of an obstacle because of the poor approximation of the convexified penalty (the left part of the figure represents the variation in the flight trajectory on the horizontal plane projection as the subproblem is sequentially solved. The right part of the figure shows that this change is to gradually reduce the punishment of obstacle avoidance constraints).
Drones 08 00615 g004
Figure 5. Different initial trajectory guesses for one UAV lead to different local optimums of rendezvous time because of the spatiotemporal coupling rendezvous process (the solid line represents the actual trajectory, while the dashed line represents the initial trajectory guess. The green and yellow lines represent different trajectories optimized under different initial guesses).
Figure 5. Different initial trajectory guesses for one UAV lead to different local optimums of rendezvous time because of the spatiotemporal coupling rendezvous process (the solid line represents the actual trajectory, while the dashed line represents the initial trajectory guess. The green and yellow lines represent different trajectories optimized under different initial guesses).
Drones 08 00615 g005
Figure 6. Flow chart of the initial solution tolerance SCP (IST-SCP) algorithm ( superscript * stands for the optimal solution/objective value of the subproblem).
Figure 6. Flow chart of the initial solution tolerance SCP (IST-SCP) algorithm ( superscript * stands for the optimal solution/objective value of the subproblem).
Drones 08 00615 g006
Figure 7. The rendezvous scene of carrier-based UAVs.
Figure 7. The rendezvous scene of carrier-based UAVs.
Drones 08 00615 g007
Figure 8. (a) The 3D flight path for each UAV; (b) the projection of the flight path on the horizontal plane.
Figure 8. (a) The 3D flight path for each UAV; (b) the projection of the flight path on the horizontal plane.
Drones 08 00615 g008
Figure 9. The velocity during the rendezvous process generated by IST-SCP and conventional SCP. Both algorithms can generate the maximum velocity flying on this path. Thus, the difference in rendezvous time is mainly caused by the path.
Figure 9. The velocity during the rendezvous process generated by IST-SCP and conventional SCP. Both algorithms can generate the maximum velocity flying on this path. Thus, the difference in rendezvous time is mainly caused by the path.
Drones 08 00615 g009
Figure 10. (a) The initial path guess for the first batch is between no-fly zone 1 and no-fly zone 2. The path generated by IST-SCP jumped out of the local optimum, while the path generated by conventional SCP was trapped in it; (b) the initial path guess for the second batch is outside no-fly zone 1 and no-fly zone 3. The IST-SCP method still found a faster path to enter the circling orbit, and entered the formation further upstream in the circling orbit compared with the conventional SCP; (c) the initial path guess for the third batch is between no-fly zone 2 and no-fly zone 3. The path generated by conventional SCP failed to enter the orbit from a closer position, while the path by IST-SCP bypassed no-fly zone 2 and entered the orbit from a closer position.
Figure 10. (a) The initial path guess for the first batch is between no-fly zone 1 and no-fly zone 2. The path generated by IST-SCP jumped out of the local optimum, while the path generated by conventional SCP was trapped in it; (b) the initial path guess for the second batch is outside no-fly zone 1 and no-fly zone 3. The IST-SCP method still found a faster path to enter the circling orbit, and entered the formation further upstream in the circling orbit compared with the conventional SCP; (c) the initial path guess for the third batch is between no-fly zone 2 and no-fly zone 3. The path generated by conventional SCP failed to enter the orbit from a closer position, while the path by IST-SCP bypassed no-fly zone 2 and entered the orbit from a closer position.
Drones 08 00615 g010
Figure 11. (a) Four batches, each with two UAVs, form a regular octagonal formation (number of UAVs in each batch = (2, 2, 2, 2)); (b) three batches, one UAV in the first batch, two UAVs in the second batch, and three UAVs in the third batch, form a regular hexagonal formation (number of UAVs in each batch = (1, 2, 3)); (c) three batches, one UAV in the first batch, three UAVs in the second batch, and two UAVs in the third batch, form a regular hexagonal formation (number of UAVs in each batch = (1, 3, 2)).
Figure 11. (a) Four batches, each with two UAVs, form a regular octagonal formation (number of UAVs in each batch = (2, 2, 2, 2)); (b) three batches, one UAV in the first batch, two UAVs in the second batch, and three UAVs in the third batch, form a regular hexagonal formation (number of UAVs in each batch = (1, 2, 3)); (c) three batches, one UAV in the first batch, three UAVs in the second batch, and two UAVs in the third batch, form a regular hexagonal formation (number of UAVs in each batch = (1, 3, 2)).
Drones 08 00615 g011
Figure 12. (a) Three batches, one UAV in the first batch, two UAVs in the second and third batches, form a diamond-shaped formation (geometric shape of formation = diamond); (b) three batches, one UAV in the first batch, three UAVs in the second and third batches, to form a herringbone formation (geometric shape of formation = herringbone).
Figure 12. (a) Three batches, one UAV in the first batch, two UAVs in the second and third batches, form a diamond-shaped formation (geometric shape of formation = diamond); (b) three batches, one UAV in the first batch, three UAVs in the second and third batches, to form a herringbone formation (geometric shape of formation = herringbone).
Drones 08 00615 g012
Table 1. The parameters of the IST-SCP and SCP.
Table 1. The parameters of the IST-SCP and SCP.
ParameterIST-SCPSCP
Weight   of   the   rendezvous   time   w t 11
Weight   of   the   altitude   penalty   w z 33
Weight   of   the   penalty   on   relaxation   factors   w p 1 × 1021 × 102
The   trust   region   update   rate   α 22
The   trust   region   evaluation   ( ρ l o w , ρ m i d , ρ u p ) (0.1, 0.25, 0.75)(0.1, 0.25, 0.75)
The   initial   trust   region   radius   ( δ x 0 , δ u 0 , δ t 0 ) (1 × 102, 10, 10)(1 × 102, 10, 10)
The   temperature   update   rate   β 0.6
The   range   of   temperature   ( T m i n , T m a x ) (0.1, 1 × 102)
Table 2. Task setting parameters.
Table 2. Task setting parameters.
ParameterValue
Start point of carrier(0 m,0 m)
Velocity of carrier42 km/h
Time interval for takeoff between batches2 min
Number of UAVs per batch2
Number of batches3
Rendezvous altitude600 m
Circling angular velocity0.04 rad/s
Radius of rendezvous orbit3 km
Center of rendezvous orbit(11,880 m, 16,120 m)
Geometric shape of formationRegular hexagonal
Center-to-center distance in the formation24 m
Wingspan of UAV18 m
Table 3. Comparison results of IST-SCP and conventional SCP.
Table 3. Comparison results of IST-SCP and conventional SCP.
The Type of Time NodeIST-SCP/sConventional SCP/s
1st batchTakeoff0
Rendezvous203.05213.80
2nd batchTakeoff120
Rendezvous315.86336.99
3rd batchTakeoff240
Rendezvous405.63428.46
Table 4. Comparison of the rendezvous time under various initial rendezvous times.
Table 4. Comparison of the rendezvous time under various initial rendezvous times.
Initial Rendezvous Time/sRendezvous Time/s of IST-SCPRendezvous Time/s of SCP
320314.52314.37
340315.86336.99
360315.93323.31
380318.25357.68
Table 5. Comparison of the computing time between IST-SCP, GA+SCP, and SA+SCP.
Table 5. Comparison of the computing time between IST-SCP, GA+SCP, and SA+SCP.
Rendezvous Time/sComputing Time/s
6 UAVs in 3 batches
IST-SCP405.63485.99
GA+SCP400.3414,266.16
SA+SCP400.0716,948.33
7 UAVs in 3 batches
IST-SCP404.85528.26
GA+SCP405.2715,654.69
SA+SCP406.1318,595.76
8 UAVs in 4 batches
IST-SCP520.27647.70
GA+SCP524.4919,415.76
SA+SCP522.9622,597.06
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, Z.; Sun, L.; Wang, Y. Multi-Batch Carrier-Based UAV Formation Rendezvous Method Based on Improved Sequential Convex Programming. Drones 2024, 8, 615. https://doi.org/10.3390/drones8110615

AMA Style

Zhang Z, Sun L, Wang Y. Multi-Batch Carrier-Based UAV Formation Rendezvous Method Based on Improved Sequential Convex Programming. Drones. 2024; 8(11):615. https://doi.org/10.3390/drones8110615

Chicago/Turabian Style

Zhang, Zirui, Liguo Sun, and Yanyang Wang. 2024. "Multi-Batch Carrier-Based UAV Formation Rendezvous Method Based on Improved Sequential Convex Programming" Drones 8, no. 11: 615. https://doi.org/10.3390/drones8110615

APA Style

Zhang, Z., Sun, L., & Wang, Y. (2024). Multi-Batch Carrier-Based UAV Formation Rendezvous Method Based on Improved Sequential Convex Programming. Drones, 8(11), 615. https://doi.org/10.3390/drones8110615

Article Metrics

Back to TopTop