1. Introduction
In recent years, the problem of target tracking and circumnavigation control has garnered significant research attention due to its wide range of applications. These applications include satellite formation flying, dynamic encirclement by multiple Unmanned Aerial Vehicles (UAVs), and cooperative surrounding missions involving multiple vessels [
1,
2,
3,
4]. In general, the position of the target is initially unavailable for the agents, thus requiring an estimator to estimate the target state [
5,
6]. Particularly, in some military scenarios, to maintain radio silence, the passive target locating and tracking techniques, which uses only the bearings measured by camera, has become a hot research field [
7,
8,
9].
Researchers commonly utilized target bearing data to estimate unknown states and update control algorithms, merging localization and circumnavigation control. This strategy enhances system performance via environmental observation and recognition, thus addressing a dual problem [
10]. In early studies [
11,
12,
13], the method employing an orthogonal projection matrix was introduced to handle unit bearing vectors, enabling an agent to estimate the target position and accomplish the circumnavigation mission of static or slowly moving targets. In [
14], this approach has been extended to multi-agent systems for circumnavigation control of multiple static targets. In [
15], the authors established a condition to ensure a safe distance between agents and targets, preventing agents from entering the proximity of targets during circumnavigation. In [
16,
17], corresponding control frameworks are proposed to enable the estimator’s application to a single or multiple UAVs for target encirclement. Later, the authors of [
18,
19,
20] employed the linear filter and finite-time stability theory to enhance the convergence rate of the estimator. This enhancement allows the multi-agent system to rapidly enclose the targets. For multi-drone motion planning, especially for large drone teams, studying collision avoidance strategies between drones is fundamental and crucial [
21,
22]. Researchers in [
23,
24,
25] further investigated the discrete-time and communication-free circumnavigation control problem, incorporating collision avoidance among agents to enhance the system’s applicability in real-world scenarios.
However, the aforesaid consequences [
11,
12,
13,
14,
15,
16,
17,
18,
19,
20,
23,
24,
25] heavily rely on GPS devices for agents to acquire real-time global position data, which is crucial for target estimation. Yet, in hostile environments, GPS signals are susceptible to disruption or attacks, potentially compromising target tracking effectiveness [
26]. Addressing this challenge, researchers proposed the concept of distributed bearing-only formation control [
27,
28,
29]. Within this framework, agents only need to measure the relative bearings between each other, without reliance on GPS information. Target state estimation is often integrated with multi-agent formation control, utilizing bearing measurements for more sophisticated target tracking and cooperative motion. For instance, in [
30], the bearing rigidity theory and a rotation matrix were used to establish a time-varying formation for encircling a moving target. Additionally, a dynamic compensation approach in [
31] modified traditional target estimators to calculate relative target positions without global information. Building on this, refined target estimators in [
32,
33] relaxed the initial condition requirements, enabling adept navigation around one or multiple targets in circular or convex trajectories. Furthermore, the researchers in [
34,
35] analyzed target tracking and circumnavigation within GPS-free systems involving multiple vessels and second-order agents, proposing tailored algorithms for specific dynamic models. Despite their effectiveness, these estimation and control strategies heavily rely on precise sensor data, and may not adequately address high-speed target scenarios. Addressing these limitations is essential for enhancing performance and expanding applicability.
Motivated by the above discussion, this paper is devoted to solving the target tracking and circumnavigation control problem for a moving target within bearing-based networked multi-UAV systems. The main contributions of this paper are presented as follows:
- (1)
A distributed target estimation algorithm and a circumnavigation control protocol are designed by using bearing measurements of the target. In contrast to the proportional (P) estimator used in [
11,
12,
13,
14,
15,
16,
17,
18,
19,
20,
23,
24,
25,
29,
30,
31,
32,
33], we introduce a PI control method to develop the estimator, which significantly enhances its robustness. It can be proven that the algorithm converges to a bounded range near the true value. Moreover, in comparison with [
11,
12,
13,
14,
15,
16,
17,
18,
19,
20,
23,
24,
25], we take into account practical communication range restrictions and employ dynamic compensation to enable the system to track and encircle the target in a GPS-denied environment.
- (2)
A Kalman filter approach is applied to improve the accuracy of target estimation and circumnavigation control. Each UAV in the system utilizes local information, including relative bearing and velocity data from neighbors, to calculate measurements of the target’s relative position. These measurements are then incorporated into a Kalman filter for target state estimation. This approach enhances circumnavigation tracking precision, enabling successful tracking of the dynamic target within the system. Compared to previous methods [
11,
12,
13,
14,
15,
16,
17,
18,
19,
20,
23,
24,
25,
28,
29,
30,
31,
32,
33,
34,
35], our approach leverages effective communication and collaboration among agents to achieve superior target localization and circumnavigation performance, especially in high-speed target scenarios.
The rest of this paper is structured below. In
Section 2, the problem to be investigated and necessary preliminaries are given. In
Section 3, the distributed target-state estimator and circumnavigation control algorithm are designed. In
Section 4, the precise estimator and modified circumnavigation control strategy by Kalman filtering method are developed. In
Section 5, simulation results are presented. Finally, conclusions are summarized in
Section 6.
3. Distributed Estimation and Circumnavigation Algorithms
To successfully estimate the relative target position
and accomplish the control objectives described in Problem 1, an estimator
and the control input (control protocol)
for each UAV are constructed as follows:
where
,
,
,
,
and
are positive design parameters, and
represents the estimated relative distance between the target and UAV
i.
Remark 1. In (9), the term is equivalent to in Equation (10), and is introduced to compensate for the estimation inaccuracies resulting from the movement of UAV i. The term denotes the estimation of target velocity, structured as a PI controller. In (10), the radial term forces the agent to encircle the target with the desired distance , while the tangential term ensures that the UAV rotate around the target while achieving the desired formation with other agents. In motion control, if is set too small, the angular velocity of the UAV may become very large. To limit the angular velocity, the tangential term is adjusted to be dependent on . We will first demonstrate that the estimation error globally converges to a neighborhood of zero, leading to the following lemma.
Lemma 3. Assuming that the estimation error in (3) is bounded, if the control parameters satisfy , where ξ is a given positive scalar, then the estimator (9) and control protocol (10) will cause to exponentially converge to a circular area of radius centered at origin as , where and are defined in Lemma 2.
Proof. Observing that
, we can obtain
Utilizing the property
and defining
as an auxiliary variable, Equation (
10) can be rewritten as
Obviously, System (12) can be divided into a nominal system
and a perturbation term
. According to Lemma 2, if we can prove the global exponential stability of System (13), then we will be able to find some constants
and
such that Equation (
8) holds, thereby proving Lemma 3. And by following Lemma 1, this objective can be redirected to prove that the unit vector
is persistently exciting.
It is known that if there exist
such that
is satisfied for all unit constant vectors
and all
, then
is PE (see [
9]). Since
, we can always find an upper bound
. Next, we shall prove that a lower bound
also exists. Let
be the angle measured counterclockwise from
U to
; its derivative is
Since and , we have . Therefore, if there exists an upper bound such that , then we can obtain ; thus, one can always find a and a such that (14) is satisfied.
The final step is to show that
is indeed bounded. Defining
, we derive the derivative of
:
and introducing the error
between the distance of UAV
i from the target and the desired distance results in
According to the triangular relationship, we have the following inequality:
Choose the Lyapunov candidate function
. Given that the matrix
is negative semi-definite, we have
. This implies that the estimation error always remains
. Since
is bounded, we can assume an upper bound
that satisfies
. Therefore, adding
on both sides of Equation (
17), we can obtain
From (19), we can see is bounded, and we have . Hence, we have completed the proof of Lemma 3. □
Remark 2. For a proportional-integral (PI) controller, the integral term prevents the error from growing indefinitely by capping it. Given that the target speed has been regarded as a bounded disturbance, the estimation error will consequently remain bounded throughout the target estimation process. By integrating the error over time, the PI controller drives the error towards an acceptable range.
Remark 3. Based on (19), we can also determine a lower bound for . This suggests that by reasonably setting the value of the desired radius , we can ensure that for all , thereby guaranteeing UAV i can consistently obtain the bearing vector while avoiding collisions between the UAVs and the target at all times.
Next, on the proof of Lemma 3, we will show that the distance error globally converges to a ball centered at zero for all the UAVs.
Theorem 1. Under the estimation strategy (9), the controller (10) makes the relative distance between the target and UAV i converge exponentially fast to a bounded circular area centered at with radius .
Proof. From (17), and using the triangular inequality (18), we have
Using Inequality (8) in Lemma 2, (20) can be rewritten as
and letting
, we finally obtain
Therefore, as indicated by (22), the distance between the target and each UAV i converges to a neighborhood around the desired radius as . □
Finally, we aim to demonstrate that the UAVs will arrange themselves in a regular polygon formation along the circumference of a circle, thereby achieving an optimal enclosing performance.
Theorem 2. For all and , assume that the communication radius satisfies Then, under estimator (9) and control law (10), each converges exponentially to a bounded circular area centered at .
Proof. According to Theorem 1,
converges exponentially to a ball centered at zero with radius
. Therefore, under (23), there exists a time
such that for all
, we have
. This implies that each UAV
i will always maintain a neighbor
, while simultaneously ensuring that
. Then, we calculate the derivative of
:
Define
, and let
be the Laplacian matrix of the time-varying networked graph
, where
. We can rewrite Equation (
24) as
where
,
,
and
. Since
G is a strongly connected graph, the
in equation
will achieve consensus:
. Thus, let
, and we can obtain
It is obvious that the disturbances , , are bounded, and according to Lemma 2, the will converge exponentially fast to a ball centered at zero. □
Based on Lemma 3, and Theorems 1 and 2, we have successfully achieved the estimation and control objectives (4)–(6). However, when the target’s speed is high, the estimation error given by (9) and the control error for circumnavigation radius and interval angle given by (10) are unsatisfactory. Therefore, in the subsequent section of this paper, we will enhance our estimation and circumnavigation strategy by incorporating the Kalman filtering method to improve the encircling tracking performance.
4. Kalman Filtering Strategy
In this section, we will first utilize the local information exchange between neighboring UAV and UAV i to obtain an observation of the target’s relative position. Then, by employing the Kalman filtering method, along with estimator (9), we can achieve a more accurate estimation of the target state, thereby improving the overall performance of collaborative target encircling tracking in the multi-UAV system.
As shown in
Figure 1, when
, using trigonometric relationships, we have
where
. By Lemma 5 in [
38], for the angle
between any two unit vectors
, we have
. Thus, we can rewrite (27) as
By calculating the derivative of
, we can obtain
In a practical environment, the values of
and
can be measured directly by UAV
i. Then, by substituting Equation (
28), we can finally obtain
as the observation of the target’s relative position for each UAV
i:
Remark 4. From Equations (27)–(30), we can easily see that the relative position observation is calculated using the measurements of , , and their rate of change. In the theoretical analysis section, we do not consider the measurement noises for , , and their rates of change. However, the error of can be affected by communication packet loss and bit errors between drones. We can model the observation noise as white Gaussian noise with zero mean and covariance , where is a Dirac function and is the standard deviation of the noise.
By utilizing
with a Kalman filter, we can apply the following equations [
39]:
Here, X and Z represent the estimated and observed states of the target, determined by and , respectively. B and u represent the control input matrix and the control input. A and H are the state transition and observation models. The terms w and denote process and measurement noises, following distributions and . Q and R are the covariance matrices of process and measurement noises. P represents the state covariance matrix, K is the Kalman gain factor, and denotes the estimated target state obtained through Kalman filtering.
Thus, we can improve the accuracy of the target state estimation in Equation (
9) and obtain
from
in (31), where
and
represent the relative position and relative velocity of the target after the Kalman filtering process. Then, by
, each UAV can estimate the global velocity of the target:
and by letting
, we can modify the controller (10) by
Remark 5. The updated control law (33), in comparison to the preliminary controller (10), replaces with , allowing agent i to employ more accurate relative distance estimation for achieving encirclement. Additionally, the additional term is introduced to compensate the circumnavigation error caused by the target’s movement. If is similar to the actual target velocity, the problem can be treated as a static target circumnavigation problem.
Theorem 3. Through the estimated target state and the modified control law (33), when , we can finally obtain , and .
Proof. Similar to the proof process of Corollary 1 and Corollary 2 in [
16], based on Lemma 3, along with the unbiasedness of the Kalman filter, we can directly conclude that
, for
. Here,
is the real state of the target. And from (32), we can also acquire
Therefore, as
, we have
Thus, the target tracking problem is transferred into the problem of encircling a static target, which means we can let
in Inequality (22), and the disturbances
,
, and
in Equation (
26) will finally vanish. That is, as
,
and
. □
Remark 6. In some cases, there may be situations where the observed value of is lost. This can happen when , or when the target, neighbor UAV, and UAV i are lined up in a straight line. In such situations, if the observed value is lost during the filtering process, we will use the estimated value instead of . This allows us to continue estimating the state of the target.
At last, we demonstrate the implementation process of our algorithm.
- Step 1:
Define two integers, k and N, where k is the number of simulation steps, and N denotes a pre-set threshold value. Additionally, initialize the estimation of the relative target position , the error covariance matrix , prediction covariance matrix Q, and measurement covariance matrix R in the filter.
- Step 2:
When
, we estimate the target state using
in Equation (
9). Then, we update
as
,
as
in control law (33). Finally, apply the updated control law to drive each UAV
i.
- Step 3:
When
, we start the Kalman filtering process. Each UAV
i begins to calculate
in Equation (
30); if the observed value is not available, we let
. Next, for agent
i, by calculating
, we using the modified controller (33) to eventually complete the target encircling tracking.
5. Numerical Simulations
In this section, we will demonstrate several numerical examples to illustrate the theoretical results obtained in the previous sections.
Consider a formation of five circumnavigating UAVs and an unknown dynamic target in a GPS-denied indoor environment; the initial positions of the target and UAVs are assigned as , , , , and . The initial estimates of the relative target position are , , , and . The desired circumnavigating radius is set as .
The designed parameters in Estimator (9) and Controller (10) are given as , , , , and . Let represent the simulation step size, and set to be 0.01 s. We define two different speed levels for the target: a velocity of for low speed and a velocity of for high speed, where .
We first assess the impact of the rough estimator (9) and control protocol (10) on tracking the low-speed target. The simulation results, as depicted in
Figure 2, reveal that while the drones can maintain an accurate estimation of the target’s relative position, the effectiveness of the circumnavigation is not satisfactory. It is evident from the results that, with only the rough distributed estimation and circumnavigation algorithm, the target state estimation ultimately converges to a neighborhood of the accurate value. The distance error
between agent
i and the target also converges to a neighborhood of zero, and the separation angle
between two neighboring UAVs converges to a neighborhood of the desired value
. However, it is apparent that the results are not ideal, as the circumnavigation errors exhibit obvious fluctuations.
Now, we adopt the filtering strategy. We set
, and utilize the observed value
from Equation (
30) and the modified control law (33). Additionally, let the state transition matrix
and the observation matrix
, we select the following target dynamic model [
40,
41]:
where the state vector
has a dimension of
. The
vectors
,
,
refer to the target state variables to be estimated, respectively, representing the position, velocity, and acceleration in the k-th step of the simulation.
represents the observed values of the target state.
and
represent process noise and measurement noise, respectively, assumed to be zero-mean Gaussian white noise. We set
,
and
. The simulation results in low-speed target scenario are shown in
Figure 3.
By comparing
Figure 3 with
Figure 2, we can observe that the application of the Kalman filter has further improved the accuracy of estimating the target’s relative position and the circumnavigation control, enabling the multi-UAV system to accurately encircle and track the low-speed moving target.
Then, we begin to discuss the high-speed scenario; the corresponding results are depicted in
Figure 4,
Figure 5 and
Figure 6. From these results, it can be seen that the system also performs well in circumnavigation tracking of high-speed moving targets. It is able to accurately estimate the target’s state and execute the circumnavigation maneuvers precisely.
Considering the potential disturbance to the UAVs caused by air currents, which could lead to interference in bearing measurements, we conducted a simulation experiment. The simulation involved perturbing the bearing unit vectors
and
with Gaussian noise
. The noise follows
and
. The simulation results, as shown in
Figure 7, indicate that the system still maintains satisfactory circumnavigation performance around the high-speed target, demonstrating a significant level of robustness against noise.
Finally, to substantiate the performance of our algorithm, we have conducted comparisons with the methods in reference [
16,
31]. In reference [
16], the authors employed a single UAV equipped with GPS technology. They used the Virtual Intersecting Location Algorithm (VILA) to calculate a virtual observation value of the target state. Subsequently, by utilizing a Kalman filter and an initial estimation algorithm, the UAV estimated the target state and successfully accomplished the task of encircling the target. We selected the estimator (3) and the controller (17) from that reference, with the UAV’s velocity
v set to 20 and the estimation parameter
set to 200. The circumnavigation tracking results for both low-speed and high-speed targets are shown in
Figure 8.
Additionally, in reference [
31], the authors considered a scenario where five agents track a moving target in a GPS-denied environment. We adopted the estimation method (18) from that reference, and set the parameter
to 12. The initial states of all agents were adjusted to match our situation. The simulation results of the comparative analysis are depicted in
Figure 9. For the high-speed target, simulated control indicators show significant deviations from their desired values. Therefore, corresponding simulation illustrations are not be presented here. It is evident that neither of the two existing methods can ensure accurate target encircling tracking when the target has a high speed.
Through
Figure 8 and
Figure 9, it can be observed that existing typical algorithms can only successfully achieve encircling tracking in low-speed target scenario, while they perform poorly, or even fail, in encircling tracking of the high-speed target. By comparing with existing algorithms, the effectiveness of the algorithm designed in this paper is further validated.