1. Introduction
The need to explore various environmental boundaries has motivated extensive research on using mobile robotic platforms for such a purpose; see, e.g., Refs. [
1,
2,
3,
4,
5,
6,
7,
8,
9,
10,
11,
12,
13] and the literature therein. A typical mission is to find and arrive at the level set where an unknown environmental field assumes a specific value and then sweep the entirety of this set, thus exhibiting and putting under control the border of the region with the greater field values. Examples include finding the flows of air pollutants or contaminant clouds [
14] and tracking zones of turbulence or high radioactivity level, to name just a few. In such missions, typical challenges include a paucity of a priori information about the field, obsolescence of the data collected online due to the field changes, and the capacity of the available sensors to measure only the field value at the current location via immediate contact with the sensed entity, e.g., with a transparent gas.
Recently, much attention has been given to navigation algorithms that enable mobile robots to localize, approach, and cover the environmental boundary of interest. A large group of the algorithms relies on access to the field’s gradient [
8,
12,
15,
16,
17,
18]. This group is exemplified by, e.g., the methods based on multi-agent estimation of the gradient [
17], cooperative contour estimation [
2,
16], and gradient-based artificial potentials [
18]. However, the possibility to directly measure the field’s gradient is uncommon, whereas reliable gradient estimation from noisy measurements of the field value is still an intricate challenge in a practical setting [
19,
20]. Also, such estimation calls for measurements in neighboring locations distributed across all dimensions, whereas exploration of an environmental boundary motivates to place the sensors on this lower-dimensional structure. Finally, communication constraints may hinder transfers of field measurements to the gradient estimator, wherever it may be built.
The alternative, gradient-free methods do not attempt to assess the gradient and are well fitted to the situation of pointwise measurement of the field value only. Some such methods (exemplified by [
4,
21]) implement oscillations around a profitable path, thus enabling the robot to collect field data from a whole corridor hosting this path. This approach raises concerns about a waste of resources due to systematic and mutually nullifying shifts sideways. Common image segmentation techniques are used in [
1] to monitor a forest fire-front by a team of UAVs. However, these findings are not rigorously and completely justified. A PID controller empowered by an extended Kalman filter and adaptive crossing angle correction scheme is justified in [
22] for a holonomic planar mobile robot. To drive a Dubins car-like robot along an isoline of a planar field, a PD controller is presented in [
23], and its local convergence in radial harmonic fields are proven, whereas [
24,
25] offer sliding mode controllers whose global convergence is rigorously justified for generic smooth steady [
24] and time-varying [
25] fields. The findings of [
24] are extended to the case of multiple robots in [
26], where the algorithm also ensures effective self-deployment of the robots over the environmental boundary.
The expansion of drone technology motivates the interest to navigate drones using all three dimensions. However, the literature on the sensor-based robotic tracking environmental boundaries has focused so far on the case of 2D or those reducible to 2D. Few exceptions [
12,
27] deal with a single tracking robot; it is supported by a sensing robot in the context of [
12]. The controller from [
12] assumes communicating robots modeled as simple integrators and also that the field evolves subject to an advection-diffusion equation with the fully known constant parameters; these assumptions about the field are generally challenged in practice. The control algorithm from [
12] requires a computationally expensive online solution of a partial differential equation, and the completeness of the coverage of the level set is not addressed. In [
27], a gradient-free control law is presented that drives a non-holonomic underactuated mobile robot to an unknown and unsteady environmental boundary in 3D and then ensures its exhaustively sweeping.
Contrary to the scenario of a single robot, the strength of drone technology greatly stems from the use of large teams of simple and low-cost devices. Reaping this benefit requires multi-agent control strategies that are robust, fault-tolerant, distributed, and homogeneous in the sense of identical roles of the teammates. Other requirements include low consumption of energy, computational, and communication resources, as well as rigorous guarantees for global convergence. Among the survey of papers on sensor-based robotic tracking environmental boundaries in 3D, the authors, however, failed to come across one addressing these issues.
This paper seeks to fill this gap while combining the above issues with that of constraints due to non-holonomy, under-actuation, and a limited control range of the robots. Whether the plenty of the identified factors and requirements allows solving the mission by a computationally inexpensive, low-level controller that directly converts the current observation into the current control and is, nevertheless, justified by a rigorous global convergence result? The paper answers in the affirmative and offers respective details, including the techniques and concepts of justification.
Specifically, we consider a swarm of UAVs whose kinematics are described by the Dubins vehicle model [
28,
29]. Every UAV moves in 3D with a constant speed in the longitudinal direction and is steered by the yawing and pitching rates, which are limited in magnitude. This model applies to, e.g., fixed-wing UAVs, torpedo-like underwater drones, surface vessels, and various rotorcraft [
28,
29].
The UAVs cannot distinguish among their peers and cannot play distinct roles in the team; they are unaware of the team’s size. Any UAV has access to the vertical direction, is aware of its own speed, can assess the altitudinal and Euclidean distances to the objects within a finite visibility range, and measures the value of an unsteady and unpredictable scalar field at the current location. All UAVs should find and reach the moving and deforming level set (isosurface) where the field assumes a given value. They should also distribute themselves into the densest net across a pre-specified altitudinal range (this may be, e.g., the range of particularly important altitudes or those at which the UAVs can operate). After this, every UAV should repeatedly circumnavigate the isosurface at its own altitude selected in a distributed fashion, thus forming an altitudinally densest and horizontally complete dynamic barrier around the isosurface for exposition, surveillance, processing, or protection. All these goals should be achieved via independent decisions of the UAVs according to a common rule and with no communication among them.
The paper presents a gradient-free navigation law that meets the above requirements. Moreover, we first disclose conditions necessary for the mission to be achievable. Then we show that the proposed law solves the mission under only a slight and somewhat inevitable enhancement of these conditions. This is done by means of a mathematically rigorous global convergence result. Basic theoretical findings are confirmed and complemented by computer simulation tests.
This paper develops some ideas reported in [
25,
27]. However, Ref. [
25] deals with a planar workspace so that its findings are insufficient to cope with special challenges inherent in 3D environments. Meanwhile, [
27] considers a single-robot scenario and so contributes nothing to the issue of inter-UAV cooperation, which is the major focus of the current paper. Moreover, despite some similarity in processing the field measurements, analysis of the entailed behavior with respect to isosurface finding and tracking aspects must be fully updated as compared with [
27] due to the critical coupling of the concerned control loop with that of regulating the inter-UAV altitudinal gaps.
The following general notations are used in the paper: means “is defined to be”, “is used to denote”; the dot · in notations like is the placeholder of the argument of the function f; the symbols , , and × stand for the standard inner product, Euclidean norm, and cross product in , respectively.
2. Problem of Sweep Coverage of an Isosurface
A team of N UAVs travels in 3D. Every UAV moves with a constant speed in the longitudinal direction over paths of bounded curvatures, driven by pitching and yawing rates limited in absolute value. There is an unknown and unsteady environmental field described by a scalar function of time t and space location . All UAVs should arrive at the locus of points (called isosurface) with the pre-specified field value . Then they should sweep this isosurface while uniformly distributing themselves over it. The UAVs are not equipped with communication facilities. So coordination of their motions should be based on only individual sensory data and be achieved in a fully distributed fashion.
Further specification of the targeted distribution assumes that a certain space direction is given by a unit vector ; the associated coordinate of point is loosely referred to as altitude. The mission is confined to a certain altitudinal range ; for example, this may be the range of particularly important altitudes or altitudes at which the UAVs are able to operate. Self-distribution of the UAVs into the densest net across the range should be achieved, whereas each of them should fully circumnavigate the isosurface at its own altitude.
The ith UAV has access only to the field value at its own current location , and has no idea about the distance to or bearing of the targeted isosurface . The ith UAV also has access to in its frame of reference, is aware of its own speed , and can assess both the altitudinal and Euclidean distances to the objects, including the top/bottom of the altitudinal range , that lie within a given “visibility” range . The UAVs do not know their total number and cannot distinguish between their peers or play different roles in the team. The last circumstance implies that all UAVs should be driven by a common control rule and cannot be assigned individual serial numbers that influence the control input.
To unveil the structure of the densest net across
, we denote by
the worst-case distance from a point of
to a finite set
. It is easy to see that the minimum of
over all sets
with
N elements is attained at the set
Now we flesh out the targeted collective behavior of the considered team of UAVs.
Definition 1. The team is said to form the densest horizontally sweeping net on the isosurface in the range from to if the UAVs can be labeled with i from 0 to so that as , where is the altitude of the ith UAV.
The imposed information constraints mean that any UAV is unaware of the altitude assigned to this UAV in Definition 1, may have no access to the end-points of the altitudinal range since they are beyond its range of visibility, and cannot receive these data from more informed teammates (if they exist) due to the lack of communication facilities.
It is required to design a common control rule by executing which every UAV individually builds its own control based on the available data, whereas the entire team acquires the property described in Definition 1. Given an ever-growing use of mass-produced, cheap, relatively small-sized, and, as a result, energy and computationally constrained drones, this rule is welcome to be computationally inexpensive and exhibit a regular energy-efficient behavior. Whether the entire range of the above rather diverse and partly contradictory wishes and goals may be compromised and attained?
For theoretical analysis, we employ a truncated model of the kinematics of a rigid-body robot moving with a constant speed in the longitudinal direction [
30,
31,
32,
33]. This model disregards the roll motion and is used in vector form borrowed from [
31,
32]:
Here
is the control, the upper bound
on its magnitude is given,
is the constant speed of the UAV,
is the unit vector along the centerline of the
ith UAV (see
Figure 1), and the third equation in (2) “keeps” the length of
constant. The model (2) captures the robot’s capacity to travel over space paths whose curvature radii
. The scope of applicability of this model is discussed at length in Remark 2.1 in [
31] and includes fixed-wing UAVs, various rotorcraft, and torpedo-like underwater drones. Due to a one-to-one correspondence
in Remark 2.1 in [
31], the vector control input
can be replaced by the pitching
and yawing
rates. In fact, (2) is a 3D extension of the standard Dubins-vehicle model of an aircraft or boat in a plane [
34,
35,
36,
37].
3. Proposed Hybrid Controller
It uses the following tunable free
parameters
and two
functions and
ð, where
maps
to
and
ð maps
to
. The choice of these parameters and functions is discussed in
Section 6.
For any
i, the
ith UAV builds and updates the set
of its
essential neighbors by executing the following instructions:
The set
can be immediately replenished due to the second line in (4). The use of not equal but different parameters
in the second and third lines, respectively, is aimed at suppressing excessive sliding-mode phenomena via preventing the situations where a just enrolled peer
j should be immediately excluded and vice-versa. The sets
and
of
essential higher and lower neighbors, respectively, are defined as
The
ith UAV can compute these sets from the available sensory data since the definitions of these sets use only the relative altitudes and the distances to the teammates within the visibility range, which data are accessible to the
ith UAV. The
scaled higher and lower altitudinal gaps near the
ith UAV are defined to be
Here either + should be put in place of ± everywhere, or the same should be performed with − instead of +. If and , there are no essential neighbors between the UAV at hands and the top/bottom of the altitudinal range ; then is twice the altitudinal distance to the top/bottom. Given a UAV, the gaps do not depend on the enumeration of the UAVs and so are computable in the current situation where the teammates are anonymous to one another; see Remark 1 for more details.
By the third equation in (2), the control input
must lie in the plane
normal to the centerline vector
(the pitch-yaw plane). We define
in a special orthonormal basis
of
, where
is the orthogonal projection of the vertical vector
onto the pitch-yaw plane
, which projection is then normalized to the unit length:
Here is the angle between the centerline and the vertical . As will be shown in (i) of Theorem 1, our controller ensures that any UAV is always non-vertical, so both the vector and the considered basis are well-defined.
Our control law is hybrid with two modes:
(approaching the isosurface) and
(main mode). Any UAV
i starts in
and then switches to
. The role of mode
is to find the targeted isosurface and to arrive at its close vicinity defined as the locus of points where the field value differs from
by not more than a pre-specified and nominally small
from (3). Vertical distribution of the team is the task of the next mode
so that the global search of the isosurface during mode
is not disturbed by control signals aimed for other purposes. The switching rule is as follows:
The control rule invokes (like (8) does) parameters from (3) and is as follows:
where
Here
and
ð are functions that are to be chosen by the designer of the controller (see (20)–(29) for details),
is the time of the transition
in (8), and
is a linear function of
with saturation at the threshold
mentioned in (3).
The
ith UAV can compute the derivative
since it is aware of its own speed
and has access to the vertical vector
in its local frame illustrated in
Figure 1. Numerical differentiation can be used to assess the time-derivative
of the sensor readings
. Estimating the derivatives from noisy data is a well-researched discipline offering many methods; any of them is acceptable and welcome to implement the controller. Among these methods are, e.g., optimal schemes based on stochastic models, observers with sliding modes, difference methods; see [
19,
38,
39] for a survey.
The proposed design of the control system is illustrated in
Figure 2. The block conditionally entitled “inclinometer” is responsible for access to the vertical vector
in the local frame of reference of the UAV at hand. It thus provides access to the angle
between
and the centerline
of the UAV and the vector (7). The block in the lower right corner of the diagram illustrates the procedure (4) of forming the set of essential neighbors. The lower and upper positions of the switch in the diagram correspond to modes
and
, respectively.
Except for the coefficient , the control rule (9) is common for all robots. Remark 2 discusses when complete uniformity can be achieved by picking common for all i.
Remark 1. To facilitate understanding the procedure for determining , its first step was pictured as building the sets of labels j. However, the robots cannot figure out these labels. So actually, these “absolute” labels are not involved: on its own choice, robot i labels the available relative distances , uses time-invariant labels for continuously changing distances, and processes exactly these labels. All of that is performable based on the available data.
Since the control law (9) and (10) is discontinuous, the solution of the closed-loop system is meant as that of the differential inclusion obtained via Filippov’s convexification method [
40]. Given an initial state, a solution exists and does not blow up in a finite time due to the boundedness of the controls.
4. Mission Feasibility and Assumptions
To avoid overly restrictive assumptions in our theoretical analysis, we first disclose conditions necessary for the mission feasibility. They display the necessary balance between the level of maneuverability of the UAVs and the challenges from the contortions and motion of the targeted isosurface. Our assumptions will be only slight and partly inevitable enhancements of these necessary conditions. To disclose them, we need the following characteristics of the unsteady environmental field:
, spatial gradient of the field;
, unit vector normal to the associated isosurface (AI) that passes through location at time t;
, angle from this vector N to the horizontal planes;
, projection of this vector N onto the horizontal plane normalized to the unit length;
, normalized projection of the unit vertical vector onto the plane tangent to the associated isosurface;
, horizontal section (of the -isosurface) at the altitude h;
, unit vector tangential to the horizontal section that passes through location at time t;
, second fundamental (quadratic) form of AI, i.e., the signed curvature of the intersection of AI with the span of a unit tangent vector
V and
N; see Section 4 in [
41];
, maximal (in absolute value) eigenvalue of this quadratic form;
, front velocity of the associated isosurface;
, front acceleration of the associated isosurface;
, angular velocity of rotation of the associated isosurface;
, density of isosurfaces, which evaluates their number (assessed by the range of the field values) within the unit distance from the associated isosurface;
, proportional growth rate of the density with time;
, proportional tangential gradient of the density ,
, proportional growth rate of the density under the normal shift.
The last seven quantities are rigorously defined in Appendix A in [
27].
By Definition 1, ideally carrying out the mission includes moving over the horizontal section
at a fixed altitude
. Since the size
N of the team and so the altitudes
’s are unknown to the team members and no UAV is assigned its own altitude
a priori, it is fair to require that any UAV be able to trace the whole of
within the
operational zone at any altitude
in the given altitudinal range
. If this requirement is met, the isosurface is said to be
trackable by this UAV. For the sake of convenience, we define the zone
in terms of the extreme values
(
) achievable by the field
F in this zone:
Since any UAV (2) can trace only regular (i.e., differentiable with a nonzero derivative) curves, the above trackability may hold only if any curve is regular. This may be violated not only because of the non-smoothness of the field but also due to the zero spatial gradient. Hence the following is compelled by necessity.
Assumption 1. In an open vicinity of the operational zone (12), the field F is twice continuously differentiable, is not singular , and the horizontal section of the -isosurface at any altitude is not empty.
Conditions necessary for the mission feasibility are as follows.
Lemma 1. Let the isosurface be trackable by the ith UAV. Then at any temporal-spatial point of , the following inequality holds: If, in addition, the normal N to the associated isosurface is not vertical , thenand the inequality holds with any sign in ±. This lemma is immediate from Proposition 4.1 in [
27].
Inequality (13) means that projected onto the normal to AI, the speed of the ith UAV is enough to compensate for the normal displacement of AI in order to remain on the moving AI. Meanwhile, (14) means that while keeping its altitude unchanged, the UAV can remain on AI by meeting the challenges from the translational acceleration of AI and the Coriolis and centrifugal accelerations caused by the motion of the UAV over AI. We slightly enhance inequalities (13) and (14) by assuming that they hold with > put in place of ≥ and do not regress as or (if applicable) .
Assumption 2. In the operational zone , Assumption 1 and inequalities (13) and (14) hold in an enhanced form: there exist , , such that , and for all i, Here (16) guarantees that and so the normal N is not vertical, i.e., the prerequisite for (14) is met.
In the real world, the next assumption is typically satisfied.
Assumption 3. In the operational zone, the basic characteristics of the field stay bounded: The control objective pursued in this paper tacitly assumes that the isosurface can be horizontally circumnavigated and so is “horizontally” bounded. However, Assumptions 1–3 do not guarantee this. So we need another assumption.
Assumption 4. There exists a constant such that for any t, the distance between the horizontal projections of any two points from does not exceed .
Before applying the control law (9) and (10), the UAVs are to be driven to or put in special postures. Since this is trivially performable, we do not come into implementation details and merely describe those postures.
Assumption 5. Initially, all UAVs are oriented horizontally and are in the interior of the operational zone at distinct altitudes.
The requirement to the altitudes can be met with probability 1 if, for example, every teammate is instructed to preliminarily reach its own altitude that is independently drawn for any of them from a common continuous probability distribution.
If Assumption 5 holds and
, then the
ith UAV does not leave its initial horizontal plane. Let, in addition, the control input
continuously depends on time and
where
is the upper bound from (2). Then the
ith UAV moves over the boundary of one of two horizontal discs
, making a full turn for
units of time. We assume that these discs lie in
and that the UAV’s turning rate exceeds the mean rate (over some initial period of time) at which the isosurface rotates about the vertical axis.
Assumption 6. For any UAV i, there exists a natural number such that the following statements hold for the time interval :
- (i)
During this interval, the horizontal projection of the unit vector normal to the associated isosurface rotates through an angle that does not exceed ;
- (ii)
The initial discs lie inside the operational zone (12) during the time interval:
By (16), the normal N is not vertical. Hence its horizontal projection is nonzero, so the vector and its rotation angle are well-defined. If the field is steady , this angle is zero, and (i) does hold with .
5. Chimerical Solutions
Under the control law (9) and (10), the closed-loop system is described by an ordinary differential equation (ODE) with a discontinuous right-hand side (RHS). In the theory of such ODE’s, studies on the phenomenon of sliding have been primarily confined to the case of attractive discontinuity surfaces up to now. Only the slightest attention has been paid to non-attractive ones. For them, the discussion has been typically brief and limited to a reference to the very possibility of sliding, with a two-side repelling surface
S exemplified in
Figure 3a being the most popular subject of focus.
However, there is much more diversity in sliding surfaces than mentioned above.
Figure 3b,c presents two examples, where the surface of interest
S contains a single (green) point and has the zero dimension. This surface hosts a sliding solution
(“staying still at
S”), whereas some other solutions reach
S in a finite time and can be continued by
. These are those starting in the pink domain, which has a nonzero area in
Figure 3c. In
Figure 3a,b, the sliding solution is non-viable and can be treated as nonexistent in reality since it is catastrophically sensitive to arbitrarily small disturbances. Specifically, almost all (for a continuous probability) of them bring the state in the white domain, after which the state essentially deviates from the sliding solution on any finite time interval. This deviation is not small for small disturbances, and a nonzero lower bound on the deviation is determined by the interval. Moreover, disturbance causes an immediate repulsion from the sliding solution in
Figure 3a, which also holds in
Figure 3b if the state is brought to the angles A or C. If it is brought to the angles B or D, repulsion still occurs with probability 1. Still, it commences after a transient, whose duration is proportional to the disturbance magnitude. Meanwhile,
Figure 3c shows that the overall diversity of repulsive behaviors is richer than those just discussed. For example, suppose all directions of disturbance are equiprobable. In that case, the disturbance brings the state to the pink domain in
Figure 3c with a nonzero probability, and then the solution returns to
in a finite time. Simultaneously, the disturbance brings the state in the white domain with a nonzero probability, and then the solution diverges far away from
.
The apathy of the classic theory to the detailed classification of the entire range of behaviors possible near sliding solutions partly stems from being aimed at building a controller that imparts a useful feature to the system, e.g., reduced dimension, robustness against disturbances, etc., which objective calls for an attractive sliding surface. With no intention to fill the identified gap, we offer a general concept of sliding solutions that “do not exist in reality” in the fashion similar to
in
Figure 3a,b. The rationale for this selection is that such solutions can be formally found in our closed-loop system, whereas they can be ignored in a practical setting and so in the results.
With these in mind, we consider a differential inclusion (DI)
on a Riemannian manifold
, where the RHS is a convex, compact, and nonempty subset of the tangential (to
at
) space and the map
is upper-semicontinuous; these properties imply local solvability of the initial value problem by Theorem 6.2 in [
42]. (For the swarm of the UAVs (2) driven by (9) and (10), the points of
are the team’s states
with
and
from the unit sphere in
centered at the origin, and
is obtained via the Fillipov’s covexification procedure [
40]).
Let
be a (maybe infinite) time interval. A solution
of the DI is said to be
fully chimerical if for any finite subinterval
, there is
such that almost all (with respect to the Lebesgue measure or, equivalently, with respect to any continuous probability distribution) initial states
from a sufficiently small ball centered at
give rise to a solution
whose maximal deviation from
on
is no less than
, irrespective of how close
is to
. The solution is said to be
chimerical if it is fully chimerical on some subinterval of
. In
Figure 3b, the sliding solution
is fully chimerical, whereas examples of chimerical solutions are given by those that start on the pink line, then arrive at
S, then stay at
S, and then (possibly) depart from
either to the left or to the right. Fully chimerical and so chimerical solutions are nonexistent in practice since they are not stable against inevitable disturbances and errors in sensors, computational units, and actuators, including quantization errors.
With no intention to fully categorize the converse case, we say that the solution
is
firmly corporeal if, for any closed finite subinterval
, that maximum deviation goes to zero whenever
, and
corporeal if there exists a finite set
F such that the solution is firmly corporeal on every connected component of
. If
, the latter means existence of times
such that the solution is firmly corporeal on
, and
.
Figure 3b shows that the solutions starting in the white domain are firmly corporeal. Any initial state from the pink line gives rise to exactly two corporeal solutions: they go over this line to point
S and then immediately leave it either to the left or right. The set
F consists of a single time when the trajectory passes through
S; at this time, the solution branches in two directions.
6. Main Results
We first skip tedious tuning details and show that the proposed navigation scheme is enough to solve the mission under minimal and partly inevitable assumptions.
Theorem 1. Let Assumptions 1–6 hold, and the visibility range be not overly small: . Then the parameters (3) of the control law and the maps χ and ð in (9) and (10) can be chosen so that the closed-loop system has a corporeal solution defined on , and for any such solution and moreover, for any non-chimerical one the following claims are true:
- (i)
Any UAV is never vertically oriented: ;
- (ii)
The output of the control law (9) is feasible, i.e., the third and fourth relations in (2) do hold;
- (iii)
The team members do not collide with one another;
- (iv)
They are always in the pre-specified altitudinal range and the operational zone (12);
- (v)
The team forms the densest horizontally sweeping net on the targeted isosurface in the range from to , as is specified by Definition 1.
Moreover, let a compact set of initial states be given such that any its element satisfies Assumptions 5 and 6. Then common values of the controller parameters (including the functions χ and ð) can be chosen so that (i)–(v) hold whenever the initial state of the team is in .
Theorem 1 means that our control law ensures the attainment of the posed objective.
By Assumption 4, the requirement means that if the UAVs are close to the targeted isosurface and the even distribution over the altitudinal range is nearly attained, the “altitudinally adjacent” robots “see” each other. Meanwhile, at the initial time the UAVs are permitted to be arbitrarily distributed over the range (modulo that different UAVs should be at distinct altitudes by Assumption 5) so that some “altitudinally adjacent” robots may not “see” each other since the distance between them exceeds . However, (v) in Theorem 1 and the first sentence in the current paragraph imply that this unwanted situation is eventually eliminated under the action of the proposed algorithm.
Theorem 1 neglects chimerical trajectories. According to
Appendix B,
Appendix C and
Appendix D (see Lemmas A8, A18 and A19), such trajectories possess at least one of the following two features. (1) On some time interval
, the state remains on a two-side repelling (like in
Figure 3a) surface described by
with some
i. (2) There exist
and
such that the UAVs
i and
j constantly remain at a common and constant altitude for
. Chimericality means that both features are unrealizable in the closed-loop due to instability against arbitrarily small perturbations, errors, and noises.
When discussing controller tuning, we have in mind the situation of multiple initial states from . (The case of a single initial state is that of a singleton set .)
Preparatory choice of from (9). In (19), we put some in place of and thus acquire larger disks . If , they are close to (uniformly over all initial states from ) and so lie inside the operational zone for by (ii) of Assumption 6, which thus remains true if is replaced by . We pick such , with a view to possibly push it closer to afterward.
General requirements to the functions and ð from (9) and (10). We use continuous and piecewise smooth functions that map
to
and
to
, respectively, and are such that
Examples include
and
Switching and saturation thresholds from (8) and (10), respectively, and the parameter from (4) are chosen so that:
Here and are taken from Assumption 2, and from Assumption 4. In (23), assesses the remoteness of the targeted isosurface from the borders of the operational zone (12). Thanks to the assumption introduced in the body of Theorem 1, the conditions (23) and (24) are feasible: they can be satisfied by picking and close enough to and 0, respectively.
An auxiliary parameter is recommended to be picked at this tuning stage to simplify the subsequent choice of the basic parameters.
The slope of in (10), the parameters in (9), the upper bounds in (21), and the upper bound in (22) are subjected to the following constraints:
Here are taken from Assumption 2, from Assumption 3, and from (2). At least, the requirement (26) to and means that its left-hand side is less than . If this is satisfied, (26) gives an upper bound on the choice of the auxiliary parameter . Putting this bound to (27) in place of results in an “-free” form of the conditions on the controller parameters, whose format is, however, rather cumbersome and so is not user-friendly. This is the rationale for using .
Inequalities (25)–(29) are feasible. Indeed, it suffices to note that the left-hand sides of (25)–(27) go to 0 as , whereas the RHS’s are positive constants. So to meet (25)–(27), it suffices to pick small enough and close enough to . Then (28) can be ensured by further decreasing . After this, (29) can be satisfied (while not violating (25)–(28)) by further decreasing .
These considerations give guidelines for experimentally tuning the control law.
The final choice of the functions and ð from (9) and (10): They are chosen subject to the above general requirements and the already chosen upper bounds .
The time from (8) is chosen so that , where is taken from Assumption 6.
Theorem 2. The statement of Theorem 1 is true if the parameters (3) of the control law and the maps and in (9) and (10) are chosen based on the above recommendations.
So these recommendations can be used for analytically tuning the controller if the involved bounds on the parameters of the field or their estimates are available.
Remark 2. Implementing the above recommendations results in controller parameters common for all robots, with the only exception of . For homogeneous robotic teams (but not only for them), ’s can also be chosen common since and do not depend on i.
Remark 3. Under the assumptions of Theorem 2, the statement (i) of Theorem 1 can be specified: the pitch angle of the ith UAV never exceeds in absolute value. (Here is well-defined due to the first inequality in (25).) Meanwhile, the above recommendations on the choice of the controller parameters are not violated by decreasing the coefficient . By using this, the controller can be tuned so that not only the statement of Theorem 1 is true but also the pitch angle of every UAV is always within a given bound, which can be chosen as small as desired. This observation is of interest whenever large pitch angles are unwelcome, unacceptable, or challenge the employed model (2).
7. Computer Simulation Tests
The numerical values of the basic parameters used in the tests are as follows:
| | | | , |
❢
| s | | m | ❢ |
m | m | |
❢
| | | | |
Here ❢ is the unit of measurement of the field value
f. Zero-mean Gaussian additive noises corrupt the measurements of this value and the altitude
h with the standard deviation of
❢ and
m, respectively. No noise reduction techniques were applied, and the simplest two-point Newton’s quotient
was employed to assess the time derivative
in (9). The simulations were performed in MATLAB. In
Figure 4,
Figure 5,
Figure 6 and
Figure 7, the robots, their paths, and the targeted isosurface are depicted in green, blue, and gray, respectively; obsolete parts of the paths are erased; the targeted isosurface is treated as opaque. In fact, what is depicted is not the targeted isosurface but a close one; otherwise, the UAV’s path would seem overly dashed due to invisible portions that appear whenever the UAV dives, even slightly, behind the isosurface. Multimedia of all tests are available at
https://cutt.ly/fTZsmjC, (accessed on 20 October 2021).
Figure 4 illustrates an experiment where an environmental field slowly translates along the
x-axis so that its isosurfaces retain their form and size. Among the purposes of this experiment, there is complementing Theorems 1 and 2 via testing the capacity of the control law to cope with the non-smoothness of the field and isosurface. Specifically, the cross-like isosurface from
Figure 4a is not smooth (and so Assumption 1 violates) on the red curves, where (except for the yellow point) the field gradient abruptly changes when crossing the curve. Initially, ten UAVs are organized into two groups of five; each group is aligned vertically and evenly distributed. Meanwhile, their vertical spacing is far from the desired one (which is associated with the even deployment from
m to
m), and the groups are out of “eye contact” with each other.
By
Figure 4a, the moment of
s can be viewed as when all UAVs localize the isosurface, though with a small degree of approximation, as can be seen in
Figure 4h.
Figure 4i shows that at this moment, the UAVs over-populate the altitudinal range from 40 m to 60 m, which condition is far from the targeted even distribution from 10 m to 190 m. Approximately at this time, all UAVs pass to mode
and so regulation of the altitudes towards the even distribution is commenced according to (10). By
Figure 4d,i, this goal is attained from
s. The outbursts of the field value errors for two UAVs at ≈
s occur when these UAVs have to pass from encircling the horizontal “beam” of the cross to dealing with the vertical one, as can be seen in
Figure 4c. The height regulation module initially drives them upwards so intensively that they find themselves far enough from the targeted isosurface represented by its vertical beam, which is fairly distant from the just traced tip of the horizontal beam. These outbursts are promptly fixed and never repeated, as shown by
Figure 4h. So the discussed episode can be related to the fact that by
s, an even distribution of the altitudes has not yet been achieved, as shown in
Figure 4i. Overall, the control law ensures the attainment of the control objective despite sensor errors and the non-smoothness of the field.
The experiment in
Figure 5 complements Theorems 1 and 2 from another standpoint: the field gradient is not vertical at the red points in
Figure 5a and their antipodes with respect to the centers of the holes. This means violation of (16) in Assumption 2 and implies that the number
K of the connected components of the horizontal section varies as the cutting horizontal plane runs over the altitudinal range of interest (from 30 m to 170 m); four sample sections A, B, C, and D with different
K’s are depicted in light blue in
Figure 5a. The field and the targeted isosurface rotate about the pink axis. As a result, the number
K varies over time for some fixed altitudes, as illustrated in
Figure 5h,i. For example, the red section in
Figure 5i has three components, unlike
Figure 5a, with no more than two components. When keeping both the field value and the altitude constant, any UAV can trace only a single connected component and so has to “select” it from a variety of those (if applicable), e.g., has to select one of the two loops constituting B in
Figure 5a. From time to time, the UAVs are forced to “reselect” because of changes in circumstances, e.g., alterations in
K. Another trouble is highlighted by considering the horizontal cutting plane that gives rise to B in
Figure 5a. As this plane approaches the upper red point from above, the curvature of both B-loops near that point increases without limits. It so exceeds, sooner or later, the maximal turning capacity of the UAVs. Hence, there are horizontal sections that can be traced by no means due to the limited turning radius of the UAVs. Though the proposed algorithm is not intended to handle the described troublesome issues, the experiment in
Figure 5 is aimed to form an initial pre-judgment on the intrinsic potential of this algorithm for reasonably treating them.
Figure 5j,k show that the above extra challenges do not visibly worsen the performance of the control law with respect to the primary goals of finding and sweeping the isosurface and even self-distribution of the team over the altitudinal range. Moreover,
Figure 5b–g provides evidence that the algorithm manages to attend all “simple” components of the topologically complex surface: two UAVs circumnavigate the “half-donut” B1 from
Figure 5f, one UAV encircles B2, another UAV encircles C1, and two more UAVs circumnavigate the “half-donut” C2, whereas the remaining four UAVs go around the central part of the eight-shaped surface. This trait of “attending all parts” may also be identified at the other stages of the experiment, albeit to various degrees. In
Figure 5j, the splash of the field error for a UAV at ≈
s is due to being too close to a point with an excessive “curvature demand”, as is described in the previous paragraph. However, this splash is promptly fixed and never repeats within the duration of the experiment. Overall, this experiment shows that the algorithm more or less satisfactorily copes with the above extra challenges.
Figure 6 is concerned with an experiment whose purpose is to test the algorithm’s robustness against failures of the team members and its performance when dealing with a deforming isosurface. This isosurface has the form of a curved tube. The tube performs oscillatory displacements along the
x-axis, alternates increasing and decreasing in size, and reshapes, e.g., changes the number of the “waves on the surface”, becomes a right cylinder or a “bottle” at some times, etc. The number of the UAVs is increased up to 20; the targeted altitudinal range is from 0 m to 200 m. Starting from the initial deployment shown in
Figure 6a, all UAVs individually reach the targeted isosurface as early as at ≈
s, according to
Figure 6b,i. By
Figure 6c,j, an even self-distribution over the altitudes is achieved later at ≈
s. Then at
s and
s, a group of five UAVs is withdrawn from the mission (and their color is changed from green to red), whereas the remaining peers continue to run the algorithm “as usual” with ignoring the missed members. As can be inferred from
Figure 6d,e,j, these peers need only ≈
s to rebuild the even distribution with lesser team size. For the missing episode at
s, this entails a slight temporal impairment in the field tracking performance, which is fixed for ≈
s, as shown in
Figure 6i.
Overall, the algorithm exhibits robustness to failures of the team members in a sophisticated scenario with a deforming and moving isosurface.
The last experiment tests the capacity of the algorithm to automatically manage the admission of new team members (
newcomers). The deforming isosurface from the previous test is handled, though without displacement along the
x-axis. The team consists of 10 members initially. Five extra members appear on stage at
s and then another five at
s, as shown in
Figure 7. Meanwhile, the algorithm is run “as usual” at both newcomers and “oldies”, taking into account the UAVs currently present at the stage.
As follows from
Figure 7f, the UAVs autonomously rebuild an even distribution over the altitudes for ≈
s in both events of admission. By
Figure 7e, the second admission implies detrimental effects in terms of the field value. However, they are minor in value and are overcome for ≈
s. This demonstrates the algorithm’s capacity to incorporate extra UAVs in the team on the fly.
8. Conclusions and Future Work
This study aimed to design and analyze a distributed navigation and collision avoidance strategy for a team of UAVs traveling in a 3D environment. The strategy enables the team to first find the isosurface where an unknown and unpredictably varying scalar field assumes a given value and then form the vertically densest net-like barrier horizontally sweeping the isosurface. Among the complicating factors was the lack of access to the field gradient, absence of communication facilities, non-holonomy, under-actuation, and a finite control range of the UAVs. It was shown that even in such circumstances, the mission could be solved by a computationally inexpensive strategy justified by a mathematically rigorous global convergence theorem. Computer simulation tests confirmed the convergence and performance of the algorithm.
The algorithm is individually executed by each UAV and consists of two stages (operating modes). The main objective of the first and second stages is to find and arrive at the isosurface and, respectively, to track and circumnavigate it while distributing the team into the vertically densest net. The proposed regulation rule conforms to the sliding mode control paradigm at any stage. This paradigm has attracted significant interest from industry and academia thanks to well-known benefits such as high insensitivity to disturbances and noises, robustness against uncertainties, good dynamic response, and simple implementation (we refer the reader to [
43,
44,
45,
46,
47,
48,
49] for a survey). The major problem with the practical implementation of sliding-mode controllers is identified as the possibility of a chattering phenomenon. The ever-increasing popularity of the sliding-mode approach to motion control is partly due to the development of rather effective general techniques of chattering elimination and suppression; see, e.g., [
45,
50,
51] for a survey. Among them, there is a smooth approximation of the discontinuous controller using low-pass filters, adaptive controllers, and higher-order sliding modes. Whether the harmful chattering is encountered when implementing the proposed controller, it can be subjected to treatment via these methods. Their practical effectiveness has been widely reported, whereas the phenomenon does not necessarily occur in experiments with real mobile robots. Some examples are reported in, e.g., [
46,
52], where control laws that are similar in some respects to the law proposed in this paper are considered.
A fairly common approach to the design of control systems implements the idea of a two-level hierarchical structure, where a kinematic-level controller generates a reference signal to be tracked by low-level controllers. The findings of this paper are concerned with the first stage and are based on a model of the UAV’s kinematics. Implementation issues concerned with the second stage and controllers somewhat similar to that from this paper are addressed in, e.g., [
46,
53].
Future work includes an extension of the findings of this paper to the case where along with all the previous control goals, the UAVs should be “horizontally synchronized” in some sense, e.g., should all be ultimately contained by a common vertical moving plane. Consideration of more sophisticated isosurfaces and models of UAVs is also on the agenda.