1. Introduction
Modern research on multi-agent systems covers various aspects, including task allocation, communication protocols, coordination strategies, obstacle avoidance, etc. Each of these aspects plays a crucial role in the overall functionality and performance of the multi-agent systems. In this paper, we cover obstacle avoidance and coverage control for multi-agent systems. These aspects are important for maintaining system efficiency in complex environments.
In recent years, there has been a growing focus on research and studies related to multi-agent coverage control. This evolution has come from the idea of a single robot to the collaboration of many agents in the environment [
1]. The main purpose of multi-agent coverage control is to direct the agents to cover a certain area in space, whether it is 2D or 3D. This problem is especially important in applications such as security, surveillance, and search and rescue in disaster areas [
2]. An important application of multi-agent management is the monitoring and protection of certain areas through the use of mobile agents equipped with sensors. Additionally, multi-agent systems can also benefit from the integration of tactile sensors such as those in [
3,
4], which can enhance their sensory capabilities and improve their interaction with complex environments. The benefits of employing multi-agent coverage control methods include increased robustness, scalability, efficiency, and redundancy.
Recent advancements in computational neural network algorithms have further enriched the field of control and navigation in multi-agent systems. Notable methods include observer-based adaptive fuzzy finite-time attitude control for quadrotor UAVs [
5,
6] and anti-saturation fixed-time attitude tracking control for uncertain quadrotor UAVs with external disturbances [
7]. These approaches offer promising solutions for managing computational burdens and enhancing the system’s performance in real-time applications. For instance, observer-based adaptive fuzzy finite-time control leverages neural network techniques to achieve precise and efficient control, while anti-saturation fixed-time control addresses the challenges of computational load and external disturbances, ensuring a robust and reliable performance in practical systems.
Managing the coverage of multiple agents is a complex task, especially in dynamic environments with varying density functions [
8,
9,
10,
11,
12,
13]. Agents in such environments need to adjust their positioning to ensure effective coverage, taking into account changing factors such as moving obstacles, evolving terrain conditions, and fluctuations in environmental variables like temperature, humidity, and pollution levels. Additionally, the density function reflects the spatial distribution or concentration of a specific attribute or entity within the environment. For instance, in surveillance scenarios, the density function might indicate the probability of activity or events happening in different areas.
Controllers available in the literature [
8,
9,
10] can adapt to changing environments with varying density functions and in [
11,
12,
13] address systems with uncertainties. These algorithms are based on the concept of CVT, where agents are represented as points and confined within cells to prevent collisions. However, collisions may still occur when these algorithms are applied to real agents, even if the agents are designed to have zero radius, which is not realistic. To solve this problem, [
14,
15,
16] suggests several approaches based on the buffered Voronoi cell. The buffered Voronoi cell incorporates a safety radius to adjust the edge of each Voronoi cell, ensuring that collisions are avoided as long as the agents remain within their cells. However, this modification also involves path planning and tracking at each step. The effectiveness of this approach in avoiding collisions in the presence of system uncertainties has not been tested. These uncertainties can potentially cause agents to deviate from the BVC and result in collisions. Successful obstacle avoidance can be achieved when an agent gathers information about its surroundings, combines this information to create a symbolic model of the environment, and then uses this model to make informed decisions. Based on this model, the agent can autonomously plan a path to avoid obstacles [
17].
The field of obstacle avoidance algorithms has been extensively researched and developed due to its importance in various applications. However, many algorithms have significant limitations when obstacles are unknown. Traditional obstacle avoidance algorithm methods include the bug algorithm, potential field algorithm, and vector field algorithm [
18,
19,
20], while intelligent obstacle avoidance algorithm methods include the genetic algorithm, fuzzy logic algorithm, and neural network algorithm [
21,
22,
23,
24].
The bug algorithm is the simplest obstacle avoidance algorithm, involving finding an obstacle and walking around its outline to bypass it [
19]. The most notable bug algorithms include Bug1, Bug2, and Tangent Bug. Bug1 operates by having the robot move directly towards its goal until it encounters an obstacle, at which point it follows the obstacle’s boundary until it finds the closest point to the goal before continuing. Bug2 simplifies this by having the robot leave the obstacle boundary at the first intersection point with the line connecting the start and the goal, resuming its direct path towards the goal. Tangent Bug enhances the process by utilizing range sensors to detect obstacles and build a local map, allowing the robot to follow tangent lines along obstacles and dynamically re-plan its path based on real-time sensor data. These algorithms are characterized by their simplicity, sensor-based operation, and completeness, guaranteeing path finding if a path exists under ideal conditions. These algorithms are easy to implement but do not consider robot dynamics and are not reliable in complex environments.
The potential field algorithm is a widely used approach in robotic motion planning for obstacle avoidance, representing the environment as a potential field where the goal exerts an attractive force and obstacles exert repulsive forces. The robot navigates by following the resultant force vector, which is the sum of these attractive and repulsive forces, guiding it towards the goal while avoiding obstacles. The attractive potential pulls the robot towards the goal, increasing with distance, while the repulsive potential pushes the robot away from obstacles, becoming stronger as the robot approaches them. Mathematically, the attractive force is proportional to the distance to the goal and the repulsive force is inversely proportional to the distance to the obstacle within a certain range. The potential field algorithm is simple to implement and efficient for real-time applications, providing smooth and continuous paths for the robot. However, it can suffer from local minima, where the robot gets stuck in non-goal positions with zero resultant force and may exhibit oscillations around obstacles [
20].
The vector field histogram is an advanced real-time obstacle avoidance algorithm used in mobile robotics, designed to navigate around obstacles while moving toward a target. It works by creating a polar histogram that represents the robot’s immediate surroundings, where each bin in the histogram corresponds to a specific direction and contains the density of obstacles in that direction. The algorithm involves three main steps: first, it constructs a histogram grid from sensor readings to represent the environment’s occupancy; second, it reduces this grid to a one-dimensional polar histogram centered on the robot, reflecting the density of obstacles at various angles; third, it identifies candidate directions with low obstacle densities and selects the most suitable direction based on the goal direction and the robot’s dynamic constraints. The vector field histogram algorithm is robust and allows the robot to make quick, reactive decisions to avoid obstacles, balancing the need to reach the goal efficiently with the need to navigate safely. Its ability to dynamically adjust the robot’s path in real-time makes it particularly suitable for complex and cluttered environments. However, it requires careful tuning of parameters and may struggle with very tight spaces or highly dynamic environments [
21].
The genetic algorithm is an optimization technique inspired by natural selection, used for solving complex problems like obstacle avoidance in robotic motion planning. Over successive generations, the algorithm converges towards an optimal or near-optimal path that effectively avoids obstacles while efficiently guiding the robot to its destination. Genetic algorithms are advantageous for their ability to search large and complex solution spaces without requiring gradient information. However, they can be computationally intensive and may require the careful tuning of parameters such as population size, mutation rate, and crossover rate to balance exploration and exploitation. The genetic algorithm is slow in terms of speed [
22].
A fuzzy algorithm for obstacle avoidance in robotics leverages fuzzy logic to handle uncertainties and imprecise information about the environment, providing a flexible and robust approach to navigation. In this method, the robot’s sensors collect data about its surroundings, such as the distance to obstacles and the direction of the goal. This sensory information is then fuzzified into linguistic variables like “close”, “medium”, and “far” for distances, or “left”, “straight”, and “right” for directions. These variables are processed using a set of fuzzy rules designed to mimic human reasoning, such as “if an obstacle is close and to the right, then turn left”. The advantage of a fuzzy algorithm is its ability to smoothly handle the inherent uncertainties and ambiguities in sensor data, resulting in more adaptive and resilient obstacle avoidance behavior compared to rigid, rule-based systems. However, the fuzzy logic algorithm involves a fuzzy controller and requires a large amount of calculation as the number of obstacles increases [
23].
A neural network algorithm for obstacle avoidance leverages the learning capabilities of neural networks to enable robots to navigate complex environments. This approach involves training a neural network to recognize and react to obstacles using data collected from sensors such as cameras, lidars, or ultrasonic sensors. During the training phase, the neural network learns to map sensor inputs to appropriate movement commands by being exposed to numerous scenarios involving various obstacle configurations. The network’s architecture typically includes multiple layers of neurons that process the sensor data, extract relevant features, and make decisions about the robot’s actions. Once trained, the neural network can generalize from the examples it has seen to handle new, unseen obstacles effectively. This method is particularly advantageous in dynamic and unstructured environments where traditional algorithms might struggle. The neural network can adapt to different types of obstacles and complex terrains, providing robust and flexible obstacle avoidance. However, this approach requires a substantial amount of training data and computational resources. Additionally, the performance of the neural network heavily depends on the quality and diversity of the training data, and it may not guarantee safety in all scenarios, particularly if it encounters situations significantly different from its training set [
24].
Considering these limitations, carefully selecting a better obstacle-avoidance algorithm is essential. The increasing use of multi-agent systems for surveillance and security purposes has raised concerns about safety and efficiency. Traditional approaches often focus on individual agents, neglecting interactions that can lead to collisions or ineffective coverage. The controversy lies in balancing agents to ensure safe and optimal coverage. This study addresses the concern of proposing a collision avoidance algorithm that guarantees safety while in a complex static environment. The CBF-based controller offers guaranteed safety, scalability, and adaptability to complex real-world environments. Implementing a CBF-based controller for obstacle avoidance has been identified as a feasible solution to address the limitations of traditional algorithms in challenging environments. By leveraging the principles of CBF, the controller has demonstrated the potential to improve the reliability and performance of autonomous systems [
25]. Its ability to generate robust and safe trajectories while guaranteeing collision avoidance has made it a promising option for developing advanced navigation systems.
The primary objective of this paper is to explore the combination of CVT and CBFs with obstacle navigation, evaluate the controller through simulations, and validate the outcomes with mobile agents. Integrating CBF-based obstacle avoidance would enable agents to navigate while avoiding collisions with obstacles. Meanwhile, implementing CVT for coverage control would allow agents to efficiently cover the area by dividing it into regions for optimal coverage. This integration of CVT and CBF techniques would improve the agents’ ability to navigate challenging terrain while effectively covering the area. The combination of CBFs with CVT presents a robust approach to ensuring safety and coordination in multi-agent systems. However, integrating these complex methodologies poses challenges, such as algorithm and computational complexity, non-convexity, real-time implementation, and parameter tuning. Yet, this integration offers numerous benefits, including safety assurance, collision avoidance, formation control, uncertainty robustness, global optimality, real-time adaptation, flexible applications, scalability, and compliance with safety standards.
This paper’s first contribution involves developing a safe coverage controller using CBFs with CVT to simultaneously address safety and spatial optimization in multi-agent systems. The combined approach aims to address the challenges of coordinating multiple agents and facilitating formation control that upholds safety constraints. Another contribution is developing and testing the controller in ROS/Gazebo simulators for mobile robots with obstacles. The ROS/Gazebo simulator provides a realistic and complex simulation environment, supporting sensor integration, ROS integration, dynamic environments, and multi-robot simulation, making it an industry-standard tool. Additionally, this paper involves setting up and testing the CVT-CBF-based controller through experiments using RT mobile robots to examine its effectiveness in various scenarios involving obstacles. In summary, this paper presents a promising solution for ensuring safe and efficient mobile robot navigation.
The rest of the paper is organized as follows. In
Section 2, preliminaries about CBF are introduced. In
Section 3, the coverage control problem is stated for multi-agent systems and a corresponding control algorithm is given based on the CBF technique. The simulation tools and experimental setup are explained in
Section 4. The structure of the proposed framework with the flowchart is explained in
Section 5. A comparison case study is given in
Section 6. The proposed algorithm is verified under simulation in
Section 7 and under experiments with RT robots in
Section 8. Finally, conclusions are drawn in
Section 9.
3. Controller Design
This section describes the multi-agent security problem and the coverage control based on CBF technology. The aforementioned controller utilizes the CVT algorithm to optimally distribute the agents, effectively solving the coverage control problem. Additionally, it integrates the CVT with the CBF to ensure that the system is able to avoid obstacles with ease.
By the definition of Voronoi tessellations, assuming it is in a two-dimensional case, a Voronoi diagram is a partition of the plane. Every partition carries one generator or more than one generator. Thus, we have the relation
where
. A Voronoi tessellation is formed as
where
indicates the Voronoi region corresponding to the
i-th agent, while
is the partition of the Voronoi tessellation. In addition to Voronoi tessellation, the CVT will generate the center of mass for each Voronoi region.
The Voronoi tessellation also includes density function
, which expresses the relative significance of points in the region and, to calculate how well a given point
is covered by the agent at position
, one defines the locational cost
where
denotes the associated density function, which is assumed to be positive and bounded. This captures the relative importance of a point
[
27].
From (
6), the time derivative of
can be expressed by
Assume that, as the density
changes slowly with respect to time, one has
[
8]. In [
27], it was shown that
where the mass
and the center of mass
of the
i-th Voronoi cell
are defined as
Note that
because the density function
is strictly positive [
28].
At a given time
t, an optimal coverage for the domain
requires a configuration of agents
to minimize
. From (
8), one can see that a critical point is
When (
10) is satisfied, an agent’s network is said to be in a locally optimal coverage configuration [
28]. The corresponding
defines the CVT.
The concept of the controller for CBF is illustrated in
Figure 1. The safety range depicts the safety distance between the agents. The CBF controller is designed to function within the sensing range, which is its designated operating range. The sensing range of agent
i and the safety range of agent
i refers to the distances at which an agent can detect other agents or obstacles in its surroundings.
For the safety controller in this study, we model the agent by the first-order integrator,
where
n is the number of agents,
is the planar position of robot
i, and
is its input velocity. A construction of safe set
where
is defined as a set of a continuous function
where
is the position of the agent
i and
is the position of the obstacle or agent that is closest to agent
i;
is the minimum safety distance of the agents between each other.
We want to enforce conditions on
to guarantee the system’s safety. However, simply enforcing the condition that
at a particular time is not useful since
does not explicitly depend on the control input. The control input influences the state evolution, so we can enforce a safety condition on the time derivative
. To achieve this, we first adapt the definition of barrier functions from
Definition 1 of [
29].
Definition 1 (Nonsmooth Control Barrier Function). A local Lipschitz continuous function is a nonsmooth control barrier function (NCBF) if there exists a measurable control law such that the set is forward invariant for the closed-loop system.
If
h is a local Lipschitz continuous function, it is also absolutely continuous and differentiable almost everywhere; then, the following lemma [
29] can be used to guarantee safety:
Lemma 1. Let be a constant and be an absolutely continuous function. Iffor almost all and , . To represent the admissible control space for the given barrier function
, we need to consider the time derivative of barrier function
. The time derivative can be calculated as
where
is the input velocity.
The admissible control space ensures that the safety function
remains non-negative for all time. Therefore, the admissible control
is defined as
Combining (
5) and (
16), the velocity input
needs to satisfy
This inequality can be treated as a linear constraint on
when the state
is given, i.e., by selecting
which becomes
The CBF-based obstacle avoidance minimizes the difference between the user’s desired control signal and the actual control signal using a quadratic program (QP) controller
where
such that
where
is a nominal input and
is the actual control command or actual input. The nominal input
can be interpreted as the desired or reference value for the control input in each dimension. It serves as a target or baseline for the actual control input
and is compared against the cost function. The goal of minimizing this cost function is often to bring the actual control input as close as possible to the desired or nominal input. The linear constraints
ensure safety and are derived from the safety conditions and admissible control space considerations. This QP formulation aims to find the control inputs
for all agents that minimize the deviation from the user-specified signals while satisfying safety constraints. The solution to this QP provides the optimal control inputs that balance user preferences with safety requirements.
9. Conclusions
This paper introduces safe coverage control algorithms for multi-agent systems that utilize CVT and CBFs. The main goal of this paper has been achieved by integrating the concept of CVT and CBFs with obstacles, testing the controller under simulation, and verifying the results with RT mobile robots.
The proposed approach in this paper has presented the integration of guaranteeing safety provided by CBF with the spatial optimization capabilities of CVT. This integration achieves a synergy that addresses safety and spatial optimization simultaneously in multi-agent systems, where both are crucial. Simulator setups have been built to test the validity of the proposed controller. The proposed controller has been verified through experiments in the ROS Gazebo simulator and with mobile RT robots. The use of CVT- and CBF-based controllers ensures the optimal distribution of robots and obstacle avoidance.
In developing and verifying the proposed algorithm for safe coverage control in multi-agent systems, it is crucial to consider the following constraints and assumptions. Firstly, the communication constraint is addressed by taking into account latency in a setup that involves less than 10 robots. However, it is noted that, as the number of robots increases, latency may become more noticeable. Secondly, the sensing constraint revolves around the accurate positioning data required by the algorithm. While the algorithm assumes ideal coverage, variations in accuracy and a limited field of view could impact the algorithm’s performance in practical scenarios. These considerations are essential for understanding the potential limitations and practical implications of the algorithm in real-world applications.
These findings provide a better understanding of the integration of CVT and CBF in the scope of coverage control with obstacle avoidance for multi-agent systems. Throughout the simulations carried out in ROS/Gazebo and experiments with RT robots, CVT and CBF have displayed their role in determining safe coverage while taking into account the desired trajectory. In particular, the ROS/Gazebo simulator gave us a clear picture of how CVT and CBF could be implemented. The experiment with RT robots has shown how a collision-free coverage system is achieved in practice despite the size limitations.
As we scale up the system, it is important to consider some potential challenges and limitations. Firstly, our current approach relies on a centralized framework, which may pose scalability challenges as the number of agents increases. The current centralized approach can strain computational resources and cause delays, making real-time decision making challenging. To tackle this issue, a shift from a centralized to a decentralized or distributed control architecture can be implemented. A decentralized approach can distribute the computational load among agents, reducing the dependence on a single central unit and improving scalability. Secondly, our current system uses a centralized camera setup for position tracking with ARTags, but it has limitations when managing a large number of robots due to the limited field of view. To address this, we are considering decentralized solutions commonly used in field robotics. These systems use onboard sensors like LiDAR, GPS, IMUs, and local cameras for tracking and navigation without relying on a central camera system, which can improve coverage and reliability.
In our future work, we plan to explore additional aspects to develop a more comprehensive approach to multi-agent systems. This will include scenarios where the obstacles are equidistant from agents, which may require the design of an extra layer in the control scheme. Furthermore, we will conduct research on the non-convex shape of obstacles in the combined CVT and CBF framework, and transition from a centralized framework to a decentralized framework to enhance the overall efficiency and versatility of the systems which can lead to interesting findings and challenges.