1. Introduction
Swarm robotics represents an innovative approach to coordinating large numbers of robots, drawing inspiration from the collective behaviors observed in social insects [
1]. It represents the application of Swarm Intelligence (SI) within Multi-Robot Systems (MRSs). This paradigm emphasizes the significance of physical embodiment and realistic interactions among robots and their environment. Swarm robotics is characterized by the emergence of synchronized behaviors at the system level, even when individual agents may be relatively limited, centralized coordination is absent, and interactions are kept simple [
2].
Multi-robot exploration involves deploying a group of robots with the task of searching for multiple targets dispersed in an unknown environment. In such environments, employing a random walk strategy presents a viable choice as it offers the potential to discover more targets. However, this approach can be time-consuming, as robots may repeatedly cover already explored areas. Moreover, there is a risk that robots may not reach the most distant areas, thereby reducing the likelihood of finding additional targets. A widespread dispersion of robots facilitates the exploration of more areas, thereby increasing the chances of locating more targets. Another alternative is distributing the search areas among robots, enabling rapid exploration of environments. However, achieving this in swarms of robots without direct communication can be challenging.
The practical implementation of multi-target search and foraging algorithms poses challenges in swarm robotics, as there remains a disparity between proposed algorithms and their actual deployment. One contributing factor to this disparity could be the communication methods employed among robots. For instance, while stigmergic communication is often proposed, its real-world application is often limited by cost considerations. In contrast, light-based communication has emerged as a promising alternative. This method entails wireless data transmission through light, where Light-Emitting Diodes (LEDs) emit light pulses modulated with data. Light-based communication facilitates the transmission of various data types, including position information, sensor data, and commands. Compared to radio-based communication, light-based communication offers several advantages in the context of swarm robotics search tasks:
- 1.
It avoids bandwidth problems compared to radio-frequency communication.
- 2.
It offers a more reliable and interference-free communication channel compared to other electronic devices and radio signals.
- 3.
It is an energy-efficient communication mechanism designed specifically for robots with limited power. This will expand the life of the robots and thus provide longer mission duration.
- 4.
The signals can be directed toward specific robots. This will help in improving communication efficiency and reduce interference.
- 5.
Light-based communication can scale to large swarm sizes without performance degradation.
In this paper, we propose two lightweight multi-target search algorithms named Random Bounce Firefly Algorithm (RBFA) and Global Lawnmower Firefly Algorithm (GLFA). These algorithms integrate inverse light-based behavior with two random walks: random bounce and global lawnmower. Light-based communication is used here to realize the following three main objectives:
- 1.
Ensuring robots disperse widely throughout their environment to maximize object discovery;
- 2.
Implicitly partitioning the search space among robots through indirect communication;
- 3.
Ensuring effective local exploration.
The proposed algorithms are implemented in the multi-robots ArGOS simulator. The obtained results are compared with two other related algorithms (LFA [
3] and the original Firefly algorithm (FF) [
4]). The comparisons are promising and prove the superiority of the proposed algorithms in some simulation scenarios.
The remainder of the paper is structured as follows: we present related works and background algorithms in
Section 2. In
Section 3, we introduce the finite state machine and provide the pseudo-code for the proposed algorithms. Following this, we present and discuss the experimental results in
Section 4.
Section 5 highlights real-world applications and limitations, while
Section 6 offers insights into implementing the proposed algorithms with real robots. Finally, the paper concludes in
Section 7.
2. Related Works
In multi-robot exploration, it is crucial for the robots to cover various search areas to explore the environment and create a useful map effectively. This requirement applies to diverse fields, including industrial applications, like automated lawnmowers or vacuum cleaners, military operations, such as mine clearance, and humanitarian efforts like search and rescue operations [
5]. This section summarizes relevant works related to multi-target search and compares them with our proposal.
The paper by Palmieri et al. [
6] presents two multi-robot algorithms: “Firefly-based Team Strategy for Robot Recruitment (FTS-RR)” and “Ant-based Team Strategy for Robot Recruitment (ATS-RR)”. These algorithms aim to address the challenge of locating and disarming distributed mines. FTS-RR combines Ant Colony Optimization (ACO) and Firefly (FF) algorithms, while ATS-RR relies solely on ACO. In this strategy, robots disperse into cells and independently explore using ACO, leaving pheromones on visited cells. Upon detecting a mine, robots are mobilized for disarmament. The detecting robot acts as a firefly, attracting others based on intensity values influenced by distance. Additionally, a cooperating robot becomes a coordinator, attracting robots with different pheromones to guide them to mine locations. Simulations show that FTS-RR outperforms in terms of mission completion time, cell visits, and overall robot distribution, leading to more efficient disarmament times.
Dimidov et al. [
7] conducted a comprehensive study on random walk models for Kilobots searching for a static target under various environmental conditions. The research focused on a swarm of robots searching for a single static target, with the N robots varying from 10 to 100. Communication among robots was limited to a radius of 10 cm, and a multi-agent simulation abstracted the kilobots’ physical details. Experiments in a confined 90 cm square arena revealed that the Coherent Random Walk (CRW) with high persistence was most effective in bounded spaces, while the Levy Walk (LW) performed poorly due to collisions. LW emerged as the optimal strategy in unlimited space, with some movement correlation providing an advantage. Additionally, the distance from the target to the central location significantly influenced system performance, with LW demonstrating scalability in information dissemination.
The authors in [
8] propose a hybrid approach that integrates the
FTS-RR and
ATS-RR algorithms [
6] to tackle the challenge of mine detection and disarmament using a robotic team. Compared to other algorithms like
PSO and
ATS-RR,
FTS-RR has exhibited superior performance. Experiments conducted by the authors using
FTS-RR in a Java-based grid environment involved varying parameters, such as the number of targets, robots, and the size of the environment. The findings emphasize the importance of parameter values, as task complexity increases with larger environments and more mines. Specifically, achieving a balance between the attractiveness of the fireflies and introducing random movements is crucial. This balance enhances robot distribution, mitigates repetition in specific areas, and reduces the time required to complete all tasks.
In Katada et al. [
9], the authors tackle the challenge of multi-target detection in enclosed environments using two exploration algorithms: the conventional Random Walk (
RW) and Lévy Flight (
LF). Their proposed subsumption architecture comprises three layers: transmission, obstacle avoidance, and target exploration. The authors conducted experiments with three prototypes: random walk with step sizes of
1 (RN2) and
6 (RN6), and
Lévy Flight (LF6).
LF6 involves two phases: forward movement and rotation, with step sizes determined by a Lévy probability distribution. The target, an infrared emitter ball, is placed in the lower left corner, while a wireless station is positioned in the upper right corner. The robots commence from a fixed initial position near the station and are oriented randomly. The target is located 80 m away, and robots must maintain connectivity with the station. Experiments involved varying the number of robots (10, 15, 20) and concluded when the station receives a message or after 1800 s without a message. The exploration strategy changed among
RN2, RN6, and LF6, with ten experiments for each configuration. The success rate was notably high, with
LF6 demonstrating the best performance among the explored strategies.
Palmieri et al. [
10] conducted a comparative analysis of hybridized algorithms for mine detection and disarmament in a distributed environment. They compared the hybridization of
Ant Colony Optimization (ACO) and
Firefly algorithm (FF) with
Particle Swarm Optimization (PSO) and
Artificial Bee Colony (ABC). The three hybrid proposals
(ACO-FF, ACO-PSO, and ACO-ABC) were evaluated using a Java-based simulator with robots having limited energy stores.
ACO-FF uses a combination of
ACO and
FF for search and collaboration in mine disarmament, as previously explained in [
6]. A coordination mechanism was introduced in
ACO-ABC, where robots employed
ABCs to determine movement decisions when receiving multiple requests.
ACO-PSO uses the
PSO algorithm for mine disarmament upon detection. Simulations revealed higher energy consumption with the
PSO approach, especially with small swarm sizes and high task complexity.
FF and
ABC methods were compatible with less complex tasks, but differences became apparent with more targets and smaller robot swarms. In complex tasks, the coordination mechanism became more intricate, and the FF-based strategy outperformed in terms of performance.
The authors in [
11] focus on the application of the algorithms proposed in [
6] in simulated environments and compare them to the
PSO algorithm. The authors assessed the solution’s quality by analyzing a specific performance metric—the relative error indicating the number of accesses in the cells, providing a measure of the efficiency with which the robots are distributed in the area. The simulations show that the relative error is low by applying the ACO and the FF algorithms together.
In Suarez et al. [
12], the authors present a new exploration strategy inspired by bats, specifically microbats that use sonar for navigation in the dark. This algorithm involves a prime number of individuals (
P) representing bats, each assigned a location
and a velocity
in the search space. The algorithm initializes these variables randomly and calculates each bat’s pulse frequency and volume. The swarm evolves through generations, each representing a temporal state, until reaching the maximum number of generations
. New frequency, position, and speed are calculated for each generation
g and each bat. The current best solution is determined by evaluating the objective function for all bats, and local solutions are selected based on specific criteria. The algorithm then intensifies the search through a local random walk, perturbing the chosen solution from the best current solutions. Experimental results demonstrate effectiveness, particularly in scenarios involving challenges like complex configurations (e.g.,
U and
V shapes), traffic jams, or narrow passages. The authors highlight the adaptability of the bat-inspired strategy in navigating challenging environments.
In Santos et al. [
13], the authors introduce a novel exploration strategy utilizing a continuous multi-modal utility function and metaheuristic optimization. They frame the exploration problem as an optimization task, generating data for a
Firefly-based algorithm (FF) tasked with searching solution space cells to construct a map. The environment is represented as a
two-dimensional objective function, with minimum energy points as potential robot targets. A metaheuristic, based on
FF, seeks to minimize the objective function, directing the robot towards optimal targets. The exploration task includes searching boundaries for maximum information gain, devising collision-minimizing paths based on map characteristics, and avoiding previously explored areas to minimize redundancy. These directives are weighted in a behavior, defining the objective function as the weighted sum of their component functions for position selection. Considering computational efficiency,
FF is utilized for objective function minimization. The strategy is implemented on the
V-REP robotic platform under the
ROS (Robot Operating System). Tests conducted in 10 m environments utilize conditional, linear, and propositional loss metrics, with propositional loss effectively guiding time and trajectory length exploration.
Abuomar and Al-Aubidy propose [
14] an enhanced swarm robotics-based search and rescue algorithm, a refinement of the Binary Dragonfly Optimization Algorithm (
BDA). This improved version, the Robotic Binary Dragonfly Algorithm (
RBDA), integrates two additional behaviors: obstacle avoidance and communication. The original
BDA includes five fundamental behaviors: obstacle avoidance, speed matching, approaching a center, attraction to an area of interest, and fleeing from negative influences. The
RBDA algorithm is tailored to adapt to environments with obstacles and communication constraints through adjustments to the step vector equation. Evaluation is performed using Benchmark functions (
sphere, Rosenbrock, De Jong, Grie Wank, Rastrifin, Ackliu) and compared against the original
BDA. Results reveal that
RBDA minimizes error and converges more effectively to optimal solutions. Simulations are executed using the SIMROBOT virtual Toolbox in Matlab, with a simulated environment spanning
300 × 300 m and featuring randomly distributed obstacles. The number of robots ranges from
5 to
15, with a victim robot randomly placed.
RBDA displays enhanced performance in achieving optimal solutions, with faster convergence times observed as the number of robots increases.
The authors of [
15] address the challenge of a multi-robot target search. They improve the
ATS-RR strategy (explained in [
6]) by integrating a nature-inspired distributed wireless communication protocol. This protocol facilitates on-site recruitment of the required robot, minimizing traffic. When a robot detects a target, it sends message announcements among the robots. Simulation results demonstrate reduced time compared to the
Inverse Ant System-Based Surveillance System (IAS-SS). Moreover, increasing the number of targets prolongs the convergence time.
In Khaluf et al. [
16], the
Collective Lévy Walk (CLW) algorithm is introduced for robot swarms, utilizing local communication to create a collective Lévy walk pattern. The algorithm prioritizes longer steps in robot trajectories by exchanging information between robots.
CLW is a deterministic automaton where robots start exploration in a running state, moving at a constant linear speed while determining the duration of their next step (
TL) from a Lévy distribution. Upon completing the
TL interval, robots enter a spinning state, rotating at a
constant angular velocity (TU) sampled from a uniform distribution. If an obstacle is detected via proximity sensors, a walking robot switches to collision avoidance behavior, rotating based on the obstacle’s distance, and then returns to the walking state after avoidance, sampling a new
TL interval to continue the Lévy walk. The algorithm’s crucial aspect is utilizing communication among robots to generate a collective Lévy walk, where each robot broadcasts its sampled
TL to local neighbors within its communication range and sector of vision. Steps are categorized as short or long based on a predefined threshold (
F). Simulations conducted in a
20 × 20 m² arena using the ARGoS simulator demonstrate the algorithm’s efficacy. Results, averaged over
30 executions lasting
5000 time steps each, show
CLW’s ability to generate collective trajectories even for larger swarm sizes. Simulation parameters include an exponent (
) set to
1, robot communication range of
1.35 m, linear speed of
5 m/s, and a step threshold (
F) set to
1.53 m (with
0.17 m as the simulated robot’s diameter).
In Huang et al. [
17], the authors focus on implementing an exploration scenario involving a swarm of heterogeneous robots capable of detecting various types of targets simulating different radiation levels. The study investigates the influence of experimental parameters on swarm efficiency using the
Mona simulation platform. The exploration algorithm comprises three steps: (1) random walk, where the robot continuously explores the environment at a constant speed using its sensory system; (2) obstacle detection, where the robot adjusts motor speeds based on sensors to change direction and continues the random walk upon detecting obstacles; (3) target detection, where the robot halts to investigate detected targets, records their relative positions, and resumes random walking. Simulations involve preventive radiation, petroleum, or oil drums in
V-REP, each with distinct radiation levels requiring separate detection. Three versions of the Mona robot are designed, each capable of detecting a specific type with different colors. Performance evaluation is based on the total number of target detections and the time to locate all targets. Two simulation scenarios explore the impact of increasing robot numbers and changes in environmental configurations. Results indicate a slight increase in the total number of detected targets with more robots. Statistical analysis using
ANOVA reveals that environmental complexity does not significantly affect performance.
In Pang et al. [
18], the authors introduce an advanced random exploration strategy where robots adjust their step sizes dynamically based on the estimated density of robots in specific areas. This adaptive approach aims to evenly distribute robots in the environment, thereby enhancing exploration efficiency. The study includes both computer simulations using the
Webots simulator and experiments with real robots (
e-puck). Results from simulations demonstrate the superiority of the proposed method over existing strategies such as
Brownian Walk (BW) and
Lévy Flight (LF) in terms of coverage ratio. Real experiments also show the effectiveness of the proposed method compared to
BW and
LF, measured by the number of found objects.
In [
19], the authors introduce a novel multi-robot exploration strategy named
Distributed Particle Swarm Optimization (DPSO) designed for search and rescue operations. Their focus lies in enabling a swarm of robots to effectively navigate the search space, avoid collisions and obstacles, and locate victims. The
DPSO algorithm incorporates two primary functions: the artificial potential function, which attracts forces to unknown areas and victims, and the repulsion forces function, aiding in collision avoidance. The repulsion function comprises intra and inter-repulsion forces. The experiments were conducted by implementing a multi-agent model and environment using Python and
VRep. Results from the experiments validate that the proposed
DPSO algorithm assists robots in navigating away from local minima, enabling them to discover alternative paths through disaster scenarios to reach optimal solutions.
In [
20], a search strategy named Velocity-inspired Robotic Fruit Fly Algorithm (VRFA) is introduced for expediting victim search and enabling real-time assessment and management of search and rescue operations. This strategy combines the
Fruit Fly Optimization algorithm (FOA) for searching and tracking multiple targets and the
Particle Swarm Optimization (PSO) algorithm for updating the position and velocity of fruit flies. Each independent robot engages in four activities: local recording and data processing, inter-robot data sharing, development of a movement plan, and creation of timelines for executing movement. The algorithm is evaluated alongside other techniques such as
PSO, FOA, and
BSO, using two cooperation strategies (centralized and decentralized) for static and dynamic targets. The study investigates the impact of the number of targets, environment complexity, and the number of robots on system performance. Results indicate that decentralized cooperation enhances performance, with
VRFA demonstrating the best average performance in reducing search and rescue time.
In [
21], a novel neural network algorithm using the
Artificial Bee Colony (ABC) method is introduced to tackle the challenges of full-coverage path planning in search and rescue operations. This algorithm aims to comprehensively cover the area of interest within a constrained time frame and autonomously navigate around obstacles in unfamiliar terrains. The neural network takes inputs about obstacles and coverage in five directions and produces the speed of the left and right wheels as output. By leveraging the advantages of the
ABC algorithm, such as increased probability of finding the optimal solution and system robustness, the authors optimize the parameters of the neural network, enhancing the overall training efficiency and effectiveness. Performance evaluation of the algorithm is based on an evaluation function divided into three parts: coverage rate, path repetition rate, and failure rate. Experimental results show that integrating the
ABC method with a neural network path planning algorithm effectively guides rescue robots to plan comprehensive coverage paths. Additionally, this approach exhibits high robustness across diverse environments.
Table 1 has been populated to highlight all the features of the reviewed work. According to such a table, several observations can be made:
The majority of the studies used an obstacle-free environment.
Most of the studies employed static targets with random distribution.
Only a few studies fixed the position of the robots, such as [
18], which followed both central and random strategies; [
9], where the robots had predefined positions; and [
14], which followed the random strategy.
Half of the studies considered unlimited robot energy, while the other half considered limited energy.
The majority of the studies considered limited robot perception.
All the studies used homogeneous robots, most of which used without memory.
Most of the studies employed bio-inspired algorithms as exploration strategies and direct and indirect communication behaviors for the robots, with the majority also utilizing collaborative work.
The experiments were tested either in the real world, such as [
16] that used e-puck robots and [
22], or in simulations on platforms like ARGoS, Player/Stage, Gazbo, etc., and other custom-developed platforms.
The studies using the FF algorithm utilize its attractive light behavior. None of the studies based on the firefly algorithm considered light to repel the robots.
Previous works have primarily employed the Firefly (FF) algorithm to attract robots to regions of interest. To the best of our knowledge, our study marks the first and only instance where the FF algorithm is utilized to repel robots from each other within covered areas. Upon discovering a resource region, a robot emits light to deter other robots from approaching. This repulsion mechanism ensures several benefits: (1) Implicit distribution of robots across the search space, enhancing the likelihood of discovering additional regions; (2) prevention of congestion and collisions in search areas; (3) expedited exploration of local regions, facilitated by the algorithm’s utilization of spiral search patterns.