Next Article in Journal
New Supplementary Photography Methods after the Anomalous of Ground Control Points in UAV Structure-from-Motion Photogrammetry
Next Article in Special Issue
The Development of a Visual Tracking System for a Drone to Follow an Omnidirectional Mobile Robot
Previous Article in Journal
Four-Dimensional Gait Surfaces for a Tilt-Rotor—Two Color Map Theorem
Previous Article in Special Issue
Mathematical Modeling and Stability Analysis of Tiltrotor Aircraft
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Multi-Colony Social Learning Approach for the Self-Organization of a Swarm of UAVs

1
Electronic Engineering Department, Sir Syed University of Engineering & Technology, Karachi 75300, Pakistan
2
Department of Computer Science, College of Computers and Information Technology, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia
3
Department of Computer Sciences, College of Computer and Information Science, Princess Nourah Bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia
*
Author to whom correspondence should be addressed.
Drones 2022, 6(5), 104; https://doi.org/10.3390/drones6050104
Submission received: 11 April 2022 / Accepted: 20 April 2022 / Published: 23 April 2022
(This article belongs to the Special Issue Advances in UAV Detection, Classification and Tracking)

Abstract

:
This research offers an improved method for the self-organization of a swarm of UAVs based on a social learning approach. To start, we use three different colonies and three best members i.e., unmanned aerial vehicles (UAVs) randomly placed in the colonies. This study uses max-min ant colony optimization (MMACO) in conjunction with social learning mechanism to plan the optimized path for an individual colony. Hereinafter, the multi-agent system (MAS) chooses the most optimal UAV as the leader of each colony and the remaining UAVs as agents, which helps to organize the randomly positioned UAVs into three different formations. Afterward, the algorithm synchronizes and connects the three colonies into a swarm and controls it using dynamic leader selection. The major contribution of this study is to hybridize two different approaches to produce a more optimized, efficient, and effective strategy. The results verify that the proposed algorithm completes the given objectives. This study also compares the designed method with the Non-Dominated Sorting Genetic Algorithm II (NSGA-II) to prove that our method offers better convergence and reaches the target using a shorter route than NSGA-II.

1. Introduction

In the last decade, research has been exponentially increasing in the domains of flight control, path planning, and obstacle avoidance of unmanned aerial vehicles (UAVs) [1,2,3]. The analyses get increasingly complex when dealing with multiple UAVs in different formations. The natural behaviors of birds, ants, and fishes have been proven to be significant in formulating successful bio-inspired algorithms for the formation control, route planning, and trajectory tracking of a swarm of multiple UAVs [4,5,6]. Some of the important algorithms inspired by nature include ant colony optimization [7], pigeon-inspired optimization [8], and particle swarm optimization [9].
The primary inspiration for this study is to utilize the knowledge obtained from studying the natural flocking and swarming activities of ants and use them for controlling UAVs. Researchers have used these nature-inspired algorithms for numerous purposes including cooperative path planning of multiple UAVs [10], distributed UAV flocking among obstacles [11], and forest fire fighting missions [12]. We also find multiple studies that hybridize a bio-inspired algorithm with another method to increase its efficiency [13,14,15].
There are many existing solutions regarding the problems of path planning and multi-UAV cooperation. One such research study [16] deals with the inspection of an oilfield using multiple UAVs while avoiding obstacles. The researchers achieve this using an improved version of Non-Dominated Sorting Genetic Algorithm (NSGA). Another existing solution to tackle a multi-objective optimization is addressed in reference [17]. In [17], the researchers use a hybrid of NSGA and local fruit fly optimization to solve interval multi-objective optimization problems. In reference [18], academics use a modified particle swarm optimization algorithm for the dynamic target tracking of multiple UAVs.
Our proposed method consists of many concepts, which are explained as follows:
Ant colony optimization (ACO) is an optimization technique used by ant colonies to find the shortest route to take to get to their food [19]. Using pheromones left behind from earlier ants, the ACO mimics ants looking for food. The path used by the most ants contains the most pheromones, which aids the next ant in choosing the shortest way [20]. The ACO technique is used in a variety of applications, such as the routing of autonomous vehicles and robots, which need to find the shortest path to a destination, and the design of computer algorithms, which need to find the optimal solution for a given problem.
Sometimes, however, the ACO is slow to converge and falls into the local optimum. To help solve these issues, researchers introduced a modified version of ACO called max-min ant colony optimization (MMACO). It operates by controlling the maximum and minimum amounts of pheromone that can be left on each possible trial [21]. The range of possible pheromone amounts on each possible route is limited to avoid stagnation in the search process.
In social animals, social learning plays a significant part in behavior learning. Social learning, as opposed to asocial (individual) learning, allows individuals to learn from the actions of others without experiencing the costs of individual trials and errors [22]. That is why this study incorporates social learning mechanisms into MMACO. Unlike traditional MMACO variations, which update ants based on past data, each ant in the proposed SL-MMACO learns from any better ants (called demonstrators) in the present swarm.
Some of the state-of-the-art work in the field of optimization algorithms include research [23] that proposes the use of social learning-based particle swarm optimization (SL-PSO) for integrated circuits manufacturing. The SL-PSO is used to increase the imaging performance in extreme ultraviolet lithography. Results showed that the errors were reduced significantly compared to conventional methods. Similarly, another recent study [24] uses improved ant colony optimization (IACO) for human gait recognition. The IACO is used to enhance the extracted features, which are then passed on to the classifier. Compared with current methods, the IACO technique in [24] is more accurate and takes less time to compute.
A multi-agent system (MAS) is a collection of agents that interact with each other and the environment to achieve a common goal. The main function of the MAS is to tackle issues that a single agent would find difficult to solve. To achieve its objective, another important function of the MAS is to be able to interact with each agent and respond accordingly. In MAS, each agent can determine its state and behavior based on the state and behavior of its neighbors [25]. MAS has multiple uses in fields such as robotics, computer vision, and transportation [26,27,28]. In most MAS scenarios, an external entity is required to direct the agents toward the destination [29].
A leader is an agent that can alter the states of the follower agents. A leader can be outside or inside the MAS or can even be virtual. A large swarm of UAVs can be controlled more efficiently by selecting fewer leaders than follower agents. For example, when we want to control a fleet of UAVs, we can select a few leader agents, let the remaining UAVs follow the leader, and use the leading UAVs to control the state of the system.
The major contributions of this study are as follows:
  • Designing a novel hybrid algorithm that combines MMACO with the social learning mechanism to enhance its performance.
  • Using the designed algorithm in tandem with the MAS and dynamic leader selection to connect the three colonies.
  • Achieving the self-organization of the three colonies and synchronizing them into one swarm.
  • Demonstrating the validity of the designed method by performing computer simulations that mimic real-life scenarios.
The paper is organized into seven sections. Section 1 presents the introduction and the literature review. In Section 2, we break down the problem into three scenarios and describe each one in detail. Section 3 provides the framework of the proposed solution. Section 4 defines the proposed method with each of its constituent parts discussed at length and offers the algorithm and the flowchart. In Section 5, we discuss the simulations and their outcomes. Section 6 presents the conclusion of the study.

2. Research Design

We broke our problem into three different scenarios to help make it easier. In the first scenario, the UAVs are in random positions and then using our proposed algorithm, they organize themselves into a formation. In the second scenario, we navigate the newly organized formations through some obstacles. In the last scenario, we combine the three formations into one swarm and then navigate it through the same environment. Below, we describe each scenario in detail:
Scenario 1:
Figure 1 presents the first scenario. In this scenario, there are three UAVs in three different colonies and the UAVs are placed at random positions within each colony. The environment contains different obstacles like mountains and rough terrain. The goal is to reach the target using the shortest route possible without colliding with other UAVs. The main objective here is to maintain the formation throughout the journey.
Scenario 2:
Figure 2 illustrates the second scenario. In this scenario, there are again three UAVs in three different colonies and the UAVs are placed at random positions within each colony. The environment is also the same. The goal is to reach the target using the shortest route possible without colliding with the obstacles or other UAVs. The main distinction between the first and the second scenario is that, in the first task, we only had to demonstrate the ability of the algorithm to maintain a formation. However, the second task also requires navigating through the obstacles without any collision.
Scenario 3:
Figure 3 illustrates the third scenario. In this scenario, we pick up where the second scenario left off, i.e., the three colonies are now in the desired formations. The environment is the same as in the second scenario. The goal is to first synchronize the three colonies into one big swarm, and then, while maintaining the swarm, reach the target using the shortest route possible without colliding with the obstacles or other UAVs.

3. Solution Architecture

Figure 4 presents the framework of our proposed solution for the aforementioned problems. It is clear from the figure that, initially, the UAVs in the three different colonies are at random positions. Here, we apply the social learning-based max-min ant colony optimization (SL-MMACO) to each colony. SL-MMACO works by first finding the most optimal routes for each colony, and then the social learning mechanism sorts the ants from best to worst. Afterward, the multi-agent system (MAS) appoints the best ant as the leader and the remaining ants as agents. In the next step, we see that all the colonies are now synchronized to avoid any collision between the UAVs. Finally, we connect all three colonies into one big swarm and then select its leader dynamically according to the mission requirements.

4. Proposed Algorithm

This section introduces the different concepts and theories that we used for the development of our proposed algorithm. In this research paper, we are using a graph-based approach.

4.1. Ant Colony Optimization

Here, we are using a graph-based approach. The concept of nodes, edges, and legs is important to understand path planning using ant colony optimization (ACO). Figure 5 presents the relationship between the nodes, edges, and legs. The ACO generates intermediary points between the initial and final positions. These intermediate points are called the nodes. An edge is a link between two nodes. For instance, edge (b,c) is the length from edge b to c, whereas a leg is generated whenever a UAV turns.
Suppose that the mth ant is at node i on time t, the probability of transition can be written as:
p i j m t = τ i j α η i j β c a l l o w e d i τ i c α η i c β
where the probability of transition from node i to node j of the mth ant is p i j m t , the pheromone on the edge (i,   j ) is τ i j t , the transit feasibility from node i to node j is η i j t , the set of nodes that are neighboring i is a l l o w e d i , the constant influencing the τ i j t is α , and the constant influencing the η i j t is β .
After the method begins, the starting pheromone rate varies according to the edges. The pheromone rate is then reset by each ant that generated the result, which starts the next cycle of the process. τ i j t on the edge ( i ,   j ) is:
τ i j t + 1 = 1 ρ × τ i j t + m = 1 k τ i j m t
where the rate of pheromone evaporation is ρ ( 0 ρ 1 ) , total ants are represented by k , and the pheromone rate of the edge i , j is τ i j m t . τ i j m t can be further defined as
τ i j m t = Q / L m ; a n t   m   u s e s   e d g e s   o f i , j 0 ; o t h e r w i s e
where the length of the route built by the mth ant is L m and Q is the constant.

4.2. Max-Min Ant Colony Optimization

We need to improve the traditional method to ensure that the ACO converges quickly. MMACO delivers some remarkable results in this area by restricting the pheromones on each route. To understand MMACO, we must first examine the path’s cost. The average cost of path J a , k t can be given as:
J a , k t = 1 k m = 1 k J a , m t
Note that the mth ant only updates the pheromone when the cost of path of the mth ant in the tth iteration fulfill J a , k t J a , m t .
The MMACO updates the route using Equation (3) after every iteration. After each iteration, the algorithm determines the most optimum and least optimal paths. To improve the probability of discovering the global best route, it discards the least optimum route. As a result, Equation (3) may be revised as follows:
τ i j m t = Q / L o ; r o u t e   i , j   r e f e r   t o   t h e   o p t i m a l   r o u t e Q / L w ; r o u t e   i , j   r e f e r   t o   t h e   w o r s t   r o u t e   0 ; o t h e r w i s e
In the above equation, L O is the most optimal route and L w is the current iteration’s worst route. The quantity of pheromone produced by MMACO is limited to specified values. This limitation aids in accelerating convergence and avoiding stagnation.
The algorithm restricts the pheromone on each route to a specified minimum and maximum value, denoted by τ m i n and τ m a x , respectively. This can be represented mathematically as:
τ i j t = τ m a x ; τ i , j t τ m a x τ i j t ; τ m i n < τ i , j t < τ m i n ; τ i , j t τ m i n t τ m a x

4.3. Social Learning Mechanism

The conduct of a person to learn from their surroundings is referred to as social learning. You should learn not just from the top students in class, but also from students who are better than you. Most biological groups follow the same idea. Initially, for the map and compass process, locations and velocities of the ants are produced at random and are indicated as Xi and Vi (i = 1, 2, …, m). At the next iteration, the new location Xi and velocity Vi are calculated by the formula [30]:
X a N c = X a N c 1 + V a N c
V a N c = V a N c 1 × e R . N c + r a n d × c 1 × ( X m o d X a N c 1 )
c 1 = 1 log N c m
where R represents the map and compass factor, which is between 0 and 1, N c is the current number of iterations, c1 the learning factor, Xmod the demonstrator ant superior to the current ant, and m is the total number of ants. Each follows the ant better than itself, and this is known as the learning behavior. Figure 6 illustrates the process for the selection of demonstrator Xmod.
For the landmark operation, the social behavior occurs when ants from the center are removed, and other ants migrate toward the center. The procedure can be given as:
X c e n t N c 1 = a = 1 N c 1 X a N c 1 N N c 1
X a N c = X a N c 1 + r a n d × c 2 × ( X c e n t N c 1 X a N c 1 )
c 2 = α s N c m
where the social influence factor is c2, and α s is called the social coefficient.

4.4. Multiple Agent Systems

The multi-agent system comprises n independent agents that move with the same absolute velocity. Every agent’s direction is updated to its neighbor’s status. At time t, the neighbors of an agent A (1 ≤ An) are those who are located within a circle of radius r (r > 0) centered on the position of agent a. At time t, the neighbor of the agent a is Na(t),
N a t = b | d a b t < r
where the Pythagorean Theorem can be used to compute d a b t .
The coordinates of the agent at time t are (xa(t), ya(t)), where agent a is a neighbor to agent b. The absolute velocity v (v > 0) of each agent in the system is the same.
x a t + 1 = x a t + v cos θ a t
y a t + 1 = y a t + v sin θ a t
At time t, the heading angle of agent a is θa(t). The following equation is used by the algorithm to update the heading angle:
θ a t + 1 = tan 1 b N a t sin θ b t b N a t cos θ b t
The equation above is used to discover obstructions by examining their surroundings. If a missing node in the neighbor (i.e., a hurdle) exists, the heading angle will be changed to prevent colliding with the obstruction.
We can analyze this algorithm using basic graph theory. Please note that each agent’s neighbors are not always the same. The undirected graph set   G t = { V ,   ε t } is used for agent coordination. Where the set containing every agent is V = {1, 2, ···, N}, and the time-varying edge set is ε t . A graph is connected if any two of its vertices are connected.
Equation (16) can be modified as:
tan θ a t + 1 = b N a t cos θ b t m N a t cos θ m t tan θ a t
To further simplify Equation (17), we use a matrix,
tan θ t + 1 = I t tan θ t
where tan θ t tan θ 1 t ,   , tan θ N t τ . For the graph G t , the weighted average matrix is I t i a b t .
i a b t = cos θ b t m N a t cos θ m t i f   a , b ε t 0 , o t h e r w i s e
For the synchronization, we study the linear model of Equation (16) as follows:
θ a t + 1 = 1 n a t b N a t θ b t
where the number of elements in N a t is n a t . Equation (18) can be rewritten as,
tan θ t + 1 = I ˜ t θ t
where θ t θ 1 t ,   , θ N t τ , and the entries of the matrix I ˜ t are,
i ˜ a b t = 1 n a t , i f   a , b ε t 0 , o t h e r w i s e

4.5. Synchronization and Connectivity

To continue the study of the synchronization of the designed algorithm and the connectivity of the associated neighboring graphs, we should formally describe synchronization. If the headings of each agent match the following criteria, the system will achieve synchronization.
lim t θ a t = θ ,   a = 1 ,     ,   N
where θ varies to the starting values θ a 0 ,   x a 0 ,   y a 0 ,   a = 1 , , N and the system parameters v, and r.
Considering the model in Equations (14)–(16), let   { θ a 0 π 2 , π 2 ,   a = 1 ,     ,   N } , and assume that the neighbor at the start, G 0 = V ,   ε 0 is connected. Therefore, to achieve synchronization, the system will have to satisfy,
v d Δ 0 cos θ ¯ N N
whereas the number of agents is represented by N. Meanwhile,
θ ¯ = max a θ a 0
d = r max a , b ε 0 d a b 0
Δ 0 = max a , b tan θ a 0 tan θ b 0
Considering the models in Equations (14), (15) and (20), let θa(0) ∈ [0, 2π), and let us assume that the starting neighbor graph is connected. Therefore, to achieve synchronization, the system will have to satisfy,
v d 1 N N 2 π
whereas d is the same as described in Equation (26).

4.6. Dynamic Leader Selection

Due to communication problems between agents, the formation structure of multi-agent systems might occasionally change. Considering a random failure of communication, each connection (a,b)∊E fails independently with probability p. Let G ^ be the graph topology and E G ^   t c o n v be the expected convergence time for this model of communication failure, while E G ^ ( ) denotes the argument’s expectation across the group of network structures represented by G ^ . The convergence rate will be maximized by reducing E G ^   t c o n v ). As a result, the formula for choosing a leader k to maximize the convergence rate is as follows:
max Y E G ^ min x 0 x 0 T Y L ^ + L ^ Y x 0
So that it meets the below conditions,
t r Y n k Y a a 0 , 1 a V Y a b = 0 a b
As such, the objective function is in line with the projected convergence rate for potential network topologies. Where min x 0 x 0 T Y L ^ + L ^ Y x 0 is a convex function of Y.

4.7. B-Spline Path Smoothing

The hybrid algorithm’s route consists mostly of a combination of line segments. The B-spline curve method is utilized to ensure the smoothness of the route created. The B-spline method is an improvement over the Bezier approach that preserves the convexity and geometrical invariability.
The B-spline path smoothing can be written as:
P u = i = 0 n d i N i j u
Considering Equation (31), d i i = 0 , 1 , , n are control points, and N i j u are the normalized b-order functions of the B-spline. These can be described as:
N i j u = 1 , i f   u i u u i + 1 0 , o t h e r w i s e N i j u = u u i u i + j u i N i , j 1 u + u i + j + 1 u u i + j + 1 u i + 1 N i + 1 , j + 1 u d e f i n e 0 0 = 0
The essential functions of the B-spline curve are determined by the parametric knots u 0 u 1 u n + j . In contrast to the Bezier curve, the B-spline curve is unaffected by altering a single control point. Another benefit of the control points over the Bezier curve is that the degree of polynomials does not increase when the control points are increased.

4.8. Flowchart and Algorithm

Figure 7 illustrates the flowchart of the whole system. As we can see, the system first initializes the parameters in all colonies and places ants at the starting random positions. Then, the ants keep moving to the next node until they reach the destination. Hereinafter, they compute the path cost and update the pheromone. However, the system only updates the path if the pheromone is within the desired range. This process is repeated for the predefined number of iterations. Next, the social learning mechanism sorts the ants from best to worst according to the path cost, and the algorithm assigns the best ant to be the leader of each colony and the remaining ants to be agents. Finally, all the colonies are synchronized into one big swarm, and the system dynamically selects its leader.
Algorithm 1 Designed Strategy is Given As.
Drones 06 00104 i001

5. Simulation Results

In this section, we verify the effectiveness of the proposed strategy by applying it to a MATLAB simulation. The conclusions that we want to verify are the following: firstly, we apply the algorithm to three colonies with each having three UAVs, and see if they can organize themselves into desired formations. For the second scenario, we navigate these newly organized formations through some obstacles to see if they can maintain their formations. Lastly, we validate if the designed strategy can successfully synchronize and connect the three colonies into one swarm.
First Scenario:
For the first scenario, the UAVs are at random positions within each colony. The objective is to arrange the UAVs into the desired formations and then reach the target using the shortest possible path. Figure 8 presents the simulation result of the first scenario. As we can see, the algorithm successfully arranges the UAVs into formations, and they maintain these formations throughout the journey. The environment contains different obstacles like mountains and rough terrain. Please note that in this scenario, we are only testing the capability of the algorithm to maintain formations, and hence, we fly the UAV formations in an upward trajectory and not through the obstacles. The second and the third scenarios deal with passing the formations through the obstacles.
Second Scenario:
In this scenario, after the algorithm successfully maintains the formation, and we check whether it is also capable of obstacle avoidance. The environment contains different obstacles like mountains and rough terrain. The goal is to reach the target using the shortest route possible without colliding with the obstacles or other UAVs. Figure 9 illustrates the simulation result of the second scenario. It is evident from the result that the algorithm achieved the desired goal, and we can see that the three colonies navigated the obstacles while maintaining formations.
Third Scenario:
In this scenario, we pick up where the second scenario left off, i.e., the three colonies are now in the desired formations. The environment contains different obstacles like mountains and rough terrain. The goal is to first synchronize the three colonies into one big swarm, and then while maintaining the swarm, reach the target using the shortest route possible without colliding with the obstacles or other UAVs. Figure 10 illustrates the simulation result of the third scenario. Again, we see that the algorithm successfully synchronized the three colonies into one swarm, and it reaches its target without any collision.
Comparison with NSGA-II:
Lastly, we also compare our proposed algorithm with the Non-Dominated Sorting Genetic Algorithm II (NSGA II). We compare our proposed method with the NSGA II because it is a fast multi-objective genetic algorithm and is a highly regarded evolutionary algorithm. Figure 11 compares our proposed method with NSGA-II. Here, we can see that our designed strategy stays close to the reference while NSGA-II sometimes strays too far. Additionally, our strategy follows a shorter and quicker path to the target.

6. Conclusions

This research presents a strategy for the self-organization of a swarm of UAVs consisting of three colonies with three UAVs each. To plan the path of each colony to the target, we used max-min ant colony optimization (MMACO), and we used the social learning mechanism to sort the ants from worst to best. To organize the randomly positioned UAVs into different formations, this study used the multi-agent system (MAS). The designed algorithm also synchronized and connected the three colonies into a swarm with the help of dynamic leader selection. The proposed algorithm completed the given objectives in the simulation results.
The salient results and the findings in this research include successfully maintaining the formations in the first scenario. In the second scenario, the algorithm not only maintained the formations, but also navigated them through the obstacles. In the third scenario, the algorithm merged the three formations into one big swarm and then successfully navigated the swarm through the obstacles. By comparing the proposed method with the Non-Dominated Sorting Genetic Algorithm II (NSGA-II), it was clear that our strategy offered better convergence, optimized routes, and reached the destination using a shorter route than NSGA-II.

Author Contributions

Investigation, M.S.; supervision, Z.A.A.; validation, A.I.; methodlogy, E.H.A.; writing-original draft preparation, M.S.; software, Z.A.A.; funding acquision, E.H.A.; writing-review and editing, A.I. and M.H. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by Taif University Researchers Supporting Project number (TURSP-2020/292) Taif University, Taif, Saudi Arabia and this work is also supported by Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2022R193), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Data Availability Statement

All data used to support the findings of this study are included within the article.

Acknowledgments

The authors would like to acknowledge Taif University Researchers Supporting Project number (TURSP-2020/292) Taif University, Taif, Saudi Arabia. The authors would like also to acknowledge, Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2022R193), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Conflicts of Interest

The authors declare that they have no conflict of interest.

References

  1. Radmanesh, M.; Kumar, M.; Guentert, P.H.; Sarim, M. Overview of path-planning and obstacle avoidance algorithms for UAVs: A comparative study. Unmanned Syst. 2018, 6, 95–118. [Google Scholar] [CrossRef]
  2. Yao, P.; Wang, H.; Su, Z. Real-time path planning of unmanned aerial vehicle for target tracking and obstacle avoidance in complex dynamic environment. Aerosp. Sci. Technol. 2015, 47, 269–279. [Google Scholar] [CrossRef]
  3. Seo, J.; Kim, Y.; Kim, S.; Tsourdos, A. Collision avoidance strategies for unmanned aerial vehicles in formation flight. IEEE Trans. Aerosp. Electron. Syst. 2017, 53, 2718–2734. [Google Scholar] [CrossRef] [Green Version]
  4. Oh, H.; Shirazi, A.R.; Sun, C.; Jin, Y. Bio-inspired self-organising multi-robot pattern formation: A review. Robot. Auton. Syst. 2017, 91, 83–100. [Google Scholar] [CrossRef]
  5. Ganesan, R.; Raajini, X.M.; Nayyar, A.; Sanjeevikumar, P.; Hossain, E.; Ertas, A.H. Bold: Bio-inspired optimized leader election for multiple drones. Sensors 2020, 20, 3134. [Google Scholar] [CrossRef] [PubMed]
  6. Xie, Y.; Han, L.; Dong, X.; Li, Q.; Ren, Z. Bio-inspired adaptive formation tracking control for swarm systems with application to UAV swarm systems. Neurocomputing 2021, 453, 272–285. [Google Scholar] [CrossRef]
  7. Dorigo, M.; Birattari, M.; Stutzle, T. Ant colony optimization. IEEE Comput. Intell. Mag. 2006, 1, 28–39. [Google Scholar] [CrossRef]
  8. Duan, H.; Qiao, P. Pigeon-inspired optimization: A new swarm intelligence optimizer for air robot path planning. Int. J. Intell. Comput. Cybern. 2014, 7, 24–37. [Google Scholar] [CrossRef]
  9. Poli, R.; Kennedy, J.; Blackwell, T. Particle swarm optimization. Swarm Intell. 2007, 1, 33–57. [Google Scholar] [CrossRef]
  10. Ali, Z.A.; Han, Z.; Wang, B.H. Cooperative path planning of multiple UAVs by using max–min ant colony optimization along with Cauchy mutant operator. Fluct. Noise Lett. 2021, 20, 2150002. [Google Scholar] [CrossRef]
  11. Qiu, H.; Duan, H. A multi-objective pigeon-inspired optimization approach to UAV distributed flocking among obstacles. Inf. Sci. 2020, 509, 515–529. [Google Scholar] [CrossRef]
  12. Ghamry, K.A.; Kamel, M.A.; Zhang, Y. Multiple UAVs in forest fire fighting mission using particle swarm optimization. In Proceedings of the 2017 IEEE International Conference on Unmanned Aircraft Systems (ICUAS), Miami, FL, USA, 13–16 June 2017; pp. 1404–1409. [Google Scholar]
  13. Vijh, S.; Gaurav, P.; Pandey, H.M. Hybrid bio-inspired algorithm and convolutional neural network for automatic lung tumor detection. Neural Comput. Appl. 2020, 1–14. [Google Scholar] [CrossRef]
  14. Domanal, S.G.; Guddeti, R.M.R.; Buyya, R. A hybrid bio-inspired algorithm for scheduling and resource management in cloud environment. IEEE Trans. Serv. Comput. 2017, 13, 3–15. [Google Scholar] [CrossRef]
  15. Dhiman, G. ESA: A hybrid bio-inspired metaheuristic optimization approach for engineering problems. Eng. Comput. 2021, 37, 323–353. [Google Scholar] [CrossRef]
  16. Li, K.; Yan, X.; Han, Y.; Ge, F.; Jiang, Y. Many-objective optimization based path planning of multiple UAVs in oilfield inspection. Appl. Intell. 2022, 1–16. [Google Scholar] [CrossRef]
  17. Ge, F.; Li, K.; Han, Y. Solving interval many-objective optimization problems by combination of NSGA-III and a local fruit fly optimization algorithm. Appl. Soft Comput. 2022, 114, 108096. [Google Scholar] [CrossRef]
  18. Wang, Y.; Li, K.; Han, Y.; Yan, X. Distributed multi-UAV cooperation for dynamic target tracking optimized by an SAQPSO algorithm. ISA Trans. 2021, in press. [CrossRef]
  19. Shetty, A.; Shetty, A.; Puthusseri, K.S.; Shankaramani, R. An improved ant colony optimization algorithm: Minion Ant (MAnt) and its application on TSP. In Proceedings of the 2018 IEEE Symposium Series on Computational Intelligence (SSCI), Bangalore, India, 18–21 November 2018; pp. 1219–1225. [Google Scholar]
  20. Shi, B.; Zhang, Y. A novel algorithm to optimize the energy consumption using IoT and based on Ant Colony Algorithm. Energies 2021, 14, 1709. [Google Scholar] [CrossRef]
  21. Shafiq, M.; Ali, Z.A.; Alkhammash, E.H. A cluster-based hierarchical-approach for the path planning of swarm. Appl. Sci. 2021, 11, 6864. [Google Scholar] [CrossRef]
  22. Cheng, R.; Jin, Y. A social learning particle swarm optimization algorithm for scalable optimization. Inf. Sci. 2015, 291, 43–60. [Google Scholar] [CrossRef]
  23. Zhang, Z.; Li, S.; Wang, X.; Cheng, W.; Qi, Y. Source mask optimization for extreme-ultraviolet lithography based on thick mask model and social learning particle swarm optimization algorithm. Opt. Express 2021, 29, 5448–5465. [Google Scholar] [CrossRef] [PubMed]
  24. Khan, A.; Javed, M.Y.; Alhaisoni, M.; Tariq, U.; Kadry, S.; Choi, J.; Nam, Y. Human Gait Recognition Using Deep Learning and Improved Ant Colony Optimization. Comput. Mater. Cont. 2022, 70, 2113–2130. [Google Scholar] [CrossRef]
  25. Kamdar, R.; Paliwal, P.; Kumar, Y. A state of art review on various aspects of multi-agent system. J. Circuits Syst. Comput. 2018, 27, 1830006. [Google Scholar] [CrossRef]
  26. Gulzar, M.M.; Rizvi, S.T.H.; Javed, M.Y.; Munir, U.; Asif, H. Multi-agent cooperative control consensus: A comparative review. Electronics 2018, 7, 22. [Google Scholar] [CrossRef] [Green Version]
  27. González-Briones, A.; Villarrubia, G.; De Paz, J.F.; Corchado, J.M. A multi-agent system for the classification of gender and age from images. Comput. Vis. Image Underst. 2018, 172, 98–106. [Google Scholar] [CrossRef]
  28. Hamidi, H.; Kamankesh, A. An approach to intelligent traffic management system using a multi-agent system. Int. J. Intell. Transp. Syst. Res. 2018, 16, 112–124. [Google Scholar] [CrossRef]
  29. Herrera, M.; Pérez-Hernández, M.; Parlikad, A.K.; Izquierdo, J. Multi-agent systems and complex networks: Review and applications in systems engineering. Processes 2020, 8, 312. [Google Scholar] [CrossRef] [Green Version]
  30. Ruan, W.; Duan, H. Multi-UAV obstacle avoidance control via multi-objective social learning pigeon-inspired optimization. Front. Inf. Technol. Electron. Eng. 2020, 21, 740–748. [Google Scholar] [CrossRef]
Figure 1. Illustration of the first scenario.
Figure 1. Illustration of the first scenario.
Drones 06 00104 g001
Figure 2. Illustration of the second scenario.
Figure 2. Illustration of the second scenario.
Drones 06 00104 g002
Figure 3. Illustration of the third scenario.
Figure 3. Illustration of the third scenario.
Drones 06 00104 g003
Figure 4. Solution architecture.
Figure 4. Solution architecture.
Drones 06 00104 g004
Figure 5. Relationship between nodes, edges, and legs.
Figure 5. Relationship between nodes, edges, and legs.
Drones 06 00104 g005
Figure 6. Social learning mechanism.
Figure 6. Social learning mechanism.
Drones 06 00104 g006
Figure 7. Illustration of the flowchart.
Figure 7. Illustration of the flowchart.
Drones 06 00104 g007
Figure 8. Simulation result of the first scenario.
Figure 8. Simulation result of the first scenario.
Drones 06 00104 g008
Figure 9. Simulation result of the second scenario.
Figure 9. Simulation result of the second scenario.
Drones 06 00104 g009
Figure 10. Simulation result of the third scenario.
Figure 10. Simulation result of the third scenario.
Drones 06 00104 g010
Figure 11. Comparison of our proposed method with NSGA-II.
Figure 11. Comparison of our proposed method with NSGA-II.
Drones 06 00104 g011
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Shafiq, M.; Ali, Z.A.; Israr, A.; Alkhammash, E.H.; Hadjouni, M. A Multi-Colony Social Learning Approach for the Self-Organization of a Swarm of UAVs. Drones 2022, 6, 104. https://doi.org/10.3390/drones6050104

AMA Style

Shafiq M, Ali ZA, Israr A, Alkhammash EH, Hadjouni M. A Multi-Colony Social Learning Approach for the Self-Organization of a Swarm of UAVs. Drones. 2022; 6(5):104. https://doi.org/10.3390/drones6050104

Chicago/Turabian Style

Shafiq, Muhammad, Zain Anwar Ali, Amber Israr, Eman H. Alkhammash, and Myriam Hadjouni. 2022. "A Multi-Colony Social Learning Approach for the Self-Organization of a Swarm of UAVs" Drones 6, no. 5: 104. https://doi.org/10.3390/drones6050104

APA Style

Shafiq, M., Ali, Z. A., Israr, A., Alkhammash, E. H., & Hadjouni, M. (2022). A Multi-Colony Social Learning Approach for the Self-Organization of a Swarm of UAVs. Drones, 6(5), 104. https://doi.org/10.3390/drones6050104

Article Metrics

Back to TopTop