1. Introduction
Infectious diseases are a class of diseases that can be transmitted between humans, animals, or between humans and animals. They are caused by various pathogens, which pose a serious threat to human life and property security. One example is the outbreak of the COVID-19 pandemic, which has had huge impacts on people’s health, the global economy, and the international community [
1]. As of March 2023, there have been more than 470 million confirmed cases and over 6 million deaths worldwide [
2]. As a result of COVID-19, economic growth has declined and unemployment has risen globally in recent years [
3]. According to a United Nations report, the global economic growth is expected to be 1.9% in 2023, one of the lowest growth rates in decades.
Infectious diseases like COVID-19 exhibit strong infectivity and long incubation periods, leading to widespread transmission and difficulties in controlling the spread. Delayed detection of outbreaks has resulted in city-wide lockdowns in many countries. For urgent public health emergencies, the Centers for Disease Control and Prevention (CDC) must ensure continuous and sensitive monitoring for early warning, which is beneficial in preventing virus spread and disease transmission [
4]. The CDC typically conducts regular sample testing in high-risk communities during the initial stages of an outbreak. Sample collection and transportation are crucial links that require high levels of safety and timeliness.
When collecting samples within the community, residents traditionally undergo testing at hospitals or community sites, a process known for its lengthy sampling times and the potential for gatherings that contribute to the spread of viruses. Moreover, the transportation of these samples via delivery vehicles traveling between testing centers and community sites elevates the risk of contact with infected individuals, potentially turning vehicles into carriers of the virus. This increased exposure and potential for the spread of virus through sample movement highlights the importance of implementing the necessary measures to minimize the risk of infection transmission during transport. Addressing this challenge requires the development of an innovative sample collection method that prioritizes effectiveness while minimizing virus transmission risk. Ideally, this new method should be rapid and automated, facilitating early epidemic prevention actions and mitigating gatherings and cross-infection during collection and transportation. Thus, it is meaningful to propose an infection sample collection system. The development of technologies such as IoT and blockchain provide feasible solutions for this work.
In recent years, the technology behind drones has undergone rapid development, resulting in significant improvements in endurance and payload capacity [
5]. As a result, drones offer a new solution for the collection of infectious disease samples. Many companies have attempted to use drone delivery technology for sample transport. In Hangzhou, China, drones are already being utilized for the transportation of nucleic acid samples (
Figure 1). During the epidemic period, up to 100,000 samples can be delivered daily. The employed drones can support a payload of 10 kg, a flight time of at least 20 min, and a range of 18 km. Compared to traditional vehicle transportation, using drones for sample delivery has several advantages: firstly, replacing traditional modes with unmanned systems saves medical resources; secondly, transporting samples by drone avoids the temporal and spatial interaction between sample transfer and individuals, thereby reducing the risk of epidemic spread during transportation to the greatest extent possible; thirdly, drones can avoid the impact of traffic control and terrain factors on sample transport, enabling even more timely delivery; and finally, drones can be equipped with sensors, cameras and other electronic devices to ensure sample safety by recording the transportation process [
6].
However, existing drone-assisted sample collection solutions still have the potential to improve the timeliness and safety of sample collection [
7]. Additionally, existing solutions have significant shortcomings in terms of epidemic information security and the risk of epidemic spread. Drones are only used for point-to-point sample transportation between sampling points and testing centers, rather than participating in the entire collection task. Participants still need to personally queue up at sampling points, which increases the risk of personnel gathering and cross-infection. The traceability of infectious disease samples is also difficult to realize, resulting in medical institutions being unable to take effective measures to control the spread of epidemics. Moreover, public health information that lacks management and protection is at risk of being tampered with. Essentially, in existing drone-assisted sample collection solutions, drones are only replacing vehicles in the task mode, without any fundamental difference. In addition, regarding sample collection, remote areas have always been a challenge due to sparse populations and inadequate infrastructure. For these scenarios, both existing drone-assisted sample collection solutions and traditional shuttle-based sample collection solutions make it difficult to achieve the safe and rapid collection and delivery of samples.
To overcome the shortcomings discussed and improve the safety and timeliness of sample collection, we integrate emerging technologies such as IoT, drones, and blockchain to propose the Blockchain-enabled Infection Sample Collection system (BISC). The blockchain’s chain structure improves information security and makes infection samples traceable. We also propose a two-echelon drone-assisted mechanism that utilizes two types of drones for transporting samples to testing centers: the agile and fast collector drone for collecting infection samples at user points, and the deliverer drone for long-distance sample delivery. The brief workflow of the system is as follows: The collector drone will transfer the samples to the designated transit points, where they will be handed over to the deliverer drone. The deliverer drones receive samples at transit points and transport them to the nearest testing center. All samples are stored and tested at the testing center. This solution realizes contactless sample collection so that users complete sampling without the extra risk of infection. The data generated during the operation of the system and the test results are stored in the blockchain to ensure data security. The two types of drones can effectively reduce the risk of cross-infection during sample collections due to different activity areas, thus improving the biological safety and reliability of sample collection. In addition, the collaboration between collector drones and deliverer drones can significantly improve the effectiveness of sample transmission in remote areas. Our main contributions can be summarized as follows:
We propose a blockchain-enabled infection sample collection system that is built with an infrastructure layer, a blockchain layer, and an application layer. The information security of infection samples is ensured using the hash function, chain structure, timestamp, and consensus mechanism. The routing algorithm for sample collection is integrated into the smart contract to improve the reliability and automaticity of BISC.
We propose a model of a Two-Echelon Heterogeneous Drone Routing Problem with Transit point Synchronization (2E-HDRP-TS). The model integrates fixed transit points, synchronized handover operations, and heterogeneous drones with limited energy and capacity. It is specifically designed for collecting infection samples from all user points with two types of drones. It improves the time efficiency of samples, especially in remote areas, and reduces the probability of cross-infection during sample collection. To measure sample safety, we introduce the exposure index.
We present a multi-objective Adaptive Large Neighborhood Search algorithm for Routing of Drones (ALNS-RD). By incorporating flight distance-based and exposure index-based operators, this algorithm can better plan the drone’s route and improve the total collection time for infection samples and the exposure index. Based on the dominance relationship, we refine the criteria for multiple objectives and adaptive mechanisms. To increase diversity in the Pareto set, we determine the search direction based on the crowding distances.
We conducted a series of experiments based on practical cases. First, we optimized and adjusted the key parameters used in ALNS-RD. Then, we validated the effectiveness and superiority of using the flight distance operator and exposure index operator to improve the quality of the Pareto set through experiments. Finally, we compared the proposed ALNS-RD algorithm with existing ones (i.e., NSGA-II, MOLNS). The experimental results demonstrated that ALNS-RD performs much better when solving the 2E-HDRP-TS for infection sample collection.
The rest of this article is organized as follows.
Section 2 discusses related work,
Section 3 describes the architecture of BISC,
Section 4 describes the two-echelon drone-assisted mechanism,
Section 5 specifically addresses the solution approach,
Section 6 provides the experimental results, and
Section 7 concludes the article.
2. Related Work
With the continuous performance improvement of drones, they have demonstrated significant advantages in efficiency and flexibility. Research on the application of drones in logistics delivery is also increasing. We first review the literature on drone delivery systems and then review the literature on 2E-VRP and corresponding solution methods.
2.1. Drone-Assisted Delivery System
Due to drones’ enormous potential, extensive research has been conducted on drone delivery. Andreato et al. [
8] demonstrated the feasibility of drone delivery from a cost perspective and highlighted the challenges that must be overcome to make it practical. Sundar et al. [
9] proposed a routing problem that considered the fuel constraints of a single drone while minimizing total fuel and enabling multiple refueling stations. Song et al. [
10] proposed a scheduling model for a drone-delivery logistics system, which allows drones to share multiple charging stations due to the constraints of the load capacity and flight time. Dorling et al. [
11] proposed a Drone Delivery Problem (DDP), implementing a mixed-integer linear program that considered battery weight, payload weight, and drone reusability. Tian et al. [
12] proposed the Solar-Powered UAV Delivery System (SPU), which eliminates the need for charging facilities. When the SPUs run out of power, they recharge themselves at a designated landing site instead of at charging stations. Other studies have used a combination of drones and ground vehicles for delivery. Murray et al. [
13] first introduced drones into the study of the Traveling Salesman Problem (TSP), calling it the Flying Sidekick Traveling Salesman Problem (FSTSP), in which drones assist vehicles in completing package deliveries and specifying that drones only serve one customer in a single journey. Ferrandez et al. [
14] established a mathematical model to expand the truck–drone delivery system by optimizing the number and location of drone launch sites. Sajid et al. [
15] proposed mixed-integer linear programming models for UAV-routing and UAV-route scheduling problems, taking into account the effect of incidental processes and the varying payload on travel time. Aggarwal et al. [
16] analyzed different types of UAV path planning techniques and discussed issues related to UAV network communications, providing a comprehensive research perspective. Laghari et al. [
17] investigated the design of UAV systems, especially communication systems, and proposed sensor-based data processing methods as well as applications. Deng et al. [
18] proposed a new solution for vehicle-assisted UAV delivery, allowing UAVs to serve multiple customers in a single take-off while considering energy consumption. She et al. [
19] proposed self-organized UAV traffic flow in low-altitude 3D airspace, formulating the user equilibrium condition using partial differential equations and proposing a scheme for numerical solution and system performance computation.
In the field of medical delivery, many studies have shown that drones can improve the efficiency and cost-effectiveness of blood delivery, laboratory testing, and medication delivery [
20,
21]. Hii et al. [
22] pointed out that the process of drone medical delivery can potentially affect the quality of medical supplies. In their study, they indicated that medical supplies such as insulin are very sensitive to the environment. Therefore, the potential impact of temperature and vibration on the effectiveness of medical supplies should be fully considered in drone-assisted medical delivery. Scott et al. [
23] designed a new model for drone medical delivery systems. The model combines traditional land transportation with drone delivery and uses budget as a constraint to provide location decisions for drones. In addition, as the variety of IoT devices increases and technology advances, the integration of medical systems and IoT becomes a feasible solution. Zeadally et al. [
24] presented the devices and technologies that need to be incorporated into the IoT ecosystem for medical systems. They emphasize that providing reliable connectivity for medical devices and sensors through the IoT ecosystem is the key to unlocking the digital medical system. Paganelli et al. [
3] proposed a detailed description of an IoT-based conceptual architecture for a COVID-19 patient monitoring system. The architecture also provides modules for analyzing health data to provide patients with feedback and insights. Otoom et al. [
25] proposed a solution for the detection and monitoring of COVID-19 cases, which collects real-time data from wearable sensors and uses machine learning to predict suspected cases. We summarize the comparison between our work and the existing literature in
Table 1.
Currently, drone-assisted delivery is a hot topic in the logistics field. However, research on drone-assisted delivery of medical items, especially with regard to the delivery of infection samples, is still relatively rare. Unlike ordinary delivery objects, medical items often have characteristics such as being fragile and susceptible to contamination. Compared with traditional delivery processes, medical delivery not only considers costs and time but also the safety and effectiveness of medical items. In particular, for drone-assisted delivery of infection samples, we need unique transportation strategies to minimize the exposure risks of the samples. In addition, we also need to consider the drone’s energy consumption and capacity to ensure that it can successfully complete the delivery task. In addition, as far as we know, there are no papers on systems combining blockchain and drone collection of infection samples. The introduction of blockchain is beneficial for improving the traceability and information security of infection samples.
2.2. 2E-VRP
The two-echelon drone-assisted mechanism described in this paper can be characterized as a two-echelon heterogeneous drone routing problem with transit point synchronization, which is a variant of the two-echelon vehicle routing problem (2E-VRP). The 2E-VRP has been widely studied in academia over the past decade, and the classic model is as follows: there is a central station and a fixed number of intermediate sites with operational capacity restrictions on a two-level logistics network; a specific number of vehicles are used on each level of the network; the demand of all customers is fixed and known, and cannot be divided. The 2E-VRP has attracted widespread attention in the past decade, and the methods for solving this problem mainly include exact algorithms and heuristic algorithms. González-Feliu et al. [
26] were the first to propose the model of 2E-VRP, and solved an instance with 32 customers and satellites using the branch-and-cut algorithm. Perboli et al. [
27] introduced some new optimality cut classes. They improved the algorithm by adding valid inequalities based on the network flow formulation and the connectivity of the transportation system graph. Jepsen et al. [
28] proposed a modified branch-and-cut algorithm to address the problem of an incorrect upper bound when the number of transfer stations exceeds two in Perboli et al.’s model [
27]. Baldacci et al. [
29] proposed a new mathematical model in which two binary variables were used to decide whether a feasible route was adopted. They proposed an exact algorithm that decomposes the 2E-VRP into a set of VRPs with multiple depots, further enhancing the scalability and accuracy of the algorithm for solving instances. If the problem size expands further, exact algorithms cannot be efficiently solved. Therefore, in recent years, researchers have turned to using heuristic algorithms and metaheuristic algorithms to solve the 2E-VRP. Hemmelmayr et al. [
30] proposed the Adaptive Large Neighborhood Search (ALNS) heuristic algorithm, which is specifically designed for the structure of the 2E-VRP. Breunig et al. [
31] proposed a hybrid metaheuristic that combines enumeration local search with destruction and repair heuristics. Compared with other algorithms, the ALNS algorithm has a greater advantage in solving problems due to its extensive heuristic operators providing directions for neighborhood search.
In recent years, many improved models have emerged based on the classic model of 2E-VRP. Regarding the two-echelon vehicle routing problem, Crainic et al. [
32] first proposed a 2E-VRP model with a synchronization concept, which integrated multiple depots, multiple tours, heterogeneity, and time windows. Perboli et al. [
33] proposed a two-echelon vehicle path planning problem with satellite synchronization. Granger et al. [
34] proposed the 2E-VRP problem with time windows, satellite synchronization constraints, and multi-trip based on urban scenarios. Li et al. [
35] proposed the two-echelon vehicle path planning problem with satellite dual synchronization and solved it with an improved ALNS algorithm. Belgin et al. [
36] proposed a two-echelon vehicle routing problem with simultaneous pickup and delivery (2E-VRPSPD) and developed a hybrid heuristic algorithm based on variable neighborhood descent and local search. Chen et al. [
37] proposed a new two-echelon delivery system based on hybrid machine learning and heuristic optimization. The method pre-distributes predicted demands from depots to regional facilities to improve the solution quality. Yu et al. [
38] proposed the two-echelon van-based robot routing problem with hybrid pickup and delivery, including five new one-to-one pickup and delivery modes. Paredes-Belmar et al. [
39] proposed a milk transportation problem with vehicle routing and the location of milk collection centers. They addressed the location of milk collection centers to reduce costs. Xue et al. [
40] proposed a two-echelon dynamic vehicle routing problem with proactive satellite stations, which converts customers with available idle storage to satellite stations and optimizes the transportation cost and storage cost.
The above research mainly focuses on the delivery problem based on vehicle transportation and has made remarkable progress in problem modeling and algorithm design. However, these models and algorithms are not applicable to the infection sample collection scenario: (1) existing research mainly takes delivery time and cost as the main optimization objectives. However, in the sample collection scenario, the exposure level of the sample during the delivery process is a crucial factor that must be considered in addition to the above two factors. Therefore, we should minimize the exposure risk of the samples by minimizing their residence time in the handover link; (2) existing research only addresses the problem of vehicle routes without considering the relationship between effective load and energy consumption, but in drones-assisted sample collection, we must consider the impact of energy consumption and payload on the drone routes. Therefore, in this paper, we propose a 2E-HDRP-TS model. In this model, we consider factors such as the degree of cross-infection and the collection time of samples during the delivery process. To measure the safety of the samples, we propose the concept of exposure index and use it as one of the optimization objectives. Also, we focus on minimizing the collection time to improve the delivery efficiency. Our goal is to find the optimal delivery route while satisfying various constraints.
5. ALNS-RD
To minimize the exposure risk and delivery delay of infection sample collection, we integrate the ALNS-RD algorithm into the smart contract. The algorithm is executed automatically at the beginning of the BISC to obtain near-optimal flight route solutions for drones. The solution includes path sequences for collector drones to collect samples from user points and path sequences for deliverer drones to transfer samples from transit points to the testing center. The ALNS algorithm was initially proposed by Ropke and Pisinger [
46], allowing for the use of different destroy and repair operators during the searching process. Based on ALNS, Rifai et al. [
47] proposed the Multi-Objective Adaptive Large Neighborhood Search (MOALNS) algorithm to provide a good set of non-dominated solutions for multi-objective problems. However, the above algorithms are not suitable for solving 2E-HDRP-TS, which has strict constraints for two-echelon synchronization. Moreover, MOALNS simply determines the solution to be modified by randomly selecting a non-dominated solution from the archive, which is not conducive to the solution of 2E-HDRP-TS. Thus, this study defines four operators (i.e., flight distance-based destroy operator, flight distance-based repair operator, exposure index-based destroy operator, exposure index-based repair operator) for 2E-HDRP-TS to optimize the route of drones and improve the synchronization performance of handover operations. In addition, in order to ensure the diversity of the solutions, we set the solution with high crowding distances as the current solution for the next iteration.
5.1. ALNS-RD Framework
The principle of ALNS-RD is described in detail in Algorithm 1. ALNS-RD begins by generating a solution and ultimately obtains the Pareto set, in which each member represents a flight route solution for drones. The steps for each iteration of ALNS-RD are as follows: During the first iteration,
s is added to
. In the subsequent iterations, the algorithm continuously executes a looping process. First, the roulette wheel is used to select operators;
s is removed and inserted to generate
; if
meets the acceptance criterion, then
is chosen as the
s for the next iteration;
and the operator weights are updated;
s is replaced based on the crowding distance every
iterations. Finally, we obtain the Pareto solution set
. ALNS-RD utilizes two types of operators: destroy
and repair
operators. The two types of operators iteratively destroy and repair the flight route solution for drones
s to obtain a new solution
. The destroying refers to removing a number
p of user points from the current solution
s, while repairing refers to inserting all pending user points in the solution. All operators initially have the same weight, which determines their probability of being selected at each iteration. The weights are updated based on the operators’ historical performance. If
satisfies the acceptance criterion, it is accepted and becomes the current solution
for the next iteration. Additionally, if
is non-dominated, it is inserted into the Pareto set. The acceptance criterion and the update of the Pareto set are described in detail in
Section 5.7. This procedure continues until the stop-criterion is met.
Algorithm 1: ALNS-RD |
|
5.2. Initial Solution
We propose using a two-phase construction algorithm to obtain the initial solution, as described in Algorithm 2. Firstly, a best insertion heuristic is used to generate routes for deliverer drones to visit all used transit points, as described in lines 3–7. In this heuristic, the closest transit points in terms of distance are inserted into the routes of drones. Then, based on the deliverer drone’s route, multiple routes of collector drones are generated to visit user points, as described in lines 8–12. Similarly, the best insertion heuristic is used to insert user points into existing routes of drones.
Algorithm 2: Procedure for generating initial solutions |
|
5.3. Non-Dominated Solution
2E-HDRP-TS is a multi-objective problem, which requires minimization of the total completion time of sample collection and the exposure index of samples during the handover process. In most cases, these two objectives are in conflict. When the total collection time of infection samples is prioritized, the vast majority of samples are likely to be sent to the closest transit point to the testing center to ensure that the entire sample collection task can be completed quickly. However, considering that the completion time of the handover mainly depends on the last drone to arrive, doing so may result in these samples staying at this transit point for too long, which in turn increases the risk of sample exposure. This conflict between objectives prevents us from obtaining a dominant optimal solution. Therefore, we generate a Pareto solution set
to preserve the various non-dominated solutions
s. In the 2E-HDRP-TS, the Pareto dominance relation can be defined when a solution
dominates a solution
, denoted as
, if and only if the following conditions are fulfilled:
where
O is the set of optimization objectives. Equations (
20) and (
21) indicate that
s can be equal to
on one objective, but another objective must be better than
. During the iteration process,
s will be inserted into the Pareto set
. When a new solution that dominates
s appears,
s will be removed from the Pareto set
. The Pareto front of
moves forward with the optimization and is closer to the true Pareto front.
5.4. Destroy Operators
We generate a new solution by destroying and repairing the current flight route solution for drones. This process assigns infection samples to different collector drones and replans their visit routes. To destroy a current solution, we first select an operator and continuously remove user points until the removal requirement is met. There are two types of destroy operators: route-based destroy operators and user-based destroy operators.
5.4.1. Route-Based Destroy Operators
Route-based destroy operators remove the entire route of drones and contain the following three operators.
Random route removal: For the current solution, this operator randomly removes a route of the drone, which means removing all user points on this route.
Random drone removal: This operator randomly removes all user points collected by the drone. It is hoped that the repair operator will assign uncollected user points to other drones. Random drone removal is a more effective method than random route removal.
Exposure index-based route removal: Based on the low-exposure requirement of the two-echelon infection sample collection with synchronization, we proposed a destroy operator based on the exposure index. When the drones participating in the handover arrive almost synchronously, the exposure index at this handover naturally is very low. If the collector drone arrives a long time before (or after) the deliverer drone, the handover operation will involve a long waiting time and will pose a threat to the sample’s safety. We are inspired by the worst removal [
46] and directly remove the poorly performing routes in the handover. The operator begins by calculating the exposure index of each handover operation. After sorting, all user points on the worst route are removed until the removal requirement is met. This operator removes routes with poor synchronization performance to improve the safety of the sample.
5.4.2. User-Based Destroy Operators
User-based destroy operators remove certain user points from the drone’s route and contain the following four operators.
Random user point removal: This operator randomly removes user points from the drone’s route until the removal requirement is met.
Worst user point removal: For each user point, this operator calculates the difference between the cost function with and without this user point. The cost function represents the distance required to collect samples at that point. Then, the operator sorts the user points from the highest to the lowest cost and removes the worst user points from the drone’s route.
Related user point removal: This operator is based on the Shaw removal heuristic [
48], and it deletes a certain number of related user points from the drone’s route. We measure the relatedness between two user points based on their distance. A shorter distance indicates a great degree of relatedness. This operator is designed to delete closely related user points.
Flight distance-based removal: To minimize the total collection time of infection samples, we design a flight distance-based destroy operator.
As shown in
Figure 9, the flight distance of the sample at user point 1 follows the route of the collector drone to transit point 4 first, and then follows the route of the deliverer drone to the testing center (node 0). In order to evaluate the potential impact of an individual user point on the objective value
T, the following analysis is conducted. First, according to the flight time of the collector drone in Equation (
3), we calculate the time increment
for user point 1 on this route:
where
q is the quality of a single sample.
is the flight distance from user point 1 to transit point 4 along the route. The other parts of Equation (
22) can be considered as constants when the drone’s parameters are determined. Therefore,
is proportional to
but independent of the drone’s weight and total payload.
Figure 10 further explains the relationship between the flight time and flight distance of the user point. The
x axis represents the drone’s weight and the quality of each user point’s sample
q (
). The drone’s route from the starting point passes through each user point, and finally returns to the endpoint. The size of each column rectangle represents the specific time increment for the corresponding weight portion in this route.
The Equation (
22) can also be refined by considering the flight distance in the route of the deliverer drone. We assume that the handover operation is perfectly synchronized, meaning there is no waiting time for both drones involved in the handover link. The time increment of user point 1 with respect to the objective value
T is given as
In summary, there are two methods that can be used to shorten the collection time when the sample quality is fixed. The first method is to optimize the route and find shorter circuits. The second method is to find and remove user points with large time increments from the flight route and assign them to drones with lower collection costs. Since the time increment is proportional to the flight distance of the user point, we propose a flight distance-based destroy operator. The operator aims to remove user points with the longest flight distances. Instead of calculating the difference in objective value with and without these user points, it calculates the increment in the objective value in terms of flight distance. Removing the user point with the farthest flight distance after sorting can significantly reduce the total collection time of infection samples.
5.5. Repair Operators
In
Section 5.4, we remove some user points from the drone’s routes in order to generate a new solution, which results in some infection samples being uncollected in the solution. Therefore, we use the repair operator to insert the uncollected user points into the drone’s route. In the repair process, the insertion can either be conducted along the existing route or another route can be created for the drone. User-based destroy operators contain the following four operators.
Greedy insertion: For an uncollected user point, after traversing all possible insertion positions, this operator chooses the position with the lowest cost for insertion.
Random insertion: This operator randomly selects insertion positions and chooses the most suitable position among them to insert the uncollected user points.
Flight distance-based insertion: Similar to the flight distance-based destroy operator, we propose a flight distance-based repair operator. This operator traverses each route and selects the most suitable insertion position based on the minimum flight distance principle.
Exposure index-based insertion: This operator is aimed at routes that still have room for improvement in terms of exposure index. Uncollected user points are continuously inserted at suitable positions along the route to reduce the waiting time for drones in handover operations. During this process, the selected route keeps expanding until it reaches saturation, where both drones arrive at the transit point almost synchronously.
5.6. Adaptive Mechanism
The ALNS-RD algorithm utilizes an adaptive mechanism to select multiple destroy and repair operators. The adaptive mechanism can adjust throughout the entire iteration process to better adapt to different situations. The algorithm adopts a roulette wheel strategy to calculate the probability of different operators being selected. At the beginning of the iteration, the probability of selecting each operator is the same. Each operator is assigned a dynamic weight
, which depends on its contribution to the objective value. The weights of the repair operators
and the destruction operators
do not interfere with each other. Specifically, we evaluate the performance of the operators in previous iterations by their scores, and update their weights. Finally, the selection probability of each operator is calculated based on the weights and shown as
At the beginning of the iteration, all operators are initialized and have an equal probability of regenerating routes for drones. After entering the iteration, based on the score
obtained by the operator
i in the
j-th loop, the weight of selecting the operator
i in the next loop is updated as
. The score
represents the performance of the operator
i in the
j-th iteration, and
is the number of times that the operator
i was used in the
j-th iteration. When the current iteration ends, the scores obtained by the operators are used to calculate the new weights. The weight update process is shown as follows:
where
is the response factor. The
r controls the sensitivity of the adaptive selection of the algorithm and ranges from 0 to 1. As
r approaches one, the new weights essentially depend on the score of this iteration. The parameter
reflects the performance of the selected destroy and repair operators in the current iteration in the form of a score. The calculation of the score is as follows:
where
represents the new solution generated by destroying and repairing a current solution
s.
is a solution of the Pareto set
. The acceptance criteria will be detailed in the next section.
5.7. Acceptance Criteria
The acceptance criteria of ALNS-RD are developed based on the Archived Multi-Objective Simulated Annealing (AMOSA) [
49]. In AMOSA, the acceptance probability of a new solution becoming the current solution is determined by considering the dominant relationship between the new solution, the current solution, and the solutions in the archive. We propose similar acceptance criteria.
If dominates a solution in , will be accepted as the current solution and inserted into . At the same time, all solutions in dominated by will be deleted. The score of the operator in this situation is .
If and a solution in are non-dominating, will also be accepted as the current solution and will become a new Pareto solution. The score of the operator in this situation is .
If dominates the current solution s, although will be accepted as the current solution, it cannot be added to the Pareto set . The score of the operator in this situation is .
If
and the current solution
s are non-dominated, whether to accept the new solution
is determined by the difference with the Pareto solution set
. The closer the distance to the Pareto set
, the higher the probability of being accepted. In this case, the amount of domination is used to measure the difference between two solutions [
49]. Between two solutions
a and
b, the amount of domination is defined as
where
represents the range of the objective
, obtained using Pareto set
. When
is dominated by
m solutions in set
, the average amount of domination can be defined as
. And we pick
as the current solution with probability
, which is defined as
.
represents the current temperature, which is decreased in each iteration by being multiplied by the cooling rate
c. The score of the operator is
if the solution is accepted and
otherwise.
5.8. Crowding Distance and Current Solution Replacement
To prevent premature convergence to a local optimum, we select a solution from the Pareto set as the current solution at an appropriate time. Specifically, if a new solution meets the acceptance criterion, then it is promoted to the current solution for the next iteration. Otherwise, ALNS-RD continues to generate new solutions based on s or select the point with the maximum crowding distance from the Pareto set as the current solution.
In the Pareto set
, crowding distance refers to the density around a point, which is the difference in objective values between the point and its neighbors. As shown in
Figure 11, the crowding distance of solution
n can be calculated by
where
represents the range of the objective
obtained using Pareto set
. If the left or right neighbors are empty, the crowding distance
is regarded as infinitely large. A higher crowding distance typically implies a lower similarity between this solution and other solutions, indicating that the neighborhood search for this solution is not extensive enough. Consequently, if there is still no improvement after
iterations, ALNS-RD selects the solution with the maximum crowding distance as the current solution for the next iteration. The diversity of the Pareto set is maintained in this way.
6. Experimental Results
In this section, we first describe the instances used for the experiments. In
Section 5.2, we introduce two performance metrics used to measure the quality of the solution set. In
Section 5.3, we present the actual generated routes. The impact of ALNS-RD parameters on the solutions is discussed in
Section 5.4. In
Section 5.5, we discuss the impact of operators on the quality of solutions. In
Section 5.6, we conduct algorithm comparisons.
6.1. Experimental Setup
We use Pycharm as the simulation platform and run on a computer with an Intel Xeon Gold 6230R CPU (Intel, Santa Clara, CA, USA) and an NVIDIA Quadro RTX 5000 GPU (NVIDIA, Santa Clara, CA, USA). In the experiment, we generate 1500 × 1500 m instances by using an actual layout. The drone’s flight height is set at 30 m to fly in a free space area. The value of parameters related to collector drones and deliverer drones is set according to typical ones in practical use [
45]. The weight of the deliverer drone
is set to 9 kg, the maximum capacity
is 11 kg, the maximum energy
is 0.8 kW·h, and the maximum power
is 1.5 kW. The collector drone should carry a lighter load, so the weight
is set to 2 kg, the maximum capacity
is 2 kg, the maximum energy
is 0.3 kW·h, and the maximum power
is 0.5 kW. We assume that the area is covered by the 4G network and the drones are capable of maintaining real-time communication with ground facilities (user points and agencies). We assume that the communication link between the drones and ground facilities is line-of-sight communication. The communication channel modeling and parameter settings refer to [
50]. All simulation parameters are presented in
Table 3. In this simulation, it is assumed that one deliverer drone and four collector drones complete the collection task.
6.2. Instance Description
We first generate instances used for the experiments and then present the geographical configuration of the BISC used in the instances.
6.2.1. Generating Instance
We generate instances by using an actual layout of residential users near the countryside of Ekron, Kentucky, United States. As shown in
Figure 12, this area is located far from the city and is suitable for two-echelon sample collection with synchronization. Additionally, there are no buildings in the countryside to obstruct drone flights, so the route of the drone does not need to consider obstacle avoidance. The drone flies between user points and transit points, delivering all infection samples from the user points to the testing center.
Specifically, we obtain the real geographic locations through Google Earth and measure the scale of the entire scene. Then, we label the locations of the user points using ImageJ (
https://imagej.net/ij/), an image analysis software program. The instances are generated with the map origin at the bottom-left corner, and the coordinates of all user points are computed to obtain the distance matrix.
6.2.2. Geographical Configuration
Based on the geographical configuration proposed by Grangier et al. [
34], we adopt the following
notation to describe the positions of the testing center and the transit points.
X and
Y are expressed as a percentage of the map size, indicating the location of a testing center.
M and
N represent the number of rows and columns of the grid. A transit point is located at each intersection of the grid.
Figure 13 shows a −50/50/3/3 configuration, which is used in all experiments. Finally, we combine the actual layout of residential users and geographical configuration to create an instance called TDR1. The following route results and parameter tuning of ALNS-RD are based on TDR1.
6.3. Performance Metrics
To evaluate the performance of ALNS-RD, we use the following two performance metrics to measure the quality of the Pareto set.
6.3.1. Perfect Point
In order to evaluate the quality of the obtained solutions, we introduce the perfect point
. The perfect point
is the best possible point, which has the lowest value in each objective of the Pareto set generated by ALNS-RD.
represents the perfect value of the perfect point
in objective
and is calculated by
where
O represents the set of optimization objectives,
O = {1, 2}.
represents the optimal total collection time of infection samples
T of the entire Pareto set
, and
represents the optimal exposure index
Q of the entire Pareto set
.
6.3.2. Average Euclidean Distance
The average Euclidean distance is commonly used to evaluate the convergence of a solution set. The Euclidean distance from the solution
s to the perfect point
is denoted as
, which can be calculated as follows
where
is the range of the objective
in the Pareto set
. The average Euclidean distance of all solutions in the
is denoted as
, which can be calculated as follows
where
is the number of solutions in the Pareto set
. A lower value of average Euclidean distance
implies that all solutions are closer to the center of the solution space. This also means that the Pareto set
is closer to the true Pareto front.
6.4. Simulated Route Results
In this section, we show the simulated route results based on instance TDR1 using ALNS-RD.
As shown in
Figure 14, we select the three most representative solutions from the Pareto set for route presentation, which are the solution that minimizes the total collection time of infection samples
T, the solution that minimizes the exposure index
Q, and the trade-off solution. In this solution, the deliverer drone has one route, while the collection drones have two routes. Therefore, we present the routes of drones in two separate figures, with each drone represented by a different color.
Figure 15 shows the flight routes of the drones presented by the solution that minimizes the total collection time of infection samples
T. As shown in
Figure 15a, the first routes of the collector drones depart from the transit point and return to the same transit point for handover, completing the collection of infection samples at each user point in the process. There is no intersection between routes of collector drones. This indicates that the drones tend to make the shortest route decision, resulting in the shortest collection time. As shown in
Figure 15b, the collector drones all choose the nearest transit point to the testing center, aiming to minimize the total collection time of infection samples.
Figure 16 shows the flight routes of the drones presented by the solution that minimizes the exposure index
Q. In the Pareto set, to optimize the exposure index as much as possible and improve the synchronization of handover operations, the total collection time of infection samples
T has to be increased. In this case, the solution with the shortest route may not lead to the optimal objective value
Q. We observe that in
Figure 16a,b, there are route intersections, which means that these routes are not the shortest circuits. ALNS-RD can control the arrival time of the collector drone at the transit point by extending its route, so that the collector drone synchronizes with the deliverer drone at the transit point.
Figure 17 shows the flight routes of the trade-off solution, in which the drones balance the collection time and exposure index. This solution maintains a relatively low collection time of infection samples and the exposure index. This trade-off ensures efficient collection while keeping the risk of infection spread acceptable. Although this solution may not be optimal, it is highly possible for it to be chosen in practical applications.
Figure 18 shows the payload variation of the drones during the collection and delivery process in the trade-off solution. Collector drones 1, 2, and 3 each experience two payload drops, resulting in two transit operations. Collector drone 4 visits numerous user points located on the edges and performs one transit operation.
In real applications, the Pareto set can provide multiple alternative solutions. The decision maker can select a flight route solution for drones from the Pareto set that best fulfills requirements.
6.5. Parameter Tuning
In this section, we describe how the ALNS-RD parameters are used in the following experiments.
Despite the large number of parameters used in the ALNS-RD, it turns out that it is possible to find a set of parameters that works well for multiple instances. We use the following strategy to tune the parameters. We first generate a set of fair parameter settings. Then, we allow a particular key parameter to take a number of values while the rest of the parameters are kept fixed. With an insufficient number of iterations (1000), the perfect value will be affected by the change of parameter values. Insufficient iterations can amplify the differences caused by different parameter values. For each parameter setting, the perfect values are calculated ten times, and the average is taken. Deviation refers to the difference between the average perfect value and the best perfect value in the entire experiment, which is used to evaluate the rationality of the experimental parameter settings. The lower the deviation, the more likely it is that the solution set obtained based on this parameter is close to the true Pareto front. We choose the settings with the best average perfect value and move on to the next parameter. This process continues until all parameter values have been determined.
First, we determine the value of the destroy ratio
(
). It represents the ratio of the number of user points removed to the total number of user points. Generally, the removal rate
has a significant impact on the convergence speed of ALNS-RD and the objectives. The removal rate
ranges from 0.05 to 0.5, with a step size of 0.05.
Table 4 shows the impact of
on perfect value
. Obviously,
ranges from 0.2 to 0.3 is a good choice and we choose 0.3.
The reaction coefficient
r is another key parameter that controls the sensitivity of the adaptive selection operator during the weight update process. This parameter is set in the range of 0.1 to 0.8 with a step size of 0.1. By applying the same parameter tuning strategy, the results in
Table 5 are obtained. The results show that the deviation of average perfect value is minimized when
r is set between 0.4 and 0.5. Therefore, we set the reaction coefficient
r to 0.5.
In addition to the damage rate
and the reaction coefficient
r, the ALNS-RD algorithm includes the following parameters
. Referring to the study of Ropke & Pisinger [
46], the values of
are taken as (0.99975, 20, 16, 10, 6, 2, 10), respectively.
6.6. Impact of Operators on Solutions Quality
In this section, we discuss the impact of neighborhoods composed of different operators on the quality of solution sets. To make the experimental results convincing enough, we run multiple neighborhoods on instances of different scales.
Table 6 provides detailed descriptions of the instances (TDR1-TDR5).
We combine different destroy and repair operators, as shown in
Table 7. Neighborhood 1 consists of the basic operators mentioned in
Section 5.4 and
Section 5.5. Neighborhood 2 incorporates flight distance-based destroy and repair operators in addition to those in Neighborhood 1. Neighborhood 3 further includes exposure index-based destroy and repair operators. We perform insufficient iterations (1000) for the convergence speed and solution quality of different neighborhoods. We record the best perfect values from 10 runs and the average perfect value. Deviation refers to the difference between the average perfect value in the neighborhood and the best perfect value in the entire experiment. The results are presented in
Table 8 and
Table 9.
Table 8 and
Table 9 show the impact of the different neighborhoods on the solution quality. We observe that the flight distance-based operator is highly effective at optimizing the total collection time of infection samples
T. Neighborhood 2 incorporates this operator into the search process and significantly improves the quality of the objective
T. However, we found that when the exposure index-based operator is used to optimize the total collection time of infection samples
T, its impact on the quality of the Pareto set is very limited. Specifically, in
Table 8, we can observe that the results obtained from Neighborhood 2 and Neighborhood 3 are very close. In contrast, in
Table 9, the best perfect value and average perfect value for neighborhood 3 with the exposure index-based operator are improved significantly. In general, the flight distance-based operators and the exposure index-based operators play different roles in minimizing the objective value. The former is very effective at optimizing the total collection time of infection sample
T, while the latter can significantly improve the exposure index
Q. The results prove that the proposed flight distance-based operators and the exposure index-based operators can generate a Pareto set that is closer to the true Pareto front. Based on the experimental results, we configure neighborhood 3 for algorithm comparison.
6.7. Algorithm Comparison
In this section, we demonstrate the effectiveness of the proposed ALNS-RD algorithm by comparing it with benchmark algorithms (i.e., NSGA-II [
51], AMOSA [
49], MOLNS [
52] and MOALNS [
47]). NSGA-II (Non-dominated Sorting Genetic Algorithm II) is commonly used in multi-objective optimization and is one of the classic multi-objective optimization algorithms. In addition, MOLNS (Multi-Objective Large Neighborhood Search), which is similar to ALNS, is also included to evaluate the significance of the improvement. All algorithms are run using the same equipment and software to avoid the bias. The ALNS-RD algorithm is configured with a neighborhood 3 and iterated 10,000 times. The MOLNS, MOALNS and AMOSA follow the same parameter settings as ALNS-RD to ensure the same number of iterations. As NSGA-II is a population-based optimization, the number of iterations is set differently compared to other algorithms. The parameters in NSGA-II were configured using the recommended parameters in [
51], which are shown in
Table 10.
As shown in
Table 11 and
Table 12, for the Pareto sets obtained by running the different algorithms ten times, we calculate the best perfect value Best.
, the average Euclidean distance of the Pareto Set with Best.
, the average perfect value Avg.
, and the deviation. Except for TDR4, the Pareto set obtained using ALNS-RD significantly outperforms those obtained using other algorithms in all performance metrics. ALNS-RD consistently finds the Pareto set which has the best perfect value
in the entire experiment. This may be due to the fact that the proposed operators provide directions for the neighborhood search, making the solution set of good quality. In particular, in instances with more than one hundred user points, the deviation of the perfect value and average Euclidean distance of the Pareto set obtained using ALNS-RD are significantly better than those obtained by other algorithms. This indicates that ALNS-RD is able to find solutions that are closer to the true Pareto front in instances with a high level of complexity. This is a highly desirable characteristic, which means that ALNS-RD is more reliable when it comes to exploring the solution space and discovering high-quality solutions. The above results show that for 2E-HDRP-TS, the ALNS-RD has good performance and generates solution sets closer to the true Pareto front.
7. Conclusions
This study proposes a Blockchain-enabled Infection Sample Collection system (BISC) architecture, which is built with an infrastructure layer, a blockchain layer, and an application layer, in order to improve the safety and timeliness of infection sample collection. Additionally, a two-echelon drone-assisted mechanism termed the Two-Echelon Heterogeneous Drone Routing Problem with Transit point Synchronization (2E-HDRP-TS) is proposed, representing an unexplored area in this field. This problem integrates fixed transit points, synchronized handover operations, and heterogeneous drones with limited energy and capacity. The 2E-HDRP-TS can be divided into two interrelated decisions: the flight routes of collector drones for collecting infection samples from user points, and the transport routes of deliverer drones from transit points to the testing center. To generate near-optimal solutions, we introduce a multi-objective Adaptive Large Neighborhood Search algorithm for Routing of Drones (ALNS-RD), aiming to minimize the total collection time of infection samples and the exposure index. Alongside traditional search operators, the ALNS-RD algorithm incorporates innovative flight distance-based and exposure index-based operators to enhance the efficiency and accuracy of the solution. Our comparative analysis against benchmark algorithms NSGA-II, AMOSA, MOLNS, and MOALNS demonstrates the superior performance of the proposed ALNS-RD algorithm across diverse levels of complexity in all five instances.
Even though the performance of the ALNS-RD algorithm is very good, due to its nature as an offline heuristic algorithm, drones are unable to handle new task requests during the sample collection process. Hence, our future work will focus on investigating online route planning algorithms for drones, with the aim of enabling dynamic adjustment of flight routes during the sample collection and delivery process. We envision the possibility of using neural networks and reinforcement learning to develop a viable online drone route planning algorithm. In addition, Clustering users will help to complement the biological safety of infection samples. Therefore, it is necessary to propose an innovative framework for implementing the user points clustering in the infection sample collection system.