1. Introduction
In recent decades, the topic of logistics has received a lot of attention from service and production companies. Logistics management means planning, organizing, leading, coordinating, and controlling logistics activities and implementing them in the best way in order to improve the efficiency of the logistics system and increase profits. The logistics distribution system serves as a bridge between the supply and demand points in the logistics network, and its purpose is to deliver materials, goods, or products from the supply points to the demand points at the right time and in the right size, keeping in mind the economic conditions. The transportation sector in the distribution of logistics accounts for the highest cost, and cost management in this sector is highly sensitive because the cost of product transportation significantly affects the final product price. As a result, minimizing the costs of the entire distribution system is among the primary objectives and goals of the logistics companies. Nowadays, various logistics firms seek to effectively plan for decreasing their operational costs and rendering appropriate services to their customers in order to increase customer satisfaction and survive in this competitive global market. Several documents and evidence demonstrate that optimizing the distribution system has led to significantly decreasing the costs of transportation [
1]. Moreover, it should be noted that losing only 5% of customers will result in reducing the income of the logistics system between 25% and 85%, which is pertinent to the nature of the industry. Customer satisfaction as an essential requirement for retaining customers may enhance the life cycle of the customer [
2]. These issues have resulted in the significant development of the vehicle routing problem (VRP). VRP directly affects the level of customer service and distribution costs. Distribution centers and customers are known as the main components of logistics distribution, and routing is between distribution centers and customers.
Moreover, the Sustainable Development Goals (SDG) of the United Nations (UN) require world-wide countries to pay more attention to the environment. To this end, with the increase in urbanization, tremendous efforts have been made in the transportation industry to address confined fossil fuels and natural resources, climate change, and environmental issues. Decreasing air pollution compels the logistics industry, which is accused of 20% of the entire air pollution. [
3]. As a result, developing environmentally friendly transportation and logistics systems has been addressed in the last decade because of transportation operations that affect environmental pollution as well as global warming. There are also growing social and governmental concerns about environmental issues in the form of new rules, regulations, and laws, as well as social incentives leading to the proposal of novel distribution and transportation models [
4]. Various studies and research works have been conducted in the field of green transportation and logistics, aiming to enhance transportation sustainability while considering social, environmental, and social problems. The use of EVs instead of traditional vehicles and their impacts on the environment is a major topic for these studies on green transportation and distribution [
5,
6,
7]. Several developed countries have been cooperating to produce environmentally friendly vehicles ranging from small cars to large trucks in order to decrease greenhouse gas emissions and optimize fuel usage [
8].
The above-mentioned problems explicitly show the importance and motivation of studying environmentally friendly logistics and transportation systems. As a result, the aim of this paper is to address the use of EVs instead of vehicles that consume fossil fuels. The present study aims to develop a multi-objective mathematical programming model for the Capacitated Electric Vehicle Routing Problem (CEVRP), considering all sustainability aspects. In the proposed model, the capacity of EVs, the possibility of recharging, and time windows for availability are taken into account. In addition, the hard time window for customer service is assumed to increase customer satisfaction. It is presumed that there is a maximum time or deadline for customer service in order to return the EVs to the final (destination) point in such a way that EVs are available for re-serving in the upcoming time periods. Moreover, the early return of the EVs to the terminus point ultimately leads to enhanced customer service and customer satisfaction. The first objective function of the proposed model is to minimize the total cost, which is one of the most crucial factors for any logistics company, by choosing the optimal routes. On the other hand, the second objective function attempts to minimize the negative environmental effects by optimizing the amount of charging and loading EVs. On the contrary, the third objective function tries to maximize customer satisfaction by fulfilling customer demand within an appropriate time window. In order to solve this model and find the optimal solution, the EC exact method is employed for small-sized problems. Also, three MOSA, MOGWO, and NSGA-II-TLBO meta-heuristic algorithms are exploited to deal with large-sized problems, as the problem is HP-Hard. In addition, the impact of changing the main model parameters is analyzed on the results.
The research questions can be mentioned as follows:
What are the indicators of pollution reduction in the studied CVRP problem in this research?
What is the effect of the time window on reducing the emission of pollution by the green vehicle?
How is it possible to recharge green vehicles along the way?
What is the effect of using green vehicles with different capacities on reducing pollution?
It should be noted that the MOSA algorithm is an extension of SA that is inspired by the annealing process in metallurgy and is used to change the chemical and physical properties of a material. The MOGWO algorithm is an extended gray wolf optimization algorithm that is inspired by the behavior of a type of wolf called the gray wolf. It was simply created by studying and examining the characteristics and behaviors of gray wolves. The NSGA-II algorithm is one of the leading and most widely used algorithms for multi-objective optimization problems. This algorithm, which was inspired by the principles of natural selection and genetics, mimics the process of natural evolution to find optimal solutions to complex problems. The TLBO optimization algorithm, such as other meta-heuristic algorithms, is a population-based algorithm derived from nature and works based on the influence of a teacher on learning in the classroom. In the present research, the NSGA-II algorithm is combined with the TLBO algorithm.
The remainder of this paper is organized as follows: The literature is thoroughly reviewed in
Section 2.
Section 3 proposes the sustainable multi-objective optimization model. Subsequently,
Section 4 elaborates on the solution approaches. Then, the proposed optimization model is validated in
Section 5. The efficiency of NSGA-II-TLBO is compared with two recent meta-heuristic algorithms, MOSA and MOGWO, and the findings are supplied in
Section 6. At the end, a conclusion and suggestions for further research are presented in
Section 7.
2. Literature Review
The Vehicle Routing Problem (VRP) as a substantial problem in goods distribution management has attracted the attention of numerous practitioners and researchers. The VRP problem is categorized into several classifications according to the objective functions and constraints of the distribution systems. This diversification leads to a broad range of problems with various structural and operational features. Since the VRP problem has been extensively investigated by several scholars, it is intricate to examine the improvement of all its classifications during the last decade. In this section, a review of the relevant studies on Capacitated VRP (CVRP) and Electric VRP (EVRP) is presented.
VRP was introduced by Golden et al. in 1977 for the first time. Since then, this problem has been widely investigated [
9]. Solomon [
10] proposed the concept of a time window. Then, Erdoğan, and Miller-Hooks [
11] presented the Green VRP (GVRP), assuming that the vehicles are fully refueled with a constant refueling duration in the defined stations on the predetermined routes. They developed a single-objective mixed integer linear programming (MILP) mathematical model for minimizing the route traversed by the non-capacitated vehicles, considering the confined tour distance and distinctive duration. Subsequently, Felipe et al. [
12] studied GVRP considering partial recharging and diverse technologies, in which the technologies of the charging stations are different and green vehicles can be partially recharged. They designed a MILP optimization model aiming to minimize the total costs of refueling and the total distance traversed, and they solved the model using the Simulated Annealing (SA) meta-heuristic algorithm with a local search heuristic algorithm.
Schneider et al. [
13] investigated EVRP, considering recharging stations and time windows, and solved numerous problem instances. Then, Koç and Karaoglan [
14] introduced a VRP model and employed the branch-and-cut exact method and SA to find better upper bounds and initial solutions. Subsequently, Xiao and Konak [
15] developed a MIP model for the scheduling of CVRP, assuming the time window for customer service and routes’ traffic. Followingly, Leggieri and Haouari [
16] formulated both capacitated and non-capacitated VRPs.
Sedighizadeh and Mazaheripour [
17] introduced an optimization model with multiple objectives for CVRP, including the objective functions of minimizing the number of vehicles, travel duration, and cost, and maximizing the discrepancy between the earliest and latest times for all customers. They assumed hard time windows for serving and visiting customers and exploited a hybrid of the Particle Swarm Optimization (PSO) and Artificial Bee Colony (ABC) meta-heuristic algorithms as a solution approach. Then, Montoya et al. [
18] presented an optimization model for EVRP considering the recharging durations of EVs. Later on, Keskin and Catay [
19] applied an Adaptive Large Neighborhood Search (ALNS) meta-heuristic algorithm for a type of GVRP, considering various types of vehicle recharging. Zhang et al. [
20] computed the EVs’ energy consumption and exploited a meta-heuristic approach to deal with it. Then, Jie et al. [
5] proposed a kind of two-echelon CEVRP, considering several assumptions such as diverse battery capacity, load capacity, and battery recharge cost. Finally, a hybrid meta-heuristic algorithm was exploited to tackle this problem. Subsequently, Mackrina et al. [
21] investigated GVRP, considering two kinds of fossil fuel and EVs, as well as time windows and partial recharging. Basso et al. [
22] designed a mathematical model for EVRP for calculating energy consumption, taking several assumptions such as topology, traffic, speed, and number of intersections into account. Xu et al. [
23] addressed the capacitated green VRP considering soft time windows and time-varying vehicle speed and solved this problem by using the NSGA-II meta-heuristic algorithm.
Zhang et al. [
24] proposed a fuzzy mathematical model for EVRP considering recharging stations and time windows and exploited the Adaptive Large Neighborhood Search (ALNS) meta-heuristic algorithm to solve the model. Lu et al. [
25] investigated a time-dependent EVRP with the objective of cost minimization to specify the departure time and speed of the vehicle and dealt with this problem by employing a meta-heuristic algorithm named Iterated Variable Neighborhood Search (IVNS). Zulvia et al. [
26] investigated the green VRP for perishable products to minimize costs and carbon emissions while maximizing customer satisfaction and solved it by implementing the many-objective gradient evolution (MOGE) meta-heuristic algorithm. Zhou et al. [
27] proposed a novel collaborative multi-heterogeneous-depot electric vehicle routing problem with mixed time windows and battery swapping. Bahrami et al. [
28] proposed a type of VRP in which the vehicles consume gasoline and electricity and solve the problem using the branch-and-price algorithm. Zhu et al. [
29] developed an optimization model to decrease air pollution in the multi-depot EVRP and applied meta-heuristic algorithms as the solution approached. Mavrovouniotis et al. [
30] proposed a mathematical programming model for EVRP and employed a meta-heuristic algorithm to solve it. Kozák et al. [
31] presented a model for EVs.
Moreover, Froger et al. [
32] designed a mathematical programming model for EVRP considering recharging stations. Furthermore, Futalef et al. [
33] modeled the single-objective CEVRP and solved the model using the GA meta-heuristic algorithm. On the other hand, Gupta et al. [
34] developed another optimization model with multiple objectives, considering multiple depots for minimizing fuel emissions in VRP, and solved it using a hybrid GA. Karakatič [
35] introduced a multi-objective mathematical programming model for recharging EVs in CVRP in order to minimize stops as well as travel and recharging times and employed a Two-Layer GA for solving the problem. Arias-Londoño et al. [
36] proposed a MILP model for CVRP, taking the last-mile delivery, battery capacity, and recharging station into account, and solved a test problem with the GAMS optimization software. Furthermore, Pan et al. [
37] studied an urban CVRP considering time windows, travel and loading time, maximum trip duration, and vehicle multiple-trip aiming to reduce the total traveling distance. They also used a hybrid of ALNS and VND meta-heuristic algorithms to tackle the problem. Fan et al. [
38] developed an integer model for multi-depot VRP considering different assumptions such as vehicle load, speed, and time-varying road gradient to minimize the total costs comprising fixed costs, penalty costs, and fuel costs. This problem was solved by using the combination of GA and VNS. The numerical problems demonstrated the effectiveness of their model and solution approach. Olgun et al. [
39] investigated GVRP, considering customer pickup and delivery demand at the same time to minimize cost, and exploited a hyper-heuristic algorithm based on iterative local search (HH-ILS) to solve the problem. The numerical problems indicate the efficiency of this algorithm. Moreover, Moghdani et al. [
40] conducted a systematic review of the literature on GVRP, considering different objective functions, uncertainty, and methodologies published from 2006 to 2019. They concluded that studies on GVRP are comparatively fresh and that significant improvements can be made in various areas. Hesam Sadati et al. [
41] applied the Variable Tabu Neighborhood Search (VTNS) meta-heuristic algorithm to solve VRP with multiple depots and time windows. The findings display the effectiveness of the solution method. Wang et al. [
42] studied VRP with multiple depots and multiple time periods for sharing logistics resources to minimize transportation cost, number of vehicles, and service waiting time and employed a hybrid NSGA-III meta-heuristic algorithm to solve the problem. They also implemented their model in a case study in China to validate the model and algorithm.
Xue et al. [
43] studied the two-echelon dynamic VRP, which transforms customers with useless storage into satellite stations, aiming to minimize make-to-stock and operational costs. They exploited a hybrid of GA and TS to solve this problem. Finally, a case study in China was considered to demonstrate the model’s efficiency. Xiang et al. [
44] implemented an Ant Colony Optimization meta-heuristic algorithm based on coverage diversity on a type of VRP to retain the variety of customers covered within routes in order to respond to new customer demands. Several test problems were also used to show the effectiveness of the model and algorithm. Li et al. [
45] developed the Markov decision model to identify the optimal time allocation policy for the fleet of EVs and proposed the dynamic programming algorithm for solving this problem. They implemented the model using numerical examples for validation. Bruglieri et al. [
46] suggested the Greedy Randomized Adaptive Search Procedure (GRASP) for solving a classification of the GVRP to minimize the total distance traveled. Erdem [
47] proposed a sustainable MIP model with the goals of optimizing transportation operations and waste collection for an extension of EVRP in which EVs visit the waste bins and employed the Adaptive Variable Neighborhood Search (AVNS) algorithm for solving the problem. Wen et al. [
48] exploited an improved ALNS meta-heuristic algorithm to solve GVRP with time windows and multiple depots. The results indicate the high accuracy and speed of the ALNS meta-heuristic algorithm.
Kuo et al. [
49] modeled a VRP considering time windows and the objectives of minimizing costs and carbon emissions. They also applied the improved MOPSO meta-heuristic algorithm to solve this problem. The findings demonstrate the efficiency of MOPSO. Zhang et al. [
50] introduced a type of CVRP considering stochastic demands. They used Monte Carlo simulation and scenario analysis to minimize the expected total logistics costs and employed a hybrid meta-heuristic algorithm based on Adaptive Tabu Search (ATS) to solve this problem. Wang and Zhao [
51] investigated EVRP, considering the strategy for partial linear recharging, time window, and heterogeneous vehicles, and developed an efficient heuristic algorithm based on LNS to solve the problem. Moreover, Wang et al. [
52] proposed a bi-objective non-linear model for EVRP considering multiple depots, shared charging stations, and time windows in order to minimize the number of EVs and total operational costs. They also suggested a hybrid of Multi-Objective GA and TS algorithms for solving this problem. The results reveal that the algorithms can considerably reduce operational costs. Furthermore, Xiao et al. [
53] studied EVRP considering mixed backhauls and time windows and proposed the Diversity-Enhanced Memetic meta-heuristic algorithm (DEMA) to deal with the problem. They finally implemented their proposed model and algorithm on several test problems for validation. Amiri et al. [
54] addressed GVRP and presented a bi-objective model to minimize transportation costs and greenhouse gas emissions and exploited the EC method and ALNS meta-heuristic algorithm to solve this problem. Asghari et al. [
55] studied EVRP, considering recovering a pre-determined schedule of an EV in case of an unpredicted disruption to determine the optimal vehicle velocity and the policy for battery recharging. They solved the model by using a PSO-based meta-heuristic algorithm. The findings indicate that the application of the recovery actions can greatly decrease disruption costs. Ma et al. [
56] incorporated congestion and energy consumption into EVRP with the objective of minimizing cost and solved this problem by applying the ALNS meta-heuristic algorithm. The performance of the algorithm was also examined using test problems. Zhang et al. [
57] studied the long-term bus electrification that simultaneously optimizes the charging infrastructure deployment and bus fleet transition. Han et al. [
58] proposed a MIP model for the routing problem of electric trucks, considering multiple charging alternatives with the objective of minimizing total transportation costs. They applied the ALNS algorithm to solve the model on test problems, and the findings demonstrated the efficiency of the algorithm. Cai et al. [
59] investigated some of the main issues of EVRP with backup batteries, battery swapping stations, and time windows to simultaneously optimize different tasks by exploiting a novel meta-heuristic algorithm. They also validated their model and algorithm using 30 numerical examples as well as four real-world problems. Abid et al. [
60] attempted to iterate real-world parameters such as energy consumption, heterogenous fleets, and infrastructure data in order to minimize the total number of vehicles used, distance traveled, travel time, and energy consumption. In addition, several studies have been conducted on EVs and energy consumption [
61,
62,
63,
64].
A comprehensive study of the literature demonstrates that despite several research studies conducted on EVRP, some gaps still exist in this field. As a result, this paper seeks to contribute to the existing literature by filling these gaps. The primary contributions of the present study can be mentioned as follows:
Presenting a multi-objective mathematical programming model for the Capacitated Electric Vehicle Routing Problem (CEVRP), taking both soft and hard time windows as well as the possibility of EV charging into consideration,
Proposing a hybrid of Non-dominated Sorting Genetic Algorithm (NSGA-II) and Teaching–Learning-Based Optimization (TLBO) meta-heuristic algorithms to solve this problem,
Comparing the performance of the proposed hybrid NSGA-II-TLBO meta-heuristic algorithm with other well-known meta-heuristic algorithms by solving different test problems based on six criteria,
Performing statistical analysis for determining the meaningful difference between the performance of the algorithms according to six criteria.
4. Multi-Objective Mathematical Model
The proposed model includes the following assumptions: The set C represents the service (customer) points, and the set R denotes the charging stations. Also, there is only one depot. In addition, a graph of A = (V, G) is assumed in which denotes the present nodes containing the depot, charging stations, and customers. Moreover, A shows the set of arcs. EVs with various capacities begin their tours at the depot. Each vehicle consumes energy, whose amount is computed based on the distance traveled.
A group of customers that need to receive goods are included in a distribution network.
The EV fleet is heterogeneous.
The tour of each EV starts at the depot with the total amount of goods that must be delivered to the customers, and each EV has a full charge while departing the depot.
Partial delivery is not allowed.
Each customer must be serviced (receive goods) once with a certain fleet of identical EVs.
Each tour terminates in the depot, and a time window is assumed for the time to reach the depot and can go beyond the allowable time; however, this amount is taken into consideration in the environmental (second) objective function.
Every EV is chosen for one route.
A hard time window is assumed for the customer’s visiting time for rendering service, which must not go beyond the time interval.
A certain time interval is assumed for the vehicles’ accessibility.
There are various types of EVs with a defined capacity.
The vehicle load cannot exceed the vehicle capacity at any time during the tour.
The capacity of the depot is large enough.
Charging stations may be visited more than once by an EV.
Partial charging is also possible.
EVs are assumed to be charged at a fixed rate at charging stations.
The battery charge of the EV and its load must be zero when it returns to the depot.
The load flow is calculated for each arc.
Sets
and : Node
: number of EVs e
: number of charging stations
: The set of initial depots and charging stations
: number of customers
: The set of initial depots and customers
: The set of charging stations and customers
: The set of initial depots, charging stations, and customers
Parameters
: The weight of each customer’s order
: The consumption rate of vehicle e
: Distance between nodes I and j
: Time needed for passing from node i to node j with vehicle e
: Service time for customer i
: Earliest start time i for customer service
: Latest start time i for customer service
: Customer demand i
: Charging time rate
: The energy capacity of EV e
: The loading capacity of EV e
: Remained charge for each eVper node
: Penalty cost for the remaining charge for each EV
: The cost of travel from node i to node j with EV e
: Penalty for early arrival to the customer for each EV
: Penalty for late arrival to the customer for each EV
: A large enough number
Decision variables
: Time to arrive at the nod with EV
: The remaining load of EV n when reaching node i
: The remained charge of EV n when reaching node i
: 1; if the EV n travels between nodes i and j, otherwise, 0.
: The amount of load in each arc that the EV carries
: The earliest time that the EV reaches the customer
: The latest time that the EV reaches the customer
Equation (1) shows the economic objective function, which consists of four parts. The first part calculates the cost of the route traveled by each vehicle
e; the second part calculates the penalty associated with the remaining charge of each vehicle; and the third and fourth parts calculate the cost of arriving late or early to the customer. Equation (2) expresses the environmental objective function that seeks to minimize electricity consumption by reducing the distance traveled by EVs. Equation (3) shows the social objective function that minimizes the vehicle’s arrival time to the depot.
Constraints (4)–(7) represent the visited nodes and ensure that each customer is served by only one EV at a time so that customer demand is met. In fact, these restrictions determine the EV’s route. Constraint (8) represents the time to reach the nodes. Constraint (9) indicates the arrival time to the following nodes when EV is in the node of the charging station. Constraint (10) deals with the assumed hard time window associated with the arrival time to the customer node since each customer must be served at a determined time interval. Constraint (11) indicates the amount of the EV load in the following nodes in such a way that when the necessary load (that is less than or equal to the loading capacity of EV) is loaded at the pickup depot, it is decreased when visiting each customer. Constraint (12) dictates that the vehicle load in the pickup depot must be equal to the maximum loading capacity of an EV. Constraint (13) displays the amount of EV charge in the successive nodes. Constraint (14) represents the amount of vehicle charge in the consecutive nodes. In other words, this constraint states that the energy level of an EV is at its maximum when it leaves the initial depot or charging station. Constraints (15) to (18) control the load flow of the vehicle. Constraints (19) and (20) are related to the hard time window for satisfying customer demand.
5. Solution Approaches
Goal programming (GP) is one of the most practical multi-objective approaches. Decision makers assign specific values to objective functions as their expectation levels or goals in the GP approach and then try to sum up the deviations from the expectation levels to minimize. In this method, the ideal of the objective functions must be expressed with a specific value, and since there is uncertainty in real problems, it is almost impossible to determine a certain value for the desired goal. One of the most effective approaches to dealing with uncertainty in the decision-making process is the fuzzy sets theory. Therefore, the fuzzy goal programming (FGP) approach, as a combination of fuzzy sets theory and GP, has been introduced as a practical approach [
65].
PFGP is a new development of FGP that was first introduced by Mirzaei et al. in 2018. In this method, goals are prioritized, and the degree of achievement of the most important goal should be higher than the degree of achievement of other goals. These priorities are added to the model through some equations as priority constraints. PFGP can be implemented on multi-objective problems with uncertainty in objectives and objectives with varying importance. The variables and parameters of the PFGP method are described as follows [
66]:
: The degree of achievement of the goal
: The upper limit for the goal
: The lower limit for the goal
: Positive changes from the expected level
: Negative changes from the expected level
: The expected goal of the objective function
The general formulation of PFGP is described below:
Equation (21) represents the goal of the PFGP method for maximizing the degree of achievement of all goals. Constraints (21)–(28) represent the system constraints and represents the objective function o.
5.1. Solution Representation Scheme
In this research, meta-heuristic algorithms and MATLAB software (2020b) were used, but first, the way to reach a feasible solution in MATLAB software should be presented. In this research, in order to obtain an initial feasible solution, a solution representation scheme is used to determine the sequence of visits and assign customers to vehicles.
For this purpose, a random vector of numbers between zero and one with the dimension of
l is formed, where
l is the number of customers. Then, we multiply the resulting vector by
K (the number of EVs) and round the result upwards. In this way, customers are allocated to EVs. In order to determine the sequence of customers for each EV, random numbers between zero and one are sorted in ascending order in the initial vector, and customers are visited based on the sorting (
Figure 2,
Figure 3,
Figure 4,
Figure 5 and
Figure 6). For example, if we have 5 customers and 2 EVs, a random vector is formed as follows:
Figure 2.
An example of the solution chromosome.
Figure 2.
An example of the solution chromosome.
By multiplying the above vector by the number of EVs and rounding up, we will have:
Figure 3.
An example of the solution chromosome.
Figure 3.
An example of the solution chromosome.
Now, based on the sorting of the primary vector in ascending order, the order of their visits will be as follows:
The first vehicle: Customer 3, Customer 1, and Customer 4
The second vehicle: Customer 5 and Customer 2.
Considering that all the variables are related to the departure of EVs to the customers, if an EV has visited the customer, its needed goods are delivered and its demand is met.
Guaranteeing the solution feasibility
While updating chromosomes based on mutation and crossover operators, chromosomes may be generated whose values are infeasible and do not satisfy the model constraints. In order to resolve this issue, we use a penalty function in which the violation value is multiplied by a large number and added to the objective function values. This problem has one penalty function. The penalty function of this problem is the amount of violation of the maximum available time of EV (denoted by
r). According to this penalty function, we obtain the violation amount of the maximum available time of EV, multiply this amount by a large number, and add it to the objective function values. If the violation value is equal to zero, no penalty is considered, but if there is a violation, the value of a large number is added to the objective function values and causes their values to deteriorate.
In the above equation,
denotes the violation amount,
represents the time to return to the depot, and
r denotes the maximum available time for the EV
k. Based on the violation functions, the values of the objective functions are obtained by Equations (30)–(32):
Crossover operator
The arithmetic crossover is used to create new generations. In this type of crossover operator, if
is the first parent and
is the second parent, and each has
n members, a random vector
between
with the length of
n is formed, where
is the parameter. The first child
and the second child
are obtained by using Equations (33) and (34):
Suppose that the first and second parents are displayed as the following vectors (shown in
Figure 4):
Figure 4.
An example of a crossover operator.
Figure 4.
An example of a crossover operator.
Considering the value of 0.1 for parameter
δ, a random solution for the vector
is obtained as follows (depicted in
Figure 5):
Figure 5.
An example of a crossover operator.
Figure 5.
An example of a crossover operator.
Using the aforementioned equations, the first and second children are obtained as follows (displayed in
Figure 6):
Figure 6.
An example of crossover operator.
Figure 6.
An example of crossover operator.
Mutation operator
The mutation operators are used in the optimization algorithms for diversifying the solutions and preventing convergence to the local optimum. The mutation operators are applied to the solutions as well as to the best solution that has the highest fitness value. In each step, the best population or solution is saved as the solution. The mutation operator used in this research is that members are selected from each chromosome, and those members are added with a continuous random number between [−1, +1].
5.2. Multi-Objective Simulated Annealing (MOSA) Algorithm
The Simulated Annealing (SA) algorithm was introduced by Kirkpatrick et al. in 1983 [
67]. This probabilistic discrete meta-heuristic algorithm has been widely employed to solve numerous combinatorial optimization problems due to its simplicity and efficiency. The primary notion of SA originates from the metal annealing processes. The gradual and step-by-step annealing technique is employed by metallurgists to attain a state where solids are well-adjusted and energy is downgraded. The aim is to achieve the crystal size of the solid material at the highest level in the annealing process. In the SA algorithm, the temperature of the material is raised to the highest level, and then the temperature is gently lowered. The annealing process is simulated to obtain a globally minimum response. The SA algorithm begins with a random initial solution, and the temperature of the system is equal to the beginning temperature (T = T
0). At each iteration, a neighborhood solution is found. The value of the objective function of the consequent solution is compared to the objective function value of the present solution. If the new solution is better than the previous solution, it will be swapped with the previous solution; otherwise, it will be swapped with the previous solution with a probability computed by the Baltsman’s function, which avoids being trapped in a locally optimal solution. A sufficient number of iterations may be considered the stopping criterion for the SA algorithm. When the algorithm stopping criterion is satisfied, the system temperature diminishes. This procedure carries on until the SA algorithm gets to the endpoint [
68,
69,
70]. The extended version of this algorithm, known as Multi-Objective Simulated Annealing (MOSA), is exploited to solve optimization problems with conflicting objectives.
5.3. Multi-Objective Grey Wolf Optimization (MOGWO) Algorithm
The Grey Wolf Optimization (GWO) algorithm is a meta-heuristic algorithm inspired by the hierarchical structure and social behavior of hunting gray wolves. This population-based algorithm has a simple process and can easily be generalized to large-scale problems. The extension of this algorithm, known as Multi-Objective Grey Wolf Optimization (MOGWO), is used for solving optimization problems with conflicting objectives. MOGWO considers an external archive with a fixed size to store and retrieve Pareto optimal solutions. This archive is utilized to define social hierarchy and simulate the hunting behavior of gray wolves in multi-objective search spaces. For more details about this algorithm, we refer to Mirjalili et al. [
71].
5.4. Non-Dominated Sorting Genetic Algorithm II (NSGA-II)
The Genetic Algorithm (GA) as a population-based method was proposed by Deb et al. [
72]. The Non-dominated Sorting Genetic algorithm (NSGA-II) has been extensively applied to solve multi-objective optimization problems. For more details about NSGA-II, we refer to Goldberg and Holland [
73]. In this algorithm, chromosomes are used to present solutions. At each iteration, the next offspring is generated using the crossover and mutation operators, and then the obtained solutions are sorted regarding the non-domination theory. In order to maintain diversity in the population, a crowding distance measure is allocated to each member of the population. If the number of non-dominated solutions goes beyond the population size, the NSGA-II algorithm chooses the least crowded solutions according to the crowding distance measure and ignores the remaining non-dominated solutions. Therefore, both diversity and convergence of the Pareto solutions are assured [
74].
The fundamental steps of the NSGA-II algorithm can be described as follows:
Generating the initial population,
Calculating the fitness values (according to the objective functions),
Sorting non-dominated population and computing the swarm distance,
Implementing the crossover and mutation operators for generating the next population (offspring),
Combining the initial population with the next population generated by the crossover and mutation operators,
Swapping the initial population with the fittest (best) population members generated in the previous steps,
All the aforementioned steps are iterated to achieve the optimal conditions.
5.5. Teaching–Learning-Based Optimization (TLBO) Algorithm
The TLBO algorithm was inspired by the teacher’s educational influence on students’ output (learning) and is appropriate for engineering applications. A teacher is generally defined as someone with a higher level than the students and who is capable of sharing knowledge with them. Hence, at each iteration, the teacher is a solution with the best objective function value. However, the teacher may be different at any iteration. The teacher literally lectures the lessons and assesses the students based on their scores. For more information about the TLBO algorithm, we refer to Rao et al. [
75] and Rao et al. [
76].
5.6. Hybrid of Non-Dominated Sorting Genetic Algorithm (NSGA-II) and Teaching–Learning-Based Optimization (TLBO) Algorithms
The Teaching–Learning-Based Optimization (TLBO) algorithm includes two phases for solution search, leading to a more profound exploration of finding a globally optimal solution. The combination of TLBO and NSGA-II increases the diversification of solutions and eludes trapping in a local optimum. This hybrid algorithm begins with a random initial population. At each iteration, the NSGA-II crossover and mutation operators are first applied to the initial population to generate the next population; subsequently, the TLBO operators improve the obtained solutions. As a result, both intensification and diversification of the meta-heuristic algorithm enhance. The process of the proposed algorithm is shown in
Figure 7.
As depicted in
Figure 7, first the parameters of the NSGA-II and TLBO algorithms are defined and set, then the initial solutions for the given problem are generated by using the method described in the “Solution representation scheme” section. Subsequently, the operators of the two algorithms are implemented on the chromosomes, and the solutions are ranked. Finally, the set of Pareto solutions is obtained by using the non-dominated sorting function [
45,
46,
48,
50,
54,
77,
78].
5.7. Multi-Objective Particle Swarm Optimization (MOPSO) Algorithm
The particle swarm optimization (PSO) algorithm is one of the most important intelligent optimization algorithms in the field of swarm intelligence. This algorithm was inspired by the social behavior of animals such as fish and birds that live together in different groups [
43,
44,
46,
48,
52,
70,
71].
6. A Numerical Example for Validating the Proposed Model and Methodology
In order to validate the proposed mathematical programming model and compare the solutions of the proposed meta-heuristic algorithms, a numerical example was considered and solved by the FPGP exact method along with the MOSA, MOGWO, NSGA-II-TLBO, and MOPSO meta-heuristic algorithms. The data for the numerical example are given in
Table 1. According to
Table 1, this problem comprises 7 customers, 2 charging stations, and 2 heterogeneous vehicles.
The results of solving the mathematical model using the FPGP method are presented in
Table 2. According to
Table 2, improving the value of the economic (first) objective function leads to the deterioration of the value of the environmental (third) objective function, as well as the deterioration of the value of the social responsibility (third) objective function dealing with customer service time, which demonstrates the conflict between the objective functions.
Table 2 shows the range of epsilon that was achieved from the upper limit and lower limit of each objective. For example, the minimum value of the first objective function is 7125.3, and the maximum value of this objective function is 31,247.1. Also, the minimum value of the second objective function is 22.054, and the maximum value of this objective function is 41.08. In addition, the minimum value of the third objective function is 23, and the maximum value of this objective function is 117.
In the following, by choosing six ideals in the acceptable ranges for economic, environmental, and social goals, a three-dimensional Pareto front was obtained from the FPGP method, which is presented in
Table 3. To compare the Pareto front obtained from the FPGP method with the Pareto fronts obtained from the meta-heuristic algorithms, the aforementioned problem was implemented in MATLAB version 2021a software, and the results are presented in
Figure 8. The results show that the Pareto fronts obtained from all meta-heuristic algorithms are close to the Pareto fronts obtained from the FPGP exact method.
According to
Figure 9, to reduce the arrival time for customers and electricity consumption, higher costs are incurred. Similarly, if the focus of the decision-makers is on cost minimization, they should be indifferent to environmental and social objectives. This issue is always raised as one of the challenges in multi-criteria decision-making, and managers should make the best decision and select one of the Pareto solutions, considering the importance of the three sustainability goals to the company, which can affect the business future of the company.
The route of vehicles 1 and 2 and the arrival time for each customer at the selected Pareto point are depicted in
Figure 10. As seen in this figure, the order of meeting customers is based on soft and hard time windows, and the vehicle continued to meet customers as much as possible (as long as it has charge), and if necessary, it was recharged. The route of the first vehicle is [O-C4-C2-F1-O] and the route of the second vehicle is [O-C7-C3-C5-F1-C6-C1-F2-O] so that all the customers’ demands were met. It should be noted that
Figure 10 shows the moving path of the truck, the black path is for truck number 1 and the brown path is for truck number 2.
6.1. Comparison of the Proposed Methods for Solving the Mathematical Programming Model
In this section, 20 different problem examples (according to
Table 4) have been created to compare the performance of the proposed algorithms in terms of three indexes and solve them using the provided solution methods. These examples have different parameters, including the number of vehicles (2 to 15), the number of customer nodes (5, 10, 15, and 100), charging stations (1 to 10), and coordinates of points. The values of the problem parameters are generated by the random distribution functions shown in
Table 1 [
13].
The first three test problems were solved using the MOSA, MOGWO, MOPSO, and NSGA-II-TLBO meta-heuristic algorithms, as well as the FPGP exact method (the CPLEX solver). It should be noted that each test problem was run 30 times by each algorithm, and the average for each objective function value is reported. The findings are provided in
Table 5. As can be seen in
Table 5, the values of each objective function are close to the values obtained from the FPGP method (the CPLEX solver) as the exact method.
It can be concluded from
Table 5 that the values obtained by the meta-heuristic algorithms are close to the values obtained by the FPGP exact method (the CPLEX solver); hence, the proposed meta-heuristic algorithm is reliable for solving large-sized test problems.
6.2. The Criteria for the Performance Comparison of the Meta-Heuristic Algorithms
The performance of these three meta-heuristic algorithms is compared in this section. If a single-objective optimization problem (maximization problem) is in hand, it is clear that any feasible solution with a higher objective function value is better. However, in the case of MODM, the evaluation method is different. There are different ways to evaluate the performance of algorithms. One of these approaches is to fully investigate the solution space, obtain all the non-dominated points, and compare the solutions obtained by the algorithms with them. However, in practice, this is suitable only for small-sized problems, not for large-sized problems. To this end, the well-known indices for comparing the performance of multi-objective meta-heuristic algorithms are applied. Algorithm evaluation criteria are often divided into two categories. The first category deals with the convergence and quality of the solutions, and the second category focuses on the diversification of the solutions in the solution space. In this study, four indexes of CPU time, Mean Ideal Distance (MID), Multi Objective Coefficient of Variation (MOCV), and Hypervolume Indicator (HV) are employed to compare the performance of the MOGWO, MOSA, MOPSO, and NSGA-II-TLBO meta-heuristic algorithms [
79,
80].
Mean Ideal Distance (MID)
The MID index measures the distance between the Pareto solutions and the ideal point of those solutions using Equation (30). In this equation,
denotes the distance of the objective functions of the
it Pareto point from the objective functions of the ideal point. The lower the value of the MID index, the higher the performance of the algorithm [
81].
Multi Objective Coefficient of Variation (MOCV)
This index is actually calculated by dividing the Mean Ideal Distance (MID) index by the diversity index. It is clear that the lower values of this criterion are desirable for comparing meta-heuristic algorithms [
82].
Hypervolume Indicator (HV)
The HV indicator (or s-metric) is a performance metric for measuring the quality of a non-dominated approximation set, which was presented by Zitzler and Thiele in 1998 and described as the size of the space covered or the size of the dominated space [
83].
10. Conclusions
In this study, a sustainable multi-objective mathematical programming model was presented for the electric vehicle routing problem (EVRP), considering the soft and hard time windows as well as the limitations related to load calculation. Minimization of system costs, minimization of vehicle electricity consumption, and minimization of the vehicle’s arrival time to customers were considered as the economic, environmental, and social objective functions, respectively. The main goal of the proposed optimization model was to reach a logical balance between the three sustainability aspects. The proposed model was validated using the FPGP exact method and the CPLEX solver. Since the problem is NP-Hard, the MOGWO, MOSA, MOPSO, and NSGA-II-TLBO meta-heuristic algorithms were employed to solve the large-sized problems. To this end, several test problems were generated to solve the proposed model using the meta-heuristic algorithms.
The results show that decreasing costs leads to increased service time and electricity consumption. In addition, improving the value of the environmental objective function leads to incurring more costs. Also, increasing customer satisfaction requires a significant increase in costs.
Moreover, the results demonstrate that it is very important to consider time windows to provide better services to customers and avoid high penalty costs.
Finally, the performance of the meta-heuristic algorithms was compared based on four indexes: time, MID, MOCV, and HV. The results indicate that the MOSA algorithm performs better in terms of the time index, and this difference is meaningful. On the other hand, the NSGA-II-TLBO algorithm outperforms in terms of the MID, MOCV, and HV indexes. The following suggestions are made for further research:
Considering multiple depots in EVRP,
Considering the conditions of uncertainty to achieve results similar to reality,
Using artificial intelligence to predict system costs in the future,
Solving the mathematical model with exact methods such as the Augmented Epsilon Constraint (AEC) method.