A Hybrid Genetic Algorithm Based on Imitation Learning for the Airport Gate Assignment Problem
Abstract
:1. Introduction
- We transform the mathematical model of the gate assignment problem to a Markov decision process. The state space, action space, rewards, and state transition of the problem have been defined.
- We propose a hybrid genetic algorithm based on imitation learning (IL-GA) to solve the gate assignment problem. This algorithm includes two stages: In the first stage, a deep policy network is designed to learn the decision-making process of human allocation experts. The network is trained by historical flight assignment trajectories and outputs the gate selection probability of flights. Considering that it is impossible to obtain a better solution only by learning the decision-making trajectory of human experts, a genetic algorithm is used in the second stage in order to optimize the number of flights assigned to gates. At the same time, the optimal solution can be provided to the policy network as dataset aggregation for further training.
- We use real airport data from Lijiang Airport in Yunnan, China, to verify the rationality and effectiveness of the model and algorithm. The experimental results show that the genetic algorithm combined with imitation learning can not only better meet the airport gate allocation preference but also help to provide a good initial population, shorten the iteration time, and accelerate the convergence of the algorithm to better solution.
2. Literature Review
3. Model Construction
3.1. Mathematical Model of AGAP
- Flight pair
- 2.
- Contact gates
- 3.
- Remote gates
- 4.
- Gate preference
3.1.1. Constraints
- One aircraft (flight pair) can only park at one gate.
- The aircraft type and international/domestic attribute must be within the allowable range of gates.
- Two consecutive aircraft assigned to the same gate should add a necessary safety time interval between them. For the convenience of calculation, the conflict relation matrix of any two flights is calculated in advance according to the arrival and departure times. If the interval between flight and flight is less than the minimum safety time interval , ; otherwise, it is 0. Then, this constraint can be expressed as follows:
- Two flights assigned to adjacent gates cannot arrive or leave at the same time. Because the distance between adjacent gates is close, sliding in or pushing out at the same time can easily cause wing scratching. Similarly, a conflict matrix is generated in advance to indicate that the arrival or departure time of two flights does not meet the safety interval . Then, the constraint can be expressed as Equation (6):
3.1.2. Objective Function
- Maximize the number of flight pairs assigned to contact gates
- 2.
- Maximize the total gate preferences
3.2. Markov Decision Process of AGAP
- State Space
- is the coding set of flight attributes, including aircraft type, international/domestic type, VIP type, and overnight type. We use the one-hot code method [34] to represent the attributes of flights and then splice them together.
- is the gate occupancy time of the flight. We divide 24 h into 288 bits every 5 min. The bits between the arrival and departure time of the flight are set to 1; otherwise, it is 0.
- is the coding set of gate status for a current flight. We use the constraints to filter out the available gates and assign 1 to indicate availability; otherwise, 0 is assigned.
- 2.
- Action Space
- 3.
- Reward
- 4.
- State Transition
4. A Hybrid Genetic Algorithm Based on Imitation Learning for AGAP
4.1. General Scheme
4.2. Imitation Learning Stage
4.3. Genetic Algorithm Stage
- Chromosome coding
- 2.
- Generation of the initial population
Algorithm 1: Generation of the initial population for AGAP |
Input: An instance of flights waiting for assignment; Gate list ; Policy network ; Greedy probability ; Population size . Process: //The set for population for do //A single chromosome for do //Calculate gate selection probability if then Randomly add a gate to according to else Add a gate with to end if end for end for |
Output: Initial population |
- 3.
- Selection, crossover, and mutation operators
4.4. Data Aggregation
- Training the policy network through the human allocation trajectory dataset ;
- Using to solve new instances and save assignment results as dataset ;
- Optimizing the result of each instance in through genetic algorithm;
- Data aggregation: ;
- Return to the first step.
5. Experiment
5.1. Experimental Data and Analysis
5.2. Policy Network Construction and Imitation Learning
5.3. Genetic Algorithm Stage
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Daş, G.S.; Gzara, F.; Stützle, T. A review on airport gate assignment problems: Single versus multi objective approaches. Omega 2020, 92, 102146. [Google Scholar] [CrossRef]
- Guépet, J.; Acuna-Agost, R.; Briant, O.; Gayon, J.P. Exact and heuristic approaches to the airport stand allocation problem. Eur. J. Oper. Res. 2015, 246, 597–608. [Google Scholar] [CrossRef]
- Marinelli, M.; Dell’Orco, M.; Sassanelli, D. A Metaheuristic Approach to Solve the Flight Gate Assignment Problem. Transp. Res. Procedia 2015, 5, 211–220. [Google Scholar] [CrossRef]
- Zhou, Y.; Fu, R.; Wang, C.; Zhang, R. Modeling Car-Following Behaviors and Driving Styles with Generative Adversarial Imitation Learning. Sensors 2020, 20, 5034. [Google Scholar] [CrossRef] [PubMed]
- Lioutikov, R.; Neumann, G.; Maeda, G.; Peters, J. Learning movement primitive libraries through probabilistic segmentation. Int. J. Robot. Res. 2017, 36, 879–894. [Google Scholar] [CrossRef]
- Shou, Z.; Di, X.; Ye, J.; Zhu, H.; Zhang, H.; Hampshire, R. Optimal passenger-seeking policies on E-hailing platforms using Markov decision process and imitation learning. Transp. Res. Part C Emerg. Technol. 2020, 111, 91–113. [Google Scholar] [CrossRef]
- Srihari, K.; Muthukrishnan, R. An expert system methodology for aircraft-gate assignment. Comput. Ind. Eng. 1991, 21, 101–105. [Google Scholar] [CrossRef]
- Cheng, Y. A knowledge-based airport gate assignment system integrated with mathematical programming. Comput. Ind. Eng. 1997, 32, 837–852. [Google Scholar] [CrossRef]
- Geoffrey, D. Gosling. Design of an expert system for aircraft gate assignment. Transp. Res. Part A Gen. 1990, 24, 59–69. [Google Scholar]
- Babic, O.; Teodorović, D.; Tošić, V. Aircraft Stand Assignment to Minimize Walking. J. Transp. Eng. 1984, 110, 55–66. [Google Scholar] [CrossRef]
- Yan, S.; Tang, C.H. A heuristic approach for airport gate assignments for stochastic flight delays. Eur. J. Oper. Res. 2007, 180, 547–567. [Google Scholar] [CrossRef]
- Kim, S.H.; Feron, E.; Clarke, J.P. Gate assignment to minimize passenger transit time and aircraft taxi time. J. Guid. Control Dyn. 2013, 36, 467–475. [Google Scholar] [CrossRef]
- Bi, J.; Wu, Z.; Wang, L.; Xie, D.; Zhao, X. A Tabu Search-Based Algorithm for Airport Gate Assignment: A Case Study in Kunming, China. J. Adv. Transp. 2020, 2020, 8835201. [Google Scholar] [CrossRef]
- Kaliszewski, I.; Miroforidis, J.; Stańczak, J. Multiobjective optimization in the Airport Gate Assignment Problem, exact versus evolutionary multiobjective optimization. Comput. Sci. 2017, 18, 41–52. [Google Scholar] [CrossRef]
- Deng, W.; Zhao, H.; Yang, X.; Li, D.; Li, Y.; Liu, J. Research on a robust multi-objective optimization model of gate assignment for hub airport. Transp. Lett. Int. J. Transp. Res. 2018, 10, 229–241. [Google Scholar] [CrossRef]
- Tan, C.; He, J. Robust Airport Gate Assignment Based on the Analysis of Flight Arrival Time. Math. Probl. Eng. 2021, 2021, 6693127. [Google Scholar] [CrossRef]
- Maharjan, B.; Matis, T.I. Multi-commodity flow network model of the flight gate assignment problem. Comput. Ind. Eng. 2012, 63, 1135–1144. [Google Scholar] [CrossRef]
- Xiao, M.; Chien, S.; Schonfeld, P.; Hu, D. Optimizing flight equencing and gate assignment considering terminal configuration and walking time. J. Air Transp. Manag. 2020, 86, 101816. [Google Scholar] [CrossRef]
- Zhang, D.; Klabjan, D. Optimization for gate re-assignment. Transp. Res. Part B Methodol. 2017, 95, 260–284. [Google Scholar] [CrossRef]
- Aktel, A.; Yagmahan, B.; Özcan, T.; Yenisey, M.M.; Sansarcı, E. The comparison of the metaheuristic algorithms performances on airport gate assignment problem. Transp. Res. Procedia 2017, 22, 469–478. [Google Scholar] [CrossRef]
- Behrends, J.A.; Usher, J.M. Aircraft Gate Assignment: Using a Deterministic Approach for Integrating Freight Movement and Aircraft Taxiing. Comput. Ind. Eng. 2016, 102, 44–57. [Google Scholar] [CrossRef]
- Bi, J.; Wang, F.; Ding, C.; Xie, D.; Zhao, X. The airport gate assignment problem: A Branch-and-Price Approach for improving utilization of jetways. Comput. Ind. Eng. 2022, 164, 107878. [Google Scholar] [CrossRef]
- Şeker, M.; Noyan, N. Stochastic optimization models for the airport gate assignment problem. Transp. Res. Part E-Logist. Transp. Rev. 2012, 48, 438–459. [Google Scholar] [CrossRef]
- Yu, C.; Lau, H. Airport Gate Reassignment Based on the Optimization of Transfer Passenger Connections. J. Traffic Logist. Eng. 2015, 3, 25–30. [Google Scholar] [CrossRef]
- Deng, W.; Zhao, H.; Zou, L.; Li, G.; Yang, X.; Wu, D. Study on an improved adaptive PSO algorithm for solving multi-objective gate assignment. Appl. Soft Comput. 2017, 59, 288–302. [Google Scholar] [CrossRef]
- Deng, W.; Sun, M.; Zhao, H.; Li, B.; Wang, C. Study on an airport gate assignment method based on improved ACO algorithm. Kybernetes 2018, 47, 20–43. [Google Scholar] [CrossRef]
- Stollenwerk, T.; Lobe, E.; Jung, M. Flight Gate Assignment with a Quantum Annealer; Springer International Publishing: Cham, Switzerland, 2019; pp. 99–110. [Google Scholar]
- Aoun, O.; El Afia, A. Using Markov decision processes to solve stochastic gate assignment problem. In Proceedings of the 2014 International Conference on Logistics Operations Management (GOL 2014), Rabat, Morocco, 5–7 June 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 42–47. [Google Scholar]
- Aoun, O.; El Afia, A. Application of multi-agent Markov decision processes to gate assignment problem. In Proceedings of the 2014 Third IEEE International Colloquium in Information Science and Technology, Tetouan, Morocco, 20–22 October 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 196–201. [Google Scholar]
- Oussama, A.O.; Abdellatif, E.L. Time-Dependence in Multi-Agent MDP Applied to Gate Assignment Problem. Int. J. Adv. Comput. Sci. Appl. 2018, 9, 331–340. [Google Scholar]
- Bouras, A.; Ghaleb, M.A.; Suryahatmaja, U.S.; Salem, A.M. The airport gate assignment problem: A survey. Sci. World J. 2014, 2014, 923859. [Google Scholar] [CrossRef]
- Wang, H.; Luo, Y.; Shi, Z. Real-Time Gate Reassignment Based on Flight Delay Feature in Hub Airport. Math. Probl. Eng. 2013, 2013, 646241. [Google Scholar] [CrossRef]
- Maia, T.V. Reinforcement learning, conditioning, and the brain: Successes and challenges. Cogn. Affect. Behav. Neurosci. 2009, 9, 343–364. [Google Scholar] [CrossRef]
- Klimo, M.; Lukáč, P.; Tarábek, P. Deep Neural Networks Classification via Binary Error-Detecting Output Codes. Appl. Sci. 2021, 11, 3563. [Google Scholar] [CrossRef]
- Bolat, A. Models and a Genetic Algorithm for Static Aircraft-Gate Assignment Problem. J. Oper. Res. Soc. 2001, 52, 1107–1120. [Google Scholar] [CrossRef]
- Xie, J.; He, Z.; Zhu, Y. A DRL based cooperative approach for parking space allocation in an automated valet parking system. Appl. Intell. 2022, 53, 5368–5387. [Google Scholar] [CrossRef]
- James, J.Q.; Yu, W.; Gu, J. Online Vehicle Routing with Neural Combinatorial Optimization and Deep Reinforcement Learning. IEEE Trans. Intell. Transp. Syst. 2019, 20, 3806–3817. [Google Scholar]
- Maher, M.; Liu, R.; Dong, N. Signal optimisation using the cross entropy method. Transp. Res. Part C 2013, 27, 76–88. [Google Scholar] [CrossRef]
- Liang, B.; Li, Y.; Bi, J.; Ding, C.; Zhao, X. An Improved Adaptive Parallel Genetic Algorithm for the Airport Gate Assignment Problem. J. Adv. Transp. 2020, 2020, 8880390. [Google Scholar] [CrossRef]
- Daftry, S.; Bagnell, J.A.; Hebert, M. Learning Transferable Policies for Monocular Reactive MAV Control. In Proceedings of the International Symposium on Experimental Robotics, Nagasaki, Japan, 3–8 October 2016; Springer: Cham, Switzerland, 2016. [Google Scholar]
- Macedo, M.; Siqueira, H.; Figueiredo, E.; Santana, C.; Lira, R.C.; Gokhale, A.; Bastos-Filho, C. Overview on Binary Optimization Using Swarm-Inspired Algorithms. IEEE Access 2021, 9, 149814–149858. [Google Scholar] [CrossRef]
Parameters | |
---|---|
The set of flight pairs, the total number is | |
The set of gates, the total number is | |
The arrival time of flight pair , | |
The departure time of flight pair , | |
The aircraft type of flight pair | |
The international/domestic type of flight pair | |
The VIP type of flight pair | |
The overnight type of flight pair | |
The international/domestic type of gate , | |
Whether gate is a contact gate | |
Types of aircraft allowed to park in gate | |
The preference score for flight pair and gate | |
The minimum safety time interval of the same gate | |
The minimum safety time interval between two neighbor gates | |
Decision variable | |
0–1 decision variable, |
Gate. No | Gate Type | Nation Type | Aircraft Types |
---|---|---|---|
1~2 | remote gate | domestic | B733-B738/A319/A320 |
3 | contact gate | international | B733-B738/A319/A320 |
4 | contact gate | domestic | Business aircraft |
5 | contact gate | international | B733-B738/A319/A320 |
6 | remote gate | domestic | B733-B738/A319/A320 |
7 | contact gate | domestic | B737 |
8–10 | contact gate | domestic | B733-B738/A319/A320 |
12–14 | remote gate | domestic | B733-B738/A319/A320/A321/B763 |
15–16 | contact gate | domestic | B733-B738/A319/A320 |
17–19 | remote gate | domestic/international | B733-B738/A319/A320 |
20–28 | remote gate | domestic | B733-B738/A319/A320 |
Flight. No | Aircraft Type | Arrival Time | Departure Time | VIP | Overnight | Nation | Assigned Gate |
---|---|---|---|---|---|---|---|
1 | A319 | 20:01 | 06:55 (+1) | N | Y | D | 14 |
2 | A320 | 23:31 | 09:47 (+1) | N | Y | D | 3 |
3 | A320 | 23:48 | 07:42 (+1) | N | Y | D | 10 |
4 | A319 | 00:12 | 09:10 | N | Y | D | 17 |
5 | B737 | 00:43 | 08:54 | N | Y | D | 7 |
6 | B737 | 07:05 | 11:24 | N | N | D | 1 |
7 | A320 | 07:28 | 08:35 | Y | N | D | 15 |
8 | B737 | 08:13 | 09:20 | N | N | D | 11 |
9 | B737 | 09:22 | 10:29 | N | N | D | 9 |
10 | A320 | 09:56 | 11:39 | N | N | D | 12 |
11 | A320 | 10:17 | 11:28 | N | N | I | 3 |
12 | A320 | 10:33 | 14:20 | N | N | D | 5 |
… | … | … | … | … | … | ... | … |
14,000 | A320 | 15:07 | 07:16 (+1) | N | Y | D | 25 |
Layer | Settings |
---|---|
Input layer | Number of units = 313 |
Hidden layer 1 | Number of units = 512, activation function = ReLU |
Hidden layer 2 | Number of units = 512, activation function = ReLU |
Output layer | Number of units = 28 |
Softmax layer | Number of units = 28 |
Parameter | Value |
---|---|
Epochs | 200 |
Batch size | 64 |
Learning rate | 0.01 |
Flight. No | Arrive Time | Departure Time | Predict Gates | Actual Gate |
---|---|---|---|---|
1 | 15:07 | 07:16 (+1) | (25, 24, 27, 19, 1) | 25 |
2 | 20:04 | 08:02 (+1) | (2, 1, 17, 14, 13) | 2 |
3 | 20:10 | 07:01 (+1) | (14, 13, 17, 12, 2) | 17 |
4 | 07:15 | 08:36 | (16, 8, 7, 15, 5) | 5 |
5 | 07:52 | 09:05 | (8, 11, 10, 16, 15) | 8 |
6 | 08:08 | 10:05 | (11, 15, 9, 12, 14) | 12 |
7 | 08:34 | 09:41 | (9, 8, 15, 16, 11) | 15 |
8 | 09:04 | 11:36 | (17, 12, 6, 1, 2) | 2 |
9 | 09:13 | 10:20 | (16, 10, 15, 9, 11) | 11 |
10 | 09:35 | 10:42 | (5, 10, 8, 16, 9) | 9 |
… | … | … | … | … |
Parameter | Value |
---|---|
Population size | 200 |
Iterations | 200 |
Greedy principle | 0.3~0.6 |
Number of gates selected by the policy network | 5, 10 |
Minimum safety time interval of the same gate | 10 min |
Minimum safety time interval between two neighbor gates | 5 min |
Instance | Flights | ε | GP | Flights Assigned to Contact Gate | |||
---|---|---|---|---|---|---|---|
Gurobi | IL-GA | GA | Policy Network | ||||
1 | 81 | 0.3 | 5 | 70 | 69 | 66 | 61 |
2 | 88 | 0.3 | 10 | 72 | 72 | 67 | 63 |
3 | 85 | 0.4 | 5 | 73 | 73 | 68 | 58 |
4 | 82 | 0.4 | 10 | 70 | 69 | 66 | 53 |
5 | 84 | 0.5 | 5 | 68 | 65 | 63 | 54 |
6 | 75 | 0.5 | 10 | 60 | 59 | 56 | 50 |
7 | 83 | 0.6 | 5 | 69 | 67 | 64 | 55 |
8 | 74 | 0.6 | 10 | 62 | 58 | 56 | 48 |
Average flight rate assigned to contact gates | 83.4% | 81.5% | 77.5% | 67.7% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ding, C.; Bi, J.; Wang, Y. A Hybrid Genetic Algorithm Based on Imitation Learning for the Airport Gate Assignment Problem. Entropy 2023, 25, 565. https://doi.org/10.3390/e25040565
Ding C, Bi J, Wang Y. A Hybrid Genetic Algorithm Based on Imitation Learning for the Airport Gate Assignment Problem. Entropy. 2023; 25(4):565. https://doi.org/10.3390/e25040565
Chicago/Turabian StyleDing, Cong, Jun Bi, and Yongxing Wang. 2023. "A Hybrid Genetic Algorithm Based on Imitation Learning for the Airport Gate Assignment Problem" Entropy 25, no. 4: 565. https://doi.org/10.3390/e25040565
APA StyleDing, C., Bi, J., & Wang, Y. (2023). A Hybrid Genetic Algorithm Based on Imitation Learning for the Airport Gate Assignment Problem. Entropy, 25(4), 565. https://doi.org/10.3390/e25040565