1. Introduction
The air transport system is formed of many highly interconnected components. In such a complex system, delays and inefficiencies arise for many different reasons, such as tight aircraft and crew rotations or imbalances between demand and capacity of the airspace. The constant growth in air traffic in recent years has exacerbated the situation, making the efficient management of air traffic a necessary and challenging endeavour. Although the COVID-19 pandemic has dramatically interrupted this growth (at the time of writing, in Europe, the traffic level is reduced by −65% with respect to 2019), a recovery is expected (+3% in 2024 with respect to 2019, according to some optimistic scenarios [
1]). Therefore, the inefficiencies and problems of the past will certainly recur if not adequately addressed. In Europe, the most relevant cause of delay are the imbalances between capacity and demand [
2], with an average cost of €105 per minute in 2019 [
3]. These imbalances, on top of causing delays and consequent costs for air transportation stakeholders, are also associated with overall network inefficiency. The latter is a relevant issue for the sustainability of the aviation system, as it generates unnecessary fuel burn and emissions due to longer trajectories travelled by flights. For example, it is well known that in case of congestion in the airspace, one of the very few options currently available to airlines is to have the aircraft make a detour around the congested area, travelling a longer distance. As a measure of the size of these issues, in 2017 there was an additional 5.8% gate-to-gate CO
emissions at the European level due to the difference between the actual trajectories of flights and their unimpeded trajectories [
4]. Increasing the efficiency of air transport management is therefore one of the directions to take to improve the sustainability of aviation. Longer-than-necessary routes are also chosen to minimise route costs, by avoiding to overfly countries with high charges per unit distance (unit rate) [
5], for instance Sweden [
6]. For this reason, pricing mechanisms should not only be designed to alleviate congestion [
7], but also take into account environmental impacts [
8].
Over the last few decades, various mathematical models have been proposed to improve the efficiency of air traffic management and mitigate the effects of congestion. Ideally, these models should be applied to one entire day of air traffic. In fact, different days are roughly independent from each other, because all delays and congestion issues are buffered by the night hours, when a small number of flights are operated. Nevertheless, the air traffic network of an entire day contains a large number of elements with many interactions of various types. Before the pandemic, more than 35,000 flights could be flown in Europe in one day. For instances of this size, the models become very computationally demanding to solve. Consequently, previous studies have typically considered problems restricted to one or a few countries and to a short period of time, e.g., a few hours.
Focusing on some of the most influential and recent papers concerning the management of demand-capacity balance, we observe that computational experiments have always been carried out on instances that range from 1000 to a maximum of 6500 flights, see, e.g., Bertsimas et al. [
9], Sherali et al. [
10], Castelli et al. [
11], Agustín et al. [
12], Barnhart et al. [
13], Bertsimas and Gupta [
14], Xu et al. [
15], Xu et al. [
16]. All these studies consider the tactical phase of flight planning, where many details must be taken into consideration, such as the entry time in a sector and its capacity, or the aircraft speed. For instance, Bertsimas et al. [
9] present a model that “covers all the phases of each flight—i.e., takeoff, en route cruising, and landing—and solves for an optimal combination of flow management actions, including ground-holding, rerouting, speed control, and airborne holding on a flight-by-flight basis”. Not surprisingly, the mathematical model that tries to solve these problems is indeed computationally demanding.
A similar situation occurs in some of the few models that address the flight planning at a strategic level as in Bolić et al. [
7], Ivanov et al. [
17], and Starita et al. [
18]. Also in these cases, the complexity of the mathematical formulations requires that the dimensions of the problem instances be limited. As an example, the pricing mechanism introduced in Bolić et al. [
7] could only be tested on 2400 flights crossing the French airspace in a four-hour interval.
All these previous applications of models to a subset of the flights of one day have the inherent weakness of establishing arbitrarily the subset to consider. No preliminary analysis of the level of interactions between the analysed sub-network with the rest of the network is performed. Restricting a model to a subset of flights leads unavoidably to approximations or simplifications, such as the need to exclude from the analysis flights whose operations start before or end after the interval considered. It follows, for example, that demand or residual airspace capacity values may be under or overestimated. Due to these approximations, the solution of the model on the restricted subset is different from the solution that would be found for the flights of the subset by solving the model on the entire day. Additionally, there is no control on how different these solutions are, and no element to assess if the approximation is acceptable.
This work proposes an approach to overcome these issues. First, it identifies a criterion to divide the set of flights of a day into subsets such that the approximations made when solving the model independently on one subset are reduced to a minimum. This criterion provides a rationale according to which it is appropriate to apply the aforementioned models to a subset of flights rather than to another. Additionally, it goes a step further, as it proposes and analyses a strategy to reduce the resolution time of large instances of a model, at the price of obtaining an approximate solution. This study aims to show that this solution is acceptable, when compared to the exact one, and, therefore, that the strategy is viable. A way to reduce the resolution time that also preserves the quality of the solution is necessary to make mathematical models for flight planning a tool that can actually be used in practice.
Figure 1 summarises our approach. The proposed strategy to solve models on a large set of flights (
Figure 1A) is to partition them into smaller subsets, or clusters (
Figure 1C), and solve the model independently on each one. In this way, the resolution time is reduced both because the solving time of these models increases faster than linearly with the number of flights to optimise simultaneously, and because the solution of the model can be parallelised (
Figure 1D). This approach, however, can yield an acceptable solution only if flights belonging to different subsets have a weak influence on each other for the optimisation process. Otherwise, when the solutions found on the different subsets are aggregated, violations of some constraints may arise (
Figure 1D). The trajectories of two flights do not overlap if they do not cross the same sector of the airspace, or they cross it at different times (
Figure 1B). Flights with non overlapping trajectories do not influence each other directly in the optimisation, as their solutions affect the occupation of different sectors. We can say that two such flights do not “interact”. However, the network of interactions between flights is quite complex, and there is no obvious or immediate way to partition the flights in weakly interacting sets. Therefore, we propose to resort to community detection algorithms to determine weakly interacting clusters of flights. The use of complex network methods in air traffic management has been used in several contexts [
19,
20,
21,
22,
23,
24]. For example, community detection algorithms have been applied to the networks of airports, sectors, or navigation points to investigate their structure [
22,
23].
As a test case, we consider the whole European air traffic volume of a single day and test the benefits of partitioning flights against the results of an integer programming model, named SAIPE [
25]. SAIPE balances demand and capacity through a
strategic redistribution of flights that respects nominal sector capacities. The model solves large-scale instances that include around 30,000 flights in approximately 300 s on standard software and hardware equipment (64 bit Intel
® Xeon
® W-2145 CPU @ 3.70 GHz 16 core CPU computer, with 32 GB of RAM memory and Ubuntu 18.04 operating system). To the best of our knowledge, SAIPE is the only model that is computationally able to address demand–capacity imbalances for an entire day of the European airspace. For this reason, we use it as a benchmark.
In order to obtain the desired flight partition, we build a network where the nodes represent flights and a link is present between two nodes if the corresponding flights are likely to influence each other in the flight planning optimisation. This depends on whether two flights’ possible trajectories overlap and to what extent. We test different choices of conditions on the overlap necessary to have a link, yielding different networks (see
Materials & Methods Section). We apply existing community detection algorithms to the obtained networks, showing that they find clusters of flights that are differentiated spatially and/or temporally (see Results). Finally, we run the SAIPE model on each cluster and compare the aggregate solution with the exact one in terms of resolution time and constraints’ violations.
In summary, the main contributions of this paper are as follows:
- -
we propose and compare different ways of defining flight interactions, which lead to different networks representing one day of flights across Europe;
- -
we show that it is possible to divide the set of all European flights of one entire day into components that are temporally and/or spatially distinct, such that different components interact weakly;
- -
we verify, for a relevant problem of strategic flight planning optimisation, that solving the problem on smaller components and then aggregating the solutions can be significantly faster than solving the whole network, at the only expense of a slight deterioration of the optimal solution.
3. Results and Discussion
The application of each scenario, consisting of a method to build the network and an algorithm to cluster it, yields the partition of a network in a set of clusters. The first two sections of the results are devoted to analysing the results of this procedure. In
Section 3.1, we compare the results of the application of different scenarios in terms of two important aspects: the run time needed to perform the clustering, and the number of clusters obtained. We also check if the clusters obtained are similar across different runs of the same scenario, and across different scenarios. In
Section 3.2, we analyse the clusters obtained, to understand what characterises these groups of weakly interacting flights. Then, applying the SAIPE optimisation to each partitioned network yields an optimised strategic plan.
Section 3.3 and
Section 3.4 analyse the output of the optimisation in terms of resolution time and quality of the solution.
3.1. Comparison of Results of Community Detection in Different Scenarios
The Infomap algorithm has smaller run times with respect to the Louvain algorithm (see
Figure 2A, the scale is logarithmic). Both algorithms are significantly faster on networks 1 and 2 than on networks 3 and 4, due to the smaller number of links. The combination of Infomap on network 2 yields the shortest run time, 8.4 s on average over 30 runs. Run times are consistent, in terms of order of magnitude, across runs. This can be seen from the limited height of the boxes in
Figure 2A.
The number of clusters detected in each scenario is also quite consistent across repetitions (see
Figure 2B). An exception is Infomap on Network 4, where around a third of the runs yield only 3 clusters, while the others yield 14–18 clusters.
The number of clusters depends both on the algorithm and on the underlying network. Infomap on networks 1 and 2 finds a smaller number of clusters (typically 3 and 4, respectively) if compared to the same algorithm on the denser networks, while for the Louvain algorithm the converse is true. Therefore, we cannot draw a general conclusion on the relationship between the network-building method and the number of clusters found.
Both clustering algorithms typically produce consistent results when they are run on a given network several times. In fact, the similarity of the network partitions obtained across different runs of the same scenario is high for most scenarios, as measured both by the Rand Index and the Mutual Information (
Figure 3A). The most consistent scenario is Infomap on network 2. At the other extreme, Infomap on network 4 produces very variable results. This is consistent with what is observed in
Figure 2B, i.e., that different runs of Infomap on network 4 produce partitions with a different number of clusters. Partitions obtained with different runs of the same scenario are more similar to each other than those obtained with different scenarios (
Figure 3B). The outlier in panel B (blue circle) is, again, Infomap on network 4, with low similarity across runs. The greater similarity of clustering results across runs with respect to those across scenarios suggests that each scenario produces a partition that is, to a certain extent, characteristic of that scenario. In other words, each scenario partitions the network differently. Nevertheless, there are some similarities among the partitions obtained with different scenarios, according to both indexes. Namely, the results on networks 3 and 4 with different methods are similar (
Figure 3C), and the Louvain algorithm finds similar results on networks 1 and 2 (
Figure 3D).
3.2. Temporal and Spatial Analysis of Clusters
To characterise the results of the partitioning algorithm, we looked at the temporal and spatial properties of the obtained clusters. Specifically, we looked at the distribution of departure times and at the spatial arrangement of the routes of flights contained in each cluster. We found that the clusters have a clear temporal or spatial characterisation. For example, in
Figure 4A, we observe that the Infomap algorithm on network 1 finds two clusters with well-separated departure times. The two clusters are spatially overlapping (panel B). The Louvain algorithm on network 3, instead, finds clusters that have some temporal overlap (panel C). However, clusters that overlap temporally are spatially distinct. In fact, in panel D we see that the two clusters that both contain flights departing around 10 a.m. (yellow and dark violet) have different geographical positions. Similarly, the cluster that is temporally distributed along the entire afternoon and overlaps with two other clusters, is very specific geographically, being localised in the Scandinavian area. Results for other algorithms are in
Supplementary Figures S2 and S3.
Therefore, we conclude that both community detection algorithms succeed in finding clusters of flights that have in common the time of the day, the geographical position, or both. The fineness of this separation depends on the algorithms and on the network. For example, the temporal division performed by the Infomap algorithm on network 1 (panel A) is rougher than the one performed by the Louvain algorithm on network 3 (panel C). The latter isolates four different time periods: early morning, late morning, early afternoon, and late afternoon.
3.3. A Trade-Off between the Number of Capacity Violations and the Resolution Time
We apply the SAIPE optimisation to each partitioned network obtained with the different scenarios. The optimisation is run in parallel on the clusters. For each run, we obtain an optimised strategic plan for each cluster, and by aggregating over clusters we obtain the global strategic plan for all flights.
The advantage that we expected in optimising independently the clusters was to have a reduced resolution time. Indeed, we find that the resolution time of the SAIPE decreases with the number of clusters (
Figure 5A). This is expected because solving in parallel more, but smaller, clusters is faster than solving a smaller number of larger clusters. For any number of clusters, the resolution time is shorter than the time needed to find the global exact solution on the unpartitioned network, which is roughly 260s.
The advantage of a reduced resolution time must be evaluated in parallel to the disadvantage of obtaining an approximate global solution. To quantify the quality of this global solution, we measure the number and size of the violations of capacity constraints. Given a global solution, i.e., a strategic flight plan, we compare the sectors-hour occupation with the nominal capacities, to assess constraint violations. We call the violations computed according to the strategic flight plan strategic violations.
First, we look at the
number of capacity violations, i.e., the number of sectors where the nominal capacity is exceeded. The number of violations tends to increase with the numbers of clusters (
Figure 5B). This pattern means that when the network is partitioned in more clusters, it happens more often that two interacting flights are in two distinct clusters, and are therefore optimised independently.
There is therefore a trade-off between the number of violations, lower when there are few clusters, and the resolution time, lower when there are many clusters.
Figure 5C shows that in the present case, a good trade-off is represented by Infomap on network 1. This scenario in fact reduces the resolution time by roughly one-third with respect to time needed to run the SAIPE on the entire network (baseline), at the cost of a small number of violations. Further reducing the resolution time comes at a higher relative cost in terms of violations.
The particular community detection method yielding the best solution might be specific to the problem. However, we can draw the general conclusion that partitioning flights in small number of clusters (3–5) represents a good compromise between reducing the resolution time and keeping the number of violations low.
We also compute the
size of the violations, i.e., the number of flights exceeding the capacity, in absolute and in percentage. The mean and variance of the violation size do not show a dependence on the number of clusters and are similar for all scenarios (see
Figure 6, blue bars). Infomap on network 2 shows the lowest mean violation size, both absolute and percentage.
3.4. Comparison with Actual Capacity Violations
To understand if the violations found in the approximate solutions produced by our approach are acceptable, we compare them with the violations that actually occurred on 1 September 2017. We find that the strategic violations are in all cases smaller in number and size with respect to the capacity violations that actually occurred on 1 September 2017 (represented as dotted lines in
Figure 6). As explained in the Methods section (
Section 2.6), a fairer comparison is that with the tactical violations that we estimate would arise from each strategic plan due to ATFM delays. These estimated tactical violations are more, in number, with respect to the strategic ones, but smaller on average (both in absolute value and in percentage) (see
Figure 6, red bars). Importantly, they remain in all cases smaller in number and size with respect to the occurred violations. We can therefore conclude that, for all tested scenarios, the capacity violations produced by our approach are not a cause for concern.
4. Conclusions
The need for an air traffic system that is sustainable both from an environmental and an economic point of view requires the availability of mathematical and/or simulation models that allow to solve traffic management problems in a reasonable amount of time.
A classic approach to solving problems associated with complex systems consists of finding decompositions of the system in smaller subsystems that interact weakly with each other. In this work, we apply this approach to the set of flights, aiming to identify subsets of flights that interact as little as possible with other subgroups, where interactions are defined within the context of the specific problem at hand. Such decomposition allows to solve the problem independently on each of the subsystems. The resolution can thus be parallelised, with obvious computational advantages.
We partition the set of flights by applying community detection algorithms. To this end, the system must be appropriately formalised as a network, whose links represent the interactions that should be minimised between subsystems. We propose a method to build this network, with four possible variations, specifically for the problem of mitigating imbalances between capacity and demand. The methods use information available at the time of strategic flight planning on flights routes and departure times to identify which pairs of flights might interact during the resolution of the problem. The exact result of this partitioning depends on the method chosen to build the network and the clustering algorithm. However, analysing the clusters of flights that we obtained applying community detection algorithms to each of the networks, we found out that these clusters are temporally and/or spatially differentiated. This indicates that the minimisation of between-clusters interaction sought by the algorithms is consistently realised by gathering flights by temporal or spatial position.
We test four alternative methods to build the network and two clustering algorithms, which produce different results in terms of quality of the solution and computational time. We can use these differences to identify the most suited methodological choices. In the context of the present application, Infomap on network 1 offers the best combination of characteristics: low run time to perform the clustering and good trade-off between resolution time of the strategic flight planning model and number of capacity violations, their size also being significantly smaller than the realised ones. While this specific conclusion might not directly generalise to all datasets and problems, we can draw some general indications. First, in terms of capacity violations, all solutions for the problem of strategic flight planning obtained with any method and clustering algorithm are acceptable. In fact, all choices yield violations that are, in number and size, considerably smaller than the actual ones. Therefore, the quality of the solution should not be a strong concern in the choice of the method. Second, the shorter run time of the Infomap algorithm to perform the clustering make it preferable with respect to the Louvain algorithm. Third, if a good guess of the most likely trajectory is available (e.g., from historical data), it seems preferable to use methods 1 or 2 to build the network. In fact, these two methods require a shorter run time for the clustering (due to the smaller number of links) and yield either a better (in the case of Infomap) or comparable (in the case of Louvain) trade-off between violations and resolution time.
The results of this work also offer significant developments that might go beyond the simple reduction of computational resolution times. In fact, the possibility of identifying groups of flights that only weakly interact with each other allows, for example, to isolate the downstream effects that can be caused by a flight’s delay (reactionary delay) and also the consequences that it can have on passenger connectivity (missed connections). To perform this type of analysis, however, the definition of interaction between flights (used to establish links in the network) should be expanded. It should include not only interactions in the airspace, but also at airports, a study which is currently under development.
With the inclusion of interactions at airports, our approach could also lead to significant enhancements of models that optimise the allocation of airport slots. Currently, these slots are allocated according to the IATA World Schedule Guidelines (WSG) on an airport-by-airport basis. The allocation does not take place for a single day, but for a series of slots. A series is composed of at least five slots allocated for the same or approximately same time on the same day-of-the-week, distributed regularly in the season [
32]. Since airports are clearly interconnected, the current single airport allocation is likely to produce allocations at two different airports that may not be compatible with the flight time between them. For this reason, twice a year, a Slot Coordination conference takes place principally to resolve conflicts stemming from the timing of slots allocated across multiple airports [
33]. The current contributions that address the allocation of airport slots at the network level are based on very strong simplifications that severely limit their applicability—only for single days and not in series [
34], with computer-generated instances under a number of severely simplifying assumptions [
35], or considering a network composed of only 3 airports in southern China [
36]. An appropriate decomposition of the network, as proposed in this work, could foster the development of optimisation models that limit the use of face-to-face and manually conducted negotiations, as is the case today in the Slot Coordination conference.