1. Introduction
Ensuring climate-friendly and sustainable mobility requires strong traffic growth on the railways. However, in many places, numerous line sections and nodes are already operating at high capacity today. Therefore, the railway infrastructure must be optimally used, upgraded, and expanded. Future bottlenecks in the network must be detected at an early stage to derive expansion and new construction measures. Railway capacity studies are a means of identifying existing bottlenecks on the one hand and making a statement about feasible train numbers before the realization of a measure on the other hand. Capacity analyses are thus a key component for the planning and dimensioning of railway infrastructure.
Depending on the investigated problem, different capacity calculation methods are applied. The internationally widely applied UIC Code 406 [
1] introduced by the International Union of Railways (UIC) determines the capacity usage of a specific timetable and is mostly limited to lines [
2]. Simulation methods are also based on a specific timetable while being mainly used for the identification of bottlenecks, the assessment of timetable stability and thus for the comparison of different timetable or infrastructure options [
3]. They model interactions between different infrastructure elements and are hence also suitable to analyze railway networks. Since they rely on microscopic infrastructure data and are therefore very complex and computationally time-consuming, the network size that can be examined is limited.
Analytical methods offer a possibility for capacity determination that is independent of a concrete timetable. These methods are mainly used for the assessment of different infrastructure and operating program variants and have short computing times due to being based on rough information on the operating program instead of concrete schedules [
4]. Analytical methods are hence especially suitable for long-term infrastructure planning, which is the application area this paper focuses on.
Recognized analytical capacity calculation methods, however, determine capacities for individual building blocks such as line sections, track groups, and route nodes separately, as well as according to different quality standards. As a result, the respective capacities are not comparable with one another and do not sufficiently consider interactions between the single elements. However, since the capacity of an individual element changes depending on the surrounding elements, it is of particular interest for long-term infrastructure planning to make statements about the interaction of elements in larger contiguous areas. Simulations as well as queueing theory, which is used as the main tool in analytical capacity assessments, quickly reach their computational limits when considering networks. Therefore, most alternative approaches to capacity determination in networks make use of other techniques such as very macroscopic mathematical model descriptions or the formulation of optimization models that are solved numerically. Even though these models are more suitable for network descriptions than models based on queueing theory, their complexity also increases rapidly. For this reason, current optimization approaches are largely based on strong simplifications or consider only the details that are relevant regarding the specifically investigated problem.
In this paper, an optimization model that aims at determining the usable line capacities in a railway network under consideration of influences by surrounding infrastructure elements and of railway-specific details is presented. These details include, for example, the modeling of minimum headway times on lines and route conflicts in nodes. Furthermore, the model ensures that the load on each line can maintain given quality values through the application of the STRELE-formula [
5,
6]. This practice guarantees that the developed model is in line with state-of-the-art methods in Germany and therefore prepares for a potential practical implementation.
To introduce the model for capacity determination and discuss its contribution, this paper is structured as follows: First, an overview of the currently existing approaches for capacity calculation is given in
Section 2, before the model description and the details of the mathematical formulation are presented in
Section 3. In
Section 4, the application of the model to a sample network is described, and the results obtained are discussed. Finally, a summary as well as suggestions for future research are provided in
Section 5.
2. Literature Review
This section provides an overview of different capacity calculation methods, where first a summary of the state-of-the-art methods for determining the capacity of individual elements is given. Further, it is described which approaches have been pursued to calculate a network-wide capacity. The focus is on analytical methods and optimization approaches relying on methods from flow theory, while a complete overview of methods for capacity analysis in railways can be found in [
7].
One widely used method to determine capacity consumption on lines is the UIC Code 406 compression method [
1], which was developed with the aim of standardizing capacity assessment across different countries. It is applied to a specific timetable and based on the occupancy rate, which is obtained from the total occupancy derived via the compression of the single train rides’ blocking time stairways in relation to the investigation period. To ensure a certain quality, the UIC defined permissible standard values for the occupancy rate depending on the type and usage of the line. For the assessment of node capacity, however, the method is not unrestrictedly suitable [
2].
Analytical methods are often applied to determine capacities independently of an actual timetable. These methods mostly rely on the division of the infrastructure into nodes and lines and are based on queueing theory. In the case of nodes, the corresponding queueing system is solved for the scheduled waiting times occurring in the investigated infrastructure. For lines, in Germany, unscheduled waiting times are determined using the STRELE-formula [
5,
6]. In both cases, the resulting waiting times serve as quality criteria for which a defined quality standard can be applied to relate to the optimal number of trains, i.e., the infrastructure element’s capacity.
Since the respective capacities for nodes and lines are not comparable and do not sufficiently consider interactions between the single elements, the question arises whether capacities can also be determined network-wide to be used as a basis for long-term infrastructure planning. There are already a few approaches for deriving such a capacity, selected ones of which are presented below.
In [
8], queueing theory is applied to calculate unscheduled waiting times in a network, for example. However, the calculations assume simplified station layouts and there is no possibility for trains to pass each other. In [
9], the methods for lines and nodes described above are modified such that the resulting capacities are approximately comparable. Subsequently, the train rides in the network are iteratively optimized based on the calculated capacities for the single elements. Another approach that explicitly includes interactions between the individual infrastructure elements on a corridor is presented in [
10]. Here, individual infrastructure sections are modeled by queueing systems, the throughput for a corridor is calculated, and the influence of travel time variability on it is examined. Due to the quickly increasing complexity of the problem, the approach is only suited for corridors, though, and cannot be applied to branched networks. Also, in [
11], interactions are considered by investigating the influence of junctions on the capacity of lines by means of Markov models. Within the scope of the analysis, different operating strategies are mapped and compared with each other. This approach is extended in [
12] to include the state of the infrastructure. In [
13], the queueing approach is also extended to calculate capacity indicators for networks. Also, for the model developed in this approach, drawbacks consist in limited network sizes and a substantial generalization of the underlying infrastructure models, for example.
In addition to analytical calculation methods, there are approaches that rely on the formulation of an optimization model to calculate a network-wide capacity. Here, the idea is that calculating a network capacity can be traced back to calculating a maximum flow in a restricted flow network. Network flow problems in general are a well-known class of combinatorial optimization problems. Within such a problem, a so-called flow network is used to map the flow of objects such as vehicles or goods in a network consisting of vertices and directed arcs. There are numerous types and applications of network flow problems. In the most general maximum flow problem, the goal is to transport as many flow units as possible from one certain vertex to another while capacity restrictions on the arcs are respected and flow conservation in the network holds. Applications of maximum flow problems range from capacity assessments (e.g., of an oil pipeline network or an urban street network) to more abstract issues like scheduling or matching problems. There are excellent surveys in the literature of the various formulations and applications of network flows. Two examples are given by [
14] and [
15]. The first reference especially examines the vast area of static network flows, while the latter focuses on the topic of dynamic network flows.
To calculate the network capacity using network flows in the railway context, the underlying flow network is defined in such a way that vertices correspond to the stations or junctions, and arcs represent the line sections in between.
An important and practice-oriented part of the flow network in the railway context is the existence of travel times assigned to the arcs. Since the travel time of an arc depends on which train is running on it, another important aspect of the study is the distinction between different trains. To map these properties, for the application in the railway context, the standard maximum flow problem is extended to a dynamic multicommodity maximum flow problem.
Among the previous applications of dynamic multicommodity flow problems or of optimization methods in general, there are examples where capacity studies in railway networks have been conducted. The most relevant approaches for this study are described in more detail below.
In [
16], a study on the topological features of transportation networks is conducted. Since insights into network structure provide clear indications of flow properties, this approach establishes a solid graph theoretical foundation for flow analysis.
A direct application of flow analysis to a railway network is presented in [
17]. It estimates the network capacity by finding the maximum flow through the network applying the Ford–Fulkerson [
18] and Gomory–Hu methods. However, a notable limitation is the lack of accounting for time dependency such as travel times on the edges.
In [
19], a model is developed in the form of an integer linear program whose binary decision variables are defined based on the arcs of a time-expanded network. If the value of a variable for a specific line at a specific time or for a certain station at a certain time equals one, then a train is running on this line or stopping at this station at that time. Maximizing the sum over these variables calculates the maximum number of trains that can run on a sequence of line sections with stations in between. In this way, the capacity of an entire railway line is examined. Various limiting aspects are considered by the restrictions defined in the linear program. These include, for example, that a certain train mix must be maintained on the line, and that the number of station tracks that can be used in parallel by stopping trains is never exceeded.
The methodical approach of [
20] is similar. Again, the capacity is calculated for a single railway line, which consists of a sequence of line sections with stations in between. However, compared to [
19], further railway-specific restrictions are considered. Minimum headway times applicable to the line sections, and the minimum and maximum dwell times of the trains in the stations are modeled as additional influencing factors on the capacity. The level of the calculated capacity is also affected by the specification of temporary disruptions on individual line sections for specific time intervals. In addition, each train is assigned an earliest departure time and a latest arrival time in a discrete time horizon. Further binary variables are used to depict whether a train reaches its destination within the specified time horizon or not. As part of the model’s constraints, it is required that a certain minimum percentage of trains arrive at their destination on time. Finally, it is compulsory that freight trains are only scheduled up to a certain maximum percentage of the train mix. Similar to the previously described approach, one main drawback of the method is that it is only applicable to railway lines and does not extend to more complex and branched networks.
Since requests for freight train paths are often received at very short notice, a model with which as many train paths for freight traffic as possible can be planned ad hoc into an existing timetable is developed in [
21]. A larger network area is investigated, which is modeled using a time-expanded network. The resulting optimized timetable can then be used to determine a maximum number of trains. The results are strongly dependent on the existing timetable. An investigation based on the same methodology is presented in [
22]. Here, too, the usable residual capacities in an existing timetable are determined for a network. First, the number of trains is maximized, and, in a further step, the travel times of the trains are minimized. The capacity of the network can also be inferred from the optimized train journeys. It should be noted, though, that the results are only valid for the respective input timetable, meaning that the method is not suitable for planning processes.
In [
23], the authors concentrate on the differences between freight and passenger traffic from another perspective. They especially investigate the effect of traffic separation on capacity. Therefore, they optimize train flows, i.e., routes are not fixed and minimize different objective functions such as travel time or energy intensity.
Another capacity study that takes infrastructure as well as vehicle circulation into account using a hybrid time-space network is given in [
24]. Capacity is defined as maximal transportation, which does include the number of passengers instead of absolute train numbers with the result of a saturated timetable.
The approach described in [
25] focuses on the differences in train properties such as different speed or stopping patterns. It uses multiobjective optimization to create a saturated timetable whereas the application of the solutions in practice requires further research according to the authors.
Also [
26] includes different properties of trains. For fixed sets of trains, the time that is needed for all of them to pass the network is calculated. Hereby, different sequences within the set are considered and capacity of a set is defined by the time in which a certain percentage of all sequences can pass the network. This hence makes the approach timetable independent with the limitation that the sets are fixed beforehand.
In summary, the methods currently used in practice to determine capacity are very much geared to the subdivision of the railway network into its single elements. The capacity parameters used to derive the number of trains that can be operated differ for the individual elements. To include interactions between elements in capacity assessments, several studies, a selection of which is described above, present approaches that consider longer line sections or the network as a whole. However, these methods often rely on many assumptions and were mostly used to investigate specific problems. For example, some of the methods are limited to sequences of line sections and do not extend to branched networks. Others are based and very much dependent on a specific input timetable or relate to a capacity definition that is based on passenger numbers. They are therefore only suitable to a limited extent as a method for calculating a generally defined capacity in the context of long-term infrastructure planning. This paper, in contrast, introduces an optimization model to calculate a network-wide capacity that is independent of an actual timetable and includes a realistic representation of railway-specific constraints. These constraints concentrate on capacity determining properties. The objective of the optimization is to find the maximum number of trains that can run through the network. The capacity term thereby does not refer to this single number for the network. Instead, the resulting capacities for the single line sections, i.e., the extent to which they can use their capacities under the consideration of the surrounding network is analyzed. A further aim is the facilitation of an application in practice that fits in with the state-of-the-art methods, which is why the model is tested on a sample network based on a real network in Germany.
4. Exemplary Application to a Sample Network
4.1. Definition of the Sample Network and the Train Routes
To validate the developed optimization model, it is tested using a sample network that is based on a real network in North Rhine-Westphalia, Germany. The core network consists of ten vertices connected by eleven line sections, each of which is depicted by two arcs, as described in
Section 3.1. The linking lines contain eight virtual vertices connected to the core network by ten line sections (see
Figure 3).
Apart from two operationally relevant junctions (vertices 3 and 4 in
Figure 3), all vertices of the core network correspond to stations, whereas smaller junctions are neglected in favor of computability. Most of the line sections are double-track line sections, but the network also comprises two short single-track line sections, so that all aspects covered by the model can be tested in the validation process. The distances covered by the different line sections lie between three and fifteen kilometers.
To apply the model to the sample network, it is first necessary to distinguish the different route numbers and, based thereon, to determine the sets of scheduled trains and of potentially to be planned trains .
In the presented case, we consider 13 local passenger and three freight train route numbers in each of the two directions. All the considered route numbers run on at least one arc of the core network. The set of scheduled trains is defined over a specified number of trains per route number and hour. Here, the freight train route numbers are included in once per hour and direction and the passenger route numbers are included according to their frequency in the current timetable. Note that the exact schedule itself is not fixed and only frequencies are extracted from the timetable. The defined trains of a route number can be distributed arbitrarily by the model within the hourly slices of the time horizon according to their frequency.
In the set of potentially to be planned trains , only trains of the freight train route numbers are considered, as this is a realistic scenario in the actual practical application. Usually, timetables for passenger trains are fixed and additional (short-term) slots saturating the timetable are then mostly allocated to freight trains.
For all passenger and freight train route numbers, the departure frequencies and the assigned routes through the sample network of
Figure 3 are listed in
Table 2.
With the route numbers defined, it is possible to determine the basic load on the individual arcs of the sample network shown in
Figure 3. The resulting numbers of trains are listed in
Table 3.
4.2. Calculation of the Input Parameters
The necessary input parameters for the optimization model are determined with LUKS
® [
28], which is a software tool for microscopic railway operation studies, that, in addition to the analytical capacity investigation of nodes and lines, also enables the simulation of timetable creation and operational processes. The input parameters include the travel times of the different route numbers listed in
Table 2, the average minimum headway times between two trains on each line section and the hourly capacity limits for each line section. The minimum headway times as well as the capacity limits of the line sections thereby depend on the train mix. Since the scenario considered in the present study assumes a fixed set of trains that is filled up with freight trains only (cf.
Section 4.1), the train mix in the optimization process changes in each iteration (cf.
Section 4.3 for a detailed description of the optimization process). An important note in this context is therefore that the calculation of the input parameters is based on the train mix that results from the initially specified operating program through the set
(cf.
Table 3). Furthermore, since the time resolution here is set to minute steps, all parameters must be rounded to full minutes.
Travel times are calculated for every arc of the core network and each route number running on it. All dynamic vehicle properties are considered. If the resulting travel times exceed full minutes by more than six seconds (0.1 min), they are rounded up to the next full minute. Otherwise, they are rounded down. This approach ensures that the network-wide capacity is rather underestimated than overestimated in the end. Stops for passenger trains are defined in each station of the core network so that the calculated travel times correspond to realistic values for the modeling. No stops are planned for freight trains, but the optimization model allows them to stop in stations if they should be overtaken by passenger trains (for capacity reasons). To consider the time that is necessary for the start-up process and brake application in case of such stops, the travel times of freight trains on each arc of the core network are blanketly increased by one minute.
The use of headway-specific minimum headway times would quickly increase the complexity of the model and thus the runtime. Therefore, average minimum headway times based on the initial train mix (cf.
Table 3) are calculated for each arc of the core network. The average minimum headway times are rounded in the same way as the travel times. The minimum buffer times are set to one minute for all arcs of the core network in accordance with DB regulations [
29].
The line capacities are determined using the STRELE-formula, which is integrated in the LUKS
® software tool, assuming an optimal operational quality according to the standards applied by DB in Germany. The calculation requires, among others, the definition of delay characteristics for the different route numbers that are assumed to take the standard values defined by DB for highly loaded lines. The resulting capacities are rounded down to the next integer. Instead of the STRELE-formula, other methods can also be used to specify an upper limit for the line capacity. For example, a limit for utilization according to UIC Code 406 could be provided. Through iterative addition of train services (cf. procedure described in
Section 4.3), where it is checked in each iteration whether this limit can be adhered to or if it is already exceeded by the scheduled trains, the optimal utilization can be determined.
For nodes, a capacity equivalent to that for lines does not exist. Instead, a realistic limit on the number of trains that can pass a node is approximated by including the number of tracks that can be occupied by stopping trains at the same time. On top of that, route conflicts between the different trains are considered. The minimum time interval between two conflicting trains is set to two minutes (see
Section 3.2.2 for details).
Moreover, the following assumptions are made for dwell times. Passenger trains have a minimum dwell time of five minutes at their start and destination stations as well as stations where they change the direction of travel. At other planned stops on their routes, they have a minimum dwell time of one minute and a maximum dwell time of two minutes. Additionally, passenger trains are not allowed to stop where they are not planned to stop. For freight trains, there are no restrictions regarding dwell times.
The following
Section 4.3 elaborates on how the optimal number of trains is determined, before the results for the network described in
Section 4.1 are given in
Section 4.4.
4.3. Implementation of an Iterative Optimization Process
One problem in running the optimization model presented in
Section 3.3 only once is given in the definition of an appropriate number of trains in the set
. If, on the one hand, the number of trains that are defined to run in addition to the trains scheduled in any case is too small, all trains will be routed, but it remains unclear whether the network’s capacity is already exhausted or whether additional trains could be scheduled. If, on the other hand, there are too many trains, the optimization process may not terminate within a reasonable time, since the time-expanded network grows rapidly, and with it the number of variables and constraints in the model.
For that reason, an iterative optimization process is developed in this paper. At the beginning, only the trains scheduled in any case () are defined, and the optimization model is used to check whether a valid schedule exists. In case all trains have been routed, one train is added to the (initially empty) set for each freight train route number. Subsequently, the model is solved again. That means that this time it is checked whether a valid schedule containing both the trains from and the additionally defined trains from can be generated. For each of the following iterations, it is evaluated for which of the freight train route numbers an additional train could be planned in the last iteration. The route numbers for which this is the case are again included in the set by one additional train in the following iteration. Trains that could additionally be planned in one iteration are also scheduled again in the next iterations. On the contrary, the route numbers for which no additional train could be planned in one iteration are excluded for the following iterations. As soon as there are no more trains that can be scheduled for any of the specified additional route numbers, the program terminates. The total number of trains that could be scheduled determines the capacity of the core network.
The iterative optimization process described is depicted in the flow chart of
Figure 4.
4.4. Results
The optimization model described in the previous sections is implemented in Python 3.9 and solved for the sample network specified in
Section 4.1 on the compute cluster of the RWTH Aachen University using Gurobi 9.5. For technical details of the compute cluster see [
30]. For the application, a time period of three hours (
) is considered. In the following, the results are first analyzed for the entire network as depicted in
Figure 3.
For the entire network (see
Figure 3), 38 trains of 32 different route numbers are initially scheduled per hour, 6 of which are freight trains. This means that, within the whole investigation period, 114 trains are initially scheduled (cf.
Table 3). All freight train route numbers are then considered for the saturation of the network, and in each iteration, it is checked whether another train of each route number can be incorporated. This procedure leads to a total number of 132 trains in the network. Hence, eighteen additional trains that were added in seven iterations could be scheduled.
The runtime of the entire optimization process was approximately 477 min. To achieve this, further accelerating constraints were implemented in the optimization model. Most of these constraints helped the solver make irrelevant but necessary decisions that do not change the objective value of the best possible solution. An example of these constraints is the specification that no trains use holdover arcs of the time-expanded network outside of the core network. One example of constraints that can have a minor impact on the results but contribute significantly to the acceleration of the optimization process, is the specification for trains of passenger route numbers in the set to maintain a minimum time interval of half their frequency at their origin vertices.
Apart from the evaluation regarding the number of totally planned trains, the train numbers can also be evaluated with respect to the single line sections of the considered network. The resulting train numbers can then be compared to the line capacities that were used as input parameters for the model as an upper bound.
Figure 5 shows this comparison for the entire network, where the first number indicates the actual number of trains on each line section and the second number indicates the maximum number of trains that could theoretically run on that section according to the calculated line section capacity.
To allow for the derivation of network effects, the network is divided into two subnetworks, each of which the analysis is repeated for. The division into the partial networks is illustrated in
Figure 6.
In total, 84 trains were initially planned for the “upper” part of the network and 21 trains were added during the following seven iterations. The runtime of the complete optimization process was approximately 17 min. The subnetwork, consisting of the “lower” part of the entire network, is already saturated after six iterations, and 12 additional trains could be added to the originally planned 78 trains. The runtime of the entire optimization process was approximately 91 min. The load of the respective subnetworks after saturation can also be seen in
Figure 6.
In comparison to the analysis of the entire network (cf.
Figure 5), it can be seen that in these partial analyses at least as many (and sometimes even more) trains can be scheduled on the individual line sections.
The optimization thus allows the capacities that would be available when looking at a single element to be compared with the actually usable capacity when considering the interaction with surrounding elements. As expected, the usable capacity tends to decrease the larger the network under consideration becomes, as the influences between the elements increase.
5. Conclusions and Discussion
In this paper, an approach to calculate a network-wide capacity was developed. The approach is based on existing optimization methods and considers characteristic capacity values for lines that are determined using established procedures from railway operations science. This ensures that the developed approach is as consistent as possible with existing procedures and quality standards. The derived model maximizes the number of trains in the considered network while ensuring that railway-specific constraints are met. These constraints include, for example, minimum headway times and hourly capacity limits on lines, route conflicts between arriving and departing trains, the number of available tracks, as well as permissible dwell time intervals for trains in stations. As the input of the optimization model, an individual travel time can be specified for each train on each arc of the network; thereby, different train characteristics as well as the maximum permissible speeds of the line sections are reflected in the model.
In addition to the total number of trains determined for the entire network under consideration, the number of trains that can be operated can also be evaluated in relation to individual lines of the network. In this way, the train runs determined for the individual routes can be compared with the capacities that were entered into the model as input parameters.
The application of the model to a sample network illustrated how the model can be used to examine the extent to which the capacities theoretically available on the individual line sections can actually be used, taking network effects into account. The amount by which the usable capacity is smaller also depends on the network size and the utilization of the network at its edges. Therefore, the network boundaries for such a network analysis should be carefully chosen to minimize the interactions that are neglected by the network selection. The exemplary application of the model is intended to show in a comprehensible way that certain sections of a railway network may already be at their capacity limit, while there is still residual capacity on other sections. Such an investigation can therefore be used, especially with regard to economic aspects, to identify crucial bottlenecks in a network and to derive targeted investment decisions that can increase the capacity of the overall system.
The presented study should be considered as a preliminary investigation. Further research is required regarding, for instance, the following aspects: The input parameters of the model are based on the train mix that is given by the initially scheduled trains. However, the train mix actually changes as soon as additional freight trains are added to the network. It should therefore be investigated whether it is possible either to adapt the input parameters in each iteration, or to not exclusively add freight trains but to increase the number of trains in such a way that the initially defined train mix is maintained. Another possibility to extend the model is the inclusion of stochastic travel times to consider deviations from ideal conditions such as adverse weather conditions or time restrictions due to track construction and maintenance. Furthermore, travel times of freight trains should depend on whether a freight train stops or not. One approach would be to define a variable for each freight train that ensures that in the event of an unscheduled stop at a node, the required travel times on the adjacent arcs are increased for this train to adequately model braking and acceleration processes. One more drawback of the presented approach is the rather rough modeling of nodes. It should hence be checked whether the processes in nodes can be modeled in more detail or by node capacities that are derived with a more accurate approach.
A key factor in the capacitive analysis of networks remains that it would be useful to increase the network size compared to the scenario investigated in this study. However, the computing time increases significantly with an increasing network size. Therefore, an important direction for future research is, on the one hand, the development of powerful heuristics. In the best-case scenario, those heuristics can be used to achieve a possibly suboptimal, but nevertheless particularly good result in a significantly reduced computing time. On the other hand, it should be analyzed how the examined network and its associated input parameters can be aggregated into a more macroscopic view, so that fewer variables must be calculated, which also leads to shorter computing times and therefore allows for the examination of larger networks.