1. Introduction
The metro system plays an essential role in urban public transportation. Because of its high speed, high capacity, and low energy consumption characteristics, the metro system has become the most popular type of urban transportation mode over the years, especially in large cities with many inhabitants, such as London, Tokyo, Beijing, Hong Kong, etc. Thus, plenty of effort is devoted to optimising the metro operation based on different demands.
From the macroscopic perspective, a typical metro operation problem is to make schedules for trains regarding systems’ constraints. Unlike a mainline railway system operated and restricted to a predetermined timetable, periodic operation strategies are the most common solution for a metro system. However, this traditional method cannot always satisfy the variation of passenger flow. In general, the variation of passenger flow is catalogued into two main kinds: regular variation and irregular variation. Regular variation occurs when predictable changes occur, such as the passenger flow caused by rush hours, summer holidays, significant events, etc. These regular variations could be predicted based on historical experiences in the early stage, and operators have enough time to design specialised dynamic timetables. For example, the London Underground established a specialised timetable for massive passenger flow caused by the football match at Arsenal Football Club to ensure sufficient system capacity. Contrarily, irregular variations occur occasionally, and they are unpredictable; for example, due to a sudden change of weather, many tourists choose to take the metro instead of walking to a tourist attraction; these irregular variations will invalidate well-designed timetables, which can cause passenger oversaturation and impact the service quality. Usually, though the approximate passenger flow distribution could be counted based on historical data, these real-time irregular variations will always exist, and predetermined dynamic timetables could not always satisfy these variations.
With the development of the automatic fare collection system and the face recognition technology, the real-time variations of passenger flow are easier to be detected. As a result, a new operation method could be proposed to adjust operation strategies to satisfy the real-time detected passenger flow variation. Though scheduling in a fixed timetable has the advantage of predictability for the passengers, it also makes the operation of the train easier for the drivers. Unlike a mainline railway system which limits the number of passengers boarding and provides a prepared timetable for passengers to plan their journey before purchasing the tickets, with its high frequency characteristic, most metro systems only publish the timetables for the first and last service (such as the Wuhan Metro system, the Beijing Subway system, etc.), most passengers do not plan their journey and board the first train arriving at the platform. Furthermore, based on a field study in the London Underground Bakerloo line operation department, because of the uncertainty in actual operation, metro drivers have to modify the operation strategies according to the metro operators’ real-time instructions instead of following a prepared timetable. Thus, a real-time passenger flow-oriented operation method is more advantageous than a preprepared timetable for a metro system. This real-time operation method is essential to guarantee the quality of services, avoid safety issues caused by passengers’ oversaturation in metro stations, and ensure the robust operation of the metro system. The associated service quality function can be formulated by a single or by multiple features of the considered metro system, including the overall travelling time of passengers, the waiting time of passengers, the transport capacity of the metro system in a time window, and the particular definition of passenger satisfaction, etc.
2. Literature Review
In recent years, rail operation problems have drawn tremendous attention from academia and industry [
1,
2,
3,
4].
A trendy area of rail operation research is train schedule. This research aims to propose different timetables according to different objectives. In general, three main goals are involved: avoiding train delays [
5,
6,
7,
8], reducing fuel consumption costs [
9,
10], and improving the quality of services [
11]. For a passenger railway system, especially for an urban rail system, it is essential to design a passenger flow-oriented scheduling strategy to improve the service quality, system reliability, and passenger satisfaction [
12].
The research on meeting the requirements of passenger flow-oriented scheduling can be divided into two categories. The first kind of research proposes periodic timetables [
13,
14,
15], which means that trains are scheduled at a specific frequency and that timetables keep repeating in a time horizon. This periodic scheduling method is easy to formulate and optimise based on regular passenger flow variations and periodic events [
4]. Thus, most metro systems in real-life apply this periodic operation strategy. However, because more passenger flow data can be collected and analysed, in recent years, instead of considering periodic timetables, more research has focused on scheduling to propose dynamic timetables based on dynamic origin–destination (OD) passenger demands [
3,
12,
16,
17]. For example, Yin et al. used the passenger OD matrix data at stations through a single day to minimise total passenger waiting time [
18]. Halim et al. built an OD matrix to minimise the passenger waiting time and avoid high passenger loads [
19].
The modelling method for scheduling is generally based on four main techniques: linear programming, integer programming (IP), nonlinear programming, and graph theory. For example, Petersen et al. used linear programming to develop a computer-assisted train dispatch system [
20]. Ghoseiri et al. built a multi-objective train scheduling model using IP [
21]. Li et al. proposed a green train scheduling model and fuzzy multi-objective optimisation algorithm based on nonlinear programming [
22]. D’Ariano et al. used graph theory to solve the real-time perturbations for a rail system [
23]. The solution methodologies can be divided into three types: (1) commercial optimisation software, (2) heuristic algorithms (e.g., evolutionary algorithms [
24]), and (3) simulation methods [
25,
26].
Another form of rail operation research is train rescheduling, which mainly focuses on adjusting the train operation into a regular timetable when operational incidents occur [
27]. There are two main research directions: (1) Disturbance management deals with short incidents. In this situation, the dispatch strategy needs to be modified to reduce the influence on the timetable in the future [
23,
28]. (2) Disruption management deals with those events that cause a long delay. In this situation, it is not only the modification of the dispatch strategy that is needed—other measures, such as cancellation of trains, the addition of trains, and skipping stops, also need to be considered [
29,
30]. In general, rescheduling strategies can be divided into five categories [
31]: (1) Holding, which is used to modify the headway between trains by holding the early-departing trains [
32]. (2) Zone scheduling, where the whole line is divided into different zones, and all undeparted trains are divided into different groups, and a specific train only stops at all stations within a single zone. This scheduling method can deal with passengers piling up in a few specific stations [
33]. (3) Short turning, a strategy that arranges several trains to only serve some zones with high demand defined by passenger flow speed and turn back [
34,
35]. (4) Deadheading, a strategy in which several trains are dispatched without stopping and collecting any passengers at the beginning of their trips to reduce the headways with other trains in front [
36]. (5) Skipping strategy, used by most metro systems with high passenger demands, allows some late trains online to skip low-demand stations to recover the timetable.
Real-time rail operation research is also popular in the current decade. Some existing research has focused on fast optimisation for a specific operation in a particular circumstance to increase the stability of the system and avoid delay, such as the authors of [
24,
27,
37] considering the optimisation of the trains’ order in the junction area to minimise delay. The other real-time research is about rescheduling after an incident; this research focused on correcting the timetable to normal to minimise the impacts from disturbance or disruption that already occurs, such as [
29,
38,
39]. As is apparent from the above review, most rail operation research is based on a predefined timetable, and most scheduling-related research is engaged to propose a robust timetable (dynamic or periodic) according to different requirements. Most rescheduling-related research focuses on reducing the impact on the timetable from incidents to recover trains’ operation or to optimise a specific operation to increase system stability. The comparisons of representative metro operation research and this research are listed in
Table 1 below.
In this paper, the authors focus on proposing a new metro operation method to satisfy real-time detected passenger flow without timetables. As a result, the overall objective is to establish a systematic methodology to keep adjusting operation strategies for an urban metro line to replace predetermined timetables, which is applied to reduce the total waiting time for all passengers under the vehicle number constraint. The main innovations of this research are:
Proposal for a real-time passenger flow-monitoring-based urban metro operation method without timetables;
Formal formulation of the proposed metro operation method;
Development of a modified genetic algorithm to solve the problem.
This paper is organised as follows. The overall research motivations and background are introduced in
Section 1;
Section 2 gives a general introduction and review of related research.
Section 3 formulates the real-time passenger flow-oriented operation problems for urban metro systems. A “real-time passenger-oriented operation method (RPOM)” based on nonlinear integer programming is proposed. To solve RPOM,
Section 4 introduces an innovative algorithm, GA_RPOM, based on a modified generic genetic algorithm and a macroscopic metro system simulator. In
Section 5, the proposed method is tested and evaluated based on the data from the Beijing Metro Line 19, and the results are compared with the other operation strategies with timetables in real life. Conclusions are presented in
Section 6.
3. Mathematical Description
Two figures are applied to describe the problem. In
Figure 1, there is a metro system; in general, trains will depart and stop at all the stations. We assume the passenger flow monitoring system, which can record passengers’ origins and destinations (ODs) based on their tickets, has been installed in all the stations. Thus, the average speed of entering passengers in a fixed period with different ODs could be counted. Passengers can only travel from a station labelled with a lower number to one marked with a higher number.
The process flow chart of RPOM is shown in
Figure 2. The passenger flow entering speed will be kept up-to-date in every new detecting period, and the operation strategy will be optimised based on the objective function after the updating. As this process needs to be completed rapidly, the optimising method is essential.
Based on the above description, the considered problem should be formulated to a RPOM that constantly optimises the operation strategy based on the updated data. The following part focuses on the model’s details, including parameters, decision variables, constraint conditions, and objective functions. Several assumptions and explanations should be given first to make the scheduling process straightforward.
(A1) Discrete time, i.e., 1, 2, …, T, will be used in this research;
(A2) Between stations, trains will run based on the predetermined running time, unless it is against safety constraints;
(A3) With the implementation of platform screen doors (PSD) and the deployment of station staffs, the impacts of passenger flow on the trains’ dwelling times have been significantly decreased in recent years. Thus, we assume the generated dwelling strategies can be followed strictly.
Based on the Field Study in the London Underground Bakerloo Line Operation Department, Modifying the Number of Trains in a Specific Operation Time Horizon Will Severely Affect the Economic Operation of the Metro System and Lead to More Operation Problems, such as the Staffs’ Redeployment. Thus, the Number of Trains Active in the Research Time Horizon Is Fixed for in this Research.
3.1. Decision Variables
The arrival and departure times have always been defined as two crucial decision variables for a traditional railway operation scheduling problem. However, with a set of optional departure intervals and dwells, arrival and departure times can be represented by the accumulation of departure intervals, dwell times, and running times. Thus, the decision variables involved in this problem are the choices of the intervals’ level and the dwell time level for each train at different stations, and binary codes could represent these choices. For clarity, the detailed definitions of decision variables are listed in
Table 2.
3.2. Notation and Parameters
Table 3 lists all the relevant notations and parameters in the problem.
3.3. System Constraints
The dwelling time could be adjusted in operation, and, in doing so, the time interval between trains would be changed. Therefore, a lower bound for the time interval should be provided to guarantee safety in operation.
Meanwhile, the departure time interval between two trains in the first station should be included in the collection of optional intervals.
The dwelling time for each train should be chosen from the optional dwelling times set.
The number of passengers on a train after its departure from a station should be less than or equal to the train’s maximum capacity.
The arrival time of train
at the first station equals the departure time of train
plus the departure time interval according to level choice for train
.
The operational running time between different stations is predetermined if the time interval between two trains is larger than the minimum interval; otherwise, it should be increased to guarantee the minimum interval.
The arrival time
of train
at station
equals the sum of the departure time at station
and the operational running time between these two stations.
The departure time
of train
at station
is equal to the sum of the arrival time
and the dwell time
.
The time interval between train
and train
at station
is equal to the arrival time
minus the departure time
.
3.4. Passenger Constraints
The number of passengers at station
waiting for train
can be represented by the passengers who were left by train
and newly arrived passengers.
The remaining capacity of train
at station
immediately after passengers alight is given by:
The number of passengers who can get on train
at station
should be chosen from a comparison of the remaining train capacity and the number of passengers at the station.
If the remaining capacity is enough, all passengers can get on the train. Otherwise, some passengers will be left. The number of passengers left by train
at station
is given by:
The number of passengers on train
when it is departing from station
is given by:
3.5. Objective Function
The waiting time for all passengers(WAT) can be used to evaluate the operation strategy. It includes the waiting time of passengers who are taken to their destination and the passengers who are left in the station, which means that passengers who cannot be taken by the last train become a penalty factor of this problem. Thus, the operation strategy should balance taking more passengers and faster arrival. Based on Equations (1)–(14), the objective function can be formulated below:
In real-time operation, passenger flow’s average entering speed will keep changing; the operation strategy needs to be optimised continually. However, it is impossible to adjust the station stop strategies that have already passed by the train. Thus, to optimise the operation based on detected variations in real-time, the departure time generated should be compared with the detecting time. If the departure time is earlier than or equal to the detecting time, it could not be optimised, and these departure times will be regarded as parameters in the objective function in future optimisation. Otherwise, the departure time is still a variable that could be optimised in real-time:
As the constraints need to be met to guarantee the safety of the system, the optimisation function of RPOM based on nonlinear integer programming can be given as follows:
4. Algorithm
From last section, the formulation of real-time passenger flow-oriented metro operation without timetables has been proposed. There are three requirements for the solving method. First, the operation strategies need to be adjusted and constantly based on optimising the result in real-time. Thus, the solving method needs to be fast enough. Second, the average passenger entering speed is a dynamic parameter. Third, the number of decision variables changes with the detecting time period.
Thus, to meet the requirements, a modified genetic algorithm, GA_RPOM, integrated with a macroscopic metro simulator, is proposed to solve the RPOM problem.
4.1. Macroscopic Metro Simulator
The four sections of the simulator will be introduced separately below. Based on this simulator, the locations of trains and passenger flow distribution could be tracked at any specific time in the research time horizon.
4.1.1. Metro Infrastructure Section and Passenger Flow Section
This section mainly focuses on data input and the descriptions of the passenger flow and the metro system. For passenger flow, the most critical data is the OD matrix, which describes passengers’ journeys between specific stations. For metro infrastructure, the input data mainly focus on the constraint conditions in the last section, which contains the running time, departure interval levels, different dwell time levels, and train capacity.
4.1.2. Timetable Section
Based on different departure intervals and dwell levels settled in the metro system section, each train has many optional combinations of interval and dwell time choices. This section assembles these combinations and generates, modifies, and records all trains’ arrival times and departure times based on these combinations. The process of the timetable section is shown in Algorithm 1 below:
Algorithm 1: Timetable generator |
Input: Departure interval and dwell level choice for trains in the time horizon |
Output: Arrival and departure time |
1 | for train do |
2 | for station do |
3 | if i = 0 and s = 0 |
4 | |
5 |
|
6 | else if i = 0 and s ! = 0 |
7 |
|
8 |
|
9 | else if i ! = 0 and s = 0 |
10 |
|
11 |
|
12 | else if
|
13 |
|
14 |
|
15 | else |
16 |
|
17 |
|
18 | return
|
19 | end if |
20 | end for |
21 | end for |
4.1.3. Passenger–Train Interaction Section
This part mainly concentrates on the interaction between the metro system and the passenger flow; it calculates all passengers’ waiting time and evaluates the system’s capability. Trains are dispatched based on the strategies generated in the last section, and passengers arrive at each station according to the average passenger flow entering speed recorded by the passenger flow section. With trains departing and stopping based on the chosen departure interval and dwell time, passengers get off firstly to make more space for passengers preparing to board. The metro infrastructure section should define train capacity. If there is free space on the train, passengers will board in sequence according to different destinations until the train’s maximum capacity is filled. Remaining passengers will keep waiting in the station. The process of the passenger–train interaction simulator is shown in Algorithm 2 below:
Algorithm 2: Passenger–train interaction simulator |
Input: Passenger data from passenger flow section; system data from metro infrastructure section; departure and arrival time from timetable section |
Output: Total passengers’ waiting time WAT. |
1 | WAT = 0 |
2 | for time t <= T do, t++ |
3 | for station do |
4 |
|
5 | end for |
6 | for train do |
7 | do |
8 | = t |
9 | > 0 do |
10 | -- |
11 | ++ |
12 | end for |
13 | end if |
14 | = t |
15 | |
16 | do |
17 | ++
|
18 |
-- |
19 | -- |
20 | end for |
21 | else |
22 | continue |
23 | end if |
24 | end for |
25 | end for |
26 | end for |
27 | return WAT |
4.2. GA_RPOM Algorithm
GA (genetic algorithm) is a metaheuristic inspired by the process of natural selection that belongs to the larger class of evolutionary algorithms (EA). GA is commonly used to generate high-quality solutions to search and optimise problems by relying on bio-inspired operators. In the GA, a population of candidate solutions (individuals, creatures, or phenotypes) to an optimisation problem evolves toward better solutions. Each candidate solution has a set of properties (its chromosomes or genotype) which can be mutated and altered; traditionally, solutions are represented in binary as strings of 0 s and 1 s.
Because of its clear concept, GA has become increasingly popular in the optimal train operational problems. For the RPOM problem, with the binary coding, the GA is more suitable to simulate the choices of dwell time and departure time interval in the metro system than other heuristic algorithms (such as the ant colony algorithm). With a large number of variables in the objective function, the GA could calculate the optimised result in a shorter time and also jump from the local optimal solutions with the crossover and the mutation process easily. Furthermore, the GA can be easily modified and integrated with the simulator for this problem. Based on the traditional GA, the modified process of GA_RPOM can been shown with the following steps:
Initialize population: in general, the population of a GA consists of fixed-length binary arrays called gene sequences which represent the specific variables in the objective functions. At first, the population will be generated randomly according to the designed population size.
For this research, we use binary numbers to represent the chosen departure time interval’s level and dwelling time’s level. Thus, each train can obtain a unique binary array representing its operation strategy. After integrating different trains’ binary arrays, an enormous binary array is generated as the gene sequence.
However, different from the traditional GA, the number of adjustable departure time intervals and dwelling times keeps changing. By integrating the macroscopic simulator, whenever the strategy needs to be optimised, the best gene sequence generated in the last detecting period would be recorded and compared with the new detecting time in the metro infrastructure section to modify the gene sequence length in real-time. The process to generate the gene sequence can be shown in
Figure 3.
Evaluation: This step evaluates all the gene sequences from the population generated in the previous stage. Typically, the evaluation process is based on the objective function. First, the binary arrays of gene sequence would be transformed into the variables in the objective function. Then, the evaluation indicators mark a better result with higher (or lower) scores.
In this research, the timetable section of the metro simulator could transform the binary arrays into specific operation strategies and ensure the objective function’s constraints. The passenger–train interaction section of the simulator could record the waiting time for all passengers, which could be regarded as the evaluation indicator.
Selection: The gene sequence with a higher score is adopted from this step. Generally, a roulette algorithm is applied to prove the fairness of selection. Then, selected gene sequences will become parent gene sequences for the next step.
Crossover: In this step, all the parent gene sequences selected in the last step will mate (cross a part of binary arrays) to produce new baby gene sequences which inherit a part of their features, as shown in
Figure 4.
Mutation: A part of the baby gene sequences generated in the last step is mutated. Based on the mutation rate, a part of the gene sequence will switch to opposite numbers (0 to 1, 1 to 0). The combination of mutating gene sequences constitutes the new population for the next period. The flow chart of GA_RPOM in the optimisation process and the interaction with the simulator is shown in
Figure 5.
With massive experiments, GA_RPOM could solve most metro operation problems in 30 s. Based on this algorithm, metro systems’ operation strategy could be kept generating and optimising in real-time.
5. Case Study
For this research, a case study based on Beijing Metro Line 19 is used to demonstrate the new operation method and evaluate the performance of GA_RPOM. The total length of the Beijing Metro Line 19 is 22.4 km with ten stations. The speed limit for the whole line is 120 km/h, and this line will be allocated with new 5G network communication and face recognition technology from Beijing Infrastructure Investment Co Ltd, Beijing, China. Before passengers enter the platforms, they need to choose their origin and destination from the ticket machines. Then, when they pass the platforms’ gates, an Automated Fare Collection (AFC) system can collect and generate real-time OD data. Furthermore, the EUHT communication units will be set up in all stations to transfer the OD data. Generally, this detecting and transferring process will take 15 min. Thus, the 15 min is regarded as the detecting period. The case studies use a single-direction line from Mudanyuan to Xingong, whose infrastructure data is shown in
Table 4. In macroscopic terms, the simplified line information and running time between stations are shown in
Figure 6.
Because this metro line is still under construction, the passenger data in the case studies is mocked and based on the early-stage desk research of the metro operator and the pre-designed capacity target of this line with the consideration of its connections with other commuting systems. Some extreme cases are also applied for testing. The passenger entering speed OD data is shown in
Figure 7. The vertical axis represents different OD pairs, while the horizontal axis represents passenger entering speed; different colours represent the different detecting periods.
The case study compares the quality of service based on all passengers’ waiting time in the research horizon between two traditional periodic timetables and the real-time passenger-oriented operation method presented in this paper. From
Table 3, there are two optional dwelling times and two optional departure intervals; one periodic timetable is applied with the short dwelling time and short departure interval, named ‘short periodic timetable’, and the other one uses the long dwelling time and long departure interval, named ‘long periodic timetable’. Contrarily, the RPOM method will keep generating operation strategies and optimising based on detecting passenger flow data in real-time.
The case study was carried out on a Windows 10 laptop with a Core i5 8th generation CPU and 8 GB of RAM. The GA_RPOM program is coded in the Java language based on JDK 1.8. The population size is 200, the mutation rate is 0.2 with a maximum mutation number of 5, and the maximum iteration number is 600. The parameters of the GA_RPOM for this case study are shown in
Table 5. We can obtain the operation strategies in 30 s based on the algorithm program. The convergence graph of the GA_RPOM based on the dynamic passenger flow data in the research time horizon is shown in
Figure 8. The operation strategies based on RPOM and the comparison of all passenger waiting times between RPOM and periodic timetables are shown in
Table 6.
From the above table, RPOM could optimise and modify any later operation strategy for both dispatched trains and follow-up trains at the beginning of every detecting period. In the 90 min research time horizon, the operation strategies’ optimisation process has been implemented six times, or 53 dwelling times, and the departure intervals have been rescheduled. Because the waiting time of passengers who have been left is also counted, from time 0 to 45 min, RPOM set each train’s departure interval to ‘long’ to take more passengers, and it kept adjusting the strategies to lead trains to stop longer in the station with high passenger flow entering speed. Between 45 and 60 min, an extreme situation happened: only two passengers entered every minute in the last station. Thus, RPOM changes the strategy of Train 8 rapidly to a short departure interval. After 60 min, it kept the long dwelling strategies to take more passengers to their destination. Typically, these modifications were implemented after noticeable passenger flow variations occurred, especially when a sudden, massive passenger flow occurred. Unlike the fixed dwelling times strategies, with the GA_RPOM method, trains aim to dwell shorter in the stations with lower passenger entering speeds and stop longer in the stations with high passenger entering speeds. Furthermore, GA_RPOM keeps modifying its operation strategies, making the massive passenger flows in the busy stations embark earlier and preventing these passengers from being stranded in these stations in real-time. With high flexibility, in most situations, the performance of RPOM was better than the periodic timetables, which is reflected by all of the passenger waiting time data. Though in some extreme situations, with the number of modifiable strategies decreased, the performance of RPOM was slightly worse than the short periodic timetable, which occurred once from 45 to 60 min, the overall performance of RPOM is much better than two periodic timetables. Moreover, as passenger flow variations are always in real-life operation, a flexible operation method is more valuable than a perfectly prepared fixed timetable.
With a fixed number of trains, compared with the periodic strategies such as the ST and LT, there is no significant increase in trains’ running costs of GA_RPOM, and it is easily applied in practice. Of course, extra operational training for the metro operators and drivers is necessary. However, compared with the experience-based operation method, more accurate indicators could be provided to the operators and drivers by GA_RPOM.
6. Conclusions
In this paper, an innovative passenger flow-oriented metro operation method without timetables has been presented. A mathematical model is proposed to minimise all passenger waiting times with a limited rolling stock number in the research horizon. A new algorithm program based on a GA, GA_RPOM, integrated with a simulator, has been developed to solve the model and optimise the metro operation plan in an acceptable computational time. Different from the traditional GA for other dynamic scheduling problems, GA_RPOM could modify the gene sequence length to optimise the operation strategies for the metro system in real-time. A case study based on the Beijing Metro Line 19 has been carried out to validate the effectiveness of the proposed method. Compared with traditional periodic timetables, the passenger waiting time has been reduced in most scenarios and the flexibility of the operation has been significantly improved.
The limitations found for this research were that no real-life passenger flow data were collected for the test, as the Beijing Metro Line is still under construction, and no other test metro systems were found for the proposed optimisation method.
In future works, we aim to design more test instances for the proposed method and also to improve the computing time by integrating the genetic algorithm with a machine learning method such as the decision tree algorithm.