1. Introduction
The traction control of high-speed trains is a factor critical to ensuring their safety, punctuality, and reduced energy consumption. However, traditional traction control methods often make it challenging to achieve the optimal traction distribution in the face of complex dynamic characteristics and environmental conditions during the operation of high-speed trains. Especially in regards to the issue of energy consumption, drivers often need to make a difficult trade-off between traction and speed, resulting in an unnecessary waste of energy and reduced operating efficiency.
Saving energy during the operation of trains is essentially an optimization control problem and has attracted the attention of many scholars. Control strategies based on dynamic models are more common than others, but such methods face challenges in accurately describing the dynamic characteristics of train operation [
1]. Scholars have applied modern intelligent control methods to the field of train operational control; thus, a series of methods using deep learning have been developed. In terms of the driving strategy of high-speed railway trains, Lei et al. optimized the genetic algorithm based on the speed limit, slope of the actual route, and the constraints of the vehicle. Although it has an obvious energy-saving effect, it has a high degree of computational complexity and strong data dependence, and it is difficult to adjust its parameters [
2]. Ning et al. proposed a high-speed railway train track optimization model considering continuous train constraints but could not guarantee the global optimal solution [
3]. Cao et al. proposed a method based on mixed integer linear programming to optimize the trajectory of high-speed trains, but the process of transforming the problem into a mathematical model is more complicated [
4]. Based on the Soft Actor Critic method, Su et al. proposed a method to optimize the train driving strategy, but it is sensitive to the initial strategy, which may cause the algorithm to fall into the local optimal solution or the convergence speed to be slow [
5]. Zhu et al. transformed the inverse problem of training energy-saving control into a decision-making machine learning algorithm solved by a finite Markov process through deep reinforcement. They adopted the algorithm of the Soft Actor Critic method to determine the optimal driving strategy, but selecting the appropriate hyperparameters often requires a lot of experiments and tuning [
6]. Zhang et al. proposed a real-time energy-saving optimization method of timetable based on an improved differential evolution algorithm and fast iteration method, but the effect of reducing energy consumption is not obvious [
7]. Ying et al. optimized the curve of energy saved during train operation by quadratic constraint linear programming, but it was limited to only the neutral zone system [
8]. Havaei et al. proposed a new intelligent proportional–integral–differential controller to optimize the velocity trajectory. However, when there is integral saturation in the system, the performance of the controller will decrease [
9].
A high-speed railway system usually contains complex nodes and edges, and the interaction and influence of these are complex to model by traditional methods. The GNN (graph neural network) can capture important information in the network through mechanisms such as convolution, attention addition, and information dissemination, thus providing more accurate energy consumption prediction and energy-saving optimization [
10]. The VGAE developed from the variational auto-encoder (VAE) is a graph neural network that uses latent variable learning to approximate the Gaussian distribution and can effectively solve the link prediction problem [
11,
12,
13]. Nowadays, an algorithm based on graph neural networks is mainly applied in the field of transportation to solve the problem of traffic flow prediction in urban traffic networks and is rarely applied to issues related to high-speed trains [
14,
15,
16].
The VGAE can effectively learn the relationship between nodes in complex systems, which is crucial for modeling the interaction between components in high-speed rail systems. By modeling the high-speed rail system as a graph, the VGAE can learn the low-dimensional representation of nodes to encode and represent the system state effectively. In addition, the VGAE can also expand the training data and improve the generalization ability of the model by generating synthetic samples. Based on previous research, this paper applies the link prediction method of the VGAE to the optimal control problem of trains. To solve this problem, the implicit graph structure is established, and the train traction problem is abstracted into a multi-classification problem. The VGAE is employed to derive an intelligent traction control, which is continuously adjusted to calculate the traction’s energy consumption.
2. The Formation of Energy-Saving Problem of High-Speed Trains
To address the energy-saving control problem of high-speed trains, we aim to reduce energy consumption, improve traction efficiency, and reduce operating costs during train operation. The multiple aspects covered by this question are shown in
Table 1. This paper applies advanced traction control and braking strategies to model the traction process, accurately and dynamically adjusting the strategy to adapt to different operating conditions. The force analysis of the train operation is shown in
Figure 1. The train is regarded as a single-particle model, and it is subjected to five forces during its operation: train gravity G, support force N, traction force F, running resistance W, and braking force B. Because the traction and braking force are two different working conditions of a train power system, traction F and braking force B cannot exist simultaneously.
The high-speed train adopts a composite braking system with electric braking as the main method and air braking as the auxiliary method. The most commonly used braking method prioritizes non-wearing electric braking. In contrast, air braking or composite braking is used in emergency braking situations and is difficult to describe in specific application scenarios. The basic braking control modes include regular, emergency, anti-skid, and parking braking [
17]. Compared with pure mechanical braking, the driving strategy combining regenerative and mechanical braking saves more energy [
18]. The brake handling level contains different braking modes, which can meet the requirements of high-speed trains under different operating conditions. These include common brake levels (1A, 1B, 2, 3, 4, 5, 6, 7, 8), OC, REL, and EB. In the same operating environment, different braking operations consume varying amounts of traction electrical energy [
19].
Without considering the influence of other conditions, the average increase in traction energy consumption per second for each brake lever position is shown in
Table 2. REL corresponds to the traction position, and its traction energy consumption is greater than that of other operations. In the event of an emergency, the brake lever needs to be placed in the “EB” position. However, if the train is stopped on a track segment where evacuation is not feasible, the driver can place the brake lever in the “OC” position to cancel the excessive braking force. Therefore, compared to common braking positions, it results in higher energy consumption, which is consistent with the statistical results in
Table 2.
By improving the operation mode of high-speed trains, such as by optimizing the acceleration and deceleration strategies of vehicles, improving the speed control of trains on different road sections, and optimizing the parking strategies of trains, the traction energy consumption during train operation can be effectively reduced [
20,
21]. The optimal traction control is studied to determine the application time of the brakes and the type of operating brake handling. The braking problem can be transformed into the following equivalent decision problem.
In Equations (1) and (2) above, represents the running state of the train at decision time , which is comprehensively reflected by the running environment, the attributes of the train, and the running state. maps the train state to the braking operation that the drivers may perform; and represents the specific braking operation, and its value is a specific number of discrete quantities, namely . Therefore, the process of obtaining an intelligent braking application strategy by learning from actual driving data can be considered as solving a multi-classification problem. operates for the drivers to change the current running state from to .
The energy saving can be determined by whether the resultant force ∆ of the train is greater than 0. The energy-saving auxiliary labels defined for each sample are shown in (3):
where, when
, the direction of the resultant force on the train is consistent with the running direction. In this case, most of the trains are idle, and the energy consumption does not increase much. When
, the resultant force on the train in the horizontal direction is 0. In this case, most trains are in the cruise state, and the energy consumption does not increase much. When
, the direction of the resultant force on the train is opposite to the running direction. In this case, most of the trains are running with an increased level of energy consumption.
3. Energy-Saving Traction Control Model Based on VGAE
The proposed energy-saving traction control model based on the VGAE is divided into two parts: one constructs the equilibrium training set, and the other constructs the implicit graph structure according to the problem and uses the VGAE to solve the problem. The overall process is shown in
Figure 2 as follows:
The class of data with the least number of brake handling operations is defined as , and the other classes are defined as .
3.1. Constructing the Equilibrium Training Set
Like most multi-classification studies, the braking decision problem of high-speed trains includes the phenomenon of sample imbalance; that is, the number of samples varies greatly between categories, and it is difficult to extract effective information from classes with too few samples.
The training set without any preprocessing leads to poor performance in multi-class classification results, with unsatisfactory levels of model accuracy, robustness, and other performance indicators. Moreover, the classification error of the minority class is more serious than that of the majority class [
22]. The phenomenon of data imbalance is often ignored in the braking problem. While solving the air-braking decision problem of heavy-haul trains, the literature [
23] mentions that the information embodied in the few data sets that are ignored may be related to some special conditions encountered by the train.
Therefore, this paper needs to overcome the problem of data imbalance in the training samples so that the number of samples in each case is as balanced as possible. The implementation steps of this module are divided into two parts.
3.1.1. Eliminating Redundant Noise Data Based on KNN
The redundant data generated by high-speed train operations includes repeated data and some useless surplus data. Noise data are generated by incorrect operation when drivers are nervous. This information has a greater impact on a small number of data sets [
24], so this part only focuses on a small number of data sets. Considering the classification performance and computational cost, the redundant and noisy data are filtered by the KNN-based method. An equal number of samples were randomly selected from small data sets and big data sets as known categories, and samples from the small data sets were chosen successively as unknown samples. Equation (4) was used to calculate the distance between unknown samples and known samples:
Wherein represents the Euclidean distance between the unknown sample and the known sample , and r represents the rth attribute.
We found the k samples with a high degree of similarity to the unknown samples and determined the category of the unknown samples as the category with the most occurrence times among the k samples. The selection of the k value was determined by the five-fold cross-validation method, and the k value with the higher average accuracy was selected. After determining k, we compared the actual value and the predicted value in the original high-speed train driving data for consistency. If they were not consistent, the data were removed from a small number of data sets by filtering.
3.1.2. The Undersampling Algorithm to Solve the Imbalance Problem
After obtaining the filtered data set, the undersampling algorithm is used to randomly select samples from big data sets to construct a training set with a balanced sample number so that the number of samples extracted from each class satisfies Equation (5) as follows:
The difference between this study and other undersampling algorithms is that each class of big data sets satisfies; that is, under the premise of ensuring that the number of each braking operation in the training set is the same, the number of energy-saving state transitions in each class is also as consistent as possible. We suppose there is no mapping of a certain relation under a certain operation or the number of samples is less than the specified number of samples extracted. In that case, we should increase the number of samples in the other operation cases to ensure the balance of the training sets.
3.2. Solution of Variational Graph Auto-Encoder Strategy
By constructing the graph structure of a high-speed rail system, the VGAE strategy represents the complex relationship between the braking operation and the train attributes in the operation process as a network of nodes and edges. It uses the neural network to learn the posterior distribution and the potential representation of the nodes to design the corresponding optimization algorithm or decision strategy so that the train drivers can flexibly adjust the braking intensity and timing of the train according to the current operational situation and environmental information to achieve the optimal level of energy consumption.
3.2.1. Diagram Construction
According to the description of the problem in
Section 2, the braking operation and running condition of a high-speed train are modeled as graph G = (V, E, F), as shown in
Figure 3:
Where node V represents the set of running states of high-speed trains, and node set contains attributes that reflect the running state of high-speed trains, in which the number of attributes is m, including the energy-saving label . The edge set contains the direction information, and the node representing points to the node representing , indicating that the specific braking operation executed at a certain time makes the train’s running state change from to .
The problem of this study is transformed into a link prediction problem of a directed graph, which predicts the possibility of a certain type of link between nodes. It is similar to the description of the possibility of synergistic lethality in the medical field [
25]; that is, according to the attributes of high-speed operations and their energy-saving labels, the most likely braking operation is speculated.
3.2.2. Variational Graph Autoencoder Solution
The VGAE is an unsupervised learning framework that can utilize neural networks to learn posterior distributions and obtain interpretable latent representations of undirected graphs.
Compared with other tasks that can make link predictions [
26], the VGAE model can naturally integrate the characteristics of nodes, so the model has achieved good results in the link prediction task. The basic structure of the VGAE defined in this study is shown in
Figure 4:
In the figure, the first layer is the shared GCN, and the second layer is composed of parallel
and
. The brake diagram has
f nodes, where each node contains
train operation attributes, and all the attributes are expressed as the attribute information characteristic matrix
of
. The node represented by the current train state
has a connection with all the nodes represented by the next train state
, indicating that the drivers have performed the operation, and the relationship between the nodes with the connection is 1; otherwise, the edge relation value is 0, and all the edge relations form an adjacency matrix
A of
f ×
f. The feature matrix
X and the adjacency matrix
A of the graph under each perspective are input into the graph autoencoder of each perspective, respectively. The mean and variance of the target distribution are calculated, and the Gaussian distribution of the target is obtained by using the graph convolution network, as shown in Equations (6) and (7).
The mean vector
and the covariance matrix
are learned by the GCN, where the first layer parameter
in
and
is shared, and the second layer parameter
is not shared. The mean and variance are sampled by reparameterization, and the posterior distribution of the input is determined by the mean and variance. Then, the potential vector
Z is obtained, as shown in (8) and (9).
Here,
is the element in the latent vector
Z, and
is the obtained posterior distribution. The adjacency matrix is reconstructed using the inner product of hidden variables, and the reconstruction is realized by calculating the probability of edges between points. Finally, a reconstruction diagram of the energy-saving traction for high-speed train driving is obtained as shown in (10) and (11).
is the process of calculating probability;
takes the inner product of latent variables sampled from the distribution obtained by the encoder as the decoder, where
is an element of
A, and
is the Sigmoid activation function. The loss function needs to determine the similarity between the reconstructed graph and the original graph, as well as between the distribution calculated by the GCN and the standard Gaussian distribution, which are measured by the cross entropy and
KL divergence, respectively, as shown in (12):
4. Simulation Process and Results of Train Energy-Saving Optimization
In the previous section, an energy-saving traction control model based on the VGAE was proposed. This section describes our supervised training and reconstruction of train traction based on real data of high-speed train operation, a simulation of this, and a comparison of the results. Through the analysis of the algorithm and its energy-saving performance, the effectiveness of the proposed model is verified.
4.1. Simulation Experiment Settings
The simulation experiment is carried out on the basis of the proposed scheme. Firstly, it is necessary to select and set the experimental circuit, including the sample size and circuit information of the brake handling. In addition, various parameters need to be set, such as the sample size of the training set, the number of iterations, and the embedding dimension. These parameter settings will directly affect the results and reliability of the simulation experiment.
4.1.1. Experimental Circuit
The original data are the actual data of more than 190 thousand Tangshan Rail Car Company high-speed trains running from 4 December to 22 December 2012. The unprocessed data include 202 attributes, which can be divided into train attributes, operational characteristics, and the predicted values of braking operations. The statistics of the level of brake handling during the operation of the high-speed trains are shown in
Table 3.
During the operation of high-speed trains, the number of samples at each brake handling level varies greatly, and there are too few cases in which the brake handling level is “EB”. This study focuses on the normal driving process. Therefore, a few data sets in this study are defined as cases in which the brake handling level is at “OC”. The line containing all the braking level operations is used to verify whether the optimal control method proposed in this paper is effective. The specific line data are shown in
Table 4, and the statistics of the brake handling levels in the simulation experiment data are shown in
Table 5.
4.1.2. Parameter Setting
The number of balanced training sets extracted from big datasets and small datasets is 30. The k value determined by cross-validation is four, and we use the European distance function. The number of valid minority datasets obtained by filtering is 65. The VAGE model uses a two-layer GCN as the encoder and uses the Adam optimizer to train 300 iterations at a learning rate of 0.01. The embedding dimension of the VAGE is 32.
4.2. Simulation Process
Firstly, the static attributes that cannot describe the sample distribution are eliminated. The chi-square test is used to reduce the dimension of feature selection and eliminate the attributes that are not related to the classification problem. The average interpolation method is used for filling, and attributes such as train operation status and arrival distance are increased. Finally, each sample contains 51 attribute values. The first part of the strategy is used to obtain a balanced training set, and the attributes of these samples are extracted at the next moment according to the time stamp to illustrate the changes in the train operation caused by the braking level. In order to eliminate the influence of the graph direction on model training, the braking problem is divided into three perspectives according to the value of the energy-saving label
under the current high-speed train
of the training set, as shown in
Figure 5.
The center of each figure represents the node under the current status of the high-speed train operation, and the possible situation for the next status is around the center node. The figure is based on the true statistics of all possible situations in the training set. Multiple VAGE models are used to reconstruct the braking relationship diagram of high-speed trains from each perspective, and supervised training is performed in combination with the braking value of the training set. In the process of verification, the energy-saving braking label at each moment is applied to the reconstruction map from the corresponding perspective in turn, and the edge with the highest probability of occurrence is selected as the most likely operation of the current driver. The predicted value constitutes an initial strategy for train operation.
4.3. Analysis of Simulation Results
Our analysis of the simulation results covers many aspects. The first is the evaluation of the performance of the algorithm, which includes a comprehensive evaluation of the performance of the proposed method in the experimental environment. The second is the evaluation of its energy-saving performance. Compared with the traditional method, a reduction in energy consumption and improvement in system energy efficiency are achieved. Finally, the proposed method is compared with other existing energy-saving methods to evaluate its advantages and disadvantages in practical applications so as to verify its effectiveness and practicability.
4.3.1. Evaluation of Algorithm Performance
In this paper, the AUC and loss are used to measure the performance of the model, and their changes with the number of iterations are shown in
Figure 6 as follows:
The AUC value in the figure tends to be 0.9; the loss value gradually decreases with the increase in training rounds and finally stabilizes. This shows the validity of the model.
In addition, in order to further compare and illustrate our model, the confusion matrix is defined to describe the classification effects of multi-classification models, and
,
, and
are used to represent the accuracy, recall, and overall performance of model classification, respectively, as shown in (13)–(15):
Wherein
is the number of correctly predicted samples in the class, and
is the number of samples that incorrectly classify brake operation
as brake operation
. The results of
,
, and
are shown in
Table 6,
Table 7 and
Table 8.
From the above table, it can be seen that the classification effect of the VGAE for traction control in this paper is not much different from that of the naive Bayesian and BP neural networks. The classification effect of REL braking is better than that of others because the number of samples of other classes in the validation section is too small. If the number of samples of each class in the validation set can reach a certain value, the accuracy, recall, and overall evaluation will be greatly improved. In addition, this study does not have requirements that are too strict on the accuracy of classification. It is reasonable that the predicted braking operation is inconsistent with the original operation, as we focus on determining the amount of energy saved.
4.3.2. Energy-Saving Performance Evaluation
The running curve of the final running strategy is shown in
Figure 7. The simulated high-speed train reaches the speed limit when it goes from the starting point to more than 20,000 m, continues to cruise to more than 70,000 m at a speed close to the speed limit, and then begins to slow down. Compared with the original real running curve, the optimized traction control makes the change in speed smoother and ensures that the train runs within the speed limit range, which ensures safety. Based on the original manual driving data, the time error of two minutes is acceptable. The line data after the implementation of the strategy are shown in
Table 9.
In the table, the running time of the optimized driving scheme obtained in this study is 1 min 24 s faster than that of the original scheme. This can be explained by the fact that the strategy in this study achieves the punctuality and accurate parking of trains on the basis of safe driving, which are the preconditions for verifying energy saving.
The traction energy consumption of the train is a piecewise function, and the calculation needs to be segmented. Combined with whether the brake handling is switched and the speed changes, the energy consumption of different segments is counted, and the curve of the total energy consumption with the driving distance based on the strategy proposed in this paper is obtained, as shown in
Figure 8.
After the high-speed train goes 30,000 m in the diagram, the energy saving of this research strategy begins to show. For each interval of 10,000 m, the cumulative energy consumption of the original and the optimized strategy is shown in
Table 10.
In
Table 10, with the increase in driving distance, the energy-saving percentage in this study increases continuously, saving about 18.78% of the traction energy for the whole journey. In the normal driving environment, compared with the manual driving strategy, the optimized driving strategy in this study shows that some energy is saved.
4.3.3. Performance Comparison Test with Other Methods
In order to comprehensively evaluate the effect of the high-speed rail energy-saving strategy proposed in this paper, based on existing data, comparative experiments are designed to discover the performance differences between the VAGE method and the Jaccard and Adamic–Adar methods in depth. The energy-saving percentage of the traction energy consumption with the same distance interval under the VAGE, Adamic–Adar, and Jaccard methods compared with the original strategy is shown in
Table 11.
The VGAE algorithm saves about 17.68% of the traction power throughout the simulated driving route and shows energy savings earlier than the other algorithms. A comparison of the speed and travel time between the proposed method and other existing methods is shown in
Table 12.
The maximum speed of the VGAE method does not exceed 300 km per hour, and the travel time is shorter than that of the other methods. The experimental results can verify the effectiveness of using a multi-view variational graph autoencoder to solve the energy-saving problem of high-speed trains, that is, using less energy in a shorter period of travel.
5. Conclusions
This paper studies the energy consumption algorithm of high-speed train operations and proposes a graph optimization algorithm based on the VGAE, which can fully consider the complex relationship between nodes in the operation of trains. Through a simulation and comparative tests, under the premise of ensuring safe, on-time, and accurate parking, the control method in this study shows increased energy savings.
In the process of solving the driving strategy, this study balanced the sample size of each class of the training set and considered some special cases. A algorithm based on the KNN and undersampling is used to balance the negative impact of the number imbalance between the data sets, and the braking problem of the high-speed train is transformed into the link prediction problem of the implicit graph. The variational graph autoencoder is used to solve the initial strategy of train braking, and the time of intermediate trips is adjusted to ensure that the driving distance is consistent with the length of the route. The actual feasible braking application time and brake handling operation are obtained. However, the training data are based on relatively safe driving conditions, which makes the actual driving situation, especially in the face of a complex or dangerous driving environment, too idealistic. Therefore, for future research, we suggest inputting more train operation factors and real-time data into the VGAE model to obtain a more optimized strategy.
In detail, in order to further evaluate the proximity between research and practical applications, the following aspects need to be considered. First of all, the simulation data used in this study may not be able to fully simulate the various complex situations and changes during the operation of actual trains. The actual train operation environment may be affected by many factors, such as the weather, road conditions, traffic conditions, etc. These factors may have an important impact on the energy consumption of trains and the driving strategy of their drivers. Secondly, our research focuses on the safety, punctuality, and accuracy of trains, but in practical applications, safety is always the primary consideration. Therefore, any control method that optimizes energy consumption must ensure that the safety of the train is not sacrificed. Finally, the training data in the study are relatively restricted, which may limit the generalizability of the algorithm to actual driving environments. Therefore, incorporating more actual operating data and factors into the model should be the focus of future research to ensure the effectiveness and reliability of the algorithm in practical applications.