Expressway Speed Prediction Based on Electronic Toll Collection Data

Zou, Fumin; Ren, Qiang; Tian, Junshan; Guo, Feng; Huang, Shibin; Liao, Lyuchao; Wu, Jinshan

doi:10.3390/electronics11101613

Open AccessArticle

Expressway Speed Prediction Based on Electronic Toll Collection Data

by

Fumin Zou

¹,

Qiang Ren

^1,*

,

Junshan Tian

¹,

Feng Guo

²,

Shibin Huang

¹,

Lyuchao Liao

³

and

Jinshan Wu

¹

Fujian Key Lab for Automotive Electronics and Electric Drive, Fujian University of Technology, Fuzhou 350118, China

²

College of Mathematics and Computer Science, Fuzhou University, Fuzhou 350108, China

³

Fujian Provincial Big Data Research Institute of Intelligent Transportation, Fujian University of Technology, Fuzhou 350118, China

^*

Author to whom correspondence should be addressed.

Electronics 2022, 11(10), 1613; https://doi.org/10.3390/electronics11101613

Submission received: 19 April 2022 / Revised: 14 May 2022 / Accepted: 16 May 2022 / Published: 18 May 2022

(This article belongs to the Special Issue Advanced Intelligent Transportation Systems and Automated Vehicles in Smart Cities)

Download

Browse Figures

Versions Notes

Abstract

:

Expressway section speed can visually reflect the section operation condition, and accurate short time section speed prediction has a wide range of applications in path planning and traffic guidance. However, existing expressway speed prediction data have defects, such as sparse density and incomplete object challenges. Thus, this paper proposes a framework for a combined expressway traffic speed prediction model based on wavelet transform and spatial-temporal graph convolutional network (WSTGCN) of the Electronic Toll Collection (ETC) gantry transaction data. First, the framework pre-processes the ETC gantry transaction data to construct the section speeds. Then wavelet decomposition and single-branch reconstruction are performed on the section speed sequences, and the spatial features are captured by graph convolutional network (GCN) for each reconstructed single-branch sequence, and the temporal features are extracted by connecting the gated recurrent unit (GRU). The experiments use the ETC gantry transaction data of the expressway from Quanzhou to Xiamen. The results indicate that the WSTGCN model makes notable improvements compared to the model of the baseline for different prediction ranges.

Keywords:

expressway speed prediction; the Electronic Toll Collection; graph convolutional network; gated recurrent unit; wavelet transform

1. Introduction

With its high capacity and low time cost, the expressway has become the preferred way to travel between cities [1]. Due to social and economic development, traditional traffic management techniques struggle to cope with the increasing traffic pressure, and there is an urgent need to develop Intelligent Transportation Systems (ITS) for expressways. With the accurate prediction of traffic information from ITS, travelers can develop reasonable travel routes before departure and improve the efficiency of travel, and road management departments can effectively conduct traffic guidance and alleviate traffic congestion and other problems based on reliable road traffic information [2]. In recent years, China’s expressway ETC system has realized the networking of 29 provinces nationwide, built a total of 24,588 sets of ETC gantry systems, renovated 48,211 ETC lanes, and averaged nearly one billion ETC gantry transaction data per day [3], which has further improved the efficiency of expressways. The transaction data collected by the ETC gantry system can record the travel information of almost every vehicle on the expressway, and compared with detector data [4,5] and floating car data [6,7], the ETC gantry transaction data are more comprehensive and reliable, covering the expressway road network. Therefore, to further improve the service quality of the expressway ETC system, it is of great theoretical significance and practical value to research the traffic speed prediction based on ETC gantry transaction data [8].

Several domestic and foreign researchers have invested in the field of traffic prediction research [9]. The three main types of traffic prediction methods are statistical learning models, shallow machine learning and deep learning. Statistical learning models. For example, Emami et al. [10] proposed a Kalman filter-based method for traffic flow prediction. Xu et al. [11] proposed an Autoregressive Integrated Moving Average (ARIMA) and Kalman filter method for predicting road traffic status. Statistical learning models are usually performed under the assumption of linearity, which does not reflect the nonlinear feature of traffic speed well, and also cannot handle high-dimensional data well due to its high complexity. Shallow machine learning can solve the above problems well, for example, Hu et al. [12] proposed a Support Vector Regression (SVR)-based method for estimating the average speed of expressway sections to overcome the sparse density of existing expressway vehicle detectors. Evans et al. [13] performed road section state prediction based on Random Forest (RF), and with different prediction ranges and training data amounts, the algorithm achieved the best results compared with others. Sun et al. [14] used a method to dynamically adjust the K-nearest neighbor (KNN) parameters for traffic flow prediction and achieved good results in different periods. However, the performance of shallow machine learning methods depends heavily on the artificially designed features and they usually fail to produce the best results for prediction tasks with complex regularities and complicated factors. In recent years, deep learning models have shown their superior predictive capabilities. Deep learning models can automatically extract features and capture the correlation of data, so a large number of deep learning models are used for traffic prediction. Since traffic data can be represented as time series data, GRU [15] and LSTM [16] are gradually used for speed prediction. Fu et al. [17] used wavelet transform to decompose the original data and then constructed GRU and Autoregressive Moving Average (ARMA) models to predict low-frequency and high-frequency sequences. Despite the impressive capability of these methods in temporal modeling, accurate traffic prediction is still limited because of the lack of consideration of the spatial characterization of traffic data. To solve this problem, Lu et al. [18] proposed a spatial-temporal deep learning network (ST-TrafficNet) for traffic flow forecasting, which is capable of capturing high-dimensional temporal features while also extracting latent spatial features. Bogaerts et al. [19] used a combination of CNN and LSTM to construct a spatio-temporal recurrent convolutional neural network to effectively extract temporal features and spatial feature variations of traffic speed and achieve high accuracy traffic speed prediction. However, CNN is designed for Euclidean spatial structure, and for the actual expressway road network structure, CNN is unable to capture the spatial correlation completely. Zhao et al. [20] proposed a temporal graph convolutional network for traffic prediction, in which the combination of GCN and GRU goes to capture the spatial and temporal features of traffic flow. Pan et al. [21] proposed the dual-channel based graph convolutional network (DC-STGCN) model to fully extract the spatio-temporal characteristics between traffic flows, and achieved good results in long-term prediction.

Regarding expressway speed prediction, most of the data used in this paper are vehicle detector data and floating car data. However, there are shortcomings, such as low density of vehicle detectors, high damage rate, and incomplete objects for floating car data. To solve these issues, this paper considers the nearly full sample of ETC gantry transaction data and use the WSTGCN model to predict exressway speeds, which can effectively eliminate the section speeds and extract the spatio-temporal characteristics between ETC gantries. The main contributions of this work as follows.

A data pre-processing method on ETC gantry transaction data is designed. The fusion of expressway network topology data, ETC and manual toll collection (MTC) transaction data constitutes spatio-temporal origin-destination (OD) data. Anomaly cleaning, missing repair and vehicle travel time statistics are performed on OD data, and a vehicle travel time outlier detection algorithm is proposed to eliminate outlier samples. In this way, the speed of the expressway section is constructed.
The proposed WSTGCN model consists of wavelet transform, GCN and GRU. It reduces the disturbance of section speed and also captures the spatio-temporal correlation section speed.
The proposed WSTGCN model is evaluated on the ETC gantry transaction data of the Quanzhou-Xiamen Expressway. The results show that the model has the best prediction of section speed compared with the baseline method. Furthermore, the accuracy is still higher than that of the baseline prediction model in different prediction ranges.

The rest of the paper is organized as follows. The concepts related to expressways are introduced and problem description in detail in the “Preliminary” section. The model construction for expressway speed prediction is described in detail in the “Methodology” section. In the “Experimental Results and Analysis” section, the WSTGCN model is evaluated using ETC gantry transaction data from Fujian Province, and finally we present the ‘Conclusions” of our paper.

2. Preliminary

2.1. Related Concepts

Definition 1.

Each ETC gantry of the expressway is called a

N o d e

, and two adjacent

N o d e s

on the road compose an expressway section, which is referred to as

Q D = \{Q, D i s t a n c e\}

,

Q = 〈N o d e_{1}, N o d e_{2}〉

, where

N o d e_{1}

is the start of the section,

N o d e_{2}

is the end of the section, and

D i s t a n c e

is the actual distance of the section.

Definition 2.

Expressway road network, all

Q D

within the research area of expressway form expressway road network, referred to as

L W = \{Q D_{1}, \dots, Q D_{2}\}

.

Definition 3.

Vehicle Trajectory, the sequence of nodes arranged in chronological order formed by a vehicle on the ETC gantry on the expressway is called

T r a j = \{N o d e_{1}, \dots, N o d e_{n}\}

, where

N o d e_{1}

is called the trajectory start point, and

N o d e_{n}

is called the trajectory end point.

Definition 4.

The average speed of a vehicle passing through a section is called the average vehicle speed. The calculation method is as follows:

v_{i} = \frac{D i s t a n c e}{t_{2} - t_{1}}

(1)

where

t_{1}

represents the time when the vehicle passes through the starting point of the section, and

t_{2}

represents the time when the vehicle passes through the end point of the section.

Definition 5.

The average speed of vehicles passing through the same section in a certain period of time is called section speed, and the calculation method is shown in Equation (2)

V = \frac{\sum_{i = 1}^{n} v_{i}}{n}

(2)

where

v_{i}

represents the average vehicle speed of the ith vehicle, i is the ith vehicle passing through a certain section within a certain period of time, and n is the nth vehicle passing through a certain section within a certain period of time.

Definition 6.

The time difference between a vehicle passing through a certain section is called the vehicle travel time, and the calculation method is shown in Equation (3)

Δ t_{i} = t_{2} - t_{1}

(3)

where

t_{2}

represents the time of passing a gantry after a certain section, and

t_{1}

represents the time of passing a gantry before a certain section.

2.2. Problem Description

The expressway road network can be abstracted as a graph. Generally, the unweighted graph

G = (φ, E)

can be used to represent the topology of the expressway road network, where

φ

represents the set of all nodes on the expressway road network,

φ = \{φ_{1}, φ_{2}, \dots, φ_{N}\}

, and N represents the number of

N o d e s

. E represents the set of interconnected edges between

N o d e s

, all the connection information between

N o d e s

is in the adjacency matrix

A \in R^{(N - 1) \times (N - 1)}

, and there are only 0 and 1 elements in the adjacency matrix, where 0 indicates that there is no connection between

N o d e s

, and 1 indicates that there is a connection between

N o d e s

. The section speed can be regarded as the attribute feature of the expressway road network

N o d e

, which is represented by the feature matrix as

X \in R^{(N - 1) \times P}

, where P is the number of attribute features of the

N o d e

, the length of the historical time series.

X_{t} \in R^{(N - 1) \times i}

represents the section speed of all sections when the time section is i.

Therefore, the problem of expressway speed prediction is to learn a mapping function F based on the feature matrix X of the section speed in the past under the topology graph G of the expressway road network to predict the section speed at T times in the future.

X_{t + T} = F ((X_{t - n}, \dots, X_{t - 1}, X_{t}); G)

(4)

where t is the time interval, n is the length of the historical time series, and T is the length of the time series to be forecasted.

3. Methodology

3.1. Overview of the Overall Framework

This paper is based on the WSTGCN model to predict expressway section speed, which is mainly divided into four modules: expressway road network spatio-temporal OD data construction module, OD data pre-processing module, spatio-temporal feature extraction module, and output module. Figure 1 shows the whole framework structure of expressway speed prediction. The expressway road network spatio-temporal OD data construction module consists of expressway road network topology data, ETC transaction data and MTC transaction data. According to the expressway road network topology data, ETC, MTC transaction data in the OBU Plate and Flag ID group iterations, and then Trade time will be sorted to form the spatio-temporal OD data set. The data pre-processing module includes a data interpolation module, vehicle travel time construction module, travel time abnormality detection module, and section speed generation module. After interpolating the missing data of some trajectories, the vehicle travel time of each section is constructed, and then the anomaly is detected using the vehicle travel time outlier detection algorithm, finally, the section speed data set is constructed. The spatio-temporal feature extraction module includes wavelet transform, GCN, and GRU. The multi-scale wavelet decomposition is applied to the section speed time series data, decomposed and reconstructed to obtain the section speed after single branch reconstruction, and then GCN is used to capture the spatial feature information of section speed, and GRU is used to capture the temporal feature information of section speed. The final output module, which outputs the numerical summation of the predicted values of each reconstructed single-branch series, obtains the overall speed prediction results considering the spatio-temporal characteristics.

3.2. Data Pre-Processing

Vehicles enter the expressway through the ETC channel and MTC channel of the expressway toll station, and the expressway ETC gantry system can record the driving information of vehicles entering from the ETC channel and MTC channel at the same time. Therefore, ETC gantry transaction data are more complete. ETC gantry transaction data includes ETC transaction data and MTC transaction data. According to the ETC gantry transaction data statistics of this experiment, the percentages of ETC transaction data and MTC transaction data are shown in Figure 2.

3.2.1. Raw Data Cleaning

In the process of ETC gantry transaction data collection, the following three main abnormal problems exist in the collected ETC gantry transaction data, due to the influence of factors beyond control such as equipment abnormalities, wireless crosstalk, and bad weather, as shown in Figure 3. (1) Data redundancy. Duplication between multiple sets of data. (2) Missing data. The problem of data not being collected effectively occurs. For example, fields such as date, time, and vehicle type are missing at the entrance and exit station. (3) Data errors. Data records that do not match the normal traffic rules, such as the date of the entrance station being later than the date of the exit station, and the wrong entrance and exit numbers, which cannot correspond to the actual toll stations. These abnormal data greatly reduce the value of ETC big data-mining applications. To reduce the impact of erroneous data on the accuracy of the established prediction model and increase the reliability of prediction, such data will be removed.

3.2.2. Vehicle Travel Time Construction

After the abnormal data of ETC gantry transaction data are eliminated, the travel trajectory of each vehicle is constructed by the time sequence. Using the ETC gantry topology data of the expressway road network, the ETC gantry search is performed for each vehicle’s travel trajectory, traversing two adjacent ETC gantries in the vehicle travel trajectory, checking the two adjacent gantry topology relationships and whether they exist in the ETC gantry topology data of the expressway road network. If it exists, the travel time of the vehicle through the section is calculated directly. If it does not exist, road section is searched with these two gantries, the driving trajectory of the vehicle is interpolated, the average speed of the vehicle through the road section according to the search result is calculated, and this average speed is taken as the average speed of all the sections between through these two gantries, through the distance of the two adjacent gantries, the travel time of this section can be derived. The specific construction method is shown in Algorithm 1.

Algorithm 1 Vehicle travel time construction algorithm.

Input:

Vehicle trajectory data

T r a j

; Expressway road network topology data

L W

;

Output:

Output Vehicle travel time data

Δ t

;

1:: $T r a j = \{N o d e_{1}, \dots, N o d e_{n}\}$ , $L W = \{Q D_{1}, \dots, Q D_{n}\}$ , $Q D = \{Q, D i s t a n c e\}$ ;
2:: for $i = 0$ to $i = n - 1$ do;
3:: $N o d e_{i}, T i m e_{i}, N o d e_{i + 1}, T i m e_{i + 1}$ //extract information of adjacent nodes;
4:: $Δ t_{i} = T i m e_{i + 1} - T i m e_{i}$ //compute the time difference of adjacent nodes;
5:: $Q = (N o d e_{i}, T i m e_{i}, N o d e_{i + 1}, T i m e_{i + 1})$ //save the information of adjacent nodes;
6:: $Δ t = (Q, Δ t_{i})$ //save the vehicle passage time data;
7:: If $N o d e_{i}$ and $N o d e_{i + 1}$ in $L W$ //if adjacent nodes are in topological data;
8:: $Δ t = Δ t (Q, Δ t_{i})$ //the vehicle passage time data remains unchanged;
9:: Else $N o d e_{i}$ and $N o d e_{i + 1}$ not in $L W$ ;
10:: $d i s t a n c e = \{\}$ ;
11:: $\{N o d e_{i}, \dots, N o d e_{n}\} \leftarrow$ shortest path $L W$ //search for the shortest path;
12:: $\{d i s t a n c e_{i}, \dots d i s t a n c e_{n}\} \leftarrow L W$ // the shortest distance between gantries to distance;
13:: $v = D i s t a n c e \div v$ //calculate the speed of the front and back gantry;
14:: $\{N o d e_{i}, \dots, N o d e_{n}\} = v$ //add speed attributes to adjacent gantries;
15:: $Δ t_{j} = d i s t a n c e \div v$ //calculate the time difference of adjacent nodes;
16:: $N o d e_{j}, N o d e_{j + 1}, T i m e_{j}, T i m e_{j + 1}, Δ t_{j}$ //extract the information of adjacent nodes in the shortest path
17:: $Δ t = (Q, Δ t_{j})$ //replace the original pass time and generate a new pass time;
18:: return $Δ t$ ;

3.2.3. Vehicle Travel Time Outlier Detection Algorithm

After constructing the vehicle travel time data, the data that objectively exist are reasonable. However, they contain some of the ETC gantry transaction data of abnormal driving behavior, for example, if a vehicle’s travel time is too long or too short compared with the normal situation of similar models. Therefore, a vehicle travel time outlier detection algorithm is constructed to further reject such data. This algorithm is a combination of the outlier information detection algorithm in the literature [22] and the outlier elimination algorithm in the literature [23]. In the expressway ETC gantry transaction data, there are only a very few cases where the vehicle travel time is shorter than the normal value, and most of the outliers are long vehicle travel times, resulting in an asymmetric error interval. If only the outlier information detection algorithm is used, the 75% quantile value is relatively high, while the 25% quantile value is closer to the sample mean, which will not be able to eliminate the data where the vehicle travel time is much lower than the sample mean. Therefore, combining the two methods can solve this problem well. The basic idea of the vehicle travel time outlier detection algorithm is to use both upper and lower limits of the box line diagram and the centroid threshold of the statistical distribution of distance data for outlier detection, to determine the threshold interval for abnormal travel time data filtering, and to eliminate the data outside this threshold, and then to quickly filter out abnormal data in the massive ETC gantry transaction data, as shown in Figure 4.

t_{d o w n} = m a x (t_{25 %} - 1.5 \times (t_{75 %} - t_{25 %}), t_{m e a n} - 2 σ)

(5)

t_{u p} = m i n (t_{75 %} + 1.5 \times (t_{75 %} - t_{25 %}), t_{m e a n} + 2 σ)

(6)

t_{25 %}

means the time greater than 25% of the vehicle travel time,

t_{75 %}

means the time greater than 75% of the vehicle travel time,

t_{m e a n} = \frac{\sum_{i = 1}^{N} t_{i}}{N}

is the mean value of vehicle travel time, and

σ

is the standard deviation of vehicle travel time. The final vehicle travel time is valid interval

Δ t \in [t_{d o w n}, t_{u p}]

. If a vehicle passes through a section in a certain time period with a vehicle travel time within

Δ t

, the average vehicle speed of the vehicle passing through the section is directly generated, and the section speed is generated with a statistical window of 15 min.

3.3. Spatio-Temporal Feature Extraction

Based on the GCN-GRU model, wavelet transform is used to capture the spatio-temporal trend of expressway traffic speed by decomposing and reconstructing the expressway traffic speed. The structure of the prediction model is shown in Figure 5, which contains three parts: (a) wavelet transform (b) GCN (c) GRU.

3.3.1. Wavelet Transform

The expressway ETC gantry transaction data generates a lot of noise due to its periodic volatility, and data containing noise is fatal for speed prediction. In real traffic analysis, it is known that real speed signals are usually low-frequency speed signals or relatively stable speed signals, while noisy signals are more high-frequency speed signals [24]. Therefore, with the help of the theory related to wavelet transform, the calculated section speed signals are filtered out of the noise signals to obtain relatively accurate section speed data. To separate the low-frequency part and the high-frequency part of the original signal, Mallet et al. proposed a multiscale decomposition and reconstruction algorithm for the repair signal, the principle of which is shown in Figure 6.

A_{j} [f (t)] = \sum_{k}^{} H (2 t - k) A_{j - 1} [f (t)]

(7)

D_{j} [f (t)] = \sum_{k}^{} G (2 t - k) A_{j - 1} [f (t)]

(8)

In the formula, t is the time series number of the time series data,

t = 1, 2, \dots n

,

f (t)

is the original signal, j is the number of layers of decomposition. H,G are wavelet decomposition filters in the time domain,

A_{j}

is the wavelet coefficient of the low-frequency part of the signal

f (t)

in the jth layer, and

D_{j}

is the wavelet coefficient of the high-frequency part of the signal

f (t)

in the jth layer. The decomposed signal can be reconstructed using Equation (9).

f (t) = D_{1} + \dots + D_{j} + A_{j}

(9)

In the expressway speed prediction, the original section speed data consists of a set of non-smooth time series data. In many wavelet transform functions, the sym wavelet function is a linear phase, approximately symmetric and double orthogonal function. The smoothness is better, the calculation is simpler, and it has achieved good results in related research [25,26]. The sym5 is one of the commonly used wavelets in the sym wavelet group. Therefore, in this paper, the sym5 wavelet is chosen as the basis function. The number of decomposition layers cannot be too large or too small. If the number of decomposition layers is too large, it will reduce the variation pattern and trend of the section speed series. If the number of decomposition layers is too small, the signals with different frequency characteristics in the original section speed signal cannot be separated effectively. According to the existing research on wavelet transform for noise reduction of time series data [27], the number of decomposition layers is set to 3. The high-frequency part of the decomposed signal in each layer is processed using a threshold function. Finally, the low-frequency speed signal of the last layer after decomposition is reconstructed with the high-frequency speed signal after the threshold in each layer to obtain the noise-reduced section speed data. The decomposition results are shown in Figure 7, and the specific method is described in Algorithm 2.

Algorithm 2 Wavelet transform algorithm.

Input:

the set of section speed time series v;

Output:

the set of section speed time series

v^{'}

;

1:: $v = \{v_{1}, \dots, v_{n}\}$ , $c a = []$ , $c d = []$ , $r e c a = []$ , $r e c d = []$ ;
2:: Select the sym5 wavelet as the basis function;
3:: $j = 3$ //the number of decomposition layers is specified as 3 layers;
4:: for i to range(j);
5:: $c a \leftarrow c a_{i} + \dots + c a_{j}$ //store the trend signal;
6:: $c d \leftarrow c d_{i} + \dots + c d_{j}$ //store the noise signal;
7:: For i to $c a$ ;
8:: $r e c a \leftarrow c a_{1} + \dots + c a_{i}$ //store the reconstructed trend signal;
9:: For i to $c d$ ;
10:: $r e c d \leftarrow c d_{1} + \dots + c d_{i}$ //store the reconstructed noise signal;
11:: $v^{'} \leftarrow r e c a$ //wrap the reconstructed trend signal into $v^{'}$ ;
12:: return $v^{'}$ ;

3.3.2. Graph Convolutional Networks (GCN)

The ETC gantry of the expressway has different topological relationships in different sections, and the mutual influence between the ETC gantry with different topological relationships must be different. If the topological relationship between the ETC gantry can be fully extracted and used, the speed prediction will be more accurate. Ordinary CNN can only handle Euclidean spatial data with regular structure, and cannot handle irregular non-Euclidean spatial data. Therefore, the literature [28] proposed the GCN model to deal with non-Euclidean spatial data very well. The spatial distribution of the ETC gantry of the expressway is a non-Euclidean spatial structure, so the GCN model is used to model the spatial distribution of the ETC gantry of the expressway. We treat the data as signals on a spectrogram, and process the signal on the graph to capture meaningful patterns and features in space. The connection relationship and mutual influence of the graph are represented by the Laplacian matrix of the graph. The Laplace matrix of a graph is defined as:

L = D - A

(10)

The regularized Laplacian matrix is:

L = I_{n} - D^{- \frac{1}{2}} A D^{- \frac{1}{2}}

(11)

where

I_{n} \in R_{N \times N}

is the identity matrix, and the degree matrix

D_{i i} = \sum_{i}^{} A_{i j}

. Decompose L into eigenvalues to obtain

L = U Λ U^{T}

,

Λ = d i a g ([λ_{1}, \dots, λ_{n}])

is a diagonal matrix composed of eigenvalues of L,

U = \{u_{1}, \dots, u_{N}\}

is an orthonormal matrix consisting of the standard orthonormal eigenvectors of L. For a signal input

X \in R_{N}

, the Fourier transform in the figure is

\hat{x} = U^{T} x

, and its inverse Fourier transform is

x = U^{T} \hat{x}

. The convolution operation of the convolution kernel g and the input signal x in the time domain can be converted into the frequency domain inner product form as:

g * x = U ((U^{T} g) ⨀ (U^{T}) x) = U_{g_{Θ}} (A) U^{T} x

(12)

where

g_{s} (Λ) = U^{T} g = d i a g (Θ)

, ⨀ represents the Hadamard product, and

U^{T} g

means mapping g to the frequency domain space based on U. Due to the high computational complexity of

g_{Θ}

, the hierarchical linear model constraints [29] and Chebyshev polynomials [30] are used to approximate the calculation. This paper adopts the simplified first-order polynomial form of

g * x

:

g * x = U_{g_{θ}} U_{x}^{T} \approx Θ (I_{n} + D^{- \frac{1}{2}} A D^{- \frac{1}{2}}) x

(13)

There exists

{\tilde{D}}^{- \frac{1}{2}} \tilde{A} {\tilde{D}}^{- \frac{1}{2}} = I_{n} + D^{- \frac{1}{2}} A D^{- \frac{1}{2}}

, where

\tilde{A} = I_{n} + A

and

\tilde{D} = \sum_{i}^{} {\tilde{A}}_{i j}

, so the output of layer l is:

X^{l} = σ ({\tilde{D}}^{- \frac{1}{2}} \tilde{A} {\tilde{D}}^{- \frac{1}{2}} X^{l - 1} W^{l - 1})

(14)

A represent an adjacency matrix, which is used to represent the connection relationship between expressway nodes. Each row in A represents a section, and each value in A represents the connection between sections.

\tilde{A} = A + I

in the matrix is to prevent the ETC gantry of the expressway from being unable to transmit its characteristic information when capturing the characteristic information (section speed) of the adjacent nodes. D is the degree matrix,

\tilde{D}^{- \frac{1}{2}} \tilde{A} \tilde{D}^{- \frac{1}{2}}

is to prevent the gradient from exploding or disappearing when the gantry propagates feature information layer by layer, which makes it impossible to perform the next step of training.

W^{l - 1}

is the weight in the GCN model,

X^{l - 1}

represents the feature matrix of the section speed, each row represents a different section, each column represents the section speed of the same time interval,

σ

represents an activation function.

As shown in Figure 8, assuming that each node in the figure represents the expressway ETC gantry, the essence of the GCN model is actually to capture the linear combination of the adjacent node features of the gantry and its own single node features. Therefore,

N o d e_{1}

can obtain the spatial characteristics of itself and surrounding nodes through the GCN model.

3.3.3. Gated Recurrent Unit (GRU)

Currently, RNN models in neural network models are commonly used to process sequence data. However, the traditional RNN has the disadvantages of gradient disappearance and gradient explosion in the training process [31]. To solve this problem, GRU and LSTM were proposed as variants of RNNs. However, compared with LSTM, the GRU model has the advantages of simple structure, fewer parameters, and a short training time. Therefore, the GRU model was selected to obtain temporal features from the section speed, and its structure is shown in Figure 9.

There are two gate units in the hidden layer of GRU, the reset gate (

r_{t}

) and the update gate (

z_{t}

).

z_{t}

indicates how much the state information of the previous moment is transferred to the current state, and

r_{t}

indicates how much the state information of the previous moment is ignored. The calculation process of GRU is as follows:

z_{t} = σ (W_{z} \cdot [x_{t}, H_{t - 1}])

(15)

r_{t} = σ (W_{z} \cdot [x_{t}, H_{t - 1}])

(16)

Equations (15) and (16) show how to set the update gate

z_{t}

and reset gate

r_{t}

.

W_{z}

represents the weight of the reset gate

r_{t}

,

σ

represents the activation function,

x_{t}

represents the section speed at the current moment, and

H_{t - 1}

represents the hidden state at the time

t - 1

.

{\tilde{H}}_{t} = σ (W_{h} \cdot [r_{t} \times H_{t - 1}, x_{t}])

(17)

H_{t} = (1 - z_{t}) \times h_{t - 1} + z_{t} \times \tilde{H_{t}}

(18)

Equation (17) indicates that the output of the reset gate at the current time is multiplied by the hidden state of the previous time, and then the candidate hidden state is calculated through the full connection layer of the activation function. Equation (18) represents the update gate

z_{t}

that calculates the current time, the hidden state

H_{t - 1}

at the previous time point, and the weighted average of the candidate hidden states

{\tilde{H}}_{t}

at the current time to calculate the most probable state

H_{t}

.

In general, this paper uses the historical n time series data of section speed to obtain the section speed of t time by using the hidden state of

t - 1

time and the current section speed as input through the GRU model. The model captures the section speed at the current moment, while still maintaining the changing trend of historical traffic information, and obtains the dynamic time change characteristics of the section speed.

4. Experimental Results and Analysis

4.1. Data Description and Pre-Processing

The experimental data are mainly divided into two types of data, one is the ETC gantry transaction data of Fuzhou South to Xiamen North Expressway in Fujian Province for 30 days from 1 to 30 June 2020 from Fujian Expressway Information Technology Co., Ltd. (2F, Building 1, No.27 Jinji Shan Road, Jinan District, Fuzhou City, China). which mainly contains transaction 103 dimensions of data, such as transaction identifier, trade time, gantry number, OBU plate, OBU status and user type, with a total of about 20.53 million data samples. The main attributes of the ETC gantry transaction data used in this paper are shown in Table 1. Second, according to the longitude and latitude coordinates of the ETC gantry, the section distance is crawled by Amap, and the topological relationship data of the expressway ETC gantry is generated, which includes the name of the ETC gantry in different sections and the actual section distance.

Match the ETC transaction data with the topological data to construct the vehicle travel time. Due to the existence of factors such as vehicles entering the service area or vehicles breaking down, a certain amount of abnormal data will be generated. By taking 15 min as the statistical window for each section, the vehicle travel time outlier detection algorithm is used to detect outliers, eliminate abnormal data and retain correct data. The average speed of each vehicle passing through each section is calculated, and then the speed dataset of the expressway section is constructed with a 15-min interval, and the main attributes are shown in Table 2.

In this paper, min-max normalization is used to map the data to the [0, 1] interval, and the normalization formula is shown in Equation (19):

z = \frac{x - x_{m i n}}{x_{m a x} - x_{m i n}}

(19)

In Equation (19), z represents the normalized data, x represents the original data,

x_{m i n}

is the minimum value of x, and

x_{m a x}

is the maximum value of x. According to the above data pre-processing, the section speeds of 16 sections travel in both directions every 15 min from 00:00 to 24:00. Then, a time series of section speed was generated, with 96 data samples per day and a total of 2880 data samples in 30 days. The first 80% of the 30 days of section speed data were used as the training set and the remaining 20% as the test set. The section speeds were predicted for the next 15 min, 30 min, and 45 min.

4.2. Evaluation Indicators

A total of 5 metrics, including Root Mean Square Error (

R M S E

), Mean Absolute Error (

M A E

), Accuracy, Coefficient of Determination (

R^{2}

) and Explained Variance Score (

V a r

), were used in the experiments, and they were used to compare and evaluate the prediction results of the models. Where

y_{i}

is the actual section speed,

\hat{y}

is the predicted section speed,

{\bar{y}}_{i} = \frac{1}{N} \sum_{i = 1}^{N} y_{i}

, and N is the sample size.

R M S E = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(y_{i} - \hat{y})}^{2}}

(20)

M A E = \frac{1}{N} \sum_{i = 1}^{N} |y_{i} - \hat{y}|

(21)

A c c u r a c y = 1 - \frac{{∥y_{i} - \hat{y}∥}_{F}}{{∥y_{i}∥}_{F}}

(22)

R^{2} = 1 - \frac{\sum_{i = 1}^{N} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{N} {(y_{i} - {\bar{y}}_{i})}^{2}}

(23)

V a r = 1 - \frac{v a r \{y_{i} - \hat{y}\}}{v a r \{y_{i}\}}

(24)

R M S E

and

M A E

are both measures of prediction error, with smaller values indicating better predictions and larger values indicating worse predictions.

R^{2}

and

V a r

calculates the correlation coefficient to measure the ability of the prediction result to represent the actual data. The larger the value, the better the prediction effect.

4.3. Parameter Design

In this experiment, we choose the sym5 wavelet, the number of decomposition layers is 3, the learning rate of model parameters is set to 0.001, the batch size is set to 64, and the training epoch is set to 3000. Because the prediction accuracy may be affected by different numbers of hidden units, we try different numbers of hidden units and compare the predictions to choose the best number of hidden units. The number of hidden units is selected from [16, 32, 64, 128] and the change in prediction accuracy is analyzed. As shown in Figure 10, the horizontal axis indicates the number of hidden units and the vertical axis indicates the change in metrics. When the number of hidden units is 64, the

R M S E

and

M A E

are the smallest. As the number of hidden units increases, the prediction accuracy increases first and then decreases. The main reason for this is that when the number of hidden units exceeds a certain level, the complexity and computational difficulty of the model increases greatly, leading to a decrease in prediction accuracy. Therefore, in all experiments, the number of hidden units is set to 64. Furthermore, the Adam optimizer is selected during the training process, and it will be used to calculate and update the network parameters of the model training and output so that the parameters are close to or reach the optimal values.

4.4. Experimental Results and Analysis

The experiment uses 12 historical data samples to predict the future 15-min, 30-min, and 45-min section speeds. Figure 11 shows the results of the visualization of the predicted future 15-min section speed using the WSTGCN model for four sections selected from the 16 sections. Black in the figure indicates the predicted section speed, red indicates the real section speed, the values marked by

R M S E

,

M A E

indicate the overall evaluation index of the section, and the orange and green bars indicate the

R M S E

and

M A E

calculated every three hours. It can be seen that the trend of the predicted speed and the actual speed are similar, and the variation between

R M S E

and

M A E

is small, indicating that the model can accurately predict traffic speed. From Figure 11c,d, it can be seen that the prediction accuracy will be reduced in the section where the traffic speed changes are complicated. As seen in the position of the red rectangle, the model can capture similarly varying trends in the face of sudden speed changes. To verify the reliability of the WSTGCN model, six baseline methods are used for comparison. These eight baseline methods are HA, SVR, ARIMA, GCN, GRU, LSTM, Spatial-Temporal Dynamic Network(STDN), which consists of CNN and LSTM, and GCN-GRU.

Table 3 shows the results of the evaluation metrics of the different models for predicting the section speed for the next 15 min. It can be seen that the results in all five evaluation indicators of WSTGCN are better than the baseline method. Compared with the GCN-GRU model, the

R M S E

of WSTGCN is 20.28% lower than that of GCN-GRU, and the

M A E

of WSTGCN is 11.69% lower than that of GCN-GRU. Because the GCN-GRU model does not consider the volatility of the data, which leads to lower prediction accuracy, and also proves that it is feasible to use WSTGCN to improve the accuracy of the prediction.

Compared with STDN, WSTGCN has 21.24% lower

R M S E

and 12.72% lower

M A E

, which indicates that the realistic expressway road network structure, GCN can capture spatial correlation better than CNN. The

R M S E

of WSTGCN is reduced by 55.68% and

M A E

is reduced by 59.13% compared with GCN, which shows that good prediction results cannot be obtained by considering only the spatial characteristics of expressway nodes without considering the temporal characteristics of their characteristic attributes. The

R M S E

of WSTGCN is reduced by 20.74%, 21.10%, and

M A E

is reduced by 13.18%, 14.01% compared with GRU and LSTM, which shows that ignoring the correlation between nodes among expressways when performing section speed prediction is also not achieving good prediction results. Therefore, combining GCN with GRU, which takes into account the spatial characteristics of nodes as well as the time series characteristics of nodes, can better improve the prediction results. Second, the

R M S E

of GRU is 79.91% lower and

M A E

is 53.72% lower than that of GCN, which is because GCN only considers the spatial characteristics of expressway nodes and does not consider the temporal characteristics of node feature vectors, and also indicates that the future section speed is more dependent on the section speed of historical time series. Compared with GRU, the

R M S E

of HA, ARIMA, and SVR are about 46.87%, 54%, and 1.19% higher. Compared with WSTGCN, the

R M S E s

of HA, ARIMA, and SVR are higher by roughly 57.64%, 63.33%, and 21.22%, which is mainly caused by their poor nonlinear fitting ability to complex spatio-temporal data.

Then, the prediction performance of WSTGCN and other models at different time intervals are further discussed, and different models are used to predict the future 30-min and 45-min section speeds, and their prediction performance is compared. Table 4 and Table 5 compare the effects of different models on section speed prediction at different time intervals. For the 30-min versus 45-min speed prediction, the

R M S E

is reduced by 18.85% and 8.67% for WSTGCN compared with GCN-GRU, 22.34% and 19.21% for WSTGCN compared with GRU, and 45.66% and 33.19% for WSTGCN compared with GCN. It shows that the WSTGCN model is also able to capture the spatial and temporal characteristics of the section speed well in the case of long-term prediction.

Figure 12 shows the evaluation metrics of different models at 30 min and 45 min. We can see that WSTGCN still has lower

R M S E

and

M A E

compared to other models for the same length of time. As the prediction time increases, the accuracy of the models gradually decreases, and WSTGCN still has high accuracy at different lengths of time. Therefore, WSTGCN has good long-term prediction ability.

5. Conclusions

In this paper, an expressway speed prediction method based on wavelet transform and spatio-temporal graph convolutional network is proposed. First, the ETC gantry transaction data are matched with the topological data to construct the vehicle travel time data. Then the vehicle travel time outlier detection algorithm is used to eliminate the time anomalies of each section, and then the section speed data set is constructed. Finally, the section speed data and topological data are input into the WSTGCN model for training and learning, and are compared with various other models for analysis. The experimental results show that the prediction accuracy of the WSTGCN model in expressway speed prediction is significantly better than other methods, and it can accurately predict the section speed of the expressway. In addition, there are some shortcomings in this paper, i.e., other factors (e.g., weather conditions) that affect traffic speed are not considered. Next, introducing other data sources to improve more accurate section speed prediction is a work we plan to continue in the future.

Author Contributions

Conceptualization, F.Z. and Q.R.; methodology, F.Z.; software, Q.R.; validation, J.T., F.G. and S.H.; formal analysis, F.Z.; investigation, Q.R.; resources, J.T.; data curation, Q.R.; writing—original draft preparation, Q.R.; writing—review and editing, F.Z.; visualization, L.L.; supervision, J.W.; project administration, S.H.; funding acquisition, F.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work is funded by the National Natural Science Foundation of China (41971340), the Special Funds for the Central Government to Guide Local Scientific and Technological Development (2020L3014), the 2020 Fujian Province “the Belt and Road” Technology Innovation Platform (2020D002), the Provincial Candidates for the Hundred, Thousand and Ten Thousand Talent of Fujian (GY-Z19113). Crosswise project (No.GY-H-21021).

Data Availability Statement

Restrictions apply to the availability of these data. Data was obtained from Fujian Expressway Information Technology Co., Ltd. Furthermore, are available from the authors with the permission of Fujian Expressway Information Technology Co., Ltd.

Conflicts of Interest

The authors declare no conflict of interest.

References

Smith, M.; Huang, W.; Viti, F.; Tampère, C.; Lo, H.K. Quasi-dynamic traffic assignment with spatial queueing, control and blocking back. Transp. Res. Part B Methodol. 2019, 122, 140–166. [Google Scholar] [CrossRef]
Yang, X.X.; Zou, Y.J.; Tang, J.J.; Liang, J.; Ijaz, M. Evaluation of short-term freeway speed prediction based on periodic analysis using statistical models and machine learning models. J. Adv. Transp. 2020, 2020, 9628957. [Google Scholar] [CrossRef] [Green Version]
Qian, M. Exploration of Multi-dimensional Data Fusion Application of ETC Gantry System. China Its J. 2021, 6, 109–112. [Google Scholar]
Abduljabbar, R.L.; Dia, H.; Tsai, P.W. Unidirectional and Bidirectional LSTM Models for Short-Term Traffic Prediction. J. Adv. Transp. 2021, 2021, 5589075. [Google Scholar] [CrossRef]
Hua, C.H.; Shao, Y.M.; Ao, G.C.; Zhang, H.L. Speed prediction by online map-based GCN-LSTM neural network. J. Traffic Transp. Eng. 2021, 21, 183–196. [Google Scholar]
Elleuch, W.; Wali, A.; Alimi, A.M. Towards an efficient traffic congestion prediction method based on neural networks and big GPS data. IIUM Eng. J. 2019, 20, 108–118. [Google Scholar] [CrossRef]
Gao, Y.; Zhao, J.D.; Qin, Z.Y.; Feng, Y.Z.; Yang, Z.Z.; Jia, B. Traffic speed forecast in adjacent region between highway and urban expressway: Based on MFD and GRU model. J. Adv. Transp. 2020, 2020, 108–118. [Google Scholar] [CrossRef]
Zeng, X.; Guan, X.F.; Wu, H.Y.; Xiao, H.P. A data-driven quasi-dynamic traffic assignment model integrating multi-source traffic sensor data on the expressway network. ISPRS Int. J. Geo-Inf. 2021, 10, 113. [Google Scholar] [CrossRef]
Zhang, N.; Guan, X.F.; Cao, J.; Wang, X.L.; Wu, H.Y. Wavelet-HST: A wavelet-based higher-order spatio-temporal framework for urban traffic speed prediction. IEEE Access 2019, 7, 118446–118458. [Google Scholar] [CrossRef]
Emami, A.; Sarvi, M.; Bagloee, S.A. Short-term traffic flow prediction based on faded memory Kalman Filter fusing data from connected vehicles and Bluetooth sensors. Simul. Model. Pract. Theory 2020, 102, 102025. [Google Scholar] [CrossRef]
Xu, D.W.; Wang, Y.D.; Jia, L.M.; Qin, Y.; Dong, H.H. Real-time road traffic state prediction based on ARIMA and Kalman filter. Front. Inf. Technol. Electron. Eng. 2017, 18, 287–302. [Google Scholar] [CrossRef]
Hu, Y.C.; Wu, H.; Huang, J.X.; Lv, H.Y.; Zhang, Z.J. A method for estimating expressway section average speed based on Support Vector Regression. J. Highw. Transp. Res. Dev. 2019, 36, 137–143, 151. [Google Scholar]
Evans, J.; Waterson, B.; Hamilton, A. Forecasting road traffic conditions using a context-based random forest algorithm. Transp. Plan. Technol. 2019, 42, 554–572. [Google Scholar] [CrossRef]
Sun, B.; Cheng, W.; Goswami, P.; Bai, G.H. Short-term traffic forecasting using self-adjusting k-nearest neighbours. IET Intell. Transp. Syst. 2018, 12, 41–48. [Google Scholar] [CrossRef] [Green Version]
Zhang, D.; Kabuka, M.R. Combining weather condition data to predict traffic flow: A GRU-based deep learning approach. IET Intell. Transp. Syst. 2018, 12, 578–585. [Google Scholar] [CrossRef]
Ma, X.L.; Tao, Z.M.; Wang, Y.H.; Yu, H.Y.; Wang, Y.P. Long short-term memory neural network for traffic speed prediction using remote microwave sensor data. Transp. Res. Part C Emerg. Technol. 2015, 54, 187–197. [Google Scholar] [CrossRef]
Fu, X.; Luo, W.; Xu, C.Y.; Zhao, X.X. Short-term traffic speed prediction method for urban road sections based on wavelet transform and gated recurrent unit. Math. Probl. Eng. 2020, 9, 3697625. [Google Scholar] [CrossRef]
Lu, H.K.; Huang, D.M.; Song, Y.Y.; Jiang, D.Z.; Zhou, T.; Qin, J. St-trafficnet: A spatial-temporal deep learning network for traffic forecasting. Electronics 2020, 9, 1474. [Google Scholar] [CrossRef]
Bogaerts, T.; Masegosa, A.D.; Angarita-Zapata, J.S.; Onieva, E.; Hellinckx, P. A graph CNN-LSTM neural network for short and long-term traffic forecasting based on trajectory data. Transp. Res. Part C Emerg. Technol. 2020, 112, 62–77. [Google Scholar] [CrossRef]
Zhao, L.; Song, Y.J.; Zhang, C.; Liu, Y.; Wang, P.; Lin, T.; Deng, M.; Li, H.L. T-gcn: A temporal graph convolutional network for traffic prediction. IEEE Trans. Intell. Transp. Syst. 2019, 21, 3848–3858. [Google Scholar] [CrossRef] [Green Version]
Pan, C.S.; Zhu, J.; Kong, Z.X.; Shi, H.F.; Yang, W.S. DC-STGCN: Dual-channel based graph convolutional networks for network traffic forecasting. Electronics 2021, 10, 1014. [Google Scholar] [CrossRef]
Zou, F.M.; Guo, F.; Tian, J.S.; Luo, S.J.; Yu, X.; Gu, Q.; Liao, L.C. The Method of Dynamic Identification of the Maximum Speed Limit of Expressway Based on Electronic Toll Collection Data. Sci. Program. 2021, 2021, 4702669. [Google Scholar] [CrossRef]
Liao, L.C.; Jiang, X.H.; Lin, M.Z.; Zou, F.M. Recognition method of road speed limit information based on data mining of traffic trajectory. J. Traffic Transp. Eng. 2015, 15, 118–126. [Google Scholar]
Zhao, R.M.; Cui, H.M. Improved threshold denoising method based on wavelet transform. In Proceedings of the 2015 7th International Conference on Modelling, Identification and Control (ICMIC), Sousse, Tunisia, 18–20 December 2015; pp. 1–4. [Google Scholar]
Liu, C.; Liu, W.M. Wavelet transform based traffic flow predicting and model optimization. Sci. Technol. Eng. 2008, 21, 5858–5862. [Google Scholar]
Liu, Q.L.; Dai, H.L. Wavelet Filtering of the BP Neural Network of Highway Congestion Forecast Analysis during the Holidays. Highw. Eng. 2016, 6, 98–102. [Google Scholar]
Yang, X.; Xue, Q.C.; Yang, X.X.; Yin, H.D.; Qu, Y.C.; Li, X.; Wu, J.J. A novel prediction model for the inbound passenger flow of urban rail transit. Inf. Sci. 2021, 566, 347–363. [Google Scholar] [CrossRef]
Zhang, C.X.; Song, D.J.; Huang, C.; Swami, A.; Chawla, N.V. Heterogeneous graph neural network. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, USA, 4–8 August 2019; pp. 793–803. [Google Scholar]
Kipf, T.N.; Welling, M. Semi-supervised classification with graph convolutional networks. In Proceedings of the 5th International Conference on Learning Representations (ICLR), Toulon, France, 24–26 April 2017. [Google Scholar]
Defferrard, M.; Bresson, X.; Vandergheynst, P. Convolutional neural networks on graphs with fast localized spectral filtering. In Proceedings of the 30th Congress on Advances in Neural Information Processing Systems (NIPS), Barcelona, Spain, 5–10 December 2016. [Google Scholar]
Ma, L.; Liu, Y.; Zhang, X.L.; Ye, Y.X.; Yin, G.F.; Johnson, B.A. Deep learning in remote sensing applications: A meta-analysis and review. ISPRS J. Photogramm. Remote Sens. 2019, 152, 166–177. [Google Scholar] [CrossRef]

Figure 1. Overall framework.

Figure 2. Percentages of ETC transaction data and MTC transaction data.

Figure 3. Types of abnormal data (**** indicates that the data was desensitized) (a) Data redundancy (b) Missing data (c) Data errors.

Figure 4. Principle diagram of travel time outlier detection (a) Upper and lower limits are outside the normal distribution (b) Upper and lower limits are inside the normal distribution (c) Lower limit is outside the normal distribution and upper limit is inside the normal distribution (d) Lower limit is inside the normal distribution and upper limit is outside the normal distribution.

Figure 5. WSTGCN model.

Figure 6. Wavelet transform principle.

Figure 7. Section speed decomposition results.

Figure 8. Topological relationship between gantry to obtain spatial feature expressway.

Figure 9. GRU basic structure.

Figure 10. Comparison of prediction performance under different hidden units.

Figure 11. Visualization results for 15 min. (a) Section a (b) Section b (c) Section c (d) Section d.

Figure 12. The left half shows the

R M S E

and

M A E

for 30 min for different models, the right half shows the

R M S E

and

M A E

for 45 min for the different models.

Figure 12. The left half shows the

R M S E

and

M A E

for 30 min for different models, the right half shows the

R M S E

and

M A E

for 45 min for the different models.

Table 1. Partial transaction data attribute table of ETC gantry system.

Attribute Name	Examples	Attribute Name	Examples
Trade ID	340,119…2698	OBU Plate	Blue Fujian A12345
Trade Time	2020/6/1 21:35:51	Vehicle Class	1
Flag ID	352,305	Enter Time	2020/6/1 21:27:35
Flag Type	0	Enter Station	2507
Flag Index	1	OBU ID	12A2B3F8

Table 2. Expressway section speed data attribute table.

Attribute Name	Examples
Before Flag ID	354,505
After Flag ID	354,507
Speed (km/h)	98.46
Time Window	2020-06-01 13:15:00

Table 3. Comparison of 15-min speed predictions.

Model	$RMSE$	$MAE$	$Accuracy$	$R^{2}$	$Var$
HA	5.2682	2.9406	0.9457	0.4764	0.4764
ARIMA	6.0858	4.2199	0.9364	0.0025	0.0011
SVR	2.8328	1.4903	0.9708	0.8489	0.8489
LSTM	2.8248	1.4872	0.9708	0.8491	0.8491
GCN	5.0358	3.1289	0.9481	0.5243	0.5263
GRU	2.8158	1.4729	0.9708	0.8513	0.8493
STDN	2.8335	1.4652	0.9709	0.8518	0.8518
GCN-GRU	2.7992	1.4481	0.9711	0.8525	0.8525
WSTGCN	2.2316	1.2788	0.9771	0.9060	0.9063

Table 4. Comparison of 30-min speed predictions.

Model	$RMSE$	$MAE$	$Accuracy$	$R^{2}$	$Var$
HA	5.4618	3.0503	0.9457	0.4379	0.4379
ARIMA	6.0858	4.2199	0.9364	0.0025	0.0011
SVR	4.1162	2.2221	0.9575	0.6821	0.6821
LSTM	3.7531	2.1942	0.9586	0.7106	0.7106
GCN	5.0358	3.1289	0.9481	0.5145	0.5166
GRU	3.5607	2.1305	0.9604	0.7258	0.7289
STDN	3.4516	1.9482	0.9638	0.7684	0.7692
GCN-GRU	3.4074	1.8117	0.9649	0.7816	0.7819
WSTGCN	2.7651	1.5906	0.9715	0.8559	0.8569

Table 5. Comparison of 45-min speed predictions.

Model	$RMSE$	$MAE$	$Accuracy$	$R^{2}$	$Var$
HA	5.6406	3.1526	0.9418	0.4011	0.4011
ARIMA	6.0858	4.2199	0.9364	0.0025	0.0011
SVR	4.9536	2.7176	0.9489	0.5411	0.5411
LSTM	4.6462	2.6085	0.9556	0.6208	0.6221
GCN	5.2113	3.1912	0.9462	0.4912	0.4937
GRU	4.3096	2.5291	0.9582	0.6625	0.6284
STDN	4.1096	2.2354	0.9597	0.7146	0.7159
GCN-GRU	3.7873	2.0209	0.9609	0.7305	0.7311
WSTGCN	3.4818	2.0231	0.9641	0.7718	0.7736

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zou, F.; Ren, Q.; Tian, J.; Guo, F.; Huang, S.; Liao, L.; Wu, J. Expressway Speed Prediction Based on Electronic Toll Collection Data. Electronics 2022, 11, 1613. https://doi.org/10.3390/electronics11101613

AMA Style

Zou F, Ren Q, Tian J, Guo F, Huang S, Liao L, Wu J. Expressway Speed Prediction Based on Electronic Toll Collection Data. Electronics. 2022; 11(10):1613. https://doi.org/10.3390/electronics11101613

Chicago/Turabian Style

Zou, Fumin, Qiang Ren, Junshan Tian, Feng Guo, Shibin Huang, Lyuchao Liao, and Jinshan Wu. 2022. "Expressway Speed Prediction Based on Electronic Toll Collection Data" Electronics 11, no. 10: 1613. https://doi.org/10.3390/electronics11101613

APA Style

Zou, F., Ren, Q., Tian, J., Guo, F., Huang, S., Liao, L., & Wu, J. (2022). Expressway Speed Prediction Based on Electronic Toll Collection Data. Electronics, 11(10), 1613. https://doi.org/10.3390/electronics11101613

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Expressway Speed Prediction Based on Electronic Toll Collection Data

Abstract

1. Introduction

2. Preliminary

2.1. Related Concepts

2.2. Problem Description

3. Methodology

3.1. Overview of the Overall Framework

3.2. Data Pre-Processing

3.2.1. Raw Data Cleaning

3.2.2. Vehicle Travel Time Construction

3.2.3. Vehicle Travel Time Outlier Detection Algorithm

3.3. Spatio-Temporal Feature Extraction

3.3.1. Wavelet Transform

3.3.2. Graph Convolutional Networks (GCN)

3.3.3. Gated Recurrent Unit (GRU)

4. Experimental Results and Analysis

4.1. Data Description and Pre-Processing

4.2. Evaluation Indicators

4.3. Parameter Design

4.4. Experimental Results and Analysis

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI