With its high capacity and low time cost, the expressway has become the preferred way to travel between cities [
1]. Due to social and economic development, traditional traffic management techniques struggle to cope with the increasing traffic pressure, and there is an urgent need to develop Intelligent Transportation Systems (ITS) for expressways. With the accurate prediction of traffic information from ITS, travelers can develop reasonable travel routes before departure and improve the efficiency of travel, and road management departments can effectively conduct traffic guidance and alleviate traffic congestion and other problems based on reliable road traffic information [
2]. In recent years, China’s expressway ETC system has realized the networking of 29 provinces nationwide, built a total of 24,588 sets of ETC gantry systems, renovated 48,211 ETC lanes, and averaged nearly one billion ETC gantry transaction data per day [
3], which has further improved the efficiency of expressways. The transaction data collected by the ETC gantry system can record the travel information of almost every vehicle on the expressway, and compared with detector data [
4,
5] and floating car data [
6,
7], the ETC gantry transaction data are more comprehensive and reliable, covering the expressway road network. Therefore, to further improve the service quality of the expressway ETC system, it is of great theoretical significance and practical value to research the traffic speed prediction based on ETC gantry transaction data [
8].
Several domestic and foreign researchers have invested in the field of traffic prediction research [
9]. The three main types of traffic prediction methods are statistical learning models, shallow machine learning and deep learning. Statistical learning models. For example, Emami et al. [
10] proposed a Kalman filter-based method for traffic flow prediction. Xu et al. [
11] proposed an Autoregressive Integrated Moving Average (ARIMA) and Kalman filter method for predicting road traffic status. Statistical learning models are usually performed under the assumption of linearity, which does not reflect the nonlinear feature of traffic speed well, and also cannot handle high-dimensional data well due to its high complexity. Shallow machine learning can solve the above problems well, for example, Hu et al. [
12] proposed a Support Vector Regression (SVR)-based method for estimating the average speed of expressway sections to overcome the sparse density of existing expressway vehicle detectors. Evans et al. [
13] performed road section state prediction based on Random Forest (RF), and with different prediction ranges and training data amounts, the algorithm achieved the best results compared with others. Sun et al. [
14] used a method to dynamically adjust the K-nearest neighbor (KNN) parameters for traffic flow prediction and achieved good results in different periods. However, the performance of shallow machine learning methods depends heavily on the artificially designed features and they usually fail to produce the best results for prediction tasks with complex regularities and complicated factors. In recent years, deep learning models have shown their superior predictive capabilities. Deep learning models can automatically extract features and capture the correlation of data, so a large number of deep learning models are used for traffic prediction. Since traffic data can be represented as time series data, GRU [
15] and LSTM [
16] are gradually used for speed prediction. Fu et al. [
17] used wavelet transform to decompose the original data and then constructed GRU and Autoregressive Moving Average (ARMA) models to predict low-frequency and high-frequency sequences. Despite the impressive capability of these methods in temporal modeling, accurate traffic prediction is still limited because of the lack of consideration of the spatial characterization of traffic data. To solve this problem, Lu et al. [
18] proposed a spatial-temporal deep learning network (ST-TrafficNet) for traffic flow forecasting, which is capable of capturing high-dimensional temporal features while also extracting latent spatial features. Bogaerts et al. [
19] used a combination of CNN and LSTM to construct a spatio-temporal recurrent convolutional neural network to effectively extract temporal features and spatial feature variations of traffic speed and achieve high accuracy traffic speed prediction. However, CNN is designed for Euclidean spatial structure, and for the actual expressway road network structure, CNN is unable to capture the spatial correlation completely. Zhao et al. [
20] proposed a temporal graph convolutional network for traffic prediction, in which the combination of GCN and GRU goes to capture the spatial and temporal features of traffic flow. Pan et al. [
21] proposed the dual-channel based graph convolutional network (DC-STGCN) model to fully extract the spatio-temporal characteristics between traffic flows, and achieved good results in long-term prediction.
Regarding expressway speed prediction, most of the data used in this paper are vehicle detector data and floating car data. However, there are shortcomings, such as low density of vehicle detectors, high damage rate, and incomplete objects for floating car data. To solve these issues, this paper considers the nearly full sample of ETC gantry transaction data and use the WSTGCN model to predict exressway speeds, which can effectively eliminate the section speeds and extract the spatio-temporal characteristics between ETC gantries. The main contributions of this work as follows.
The rest of the paper is organized as follows. The concepts related to expressways are introduced and problem description in detail in the “Preliminary” section. The model construction for expressway speed prediction is described in detail in the “Methodology” section. In the “Experimental Results and Analysis” section, the WSTGCN model is evaluated using ETC gantry transaction data from Fujian Province, and finally we present the ‘Conclusions” of our paper.