Convolutional Long Short-Term Memory Two-Dimensional Bidirectional Graph Convolutional Network for Taxi Demand Prediction
Abstract
:1. Introduction
- We build a neural network to connect two regions to get useful information from the demand distribution of the destination.
- We design two GCNs to mine the implicit information of the origin and the destination individually, which can help balance the influence of each dimension in the results.
- Since the traditional relationship defined by static graphs may be not accurate in the real world, we use dynamic graphs to describe each region’s relationship. This method not only solves the intricate graph correlations that need much previous data to support and calculate but also reduces pre-defined difficulties if the raw data are particularly anomalous.
2. Related Works
2.1. Traditional Methods
2.2. Deep Learning Methods
3. Preliminaries and Problem Statement
3.1. OD Graph
3.2. OD Features
3.3. Problem Definition
4. The Proposed CTBGCN Model
4.1. CTBGCN Framework
Algorithm 1 CTBGCN Algorithm |
|
4.2. Temporal Section
4.3. Spatial Section
4.3.1. Adjacency Matrix Definition
4.3.2. TBGCN
4.4. Fusion Section
5. Experiments and Results
5.1. Dataset
5.2. Evaluation Metrics and Experiment Settings
- Evaluation metrics: To evaluate the performance of the proposed model, we adopt performance metrics of mean square errors (MSE), root mean square errors (RMSE), and mean absolute percentage errors (MAPE). and are used for test evaluation at the OD-level. and refer to the comparisons at the region level in RMSE and MAPE. is regarded as the loss function to prompt the network convergence. Their mathematical formulas are listed here:
- Data preparation: We make use of all of the data in the prediction and set the ratio of the training, verification, and test to 4:1:1 in chronological order. According to the Min-Max method, the data transformed in the range of [−1,1], which can be expressed as:After the transformation, we utilize the demands of five continuous intervals to predict the demand of the next interval. For instance, the data during 4:00 to 6:30 are used to forecast the demand during 6:30 to 7:00 within one day.
- Experiment settings: All experiments were run on one GeForce RTX 2080 Ti GPU and one Intel(R) Xeon(R) CPU E5-2678 v3 @ 2.50 GHz. The k-th order of the Chebyshev net is set as 3. We set 500 epochs to train the proposed model and set the early stopper at 50 to ensure the best results. The optimized method is the adaptive moment estimation (Adam). The initial learning rate is 0.001. After 100 epochs, we reduce the learning rate by half if the loss does not decrease within 5 epochs. Finally, we set the batch size at 16 because of the limit of video memory.
5.3. Methods for Comparison
- HA [12]: Predict the future needs by averaging the historical demands. We utilize the same data from S time intervals to average the taxi demands of the previous n time intervals.
- Lasso Regression [13]: Lasso Regression is a linear regression method with ℓ1-norm regularization. The data used are the same as in the HA method above.
- LSTM [19]: LSTM is a temporal neural network for many time-interval problems. Three layers are added, and the hidden dimensions are 16, 32, and 1.
- MLP: MLP is a classical deep learning method. We set the hidden neurons at 64, 32, 16, and 1.
- ST-ResNet [33]: ST-ResNet is a residual neural network framework that models the time distance, cycle, and trend attributes of crowd flow.
- CSTN [11]: CSTN is an OD predicting method. It makes use of the 3DCNN, Conv-LSTM, and a global correlation context section from global and local views.
5.4. Comparisons
- CTBGCN achieves the best performance in metrics. Compared with CSTN, there is an obvious improvement in at the region level, with a relative performance improvement of 4.5%. Other metrics improve slightly, with a relative improvement of just 0.2% in and 0.14% in . However, other baselines have gaps in terms of both RMSE and MAPE since they all ignore spatio-temporal correlations.
- The extensibility of the proposed model is wonderful. The model do not need any pre-defined additional information in the prediction process in addition to the necessary OD information.
- The proposed model has a excellent generalization in long-term predictions. Many other methods do not make sense for long time intervals. There is no validation set or the validation set time is very short in methods such as CSTN and ST-ED-RMGC. We allocate two months to the verification set in our experiment, which means our model needs to fit the data after two months correctly. It proves that the proposed model has great robustness even in the face of some weather and seasonal changes.
5.5. Forecating Shows
5.6. Time Consumption
5.7. Ablation Experiments
6. Conclusions and Future Direction
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Zhao, K.; Khryashchev, D.; Freire, J.; Silva, C.; Vo, H. Predicting taxi demand at high spatial resolution: Approaching the limit of predictability. In Proceedings of the 2016 IEEE International Conference on Big Data (Big Data), Washington, DC, USA, 5–8 December 2016; pp. 833–842. [Google Scholar] [CrossRef]
- Yang, H.; Wong, S.; Wong, K. Demand–supply equilibrium of taxi services in a network under competition and regulation. Transp. Res. Part B Methodol. 2002, 36, 799–819. [Google Scholar] [CrossRef]
- Ma, W.; Qian, Z.S. Estimating multi-year 24/7 origin-destination demand using high-granular multi-source traffic data. Transp. Res. Part C Emerg. Technol. 2018, 96, 96–121. [Google Scholar] [CrossRef]
- Wang, Y.; Yin, H.; Chen, H.; Wo, T.; Xu, J.; Zheng, K. Origin-Destination Matrix Prediction via Graph Convolution: A New Perspective of Passenger Demand Modeling. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, USA, 4–8 August 2019; Association for Computing Machinery: New York, NY, USA, 2019. KDD ’19. pp. 1227–1235. [Google Scholar] [CrossRef]
- Ke, J.; Zheng, H.; Yang, H.; Chen, X.M. Short-term forecasting of passenger demand under on-demand ride services: A spatio-temporal deep learning approach. Transp. Res. Part C Emerg. Technol. 2017, 85, 591–608. [Google Scholar] [CrossRef]
- Yao, H.; Wu, F.; Ke, J.; Tang, X.; Jia, Y.; Lu, S.; Gong, P.; Ye, J.; Zhenhui, L. Deep Multi-View Spatial-Temporal Network for Taxi Demand Prediction. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, New Orleans, LA, USA, 2–7 February 2018. [Google Scholar]
- Geng, X.; Li, Y.; Wang, L.; Zhang, L.; Yang, Q.; Ye, J.; Liu, Y. Spatiotemporal Multi-Graph Convolution Network for Ride-Hailing Demand Forecasting. In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence and Thirty-First Innovative Applications of Artificial Intelligence Conference and Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, Honolulu, HI, USA, 27 January–1 February 2019; AAAI Press: Palo Alto, CA, USA, 2019. AAAI’19/IAAI’19/EAAI’19. [Google Scholar] [CrossRef]
- Blume, S.O.; Corman, F.; Sansavini, G. Bayesian origin-destination estimation in networked transit systems using nodal in- and outflow counts. Transp. Res. Part B Methodol. 2022, 161, 60–94. [Google Scholar] [CrossRef]
- Ding, F.; Zhu, Y.; Yin, Q.; Cai, Y.; Zhang, D. MS-ResCnet: A combined spatiotemporal modeling and multi-scale fusion network for taxi demand prediction. Comput. Electr. Eng. 2023, 105, 108558. [Google Scholar] [CrossRef]
- Xu, J.; Rahmatizadeh, R.; Bölöni, L.; Turgut, D. A taxi dispatch system based on prediction of demand and destination. J. Parallel Distrib. Comput. 2021, 157, 269–279. [Google Scholar] [CrossRef]
- Liu, L.; Qiu, Z.; Li, G.; Wang, Q.; Ouyang, W.; Lin, L. Contextualized Spatial–Temporal Network for Taxi Origin-Destination Demand Prediction. IEEE Trans. Intell. Transp. Syst. 2019, 20, 3875–3887. [Google Scholar] [CrossRef]
- Campbell, J.Y.; Thompson, S.B. Predicting excess stock returns out of sample: Can anything beat the historical average? Rev. Financ. Stud. 2008, 21, 1509–1531. [Google Scholar]
- Tibshirani, R. Regression Shrinkage and Selection via the Lasso. J. R. Stat. Soc. Ser. B-Methodol. 1996, 58, 267–288. [Google Scholar]
- Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; Association for Computing Machinery: New York, NY, USA, 2016. KDD ’16. pp. 785–794. [Google Scholar] [CrossRef]
- Liang, X.; Xu, C.; Shen, X.; Yang, J.; Tang, J.; Lin, L.; Yan, S. Human Parsing with Contextualized Convolutional Neural Network. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 115–127. [Google Scholar] [CrossRef]
- Li, Y.; Yu, R.; Shahabi, C.; Liu, Y. Graph Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting. CoRR 2017, abs/1707.01926. Available online: http://xxx.lanl.gov/abs/1707.01926 (accessed on 29 January 2023).
- Yu, B.; Yin, H.; Zhu, Z. Spatio-Temporal Graph Convolutional Networks: A Deep Learning Framework for Traffic Forecasting. In Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI-18, International Joint Conferences on Artificial Intelligence Organization, Stockholm, Sweden, 13–19 July 2018; pp. 3634–3640. [Google Scholar] [CrossRef]
- Zaremba, W.; Sutskever, I.; Vinyals, O. Recurrent Neural Network Regularization. CoRR 2014, abs/1409.2329. Available online: http://xxx.lanl.gov/abs/1409.2329 (accessed on 29 January 2023).
- Yu, Y.; Si, X.; Hu, C.; Zhang, J. A Review of Recurrent Neural Networks: LSTM Cells and Network Architectures. Neural Comput. 2019, 31, 1235–1270. [Google Scholar] [CrossRef]
- Chung, J.; Gülçehre, Ç.; Cho, K.; Bengio, Y. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling. CoRR 2014, abs/1412.3555. Available online: http://xxx.lanl.gov/abs/1412.3555 (accessed on 29 January 2023).
- Shi, X.; Chen, Z.; Wang, H.; Yeung, D.Y.; Wong, W.k.; Woo, W.c. Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting. In Proceedings of the 28th International Conference on Neural Information Processing Systems—Volume 1, Sanur, Bali, Indonesia, 8–12 December 2021; MIT Press: Cambridge, MA, USA, 2015. NIPS’15. pp. 802–810. [Google Scholar]
- Monti, F.; Bronstein, M.M.; Bresson, X. Geometric Matrix Completion with Recurrent Multi-Graph Neural Networks. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; NIPS’17. Curran Associates Inc.: Red Hook, NY, USA, 2017; pp. 3700–3710. [Google Scholar]
- Shi, H.; Yao, Q.; Guo, Q.; Li, Y.; Zhang, L.; Ye, J.; Li, Y.; Liu, Y. Predicting Origin-Destination Flow via Multi-Perspective Graph Convolutional Network. In Proceedings of the 2020 IEEE 36th International Conference on Data Engineering (ICDE), Dallas, TX, USA, 20–24 April 2020; pp. 1818–1821. [Google Scholar] [CrossRef]
- Nguyen, K.P. Demand, supply, and pricing in urban road transport: The case of Ho Chi Minh City, Vietnam. Res. Transp. Econ. 1999, 5, 107–154. [Google Scholar] [CrossRef]
- Cascetta, E. Estimation of trip matrices from traffic counts and survey data: A generalized least squares estimator. Transp. Res. Part B Methodol. 1984, 18, 289–299. [Google Scholar] [CrossRef]
- Spiess, H. A maximum likelihood model for estimating origin-destination matrices. Transp. Res. Part B Methodol. 1987, 21, 395–412. [Google Scholar] [CrossRef]
- Maher, M. Inferences on trip matrices from observations on link volumes: A Bayesian statistical approach. Transp. Res. Part B Methodol. 1983, 17, 435–447. [Google Scholar] [CrossRef]
- Mukai, N.; Yoden, N. Taxi Demand Forecasting Based on Taxi Probe Data by Neural Network. In Intelligent Interactive Multimedia: Systems and Services; Watanabe, T., Watada, J., Takahashi, N., Howlett, R.J., Jain, L.C., Eds.; Springer Berlin Heidelberg: Berlin/Heidelberg, Germany, 2012; pp. 589–597. [Google Scholar]
- Bai, L.; Yao, L.; Kanhere, S.S.; Wang, X.; Sheng, Q.Z. STG2Seq: Spatial-temporal Graph to Sequence Model for Multi-step Passenger Demand Forecasting. CoRR 2019, abs/1905.10069. Available online: http://xxx.lanl.gov/abs/1905.10069 (accessed on 29 January 2023).
- Ye, J.; Sun, L.; Du, B.; Fu, Y.; Xiong, H. Coupled Layer-wise Graph Convolution for Transportation Demand Prediction. In Proceedings of the AAAI Conference on Artificial Intelligence, Vancouver, BC, Canada, 2–9 February 2021; Volume 35, pp. 4617–4625. [Google Scholar]
- Ke, J.; Qin, X.; Yang, H.; Zheng, Z.; Zhu, Z.; Ye, J. Predicting origin-destination ride-sourcing demand with a spatio-temporal encoder-decoder residual multi-graph convolutional network. Transp. Res. Part C Emerg. Technol. 2021, 122, 102858. [Google Scholar] [CrossRef]
- Defferrard, M.; Bresson, X.; Vandergheynst, P. Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering. CoRR 2016, abs/1606.09375. Available online: http://xxx.lanl.gov/abs/1606.09375 (accessed on 29 January 2023).
- Zhang, J.; Zheng, Y.; Qi, D. Deep Spatio-Temporal Residual Networks for Citywide Crowd Flows Prediction. CoRR 2016, abs/1610.00081. Available online: http://xxx.lanl.gov/abs/1610.00081 (accessed on 29 January 2023).
Methods | OD_RMSE | OD_MAPE | O_RMSE | O_MAPE |
---|---|---|---|---|
HA | 1.893 | 35.46% | 54.33 | 47.59% |
Lasso | 1.652 | 33.85% | 33.00 | 34.89% |
MLP | 1.665 | 34.35% | 34.79 | 28.84% |
LSTM | 1.618 | 33.55% | 33.86 | 40.18% |
ST-ResNet | 1.380 | 28.53% | 22.43 | 24.16% |
CSTN | 1.322 | 27.39% | 19.85 | 18.48% |
CTBGCN (ours) | 1.318 | 27.25% | 18.95 | 18.16% |
Conditions | OD_RMSE | OD_MAPE | O_RMSE | O_MAPE |
---|---|---|---|---|
all regions | 1.326 | 27.44% | 19.07 | 21.82% |
20 regions | 2.161 | 26.33% | 31.74 | 19.56% |
Methods | Time Consumption/Epoch |
---|---|
CSTN | 1013 s |
CTBGCN (ours) | 420 s |
Methods | OD_RMSE | OD_MAPE | O_RMSE | O_MAPE |
---|---|---|---|---|
LSTM+ 1 layer TGCN | 1.434 | 29.57% | 22.91 | 21.46% |
LSTM+ 3 layers TGCN | 1.391 | 28.86% | 21.30 | 19.61% |
LSTM+ 3 layers TBGCN | 1.362 | 28.02% | 21.21 | 23.71% |
LSTM+ 3 layers TBGCN (static graph) | 1.413 | 29.65% | 23.28 | 23.78% |
CL+ 3 layers TBGCN | 1.320 | 27.34% | 19.06 | 19.04% |
CNN+ CL+ 3 layers TBGCN (cor) | 1.322 | 27.31% | 19.23 | 18.31% |
CL+ 3 layers TBGCN (cor) (ours) | 1.318 | 27.25% | 18.95 | 18.16% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cao, Y.; Liu, L.; Dong, Y. Convolutional Long Short-Term Memory Two-Dimensional Bidirectional Graph Convolutional Network for Taxi Demand Prediction. Sustainability 2023, 15, 7903. https://doi.org/10.3390/su15107903
Cao Y, Liu L, Dong Y. Convolutional Long Short-Term Memory Two-Dimensional Bidirectional Graph Convolutional Network for Taxi Demand Prediction. Sustainability. 2023; 15(10):7903. https://doi.org/10.3390/su15107903
Chicago/Turabian StyleCao, Yibo, Lu Liu, and Yuhan Dong. 2023. "Convolutional Long Short-Term Memory Two-Dimensional Bidirectional Graph Convolutional Network for Taxi Demand Prediction" Sustainability 15, no. 10: 7903. https://doi.org/10.3390/su15107903
APA StyleCao, Y., Liu, L., & Dong, Y. (2023). Convolutional Long Short-Term Memory Two-Dimensional Bidirectional Graph Convolutional Network for Taxi Demand Prediction. Sustainability, 15(10), 7903. https://doi.org/10.3390/su15107903