Trajectory Prediction with Attention-Based Spatial–Temporal Graph Convolutional Networks for Autonomous Driving
Abstract
:1. Introduction
- The interdependent correlations of the interactions between vehicles and traffic rules in trajectory prediction problems are elaborated for the first time in our work. To further address the issue of data heterogeneity from diverse or varied sources, we also provide an innovative solution to transfer the time-series data of traffic rules into the spatial–temporal ones and produce a homogeneous joint graph along with trajectory data, which provides a fresh idea for the self-driving vehicle to predict the trajectory of nearby vehicles more accurately.
- The interactive intensities between agents are effectively learned and distinguished. We introduce an attention mechanism to learn about discrepancies in multi-agent interactions, which impels the graph dynamic in the spatial domain with growing learning ability. In such a way, we boost the model performance in capturing the spatial–temporal dependencies combined with the higher efficiency of GCN and GRU, which is conducive to the improvement of the prediction precision.
- A novel framework is developed for trajectory anticipation. It integrates the multi-agent interaction mechanisms and the scene context. Based on the real-world dataset tests, the effectiveness and superiority of our proposal have been verified through quantitative and qualitative analyses.
2. Related Work
3. Framework Statement
3.1. Problem Definition
3.2. Proposal Construction
4. Methodology
4.1. Generation of Multi-Agent Graph
4.2. Attention-Based Spatial–Temporal Fusion Graph Module (ASTFG)
4.2.1. Spatial Dependency Modeling
4.2.2. Temporal Dependency Modeling
4.3. Vehicle Trajectory Prediction Module (VTP)
4.4. Training Loss and Optimizer Construction
5. Experiments and Results
5.1. Dataset Description
5.2. Benchmark Model Comparison and Estimation Metrics
5.2.1. Baselines
- Constant Velocity (CV) model [39]: It assumes a constant speed for the vehicle and predicts its future trajectory based on historical states. In our work, with the 3 s historical trajectory input, the next 5 s movements are indicated through calculating the trajectory average speed of the previous time window.
- GRU model [40]: The model is a common GRU network with no aggregation mechanism, and it considers all trajectories to be independent of each other.
- Vanilla LSTM (V-LSTM) model [23]: This baseline conducts the vehicle trajectory task with an LSTM-based encoder–decoder process. Generally, the vehicle historical trajectories are taken into the encoder, and the decoder outputs the vehicle anticipation positions.
- Generative Adversarial Imitation Learning GRU (GAIL-GRU) model [41]: This baseline uses ground truth data from surrounding vehicles as input in the prediction task.
5.2.2. Estimation Metrics
5.3. Implementation Details
5.3.1. Input Description
5.3.2. Training
5.3.3. Time Consumption
5.4. Ablation Study
5.4.1. Effectiveness of Traffic Rules Features
5.4.2. Neighbor Features with Different Distance Thresholds
5.5. Quantitative Analysis of Prediction Results
5.6. Qualitative Analysis of Prediction Results
5.7. Interactions and Attention Mechanism Evaluation
6. Conclusions
- The prediction method considers the interaction between traffic agents and the influence of traffic rules on vehicle trajectories, such as the signal phase. This research proposes a novel framework, called TPASTGCN, which fully utilizes multilevel features for vehicle trajectory anticipation tasks. The effectiveness of our proposal is validated by a real-world dataset and several mainstream methods, which demonstrates the excellent performance in generating future trajectories at the highly complicated intersection scenes. The results manifest that compared with the baseline models, the proposed scheme achieves the prediction accuracy improvement with over 23% on average and 61% at most.
- Additionally, the experiment also indicates that the model is able to address the intention prediction issue to some extent, and the model and its effectiveness are superior to existing trajectory prediction methods in intersection scenarios. But the trajectory prediction model accuracy has further room for improvement. A vehicle’s future trajectory is under the influence of more factors than we are able to consider here, such as vehicle interactions, driver intentions, traffic rules, etc. In this paper, only the factors of vehicle interaction and signal phasing are considered, but the remaining time of traffic signals may improve the accuracy of prediction. Thus, we encourage the authors of future research to model more influencing factors within the the model.
- The accuracy and real-time requirements for the trajectory are very high. The proposed model achieves an equilibrium of prediction speed and accuracy, but its effectiveness on real vehicles is not verified. In the future, we need to conduct extensive testing, validation, and additional research at the vehicle end to verify the validity of our models, and we intend to build spatial–temporal navigation maps and make path-planning tasks for autonomous vehicles based on this study to improve the safety and comfort of autonomous driving.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Zhao, Y.; Wang, Y.; Cheng, X.; Chen, H.; Yu, H.; Ren, Y. RFAP: A Revocable Fine-Grained Access Control Mechanism for Autonomous Vehicle Platoon. IEEE Trans. Intell. Transp. Syst. 2022, 23, 9668–9679. [Google Scholar] [CrossRef]
- Yu, H.; Liu, R.; Li, Z.; Ren, Y.; Jiang, H. An RSU Deployment Strategy based on Traffic Demand in Vehicular Ad Hoc Networks (VANETs). IEEE Internet Things J. 2022, 9, 6496–6505. [Google Scholar] [CrossRef]
- Mo, X.; Huang, Z.; Xing, Y.; Lv, C. Multi-Agent Trajectory Prediction With Heterogeneous Edge-Enhanced Graph Attention Network. IEEE Trans. Intell. Transp. Syst. 2022, 23, 9554–9567. [Google Scholar] [CrossRef]
- Lefèvre, S.; Vasquez, D.; Laugier, C. A survey on motion prediction and risk assessment for intelligent vehicles. ROBOMECH J. 2014, 1, 1. [Google Scholar] [CrossRef]
- Li, X.; Ying, X.; Chuah, M.C. GRIP: Graph-based Interaction-aware Trajectory Prediction. In Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference (ITSC), Auckland, New Zealand, 27–30 October 2019; pp. 3960–3966. [Google Scholar] [CrossRef]
- Chandra, R.; Guan, T.; Panuganti, S.; Mittal, T.; Bhattacharya, U.; Bera, A.; Manocha, D. Forecasting Trajectory and Behavior of Road-Agents Using Spectral Clustering in Graph-LSTMs. IEEE Robot. Autom. Lett. 2020, 5, 4882–4890. [Google Scholar] [CrossRef]
- Mozaffari, S.; Al-Jarrah, O.Y.; Dianati, M.; Jennings, P.; Mouzakitis, A. Deep Learning-Based Vehicle Behavior Prediction for Autonomous Driving Applications: A Review. IEEE Trans. Intell. Transp. Syst. 2022, 23, 33–47. [Google Scholar] [CrossRef]
- Sheng, Z.; Xu, Y.; Xue, S.; Li, D. Graph-Based Spatial-Temporal Convolutional Network for Vehicle Trajectory Prediction in Autonomous Driving. IEEE Trans. Intell. Transp. Syst. 2022, 23, 17654–17665. [Google Scholar] [CrossRef]
- Shi, X.; Qi, H.; Shen, Y.; Wu, G.; Yin, B. A Spatial–Temporal Attention Approach for Traffic Prediction. IEEE Trans. Intell. Transp. Syst. 2021, 22, 4909–4918. [Google Scholar] [CrossRef]
- Zhao, L.; Song, Y.; Zhang, C.; Liu, Y.; Wang, P.; Lin, T.; Deng, M.; Li, H. T-GCN: A Temporal Graph Convolutional Network for Traffic Prediction. IEEE Trans. Intell. Transp. Syst. 2020, 21, 3848–3858. [Google Scholar] [CrossRef]
- Ren, Y.; Jiang, H.; Ji, N.; Yu, H. TBSM: A traffic burst-sensitive model for short-term prediction under special events. Knowl.-Based Syst. 2022, 240, 108120. [Google Scholar] [CrossRef]
- Park, S.H.; Kim, B.; Kang, C.M.; Chung, C.C.; Choi, J.W. Sequence-to-Sequence Prediction of Vehicle Trajectory via LSTM Encoder-Decoder Architecture. In Proceedings of the 2018 IEEE Intelligent Vehicles Symposium (IV), Changshu, China, 26–30 June 2018; pp. 1672–1678. [Google Scholar] [CrossRef]
- Wang, L.; Zhang, L.; Yi, Z. Trajectory Predictor by Using Recurrent Neural Networks in Visual Tracking. IEEE Trans. Cybern. 2017, 47, 3172–3183. [Google Scholar] [CrossRef]
- Xing, Y.; Lv, C.; Cao, D. Personalized Vehicle Trajectory Prediction Based on Joint Time-Series Modeling for Connected Vehicles. IEEE Trans. Veh. Technol. 2020, 69, 1341–1352. [Google Scholar] [CrossRef]
- Su, Y.; Du, J.; Li, Y.; Li, X.; Liang, R.; Hua, Z.; Zhou, J. Trajectory Forecasting Based on Prior-Aware Directed Graph Convolutional Neural Network. IEEE Trans. Intell. Transp. Syst. 2022, 23, 16773–16785. [Google Scholar] [CrossRef]
- Rainbow, B.A.; Men, Q.; Shum, H.P.H. Semantics-STGCNN: A Semantics-guided Spatial-Temporal Graph Convolutional Network for Multi-class Trajectory Prediction. In Proceedings of the 2021 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Melbourne, Australia, 17–20 October 2021; pp. 2959–2966. [Google Scholar] [CrossRef]
- Zyner, A.; Worrall, S.; Ward, J.; Nebot, E. Long short term memory for driver intent prediction. In Proceedings of the 2017 IEEE Intelligent Vehicles Symposium (IV), Los Angeles, CA, USA, 11–14 June 2017; pp. 1484–1489. [Google Scholar] [CrossRef]
- Shirazi, M.S.; Morris, B. Observing behaviors at intersections: A review of recent studies & developments. In Proceedings of the 2015 IEEE Intelligent Vehicles Symposium (IV), Seoul, Republic of Korea, 28 June–1 July 2015; pp. 1258–1263. [Google Scholar] [CrossRef]
- Lefèvre, S.; Laugier, C.; Ibañez-Guzmán, J. Exploiting map information for driver intention estimation at road intersections. In Proceedings of the 2011 IEEE Intelligent Vehicles Symposium (IV), Baden-Baden, Germany, 5–9 June 2011; pp. 583–588. [Google Scholar] [CrossRef]
- Streubel, T.; Hoffmann, K.H. Prediction of driver intended path at intersections. In Proceedings of the 2014 IEEE Intelligent Vehicles Symposium Proceedings, Dearborn, MI, USA, 8—11 June 2014; pp. 134–139. [Google Scholar] [CrossRef]
- Qian, L.P.; Feng, A.; Yu, N.; Xu, W.; Wu, Y. Vehicular Networking-Enabled Vehicle State Prediction via Two-Level Quantized Adaptive Kalman Filtering. IEEE Internet Things J. 2020, 7, 7181–7193. [Google Scholar] [CrossRef]
- Vemula, A.; Muelling, K.; Oh, J. Social attention: Modeling attention in human crowds. In Proceedings of the 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, Australia, 21–25 May 2018; pp. 4601–4607. [Google Scholar]
- Altche, F.; Fortelle, A.D. An LSTM network for highway trajectory prediction. In Proceedings of the 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC), Yokohama, Japan, 16–19 October 2017; pp. 353–359. [Google Scholar]
- Kim, B.; Kang, C.M.; Kim, J.; Lee, S.H.; Chung, C.C.; Choi, J.W. Probabilistic vehicle trajectory prediction over occupancy grid map via recurrent neural network. In Proceedings of the 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC), Yokohama, Japan, 16–19 October 2017; pp. 399–404. [Google Scholar] [CrossRef]
- Phillips, D.J.; Wheeler, T.A.; Kochenderfer, M.J. Generalizable intention prediction of human drivers at intersections. In Proceedings of the 2017 IEEE Intelligent Vehicles Symposium (IV), Los Angeles, CA, USA, 11–14 June 2017; pp. 1665–1670. [Google Scholar] [CrossRef]
- Deo, N.; Trivedi, M.M. Multi-Modal Trajectory Prediction of Surrounding Vehicles with Maneuver based LSTMs. In Proceedings of the 2018 IEEE Intelligent Vehicles Symposium (IV), Changshu, China, 26–30 June 2018; pp. 1179–1184. [Google Scholar] [CrossRef]
- Zhao, T.; Xu, Y.; Monfort, M.; Choi, W.; Baker, C.; Zhao, Y.; Wang, Y.; Wu, Y.N. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: Piscataway, NJ, USA, 2019; pp. 12126–12134. [Google Scholar]
- Mercat, J.; Gilles, T.; Zoghby, N.E.; Sandou, G.; Beauvois, D.; Gil, G.P. Multi-Head Attention for Multi-Modal Joint Vehicle Motion Forecasting. In Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, 31 May–31 August 2020; pp. 9638–9644. [Google Scholar] [CrossRef]
- Zhang, T.; Song, W.; Fu, M.; Yang, Y.; Wang, M. Vehicle Motion Prediction at Intersections Based on the Turning Intention and Prior Trajectories Model. IEEE/CAA J. Autom. Sin. 2021, 8, 1657–1666. [Google Scholar] [CrossRef]
- Zhong, Z.; Luo, Y.; Liang, W. STGM: Vehicle Trajectory Prediction Based on Generative Model for Spatial-Temporal Features. IEEE Trans. Intell. Transp. Syst. 2022, 23, 18785–18793. [Google Scholar] [CrossRef]
- Xie, X.; Zhang, C.; Zhu, Y.; Wu, Y.N.; Zhu, S.-C. Congestion-aware Multi-agent Trajectory Prediction for Collision Avoidance. In Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi’an, China, 30 May–5 June 2021; pp. 13693–13700. [Google Scholar] [CrossRef]
- Carrasco, S.; Llorca, D.F.; Sotelo, M.A. SCOUT: Socially-COnsistent and UndersTandable Graph Attention Network for Trajectory Prediction of Vehicles and VRUs. In Proceedings of the 2021 IEEE Intelligent Vehicles Symposium (IV), Nagoya, Japan, 11–17 July 2021; pp. 1501–1508. [Google Scholar] [CrossRef]
- Liu, C.; Chen, Y.; Liu, M.; Shi, B.E. AVGCN: Trajectory Prediction using Graph Convolutional Networks Guided by Human Attention. In Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi’an, China, 30 May–5 June 2021; pp. 14234–14240. [Google Scholar] [CrossRef]
- Guo, S.; Lin, Y.; Feng, N.; Song, C.; Wan, H. Attention based spatial-temporal graph convolutional networks for traffic flow forecasting. Proc. AAAI Conf. Artif. Intell. 2019, 33, 922–929. [Google Scholar] [CrossRef]
- Wang, S.; Zhang, M.; Miao, H.; Peng, Z.; Yu, P.S. Multivariate Correlation-aware Spatio-temporal Graph Convolutional Networks for Multi-scale Traffic Prediction. ACM Trans. Intell. Syst. Technol. 2022, 13, 1–12. [Google Scholar] [CrossRef]
- Peng, H.; Wang, H.; Du, B.; Bhuiyan, M.Z.A.; Ma, H.; Liu, J.; Wang, L.; Yang, Z.; Du, L.; Wang, S.; et al. Spatial temporal incidence dynamic graph neural networks for traffic flow forecasting. Inf. Sci. 2022, 521, 277–290. [Google Scholar] [CrossRef]
- Zhang, Z. Improved Adam Optimizer for Deep Neural Networks. In Proceedings of the 2018 IEEE/ACM 26th International Symposium on Quality of Service (IWQoS), Banff, AB, Canada, 4–6 June 2018; pp. 1–2. [Google Scholar] [CrossRef]
- NGSIM—Next Generation Simulation [EB/OL]. Available online: https://ops.fhwa.dot.gov/trafficanalysistools/ngsim.htm (accessed on 1 October 2022).
- Zhao, Y.; Yu, H.; Yang, Y.; Xu, L.; Pan, S.; Ren, Y. Flexible and Secure Cross-Domain Signcrypted Data Authorization in Multi-Platoon Vehicular Networks. IEEE Trans. Intell. Transp. Syst. 2023, 1–11. [Google Scholar] [CrossRef]
- Cho, K.; van Merrienboer, B.; Gulcehre, C.; Bahdanau, D.; Bougares, F.; Schwenk, H.; Bengio, Y. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv 2014, arXiv:1406.1078,2014. [Google Scholar]
- Ho, J.; Ermon, S. Generative adversarial imitation learning. Adv. Neural Inf. Process. Syst. 2016, 29. [Google Scholar]
- Paszke, A.; Gross, S.; Chintala, S.; Chanan, G.; Yang, E.; DeVito, Z.; Lin, Z.; Desmaison, A.; Antiga, L.; Lerer, A. Automatic differentiation in PyTorch. In Proceedings of the Advances in Neural Information Processing Systems Workshops, Long Beach, CA, USA, 4–9 December 2017; pp. 1–4. [Google Scholar]
- Jiang, H.; Ren, Y.; Fang, J.; Yang, Y.; Xu, L.; Yu, H. SHIP: A State-Aware Hybrid Incentive Program for Urban Crowd Sensing with For-Hire Vehicles. IEEE Trans. Intell. Transp. Syst. 2023, 1–13. [Google Scholar] [CrossRef]
- Schöller, C.; Aravantinos, V.; Lay, F.; Knoll, A. What the constant velocity model can teach us about pedestrian motion prediction. IEEE Robot. Autom. Lett. 2020, 5, 1696–1703. [Google Scholar] [CrossRef]
Serial No. | Data Fields | Explanations |
---|---|---|
1 | Vehicle | Vehicle ID |
2 | Local | X value in regional coordinate frame |
3 | Local | Y value in regional coordinate frame |
4 | v | Vehicle length |
5 | v | Vehicle width |
6 | v | Vehicle velocity |
7 | v | Vehicle acceleration |
8 | Lane | The lane number of the vehicle |
9 | Int | The number of intersection |
Time Consumption (s) | ||||
---|---|---|---|---|
CV | GRU | V-LSTM | GAIL-GRU | TPASTGCN |
0.02864 | 0.87395 | 0.08742 | 0.99733 | 0.07945 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Li, H.; Ren, Y.; Li, K.; Chao, W. Trajectory Prediction with Attention-Based Spatial–Temporal Graph Convolutional Networks for Autonomous Driving. Appl. Sci. 2023, 13, 12580. https://doi.org/10.3390/app132312580
Li H, Ren Y, Li K, Chao W. Trajectory Prediction with Attention-Based Spatial–Temporal Graph Convolutional Networks for Autonomous Driving. Applied Sciences. 2023; 13(23):12580. https://doi.org/10.3390/app132312580
Chicago/Turabian StyleLi, Hongbo, Yilong Ren, Kaixuan Li, and Wenjie Chao. 2023. "Trajectory Prediction with Attention-Based Spatial–Temporal Graph Convolutional Networks for Autonomous Driving" Applied Sciences 13, no. 23: 12580. https://doi.org/10.3390/app132312580
APA StyleLi, H., Ren, Y., Li, K., & Chao, W. (2023). Trajectory Prediction with Attention-Based Spatial–Temporal Graph Convolutional Networks for Autonomous Driving. Applied Sciences, 13(23), 12580. https://doi.org/10.3390/app132312580