Public Transportation Operational Health Assessment Based on Multi-Source Data
Abstract
:Featured Application
Abstract
1. Introduction
2. Concept Definition and Calculation
2.1. Factors Influencing the Health of Public Transportation Operations
2.2. Definition of Eigenvalues of Influencing Factors
2.3. Calculation of the Eigenvalues of Influencing Factors
3. Causes of Health in Public Transport Operations
3.1. Random Forest Algorithm Based on Bagging
3.2. Random Forest Model Optimization and Evaluation
3.3. Analysis of the Causes of Public Transportation Operational Health
4. Conclusions
- (1)
- The study quantitatively determines the extent to which different factors affect the health of bus operations. It changes the limitations of previous studies that only find the symptoms, but not the causes of the disease.
- (2)
- The model is based on basic information about public transport operations, which is easily accessible. The method is uncomplicated to implement, and the results are highly accurate and usable. It can be universally applied to conventional public transport scenarios.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Qian, L.H. The fundamental way to solve the traffic problems of big cities is to vigorously develop public transportation. Beijing Plan. Constr. 1998, 6, 25–27. [Google Scholar]
- Shanghai Comprehensive Transportation Annual Report (2020). Traffic Transp. 2019, 34, 10–12.
- Wang, P.; Lin, J.J.; Barnum, D. Data envelopment analysis of bus service reliability using automatic vehicle location data. In Proceedings of the 86th Annual Meeting of Transportation Research Board, Washington, DC, USA, 9–13 January 2008. [Google Scholar]
- Liu, X.; Graham, D.J. Development of a Key Performance Indicator to Compare Regularity of Service between Urban Bus Operations. Transp. Res. Rec. J. Transp. Res. Board 2011, 2216, 33–41. [Google Scholar]
- Wei, Q.B.; Yang, J.F.; Chen, C.J. Study on the evaluation index system of public transport operation service in Guangzhou. Transp. Res. 2016, 2, 17–23. [Google Scholar]
- Wen, H.Y.; Wu, L.F.; Mei, J.J. Fuzzy Comprehensive Evaluation of Guangzhou-Foshan Intercity Bus Satisfaction Based on Improved AHP Method. J. Zhongshan Univ. (Nat. Sci. Ed.) 2018, 5, 64–71. [Google Scholar]
- Chen, Y.Y.; Cai, Y.W.; Hou, Y.M. Evaluation method of “health index” of bus routes in large cities. J. Changan Univ. Nat. Sci. Ed. 2015, 35, 1–6. [Google Scholar]
- Lin, Y.; Yang, X.; Zou, N. Real time bus arrival time prediction: Case study for Jinan. J. Transp. Eng. 2013, 11, 1133–1140. [Google Scholar] [CrossRef]
- Wang, C. Research on the evaluation system of road traffic health status in old urban areas. Urban Roads Flood Control 2016, 12, 143–147. [Google Scholar]
- Zheng, C.J.; Zhang, Y.H.; Feng, X.J. Improved iterative prediction for multiple stop arrival time using a support vector machine. Transport 2012, 2, 158–164. [Google Scholar] [CrossRef]
- Ke, J.Y. Evaluation of transportation efficiency of bus routes based on improved DEA. In Proceedings of the 2018 Annual Conference on Urban Transportation Planning in China, Qingdao, China, 16–18 October 2018. [Google Scholar]
- Lu, H.P.; Yang, M.; Zhang, Y.B. Assessment of integrated development of comprehensive transportation hubs and suggestions for countermeasures. Compr. Transp. 2019, 4, 25–30. [Google Scholar]
- Shen, L.; Feng, T.; Mo, Y. Passenger perception-based service quality assessment of pick-up bus service. Highw. Transp. Technol. 2019, 8, 152–158. [Google Scholar]
- Zhang, L.; Panos, D.P. Signalized Intersection LOS that Accounts for User Perceptions. In Proceedings of the 83th Annual Meeting of the Transportation Research Board, National Research Council, Washington, DC, USA, 11–15 January 2004. [Google Scholar]
- Xin, G.Z. Research on Multi-Modal Bus Network Effectiveness Assessment Method. Master’s Thesis, Southeast University, Nanjing, China, 2015. [Google Scholar]
- Fan, L.J. Analysis of Bus Rapid Transit Operation Status and Service Level Classification. Master’s Thesis, Shandong University, Jinan, China, 2017. [Google Scholar]
- Ouyang, Y.; Nourbakhsh, S.M.; Cassidy, M.J. Continuum approximation approach to bus network design under spatially heterogeneous demand. Transp. Res. Part B Methodol. 2014, 68, 333–344. [Google Scholar] [CrossRef]
- Daniel, T.; Peng, Z.R. Timetable optimization for single bus line based on hybrid vehicle size model. J. Traffic Transp. Eng. 2015, 2, 179–186. [Google Scholar]
- Chou, C.H. Machine Learning; Tsinghua University Press: Beijing, China, 2016. [Google Scholar]
- Zhang, J.Q.; Shi, W.B.; Ji, X.J. Machine learning in healthcare and public health applications. China Public Health 2019, 10, 1449–1452. [Google Scholar]
- Zhang, R.; Wang, Y.B. Research on machine learning and its algorithms and development. J. Commun. Univ. China Nat. Sci. Ed. 2016, 2, 10–18. [Google Scholar]
- Zhao, J.Y. Research and Application of Machine Learning Methods for Health Assessment. Master’s Thesis, University of Electronic Science and Technology, Chengdu, China, 2013. [Google Scholar]
- Fan, X.H. Research on Support Vector Machine Classification Algorithm. Master’s Thesis, Northwestern Polytechnical University, Xian, China, 2016. [Google Scholar]
- Zheng, Y.; Hu, X.P.; Yin, J. A multi-task support vector machine based approach to health data fusion. Syst. Eng. Theory Pract. 2019, 2, 418–428. [Google Scholar]
- Yu, M.J.; Wang, Y.L. Research on plain Bayesian classification algorithm. Bus. Intell. 2012, 8, 226–227. [Google Scholar]
- Zhang, G.W.; Wang, L.; Wang, X.Y. Research on sand information extraction method based on CART decision tree. Arid Zone Geogr. 2019, 5, 1133–1140. [Google Scholar]
- Champahom, T. Evaluating user’s satisfaction of bus service in Mauritius: Decision tree approach. Lowl. Technol. Int. 2019, 4, 478–489. [Google Scholar]
- Zhu, S.Z. A study based on the implementation of k-nearest neighbor algorithm. Comput. Digit. Eng. 2015, 10, 1771–1774. [Google Scholar]
- Xiao, J. Research on the Classification Method of Imbalanced Data Based on Random Forest. Master’s Thesis, Harbin Institute of Technology, Harbin, China, 2013. [Google Scholar]
- Li, Q. Characterizing the importance of criminal factors affecting bus ridership using random forest ensemble algorithm. Transp. Res. Rec. 2019, 4, 864–876. [Google Scholar] [CrossRef]
- Zhou, Y. Evaluation Method of Bus Operation Based on Vehicle Positioning Data. Master’s Thesis, Southeast University, Nanjing, China, 2015. [Google Scholar]
- Chawla, N.V.; Bowyer, K.W.; Hall, L.O. SMOTE: Synthetic Minority Over-sampling Technique. J. Artif. Intell. Res. 2002, 1, 321–357. [Google Scholar] [CrossRef]
Date | Weather | Assignment |
---|---|---|
21-11-2016 | Cloudy | 2 |
22-11-2016 | Light rain | 4 |
23-11-2016 | Medium Rain | 4 |
24-11-2016 | Light rain | 3 |
25-11-2016 | Sunny | 1 |
Station Order | Site Name | Station Delay | Station Order | Site Name | Station Delay |
---|---|---|---|---|---|
0 | Train Station | - | 14 | Yuejin Road Middle | 19 |
1 | Bus Station A | 13 | 15 | Meishu Ceramic Factory | 5 |
2 | Chinese Hospital | 20 | 16 | Hualong Palace | 6 |
3 | Cyclone Hotel | 23 | 17 | Sha Gang | 24 |
4 | Zumiao A Station | 8 | 18 | Tamcheong-ri | 3 |
5 | Global International Plaza | 35 | 19 | Home Expo City South Gate | 8 |
6 | Customs | 15 | 20 | Lanshi Bridge West | 10 |
7 | Foshan Institute of Technology | 0 | 21 | Foshan Home Expo City | 8 |
8 | Pil Tang | 4 | 22 | Lanshi Road East | 5 |
9 | Tea Pavilion | 4 | 23 | Yin Yuan Market | 10 |
10 | Shiwan Bus Station | 16 | 24 | Lanshi High School | 7 |
11 | China Ceramic City | 3 | 25 | Provincial Spinning Institute | 5 |
12 | Shiwan Park | 24 | 26 | Stone Village | - |
13 | Tao Du | 8 | Total | 283 |
Signal No. | Cross Delay | Signal No. | Cross Delay |
---|---|---|---|
ID_1 | 23.696 | ID_11 | 13.004 |
ID_2 | 41.4 | ID_12 | 44.2 |
ID_3 | 33.85 | ID_13 | 58.93 |
ID_4 | 10.43 | ID_14 | 48.187 |
ID_5 | 13.727 | ID_15 | 17.404 |
ID_6 | 25.2 | ID_16 | 37.617 |
ID_7 | 15.986 | ID_17 | 12.711 |
ID_8 | 61.4 | ID_18 | 24.788 |
ID_9 | 31.416 | ID_19 | 54.329 |
ID_10 | 21.4 | Total | 589.675 |
Section No. | Road Delay | Section No. | Road Delay |
---|---|---|---|
road_1 | 38.966 | road_11 | 922.869 |
road_2 | 272.049 | road_12 | 378.659 |
road_3 | 660.432 | road_13 | 232.953 |
road_4 | 313.809 | road_14 | 91.732 |
road_5 | 948.306 | road_15 | 716.192 |
road_3 | 660.432 | road_13 | 232.953 |
road_6 | 17.342 | road_16 | 299.874 |
road_7 | 557.481 | road_17 | 28.525 |
road_8 | 102.213 | road_18 | 966.225 |
road_9 | 60.817 | road_19 | 49.424 |
road_10 | 807.132 | Total | 7396.55 |
Weather | Driver | Road | Lamp | Station | ||
---|---|---|---|---|---|---|
Health Level | Pearson Correlation | −0.050 | 0.056 | −0.693 ** | −0.260 ** | −0.305 ** |
Significance | 0.298 | 0.250 | 0.000 | 0.000 | 0.000 |
Random Forest Algorithm |
---|
1: Input: Dataset , Feature set k: k-base learners (Decision Tree Learners) 2: Take 70% of the data as training set and 30% of the data as test set randomly |
2: for i = i to k do 3: N samples are taken from data set D as the training set for the i decision tree randomly and with put-back 4: Specify the constant and randomly select a subset of m features from M features to form the feature set 5: Build a decision tree model, TreeGenerate ), where each tree is grown to the maximum extent possible 6: end for 7: Generate k decision tree models |
8: All k models make classification predictions for test set and return the most classified classification labels |
Parameters | Value |
---|---|
n_estimators | 111 |
max_depth | 15 |
max_features | 52 |
min_samples_leaf | 1 |
min_impurity_decrease | 0 |
Algorithms | Accuracy |
---|---|
Knn | 0.721 |
Decision Trees | 0.842 |
Random Forest | 0.926 |
Feature | Importance | Rank | Feature | Importance | Rank | Feature | Importance | Rank |
---|---|---|---|---|---|---|---|---|
road_10 | 0.1997 | 1 | ID_6 | 0.0129 | 22 | ID_4 | 0.0059 | 43 |
road_4 | 0.0945 | 2 | road_2 | 0.0123 | 23 | 6station | 0.0056 | 44 |
ID_10 | 0.0586 | 3 | road_1 | 0.012 | 24 | 19station | 0.0055 | 45 |
road_3 | 0.0361 | 4 | 3station | 0.0119 | 25 | road_19 | 0.0053 | 46 |
road_16 | 0.0342 | 5 | ID_11 | 0.0114 | 26 | ID_3 | 0.0052 | 47 |
road_5 | 0.0256 | 6 | ID_13 | 0.0109 | 27 | ID_15 | 0.0052 | 48 |
road_9 | 0.0252 | 7 | ID_1 | 0.0102 | 28 | ID_12 | 0.0052 | 49 |
road_12 | 0.0247 | 8 | ID_14 | 0.0102 | 29 | 9station | 0.0052 | 50 |
road_11 | 0.0239 | 9 | 21station | 0.0101 | 30 | ID_16 | 0.005 | 51 |
road_7 | 0.0223 | 10 | 4station | 0.0098 | 31 | 2station | 0.005 | 52 |
ID_18 | 0.0185 | 11 | ID_9 | 0.0095 | 32 | ID_7 | 0.0047 | 53 |
ID_5 | 0.0174 | 12 | 14station | 0.0088 | 33 | 5station | 0.0047 | 54 |
16station | 0.0174 | 13 | 25station | 0.0081 | 34 | road_15 | 0.0045 | 55 |
road_8 | 0.0163 | 14 | road_17 | 0.008 | 35 | ID_17 | 0.0045 | 56 |
ID_2 | 0.0152 | 15 | road_13 | 0.0078 | 36 | 15station | 0.0038 | 57 |
1station | 0.0151 | 16 | 22station | 0.0069 | 37 | 7station | 0.0035 | 58 |
18station | 0.0144 | 17 | ID_19 | 0.0066 | 38 | 8station | 0.0034 | 59 |
13station | 0.0143 | 18 | 12station | 0.0066 | 39 | 23station | 0.0032 | 60 |
road_6 | 0.0137 | 19 | 17station | 0.0064 | 40 | 11station | 0.003 | 61 |
road_14 | 0.0137 | 20 | 20station | 0.0063 | 41 | 24station | 0.0025 | 62 |
road_18 | 0.0131 | 21 | ID_8 | 0.006 | 42 | 10station | 0.0025 | 63 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhou, X.; Guan, Z.; Xi, J.; Wei, G. Public Transportation Operational Health Assessment Based on Multi-Source Data. Appl. Sci. 2021, 11, 10611. https://doi.org/10.3390/app112210611
Zhou X, Guan Z, Xi J, Wei G. Public Transportation Operational Health Assessment Based on Multi-Source Data. Applied Sciences. 2021; 11(22):10611. https://doi.org/10.3390/app112210611
Chicago/Turabian StyleZhou, Xuemei, Zhen Guan, Jiaojiao Xi, and Guohui Wei. 2021. "Public Transportation Operational Health Assessment Based on Multi-Source Data" Applied Sciences 11, no. 22: 10611. https://doi.org/10.3390/app112210611
APA StyleZhou, X., Guan, Z., Xi, J., & Wei, G. (2021). Public Transportation Operational Health Assessment Based on Multi-Source Data. Applied Sciences, 11(22), 10611. https://doi.org/10.3390/app112210611