Reinforcement Learning for Transit Signal Priority with Priority Factor
Abstract
:Highlights
- What are the main findings?
- Effective Reduction of Bus Travel Times: The study’s proposed transit signal priority (TSP) control system that incorporates a priority factor (PF) significantly reduces average travel times for buses—17% in the arterial network and 25% in the grid network.
- Mitigation of Conflicting Requests: The RL-based TSP with PF effectively resolves conflicting priority requests when multiple buses approach an intersection from different directions.
- Minimal Impact on Passenger Cars: While the average travel time for passenger cars increased (by up to 12% in the arterial network and 7% in the grid network), the impact is not significant compared to the improvements in bus travel times.
- Dynamic Assignment of PF: The PF can be dynamically assigned based on the number of passengers on each bus, enhancing the system’s adaptability to varying traffic conditions.
- Longer Cycle Length and Signal Splits: An increase in buses with higher PF leads to longer cycle lengths and extended signal phase durations for directions with buses.
- What is the implication of the main finding?
- Improved Public Transit Efficiency: The findings suggest that integrating advanced control strategies like RL and dynamic PFs can enhance the efficiency of public transportation systems, making them more appealing and potentially increasing ridership.
- Balanced Traffic Management: The study indicates a feasible approach for balancing the needs of both transit and non-transit users at signalized intersections, which is crucial for urban mobility and reducing overall congestion.
- Scalability and Adaptability: The ability to dynamically adjust the PF based on passenger load suggests that this approach can be tailored to different traffic conditions and demands, making it adaptable for various urban environments.
- Future Research Directions: The findings underscore the need for further exploration into real-time PF determination and the integration of additional factors (like passenger waiting times) to enhance the operational efficiency and reliability of transit systems.
Abstract
1. Introduction
- First, develop a TSP control method that utilizes RL-based strategies in a mixed-traffic environment.
- Second, provide a comprehensive theoretical framework for designing the key components of the RL-based system, including the state, action, and reward structures.
- Third, address the negative externalities experienced by non-transit users and the necessity of effectively managing conflicting priority requests.
- Reduction in Bus Travel Times: The method significantly decreases bus travel times while maintaining minimal negative impacts on passenger car operations, thereby enhancing overall traffic efficiency.
- Conflict Resolution: The approach effectively addresses and resolves conflicting priority requests between buses through the implementation of the PF, ensuring smoother transit operations at signalized intersections.
- Dynamic Assignment of Priority Factor: The research reveals that the PF can be dynamically assigned to individual buses based on their passenger loads. This adaptability indicates the potential for the proposed method to be tailored to various traffic management scenarios, accommodating the specific needs of different transit agencies.
- Examines signal timing: the paper examines signal timing within the framework of the proposed RL-based TSP control method. The analysis reveals that an increase in the number of buses assigned a higher PF is associated with longer cycle lengths. Furthermore, with respect to signal splits, the duration of the traffic signal phases is extended when servicing directions that include buses with PF. This outcome aligns with expectations because the RL agent prioritizes the swift passage of buses through intersections. Examining signal timing provides insights into the control behavior of RL agents.
2. Literature Review
2.1. Transit Signal Priority (TSP)
2.2. Reinforcement Learning (RL)-Based Traffic Signal Control
2.3. TSP with RL
2.4. Summary
3. Methodology
3.1. State, Action and Reward Design
3.1.1. State Design
3.1.2. Action Design
3.1.3. Reward Design
- Land-based MP control derivation
- Determination of positive coefficient values
- Reward function
3.2. Justification for Reward Function
3.3. Learning Process
3.4. TSP Control with Priority Factor (PF)
4. Experimental Setting
4.1. Simulator
4.2. Road Networks
4.3. Traffic Light Signal Plan
4.4. Traffic Volume and Turning Ratio
4.5. Bus Routes
5. Results
5.1. No Priority Control
- Light-to-Medium (LM) Volume
- In the Arterial1×3 network, the proposed model consistently exhibits the shortest ATT. Specifically, for routes B310 and B320, the ATT ranges from 133 to 134 s, while for routes B330 and B340, the ATT ranges from 45 to 46 s.
- In the Grid2×2 network, the proposed model outperforms both MP and fixed-time control strategies, with the ATT ranging from 154 to 162 s.
- Both in Arterial1×3 and Grid2×2 networks, MP control shows moderate performance, and fixed-time control results in the highest travel times highlighting its inadequacy in dynamic traffic situations.
- Medium-to-Heavy (MH) Volume
- In the Arterial1×3 network, the proposed model with an ATT between 160 and 52 s maintains its superior performance, which is significantly lower than the other methods.
- In the Grid2×2 network, the proposed model continues to lead in performance with an ATT ranging from 195 to 220 s.
- MP control shows a decline in efficiency under heavier traffic conditions. Fixed-time control exhibits the highest travel times which further underscores its limitations in handling medium-to-heavy traffic volumes.
- Summary
- The proposed model demonstrates superior performance relative to both fixed-time control and MP control. As traffic volume increases, the advantages of the proposed model become more pronounced.
- MP control, which is a greedy algorithm, achieves optimal solutions at each time step. However, it frequently switches signal phases, leading to increased lost time.
- Fixed-time control, which is predicated on overly simplistic assumptions or prior knowledge of traffic patterns, is prone to failure in dynamic traffic scenarios.
5.2. Priority Control
5.2.1. Cases
5.2.2. ATT Analysis
- Bus ATTs
- Bus routes with a PF greater than 1 exhibit a lower ATT compared to the Base case with a PF of 1 under identical traffic conditions. The reduction in the ATT is notable, with the highest decrease observed at 17% in the Arterial1×3 network and 25% in the Grid2×2 network.
- In Case 1 of both the Arterial1×3 and Grid2×2 networks, agents are inclined to grant right-of-way to priority bus routes at intersections. Consequently, non-priority bus routes experience longer crossing times at these intersections. This represents a trade-off for allowing priority buses to traverse intersections more rapidly.
- In Case 2 for both Arterial1×3 and Grid2×2, all bus routes demonstrate reduced ATTs compared to the Base case due to the application of PF across the board. However, the ATT for routes B310 and B320 in Arterial1×3, as well as B310 in Grid2×2 in Case 2, is longer than in Case 1. This outcome indicates that agents do not prioritize a specific direction for bus routes at intersections uniformly; rather, all bus routes compete for the right-of-way. In the Arterial1×3 network, arterial buses compete with side-street buses for priority, while in the Grid2×2 network, bus routes from different directions vie for precedence. These findings illustrate that the proposed TSP control with PF effectively addresses conflicting priority requests.
- 2.
- Passenger car ATTs
- Agents prioritize buses over passenger cars to enhance the efficiency of bus travel times. Consequently, the ATT for passenger cars can increase by as much as 12% in Arterial1×3 and 7% in Grid2×2, while the ATT for buses may be reduced by up to 17% and 25%, respectively. This increase in passenger car travel time is considered acceptable due to its relatively minor impact.
- Generally, it is observed that the ATT for passenger cars tends to lengthen as the PF increases.
- Across both Arterial1×3 and Grid2×2 networks, passenger cars in Case 2 experience shorter ATTs compared to Case 1. This improvement can be attributed to the facilitation provided by all buses with a PF greater than 1, which enables passenger cars traveling in multiple directions to cross intersections more efficiently.
5.2.3. Signal Timing Analysis
- 1.
- Average cycle length
- 2.
- Split
5.3. Dynamic PF
6. Conclusions
7. Future Research
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Xu, M.; Ye, Z.; Sun, H.; Wang, W. Optimization model for transit signal priority under conflicting priority requests. Transp. Res. Rec. J. Transp. Res. Board 2016, 2539, 140–148. [Google Scholar] [CrossRef]
- Zeng, X.; Zhang, Y.; Balke, K.N.; Yin, K. A real-time Transit Signal Priority Control Model considering stochastic bus arrival time. IEEE Trans. Intell. Transp. Syst. 2014, 15, 1657–1666. [Google Scholar] [CrossRef]
- Wang, X.; Yin, Y.; Feng, Y.; Liu, H.X. Learning the max pressure control for urban traffic networks considering the phase switching loss. Transp. Res. Part C Emerg. Technol. 2022, 140, 103670. [Google Scholar] [CrossRef]
- Abdulhai, B.; Pringle, R.; Karakoulas, G.J. Reinforcement Learning for True Adaptive Traffic Signal Control. J. Transp. Eng. 2003, 129, 278–285. [Google Scholar] [CrossRef]
- Arel, I.; Liu, C.; Urbanik, T.; Kohls, A.G. Reinforcement learning-based multi-agent system for network traffic signal control. IET Intell. Transp. Syst. 2010, 4, 128–135. [Google Scholar] [CrossRef]
- Chu, T.; Wang, J.; Codecà, L.; Li, Z. Multi-agent deep reinforcement learning for large-scale traffic signal control. IEEE Trans. Intell. Transp. Syst. 2020, 21, 1086–1095. [Google Scholar] [CrossRef]
- Levin, M.W. Max-pressure traffic signal timing: A summary of methodological and experimental results. J. Transp. Eng. Part A Syst. 2023, 149, 4. [Google Scholar] [CrossRef]
- Ma, D.; Xiao, J.; Song, X.; Ma, X.; Jin, S. A back-pressure-based model with fixed phase sequences for traffic signal optimization under oversaturated networks. IEEE Trans. Intell. Transp. Syst. 2020, 22, 5577–5588. [Google Scholar] [CrossRef]
- Levin, M.W.; Hu, J.; Odell, M. Max-pressure signal control with cyclical phase structure. Transp. Res. Part C Emerg. Technol. 2020, 120, 102828. [Google Scholar] [CrossRef]
- Wei, H.; Chen, C.; Zheng, G.; Wu, K.; Gayah, V.; Xu, K. PressLight: Learning max pressure control to coordinate traffic signals in arterial network. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, USA, 4–8 August 2019; pp. 1290–1298. [Google Scholar]
- Wei, H.; Xu, N.; Zhang, H.; Zheng, G.; Zang, X.; Chen, C.; Zhang, W.; Zhu, Y.; Xu, K.; Li, Z. CoLight: Learning network-level cooperation for traffic signal control. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management, Beijing, China, 3–7 November 2019; pp. 1913–1922. [Google Scholar]
- Wei, H.; Zheng, G.; Gayah, V.; Li, Z. A survey on traffic signal control methods. arXiv 2019, arXiv:1904.08117. Available online: http://arxiv.org/abs/1904.08117 (accessed on 1 September 2024).
- Noaeen, M.; Naik, A.; Goodman, L.; Crebo, J.; Abrar, T.; Abad, Z.S.H.; Bazzan, A.L.; Far, B. Reinforcement learning in urban network traffic signal control: A systematic literature review. Expert Syst. Appl. 2022, 199, 116830. [Google Scholar] [CrossRef]
- Xu, T.; Barman, S.; Levin, M.W.; Chen, R.; Li, T. Integrating public transit signal priority into Max-Pressure Signal Control: Methodology and Simulation Study on a downtown network. Transp. Res. Part C Emerg. Technol. 2022, 138, 103614. [Google Scholar] [CrossRef]
- Lin, Y.; Yang, X.; Zou, N.; Franz, M. Transit signal priority control at signalized intersections: A comprehensive review. Transp. Lett. 2014, 7, 168–180. [Google Scholar] [CrossRef]
- Ngan, V.W.K. A comprehensive strategy for transit signal priority. Master’s Thesis, University of British Columbia, Vancouver, BC, Cananda, 2002. [Google Scholar]
- Smith, H.R.; Hemily, P.B.; Ivanovic, M. Transit Signal Priority (TSP): A Planning and Implementation Handbook; ITS America: Washington, DC, USA, 2005. [Google Scholar]
- Furth, P.G.; Muller, T.H. Conditional bus priority at signalized intersections: Better Service with less traffic disruption. Transp. Res. Rec. J. Transp. Res. Board 2000, 1731, 23–30. [Google Scholar] [CrossRef]
- Mohammadi, R.; Roncoli, C.; Mladenovic, M.N. Transit Signal Priority in a connected vehicle environment: User throughput and schedule delay optimization approach. In Proceedings of the 2020 Forum on Integrated and Sustainable Transportation Systems (FISTS), Delft, The Netherlands, 3–5 November 2020. [Google Scholar] [CrossRef]
- He, S.; Han, H.; Zhang, H.; Sun, S.; Qiu, T.Z. Connected Transit Bus Dynamic Priority Weight Modeling and conflicting request resolution control at the signalized intersection. J. Adv. Transp. 2022, 2022, 1–22. [Google Scholar] [CrossRef]
- Ma, W.; Liu, Y.; Yang, X. A dynamic programming approach for optimal signal priority control upon multiple high-frequency bus requests. J. Intell. Transp. Syst. 2012, 17, 282–293. [Google Scholar] [CrossRef]
- Xu, M.; An, K.; Ye, Z.; Wang, Y.; Feng, J.; Zhao, J.; Xu, M.; An, K.; Ye, Z.; Wang, Y.; et al. A bi-level model to resolve conflicting transit priority requests at Urban Arterials. IEEE Trans. Intell. Transp. Syst. 2019, 20, 1353–1364. [Google Scholar] [CrossRef]
- Zeng, X.; Zhang, Y.; Jiao, J.; Yin, K. Route-based transit signal priority using connected vehicle technology to promote bus schedule adherence. IEEE Trans. Intell. Transp. Syst. 2021, 22, 1174–1184. [Google Scholar] [CrossRef]
- Sutton, R.S.; Barto, A.G. Reinforcement Learning: An Introduction; MIT Press: Cambridge, MA, USA, 2018. [Google Scholar]
- Arulkumaran, K.; Deisenroth, M.P.; Brundage, M.; Bharath, A.A. Deep reinforcement learning: A brief survey. IEEE Signal Process. Mag. 2017, 34, 26–38. [Google Scholar] [CrossRef]
- Chanloha, P.; Chinrungrueng, J.; Usaha, W.; Aswakul, C. Cell transmission model-based multiagent Q-learning for network-scale signal control with transit priority. Comput. J. 2013, 57, 451–468. [Google Scholar] [CrossRef]
- Shabestray, S.M.A.; Abdulhai, B. Multimodal iNtelligent deep (MiND) traffic signal controller. In Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference (ITSC), Auckland, New Zealand, 27–30 October 2019; pp. 4532–4539. [Google Scholar] [CrossRef]
- Cheng, H.K.; Kou, K.P.; Wong, K.I. Transit Signal Priority Control with Deep Reinforcement Learning. In Proceedings of the 2022 10th International Conference on Traffic and Logistic Engineering (ICTLE), Macau, China, 12–14 August 2022. [Google Scholar] [CrossRef]
- Varaiya, P. Max pressure control of a network of signalized intersections. Transp. Res. Part C Emerg. Technol. 2013, 36, 177–195. [Google Scholar] [CrossRef]
- Zheng, G.; Zang, X.; Xu, N.; Wei, H.; Yu, Z.; Gayah, V.; Xu, K.; Li, Z. Diagnosing reinforcement learning for traffic signal control. arXiv 2019, arXiv:1905.04716. [Google Scholar]
- “PTV vissim”, Traffic Simulation Software|PTV Vissim|PTV Group. Available online: https://www.ptvgroup.com/ (accessed on 9 December 2023).
Disclaimer/Publisher’s Note: The statements, opinions, and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions, or products referred to in the content. |
TSP Types | Advantages | Disadvantages | Control Strategy |
---|---|---|---|
Passive | Does not require a transit detection system and is easy to implement | Not suitable for high fluctuation traffic situation | Predetermining signal timings |
Active | Considers transit conditions or specific criteria | May fall short of efficiency under high transit volumes situation | Adjustments of signal timing; common strategies are green extension and red truncation |
Adaptive | Considers trade-offs between transit and traffic conditions | Sophisticated and complex systems | Signal control algorithm that provides priority while explicitly considering the impacts on the rest of the traffic |
Arterial1×3 | Grid2×2 | |||||||
---|---|---|---|---|---|---|---|---|
Bus Route | B310 | B320 | B330 | B340 | B310 | B320 | B330 | B340 |
LM volume | ||||||||
Proposed | 134 | 133 | 46 | 45 | 158 | 154 | 163 | 162 |
MP | 175 | 176 | 46 | 46 | 175 | 176 | 174 | 171 |
Fixed time | 184 | 181 | 50 | 53 | 167 | 165 | 209 | 184 |
MH volume | ||||||||
Proposed | 160 | 156 | 53 | 52 | 199 | 195 | 212 | 220 |
MP | 216 | 219 | 56 | 56 | 259 | 260 | 239 | 227 |
Fixed time | 318 | 367 | 132 | 145 | 347 | 358 | 410 | 451 |
Arterial1×3 | Grid2×2 | |||||||
---|---|---|---|---|---|---|---|---|
Bus Route | B310 | B320 | B330 | B340 | B310 | B320 | B330 | B340 |
Base case | NP | NP | NP | NP | NP | NP | NP | NP |
Case 1 | PF | PF | NP | NP | PF | NP | NP | NP |
Case 2 | PF | PF | PF | PF | PF | PF | PF | PF |
Base Case | Case 1 | Case 2 | |||||
---|---|---|---|---|---|---|---|
Priority Level | NP | Low PF = 2 | Medium PF = 5 | High PF = 8 | Low PF = 2 | Medium PF = 5 | High PF = 8 |
LM volume | |||||||
B310 | 134 | 127 (−5%) | 118 (−12%) | 110 (−17%) | 131 (−2%) | 120 (−10%) | 115 (−14%) |
B320 | 133 | 127 (−5%) | 118 (−12%) | 111 (−17%) | 127 (−4%) | 120 (−9%) | 115 (−13%) |
B330 | 46 | 44 (−3%) | 49 (+7%) | 50 (+9%) | 45 (−2%) | 44 (−3%) | 41 (−10%) |
B340 | 45 | 46 (+3%) | 47 (+5%) | 50 (+13%) | 44 (−2%) | 42 (−6%) | 41 (−7%) |
MH volume | |||||||
B310 | 160 | 153 (−4%) | 141 (−12%) | 135 (−16%) | 145 (−10%) | 146 (−9%) | 133 (−17%) |
B320 | 156 | 151 (−3%) | 135 (−14%) | 134 (−14%) | 149 (−4%) | 137 (−12%) | 140 (−11%) |
B330 | 53 | 52 (−1%) | 54 (+3%) | 56 (+6%) | 49 (−7%) | 49 (−7%) | 45 (−15%) |
B340 | 52 | 54%) (+5%) | 58 (+12%) | 57 (+11%) | 49 (−6%) | 49 (−5%) | 48 (−8%) |
Base Case | Case 1 | Case 2 | |||||
---|---|---|---|---|---|---|---|
Priority Level | Non PF = 1 | Low PF = 2 | Medium PF = 5 | High PF = 8 | Low PF = 2 | Medium PF = 5 | High PF = 8 |
LM volume | |||||||
B310 | 158 | 148 (−6%) | 129 (−18%) | 118 (−25%) | 153 (−3%) | 133 (−16%) | 134 (−15%) |
B320 | 154 | 155 (0%) | 160 (+3%) | 158 (+2%) | 151 (−3%) | 135 (−13%) | 133 (−14%) |
B330 | 163 | 163 (0%) | 176 (+8%) | 172 (+5%) | 162 (−1%) | 153 (−6%) | 147 (−10%) |
B340 | 162 | 159 (−2%) | 172 (+6%) | 173 (+7%) | 161 (−1%) | 145 (−11%) | 148 (−9%) |
MH volume | |||||||
B310 | 199 | 196 (−2%) | 178 (−10%) | 167 (−16%) | 185 (−7%) | 177 (−11%) | 164 (−17%) |
B320 | 195 | 215 (+10%) | 202 (+3%) | 210 (+8%) | 192 (−2%) | 167 (−15%) | 164 (−16%) |
B330 | 212 | 230 (+8%) | 235 (+11%) | 229 (+8%) | 207 (−2%) | 181 (−15%) | 188 (−11%) |
B340 | 220 | 254 (+16%) | 255 (+16%) | 266 (+21%) | 205 (−7%) | 205 (−7%) | 185 (−16%) |
Base Case | Case 1 | Case 2 | |||||
---|---|---|---|---|---|---|---|
Priority Level | Non PF = 1 | Low PF = 2 | Medium PF = 5 | High PF = 8 | Low PF = 2 | Medium PF = 5 | High PF = 8 |
Side-street passenger cars in Arterial1×3 | |||||||
LM | 45 | 45 (+1%) | 47 (+6%) | 50 (+12%) | 45 (0%) | 46 (+4%) | 48 (+7%) |
MH | 51 | 52 (+2%) | 54 (+5%) | 57 (+11%) | 50 (−3%) | 52 (+2%) | 54 (+6%) |
All passenger cars in Grid2×2 | |||||||
LM | 123 | 123 (0%) | 126 (+3%) | 126 (+3%) | 124 (+1%) | 128 (+4%) | 129 (+5%) |
MH | 156 | 167 (+7%) | 162 (+4%) | 167 (+7%) | 159 (+2%) | 157 (0%) | 158 (+1%) |
Base Case | Case 1 | Case 2 | |||||
---|---|---|---|---|---|---|---|
PF = 1 | PF = 2 | PF = 5 | PF = 8 | PF = 2 | PF = 5 | PF = 8 | |
LM volume | |||||||
Controller A | 49 | 47 | 51 | 54 | 48 | 50 | 55 |
Controller B | 48 | 48 | 51 | 54 | 50 | 55 | 56 |
Controller C | 48 | 47 | 50 | 54 | 49 | 51 | 54 |
MH volume | |||||||
Controller A | 59 | 60 | 61 | 64 | 59 | 61 | 67 |
Controller B | 60 | 61 | 64 | 66 | 60 | 65 | 71 |
Controller C | 59 | 62 | 65 | 68 | 61 | 63 | 64 |
Base Case | Case 1 | Case 2 | |||||
---|---|---|---|---|---|---|---|
PF = 1 | PF = 2 | PF = 5 | PF = 8 | PF = 2 | PF = 5 | PF = 8 | |
LM volume | |||||||
Controller A | 47 | 48 | 50 | 52 | 48 | 56 | 60 |
Controller B | 47 | 48 | 50 | 52 | 50 | 55 | 57 |
Controller C | 48 | 48 | 48 | 48 | 50 | 57 | 59 |
Controller D | 47 | 47 | 49 | 49 | 48 | 55 | 60 |
MH volume | |||||||
Controller A | 63 | 68 | 73 | 68 | 66 | 71 | 73 |
Controller B | 63 | 66 | 68 | 69 | 61 | 67 | 65 |
Controller C | 62 | 66 | 64 | 68 | 64 | 70 | 78 |
Controller D | 59 | 69 | 66 | 70 | 60 | 68 | 74 |
Base Case | Case 1 | Case 2 | |||||
---|---|---|---|---|---|---|---|
PF = 1 | PF = 2 | PF = 5 | PF = 8 | PF = 2 | PF = 5 | PF = 8 | |
LM volume | |||||||
Phase 1 | 22% | 22% | 21% | 19% | 21% | 19% | 19% |
Phase 2 | 29% | 29% | 33% | 34% | 29% | 32% | 32% |
Phase 3 | 21% | 22% | 20% | 19% | 21% | 19% | 19% |
Phase 4 | 28% | 27% | 27% | 27% | 30% | 30% | 31% |
MH volume | |||||||
Phase 1 | 21% | 20% | 22% | 19% | 21% | 20% | 20% |
Phase 2 | 30% | 31% | 31% | 32% | 31% | 31% | 31% |
Phase 3 | 19% | 19% | 19% | 21% | 19% | 18% | 18% |
Phase 4 | 30% | 30% | 28% | 29% | 29% | 32% | 31% |
Base Case | Case 1 | Case 2 | |||||
---|---|---|---|---|---|---|---|
PF = 1 | PF = 2 | PF = 5 | PF = 8 | PF = 2 | PF = 5 | PF = 8 | |
LM volume | |||||||
Phase 1 | 22% | 21% | 20% | 20% | 21% | 19% | 18% |
Phase 2 | 28% | 29% | 30% | 33% | 28% | 28% | 30% |
Phase 3 | 24% | 24% | 23% | 22% | 24% | 27% | 27% |
Phase 4 | 27% | 26% | 26% | 25% | 27% | 26% | 25% |
MH volume | |||||||
Phase 1 | 20% | 18% | 19% | 17% | 19% | 17% | 17% |
Phase 2 | 30% | 29% | 31% | 32% | 29% | 29% | 30% |
Phase 3 | 24% | 24% | 22% | 24% | 23% | 27% | 25% |
Phase 4 | 27% | 29% | 29% | 27% | 28% | 28% | 28% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cheng, H.-K.; Kou, K.-P.; Wong, K.-I. Reinforcement Learning for Transit Signal Priority with Priority Factor. Smart Cities 2024, 7, 2861-2886. https://doi.org/10.3390/smartcities7050111
Cheng H-K, Kou K-P, Wong K-I. Reinforcement Learning for Transit Signal Priority with Priority Factor. Smart Cities. 2024; 7(5):2861-2886. https://doi.org/10.3390/smartcities7050111
Chicago/Turabian StyleCheng, Hoi-Kin, Kun-Pang Kou, and Ka-Io Wong. 2024. "Reinforcement Learning for Transit Signal Priority with Priority Factor" Smart Cities 7, no. 5: 2861-2886. https://doi.org/10.3390/smartcities7050111
APA StyleCheng, H. -K., Kou, K. -P., & Wong, K. -I. (2024). Reinforcement Learning for Transit Signal Priority with Priority Factor. Smart Cities, 7(5), 2861-2886. https://doi.org/10.3390/smartcities7050111