1. Introduction
The signal control is one of the key factors that impact road performance and traffic congestion. A reasonable optimization strategy of signal control at the intersection can alleviate urban road congestion and improve traffic efficiency.
To alleviate traffic pressure, researchers have proposed many advanced traffic signal control methods according to different control standards. In 1958, the famous Webster’s signal plan [
1] was put forward, which was mainly used to calculate the green split to reduce the delay. Subsequently, SCAT [
2] and SCOOT [
3,
4,
5] adaptive traffic signal optimization systems were proposed and widely applied in the world. Choy et al. developed a new hybrid neural network (SPSA-NN) for distributed real-time control of intersections in road networks. The experimental results show that the algorithm can significantly reduce the delay and stop of vehicles in road networks. However, this algorithm requires a large number of training samples [
6]. In recent years, reinforcement learning has become the mainstream method in urban traffic signal control research. Chen et al. proposed an adaptive traffic signal control scheme for urban road networks based on multi-agent reinforcement learning (MARL). A delay time estimation model was adopted to predict the total delay of vehicles in each road section, and traffic signals are adaptively controlled according to the computation [
7]. Wade et al. proposed an adaptive traffic signal control system that is trained by the function approximation reinforcement learning technology. In the process, they developed a dynamic and random traffic simulation method that will slightly increase the delay of left-turning vehicles [
8]. Ali et al. used an immune network algorithm to conduct an optimization study in the intersection after traffic disturbance events [
9]. Based on data-driven intelligent transportation systems (D2ITSs), Zhang et al. designed the MA-DD-DACC method combined with real-time queue-length data to ensure queuing strength (the number of queuing vehicles/the length of its link) balance between multiple signal phases and proved that the algorithm could make the queue intensity balance error remain bounded by using Lyapunov’s stability analysis, but most of the time, the vehicles in the queues in each direction do not saturate the road and lane space, and the number of queuing-vehicles balance will have greater significance [
10].
The traffic condition change of a local road network can be regarded as uncertain competition conditions in the game, so it is appropriate to use the Nash equilibrium to improve the existing intersection signal control strategy. Abdelghaffar et al. developed a Nash negotiation game theory framework for the intersection phase that uses each signal phase as a game player competing for the green light release and realized phase-free and cycle-free distributed real-time signal control. However, the algorithm is not friendly for driving and is not good for vehicle-to-infrastructure cooperative control [
11]. Liu et al. designed a traffic signal control multi-agent coordination framework based on deep reinforcement learning methods, which corrected actions by using spatial difference coordination methods. They proved that the model could converge to the Nash equilibrium [
12]. The Cournot and Stackelberg game theory models were used to balance the flow of a single intersection and road network in the previous study [
13,
14,
15,
16]. The cooperative game was utilized to optimize vehicles distribution among multiple intersections [
17]. Tan et al. used the two-layer cooperation game model to control the single intersection, but this method is only applicable to the two-phase intersection [
18].
In this paper, the game theory is introduced into the intersection signal control to realize the coordination and optimization of control. The single intersection is the research object, the store-and-forward method is used in the modeling, and the payoff index based on in-queue vehicle number equilibrium is proposed. By designing the game strategy, the Nash equilibrium state achieved by the game is the queuing payoff equilibrium of each key signal phase to optimize the green time.
2. Queuing-Length Model
In this section, the store-and-forward mode of the intersection will be proposed to prepare for the design of an in-queue vehicle number equilibrium control strategy. The store-and-forward model of traffic networks was first proposed in 1963 and is still widely used in traffic signal control [
19,
20]. The mathematical queuing-length model of each direction at the cycle-changeable intersection is as follows:
where
represents the
ith phase of the intersection, and
represents there are four phases at the intersection;
indicates the
th direction of the intersection,
represents there are four directions at the intersection,
indicates the
th cycle of the intersection signal control, and
represents the in-queue vehicle number (pcu) of the
th cycle of the
th phase at the
th direction during the time period
.
represents the vehicle arrival rate (veh/s) of the intersection in phase
, direction
, and cycle
;
means the saturation flow rate (pcu/s) of phase
at direction
;
indicates the effective green time (s) of the
th cycle of the
th phase.
To define the maximum queuing length of the specific phase at a single intersection, which is the maximum value of the queuing length at each direction in the same phase, the equation can be listed as follows:
where direction
and direction
belong to phase
. The third item
is to prevent negative values of the queue length during the calculation process, which is inconsistent with the actual situation.
The mathematical queuing-length model of the cycle-changeable intersection is as follows:
where
represents the in-queue vehicle number (pcu) of the
th phase and the
th cycle during
;
means the vehicle arrival rate (pcu/s) in the
th cycle of the longest queue direction in phase
of the intersection;
indicates the saturation flow rate (pcu/s) of the longest queue direction in phase
of the intersection
.
The green time of each phase and the signal cycle at the intersection should be satisfied by the following constraints:
where
indicates the lost time.
To avoid a too long or too short green time, the maximum and minimum green times need to be set. Therefore, the constraints for green time and the signal cycle in the
th phase and
th cycle of the intersection are defined as:
The queue length model with a green time constraint at a single intersection is considered as follows:
3. The Equilibrium Control
In the intersection signal control, the green time for each phase can be regarded as a game, where the phase acts as a participant, and each participant competes for the green time in a signal cycle. Suppose the queue length at the beginning of the following cycle is the yield indicator, then the larger the queue number, the smaller the yield. In that case, each participant (phase) hopes to obtain as much green time as possible to maximize their own returns.
To facilitate calculation, Equation (7) and green time are split (
), and the final payoff function
is:
Equation (8) indicates that each participant (signal phase) competed for the green time to clear the queue in each phase, and when the profit is 0, it means that the queue can be cleared just in time; Equation (9) suggests that each participant is assigned a fixed green time that can clear all the arrived vehicles in the current signal cycle.
Since all the variables except
in Equation (9) are known, it can be set that:
Substituting Equation (10) into (6) gives:
According to the cake game, it is easy to see that the Nash equilibrium state is reached when the participants’ benefits are equal. For example, if there are two participants in the cake splitting, one of whom cuts the cake and the other chooses the cut pieces first, then the one slicing the cake will cut it equally to prevent the other player from taking the larger piece first. In this case, the two players reach the Nash equilibrium state with the same amounts of benefits. According to the above analysis, when the intersection is a two-phase game, there are:
Suppose
and
, combine Equations (8) and (12) to have:
At this time,
(s) indicates the
th cycle of the intersection signal control is just enough to clear the intersection. It can also be seen from Equation (13) that the cycle is the sum of the time to clear the original queue in each phase and the arriving vehicles in the current cycle and the lost time. Substitute Equations (9) and (10) into (13):
In the actual calculation process, the queue may not be cleared; then the value of is too high, so the maximum cycle should be set according to practical experience. The period in the final substitution algorithm is .
When the intersection is a four-phase game:
Other phases’ green times can be obtained similarly. Resembling the two-phase, let
, where
, and the same two-phase case
can be obtained as follows:
As in
Section 2, to avoid a too long or too short green time, the green time calculated by the algorithm in this paper needs to set the maximum and minimum green-time constraints as shown in Equation (5), and then the corresponding signal cycle is updated according to Equation (6).
4. Simulation Verification
To verify the feasibility of the proposed algorithm, this paper implements the queuing equilibrium control algorithm based on game theory in Matlab and compares it with algorithms in other papers. Two groups of experiments are carried out: 1. using the queuing equilibrium game control algorithm in this paper; 2. using the MA-DD-DACC queuing strength balance control algorithm. As shown in
Figure 1 and
Figure 2, it can be found that MA-DD-DACC can ensure the balanced queuing strength (the number of queuing vehicles/the length of its link) and clear the intersection at the same time. The time taken is 651 s. However, most of the time, the line in each direction does not fill the lane, and the experience of each vehicle in the line will have greater significance. The algorithm in this paper takes the number of queues as the index. Each phase competes with the other over the green time and finally clears the entire intersection, which takes 639 s. The experimental parameters are shown in
Table 1, where the flow rate of each phase is translated successively
by the cosine function.
We use the traffic simulation software VISSIM to carry out the experiments. The intersection of Nanhuan Road and Yanping Road in Changping District in Beijing is selected as the simulation intersection, and the simulation time is 10 a.m. to 11 a.m. (during the off-peak period). The filed traffic data in the intersection is applied as the vehicle inputs and routes ratio of VISSIM, as shown in
Table 2. The vehicle composition of all directions is similar in the intersection, as shown in
Table 3. The original signal timing and the magnetic detector data of the intersection are obtained by the traffic management department. The original intersection timing plan is shown in
Table 4. The magnetic detector is located 120 m from the stop line in the east–west direction and 50 m from the stop line in the north–south direction. The east–west direction is the main road, so the conflict zone in the priority of east–west is higher than that of north–south.
Model calibration is an important step in VISSIM simulation. The car-following model in this study is the Wiedemann-74, and the calibration parameters include average standstill distance (AvgSD), additive part of safety distance (AddSD), multiple part of safety distance (MulSD), look-ahead observed vehicles (LaVeh), and maximum look-ahead distance (MaxLD). The lane-change model is free lane selection, and the calibration parameters include minimum headway (MinHF), waiting time before diffusion (WaiBD), maximum cancellation (MaxDC), and maximum cancellation for cooperative braking (MaxDB). The detector is set at the corresponding position in VISSIM based on the position of the actual magnetic detector. The calibration parameters are optimized by using a learning algorithm (GA algorithm) with the goal of best matching the number of vehicles passing through the each direction of the intersection. The objective function is the mean absolute percentage error (MAPE) of the flow:
where
represents there being four directions in the intersection.
indicates the flow of the actual magnetic detector.
represents the flow of the VISSIM detector. Finally, the calibrated VISSIM model parameters are obtained, as shown in
Table 5. The other VISSIM parameters are the default values. After calibration, the VISSIM detector data and the magnetic detector data are shown in
Figure 3, and the MAPE is 4.7% (the random seed is 42).
The VISSIM-VB-MATLAB is used to simulate and compare the two groups of algorithms. The evaluation data of VISSIM are shown in
Figure 4 and
Figure 5. It can be seen that the proposed algorithm in this paper brings more balance in queuing in all directions. The average delay and stopping times of each vehicle are also more balanced. Generally, standard deviation is used to express the degree of data balance. The standard deviation of average delay and stopping times of each vehicle in all directions obtained by the proposed algorithm in this paper is 2.3 s and 0.18, respectively, and the corresponding standard deviations of MA-DD-DAC is 6.2 s and 0.56. The average delay and total number of stops in the four directions are summed up and compared. The total average delay and total number of stops of the proposed algorithm in this paper are 50.2 s and 2.74, respectively, and the corresponding total average delay and total number of stops of MA-DD-DAC are 51.8 s and 2.93. The above results show that the queuing equilibrium game algorithm in this paper can effectively reduce the delay and the number of stops at the intersection and improve the operation efficiency.
5. Conclusions
We propose a queuing equilibrium control algorithm based on game theory to help urban traffic controllers deal with control problems in different traffic environments. The VISSIM simulation results show that this algorithm has a good effect on the intersection queue balance control and improves the intersection operation efficiency. Although this is only a preliminary exploration, it can be expected that game theory can be well-applied in traffic signal control. At present, it is only used in a single intersection. Next, we will combine game theory with multi-agents at multiple intersections to further improve the operating environment of urban road traffic and improve the efficiency of traffic controllers.
Author Contributions
Conceptualization, L.W. and K.P.; methodology, K.P.; software, K.P.; validation, K.P. and Q.Z.; formal analysis, K.P.; resources, L.W.; data curation, L.Z. (Lingyu Zhang); writing—original draft preparation, K.P.; writing—review and editing, Q.Z.; visualization, L.Z. (Lili Zhang); supervision, L.Z. (Lili Zhang); project administration, L.W. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by Beijing Natural Science Foundation, grant number 4214070 and Beijing Municipal Science and Technology Project, grant number Z221100008122006.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
The authors confirm that the data supporting the findings of this study are available within the article.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Webster, F.V. Traffic Signal Settings; Road Research Technical Paper 39; Her Majesty’s Stationary Office: London, UK, 1958.
- Sims, A.G.; Dobinson, K.W. The Sydney Coordinated Adaptive Traffic (SCAT) system philosophy and benefits. IEEE Trans. Veh. Technol. 1980, 29, 130–137. [Google Scholar] [CrossRef]
- Robertson, D.I.; Bretherton, R.D. Optimizing networks of traffic signals in real time—The scoot method. IEEE Trans. Veh. Technol. 1991, 40, 11–15. [Google Scholar] [CrossRef]
- Bretherton, D. Current developments in SCOOT: Version 3. Transp. Res. Rec. J. Transp. Res. Board 1996, 1554, 48–52. [Google Scholar] [CrossRef]
- Bretherton, D.; Wood, K.; Raha, N. Traffic Monitoring and Congestion Management in the SCOOT Urban Traffic Control System. Traffic Res. Rec. 1998, 1634, 118–122. [Google Scholar] [CrossRef]
- Choy, M.C.; Srinivasan, D.; Cheu, R.L. Neural networks for continuous online learning and control. IEEE Trans. Neural Netw. 2006, 17, 1511–1531. [Google Scholar] [CrossRef] [PubMed]
- Chen, Y.; Yao, J.; He, C.; Chen, H.; Jin, H. Adaptive Traffic Signal Control with Network-Wide Coordination. In International Conference on Algorithms and Architectures for Parallel Processing; Springer International Publishing: Berlin/Heidelberg, Germany, 2017; pp. 180–194. [Google Scholar]
- Genders, W.; Razavi, S. Asynchronous n-step Q-learning adaptive traffic signal control. J. Intell. Transp. Syst. 2019, 23, 319–331. [Google Scholar] [CrossRef]
- Louati, A.; Darmoul, S.; Elkosantini, S.; ben Said, L. An artificial immune network to control interrupted flow at a signalized intersection. Inf. Sci. Int. J. 2018, 433–434, 70–95. [Google Scholar] [CrossRef]
- Zhang, H.; Liu, X.; Ji, H.; Hou, Z.; Fan, L. Multi-Agent-Based Data-Driven Distributed Adaptive Cooperative Control in Urban Traffic Signal Timing. Energies 2019, 12, 1402–1421. [Google Scholar] [CrossRef] [Green Version]
- Abdelghaffar, H.M.; Yang, H.; Rakha, H.A. Developing a De-centralized Cycle-free Nash Bargaining Arterial Traffic Signal Controller. In Proceedings of the IEEE International Conference on Models and Technologies for Intelligent Transportatin Systems, Naples, Italy, 26–28 June 2017; IEEE: New York, NY, USA, 2017; pp. 544–549. [Google Scholar]
- Liu, J.; Zhang, H.; Fu, Z.; Wang, Y. Learning Scalable Multi-Agent Coordination by Spatial Differential for Traffic Signal Control. September 2020. Available online: https://arxiv.org/abs/2002.11874v2 (accessed on 16 September 2020).
- Chen, O.; Ben-Akiva, M. Game-Theoretic Formulations of Interaction Between Dynamic Traffic Control and Dynamic Traffic Assignment. Transp. Res. Rec. 1998, 1617, 179–188. [Google Scholar] [CrossRef]
- Alvarez, I.; Poznyak, A. Game theory applied to urban traffic control problem. In Proceedings of the 2010 International Conference on Control, Automation and Systems (ICCAS 2010), Gyeonggi-do, Republic of Korea, 27–30 October 2010. [Google Scholar]
- Ukkusuri, S.; Doan, K.; Aziz, H.A. A Bi-level Formulation for the Combined Dynamic Equilibrium based Traffic Signal Control. Procedia—Soc. Behav. Sci. 2013, 80, 729–752. [Google Scholar] [CrossRef]
- Bui, K.-H.N.; Jung, J.E.; Camacho, D. Game theoretic approach on Real-time decision making for IoT-based traffic light control. Concurr. Comput. Pract. Exp. 2017, 29, e4077. [Google Scholar] [CrossRef]
- Bui, K.-H.N.; Jung, J.J. Cooperative game-theoretic approach to traffic flow optimization for multiple intersections. Comput. Electr. Eng. 2017, 63, 1–13. [Google Scholar]
- Tan, L.; Zhao, X.; Hu, D.; Shang, Y.; Ren, W. A Study of Single Intersection Traffic Signal Control Based on Two-player Cooperation Game Model. In Proceedings of the 2010 WASE International Conference on Information Engineering, Beidai, China, 14–15 August 2010. [Google Scholar]
- Gazis, D.C.; Potts, R.B. Development of traffic control and queue management procedures for oversaturated arterials. Transp. Res. Rec. 1997, 1603, 119–127. [Google Scholar]
- Aboudolas, K.; Papageorgiou, M.; Kosmatopoulos, E. Store-and-forward based methods for the signal control problem in large-scale congested urban road networks. Transp. Res. Part C Emerg. Technol. 2009, 17, 163–174. [Google Scholar] [CrossRef]
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).