An Intersection-Based Routing Scheme Using Q-Learning in Vehicular Ad Hoc Networks for Traffic Management in the Intelligent Transportation System
Abstract
:1. Introduction
- IRQ presents a dissemination mechanism of traffic status information to provide both global and local views in the network. The purpose of this mechanism is to update traffic information constantly and inform the network server relative to traffic status in the network at any moment. According to this mechanism, beacon messages are periodically disseminated on the network and sent to the central server by roadside units (RSUs). This information is stored in the central server. IRQ utilizes this information in the global view-based routing process and the local view-based routing process.
- In IRQ, the central server executes a global view-based routing algorithm to calculate various paths between different intersections based on the traffic status information. In this process, the agent (central server) trains a Q-table based on traffic status information such as node density, connection time, and delay in road segments. In this method, Q-value indicates how much is each intersection suitable for sending data packets to the destination. Furthermore, the central server constantly evaluates the created paths to penalize paths with high congestion and improves the packet delivery rate properly.
- In IRQ, each vehicle uses a local view-based routing algorithm to find the best route in any road segment. This method applies a greedy routing technique to reach the destination.
2. Related Works
3. Basic Concepts
4. Network Model
- Central server: This server uses traffic status messages to achieve a global view of the whole network. This robust and high-energy node plays the agent role in the global view-based routing process to discover the VANET environment. The central server learns the best routing strategy between intersections by interacting with the network environment using the Q-learning algorithm. Finally, this node sends the Q-table to RSUs in the network.
- Roadside units (RSUs): These entities are directly connected to the central server and are at intersections. These nodes are responsible for monitoring the network, sending traffic status messages to the central server, and controlling congestion on each road segment. Furthermore, RSUs are connected to the central server and periodically send traffic messages to it. In addition, each RSU is responsible for finding the best route from Q-table stored in its memory and sending data packets to the next intersection based on this route.
- Vehicles: These entities periodically exchange beacon messages between themselves. The information obtained from this message is stored in the neighborhood table of each vehicle. Moreover, each vehicle is equipped with a positioning system to obtain its spatial and speed information at any moment. In each road segment, vehicles utilize a local view-based routing method to communicate with other vehicles in a multi-hop manner.
5. Proposed Method
- Dissemination mechanism of traffic status information;
- Global view-based routing algorithm;
- Local view-based routing algorithm.
5.1. Dissemination Mechanism of Traffic Information
5.1.1. Calculating the Connection Time of Two Vehicles
5.1.2. Calculating the Delay between Two Vehicles
5.2. Dissemination of Traffic State Information
- ID of RSU: This field represents the identification of RSU, which transmits the traffic information packet.
- Intersection ID: This field indicates the identification of the intersection having the transmitter RSU.
- Time to live: This field is adjusted based on the validity time of the traffic message (i.e., 5 s). After ending this period, the traffic information message is invalid and removed from the network.
- Road ID: Each intersection is connected to the four road segments, including the upper road, the down road, the left road, and the right road. This field indicates the identification of the corresponding road segment based on the traffic table.
- Vehicle density: This field is equal to the number of vehicles in a road segment (i.e., up, down, left, and right), which is inserted into the traffic table.
- Average connection time: This field represents the average connection time related to each road segment, which is obtained from Equation (7) and recorded in the traffic table.
- Average road delay: This field indicates the average single-hop delay corresponding to each road segment. It is calculated according to Equation (8) and stored in the traffic table.
Algorithm 1 Dissemination of traffic status information |
End |
5.3. Global View-Based Routing Algorithm
Algorithm 2 Global view based-routing process |
End |
5.4. Local View-Based Routing Algorithm
- V2V routing: In a road segment, each vehicle uses the V2V routing process for choosing the next-hop node. When the source node produces the data packet, it also achieves the position of the destination vehicle. If the two vehicles are in a similar road segment, then the destination node is regarded as . Otherwise, the source intersection is considered as . Then, the V2V routing process is executed based on the position of . Accordingly, the source vehicle selects the closest vehicle to as the next-hop node and sends the data packets to it. If there is no neighboring node closer to compared to the current node, the local optimal issue occurs. In this case, the current vehicle calculates a score for its neighbors using Equation (15) and sends the data packet to a node with the maximum score.
Algorithm 3 Local view based-routing process (V2V) |
End |
- V2I routing strategy: A data packet reaches the next intersection using V2V greedy forwarding strategy. Then, RSU at the intersection selects the next intersection using Q-table obtained from the Q-learning-based routing process described in Section 5.3 and sends the data packet to the corresponding road segment using the V2I greedy forwarding strategy. In this process, if the destination node is in this road segment, it is considered as , and RSU sends the packet to the closest node to the destination vehicle. Otherwise, RSU sends the packet to the closest node to the next intersection, which is considered as . If there is no vehicle for sending the packet to , RSU carries this packet until it finds a suitable next-hop node. The pseudo-code of this process is described in Algorithm 4.
Algorithm 4 Local view based-routing process (V2I) |
End |
6. Simulation and Evaluation of Results
6.1. Packet Delivery Rate
6.2. End-to-End Delay
6.3. Hop Count
6.4. Communication Overhead
7. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Al-Shareeda, M.A.; Anbar, M.; Hasbullah, I.H.; Manickam, S. Survey of authentication and privacy schemes in vehicular ad hoc networks. IEEE Sens. J. 2020, 21, 2422–2433. [Google Scholar] [CrossRef]
- Rashid, S.; Khan, M.A.; Saeed, A.; Hamza, C.M. A Survey on Prediction based Routing for Vehicular Ad-hoc Networks. In Proceedings of the IEEE 2021 International Congress of Advanced Technology and Engineering (ICOTEN), Online, 4–5 July 2021; pp. 1–8. [Google Scholar] [CrossRef]
- Ameur, A.I.; Lakas, A.; Bachir, Y.M.; Oubbati, O.S. Peer-to-peer overlay techniques for vehicular ad hoc networks: Survey and challenges. Veh. Commun. 2022, 34, 100455. [Google Scholar] [CrossRef]
- Xia, Z.; Wu, J.; Wu, L.; Chen, Y.; Yang, J.; Yu, P.S. A comprehensive survey of the key technologies and challenges surrounding vehicular ad hoc networks. Acm Trans. Intell. Syst. Technol. (TIST) 2021, 12, 1–30. [Google Scholar] [CrossRef]
- Shahwani, H.; Shah, S.A.; Ashraf, M.; Akram, M.; Jeong, J.P.; Shin, J. A comprehensive survey on data dissemination in Vehicular Ad Hoc Networks. Veh. Commun. 2021, 34, 100420. [Google Scholar] [CrossRef]
- Gaurav, A.; Gupta, B.B.; Peñalvo, F.J.G.; Nedjah, N.; Psannis, K. Ddos attack detection in vehicular ad-hoc network (vanet) for 5g networks. In Security and Privacy Preserving for IoT and 5G Networks; Springer: Cham, Switzerland, 2022; pp. 263–278. [Google Scholar] [CrossRef]
- Piper, J.; Rodger, J.A. Longitudinal Study of a Website for Assessing American Presidential Candidates and Decision Making of Potential Election Irregularities Detection. Int. J. Semant. Web Inf. Syst. (IJSWIS) 2022, 18, 1–20. [Google Scholar] [CrossRef]
- Yang, H.; Vijayakumar, P.; Shen, J.; Gupta, B.B. A location-based privacy-preserving oblivious sharing scheme for indoor navigation. Future Gener. Comput. Syst. 2022, 137, 42–52. [Google Scholar] [CrossRef]
- Jeong, H.; Lee, S.W.; Hussain Malik, M.; Yousefpoor, E.; Yousefpoor, M.S.; Ahmed, O.H.; Hosseinzadeh, M.; Mosavi, A. SecAODV: A secure healthcare routing scheme based on hybrid cryptography in wireless body sensor networks. Front. Med. 2022, 9, 829055. [Google Scholar] [CrossRef] [PubMed]
- Rahmani, A.M.; Ali, S.; Yousefpoor, E.; Yousefpoor, M.S.; Javaheri, D.; Lalbakhsh, P.; Ahmed, O.H.; Hosseinzadeh, M.; Lee, S.W. OLSR+: A new routing method based on fuzzy logic in flying ad-hoc networks (FANETs). Veh. Commun. 2022, 36, 100489. [Google Scholar] [CrossRef]
- Abdel-Halim, I.T.; Fahmy, H.M.A. Prediction-based protocols for vehicular Ad Hoc Networks: Survey and taxonomy. Comput. Netw. 2018, 130, 34–50. [Google Scholar] [CrossRef]
- Grover, J. Security of Vehicular Ad Hoc Networks using blockchain: A comprehensive review. Veh. Commun. 2022, 34, 100458. [Google Scholar] [CrossRef]
- Yousefpoor, M.S.; Yousefpoor, E.; Barati, H.; Barati, A.; Movaghar, A.; Hosseinzadeh, M. Secure data aggregation methods and countermeasures against various attacks in wireless sensor networks: A comprehensive review. J. Netw. Comput. Appl. 2021, 190, 103118. [Google Scholar] [CrossRef]
- Tripp-Barba, C.; Zaldívar-Colado, A.; Urquiza-Aguiar, L.; Aguilar-Calderón, J.A. Survey on routing protocols for vehicular ad hoc networks based on multimetrics. Electronics 2019, 8, 1177. [Google Scholar] [CrossRef] [Green Version]
- Senouci, O.; Harous, S.; Aliouat, Z. Survey on vehicular ad hoc networks clustering algorithms: Overview, taxonomy, challenges, and open research issues. Int. J. Commun. Syst. 2020, 33, e4402. [Google Scholar] [CrossRef]
- Ardakani, S.P.; Kwong, C.F.; Kar, P.; Liu, Q.; Li, L. CNN: A Cluster-Based Named Data Routing for Vehicular Networks. IEEE Access 2021, 9, 159036–159047. [Google Scholar] [CrossRef]
- Nazib, R.A.; Moh, S. Routing protocols for unmanned aerial vehicle-aided vehicular ad hoc networks: A survey. IEEE Access 2020, 8, 77535–77560. [Google Scholar] [CrossRef]
- Aggarwal, A.; Gaba, S.; Nagpal, S.; Vig, B. Bio-Inspired Routing in VANET. In Cloud and IoT-Based Vehicular Ad Hoc Networks; Wiley: Hoboken, NJ, USA, 2021; pp. 199–220. [Google Scholar] [CrossRef]
- Ramamoorthy, R.; Thangavelu, M. An enhanced distance and residual energy-based congestion aware ant colony optimization routing for vehicular ad hoc networks. Int. J. Commun. Syst. 2022, 35, e5179. [Google Scholar] [CrossRef]
- Liu, J.; Weng, H.; Ge, Y.; Li, S.; Cui, X. A Self-Healing Routing Strategy based on Ant Colony Optimization for Vehicular Ad Hoc Networks. IEEE Internet Things J. 2022. [Google Scholar] [CrossRef]
- Nazib, R.A.; Moh, S. Reinforcement learning-based routing protocols for vehicular ad hoc networks: A comparative survey. IEEE Access 2021, 9, 27552–27587. [Google Scholar] [CrossRef]
- Mchergui, A.; Moulahi, T.; Zeadally, S. Survey on artificial intelligence (AI) techniques for vehicular ad-hoc networks (VANETs). Veh. Commun. 2021, 34, 100403. [Google Scholar] [CrossRef]
- Vijayakumar, P.; Rajkumar, S.C. Deep Reinforcement Learning-Based Pedestrian and Independent Vehicle Safety Fortification Using Intelligent Perception. Int. J. Softw. Sci. Comput. Intell. (IJSSCI) 2022, 14, 1–33. [Google Scholar] [CrossRef]
- Rahmani, A.M.; Ali, S.; Yousefpoor, M.S.; Yousefpoor, E.; Naqvi, R.A.; Siddique, K.; Hosseinzadeh, M. An area coverage scheme based on fuzzy logic and shuffled frog-leaping algorithm (sfla) in heterogeneous wireless sensor networks. Mathematics 2021, 9, 2251. [Google Scholar] [CrossRef]
- Lee, S.W.; Ali, S.; Yousefpoor, M.S.; Yousefpoor, E.; Lalbakhsh, P.; Javaheri, D.; Rahmani, A.M.; Hosseinzadeh, M. An energy-aware and predictive fuzzy logic-based routing scheme in flying ad hoc networks (fanets). IEEE Access 2021, 9, 129977–130005. [Google Scholar] [CrossRef]
- Rahmani, A.M.; Ali, S.; Malik, M.H.; Yousefpoor, E.; Yousefpoor, M.S.; Mousavi, A.; Hosseinzadeh, M. An energy-aware and Q-learning-based area coverage for oil pipeline monitoring systems using sensors and Internet of Things. Sci. Rep. 2022, 12, 9638. [Google Scholar] [CrossRef]
- Ji, X.; Xu, W.; Zhang, C.; Yun, T.; Zhang, G.; Wang, X.; Wang, Y.; Liu, B. Keep forwarding path freshest in VANET via applying reinforcement learning. In Proceedings of the 2019 IEEE First International Workshop on Network Meets Intelligent Computations (NMIC), Dallas, TX, USA, 7–9 July 2019; pp. 13–18. [Google Scholar] [CrossRef]
- Saravanan, M.; Ganeshkumar, P. Routing using reinforcement learning in vehicular ad hoc networks. Comput. Intell. 2020, 36, 682–697. [Google Scholar] [CrossRef]
- Wu, J.; Fang, M.; Li, H.; Li, X. RSU-assisted traffic-aware routing based on reinforcement learning for urban vanets. IEEE Access 2020, 8, 5733–5748. [Google Scholar] [CrossRef]
- Yang, X.; Zhang, W.; Lu, H.; Zhao, L. V2V routing in VANET based on heuristic Q-learning. Int. J. Comput. Commun. Control. 2020, 15, 1–17. [Google Scholar] [CrossRef]
- Wu, C.; Yoshinaga, T.; Bayar, D.; Ji, Y. Learning for adaptive anycast in vehicular delay tolerant networks. J. Ambient. Intell. Humaniz. Comput. 2019, 10, 1379–1388. [Google Scholar] [CrossRef]
- Karp, B.; Kung, H.T. GPSR: Greedy perimeter stateless routing for wireless networks. In Proceedings of the 6th Annual International Conference on Mobile Computing and Networking, Boston, MA, USA, 6–11 August 2000; pp. 243–254. [Google Scholar] [CrossRef]
- Li, F.; Song, X.; Chen, H.; Li, X.; Wang, Y. Hierarchical routing for vehicular ad hoc networks via reinforcement learning. IEEE Trans. Veh. Technol. 2018, 68, 1852–1865. [Google Scholar] [CrossRef]
- Luo, L.; Sheng, L.; Yu, H.; Sun, G. Intersection-based V2X routing via reinforcement learning in vehicular Ad Hoc networks. IEEE Trans. Intell. Transp. Syst. 2021, 23, 5446–5459. [Google Scholar] [CrossRef]
- Padakandla, S. A survey of reinforcement learning algorithms for dynamically varying environments. Acm Comput. Surv. (CSUR) 2021, 54, 1–25. [Google Scholar] [CrossRef]
- Qiang, W.; Zhongli, Z. Reinforcement learning model, algorithms and its application. In Proceedings of the IEEE 2011 International Conference on Mechatronic Science, Electric Engineering and Computer (MEC), Jilin, China, 19–22 August 2011; pp. 1143–1146. [Google Scholar] [CrossRef]
- Rezwan, S.; Choi, W. A survey on applications of reinforcement learning in flying ad-hoc networks. Electronics 2021, 10, 449. [Google Scholar] [CrossRef]
- Al-Rawi, H.A.; Ng, M.A.; Yau, K.L.A. Application of reinforcement learning to routing in distributed wireless networks: A review. Artif. Intell. Rev. 2015, 43, 381–416. [Google Scholar] [CrossRef]
- Rahmani, A.M.; Yousefpoor, E.; Yousefpoor, M.S.; Mehmood, Z.; Haider, A.; Hosseinzadeh, M.; Ali Naqvi, R. Machine learning (ML) in medicine: Review, applications, and challenges. Mathematics 2021, 9, 2970. [Google Scholar] [CrossRef]
- Issariyakul, T.; Hossain, E. Introduction to network simulator 2 (NS2). In Introduction to Network Simulator NS2; Springer: Boston, MA, USA, 2009; pp. 1–18. [Google Scholar] [CrossRef]
Scheme | Advantages | Disadvantages |
---|---|---|
RHR [27] | Decreasing broadcast storms, using an adaptive broadcast technique by predicting the position and movement of vehicles, suitable for rural areas | Not describing how to select a fixed number of neighbors, not considering a recovery process in sparse network conditions |
VRDRT [28] | Improving the performance of the routing process by DRL, predicting road traffic conditions, reducing delay in the data transmission process | Not considering a suitable method for calculating the density of vehicles on the road, depending significantly on RSUs, not suitable for highways and urban areas |
QTAR [29] | Using Q-learning to improve PDR and throughput, presenting a traffic-aware routing method, high reliability, reducing end-to-end delay | Implementation capability only for urban areas, not estimating the movement direction of vehicles on the roads |
HQVR [30] | Determining the learning rate based on the link quality, increasing packet delivery rate, reducing the effect of the node mobility on the convergence speed of Q-learning algorithm, low dependence on infrastructure (RSUs) | High dependence of Q-learning algorithm to beacon messages, slow convergence speed of the learning algorithm, applying an exploration technique based on a specific probability |
QVDRP [31] | High delay-tolerant, increasing packet delivery rate, reducing the number of duplicated control messages, considering relative velocity of vehicles | Slow convergence speed of the learning algorithm |
GPSR [32] | Reducing routing overhead, reducing delay in the network | Not considering parameters such as speed, movement direction, and link lifetime in the routing process |
QGrid [33] | Reducing the number of states in Q-learning algorithm, appropriate convergence speed, reducing communication overhead, determining the discount factor based on vehicle density | Designing an off-line routing, not designing a congestion control mechanism in the network, fixing Q-table during the simulation process, not considering the effect of intersections and buildings on the transmission quality in each grid, not considering parameters such as speed, movement direction, and link lifetime in the routing process |
IV2XQ [34] | Determining the discount factor based on the density and distance of vehicles on the road, reducing communication overhead, designing a congestion control mechanism, appropriate convergence speed, reducing the number of states in the Q-learning algorithm | Not considering parameters such as speed, movement direction, and link lifetime in the routing process, not relying on new traffic information in the network |
Vehicle ID | Road ID | Spatial Coordinates | Velocity | Connection Time | Delay | Validity Time |
---|---|---|---|---|---|---|
Road ID | Vehicle Density | Average Connection Time | Average Delay | Validity Time |
---|---|---|---|---|
The number of vehicles on the upper road | Average connection time at the upper road | Average delay at the upper road | ||
The number of vehicles on the down road | Average connection time at the down road | Average delay at the down road | ||
The number of vehicles on the left road | Average connection time at the left road | Average delay at the left road | ||
The number of vehicles on the right road | Average connection time at the right road | Average delay at the right road |
ID of RSU | ID of Intersection | Time to Live | |
---|---|---|---|
The vehicle density on the upper road | Average connection time at the upper road | Average delay at the upper road | |
The vehicle density on the down road | Average connection time at the down road | Average delay at the down road | |
The vehicle density on the left road | Average connection time at the left road | Average delay at the left road | |
The vehicle density on the right road | Average connection time at the right road | Average delay at the right road |
Parameter | Value |
---|---|
Simulator | NS2 |
Simulation environment (km2) | |
Simulation time (second) | 1000 |
Total number of vehicles | 450 |
Number of road segments | 38 |
Number of intersections | 24 |
Vehicle density (vehicles/m) | 0.005–0.02 |
The velocity of vehicles (m/s) | 14 |
Transmission radius of vehicles (m) | 250–300 |
Transmission radius of RSUs (m) | 300 |
Packet size (byte) | 512 |
Packet sending rate (packets/s) | 1–6 |
Beacon broadcast interval (second) | 1 |
Traffic broadcast interval (second) | 5 |
Learning rate () | 0.1 |
Probability of | 0.2 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Khan, M.U.; Hosseinzadeh, M.; Mosavi, A. An Intersection-Based Routing Scheme Using Q-Learning in Vehicular Ad Hoc Networks for Traffic Management in the Intelligent Transportation System. Mathematics 2022, 10, 3731. https://doi.org/10.3390/math10203731
Khan MU, Hosseinzadeh M, Mosavi A. An Intersection-Based Routing Scheme Using Q-Learning in Vehicular Ad Hoc Networks for Traffic Management in the Intelligent Transportation System. Mathematics. 2022; 10(20):3731. https://doi.org/10.3390/math10203731
Chicago/Turabian StyleKhan, Muhammad Umair, Mehdi Hosseinzadeh, and Amir Mosavi. 2022. "An Intersection-Based Routing Scheme Using Q-Learning in Vehicular Ad Hoc Networks for Traffic Management in the Intelligent Transportation System" Mathematics 10, no. 20: 3731. https://doi.org/10.3390/math10203731
APA StyleKhan, M. U., Hosseinzadeh, M., & Mosavi, A. (2022). An Intersection-Based Routing Scheme Using Q-Learning in Vehicular Ad Hoc Networks for Traffic Management in the Intelligent Transportation System. Mathematics, 10(20), 3731. https://doi.org/10.3390/math10203731