Reinforcement-Learning-Based Software-Defined Edge Task Allocation Algorithm
Abstract
:1. Introduction
- (1)
- The synchronisation of the state information between edge servers is addressed through the east–west architecture of SDNs.
- (2)
- A software-defined edge-computing architecture is proposed. By fusing the control layer with the edge layer, global information about edge servers, network states and tasks can be obtained, thus enabling multiple edge servers to perform tasks together in an edge environment.
- (3)
- A reinforcement-learning-based edge task allocation algorithm in software-defined IoT is proposed, which can effectively reduce the cost of finding the optimal edge server, lower the task completion time and reduce energy consumption. We conduct extensive experiments to evaluate the performance of the scheme. The experimental results show that the algorithm can reduce task completion time as well as energy consumption compared to random and uniform task computation offloading.
2. Related Work
2.1. Distributed Controller Architecture
2.2. Allocation of Edge Tasks
3. Software-Defined Edge-Computing Architecture
3.1. Network Global View
3.2. Global View Information Exchange Mechanism
4. Optimal Edge Task Allocation Algorithm
4.1. Problem Definition
- Distance factor: Distance here refers to the transmission distance between the wireless sensor to be assigned the task and the edge server to be selected. Minimising the distance reduces the transmission delay of the task and the energy consumption of the sensor’s emission.
- CPU processing frequency of the edge server: The higher the operating frequency of the processing unit on the edge server, the less time a task of the same size will take to compute on the edge server and the faster it will be computed. Denoting the operating frequency of the processing unit on the edge server by , the execution time of the task on the edge server is defined as follows:
- Remaining computing resources of the edge server: The edge servers to be selected that are rich in remaining computational resources are also prioritised as the optimal edge servers to facilitate the allocation of more task-dependent subsequent subtasks. The remaining computational resources can be measured by the average system load, which can be calculated based on the average number of processes in the running queue during a given time interval. We used the following formula to define the remaining computational resources of an edge server:
4.2. Energy Consumption Model
4.3. Task Allocation Issues
4.3.1. State Space S
4.3.2. Action Space A
4.3.3. Reward R
4.3.4. Environment E
Algorithm 1 Optimal value find algorithm. |
Input: 1: Give calculation amount of task 2: Give global view information 3: Give weight before algorithm execution Output: 4: Optimal value selection 5: Initialize random data 6: Calculate the Q-value of the edge server in the global view by Equation (9) 7: while state S != ’terminal’ do 8: Select an action A from the state table corresponding to state S 9: With probability , select a random action A 10: Otherwise, select 11: Execute action A and generate a new state and a reward R 12: Adjustment parameter 13: if != ’terminal’ then 14: Calculate by R, according to Equation (18) 15: else 16: Assign the reward R to 17: end if 18: Update the state table by parameter according to Equation (20) 19: Update new state to state S 20: end while 21: return state table, |
Algorithm 2 RL-SDETA algorithm |
Input: 1: Give computing tasks 2: Give global view information Output: 3: Optimal edge server selection 4: Define initial 5: Convert global view information to array form 6: Call Algorithm 1 with global view information in the form of index groups 7: Calculate the Q-value of the edge server to be selected according to Equation (9) 8: Sort the Q-values of edge servers 9: Select the best edge server by the Q-value 10: return Optimal server selection schemes |
5. Simulation Experiment
5.1. Experimental Setup
5.2. Performance Evaluation
5.2.1. Contrast Programme
5.2.2. Impact of Calculation Volume on Task Calculation Time
5.2.3. Impact of Data Volume on Task Completion Time
5.2.4. Impact of Data Volume on Energy Consumption in Wireless Sensing Networks
5.2.5. Impact of The Number of Edge Servers on Task Processing
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
IoT | Internet of things |
MEC | Multiaccess edge computing |
SDN | Software-defined network |
SDEC | Software-defined edge computing |
RL-SDETA | Reinforcement-learning-based software-defined edge task allocation algorithm |
OXP | Open exchange protocol |
CIDC | Communication interface for distributed control |
WECAN | West–east control association network |
QoS | Quality of Service |
References
- Premsankar, G.; Di Francesco, M.; Taleb, T. Edge computing for the Internet of Things: A case study. IEEE Internet Things J. 2018, 5, 1275–1284. [Google Scholar] [CrossRef]
- Hu, F.; Hao, Q.; Bao, K. A survey on software-defined network and openflow: From concept to implementation. IEEE Commun. Surv. Tutorials 2014, 16, 2181–2206. [Google Scholar] [CrossRef]
- Ghaffar, Z.; Alshahrani, A.; Fayaz, M.; Alghamdi, A.M.; Gwak, J. A Topical Review on Machine Learning, Software Defined Networking, Internet of Things Applications: Research Limitations and Challenges. Electronics 2021, 10, 880. [Google Scholar] [CrossRef]
- Geng, Y.; Yang, Y.; Cao, G. Energy-efficient computation offloading for multicore-based mobile devices. In Proceedings of the 2018-IEEE Conference on Computer Communications, Honolulu, HI, USA, 16–19 April 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 46–54. [Google Scholar] [CrossRef]
- Hu, P.; Chen, W.; He, C.; Li, Y.; Ning, H. Software-defined edge computing (SDEC): Principle, open IoT system architecture, applications, and challenges. IEEE Internet Things J. 2019, 7, 5934–5945. [Google Scholar] [CrossRef]
- Feng, W.; Liu, C.; Cheng, B.; Chen, J. Secure and cost-effective controller deployment in multi-domain SDN with Baguette. J. NETW. Comput. Appl. 2021, 178, 102969. [Google Scholar] [CrossRef]
- Hu, T.; Yi, P.; Zhang, J.; Lan, J. Reliable and load balance-aware multi-controller deployment in SDN. China Commun. 2021, 15, 184–198. [Google Scholar] [CrossRef]
- Ahmad, S.; Mir, A.H. Scalability, consistency, reliability and security in SDN controllers: A survey of diverse SDN controllers. J. Netw. Syst. Manag. 2021, 29, 1–59. [Google Scholar] [CrossRef]
- Contreras, L.M.; Solano, A.; Cano, F.; Folgueira, J. Efficiency Gains due to Network Function Sharing in CDN-as-a-Service Slicing Scenarios. In Proceedings of the IEEE 7th International Conference on Network Softwarization, Tokyo, Japan, 28 June–2 July 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 348–356. [Google Scholar] [CrossRef]
- Lin, P.; Bi, J.; Wang, Y. East-West Bridge for SDN Network Peering. In Communications in Computer and Information Science; Springer: Berlin/Heidelberg, Germany, 2013; pp. 170–181. [Google Scholar] [CrossRef]
- Yang, F.; Cheng, L.I.; Huang, T. OXP: An efficient west-east protocol for SDN in Ad hoc. Telecom Eng. Tech. Stand. 2016, 9, 1–6. [Google Scholar] [CrossRef]
- Benamrane, F.; Benaini, R. An East-West interface for distributed SDN control plane: Implementation and evaluation. Comput. Electr. Eng. 2017, 57, 162–175. [Google Scholar] [CrossRef]
- Yu, H.; Qi, H.; Li, K. WECAN: An Efficient west-east control associated network for large-scale SDN systems. Mobile Netw. Appl. 2020, 25, 114–124. [Google Scholar] [CrossRef]
- Wu, D.; Nie, X.; Asmare, E.; Arkhipov, D.I.; Qin, Z.; Li, R.; McCann, J.A.; Li, K. Towards distributed SDN: Mobility management and flow scheduling in software defined urban IoT. IEEE Trans. Parall. Distr. Syst. 2020, 31, 1400–1418. [Google Scholar] [CrossRef]
- Wang, Z.; Cai, Y. Quality of service (QoS) control in mobile edge computing (MEC). IEEE Wirel. Commun. Mob. Comput. 2022, 12, 7291954. [Google Scholar] [CrossRef]
- Lim, Y. Federated Deep Reinforcement Learning Based Task Offloading with Power Control in Vehicular Edge Computing. Sensors 2022, 22, 9212. [Google Scholar] [CrossRef] [PubMed]
- Masip-Bruin, X.; Marin-Tordera, E.; Juan-Ferrer, A.; Queralt, A.; Jukan, A.; Garcia, J.; Lezzi, D.; Jensen, J.; Cordeiro, C.; Leckey, A.; et al. mF2C: Towards a coordinated management of the IoT-fog-cloud continuum. In Proceedings of the 4th ACM MobiHoc Workshop on Experiences with the Design and Implementation of Smart Objects, Los Angeles, CA, USA, 25 June 2018; ACM: New York, NY, USA, 2018; pp. 1–8. [Google Scholar] [CrossRef]
- Ramirez, W.; Masip-Bruin, X.; Marin-Tordera, E.; Souza, V.B.C.; Jukan, A.; Ren, G.-J.; Gonzalez de Dios, O. Evaluating the benefits of combined and continuous Fog-to-Cloud architectures. Comput. Commun. 2017, 113, 43–52. [Google Scholar] [CrossRef]
- Wu, B.; Zeng, J.; Ge, L.; Su, X.; Tang, Y. Energy-latency aware offloading for hierarchical mobile edge computing. IEEE Access 2019, 7, 121982–121997. [Google Scholar] [CrossRef]
- Chen, S.; Li, Q.; Zhou, M.; Abusorrah, A. Recent advances in collaborative scheduling of computing tasks in an edge computing paradigm. Sensors 2021, 21, 779. [Google Scholar] [CrossRef]
- Wang, B.; Wang, C.; Huang, W.; Song, Y.; Qin, X. A survey and taxonomy on task offloading for edge-cloud computing. IEEE Access 2020, 8, 186080–186101. [Google Scholar] [CrossRef]
- Guo, X.; Lin, H.; Li, Z.; Peng, M. Deep-reinforcement-learning-based QoS-aware secure routing for SDN-IoT. IEEE Internet Things J. 2019, 7, 6242–6251. [Google Scholar] [CrossRef]
- Rivera, A.V.; Refaey, A.; Hossain, E. A blockchain framework for secure task sharing in multi-access edge computing. IEEE Netw. 2020, 35, 176–183. [Google Scholar] [CrossRef]
- Ranji, R.; Mansoor, A.M.; Sani, A.A. EEDOS: An energy-efficient and delay-aware offloading scheme based on device to device collaboration in mobile edge computing. Telecommun. Syst. 2020, 73, 171–182. [Google Scholar] [CrossRef]
- Sellami, B.; Hakiri, A.; Yahia, S.B.; Berthou, P. Deep Reinforcement Learning for Energy-Efficient Task Scheduling in SDN-based IoT Network. In Proceedings of the IEEE 19th International Symposium on Network Computing and Applications, Cambridge, MA, USA, 24–27 November 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 1–4. [Google Scholar] [CrossRef]
- Zhang, H.; Wang, Z.; Liu, K. V2X offloading and resource allocation in SDN-assisted MEC-based vehicular networks. China Commun. 2020, 17, 266–283. [Google Scholar] [CrossRef]
- Zhou, X.; Hu, J.; Liang, M.; Liu, Y. An Efficient Computation Offloading Strategy in Wireless Powered Mobile-Edge Computing Networks. In Proceedings of the International Conference on Algorithms and Architectures for Parallel Processing, Virtual Event, 3–5 December 2021; Springer: Cham, Swtizerland, 2022; pp. 334–344. [Google Scholar] [CrossRef]
- Cho, Y.H.; Byun, W.J. Generalized Friis transmission equation for orbital angular momentum radios. IEEE Trans. Antenn. Propag. 2019, 67, 2423–2429. [Google Scholar] [CrossRef]
- Sonmez, C.; Ozgovde, A.; Ersoy, C. Edgecloudsim: An environment for performance evaluation of edge computing systems. Trans. Emerg. Telecommun. Technol. 2018, 29, e3493. [Google Scholar] [CrossRef]
- Lent, R. A generalized reinforcement learning scheme for random neural networks. Neural Comput. Appl. 2019, 31, 2699–2716. [Google Scholar] [CrossRef]
Key | Column |
---|---|
Controller_ID | IP address, Port number, System version, Edge Server_ID, Supplier name, Device type, Device function |
Link_ID | Source Controller_ID, Destination Controller_ID, Source Port_ID, Destination Port_ID, Is_Link_Active, Bandwidth |
Port_ID | Controller_ID, Port_MAC, Is_Active, Throughput |
Edge Server_ID | Controller_ID, CPU model, CPU frequency, Memory type, Memory capacity, Remaining computing resources |
Controller_Capability | Protocol name, version |
Reachability | Edge Server IP prefixes, Length |
Link_Utilities | Link_ID, Link utilities |
Message Type | Function |
---|---|
HELLO | The first message sent after the TCP connection is established |
KEEPALIVE | Send at regular intervals to confirm the connection |
VIEW-REQUEST | Request network global view |
UPDATE | Global view update information sent to other controllers |
ERROR | Report problems with itself or adjacent controllers to all other controllers |
State | Action | |||||
---|---|---|---|---|---|---|
… | ||||||
Initial state | 2.5 | 1 | 2 | 0.2 | 0.3 | … |
State | 2.5 | 0.4 | 2.2 | 0.1 | 1.1 | … |
State | 1.1 | 2.5 | 0.5 | 2.6 | 1.7 | … |
… | … | … | … | … | … | … |
Parameter | Value |
---|---|
Number of edge servers | 10 |
Number of tasks | 500 |
Transmission bandwidth | 20 MHz |
Greed | 0.9 |
Learning efficiency | 0.8 |
Attenuation degree | 0.9 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhang, T.; Zhu, X.; Wu, C. Reinforcement-Learning-Based Software-Defined Edge Task Allocation Algorithm. Electronics 2023, 12, 773. https://doi.org/10.3390/electronics12030773
Zhang T, Zhu X, Wu C. Reinforcement-Learning-Based Software-Defined Edge Task Allocation Algorithm. Electronics. 2023; 12(3):773. https://doi.org/10.3390/electronics12030773
Chicago/Turabian StyleZhang, Tianhao, Xiaojuan Zhu, and Cai Wu. 2023. "Reinforcement-Learning-Based Software-Defined Edge Task Allocation Algorithm" Electronics 12, no. 3: 773. https://doi.org/10.3390/electronics12030773
APA StyleZhang, T., Zhu, X., & Wu, C. (2023). Reinforcement-Learning-Based Software-Defined Edge Task Allocation Algorithm. Electronics, 12(3), 773. https://doi.org/10.3390/electronics12030773