Typical Power Grid Operation Mode Generation Based on Reinforcement Learning and Deep Belief Network
Abstract
:1. Introduction
- A power grid operation mode set extraction method based on Q-learning and DBSCAN clustering is proposed. Based on several typical operation modes used in previous years, the load level is expanded by 5% year-on-year in order to simulate the growth trend of load, and through the reinforcement learning algorithm Q-learning, the operation modes of different optimal combinations of generators located in different regions are intelligently generated. Then, based on the data characteristics of each operation mode, the DBSCAN clustering algorithm is applied to divide the operation modes into different clusters to extract operation mode sets. It should be noted that in this paper, operation modes are generated without considering power grid structure changes.
- A rational evaluation index system for the operation modes collection is established. The key indexes to meet the demand of actual work procedures are selected from three perspectives: steady state, transient state, and economy mode. Index calculation can replace the system operator calculation process. At the same time, in order to facilitate the system operator to compare each mode, and the application of the analytical hierarchy process–entropy weight method (AHP-EWM) for the fusion of the weights of the multi-dimensional indexes, the results of the operation of the multi-indicator mode are synthesized into a single composite value, which effectively reduces the workload of the system operator.
- An evaluation model of operation mode sets is established based on DBN. This paper proposes a fast evaluation method of operation modes based on DBN, which no longer needs to calculate each index of operation modes, but constructs the correlation relationship between the feature data of operation modes and the composite values through a neural network, to quickly and accurately obtain the composite values of operation modes. In this way, the system operator can select the highest value of each operation mode set according to the comprehensive value, and then obtain the typical operation mode.
2. Methods
2.1. Reinforcement-Learning-Based Power Grid Operation Mode Generation Model
2.1.1. Actual Engineering Needs
2.1.2. Computational Process Modeling in Operation Modes
2.2. Feature Extraction for Power Grid Operation Modes
2.2.1. Data Preprocessing
- Center all operation modes vectors:
- Calculate the covariance matrix to obtain the eigenvalue and the corresponding eigenvectors , where ;
- Take the first K eigenvectors to form a new matrix :
- Perform a linear transformation to obtain operation mode data :
2.2.2. Operation Mode Clustering
- Calculate the distance from each data point to the k-th nearest neighbor, denoted as ;
- Incrementally sort and display the resulting k-distance sequence , with data points on the horizontal axis and the incremental k-distance sequence on the vertical axis;
- Determine the location of the inflection point in the graph; the y-value of the inflection point is .
- Arbitrarily select an operation mode as the current point from the operation mode data and create a new cluster C for . The cluster count is initialized to 1.
- Find all the operation modes in the neighborhood of the current operation mode . If the number of all operation modes in the neighborhood is less than , mark the current operation mode as noise; otherwise, mark the current operation mode as the core point of cluster C.
- Traverse each operation mode (new current point) in the neighborhood and repeat step 2 until no new operation mode that can be marked as belonging to the current cluster C is found.
- Choose the subsequent unlabeled operation mode of the operation mode data as the current point and increase the cluster count by 1.
- Repeat steps 2–4 until all the operation modes in are labeled and the clustering results obtained are the different operation modes.
2.3. Operation Mode Evaluation System and Index Calculation
2.3.1. Indexes for the Comprehensive Evaluation of Operation Mode
- (a)
- In the N − 1 verification inspection of the whole power grid, the percentage of times that the power flow converges after removing any line and transformer in the system, where the voltage and frequency are not out of the limits, is defined as the grid-wide N − 1 pass rate:
- (b)
- The transmission section safety margin is a key parameter for inter-regional power supply, where is the transmission power of the selected section in the current operation mode and is the power limit of the selected section:
- (c)
- The voltage pass rate can reflect the voltage pass level of the nodes and show the voltage quality of the current operation mode. The ratio of the number of nodes with a qualified voltage to the total number of nodes in the whole grid is defined as the voltage qualification rate:
- (d)
- Power angle stability refers to the power angle swing of the generator caused by an expected accident and its severity. It is calculated by taking the maximum power angle offset between generators:
- (e)
- Frequency stability refers to the degree of generator frequency shift caused by large disturbances due to expected accidents. It is calculated by taking the maximum frequency shift in the line :
- (f)
- The total power generation of the power grid minus the total load of the power grid is defined as the total network loss of the power grid. The ratio of total network loss to total power generation defines the network loss rate:
2.3.2. Model for Calculating Index Weights
- Step 1.
- Normalize the indexes using the ideal point approximation method to convert the values of the indexes into the (0, 100) interval. The formula is as follows:
- Step 2.
- Construct a judgment matrix . In this paper, we use a nine-level numerical scale layer to illustrate the relationship between indexes:
- Step 3.
- Compute the concatenated product of the elements of each row, take the nth root and the eigenvectors of the matrix , normalize them, and check for consistency.
- Step 4.
- Calculate the objective weights and combine with the method used in [33] to obtain the following formula:
- Step 5.
- Combining the strengths of subjective prior knowledge and objective data analysis, calculate the combined weights using the following formula:
2.4. Rapid Operation Mode Evaluation Model
- Step 1.
- Generation of operation modes. According to the historical operation data of the power grid, operation mode generation is carried out according to the reinforcement learning algorithm proposed in this paper. Through data preprocessing methods such as standardized processing, feature extraction, and feature dimension reduction, the operation mode feature dataset for clustering and DBN training is obtained.
- Step 2.
- For the operation mode characterization data and DBSCAN clustering algorithm, the operation modes are divided into different operation mode sets.
- Step 3.
- According to the steady-state indexes, transient state indexes, and economy indexes, operation mode evaluation is carried out, and then the respective weights from these three perspectives are calculated using the AHP-EWM. The multi-dimensional index values and weights are then integrated into a comprehensive value.
- Step 4.
- All the operation modes are randomly divided into the training set and test set, where the ratio of training set to test set is 8:2, and the model is trained according to the forward unsupervised pre-training and reverse supervised parameter fine-tuning method.
- Step 5.
- After the training is completed, the test set feature data are input into the DBN model. The evaluation value of the operation mode is then obtained to obtain the rapid operation mode evaluation model based on DBN.
3. Results
3.1. Intelligent Generation of Operation Modes
3.2. Distribution of Operation Mode Sets
3.3. Indexes and Analysis of Comprehensive Operation Mode Evaluation
3.4. Rapid Operation Mode Evaluation Model
4. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Iqbal, S.; Habib, S.; Khan, N.H.; Ali, M.; Aurangzeb, M.; Ahmed, E.M. Electric Vehicles Aggregation for Frequency Control of Microgrid under Various Operation Conditions Using an Optimal Coordinated Strategy. Sustainability 2022, 14, 3108. [Google Scholar] [CrossRef]
- Lan, J.; Guo, Q.; Zhou, Y.; Sun, H. Generation of Power System Typical Operation Mode Samples: A Generation Adversarial Network and Model-based Transfer Learning Approach. Proc. CSEE 2022, 42, 2889–2900. [Google Scholar]
- Iqbal, S.; Xin, A.; Jan, M.U.; Abdelbaky, M.A.; Rehman, H.U.; Salman, S.; Aurangzeb, M. Aggregation of EVs for primary frequency control of an industrial microgrid by implementing grid regulation & charger controller. IEEE Access 2020, 8, 141977–141989. [Google Scholar]
- Li, C.; Wu, Z.; Dong, L.; Liu, M.; Xiao, Y. Fast Generation of Power System Operation Modes Based on Optimal Power Flow. In Proceedings of the 2021 11th International Conference on Power and Energy Systems (ICPES), Shanghai, China, 18–20 December 2021; pp. 767–771. [Google Scholar]
- Xu, H.; Yu, Z.; Zheng, Q.; Hou, J.; Wei, Y. Improved deep reinforcement learning based convergence adjustment method for power flow calculation. In Proceedings of the 16th IET International Conference on AC and DC Power Transmission (ACDC 2020), Online Conference, 2–3 July 2020; pp. 1898–1903. [Google Scholar]
- Li, R.Y.; Peng, H.M.; Li, R.G.; Zhao, K. Overview on Algorithms and Applications for Reinforcement Learning. Comput. Syst. Appl. 2020, 29, 13–25. [Google Scholar]
- Jang, B.; Kim, M.; Harerimana, G.; Kim, J.W. Q-learning algorithms: A comprehensive classification and applications. IEEE Access 2019, 7, 133653–133667. [Google Scholar] [CrossRef]
- Feng, C.; Sun, M.; Zhang, J. Reinforced deterministic and probabilistic load forecasting via Q-learning dynamic model selection. IEEE Trans. Smart Grid 2019, 11, 1377–1386. [Google Scholar] [CrossRef]
- Shin, K.S.; Choi, H.H.; Lee, H. Knowledge Transfer-Based Multiagent Q-Learning for Medium Access in Dense Cellular Networks. IEEE Wirel. Commun. Lett. 2022, 11, 2542–2545. [Google Scholar] [CrossRef]
- Xi, B.; Lei, D. Q-learning-based teaching-learning optimization for distributed two-stage hybrid flow shop scheduling with fuzzy processing time. Complex Syst. Model. Simul. 2022, 2, 113–129. [Google Scholar] [CrossRef]
- Kushwaha, A.; Gopal, M.; Singh, B. Q-learning based maximum power extraction for wind energy conversion system with variable wind speed. IEEE Trans. Energy Convers. 2020, 35, 1160–1170. [Google Scholar] [CrossRef]
- Cao, Y.; Zhao, C.; Li, D. Carbon Management for Intelligent Community with Combined Heat and Power Systems. Sustainability 2023, 15, 13257. [Google Scholar] [CrossRef]
- Xiang, W.; Zhao, J.J.; Gu, Z.G.; Zhang, Q.R.; Fang, J.L.; Li, S.Y. A Dynamic Model Based Method for Evaluating the Operation State of Provincial Power Grid Company. In Proceedings of the 2020 8th International Conference on Power Electronics Systems and Applications (PESA), Hong Kong, China, 7–10 December 2020; pp. 1–5. [Google Scholar]
- Li, H.; Wen, C.; Gu, R.; Zhou, Y. Research on intelligent decision evaluation method for substation based on fuzzy analysis and decision theory. In Proceedings of the 2016 International Conference on Fuzzy Theory and Its Applications (iFuzzy), Taichung, Taiwan, 9–11 November 2016; pp. 1–6. [Google Scholar]
- Liu, Y.; Ouyang, Z.; Yi, H.; Qin, H. Study of the Multilevel Fuzzy Comprehensive Evaluation of Rock Burst Risk. Sustainability 2023, 15, 13176. [Google Scholar] [CrossRef]
- Wang, J.; Hao, Y.; Yang, C. A comprehensive prediction model for VHF radio wave propagation by integrating entropy weight theory and machine learning methods. IEEE Trans. Antennas Propag. 2023, 71, 6249–6254. [Google Scholar] [CrossRef]
- Zhuo, Y.; Cao, Y.; Chen, L.; Nie, J.; Huang, Y.; Liang, Y. Research on evaluation model based on analytic hierarchy process and entropy weight method for smart grid. In Proceedings of the 2022 5th International Conference on Energy, Electrical and Power Engineering (CEEPE), Chongqing, China, 22–24 April 2022; pp. 729–734. [Google Scholar]
- Zhou, K.; Li, Z.; Gong, W.; Zhao, S.; Wen, C.; Song, Y. Influence of magnetic field generated by air core reactors in SVC-based substation and an optimal suppression method based on fuzzy comprehensive evaluation. IEEE Trans. Electromagn. Compat. 2019, 62, 1961–1970. [Google Scholar] [CrossRef]
- Lo, H.W.; Hsu, C.C.; Huang, C.N.; Liou, J.J. An ITARA-TOPSIS based integrated assessment model to identify potential product and system risks. Mathematics 2021, 9, 239. [Google Scholar] [CrossRef]
- Wang, T.; Du, Z.A.; Zhang, K.; Chen, K.; Xiao, F.; Ye, P. Reliability evaluation of high voltage direct current transmission protection system based on interval analytic hierarchy process and interval entropy method mixed weighting. Energy Rep. 2021, 7, 90–99. [Google Scholar] [CrossRef]
- Iqbal, S.; Habib, S.; Ali, M.; Shafiq, A.; ur Rehman, A.; Ahmed, E.M.; Khurshaid, T.; Kamel, S. The Impact of V2G Charging/Discharging Strategy on the Microgrid Environment Considering Stochastic Methods. Sustainability 2022, 14, 13211. [Google Scholar] [CrossRef]
- Tran, S.N.; Garcez, A.S.D.A. Deep logic networks: Inserting and extracting knowledge from deep belief networks. IEEE Trans. Neural Netw. Learn. Syst. 2016, 29, 246–258. [Google Scholar] [CrossRef] [PubMed]
- Kong, X.; Li, C.; Zheng, F.; Wang, C. Improved deep belief network for short-term load forecasting considering demand-side management. IEEE Trans. Power Syst. 2019, 35, 1531–1538. [Google Scholar] [CrossRef]
- Tao, C.; Wang, X.; Gao, F.; Wang, M. Fault diagnosis of photovoltaic array based on deep belief network optimized by genetic algorithm. Chin. J. Electr. Eng. 2020, 6, 106–114. [Google Scholar] [CrossRef]
- Zheng, L.; Hu, W.; Zhou, Y.; Min, Y.; Xu, X.; Wang, C.; Yu, R. Deep belief network based nonlinear representation learning for transient stability assessment. In Proceedings of the 2017 IEEE Power & Energy Society General Meeting, Chicago, IL, USA, 16–20 July 2017; pp. 1–5. [Google Scholar]
- Su, T.; Liu, Y.; Zhao, J.; Liu, J. Deep belief network enabled surrogate modeling for fast preventive control of power system transient stability. IEEE Trans. Ind. Inform. 2021, 18, 315–326. [Google Scholar] [CrossRef]
- Li, B.; Wu, J. Adaptive assessment of power system transient stability based on active transfer learning with deep belief network. IEEE Trans. Autom. Sci. Eng. 2022, 20, 1047–1058. [Google Scholar] [CrossRef]
- Zhang, B.; Xu, X.; Xing, H.; Li, Y. A deep learning based framework for power demand forecasting with deep belief networks. In Proceedings of the 2017 18th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT), Taipei, Taiwan, 18–20 December 2017; pp. 191–195. [Google Scholar]
- Wu, W.; Yang, D.; Yu, Z.; Zhou, B.; Lv, C.; Gu, J. Determination of key transmission section and strong correlation section based on matrix aggregation algorithm. Int. J. Electr. Power Energy Syst. 2023, 153, 109387. [Google Scholar] [CrossRef]
- Wu, S.; Hu, W.; Lu, Z.; Gu, Y.; Tian, B.; Li, H. Power system flow adjustment and sample generation based on deep reinforcement learning. J. Mod. Power Syst. Clean Energy 2020, 8, 1115–1127. [Google Scholar] [CrossRef]
- Chen, B.; Tao, C.; Tao, J.; Jiang, Y.; Li, P. Bearing Fault Diagnosis Using ACWGAN-GP Enhanced by Principal Component Analysis. Sustainability 2023, 15, 7836. [Google Scholar] [CrossRef]
- Lin, Y.; Giacoumidis, E.; O’Duill, S.; Barry, L.P. DBSCAN-based clustering for nonlinearity induced penalty reduction in wavelength conversion systems. IEEE Photonics Technol. Lett. 2019, 31, 1709–1712. [Google Scholar] [CrossRef]
- Xue, T.; Chang, Z.; Zhang, N. Comprehensive evaluation of slope stability using entropy-weight fuzzy hierarchy analysis. J. Heilongjiang Inst. Sci. Technol. 2011, 21, 454–457. [Google Scholar]
- Aslam, N.; Xia, K.; Hadi, M.U. Optimal wireless charging inclusive of intellectual routing based on SARSA learning in renewable wireless sensor networks. IEEE Sens. J. 2019, 19, 8340–8351. [Google Scholar] [CrossRef]
- Bajpai, D.; He, L. Evaluating KNN performance on WESAD dataset. In Proceedings of the 2020 12th International Conference on Computational Intelligence and Communication Networks (CICN), Bhimtal, India, 25–26 September 2020; pp. 60–62. [Google Scholar]
- Hanshuo, M.; Xiaodong, Z.; Xuan, T.; Fei, Q. Research on fault prediction method of electronic equipment based on improved SVR algorithm. In Proceedings of the 2020 Chinese Automation Congress (CAC), Shanghai, China, 6–8 November 2020; pp. 3092–3096. [Google Scholar]
- Sheng, C.; Yu, H. An optimized prediction algorithm based on XGBoost. In Proceedings of the 2022 International Conference on Networking and Network Applications (NaNA), Urumqi, China, 3–5 December 2022; pp. 1–6. [Google Scholar]
Weight | Steady-State Security Index | Transient State Security Index | Economy Index |
---|---|---|---|
Subjective weight | 0.2973 | 0.5389 | 0.1638 |
Objective weight | 0.4234 | 0.3219 | 0.2547 |
Comprehensive weight | 0.3691 | 0.5086 | 0.1223 |
Mode Number | Steady-State Security Index | Transient State Security Index | Economy Index | Total Value |
---|---|---|---|---|
Basic mode | 32.8604 | 46.5159 | 11.6732 | 91.0495 |
Mode 1 | 30.9461 | 45.9558 | 10.9218 | 87.8237 |
Mode 2 | 30.8144 | 43.7940 | 11.5482 | 86.1566 |
Mode 3 | 31.7483 | 46.6853 | 10.6337 | 89.0673 |
Mode 4 | 30.1596 | 44.6487 | 10.9485 | 85.7568 |
Mode Number | Steady-State Security Index | Economy Index | Total Value |
---|---|---|---|
Basic mode | 68.3560 | 22.1629 | 90.5189 |
Mode 1 | 64.3739 | 20.7362 | 85.1101 |
Mode 2 | 64.1000 | 21.9255 | 86.0255 |
Mode 3 | 66.0426 | 20.1893 | 86.2319 |
Mode 4 | 62.7379 | 20.7869 | 83.5248 |
Algorithm | RMSE | MAPE |
---|---|---|
DBN | 0.506 | 0.458 |
KNN | 0.684 | 0.619 |
SVR | 0.824 | 0.726 |
XGBoost | 1.354 | 1.068 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, Z.; Zhou, B.; Lv, C.; Yang, H.; Ma, Q.; Yang, Z.; Cui, Y. Typical Power Grid Operation Mode Generation Based on Reinforcement Learning and Deep Belief Network. Sustainability 2023, 15, 14844. https://doi.org/10.3390/su152014844
Wang Z, Zhou B, Lv C, Yang H, Ma Q, Yang Z, Cui Y. Typical Power Grid Operation Mode Generation Based on Reinforcement Learning and Deep Belief Network. Sustainability. 2023; 15(20):14844. https://doi.org/10.3390/su152014844
Chicago/Turabian StyleWang, Zirui, Bowen Zhou, Chen Lv, Hongming Yang, Quan Ma, Zhao Yang, and Yong Cui. 2023. "Typical Power Grid Operation Mode Generation Based on Reinforcement Learning and Deep Belief Network" Sustainability 15, no. 20: 14844. https://doi.org/10.3390/su152014844
APA StyleWang, Z., Zhou, B., Lv, C., Yang, H., Ma, Q., Yang, Z., & Cui, Y. (2023). Typical Power Grid Operation Mode Generation Based on Reinforcement Learning and Deep Belief Network. Sustainability, 15(20), 14844. https://doi.org/10.3390/su152014844