Defect Texts Mining of Secondary Device in Smart Substation with GloVe and Attention-Based Bidirectional LSTM
Abstract
:1. Introduction
- o
- Considering the incompleteness of the existing word segmentation thesaurus, a professional dictionary in the field of secondary equipment is constructed to achieve the efficient and accurate word segmentation of defect texts, which effectively ensures the integrity of the semantic information of the text in the word segmentation stage.
- o
- In order to solve the problems of low training efficiency and an easy loss of semantic information of word2vec, FastText and other models, the GloVe model based on global corpus statistics is used to vectorize defect texts. It takes global information into full consideration and ensures the integrity of semantic information.
- o
- The attention mechanism is innovatively introduced into the field of power equipment defect text mining, and a defect text classification model based on BiLSTM-Attention is proposed, which improves the ability of feature extraction and the mining of text information. The classification accuracy of the model was up to 94.9%, which is 5%~—37% higher than the traditional classification models such as TextCNN and Decision Tree.
2. Defect Texts of Secondary Device
2.1. Description and Grading of Defect Texts
2.2. Natural Language Characteristic of Defect Texts
- (1)
- The content of defect texts is unstructured Chinese short text, which is mixed with many numbers and symbols;
- (2)
- The text involves the field of electric power, including a large number of secondary device professional vocabulary;
- (3)
- The content of the text record is slightly different, and the length of a text varies from a few words to a hundred words;
- (4)
- The content and format of records vary from person to person, and different operation and maintenance personnel have different descriptions of the same phenomenon.
3. Text Representation
3.1. Cleaning and Pretreatment
3.2. Texts Segmentation
3.3. Text Representation Based on GloVe
3.3.1. Construct Co-Occurrence Matrix
3.3.2. Word Vector Training
4. Text Classification Model
4.1. Network Architecture of LSTM
4.2. Extracting Semantic Information Based on BiLSTM
4.3. Model Optimization Method Based on Attention Mechanism
5. Case Study and Analysis
5.1. Preformance Comparison of Word Vector Representation
5.2. Preformance Comparison of Different Classification Models
6. Conclusions
Author Contributions
Funding
Conflicts of Interest
Nomenclature
the attention weight matrix after normalization | |
the bias of word vectors and | |
bf,bi,bo,bc | the bias of forget gate, input gate, output gate and cell |
the bias of BiLSTM and attention model | |
the cell state of the LSTM at time and | |
the new information after transformation at time | |
the weight function | |
the forget gate, input gate and output gate at time | |
the output of LSTM at time and | |
the forward and reverse output of LSTM at time | |
the output of BiLSTM at time | |
hs | hierarchical softmax |
the cost function of the GloVe model | |
the size of the vocabulary | |
the sigmoid function | |
the semantic vector at the sentence level | |
sg | skip-gram |
the input weight of forget gate, input gate, output gate and cell | |
the implicit representation of the output of BiSLTM | |
a randomly initialized context vector | |
the cyclic weight of forget gate, input gate, output gate and cell | |
the weight matrix of forward output and reverse output in BiLSTM | |
the weight of attention model | |
the word vectors of word and | |
the co-occurrence matrix | |
the element of the co-occurrence matrix | |
the input of LSTM at time |
References
- Chen, X.; Xiong, X.; Qi, X.; Zheng, C.; Zhong, J. A big data simplification method for evaluation of relay protection operation state. Proc. CSEE 2015, 35, 538–548. [Google Scholar]
- Chen, G.; Wang, D.; Qiu, Y.; Wang, S.; Qi, X. Challenges and development prospects of relay protection technology. Autom. Electr. Power Syst. 2017, 41, 1–11. [Google Scholar]
- Li, G.; Zhang, B.; Zhao, W.; Liu, Y.; Gao, S. Data science issues in state evaluation of power equipment: Challenges and prospects. Autom. Electr. Power Syst. 2018, 42, 10–20. [Google Scholar]
- Miao, X.; Zhang, D.; Sun, D. The opportunity and challenge of big data’s application in power distribution networks. Power Syst. Technol. 2015, 39, 3122–3127. [Google Scholar]
- Zhang, X.; Cheng, X.; Zhao, D. Rule extraction of network operation ticket for power system based on the rough sets. Power Syst. Technol. 2014, 38, 1600–1605. [Google Scholar]
- Liu, Y.; Xu, Z.; Li, G.; Xia, Y.; Gao, S. Review on applications of artificial intelligence driven data analysis technology in condition based maintenance of power transformers. High Volt. Eng. 2019, 45, 337–348. [Google Scholar]
- Qiu, J.; Wang, H.; Ying, G.; Zhang, B.; Zou, G.; He, B. Text mining technique and application of lifecycle condition assessment for circuit breaker. Autom. Electr. Power Syst. 2016, 40, 107–112. [Google Scholar]
- Du, X.; Qin, J.; Guo, S.; Yan, D. Text mining of typical defects in power equipment. High Volt. Eng. 2018, 44, 1078–1084. [Google Scholar]
- Hu, J.; Yin, L.; Li, Z.; Guo, L.; Duan, L.; Zhang, Y. Fault diagnosis method of transmission and transformation equipment based on big data mining technology. High Volt. Eng. 2017, 43, 3690–3697. [Google Scholar]
- Wang, C.; Jiang, Q.; Tang, Y.; Zhu, B.; Xiang, Z.; Tang, J. Fault diagnosis of power dispatching based on alarm signal text mining. Electr. Power Autom. Equip. 2019, 39, 126–132. [Google Scholar]
- Rudin, C.; Waltz, D.; Anderson, R.N.; Boulanger, A.; Salleb-Aouissi, A.; Chow, M.; Dutta, H.; Gross, P.N.; Huang, B.; Ierome, S.; et al. Machine learning for the New York city power grid. IEEE Trans. Pattern Anal. Mach. Intell. 2012, 34, 328–345. [Google Scholar] [CrossRef] [Green Version]
- Zhang, D.; Han, X.; Deng, C. Review on the research and practice of deep learning and reinforcement learning in smart grids. CSEE J. Power Energy Syst. 2018, 4, 362–370. [Google Scholar] [CrossRef]
- Deng, J.; Zhang, W.; Yang, X. Recognition and Classification of Incipient Cable Failures Based on Variational Mode Decomposition and a Convolutional Neural Network. Energies 2019, 12, 2005. [Google Scholar] [CrossRef] [Green Version]
- Ma, B.; Dong, H.; Ren, W. Status assessment of UHV substation secondary equipment based on fuzzy comprehensive support vector machine method. Comput. Syst. Appl. 2014, 23, 191–197. [Google Scholar]
- Zhang, Y.; Hu, C.; Huang, S.; Feng, S.; Lin, G. Apriori algorithm based data mining and analysis method for secondary device defects. Autom. Electr. Power Syst. 2017, 41, 147–151. [Google Scholar]
- Liu, Z.; Wang, H.; Cao, J.; Qiu, J. A classification model of power equipment defect texts based on convolutional neural network. Power Syst. Technol. 2018, 42, 644–651. [Google Scholar]
- Jiang, Y.; Li, L.; Li, Z.; Su, C.; Wang, G.; Xiao, X. An information mining method of power transformer operation and maintenance texts based on deep semantic learning. Proc. CSEE 2019, 39, 4162–4172. [Google Scholar]
- Cao, J.; Chen, S.; Qiu, J.; Wang, H.; Ying, G.; Zhang, B. Semantic framework-based defect text mining technique and application in power grid. Power Syst. Technol. 2017, 41, 637–643. [Google Scholar]
- Shao, G.; Wang, H.; He, B. Quality assessment and improvement method for power grid equipment defect text. Power Syst. Technol. 2019, 43, 1472–1479. [Google Scholar]
- Zhao, L.; Zeng, G.; Wang, W.; Zhang, Z. Forecasting Oil Price Using Web-based Sentiment Analysis. Energies 2019, 12, 4291. [Google Scholar] [CrossRef] [Green Version]
- Ballı, S.; Karasoy, O. Development of content-based SMS classification application by using Word2Vec-based feature extraction. IET Softw. 2019, 13, 295–304. [Google Scholar] [CrossRef]
- Zhou, J.; Lu, Y.; Dai, H.; Wang, H.; Xiao, H. Sentiment analysis of Chinese microblog based on stacked bidirectional LSTM. IEEE Access 2019, 7, 38856–38866. [Google Scholar] [CrossRef]
- Zhang, D.; Xu, H.; Su, Z.; Xu, Y. Chinese comments sentiment classification based on word2vec and SVM perf. Expert Syst. 2015, 42, 1857–1863. [Google Scholar] [CrossRef]
- Dai, L.; Jiang, K. Chinese text classification based on FastText. Comput. Mod. 2018, 5, 39–44. [Google Scholar]
- Pennington, J.; Socher, R.; Manning, C. Glove: Global vectors for word representation. In Proceedings of the Conference Empirical Methods Natural Language Process, Doha, Qatar, 25–29 October 2014; pp. 1532–1543. [Google Scholar]
- Ren, X.; Zhou, Y.; He, J.; Chen, K.; Yang, X.; Sun, J. A convolutional neural network-based Chinese text detection algorithm via text structure modeling. IEEE Trans. Multimed. 2017, 19, 506–518. [Google Scholar] [CrossRef] [Green Version]
- Wei, D.; Wang, B.; Lin, G.; Liu, D.; Dong, Z.; Liu, H.; Liu, Y. Research on Unstructured Text Data Mining and Fault Classification Based on RNN-LSTM with Malfunction Inspection Report. Energies 2017, 10, 406. [Google Scholar] [CrossRef] [Green Version]
- Bai, Z.; Sun, G.; Zang, H.; Zhang, M.; Shen, P.; Liu, Y.; Wei, Z. Identification Technology of Grid Monitoring Alarm Event Based on Natural Language Processing and Deep Learning in China. Energies 2019, 12, 3258. [Google Scholar] [CrossRef] [Green Version]
- Li, W.; Li, D.; Yin, H.; Zhang, L.; Zhu, Z.; Liu, P. Lexicon-Enhanced Attention Network Based on Text Representation for Sentiment Classification. Appl. Sci. 2019, 9, 3717. [Google Scholar] [CrossRef] [Green Version]
- Xu, G.; Meng, Y.; Qiu, X.; Yu, Z.; Wu, X. Sentiment Analysis of Comment Texts Based on BiLSTM. IEEE Access 2019, 7, 51522–51532. [Google Scholar] [CrossRef]
- Dias, M.; Boné, J.; Ferreira, J.C.; Ribeiro, R.; Maia, R. Named Entity Recognition for Sensitive Data Discovery in Portuguese. Appl. Sci. 2020, 10, 2303. [Google Scholar] [CrossRef] [Green Version]
- Sun, Y.; Wang, P.; Zhai, S.; Hou, D.; Wang, S.; Zhou, Y. Ultra short-term probability prediction of wind power based on LSTM network and condition normal distribution. Wind Energy 2020, 23, 63–76. [Google Scholar] [CrossRef]
- Xu, J.; Wei, H.; Li, L.; Fu, Q.; Guo, J. Video Description Model Based on Temporal-Spatial and Channel Multi-Attention Mechanisms. Appl. Sci. 2020, 10, 4312. [Google Scholar] [CrossRef]
- Zheng, J.; Cai, F.; Shao, T.; Chen, H. Self-Interaction Attention Mechanism-Based Text Representation for Document Classification. Appl. Sci. 2018, 8, 613. [Google Scholar] [CrossRef] [Green Version]
- Li, S.; Tang, M.; Guo, Q.; Lei, J.; Zhang, J. Deep neural network with attention model for scene text recognition. IET Comput. Vis. 2017, 11, 605–612. [Google Scholar] [CrossRef]
- Meng, W.; Wei, Y.; Liu, P.; Zhu, Z.; Yin, H. Aspect based sentiment analysis with feature enhanced attention CNN-BiLSTM. IEEE Access 2019, 7, 167240–167249. [Google Scholar] [CrossRef]
- Guan, P.; Li, B.; Lv, X.; Zhou, J. Attention enhanced bi-directional LSTM for sentiment analysis. J. Chin. Inf. Process. 2019, 33, 105–111. [Google Scholar]
- Xie, Y.; Wang, Y.; Zhu, P.; You, X. Grid-search-based hybrid TOA/AOA location techniques for NLOS environments. IEEE Commun. Lett. 2009, 13, 254–256. [Google Scholar] [CrossRef]
- Blanco, M.A.; Marti-Puig, P.; Gibert, K.; Cusidó, J.; Solé-Casals, J. A Text-Mining Approach to Assess the Failure Condition of Wind Turbines Using Maintenance Service History. Energies 2019, 12, 1982. [Google Scholar] [CrossRef] [Green Version]
- Shi, L.; He, Y.; Li, B.; Cheng, T.; Huang, Y.; Sui, Y. Tilt Angle Monitoring by Using Sparse Residual LSTM Network and Grid Search. IEEE Sens. J. 2019, 19, 8803–8812. [Google Scholar] [CrossRef]
- Feng, Y.; Qu, B.; Xu, H.; Wang, R.; Zhang, Y. Chinese FastText Short Text Classification Method Integrating TF-IDF and LDA. J. Appl. Sci. 2019, 37, 378–388. [Google Scholar]
Level of Equipment Defects | Maintenance Decision |
---|---|
General Defect | No maintenance required. |
Serious Defect | Real time monitoring of equipment operation status, priority to arrange maintenance. |
Critical Defect | Immediately cut off the power for maintenance. |
Text to be Segmented | Segmentation Result |
---|---|
(Abnormal high frequency protection channel of 220 kv wumi line) | (220 kv/wu/miline/high frequency/protection/channel/abnormal) |
Category Name | Proper Noun Set |
---|---|
Transformer Name | (duolang substation) (wusu substation) (bachu substation) (wucaiwan substation) |
Name of Protection Line | (chuba line) (yaningzhou line) (qichai line) |
Terminology of Equipment Protection | (high-frequency protection) (intelligent terminal) (automatic safety device) (high resistance protection) (stability control device) |
Window Label | Central Word | Window Contents |
---|---|---|
0 | 220 kv | 220 kv—(rainbow line) |
1 | rainbow line | 220 kv—(rainbow line)—(merging unit) |
2 | merging unit | (rainbow line)—(merging unit)—cpu |
3 | cpu | (merging unit)—cpu—(plug-in is abnormal) |
4 | plug-in is abnormal | cpu—(plug-in is abnormal) |
Defect Classification | Training Set | Verification Set | Test Set |
---|---|---|---|
General Defect | 200 | 67 | 67 |
Serious Defect | 492 | 164 | 164 |
Critical Defect | 388 | 129 | 129 |
Parameter | Value |
---|---|
Learning Rate | 0.01 |
Dropout Rate | 0.5 |
L2RegLambda | 0.5 |
Number of Neurons in LSTM Hidden Layer | 64 |
Batch Size | 64 |
Word Vector Dimension | 200 |
GloVe | word2vec | FastText | |||
---|---|---|---|---|---|
Parameters | Value | Parameters | Value | Parameters | Value |
vector size | 200 | vector size | 200 | vector size | 200 |
window size | 8 | window size | 6 | window size | 5 |
min count | 2 | min count | 2 | min count | 2 |
batch_size | 512 | batch_size | 512 | batch_size | 512 |
xmax | 100 | sg | 1(skip-gram) | sg | 1(skip-gram) |
α | 0.75 | hs | 1(negative sampling) | loss function | hierarchical softmax |
Text Classification Model | Parameter | Value |
---|---|---|
TextCNN | Filter Size | (2,3,4) |
Pooling | Max_pooling | |
Dropout Rate | 0.2 | |
Iter | 5 | |
Optimization | Gradient Descent | |
Batch Size | 64 | |
Decision Tree | criterion | gini |
splitter | best | |
min_samples_split | 2 | |
min_samples_leaf | 1 |
Classification Model | Training Time |
---|---|
BiLSTM | 8 min 50 s |
BiLSTM-Attention | 9 min 12 s |
TextCNN | 10 min 20 s |
Decision Tree | 5 min 30 s |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Chen, K.; Mahfoud, R.J.; Sun, Y.; Nan, D.; Wang, K.; Haes Alhelou, H.; Siano, P. Defect Texts Mining of Secondary Device in Smart Substation with GloVe and Attention-Based Bidirectional LSTM. Energies 2020, 13, 4522. https://doi.org/10.3390/en13174522
Chen K, Mahfoud RJ, Sun Y, Nan D, Wang K, Haes Alhelou H, Siano P. Defect Texts Mining of Secondary Device in Smart Substation with GloVe and Attention-Based Bidirectional LSTM. Energies. 2020; 13(17):4522. https://doi.org/10.3390/en13174522
Chicago/Turabian StyleChen, Kai, Rabea Jamil Mahfoud, Yonghui Sun, Dongliang Nan, Kaike Wang, Hassan Haes Alhelou, and Pierluigi Siano. 2020. "Defect Texts Mining of Secondary Device in Smart Substation with GloVe and Attention-Based Bidirectional LSTM" Energies 13, no. 17: 4522. https://doi.org/10.3390/en13174522
APA StyleChen, K., Mahfoud, R. J., Sun, Y., Nan, D., Wang, K., Haes Alhelou, H., & Siano, P. (2020). Defect Texts Mining of Secondary Device in Smart Substation with GloVe and Attention-Based Bidirectional LSTM. Energies, 13(17), 4522. https://doi.org/10.3390/en13174522