Transmission Line Fault-Cause Identification Based on Hierarchical Multiview Feature Selection
Abstract
:1. Introduction
- To the best of our knowledge, this is the first time that multiview learning is introduced for transmission line fault-cause identification in view of the nature of multiview fault data.
- We propose a novel approach, HMVFS, based on the ε-dragging and two regularization terms to select the discriminative features across views. We also develop an iterative algorithm to solve the optimization problem and prove its convergence theoretically.
- The performance of HMVFS is evaluated on field data and compared with classical feature selection methods. Experimental results prove the effectiveness of combining waveform and contextual features and demonstrate the feasibility and superiority of HMVFS.
2. Hierarchical Multiview Feature Selection (HMVFS)
2.1. Notation
2.2. The Objective Function
2.3. Optimization
2.4. Convergence
Algorithm 1 The optimization algorithm for (8) |
Input: The feature matrix across all views, ; the label matrix, ; the parameters and |
Output: The weight matrix across all views, |
1: Calculate B from Y via (2) |
2: Initialize W0 and M0 |
3: Initialize |
4: Repeat |
5: Calculate Ct and Dt from Wt |
6: |
7: |
8: |
9: Calculate residue via (1) |
10: Until convergence or maximum iteration number achieved |
3. Material and Characterization
3.1. Data Collection and Cleaning
3.2. Waveform Characteristics
- 1.
- Maximum Change of Sequence Components: Instantaneous magnitude is calculated relative to prefault amplitude in order to be compatible with measurements from different voltage levels and operation conditions. Karenbauer transformation is used to obtain zero, positive and negative components of three-phase signals, denoted by s, s = 0, 1, 2.
- 2.
- Maximum Rate of Change of Sequence Components:
- 3.
- Sequence Component Values at t-cycle: t is set to be 0, 0.5, 1 and 1.5. For instance, t = 0.5 means the measuring point is 1/2 cycle from the start.
- 4.
- Custom Time Constant of Sequence Current: Inspired by a linear time-invariant system, time content is introduced to reflect the dynamic response of the network [23]. Time content is the time required to rise from the zero point to 1/e of the maximum current. In this study, 1/e is replaced with a custom value, m. These features are denoted as
- 5.
- DC and Harmonic Content: Hilbert–Huang transform is used to conduct spectrum analysis [17]. The harmonic content and DC content are calculated from the ratio of the specific component to the fundamental component. DC and harmonic content are denoted as
- 6.
- Wavelet Energy and Energy Entropy: Discrete wavelet transform is applied to decompose fault-phase current signals into three wavelet scales. Wavelet energy E and energy entropy S are calculated for each scale.
- 7.
- Maximum DC Current: Equation (30) is used to calculate the maximum DC current on three-phase signals. Ns is the number of data points in one cycle, and n = 0 means the triggering point.
- 8.
- Time Domain Factors: Form factor, crest factor, skewness and kurtosis, denoted as t1–t4, respectively, are introduced to reflect characteristics of waveform shape and the shock for fault-phase current signals. SD denotes their standard deviation.
- 9.
- Approximation Constants for Neural Waveform: In order to learn more from the front wave, the waveform of rms neutral voltage/current is approximated by (32), as introduced in [33].
- 10.
- Fault Inception Phase Angle (FIPA): FIPA is calculated based on the trigger time after the last zero crossing point prior to fault happening.
3.3. Contextual Characteristics
4. Experiments and Discussion
4.1. Experiment Setup
4.2. Comparison Feature Algorithms
4.3. Overall Classification Performance
4.4. Parameter Sensitivity
4.5. Comparison between ML Classifiers
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Ferreira, V.H.; Zanghi, R.; Fortes, M.Z.; Sotelo, G.G.; Silva, R.; Souza, J.; Guimarães, C.; Gomes, S., Jr. A survey on intelligent system application to fault diagnosis in electric power system transmission lines. Electr. Power Syst. Res. 2016, 136, 135–153. [Google Scholar]
- Chen, Y.; Fink, O.; Sansavini, G. Combined fault location and classification for power transmission lines fault diagnosis with integrated feature extraction. IEEE Trans. Ind. Electron. 2018, 65, 561–569. [Google Scholar] [CrossRef]
- Minnaar, U.J.; Gaunt, C.T.; Nicolls, F. Characterisation of power system events on South African transmission power lines. Electr. Power Syst. Res. 2012, 88, 25–32. [Google Scholar] [CrossRef]
- Cai, Y.; Chow, M. Cause-Effect modeling and spatial-temporal simulation of power distribution fault events. IEEE Trans. Power Syst. 2011, 26, 794–801. [Google Scholar] [CrossRef]
- Gui, M.; Pahwa, A.; Das, S. Bayesian network model with Monte Carlo simulations for analysis of animal-related outages in overhead distribution systems. IEEE Trans. Power Syst. 2011, 26, 1618–1624. [Google Scholar] [CrossRef]
- Núñez, V.B.; Meléndez, J.; Kulkarni, S.; Santoso, S. Feature analysis and automatic classification of short-circuit faults resulting from external causes. Int. Trans. Power Syst. 2013, 23, 510–525. [Google Scholar] [CrossRef]
- Liang, Y.; Li, K.; Ma, Z.; Lee, W. Typical fault cause recognition of single-phase-to-ground fault for overhead Lines in nonsolidly earthed distribution networks. IEEE Trans. Ind. Appl. 2020, 56, 6298–6306. [Google Scholar] [CrossRef]
- Xu, L.; Chow, M.; Taylor, L.S. Power distribution fault cause identification with imbalanced data using the data mining-based fuzzy classification E-algorithm. IEEE Trans. Power Syst. 2007, 22, 164–171. [Google Scholar] [CrossRef] [Green Version]
- Xu, L.; Chow, M.; Timmis, J.; Taylor, L.S. Power distribution outage cause identification with imbalanced data using artificial immune recognition system (AIRS) algorithm. IEEE Trans. Power Syst. 2007, 22, 198–204. [Google Scholar] [CrossRef]
- Xu, L.; Chow, M. A classification approach for power distribution systems fault cause identification. IEEE Trans. Power Syst. 2006, 21, 53–60. [Google Scholar]
- Cai, Y.; Chow, M.; Lu, W.; Li, L. Statistical feature selection from massive data in distribution fault diagnosis. IEEE Trans. Power Syst. 2010, 25, 642–648. [Google Scholar] [CrossRef]
- Chang, G.W.; Hong, Y.; Li, G. A hybrid intelligent approach for classification of incipient faults in transmission network. IEEE Trans. Power Deliv. 2019, 34, 1785–1794. [Google Scholar] [CrossRef]
- Morales, J.; Orduña, E.A.; Rehtanz, C. Identification of lightning stroke due to shielding failure and backflashover for ultra-high-speed transmission line protection. IEEE Trans. Power Deliv. 2014, 29, 2008–2017. [Google Scholar] [CrossRef]
- Jiang, X.; Stephen, B.; McArthur, S. Automated distribution network fault cause identification with advanced similarity metrics. IEEE Trans. Power Deliv. 2021, 36, 785–793. [Google Scholar] [CrossRef]
- Liang, H.; Liu, Y.; Sheng, G.; Jiang, X. Fault-cause identification method based on adaptive deep belief network and time–frequency characteristics of travelling wave. IET Gener. Transm. Distrib. 2019, 13, 724–732. [Google Scholar] [CrossRef]
- Tse, N.C.F.; Lai, L.L. Wavelet–based algorithm for signal analysis. EURASIP J. Adv. Signal Process. 2007, 2007, 1–10. [Google Scholar] [CrossRef] [Green Version]
- Malik, H.; Sharma, R. Transmission line fault classification using modified fuzzy Q learning. IET Gener. Transm. Distrib. 2017, 11, 4041–4050. [Google Scholar] [CrossRef]
- Tse, N.C.F.; Chan, J.Y.C.; Lau, W.H.; Poon, J.T.Y.; Lai, L.L. Real-time power-quality monitoring with hybrid sinusoidal and lifting wavelet compression algorithm. IEEE Trans. Power Deliv. 2012, 27, 1718–1726. [Google Scholar] [CrossRef]
- Tse, N.C.F.; Chan, J.Y.C.; Lau, W.H.; Lai, L.L. Hybrid wavelet and Hilbert transform with frequency shifting decomposition for power quality analysis. IEEE Trans. Instrum. Meas. 2012, 61, 3225–3233. [Google Scholar] [CrossRef]
- Asman, S.H.; Aziz, N.; Amirulddin, U.; Kadir, M. Decision tree method for fault causes classification based on RMS-DWT analysis in 275 kV transmission lines network. Appl. Sci. 2021, 11, 4031–4051. [Google Scholar] [CrossRef]
- Qin, X.; Wang, P.; Liu, Y.; Guo, L.; Sheng, G.; Jiang, X. Research on distribution network fault recognition method based on time-frequency characteristics of fault waveforms. IEEE Access 2018, 6, 7291–7300. [Google Scholar] [CrossRef]
- Dehbozorgi, M.; Rastegar, M.; Dabbaghjamanesh, M. Decision tree-based classifiers for root-cause detection of equipment-related distribution power system outages. IET Gener. Transm. Distrib. 2020, 14, 5809–5815. [Google Scholar] [CrossRef]
- Minnaar, U.J.; Nicolls, F.; Gaunt, C. Automating transmission-line fault root cause analysis. IEEE Trans. Power Deliv. 2016, 31, 1692–1700. [Google Scholar] [CrossRef]
- Shi, Y.; Ji, H. Kernel canonical correlation analysis for specific radar emitter identification. Electron. Lett. 2014, 50, 1318–1319. [Google Scholar] [CrossRef]
- Ramachandram, D.; Taylor, G. Deep multimodal learning: A survey on recent advances and trends. IEEE Signal Process. Mag. 2017, 34, 96–108. [Google Scholar] [CrossRef]
- Lai, C.S.; Yang, Y.; Pan, K.; Zhang, J.; Yuan, H.L.; Wing, W.; Gao, Y.; Zhao, Z.; Wang, T.; Shahidehpour, M.; et al. Multi-view neural network ensemble for short and mid-term load forecasting. IEEE Trans. Power Syst. 2020. [Google Scholar] [CrossRef]
- Lai, C.S. Compression of power system signals with wavelets. In Proceedings of the 2014 International Conference on Wavelet Analysis and Pattern Recognition, Lanzhou, China, 13–16 July 2014. [Google Scholar] [CrossRef]
- Lai, C.S. High impedance fault and heavy load under big data context. In Proceedings of the 2015 IEEE International Conference on Systems, Man, and Cybernetics, Hong Kong, China, 9–12 October 2016. [Google Scholar] [CrossRef]
- Xu, Z.; Yang, P.; Zhao, Z.; Lai, C.S.; Lai, L.L.; Wang, X. Fault diagnosis approach of main drive chain in wind turbine based on data fusion. Appl. Sci. 2021, 11, 5804. [Google Scholar] [CrossRef]
- Xiang, S.; Nie, F.; Meng, G.; Pan, C.; Zhang, C. Discriminative least squares regression for multiclass classification and feature selection. IEEE Trans. Neural Netw. Learn. Syst. 2012, 23, 1738–1754. [Google Scholar] [CrossRef] [PubMed]
- Zhu, X.; Li, X.; Zhang, S. Block-row sparse multiview multilabel learning for image classification. IEEE Trans. Cybern. 2016, 46, 450–461. [Google Scholar] [CrossRef]
- Zhang, Y.; Wu, J.; Cai, Z.; Yu, P.S. Multiview multilabel learning with sparse feature selection for image annotation. IEEE Trans. Multimed. 2020, 22, 2844–2857. [Google Scholar] [CrossRef]
- Zin, A.; Karim, S. Protection system analysis using fault signatures in Malaysia. Int. J. Electr. Power Energy Syst. 2013, 45, 194–205. [Google Scholar] [CrossRef]
- Lai, C.S.; Zhong, C.; Pan, K.; Ng, W.W.Y.; Lai, L.L. A deep learning based hybrid method for hourly solar radiation forecasting. Expert. Syst. Appl. 2021, 177, 114941. [Google Scholar] [CrossRef]
- Yamada, M.; Jitkrittum, W.; Sigal, L.; Xing, E.; Sugiyama, M. High-dimensional feature selection by feature-wise Kernelized Lasso. Neural Comput. 2014, 26, 185–207. [Google Scholar] [CrossRef] [Green Version]
- Bennasar, M.; Hicks, Y.; Setchi, R. Feature selection using Joint Mutual Information Maximisation. Expert Syst. Appl. 2015, 42, 8520–8532. [Google Scholar] [CrossRef] [Green Version]
- Haghighat, M.; Abdel-Mottaleb, M.; Alhalabi, W. Discriminant correlation analysis: Real-time feature level fusion for multimodal biometric recognition. IEEE Trans. Inf. Forensics Secur. 2016, 11, 1984–1996. [Google Scholar] [CrossRef] [Green Version]
Article | Waveform Characteristics | Time Characteristics | External Characteristics | Classification Methods | |||
---|---|---|---|---|---|---|---|
Signal Amplitude | Sequence Component | Spectrum Analysis | Phase or Phase Angle | ||||
* Núñez, Meléndez [6] | √ | √ | √ | √ | CN2 | ||
Liang, Li [7] | √ | √ | FIS | ||||
* Xu, Chow [8,9,10] | √ | √ | √ | FIS/LR/ANN | |||
* Cai, Chow [11] | √ | √ | √ | LR | |||
Chang, Hong [12] | √ | √ | SVM | ||||
* Jiang, Liu [14] | √ | √ | √ | √ | √ | KNN | |
Liang, Liu [15] | √ | √ | √ | DBN | |||
Asman, Aziz [20] | √ | √ | decision tree | ||||
* Qin, Wang [21] | √ | √ | √ | √ | √ | logic flow | |
* Dehbozorgi, Rastegar [22] | √ | √ | decision tree | ||||
Minnaar, Nicolls [23] | √ | √ | √ | √ | √ | KNN |
Pool Type | Feature | Total Number |
---|---|---|
Waveform | Maximum sequence voltage/current | 5 |
Maximum change of three-phase signals and sequence components | 6 | |
Sequence component values | 24 | |
Custom time constant of sequence current | 30 | |
DC and harmonic content | 6 | |
Wavelet energy and energy entropy | 6 | |
Maximum DC current | 1 | |
Form factor, crest factor, skewness and kurtosis | 4 | |
Approximation constants | 2 | |
FIPA | 1 | |
Contextual | Time stamp: season, day/night, mouth, hour | 4 |
Location: landform, zone | 2 | |
Meteorological data: weather, temperature, humidity, rainfall, cloud cover, maximum wind speed, wind scale | 7 | |
Protection data: reclosing, fault phase, fault duration, tripping time, breaker quenching time, reclosing time, number of triggering | 7 | |
Others: voltage level, number of faults | 2 |
Classifier | Feature Selection | Feature Number | Gmean | ACC | AUC |
---|---|---|---|---|---|
CN2 | F_Score | 39 | 0.707 | 0.581 | 0.834 |
ReliefF | 33 | 0.707 | 0.580 | 0.836 | |
HMVFS | 28 | 0.730 | 0.612 | 0.841 | |
LR | F_Score | 16 | 0.833 | 0.756 | 0.889 |
ReliefF | 15 | 0.833 | 0.756 | 0.896 | |
HMVFS | 33 | 0.831 | 0.752 | 0.896 | |
KNN | F_Score | 14 | 0.838 | 0.764 | 0.891 |
ReliefF | 11 | 0.835 | 0.760 | 0.895 | |
HMVFS | 7 | 0.848 | 0.778 | 0.909 | |
SVM | F_Score | 18 | 0.812 | 0.728 | 0.908 |
ReliefF | 18 | 0.837 | 0.761 | 0.906 | |
HMVFS | 14 | 0.849 | 0.779 | 0.921 | |
ANN | F_Score | 18 | 0.837 | 0.761 | 0.891 |
ReliefF | 15 | 0.850 | 0.780 | 0.911 | |
HMVFS | 36 | 0.842 | 0.769 | 0.915 | |
RF | F_Score | 27 | 0.878 | 0.821 | 0.926 |
ReliefF | 12 | 0.876 | 0.819 | 0.935 | |
HMVFS | 9 | 0.875 | 0.817 | 0.935 | |
AdaBoost | F_Score | 36 | 0.781 | 0.684 | 0.797 |
ReliefF | 19 | 0.777 | 0.679 | 0.830 | |
HMVFS | 14 | 0.784 | 0.690 | 0.846 | |
META-DES | F_Score | 19 | 0.876 | 0.816 | 0.930 |
ReliefF | 11 | 0.872 | 0.812 | 0.928 | |
HMVFS | 12 | 0.881 | 0.824 | 0.937 | |
DES-Clustering | F_Score | 32 | 0.872 | 0.812 | 0.916 |
ReliefF | 13 | 0.875 | 0.817 | 0.932 | |
HMVFS | 10 | 0.882 | 0.827 | 0.945 | |
KNORA-U | F_Score | 15 | 0.872 | 0.812 | 0.926 |
ReliefF | 14 | 0.870 | 0.809 | 0.932 | |
HMVFS | 12 | 0.884 | 0.829 | 0.942 | |
Stacking | F_Score | 16 | 0.880 | 0.824 | 0.930 |
ReliefF | 13 | 0.874 | 0.814 | 0.936 | |
HMVFS | 11 | 0.886 | 0.831 | 0.939 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Jian, S.; Peng, X.; Yuan, H.; Lai, C.S.; Lai, L.L. Transmission Line Fault-Cause Identification Based on Hierarchical Multiview Feature Selection. Appl. Sci. 2021, 11, 7804. https://doi.org/10.3390/app11177804
Jian S, Peng X, Yuan H, Lai CS, Lai LL. Transmission Line Fault-Cause Identification Based on Hierarchical Multiview Feature Selection. Applied Sciences. 2021; 11(17):7804. https://doi.org/10.3390/app11177804
Chicago/Turabian StyleJian, Shengchao, Xiangang Peng, Haoliang Yuan, Chun Sing Lai, and Loi Lei Lai. 2021. "Transmission Line Fault-Cause Identification Based on Hierarchical Multiview Feature Selection" Applied Sciences 11, no. 17: 7804. https://doi.org/10.3390/app11177804
APA StyleJian, S., Peng, X., Yuan, H., Lai, C. S., & Lai, L. L. (2021). Transmission Line Fault-Cause Identification Based on Hierarchical Multiview Feature Selection. Applied Sciences, 11(17), 7804. https://doi.org/10.3390/app11177804