1. Introduction
Due to the limitations of urban land resources and the requirements for power reliability, power cables are widely used in urban power transmission and distribution systems [
1,
2,
3]. With the rapid increase in the scale of power cable applications, the number of short-circuit faults has also increased [
4,
5]. Although the overall short-circuit fault rate of the power cables is low, the origin of the short-circuit faults has many possible factors, which are difficult to predict, and there will be wide social impacts and large economic losses caused by the faults. In order to shorten the repair time of the faulty power cables, efforts are needed to develop reliable and accurate fault location methods.
According to the fault location principles, currently, online fault location methods used in power systems are mainly based on the impedance and the traveling wave. Both of the impedance-based and the traveling-wave-based methods originated from the fault location for overhead lines. The impedance-based method [
6,
7,
8] mainly uses the relay protection device to collect fault voltage/current data, and locates the fault based on the parameter identification of the system impedance. Since the impedance monitored by this method may have a nonlinear relationship with the distance between the fault point, it is difficult to directly locate the fault accurately through the relationship between the impedance and the distance between the fault point. The traveling-wave-based methods [
9,
10,
11] locate the fault through the identification of the propagation time of the fault traveling wave. When the line distribution parameters are large or non-uniform, it is difficult to accurately extract the head and/or the arrival time of the traveling wave.
Recently, efforts have been made to improve the accuracy and applicability of impedance-based methods. In [
12], mutual inductance effects have been taken into account to make the method suitable for the medium voltage distribution network in the double-circuit lines. In [
13], the fault location criteria for different fault sections of the distributed network have been proposed, taking into account the electrical characteristics of nodes in the distributed network under fault and non-fault conditions. In [
14], a fault line selection method of distributed network has been proposed based on the direction vector characteristics of the fault currents. In [
15], a Fibonacci search algorithm has been used to analyze a specific network structure under the premise of low fault resistance, the results indicate that this method improves the robustness of the impedance-based method with respect to the fault resistance. In [
16], the influence of the load uncertainty on the accuracy of fault location has been studied, and a method to reduce the influence has been proposed. The research above can be summarized in
Table 1.
As shown in
Table 1, the improvements of the impedance-based method mainly focus on the applicability of different application scenarios. With the different focuses, the fault location criterion cannot be uniform or mutually applicable.
Additionally, efforts have been made to improve the accuracy and the applicability of the traveling-wave-based method in recent years. In [
17], the regular pattern of the reflection of voltage and current traveling waves at the end of the line has been analyzed, then a fault searching algorithm has been proposed based on mid-point time, which can be used for the fault location of mixed line with discontinuous wave velocity. In [
18], time-frequency analysis of fault traveling waves has been carried out using continuous wavelet transform method, so that the arrival time of the traveling wave could be identified according to the characteristic frequency of the fault traveling wave. In [
19], the identification accuracy of the high frequency part of the fault traveling wave has been improved using a multilayer neural network. In [
20], the problem of multi-channel signal synchronization has been solved by using the time difference between the first arrival wave and the first reflected wave. In [
21], a normalized fault location criterion has been proposed, which does not require external time reference and accurate traveling wave velocity. However, the time difference may be aliased in multiple discharges of the fault, which is common in cable breakdown discharges. The above studies have improved the traveling-wave-based method in terms of traveling wave propagation, time-frequency analysis, and feature recognition, which can be summarized in
Table 2.
It can be seen from the above-mentioned literature analysis that, as time goes by, there are continuous researches to improve the traveling-wave-based method, which also makes the method supplemented and improved. With the improvement of computing performance and signal synchronization accuracy, many applications of intelligent algorithms have emerged in recent years, but these improvements and applications are limited to a single type of line or specific application scenarios, and it is difficult to form a unified fault location criterion. The improvements have not fundamentally solved the inherent problems of traveling wave head attenuation and noises.
The authors of this work previously proposed an unsupervised learning method for fault location of power cables and to improve on the traditionally applied traveling-wave-based method [
22,
23]. The present paper further expands on their previously published work, and makes modifications of the fault location process. There are four steps for the fault location in the previous contribution [
23], i.e., (1) build a matrix of the traveling waves associated with the sheath currents of the cables; (2) apply t-distributed Stochastic Neighbor Embedding (t-SNE) to reconstruct the matrix into a low dimension; (3) cluster the data in the matrix according to its closeness, using Density-Based Spatial Clustering of Applications with Noise (DBSCAN); (4) search for the maximum slope point of the non-noise cluster with the fewest samples to identify the arrival time of the traveling wave. The improvement mainly lies in the high-dimensional clustering for the identification of the arrival time of the traveling waves of multiple signals. Thus, Steps 2 and 3 can be merged as one, the information of the original signals can be reserved without the dimensionality reduction, and the waveforms of the traveling waves can be identified directly via an unsupervised learning algorithm. In addition, the presented method can output a condensed cluster tree which is believed to correspond with the traveling waves. The specific relation between the condensed cluster tree and the traveling wave will be studied in further research, which could help to make the process of fault occurrence and traveling wave propagation clearer.