Unsupervised Anomaly Detection Applied to Φ-OTDR

Almudévar, Antonio; Sevillano, Pascual; Vicente, Luis; Preciado-Garbayo, Javier; Ortega, Alfonso

doi:10.3390/s22176515

Open AccessArticle

Unsupervised Anomaly Detection Applied to Φ-OTDR

by

Antonio Almudévar

^1,*

,

Pascual Sevillano

²

,

Luis Vicente

¹

,

Javier Preciado-Garbayo

³

and

Alfonso Ortega

¹

ViVoLab, Aragón Institute for Engineering Research (I3A), University of Zaragoza, 50009 Zaragoza, Spain

²

Applied Physics Department, Aragón Institute for Engineering Research (I3A), University of Zaragoza, 50009 Zaragoza, Spain

³

Aragon Photonics Labs (APL) and Electronic Engineering and Communications Department, University of Zaragoza, 50009 Zaragoza, Spain

^*

Author to whom correspondence should be addressed.

Sensors 2022, 22(17), 6515; https://doi.org/10.3390/s22176515

Submission received: 28 July 2022 / Revised: 21 August 2022 / Accepted: 24 August 2022 / Published: 29 August 2022

(This article belongs to the Section Optical Sensors)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Distributed acoustic sensors (DASs) based on direct-detection Φ-OTDR use the light–matter interaction between light pulses and optical fiber to detect mechanical events in the fiber environment. The signals received in Φ-OTDR come from the coherent interference of the portion of the fiber illuminated by the light pulse. Its high sensitivity to minute phase changes in the fiber results in a severe reduction in the signal to noise ratio in the intensity trace that demands processing techniques be able to isolate events. For this purpose, this paper proposes a method based on Unsupervised Anomaly Detection techniques which make use of concepts from the field of deep learning and allow the removal of much of the noise from the Φ-OTDR signals. The fact that this method is unsupervised means that no human-labeled data are needed for training and only event-free data are used for this purpose. Moreover, this method has been implemented and its performance has been tested with real data showing promising results.

Keywords:

distributed acoustic sensors; Φ-OTDR; deep learning; autoencoder; Unsupervised Anomaly Detection

1. Introduction

Distributed acoustic sensing comprises several techniques which make use of light–matter interaction phenomena to turn the fiber into a distributed sensor. Those based on Rayleigh scattering have been widely used because of the relatively high scattering coefficient compared to Raman or Brillouin [1]. In 1993, an enhanced version of an Optical Time Domain Reflectomer (OTDR) employing a highly coherent laser, named phase-Sensitive OTDR (Φ-OTDR), was demonstrated and tested as an intrusion detector [2,3]. Φ-OTDRs are based on the variation of the backscattered intensity trace between pulses propagating through the fiber. The jagged shape of this trace is generated by the random interference of the light backscattered from different points along the fiber. Strain or temperature variations in the fiber cause a localized change in the relative phase between scattering points that severely affect the interference at that point. Thus, changes in the profile of this trace between pulses reveal minute local variations of physical conditions at any point of the fiber. Due to this feature, Φ-OTDR has been widely employed as a high sensitivity distributed sensor in many fields, such as pipeline protection [4,5], live traffic monitoring [6] or third party intrusion (TPI) [7,8].

The performance of this technology in practical application fields, especially in TPI, rests on its ability to locate and identify threats without increasing the nuisance alarm rate. For that purpose, the system needs to provide the best possible SNR level in order to detect events accurately. In its most simple version, the detection stage of the Φ-OTDR is based on a direct-detection scheme that provides only the intensity backscattered trace [9]. These recovered intensity traces are always noisy and present high fluctuations because of the non-linear response of the interference with the phase changes induced by the stimulus [10]. Traditionally, the most employed methods to reduce this impairment are based on the demodulation of the phase from the backscattered trace. These techniques implement schemes such as I/Q demodulation [11,12], heterodyne detection [13,14] or self-correlation algorithms of the trace [15,16,17]. The first two categories are based on the beat of two or more intensity signals that have a controlled delay between them and require coherent detection and high performance acquisition, among other hardware requirements, which increases the complexity and the cost of the solution. In contrast, self-correlation algorithms keep the direct-detection scheme while recovering the phase from the intensity trace. However, these last algorithms are based on specific characteristics of the signal that appear only for certain types of stimuli, narrowing its practical field of use. Even though these techniques either demand costly hardware or cannot be applied for any previously unknown stimuli, they have been widely employed in phase-sensitive OTDR. This is because the direct-detection configuration has not been considered as an alternative due to the low SNR than can be achieved and its effect in event detection for real scenarios.

Recently, there has been a growing interest in the use of deep-learning techniques applied to distributed acoustic sensors. These techniques have been proven to enhance the performance of the direct-detection Φ-OTDR by increasing the SNR without requiring extra hardware on the optical setup [18]. Post-processing algorithms applied to the raw intensity trace have been reported in multiple studies to denoise the signal and increase the event detect ratio. In this work, we have experimentally analyzed the Unsupervised Anomaly Detection (UAD) paradigm for event detection in direct-detection Φ-OTDR. Throughout this work, we analyze the SNR-enhanced techniques already reported in the literature and evaluate the UAD algorithm for event detection in real scenarios.

This paper is organized as follows. Section 2 describes the properties of the backscattered signal registered in Φ-OTDR and the non-hardware related approaches that can be found in the literature to increase its effective SNR and introduces the novel approach presented in this work. Section 3 details the experimental setup and the field experiments performed to validate the algorithm. Section 4 explains the proposed algorithms and all the steps involved from the raw signal to its output. Section 5 shows the results of the experiments and the discussion. Finally, Section 6 presents our conclusions.

2. Unsupervised Anomaly Detection in Φ-OTDR

2.1. Signal in Direct-Detection Φ-OTDR

The signal received at the detector in phase-sensitive OTDR is continuous. At any precise time, the signal acquired corresponds to the coherent sum of the fields reflected by the scatter centers in the fiber located within the region stimulated by the pulse at that time. This sum can be expressed as:

E {(t)}_{z = 0} = E_{0} e^{- 2 α \bar{z}} e^{i ω t} \sum_{m = 1}^{M} r_{m} e^{i ϕ_{m}}

(1)

where

E_{0}

is the reference amplitude,

α

is the attenuation coefficient of the fiber,

ω

accounts for the frequency of the pulsed light and

\bar{z}

is the physical location of the center of the pulse and is defined as

\bar{z} = (1 / 2) (t v_{g} - w_{p} / 2)

, with

v_{g}

being the group velocity in the fiber and

w_{p}

being the pulse width. The summation term accounts for all the M scatter centers; with reflectivity

r_{m} \in [0, 1]

and

ϕ_{m}

, the optical phase change causes uniform distribution in the interval

[0, 2 π)

, which lies within the spatial region stimulated by half of the pulse width and thus is located in

z \in [(1 / 2) (t v_{g} - w_{p}), (1 / 2) (t v_{g})]

[19]. In the most general case, fiber attenuation within the pulse width can be neglected as the spatial resolution is assumed to be small enough. The intensity registered by the detector, omitting the losses, can be expressed in terms of the resulting field as

\begin{matrix} I = {| E |}^{2} = {| \sum_{m = 1}^{M} r_{m} e^{i ϕ_{m}} |}^{2} = \sum_{m = 1}^{M} r_{m}^{2} + 2 \sum_{k = 1}^{M - 1} \sum_{l = k + 1}^{M} r_{k} r_{l} cos (ϕ_{k} - ϕ_{l}) \end{matrix}

(2)

The term regarding

cos (ϕ_{k} - ϕ_{l})

accounts for the relative phase change between scatter centers and can be considered non-variant for static mechanical conditions of the fiber. Phase noise of the laser source or environmental fluctuations such as temperature can affect this value, generating a slow drift than can be neglected in a short time range compared to pulse frequency. Now, if we consider a perturbation that involves a phase change

θ_{p}

between the reflector

q - 1

and q in a region of M scattering centers, the difference of the intensity signals between before,

I_{M}

, and after the perturbation,

I_{M}^{'}

, can be described as

\begin{matrix} I_{M} - I_{M}^{'} = 2 \sum_{k = 1}^{q - 1} \sum_{l = q}^{M} r_{k} r_{l} [cos (ϕ_{k} - ϕ_{l}) (1 - cos θ_{p}) - sin (ϕ_{k} - ϕ_{l}) sin θ_{p}] \end{matrix}

(3)

This dependence shows the non-linear response of the phenomena to perturbations of the fiber [20]. Direct-detection schemes are based on the analysis of this intensity signal and thus this non-linearity is a strong impairment in its use for sensing due to the high nuisance alarm rate it implies when using event-detection algorithms in real scenarios [20]. The most common and simple approach to overcome this nonlinearity in the intensity trace is the implementation of a coherent stage that could reveal the phase of the backscattered signal. For that, the laser source is split and a small portion of the original signal, aided by an acousto-optic modulator, is frequency shifted and directly coupled into the detector with the signal coming from the fiber [21]. By doing this, the phase of the backscattered signal can be measured via the electric signal originating from the beat of the two optical signals, which presents a low frequency that can be handled by standard electronics. I/Q demodulation or Hilbert transform demodulation are common techniques that can be performed to recover the phase in this architecture [21,22]. Other classical approaches have been proved valid in order to recover the phase such as those based on the Kramers–Kroning receiver [23], Rayleigh backscattering self-interference detection [16] and the phase-generated carrier [24]. However, all these solutions require extra hardware or rely on some specific conditions of the signal that limit its range of application or increase its deployment cost in real scenarios.

2.2. Event Detection for Φ-OTDR

Recently, we have seen a different approach in order to increase the SNR and reduce the false alarm rate in phase-sensitive OTDR based on the application of advanced algorithms to the raw data. The use of machine-learning techniques and tools has been studied as an alternative to hardware-oriented solutions in order to enhance the performance of phase-sensitive OTDR without increasing the complexity and cost of the setup. So far, different methods based on machine-learning techniques have been proposed to process signals from DAS sensors.

Depending on the data to be worked with and the objective, two working scenarios are possible. The first aims to classify the different types of situations that the DAS can capture. That is, the objective in this scenario is to indicate whether there is an event or not and, if there is, to say what type of event it is. This scenario requires a very large amount of data of each type of event labeled by a human, which is costly and time-consuming. The second scenario, on the other hand, aims to discern between event-absence and event-presence situations. That is, in cases where events are present, there is no indication of which possible event it is. The main advantage of this scenario is that only event-free data are needed for its design and, therefore, it does not require human-provided labeling. In addition, in the case of wanting to classify events, as is the aim in the first scenario, the fact of previously separating between absence and presence of events can facilitate the task considerably. Equivalently, the objective of the latter scenario can be seen as that of denoising the signals, so that the SNR in event areas is much higher after processing them and, therefore, it is easier to detect events. This work belongs to the second scenario, but it is convenient to analyze works belonging to both groups, since some techniques used are similar.

As for the first scenario, in [25], principal component analysis (PCA) is used to obtain features of reduced dimension that are subsequently classified using support vector machine (SVM). In [26,27], convolutional neural networks (CNNs) whose input are the Mel-frequency cepstral coefficients (MFCCs) [28] of the signal corresponding to each DAS distance are used to classify different types of events. Moreover, it is proposed in [29] to use a network based on VGG16 [30] to classify events; it is trained with signals generated by Generative Adversarial Networks (GANs) [31]. Finally, in [7], CNN and the long short-term memory network (LSTM) are combined to both detect and classify events by taking spatio-temporal patches as inputs.

Regarding the second scenario, a solution based on real-time event-detection methods, in particular YOLOv3 [32], is proposed in [33]. On the other hand, in [34,35] it is proposed to generate artificial data and noise, sum them and process the result through a CNN that is able to eliminate the added noise. This idea is based on the concept of a denoising autoencoder [36]. Finally, in [37], the previous idea is combined with a GAN, the generator of which is a denoiser that takes as input the summed signal and noise. The reason for introducing a GAN to the previous proposal is that the generator is improved and overcomes the discriminator.

This work belongs to the second scenario, i.e., it tries to detect events, but not to classify them. However, the way of approaching the task is different from those described above. For this purpose, it is assumed that the events that the Φ-OTDR aims to capture are anomalies and Unsupervised Anomaly Detection (UAD) techniques are introduced. UAD refers to the task of detecting out-of-normal events using only anomaly-free data (also known as normal data) for its design. UAD techniques proposed in this work also use neural networks, as some of the previously described systems do, but the workflow is different, as is explained in Section 4.

3. Experimental Setup and Field Experiment

The setup employed is based on a conventional direct-detection

ϕ

-OTDR and it is depicted in Figure 1. The emission source is a coherent laser centered in 1550.12 nm. Due to the availability for the field test, the linewidth of the laser employed was 1 Khz, although for the direct-detection scheme this is not a requirement. The continuous wave emission is pulsed by an acousto-optic modulator fed with a square signal provided by an electrical Signal Generator (SG). The pulse width employed is 200 ns and the pulse frequency is 1 kH, which corresponds to a 20 m spatial resolution and an effective bandwidth of 500 Hz. The pulsed light is amplified by an Erbium Doped Fiber Amplifier (EDFA) and the undesired noise generated by spontaneous emission is filtered out with the Optical Bandpass Filter (OBPF) of 100 GHz conveniently centered at the emitting wavelength. The amplified and filtered pulse is coupled into the fiber under test (FUT) by means of a circulator. Finally, the backscattered signal is guided into the detection stage, where it is amplified and filtered with the same scheme as in the emission stage. At the end of the detection stage, the optical signal is registered with a photodetector (PD) and sampled by an analog–digital converter (ADC) at 50 MS/s, which yields to a sampling resolution of 2 m. The maximum reach of the system will be determined by the degradation of the OSNR along the measurement setup, and will be settled by these configuration parameters and the distributed attenuation in the layout. Due to the aforementioned nonlinearity nature of the signals in the phase-sensitive OTDR, the OSNR analysis requires a statistical analysis of the signal at the ending point where a reference stimulus is placed [38]. For this setup and the configured parameters, the maximum reach was set to 41 km.

To train and evaluate the systems explained in the next section, the layout was designed so the same stimulus could be recorded at several distances at the same time and the effect of the distance for the perturbation could be precisely evaluated. The FUT in this case was a standard single mode fiber deployed inside and optical fiber cable, which was buried in the ground and hosted several optical fibers. By placing reels of fiber with different lengths in one side of the cable, and splicing the fibers together in the other end of the cable accordingly, we could define an optical path where the same pulse travelled the event area, and thus the same physical location, several times in the same trip. In this scenario, depicted in Figure 1, we used reels of 5, 10 and 20 km, so we have the same event replicated in distances ≈ 0, 5, 15 and 35 km at the same time.

Based on the most common threats for TPI in underground infrastructures, four different events, listed in Table 1, were performed in the sensing area: Hydraulic— this consisted of a hydraulic hammer hitting the ground above the cable. Digging—this was performed with an excavator scraping in the ground and located at three different surface distances perpendicular to the fiber cable. Compactor—this was carried out by a plate compactor compacting the ground, and again at three different distances perpendicular to the fiber. Moving along—this event was recorded when the excavator was being positioned at the different locations and was passing over the cable in the surface. Although this last event is not considered an actual threat, it usually precedes others and its detection plays a key role in early warning systems. In addition, 10 min of event-free data were captured in each set to train the network. Having data captured in this controlled environment has two main advantages. The first one is that we can be sure that the data used to train Unsupervised Anomaly Detection systems are indeed free of anomalies. The second one is that the abnormal data are labeled and the time and position at which they occurred is known, so evaluation is straightforward.

4. Methodology

This section explains the proposed method to detect events in Φ-OTDR signals. This method uses deep-learning techniques applied to anomaly detection.

4.1. Deep Learning and Unsupervised Anomaly Detection

As already explained in Section 2, in the field of anomaly detection, methods based on deep-learning techniques have been proposed in recent years. In this particular case, the term “unsupervised” refers to the fact that no labels are used to train the neural networks. In particular, autoencoders [39] have been proposed for this task [40].

An autoencoder is a neural network that has two main parts—an encoder and a decoder—and whose objective is to make the input to the encoder and the output of the decoder as similar as possible by passing through a compressed version of the data, which is called code. Equivalently, this means choosing a cost function that compares input and output of the network. In particular, when working with bidimensional data such as images, it is common to choose the mean square error, i.e., given an image

X = (x_{i j}) \in R^{N \times M}

and an autoencoder

A_{θ} : R^{N \times M} \to R^{N \times M}

whose parameters are

θ

, the loss function is given by:

L (X, A_{θ}) = \sum_{i = 1}^{N} \sum_{j = 1}^{M} {(x_{i j} - A_{θ} {(x)}_{i j})}^{2}

(4)

These structures have been proposed as a method for Unsupervised Anomaly Detection [41]. For this purpose, the network is trained only with normal data in order to be able to accurately reconstruct this type of data. Thus, in the prediction stage, when the network tries to process another type of data (anomalies), the performance will be clearly worse.

As an illustrative example, an autoencoder was designed and trained to encode and decode images with a handwritten number two from the MNIST dataset [42]. Subsequently, in the prediction stage, two images serve as inputs. The first one contains a handwritten number two (Figure 2a) and, as can be seen in Figure 2b, the autoencoder reconstructs it precisely. The second image contains a handwritten number eight (Figure 2c), which the autoencoder is not able to reconstruct it properly, as can be seen in Figure 2d. Thus, we could easily detect that the eight is an anomaly by comparing the input and the output of the network.

4.2. Φ-OTDR Signal to Patches

Signals from Φ-OTDR contain time and distance information. Therefore, as a result of this, a large matrix

S = (s_{i j}) \in R^{L_{t} \times L_{d}}

is obtained, where

s_{i j}

contains information about instant i at position j, and

L_{t}

and

L_{d}

are the number of temporal and spatial samples, respectively.

The first step is to normalize the matrix so that the network training converges. To do this,

S^{N}

is defined as:

S^{N} = (s_{i j}^{N}) \in R^{L_{t} \times L_{d}}, s_{i j}^{N} = \frac{s_{i j} - μ_{j}}{σ_{j}}

(5)

where

μ_{j}

and

σ_{j}

are the mean and standard deviation at distance

j \in {1, 2, \dots, L_{t}

}, respectively. These values are taken from training data and they are used both for training and the prediction stage.

The input to the network is not the large matrix

S^{N}

; smaller overlapping patches are included. Figure 3 clarifies the way in which information is passed as input to the network. Thus, four parameters are defined:

N_{t}

and

N_{d}

, which are the patch length in the temporal and spatial dimensions, respectively, and

M_{t}

and

M_{d}

, which are the patch offset in the temporal and spatial dimensions, respectively. It follows that:

\begin{matrix} S^{P} = (s_{i_{t}, i_{d}, j_{t}, j_{d}}^{P}) \in R^{W_{t} \times W_{d} \times N_{t} \times N_{d}}, s_{i_{t}, i_{d}, j_{t}, j_{d}}^{P} = s_{M_{t} (i_{t} - 1) + j_{t}, M_{d} (i_{d} - 1) + j_{d}}^{N} \end{matrix}

(6)

where

W_{t} = ⌈\frac{L_{t} - N_{t}}{M_{t}} + 1⌉

and

W_{d} = ⌈\frac{L_{d} - N_{d}}{M_{d}} + 1⌉

.

Subsequently,

S^{P}

is flattened into a 3D array

S^{F}

, so that each

s_{i}^{F}

makes up the input to the network.

\begin{matrix} S^{F} = (s_{i, j_{t}, j_{d}}^{F}) \in R^{W_{t} \cdot W_{d} \times N_{t} \times N_{d}}, s_{(i_{t} - 1) W_{d} + i_{d}, j_{t}, j_{d}}^{F} = s_{i_{t}, i_{d}, j_{t}, j_{d}}^{P} \end{matrix}

(7)

4.3. Autoencoder Scheme Proposed

As explained in Section 4.1, autoencoders were used for Unsupervised Anomaly Detection. For this working scenario, the inputs to the network are the patches of size

N_{t} \times N_{d}

obtained according to the process described previously. These inputs are processed through an autoencoder with convolutional layers to take full advantage of the spatio-temporal correlation within each patch. Only anomaly-free data are used to train the network; mean square error was chosen as the cost function and Adam with a learning rate equal to 0.0001 was used as an optimizer.

Figure 4 shows the encoder and decoder structures. The output of the encoder is used as input for the decoder. Furthermore, as can be seen, in addition to pooling and upsampling layers, we use residual blocks. The structure of these blocks is shown in Figure 5 and their purpose within the architecture is explained below.

The residual blocks scheme can be seen in Figure 5; they are based on residual networks presented in [43]. In them, the inputs to the convolutional layers are connected (as a sum generally) with their outputs. It is argued that in this way, the backpropagation algorithm does not suffer from the vanishing-gradient problem [44] and, therefore, the depth of the networks can be increased. Most current architectures introduce residual paths in some way between the inputs and outputs of their layers, improving performance on almost any task. In particular, the architecture used for this work is shown in Figure 5.

4.4. Reconstruction—Calculation of Anomaly Scores

Once the network has been trained, the data from which we want to obtain information concerning whether they are anomalous are processed through the network in the prediction stage. When the network output is available, the instantaneous square error between each pixel of input and output is defined as the anomaly score. This means that no resolution is lost and it is possible to have information about whether each element of the input is anomalous. Therefore, if we assume that

A_{θ}

is the autoencoder whose parameters are

θ

and

s_{i}^{F} \in R^{N_{t} \times N_{d}}

is the input to the network, the output of the network is

A_{θ} (s_{i}^{F}) \in R^{N_{t} \times N_{d}}

.

We can then define the anomaly marker for each patch

s_{i}^{F}

as the instantaneous square error between it and its output from the network, i.e:

\begin{matrix} E^{F} = (e_{i, j_{t}, j_{d}}^{F}) \in R^{W_{t} \cdot W_{d} \times N_{t} \times N_{d}}, \\ e_{i, j_{t}, j_{d}}^{F} = | s_{i, j_{t}, j_{d}}^{F} - A_{θ} {(s_{i}^{F})}_{j_{t}, j_{d}} | \end{matrix}

(8)

Carrying out the inverse procedure to the one explained in Section 4.2, we obtain the following:

\begin{matrix} E^{P} = (e_{i_{t}, i_{d}, j_{t}, j_{d}}^{P}) \in R^{W_{t} \times W_{d} \times N_{t} \times N_{d}}, \\ e_{i_{t}, i_{d}, j_{t}, j_{d}}^{P} = e_{(i_{t} - 1) W_{d} + i_{d}, j_{t}, j_{d}}^{F} \end{matrix}

(9)

\begin{matrix} E^{N} = (e_{i j}^{N}) \in R^{L_{t} \times L_{d}}, \\ e_{M_{t} (i_{t} - 1) + j_{t}, M_{d} (i_{d} - 1) + j_{d}}^{N} = e_{i_{t}, i_{d}, j_{t}, j_{d}}^{F} \end{matrix}

(10)

\begin{matrix} E = (e_{i j}) \in R^{L_{t} \times L_{d}}, \\ e_{i j} = σ_{j} \cdot e_{i j}^{N} \end{matrix}

(11)

Therefore, the anomaly score E is obtained, which is a matrix of the same size as the signal coming from the DAS S and which provides information on whether or not an event has occurred at each of the points of the signal. Thus, when

e_{i j}

is greater than a threshold, an event is considered to have occurred at instant i and distance j.

5. Results

This section presents the results obtained for the database we worked with. As explained in Section 2, the resolution of this problem can be seen from two perspectives. The first one consists of treating the task as an event-detection problem, while the second one consists of treating it as a denoising problem. Therefore, when it comes to showing and analyzing the results, a metric reflecting the performance for each of the two perspectives will be presented. For the first one, the area under the ROC curve (AUC) [45] will be used. AUC can be interpreted as the probability that a binary classifier system correctly identifies two samples from different classes. For this particular case, it indicates the probability that, given normal and anomalous data, the system is able to tell which is which. For the second one, the improvement in terms of the SNR (ΔSNR) will be used. ΔSNR is the difference between SNR_E and SNR_S, which are the SNR for the prediction error and the SNR for the signal coming from the sensor, respectively. To calculate SNR_E and SNR_S, the average power of the event zones is divided by the average amplitude in the non-event zones E and S, respectively. Due to the fact that we have different types of events to measure performance, each of these scores is going to be specified for each type of event. In addition, since the signal power decreases with distance from the DAS, the detection capability and SNR can be expected to degrade with distance.

To carry out these two types of analyses, some design parameters have been chosen empirically. In concrete terms, it has been chosen that

N_{t} = 256

,

M_{t} = 256

,

N_{d} = 8

,

M_{d} = 6

. The AUC and ΔSNR for different distance sections are presented in Table 2 and Table 3, respectively.

Two conclusions can be drawn from Table 2. The first one is that some stimuli are more easily detectable than others. For example, Hydraulic or Digging are more easily detectable than Compactor or Moving along. It is also easier to detect events that have occurred closer to the DAS than those that have occurred further away. In particular, a considerable decrease is observed in the 20 to 35 km range compared to the others. Table 3 shows conclusions similar to the two previous ones, but referring to the SNR improvement. In particular, here it is especially remarkable that a much lower SNR increase is achieved in the 20 to 35 km section compared to the others. The fact that this improvement in SNR is obtained is due to the fact that the system reconstructs the event-free zones (noise) much more accurately than the event zones (signal), since it has been trained to reconstruct the event-free zones. Therefore, the signal prediction error is much higher in the event zones than in the event-free zones.

If we analyze each of the events, we can draw different conclusions that are of interest. The first is that the Hydraulic event is almost always detectable even at the highest distances. On the other hand, in the Moving along event there is a clear degradation of the detection capability for distances greater than 20 km. For the events produced by Digging, quite similar performances are obtained for those produced at 0, 5 and 10 m from the fiber. Finally, for the events produced by Compactor, there is a significant degradation of the system for the 10-meter distance compared to the 0- and 5-meter distances.

In addition, Figure 6 and Figure 7 show a comparison between the input and output of the proposed anomaly-detection system for the Hydraulic event for distances of 5 and 35 km. It can be seen in these figures that the event is more easily detectable at 5 km than at 35 km from the DAS. In these figures it can be seen that the events (yellow zone) are easily detectable after applying our processing, while before the processing the signal in the event zone is even weaker than in the non-event zone. Furthermore, it can be seen that the magnitude of the prediction error is larger in the 5 km event than in the 35 km event, which is due to the direct dependence between attenuation and distance. These figures complement Table 2 and Table 3. First, it can be perfectly perceived how the detection capacity of the events is very high, since the event and event-free zones are distinguished with a great contrast. On the other hand, it can be seen how the power level is practically similar in the event and noise zones in the original signal, so that the SNR is around 0 dB, while in the denoised signal the power difference is much higher.

Finally, Figure 8a shows the histogram of the normal data and the anomalous data corresponding to each trench before the proposed processing. It can be easily observed that only some of the events close to the sensor can be detected and that all those occurring at distances greater than 5 km are completely indistinguishable with respect to the normal data. Therefore, without signal processing, the intensity trace has no use in detecting events at distances beyond 5 km. On the other hand, Figure 8b shows the histogram of the data after processing. Here, it can be seen that only some data for events occurring at 35 km from the sensor are confused with the normal situation and that the overlap between events at distances less than 35 km and the normal situation is null. Comparing these two histograms, it is possible to understand the importance of introducing the proposed processing, since it allows to considerably increase the range of distances with which the sensor is able to work.

6. Conclusions

This paper has presented a method to detect events in signals obtained from direct-detection DAS Φ-OTDR sensors. The method is based on considering the potential events as anomalies and using anomaly-detection techniques to detect them. For this purpose, unsupervised deep-learning systems using only event-free signals were used. The main advantage of this approach compared to other techniques is that no human-provided labels are needed. This loosening of the requirements for the training set widens the usability of this technique for event detection in real environments without previous knowledge of the potential events.

To train the neural networks, event-free data corresponding to 10 min and 35.2 km of fiber were used. On the other hand, to test the performance of the proposed method, a small database with different intentionally triggered and controlled events was also used. This method provides an effective noise reduction from the signals coming from the Φ-OTDR that enables precise detection of the aggression events up to 35 km. All the results obtained in this work were analyzed in terms of the AUC and the SNR improvement that applying the proposed method provides.

Author Contributions

Conceptualization, A.A., A.O., L.V. and P.S.; methodology, A.A., P.S., A.O. and L.V.; software, A.A., A.O. and L.V.; validation, A.A. and A.O.; formal analysis, A.A., P.S., A.O. and L.V.; investigation, A.A., P.S., A.O., L.V. and J.P.-G.; data curation, P.S., J.P.-G., L.V., A.O. and A.A.; writing—original draft preparation, A.A. and P.S.; writing—review and editing, A.A., P.S., A.O. and L.V.; funding acquisition, L.V., P.S. and A.O. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by MCIN/AEI/10.13039/501100011033 Grant RTC2019-007207-4. Government of Aragón (Reference Group T36_20R and T20_20R). Antonio Almudévar is supported by Government of Aragón predoctoral grant (2021–2025) and Javier Preciado by the Spanish the Ministry of Science and Innovation (MICINN) under Grant DI-17-09169.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used for the development of this system have been obtained within the framework of the project. At the moment, they are not available online. For more information on the data, please contact the authors.

Conflicts of Interest

The authors declare no conflict of interest.

References

Agrawal, G. Nonlinear Fiber Optics; Academic Press: Cambridge, MA, USA, 2013. [Google Scholar]
Taylor, H.F.; Lee, C.E. Apparatus and Method for Fiber Optic Intrusion Sensing. U.S. Patent US5194847A, 16 March 1993. [Google Scholar]
Juarez, J.C.; Taylor, H.F. Field test of a distributed fiber-optic intrusion sensor system for long perimeters. Appl. Opt. 2007, 46, 1968. [Google Scholar] [CrossRef] [PubMed]
Tanimola, F.; Hill, D. Distributed fibre optic sensors for pipeline protection. J. Nat. Gas Sci. Eng. 2009, 1, 134–143. [Google Scholar]
Shi, Y.; Wang, Y.; Zhao, L.; Fan, Z. An Event Recognition Method for Φ-OTDR Sensing System Based on Deep Learning. Sensors 2019, 19, 3421. [Google Scholar] [CrossRef]
Wang, Z.; Pan, Z.; Ye, Q.; Lu, B.; Fang, Z.; Cai, H.; Qu, R. Novel distributed passive vehicle tracking technology using phase sensitive optical time domain reflectometer. Chin. Opt. Lett. 2015, 13, 100603. [Google Scholar] [CrossRef]
Li, Z.; Zhang, J.; Wang, M.; Zhong, Y.; Peng, F. Fiber distributed acoustic sensing using convolutional long short-term memory network: A field test on high-speed railway intrusion detection. Opt. Express 2020, 28, 2925–2938. [Google Scholar] [PubMed]
Tejedor, J.; Macias-Guarasa, J.; Martins, H.F.; Martin-Lopez, S.; Gonzalez-Herraez, M. A Contextual GMM-HMM Smart Fiber Optic Surveillance System for Pipeline Integrity Threat Detection. J. Light. Technol. 2019, 37, 4514–4522. [Google Scholar] [CrossRef]
Park, J.; Taylor, H.F. Fiber Optic Intrusion Sensor using Coherent Optical Time Domain Reflectometer. Jpn. J. Appl. Phys. 2003, 42, 3481. [Google Scholar] [CrossRef]
Martins, H.F.; Martin-Lopez, S.; Corredera, P.; Filograno, M.L.; Frazao, O.; Gonzalez-Herraez, M. Coherent Noise Reduction in High Visibility Phase-Sensitive Optical Time Domain Reflectometer for Distributed Sensing of Ultrasonic Waves. J. Light. Technol. 2013, 31, 3631–3637. [Google Scholar] [CrossRef]
Wang, Z.; Zhang, L.; Wang, S.; Xue, N.; Peng, F.; Fan, M.; Sun, W.; Qian, X.; Rao, J.; Rao, Y. Coherent Φ-OTDR based on I/Q demodulation and homodyne detection. Opt. Express 2016, 24, 853–858. [Google Scholar] [CrossRef]
Dong, Y.; Chen, X.; Liu, E.; Fu, C.; Zhang, H.; Lu, Z. Quantitative measurement of dynamic nanostrain based on a phase-sensitive optical time domain reflectometer. Appl. Opt. 2016, 55, 7810–7815. [Google Scholar] [CrossRef]
He, H.; Yan, L.; Qian, H.; Zhang, X.; Luo, B.; Pan, W. Enhanced range of the dynamic strain measurement in phase-sensitive OTDR with tunable sensitivity. Opt. Express 2020, 28, 226–237. [Google Scholar] [CrossRef] [PubMed]
Liu, H.; Pang, F.; Lv, L.; Mei, X.; Song, Y.; Chen, J.; Wang, T. True Phase Measurement of Distributed Vibration Sensors Based on Heterodyne φ-OTDR. IEEE Photonics J. 2018, 10, 7101309. [Google Scholar] [CrossRef]
Sha, Z.; Feng, H.; Zeng, Z. Phase demodulation method in phase-sensitive OTDR without coherent detection. Opt. Express 2017, 25, 4831. [Google Scholar] [CrossRef] [PubMed]
Chen, M.; Masoudi, A.; Brambilla, G. Performance analysis of distributed optical fiber acoustic sensors based on φ-OTDR. Opt. Express 2019, 27, 9684–9695. [Google Scholar] [CrossRef] [PubMed]
Shi, Y.; Chen, G.; Wang, Y.; Zhao, L.; Fan, Z. Dynamic Phase Demodulation Algorithm for Phase-Sensitive OTDR with Direct Detection. IEEE Access 2020, 8, 77511–77517. [Google Scholar] [CrossRef]
Shao, L.Y.; Liu, S.; Bandyopadhyay, S.; Yu, F.; Xu, W.; Wang, C.; Li, H.; Vai, M.I.; Du, L.; Zhang, J. Data-Driven Distributed Optical Vibration Sensors: A Review. IEEE Sens. J. 2020, 20, 6224–6239. [Google Scholar] [CrossRef]
Wojcik, A.K. Signal Statistics of Phase Dependent Optical Time Domain Reflectometry. Ph.D. Thesis, Texas A&M University System, College Station, TX, USA, 2006. [Google Scholar]
Yang, G.; Fan, X.; Liu, Q.; He, Z. Frequency Response Enhancement of Direct-Detection Phase-Sensitive OTDR by Using Frequency Division Multiplexing. J. Light. Technol. 2018, 36, 1197–1203. [Google Scholar] [CrossRef]
Pan, Z.; Liang, K.; Ye, Q.; Cai, H.; Qu, R.; Fang, Z. Phase-sensitive OTDR system based on digital coherent detection. In Proceedings of the Communications and Photonics Conference and Exhibition, Shanghai, China, 13–16 November 2011; pp. 1–6. [Google Scholar] [CrossRef]
Xue, N.; Fu, Y.; Lu, C.; Xiong, J.; Yang, L.; Wang, Z. Characterization and Compensation of Phase Offset in Φ-OTDR with Heterodyne Detection. J. Light. Technol. 2018, 36, 5481–5487. [Google Scholar] [CrossRef]
Lu, X.; Krebber, K. Direct detection based φOTDR using the Kramers-Kronig receiver. Opt. Express 2020, 28, 37058–37068. [Google Scholar] [CrossRef]
Muanenda, Y.; Faralli, S.; Oton, C.J.; Pasquale, F.D. Dynamic phase extraction in a modulated double-pulse Φ-OTDR sensor using a stable homodyne demodulation in direct detection. Opt. Express 2018, 26, 687–701. [Google Scholar] [CrossRef]
Qi, X.; Ji, J.; Han, X.; Yuan, Z. An Approach of passive vehicle type recognition by acoustic signal based on SVM. In Proceedings of the 2009 Third International Conference on Genetic and Evolutionary Computing, Guilin, China, 14–17 October 2009; IEEE: Piscataway, NJ, USA, 2009; pp. 545–548. [Google Scholar]
Jiang, F.; Li, H.; Zhang, Z.; Zhang, X. An event recognition method for fiber distributed acoustic sensing systems based on the combination of MFCC and CNN. In Proceedings of the 2017 International Conference on Optical Instruments and Technology: Advanced Optical Sensors and Applications, Beijing, China, 28–30 October 2017; International Society for Optics and Photonics: Bellingham, WA, USA, 2018; Volume 10618, p. 1061804. [Google Scholar]
Che, Q.; Wen, H.; Li, X.; Peng, Z.; Chen, K.P. Partial discharge recognition based on optical fiber distributed acoustic sensing and a convolutional neural network. IEEE Access 2019, 7, 101758–101764. [Google Scholar] [CrossRef]
Logan, B. Mel frequency cepstral coefficients for music modeling. In Proceedings of the In International Symposium on Music Information Retrieval, Plymouth, MA, USA, 23–25 October 2000; Citeseer: Princeton, NJ, USA, 2000. [Google Scholar]
Shiloh, L.; Eyal, A.; Giryes, R. Efficient processing of distributed acoustic sensing data using a deep learning approach. J. Light. Technol. 2019, 37, 4755–4762. [Google Scholar] [CrossRef]
Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv 2014, arXiv:1409.1556. [Google Scholar]
Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial nets. Adv. Neural Inf. Process. Syst. 2014, 27, 1384. [Google Scholar]
Redmon, J.; Farhadi, A. Yolov3: An incremental improvement. arXiv 2018, arXiv:1804.02767. [Google Scholar]
Stork, A.L.; Baird, A.F.; Horne, S.A.; Naldrett, G.; Lapins, S.; Kendall, J.M.; Wookey, J.; Verdon, J.P.; Clarke, A.; Williams, A. Application of machine learning to microseismic event detection in distributed acoustic sensing data. Geophysics 2020, 85, KS149–KS160. [Google Scholar] [CrossRef]
Feng, Q.; Li, Y. Denoising Deep Learning Network Based on Singular Spectrum Analysis–DAS Seismic Data Denoising with Multichannel SVDDCNN. IEEE Trans. Geosci. Remote Sens. 2021, 60, 5902911. [Google Scholar] [CrossRef]
Zhao, Y.; Li, Y.; Wu, N. Coupled Noise Reduction in Distributed Acoustic Sensing Seismic Data Based on Convolutional Neural Network. IEEE Geosci. Remote Sens. Lett. 2022, 19, 8025605. [Google Scholar] [CrossRef]
Vincent, P.; Larochelle, H.; Bengio, Y.; Manzagol, P.A. Extracting and composing robust features with denoising autoencoders. In Proceedings of the 25th International Conference on Machine Learning, Helsinki, Finland, 5–9 July 2008; pp. 1096–1103. [Google Scholar]
Dong, X.; Li, Y. Denoising the optical fiber seismic data by using convolutional adversarial network based on loss balance. IEEE Trans. Geosci. Remote Sens. 2020, 59, 10544–10554. [Google Scholar] [CrossRef]
Preciado, J.; Sevillano, P.; Heras, C.; Subías, J.; Sanahuja, D.; Carretero, E.; Martínez, J.J. Range and Bandwidth Extension in Direct-Detection C-OTDR Using Optical Repeaters. IEEE Sens. J. 2021, 21, 27526–27531. [Google Scholar] [CrossRef]
Bank, D.; Koenigstein, N.; Giryes, R. Autoencoders. arXiv 2021, arXiv:2003.05991. [Google Scholar]
Zhou, C.; Paffenroth, R.C. Anomaly detection with robust deep autoencoders. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, 13–17 August 2017; pp. 665–674. [Google Scholar]
Sakurada, M.; Yairi, T. Anomaly detection using autoencoders with nonlinear dimensionality reduction. In Proceedings of the MLSDA 2014 2nd Workshop on Machine Learning for Sensory Data Analysis, Gold Coast, Australia, 2 December 2014; pp. 4–11. [Google Scholar]
Deng, L. The MNIST database of handwritten digit images for machine learning research. IEEE Signal Process. Mag. 2012, 29, 141–142. [Google Scholar] [CrossRef]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Hochreiter, S. The vanishing gradient problem during learning recurrent neural nets and problem solutions. Int. J. Uncertain. Fuzziness Knowl.-Based Syst. 1998, 6, 107–116. [Google Scholar] [CrossRef]
Bradley, A.P. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognit. 1997, 30, 1145–1159. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Setup.

Figure 2. Comparison of reconstruction of a handwritten number two and a handwritten number eight with an autoencoder trained to reconstruct images with a handwritten number two.

Figure 3. The signal coming from the Φ-OTDR does not serve as input directly to the network, but fixed size patches are taken in.

Figure 4. Autoencoder architecture used to obtain anomaly scores.

Figure 5. Residual blocks used to cause the autoencoder to conform.

Figure 6. Comparison of the original and denoised signals around 5 km ditch for Hydraulic.

Figure 7. Comparison of the original and denoised signals around 35 km ditch for Hydraulic.

Figure 8. Comparison of histograms before and after processing the data through our system for Hydraulic.

Table 1. Events performed that form the test dataset.

Event ID	Description of the Event
Hydraulic	Hydraulic hammer working
Moving along	Heavy machinery moving along the ground
Digging 0 m	Excavator digging right on top of fiber
Digging 5 m	Excavator digging from 5 m of fiber
Digging 10 m	Excavator digging from 10 m of fiber
Compactor 0 m	Removed soil is added back and compacted right on top of the fiber
Compactor 5 m	Removed soil is added back and compacted 5 m from the fiber
Compactor 10 m	Removed soil is added back and compacted 10 m from the fiber

Table 2. AUC (%) for different events and distances.

Event ID	0–2.5 km	2.5–10 km	10–20 km	20–35.2 km
Hydraulic	99.994	99.997	99.997	99.954
Moving along	99.343	99.523	99.677	91.167
Digging 0 m	99.997	99.988	99.934	95.301
Digging 5 m	99.924	99.959	99.981	95.606
Digging 10 m	99.932	99.947	99.971	98.631
Compactor 0 m	97.587	98.820	98.474	74.853
Compactor 5 m	97.127	97.968	98.556	78.776
Compactor 10 m	94.849	98.191	95.505	52.326

Table 3. ΔSNR (dB) for different events and distances.

Event ID	0–2.5 km	2.5–10 km	10–20 km	20–35.2 km
Hydraulic	13.80	15.00	13.40	9.77
Moving along	11.58	10.49	10.02	5.63
Digging 0 m	13.10	12.78	11.84	4.53
Digging 5 m	13.63	12.51	12.27	3.72
Digging 10 m	14.58	13.22	12.00	3.97
Compactor 0 m	7.64	7.42	6.09	1.36
Compactor 5 m	11.03	9.21	9.05	3.75
Compactor 10 m	5.92	4.50	4.21	0.489

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Almudévar, A.; Sevillano, P.; Vicente, L.; Preciado-Garbayo, J.; Ortega, A. Unsupervised Anomaly Detection Applied to Φ-OTDR. Sensors 2022, 22, 6515. https://doi.org/10.3390/s22176515

AMA Style

Almudévar A, Sevillano P, Vicente L, Preciado-Garbayo J, Ortega A. Unsupervised Anomaly Detection Applied to Φ-OTDR. Sensors. 2022; 22(17):6515. https://doi.org/10.3390/s22176515

Chicago/Turabian Style

Almudévar, Antonio, Pascual Sevillano, Luis Vicente, Javier Preciado-Garbayo, and Alfonso Ortega. 2022. "Unsupervised Anomaly Detection Applied to Φ-OTDR" Sensors 22, no. 17: 6515. https://doi.org/10.3390/s22176515

APA Style

Almudévar, A., Sevillano, P., Vicente, L., Preciado-Garbayo, J., & Ortega, A. (2022). Unsupervised Anomaly Detection Applied to Φ-OTDR. Sensors, 22(17), 6515. https://doi.org/10.3390/s22176515

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Unsupervised Anomaly Detection Applied to Φ-OTDR

Abstract

1. Introduction

2. Unsupervised Anomaly Detection in Φ-OTDR

2.1. Signal in Direct-Detection Φ-OTDR

2.2. Event Detection for Φ-OTDR

3. Experimental Setup and Field Experiment

4. Methodology

4.1. Deep Learning and Unsupervised Anomaly Detection

4.2. Φ-OTDR Signal to Patches

4.3. Autoencoder Scheme Proposed

4.4. Reconstruction—Calculation of Anomaly Scores

5. Results

6. Conclusions

Author Contributions

Funding

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI