1. Introduction
Acoustic emission testing is a non-destructive testing method used to detect several kinds of faults in structures such as piping, bridges and aerospace structures. Its main advantages over other non-destructive testing methods are the high sensibility, the wide coverage area and the possibility of monitoring structures in real time [
1,
2,
3,
4,
5,
6]. In the acoustic emission framework, sensors are spread over the surface to be monitored, and elastic waves emitted by passive sources (such as cracks or rivets under stress) propagate through the medium and are detected by these sensors. The segment of signal acquired by a sensor that is detected as a wave is called a
hit. In order to reduce memory requirements, the sampled signals are usually not saved, only the estimated times of arrival (TOAs) are recorded and later used to estimate the source position.
Many TOA estimation algorithms are presented in the literature [
7,
8,
9,
10], but the fixed threshold method [
11,
12,
13] is a very popular technique due to its simplicity and low complexity. Defining
as the signal sampled with period
T by the
i-th sensor at instant
, this method estimates the TOA
by comparing the absolute value of the signal to a fixed threshold
K:
Thus, the estimated TOA is the first instant the absolute value of the signal crosses the threshold. The value of K should be high enough to be larger than the noise level most of the time, but not too high so that hits will be missed.
There are many approaches in the literature to estimate the source position given the TOAs [
14,
15,
16,
17,
18,
19,
20], usually based on the minimization of different choices of cost function that may receive as input the measured TOAs or the time-differences of arrival (TDOAs, defined as the difference of TOAs measured by different sensors). One of the main contributions of this paper is to compare different cost functions and show that they do not usually lead to optimal estimates of source localization in general, except in the particular situation of small noise variance. We concentrate here in the isotropic case, i.e., we assume that the wave velocity is known and independent of the direction, but there are algorithms that are applicable to the anisotropic and unknown velocity case [
14,
21,
22].
Passive source localization algorithms based on TOAs usually assume a Gaussian distribution for the uncertainties in the TOA estimates and a zero-mean Gaussian distribution for TDOA estimates [
20,
21,
22,
23,
24,
25,
26]. We prove theoretically that this assumption holds approximately for the threshold method in the case of low noise. In a more general setting, the TOA distribution depends on the method used to estimate it, and we prove in
Section 3 that for larger noise variances the fixed threshold method produces TOAs that can be modeled as a mixture of Gaussians (see also [
27]). Furthermore, TDOAs estimated by the fixed threshold method may present a large bias since signals reach sensors with different amplitudes due to attenuation (see
Section 4 and [
27]).
The fixed threshold method only works if the threshold
K is sufficiently higher than the noise level so that noise has a low probability of triggering the threshold without an incoming wave. However, choosing a high value for
K increases the TOA bias, given by the difference between the expected value of the estimated and the actual time of arrival of the wave. If all sensors received hits with similar amplitudes, the TOA bias would be constant accross sensors, and there would be no TDOA bias. However, because sensors lie at different distances from the acoustic emission source, they usually receive hits with different amplitudes due to attenuation. As such, TOA bias varies accross sensors, generating a bias in TDOA estimates, which may bias the position estimate. An illustration of the fixed threshold method and the bias in TOA and TDOA estimates is presented in
Figure 1.
The contributions of this paper are:
To derive the probability distribution for TOA estimates obtained through the fixed threshold method. We show that the TOAs can be well approximated by a mixture of Gaussian distributions when (as is usually the case) the sensor has a bandpass frequency response, such that the observed signals may be well described by a model of the form
where
is a low-pass envelop function and
is the peak of the sensor frequency response. The mixture of Gaussians is a reasonable approximation if the sampling rate
satisfies
. We also show that the Gaussian distribution usually assumed is a good approximation in the low-noise regime.
To propose a TOA estimation method that generates unbiased Time Differences of Arrival (TDOAs) for structures where all frequencies propagate in the medium with the same velocity.
To derive the Cramér–Rao lower bound for the source position assuming TOAs follow a mixture of Gaussian distributions.
To derive the optimal TOA-based source position estimator on a surface (that is, the estimator whose covariance matrix is the inverse of the Fisher information matrix) assuming TOAs follow a mixture of Gaussian distributions whose parameters are known given the source position. A procedure to estimate these parameters from noisy signals is also presented. The optimal estimator is tested in simulation in several scenarios and compared with other passive source localization methods based on TOAs.
To show that the optimal estimator reduces to one of the algorithms available in the literature if the TOA estimates are unbiased and the measurement noise is sufficiently small, in which case TOAs are approximately Gaussian-distributed.
Notation: We denote the expected value operator as , the probability of an event as , the multivariate normal distribution with mean vector and covariance matrix as , the covariance matrix of a random vector as , the trace of matrix as . is the diagonal matrix such that , is the indicator function, equal to 1 if and 0 otherwise, and is the complement of the set . Vectors and matrices are written in boldface, and the derivative of the k-th element of a vector with respect to x is denoted as .
2. Source Localization Methods Based on TOAs
Hits identified at different sensors with similar TOAs are grouped into a single
event related to a possible source [
12]. Assuming the grouping is correct, the source position can be estimated by solving the system of equations:
where
is the TOA at sensor
k,
N is the number of sensors,
t is the instant the event occurred (an unknown variable) and
is the time the wave on the surface under test takes to propagate from the tentative source position
to the position
of the
k-th sensor assuming the propagation is isotropic with known velocity
c:
The system of Equations (
3) has three unknowns (
and
t) and
N equations, thus it is overdetermined for
unless there are co-linear sensors. Since due to noise the system usually has no solution, the problem is recast as an optimization problem using an appropriate cost function to obtain an approximate solution. Three popular cost functions for TOA-based source localization are (
5), (
8) and (
9) below [
12,
20,
28]. The quality of the position estimates depends on the choice of the cost function, as well as on the distribution of the uncertainty in the TOA estimates
.
The least squares solution of (
3) is obtained by minimizing the cost function
, defined as
is a generalization of the cost function
[
23] that is used in applications where the instant of emission of the wave
is known, in which case it is common to use
without loss of generality. In our application,
is unknown and it must be taken into account to localize the source. Fortunately, it is possible to obtain a cost function equivalent to (5) that only depends on
by solving for
t as a function of
. In fact, setting
to zero and isolating
t yields
.
Hence, minimizing
is equivalent to minimizing:
where
. Another approach to approximately solve (
3) is to subtract the first equation from the others to eliminate
t, yielding a modified system of
equations in two unknowns whose
k-th equation is:
The least squares solution of (
7) is obtained by minimizing the cost function
below, which only depends on the measured time differences of arrival (TDOAs)
[
12,
29]:
It is also possible obtain an approximate solution to (
3) using a constrained least squares approach, as done in [
28]. Without loss of generality, consider that sensor 1 is positioned at
. Rewriting each equation from (
7) as
, substituting (
4), expanding the squares and performing simple algebraic manipulations, we obtain
, where
is the measured range difference between sensors
k and 1.
In order to find an approximate solution for this over-determined system of equations, Ref [
28] defines the auxiliary unknown variable
, which is estimated by defining the cost function
(where the acronym ’CLS’ stands for ’constrained least squares’) as
where
and
, and solving the constrained optimization problem:
in which we denote by
the
i-th element of vector
. A low-complexity method for obtaining an approximate solution for (
10) is proposed in [
28], but since the focus of this paper is on the quality of the estimators, in our simulations we solve (
10) directly.
3. TOA Probability Distribution
In [
27], the probability mass function (pmf) for TOAs obtained by the fixed threshold algorithm was derived considering the measurement noise to be white. In this section, we extend that result, deriving the TOA pmf for non-white noise, and obtaining a simplified expression in the case where the noise is white.
3.1. TOA Pmf for Correlated Noise
Let us model the signal sampled by a sensor as the sum of a deterministic component and a zero-mean noise , that is, . To simplify the notation, the index i representing the sensor is omitted.
The fixed threshold method estimates the TOA as the first instant the signal
crosses the threshold. Thus, denoting as
the probability of the estimated TOA being the instant
n, we have:
Note that the threshold level should be adjusted so that the noise cannot trigger the threshold by itself [
11]. Assume then that the threshold is well adjusted (i.e.,
). Then,
if and only if:
Let us prove (
12): If
, then
because
. Thus,
leading to:
To prove the converse of (
12), assume that
. If
, this implies
, thus
. On the other hand, if
, we have
, also leading to
.
Our goal is to write (
11) in terms of the noise joint cumulative distribution. For this, define the intervals
and the modified noise
for
. Using these definitions, we conclude from (
12) that
if and only if
, thus
is the probability of
belonging to
. Define the vector of noise samples
. We consider that the joint noise pdf
is an even function with respect to each variable, that is,
=
for all
j (e.g., a zero-mean Gaussian distribution). In this case,
has the same distribution as
. Hence, we have
, which can be expanded into:
As the second integral in (
15) is equal to the joint cumulative distribution of
evaluated at
and the first integral is the cumulative distribution of
evaluated at
, we obtain:
where
is the joint cumulative distribution of
.
3.2. TOA Pmf for Independent and Identically Distributed Noise
If the noise is an independent and identically distributed (i.i.d.) stochastic process, a much simpler expression for (
16) can be obtained. In this case, denoting the cumulative density function (cdf) of a single noise sample as
, the joint cumulative distribution
can be written as the product of the cdfs of each noise sample, that is,
, and thus:
Note that
and
depend on the noiseless signal
, which is unknown in practical applications. In
Section 5.3,
is estimated using the noisy signal
instead of
, and (
17) is used to build a source position estimator.
3.3. Approximating the TOA Pdf as a Gaussian Mixture
Many authors consider that TOAs follow a Gaussian distribution to derive passive source localization methods [
23,
24,
25,
26], but (
17) is not only discrete, but may also be multimodal. In this section, we investigate in which cases the Gaussian distribution is a good approximation to the TOA pmf (
17). We show via an example that, for high sampling rate and small noise variance, TOAs can be well approximated by a Gaussian distribution, and for high sampling rate and high noise variance they can be approximated by a mixture of Gaussian distributions. This approximation is exploited in
Section 5.1, where the expression for the optimal TOA-based estimator for TOAs distributed according to a mixture of Gaussian distributions is derived.
Using a sampling rate of
(sampling period
), the received signal was generated following (
2) as
(a von Hann window modulated with a frequency of
, a common resonance frequency for piezoelectric sensors used in acoustic emission), where
is a Gaussian white noise with standard deviation
, and
is the threshold. We then applied the fixed threshold method to
to obtain the experimental TOAs and generate the empirical TOA pmf and compare with the theoretical expression (
17).
The obtained TOA pmf and cdf for this hit is shown in
Figure 2. The pmf
was fitted into a Gaussian Mixture Model (GMM), leading to the pdf
. The details of the fitting technique we used are described in
Appendix C. The cdf of the fitted distribution was plotted along with the TOA cdf, and in order to allow the comparison between the fitted pdf
and
, we plotted
along with
.
We conclude from
Figure 2 that the TOA distribution is well approximated by a GMM. If the noise level is high enough, the threshold may be triggered in different oscillation periods of the signal, creating secondary lobes spaced by the oscillation period of
(half the oscillation period of
), explaining the different Gaussian components of
. On the other hand, if the noise were small,
would present a single lobe, reducing to a Gaussian distribution.
Since we are approximating a discrete probability distribution by a continuous distribution, this approximation is only reasonable if the sampling rate is high enough so that each oscillation period of contains multiple samples, leading to multiple samples for each lobe of . Furthermore, each lobe of should correspond to a single oscillation period of , thus there must be at least one sample below the threshold between oscillation periods of . Hence, the sampling rate must satisfy , where is the carrier frequency. Nevertheless, if the sampling rate is small enough so that each lobe of contains only one sample, may be approximated by a mixture of Gaussian distributions with very small variances. It is worth noting that if both the sampling rate and the noise are small, TOAs may become deterministic, in which case any unbiased estimator produces the same outcome.
4. Estimating Unbiased TDOAs
In general, sensors receive hits generated by the same source with different amplitudes, since the distance of propagation is different for each sensor and waves attenuate as they propagate. Moreover, the attenuation may be frequency-dependent, so that the signal waveforms may differ at each sensor. Because of the difference of attenuation among sensors, the lag between the TOA estimated by the fixed threshold method and the actual time of arrival is not the same for all sensors (that is, even for noise variance tending to zero). As such, the fixed threshold method produces biased TDOA estimates, which will lead to biased position estimates. The fixed threshold method thus only produces unbiased estimates in scenarios with small attenuation difference between hits received by sensors.
We now derive a low-complexity method to obtain unbiased TDOAs in the noiseless case that consists of using different thresholds for each sensor, extending the fixed threshold method to a scenario where attenuation does not necessarily produce biased TDOA estimates. For this, we assume that waveforms at different sensors have the same shape with different amplitudes. In this model, the analog signals received by sensors
are attenuated and delayed versions of each other:
is a known function of the distance between the source and the
i-th sensor (denoted as
) that models the attenuation and
is an auxiliary function that controls the shape of the received signals. In
Section 6, we use
, where the term
models the energy loss and the term
is due to the wave dispersion in the 2D medium [
30]. Note that (
18) assumes the frequency response of all sensors to be similar. Furthermore, if the structure generates reflections of the source signal, then (
18) may be valid only for instants
t smaller than the instant the first reflection arrives at the sensor.
Define as the RMS value of the hit corresponding to the i-th sensor, and . We propose to obtain TOAs in real time by comparing with the modified threshold instead of using a fixed threshold. This way, the normalized threshold is used to estimate the TOAs given that a hit was detected.
We now show that if signals follow the propagation model (
18), this procedure produces unbiased TDOAs in the noiseless case. Since
, we have
, where
. Hence,
.
The estimated TOA is
, and substituting
, we obtain:
The term
in (
19) does not depend on
i, thus the TDOA between hits measured by sensors
i and
j is
, which is an unbiased TDOA measurement in the noiseless case.
Note that even though the waveform is required to compute the modified threshold
, the operations needed to calculate them have very low complexity and can be easily implemented in real time, thus waveforms do not need to be stored to apply this method. There are other TOA estimation methods that generate unbiased TDOAs, as methods based on Akaike information criterion (AIC) [
8,
31], but these methods present higher computational complexity than the fixed threshold method (which only needs comparisons) and our TOA estimation technique, whose complexity is dominated by the calculation of the energy of the signal.
This method only works in structures where waves propagate according to (
18), thus it will not be effective if hits contain wave reflections. In this case, hits must be windowed to avoid adding the energy of reflections into the energy of the hit. The proposed TOA estimation method is employed in
Section 5.1 to compare the source position estimators presented in this work. The performance of these estimator using the fixed threshold method and the proposed TOA estimator are also compared.
5. The Optimal Source Position Estimator
In this section, we derive an expression for the optimal cost function (that is, the one that yields the minimum mean-square localization error among all possible TOA-based estimators) assuming TOAs follow a Gaussian mixture model (GMM), which is a good approximation for high sampling rate as discussed in
Section 3.3. We prove that
is the optimal cost function when the noise is small and independent and identically distributed (i.i.d.) in time and across sensors. In order to determine the optimal estimator, the Fisher information matrix must be calculated and compared to the inverse of the covariance matrix of our position estimator. It is worth noting that the Fisher information matrix for passive localization problem using Gaussian TOA or TDOA measurements (that is, a Gaussian mixture with
components) was calculated in [
32].
We consider that TOAs follow a mixture of
M Gaussian distributions with weights
, covariance matrices
and means
. In practice, all these parameters depend on the source position, but in order to derive the optimal estimator we assume that
and
are independent of the source position, while
is modeled as
, where
is a TOA bias model that may depend on
,
,
and
t is the instant of emission of the wave. Thus, defining the vector of measured TOAs
, the TOA pdf is given by:
We also assume that for each
there is only one dominant term in the sum (
20), that is, there is an index
such that for each set
of possible measured TOAs, the only term in (
20) that is not approximately zero is the
m-th component, so:
This is a reasonable hypothesis in the case where the sampling rate is such that there will be at least one sample below the threshold in (
2) after each maximum, that is, the sampling rate should satisfy
, as in the example given in
Section 3.3.
5.1. Optimal Source Position Estimator
Let be a partition of such that the k-th mixture component is different than zero for and approximately zero for . Each Gaussian mixture component lies in a single set , thus can be viewed as the "support" of the k-th Gaussian component. The proposed optimal estimator defined below consists on estimating m such that and estimating the source position with the conditional maximum likelihood estimator given that (we show below how to estimate m even without knowing and t).
The conditional probability distribution of
given that
is
, thus the conditional maximum likelihood estimator given that
is obtained by maximizing
, or equivalently minimizing
.
can be rewritten in terms of only two variables by setting
and isolating
t in terms of
, yielding the equivalent cost function:
Assume we could choose
, that is, the index of the mixture component that most contributes to
, where
,
are the actual source position and instant of emission. Hence, recalling that
is the vector of measured TOAs,
where
is
computed at the actual source position
, i.e.,
. Even though
are unknown variables, in
Section 5.3 we show how to obtain
from the TOA pmf, which is estimated using the noisy signals acquired by sensors.
can be correctly chosen even if an imprecise value for
is used because
is chosen by picking the maximum of a finite set, thus a perturbation on these values only affects
if it is large enough to change the index of the maximum value.
We claim that the cost function
yields the optimal estimator when minimized if the noise level is not very high and the Hessian of
is approximately constant around the actual source position, in which case
can be approximated by its second-order Taylor polynomial around
. In order to prove this, we calculate the Fisher information matrix in
Appendix A and the covariance matrix of our estimator in
Appendix B. The covariance matrix (
A11) coincides with the inverse of the submatrix associated with the spatial components of the Fisher information matrix (
A2), which proves that the proposed estimator is indeed optimal.
It must be highlighted that we made the following hypotheses to compute the covariance matrix:
If these conditions do not hold, (
23) may not be optimal. However, we show in
Section 5.3 how to obtain an approximation to the optimal estimator when the assumptions are not true, and in
Section 6 we show that the resulting estimator usually outperforms other estimators.
5.2. The Maximum Likelihood Estimator
The Maximum likelihood estimator (MLE) is obtained by finding the values of
that maximize (
20). Given the definition of
, this can be obtained by solving:
where
. Since
does not depend on
, the MLE coincides with the arguments that minimize
for
.
Therefore, the optimal estimator may coincide with the MLE if they share the same value for m, but they are different estimators in general. However, their estimates always coincide if the noise is small enough so that the number of components in the Gaussian mixture is , as in this case both estimators always pick . Note that if and TOAs are unbiased and i.i.d, both the MLE and the optimal estimator coincide with .
5.3. The Gaussian Mixture TOA Estimator
In order to show that (
23) is optimal, we assumed that the parameters of the Gaussian mixture are known and that
m is such that
, but these parameters must be estimated in practical problems. In this section, we describe a procedure to choose
m and to estimate the mixture parameters using the noisy signals received by the sensors, leading to a suboptimal estimator we call the
Gaussian mixture TOA estimator (GMTOA). To simplify the explanation of our method, we assume that the signals are corrupted by white noise and that the noise at different sensors is independent.
We estimate the TOA pmf for each sensor using (
17), but instead of the noiseless waveform we compute
using the noisy signal, that is,
Note that evaluating this pmf does not depend on previous knowledge of the source position, which we use to extract the GMM parameters. Thus, we convert the pmf
to a pdf
and then we fit
into a Gaussian mixture model, as done in
Section 3.3. Then, the means
, variances
and weights
are extracted for each mixture component
and sensor
. Note that, since we are assuming the noise at different sensors to be independent, the TOAs in different sensors will be independent and the full mixture model will be the product of the distributions for each sensor, and have in total
components.
Now, it is necessary to choose the index
in the range
. Since the TOAs are independent, it is possible to choose the active index
for each sensor independently. Index
is chosen by finding the Gaussian component that most contributes to the fitted pdf:
Finally, the TOA bias for the
i-th sensor
must be extracted from the mean of the chosen GMM component
. Ideally,
and
would be related by
, but
and
are not known. As such,
is estimated by approximating
as the first instant the estimated probability distribution
crosses a very small fixed threshold
, that is, the instant where it assumes a non-negligible value (in our simulations, we use
). Hence,
is estimated as:
Therefore, the source position estimated by our proposed estimator is:
where
,
,
and
. The proposed algorithm is summarized in Algorithm 1.
Note that our method requires the signals
sampled by each sensor to localize the source, thus it does not use only TOAs as the cost functions
,
and
. Storing waveforms may be an issue for long acoustic emission tests that last weeks or months, as the waveforms may require an enormous storage capacity. However, it is possible to use our method without the need to store the whole waveforms because it uses
only to calculate
, which can be computed recursively: Defining
and recalling that
, we have:
Since the computational cost of computing
is small, it is possible to compute
in real time and discard the signals
instead of storing them. Storing
instead of the signals is much more efficient because
assumes a non-negligible value for only a small number of samples, thus it is possible to store only these non-negligible samples. Therefore, our method does not present the problem of storing a huge number of waveforms in long acoustic emission tests as methods that explicitly require the sampled signals as [
33,
34,
35].
Algorithm 1: GMTOA source position estimator |
|
6. Simulations
In this section, we assess the performance of our proposed source position estimator through simulations. Four sensors were positioned at the coordinates
and
in order to estimate the position of sources located inside the convex polygon whose vertices are the positions of the sensors (in this case, this polygon is a square) in several scenarios. In all cases, the cost functions were minimized using the mean position of the sensors
as initial condition, except for the MLE which was implemented by finding the global maximum of the exact Gaussian mixture pdf (
20) using
M different initial conditions, each one obtained by minimizing
for
.
6.1. Unbiased i.i.d. Gaussian TOAs, Different Source Positions
In order to test the optimality of our algorithm in ideal conditions, we generated TOAs as Gaussian random variables with no bias and standard deviation of s and s. Denoting the source position as , y was fixed at and x was swept from 0 to 1. For each position, the Mean Square Error (MSE) of the estimators was calculated using samples and used to compute the efficiency of the estimator, defined as . Note that if the estimator is optimal.
The efficiency in terms of the source position is shown in
Figure 3. For
, the efficiency of the cost function
is very close to
, while
and
show worse performance for all source positions, indicating that
is indeed the optimal estimator for unbiased i.i.d. Gaussian TOAs.
On the other hand, for
, the estimates obtained through
are very close to optimal when the source lies inside the region of interest (the convex polygon whose vertexes are the positions of the sensors). However, outside this polygon,
is no longer the optimal cost function because the approximations used to prove the estimator is optimal are not precise in this region. This happens because the Taylor approximation (
A3) does not hold when
is in a region where the Hessian of
is not approximately constant, which may happen if the source is close to a sensor.
Even though has worse accuracy than in most cases, its efficiency does not fall down abruptly for x outside the region of interest, especially in the case of high noise level, in which the performance of the other methods is close to zero outside the region between sensors.
It is worth noting that in practice the sensors are scattered around the monitored region, thus the precision of localization algorithms is more important for sources lying inside the region of interest, in which case performs better than the other methods for both noise variances.
6.2. Fixed Source Position and Different SNRs
Next, we performed simulations estimating the TOAs from noisy signals, as happens in practice. Note that:
For low SNR, the Taylor approximation (
A3) (
Appendix B) does not hold and the estimator may be biased.
The MSE of GMTOA depends on the quality of the estimated GMM parameters, being optimal only if these estimates match the actual ones.
The GMM assumption for TOAs is an approximation, so an estimator may show lower MSE than the derived CRLB when empirical TOAs are used.
We have done four simulations in similar conditions as before, except that the source position is fixed at
. In all simulations, hits were generated according to (
18) using
and
(a modulated von Hann window), with white Gaussian noise of different variances. The sampling rate is
, hence the frequency of the carrier is
, the resonance frequency for some acoustic emission sensors.
In these simulations, TOAs were calculated using the TOA estimation technique proposed in
Section 4 with a fixed threshold
. The GMTOA estimator was implemented using Algorithm 1, and the parameters obtained by this algorithm were used in the MLE.
6.2.1. TOAs Following a GMM and Known Parameters
In this simulation, the noisy hits were used to obtain auxiliary TOAs through the fixed threshold method. Then, the mean and variance of these auxiliary TOAs were computed and used to generate TOAs following a Gaussian mixture distribution with known parameters. Note that is not known because it depends on the tentative source position.
The MSE in terms of noise standard deviation for the presented methods is shown in
Figure 4a,b. The MSE of the GMTOA estimator is very close to the CRLB for all noise variances, confirming our derivations. Moreover, the maximum likelihood estimator does not coincide with the optimal one if
is high, but the MLE is optimal for low noise level.
6.2.2. Empirical TOAs and Unknown Parameters
In this simulation, the actual TOAs obtained from the fixed threshold method are used to localize the source. Hence, TOAs are not exactly distributed according to a Gaussian mixture in this case. The Cramér–Rao lower bound was calculated based on the parameters estimated by the GMTOA Estimator from noisy hits using Algorithm 1.
Figure 4c,d shows that if the noise variance is very high, the GMTOA estimator does not perform better than the others, but it does have a significantly better performance for intermediate variances where the TOA pdfs cannot be approximated as single Gaussian distributions.
On the other hand, the proposed estimator has lower MSE than the other estimators for very low variances, but it may perform worse than them if the variance is not very low, but low enough so that the TOA pdf is nearly Gaussian. This is because the MSE of , and fall abruptly when the TOA distribution becomes approximately Gaussian (or a mixture with only one component), while the MLE and the GMTOA estimator do not because they depend on the noisy parameters of the estimated Gaussian pdf. For smaller variances, the parameters from the pdf approximate well the actual ones, thus the GMTOA estimator becomes again better than and .
Note that the MSEs of the estimators fall below the Cramér–Rao lower bound if is small. This happens because we derived the CRLB for TOAs following a known Gaussian mixture distribution, but the true distribution is discrete, not exactly a mixture of Gaussians.
6.2.3. TOAs Obtained from Filtered Hits and Unknown Parameters
The main issue with the maximum likelihood and GMTOA estimators is that they depend on the noise pdf, which is not accessible in practice and must be estimated. If the noise variance is large, these parameters can be poorly estimated and cause performance loss in the localization methods, as verified in the last simulation. To test the robustness of these estimators in a scenario of non-white noise, a lowpass filter is applied to hits before calculating the threshold in this simulation. The filter used was a Butterworth filter of order 15 and digital cutoff frequency (or 1 MHz), but the TOA pmf was calculated assuming white Gaussian noise.
The results are shown in
Figure 4e,f. With the filter, the parameters of the GMM are better estimated, and the GMTOA estimator has better performance than all other methods even for high variance. The MLE also presents much lower MSE than
and
for high noise level.
Since in this simulation GMTOA assumes white noise to calculate the TOA pmf (which is used to extract the GMM parameters), it may have worse performance using filtered hits (which are corrupted by non-white noise) instead of the original hits for some values of , but in most cases using filtered hits implied in lower MSE.
6.2.4. Using Biased TDOAs
In all previous simulations, TOAs were obtained using a proposed modification of the fixed threshold method described in
Section 4, which generates unbiased TDOAs.
Figure 5 shows a simulation with the same setup as in
Section 6.2.2 and
Section 6.2.3, but the fixed threshold method was used to obtain TOAs instead of the proposed modification. As such, TDOAs are biased in this simulation.
Comparing
Figure 4c,e with
Figure 5, the proposed TOA estimation method significantly decreases the MSE of TOA-based localization methods. These figures also show that the GMTOA estimator achieves better performance than other methods in most cases for TOAs obtained from original hits and in all cases if hits are filtered before TOA extraction.
It is worth noting that if biased TOAs are used directly for estimation, the MSE may decrease for higher noise variance because bias may be reduced.
7. Conclusions
In this work, we derived the expression for the TOA pmf obtained through the threshold method, and showed that it is not Gaussian-distributed as usually assumed. We show that the TOA pmf can only be approximated as a Gaussian distribution if the sampling rate is high and the noise level is small. The distribution can be modeled by a mixture of Gaussian distributions for higher noise level. We also presented a TOA estimation method based on the fixed threshold algorithm that generates unbiased TDOAs in structures where hits are approximately delayed and attenuated versions of each other.
The optimal source position estimator was derived assuming TOAs follow a mixture of Gaussian distributions of known parameters. This estimator coincides with the maximum likelihood estimator for small noise level, in which case TOAs are Gaussian. Since the mixture parameters are not known in practical applications, we proposed a method to estimate them from noisy waveforms, resulting in the GMTOA estimator, a suboptimal method to localize the source. We verified in simulation that GMTOA yields better results than other methods for both biased and unbiased TDOAs in most cases, even if the TOA pmf is not exactly a mixture of Gaussian distributions, and its performance is enhanced if a low-pass filter is applied to hits before extracting TOAs.
Even though the proposed estimator performed better than other methods in simulation, it is necessary to further validate it with data collected from an acoustic emission test. Further work also includes extending our TOA estimation method to generate unbiased TDOAs for more complex propagation models, in particular for structures where hits contain many reflections of the wave emitted by the source.