Robust Underwater Direction-of-Arrival Tracking Based on AI-Aided Variational Bayesian Extended Kalman Filter

Hou, Xianghao; Qiao, Yueyi; Zhang, Boxuan; Yang, Yixin

doi:10.3390/rs15020420

Open AccessArticle

Robust Underwater Direction-of-Arrival Tracking Based on AI-Aided Variational Bayesian Extended Kalman Filter

by

Xianghao Hou

^1,2,

Yueyi Qiao

¹,

Boxuan Zhang

^1,2 and

Yixin Yang

^1,2,*

¹

School of Marine Science and Technology, Northwestern Polytechnical University, Xi’an 710072, China

²

Shaanxi Key Laboratory of Underwater Information Technology, Xi’an 710072, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2023, 15(2), 420; https://doi.org/10.3390/rs15020420

Submission received: 7 December 2022 / Revised: 4 January 2023 / Accepted: 5 January 2023 / Published: 10 January 2023

(This article belongs to the Special Issue Underwater Communication and Networking)

Download

Browse Figures

Versions Notes

Abstract

:

The AI-aided variational Bayesian extended Kalman filter (AI-VBEKF)-based robust direction-of-arrival (DOA) technique is proposed to make reliable estimations of the bearing angle of an uncooperative underwater target with uncertain environment noise. Considering that the large error of the guess of the initial mean square error matrix (MSEM) will lead to inaccurate DOA tracking results, an attention-based deep convolutional neural network is first proposed to make reliable estimations of the initial MSEM. Then, by utilizing the AI-VBEKF estimating scheme, the uncertain measurement noise caused by the unknown underwater environment along with the bearing angle of the target can be estimated simultaneously to provide reliable results at every DOA tracking step. The proposed technique is demonstrated and verified by both of the simulations and the real sea trial data from the South China Sea in July 2021, and both the robustness and accuracy are proven superior to the traditional DOA-estimating methods.

Keywords:

extended Kalman filter; robust tracking; underwater direction-of-arrival tracking; variational Bayesian; attention-based neural network

1. Introduction

Estimating the bearing angle of an uncooperative underwater target is vitally important in underwater acoustic engineering [1,2,3,4]. Considering that the unknown underwater environment sometimes causes uncertain measurement noise that will degrade the estimation accuracy of the interested target’s bearing angle, the robust bearing angle estimating technique is necessary and has attracted a number of researchers’ attention [5,6,7,8]. In addition, the techniques of passive bearing angle estimation can be based on two different ways, namely the DOA estimation and the DOA tracking methods. The DOA estimation merely depends on the measurements from a certain period. Besides the measurements, the DOA tracking techniques also utilize the prior motion information of an interested target. Although the DOA estimation has already made many milestones in various bearing angle estimation missions [9,10,11], its ignorance of the target’s kinematics makes the DOA estimation techniques always have superior estimating results when the target is nearly static and the environment is stable. When the target is maneuvering or the measurement noise shifts because the underwater environment is uncertain, the DOA estimation methods can hardly have satisfied results. In addition, in order to make reliable and accurate estimations of an uncooperative underwater target’s bearing angle, the DOA estimation techniques usually need a wide observation window, which will lead to heavy on-board computational load and which will occupy a lot of on-board limited storage.

Considering the advantages and drawbacks of the DOA estimation methods, DOA tracking techniques have been proposed. Instead of merely utilizing the measurement information, DOA tracking methods take both the target’s motion model and the current measurements into consideration. Based on the Bayesian estimation principle, the DOA tracking techniques can reach higher estimating accuracy and can occupy less on-board computing or storage resources, especially when the underwater target is maneuvering [12,13,14,15]. In addition, the unpredicted moving objects or environmental changes in the ocean can sometimes make the measurements uncertain, which will degrade the bearing angle estimation accuracy or even make the estimating procedure diverge. Under this circumstance, the robust DOA tracking technique is necessary to guarantee the precision of the whole estimation process. Among all the robust DOA tracking techniques, two categories can be divided depending on the basic mathematical principles, namely the data-inspiring robust tracking techniques [16] and the Bayesian inferring techniques [17,18]. These two categories of robust DOA tracking techniques are both derived based on the widely used Kalman filter technique [19,20,21]. The former one utilizes good estimations from a long tracking period to adjust the weight of the bad measurements when the measurement noise becomes uncertain. As a result, the weights of the poor measurements can become very small to make the current bearing angle estimations mainly depend on the historical good estimations. These kinds of data-inspiring robust DOA tracking techniques have proven their efficiency in a number of research studies [22,23,24,25,26] nevertheless, their convergence is still an open question. When the hyper-parameters of the inspired robust DOA tracking are set inappropriately, the estimations usually have poor accuracy and sometimes diverge.

To overcome the drawbacks of the data-inspiring robust tracking techniques, the Bayesian inferring techniques, or the so-called variational Bayesian (VB) robust tracking techniques, are proposed and have been utilized in a various of research studies [27,28,29]. Unlike the data-inspiring robust tracking methods, the VB robust tracking techniques are strictly derived mathematically. By assuming the prior probability density functions (PDFs) of the tracking parameters and the interested time-varying parameters (i.e., covariance matrix of the uncertain measurements noise) along with optimizing the Kullback–Leibler Divergence (KLD) among the posterior PDFs, the VB technique can analytically estimate the tracking parameters and other interested parameters simultaneously. Different from the data-inspiring robust tracking methods, the VB robust tracking techniques have strict mathematical foundations and can guarantee convergence and accuracy if the prior PDFs are set properly. Under this circumstance, the VB robust tracking techniques do not need good estimations from a long period to output reliable results when measurements become uncertain, and they are more theoretically complete.

In addition, the error between the guess of the initial state and its real value will influence the final DOA tracking accuracy since the DOA tracking system has high nonlinearities. If the error is large enough, the tracking results will even be diverged [30]. Under this circumstance, before operating a DOA tracking algorithm, the initial MSEM should be set properly. However, in the real DOA tracking scenario, it is almost impossible to obtain an accurate error covariance matrix since the real initial states cannot be accurately obtained. In addition, the uncertain underwater environment always affects the state estimating results and makes the initial MSEM much harder to accurately predetermine. To deal with this problem, some researchers suggested utilizing a large guess of the initial MSEM to make the tracking system have fast converging speed and to deal with the large error between the initial guesses of the states and their real values. However, since the underwater target DOA tracking system has high nonlinearities, setting a large guess of the initial MSEM sometimes makes the whole tracking procedure become unstable or even diverge. As a consequence, how to set a proper guess of the initial MSEM with respect to the error of the initial guess of the state and its real value is still an open question. Usually, the initial MSEM is set by engineering experience, which is similar to an expert system that tries to have common solutions. Although some adaptive tracking techniques have been designed to weaken the influences by the initial error [31], the unproperly set initial MSEM still affects the overall tracking performance. Under these circumstances, a robust and accurate technique for properly setting the guess of the initial MSEM is necessary. Usually, the traditional Bayesian estimating techniques rely heavily on the mathematical model of the systems or parameters to be estimated, but the guess of the initial MSEM does not have a determined and accurate mathematical model with respect to the system state, which makes the traditional methods fail. However, the value of the guess of the initial MSEM has a relationship with the theoretical measurements calculated by the state by the measurement model, initial guess of the state, and its real value. Under this circumstance, the data characteristics other than the mathematical characteristics of the guess of the initial MSEM can be analyzed and utilized. As a result, although the traditional techniques have few general solutions for the problem of properly setting the guess of the initial MSEM, the latest data-driven AI techniques allow the problem discussed above to be solved.

Except for depending heavily on the mathematical derivations, the AI techniques can only rely on the statistical characteristics to infer the inner nonlinear relationship between the inputs and the outputs from a certain system. As a result, the AI techniques are more suitable for the problems that are hard to be mathematically or analytically depicted but have plenty of data resources. After the back propagated (BP) training method was proposed by Rumelhart et al. [32] and after the deep learning scheme that brought the deep artificial neural network (DANN) into light by Hinton et al. [33], the DANN has shown its powerful abilities in many fields, such as automation, parameter estimation, target tracking, etc. Among all of the DANNs, the deep convolutional neural networks (DCNNs) and the variations of the DCNNs (i.e., Resnet [34], Senet [35]) are the most notable ones, with robust and accurate performances, especially for regression and classification problems that are hard to be analytically modeled. By adopting the convolutional and pooling ideas into one DANN, the DCNN can be modeled much deeper and can easily be trained. Based on the DCNN technique, Lecun et al. [36] proposed a DCNN named Lennet-5 to deal with the digital image classification task. Ref. [37] considered the training efficiency and the gradient vanishing problem caused by the DCNNs and proposed the shortcut concept to make the DCNNs have a much deeper structure so that the DCNNs can deal with more complicated classification and regression issues. Ref. [38] considered that the different features may have different weights during the training process of the designed DCNNs and proposed an attention-based scheme to make the designed DCNNs more robust and accurate. Besides the principle developments of the DCNNs, the applications of them have also attracted a number of researchers from various fields, especially from the field of underwater target localization, detection, classification, and tracking [37,38,39,40,41,42,43]. Niu et al. [37,38,39,40] have published a series of research studies to make the underwater target localization task more accurate and robust to the uncertain underwater environment. Wang et al. [41,42] utilized the DCNNs technique in estimating the unknown parameters in the underwater acoustical channel. Based on the DCNNs technique, Ref. [40] developed intelligent target classification algorithms that can obtain higher accuracy and classification speed than the traditional methods. Ref. [43] proposed a DCNN-based seabed parameters inversion method. However, to the best of the authors’ knowledge, very few research studies have considered using the advanced DCNN technique for the underwater target DOA tracking problem. As a result, the potential and strength of the DCNNs in the underwater target DOA tracking scenario, especially in dealing with the proper setting of the guess of the initial MSEM, have not been proven yet.

Based on the above analysis, an AI-aided variational Bayesian extended Kalman filter (AI-VBEKF)-based robust direction-of-arrival (DOA) technique is proposed to make reliable estimations of the bearing angle of an uncooperative underwater target with uncertain environment noise. The main contributions of this study are summarized as follows.

Firstly, a uniform circular array (UCA) is considered to provide measurements of the underwater target. By adopting the UCA as the measurement system, the port and starboard ambiguity problem is overcome. In addition, the uniform aperture at all bearing angles is processed.

Secondly, considering the effects caused by the unproperly set guess of the initial MSEM, an attention-based DCNN is designed to make reliable initial guesses of it and to make the later DOA tracking process steady and accurate.

Thirdly, considering that the unknown underwater disturbance can sometimes make the measurement noise uncertain, the AI-VBEKF is designed to robustly estimate the bearing angle of the underwater target with a shifting covariance matrix of the uncertain measurement noise.

Finally, based on the sea trail data from the South China Sea in July 2021, the proposed AI-VBEKF is verified. The robust and accurate estimation results proved the superior characteristics of the proposed DOA tracking method.

The remainder of this paper is organized as follows: Section 2 shows the kinematic model and the measurement model of the DOA tracking problem. In Section 3, the VB-EKF using a UCA is derived first. In addition, the attention-based DCNN is proposed to make a good estimation of the guess of the initial MSEM. Then, based on the VB-KEF and proposed DCNN, the whole frame of the AI-VBKEF is proposed. In Section 4, the simulation and experiment verification results are shown. Finally, the conclusions are drawn in Section 5.

2. Problem Formulation

2.1. Kinematic Model of the DOA Tracking Process

Since the underwater target is usually performing at a constant velocity to keep invisibility and is far from the hydrophone arrays, the motion model of the target’s bearing angle can be modeled as the constant velocity (CV) model.

Let

θ_{k}

denote the bearing angle of the target (

θ_{k}

is the angle between the target and the positive x-axis with the positive counterclockwise direction) and

{\dot{θ}}_{k}

denote the change rate of

θ_{k}

. Then, the CV can be expressed as:

x_{k} = F_{k | k - 1} x_{k - 1} + G_{k - 1} w_{k - 1}

(1)

where

x_{k} = {(θ_{k}, {\dot{θ}}_{k})}^{T}

is the system state at time step k,

{(\cdot)}^{T}

denotes the matrix transposition, and

w_{k - 1}

denotes the zero mean Gaussian process noise with the covariance matrix

Q_{k - 1}

caused by the unknown underwater environment.

Q_{k - 1}

is named the process noise covariance matrix (PNCM).

F_{k | k - 1}

and

G_{k}

are the state transition matrix for CV motion and the noise driving matrix at time step k, which are given by:

F_{k | k - 1} = [\begin{matrix} 1 & T \\ 0 & 1 \end{matrix}]

(2)

G_{k - 1} = [\begin{matrix} T^{2} / 2 \\ T \end{matrix}]

(3)

where T is the tracking interval.

2.2. Measurement Model Based on the Received Signal of UCA

Considering that the bearing angle of the target varies in a wide range in long-time tracking, the UCA is used since it is free from the port and starboard ambiguity problem. The configuration of the UCA and the underwater target is shown in Figure 1.

Consider a narrowband acoustic signal is emitted by a target. The frequency, amplitude and the initial phase are f, a, and

ϕ_{0}

. Then, we express the target signal at time step k as:

s (k) = α \exp (j (2 π f k T + ϕ_{0}))

(4)

The emitted signal given by Equation (4) is received by the UCA with P elements at time step k. The P elements are uniformly placed around the circle with the radius r and the center at the origin of the cartesian coordinates. It is assumed that the range of the target satisfies the far-field condition to regard the arrived signal as a plane wave, and the sound velocity is c. Then, the time delay of the signal relative to the center of the UCA at the p-th element (

p = 1, 2, \dots, P

) is

\cos (θ_{k} - 2 π p / P) r / c

. In addition, considering the signal given by Equation (4), the shift of the signal caused by the time delay at the p-th element is expressed as:

Consider that a P-element UCA with radius of r received the emitted signal at time step k. The arrived signal is assumed to be the plane wave, and the sound velocity is denoted by c. Then, the array manifold at the p-th element is expressed as:

A_{p} (θ_{k}) = \exp (j 2 π f \frac{r}{c} \cos (θ_{k} - 2 π \frac{p}{P}))

(5)

Thus, the received signal at the p-th element is expressed as:

z_{p} (k) = real (A_{p} (θ_{k}) s (k)) + v_{k}

(6)

where

real (\cdot)

denotes taking real part of a complex number,

θ_{k}

denotes the bearing angle of the target at time step k, and

v_{k}

denotes the zero mean Gaussian measurement noise with the variance

σ_{r, k}^{2}

at time step k from the measurement system and the unknown ocean environment.

We define the received signal vector of the UCA as

z_{k} = {[z_{1} (k), z_{2} (k), \dots, z_{P} (k)]}^{T}

, the output of the UCA at time step k is:

z_{k} = real (A (θ_{k}) s (k)) + v_{k}

(7)

where

A (θ_{k})

denotes the array manifold vector, which is given by

A (θ_{k}) = {[A_{1} (θ_{k}), A_{2} (θ_{k}), \dots, A_{P} (θ_{k})]}^{T}

(8)

M snapshots of the received signal are taken in a measurement. The snapshots are taken from a small period within every tracking interval T to guarantee that the target remains stationary. Then, we can express the measurement vector at tracking step k as:

We take M snapshots of the received signal from a small period within every tracking interval T in one measurement to guarantee that the target remains stationary. Then, the measurement vector at time step k is given as follows:

Z_{k} = [\begin{array}{l} z (k) \\ z (k + τ) \\ z (k + 2 τ) \\ ⋮ \\ z (k + (M - 1) τ) \end{array}] = h_{k} (X_{k}) + V_{k} = [\begin{array}{l} real (A (θ_{k}) s (k)) \\ real (A (θ_{k}) s (k + τ)) \\ real (A (θ_{k}) s (k + 2 τ)) \\ ⋮ \\ real (A (θ_{k}) s (k + (M - 1) τ)) \end{array}] + V_{k}

(9)

where τ denotes the interval between adjacent snapshots,

M τ ≪ T

,

h_{k} (x_{k})

denotes the nonlinear measurement function, and

V_{k}

denotes the measurement noise.

The measurement noise of each element is assumed to be independent and identically distributed. Then,

V_{k}

at time step k is defined as a zero mean Gaussian noise with covariance matrix

R_{k} = I_{M P} σ_{r, k}^{2}

, where

σ_{r, k}^{2}

is the variance of the measurement noise of each element given in Equation (6), and

I_{M P}

is the MP-order identity matrix. The measurement noise covariance matrix (MNCM)

R_{k}

is usually assumed to be fixed in the underwater tracking problem. However, in the real scenario of underwater tracking, the unknown ocean environment can result in uncertain measurement noise, which makes the MNCM become fluctuant and hard to deal with by using traditional tracking methods. This problem seriously degrades the tracking performance.

3. Methods

3.1. EKF for DOA Tracking

Since the measurement model depicted by Equation (9) has high nonlinearities, the extended Kalman filter (EKF) scheme is used to derived a DOA tracking algorithm in this section.

From Equation (1), the one-step prediction of the state estimate

{\hat{x}}_{k | k - 1}

is depicted as:

{\hat{x}}_{k | k - 1} = F_{k | k - 1} {\hat{x}}_{k - 1}

(10)

where

F_{k | k - 1}

is the state transition matrix given by Equation (2), and

{\hat{x}}_{k - 1}

is the state estimate at tracking step k − 1. The one-step prediction of the MSEM

P_{k | k - 1}

is expressed as:

P_{k | k - 1} = F_{k | k - 1} P_{k - 1} F_{k | k - 1}^{T} + G_{k} Q_{k} G_{k}^{T}

(11)

where

P_{k - 1}

is the MSEM at step k − 1,

G_{k}

is the noise driving matrix given by Equation (3), and

Q_{k}

is the covariance matrix of the process noise.

Then, considering the UCA-based measurement model given in Equation (9), the Kalman filter gain

K_{k}

is expressed as:

K_{k} = P_{k | k - 1} H_{k}^{T} {(H_{k} P_{k | k - 1} H_{k}^{T} + R_{k})}^{- 1}

(12)

where

H_{k}

is the first order Taylor expansion of the nonlinear measurement model.

R_{k}

is the MNCM. According to Equation (9),

H_{k}

can be calculated as:

H_{k} = {[\begin{matrix} {h^{'}}_{1, 1} & {h^{'}}_{2, 1} & \dots & {h^{'}}_{P, 1} & {h^{'}}_{1, 2} & {h^{'}}_{2, 2} & \dots & {h^{'}}_{P, 2} & \dots & {h^{'}}_{1, M} & {h^{'}}_{2, M} & \dots & {h^{'}}_{P, M} \\ 0 & 0 & \dots & 0 & 0 & 0 & \dots & 0 & \dots & 0 & 0 & \dots & 0 \end{matrix}]}^{T}

(13)

The elements of the matrix

{h^{'}}_{p, m}

(

p = 1, 2, \dots, P

,

m = 1, 2, \dots, M

) are given by [44]:

{h^{'}}_{p, m} = - 2 π f \frac{r}{c} \sin ({\hat{θ}}_{k | k - 1} - 2 π \frac{p}{P}) real (j A_{p} ({\hat{θ}}_{k | k - 1}) \hat{s} (k + (m - 1) τ))

(14)

where

\hat{s} (k + (m - 1) τ)

denotes the estimate of the target signal, which is given by [45]:

{\hat{s}}_{k | k}^{(i)} (k) = {(A^{H} ({\hat{θ}}_{k | k}^{(i)}) A ({\hat{θ}}_{k | k}^{(i)}))}^{- 1} A^{H} ({\hat{θ}}_{k | k}^{(i)}) Hilbert (z (k))

(15)

where

{\hat{θ}}_{k | k}^{(i)}

is the predicted bearing angle, i.e., the first term of

{\hat{x}}_{k | k - 1}

,

A ({\hat{θ}}_{k | k}^{(i)})

is obtained by using Equations Equations (5) and (8)

{(\cdot)}^{H}

and

Hilbert (\cdot)

denote the conjugate transposition and the Hilbert transform, respectively.

The state estimate

{\hat{x}}_{k}

at tracking step k is expressed as:

{\hat{x}}_{k} = {\hat{x}}_{k | k - 1} + K_{k} (Z_{k} - h ({\hat{x}}_{k | k - 1}))

(16)

where

Z_{k}

is the measurement of the UCA at step k. The one-step prediction of MSEM

P_{k | k - 1}

is modified by

K_{k}

, i.e.,

P_{k} = (I - K_{k} H_{k}) P_{k | k - 1}

(17)

where

P_{k}

denotes the MSEM at tracking step k. From Equations (10)–(17), the current bearing angle estimation can be calculated by the inputs of

{\hat{x}}_{0}

and

P_{0}

.

From the above analysis, the DOA tracking process not only depends on the measurements but also utilizes the target’s prior motion information. Therefore, the DOA tracking techniques can be robust to the target’s motion; especially, the kinematic model of the target is accurate. However, from Equation (12), it is obvious that the measurement noise

R_{k}

will affect the Kalman gain, which will hugely influence the final tracking precision. In addition, from Equation (10), it is obvious that the initial values of the state and MSEM will influence the final tracking accuracy. As a result, in the scenario of accurate and robust underwater DOA tracking, both the uncertain measurement noise and proper setting of the

{\hat{x}}_{0}

and

P_{0}

need to be considered.

3.2. VB-EKF for Robust DOA Tracking

Regarding the measurement model with high nonlinearity depicted by Equation (7), the extended Kalman filter (EKF) technique is used for DOA tracking in this section. Furthermore, considering the fluctuations of the MNCM caused by the unknown underwater environment, the variational Bayesian approach is utilized to improve the tracking performance by estimating the MNCM. Thus, the VB-EKF for DOA tracking using the UCA is proposed, and the details are given as follows.

3.2.1. Choice of Prior Distribution

In the framework of the standard EKF [5], the one-step predicted probability density distribution (PDF)

p (x_{k} | z_{1 : k - 1})

and the likelihood PDF

p (z_{k} | x_{k})

are assumed to be subject to Gaussian distributions as follows:

p (x_{k} | z_{1 : k - 1}, P_{k | k - 1}) = N (x_{k}; {\hat{x}}_{k | k - 1}, P_{k | k - 1})

(18)

p (z_{k} | x_{k}, R_{k}) = N (z_{k}; h (x_{k}), R_{k})

(19)

where

N (\cdot; μ, Σ)

denotes the PDF of the Gaussian distribution with mean μ and covariance matrix Σ, and

h (x_{k})

is the nonlinear measurement function given by Equation (7).

{\hat{x}}_{k | k - 1}

and

P_{k | k - 1}

denote the one-step prediction of state and MSEM, respectively, which are given by Equations (10) and (11).

In order to infer

x_{k}

along with

R_{k}

, a conjugate prior distribution needs to be selected for the fluctuant MNCM

R_{k}

since a conjugate distribution can guarantee the same functional forms of the prior distribution and the posterior distribution. In the Bayesian theory, inverse Wishart distribution is usually used as the conjugate prior to the covariance matrix of a Gaussian distribution with known mean [6]. Since

R_{k}

is the covariance matrix of Gaussian distribution, the prior distribution

p (R_{k} | z_{1 : k - 1})

is selected as an inverse Wishart distribution given by:

p (R_{k} | z_{1 : k - 1}) = IW (R_{k}; {\hat{u}}_{k | k - 1}, {\hat{U}}_{k | k - 1})

(20)

where

IW (\cdot; λ, Ψ)

denotes the PDF of the inverse Wishart distribution with degree of freedom (dof) λ and inverse scale matrix Ψ [3],

{\hat{u}}_{k | k - 1}

and

{\hat{U}}_{k | k - 1}

are the dof and the inverse scale matrix of

p (R_{k} | z_{1 : k - 1})

, respectively.

The posterior distribution

p (R_{k - 1} | z_{1 : k - 1})

is also subject to an inverse Wishart distribution as follows:

p (R_{k - 1} | z_{1 : k - 1}) = IW (R_{k - 1}; {\hat{u}}_{k - 1 | k - 1}, {\hat{U}}_{k - 1 | k - 1})

(21)

To guarantee that

p (R_{k} | z_{1 : k - 1})

is the inverse Wishart distribution given by Equation (20), the previous approximate posteriors is spread through a forgetting factor

ρ \in (0, 1]

, which indicates the extent of the time-fluctuations of the MNCM. Then, the prior dof

{\hat{u}}_{k | k - 1}

and the prior inverse scale matrix

{\hat{U}}_{k | k - 1}

are given as follows:

{\hat{u}}_{k ∣ k - 1} = ρ ({\hat{u}}_{k - 1 ∣ k - 1} - n - 1) + n + 1

(22)

{\hat{U}}_{k ∣ k - 1} = ρ {\hat{U}}_{k - 1 ∣ k - 1}

(23)

where n denotes the order of the MNCM

R_{k}

.

3.2.2. Variational Approximations of Posterior PDFs

According to the variational Bayesian approximation, the joint posterior PDF of the state

x_{k}

and the MNCM

R_{k}

is approximated to

p (x_{k}, R_{k} ∣ z_{1 : k}) \approx q (x_{k}) q (R_{k})

(24)

where

q (x_{k})

and

q (R_{k})

are the approximate posterior PDF of

x_{k}

and

R_{k}

, respectively. The variational Bayesian approximation is formed by minimizing the Kullback–Leibler divergence (KLD) between the true joint distribution

p (x_{k}, R_{k} ∣ z_{1 : k})

and the approximate distribution

q (x_{k}) q (R_{k})

, i.e.,

\{q (x_{k}), q (R_{k})\} = \arg \min KLD (q (x_{k}) q (R_{k}) ∥ p (x_{k}, R_{k} ∣ z_{1 : k}))

(25)

where

KLD (q (x) ∥ p (x))

denotes the KLD between

q (x)

and

p (x)

, and

KLD (q (x) ∥ p (x)) = \int q (x) \log \frac{q (x)}{p (x)} d x

(26)

The optimal solution of Equation (25) satisfies the following Equations [4]:

\log q (x_{k}) = E_{R_{k}} [\log p (x_{k}, R_{k}, z_{1 : k})] + c_{x}

(27)

\log q (R_{k}) = E_{x_{k}} [\log p (x_{k}, R_{k}, z_{1 : k})] + c_{R}

(28)

where

E_{x_{k}} [\cdot]

and

E_{R_{k}} [\cdot]

denote the expectation with regard to

x_{k}

and

R_{k}

, respectively, and

c_{x}

and

c_{R}

denote the constants with respect to

x_{k}

and

R_{k}

, respectively. Since the variational parameters of

q (x_{k})

and

q (R_{k})

are coupled, a fix-point iteration process is applied to solve Equations (27) and (28), i.e., the approximate posterior PFD

q (x_{k})

is updated to

q^{(i + 1)} (x_{k})

at the

i + 1

-th iteration using the posterior PDF

q^{(i)} (R_{k})

, and

q (R_{k})

is updated to

q^{(i + 1)} (R_{k})

using the posterior

q^{(i)} (x_{k})

.

According to Equations (18)–(20), the joint PDF is expressed as

\begin{array}{l} p (x_{k}, R_{k}, z_{1 : k}) \\ = p (z_{k} | x_{k}, R_{k}) p (x_{k} | z_{1 : k - 1}) p (R_{k} | z_{1 : k - 1}) p (z_{1 : k - 1}) \\ = N (z_{k}; h (x_{k}), R_{k}) N (x_{k}; {\hat{x}}_{k | k - 1}, P_{k | k - 1}) \times IW (R_{k}; {\hat{u}}_{k | k - 1}, {\hat{U}}_{k | k - 1}) p (z_{1 : k - 1}) \end{array}

(29)

(1): Update of $x_{k}$

The posterior

q^{(i + 1)} (x_{k} | z_{1 : k - 1})

is updated according to Equations (12), (16) and (17) as:

q^{(i + 1)} (x_{k} | z_{1 : k - 1}) = N (x_{k}; {\hat{x}}_{k | k}^{(i + 1)}, {\hat{P}}_{k | k}^{(i + 1)})

(30)

where the mean vector

{\hat{x}}_{k | k}^{(i + 1)}

and the covariance matrix

{\hat{P}}_{k | k}^{(i + 1)}

are given as follows:

K_{k}^{(i + 1)} = P_{k | k - 1} {(H_{k}^{(i)})}^{T} {(H_{k}^{(i)} P_{k | k - 1} {(H_{k}^{(i)})}^{T} + {\hat{R}}_{k}^{(i)})}^{- 1}

(31)

{\hat{x}}_{k | k}^{(i + 1)} = {\hat{x}}_{k | k - 1} + K_{k}^{(i + 1)} (z_{k} - h ({\hat{x}}_{k | k - 1}))

(32)

P_{k | k}^{(i + 1)} = P_{k | k - 1} - K_{k}^{(i + 1)} H_{k}^{(i)} P_{k | k - 1}

(33)

where

H_{k}^{(i)}

denotes the Jacobian matrix of the measurement function.

H_{k}^{(i)}

is obtained by substituting

{\hat{x}}_{k | k}^{(i)}

into

H_{k}

given by Equation (13).

Based on the new estimated state

{\hat{x}}_{k | k}^{(i)}

and MSEM

P_{k | k}^{(i)}

, a more accurate approximation of

h (x_{k})

can be obtain by performing linearization with

{\hat{x}}_{k | k}^{(i)}

[6], i.e.,

h (x_{k}) = h ({\hat{x}}_{k | k}^{(i)}) + H_{k}^{(i)} (x_{k} - {\hat{x}}_{k | k}^{(i)})

(34)

The estimate of the signal

{\hat{s}}_{k | k}^{(i)} (k)

is also used instead of the real one to calculate the measurement function

h ({\hat{x}}_{k | k}^{(i)})

by using Equation (7). Thus, Equation (32) is upgraded to

{\hat{x}}_{k | k}^{(i + 1)} = {\hat{x}}_{k | k - 1} + K_{k}^{(i + 1)} (z_{k} - h ({\hat{x}}_{k | k}^{(i)}) - H_{k}^{(i)} ({\hat{x}}_{k | k - 1} - {\hat{x}}_{k | k}^{(i)}))

(35)

(2): Update of $R_{k}$

According to Equations (18)–(20),

\log q^{(i)} (R_{k})

is given by

\begin{array}{l} \log q^{(i + 1)} (R_{k}) \\ = - 0.5 (n + {\hat{u}}_{k | k - 1} + 2) \log |R_{k}| - 0.5 tr ({\hat{U}}_{k | k - 1} R_{k}^{- 1}) - 0.5 {(z_{k} - h (x_{k}))}^{T} R_{k}^{- 1} (z_{k} - h (x_{k})) + c_{R} \\ = - 0.5 (n + {\hat{u}}_{k | k - 1} + 2) \log |R_{k}| - 0.5 tr ((B_{k}^{(i)} + {\hat{U}}_{k | k - 1}) R_{k}^{- 1}) + c_{R} \end{array}

(36)

where

B_{k}^{(i)} = E^{(i)} [(z_{k} - h (x_{k})) {(z_{k} - h (x_{k}))}^{T}]

(37)

Similar to Equation (34),

h (x_{k})

is linearized as

h (x_{k}) = h ({\hat{x}}_{k | k}^{(i + 1)}) + H_{k}^{(i + 1)} (x_{k} - {\hat{x}}_{k | k}^{(i + 1)})

(38)

where

H_{k}^{(i + 1)}

denotes the Jacobian matrix of the measurement function

h (x_{k})

at

{\hat{x}}_{k | k}^{(i + 1)}

, and

H_{k}^{(i + 1)}

is obtained by substituting

{\hat{x}}_{k | k}^{(i + 1)}

into Equation (13). Substituting Equation (38) into Equation (37), we obtain:

\begin{array}{l} B_{k}^{(i)} \\ = E^{(i)} [(z_{k} - h (x_{k})) {(z_{k} - h (x_{k}))}^{T}] \\ = E^{(i)} [(z_{k} - h ({\hat{x}}_{k | k}^{(i)}) - H_{k}^{(i)} (x_{k} - {\hat{x}}_{k | k}^{(i)})) {(z_{k} - h ({\hat{x}}_{k | k}^{(i)}) - H_{k}^{(i)} (x_{k} - {\hat{x}}_{k | k}^{(i)}))}^{T}] \\ = (z_{k} - h ({\hat{x}}_{k | k}^{(i)})) {(z_{k} - h ({\hat{x}}_{k | k}^{(i)}))}^{T} + H_{k}^{(i)} E^{(i)} [(x_{k} - {\hat{x}}_{k | k}^{(i)}) {(x_{k} - {\hat{x}}_{k | k}^{(i)})}^{T}] H_{k}^{T} \\ = (z_{k} - h ({\hat{x}}_{k | k}^{(i)})) {(z_{k} - h ({\hat{x}}_{k | k}^{(i)}))}^{T} + H_{k}^{(i)} P_{k | k}^{(i)} H_{k}^{T} \end{array}

(39)

From Equation (36),

q^{(i + 1)} (R_{k})

is updated as

q^{(i + 1)} (R_{k}) = IW (R_{k}; {\hat{u}}_{k}^{(i + 1)}, {\hat{U}}_{k}^{(i + 1)})

(40)

where the dof

{\hat{u}}_{k}^{(i + 1)}

and the inverse scale matrix

{\hat{U}}_{k}^{(i + 1)}

are given as follows:

{\hat{u}}_{k}^{(i + 1)} = {\hat{u}}_{k ∣ k - 1} + 1

(41)

{\hat{U}}_{k}^{(i + 1)} = B_{k}^{(i)} + {\hat{U}}_{k ∣ k - 1}

(42)

Then, according to Equation (29),

\log q^{(i)} (x_{k})

is given by:

\begin{matrix} \log q^{(i + 1)} (x_{k}) = - 0.5 {(z_{k} - h (x_{k}))}^{T} E^{(i + 1)} [R_{k}^{- 1}] (z_{k} - h (x_{k})) \\ - 0.5 {(x_{k} - {\hat{x}}_{k | k - 1})}^{T} P_{k | k - 1}^{- 1} (x_{k} - {\hat{x}}_{k | k - 1}) + c_{x} \end{matrix}

(43)

where

E^{(i + 1)} [R_{k}^{- 1}]

is given by:

E^{(i + 1)} [R_{k}^{- 1}] = ({\hat{u}}_{k}^{(i + 1)} - m - 1) {({\hat{U}}_{k}^{(i + 1)})}^{- 1}

(44)

The modified one step predicted PDF

p^{(i + 1)} (z_{k} | x_{k})

at the I + 1th iteration is defined as:

p^{(i + 1)} (z_{k} | x_{k}) = N (z_{k}; H_{k} x_{k}, {\hat{R}}_{k}^{(i + 1)})

(45)

where the modified MNCM

{\hat{R}}_{k}^{(i + 1)}

are formulated as:

{\hat{R}}_{k}^{(i + 1)} = {\{E^{(i + 1)} [R_{k}^{- 1}]\}}^{- 1} = {\hat{U}}_{k}^{(i + 1)} / ({\hat{u}}_{k}^{(i + 1)} - m - 1)

(46)

Finally, after N fixed-point iterations, the variational approximations of the posterior PDFs are given as follows:

q (x_{k}) \approx q^{(N)} (x_{k}) = N (x_{k}; {\hat{x}}_{k | k}^{(N)}, P_{k | k}^{(N)}) = N (x_{k}; {\hat{x}}_{k | k}, P_{k | k})

(47)

q (R_{k}) \approx q^{(N)} (R_{k}) = IW (R_{k}; {\hat{u}}_{k}^{(N)}, {\hat{U}}_{k}^{(N)}) = IW (R_{k}; {\hat{u}}_{k ∣ k}, {\hat{U}}_{k ∣ k})

(48)

Combining the EKF depicted by Equations (10)–(17) and the VB estimating technique, the state can be robustly estimated under uncertain measurement noise.

3.3. AI-VBEKF for Robust DOA Tracking

Considering the high nonlinearities of the measurement model by Equation (7), the initial values will affect the tracking results. Since the EKF is based on the Taylor series expansion to linearize the nonlinear system model, the truncation error will become unacceptable if the error between the initial guess of the state and its real value is quite large. In addition, if the initial MSEM cannot be set properly, the DOA tracking results will be in low accuracy or even diverged. In the real underwater DOA tracking scenario, the initial DOA of the target can be obtained by traditional DOA estimation methods (i.e., CBF, MVDR, MUSIC, et.al.), but the initial MSEM cannot be determined since the initial error between the real state and the estimated one is unknown. If the initial MSEM is set too small, the DOA tracking algorithm will converge very slowly or cannot converge at all. On the contrary, if the initial MSEM is set too large, the DOA tracking process will be soon diverged since the Kalman gain will become uncontrolled during the measurement update steps for nonlinear tracking systems. As a result, a proper preset initial MSEM not only affects the final DOA tracking accuracy but also determines the convergence of the total tracking procedure. However, in the real DOA tracking scenario, the initial MSEM can only be set by engineering experience. To solve this problem and deal with the uncertain measurement noise, an AI-aided robust DOA tracking algorithm is proposed in this subsection.

In order to minimalize the final DOA tracking error caused by the inaccurate initial guess of the MSEM, an AI-aided robust DOA tracking algorithm is proposed by this subsection. Firstly, considering that the different initial state guesses will lead to different theoretical measurement calculations by the measurement model, the error between the real measurements and the theoretical ones contains the information of the error of the initial guess of the MSEM between its true value. Thus, based on the difference between the theoretical measurement calculations and the real measurements, the initial MSEM can be estimated via deep learning techniques. By utilizing the covariance matrix of the difference between the theoretical measurement calculations and the real measurements from all bearing angles with different initial errors as inputs, an attention-based deep convolutional neural network is proposed by this subsection to output reliable initial MSEM. Then, utilizing the estimated MSEM as the input of the VB-EKF, the AI-VBEKF for a robust DOA tracking algorithm is finally carried out in this subsection.

3.3.1. Input Data Processing

Supposing the initial state is selected as:

x_{0} = {(θ_{0}, {\dot{θ}}_{0})}^{T}

(49)

Using the measurement model depicted by Equation (9), the error between the theoretical measurement and the real measurement can be presented as:

{\tilde{Z}}_{k} = Z_{k} - h_{0} (x_{0}) = [\begin{array}{l} z (0) \\ z (τ) \\ z (2 τ) \\ ⋮ \\ z ((M - 1) τ) \end{array}] - [\begin{array}{l} real (A (θ_{0}) s (0)) \\ real (A (θ_{0}) s (τ)) \\ real (A (θ_{0}) s (2 τ)) \\ ⋮ \\ real (A (θ_{0}) s ((M - 1) τ)) \end{array}]

(50)

Using Equation (50), the quadratic form of the error between the theoretical measurement and the real measurement can be presented as:

e_{V} = {\tilde{Z}}_{k} {\tilde{Z}}_{_{k}}^{T}

(51)

where

e_{V}

is a matrix with the dimension of

M P \times M P

.

From Equation (7), it can be found that the real measurement has a strong relationship with the real DOA angle of the target and its angular velocity. When the value of the real state changes, the measurement will also change. In addition, the estimated state influences the value of the theoretical measurement computed by the measurement model. Thus, the computed

e_{V}

has the information of the difference of the real values of the state and its estimated values. From the view of data-driven techniques,

e_{V}

can be utilized to output this error and can make good estimation of the guess of the initial MSEM.

In addition, from Equation (51), it is obvious that the size of

e_{V}

depends on the number of the snapshots in one measurement time interval. In order to make reliable DOA estimations and DOA tracking, the value of M is usually set high during one measurement updating step (i.e., the same as the sampling rate of the sonar system). As a result, the size of

e_{V}

is usually large, and the nonlinear relationship between the real guess of the initial MSEM and its real values is hard to be dug out by shallow BP neural networks. In order to make sufficient utilization of the data resources obtained by the sonar system and make reliable estimations of the guess of the initial MSEM, an attention-based DCNN is proposed in the following subsection.

3.3.2. Attention-Based Deep Convolutional Neural Network

As mentioned in Section 3.3.1, since the traditional BP neural networks have limited ability in dealing with a large matrix and the computed

e_{V}

has the form of a symmetric matrix such as an image, an attention-based DCNN is proposed by this subsection. Among the latest DCNNs, the attention-based DCNN is the most popular one and has been utilized in a number of target classification, localization, and tracking missions. Unlike the basic DCNN that cannot be developed very deep and the Resnet that cannot give different features different weights, which make the overall performance degrade, the attention-based DCNN not only utilizes the advantages of the Resnet to make the DCNN much deeper than the traditional ones, but also develops the attention scheme that values different features with different weights to make the DCNN have better tracking performance. The basic structure of the attention-based DCNN is shown in Figure 2 [35].

From Figure 2 [35], the main difference between the attention-based DCNN and the other DCNNs is the “squeeze and excitation” block. For any given data with the dimension

C' \times W' \times H'

, firstly a transformation mapping

F_{t r}

is operated to generate the feature maps U, which can be depicted as:

F_{t r} : X \to U, X \in ℝ^{H^{'} \times W^{'} \times C^{'}}, U \in ℝ^{H \times W \times C}

(52)

From Equation (52), it is obvious that the

F_{t r}

operation has no difference with other DCNNS (i.e.,

F_{t r}

is the traditional DCNN or Resnet). As a result, the attention-based DCNNs can be fitted into any existing DCNNs to enhance their performance.

After the

F_{t r}

mapping operation, the “squeeze and excitation” block can operate. Firstly, the feature maps U is operated via a squeeze process that aims to compress the information of the feature maps into a one-dimensional vector. By operating in this way, the compressed feature of every channel of the feature maps U is generated. The squeeze operation can be depicted as the following equation:

z_{c} = F_{s q} (u_{c}) = \frac{1}{H \times W} \sum_{H}^{i = 1} \sum_{W}^{j = 1} u_{c} (i, j)

(53)

where

u_{c}

is the

c

-th channel of the feature maps U, and

F_{s q}

is the squeeze operation.

From Equation (53), it can be found that the squeeze operation operates on every feature map with the dimension of

H \times W

. After the squeeze operation, every feature map of U turns to a real number so that the information is maximally compressed. Then, the excitation operation is processed to calculate the weights of every feature map. The excitation operation aims to fully capture the channel-wise dependencies of the input feature maps U. By utilizing full connection layers and the ReLU activation function, the nonlinear mapping process made by the excitation operation can be depicted as:

s = F_{e x} (z_{c}, W) = σ (g (z_{c}, W)) = σ (W_{2} δ (W_{1} z_{c}))

(54)

where

s

is the output of the excitation operation,

F_{e x}

is the excitation operation,

W_{1}

is the first fully connected (FC) layer that can be regarded as a nonlinear operation to extract the inner information of the input

z_{c}

,

δ (\cdot)

is the ReLU activating function,

W_{2}

is the second FC layer, and

σ (\cdot)

is a typical activating function that can be selected as a sigmoid function in general.

From Equation (54), after the excitation operation, the inner relationships between every feature map in U can be represented by

s

. Thus,

s

is the core of the attention-based DCNN, for it depicts the weights of every feature inside the whole feature maps U shown by Equation (52). As long as

s

is obtained, the weights vector of every feature map is known. As a result, the final output of the attention-based DCNN can be represented as:

{\tilde{x}}_{c} = F_{scale} (u_{c}, s_{c}) = s_{c} \cdot u_{c}

(55)

where

F_{scale}

is the final scaling operation that refers to the channel-wise multiplication between the

c

-th element of the output of the excitation operation and the

c

-th feature map of U. It is obvious from Equation (52) and Equation (55) that the original feature maps U is weighted by the “squeeze and excitation” block. This weighting mechanism is similar to paying attention to different features such that the whole deep learning technique is called the attention-based DCNN.

From Equation (52) and the above statements, the first mapping operation

F_{t r}

can be any type of DCNN technique. As the Resnet has proven its superior performance in underwater target localization missions [38], the

F_{t r}

is selected as the residual model [38] and the attention-based DCNN blocks combined with the residual model can be shown as the following block as shown in Figure 3:

Equations (52)–(55) represent the whole theoretical derivations of the attention-based DCNN. In addition, since this DCNN is mainly based on the “squeeze and excitation” block, the mentioned attention-based DCNN is also called the Senet. Here, substitute Equation (51) into Equation (52), and change the initial input

X

to

e_{V}

. The Senet can then be directly utilized to analyze the inner nonlinear relationship between the real values of the state and its estimated values.

3.3.3. Design of the AI-VBEKF

From Equation (9), if the measurement noise varies for the same initial error between the initial guess of the state and its real value,

e_{V}

computed by Equation (51) will change by different

R_{k}

. Supposing the number of varying covariance matrix of

R_{k}

is

H'

and the measurement

Z_{k}

has snapshots with the number of M,

e_{V}

will have the dimensions of

H' \times M \times M

. Then, substituting Equation (51) into Equation (52) and changing the initial input

X

to

e_{V}

with

H'

covariance matrix of the measurement noise and M snapshots, Equation (52) can be represented as:

F_{t r} : e_{V} \to U, e_{V} \in ℝ^{H^{'} \times M \times M}, U \in ℝ^{H \times W \times C}

(56)

In addition, if the transformation mapping

F_{t r}

is chosen as the residual model, and the following steps are the same as the aforementioned ones (Equations (53)–(55)), the attention-based DCNN combined with the residual model can be proposed. By using

e_{V}

with different

R_{k}

as inputs and the differences between the initial guess of the state and its real value as outputs, the attention-based DCNN for the initial guess of the error of the covariance matrix of the states can be proposed as in Table 1.

Then, by combining the proposed attention-based DCNN, the EKF DOA tracking scheme and the VB robust estimating technique, the AI-VBEKF, which has the abilities of estimating the guess of the initial MSEM and performing robust DOA tracking, can be depicted as Algorithm 1.

Algorithm 1: AI-VBEKF

Input

x_{0}

.
Calculate

{\tilde{Z}}_{k}

as Equation (50).
Calculate

e_{V}

as Equation (51).
Initial MSEM

P_{0}

estimation by attention-based DCNN.
Inputs:

{\hat{x}}_{k - 1 | k - 1}

,

P_{k - 1 | k - 1}

,

z_{k}

,

{\hat{u}}_{k - 1 | k - 1}

,

{\hat{U}}_{k - 1 | k - 1}

,

Q_{k - 1}

, ρ, N.
Time update

{\hat{x}}_{k | k - 1} = F_{k | k - 1} {\hat{x}}_{k - 1 | k - 1}

.

P_{k | k - 1} = F_{k | k - 1} P_{k - 1 | k - 1} F_{k | k - 1}^{T} + G_{k - 1} Q_{k - 1} G_{k - 1}^{T}

.

{\hat{u}}_{k ∣ k - 1} = ρ ({\hat{u}}_{k - 1 | k - 1} - n - 1) + n + 1

,

{\hat{U}}_{k | k - 1} = ρ {\hat{U}}_{k - 1 | k - 1}

.
Iteration measurement update
Initialization:

{\hat{x}}_{k | k}^{(0)} = {\hat{x}}_{k | k - 1}

,

{\hat{R}}_{k}^{(0)} = {\hat{U}}_{k | k - 1} / {\hat{u}}_{k | k - 1}

.
For

i = 0 : N - 1

Update

q^{(i + 1)} (x_{k}) = N (x_{k}; x_{k | k}^{(i + 1)}, P_{k | k}^{(i + 1)})

h ({\hat{x}}_{k | k}^{(i)})

is calculated by Equation (7),

H_{k}^{(i)}

is calculated by Equation (13).

K_{k}^{(i + 1)} = P_{k | k - 1} {(H_{k}^{(i)})}^{T} {(H_{k}^{(i)} P_{k | k - 1} {(H_{k}^{(i)})}^{T} + {\hat{R}}_{k}^{(i)})}^{- 1}

,

{\hat{x}}_{k | k}^{(i + 1)} = {\hat{x}}_{k | k - 1} + K_{k}^{(i + 1)} (z_{k} - h ({\hat{x}}_{k | k}^{(i)}) - H_{k}^{(i)} ({\hat{x}}_{k | k - 1} - {\hat{x}}_{k | k}^{(i)}))

,

P_{k | k}^{(i + 1)} = P_{k | k - 1} - K_{k}^{(i + 1)} H_{k}^{(i)} P_{k | k - 1}

.
Update

q^{(i + 1)} (R_{k}) = IW (R_{k}; {\hat{u}}_{k | k}^{(i + 1)}, {\hat{U}}_{k | k}^{(i + 1)})

h ({\hat{x}}_{k | k}^{(i + 1)})

is calculated by Equation (7),

H_{k}^{(i + 1)}

is calculated by Equation (13).

B_{k}^{(i + 1)} = (z_{k} - h ({\hat{x}}_{k | k}^{(i + 1)})) {(z_{k} - h ({\hat{x}}_{k | k}^{(i + 1)}))}^{T} + H_{k}^{(i + 1)} P_{k | k}^{(i + 1)} {(H_{k}^{(i + 1)})}^{T}

,

{\hat{U}}_{k | k}^{(i + 1)} = {\hat{U}}_{k | k - 1} + B_{k}^{(i + 1)}

,

{\hat{u}}_{k | k}^{(i + 1)} = {\hat{u}}_{k | k - 1} + 1

,

{\hat{R}}_{k}^{(i + 1)} = {\hat{U}}_{k | k}^{(i + 1)} / {\hat{u}}_{k | k}^{(i + 1)}

.
End for

{\hat{x}}_{k | k} = {\hat{x}}_{k | k}^{(N)}

,

P_{k | k} = P_{k | k}^{(N)}

,

{\hat{u}}_{k | k} = {\hat{u}}_{k | k}^{(N)}

,

{\hat{U}}_{k | k} = {\hat{U}}_{k | k}^{(N)}

.
Outputs:

{\hat{x}}_{k | k}

,

P_{k | k}

,

{\hat{u}}_{k | k}

,

{\hat{U}}_{k | k}

.

3.3.4. Performance Metrics

In the underwater target DOA tracking scenario, the angular velocity of the target is quite small, for the target usually moves at a relatively far distance from the observer. To obtain the accurate initial MNCM of the target, only the difference between the real initial angle and its estimation will be determined by the attention-based DCNN. According to the goal of the estimation of the initial guess of the error of the covariance matrix of the states, the metrics for quantifying the performance of the proposed attention-based DCNN can be described as:

E_{a c c, θ} = \frac{1}{N} \sum_{i = 1}^{N} \frac{|{\bar{θ}}_{0} - θ_{0}|}{θ_{0}}

(57)

where

{\bar{θ}}_{0}

is the estimated initial guess of the DOA angle, and

θ_{0}

is the real initial values of the DOA angle and its angular velocity.

N

is the total number of the training data.

4. Results and Discussions

4.1. Simulations

4.1.1. Simulation Scenario and Data Set Generation of the Attention-Based DCNN

An underwater target is assumed to perform in CV motion mode. The initial state

x_{0}

is set to [270, −0.03], and process noise covariance matrix

Q_{k - 1}

is set to

2 \times 10^{- 4} ° / s^{2}

. The total simulation time is 5000 s with a 1 s interval. The frequency f, amplitude a and the initial phase

ϕ_{0}

of the target signal are set to 170 Hz, 1 and 0 rad, respectively. The underwater sound velocity c is assumed to be 1500 m/s. A UCA with 12 elements is used to obtain the measurements. The sample frequency of the array signal is 1000 Hz.

The MNCM

R_{k}

is set to

I_{M P} σ_{r}^{2}

, where

σ_{r}^{2}

is set as 20 to simulate a low SNR tracking scenario. The uncertainties of the MNCM appear from 3600 to 4000 s to test the robustness of the VB-EKF to the uncertain measurement noise caused by the unknown underwater environment. Then,

R_{k}

is expressed as:

R_{k} = \{\begin{cases} I_{M P} σ_{r}^{2} & 0 \leq k < 3600, 4000 < k \leq 5000 \\ 10 I_{M P} σ_{r}^{2} & 3600 \leq k \leq 4000 \end{cases}

(58)

The proposed AI-VBEKF for DOA tracking given by Algorithm 1 is tested by the simulated measurement data. The EKF, the SH-EKF for DOA tracking [44] are also tested for comparison. In addition, to illustrate the superior performance of the proposed AI-VBEKF,

P_{k}

is initialized to different initial guesses as

P_{0} = diag ([25, 10^{- 5}])

,

P_{0} = diag ([50, 10^{- 5}])

, and

P_{0} = diag ([100, 10^{- 5}])

. The process noise covariance matrix

Q_{k}

for EKF, SH-EKF and VB-EKF is set to the true value. The MNCM of EKF is set to a constant

I_{M P}

. The initial estimate of the MNCM and the forgetting factor b are set to

I_{M P}

and 0.9 for the SH-EKF. The dof

{\hat{u}}_{0}

, the inverse scale matrix

{\hat{U}}_{0}

of the inverse Wishart distribution and the forgetting factor ρ for VB-EKF are set to 16,

16 I_{M P}

, and 0.9995, respectively.

To numerically compare the tracking precision of the above methods, the bearing angle estimation error (BEE) is defined as a performance metrics, which is given as follows:

B E E (k) = |{\hat{θ}}_{k} - θ_{k}^{}|

(59)

where

{\hat{θ}}_{k}

is the bearing angle estimate at step k, and

θ_{k}^{}

is the real bearing angle.

Furthermore, 500 Monte Carlo simulations are performed to statistically evaluate the performance of the VB-EKF. The root mean square error (RMSE) for DOA tracking results at time step k is defined as

{RMSE}_{θ} = \sqrt{\frac{1}{N} {\sum_{n = 1}^{N} ({\hat{θ}}_{k, n} - θ_{k})}^{2}}

(60)

where N is the number of the Monte Carlo trials,

{\hat{θ}}_{k, n}

is the estimate of bearing angle in the n-th trail. The average RMSE of the bearing angle estimate in total time is defined as:

{\overline{R M S E}}_{θ} = \sqrt{\frac{1}{K N} {\sum_{k = 1}^{K} \sum_{n = 1}^{N} ({\hat{θ}}_{k, n} - θ_{k})}^{2}}

(61)

where K is the total number of tracking steps. A smaller RMSE represents a higher precision of the DOA tracking.

4.1.2. Data Set Generation and Training of the Attention-Based DCNN

The training data for the attention-based DCNN for initial MSEM estimations are generated as the following steps:

Firstly,

θ_{0}

of the initial state

x_{0}

is set as

θ_{0} \in [0, 360]

deg with intervals of 1 deg.

In addition, the offset of the initial guess of

θ_{0}

is set as:

Δ θ_{0} \in [0, 20]

deg with the interval of 0.1 deg.

Then, the MNCM

R_{k}

is set as:

R_{k} = α I_{M P} σ_{r, k}^{2}

, where the parameter

α

is set as

α \in [1, 2]

with the interval of 0.1 to generate different measurement noise.

Finally, assuming that a UCA has 12 elements and the number of snapshots in one measurement is set as 12, by utilizing the different generated

θ_{0}

along with

R_{k}

and using Equations (4)–(9), (50) and (51), the data set for training and testing can be generated.

From the above data-generating methods, the dimension of every input data of the attention-based DCNN is

144 \times 144

, and the total number of data are 1,440,000.

By utilizing the generated training data and the proposed attention-based DCNN depicted by Table 1, the proposed attention-based DCNN can be properly trained, and the averaged cross entropy loss for the training process is shown in Figure 4.

4.1.3. Simulation Results

The DOA tracking results of the EKF, SH-EKF and AI-VBEKF are shown in Figure 5, and Table 1 shows the average RMSEs. Since the CBF is only a DOA estimation technique, we merely compared different DOA tracking algorithms in the simulation section to show the robustness and accuracy of the proposed AI-VBEKF. When the initial MSEM is set properly (

P_{0} = diag ([25, 10^{- 5}])

), all of the DOA tracking techniques can output acceptable DOA tracking results. The trajectory obtained by the EKF, SH-EKF, and AI-VBEKF fluctuated in the beginning. After about 1000 s, the algorithms converged, and then, the tracking stabilized. The reason is that the error in the initial state estimate and MSEM affect the performance at the beginning of the tracking. The error was gradually eliminated by the correction of the new measurements after a certain period. When the uncertainties of the measurement noise appear, the DOA estimation accuracy of the EKF degrades. However, by utilizing the prior information of the kinematics model of the underwater target, the DOA tracking results by the EKF are still within a certain threshold (see in Figure 5b).

Since the SH-EKF estimates the MNCM in real time by using the Sage–Husa algorithm to provide accurate Kalman filter gain, the precision of SH-EKF is higher than EKF. By giving the prior distribution of MNCM, the AI-VBEKF uses the VB iteration method to jointly estimate the MNCM and the target state, which provides a more accurate estimation of the MNCM. Thus, the AI-VBEKF provides the most high-precision tracking trajectory. Therefore, by utilizing the online measurement noise covariance estimating techniques, the SH-EKF and AI-VBEKF can achieve much higher DOA tracking accuracy.

From the RMSE results (

P_{0} = diag ([25, 10^{- 5}])

) shown by Table 2, similar to the conclusion above, the EKF, SH-EKF and AI-VBEKF converged after a period, and SH-EKF and AI-VBEKF can provide high-precision tracking even with the uncertain measurement noise. The AI-VBEKF can achieve the highest DOA tracking results so that the superior performance of the proposed AI-VBEKF is verified again.

Figure 5c,f and Table 2 (

P_{0} = diag ([50, 10^{- 5}])

,

P_{0} = diag ([100, 10^{- 5}])

) give the DOA tracking results under different initial MSEMs. From the DOA tracking results, it is obvious that the properly set initial MSEM can strongly affect the overall DOA tracking results. By utilizing the attention-based DCNN to estimate the initial MSEM before operating the DOA tracking procedure, the AI-VBEKF can achieve good tracking accuracy and robustness. However, without properly setting the initial MSEM, the DOA tracking results of the SH-EKF and EKF degrade, especially for the case that the initial MSEM is largely deviated from its proper values (

P_{0} = diag ([100, 10^{- 5}])

).

4.2. Experimental Scenario

4.2.1. Experimental Scenario

We used the data from an acoustic experiment in the South China Sea in July 2021 to verify the performance of the proposed method. A ship with an acoustic signal emission system kept moving and emitting an acoustic signal at 170 Hz for 7500 s with a source level of 145 dB during the experimental procedure.

Figure 6a shows the underwater buoy system used in the experiment, which was placed 20 m under the sea surface. A 12-element uniform circular hydrophone array with a radius of 1 m was fixed on this underwater buoy along with a digital acquisition system. As shown in Figure 6b, a compass system (the rectangle) was fixed on the underwater buoy in order to measure the bearing of hydrophone 1 to offset the rotation angle of the buoy φ (measured clockwise wrt north). The digital acquisition system sampled the array signal at a frequency of 8192 Hz during the acoustic signal emission. A CTD profiler was used to measure the sound velocity to be constant at 1544 m/s at a depth from 0 to 20 m.

In order to provide the true bearing angle of the target with regard to the circular hydrophone array for reference, the GPSs are used to record the locations of the acoustic signal emission system and the underwater buoy system. Then, the bearing angle trajectory of the target was calculated clockwise with regard to north by using the GPS data.

4.2.2. Verification of the AI-VBEKF for DOA Tracking

The CBF-based DOA estimation [46], the EKF, SH-EKF [44] and the proposed AI-VBEKF are tested by using the experimental data. The interval of adjacent steps was set to 1 s. The bearing angle of the CBF-based DOA estimation scans from 0° to 360° every 1°. The initial state estimates of the EKF, SH-EKF and the AI-VBEKF are both set to

{\hat{X}}_{0} = [{\hat{θ}}_{C B F}^{0}, 0]

, where

{\hat{θ}}_{C B F}^{0}

is the estimate of bearing angle obtained by using CBF-based DOA estimation at k = 0. In addition, to illustrate the superior performance of the proposed AI-VBEKF,

P_{k}

is initialized to different initial guesses as

P_{0} = diag ([25, 10^{- 5}])

,

P_{0} = diag ([50, 10^{- 5}])

, and

P_{0} = diag ([100, 10^{- 5}])

. The covariance matrix of the process noise

Q_{k}

is set as

10^{- 4}

. The MNCM was set as

I_{M P}

for the EKF. The estimation of the MNCM

{\hat{R}}_{k}

is initialized to

I_{M P}

, and the forgetting factor b is set to 0.995 for the SH-EKF. The parameters

{\hat{u}}_{0}

,

{\hat{U}}_{0}

and ρ of the AI-VBEKF are set to 16,

16 I_{M P}

and 0.997, respectively.

To numerically compare the tracking precision of the above methods, the bearing angle estimation error (BEE) and the average BEE (ABEE) are defined as the performance metrics, which are given as follows:

B E E (k) = |{\hat{θ}}_{k} - θ_{k}^{G P S}|

(62)

A B E E (k) = \sqrt{\frac{1}{K} {\sum_{k = 1}^{K} ({\hat{θ}}_{k} - θ_{k}^{G P S})}^{2}}

(63)

where

{\hat{θ}}_{k}

is the bearing angle estimate at step k, and

θ_{k}^{G P S}

is the true bearing angle obtained by the GPS data.

The trajectories obtained by the AI-VBEKF under different

P_{0}

are shown in Figure 7 and Table 3, which show the BEEs and the ABEEs, respectively. Figure 7 and Table 3 also show the results of the CBF, EKF and the SH-EKF for comparison. Specifically, considering the complicated actual underwater environment, the CBF used 10 measurements to obtain bearing angles every 10 s for a higher precision. In order to further test the robustness of the proposed AI-VBEKF to uncertain measurement noise, a period of Gaussian noise with the covariance of

2 I_{M P}

was added to the raw measurement data from 2600 to 3000 s, and then the methods were tested again. The results are shown in Figure 7 and Table 3.

Figure 7 shows that the trajectories of all methods fluctuated around the true trajectory under the effect of the underwater environmental noise. Figure 7 shows that the trajectories obtained by all methods using raw experimental data fluctuated around the true trajectory due to the effect of the underwater environmental noise. In addition, it can be found that the initial guess of

P_{0}

influences the final tracking accuracy. From Figure 7a,b, when

P_{0}

is properly set, all the DOA tracking and DOA estimation methods can output reliable results. The trajectory obtained by the CBF-based DOA estimation is the most fluctuant and provided a large error. The precision of the EKF is higher than that of the CBF-based DOA estimation since the EKF considers the kinematic model of the underwater target.

The trajectory obtained by the robust DOA tracking algorithms, namely the SH-EKF and the AI-VBEKF, are closest to the true trajectory. The reason is that the robust DOA tracking algorithms provide more accurate Kalman filter gains than EKF due to the high-precision estimation of the MNCM, which can significantly improve the precision of DOA tracking. In addition, it can be found that the proposed AI-VBEKF has higher DOA tracking accuracy compared the SH-EKF since the VB technique is more analytical, such that it makes a precise mathematical model of the unknown MNCM and makes it more accurately estimated. In addition, the AI-VBEKF utilizes the attention-based DCNN to estimate the initial MSEM, which makes the whole DOA tracking process more stable and accurate. However, the data-inspiring SH-EKF outputs the DOA tracking results with lower accuracy since the precision of the experimental data is not accurate enough such that the tracking window of good estimations is not sufficient. Table 3 shows the high precision and robustness of the AI-VBEKF.

Figure 7c–f and Table 3 illustrate the DOA tracking results under different

P_{0}

. It can be found that the tracking results of the EKF and SH-EKF degrade when

P_{0}

is not set properly. As analyzed in Section 3.3, the high nonlinearities and complexed underwater environment make the unproperly set

P_{0}

seriously influence the DOA tracking result. However, since the proposed AI-VBEKF does not rely on the preset

P_{0}

, its DOA tracking accuracy only has slight changes with different initial MSEMs. As a result, the proposed AI-VBEKF is robust to the initial MSEMs and has the highest tracking accuracy.

Figure 8a–f and Table 3 give the results obtained by using the data with added noise. Similar conclusions can be made as the above. Furthermore, the AI-VBEKF kept robust tracking even during the existence of the added noise under different initial MSEMs, while the performance of the SH-EKF, EKF and the CBF-based DOA estimation obviously degraded when the noise was added or the initial MSEMs were not properly set.

5. Conclusions

An AI-VBEKF robust DOA tracking method was proposed and verified. The theoretical derivations were comprehensively given, and the performances were proven by simulations and the real sea trial data. It can be found in the experiment results that the proposed attention-based DCNN can make good estimation of the initial MSEM after being well trained by the training data with different initial angles, initial errors, and covariance matrix of the measurement noise. Based on the experimental results, the proposed AI-VBEKF can not only have more accurate estimation results, but it is also robust to the uncertain underwater disturbance and the initial guesses of the MSEMs. From the overall consideration of accuracy and robustness, the proposed AI-VBEKF can be considered as an alternative underwater DOA tracking algorithm, especially when the underwater environment is uncertain and the initial MSEM cannot be determined properly.

Author Contributions

Conceptualization, X.H. and B.Z.; Methodology, B.Z.; Software, X.H. and Y.Q.; Validation, Y.Q.; Writing—original draft, X.H. and Y.Q.; Supervision, Y.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (grant no. 12104113), the foundation of Central University Operating Expenses Project (grant no. G2022KY05102), and the fundamental research funding for characteristic disciplines (grant no. G2022WD0235).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created and data is unavailable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Rideout, B.P.; Dosso, S.E.; Hannay, D.E. Underwater passive acoustic localization of Pacific walruses in the northeastern Chukchi Sea. J. Acoust. Soc. Am. 2013, 134, 2534–2545. [Google Scholar] [CrossRef] [Green Version]
Zhu, X.; Buck, J.R. Designing nonuniform linear arrays to maximize mutual information for bearing estimation. J. Acoust. Soc. Am. 2010, 128, 2926–2939. [Google Scholar] [CrossRef] [PubMed]
Miller, B.; Dawson, S. A large-aperture low-cost hydrophone array for tracking whales from small boats. J. Acoust. Soc. Am. 2009, 126, 2248–2256. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Rooney, I.M.; Buck, J.R. Spatial power spectral density estimation using a Welch coprime sensor array processor. J. Acoust. Soc. Am. 2019, 145, 2350–2362. [Google Scholar] [CrossRef] [Green Version]
Zeng, W.-J.; So, H.C.; Huang, L. ℓ_p-MUSIC: Robust Direction-of-Arrival Estimator for Impulsive Noise Environments. IEEE Trans. Signal Process. 2013, 61, 4296–4308. [Google Scholar] [CrossRef]
Wang, B.; Zhang, Y.D.; Wang, W. Robust DOA Estimation in the Presence of Miscalibrated Sensors. IEEE Signal Process. Lett. 2017, 24, 1073–1077. [Google Scholar] [CrossRef]
Mao, Y.; Zhang, G.; Leung, H. Harmonic Retrieval Joint Multiple Regression: Robust DOA Estimation for FMCW Radar in the Presence of Unknown Spatially Colored Noise. IEEE Commun. Lett. 2021, 25, 2240–2244. [Google Scholar] [CrossRef]
Hu, R.; Fu, Y.; Chen, Z.; Xu, J.; Tang, J. Robust DOA Estimation via Sparse Signal Reconstruction with Impulsive Noise. IEEE Commun. Lett. 2017, 21, 1333–1336. [Google Scholar] [CrossRef]
Li, T.; Nehorai, A. Direction-of-Arrival Estimation of Hydroacoustic Signals from Marine Vessels Containing Random and Sinusoidal Components. IEEE Signal Process. Lett. 2012, 19, 503–506. [Google Scholar] [CrossRef]
Zheng, Z.; Yang, C. Direction-of-Arrival Estimation of Coherent Signals Under Direction-Dependent Mutual Coupling. IEEE Commun. Lett. 2021, 25, 147–151. [Google Scholar] [CrossRef]
Zheng, G. DOA Estimation in MIMO Radar With Non-Perfectly Orthogonal Waveforms. IEEE Commun. Lett. 2016, 21, 414–417. [Google Scholar] [CrossRef]
Zhao, J.; Gui, R.; Dong, X.; Wu, S. Time-Varying DOA Tracking Algorithm Based on Generalized Labeled Multi-Bernoulli. IEEE Access 2021, 9, 5943–5950. [Google Scholar] [CrossRef]
Saucan, A.-A.; Chonavel, T.; Sintes, C.; Le Caillec, J.-M. Marked poisson point process PHD filter for DOA tracking. In Proceedings of the 2015 23rd European Signal Processing Conference (EUSIPCO), Nice, France, 31 August–4 September 2015; pp. 2621–2625. [Google Scholar] [CrossRef] [Green Version]
Saucan, A.-A.; Chonavel, T.; Sintes, C.; Le Caillec, J.-M. CPHD-DOA Tracking of Multiple Extended Sonar Targets in Impulsive Environments. IEEE Trans. Signal Process. 2015, 64, 1147–1160. [Google Scholar] [CrossRef]
Zhai, Y.; Yeary, M. A New Particle Filter Tracking Algorithm for DOA Sensor Systems. In Proceedings of the 2007 IEEE Instrumentation & Measurement Technology Conference IMTC, Warsaw, Poland, 1–3 May 2007; pp. 1–4. [Google Scholar] [CrossRef]
Mehra, R.K. Approaches to adaptive filtering. In Proceedings of the Adaptive Processes (9th) Decision and Control, 1970 IEEE Symposium on IEEE, Austin, TX, USA, 7–9 December 1970; IEEE: Piscataway, NJ, USA, 2010. [Google Scholar]
Sarkka, S.; Nummenmaa, A. Recursive Noise Adaptive Kalman Filtering by Variational Bayesian Approximations. IEEE Trans. Autom. Control 2009, 54, 596–600. [Google Scholar] [CrossRef]
Hartikainen, S.S.J. Variational Bayesian adaptation of noise covariances in non-linear Kalman filtering. arXiv 2013, arXiv:1302.0681. [Google Scholar]
Mohinder, S.G. Kalman Filtering: Theory and Practice Using Matlab; Angus, P.A., Ed.; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2001; pp. 133–144. [Google Scholar]
Zhao, Y.; Zhang, J.; Hu, G.; Zhong, Y. Set-membership based hybrid Kalman filter for nonlinear state estimation under system uncertainty. Sensors 2020, 20, 627. [Google Scholar] [CrossRef] [Green Version]
Pesce, V.; Silvestrini, S.; Lavagna, M. Radial basis function neural network aided adaptive extended Kalman filter for spacecraft relative navigation. Aerosp. Sci. Technol. 2019, 96, 105527. [Google Scholar] [CrossRef]
Huang, Y.; Zhang, Y.; Wu, Z.; Li, N.; Chambers, J. A novel adaptive Kalman filter with inaccurate process and measure-ment noise covariance matrices. IEEE Trans. Autom. Control 2017, 63, 594–601. [Google Scholar] [CrossRef] [Green Version]
Huang, Y.; Zhang, Y.; Shi, P.; Chambers, J. Variational Adaptive Kalman Filter with Gaussian-Inverse-Wishart Mixture Distribution. IEEE Trans. Autom. Control 2020, 66, 1786–1793. [Google Scholar] [CrossRef]
Huang, Y.; Bai, M.; Li, Y.; Zhang, Y.; Chambers, J. An Improved Variational Adaptive Kalman Filter for Cooperative Localization. IEEE Sens. J. 2021, 21, 10775–10786. [Google Scholar] [CrossRef]
Xu, G.; Huang, Y.; Gao, Z.; Zhang, Y. A Computationally Efficient Variational Adaptive Kalman Filter for Transfer Alignment. IEEE Sens. J. 2020, 20, 13682–13693. [Google Scholar] [CrossRef]
Hu, G.; Gao, S.; Zhong, Y.; Gao, B.; Subic, A. Modified strong tracking unscented Kalman filter for nonlinear state estimation with process model uncertainty. Int. J. Adapt. Control. Signal Process. 2016, 29, 1561–1577. [Google Scholar] [CrossRef]
Wang, J.; Zhang, T.; Xu, X.; Li, Y. A Variational Bayesian Based Strong Tracking Interpolatory Cubature Kalman Filter for Maneuvering Target Tracking. IEEE Access 2018, 6, 52544–52560. [Google Scholar] [CrossRef]
Lan, H.; Sun, S.; Wang, Z.; Pan, Q.; Zhang, Z. Joint Target Detection and Tracking in Multipath Environment: A Variational Bayesian Approach. IEEE Trans. Aerosp. Electron. Syst. 2019, 56, 2136–2156. [Google Scholar] [CrossRef] [Green Version]
Ban, Y.; Alameda-Pineda, X.; Girin, L.; Horaud, R. Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers. IEEE Trans. Pattern Anal. Mach. Intell. 2019, 43, 1761–1776. [Google Scholar] [CrossRef] [PubMed]
Hou, X.; Yuan, J.; Ma, C.; Sun, C. Parameter estimations of uncooperative space targets using novel mixed artificial neural net-work. Neurocomputing 2019, 339, 232–244. [Google Scholar] [CrossRef]
Wenhui, W.; Shesheng, G.; Yongmin, Z.; Gu, C.; Hu, G. Adaptive Square-Root Unscented Particle Filtering Algorithm for Dynamic Nav-igation. Sensors 2018, 18, 2337. [Google Scholar]
Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning Internal Representations by Error Propagation. Read. Cogn. Sci. 1988, 323, 399–421. [Google Scholar]
Hinton, G.E.; Osindero, S.; Teh, Y.W. A Fast Learning Algorithm for Deep Belief Nets; MIT Press: Cambridge, MA, USA, 2006. [Google Scholar]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 27–30 June 2016; IEEE: Piscataway, NJ, USA, 2016. [Google Scholar]
Hu, J.; Shen, L.; Sun, G. Squeeze-and-excitation networks. arXiv 2017, arXiv:1709.01507. [Google Scholar] [CrossRef] [Green Version]
Lecun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef] [Green Version]
Niu, H.; Reeves, E.; Gerstoft, P. Source localization in an ocean waveguide using supervised machine learning. J. Acoust. Soc. Am. 2017, 142, 1176–1188. [Google Scholar] [CrossRef]
Niu, H.; Gong, Z.; Ozanich, E.; Gerstoft, P.; Wang, H.; Li, Z. Deep-learning source localization using multi-frequency magnitude-only data. J. Acoust. Soc. Am. 2019, 146, 211–222. [Google Scholar] [CrossRef] [Green Version]
Liu, Y.; Niu, H.; Li, Z. A multi-task learning convolutional neural network for source localization in deep ocean. J. Acoust. Soc. Am. 2020, 148, 873–883. [Google Scholar] [CrossRef]
Ozanich, E.; Gerstoft, P.; Niu, H. A feedforward neural network for direction-of-arrival estimation. J. Acoust. Soc. Am. 2020, 147, 2035–2048. [Google Scholar] [CrossRef] [PubMed]
Huang, Z.; Xu, J.; Gong, Z.; Wang, H.; Yan, Y. A Deep Neural Network Based Method of Source Localization in a Shallow Water Environment. In Proceedings of the 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, AB, Canada, 15–20 April 2018; pp. 3499–3503. [Google Scholar]
Huang, Z.; Xu, J.; Gong, Z.; Wang, H.; Yan, Y. Multiple Source Localization in a Shallow Water Waveguide Exploiting Subarray Beamforming and Deep Neural Networks. Sensors 2019, 19, 4768. [Google Scholar] [CrossRef] [Green Version]
Benson, J.; Chapman, N.; Antoniou, A. Geoacoustic model inversion with artificial neural networks. In Proceedings of the 1998 IEEE Symposium on Advances in Digital Filtering and Signal Processing. Symposium Proceedings (Cat. No.98EX185), Victoria, BC, Canada, 5–6 June 2002; pp. 121–125. [Google Scholar] [CrossRef]
Zhang, B.; Hou, X.; Yang, Y. Robust underwater direction-of-arrival tracking with uncertain environmental disturbances using a uniform circular hydrophone array. J. Acoust. Soc. Am. 2022, 151, 4101–4113. [Google Scholar] [CrossRef] [PubMed]
Kong, D.; Chun, J. A fast DOA tracking algorithm based on the extended Kalman filter. In Proceedings of the IEEE 2000 National Aerospace and Electronics Conference (NAECON), Dayton, OH, USA, 12 October 2000; pp. 235–238. [Google Scholar]
Van Veen, B.; Buckley, K. Beamforming: A versatile approach to spatial filtering. IEEE ASSP Mag. 1988, 5, 4–24. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Configuration of the UCA-based measurement system.

Figure 2. The basic structure of the attention-based DCNN [35].

Figure 3. The structure of the attention-based DCNN blocks combined with the residual model [35].

Figure 4. The cross-entropy loss of the attention-based DCNN on generated data sets.

Figure 5. Simulation results of EKF, SH-EKF and AI-VBEKF. (a) Trajectories tracking results under

P_{0} = diag ([25, 10^{- 5}])

, (b) BEEs obtained under

P_{0} = diag ([25, 10^{- 5}])

, (c) trajectories tracking results under

P_{0} = diag ([50, 10^{- 5}])

, (d) BEEs obtained under

P_{0} = diag ([50, 10^{- 5}])

, (e) trajectories tracking results under

P_{0} = diag ([100, 10^{- 5}])

, and (f) BEEs obtained under

P_{0} = diag ([100, 10^{- 5}])

.

Figure 5. Simulation results of EKF, SH-EKF and AI-VBEKF. (a) Trajectories tracking results under

P_{0} = diag ([25, 10^{- 5}])

, (b) BEEs obtained under

P_{0} = diag ([25, 10^{- 5}])

, (c) trajectories tracking results under

P_{0} = diag ([50, 10^{- 5}])

, (d) BEEs obtained under

P_{0} = diag ([50, 10^{- 5}])

, (e) trajectories tracking results under

P_{0} = diag ([100, 10^{- 5}])

, and (f) BEEs obtained under

P_{0} = diag ([100, 10^{- 5}])

.

Figure 6. Configuration of the underwater buoy system: (a) photo of the underwater buoy system; (b) configuration of the UCA and the compass system.

Figure 7. Experimental results of CBF-based DOA estimation, EKF, SH-EKF and AI-VBEKF on raw data. (a) Trajectories obtained using the raw data under

P_{0} = diag ([25, 10^{- 5}])

, (b) BEEs obtained using the raw data under

P_{0} = diag ([25, 10^{- 5}])

, (c) trajectories obtained using the raw data under

P_{0} = diag ([50, 10^{- 5}])

, (d) BEEs obtained using the raw data under

P_{0} = diag ([50, 10^{- 5}])

, (e) trajectories obtained using the raw data under

P_{0} = diag ([100, 10^{- 5}])

, and (f) BEEs obtained using the raw data under

P_{0} = diag ([100, 10^{- 5}])

.

Figure 7. Experimental results of CBF-based DOA estimation, EKF, SH-EKF and AI-VBEKF on raw data. (a) Trajectories obtained using the raw data under

P_{0} = diag ([25, 10^{- 5}])

, (b) BEEs obtained using the raw data under

P_{0} = diag ([25, 10^{- 5}])

, (c) trajectories obtained using the raw data under

P_{0} = diag ([50, 10^{- 5}])

, (d) BEEs obtained using the raw data under

P_{0} = diag ([50, 10^{- 5}])

, (e) trajectories obtained using the raw data under

P_{0} = diag ([100, 10^{- 5}])

, and (f) BEEs obtained using the raw data under

P_{0} = diag ([100, 10^{- 5}])

.

Figure 8. Experimental results of CBF-based DOA estimation, EKF, SH-EKF and AI-VBEKF on data with noise. (a) Trajectories obtained using the data with added noise under

P_{0} = diag ([25, 10^{- 5}])

, (b) BEEs obtained using the data with added noise under

P_{0} = diag ([25, 10^{- 5}])

, (c) trajectories obtained using the data with added noise under

P_{0} = diag ([50, 10^{- 5}])

, (d) BEEs obtained using the data with added noise under

P_{0} = diag ([50, 10^{- 5}])

, (e) trajectories obtained using the data with added noise under

P_{0} = diag ([100, 10^{- 5}])

, and (f) BEEs obtained using the data with added noise under

P_{0} = diag ([100, 10^{- 5}])

.

Figure 8. Experimental results of CBF-based DOA estimation, EKF, SH-EKF and AI-VBEKF on data with noise. (a) Trajectories obtained using the data with added noise under

P_{0} = diag ([25, 10^{- 5}])

, (b) BEEs obtained using the data with added noise under

P_{0} = diag ([25, 10^{- 5}])

, (c) trajectories obtained using the data with added noise under

P_{0} = diag ([50, 10^{- 5}])

, (d) BEEs obtained using the data with added noise under

P_{0} = diag ([50, 10^{- 5}])

, (e) trajectories obtained using the data with added noise under

P_{0} = diag ([100, 10^{- 5}])

, and (f) BEEs obtained using the data with added noise under

P_{0} = diag ([100, 10^{- 5}])

.

Table 1. The attention-based DCNN for estimation of the guess of the initial MSEM.

Layer Name	[Kernel Size, Filters] $\times$ No. of Blocks
SE-Comv1	$[5 \times 5, 64] \times 1$
SE-Conv2	$[\begin{matrix} 1 \times 1 & 64 \\ 3 \times 3 & 64 \\ 1 \times 1 & 256 \\ f u l l c o n n e c t i o n & [16, 256] \end{matrix}] \times 3$
SE-Conv3	$[\begin{matrix} 1 \times 1 & 128 \\ 3 \times 3 & 128 \\ 1 \times 1 & 512 \\ f u l l c o n n e c t i o n & [32, 512] \end{matrix}] \times 4$
SE-Conv4	$[\begin{matrix} 1 \times 1 & 256 \\ 3 \times 3 & 256 \\ 1 \times 1 & 1024 \\ f u l l c o n n e c t i o n & [64, 1024] \end{matrix}] \times 6$
SE-Conv5	$[\begin{matrix} 1 \times 1 & 512 \\ 3 \times 3 & 512 \\ 1 \times 1 & 2048 \\ f u l l c o n n e c t i o n & [128, 2048] \end{matrix}] \times 3$
SE-Global	Global average pool, fully-connected layer with ReLU

Table 2. The average RMSEs of EKF, SH-EKF and VB-EKF.

	$P_{0}$	EKF	SH-EKF	VB-AEKF
$\bar{R M S E_{θ}}$ (°)	$P_{0} = diag ([25, 10^{- 5}])$	7.4	2.8	2.7
	$P_{0} = diag ([50, 10^{- 5}])$	13.2	7.4	2.7
	$P_{0} = diag ([100, 10^{- 5}])$	61.7	7.3	2.7

Table 3. ABEEs of different DOA tracking methods under different initial MSEMs.

Data			ABEE (°)
Data		CBF	EKF	SH-EKF	AI-VBEKF
$P_{0} = diag ([25, 10^{- 5}])$	Raw data	13.2	8.2	7.1	6.3
$P_{0} = diag ([25, 10^{- 5}])$	With noise	20.1	12.2	7.3	6.5
$P_{0} = diag ([50, 10^{- 5}])$	Raw data	13.2	11.3	11.6	6.8
$P_{0} = diag ([50, 10^{- 5}])$	With noise	20.1	12.9	11.8	7.2
$P_{0} = diag ([100, 10^{- 5}])$	Raw data	13.2	18.3	14.2	6.4
$P_{0} = diag ([100, 10^{- 5}])$	With noise	20.1	18.5	14.9	6.6

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hou, X.; Qiao, Y.; Zhang, B.; Yang, Y. Robust Underwater Direction-of-Arrival Tracking Based on AI-Aided Variational Bayesian Extended Kalman Filter. Remote Sens. 2023, 15, 420. https://doi.org/10.3390/rs15020420

AMA Style

Hou X, Qiao Y, Zhang B, Yang Y. Robust Underwater Direction-of-Arrival Tracking Based on AI-Aided Variational Bayesian Extended Kalman Filter. Remote Sensing. 2023; 15(2):420. https://doi.org/10.3390/rs15020420

Chicago/Turabian Style

Hou, Xianghao, Yueyi Qiao, Boxuan Zhang, and Yixin Yang. 2023. "Robust Underwater Direction-of-Arrival Tracking Based on AI-Aided Variational Bayesian Extended Kalman Filter" Remote Sensing 15, no. 2: 420. https://doi.org/10.3390/rs15020420

APA Style

Hou, X., Qiao, Y., Zhang, B., & Yang, Y. (2023). Robust Underwater Direction-of-Arrival Tracking Based on AI-Aided Variational Bayesian Extended Kalman Filter. Remote Sensing, 15(2), 420. https://doi.org/10.3390/rs15020420

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Robust Underwater Direction-of-Arrival Tracking Based on AI-Aided Variational Bayesian Extended Kalman Filter

Abstract

1. Introduction

2. Problem Formulation

2.1. Kinematic Model of the DOA Tracking Process

2.2. Measurement Model Based on the Received Signal of UCA

3. Methods

3.1. EKF for DOA Tracking

3.2. VB-EKF for Robust DOA Tracking

3.2.1. Choice of Prior Distribution

3.2.2. Variational Approximations of Posterior PDFs

3.3. AI-VBEKF for Robust DOA Tracking

3.3.1. Input Data Processing

3.3.2. Attention-Based Deep Convolutional Neural Network

3.3.3. Design of the AI-VBEKF

3.3.4. Performance Metrics

4. Results and Discussions

4.1. Simulations

4.1.1. Simulation Scenario and Data Set Generation of the Attention-Based DCNN

4.1.2. Data Set Generation and Training of the Attention-Based DCNN

4.1.3. Simulation Results

4.2. Experimental Scenario

4.2.1. Experimental Scenario

4.2.2. Verification of the AI-VBEKF for DOA Tracking

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI