1. Introduction
State estimation of nonlinear discrete-time stochastic dynamic systems from noisy or incomplete measurements has been a subject of considerable research interest. The topic plays an important role in signal processing, control, and fault detection of both small and large scale systems in virtually all fields of industry [
1] including power generation and transmission, battery live prediction [
2,
3,
4,
5,
6,
7,
8], automotive [
9,
10], navigation [
11,
12], speech and image processing etc. State estimation methods has also become important in non-technical fields including, for example, economics, biology, or weather forecasting.
In the Bayesian framework, a general solution to the state estimation problem is given by the Bayesian recursive relations (BRRs), which compute the probability density functions (PDFs) of the state conditioned on the measurements. The conditional PDFs fully describe the estimate of the immeasurable state of a nonlinear or non-Gaussian stochastic dynamic system. Besides a set of linear models leading e.g., to the Kalman filter (KF), the BRRs are not analytically tractable and an approximation in their solution must be used. The resulting approximate methods can be divided with respect to the validity of the resulting estimates into global and local filters [
13,
14].
The local filters provide computationally efficient estimates in the form of the conditional mean and covariance matrix with potentially limited performance due to inherent underlying Gaussian assumption, which is not always realistic (as the first two moments usually do not represent a full description of the immeasurable state). These filters are represented by the extended Kalman filter, the unscented Kalman filter, or the stochastic integration filter [
14,
15,
16,
17,
18].
As opposed to the local filters, the global filters provide estimates in the form of conditional PDFs without any assumption on the conditional distribution family. The global filters are capable of estimating the state of a strongly nonlinear or non-Gaussian system but usually at the cost of higher computational demands. Among these, the Gaussian sum filter [
19], the particle filter [
20], and the point-mass filter [
21,
22,
23,
24] have attracted a considerable attention.
The point-mass filter (PMF) considered in this paper is based on a numerical solution to the BRRs using deterministic grid-based numerical integration rules, which means the BRRs are solved and the conditional PDF is computed at the grid points only. Although a conceptual design of the PMF is known from the seventies [
13], an application in the terrain-aided navigation (TAN) [
22] is reported as late as the nineties mainly because of the computational complexity of the filter. Therefore, since the development of the PMF, extensive attention is devoted to the efficient filter design reducing the computational complexity. In the literature, two main directions of reduction can be found;
Design of the Rao-Blackwellized (or marginalized) PMF [
23,
25],
Efficient convolution in the PMF prediction step [
21,
22,
24,
26,
27,
28,
29].
The former approach is based on the decomposition of the state vector into two parts; nonlinearly and linearly modeled. As a consequence, just the nonlinearly modeled part of the state is estimated by the computationally expensive PMF and the remaining part of the state vector is estimated by a set of computationally cheap KFs. The Rao–Blackwellized PMF is thus suitable for possibly high-dimensional models with specific conditionally linear structure and Gaussian noises. The latter approach typically does not restrict the class of the allowed state-space models, but it rather focuses on computationally effective implementation of the convolution, as the most demanding step, in the prediction of the PMF.
Recently, a novel method for the efficient convolution, based on the the density-specific grid (DSG) design, was proposed [
24]. Compared to other methods (typically assuming one equidistant grid), the DSG assumes two different grids (each with different properties);
Denser grid supporting symmetrical vicinity of the conditional PDF mean (where a higher volume of the conditional PDF is expected),
Sparser grid supporting conditional PDF tail region (where a marginal volume of the conditional PDF is expected).
The DSG brings significant improvement to the estimation quality especially for unimodal and rather symmetric conditional PDFs with light tails. If the density is multimodal, heavy-tailed, or heavy-skewed, the benefit of the DSG is marginal.
The goal of the paper is, thus, to design an advanced version of the DSG, further denoted as the density difference grid (DDG). The proposed DDG is based on the differentiation of the conditional PDF at the sparse grid, which allows accurate determination of the regions with a significant spatial change of the PDF. In these regions, the denser grid is created to reasonably well capture the PDF shape. As a consequence, the DDG has an ability to better approximate asymmetric, heavy-tailed, and multi-modal PDFs, i.e., the PDFs with dominant higher-order moments, with a negligible increase of the overall PMF computational complexity.
The rest of the paper is organized as follows. In
Section 2, the system description and a brief introduction to the Bayesian state estimation is presented. Then, in
Section 3, the points-mass filter is overviewed. This overview is accompanied with a short recapitulation of the state-of-the-art grid designs.
Section 4 proposes a novel DDG design and
Section 5 shows the DDG performance in a numerical study. Finally, concluding remarks are drawn in
Section 6.
2. System Description and Bayesian State Estimation
Let the following discrete-time state-space model of a nonlinear stochastic dynamic system
be considered, where the vectors
,
, and
represent the unknown state of the system and the known input and measurement at time instant
k, respectively. The state and measurement functions
and
are known general transformations. Particular realizations of the state and measurement noises
and
are unknown, but their PDFs, i.e., the state noise PDF
and the measurement noise PDF
, are supposed to be known and independent of the known initial state PDF
.
The goal of the state estimation, particularly of the filtering, is to find an estimate of the state
conditioned on all measurements
up to the time instant
k. In the Bayesian framework, the state estimate in the form of the conditional PDF
, is sought. Note that, considering the model (
1), (2), the state transition PDF and the BRRs (
3)–(
5) should be conditioned also on available sequence of the input
. However, for the sake of notational simplicity, the input signal is assumed to be implicitly part of the condition and it is not explicitly stated, i.e.,
,
, and
.
The Bayesian state estimator design stems from the solution of the Bayesian recursive relations (BRRs) for the recursive computation of the conditional PDFs [
15]
where
is the filtering PDF computed by the Bayes’ rule (
3) and
is the one-step predictive PDF computed by the Chapman–Kolmogorov Equation (4). The PDFs
and
are the state transition PDF obtained from system dynamics (
1) and the measurement PDF obtained from measurement Equation (2), respectively. The PDF
is the one-step predictive PDF of the measurement. The recursion (
3), (4) starts from the initial PDF, i.e.,
.
4. Density Difference Grid Design
The DDG design is based on the idea that such part of the support, where the conditional PDF is significantly spatially varying, should be covered by the dense grid (to sufficiently well capture PDF behavior). On the other hand, the remaining part of the support, i.e., the parts where the conditional PDF is almost flat, can be reasonably well approximated by the sparse grid. To realize this basic idea of the DDG, the following three steps should be performed:
Design of the sparse (outer) grid and evaluation of the predictive PMD at these grid points,
Differentiation of the “sparse” predictive PMD,
Design of the dense (inner) grids in the vicinity of such “sparse” grid points, which are associated with a large value of the difference.
Compared to the DSG design discussed in [
24], the DDG design utilizes full information about the shape of the predictive PMD (and not only the moments) and also the denser (inner) grid need not be hyper-rectangular, which further improves the fit of the PDF and its grid support. The steps are particularized below.
4.1. Sparse Grid Design
The sparse grid design takes advantage of the PMF time-update, that the PDF value of a single point
can be counted independently of any other grid points. The sparse grid design is given by Algorithm 2:
Algorithm 2: DDG Sparse Grid |
Define a threshold below which the probability at a grid point is considered to be negligible and can be neglected. The threshold setting can be related to the computation precision of a given platform; State Similarly as in the DSG design [ 24], compute the predictive moments ( 18) and (19) and set the number of standard deviations (STDs) of the marginal distributions of ( 20), which are used for specification of an a priori (hyper-)rectangular region to be covered by the grid points; At the boundaries of the region select a small set of grid points and evaluate the predictive probability at these points. If the probability of the point is (significantly) below the threshold , then the particular bound can be shifted towards the mean. On the other hand, if the probability of the point is (significantly) over the threshold , then the particular bound can be shifted in an opposite direction from the mean. The shifted points are, then, re-evaluated; When all boundary points are properly set, create the smallest possible hyper-rectangle defining including all the boundary points. As a consequence, the resulting region need not be symmetrical with respect to the mean; Define a cardinality of the sparse grid and dense grid . Fill the region by equidistantly placed hyper-rectangular grid points (each sparse grid point is associated with a neighbourhood ). A reasonable choice ;
|
This algorithm is illustrated in
Figure 2 for the two dimensional state-space.
4.2. Differentiation of Sparse Grid PMD
To find the state-space regions, where the conditional predictive PDF
is expected to significantly spatially vary (i.e., the regions to be covered by the dense grid), the PMD evaluated at the sparse grid points is differenced according to Algorithm 3:
Algorithm 3: Sparse Grid PMD Differentiation |
Compute the PMD for sparse grid points according to the convolution ( 14); For each sparse grid point compute the divided difference vector [ 14, 33] according to
where denotes the Euclidean norm and the index vector is selected so that the closest neighbouring points of the j-th point are chosen. It means two neighbouring points are selected for , four neighbouring points for , eight neighbouring points for , etc.; Compute the grid points rating as a norm of the vector ( 21) according to
which characterizes the overall spatial variability of the PDF in the vicinity of the j-th sparse grid point .
|
This algorithm is illustrated in
Figure 3 for the two dimensional state-space.
4.3. Design of Dense Grid
Having defined sparse grid points (Algorithm 2) and computed their difference rating (Algorithm 3), which in fact precisely denotes the state-space regions with significant change or variability of the conditional PDF, it is possible to construct the dense grid according to Algorithm 4:
Algorithm 4: Dense Grid Design |
|
This algorithm is illustrated in
Figure 4 for the two dimensional state-space.
4.4. Summary of DDG Design and Properties
The DDG design is given by the three steps realized by Algorithms 2–4. An example of the final DDG for the PMD shown in
Figure 3 can be found in
Figure 5.
Compared to the recently proposed DSG, the DDG dense (inner) grid need not be rectangular and, thus, it more realistically covers the support of the conditional PDF with the significant volume without any requirement on computation and utilization of a set of higher-order moments. The DDG design also reduces the user interaction and minimizes the number of the user-defined parameters (compared to the standard grid design, the DDG design requires specification of three additional parameters only; the number of sparse grid points and the threshold in Algorithm 2 and splitting ration r in Algorithm 4). On the other, the DDG design is slightly more computationally intensive as illustrated in the following section.