1. Introduction
Kalman filters were first introduced into onboard guidance and navigation systems during NASA’s Apollo Project in the 1960s [
1]. However, digital simulations of the filter for onboard trajectory estimation using finite-precision arithmetic uncovered numerical stability problems. The filter simulations on the IBM 704 computer with 36-bit floating-point arithmetic were determined to be numerically unreliable. On the Apollo flight computer constrained with only 15-bit fixed-point arithmetic operations, onboard implementation of the filter was infeasible [
2]. Since then, efforts to improve filter accuracy and stability under finite-precision hardware constraints have been a focus for practical realization of navigation filters. Square-root filtering was invented as a solution for execution on the Apollo guidance computer to solve the historic circumlunar navigation problem [
3]. Square-root algorithms for state estimation were well-established by the 1970s [
4,
5] to address numerical stability issues due to quantization effects. With technological advancements in microprocessors and hardware accelerators, research on navigation filters continued to focus on high-speed, hard real-time calculations [
6]. Recent research on quantized filtering algorithms has been enabled by optimized software packages, dedicated hardware for parallel computing, and purpose-built solutions [
7,
8,
9].
The quantization problem in state estimation and optimal control has been widely studied in the literature [
10,
11,
12]. Quantization is commonly utilized in wireless sensor networks (WSNs) with low-quality sensors with limited computation and communication capabilities, and where transmission bandwidth is severely constrained. A class of Kalman filters where the difference between measurements and its predictions (i.e., innovations) are quantized has been developed for decentralized state estimation [
12,
13]. In these quantized Kalman filters, measurement predictions are distributed from a fusion center to a sensor node, where innovations are quantized and sent back to the center for filter update. The sign of innovation Kalman filter (SOI-KF) [
13] operates by quantizing innovations to only
-bit, which may give rise to large estimation errors. Filters are also proposed for multi-bit transmission, such as the multiple-level quantized innovation Kalman filter (MLQ-KF) [
12]. It is shown that with multi-bit quantization, the performance of the quantized filter can be recovered to nearly match the mean squared errors of standard the Kalman filter [
12]. However, to have such performance, uniformly distributed sensor nodes collectively convey their quantized information to a fusion center for processing into a combined target state estimate. Having just one isolated sensor significantly undermines estimator tracking performance, and a standard Kalman filter operating on high-resolution measurements from even one sensor is relatively more accurate. Work has also been reported in utilizing quantized measurements in the development and analysis of particle filters [
14], unscented Kalman filters [
15], and their fixed-point implementations [
14,
16]. These implementations are purpose-built for accelerating filter algorithms for specific applications and are not easily scalable for generic state estimation problems. To a great extent, the usual practice for high-speed onboard realization of filtering algorithms vastly depends upon the hardware resources and also on the choice of an optimal state estimator. This paper aims to bridge this gap by reformulating minimum mean square estimation algorithms to explicitly account for finite-precision numerical errors in states, inputs, and measurements. By integrating quantization noise models into the filter structure, the proposed approach enables the design of quantized filters that are better suited for fixed-point implementations on embedded systems.
This research focuses on optimal quantized filtering methods with an application focus on optomechanical acceleration sensors for inertial navigation. Optomechanical accelerometers rely on the coupling between the mechanical displacement of a test mass and light captured using an optical detection system [
17]. Such a sensor requires optical data processing on high-speed hardware modules such as Field Programmable Gate Arrays (FPGAs) [
18] for estimation of acceleration forces from test-mass displacement measurements. These precision force measuring units are deployed in geodesic applications, including the Gravity Recovery and Climate Experiment (GRACE) mission [
19], the Laser Interferometer Space Antenna (LISA) [
20], and the Laser Interferometer Gravitational-Wave Observatory (LIGO) [
21].
For deployment on a spacecraft or resource-constrained hardware (e.g., FPGA [
22,
23]), dynamic range and sensor resolution limitations shall restrict the ability of standard filtering algorithms to precisely estimate states, especially when the dynamical system exhibits large variations in state variables. This necessitates the adoption of accurate error models and the use of novel filtering techniques to achieve sufficient estimation accuracy of desired state variables from the sensor measurements [
10,
12,
24]. Furthermore, in scenarios where transmitting high-resolution measurements or performing double-precision operations is impossible or memory-intensive, quantization errors become significant. With increased performance requirements, the noise from the quantization effects has become an important aspect to analyze in efforts to maximize signal-to-noise ratio [
12,
13,
25,
26]. For the state estimation problem of dynamical stochastic processes, finite-precision implementation necessitates estimation to be based on quantized parameters of state, input, and observations. This requires revisiting the state estimation algorithms to include quantization errors in the filter implementation. Although using a sufficiently large word length for real-valued state variables can minimize the effects of quantization, even with a large number of bits, the system cannot be completely detached from finite word-length effects. In some instances, the luxury of large dynamic range for storage and computations may not be practicable. Consequently, the errors due to quantization have to be modeled as process and output noises, which can be accounted for in the system design using classical state-space and estimator modeling schemes.
The key contributions of this work are briefly summarized as follows. Given a finite-precision representation for system variables, linear state estimation filters are proposed. They consist of a (1) minimum variance estimator (least squares), quantized forms of (2) a discrete-time Kalman filter (QDKF), and (3) a square-root Kalman filter (QSRKF). The filter algorithms are applied to estimate states of an optomechanical oscillator model. The model formulation and assumptions are based upon the efforts of Kelly et al. [
27] and the bench top experiments from Hines et al. [
28]. Numerical simulations of the estimators are reported to show the prominence of modeling quantization errors in state estimation filters. The results offer much improved estimator performances over standard implementations of the respective filters when system variables are quantized. Steady-state filter performance is analyzed to provide insights into the best possible filter accuracy achievable for a given numerical representation. A least squares-based dual oscillator model is implemented on the FPGA board for the state estimation of acceleration forces from simulated measurements. A reliable estimator performance is reported by comparing the fixed-point implementation on the FPGA with the floating-point software implementation as a reference. Note that, in this work, quantization noise is from rounding off of numerical data to a desired number of bits. From here on, quantization errors and round-off errors are used interchangeably and refer to the same idea. Moreover, bit overflows are assumed to be negligible due to the careful selection of dynamic ranges for internal variable representation.
The rest of the article is organized as follows. The sensor model is described in
Section 2. The quantization effects are incorporated into the model and state estimation filters are proposed in
Section 3. Respective filter algorithms are derived in
Appendix A,
Appendix B and
Appendix C. Numerical simulations are reported in
Section 4 to illustrate the performance of the proposed filter structures. In
Section 5, an FPGA architecture is proposed for on-board implementation of a dual-oscillator filter for acceleration estimation problem. Implementation results are reported. Concluding remarks are drawn in
Section 6.
4. Numerical Simulations
The optomechanical inertial sensor parameters and the corresponding noise processes modeled in this simulation are highlighted in
Table 1. These parameters are derived from laboratory experiments and sensor benchtop prototypes described in references [
28,
31,
38]. The contributing process noise sources, including thermal, mechanical, and cavity drifts, have been thoroughly investigated in the cited works. The oscillator parameters, process noise terms (
,
) driving the discrete-time state evolution, calibrated sensor bias as well as the readout noise floor [
27], are set to values consistent with the experimental observations.
Noisy position measurements are simulated from true position states corrupted by a stochastic measurement noise process (
) along with an additive quantization process (
). The stochastic processes (
) driving the discrete-time state evolution, the state quantization processes (
), the standard deviation of the calibrated bias estimate (
), and its corresponding quantization process (
) are also presented in
Table 1.
In this simulation, sensor bias is assumed to be estimated through an independent calibration step that is briefly described in
Section 2.2 and detailed in [
27]. The bias estimate
is randomly drawn from
, and the acceleration input is simulated as a sinusoidal signal of frequency
Hz and amplitude
g. In this analysis, the state and bias values are stored with fractional word length of internal registers configured to signed 16 bits (
). This means that the fractional parts of the states and input nodes are rounded off to 16 bits, with the word-length of the measurements being the only variable. The measurements are quantized to fractional lengths (FLs) of 8 to 16 bits (
) to emulate different ADC resolutions available on development boards. To prevent numerical underflows and to maintain precision, the covariance matrix elements are assigned higher word lengths compared with the state and measurement variables. For FPGA implementation, dedicated digital signal processing (DSP) blocks can be utilized to efficiently multiply the covariance matrix elements with the corresponding state or measurement variables. Modern FPGAs typically provide DSP blocks that support multiplication of 18-bit and
-bit operands. To handle higher word lengths of the covariance matrix elements, multiple DSP blocks can be cascaded to perform pipelined multiply-and-accumulate operations, enabling accurate computations while maintaining high throughput.
Figure 1 shows the results for the errors in the estimated acceleration and the corresponding
estimation error bounds from the moving average least squares method. In this figure, the errors and the
bounds for the least squares implementation are compared for measurements of 12 versus
-bit fractional length. The least squares filter structure defined in Equation (
34) supplements the measurement noise covariance with the quantization error covariance that reflects the bit resolution. Hence, the simulation in
Figure 1 demonstrates additional errors in the acceleration estimates due to less precise representation of measurements. The simulation also signifies the expansion of the
bounds, indicative of added uncertainties in the estimated errors due to increased quantization noise in the measurements. For instance, acceleration estimates with the measurements rounded off to a fractional length of
-bits have larger errors and covariance bounds than those of
-bit measurements.
Figure 2 shows the estimation error results for the Kalman filter formulation, where the process noise associated with the acceleration channel is modeled as a zero-mean process with a variance of
. As in the least squares method, the errors and error bounds from the Kalman filter are sensitive to the precision in the measurements. This sensitivity is illustrated in
Figure 2a,b, which respectively compare 8 and
-bit measurements against measurements with
-bit fractional resolution. The analysis suggests that as the measurement precision increases, the errors and the error bounds tend to align statistically with those of a floating-point implementation of the discrete-time Kalman filter, despite finite-precision hardware constraints. In another observation, the least squares filter has lower variance than that of the QDKF for this application. This is because the least squares filter need not account for the process noise associated with the acceleration channel.
Furthermore, accounting for finite word-length effects in the filter implementation not only accurately predicts the uncertainty in the estimation errors but also reduces these errors. Shown in
Figure 3 is discrete-time Kalman filter (DKF) implementation without considering round-off errors in the filter design versus the quantized discrete-time Kalman filter (QDKF) proposed in this work (Algorithm 1). In this comparison, the measurements are quantized to
-bits in both the DKF and the QDKF implementations, but the filter formulation in DKF does not incorporate quantization error covariances as the proposed QDKF algorithm does. Evidently, the QDKF filter resulted in lower error values than the DKF and also higher
bounds to represent increased covariance due to quantization errors. Moreover, the DKF errors are numerically inconsistent with the corresponding error covariances, as its formulation does not account for quantization noise statistics. The same phenomenon is also evident in the least squares estimation, as observed in
Figure 1. Furthermore, numerical round-off errors exacerbate the lack of observability in the bias state, resulting in growing uncertainty in acceleration estimation errors.
4.1. Steady-State Performance
In practice, the Kalman filter is often run for long periods of time. As
and if given input remains within reasonable magnitude, the error covariance (
) converges to a bounded steady-state value
. For a large
k,
, and Equation (
46) satisfies the discrete-time algebraic Riccati equation [
24]:
with
given in Equation (
47).
Figure 4 shows the standard deviation contours for steady-state acceleration estimation errors. These contours illustrate how the steady-state error bounds change as a function of the fractional length in measurement quantization and the process noise associated with the acceleration channel. In the absence of quantization errors, the contour isolines maintain constant values for given levels of process noise. As the numerical precision of measurements increases, the contours asymptotically approach the steady-state performance achieved by floating-point implementation. The sensitivity analysis curves that graphically illustrate the impact of quantization noise on the state estimation accuracy are an important contribution of this work.
The steady-state behavior reflects the filter performance over an extended period, depicting characteristics of the estimation errors once the transient effects have diminished and the error dynamics have stabilized. The steady-state analysis offers a quantitative measure of the best possible accuracy achievable by the filter and is based on sensor parameters, process, and measurement noise characteristics. Consequently, it serves as a valuable tool in sensor design and parameter tuning. In interferometric sensing, the design and modeling of mechanical elements, signal processing, and estimation filters can be tailored for accuracy and reliability using steady-state covariance analysis.
Furthermore, in the context of quantized Kalman filtering, the Mahalanobis distance is chosen as a metric to quantify the impact of quantization noise on the consistency of estimation errors. The Mahalanobis distance (
d) between the quantized observation (
) and its prediction (
) reflects the variance between them, scaled by the inverse of the associated covariance matrix (
). That is,
where, from Algorithm 1,
Figure 5 illustrates the Mahalanobis distance of the estimates from the QDKF method for observations with varying fractional lengths over time. The Mahalanobis distance quantifies the dissimilarity between the estimated and true states, considering the covariance of the estimation errors. A smaller Mahalanobis distance indicates better filter performance, as the estimated states are closer to the true states. To visualize the trend, a moving average of 10,000 Mahalanobis distance samples is computed. The results demonstrate that higher quantization noise in the observations leads to greater disparities between the observed and predicted measurements. While sequential filtering helps the filter learn the quantization noise statistics over time, it cannot fully match the lower bound achieved in double-precision simulations. Notably, this metric indicates that 16-bit measurements closely track the lower bound, suggesting an optimal word length for sensor data transmissions in this application.
4.2. Square Root Kalman Filter Simulations
For
-bit measurements, the variances from the quantized DKF were observed to take negative values, which is theoretically nonviable. The SRKF, by construction, avoids loss of positive definiteness of the error covariance matrices and thereby offers resistance to numerical overflows and underflows. For covariance matrices that are appropriately scaled and quantized to avoid numerical degeneracy for this application, the performance of the QSRKF proposed in Algorithm 2 is comparable to that of the QDKF (Algorithm 1).
Figure 6 compares the estimation errors and the corresponding
bounds from QDKF and the QSRKF filters. Although this result indicates that the errors are not significantly reduced using the QSRKF version, the loss of positive definiteness in QDKF is a weakness of the standard implementation. Therefore, the square-root filters are preferable for onboard implementation even though they are computationally burdensome.
At the same time, it is evidently important to model quantization noise sources into the standard SRKF algorithm for reliable filter performance.
Figure 7 compares the performance of the square-root filters with the quantization noise modeled into the filter structure (QSRKF in Algorithm 2) versus a standard SRKF implementation that does account for the round-off error statistics. In this comparison, the measurements are quantized to
-bits in both the QSRKF and the SRKF runs.
Table 2 presents the time-averaged Mahalanobis distance for each filter (DKF, QDKF, SRKF, and QSRKF) as a function of measurement resolutions. The results show that the quantized filters (QDKF and QSRKF) proposed in this work outperform filter implementations that do not account for round-off errors (DKF and SRKF). The quantized filters maintain a lower average Mahalanobis distance, indicating better estimation accuracy and robustness to the effects of measurement quantization. Furthermore, the average Mahalanobis distance decreases as the fractional length increases for all filters. This trend suggests that higher measurement resolutions lead to improved filter performance, as more precise measurements provide better information for state estimation. The last column in the table (DP) represents the average Mahalanobis distance for the ideal filter implementations using double-precision measurements. These values serve as a benchmark for the best achievable performance without quantization effects. The quantized filters (QDKF and QSRKF) approach the performance of the ideal filters, demonstrating their effectiveness in mitigating the impact of measurement quantization.
Numerical simulations show that the Kalman filters appear to be under-confident because of the uncertainty in the evolution of the acceleration state. The least squares filter, however, does not require handling process noise associated with the acceleration state and is therefore much more confident about the estimation errors. Moreover, the process noise of the acceleration input is well studied to have low uncertainty [
28]. Therefore, a least-squares based moving average filter is considered for hardware implementation.
6. Conclusions
In aerospace applications, system states are filtered on onboard embedded compute elements using measurements from a sensor network. The sensors and embedded flight computing systems are resource-constrained, limiting the precision of stored or transmitted data and consequently impacting the signal-to-noise ratio of the filter output. Accurate state estimation in finite-precision embedded implementations depends on the precision of the measurements and the word lengths of the state and input variables stored on the embedded computer.
This work presents an optomechanical sensor model for estimating forcing accelerations from simulated displacement measurements of a proof mass. The state estimation algorithms are reformulated to incorporate rounding errors into classical estimator frameworks. A least squares estimator, a discrete-time Kalman filter, and a square-root Kalman filter are developed for optimal state estimation with quantized measurement, state, and input variables. Numerical simulations demonstrate that the modified filter frameworks account for finite-precision effects, ensuring proper management of errors and uncertainties in the acceleration estimates. This approach maximizes the performance of filters implemented on fixed-point hardware architectures. Steady-state performance analysis shows that the best possible accuracy achievable by the filter is tightly coupled with the numerical precision of the internal variables. Metrics such as Mahalanobis distance give concrete insights into the word-length versus performance trade-offs.
Additionally, a dual-oscillator system for estimating acceleration states from independent measurements belonging to two oscillator models is proposed for hardware implementation. A covariance-weighted average of independent acceleration estimates is realized on an FPGA-SoC using a finite-precision implementation of the least squares method. The pipelined FPGA realization with simulated model and measurements reasonably tracks a double-precision MATLAB implementation of the same least squares-based estimation. In summary, this article addresses the realization of state estimation on embedded architectures, emphasizing the importance of managing finite word-length implementation errors to design and implement high-performance, resource-efficient estimation algorithms on memory-constrained computing systems.
It is worth noting that while this work thoroughly addresses quantization effects, the potential impacts of bit overflows are neglected. Scaling digital filter realizations to prevent overflow errors remains an avenue for future investigation.