Geometric Analysis of Conditional Bias-Informed Kalman Filters

Lee, Haksu; Shen, Haojing; Seo, Dong-Jun

doi:10.3390/hydrology9050084

Open AccessEditor’s ChoiceArticle

Geometric Analysis of Conditional Bias-Informed Kalman Filters

by

Haksu Lee

^1,*

,

Haojing Shen

²

and

Dong-Jun Seo

²

¹

Len Technologies, Oak Hill, VA 20171, USA

²

Department of Civil Engineering, University of Texas at Arlington, Arlington, TX 76019, USA

^*

Author to whom correspondence should be addressed.

Hydrology 2022, 9(5), 84; https://doi.org/10.3390/hydrology9050084

Submission received: 19 February 2022 / Revised: 6 May 2022 / Accepted: 7 May 2022 / Published: 11 May 2022

Download

Browse Figures

Versions Notes

Abstract

:

This paper presents a comparative geometric analysis of the conditional bias (CB)-informed Kalman filter (KF) with the Kalman filter (KF) in the Euclidean space. The CB-informed KFs considered include the CB-penalized KF (CBPKF) and its ensemble extension, the CB-penalized Ensemble KF (CBEnKF). The geometric illustration for the CBPKF is given for the bi-state model, composed of an observable state and an unobservable state. The CBPKF co-minimizes the error variance and the variance of the Type-II error. As such, CBPKF-updated state error vectors are larger than the KF-updated, the latter of which is based on minimizing the error variance only. Different error vectors in the Euclidean space imply different eigenvectors and covariance ellipses in the state space. To characterize the differences in geometric attributes between the two filters, numerical experiments were carried out using the Lorenz 63 model. The results show that the CBEnKF yields more accurate confidence regions for encompassing the truth, smaller errors in the ensemble mean, and larger norms for Kalman gain and error covariance matrices than the EnKF, particularly when assimilating highly uncertain observations.

Keywords:

geometric analysis; conditional bias; CBPKF; CBEnKF; covariance ellipse

1. Introduction

With highly uncertain observations and model dynamics, KF [1] and EnKF [2] estimates tend to be conditionally biased. The conditional bias (CB)-informed Kalman filter (KF) is designed to improve the estimation of extreme states in geophysical data assimilation, particularly at the onset of extreme phenomena. The CB-informed KFs developed to date include the CB-penalized Kalman filter (CBPKF; [3,4]) and its ensemble extension, the CB-penalized Ensemble Kalman filter (CBEnKF; [5]), and aim at addressing CB [3,4,5] to improve estimation of extreme states. With extreme precipitation, droughts, and floods occurring more frequently in many parts of the globe, the estimation and prediction of extremes is an increasingly important topic in hydrology. With the increasing availability of diverse sources of observations, hydrologic data assimilation has a large role to play in improving state estimation. The CB-informed KF is an effort to help address both.

There are two types of CB, Type I and Type II. The Type-I CB is defined as E[X|X* = x*] − x* where X, X*, and x* represent the unknown truth, the estimate, and the realization of X*, respectively. The Type-II CB is defined as E[X*|X = x] − x where x represents the realization of X. The Type-I CB is associated with false alarms which can be reduced by calibration. The Type-II CB is associated with failure to detect an event and cannot be reduced by calibration. The CBPKF minimizes a linearly weighted sum of the error covariance and the expectation of the Type-II error squared. With skillful specification of the weight for the latter, the CBPKF improves estimation of extremes over the KF while slightly increasing the unconditional MSE [3,4].

Though extensively studied in statistics, econometrics, meteorology, and hydrology [6,7,8,9,10,11,12,13,14,15,16], CB has gained interest only recently in data assimilation. Relevant studies to date include the CBPKF [3,4] and the CBEnKF [5]—the CBEnKF application to flood forecasting [5], adaptive filtering for the CBPKF [17] and the CBEnKF [18], and the variance-inflated KF-based approximation of the CBPKF for reduced computation [17,18]. Improving understanding of how the CB-informed KF compares with conventional data assimilation techniques, such as the KF, is crucial for interpreting and analyzing its application results, as well as identifying areas for further development. To that end, this study provides a geometric analysis of the CB-informed KF and compares performance with the EnKF via numerical experiments.

Stochastic variables may be expressed as vectors in the Euclidean domain, which can be used to geometrically illustrate vectors of innovation, observation errors, and forecast and analysis errors in the Euclidean domain [19]. The geometric interpretation of state estimation thus helps understand how Kalman gain shape to analysis error vectors and assimilation solutions for observable as well as unobservable states [19]. Error covariance matrices may also be analyzed geometrically in the state space and characterized with eigenvalues, eigenvectors, and associated confidence regions.

The purpose of this work is to gain additional insight into intuitive understanding of how the CBPKF solution differs from the KF, by casting them in error and state spaces using a bi-state model, and to advance understanding of the comparative performance of the CBEnKF with the EnKF by identifying and characterizing the representative geometric attributes in the state space via eigenvalue analysis. The new and significant contributions of this work are: comparative geometric analysis of the CBPKF solution with the KF, development of a set of geometry-based relationships for improved understanding of the CBPKF solution, and geometric characterization of ensemble analysis for improved understanding of the comparative performance of the CBEnKF with the EnKF. The comparative performance is based on numerical experiments with the Lorenz 63 model, chosen in this study for familiarity and simplicity. For real-world flood forecasting applications of the CBEnKF, the interested reader is referred to Lee et al. [5] and Shen et al. [18].

This paper is organized as follows. Section 2 describes the state updating problem, and the two solution approaches, the KF and the CBPKF. Section 3 presents the geometric analysis of the CB-informed KF in relation to the KF and the EnKF. First, Section 3.1 describes how to use vectors to represent stochastic variables in the Euclidean domain, and Section 3.2 expresses the filter equations in terms of state and observation error terms. Section 3.3 geometrically illustrates the filter equations for a low-order model based on the visualization of error vectors and their relations to Kalman gain and covariance matrices in the Euclidean space [19]. Section 3.4 describes geometric analyses in the state-space with eigenvalues and eigenvectors of error covariance matrices and associated confidence regions. In Section 3.5, an example is given for geometric analysis in the state space in which the CBEnKF, the EnKF, and Open Loop (OL) are compared for the Lorenz 63 model via numerical experiments. Finally, Section 4 describes conclusions.

2. Methodology

2.1. State Updating Problem

The nonlinear dynamical model is written as Equation (1).

X_{k} = M (X_{k - 1}) + W_{k - 1}

(1)

In the above,

X_{k}

denotes the (

n_{c} \times 1

) model state, or control vector, where

n_{c}

denotes the number of variables in the control vector, M( ) denotes the dynamical model for state variables, and W_k−₁ denotes the dynamical model error at a time step k − 1 with a mean of zero and a covariance of

Q_{k}

. The nonlinear observational model is written as:

Z_{k} = H_{k} (X_{k}) + V_{k}

(2)

In the above, Z_k denotes the (n

\times

1) observation vector, where n denotes the total number of observations. V_k denotes the (n

\times

1) observation error vector at a time step k with a mean of zero and a covariance of

R_{k}

.

H_{k}

() denotes the nonlinear observation operator that maps

X_{k}

to

Z_{k}

. This study solves the linear observation model which renders

H_{k} (X_{k})

in Equation (2) linear, i.e.,

H_{k} X_{k}

:

Z_{k} = H_{k} X_{k} + V_{k}

(3)

In the case of a nonlinear observation operator, e.g., soil moisture–streamflow transformation, one can still render Equation (2) linear via state augmentation [5,20,21], which is beyond the scope of this study. Interested readers are referred to Lee et al. [5], where state augmentation is used to solve a highly nonlinear flood forecasting problem.

Equation (4) shows the state updating equation where

X_{k | k}^{}

,

X_{k | k - 1}^{}

, and

K_{k}

represent updated state at a time step k, state forecast from k − 1 to k, and Kalman gain, respectively.

X_{k | k}^{} = X_{k | k - 1}^{} + K_{k} [Z_{k} - H_{k} X_{k | k - 1}^{}]

(4)

To find

K_{k}

, the KF and the EnKF minimize the error variance (

Σ_{E V}

) in Equation (5). The CBPKF and the CBEnKF minimize the weighted sum of

Σ_{E V}

and the expectation of the Type-II CB squared (

Σ_{C B}

) in Equation (6), or

Σ_{E V} + {α Σ}_{C B}

where α is the weight given to the CB penalty term. The α can be estimated using an iterative method that yields the error covariance within theoretically expected bounds (see [5] for details). Since the iterative method is computationally expensive, the adaptive filtering method has been developed, and the results are reported in Shen et al. [17].

Σ_{E V} = E [(X_{k} - X_{k | k}) {(X_{k} - X_{k | k})}^{T}]

(5)

Σ_{C B} = E [(X_{k} - E_{X_{k | k}} [X_{k | k} | X_{k}]) {(X_{k} - E_{X_{k | k}} [X_{k | k} | X_{k}])}^{T}]

(6)

The following sections present expressions for Kalman gain (

K_{k}^{}

) and covariance (

P_{k | k}

) matrices in the case of the KF and the CBPKF.

2.2. Kalman Filter, KF

Equation (4) is rewritten as Equation (7) but with the superscript K to denote the KF.

X_{k | k}^{K} = X_{k | k - 1}^{} + K_{k}^{K} [Z_{k} - H_{k} X_{k | k - 1}^{}]

(7)

Minimizing

Σ_{E V}

in Equation (5) results in Kalman gain

K_{k}^{K}

in Equation (8) and the covariance analysis matrix

P_{k | k}

in Equation (9) [1].

K_{k}^{K} = P_{k | k - 1} H^{T} {[H P_{k | k - 1} H^{T} + R]}^{- 1}

(8)

P_{k | k} = [1 - K_{k} H] P_{k | k - 1} = P_{k | k - 1} - K_{k} [H P_{k | k - 1} H^{T} + R_{k}] K_{k}^{T}

(9)

2.3. Conditional Bias-Penalized Kalman Filter, CBPKF

State updating equation (Equation (4)) can be rewritten into Equation (10) with the superscript C denoting the CBPKF.

X_{k | k}^{C} = X_{k | k - 1}^{} + K_{k}^{C} [Z_{k} - H_{k} X_{k | k - 1}^{}]

(10)

Minimizing

Σ_{E V} + {α Σ}_{C B}

produces the CBPKF gain

K_{k}^{C}

in Equation (11) and the covariance matrix

Σ_{k | k}

in Equation (12) [3,4,17].

K_{k}^{C} = {[ϖ_{1, k} H_{k} + ϖ_{2, k}]}^{- 1} ϖ_{1, k}

(11)

Σ_{k | k} = {[ϖ_{1, k} H_{k} + ϖ_{2, k}]}^{- 1} (ϖ_{1, k} R_{k} ϖ_{1, k}^{T} + ϖ_{2, k} Σ_{k | k - 1} ϖ_{2, k}^{T}) {[ϖ_{1, k} H_{k} + ϖ_{2, k}]}^{- 1}

(12)

In Equations (11) and (12), the (m

\times

n) and (m

\times

m) weight matrices for the observations and model prediction,

ϖ_{1, k}

and

ϖ_{2, k}

, respectively, are given by:

ϖ_{1, k} = {\hat{H}}_{k}^{T} Γ_{11, k} + Γ_{21, k}

(13)

ϖ_{2, k} = {\hat{H}}_{k}^{T} Γ_{12, k} + Γ_{22, k}

(14)

In Equations (13) and (14), the (m

\times

n) modified observation matrix,

{\hat{H}}_{k}^{T}

, and the (n

\times

n), (n

\times

m), and (m

\times

m) matrices,

Γ_{11, k}

,

Γ_{12, k} (= Γ_{21, k}^{T})

, and

Γ_{22, k}

, respectively, are given by:

{\hat{H}}_{k}^{T} = H_{k}^{T} + α C_{k}^{T}

(15)

Γ_{22, k}^{- 1} = Λ_{22, k} - Λ_{21, k} Λ_{11, k}^{- 1} Λ_{12, k}

(16)

Γ_{11, k} = Λ_{11, k}^{- 1} + Λ_{11, k}^{- 1} Λ_{12, k} Γ_{22, k} Λ_{21, k} Λ_{11, k}^{- 1}

(17)

Γ_{12, k} = - Λ_{11, k}^{- 1} Λ_{12, k} Γ_{22, k}

(18)

In Equations (15)–(18), the (n

\times

m) CB gain matrix for the observation vector,

C_{k}

, and the (n

\times

n), (n

\times

m), and (m

\times

m) modified error covariance matrices,

Λ_{11, k}

,

Λ_{12, k} (= Λ_{12, k}^{T})

, and

Λ_{22, k}

, respectively, are given by:

C_{k} = [H_{k} Σ_{k | k - 1} G_{2, k}^{- 1} + R_{k} H_{k}] [G_{2, k}^{- 1} Σ_{k | k - 1} G_{2, k}^{- 1} + 2 (H_{k}^{T} R_{k} H_{k} + Σ_{k | k - 1})]^{- 1} G_{2, k}^{- 1}

(19)

Λ_{11, k} = R_{k} + α (1 - α) C_{k} Σ_{k | k - 1} C_{k}^{T} + Λ_{12, k} H_{k}^{T} + H_{k} Λ_{21, k}

(20)

Λ_{12, k} = - α C_{k} Σ_{k | k - 1}

(21)

Λ_{22, k} = Σ_{k | k - 1}

(22)

In Equations (19)–(22),

R_{k}

and

Σ_{k | k - 1}

denote the (n

\times

n) observation error covariance matrix,

C o v [V_{k}, V_{k}]

, and the (m

\times

m) forecast error covariance matrix, respectively, and

G_{2, k}^{- 1} = H_{k}^{T} H_{k} + I

.

If

α = 0

, the CBPKF solution is reduced to the KF, i.e.,

Σ_{k | k} = {[H_{k}^{T} R_{k}^{- 1} H_{k} + Σ_{k | k - 1}^{- 1}]}^{- 1},

and

K_{k} = Σ_{k | k} H_{k}^{T} R_{k}^{- 1}

[5].

3. Geometric Analysis of the CB-Informed KF

3.1. Error Representation of Filter Equations

Since the error covariance of stochastic variables plays the key role in geometric analyses in the Euclidean space, this section defines errors in states and a residual in measurement in order to rewrite state updating and observation equations with error terms.

ε_{k}^{C} = X_{k} - X_{k | k}^{C}

(23)

ε_{k}^{K} = X_{k} - X_{k | k}^{K}

(24)

ε_{k} = X_{k} - X_{k | k - 1}

(25)

y_{k} = Z_{k} - H_{k} X_{k | k - 1}

(26)

In the above,

ε_{k}^{C}

,

ε_{k}^{K}

, and

ε_{k}

denote the CBPKF analysis error, the KF analysis error, and the forecast error, respectively;

y_{k}

represents innovation;

X_{k}

,

X_{k | k}^{C}

,

X_{k | k}^{K}

, and

X_{k | k - 1}

represent the truth, the CBPKF-updated state, the KF-updated state, and the state forecast, respectively.

State update equations for the KF and the CBPKF are rewritten into Equations (27) and (28), assuming the same a priori states used in both filters.

ε_{k}^{K} = ε_{k} - K_{k}^{K} y_{k}

(27)

ε_{k}^{C} = ε_{k} - K_{k}^{C} y_{k}

(28)

where

K_{k}^{K}

and

K_{k}^{C}

are Kalman gains from the KF and the CBPKF, respectively.

Updated states from both filters have the following mathematical relationship.

ε_{k}^{C} = ε_{k}^{K} - (K_{k}^{C} - K_{k}^{K}) y_{k}

(29)

The observation model in Equation (3) can be rewritten as follows:

y_{k} = H_{k} ε_{k} + V_{k}

(30)

From Equation (3), the covariance matrix of

y_{k}

, or

D_{k}

, can be written as

D_{k} = H_{k} P_{k}^{-} H_{k}^{T} + R_{k}

(31)

where

P_{k}^{-}

is the covariance matrix of the state forecast.

3.2. KF and CBPKF Solutions for a Bi-State Model

A bi-state model (

X_{k} = {[x_{1, k} x_{2, k}]}^{T}

) is used to illustrate the geometric relation of errors in states and observations from the CBPKF and the KF. Observation

z_{1, k}

exists for a state

x_{1, k}

but not for

x_{2, k}

to investigate how state updating can be illustrated in the two-dimensional (2-D) Euclidean space for observable as well as unobservable states, i.e.,

Z_{k} = [z_{1, k}];

H = [1 0]

is used for simplicity. Kronhamn [19] used the same

X_{k}, Z_{k}

and

H

matrices as those described in this Section for the geometric illustration of the KF, whereas we focus on the CBPKF in reference to the KF. For the bi-state model with a forecast error covariance

P_{f, k} = [\begin{matrix} σ_{1, k}^{2} σ_{12, k} \\ σ_{12, k} σ_{2, k}^{2} \end{matrix}]

and the observation error

R_{k} = [σ_{z, k}^{2}]

, matrices of Kalman gain (Equations (32) and (33)), analysis error covariance (Equations (34) and (35)), and correlation (Equations (36)–(38)) for the CBPKF and the KF are evaluated in the equations below where

σ_{12, k}

=

ρ_{12, k} σ_{1, k} σ_{2, k}

with

ρ_{12, k}

representing the correlation between

x_{1, k}

and

x_{2, k}

.

K_{k}^{C} = [\begin{matrix} K_{1, k}^{C} \\ K_{2, k}^{C} \end{matrix}] = K_{k}^{K} + α [\begin{matrix} \frac{2 σ_{1, k}^{2} σ_{z, k}^{2}}{(σ_{1, k}^{2} + σ_{z, k}^{2}) ((1 + 2 α) σ_{1, k}^{2} + σ_{z, k}^{2})} \\ \frac{2 σ_{12, k} σ_{z, k}^{2}}{(σ_{1, k}^{2} + σ_{z, k}^{2}) ((1 + 2 α) σ_{1, k}^{2} + σ_{z, k}^{2})} \end{matrix}]

(32)

where

K_{k}^{K} = [\begin{matrix} K_{1, k}^{K} \\ K_{2, k}^{K} \end{matrix}] = [\begin{matrix} \frac{σ_{1, k}^{2}}{σ_{1, k}^{2} + σ_{z, k}^{2}} \\ \frac{σ_{12, k}}{σ_{1, k}^{2} + σ_{z, k}^{2}} \end{matrix}]

(33)

Equation (32) indicates that for an observable state

x_{1, k}

,

K_{k}^{C}

is always larger than

K_{k}^{K},

but for an unobservable state

x_{2, k}

, the sign of

K_{k}^{C} - K_{k}^{K}

depends on the sign of

σ_{12, k}

. The CBPKF covariance matrix

P_{a, k}^{C}

and its relation to the KF-equivalent

P_{a, k}^{K}

are given in Equation (34).

P_{a, k}^{C} = P_{a, k}^{K} + α [\begin{matrix} \frac{α {(2 σ_{1, k}^{2} σ_{z, k}^{2})}^{2}}{(σ_{1, k}^{2} + σ_{z, k}^{2}) {((1 + 2 α) σ_{1, k}^{2} + σ_{z, k}^{2})}^{2}} \frac{α σ_{12, k} {(2 σ_{1, k}^{} σ_{z, k}^{2})}^{2}}{(σ_{1, k}^{2} + σ_{z, k}^{2}) {((1 + 2 α) σ_{1, k}^{2} + σ_{z, k}^{2})}^{2}} \\ \frac{α σ_{12, k} {(2 σ_{1, k}^{} σ_{z, k}^{2})}^{2}}{(σ_{1, k}^{2} + σ_{z, k}^{2}) {((1 + 2 α) σ_{1, k}^{2} + σ_{z, k}^{2})}^{2}} \frac{α {(2 σ_{12, k} σ_{z, k}^{2})}^{2}}{(σ_{1, k}^{2} + σ_{z, k}^{2}) {((1 + 2 α) σ_{1, k}^{2} + σ_{z, k}^{2})}^{2}} \end{matrix}]

(34)

where

P_{a, k}^{K} = [\begin{matrix} \frac{σ_{1, k}^{2} σ_{z, k}^{2}}{σ_{1, k}^{2} + σ_{z, k}^{2}} \frac{σ_{12, k} σ_{z, k}^{2}}{σ_{1, k}^{2} + σ_{z, k}^{2}} \\ \frac{σ_{21, k} σ_{z, k}^{2}}{σ_{1, k}^{2} + σ_{z, k}^{2}} σ_{2, k}^{2} - \frac{σ_{12, k}^{2}}{σ_{1, k}^{2} + σ_{z, k}^{2}} \end{matrix}]

(35)

In Equation (34), variances of both updated states

x_{1, k}

and

x_{2, k}

are larger than KF-equivalents, and the sign of covariance terms of

P_{a, k}^{C} - P_{a, k}^{K}

depend on the sign of

σ_{12, k}

. Equations (34) and (35) imply that the CBPKF-updated state ensembles have larger spreads in the state space than the KF-updated, and that the matrix norm of

P_{a, k}^{C}

is larger than that of

P_{a, k}^{K}

. From covariance matrices, Pearson product-moment correlation matrix

C_{a, k}^{C}

can be computed by Equation (36).

C_{a, k}^{C} = {(d i a g (P_{a, k}^{C}))}^{- \frac{1}{2}} P_{a, k}^{C} {(d i a g (P_{a, k}^{C}))}^{- \frac{1}{2}} = [\begin{matrix} 1 C_{a, 12, k}^{C} \\ C_{a, 12, k}^{C} 1 \end{matrix}]

(36)

where diag( ) denotes a diagonal matrix. Equation (37) shows that the correlation coefficient of the CBPKF-updated states

C_{a, 12, k}^{C}

can be either larger or smaller than the KF-equivalent

C_{a, 12, k}^{K}

.

\begin{array}{l} C_{a, 12, k}^{C} & = \frac{σ_{12, k} σ_{z, k}^{} \sqrt{({(1 + 2 α)}^{2} σ_{1, k}^{2} + σ_{z, k}^{2})}}{σ_{1, k}^{} \sqrt{(σ_{1, k}^{2} + σ_{z, k}^{2}) (σ_{2, k}^{2} (σ_{1, k}^{2} + σ_{z, k}^{2}) - σ_{12, k}^{2}) + 4 α (1 + α) (σ_{1, k}^{2} - σ_{z, k}^{2}) (σ_{1, k}^{2} σ_{2, k}^{2} - σ_{12, k}^{2})}} \\ = C_{a, 12, k}^{K} \\ + \frac{σ_{12, k} σ_{z, k}^{} \sqrt{A_{1} - A_{2}}}{σ_{1, k}^{} \sqrt{σ_{2, k}^{2} (σ_{1, k}^{2} + σ_{z, k}^{2}) - σ_{12, k}^{2}} \sqrt{(σ_{2, k}^{2} (σ_{1, k}^{2} + σ_{z, k}^{2}) - σ_{12, k}^{2}) (σ_{1, k}^{2} + σ_{z, k}^{2}) + 4 α (1 + α) (σ_{1, k}^{2} - σ_{z, k}^{2}) (σ_{1, k}^{2} σ_{2, k}^{2} - σ_{12, k}^{2})}} \end{array}

(37)

where

A_{1} = (σ_{2, k}^{2} (σ_{1, k}^{2} + σ_{z, k}^{2}) - σ_{12, k}^{2}) ({(1 + 2 α)}^{2} σ_{1, k}^{2} + σ_{z, k}^{2})

A_{2} = (σ_{2, k}^{2} (σ_{1, k}^{2} + σ_{z, k}^{2}) - σ_{12, k}^{2}) (σ_{1, k}^{2} + σ_{z, k}^{2}) + 4 α (1 + α) (σ_{1, k}^{2} - σ_{z, k}^{2}) (σ_{1, k}^{2} σ_{2, k}^{2} - σ_{12, k}^{2})

C_{a, 12, k}^{K} = \frac{σ_{12, k} σ_{z, k}^{}}{σ_{1, k}^{} \sqrt{σ_{2, k}^{2} (σ_{1, k}^{2} + σ_{z, k}^{2}) - σ_{12, k}^{2}}}

(38)

From Equations (32)–(38), if

α = 0

, then

K_{k}^{C} = K_{k}^{K}

,

P_{a, k}^{C} = P_{a, k}^{K}

, and

C_{a, k}^{C} = C_{a, k}^{K}

.

3.3. Geometric Representation of KF and CBPKF Solutions

This Section begins with describing a relation between stochastic variables and vectors in the Euclidean domain at Equations (39)–(41), and then describes the geometric representation of KF and CBPKF equations. The covariance of stochastic variables

x_{s}

and

y_{s}

can be used to compute the scalar product of two vectors

\vec{x_{E}}

and

\vec{y_{E}}

in the Euclidean domain (Equation (39)). The vector norm

| | \vec{x_{E}} | |

corresponds to the standard deviation of

x_{s}

(Equation (40)). The angle of

\vec{x_{E}}

and

\vec{y_{E}}

can be computed from the correlation of

x_{s}

and

y_{s}

(Equation (41)) [19,22]. Figure 1 shows the vector representation of stochastic variables in the Euclidean space.

\vec{x_{E}} \cdot \vec{y_{E}} = | | \vec{x_{E}} | | | | \vec{y_{E}} | | \cos θ = C o v [x_{s}, y_{s}] = σ_{x_{s} y_{s}}

(39)

| | \vec{x_{E}} | | = \sqrt{\vec{x_{E}} \cdot \vec{x_{E}}} = \sqrt{C o v [x_{s}, x_{s}]} = σ_{x_{s}}

(40)

θ = \cos^{- 1} (\frac{\vec{x_{E}} \cdot \vec{y_{E}}}{| | \vec{x_{E}} | | | | \vec{y_{E}} | |}) = \cos^{- 1} (\frac{σ_{x_{s} y_{s}}}{σ_{x_{s}} σ_{y_{s}}}) = \cos^{- 1} (ρ_{x_{s} y_{s}})

(41)

Figure 2 shows the error vectors from (a) the KF and (b) the CBPKF for the observable state

x_{1, k}

of the bi-state model, where

| | {\vec{ε}}_{1, k} | | = σ_{1, k}

,

| | \vec{v_{1, k}} | | = σ_{z, k}

, and

{| | {\vec{y}}_{1, k} | |}^{2} = (σ_{1, k}^{2} + σ_{z, k}^{2}),

where the last equality is from Equation (30). In Figure 2a, the state forecast error vector

{\vec{ε}}_{1, k}

is orthogonal to the observation error vector

v_{1, k}

owing to the independence assumption. Being a minimum variance solution,

{\vec{ε}}_{1, k}^{K}

is orthogonal to

{\vec{y}}_{1, k}

[19,23]. Figure 2a also shows that the forecast error is a vector sum of the gain-weighted innovation and the analysis error, i.e.,

{\vec{ε}}_{1, k}^{} = K_{1, k}^{K} {\vec{y}}_{1, k} + {\vec{ε}}_{1, k}^{K}

as expected from Equation (27). The KF analysis error variance may be obtained in Figure 2a via the Pythagorean theorem as

{| | {\vec{ε}}_{1, k}^{K} | |}^{2} = {| | {\vec{ε}}_{1, k}^{} | |}^{2} - {| | K_{1, k}^{K} {\vec{y}}_{1, k} | |}^{2} = σ_{1, k}^{2} - {(\frac{σ_{1, k}^{2}}{σ_{1, k}^{2} + σ_{z, k}^{2}})}^{2} (σ_{1, k}^{2} + σ_{z, k}^{2}) = \frac{σ_{1, k}^{2} σ_{z, k}^{2}}{σ_{1, k}^{2} + σ_{z, k}^{2}}

where

K_{1, k}^{K} = \frac{σ_{1, k}^{2}}{σ_{1, k}^{2} + σ_{z, k}^{2}}

as expected from Equation (35). Since

{\vec{ε}}_{1, k}^{C}

=

{\vec{ε}}_{1, k}^{K} - (K_{1, k}^{C} - K_{1, k}^{K}) {\vec{y}}_{1, k}

, we may write via the Pythagorean theorem

| | {\vec{ε}}_{1, k}^{C} | | = \sqrt{{| | {\vec{ε}}_{1, k}^{K} | |}^{2} + {| | (K_{1, k}^{C} - K_{1, k}^{K}) {\vec{y}}_{1, k} | |}^{2}} = \sqrt{\frac{({(1 + 2 α)}^{2} σ_{1, k}^{2} + σ_{z, k}^{2}) σ_{1, k}^{2} σ_{z, k}^{2}}{{((1 + 2 α) σ_{1, k}^{2} + σ_{z, k}^{2})}^{2}}}

which is expected from Equation (34). In Figure 2, the inequality,

| | K_{1, k}^{K} {\vec{y}}_{1, k} | | < | | K_{1, k}^{C} {\vec{y}}_{1, k} | |

, arises due to the fact that the CBPKF solution minimizes not the error variance but a weighted sum of the error variance and the variance of the Type-II CB.

Figure 3 is the same as Figure 2 but for the unobservable state

x_{2, k}

where

| | {\vec{ε}}_{2, k}^{} | | = σ_{2, k}

;

| | {\vec{ε}}_{2 p, k}^{} | | = | | {\vec{ε}}_{2, k}^{} | | \cos θ = \frac{σ_{12, k}^{}}{σ_{1, k}^{}}

;

| | {\vec{ε}}_{2 o, k}^{} | | = \sqrt{{| | {\vec{ε}}_{2, k}^{} | |}^{2} - {| | {\vec{ε}}_{2 p, k}^{} | |}^{2}} = \sqrt{σ_{2, k}^{2} - \frac{σ_{12, k}^{2}}{σ_{1, k}^{2}}}

; and

| | {\vec{ε}}_{2 c, k}^{} | | = \sqrt{{| | {\vec{ε}}_{2 p, k}^{} | |}^{2} - {| | K_{2, k}^{K} {\vec{y}}_{1, k} | |}^{2}} = \sqrt{\frac{σ_{12, k}^{2} σ_{z, k}^{2}}{σ_{1, k}^{2} (σ_{1, k}^{2} + σ_{z, k}^{2})}}

. In Figure 3a, the proportionality,

| | {\vec{ε}}_{2 p, k}^{} | | : | | K_{2, k}^{K} {\vec{y}}_{1, k} | | = | | {\vec{ε}}_{1, k}^{} | | : | | K_{1, k}^{K} {\vec{y}}_{1, k} | |

, gives

K_{2, k}^{K} = K_{1, k}^{K} \frac{| | {\vec{ε}}_{2 p, k}^{} | |}{| | {\vec{ε}}_{1, k}^{} | |} = K_{1, k}^{K} \frac{| | {\vec{ε}}_{2, k}^{} | | \cos θ}{| | {\vec{ε}}_{1, k}^{} | |} = K_{1, k}^{K} \frac{σ_{2, k}^{}}{σ_{1, k}^{}} \frac{σ_{12, k}}{σ_{1, k}^{} σ_{2, k}^{}} = K_{1, k}^{K} \frac{σ_{12, k}}{σ_{1, k}^{2}}

as expected from Equation (33). In Figure 3a, the orthogonality,

{\vec{ε}}_{2, k}^{K} ⊥ {\vec{y}}_{1, k}

, and the equality

{\vec{ε}}_{2, k}^{K} = {\vec{ε}}_{2, k}^{} - K_{2, k}^{K} {\vec{y}}_{1, k}

, yield

{| | {\vec{ε}}_{2, k}^{K} | |}^{2} = {| | {\vec{ε}}_{2, k}^{} | |}^{2} - {| | K_{2, k}^{K} {\vec{y}}_{1, k} | |}^{2} = σ_{2, k}^{2} - \frac{σ_{12, k}^{2}}{σ_{1, k}^{2} + σ_{z, k}^{2}}

in agreement with Equation (35). In Figure 3b,

{| | {\vec{ε}}_{2, k}^{C} | |}^{2}

may be written via the Pythagorean theorem as:

{| | {\vec{ε}}_{2, k}^{C} | |}^{2} = {| | {\vec{ε}}_{2, k}^{K} | |}^{2} + {| | (K_{2, k}^{C} - K_{2, k}^{K}) {\vec{y}}_{1, k} | |}^{2}

(42)

Using Equations (35), (32) and (31) for

{| | {\vec{ε}}_{2, k}^{K} | |}^{2}

,

K_{2, k}^{C} - K_{2, k}^{K}

, and

{| | {\vec{y}}_{1, k} | |}^{2}

, respectively, we may rewrite Equation (42), in agreement with Equation (34), as:

{| | {\vec{ε}}_{2, k}^{C} | |}^{2} = σ_{2, k}^{2} - \frac{σ_{12, k}^{2}}{σ_{1, k}^{2} + σ_{z, k}^{2}} + \frac{{(2 α σ_{12, k} σ_{z, k}^{2})}^{2}}{(σ_{1, k}^{2} + σ_{z, k}^{2}) {((1 + 2 α) σ_{1, k}^{2} + σ_{z, k}^{2})}^{2}}

(43)

Figure 2 and Figure 3 show that the KF- and the CBPKF-updated state error vectors point to different directions in the state space. Figure 4 shows the updated state error vectors in Figure 2 and Figure 3 to visually compare the differences in the angle, the magnitude, and the direction. The angles of the two-state error vectors for the CBPKF (

θ^{C}

in Equation (44)) and the KF (

θ^{K}

in Equation (45)) can be computed from the correlation

C_{a, 12, k}^{C}

and

C_{a, 12, k}^{K}

in Equations (37) and (38), respectively:

θ^{C} = \cos^{- 1} C_{a, 12, k}^{C} = \cos^{- 1} (\frac{{\vec{ε}}_{1, k}^{C} \cdot {\vec{ε}}_{2, k}^{C}}{| | {\vec{ε}}_{1, k}^{C} | | | | {\vec{ε}}_{2, k}^{C} | |}) = \cos^{- 1} (\frac{σ_{x_{1, k | k}^{C} x_{2, k | k}^{C}}}{σ_{x_{1, k | k}^{C}} σ_{x_{2, k | k}^{C}}})

(44)

θ^{K} = \cos^{- 1} C_{a, 12, k}^{K} = \cos^{- 1} (\frac{{\vec{ε}}_{1, k}^{K} \cdot {\vec{ε}}_{2, k}^{K}}{| | {\vec{ε}}_{1, k}^{K} | | | | {\vec{ε}}_{2, k}^{K} | |}) = \cos^{- 1} (\frac{σ_{x_{1, k}^{K} x_{2, k}^{K}}}{σ_{x_{1, k | k}^{K}} σ_{x_{2, k | k}^{K}}})

(45)

Below, we develop a set of geometric expressions in the 2-D state space for the analysis error covariance via eigenvalue decomposition (EVD, [24]).

3.4. Geometric Analysis in the State Space

Geometric characteristics of state ensembles in the 2-D state space can be quantified by confidence regions (CR), eigenvectors, eigenvalues, and the angle between the eigenvector and the basis vector of the x-axis. Assuming normal distributions for state ensembles in a 2-D state space, a CR, or so-called a covariance ellipse, can be constructed based on the EVD of a covariance matrix as well as the Chi-Square probability table. The presence of the CB results in different variances (eigenvalues) and directions (eigenvectors) of updated state ensembles in a 2-D state space. The major and minor axis lengths of the covariance ellipse are 2

\sqrt{s λ_{1}}

and 2

\sqrt{s λ_{2}}

where

λ_{1} > λ_{2}

and the value of

s

is from the Chi-Square probability table for a given confidence region, e.g.,

s

= 4.605 for a 90% confidence region given the Chi-Square probability

P (s < 4.605) = 0.9

in the case of degrees of freedom of 2. In a 2-D state space, the error in the orientation of the covariance ellipse with respect to the truth can be estimated by the angle, θ, between the largest eigenvector

{\vec{u}}_{1}

and the vector

\vec{a}

connecting the truth and the ensemble mean, i.e.,

θ ({\vec{u}}_{1}, \vec{a}) = \cos^{- 1} (\frac{{\vec{u}}_{1} \cdot \vec{a}}{| | {\vec{u}}_{1} | | | | \vec{a} | |})

.

The EVD of the CBPKF analysis covariance

P_{a, k}^{C}

may be written as:

P_{a, k}^{C} = U E U^{T}

(46)

where

U = [\begin{matrix} \cos θ \sin θ \\ - \sin θ \cos θ \end{matrix}] = [{\vec{u}}_{1} {\vec{u}}_{2}], E = [\begin{matrix} λ_{1}^{C} 0 \\ 0 λ_{2}^{C} \end{matrix}],

(47)

In the above, U is the eigenvector matrix which rotates the white data (W), or uncorrelated standard normal variates by

θ

. The eigenvalue matrix E explains the variance along the principal error direction, or the direction of the eigenvector. In a 2-D state space,

\sqrt{E}

is a scale factor applied to W. The dataset (D) resulting from scaling W by

\sqrt{E}

and rotating by

U

, i.e.,

D = U \sqrt{E} W

, has the covariance matrix of

P_{a, k}^{C} = U \sqrt{E} {(U \sqrt{E})}^{T}

. Below we apply EVD to the CBPKF analysis error covariance,

P_{a, k}^{C}

, from the bi-state model in Section 3.2.

With

λ_{1}^{C} > λ_{2}^{C} > 0

, and

P_{a, k}^{K} = [\begin{matrix} p_{1, k}^{2} p_{12, k} \\ p_{12, k} p_{2, k}^{2} \end{matrix}]

,

P_{a, k}^{C}

in Equation (34) may be rewritten as:

P_{a, k}^{C} = P_{a, k}^{K} + ζ [\begin{matrix} σ_{1, k}^{4} σ_{12, k} σ_{1, k}^{2} \\ σ_{12, k} σ_{1, k}^{2} σ_{12, k}^{2} \end{matrix}] where ζ = \frac{{(2 α σ_{z, k}^{2})}^{2}}{(σ_{1, k}^{2} + σ_{z, k}^{2}) {((1 + 2 α) σ_{1, k}^{2} + σ_{z, k}^{2})}^{2}}

(48)

Appendix A describes how eigenvalues and eigenvectors may be evaluated for a

2 \times 2

covariance matrix. Using Equation (A11) in Appendix A, we have for

λ_{1}^{C}

:

\begin{matrix} λ_{1}^{C} & = 0.5 ({(p_{1, k}^{2} + ζ σ_{1, k}^{4})}^{2} + {(p_{2, k}^{2} + ζ σ_{12, k}^{2})}^{2}) \\ + 0.5 \sqrt{{(p_{1, k}^{2} - p_{2, k}^{2} + ζ σ_{1, k}^{4} - ζ σ_{12, k}^{2})}^{2} + 4 {(p_{12, k} + ζ σ_{12, k} σ_{1, k}^{2})}^{2}} \end{matrix}

(49)

The difference in the largest eigenvalue between the KF and the CBPKF analysis error covariance is given by:

\begin{matrix} λ_{1}^{C} - λ_{1}^{K} & = 0.5 (2 p_{1, k}^{2} ζ σ_{1, k}^{4} + 2 p_{2, k}^{2} ζ σ_{12, k}^{2} + ζ^{2} σ_{1, k}^{8} + ζ^{2} σ_{12, k}^{4}) \\ + 0.5 \sqrt{{(p_{1, k}^{2} - p_{2, k}^{2} + ζ σ_{1, k}^{4} - ζ σ_{12, k}^{2})}^{2} + 4 {(p_{12, k} + ζ σ_{12, k} σ_{1, k}^{2})}^{2}} \\ - 0.5 \sqrt{{(p_{1, k}^{2} - p_{2, k}^{2})}^{2} + 4 {(p_{12, k})}^{2}} \end{matrix}

(50)

where

λ_{1}^{K}

denotes the largest eigenvalue of

P_{a, k}^{K}

. Using Equation (A12), we may write

λ_{2}^{C}

and

λ_{2}^{C} - λ_{2}^{K}

:

\begin{matrix} λ_{2}^{C} & = 0.5 ({(p_{1, k}^{2} + ζ σ_{1, k}^{4})}^{2} + {(p_{2, k}^{2} + ζ σ_{12, k}^{2})}^{2}) \\ - 0.5 \sqrt{{(p_{1, k}^{2} - p_{2, k}^{2} + ζ σ_{1, k}^{4} - ζ σ_{12, k}^{2})}^{2} + 4 {(p_{12, k} + ζ σ_{12, k} σ_{1, k}^{2})}^{2}} \end{matrix}

(51)

\begin{matrix} λ_{2}^{C} - λ_{2}^{K} & = 0.5 (2 p_{1, k}^{2} ζ σ_{1, k}^{4} + 2 p_{2, k}^{2} ζ σ_{12, k}^{2} + ζ^{2} σ_{1, k}^{8} + ζ^{2} σ_{12, k}^{4}) \\ - 0.5 \sqrt{{(p_{1, k}^{2} - p_{2, k}^{2} + ζ σ_{1, k}^{4} - ζ σ_{12, k}^{2})}^{2} + 4 {(p_{12, k} + ζ σ_{12, k} σ_{1, k}^{2})}^{2}} \\ + 0.5 \sqrt{{(p_{1, k}^{2} - p_{2, k}^{2})}^{2} + 4 {(p_{12, k})}^{2}} \end{matrix}

(52)

Similarly, using Equation (A14), we may write

θ_{a, k}^{C} ({\vec{u}}_{1}, \vec{i})

and

θ_{a, k}^{C} ({\vec{u}}_{1}, \vec{i}) - θ_{a, k}^{K} ({\vec{u}}_{1}, \vec{i})

as:

θ_{a, k}^{C} ({\vec{u}}_{1}, \vec{i}) = \frac{1}{2} \tan^{- 1} (- \frac{2 (p_{12, k} + ζ σ_{12, k} σ_{1, k}^{2})}{p_{1, k}^{2} + ζ σ_{1, k}^{4} - p_{2, k}^{2} - ζ σ_{12, k}^{2}})

(53)

θ_{a, k}^{C} ({\vec{u}}_{1}, \vec{i}) - θ_{a, k}^{K} ({\vec{u}}_{1}, \vec{i}) = \frac{1}{2} \tan^{- 1} (- \frac{2 (p_{12, k} + ζ σ_{12, k} σ_{1, k}^{2})}{p_{1, k}^{2} + ζ σ_{1, k}^{4} - p_{2, k}^{2} - ζ σ_{12, k}^{2}}) - \frac{1}{2} \tan^{- 1} (- \frac{2 p_{12, k}}{p_{1, k}^{2} - p_{2, k}^{2}})

(54)

where the vector

\vec{i}

is the basis vector of the x-axis and

θ_{a, k}^{C} ({\vec{u}}_{1}, \vec{i}) - θ_{a, k}^{K} ({\vec{u}}_{1}, \vec{i})

is equal to

θ_{a, k}^{C} ({\vec{u}}_{1}, \vec{a}) - θ_{a, k}^{K} ({\vec{u}}_{1}, \vec{a})

. With the geometric attributes established above, we now carry out the comparative geometric analysis of the KF and the CBPKF analysis results using the Lorenz 63 model [25].

With the EVD of

P_{a, k}^{C}

and the ensemble mean

(\bar{x_{1}}, \bar{x_{2}})

, the minimum percentage confidence

C R_{M I N}

to contain the verifying truth

(x_{1, T}, x_{2, T})

within the confidence region can be computed by

C R_{M I N} = 100 \times P (s < \tilde{s}),

where

P (s < \tilde{s})

is the Chi-Square probability with degrees of freedom of 2;

\tilde{s}

satisfies Equation (55).

\frac{{(x_{1, T} - \bar{x_{1}}) \cos θ_{a, k}^{C} + (x_{2, T} - \bar{x_{2}}) \sin θ_{a, k}^{C}}^{2}}{\tilde{s} λ_{1}^{C}} + \frac{{(x_{1, T} - \bar{x_{1}}) \sin θ_{a, k}^{C} - (x_{2, T} - \bar{x_{2}}) \cos θ_{a, k}^{C}}^{2}}{\tilde{s} λ_{2}^{C}} = 1

(55)

Figure 5 shows an example of computing

\vec{a}

,

θ ({\vec{u}}_{1}, \vec{a})

,

C R_{M I N}

, and eigenvalues and eigenvectors of a covariance matrix.

3.5. Numerical Experiment with the Lorenz 63 Model

In the sections above, a linear two-state model was used for theoretical simplicity. In this section, we use the three-state Lorenz 63 model to illustrate the differences between the EnKF and the CBEnKF solutions in terms of the geometric attributes introduced above. In this experiment, synthetically generated observations of all three states in the Lorenz 63 model were assimilated at every time step using the EnKF and the CBEnKF [26]. Preliminary experiments suggested observation error variances (

σ_{z}^{2}

) of 10 or 400 can be used for the cases of assimilating less uncertain or largely uncertain observations, respectively, based on the ensemble spread. To render the assimilation problem more challenging,

σ_{z}^{2} = 400

is used to compare the performance of the CBEnKF to that of the EnKF in Figure 6, Figure 7, Figure 8, Figure 9, Figure 10 and Figure 11, where the ensemble size (

n_{S}

) used is 2000 to minimize filter performance degradation owing to a small ensemble size. Figure 12 and Figure 13 compare assimilation results from the two filters in the case of

σ_{z}^{2} = 10

or 400 as a function of an ensemble size, where

n_{S}

values used include 10, 20, 30, 50, 70, 100, 200, 300, 500, 700, 1000, and 2000.

Figure 6 presents the results for states

x_{1}

and

x_{2}

of the Lorenz 63 model in a 2-D state space. The attributes shown include

| {\vec{a}}_{h} |

,

| {\vec{a}}_{v} |

,

C R_{M I N}

,

\sqrt{λ_{1}}

,

θ

,

ρ

, and

\sqrt{λ_{1}} {\vec{u}}_{1}

;

| {\vec{a}}_{h} |,

and

| {\vec{a}}_{v} |

reflect errors in the ensemble mean measured along horizontal or vertical directions, respectively; large dots overlaid in the scatter plots represent mean values of the samples in each of the ten bins equally dividing the entire state space. Figure 7 and Figure 8 are the same as Figure 6 but for the state spaces of

(x_{1}, x_{3})

and

(x_{2}, x_{3})

, respectively. The following summarizes general observations from Figure 6, Figure 7 and Figure 8. The spread of

| {\vec{a}}_{h} |

and

| {\vec{a}}_{v} |

in the scatter plots shows that the CBEnKF reduces CBs more effectively than the EnKF or the OL, particularly in the extremes. In some cases, however, their mean values appear similar, e.g.,

| {\vec{a}}_{h} |

in the state spaces of

(x_{1}, x_{3})

or

(x_{2}, x_{3})

. The OL results for

| {\vec{a}}_{h} |

and

| {\vec{a}}_{v} |

show that their patterns appearing in the state space are similar to those of the state space plot in the top left of Figure 6, Figure 7 and Figure 8; this indicates the notable dependency of the amount of ensemble mean errors on model dynamics, e.g., the larger ensemble mean errors at the extreme—this is also seen in EnKF solutions but less so in the CBEnKF. This may be explained by the CBEnKF with a larger weight to observations than the EnKF in the case of largely uncertain observations (

σ_{z}^{2} = 400

), which reduces the reliance of CBEnKF solutions on the model dynamics. Based on

\sqrt{λ_{1}}

, CBEnKF covariances are generally larger than the EnKF at all state spaces. Larger

\sqrt{λ_{1}}

and smaller

| {\vec{a}}_{h} |

and

| {\vec{a}}_{v} |

of the CBEnKF than the EnKF yield consistently smaller

C R_{M I N}

than the EnKF at both extremes and the median of all three variables. This signifies the benefit of using the CB-informed KF for the estimation of extremes given that the EnKF’s

C R_{M I N}

quickly increases towards extremes, i.e., the EnKF is less confident in estimating extremes than the CBEnKF. For example, 3% confidence regions for selected extreme values presented in the bottom right plots show the truth (green dots) contained within the CBEnKF’s confidence regions (red ellipses) but not within the EnKF’s (blue ellipses). At the plots, arrows represent

\sqrt{λ_{1}} {\vec{u}}_{1}

.

θ

and

ρ

do not clearly indicate differences between the two filters.

Figure 9 shows

C R_{M I N}

,

| \vec{a} |

,

\sqrt{λ_{1}}

,

θ

, and

ρ

of Figure 6, Figure 7 and Figure 8 but as a function of exceedance probabilities to highlight CB-informed KF performances at extremes. At extremes with low exceedance probabilities, differences between the CBEnKF and the EnKF are vivid in the case of

| \vec{a} |

and

C R_{M I N}

. On the other hand,

\sqrt{λ_{1}}

of the CBEnKF is consistently larger than those of the EnKF and the OL across exceedance probabilities. As exceedance probabilities increase, the EnKF’s

| \vec{a} |

becomes similar to the CBEnKF’s, implying unconditionally less biased. The CBEnKF keeps consistently low

C R_{M I N}

at all exceedance probabilities owing to small

| \vec{a} |

and large

\sqrt{λ_{1}}

, compared to the EnKF or the OL. Since the EnKF seeks orthogonal solutions to minimize analysis covariances, its

\sqrt{λ_{1}}

is always smaller than the OL’s as well as the CBEnKF’s. On the other hand, the CBEnKF increases

\sqrt{λ_{1}}

to address CBs which helps keep

C R_{M I N}

low to contain the truth. Both

θ

and

ρ

show no consistent patterns across different state spaces as well as exceedance probabilities.

In Figure 10, the ensemble mean error time series indicates that among the three variables, CBEnKF’s improvement is the largest for

x_{1}

. On the other hand, for the state

x_{3}

, the CBEnKF mainly remedies the underestimation of

x_{3}

compared to the EnKF. In the case of

x_{2}

, the CBEnKF slightly outperforms the EnKF. These observations may imply the different amounts of CBs present in different states, hence the need of applying a separate weight

α

to the CB penalty for the individual state, which warrants a future effort. To compare

P_{a, k}^{}

and

K_{k}^{}

from the two filters, the time series of Frobenius norm of

P_{a, k}^{}

and

K_{k}^{}

is computed by Equations (56) and (57), respectively. Compared to the EnKF, the CBEnKF yields

{| | K_{k}^{} | |}_{F}

and

{| | P_{a, k}^{} | |}_{F}

consistently larger at all assimilation cycles, and the mean values of

{| | K_{k}^{} | |}_{F}

and

{| | P_{a, k}^{} | |}_{F}

are five and three times larger, respectively.

{| | P_{a, k}^{} | |}_{F} = \sqrt{\sum_{i = 1}^{3} λ_{i}^{2}}

(56)

{| | K_{k}^{} | |}_{F} = \sqrt{\sum_{i = 1}^{3} λ_{i}^{2}}

(57)

Figure 11 shows mean

{| | K_{k}^{} | |}_{F}

, and

{| | P_{a, k}^{} | |}_{F}

as a function of exceedance probabilities. At extremes, both the CBEnKF and the EnKF show that mean

{| | K_{k}^{} | |}_{F}

and

{| | P_{a, k}^{} | |}_{F}

are larger than those at high exceedance probabilities, and that large differences in mean

{| | K_{k}^{} | |}_{F}

and

{| | P_{a, k}^{} | |}_{F}

between the CBEnKF and the EnKF are consistent across exceedance probabilities.

Figure 6, Figure 7, Figure 8, Figure 9, Figure 10 and Figure 11 are based on the case of uncertain observations (

σ_{z}^{2} = 400

) where the CBEnKF may supposedly outperform the EnKF. To explore the CBEnKF performance with less uncertain observations (

σ_{z}^{2} = 10

) and also to see the sensitivity to the ensemble size (

n_{S}

), Figure 12 presents results from the combination of

n_{S}

= 10, 20, 30, 50, 70, 100, 200, 300, 500, 700, 1000, and 2000, and

σ_{z}^{2} = 10

and 400. In Figure 12,

| \vec{a} |

plots indicate that with

σ_{z}^{2} = 10

, the accuracy of the ensemble mean continuously increases with an increase of

n_{S}

at both cases of extremes (an exceedance probability of 0.1; red and blue dots for the CBEnKF and the EnKF, respectively) and all data (red and blue lines for the CBEnKF and the EnKF, respectively). When

σ_{z}^{2} = 10

, the EnKF’s

| \vec{a} |

is slightly smaller than the CBEnKF’s, but the CBEnKF’s

\sqrt{λ_{1}}

is slightly larger than the EnKF’s. The resulting

C R_{M I N}

from both filters are very similar. This implies when observations are less uncertain, the EnKF solutions are as accurate and as confident as the CBEnKF solutions at extremes as well as the whole range. When

n_{S} \geq

200 and

σ_{z}^{2} = 10

, mean

C R_{M I N}

maintains ~1%. When

n_{S} <

200 and

σ_{z}^{2} = 10

,

C R_{M I N}

quickly increases with a decrease of

n_{S}

because of inaccurate error covariance estimates with an insufficient ensemble size. When observations are largely uncertain (

σ_{z}^{2} = 2000)

, the CBEnKF clearly shows more accurate ensemble means (smaller

| \vec{a} |

) and higher confidence in covariance estimates (smaller

C R_{M I N}

) than the EnKF, particularly at extremes. Compared to

σ_{z}^{2} = 10

, assimilating largely uncertain observations (

σ_{z}^{2} = 2000

) reduces accuracies in covariance estimates, resulting in larger

\sqrt{λ_{1}}

in both filters, although the CBEnKF’s

\sqrt{λ_{1}}

addressing the CB is larger than the EnKF’s. When

σ_{z}^{2} = 2000

,

| \vec{a} |

and

C R_{M I N}

tend to be less sensitive to

n_{S}

than the case of

σ_{z}^{2} = 10

. Both

θ

and

ρ

show neither any consistent patterns nor sensitivities to

n_{S}

, but are included in Figure 12 for completeness.

Finally, Figure 13 presents mean

{| | K_{k}^{} | |}_{F}

and

{| | P_{a, k}^{} | |}_{F}

as a function of

n_{S}

. Compared to the results from

σ_{z}^{2} = 2000

,

σ_{z}^{2} = 10

results in larger

{| | K_{k}^{} | |}_{F}

in both filters due to bigger weights to the observations. When

σ_{z}^{2} = 2000

, the CBEnKF maintains relatively large

{| | K_{k}^{} | |}_{F}

to account for the CB; however, the EnKF’s

{| | K_{k}^{} | |}_{F}

is conspicuously small. Both

{| | K_{k}^{} | |}_{F}

and

{| | P_{a, k}^{} | |}_{F}

tend to be little sensitive to the ensemble size

n_{S}

, except the all data case of the CBEnKF with

σ_{z}^{2} = 2000

(pink line). With uncertain observations (

σ_{z}^{2} = 2000

), the CBEnKF’s

{| | P_{a, k}^{} | |}_{F}

becomes large at extremes (pink dots) as well as all data (pink line) at all

n_{S}

values used to reflect CBs in all states.

4. Conclusions

Error covariance and gain matrices of two CB-informed KFs, i.e., the CBPKF and the CBEnKF, are geometrically illustrated and compared with the KF equivalents [19] for a bi-state model using error vectors in the Euclidean space. Geometric illustration and analysis offer an intuitive understanding of the relationship between the two filters. Unlike the KF, the CBPKF solution is not orthogonal to its error, which renders its error covariances and gains to be larger than the KF’s. The above differences result in different confidence regions and principal error directions in the state space. Synthetic sensitivity experiments with the Lorenz 63 model showed that the CBEnKF solutions have generally smaller errors in the ensemble mean, larger eigenvalues in the error covariance matrix, more accurate confidence regions for encompassing the truth, and larger Frobenius norms of the error covariance and gain matrices than the KF. The above differences are particularly pronounced when the observations are highly uncertain.

Future research recommendations include applying the CBPKF and the CBEnKF to diverse geophysical problems of estimating and predicting extremes, e.g., extreme precipitation or floods. The bi-state model was used in this work for a comparative geometric analysis of the CBPKF and the KF. Possible extension to an arbitrary number of states poses an interesting research topic.

Author Contributions

Conceptualization, H.L., H.S. and D.-J.S.; methodology, H.L. and D.-J.S.; software, H.L. and D.-J.S.; validation, H.L., H.S. and D.-J.S.; formal analysis, H.L.; investigation, H.L., H.S. and D.-J.S.; writing—original draft preparation, H.L.; writing—review and editing, H.L., H.S. and D.-J.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Nomenclature

Symbol	Description
Superscript
C	Conditional bias-penalized Kalman filter (CBPKF)
K	Kalman filter (KF)
T	Transpose of a matrix
−	Forecast
Subscript
a	Analysis
E	Variable in the Euclidean domain
h	Horizontal
k	Time step
s	Stochastic variable
T	Truth
v	Vertical
$\vec{a}$	Vector connecting the ensemble mean and the truth
$\| \vec{a} \|$	Magnitude of the vector $\vec{a}$
C	Pearson product-moment correlation matrix
CR	Confidence region
CR_MIN	Minimum percentage confidence required to contain the verifying truth
E	Expected value
H	Observation operator that maps model states X to the observation vector Z
$\vec{i}$	Basis vector of the x-axis
K	Kalman gain
${\| \| K \| \|}_{F}$	Frobenius norm of K
M	The dynamical model
n	The total number of observations
n_c	The number of variables in the control vector
n_s	Ensemble size
P	Covariance matrix of state estimate
P⁻	Covariance matrix of state forecast
P(s < p)	Chi-Square probability in the case of s smaller than p
Q	The covariance of the dynamical model error W
R	The covariance of the observation error vector V
s	The value in the Chi-Square probability table for a given confidence region
$\vec{u}$	Eigenvector of the covariance matrix P
V	The observation error vector
W	The dynamical model error
x	Individual model state
$\| \| \vec{x} \| \|$	Norm of the vector $\vec{x}$
x*	Realization of an estimated state X*
X	Model state vector
X*	The vector of the estimated states
y	Innovation
Z	The observation vector
α	The weight given to the CB penalty term in Σ_EV + αΣ_CB
ɛ	Error in a state
λ	Eigenvalue
ρ_ij	Correlation between variables i and j
$σ_{i j}^{}$	Covariance of variables i and j
$σ_{i}^{2}$	Variance of a variable i
$σ_{Z}^{2}$	Observation error variance
Σ_EV	The error variance
Σ_CB	The expectation of the Type-II CB squared

Appendix A. Eigenvalue Decomposition of a Two-by-Two Covariance Matrix

The eigenvalue decomposition (EVD) of the covariance matrix

P_{a, k} = [\begin{matrix} s_{1, k}^{2} s_{12, k} \\ s_{12, k} s_{2, k}^{2} \end{matrix}]

can be written as

P_{a, k} = [\begin{matrix} s_{1, k}^{2} s_{12, k} \\ s_{12, k} s_{2, k}^{2} \end{matrix}] = U E U^{T}

(A1)

U = [\begin{matrix} \cos θ \sin θ \\ - \sin θ \cos θ \end{matrix}], E = [\begin{matrix} λ_{1} 0 \\ 0 λ_{2} \end{matrix}],

U is the eigenvector matrix that can be interpreted as the rotation matrix applied to white data, or a standard normal and uncorrelated data.

E

is the eigenvalue matrix that explains the variance of the principal error direction, or the direction of the eigenvector. In the case of two-dimensional data,

\sqrt{E}

can be interpreted as a scale factor applied to white data. In other words, the resulting dataset (D) from scaling white data (W) by

\sqrt{E}

and then rotating it by

U

, i.e.,

D = U \sqrt{E} W

, will have the covariance matrix of

P_{a, k} = U \sqrt{E} {(U \sqrt{E})}^{T}

.

Let λ₁ > λ₂ > 0, or P_a,k is positive definite

\begin{array}{l} [\begin{matrix} s_{1, k}^{2} s_{12, k} \\ s_{12, k} s_{2, k}^{2} \end{matrix}] = [\begin{matrix} \cos θ \sin θ \\ - \sin θ \cos θ \end{matrix}] [\begin{matrix} λ_{1} 0 \\ 0 λ_{2} \end{matrix}] [\begin{matrix} \cos θ - \sin θ \\ \sin θ \cos θ \end{matrix}] \\ = [\begin{matrix} \cos θ \sin θ \\ - \sin θ \cos θ \end{matrix}] [\begin{matrix} λ_{1} \cos θ - λ_{1} \sin θ \\ λ_{2} \sin θ λ_{2} \cos θ \end{matrix}] \\ = [\begin{matrix} λ_{1} \cos^{2} θ + λ_{2} \sin^{2} θ - λ_{1} \sin θ \cos θ + λ_{2} \sin θ \cos θ \\ - λ_{1} \sin θ \cos θ + λ_{2} \sin θ \cos θ λ_{1} \sin^{2} θ + λ_{2} \cos^{2} θ \end{matrix}] \end{array}

(A2)

s_{1, k}^{2} = λ_{1} \cos^{2} θ + λ_{2} \sin^{2} θ

(A3)

s_{12, k} = (λ_{2} - λ_{1}) \sin θ \cos θ

(A4)

s_{2, k}^{2} = λ_{1} \sin^{2} θ + λ_{2} \cos^{2} θ

(A5)

A(3) + A(5), A(3) − A(5), and 2 × (A4) yield (A6), (A7), and (A8), respectively.

s_{1, k}^{2} + s_{2, k}^{2} = λ_{1} + λ_{2}

(A6)

s_{1, k}^{2} - s_{2, k}^{2} = λ_{1} (\cos^{2} θ - \sin^{2} θ) - λ_{2} (\cos^{2} θ - \sin^{2} θ) = (λ_{1} - λ_{2}) \cos 2 θ

(A7)

2 s_{12, k} = (λ_{2} - λ_{1}) 2 \sin θ \cos θ = (λ_{2} - λ_{1}) \sin 2 θ

(A8)

From (A7) and (A8)

{(λ_{1} - λ_{2})}^{2} = {(s_{1, k}^{2} - s_{2, k}^{2})}^{2} + 4 s_{12, k}^{2}

(A9)

Since λ₁ > λ₂,

λ_{1} - λ_{2} = \sqrt{{(s_{1, k}^{2} - s_{2, k}^{2})}^{2} + 4 s_{12, k}^{2}}

(A10)

(A6) + (A10)

λ_{1} = \frac{1}{2} (s_{1, k}^{2} + s_{2, k}^{2} + \sqrt{{(s_{1, k}^{2} - s_{2, k}^{2})}^{2} + 4 s_{12, k}^{2}})

(A11)

λ_{2} = \frac{1}{2} (s_{1, k}^{2} + s_{2, k}^{2} - \sqrt{{(s_{1, k}^{2} - s_{2, k}^{2})}^{2} + 4 s_{12, k}^{2}})

(A12)

From (A7) and (A8)

\tan 2 θ = - \frac{2 s_{12, k}}{s_{1, k}^{2} - s_{2, k}^{2}}

(A13)

θ = \frac{1}{2} \tan^{- 1} (- \frac{2 s_{12, k}}{s_{1, k}^{2} - s_{2, k}^{2}})

(A14)

References

Kalman, R.E. A new approach to linear filtering and prediction problems. J. Basic Eng. 1960, 82, 35–45. [Google Scholar] [CrossRef] [Green Version]
Evensen, G. The Ensemble Kalman Filter: Theoretical formulation and practical implementation. Ocean Dyn. 2003, 53, 343–367. [Google Scholar] [CrossRef]
Seo, D.-J.; Saifuddin, M.M.; Lee, H. Conditional bias-penalized Kalman filter for improved estimation and prediction of extremes. Stoch. Environ. Res. Risk Assess. 2018, 32, 183–201. [Google Scholar] [CrossRef]
Seo, D.-J.; Saifuddin, M.M.; Lee, H. Correction to: Conditional bias-penalized Kalman filter for improved estimation and prediction of extremes. Stoch. Environ. Res. Risk Assess. 2018, 32, 3561–3562. [Google Scholar] [CrossRef] [Green Version]
Lee, H.; Shen, H.; Noh, S.J.; Kim, S.; Seo, D.-J.; Zhang, Y. Improving flood forecasting using conditional bias-penalized ensemble Kalman filter. J. Hydrol. 2019, 575, 596–611. [Google Scholar] [CrossRef]
Ciach, G.J.; Morrissey, M.L.; Krajewski, W.F. Conditional bias in radar rainfall estimation. J. Appl. Meteorol. 2000, 39, 1941–1946. [Google Scholar] [CrossRef]
Fuller, W.A. Measurement Error Models; Wiley: Chichester, UK, 1987. [Google Scholar]
Seber, G.A.F.; Wild, C.J. Nonlinear Regression; John Wiley & Sons: New York, NY, USA, 1989. [Google Scholar]
Hausman, J.A. Mismeasured variables in econometric analysis: Problems from the right and problems from the left. J. Econ. Perspect. 2001, 15, 57–67. [Google Scholar] [CrossRef] [Green Version]
Hughes, M.D. Regression dilution in the proportional hazards model. Biometrics 1993, 49, 1056–1066. [Google Scholar] [CrossRef] [PubMed]
Frost, C.; Thompson, S. Correcting for regression dilution bias: Comparison of methods for a single predictor variable. J. R. Stat. Soc. Ser. A 2000, 163, 173–190. [Google Scholar] [CrossRef]
Joliffe, I.T.; Stephenson, D.B. Forecast Verification: A Practitioner’s Guide in Atmospheric Science; Wiley: New York, NY, USA, 2003. [Google Scholar]
Kim, B.; Seo, D.-J.; Noh, S.; Prat, O.P.; Nelson, B.R. Improving multisensor estimation of heavy-to-extreme precipitation via conditional bias-penalized optimal estimation. J. Hydrol. 2018, 556, 1096–1109. [Google Scholar] [CrossRef]
Wilks, D.S. Statistical Methods in the Atmospheric Sciences, 2nd ed.; Academic Press: London, UK, 2006; p. 648. [Google Scholar]
Brown, J.D.; Seo, D.-J. Evaluation of a nonparametric post-processor for bias correction and uncertainty estimation of hydrologic predictions. Hydrol. Processes 2013, 27, 83–105. [Google Scholar] [CrossRef]
Seo, D.-J.; Siddique, R.; Zhang, Y.; Kim, D. Improving real-time estimation of heavy-to-extreme precipitation using rain gauge data via conditional bias-penalized optimal estimation. J. Hydrol. 2014, 519 Pt B, 1824–1835. [Google Scholar] [CrossRef]
Shen, H.; Lee, H.; Seo, D.-J. Adaptive conditional bias-penalized Kalman filter for improved estimation of extremes and its approximation for reduced computation. Hydrology 2022, 9, 35. [Google Scholar] [CrossRef]
Shen, H.; Seo, D.-J.; Lee, H.; Liu, Y.; Noh, S. Improving flood forecasting using conditional bias-aware assimilation of streamflow observations and dynamic assessment of flow-dependent information content. J. Hydrol. 2022, 605, 127247. [Google Scholar] [CrossRef]
Kronhamn, T.R. Geometric illustration of the Kalman filter gain and covariance update algorithms. IEEE Control Syst. Mag. 1985, 5, 41–43. [Google Scholar] [CrossRef]
Lorentzen, R.J.; Nævdal, G. An iterative ensemble Kalman filter. IEEE Trans. Autom. Control 2011, 56, 5–8. [Google Scholar] [CrossRef]
Houtekamer, P.L.; Zhang, F. Review of the ensemble Kalman filter for atmospheric data assimilation. Mon. Wea. Rev. 2016, 144, 4489–4532. [Google Scholar] [CrossRef]
Astrom, K.J. Introduction to Stochastic Control Theory; Academic Press: New York, NY, USA, 1970. [Google Scholar]
Schweppe, F.C. Uncertain Dynamic Systems; Prentice-Hall: Englewood Cliffs, NJ, USA, 1973; p. 563. [Google Scholar]
Kreyszig, E. Advanced Engineering Mathematics, 10th ed.; Wiley: New York, NY, USA, 2011. [Google Scholar]
Lorenz, E.N. Deterministic nonperiodic flow. J. Atmos. Sci. 1963, 20, 130–141. [Google Scholar] [CrossRef] [Green Version]
Shen, H.; Lee, H.; Noh, S.; Kim, S.; Seo, D.-J.; Zhang, Y. Conditional bias-penalized Kalman filter for improved state estimation over the tails of distribution. In Proceedings of the AGU Fall Meeting, Washington, DC, USA, 10–14 December 2018. [Google Scholar]

Figure 1. Vector representation of stochastic variables in the Euclidean space, where x_E and y_E are vectors in the Euclidean space and x_S and y_S are stochastic variables.

Figure 2. Geometric illustration of assimilation solutions for the observable state x_1,k for (a) the KF (→; after Kronhamn [19]), and (b) the CBPKF (→).

Figure 3. Geometric illustration of assimilation solutions for the unobservable state x_2,k for (a) the KF (→; after Kronhamn [19]), and (b) the CBPKF (→).

Figure 4. Vector representation of updated states from (a) the KF and (b) the CBPKF in Figure 3, where thick blue and red vectors indicate the principal error direction of the KF and the CBPKF in the Euclidean space, respectively.

Figure 5. Illustration of geometric characteristics in a 2-D state space. Pink dots represent 2000 ensemble pairs of states

(x_{1}, x_{2})

. The red and green dots denote the ensemble mean pair and the truth, respectively.

\vec{a}

is the vector connecting the ensemble mean and the truth.

{\vec{u}}_{1}

is the eigenvector of the covariance matrix P that corresponds to the largest eigenvalue

λ_{1}

of P.

\sqrt{λ_{1}} {\vec{u}}_{1}

represents the vector along the major axis of covariance ellipses, where

\sqrt{λ_{1}}

represents a scale factor applied to the white data.

θ

is the angle between

\vec{a}

and

{\vec{u}}_{1}

. Assuming normal distributions of state ensembles, the major axis length of the covariance ellipse, or the confidence region (CR), is 2

\sqrt{s λ_{1}}

, where s = 1.39, 4.605 or 9.21 for 50, 90, or 99% CRs, respectively.

Figure 5. Illustration of geometric characteristics in a 2-D state space. Pink dots represent 2000 ensemble pairs of states

(x_{1}, x_{2})

. The red and green dots denote the ensemble mean pair and the truth, respectively.

\vec{a}

is the vector connecting the ensemble mean and the truth.

{\vec{u}}_{1}

is the eigenvector of the covariance matrix P that corresponds to the largest eigenvalue

λ_{1}

of P.

\sqrt{λ_{1}} {\vec{u}}_{1}

represents the vector along the major axis of covariance ellipses, where

\sqrt{λ_{1}}

represents a scale factor applied to the white data.

θ

is the angle between

\vec{a}

and

{\vec{u}}_{1}

. Assuming normal distributions of state ensembles, the major axis length of the covariance ellipse, or the confidence region (CR), is 2

\sqrt{s λ_{1}}

, where s = 1.39, 4.605 or 9.21 for 50, 90, or 99% CRs, respectively.

Figure 6. State space plot of x₁ and x₂ of the Lorenz 63 model from the CBEnKF, the EnKF, the Open Loop and the truth. Note that at the bottom right plot,

{\vec{u}}_{1}

is plotted based on EVD results; however,

- {\vec{u}}_{1}

(not shown) should also be considered to interpret the principal error vector pointing towards the truth—this is reflected in the shape and the size of covariance ellipses at the plot.

Figure 6. State space plot of x₁ and x₂ of the Lorenz 63 model from the CBEnKF, the EnKF, the Open Loop and the truth. Note that at the bottom right plot,

{\vec{u}}_{1}

is plotted based on EVD results; however,

- {\vec{u}}_{1}

(not shown) should also be considered to interpret the principal error vector pointing towards the truth—this is reflected in the shape and the size of covariance ellipses at the plot.

Figure 7. Same as Figure 6 but for x₁ and x₃.

Figure 8. Same as Figure 6 but for x₂ and x₃.

Figure 9. Geometric statistics of 2-D state-space from the CBEnKF and the EnKF in the case of n_s = 2000, and

σ_{Z}^{2}

= 400 as a function of an exceedance probability.

Figure 9. Geometric statistics of 2-D state-space from the CBEnKF and the EnKF in the case of n_s = 2000, and

σ_{Z}^{2}

= 400 as a function of an exceedance probability.

Figure 10. Time series of state ensemble mean errors,

{| | K_{k}^{} | |}_{F}

, and

{| | P_{a, k}^{} | |}_{F}

. At the top three plots, the true state (green line) time series is overlaid for comparison. Inverse Hyperbolic Sine (IHS) transformation is applied to the top three plots to properly view ensemble mean errors, denoted by blue and red lines for the EnKF and the CBEnKF, respectively;

\bar{x}

denotes the ensemble mean of x; the superscript + represents the updated state; the subscript T denotes the truth;

\bar{{| | K_{k, i}^{C} | |}_{F}} = 0.38; \bar{{| | K_{k, i}^{K} | |}_{F}} = 0.08; \frac{1}{N} \sum_{i = 1}^{N} \frac{{| | K_{k, i}^{C} | |}_{F}}{{| | K_{k, i}^{K} | |}_{F}} = 5.1; \bar{{| | P_{k, i}^{C} | |}_{F}} = 92; \bar{{| | P_{k, i}^{K} | |}_{F}} = 31,

and

\frac{1}{N} \sum_{i = 1}^{N} \frac{{| | P_{k, i}^{C} | |}_{F}}{{| | P_{k, i}^{K} | |}_{F}} = 3.1

.

Figure 10. Time series of state ensemble mean errors,

{| | K_{k}^{} | |}_{F}

, and

{| | P_{a, k}^{} | |}_{F}

. At the top three plots, the true state (green line) time series is overlaid for comparison. Inverse Hyperbolic Sine (IHS) transformation is applied to the top three plots to properly view ensemble mean errors, denoted by blue and red lines for the EnKF and the CBEnKF, respectively;

\bar{x}

denotes the ensemble mean of x; the superscript + represents the updated state; the subscript T denotes the truth;

\bar{{| | K_{k, i}^{C} | |}_{F}} = 0.38; \bar{{| | K_{k, i}^{K} | |}_{F}} = 0.08; \frac{1}{N} \sum_{i = 1}^{N} \frac{{| | K_{k, i}^{C} | |}_{F}}{{| | K_{k, i}^{K} | |}_{F}} = 5.1; \bar{{| | P_{k, i}^{C} | |}_{F}} = 92; \bar{{| | P_{k, i}^{K} | |}_{F}} = 31,

and

\frac{1}{N} \sum_{i = 1}^{N} \frac{{| | P_{k, i}^{C} | |}_{F}}{{| | P_{k, i}^{K} | |}_{F}} = 3.1

.

Figure 11. Frobenius norm of

K_{k}^{}

and

P_{a, k}^{}

, or

{| | K_{k}^{} | |}_{F}

and

{| | P_{a, k}^{} | |}_{F}

, respectively, as a function of exceedance probabilities, where n_s = 2000, and

σ_{Z}^{2}

= 400 are used for both the CBEnKF (red line) and the EnKF (blue line).

Figure 11. Frobenius norm of

K_{k}^{}

and

P_{a, k}^{}

, or

{| | K_{k}^{} | |}_{F}

and

{| | P_{a, k}^{} | |}_{F}

, respectively, as a function of exceedance probabilities, where n_s = 2000, and

σ_{Z}^{2}

= 400 are used for both the CBEnKF (red line) and the EnKF (blue line).

Figure 12. Geometric statistics of the 2-D state space from the CBEnKF and the EnKF in the case of n_s = 10, 20, 30, 50, 70, 100, 200, 300, 500, 700, 1000, and 2000, and

σ_{Z}^{2}

= 10 and 400. Dots (extreme 1%) denote statistics based on samples with the exceedance probability of 0.01.

Figure 12. Geometric statistics of the 2-D state space from the CBEnKF and the EnKF in the case of n_s = 10, 20, 30, 50, 70, 100, 200, 300, 500, 700, 1000, and 2000, and

σ_{Z}^{2}

= 10 and 400. Dots (extreme 1%) denote statistics based on samples with the exceedance probability of 0.01.

Figure 13. Kalman gain norm and analysis covariance norm of the CBEnKF and the EnKF in the case of n_s = 10, 20, 30, 50, 70, 100, 200, 300, 500, 700, 1000, and 2000, and

σ_{Z}^{2}

= 10 and 400.

Figure 13. Kalman gain norm and analysis covariance norm of the CBEnKF and the EnKF in the case of n_s = 10, 20, 30, 50, 70, 100, 200, 300, 500, 700, 1000, and 2000, and

σ_{Z}^{2}

= 10 and 400.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lee, H.; Shen, H.; Seo, D.-J. Geometric Analysis of Conditional Bias-Informed Kalman Filters. Hydrology 2022, 9, 84. https://doi.org/10.3390/hydrology9050084

AMA Style

Lee H, Shen H, Seo D-J. Geometric Analysis of Conditional Bias-Informed Kalman Filters. Hydrology. 2022; 9(5):84. https://doi.org/10.3390/hydrology9050084

Chicago/Turabian Style

Lee, Haksu, Haojing Shen, and Dong-Jun Seo. 2022. "Geometric Analysis of Conditional Bias-Informed Kalman Filters" Hydrology 9, no. 5: 84. https://doi.org/10.3390/hydrology9050084

APA Style

Lee, H., Shen, H., & Seo, D. -J. (2022). Geometric Analysis of Conditional Bias-Informed Kalman Filters. Hydrology, 9(5), 84. https://doi.org/10.3390/hydrology9050084

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Geometric Analysis of Conditional Bias-Informed Kalman Filters

Abstract

1. Introduction

2. Methodology

2.1. State Updating Problem

2.2. Kalman Filter, KF

2.3. Conditional Bias-Penalized Kalman Filter, CBPKF

3. Geometric Analysis of the CB-Informed KF

3.1. Error Representation of Filter Equations

3.2. KF and CBPKF Solutions for a Bi-State Model

3.3. Geometric Representation of KF and CBPKF Solutions

3.4. Geometric Analysis in the State Space

3.5. Numerical Experiment with the Lorenz 63 Model

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Conflicts of Interest

Nomenclature

Appendix A. Eigenvalue Decomposition of a Two-by-Two Covariance Matrix

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI