Nonlinear Observability Algorithms with Known and Unknown Inputs: Analysis and Implementation

Martínez, Nerea; Villaverde, Alejandro F.

doi:10.3390/math8111876

Open AccessArticle

Nonlinear Observability Algorithms with Known and Unknown Inputs: Analysis and Implementation

by

Nerea Martínez

^1,2,3 and

Alejandro F. Villaverde

^1,*

¹

BioProcess Engineering Group, IIM-CSIC, 36208 Vigo, Galicia, Spain

²

Department of Applied Mathematics II, University of Vigo, 36310 Vigo, Galicia, Spain

³

Department of Applied Mathematics, University of Santiago de Compostela, 15782 Santiago de Compostela, Galicia, Spain

^*

Author to whom correspondence should be addressed.

Mathematics 2020, 8(11), 1876; https://doi.org/10.3390/math8111876

Submission received: 30 September 2020 / Revised: 14 October 2020 / Accepted: 22 October 2020 / Published: 29 October 2020

(This article belongs to the Special Issue Recent Advances in Differential Equations and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

The observability of a dynamical system is affected by the presence of external inputs, either known (such as control actions) or unknown (disturbances). Inputs of unknown magnitude are especially detrimental for observability, and they also complicate its analysis. Hence, the availability of computational tools capable of analysing the observability of nonlinear systems with unknown inputs has been limited until lately. Two symbolic algorithms based on differential geometry, ORC-DF and FISPO, have been recently proposed for this task, but their critical analysis and comparison is still lacking. Here we perform an analytical comparison of both algorithms and evaluate their performance on a set of problems, while discussing their strengths and limitations. Additionally, we use these analyses to provide insights about certain aspects of the relationship between inputs and observability. We found that, while ORC-DF and FISPO follow a similar approach, they differ in key aspects that can have a substantial influence on their applicability and computational cost. The FISPO algorithm is more generally applicable, since it can analyse any nonlinear ODE model. The ORC-DF algorithm analyses models that are affine in the inputs, and if those models have known inputs it is sometimes more efficient. Thus, the optimal choice of a method depends on the characteristics of the problem under consideration. To facilitate the use of both algorithms, we implemented the ORC-DF condition in a new version of STRIKE-GOLDD, a MATLAB toolbox for structural identifiability and observability analysis. Since this software tool already had an implementation of the FISPO algorithm, the new release allows modellers and model users the convenience of choosing between different algorithms in a single tool, without changing the coding of their model.

Keywords:

observability; identifiability; nonlinear systems; control theory; differential geometry; software

1. Introduction

Mathematical models of ordinary differential equations (ODEs) are used in all areas of science and technology for describing nonlinear systems. The ODEs define the evolution of the system state variables

x (t)

with respect to time, while the measurable quantities

y (t)

are defined by the output function. The model equations (ODEs and output function) may contain unknown parameters

θ,

and external inputs that may be known (

u (t)

) or unknown (

w (t)

). The structure of the model equations determines whether it is possible to estimate the model unknowns from the outputs. The theoretical possibility of inferring the states (respectively parameters) from the outputs is called observability (respectively structural identifiability) [1,2]. Since a parameter can be considered as a state variable with time derivative equal to zero, structural identifiability can be considered as a particular case of observability. Additionally, the possibility of recovering the unknown inputs is called reconstructibility or input observability. For simplicity, in this manuscript we use the word observability for all model unknowns, that is, to refer to the possibility of determining states, parameters, and/or inputs from the output. We assume that the model structure—the ODEs and output function—is known and correct. The assessment of the possible existence of alternative structures capable of generating exactly the same output is a different albeit related problem [3] that is not addressed in the present article.

The concept of observability arose in systems and control theory. It was initially defined for linear models and extended to the nonlinear case afterwards [4]. The concept of structural identifiability, on the other hand, was motivated by the analysis of biological models [5], due to the specific challenges that parameter identification poses in mathematical biology and other biosciences. Hence, many observability analysis methods developed in that context aimed at analysing structural identifiability and were named accordingly, even though they could be applied or adapted to the more general task of analysing observability. Examples of software tools include DAISY [6], COMBOS [7], IdentifiabilityAnalysis [8], STRIKE-GOLDD [9], GenSSI [10], and SIAN [11].

The existence of external inputs affects the observability of a model and determines which methods can be applied for its analysis. A key distinction is between known and unknown inputs, where “known” is interpreted as “quantified”; thus, we are aware of the existence of an unknown input but not of its magnitude. A known input that can be manipulated is also called a control input, or simply a control. An unknown input can be considered as an unmeasured disturbance or as a time-varying parameter. Some techniques are applicable specifically to uncontrolled systems [12], while others allow for the existence of known inputs. To the best of our knowledge, the first works describing algorithms capable of handling unknown inputs were presented by [13,14]. These methods are not applicable to systems in which the outputs are direct functions of the inputs, and do not analyse the observability of the unknown input itself.

To address these issues, two differential geometry algorithms called ORC-DF—observability rank criterion with direct feedthrough [15]—and FISPO—full input, state, and parameter observability [16]—have been recently presented. Both methods are capable of determining the observability of states and parameters, and unknown inputs of nonlinear ODE models. ORC-DF is applicable to affine-in-the-inputs systems, while FISPO does not have this requirement. The latter algorithm is already implemented in the STRIKE-GOLDD toolbox [9].

In the present paper we perform a critical examination of the ORC-DF and FISPO algorithms. First we provide the necessary background on observability analysis and differential geometry in Section 2. Then we perform a theoretical analysis of the two methods in Section 3, showing that they are equivalent for systems without known inputs, and that they differ for other classes of models, as a result of building different observability matrices. Realising the convenience of having both algorithms available in the same software environment, we provide their implementations in a new version of the MATLAB toolbox STRIKE-GOLDD, which is described in Section 4. The new release includes an implementation of ORC-DF, and a seamless integration with the already existing FISPO. Furthermore, it enables the automatic analysis of multi-experiment observability throughout any of the two implemented algorithms. Since ORC-DF and FISPO are symbolic algorithms that can be computationally expensive, in Section 5 we evaluate their performances by applying them to a number of modelling problems of different domains, from mechanical engineering to biology, and report their applicability and computational costs. The analysis of the selected case studies is also helpful for obtaining detailed insights about the inner workings of the algorithms. Finally, we conclude with a discussion of the results in Section 6.

2. Materials and Methods

2.1. Notation and Model Classes

We are interested in studying the observability of nonlinear systems of the following (fairly general) form:

Σ = {\begin{matrix} (1) & \dot{x} = f (x, θ, u, w), \\ (2) & y = h (x, θ, u, w), \end{matrix}

where f and h are nonlinear and analytical (infinitely differentiable) functions of the model variables (which depend on time

t \in I = [0, T),

T > 0

), i.e., system states

x (t) \in R^{n_{x}},

a set of inputs, both known

u (t) \in R^{n_{u}},

and unknown

w (t) \in R^{n_{w}},

and unmeasured parameters

θ \in R^{n_{θ}} .

On the other hand, the system output vector

y (t) \in R^{m}

consists of measured functions of model variables. We assume that

n_{x}, m \geq 1,

while

n_{θ},

n_{u}

, and

n_{w}

are non-negative integers. Note that the dependence of model variables over time is omitted from the Equations (1)–(2) for simplicity of notation.

As a special case of

Σ

we also study affine-in-inputs systems, which are given by the expressions:

Σ_{A} = {\begin{matrix} (3) & \dot{x} = f_{0} (x, θ) + \sum_{i = 1}^{n_{u}} f_{u_{i}} (x, θ) u_{i} + \sum_{i = 1}^{n_{w}} f_{w_{i}} (x, θ) w_{i}, \\ (4) & y = h_{0} (x, θ) + \sum_{i = 1}^{n_{u}} h_{u_{i}} (x, θ) u_{i} + \sum_{i = 1}^{n_{w}} h_{w_{i}} (x, θ) w_{i}, \end{matrix}

where

f_{0}, f_{u_{i}}, f_{w_{j}}

and

h_{0}, h_{u_{i}}, h_{w_{j}}

are—possibly nonlinear—analytical functions for

1 \leq i \leq n_{u},

1 \leq j \leq n_{w}

, and:

\begin{matrix} f = f_{x w} + \sum_{i = 1}^{n_{u}} f_{u_{i}} u_{i}, h = h_{x w} + \sum_{i = 1}^{n_{u}} h_{u_{i}} u_{i}, \end{matrix}

(5)

where, following [15]:

f_{x w} = f_{0} + \sum_{i = 1}^{n_{w}} f_{w_{i}} w_{i}, h_{x w} = h_{0} + \sum_{i = 1}^{n_{w}} h_{w_{i}} w_{i} .

(6)

Note that, as in Equations (3) and (4), the functions

f_{0}, f_{u_{i}}, f_{w_{i}}, h_{0}, h_{u_{i}}, h_{w_{i}}

can depend on

(x, θ)

, but we omit this dependency in the expressions above and elsewhere when convenient.

In what follows, a vector

v \in R^{n}

is assumed to be a one column matrix and

v^{T}

its transpose. The Jacobian matrix of a function

ϕ = (ϕ_{1}, \dots, ϕ_{s})

with respect to a vector field

v = (v_{1}, \dots, v_{n}),

will be denoted as:

\begin{matrix} \frac{\partial ϕ}{\partial v} = {[\frac{\partial ϕ_{i}}{\partial v_{j}}]}_{i j}, 1 \leq i \leq s, 1 \leq j \leq n . \end{matrix}

2.2. Background

2.2.1. Structural Identifiability, Observability, and Differential Geometry

Roughly speaking, a nonlinear system

Σ

is structurally observable if it is possible to distinguish between its state trajectories from the data provided by its output, and structurally reconstructible (or input observable) if its disturbances can be tracked from the aforementioned measurements. Similarly,

Σ

is said to be structurally identifiable if it is possible to infer the values of its unknown parameters from the output. In practice, it is often not necessary distinguish between every pair of unmeasured states in the phase mapping of

Σ

—a property called structural global observability—and it is sufficient to distinguish neighbouring states—a property called local “weak” observability in some texts [4]. In this work we will not further make this distinction, and local “weak” structural observability will be simply called observability. Likewise, we will refer to structural local “weak” identifiability simply as identifiability.

Structural identifiability can be addressed as a particular case of observability. This is because any unknown parameter

θ_{i}

of

Σ

can be considered as a zero-dynamics state, that is, the equation

\dot{θ_{i}} = 0

holds for

1 \leq i \leq n_{p} .

It is then possible to augment the system state as:

\begin{matrix} \bar{x} = {(\begin{matrix} x & θ \end{matrix})}^{T}, \end{matrix}

(7)

which consists of

n_{\bar{x}} = n_{x} + n_{θ}

components and follows the augmented dynamics:

\begin{matrix} \dot{\bar{x}} = {(\begin{matrix} f {(x, θ, u, w)}^{T} & 0_{1 \times n_{p}} \end{matrix})}^{T} = \bar{f} (\bar{x}, u, w) . \end{matrix}

(8)

Thus, the identifiability and observability of

Σ

can be studied as the observability of the augmented system (totally equivalent to the original one) with state vector (7), dynamics (8), and output (2), the same as

Σ .

The algorithms analysed in this work adopt a differential geometry approach, which relies on the use of the concept of Lie derivative to bring out algebraic (and in some sense, geometric) tests suitable to address the study of observability of analytic systems

Σ .

Let us consider first the simpler case in which

Σ

is not dependent on unknown inputs:

\begin{matrix} Σ^{'} \{\begin{matrix} \dot{x} = f (x, θ, u), \\ y = h (x, θ, u) . \end{matrix} \end{matrix}

Definition 1

(Lie derivative [17]). Consider the system

Σ^{'}

with augmented state vector (7) and augmented dynamics (8). Fixing the control variables

u,

the Lie derivative of the output function

h (, u)

in

\bar{x}

along the tangent vector field

\bar{f} = \bar{f} (\cdot, u)

is:

\begin{matrix} L_{\bar{f}} h (\bar{x}, u) = \frac{\partial h}{\partial \bar{x}} (\bar{x}, u) \bar{f} (\bar{x}, u) . \end{matrix}

Furthermore, setting

L_{\bar{f}}^{0} h = h,

the i-order Lie derivative of

h (\cdot, u)

(along the vector field

\bar{f} = \bar{f} (\cdot, u))

can be recursively computed in

\bar{x}

as:

\begin{matrix} L_{\bar{f}}^{i} h (\bar{x}, u) = L_{\bar{f}} (L_{\bar{f}}^{i - 1} h (\bar{x}, u)), i \geq 1 . \end{matrix}

The above definition does not capture the effect of possible time variations in the control variables. To account for it, this notion can be extended as follows:

Definition 2

(Extended Lie derivative [8]). Consider the system

Σ^{'}

with augmented state vector (7), augmented dynamics (8), and assume that the controls

u = (u_{1}, \dots, u_{n_{u}})

are analytical functions on

I = [0, T), T > 0 .

The extended Lie derivative of the output function h in

\bar{x}

by the tangent vector field

\bar{f} = \bar{f} (\cdot, u)

is:

\begin{matrix} L_{\bar{f}}^{e} h (\bar{x}, u) = \frac{\partial h}{\partial \bar{x}} (\bar{x}, u) \bar{f} (\bar{x}, u) + \frac{\partial h}{\partial u} (\bar{x}, u) \dot{u} . \end{matrix}

Moreover, setting

L_{\bar{f}}^{e, 0} h = h,

the i-order extended Lie derivative of h can be recursively computed in

\bar{x}

as:

\begin{matrix} L_{\bar{f}}^{e, i} h (\bar{x}, u) = \frac{\partial L_{\bar{f}}^{e, i - 1} h}{\partial \bar{x}} (\bar{x}, u) \bar{f} (\bar{x}, u) + \sum_{j = 0}^{i - 1} \frac{\partial L_{\bar{f}}^{e, i - 1} h}{\partial u^{j)}} (\bar{x}, u) u^{j + 1)}, i \geq 1, \end{matrix}

(9)

where

u^{j + 1)}

(j \geq 0)

stands for the

(j + 1)

-order time derivative of

u .

Remark 1.

For any fixed controls

u (t),

assuming Σ is complete and denoting

{\bar{x}}_{0}^{u} (t)

the state trajectory such that:

\{\begin{matrix} {\dot{\bar{x}}}_{0}^{u} (t) = \bar{f} ({\bar{x}}_{0}^{u} (t), u (t)), t \in I, \\ {\bar{x}}_{0}^{u} (0) = {\bar{x}}_{0}, \end{matrix}

then, in the case of constant inputs

u (t) = u_{0} \in R^{n_{u}},

the Lie derivative introduced in Definition (1) verifies:

\begin{matrix} y (0) = & h ({\bar{x}}_{0}^{u} (t), u_{0}) (0) = L_{\bar{f}}^{0} h ({\bar{x}}_{0}, u_{0}), \\ y^{'} (0) = & \frac{d}{d t} h ({\bar{x}}_{0}^{u} (t), u_{0}) (0) = \frac{\partial h}{\partial \bar{x}} ({\bar{x}}_{0}^{u} (t), u_{0}) {\dot{\bar{x}}}_{0}^{u} (t) |_{t = 0} = \frac{\partial h}{\partial \bar{x}} ({\bar{x}}_{0}, u_{0}) \bar{f} ({\bar{x}}_{0}, u_{0}) = L_{\bar{f}} h ({\bar{x}}_{0}, u_{0}), \\ y^{″} (0) = & \frac{d}{d t} L_{\bar{f}} h ({\bar{x}}_{0}^{u} (t), u_{0}) (0) = \frac{\partial L_{\bar{f}} h}{\partial \bar{x}} ({\bar{x}}_{0}^{u} (t), u_{0}) {\dot{\bar{x}}}_{0}^{u} (t) |_{t = 0} = \frac{\partial L_{\bar{f}} h}{\partial \bar{x}} ({\bar{x}}_{0}, u_{0}) \bar{f} ({\bar{x}}_{0}, u_{0}) = L_{\bar{f}}^{2} h ({\bar{x}}_{0}, u_{0}), \\ ⋮ \\ y^{i)} (0) = & \frac{d}{d t} L_{\bar{f}}^{i - 1} h ({\bar{x}}_{0}^{u} (t), u_{0}) (0) = L_{\bar{f}} (L_{\bar{f}}^{i - 1} h ({\bar{x}}_{0}^{u} (t), u_{0})) |_{t = 0} = L_{\bar{f}}^{i} h ({\bar{x}}_{0}, u_{0}), i \geq 0, \end{matrix}

which can be proven by using repeatedly the chain rule. Likewise, if

u (t)

are time-varying and

u (0) = u_{0},

the extended Lie derivative of Definition (2) verifies:

\begin{matrix} y^{i)} (0) = L_{\bar{f}}^{e, i} h ({\bar{x}}_{0}, u_{0}), i \geq 0 . \end{matrix}

Given a nonlinear system

Σ^{'}

with augmented state (7) and analytical inputs, it is possible to use the extended Lie derivatives of the output to build (symbolically) the following

m n_{\bar{x}} \times n_{\bar{x}}

matrix:

\begin{matrix} O_{I} (\bar{x}, u) = \frac{\partial}{\partial \bar{x}} {(\begin{matrix} L_{\bar{f}}^{0} h {(\bar{x}, u)}^{T} & L_{\bar{f}} h {(\bar{x}, u)}^{T} & L_{\bar{f}}^{2} h {(\bar{x}, u)}^{T} & \dots & L_{\bar{f}}^{n_{\bar{x}} - 1} h {(\bar{x}, u)}^{T} \end{matrix})}^{T}, \end{matrix}

(10)

which is the observability-identifiability matrix of

Σ^{'} .

Note that the assumption of analytic controls is conservative, since it suffices that they are differentiable up to order

n_{d} = n_{\bar{x}} - 1 .

By calculating the rank of the above matrix, it is possible to establish the observability and identifiability of

Σ^{'}

using the following condition.

Theorem 1

(Observability-identifiability condition, OIC [8]). If the identifiability-observability matrix (10) of a model

Σ^{'}

satisfies

rank (O_{I} ({\bar{x}}_{0}, u)) = n_{\bar{x}},

with

{\bar{x}}_{0}

being a (possibly generic) point in the augmented state space (8) of

Σ^{'}

, then the system is structurally locally observable and structurally locally identifiable.

Remark 2.

The rank of (10) is constant except for a zero-measurement subset in the augmented state space (8) of

Σ^{'}

where the rank is smaller, as a consequence of the system being analytical [18]. Thus, to verify the condition of the Theorem 1 it is sufficient to calculate the rank of (10) at any non-singular point of the phase space.

2.2.2. FISPO

Unknown inputs w can be taken into account by further augmenting the phase mapping of

Σ,

including w as unmeasured states. Thus, for any non-negative integer l we have the l-augmented state vector:

x^{l} = (x, θ, w, \dots, w^{l)}),

(11)

which follows the l-augmented dynamics:

{\dot{x}}^{l} = f^{l} (x^{l}, u) = {(\begin{matrix} f {(x^{0}, u)}^{T} & 0_{1 \times n_{p}} & w^{T} & \dots & {(w^{l + 1)})}^{T} \end{matrix})}^{T},

leading to the l-augmented system:

\begin{matrix} Σ^{l} & \{\begin{matrix} {\dot{x}}^{l} = f^{l} (x^{l}, u, w^{l + 1)}), \\ y = h (x^{0}, u) . \end{matrix} \end{matrix}

(12)

An analogous extension for affine systems

Σ_{A}

exists. Using the notation given in (5) and (6), the l-augmented system described above takes the form [13]:

\begin{matrix} Σ_{A}^{l} \{\begin{matrix} {\dot{x}}^{l} = f_{x w}^{l} (x^{l}, w^{l + 1)}) + \sum_{i = 1}^{n_{u}} f_{u_{i}}^{l} (x, θ) u_{i}, \\ y = h_{x w} (x^{0}) + \sum_{i = 1}^{n_{u}} h_{u_{i}} (x, θ) u_{i}, \end{matrix} \end{matrix}

where the l-augmented dynamics is decomposed as follows:

f_{x w}^{l} (x^{l}, w^{l + 1)}) = {(\begin{matrix} f_{x w} {(x^{0})}^{T} & 0_{1 \times n_{p}} & {\dot{w}}^{T} & \dots & {(w^{l + 1)})}^{T} \end{matrix})}^{T},

(13)

f_{u_{i}}^{l} (x, θ) = {(\begin{matrix} f_{u_{i}} (x, θ) & 0_{1 \times n_{p}} & 0_{1 \times (l + 1) n_{w}} \end{matrix})}^{T}, 1 \leq i \leq n_{u} .

(14)

We note that, in order to build l-augmented systems

Σ^{l}

and

Σ_{A}^{l},

it must be possible to calculate the

l + 1

-time derivative of disturbances

w (t)

, and therefore, they will be considered as analytical functions from now on (again, this hypothesis is more restrictive than necessary). We also note that the l-augmented form of

Σ

and

Σ_{A}

is totally equivalent to the original system, and consists of

n^{l} = n_{x} + n_{θ} + (l + 1) n_{w}

states,

n_{u}

controls,

n_{w}

disturbances (the

l + 1

-order time derivatives of

w

) and m outputs, that have not changed due to state augmentation [13].

As an additional hypothesis we assume that a non-negative integer s exists (possibly

s = + \infty)

such that

w^{s)} (t) \neq 0

and

w^{i)} (t) = 0

for all

i > s .

In principle, this assumption introduces a restriction on the type of allowed inputs, and it is equivalent to assuming that the disturbances are

(s

order) polynomial functions of time. Nevertheless, in practice, the method may still provide relevant information about the analytical case, as is discussed in [16].

In what follows, if a vector function

ϕ = (ϕ_{1}, \dots, ϕ_{r}),

r \geq 1,

depends on variables

x^{l}

we denote:

\begin{matrix} d^{l} ϕ (x^{l}) = & \frac{\partial ϕ}{\partial x^{l}} (x^{l}) = {[\frac{\partial ϕ_{i}}{\partial x_{j}^{l}} (x^{l})]}_{i j}, 1 \leq i \leq r, 1 \leq j \leq n^{l}, \\ L_{f^{l}} ϕ (x^{l}) = & d^{l} ϕ (x^{l}) f^{l} (x^{l}, w^{l + 1)}), \end{matrix}

and if

ϕ = ϕ (\cdot, u)

(where controls u are considered to be analytical on the time interval I), then:

L_{f^{l}}^{e} ϕ (x^{l}, u) = d^{l} ϕ (x^{l}, u) f^{l} (x^{l}, u, w^{l + 1)}) + \frac{\partial ϕ}{\partial u} (x^{l}, u) \dot{u} .

Definition 3

(Full input-state-parameter observability, FISPO [16]). Consider the system Σ and the augmented state vector

z (t) = (x (t), θ, w (t)) .

We say that Σ has the FISPO property if, for every

t_{0} \in I

and

1 \leq i \leq n^{0},

z_{i} (t_{0})

can be locally determined from the output

y (t)

and the known inputs

u (t) = (u_{1} (t), \dots, u_{n_{u}} (t))

in a finite time interval

[t_{0}, t_{f}] \subset I .

Thus, a system Σ is FISPO if, for every

z (t_{0})

and for almost any vector

z^{*} (t_{0}),

there is a neighbourhood

N (z^{*} (t_{0}))

such that, for all

\hat{z} (t_{0}) \in N (z^{*} (t_{0})),

the following condition holds:

y (t, \hat{z} (t_{0})) = y (t, z^{*} (t_{0})) \Rightarrow {\hat{z}}_{i} (t_{0}) = z_{i}^{*} (t_{0}), 1 \leq i \leq n^{0} .

Remark 3.

The original definition of the term FISPO reproduced above refers to a model property. Here we also use it to refer to the algorithm presented for its evaluation by [16].

Using the system augmentation (12) and taking the unique

l = s

such that

w^{s)} (t) \neq 0

and

w^{i)} (t) = 0

for all

i > s,

it is possible to build the following matrix,

O_{I}^{g} (x^{s}, u) = d^{s} {(\begin{matrix} L_{f^{s}}^{0} h {(x^{s}, u)}^{T} & L_{f^{s}} h {(x^{s}, u)}^{T} & L_{f^{s}}^{2} h {(x^{s}, u)}^{T} & \dots & L_{f^{s}}^{n^{s} - 1} h {(x^{s}, u)}^{T} \end{matrix})}^{T},

(15)

which is the generalised observability matrix of

Σ .

Note that (15) coincides with the observability matrix (10) of

Σ^{s}

without disturbances. Thus, the rank of

O_{I}^{g}

provides a condition for assessing the observability of

Σ

as follows:

Theorem 2

(FISPO condition [16]). A nonlinear system Σ given by (1-2) with analytic inputs is FISPO if, for

x_{0}^{s}

being a (possibly generic) point in the state space of the s-augmented system

Σ^{s},

the generalised observability matrix (15) verifies

rank (O_{I}^{g} (x_{0}^{s}, u)) = n^{s} .

Remark 4.

For

1 \leq i \leq n^{s},

the observability of the i-th state of

x^{s}

can also be studied using the matrix (15) Thus, if

O_{I}^{g, i} (x_{0}^{s}, u)

denotes the matrix obtained from

O_{I}^{g} (x_{0}^{s}, u)

after removing its i-th column, state

x_{i}

is observable if

rank (O_{I}^{g, i} (x_{0}^{s}, u)) < rank (O_{I}^{g} (x_{0}^{s}, u))

for almost any

x_{0}^{s}

in the phase space of

Σ^{s} .

2.2.3. ORC-DF

The observability of affine systems

Σ_{A}

with bounded measurable controls

u (t)

can also be analysed by building a different observability matrix, as explained below. For a full description of the procedure, see [15].

Definition 4

(Observability rank criterion for systems with direct feedthrough, ORC-DF). A system

Σ_{A}

is classified as k-row observable if almost any initial state

x^{k} (t_{0}),

t_{0} \in I,

in the state space of the k-augmented system

Σ_{A}^{k}

can be separated locally from its neighbours based on the output at

k + 1

consecutive times

t_{0},

t_{1},

\dots,

t_{k} .

If there exists

k \geq 1

such that

Σ_{A}

is k-row observable, it is said that

Σ_{A}

satisfies the ORC-DF.

Lemma 1.

Consider the system

Σ_{A}

, and the vector field

Ω_{k},

which is recursively defined for

k \geq 0

by:

\begin{matrix} Ω_{0} = & {(\begin{matrix} h_{x w}^{T} & h_{u_{1}}^{T} & \dots & h_{u_{n_{u}}}^{T} \end{matrix})}^{T}, Δ Ω_{0} = Ω_{0}, \\ Δ Ω_{k + 1} = & {(\begin{matrix} L_{f_{x w}^{k}} {(Δ Ω_{k})}^{T} & L_{f_{u_{1}}^{k}} {(Δ Ω_{k})}^{T} & \dots & L_{f_{u_{n_{u}}}^{k}} {(Δ Ω_{k})}^{T} \end{matrix})}^{T}, \\ Ω_{k + 1} = & {(\begin{matrix} Ω_{k}^{T} & Δ Ω_{k + 1}^{T} \end{matrix})}^{T}, \end{matrix}

then

Σ_{A}

is k-row observable if

rank (d^{k} Ω_{k} (x_{0}^{k})) = n^{k}

for almost any

x_{0}^{k}

in the phase space of

Σ_{A}^{k} .

Lemma 2.

If

Σ_{A}

satisfies the ORC-DF, then

Σ_{A}

is observable in the presence of unmeasured inputs.

Corollary 1.

Let

d^{k} Ω_{k}^{i}

denote the matrix that is obtained after removing the i-th column from

d^{k} Ω_{k} .

The i-th state of

x^{k}

is k-row observable if and only if

rank (d^{k} Ω_{k}^{i} (x_{0}^{k})) < rank (d^{k} Ω_{k} (x_{0}^{k}))

for almost any

x_{0}^{k}

in the phase space of

Σ^{k} .

3. Theory: Analysis of the FISPO and ORC-DF Algorithms

In this section we discuss the similarities and differences between FISPO and ORC-DF, whose pseudo-code is provided in Algorithms 1 and 2.

3.1. Preliminary Remarks

We begin by recalling three facts that are relevant for the analysis: (i) The FISPO algorithm does not always require building the full matrix (15) (a full matrix can be obtained for undisturbed systems

Σ^{'}

or assuming

w^{j)} = 0

for

j > s,

where s is a non-negative integer), (ii) Both ORC-DF and FISPO can be inconclusive for certain models, and (iii) ORC-DF and FISPO can handle different types of inputs, although both methods allow the study of observability in the presence of generic (measurable) controls.

Remark 5

(The FISPO algorithm does not always require building the full matrix). In each iteration

k \geq 1

the FISPO algorithm builds the matrix

O_{I}^{k} (x^{k}, u),

composed by extended Lie derivatives of output up to order

k,

and then calculates its rank and partial ranks, instead of directly building the full matrix (15). The algorithm is programmed in this way because the matrix

O_{I}^{k} (x^{k}, u)

can reach full rank for some

k \leq n_{s} - 1

, and if the number of states increases indefinitely

(s = \infty),

(15) can never be built in practice. In addition, the above procedure may classify some states as observable before obtaining the full matrix (15), since any observable state in the k-augmented system

Σ^{k}

remains observable in

Σ^{l},

for

l \geq k

[13]. Moreover, if the system does not have unknown inputs it is possible to classify it as unobservable or unidentifiable using fewer than

n_{\tilde{x}} - 1

Lie derivatives [19].

Remark 6

(Both ORC-DF and FISPO can be inconclusive for certain models). If the model under study has unknown inputs and their time derivatives

w^{s)} (t)

do not vanish for any non-negative integer

s < + \infty,

both FISPO and ORC-DF algorithms can be inconclusive. This happens when the rank of the observability matrices grows at each iteration without reaching a value equal to the number of states (which also increases with each iteration). Therefore, a computational implementation of both algorithms should include shutdown conditions based on computation time or number of iterations. This impediment can also be avoided by assuming that

s < \infty,

which implies:

\begin{matrix} x^{k} = & x^{s}, \\ f^{k} = & {(\begin{matrix} {(f^{s})}^{T} & 0_{1 \times (k - s) n_{w}} \end{matrix})}^{T}, \end{matrix}

for any

k > s,

so the observability matrix constructed by FISPO at k-th iteration is:

O^{k} (x^{k}, u) = d^{k} (\begin{matrix} L_{f^{k}}^{0} h (x^{k}, u) & \dots & L_{f^{k}}^{k} h (x^{k}, u) \end{matrix}) = d^{s} (\begin{matrix} L_{f^{s}}^{0} h (x^{s}, u) & \dots & L_{f^{s}}^{k} h (x^{s}, u) \end{matrix}),

while, in the case of affine-in-inputs systems, ORC-DF can be modified at each iteration

k > s

to account for the above assumption, by taking:

\begin{matrix} x^{k} = & x^{s}, \\ f_{x w}^{k} = & {(\begin{matrix} {(f_{x w}^{s})}^{T} & 0_{1 \times (k - s) n_{w}} \end{matrix})}^{T}, \\ d^{k} Ω_{k} = & d^{s} Ω_{k} . \end{matrix}

Actually, ORC-DF is designed to work with unknown inputs such that

w^{s)} (t)

are piecewise constant for

s = \infty

but, obviously, if

w^{s)} (t) = 0

for a finite integer

s,

this condition is trivially verified.

Algorithm 1 The FISPO algorithm [16].

Algorithm 2 The ORC-DF algorithm [15].

Remark 7

(ORC-DF and FISPO can handle different types of inputs). Both ORC-DF and FISPO construct an observation space generated by Lie derivatives of the output; its dimension determines observability. FISPO builds an observation space spanned by extended Lie derivatives (2) considering analytical inputs, while ORC-DF assumes piecewise constant inputs and exploits certain properties specific to affine systems in order to build a different observation space. If an affine system is classified as observable by ORC-DF or FISPO, it is observable when a generic measurable input is considered [15,19].

In the next subsections we present the main novel insights of our theoretical analysis of the algorithms.

3.2. For Systems without Known Inputs, ORC-DF and FISPO Reduce to the Same Algorithm

Here we prove by induction that, if no inputs u are involved in

Σ_{A},

the FISPO algorithm reduces to ORC-DF (note that

Σ_{A}

is a particular case of

Σ,

so FISPO is directly applicable). Before presenting the result, we remark that, in the case

n_{u} = 0,

the extended Lie derivative reduces to:

L_{f}^{e} (\cdot) = L_{f} (\cdot)

and denoting the composition of functions with ∘, the

k + 1

-order Lie derivative verifies:

L_{f^{k}}^{k + 1} (h) = L_{f^{k}} \circ \overset{k + 1}{\overset{⏞}{\dots}} \circ L_{f^{k}} (h) = L_{f^{k}} \circ L_{f^{k - 1}} \circ \dots \circ L_{f^{0}} (h) = L_{f^{k}} (L_{f^{k - 1}}^{k} h) k \geq 1,

(16)

since

L_{f^{k}}^{j} h

depends only on time derivatives

w^{i)} (t)

for

1 \leq i \leq j \leq k + 1 .

Proof.

Setting

n_{u} = 0

in (5), the dynamics and output of

Σ_{A}

are given by:

\begin{matrix} f (x (t), θ, w (t)) = f_{x w} (x (t), θ, w (t)) \\ h (x (t), θ, w (t)) = h_{x w} (x (t), θ, w (t)) \end{matrix}

Let

k = 0 .

By the recursion given in Lemma (1), it is verified that:

Δ Ω_{0} = h_{x w} = h,

(17)

so the induction hypothesis holds for

k = 0 :

d^{0} Ω_{0} = d^{0} Δ Ω_{0} = d^{0} h = O_{I}^{0} .

Consider now any non-negative integer

k \geq 0

and suppose that the induction hypothesis holds for

0 \leq j \leq k,

then:

d^{k + 1} Ω_{k + 1} = d^{k + 1} {(\begin{matrix} Ω_{k}^{T} & Δ Ω_{k + 1}^{T} \end{matrix})}^{T} = (\begin{matrix} d^{k} Ω_{k} & 0 \\ d^{k + 1} Δ Ω_{k + 1} \end{matrix}) = (\begin{matrix} O_{I}^{k} & 0 \\ d^{k + 1} Δ Ω_{k + 1} \end{matrix})

and the result is proven if for every

k \geq 0

it holds that:

Δ Ω_{k + 1} = L_{f^{k}}^{e, k + 1} h = L_{f^{k}}^{k + 1} h .

(18)

The above equality is fulfilled for

k \geq 0 .

Indeed, for

k = 0,

using (17) we have:

Δ Ω_{1} = L_{f_{x w}^{0}} (Δ Ω_{0}) = L_{f_{x w}^{0}} h = L_{f^{0}} h,

so, if

k \geq 0

and the condition (18) holds for

0 \leq j \leq k - 1,

then:

Δ Ω_{k + 1} = L_{f_{x w}^{k}} (Δ Ω_{k}) = L_{f^{k}} (Δ Ω_{k}) = L_{f^{k}} (L_{f^{k - 1}}^{k} h) = L_{f^{k}}^{k + 1} h

where in the last equality we have applied (16). Thus, condition (18) holds for

k \geq 0 .

□

3.3. For Systems with Known Inputs, ORC-DF and FISPO Lead to Different Observability Matrices

Excluding the case

n_{u} = 0,

an important difference between the observability matrices built by algorithms ORC-DF and FISPO is the number of Lie derivatives (rows) they include in each iteration. Indeed, for

k \geq 0,

O_{I}^{k} (x^{k}, u) \in m (k + 1) \times n^{k}, d^{k} Ω^{k} (x^{k}) \in \sum_{i = 0}^{k} m {(1 + n_{u})}^{i + 1} \times n^{k},

so the observability matrix constructed by FISPO grows in m rows in each iteration, while the matrix constructed by ORC-DF includes

m {(1 + n_{u})}^{k + 1}

new rows in the k-th stage. This fact can be an advantage for ORC-DF, as it makes it possible to reach full rank more rapidly, i.e., with lower order Lie derivatives. However, it may also be a disadvantage if this growth makes the problem dimension increase rapidly while adding little new information. Hence the faster growth may be beneficial or not depending on the form of the mathematical expressions of the dynamics and output functions in which the known inputs are present. For example, suppose that there exists an integer

1 \leq i \leq n_{u}

and

n_{u} - 1

real numbers

λ_{j}

not simultaneously zero, such that:

f_{u_{i}} = \sum_{j \neq i = 1}^{n_{u}} λ_{j} f_{u_{j}},

which, using (13), implies:

f_{u_{i}}^{k} = \sum_{j \neq i = 1}^{n_{u}} λ_{j} f_{u_{j}}^{k}, k \geq 0 .

Since, by definition, it holds that:

\begin{matrix} d^{k + 1} Ω_{k + 1} = d^{k + 1} {(\begin{matrix} Ω_{k}^{T} & Δ Ω_{k + 1}^{T} \end{matrix})}^{T} = d^{k + 1} {(\begin{matrix} Ω_{k}^{T} & L_{f_{x w}^{k}} {(Δ Ω_{k})}^{T} & L_{f_{u_{1}}^{k}} {(Δ Ω_{k})}^{T} & \dots & L_{f_{u_{n_{u}}}^{k}} {(Δ Ω_{k})}^{T} \end{matrix})}^{T}, \end{matrix}

the matrix built by ORC-DF algorithm in the

(k + 1)

-th iteration includes

m {(1 + n_{u})}^{k + 1}

dependent rows; the rows forming

d^{k + 1} L_{f_{u_{i}}^{k}} (Δ Ω_{k})

can be written as a linear combination of the remaining rows. Indeed, by a property of Lie derivative [18] it holds that:

L_{f_{u_{i}}^{k}} (Δ Ω_{k}) = d^{k} Δ Ω_{k} f_{u_{i}}^{k} = d^{k} Δ Ω_{k} (\sum_{j \neq i = 1}^{n_{u}} λ_{j} f_{u_{j}}^{k}) = \sum_{j \neq i = 1}^{n_{u}} λ_{j} d^{k} Δ Ω_{k} f_{u_{j}}^{k} = \sum_{j \neq i = 1}^{n_{u}} λ_{j} L_{f_{u_{j}}^{k}} (Δ Ω_{k}),

so, using linearity of the derivative, the following linear combination has been obtained:

d^{k + 1} L_{f_{u_{i}}^{k}} (Δ Ω_{k}) = d^{k + 1} (\sum_{j \neq i = 0}^{n_{u}} λ_{j} L_{f_{u_{j}}^{k}} (Δ Ω_{k})) = \sum_{j \neq i = 0}^{n_{u}} λ_{j} d^{k + 1} L_{f_{u_{j}}^{k}} (Δ Ω_{k}) \in m {(1 + n_{u})}^{k + 1} \times n^{k} .

(19)

Likewise, if there exists

1 \leq i \leq n_{u}

such that

h_{u_{i}}

is linearly dependent on vector fields

h_{u_{j}}

for

1 \leq j \leq n_{u},

j \neq i,

the observability matrix built by ORC-DF includes

m {(1 + n_{u})}^{k + 1}

dependent rows in the

k + 1

-th iteration, for

k \geq 0 .

Note that the real values

λ_{j}

can be replaced by functions of unknown parameters,

λ_{j} = λ_{j} (θ),

as they are constant variables, so the equality (19) holds for this case as well.

4. Implementation

4.1. The STRIKE-GOLDD Software Toolbox

STRIKE-GOLDD (Structural identifiability taken as extended-generalized observability using Lie derivatives and decomposition) is an open source MATLAB toolbox that analyses the observability of nonlinear systems of the form Equations (1) and (2). It is available at https://sites.google.com/site/strikegolddtoolbox/ and https://github.com/afvillaverde/strike-goldd/. STRIKE-GOLDD versions up to v2.1.6 implemented the FISPO algorithm, including a number of additional features that go beyond the core instructions described in Algorithm 1, with the purpose of facilitating the analysis of large models. Furthermore, they also allowed one to indicate a given number of non-zero time derivatives of inputs, both known and unknown.

4.2. Implementation of the ORC-DF Algorithm

We have released a new version of STRIKE-GOLDD (v2.2) that includes an implementation of ORC-DF (Algorithm 2) along with the already existing implementation of FISPO (Algorithm 1). The algorithm is chosen with the newly introduced option opts.affine in the options.m file (set it to 1 for ORC-DF, and to 0 for FISPO). The ORC_DF.m function checks whether a model is indeed affine in the inputs, and if that is the case, converts it to the appropriate form

Σ_{A}

(3) and (4) and stores it in a mat-file to avoid repeating this calculation in the future. Thus, the user only needs to enter the model once, using the same format for ORC-DF and FISPO. New specific options for the ORC-DF algorithm include the possibility of setting a maximum number of iterations through the variable opts.kmax, limiting the computation time of each stage with opts.tStage, plotting graphs of the results obtained by the algorithm with opts.graphics, and using the MATLAB Parallel Toolbox by setting the option opts.parallel to 1.

4.3. Multiple Experiments and Piecewise Constant Inputs

FISPO analyses the observability of a model for a single experiment with an infinitely differentiable ("smooth") input. However, it is possible to use it to consider multiple experiments with possibly different inputs by applying it to a modified model: if we create as many replicates of the model states, inputs, and outputs as the number of experiments that we want to consider, we obtain a new model whose analysis for a single input has the same observability properties as the original model with multiple inputs [20]. Until now, this feature was only available in the GenSSI 2.0 toolbox [10]. We have included the possibility of carrying out this multi-experiment analysis automatically (using any of the available algorithms) in the new version of STRIKE-GOLDD, by setting the option opts.multiexp=1. The number of experiments can be chosen with opts.numexp, and a set of initial conditions can be chosen by the user through the options.m file.

5. Computational Results and Discussion

We have applied the ORC-DF and FISPO algorithms to a set of illustrative case studies from different areas of science and technology, ranging from civil engineering to different biological disciplines. They are listed in Table 1, along with the computation times of the algorithms.

5.1. An Identifiable and Observable Model with Known Input: “C2M”

Our first case study is a deceivingly simple compartmental model [20]:

\begin{matrix} \{\begin{matrix} {\dot{x}}_{1} = - (k_{1 e} + k_{12}) x_{1} + k_{21} x_{2} + b u, \\ {\dot{x}}_{2} = k_{12} x_{1} - k_{21} x_{2}, \\ y = x_{1}, \end{matrix} \end{matrix}

where each state

x_{i}

(i = 1, 2)

corresponds to a compartment, and

θ = (k_{1 e}, k_{12}, k_{21}, b)

is the unknown parameter vector. The augmented state vector is

\bar{x} = (x_{1}, x_{2}, k_{1 e}, k_{12}, k_{21}, b),

with extended dynamics given by:

\bar{f} (\bar{x}, u) = f_{x w} (\bar{x}) + f_{u} (\bar{x}) u = {(\begin{matrix} {\dot{x}}_{1} & {\dot{x}}_{2} & 0 & 0 & 0 & 0 \end{matrix})}^{T},

with the following vector fields for the affine-in-inputs formulation (5):

\begin{matrix} f_{x w} (\bar{x}) = f_{x w} (x, θ) = {(\begin{matrix} - (k_{1 e} + k_{12}) x_{1} + k_{21} x_{2} & k_{12} x_{1} - k_{21} x_{2} & 0 & 0 & 0 & 0 \end{matrix})}^{T}, \\ f_{u} (\bar{x}) = f_{u} (x, θ) = {(\begin{matrix} b & 0 & 0 & 0 & 0 & 0 \end{matrix})}^{T} . \end{matrix}

In addition, the output is given by the function:

y = h (x) = h_{x w} (x) = x_{1} .

The results obtained by ORC-DF and FISPO are shown in Figure 1, where the first important difference between both methods can be seen: while FISPO works with analytical, time-varying controls

u (t)

(which implies that it depends on functions

u (t)

throughout their time derivatives; see Equations (9) and (10)), ORC-DF assumes piecewise constant inputs. For this reason, the results obtained by FISPO vary according to the number of non-zero

u (t)

derivatives, while those of ORC-DF do not. Considering different types of inputs can be of interest in biological applications, where it is sometimes experimentally impossible to apply sufficiently exciting signals. Due to its reduced size, this model is well suited for illustrating this difference, so we derive the equations of the Lie derivatives calculated by each algorithm in Appendix A, where we also discuss other aspects and implications.

This model is classified as observable and identifiable by ORC-DF after three iterations. The result yielded by FISPO depends on the number of input derivatives assumed to be zero: the unmeasured variables are classified as unobservable with a constant input, while they become observable in the fifth iteration if the input is any non-constant analytical function. The variables classified as observable by both algorithms at each iteration are illustrated in Figure 1A,B. Figure 1C shows the ranks of the matrices built by both algorithms in each iteration. As can be seen, the observability matrix constructed by ORC-DF reaches full rank after considering Lie derivatives up to order three. The matrix built by FISPO stagnates from the fourth iteration onward with a constant input, while it reaches full rank after five iterations with a non-constant input.

5.2. A Non-Identifiable, Non-Observable Model with Known Inputs: “Bolie”

Our second example is a model with similarities to the previous one, given by [21]:

\begin{matrix} \{\begin{matrix} {\dot{q}}_{1} = p_{1} q_{1} - p_{2} q_{2} + u, \\ {\dot{q}}_{2} = p_{4} q_{1} + p_{3} q_{2}, \\ y = q_{1} / V_{p}, \end{matrix} \end{matrix}

where

x = (q_{1}, q_{2})

is the state vector,

θ = (p_{1}, p_{2}, p_{3}, p_{4}, V_{p})

are the unknown parameters, and u is the control variable. The augmented state vector is

\bar{x} = (q_{1}, q_{2}, p_{1}, p_{2}, p_{3}, p_{4}, V_{p})

, and the extended dynamics are given by:

\bar{f} (\bar{x}, u) = f_{x w} (\bar{x}) + f_{u} (\bar{x}) u = {(\begin{matrix} {\dot{q}}_{1} & {\dot{q}}_{2} & 0 & 0 & 0 & 0 & 0 \end{matrix})}^{T},

and can be separated into the vector fields:

\begin{matrix} f_{x w} (\bar{x}) = & f_{x w} (x, θ) = {(\begin{matrix} p_{1} q_{1} - p_{2} q_{2} & p_{4} q_{1} + p_{3} q_{2} & 0 & 0 & 0 & 0 & 0 \end{matrix})}^{T}, \\ f_{u} (\bar{x}) = & f_{u} (x, θ) = {(\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 & 0 \end{matrix})}^{T} . \end{matrix}

The output is a function of the state

q_{1}

and the unknown parameter

V_{p} :

h (x, θ) = h_{x w} (x, θ) = q_{1} / V_{p},

so, in this case, there are no directly measured states or parameters.

The model is classified as non-identifiable and non-observable by FISPO and ORC-DC, as shown in Figure 2. A detailed analysis of the calculations performed by both algorithms is provided in Appendix B.

5.3. A Model with Known and Unknown Inputs: “2DOF”

We consider now an affine-in-the-inputs model with a known and an unknown input, proposed by [15]. It describes the behaviour of a mechanical system consisting of two masses connected by a spring. In the form (1) and (2), its dynamics and output functions are given by:

\begin{matrix} f (x, θ, u, w) = & (\begin{matrix} {\dot{x}}_{1} \\ {\dot{x}}_{2} \\ (- (k_{1} + δ k_{1} x_{1}) x_{1} + k_{2} (x_{2} - x_{1}) - c_{1} {\dot{x}}_{1} + c_{2} ({\dot{x}}_{2} - {\dot{x}}_{1}) + F_{1}) / m_{1} \\ (k_{2} (x_{1} - x_{2}) + c_{2} ({\dot{x}}_{1} - {\dot{x}}_{2}) + F_{2}) / m_{2} \end{matrix}), \end{matrix}

\begin{matrix} h (x, θ, w) = & h_{x w} (x, θ, w) = {(\begin{matrix} x_{1} & (k_{2} (x_{1} - x_{2}) + c_{2} ({\dot{x}}_{1} - {\dot{x}}_{2}) + F_{2}) / m_{2} \end{matrix})}^{T} . \end{matrix}

The state vector is

x = (x_{1}, x_{2}, {\dot{x}}_{1}, {\dot{x}}_{2})

and the unknown parameters are

θ = (k_{1}, δ k_{1}, m_{2}) .

Two external forces act on the system as inputs, one of known magnitude,

u (t) = F_{1} (t),

and another of unknown value,

w (t) = F_{2} (t) .

The remaining parameters

k_{2},

m_{1},

c_{1}

and

c_{2}

are known.

Since there is an unknown input acting on the model, it is necessary to include its time derivatives in the extended state vector. The 0-augmented state is

x^{0} = (x_{1}, x_{2}, {\dot{x}}_{1}, {\dot{x}}_{2}, k_{1}, δ, k_{1}, m_{2}, w)

, which follows the dynamics:

{\dot{x}}^{0} = f^{0} (x^{0}, u, \dot{w}) = f_{x w}^{0} (x^{0}, \dot{w}) + f_{u}^{0} (\bar{x}) u = {(\begin{matrix} f {(x, θ, u, w)}^{T} & 0 & 0 & 0 & \dot{w} \end{matrix})}^{T},

where the contribution of the known input is:

f_{u}^{0} (\bar{x}) = {(\begin{matrix} 0 & 0 & 1 / m_{1} & 0 & 0 & 0 & 0 & 0 \end{matrix})}^{T} .

First we consider the case in which the unknown disturbance

w (t)

is assumed constant,

\dot{w} (t) = 0

. The results are shown in Figure 3. After calculating three Lie derivatives both algorithms conclude that the system is identifiable, observable and invertible. It should be noted that for this model FISPO always leads to the result shown in Figure 3A, regardless of the number of known input derivatives assumed to be non-zero (note that the figure shows the constant case

\dot{w} (t) = 0,

but this property is fulfilled for any analytical disturbance

w (t)) .

Next, we consider a time-varying unknown input, assuming that

w^{s)} = 0

for some

s > 1 .

For this case, the model is again classified as fully observable by both algorithms. However, the paths that they follow to reach that conclusion are different. The number of Lie derivatives required by FISPO to classify the model as observable increases as s grows, due to the number of states in each stage also increasing without reaching full rank, so FISPO does not reach any conclusion when

s = + \infty .

In contrast, ORC-DF ends at most in four iterations, regardless of the value of

s \geq 1

(including the case

s = \infty) .

This situation is illustrated in Figure 4, which shows the number of Lie derivatives required by each algorithm to achieve a result for

0 \leq s \leq 10 .

A difference between the procedures carried out by both algorithms for this model is that—similarly to the case of parameter b in the C2M example, mentioned in Appendix A—the expressions obtained by ORC-DF determine that the unknown parameter

m_{2}

can be calculated directly from the measurements since the first iteration as:

L_{f_{u}^{0}} h (x, θ, u, w) = c_{2} / (m_{1} m_{2}) .

In the case of FISPO, in contrast, the parameter

m_{2}

is the last to be classified as identifiable, which happens at the same time in which the entry

w (t)

is classified as invertible for a sufficiently large s

(s \geq 5)

.

5.4. A Model with a Known or Unknown Input: “HIV”

Next we consider a model of HIV dynamics in the human body given by [22]:

\begin{matrix} \{\begin{matrix} {\dot{T}}_{U} = λ - ρ T_{U} - η T_{U} V, \\ {\dot{T}}_{I} = η T_{U} V - δ T_{I}, \\ \dot{V} = N δ T_{I} - c V, \\ y_{1} = V, \\ y_{2} = T_{I} + T_{U}, \end{matrix} \end{matrix}

where the states are

x = (T_{U}, T_{I}, V),

the unknown parameters vector is

θ = (λ, ρ, δ, N, c),

and

η (t)

is a time-varying input, the infection rate.

As was established in Section 3.2, if

η (t)

is unknown, ORC-DF and FISPO become the same algorithm (leaving aside implementation details). If

η (t)

is known and time-varying, they differ.

This model was analysed with FISPO by [16] considering two possibilities, i.e.,

η (t)

known and unknown. In both cases the model is classified as observable and identifiable by FISPO. In the latter case, the number of Lie derivatives necessary to achieve this conclusion grows with the number of derivatives of

η (t)

assumed to be non-zero, as happened with the 2DOF model analysed in Section 5.3.

It is possible to analyse the HIV model with the ORC-DF algorithm, since it is affine in input. With the infection rate considered known, i.e.,

u (t) = η (t)

, the functions of the affine formulation (13) and (14) are written as:

\begin{matrix} f_{x w} (x, θ) = & {(\begin{matrix} λ - ρ T_{U} & - δ T_{I} & N δ T_{I} - c V \end{matrix})}^{T}, \\ f_{u} (x, θ) = & {(\begin{matrix} - T_{U} V & T_{U} V & 0 \end{matrix})}^{T}, \\ h_{x w} (x, θ) = & h (x, θ) = {(\begin{matrix} V & T_{I} + T_{U} \end{matrix})}^{T}, \end{matrix}

while if it is unknown,

w (t) = η (t)

, we have:

\begin{matrix} f_{x w} (x, θ, w) = f (x, θ, w), \\ h_{x w} (x, θ, w) = h (x, θ, w) . \end{matrix}

If the infection rate is considered known, both algorithms classify the model as observable and identifiable, regardless of the number of input derivatives assumed non-zero by FISPO. Figure 5 illustrates this fact, where the results obtained by FISPO have been represented for a generic analytical input.

The observability matrix calculated by ORC-DF has full rank after including Lie derivatives up to second order, so the number of its rows is

\sum_{i = 0}^{2} m {(1 + n_{u})}^{i + 1} = \sum_{i = 0}^{2} 2^{i + 2} = 26

(actually, 14 rows after excluding dependent rows arising from the equality

h_{u} = 0)

while the matrix constructed by FISPO needs to include Lie derivatives up to order three to achieve full rank, so it has

m (k + 1) = 8

rows.

This is an example of a model for both algorithms perform similarly; although FISPO needs to calculate one more Lie derivative to classify the system as observable, ORC-DF calculates ranks of matrices of greater dimension, resulting in similar computational cost of the calculations involved in each algorithm. As can be seen in Figure 5C, the ranks of both observability matrices coincide up to the first iteration, as a consequence of:

h_{u} (x, θ) = L_{f_{u}} h (x, θ) = {(\begin{matrix} 0 & 0 \end{matrix})}^{T} .

5.5. A Genetic Toggle Switch with Two Inputs: “TS”

Let us now consider the following model of a genetic toggle switch [23]:

\begin{matrix} \{\begin{matrix} {\dot{x}}_{1} = k_{01} + \frac{k_{1}}{1 + {(x_{2} / (1 + {(a T c / θ_{a T c})}^{η_{a T c}}))}^{η_{T e t R}}} - x_{1}, \\ {\dot{x}}_{2} = k_{02} + \frac{k_{2}}{1 + {(x_{1} / (1 + {(I P T G / θ_{I P T G})}^{η_{I P T G}}))}^{η_{L a c I}}} - x_{2}, \\ y_{1} = x_{1}, \\ y_{2} = x_{2}, \end{matrix} \end{matrix}

where

x = (x_{1}, x_{2})

is the state vector and the inputs are

a T c (t)

and

I P T G (t) .

The remaining variables are unknown parameters.

This model is an example that cannot be analysed by ORC-DF algorithm, since it is not affine in inputs. It was analysed with FISPO in [16], considering both measured and unmeasured inputs. If both inputs are known, FISPO classifies the model as structurally identifiable, as long as neither input is constant. If the inputs are unknown FISPO concludes that some parameters become unidentifiable. For more details we refer the reader to [16].

5.6. A Signalling Pathway with Five Known Inputs: “JAK-STAT”

To show the computational limitations of the two algorithms, we analyse here a model that pushes them to their limits. It is a classic model of the JAK-STAT signalling pathway presented by [24], which has 25 states, 26 unknown parameters, and 5 inputs. The output consists on 15 measured functions of the model variables that depend only on one of the external signals

u_{i},

which is not involved in system dynamics, that is,

\begin{matrix} f_{u_{i}} (x, θ) = 0_{25 \times 1} \end{matrix}

(20)

\begin{matrix} h_{u_{j}} (x, θ) = 0_{15 \times 1}, 1 \leq j \leq 5, j \neq i \end{matrix}

(21)

The model equations are provided in Appendix C.

This model was analysed with FISPO in [25], concluding that all its parameters are structurally identifiable but two of its 25 states are non-observable. The calculations are computationally expensive, requiring the use of procedures supported in STRIKE-GOLDD—such as model decomposition or successive executions after removing parameters previously classified as identifiable—in order to reach the conclusion. Thus, the model was first analysed after setting the maximum computation time of each Lie derivative to 100 seconds, which allowed FISPO to calculate 5 Lie derivatives and to classify 17 parameters and 4 states as observable. Next, the 17 parameters were specified as previously classified in the FISPO options, thereby removing them from further consideration and decreasing the size of the problem, and the model was decomposed. The post-decomposition analysis classified five additional parameters as identifiable. After removing them, it was possible to analyse the remainder of the model and reach the aforementioned conclusion.

With ORC-DF we did not manage to analyse the model due to computational limitations (specifically, insufficient memory). The different computational requirements of ORC-DF and FISPO are shown in Table 2.

Table 2 shows that the number of rows of the matrix built by ORC-DF grows rapidly at each iteration (even though the implementation removes null rows arising from dependencies in (20) and (21), which is why the number of rows of the ORC-DF matrix does not match the number given in Section 3.3). Although this matrix leads to higher ranks than the one built by FISPO, especially at the beginning of the execution (i.e., with few Lie derivatives), the difference decreases soon and the ranks of the two matrices are similar despite the big difference in the number of rows. With five Lie derivatives, ORC-DF labels 27 model variables as observable (9 states and 18 parameters), including those classified as observable by FISPO (21: 4 states and 17 parameters). However, the computation time of ORC-DF at that point is roughly ten times higher than FISPO, and memory requirements impede further progress with this algorithm.

6. Conclusions

In this paper we have analysed two recent algorithms for observability analysis of nonlinear systems with known and/or unknown inputs, which we refer to as ORC-DF and FISPO. Our analyses have revealed the key similarities and differences between them. The main conclusions can be summarized as follows.

First, we have proven theoretically that for models without known inputs both algorithms are basically equivalent, since they calculate the rank of the same observability matrix. In contrast, for models with known inputs—e.g., for controlled systems—the two algorithms differ, since they build different observability matrices. Specifically, the number of rows of the matrix built by ORC-DF increases more at each iteration than the one built by FISPO. We have shown that this increased growth is often an advantage of ORC-DF, since it makes it possible to reach full rank—and thus, to conclude that a model is observable—with less Lie derivatives, and hence less computational cost; an example was shown in Section 5.1. However, said increased growth is not always advantageous: as we have noted in Section 3.3, it can also be detrimental to the efficiency of ORC-DF. The latter situation may happen when the structure of the model equations is such that the increase in problem dimension outweighs the increase in information resulting from the inclusion of a new Lie derivative; we provided an example in Section 5.6.

When applying FISPO to a model with unknown input(s) it is usually necessary to assume that their derivatives,

w^{s)}

, are zero for orders higher than a finite s, which amounts to assuming that the unknown inputs are polynomial functions. This is not a theoretical requirement—and in fact, a counter-example that did not require this assumption was shown in [16]—but it is often necessary in practice in order to reach a conclusion in finite time. This was indeed the case for the models with unknown inputs that we analysed in this paper. It should be noted that the results obtained under this assumption are not necessarily valid if the unknown input has nonzero derivatives of order higher than s.

We remark that this assumption of polynomial inputs does not apply to known inputs; FISPO does not require this assumption for their analysis, and it is not used by ORC-DF either. That being said, it must be taken into account that in certain applications there are experimental limitations that do not allow one to use sophisticated input signals for system identification. This situation is common in biological modelling, where often times only constant or ramp inputs can be applied. The implications of such limitations in observability can be taken into account by specifying that the derivatives of the known inputs,

u^{s)}

, are zero for orders higher than a finite s. This option is available in STRIKE-GOLDD.

Another difference between both algorithms lies in the types of models and known inputs that they can analyse. In regard to model types, FISPO is applicable to a general class of nonlinear ODE models, while ORC-DF is applicable to a subclass of those models: the ones that are affine in the inputs. There is an additional, albeit subtle, difference between both algorithms in regard to the known inputs: FISPO considers infinitely differentiable ("smooth") functions, while ORC-DF considers piecewise constant inputs. It should be noted however that an affine system that is observable for piecewise constant inputs is also observable for smooth inputs; therefore, ORC-DF can also establish the observability of (affine) models with continuous inputs.

In a new release of the STRIKE-GOLDD toolbox (v2.2) we have included an implementation of the ORC-DF algorithm, thus allowing the user to apply different algorithms with the same tool and model definition. It should be noted that the implementations of the FISPO and ORC-DF algorithms included in the STRIKE-GOLDD toolbox have a number of additional features that increase the efficiency of the core algorithms analysed here, as noted in Section 4. We used the new implementations in STRIKE-GOLDD 2.2 to benchmark the algorithms with several models taken from the literature. Our selection of case studies included both simple models, used for illustrating the inner workings of the algorithms in detail, and more complex models whose analysis is computationally challenging, which we used for pushing the algorithms to their limits. We also provided an example of a model that cannot be analysed with ORC-DF due to being not affine in the inputs.

In conclusion, the theoretical and computational analyses presented here have informed us about the differences between the ORC-DF and FISPO algorithms, showing that they represent complementary techniques for solving an often challenging problem, and clarifying when one may be preferred over the other. The release of a new version of the MATLAB toolbox STRIKE-GOLDD that includes implementations of both algorithms provides the convenience of performing different analyses with minimal intervention from the user.

Author Contributions

A.F.V. designed and supervised the research; N.M. implemented the software and performed the computational experiments; N.M. and A.F.V. analysed the algorithms and the results; N.M. and A.F.V. wrote the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Spanish Ministry of Science, Innovation and Universities through the project SYNBIOCONTROL (reference DPI2017-82896-C2-2-R). We acknowledge support of the publication fee by the CSIC Open Access Publication Support Initiative through its Unit of Information Resources for Research (URICI) and by the Xunta de Galicia through grant ref. IN607B 2020-03.

Conflicts of Interest

The authors declare no competing interests.

Data and Materials Availability

The methods and models used in this paper are available in the GitHub repository as part of release v2.2 of the STRIKE-GOLDD toolbox: https://github.com/afvillaverde/strike-goldd.

Appendix A. Analysis of the Lie Derivatives of the C2M Case Study

Let us first consider the FISPO algorithm and the model with a non-constant input. As shown in Figure 1C, in this case there are six independent extended Lie derivatives of the output, which are given by:

\begin{matrix} L_{f}^{0} h (x) = & x_{1}, \end{matrix}

(A1)

\begin{matrix} L_{f} h (x) = & - (k_{1 e} + k_{12}) x_{1} + k_{21} x_{2} + b u, \end{matrix}

(A2)

\begin{matrix} L_{f}^{2} h (x) = & ({(k_{1 e} + k_{12})}^{2} + k_{21} k_{12}) x_{1} - (k_{1 e} + k_{12} + k_{21}) k_{21} x_{2} - (k_{1 e} + k_{12}) b u + b \dot{u}, \end{matrix}

(A3)

\begin{matrix} L_{f}^{3} h (x) = & - ({(k_{1 e} + k_{12})}^{3} + k_{12} k_{21} (k_{1 e} + k_{12}) + k_{12} k_{21} (k_{1 e} + k_{12} + k_{21})) x_{1} + (k_{1 e} + k_{12} + k_{21}) k_{21}^{2} x_{2} + \\ ({(k_{1 e} + k_{12})}^{2} + k_{12} k_{21}) k_{21} x_{2} + ({(k_{1 e} + k_{12})}^{2} + k_{21} k_{12}) b u - (k_{1 e} + k_{12}) b \dot{u} + b \ddot{u}, \end{matrix}

(A4)

\begin{matrix} L_{f}^{4} h (x) = & ({(k_{1 e} + k_{12})}^{4} + 2 k_{12} k_{21} {(k_{1 e} + k_{12})}^{2} + k_{12} k_{21} {(k_{1 e} + k_{12} + k_{21})}^{2} + k_{12}^{2} k_{21}^{2}) x_{1} - \\ ({(k_{1 e} + k_{12})}^{3} + 2 k_{12} k_{21} (k_{1 e} + k_{12}) + 2 k_{12} k_{21}^{2} + k_{21} {(k_{1 e} + k_{12})}^{2} + k_{21}^{2} (k_{1 e} + k_{12} + k_{21})) k_{21} x_{2} - \\ ({(k_{1 e} + k_{12})}^{3} + 2 k_{12} k_{21} (k_{1 e} + k_{12}) + k_{12} k_{21}^{2}) b u + ({(k_{1 e} + k_{12})}^{2} + k_{21} k_{12}) b \dot{u} - b (k_{1 e} + k_{12}) \ddot{u} + b u^{3)}, \end{matrix}

(A5)

\begin{matrix} L_{f}^{5} h (x) = & - ({(k_{1 e} + k_{12})}^{5} + 3 k_{12} k_{21} {(k_{1 e} + k_{12})}^{3} + 3 k_{12}^{2} k_{21}^{2} (k_{1 e} + k_{12}) + k_{12} k_{21}^{2} {(k_{1 e} + k_{12})}^{2} + 2 k_{12}^{2} k_{21}^{2}) x_{1} + \\ (k_{12} k_{21} (k_{1 e} + k_{12}) {(k_{1 e} + k_{12} + k_{21})}^{2} + k_{12} k_{21}^{3} (k_{1 e} + k_{12} + k_{21})) x_{1} + {(k_{1 e} + k_{12})}^{4} k_{21} x_{2} + \\ (k_{21} {(k_{1 e} + k_{12})}^{2} (2 k_{12} + k_{21}) + k_{12} k_{21} {(k_{1 e} + k_{12} + k_{21})}^{2} + k_{21}^{3} (k_{1 e} + k_{12} + k_{21})) k_{21} x_{2} + \\ (k_{21} {(k_{1 e} + k_{12})}^{3} + 2 k_{12} k_{21}^{2} (k_{1 e} + k_{12}) + 2 k_{12} k_{21}^{3} + k_{12}^{2} k_{21}^{2} + 3 k_{12}^{2} k_{21}^{2} (k_{1 e} + k_{12})) k_{21} x_{2} + \\ ({(k_{1 e} + k_{12})}^{4} + 2 k_{12} k_{23} {(k_{1 e} + k_{12})}^{2} + k_{12}^{2} k_{21}^{2} + k_{12} k_{21} {(k_{1 e} + k_{12} + k_{21})}^{2}) b u + {(k_{1 e} + k_{12})}^{3} b \dot{u} \\ (2 k_{12} k_{21} (k_{1 e} + k_{12}) + k_{12} k_{21}^{2}) b \dot{u} + ({(k_{1 e} + k_{12})}^{2} + k_{21} k_{12}) b \ddot{u} - b (k_{1 e} + k_{12}) u^{3)} + b u^{4)} . \end{matrix}

(A6)

Assuming (for simplicity) that

u^{k)} = 0

for

k > 1,

and writing

L_{f}^{i} h = y^{i)}

(1 \leq i \leq 5),

from (A1)–(A6) we obtain:

\begin{matrix} x_{1} = y, \end{matrix}

(A7)

\begin{matrix} b u = y^{'} + (k_{1 e} + k_{12}) y - k_{21} x_{2}, \end{matrix}

(A8)

\begin{matrix} k_{21}^{2} x_{2} = k_{12} k_{21} y - (k_{1 e} + k_{12}) y^{'} - y^{″} + b \dot{u}, \end{matrix}

(A9)

\begin{matrix} k_{21} b \dot{u} = y^{‴} + k_{1 e} k_{21} y^{'} + (k_{1 e} + k_{12} + k_{21}) y^{″}, \end{matrix}

(A10)

\begin{matrix} k_{1 e} k_{21} y^{″} = - y^{4)} - (k_{1 e} + k_{12} + k_{21}) y^{‴}, \end{matrix}

(A11)

\begin{matrix} (k_{1 e} + k_{12} + k_{21}) (y^{‴ 2} - y^{4)} y^{″}) = y^{5)} y^{″} - y^{4)} y^{‴} . \end{matrix}

(A12)

Therefore, from the above system we can extract directly the following input–output expressions:

\begin{matrix} x_{1} = & ϕ_{0} (y, y^{'}, \dots, y^{5)}, u, \dot{u}), \end{matrix}

(A13)

\begin{matrix} k_{1 e} + k_{12} + k_{21} = & ϕ_{1} (y, y^{'}, \dots, y^{5)}, u, \dot{u}), \end{matrix}

(A14)

\begin{matrix} k_{1 e} k_{21} = & ϕ_{2} (y, y^{'}, \dots, y^{5)}, u, \dot{u}), \end{matrix}

(A15)

\begin{matrix} k_{21} b = & ϕ_{3} (y, y^{'}, \dots, y^{5)}, u, \dot{u}), \end{matrix}

(A16)

where

ϕ_{i}

(0 \leq i \leq 3)

are functions that depend only on the output, the input, and their time derivatives. It is possible to obtain similar input–output expressions for any variable involved in the model by determining the unique solution of the system (A7)–(A12), which consists of six independent equations and six unknowns. Note that the assumption

u^{k)} = 0

for

k > 1

does not imply loss of generality, because the presence of higher order derivatives does not affect the unicity of the solution of (A1)–(A6) (if anything, they would contribute to the independence of the equations).

By inspecting Equations (A7)–(A12) it is possible to explain the classification obtained by FISPO and shown in Figure 1A (blue line), since it is not possible to obtain such an input–output expression for any unmeasured variable until calculating the fifth Lie derivative (A6), when the ambiguities in the equations are finally resolved. (Note that this result requires excluding those states and parameters in the phase space for which the denominators in the input–output expressions vanish. However, since the system is analytical these states form a zero measurement subset.)

Let us consider now the FISPO algorithm in the constant input case. From Figure 1C it follows that the equations obtained (A1)–(A6) are dependent. The following system of equations is extracted from (A1)–(A5):

\begin{matrix} x_{1} = y, \end{matrix}

(A17)

\begin{matrix} k_{21} x_{2} = y^{'} + (k_{1 e} + k_{12}) y - b u, \end{matrix}

(A18)

\begin{matrix} k_{21} b u = y^{″} + k_{1 e} k_{21} y + (k_{1 e} + k_{12} + k_{21}) y^{'}, \end{matrix}

(A19)

\begin{matrix} k_{1 e} k_{21} y^{'} = - y^{‴} - (k_{1 e} + k_{12} + k_{21}) y^{″}, \end{matrix}

(A20)

\begin{matrix} (k_{1 e} + k_{12} + k_{21}) (y^{″ 2} - y^{‴} y^{'}) = y^{'} y^{4)} - y^{″} y^{‴} . \end{matrix}

(A21)

Using Equations (A17) and (A19–A21), it is possible to write the combinations

x_{1},

k_{21} b,

k_{1 e} + k_{12} + k_{21}

y

k_{1 e} k_{21}

exclusively in terms of the input and output of the model. Thus, after replacing in (A18), the system to be solved is:

\begin{matrix} x_{1} = & ϕ_{0} (y, y^{'}, \dots, y^{4)}, u), \end{matrix}

(A22)

\begin{matrix} k_{1 e} + k_{12} + k_{21} = & ϕ_{1} (y, y^{'}, \dots, y^{4)}, u), \end{matrix}

(A23)

\begin{matrix} k_{1 e} k_{21} = & ϕ_{2} (y, y^{'}, \dots, y^{4)}, u), \end{matrix}

(A24)

\begin{matrix} k_{21} b = & ϕ_{3} (y, y^{'}, \dots, y^{4)}, u), \end{matrix}

(A25)

\begin{matrix} k_{21} x_{2} = & y^{'} + (ϕ_{1} (y, y^{'}, \dots, y^{4)}, u) - k_{21}) y - ϕ_{3} (y, y^{'}, \dots, y^{4)}, u) u / k_{21}, \end{matrix}

(A26)

which has six unknowns and five independent equations (infinite solutions), so it is not possible to write (at least, locally) any of the parameters or the unknown state

x_{2}

as a function of the input and output. This scenario is shown in Figure 1A (red line).

We consider now the ORC-DF algorithm. Instead of (A1)–(A6), this method computes the following Lie derivatives (independent of the control values, since they are assumed to be piecewise constant):

\begin{matrix} L_{0} = & h (x) = x_{1}, \end{matrix}

(A27)

\begin{matrix} L_{1} = & L_{f_{x w}} h (x) = - (k_{1 e} + k_{12}) x_{1} + k_{21} x_{2}, \end{matrix}

(A28)

\begin{matrix} L_{2} = & L_{f_{u}} h (x) = b, \end{matrix}

(A29)

\begin{matrix} L_{3} = & L_{f_{x w}}^{2} h (x) = ({(k_{1 e} + k_{12})}^{2} + k_{21} k_{12}) x_{1} - (k_{1 e} + k_{12} + k_{21}) k_{21} x_{2}, \end{matrix}

(A30)

\begin{matrix} L_{4} = & L_{f_{u}} L_{f_{x w}} h (x) = - b (k_{1 e} + k_{12}), \end{matrix}

(A31)

\begin{matrix} L_{5} = & L_{f_{x w}}^{3} h (x) = - ({(k_{1 e} + k_{12})}^{3} + 2 k_{21} k_{12} (k_{1 e} + k_{12}) + k_{21}^{2} k_{12}) x_{1} + \\ ({(k_{1 e} + k_{12})}^{2} + k_{21}^{2} + k_{21} (k_{1 e} + 2 k_{12})) k_{21} x_{2}, \end{matrix}

(A32)

\begin{matrix} L_{6} = & L_{f_{u}} L_{f_{x w}}^{2} h (x) = b ({(k_{1 e} + k_{12})}^{2} + k_{21} k_{12}) . \end{matrix}

(A33)

The system (A27)–(A33) has a unique solution, in which one of the equations depends on the others. The solution, obtained from (A27)–(A31) and (A33), is given as a function of Lie derivatives

L_{i},

i \in \{0, 1, 2, 3, 4, 6\},

as follows:

\begin{matrix} x_{1} = & L_{0}, \\ x_{2} = & {(L_{1} L_{2} - L_{0} L_{4})}^{2} / (L_{2} (L_{0} L_{6} - L_{2} L_{3}) + L_{4} (L_{1} L_{2} - L_{0} L_{4})), \\ k_{1 e} = & L_{2} (L_{3} L_{4} - L_{1} L_{6}) / (L_{2} (L_{0} L_{6} - L_{2} L_{3}) + L_{4} (L_{1} L_{2} - L_{0} L_{4})), \\ k_{12} = & (L_{1} L_{2} - L_{0} L_{4}) (L_{2} L_{6} - L_{4}^{2}) / (L_{2} (L_{0} L_{6} - L_{2} L_{3}) + L_{4} (L_{1} L_{2} - L_{0} L_{4})), \\ k_{21} = & (L_{2} (L_{0} L_{6} - L_{2} L_{3}) + L_{4} (L_{1} L_{2} - L_{0} L_{4})) / (L_{1} L_{2}^{2} - L_{0} L_{2} L_{4}), \\ b = & L_{2} . \end{matrix}

Therefore, the ORC-DF algorithm classifies the C2M model as observable and identifiable after the third iteration. We note that Equation (A29) implies that parameter b can be calculated directly from the output.

Appendix B. Analysis of the Lie Derivatives of the Bolie Case Study

Assuming non-constant input, FISPO calculates six independent Lie derivatives of the output, as shown in Figure 2C:

\begin{matrix} L_{f}^{0} h (x, θ, u) = & q_{1} / V_{p}, \end{matrix}

(A34)

\begin{matrix} L_{f} h (x, θ, u) = & (p_{1} q_{1} - p_{2} q_{2} + u) / V_{p}, \end{matrix}

(A35)

\begin{matrix} L_{f}^{2} h (x, θ, u) = & ((p_{1}^{2} - p_{2} p_{4}) q_{1} - (p_{1} + p_{3}) p_{2} q_{2} + p_{1} u + \dot{u}) / V_{p}, \end{matrix}

(A36)

\begin{matrix} L_{f}^{3} h (x, θ, u) = & ((p_{1} (p_{1}^{2} - p_{2} p_{4}) - p_{2} p_{4} (p_{1} + p_{3})) q_{1} - (p_{1}^{2} - p_{2} p_{4} + p_{3} (p_{1} + p_{3})) p_{2} q_{2}) / V_{p} + \\ ((p_{1}^{2} - p_{2} p_{4}) u + p_{1} \dot{u} + \ddot{u}) / V_{p}, \end{matrix}

(A37)

\begin{matrix} L_{f}^{4} h (x, θ, u) = & (({(p_{1}^{2} - p_{2} p_{4})}^{2} - p_{2} p_{4} {(p_{1} + p_{3})}^{2}) q_{1} + (p_{1} + p_{3}) (p_{1}^{2} - 2 p_{2} p_{4} + p_{3}^{2}) p_{2} q_{2}) / V_{p} + \\ ((p_{1}^{3} - 2 p_{1} p_{2} p_{4} - p_{2} p_{3} p_{4}) u - ((p_{1}^{2} - p_{2} p_{4}) \dot{u} + p_{1} \ddot{u} + \overset{⃛}{u})) / V_{p}, \end{matrix}

(A38)

\begin{matrix} L_{f}^{5} h (x, θ, u) = & ((p_{1}^{2} {(p_{1} - p_{2} p_{4})}^{2} - p_{2} p_{4} (p_{1} + p_{3}) (2 p_{1}^{2} + p_{3}^{2} - p_{2} p_{4})) q_{1} - {(p_{1} - p_{2} p_{4})}^{2} p_{2} q_{2}) / V_{p} - \\ ((p_{3} (p_{1} + p_{3}) (p_{1}^{2} - 2 p_{2} p_{4} + p_{3}^{2}) - p_{2} p_{4} {(p_{1} + p_{3})}^{2}) p_{2} q_{2} + {(p_{1}^{2} - p_{2} p_{4})}^{2} u) / V_{p} + \\ (- p_{2} p_{4} {(p_{1} + p_{3})}^{2} u + (p_{1}^{3} - 2 p_{1} p_{2} p_{4} - p_{2} p_{3} p_{4}) \dot{u} + (p_{1}^{2} - p_{2} p_{4}) \ddot{u} + p_{1} u^{3)} + u^{4)}) / V_{p} . \end{matrix}

(A39)

System (A34)–(A39) has infinite solutions, since it is composed by six equations and seven unknowns, but similarly to the C2M case study, it is possible to perform a series of algebraic manipulations on the above equations to obtain an input–output expression for the variables

q_{1},

p_{1},

p_{3}

and

V_{p},

as it shown in Figure 2.

In the case of constant input, Equations (A34)–(A39) become redundant, so there are only five independent Lie derivatives, as is shown in Figure 2. The combinations

p_{1} + p_{3},

p_{1} p_{3} + p_{2} p_{4}

are the only observable functions of the variables that can be extracted from (A34)–(A38). Since the rational combinations of these functions are insufficient to determine any of the parameters or states, all variables are non-observable, as shown in Figure 2A (red line).

On the other hand, ORC-DF calculates the following Lie derivatives:

\begin{matrix} h (x, θ) = & q_{1} / V_{p}, \end{matrix}

(A40)

\begin{matrix} L_{f_{x w}} h (x, θ) = & (p_{1} q_{1} - p_{2} q_{2}) / V_{p}, \end{matrix}

(A41)

\begin{matrix} L_{f_{u}} h (x, θ) = & 1 / V_{p}, \end{matrix}

(A42)

\begin{matrix} L_{f_{x w}}^{2} h (x, θ) = & ((p_{1}^{2} - p_{2} p_{4}) q_{1} - (p_{1} + p_{3}) p_{2} q_{2}) / V_{p}, \end{matrix}

(A43)

\begin{matrix} L_{f_{u}} L_{f_{x w}} h (x, θ) = & p_{1} / V_{p}, \end{matrix}

(A44)

\begin{matrix} L_{f_{x w}}^{3} h (x, θ) = & ((p_{1} (p_{1}^{2} - p_{2} p_{4}) - (p_{1} + p_{3}) p_{2} p_{4}) q_{1} - ((p_{1}^{2} - p_{2} p_{4}) + p_{3} (p_{1} + p_{3})) p_{2} q_{2}) / V_{p}, \end{matrix}

(A45)

\begin{matrix} L_{f_{u}} L_{f_{x w}}^{2} h (x, θ) = & (p_{1}^{2} - p_{2} p_{4}) / V_{p} . \end{matrix}

(A46)

From Equations (A40), (A42), and (A44) it is easy to obtain the state

q_{1}

and the parameters

V_{p}

and

p_{1}

uniquely from the measurements, using Lie derivatives up to order two. This property is not fulfilled by the system formed by (A34)–(A39); Figure 2A shows that the aforementioned variables are classified as observable by FISPO only after considering fifth order derivatives. Using the input–output expressions of

q_{1},

V_{p}

, and

p_{1}

extracted from (A40), (A42) and (A44), in conjunction with Equations (A41), (A43) and (A46), it is also possible to determine parameter

p_{3}

as a function of the Lie derivatives of the output. However, Figure 2C shows that system (A40)–(A46) contains one redundant equation, so it is not possible to determine uniquely an input–output expression of the remaining unmeasured states. It can also be noted that any rational combination of the observable variables with the functions

p_{2} q_{2}

and

p_{2} p_{4}

is also observable.

Appendix C. Equations of the JAK-STAT Model

The dynamics of the JAK-STAT model analysed in Section 5.6 is given by:

\begin{matrix} {\dot{x}}_{1} & = & x_{2345} x_{8} θ_{11} / θ_{26} - k_{5} x_{1} θ_{10} / M_{1}, \\ {\dot{x}}_{2} & = & k_{5} x_{1} θ_{10} / M_{1} - x_{2} θ_{7} / M_{1} - x_{2} x_{8} θ_{11} / θ_{26} - 3 x_{2} θ_{7} / ((θ_{8} x_{6} + 1) M_{1}), \\ {\dot{x}}_{3} & = & θ_{7} x_{2} / M_{1} - θ_{11} x_{8} x_{3} / θ_{26} - 3 θ_{7} x_{3} / ((θ_{8} x_{6} + 1) M_{1}), \\ {\dot{x}}_{4} & = & 3 x_{2} θ_{7} / ((θ_{8} x_{6} + 1) M_{1}) - θ_{7} x_{4} / M_{1} - θ_{11} x_{8} x_{4} / θ_{26}, \\ {\dot{x}}_{5} & = & θ_{7} x_{4} / M_{1} - θ_{11} x_{8} x_{5} / θ_{26} + 3 θ_{7} x_{3} / ((θ_{8} x_{6} + 1) M_{1}), \\ {\dot{x}}_{6} & = & - x_{6} (θ_{9} / θ_{25}) (x_{5} + x_{3}), \\ {\dot{x}}_{7} & = & θ_{13} x_{8} - x_{7} (θ_{12} / θ_{25}) x_{2345}, \\ {\dot{x}}_{8} & = & x_{7} (θ_{12} / θ_{25}) x_{2345} - θ_{13} x_{8}, \\ {\dot{x}}_{9} & = & k_{6} θ_{23} x_{11} / k_{7} - x_{9} (θ_{22} / θ_{25}) x_{2345} / M_{1} - x_{9} θ_{21} {(x_{5} + x_{3})}^{2} / ((x_{18} θ_{3} / θ_{1} + 1) M_{1} θ_{25}^{2}), \\ {\dot{x}}_{10} & = & x_{9} θ_{22} x_{2345} M_{1} / θ_{25} - θ_{24} x_{10} + x_{9} θ_{21} {(x_{5} + x_{3})}^{2} / (θ_{25}^{2} (x_{18} θ_{3} / θ_{1} + 1) M_{1}), \\ {\dot{x}}_{11} & = & k_{7} θ_{24} x_{10} / k_{6} - θ_{23} x_{11}, \\ {\dot{x}}_{12} & = & - x_{12} θ_{4} - θ_{5} x_{11} (k_{1} - 1) / θ_{27}, \\ {\dot{x}}_{13} & = & x_{12} θ_{4} - x_{13} θ_{4}, \\ {\dot{x}}_{14} & = & x_{13} θ_{4} - x_{14} θ_{4}, \\ {\dot{x}}_{15} & = & x_{14} θ_{4} - x_{15} θ_{4}, \\ {\dot{x}}_{16} & = & x_{15} θ_{4} - x_{16} θ_{4}, \\ {\dot{x}}_{17} & = & x_{16} θ_{4} k_{6} / k_{7} - x_{17} θ_{5}, \\ {\dot{x}}_{18} & = & x_{17} θ_{1} θ_{6} - x_{18} θ_{6} + k_{2} θ_{6} θ_{2} θ_{1}, \\ {\dot{x}}_{19} & = & - x_{19} θ_{18} - θ_{19} x_{11} (k_{1} - 1) / θ_{27}, \\ {\dot{x}}_{20} & = & x_{19} θ_{18} - x_{20} θ_{18}, \\ {\dot{x}}_{21} & = & x_{20} θ_{18} - x_{21} θ_{18}, \\ {\dot{x}}_{22} & = & x_{21} θ_{18} - x_{22} θ_{18}, \\ {\dot{x}}_{23} & = & x_{22} θ_{18} - x_{23} θ_{18}, \\ {\dot{x}}_{24} & = & k_{6} x_{23} θ_{18} / k_{7} - x_{24} θ_{19}, \\ {\dot{x}}_{25} & = & x_{24} θ_{15} θ_{20} - x_{25} θ_{20} + k_{3} θ_{20} θ_{16} θ_{15} \end{matrix}

where the auxiliary variables

x_{2345} = x_{2} + x_{3} + x_{4} + x_{5}

and

M_{1} = x_{25} θ_{17} / θ_{15} + 1

have been used.

The 25 states

x_{1}, x_{2}, \dots, x_{25}

are, respectively, the following species: EpoRJAK2, EpoRpJAK2, p1EpoRpJAK2, p2EpoRpJAK2, p12EpoRpJAK2, EpoRJAK2_CIS, SHP1, SHP1Act, STAT5, pSTAT5, npSTAT5, CISnRNA1, CISnRNA2, CISnRNA3, CISnRNA4, CISnRNA5, CISRNA, CIS, SOCS3nRNA1, SOCS3nRNA2, SOCS3nRNA3, SOCS3nRNA4, SOCS3nRNA5, SOCS3RNA, and SOCS3.

The 27 unknown parameters,

θ_{i}

, were written in the original publication [24] as: CISEqc, CISEqcOE, CISInh, CISRNADelay, CISRNATurn, CISTurn, EpoRActJAK2, EpoRCISInh, EpoRCISRemove, JAK2ActEpo, JAK2EpoRDeaSHP1, SHP1ActEpoR, SHP1Dea, SHP1ProOE, SOCS3Eqc, SOCS3EqcOE, SOCS3Inh, SOCS3RNADelay, SOCS3RNATurn, SOCS3Turn, STAT5ActEpoR, STAT5ActJAK2, STAT5Exp, STAT5Imp, init_EpoRJAK2, init_SHP1, and init_STAT5.

The model has seven known constants (

k_{1}

–

k_{7}

), five of which correspond to experimental conditions that can be considered as constant inputs (

k_{1}

–

k_{5}

), including the external signal (

k_{5} \equiv

Epo).

The output equations are:

\begin{matrix} y_{1} & = & 2 (x_{2} + x_{3} + x_{4} + x_{5}) θ_{25}, \\ y_{2} & = & 16 (x_{3} + x_{4} + x_{5}) θ_{25}, \\ y_{3} & = & x_{18} θ_{1}, \\ y_{4} & = & x_{25} / θ_{14}, \\ y_{5} & = & (x_{9} + x_{10}) / θ_{27}, \\ y_{6} & = & x_{10} θ_{27}, \\ y_{7} & = & x_{9}, \\ y_{8} & = & x_{7} + x_{8}, \\ y_{9} & = & x_{18}, \\ y_{10} & = & x_{25}, \\ y_{11} & = & 100 x_{10} / (x_{10} + x_{9}), \\ y_{12} & = & x_{24}, \\ y_{13} & = & x_{17}, \\ y_{14} & = & (x_{7} + x_{8}) (1 + (k_{4} θ_{27})) / θ_{26}, \end{matrix}

References

Chatzis, M.N.; Chatzi, E.N.; Smyth, A.W. On the observability and identifiability of nonlinear structural and mechanical systems. Struct. Control Health Monit. 2015, 22, 574–593. [Google Scholar] [CrossRef]
Villaverde, A.F. Observability and Structural Identifiability of Nonlinear Biological Systems. Complexity 2019, 2019, 8497093. [Google Scholar] [CrossRef] [Green Version]
Tuza, Z.A.; Ács, B.; Szederkényi, G.; Allgöwer, F. Efficient computation of all distinct realization structures of kinetic systems. IFAC-PapersOnLine 2016, 49, 194–200. [Google Scholar] [CrossRef]
Hermann, R.; Krener, A.J. Nonlinear controllability and observability. IEEE Trans. Autom. Control 1977, 22, 728–740. [Google Scholar] [CrossRef] [Green Version]
Bellman, R.; Åström, K.J. On structural identifiability. Math. Biosci. 1970, 7, 329–339. [Google Scholar] [CrossRef]
Bellu, G.; Saccomani, M.P.; Audoly, S.; D’Angio, L. DAISY: A new software tool to test global identifiability of biological and physiological systems. Comput. Methods Programs Biomed. 2007, 88, 52–61. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Meshkat, N.; Eisenberg, M.; DiStefano, J.J. An algorithm for finding globally identifiable parameter combinations of nonlinear ODE models using Gröbner Bases. Math. Biosci. 2009, 222, 61–72. [Google Scholar] [CrossRef] [PubMed]
Karlsson, J.; Anguelova, M.; Jirstrand, M. An Efficient Method for Structural Identiability Analysis of Large Dynamic Systems. In Proceedings of the 16th IFAC Symposium on System Identification, Brussels, Belgium, 11–13 July 2012; Volume 16, pp. 941–946. [Google Scholar]
Villaverde, A.F.; Barreiro, A.; Papachristodoulou, A. Structural identifiability of dynamic systems biology models. PLoS Comput. Biol. 2016, 12, e1005153. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Ligon, T.S.; Fröhlich, F.; Chiş, O.T.; Banga, J.R.; Balsa-Canto, E.; Hasenauer, J. GenSSI 2.0: Multi- experiment structural identifiability analysis of SBML models. Bioinformatics 2018, 8, 1421–1423. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Hong, H.; Ovchinnikov, A.; Pogudin, G.; Yap, C. SIAN: A tool for assessing structural identifiability of parametric ODEs. ACM Commun. Comput. Algebra 2019, 53, 37–40. [Google Scholar] [CrossRef]
Evans, N.D.; Chapman, M.J.; Chappell, M.J.; Godfrey, K.R. Identifiability of uncontrolled nonlinear rational systems. Automatica 2002, 38, 1799–1805. [Google Scholar] [CrossRef]
Martinelli, A. Extension of the observability rank condition to nonlinear systems driven by unknown inputs. In Proceedings of the 2015 23rd Mediterranean Conference on Control and Automation (MED), Torremolinos, Spain, 16–19 June 2015; pp. 589–595. [Google Scholar]
Martinelli, A. Nonlinear Unknown Input Observability: Extension of the Observability Rank Condition. IEEE Trans. Autom. Control 2019, 64, 222–237. [Google Scholar] [CrossRef] [Green Version]
Maes, K.; Chatzis, M.; Lombaert, G. Observability of nonlinear systems with unmeasured inputs. Mech. Syst. Signal Process. 2019, 130, 378–394. [Google Scholar] [CrossRef]
Villaverde, A.F.; Tsiantis, N.; Banga, J.R. Full observability and estimation of unknown inputs, states, and parameters of nonlinear biological models. J. R. Soc. Interface 2019, 16, 20190043. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Vidyasagar, M. Nonlinear Systems Analysis; Prentice Hall: Englewood Cliffs, NJ, USA, 1993. [Google Scholar]
Isidori, A. Nonlinear Control Systems; Springer Science & Business Media: Berlin/Heidelberg, Germany, 1995. [Google Scholar]
Anguelova, M. Nonlinear observability and identifiability: General theory and a case study of a kinetic model for S. cerevisiae. Master’s Thesis, Chalmers University of Technology and Göteborg University, Göteborg, Sweden, 2004. [Google Scholar]
Villaverde, A.F.; Evans, N.D.; Chappell, M.J.; Banga, J.R. Input-Dependent Structural Identifiability of Nonlinear Systems. IEEE Control Syst. Lett. 2019, 3, 272–277. [Google Scholar] [CrossRef] [Green Version]
Bolie, J. Coefficients of normal blood glucose regulation. J. Appl. Physiol. 1961, 16, 783–788. [Google Scholar] [CrossRef] [PubMed]
Miao, H.; Xia, X.; Perelson, A.S.; Wu, H. On identifiability of nonlinear ODE models and applications in viral dynamics. SIAM Rev. 2011, 53, 3–39. [Google Scholar] [CrossRef] [PubMed]
Lugagne, J.B.; Carrillo, S.S.; Kirch, M.; Köhler, A.; Batt, G.; Hersen, P. Balancing a genetic toggle switch by real-time feedback control and periodic forcing. Nat. Commun. 2017, 8, 1671. [Google Scholar] [CrossRef] [PubMed]
Bachmann, J.; Raue, A.; Schilling, M.; Böhm, M.E.; Kreutz, C.; Kaschek, D.; Busch, H.; Gretz, N.; Lehmann, W.D.; Timmer, J.; et al. Division of labor by dual feedback regulators controls JAK2/STAT5 signaling over broad ligand range. Mol. Syst. Biol. 2011, 7, 516. [Google Scholar] [CrossRef] [PubMed]
Villaverde, A.F.; Banga, J.R. Análisis de observabilidad e identificabilidad estructural de modelos no lineales: Aplicación a la vía de señalización JAK/STAT. In XL Jornadas de Automática. Universidade da Coruña; Servizo de Publicacións—UDC: Coruña, Spain, 2019; pp. 631–638. [Google Scholar]

Figure 1. Analysis of the C2M model with the FISPO and ORC-DF algorithm. For FISPO two cases are considered:

\dot{u} = 0

, which is labelled as “(const)”, and

\dot{u} \neq 0

, which is labelled as “(var)”. In the first case, higher order derivatives are assumed to be zero too. (A,B) Results of the FISPO and ORC-DF algorithms, respectively: the panel shows the states classified as observable or unobservable as a function of the number of Lie derivatives calculated by each algorithm. (C) Observability rank obtained by each algorithm as a function of the number of Lie derivatives. The full rank is equal to the number of states, i.e., six.

Figure 1. Analysis of the C2M model with the FISPO and ORC-DF algorithm. For FISPO two cases are considered:

\dot{u} = 0

, which is labelled as “(const)”, and

\dot{u} \neq 0

, which is labelled as “(var)”. In the first case, higher order derivatives are assumed to be zero too. (A,B) Results of the FISPO and ORC-DF algorithms, respectively: the panel shows the states classified as observable or unobservable as a function of the number of Lie derivatives calculated by each algorithm. (C) Observability rank obtained by each algorithm as a function of the number of Lie derivatives. The full rank is equal to the number of states, i.e., six.

Figure 2. Analysis of the Bolie model with the FISPO and ORC-DF algorithms. For FISPO two cases are considered:

\dot{u} = 0

, which is labelled as “(const)”, and

\dot{u} \neq 0

, which is labelled as “(var)”. In the first case, higher order derivatives are assumed to be zero too. (A,B) Results of the FISPO and ORC-DF algorithms, respectively: the panel shows the states classified as observable or unobservable as a function of the number of Lie derivatives calculated by each algorithm. (C) Observability rank obtained by each algorithm as a function of the number of Lie derivatives. The full rank is equal to the number of states, i.e., seven.

Figure 2. Analysis of the Bolie model with the FISPO and ORC-DF algorithms. For FISPO two cases are considered:

\dot{u} = 0

, which is labelled as “(const)”, and

\dot{u} \neq 0

, which is labelled as “(var)”. In the first case, higher order derivatives are assumed to be zero too. (A,B) Results of the FISPO and ORC-DF algorithms, respectively: the panel shows the states classified as observable or unobservable as a function of the number of Lie derivatives calculated by each algorithm. (C) Observability rank obtained by each algorithm as a function of the number of Lie derivatives. The full rank is equal to the number of states, i.e., seven.

Figure 3. Analysis of the 2DOF model. (A,B) Results of the FISPO and ORC-DF algorithms, respectively, with

\dot{w} = 0 .

The panels show the states classified as observable or unobservable as a function of the number of Lie derivatives calculated by each algorithm. (C) Observability rank obtained by each algorithm as a function of the number of Lie derivatives. The full rank is equal to the number of states, i.e., eight.

Figure 3. Analysis of the 2DOF model. (A,B) Results of the FISPO and ORC-DF algorithms, respectively, with

\dot{w} = 0 .

The panels show the states classified as observable or unobservable as a function of the number of Lie derivatives calculated by each algorithm. (C) Observability rank obtained by each algorithm as a function of the number of Lie derivatives. The full rank is equal to the number of states, i.e., eight.

Figure 4. Number of Lie derivatives needed by each algorithm for building the full-rank observability matrix for the 2DOF model as a function of

s,

the number of non-zero

w (t)

time derivatives, for

0 \leq s \leq 10 .

Figure 4. Number of Lie derivatives needed by each algorithm for building the full-rank observability matrix for the 2DOF model as a function of

s,

the number of non-zero

w (t)

time derivatives, for

0 \leq s \leq 10 .

Figure 5. Analysis of the HIV model with the FISPO and ORC-DF algorithms, with the input

η (t)

considered known. (A,B) Results of the FISPO and ORC-DF algorithms, respectively: the panel shows the states classified as observable or unobservable as a function of the number of Lie derivatives calculated by each algorithm. (C) Observability rank obtained by each algorithm as a function of the number of Lie derivatives. The full rank is equal to the number of states, i.e., eight.

Figure 5. Analysis of the HIV model with the FISPO and ORC-DF algorithms, with the input

η (t)

considered known. (A,B) Results of the FISPO and ORC-DF algorithms, respectively: the panel shows the states classified as observable or unobservable as a function of the number of Lie derivatives calculated by each algorithm. (C) Observability rank obtained by each algorithm as a function of the number of Lie derivatives. The full rank is equal to the number of states, i.e., eight.

Table 1. Computation times of the two algorithms for the models analysed in this study. The computation times of case studies with unknown inputs depend on the highest order of the derivatives of the unknown inputs that are assumed to be non-zero, s. Three different cases are shown for those models:

s = {0, 1, 5}

. For models without unknown inputs this setting does not apply. Cases in which an algorithm cannot be applied are labelled as N/A. Results were obtained on a personal computer with 16 GB RAM and processor Intel(R) Core(TM) i7-8550U 1.80 GHz.

Table 1. Computation times of the two algorithms for the models analysed in this study. The computation times of case studies with unknown inputs depend on the highest order of the derivatives of the unknown inputs that are assumed to be non-zero, s. Three different cases are shown for those models:

s = {0, 1, 5}

. For models without unknown inputs this setting does not apply. Cases in which an algorithm cannot be applied are labelled as N/A. Results were obtained on a personal computer with 16 GB RAM and processor Intel(R) Core(TM) i7-8550U 1.80 GHz.

Model	Section	Reference	$# x$	$# θ$	$# u$	$# w$	Total Computation Time [s]
							FISPO			ORC-DF
							$s = 0$	$s = 1$	$s = 5$	$s = 0$	$s = 1$	$s = 5$
C2M	5.1	[16]	2	4	1	0		$0.47$			$0.41$
Bolie	5.2	[21]	2	5	1	0		$1.42$			$0.59$
2DOF	5.3	[15]	4	3	1	1	$0.51$	$1.26$	$5.68$	$0.80$	$0.89$	$1.54$
HIV	5.4	[22]	3	5	1	0		$0.42$			$0.44$
			3	5	0	1	$0.42$	$0.43$	$57.8$	$1.14$	$1.29$	$47.1$
TS	5.5	[23]	2	10	2	0		$99.9$			N/A
			2	6	0	2	$1.47$	$36.2$	> $10^{4}$		N/A
JAK-STAT	5.6	[24]	25	26	5	0		Table 2			Table 2

Table 2. Results and computation times of ORC-DF and FISPO for the JAK-STAT model.

	ORC-DF					FISPO
Iteration	1	2	3	4	5	1	2	3	4	5
Number of rows	180	930	4680	23430	117180	30	45	60	75	90
Rank	24	35	40	42	44	20	28	34	39	43
Rank computation time [s]	$0.54$	$2.18$	$15.36$	$170.95$	$3600.54$	$0.40$	$0.76$	$3.46$	$59.55$	$362.78$
Observable variables	14	21	29	29	29	8	12	15	15	21

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Martínez, N.; Villaverde, A.F. Nonlinear Observability Algorithms with Known and Unknown Inputs: Analysis and Implementation. Mathematics 2020, 8, 1876. https://doi.org/10.3390/math8111876

AMA Style

Martínez N, Villaverde AF. Nonlinear Observability Algorithms with Known and Unknown Inputs: Analysis and Implementation. Mathematics. 2020; 8(11):1876. https://doi.org/10.3390/math8111876

Chicago/Turabian Style

Martínez, Nerea, and Alejandro F. Villaverde. 2020. "Nonlinear Observability Algorithms with Known and Unknown Inputs: Analysis and Implementation" Mathematics 8, no. 11: 1876. https://doi.org/10.3390/math8111876

APA Style

Martínez, N., & Villaverde, A. F. (2020). Nonlinear Observability Algorithms with Known and Unknown Inputs: Analysis and Implementation. Mathematics, 8(11), 1876. https://doi.org/10.3390/math8111876

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Nonlinear Observability Algorithms with Known and Unknown Inputs: Analysis and Implementation

Abstract

1. Introduction

2. Materials and Methods

2.1. Notation and Model Classes

2.2. Background

2.2.1. Structural Identifiability, Observability, and Differential Geometry

2.2.2. FISPO

2.2.3. ORC-DF

3. Theory: Analysis of the FISPO and ORC-DF Algorithms

3.1. Preliminary Remarks

3.2. For Systems without Known Inputs, ORC-DF and FISPO Reduce to the Same Algorithm

3.3. For Systems with Known Inputs, ORC-DF and FISPO Lead to Different Observability Matrices

4. Implementation

4.1. The STRIKE-GOLDD Software Toolbox

4.2. Implementation of the ORC-DF Algorithm

4.3. Multiple Experiments and Piecewise Constant Inputs

5. Computational Results and Discussion

5.1. An Identifiable and Observable Model with Known Input: “C2M”

5.2. A Non-Identifiable, Non-Observable Model with Known Inputs: “Bolie”

5.3. A Model with Known and Unknown Inputs: “2DOF”

5.4. A Model with a Known or Unknown Input: “HIV”

5.5. A Genetic Toggle Switch with Two Inputs: “TS”

5.6. A Signalling Pathway with Five Known Inputs: “JAK-STAT”

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

Data and Materials Availability

Appendix A. Analysis of the Lie Derivatives of the C2M Case Study

Appendix B. Analysis of the Lie Derivatives of the Bolie Case Study

Appendix C. Equations of the JAK-STAT Model

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI