1. Introduction
Mathematical models are useful tools that can be used to interpret experimental or observational data, make quantitative predictions that are amenable for experimental testing, and identify the mechanisms that underlie the generation of pattern of activity in terms of the interactions among the system’s components [
1,
2,
3]. An important step in connecting models with experimental or observational data is to estimate the model parameters by fitting the model outputs to the available data (
Figure 1).
The ability of models to make predictions, provide mechanistic explanations, and be useful for decision making all depends on the accuracy and reliability of the parameter estimation process. A large number of parameter estimation tools are available to scientists as well as methods to discover data-drive non-linear dynamic equations [
4,
5,
6,
7,
8,
9,
10,
11,
12,
13,
14,
15,
16,
17,
18,
19,
20] (and references therein) and tools to link data with models continue to develop.
A key feature of these tools is the minimization of an error function, a metric of the difference between the model output (simulations using estimated parameters) and the available (target) experimental/observed data. These data can be continuous (data collected with a high-frequency sampling, as compared to the scale of the process of interest, so that one can make the continuous approximation for mathematical purposes) or discrete (point process capturing the occurrence of events of interest). Error functions can be constructed by using all the data available in a point-to-point fashion (e.g.,
Figure 2A) or by computing one or more
attributes that characterize these data (e.g., oscillation frequency, oscillation amplitude, oscillation duty cycle, stationary states, characteristic raise and decay times;
Figure 2B).
There are several difficulties associated with the implementation of parameter estimation tools. These difficulties are data related (lack of access to all state variables, inconsistent gaps across trials), computational (algorithmic nature), statistical (data is noisy and therefore one can at best expect to estimate distributions of parameter values around a “true” mean), and structural (degeneracy, mathematical nature).
In an ideal situation, there would be a unique parameter set that fits the experimental/observational data (e.g.,
Figure 1A). We refer to the underlying model as
identifiable [
24]. In mathematical terms, the function linking parameter sets and data is bijective. In practice, these systems are subject to fluctuations from uncertain sources and one obtains a distribution of parameter sets (in a high-dimensional spaces whose dimensions are equal to the number of parameters of interest) centered at the “true” parameter set. The former has been termed
structural identifiability, while the second has been termed
practical identifiability or
estimability [
25]. Roughly speaking, the narrower the estimated parameters distribution (the smaller the variance), the more precise the parameter estimation process. However, this assumes that the center of the distribution is a good approximation to the “true” value. When necessary, the necessary validation of this assumption and the circumstance under which it is true can be achieved by using ground truth data generated by mathematical models. When continuous data is too noisy to be amenable for parameter estimation purposes, one might be able to extract useful discrete data in the form of attributes that capture the most relevant aspects of these data or point processes that are embedded in the data (e.g.,
Figure 2C and
Figure 3). However, this is performed at the expense of loosing information and therefore has consequences for the accuracy of the parameter estimation process. Additional issues emerge when not all state variables are accessible or if gaps in discrete datasets are inconsistent across trials, requiring the use of imputation tools.
Unidentifiability, the opposite of identifiability [
25,
26,
27,
28,
29,
30,
31], is associated with the concept of degeneracy. Degeneracy refers to these situations where multiple sets of parameter values can produce the same observable output (e.g.,
Figure 1B and
Figure 2D1 if the output is the period), therefore making the inverse problem (of finding parameters given the data) ill posed. In mathematical terms, the function linking parameter sets and data is not injective (at most surjective). (There is in fact an implicit assumption that these links are surjective.) Degeneracy is not a problem associated with the statistical uncertainty in the knowledge of a unique parameter set from which the data is generated referred to above, but a structural problem inherent to mathematical models where the same patterns (e.g., temporal, spatial) can be obtained from multiple parameter sets. Degeneracy gives rise to the concept of
level sets in parameter space (e.g.,
Figure 2D1) [
32,
33]. These are geometric objects (curves, surfaces, hypersurfaces) along which a given attribute of activity remains constant (see additional examples and a detailed discussion in the
Appendix A), and are the geometric instantiation of the idea that a number of well-defined combinations of unidentifiable parameters can form an identifiable set. In other words, one can at most identify combinations of unidentifiable parameters.
Identifiability and unidentifiability have been studied in a number of systems. Structural identifiability has been studied using different approaches, including differential algebra [
27,
34,
35,
36,
37,
38,
39], Taylor series [
40], similarity transformations [
41,
42] and the Fisher information matrix [
26,
43,
44,
45] (see also [
46,
47,
48]). In recent work [
32], we have used numerical simulations and dynamical systems tools to characterize the frequency and duty cycle level sets and explain what are the dynamic mechanisms responsible for their generation in the FitzHugh–Nagumo (FHN) [
49] model and the Morris–Lecar model [
50,
51] in the oscillatory regimes. Within a given attribute level set (e.g., period), the oscillatory patterns are non-identical (similarly to
Figure 2D), and therefore dynamic balances operate to maintain the attribute of interest. The Morris—Lecar model is a neuronal model describing the interplay of two ionic currents (calcium and potassium), a leak (linear) current and a capacitive current. The FHN model is a caricature model that has been used to capture the oscillatory behavior in a number of fields (see
Appendix A.3). In the relaxation oscillations regime, the oscillatory behavior of the FHN model is qualitatively similar to the one exhibited by the Morris–Lecar model and other models of biological relevance [
2,
52,
53]. When the time scales are not well separated, the FHN model exhibits oscillations qualitatively similar to the HIV model in
Figure 2.
The use of non-trivial attributes makes the comparison between data and model output computationally less expensive; instead of having to use a large set of points (e.g., the histogram envelope captured by the red curve in
Figure 3B or the full time courses in
Figure 2) one can use a significantly smaller number of attributes. In general, attributes refer to the parameters that characterize the data according to the way one chooses the data to be organized. On one extreme, the attributes are the whole set of data points and one ideally has access to all the model’s state variables. On the other extreme, the attributes are the minimal number of “numbers” necessary to fully characterize the available data. For example, frequency and amplitude are enough to describe a sinusoidal oscillation. A third attribute would be necessary if this oscillation is not centered at zero. Additional attributes such as the time constant governing the transition from an active (up) to a passive (down) phases would be necessary to discriminate between sinusoidal and relaxation oscillations. In the example in
Figure 2, the oscillations can be characterized by the frequency, amplitude and duty cycle. In the example in
Figure 3, the data consists of the interspike (or interpeak) intervals (ISIs or IPIs) histogram, which releases us from making detailed spike-time comparisons that could be strongly affected by noise and therefore complicates the error minimization process. This histogram can be characterized by four attributes (
Figure 3B): the histogram peak
, the characteristic rise and decay constants
and
, respectively, and the value of
at which the histogram “begins”,
. Additional attributes are the mean and variance of the ISI distribution. (Neuronal spiking behavior typically approximates a Poisson process [
54,
55] and therefore the mean ISI is equal to its standard deviation.) Similar type of histograms could be computed for the frequency, amplitude and duty cycle in the HIV-1 viral kinetics example in
Figure 2C. Note that attributes are used in connection with the data, while parameters are used in connection to the model used to generate or describe these data. A detailed discussion about the relationship between attributes (of patterns of activity) and model parameters in simple models including the prototypical FitzHugh–Nagumo oscillator [
49] and a Hepatitis C viral model [
56] (see also [
57]) is presented in the
Appendix A.
Oscillations are ubiquitous in dynamical systems, particularly in biological systems [
2,
21,
52,
53,
58,
59,
60,
61,
62,
63] where degeneracy has been observed, not only in models, but also in experiments [
64,
65,
66] (see also [
32,
33]). The mechanisms underlying the presence of degeneracies in biological oscillations are not well understood. It is also unclear how the presence of degeneracies affect the parameter estimation process for models describing biological oscillations and oscillations in general.
We have identified a family of canonical models for unidentifiability/degeneracy where the limit cycle trajectories are unidentifiable and the degeneracy can be analytically computed. There are infinitely many parameter sets that produce identical stable oscillations, except possible for a for a phase shift, which depends on the initial phase. In other words, all the attributes of the of the oscillatory patterns are degenerate and have the same level sets, which can be analytically computed. We refer to the generic formulation of this model as the
-
model. It is a generalization of the so-called
-
system [
67,
68], also referred to as the Poincaré oscillator [
69,
70]. These are real-valued special cases of the complex Ginzburg–Landau equation [
71,
72,
73,
74] and their role for modeling biological systems are discussed in detail in [
58] (see references therein). The general formulation can exhibit bistability and a number of bifurcations. The classical
-
exhibits sinusoidal oscillations. Extended versions of this model produce oscillations with more complex waveforms, including relaxation and spike-like oscillations, and a rich repertoire of dynamic behavior, including bistability and even multistability. In contrast to other models showing structural unidentifiability due to redundancy in the number of parameters with physical meaning (see
Appendix A for a detailed discussion), the number of parameters in the
-
models cannot be reduced without reducing the model complexity. To our knowledge, the issues of unidentifiability and degeneracy in the family of
-
models has not been raised before.
Because of the properties described above, the family of - models are an ideal system to ask fundamental questions about degeneracy of dynamical systems and their relationship with parameter estimation unidentifiability. The goal of this paper is to address these issues. Our results contribute to the development of a framework for the investigation of unidentifiability in dynamic models.
3. Results
3.1. Unidentifiability and Degeneracy: A Preliminary Discussion Using Simple Models
Here, we introduce some ideas related to unidentifiability and degeneracy in dynamical systems, including the concepts of activity attributes, level sets and parameter redundancy in models and their outputs. We do this in the context of relatively simple models. A detailed mathematical description of these models and the unidentifiability analysis are presented in the
Appendix A.
In the 1D minimal model example discussed in the
Appendix A.1, the minimal model has no level sets. The number of minimal model parameters is equal to the number of attributes that characterize the pattern. In fact, the minimal model parameters are themselves the attributes. In the 2D minimal model example discussed in the
Appendix A.2, there are no level sets for the time constant
and this attribute is determined uniquely by the minimal model parameter
b in Equation (
A6). In contrast, the damped oscillations frequency
(
A7) is a combination of the minimal model parameters
b and
c and has level sets in this parameter space. Clearly, if one can estimate the two attributes
and
, one can identify the minimal model parameters
b and
c. If one can only estimate
, then the model has degeneracy. Typically, by design, the minimal model parameters have no physical (or biological) meaning, but only dynamic meaning. They are a minimal set of model parameters in that eliminating one of them reduces the dynamic complexity.
The situation changes when models are created to reproduce or explain experimental or observational data. The number of physically meaningful model parameters is larger than the number of minimal model parameters needed to maintain the model dynamic complexity. The level sets and degeneracy that we find in the realistic models in both cases (
Appendix A.1 and
Appendix A.2) emerge because the minimal model parameters (with dynamic meaning) are combinations of (realistic) parameters with physical meaning. From that perspective, the realistic parameters generate a dynamic redundancy, which is reflected by the fact that the minimal model parameters are combinations of physical parameters. This type of degeneracy cannot be resolved unless one has access to additional data.
The FHN model discussed in the
Appendix A.3 is a non-linear minimal model in the oscillatory regime. In the example presented in the
Appendix A.3, there are level sets, which emerge as two or more minimal model parameter sets dynamically balance each other to maintain an attribute constant (e.g., frequency, see
Figure A3A,B) [
32] similarly to the HIV model oscillations discussed in the context of
Figure 2. Attribute level sets such as frequency, amplitude, and duty cycle do not coincide except perhaps in some specific cases [
32]. Additional degeneracies emerge in realistic models (e.g., the Morris–Lecar model [
32]) as the number of parameters with biological meaning increases with the consequent increase in parameter redundancies. However, it is not always possible to reduce non-linear models to minimal models.
Figure A3C in the
Appendix A.1 illustrates the fact that additional attributes may be necessary to characterize oscillatory patterns. The non-identical sinusoidal and square-wave functions have identical frequency, amplitude and duty cycle. This raises the need of identifying the appropriate attributes to characterize a given pattern and their optimal number.
3.2. The - Systems Are Canonical Models for Unidentifiability/Degeneracy
Here, we discuss the role of the
-
models generically described by Equations (
4)–(
6) as canonical models for unidentifiability where a limit cycle with the same frequency can be generated by multiple combinations of model parameters.
A change of coordinates from Cartesian to polar
transforms system (
4) and (
5) into
The equation for the radius of the limit cycle is independent of the the angular velocity
. Therefore, the limit cycles are circles around zero and their radii
are the solutions of
The number of limit cycles depends on the form and complexity of the function . Since , a limit cycle of radius is stable if and only if . The frequency of the solution along this limit cycle is and the period is . Because, if a stable limit cycle exists, in the limit of the dynamics are described completely by , - systems are often referred to as phase oscillators. Note that corresponds to a fixed point and therefore the - systems always have a fixed point whose stability properties also depend on the sign of .
Limit cycle and frequency degeneracy emerges when the function is defined by two or more parameters. Additional degeneracy in the frequency results from the parameters defining the function .
3.3. Lambda-Omega Systems of Order 2: Limit Cycle Degeneracy
-
systems of order 2 (
-
) correspond to the following quadratic choices for
and
.
where
,
,
a and
b are constant. Equations (
4) and (
5) become
Following [
67], we refer to these models as
-
models. They have been also referred to as Poincaré oscillators [
69] and are real-valued special cases of the complex Ginzburg–Landau equation as mentioned above [
71,
72,
73,
74].
The parameters and control the linear dynamics and the parameters a and b control the contribution of the non-linear terms to the system’s dynamics. In this sense, the - model is minimal. As we will see, the parameters and play a stronger role in controlling the transient dynamics than the parameters a and b.
3.3.1. Limit Cycles, Fixed Points, Stability and Oscillation Frequency
These
-
systems have a single limit cycle for
if
, which is stable if and only if
. Otherwise, the fixed point
is stable. Along the limit cycle, the frequency is constant, and given by
In Cartesian coordinates, along the limit cycle
whose solution satisfying
and
is given by
Solutions for other initial conditions are phase shifts of this solution. This implies that limit cycles are identical and trajectories moving along the limit cycle have the same frequency.
Figure 4a illustrates the
x and
y traces for representative parameter values.
Figure 4b shows the corresponding phase-plane diagram, including the circular limit cycle characteristic of sinusoidal trajectories. Finally,
Figure 4c shows the speed diagram, also characteristic of sinusoidal trajectories.
3.3.2. Limit Cycle Degeneracy: The Level Sets for All Possible Attributes Coincide
The limit cycles have the same amplitude for the infinitely many combinations of values and b for which their quotient is constant and they have the same frequency for the infinitely many combinations of the four parameters such that is constant. The amplitude level sets are curves in the two-dimensional -b parameter space and the frequency level sets are hypersurfaces in the four-dimensional -b--a parameters space. Note that even along an amplitude level set there is frequency degeneracy along the limit cycle.
Figure 5 illustrates the identical limit cycles for different combinations of parameter values. The different phase-plane diagrams supporting this limit cycle degeneracy indicate that the transient dynamics are different for the different cases. This is also clear from Equation (
10). The time constant associated with the evolution of the envelope oscillations approaching the limit cycle (or moving away from it, if it is unstable) is
dependent.
3.3.3. Limit Cycle Degeneracy: Symmetries in the Phase-Plane Diagram
The
x- and
y-nullclines for the
-
system are given by the solutions
and
of
and
respectively.
The phase-plane diagram in
Figure 4b shows the nullclines and trajectory for the solution to the
-
system for representative parameter values whose solution is shown in
Figure 4a. We express the
y-nullcline as a function of
y, and not of
x, because as a function of
x this nullcline is not invertible (
Figure 4b).
System (
14) and (
15) is preserved under the following changes of coordinates: (i)
, (ii)
, and (iii)
, but not under
and
. The preservation property (i) reflects the fact that the two nullclines are odd-functions:
and
. The preservation properties (ii) and (iii) reflect the fact that the two nullclines are the negative of one another
. Taken together, these properties reflect the fact that the two nullclines are orthogonally identical in the sense that they can be obtained from one another by a
rotation (
Figure 4b). The limit cycles are preserved if both nullclines are rotated by the same angle [
74].
Figure 5 illustrates that these asymmetries are preserved for parameter values for different values of the amplitude and frequency of the limit cycle.
3.4. Compensation Mechanisms of Generation of Level Sets in the - Model
Here, we investigate the dynamic mechanisms of generation of level sets in terms of the model parameters by focusing on the geometric properties of the phase-plane diagrams in Cartesian coordinates. We aim to identify the geometric constraints in the phase-plane diagrams for the parameter values that belong to the same level sets and give rise to the limit cycle degeneracy.
In our analysis, we assume to have no information about the properties of the limit cycles and level sets from the polar coordinates analysis above. The reason for doing so is to develop tools to analyze the models when the symmetries of the - model are broken and the level sets for the different attributes are split. In these cases, Cartesian coordinates are more amenable for analysis than polar coordinates.
Because of the model symmetries reflected in the geometric properties of the
x- and
y-nullclines, it is natural to assume that if a limit cycle exists, then it has circular shape and it is centered at the origin (
Figure 6).
3.4.1. Level Sets in the -b Parameter Space
Setting
in the
x-nullcline (
21) shows that it crosses the
x-axis at
, independently of the values of
and
. Likewise, setting
in the
y-nullcline (
22) shows that it crosses the
y-axis at
, independently of the values of
and
(
Figure 7a).
The assumption that the limit cycle is circular and centered at the origin implies that the limit cycle trajectory is vertical on the x-axis and horizontal on the y-axis. This requires limit cycle trajectory to intersect the x-nullcline when the latter intersects the x-axis (at ) and to intersect the y-nullcline when the latter intersects the y-axis (at ). In other words, the intersection of the x-nullcline with the x-axis (or the intersection of the y-nullcline with the y-axis) determines the radius of the limit cycle trajectory without the need of the transformation to polar coordinates.
In general, changes in
and
b change the shape of the nullclines (compare
Figure 5a–c). However, when
and
b change in such a way that
remains constant, then the intersection of the
x-nullcline with the
x-axis is constant and therefore the radius of the limit cycle is constant. In other words, along a given level set,
and
b compensate for each other so as to maintain the intersection of the
x-nullcline with the
x-axis constant (or, by symmetry, the intersection of the
y-nullcline with the
y-axis constant).
Along these level sets, the frequency of the oscillations on the limit cycle is constant since from (
14) and (
15) with
it follows that
whose solution is a sinusoidal function of frequency
and phase
. This implies that solutions belonging to the same level set are identical.
3.4.2. Frequency Level Sets in the -a Parameter Space: Static Compensation Mechanisms When and Belong to the Same Level Set
Computation of the derivative of the
x-nullcline (
21) at
gives
Similarly, computation of the derivative of the
y-nullcline (
22) at
gives
Therefore, the nullclines for parameter values on the same frequency level set (
) for the same value of
are tangent at their point of intersection (
Figure 7b). This constraint is lost when
changes, even though the parameter values may belong to the same amplitude level set (
Figure 7c).
3.4.3. Frequency Level Sets in the -a Parameter Space: Dynamic Compensation Mechanisms When and b Do Not Belong to the Same Level Set
Here, we focus on frequency level sets (
) for parameter values such that
. In these cases, the oscillation amplitudes are different (
Figure 8A,B). The
x- and
y-nullclines still intersect the corresponding limit cycles at the
x- and
y-axis, respectively, but, in order to maintain a constant frequency, the speed of the limit cycle trajectory needs to be greater than the limit cycle radius (
Figure 8C).
3.5. Unfolding of Attribute Level Sets for Different Attributes Due to Symmetry Breaking
The complete degeneracy of the limit cycle oscillations’ in the
-
model results from a number of symmetries present in the model, and a break in this symmetry is expected to disrupt the degeneracy. However, from previous work [
32,
33] and our discussion above in the context of the HIV model (
Figure 2), we expect degeneracies to remain for the attributes characterizing the oscillation patterns. Here, we address this issue by using the modified
-
model (
7) and (
8) where all the functions are quadratic and
r is given by (
6). The resulting modified
-
model reads
and reduces to the
-
model discussed above for
,
,
and
.
Geometrically, from the dynamical systems point of view, there are four primary ways of breaking the symmetry between the x- and y-nullclines: (i) generating of a time scale separation between the two variables, (ii) expanding or shrinking the linear component of one equation with respect to the other, (iii) rotating one nullcline while keeping the other fixed, and (iv) displacing one nullcline while keeping the other fixed.
Here, we systematically analyze the effects of these symmetry breaking actions by choosing the appropriate values of the parameters in Equation (
28) while keeping the parameter values in Equation (
27) fixed. For simplicity, we focus on the amplitude and frequency level sets that emerge for each one of the models. In all cases, the canonical level sets are not preserved and new level sets emerge, which are different for different attributes and the four different ways of breaking the model symmetry. As expected, degeneracy remains for these attributes. As the parameter values move away from symmetry, both the shapes of the limit cycles and the shapes of the speed diagrams change, indicating that new dynamic compensation mechanisms to maintain the attributes’ degeneracy arise.
3.5.1. Onset of Frequency and Amplitude Level Sets Due to the Presence of a Time Scale Separation
A time scale separation between the first and second equations is generated by multiplying each one of the parameters in the
y-equation by a parameter
Figure 8A shows the frequency and amplitude level sets for representative parameter values. The presence of a time scale separation does not break the rotational symmetry of the two nullclines, but only the relative dynamics of the two variables, which is reflected in shapes of the limit cycles (
Figure 8B2,C2) and the non-uniform speed along the limit cycle (
Figure 8B3,C3) as well as the shapes of the oscillatory patterns (
Figure 8B1,C1). The speed diagrams show the presence of dynamic compensation mechanisms responsible for the generation of the level sets.
3.5.2. Onset of Frequency and Amplitude Level Sets Due to Changes in the Linear Properties of the y-Nullcline
The region of linear behavior around the origin of the
y-nullcline can be expanded or shrunk by multiplying the linear parameters
and
by a constant
, while leaving the non-linear parameters
a and
b unchanged
Figure 9 shows our results for three values of
c:
(symmetry),
and
. The amplitude level sets almost coincide, but the frequency level sets are more separated (
Figure 9a). This is reflected in the shapes of the limit cycles (
Figure 9b2,c2) and speed diagrams (
Figure 9b3,c3). The speed diagrams show the presence of dynamic compensation mechanisms responsible for the generation of the level sets. These mechanisms are different from the ones discussed above.
3.5.3. Onset of Frequency and Amplitude Level Sets Due to Rotation of the y-Nullcline with Respect to the x-Nullcline
In order for the
y-nullcline to be rotated by an angle
, the parameters
,
,
and
must be given by (see
Appendix B.1)
Figure 10 shows our results for three values of
:
(symmetry),
and
. The frequency and amplitude level sets do not coincide. The speed diagrams show the presence of dynamic compensation mechanisms responsible for the generation of the level sets. Comparison with the previous cases shows that these mechanisms are different from the ones discussed above. In particular, the speed diagrams in
Figure 10b3,c3 have prominent intersections, which are almost entirely absent in the cases discussed above.
3.5.4. Onset of Frequency and Amplitude Level Sets Due to the Displacement of the y-Nullcline with Respect to the x-Nullcline
The modified
-
system where the
y-nullcline is displaced
units in the horizontal direction from the position determined by system (
14) and (
15) is given by
where
Figure 11 shows our results for three values of
:
(symmetry),
and
. These results are similar to the previous cases, in that the frequency and amplitude level sets do not coincide and the speed diagrams show the presence of dynamic compensation mechanisms responsible for the generation of the level sets with prominent intersections (
Figure 11b3,b3) as is only observed in
Section 3.5.3.
3.6. Lambda-Omega Systems of Order 4: Degeneracy, Hopf Bifurcations and the Emergence of Bistability
-
systems of order 4 (
-
) correspond to the following quartic choices for
and
They are the normal forms of the Hopf bifurcation [
84], and they have been used to model the dynamics of oscillatory systems in response to external inputs [
85,
86].
The limit cycles, if they exist are given by
The fixed points and limit cycles are stable (unstable) if ().
Bistability emerges when there is a stable fixed point, a stable limit cycle and an unstable limit cycle in between. There are a number of alternative scenarios that include the presence of one or two limit cycles. Along the limit cycles, the frequency is given by
The degeneracies that emerge in this system, particularly in the bistable case, are more complex than in the - system discussed above and the level sets live in higher-dimensional parameter spaces. The analysis of these degeneracies and the compensation mechanisms that give rise to them goes along the same lines of the discussion above for the - system; however, a detailed analysis is beyond the scope of this paper.
3.7. Degeneracy in Extended - Systems with Generic Waveforms: From Sinusoidal to Square and Spike-like Waveforms
The limit cycles in the standard
-
systems have sinusoidal waveforms. However, non-sinusoidal waveforms are ubiquitous in biological systems. Variations of the
-
system showing square-wave and spike-like waveforms have been introduced in [
87]. Here, we discuss the presence of degeneracy, in particular the complete limit cycle degeneracy in these systems.
The extended lambda-omega
-
systems (or extended Poincare oscillators) have the form
where
and
The parameters
p (
,
,
a,
b,
c and
d) are non-negative constants. For
, system (
4) and (
5) reduces to the standard
-
system. The parameters
and
control the linear dynamics, the parameters
a and
b control the cubic-like non-linear terms (the non-linearities in the standard
-
system) and the parameters
c and
d control the remaining non-linear terms.
A change of coordinates from Cartesian to polar
transforms system (
4) and (
5) into
where
r is given by (
6)
3.7.1. Sinusoidal, Spike-like and Square-Wave Sustained Oscillatory Patterns for the Extended - System
From Equation (
10), the extended
-
system (
4) and (
5) has a circular limit cycle of radius
provided
(
Figure 4). The limit cycle is stable if and only if
. Otherwise, the fixed point
is stable. The properties of the right-hand side of Equation (
A16)
determine the various types of temporal patterns along these limit cycles.
For
,
is constant (
Figure 12A2), and therefore
-
evolves linearly with time (
Figure 12A3). The solutions are sinusoidal (
Figure 12A1) with period
where
For
,
peaks at
(
Figure 12B2), causing
to evolve in a fast-slow manner (
Figure 12B3) giving rise to spike-like patterns (
Figure 12B1). The period is given by (see
Appendix B)
For
,
peaks twice at
(
Figure 12C2), causing
to evolve in a fast slow manner, changing from fast to slow and vice versa twice within a cycle (
Figure 12C3) and give rise to square-wave patterns (
Figure 12C1). The period is given by (see
Appendix B)
3.7.2. Amplitude and Period Level Sets
The amplitude level sets are as for the standard - system discussed above. The frequency level sets for the extended - system depend on the values of c and d. It is illustrative to consider two separate cases: (i) and (ii) for which the period T can be easily computed.
From (
52), for
, the period level sets are given by the solution to
Similarly, from (
52), for
, the period level sets are given by the solution to
Similarly to the standard
-
system (
), for all combinations of parameter values satisfying
and either (
55) or (
57) the system has the same amplitude and frequency level sets.
3.8. Parameter Estimation Algorithms Recover at Most Parameter Values on a Level Set
One potential way to disambiguate degenerate models is to obtain information about the transient behavior of trajectories. For example, for the - models, the parameter controls the evolution of the envelope of the oscillations with increasing amplitude converging to the limit cycle (with amplitude ). However, in realistic situations one does not have clean access to the transient evolution of trajectories unless one is able to perform perturbations to the oscillatory patterns. It has been argued that the presence of noise is useful to improve parameter estimation results since noise causes trajectories to explore wider regions of the phase space and therefore more information about the dynamics, particularly the transient behavior of trajectories, is available to estimate parameters.
We used the two parameter estimation algorithms described in
Section 2.4. In
Figure 13, we fixed the values of
and attempted to estimate the values of
and
b using ground truth data (GTD) generated by using
. In
Figure 14, we used the same GTD and attempted to estimate the four parameters.
Our results show that relatively small levels of noise are not enough to provide useful information, while higher levels of noise affect the efficacy of the parameter estimation algorithm. While these results are not conclusive, they illustrate that the degeneracy present in the models poses an obstacle to the ability to estimate the model parameters.
4. Discussion
Parameter estimation from experimental or observational data is a critical stage in any modeling study. There are several issues that conspire against the accurate estimation of model parameters, particularly the parameters in dynamic models (e.g., described by differential equations). Data are noisy and therefore one can at best expect to be able to identify a distribution of model parameters centered around the “true” parameter set (if it exists and is unique). In addition, one may not have access to data for all the state variables assumed to describe the process. Moreover, one may have experimental/observational access to discrete events (e.g., sequence neuronal spike times, record of a viral infection), but not the data that underlie the generation of these events (e.g., membrane potential, concentrations of the virus, infected and uninfected cells). Besides, if the data are too noisy or exhibit strong irregularities, one may only have reliable access to a number of activity attributes (e.g., oscillation preferred frequency, neuronal spiking mean firing rate) that may not be enough to capture the complete dynamic structure of these data (e.g., sinusoidal and relaxation oscillations may have the same frequency, amplitude and duty cycle). Furthermore, there may be gaps in the datasets and these gaps may be inconsistent across trials. Finally, even if one has clean experimental/observational access to all the state variables and the data are noiseless, the models may exhibit parameter degeneracy (multiple combinations of parameter give rise to the same pattern), rendering them structurally unidentifiable.
Classical structural unidentifiability is associated with the notion that one can at most identify combinations of unidentifiable model parameters [
25,
26]. This is due to an excess in the number of parameters with physical meaning as compared to the number of parameters necessary to compute the attributes that describe these data. We discussed these ideas in detail using a number of simple representative model examples with increasing levels of complexity in the
Appendix A. A more thorough discussion can be found in [
25,
26] (and references therein).
In this paper, we set out to analyze a different type of structural degeneracy/unidentifiability present in the family of
-
models [
67,
68,
69,
70,
87]. We found that
-
models have infinitely many parameter sets that produce identical stable oscillations, except possible for a phase shift (reflecting the initial phase), but this parameters are not identifiable combinations of unidentifiable parameters as is the case in structural degeneracy. The number of model parameters in the
-
models is minimal in the sense that each one controls a different aspect of the model dynamics and the dynamic complexity of the system would be reduced by reducing the number of parameters. The level sets and the phase-plane diagrams can be computed analytically as well as the degenerate solutions. This adds clarity to the problem, leaving out other possible causes for unidentifiability (e.g., numerical, algorithmic), and facilitates the analytical investigation of the underlying compensation mechanisms.
Mathematically, degeneracy can be described in terms of level sets in parameter space for a given attribute (e.g., frequency, amplitude, duty cycle). An attribute level set is the set of all combinations of parameter values that produce solutions with a constant attribute [
32]. In general, for a given model, level sets for different attributes may or may not coincide or be constrained by one another. For the
-
models, the level sets for all possible attributes coincide.
The family of
-
models produces a rich dynamic behavior, including monostable oscillations, bistability between oscillations and equilibria, Hopf bifurcations, and saddle-node bifurcations of limit cycles. The classical
-
models show sinusoidal oscillations, while the extended
-
models exhibit more complex waveforms, including square-wave (relaxation-type) and spike-like oscillations. Therefore, the
-
models serve as canonical models for degeneracy/unidentifiability for a large variety of realistic oscillatory processes. In addition, these canonical models can be used to investigate the consequences of degeneracy in single cells for their response to external perturbations [
87] and the dynamics of networks in which these cells are embedded.
Degeneracy of oscillatory patterns has been the focus of various studies in the context of neuronal systems both experimentally and theoretically [
32,
33,
64,
88,
89,
90,
91,
92,
93,
94]. Degeneracy is an inherent property of dynamical systems and is believed to arise from compensation mechanisms that generate balances among the participating processes (e.g., neuronal ionic currents) that control the dynamics [
32,
33]. Because the dynamic mechanisms of generation of biological oscillations involve the interplay of positive and negative feedback effects operating at different time scales, unidentifiability is likely to be present in the majority, if not all biological oscillatory systems. Therefore, the
-
models can serve as a first, guiding step to understand the mechanisms leading to degeneracy in biological oscillators and biological oscillatory networks [
67,
68].
The full degeneracy present in the - models results from the symmetries present in the model, which are captured by the phase-plane diagrams. We found that a systematic break of these symmetries for the - models leading to the phase-plane diagrams characteristic of more realistic models (e.g., relaxation oscillators) preserve the degeneracies of the attributes, although the level sets for different attributes do not necessarily coincide. Therefore, the - models serve as reference models to investigate the mechanisms of generation of attribute level sets for biological oscillators as perturbations to the corresponding - models that exhibits full degeneracy. We hypothesize that this is the case for the whole family of - models. More research is needed to test this hypothesis and clarify these issues.
In addition to biological oscillators, our results predict that the phenomenon of degeneracy may arise in other oscillatory systems in physics and chemistry since the
-
models are real-valued special cases of the complex Ginzburg–Landau equation [
71,
72,
73,
74,
95,
96]. Establishing these ideas requires additional research.
Unidentifiability in parameter estimation is a fundamental problem arising in the field of data science in connection to the models needed to analyze the available data. Degeneracy, the other side of the coin, is also a biological fact reported in biological experiments [
64,
65,
66] (see also [
32,
33]) and is expected to be pervasive in oscillatory systems. While on one hand degeneracy reflects the lack of information necessary to understand the underlying mechanism in term of the participating process, on the other hand, degeneracy has been proposed to endow systems with functional flexibility [
97], thus adding a new dimension to the investigation of degenerate/unidentifiable systems. The family of
-
models serves as a framework to investigate these issues and is expected to have implications for the data-driven discovery of non-linear dynamic equations [
19,
20]. Furthermore, the family of
-
models serves as the canonical models to calibrate parameter estimation algorithms. In contrast to the classical unidentifiable models, the family of
-
models is potentially identifiable if one has enough information about the transient behavior. However, as we pointed out above, this is information is not available in realistic situations. Our results using noisy ground truth data where transients are activated by noise were not useful to resolve the unidentifiability.
The family of - models also serves as a framework for the systematic investigation of degeneracy in dynamic models and the interplay between structural unidentifiability and the various forms of unidentifiability raised above resulting for lack of experimental/observational access to the state variables. The development of this framework and the implications for the investigation of degeneracy and identifiability in more realistic models require more research.