Fast Approximation of Over-Determined Second-Order Linear Boundary Value Problems by Cubic and Quintic Spline Collocation

Seiwald, Philipp; Rixen, Daniel J.

doi:10.3390/robotics9020048

Open AccessArticle

Fast Approximation of Over-Determined Second-Order Linear Boundary Value Problems by Cubic and Quintic Spline Collocation

by

Philipp Seiwald

^*

and

Daniel J. Rixen

Department of Mechanical Engineering, Chair of Applied Mechanics, Technical University of Munich, Boltzmannstraße 15, 85748 Garching, Germany

^*

Author to whom correspondence should be addressed.

Robotics 2020, 9(2), 48; https://doi.org/10.3390/robotics9020048

Submission received: 3 May 2020 / Revised: 8 June 2020 / Accepted: 23 June 2020 / Published: 25 June 2020

Download

Browse Figures

Versions Notes

Abstract

:

We present an efficient and generic algorithm for approximating second-order linear boundary value problems through spline collocation. In contrast to the majority of other approaches, our algorithm is designed for over-determined problems. These typically occur in control theory, where a system, e.g., a robot, should be transferred from a certain initial state to a desired target state while respecting characteristic system dynamics. Our method uses polynomials of maximum degree three/five as base functions and generates a cubic/quintic spline, which is

C^{2}

/

C^{4}

continuous and satisfies the underlying ordinary differential equation at user-defined collocation sites. Moreover, the approximation is forced to fulfill an over-determined set of two-point boundary conditions, which are specified by the given control problem. The algorithm is suitable for time-critical applications, where accuracy only plays a secondary role. For consistent boundary conditions, we experimentally validate convergence towards the analytic solution, while for inconsistent boundary conditions our algorithm is still able to find a “reasonable” approximation. However, to avoid divergence, collocation sites have to be appropriately chosen. The proposed scheme is evaluated experimentally through comparison with the analytical solution of a simple test system. Furthermore, a fully documented C++ implementation with unit tests as example applications is provided.

Keywords:

cubic; quintic; spline; collocation; second-order; over-determined; boundary value problem

1. Introduction

1.1. Literature Review

In almost every field of natural science and engineering we face differential equations, which are typically used for modeling dynamical systems. Especially in engineering, the more specific case of boundary value problems (BVP) is very prominent. In other words, one often searches for the behavior of the investigated system between some kind of fixed, i.e., known, spatial or temporal boundaries. One can try to derive the analytical solution to the problem, however for complex systems this can be difficult or even impossible. In such cases, it can be sufficient to approximate the solution for which a variety of techniques exists.

Methods based on finite differences approximate the derivatives by difference quotients to obtain a system of equations which depends only on the primal function. This allows a solution to be found by formulating a linear system of (algebraic) equations representing the transformed system at certain grid points. In contrast, shooting methods aim at the iterative solution of an equivalent initial value problem (IVP), which is typically easier to handle than the original BVP. While for single shooting the IVP is evaluated over the complete time interval, multiple shooting considers a partitioned time domain. Another technique is given by the finite element approach, which is mainly used for partial differential equations (PDE) as they occur, for example, in structural-, thermo-, and fluid dynamics. Typically finite element methods are based on a weak formulation of the residual and on splitting the considered domain into elements on which the local base functions to approximate the solution are defined. Although designed for PDEs, they can be applied to the simpler case of ordinary differential equations (ODE), which are the focus of this contribution (see, for example, [1]). A variety of other techniques exist. However, these three groups appear to be used the most in the field of engineering.

In the following, we approximate the solution through spline collocation using piecewise polynomial trial functions, which is a well-known technique to solve BVPs, see, for example, [2,3,4]. Over the past decades, various algorithms emerged, which can be classified into two types. The first type, also known as smoothest spline collocation (or just spline collocation as in [5]), aims at matching the differential equation at one collocation site per spline segment, which is typically a knot or the mid-point of the segment, while simultaneously forcing the spline to have maximum smoothness, i.e., highest possible continuity [5]. The second type, in [5] called Gaussian collocation, removes the constraints on higher order continuity and instead uses more collocation sites, which are typically chosen to be the Gaussian points of each segment. This class, which is also called orthogonal spline collocation [6], originates from [3] and aims at maximizing the order of convergence. In order to also provide a competitive convergence for the first type of methods, special variants for quadratic and quintic spline collocation have been proposed in [7] and [8], respectively. Optimal methods for quadratic and cubic splines on non-uniform grids, i.e., for an inhomogeneous segmentation of the spline, have been presented in [5].

Note that in addition to general purpose codes, such as COLSYS (FORTRAN) for non-linear mixed-order systems of multi-point boundary value ODEs [9], highly specialized algorithms, which aim at obtaining the best possible approximation for certain use cases, have also been published. Recent examples are methods for integro-differential equations [10] or fractional differential equations [11], which occur in certain material models exhibiting memory effects. Along with ODEs, various kinds of (multi-variable) PDEs have been investigated. See [6] for a survey on corresponding Gaussian collocation methods. Note that, for PDEs, the time domain is typically discretized using finite differences, e.g., by the Crank–Nicholson approach as used in [6], or the second-order backward difference in [10], while spline collocation is used for approximating the spacial variables. Lastly, methods for differential algebraic equations (DAE) have also been developed, e.g., the COLSYS extension COLDAE [12] for semi-explicit DAEs of index 2 and fully implicit DAEs of index 1.

In our opinion, despite the variety of techniques, there is still a lack of simple methods prioritizing execution time over approximation quality, which is essential for time critical control applications. The aim of this contribution is to provide an algorithm satisfying these needs while focusing on the special case of second-order linear ODEs, which are very common for dynamic systems. Moreover, we focus on over-determined BVPs, i.e., where more boundary conditions (BC) are given than necessary. This may at first seem to be a restriction of our algorithm since it needs more information than other implementations; however, it allows us to also consider inconsistent BVPs, i.e., the case where no exact solution to the problem exists.

1.2. Motivation

It might appear strange to search for an approximation of something that actually does not exist. However, we face exactly this situation during walking pattern generation for our humanoid robot LOLA, cf. Figure 1 left. In particular, we use a simplified model of the robot’s multi-body dynamics, cf. Figure 1 right, to plan the center of mass (CoM) motion over a certain time horizon. The planned CoM motion resembles the dynamics of the model, which can be formulated as second-order linear ODE, and is constrained to certain values on position-, velocity- and acceleration-level at the boundaries of the planning horizon. This leads to an overdetermined BVP of the type investigated in this contribution. The BCs at the beginning reflect the current state of the robot, e.g., the standing pose in front of a platform, while the BCs at the end represent the target state, e.g., the standing pose on the other side of the obstacle, cf. Figure 2. For a seamless motion of the robot it is crucial to guarantee the satisfaction of the BCs, i.e., a perfect match of the boundary states. In contrast, it is sufficient to approximate the dynamics of the underlying ODE since it is derived from a simplified model which is an approximation in itself. This is the key idea behind the formulation of an over-determined BVP, which may thus not have a proper “real” solution. To the authors knowledge, all comparable algorithms assume that a solution of the BVP exists, while most of them also require it to be unique and “sufficiently” smooth.

Explaining the technical details of the walking pattern generation framework of LOLA goes beyond the scope of this paper. Instead, the interested reader is referred to [13], in which the application of the proposed algorithm is presented. The publication [13] comes with an accompanying video, see [14], which visualizes the sequential steps of the walking pattern generation pipeline of LOLA. The simulations and experiments presented in the video show the successful application of our algorithm. In the following, we do not further restrict ourselves to this special application. Since the proposed algorithm is generic, we derive it in the most general way since it may be useful also for different applications in robotics.

1.3. Additional Remarks

Our method can be seen as smoothest spline collocation, i.e., it belongs to the “first” type as classified in Section 1.1. We do not apply special techniques to increase the order of convergence, but instead adhere to its basic form. This leads to a much simpler derivation and implementation. In addition it makes the algorithm faster by sacrificing approximation quality. This complies with the needs of our target application as explained in Section 1.2. We emphasize that our focus lies on simplicity, robustness, and efficiency. Thus we will not search for a mathematical formulation of the convergence order. Instead, the algorithm is evaluated mainly through experiments, where runtime performance is our primary concern.

As summarized in Section 1.1, there exist numerous methods for solving linear BVPs or spline interpolation/collocation in general. For most approaches, solving a large-sparse or small-dense linear system of equations (LSE) represents the main workload. To overcome this bottleneck, parallelized algorithms have been developed, which typically exploit the special structure of the involved matrices, e.g., the scheme proposed in [15] for staircase matrix structures. Although we aim at efficiency, we do not consider explicit parallelization as acceleration technique. This is because our algorithm is designed for embedded systems which typically feature only few physical CPU cores running also other time-critical tasks. Moreover, the use of GPUs, e.g., through CUDA or OPENCL, is often not feasible since the CPU–GPU interface lacks capabilities for hard real-time requirements. Finally, dedicated to our target application, we only consider (comparatively) small problems with runtimes

\approx 1 m s

. For such systems the performance boost obtained through parallelization is likely to be canceled out by the synchronization overhead. Nevertheless, our algorithm may also be used for large scale problems. In this case an “off the shelf” parallel solver for dense LSEs may be used, cf. [16]. However, one should keep in mind that by using an iterative scheme execution time is not deterministic anymore, and, even worse, the solver might not converge. As an alternative, there exists an efficient way for the parallel solution of decoupled, multi-dimensional BVPs, which takes advantage of intermediate results, see Appendix C for details.

2. Materials and Methods

In the following, two versions of our algorithm are presented: one using a cubic spline and the other using a quintic spline for approximating the BVP. As the derivations and resulting algorithms are similar, we show the connecting links by presenting both methods in parallel. The version based on cubic splines is naturally simpler, although, using quintic splines leads to a smoother approximation. Indeed it is

C^{4}

- instead of

C^{2}

-continuous, which can be preferable for some use cases. Moreover, quintic splines allow us to directly preset first and second-order derivatives at both boundaries, which otherwise requires the introduction of virtual control points and in turn can lead to poor results, as discussed in Section 3. Although deriving the proposed collocation algorithm for quintic splines is more advanced than its cubic counterpart, overall performance is superior, because the same approximation quality can be obtained with less collocation sites and hence with less computational effort as shown in Section 3. However, this requires the underlying ODE to be sufficiently smooth. Note that, in contrast to our intention, most other investigations choose quintic splines for approximating fourth-order ODEs, e.g., in [8,10,11,17] (similar to choosing cubic splines for second-order ODEs).

We highlight that the proposed method is not only inspired by, but is also heavily based on the interpolation and collocation algorithms presented in [18,19], respectively. The main contribution of this paper is the combination, extension and runtime optimization of those methods. Moreover, we provide a detailed and self-contained derivation together with a fully documented open-source C++ reference implementation.

Having a background in mechanical engineering, the authors intention is to present a simple and self-contained derivation of the proposed algorithm, which can be easily understood, implemented, and extended by readers also lacking a dedicated mathematical background.

2.1. Problem Statement

Consider the second-order linear time-variant ODE

α (t) \ddot{F} (t) + β (t) \dot{F} (t) + γ (t) F (t) = τ (t) for t \in [t_{0}, t_{n}]

(1)

where

\dot{F} (t)

and

\ddot{F} (t)

denote the first and second derivative of the unknown function

F (t)

with respect to time t, i.e.,

\dot{F} (t) = \frac{d F (t)}{d t} a n d \ddot{F} (t) = \frac{d^{2} F (t)}{d t^{2}} .

Note that t does not have to represent time. However, this synonym is used in the following due to the typical appearance of (1) in dynamical systems. The coefficients

α

,

β

,

γ

, and the right-hand side

τ

are arbitrary, in general nonlinear, but known functions of t. Let system (1) be constrained by the BCs

\begin{matrix} F (t_{0}) & = F_{0}, & \dot{F} (t_{0}) & = {\dot{F}}_{0}, & \ddot{F} (t_{0}) & = {\ddot{F}}_{0}, \\ F (t_{n}) & = F_{n}, & \dot{F} (t_{n}) & = {\dot{F}}_{n}, & \ddot{F} (t_{n}) & = {\ddot{F}}_{n}, \end{matrix}

(2)

where

t_{0}

and

t_{n}

define the considered time interval

t \in [t_{0}, t_{n}]

and

F_{0}

,

{\dot{F}}_{0}

,

{\ddot{F}}_{0}

,

F_{n}

,

{\dot{F}}_{n}

,

{\ddot{F}}_{n}

are user-defined constants. Then system (1) together with the BCs (2) represents the second-order linear two-point BVP for which an approximation is to be found. Note that (2) considers Dirichlet, Neumann, and second-order BCs independently of each other. In contrast, various other algorithms assume Robin BCs, i.e., a linear combination of Dirichlet and Neumann BCs, which is not equivalent to our approach. Due to (2), the BVP is over-determined and the existence of a solution

F (t)

depends on the consistency of the BCs with the ODE.

In the following, we investigate the approximation of

F (t)

through spline collocation, i.e., we generate a spline

y (t)

which satisfies the underlying ODE (1) at a user-defined set of distinct collocation sites

{t_{k}}

, numbered in increasing order, which lie within the considered interval

(t_{0}, t_{n})

, i.e.,

α (t_{k}) \ddot{y} (t_{k}) + β (t_{k}) \dot{y} (t_{k}) + γ (t_{k}) y (t_{k}) = τ (t_{k}) for t_{0} < t_{k} < t_{k + 1} < t_{n} .

(3)

Moreover,

y (t)

is forced to fulfill the BCs (2) at

t_{0}

and

t_{n}

, i.e.,

\begin{matrix} y (t_{0}) & = y_{0} = F_{0}, & \dot{y} (t_{0}) & = {\dot{y}}_{0} = {\dot{F}}_{0}, & \ddot{y} (t_{0}) & = {\ddot{y}}_{0} = {\ddot{F}}_{0}, \\ y (t_{n}) & = y_{n} = F_{n}, & \dot{y} (t_{n}) & = {\dot{y}}_{n} = {\dot{F}}_{n}, & \ddot{y} (t_{n}) & = {\ddot{y}}_{n} = {\ddot{F}}_{n} . \end{matrix}

(4)

Here we use

y (t)

to denote the approximating spline while the exact solution is represented by

F (t)

. For clarity, we also use different denominations for

{t_{k}}

and

{y_{k}}

by using the terms collocation sites and collocation points, respectively. While

{t_{k}}

are user-defined parameters,

{y_{k}}

describe the solution to be found.

Since the proposed collocation algorithm is strongly related to the interpolation of cubic/quintic splines, which may not be common to some readers, spline interpolation is recapitulated in Section 2.3. Then, the proposed collocation method is derived in Section 2.7. Moreover, we reuse core elements of the interpolation algorithm during collocation, thus we cannot omit its derivation.

2.2. Spline Parametrization

Before diving into the derivation of algorithms, one first has to decide which spline representation to use. In the literature, formulations such as basis splines (B-splines) are common, since they feature inherent continuity and local control, which typically leads to banded systems [20]. In general, B-splines do not pass through their control points, which seems to make interpolation difficult at first sight; however, efficient algorithms for interpolation and collocation exist, see, for example, [21]. In [20], it has been shown that B-splines might not be as stable and efficient as other representations, namely monomial and Hermite type bases, especially when it comes to implementation. In particular, monomial bases have been recommended due to their superior condition, and thus lower roundoff errors. For all three forms, B-spline, Hermite type, and monomial, the core operation during Gaussian collocation is typically the solution of an almost block diagonal (ABD) linear system of equations [6,21]. A generic solver for these type of systems is SOLVEBLOK [22], while the special structure occurring for monomial bases is exploited by ABDPACK [23] with increased speed and lower memory consumption. Unfortunately, for smoothest spline collocation as presented in the following, the corresponding collocation matrix is dense, thus we cannot apply these algorithms. However, when compared to Gaussian collocation, the count of collocation sites and thus the dimension of the corresponding LSE is much smaller, which can lead to comparable performance.

Despite the popularity of B-splines, we use the so-called piecewise polynomial (PP) form [21], which describes the spline through the coefficients of interconnected, but independently defined, polynomial segments. We use a special type of monomial bases, namely the canonical form of the polynomials, which may not be as efficient as the choice in [20]; however, it makes our algorithm much simpler. By using this formulation, continuity between the spline segments needs to be explicitly established. The evaluation of the resulting spline, however, boils down to the evaluation of a single polynomial belonging to the corresponding segment, which is in general much quicker than evaluating the equivalent B-spline form. The evaluation of B-splines of degree p with the well-known de Boor’s algorithm [24] takes

O (p^{2}) + O (p)

operations [25]. There are optimized versions of it as proposed in [25,26]; however, these are numerically less stable [27]. In contrast, evaluating a polynomial of degree p with Horner’s method [28] takes only

2 p

, i.e.,

O (p)

, operations [29]. This is essential for time-critical applications, where the resulting spline has to be evaluated as quickly as possible. Note that we are free to construct the spline in B-spline formulation and convert it to the corresponding PP form in a post-processing step, see [21]. However, this introduces an additional (expensive) step which we try to avoid since, in our case, not only the evaluation but also the construction of the spline is time critical.

Let the spline

y (t)

be defined as

y (t) = s_{i} (ξ_{i} (t)) for t_{0} \leq t_{i} \leq t < t_{i + 1} \leq t_{n} with i = 0, \dots, n - 1,

(5)

where

s_{i}

represents the i-th of the

n > 1

spline segments parametrized by the normalized interpolation parameter

ξ_{i}

. We call

t \in [t_{0}, t_{n}]

and

ξ_{i} \in [0, 1]

the global and local interpolation parameters, respectively, for which we choose the linear mapping

ξ_{i} (t) = \frac{1}{h_{i}} t - \frac{t_{i}}{h_{i}} = (t - t_{i}) g_{i}

(6)

with

h_{i} > 0

as the duration of the i-th segment

h_{i} = t_{i + 1} - t_{i}

and its reciprocal

g_{i} = 1 / h_{i}

. The partitioning of the spline into n segments is visualized in Figure 3. In the following, we predominantly derive expressions in local segment space, i.e., with respect to

ξ_{i}

, since this makes the notation clearer, especially in Section 2.7. Note that, in contrast to some other approaches, we do not require homogeneous partitioning, thus the segmentation can be chosen arbitrarily as long as spline knots do not coincide. However, in Section 2.3, we show that for best numerical stability uniform partitioning should be used.

In the case of cubic splines (left subscript C), each segment

_{C} s_{i}

represents a polynomial of degree three,

_{C} s_{i} (ξ_{i}) =_{C} a_{i} ξ_{i}^{3} +_{C} b_{i} ξ_{i}^{2} +_{C} C_{i} ξ_{i} +_{C} d_{i} with i = 0, \dots, n - 1

(7)

where

_{C} a_{i}

,

_{C} b_{i}

,

_{C} C_{i}

, and

_{C} d_{i}

are its constant coefficients. The first two derivatives of

_{C} s_{i} (ξ_{i} (t))

with respect to t are obtained by applying the chain rule:

\begin{matrix} _{C} {\dot{s}}_{i} (ξ_{i}) & = (\frac{d_{C} s_{i}}{d t}) = (\frac{\partial_{C} s_{i}}{\partial ξ_{i}}) (\frac{d ξ_{i}}{d t}) = \frac{3}{h_{i}}_{C} a_{i} ξ_{i}^{2} + \frac{2}{h_{i}}_{C} b_{i} ξ_{i} + \frac{1}{h_{i}}_{C} C_{i}, \end{matrix}

(8)

\begin{matrix} _{C} {\ddot{s}}_{i} (ξ_{i}) & = (\frac{d^{2}_{C} s_{i}}{d t^{2}}) = (\frac{\partial^{2}_{C} s_{i}}{\partial ξ_{i}^{2}}) {(\frac{d ξ_{i}}{d t})}^{2} + (\frac{\partial_{C} s_{i}}{\partial ξ_{i}}) \underset{= 0}{\underset{︸}{(\frac{d^{2} ξ_{i}}{d t^{2}})}} = \frac{6}{h_{i}^{2}}_{C} a_{i} ξ_{i} + \frac{2}{h_{i}^{2}}_{C} b_{i} . \end{matrix}

(9)

For quintic splines (left subscript Q), each segment

_{Q} s_{i}

represents a polynomial of degree five,

_{Q} s_{i} (ξ_{i}) =_{Q} a_{i} ξ_{i}^{5} +_{Q} b_{i} ξ_{i}^{4} +_{Q} C_{i} ξ_{i}^{3} +_{Q} d_{i} ξ_{i}^{2} +_{Q} e_{i} ξ_{i} +_{Q} f_{i} with i = 0, \dots, n - 1

(10)

where

_{Q} a_{i}

,

_{Q} b_{i}

,

_{Q} C_{i}

,

_{Q} d_{i}

,

_{Q} e_{i}

, and

_{Q} f_{i}

are its constant coefficients. As in the cubic case, we obtain the first four derivatives of

_{Q} s_{i} (ξ_{i} (t))

with respect to t through the chain rule:

\begin{matrix} _{Q} {\dot{s}}_{i} (ξ_{i}) & = (\frac{d_{Q} s_{i}}{d t}) = \frac{5}{h_{i}}_{Q} a_{i} ξ_{i}^{4} + \frac{4}{h_{i}}_{Q} b_{i} ξ_{i}^{3} + \frac{3}{h_{i}}_{Q} C_{i} ξ_{i}^{2} + \frac{2}{h_{i}}_{Q} d_{i} ξ_{i} + \frac{1}{h_{i}}_{Q} e_{i}, \end{matrix}

(11)

\begin{matrix} _{Q} {\ddot{s}}_{i} (ξ_{i}) & = (\frac{d^{2}_{Q} s_{i}}{d t^{2}}) = \frac{20}{h_{i}^{2}}_{Q} a_{i} ξ_{i}^{3} + \frac{12}{h_{i}^{2}}_{Q} b_{i} ξ_{i}^{2} + \frac{6}{h_{i}^{2}}_{Q} C_{i} ξ_{i} + \frac{2}{h_{i}^{2}}_{Q} d_{i}, \end{matrix}

(12)

\begin{matrix} _{Q} s_{i}^{(3)} (ξ_{i}) & = (\frac{d^{3}_{Q} s_{i}}{d t^{3}}) = \frac{60}{h_{i}^{3}}_{Q} a_{i} ξ_{i}^{2} + \frac{24}{h_{i}^{3}}_{Q} b_{i} ξ_{i} + \frac{6}{h_{i}^{3}}_{Q} C_{i}, \end{matrix}

(13)

\begin{matrix} _{Q} s_{i}^{(4)} (ξ_{i}) & = (\frac{d^{4}_{Q} s_{i}}{d t^{4}}) = \frac{120}{h_{i}^{4}}_{Q} a_{i} ξ_{i} + \frac{24}{h_{i}^{4}}_{Q} b_{i} . \end{matrix}

(14)

2.3. Spline Interpolation: Preliminaries

In the following, we recall how a given set of

n + 1

data points

{(t_{i}, y_{i})}

with

i = 0, \dots, n

can be interpolated with a

C^{2}

or

C^{4}

smooth cubic or quintic spline, respectively. The derivation explicitly uses the PP form of the spline and leads to the same algorithm as presented in [18], except for slight modifications in notation. Note that [18] only deals with quintic splines. However, the method for cubic segments presented in this paper is a simplified version of the same scheme. Moreover, the derivation is investigated in more detail than it is in [18]. Readers not interested in these details are referred to Algorithm 1 which summarizes the results of this section.

In contrast to [18], we only consider the case of predefined first- and second-order derivatives at the boundaries of the quintic spline, i.e., we assume

{\dot{y}}_{0}

,

{\ddot{y}}_{0}

,

{\dot{y}}_{n}

, and

{\ddot{y}}_{n}

to be given (as indicated in Figure 3). For the cubic counterpart, we lose two degrees of freedom, allowing us to predefine only two constraints out of

{{\dot{y}}_{0}, {\ddot{y}}_{0}, {\dot{y}}_{n}, {\ddot{y}}_{n}}

. For the remainder of this section, we restrict ourselves to the case of predefined second-order time derivatives

{\ddot{y}}_{0}

and

{\ddot{y}}_{n}

as this allows an efficient algorithm similar to the one presented for quintic splines. Note that this choice includes the common case of natural cubic splines, i.e.,

{\ddot{y}}_{0} = {\ddot{y}}_{n} = 0

. For cubic splines, we postpone the enforcement of the remaining boundary conditions, i.e.,

{\dot{y}}_{0}

and

{\dot{y}}_{n}

, to the end of Section 2.7.

In the following, we only consider distinct and ascending interpolation sites, i.e.,

t_{0} \leq t_{i} < t_{i + 1} \leq t_{n}

for

i = 0, \dots, n - 1

.

2.4. Cubic Spline Interpolation: Derivation

Since our goal is a spline which passes through the given data points

{(t_{i}, y_{i})}

, we enforce the interpolation constraints

\begin{matrix} _{C} s_{i} (ξ_{i}) |_{ξ_{i} = 0} & \overset{!}{=} y_{i} & for i = 0, \dots, n - 1 (n eqs .), \\ _{C} s_{i} (ξ_{i}) |_{ξ_{i} = 1} & \overset{!}{=} y_{i + 1} & for i = 0, \dots, n - 1 (n eqs .) . \end{matrix}

(15)

Furthermore, we aim at

C^{2}

continuity, thus we further require that

\begin{matrix} _{C} {\dot{s}}_{i} (ξ_{i}) |_{ξ_{i} = 1} & \overset{!}{=}_{C} {\dot{s}}_{i + 1} (ξ_{i + 1}) |_{ξ_{i + 1} = 0} = {\dot{y}}_{i + 1} & for i = 0, \dots, n - 2 (n - 1 eqs .), \\ _{C} {\ddot{s}}_{i} (ξ_{i}) |_{ξ_{i} = 1} & \overset{!}{=}_{C} {\ddot{s}}_{i + 1} (ξ_{i + 1}) |_{ξ_{i + 1} = 0} = {\ddot{y}}_{i + 1} & for i = 0, \dots, n - 2 (n - 1 eqs .) . \end{matrix}

(16)

Inserting (7) and (9) into (15) and in the second row of (16) allows us to reformulate the spline coefficients as

\begin{matrix} _{C} a_{i} & = & - \frac{1}{6} {\ddot{y}}_{i} h_{i}^{2} & + \frac{1}{6} {\ddot{y}}_{i + 1} h_{i}^{2}, \\ _{C} b_{i} & = & + \frac{1}{2} {\ddot{y}}_{i} h_{i}^{2}, \\ _{C} C_{i} & = & - y_{i} & + y_{i + 1} & - \frac{1}{3} {\ddot{y}}_{i} h_{i}^{2} & - \frac{1}{6} {\ddot{y}}_{i + 1} h_{i}^{2}, \\ _{C} d_{i} & = & + y_{i}, \end{matrix}

(17)

where

{\ddot{y}}_{i}

for

i = 1, \dots, n - 1

are the

n - 1

unknowns which have to be computed using the remaining

n - 1

equations given by the first row of (16). For this purpose, we expand the first row of (16) with (8) and insert

_{C} a_{i}

,

_{C} b_{i}

,

_{C} C_{i}

and

_{C} C_{i + 1}

from (17) to obtain

h_{i - 1} {\ddot{y}}_{i - 1} + 2 (h_{i - 1} + h_{i}) {\ddot{y}}_{i} + h_{i} {\ddot{y}}_{i + 1} = \frac{6}{h_{i}} (y_{i + 1} - y_{i}) - \frac{6}{h_{i - 1}} (y_{i} - y_{i - 1}),

(18)

which holds for

i = 1, \dots, n - 1

. This equation can be considered as defining intermediate unknowns

{\ddot{y}}_{i}

and can be written as LSE of the form

_{C} A_{C} X =_{C} B

(19)

with

_{C} A \in α R^{(n - 1) \times (n - 1)}

and

_{C} X

,

_{C} B \in α R^{n - 1}

given by

_{C} A = [\begin{matrix} _{C} D_{1} & _{C} U_{1} & 0 \\ _{C} L_{2} & _{C} D_{2} & _{C} U_{2} \\ ⋱ & ⋱ & ⋱ \\ _{C} L_{i} & _{C} D_{i} & _{C} U_{i} \\ ⋱ & ⋱ & ⋱ \\ _{C} L_{n - 2} & _{C} D_{n - 2} & _{C} U_{n - 2} \\ 0 & _{C} L_{n - 1} & _{C} D_{n - 1} \end{matrix}],_{C} X = [\begin{matrix} _{C} X_{1} \\ _{C} X_{2} \\ ⋮ \\ _{C} X_{i} \\ ⋮ \\ _{C} X_{n - 2} \\ _{C} X_{n - 1} \end{matrix}],_{C} B = [\begin{matrix} _{C} B_{1} \\ _{C} B_{2} \\ ⋮ \\ _{C} B_{i} \\ ⋮ \\ _{C} B_{n - 2} \\ _{C} B_{n - 1} \end{matrix}],

(20)

and their elements

\begin{matrix} _{C} D_{i} & = 2 (h_{i - 1} + h_{i}),_{C} X_{i} = {\ddot{y}}_{i} & for i = 1, \dots, n - 1, \end{matrix}

(21)

\begin{matrix} _{C} U_{i} & = h_{i},_{C} L_{i + 1} =_{C} U_{i} & for i = 1, \dots, n - 2, \end{matrix}

(22)

\begin{matrix} _{C} B_{i} & = \frac{6}{h_{i}} (y_{i + 1} - y_{i}) - \frac{6}{h_{i - 1}} (y_{i} - y_{i - 1}) +_{C} Λ_{i} & for i = 1, \dots, n - 1, \end{matrix}

(23)

_{C} Λ_{i} = \{\begin{matrix} - h_{0} {\ddot{y}}_{0} - h_{n - 1} {\ddot{y}}_{n} & for n = 2, & (\equiv -_{C} L_{1}_{C} X_{0} -_{C} U_{n - 1}_{C} X_{n}) \\ - h_{0} {\ddot{y}}_{0} & for n > 2 \land i = 1, & (\equiv -_{C} L_{1}_{C} X_{0}) \\ - h_{n - 1} {\ddot{y}}_{n} & for n > 2 \land i = n - 1, & (\equiv -_{C} U_{n - 1}_{C} X_{n}) \\ 0 & else \end{matrix} .

(24)

Herein

_{C} L_{i}

,

_{C} D_{i}

, and

_{C} U_{i} \in α R

represent the lower diagonal, diagonal, and upper diagonal elements of

_{C} A

, respectively. Note that

_{C} A

only depends on the choice of

{t_{i}}

, thus it can be reused in case we need to perform further interpolations with the same segmentation

{t_{i}}

. While

_{C} A

represents the interpolation sites

{t_{i}}

, the interpolation points

{y_{i}}

are encoded in

_{C} B

. The additional term

_{C} Λ_{i}

incorporates the BCs such that we can write the system in the neat form of (20). From the solution

_{C} X

we can directly obtain the unknowns

{\ddot{y}}_{i}

which in turn can be used together with the data points

{(t_{i}, y_{i})}

and BCs to compute the segment coefficients (17) such that the spline

y (t)

is fully defined. Note that

_{C} A

is symmetric, i.e.,

_{C} A =_{C} A^{T}

; however, we do not make use of this property.

Although one can compute

_{C} X

from (19) using an arbitrary solver for linear systems of equations, there is a more efficient way for doing so: since

_{C} A

is tridiagonal, we can solve (19) with the Thomas algorithm [30,31]. Derived from an

L U

decomposition of

_{C} A

, one performs a recursive forward elimination

\begin{matrix} _{C} H_{i} & : = \{\begin{matrix} \frac{_{C} U_{1}}{_{C} D_{1}} & for i = 1, \\ \frac{_{C} U_{i}}{_{C} D_{i} -_{C} L_{i}_{C} H_{i - 1}} & for i = 2, \dots, n - 2 \end{matrix}, \end{matrix}

(25)

\begin{matrix} _{C} P_{i} & : = \{\begin{matrix} \frac{_{C} B_{1}}{_{C} D_{1}} & for i = 1, \\ \frac{_{C} B_{i} -_{C} L_{i}_{C} P_{i - 1}}{_{C} D_{i} -_{C} L_{i}_{C} H_{i - 1}} & for i = 2, \dots, n - 1 \end{matrix} \end{matrix}

(26)

followed by a backward substitution

\begin{matrix} _{C} X_{i} = \{\begin{matrix} _{C} P_{n - 1} & for i = n - 1, \\ _{C} P_{i} -_{C} H_{i}_{C} X_{i + 1} & for i = n - 2, n - 3, \dots, 1 \end{matrix} . \end{matrix}

(27)

Computing

_{C} X

out of

_{C} A

and

_{C} B

thus boils down to

5 n - 9

operations. Note that in contrast to [31] where

A \in α R^{n \times n}

in our case

_{C} A \in α R^{(n - 1) \times (n - 1)}

, thus n changes to

n - 1

for computing the count of operations in total [31]. Since

_{C} A

is symmetric and positive definite, one may think of using an algorithm based on Cholesky factorization instead of

L U

decomposition, as this has proven to be approximately twice as efficient where applicable. However, this rule of thumb seems to be no longer valid for the special case of tridiagonal matrices: the Cholesky factorization

T = L D_{L}^{- 1} L^{T}

in [32], where the computation of the lower diagonal matrix L exploits the special structure and the diagonal matrix

D_{L}

is used to avoid evaluating expensive square roots, leads to

7 n - 10

operations in total.

The numerical stability of cubic spline interpolation is shown in Appendix A.1.

2.5. Quintic Spline Interpolation: Derivation

As previously mentioned, the following is a detailed version of the derivation given in [18]. Just as in the cubic case, our task is to pass through the given data points

{(t_{i}, y_{i})}

, thus we enforce the interpolation constraints

\begin{matrix} _{Q} s_{i} (ξ_{i}) |_{ξ_{i} = 0} & \overset{!}{=} y_{i} & for i = 0, \dots, n - 1 (n eqs .), \\ _{Q} s_{i} (ξ_{i}) |_{ξ_{i} = 1} & \overset{!}{=} y_{i + 1} & for i = 0, \dots, n - 1 (n eqs .) . \end{matrix}

(28)

We use the additional degrees of freedom to enforce not only

C^{2}

, but instead

C^{4}

continuity with

\begin{matrix} _{Q} {\dot{s}}_{i} (ξ_{i}) |_{ξ_{i} = 1} & \overset{!}{=}_{Q} {\dot{s}}_{i + 1} (ξ_{i + 1}) |_{ξ_{i + 1} = 0} = {\dot{y}}_{i + 1} & for i = 0, \dots, n - 2 (n - 1 eqs .), \\ _{Q} {\ddot{s}}_{i} (ξ_{i}) |_{ξ_{i} = 1} & \overset{!}{=}_{Q} {\ddot{s}}_{i + 1} (ξ_{i + 1}) |_{ξ_{i + 1} = 0} = {\ddot{y}}_{i + 1} & for i = 0, \dots, n - 2 (n - 1 eqs .), \\ _{Q} s_{i}^{(3)} (ξ_{i}) |_{ξ_{i} = 1} & \overset{!}{=}_{Q} s_{i + 1}^{(3)} (ξ_{i + 1}) |_{ξ_{i + 1} = 0} = y_{i + 1}^{(3)} & for i = 0, \dots, n - 2 (n - 1 eqs .), \\ _{Q} s_{i}^{(4)} (ξ_{i}) |_{ξ_{i} = 1} & \overset{!}{=}_{Q} s_{i + 1}^{(4)} (ξ_{i + 1}) |_{ξ_{i + 1} = 0} = y_{i + 1}^{(4)} & for i = 0, \dots, n - 2 (n - 1 eqs .) . \end{matrix}

(29)

Inserting (10), (11), and (12) into (28) and into the first two rows of (29) allows us to reformulate the spline coefficients to

\begin{matrix} _{Q} a_{i} & = & - 6 y_{i} & + 6 y_{i + 1} & - 3 {\dot{y}}_{i} h_{i} & - 3 {\dot{y}}_{i + 1} h_{i} & - \frac{1}{2} {\ddot{y}}_{i} h_{i}^{2} & + \frac{1}{2} {\ddot{y}}_{i + 1} h_{i}^{2}, \\ _{Q} b_{i} & = & + 15 y_{i} & - 15 y_{i + 1} & + 8 {\dot{y}}_{i} h_{i} & + 7 {\dot{y}}_{i + 1} h_{i} & + \frac{3}{2} {\ddot{y}}_{i} h_{i}^{2} & - {\ddot{y}}_{i + 1} h_{i}^{2}, \\ _{Q} C_{i} & = & - 10 y_{i} & + 10 y_{i + 1} & - 6 {\dot{y}}_{i} h_{i} & - 4 {\dot{y}}_{i + 1} h_{i} & - \frac{3}{2} {\ddot{y}}_{i} h_{i}^{2} & + \frac{1}{2} {\ddot{y}}_{i + 1} h_{i}^{2}, \\ _{Q} d_{i} & = & + \frac{1}{2} {\ddot{y}}_{i} h_{i}^{2}, \\ _{Q} e_{i} & = & + {\dot{y}}_{i} h_{i}, \\ _{Q} f_{i} & = & + y_{i}, \end{matrix}

(30)

where

{\dot{y}}_{i}

and

{\ddot{y}}_{i}

for

i = 1, \dots, n - 1

are the

2 (n - 1)

unknowns which we still have to determine using the remaining

2 (n - 1)

equations given by the last two rows of (29). In particular, we expand the fourth row of (29) with (14) and insert

_{Q} a_{i}

,

_{Q} b_{i}

, and

_{Q} b_{i + 1}

from (30) to obtain

\begin{matrix} - 56 g_{i - 1}^{3} {\dot{y}}_{i - 1} - 8 g_{i - 1}^{2} {\ddot{y}}_{i - 1} - 64 {\dot{y}}_{i} (g_{i - 1}^{3} + g_{i}^{3}) + 12 {\ddot{y}}_{i} (g_{i - 1}^{2} - g_{i}^{2}) - 56 g_{i}^{3} {\dot{y}}_{i + 1} + 8 g_{i}^{2} {\ddot{y}}_{i + 1} = \\ = - 120 g_{i}^{4} (y_{i + 1} - y_{i}) - 120 g_{i - 1}^{4} (y_{i} - y_{i - 1}) . \end{matrix}

(31)

In the same manner, we expand the third row of (29) with (13) and insert

_{Q} a_{i}

,

_{Q} b_{i}

,

_{Q} C_{i}

and

_{Q} C_{i + 1}

from (30) which leads to

\begin{matrix} - 8 g_{i - 1}^{2} {\dot{y}}_{i - 1} - g_{i - 1} {\ddot{y}}_{i - 1} - 12 {\dot{y}}_{i} (g_{i - 1}^{2} - g_{i}^{2}) + 3 {\ddot{y}}_{i} (g_{i - 1} + g_{i}) + 8 g_{i}^{2} {\dot{y}}_{i + 1} - g_{i} {\ddot{y}}_{i + 1} = \\ = 20 g_{i}^{3} (y_{i + 1} - y_{i}) - 20 g_{i - 1}^{3} (y_{i} - y_{i - 1}) . \end{matrix}

(32)

Both (31) and (32) hold for

i = 1, \dots, n - 1

which allows casting them in the form of

_{Q} A_{Q} X =_{Q} B

(33)

with

_{Q} A \in α R^{2 (n - 1) \times 2 (n - 1)}

and

_{Q} X

,

_{Q} B \in α R^{2 (n - 1)}

given by

_{Q} A = [\begin{matrix} _{Q} D_{1} & _{Q} U_{1} & 0 \\ _{Q} L_{2} & _{Q} D_{2} & _{Q} U_{2} \\ ⋱ & ⋱ & ⋱ \\ _{Q} L_{i} & _{Q} D_{i} & _{Q} U_{i} \\ ⋱ & ⋱ & ⋱ \\ _{Q} L_{n - 2} & _{Q} D_{n - 2} & _{Q} U_{n - 2} \\ 0 & _{Q} L_{n - 1} & _{Q} D_{n - 1} \end{matrix}],_{Q} X = [\begin{matrix} _{Q} X_{1} \\ _{Q} X_{2} \\ ⋮ \\ _{Q} X_{i} \\ ⋮ \\ _{Q} X_{n - 2} \\ _{Q} X_{n - 1} \end{matrix}],_{Q} B = [\begin{matrix} _{Q} B_{1} \\ _{Q} B_{2} \\ ⋮ \\ _{Q} B_{i} \\ ⋮ \\ _{Q} B_{n - 2} \\ _{Q} B_{n - 1} \end{matrix}],

(34)

and their block components

\begin{matrix} _{Q} D_{i} & = [\begin{matrix} 64 (g_{i - 1}^{3} + g_{i}^{3}) & 12 κ (g_{i - 1}^{2} - g_{i}^{2}) g_{i} \\ 12 κ (g_{i - 1}^{2} - g_{i}^{2}) g_{i} & 3 κ^{2} (g_{i - 1} + g_{i}) g_{i}^{2} \end{matrix}],_{Q} X_{i} = [\begin{matrix} - {\dot{y}}_{i} \\ \frac{1}{κ g_{i}} {\ddot{y}}_{i} \end{matrix}] & for i = 1, \dots, n - 1, \end{matrix}

(35)

\begin{matrix} _{Q} U_{i} & = [\begin{matrix} 56 g_{i}^{3} & 8 κ g_{i}^{2} g_{i + 1} \\ - 8 κ g_{i}^{3} & - κ^{2} g_{i}^{2} g_{i + 1} \end{matrix}],_{Q} L_{i + 1} =_{Q} U_{i}^{T} & for i = 1, \dots, n - 2, \end{matrix}

(36)

\begin{matrix} _{Q} B_{i} & = [\begin{matrix} - 120 g_{i}^{4} (y_{i + 1} - y_{i}) - 120 g_{i - 1}^{4} (y_{i} - y_{i - 1}) \\ 20 κ g_{i}^{4} (y_{i + 1} - y_{i}) - 20 κ g_{i - 1}^{3} g_{i} (y_{i} - y_{i - 1}) \end{matrix}] +_{Q} Λ_{i} & for i = 1, \dots, n - 1, \end{matrix}

(37)

_{Q} Λ_{i} = \{\begin{matrix} [\begin{matrix} 56 g_{0}^{3} {\dot{y}}_{0} + 8 g_{0}^{2} {\ddot{y}}_{0} + 56 g_{n - 1}^{3} {\dot{y}}_{n} - 8 g_{n - 1}^{2} {\ddot{y}}_{n} \\ 8 κ g_{0}^{2} g_{1} {\dot{y}}_{0} + κ g_{0} g_{1} {\ddot{y}}_{0} - 8 κ g_{n - 1}^{3} {\dot{y}}_{n} + κ g_{n - 1}^{2} {\ddot{y}}_{n} \end{matrix}] & \begin{matrix} for n = 2 \\ (\equiv -_{Q} L_{1}_{Q} X_{0} -_{Q} U_{n - 1}_{Q} X_{n}) \end{matrix}, \\ [\begin{matrix} 56 g_{0}^{3} {\dot{y}}_{0} + 8 g_{0}^{2} {\ddot{y}}_{0} \\ 8 κ g_{0}^{2} g_{1} {\dot{y}}_{0} + κ g_{0} g_{1} {\ddot{y}}_{0} \end{matrix}] & \begin{matrix} for n > 2 \land i = 1 \\ (\equiv -_{Q} L_{1}_{Q} X_{0}) \end{matrix}, \\ [\begin{matrix} 56 g_{n - 1}^{3} {\dot{y}}_{n} - 8 g_{n - 1}^{2} {\ddot{y}}_{n} \\ - 8 κ g_{n - 1}^{3} {\dot{y}}_{n} + κ g_{n - 1}^{2} {\ddot{y}}_{n} \end{matrix}] & \begin{matrix} for n > 2 \land i = n - 1 \\ (\equiv -_{Q} U_{n - 1}_{Q} X_{n}) \end{matrix}, \\ 0 & else \end{matrix} .

(38)

Just as in the cubic case,

_{Q} L_{i}

,

_{Q} D_{i}

, and

_{Q} U_{i} \in α R^{2 \times 2}

represent the lower diagonal, diagonal, and upper diagonal blocks of

_{Q} A

, respectively. As suggested in [18], the additional parameter

κ

in the definitions (35)–(38) is chosen as

κ = \sqrt{\frac{64}{3}} .

(39)

From an analytical point of view,

κ

has no influence on the solution

_{Q} X

(at least if

κ \neq 0

). However, it improves numerical stability which is verified in Appendix A.2.

As for the cubic spline

_{Q} A

is symmetric and only depends on the choice of

{t_{i}}

, while the interpolation points

{y_{i}}

are contained in

_{Q} B

. As before, the additional term

_{Q} Λ_{i}

incorporates the BCs such that we can write the system in the form of (34). In contrast to the cubic case, the solution

_{Q} X

represents not only

{\ddot{y}}_{i}

, but also

{\dot{y}}_{i}

, which is now additionally required to compute the segment coefficients from (30).

Since

_{Q} A

is block-tridiagonal, we can again solve (33) efficiently with the generalization of the Thomas algorithm to block-tridiagonal matrices [31]. Based on an

L U

decomposition of

_{Q} A

, we first run a recursive forward elimination

\begin{matrix} _{Q} H_{i} & : = \{\begin{matrix} _{Q} D_{1}^{- 1}_{Q} U_{1} & for i = 1, \\ {(_{Q} D_{i} -_{Q} L_{i}_{Q} H_{i - 1})}^{- 1}_{Q} U_{i} & for i = 2, \dots, n - 2 \end{matrix}, \end{matrix}

(40)

\begin{matrix} _{Q} P_{i} & : = \{\begin{matrix} _{Q} D_{1}^{- 1}_{Q} B_{1} & for i = 1, \\ {(_{Q} D_{i} -_{Q} L_{i}_{Q} H_{i - 1})}^{- 1} (_{Q} B_{i} -_{Q} L_{i}_{Q} P_{i - 1}) & for i = 2, \dots, n - 1 \end{matrix} \end{matrix}

(41)

followed by a backward substitution

\begin{matrix} _{Q} X_{i} = \{\begin{matrix} _{Q} P_{n - 1} & for i = n - 1, \\ _{Q} P_{i} -_{Q} H_{i}_{Q} X_{i + 1} & for i = n - 2, n - 3, \dots, 1 \end{matrix} . \end{matrix}

(42)

Computing

_{Q} X

out of

_{Q} A

and

_{Q} B

requires at a maximum

36 n - 60

operations. Again [31] uses

A \in α R^{2 n \times 2 n}

while in our case

_{Q} A \in α R^{2 (n - 1) \times 2 (n - 1)}

holds, thus n changes to

n - 1

for computing the count of operations in total [31]. For this upper bound, explicit computation of the inverse of

_{Q} D_{1}

and

(_{Q} D_{i} -_{Q} L_{i}_{Q} H_{i - 1})

by Gaussian elimination is assumed, which in practice should be avoided by solving a

2 \times 2

LSE instead [31]. Thus, a corresponding implementation can be expected to require even less operations.

In Appendix A.2, the interested reader can find considerations on the numerical stability of quintic spline interpolation.

2.6. Algorithm for Cubic/Quintic Spline Interpolation

Since the presented derivation is rather lengthy, the key steps for interpolating cubic/quintic splines following the proposed method are summarized in Algorithm 1.

Algorithm 1: Cubic/Quintic Spline Interpolation.

For details on convergence order and approximation error of quintic spline interpolation, the interested reader is referred to [18] where these issues have been experimentally investigated for various examples.

2.7. Spline Collocation: Derivation

The following is based on the collocation algorithm presented in [19,33]. However, we extend the method from cubic to quintic splines. Moreover, we do not use natural splines, but instead integrate the boundary conditions directly into the scheme. Lastly, in contrast to [19,33], we do not need to modify the right-hand side of (1), thus leading to a “true” collocation of the ODE for all collocation sites, which are chosen to be the interior spline knots. As runtime performance is of the highest priority for our application, we choose smoothest spline collocation. This minimizes the count of (expensive) collocation sites, thus reduces the count of equations to solve, and instead uses the available degrees of freedom to force

C^{2}

(cubic spline) or

C^{4}

(quintic spline) continuity. Moreover, in our application,

y (t)

is used as input for controlling the motion of a robot, thus a smooth

y (t)

is equivalent to small changes in joint accelerations, i.e., motor jerks, which in turn improves overall stability during locomotion.

As stated in Section 2.1, we require the approximation

y (t)

to fulfill the underlying ODE at certain collocation sites

{t_{k}}

, see (3), while simultaneously satisfying the boundary conditions as specified in (4). Note that we use the index k instead of i to highlight that our new task consists in collocating the ODE at the interior knots, i.e.,

k = 1, \dots, n - 1

rather than the previously investigated interpolation at all knots, i.e.,

i = 0, \dots, n

. Furthermore, it should be pointed out that although (3) holds, this does not imply that

y (t_{k}) = F (t_{k})

,

\dot{y} (t_{k}) = \dot{F} (t_{k})

, or

\ddot{y} (t_{k}) = \ddot{F} (t_{k})

. In other words,

y (t)

will not coincide with the real solution

F (t)

at the collocation sites

{t_{k}}

. However, it will behave similarly at these spots (meaning that they will satisfy the same Equation (1)), which is illustrated in Figure 4.

As first step, we introduce the auxiliary variables

λ

,

η

, and

r

which are defined as

\begin{matrix} _{C} λ & : = {[\begin{matrix} y_{1}, \dots, y_{n - 1} \end{matrix}]}^{T}, & _{C} η & : = {[\begin{matrix} {\ddot{y}}_{1}, \dots, {\ddot{y}}_{n - 1} \end{matrix}]}^{T}, & _{C} r & : = {[\begin{matrix} y_{0}, {\ddot{y}}_{0}, y_{n}, {\ddot{y}}_{n} \end{matrix}]}^{T}, \\ _{Q} λ & : = {[\begin{matrix} y_{1}, \dots, y_{n - 1} \end{matrix}]}^{T}, & _{Q} η & : = {[\begin{matrix} {\dot{y}}_{1}, {\ddot{y}}_{1}, \dots, {\dot{y}}_{n - 1}, {\ddot{y}}_{n - 1} \end{matrix}]}^{T}, & _{Q} r & : = {[\begin{matrix} y_{0}, {\dot{y}}_{0}, {\ddot{y}}_{0}, y_{n}, {\dot{y}}_{n}, {\ddot{y}}_{n} \end{matrix}]}^{T}, \end{matrix}

(43)

where

_{C} λ

,

_{C} η

,

_{C} r

and

_{Q} λ

,

_{Q} η

,

_{Q} r

are the corresponding counterparts for the case of cubic and quintic splines, respectively. While

λ

represents the (yet unknown) collocation points

{y_{k}}

,

η

contains their corresponding first (and second) time derivatives, which can be seen as “internal” unknowns, as they will be implicitly defined through an embedded spline interpolation. Lastly,

r

depicts the BCs, where we lack

{\dot{y}}_{0}

and

{\dot{y}}_{n}

in the case of cubic splines as has been previously explained.

From (17), (30), and (43) we observe that the spline segments

s_{i}

are linear with respect to

λ

,

η

, and

r

, i.e.,

s_{i} (ξ_{i}) = \underset{known}{\underset{︸}{(\frac{\partial s_{i} (ξ_{i})}{\partial λ})}} λ + \underset{known}{\underset{︸}{(\frac{\partial s_{i} (ξ_{i})}{\partial η})}} η + \underset{known}{\underset{︸}{(\frac{\partial s_{i} (ξ_{i})}{\partial r}) r}} for i = 0, \dots, n - 1

(44)

holds. The gradients are fully defined by the spline partitioning

{t_{i}}

, which is assumed to be known. Thus the construction of the spline

y (t)

is equivalent to the search for a corresponding

λ

and

η

. Note that to obtain (44), we used (17) and (30), which in turn were derived from fulfilling the interpolation condition together with enforcing continuity of the second time derivative (cubic spline), or first and second time derivative (quintic spline) at the interior knots. In order to accomplish full

C^{2}

and

C^{4}

continuity, we further make use of

A X = B

from (19) and (33), which represents continuity of the first time derivative (cubic spline) or third and fourth time derivative (quintic spline), respectively. In particular, we observe from (23), (24), (37), and (38), that

B

is linear with respect to

λ

and

r

, thus

A X = B = \underset{known}{\underset{︸}{(\frac{\partial B}{\partial λ})}} λ + \underset{known}{\underset{︸}{(\frac{\partial B}{\partial r}) r}}

(45)

holds, where the gradients again depend only on the known partitioning

{t_{i}}

. We further observe that, according to the definitions (21), (35) and (43), we can write the mapping

X = S η with_{C} S : = I a n d_{Q} S : = diag (_{Q} S_{1}, \dots,_{Q} S_{n - 1}) where_{Q} S_{i} : = [\begin{matrix} - 1 & 0 \\ 0 & \frac{1}{κ g_{i}} \end{matrix}]

(46)

with

I

being the identity matrix of appropriate size. Since

A

and

S

are constant, by “constant” we mean in this context that an expression does not depend on the yet unknown

λ

or

η

, and assumed to be non-singular, it is clear from (45) that not only

B

, but also

X

and thus

η

are linear with respect to

λ

and

r

. Hence one can write

X = (\frac{\partial X}{\partial λ}) λ + (\frac{\partial X}{\partial r}) r and η = (\frac{\partial η}{\partial λ}) λ + (\frac{\partial η}{\partial r}) r .

(47)

Note that

A

and

_{Q} S

only depend on the known

{t_{i}}

, which allows us to safely differentiate (45) with respect to

λ

and

r

to obtain

A \underset{S (\frac{\partial η}{\partial λ})}{\underset{︸}{(\frac{\partial X}{\partial λ})}} = \underset{known}{\underset{︸}{(\frac{\partial B}{\partial λ})}} and A \underset{S (\frac{\partial η}{\partial r})}{\underset{︸}{(\frac{\partial X}{\partial r})}} = \underset{known}{\underset{︸}{(\frac{\partial B}{\partial r})}} .

(48)

which we can use to compute the yet unknown gradients in (47). Note that this can be done very efficiently due to the (block-)tridiagonal form of

A

as already discussed in Section 2.3. Since

S

is diagonal, this property also holds for the product

A S

. However, for best numerical stability, one should solve for the gradients of

X

first and use the mapping (46) to obtain the gradients of

η

afterwards, which is of negligible cost since

S

, and thus also

S^{- 1}

, is diagonal. The right-hand sides necessary to solve (48) only depend on

{t_{i}}

and are derived in Appendix B. Lastly, we insert

η

from (47) into (44) and obtain

s_{i} (ξ_{i}) = [(\frac{\partial s_{i} (ξ_{i})}{\partial λ}) + (\frac{\partial s_{i} (ξ_{i})}{\partial η}) (\frac{\partial η}{\partial λ})] λ + [(\frac{\partial s_{i} (ξ_{i})}{\partial r}) + (\frac{\partial s_{i} (ξ_{i})}{\partial η}) (\frac{\partial η}{\partial r})] r

(49)

or equivalently

s_{i} (ξ_{i}) = \nabla_{λ} s_{i} (ξ_{i}) λ + \nabla_{r} s_{i} (ξ_{i}) r

(50)

with the known spline gradients

\nabla_{λ} s_{i} (ξ_{i}) : = [(\frac{\partial s_{i} (ξ_{i})}{\partial λ}) + (\frac{\partial s_{i} (ξ_{i})}{\partial η}) (\frac{\partial η}{\partial λ})], \nabla_{r} s_{i} (ξ_{i}) : = [(\frac{\partial s_{i} (ξ_{i})}{\partial r}) + (\frac{\partial s_{i} (ξ_{i})}{\partial η}) (\frac{\partial η}{\partial r})] .

(51)

We can obtain the corresponding expressions for the first and second time derivatives by following the exact same scheme. In particular, we get

{\dot{s}}_{i} (ξ_{i}) = \nabla_{λ} {\dot{s}}_{i} (ξ_{i}) λ + \nabla_{r} {\dot{s}}_{i} (ξ_{i}) r, {\ddot{s}}_{i} (ξ_{i}) = \nabla_{λ} {\ddot{s}}_{i} (ξ_{i}) λ + \nabla_{r} {\ddot{s}}_{i} (ξ_{i}) r

(52)

with

\begin{matrix} \nabla_{λ} {\dot{s}}_{i} (ξ_{i}) & : = [(\frac{\partial {\dot{s}}_{i} (ξ_{i})}{\partial λ}) + (\frac{\partial {\dot{s}}_{i} (ξ_{i})}{\partial η}) (\frac{\partial η}{\partial λ})], & \nabla_{r} {\dot{s}}_{i} (ξ_{i}) & : = [(\frac{\partial {\dot{s}}_{i} (ξ_{i})}{\partial r}) + (\frac{\partial {\dot{s}}_{i} (ξ_{i})}{\partial η}) (\frac{\partial η}{\partial r})], \end{matrix}

(53)

\begin{matrix} \nabla_{λ} {\ddot{s}}_{i} (ξ_{i}) & : = [(\frac{\partial {\ddot{s}}_{i} (ξ_{i})}{\partial λ}) + (\frac{\partial {\ddot{s}}_{i} (ξ_{i})}{\partial η}) (\frac{\partial η}{\partial λ})], & \nabla_{r} {\ddot{s}}_{i} (ξ_{i}) & : = [(\frac{\partial {\ddot{s}}_{i} (ξ_{i})}{\partial r}) + (\frac{\partial {\ddot{s}}_{i} (ξ_{i})}{\partial η}) (\frac{\partial η}{\partial r})] . \end{matrix}

(54)

Although computing the spline gradients can be done very efficiently, their mathematical representation is rather lengthy. A formulation of the gradients, which is ready for implementation, is given in Appendix B.

Lastly, we fulfill the dynamics of the ODE by inserting (50) and (52) into (3), which leads to

α (t_{k}) {\ddot{s}}_{k} (ξ_{k} = 0) + β (t_{k}) {\dot{s}}_{k} (ξ_{k} = 0) + γ (t_{k}) s_{k} (ξ_{k} = 0) = τ (t_{k})

(55)

for

t_{0} < t_{k} < t_{n}

and

t_{k} < t_{k + 1}

with

k = 1, \dots, n - 1

. This can be formulated as LSE

A_{coll} λ = B_{coll}

(56)

with

A_{coll} = [\begin{matrix} A_{coll,1} \\ ⋮ \\ A_{coll, k} \\ ⋮ \\ A_{coll, n - 1} \end{matrix}] \in α R^{(n - 1) \times (n - 1)}, B_{coll} = [\begin{matrix} B_{coll,1} \\ ⋮ \\ B_{coll, k} \\ ⋮ \\ B_{coll, n - 1} \end{matrix}] \in α R^{(n - 1)}

(57)

where the k-th row of

A_{coll}

and

B_{coll}

corresponds to the collocation site

t_{k}

and is given by

\begin{matrix} A_{coll, k} & = α (t_{k}) \nabla_{λ} {\ddot{s}}_{k} (0) + β (t_{k}) \nabla_{λ} {\dot{s}}_{k} (0) + γ (t_{k}) \nabla_{λ} s_{k} (0) \in α R^{1 \times (n - 1)}, \end{matrix}

(58)

\begin{matrix} B_{coll, k} & = τ (t_{k}) - [α (t_{k}) \nabla_{r} {\ddot{s}}_{k} (0) + β (t_{k}) \nabla_{r} {\dot{s}}_{k} (0) + γ (t_{k}) \nabla_{r} s_{k} (0)] r \in α R . \end{matrix}

(59)

Note that we choose a partitioning of the spline such that the collocation sites coincide with the starting (“left”) knot of each segment, i.e.,

ξ_{k} = 0

, see Figure 4. This allows us to skip the computation of certain spline gradients. Since the underlying ODE is of order two, only gradients of the last three coefficients, i.e.,

_{C} b_{i}

,

_{C} C_{i}

,

_{C} d_{i}

(cubic) or

_{Q} d_{i}

,

_{Q} e_{i}

,

_{Q} f_{i}

(quintic), have to be computed, for details see Appendix B. This simplifies the implementation and improves the overall performance. Solving (56) for

λ

represents the key operation (i.e., bottleneck for large n) of the proposed collocation method, since

A_{coll}

is in general dense while all other operations are either simple explicit expressions or linear systems in (block-)tridiagonal form, which can be solved efficiently. This justifies our strategy to minimize the count of collocation points (and thus the dimension of

A_{coll}

) and instead force high order continuity.

As soon as

λ

has been obtained, we can compute

η

directly from (47), since the gradients of

η

with respect to

λ

and

r

are already available as by-products of computing

λ

, see (48). From

λ

,

η

and

r

the segment coefficients can finally be computed using (17) or (30).

2.8. Satisfying First Order Boundary Conditions for Cubic Splines

The presented method for cubic spline collocation respects

y_{0}

,

{\ddot{y}}_{0}

,

y_{n}

, and

{\ddot{y}}_{n}

as BCs. However, our task was to fulfill all BCs given in (4), which seems at first to be only possible with quintic spline collocation. Also satisfying the first-order BCs, i.e.,

{\dot{y}}_{0}

and

{\dot{y}}_{n}

, can be achieved with moderate effort. For that purpose we insert two auxiliary knots, the so-called virtual control points

t_{virt,1}

and

t_{v i r t, 2}

, which give us the necessary degrees of freedom to introduce additional constraints. The term “virtual” highlights that these points are not used as collocation sites. Both,

t_{virt,1}

and

t_{v i r t, 2}

, have to lie within the specified start- and endtime and must not coincide with the collocation sites. This way the spline remains properly partitioned. For simplicity, we place the virtual control points at the centers of the (originally) first and last segments, i.e.,

t_{virt,1} : = \frac{t_{orig,0} + t_{orig,1}}{2} and t_{virt,2} : = \frac{t_{orig, n - 1} + t_{orig, n}}{2} .

(60)

Obviously, inserting two knots leads to a different segmentation of the spline, i.e.,

n \to n + 2

; however, the boundaries and collocation sites remain unchanged. If we adapt the indexing such that

t_{1} : = t_{virt,1}

and

t_{n - 1} : = t_{virt,2}

, all findings derived so far are also valid for this case. The only difference is that we do not force

y (t)

to fulfill the underlying ODE at

t_{1}

and

t_{n - 1}

anymore. Instead, we satisfy the BCs

{\dot{y}}_{0}

and

{\dot{y}}_{n}

by replacing the first and last row of

A_{coll}

and

B_{coll}

with

\begin{matrix} A_{coll,1} & = \nabla_{λ} {\dot{s}}_{0} (ξ_{0} = 0), & B_{coll,1} & = {\dot{y}}_{0} - \nabla_{r} {\dot{s}}_{0} (ξ_{0} = 0) r, \\ A_{coll, n - 1} & = \nabla_{λ} {\dot{s}}_{n - 1} (ξ_{n - 1} = 1), & B_{coll, n - 1} & = {\dot{y}}_{n} - \nabla_{r} {\dot{s}}_{n - 1} (ξ_{n - 1} = 1) r . \end{matrix}

(61)

In this way the resulting cubic spline fulfills all BCs given in (4), satisfies the underlying ODE at the specified collocation sites, and is

C^{2}

continuous at the interior spline knots (which include

t_{virt,1}

and

t_{virt,2}

).

2.9. Algorithm for Cubic/Quintic Spline Collocation

We summarize our findings in Algorithm 2. Note that, for the case of cubic splines, the modification necessary to satisfy first-order BCs is already integrated.

Algorithm 2: Cubic/Quintic Spline Collocation.

3. Results

3.1. Implementation

Algorithm 2 delivers a detailed description of the proposed collocation method which can be used as a reference during implementation. However, we also provide a fully documented C++ implementation through our free and open-source library BROCCOLI [34] (see Supplementary Materials). For best usability, the library is designed to be header-only and uses the (also header-only) cross-platform linear algebra system EIGEN [35] as sole dependency. Although BROCCOLI also has other dependencies, only EIGEN is required for the functionality discussed in this paper. Aside from basic matrix and vector operations, we use EIGEN to solve dense linear systems of equations such as

A_{coll}^{- 1} B_{coll}

in (56), but also

_{Q} D_{1}^{- 1} (\cdot)

and

{(_{Q} D_{i} -_{Q} L_{i}_{Q} H_{i - 1})}^{- 1} (\cdot)

in (40) and (41). In particular, we use a Householder rank-revealing QR decomposition with column-pivoting, see ColPivHouseholderQR in EIGEN, since it provides the best trade-off between speed, accuracy, and robustness for our use case. Note that for large scale systems solving

A_{coll}^{- 1} B_{coll}

turns out to be the bottleneck, cf. Figure 11. Thus, in this case one might choose a parallel solver for (56) instead, for example PartialPivLU in EIGEN with OPENMP enabled. In addition to the variants CubicSplineCollocator and QuinticSplineCollocator for spline collocation, see ode module of BROCCOLI, cubic and quintic spline interpolation, see curve module of BROCCOLI, as specified in Algorithm 1, is also implemented. An overview over the structure of the source code related to our collocation method is given in Appendix C. Lastly, unit tests, similar to the evaluation presented in the following, are shipped with BROCCOLI. These can be used as example applications.

3.2. Test System

In order to evaluate the proposed method and its variants, a simple mass-spring-damper system is considered, see Figure 5 (left). For the sake of simplicity, we do not consider external excitation, i.e.,

τ (t) : = 0

. The corresponding ODE (1) describing the system dynamics simplifies to

α \ddot{F} (t) + β \dot{F} (t) + γ F (t) = 0

(62)

where

α

,

β

, and

γ

are constants representing the mass and (linear) damping/stiffness coefficients of the system, respectively.

Using textbook mathematics, we can find the analytical solution given by

F (t) = \{\begin{matrix} e^{σ_{1} t} [F_{0} cos (σ_{2} t) + (\frac{{\dot{F}}_{0} - F_{0} σ_{1}}{σ_{2}}) sin (σ_{2} t)] & \begin{matrix} for \frac{β^{2}}{4 α^{2}} - \frac{γ}{α} < 0 \\ (underdamped) \end{matrix}, \\ (\frac{{\dot{F}}_{0} - F_{0} (σ_{1} - σ_{2})}{2 σ_{2}}) e^{(σ_{1} + σ_{2}) t} + (\frac{F_{0} (σ_{1} + σ_{2}) - {\dot{F}}_{0}}{2 σ_{2}}) e^{(σ_{1} - σ_{2}) t} & \begin{matrix} for \frac{β^{2}}{4 α^{2}} - \frac{γ}{α} > 0 \\ overdamped) \end{matrix}, \\ e^{σ_{1} t} [F_{0} + ({\dot{F}}_{0} - F_{0} σ_{1}) t] & \begin{matrix} for \frac{β^{2}}{4 α^{2}} - \frac{γ}{α} = 0 \\ (critically damped) \end{matrix} \end{matrix}

(63)

where we used the initial conditions

F (t_{0} = 0) = F_{0}

and

\dot{F} (t_{0} = 0) = {\dot{F}}_{0}

, and the abbreviations

σ_{1} : = - \frac{β}{2 α} and σ_{2} : = \sqrt{|\frac{β^{2}}{4 α^{2}} - \frac{γ}{α}|} .

(64)

The characteristic shape of each branch of (63) is visualized in Figure 5 (right) for the parametrization

α = 1

,

β = 1

,

γ = 10

in the underdamped case,

α = 1

,

β = 10

,

γ = 10

in the overdamped case, and

α = 1

,

β = 10

,

γ = 25

in the critically damped case. For the rest of this paper we will adhere to this parametrization. Moreover, we assume the initial conditions to be given with

F_{0} = 1

,

{\dot{F}}_{0} = 0

. Note that by choosing

σ_{1} < 0

we obtain asymptotically stable behavior. As we never constrained the underlying ODE to be stable, our algorithm can also be used to approximate instable systems. Since tests with

β = - 1

showed results comparable to the underdamped case (with

β = 1

), we omit an explicit discussion of this case for brevity. Lastly, we point out that although (62) describes a very simple system, we also successfully applied the proposed algorithm to the much more complex walking-pattern generation system of our humanoid robot LOLA, cf. [13]. However, the properties and characteristics of the proposed method can be better investigated and explained by means of a less complex test system.

3.3. Convergence for Consistent and Inconsistent Boundary Conditions

As already mentioned, we focus on over-determined BVPs. Thus, we consider two cases for evaluating our algorithm. For the first analysis, we use the initial conditions

y_{0} = F_{0} = 1

,

{\dot{y}}_{0} = {\dot{F}}_{0} = 0

and correspondingly

{\ddot{y}}_{0} = {\ddot{F}}_{0} = - γ / α

, see (62), together with the analytical solution given in (63) to compute the BCs

y_{n} = F_{n}

,

{\dot{y}}_{n} = {\dot{F}}_{n}

, and

{\ddot{y}}_{n} = {\ddot{F}}_{n}

at

t_{n} = 5

. In this way, the BCs are guaranteed to be consistent because they belong to the same analytic solution

F (t)

. In the second case, we keep the initial BCs unchanged, but force

y_{n} = {\dot{y}}_{n} = {\ddot{y}}_{n} = 0

for

t_{n} = 5

which deviates from the previous solution

F_{n}

,

{\dot{F}}_{n}

, and

{\ddot{F}}_{n}

, especially in the underdamped case, as can be clearly seen in Figure 5 right. Thus, the second analysis handles inconsistent BCs.

In the following, we only focus on the underdamped and overdamped case, since the critically damped case can be seen as a special form of these with

σ_{2} \to 0

. By evaluating both cases, we aim at covering oscillating and non-oscillating dynamics. Moreover, we run tests for different counts of collocation sites

ν : = | {t_{k}} |

, where we use a uniform segmentation of the spline, i.e.,

h_{i} = h_{i + 1} \forall i

. For cubic and quintic spline collocation without virtual control points

ν = n - 2

holds. In contrast,

ν = n - 4

holds for cubic spline collocation with virtual control points. Note that an inhomogeneous partitioning

h_{i} \neq h_{i + 1}

is evaluated per default in the unit tests provided with BROCCOLI. The approximation of the BVP by spline collocation with consistent BCs and

ν = 1, \dots, 100

is depicted in Figure 6. Note that for cubic spline collocation without virtual control points, the BCs

{\dot{y}}_{0}

and

{\dot{y}}_{n}

are violated (see Section 2.7).

For the case of consistent BCs, we observe that the approximation

y (t)

indeed converges for increasing

ν

to the analytical solution

F (t)

. From a qualitative point of view, the convergence order using a quintic spline (Figure 6 right column) is clearly higher than the corresponding cubic counterparts (Figure 6 left and center column). In order to compare convergence using a quantitative measure, we use the root mean square (RMS) of the approximation error

e (t)

and of the residual

r (t)

, defined as

\begin{matrix} RMS (e) & : = \sqrt{\frac{1}{t_{n} - t_{0}} \int_{t_{0}}^{t_{n}} {[e (t)]}^{2} d t} & with e (t) : = F (t) - y (t), \end{matrix}

(65)

\begin{matrix} RMS (r) & : = \sqrt{\frac{1}{t_{n} - t_{0}} \int_{t_{0}}^{t_{n}} {[r (t)]}^{2} d t} & with r (t) : = α \ddot{y} (t) + β \dot{y} (t) + γ y (t) . \end{matrix}

(66)

For numerical evaluation of (65) and (66), we discretize the embedded integral using a time step size of

Δ t = 0.01

for which we obtain the results presented in Figure 7 left. Since we stop the test series at

ν_{\max} = 100

, we find the optimum for all variants to be at

ν_{opt} = ν_{\max}

. However, due to convergence, we expect the theoretical optimum to be at

ν_{opt} \to \infty

. We observe that also from an quantitative point of view, quintic spline collocation clearly outperforms the cubic variants for the same

ν

. Furthermore, forcing first-order BCs through virtual control points of the cubic spline seems to have a negative influence, especially for increasing

ν

. The influence of virtual control points on the residual is visualized in Figure 8 where peaks in

r (t)

occur at these spots. Note that without virtual control points, the first-order boundary conditions at

t_{0} = 0

and

t_{n} = 5

are missed. Thus, in contrast to the collocation sites, the residual does not drop to zero at the boundaries. By comparing the left and right plot of Figure 8, we observe that

r (t)

behaves similarly for consistent and inconsistent BCs.

For consistent BCs, we can state that, at least for the investigated test system, the proposed algorithm leads to an approximation which converges to the real solution where the “speed” of convergence depends on the chosen variant of 2. For inconsistent BCs, however, our analysis draws a different picture: while we can find an optimal count of collocation sites

ν_{opt}

for each variant and test case, the approximation diverges for

ν \to \infty

, see Figure 7 right and Figure 9. Note that this was expected since we are attempting to find an approximation of a solution which does not actually exist. For small

ν

, we still find a “reasonable”

y (t)

where the spline is still able to smooth out the wrong BCs. However, as we refine the segmentation of the spline, it gets harder to compensate the error, which in turn leads to undesired oscillations of

y (t)

. Note that although we call this behavior divergence, our collocation algorithm still satisfies all constraints that we defined: using a given partitioning, it fulfills the BCs and satisfies the underlying ODE at the given collocation sites. Thus, the divergence is not a fault of the proposed algorithm, but rather is due to the non-existence of the solution

F (t)

. Moreover, the sensitivity to undesired oscillations depends heavily on the underlying BVP: for our target application of real-time motion planning, we did not observe any oscillations up to

ν = 10^{4}

. The critical value for

ν

is expected to be even higher. However, we could not determine it due to memory limitations of our test system.

In order to avoid undesired oscillations, an optimal

ν

has to be chosen, which seems to be difficult at the first sight, since in practice we do not know the analytical solution, and thus cannot evaluate

e (t)

. However, one can use the residual

r (t)

as measure for the approximation error and use this to formulate a governing optimization for finding

ν_{opt}

. Moreover, one can also use a non-uniform segmentation to give specific sections more weight. Such adaptive techniques for automatic mesh refinement have also been developed for other collocation methods, see, for example, [36]. However, for our application it is sufficient to choose

h_{i}

once (fixed), thus we withhold this idea for future investigations.

3.4. Runtime Analysis

In the following, we present measurements, which have been made by using the implementation given in BROCCOLI with the version primo (v1.0.0 - commit 89280c69). We used an AMD Ryzen 7 1700X 8x (16x) @3.40GHz with 32GB DDR4 RAM @2133MHz as hardware backend and Ubuntu 18.04.2 LTS 64bit (Linux kernel 4.15.0-51) together with Clang (version 6.0.0-1) on optimization level 3 as software basis. Although our algorithm is sequential, we run different test cases on four physical cores of the CPU in parallel. For all tests simultaneous multi-threading (SMT) was disabled in the BIOS. For runtime evaluation, we take 1000 samples for every code section in Algorithm 2 and choose the minimum execution time as reference to minimize the risk of wrong measurements due to high system load and context switching effects.

In Figure 10 (left), we present runtime measurements for all three variants of our algorithm. We restrict our analysis to the case of an underdamped parametrization and consistent BCs since we expect comparable results for the other test cases. Since our algorithm has a deterministic runtime which only depends on the chosen variant (cubic/quintic, with/without virtual control points) and

ν

, there is no reason why it should be faster or slower with another parametrization. We observe that quintic spline collocation is more expensive than the cubic counterparts for the same

ν

, which complies with theory since the (block-)tridiagonal system for quintic splines is twice the size of the tridiagonal system for cubic splines. However, for increasing

ν

, the gap becomes smaller since solving the collocation equations, where

A_{coll}

is of the same dimension for cubic and quintic splines, becomes the bottleneck, see Figure 11. Note that there is only a small difference in runtime between cubic spline collocation with and without virtual control points. This also complies with theory, since the only difference is two additional spline knots, which increases the dimension of

A_{coll}

by two. Lastly, the small ripples in Figure 10 (left) are due to vectorization and SIMD optimizations handled by EIGEN and the compiler, which give slightly better performance if the dimension of arrays in memory are a multiple of two.

Comparing runtimes for the same

ν

might not be a meaningful basis for choosing a method. Instead, one is typically interested in getting the best approximation in the shortest time. For this purpose, the RMS of the residual

r (t)

is plotted over runtime in Figure 10 (right). As can be seen, quintic spline collocation significantly outperforms both other variants. Moreover, bad convergence of cubic spline collocation with virtual control points is also visible in this comparison. In addition to measuring total runtime, we also performed a detailed analysis on the relative cost of each code section of Algorithm 2 during quintic spline collocation. Figure 11 demonstrates that in the vicinity of

ν \approx 160

, evaluating the block-tridiagonal systems to enforce BCs and continuity, and solving

A_{coll} λ = B_{coll}

for actual collocation share approximately the same portion of total runtime. With increasing

ν

, solving for

λ

becomes more relevant since the corresponding system is dense while the block-tridiagonal LSE can be solved efficiently using the recursive scheme discussed in Section 2.3.

Lastly, we want to point out that the condition of

A_{C o l l}

gets worse for increasing count of collocation sites

ν

, see Figure 10 left. Thus, there might be an upper limit of

ν

for the proposed algorithm.

4. Discussion

As shown in the previous section, the presented algorithm performs well if the BCs are consistent and fully known. Even in the case of inconsistent BCs, we still obtain a reasonable approximation as long as we carefully pick the collocation sites. However, if we exceed the optimum, undesired oscillations may occur, which is an indicator for putting too much emphasis on satisfying the underlying ODE while simultaneously trying to compensate the “broken” BCs. In order to automatically determine an optimal partitioning of the spline, a higher level optimization may be applied, which will be the focus of further investigations.

Obviously, if the investigated BVP is well-posed, i.e., if it is not over-determined, other techniques should be preferred over our approach. However, for applications where the enforcement of certain BCs is more important than approximating the underlying dynamics, the proposed method seems to be a valid approach. Moreover, we want to emphasize that no variable recursion or iteration is involved. This makes execution time predictable, which is especially relevant for real-time applications such as in our use case.

Comparing different variants of our algorithm showed that collocation using a quintic spline is in general superior to using the somewhat simpler cubic splines. Note that although its derivation is more involved, the final implementation is of approximately the same complexity, since virtual control points have to be introduced in the cubic case. At this point, we also want to emphasize that, based on the results of our study, we do not recommend to use cubic spline collocation with virtual control points. Although the full set of BCs is satisfied, the additional knots seem to significantly downgrade convergence. However, other choices of

t_{virt,1}

and

t_{virt,2}

may lead to different results.

Within this contribution, we only considered second-order BVPs. An extension to ODEs of higher order seems to be straightforward, since only the collocation equations, see (58) and (59) have to be extended by the corresponding gradients while the overall dimension of

A_{coll}

and

B_{coll}

stays the same. Moreover, our approach may be applied to nonlinear systems as well, by using the common approach of linearization and embedding the scheme into a Newton iteration.

Lastly, we want to highlight that our focus lies on runtime efficiency. However, for embedded systems especially, memory consumption may also be a limiting factor. Although we expect our algorithm to have similar requirements when compared to other techniques, we have not looked into this issue so far.

Supplementary Materials

An archived version of the free and open-source C++ header-only library BROCCOLI in the version primo (v1.0.0) which includes the proposed algorithm is available online at https://www.mdpi.com/2218-6581/9/2/48/s1. For the most recent version of BROCCOLI visit https://gitlab.lrz.de/AM/broccoli.

Author Contributions

Conceptualization, P.S.; Data curation, P.S.; Formal analysis, P.S.; Funding acquisition, D.J.R.; Investigation, P.S.; Methodology, P.S.; Project administration, D.J.R.; Resources, P.S.; Software, P.S.; Supervision, D.J.R.; Validation, P.S.; Visualization, P.S.; Writing—original draft, P.S.; Writing—review & editing, P.S. and D.J.R. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the German Research Foundation (DFG), project number 407378162. Moreover, this work was supported by the DFG and the Technical University of Munich (TUM) in the framework of the Open Access Publishing Program.

Acknowledgments

We would like to express special thanks to Nora-Sophie Staufenberg and Felix Sygulla for the constructive discussions during development, implementation and analysis of the presented method.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

BVP	Boundary Value Problem
IVP	Initial Value Problem
PDE	Partial Differential Equation
ODE	Ordinary Differential Equation
DAE	Differential Algebraic Equation
BC	Boundary Condition
CoM	Center of Mass
ABD	Almost Block Diagonal
LSE	Linear System of Equations
PP	Piecewise Polynomial
RMS	Root Mean Square
SMT	Simultaneous Multi-Threading
DFG	German Research Foundation (Deutsche Forschungsgemeinschaft)
TUM	Technical University of Munich

Appendix A. Considerations on Numerical Stability

Appendix A.1. Cubic Spline Interpolation

The Thomas algorithm is guaranteed to be numerically stable if

_{C} A

is diagonally dominant, i.e., if

|_{C} D_{i}| = |2 (h_{i - 1} + h_{i})| > |_{C} L_{i}| + |_{C} U_{i}| = |h_{i - 1}| + |h_{i}| f o r i = 1, \dots, n - 1

(A1)

holds [30], whereas for

i = 1

and

i = n - 1

we use

_{C} L_{1} = 0

and

_{C} U_{n - 1} = 0

, respectively. As is easily verified with (A1), this holds true for any choice of

h_{i - 1} > 0

and

h_{i} > 0

(i.e., for distinct and ascending interpolation sites), thus the presented method is always stable.

Appendix A.2. Quintic Spline Interpolation

The presented scheme for solving block-tridiagonal systems is a special form of block-Gaussian elimination without pivoting and is guaranteed to be numerically stable if

_{Q} A

is block-diagonally dominant, i.e., if

Γ_{i} : = ∥_{Q} D_{i}^{- 1}∥ (∥_{Q} L_{i}∥ + ∥_{Q} U_{i}∥) \leq 1 f o r i = 1, \dots, n - 1

(A2)

holds for an arbitrary matrix norm

∥\cdot∥

[37]. Note that for

i = 1

and

i = n - 1

we set

_{Q} L_{1} = 0

and

_{Q} U_{n - 1} = 0

, respectively. In order to verify (A2) for

_{Q} D_{i}

,

_{Q} L_{i}

, and

_{Q} U_{i}

as specified in (35) and (36), we assume a constant ratio

ω = g_{i} / g_{i - 1} = g_{i + 1} / g_{i}

to obtain

_{Q} D_{i} = g_{i}^{3} \underset{= :_{Q} {\hat{D}}_{i} (ω)}{\underset{︸}{[\begin{matrix} 64 (\frac{1}{ω^{3}} + 1) & 12 κ (\frac{1}{ω^{2}} - 1) \\ 12 κ (\frac{1}{ω^{2}} - 1) & 3 κ^{2} (\frac{1}{ω} + 1) \end{matrix}]}},_{Q} L_{i} = g_{i}^{3} \underset{= :_{Q} {\hat{L}}_{i} (ω)}{\underset{︸}{[\begin{matrix} 56 \frac{1}{ω^{3}} & - 8 κ \frac{1}{ω^{3}} \\ 8 κ \frac{1}{ω^{2}} & - κ^{2} \frac{1}{ω^{2}} \end{matrix}]}},_{Q} U_{i} = g_{i}^{3} \underset{= :_{Q} {\hat{U}}_{i} (ω)}{\underset{︸}{[\begin{matrix} 56 & 8 κ ω \\ - 8 κ & - κ^{2} ω \end{matrix}]}}

(A3)

and further

Γ_{i} (ω) = ∥_{Q} D_{i}^{- 1} (ω)∥ (∥_{Q} L_{i} (ω)∥ + ∥_{Q} U_{i} (ω)∥) = ∥_{Q} {\hat{D}}_{i}^{- 1} (ω)∥ (∥_{Q} {\hat{L}}_{i} (ω)∥ + ∥_{Q} {\hat{U}}_{i} (ω)∥)

(A4)

which shows that

Γ_{i}

does not depend on

g_{i}

for a constant ratio

ω

. It can be easily verified through numerical evaluation of (A4) that a homogeneous partitioning of the spline, i.e.,

ω = 1

, results in the lowest value for

Γ_{i}

. Using the spectral matrix norm and the special choice of

κ

given in (39) leads to

Γ_{i} (ω = 1) \approx 1.24 ≰ 1 f o r i = 2, \dots, n - 2 .

(A5)

Unfortunately, condition (A2) is violated, even for the ideal case of a homogeneously partitioned spline. However, (A2) represents a sufficient but not necessary condition for numerical stability, thus we can still use

Γ_{i}

as a measure for the pivotal growth [37] which should be minimized. In doing so, we can conclude that for best numerical stability one should use a “reasonable” ratio

ω

which is close to 1.

Note that the special choice of

κ

in (39) has been suggested in [18] to minimize

Γ_{i}

and thus optimize numerical stability. This intention becomes clear when observing that

{∥_{Q} {\hat{D}}_{i}^{- 1} (ω = 1)∥}_{2} = \sqrt{λ_{m} a x (_{Q} {\hat{D}}_{i}^{- T} (ω = 1)_{Q} {\hat{D}}_{i}^{- 1} (ω = 1))} = \{\begin{matrix} \frac{1}{6 κ^{2}} > \frac{1}{128} & for κ^{2} < \frac{64}{3}, \\ \frac{1}{128} & for κ^{2} \geq \frac{64}{3} \end{matrix}

where

λ_{m} a x (\dots)

denotes the maximum eigenvalue of a given matrix. Since both

{∥_{Q} {\hat{L}}_{i} (ω)∥}_{2}

and

{∥_{Q} {\hat{U}}_{i} (ω)∥}_{2}

, increase with growing

|κ|

, the optimum (39) is chosen. This finding can be easily verified through numerical investigation. Note that numerical stability only depends on

|κ|

, thus one could also choose the negative form

κ = - \sqrt{64 / 3}

.

Appendix B. Spline Gradients

In the following, explicit expressions for the spline gradients used in (51), (53), and (54) are given. Note that the gradients differ depending on the type of the underlying spline (cubic/quintic).

Appendix B.1. Cubic Spline Gradients

From (7), (8), and (9) we obtain for each variable

_{C} ρ \in {_{C} λ,_{C} η,_{C} r}

and each segment

_{C} s_{i}

with

i = 0, \dots, n - 1

\begin{matrix} (\frac{\partial_{C} s_{i} (ξ_{i})}{\partial_{C} ρ}) & = (\frac{\partial_{C} a_{i}}{\partial_{C} ρ}) ξ_{i}^{3} + (\frac{\partial_{C} b_{i}}{\partial_{C} ρ}) ξ_{i}^{2} + (\frac{\partial_{C} C_{i}}{\partial_{C} ρ}) ξ_{i} + (\frac{\partial_{C} d_{i}}{\partial_{C} ρ}), \end{matrix}

(A6)

\begin{matrix} (\frac{\partial_{C} {\dot{s}}_{i} (ξ_{i})}{\partial_{C} ρ}) & = \frac{3}{h_{i}} (\frac{\partial_{C} a_{i}}{\partial_{C} ρ}) ξ_{i}^{2} + \frac{2}{h_{i}} (\frac{\partial_{C} b_{i}}{\partial_{C} ρ}) ξ_{i} + \frac{1}{h_{i}} (\frac{\partial_{C} C_{i}}{\partial_{C} ρ}), \end{matrix}

(A7)

\begin{matrix} (\frac{\partial_{C} {\ddot{s}}_{i} (ξ_{i})}{\partial_{C} ρ}) & = \frac{6}{h_{i}^{2}} (\frac{\partial_{C} a_{i}}{\partial_{C} ρ}) ξ_{i} + \frac{2}{h_{i}^{2}} (\frac{\partial_{C} b_{i}}{\partial_{C} ρ}) . \end{matrix}

(A8)

Moreover, by using the indices

u = 1, \dots, n - 1

,

v = 1, \dots, n - 1

, and

w = 1, \dots, 4

to specify the elements of

\begin{matrix} _{C} λ & = {[_{C} λ_{1}, \dots,_{C} λ_{u}, \dots,_{C} λ_{n - 1}]}^{T} & = {[y_{1}, \dots, y_{u}, \dots, y_{n - 1}]}^{T} \\ _{C} η & = {[_{C} η_{1}, \dots,_{C} η_{v}, \dots,_{C} η_{n - 1}]}^{T} & = {[{\ddot{y}}_{1}, \dots, {\ddot{y}}_{v}, \dots, {\ddot{y}}_{n - 1}]}^{T} \\ _{C} r & = {[_{C} r_{1}, \dots,_{C} r_{w}, \dots,_{C} r_{4}]}^{T} & = {[y_{0}, {\ddot{y}}_{0}, y_{n}, {\ddot{y}}_{n}]}^{T} \end{matrix}

(A9)

we obtain

(\frac{\partial_{C} a_{i}}{\partial_{C} λ}) = 0, (\frac{\partial_{C} b_{i}}{\partial_{C} λ}) = 0, (\frac{\partial_{C} C_{i}}{\partial_{C} λ_{u}}) = \{\begin{matrix} - 1 & for u = i, \\ 1 & for u = i + 1, \\ 0 & else \end{matrix} (\frac{\partial_{C} d_{i}}{\partial_{C} λ_{u}}) = \{\begin{matrix} 1 & for u = i, \\ 0 & else \end{matrix},

(A10)

\begin{matrix} (\frac{\partial_{C} a_{i}}{\partial_{C} η_{v}}) & = \{\begin{matrix} - \frac{h_{i}^{2}}{6} & for v = i, \\ \frac{h_{i}^{2}}{6} & for v = i + 1, \\ 0 & else \end{matrix}, & (\frac{\partial_{C} C_{i}}{\partial_{C} η_{v}}) & = \{\begin{matrix} - \frac{h_{i}^{2}}{3} & for v = i, \\ - \frac{h_{i}^{2}}{6} & for v = i + 1, \\ 0 & else \end{matrix} \end{matrix}

(A11)

\begin{matrix} (\frac{\partial_{C} b_{i}}{\partial_{C} η_{v}}) & = \{\begin{matrix} \frac{h_{i}^{2}}{2} & for v = i, \\ 0 & else \end{matrix}, & (\frac{\partial_{C} d_{i}}{\partial_{C} η}) & = 0, \end{matrix}

(A12)

\begin{matrix} (\frac{\partial_{C} a_{i}}{\partial_{C} r_{w}}) & = \{\begin{matrix} - \frac{h_{i}^{2}}{6} & for i = 0 \land w = 2, \\ \frac{h_{i}^{2}}{6} & for i = n - 1 \land w = 4, \\ 0 & else \end{matrix}, & (\frac{\partial_{C} C_{i}}{\partial_{C} r_{w}}) & = \{\begin{matrix} - 1 & for i = 0 \land w = 1, \\ - \frac{h_{i}^{2}}{3} & for i = 0 \land w = 2, \\ 1 & for i = n - 1 \land w = 3, \\ - \frac{h_{i}^{2}}{6} & for i = n - 1 \land w = 4, \\ 0 & else \end{matrix}, \end{matrix}

(A13)

\begin{matrix} (\frac{\partial_{C} b_{i}}{\partial_{C} r_{w}}) & = \{\begin{matrix} \frac{h_{i}^{2}}{2} & for i = 0 \land w = 2, \\ 0 & else \end{matrix}, & (\frac{\partial_{C} d_{i}}{\partial_{C} r_{w}}) & = \{\begin{matrix} 1 & for i = 0 \land w = 1, \\ 0 & else \end{matrix}, \end{matrix}

(A14)

which can easily be derived from (17) and the definition of

_{C} λ

,

_{C} η

, and

_{C} r

. Note that for

ξ_{i} = 0

the computation of

(\frac{\partial_{C} a_{i}}{\partial_{C} λ}), (\frac{\partial_{C} a_{i}}{\partial_{C} η}), a n d (\frac{\partial_{C} a_{i}}{\partial_{C} r})

(A15)

can be skipped entirely since up to the second time derivative of the gradients of

s_{i} (ξ_{i} = 0)

these terms are multiplied with zero and have no effect anyway. In order to solve (48), we have to further compute the corresponding right-hand sides which are given by

(\frac{\partial_{C} B_{i}}{\partial_{C} λ_{u}}) = \{\begin{matrix} \frac{6}{h_{i - 1}} & for u = i - 1, \\ - \frac{6}{h_{i}} - \frac{6}{h_{i - 1}} & for u = i, \\ \frac{6}{h_{i}} & for u = i + 1, \\ 0 & else \end{matrix}, (\frac{\partial_{C} B_{i}}{\partial_{C} r_{w}}) = \{\begin{matrix} \frac{6}{h_{i - 1}} & for i = 1 \land w = 1, \\ - h_{i - 1} & for i = 1 \land w = 2, \\ \frac{6}{h_{i}} & for i = n - 1 \land w = 3, \\ - h_{i} & for i = n - 1 \land w = 4, \\ 0 & else \end{matrix}

(A16)

for

i = 1, \dots, n - 1

, where we used (23), (24), and the definition of

_{C} λ

and

_{C} r

.

Appendix B.2. Quintic Spline Gradients

From (10), (11), and (12) we obtain for each variable

_{Q} ρ \in {_{Q} λ,_{Q} η,_{Q} r}

and each segment

_{Q} s_{i}

with

i = 0, \dots, n - 1

\begin{matrix} (\frac{\partial_{Q} s_{i} (ξ_{i})}{\partial_{Q} ρ}) & = (\frac{\partial_{Q} a_{i}}{\partial_{Q} ρ}) ξ_{i}^{5} + (\frac{\partial_{Q} b_{i}}{\partial_{Q} ρ}) ξ_{i}^{4} + (\frac{\partial_{Q} C_{i}}{\partial_{Q} ρ}) ξ_{i}^{3} + (\frac{\partial_{Q} d_{i}}{\partial_{Q} ρ}) ξ_{i}^{2} + (\frac{\partial_{Q} e_{i}}{\partial_{Q} ρ}) ξ_{i} + (\frac{\partial_{Q} f_{i}}{\partial_{Q} ρ}), \end{matrix}

(A17)

\begin{matrix} (\frac{\partial_{Q} {\dot{s}}_{i} (ξ_{i})}{\partial_{Q} ρ}) & = \frac{5}{h_{i}} (\frac{\partial_{Q} a_{i}}{\partial_{Q} ρ}) ξ_{i}^{4} + \frac{4}{h_{i}} (\frac{\partial_{Q} b_{i}}{\partial_{Q} ρ}) ξ_{i}^{3} + \frac{3}{h_{i}} (\frac{\partial_{Q} C_{i}}{\partial_{Q} ρ}) ξ_{i}^{2} + \frac{2}{h_{i}} (\frac{\partial_{Q} d_{i}}{\partial_{Q} ρ}) ξ_{i} + \frac{1}{h_{i}} (\frac{\partial_{Q} e_{i}}{\partial_{Q} ρ}), \end{matrix}

(A18)

\begin{matrix} (\frac{\partial_{Q} {\ddot{s}}_{i} (ξ_{i})}{\partial_{Q} ρ}) & = \frac{20}{h_{i}^{2}} (\frac{\partial_{Q} a_{i}}{\partial_{Q} ρ}) ξ_{i}^{3} + \frac{12}{h_{i}^{2}} (\frac{\partial_{Q} b_{i}}{\partial_{Q} ρ}) ξ_{i}^{2} + \frac{6}{h_{i}^{2}} (\frac{\partial_{Q} C_{i}}{\partial_{Q} ρ}) ξ_{i} + \frac{2}{h_{i}^{2}} (\frac{\partial_{Q} d_{i}}{\partial_{Q} ρ}) . \end{matrix}

(A19)

Moreover, by using the indices

u = 1, \dots, n - 1

,

v = 1, \dots, 2 (n - 1)

, and

w = 1, \dots, 4

to specify the elements of

\begin{matrix} _{Q} λ & = {[_{Q} λ_{1}, \dots,_{Q} λ_{u}, \dots,_{Q} λ_{n - 1}]}^{T} & = {[y_{1}, \dots, y_{u}, \dots, y_{n - 1}]}^{T} \\ _{Q} η & = {[_{Q} η_{1}, \dots,_{Q} η_{v}, \dots,_{Q} η_{2 (n - 1)}]}^{T} & = {[{\dot{y}}_{1}, {\ddot{y}}_{1}, \dots, {\dot{y}}_{(v + 1) / 2}, {\ddot{y}}_{v / 2}, \dots, {\dot{y}}_{n - 1}, {\ddot{y}}_{n - 1}]}^{T} \\ _{Q} r & = {[_{Q} r_{1}, \dots,_{Q} r_{w}, \dots,_{Q} r_{6}]}^{T} & = {[y_{0}, {\dot{y}}_{0}, {\ddot{y}}_{0}, y_{n}, {\dot{y}}_{n}, {\ddot{y}}_{n}]}^{T} \end{matrix}

(A20)

we obtain

\begin{matrix} (\frac{\partial_{Q} a_{i}}{\partial_{Q} λ_{u}}) & = \{\begin{matrix} - 6 & for u = i, \\ 6 & for u = i + 1, \\ 0 & else \end{matrix}, & (\frac{\partial_{Q} d_{i}}{\partial_{Q} λ}) & = 0, \end{matrix}

(A21)

\begin{matrix} (\frac{\partial_{Q} b_{i}}{\partial_{Q} λ_{u}}) & = \{\begin{matrix} 15 & for u = i, \\ - 15 & for u = i + 1, \\ 0 & else \end{matrix}, & (\frac{\partial_{Q} e_{i}}{\partial_{Q} λ}) & = 0, \end{matrix}

(A22)

\begin{matrix} (\frac{\partial_{Q} C_{i}}{\partial_{Q} λ_{u}}) & = \{\begin{matrix} - 10 & for u = i, \\ 10 & for u = i + 1, \\ 0 & else \end{matrix}, & (\frac{\partial_{Q} f_{i}}{\partial_{Q} λ_{u}}) & = \{\begin{matrix} 1 & for u = i, \\ 0 & else \end{matrix}, \end{matrix}

(A23)

\begin{matrix} (\frac{\partial_{Q} a_{i}}{\partial_{Q} η_{v}}) & = \{\begin{matrix} - 3 h_{i} & for v = 2 i - 1, \\ - \frac{h_{i}^{2}}{2} & for v = 2 i, \\ - 3 h_{i} & for v = 2 i + 1, \\ \frac{h_{i}^{2}}{2} & for v = 2 i + 2, \\ 0 & else \end{matrix}, & (\frac{\partial_{Q} d_{i}}{\partial_{Q} η_{v}}) & = \{\begin{matrix} \frac{h_{i}^{2}}{2} & for v = 2 i, \\ 0 & else \end{matrix}, \end{matrix}

(A24)

\begin{matrix} (\frac{\partial_{Q} b_{i}}{\partial_{Q} η_{v}}) & = \{\begin{matrix} 8 h_{i} & for v = 2 i - 1, \\ \frac{3 h_{i}^{2}}{2} & for v = 2 i, \\ 7 h_{i} & for v = 2 i + 1, \\ - h_{i}^{2} & for v = 2 i + 2, \\ 0 & else \end{matrix}, & (\frac{\partial_{Q} e_{i}}{\partial_{Q} η_{v}}) & = \{\begin{matrix} h_{i} & for v = 2 i - 1, \\ 0 & else \end{matrix}, \end{matrix}

(A25)

\begin{matrix} (\frac{\partial_{Q} C_{i}}{\partial_{Q} η_{v}}) & = \{\begin{matrix} - 6 h_{i} & for v = 2 i - 1, \\ - \frac{3 h_{i}^{2}}{2} & for v = 2 i, \\ - 4 h_{i} & for v = 2 i + 1, \\ \frac{h_{i}^{2}}{2} & for v = 2 i + 2, \\ 0 & else \end{matrix}, & (\frac{\partial_{Q} f_{i}}{\partial_{Q} η}) & = 0, \end{matrix}

(A26)

\begin{matrix} (\frac{\partial_{Q} a_{i}}{\partial_{Q} r_{w}}) & = \{\begin{matrix} - 6 & for i = 0 \land w = 1, \\ - 3 h_{i} & for i = 0 \land w = 2, \\ - \frac{h_{i}^{2}}{2} & for i = 0 \land w = 3, \\ 6 & for i = n - 1 \land w = 4, \\ - 3 h_{i} & for i = n - 1 \land w = 5, \\ \frac{h_{i}^{2}}{2} & for i = n - 1 \land w = 6, \\ 0 & else \end{matrix}, & (\frac{\partial_{Q} d_{i}}{\partial_{Q} r_{w}}) & = \{\begin{matrix} \frac{h_{i}^{2}}{2} & for i = 0 \land w = 3, \\ 0 & else \end{matrix}, \end{matrix}

(A27)

\begin{matrix} (\frac{\partial_{Q} b_{i}}{\partial_{Q} r_{w}}) & = \{\begin{matrix} 15 & for i = 0 \land w = 1, \\ 8 h_{i} & for i = 0 \land w = 2, \\ \frac{3 h_{i}^{2}}{2} & for i = 0 \land w = 3, \\ - 15 & for i = n - 1 \land w = 4, \\ 7 h_{i} & for i = n - 1 \land w = 5, \\ - h_{i}^{2} & for i = n - 1 \land w = 6, \\ 0 & else \end{matrix}, & (\frac{\partial_{Q} e_{i}}{\partial_{Q} r_{w}}) & = \{\begin{matrix} h_{i} & for i = 0 \land w = 2, \\ 0 & else \end{matrix}, \end{matrix}

(A28)

\begin{matrix} (\frac{\partial_{Q} C_{i}}{\partial_{Q} r_{w}}) & = \{\begin{matrix} - 10 & for i = 0 \land w = 1, \\ - 6 h_{i} & for i = 0 \land w = 2, \\ - \frac{3 h_{i}^{2}}{2} & for i = 0 \land w = 3, \\ 10 & for i = n - 1 \land w = 4, \\ - 4 h_{i} & for i = n - 1 \land w = 5, \\ \frac{h_{i}^{2}}{2} & for i = n - 1 \land w = 6, \\ 0 & else \end{matrix}, & (\frac{\partial_{Q} f_{i}}{\partial_{Q} r_{w}}) & = \{\begin{matrix} 1 & for i = 0 \land w = 1, \\ 0 & else \end{matrix}, \end{matrix}

(A29)

which can easily be derived from (30) and the definition of

_{Q} λ

,

_{Q} η

, and

_{Q} r

. Note that for

ξ_{i} = 0

the computation of

(\frac{\partial_{Q} a_{i}}{\partial_{Q} λ}), (\frac{\partial_{Q} a_{i}}{\partial_{Q} η}), (\frac{\partial_{Q} a_{i}}{\partial_{Q} r}), (\frac{\partial_{Q} b_{i}}{\partial_{Q} λ}), (\frac{\partial_{Q} b_{i}}{\partial_{Q} η}), (\frac{\partial_{Q} b_{i}}{\partial_{Q} r}), (\frac{\partial_{Q} C_{i}}{\partial_{Q} λ}), (\frac{\partial_{Q} C_{i}}{\partial_{Q} η}), (\frac{\partial_{Q} C_{i}}{\partial_{Q} r}),

(A30)

can be skipped entirely since up to the second time derivative of the gradients of

s_{i} (ξ_{i} = 0)

these terms are multiplied with zero and have no effect anyway. In order to solve (48), we further have to compute the corresponding right-hand sides, which are given by

\begin{matrix} (\frac{\partial_{Q} B_{i}}{\partial_{Q} λ_{u}}) & = \{\begin{matrix} [\begin{matrix} 120 g_{i - 1}^{4} \\ 20 κ g_{i - 1}^{3} g_{i} \end{matrix}] & for u = i - 1, & [\begin{matrix} - 120 g_{i}^{4} \\ 20 κ g_{i}^{4} \end{matrix}] & for u = i + 1, \\ [\begin{matrix} 120 (g_{i}^{4} - g_{i - 1}^{4}) \\ - 20 κ g_{i} (g_{i}^{3} + g_{i - 1}^{3}) \end{matrix}] & for u = i, & 0 & else \end{matrix}, \end{matrix}

(A31)

\begin{matrix} (\frac{\partial_{Q} B_{i}}{\partial_{Q} r_{w}}) & = \{\begin{matrix} [\begin{matrix} 120 g_{i - 1}^{4} \\ 20 κ g_{i - 1}^{3} g_{i} \end{matrix}] & for i = 1 \land w = 1, & [\begin{matrix} - 120 g_{i}^{4} \\ 20 κ g_{i}^{4} \end{matrix}] & for i = n - 1 \land w = 4, \\ [\begin{matrix} 56 g_{i - 1}^{3} \\ 8 κ g_{i - 1}^{2} g_{i} \end{matrix}] & for i = 1 \land w = 2, & [\begin{matrix} 56 g_{i}^{3} \\ - 8 κ g_{i}^{3} \end{matrix}] & for i = n - 1 \land w = 5, \\ [\begin{matrix} 8 g_{i - 1}^{2} \\ κ g_{i - 1} g_{i} \end{matrix}] & for i = 1 \land w = 3, & [\begin{matrix} - 8 g_{i}^{2} \\ κ g_{i}^{2} \end{matrix}] & for i = n - 1 \land w = 6, \\ 0 & else \end{matrix} \end{matrix}

(A32)

for

i = 1, \dots, n - 1

, where we used (37), (38), and the definition of

_{Q} λ

and

_{Q} r

.

Appendix C. Software Design in BROCCOLI

The basic structure of the source code related to the classes CubicSplineCollocator and QuinticSplineCollocator is illustrated in Figure A1 (left). Both classes are derived from the abstract base class SplineCollocatorBase, which declares the interface for data in-/output and the main processing operators. In order to trigger Algorithm 2 for a given input dataset, a simple call of process() is sufficient. For a more fine-grained control, process() is split up into publicly available subroutines (names starting with “substep_”). Note that providing public access to the subroutines not only allows detailed runtime measurements by the user, but also enables an efficient parallel solution for decoupled, multi-dimensional BVPs, see Figure A1 right. For this, we exploit that large parts of Algorithm 2 only depend on the spline segmentation, i.e., the collocation sites, (green box). In contrast, the remaining subroutines require explicit information about the investigated BVP (blue box). If all dimensions of the BVP, e.g., the x- and y-component of LOLA’s CoM motion, share the same spline segmentation, subroutines covered by the green box in Figure A1 have to be called only once. Subsequently, the used instance of CubicSplineCollocator or QuinticSplineCollocator (holding intermediate results) is copied such that the remaining subroutines, covered by the blue box in Figure A1, can be run in parallel. Note that this includes solving

A_{coll}^{- 1} B_{coll}

(substep_solveCollocationLSE()), which is the bottleneck for large-scale systems. We plan to use this feature in our target application to efficiently generate the CoM motion of LOLA (the x- and y-component represent a decoupled, two-dimensional BVP).

Figure A1. Design of the classes CubicSplineCollocator and QuinticSplineCollocator in BROCCOLI. Left: class inheritance and segmentation of process() into subroutines. Right: proposed strategy for efficient parallelization in the case of a decoupled, multi-dimensional BVP. Note that the first and last subroutine (grayed out) are optional and perform a validity check of the given input parameters and convert the final result into a corresponding broccoli::curve::Trajectory data structure for convenient evaluation of the generated polynomial spline, respectively.

References

Lee, J.F.; Lee, R.; Cangellaris, A. Time-Domain Finite-Element Methods. IEEE Trans. Antennas Propag. 1997, 45, 430–442. [Google Scholar]
Ahlberg, J.H.; Nilson, E.N.; Walsh, J.L. The Theory of Splines and Their Applications; Mathematics in Science and Engineering; Academic Press: New York, NY, USA, 1967. [Google Scholar]
De Boor, C.; Swartz, B. Collocation at Gaussian Points. SIAM J. Numer. Anal. 1973, 10, 582–606. [Google Scholar] [CrossRef]
Ahlberg, J.; Ito, T. A Collocation Method for Two-Point Boundary Value Problems. Math. Comput. 1975, 29, 761–776. [Google Scholar] [CrossRef]
Christara, C.C.; Ng, K.S. Optimal Quadratic and Cubic Spline Collocation on Nonuniform Partitions. Computing 2005, 76, 227–257. [Google Scholar] [CrossRef]
Bialecki, B.; Fairweather, G. Orthogonal spline collocation methods for partial differential equations. J. Comput. Appl. Math. 2001, 128, 55–82. [Google Scholar] [CrossRef] [Green Version]
Houstis, E.N.; Christara, C.C.; Rice, J.R. Quadratic-Spline Collocation Methods for Two-Point Boundary Value Problems. Int. J. Numer. Methods Eng. 1988, 26, 935–952. [Google Scholar] [CrossRef] [Green Version]
Irodotou-Ellina, M.; Houstis, E.N. An O(h⁶) Quintic Spline Collocation Method for Fourth Order Two-Point Boundary Value Problems. BIT Numer. Math. 1988, 28, 288–301. [Google Scholar] [CrossRef]
Ascher, U. Solving Boundary-Value Problems with a Spline-Collocation Code. J. Comput. Phys. 1980, 34, 401–413. [Google Scholar] [CrossRef]
Zhang, H.; Han, X.; Yang, X. Quintic B-spline collocation method for fourth order partial integro-differential equations with a weakly singular kernel. Appl. Math. Comput. 2013, 219, 6565–6575. [Google Scholar] [CrossRef]
Akram, G.; Tariq, H. Quintic spline collocation method for fractional boundary value problems. J. Assoc. Arab Univ. Basic Appl. Sci. 2017, 23, 57–65. [Google Scholar] [CrossRef] [Green Version]
Ascher, U.; Spiteri, R. Collocation Software for Boundary Value Differential-Algebraic Equations. SIAM J. Sci. Comput. 1995, 15. [Google Scholar] [CrossRef]
Seiwald, P.; Sygulla, F.; Staufenberg, N.S.; Rixen, D. Quintic Spline Collocation for Real-Time Biped Walking-Pattern Generation with variable Torso Height. In Proceedings of the IEEE-RAS 19th International Conference on Humanoid Robots (Humanoids), Toronto, ON, Canada, 15–17 October 2019; pp. 56–63. [Google Scholar]
Seiwald, P. Video: Smooth Real-Time Walking-Pattern Generation for Humanoid Robot LOLA. Available online: https://youtu.be/piQm_oTYXIc (accessed on 22 June 2020).
Wright, S.J. Stable Parallel Algorithms for Two-Point Boundary Value Problems. SIAM J. Sci. Statistical Comput. 1992, 13, 742–764. [Google Scholar] [CrossRef]
Magoulés, F.; Roux, F.X.; Houzeaux, G. Parallel Scientific Computing; Wiley & Sons: New York, NY, USA, 2015. [Google Scholar]
Usmani, R.A. Smooth spline approximations for the solution of a boundary value problem with engineering applications. J. Comput. Appl. Math. 1980, 6, 93–98. [Google Scholar] [CrossRef] [Green Version]
Mund, E.H.; Hallet, P.; Hennart, J.P. An algorithm for the interpolation of functions using quintic splines. J. Comput. Appl. Math. 1975, 1, 279–288. [Google Scholar] [CrossRef] [Green Version]
Buschmann, T.; Lohmeier, S.; Bachmayer, M.; Ulbrich, H.; Pfeiffer, F. A Collocation Method for Real-Time Walking Pattern Generation. In Proceedings of the IEEE-RAS 7th International Conference on Humanoid Robots, Pittsburgh, PA, USA, 29 November–1 December 2007; pp. 1–6. [Google Scholar]
Ascher, U.; Pruess, S.; Russell, R.D. On Spline Basis Selection for Solving Differential Equations. SIAM J. Numer. Anal. 1983, 20, 121–142. [Google Scholar] [CrossRef]
De Boor, C. A Practical Guide to Splines; Springer: New York, NY, USA, 2001; ISBN 978-0387953663. [Google Scholar]
De Boor, C.; Weiss, R. SOLVEBLOK: A Package for Solving Almost Block Diagonal Linear Systems. ACM Trans. Math. Software 1980, 6, 80–87. [Google Scholar] [CrossRef]
Majaess, F.; Keast, P.; Fairweather, G. The Packages for solving almost block diagonal linear systems arising in spline collocation at Gaussian points with monomial basis functions. In Scientific Software Systems; Springer: Dordt, The Netherlands, 1988; pp. 47–58. [Google Scholar]
De Boor, C. On calculating with B-splines. J. Approximation Theor. 1972, 6, 50–62. [Google Scholar] [CrossRef] [Green Version]
Böhm, W. Efficient Evaluation of Splines. Computing 1984, 33, 171–177. [Google Scholar] [CrossRef]
Lee, E.T.Y. A Simplified B-Spline Computation Routine. Computing 1982, 29, 365–371. [Google Scholar] [CrossRef]
Lee, E.T.Y. Comments on Some B-Spline Algorithms. Computing 1986, 36, 229–238. [Google Scholar] [CrossRef]
Horner, W.G.; Gilbert, D. A New Method of Solving Numerical Equations of All Orders, by Continuous Approximation; Philosophical Transactions of the Royal Society of London: London, UK, 1819; pp. 308–335. [Google Scholar]
Higham, N.J. Accuracy and Stability of Numerical Algorithms, 2nd ed.; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 2002. [Google Scholar]
Quarteroni, A.; Sacco, R.; Saler, F. Numerical Mathematics, 2nd ed.; Springer: Berlin, Germany, 2007. [Google Scholar]
Isaacson, E.; Keller, H.B. Analysis of Numerical Methods; Dover Publications, Inc.: New York, NY, USA, 1966. [Google Scholar]
Meurant, G. A Review on the Inverse of Symmetric Tridiagonal and Block Tridiagonal Matrices. SIAM J. Matrix Anal. Appl. 1992, 13, 707–728. [Google Scholar] [CrossRef] [Green Version]
Buschmann, T. Simulation and Control of Biped Walking Robots. Ph.D. Thesis, Technical University of Munich, Munich, Germany, November 2010. Available online: https://mediatum.ub.tum.de/997204 (accessed on 22 June 2020).
Seiwald, P.; Sygulla, F. Broccoli: Beautiful Robot C++ Code Library. Available online: https://gitlab.lrz.de/AM/broccoli (accessed on 22 June 2020).
Guennebaud, G.; Jacob, B. Eigen. Available online: http://eigen.tuxfamily.org (accessed on 22 June 2020).
Christara, C.C.; Ng, K.S. Adaptive Techniques for Spline Collocation. Computing 2006, 76, 259–277. [Google Scholar] [CrossRef]
Varah, J.M. On the Solution of Block-Tridiagonal Systems Arising from Certain Finite-Difference Equations. Math. Comput. 1972, 26, 859–868. [Google Scholar] [CrossRef]

Figure 1. Humanoid robot LOLA developed at the Lehrstuhl für Angewandte Mechanik, Technical University of Munich (TUM). The proposed algorithm is used within the walking pattern generation framework of LOLA, see [13] for details. The robot is

1.8

m

tall and weights about 60

k g

. Left: photo and kinematic configuration of the system with 24 actuated degrees of freedom. Right: simplified model of multi-body dynamics with torso mass

m_{t}

, foot mass

m_{f}

and torso mass moment of inertia

Θ_{t}

which (together with the ground reaction forces/torques) contribute to the ordinary differential equation (ODE) describing the center of mass (CoM) dynamics (blue).

Figure 1. Humanoid robot LOLA developed at the Lehrstuhl für Angewandte Mechanik, Technical University of Munich (TUM). The proposed algorithm is used within the walking pattern generation framework of LOLA, see [13] for details. The robot is

1.8

m

tall and weights about 60

k g

. Left: photo and kinematic configuration of the system with 24 actuated degrees of freedom. Right: simplified model of multi-body dynamics with torso mass

m_{t}

, foot mass

m_{f}

and torso mass moment of inertia

Θ_{t}

which (together with the ground reaction forces/torques) contribute to the ordinary differential equation (ODE) describing the center of mass (CoM) dynamics (blue).

Figure 2. Visual interpretation of the over-determined boundary value problem (BVP): the start and end pose (left/right) represent the boundary conditions, while the intermediate motion (transparent) tries to approximate the inherent system dynamics of the simplified model.

Figure 3. Segmentation and parametrization of the investigated spline

y (t)

. The spline consists of n interconnected segments, which share the interior knots with their neighbors. Each segment is described through the local interpolation parameter

ξ_{i} \in [0, 1]

(blue).

Figure 3. Segmentation and parametrization of the investigated spline

y (t)

. The spline consists of n interconnected segments, which share the interior knots with their neighbors. Each segment is described through the local interpolation parameter

ξ_{i} \in [0, 1]

(blue).

Figure 4. The computed spline

y (t)

(black) approximating the real solution

F (t)

(green). The approximation satisfies the underlying ODE at the specified collocation sites

{t_{k}}

(blue), and fulfills the boundary conditions (BC) at

t_{0}

and

t_{n}

(orange), but do not necessarily coincide at the collocation points.

Figure 4. The computed spline

y (t)

(black) approximating the real solution

F (t)

(green). The approximation satisfies the underlying ODE at the specified collocation sites

{t_{k}}

(blue), and fulfills the boundary conditions (BC) at

t_{0}

and

t_{n}

(orange), but do not necessarily coincide at the collocation points.

Figure 5. Left: mass-spring-damper system used for validation. Right: analytical solution for the underdamped case (

α = 1

,

β = 1

,

γ = 10

), the overdamped case (

α = 1

,

β = 10

,

γ = 10

) and the critically damped case (

α = 1

,

β = 10

,

γ = 25

). The solution is plotted for the initial conditions

F_{0} = 1

,

{\dot{F}}_{0} = 0

.

Figure 5. Left: mass-spring-damper system used for validation. Right: analytical solution for the underdamped case (

α = 1

,

β = 1

,

γ = 10

), the overdamped case (

α = 1

,

β = 10

,

γ = 10

) and the critically damped case (

α = 1

,

β = 10

,

γ = 25

). The solution is plotted for the initial conditions

F_{0} = 1

,

{\dot{F}}_{0} = 0

.

Figure 6. Consistent BCs: convergence of the approximation

y (t)

(blue and green) towards the analytic solution

F (t)

(black, dashed) for

ν = 1, \dots, 9, 30, 50, 70, 100

. The top row belongs to the underdamped case while the bottom row represents the overdamped case. From left to right: approximation with cubic spline without virtual control points (left), cubic spline with virtual control points (center) and quintic spline (right). The corresponding best approximation

ν_{\max}

is drawn in bold blue.

Figure 6. Consistent BCs: convergence of the approximation

y (t)

(blue and green) towards the analytic solution

F (t)

(black, dashed) for

ν = 1, \dots, 9, 30, 50, 70, 100

. The top row belongs to the underdamped case while the bottom row represents the overdamped case. From left to right: approximation with cubic spline without virtual control points (left), cubic spline with virtual control points (center) and quintic spline (right). The corresponding best approximation

ν_{\max}

is drawn in bold blue.

Figure 7. Root mean square (RMS) of error

e (t)

and residual

r (t)

as defined in (65) and (66). The left subscript differs between cubic C and quintic Q spline collocation. Moreover, for cubic spline collocation, we identify the variants without virtual control points (i.e., free first-order boundaries

{\dot{y}}_{0}

,

{\dot{y}}_{n}

) and with virtual control points by the right subscript

f r e e

and

v i r t

, respectively. The left plot belongs to the case of consistent BCs while the right one was obtained using inconsistent BCs. Note that for inconsistent BCs an analytic solution does not exist, thus the error

e (t)

is not defined and we consider only the residual

r (t)

.

Figure 7. Root mean square (RMS) of error

e (t)

and residual

r (t)

as defined in (65) and (66). The left subscript differs between cubic C and quintic Q spline collocation. Moreover, for cubic spline collocation, we identify the variants without virtual control points (i.e., free first-order boundaries

{\dot{y}}_{0}

,

{\dot{y}}_{n}

) and with virtual control points by the right subscript

f r e e

and

v i r t

, respectively. The left plot belongs to the case of consistent BCs while the right one was obtained using inconsistent BCs. Note that for inconsistent BCs an analytic solution does not exist, thus the error

e (t)

is not defined and we consider only the residual

r (t)

.

Figure 8. Residual

r (t)

as defined in (66) for consistent (left) and inconsistent (right) BCs in the underdamped case. For best presentation, the count of collocation sites is chosen as

ν = 4

such that the collocation sites (dots) are given by

{t_{k}} = {1, 2, 3, 4}

. For cubic spline collocation, the left subscript C and for the quintic counterpart Q is used. Moreover, the right subscripts

f r e e

and

v i r t

are used to differentiate between the variants without and with virtual control points, respectively. The virtual control points are highlighted with circles.

Figure 8. Residual

r (t)

as defined in (66) for consistent (left) and inconsistent (right) BCs in the underdamped case. For best presentation, the count of collocation sites is chosen as

ν = 4

such that the collocation sites (dots) are given by

{t_{k}} = {1, 2, 3, 4}

. For cubic spline collocation, the left subscript C and for the quintic counterpart Q is used. Moreover, the right subscripts

f r e e

and

v i r t

are used to differentiate between the variants without and with virtual control points, respectively. The virtual control points are highlighted with circles.

Figure 9. Inconsistent BCs: approximation

y (t)

(blue, green, and orange) and reference system

F (t)

(black, dashed) for

ν = 1, \dots, 4, ν_{opt}, 13, 15, 17, 20

. The top row belongs to the underdamped case while the bottom row represents the overdamped case. From left to right: approximation with cubic spline (no virtual control points, left), cubic spline (with virtual control points, center) and quintic spline (right). The corresponding best approximation

ν_{opt}

is drawn in bold blue. Diverging approximations for

ν > ν_{opt}

are colored orange.

Figure 9. Inconsistent BCs: approximation

y (t)

(blue, green, and orange) and reference system

F (t)

(black, dashed) for

ν = 1, \dots, 4, ν_{opt}, 13, 15, 17, 20

. The top row belongs to the underdamped case while the bottom row represents the overdamped case. From left to right: approximation with cubic spline (no virtual control points, left), cubic spline (with virtual control points, center) and quintic spline (right). The corresponding best approximation

ν_{opt}

is drawn in bold blue. Diverging approximations for

ν > ν_{opt}

are colored orange.

Figure 10. Left: (minimum) runtime T and condition C of

A_{coll}

for running 2 over count of collocation sites

ν

. Right: root mean square (RMS) of residual

r (t)

as defined in (66) over (minimum) runtime T. The left subscript C and Q belong to the cubic or quintic spline version of the algorithm, respectively. Moreover, the right subscript indicates if cubic spline collocation was performed without (

free

) or with (

virt

) virtual control points. All measurements were performed using the underdamped parametrization and consistent BCs.

Figure 10. Left: (minimum) runtime T and condition C of

A_{coll}

for running 2 over count of collocation sites

ν

. Right: root mean square (RMS) of residual

r (t)

as defined in (66) over (minimum) runtime T. The left subscript C and Q belong to the cubic or quintic spline version of the algorithm, respectively. Moreover, the right subscript indicates if cubic spline collocation was performed without (

free

) or with (

virt

) virtual control points. All measurements were performed using the underdamped parametrization and consistent BCs.

Figure 11. Runtime (percentile) of relevant steps of 2 relative to total runtime for quintic spline collocation. Code sections with negligible execution time are not plotted and also not accounted for total runtime. The measurements were obtained by using an extended time horizon

t_{n} = 50

and the parametrization

α = 1

,

β = 0.1

,

γ = 10

(underdamped) to allow a better representation of high counts of collocation sites of up to

ν = 300

.

Figure 11. Runtime (percentile) of relevant steps of 2 relative to total runtime for quintic spline collocation. Code sections with negligible execution time are not plotted and also not accounted for total runtime. The measurements were obtained by using an extended time horizon

t_{n} = 50

and the parametrization

α = 1

,

β = 0.1

,

γ = 10

(underdamped) to allow a better representation of high counts of collocation sites of up to

ν = 300

.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Seiwald, P.; Rixen, D.J. Fast Approximation of Over-Determined Second-Order Linear Boundary Value Problems by Cubic and Quintic Spline Collocation. Robotics 2020, 9, 48. https://doi.org/10.3390/robotics9020048

AMA Style

Seiwald P, Rixen DJ. Fast Approximation of Over-Determined Second-Order Linear Boundary Value Problems by Cubic and Quintic Spline Collocation. Robotics. 2020; 9(2):48. https://doi.org/10.3390/robotics9020048

Chicago/Turabian Style

Seiwald, Philipp, and Daniel J. Rixen. 2020. "Fast Approximation of Over-Determined Second-Order Linear Boundary Value Problems by Cubic and Quintic Spline Collocation" Robotics 9, no. 2: 48. https://doi.org/10.3390/robotics9020048

APA Style

Seiwald, P., & Rixen, D. J. (2020). Fast Approximation of Over-Determined Second-Order Linear Boundary Value Problems by Cubic and Quintic Spline Collocation. Robotics, 9(2), 48. https://doi.org/10.3390/robotics9020048

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Fast Approximation of Over-Determined Second-Order Linear Boundary Value Problems by Cubic and Quintic Spline Collocation

Abstract

1. Introduction

1.1. Literature Review

1.2. Motivation

1.3. Additional Remarks

2. Materials and Methods

2.1. Problem Statement

2.2. Spline Parametrization

2.3. Spline Interpolation: Preliminaries

2.4. Cubic Spline Interpolation: Derivation

2.5. Quintic Spline Interpolation: Derivation

2.6. Algorithm for Cubic/Quintic Spline Interpolation

2.7. Spline Collocation: Derivation

2.8. Satisfying First Order Boundary Conditions for Cubic Splines

2.9. Algorithm for Cubic/Quintic Spline Collocation

3. Results

3.1. Implementation

3.2. Test System

3.3. Convergence for Consistent and Inconsistent Boundary Conditions

3.4. Runtime Analysis

4. Discussion

Supplementary Materials

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Considerations on Numerical Stability

Appendix A.1. Cubic Spline Interpolation

Appendix A.2. Quintic Spline Interpolation

Appendix B. Spline Gradients

Appendix B.1. Cubic Spline Gradients

Appendix B.2. Quintic Spline Gradients

Appendix C. Software Design in BROCCOLI

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI