The Generalized Schur Algorithm and Some Applications

Laudadio, Teresa; Mastronardi, Nicola; Van Dooren, Paul

doi:10.3390/axioms7040081

Open AccessArticle

The Generalized Schur Algorithm and Some Applications

by

Teresa Laudadio

^1,*

,

Nicola Mastronardi

¹

and

Paul Van Dooren

²

¹

Istituto per le Applicazioni del Calcolo “M. Picone”, CNR, Sede di Bari, via G. Amendola 122/D, 70126 Bari, Italy

²

Catholic University of Louvain, Department of Mathematical Engineering, Avenue Georges Lemaitre 4, B-1348 Louvain-la-Neuve, Belgium

^*

Author to whom correspondence should be addressed.

Axioms 2018, 7(4), 81; https://doi.org/10.3390/axioms7040081

Submission received: 2 October 2018 / Revised: 5 November 2018 / Accepted: 7 November 2018 / Published: 9 November 2018

(This article belongs to the Special Issue Advanced Numerical Methods in Applied Sciences)

Download

Browse Figures

Versions Notes

Abstract

:

The generalized Schur algorithm is a powerful tool allowing to compute classical decompositions of matrices, such as the

Q R

and

L U

factorizations. When applied to matrices with particular structures, the generalized Schur algorithm computes these factorizations with a complexity of one order of magnitude less than that of classical algorithms based on Householder or elementary transformations. In this manuscript, we describe the main features of the generalized Schur algorithm. We show that it helps to prove some theoretical properties of the R factor of the

Q R

factorization of some structured matrices, such as symmetric positive definite Toeplitz and Sylvester matrices, that can hardly be proven using classical linear algebra tools. Moreover, we propose a fast implementation of the generalized Schur algorithm for computing the rank of Sylvester matrices, arising in a number of applications. Finally, we propose a generalized Schur based algorithm for computing the null-space of polynomial matrices.

Keywords:

generalized Schur algorithm; null-space; displacement rank; structured matrices

1. Introduction

The generalized Schur algorithm (GSA) allows computing well-known matrix decompositions, such as

Q R

and

L U

factorizations [1]. In particular, if the involved matrix is structured, i.e., Toeplitz, block-Toeplitz or Sylvester, the GSA computes the R factor of the

Q R

factorization with complexity of one order of magnitude less than that of the classical

Q R

algorithm [2], since it relies only on the knowledge of the so-called generators [2] associated to the given matrix, rather than on the knowledge of the matrix itself. The stability properties of the GSA are described in [3,4,5], where it is proven that the algorithm is weakly stable provided the involved hyperbolic rotations are performed in a stable way.

In this manuscript, we first show that, besides the efficiency properties, the GSA provides new theoretical insights on the bounds of the entries of the R factor of the

Q R

factorization of some structured matrices. In particular, if the involved matrix is a symmetric positive definite (SPD) Toeplitz or a Sylvester matrix, we prove that all or some of the diagonal entries of R monotonically decrease in absolute value.

We then propose a faster implementation of the algorithm described in [6] for computing the rank of a Sylvester matrix

S \in R^{(m + n) \times (m + n)},

whose entries are the coefficients of two polynomials of degree m and n, respectively. This new algorithm is based on the GSA for computing the R factor of the

Q R

factorization of

S .

The proposed modification of the GSA-based method has a computational cost of

O (r l)

floating point operations, where

l = min {n, m}

and r is the computed numerical rank.

It is well known that the upper triangular factor R factor of the

Q R

factorization of a matrix

A \in R^{n \times n}

is equal to the upper triangular Cholesky factor

R_{c} \in R^{n \times n}

of

A^{T} A,

up to a diagonal sign matrix D, i.e.,

R = D R_{c},

D = diag (\pm 1, \dots, \pm 1) \in R^{n \times n} .

In this manuscript, we assume, without loss of generality, that the diagonal entries of R and

R_{c}

are positive and since the matrices are then equal, we denote both matrices by R.

Finally, we propose a GSA-based approach for computing a null-space basis of a polynomial matrix, which is an important problem in several systems and control applications [7,8]. For instance, the computation of the null-space of a polynomial matrix arises when solving the column reduction problem of a polynomial matrix [9,10].

The manuscript is structured as follows. The main features of the GSA are provided in Section 2. In Section 3, a GSA implementation for computing the Cholesky factor R of a SPD Toeplitz matrix is described, which allows proving that the diagonal entries of R monotonically decrease. In Section 4, a GSA-based algorithm for computing the rank of a Sylvester matrix S is introduced, based on the computation of the Cholesky factor R of

S^{T} S .

In addition, in this case, it is proven that the first diagonal entries of R monotonically decrease. The GSA-based method to compute the null-space of polynomial matrices is proposed in Section 5. The numerical examples are reported in Section 6 followed by the conclusions in Section 7.

2. The Generalized Schur Algorithm

Many of the classical factorizations of a symmetric matrix, e.g.,

Q R

and

L D L^{T},

can be obtained by the GSA. If the matrix is Toeplitz-like, the GSA computes these factorizations in a fast way. For the sake of completeness, the basic concepts of the GSA for computing the R factor of the

Q R

factorization of structured matrices, such as Toeplitz and block-Toeplitz matrices, are introduced in this Section. A comprehensive treatment of the topic can be found in [1,2].

Let

A \in R^{n \times n}

be a symmetric positive definite (SPD) matrix. The semidefinite case is considered in Section 4 and Section 5. The displacement of A with respect to a matrix Z of order

n,

is defined as

\nabla_{Z} A = A - Z A Z^{T},

(1)

while the displacement rank k of A with respect to Z is defined as the rank of

\nabla_{Z} A .

If

rank (\nabla_{Z} A) = k,

Equation (1) can be written as the sum of k rank-one matrices,

\nabla_{Z} A = \sum_{i = 1}^{k_{1}} g_{i}^{(p)} {g_{i}^{(p)}}^{T} - \sum_{i = 1}^{k_{2}} g_{i}^{(n)} {g_{i}^{(n)}}^{T},

where

(k_{1}, n - k_{1} - k_{2}, k_{2})

is the inertia of

\nabla_{Z} A,

k = k_{1} + k_{2},

and the vectors

g_{i}^{(p)} \in R^{n}, i = 1, \dots, k_{1},

g_{i}^{(n)} \in R^{n}, i = 1, \dots, k_{2},

are called the positive and the negative generators of A with respect to

Z,

respectively, conversely, if there is no ambiguity, simply the positive and negative generators of A. The matrix

G \equiv {[g_{1}^{(p)}, g_{2}^{(p)}, \dots, g_{k_{1}}^{(p)}, g_{1}^{(n)}, g_{2}^{(n)}, \dots, g_{k_{2}}^{(n)}]}^{T}

is called the generator matrix.

The matrix Z is a nilpotent matrix. In particular, for Toeplitz and block-Toeplitz matrices, the matrix Z can be chosen as the shift and the block shift matrix

Z_{1} = [\begin{matrix} 0 & 0 & \dots & 0 \\ 1 & ⋱ & ⋱ & ⋮ \\ ⋮ & ⋱ & ⋱ & ⋮ \\ 0 & \dots & 1 & 0 \end{matrix}], Z_{2} = [\begin{matrix} 0 & 0 & \dots & 0 \\ Z_{1} & ⋱ & ⋱ & ⋮ \\ ⋮ & ⋱ & ⋱ & ⋮ \\ 0 & \dots & Z_{1} & 0 \end{matrix}],

respectively.

The implementation of the GSA relies only on the knowledge of the generators of A rather than on the knowledge of the matrix itself [1].

Let

J = diag (\underset{k_{1}}{\underset{⏟}{1, 1, \dots, 1}}, \underset{k_{2}}{\underset{⏟}{- 1, - 1, \dots, - 1}}) .

Since

\begin{matrix} A - Z A Z^{T} & = & G^{T} J G, \\ Z A Z^{T} - Z^{2} A {Z^{2}}^{T} & = & Z G^{T} J G Z^{T}, \\ ⋮ & ⋮ \\ Z^{n - 2} A {Z^{n - 2}}^{T} - Z^{n - 1} A {Z^{n - 1}}^{T} & = & Z^{n - 2} G^{T} J G {Z^{n - 2}}^{T}, \\ Z^{n - 1} A {Z^{n - 1}}^{T} & = & Z^{n - 1} G^{T} J G {Z^{n - 1}}^{T}, \end{matrix}

(2)

then, adding all members of the left and right-hand sides of Equation (2) yields

A = \sum_{j = 0}^{n - 1} Z^{j} G^{T} J G {Z^{j}}^{T},

(3)

which expresses the matrix A in terms of its generators.

Exploiting Equation (2), we show how the GSA computes R by describing its first iteration. Observe that the matrix products involved in the right-hand side of Equation (2) have their first row equal to zero, with the exception of the first product,

G^{T} J G

.

A key role in GSA is played by J-orthogonal matrices [11,12], i.e., matrices

Φ

satisfying

Φ^{T} J Φ = J .

Any such matrix

Φ

can be constructed in different ways [11,12,13,14]. For instance, it can be considered as the product of Givens and hyperbolic rotations. In particular, a Givens rotation acting on rows i and j of the generator matrix is chosen if

J (i, i) J (j, j) > 0,

i, j \in {1, \dots, n}, i \neq j .

Otherwise, a hyperbolic rotation is considered. Indeed, suitable choices of

Φ

allow efficient implementations of GSA, as shown in Section 4.

Let

G_{0} \equiv G

and

Φ_{1}

be a J-orthogonal matrix such that

{\tilde{G}}_{1} = Φ_{1} G_{0}, {\tilde{G}}_{1} e_{1} = {[α_{1}, 0, \dots, 0]}^{T}, with α_{1} > 0,

(4)

and

e_{i}, i = 1, \dots, n,

be the ith column of the identity matrix. Furthermore, let

{\tilde{g}}_{1}^{T}

and

{\tilde{Γ}}_{1}

be the first and last

k - 1

rows of

{\tilde{G}}_{1},

respectively, i.e.,

{\tilde{G}}_{1} = [\begin{matrix} {\tilde{g}}_{1}^{T} \\ {\tilde{Γ}}_{1} \end{matrix}] .

From Equation (4), it turns out that the first column of

{\tilde{Γ}}_{1}

is zero. Let

\tilde{J}

be the matrix obtained by deleting the first row and column from J. Then, Equation (2) can be written as follows,

\begin{matrix} A & = & \sum_{j = 0}^{n - 1} Z^{j} G_{0}^{T} J G_{0} {Z^{j}}^{T} \\ = & \sum_{j = 0}^{n - 1} Z^{j} G_{0}^{T} Φ_{1}^{T} J Φ_{1} G_{0} {Z^{j}}^{T} \\ = & \sum_{j = 0}^{n - 1} Z^{j} {[\begin{matrix} {\tilde{g}}_{1}^{T} \\ {\tilde{Γ}}_{1} \end{matrix}]}^{T} J [\begin{matrix} {\tilde{g}}_{1}^{T} \\ {\tilde{Γ}}_{1} \end{matrix}] {Z^{j}}^{T} \\ = & {\tilde{g}}_{1} {\tilde{g}}_{1}^{T} + \sum_{j = 1}^{n - 1} Z^{j} {\tilde{g}}_{1} {\tilde{g}}_{1}^{T} {Z^{j}}^{T} + \sum_{j = 0}^{n - 2} Z^{j} {\tilde{Γ}}_{1}^{T} \tilde{J} {\tilde{Γ}}_{1} {Z^{j}}^{T} + \underset{= 0}{\underset{⏟}{Z^{n - 1} {\tilde{Γ}}_{1}^{T} \tilde{J} {\tilde{Γ}}_{1} {Z^{n - 1}}^{T}}} \\ = & {\tilde{g}}_{1} {\tilde{g}}_{1}^{T} + \sum_{j = 0}^{n - 2} Z^{j} {[\begin{matrix} {\tilde{g}}_{1}^{T} Z^{T} \\ {\tilde{Γ}}_{1} \end{matrix}]}^{T} J [\begin{matrix} {\tilde{g}}_{1}^{T} Z^{T} \\ {\tilde{Γ}}_{1} \end{matrix}] {Z^{j}}^{T} \\ = & {\tilde{g}}_{1} {\tilde{g}}_{1}^{T} + \sum_{j = 0}^{n - 2} Z^{j} G_{1}^{T} J G_{1} {Z^{j}}^{T}, \\ = & {\tilde{g}}_{1} {\tilde{g}}_{1}^{T} + A_{1}, \end{matrix}

where

G_{1} \equiv {[Z {\tilde{g}}_{1}, {\tilde{Γ}}_{1}^{T}]}^{T},

that is,

G_{1}

is obtained from

{\tilde{G}}_{1}

by multiplying

{\tilde{g}}_{1}

with

Z,

and

A_{1} \equiv \sum_{j = 0}^{n - 2} Z^{j} G_{1}^{T} J G_{1} {Z^{j}}^{T} .

If A is a Toeplitz matrix, this multiplication with Z corresponds to displacing the entries of

{\tilde{g}}_{1}

one position downward, while it corresponds to a block-displacement downward in the first generator if A is a block-Toeplitz matrix.

Thus, the first column of

G_{1}

is zero and, hence,

{\tilde{g}}_{1}^{T}

is the first row of the R factor of the

Q R

factorization of

A .

The above procedure is recursively applied to

A_{1}

to compute the other rows of

R .

The jth iteration of GSA,

j = 1, \dots, n,

involves the products

Φ_{j} G_{j - 1}

and

Z {\tilde{g}}_{1}

. The former multiplication can be computed in

O (k (n - j))

operations [11,12], and the latter is done for free if Z is either a shift or a block–shift matrix. Therefore, if the displacement rank k of A is small compared to

n,

the GSA computes the R factor in

O (k n^{2})

rather than in

O (n^{3})

operations, as required by standard algorithms [15].

For the sake of completeness, the described GSA implementation is reported in the following matlab style function. (The function givens is the matlab function having as input two scalars,

x_{1}

and

x_{2},

and as output an orthogonal

2 \times 2

matrix

Θ

such that

Θ [\begin{matrix} x_{1} \\ x_{2} \end{matrix}] = [\begin{matrix} \sqrt{x_{1}^{2} + x_{2}^{2}} \\ 0 \end{matrix}] .

The function Hrotate computes the coefficients of the

2 \times 2

hyperbolic rotation

Φ

such that, given two scalars

x_{1}

and

x_{2},

| x_{1} | > | x_{2} |,

Φ [\begin{matrix} x_{1} \\ x_{2} \end{matrix}] = [\begin{matrix} \sqrt{x_{1}^{2} - x_{2}^{2}} \\ 0 \end{matrix}] .

The function Happly applies

Φ

to two rows of the generator matrix. Both functions are defined in [12]).

function

[R] =

GSA

(G, n);

fori = 1 : n,

for j = 2 : k₁,

Θ =

givens

(G (1, i), G (j, i));

G ([1, j], i : n) = Θ * G ([1, j], i : n);

end % for

for j = k₁ + 2 : k₁ + k₂,

Θ = govens(G(k₁ + 1, i), G(j, i));

G([k₁ + 1, j], i : n) = Θ ∗ G[k₁ + 1, j], i : n);

end % for

[c₁, s₁] = Hrotate(G(1, i), G(k₁ + 1, i));

G([1, k₁ + 1], i : n) = Happly(c₁, s₁, G([1, k₁ + 1], i : n), n − i + 1);

R(i, i : n) = G(1, i : n);

G(1, i + 1 : n) = G(1, i : n − 1); G(1, i) = 0;

end % for

The GSA has been proven to be weakly stable [3,4], provided the hyperbolic transformations involved in the construction of the matrices

Φ_{j}

are performed in a stable way [3,11,12].

3. GSA for SPD Toeplitz Matrices

In this section, we describe the GSA for computing the R factor of the Cholesky factorization of a SPD Toeplitz matrix A, with R upper triangular, i.e.,

A = R^{T} R .

Moreover, we show that the diagonal entries of R decrease monotonically.

Let

A \in R^{n \times n}

and

Z \in R^{n \times n}

be a SPD Toeplitz matrix and a shift matrix, respectively, i.e.,

A = [\begin{matrix} t_{1} & t_{2} & ⋱ & t_{n} \\ t_{2} & ⋱ & ⋱ & ⋱ \\ ⋱ & ⋱ & ⋱ & t_{2} \\ t_{n} & ⋱ & t_{2} & t_{1} \end{matrix}], Z_{n} = [\begin{matrix} 0 & 0 & \dots & 0 \\ 1 & ⋱ & ⋱ & ⋮ \\ ⋮ & ⋱ & ⋱ & ⋮ \\ 0 & \dots & 1 & 0 \end{matrix}],

and let

t = A (:, 1) .

Then,

\nabla_{Z} A = [\begin{matrix} t_{1} & t_{2} & \dots & t_{n} \\ t_{2} & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ \\ t_{n} & 0 & \dots & 0 \end{matrix}],

i.e.,

\nabla_{Z} A

is a symmetric rank-2 matrix. Moreover, the generator matrix G is given by

G = [\begin{matrix} g_{1}^{T} \\ g_{2}^{T} \end{matrix}], with g_{1} = \frac{t}{\sqrt{t_{1}}}, g_{2} = {[0, g_{1} {(2 : n)}^{T}]}^{T} .

In this case, the GSA can be implemented in matlab-like style as follows.

function

[R] =

GSA_chol

(G_{0})

fori = 1 : n,

[c₁, s₁] = Hrotate(G_{i − 1}(1, i), G⁽ⁱ⁾(2, i)); G_{i − 1}(:, i : n) = Happly(c₁, s₁, G_{i − 1}(:, i : n), n − i + 1);

R(i, i : n) = G_{i − 1}(1, i : n);

G_i(1, i + 1 : n) = G_{i − 1}(1, i : n − 1); G_i(2, i + 1 : n) = G_{i − 1}(2, i + 1 : n − 1);

end % for

The following lemma holds.

Lemma 1.

Let A be a SPD Toeplitz matrix and let R be its Cholesky factor, with R upper triangular. Then,

R (i - 1, i - 1) \geq R (i, i), i = 2, \dots, n .

Proof.

At each step i of GSA_chol,

i = 1, \dots, n,

first a hyperbolic rotation is applied to

G_{i - 1}

in order to annihilate the element

G_{i} (2, i) .

Hence, the first row of

G_{i - 1}

becomes the row i of

R .

Finally,

G_{i} (1, :)

is obtained displacing the entries of the first row of

G_{i - 1}

one position right, while

G_{i} (2, :)

is equal to

G_{i - 1} (2, :) .

Taking into account that

G_{i - 1} (2, 1) = 0,

the diagonal entries of R are

\begin{matrix} R (1, 1) & = & G_{0} (1, 1) \\ R (2, 2) & = & \sqrt{G_{1}^{2} (1, 2) - G_{1}^{2} (2, 2)} = \sqrt{R^{2} (1, 1) - G_{1}^{2} (2, 2)} \leq R (1, 1); \\ ⋮ \\ R (i, i) & = & \sqrt{G_{i - 1}^{2} (1, i) - G_{i - 1}^{2} (2, i)} = \sqrt{R^{2} (i - 1, i - 1) - G_{i - 1}^{2} (2, i)} \leq R (i - 1, i - 1); \\ ⋮ \\ R (n, n) & = & \sqrt{G_{n - 1}^{2} (1, n) - G_{n - 1}^{2} (2, n)} = \sqrt{R^{2} (n - 1, n - 1) - G_{n - 1}^{2} (2, n)} \leq R (n - 1, n - 1) . \end{matrix}

□

4. Computing the Rank of Sylvester Matrices

In this section, we focus on the computation of the rank of Sylvester matrices. The numerical rank of a Sylvester matrix is a useful information for determining the degree of the greatest common divisor of the involved polynomials [6,16,17].

A GSA-based algorithm for computing the rank of S has been recently proposed in [6]. It is based on the computation of the Cholesky factor R of

S^{T} S,

with R upper triangular, i.e.,

R^{T} R = S^{T} S .

Here, we propose a more efficient variant of this algorithm that allows proving that the first entries of R monotonically decrease.

Let

w_{i} \in R, i = 0, 1, \dots, n,

and let

y_{i} \in R, i = 0, 1, \dots, m .

Denote by

w (x)

and

y (x)

two univariate polynomials,

\begin{matrix} w (x) = w_{n} x^{n} + w_{n - 1} x^{n - 1} + \dots + w_{1} x + w_{0}, & w_{n} \neq 0, \\ y (x) = y_{m} x^{m} + y_{m - 1} x^{m - 1} + \dots + y_{1} x + y_{0}, & y_{m} \neq 0 . \end{matrix}

(5)

Let

S \in R^{(m + n) \times (m + n)}

be the Sylvester matrix defined as follows,

S = [\begin{matrix} W & Y \end{matrix}], W = [\begin{matrix} w_{n} \\ w_{n - 1} & w_{n} \\ ⋮ & w_{n - 1} & ⋱ \\ w_{1} & ⋮ & ⋱ & w_{n} \\ w_{0} & w_{1} & ⋱ & w_{n - 1} \\ w_{0} & ⋱ & ⋮ \\ ⋱ & w_{1} \\ w_{0} \end{matrix}], Y = [\begin{matrix} y_{m} \\ y_{m - 1} & y_{m} \\ ⋮ & y_{m - 1} & ⋱ \\ y_{1} & ⋮ & ⋱ & y_{m} \\ y_{0} & y_{1} & ⋱ & y_{m - 1} \\ y_{0} & ⋱ & ⋮ \\ ⋱ & y_{1} \\ y_{0} \end{matrix}],

(6)

with

W \in R^{(m + n) \times m}

and

Y \in R^{(m + n) \times n}

band Toeplitz matrices.

We now describe how the GSA-based algorithm proposed in [6] for computing the rank of S can be implemented in a faster way. This variant is based on the computation of the Cholesky factor

R \in R^{(m + n) \times (m + n)}

of

S^{T} S,

with R upper triangular, i.e.,

R^{T} R = S^{T} S .

Defining

Z = [\begin{matrix} Z_{m} \\ Z_{n} \end{matrix}], with Z_{k} = {[\begin{matrix} 0 & 0 & \dots & 0 \\ 1 & ⋱ & ⋱ & ⋮ \\ ⋮ & ⋱ & ⋱ & ⋮ \\ 0 & \dots & 1 & 0 \end{matrix}]}_{k \times k}, k \in N,

(7)

the generator matrix G of

S^{T} S

with respect to Z is then given by [6]

G = {[\begin{matrix} g_{1} & g_{2} & g_{3} & g_{4} \end{matrix}]}^{T}

where

\begin{matrix} g_{1} = x_{1} / {∥ S (:, 1) ∥}_{2}, \\ g_{2} ([2 : n + m]) = x_{2} ([2 : n + m]) / {∥ S (:, m + 1) ∥}_{2}, g_{2} (1) = 0, \\ g_{3} (2 : n + m) = g_{1} (2 : n + m), g_{3} (1) = 0, \\ g_{4} ([1 : m, m + 2 : n + m]) = g_{2} ([1 : m, m + 2 : n + m]), g_{4} (m + 1) = 0, \end{matrix}

(8)

with

x_{1} = S^{T} S e_{1},

x_{2} = S^{T} S e_{m + 1},

e_{j}

the jth vector of the canonical basis of

R^{m + n},

and

J = diag (1, 1, - 1, - 1)

.

The algorithm proposed in [6] is based on the following GSA implementation for computing the R factor of the

Q R

factorization of

S .

function

[R] =

GSA_chol2

(G)

fori = 1 : n,

Θ₁ = givens(G(1, i), G(2, i)); Θ₂ = givens(G(3, i), G(4, i));

G(1 : 2, i : n) = Θ₁G(1 : 2, i : n); G(3 : 4, i : n) = Θ₂G(3 : 4, i : n);

[c₁, s₁] = Hrotate(G(1, i), G(3, i));

G([1,3], i : n) = Happly(c₁, s₁, G([1,3], i : n), n − i + 1);

R(i, i : n) = G_i(1, i : n);

G(1, i + 1 : n) = G(1, i : n − 1)Z^T; G(2, i + 1 : n) = G(2 : 4, i + 1 : n − 1);

end % for

At the ith iteration of the algorithm,

i = 1, \dots, n,

the G_ivens rotations

Θ_{1}

and

Θ_{2}

are computed and applied, respectively, to the first and second generators, and to the third and fourth generators, to annihilate

G (2, i)

and

G (4, i) .

Hence, the hyperbolic rotation

[\begin{matrix} c_{1} & - s_{1} \\ - s_{1} & c_{1} \end{matrix}]

is applied to the first and the third row of G to annihilate

G (3, i) .

Finally, the first row of G becomes the ith row of R and the first row of G is multiplied by

Z^{T} .

Summarizing, at the first step of the ith iteration of GSA, all entries of the ith column but the first one of G, are annihilated. If the number of rows of G is greater than

2,

this can be accomplished in different ways (see [5,14]).

Analyzing the pattern of the generators in Equation (8), we are able to derive a different implementation of

G S A

that costs

O (r l)

, with

l = min {n, m}

. Moreover, this implementation allows proving that the first l diagonal entries of R are monotonically decreasing.

We observe that the matrix

W^{T} W

in Equation (6) is the SPD Toeplitz matrix

W^{T} W = {[\begin{matrix} t_{1} & t_{2} & \dots & t_{n} & t_{n + 1} \\ t_{2} & t_{1} & t_{2} & ⋱ & t_{n} & ⋱ \\ ⋮ & t_{2} & ⋱ & ⋱ & ⋱ & ⋱ & t_{n + 1} \\ t_{n} & ⋮ & ⋱ & ⋱ & ⋱ & ⋮ & t_{n} \\ t_{n + 1} & t_{n} & ⋱ & ⋱ & ⋱ & t_{2} & ⋮ \\ ⋱ & ⋱ & ⋮ & t_{2} & t_{1} & t_{2} \\ t_{n + 1} & t_{n} & \dots & t_{2} & t_{1} \end{matrix}]}_{m \times m},

(9)

with

t_{i} = \sum_{j = i}^{n + 1} w_{j - 1} w_{j - i}, i = 1, 2, \dots, n + 1 .

Since

S^{T} S = [\begin{matrix} W^{T} W & W^{T} Y \\ Y^{T} W & Y^{T} Y \end{matrix}],

if

n ≪ m,

from Equation (9), it turns out that

G ([1, 3], n + 2 : m) = 0 .

Moreover, the rows

G (2, :)

and

G (4, :)

have their first entry equal to zero and differ only in their entry in column

m + 1 .

This particular pattern of G is close to the ones described in [13,14,18], allowing to design an alternative GSA implementation with respect to that considered in [6], and thereby reducing the complexity from

O (r (n + m))

to

O (r l)

, where r is the computed rank of S and

l = min {n, m} .

Since the description of the above GSA implementation is quite cumbersome and similar to the algorithms reported in [13,14,18], we omit it here. The corresponding matlab pseudo–code can be obtained from the authors upon request.

If the matrix S has rank

r < (n + m),

at the

k = (n + m - r + 1)

st iteration, it turns out that

G^{2} (1, k) - G^{2} (3, k) = 0

in exact arithmetic [6]. Therefore, at each iteration of the algorithm we check whether

G^{2} (1, k) - G^{2} (3, k) > t o l,

(10)

where

t o l

is a fixed tolerance. If Equation (10) is not satisfied, we stop the computation considering k as the computed numerical rank of

S .

The R factor of the

Q R

factorization of S is unique if the diagonal entries of R are positive. The considered GSA implementation, yielding the rank of S and based on computing the R factor of the

Q R

factorization of S, allows us to prove that the first l entries of the diagonal of R are ordered in a decreasing order, with

l = min {m, n} .

In fact, the following theorem holds.

Theorem 1.

Let

R^{T} R = S^{T} S

be the Cholesky factorization of

S^{T} S

with S the Sylvester matrix defined in Equation (6) with rank

r \geq l = min {m, n} .

Then,

R (i - 1, i - 1) \geq R (i, i) \geq 0, i = 2, \dots, l .

(11)

Proof.

Each entry i of the diagonal of R is determined by the ith entry of the first row of G at the end of iteration

i,

for

i = 1, \dots, m + n .

Let us define

\hat{G} \equiv G (:, 1 : l)

and consider the following alternative implementation of the GSA for computing the first l columns of the Cholesky factor of

S^{T} S .

for

i = 1 : l,

Θ =

givens

(\hat{G} (1, i), \hat{G} (2, i));

\hat{G} (1 : 2, i : l) = Θ * \hat{G} (1 : 2, i : l);

[c_{1}, s_{1}] =

Hrotate

(\hat{G} (1, i), \hat{G} (4, i));

\hat{G} ([1, 4], :) = Happly (c_{1}, s_{1}, \hat{G} ([1, 4], :), l);

[c_{2}, s_{2}] =

Hrotate

(\hat{G} (1, i), \hat{G} (3, i));

\hat{G} ([1, 3], :) = Happly (c_{2}, s_{2}, \hat{G} ([1, 3], :), l);

R (i, i : l) = \hat{G} (1, i : l);

\hat{G} (1, i + 1 : l) = \hat{G} (1, i : l - 1);

\hat{G} (1, i) = 0;

end % for

We observe that, for

i = 1

,

\hat{G} (1, 1)

is the only entry in the first column of

\hat{G}

different from

0 .

Hence,

R (1, i) = \hat{G} (1, 1 : l)

and the first iteration amounts only to shifting

\hat{G} (1, 1 : l)

one position rightward, i.e.,

\hat{G} (1, 2 : l) = \hat{G} (1, 1 : l - 1), \hat{G} (1, 1) = 0 .

At the beginning of iteration

i = 2,

the second and the fourth row of

\hat{G}

are equal Equation (8). Hence, when applying a G_ivens rotation to the first and the second row in order to annihilate the entry

\hat{G} (2, i)

and when subsequently applying a hyperbolic rotation to the first and fourth row of

\hat{G}

in order to annihilate

\hat{G} (4, i),

it turns out that

\hat{G} (2, i : l)

and

\hat{G} (4, i : l)

are then modified but still equal to each other, while

\hat{G} (1, i : l)

remains unchanged. The equality between

\hat{G} (2, :)

and

\hat{G} (4, :)

is maintained throughout the iterations

1, 2, \dots, l .

Therefore, the second and the fourth row of

\hat{G}

do not play any role in computing

R (1 : l, 1 : l)

and can be neglected. Hence, the GSA for computing

R (1 : l, 1 : l)

reduces only to applying a hyperbolic rotation to the first and the third generators, as described in the following algorithm.

for

i = 1 : l,

[c_{2}, s_{2}] =

Hrotate

(\hat{G} (1, i), \hat{G} (3, i));

\hat{G} ([1, 3], :) = Happly (c_{2}, s_{2}, \hat{G} ([1, 3], :), l);

R (i, i : l) = \hat{G} (1, i : l);

\hat{G} (1, i + 1 : l) = \hat{G} (1, i : l - 1);

\hat{G} (1, i) = 0;

end % for

Since at the beginning of iteration

i, i = 2, \dots, i,

\hat{G} (1, i : l) = R (i - 1, i - 1 : l - 1),

then the involved hyperbolic rotation

Φ = [\begin{matrix} c_{2} & - s_{2} \\ - s_{2} & c_{2} \end{matrix}]

is such that

Φ [\begin{matrix} \hat{G} (1, i) \\ \hat{G} (3, i) \end{matrix}] = Φ [\begin{matrix} R (i - 1, i - 1) \\ \hat{G} (3, i) \end{matrix}] = [\begin{matrix} \hat{G} (1, i) \\ 0 \end{matrix}] = [\begin{matrix} R (i, i) \\ 0 \end{matrix}],

where the updated

\hat{G} (1, i)

is equal to

\sqrt{\hat{G} {(1, i)}^{2} - \hat{G} {(3, i)}^{2}} \geq 0 .

Therefore,

R (i, i) = \sqrt{R {(i - 1, i - 1)}^{2} - \hat{G} {(3, i)}^{2}} \geq 0,

and thus

R (i, i) \leq R (i - 1, i - 1) .

□

Remark 1.

The above GSA implementation allows to prove the inequality Equation (11). This property is difficult to obtain if the

Q R

factorization is performed via Householder transformations or if the classical Cholesky factorization of

S^{T} S

is used.

5. GSA for Computing the Null-Space of Polynomial Matrices

In this section, we consider the problem of computing a polynomial basis

X (s) \in R^{n \times (n - ρ)}

of the null-space of an

m \times n

polynomial matrix of degree

δ

and rank

ρ \leq min (m, n),

M (s) = \sum_{i = 0}^{δ} M_{i} s^{i}, M_{i} \in R^{m \times n}, i = 0, \dots, δ .

(12)

As described in [8,19,20], the above problem is equivalent to that of computing the null-space of a related block-Toeplitz matrix. Algorithms to solve this problem are proposed in [8,19] but they do not explicitly exploit the structure of the involved matrix. Algorithms to solve related problems have also been described in the literature, e.g., in [8,19,21,22].

In this paper, we propose an algorithm for computing the null-space of polynomial matrices based on a variant of the GSA for computing the null-space of a related band block-Toeplitz matrix [8].

5.1. Null-Space of Polynomial Matrices

A polynomial vector

v (s) = \sum_{i = 0}^{γ} v_{i} s^{i}, v_{i} \in R^{n}, i = 0, \dots, γ,

γ \in N,

is said to belong to the null-space of (12) if

M (s) v (s) = 0 \Leftrightarrow \sum_{j = 0}^{δ} M_{j} s^{j} \sum_{i = 0}^{γ} v_{i} s^{i} = 0 .

The polynomial vector

v (s)

belongs to the null-space of

M (s)

iff

v = {[v_{0}^{T}, v_{1}^{T}, \dots, v_{γ}^{T}]}^{T},

v_{i} \in R^{n}, i = 0, \dots, γ,

is a vector belonging to the null-space of the band block-Toeplitz matrix

T = {[\begin{matrix} M_{0} \\ M_{1} & M_{0} \\ ⋮ & M_{1} & ⋱ \\ M_{δ} & ⋮ & ⋱ & M_{0} \\ M_{δ} & ⋱ & M_{1} & M_{0} \\ ⋱ & ⋮ & M_{1} \\ M_{δ} & ⋮ \\ M_{δ} \end{matrix}]}_{\hat{m} \times \hat{n}},

(13)

where

\hat{m} = m (δ + n_{b}), \hat{n} = n n_{b},

with

n_{b} = γ + 1

the number of block columns of

T,

that can be determined, e.g., by the algorithm described in [8]. Hence, the problem of computing the null-space of the polynomial matrix in Equation (12) is equivalent to the problem of computing the null-space of the matrix in Equation (13). To normalize the entries in this matrix, it is appropriate to first perform a QR factorization of each block column of T:

[\begin{matrix} M_{0} \\ M_{1} \\ ⋮ \\ M_{δ} \end{matrix}] = [\begin{matrix} Q_{0} \\ Q_{1} \\ ⋮ \\ Q_{δ} \end{matrix}] U, where \sum_{i} Q_{i}^{T} Q_{i} = I_{n},

and to absorb the upper triangular factor U in the vector

u (s) : = U v (s)

. The convolution equation

M (s) v (s) = 0

then becomes an equation of the type

Q (s) u (s) = 0,

but where the coefficient matrices

Q_{i}

of

Q (s)

form together an orthonormalized matrix.

Remark 2.

Above, we have assumed that there are no constant vectors

v

in the kernel of

M (s)

. If there are, then, the block column of

M_{i}

matrices has rank less than n and the above factorization will discover it in the sense that the matrix U is nonsquare and the matrices

Q_{i}

have less columns than

M_{i}, i = 0, 1, \dots, δ .

This trivial null-space can be eliminated and we therefore assume that the rank was full. For simplicity, from now on, we also assume that the coefficient matrices of the polynomial matrix

M (s)

were already normalized in this way and the norm of the block columns of T are thus orthonormalized. This normalization proves to be very useful in the sequel.

Denote by

Z_{n_{b}} = {[\begin{matrix} 0_{n} \\ I_{n} & 0_{n} \\ ⋱ & ⋱ \\ I_{n} & 0_{n} \end{matrix}]}_{\hat{n} \times \hat{n}} and Z = {[\begin{matrix} Z_{n_{b}} \\ Z_{n_{b}} \end{matrix}]}_{2 \hat{n} \times 2 \hat{n}},

where

0_{n}

is the null–matrix of order

n \in N .

If

v_{i} \neq 0,

0 < i < γ,

and

v_{j} = 0, j = i + 1, \dots, γ,

i.e.,

v = {[v_{0}^{T}, v_{1}^{T}, \dots, v_{i}^{T}, \underset{γ - i}{\underset{⏟}{0, \dots, 0}}]}^{T},

and

v \in ker (T),

then also

Z_{n_{b}}^{k} v \in ker (T), k = 0, 1, \dots, γ - i .

In this case, the vector

v

is said to be a generator vector of a chain of length

γ - i + 1

of the null-space of

T .

The proposed algorithm for the computation of the null-space of polynomial matrices is based on the GSA for computing the R factor of the

Q R

-factorization of the matrix T in Equation (13) and, if R is full column rank, its inverse

R^{- 1}

.

Let us first assume that the matrix T is full rank, i.e.,

rank (T) = ρ = min {\hat{m}, \hat{n}} .

Without loss of generality, we suppose

\hat{m} \geq \hat{n} .

If

\hat{m} < \hat{n},

the algorithm still computes the R factor in trapezoidal form [23]. Moreover, in this case, we compute the first

\hat{m}

rows of the inverse of the matrix obtained appending the last

\hat{n} - \hat{m}

rows of the identity matrix of order

\hat{n}

to

R .

Let us consider the SPD block-Toeplitz matrix

\hat{T} = T^{T} T = [\begin{matrix} {\hat{T}}_{0} & {\hat{T}}_{1} & \dots & {\hat{T}}_{δ} \\ {\hat{T}}_{1} & {\hat{T}}_{0} & {\hat{T}}_{1} & ⋱ & ⋱ \\ ⋮ & {\hat{T}}_{1} & ⋱ & ⋱ & ⋱ & {\hat{T}}_{δ} \\ {\hat{T}}_{δ} & ⋱ & ⋱ & ⋱ & ⋱ & ⋮ \\ ⋱ & ⋱ & ⋱ & {\hat{T}}_{0} & {\hat{T}}_{1} \\ {\hat{T}}_{δ} & \dots & {\hat{T}}_{1} & {\hat{T}}_{0} \end{matrix}] \in R^{\hat{n} \times \hat{n}},

(14)

whose blocks are

{\hat{T}}_{i - j} = \{\begin{matrix} \sum_{k = 0}^{δ - | i - j |} M_{k + | i - j |}^{T} M_{k}, & if i \leq j \\ \sum_{k = 0}^{δ - | i - j |} M_{k}^{T} M_{k + | i - j |}, & if i > j . \end{matrix}

(15)

Notice that, because of the normalization introduced before, we have that

{\hat{T}}_{0} = I_{n}

and

∥ {\hat{T}}_{i} ∥_{2} \leq 1

. This is used below. The matrix

W = [\begin{matrix} \hat{T} & I_{\hat{n}} \\ I_{\hat{n}} & 0_{\hat{n}} \end{matrix}]

(16)

can be factorized in the following way,

W = {\hat{R}}^{T} \hat{J} \hat{R} \equiv [\begin{matrix} R^{T} \\ R^{- 1} & R^{- 1} \end{matrix}] [\begin{matrix} I_{\hat{n}} \\ - I_{\hat{n}} \end{matrix}] [\begin{matrix} R & R^{- T} \\ R^{- T} \end{matrix}],

(17)

where

R \in R^{\hat{n} \times \hat{n}}

is the factor R of the

Q R

-factorization of

T,

i.e., the Cholesky factor of

\hat{T} .

Hence, R and its inverse

R^{- 1}

can be retrieved from the first

\hat{n}

columns of the matrix

{\hat{R}}^{T} .

The displacement matrix and the displacement rank of W with respect to

Z,

are given by

\nabla_{Z} (W) = W - Z W Z^{T} = [\begin{matrix} I_{n} & {\hat{T}}_{1} & \dots & {\hat{T}}_{δ} & 0_{n} & \dots & 0_{n} & I_{n} & 0_{n} & \dots & 0_{n} \\ {\hat{T}}_{1} \\ ⋮ \\ {\hat{T}}_{δ} \\ 0_{n} \\ ⋮ \\ 0_{n} \\ I_{n} \\ 0_{n} \\ ⋮ \\ 0_{n} \end{matrix}]

(18)

and

ρ (W, Z) = rank (\nabla_{Z} (W)),

respectively, with

\nabla_{Z} (W) \in R^{2 \hat{n} \times 2 \hat{n}} .

Then, taking the order n of the matrices

{\hat{T}}_{i}, i = 0, 1, \dots, δ,

into account, it turns out that

ρ (W, Z) \leq 2 n .

Hence, Equation (18) can be written as the difference of two matrices of rank at most n, i.e.,

\nabla_{Z} (W) = {G^{(+)}}^{T} G^{(+)} - {G^{(-)}}^{T} G^{(-)} = G^{T} J G, where G : = [\begin{matrix} G^{(+)} \\ G^{(-)} \end{matrix}] and J = diag (I_{n}, - I_{n}) .

Since

{\hat{T}}_{0} = I_{n},

the construction of G does not require any computation: it is easy to check that G is given by

G : = [\begin{matrix} G^{(+)} \\ G^{(-)} \end{matrix}] = [\begin{matrix} I_{n} & {\hat{T}}_{1} & \dots & {\hat{T}}_{δ} & 0_{n} & \dots & 0_{n} & I_{n} & 0_{n} & \dots & 0_{n} \\ 0_{n} & {\hat{T}}_{1} & \dots & {\hat{T}}_{δ} & 0_{n} & \dots & 0_{n} & 0_{n} & 0_{n} & \dots & 0_{n} \end{matrix}] .

(19)

Remark 3.

Observe that increasing

n_{b},

with

n_{b} \geq δ + 1,

the structures of W and

\nabla_{Z} (W)

do not change due to the block band structure of the matrix W. Consequently, the length of the corresponding generators changes but their structure remains the same since only

T_{0}, T_{1}, \dots, T_{δ}

and

I_{n}

are different from zero in the first block row.

The computation by the GSA of the R factor of T and of its inverse

R^{- 1}

is made by only using the matrix G rather than the matrix T. Its implementation is a straightforward block matrix extension of the GSA described in Section 2.

Remark 4.

By construction, the initial generator matrix

G_{0}

has the first

δ + 1

block rows and the block row

n_{b} + 1

different from zero. Therefore, the multiplication of

G_{0}

by the J-orthogonal matrix

H_{1}

does not modify the structure of the generator matrix.

Let

G_{0} = G .

At each iteration i (for

i = 1, \dots, n_{b},

), we start from the generator matrix

G_{i - 1}

having the blocks (of length n)

i, i + 1, \dots, i + δ

and

n_{b} + 1, \dots, n_{b} + i

different form zero. We then look for a J-orthogonal matrix

H_{i}

such that the product

H_{i} G_{i - 1}

has in position

(1 : n, (i - 1) n + 1 : i n)

and

(n + 1 : 2 n, (i - 1) n + 1 : i n)

a nonsingular upper triangular and zero matrix, respectively.

Then,

G_{i}

is obtained from

[\begin{matrix} {\tilde{G}}_{i}^{(+)} \\ {\tilde{G}}_{i}^{(-)} \end{matrix}] \equiv H_{i} G_{i - 1}

by multiplying the first n columns with

Z,

i.e.,

G_{i} = [\begin{matrix} {\tilde{G}}_{i}^{(+)} Z^{T} \\ {\tilde{G}}_{i}^{(-)} \end{matrix}] .

The computation of the J-orthogonal matrix

H_{i}

at the ith iteration of the GSA can be constructed as a product of n Householder matrices

{\hat{H}}_{i, j}

and n hyperbolic rotations

{\hat{Y}}_{i, j}, j = 1, \dots, n .

The multiplication by the Householder matrices

{\hat{H}}_{i, j}

modifies the last n columns of the generator matrix, annihilating the last n entries but the

(n + 1)

st in the row

(i - 1) n + j,

j = 1, \dots, n,

while the multiplication by the hyperbolic rotations

{\hat{Y}}_{i, j}

acts on the columns i and

n + 1,

annihilating the entry in position

((i - 1) n + j, n + 1) .

G_iven

υ_{1}, υ_{2} \in R, | υ_{1} | > | υ_{2} |,

a hyperbolic matrix

Y \in R^{2 \times 2}

can be computed

Y = [\begin{matrix} c & - s \\ - s & c \end{matrix}], with c = \frac{υ_{1}}{\sqrt{υ_{1}^{2} - υ_{2}^{2}}}, s = \frac{υ_{2}}{\sqrt{υ_{1}^{2} - υ_{2}^{2}}},

such that

[υ_{1}, υ_{2}] Y = [\sqrt{υ_{1}^{2} - υ_{2}^{2}}, 0] .

The modification of the sparsity pattern of the generator matrix after the first and ith iteration of the GSA are displayed in Figure 1 and Figure 2, respectively.

The reliability of the GSA strongly depends on the way the hyperbolic rotation is computed. In [4,5,24], it is proven that the GSA is weakly stable if the hyperbolic rotations are implemented in an appropriate manner [3,11,12,24].

Let

H_{i, j} = [\begin{matrix} I_{n} \\ {\hat{H}}_{i, j} \end{matrix}] Y_{i, j} = [\begin{matrix} I_{j - 1} \\ c_{j} & - s_{j} \\ I_{n - j} \\ - s_{j} & c_{j} \\ I_{n - 1} \end{matrix}] .

Then,

H_{i} = H_{i, 1} Y_{i, 1} \dots H_{i, n - 1} Y_{i, n - 1} H_{i, n} Y_{i, n} .

As previously mentioned, GSA relies only on the knowledge of the generators of W rather than on the matrix

\hat{T}

itself. Its computation involves the product

{\hat{T}}^{T} \hat{T},

which can be accomplished with

δ^{2} n^{3}

flops. The ith iteration of the GSA involves the multiplication of n Householder matrices of size n times a matrix of size

((i + δ + 1) n \times n) .

Therefore, since the cost of the multiplication by the hyperbolic rotation is negligible with respect to that of the multiplication by the Householder matrices, the computational cost at iteration i is

4 n^{3} (δ + i)

. Hence, the computational cost of GSA is

2 n^{3} n_{b} (2 δ + n_{b}^{2} / 2)

.

5.2. GSA for Computing the Right Null-Space of Semidefinite Block Toeplitz Matrices

As already mentioned in Section 5.1, the number of desired blocks

n_{b}

of the matrix T in Equation (13) can be computed as described in [8]. For the sake of simplicity, in the considered examples, we choose

n_{b}

large enough to compute the null-space of

T .

The structure and the computation via the GSA of the R factor of the

Q R

factorization of the singular block Toeplitz matrix T with rank

ρ < n \leq m,

is considered in [23].

A modification of the GSA for computing the null-space of Toeplitz matrices is described in [25]. In this paper, we extend the latter results to compute the null-space of T by modifying GSA.

Without loss of generality, let us assume that the first

\hat{n} - 1

columns of T are linear independent and suppose that the

\hat{n}

th column linearly depends on the previous ones. Therefore, the first

\hat{n} - 1

principal minors of

\hat{T}

are positive while the

\hat{n}

th one is zero. Let

\hat{T} = Q Λ Q^{T}

be the spectral decomposition of

\hat{T},

with

Q = [q_{1}, \dots, q_{\hat{n}}]

orthogonal and

Λ = diag (λ_{1}, \dots, λ_{\hat{n} - 1}, λ_{\hat{n}}),

with

λ_{1} \geq λ_{2} \geq \dots \geq λ_{\hat{n} - 1} > λ_{\hat{n}} = 0,

and let

{\hat{T}}_{ε} = Q Λ_{ε} Q^{T},

with

Λ_{ε} = diag (λ_{1}, \dots, λ_{\hat{n} - 1}, ε^{2}),

with

ε \in R_{+}^{*} .

Hence,

{\hat{T}}_{ε}^{- 1} = \frac{1}{ε^{2}} (\sum_{i = 1}^{\hat{n} - 1} \frac{ε^{2}}{λ_{i}} q_{i} q_{i}^{T} + q_{\hat{n}} q_{\hat{n}}^{T}) .

Let

R_{ε}

be the Cholesky factor of

{\hat{T}}_{ε},

with

R_{ε}

upper triangular, i.e.,

{\hat{T}}_{ε} = R_{ε}^{T} R_{ε} .

Then,

ε^{2} {\hat{T}}_{ε}^{- 1} e_{\hat{n}}^{(2 \hat{n})} = (\sum_{i = 1}^{\hat{n} - 1} \frac{ε^{2} q_{i}^{T} e_{\hat{n}}^{(2 \hat{n})}}{λ_{i}} q_{i} + (q_{\hat{n}}^{T} e_{\hat{n}}^{(2 \hat{n})}) q_{\hat{n}}) .

On the other hand,

ε^{2} {\hat{T}}_{ε}^{- 1} e_{\hat{n}}^{(2 \hat{n})} = ε^{2} R_{ε}^{- 1} R_{ε}^{- T} e_{\hat{n}}^{(2 \hat{n})} = ε^{2} r_{\hat{n}, \hat{n}}^{- 1} R_{ε}^{- 1} e_{\hat{n}}^{(2 \hat{n})},

where

r_{\hat{n}, \hat{n}} = {e_{\hat{n}}^{(2 \hat{n})}}^{T} R_{ε} e_{\hat{n}}^{(2 \hat{n})} .

Hence, as

ε \to 0^{+},

the last column of

R_{ε}^{- 1}

becomes closer and closer to a multiple of

q_{\hat{n}},

the eigenvector corresponding to the 0 eigenvalue of

\hat{T} .

Therefore, given

W_{ε} = [\begin{matrix} {\hat{T}}_{ε} & I_{\hat{n}} \\ I_{\hat{n}} & 0_{\hat{n}} \end{matrix}],

we have that

\nabla_{Z} (W_{ε}) = {G^{(+)}}^{T} G^{(+)} - {G^{(-)}}^{T} G^{(-)} + ε^{2} e_{\hat{n}}^{(2 \hat{n})} {e_{\hat{n}}^{(2 \hat{n})}}^{T} .

Let

G_{0, ε} = [\begin{matrix} G^{(+)} \\ G^{(-)} \\ ε {e_{\hat{n}}^{(2 \hat{n})}}^{T} \end{matrix}] .

Define

J_{ε} = diag (\underset{\hat{n}}{\underset{⏟}{1, 1, \dots, 1}}, \underset{\hat{n}}{\underset{⏟}{- 1, - 1, \dots, - 1}}, 1) .

Hence,

\nabla_{Z} (W_{ε}) = G_{0, ε}^{T} J_{ε} G_{0, ε} .

We observe that column

\hat{n} + 1

of the generator matrix is not involved in the GSA until the very last iteration, since only its

\hat{n}

th entry is different from

0 .

At the very last iteration, the hyperbolic rotation

Y = [\begin{matrix} c & - s \\ - s & c \end{matrix}],

with

c = \frac{\sqrt{{G^{(+)}}^{2} (n, \hat{n}) + ε^{2}}}{\sqrt{{G^{(+)}}^{2} (n, \hat{n}) + ε^{2} - {G^{(-)}}^{2} (1, \hat{n})}}, s = \frac{G^{(-)} (1, \hat{n})}{\sqrt{{G^{(+)}}^{2} (n, \hat{n}) + ε^{2} - {G^{(-)}}^{2} (1, \hat{n})}}

is applied to the

\hat{n}

th and

(\hat{n} + 1)

st rows of

G,

i.e., to the nth row of

G^{(+)}

and the first one of

G^{(-)}

. Since

\hat{T}

is singular, it turns out that

| G^{(+)} (n, \hat{n}) | = | G^{(-)} (1, \hat{n}) |

(see [23,25]). Thus,

\begin{matrix} Y & = & \frac{1}{\sqrt{{G^{(+)}}^{2} (n, \hat{n}) + ε^{2} - {G^{(-)}}^{2} (1, \hat{n})}} [\begin{matrix} \sqrt{{G^{(+)}}^{2} (n, \hat{n}) + ε^{2}} & - G^{(-)} (1, \hat{n}) \\ - G^{(-)} (1, \hat{n}) & \sqrt{{G^{(+)}}^{2} (n, \hat{n}) + ε^{2}} \end{matrix}] \\ = & \frac{| G^{(+)} (n, \hat{n}) |}{ε} [\begin{matrix} \sqrt{1 + {(\frac{ε}{G^{(+)} (n, \hat{n})})}^{2}} & - θ \\ - θ & \sqrt{1 + {(\frac{ε}{G^{(+)} (n, \hat{n})})}^{2}} \end{matrix}], \end{matrix}

where

θ = \frac{G^{(-)} (1, \hat{n})}{| G^{(+)} (n, \hat{n}) |} = sign (G^{(-)} (1, \hat{n})) .

We observe that, as

ε \to 0^{+},

\frac{| G^{(+)} (n, \hat{n}) |}{ε} \to \infty, \sqrt{1 + {(\frac{ε}{G^{(+)} (n, \hat{n})})}^{2}} \to 1 .

Since a vector of the right null-space of T is determined except for the multiplication by a constant, neglecting the term

| G^{(+)} (n, \hat{n}) | / ε,

such a vector can be computed at the last iteration as the first column of the product

[\begin{matrix} 1 & - θ \\ - θ & 1 \end{matrix}] [\begin{matrix} G^{(+)} (n, \hat{n} + 1 : 2 \hat{n}) \\ G^{(-)} (1, \hat{n} + 1 : 2 \hat{n}) \end{matrix}] .

When detecting a vector of the null-space as a linear combination of row n of

G^{(+)}

and row one of

G^{(-)},

the new generator matrix G for the GSA is obtained removing the latter columns from G [23,25].

The implementation of the modified GSA for computing the null-space of band block-Toeplitz matrices in Equation (13) is rather technical and can be obtained from the authors upon request.

The stability properties of the GSA have been studied in [4,5,24]. The proposed algorithm inherits the stability properties of the GSA, which means that it is weakly stable.

6. Numerical Examples

All the numerical experiments were carried out in matlab with machine precision

ε \approx 2.22 \times 10^{- 16}

. Example 1 concerns the computation of the rank of a Sylvester matrix, while Examples 2 and 3 concern the computation of the null-space of polynomial matrices.

Example 1.

Let

x_{i}, i = 1, \dots, 12,

y_{i}, i = 1, \dots, 15,

and

z_{i}, i = 1, \dots, 3,

be random numbers generated by thematlabfunctionrandn. Let

w (x)

and

y (x)

be the two polynomials of degree 15 and 18, constructed by the matlab functionpoly, whose roots are, respectively,

x_{i}

and

z_{j}, i = 1, \dots, 12, j = 1, \dots, 3,

and

y_{i}

and

z_{j}, i = 1, \dots, 15, j = 1, \dots, 3 .

The greatest common divisor of w and y has degree 3 and, therefore, the Sylvester matrix

S \in R^{33 \times 33}

constructed from

w (x)

anf

y (x)

has rank

30 .

The diagonal entries of the R factor computed by the

G S A

implementation described in Section 4 are displayed in Figure 3. Observe that the rank of the matrix can be retrieved by the number of entries of R above a certain tolerance. Moreover, it can be noticed that the first

m = 15

diagonal entries monotonically decrease.

Example 2.

As second example, we consider the computation of the coprime factorization of a transfer function matrix, described in [9,26]. The results obtained by the proposed GSA-based algorithm were compared with those obtained computing the null-space of the considered matrix by the functionsvdofmatlab.

Let

H (s) = N_{r} (s) D_{r}^{- 1} (s)

be the transfer function with

D_{r} (s) = [\begin{matrix} 1 - s & 0 & 0 & 0 \\ 0 & 1 - s & 0 & 0 \\ 0 & - s & 1 - s & 0 \\ 0 & 0 & 0 & 1 - s \end{matrix}], N_{r} (s) = [\begin{matrix} s^{2} & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & s & 0 \\ 0 & 0 & 0 & s \end{matrix}] .

Let

M (s) = [\begin{matrix} N_{r}^{T} (s) & - D_{r}^{T} (s) \end{matrix}] = [\begin{matrix} s^{2} & 0 & 0 & 0 & 0 & s - 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & s - 1 & s & 0 \\ 0 & 0 & 0 & s & 0 & 0 & 0 & s - 1 & 0 \\ 0 & 0 & 0 & 0 & s & 0 & 0 & 0 & s - 1 \end{matrix}] .

As reported in [9,26], a minimal polynomial basis for the right null-space of

M (s)

is

N (s) = [\begin{matrix} 1 - s & 0 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & 0 \\ 0 & {(1 - s)}^{2} & 0 \\ 0 & 0 & 1 - s \\ s^{2} & 0 & 0 \\ 0 & s^{2} & 0 \\ 0 & s - s^{2} & 0 \\ 0 & 0 & s \end{matrix}] .

Let us consider

n_{b} = 3 .

Then,

T \in R^{20 \times 21}

is the block-Toeplitz matrix constructed from

M (s)

as described in Section 5.1. Let

rank (M) = 17

and

U Σ V^{T}

be the rank and the singular value decomposition of T computed bymatlab, respectively, and let us define

V_{1} = V (:, 1 : 17)

and

V_{2} = V (:, 18 : 21)

the matrices of the right singular vectors corresponding to the nonzero and zero singular values of

T,

respectively. The modified GSA applied to T yields four vectors

v_{1}, Z_{n_{b}} v_{1}, v_{2}, v_{3} \in R^{21}

belonging to the right null-space of

M (s),

with

Z_{n_{b}} = diag (ones (14, 1), - 7) .

Let

X = [v_{1} Z_{n_{b}} v_{1} v_{2} v_{3}] .

In Table 1, the relative norm of

T V_{2}

, the relative norm of

T X

, the norm of

V_{1}^{T} V_{2}

and the norm of

V_{1}^{T} X,

are reported in Columns 1–4, respectively.

Such values show that the results provided bysvdofmatlaband by the algorithm based on a modification of GSA are comparable in terms of accuracy.

Example 3.

This example can be found in [9,26]. Let

H (s) = D_{l}^{- 1} (s) N_{r} (s)

be the transfer function with

D_{l} (s) = {(s + 2)}^{2} (s + 3) [\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}], N_{l} (s) = [\begin{matrix} 3 s + 8 & 2 s^{2} + 6 s + 2 \\ s^{2} + 6 s + 2 & 3 s^{2} + 7 s + 8 \end{matrix}] .

Let

M (s) = [D_{l} (s), - N_{L} (s)] .

A right coprime pair for

M (s)

is given by

N_{r} = [\begin{matrix} 3 & 2 \\ s + 2 & 3 \end{matrix}], D_{r} = [\begin{matrix} s^{2} + 3 s + 4 & 2 \\ 2 & s + 4 \end{matrix}],

Let us choose

n_{b} = 4 .

Then,

T \in R^{14 \times 16}

is the block-Toeplitz matrix constructed from

M (s)

as described in Section 5.1. Let

rank (M) = 11

and

U Σ V^{T}

be the rank and the singular value decomposition of T computed bymatlab, respectively, and let define

V_{1} = V (:, 1 : 11)

and

V_{2} = V (:, 12 : 16)

the matrices of the right singular vectors corresponding to the nonzero and zero singular values of

T,

respectively. The modified GSA applied to T yields the vectors

v_{1}, Z_{n_{b}} v_{1}, Z_{n_{b}}^{2} v_{1}, v_{2}, Z_{n_{b}} v_{2}, v_{3} \in R^{16}

of the right null-space, with

Z_{n_{b}} = diag (ones (12, 1), - 4) .

Let

X = [v_{1}, Z_{n_{b}} v_{1}, Z_{n_{b}}^{2} v_{1}, v_{2}, Z_{n_{b}} v_{2}, v_{3}] .

In Table 2, the relative norm of

T V_{2}

, the relative norm of

T X

, the norm of

V_{1}^{T} V_{2}

and the norm of

V_{1}^{T} X,

are reported in Columns 1–4, respectively.

As in Example 2, the results yielded by the considered algorithms are comparable in accuracy.

7. Conclusions

The Generalized Schur Algorithm is a powerful tool allowing to compute classical decompositions of matrices, such as the

Q R

and

L U

factorizations. If the involved matrices have a particular structure, such as Toeplitz or Sylvester, the GSA computes the latter factorizations with a complexity of one order of magnitude less than that of classical algorithms based on Householder or elementary transformations.

After having emphasized the main features of the GSA, we have shown in this manuscript that the GSA helps to prove some theoretical properties of the R factor of the

Q R

factorization of some structured matrices. Moreover, a fast implementation of the GSA for computing the rank of Sylvester matrices and the null-space of polynomial matrices is proposed, which relies on a modification of the GSA for computing the R factor and its inverse of the

Q R

factorization of band block-Toeplitz matrices with full column rank. The numerical examples show that the proposed approach yields reliable results comparable to those ones provided by the function svd of matlab.

Author Contributions

All authors contributed equally to this work.

Funding

This research was partly funded by INdAM-GNCS and by CNR under the Short Term Mobility Program.

Conflicts of Interest

The authors declare no conflict of interest.

References

Kailath, T.; Sayed, A.H. Fast Reliable Algorithms for Matrices with Structure; SIAM: Philadelphia, PA, USA, 1999. [Google Scholar]
Kailath, T.; Sayed, A. Displacement Structure: Theory and Applications. SIAM Rev. 1995, 32, 297–386. [Google Scholar] [CrossRef]
Chandrasekaran, S.; Sayed, A. Stabilizing the Generalized Schur Algorithm. SIAM J. Matrix Anal. Appl. 1996, 17, 950–983. [Google Scholar] [CrossRef]
Stewart, M.; Van Dooren, P. Stability issues in the factorization of structured matrices. SIAM J. Matrix Anal. Appl. 1997, 18, 104–118. [Google Scholar] [CrossRef]
Mastronardi, N.; Van Dooren, P.; Van Huffel, S. On the stability of the generalized Schur algorithm. Lect. Notes Comput. Sci. 2001, 1988, 560–567. [Google Scholar]
Li, B.; Liu, Z.; Zhi, L. A structured rank-revealing method for Sylvester matrix. J. Comput. Appl. Math. 2008, 213, 212–223. [Google Scholar] [CrossRef]
Forney, G. Minimal bases of rational vector spaces with applications to multivariable linear systems. SIAM J. Control Optim. 1975, 13, 493–520. [Google Scholar] [CrossRef]
Zúñiga Anaya, J.; Henrion, D. An improved Toeplitz algorithm for polynomial matrix null-space computation. Appl. Math. Comput. 2009, 207, 256–272. [Google Scholar] [CrossRef] [Green Version]
Beelen, T.; van der Hurk, G.; Praagman, C. A new method for computing a column reduced polynomial matrix. Syst. Control Lett. 1988, 10, 217–224. [Google Scholar] [CrossRef] [Green Version]
Neven, W.; Praagman, C. Column reduction of polynomial matrices. Linear Algebra Its Appl. 1993, 188, 569–589. [Google Scholar] [CrossRef]
Bojańczyk, A.; Brent, R.; Van Dooren, P.; de Hoog, F. A note on downdating the Cholesky factorization. SIAM J. Sci. Stat. Comput. 1987, 8, 210–221. [Google Scholar] [CrossRef]
Higham, N.J. J-orthogonal matrices: properties and generation. SIAM Rev. 2003, 45, 504–519. [Google Scholar] [CrossRef]
Lemmerling, P.; Mastronardi, N.; Van Huffel, S. Fast algorithm for solving the Hankel/Toeplitz Structured Total Least Squares Problem. Numer. Algorithm. 2000, 23, 371–392. [Google Scholar] [CrossRef]
Mastronardi, N.; Lemmerling, P.; Van Huffel, S. Fast structured total least squares algorithm for solving the basic deconvolution problem. SIAM J. Matrix Anal. Appl. 2000, 22, 533–553. [Google Scholar] [CrossRef]
Golub, G.H.; Van Loan, C.F. Matrix Computations, 4th ed.; Johns Hopkins University Press: Baltimore, MD, USA, 2013. [Google Scholar]
Boito, P. Structured Matrix Based Methods for Approximate Polynomial GCD; Edizioni Della Normale: Pisa, Italy, 2011. [Google Scholar]
Boito, P.; Bini, D.A. A Fast Algorithm for Approximate Polynomial GCD Based on Structured Matrix Computations. In Numerical Methods for Structured Matrices and Applications; Bini, D., Mehrmann, V., Olshevsky, V., Tyrtyshnikov, E., Van Barel, M., Eds.; Birkhauser: Basel, Switzerland, 2010; Volume 199, pp. 155–173. [Google Scholar]
Mastronardi, N.; Lernmerling, P.; Kalsi, A.; O’Leary, D.; Van Huffel, S. Implementation of the regularized structured total least squares algorithms for blind image deblurring. Linear Algebra Its Appl. 2004, 391, 203–221. [Google Scholar] [CrossRef]
Basilio, J.; Moreira, M. A robust solution of the generalized polynomial Bezout identity. Linear Algebra Its Appl. 2004, 385, 287–303. [Google Scholar] [CrossRef]
Kailath, T. Linear Systems; Prentice Hall: Englewood Cliffs, NJ, USA, 1980. [Google Scholar]
Bueno, M.; De Terán, F.; Dopico, F. Recovery of Eigenvectors and Minimal Bases of Matrix Polynomials from Generalized Fiedler Linearizations. SIAM J. Matrix Anal. Appl. 2011, 32, 463–483. [Google Scholar] [CrossRef]
De Terán, F.; Dopico, F.; Mackey, D. Fiedler companion linearizations and the recovery of minimal indices. SIAM J. Matrix Anal. Appl. 2010, 31, 2181–2204. [Google Scholar] [CrossRef]
Gallivan, K.; Thirumalai, S.; Van Dooren, P.; Vermaut, V. High performance algorithms for Toeplitz and block Toeplitz matrices. Linear Algebra Its Appl. 1996, 241–243, 343–388. [Google Scholar] [CrossRef]
Stewart, M. Cholesky Factorization of Semi-definite Toeplitz Matrices. Linear Algebra Its Appl. 1997, 254, 497–526. [Google Scholar] [CrossRef]
Mastronardi, N.; Van Barel, M.; Vandebril, R. On the computation of the null space of Toeplitz-like matrices. Electron. Trans. Numer. Anal. 2009, 33, 151–162. [Google Scholar]
Antoniou, E.; Vardulakis, A.; Vologiannidis, S. Numerical computation of minimal polynomial bases: A generalized resultant approach. Linear Algebra Its Appl. 2005, 405, 264–278. [Google Scholar] [CrossRef]

Figure 1. Modification of the sparsity pattern of the generator matrix G after the first iteration.

Figure 2. Modification of the sparsity pattern of the generator matrix G after the ith iteration.

Figure 3. Diagonal entries of R.

Table 1. Relative norm of

T V_{2}

, relative norm of

T X

, norm of

V_{1}^{T} V_{2}

and norm of

V_{1}^{T} X,

for Example 2.

Table 1. Relative norm of

T V_{2}

, relative norm of

T X

, norm of

V_{1}^{T} V_{2}

and norm of

V_{1}^{T} X,

for Example 2.

$\frac{∥ T V_{2} ∥_{2}}{{∥ T ∥}_{2}}$	$\frac{{∥ T X ∥}_{2}}{{∥ T ∥}_{2}}$	$∥ V_{1}^{T} V_{2} ∥_{2}$	$∥ V_{1}^{T} {X ∥}_{2}$
$4.23 \times 10^{- 16}$	$2.89 \times 10^{- 16}$	$9.66 \times 10^{- 16}$	$2.59 \times 10^{- 15}$

Table 2. Relative norm of

T V_{2}

, relative norm of

T X

, norm of

V_{1}^{T} V_{2}

and norm of

V_{1}^{T} X,

for Example 3.

Table 2. Relative norm of

T V_{2}

, relative norm of

T X

, norm of

V_{1}^{T} V_{2}

and norm of

V_{1}^{T} X,

for Example 3.

$\frac{∥ T V_{2} ∥_{2}}{{∥ T ∥}_{2}}$	$\frac{{∥ T X ∥}_{2}}{{∥ T ∥}_{2}}$	$∥ V_{1}^{T} V_{2} ∥_{2}$	$∥ V_{1}^{T} {X ∥}_{2}$
$1.33 \times 10^{- 16}$	$2.04 \times 10^{- 16}$	$5.56 \times 10^{- 16}$	$4.91 \times 10^{- 15}$

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Laudadio, T.; Mastronardi, N.; Van Dooren, P. The Generalized Schur Algorithm and Some Applications. Axioms 2018, 7, 81. https://doi.org/10.3390/axioms7040081

AMA Style

Laudadio T, Mastronardi N, Van Dooren P. The Generalized Schur Algorithm and Some Applications. Axioms. 2018; 7(4):81. https://doi.org/10.3390/axioms7040081

Chicago/Turabian Style

Laudadio, Teresa, Nicola Mastronardi, and Paul Van Dooren. 2018. "The Generalized Schur Algorithm and Some Applications" Axioms 7, no. 4: 81. https://doi.org/10.3390/axioms7040081

APA Style

Laudadio, T., Mastronardi, N., & Van Dooren, P. (2018). The Generalized Schur Algorithm and Some Applications. Axioms, 7(4), 81. https://doi.org/10.3390/axioms7040081

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Generalized Schur Algorithm and Some Applications

Abstract

1. Introduction

2. The Generalized Schur Algorithm

3. GSA for SPD Toeplitz Matrices

4. Computing the Rank of Sylvester Matrices

5. GSA for Computing the Null-Space of Polynomial Matrices

5.1. Null-Space of Polynomial Matrices

5.2. GSA for Computing the Right Null-Space of Semidefinite Block Toeplitz Matrices

6. Numerical Examples

7. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI