Weighted Block Golub-Kahan-Lanczos Algorithms for Linear Response Eigenvalue Problem

Zhong, Hongxiu; Teng, Zhongming; Chen, Guoliang

doi:10.3390/math7010053

Open AccessArticle

Weighted Block Golub-Kahan-Lanczos Algorithms for Linear Response Eigenvalue Problem

by

Hongxiu Zhong

^1,*

,

Zhongming Teng

²

and

Guoliang Chen

³

¹

School of Science, Jiangnan University, Wuxi 214122, China

²

College of Computer and Information Science, Fujian Agriculture and Forestry University, Fuzhou 350002, China

³

School of Mathematical Sciences, Shanghai Key Laboratory of PMMP, East China Normal University, Shanghai 200241, China

^*

Author to whom correspondence should be addressed.

Mathematics 2019, 7(1), 53; https://doi.org/10.3390/math7010053

Submission received: 30 November 2018 / Revised: 25 December 2018 / Accepted: 29 December 2018 / Published: 7 January 2019

(This article belongs to the Special Issue Mathematics and Engineering)

Download

Browse Figure

Versions Notes

Abstract

:

In order to solve all or some eigenvalues lied in a cluster, we propose a weighted block Golub-Kahan-Lanczos algorithm for the linear response eigenvalue problem. Error bounds of the approximations to an eigenvalue cluster, as well as their corresponding eigenspace, are established and show the advantages. A practical thick-restart strategy is applied to the block algorithm to eliminate the increasing computational and memory costs, and the numerical instability. Numerical examples illustrate the effectiveness of our new algorithms.

Keywords:

linear response eigenvalue problem; block methods; weighted Golub-Kahan-Lanczos algorithm; convergence analysis; thick restart

AMS Subject Classification:

65F15; 15A18

1. Introduction

In this paper, we are interested in solving the linear response eigenvalue problem (LREP):

H z : = [\begin{matrix} 0 & M \\ K & 0 \end{matrix}] [\begin{matrix} u \\ v \end{matrix}] = λ [\begin{matrix} u \\ v \end{matrix}] = λ z,

where K and M are

N \times N

real symmetric positive definite matrices. Such a problem arises from studying the excitation energy of many particle systems in computational quantum chemistry and physics [1,2,3]. It also known as the Bethe-Salpeter (BS) eigenvalue-problem [4] or the random phase approximation (RPA) eigenvalue problem [5]. There has immense past and recent work in developing efficient numerical algorithms and attractive theories for LREP [6,7,8,9,10,11,12,13,14,15].

Since all the eigenvalues of

H

are real nonzero and appear in pairs

{λ, - λ}

[6], thus we order the eigenvalues in ascending order, i.e.,

- λ_{1} \leq \dots \leq - λ_{N} < λ_{N} \leq \dots \leq λ_{1} .

In this paper, we focus on a small portion of the positive eigenvalues for LREP, i.e.,

λ_{i}

,

i = k, k + 1, \dots, ℓ

with

1 \leq k \leq ℓ \leq N

and

ℓ - k + 1 ≪ N

, and their corresponding eigenvectors. We only consider the real case, all the results can be easily applied to the complex case.

The weighted Golub-Kahan-Lanczos method (wGKL) for LREP was introduced in [16]. It produces recursively a much small projection

B_{j} = [\begin{matrix} 0 & B_{j} \\ B_{j}^{T} & 0 \end{matrix}]

of

H

at j-th iteration, where

B_{j} \in R^{j \times j}

is upper bidiagonal. Afterwards, the eigenpairs of

H

can be constructed by the singular value decomposition of

B_{j}

. The convergence analysis performs that running k iterations of wGKL is equivalently running

2 k

iterations of a weighted Lanczos algorithm for

H

[16]. Actually,

B_{j}

can be also a lower bidiagonal matrix, and the same discussion can be taken place as in the case of

B_{j}

is upper bidiagonal. In the following, we only consider the upper bidiagonal case.

It is well known that the single-vector Lanczos method is widely used for searching a small number of extreme eigenvalues, and it may encounter very slow convergence when the wanted eigenvalues stay in a cluster [17]. Instead, a block Lanczos method with a suitable block size is capable of computing a cluster of eigenvalues including multiple eigenvalues very quickly. Motivated by this idea, we are going to develop a block version of wGKL in [16] in order to find efficiently all or some positive eigenvalues within a cluster for LREP. Based on the standard block Lanczos convergence theory in [17], the error bounds of approximation to an eigenvalue cluster, as well as their corresponding eigenspace are established to illustrate the advantage of our weighted block Golub-Kahan-Lanczos algorithm (wbGKL).

As the increasing size of the Krylov subspace, the storage demands, computational costs, and numerical stability of a simple version of a block Lanczos method may be affected [18]. Several kinds of efficiently restarting strategies to eliminate these effects are developed for the classic Lanczos method, such as, implicitly restart method [19], thick restart method [20]. In order to make our block method more practical, and using the special structure of LREP, we consider the thick restart strategy to our block method.

The rest of this paper is organized as follows. Section 2 gives some necessary preliminaries for our later use. In Section 3, the weighted block Golub-Kahan-Lanczos algorithm (wbGKL) for LREP is presented, and its convergence analysis is discussed. Section 4 proposed the thick restart weighted block Golub-Kahan-Lanczos algorithm (wbGKL-TR). The numerical examples are tested in Section 5 to illustrate the efficiency of our new algorithms. Finally, some conclusions are given in Section 6.

Throughout this paper,

R^{m \times n}

is the set of all

m \times n

real matrices,

R^{n} = R^{n \times 1}

, and

R = R^{1}

.

I_{n}

(or simply I if its dimension is clear from the context) is the

n \times n

identity matrix, and

0_{m \times n}

is an

m \times n

matrix of zero. The superscript “

^{T}

” denotes transpose.

{∥ \cdot ∥}_{F}

denotes the Frobenius norm of a matrix, and

{∥ \cdot ∥}_{2}

denotes the 2-norm of a matrix or a vector. For a matrix

X \in R^{m \times n}

,

r a n k (X)

denotes the rank of X, and

R (X) = s p a n (X)

denotes the column space of X; the submatrices

X_{i : j, :}

and

X_{:, k : ℓ}

of X composed by the intersections of row i to row j and column k to column ℓ, respectively. For matrices or scalars

X_{i}

,

d i a g (X_{1}, \dots, X_{k})

denotes the block diagonal matrix with the i-th diagonal block

X_{i}

.

2. Preliminaries

For a symmetric positive definite matrix

W \in R^{N \times N}

, the W-inner product is defined as following

{〈 x, y 〉}_{W} : = y^{T} W x, \forall x, y \in R^{N} .

If

{〈 x, y 〉}_{W} = 0

, then we denote it by

x ⊥_{W} y

, and call it with x and y are W-orthogonal. The projector

Π_{W}

is called the W-orthogonal projector onto

Y

if for any

y \in R^{N}

,

Π_{W} y \in Y, (I - Π_{W}) y ⊥_{W} Y .

For two subspaces

X, Y \subseteq R^{N}

, and suppose

k = d i m (X) \leq d i m (Y) = ℓ

, if

X \in R^{N \times k}

and

Y \in R^{N \times ℓ}

are W-orthonormal basis of

X

and

Y

, respectively, i.e.,

X^{T} W X = I_{k}, X = R (X) and Y^{T} W Y = I_{ℓ}, Y = R (Y),

and

ν_{j}

for

j = 1, \dots, k

with

ν_{1} \leq \dots \leq ν_{k}

are the singular values of

Y^{T} W X

, then the W-canonical angles

θ_{W}^{(j)} (X, Y)

from

X

to

Y

are defined by

0 \leq θ_{W}^{(j)} (X, Y) = arccos ν_{j} \leq π / 2, for j = 1, \dots, k .

If

k = ℓ

, these angles can be said between

X

and

Y

. Obviously,

θ_{W}^{(1)} (X, Y) \geq \dots \geq θ_{W}^{(k)} (X, Y)

. Set

Θ_{W} (X, Y) = d i a g (θ_{W}^{(1)} (X, Y), \dots, θ_{W}^{(k)} (X, Y)) .

Especially, if

k = 1

, X is a vector, there is only one W-canonical angle from

X

to

Y

. In the following, we may use a matrix in one or both arguments of

Θ_{W} (\cdot, \cdot)

, i.e.,

Θ_{W} (X, Y)

with the understanding that it means the subspace spanned by the columns of the matrix argument.

The following two lemmas are important to our later analysis, and for proofs and more details, the reader is referred to [12,16].

Lemma 1.

([12] Lemma 3.2). Let

X

and

Y

be two subspaces in

R^{N}

with equal dimensional

d i m (X) = d i m (Y) = k

. Suppose

θ_{W}^{(1)} (X, Y) < π / 2

. Then, for any set

y_{1}, y_{2}, \dots, y_{k_{1}}

of the basis vectors in

Y

where

1 \leq k_{1} \leq k

, there is a set

x_{1}, x_{2}, \dots, x_{k_{1}}

of linearly independent vectors in

X

such that

Π_{W} x_{j} = y_{j}

for

1 \leq j \leq k_{1}

, where

Π_{W}

is the W-orthogonal projector onto

Y

.

Lemma 2.

([16] Proposition 3.1). The matrix

M K

has N position eigenvalues

λ_{1}^{2} \geq λ_{2}^{2} \geq \dots \geq λ_{N}^{2}

with

λ_{j} > 0

. The corresponding right eigenvectors

ξ_{1}, \dots, ξ_{N}

can be chosen K-orthonormal, and the corresponding left eigenvectors

η_{1}, \dots, η_{N}

can be chosen M-orthonormal. In particular, for given

{ξ_{j}}

, one can choose

η_{j} = λ_{j}^{- 1} K ξ_{j}

, and for given

{η_{j}}

,

ξ_{j} = λ_{j}^{- 1} M η_{j}

, for

j = 1, \dots, N

.

3. Weighted Block Golub-Kahan-Lanczos Algorithm

3.1. Weighted Block Golub-Kahan-Lanczos Algorithm

In this section, we plan to introduce the weighted block Golub-Kahan-Lanczos algorithm (wbGKL) for LREP, which is a block version of the weighted Golub-Kahan-Lanczos algorithm [16]. Algorithm 1 gives the process of recursively generating the M-orthonormal matrix

X_{n}

, the K-orthonormal matrix

Y_{n}

, and the block bidiagonal matrix

B_{n}

. Giving

Y_{1} \in R^{n \times n_{b}}

with

Y_{1}^{T} K Y_{1} = I_{n_{b}}

, denoting

E_{n}^{T} = [0_{n_{b} \times (n - 1) n_{b}}, I_{n_{b}}]

, and

X_{n} = [X_{1}, \dots, X_{n}], Y_{n} = [Y_{1}, \dots, Y_{n}], B_{n} = [\begin{matrix} A_{1} & B_{1} \\ A_{2} & ⋱ \\ ⋱ & B_{n - 1} \\ A_{n} \end{matrix}],

then we have the relation from Algorithm 1:

K Y_{n} = X_{n} B_{n}, M X_{n} = Y_{n} B_{n}^{T} + Y_{n + 1} B_{n}^{T} E_{n}^{T},

(1)

and

X_{n}^{T} M X_{n} = I_{n n_{b}} = Y_{n}^{T} K Y_{n} .

Remark 1.

In Algorithm 1, we only consider the case that

r a n k ({\tilde{X}}_{j}) = r a n k ({\tilde{Y}}_{j + 1}) = n_{b}

, no further treatment is provided for the cases

r a n k ({\tilde{X}}_{j}) < n_{b}

or

r a n k ({\tilde{Y}}_{j + 1}) < n_{b}

. Because K and M are both symmetric positive definite, thus the two W inStep 2are both reversible.

Algorithm 1: wbGKL

1. Choose

Y_{1}

satisfying

Y_{1}^{T} K Y_{1} = I_{n_{b}}

, and set

W = I_{n_{b}}

,

B_{0} = I_{n_{b}}

,

X_{0} = 0_{n \times n_{b}}

. Compute

F = K Y_{1}

.
2. For

j = 1, 2, \dots, n

{\tilde{X}}_{j} = F W - X_{j - 1} B_{j - 1}

F = M {\tilde{X}}_{j}

Do Cholesky decomposition

{\tilde{X}}_{j}^{T} F = W^{T} W

A_{j} = W

,

W = i n v (W)

,

X_{j} = {\tilde{X}}_{j} W

%

W = i n v (W)

means

W = W^{- 1}

{\tilde{Y}}_{j + 1} = F W - Y_{j} A_{j}^{T}

F = K {\tilde{Y}}_{j + 1}

Do Cholesky decomposition

{\tilde{Y}}_{j + 1}^{T} F = W^{T} W

B_{j} = W^{T}

,

W = i n v (W)

,

Y_{j + 1} = {\tilde{Y}}_{j + 1} W

End

Remark 2.

With j increasing inStep 2, the M-orthogonality of

X_{j}

and the K-orthogonality of

Y_{j}

will slowly lose. Thus, in practice, we can add a re-orthogonalization process in each iteration to eliminate the defect. The same strategy is executed in the following algorithms.

From (1), we have

[\begin{matrix} 0 & M \\ K & 0 \end{matrix}] [\begin{matrix} Y_{n} & 0 \\ 0 & X_{n} \end{matrix}] = [\begin{matrix} Y_{n} & 0 \\ 0 & X_{n} \end{matrix}] [\begin{matrix} 0 & B_{n}^{T} \\ B_{n} & 0 \end{matrix}] + [\begin{matrix} Y_{n + 1} \\ 0 \end{matrix}] B_{n}^{T} E_{2 n}^{T}

with

E_{2 n}^{T} = [0_{n_{b} \times (2 n - 1) n_{b}}, I_{n_{b}}]

. Then, the approximate eigenpairs of

H

can be obtained by solving a small eigenvalue problem of

[\begin{matrix} 0 & B_{n}^{T} \\ B_{n} & 0 \end{matrix}]

. Suppose

B_{n}

has an singular value decomposition

B_{n} = Φ Σ_{n} Ψ^{T},

(2)

where

Φ = [ϕ_{1}, ϕ_{2}, \dots, ϕ_{n n_{b}}]

,

Ψ = [ψ_{1}, ψ_{2}, \dots, ψ_{n n_{b}}]

,

Σ_{n} = [σ_{1}, σ_{2}, \dots, σ_{n n_{b}}]

with

σ_{1} \geq \dots \geq σ_{n n_{b}} > 0

. Thus, we can take

\pm σ_{j} (1 \leq j \leq n n_{b})

as the Ritz values of

H

and

{\tilde{z}}_{j} = \frac{1}{\sqrt{2}} [\begin{matrix} Y_{n} ψ_{j} \\ \pm X_{n} ϕ_{j} \end{matrix}], 1 \leq j \leq n n_{b},

as the corresponding

K

-orthonormal Ritz vectors, where

K = [\begin{matrix} K & 0 \\ 0 & M \end{matrix}]

.

3.2. Convergence Analysis

In this section, we first consider the convergence analysis when using the first few

σ_{j}

as approximations to the first few

λ_{j}

. Then, the similar theories are presented if using the last few

σ_{j}

as approximations to the last few

λ_{j}

. Since a block Lanczos method with a suitable block size which is not smaller than the size of an eigenvalue cluster can compute all eigenvalues in the cluster. Now, we are considering the i-th to

(i + n_{b} - 1)

-st eigenpairs of LREP, in which the k-th to ℓ-th eigenvalues form a cluster as in the following figure with

1 \leq i \leq k \leq ℓ \leq i + n_{b} - 1 \leq n n_{b}

and

k \leq n

.

Here, the squares of the eigenvalues for LREP are listed. Hence, motivated by [12,17], we analyze the convergence of the cluster eigenvalues and their corresponding eigenspace, and give the error bounds of the approximate eigenpairs belonging to eigenvalue cluster together, instead of separately for each individual eigenpair.

We first give some notations and equations, which are critical in our main theorem. Note that from (1), we get

M K Y_{n} = Y_{n} B_{n}^{T} B_{n} + Y_{n + 1} B_{n}^{T} A_{n} E_{n}^{T} .

(3)

Since (2) is the singular value decomposition of

B_{n}

, thus the eigenvalues of

B_{n}^{T} B_{n}

are

σ_{j}^{2}

with the associated eigenvectors

ψ_{j}

for

1 \leq j \leq n n_{b}

.

From Lemma 2, if we let

Ξ = [ξ_{1}, \dots, ξ_{N}]

, and

Γ = [η_{1}, \dots, η_{N}]

, then

Γ = K Ξ Λ^{- 1}

, and

M K Ξ = Ξ Λ^{2} .

(4)

Write

Ξ

and

Λ^{2}

as

Let

{\overset{ˇ}{Ξ}}_{2} = Ξ_{(:, k : ℓ)}

and

{\overset{ˇ}{Λ}}_{2}^{2} = d i a g (λ_{k}^{2}, \dots, λ_{ℓ}^{2})

. Denote

C_{j}

the first kind Chebyshev polynomial with j-th degree, and

0 \leq j \leq n

.

In the following, we assume

θ_{K}^{(1)} (Y_{1}, Ξ_{2}) < π / 2,

(5)

i.e.,

r a n k (Y_{1}^{T} K Ξ_{2}) = n_{b}

, then from Lemma 1, we have ∃

Z \in R^{N \times (ℓ - k + 1)}

with

R (Z) \subseteq R (Y_{1})

, s.t.,

Ξ_{2} Ξ_{2}^{T} K Z = {\overset{ˇ}{Ξ}}_{2} .

(6)

Theorem 1.

Suppose

θ_{K}^{(1)} (Y_{1}, Ξ_{2}) < π / 2

, and Z satisfy (6), then we have

∥ d i a g (λ_{k}^{2} - σ_{k}^{2}, \dots, λ_{ℓ}^{2} - σ_{ℓ}^{2}) ∥_{F} \leq (λ_{k}^{2} - λ_{N}^{2}) \frac{π_{i, k, ℓ}^{2}}{C_{n - k}^{2} (1 + 2 γ_{i, ℓ})} {∥ {tan}^{2} Θ_{K} ({\overset{ˇ}{Ξ}}_{2}, Z) ∥}_{F}

(7)

with

γ_{i, ℓ} = \frac{λ_{ℓ}^{2} - λ_{i + n_{b}}^{2}}{λ_{i + n_{b}}^{2} - λ_{N}^{2}}, π_{i, k, ℓ} = \frac{max_{i + n_{b} \leq j \leq N} \prod_{m = 1}^{k - 1} | σ_{m}^{2} - λ_{j}^{2} |}{min_{k \leq t \leq ℓ} \prod_{m = 1}^{k - 1} | σ_{m}^{2} - λ_{t}^{2} |},

and

∥ sin Θ_{K} ({\overset{ˇ}{Ξ}}_{2}, Y_{n} Ψ_{(:, k : ℓ)}) ∥_{F} \leq \frac{π_{i, k} \sqrt{1 + c^{2} {∥ A_{n}^{T} B_{n} ∥}_{2}^{2} / δ^{2}}}{C_{n - i} (1 + 2 γ_{i, ℓ})} {∥ tan Θ_{K} ({\overset{ˇ}{Ξ}}_{2}, Z) ∥}_{F}

(8)

with constant c lies between 1 and

π / 2

, and

c = 1

if

k = ℓ

, and

δ = min_{\begin{matrix} k \leq j \leq ℓ \\ p < k o r p > ℓ \end{matrix}} | λ_{j}^{2} - σ_{p}^{2} |, π_{i, k} = \prod_{j = 1}^{i - 1} \frac{λ_{j}^{2} - λ_{N}^{2}}{λ_{j}^{2} - λ_{k}^{2}} .

Particularly if

σ_{k - 1}^{2} \geq λ_{k}^{2}

, then

π_{i, k, ℓ} = \prod_{m = 1}^{k - 1} \frac{| σ_{m}^{2} - λ_{N}^{2} |}{| σ_{m}^{2} - λ_{k}^{2} |} .

Proof.

Multiplying

L^{T}

from left, (4) can be rewritten as

L^{T} M L (L^{T} Ξ) = (L^{T} Ξ) Λ^{2}

, so,

(λ_{j}^{2}, L^{T} ξ_{j})

is the eigenpair of

L^{T} M L

, for

j = 1, \dots, N

, and

L^{T} ξ_{1}, \dots, L^{T} ξ_{N}

are orthonormal. Do the same process to (3), we have

L^{T} M L V_{n} = V_{n} B_{n}^{T} B_{n} + V_{n + 1} B_{n}^{T} A_{n} E_{n}^{T},

(9)

where

V_{n} = L^{T} Y_{n}

,

V_{n + 1} = L^{T} Y_{n + 1}

, and

V_{n}^{T} V_{n} = I_{n n_{b}}

, which can be seen as the relation generalize by using standard Lanczos process to

L^{T} M L

. Thus,

σ_{1}^{2}, \dots, σ_{n n_{b}}^{2}

are the Ritz values of

L^{T} M L

, with the corresponding orthonormal Ritz vectors

V_{n} ψ_{1}, \dots, V_{n} ψ_{n n_{b}}

.

Premultiplying

L^{T}

to Equation (6), we have

L^{T} Ξ_{2} Ξ_{2}^{T} L (L^{T} Z) = L^{T} {\overset{ˇ}{Ξ}}_{2}

. Consequently, the conditions of the block Lanczos convergence Theorem 4.1 and Theorem 5.1 in [17] are satisfied. Thus, using the results Theorem 5.1 in [17], one has

∥ d i a g (λ_{k}^{2} - σ_{k}^{2}, \dots, λ_{ℓ}^{2} - σ_{ℓ}^{2}) ∥_{F} \leq (λ_{k}^{2} - λ_{N}^{2}) \frac{π_{i, k, ℓ}^{2}}{C_{n - k}^{2} (1 + 2 γ_{i, ℓ})} {∥ {tan}^{2} Θ (L^{T} {\overset{ˇ}{Ξ}}_{2}, L^{T} Z) ∥}_{F} .

Then the bound (7) can be easily got by using ([21] Theorem 4.2)

Θ (L^{T} {\overset{ˇ}{Ξ}}_{2}, L^{T} Z) = Θ_{K} ({\overset{ˇ}{Ξ}}_{2}, Z) .

(10)

Let

Π_{n} = V_{n} V_{n}^{T}

, then

Π_{n}

is the orthogonal projection onto

K_{n} (L^{T} M L, L^{T} Z)

, thus from (9), we have

\begin{matrix} ∥ Π_{n} L^{T} M L (I - Π_{n}) ∥_{2} & = ∥ V_{n} V_{n}^{T} L^{T} M L (I - V_{n} V_{n}^{T}) ∥_{2} \\ = ∥ V_{n} (B_{n}^{T} B_{n} + E_{n} A_{n}^{T} B_{n} V_{n + 1}^{T}) - V_{n} B_{n}^{T} B_{n} V_{n}^{T} ∥_{2} \\ = ∥ V_{n} A_{n}^{T} B_{n} V_{n + 1}^{T} ∥_{2} \\ = ∥ A_{n}^{T} B_{n} ∥_{2} . \end{matrix}

Consequently, applying the results of Theorem 4.1 in [17], we get

\begin{matrix} ∥ sin Θ (L^{T} {\overset{ˇ}{Ξ}}_{2}, V_{n} Ψ_{(:, k : ℓ)}) ∥_{F} & \leq \frac{π_{i, k} \sqrt{1 + ∥ Π_{n} L^{T} M L (I - Π_{n}) ∥_{2}^{2} / δ^{2}}}{C_{n - i} (1 + 2 γ_{i, ℓ})} {∥ tan Θ (L^{T} {\overset{ˇ}{Ξ}}_{2}, L^{T} Z) ∥}_{F} \\ = \frac{π_{i, k} \sqrt{1 + ∥ A_{n}^{T} B_{n} ∥_{2}^{2} / δ^{2}}}{C_{n - i} (1 + 2 γ_{i, ℓ})} {∥ tan Θ (L^{T} {\overset{ˇ}{Ξ}}_{2}, L^{T} Z) ∥}_{F} . \end{matrix}

Then the bound (8) can be derived by using

Θ (L^{T} {\overset{ˇ}{Ξ}}_{2}, V_{n} Ψ_{(:, k : ℓ)}) = Θ_{K} ({\overset{ˇ}{Ξ}}_{2}, Y_{n} Ψ_{(:, k : ℓ)})

and (10). ☐

Theorem 1 is used to bound the errors of the approximate eigenvalues to an eigenvalue cluster including the multiple eigenvalues. It can be also applied to the single eigenvalue case, the following corollary is derived by setting

k = ℓ = i

, except the left equality of (10), which needs to be proved.

Corollary 1.

Suppose

θ_{K}^{(1)} (Y_{1}, Ξ_{2}) < π / 2

, then for

1 \leq i \leq n n_{b}

, there exits a vector

y \in R (Y_{1})

, s.t.,

Ξ_{2} Ξ_{2}^{T} y = ξ_{i}

, and

λ_{i}^{2} - σ_{i}^{2} \leq (λ_{i}^{2} - λ_{N}^{2}) \frac{π_{i, j}^{2}}{C_{n - i}^{2} (1 + 2 γ_{i})} {tan}^{2} θ_{K} (ξ_{i}, y)

with

γ_{i} = \frac{λ_{i}^{2} - λ_{i + n_{b}}^{2}}{λ_{i + n_{b}}^{2} - λ_{N}^{2}}, π_{i, j} = max_{i + n_{b} \leq j \leq N} \prod_{m = 1}^{i - 1} \frac{| σ_{m}^{2} - λ_{j}^{2} |}{| σ_{m}^{2} - λ_{i}^{2} |},

and

\begin{matrix} {((1 - \frac{σ_{i}^{2}}{λ_{i}^{2}}) + \frac{σ_{i}^{2}}{λ_{i}^{2}} {sin}^{2} θ_{M} (η_{i}, X_{n} ϕ_{i}))}^{1 / 2} \\ = & sin θ_{K} (ξ_{i}, Y_{n} ψ_{i}) \leq \frac{π_{i} \sqrt{1 + ∥ A_{n}^{T} B_{n} ∥_{2}^{2} / δ^{2}}}{C_{n - i} (1 + 2 γ_{i})} tan θ_{K} (ξ_{i}, y) \end{matrix}

(11)

with

δ = min_{i \neq j} | λ_{j}^{2} - σ_{i}^{2} |, π_{i} = \prod_{j = 1}^{i - 1} \frac{λ_{j}^{2} - λ_{N}^{2}}{λ_{j}^{2} - λ_{i}^{2}} .

Proof.

We only proof the left equality of (11). From (4) and Lemma 2, we have

Ξ = M K Ξ Λ^{- 2} = M Γ Λ^{- 1}

. If we let

Z_{1} = {(Y_{n} ψ_{i})}^{T} K ξ_{i}

, and

Z_{2} = {(X_{n} ϕ_{i})}^{T} M η_{i}

, then we can get

Z_{1} = \frac{σ_{i}}{λ_{i}} Z_{2}

by using

K Y_{n} Ψ = X_{n} B_{n} Ψ = X_{n} Φ Σ_{n}

. Thus

\begin{matrix} {sin}^{2} θ_{K} (ξ_{i}, Y_{n} ψ_{i}) & = 1 - {cos}^{2} θ_{K} (ξ_{i}, Y_{n} ψ_{i}) \\ = 1 - Z_{1}^{T} Z_{1} \\ = 1 - \frac{σ_{i}^{2}}{λ_{i}^{2}} Z_{2}^{T} Z_{2} \\ = 1 - \frac{σ_{i}^{2}}{λ_{i}^{2}} {cos}^{2} θ_{M} (η_{i}, X_{n} ϕ_{i}) \\ = 1 - \frac{σ_{i}^{2}}{λ_{i}^{2}} + \frac{σ_{i}^{2}}{λ_{i}^{2}} {sin}^{2} θ_{M} (η_{i}, X_{n} ϕ_{i}) . \end{matrix}

Then,

sin θ_{K} (ξ_{i}, Y_{n} ψ_{i}) = {(1 - \frac{σ_{i}^{2}}{λ_{i}^{2}} + \frac{σ_{i}^{2}}{λ_{i}^{2}} {sin}^{2} θ_{M} (η_{i}, X_{n} ϕ_{i}))}^{1 / 2} .

☐

Next, we are going to consider the last few

σ_{j}

to approximate as the last few

λ_{N - n n_{b} + j}

,

j = k, \dots, ℓ

, and

λ_{N - n n_{b} + k}

to

λ_{N - n n_{b} + ℓ}

form a cluster in

λ_{\hat{i}}

to

λ_{\hat{i} + n_{b} - 1}

, which is described in the following figure, where

N + 1 - n n_{b} \leq \hat{i} \leq \hat{k} \leq \hat{ℓ} \leq \hat{i} + n_{b} - 1 \leq N

,

n n_{b} - ℓ + 1 \leq n

,

\hat{k} ≜ N - n n_{b} + k

, and

\hat{ℓ} ≜ N - n n_{b} + ℓ

.

Similar to the above discussion for the first few eigenvalues, we can also obtain the error bounds of the approximate last few eigenpairs belongs to eigenvalue cluster together. We use the same notion, except

{\hat{Λ}}_{2}^{2} = d i a g (λ_{\hat{k}}^{2}, \dots, λ_{\hat{ℓ}}^{2})

and

{\hat{Ξ}}_{2} = Ξ_{(:, \hat{k} : \hat{ℓ})}

. Assuming

θ_{K}^{(1)} (Y_{1}, Ξ_{2}) < π / 2

, then from Lemma 1, there ∃

\hat{Z} \in R^{N \times (ℓ - k + 1)}

with

R (\hat{Z}) \subseteq R (Y_{1})

, s.t.,

Ξ_{2} Ξ_{2}^{T} K \hat{Z} = {\hat{Ξ}}_{2} .

(12)

Theorem 2.

Suppose

θ_{K}^{(1)} (Y_{1}, Ξ_{2}) < π / 2

and

\hat{Z}

satisfy (12), then we have

\begin{matrix} ∥ d i a g (σ_{k}^{2} - λ_{\hat{k}}^{2}, \dots, σ_{ℓ}^{2} - λ_{\hat{ℓ}}^{2}) ∥_{F} \\ \leq & (λ_{1}^{2} - λ_{\hat{ℓ}}^{2}) \frac{{\hat{π}}_{\hat{i}, \hat{k}, \hat{ℓ}}^{2}}{C_{n - N + \hat{ℓ} - 1}^{2} (1 + 2 {\hat{γ}}_{\hat{i}, \hat{k}})} {∥ {tan}^{2} Θ_{K} ({\hat{Ξ}}_{2}, \hat{Z}) ∥}_{F} \end{matrix}

(13)

with

{\hat{γ}}_{\hat{i}, \hat{ℓ}} = \frac{λ_{\hat{i} - 1}^{2} - λ_{\hat{k}}^{2}}{λ_{1}^{2} - λ_{\hat{i} - 1}^{2}}, {\hat{π}}_{\hat{i}, \hat{k}, \hat{ℓ}} = \frac{max_{1 \leq j \leq \hat{i} - 1} \prod_{m = ℓ + 1}^{n n_{b}} | σ_{m}^{2} - λ_{j}^{2} |}{min_{\hat{k} \leq t \leq \hat{ℓ}} \prod_{m = ℓ + 1}^{n n_{b}} | σ_{m}^{2} - λ_{t}^{2} |},

and

∥ sin Θ_{K} ({\hat{Ξ}}_{2}, Y_{n} Ψ_{(:, k : ℓ)}) ∥_{F} \leq \frac{{\hat{π}}_{\hat{i}, \hat{ℓ}} \sqrt{1 + {\hat{c}}^{2} {∥ A_{n}^{T} B_{n} ∥}_{2}^{2} / {\hat{δ}}^{2}}}{C_{n + \hat{i} + n_{b} - N - 2} (1 + 2 {\hat{γ}}_{\hat{i}, \hat{k}})} {∥ tan Θ_{K} ({\hat{Ξ}}_{2}, \hat{Z}) ∥}_{F}

(14)

with constant

\hat{c}

lies between 1 and

π / 2

, and

\hat{c} = 1

if

k = ℓ

, and

\hat{δ} = min_{\begin{matrix} \hat{k} \leq j \leq \hat{ℓ} \\ p < k o r p > ℓ \end{matrix}} | λ_{j}^{2} - σ_{p}^{2} |, {\hat{π}}_{\hat{i}, \hat{ℓ}} = \prod_{j = \hat{i} + n_{b}}^{N} \frac{λ_{1}^{2} - λ_{j}^{2}}{λ_{\hat{ℓ}}^{2} - λ_{j}^{2}} .

Remark 3.

Similar to Corollary 1, Theorem 2 can also be applied to the single eigenvalue case, here we omit the detail.

Remark 4.

In Theorem 1 and 2, we use the Frobenius norm to estimate the accuracy of eigenpairs approximations, in fact, any unitary invariant norm can be used to measure.

Remark 5.

Compared with the single-vector type of the weighted Golub-Kahan-Lanczos method in [16], our convergence results show the superiority of the block version. For instance, in Corollary 1, the convergence rate of the approximate eigenvalues

σ_{j}

is proportional to

C_{n - i}^{- 2} (1 + 2 γ_{i})

with

γ_{i} = \frac{λ_{i}^{2} - λ_{i + n_{b}}^{2}}{λ_{i + n_{b}}^{2} - λ_{N}^{2}}

, which is obviously better than

C_{n - i}^{- 2} (1 + 2 {\tilde{γ}}_{i})

with

{\tilde{γ}}_{i} = \frac{λ_{i}^{2} - λ_{i + 1}^{2}}{λ_{i + 1}^{2} - λ_{N}^{2}}

in ([16] Theorem 3.4). While the additional cost caused from the block version can be paid by the improvements generated by

γ_{i}

, especially when the desired eigenvalues lie in a well-separated cluster [12].

4. Thick Restart

As the number of iterations increases, Algorithm 1 may encounter the dilemma that the amount of calculation and storage increases sharply and the numerical stability gradually weakens. In this section, we will apply the thick restart strategy [20] to improve the algorithm. After running n iterations, Algorithm 1 derives the following relations for LREP:

\{\begin{matrix} K Y_{n} & = X_{n} B_{n}, \\ M X_{n} & = Y_{n} B_{n}^{T} + Y_{n + 1} B_{n}^{T} E_{n}^{T}, \end{matrix}

(15)

with

X_{n}^{T} M X_{n} = I_{n n_{b}} = Y_{n}^{T} K Y_{n}

.

Recall the SVD (2), let

Φ_{k}

and

Ψ_{k}

be the first

k n_{b}

columns of

Φ

and

Ψ

, respectively, i.e.,

Φ_{k} = [ϕ_{1}, ϕ_{2}, \dots, ϕ_{k n_{b}}], Ψ_{k} = [ψ_{1}, ψ_{2}, \dots, ψ_{k n_{b}}] .

Thus it follows that

B_{n} Ψ_{k} = Φ_{k} Σ_{k} and B_{n}^{T} Φ_{k} = Ψ_{k} Σ_{k},

(16)

where

Σ_{k} = d i a g (σ_{1}, \dots, σ_{k n_{b}})

.

By using the approximate eigenvectors of

H

for thick restart, we post-multiply

Ψ_{k}

and

Φ_{k}

to the Equation (15), respectively, and get

\{\begin{matrix} K Y_{n} Ψ_{k} & = X_{n} B_{n} Ψ_{k}, \\ M X_{n} Φ_{k} & = Y_{n} B_{n}^{T} Φ_{k} + Y_{n + 1} B_{n}^{T} E_{n}^{T} Φ_{k}, \end{matrix}

(17)

From (16), and let

{\hat{Y}}_{k} = Y_{n} Ψ_{k}

,

{\hat{X}}_{k} = X_{n} Φ_{k}

,

{\hat{B}}_{k} = Σ_{k}

,

{\hat{Y}}_{k + 1} = Y_{n + 1}

,

U^{T} = E_{n}^{T} Φ_{k}

,

{\hat{B}}_{k} = B_{n}

, then (17) can be rewritten as

\{\begin{matrix} K {\hat{Y}}_{k} & = {\hat{X}}_{k} {\hat{B}}_{k}, \\ M {\hat{X}}_{k} & = {\hat{Y}}_{k} {\hat{B}}_{k}^{T} + {\hat{Y}}_{k + 1} {\hat{B}}_{k}^{T} U^{T}, \end{matrix}

(18)

and

{\hat{X}}_{k}^{T} M {\hat{X}}_{k} = I_{k n_{b}} = {\hat{Y}}_{k}^{T} K {\hat{Y}}_{k}

.

Next,

{\hat{X}}_{k + 1}

and

{\hat{Y}}_{k + 2}

will be generalized. Firstly, we compute

\begin{matrix} {\tilde{X}}_{k + 1} & = K {\hat{Y}}_{k + 1} - {\hat{X}}_{k} {\hat{X}}_{k}^{T} M K {\hat{Y}}_{k + 1} \\ = K {\hat{Y}}_{k + 1} - {\hat{X}}_{k} U {\hat{B}}_{k} . \end{matrix}

From the second equation in (18), we know

{\tilde{X}}_{k + 1}^{T} M {\hat{X}}_{k} = 0

. Do Cholesky decomposition

{\tilde{X}}_{k + 1}^{T} M {\tilde{X}}_{k + 1} = W^{T} W

, and set

{\hat{A}}_{k + 1} = W

,

W = i n v (W)

. Compute

{\hat{X}}_{k + 1} = {\tilde{X}}_{k + 1} W

, and let

{\hat{X}}_{k + 1} = [{\hat{X}}_{k}, {\hat{X}}_{k + 1}], {\hat{B}}_{k + 1} = [\begin{matrix} {\hat{B}}_{k} & U {\hat{B}}_{k} \\ 0 & {\hat{A}}_{k + 1} \end{matrix}],

we have

K {\hat{Y}}_{k + 1} = {\hat{X}}_{k + 1} {\hat{B}}_{k + 1} with {\hat{X}}_{k + 1}^{T} M {\hat{X}}_{k + 1} = I_{(k + 1) n_{b}} .

(19)

Secondly, from the above equation, we can compute

\begin{matrix} {\tilde{Y}}_{k + 2} & = M {\hat{X}}_{k + 1} - {\hat{Y}}_{k} {\hat{Y}}_{k}^{T} K M {\hat{X}}_{k + 1} - {\hat{Y}}_{k + 1} {\hat{Y}}_{k + 1}^{T} K M {\hat{X}}_{k + 1} \\ = M {\hat{X}}_{k + 1} - {\hat{Y}}_{k + 1} {\hat{A}}_{k + 1}^{T} . \end{matrix}

Again using (19), it is easily got that

{\tilde{Y}}_{k + 2}^{T} K {\hat{Y}}_{k + 1} = 0

. Similarly, do Cholesky decomposition

{\tilde{Y}}_{k + 2}^{T} K {\tilde{Y}}_{k + 2} = W^{T} W

, and let

{\hat{B}}_{k + 1} = W^{T}

,

W = i n v (W)

. Compute

{\hat{Y}}_{k + 2} = {\tilde{Y}}_{k + 2} W

, and let

{\hat{Y}}_{k + 1} = [{\hat{Y}}_{k}, {\hat{Y}}_{k + 1}]

, we get

M {\hat{X}}_{k + 1} = {\hat{Y}}_{k + 1} {\hat{B}}_{k + 1}^{T} + {\hat{Y}}_{k + 2} {\hat{B}}_{k + 1}^{T} E_{k + 1}^{T} with {\hat{Y}}_{k + 1}^{T} M {\hat{Y}}_{k + 1} = I_{(k + 1) n_{b}} .

Continue the same procedure for

{\hat{X}}_{k + 2}, \dots, {\hat{X}}_{n}

and

{\hat{Y}}_{k + 3}, \dots, {\hat{Y}}_{n + 1}

, we can obtain the new M-orthonormal matrix

{\hat{X}}_{n} \in R^{N \times n n_{b}}

, the new K-orthonormal matrix

{\hat{Y}}_{n} \in R^{N \times n n_{b}}

, and the new matrix

{\hat{B}}_{n} \in R^{n n_{b} \times n n_{b}}

, and relations

\{\begin{matrix} K {\hat{Y}}_{n} & = {\hat{X}}_{n} {\hat{B}}_{n}, \\ M {\hat{X}}_{n} & = {\hat{Y}}_{n} {\hat{B}}_{n}^{T} + {\hat{Y}}_{n + 1} {\hat{B}}_{n}^{T} E_{n}^{T}, \end{matrix}

(20)

with

{\hat{X}}_{n}^{T} M {\hat{X}}_{n} = I_{n n_{b}} = {\hat{Y}}_{n}^{T} K {\hat{Y}}_{n}

, and

{\hat{B}}_{n} = [\begin{matrix} {\hat{B}}_{k} & U {\hat{B}}_{k} \\ {\hat{A}}_{k + 1} & {\hat{B}}_{k + 1} \\ ⋱ & {\hat{B}}_{n - 1} \\ {\hat{A}}_{n} \end{matrix}] .

Note that

{\hat{B}}_{n}

is no longer a block bidiagonal matrix. Algorithm 2 is our thick-restart weighted block Golub-Kahan-Lanczos algorithm for LREP.

Remark 6.

Actually, from the construction of

{\hat{B}}_{n}

, we can know the procedure for getting

{\hat{X}}_{k + 2}, \dots, {\hat{X}}_{n}

and

{\hat{Y}}_{k + 3}, \dots, {\hat{Y}}_{n + 1}

is the same as applying Algorithm 1 to

{\hat{Y}}_{k + 2}

for

n - k - 1

iterations, thus we use Algorithm 1 directly in restartingStep 2of the following Algorithm 2.

Algorithm 2: wbGKL-TR

1. Given an initial guess

Y_{1}

satisfying

Y_{1}^{T} K Y_{1} = I_{n_{b}}

, a tolerance

t o l

, an integer k that the k blocks approximate eigenvectors we want to add to the solving subspace, an integer n the block dimension of solving subspace, as well as

w_{ℓ}

the desired number of eigenpairs;
2. Apply Algorithm 1 from the current point to generate the rest of

X_{n}

,

Y_{n + 1}

, and

B_{n}

. If it is the first cycle, the current point is

Y_{1}

, else

Y_{k + 2}

;
3. Compute an SVD of

B_{n}

as in (2), select

w_{ℓ} (w_{ℓ} \leq n n_{b})

wanted singular values

σ_{j}

, and their associated left singular vectors

ϕ_{j}

and right singular vectors

ψ_{j}

. Form the approximate eigenpairs for

H

, if the stopping criterion is satisfied, then stop, else continue;
4. Generate new

{\hat{X}}_{k + 1}

,

{\hat{Y}}_{k + 2}

and

{\hat{B}}_{k + 1}

:
Compute

{\hat{Y}}_{k} = Y_{n} Ψ_{k}

,

{\hat{X}}_{k} = X_{n} Φ_{k}

,

{\hat{B}}_{k} = Σ_{k}

,

{\hat{Y}}_{k + 1} = Y_{n + 1}

,

U^{T} = E_{n}^{T} Φ_{k}

,

{\hat{B}}_{k} = B_{n}

;
Compute

{\tilde{X}}_{k + 1} = K {\hat{Y}}_{k + 1} - {\hat{X}}_{k} U {\hat{B}}_{k}

, do Cholesky decomposition

{\tilde{X}}_{k + 1}^{T} M {\tilde{X}}_{k + 1} = W^{T} W

, set

{\hat{A}}_{k + 1} = W

,

W = i n v (W)

,

{\hat{X}}_{k + 1} = {\tilde{X}}_{k + 1} W

;
Compute

{\tilde{Y}}_{k + 2} = M {\hat{X}}_{k + 1} - {\hat{Y}}_{k + 1} {\hat{A}}_{k + 1}^{T}

, do Cholesky decomposition

{\tilde{Y}}_{k + 2}^{T} K {\tilde{Y}}_{k + 2} = W^{T} W

, set

{\hat{B}}_{k + 1} = W^{T}

,

W = i n v (W)

,

{\hat{Y}}_{k + 2} = {\tilde{Y}}_{k + 2} W

;
Let

X_{k + 1} = {\hat{X}}_{k + 1} = [{\hat{X}}_{k}, {\hat{X}}_{k + 1}]

,

B_{k + 1} = {\hat{B}}_{k + 1} = [\begin{matrix} {\hat{B}}_{k} & U {\hat{B}}_{k} \\ 0 & {\hat{A}}_{k + 1} \end{matrix}]

,

Y_{k + 2} = {\hat{Y}}_{k + 2} = [{\hat{Y}}_{k}, {\hat{Y}}_{k + 1}, {\hat{Y}}_{k + 2}]

, and go to Step 2.

Remark 7.

InStep 3, we compute the harmonic Ritz pairs after n iterations. In practice, we do the computation for each iterations

j = 1, \dots, n

. When restarting, the information chosen to add to the solving subspaces are the wanted

w_{ℓ}

singular values of

B_{n}

with their corresponding left and right singular vectors. Actually, we use MATLAB command “sort” to choose the

w_{ℓ}

smallest ones or the

w_{ℓ}

largest ones, and which singular values to choose depends on the desired eigenvalues of

H

.

In the end of this section, we list the computational costs in a generic cycle of four algorithms, which are weighted block Golub-Kahan-Lanczos algorithm, thick-restart weighted block Golub-Kahan-Lanczos algorithm, block Lanczos algorithm [12], and thick-restart block Lanczos algorithm [12], and denoted by wbGKL, wbGKL-TR, BLan, and BLan-TR, respectively. The detail pseudocodes of BLan and BLan-TR are be found in [12].

The comparisons are presented in Table 1 and Table 2. Here, we denote “block vector” a

N \times n_{b}

rectangular matrix, denote “mvb” the product number of a

N \times N

matrix and a block vector. “dpb” denotes the dot product number of two block vectors X and Y, i.e.,

X^{T} Y

. “saxpyb” denotes the number of adding two block vectors or multiplying a block vector to a

n_{b} \times n_{b}

small matrix. “Ep(

2 n \times 2 n

)(with sorting)” means the number of

2 n \times 2 n

size eigenvalue problem with sorting eigenvalues and their corresponding eigenvectors in one cycle. Similarly, “Sp(

n \times n

)” denotes the number of

n \times n

size singular value decomposition in one cycle. Because wbGKL and BLan are non-restart algorithms, we just count the first n Lanczos iterations.

5. Numerical Examples

In this section, two numerical experiments are carried out by using MATLAB 8.4 (R2014b) on a laptop with an Intel Core i5-6200U CPU 2.3 GHz memory 8 GB under the Windows 10 operating system.

Example 1.

In this example, we check the bounds established in Theorem 1 and 2. For simplicity, we take

N = 100

, the number of weighted block Golub-Kahan-Lanczos steps

n = 20

,

K = M

as diagonal matrix

d i a g (λ_{1}, λ_{2}, \dots, λ_{N})

, where

\begin{matrix} λ_{1} = 11 + ρ, λ_{2} = 11, λ_{3} = 11 - ρ, \\ λ_{N - 2} = 1 + ρ, λ_{N - 1} = 1, λ_{N} = 1 - ρ, \\ λ_{j} = 5 + \frac{5 (N - j + 1)}{N - 3}, j = 4, \dots, N - 3, \end{matrix}

and

i = k = 1

,

ℓ = 3

,

\hat{i} = \hat{k} = N - 2

,

\hat{ℓ} = N

,

n_{b} = 3

. There are three positive eigenvalue clusters:

{λ_{1}, λ_{2}, λ_{3}}

,

{λ_{4}, \dots, λ_{N - 3}}

, or

{λ_{N - 2}, λ_{N - 1}, λ_{N}}

. Obviously,

Ξ = Γ = K^{- \frac{1}{2}}

.

We seek two groups of the approximate eigenpairs, the first is related to the first cluster, the second is related to the last cluster, i.e.,

{σ_{1}, σ_{2}, σ_{3}}

approximate

{λ_{1}, λ_{2}, λ_{3}}

, and

{σ_{n n_{b} - 2}, σ_{n n_{b} - 1}, σ_{n n_{b}}}

approximate

{λ_{N - 2}, λ_{N - 1}, λ_{N}}

. In order to see the affect that generated from

ρ

to the upper bounds of the approximate eigenpairs errors in weighted block Golub-Kahan-Lanczos method for LREP, we change the parameter

ρ > 0

to overmaster the tightness among eigenvalues within

{λ_{1}, λ_{2}, λ_{3}}

and

{λ_{N - 2}, λ_{N - 1}, λ_{N}}

. First, we choose the same matrix

Y_{0}

as in [12,17], i.e.,

Y_{0} = [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \\ \frac{1}{N} & s i n 1 & c o s 1 \\ ⋮ & ⋮ & ⋮ \\ \frac{N - n_{b}}{N} & s i n (N - n_{b}) & c o s (N - n_{b}) \end{matrix}] .

Obviously,

r a n k (Y_{0}) = n_{b}

and

r a n k (Y_{0}^{T} K Ξ_{(:, 1 : 3)}) = n_{b}

. Since K symmetric positive definite, thus do Cholesky decomposition

Y_{0}^{T} K Y_{0} = W^{T} W

, let

Y_{1} = Y_{0} W^{- 1}

, hence,

Y_{1}

satisfies (5), i.e.,

Y_{1}^{T} K Ξ_{(:, 1 : 3)}

is singular. We take

Z = Y_{1} {(Ξ_{(:, 1 : 3)}^{T} K Y_{1})}^{- 1}

, then Z satisfies (6). We execute the weighted block Golub-Kahan-Lanczos method with full re-orthogonalization for LREP in MATLAB, and check the bounds in (7), (8), (13), and (14). Since the approximate eigenvalues are

{σ_{1}, σ_{2}, σ_{3}}

and

{σ_{n n_{b} - 2}, σ_{n n_{b} - 1}, σ_{n n_{b}}}

, thus

π_{i, k, ℓ} = π_{i, k} = {\hat{π}}_{\hat{i}, \hat{k}, \hat{ℓ}} = {\hat{π}}_{\hat{i}, \hat{ℓ}} = 1

,

c = \hat{c} = 1

, and we measure the following two groups of errors:

\begin{matrix} ε_{11} = {∥ d i a g (λ_{1}^{2} - σ_{1}^{2}, λ_{2}^{2} - σ_{2}^{2}, λ_{3}^{2} - σ_{3}^{2}) ∥}_{F}, \\ ε_{21} = \frac{λ_{1}^{2} - λ_{N}^{2}}{C_{n - 1}^{2} (1 + 2 γ_{1, 3})} {∥ {tan}^{2} Θ_{K} (Ξ_{(:, 1 : 3)}, Z) ∥}_{F}, \\ ε_{31} = {∥ sin Θ_{K} (Ξ_{(:, 1 : 3)}, Y_{n} Ψ_{(:, 1 : 3)}) ∥}_{F}, \\ ε_{41} = \frac{\sqrt{1 + ∥ A_{n}^{T} B_{n} ∥_{2}^{2} / δ^{2}}}{C_{n - 1} (1 + 2 γ_{1, 3})} {∥ tan Θ_{K} (Ξ_{(:, 1 : 3)}, Z) ∥}_{F}, \end{matrix}

and

\begin{matrix} ε_{12} = {∥ d i a g (σ_{N - 2}^{2} - λ_{N - 2}^{2}, σ_{N - 1}^{2} - λ_{N - 1}^{2}, σ_{N}^{2} - λ_{N}^{2}) ∥}_{F}, \\ ε_{22} = \frac{λ_{1}^{2} - λ_{N}^{2}}{C_{n - 1}^{2} (1 + 2 {\hat{γ}}_{N - 2, N})} {∥ {tan}^{2} Θ_{K} (Ξ_{(:, N - 2 : N)}, Z) ∥}_{F}, \\ ε_{32} = {∥ sin Θ_{K} (Ξ_{(:, N - 2 : N)}, Y_{n} Ψ_{(:, n n_{b} - 2 : n n_{b})}) ∥}_{F}, \\ ε_{42} = \frac{\sqrt{1 + ∥ A_{n}^{T} B_{n} ∥_{2}^{2} / {\hat{δ}}^{2}}}{C_{n - i} (1 + 2 {\hat{γ}}_{N - 2, N})} {∥ tan Θ_{K} (Ξ_{(:, N - 2 : N)}, Z) ∥}_{F} . \end{matrix}

Actually,

ε_{21}

and

ε_{41}

are upper bounds of

ε_{11}

and

ε_{31}

, and

ε_{22}

and

ε_{42}

are upper bounds of

ε_{12}

and

ε_{32}

. Table 3 and Table 4 report the results of

ε_{i j}

,

i = 1, \dots, 4

,

j = 1, 2

with the parameter

ρ

goes to 0. From the two tables, we can see that the bounds for the eigenvalues lie in a cluster and their corresponding eigenvectors are sharp, and they are not sensitive to

ρ

when

ρ

goes to 0.

Example 2.

In this example, we are going to test the effectiveness of our weighted block Golub-Kahan-Lanczos algorithms. Four algorithms are tested, i.e.,wbGKL,wbGKL-TR,BLan, andBLan-TR. We choose 3 test problems used in [12,13], which are listed in Table 5. All the matrices K and M in the problems are symmetric positive definite. Specifically, Test 1 and Test 2, which are derived by the turboTDDFT command in QUANTUM ESPRESSO [22], are from the linear response research for Na2 and silane (SiH4) compound, respectively. The matrices K and M in Test 3 are from the University of Florida Sparse Matrix Collection [23], where the order of K is

N = 9604

, and M is the leading

N \times N

principal submatrix of

f i n a n 512

.

We aim to compute the smallest 5 positive eigenvalues and the largest 5 eigenvalues, i.e.,

λ_{i}

for

i = 1, \dots, 5, N - 4, \dots, N

, together with their associated eigenvectors. The initial guess is chosen as

V_{0} = e y e (N, n_{b})

with block size

n_{b} = 3

, where

e y e

is the MATLAB command. The same as in Example 1, since K is symmetric positive definite, thus do Cholesky decomposition

Y_{0}^{T} K Y_{0} = W^{T} W

, let

Y_{1} = Y_{0} W^{- 1}

, hence,

Y_{1}

satisfies

Y_{1}^{T} K Y_{1} = I_{n_{b}}

. In wbGKL-TR and BLan-TR, we select

n = 30

,

k = 20

, i.e., the restart will occur once the dimension of the solving subspace is larger than 90, and the information of 60 Ritz vectors are kept. For wbGKL and BLan, because there is no restart, then we compute the approximate eigenpairs when the Lanczos iterations equals to

30 + 10 \times (j - 1)

,

j = 1, 2, \dots

, hence, the Lanczos iterations are as the same amount as in wbGKL-TR and BLan-TR. The following relative eigenvalue error and relative residual 1-norm for each 10 approximate eigenpairs are calculated:

\begin{matrix} e (σ_{j}) : = \{\begin{matrix} \frac{| λ_{j} - σ_{j} |}{λ_{j}}, & j = 1, \dots, 5, \\ \frac{| λ_{n + j - k} - σ_{j} |}{λ_{n + j - k}}, & j = n n_{b} - 4, \dots, n n_{b}, \end{matrix} \\ r (σ_{j}) : = \frac{∥ H {\tilde{z}}_{j} - σ_{j} {\tilde{z}}_{j} ∥_{1}}{{(∥ H ∥}_{1} + σ_{j}) ∥ {\tilde{z}}_{j} ∥_{1}}, j = 1, \dots, 5, n n_{b} - 4, \dots, n n_{b}, \end{matrix}

where the “exact” eigenvalues

λ_{j}

are calculated by the MATLAB code

e i g

. The calculated approximate eigenpair

(σ_{j}, {\tilde{z}}_{j})

is regarded as converged if

r (σ_{j}) \leq t o l = 10^{- 8}

.

Table 6 and Table 7 give the number of the Lanczos iterations (denote by

i t e r

) and the CPU time in seconds (denote by

C P U

) for the four algorithms, and Table 6 is for the smallest 5 positive eigenvalues, Table 7 is for the largest 5 eigenvalues. From Table 6, one can see that, no matter the smallest or the largest eigenvalues, the iteration number of the four algorithms are competitive, but wbGKL and wbGKL-TR cost significant less time than BLan and BLan-TR, especially, wbGKL-TR consumes the least amount of time. Because BLan and BLan-TR need to compute the eigenvalues of

[\begin{matrix} 0 & T_{n} \\ D_{n} & 0 \end{matrix}]

, which is a nonsymmetric matrix, thus the two algorithms slower than wbGKL and wbGKL-TR. Due to the saving during the orthogonalization procedure and solving a much smaller

B_{n}

, wbGKL-TR is the faster algorithm.

The accuracy of the last two approximate eigenpairs in Test 1 are shown in Figure 1. From the figure, we can see that, for the last two eigenpairs, wbGKL and BLan require almost the same iterations to obtain the same accuracy, and the case of wbGKL-TR and BLan-TR also need almost the same iterations, which are one or two more restarts than wbGKL and BLan. On one hand, without solving a nonsymmetric eigenproblem, wbGKL and wbGKL-TR can save much more time than BLan and BLan-TR. On the other hand, since the dimension of the solving subspace for wbGKL-TR is bounded by

n n_{b}

, the savings in the process of orthogonalization and a much smaller singular value decomposition problem is sufficient to cover the additional restart steps.

6. Conclusions

In this paper, we present a weighted block Golub-Kahan-Lanczos algorithm to solve the desired small portion of smallest or largest positive eigenvalues which are in a cluster. Convergence analysis is established in Theorems 1 and 2, and bound the errors of the eigenvalue and eigenvector approximations belonging to an eigenvalue cluster. These results also show the advantages of the block algorithm over the single-vector version. To make the new algorithm more practical, we introduced a thick-restart strategy to eliminate the numerical difficulties caused by the block method. Numerical examples are executed to demonstrate the efficiency of our new restart algorithm.

Author Contributions

Conceptualization, G.C.; Data curation, H.Z. and Z.T.; Formal analysis, H.Z. and Z.T.; Methodology, H.Z.; Project administration, H.Z. and G.C.; Resources, H.Z.; Visualization, H.Z. and Z.T.; Writing—original draft, H.Z.; Writing—review and editing, Z.T. and G.C.

Funding

This work was financial supported by the National Nature Science Foundation of China (No. 11701225, 11601081, 11471122), Fundamental Research Funds for the Central Universities (No. JUSRP11719), Natural Science Foundation of Jiangsu Province (No. BK20170173), and the research fund for distinguished young scholars of Fujian Agriculture and Forestry University (No. xjq201727).

Conflicts of Interest

The authors declare no conflict of interest.

References

Casida, M.E. Time-Dependent Density Functional Response Theory for Molecules. In Recent Advances in Density Functional Methods; Chong, D.P., Ed.; World Scientific: Singapore, 1995. [Google Scholar]
Onida, G.; Reining, L.; Rubio, A. Electronic excitations density functional versus many-body Green’s function. Rev. Mod. Phys. 2002, 74, 601–659. [Google Scholar] [CrossRef]
Rocca, D. Time-Dependent Density Functional Perturbation Theory: New algorithms with Applications to Molecular Spectra. Ph.D. Thesis, The International School for Advanced Studies, Trieste, Italy, 2007. [Google Scholar]
Shao, M.; da Jornada, F.H.; Yang, C.; Deslippe, J.; Louie, S.G. Structure preserving parallel algorithms for solving the Bethe-Salpeter eigenvalue problem. Linear Algebra Appl. 2016, 488, 148–167. [Google Scholar] [CrossRef]
Ring, P.; Ma, Z.; Giai, V.N.; Vretenar, D.; Wandelt, A.; Gao, L. The time-dependent relativistic mean-field theory and the random phase approximation. Nucl. Phys. A 2001, 694, 249–268. [Google Scholar] [CrossRef] [Green Version]
Bai, Z.; Li, R.-C. Minimization principles for the linear response eigenvalue problem I: Theory. SIAM J. Matrix Anal. Appl. 2012, 33, 1075–1100. [Google Scholar] [CrossRef]
Bai, Z.; Li, R.-C. Minimization principles for the linear response eigenvalue problem II: Computation. SIAM J. Matrix Anal. Appl. 2013, 34, 392–416. [Google Scholar] [CrossRef]
Bai, Z.; Li, R.-C. Minimization principles and computation for the generalized linear response eigenvalue problem. BIT Numer. Math. 2014, 54, 31–54. [Google Scholar] [CrossRef]
Li, T.; Li, R.-C.; Lin, W.-W. A symmetric structure-preserving ΓQR algorithm for linear response eigenvalue problems. Linear Algebra Appl. 2017, 520, 191–214. [Google Scholar] [CrossRef]
Teng, Z.; Li, R.-C. Convergence analysis of Lanczos-type methods for the linear response eigenvalue problem. J. Comput. Appl. Math. 2013, 247, 17–33. [Google Scholar] [CrossRef]
Teng, Z.; Lu, L.; Li, R.-C. Perturbation of partitioned linear response eigenvalue problems. Electron. Trans. Numer. Anal. 2015, 44, 624–638. [Google Scholar]
Teng, Z.; Zhang, L.-H. A block Lanczos method for then linear response eigenvalue problem. Electron. Trans. Numer. Anal. 2017, 46, 505–523. [Google Scholar]
Teng, Z.; Zhou, Y.; Li, R.-C. A block Chebyshev-Davidson method for linear response eigenvalue problems. Adv. Comput. Math. 2016, 42, 1103–1128. [Google Scholar] [CrossRef]
Zhang, L.-H.; Lin, W.-W.; Li, R.-C. Backward perturbation analysis and residual-based error bounds for the linear response eigenvalue problem. BIT Numer. Math. 2014, 55, 869–896. [Google Scholar] [CrossRef]
Zhang, L.-H.; Xue, J.; Li, R.-C. Rayleigh-Ritz approximation for the linear response eigenvalue problem. SIAM J. Matrix Anal. Appl. 2014, 35, 765–782. [Google Scholar] [CrossRef]
Zhong, H.; Xu, H. Weighted Golub-Kahan-Lanczos bidiagonalizaiton algorithms. Electron. Trans. Numer. Anal. 2017, 47, 153–178. [Google Scholar]
Li, R.-C.; Zhang, L.-H. Convergence of the block Lanczos method for eigenvalue clusters. Numer. Math. 2015, 131, 83–113. [Google Scholar] [CrossRef]
Chapman, A.; Saad, Y. Deflated and augmented Krylov subspace techniques. Numer. Linear Algebra Appl. 1996, 4, 43–66. [Google Scholar] [CrossRef]
Lehoucq, R.B.; Sorensen, D.C. Deflation techniques for an implicitly restarted Arnoldi iteration. SIAM J. Matrix Anal. Appl. 1996, 17, 789–821. [Google Scholar] [CrossRef]
Wu, K.; Simon, H. Thick-restart Lanczos method for large symmetric eigenvalue problems. SIAM J. Matrix Anal. Appl. 2000, 22, 602–616. [Google Scholar] [CrossRef]
Knyazev, A.V.; Argentati, M.E. Principal angles between subspaces in an A-based scalar product: Algorithms and perturbation estimates. SIAM J. Sci. Comput. 2002, 23, 2008–2040. [Google Scholar] [CrossRef]
Giannozzi, P.; Baroni, S.; Bonini, N.; Calandra, M.; Car, R.; Cavazzoni, C.; Ceresoli, D.; Chiarotti, G.L.; Cococcioni, M.; Dabo, I.; et al. QUANTUM ESPRESSO: A modular and open-source software project for quantum simulations of materials. J. Phys. Condens. Matter 2009, 21, 395502. [Google Scholar] [CrossRef]
Davis, T.; Hu, Y. The University of Florida Sparse Matrix Collection. ACM Trans. Math. Softw. 2011, 38, 1:1–1:25. [Google Scholar] [CrossRef]

Figure 1. Errors and residuals of the 2 smallest positive eigenvalues for Test 1 in Example 2.

Table 1. Main computational costs per cycle wbGKL and wbGKL-TR.

	wbGKL	wbGKL-TR	wbGKL-TR
	wbGKL	(1-st Cycle)	(Other Cycle)
mvb	$2 n + 1$	$2 n + 1$	$2 (n - k)$
dpb	$2 n + 1$	$2 n + 1$	$2 (n - k)$
saxpyb	$8 n$	$8 n$	$8 (n - k) + 2 k (2 n + 1)$
block vector updates	$2 n + 2$	$2 n + 2$	$2 n + 2$
Ep( $2 n \times 2 n$ )(with sorting)	0	0	0
Sp( $n \times n$ )	1	1	1

Table 2. Main computational costs per cycle BLan and BLan-TR.

	BLan	BLan-TR	BLan-TR
	BLan	(1-st Cycle)	(Other Cycle)
mvb	$2 n + 1$	$2 n + 1$	$2 (n - k)$
dpb	$2 n + 1$	$2 n + 1$	$2 (n - k)$
saxpyb	$6 n$	$6 n$	$6 (n - k) + 2 k (2 n + 1)$
block vector updates	$2 n + 2$	$2 n + 2$	$2 n + 2$
Ep( $2 n \times 2 n$ )(with sorting)	1	1	1
Sp( $n \times n$ )	0	0	0

Table 3.

ε_{11}

,

ε_{31}

together with their upper bounds

ε_{21}

,

ε_{41}

of Example 1.

Table 3.

ε_{11}

,

ε_{31}

together with their upper bounds

ε_{21}

,

ε_{41}

of Example 1.

$ρ$	$ε_{11}$	$ε_{21}$	$ε_{31}$	$ε_{41}$
$10^{- 1}$	$4.0295 \times 10^{- 13}$	$2.6773 \times 10^{- 10}$	$1.2491 \times 10^{- 10}$	$2.6260 \times 10^{- 6}$
$10^{- 2}$	$5.1238 \times 10^{- 14}$	$5.4555 \times 10^{- 11}$	$6.1184 \times 10^{- 11}$	$1.1407 \times 10^{- 6}$
$10^{- 3}$	$7.1054 \times 10^{- 14}$	$4.6711 \times 10^{- 11}$	$5.7698 \times 10^{- 11}$	$1.0520 \times 10^{- 6}$
$10^{- 4}$	$2.4449 \times 10^{- 13}$	$4.5993 \times 10^{- 11}$	$5.7370 \times 10^{- 11}$	$1.0436 \times 10^{- 6}$
$10^{- 5}$	$2.1552 \times 10^{- 13}$	$4.5922 \times 10^{- 11}$	$5.7338 \times 10^{- 11}$	$1.0427 \times 10^{- 6}$

Table 4.

ε_{12}

,

ε_{32}

together with their upper bounds

ε_{22}

,

ε_{42}

of Example 1.

Table 4.

ε_{12}

,

ε_{32}

together with their upper bounds

ε_{22}

,

ε_{42}

of Example 1.

$ρ$	$ε_{11}$	$ε_{21}$	$ε_{31}$	$ε_{41}$
$10^{- 1}$	$7.1089 \times 10^{- 16}$	$6.0352 \times 10^{- 11}$	$1.9393 \times 10^{- 10}$	$8.8823 \times 10^{- 7}$
$10^{- 2}$	$1.3688 \times 10^{- 15}$	$3.5913 \times 10^{- 11}$	$1.9562 \times 10^{- 10}$	$6.8797 \times 10^{- 7}$
$10^{- 3}$	$3.9968 \times 10^{- 15}$	$3.4113 \times 10^{- 11}$	$1.9580 \times 10^{- 10}$	$6.7081 \times 10^{- 7}$
$10^{- 4}$	$4.8495 \times 10^{- 15}$	$3.3938 \times 10^{- 11}$	$1.9582 \times 10^{- 10}$	$6.6912 \times 10^{- 7}$
$10^{- 5}$	$8.1221 \times 10^{- 15}$	$3.3920 \times 10^{- 11}$	$1.9582 \times 10^{- 10}$	$6.6895 \times 10^{- 7}$

Table 5. The matrices K and M in Test 1–3.

Problems	N	K	M
Test 1	1862	$N a 2$	$N a 2$
Test 2	5660	$S i H 4$	$S i H 4$
Test 3	9604	$f v 1$	$f i n a n 512$

Table 6. Compute 5 smallest positive eigenvalues for Test 1–3.

Algorithms	Test 1		Test 2		Test 3
Algorithms	$CPU$	$iter$	$CPU$	$iter$	$CPU$	$iter$
wbGKL	1.5070	149	25.7848	319	15.9308	379
wbGKL-TR	1.0746	179	20.3593	359	5.1302	589
BLan	4.6739	149	87.1670	349	43.9506	379
BLan-TR	2.1243	163	39.1306	393	19.9677	592

Table 7. Compute 5 largest eigenvalues for Test 1–3.

Algorithms	Test 1		Test 2		Test 3
Algorithms	$CPU$	$iter$	$CPU$	$iter$	$CPU$	$iter$
wbGKL	0.6387	79	12.4658	179	1.0639	109
wbGKL-TR	0.5284	79	9.9093	179	0.8774	109
BLan	1.4634	79	27.4028	179	6.7574	109
BLan-TR	1.0151	82	18.3415	186	4.1298	113

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhong, H.; Teng, Z.; Chen, G. Weighted Block Golub-Kahan-Lanczos Algorithms for Linear Response Eigenvalue Problem. Mathematics 2019, 7, 53. https://doi.org/10.3390/math7010053

AMA Style

Zhong H, Teng Z, Chen G. Weighted Block Golub-Kahan-Lanczos Algorithms for Linear Response Eigenvalue Problem. Mathematics. 2019; 7(1):53. https://doi.org/10.3390/math7010053

Chicago/Turabian Style

Zhong, Hongxiu, Zhongming Teng, and Guoliang Chen. 2019. "Weighted Block Golub-Kahan-Lanczos Algorithms for Linear Response Eigenvalue Problem" Mathematics 7, no. 1: 53. https://doi.org/10.3390/math7010053

APA Style

Zhong, H., Teng, Z., & Chen, G. (2019). Weighted Block Golub-Kahan-Lanczos Algorithms for Linear Response Eigenvalue Problem. Mathematics, 7(1), 53. https://doi.org/10.3390/math7010053

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Weighted Block Golub-Kahan-Lanczos Algorithms for Linear Response Eigenvalue Problem

Abstract

1. Introduction

2. Preliminaries

3. Weighted Block Golub-Kahan-Lanczos Algorithm

3.1. Weighted Block Golub-Kahan-Lanczos Algorithm

3.2. Convergence Analysis

4. Thick Restart

5. Numerical Examples

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI