Generalized Inverses Estimations by Means of Iterative Methods with Memory

Artidiello, Santiago; Cordero, Alicia; Torregrosa, Juan R.; P. Vassileva, María

doi:10.3390/math8010002

Open AccessArticle

Generalized Inverses Estimations by Means of Iterative Methods with Memory

¹

Instituto Tecnológico de Santo Domingo (INTEC), 10602 Santo Domingo, Dominican Republic

²

Multidisciplinary Institute of Mathematics, Universitat Politècnica de València, 46022 València, Spain

^*

Author to whom correspondence should be addressed.

Mathematics 2020, 8(1), 2; https://doi.org/10.3390/math8010002

Submission received: 15 November 2019 / Revised: 9 December 2019 / Accepted: 13 December 2019 / Published: 18 December 2019

(This article belongs to the Special Issue Selected Papers from Iterative Processes for Solving Nonlinear Problems: Convergence and Stability of ICIAM 2019 and MME&HB 2019)

Download Versions Notes

Abstract

:

A secant-type method is designed for approximating the inverse and some generalized inverses of a complex matrix A. For a nonsingular matrix, the proposed method gives us an approximation of the inverse and, when the matrix is singular, an approximation of the Moore–Penrose inverse and Drazin inverse are obtained. The convergence and the order of convergence is presented in each case. Some numerical tests allowed us to confirm the theoretical results and to compare the performance of our method with other known ones. With these results, the iterative methods with memory appear for the first time for estimating the solution of a nonlinear matrix equations.

Keywords:

nonlinear matrix equation; iterative method; secant method; convergence; singular value decomposition

1. Introduction

Recently, many iterative methods without memory have been published for approximating the inverse or some generalized inverse of a complex matrix A of arbitrary order (see, for example, [1,2,3,4,5,6] and the references therein). This topic has a significant role to play in many areas in applied sciences and engineering, such as multivariate analysis, image and signal processing, approximation theory, cryptography, etc. (see [7]).

The discretization process of boundary problems or partial differential equations by means of divided difference technique or finite elements yields to an important number of linear systems being solved. This statement is applicable both in equations with integer derivatives and in the case of fractional derivatives (see, for example, [8,9]). In these linear problems, usually the matrix of coefficients is too big or ill-conditioned to be solved analytically. Thus, iterative methods can play a key role.

The main purpose of this manuscript is to design a secant-type iterative scheme with memory, free for inverse operators and efficient under the point of view of CPU-time, for estimating the inverse of a non-singular complex matrix. We also argue the generalization of the proposed scheme for approximating the Drazin inverse of singular square matrices and the Moore–Penrose inverse of complex rectangular matrices. As far as we know, this is the first time that this kind of methods with memory is applied to estimate generalized inverses. This might be the first step to develop higher-order methods with memory in the future. This kind of schemes has proven to be very stable for scalar equations; we expect a similar performance in the case of matrix equations.

Let us consider a non-singular complex matrix A of size

n \times n

. The extension of the iterative methods for the real equation

g (x) = a x - 1 = 0

to obtain the inverse of A, that is the zero of the matrix function

G (X) = X^{- 1} - A

, gives us the so-called Schulz-type schemes.

The most known of these schemes to estimate

A^{- 1}

is the Newton–Schulz method [10], whose iterative expression is

X_{k + 1} = X_{k} (2 I - A X_{k}), k = 0, 1, \dots,

(1)

where I denotes the identity matrix of order n. Schulz [11] demonstrated that the eigenvalues of matrix

I - A X_{0}

must be less than 1 to assure the convergence of the scheme in Equation (1). Taking into account that the residuals

E_{k} = I - A X_{k}

in each iteration of Equation (1) satisfy

∥ E_{k + 1} ∥ \leq ∥ E_{k} ∥^{2}

, Newton–Schulz method has quadratic convergence. In general, it is known that this scheme converges to

A^{- 1}

with

X_{0} = α A^{*}

or

X_{0} = α A

, where

0 < α < 2 / ρ (A^{*} A)

,

ρ (\cdot)

denotes the spectral radius, and

A^{*}

is the conjugate transpose of A. Such schemes are also used for sensitivity analysis when accurate approximate inverses are needed for both square and rectangular matrices.

On the other hand, for a nonsingular matrix

A \in C^{n \times n}

, Li et al. [12] suggested the scheme

X_{k + 1} = X_{k} (m I - \frac{m (m - 1)}{2} A X_{k} + \frac{m (m - 1) (m - 2)}{3!} - \dots + {(- 1)}^{m - 1} {(A X_{k})}^{m - 1}), m = 2, 2, \dots,

with

X_{0} = α A^{*}

. They proved the convergence of m-order of

{X_{k}}

to the inverse of A. This result was extended by Chen et al. [13] for computing the Moore–Penrose inverse. Other iterative schemes without memory have been designed for approximating the inverse or some generalized inverses.

In this paper, we construct an iterative method with memory (that is,

k + 1

iterate is obtained not only from the iterate k but also from other previous iterates) for computing the inverse of a nonsingular matrix. In the iterative expression of the designed method, inverse operators do not appear. We prove the order of convergence of the proposed scheme and we extend it for approximating the Moore–Penrose inverse of rectangular matrices and the Drazin inverse of singular square matrices.

For analyzing the order of convergence of an iterative method with memory, we use the concept of R-order introduced in [14] by Ortega and Rheinboldt and the following result.

Let us consider an iterative method with memory (IM) that generates a sequence

{X_{k}}

of estimations to the solution

ξ

, and let us also assume that this sequence converges to

ξ

. If there exists a nonzero constant

η

and nonnegative numbers

t_{i}

,

0 \leq i \leq m

such that the inequality

| e_{k + 1} | \leq η \prod_{i = 0}^{m} {| e_{k - i} |}^{t_{i}}

(2)

holds, where

e_{k}

is the error of iterate

X_{k}

, then the R-order of convergence of (IM) satisfies

O_{R} ((I M), ξ) \geq s^{*},

(3)

where

s^{*}

is the unique positive root of the polynomial

s^{m + 1} - \sum_{i = 0}^{m} t_{i} s^{m - i} = 0 .

(4)

The proof of this result can be found in [14].

From here, the work is organized as follows. In the next section, we describe how a secant-type method, free of inverse operators, is constructed for estimating the inverse of a nonsingular complex matrix, proving its order of convergence. In Section 3 and Section 4, we study the generalization of the proposed methods for computing the Moore–Penrose inverse of a rectangular complex matrix and the Drazin inverse of a singular square matrix. Section 5 is devoted to the numerical test for analyzing the performance of the proposed schemes and to confirm the theoretical results. With a section of conclusions, the paper is finished.

2. A Secant-Type Method for Matrix Inversion

Let us recall that, for an scalar nonlinear equation

g (x) = 0

, the secant method is an iterative scheme with memory such that

x_{k + 1} = x_{k} - \frac{g (x_{k})}{α_{k}},

with

α_{k}

satisfying

g (x_{k}) - g (x_{k - 1}) = α_{k} (x_{k} - x_{k - 1})

,

k \geq 0

, given

x_{0}

and

x_{- 1}

as initial approximations.

For a nonlinear matrix equation

G (X) = 0

, where

G : C^{n \times n} \to C^{n \times n}

, the secant method can be described as

X_{k + 1} = X_{k} - A_{k}^{- 1} G (X_{k}), k \geq 0,

where

X_{0}

and

X_{- 1}

are initial estimations and being

A_{k}

a suitable linear operator satisfying

A_{k + 1} (X_{k + 1} - X_{k}) = G (X_{k + 1}) - G (X_{k}) \Leftrightarrow A_{k + 1} S_{k} = Y_{k},

where

S_{k} = X_{k + 1} - X_{k}

and

Y_{k} = G (X_{k + 1}) - G (X_{k})

. Thus, it is necessary to solve, at each iteration, the linear system

A_{k + 1} S_{k} = Y_{k}

. It is proven in [15] that, with this formulation, secant method converges to the solution of

G (X) = 0

.

Let us consider an

n \times n

nonsingular complex matrix A. We want to construct iterative schemes for computing the inverse

A^{- 1}

of A, that is, iterative methods for solving the matrix equation

G (X) = X^{- 1} - A = 0 .

(5)

The secant method was adapted by Monsalve et al. [15] to estimate the solution of Equation (5), that is the inverse of A, when the matrix is diagonalizable. The secant method applied to

G (X) = X^{- 1} - A

(see [15]) gives us:

\begin{matrix} X_{k + 1} & = & X_{k} - S_{k - 1} {[G (X_{k}) - G (X_{k - 1})]}^{- 1} G (X_{k}) \\ = & X_{k} - (X_{k} - X_{k - 1}) {[X_{k}^{- 1} - X_{k - 1}^{- 1}]}^{- 1} (X_{k}^{- 1} - A) . \end{matrix}

(6)

Now, we extend the result presented in [15] to any nonsingular matrix, not necessarily diagonalizable. If A is a nonsingular complex matrix of size

n \times n

, then there exist unitary matrices U and V, of size

n \times n

, such that

\begin{matrix} U^{*} A V = Σ = d i a g (σ_{1}, σ_{2}, \dots, σ_{n}), \end{matrix}

(7)

being

σ_{1} \geq σ_{2} \geq \dots \geq σ_{n} > 0

the singular values of A.

Let us define

D_{k} = V^{*} X_{k} U

, that is

X_{k} = V D_{k} U^{*}

. Then, from Equation (6),

V D_{k + 1} U^{*} = V D_{k} U^{*} - (V D_{k} U^{*} - V D_{k - 1} U^{*}) {(U D_{k}^{- 1} V^{*} - U D_{k - 1}^{- 1} V^{*})}^{- 1} (U D_{k}^{- 1} V^{*} - U Σ V^{*}) .

Several algebraic manipulations allow us to assure that

\begin{matrix} D_{k + 1} = D_{k} - (D_{k} - D_{k - 1}) {(D_{k}^{- 1} - D_{k - 1}^{- 1})}^{- 1} (D_{k}^{- 1} - Σ) . \end{matrix}

(8)

If we choose initial estimations,

X_{- 1}

and

X_{0}

, such that

D_{- 1} = V^{*} X_{- 1} U

and

D_{0} = V^{*} X_{0} U

are diagonal matrices, then all matrices

D_{k}

are diagonal and therefore

D_{i} D_{j} = D_{j} D_{i}

, for all

i, j

. Thus, from Equation (8), we assure

D_{k + 1} = D_{k - 1} + D_{k} - D_{k - 1} Σ D_{k},

and, from this expression, we propose the secant-type method:

\begin{matrix} X_{k + 1} = X_{k - 1} + X_{k} - X_{k - 1} A X_{k}, k = 0, 1, 2, \dots \end{matrix}

(9)

being

X_{0}

and

X_{- 1}

initial approximations given.

The analysis of the convergence of the iterative method with memory in Equation (9) is presented in the following result.

Theorem 1.

Let

A \in C^{n x n}

be a nonsingular matrix, with singular value decomposition

U^{*} A V = Σ

. Let

X_{0}

and

X_{- 1}

be such that

V^{*} X_{- 1} U

and

V^{*} X_{0} U

are diagonal matrices. Then, sequence

{X_{k}}

, obtained by Equation (9), converges to

A^{- 1}

with super-linear convergence.

Proof.

Let us consider U and V unitary matrices such that the singular values decomposition in Equation (7) is satisfied, where

σ_{1} \geq σ_{2} \geq \dots \geq σ_{n} > 0

are the singular values of A.

We define

D_{k} = V^{*} X_{k} U

, that is

X_{k} = V D_{k} U^{*}

, for

k \geq - 1

. From Equation (9), we have

V D_{k + 1} U^{*} = V D_{k - 1} U^{*} + V D_{k} U^{*} - V D_{k - 1} U^{*} U Σ V^{*} V D_{k} U^{*},

then

V D_{k + 1} U^{*} = V (D_{k - 1} + D_{k} - D_{k - 1} Σ D_{k}) U^{*}

and therefore

D_{k + 1} = D_{k - 1} + D_{k} - D_{k - 1} Σ D_{k},

where

D_{k} = d i a g (d_{k}^{1}, d_{k}^{2}, \dots, d_{k}^{n})

.

Then, component by component, we obtain

\begin{matrix} d_{k + 1}^{j} = d_{k - 1}^{j} + d_{k}^{j} - d_{k - 1}^{j} d_{k}^{j} σ_{j}, j = 1, 2, \dots, n . \end{matrix}

(10)

By subtracting

\frac{1}{σ_{j}}

from both sides of Equation (10) and denoting

e_{k}^{j} = d_{k}^{j} - 1 / σ_{j}

, we get

\begin{matrix} e_{k + 1}^{j} & = & d_{k - 1}^{j} + d_{k}^{j} - d_{k - 1}^{j} d_{k}^{j} σ_{j} - \frac{1}{σ_{j}} \\ = & - σ_{j} e_{k}^{j} e_{k - 1}^{j} \end{matrix}

(11)

From Equation (11), we conclude that, for each value of j from 1 to n,

d_{k + 1}^{j}

in Equation (10) converges to

\frac{1}{σ_{j}}

with order of convergence of the unique positive root of

λ^{2} - λ - 1 = 0

, that is,

λ \approx 1.618

(by using the result of Ortega–Rheinboldt mentioned in the Introduction).

Then, for each j,

1 \leq j \leq n

, there exist a

{c_{k}^{j}}_{k}

satisfying

c_{k}^{j} > 0

,

\forall k

and

{(c_{k}^{j})}_{k}

tends to zero when k tends to infinity. Moreover,

|e_{k + 1}^{j}| \leq c_{k}^{j} |e_{k}^{j}|, 1 \leq j \leq n .

Thus,

{∥D_{k + 1} - Σ^{- 1}∥}_{2}^{2} = \sum_{j = 1}^{n} {(e_{k + 1}^{j})}^{2} \leq \sum_{j = 1}^{n} {(c_{k}^{j})}^{2} {(e_{k}^{j})}^{2} \leq n m_{k}^{2} {∥D_{k} - Σ^{- 1}∥}_{2}^{2},

where

m_{k} = max_{1 \leq j \leq n} {c_{k}^{j}} .

Therefore,

\begin{matrix} {∥X_{k + 1} - A^{- 1}∥}_{2} & = & {∥V D_{k + 1} U^{*} - V Σ^{- 1} U^{*}∥}_{2} \\ = & {∥V (D_{k + 1} - Σ^{- 1}) U∥}_{2} \\ \leq & {∥ V ∥}_{2} {∥(D_{k + 1} - Σ^{- 1})∥}_{2} {∥ U^{*} ∥}_{2} = {∥(D_{k + 1} - Σ^{- 1})∥}_{2} \\ \leq & \sqrt{n} m_{k} {∥(D_{k} - Σ^{- 1})∥}_{2} \\ \leq & \sqrt{n} m_{k} {∥(X_{k} - A^{- 1})∥}_{2}, \end{matrix}

which allows us to affirm that

{X_{k}}

converges to

A^{- 1}

. □

On the other hand, Highan in [10] introduced the following definition for the stability of the iterative process

Z_{k + 1} = H (Z_{k})

, with a fixed point

Z_{*}

. If we assume that H is Frechét differentiable in

Z_{*}

, the iteration is stable in a neighborhood of

Z_{*}

if the Frechét derivative

H^{'} (Z_{*})

has bounded powers, that is, there exists a positive constant C such that

∥ H^{'} {(Z_{*})}^{k} ∥ \leq C, \forall k > 0 .

Therefore, the following result can be stated for the secant method.

Theorem 2.

The secant method in Equation (9) for the estimation of inverse matrix is a stable iterative scheme.

Proof.

The proof is made demonstrating that

H^{'} (Z_{*})

is an idempotent matrix.

The secant-type method described as a fixed point scheme, can be written as

H (Z_{k}) = H (\begin{matrix} X_{k} \\ X_{k - 1} \end{matrix}) = (\begin{matrix} X_{k - 1} + X_{k} - X_{k - 1} A X_{k} \\ X_{k} \end{matrix}) .

It is easy to deduce that

H^{'} (\begin{matrix} X_{k} \\ X_{k - 1} \end{matrix}) Q = (\begin{matrix} Q_{1} + Q_{2} - X_{k - 1} A Q_{1} - Q_{2} A X_{k} \\ Q_{1} \end{matrix}),

where

Q = {(Q_{1}, Q_{2})}^{T}

. Then, for

Z = Z * = {(A^{- 1}, A^{- 1})}^{T}

, we have

H^{'} (Z_{*}) Q = (\begin{matrix} 0 \\ Q_{1} \end{matrix}) = (\begin{matrix} 0 & 0 \\ I & 0 \end{matrix}) (\begin{matrix} Q_{1} \\ Q_{2} \end{matrix}) .

Thus,

H^{'} (Z_{*})

is an idempotent matrix and the iteration is stable. □

3. A Secant-Type Method for Approximating the Moore–Penrose Inverse

Now, we would like to extend the proposed iterative scheme for computing the Moore–Penrose inverse [7] of a

m \times n

complex matrix A, denoted by

A^{†}

. It is the unique

n \times m

matrix X satisfying the equations

A X A = A, X A X = X, {(A X)}^{*} = A X, {(X A)}^{*} = X A .

If

r a n k (A) = r \leq m i n {m, n}

, by using the singular value decomposition of A, we obtain

A = U [\begin{matrix} Σ & 0 \\ 0 & 0 \end{matrix}] V^{*},

being

Σ = d i a g (σ_{1}, σ_{2}, \dots, σ_{r})

,

σ_{1} \geq σ_{2}, \dots, \geq σ_{r} > 0

. U and V are unitary matrices with

U \in C^{m \times m}

and

V \in C^{n \times n}

. It is also known that

A^{†} = V^{*} [\begin{matrix} Σ^{- 1} & 0 \\ 0 & 0 \end{matrix}] U,

where

Σ^{- 1} = d i a g (1 / σ_{1}, 1 / σ_{2}, \dots, 1 / σ_{r})

.

The convergence of the method in Equation (9) for Moore–Penrose inverse is established in the following result.

Theorem 3.

Let

A \in C^{m \times n}

be a matrix with

r a n k (A) = r

, with singular value decomposition

U^{*} A V = (\begin{matrix} Σ & 0 \\ 0 & 0 \end{matrix}) .

Let

X_{- 1}

and

X_{0}

be initial estimations such that

V^{*} X_{- 1} U = (\begin{matrix} Σ_{- 1} & 0 \\ 0 & 0 \end{matrix}) a n d V^{*} X_{0} U = (\begin{matrix} Σ_{0} & 0 \\ 0 & 0 \end{matrix}),

being

Σ_{- 1}

and

Σ_{0}

diagonal matrices of size

r \times r

. Then, sequence

{X_{k}}

, obtained by Equation (9), converges to

A^{†}

with super-linear order of convergence.

Proof.

Given the singular value decomposition of A, for any fixed arbitrary value of k, we define matrix

D_{k}

as

D_{k} = V^{*} X_{k} U = (\begin{matrix} Σ_{k} & 0 \\ 0 & 0 \end{matrix}),

being

Σ_{k} \in C^{r \times r}

. Thus, by using the iterative expression in Equation (9), we obtain

(\begin{matrix} Σ_{k + 1} & 0 \\ 0 & 0 \end{matrix}) = (\begin{matrix} Σ_{k - 1} + Σ_{k} - Σ_{k - 1} Σ Σ_{k} & 0 \\ 0 & 0 \end{matrix}) .

Therefore, as

Σ_{- 1}

and

Σ_{0}

are diagonal matrices, so are all matrices

Σ_{k}

, and the expression

Σ_{k + 1} = Σ_{k - 1} + Σ_{k} - Σ_{k - 1} Σ Σ_{k}

represents r scalar uncoupled iterations converging to

\frac{1}{σ_{i}}

,

1 \leq i \leq r

with super-linear order, that is to say,

∥ Σ_{k + 1} - Σ^{- 1} ∥_{2}^{2} \leq r M_{k}^{2} {∥ Σ_{k} - Σ^{- 1} ∥}_{2}^{2},

with

M_{k} = {max}_{1 \leq i \leq r} {c_{k}^{2}}

, being

c_{k}^{i} > 0

such that sequence

{c_{k}^{i}}

tends to zero for k tending to infinity.

With an analogous argument as in Theorem 1,

∥ X_{k + 1} - A^{†} ∥_{2} \leq \sqrt{r} m_{k} {∥ X_{k} - A^{†} ∥}_{2},

which allows us to affirm that

{X_{k}}

converges to

A^{†}

, with the desired order of convergence. □

4. A Secant-Type Method for Approximating the Drazin Inverse

Drazin, in 1958 (see [10]), proposed a different kind of generalized inverse, in which some conditions of the Moore–Penrose inverse and the index of the matrix appeared. The importance of this inverse has motivated many researchers to propose algorithms for its calculation.

It is known (see [10]) that the smallest nonnegative integer l, such that

r a n k (A^{l + 1}) = r a n k (A^{l})

is called the index of A and it is denoted by

i n d (A)

. If A is a complex matrix of size

n \times n

, the Drazin inverse of A, denoted

A^{D}

, is the unique matrix X satisfying

A^{l + 1} X = A^{l}, X A X = X, (A X) = X A,

where l is the index of A.

If

i n d (A) = 1

, then X is called the g-inverse or group inverse of A, and, if

i n d (A) = 0

, then A is nonsingular and

A^{D} = A^{- 1}

. Let us observe that the idempotent matrix

A A^{D}

is the projector on

R (A^{l})

along

N (A^{l})

, where

R (A^{l})

and

N (A^{l})

denote the range and null spaces of

A^{l}

, respectively.

In [16], the following result is presented, which is used in the proof of the main result.

Proposition 1.

If

P_{A, B}

is the projector on a space A along a space B, the following statements hold:

(a): $P_{A, B} C = C$ if and only if $R (C) \subseteq A$ .
(b): $C P_{A, B} = C$ if and only if $N (C) \supseteq B$ .

Li and Wei [1] proved that the Newton–Schulz method in Equation (1) can be used for approximating the Drazin inverse, using as initial estimation

X_{0} = α A^{l}

, where parameter

α

is chosen so that condition

∥ I - A X_{0} ∥ < 1

is satisfied. One way for selecting the initial matrix used by different authors is

X_{0} = \frac{2}{t r (A^{l + 1})} A^{l},

where

t r (\cdot)

is the trace of a square matrix. Another fruitful initial matrix is

X_{0} = \frac{2}{{2 ∥ A ∥}_{2}^{l + 1}} A^{l} .

Using two initial matrices of these form,

α A^{l}

, with

α

a constant, we want to prove that the sequence obtained by the secant-type method in Equation (9) converges to the Drazin inverse

A^{D}

. In this case, we use a different type of demonstration than those used in the previous cases.

Theorem 4.

Let

A \in C^{n \times n}

be a square nonsingular matrix. We choose as initial estimations

X_{0} = α_{0} A^{l_{0}}

and

X_{1} = α_{1} A^{l_{1}}

, with

l_{0}, l_{1} \geq i n d (A)

. Then, sequence

{X_{k}}_{k \geq 0}

generated by Equation (9) satisfies the following error equation

∥ A^{D} - X_{k + 1} ∥ \leq ∥ A^{D} {∥ ∥ A ∥}^{2} ∥ A^{D} - X_{k - 1} ∥ ∥ A^{D} - X_{k} ∥ .

Thus,

{X_{k}}_{k \geq 0}

converges to

A^{D}

with order of convergence

1.618

, that is, with super-linear convergence.

Proof.

Let us define

E_{k} = I - A X_{k}

,

k = 0, 1, \dots

. Then,

\begin{matrix} E_{k + 1} & = & I - A X_{k + 1} = I - A (X_{k} + X_{k - 1} (I - A X_{k})) \\ = & I - A X_{k} - A X_{k - 1} (I - A X_{k}) \\ = & (I - A X_{k - 1}) (I - A X_{k}) \\ = & E_{k - 1} E_{k} . \end{matrix}

Therefore,

∥ E_{k + 1} ∥ \leq ∥ E_{k - 1} ∥ ∥ E_{k} ∥

. In addition, it is easy to prove that, if we choose

X_{0}

and

X_{1}

such that

∥ E_{0} ∥ < 1

and

∥ E_{1} ∥ < 1

, then

∥ E_{k} ∥ < 1

,

\forall k \in N

.

Now, we denote

e_{k} = A^{D} - X_{k}

the error of iterate k. From the selection of

X_{0}

and

X_{1}

and by applying Proposition 1, we establish

A^{D} A X_{k} = X_{k} = X_{k} A A^{D}, \forall k \geq 0 .

Thus,

e_{k} = A^{D} - X_{k} = A^{D} - A^{D} A X_{k} = A^{D} (I - A X_{k}) = A^{D} E_{k} .

From this identity, there exists

k_{0} \in N

such that

∥ e_{k} ∥ \leq ∥ A^{D} ∥ ∥ E_{k} ∥ \leq ∥ A^{D} ∥ ∥ E_{0} ∥^{k} {∥ E_{1} ∥}^{k}, \forall k \geq k_{0} .

Thus,

{∥ e_{k} {∥}}_{k \geq 0}

tends to zero and therefore

{X_{k}}_{k \geq 0}

tends to

A^{D}

.

On the other hand,

\begin{matrix} ∥ e_{k + 1} ∥ & = & ∥ X_{k + 1} - A^{D} ∥ = ∥ A^{D} A X_{k + 1} - A^{D} A A^{D} ∥ \\ = & ∥ A^{D} (A X_{k + 1} - A A^{D}) ∥ \leq ∥ A^{D} ∥ ∥ A e_{k + 1} ∥ \end{matrix}

(12)

Now, we analyze

A e_{k + 1}

.

A e_{k + 1} = A (A^{D} - X_{k + 1}) = A A^{D} - I + I - A X_{k + 1} = A A^{D} - I + E_{k + 1} = A A^{D} - I + E_{k - 1} E_{k},

but

\begin{matrix} E_{k - 1} E_{k} + A A^{D} - I & = & (I - A X_{k - 1}) (I - A X_{k}) + A A^{D} - I \\ = & (I - A A^{D} + A A^{D} - A X_{k - 1}) (I - A A^{D} + A A^{D} - A X_{k}) + A A^{D} - I \\ = & (I - A A^{D} + A e_{k - 1}) (I - A A^{D} + A e_{k}) + A A^{D} - I \\ = & {(I - A A^{D})}^{2} + (I - A A^{D}) A e_{k} + A e_{k - 1} (I - A A^{D}) + A e_{k - 1} A e_{k} + A A^{D} - I \\ = & A e_{k - 1} A e_{k} . \end{matrix}

In the last equality, we use that

{(I - A A^{D})}^{2} = I - A A^{D}

, in fact

{(I - A A^{D})}^{m} = I - A A^{D}

,

\forall m \in N

. In addition,

(I - A A^{D}) A e_{k} = 0

and

A e_{k - 1} (I - A A^{D}) = 0

.

Therefore,

\begin{matrix} ∥ e_{k + 1} ∥ & \leq & ∥ A^{D} ∥ ∥ A e_{k + 1} ∥ = ∥ A^{D} ∥ ∥ E_{k - 1} E_{k} + A A^{D} - I ∥ \\ \leq & ∥ A^{D} {∥ ∥ A ∥}^{2} ∥ e_{k - 1} ∥ ∥ e_{k} ∥ . \end{matrix}

Finally, by applying the theorem of convergence for iterative methods with memory, as mentioned in the Introduction, we assure that the order of convergence of secant-type method is the unique positive root of

λ^{2} - λ - 1 = 0

, that is

λ = 1.618

. □

5. Numerical Experiments

In this section, we check the behavior for the calculation of the inverse, Moore–Penrose inverse and Drazin inverse, of different test matrices A, using the secant method, which we compared with the Newton–Schulz scheme in Equation (1). Numerical computations were carried out in Matlab R2018b (MathWorks, Natick, USA) with a processor Intel(R) Xeon(R) CPU E5-2420 v2 at 2.20 GHz. As stopping criterion, we used

∥ X_{k + 1} - X_{k} ∥_{2} < 10^{- 6}

or

∥ F (X_{k + 1}) ∥_{2} < 10^{- 6}

.

To numerically check the theoretical results, Jay [17] introduced the order of approximate computational convergence (COC), defined as

order \approx C O C = \frac{ln (∥ F (X_{k + 1}) ∥_{2} / ∥ F (X_{k}) ∥_{2})}{ln (∥ F (X_{k}) ∥_{2} / ∥ F (X_{k - 1}) ∥_{2})} .

In a similar way, the authors presented in [18] another numerical approximation of the theoretical order, denoted by ACOC, and defined as

order \approx A C O C = \frac{ln (∥ X_{k + 1} - X_{k} ∥_{2} / ∥ X_{k} - X_{k - 1} ∥_{2})}{ln (∥ X_{k} - X_{k - 1} ∥_{2} / ∥ X_{k - 1} - X_{k - 2} ∥_{2})} .

We use indistinctly any of these computational order estimates, to show the accuracy of these approximations on the proposed method. In the case of vector COC (or ACOC) is not stable, we write “-” in the corresponding table.

Example 1.

In this example, matrix A is a

n \times n

random matrix with

n = 10, 100, 200, 300, 400, 500

. The initial estimation used for the Newton–Schulz scheme is

X_{0} = A^{T} / {∥ A ∥}^{2}

and for the secant method

X_{- 1} = \frac{A^{T}}{{∥ A ∥}^{2}}

and

X_{0} = 0.5 \frac{A^{T}}{{∥ A ∥}^{2}}

.

In Table 1, we show the results obtained by Newton–Schulz and secant-type method for the different random matrices, the number of iterations, the residuals, and the value of COC. The results are in concordance with the order of convergence of each scheme. All obtained random matrices are nonsingular and both methods give us an approximation of the inverse of A. Newton method needs lower number of iterations than Secant scheme, as was expected, being the first one quadratic and the latter one super-linear.

Example 2.

In this example, matrix A is a

m \times n

random matrix for different values of m and n. The initial matrices are calculated in the same way as in the previous example.

In Table 2, we show the results obtained by Newton–Schulz and secant-type method for the different random matrices, the number of iterations, the residuals, and the value of ACOC. The results are in concordance with the order of convergence of each scheme, despite being non-square matrices. Both methods give us an approximation of the Moore–Penrose inverse of A.

Example 3.

In this example, we want to analyze the performance of the secant method for computing the Drazin inverse of the following matrix A of size

6 \times 6

with

i n d (A) = 2

.

A = (\begin{matrix} 1 & - 1 & 0 & 0 & 0 & 0 \\ - 1 & 1 & 0 & 0 & 0 & 0 \\ - 1 & - 1 & 1 & - 1 & 0 & 0 \\ - 1 & - 1 & - 1 & 1 & 0 & 0 \\ - 1 & - 1 & - 1 & 0 & 2 & - 1 \\ - 1 & - 1 & 0 & - 1 & - 1 & 2 \end{matrix})

Here, its Drazin inverse is expressed by

A^{D} = (\begin{matrix} 1 / 4 & - 1 / 4 & 0 & 0 & 0 & 0 \\ - 1 / 4 & 1 / 4 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 / 4 & - 1 / 4 & 0 & 0 \\ 0 & 0 & - 1 / 4 & 1 / 4 & 0 & 0 \\ 0 & 0 & - 5 / 12 & - 7 / 12 & 2 / 3 & 1 / 3 \\ 0 & 0 & - 7 / 12 & - 5 / 12 & 1 / 3 & 2 / 3 \end{matrix}) .

By using the initial matrix

X_{0} = \frac{0.5}{t r (A^{3})}

and the same stopping criterion as in the previous examples, Newton–Schulz method gives us the following information:

$A C O C = 2.0009$ ;
$i t e r = 11$ ; and
Exact error $∥ A^{D} - X_{11} ∥_{2} = 7.7716 \times 10^{- 16}$ .

On the other hand, secant method is used with

X_{- 1} = \frac{1}{t r (A^{3})}

and

X_{0} = \frac{0.5}{t r (A^{3})}

, obtaining:

$A C O C = 1.6225$ ;
$i t e r = 15$ ; and
Exact error $∥ A^{D} - X_{15} ∥_{2} = 1.8539 \times 10^{- 13}$ .

Example 4.

This is another example for computing the Drazin inverse of the following matrix B of size

12 \times 12

(see [1]) with

i n d (B) = 3

.

B = (\begin{matrix} 2 & 0.4 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ - 2 & 0.4 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ - 1 & - 1 & 1 & - 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ - 1 & - 1 & - 1 & 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 & 1 & - 1 & - 1 & 0 & 0 & - 1 & 0 \\ 0 & 0 & 0 & 0 & 1 & 1 & - 1 & - 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & - 1 & - 2 & 0.4 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 2 & 0.4 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & - 1 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & - 1 & - 1 & - 1 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & - 1 & 1 & - 1 & - 1 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0.4 & - 2 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0.4 . 4 & 2 \end{matrix}) .

Now, its Drazin inverse is expressed by

A^{D} = (\begin{matrix} 0.25 & - 0.25 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 1.25 & 1.25 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ - 1.6641 & - 0.9922 & 0.25 & - 0.25 & 0 & 0 & 0 & 0 & - 0.0625 & - 0.0625 & 0 & 0.1563 \\ - 1.1953 & - 0.6797 & - 0.25 & 0.25 & 0 & 0 & 0 & 0 & - 0.0625 & 0.1875 & 0.6875 & 1.3438 \\ - 2.7637 & - 1.0449 & - 1.875 & - 1.25 & - 1.25 & - 1.25 & - 1.25 & - 1.25 & 1.4844 & 2.5781 & 3.3203 & 6.6406 \\ - 2.7637 & - 1.0449 & - 1.875 & - 1.25 & - 1.25 & - 1.25 & - 1.25 & - 1.25 & 1.4844 & 2.5781 & 4.5703 & 8.5156 \\ 14.1094 & 6.3008 & 6.625 & 3.375 & 5 & - 3 & - 5 & - 5 & - 4.1875 & - 8.5 & - 10.5078 & - 22.4609 \\ - 19.3242 & - 8.5078 & - 9.75 & - 5.25 & - 7.5 & 4.5 & 7.5 & 7.5 & 6.375 & 12.5625 & 15.9766 & 33.7891 \\ - 0.625 & - 0.3125 & 0 & 0 & 0 & 0 & 0 & 0 & 0.25 & - 0.25 & - 0.875 & - 1.625 \\ - 1.25 & - 0.9375 & 0 & 0 & 0 & 0 & 0 & 0 & - 0.25 & 0.25 & - 0.875 & - 1.625 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1.25 & 1.25 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & - 0.25 & 0.25 \end{matrix}) .

By using the initial matrix

X_{0} = \frac{0.5}{t r (A^{5})}

and the same stoping criterion as in the previous examples, Newton–Schulz method gives us the following information:

$A C O C = 2.0031$ ;
$i t e r = 14$ ; and
Exact error $∥ B^{D} - X_{14} ∥ = 1.8354 \times 10^{- 9}$ .

On the other hand, secant method is used with

X_{- 1} = \frac{1}{t r (A^{5})}

and

X_{0} = \frac{0.5}{t r (A^{5})}

, obtaining:

$A C O C = 1.6201$ ;
$i t e r = 20$ ; and
Exact error $∥ B^{D} - X_{20} ∥ = 1.8453 \times 10^{- 9}$ .

Again, the numerical tests confirm the theoretical results.

Example 5.

Finally, in this example, we test Newton–Schlutz and secant methods on several known square matrices of size

n \times n

, constructed by using different Matlab functions. Specifically, the test matrices are:

(a): $A = g a l l e r y (^{'} r i s^{'}, n)$ . Hankel matrix of size n × n.
(b): $A = g a l l e r y (^{'} g r c a r^{'}, n)$ . Toeplitz matrix of size n × n.
(c): $A = g a l l e r y (^{'} l e h m e r^{'}, n)$ . Symmetric and positive definite matrix of size n × n, $a_{i, j} = i / j, \forall i, j$ .
(d): $A = g a l l e r y (^{'} l e s l i e^{'}, n)$ . Leslie matrix of size n × n. This type of matrices appears in problems of population models.
(e): $A = g a l l e r y (^{'} i n v o^{'}, n)$ . Matrix ill-conditioned of size n × n, such that $A^{2} = I$ .

By using the stopping criterion

∥ X_{k + 1} - X_{k} ∥_{2} < 10^{- 10} o r {∥ F (X_{k + 1}) ∥}_{2} < 10^{- 10}

and the initial matrix

X_{- 1} = \frac{A^{T}}{{∥ A ∥}^{2}}

and

X_{0} = 0.5 \frac{A^{T}}{{∥ A ∥}^{2}}

, we obtain the numerical results that appear in Table 3. In this cases, as in the previous ones, the proposed method shows good performance in terms of stability, precision, and number of iterations needed. We must take into account that both schemes have different orders of convergence, which is displayed in Table 3.

6. Conclusions

An iterative method with memory for approximating the inverse of nonsingular square complex matrices, the Moore–Penrose inverse of rectangular complex matrices, and the Drazin inverse of square singular matrices is presented. As far as we know, it is the first time that a scheme with memory is employed to approximate the solution of nonlinear matrix equations. The proposed scheme is free of inverse operators and its iterative expression is simple; therefore, it is computationally efficient. From particular initial approximations, the convergence is guaranteed for all matrices, without conditions. Numerical tests allowed us to analyze the performance of the proposed scheme and confirm the theoretical results.

Author Contributions

Investigation, S.A., A.C. and J.R.T.; Supervision, M.P.V.; Validation, A.C.; Writing—original draft, S.A. and M.P.V.; Writing—review and editing, J.R.T. All the authors contributed to the different parts of this manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by PGC2018-095896-B-C22 (MCIU/AEI/FEDER, UE), Generalitat Valenciana PROMETEO/2016/089, and FONDOCYT 029-2018 República Dominicana.

Acknowledgments

The authors would like to thank the anonymous reviewers for their useful comments and suggestions that have improved the final version of this manuscript.

Conflicts of Interest

The authors declare that there is no conflict of interest regarding the publication of this paper.

References

Li, X.; Wei, Y. Iterative methods for the Drazin inverse of a matrix with a complex spectrum. Appl. Math. Comput. 2004, 147, 855–862. [Google Scholar] [CrossRef]
Li, H.B.; Huang, T.Z.; Zhang, Y.; Liu, X.P.; Gu, T.X. Chebyshev-type methods and preconditioning techniques. Appl. Math. Comput. 2011, 218, 260–270. [Google Scholar] [CrossRef]
Soleymani, F.; Stanimirović, P.S. A higher order iterative method for computing the Drazin inverse. Sci. World J. 2013, 2013, 708647. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Li, W.; Li, J.; Qiao, T. A family of iterative methods for computing Moore-Pennrose inverse of a matrix. Linear Algebra Appl. 2013, 438, 47–56. [Google Scholar]
Toutounian, F.; Soleymani, F. An iterative method for computing the approximate inverse of a square matrix and the Moore-Pennrose inverse of a nonsquare matrix. Appl. Math. Comput. 2015, 224, 671–680. [Google Scholar]
Soleymani, F.; Salmani, H.; Rasouli, M. Finding the Moore-Penrose inverse by a new matrix iteration. Appl. Math. Comput. 2015, 47, 33–48. [Google Scholar] [CrossRef]
Ben-Israel, A.; Greville, T.N.E. Generalized Inverses, 2nd ed.; Springer: New York, NY, USA, 2003. [Google Scholar]
Gu, X.; Huang, T.; Ji, C.; Carpentieri, B.; Alikhanov, A.A. Fast iterative method with a second-order implicit difference scheme for time-space fractional convection-diffusion equation. J. Sci. Comput. 2017, 72, 957–985. [Google Scholar] [CrossRef]
Li, M.; Gu, X.; Huang, C.; Fei, M.; Zhang, G. A fast linearized conservative finite element method for the strongly coupled nonlinear fractional Schrödinger equations. J. Comput. Phys. 2018, 358, 256–282. [Google Scholar] [CrossRef]
Higham, N.J. Functions of Matrices: Theory and Computation; SIAM: Philadelphia, PA, USA, 2008. [Google Scholar]
Schulz, G. Iterative Berechmmg der reziproken matrix. Z. Angew. Math. Mech. 1933, 13, 57–59. [Google Scholar] [CrossRef]
Li, W.G.; Li, Z. A family of iterative methods for computing the approximate inverse of a square matrix and inner inverse of a non-square matrix. Appl. Math. Comput. 2010, 215, 3433–3442. [Google Scholar] [CrossRef]
Chen, H.; Wang, Y. A family of higher-order convergent iterative methods for computing the Moore-Penrose inverse. Appl. Math. Comput. 2011, 218, 4012–4016. [Google Scholar] [CrossRef]
Ortega, J.M.; Rheinbolt, W.C. Iterative Solutions of Nonlinears Equations in Several Variables; Academic Press, Inc.: Cambridge, MA, USA, 1970. [Google Scholar]
Monsalve, M.; Raydan, M. A secant method for nonlinear matrix problem. Lect. Notes Electr. Eng. 2011, 80, 387–402. [Google Scholar]
Wang, G.; Wei, Y.; Qiao, S. Generalized Inverses; Science Press: New York, NY, USA, 2004. [Google Scholar]
Jay, L. A note of Q-order of convergence. BIT 2001, 41, 422–429. [Google Scholar] [CrossRef]
Cordero, A.; Torregrosa, J.R. Variants of Newton’s method using fifth-order quadrature formulas. Appl. Math. Comput. 2007, 190, 686–698. [Google Scholar] [CrossRef]

Table 1. Results for approximating the inverse of a random matrix (Example 1).

Method	n	Iter	$∥ X_{k + 1} - X_{k} ∥_{2}$	$∥ F (X_{k + 1}) ∥_{2}$	COC
Newton–Schulz	10	19	$5.2 \times 10^{- 7}$	$1.12 \times 10^{- 14}$	2.0005
Secant	10	22	$5.4 \times 10^{- 5}$	$9.8 \times 10^{- 7}$	1.8660
Newton–Schulz	100	26	$2.0 \times 10^{- 8}$	$5.4 \times 10^{- 13}$	1.9988
Secant	100	36	$1.3 \times 10^{- 6}$	$2.0 \times 10^{- 7}$	1.6645
Newton–Schulz	200	32	$2.5 \times 10^{- 12}$	$4.6 \times 10^{- 12}$	2.0012
Secant	200	40	$1.6 \times 10^{- 6}$	$1.8 \times 10^{- 8}$	1.8866
Newton–Schulz	300	34	$3.1 \times 10^{- 12}$	$5.9 \times 10^{- 12}$	1.8888
Secant	300	40	$3.7 \times 10^{- 5}$	$3.6 \times 10^{- 7}$	1.8865
Newton–Schulz	400	36	$2.2 \times 10^{- 10}$	$1.9 \times 10^{- 11}$	2.0001
Secant	400	43	$3.5 \times 10^{- 7}$	$1.1 \times 10^{- 8}$	1.8222
Newton–Schulz	500	33	$1.5 \times 10^{- 7}$	$1.2 \times 10^{- 11}$	1.9999
Secant	500	36	$9.0 \times 10^{- 5}$	$2.6 \times 10^{- 7}$	1.6666

Table 2. Results for approximating the Moore–Penrose inverse of a rectangular random matrix (Example 2).

Method	m	n	Iter	$∥ X_{k + 1} - X_{k} ∥_{2}$	ACOC
Newton–Schulz	20	10	14	$9.7 \times 10^{- 12}$	2.0005
Secant	20	10	13	$9.9 \times 10^{- 10}$	1.6199
Newton–Schulz	200	100	17	$4.1 \times 10^{- 10}$	2.0018
Secant	200	100	17	$2.02 \times 10^{- 9}$	1.6210
Newton–Schulz	300	400	21	$2.03 \times 10^{- 11}$	2.0007
Secant	300	400	27	$1.4 \times 10^{- 7}$	1.6267
Newton–Schulz	500	600	23	$3.8 \times 10^{- 9}$	2.0028
Secant	500	600	31	$5.2 \times 10^{- 10}$	1.6197
Newton–Schulz	1000	900	25	$4.5 \times 10^{- 8}$	2.0055
Secant	1000	900	36	$2.5 \times 10^{- 9}$	1.6205

Table 3. Results for approximating the inverse of classical square matrices (Example 5).

Method	Matrix	n	Iter	$∥ X_{k + 1} - X_{k} ∥_{2}$	$∥ F (X_{k + 1}) ∥$	COC
Newton–Schulz	Lehmer	10	18	$3.5 \times 10^{- 7}$	$6.3 \times 10^{- 15}$	-
Secant	Lehmer	10	20	$3.9 \times 10^{- 9}$	$1.7 \times 10^{- 11}$	1.6164
Newton–Schulz	Hankel	100	8	$1.1 \times 10^{- 5}$	$1.2 \times 10^{- 11}$	1.9993
Secant	Hankel	100	11	$1.9 \times 10^{- 12}$	$4.4 \times 10^{- 13}$	1.6180
Newton–Schulz	Toeplitz	200	9	$1.6 \times 10^{- 9}$	$3.2 \times 10^{- 15}$	1.9976
Secant	Toeplitz	200	11	$6.3 \times 10^{- 11}$	$6.4 \times 10^{- 11}$	1.6182
Newton–Schulz	Toeplitz	300	9	$1.7 \times 10^{- 9}$	$2.5 \times 10^{- 15}$	1.9975
Secant	Toeplitz	300	11	$6.3 \times 10^{- 11}$	$6.4 \times 10^{- 11}$	1.6182
Newton–Schulz	Leslie	400	22	$4.3 \times 10^{- 5}$	$2.3 \times 10^{- 13}$	1.9995
Secant	Leslie	400	33	$4.2 \times 10^{- 12}$	$1.0 \times 10^{- 14}$	1.6177
Newton–Schulz	Leslie	500	23	$1.6 \times 10^{- 6}$	$1.4 \times 10^{- 16}$	2.0001
Secant	Leslie	500	25	$1.7 \times 10^{- 12}$	$3.8 \times 10^{- 15}$	1.6070

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Artidiello, S.; Cordero, A.; Torregrosa, J.R.; P. Vassileva, M. Generalized Inverses Estimations by Means of Iterative Methods with Memory. Mathematics 2020, 8, 2. https://doi.org/10.3390/math8010002

AMA Style

Artidiello S, Cordero A, Torregrosa JR, P. Vassileva M. Generalized Inverses Estimations by Means of Iterative Methods with Memory. Mathematics. 2020; 8(1):2. https://doi.org/10.3390/math8010002

Chicago/Turabian Style

Artidiello, Santiago, Alicia Cordero, Juan R. Torregrosa, and María P. Vassileva. 2020. "Generalized Inverses Estimations by Means of Iterative Methods with Memory" Mathematics 8, no. 1: 2. https://doi.org/10.3390/math8010002

APA Style

Artidiello, S., Cordero, A., Torregrosa, J. R., & P. Vassileva, M. (2020). Generalized Inverses Estimations by Means of Iterative Methods with Memory. Mathematics, 8(1), 2. https://doi.org/10.3390/math8010002

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Generalized Inverses Estimations by Means of Iterative Methods with Memory

Abstract

1. Introduction

2. A Secant-Type Method for Matrix Inversion

3. A Secant-Type Method for Approximating the Moore–Penrose Inverse

4. A Secant-Type Method for Approximating the Drazin Inverse

5. Numerical Experiments

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI