Some New Algebraic Method Developments in the Characterization of Matrix Equalities

Tian, Yongge

doi:10.3390/axioms13100657

Open AccessArticle

Some New Algebraic Method Developments in the Characterization of Matrix Equalities

by

Yongge Tian

College of Business and Economics, Shanghai Business School, Shanghai 201400, China

Axioms 2024, 13(10), 657; https://doi.org/10.3390/axioms13100657

Submission received: 30 July 2024 / Revised: 19 September 2024 / Accepted: 19 September 2024 / Published: 24 September 2024

(This article belongs to the Special Issue Advances in Linear Algebra with Applications)

Download Versions Notes

Abstract

:

Algebraic expressions and equalities can be constructed arbitrarily in a given algebraic framework according to the operational rules provided, and thus it is a prominent and necessary task in mathematics and applications to construct, classify, and characterize various simple general algebraic expressions and equalities. As an update to this prominent topic in matrix algebra, this article reviews and improves upon the well-known block matrix methodology and matrix rank methodology in the construction and characterization of matrix equalities. We present a collection of fundamental and useful formulas for calculating the ranks of a wide range of block matrices and then derive from these rank formulas various valuable consequences. In particular, we present several groups of equivalent conditions in the characterizations of the Hermitian matrix, the skew-Hermitian matrix, the normal matrix, etc.

Keywords:

block matrix; Hermitian matrix; generalized inverse; matrix equality; normal matrix; range; rank

MSC:

15A09; 15A24

1. Introduction

Throughout this article, the symbol

C^{m \times n}

denotes the collection of all

m \times n

matrices over the field of complex numbers,

A^{*}

denotes the conjugate transpose of

A \in C^{m \times n}

,

r (A)

denotes the rank of

A \in C^{m \times n}

, i.e., the maximum order of the invertible submatrix of A, and

R (A) = {A x | x \in C^{n}}

denotes the range (column space) of a matrix

A \in C^{m \times n}

. A matrix

A \in C^{m \times m}

is said to be Hermitian if and only if

A = A^{*}

; to be skew-Hermitian if and only if

A = - A^{*}

; to be normal if and only if

A A^{*} = A^{*} A

; to be EP if and only if

R (A) = R (A^{*})

. The Moore–Penrose generalized inverse of

A \in C^{m \times n}

, denoted by

A^{†}

, is the unique matrix

X \in C^{n \times m}

satisfying the four Penrose equations:

A X A = A, X A X = X, {(A X)}^{*} = A X, {(X A)}^{*} = X A .

(1)

A square matrix

A \in C^{m \times m}

is said to be group invertible if and only if there exists an

X \in C^{m \times m}

satisfying the following three matrix equations:

A X A = A, X A X = X, A X = X A .

(2)

The matrix X satisfying the three equations, called the group inverse of A, is unique and is denoted by

X = A^{#}

. As we know, the theory of generalized inverses of matrices, as ubiquitous tool to deal with singular matrices, has been widely utilized to approach various complicated theoretical and computational problems in mathematics and applications. The most important fact is that we can use generalized inverses of matrices in the case when ordinary inverses of matrices do not exist in order to deal with general matrix equations. As shown above, generalized inverses can be defined to be common solutions to one or more algebraic equations as certain extensions of the ordinary inverses of matrices. In comparison, the Moore–Penrose inverse and the group inverse of a matrix are two highly recognized generalized inverses, which are known to have many remarkable algebraic and computational properties, and thus they have been widely studied in mathematics and other applications. For more detailed information regarding generalized inverses of matrices, we refer the reader to [1,2,3].

Let us start with the introduction of the problem considered in this article. Given two matrices

A = (a_{i j})

and

B = (b_{i j})

of the same size, the matrix equality

A = B

is directly defined by the fact that

a_{i j} = b_{i j}

holds for all elements in A and B. In matrix theory and applications, we may encounter various problems with solving a simple or general matrix equation

f (A, B) = g (A, B)

, where

f (\cdot)

and

g (\cdot)

are certain algebraic operations of two given matrices A and B of appropriate size. One of the specified cases related to this type of problem can be formulated as:

\begin{matrix} f (A, B) = 0 \Leftrightarrow A = B; \end{matrix}

(3)

namely,

A = B

is the unique solution to the matrix equation on the left-hand side. Due to the arbitrariness of the matrix expression

f (A, B)

, there does not exist a general and significant method to construct matrix equations such as (3) or to find their solutions, except in some special cases (cf. [4,5]).

Increasingly complex research on algebraic expressions and equalities has created a need for sophisticated methods that go beyond conventional operations of elements in a given algebraic framework. The aim of this article is to review some well-known matrix rank equalities in the theory of generalized inverses and to establish a diversity of new expansion formulas for calculating ranks of many specified block matrices. We also provide some novel and insightful methods for constructing and characterizing matrix equalities and use them in the study of problems such as (3).

The article is organized as follows. In Section 2, we introduce some established useful formulas, results, and facts regarding ranks, ranges, and generalized inverses of matrices in linear algebra and matrix theory. In Section 3 and Section 4, we give a series of well-known or novel concrete block matrices, calculate their ranks, and derive various consequences and applications from the rank formulas, including the characterization of the Hermitian/skew-Hermitian matrix, normal matrix, etc. Conclusions are given in Section 5.

2. Preliminary Results

In this section, we recall some basic or well-known simple general formulas, results, and facts related to ranks, ranges, and generalized inverses of matrices in linear algebra and matrix theory that we need to use in the remainder of the paper. We first describe the simple but intrinsic idea of studying matrix equalities that have been frequently used by algebraists in the past several decades. Recall that the rank of a matrix, or the dimension of the subspaces generated by the row or column of a matrix, is one of the most fundamental concepts in linear algebra. It is straightforward to know from the definition of the rank of matrix that

A = 0

if and only if

r (A) = 0

. As a consequence, we see that the matrix equality

A = B

holds if and only if

r (A - B) = 0

holds for two given matrices of the same size. In order to use this proposed assertion in the characterization of the matrix equalities in this paper, we need to use the following well-known results on matrix rank equalities and their consequences.

Lemma 1

([6]). Let

A \in C^{m \times n}, B \in C^{m \times k},

C \in C^{l \times n},

and

D \in C^{l \times k} .

Then, the following rank equalities hold:

\begin{matrix} r [A, B] & = r (A) + r (B - A A^{†} B) = r (B) + r (A - B B^{†} A), \end{matrix}

(4)

\begin{matrix} r [\begin{matrix} A & B \\ C & D \end{matrix}] & = r [\begin{matrix} A & B - A A^{†} B \\ C - C A^{†} A & D - C A^{†} B \end{matrix}] = r (A) + r [\begin{matrix} 0 & B - A A^{†} B \\ C - C A^{†} A & D - C A^{†} B \end{matrix}], \end{matrix}

(5)

and the following facts hold:

\begin{matrix} r [A, B] = r (A) = r (B) \Leftrightarrow R (A) = R (B) \Leftrightarrow A A^{†} = B B^{†}, \end{matrix}

(6)

\begin{matrix} r [A, B] = r (A) + r (B) \Leftrightarrow R (A) \cap R (B) = {0}, \end{matrix}

(7)

\begin{matrix} r [\begin{matrix} A & B \\ C & D \end{matrix}] = r (A) \Leftrightarrow A A^{†} B = B, C A^{†} A = C a n d C A^{†} B = D . \end{matrix}

(8)

In particular, if

R (A) \supseteq R (B)

and

R (A^{*}) \supseteq R (C^{*}),

then

\begin{matrix} r [\begin{matrix} A & B \\ C & D \end{matrix}] - r (A) = r (D - C A^{†} B), \end{matrix}

(9)

and the following fact holds:

\begin{matrix} C A^{†} B = D \Leftrightarrow r [\begin{matrix} A & B \\ C & D \end{matrix}] = r (A) . \end{matrix}

(10)

Lemma 2

([7]). Let

A \in C^{m \times n}, B \in C^{m \times k},

C \in C^{l \times n},

and

D \in C^{l \times k} .

Then, the following rank equality holds:

r (D - C A^{†} B) = r [\begin{matrix} A^{*} A A^{*} & A^{*} B \\ C A^{*} & D \end{matrix}] - r (A),

(11)

and following fact holds:

C A^{†} B = D \Leftrightarrow r [\begin{matrix} A^{*} A A^{*} & A^{*} B \\ C A^{*} & D \end{matrix}] = r (A) .

The facts and results in the following three lemmas are obvious or well-known in matrix theory (cf. [1,8]).

Lemma 3.

Let

A_{1} \in C^{m \times n_{1}},

B_{1} \in C^{m \times p_{1}},

A_{2} \in C^{m \times n_{2}},

and

B_{2} \in C^{m \times p_{2}} .

If

R (A_{1}) = R (A_{2})

and

R (B_{1}) = R (B_{2}),

then

r [A_{1}, B_{1}] = r [A_{2}, B_{2}]

holds.

Lemma 4.

Let

A \in C^{m \times n}

and

B \in C^{m \times p} .

Then, the following three statements are equivalent:

(a): The matrix equation $A X = B$ is solvable for $X .$
(b): $R (A) \supseteq R (B) .$
(c): $r [A, B] = r (A) .$

Lemma 5.

The following results hold:

(I): Let $A, B \in C^{m \times m}$ be two normal matrices of the same size. Then, A and B can simultaneously be unitarily diagonalizable if and only if $A B = B A .$
(II): $R (A) = R (A^{*}) \Leftrightarrow A A^{†} = A^{†} A$ .
(III): $r (A^{2}) = r (A) \Leftrightarrow A^{k} {(A^{k})}^{†} = A A^{†} \Leftrightarrow {(A^{k})}^{†} A^{k} = A^{†} A$ for $k \geq 2 .$

3. How to Establish Equalities for Ranks of Block Matrices

As described at the beginning of Section 2, the usefulness of matrix rank formulas in the characterization of matrix equalities has been noted by some earlier mathematicians. In particular, in an earlier paper [6], a wide variety of fundamental matrix rank equalities were systematically established, and various remarkable applications of the matrix rank methodology were proposed and established in the characterizations of matrix equalities composed of generalized inverses. In the past several decades, the original work developed in the seminal paper [6] has been viewed as one of the essential contributions to the current development of matrix theory and applications, and indeed, some important and valuable studies have been done in this domain. As a fruitful investigation in this respect, we intend to establish in this section a wide range of simple and useful formulas for calculating the ranks of certain specified partitioned matrices and present their essential consequences in characterizing a series of basic matrix equalities.

Theorem 1.

Let

A, B \in C^{m \times n} .

Then, the following rank equalities hold:

\begin{matrix} r (A - A B^{†} A) & = r [\begin{matrix} B^{*} B B^{*} & B^{*} A \\ A B^{*} & A \end{matrix}] - r (B) \\ = r (B^{*} A B^{*} - B^{*} B B^{*}) + r (A) - r (B) . \end{matrix}

(12)

If

R (A) \supseteq R (B)

and

R (A^{*}) \supseteq R (B^{*}),

then

r (A - A B^{†} A) = r [\begin{matrix} B & A \\ A & A \end{matrix}] - r (B) = r (A - B) + r (A) - r (B) .

(13)

If

R (A) = R (B)

and

R (A^{*}) = R (B^{*}),

then

\begin{matrix} r (A - A B^{†} A) & = r (B - B A^{†} B) = r (A - B), \end{matrix}

(14)

\begin{matrix} r (A^{†} - A^{†} B A^{†}) & = r (B^{†} - B^{†} A B^{†}) = r (A^{†} - B^{†}) . \end{matrix}

(15)

In particular, for

A \in C^{m \times m},

the following rank equality holds:

\begin{matrix} r [\begin{matrix} A & A {(A^{†})}^{*} \\ {(A^{†})}^{*} A & {(A^{†})}^{*} \end{matrix}] & = r (A) + r ({(A^{†})}^{*} - {(A^{†})}^{*} A {(A^{†})}^{*}) \\ = r (A) + r (A - A {(A^{†})}^{*} A), \end{matrix}

(16)

\begin{matrix} r (A - A {(A^{†})}^{*} A) & = r (A^{3} - A A^{*} A) . \end{matrix}

(17)

Hence,

A {(A^{†})}^{*} A = A \Leftrightarrow {(A^{†})}^{*} A {(A^{†})}^{*} = {(A^{†})}^{*} \Leftrightarrow A^{3} = A A^{*} A .

Proof.

The first equality in (12) directly follows from (11). Consequently, simplifying it leads to the second equality in (12). Equation (13) directly follows from (9). Note that

R (A) = R (B)

and

R (A^{*}) = R (B^{*})

are equivalent to

A A^{†} = B B^{†}

,

A^{†} A = B^{†} B

, and

r (A) = r (B)

by (6). Thus, (14) and (15) follow from (13). Equation (16) follows from (9). Equation (17) follows from (12) and from the simple fact

r ({(A^{†})}^{*}) = r (A^{†}) = (A)

. □

Theorem 2.

Let

A \in C^{m \times n}, B \in C^{m \times k},

C \in C^{l \times n},

and

D \in C^{l \times k} .

Then, the following facts hold:

(a): $R [\begin{matrix} A & B \\ C & D \end{matrix}] = R [\begin{matrix} A & 0 \\ 0 & D \end{matrix}] = R [\begin{matrix} 0 & B \\ C & 0 \end{matrix}] \Leftrightarrow R [\begin{matrix} A^{*} \\ B^{*} \end{matrix}] \cap R [\begin{matrix} C^{*} \\ D^{*} \end{matrix}] = {0},$ $R (A) = R (B)$ , and $R (C) = R (D) .$
(b): $r [\begin{matrix} A & B \\ C & D \end{matrix}] = r [\begin{matrix} A \\ C \end{matrix}] = r [\begin{matrix} B \\ D \end{matrix}] = r [A, B] = r [C, D] \Leftrightarrow r [\begin{matrix} A & B \\ C & D \end{matrix}] = r (A) = r (B) = r (C) = r (D) .$
(c): If $r [\begin{matrix} A & B \\ C & D \end{matrix}] = r (A) = r (B) = r (C)$ , then $r [\begin{matrix} A & B \\ C & D \end{matrix}] = r (D) .$

Proof.

It is easily seen that the first range equalities in Result (a) imply:

\begin{matrix} r [\begin{matrix} A & B \\ C & D \end{matrix}] & = r (A) + r (D) = r (B) + r (C), \end{matrix}

(18)

\begin{matrix} r [\begin{matrix} A & B \\ C & D \end{matrix}] & = r [\begin{matrix} A & 0 & 0 & B \\ 0 & D & C & 0 \end{matrix}] = r [A, B] + r [C, D], \end{matrix}

(19)

where (19) implies

R [\begin{matrix} A^{*} \\ B^{*} \end{matrix}] \cap R [\begin{matrix} C^{*} \\ D^{*} \end{matrix}] = {0}

by (7). Combining (18) and (19) leads to:

r [A, B] + r [C, D] = r (A) + r (D) = r (B) + r (C),

which further implies that:

r [A, B] = r (A) = r (B), r [C, D] = r (C) = r (D),

by noticing that

r [A, B] \geq r (A)

,

r [A, B] \geq r (B)

,

r [C, D] \geq r (C)

, and

r [C, D] \geq r (D)

. Therefore, the last two range equalities in Result (a) hold. Conversely, by (7), the three range equalities on the right-hand side of Result (a) imply that:

r [\begin{matrix} A & B \\ C & D \end{matrix}] = r [A, B] + r [C, D] = r (A) + r (D) = r (B) + r (C) .

Correspondingly,

r [\begin{matrix} A & B & A & 0 \\ C & D & 0 & D \end{matrix}] = r [\begin{matrix} A & 0 \\ 0 & D \end{matrix}] = r (A) + r (D) = r [\begin{matrix} A & B \\ C & D \end{matrix}],

r [\begin{matrix} A & B & 0 & B \\ C & D & C & 0 \end{matrix}] = r [\begin{matrix} 0 & B \\ C & 0 \end{matrix}] = r (B) + r (C) = r [\begin{matrix} A & B \\ C & D \end{matrix}] .

These rank equalities imply that the first two range equalities in Result (a) hold by (6).

The first four rank equalities in Result (b) imply:

R [\begin{matrix} A \\ C \end{matrix}] = R [\begin{matrix} B \\ D \end{matrix}], R [\begin{matrix} A^{*} \\ B^{*} \end{matrix}] = R [\begin{matrix} C^{*} \\ D^{*} \end{matrix}]

by (6), and further imply

R (A) = R (B)

,

R (C) = R (D)

,

R (A^{*}) = R (C^{*})

, and

R (B^{*}) = R (D^{*})

. In this case, the first four rank equalities are reduced to the rank equalities on the right-hand side of Result (b). Conversely, combining the following simple rank inequalities:

\begin{matrix} r [\begin{matrix} A & B \\ C & D \end{matrix}] \geq r [A, B] \geq r (A), r [\begin{matrix} A & B \\ C & D \end{matrix}] \geq r [A, B] \geq r (B), \\ r [\begin{matrix} A & B \\ C & D \end{matrix}] \geq r [C, D] \geq r (C), r [\begin{matrix} A & B \\ C & D \end{matrix}] \geq r [C, D] \geq r (D), \\ r [\begin{matrix} A & B \\ C & D \end{matrix}] \geq r [\begin{matrix} A \\ C \end{matrix}] \geq r (A), r [\begin{matrix} A & B \\ C & D \end{matrix}] \geq r [\begin{matrix} A \\ C \end{matrix}] \geq r (C), \\ r [\begin{matrix} A & B \\ C & D \end{matrix}] \geq r [\begin{matrix} B \\ D \end{matrix}] \geq r (B), r [\begin{matrix} A & B \\ C & D \end{matrix}] \geq r [\begin{matrix} B \\ D \end{matrix}] \geq r (D) \end{matrix}

with the rank equalities on the right-hand side of Result (b) yields the first four rank equalities on the left-hand side of Result (b). Result (c) is a direct consequence of Result (b). □

As we know, constructive methodology in the field of mathematics is an effective strategy when solving certain specified mathematical proof problems. This method is particularly suitable for approaching the given topics that are difficult to deal with by means of conventional analysis methods. The methodology usually requires observing, analyzing, and understanding research objects from new perspectives and viewpoints based on the characteristics and properties of the problem conditions and conclusions. By firmly grasping the intrinsic connection between the conditions and conclusions that reflect the given problem, utilizing the characteristics of the problem and known mathematical relationships and theories as tools, a mathematical object that satisfies the conditions or conclusions is then constructed properly. In this way, the implicit relationships and properties in the original problem are clearly presented in the newly constructed mathematical object, making it convenient and efficient to solve mathematical problems.

In what follows, we construct a selection of block matrices, calculate their ranks, and derive various consequences from the rank formulas.

Theorem 3.

Let

A, B \in C^{m \times n} .

Then, the following rank equalities hold:

\begin{matrix} r [\begin{matrix} A & B \\ B & A \end{matrix}] & = r (A + B) + r (A - B), \end{matrix}

(20)

\begin{matrix} r [\begin{matrix} A & - B \\ B & A \end{matrix}] & = r (A + i B) + r (A - i B), \end{matrix}

(21)

\begin{matrix} r [\begin{matrix} A + B & A - B \\ A - B & A + B \end{matrix}] & = r (A) + r (B), \end{matrix}

(22)

\begin{matrix} r [\begin{matrix} A + B & B - A \\ A - B & A + B \end{matrix}] & = r ((1 + i) A + (1 - i) B) + r ((1 - i) A + (1 + i) B), \end{matrix}

(23)

where

i = \sqrt{- 1} .

Proof.

Equation (20) follows from the following block matrix decomposition equality:

\begin{matrix} [\begin{matrix} I_{m} & I_{m} \\ I_{m} & - I_{m} \end{matrix}] [\begin{matrix} A & B \\ B & A \end{matrix}] [\begin{matrix} I_{n} & I_{n} \\ I_{n} & - I_{n} \end{matrix}] = 2 [\begin{matrix} A + B & 0 \\ 0 & A - B \end{matrix}] \end{matrix}

(24)

and the fact that the block matrices

[\begin{matrix} I_{m} & I_{m} \\ I_{m} & - I_{m} \end{matrix}]

and

[\begin{matrix} I_{n} & I_{n} \\ I_{n} & - I_{n} \end{matrix}]

are obviously nonsingular. Replacing B in (20) with

i B

and noting that

r [\begin{matrix} A & i B \\ i B & A \end{matrix}] = r [\begin{matrix} A & i^{2} B \\ B & A \end{matrix}] = r [\begin{matrix} A & - B \\ B & A \end{matrix}]

leads to (21). Replacing A and B in (20) and (21) with

A + B

and

A - B

and simplifying leads to (22) and (23), respectively. □

Observe that the construction of matrix equality in (24) is simple and explicit so that the reader can properly understand the occurrence of the two basic rank expansion formulas in (20)–(23) and their consequences. Obviously, (20)–(23) reveal certain essential links between the two specified block matrices and the operations of their sub-matrices so that we are able to derive from them a series of possible new matrix rank equalities by choosing A and B as certain specified forms. As some illustrative examples, we give below diverse expansion formulas for calculating the ranks of several concrete block matrices.

Corollary 1.

Let

A, B \in C^{m \times m}

and let integer

k \geq 2 .

Then, the following rank equalities hold:

\begin{matrix} r ({[\begin{matrix} A & B \\ B & A \end{matrix}]}^{k}) & = r ({(A + B)}^{k}) + r ({(A - B)}^{k}), \end{matrix}

(25)

\begin{matrix} r ({[\begin{matrix} A & - B \\ B & A \end{matrix}]}^{k}) & = r ({(A + i B)}^{k}) + r ({(A - i B)}^{k}), \end{matrix}

(26)

\begin{matrix} r ({[\begin{matrix} A + B & A - B \\ A - B & A + B \end{matrix}]}^{k}) & = r (A^{k}) + r (B^{k}), \end{matrix}

(27)

\begin{matrix} r ({[\begin{matrix} A + B & B - A \\ A - B & A + B \end{matrix}]}^{k}) & = r ({((1 + i) A + (1 - i) B)}^{k}) + r ({((1 - i) A + (1 + i) B)}^{k}) . \end{matrix}

(28)

Proof.

Follows from expanding the left-hand sides of the matrices in (25)–(28) and applying (20)–(23). □

Theorem 4.

Let

A, B \in C^{m \times n}

, and

C, D \in C^{n \times p} .

Then, the following rank equalities hold:

\begin{matrix} r ([\begin{matrix} A & B \\ B & A \end{matrix}] [\begin{matrix} C & D \\ D & C \end{matrix}]) & = r ((A + B) (C + D)) + r ((A - B) (C - D)), \end{matrix}

(29)

\begin{matrix} r ([\begin{matrix} A & - B \\ B & A \end{matrix}] [\begin{matrix} C & - D \\ D & C \end{matrix}]) & = r ((A + i B) (C + i D)) + r ((A - i B) (C - i D)) . \end{matrix}

(30)

Proof.

Follows from expanding the left-hand sides of the matrices in (29) and (30) and applying (20) and (21). □

Theorem 5.

Let

A \in C^{m \times n}, B \in C^{m \times k},

C \in C^{l \times n},

and

D \in C^{l \times k} .

Then, the following rank equalities hold:

\begin{matrix} r [\begin{matrix} A & B & A & 0 \\ C & D & 0 & 0 \\ A & 0 & A & B \\ 0 & 0 & C & D \end{matrix}] & = r [\begin{matrix} 2 A & B \\ C & D \end{matrix}] + r [\begin{matrix} 0 & B \\ C & D \end{matrix}], \end{matrix}

(31)

\begin{matrix} r [\begin{matrix} A & B & 0 & 0 \\ C & D & 0 & D \\ A & 0 & A & B \\ 0 & D & C & D \end{matrix}] & = r [\begin{matrix} A & B \\ C & 2 D \end{matrix}] + r [\begin{matrix} A & B \\ C & 0 \end{matrix}], \end{matrix}

(32)

\begin{matrix} r [\begin{matrix} A & B & A & 0 \\ C & D & 0 & D \\ A & 0 & A & B \\ 0 & D & C & D \end{matrix}] & = r [\begin{matrix} 2 A & B \\ C & 2 D \end{matrix}] + r (B) + r (C), \end{matrix}

(33)

\begin{matrix} r [\begin{matrix} A & B & - A & 0 \\ C & D & 0 & D \\ - A & 0 & A & B \\ 0 & D & C & D \end{matrix}] & = r [\begin{matrix} A & B \\ C & 0 \end{matrix}] + r [\begin{matrix} 0 & B \\ C & D \end{matrix}], \end{matrix}

(34)

\begin{matrix} r [\begin{matrix} A & B & 0 & B \\ C & D & C & 0 \\ 0 & B & A & B \\ C & 0 & C & D \end{matrix}] & = r [\begin{matrix} A & 2 B \\ 2 C & D \end{matrix}] + r (A) + r (D), \end{matrix}

(35)

\begin{matrix} r [\begin{matrix} A & B & 0 & - B \\ C & D & C & 0 \\ 0 & - B & A & B \\ C & 0 & C & D \end{matrix}] & = r [\begin{matrix} A & 0 \\ C & D \end{matrix}] + r [\begin{matrix} A & B \\ 0 & D \end{matrix}], \end{matrix}

(36)

\begin{matrix} r [\begin{matrix} A & B & 0 & B \\ C & D & C & D \\ 0 & B & A & B \\ C & D & C & D \end{matrix}] & = r [\begin{matrix} A & 2 B \\ 2 C & 2 D \end{matrix}] + r (A), \end{matrix}

(37)

\begin{matrix} r [\begin{matrix} A & B & A & B \\ C & D & C & 0 \\ A & B & A & B \\ C & 0 & C & D \end{matrix}] & = r [\begin{matrix} 2 A & 2 B \\ 2 C & D \end{matrix}] + r (D), \end{matrix}

(38)

\begin{matrix} r [\begin{matrix} 2 A & B & A & B \\ C & 2 D & C & D \\ A & B & 2 A & B \\ C & D & C & 2 D \end{matrix}] & = r [\begin{matrix} 3 A & 2 B \\ 2 C & 3 D \end{matrix}] + r (A) + r (D), \end{matrix}

(39)

\begin{matrix} r [\begin{matrix} 2 A & B & A & 2 B \\ C & 2 D & 2 C & D \\ A & 2 B & 2 A & B \\ 2 C & D & C & 2 D \end{matrix}] & = 2 r [\begin{matrix} A & B \\ C & D \end{matrix}], \end{matrix}

(40)

\begin{matrix} r [\begin{matrix} A & B & - A & B \\ C & D & C & D \\ - A & B & A & B \\ C & D & C & D \end{matrix}] & = r [\begin{matrix} 0 & B \\ C & D \end{matrix}] + r (A), \end{matrix}

(41)

\begin{matrix} r [\begin{matrix} A & B & - A & B \\ C & D & C & - D \\ - A & B & A & B \\ C & - D & C & D \end{matrix}] & = r (A) + r (B) + r (C) + r (D), \end{matrix}

(42)

\begin{matrix} r [\begin{matrix} 3 A & B & - A & B \\ C & 3 D & C & - D \\ - A & B & 3 A & B \\ C & - D & C & 3 D \end{matrix}] & = r [\begin{matrix} A & B \\ C & D \end{matrix}] + r (A) + r (D), \end{matrix}

(43)

\begin{matrix} r [\begin{matrix} A & B & - A & 0 \\ C & D & 0 & 0 \\ A & 0 & A & B \\ 0 & 0 & C & D \end{matrix}] & = r [\begin{matrix} (1 + i) A & B \\ C & D \end{matrix}] + r [\begin{matrix} (1 - i) A & B \\ C & D \end{matrix}], \end{matrix}

(44)

\begin{matrix} r [\begin{matrix} A & B & 0 & 0 \\ C & D & 0 & - D \\ 0 & 0 & A & B \\ 0 & D & C & D \end{matrix}] & = r [\begin{matrix} A & B \\ C & (1 + i) D \end{matrix}] + r [\begin{matrix} A & B \\ C & (1 - i) D \end{matrix}], \end{matrix}

(45)

\begin{matrix} r [\begin{matrix} A & B & - A & 0 \\ C & D & 0 & - D \\ A & 0 & A & B \\ 0 & D & C & D \end{matrix}] & = r [\begin{matrix} (1 + i) A & B \\ C & (1 + i) D \end{matrix}] + r [\begin{matrix} (1 - i) A & B \\ C & (1 - i) D \end{matrix}], \end{matrix}

(46)

\begin{matrix} r [\begin{matrix} A & B & 0 & - B \\ C & D & - C & 0 \\ 0 & B & A & B \\ C & 0 & C & D \end{matrix}] & = r [\begin{matrix} A & (1 + i) B \\ (1 + i) C & D \end{matrix}] + r [\begin{matrix} A & (1 - i) B \\ (1 - i) C & D \end{matrix}], \end{matrix}

(47)

\begin{matrix} r [\begin{matrix} A & B & 0 & - B \\ C & D & - C & - D \\ 0 & B & A & B \\ C & D & C & D \end{matrix}] & = r [\begin{matrix} A & (1 + i) B \\ C & D \end{matrix}] + r [\begin{matrix} A & (1 - i) B \\ C & D \end{matrix}], \end{matrix}

(48)

\begin{matrix} r [\begin{matrix} A & B & - A & - B \\ C & D & - C & 0 \\ A & B & A & B \\ C & 0 & C & D \end{matrix}] & = r [\begin{matrix} A & B \\ (1 + i) C & D \end{matrix}] + r [\begin{matrix} A & B \\ (1 - i) C & D \end{matrix}], \end{matrix}

(49)

\begin{matrix} r [\begin{matrix} A & B & - A & B \\ C & D & C & - D \\ A & - B & A & B \\ - C & D & C & D \end{matrix}] & = r [\begin{matrix} (1 + i) A & (1 - i) B \\ (1 - i) C & (1 + i) D \end{matrix}] + r [\begin{matrix} (1 - i) A & (1 + i) B \\ (1 + i) C & (1 - i) D \end{matrix}], \end{matrix}

(50)

\begin{matrix} r [\begin{matrix} A & B & - A & - B \\ C & D & - C & D \\ A & B & A & B \\ C & - D & C & D \end{matrix}] & = r [\begin{matrix} (1 + i) A & (1 + i) B \\ (1 + i) C & (1 - i) D \end{matrix}] + r [\begin{matrix} (1 - i) A & (1 - i) B \\ (1 - i) C & (1 + i) D \end{matrix}] . \end{matrix}

(51)

Proof.

Follows from replacing the two matrices in (20) and (21) with the given

2 \times 2

sub-block matrices on the left-hand sides of (37)–(51). □

Theorem 6.

Let

A, B, C, D \in C^{m \times n} .

Then, the following rank equalities hold:

\begin{matrix} r [\begin{matrix} A & B & C & D \\ B & A & D & C \\ C & D & A & B \\ D & C & B & A \end{matrix}] & = r (A + B + C + D) + r (A + B - C - D) \\ + r (A - B + C - D) + r (A - B - C + D), \end{matrix}

(52)

\begin{matrix} r [\begin{matrix} A & B & - C & - D \\ B & A & - D & - C \\ C & D & A & B \\ D & C & B & A \end{matrix}] & = r (A + B + i C + i D) + r (A + B - i C - i D) \\ + r (A - B + i C - i D) + r (A - B - i C + i D), \end{matrix}

(53)

\begin{matrix} r [\begin{matrix} A & B & C & D \\ D & A & B & C \\ C & D & A & B \\ B & C & D & A \end{matrix}] & = r (A + B + C + D) + r (A - B + C - D) \\ + r (A + i B - C - i D) + r (A - i B - C + i D) . \end{matrix}

(54)

For

A, B, C, D \in C^{m \times m}

, and

k \geq 2,

the following rank equalities hold:

\begin{matrix} r ({[\begin{matrix} A & B & C & D \\ B & A & D & C \\ C & D & A & B \\ D & C & B & A \end{matrix}]}^{k}) & = r ({(A + B + C + D)}^{k}) + r ({(A + B - C - D)}^{k}) \\ + r ({(A - B + C - D)}^{k}) + r ({(A - B - C + D)}^{k}), \end{matrix}

(55)

\begin{matrix} r ({[\begin{matrix} A & B & - C & - D \\ B & A & - D & - C \\ C & D & A & B \\ D & C & B & A \end{matrix}]}^{k}) & = r ({(A + B + i C + i D)}^{k}) + r ({(A + B - i C - i D)}^{k}) \\ + r ({(A - B + i C - i D)}^{k}) + r ({(A - B - i C + i D)}^{k}), \end{matrix}

(56)

\begin{matrix} r ({[\begin{matrix} A & B & C & D \\ D & A & B & C \\ C & D & A & B \\ B & C & D & A \end{matrix}]}^{k}) & = r ({(A + B + C + D)}^{k}) + r ({(A - B + C - D)}^{k}) \\ + r ({(A + i B - C - i D)}^{k}) + r ({(A - i B - C + i D)}^{k}) . \end{matrix}

(57)

Proof.

Follows from replacing the two matrices in (20) and (21) with the given

2 \times 2

sub-block matrices on the left-hand sides of (55)–(57). □

Theorem 7.

Let

A, B \in C^{m \times m} .

Then, the following rank equalities hold:

\begin{matrix} r [\begin{matrix} A^{2} & A B & B^{2} & B A \\ B A & B^{2} & A B & A^{2} \\ B^{2} & B A & A^{2} & A B \\ A B & A^{2} & B A & B^{2} \end{matrix}] & = r ({(A + B)}^{2}) + r ({(A - B)}^{2}) \end{matrix}

(58)

\begin{matrix} + r ((A + B) (A - B)) + r ((A - B) (A + B)), \end{matrix}

(59)

\begin{matrix} r [\begin{matrix} - A^{2} & A B & B^{2} & B A \\ B A & - B^{2} & A B & A^{2} \\ B^{2} & B A & - A^{2} & A B \\ A B & A^{2} & B A & - B^{2} \end{matrix}] & = r ({(A + i B)}^{2}) + r ({(A - i B)}^{2}) \end{matrix}

(60)

\begin{matrix} + r ((A + i B) (A - i B)) + r ((A - i B) (A + i B)) . \end{matrix}

(61)

Proof.

Follows from replacing the two matrices in (20) and (21) with the given

2 \times 2

sub-block matrices on the left-hand sides of (59) and (61). □

The rank expansion formulas in the following lemma are well-known in elementary linear algebra, which can be viewed as entry-level examples for illustrating the constructive matrix methodology.

Lemma 6.

Let

A \in C^{m \times n},

B \in C^{n \times m},

and let

λ_{1}, λ_{2}, \dots, λ_{k}

be scalars. Then, the following rank equalities hold:

\begin{matrix} r [\begin{matrix} I_{m} & A \\ B & I_{n} \end{matrix}] = m + r (I_{n} - B A) = n + r (I_{m} - A B), \\ r [\begin{matrix} I_{m} & λ_{1} A + λ_{2} A B A + \dots + λ_{k} {(A B)}^{k - 1} A \\ B & I_{n} \end{matrix}] \\ = n + r (I_{m} + λ_{1} A B + λ_{2} {(A B)}^{2} + \dots + λ_{k} {(A B)}^{k}) \\ = m + r (I_{n} + λ_{1} B A + λ_{2} {(B A)}^{2} + \dots + λ_{k} {(B A)}^{k}) . \end{matrix}

If

m = n,

then the following rank equalities hold:

\begin{matrix} r (I_{m} - A B) & = r (I_{m} - B A), \\ r (I_{m} + λ_{1} A B + λ_{2} {(A B)}^{2} + \dots + λ_{k} {(A B)}^{k}) & = r (I_{m} + λ_{1} B A + λ_{2} {(B A)}^{2} + \dots + λ_{k} {(B A)}^{k}) . \end{matrix}

Theorem 8.

Let

A \in C^{m \times n},

X \in C^{n \times p},

Y \in C^{q \times m},

and

B \in C^{p \times q} .

Then, the following rank equalities hold:

\begin{matrix} r [\begin{matrix} A & A X B \\ B Y A & B \end{matrix}] & = r (A) + r (B - B Y A X B) = r (B) + r (A - A X B Y A) . \end{matrix}

(62)

In particular, if

r (A) = r (B),

then

r (B - B Y A X B) = r (A - A X B Y A)

holds. Hence, the two equalities

A X B Y A = A

and

B Y A X B = B

are equivalent under

r (A) = r (B) .

Proof.

Applying (9) to the block matrix

[\begin{matrix} A & A X B \\ B Y A & B \end{matrix}]

and simplifying, we first obtain

r [\begin{matrix} A & A X B \\ B Y A & B \end{matrix}] = r [\begin{matrix} A & 0 \\ 0 & B - B Y A X B \end{matrix}] = r (A) + r (B - B Y A X B),

r [\begin{matrix} A & A X B \\ B Y A & B \end{matrix}] = r [\begin{matrix} A - A X B Y A & 0 \\ 0 & B \end{matrix}] = r (B) + r (A - A X B Y A) .

Combining the two rank equalities yields (62). □

As immediate applications of (62), we are able to obtain the following concrete matrix rank formulas and their consequences.

Theorem 9.

Let

A \in C^{m \times n}

and

B \in C^{n \times m} .

Then, the following rank equalities hold:

\begin{matrix} r [\begin{matrix} A & A B \\ B A & B \end{matrix}] & = r (A) + r (B - B A B) \\ = r (B) + r (A - A B A), \\ r [\begin{matrix} A & {(A B)}^{2} \\ B A & B \end{matrix}] & = r (A) + r (B - B {(A B)}^{2}) \\ = r (B) + r (A - {(A B)}^{2} A), \\ r [\begin{matrix} A & {(A B)}^{2} \\ {(B A)}^{2} & B \end{matrix}] & = r (A) + r (B - B {(A B)}^{3}) \\ = r (B) + r (A - {(A B)}^{3} A), \\ r [\begin{matrix} A & A B + {(A B)}^{2} \\ B A & B \end{matrix}] & = r (A) + r (B - B A B - B {(A B)}^{2}) \\ = r (B) + r (A - A B A - {(A B)}^{2} A), \\ r [\begin{matrix} A & A B + {(A B)}^{2} \\ {(B A)}^{2} & B \end{matrix}] & = r (A) + r (B - B {(A B)}^{2} - B {(A B)}^{3}) \\ = r (B) + r (A - {(A B)}^{2} A - {(A B)}^{3} A) . \end{matrix}

In consequence, the following facts hold:

\begin{matrix} A B A = A & \Leftrightarrow r (B - B A B) = r (B) - r (A), \\ B A B = B & \Leftrightarrow r (A - A B A) = r (A) - r (B), \\ r (A) = r (B) a n d A B A = A & \Leftrightarrow r (A) = r (B) a n d B A B = B, \\ {(A B)}^{2} A = A & \Leftrightarrow r (B - B {(A B)}^{2}) = r (B) - r (A), \\ B {(A B)}^{2} = B & \Leftrightarrow r (A - {(A B)}^{2} A) = r (A) - r (B), \\ r (A) = r (B) a n d {(A B)}^{2} A = A & \Leftrightarrow r (A) = r (B) a n d B {(A B)}^{2} = B, \\ {(A B)}^{2} A + A B A = A & \Leftrightarrow r (B - B A B - B {(A B)}^{2}) = r (B) - r (A), \\ B {(A B)}^{2} + B A B = B & \Leftrightarrow r (A - A B A - {(A B)}^{2} A) = r (A) - r (B), \\ r (A) = r (B) a n d {(A B)}^{2} A + A B A = A & \Leftrightarrow r (A) = r (B) a n d B {(A B)}^{2} + B A B = B, \\ {(A B)}^{3} A + {(A B)}^{2} A = A & \Leftrightarrow r (B - B {(A B)}^{2} - B {(A B)}^{3}) = r (B) - r (A), \\ B {(A B)}^{3} + B {(A B)}^{2} = B & \Leftrightarrow r (A - {(A B)}^{2} A - {(A B)}^{3} A) = r (A) - r (B), \\ r (A) = r (B) a n d {(A B)}^{3} A + {(A B)}^{2} A = A & \Leftrightarrow r (A) = r (B) a n d B {(A B)}^{3} + B {(A B)}^{2} = B . \end{matrix}

All the rank expansion formulas in the following theorems can directly be established by (9) and elementary block matrix operations and, therefore, their derivations are omitted.

Theorem 10.

Let

A, B \in C^{m \times m} .

Then, the following rank equalities hold:

\begin{matrix} r [\begin{matrix} A B & A B^{2} \\ B A^{2} & B A \end{matrix}] & = r [\begin{matrix} A B & 0 \\ B A^{2} & B A - B A^{2} B \end{matrix}] = r [\begin{matrix} A B - A B^{2} A & A B^{2} \\ 0 & B A \end{matrix}], \\ r [\begin{matrix} A B & B^{2} A \\ A^{2} B & B A \end{matrix}] & = r [\begin{matrix} A B & B^{2} A \\ 0 & B A - A B^{2} A \end{matrix}] = r [\begin{matrix} A B - B A^{2} B & B^{2} A \\ 0 & B A \end{matrix}], \end{matrix}

and the following facts hold (cf. [9]):

\begin{matrix} r (A B) = r (B A) a n d A B^{2} A = A B & \Leftrightarrow r (A B) = r (B A) a n d B A^{2} B = B A, \\ r (A B) = r (B A) a n d A B^{2} A = B A & \Leftrightarrow r (A B) = r (B A) a n d B A^{2} B = A B . \end{matrix}

Theorem 11.

Let

A \in C^{m \times n},

B \in C^{m \times n},

and let λ be a nonzero scalar. Then, the following rank equalities hold:

\begin{matrix} r [\begin{matrix} A B & A B A \\ B A B & λ B A \end{matrix}] & = r (A B) + r (λ B A - {(B A)}^{2}) \\ = r (B A) + r (λ A B - {(A B)}^{2}), \\ r [\begin{matrix} A B & {(A B)}^{2} A \\ B A B & λ B A \end{matrix}] & = r (A B) + r (λ B A - {(B A)}^{3}) \\ = r (B A) + r (λ A B - {(A B)}^{3}), \\ r [\begin{matrix} A B & {(A B)}^{2} A \\ {(B A)}^{2} B & λ B A \end{matrix}] & = r (A B) + r (λ B A - {(B A)}^{4}) \\ = r (B A) + r (λ A B - {(A B)}^{4}), \\ r [\begin{matrix} A B & A B A + {(A B)}^{2} A \\ B A B & λ B A \end{matrix}] & = r (A B) + r (λ B A - {(B A)}^{2} - {(B A)}^{3}) \\ = r (B A) + r (λ A B - {(A B)}^{2} - {(A B)}^{3}) . \end{matrix}

In consequence, the following facts hold:

\begin{matrix} {(A B)}^{2} = λ A B & \Leftrightarrow r (λ B A - {(B A)}^{2}) = r (B A) - r (A B), \\ {(B A)}^{2} = λ B A & \Leftrightarrow r (λ A B - {(A B)}^{2}) = r (A B) - r (B A), \\ r (A B) = r (B A) a n d {(A B)}^{2} = λ A B & \Leftrightarrow r (A B) = r (B A) a n d {(B A)}^{2} = λ B A . \\ {(A B)}^{3} = λ A B & \Leftrightarrow r (λ B A - {(B A)}^{3}) = r (B A) - r (A B), \\ {(B A)}^{3} = λ B A & \Leftrightarrow r (λ A B - {(A B)}^{3}) = r (A B) - r (B A), \\ r (A B) = r (B A) a n d {(A B)}^{3} = λ A B & \Leftrightarrow r (A B) = r (B A) a n d {(B A)}^{3} = λ B A . \\ {(A B)}^{4} = λ A B & \Leftrightarrow r (λ B A - {(B A)}^{4}) = r (B A) - r (A B), \\ {(B A)}^{4} = λ B A & \Leftrightarrow r (λ A B - {(A B)}^{4}) = r (A B) - r (B A), \\ r (A B) = r (B A) a n d {(A B)}^{4} = λ A B & \Leftrightarrow r (A B) = r (B A) a n d {(B A)}^{4} = λ B A \end{matrix}

and

\begin{matrix} r (A B) = r (B A) a n d {(A B)}^{3} + {(A B)}^{2} = λ A B \\ \Leftrightarrow r (A B) = r (B A) a n d {(B A)}^{3} + {(B A)}^{2} = λ A B . \end{matrix}

Theorem 12.

Let

A \in C^{m \times n}

B \in C^{m \times n},

and let λ be a nonzero scalar. Then, the following rank equalities hold:

\begin{matrix} r [\begin{matrix} A B & {(A B)}^{2} A \\ {(B A)}^{2} B & λ {(B A)}^{2} \end{matrix}] & = r (A B) + r (λ {(B A)}^{2} - {(B A)}^{4}) \\ = r ({(B A)}^{2}) + r (λ A B - {(A B)}^{3}), \\ r [\begin{matrix} {(A B)}^{2} & {(A B)}^{2} A \\ {(B A)}^{2} B & λ {(B A)}^{2} \end{matrix}] & = r ({(A B)}^{2}) + r (λ {(B A)}^{2} - {(B A)}^{3}) \\ = r ({(B A)}^{2}) + r (λ {(A B)}^{2} - {(A B)}^{3}), \end{matrix}

and the following facts hold:

\begin{matrix} {(A B)}^{3} = λ A B & \Leftrightarrow r (λ {(B A)}^{2} - {(B A)}^{4}) = r ({(B A)}^{2}) - r (A B), \\ {(B A)}^{4} = λ {(B A)}^{2} & \Leftrightarrow r (λ A B - {(A B)}^{3}) = r (A B) - r ({(B A)}^{2}), \\ {(A B)}^{3} = λ {(A B)}^{2} & \Leftrightarrow r (λ {(B A)}^{2} - {(B A)}^{3}) = r ({(B A)}^{2}) - r ({(A B)}^{2}), \\ {(B A)}^{3} = λ {(B A)}^{2} & \Leftrightarrow r (λ {(A B)}^{2} - {(A B)}^{3}) = r ({(A B)}^{2})) - r ({(B A)}^{2}) \end{matrix}

and

\begin{matrix} r ({(B A)}^{2}) = r (A B) a n d {(A B)}^{3} = λ A B \\ \Leftrightarrow r ({(B A)}^{2}) = r (A B) a n d {(B A)}^{4} = λ {(B A)}^{2}, \\ r ({(B A)}^{2}) = r ({(B A)}^{2}) a n d {(A B)}^{3} = λ {(A B)}^{2} \\ \Leftrightarrow r ({(B A)}^{2}) = r ({(B A)}^{2}) a n d {(B A)}^{3} = λ {(B A)}^{2} . \end{matrix}

Theorem 13.

Let

A \in C^{m \times n},

B \in C^{m \times n},

and let λ be a nonzero scalar. Then, the following rank equalities hold:

\begin{matrix} r [\begin{matrix} A B A & {(A B)}^{2} \\ {(B A)}^{2} & λ B A B \end{matrix}] & = r (A B A) + r (λ B A B - {(B A)}^{2} B) \\ = r (B A B) + r (λ A B A - {(A B)}^{2} A), \\ r [\begin{matrix} A B A & {(A B)}^{3} \\ {(B A)}^{2} & λ B A B \end{matrix}] & = r (A B A) + r (λ B A B - {(B A)}^{3} B) \\ = r (B A B) + r (λ A B A - {(A B)}^{3} A), \\ r [\begin{matrix} A B A & {(A B)}^{2} + {(A B)}^{3} \\ {(B A)}^{2} & λ B A B \end{matrix}] & = r (A B A) + r (λ B A B - B {(A B)}^{2} - B {(A B)}^{3}) \\ = r (B A B) + r (λ A B A - {(A B)}^{2} A - {(A B)}^{3} A), \\ r [\begin{matrix} A B A & {(A B)}^{2} + {(A B)}^{3} \\ {(B A)}^{3} & λ B A B \end{matrix}] & = r (A B A) + r (λ B A B - B {(A B)}^{3} - B {(A B)}^{4}) \\ = r (B A B) + r (λ A B A - {(A B)}^{3} A - {(A B)}^{4} A), \end{matrix}

and the following facts hold:

\begin{matrix} {(A B)}^{2} A = λ A B A \Leftrightarrow r (λ B A B - {(B A)}^{2} B) = r (B A B) - r (A B A), \\ {(B A)}^{2} B = λ B A B \Leftrightarrow r (λ A B A - {(A B)}^{2} A) = r (A B A) - r (B A B) \end{matrix}

and

\begin{matrix} r (A B A) = r (B A B) a n d {(A B)}^{2} A = λ A B A \\ \Leftrightarrow r (A B A) = r (B A B) a n d {(B A)}^{2} B = λ B A B, \\ r (A B A) = r (B A B) a n d {(A B)}^{3} A = λ A B A \\ \Leftrightarrow r (A B A) = r (B A B) a n d {(B A)}^{3} B = λ B A B, \\ r (A B A) = r (B A B) a n d {(A B)}^{3} A + {(A B)}^{2} A = λ A B A \\ \Leftrightarrow r (A B A) = r (B A B) a n d B {(A B)}^{3} + B {(A B)}^{2} = λ B A B, \\ r (A B A) = r (B A B) a n d {(A B)}^{4} A + {(A B)}^{3} A = λ A B A \\ \Leftrightarrow r (A B A) = r (B A B) a n d B {(A B)}^{4} + B {(A B)}^{3} = λ B A B . \end{matrix}

Moreover, we have the following expansion formulas for calculating the ranks of block matrices composed of multiple products of matrices.

Theorem 14.

Let

A \in C^{m \times n},

B \in C^{n \times p},

C \in C^{p \times m},

and let λ be a nonzero scalar. Then, the following rank equalities hold:

\begin{matrix} r [\begin{matrix} A B & A B C \\ B C A B & λ B C \end{matrix}] & = r (A B) + r (λ B C - B C A B C) \\ = r (B C) + r (λ A B - A B C A B), \\ r [\begin{matrix} A B & {(A B C)}^{2} \\ B C A B & λ B C \end{matrix}] & = r (A B) + r (λ B C - B C {(A B C)}^{2}) \\ = r (B C) + r (λ A B - {(A B C)}^{2} A B), \\ r [\begin{matrix} A B & A B C \\ {(B C A)}^{2} B & λ B C \end{matrix}] & = r (A B) + r (λ B C - B C {(A B C)}^{2}) \\ = r (B C) + r (λ A B - {(A B C)}^{2} A B), \\ r [\begin{matrix} A B & {(A B C)}^{2} \\ {(B C A)}^{2} B & λ B C \end{matrix}] & = r (A B) + r (λ B C - B C {(A B C)}^{3}) \\ = r (B C) + r (λ A B - {(A B C)}^{3} A B), \\ r [\begin{matrix} A B & A B C + {(A B C)}^{2} \\ B C A B & λ B C \end{matrix}] & = r (A B) + r (λ B C - B C A B C - B C {(A B C)}^{2}) \\ = r (B C) + r (λ A B - A B C A B - {(A B C)}^{2} A B), \\ r [\begin{matrix} A B & A B C + {(A B C)}^{2} \\ {(B C A)}^{2} B & λ B C \end{matrix}] & = r (A B) + r (λ B C - B C {(A B C)}^{2} - B C {(A B C)}^{3}) \\ = r (B C) + r (λ A B - {(A B C)}^{2} A B - {(A B C)}^{3} A B) . \end{matrix}

Remark 1.

The versatile and engaging findings in the above theorems actually reveal the remarkable fact that we are able to establish certain nontrivial links between different algebraic matrix equalities composed of general matrices through certain constructions of block matrices and the calculations of the ranks of the block matrices. Undoubtedly, numerous expressions of block matrices and the corresponding rank expansion formulas can be reasonably established for two or more given matrices and their algebraic operations, and consequently, many kinds of essential equivalent facts concerning matrix equalities can be obtained from the rank formulas as more and deeper approaches are devoted to this research topic.

4. Characterizations of Matrix Equalities by the Matrix Rank Methodology

In this section, we continue to construct some block matrices, calculate their ranks, and use them to characterize various matrix equalities related to the Hermitian matrix, the skew-Hermitian matrix, the normal matrix, etc.

Theorem 15.

Let

A \in C^{n \times n}

and

B, C \in C^{m \times n} .

Then, the following five conditions are equivalent:

(a): $r [\begin{matrix} A & A^{*} \\ B A^{*} A & C A A^{*} \end{matrix}] = r (A) .$
(b): $r [\begin{matrix} A A^{†} & A^{†} A \\ B A^{*} & C A \end{matrix}] = r (A) .$
(c): $R [\begin{matrix} A \\ B A^{*} A \end{matrix}] = R [\begin{matrix} A^{*} \\ C A A^{*} \end{matrix}] .$
(d): $R [\begin{matrix} A A^{†} \\ B A^{*} \end{matrix}] = R [\begin{matrix} A^{†} A \\ C A \end{matrix}] .$
(e): $R (A) = R (A^{*})$ and $B A^{*} = C A .$

Proof.

Applying (5) to the block matrix

[\begin{matrix} A & A^{*} \\ B A^{*} A & C A A^{*} \end{matrix}]

, we first obtain

\begin{matrix} r [\begin{matrix} A & A^{*} \\ B A^{*} A & C A A^{*} \end{matrix}] = r (A) + r [\begin{matrix} 0 & A^{*} - A A^{†} A^{*} \\ 0 & C A A^{*} - B A^{*} A^{*} \end{matrix}], \\ r [\begin{matrix} A & A^{*} \\ B A^{*} A & C A A^{*} \end{matrix}] = r (A^{*}) + r [\begin{matrix} A - A^{†} A^{2} & 0 \\ B A^{*} A - C A^{2} & 0 \end{matrix}] . \end{matrix}

In this case, the matrix rank equality in Condition (a) is equivalent to the following facts:

\begin{matrix} A^{2} A^{†} = A^{†} A^{2} = A, B A^{*} A = C A^{2}, B {(A^{*})}^{2} = C A A^{*} \end{matrix}

(63)

by (8). Obviously, the first two matrix equalities in (63) are equivalent to

R (A) = R (A^{*})

by (6). Next, post-multiplying the second equality with

A^{†}

and simplifying by the first pair of equalities leads to

B A^{*} = C A

, thus establishing the equivalence of Conditions (a) and (e).

Also, by (4), we are able to obtain the following two rank equalities

r [\begin{matrix} A A^{†} & A^{†} A \\ B A^{*} & C A \end{matrix}] = r (A) + r [\begin{matrix} 0 & A^{†} A - A A^{†} A^{†} A \\ 0 & C A - B A^{*} \end{matrix}],

r [\begin{matrix} A A^{†} & A^{†} A \\ B A^{*} & C A \end{matrix}] = r (A) + r [\begin{matrix} A A^{†} - A A^{†} A^{†} A & 0 \\ B A^{*} - C A & 0 \end{matrix}] .

Consequently, Condition (b) is equivalent to

A A^{†} A^{†} A = A A^{†} = A^{†} A

and

B A^{*} = C A

by (8), where

A A^{†} = A^{†} A

is well-known to be equivalent to

R (A) = R (A^{*})

(cf. [1]). Thus, Conditions (b) and (e) are equivalent.

Note that the two rank equalities in Conditions (a) and (b) can also be written as

r [\begin{matrix} A & A^{*} \\ B A^{*} A & C A A^{*} \end{matrix}] = r [\begin{matrix} A \\ B A^{*} A \end{matrix}] = r [\begin{matrix} A^{*} \\ C A A^{*} \end{matrix}], r [\begin{matrix} A A^{†} & A^{†} A \\ B A^{*} & C A \end{matrix}] = r [\begin{matrix} A A^{†} \\ B A^{*} \end{matrix}] = r [\begin{matrix} A^{†} A \\ C A \end{matrix}] .

In such cases, applying (6) to the two pairs of equalities leads to the equivalences of Conditions (a)–(d). □

For different choices of B and C in Theorem 15, we are able to derive various concrete equivalent facts concerning matrix equalities that involve a matrix and its conjugate transpose. We next present five corollaries that display how rank and range equalities of block matrices are involved in the characterizations of some well-known matrix equalities composed of matrix products of a matrix and its conjugate transpose (cf. [2,10,11,12,13,14,15,16,17]).

As we know, Hermitian matrices possess many elegant and pleasing formulas and facts, have many significant applications in the research areas of both theoretical and applied mathematics, and have already been recognized as one of the basic conceptual objects and building materials in matrix theory and its applications.

Corollary 2.

Let

A \in C^{m \times m} .

Then, the following five conditions are equivalent:

(a): $r [\begin{matrix} A & A^{*} \\ A^{*} A & A A^{*} \end{matrix}] = r (A) .$
(b): $r [\begin{matrix} A A^{†} & A^{†} A \\ A^{*} & A \end{matrix}] = r (A) .$
(c): $R [\begin{matrix} A \\ A^{*} A \end{matrix}] = R [\begin{matrix} A^{*} \\ A A^{*} \end{matrix}] .$
(d): $R [\begin{matrix} A A^{†} \\ A^{*} \end{matrix}] = R [\begin{matrix} A^{†} A \\ A \end{matrix}] .$
(e): $A = A^{*} .$

Proof.

Replacing

B = C = I_{m}

in Theorem 15 leads to the equivalences of Conditions (a)–(e), where

A = A^{*}

obviously implies

R (A) = R (A^{*})

and thus is removed from Condition (e) of Theorem 15. □

Given the above preparations, we are able to derive our main results as follows.

Theorem 16.

Let

A, B \in C^{m \times n} .

Then, the following five conditions are equivalent:

(a): $A = B$ .
(b): $R (A) = R (B),$ $R (A^{*}) = R (B^{*})$ and $A B^{†} A = A .$
(c): $R (A) = R (B),$ $R (A^{*}) = R (B^{*})$ and $B A^{†} B = B .$
(d): $R (A) = R (B),$ $R (A^{*}) = R (B^{*})$ and $A^{†} B A^{†} = A^{†} .$
(e): $R (A) = R (B),$ $R (A^{*}) = R (B^{*})$ and $B^{†} A B^{†} = B^{†} .$

Proof.

The equivalences of Conditions (a)–(e) follow from (14) and (15). □

Theorem 17.

Let

A, B \in C^{m \times m}

be two Hermitian matrices. Then, the following three conditions are equivalent:

(a): $A = B .$
(b): $r (A B) = r (A) = r (B),$ $A B = B A,$ and $A B A = B A B .$
(c): $r (A B) = r (A) = r (B),$ $A B A = B A B,$ and ${(A B)}^{2} = {(B A)}^{2} .$

Specifically, for two invertible Hermitian matrices A and B of the same size, the following three conditions are equivalent:

(d): $A = B .$
(e): $A B = B A$ and $A B A = B A B .$
(f): $A B A = B A B$ and ${(A B)}^{2} = {(B A)}^{2} .$

Proof.

Condition (a) obviously implies Conditions (b) and (c) under the Hermitian matrix assumption. Under Condition (b), we apply the well-known Frobenius rank inequality

r (P N Q) \geq r (P N) + r (N Q) - r (N)

for the product of any three matrices of appropriate sizes to the two products

A B A

and

B A B

to obtain:

r (A B A) \geq r (A B) + r (B A) - r (B) = r (B) = r (A),

r (B A B) \geq r (B A) + r (A B) - r (A) = r (A) = r (B) .

Combining the facts with the two regular rank inequalities

r (A B A) \leq r (A)

and

r (B A B) \leq r (B)

leads to:

r (A B A) = r (B A B) = r (A) = r (B),

which further implies that the following range equalities

R (A B A) = R (A) and R (B A B) = R (B)

hold as well, since

R (A B A) \subseteq R (A)

and

R (B A B) \subseteq R (B)

always hold. In this case, we obtain, by

A B = B A

, Lemma 3, and elementary block matrix operations, that

\begin{matrix} r [\begin{matrix} A B A & B A B \\ A & B \end{matrix}] & = r [\begin{matrix} A B A - A B A & B A B - A B B \\ A & B \end{matrix}] \\ = r [\begin{matrix} 0 & 0 \\ A & B \end{matrix}] = r [A, B] = r [A B A, B A B], \end{matrix}

which implies that

X [A B A, B A B] = [X A B A, X B A B] = [A, B]

holds for matrix X by Lemma 4. Finally, pre-multiplying

A B A = B A B

by X and simplifying yields

A = B

, thus establishing Condition (a).

Under Condition (c), we apply the Frobenius rank inequality to the two products

{(A B)}^{2}

and

{(B A)}^{2}

to obtain

r ({(A B)}^{2}) \geq r (A B A) + r (B A B) - r (B A) = r (A) = r (B),

r ({(B A)}^{2}) \geq r (B A B) + r (A B A) - r (A B) = r (A) = r (B),

which further imply

r ({(A B)}^{2}) = r ({(B A)}^{2}) = r (A) = r (B),

and thus,

R ({(A B)}^{2}) = R (A) and R ({(B A)}^{2}) = R (B)

hold. In this case, we obtain, by

A B A = B A B

, Lemma 3, and elementary block matrix operations, that

\begin{matrix} r [\begin{matrix} {(A B)}^{2} & {(B A)}^{2} \\ B & A \end{matrix}] & = r [\begin{matrix} {(A B)}^{2} - A B A B & {(B A)}^{2} - A B A A \\ B & A \end{matrix}] \\ = r [\begin{matrix} 0 & 0 \\ B & A \end{matrix}] = r [A, B] = r [{(A B)}^{2}, {(B A)}^{2}], \end{matrix}

which implies that the following equalities

X [{(A B)}^{2}, {(B A)}^{2}] = [X {(A B)}^{2}, X {(B A)}^{2}] = [B, A]

hold for matrix X by Lemma 4. Finally, pre-multiplying

{(A B)}^{2} = {(B A)}^{2}

with X and simplifying yields

B = A

, thus establishing (a).

If A and B are invertible, then

r (A B) = r (A) = r (B)

hold naturally. Thus, Conditions (a), (b), and (c) are reduced to Conditions (d), (e), and (f), respectively. □

As direct consequences, we replace A with

[\begin{matrix} 0 & A \\ A^{*} & 0 \end{matrix}]

and B with

[\begin{matrix} 0 & B \\ B^{*} & 0 \end{matrix}]

in Theorem 17, where A and B are any two

m \times n

matrices, to obtain the following facts.

Corollary 3.

Let

A, B \in C^{m \times n} .

Then, the following three statements are equivalent:

(a): $A = B .$
(b): $r (A B^{*}) = r (A^{*} B) = r (A) = r (B),$ $A B^{*} = B A^{*},$ $A^{*} B = B^{*} A,$ and $A B^{*} A = B A^{*} B .$
(c): $r (A B^{*}) = r (A^{*} B) = r (A) = r (B),$ $A B^{*} A = B A^{*} B,$ ${(A B^{*})}^{2} = {(B A^{*})}^{2},$ and ${(A^{*} B)}^{2} = {(B^{*} A)}^{2} .$

Similar to the preceding results and facts, the equivalent facts in Theorems 16 and 17 and their derivations also reveal some essential links between the matrix equality

A = B

and some other weaker conditions composed of different matrix rank equalities and algebraic matrix equalities for A and B, where

A B A = B A B

is known as the Yang–Baxter matrix equation in the literature (cf. [18,19,20,21]).

Theorem 18.

Let

A \in C^{m \times m} .

Then, the following 12 conditions are equivalent:

(a): $r [\begin{matrix} A & A^{*} \\ {(A^{*} A)}^{2} & {(A A^{*})}^{2} \end{matrix}] = r (A) .$
(b): $r [\begin{matrix} A A^{†} & A^{†} A \\ A^{*} A A^{*} & A A^{*} A \end{matrix}] = r (A) .$
(c): $R [\begin{matrix} A \\ {(A^{*} A)}^{2} \end{matrix}] = R [\begin{matrix} A^{*} \\ {(A A^{*})}^{2} \end{matrix}] .$
(d): $R [\begin{matrix} A A^{†} \\ A^{*} A A^{*} \end{matrix}] = R [\begin{matrix} A^{†} A \\ A A^{*} A \end{matrix}] .$
(e): $A^{3} = A^{*} A A^{*}$ and $A^{5} = {(A^{*} A)}^{2} A^{*} .$
(f): A is Hermitian.
(g): $A A^{*} A$ is Hermitian.
(h): ${(A A^{*})}^{2} A$ is Hermitian.
(i): $r (A^{2}) = r (A),$ $A^{2}$ and $A^{3}$ are Hermitian.
(j): $r (A^{2}) = r (A),$ $A^{2}$ and $A^{5}$ are Hermitian.
(k): $r (A^{2}) = r (A),$ $A^{3}$ and $A^{4}$ are Hermitian.
(l): $r (A^{2}) = r (A),$ $A^{3}$ and $A^{5}$ are Hermitian.

Proof.

Replacing

B = A^{*} A

and

C = A A^{*}

in Theorem 15 leads to the equivalences of Conditions (a)–(d) and (g), where

A^{*} A A^{*} = A A^{*} A

implies

R (A) = R (A^{*})

and thus is removed in Condition (g).

The equivalences of Conditions (f), (g) and (h) follow from [4], Theorem 7.

The first equality in (e) implies

R (A) \supseteq R (A^{3}) = R (A^{*} A A^{*}) = R (A^{*})

. So that

R (A) = R (A^{*})

holds by noting

r (A) = r (A^{*})

. Hence,

A A^{†} = A^{†} A

and

{(A^{4})}^{†} A^{4} = A^{†} A

hold by Lemma 5 (II) and (III). Now substituting the first equality into the second in (e) leads to

A^{5} = A^{4} A^{*}

. Pre-multiplying the equality with

{(A^{4})}^{†}

and simplifying with the above two equalities, we obtain

A = (A^{†} A) A = {(A^{4})}^{†} A^{5} = {(A^{4})}^{†} A^{4} A^{*} = A^{†} A A^{*} = A^{*}

. Thus, Condition (e) implies Condition (f). Conversely, Condition (f) implies Condition (e) as well.

From

r (A^{2}) = r (A)

and

A^{2} = {(A^{2})}^{*}

in Condition (i), we first obtain

R (A) = R (A^{2}) = R ({(A^{2})}^{*}) = R (A^{*})

. Hence,

A A^{†} = A^{†} A

and

{(A^{2})}^{†} A^{2} = A^{†} A

hold by Lemma 5 (II) and (III). In this case, pre-multiplying

A^{3} = (A^{2}) A^{*}

with

{(A^{2})}^{†}

and simplifying yields

A = {(A^{2})}^{†} A^{3} = {(A^{2})}^{†} A^{2} A^{*} = A^{†} A A^{*} = A^{*}

, thus Condition (i) implies Condition (f). The equivalences of Conditions (f), (j), (k), and (l) can be established by a similar approach. □

Theorem 19.

Let

A \in C^{m \times m} .

Then, the following 12 conditions are equivalent:

(a): $r [\begin{matrix} A & - A^{*} \\ {(A^{*} A)}^{2} & {(A A^{*})}^{2} \end{matrix}] = r (A) .$
(b): $r [\begin{matrix} A A^{†} & A^{†} A \\ A^{*} A A^{*} & - A A^{*} A \end{matrix}] = r (A) .$
(c): $R [\begin{matrix} A \\ {(A^{*} A)}^{2} \end{matrix}] = R [\begin{matrix} - A^{*} \\ {(A A^{*})}^{2} \end{matrix}] .$
(d): $R [\begin{matrix} A A^{†} \\ A^{*} A A^{*} \end{matrix}] = R [\begin{matrix} A^{†} A \\ - A A^{*} A \end{matrix}] .$
(e): $A^{3} = A^{*} A A^{*}$ and $A^{5} = - {(A^{*} A)}^{2} A^{*} .$
(f): A is skew-Hermitian.
(g): $A A^{*} A$ is skew-Hermitian.
(h): ${(A A^{*})}^{2} A$ is skew-Hermitian.
(i): $r (A^{2}) = r (A),$ $A^{2}$ is Hermitian, and $A^{3}$ is skew-Hermitian.
(j): $r (A^{2}) = r (A),$ $A^{2}$ is Hermitian, and $A^{5}$ is skew-Hermitian.
(k): $r (A^{2}) = r (A),$ $A^{3}$ is skew-Hermitian, and $A^{4}$ is Hermitian.
(l): $r (A^{2}) = r (A),$ and $A^{3}$ and $A^{5}$ are skew-Hermitian.

Recently, the present author showed in [22] the following equivalent facts:

A^{3} = \pm A A^{*} A \Leftrightarrow A = \pm A^{*} .

Combining this result with (17) and Theorems 1 and 16, we are able to obtain the following consequences.

Theorem 20.

Let

A \in C^{m \times m} .

Then, the following 21 conditions are equivalent:

(a): $A = A^{*} .$
(b): $A^{3} = A A^{*} A .$
(c): $A^{†} = {(A^{†})}^{*} .$
(d): $A^{*} A^{†} A^{*} = A^{*} .$
(e): $A^{†} A^{*} A^{†} = A^{†} .$
(f): $r (A^{2}) = r (A)$ and $A^{†} = {(A^{#})}^{*} .$
(g): $r (A^{2}) = r (A)$ and $A^{#} = {(A^{#})}^{*} .$
(h): $r (A^{2}) = r (A)$ and $A^{*} A^{#} A^{*} = A^{*} .$
(i): $r (A^{2}) = r (A)$ and $A^{†} A^{*} A^{†} = A^{#} .$
(j): $r (A^{2}) = r (A)$ and $A^{#} A^{*} A^{#} = A^{†} .$
(k): $r (A^{2}) = r (A)$ and $A^{#} A^{*} A^{#} = A^{#} .$
(l): $r (A^{2}) = r (A)$ and $A^{*} A^{†} A^{*} A^{†} A^{*} = A^{*} A^{†} A^{*} .$
(m): $r (A^{2}) = r (A)$ and $A^{*} A^{#} A^{*} A^{#} A^{*} = A^{*} A^{#} A^{*} .$
(n): $r (A^{2}) = r (A)$ and $A^{†} A^{*} A^{†} A^{*} A^{†} = A^{†} A^{*} A^{†} .$
(o): $r (A^{2}) = r (A)$ and $A^{†} A^{*} A^{†} A^{*} A^{†} = A^{#} A^{*} A^{#} .$
(p): $r (A^{2}) = r (A)$ and $A^{†} A^{*} A^{#} A^{*} A^{†} = A^{†} A^{*} A^{†} .$
(q): $r (A^{2}) = r (A)$ and $A^{†} A^{*} A^{#} A^{*} A^{†} = A^{#} A^{*} A^{#} .$
(r): $r (A^{2}) = r (A)$ and $A^{#} A^{*} A^{†} A^{*} A^{#} = A^{†} A^{*} A^{†} .$
(s): $r (A^{2}) = r (A)$ and $A^{#} A^{*} A^{†} A^{*} A^{#} = A^{#} A^{*} A^{#} .$
(t): $r (A^{2}) = r (A)$ and $A^{#} A^{*} A^{#} A^{*} A^{#} = A^{†} A^{*} A^{†} .$
(u): $r (A^{2}) = r (A)$ and $A^{#} A^{*} A^{#} A^{*} A^{#} = A^{#} A^{*} A^{#} .$

Theorem 21.

Let

A \in C^{m \times m} .

Then, the following 21 conditions are equivalent:

(a): $A = - A^{*} .$
(b): $A^{3} = - A A^{*} A .$
(c): $A^{†} = - {(A^{†})}^{*} .$
(d): $A^{*} A^{†} A^{*} = - A^{*} .$
(e): $A^{†} A^{*} A^{†} = - A^{†} .$
(f): $r (A^{2}) = r (A)$ and $A^{†} = - {(A^{#})}^{*} .$
(g): $r (A^{2}) = r (A)$ and $A^{#} = - {(A^{#})}^{*} .$
(h): $r (A^{2}) = r (A)$ and $A^{*} A^{#} A^{*} = - A^{*} .$
(i): $r (A^{2}) = r (A)$ and $A^{†} A^{*} A^{†} = - A^{#} .$
(j): $r (A^{2}) = r (A)$ and $A^{#} A^{*} A^{#} = - A^{†} .$
(k): $r (A^{2}) = r (A)$ and $A^{#} A^{*} A^{#} = - A^{#} .$
(l): $r (A^{2}) = r (A)$ and $A^{*} A^{†} A^{*} A^{†} A^{*} = - A^{*} A^{†} A^{*} .$
(m): $r (A^{2}) = r (A)$ and $A^{*} A^{#} A^{*} A^{#} A^{*} = - A^{*} A^{#} A^{*} .$
(n): $r (A^{2}) = r (A)$ and $A^{†} A^{*} A^{†} A^{*} A^{†} = - A^{†} A^{*} A^{†} .$
(o): $r (A^{2}) = r (A)$ and $A^{†} A^{*} A^{†} A^{*} A^{†} = - A^{#} A^{*} A^{#} .$
(p): $r (A^{2}) = r (A)$ and $A^{†} A^{*} A^{#} A^{*} A^{†} = - A^{†} A^{*} A^{†} .$
(q): $r (A^{2}) = r (A)$ and $A^{†} A^{*} A^{#} A^{*} A^{†} = - A^{#} A^{*} A^{#} .$
(r): $r (A^{2}) = r (A)$ and $A^{#} A^{*} A^{†} A^{*} A^{#} = - A^{†} A^{*} A^{†} .$
(s): $r (A^{2}) = r (A)$ and $A^{#} A^{*} A^{†} A^{*} A^{#} = - A^{#} A^{*} A^{#} .$
(t): $r (A^{2}) = r (A)$ and $A^{#} A^{*} A^{#} A^{*} A^{#} = - A^{†} A^{*} A^{†} .$
(u): $r (A^{2}) = r (A)$ and $A^{#} A^{*} A^{#} A^{*} A^{#} = - A^{#} A^{*} A^{#} .$

As applications of the above results, we present below a group of characterizations of normal matrix.

Theorem 22.

Let

A \in C^{m \times m} .

Then, the following 13 conditions are equivalent:

(a): $r [\begin{matrix} A & A^{*} \\ A A^{*} A & A^{*} A A^{*} \end{matrix}] = r (A) .$
(b): $r [\begin{matrix} A A^{†} & A^{†} A \\ A A^{*} & A^{*} A \end{matrix}] = r (A) .$
(c): $R [\begin{matrix} A \\ A A^{*} A \end{matrix}] = R [\begin{matrix} A^{*} \\ A^{*} A A^{*} \end{matrix}] .$
(d): $R [\begin{matrix} A A^{†} \\ A A^{*} \end{matrix}] = R [\begin{matrix} A^{†} A \\ A^{*} A \end{matrix}] .$
(e): $r (A^{2}) = r (A),$ $A {(A^{*})}^{2} A = A^{*} A^{2} A^{*},$ and $A {(A^{*})}^{2} A^{2} A^{*} = A^{*} A^{2} {(A^{*})}^{2} A .$
(f): $r (A^{2}) = r (A),$ $A {(A^{*})}^{2} A^{2} A^{*} = A^{*} A^{2} {(A^{*})}^{2} A,$ and ${(A {(A^{*})}^{2} A)}^{2} = {(A^{*} A^{2} A^{*})}^{2} .$
(g): $r (A^{2}) = r (A),$ $A^{2} A^{*} = A^{*} A^{2},$ and $A^{3} A^{*} = A^{*} A^{3} .$
(h): $A A^{*} = A^{*} A,$ namely, A is normal.
(i): $r (A^{2}) = r (A),$ and $A^{2}$ and $A^{3}$ are normal.
(j): $r (A^{2}) = r (A),$ and $A^{2}$ and $A^{5}$ are normal.
(k): $r (A^{2}) = r (A),$ and $A^{3}$ and $A^{4}$ are normal.
(l): $r (A^{2}) = r (A),$ and $A^{3}$ and $A^{5}$ are normal.
(m): $r (A^{2}) = r (A),$ and $A^{4}$ and $A^{5}$ are normal.

Proof.

Replacing

B = A

and

C = A^{*}

in Theorem 15 leads to the equivalences of Conditions (a)–(d) and (h), where

A A^{*} = A^{*} A

also implies

R (A) = R (A^{*})

and thus is removed from Condition (h). The equivalences of Conditions (e), (f), and (h) follow from replacing A and B with

A A^{*}

and

A^{*} A

in Theorem 17 (a), (b), and (c).

Note that

r (A^{2}) = r (A)

is obviously equivalent to the fact that

A = A^{2} X = Y A^{2}

hold for certain matrices X and Y. In this case, substituting the second given equality in Condition (g) into both sides of the third given equality in Condition (g), respectively, yields

A^{3} A^{*} = A^{2} A^{*} A

and

A^{*} A^{3} = A A^{*} A^{2}

. Consequently, pre-and post-multiplying the two equalities with the above two matrices Y and X respectively yields

A^{2} A^{*} = A A^{*} A

and

A^{*} A^{2} = A A^{*} A

. Next pre-and post-multiplying these two equalities with

A^{*}

respectively yields

A^{*} A^{2} A^{*} = {(A A^{*})}^{2} = {(A^{*} A)}^{2} = A {(A^{*})}^{2} A

. In this case,

(A A^{*} - A^{*} A) {(A A^{*} - A^{*} A)}^{*} = {(A A^{*})}^{2} + {(A^{*} A)}^{2} - A {(A^{*})}^{2} A - A^{*} A^{2} A^{*} = 0,

which further implies

A A^{*} - A^{*} A = 0

, i.e., (h) holds. Conversely, pre-multiplying

A A^{*} = A^{*} A

with A and

A^{2}

and applying the given condition yield

A^{2} A^{*} = A A^{*} A = A^{*} A^{2}

and

A^{3} A^{*} = A^{2} A^{*} A = A^{*} A^{3}

, as required for the second and third equalities on the right-hand side. Pre-and post-multiplying

A A^{*} = A^{*} A

with A and

A^{*}

yields

A^{2} {(A^{2})}^{*} = {(A A^{*})}^{2}

, which implies that

r (A^{2}) = r (A^{2} {(A^{2})}^{*}) = r ({(A A^{*})}^{2}) = r (A A^{*}) = r (A)

, since

A A^{*}

and

A^{2} {(A^{2})}^{*}

are positive semi-definite. Thus, Condition (h) implies Condition (g).

Note from Condition (i) that the two normal matrices

A^{2}

and

A^{3}

commute. Then, we first obtain from Lemma 5 (I) that there exists an unitary matrix P such that

A^{2} = P B P^{*}

and

A^{3} = P C P^{*}

hold, where B and C are two diagonal matrices. In this case,

{(A^{2})}^{†} = P B^{†} P^{*}

by the definition of the Moore–Penrose inverse, where

B^{†}

is diagonal as well. Further, we obtain from

r (A^{2}) = r (A)

and

A^{2} {(A^{2})}^{*} = {(A^{2})}^{*} A^{2}

that

R (A) = R (A^{2}) = R (A^{2} {(A^{2})}^{*}) = R ({(A^{2})}^{*} A^{2}) = R ({(A^{2})}^{*}) = R (A^{*})

. Hence,

A A^{†} = A^{†} A

and

{(A^{2})}^{†} A^{2} = A^{†} A

hold by Lemma 5 (II) and (III). Consequently, we obtain from the above facts the following result:

A = A A^{†} A = A^{†} A^{2} = {(A^{2})}^{†} A^{3} = P B^{†} P^{*} P C P^{*} = P B^{†} C P^{*},

where the product

B^{†} C

of the two diagonal matrices is also diagonal. This fact means that

A A^{*} = P {(B^{†} C)}^{*} B C P^{*} = P B C {(B^{†} C)}^{*} P^{*} = A^{*} A

holds. Thus, Condition (i) implies Condition (h). The equivalences of Conditions (h), (j), (k), (l), and (m) can be established by a similar approach. □

Corollary 4.

Let

A \in C^{m \times m}

and denote

B = A {(A^{2})}^{*} A .

Then,

r (B^{2}) = r (B) = r (A^{2})

holds, and the following 36 conditions are equivalent:

(a): $(A A^{*}) (A^{*} A) = (A^{*} A) (A A^{*}),$ namely, $A {(A^{2})}^{*} A$ is Hermitian.
(b): $B B^{*} B$ is Hermitian.
(c): $B^{2}$ and $B^{3}$ are Hermitian.
(d): $B^{2}$ and $B^{5}$ are Hermitian.
(e): $B^{3}$ and $B^{4}$ are Hermitian.
(f): $B^{3}$ and $B^{5}$ are Hermitian.
(g): ${(B B^{*})}^{3} = {(B^{*} B)}^{3}$ and ${(B B^{*})}^{4} B = {(B^{*} B)}^{4} B^{*} .$
(h): ${(B B^{*})}^{4} B = {(B^{*} B)}^{4} B^{*}$ and ${(B B^{*})}^{6} = {(B^{*} B)}^{6} .$
(i): $B^{3} = B B^{*} B .$
(j): $B^{†} = {(B^{†})}^{*} .$
(k): $B^{*} B^{†} B^{*} = B^{*} .$
(l): $B^{†} B^{*} B^{†} = B^{†} .$
(m): $B^{†} = {(B^{#})}^{*} .$
(n): $B^{#} = {(B^{#})}^{*} .$
(o): $B^{*} B^{#} B^{*} = B^{*} .$
(p): $B^{†} B^{*} B^{†} = B^{#} .$
(q): $B^{#} B^{*} B^{#} = B^{†} .$
(r): $B^{#} B^{*} B^{#} = B^{#} .$
(s): $B^{*} B^{†} B^{*} B^{†} B^{*} = B^{*} B^{†} B^{*} .$
(t): $B^{*} B^{#} B^{*} B^{#} B^{*} = B^{*} B^{#} B^{*} .$
(u): $B^{†} B^{*} B^{†} B^{*} B^{†} = B^{†} B^{*} B^{†} .$
(v): $B^{†} B^{*} B^{†} B^{*} B^{†} = B^{#} B^{*} B^{#} .$
(w): $B^{†} B^{*} B^{#} B^{*} B^{†} = B^{†} B^{*} B^{†} .$
(x): $B^{†} B^{*} B^{#} B^{*} B^{†} = B^{#} B^{*} B^{#} .$
(y): $B^{#} B^{*} B^{†} B^{*} B^{#} = B^{†} B^{*} B^{†} .$
(z): $B^{#} B^{*} B^{†} B^{*} B^{#} = B^{#} B^{*} B^{#} .$
(a1): $B^{#} B^{*} B^{#} B^{*} B^{#} = B^{†} B^{*} B^{†} .$
(b1): $B^{#} B^{*} B^{#} B^{*} B^{#} = B^{#} B^{*} B^{#} .$
(c1): $r [\begin{matrix} B & B^{*} \\ B^{*} B & B B^{*} \end{matrix}] = r (B) .$
(d1): $r [\begin{matrix} B B^{†} & B^{†} B \\ B^{*} & B \end{matrix}] = r (B) .$
(e1): $R [\begin{matrix} B \\ B^{*} B \end{matrix}] = R [\begin{matrix} B^{*} \\ B B^{*} \end{matrix}] .$
(f1): $R [\begin{matrix} B B^{†} \\ B^{*} \end{matrix}] = R [\begin{matrix} B^{†} B \\ B \end{matrix}] .$
(g1): $r [\begin{matrix} B & B^{*} \\ {(B^{*} B)}^{2} & {(B B^{*})}^{2} \end{matrix}] = r (B) .$
(h1): $r [\begin{matrix} B B^{†} & B^{†} B \\ B^{*} B B^{*} & B B^{*} B \end{matrix}] = r (B) .$
(i1): $R [\begin{matrix} B \\ {(B^{*} B)}^{2} \end{matrix}] = R [\begin{matrix} B^{*} \\ {(B B^{*})}^{2} \end{matrix}] .$
(j1): $R [\begin{matrix} B B^{†} \\ B^{*} B B^{*} \end{matrix}] = R [\begin{matrix} B^{†} B \\ B B^{*} B \end{matrix}] .$

Proof.

Replacing A with

A {(A^{*})}^{2} A

in Theorems 2, 18, and 20 leads to the equivalences of Conditions (a)–(j1). □

Corollary 5.

Let

A \in C^{m \times m} .

Then, the following five conditions are equivalent:

(a): $r [\begin{matrix} A & A^{*} \\ A {(A^{*})}^{2} A^{2} & A^{*} A^{2} {(A^{*})}^{2} \end{matrix}] = r (A) .$
(b): $r [\begin{matrix} A A^{†} & A^{†} A \\ A {(A^{*})}^{2} A & A^{*} A^{2} A^{*} \end{matrix}] = r (A) .$
(c): $R [\begin{matrix} A \\ A {(A^{*})}^{2} A^{2} \end{matrix}] = R [\begin{matrix} A^{*} \\ A^{*} A^{2} {(A^{*})}^{2} \end{matrix}] .$
(d): $R [\begin{matrix} A A^{†} \\ A {(A^{*})}^{2} A \end{matrix}] = R [\begin{matrix} A^{†} A \\ A^{*} A^{2} A^{*} \end{matrix}] .$
(e): $R (A) = R (A^{*})$ and $(A A^{*}) (A^{*} A) = (A^{*} A) (A A^{*}) .$

Proof.

Replacing

B = A {(A^{*})}^{2}

and

C = A^{*} A^{2}

in Theorem 15 leads to the equivalences of Conditions (a)–(e). □

Corollary 6.

Let

A \in C^{m \times m} .

Then, the following five conditions are equivalent:

(a): $r [\begin{matrix} A & A^{*} \\ A^{k} A^{*} A & A^{*} A^{k} A^{*} \end{matrix}] = r (A)$ for any integer $k \geq 1 .$
(b): $r [\begin{matrix} A A^{†} & A^{†} A \\ A^{k} A^{*} & A^{*} A^{k} \end{matrix}] = r (A)$ for any integer $k \geq 1 .$
(c): $R [\begin{matrix} A \\ A^{k} A^{*} A \end{matrix}] = R [\begin{matrix} A^{*} \\ A^{*} A^{k} A^{*} \end{matrix}]$ for any integer $k \geq 1 .$
(d): $R [\begin{matrix} A A^{†} \\ A^{k} A^{*} \end{matrix}] = R [\begin{matrix} A^{†} A \\ A^{*} A^{k} \end{matrix}]$ for any integer $k \geq 1 .$
(e): $R (A) = R (A^{*}) a n d A^{k} A^{*} = A^{*} A^{k}$ for any integer $k \geq 1 .$

Proof.

Replacing

B = A^{k}

and

C = A^{*} A^{k - 1}

in Theorem 15 leads to the equivalences of Conditions (a)–(e). □

Finally, we present a group of identifying conditions for the commutativity of two Hermitian matrices and leave their proofs for the reader.

Corollary 7.

Let

A, B \in C^{m \times m}

be two Hermitian matrices, and denote

C = A B .

Then,

r (C^{2}) = r (C)

holds, and the following 41 statements are equivalent:

(a): $A B = B A,$ namely, $A B$ is Hermitian.
(b): $A^{†} B = B A^{†} .$
(c): $A B^{†} = B^{†} A .$
(d): $A^{†} B^{†} = B^{†} A^{†} .$
(e): $A^{2} B A = A B A^{2}$ and $R (B A) \subseteq R (A) .$
(f): $B^{2} A B = B A B^{2}$ and $R (A B) \subseteq R (B) .$
(g): ${(A B)}^{2}$ and ${(A B)}^{3}$ are Hermitian.
(h): ${(A B)}^{2}$ and ${(A B)}^{5}$ are Hermitian.
(i): ${(A B)}^{3}$ and ${(A B)}^{4}$ are Hermitian.
(j): ${(A B)}^{3}$ and ${(A B)}^{5}$ are Hermitian.
(k): $C C^{*} C = C^{*} C C^{*} .$
(l): ${(C C^{*})}^{3} = {(C^{*} C)}^{3}$ and ${(C C^{*})}^{4} C = {(C^{*} C)}^{4} C^{*} .$
(m): ${(C C^{*})}^{4} C = {(C^{*} C)}^{4} C^{*}$ and ${(C C^{*})}^{6} = {(C^{*} C)}^{6} .$
(n): $C^{3} = C C^{*} C .$
(o): $C^{†} = {(C^{†})}^{*} .$
(p): $C^{*} C^{†} C^{*} = C^{*} .$
(q): $C^{†} C^{*} C^{†} = C^{†} .$
(r): $C^{†} = {(C^{#})}^{*} .$
(s): $C^{#} = {(C^{#})}^{*} .$
(t): $C^{*} C^{#} C^{*} = C^{*} .$
(u): $C^{†} C^{*} C^{†} = C^{#} .$
(v): $C^{#} C^{*} C^{#} = C^{†} .$
(w): $C^{#} C^{*} C^{#} = C^{#} .$
(x): $C^{*} C^{†} C^{*} C^{†} C^{*} = C^{*} C^{†} C^{*} .$
(y): $C^{*} C^{#} C^{*} C^{#} C^{*} = C^{*} C^{#} C^{*} .$
(z): $C^{†} C^{*} C^{†} C^{*} C^{†} = C^{†} C^{*} C^{†} .$
(a1): $C^{†} C^{*} C^{†} C^{*} C^{†} = C^{#} C^{*} C^{#} .$
(b1): $C^{†} C^{*} C^{#} C^{*} C^{†} = C^{†} C^{*} C^{†} .$
(c1): $C^{†} C^{*} C^{#} C^{*} C^{†} = C^{#} C^{*} C^{#} .$
(d1): $C^{#} C^{*} C^{†} C^{*} C^{#} = C^{†} C^{*} C^{†} .$
(e1): $C^{#} C^{*} C^{†} C^{*} C^{#} = C^{#} C^{*} C^{#} .$
(f1): $C^{#} C^{*} C^{#} C^{*} C^{#} = C^{†} C^{*} C^{†} .$
(g1): $C^{#} C^{*} C^{#} C^{*} C^{#} = C^{#} C^{*} C^{#} .$
(h1): $r [\begin{matrix} C & C^{*} \\ C^{*} C & C C^{*} \end{matrix}] = r (C) .$
(i1): $r [\begin{matrix} C C^{†} & C^{†} C \\ C^{*} & C \end{matrix}] = r (C) .$
(j1): $R [\begin{matrix} C \\ C^{*} C \end{matrix}] = R [\begin{matrix} C^{*} \\ C C^{*} \end{matrix}] .$
(k1): $R [\begin{matrix} C C^{†} \\ C^{*} \end{matrix}] = R [\begin{matrix} C^{†} C \\ C \end{matrix}] .$
(l1): $r [\begin{matrix} C & C^{*} \\ {(C^{*} C)}^{2} & {(C C^{*})}^{2} \end{matrix}] = r (C) .$
(m1): $r [\begin{matrix} C C^{†} & C^{†} C \\ C^{*} C C^{*} & C C^{*} C \end{matrix}] = r (C) .$
(n1): $R [\begin{matrix} C \\ {(C^{*} C)}^{2} \end{matrix}] = R [\begin{matrix} C^{*} \\ {(C C^{*})}^{2} \end{matrix}] .$
(o1): $R [\begin{matrix} C C^{†} \\ C^{*} C C^{*} \end{matrix}] = R [\begin{matrix} C^{†} C \\ C C^{*} C \end{matrix}] .$

5. Conclusions

As one of the fundamental problems in mathematics, algebraists love to construct, classify, and characterize various possible algebraic equalities according to the pre-assumed definitions and operation rules in a given algebraic framework. However, increasingly complicated problems related to these problems in matrix equalities have created the essential need for sophisticated methods that go beyond standard contents in matrix theory. As a new attempt, we gave an innovative approach to this kind of problem by establishing a variety of explicit expansion formulas for calculating the ranks of block matrices and displaying their fruitful consequences and applications in the complex matrix theory. As displayed in the preceding sections, establishing non-trivial exact matrix rank equalities properly requires careful designs and experiences. We believe that the algebraic methodologies and the fruitful results and facts developed in this paper can bring some profound mathematical insights to the performance and properties of matrix expressions and matrix equalities and hope that they can be employed in the algebraic analysis of various matrix expression and equality problems that have occurred in mathematics and other applications. In addition, we expect to get more mileage out of the ideas and findings by searching for other situations in which the work can be used or extended. As one such reasonable consideration, it is not difficult to symbolically extend the main contributions in this paper to some general algebraic settings, such as rings and operator algebras, in which generalized inverses of elements can be defined accordingly.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Acknowledgments

The author would like to express his sincere thanks to the handling editor and anonymous reviewers for their helpful comments and suggestions.

Conflicts of Interest

The author declares no conflict of interest.

References

Ben-Israel, A.; Greville, T.N.E. Generalized Inverses: Theory and Applications, 2nd ed.; Springer: New York, NY, USA, 2003. [Google Scholar]
Campbell, S.L.; Meyer, C.D. EP operators and generalized inverses. Can. Math. Bull. 1975, 18, 327–333. [Google Scholar] [CrossRef]
Cvetković-Ilić, D.S.; Wei, Y. Algebraic Properties of Generalized Inverses; Springer: Singapore, 2017. [Google Scholar]
Basavappa, P. On the solutions of the matrix equation f(X, X^*) = g(X, X^*). Canad. Math. Bull. 1972, 15, 45–49. [Google Scholar] [CrossRef]
McCullough, S.A.; Rodman, L. Hereditary classes of operators and matrices. Am. Math. Mon. 1997, 104, 415–430. [Google Scholar] [CrossRef]
Marsaglia, G.; Styan, G.P.H. Equalities and inequalities for ranks of matrices. Linear Multilinear Algebra 1974, 2, 269–292. [Google Scholar] [CrossRef]
Tian, Y. Reverse order laws for the generalized inverses of multiple matrix products. Linear Algebra Appl. 1994, 211, 185–200. [Google Scholar] [CrossRef]
Penrose, R. A generalized inverse for matrices. Proc. Camb. Philos. Soc. 1955, 51, 406–413. [Google Scholar] [CrossRef]
Opincariu, M.; Pop, V. Problem 12416. Am. Math. Mon. 2023, 130, 765. [Google Scholar]
Campbell, S.L. Linear operators for which T^*T and TT^* commute. Proc. Am. Math. Soc. 1972, 34, 177–180. [Google Scholar] [CrossRef]
Campbell, S.L. Linear operators for which T^*T and TT^* commute II. Pac. J. Math. 1974, 53, 355–361. [Google Scholar] [CrossRef]
Cheng, S.; Tian, Y. Two sets of new characterizations for normal and EP matrices. Linear Algebra Appl. 2003, 375, 181–195. [Google Scholar] [CrossRef]
Djordjević, D.S. Characterizations of normal, hyponormal and EP operators. J. Math. Anal. Appl. 2007, 329, 1181–1190. [Google Scholar] [CrossRef]
Hartwig, R.E.; Katz, I.J. Products of EP elements in reflexive semigroups. Linear Algebra Appl. 1976, 14, 11–19. [Google Scholar] [CrossRef]
Hartwig, R.E.; Spindelböck, K. Matrices for which A^* and A^† can commute. Linear Multilinear Algebra 1983, 14, 241–256. [Google Scholar] [CrossRef]
Pearl, M.H. On normal and EPr matrices. Mich. Math. J. 1959, 6, 1–5. [Google Scholar] [CrossRef]
Mosić, D.; Djordjević, D.S. New characterizations of EP, generalized normal and generalized Hermitian elements in rings. Appl. Math. Comput. 2012, 218, 6702–6710. [Google Scholar] [CrossRef]
Ding, J.; Rhee, N.H. Spectral solutions of the Yang–Baxter matrix equation. J. Math. Anal. Appl. 2013, 402, 567–573. [Google Scholar] [CrossRef]
Ding, J.; Tian, H. Solving the Yang–Baxter-like matrix equation for a class of elementary matrices. Comput. Math. Appl. 2016, 72, 1541–1548. [Google Scholar] [CrossRef]
Ding, J.; Zhang, C.; Rhee, H.H. Commuting solutions of the Yang–Baxter matrix equation. Appl. Math. Lett. 2015, 44, 1–4. [Google Scholar] [CrossRef]
Tian, H. All solutions of the Yang–Baxter-like matrix equation for rank-one matrices. Appl. Math. Lett. 2016, 51, 55–59. [Google Scholar] [CrossRef]
Tian, Y. Some new characterizations of a Hermitian matrix and their applications. Complex Anal. Oper. Theory 2024, 18, 2. [Google Scholar] [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tian, Y. Some New Algebraic Method Developments in the Characterization of Matrix Equalities. Axioms 2024, 13, 657. https://doi.org/10.3390/axioms13100657

AMA Style

Tian Y. Some New Algebraic Method Developments in the Characterization of Matrix Equalities. Axioms. 2024; 13(10):657. https://doi.org/10.3390/axioms13100657

Chicago/Turabian Style

Tian, Yongge. 2024. "Some New Algebraic Method Developments in the Characterization of Matrix Equalities" Axioms 13, no. 10: 657. https://doi.org/10.3390/axioms13100657

APA Style

Tian, Y. (2024). Some New Algebraic Method Developments in the Characterization of Matrix Equalities. Axioms, 13(10), 657. https://doi.org/10.3390/axioms13100657

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Some New Algebraic Method Developments in the Characterization of Matrix Equalities

Abstract

1. Introduction

2. Preliminary Results

3. How to Establish Equalities for Ranks of Block Matrices

4. Characterizations of Matrix Equalities by the Matrix Rank Methodology

5. Conclusions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI