Some Results on Majorization of Matrices

Udayan, Divya K.; Somasundaram, Kanagasabapathi

doi:10.3390/axioms11040146

Open AccessArticle

Some Results on Majorization of Matrices

by

Divya K. Udayan

^*,† and

Kanagasabapathi Somasundaram

^†

Department of Mathematics, Amrita School of Engineering, Amrita Vishwavidyapeetham, Coimbatore 641112, India

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Axioms 2022, 11(4), 146; https://doi.org/10.3390/axioms11040146

Submission received: 1 February 2022 / Revised: 28 February 2022 / Accepted: 20 March 2022 / Published: 23 March 2022

(This article belongs to the Section Algebra and Number Theory)

Download Review Reports Versions Notes

Abstract

:

For two

n \times m

real matrices X and Y, X is said to be majorized by Y, written as

X ≺ Y

if

X = S Y

for some doubly stochastic matrix of order

n .

Matrix majorization has several applications in statistics, wireless communications and other fields of science and engineering. Hwang and Park obtained the necessary and sufficient conditions for

X, Y

to satisfy

X ≺ Y

for the cases where the rank of

Y = n - 1

and the rank of

Y = n

. In this paper, we obtain some necessary and sufficient conditions for

X, Y

to satisfy

X ≺ Y

for the cases where the rank of

Y = n - 2

and in general for rank of

Y = n - k

, where

1 \leq k \leq n - 1

. We obtain some necessary and sufficient conditions for X to be majorized by Y with some conditions on X and Y. The matrix X is said to be doubly stochastic majorized by Y if there is

S \in Ω_{m}

such that

X = Y S

. In this paper, we obtain some necessary and sufficient conditions for X to be doubly stochastic majorized by

Y .

We introduced a new concept of column stochastic majorization in this paper. A matrix X is said to be column stochastic majorized by

Y,

denoted as

X ⪯^{c} Y,

if there exists a column stochastic matrix S such that

X = S Y .

We give characterizations of column stochastic majorization and doubly stochastic majorization for

(0, 1)

matrices.

Keywords:

matrix majorization; doubly stochastic majorization; multivariate majorization

1. Introduction

Let

R^{n}

denote the set of all real column vectors with n coordinates. A real matrix A is called non-negative, denoted by

A \geq 0

, if all its entries are non-negative. Let

Ω_{n}

denote the set of all

n \times n

doubly stochastic matrices, i.e., real non-negative matrices, with each row sum and column sum equal to 1. For a vector

{(a_{1}, a_{2}, \dots, a_{n})}^{T} \in R^{n}

, let

{(a_{[1]}, a_{[2]}, \dots, a_{[n]})}^{T}

denote the vector obtained from

{(a_{1}, a_{2}, \dots, a_{n})}^{T}

by rearranging the coordinates in nonincreasing order. For vectors

x = {(x_{1}, x_{2}, \dots, x_{n})}^{T}, y = {(y_{1}, y_{2}, \dots, y_{n})}^{T} \in R^{n}, x

is said to be majorized by y, denoted by

x ≺ y

, if

\sum_{i = 1}^{k} x_{[i]} \leq \sum_{i = 1}^{k} y_{[i]}

for all positive integers k such that

1 \leq k \leq n .

It is well known that for two vectors

x, y \in R^{n}, x ≺ y

iff

x = S y

for some

S \in Ω_{n} .

In [1], the polytope of doubly stochastic matrices D for which

x = D y

was investigated. The notion of vector majorization is naturally extended to that of matrix majorization as follows. For two matrices X and Y in

R^{n \times m}

, X is said to be majorized by Y, denoted by

X ≺ Y

, if

X = S Y

for some

S \in Ω_{n} .

In [2], a new notion of matrix majorization was introduced, called weak matrix majorization, where

X ≺_{w} Y

, if there exists a row stochastic matrix R such that

X = R Y .

Additionally, relations between this concept and strong majorization (usual majorization) are considered. In [3], the polytope of row-stochastic matrices R for which

X = R Y

was investigated and generalizations of the results for vector majorization were obtained. The notion of matrix majorization was referred to as multivariate majorization in [4]. In [4], it was proved that, if

X ≺ Y

for

X, Y \in R^{n \times m}

, then

X C ≺ Y C

for any real matrix C with m rows.

For

X = [x_{1}, x_{2}, \dots, x_{m}], Y = [y_{1}, y_{2}, \dots, y_{m}] \in R^{n \times m}

, if

X ≺ Y,

then certainly

x_{j} ≺ y_{j}

for all

j = 1, 2, \dots, m .

However, the converse does not hold in general, as is easily seen with the matrices

X = [\begin{matrix} \frac{1}{2} & 1 \\ \frac{1}{2} & 0 \end{matrix}]

and

Y = [\begin{matrix} 1 & 1 \\ 0 & 0 \end{matrix}] .

For

x, y \in R^{n}

, let

Ω (x ≺ y)

denote the set of all

S \in Ω_{n}

satisfying

x = S y .

The set

Ω (x ≺ y)

is known to contain various special types of doubly stochastic matrices [5,6]. While quite a lot of progress has been made in the theory of vector majorization, very little is known about multivariate majorization. In [7], it is proved that X is majorized by Y if

X v

is majorized by

Y v

for every real n vector v, under the assumption that

[X, e] {[Y, e]}^{+}

is nonnegative, where e denotes the m-vector of ones and

{[Y, e]}^{+}

denotes the Moore–Penrose generalized inverse of

X .

In [8], a new matrix majorization order for classes (sets) of matrices was introduced, which generalizes several existing notions of matrix majorization. Matrix majorization order has several applications in mathematical statistics. Some applications of the matrix majorization were discussed by Marshall, Olkinbarry and Arnold [4] and Tong [9]. Majorization of (0,1) matrices have important applications in classification theory and principal/dominant component analysis. Eduard Jorswieck and Holger Boche [10] reviewed the basic definitions of majorization theory and matrix monotone functions, describing their concepts clearly with many illustrative examples and then proceeded to show their applications in wireless communications. In [11], an algorithm was developed for the problem of finding a low-rank correlation matrix nearest to a given correlation matrix. The algorithm was based on majorization. The problem of rank reduction of correlation matrices occurs when pricing a derivative dependent on a large number of assets, where the asset prices are modeled as correlated log-normal processes. Mainly, such an application concerns interest rates. Matrix majorization also has applications in the comparison of eigenvalues [12].

Hwang and Park [13] obtained some necessary and sufficient conditions for

X, Y \in R^{n \times m}

to satisfy

X ≺ Y

for the case that the rank of Y is any one of

1, n - 1

or n and for the case that

n \leq 3

. In Section 2, we obtain some necessary and sufficient conditions for

X, Y \in R^{n \times m}

to satisfy

X ≺ Y

for the case that the rank of Y is

n - 2

and for the general case that the rank of Y is

n - k, 1 \leq k \leq n - 1

. We also obtain some necessary and sufficient conditions for X to be majorized by Y with some conditions on X and

Y .

Additionally, we obtain some necessary and sufficient conditions for X to be doubly stochastic majorized by Y with some conditions on X and

Y .

Dahl, Guterman and Shteyner [14] obtained several results concerning matrix majorizations of

(0, 1)

matrices and characterizations for certain matrix majorization orders. We extend these results for

(0, 1)

matrices in Section 3. We introduce a new concept of column stochastic majorization. A matrix X is said to be column stochastic majorized by

Y,

denoted as

X ⪯^{c} Y,

if there exists a column stochastic matrix S such that

X = S Y .

We obtain some characterizations for column stochastic majorization and doubly stochastic majorization of

(0, 1)

matrices.

2. Matrix Majorization

Let

I_{n}

denote the identity matrix of order n. For two real matrices

A, B

of the same size, let

A \geq B

(resp.

A \leq B

) denote that each entry of A is bigger (resp. less) than or equal to the corresponding entry of B. Let

e_{k}

denote all l vectors in

R^{k}

. For matrix A, let

σ (A), r_{A}

and

C_{A}

denote the sum of all of the entries of A, the row sum vector of A and the column sum vector of A, respectively. A vector

z \in R^{n}

is called a stochastic vector if

z \geq 0

and

z^{T} e_{n} = 1

.

Theorem 1.

Let

X = [x_{i j}], Y = [y_{i j}] = {[I_{n - 2}, y_{1}, y_{2}]}^{T} \in R^{n \times n - 2} .

Then

X ≺ Y

if the following hold.

$c_{X} = c_{Y} .$
There exist stochastic vectors $z_{1}, z_{2} \in R^{n}$ such that $e_{n} - r_{X} = (1 - σ (y_{1})) z_{1} + (1 - σ (y_{2})) z_{2}$ and $X \geq z_{1} y_{1}^{T} + z_{2} y_{2}^{T}$ .

Proof.

Let us assume

X ≺ Y

. Then there exists

S \in Ω_{n}

satisfying

X = S Y .

It is easy to see that

c_{X} = c_{Y}

. Let

S = [A, z_{1}, z_{2}]

with

z_{1}, z_{2} \in R^{n} .

Then

X = S Y = A + z_{1} y_{1}^{T} + z_{2} y_{2}^{T} .

Since

A \geq 0

, we obtain that there exist stochastic vectors

z_{1}

and

z_{2}

such that

X \geq z_{1} y_{1}^{T} + z_{2} y_{2}^{T} .

S e_{n} = A e_{n - 2} + z_{1} + z_{2} = X e_{n - 2} - z_{1} y_{1}^{T} e_{n - 2} - z_{2} y_{2}^{T} e_{n - 2} + z_{1} + z_{2} = r_{X} + (1 - σ (y_{1})) z_{1} + (1 - σ (y_{2})) z_{2} .

Since

S e_{n} = e_{n}

, we obtain that there exist stochastic vectors

z_{1}

and

z_{2}

such that

e_{n} = r_{X} + (1 - σ (y_{1})) z_{1} + (1 - σ (y_{2})) z_{2}

. Conversely, suppose (1) and (2) hold. For the vectors

z_{1}, z_{2}

in (2), let

S = [A, z_{1}, z_{2}]

, where

A = X - z_{1} y_{1}^{T} - z_{2} y_{2}^{T}

. Then

S Y = [A, z_{1}, z_{2}] {[I_{n - 2}, y_{1}, y_{2}]}^{T} = A + z_{1} y_{1}^{T} + z_{2} y_{2}^{T} = X

. It remains to show that

S \in Ω_{n}

. We see from

(2)

that

S \geq 0 .

From (1) and (2) we have

S e_{n} = e_{n} .

Additionally,

e_{n}^{T} S = [e_{n}^{T} X - e_{n}^{T} z_{1} y_{1}^{T} - e_{n}^{T} z_{2} y_{2}^{T}, e_{n}^{T} z_{1}, e_{n}^{T} z_{2}] = [c_{X} - y_{1}^{T} - y_{2}^{T}, 1, 1] = [e_{n - 2}^{T}, 1, 1] = e_{n}^{T}

since

c_{X} = c_{Y}

and

c_{Y} = e_{n - 2}^{T} + y_{1}^{T} + y_{2}^{T} .

This implies that

S \in Ω_{n} .

□

The above theorem can be extended to any

Y \in R^{n \times n - k}

of rank

n - k

.

Theorem 2.

Let

X = [x_{i j}], Y = {[I_{n - k}, y_{1}, y_{2}, \dots, y_{k}]}^{T} \in R^{n \times n - k}, 1 \leq k \leq n - 1 .

Then

X ≺ Y

if the following hold.

$c_{X} = c_{Y} .$
There exist stochastic vectors $z_{1}, z_{2}, \dots, z_{k}$ such that $e_{n} - r_{X} = \sum_{i = 1}^{k} (1 - σ (y_{i})) z_{i}$ and $X \geq z_{1} y_{1}^{T} + z_{2} y_{2}^{T} + \dots + z_{k} y_{k}^{T} .$

Proof.

Proof is similar to the proof of Theorem 1 extending to k vectors. □

In the next theorem, we give a necessary condition for

X ⪯ Y

, for any two matrices

X, Y \in R^{n \times m}

.

Theorem 3.

Let

X, Y \in R^{n \times m} .

If

X ⪯ Y

then there exists

δ_{1}, δ_{2}, \dots, δ_{n - 1} \in R

such that

| δ_{i} | \leq 1

for

i = 1, 2, 3, \dots, n - 1, δ_{1} + δ_{2} + \dots + δ_{n - 1} \geq n - 3

and

x_{1 j} + x_{2 j} + \dots + x_{n - 1, j} - x_{n, j} = δ_{1} y_{1 j} + δ_{2} y_{2 j} + \dots + δ_{n - 1 j} y_{n - 1 j} + y_{n j} (n - 2 - δ_{1} - δ_{2} - \dots - δ_{n - 1}) .

Proof.

Suppose

X ⪯ Y

so that

X = S Y

for some

S = [\begin{matrix} a_{11} & a_{12} & \dots & a_{1 n} \\ a_{21} & a_{22} & \dots & a_{2 n} \\ ⋮ & ⋮ & ⋮ & ⋮ \\ a_{n 1} & a_{n 2} & \dots & a_{n n} \end{matrix}] \in Ω_{n} .

Let

[1, 1, \dots, - 1]

be a

1 \times n

matrix. Then the j-th element in the row vector

[1, 1, \dots, - 1] X = [1, 1, \dots, - 1] S Y

is

x_{1 j} + x_{2 j} + \dots + x_{n - 1 j} - x_{n j} = (a_{11} + a_{21} + \dots - a_{n 1}) y_{1 j} + (a_{12} + a_{22} + \dots - a_{n 2}) y_{2 j} + \dots + (a_{1 n} + a_{2 n} + \dots - a_{n n}) y_{n j} = (a_{11} + a_{21} + \dots - (1 - (a_{11} + a_{21} + \dots + a_{n 1}))) y_{1 j} + (a_{12} + a_{22} + \dots - (1 - (a_{12} + a_{22} + \dots - a_{n 2})) y_{2 j} + \dots + (a_{1 n - 1} + a_{2 n - 1} + \dots - (1 - (a_{1 n - 1} + a_{2 n - 1} + \dots - a_{n n - 1}))) y_{n j} = (2 (a_{11} + a_{21} + \dots + a_{n - 11}) - 1) y_{1 j} + (2 (a_{1, 2} + a_{2, 2} + \dots + a_{n - 1, 2}) - 1) y_{2, j} + \dots + (2 (a_{1, n} + a_{2, n - 1} + \dots + a_{n - 1, n - 1}) - 1) y_{n, j} = (2 (a_{1, 1} + a_{2, 1} + \dots + a_{n - 1, 1}) - 1) y_{1, j} + (2 (a_{1, 2} + a_{2, 2} + \dots + a_{n - 1, 2}) - 1) y_{2, j} + \dots + (2 (a_{1, n} + a_{2, n} + \dots + a_{n - 1, n - 1}) - 1) y_{n, j} .

2 (a_{1 n} + a_{2 n} + \dots + a_{n - 1 n}) - 1 = 2 n - 3 - 2 (a_{11} + a_{12} + \dots + a_{1 n - 1} + \dots + a_{n - 11} + a_{n - 12} + \dots + a_{n - 1 n - 1})

. Let

2 (a_{11} + a_{21} + \dots + a_{n - 11}) - 1 = δ_{1}

,

2 (a_{12} + a_{22} + \dots + a_{n - 12}) - 1 = δ_{2},

…

2 (a_{1 n - 1} + a_{2 n - 1} + \dots + a_{n - 1 n - 1}) - 1 = δ_{n - 1} .

This implies

x_{1, j} + x_{2, j} + \dots + x_{n - 1, j} - x_{n, j} = δ_{1} y_{1, j} + δ_{2} y_{2, j} + \dots + δ_{n - 1} y_{n - 1, j} + y_{n, j} (2 n - 3 - 2 (\frac{1 + δ_{1}}{2} + \frac{1 + δ_{2}}{2} + \dots + \frac{1 + δ_{n - 1}}{2}))

= δ_{1} y_{1 j} + δ_{2} y_{2 j} + \dots + δ_{n - 1} y_{n - 1, j} + (n - 2 - (δ_{1} + δ_{2} + \dots + δ_{n})) y_{n j} .

2 (a_{1, j} + a_{2, j} + \dots + a_{n - 1, j}) - 1 = δ_{j} .

Since

0 \leq a_{1, j} + a_{2, j} + \dots + a_{n - 1, j} \leq 1, - 1 \leq 2 (a_{1, j} + a_{2, j} + \dots + a_{n - 1, j}) - 1 \leq 1 .

\Rightarrow | δ_{j} | \leq 1 . \Rightarrow a_{11} + a_{21} + \dots + a_{n - 11} + \dots + a_{1 n - 1} + a_{2 n - 1} + \dots + a_{n - 1 n - 1} + 2 - n \geq 0 . \frac{1 + δ_{1}}{2} + \frac{1 + δ_{2}}{2} + \dots + \frac{1 + δ_{n - 1}}{2} + 2 - n \geq 0 . δ_{1} + δ_{2} + \dots + δ_{n - 1} \geq n - 3 .

□

Example 1.

Let

Y = [\begin{matrix} 1 & 3 & 1 \\ 1 & 1 & 1 \\ 2 & 2 & 3 \end{matrix}],

S = [\begin{matrix} 1 & 0 & 0 \\ 0 & 0.5 & 0.5 \\ 0 & 0.5 & 0.5 \end{matrix}]

and

X = [\begin{matrix} 1 & 3 & 1 \\ 1.5 & 1.5 & 2 \\ 1.5 & 1.5 & 2 \end{matrix}] .

Then

δ_{1} = 1

and

δ_{2} = 0

are such that they satisfy the required conditions of the theorem and so this example validates Theorem 3.

The converse of Theorem 3 is not true.

Example 2.

Let

X = [\begin{matrix} 1.5 & 1 & 1 \\ 0 & 1 & 2 \\ 0 & 2 & 3 \end{matrix}]

and

Y = [\begin{matrix} 1 & 3 & 1 \\ 1 & 1 & 1 \\ 2 & 2 & 3 \end{matrix}]

.

Take

δ_{1} = 0.5

and

δ_{2} = 0 .

Then

X, Y

and

δ_{1}, δ_{2}

satisfy the conditions of the above theorem but there exists no matrix S such that

X = S Y .

Assume that there exists a matrix S such that

S = [\begin{matrix} s_{11} & s_{12} & 1 - s_{11} - s_{12} \\ s_{21} & s_{22} & 1 - s_{21} - s_{22} \\ 1 - s_{11} - s_{21} & 1 - s_{12} - s_{22} & (s_{11} + s_{12} + s_{21} + s_{22} - 1) \end{matrix}]

Since

X = S Y

when we multiply the first row of S with the first column of Y, we obtain

x_{11} = 1.5 = s_{11} + s_{12} + (1 - s_{11} - s_{12}) 2 .

This implies that

s_{11} + s_{12} = 0.5 .

Similarly for

x_{21},

we have

x_{21} = 0 = s_{21} + s_{22} + (1 - s_{21} - s_{22}) 2 .

This implies that

s_{21} + s_{22} = 1 .

Similarly for

x_{31},

we have

(1 - s_{11} - s_{21}) + (1 - s_{12} - s_{22}) + (s_{11} + s_{21} + s_{12} + s_{22} - 1) 2 = 0 .

This implies that

s_{11} = 0, s_{12} = 0, s_{21} = 0

and

s_{22} = 0 .

However, this is a contradiction to

s_{11} + s_{12} = 0.5 .

Therefore, there exists no such matrix S such that

X = S Y .

In the next theorem, we show sufficient conditions for

X ⪯ Y

, for any two matrices

X, Y \in R^{n \times m}

.

Theorem 4.

Let

X, Y \in R^{n \times m}

. A sufficient condition for X to be majorized by Y is that

c_{X} = c_{Y}

and

x_{1 j} = x_{2 j} = \dots = x_{n - 1 j}

for

j = 1, 2, \dots, m

and there exists

δ_{1}, δ_{2}, \dots, δ_{n - 1} \in R

such that

| δ_{i} | \leq 1

for

i = 1, 2, \dots, n - 1, δ_{1} + δ_{2} + \dots + δ_{n - 1} \geq n - 3

and for

j = 1, 2, \dots, m

x_{1 j} + x_{2 j} + \dots + x_{n - 1 j} - x_{n j} = δ_{1} y_{1 j} + δ_{2} y_{2 j} + \dots + δ_{n - 1 j} y_{n - 1 j} + y_{n j} (n - 2 - δ_{1} - δ_{2} - \dots - δ_{n - 1}) .

Proof.

Assume that

c_{X} = c_{Y}

and

δ_{1}, δ_{2}, \dots, δ_{n - 1} \in R

such that the conditions given in the theorem hold. Let

S = [\begin{matrix} \frac{1 + δ_{1}}{2 (n - 1)} & \frac{1 + δ_{2}}{2 (n - 1)} & \dots & \frac{1 + δ_{n - 1}}{2 (n - 1)} & \frac{1}{2} - \frac{1}{2 (n - 1)} (δ_{1} + δ_{2} + \dots + δ_{n - 1}) \\ \frac{1 + δ_{1}}{2 (n - 1)} & \frac{1 + δ_{2}}{2 (n - 1)} & \dots & \frac{1 + δ_{n - 1}}{2 (n - 1)} & \frac{1}{2} - \frac{1}{2 (n - 1)} (δ_{1} + δ_{2} + \dots + δ_{n - 1}) \\ ⋮ & ⋮ & ⋮ & ⋮ \\ \frac{1 + δ_{1}}{2 (n - 1)} & \frac{1 + δ_{2}}{2 (n - 1)} & \dots & \frac{1 + δ_{n - 1}}{2 (n - 1)} & \frac{1}{2} - \frac{1}{2 (n - 1)} (δ_{1} + δ_{2} + \dots + δ_{n - 1}) \\ \frac{1 - δ_{1}}{2} & \frac{1 - δ_{2}}{2} & \dots & \frac{1 + δ_{n - 1}}{2} & 1 - \frac{(n - 1)}{2} + \frac{δ_{1}}{2} + \frac{δ_{2}}{2} + \dots + \frac{δ_{n - 1}}{2} \end{matrix}] .

Then S is a doubly stochastic matrix. When we multiply the first row of S and the first column of Y, we will obtain

\frac{(1 + δ_{1})}{2 (n - 1)} y_{11} + \frac{(1 + δ_{2})}{2 (n - 1)} y_{21} + \dots + \frac{1 + δ_{n - 1}}{2 (n - 1)} y_{n - 11} + (\frac{1}{2} - \frac{1}{2 (n - 1)} (δ_{1} + δ_{2} + \dots + δ_{n - 1}) y_{n 1} = \frac{y_{11}}{2 (n - 1)} + \frac{y_{21}}{2 (n - 1)} + \dots + \frac{y_{n - 11}}{2 (n - 1)} + \frac{y_{n, 1}}{2 (n - 1)} + \frac{1}{2 (n - 1)} (δ_{1} y_{1, 1} + δ_{2} y_{2, 1} + \dots + δ_{n - 1} y_{n - 11}) + y_{n, 1} (\frac{1}{2} - \frac{1}{2 (n - 1)} - \frac{1}{2 (n - 1)} (δ_{1} + δ_{2} + \dots δ_{n - 1})) = \frac{1}{2 (n - 1)} (y_{11} + y_{21} + \dots + y_{n 1}) + \frac{1}{2 (n - 1)} (δ_{1} y_{11} + δ_{2} y_{21} + \dots + δ_{n - 1} y_{n - 11}) + y_{n 1} (\frac{1}{2} - \frac{1}{2 (n - 1)} - \frac{1}{2 (n - 1)} (δ_{1} + δ_{2} + \dots + δ_{n - 1})) = \frac{1}{2 (n - 1)} (x_{11} + x_{21} + \dots + x_{n 1}) + \frac{1}{2 (n - 1)} (δ_{1} y_{11} + δ_{2} y_{21} + \dots + δ_{n - 1} y_{n - 11}) + y_{n, 1} (n - 2 - (δ_{1} + δ_{2} + \dots + δ_{n - 1})) = \frac{1}{2 (n - 1)} (x_{11} + x_{21} + \dots + x_{n 1}) + (x_{11} + x_{21} + \dots + x_{n - 11} - x_{n 1})) = x_{11} .

Hence,

X = S Y .

This implies that X is majorized by

Y .

□

Example 3.

Let

X = [\begin{matrix} 1 & 0.5 & 1 \\ 1 & 0.5 & 1 \\ 2 & 3 & 2 \end{matrix}]

and

Y = [\begin{matrix} - 4 & - 2 & - 1 \\ 4 & 8 & 4 \\ 4 & - 2 & 1 \end{matrix}] \in R^{3 \times 3} .

Then X and Y satisfy the conditions of Theorem 4 and so X is majorized by

Y .

Take

δ_{1} = 0.5

and

δ_{2} = 0 .

Then

S = [\begin{matrix} 0.375 & 0.25 & 0.375 \\ 0.375 & 0.25 & 0.375 \\ 0.25 & 0.5 & 0.25 \end{matrix}] \in Ω_{3}

and

X = S Y .

So X is majorized by

Y .

A matrix X is said to be doubly stochastic majorized by Y, denoted by

X ≺^{d s} Y

, when there is

S \in Ω_{m}

such that

X = Y S .

In the next theorem, we prove a necessary condition for X to be doubly stochastic majorized by

Y .

Theorem 5.

Let

X, Y \in R^{n \times m} .

If

X ≺^{d s} Y

then there exists

δ_{1}, δ_{2}, \dots, δ_{m - 1} \in R

such that

| δ_{i} | \leq 1

for

i = 1, 2, \dots, m - 1, δ_{1} + δ_{2} + \dots + δ_{m - 1} \geq m - 3

and

x_{i 1} + x_{i 2} + \dots + x_{i m - 1} - x_{i m} = δ_{1} y_{i 1} + δ_{2} y_{i 2} + \dots + y_{i m} (m - 2 - δ_{1} - δ_{2} - \dots - δ_{m - 1}) .

Proof.

Suppose

X ≺^{d s} Y

so that

X = Y S

for some

S = {(a_{i j})}_{m \times m} \in Ω_{m} .

The j-th element of the vector

X [1, 1, \dots, - 1] = Y S [1, 1, \dots, - 1]

is

x_{i 1} + x_{i 2} + \dots - x_{i m} = y_{i 1} (2 (a_{11} + a_{12} + \dots + a_{1 m - 1}) - 1) + y_{i 2} (2 (a_{21} + a_{22} + \dots + a_{2 m - 1}) - 1) + \dots + y_{i m} (2 (a_{m 1} + a_{m 2} + \dots + a_{m m - 1}) - 1) . 2 (a_{m 1} + a_{m 2} + \dots + a_{m m - 1}) - 1 = 2 m - 3 - 2 (a_{11} + \dots + a_{m - 11} + a_{12} + a_{22} + \dots + a_{m - 12} + \dots + a_{1 m - 1} + a_{2 m - 1} + \dots + a_{m - 1 m - 1}) .

Let

2 (a_{11} + a_{21} + \dots + a_{m - 11}) - 1 = δ_{1}, 2 (a_{12} + a_{22} + \dots + a_{m - 12}) - 1 = δ_{2}, \dots 2 (a_{1 m - 1} + a_{2 m - 1} + \dots + a_{m - 1 m - 1}) - 1 = δ_{m - 1} .

This implies,

x_{i 1} + x_{i 2} + \dots + x_{i m - 1} - x_{i m} = δ_{1} y_{i 1} + δ_{2} y_{i 2} + \dots + y_{i m} (2 m - 3 - 2 (\frac{1 + δ_{1}}{2} + \frac{1 + δ_{2}}{2} + \dots + \frac{1 + δ_{m - 1}}{2})) = δ_{1} y_{i 1} + δ_{2} y_{i 2} + \dots + y_{i m} (m - 2 - (δ_{1} + δ_{2} + \dots + δ_{m - 1})) .

Since

0 \leq a_{i 1} + a_{i 2} + \dots + a_{i m - 1} \leq 1, - 1 \leq 2 (a_{i 1} + a_{i 2} + \dots + a_{i m - 1}) - 1 \leq 1 . \Rightarrow | δ_{i} | \leq 1 .

a_{m m} \geq 0 .

\Rightarrow a_{11} + a_{12} + \dots + a_{1 m - 1} + \dots + a_{m - 1 m - 1} + 2 - m \geq 0 .

\Rightarrow \frac{1 + δ_{1}}{2} + \frac{1 + δ_{2}}{2} + \dots + \frac{1 + δ_{m - 1}}{2} + 2 - m \geq 0 .

\Rightarrow δ_{1} + δ_{2} + \dots + δ_{m - 1} \geq m - 3 .

□

Example 4.

Let

X = [\begin{matrix} 1 & 1.5 & 1.5 \\ 3 & 1.5 & 1.5 \\ 1 & 2 & 2 \end{matrix}], Y = [\begin{matrix} 1 & 1 & 2 \\ 3 & 1 & 2 \\ 1 & 1 & 3 \end{matrix}]

and

S = [\begin{matrix} 1 & 0 & 0 \\ 0 & 0.5 & 0.5 \\ 0 & 0.5 & 0.5 \end{matrix}] .

Then

δ_{1} = 1

and

δ_{2} = 0

are such that they satisfy the required conditions of the theorem and so this example validates Theorem 5.

The converse of Theorem 5 is not true.

Example 5.

X = [\begin{matrix} 1.5 & 1 & 1 \\ 0 & 1 & 2 \\ 0 & 2 & 3 \end{matrix}]

and

Y = [\begin{matrix} 1 & 3 & 1 \\ 1 & 1 & 1 \\ 2 & 2 & 3 \end{matrix}] .

Take

δ_{1} = 0.5

and

δ_{2} = 0 .

Then

X, Y

and

δ_{1}, δ_{2}

satisfy the conditions of the above theorem, but there exists no matrix such that

X = Y S .

Assume that there exists a matrix S such that

S = [\begin{matrix} s_{11} & s_{12} & 1 - s_{11} - s_{12} \\ s_{21} & s_{22} & 1 - s_{21} - s_{22} \\ 1 - s_{11} - s_{21} & 1 - s_{12} - s_{22} & (s_{11} + s_{12} + s_{21} + s_{22} - 1) \end{matrix}] .

Since

X = Y S

when we multiply the first row of Y with the first column of

S,

we obtain

x_{11} = 1.5 = 2 s_{21} + 1 .

This implies that

s_{21} = 0.25 .

Similarly for

x_{21}

we have

x_{21} = 1 = 2 s_{22} + 1 .

This implies that

s_{22} = 0 .

Similarly, for

x_{13}

we have

x_{13} = 1 = 3 - 2 s_{21} - 2 s_{22} .

This implies that

s_{21} + s_{22} = 1

which is a contradiction since

s_{21} = 0.25

and

s_{22} = 0 .

Therefore, there exists no such matrix S such that

X = Y S .

Theorem 6.

Let

X, Y \in R^{n \times m} .

A sufficient condition for X to be doubly stochastic majorized by Y is that

r_{X} = r_{Y}

and

x_{i 1} = x_{i 2} = \dots = x_{i m - 1}

for

i = 1, 2, \dots, m

and there exist

δ_{1}, δ_{2}, \dots δ_{m - 1} \in R

such that

| δ_{i} | \leq 1

for

i = 1, 2, \dots, m - 1, δ_{1} + δ_{2} + \dots + δ_{m - 1} \geq m - 3

and

x_{i 1} + x_{i 2} + \dots + x_{i m - 1} - x_{i m} = δ_{1} y_{i 1} + δ_{2} y_{i 2} + \dots + y_{i m} (m - 2 - δ_{1} - δ_{2} \dots - δ_{m - 1}) .

Proof.

The proof is similar to the proof of Theorem 4. □

Example 6.

Let

X = [\begin{matrix} 1 & 1 & 2 \\ 0.5 & 0.5 & 3 \\ 1 & 1 & 2 \end{matrix}]

and

Y = [\begin{matrix} - 4 & 4 & 4 \\ - 2 & 8 & - 2 \\ - 1 & 4 & 1 \end{matrix}] \in R^{3 \times 3} .

Then X and Y satisfy the conditions of Theorem 6 and so X is doubly stochastic majorized by

Y .

Take

δ_{1} = 0.5

and

δ_{2} = 0 .

Then

S = [\begin{matrix} 0.375 & 0.375 & 0.25 \\ 0.25 & 0.25 & 0.5 \\ 0.375 & 0.375 & 0.25 \end{matrix}] \in Ω_{3}

and

Y S = X .

So X is doubly stochastic majorized by

Y .

3. Majorization for (0,1) Matrices

There are two main motivations for the study of matrix majorization for

(0, 1)

matrices. First, it is of interest to see if this restriction to the subclass of

(0, 1)

matrices leads to simpler characterizations of the majorization order in question. Secondly,

(0, 1)

matrices are essential to represent combinatorial objects, and therefore one may look at the meaning of such a matrix majorization order (each associated with a

(0, 1)

matrix). In [14] weak, directional and strong majorizations of

(0, 1)

matrices are characterized. Matrix majorization on

(0, 1)

matrices is investigated.

Definition 1.

A matrix X is said to be column stochastic majorized by

Y,

denoted as

X ⪯^{c} Y,

if there exists a column stochastic matrix S such that

X = S Y .

Theorems 7 and 8 give characterizations of

⪯^{d s}

and

⪯^{c}

for

(0, 1)

matrices.

Theorem 7.

Let

X, Y \in M_{m, n} (0, 1)

and

X ⪯^{d s} Y .

Then for every

i = 1, 2, \dots, m

the number of

1^{'} s

in the i-th column of X is equal to the number of

1^{'} s

in the i-th column of

Y .

Proof.

By assumption,

X = Y S

for some

S \in Ω_{n} .

Then

e X = e Y S = e Y

where

e = {[1, 1, \dots, 1]}^{t} .

Since both matrices are

(0, 1),

the i-th entry of

e X

is the number of

1^{'} s

in the i-th column of

A .

The same holds is for

Y .

Hence the result follows. □

Theorem 8.

Let

X, Y \in M_{m, n} (0, 1) .

Assume

X ⪯^{c} Y,

but

X^{d s} Y .

Then

(i): There exists $S \in Ω_{n}^{c o l u m n}$ satisfying $X = S Y$ such that S contains a zero row, and for each row sum $r_{i}$ of S it holds that either $r_{i} = 0$ or $r_{i} \geq 1 .$
(ii): If Y does not contain a zero row, then for any $S \in Ω_{n}^{c o l u m n}$ satisfying $X = S Y$ it holds that S contains a zero row, and for each row sum $r_{i}$ of R it holds that $r_{i} = 0$ or $r_{i} \geq 1$ .
(iii): X contains a zero row.

Proof.

(i) Suppose

X = S Y

where S is column stochastic. Since

S \in Ω_{n}^{c o l u m n}

, the sum of all elements in S is

n .

Since by assumption

X^{d s} Y,

there is i such that the ith row sum of

S,

r_{i} < 1

and there is k such that the kth row sum of

S,

r_{k} > 1 .

If

r_{i} < 1,

then

X_{(i)}

is a zero row. Indeed, each element of

X_{(i)}

is less than or equal to

r_{i} < 1 .

Hence, it is zero. We are going to modify S in order to construct

S^{'} \in Ω_{n}^{c o l u m n}

such that

X = S^{'} Y

and

S_{(i)}^{'}

is zero. Fix some k such that

r_{k} > 1 .

Suppose

s_{i p} \neq 0 .

Since

0 = X_{(i)} = \sum_{j = 1}^{n} s_{i j} Y^{(j)} \geq s_{i p} Y^{(p)},

we obtain

Y^{(p)} = 0 .

Consider arbitrary

x_{f g} = \sum_{h = 1}^{n} s_{f h} y_{h g} = \sum_{h \neq p} s_{f p} y_{p g} + s_{f p} y_{p g} = \sum_{h \neq p} s_{f p} y_{p g} .

We consider the matrix

S^{'},

which is obtained from S by changing

s_{k p}

to

s_{k p} + s_{i p}

and

s_{i p}

to

0 .

We do the same for the rest of the nonzero elements in

S_{(i)} .

Finally, we obtain

S^{'}

such that

X = S^{'} Y, S_{i}^{^{'}}

is a zero row,

r_{k}^{'} > 1

and

S_{(l)} = S_{(l)}^{'}

for

l \neq i, k .

We repeat this procedure for every q with

0 < r_{q} < 1 .

After several such substitutions, we obtain

S^{'}

such that for every q either

r_{q}^{'} = 0

or

r_{q}^{^{'}} \geq 1 .

S^{'}

contains a zero row, as required. (ii) Suppose Y does not contain a zero row. Let i be such that

r_{i} < 1 .

From

(i), X_{(i)}

is zero.

X_{(i)} = \sum_{j = 1}^{n} s_{i j} Y_{(j)}

and all summands are non-negative, and it follows that

s_{i j} = 0

for all

j .

Finally if

r_{i} < 1

then

r_{i} = 0

and S contains a zero row. (iii) By (i) we obtain that

X^{'}

contains a zero row, and, as a consequence,

A = X^{'} B

contains a zero row. □

In [14], it was proved that if

A ⪯^{w m} B

then

R (A) \subseteq R (B)

, but matrix majorization cannot be described in terms of row/column inclusion. The following examples show that strong majorization also cannot be described in terms of row/column inclusion. The first example shows that column inclusion does not follow from

X ⪯^{m} Y

, and the second one shows that the converse implication does not hold as well.

Example 7.

Let

X = [\begin{matrix} 0 & 0 \\ 0 & 0 \\ 1 & 1 \end{matrix}], Y = [\begin{matrix} 1 & 0 \\ 0 & 1 \\ 0 & 0 \end{matrix}], S = [\begin{matrix} 0 & 0 & 1 \\ 0 & 1 & 0 \\ 1 & 0 & 0 \end{matrix}] .

Then

X = S Y,

so

X ⪯^{s} Y .

X^{(1)} = [\begin{matrix} 0 \\ 0 \\ 1 \end{matrix}] \notin C (Y) .

Also,

Y^{(1)} = [\begin{matrix} 1 \\ 0 \\ 0 \end{matrix}] \notin C (X) .

Example 8.

Let

X = [\begin{matrix} 1 & 1 & 1 \\ + 1 & 1 & 1 \\ 0 & 0 & 1 \\ 0 & 0 & 1 \end{matrix}]

and

Y = [\begin{matrix} 1 & 1 & 1 \\ 0 & 1 & 1 \\ 1 & 0 & 1 \\ 0 & 0 & 1 \end{matrix}] .

Then

e X = e Y

and

R (X) \subseteq R (Y) .

But it is easy to verify that

X^{m} Y .

Suppose that there exists

S = [\begin{matrix} a & d & g & j \\ b & e & h & k \\ \dots \\ \dots \end{matrix}]

such that

X = S Y .

Since

X = S Y .

Since

S Y = [\begin{matrix} a + g & a + d & a + d + g + j \\ b + h & b + e & b + h + e + k \\ \dots \\ \dots \end{matrix}]

it follows that

a + g = a + d = a + d + g + j = 1 .

Then

d = g = j = 0

and

a = 1 .

For similar reasons,

b = 1

and then R is not a row-stochastic matrix.

4. Conclusions

In this paper, we proved some necessary and sufficient conditions for

X, Y \in R^{n \times m},

1 \leq m \leq n - 1

to satisfy

X ≺ Y .

We obtain some necessary and sufficient conditions for X to be majorized by Y with some conditions on X and Y. We also obtained some necessary and sufficient conditions for X to be doubly stochastic majorized by Y with some conditions on X and

Y .

We also obtained some characterizations for majorization of

(0, 1)

matrices. Finding necessary and sufficient conditions for general matrix majorization is difficult and still it is open. Finding necessary and sufficient conditions for doubly stochastic majorization and strong majorization are the scope of future studies.

Author Contributions

Both authors have contributed equally to this paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Brualdi, R.A. The doubly stochastic matrices of a vector majorization. Linear Algebra Appl. 1984, 61, 141–154. [Google Scholar] [CrossRef] [Green Version]
Pería, F.D.; Massey, P.G.; Silvestre, L.E. Weak matrix majorization. Linear Algebra Appl. 2005, 403, 343–368. [Google Scholar] [CrossRef] [Green Version]
Dahl, G. Majorization Polytopes. Linear Algebra Appl. 1999, 297, 157–175. [Google Scholar] [CrossRef] [Green Version]
Marshall, A.W.; Olkin, I. Inequalities: Theory of Majorization and Its Application; Academic Press: New York, NY, USA, 1979. [Google Scholar]
Brualdi, R.A.; Hwang, S.G. Vector majorization via Hessenberg matrices. J. Lond. Math. Soc. 1996, 53, 28–38. [Google Scholar] [CrossRef]
Chao, K.M.; Wong, C.S. Application of M-matrices to majorization. Linear Algebra Appl. 1992, 169, 31–40. [Google Scholar] [CrossRef] [Green Version]
Hwang, S.G.; Pyo, S.S. Matrix majorization via vector majorization. Linear Algebra Appl. 2001, 332–334, 15–21. [Google Scholar] [CrossRef] [Green Version]
Dahl, G. Alexander Guterman, and Pavel Shteyner, Majorization for matrix classes. Linear Algebra Appl. 2018, 555, 201–221. [Google Scholar] [CrossRef]
Tong, Y.L. Some recent developments on majorization inequalities in probability and statistics. Linear Algebra Appl. 1994, 199, 69–90. [Google Scholar] [CrossRef] [Green Version]
Jorswieck, E.; Boche, H. Majorization and Matrix-Monotone Functions in Wireless Communications; Now Publishers Inc.: Delft, The Netherlands, 2007. [Google Scholar]
Pietersz, R.; Groenen, P.J. Rank reduction of correlation matrices by Majorization. Quant. Financ. 2004, 4, 649–662. [Google Scholar] [CrossRef] [Green Version]
Ando, T. Majorization, Doubly stochastic matices, and comperison of eigenvalues. Linear ALgebra Appl. 1989, 118, 163–248. [Google Scholar] [CrossRef] [Green Version]
Hwang, S.G.; Park, J.-Y. A note on multivariate majorization. Comm. Korean Math. Soc. 1999, 14, 479–485. [Google Scholar]
Dahl, G. Alexander Guterman, and Pavel Shteyner, Majorization for (0,1)-matrices. Linear Algebra Appl. 2020, 585, 147–163. [Google Scholar] [CrossRef]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Udayan, D.K.; Somasundaram, K. Some Results on Majorization of Matrices. Axioms 2022, 11, 146. https://doi.org/10.3390/axioms11040146

AMA Style

Udayan DK, Somasundaram K. Some Results on Majorization of Matrices. Axioms. 2022; 11(4):146. https://doi.org/10.3390/axioms11040146

Chicago/Turabian Style

Udayan, Divya K., and Kanagasabapathi Somasundaram. 2022. "Some Results on Majorization of Matrices" Axioms 11, no. 4: 146. https://doi.org/10.3390/axioms11040146

APA Style

Udayan, D. K., & Somasundaram, K. (2022). Some Results on Majorization of Matrices. Axioms, 11(4), 146. https://doi.org/10.3390/axioms11040146

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Some Results on Majorization of Matrices

Abstract

1. Introduction

2. Matrix Majorization

3. Majorization for (0,1) Matrices

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI