1. Introduction
Let
denote the space of all
n-square matrices over the complex field
and let
be the symmetric group of degree
n. For
, the permanent function is defined as:
The permanent function therefore resembles the determinant, which in turn is given by:
where
denotes the sign function.
Although not as prominent as the determinant, the permanent function is still a well-known matrix function, with many applications in combinatorics and graph theory. However, while the determinant can be easily computed, no efficient algorithm for computing the permanent is known. The difficulty for a direct computation of the permanent leads to the idea of trying to compute it by using determinants. This problem dates back to 1913 in a work by Pólya [
1], and it has been under intensive investigation since then. While it is clear that the permanent of a
matrix:
equals the determinant of the related matrix:
Szegö [
2] proved that for
, there is no way to generalize this procedure. That is, there is no uniform way of changing the signs in the entries of a matrix
in order to obtain a matrix
B satisfying
.
In [
3], Gibson proved that if
A is an
n-square
-matrix, and if the permanent of
A can be converted to a determinant by affixing ± signs to the elements of the matrix, then
A has at most
positive entries.
Later, Little [
4] proved that an
n-square
-matrix can be conveniently represented by means of a bipartite graph and reinterpreted the problem of characterizing convertible matrices as the problem of characterizing bipartite graphs whose 1-factors can be counted by using Pfaffians in the manner suggested by Kasteleyn [
5].
The computational complexity for determining if a
-matrix is convertible was studied by Vazirani and Yannakakis [
6] and Robertson, Seymour, and Thomas [
7], leading the latter authors to design a polynomial-time algorithm capable of determining whether or not the permanent of the matrix is convertible into a determinant.
In this article, we consider mostly
n-square
-matrices with the maximum number of positive entries
. For these cases, we present a procedure to determine whether or not a given matrix is convertible. Compared with previously available algorithms, this method does not rely on the associated bipartite graph, and is more efficient. This result is presented in
Section 4, where we introduce a new concept: the imprint. Before that, in
Section 3, we define Hessenberg-type matrices, generalizing the well-known notion of Hessenberg matrices, and present some preliminary results. Our main results are presented in
Section 5. We extend Fonseca’s result [
8] by presenting an explicit characterization of convertible Hessenberg-type matrices and corresponding subspaces. We conclude that convertible matrices can be reduced to a basic set.
2. Basic Definitions and Preliminary Results
Let us start with a brief summary of definitions and results that are subsequently used throughout the article.
In the present work, we exclusively consider square matrices, so we will drop the qualifier square in what follows. In general, the order n of the considered matrices is arbitrary, except when explicitly stated otherwise.
A matrix
is said to be convertible if there exists a
-matrix
, such that:
where ★ denotes the Hadamard product. As already mentioned in the Introduction, a convertible matrix of order
n has at most
nonzero entries [
3].
For a
-matrix
S of order
n, we define the associated coordinate subspace as:
It is clear that if
S is convertible then every element of
is also convertible, with respect to the same matrix
C; i.e., there exists a
-matrix
such that:
We will also say that is convertible.
A well-known set of convertible matrices is the set of Hessenberg matrices. The matrix
is said to be a lower (upper) Hessenberg matrix if
for
(
). In [
9], Gibson proved that the linear space of lower (or upper) Hessenberg matrices is a convertible subspace of
. In [
8], Fonseca extended Gibson’s result to a broader class of matrices. In the next section we introduce a definition that further extends the class of matrices considered in [
8].
Definition 1. Two matrices A and B are permutation equivalent if there exist permutation matrices P and Q such that . In such a case, we denoted it by .
It is clear that if A is convertible and , then B is also convertible.
Proposition 1. If a matrix is convertible, then it is sufficient to change at most signs in order to convert the permanent into the determinant.
Proof. Trivial by Lemma 3 and by the Theorem in [
3]. ☐
Let . We denote by the submatrix of A obtained after removing rows and columns .
The following result is given in the proof of Lemma 1 of [
3]:
Lemma 1. Let be a convertible -matrix, with , where is a -matrix . If , then is also convertible, with: A subspace version of this lemma follows immediately.
Proposition 2. If is a convertible subspace and , then is also a convertible subspace.
Corollary 1. If is a convertible subspace and are k nonzero elements of S, then is also a convertible subspace.
Proof. Trivial by induction. ☐
3. Hessenberg-Type Matrices
In this section we will extend further the class of matrices considered in [
8]. Throughout what follows, the adjective “lower” for the lower Hessenberg matrices will be dropped, since no ambiguity will appear.
Definition 2. An n-square matrix is a Hessenberg-type matrix if it has at most nonzero entries above the main diagonal. A coordinate subspace V of is said to be a Hessenberg-type subspace if there is a Hessenberg-type -matrix with if , such that: If S has exactly nonzero entries above the main diagonal, we call V a -Hessenberg-type subspace, or simply a -subspace if there is no ambiguity. If , then S is called a full Hessenberg-type matrix and the corresponding subspace is called a full Hessenberg-type subspace.
Note that a full Hessenberg-type -matrix has precisely the maximum allowed number of nonzero entries for convertibility.
Standard Hessenberg matrices are of course special cases of Hessenberg-type matrices. In particular, a matrix in a -subspace, with all nonzero entries located in the second upper diagonal, will be referred to as a full Hessenberg matrix.
Fonseca’s extension result [
8], concerning a particular Hessenberg-type subspace, can be stated as follows:
Theorem 1. [
8]
Let be a full Hessenberg-type -matrix, and , with . If the positions of the nonzero entries, above the main diagonal, satisfy:then is a convertible subspace. The -matrix such that for all satisfies: Example 1. For example, if , , and , then:and the coordinate subspace is convertible. Next, we present some results concerning nonfull Hessenberg-type subspaces.
Proposition 3. The -subspace is convertible.
Proof. Trivial by Theorem 1. ☐
Proposition 4. Let S be a -Hessenberg-type -matrix with a 1 in position , . S is convertible if and only if the position of the second 1 above the main diagonal is not in one of the following regions: Proof. Without loss of generality, let us consider the scheme below. Note that if
, then we only have region I.
Let us first suppose that the second 1 is in region I. Assume, without loss of generality, that the second 1 is in the upper-right corner of
S. If this is not the case, then there exists a submatrix
of
S satisfying this condition, and the non-convertibility of
implies the non-convertibility of
S. Note that
is an
-square matrix with
nonzero entries above the main diagonal—which cannot be convertible because it has more than
nonzero entries [
3]. Then, by Lemma 1, we conclude that
S is not convertible.
The situation where the second 1 is in region II is no different from the previous case, since one can interchange the role of the two 1’s.
Next, let us suppose that the second 1 is in region III. So, we have
and we may assume—without loss of generality—that
and
(an example of a matrix
S of this form is given in Equation (
16) below). If this is not the case, then as we argued for the region I case, there exists a submatrix
of
S satisfying this condition.
Since the number of nonzero entries of S is , it follows that is an -square matrix with nonzero entries, because there are nonzero entries eliminated in S. Since in region III, the number of nonzero entries of this -square matrix is greater than the maximum admissible , and is therefore not convertible. By Corollary 1, this implies that the initial matrix is also nonconvertible.
Finally, if the second 1 is in region IV, one can interchange the role of the 1’s above the main diagonal, thus falling in the previous case.
If the second 1 is not in any of the four regions, then we have three cases:
- •
if and (positions labeled by ⊙), then S is permutation equivalent to a convertible matrix by Theorem 1, permuting columns and .
- •
if and (positions labeled by ⊗), then S is permutation equivalent to a convertible matrix by Theorem 1, permuting rows and .
- •
otherwise (positions labeled by ∗) S is convertible by Theorem 1. ☐
Lemma 2. For , the number of convertible -subspaces with a nonzero entry at a fixed position , , is: Proof. Consider first the case
. By Proposition 4, the number of convertible
-subspaces is given by:
where
is the
nth triangular number.
For
the number of convertible
-subspaces is likewise given by:
☐
Proposition 5. For , the number of convertible -subspaces is .
Proof. The formula clearly holds for
. Let us prove it for general
n by induction. Suppose then that the formula holds for
n. Let
S be an arbitrary
-Hessenberg-type
-matrix. Two cases may occur: apart from the
position, there is either none or at least one nonzero entry in the first row of
S. In the first case, it is clear that the number of convertible subspaces of
coincides with the number of convertible subspaces of
, which is, by hypothesis:
In the second case, one applies Lemma 2 at fixed positions
. For position
the counting is provided by Formula (18), which gives:
For positions
,
, the counting is provided by Formula (
17). Summing over all different possibilities gives:
where the term
takes care of the double counting (since there may be two nonzero entries in the first row, apart from the position
). This sum can be easily performed, yielding:
Finally, summing all contributions (
21), (
22), and (
24) for the number of convertible
-subspaces, we get:
It follows by induction that the number of convertible -subspaces is . ☐
5. Charaterization of Full Hessenberg-Type Subspaces
In this section we consider full Hessenberg-type matrices and the corresponding subspaces. Our aim is to obtain an explicit characterization of convertible full Hessenberg-type subspaces.
Hessenberg-type matrices can be composed to produce new higher-dimensional matrices as follows.
Definition 4. Let and be two Hessenberg-type -matrices with dimensions m and n, respectively, such that the submatrix of formed by the lower-right corner coincides with the submatrix of formed by its upper-left corner. The k-overlap is the -dimensional matrix obtained by superposition of and by their main diagonal, overlapping the coincident submatrix, where the missing entries below and above the main diagonal are 1 and 0, respectively.
Note that the new matrix is not necessarily of the Hessenberg-type, as the following counting of nonzero entries above the main diagonal shows. Let a, b, and c respectively denote the number of 1’s above the main diagonal in , , and the common submatrix. The number of 1’s above the main diagonal in is then . In the least favorable case when both matrices and are full (i.e., and ), it follows that is Hessenberg-type only if .
Example 3. For the two following (full) Hessenberg-type matrices:two examples of overlaps are: In the first case, the matrix is Hessenberg-type, whereas in the second case it is not.
Lemma 4. Let and be two full Hessenberg-type -matrices. is a full Hessenberg-type -matrix if and only if the common matrix is a full Hessenberg-type -matrix.
Proof. If the matrix is full, the conclusion follows trivially from the counting below Definition 4. By the same counting, if is full, we have , and thus the number of nonzero entries above the main diagonal in the matrix is . ☐
Proposition 7. Let and be two full and convertible Hessenberg-type -matrices. If a k-overlap is a full Hessenberg-type matrix, then it is convertible.
Proof. Let and be two full and convertible Hessenberg-type -matrices of order and , respectively, and S the k-overlap matrix of order n, , which by hypothesis is a full Hessenberg-type matrix. If , then or , and it is therefore convertible.
Suppose that
. Without loss of generality, one can consider the scheme depicted below:
where the
overlap submatrix will be denoted by
. It follows from the previous lemma that
is a full Hessenberg-type matrix. Moreover, it follows from Corollary 1 that
is convertible, since
.
Let us calculate
. For the last
rows of
S, the number of nonzero entries in each row is obtained by adding
to each value of the first row of
,
Since
is a full convertible matrix, we have:
and the contribution of these
k rows to the first row of
is:
which correspond to the last
k values of the first row of
. It follows that the first
values of the first row of
correspond to the first
rows of
, and therefore to the first
rows of
S. Hence, after taking into account the first
values of the first row of
, the first row of
turns out to be:
Since a similar argument is clearly valid for the columns, we get:
and it follows from Proposition 6 that
S is convertible. ☐
We will now show that a small family of matrices generates—by means of the above defined overlap—all possible full convertible Hessenberg-type subspaces.
For
, consider the family of full
n-square Hessenberg-type
-matrices represented below:
where the
nonzero entries above the main diagonal are in the following regions: position
, positions
, …,
in the first line and positions
, …,
in the last column, where the integer
k can take any of the values
. In particular, if
, all the 1’s above the main diagonal are positioned in the first row and if
all the 1’s are positioned in the last column.
We will refer to these matrices as basic, with the understanding that for the (only) basic matrix is the -Hessenberg matrix.
Proposition 8. The basic matrices are convertible.
Proof. The basic matrices are permutation equivalent to the Hessenberg matrix (or coincide with it, for ). ☐
Proposition 9. If a full Hessenberg-type -matrix is convertible, then it is basic or an overlap of two or more basic matrices.
Proof. We will prove it by induction.
For
there are three convertible subspaces
, where:
The first matrix S is a 1-overlap of two 2-square -hessenberg matrices, whereas the last two are basic.
Suppose now that the proposition is valid for n. Note first that validity for n actually guarantees validity for , and therefore for , with , for the following reason. Suppose there is a -square full convertible Hessenberg-type -matrix that is neither basic nor an overlap of basic matrices. The 1-overlap of this matrix (by means of the diagonal position ) with the basic matrix is then a full Hessenberg-type matrix of order n, and is convertible by Proposition 7, but fails to be basic or an overlap of basic matrices, which is a contradiction.
Let then be an -square full convertible Hessenberg-type -matrix. In what follows, we will repeatedly use Proposition 6, which establishes that for all integer , there is exactly one row and one column of S with exactly ℓ nonzero entries. Proposition 2 and Corollary 1 are also used.
Let , be the number of 1’s in the first row of S. It follows that rows 2, 3, …, until row k are fully determined. In particular, for these rows, for . Moreover, all entries in the first row at positions with must be 1’s, for the following reason. Matrices are convertible , and must therefore have less nonzero entries above the diagonal than S. However, for (and ), no 1’s would be removed from S, unless they are at positions . So, , …, in S are necessarily 1’s, with the remaining 1 in the first row appearing in an arbitrary position , (of course, the above constraints are void for ). Note that the column is also completely determined: it is already established that for , and it follows from Proposition 6 that the remaining entries must be 1’s. Now, if , we are done, since S is then seen to be basic (in particular, this includes the case ). Otherwise, we have and let be the square matrix formed by the first m rows and columns of S. This matrix is clearly convertible, by Corollary 1. On the other hand, since the combined number of 1’s above the diagonal in the first row and last column of is clearly , it follows that the maximum number of admissible nonzero entries is saturated, showing that is of the basic type. Furthermore, is certainly Hessenberg-type, convertible, and full, since the number of 1’s above the diagonal in is precisely , given that exactly k nonzero entries above the diagonal are removed from S. It is clear that S is an -overlap of with , thus completing our proof, since is, by hypothesis, basic or an overlap of basic matrices. ☐
We will conclude by counting the number of convertible full Hessenberg-type subspaces of order n, or the corresponding full convertible Hessenberg-type -matrices. Given an -subspace, there are different combinations of distributing the nonzero entries above the diagonal, where is the th triangular number. Of course, not all of them correspond to a convertible subspace. The number of convertible subspaces is established by the following proposition.
Proposition 10. For , there are different full convertible Hessenberg-type -matrices of order n.
Proof. We will prove it by induction. The hypothesis is clearly verified for . Suppose then that the proposition is valid for n and let us consider the case. It follows again from Proposition 6 that full convertible Hessenberg-type -matrices S fall in two classes: there are either two 1’s in the first row, with one of them necessarily at position , or two 1’s in the second row, necessarily at positions and . In the first case, assigning the remaining 1 to the position puts no restriction on the (full convertible n-square Hessenberg-type) matrix , and so there are as many possibilities for S as there are for (i.e., ). Assigning the remaining 1 to one of the positions , , restricts the form of , but one can easily convince oneself that the reunion of all these cases exhausts all possibilities for , so one again gets possibilities. In the second case, it follows from the arguments in the proof of Proposition 9 that there must be a 1 at position . In this case, there is no restriction on the matrix , and one gets again possibilities. So, there are full Hessenberg-type convertible matrices of order , thus completing the proof. ☐