1. Introduction
Using concepts from spectral graph theory, in this paper, we document new theoretical results on Geary’s
c, which is one of the most-prominent numerical measures of spatial autocorrelation. (Moran’s
I is another prominent numerical measure. For the measure, see, e.g., [
1,
2].) Here, spatial autocorrelation describes the similarity between signals on adjacent vertices and is central to spatial science ([
3]). Therefore, the results about the measure contribute to the development of spatial science.
More specifically, we provide a new representation of Geary’s c. It is an expansion of it into a linear combination of variables with different degrees of spatial autocorrelation. By using the distribution of the coefficients, we can characterize spatial data. It is somewhat similar to Fourier series. Subsequently, we develop a way to compute the graph Laplacian eigenvectors needed for the graph Fourier transform. MATLAB/GNU Octave user-defined functions are also provided.
This paper can be considered complementary to [
4]. As in this paper, using concepts from spectral graph theory, [
4] provided three types of representations for it: (a) graph Laplacian representation, (b) graph Fourier transform representation, and (c) Pearson’s correlation coefficient representation. Our new representation can be regarded as an addition to them. Moreover, the way to compute the graph Laplacian eigenvectors is useful not only for this paper, but also for [
4].
We make two remarks on Geary’s
c. First, Geary’s
c was developed by [
5] and modified by [
6,
7,
8,
9]) into what it is today. It is a spatial generalization of the von Neumann ratio ([
10]). Second, unlike Pearson’s correlation coefficient, there exists the following:
See, e.g., [
11] (Equation (
6)).
This paper is organized as follows. In
Section 2, we provide some preliminaries for the following two sections. In
Section 3, we present a new representation of Geary’s
c. In
Section 4, we develop a way to compute the graph Laplacian eigenvectors needed for the graph Fourier transform.
Section 5 concludes. In
Appendix A, we provide MATLAB/GNU Octave user-defined functions.
Some Notations
Let , be the identity matrix of order n and be the n-dimensional column vector of ones, i.e., . For an full column rank matrix , denote the column space of and its orthogonal complement by and , respectively. For square matrices , denotes the block diagonal matrix, whose diagonals are .
2. Preliminaries
Let denote the realization of a variable on a spatial unit i for . Here, we exclude the case where , i.e., . Accordingly, , where . In addition, let be the nonnegative spatial weight between the spatial units i and j. Here, we suppose that and for . Accordingly, the spatial weights matrix is an symmetric hollow matrix. In addition, let , which is assumed positive.
Geary’s
c for
, denoted by
, is defined by
Let
, where
for
. Then, the graph Laplacian in spectral graph theory (see, e.g., [
12,
13,
14]) is defined as
Accordingly, as shown in, e.g., [
4,
14],
is a nonnegative definite matrix such that
.
Ref. [
4] (Proposition 3.1) showed that
can be represented using
as
where
, which is a symmetric idempotent matrix, i.e.,
and
.
Given that
is a real symmetric matrix, it can be spectrally decomposed as
where
and
is an orthogonal matrix. Here,
denotes an eigenpair of
for
, and the eigenvalues,
, are in ascending order. Given that
is a nonnegative definite matrix and
, we can suppose that
. Let
m denote the number of connected components. Then, it is known that
. See, e.g., [
14]. Let
and
. We show how to obtain
, as well as
from
in
Section 4.
In spectral graph theory, the linear transformation given by
is referred to as the graph Fourier transform of
([
15]). In addition,
and
for
are referred to as graph Laplacian eigenvalues and graph Laplacian eigenvectors, respectively. Let
.
Given that
,
for
, and
is an orthogonal projection matrix onto
, it follows that
, which yields
In addition, given that
is symmetric and
, it follows that
Moreover,
is an orthogonal matrix. By combining these results,
can be represented as
Here, given that
, it follows that
. Finally, we note that (
7) is a part of [
4] (Proposition 3.3).
3. A New Representation of Geary’s
Given that
for
, we can consider
which can be regarded as Geary’s
c when
. We note that, given
, Geary’s
c when
is excluded. (Actually, it cannot be defined. This is because
.)
For
, it follows that
and
, where
is the
k-th column of
. Thus,
in (
8) can be represented as
Then, from the inequalities,
, it follows that
Moreover, given that
, it follows that
By combining (
7) and (
9), we obtain
where
Note that
in (
13) satisfy
and
.
The next proposition summarizes the above-mentioned results.
Proposition 1. (a) in (1) can be represented as , where for . Here, for are nonnegative and sum to unity. (b) for satisfy the inequalities given by (10), and their simple average equals unity. Remark 1. Concerning Proposition 1, we make three remarks:
- (i)
Proposition 1(a) implies that Geary’s c can be represented as a weighted average of . Concerning the weight, , the larger is, the larger is. We note that .
- (ii)
Proposition 1(b) implies that the graph Laplacian eigenvectors, , can be sorted in the spatial autocorrelation measured by Geary’s c. In other words, is more positively spatially autocorrelated than or equal to for . Accordingly, is positively (respectively negatively) spatially autocorrelated if is a monotonically decreasing (respectively increasing) sequence. can be characterized by the distribution of the coefficients, . It is somewhat similar to the Fourier series.
- (iii)
If , then regardless of .
Let
, where
and
. Then,
equals
. That is, for all
and
, it follows that
Given that
,
is an example of (
14). From (
14) and Proposition 1(a), we obtain
.
The next corollary summarizes the above result.
Corollary 1. For all and , equals .
In addition, from Proposition 1, it immediately follows that
Likewise,
follows.
The next corollary summarizes the above result.
Corollary 2. belongs to the closed interval given by .
Remark 2. Concerning Corollary 2, we make two remarks:
- (i)
If , then the interval given by reduces to a singleton. For example, if , which is the binary adjacency matrix of the complete graph with n vertices, then and, accordingly, . Then, in this case, regardless of .
- (ii)
Ref. [11] showed that belongs to the closed interval given by . Given (9), Corollary 2 is its equivalent.
4. A Way to Compute the Eigenvectors in (4)
In this section, we develop a way to compute
, which also provides
. Here, we explain the reason why it is useful. When there is only one connected component, i.e.,
, there is no problem. This is because
has single 0 eigenvalue and the corresponding normalized eigenvector is
. However, when there is more than one connected component, i.e.,
,
has multiple 0 eigenvalues, and then,
is not necessarily one of the eigenvectors returned from a computer program. For example, when
there are two connected components and, accordingly,
(
Figure 1). In this case, both
and
, where
and
, and
and
, where
and
, are eigenpairs of the corresponding graph Laplacian. The results below can handle such a situation.
Let
denote any orthonormal basis of
and
, where
and
. Then,
is an
orthogonal matrix. Accordingly, from (
4) and (
6), it follows that
where
. Here, from
,
, and
, it follows that
where
, which is an
orthogonal matrix. This is because
and
. In addition, given that
, it follows that
By combining (
18) and (
19), (
17) becomes
. Therefore, it follows that
Here, recall that
is an orthogonal matrix. In addition, given that
is an orthogonal matrix, premultiplying (
18) by
yields
. Therefore, it follows that
The next proposition summarizes the above-mentioned results.
Proposition 2. Denote the k-th column of by for , i.e., . Then, for are the eigenpairs of . In addition, is obtainable from by (21). Remark 3. Concerning Proposition 2, we make two remarks:
- (i)
The following matrix is an example of :where . (The use of is inspired by [16].) Here, , where , is a Helmert orthogonal matrix ([17]). Instead of , we may use , where Δ is the matrix such that for an n-dimensional vector . This is because satisfies that and . Here, is a positive definite matrix. - (ii)
MATLAB/GNU Octave user-defined functions required for the calculation of , , and the bounds of Geary’s c are provided in Appendix A.