Next Article in Journal
Parametrization and Optimal Tuning of Constrained Series PIDA Controller for IPDT Models
Next Article in Special Issue
A New Perspective on Moran’s Coefficient: Revisited
Previous Article in Journal
Non-Parametric Test for Decreasing Uncertainty of Residual Life Distribution (DURL)
Previous Article in Special Issue
Multi-View Learning-Based Fast Edge Embedding for Heterogeneous Graphs
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Geary’s c and Spectral Graph Theory: A Complement

Graduate School of Humanities and Social Sciences, Hiroshima University, 1-2-1 Kagamiyama, Higashihiroshima 739-8525, Japan
Mathematics 2023, 11(20), 4228; https://doi.org/10.3390/math11204228
Submission received: 14 September 2023 / Revised: 4 October 2023 / Accepted: 7 October 2023 / Published: 10 October 2023
(This article belongs to the Special Issue Advances in Graph Theory: Algorithms and Applications)

Abstract

:
Spatial autocorrelation, which describes the similarity between signals on adjacent vertices, is central to spatial science, and Geary’s c is one of the most-prominent numerical measures of it. Using concepts from spectral graph theory, this paper documents new theoretical results on the measure. MATLAB/GNU Octave user-defined functions are also provided.

1. Introduction

Using concepts from spectral graph theory, in this paper, we document new theoretical results on Geary’s c, which is one of the most-prominent numerical measures of spatial autocorrelation. (Moran’s I is another prominent numerical measure. For the measure, see, e.g., [1,2].) Here, spatial autocorrelation describes the similarity between signals on adjacent vertices and is central to spatial science ([3]). Therefore, the results about the measure contribute to the development of spatial science.
More specifically, we provide a new representation of Geary’s c. It is an expansion of it into a linear combination of variables with different degrees of spatial autocorrelation. By using the distribution of the coefficients, we can characterize spatial data. It is somewhat similar to Fourier series. Subsequently, we develop a way to compute the graph Laplacian eigenvectors needed for the graph Fourier transform. MATLAB/GNU Octave user-defined functions are also provided.
This paper can be considered complementary to [4]. As in this paper, using concepts from spectral graph theory, [4] provided three types of representations for it: (a) graph Laplacian representation, (b) graph Fourier transform representation, and (c) Pearson’s correlation coefficient representation. Our new representation can be regarded as an addition to them. Moreover, the way to compute the graph Laplacian eigenvectors is useful not only for this paper, but also for [4].
We make two remarks on Geary’s c. First, Geary’s c was developed by [5] and modified by [6,7,8,9]) into what it is today. It is a spatial generalization of the von Neumann ratio ([10]). Second, unlike Pearson’s correlation coefficient, there exists the following:
Positive spatial autocorrelation if c < 1 ; No spatial autocorrelation if c = 1 ; Negative spatial autocorrelation if c > 1 .
See, e.g., [11] (Equation (6)).
This paper is organized as follows. In Section 2, we provide some preliminaries for the following two sections. In Section 3, we present a new representation of Geary’s c. In Section 4, we develop a way to compute the graph Laplacian eigenvectors needed for the graph Fourier transform. Section 5 concludes. In Appendix A, we provide MATLAB/GNU Octave user-defined functions.

Some Notations

Let y = [ y 1 , , y n ] , I n be the identity matrix of order n and ι be the n-dimensional column vector of ones, i.e., ι = [ 1 , , 1 ] . For an n × m full column rank matrix A , denote the column space of A and its orthogonal complement by S ( A ) and S ( A ) , respectively. For square matrices A 1 , , A p , diag ( A 1 , , A p ) denotes the block diagonal matrix, whose diagonals are A 1 , , A p .

2. Preliminaries

Let y i denote the realization of a variable on a spatial unit i for i = 1 , , n . Here, we exclude the case where y 1 = = y n , i.e., y S ( ι ) . Accordingly, i = 1 n ( y i y ¯ ) 2 > 0 , where y ¯ = 1 n i = 1 n y i . In addition, let w i j be the nonnegative spatial weight between the spatial units i and j. Here, we suppose that w i i = 0 and w j i = w i j for i , j = 1 , , n . Accordingly, the spatial weights matrix W = [ w i j ] is an n × n symmetric hollow matrix. In addition, let Ω = i = 1 n j = 1 n w i j , which is assumed positive.
Geary’s c for y , denoted by c ( y ) , is defined by
c ( y ) = n 1 2 Ω i = 1 n j = 1 n w i j ( y i y j ) 2 i = 1 n ( y i y ¯ ) 2 .
Let D = diag ( d 1 , , d n ) , where d i = j = 1 n w i j for i = 1 , , n . Then, the graph Laplacian in spectral graph theory (see, e.g., [12,13,14]) is defined as
L = D W .
Accordingly, as shown in, e.g., [4,14], L is a nonnegative definite matrix such that L ι = 0 .
Ref. [4] (Proposition 3.1) showed that c ( y ) can be represented using L as
c ( y ) = n 1 Ω y L y y Q ι y ,
where Q ι = I n ι ( ι ι ) 1 ι , which is a symmetric idempotent matrix, i.e., Q ι = Q ι and Q ι 2 = Q ι .
Given that L is a real symmetric matrix, it can be spectrally decomposed as
L = U Λ U ,
where Λ = diag ( λ 1 , , λ n ) and U = [ u 1 , , u n ] is an orthogonal matrix. Here, ( λ k , u k ) denotes an eigenpair of L for k = 1 , , n , and the eigenvalues, λ 1 , , λ n , are in ascending order. Given that L is a nonnegative definite matrix and L ι = 0 , we can suppose that ( λ 1 , u 1 ) = 0 , 1 n ι . Let m denote the number of connected components. Then, it is known that 0 = λ 1 = = λ m < λ m + 1 λ n . See, e.g., [14]. Let Λ 2 = diag ( λ 2 , , λ n ) and U 2 = [ u 2 , , u n ] . We show how to obtain U 2 , as well as Λ 2 from L in Section 4.
In spectral graph theory, the linear transformation given by U y is referred to as the graph Fourier transform of y ([15]). In addition, λ k and u k for k = 1 , , n are referred to as graph Laplacian eigenvalues and graph Laplacian eigenvectors, respectively. Let [ α 1 , , α n ] = U y .
Given that u 1 S ( ι ) , u k S ( ι ) for k = 2 , , n , and Q ι is an orthogonal projection matrix onto S ( ι ) , it follows that Q ι U = [ 0 , u 2 , , u n ] , which yields
y Q ι U = [ 0 , α 2 , , α n ] .
In addition, given that L is symmetric and L ι = 0 , it follows that
L = Q ι L Q ι .
Moreover, U is an orthogonal matrix. By combining these results, c ( y ) can be represented as
c ( y ) = n 1 Ω y Q ι U Λ U Q ι y y Q ι U U Q ι y = n 1 Ω k = 2 n λ k α k 2 j = 2 n α j 2 .
Here, given that y S ( ι ) , it follows that j = 2 n α j 2 = y Q ι y > 0 . Finally, we note that (7) is a part of [4] (Proposition 3.3).

3. A New Representation of Geary’s c

Given that u k S ( ι ) for k = 2 , , n , we can consider
c ( u k ) = n 1 Ω u k L u k u k Q ι u k , k = 2 , , n ,
which can be regarded as Geary’s c when y = u k . We note that, given u 1 S ( ι ) , Geary’s c when y = u 1 is excluded. (Actually, it cannot be defined. This is because u 1 Q ι u 1 = 0 .)
For k = 2 , , n , it follows that u k L u k = u k U Λ U u k = e k Λ e k = λ k and u k Q ι u k = u k 2 = 1 , where e k is the k-th column of I n . Thus, c ( u k ) in (8) can be represented as
c ( u k ) = n 1 Ω λ k , k = 2 , , n .
Then, from the inequalities, 0 λ 2 λ n , it follows that
0 c ( u 2 ) c ( u n ) .
Moreover, given that k = 2 n λ k = k = 1 n λ k = tr ( L ) = Ω , it follows that
1 n 1 k = 2 n c ( u k ) = 1 n 1 k = 2 n n 1 Ω λ k = 1 .
By combining (7) and (9), we obtain
c ( y ) = n 1 Ω k = 2 n λ k α k 2 j = 2 n α j 2 = k = 2 n n 1 Ω λ k α k 2 j = 2 n α j 2 = k = 2 n c ( u k ) α k 2 j = 2 n α j 2 = k = 2 n α k 2 j = 2 n α k 2 c ( u k ) = k = 2 n ψ k c ( u k ) ,
where
ψ k = α k 2 j = 2 n α j 2 , k = 2 , , n .
Note that ψ k in (13) satisfy ψ k 0 and k = 2 n ψ k = 1 .
The next proposition summarizes the above-mentioned results.
Proposition 1.
(a) c ( y ) in (1) can be represented as k = 2 n ψ k c ( u k ) , where ψ k = α k 2 j = 2 n α j 2 for k = 2 , , n . Here, ψ k for k = 2 , , n are nonnegative and sum to unity. (b) c ( u k ) for k = 2 , , n satisfy the inequalities given by (10), and their simple average equals unity.
Remark 1.
Concerning Proposition 1, we make three remarks:
(i) 
Proposition 1(a) implies that Geary’s c can be represented as a weighted average of c ( u 2 ) , , c ( u n ) . Concerning the weight, ψ k , the larger | α k | = | u k y | = | ( u k u k ) 1 u k y | is, the larger ψ k is. We note that α k = arg min ϕ k y ϕ 1 u 1 ϕ n u n 2 = arg min ϕ k y ϕ k u k 2 .
(ii) 
Proposition 1(b) implies that the graph Laplacian eigenvectors, u 2 , , u n , can be sorted in the spatial autocorrelation measured by Geary’s c. In other words, u k is more positively spatially autocorrelated than or equal to u k + 1 for k = 2 , , n 1 . Accordingly, y is positively (respectively negatively) spatially autocorrelated if { ψ k } is a monotonically decreasing (respectively increasing) sequence. y can be characterized by the distribution of the coefficients, ψ 2 , , ψ n . It is somewhat similar to the Fourier series.
(iii) 
If c ( u 2 ) = = c ( u n ) = μ , then c ( y ) = k = 2 n ψ k c ( u k ) = μ k = 2 n ψ k = μ regardless of y .
Let η = η 1 + η 2 , where η 1 S ( ι ) and η 2 S ( y ) { 0 } . Then, c ( η ) equals c ( y ) . That is, for all γ 1 R and γ 2 R { 0 } , it follows that
c ( γ 1 ι + γ 2 y ) = c ( y ) .
Given that Q ι y = y y ¯ ι , c ( Q ι y ) = c ( y ) is an example of (14). From (14) and Proposition 1(a), we obtain c ( γ 1 ι + γ 2 y ) = k = 2 n ψ k c ( u k ) .
The next corollary summarizes the above result.
Corollary 1.
For all γ 1 R and γ 2 R { 0 } , c ( γ 1 ι + γ 2 y ) equals k = 2 n ψ k c ( u k ) .
In addition, from Proposition 1, it immediately follows that
c ( y ) = k = 2 n ψ k c ( u k ) k = 2 n ψ k c ( u n ) = c ( u n ) k = 2 n ψ k = c ( u n ) .
Likewise, c ( u 2 ) c ( y ) follows.
The next corollary summarizes the above result.
Corollary 2.
c ( y ) belongs to the closed interval given by c ( u 2 ) , c ( u n ) .
Remark 2.
Concerning Corollary 2, we make two remarks:
(i) 
If c ( u 2 ) = = c ( u n ) , then the interval given by c ( u 2 ) , c ( u n ) reduces to a singleton. For example, if W = ι ι I n , which is the binary adjacency matrix of the complete graph with n vertices, then L = ( n 1 ) I n ( ι ι I n ) = n Q ι and, accordingly, c ( u 2 ) = = c ( u n ) = n 1 n ( n 1 ) × n = 1 . Then, in this case, c ( y ) = k = 2 n ψ k c ( u k ) = k = 2 n ψ k = 1 regardless of y .
(ii) 
Ref. [11] showed that c ( y ) belongs to the closed interval given by n 1 Ω λ 2 , n 1 Ω λ n . Given (9), Corollary 2 is its equivalent.

4. A Way to Compute the Eigenvectors in (4)

In this section, we develop a way to compute U 2 = [ u 2 , , u n ] , which also provides Λ 2 = diag ( λ 2 , , λ n ) . Here, we explain the reason why it is useful. When there is only one connected component, i.e., m = 1 , there is no problem. This is because L has single 0 eigenvalue and the corresponding normalized eigenvector is 1 n ι . However, when there is more than one connected component, i.e., m 2 , L has multiple 0 eigenvalues, and then, 1 n ι is not necessarily one of the eigenvectors returned from a computer program. For example, when
W = 0 1 0 0 0 1 0 1 0 0 0 1 0 0 0 0 0 0 0 1 0 0 0 1 0 ,
there are two connected components and, accordingly, m = 2 (Figure 1). In this case, both ( 0 , u 1 * ) and ( 0 , u 2 * ) , where u 1 * = 1 3 [ 1 , 1 , 1 , 0 , 0 ] and u 2 * = 1 2 [ 0 , 0 , 0 , 1 , 1 ] , and ( 0 , u 1 ) and ( 0 , u 2 ) , where u 1 = 1 5 [ 1 , 1 , 1 , 1 , 1 ] and u 2 = 2 15 , 2 15 , 2 15 , 3 10 , 3 10 , are eigenpairs of the corresponding graph Laplacian. The results below can handle such a situation.
Let { g 2 , , g n } denote any orthonormal basis of S ( ι ) and G = [ g 1 , G 2 ] , where g 1 = 1 n ι and G 2 = [ g 2 , , g n ] . Then, G is an n × n orthogonal matrix. Accordingly, from (4) and (6), it follows that
G Q ι L Q ι G V = V Λ ,
where V = G U . Here, from g 1 u 1 = 1 , g 1 U 2 = 0 , and G 2 u 1 = 0 , it follows that
V = diag ( 1 , V 2 ) ,
where V 2 = G 2 U 2 , which is an ( n 1 ) × ( n 1 ) orthogonal matrix. This is because V V = U G G U = I n and V V = diag ( 1 , V 2 V 2 ) . In addition, given that Q ι G = [ 0 , G 2 ] , it follows that
G Q ι L Q ι G = diag ( 0 , G 2 L G 2 ) .
By combining (18) and (19), (17) becomes diag ( 0 , G 2 L G 2 ) · diag ( 1 , V 2 ) = diag ( 1 , V 2 ) · diag ( λ 1 , Λ 2 ) . Therefore, it follows that
G 2 L G 2 V 2 = V 2 Λ 2 .
Here, recall that V 2 is an orthogonal matrix. In addition, given that G is an orthogonal matrix, premultiplying (18) by G = [ g 1 , G 2 ] yields U = [ g 1 , G 2 ] · diag ( 0 , V 2 ) = [ g 1 , G 2 V 2 ] . Therefore, it follows that
U 2 = G 2 V 2 .
The next proposition summarizes the above-mentioned results.
Proposition 2.
Denote the k-th column of V 2 by v k + 1 for k = 1 , , n 1 , i.e., V 2 = [ v 2 , , v n ] . Then, ( λ k , v k ) for k = 2 , , n are the eigenpairs of G 2 L G 2 . In addition, U 2 is obtainable from V 2 by (21).
Remark 3.
Concerning Proposition 2, we make two remarks:
(i) 
The following n × ( n 1 ) matrix F 2 is an example of G 2 :
F 2 = 1 1 1 0 2 1 0 0 ( n 1 ) Γ 1 ,
where Γ = diag ( 1 · 2 , , ( n 1 ) · n ) . (The use of F 2 is inspired by [16].) Here, F = [ f 1 , F 2 ] , where f 1 = 1 n ι , is a Helmert orthogonal matrix ([17]). Instead of F 2 , we may use H 2 = Δ ( Δ Δ ) 1 2 , where Δ is the ( n 1 ) × n matrix such that Δ ζ = [ ζ 2 ζ 1 , , ζ n ζ n 1 ] for an n-dimensional vector ζ = [ ζ 1 , , ζ n ] . This is because H 2 satisfies that H 2 ι = ( Δ Δ ) 1 2 Δ ι = 0 and H 2 H 2 = ( Δ Δ ) 1 2 Δ Δ ( Δ Δ ) 1 2 = I n 1 . Here, Δ Δ is a positive definite matrix.
(ii) 
MATLAB/GNU Octave user-defined functions required for the calculation of Λ 2 , U 2 , and the bounds of Geary’s c are provided in Appendix A.

5. Concluding Remarks

In this paper, we showed new theoretical results on Geary’s c, which included (i) a new representation of Geary’s c and (ii) a way to compute the graph Laplacian eigenvectors. The obtained results are summarized in Propositions 1 and 2 and Corollaries 1 and 2. The required MATLAB/GNU Octave user-defined functions are also provided. Finally, as stated, this paper can be considered complementary to [4].

Funding

The Japan Society for the Promotion of Science supported this work through KAKENHI (grant number: 23K013770A).

Acknowledgments

The author thanks three anonymous referees for their valuable comments. The usual caveat applies.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. MATLAB/GNU Octave User-Defined Functions

In this section, we provide three MATLAB/GNU Octave user-defined functions. Among them, Lam2U2 is a function for calculating U 2 , as well as Λ 2 from L . Gearycbounds is a function for calculating the bounds of Geary’s c corresponding to L . Note that (A+A’)/2 in the functions is to ensure symmetry. Finally, these two functions depend on Fmat, which is a function to make F .
  1 
 
  2 
function [Lam2,U2]=Lam2U2(W)
  3 
   n=size(W,1);
  4 
   L=diag(sum(W,2))-W;
  5 
   F=Fmat(n); F2=F(:,2:n); A=F2’*L*F2;
  6 
   [X,E]=eig((A+A’)/2);
  7 
   [e,ind]=sort(diag(E),’ascend’);
  8 
   Lam2=diag(e); V2=X(:,ind);
  9 
   U2=F2*V2;
10 
end
  1 
function [c_lb,c_ub]=Gearycbounds(W)
  2 
   n=size(W,1);
  3 
   Omega=sum(sum(W));
  4 
   L=diag(sum(W,2))-W;
  5 
   F=Fmat(n); F2=F(:,2:n); A=F2’*L*F2;
  6 
   eigv=sort(eig((A+A’)/2),’ascend’);
  7 
   c_lb=((n-1)/Omega)*eigv(1);
  8 
   c_ub=((n-1)/Omega)*eigv(n-1);
  9 
end
  1 
function [F]=Fmat(n)
  2 
   F=zeros(n,n);
  3 
   F(:,1)=ones(n,1)/sqrt(n);
  4 
   for k=2:n
  5 
      F(:,k)=[ones(k-1,1);-(k-1);zeros(n-k,1)]/sqrt((k-1)*k);
  6 
   end
  7 
end

References

  1. Dray, S. A new perspective about Moran’s coefficient: Spatial autocorrelation as a linear regression problem. Geogr. Anal. 2011, 43, 127–141. [Google Scholar]
  2. Yamada, H. A unified perspective on some autocorrelation measures in different fields: A note. Open Math. 2023, 21, 20220574. [Google Scholar] [CrossRef]
  3. Getis, A. A history of the concept of spatial autocorrelation: A geographer’s perspective. Geogr. Anal. 2008, 40, 297–309. [Google Scholar] [CrossRef]
  4. Yamada, H. Geary’s c and spectral graph theory. Mathematics 2021, 9, 2465. [Google Scholar] [CrossRef]
  5. Geary, R.C. The contiguity ratio and statistical mapping. Inc. Stat. 1954, 5, 115–127+129–146. [Google Scholar] [CrossRef]
  6. Cliff, A.D.; Ord, J.K. The problem of spatial autocorrelation. In Studies in Regional Science; Scott, A.J., Ed.; Pion: London, UK, 1969; pp. 25–55. [Google Scholar]
  7. Cliff, A.D.; Ord, J.K. Spatial autocorrelation: A review of existing and new measures with applications. Econ. Geogr. 1970, 46, 269–292. [Google Scholar] [CrossRef]
  8. Cliff, A.D.; Ord, J.K. Spatial Autocorrelation; Pion: London, UK, 1973. [Google Scholar]
  9. Cliff, A.D.; Ord, J.K. Spatial Processes: Models and Applications; Pion: London, UK, 1981. [Google Scholar]
  10. Von Neumann, J. Distribution of the ratio of the mean square successive difference to the variance. Ann. Math. Stat. 1941, 12, 367–395. [Google Scholar] [CrossRef]
  11. De Jong, P.; Sprenger, C.; van Veen, F. On extreme values of Moran’s I and Geary’s c. Geogr. Anal. 1984, 16, 17–24. [Google Scholar] [CrossRef]
  12. Bapat, R.B. Graphs and Matrices, 2nd ed.; Springer: London, UK, 2014. [Google Scholar]
  13. Estrada, E.; Knight, P. A First Course in Network Theory; Oxford University Press: Oxford, UK, 2015. [Google Scholar]
  14. Gallier, J. Spectral theory of unsigned and signed graphs. Applications to graph clustering: A survey. arXiv 2016, arXiv:1601.04692. [Google Scholar]
  15. Shuman, D.I.; Narang, S.K.; Frossard, P.; Ortega, A.; Vandergheynst, P. The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains. IEEE Signal Process. Mag. 2013, 30, 83–98. [Google Scholar] [CrossRef]
  16. Maruyama, Y. An alternative to Moran’s I for spatial autocorrelation. arXiv 2015, arXiv:1501.06260v1. [Google Scholar]
  17. Lancaster, H.O. The Helmert Matrices. Am. Math. Mon. 1965, 72, 4–12. [Google Scholar] [CrossRef]
Figure 1. Undirected graph whose binary adjacency matrix is W in (16).
Figure 1. Undirected graph whose binary adjacency matrix is W in (16).
Mathematics 11 04228 g001
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yamada, H. Geary’s c and Spectral Graph Theory: A Complement. Mathematics 2023, 11, 4228. https://doi.org/10.3390/math11204228

AMA Style

Yamada H. Geary’s c and Spectral Graph Theory: A Complement. Mathematics. 2023; 11(20):4228. https://doi.org/10.3390/math11204228

Chicago/Turabian Style

Yamada, Hiroshi. 2023. "Geary’s c and Spectral Graph Theory: A Complement" Mathematics 11, no. 20: 4228. https://doi.org/10.3390/math11204228

APA Style

Yamada, H. (2023). Geary’s c and Spectral Graph Theory: A Complement. Mathematics, 11(20), 4228. https://doi.org/10.3390/math11204228

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop