1. Introduction
The precursor of a concept of statistical ensembles and the related ergodic hypothesis formulated by Boltzmann [
1,
2] were met with a violently negative reaction by the great majority of scientists for clumsiness, absurd, and paradoxical consequences [
3], although it allowed the theoretical calculation of the equations of state for the first time. The study of statistical ensembles related to graphs and networks suffers from a similar inhospitable reception from scientists playing cup-and-ball with a swarm of heuristic parameters and giving any importance to their connection with each other, which is often responsible for spurious conclusions on the graph’s structure and function. The thermodynamic approach to graphs was initiated in complex network theory concerned with the thermodynamic limit of infinitely large graph size
[
4], in which a graph’s structural “fluctuations” become negligible. The major result of the theory on structurally homogeneous infinite graph (random trees) is the Bose–Einstein condensation mechanism explaining the growth of complex evolving networks as a topological phase transition between a “rich-get-richer” phase and a “winner-takes-all” phase [
5,
6,
7]. In contrast to complex network theory, we consider the statistical ensembles of walks defined on a finite connected undirected graph in the thermodynamic limit of very long walks
which has previously never been addressed. Statistics of lengthy walks elucidates the graph structure, quantifies navigability of the graph, and evaluates the fugacity of graph nodes with respect to the entire system of infinite paths available in the graph—all of these characteristics are introduced and discussed in our work for the first time. The probability measuring the tendency of a graph to shrivel or expand at a node follows the Fermi–Dirac distribution function. Although we have sketched a set of “ideal gas laws” for the structure of networks and graphs (in the last section of our work), we have not formulated a comprehensive structural "equation of sate" for graphs and networks yet.
The probability we assign to an event depends on whether we count it as
one of many, considered all at once, or as a single event of its kind. In other words, an estimated likelihood of events hinges on their assumed membership in an
ensemble described by some probability distribution. The famous
Two-Child Paradox [
8] serves a good example for this point: “
Mr. Smith has two children. At least one of them is a boy. What is the probability that both children are boys?” Given that a child is either a boy (
B) or a girl (
G) with equal probability
two incompatible answers may be given to this question, depending on the assumptions taken.
On the one hand, as the probability of getting a boy equals
uniformly and unconditionally for all families, using the Bayes’ Theorem, we obtain that the probability of having at least one boy in a two-kid family will be the same as just having a boy, viz.,
On the other hand, as having a boy in an
ensemble of two-child families with at least one boy obviously comprises three possible events, i.e.,
, or
, or
, the probability of getting a boy in a family of two equals
, and therefore
The ensemble interpretation, in which each admissible event in a family of two with a boy appears
equally probable, is preferable in the context of
ergodic hypothesis blind to family history. In the context of the Two-Child Paradox, there is no way in probability theory to discern if the gender composition in such a family stays put, or children change their sex exploring possible gender identities during an infinite lifetime provided at least one of them stays a boy. The ergodic hypothesis helps to avoid this awkward question by equating the ensemble and time averages while replacing a dynamic description of identity changes by the probabilistic description within the ensemble over a very long period of time. Switching temporal and ensemble perspectives under the spell of ergodic hypothesis is assumed in thermodynamics, equilibrium statistical mechanics, and the theory of dynamical systems.
The concepts of ensembles and ergodicity have a long history [
3]. Boltzmann introduced a "
monode", a family of possible
stationary probability distributions over a single cyclic trajectory of a system of gas particles on an energy surface in the phase space as early as in 1844 [
1,
2]. According to the Boltzmann hypothesis (
1), the time spent by a system in some region of the phase space is proportional to the volume of this region, so that all accessible microstates are
equiprobable over a long period of time, viz.,
where
is the probability density of microstates on the iso-energetic surface, whose area element is
. With this hypothesis, Boltzmann [
1,
2] and later Helmholtz [
9,
10] were able to explain the classical equilibrium thermodynamics, which successfully describes the behavior of gases. The concept of
thermodynamic ensembles was further developed and coined into the English-speaking world by Gibbs [
11].
In our work, we review three classical thermodynamic ensembles defined by Gibbs [
11]—the
microcanonical (
Section 2),
canonical (
Section 4), and
grand canonical (
Section 8) ensembles of very long walks defined in finite connected undirected graphs—and demonstrate that the concept of ergodic ensembles might be applied to quite abstract objects of discrete mathematics. The
thermodynamic limit in our approach is defined as the limit of
very long walks
in a
finite graph rather than the limit for a large number of graph nodes
N. In the limit
, “fluctuations” of graph structural features are negligible, and therefore the graph can be considered as structurally homogeneous across all scales—random, in the limit
fluctuations of the growth rate of the number of distinguishable, long walks in the graph can be ignored, and then graph’s
topological entropy (the log of graph’s
spectral radius) and the corresponding Perron eigenvector of the graph adjacency matrix describe the degree of structural complexity, anisotropy, and navigability of the graph.
Each thermodynamic ensemble permits specific statistical behavior. For example, the microcanonical ensemble representing an
isolated system (with constant energy) is defined by assigning equal probability to every walk of a given length existing in the graph. All very long walks that fit some
probability distribution over graph’s nodes constitute a
macrostate in the canonical ensemble of walks defined in the graph. For example, the series of
intrinsic random walks (introduced in
Section 5) make up equal probabilities to all walks of a given length starting at a node providing an example of the canonical ensemble of walks defined on the finite graph. This canonical ensemble contains not only the very well-known isotropic nearest-neighbor random walks on finite graphs [
7,
12], but also infinitely many types of less known
anisotropic random walks on graphs—and the
Ruelle–Bowen random walk [
13,
14] making up all infinite walks starting at each node equally probably is one among them. While the ergodic theory for isotropic random walks on finite graphs is well developed [
15,
16] (We profoundly thank our referee for this remark), the ergodic properties of anisotropic random walks, including their statistical confinement in the best structurally integrated sub-graphs (see
Section 5 and
Section 7), have not been discussed in literature yet. Finally, in an
open system of long walks represented by the grand canonical ensemble,
chemical potential (free energy absorbed by a very long walk seizing graph’s edge) is kept fixed and equal the graph’s topological entropy
.
We also discuss applications of ergodic walks to the structural analysis of and navigation through finite undirected connected graphs. Graph’s structural defects and boundaries repel very long walks that can be be expressed in terms of entropic pressure and force (
Section 3). Intrinsic random walks forming the canonical ensemble in a graph can be used to measure the degree of graph’s structural anisotropy (
Section 5), to estimate the amount of predictable (
navigable) information about present navigator’s location (
Section 6) and assess the
navigability to each graph node in proportion to its relative
visiting frequency (
Section 7). Navigation focuses on locating a navigator’s position compared to known locations, paths, and structural patterns [
17]. The navigability to a node comprises two information components compatible with two major navigation strategies, known as
path integration (that allows for keeping track of the position and heading while exploring a new space) and
landmark-based piloting (re-calculating position when in a familiar environment), working in concert during navigation in humans and animals [
17]. Finally, the grand canonical ensemble describes the statistics of local fluctuations of the growth rate of the numbers of long walks around the chemical potential as
(
Section 8). The distribution of these fluctuations follows Fermi–Dirac statistics and marks graph’s defects and boundary nodes hosting dramatically less very long walks than others.
We conclude in the last section.
2. The Micro-Canonical Ensemble of Equiprobable Walks in Finite Connected Undirected Graphs
The number of walks of length
n (i.e.,
n–walks) in a lattice
in
d-dimensional space grows exponentially with
n,
. The
micro-canonical ensemble is defined by assigning
equal probability to every
n–walk, viz.,
where the (Boltzmann constant and) temperature
, the
free energy of the
n–walks is
and
is the
entropy in a micro-canonical ensemble. As the free energy
is the Legendre transformation of the
internal energy , with
as the independent variable [
18,
19], viz.,
comparing this definition with (
3), we conclude that the internal energy of all
n–walks is
in a micro-canonical ensemble.
We also readily extend the statistical description of micro-canonical ensemble of equiprobable walks to
-regular graphs, in which every vertex has the same number of neighbors,
, by using the substitution
. As the free energy value (
3) grows linearly with
n, the
intensive free energy (per absorbed edge), viz.,
plays the role of
chemical potential describing the change to free energy after absorbing a new edge to a very long walk in a
-regular graph in a micro-canonical ensemble.
Given a finite connected undirected graph
) where
V,
, is a set of vertices, and
is a set of edges, we assume that its
adjacency matrix (such that
,
, and
, otherwise) has the following spectral decomposition
with ordered eigenvalues
. The free energy in the micro-canonical ensemble equals
and, since
the intensive free energy amounts to the logarithm of the spectral radius
of the graph, viz.,
In
Section 8, the quantity (
6) plays the role of
chemical potential of an edge absorbed by a very long walk. For a
—regular graph, its spectral radius
, so that
in accordance with (
4). The log of graph spectral radius (
6) is also called the
topological entropy of the graph [
20,
21] because it is the exponential growth rate of the number of distinguishable walks, being a measure of complexity of the graph structure. According to (
4), the topological entropy of the graph
can also be interpreted as the
effective dimension of space of the graph,
in a micro-canonical ensemble of very long walks.
3. Entropic Pressure and Force in Micro-Canonical Ensemble of Walks
Missing nodes and edges might dramatically reduce the number of very long walks available in a graph, reshaping the global mobility patterns in a micro-canonical ensemble of walks. Statistical changes in mobility patterns due to graph defects that can be described in terms of entropic pressure and entropic force are as follows.
Namely, a missing node depletes the number of very long walks available in the graph, and therefore reduces the corresponding free energy,
by the following amount of
local energy,
corresponding to the number of very long walks anchoring at
i, viz.,
In the thermodynamic limit
, the resulting local increment of free energy measuring its sensitivity to the disappearance of node from the graph is as follows:
We call the resulting quantity (
9)
entropic pressure , as it accounts for the local stress characterizing the transfer of walker’s mobility from
i to the rest of the graph if
i is not available (see
Figure 1 Left).
Similarly, by eliminating an edge
from the graph, we reduce the local energy
of the node
(
7) by the following amount,
corresponding to the number of
-walks available from the node
adjacent to
i, viz.,
The direction dependent
entropic force introduced in (
10) emerges from the statistical tendency of very long walks to follow the
preferential transition to the neighboring nodes hosting many infinitely long walks, as in the Ruelle–Bowen random walk (
19) [
13,
14]. It is worth-mentioning that the expression for the entropic force (
10) has the structure of a Laplacian operator
related to random walks defined in the graph
G by the transition matrix
.
In
Figure 1, we have presented a membrane graph with a defect and highlighted its nodes according to the values of entropic pressure (
9) (left) and the elements of Perron eigenvector of the matrix
(
10) (right) in the membrane graph.
In
Figure 2, we use the graph representation of Lubbock, TX, USA acquired from the
OpenStreetMap service (The
OpenStreetMap database is publicly available at
https://dataverse.harvard.edu/dataverse/osmnx-street-networks). To construct the spatial graph of the city, we used
Python’s lxml library to parse the raw data and obtain the spatial graph adjacency matrix. The data set was cleaned further by removing disconnected neighborhoods, such as the Preston Smith International airport that is not a structural part of the city. The resulting connected city graph of Lubbock contains 10,421 nodes representing all spaces of movement, including but not limited to residential, secondary, tertiary roads, trunk links, and highways.
The value of entropic pressure in the spatial graph of Lubbock attains maximum at the contemporary structural focus of the city, far apart from the city historical downtown (
Figure 2 Left). The nodes of the city spatial graph on the right-hand side of
Figure 2 are highlighted according to elements of the Fiedler eigenvector belonging to the second largest eigenvalue of the entropic force matrix
(i.e., the second smallest eigenvalue of the associated Laplacian matrix
). The Fiedler eigenvector is used in spectral graph partition, as it bisects the graph into only two connected communities based on the sign of the second vector entry. The Fiedler eigenvector indicates the direction of the fastest decrease of the entropic force over the city spatial graph of Lubbock (
Figure 2 Right). The entries of the Fielder eigenvector are zero everywhere, except for a narrow band extended from the historical city center (where the magnitudeof entropic force is positive) toward the contemporary structural focus of the city (where the magnitude of entropic force is negative). The structural focus of the city absorbs very long walks while the historical center anchored at the abolished city railway station expels long walks. Although railway construction enhanced the city status of Lubbock in early days, its maintenance has a continuing negative impact on the urban development, since railways barricade streets, dramatically cutting down the number of possible paths people can drive or walk and create isolated neighborhoods [
22].
4. The Canonical Ensemble of Walks in Finite Connected Undirected Graphs
The
canonical ensemble represents the possible states of a system in equilibrium that does not evolve over time, even though the underlying system might be in constant motion [
11]. The canonical ensemble is a collection of very long walks (
microstates) of length
, where
counts the number of visits paid by a walker to the
r-th vertex of a connected undirected graph
compatible with a
-
macrostate, a discrete probability density vector
,
, taken over the set of graph vertices, viz.,
.
The total number of microstates (i.e., long walks) lumped into a single
-macrostate is then given by the following multinomial coefficient:
Using Stirling’s approximation,
we readily obtain that
and therefore, as
in which
is the
Boltzmann–Gibbs–Shannon entropy [
23,
24] in the canonical ensemble. If every very long walk lumped to the
-macrostate is chosen with equal probability among the other walks suited for the same macrostate,
, then the most probable walks would be those compatible with the uniform density
, maximizing the value of entropy (
14),
The free energy over the canonical ensemble of very long
-walks (
) is given by
and, therefore, the intensive free energy (chemical potential) equals
where
is the amount of
information (in bits) revealed at every step of the
-walk.
5. The Canonical Ensemble of Intrinsic Random Walks in Finite Connected Undirected Graphs
Discrete time random walks defined in a finite connected undirected graph by an irreducible row -stochastic transition probability matrix , are the natural candidates for the -macrostates in the canonical ensemble of walks. Indeed, as the row-stochastic transition matrix does not evolve over time, the unique stationary distribution of the random walk is the major left eigenvector of the transition matrix, such that .
Given the graph adjacency matrix
and
otherwise, we define the
-
order degree of the vertex
as the number of
n-walks available at
viz.,
Taking further into account that
, we derive an infinite sequence of transition probability matrices [
25], viz.,
defining a countable set of
intrinsic random walks in the graph
G.
The first order intrinsic random walk defined by the transition matrix
has been discussed in literature for more than a century [
12,
26]. The walk
is locally
isotropic, as the random walker chooses the next node to visit among all nearest neighbors of the current node with equal probability. In
Figure 3, we presented densities of nodes in the membrane graph with respect to the different types of intrinsic random walks. Density of nodes with respect to
is proportionate to their
degree centrality, i.e., the numbers of links incident upon the nodes (
Figure 3, left). Other intrinsic random walks following the transition probabilities,
,
make all
n-walks starting at the node
i to occur with equal probability. These random walks are locally biased (
anisotropic), as transitions to the nearest neighbors providing more lengthy walks are more preferable under (
18) for
[
25]. In the limit
, the series of transition matrices
converges [
25] to the
Ruelle–Bowen random walk [
21] (also known as the
maximal entropy random walk [
27]), viz.,
The anisotropic random walk
is confined in the central nodes of the membrane graph (
Figure 3, right). The stationary distribution for the intrinsic random walks (
18) reads as follows [
25]:
For the isotropic random walks
, the stationary distribution
, where
E is the total number of edges in the graph [
12], and
, for the Ruelle–Bowen random walks [
27]. The stationary distribution
reports on the
degree centrality of the graph nodes (i.e., the number of links incident upon a node), and
is naturally related to the
eigenvector centrality of the node
i in the graph
G [
28].
The time until a random walk approaches the stationary distribution (
Figure 3) (i.e., the mixing time) is determined by the spectral gap, the difference between the two largest eigenvalues of the transition matrix. Spectral gaps is maximum (mixing time is minimum) over the canonical ensemble of intrinsic random walks for the anisotropic random walk
(
Figure 4).
The
relative entropy rate [
29] between two Markov chains defined by their transition matrices,
can be used for measuring information divergence over the canonical ensemble of intrinsic random walks in connected undirected graphs and the degree of
graph directional anisotropy [
25].
In (
21), we have introduced
a local counterpart of the space dimension parameter (
4), and its generalization to
n-walks, the directional
graph space dimension tensor
measuring the degree of
directional anisotropy in transitions of the intrinsic random walks making up all
n-walks available from the node
i with equal probability. For
, the graph space dimension tensor (
22) reduces to the space dimension,
, as
for all nodes. In the thermodynamic limit
the graph space dimension tensor reduces to a
direction dependent counterpart of the effective space dimension of the graph
(
6) (or the graph topological entropy), viz.,
6. Navigation through Graphs over Canonical Ensembles of Walks
The problem of effective navigation in graphs and networks can be considered in the framework of canonical ensemble of walks, since the navigator location prediction requires a density of locations that is known. Frequently visited sites are predicted more efficiently than little frequented, especially in the long-run [
30].
Given a
-walk
defined in a connected undirected graph
, Bayes’ theorem [
29,
31] describes the probability of navigator’s present location
X based on prior knowledge of her previous location
t steps before,
Namely,
may be a
t-
step precursor of
X with the following probability:
where
is the probability of walking from
to
X precisely in
t steps;
and
are the densities of locations
and
X with respect to the
-walk, respectively.
is a
density of the t-
step precursors for the location
X induced by the density of walks
. If
it follows from (
24) that the location
X is
unpredictable (as any other location
is a precursor for
X). The available information about visiting the location
X at present is therefore scattered over the entire graph in the past and can be assessed by observing all possible
t-step precursors
, viz.,
The information divergence [
29] (
25) vanishes if and only if the density of
t-step precursors
for the location
X over the graph
G is identical to
, so that visiting the location
in the past is statistically independent of visiting the present location
X t steps later, and therefore
is not a
t-step precursor of
X [
30]. The amount of information (
25) attains its
maximum value, viz.,
whenever the marginal probability
is the major left eigenvector of the
t-step transition matrix
, so that
for all
t and
, i.e., visiting any location in the graph
G by
-walk with probability 1 is a predictor for visiting any other location
X t steps later.
According to the Boltzmann equation (
1), for ergodic observables, the time average of the maximal information (
26) over the entire history of
-walks equals the entropy of the
-walk (
16), viz.,
However, the actual amount of predictable (
navigable) information about present navigator’s location may be quite modest, much less than the amount information revealed at every step of the
-walk ((
16) and (
27)): different graphs have different degrees of
navigability.
7. Navigability of Graphs and Graph Nodes over Canonical Ensembles of Walks
The information function (
27) can be represented as a sum of the
predictable and
unpredictable information components [
32], viz.,
The predictable information component measures the amount of apparent uncertainty about the navigator’s location that can be resolved with some navigation strategy compatible with the -walks, and gauges the amount of true uncertainty about the navigator’s location that cannot be inferred anyway. In the following, we attribute the predictable information component to the navigability of the graph G by the -walk.
Assuming that both information components in (
28) have the same form as the information function (
27), viz.,
with some partition functions
and
, such that
, we obtain
We call the partition function the navigability to the node in the graph G by the -walk. Obviously, the navigability to the node is proportional to its relative visiting frequency —as the more frequent the location, the higher its forecast accuracy—and inverse proportional to the partition function assessing uncertainty of visiting the node r by the -walk.
There are two major navigation strategies—
landmark-based piloting and
walk integration—working in concert during wayfinding in humans and animals [
17]. First, the next visit location
can be guessed from the present navigator’s position
in the graph, and the degree of accuracy of such a guess can be assessed by the mutual information between the present and future navigator’s location conditioned on the walk history,
. This strategy can be naturally associated with landmark-based piloting.
If the
-walk is a random walk defined by a transition matrix
, the
conditional mutual information for such a Markov chain depends only upon the immediate past navigator’s location
, but not on the entire historical sequence of locations visited by the navigator in the more distant past [
32], so that
Second, some degree of uncertainty about the navigator’s future location
might be resolved after all revisiting, and a possible correlation between walks are taken into account in the course of walk integration over the presumably infinite motion history of
-walk. The latter quantity is given by the
excess entropy [
33,
34,
35],
where the
entropy rate [
29],
quantifies the mean amount of uncertainty consisting in the whole (infinite) path history of the
-walks. However, it is intuitive that the values of conditional entropies
in the r.h.s. of (
33) do not increase with the length of walks and, therefore,
, so that
.
For a random walk defined by a transition matrix
, the Markov property simplifies the expression for the entropy rate (
33), viz.,
so that the excess entropy (
32) reads as follows:
By summing (
31) and (
35), we obtain the total amount of predictable information
revealed by the
-walk in the graph
G, viz.,
so that
the navigability to the node r in the graph
G by the random walks defined by the transition matrix
is
Navigability to a node evaluated by the partition function (
37) depends on the strategy of walkers. In
Figure 5, we illustrate the difference by highlighting the nodes of the membrane graph according to the degrees of navigability by the isotropic random walks
(left) and anisotropic random walks
(right). For the isotropic random walks, the movement of walkers along the low-dimensional boundaries and at the corners of the graph are more predictable than their movements in the bulk, as all bulky locations of the same connectivity are visited with equal probability by the random walk
. In contrast with the isotropic random walks, a navigator following the anisotropic strategy
is statistically confined within the region hosting the most of infinitely long paths available in the graph, where the navigator’s position is very likely.
As demonstrated in [
32], the entropy function
allows for the following decomposition involving the conditional entropies:
Therefore, the remaining part of the information function (
28), the last part in the decomposition (
38), is the conditional entropy of the present navigator’s location conditioned on her past and future locations, viz.,
assesses the amount of
true uncertainty about a navigator’s location that can neither be inferred from integrating over the past history of the
-walk nor have any repercussion for the navigator walk in the future [
34]. For a random walk defined by the transition matrix
, we readily obtain that
where the partition function
assesses the amount of uncertainty about navigator’s visiting the node
.
8. A Grand-Canonical Ensemble of Ergodic Walks in Finite Connected Undirected Graphs
The grand canonical ensemble represents the possible states of a system exchanging energy and particles with a heat bath in thermodynamic equilibrium [
11]. The growth rate of the number of distinguishable paths in a graph tends to its topological entropy
in the thermodynamic limit
. However, the local growth rate of the number of distinguishable paths available from a node,
might differ from the graph topological entropy. In the grand canonical ensemble, the probability to observe such a "fluctuation" of the long paths growth rate inferior to the topological entropy at the node
i is taken to be
where
is a fluctuation of free energy (
8) associated with heterogeneity of growth rate of the number of very long walks in the graph. The
grand partition function amasses the
fugacity of all nodes in the graph, playing the role of a normalization factor in (
41). In the thermodynamic limit
,
=
, and
, so that the expression for the
node’s fugacity takes the following form:
the grand partition function reads as follows:
and, finally, the grand canonical probability (
41) takes the form of a Fermi–Dirac distribution in the thermodynamic limit
, viz.,
The
grand potential playing the role of free energy with respect to the grand partition function
in grand-canonical ensemble equals:
Having a form of the
relative fugacity of a node, the grand canonical probability (
44) can be regarded as measuring the
ease of separation of the vertex from the rest of graph with respect to the entire system of infinite paths. The nodes with the long paths growth rate inferior to the topological entropy are insufficiently integrated into the graph structure and might be lost or acquire new connections in the course of prospective graph structural modifications.
In
Figure 6, we have presented the membrane graph (left) and the spatial graph of the city of Lubbock, Texas (right), with their nodes colored according to values of grand canonical probabilities (
44). The nodes located on the low-dimensional graph boundaries, at the corners of membrane graph and in the loosely connected south suburbs of the city of Lubbock have distinctly higher relative fugacity than others. These nodes can also be regarded as the
points of prospective network growth in where the graph as a system of infinite paths remains open. Interestingly, the highlighted nodes in the spatial graph of Lubbock (
Figure 6 right) mark the city neighborhoods currently under construction.
9. Discussion and Conclusions
We have defined three major thermodynamic ensembles of ergodic walks in connected undirected graphs, in the thermodynamic limit of infinitely long walks and showed that the ergodic mindset might be applied not only to particles of ideal gases, but also to quite abstract objects of discrete mathematics, such as graphs.
We have demonstrated that graph structural defects and irregularities, such as missing nodes and edges, might dramatically reduce the number of available very long paths, globally reshaping the mobility patterns in the entire graph. In the framework of micro-canonical ensembles, we may consider their effect as resulting from actions of the entropic pressure and force repelling walkers from structural irregularities and boundaries toward the best integrated region of the network: the laxer the connection, the stronger the repelling. Perhaps, the cumulative effect of entropic forces generated by railways and other structural obstacles along with the unbalanced growth of urban neighborhoods might be responsible for the urban decay process in the historical districts of some cities.
We have also shown that the problem of effective navigation [
36] in graphs and networks can be considered with respect to a canonical ensemble of walks, as an effective location prediction of a navigator’s position, and requires a density of locations in the walk be known. According to the probabilistic setting, frequently visited sites are predicted more efficiently than little frequented, especially in the long run:
the more frequent a node, the more predictable the navigator’s position visiting it. Regular lattices and homogeneous graphs lacking structural salience and landmarks might also be confusing environments dramatically, reducing predictability of navigator’s position.
Finally, we have studied the grand canonical ensemble of very long paths describing the statistics of fluctuations of the local path growth rate with respect to the graph topological entropy. In the thermodynamic limit of infinite paths, the distribution of the relative fugacity over the graph nodes takes the form of Fermi–Dirac distribution function. The high relative fugacity value of a node assumes that the degree of its integration into the system of infinite paths is insufficient, indicating that the graph is open for the prospective structural modifications associated with the node. In the urban spatial graphs, the nodes of high fugacity might be concentrated in the neighborhood under construction, marking the points of city network growth.
Future research should consider a comprehensive structural "equation of sate" for networks and graphs.