1. Introduction
It is often convenient to present big volumes of data as a graph, i.e., as a set of objects and binary relations (bonds) between them. This approach naturally arises in numerous contexts ranging from physics of disordered systems [
1] and biology [
2] to sociology [
3] and linguistics (see, e.g., [
4,
5,
6]). The rapid growth in information technology ensures that larger and larger datasets of this type are becoming available. This naturally stimulates interest in the tools to analyze these datasets and simple (or not so simple) reference mathematical models, which can be used to probe their properties. Thus, a rapid development in the last 20 years of a new interdisciplinary field on the boundary of the random graph theory, the data analysis and the statistical physics, known as complex network theory [
7,
8,
9], has occurred.
Among the structural characteristics typical for many experimentally observed networks, there are three especially common and striking (see, e.g., [
7]): (i) the small-world property (a very small average node-to-node distance measured along the network), (ii) extremely wide, approximately a power-law distribution of the node degrees (the networks with this property are often called ‘scale-free’), and (iii) large, as compared to referent randomized networks, the concentration of the short circles (e.g., triangles). It is reasonably easy to construct a model network that has one or two of these characteristics, e.g., Erdos–Rényi graphs [
10,
11] are small-world, random geometrical graphs [
12] that have a large clustering coefficient. The Barabási–Albert model [
13] generates small-world scale-free networks. The Watts–Strogatts model [
14] generates small-world graphs with a large clustering coefficient, etc. Generating all three properties simultaneously is much harder. Random geometric networks in a hyperbolic space [
15,
16,
17] constitute one example of networks with these properties. Another one is the Apollonian network.
The Apollonian network [
18,
19] is a planar graph that arises naturally as a network representation of the Apollonian gasket, a remarkable object, which is, apparently, the first known fractal (interestingly, its exact fractal dimension is still unknown) [
20,
21]. The construction of this network can be explained recursively as follows; see
Figure 1. Take a triangle, pick a point inside it and connect it to the three corners of the triangle. As a result, one obtains a set of 3 adjacent triangles that form the first-generation Apollonian network. Now, pick a point inside each of the three triangles, and connect it to its corners, this gives a second-generation Apollonian network, then repeat ad infinitum. The resulting network has been studied extensively in recent years, and it has been shown to have many beautiful properties. For example, the degree distribution and the clustering coefficient have been calculated [
18], as well as the average path length [
22]. Notably, there is an interesting non-planar interpretation of the Apollonian network. Namely, it can be thought of as a simplicial complex in the following way [
23]. A first-generation Apollonian network is a tetrahedron (3-simplex). A second-generation Apollonian network consists of four tetrahedra: the original one and another three, each having a common two-face with the original one. A third-generation Apollonian network consists of 13 tetrahedra: one produced in the first generation, three produced in the second generation and nine new ones attached to each free face of the three second-generation tetrahedra, etc. Thus, one can think of an Apollonian network as a regular rooted tree of tetrahedra touching each other by common faces. This construction is easy to visualize in a 3-dimensional (3D) space (see
Figure 1b,c), and it makes the Apollonian network a natural discretization of the 3D hyperbolic space in the same way as a regular tree is a natural discretization of the hyperbolic plane. Many properties of the Apollonian network can be calculated exactly, which makes it a nice toy model for the study of various properties of real scale-free networks. As a result, there have been a significant number of papers in recent years studying percolation [
23], spin models [
24], signal spreading [
25], synchronization [
26], traffic [
27], random walks [
28,
29], etc., on the Apollonian network.
Despite being such a beautiful and well-studied object, the Apollonian network has certain drawbacks as a model of real networks. Most importantly, it is a single deterministic object with certain fixed properties, e.g., a fixed degree distribution with a fixed power law exponent
. Importantly, that degree distribution is not a true power law but rather a log-periodic distribution consisting of a sequence of atoms at points
and a power-law envelope. This means that the network is scale-invariant only with respect to certain discrete renormalizations and thus do not have the full set of properties of a true power law distribution; see [
30] for a recent discussion. One natural generalization is a random Apollonian network [
31,
32,
33], which is constructed, instead of a regular generation-by-generation process, by sequential partitioning of arbitrarily chosen triangles. The average degree distribution in such network is a true power law with exponent
[
32]. Notably, to the best of our knowledge, random Apollonian graphs remain the only scale-free planar graph model with a continuously growing size for which the exact degree distribution exponent is known. Another way of generalizing the network is to consider the
k-simpliceswith
as building blocks of the network construction procedure. This gives rise to multidimensional Apollonian networks [
34,
35].
In this paper, the authors suggest another way of generalizing the Apollonian network construction. As a result, a novel famility of small-world, scale-free planar networks is obtained. The main idea is to construct an Apollonian-style iteration procedure based on polygons with different numbers of edges. The paper is organized as follows. In
Section 2, a tetragon-based Apollonian-style network is constructed and the corresponding degree distribution is explicitely calculated. Then, the suggested procedure is generalized to polygons with an arbitrary even number of edges. In
Section 4, it is shown that it is possible to construct a continuous one-parametric family of models interpolating between the tetragon- and hexagon-based models and demonstrate that the models in this family have a power low degree distribution with an exponent depending on the parameter, so it is possible to adjust it to fit the desired degree distribution (note that the adjustable exponent of the degree distribution can be obtained by different means in the so-called Evolving Apollonian networks [
26,
33]). Last section, summarizes the results of the paper and discusses further open questions and possible generalizations.
3. Polygon-Based Networks for Polygons with Any Even Number of Edges
The procedure suggested in
Section 2 can be easily generalized for any even number of edges
(
) in the generating polygon; see generalization for
in
Figure 4. This procedure results in a sequence of planar scale-free network models with degree distributions converging to
with
m-dependent exponents
. At each generation, each polygon is split by a path connecting directly opposite nodes. There are
m different ways of such a splitting, so each node of a polygon participates in the splitting with probability
. This allows generalizing the recurrence relations (
3) and (
7) for the degree distribution of a node
n generations after its creation in the following way:
for the original
nodes, and
for all the rest. The number of nodes created at
n-th generation (
) is
. Proceeding in exactly the same way as before, gives
and
for the generating functions of the degree distributions of the original, newly created and all nodes of the network,
,
and
, respectively. The limiting function,
satisfies
and its behavior is easy to analyze both in the vicinity of
and
. Expanding Equation (
52) for small
, one obtains:
where a short-hand notation,
, is introduced.
In turn, in the vicinity of
, it takes the form of Equation (
18). Substituting the ansatz (
18) into Equation (
52), one obtains:
Thus, for any
, the second moment of
converges. The moments are controlled by the coefficients
:
Since the second moment of
is now controlled by the values of distribution at small
k, the initial
nodes do not contribute to the second moment.
Once again, in order to check the predictions
realizations of the networks of up to the 12th generation were generated. The results are shown in
Figure 5a, and in
Figure 5b the plot in the renormalized coordinates is shown. The collapse of the data onto a single master curve is apparent, although it is somewhat worse than in
Figure 3b. Presumably, this happens because the typical degrees in the hexagon-based network are much smaller than in the tetragon-network of the same size, and finite size effects are therefore more important.
4. Polygon-Based Networks with Smoothly Changing Exponent of the Degree Distribution
As a result of
Section 3, one now has a sequence of Apollonian-like models that generate planar scale-free networks with a discrete sequence of degree-distribution exponents
Is it possible to further generalize the model to make
change continuously and take any intermediate values, including, for example,
, corresponding to the point where the second moment of the degree distributions diverges for the first time?
It turns out that this is indeed possible. One way to do that is as follows. Assume that when introducing a new shortcut dividing a polygon in two, one makes the resulting partition to be a pair of tetragons with probability p and a pair of hexagons with probability . That is to say, if the original polygon is a tetragon, then with probability p introduce a 2-step path connecting opposite vertices, and with probability , a 4-step path; if the original polygon is a hexagon, the new path connecting two opposite vertices is a 1-step path with probability p and a 3-step path with probability q.
Here, we restrict ourselves to this simplest construction, although it is possible to create more complicated rules. For example, one can introduce correlations between generations in a Markovian way so that there is a matrix of probabilities for a tetragon to give birth to a couple of tetragons, a hexagon to give birth to a couple of tetragons, etc. As a result, it might be possible to construct a network that is, for example, tetragon-dominated at large scales (early generations) and hexagon-dominated at small scales (later generations).
Once again, consider a node, which is created at generation , and let us study its degree distribution at generation . This degree distribution depends only on n and on the number of edges of the initial two faces adjacent to the node where tetragons or hexagons.
Let the average fraction of tetragons at a given generation be
p and the fraction of hexagons be
. Then, for each face adjacent to a given node, the probability that this face is a tetragon is
Assume now that different faces adjacent to a node are tetragons (hexagons) independently from each other. Generally speaking, that is not true: when a new edge is created, the two faces on the sides of it have a similar number of edges. However, one might expect that as the degree of the node grows the correlations become less and less relevant. In this approximation, the probability for a node of degree
k to have exactly
l adjacent tetragons is
when the next generation is created, a new edge adjacent to the node under consideration is created with probability
for each tetragon face and with probability
for each hexagon face. Therefore, one can write down the following approximate equation for the probability
of the node that has degree
at the next generation given that it had degree
k in the previous one:
where the binomial coefficients
are assumed to be zeros if
or
. Now, introduce the probability
for a node to have degree
k n generations after its creation, and the corresponding generation function,
Then,
,
and
Using Equation (
58), it is easy to calculate the second sum on the right-hand side:
which leads to the following equation for the generating function:
where Equation (
56) is taken into account to obtain to the last expression. Proceeding as before, one gets the equation for the generating function of the full limiting degree distribution
(except for the initial set of nodes):
Expanding
for
in the form (compare Equation (
18)):
and equating the coefficients term by term exactly in the same way as in
Section 2, one gets the following equation for the degree distribution exponent
:
Thus, for example, the interesting case
when the second moment of the degree distribution diverges for the first time corresponds to
In
Figure 6, the numerical data for the degree distribution of the mixed networks are shown. One can clearly see that the slope of the distribution gradually changes with changing
p. Moreover, after rescaling the degree distribution with the power law prescribed by Equation (
66), all curves are approximately flat for small
k, validating the approximation of independent phases.
5. Concluding Remarks and Open Questions
This paper presents one possible class of planar random graphs constructed from polygons with an even number of edges using a procedure similar to the construction of Apollonian graphs [
18]. The
-polygon-based graphs have a limiting power law degree distribution with the exponent,
and the moments of the degree distribution are given by Equations (
21) and (55). The second moment of the degree distribution diverges as
with the number of generations
n in the case of tetragon-based graphs (see Equation (
41)) and converges to a finite value in Equation (55) for the polygons with a larger number of edges. Moreover, as described in
Section 4, it is possible to construct a mixed model based on two different polygons (tetragons and hexagons in our example) so that on all stages of construction, tetragons are formed with probability
p and hexagons with probability
. By varying
p, one can adjust the slope of the degree distribution in order to achieve a desired value in a way reminiscent of evolving Apollonian networks [
26].
Clearly, all graph classes presented here are small world. Indeed, the diameter of the graphs grows at most linearly with the number of generations:
where
is the diameter of the
n-generation graph. In turn, the total number of nodes grows exponentially with the number of generations; thus, the diameter is, at most, proportional to the logarithm of the number of nodes.
The shortest cycles in the graphs presented here are , and, in particular, there are no triangles in them, so, generally speaking, the clustering coefficient is zero. However, this should not obscure the fact that there is actually a huge number of short cycles in these graphs. Indeed, consider the following auxiliary construction: let the polygon-based construction be exactly as presented above up to n-th generation, but then connect all the nodes belonging to the same face on the last generation of the procedure, so the smallest faces (i.e., faces constructed on the last step) are considered to be complete graphs (-simplices). The large-scale structure of the resulting graph (including, e.g., the slope of the degree distribution) will be the same as in the original polygon-based procedure, but a finite fraction of nodes (those created in the n-th generation of the construction) will have clustering coefficient 1, guaranteeing that the average clustering coefficient of the whole graph remains finite as . In order to use polygon-based graphs as a toy model for experimental systems, it might be reasonable to add a random fraction of them in order to fit the observed clustering coefficient instead of adding all possible links connecting the vertices in the smallest faces.
Interestingly, polygon-based graphs with even
m are bipartite, see
Figure 7a. In that sense, the tetragon-based graph seems to be a natural generalization of the Apollonian construction for the case of bipartite graphs. We expect that there might be a connection between bipartite polygon-based graphs and space-filling bearings, which allow only cycles of even lengths [
37] in a way similar to the connection between original Apollonian networks and space-filling systems of embedded disks. Exploring this question goes, however, beyond the scope of this paper.
This paper restricts itself to just one class of possible generalizations of the Apollonian construction based on polygons of arbitrary sizes. It is quite easy to suggest various other generalizations. The most obvious example is, probably, the random polygon construction where new graphs are constructed not generation-by-generation by splitting all the polygons of the previous generation at once, but rather by randomly choosing on each step a face to split.
Figure 7b presents an example realization of such a tetragon-based random graph. In the standard Apollonian case, it is known that the exponent of the degree distribution is different for the regular and random constructions. Calculating the degree distribution of random Apollonian-like polygon-based graphs remains an interesting open question.
Another, this time a completely deterministic generalization, is as follows. Consider a polygon with an odd number of edges
. Put a point inside the polygon and connect it with all vertices of the polygon by chains of
m edges and
nodes. This splits a polygon into
faces, each having exactly
edges. On the next step, repeat this procedure for each of the faces and proceed ad infinitum.
Figure 7c shows the second-generation pentagon-based graph obtained via such procedure. Clearly, this construction is an even more direct generalization of the Apollonian graph construction (indeed,
case is just the Apollonian graph itself). However, it means that it has standard drawbacks of the Apollonian graph in a sense that it is a single deterministic object rather than a stochastic ensemble of graphs and that its limiting degree distribution is not a power law but rather a log-periodic function with a power law envelope.
We think that the classes of graphs presented here are a useful addition to the toolkit of toy models to model scale-free graphs. Indeed, while having the main advantages of the Apollonian networks, they have additional flexibility in a sense that one might regulate the slope of the degree distribution and the average clustering coefficient in the way described above. In particular, such graphs might be, in our opinion, useful in the applications where graph planarity is essential [
38], for example, in quantitative geography, such as the study of the formation of the systems of interconnected cities. On the other side, studying percolation, spectral properties, diffusion, synchronization, epidemic spreading, etc., on these generalized graphs might allow to systematically study the influence of varying degree exponents on these phenomena, which, to the best of our knowledge, have not yet been done for the scale-free planar networks.