1. Introduction
The performance of modern supercomputers is summarised by the TOP500 world rankings, which are published twice a year. By going through this list of high performance machines, which also gives computer architecture information, one can see that the torus topology is a popular interconnection network structure. This is the case, for instance, for the Fujitsu Fugaku supercomputer, which was ranked world no. 1 until May 2022 and is the world no. 2 at the time of the writing of this article [
1]. Other highly ranked supercomputers also rely on this topology, for example, the IBM BlueGene/Q, rated in 2010s the “Greenest Supercomputer in the World” [
2,
3], and the Fujitsu K, a former world number one supercomputer, which are both based on the torus network topology [
4,
5,
6]. An overview of the interconnection networks of the fastest supercomputers is given in
Table 1 (data taken from the TOP500 list of June 2023).
Together with this adoption of the torus topology for interconnects, another trend is the popularity of hierarchical interconnection networks (HINs) [
8,
9,
10,
11,
12,
13,
14]. Effectively, as HINs enable one to lower the diameter of the compute nodes network, they are key to reducing the network cost, which is defined as the product network diameter × network degree [
15,
16]. The network cost is a major measure in establishing how efficient an interconnect topology is. The interconnection network of the Fujitsu supercomputers K and Fugaku, called Tofu, is an HIN that combines two layers: the higher layer consists of a 3-dimensional torus and the lower layer is a 3-dimensional
-ary torus [
4]. Nodes of the higher layer are said to be “meta-nodes”, consisting of several compute nodes. As mentioned previously, the network cost, induced by the network degree and diameter, is now critical given the large amount of compute nodes that equip the TOP500 machines, in the order of one million; a low diameter and a small degree are two very desirable topological properties of a modern interconnect. Approaches similar to that of Tofu are also mentioned in [
17,
18].
In this paper, acknowledging on the one hand the torus topology and HIN trends of the high-performance computing industry, and aiming on the other hand at further lowering the diameter and degree of the network, for instance, compared to the Tofu interconnect, we propose the torus-connected toroids (TCT) interconnection network topology for supercomputers.
In order to have simple routing methodologies, proposing an HIN that links meta-nodes according to the
n-dimensional torus topology demands that one meta-node be made of
compute nodes. Based on older topologies such as the seminal cube-connected cycles (CCC) [
19], the torus-connected cycles (TCC) network topology was introduced as a first attempt to this end [
20]. However, the fact that a meta-node in a TCC is isomorphic to a cycle (ring) induces that the network diameter grows at a fast pace as the dimension of the network increases; thus, the network cost rapidly rises. Thanks to the design of the meta-nodes of the TCT topology proposed in this paper, meta-nodes that hold a greater number of compute nodes than their counterparts in the case of the Tofu topology, the arity of the higher layer torus is reduced in comparison to the same higher layer torus of Tofu when numerous processing nodes are to be connected as it is the case with modern supercomputers. Precisely, and as detailed hereinafter, a meta-node (i.e., toroid) of an
n-dimensional
k-ary TCT consists of
nodes, which is to be compared with the constant order twelve of a meta-node of Tofu.
The rest of this paper is organised as follows. After recalling notations and definitions, toroids are formally defined and evaluated in
Section 2. Then, the TCT topology is described in
Section 3. It is evaluated theoretically in
Section 4 and empirically in
Section 5. Finally,
Section 6 concludes this paper.
2. Toroids
The lower layer of a torus-connected toroids network consists of toroids. So, as explained, we shall consider in this paper one toroid in a TCT network as one meta-node (a.k.a. one cluster).
2.1. Preliminaries
Several notations and definitions used hereinafter are recalled in this section.
The two symbols ∧ and ∨ conventionally designate the logical conjunction and the logical disjunction, respectively. A path is an alternate sequence of distinct nodes and edges, for instance . The same path p can also be conveniently denoted with right-pointing arrows, in which case such arrows denote path concatenation between two paths, between one path and one node, or between two nodes, for instance, . The length of a path is its number of edges: the length of p is thus equal to . A path between two nodes u and v can also be abbreviated to when it is not ambiguous.
It is recalled that the graph order is its number of nodes, that the degree of a node is its number of adjacent nodes (a.k.a. neighbours), and that the diameter of a graph is the maximum length of a shortest path between any two nodes. In addition, the network cost is defined as the product of the average degree (i.e., network degree) by the diameter. For other graph theory notations and definitions, we generally abide by [
21].
Next, we recall the definition of the torus network topology, which is essential to our proposal.
Definition 1. An n-dimensional -ary torus is an undirected graph that consists of nodes, which are of the form with and . Two nodes and are adjacent if and only if holds, where .
In a torus, we call wrap-around edges the ones that do not exist in the corresponding mesh [
22].
2.2. Definition
A toroid is formally defined as follows.
Definition 2. An n-toroid consists of nodes, where . Let V be the set of the nodes of an n-toroid and define P the Cartesian product . Define when n is even and when n is odd. Two nodes and of V are adjacent if and only if one of the following conditions holds:
The
n-toroids where
are illustrated in
Figure 1.
Next, we make the following two remarks. First, when
n is even, an
n-toroid is isomorphic to a
-torus. Second, when
n is odd, an
n-toroid is isomorphic to a
-torus but with its two edges
and
being contracted [
21].
Regarding notations, since a toroid is structured according to three dimensions, they are naturally named X, Y, and Z, and they correspond to the coordinates , , and , respectively, of node . In addition, in an n-toroid with , the edges that link a node of Z coordinate 0 and one of Z coordinate are said to be wrap-around edges.
Finally, in an n-toroid, the address of the adjacent nodes of node u can be expressed function of n and u. We give the detailed expressions of the adjacent nodes of node u in an n-toroid.
The neighbour of u on the Y dimension is as follows:
Next, we distinguish between the two cases n even and n odd. First, we consider the case when n is even. The neighbours of u on the Z dimension are as follows:
Case .
Then, the neighbour of u on the X dimension is as follows:
Next, we consider the case when n is odd. The neighbours of u on the Z dimension are conditionally defined as follows:
Case or .
Case and .
Case and .
Then, the neighbours of u on the X dimension are as follows:
Finally, it should be noted that overlapping can occur: for instance, when n is even and , we have . This is why the above () expressions induce a set of adjacent nodes for node u, set whose cardinality is at most five.
2.3. Topological Properties
First, we consider the degree of a toroid. The following property can be directly derived from Definition 2.
Property 1. The degree of node in an n-toroid () is as follows: Second, we introduce the following theorem for the diameter of a toroid.
Theorem 1. The diameter of an n-toroid () is as follows: Proof. It is first recalled that the diameter is the maximal length of a shortest path between any two nodes.
Assume that n is even. Ignoring the wrap-around edges, the maximal distance on the Z dimension is equal to . Then, considering this time the wrap-around edges, the maximal distance on the Z dimension becomes . Moreover, the maximal distance on the X dimension is equal to one, and it is also equal to one on the Y dimension. Hence, in an n-toroid with n even, the maximal distance is equal to .
Assume that n is odd. The case is trivial since the graph consists of one single edge. So, we can assume that . Ignoring the wrap-around edges, the maximal distance on the Z dimension is equal to . Then, considering this time the wrap-around edges but ignoring the two nodes with and , that is, considering a -torus, the maximal distance on the Z dimension is equal to . Lastly, a maximum of two edges are needed to either move on both the X and Y dimensions, or to both reach a node of Z coordinate and move on the Y dimension (no move is possible on the X dimension from, or to, a node of Z coordinate ). Hence, in an n-toroid with n odd, the maximal distance is equal to . □
2.4. Shortest-Path Routing in an n-Toroid
Below, we first introduce a shortest-path routing algorithm in a toroid and next quantitatively evaluate it. This algorithm is then used for routing in a TCT as described in
Section 3.
2.4.1. Algorithm Description
The main idea of this routing method is to first distinguish the two cases
n even and
n odd. Then, dimension-order routing with support for wrap-around edges is applied, although it is adjusted for the Z dimension in the case
n odd. Concretely, this adjustment is about further distinguishing cases depending on the value of the X coordinate of the source node. The details of the algorithms for an
n-toroid are provided with pseudo-code in the following: Algorithm 1 is used for routing on either the X or Y dimension in an
n-toroid, while Algorithm 2 is used for routing on the Z dimension. Algorithm 3 is the main function that returns a shortest path between any two nodes.
Algorithm 1: route-xy(u, v) |
Input: two coordinates u, v of either the X or Y dimension, say dimension Output: a sequence of coordinates on the dimension if then return u; else return ; |
Algorithm 2: route-z(n, u, v) |
|
Algorithm 3: toroid-spr(n, u, v) |
|
2.4.2. Empirical Evaluation
In order to confirm the maximum path length induced by the described shortest-path routing algorithm in an n-toroid and to inspect its practical behaviour, we implemented it and ran a computer experiment. The experimental conditions were as follows. First, the toroid parameter n ranged from 1 to 8. For each value of n, 1000 random problem instances were automatically generated, that is, a source node and a destination node were randomly selected from the set of nodes of the corresponding toroid. Then, the routing algorithm was applied to select a path between the source node and the destination node.
The results of this simulation experiment are illustrated in
Figure 2: for each value of
n, the maximum length and the average length of the selected 1000 paths are calculated and plotted. The standard deviation with respect to the average path length is shown with vertical bars below and above the corresponding length. In addition, the theoretically established diameter in the corresponding toroid is also plotted for reference.
From these experimental results, it can be observed that the empirically measured maximum path length confirms the theoretically established toroid diameter. Precisely, the length of a selected path is at most in the case n is even, and it is at most in the case n is odd. So, considering that the length of some of the selected paths equals the network diameter, this means that the source and destination nodes of some of the random problem instances were diagonally opposed.
4. Theoretical Evaluation
We conduct, in this section, a formal and quantitative evaluation of the proposed TCT topology by comparing it to the Tofu interconnection network of the two Fujitsu supercomputers K and Fugaku.
The Tofu topology—precisely, that of Tofu 1 [
4], Tofu 2 [
25], and Tofu D [
5], indistinctly—connects meta-nodes according to a 3-dimensional torus network, with every meta-node being isomorphic to a
-torus. The network order of each meta-node is thus twelve. So, first, the degree of a TCT node is smaller than that of a Tofu node; precisely, in a TCT the node degree is at most six (refer to Properties 1 and 2), whereas it is ten for a Tofu node.
Next, we consider the network diameter of TCT and Tofu. On the one hand, the topology of each meta-node of Tofu does not depend on any variable as just explained, and on the other hand the network dimension of the higher layer of Tofu is constant: it is equal to three. As a result, there is no other choice with Tofu but to increase the arity of the network to augment the number of compute nodes included in the network. And this subsequently forces the network diameter to increase rapidly.
To further investigate this diameter issue, we compare the maximum path length induced by a dimension-order routing algorithm, which supports torus wrap-around edges, for both TCT and Tofu. This algorithm has been detailed for TCT in
Section 3.2, and the same approach can be applied to Tofu as well considering that meta-nodes are connected in the higher layer according to the torus topology (i.e., external edges) in both cases (i.e., Tofu and TCT), with a meta-node path being eventually transformed to a Tofu path after routing inside meta-nodes (i.e., internal edges). Let this algorithm be denoted by R.
We start with the Tofu topology in the following lemma.
Lemma 1. In a k-ary Tofu, algorithm R induces a path of length at most .
Proof. By definition, the Tofu topology requires a maximum of external edges for each dimension of the higher layer torus. Because this higher layer torus is 3-dimensional, in total a maximum of external edges are selected. Lastly, three is added to count the internal edges required inside the meta-node of either the source node or the destination node; it is recalled that one single such routing inside a meta-node is required in the case of Tofu. □
Next, we consider the TCT topology in the following theorem.
Theorem 2. In a , algorithm R induces a path of length at most .
Proof. For each dimension, a maximum of external edges are required. Since there are n dimensions, the algorithm selects a maximum of external edges in total. Regarding internal edges, a maximum of two are required inside a toroid at a dimension change, a maximum of inside the toroid of the source node and that of the destination node, and one for each of all the other toroids (i.e., meta-nodes) included in the path. Moreover, since there are n dimensions, there are a maximum of toroids inside which routing is needed for a dimension change. Hence, a maximum of internal edges are selected in total. Therefore, the length of a selected path is at most . □
Corollary 1. In a , the worst-case time complexity of algorithm R is .
Proof. As explained in
Section 3.2, each edge of the path can be selected in constant time
. Moreover, from Theorem 1, it is clear that
is
. Hence, from Theorem 2 and considering that
n and
k are positive integers, the worst-case time complexity of algorithm R is
. □
Corollary 2. The value is an upper bound on the diameter of a .
Proof. From Theorem 2, a path of length at most exists between any two nodes of a . Hence, the diameter of a network is at most equal to l. □
To further clarify the evolution of the network order and maximum path length depending on the TCT parameters n and k, and to investigate with actual values how TCT compares to Tofu, we consider below sample dimension and arity values. It should be noted that for the comparison to be fair, both networks, that is, TCT and Tofu, should be of same or near same order. It is recalled that a k-ary Tofu includes compute nodes whereas a has .
For example, on the one hand in a
the network order is equal to 104,976 and the maximum path length induced by algorithm R is 30. On the other hand, for the same maximum path length 30, a Tofu network only connects a maximum of 82,308 compute nodes (i.e., Tofu arity
). A Tofu network, with a maximum path length of 36, can accommodate a maximum of 146,004 compute nodes (i.e., Tofu arity
). This is significantly less than the 1,180,980 compute nodes supported by a TCT network with the same maximum path length (i.e.,
). In order to link at least that same number of compute nodes, a Tofu network has a maximum path length of at least 72, for instance, a 46-ary Tofu has 1,168,032 nodes and a 47-ary Tofu 1,245,876, with these two cases both inducing a maximum path length of 72. Although large, these network orders remain definitely realistic: modern supercomputers include hundreds of thousands, and in some cases millions, of compute nodes (e.g., the Fujitsu Fugaku, based on Tofu D, includes 7,630,848 and the Sunway Taihulight 10,649,600). In order to complete the illustration of the network order–maximum path length evolution trend and comparison, we give additional values in
Table 2.
Next, we focus on the bisection bandwidth to compare the proposed TCT network topology to Tofu.
Definition 4 ([
22])
. The bisection bandwidth of an interconnection network is the minimum number of edges to remove in order to split the network into two sets of nodes of same size. On the one hand, it is recalled, for instance in [
26,
27], that the bisection bandwidth of an
n-dimensional
k-ary torus network is equal to
when
and to
when
. Hence, given that an
n-dimensional
k-ary TCT network connects its meta-nodes, that is, its toroids, according to the torus topology, its bisection bandwidth is that of an
n-dimensional
k-ary torus.
On the other hand, the bisection bandwidth of the Tofu interconnection network can be derived from its topology [
4,
5,
28] as follows. Each node has degree 10, out of which four links are used for connection inside the meta-node and six links are used to connect meta-nodes together according to a 3-dimensional
k-ary torus network. In other words, between any two adjacent meta-nodes of Tofu, there are in total 12 links. Therefore, the bisection bandwidth of Tofu is
when
and
when
.
As a result, the bisection bandwidth of the proposed TCT network topology is greater than that of Tofu when
(
) and when
(
). Hence, as soon as the dimension and arity of the network increase (cf.
Table 2 for sample dimension and arity values), the bisection bandwidth of a TCT overcomes that of Tofu. Assuming a link bandwidth of 5 GB/s [
5], we obtain the bisection bandwidth values detailed in
Table 3.
Finally, we consider the more advanced routing scenarios described earlier: fault-tolerant routing and parallel routing with disjoint paths: one-to-many (a.k.a. node-to-set) routing, many-to-many (a.k.a. set-to-set) routing, and pairwise routing (i.e., a special case of many-to-many routing).
First, it is recalled from [
23] that in an
n-dimensional
k-ary torus with a maximum of
faulty nodes, a fault-free path of length at most
can be selected in
time, where
is the diameter of an
n-dimensional
k-ary torus with
as recalled and explained in the proof of Theorem 2. Hence, by applying the routing strategy described in
Section 3.3, in an
n-dimensional
k-ary TCT network with a maximum of
faulty nodes, a fault-free path of length in the higher layer at most
can be selected in
time. This is to be compared to Tofu: since it is based on a 3-dimensional torus, only a maximum of
faulty nodes are tolerated with this approach, and we have already discussed the diameter and path length issues for TCT and Tofu, with the former being more performant than the latter.
We can then consider tolerance to faulty clusters: it has been shown that in an
n-dimensional
k-ary torus with a maximum of
faulty clusters of diameter one, a fault-free path of length at most
can be selected in
time, with
F the set of faulty nodes induced by the faulty clusters [
24]. So, by applying the routing strategy described in
Section 3.3, in an
n-dimensional
k-ary TCT network with a maximum of
faulty clusters of diameter one, a fault-free path of length in the higher layer at most
can be selected. This is to be compared to Tofu: again, a maximum of
faulty nodes only are tolerated with this approach, and, as discussed previously, with a large value of
k (cf.
Table 2) the path in the higher layer would be significantly longer, thus inducing a greater maximum path length in Tofu.
It is here essential to note that unlike Tofu, the proposed TCT topology enables
efficient reuse of existing routing algorithms such as those for fault-tolerant point-to-point and node-to-set routing [
23], set-to-set routing [
29], pairwise routing [
30], and cluster-fault-tolerant routing [
24]. Precisely, “efficient” here stands for increased fault tolerance: a maximum of
faulty nodes in the case of TCT and only 5 in the case of Tofu, an increased number of selectable parallel paths, and a reduced maximum path length as further detailed below.
In the case of fault-tolerant one-to-many routing, it is recalled from [
23] that in an
n-dimensional
k-ary torus with a maximum of
faulty nodes,
fault-free disjoint paths of lengths at most
can be selected in
time. Therefore, by applying the routing strategy described in
Section 3.3, in an
n-dimensional
k-ary TCT network with a maximum of
faulty nodes,
p fault-free disjoint paths of lengths in the higher layer at most
can be selected in
time. This is to be compared to Tofu: since it is based on a 3-dimensional torus as explained previously, a maximum of
faulty nodes only are tolerated with this approach, with
. The maximum number of selectable parallel paths for the disjoint-path one-to-many routing scenario is thus significantly reduced compared to that in the case of the proposed TCT topology given its higher values of
n (cf.
Table 2).
Regarding many-to-many routing, it is known that in an
n-dimensional
k-ary torus,
disjoint paths of lengths at most
can be selected in
[
29]. Hence, by applying the routing strategy described in
Section 3.3, in an
n-dimensional
k-ary TCT network,
disjoint paths of lengths in the higher layer at most
can be selected. This is once again to be compared to Tofu, which would allow only six disjoint paths to be selected and used in parallel, and, as discussed previously, with a large value of
k (cf.
Table 2) the path in the higher layer would be significantly longer, thus inducing a greater maximum path length in Tofu.
Finally, regarding pairwise routing, it has been established that in an
n-dimensional
k-ary torus,
n disjoint paths of lengths at most
can be selected in
time. Therefore, by applying the routing strategy described in
Section 3.3, in an
n-dimensional
k-ary TCT network,
n disjoint paths can be selected and used in parallel, and this number of parallel paths would be reduced to only 3 paths in the case of Tofu. And, once again, as discussed previously, with a large value of
k (cf.
Table 2) the path in the higher layer would be significantly longer in the case of Tofu, thus inducing a greater maximum path length in Tofu.