4.3.1. The Effectiveness of The Proposed Method
First, we took traffic analysis zones (TAZs) as the research units and processed the taxi trajectory data within the fifth ring road of Beijing on 13 March 2017 (Monday). We aggregated these data to construct a network of spatial interaction. The network consisted of 584 TAZ units as nodes, and the edges represent the interaction volume between the research units. Based on the start timestamps, we split all the trip data into 12 time snapshots separated by two hours.
To better illustrate the training effects of the proposed method, the loss function during training is visualized (
Figure 7). Clearly, the loss of the proposed method converged during training, and it can also be seen that the modularity gradually increased and then stabilized.
Firstly, for the quantitative evaluation, all communities detected during the four time snapshots, including both the morning and evening peak hours on Mondays, were selected for assessment. We conducted three repeated experiments and reported the mean and standard deviation of each metric.
Table 4 shows the comparison of the Modularity metric,
Table 5 shows the comparison of the Average Density (AD) metric,
Table 6 shows the comparison of the Average Conductance (AC) metric, and
Table 7 shows the comparison of the Average Clustering Coefficient (ACC) metric.
As shown in
Table 4, the modularity of the communities detected by the proposed method and the comparison methods was always higher than 0.3, and the modularity of the proposed method was slightly higher than that of the comparison methods. This indicates that the proposed method can effectively distinguish the community structures in the spatial interaction network. As shown in
Table 5, the average density of the communities detected by the proposed method was slightly higher than that of the Leiden method and the GCN-based method, suggesting that the communities detected by the proposed method were, on average, more closely connected, which is generally considered to be better in community detection. As shown in
Table 6, the average conductance of the communities detected by the proposed method and the method based on GCN was higher than that of the Leiden method, indicating that the communities detected by the Leiden method had less connectivity between communities, which is its advantage; compared to the GCN-based method, the communities detected by the proposed method had less connectivity, implying that the proposed method detected communities that were more separable than those detected by the GCN-based method. As shown in
Table 7, the average clustering coefficient of the communities detected by the proposed method was higher than that of the Leiden method and the GCN-based method, indicating that the communities detected by the proposed had have a higher degree of aggregation among community nodes.
In summary, the communities detected by the proposed method exhibited a good performance on all four evaluation metrics. Considering these metrics together, it can be concluded that the proposed method achieved a satisfactory level of rationality in detecting spatial community structures.
Secondly, for the qualitative analysis, we selected four time snapshots containing two special time periods: the morning rush hour and the evening rush hour. We compared our method with the widely recognized Leiden method, and the community detection results of both methods are shown in
Figure 8. The number of communities detected by the Leiden method was generally less, with larger community sizes and more continuous structures [
30]. In contrast, the communities detected by our method tended to have smaller sizes compared to those detected by the Leiden method.
Meanwhile, we further analyzed the community detection results of our method by combining them with a geographic base map and highlighted three core communities. Compared to the Leiden method, the communities detected by our method, as shown in
Figure 9a, primarily included scenic spots such as the Summer Palace, Fragrant Hills Park, and West Lake in Community 4; educational institutions like Tsinghua University, Peking University, and Renmin University in Community 6; and areas with transportation facilities like Beijing West Railway Station, Beijing South Railway Station, and Beijing Fengtai Railway Station in Community 5. However, the communities detected by the Leiden method failed to effectively differentiate functional areas. Overall, the Leiden method is an algorithm based on modularity optimization and suffers from the problem of resolution limitation. That is, the modularity is sensitive to the size of the community. The value of modularity may overestimate the existence of large communities and ignore the existence of small communities. In contrast, the proposed method is based on the original network structure for the optimization of community results, which is able to solve the resolution limitation problem due to the modularity and detects a more fine-grained community structure.
Since the proposed community detection method in this paper utilizes hyperbolic graph convolutional modules for embedding in the network embedding part, to verify the advantage of introducing hyperbolic space in our method, we conducted ablation experiments using models based on Graph Convolutional Networks (GCNs) as the backbone network. The GCN-based community detection method employs Euclidean embeddings, which require a large number of dimensions to capture complex relationships. In contrast, our method integrates hyperbolic geometry into the network embedding module to handle complex networks, particularly those in spatial interaction networks. Specifically, we achieved this by substituting Euclidean space with hyperbolic space. We applied the t-SNE algorithm to reduce dimensionality and visualize the features obtained from both the GCN embedding and hyperbolic graph convolutional embedding, as shown in
Figure 10.
To verify the superiority of the community detection results, under the same conditions, we further conducted community detection using a model based on Graph Convolutional Networks (GCNs) as the backbone network. We selected four time periods, including both the morning and evening rush hours, and compared the results with our method. The community detection results are shown in
Figure 11.
Based on the feature visualization after the t-SNE dimensionality reduction (
Figure 10), we found that the embedded features of our method exhibited a better out-of-cluster separability and in-cluster cohesion by introducing hyperbolic space. Thus, the separability of communities can be improved by introducing the hyperbolic space. In addition, we found that the features obtained after the convolutional embedding of the hyperbolic map approximated a circular distribution.
Comparing the community detection results of our method and the ablation experiments based on GCN modules (
Figure 11), we found that the proposed method detected more fine-grained communities. For example, during the time period from 6 to 8 a.m., community C1 identified by the GCN-based method corresponded to community C2 and community C12 identified by the proposed method; during the time period from 8 to 10 a.m., community C7 identified by the GCN-based method corresponded to community C4 and community C6 identified by the proposed method; during the time period from 4 to 6 p.m., community C5 identified by the GCN-based method corresponded to community C3 and community C10 identified by the proposed method; and in the time period from 6 to 8 p.m., community C2 identified by the GCN-based method corresponded to community C4 and communities C10 and C14 identified by the proposed method.
Since most residents have the same travelling purpose during weekday commuting time, there was some similarity in their community structure. To further validate the stability of the community detection results of the proposed method, we used this as a reference to conduct experiments on snapshots containing weekday commute times.
Figure 12 shows the results of the community detection, comparing the vertical distribution of communities in each column, where the locations of larger communities were basically the same. The results show that our method could consistently detect relatively stable community structures, indirectly confirming the stability of our model.
In addition, we conducted experiments using data from the same time snapshots on weekends, and the community detection results obtained are shown in
Figure 13. Comparing the community detection results between weekdays and weekends, it is evident that the community structure on weekends differed significantly from that on weekdays, especially during Sunday mornings from 8 to 10 a.m., where the community structure appeared fragmented, indicating a diversified travel pattern during this time period.
Overall, our comparative analysis between weekdays and weekends confirms the differences in travel patterns between these two time periods.