1. Introduction
Urban functional zones serve as crucial spatial components for fulfilling various economic functions within cities. Different types of functional zones contribute to the formation of distinct urban spaces [
1,
2]. Accurate classification of these zones provides essential data support for urban management, planning, land policy formulation, and residents’ daily lives [
3,
4,
5]. Research on urban functional zone classification primarily focuses on utilizing diverse datasets including POI data, trajectory data, and hyperspectral remote sensing imagery to classify spatial boundaries and identify zones with similar urban characteristics or functions in order to effectively describe urban morphologies [
6,
7,
8,
9,
10]. Among these approaches, POI data have received increasing attention due to their rich semantic features, low acquisition cost, high timeliness, and high presentationality [
10,
11,
12,
13,
14].
Numerous studies have been conducted by scholars to explore the effective utilization of POI data for delineating and understanding urban functional zones, yielding notable results. For instance, Deng et al. [
10] employed grids as classification units and functional categories to analyze urban traffic patterns. Meanwhile, Deng et al. [
12] established urban functional zones by calculating the proportion of POI categories within each grid. Similarly, Chen et al. [
13] identified neighborhood functional categories using POI data, facilitating the evaluation of urban planning and design effectiveness. Additionally, Du et al. [
14] correlated POI data with zones delineated by the road network, enabling the classification of functional categories and further investigation into the driving factors behind urban heat island phenomena. A comprehensive analysis of the aforementioned literature reveals that many existing methods for delineating urban functional zones rely on dividing the study area into grids or blocks as basic spatial units. These methods then generalized the functional attributes of POIs within these units to generate functional zones. However, utilizing grids or neighborhoods as basic spatial units may pose challenges such as inaccurate classification of functional categories due to the large spatial extent of these units and limited correlation with POI data.
Functional zone classification based on traditional spatial units is no longer sufficient to meet the current demand for refined and intelligent urban construction, management, and planning [
15,
16]. The increasingly complex and diversified internal structures of cities are driving the demand for more refined spatial units, as seen in current smart city and real-life three-dimensional construction scenarios. Buildings are crucial for people’s production, life, and urban economic development [
15]. Building data, serving as the fundamental spatial cognitive objects in urban morphology and structure, play a crucial role in numerous practical applications, including urban planning, land use analysis, population estimation, etc. [
17]. POI data, which are closely tied to buildings, exhibit a strong spatial correlation with building data. Utilizing POI data enriched with semantic features to classify the functional categories of building spatial units contributes to a more refined and precise expression of urban functional zones, enhancing the granularity perceived within a city [
9,
16]. However, achieving this refined expression often necessitates efficient processing of vast amounts of building and POI data. Automating or even imbuing intelligence into the task of classifying building functional categories is pivotal to enhancing the capability for fine-grained urban functional zone delineation.
Knowledge engineering offers a novel solution for efficiently processing large datasets, advancing data processing from automation to intelligence. The knowledge graph, a significant outcome of knowledge engineering since its inception by Google in 2012 [
18,
19], has revolutionized critical fields such as intelligent searches, Q&A, and recommendations. It functions as a semantic network and knowledge base with a directed graph structure that explicates entities (concepts) and their relations in the physical world through ternary <Entity, Relation, Entity> [
18] representations, thereby empowering computers with explainable, comprehensible, and rational intelligence [
20]. In recent years, scholars have increasingly applied knowledge graphs in geoscience, particularly in geographic information retrieval [
21,
22], spatio–temporal information mining [
23], epidemic prevention, and disaster management [
24,
25]. However, effectively applying knowledge graphs to classify building functional categories remains an area ripe for further exploration, promising to enhance the refinement and intelligent expression of urban functional zones. Leveraging the advantages of a comprehensive knowledge graph based on POI data and building data with rich semantic characterizations enables the establishment of a POI semantic characterization knowledge graph. This knowledge graph can classify building space units into functional categories, enhancing the refinement and intelligent of urban functional zone expressions and improving the perceived granularity of cities [
9,
16].
Therefore, to achieve a more refined and intelligent expression of urban functional zones, we propose a method supported by a POI semantic characterization knowledge graph for classifying the functional categories of buildings. The approach involves designing a Delaunay triangular network combined with buffer zones to match buildings with POIs. Subsequently, entity extraction and relation establishment are used to construct a knowledge graph. Furthermore, a Z-score-supported functional category delineation model is developed to enhance the rationality of the functional category delineation results. Combined with the functional category classification model, a building functional category reasoning method is designed to effectively reason about building functional categories. Finally, the effectiveness of this method is validated through experiments, aiming to advance the refinement and intelligent division of urban functional zones and elevate the service level of urban management and planning.
4. Discussion
Most of the existing studies on urban functional districts achieve the construction of functional districts by summarizing the POI attributes within the spatial units of blocks or grids. To promote a more refined and intelligent division of functional zones, we have conducted research on constructing and reasoning with a knowledge graph to categorize building functions. This research has achieved a high recognition rate and correctness rate for building functional categories, providing robust support for city management and planning.
In terms of the use of reference data for delineating building functional categories, we relied solely on POI data for classifying building functions. However, for achieving more precise classification of building functional categories, relying solely on POI data was inadequate, and we required diverse types of data. In the future, we will incorporate data related to daily human activities (e.g., human mobility [
43,
44], travel behaviors [
45,
46], mobile signaling [
47,
48], etc.) to explore the correlations between buildings and peoples’ daily routines. This approach will help us extend the existing knowledge graph and will enable more accurate and intelligent classification of building functional categories.
In terms of the recognition rate for inferring building functional categories, this paper achieved a lower rate compared to Method 2. One significant reason was that our method relied solely on buildings with first-order neighboring relationships (with matched POIs) to infer functional categories for buildings with unmatched POIs. Therefore, in cases like B1 in
Figure 15, we could not determine the functional category of B1 because none of its first-order neighboring buildings (B2, B3, and B4) had matched POIs. To address this limitation, we proposed considering higher-order neighboring buildings (e.g., second, third, fourth, etc.) of B1 for functional category inference when first-order neighbors were insufficient. Given that spatially proximate entities often exhibited similar characteristics due to the first law of geography, our approach involved carefully considering the interaction degree of functional categories and spatial distances when reasoning about building functional categories using higher-order neighboring entities. In our future work, we aim to improve the recognition rate of knowledge-graph-supported methods for classifying building functional categories. This will be achieved by carefully considering distance factors and conducting research on higher-order-based inference methods.
In terms of the accuracy of building functional category inference, the method proposed in this paper achieved a higher accuracy rate compared to Method 2. However, there were instances of misclassification in specific areas within the district. For example, as shown in
Figure 16, buildings B1 to B4 are located in the residential area “Mengzi Road 601 Long Unit” and should be classified as residential based on their functional categories. Nonetheless, due to B1 containing two commercial POIs, it was incorrectly classified as commercial using the method described in this paper. To address this issue, we initially attempted to improve the accuracy by removing some low-recognition POI points, such as convenience stores. However, this approach alone does not provide a comprehensive solution to such problems. In the future, we will integrate AOI (area of interest) data, enhance our knowledge graph by establishing associations between AOI data and building data, and develop corresponding reasoning rules for building functional categorization. This approach will aim to improve the accuracy of building functional category classification results.
Further integration of POI semantic embedding technology will enhance the recognition rate and accuracy of functional category classification in our approach. POI semantic embedding improves the semantic understanding and reasoning performance of computers for POIs by virtue of its expression of POI information in the form of a continuous vector space [
49]. Consequently, POI semantic embedding has been constantly utilized for delineating functional zones by enhancing the accuracy of urban zone delineation through embedded representations of multiple POI functional categories [
50,
51]. In our forthcoming work, we will focus on POI semantic embedding technology and explore its integration with a knowledge graph to develop a method for constructing functional category classification supported by POI semantic embedding. Leveraging POI embedding will enrich the representation of POIs’ diverse functional category information and enhance the granularity of the knowledge graph’s depiction of POI functional categories. Additionally, we will design reasoning rules that combine POI embedding to improve the accuracy and correctness of building functional category classification.
Our research results show promising outcomes in multi-scale representation of functional zones. The knowledge graph we constructed accurately classified building functional categories, fulfilling the requirements for generating and depicting functional zones at various scales (such as at the building level, neighborhood level, block level, township level, city level, etc.). Enabling the expansion of building functional areas to larger scales often involves cartographic synthesis guided by existing knowledge graphs [
52] to enhance their capability to reason and represent functional categories across different scales. In our future work, we intend to integrate knowledge graph technology with cartographic generalization technology [
30,
37]. This will involve refining our current knowledge graph and optimizing reasoning rules by implementing techniques such as merging, deleting, and shifting to enable a cross-scale representation of buildings. Ultimately, we aim to develop a versatile knowledge graph capable of classifying building functional categories and generating multi-scale functional zones. This approach will support automatic and intelligent representation of urban functional areas and will meet various scales of urban management and planning requirements.
5. Conclusions
To enhance the sophistication and intelligent representation of urban functional zones, we proposed a building functional classification method supported by a knowledge graph. The method primarily included four main processes: building and POI matching, building knowledge graph construction, building functional classification model design, and building functional category reasoning based on the knowledge graph. The effectiveness of our method was verified through a case study using building and POI data from Shanghai. The following conclusions were drawn: (1) In terms of the recognition rate, our method had a high recognition rate, reaching 94.94%, and demonstrated strong adaptability to buildings with low or missing POI density. (2) In terms of the correctness rate, our method was equivalent to Method 2 with a high correctness rate of 92.40%, making it applicable for the automatic reasoning and classification of large-scale building functional categories.
Our method can be adapted to the delineation of functional categories of buildings in large-scale and large areas and can promote the refinement and intelligent expression of urban functional zones. However, our method has some limitations in the delineation of building functional categories, and there are inaccuracies in the delineation of building functional categories for similar residential zones. In our future work, we will consider expanding the knowledge graph by adding multivariate data to improve the correctness of the classification of building functional categories. Simultaneously, we will integrate comprehensive cartographic knowledge into the knowledge graph to enhance the multi-scale representation of building functional zones.