1. Introduction
The rapidly developing urban subway system has greatly improved the accessibility of the area around the station and attracted a large number of people and various services to gather in this area, thus forming an urban space with the station as the center and people’s daily activities influenced by the station to a certain extent [
1,
2]. This is the rail transit station influence realm (referred to as station realm in this paper). In Hong Kong, rail transit shares 43.9% of the daily public transportation trips of permanent residents, and nearly 50% of citizens live within a radius of 500 m from the station [
3]. In Tokyo, where rail transit is developed, the rail transit travel share rate in Tokyo’s 23 districts exceeded 77% in 2018 [
4]. In Northwest China, where the development of rail transit is relatively late, the share of rail transit in urban public transport in Xi’an will be over 50% in 2021 [
5]. In addition, rail transit–oriented development models such as TOD [
6], station-city integration [
7], and station integrated development [
8] are favored by urban planners and managers around the world, especially for developing countries facing traffic congestion problems caused by rapid urbanization. Thus, the urban rail station area has become an important region to play the function of gathering the city, to create public transportation to support and guide urban development, and to improve the quality of the urban environment [
9,
10]. Therefore, in order to develop a policy for sustainable spatial development of the station realm that meets public demands, urban managers and designers must improve their knowledge of the spatial pattern of the area. This is especially true when understanding the intrinsic influence relationships between functional facilities and population activities.
In general, the construction of major transportation infrastructure affects the distribution of urban elements and thus reshapes the spatial pattern of cities [
11,
12]. In previous studies, many scholars have revealed the spatial features of the rail transit station domain in terms of land use structure [
13,
14], industrial distribution [
15,
16], etc., but there is little literature addressing the detailed revelation of the spatial features of human activities in the station domain, especially considering the dynamic distribution changes of population activities in space under different time periods. Meanwhile, most studies have focused on the mechanisms of rail transit’s influence on land use [
17,
18], accessibility [
19,
20], and land price [
21,
22] around stations, while few studies have explored the relationship between urban functions and population activities, especially considering the dynamic influence mechanisms under different time periods of working days and off days. The public space around the rail station is a special space dominated by the activity trajectory of residents, and people are the main actors of activities in the public space [
23]. Therefore, population activities are an important part of the space of the rail transit station realm. Consequently, to complement the above research status gap, we attempt to identify the spatial point pattern of population activities and urban functions in the station realm using more refined and real-time positioning big data on the ArcGIS platform and further develop regression models to explore the dynamic influence mechanism between population activities and urban functions.
In today’s rapidly developing information technology, big data has greatly enriched the methods and ways for researchers to obtain data. Compared with the traditional survey methods based on observation and experience, big data has the advantages of rich sources, large sample size, and real-time, which can reflect the characteristics of survey subjects more objectively [
24]. Although many researchers have used multi-source data such as mobile phone signaling [
25,
26], the Metro Automatic Fare Collection System (Metro AFC) [
27,
28], social media [
29,
30,
31], and Wi-Fi signaling technology [
32] to explore user travel activities and distribution features, the above big data sources have some limitations in the use of the microscopic station space region. Mobile phone signaling technology is usually used to extract the Original and Destination (OD) data of residents’ traffic travel [
33], but the distribution density of communication base stations is small (the coverage radius of urban areas is generally 100~500 m) [
34]. Using the base station to send and receive data may cause spatial positioning errors, which in turn affects the accuracy of crowd identification. Metro AFC is a system that uses automatic ticket gates to record the exit and entry station and time of each passenger and uploads this data to a database with a specific frequency [
35], the data can provide a large amount of passenger travel information, including passenger codes, entrance times, exit times, departure and destination stations, etc. However, the AFC data are only applicable to the statistics of the total passenger flow of the rail transit network, and the real-time passenger distribution in space is not available [
36]. In addition, the use of social media platforms such as Facebook, Twitter, and Weibo to collect analytical data related to population activity trajectories is not sufficient to represent the activity characteristics of people of all ages, since most of the contributors are young people [
37]. The Wi-Fi signaling technology needs to plan and deploy significant signal equipment to ensure full coverage of signals in the station area region in order to ensure the accuracy of data collection; however, this greatly increases the research cost [
38]. In this study, we choose the mobile terminal program developed by WeChat, Easygo applet (which displays the thermal distribution of people flow in the currently selected area based on the map). These data have the following advantages [
39,
40]: (1) strong real-time performance. The data of WeChat comes from the GPS positioning information of the mobile intelligent terminals of the crowd. It has the characteristics of dynamic update and real-time feedback; (2) high accuracy: the extraction range of the data of Easygo is only a 25 m × 25 m grid, which is more suitable for research at the micro-scale; (3) high coverage: the positioning data of Easygo mainly comes from WeChat, a social software owned by Tencent, which covers a wider population in China and has strong utilization value; (4) easy availability: Easygo directly provides an API interface to make data easier to obtain directly. Therefore, the real-time positioning data of Easygo is suitable for this paper to identify the population distribution and changes in the station realm at the microscopic scale.
The study has two main objectives. The first one is to demonstrate the use of positioning big data to explore the distribution features of population activities and functional facilities within the study area at the microscopic scale. The second objective is to analyze the relationship between population activities and functional facilities over time using ordinary least squares (OLS) models. In summary, the main purpose of our study is to take the rail transit station realm as the research unit, use real-time positioning data from Easygo and POI data from AutoNavi Map, and visualize and analyze the spatial distribution of population activities and functional facilities within the station realm based on the spatial pattern tool of geographic information system (GIS). Then, an ordinary least squares (OLS) model is used to further explore the correlation between the spatial characteristics of population activity distribution and functional facilities and to analyze the dynamic influence relationship between them.
The main contributions of this paper are as follows:
Expansion of theoretical research. Focusing on the dynamic spatial distribution changes of population activities, we study in depth the inner mechanism of population activities and various functional facilities, which enriches the theoretical system of spatial pattern research in the station realm.
Innovation in research scale. By taking the station realm as an independent and complete spatial unit, we break the limitation of traditional urban spatial boundaries, and thus realize the spatial pattern features of population activities and functional facilities from a more microscopic perspective.
Innovation of technical means. We use real-time and more accurate positioning data (Easygo data), an analysis framework combining spatial point pattern identification technology, and an ordinary least squares (OLS) regression model. Based on identifying the spatial distribution features of population activities and functional facilities in the station realm, the dynamic influence relationship between the two is further reflected.
The rest of this paper is organized as follows.
Section 2 presents the study area.
Section 3 explains the basis for selecting research stations, the method for defining the station realm, the source of the study data, and the analysis method.
Section 4 shows the results of the kernel density analysis and OLS regression analysis.
Section 5 and
Section 6 provide a discussion and summary of the analysis results.
2. Study Area
Xi’an is the capital city of Shaanxi Province and the political, economic, and cultural center of Northwest China, with a population of 12.95 million [
41]. At present, there are eight rail transit lines in operation in Xi’an. In 2011, the first Metro Line 2 was put into operation, with a total length of 26.7 km and a total of 21 stations [
42]. It is not only the line with the highest average daily passenger flow in Xi’an (
Figure 1), but also the core line that overlaps with the city’s central axis and connects multiple functional development areas in the north and south. Metro Line 2 is a very important case study, because it has a profound impact on the urban spatial structure along and around the station. Therefore, fully grasping its characteristics and laws has strong reference significance for the sustainable development of the future metro station surrounding areas in Shaanxi Province and even the entire Northwest China.
In this paper, each station of Line 2 is classified based on the method proposed by Duan Degang et al. [
43] to classify station types by the land use composition around the station (
Table 1). Under the premise of including all types and considering the geographical location in Xi’an where the stations are located (
Figure 2), we selected a total of four typical stations, namely Xing Zheng Zhong Xin Station, Long Shou Yuan Station, Bei Da Jie Station, and Wei Yi Jie Station, as the research objects of this paper (
Table 2).
4. Results
4.1. The Spatial Distribution of Population Activities
The size of the search radius in the kernel density analysis has a significant effect on the refinement of the analysis results. According to the related literature [
62], we took the average side length of 200 m of the natural blocks composed of roads in each study station realm as the threshold value and took the radius of 50 m, 100 m, 150 m, 200 m and above, respectively, for the detection analysis. The results show that there are too many first-level hotspots within 100 m (
Figure 9a) and too many integrated above 200 m (
Figure 9e), while the hotspot levels between 100 and 200 m are basically stable (
Figure 9b–d), which only affects the smaller levels of hotspots. Therefore, this paper takes the middle number of 150 m as the search radius.
The visualization of the kernel density results of the population activities in each research station realm is shown in
Figure 10. The results show that the location, shape, and size of the population activity aggregation area during the day change spatially along with time, which indicates that the population activity in the station realm is spatially non-stationary. In terms of the overall spatial distribution, we measured the distance between the peak areas of the nuclear density of population activity (dark areas) and the stations in each research station realm using the metric tool of ArcGIS, and we found that these peak areas were mainly distributed within 500 m of the station perimeter (
Figure 11), which indicates a high level of population activity within this range. In terms of spatial position, the population activities in the XZZX station realm are mainly concentrated in the large shopping mall on the south side; the population activities in the LSY station realm are mainly distributed in the neighborhoods and commercial complexes around the station; the population activities in the BDJ station realm are mainly concentrated in a number of large medical institutions on the east side of the station and the large shopping mall on the south side; there are several universities and urban villages in the WYJ station realm, so the population activities are mainly concentrated in these areas.
In terms of temporal distribution features, the peak kernel density of population activities in the four study station realms for each period on working days and off days is shown in
Figure 12. During working days, population activity is basically dominated by commuting characteristics, with the morning peak period generally lasting from 7:00 a.m. to 9:00 a.m., and the evening peak mainly concentrated between 6:00 p.m. and 8:00 p.m. On off days, the peak time for population activity is generally at or after 11:00 a.m. This indicates that on off days people choose to be out later compared to working days due to the absence of demands such as work and school. In addition, the mean value of the kernel density of population activity is higher on off days than on working days after 11:00 a.m., which could indicate that the population is more active on off days than on working days during that period.
It is worth mentioning that the population activity in the LSY Station realm shows a continuous increase between 6:00 and 10:00 p.m., and the kernel density of population activity reaches its highest value at 10:00 p.m. The field survey reveals that a large number of night markets appear around the station during this period, which attracts the population to gather here.
4.2. The Spatial Distribution of Functional Facilities
The results of the kernel density analysis for each type of POI in the four research station realms are shown in
Figure 13.
As shown in
Figure 13a, the POIs of various functions in the Xing Zheng Zhong Xin station realm show a spatial distribution feature of high in the south and low in the north. The POIs of various functional facilities in the Long Shou Yuan station realm are generally distributed around the station (
Figure 13b), especially the six types of facilities, namely shopping POIs, catering POIs, living POIs, transportation POIs, government agency POIs, and accommodation POIs, which have the characteristic of higher density the closer they are to the station. As shown in
Figure 13c, the distribution features of each functional facility in the Bei Da Jie station realm differ significantly. Five types of facilities, namely shopping POI, catering POI, living POI, accommodation POI, and financial POI, are more densely distributed in the south of the station realm than in the north. In addition, the density distribution of medical and health POIs is much higher in the east-central part of the station realm than in other regions. The results of the kernel density analysis of the station realm of Wei Yi Jie (
Figure 13d) show that we find that the POIs of each functional facility are distributed along both sides of Chang’an South Road (north–south direction) and Chang Ming Road (east–west direction).
In general, shopping service POIs, catering service POIs, and living service POIs show spatial distribution features around the stations in each research station realm. However, the different land composition within diverse types of station realms makes the density distribution features of functional facilities vary significantly, such as entertainment POIs, government agency POIs, and medical and health POIs. However, through comparative analysis, we found that population activities and functional facilities have geographically similar peak kernel density coverage areas in the station realm, which indicates that areas with a high-density distribution of functional facilities within the station realm are more attractive for people to move around in.
4.3. Spatiotemporally Varying Impact of Functional Facilities on Population Activities
In order to more accurately assess the dynamic influence relationship between functional facilities and population activities, based on the boundaries of each study station realm, a grid with an image element scale of 25 × 25 m was created on the ArcGIS platform as the spatial statistical unit for the regression analysis in this paper (which is consistent with the collection scale of our population activity data). Then, the dependent variable (population activity) and the explanatory variable (functional facility) were connected to this grid for OLS regression analysis. However, variance inflation factors (VIFs) also needed to be calculated to avoid multicollinearity among the explanatory variables of the model and to exclude variables with VIF values greater than 10 [
63]. As shown in
Table 5, the VIF values of each explanatory variable in the four research station realms are less than 10, indicating that there is no cointegration among the explanatory variables and that they are suitable to form a regression model.
As shown in
Table 6, the explanatory variables in the Xing Zheng Zhong Xin station realm (hybrid type)—medical and health POIs (V4), catering POIs (V6), living POIs (V7), shopping POIs (V8), and transportation POIs (V9)—have a positive relationship with population activity during almost all hours of the working days and off days. This indicates that the higher number of these functional facilities is also more attractive for population activities. In particular, the catering POIs and living POIs have a significant positive correlation. The entertainment POI (V1) has a negative relationship with population activity most of the time, indicating that an increase in the number of such functional facilities does not attract more activity. In addition, the financial service POI (V2) and government agency POI (V3) have a positive relationship with population activities on working days, while the off days have more of a negative relationship.
The OLS regression results for the Long Shou Yuan station realm (residential type) are shown in
Table 7. Among them, the financial POIs (V2), accommodation POIs (V5), and catering POIs (V6) show a significant positive relationship on both working days and off days, indicating that these three types of facilities are highly attractive to population activities most of the time. The medical and health POIs (V4) have a negative effect on population activity most of the time. In addition, the entertainment POIs (V1) have a positive relationship on working days most of the time compared to off days, indicating that the entertainment POIs (V1) are more attractive to population activity on working days than off days.
As shown in
Table 8, the medical and health POIs (V4), catering POIs (V6), shopping POIs (V8), and transportation POIs (V9) in the Bei Da Jie station realm (commercial service type) have a positive relationship with population activity during most of the working days and off days. In particular, the medical and health POIs (V4) have a significant positive impact on all time periods, due to the presence of several large or comprehensive public hospitals within the Bei Da Jie station realm, and the high accessibility of the metro makes it even more convenient for residents from other areas to come here for medical care; this indicates that quality medical resources can be more attractive to population activity. The financial POIs (V2) and the government institution POIs (V3) have a positive relationship with population activity only during specific times of the working day. It is worth mentioning that the accommodation services POIs (V5) have a positive relationship with population activity during off days, while it is mostly negative during working days. Combined with the results of the kernel density analysis of population activity, it can be found that the Bei Da Jie station realm may be visited by more outsiders on off days, thus enhancing the attractiveness of the accommodation service POIs for population activity.
As shown in
Table 9, most of functional facilities in the Wei Yi Jie station realm (public service type) have a positive relationship with population activity, except for the entertainment POIs (V1) and government institution POIs (V3). In addition, through the field survey, we found that there are several higher education schools and educational training institutions in the Wei Yi Jie Station realm, leading to the existence of many inexpensive rental rooms, restaurants, school supply stores, courier stations, and other facilities that are closely related to campus life within the area. Therefore, accommodation POIs (V5), catering POIs (V6) and living service POIs (V7) have a strong attraction to population activities most of the time.
In terms of overall spatial relationships, the “Accommodation POIs (V5)”, “Catering POIs (V6)”, “Living POIs (V7)”, “Shopping POIs (V8)”, and “Transportation POIs (V9)” in each research station realm have positive relationships with population activity most of the time. In particular, the catering service POIs have a strong positive attraction to population activities, the reason for which can be traced back to the long-standing Xi’an food culture, where restaurants have become one of the most important places for social activities for citizens. On the other hand, the transportation facility POIs generally have a significant positive relationship with population activity from 12:00 p.m. to 6:00 p.m., indicating that people have high utilization of public transportation during this time.
In addition, from a temporal perspective, the entertainment services POIs (V1) usually have a positive relationship with the population activity only in the evening. The financial services POIs (V2) generally have a positive impact on working days (excluding BDJ Station). Since the financial institutions are usually closed on off days, they have a negative impact for most of the day. Government agency POIs (V3) are generally only open to their staff, so there is a significant negative correlation for population activity within each research station realm for most of the time, whether it is a working day or an off day.
It is worth mentioning that medical and health facilities (V4) have a positive correlation with population activity most of the time, except for LSY Station. Several provincial or municipal public hospitals (e.g., Xi’an Central Hospital, the Second Affiliated Hospital of Xi’an Jiaotong University, etc.) are concentrated within the BDJ station realm, and the medical and health facilities (V4) have a significant positive attraction for population activities almost all day long. On the contrary, although there are several private clinics and community medical service centers in the LSY station realm, the POI of medical and health facilities has a negative correlation with population activity most of the time due to the lack of large public hospitals.
5. Discussion
This study investigated the spatial features of the four study station realms of Xi’an Metro Line 2 based on positioning data from Easygo applet and POI data from AutoNavi Map and further analyzed the dynamic influence relationships of functional facilities with population activities in the station realm under different time periods through OLS models.
The results of the spatial point pattern identification show the following:
There is a clustering of population activities within the metro station realm, especially in the areas within 500 m of the station. Consistent findings were revealed by several studies in different cities [
64,
65,
66]. In addition, the peak periods of population activity in the station realm are between 7:00 and 9:00 a.m. and between 6:00 and 8:00 p.m. on working days. On the off days, the population activity is more active during the time period from 11:00 a.m. to 5:00 p.m. In Xi’an, although the proportion of public transportation trips is increasing year by year, with the metro ratio exceeding 50% in 2021 [
5], urban traffic congestion is still a serious problem. Therefore, identifying changes in the active time of population activities helps city managers accurately and effectively optimize the operation time and structure of public transportation, thus alleviating problems such as traffic congestion in the city.
In this paper, we found that the three types of functional facilities, namely shopping services, catering services, and living services, in each study station realm exhibit the feature that their density distribution is higher when they are closer to the station. The density distribution features of other functional facilities in different types of station realms vary greatly. In fact, the development of rail transit in China started late compared to developed countries, and stations are often built in already developed urban environments [
67], which also leads to the spatial development of station realms being restricted by public policies, transportation supply, station location, land use structure, etc. [
68,
69], making it difficult to form a high-density, mixed functional structures around stations as advocated by TOD and station-city integration models. In this paper, we summarize the spatial distribution features of various functional facilities in different types of station realms. However, the realization of diverse urban functions clustered around the station with the support of national policies and high-quality urban renewal, thus promoting comprehensive, composite, and efficient development of the station realm, remains a complex issue. The relationship between rail transit and urban functional facilities needs further research.
It is worth mentioning that although POI data cannot record “informal activity” facilities that are mobile and self-organized, using real-time location data and field surveys, we found the presence of night markets in some station realms, which cause population clustering at night. The existing literature shows that night markets have an impact on the economic development and quality of life of residents in a region [
70]. On the one hand, night markets can provide a means of subsistence for some of the unemployed [
71] and can also serve as a vehicle for a region’s unique culture and customs [
72], contributing to economic development. On the other hand, the excessive lighting, noise, and waste generated by night market activities can have a negative impact on the health of suppliers as well as local residents [
73,
74]. Therefore, the development of a night market economy in station realms with a base of night market activities can be effective in enhancing the vitality of the area, but the resulting public safety issues should be of concern to city managers and urban researchers.
The results of the regression analysis showed the following:
The POI variables, such as catering, living, shopping, and transportation, showed a high positive attraction for population activities within the station realm. This indicates that these types of services and facilities will be more likely to attract activities in this area, which is also consistent with the results of the current related studies [
37,
75]. In particular, catering service POIs and population activity have a significant positive correlation in all four research station realms, which indicates that catering strongly attracts population activity. For city managers, focusing on developing the local food culture can effectively improve the vitality of station realms. In addition, the study found that medical and health care POIs have a more significant positive correlation with population activity in station realms with a relatively high concentration of general or public hospitals. Especially under the influence of the COVID-19 pandemic, the demand for medical resource capacity and medical service facilities has greatly increased, making the allocation of quality medical and health resources more attractive for population activity. This finding helps city managers to better optimize the allocation of public health care resources in station realms.
From a dynamic perspective, there are different coefficients of variation on weekdays and off days for all POI categories in different types of station realms. For example, the financial services POIs, government POIs, and accommodation services POIs in the Xing Zheng Zhong Xin station realm have a positive relationship with population activity only on working days. In the Bei Da Jie station realm, the accommodation service POI is more attractive to demographic activities on off days than on working days. In addition, transportation facilities are significantly more positively related to population activity on working days than on off days. This suggests that on working days people are more likely to take public transportation to work, school, and other daily activities, while on off days people may be more likely to rest at home or choose to travel by private car. In conclusion, the analytical framework developed in this paper basically describes the relationship between various types of POI facilities and population activities, and these research results can help city managers understand the needs of people’s daily activities in different types of station realms from a dynamic perspective, so as to improve the allocation of urban public resources more effectively.
In terms of data, unlike exploring population distribution features by using big data such as mobile phone signaling, social media, and Wi-Fi signaling technologies [
25,
26,
27,
28,
29,
30,
31,
32], this paper uses the higher precision and better timeliness of Easygo positioning data, which provides a reliable data source for researchers to identify the dynamic distribution features of population activities from a microscopic perspective. Existing studies have shown the importance of POI data for studying the spatial and temporal distribution features and mechanisms of urban population density [
76]. Therefore, the real-time positioning data of Easygo and the POI data of AutoNavi Map used in this paper help to enable researchers to obtain more convincing analysis results.
However, there are still some limitations in this study, which may be a feasible direction for improvement. First, due to the lack of staff and time constraints, only four stations in Xi’an Metro Line 2 were selected for this study to conduct specific investigation and analysis. If all stations of Metro Line 2 could be investigated and studied, the spatial features of rail transit station realms would be more completely understood. In addition, the survey data in the study are not comprehensive enough to obtain data on the land use status and population flow before the completion of Metro Line 2. If the spatial features of the station realm before and after the completion of the metro can be compared and analyzed in the future, the practical application value of the research results will be further enhanced to support the development of metro station realms.