Spatial-Temporal Analysis of Human Dynamics on Urban Land Use Patterns Using Social Media Data by Gender
Abstract
:1. Introduction
2. Materials and Methods
2.1. Overview
2.2. Data Collection
2.3. Methods
2.3.1. Skewness and Kurtosis
2.3.2. Spatial-Temporal Aggregation Granularity
2.3.3. Local Moran’s I
3. Results
3.1. Analysis of the Intensity of User Activity
3.1.1. Statistical Analysis
- (1)
- We utilized SPSS version 22.0 to calculate the descriptive statistics and the histogram of the number of users during the month. Of the total of 56,000 microblog users, 56% were registered as female, which means the sex ratio () was 78.6. As shown in Table 1 and Figure 3a, the maximum number of users was 4225 on 14 June, and the minimum was 248 on 1 June. The skewness was −1.355, which was less than 0, indicating that the mass of the distribution was concentrated on the right of Figure 3b and the left tail was longer. Therefore, the distribution was left-skewed. The kurtosis was 2.03, which was greater than 0, indicating that the distribution was leptokurtic.
- (2)
- In Table 2, the minimum and maximum total numbers of users () were 0 and 2649 in the grids, respectively. The maximum numbers of female users () and male users () were 1837 and 828 in the grids, respectively.The maximum was observed in the dormitories of Beijing Language and Culture University (Grid ID 1284, DOBLCU), indicating that the users here were primarily students (Table 2). The sex ratio () of Grid 1284 was 44.2, meaning that female users outnumbered male users by a ratio greater than 2:1. The second highest was in Yongtaixili (Grid ID 4652, YTXL), which is one of the largest communities in the Qinghe District of Beijing, established in 1996. This mature community has a high concentration of service-type businesses within and around the community that causes a concentrated area with a large and active population. The third highest was in Yiyuan of Anhuibeili (Grid ID 1904, YYAH), which has more than 4000 residents and conditions similar to those of YTXL. The of YYAH was the lowest in the top five grids, meaning that the proportion of female users was the highest. The fourth highest was at the Peking University Health Science Center School of Nursing (Grid ID 253, PUHSCSN), which is a workplace where the users primarily consisted of students and staff. The was 119.3, indicating that male users outnumbered females. The fifth highest was in the Olympic Torch Square (Grid ID 1568, OTS), which is in the northern National Stadium area; during and after the 2008 Olympic Games in Beijing, the National Stadium became a landmark building in Beijing attracting large numbers of visitors. Its was 53.3, indicating that female users outnumbered males by a ratio greater than 2:1. Grids with no users were mainly in the center of Dongxiaokou Forest Park (DFP), which has large areas without pedestrian access or trails, or that are covered by water. Figure 4 shows the result of grid transformation of the total number of Sina Location Microblog users in June 2014.
- (3)
- In Table 3 and Table 4, the maximum total numbers of users on weekdays () and weekends ( were 1613 and 1036, respectively, with a minimum total number of 0. The maximum average numbers of users on weekdays () and weekends () were 77 and 115, respectively.The top seven grids contained residential, workplace, and public land (Table 3 and Table 4). The grid with the highest total and average numbers of users on weekdays and weekends was DOBLCU (Grid ID 1284), with a of 577 users greater than the , while the was 38 users fewer than the . The sex ratio () of DOBLCU increased on weekends; however, there were still twice as many female users as males on weekends. DOBUPT is also residential land; however, its s were higher than those of DOBLCU, meaning the proportion of male users was higher in DOBUPT than that in DOBLCU. Among YTXL, YYAH, HYJB, and OTS, the first three places represent residential land with a large and concentrated population. Their s were both lower than 100. On weekdays, the of YYAH was the lowest, showing that there were three times as many female users as male. On weekends, the s of these four places showed that there were approximately twice as many female users as male. OTS is public land, so the number of people here on weekends was usually larger than that of weekdays and the s were both less than 100. The and of PUHSCSN were greater than the and of PUHSCSN. Moreover, the s were more than 100 and the was higher on weekends than on weekdays. These quantitative characteristics of the users were opposite when compared with the quantitative characteristics of residential and public land. The grids with no users were primarily in the center of Dongxiaokou Forest Park, where public access is limited due to a lack of sidewalks or due to areas covered by water.
3.1.2. Local Moran’s I
- The Moran’s I is 0.0145, and its z-value is 2.8957, meaning that the spatial distribution of the local indicator has a positive spatial correlation.
- (1)
- High–High: this represents a grid and its surrounding grids with a relatively large difference between and , meaning that these grids have more weekday than weekend users, and the spatial differentiation among them is small. High–High grids are mainly distributed in residential land, education and scientific research land, and in areas with some companies (Figure 5). For example, the position shown by the number 1 in Figure 5 contains the Peking University Health Science Center and the Beijing University of Science and Technology. These grids and their surrounding grids have more weekday than weekend users; thus, they are distribution areas with positive values.
- (2)
- Low–Low: this represents a grid and its surrounding grids with relatively little difference between and , meaning that these grids have fewer weekday than weekend users, and the spatial differentiation among them is small. Low–Low grids are mainly distributed in public green land (Figure 5). For example, the position shown by the number 3 in Figure 5 contains Olympic parks and forest parks. These grids and their surrounding grids have fewer weekday than weekend users; thus, they have distribution areas with negative values.
- (3)
- Low–High: this represents a grid with relatively little difference between and , while its surrounding grids have a relatively greater difference between and , and the spatial differentiation among them is large. Low–High grids are randomly distributed around High–High grids (Figure 5). For example, the position shown by the number 2 in Figure 5 is a recreational park near residential land. These grids have fewer weekday than weekend users.
- (4)
- High–Low: this represents a grid with a relatively large difference between and , while its surrounding grids have relatively little difference between and , and the spatial differentiation among them is large. High–Low grids are mainly distributed around Low–Low grids (Figure 5). For example, the position shown by the number 4 in Figure 5 is a subway station. These grids have more weekday than weekend users, and fewer weekday than weekend users in the areas around these grids.
- The Moran’s I of the difference between the number of female users and male users on weekdays is 0.0346, and its z-value is 5.9051; the Moran’s I of the difference between the number of female users and male users on weekends is 0.0269, and its z-value is 4.7393, meaning that the spatial distribution of the local indicators has a positive spatial correlation.
- (1)
- High–High: this represents a grid and its surrounding grids with relatively large differences between the number of female users and male users, meaning that these grids have more female users than male users, and the spatial differentiation among them is small. High–High grids are mainly distributed at education and scientific research land and residential land. For example, the zone shown by the letter a in Figure 6A,B contains several universities and some communities. These grids and their surrounding grids have more female users than male users; thus, they are distribution areas with positive values.
- (2)
- Low–Low: this represents a grid and its surrounding grids with relatively little differences between the number of female users and male users, meaning that these grids have fewer female users than male users, and the spatial differentiation among them is small. Low–Low grids are mainly distributed at sports centers and workplaces. For example, the positions shown by the letter b contains the National Olympic Sports Center comprehensive training center, swimming pool, and tennis hall as well as several sports management centers, and the position shown by the letter c contains an office building. Additionally, the b and c zones show an obvious change between Figure 6A and Figure 6B, respectively, meaning that male users outnumber female users at playgrounds and workplaces on weekdays. These grids and their surrounding grids have fewer female users than male users; thus, they have distribution areas with negative values.
- (3)
- Low–High: this represents a grid with relatively little difference between the numbers of female users and male users, and its surrounding grids have relatively greater differences between the number of female users and male users, and the spatial differentiation among them is large. Low–High grids are distributed around High–High grids. For example, the position shown by the letter d contains a shopping center mall and Olympic park, and Figure 6B shows that there are more Low–High grids and High–High grids on weekends than on weekdays in the d zone, meaning the number of female users and male users increases over the weekends. In particular, male users obviously outnumber female users in several grids.
- (4)
- High–Low: this represents a grid with a relatively large difference between the numbers of female users and male users, and its surrounding grids have relatively little differences between the number of female users and male users, and the spatial differentiation among them is large. High–Low grids are mainly distributed around Low–Low grids. These grids have more female users than male users, as well as fewer male users than female users in areas around these grids.
3.2. The Relationship between the Activity of Users and Land Use Types
4. Discussion and Conclusions
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Liu, Y.; Xiao, Y.; Gao, S. A Review of Human Mobility Research Based on Location Aware Devices. Geogr. Geo-Inf. Sci. 2011, 27, 8–13. [Google Scholar]
- Li, H. Population information management and application analysis under the era of big data. Mod. Manag. Sci. 2014, 10, 111–113. (In Chinese) [Google Scholar]
- Yang, W.; Ai, T. POI Information Enhancement Using Crowdsourcing Vehicle Trace Data and Social Media Data: A Case Study of Gas Station. ISPRS Int. J. Geo-Inf. 2018, 7, 178. [Google Scholar] [CrossRef]
- Liu, Y.; Liu, X.; Gao, S.; Gong, L.; Kang, C.; Zhi, Y.; Chi, G.; Shi, L. Social Sensing: A New Approach to Understanding Our Socioeconomic Environments. Ann. Assoc. Am. Geogr. 2015, 105, 512–530. [Google Scholar] [CrossRef]
- Zhou, X.; Yue, Y.; Yeh, A.G.O. Uncertainty in Spatial Analysis of Dynamic Data-Identifying City Center. Geomat. Inf. Sci. Wuhan Univ. 2014, 39, 701–705. [Google Scholar]
- Long, Y.; Sun, L.J.; Tao, S. A Review of Urban Studies Based on Transit Smart Card Data. Urban Plan. Forum. 2015, 3, 70–77. [Google Scholar]
- Long, Y.; Zhang, Y.; Cui, C.Y. Identifying commuting pattern of Beijing using bus smart card data. Acta Geogr. Sin. 2012, 67, 1339–1352. [Google Scholar]
- Wu, J.S.; Li, B.; Huang, X.L. Spatio-temporal dynamics and driving mechanisms of resident trip in small cities. J. Geo-Inf. Sci. 2017, 19, 176–184. (In Chinese) [Google Scholar]
- Pan, G.; Qi, G.; Wu, Z.; Zhang, D.; Li, S. Land-Use Classification Using Taxi GPS Traces. IEEE Trans. Intell. Transp. Syst. 2013, 14, 113–123. [Google Scholar] [CrossRef]
- Medina, S.A.O.; Erath, A. Estimating Dynamic Workplace Capacities by Means of Public Transport Smart Card Data and Household Travel Survey in Singapore. Transp. Res. Rec. J. Transp. Res. Board 2013, 2344, 20–30. [Google Scholar] [CrossRef]
- Zheng, Y.; Zhang, L.; Xie, X.; Ma, W.Y. Mining correlation between locations using human location history . In Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Seattle, WA, USA, 4–6 November 2009; pp. 472–475. [Google Scholar]
- Ye, Y.; Zheng, Y.; Chen, Y.; Feng, J.; Xie, X. Mining Individual Life Pattern Based on Location History. In Proceedings of the Tenth International Conference on Mobile Data Management: Systems, Services and Middleware, Taipei, Taiwan, 18–20 May 2009. [Google Scholar]
- Hu, Q.W.; Wang, M.; Li, Q.Q. Urban Hotspot and Commercial Area Exploration with Check-in Data. Acta Geod. Cartogr. Sin. 2014, 43, 314–321. [Google Scholar]
- Jiao, Y.; Liu, W.; Shi, E. Research on the spatial distribution and mechanism of commercial activities of Guangzhou based upon multi-source POI data. Urban Insight 2015, 6, 11. [Google Scholar]
- Chen, W.; Liu, L.; Liang, Y. Retail center recognition and spatial aggregating feature analysis of retail formats in Guangzhou based on POI data. Geogr. Res. 2016, 35, 703–716. [Google Scholar]
- Leng, B.R.; Yu, Y.; Huang, D.Q.; Yi, Z.H. Big data Based Job-residence relation in Chongqing Metropolitan area. Planners 2015, 5, 17. [Google Scholar]
- Comito, C.; Falcone, D.; Talia, D. Mining human mobility patterns from social geo-tagged data. Pervasive Mob. Comput. 2016, 33, 91–107. [Google Scholar] [CrossRef]
- Altomare, A.; Cesario, E.; Comito, C.; Marozzo, F.; Talia, D. Trajectory Pattern Mining for Urban Computing in the Cloud. IEEE Trans. Parallel Distrib. Syst. 2017, 28, 586–599. [Google Scholar] [CrossRef]
- Gustave, L.B. The Crowd: A Study of the Popular Mind; Cosimo Classics: New York, NY, USA, 2006. [Google Scholar]
- Li, X.H.; Zhao, Y.D. Analysis of social network user behavior. Comput. Era 2017, 6, 29–35. [Google Scholar]
- He, Y.; Zhang, Y. On the user characteristics of different topics on Sina Microblog. J. Intell. 2016, 35, 107–110. (In Chinese) [Google Scholar]
- Olympic Village Subdistrict Office. Olympic Village Subdistrict Office Basic Situation. Available online: http://aycdq.bjchy.gov.cn/web/620/index.html (accessed on 12 April 2017).
- Mao, X.; Xu, R.R.; Li, X.S.; Wang, Y.; Li, C.; Zeng, B.; He, Y.; Liu, J. Fine Grid Dynamic Features of Population Distribution in Shenzhen. Acta Geogr. Sin. 2010, 65, 443–453. [Google Scholar]
- Lu, A.M.; Li, C.M.; Lin, Z.J.; Shi, W.Z. Spatial Continuous Surface Model of Population Density. Acta Geod. Cartogr. Sin. 2003, 32, 344–348. [Google Scholar]
- Qi, W.; Li, Y.; Liu, S.H.; Gao, X.L.; Zhao, M.F. Estimation of urban population at daytime and nighttime and analyses of their spatial pattern: A case study of Haidian District, Beijing. Acta Geogr. Sin. 2013, 68, 1344–1356. [Google Scholar]
- Chen, R.; Wang, H.Q.; Meng, B.; Gui, L.; Liu, Y. Urban Spatial analysis and visualization based on LBSN Check-in data. Geomat. World 2017, 24, 85–91. [Google Scholar]
- Lei, C.C.; Zhang, A.; Qi, Q.W.; Su, H.M. Grid-based location Microblog data fetching and human information extraction. Sci. Surv. Mapp. 2017, 42, 125–129. (In Chinese) [Google Scholar]
- Zhang, Z.A.; Huang, Z.F.; Jin, C.; Guan, J.; Cao, F.-D. Research on spatial-temporal characteristics of scenic tourist activity based on Sina Microblog: A case study of Nanjing Zhongshan Mountain National Park. Geogr. Geo-Inf. Sci. 2015, 31, 121–126. [Google Scholar]
Statistics | Maximum | Minimum | Std. Dev | Skewness | Kurtosis | Percentiles | ||
---|---|---|---|---|---|---|---|---|
25 | 50 | 75 | ||||||
Value | 4225 | 248 | 886.207 | −1.355 | 2.03 | 2556.25 | 2852 | 3251.25 |
Grid ID | Grid Location | Land Use | ||||
---|---|---|---|---|---|---|
1284 | The dormitories of Beijing Language and Culture University (DOBLCU) | Residential land | 2649 | 1837 | 812 | 44.2 |
4652 | Yongtaixili (YTXL) | Residential land | 2048 | 1220 | 828 | 67.9 |
1904 | Yiyuan of Anhuibeili (YYAH) | Residential land | 1768 | 1307 | 461 | 35.3 |
253 | Peking University Health Science Center School of Nursing (PUHSCSN) | Workplace | 1147 | 523 | 624 | 119.3 |
1568 | Olympic Torch Square (OTS) | Public land | 1122 | 732 | 390 | 53.3 |
… | … | … | ||||
6397 | Dongxiaokou Forest Park (DFP) | Green space land | 0 | 0 | 0 | - |
6398 | Dongxiaokou Forest Park (DFP) | Green space land | 0 | 0 | 0 | - |
6399 | Dongxiaokou Forest Park (DFP) | Green space land | 0 | 0 | 0 | - |
Grid ID | Grid Location | Land Use | |||||
---|---|---|---|---|---|---|---|
1284 | The dormitories of Beijing Language and Culture University (DOBLCU) | Residential land | 1613 | 1173 | 440 | 77 | 37.5 |
4652 | Yongtaixili (YTXL) | Residential land | 1221 | 700 | 521 | 58 | 74.4 |
1904 | Yiyuan of Anhuibeili (YYAH) | Residential land | 1173 | 916 | 257 | 56 | 28.1 |
253 | Peking University Health Science Center School of Nursing (PUHSCSN) | Workplace | 825 | 377 | 448 | 39 | 118.8 |
1543 | The dormitories of Beijing University of Posts and Telecommunications in Erlizhuang (DOBUPT) | Residential land | 718 | 406 | 312 | 34 | 76.8 |
1568 | Olympic Torch Square (OTS) | Public land | 653 | 441 | 212 | 31 | 48.1 |
1334 | Huiyuan Apartment J block (HYJB) | Residential land | 247 | 149 | 98 | 12 | 65.8 |
Grid ID | Grid Location | Land Use | |||||
---|---|---|---|---|---|---|---|
1284 | The dormitories of Beijing Language and Culture University (DOBLCU) | Residential land | 1036 | 664 | 372 | 115 | 56 |
4652 | Yongtaixili (YTXL) | Residential land | 827 | 520 | 307 | 92 | 59 |
1904 | Yiyuan of Anhuibeili (YYAH) | Residential land | 595 | 391 | 204 | 66 | 52.2 |
1334 | Huiyuan apartment J block (HYJB) | Residential land | 545 | 362 | 183 | 61 | 50.6 |
1568 | Olympic Torch Square (OTS) | Public land | 469 | 291 | 178 | 52 | 61.2 |
1543 | The dormitories of Beijing University of Posts and Telecommunications in Erlizhuang (DOBUPT) | Residential land | 433 | 236 | 197 | 48 | 83.5 |
253 | Peking University Health Science Center School of Nursing (PUHSCSN) | Workplace | 322 | 146 | 176 | 36 | 120.5 |
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Lei, C.; Zhang, A.; Qi, Q.; Su, H.; Wang, J. Spatial-Temporal Analysis of Human Dynamics on Urban Land Use Patterns Using Social Media Data by Gender. ISPRS Int. J. Geo-Inf. 2018, 7, 358. https://doi.org/10.3390/ijgi7090358
Lei C, Zhang A, Qi Q, Su H, Wang J. Spatial-Temporal Analysis of Human Dynamics on Urban Land Use Patterns Using Social Media Data by Gender. ISPRS International Journal of Geo-Information. 2018; 7(9):358. https://doi.org/10.3390/ijgi7090358
Chicago/Turabian StyleLei, Chengcheng, An Zhang, Qingwen Qi, Huimin Su, and Jianghao Wang. 2018. "Spatial-Temporal Analysis of Human Dynamics on Urban Land Use Patterns Using Social Media Data by Gender" ISPRS International Journal of Geo-Information 7, no. 9: 358. https://doi.org/10.3390/ijgi7090358
APA StyleLei, C., Zhang, A., Qi, Q., Su, H., & Wang, J. (2018). Spatial-Temporal Analysis of Human Dynamics on Urban Land Use Patterns Using Social Media Data by Gender. ISPRS International Journal of Geo-Information, 7(9), 358. https://doi.org/10.3390/ijgi7090358