1. Introduction
Urban agglomeration can be defined as a highly developed spatial form of integrated cities. Urban agglomeration can happen due to continuous ribbon development along the main transport routes [
1,
2]. Further, urban agglomeration can be considered a contiguous build-up area shaped by one core city or several adjacent cities along the transport routes [
3]. These adjacent cities share industries, infrastructure, housing, and land uses, attracting more and more daily movements of people. Simply, urban agglomeration takes place when the relationships among cities start to the corporate which each other than competing with each other—e.g., Greater Mumbai and Delhi in India [
4,
5,
6].
Urban agglomeration takes place because of unplanned urban growth. In addition, urban growth in developing countries is mostly encouraged by biased national policies, such as a centralized urban development paradigm implying the tendency of agglomerating all key facilities in the cities and metropolitan towns [
7]. Compared to urban areas, the conflict between rapid urbanization and environmental protection in urban agglomeration areas is more significant and serious. Urban agglomeration especially creates more complex environmental issues, such as solid waste management, an increase in air and water pollution, and economic issues [
8]. In the long term, it could lead to significant regional disparities, leading to serious socioeconomic inequalities and even social unrest [
9]. This situation ultimately deviates cities and their societies from being smart and sustainable [
10].
The concept of urban agglomeration is studied using different concepts, i.e., local coupling and telecoupling concepts [
11]; theories, i.e., field strength theory and cluster theory [
12,
13]; models, i.e., iCN Model [
14]; criteria, i.e., population density, development pressure [
15]; and indices, i.e., an index developed by [
16]. Most of these studies have used data types directly related to defining the spatial form of a city, such as a road and building density, which will not change for a long period [
15,
17]. Other studies used remote sensing and image interpretation techniques to demarcate urban footprints based on the land cover [
18,
19]. Unlike the urban form, urban agglomeration needs to be demarcated through human-centric data sources as agglomeration occurs with people’s movements [
20,
21].
According to [
6], a functional urban region and urban agglomeration are highly interlinked functional areas. These provide complementary functions of different levels at different places to supply the population with all necessities—ranging from residential functions to workplaces to education, shopping, and using various services. Making use of these functions requires communication or traveling between the places where those functions are provided.
To address this issue, the study employs a multisource geospatial—location-specific, open-source big data fusion approach—which seeks to combine information from multiple sources and sensors through various applications. This is to achieve decision-supporting inferences that cannot be achieved through a single source or sensor. Accordingly, this study attempts to use multiple data sources related to urban form—data that would not change often, and function—data that changes within a shorter period, to understand the agglomeration footprints.
The multisource geospatial big data fusion approach uses different location-specific big data types collected from different sources [
22]. This is to integrate such big data types to understand a considered phenomenon [
23]. Recently, a study [
24] used a multisource big data fusion approach to evaluate the polycentric urban form of cities. For that, they have used Night Time Light (NTL) data, Point of Interest (PoI) data, and Tencent Migration Data (TMG) as the big data sources. Further, another study [
20] used Weibo’s Application Programming Interface (API) to obtain data, an online service that has emerged in China in recent years. Therein, the authors identified urban agglomeration trends following the connection strength and user numbers, which the urban form-related parameters—population and road density—were used to identify urban agglomeration were ignored.
Against this backdrop, as an emerging area of research, this study examines the applicability of different location-specific big data sources as parameters to demarcate urban agglomeration patterns. Accordingly, this study aims to explore the use of the multisource geospatial big data fusion approach as a novel method to demarcate urban agglomeration footprint in the case of the Southern Coastal Belt of Sri Lanka.
4. Results
Urban agglomeration footprints were first identified using urban form-related parameters and, secondly, using urban function-related parameters. Thirdly, a holistic image was derived by developing a composite map using the urban form and urban function-related parameters (
Figure 4).
According to the administrative activities, only one main urban center was identified around the Colombo city area. This is mainly due to the location of most of the condominium administrative complexes in Colombo, i.e., “Nila Piyasa—Colombo” Government Quarters, Sethsiripaya Stage I and II—accommodate more than 25 government offices, Suhurupaya, and Isurupaya. No other city centers in the study area were identified as main urban centers, which implies a higher agglomeration of administrative activities to one place. Unlikely, six main urban centers: Colombo, Dehiwala, Panadura, Kalutara, Aluthgama, and Galle city centers, were identified according to the distribution of schools and higher education centers. Most significantly, the area from Colombo to Panadura acts as an urban region with multiple nuclei: Colombo, Dehiwala, and Panadura. Furthermore, out of the six main urban centers, five are in the Western Province of the country, which highlights a regional disparity in terms of the distribution of education facilities.
Only two main urban centers were identified according to the distribution of health activities, and these two centers were also identified in Colombo and Dehiwala. Four suburban centers were identified in Aluthgama, Ambalangoda, Galle, and Matara. The commercial building distribution also identified Colombo and Galle as the main urban centers, and Panadura, Beruwala, Hikkaduwa, and Matara were identified as suburban centers. Basically, Colombo and Galle act as district capitals where most of the commercial activities, such as shopping centers, shopping malls and groceries, are located at.
The identified tourism main center was in Mirissa, and the only suburban center identified was in Unawatuna. Unlike the administrative, education and health activities, new suburban centers such as Beruwala, Hikkaduwa, and Unawatuna have emerged for commercial and tourism-related activities. This is mainly due to the location of hotels, motels, homestays, and tourism-oriented markets along the study area. These suburban centers also act as world-famous tourist destinations for beach tourism, i.e., Ventura beach, Moragalla beach of Beruwala, Barberyn Island Lighthouse, and Coral Garden of Hikkaduwa. Still, there is no significant expansion of urban centers according to the distribution of tourism activities. This shows that in terms of tourist activities, all the above centers mainly try to serve as individual service centers without networking and connecting with the adjacent centers.
Similarly, in terms of recreational activities, Colombo and Kalutara are identified as the main urban centers. Colombo occupies many community gathering parks compared to other areas, i.e., Galleface, Viharamahadevi Park, Torrington Park, Gangarama Park, Diyatha Uyana, Galle Face, Urban Forest Park, etc. Kalutara is important in recreational activities because of the widespread Calido beach.
While Colombo, the commercial capital of Sri Lanka, acts as the main urban center for most parameters, Koggala and Matara show significant prominence by acting as the main urban centers with higher industrial agglomeration. This is mainly due to the location of the Export Processing Zone in Koggala. In addition, Matara accommodates many factory stations, such as Matara Polythene Center, Elcardo Industries, Freelan factories, Nippon Paint, Jay Jay Mills, etc. According to the results of banking point density values, Colombo was identified as a main urban center. Moratuwa, Kalutara, Galle and Matara were identified as suburban centers. The population density and the road density also identified Colombo and its surrounding area as the main urban center, and the road density identified Galle as a suburban center.
In terms of residential activities, three main urban centers were identified. They are: Moratuwa, Panadura and Galle. However, Colombo was classified as a third-grade urban center. However, Colombo accommodates many luxury apartments such as Prime Residencies, The Grand Ward Place, Capital Twin peaks, and Altaire. The areas such as Panadura and Galle accommodate a higher number of middle-income residential apartments, including high-rise and low-rise residences. Especially, Moratuwa accommodates more low-income residences.
According to the transportation networks, main urban centers were identified in Colombo, Dehiwala, Moratuwa and Panadura, and the identified suburban patches were Kalutara and Ambalangoda. The composite map of all those mentioned above urban form-related parameters is provided in
Figure 5. The identified urban centers and the urban expansion types are presented in
Table 7.
As of
Figure 5, three main agglomeration footprints were identified. They are Colombo’s main urban center-based urban agglomeration footprint (12.5 km
2), Galle’s suburban center-based urban agglomeration footprint (10.34 km
2), and Matara’s suburban center-based urban agglomeration footprint (4.64 km
2). Among them, Colombo’s main urban center-based urban agglomeration region is prominent and experiences a widespread agglomeration. This area can be identified as an urban region with multiple nuclei from Colombo to Kalutara. It consists of all kinds of urban centers. For instance, the main urban region spread along the coastal road from Colombo to Kalutara connects third-grade urban centers, i.e., Moratuwa, Panadura, Kalutara, and fourth-grade urban centers, i.e., Wadduwa, Ambalangoda. Further, the multiple nuclei urban patches in the region are experiencing an infilling urban expansion by converting non-urban areas from green areas into urban built-up areas.
The next urban agglomeration footprint emerged, centering the Galle suburban center. In between the Colombo main urban center-based urban agglomeration footprint and the Galle suburban center-based urban agglomeration footprint, several isolated urban centers, such as Ambalangoda and Hikkaduwa, can be identified. Such centers act as newly emerged urban centers with no spatial relationship to the surrounding urban centers, which experience an outlying urban expansion type. Galle’s suburban center-based urban agglomeration footprint is surrounded by third- and fourth-grade urban centers.
Secondly, urban agglomeration footprints were identified using urban function-related parameters of NTL Data, Point of Interest Data, and Vehicle Speed.
Figure 6 illustrates the maps related to the urban function-related parameters.
As
Figure 6a shows, only Galle and Matara were identified as the main urban centers according to the analyzed social media data. Most significantly, Colombo was identified as a suburban center, and Panadura was identified as a third-grade center. The other important thing is the emergence of Ambalangoda, Aluthgama and Hikkaduwa as outlying urban centers with fourth-grade urban characteristics. Although the cities of the Western Province, such as Colombo, Panadura, and Kalutara, acted as the main urban centers, as per the circulation of location-specific social media data, Southern Province cities, such as Galle and Matara, act as the main urban centers. The locations of famous tourist destinations in and around Galle and Matara cities can be identified as a reason for the surge in the higher number of social media data around these cities.
The NTL data has only identified Colombo as a main urban center. This urban agglomeration footprint expands towards Panadura, creating the largest footprint among all the footprints analyzed so far. This again shows the higher agglomeration concentrating on the country’s commercial capital. Galle was the only suburban center identified through the NTL data.
According to the travel speed, three main urban centers were identified. They are the Colombo, Galle, and Matara main urban centers. Colombo’s main urban center extends from Colombo to Dehiwala, performing a linear agglomeration along the A2 coastal road, creating an urban region from Colombo to Kalutara where Colombo and Panadura act as the main urban centers.
Figure 7 shows the composite output of the considered parameters under urban functions.
Table 8 lists the identified urban centers in detail with the expansion type through the composite map of urban function.
As of
Figure 7, four main agglomeration footprints were identified. They are the Colombo, Matara, Galle, and Panadura main urban center-based urban agglomeration footprints. Colombo’s main urban center-based urban agglomeration footprint significantly expand from Colombo to Dehiwala. The area of the footprint is 66.12 km
2. However, Panadura acts as another main urban center; its area is 3.16 km
2. In addition, Panadura and Colombo’s main urban centers are connected through several suburban centers, i.e., Panadura, Wadduwa, and third-grade urban centers, i.e., Kalutara and Aluthgama. Therefore, Colombo’s main urban center towards Kalutara’s urban main urban center can be identified as one urban region with multiple nuclei, which is expected to increase the agglomeration levels with future developments.
Matara’s main urban center-based urban agglomeration footprint is the second-largest urban agglomeration footprint, with an area of 12.94 km2. Therein, Weligama acts as a third-grade urban center. Galle’s main urban center-based urban agglomeration footprint is the third-largest urban agglomeration footprint, with an area of 10.49 km2. Furthermore, there are other town centers, such as Ambalangoda and Hikkaduwa, that experience a lying urban expansion, which emerge as isolated urban centers.
5. Findings and Discussion
A clear change in the urban agglomeration footprints can be identified by comparing the composite maps of urban form and urban function-related parameters separately. Accordingly, the composite map of urban form-related parameters (see
Figure 5) has identified Colombo’s main urban center-based agglomeration footprint as the only footprint with infilling urban expansion. In contrast, the composite map of urban function-related parameters (see
Figure 7) has identified a larger urban agglomeration footprint that extends from Colombo to Dehiwala.
Even when comparing the extent of the urban footprints, the composite maps of urban form-related parameters and urban function-related parameters depict a clear difference. For instance, Colombo is the only main urban center-based urban agglomeration footprint with over 12 km2 on urban form-related parameters, the same urban agglomeration footprint has extended for 66.12 km2 on the urban function-related parameters. Although the urban form-based parameters, such as distribution of residential activities, residential population, hospitals, education institutions, etc., are widely used to demarcate urban agglomeration footprints, they provide a misinterpretation of the urban agglomeration patterns, which could misguide the policymakers and practitioners in making decisions.
The footprint extent gap between the Colombo urban agglomeration footprint identified through the composite maps of urban form and functions-related parameters was 53.62 km2, which is considerably high. This hints at a possible overestimation or an underestimation of the urban agglomeration footprints.
Therefore, the study developed a composite map of all urban form and function-related parameters to compare with the real ground situation for validation purposes. Unlike the composite map of urban form-related parameters, the composite map of urban function-related parameters has identified multiple main urban centers-based urban agglomeration footprints, i.e., Colombo, Galle, and Matara, which further justify the inadequacy of using one or a few parameters to understand the urban agglomeration footprints which have been the popular practice so far [
14,
16,
40]. The composite map of all urban form and function-related parameters given in
Figure 8 and
Figure 9a–c shows the validation.
Table 9 shows the identified urban centers and the expansion types of each urban center.
In contrast to the composite map of urban form-related parameters but like the urban function-related parameters,
Figure 8 has identified three main urban centers-based agglomeration footprints, i.e., Colombo, Galle and Matara. Unlike the composite map of urban function-related parameters, the extent of the Colombo urban agglomeration-based urban footprint has declined significantly. To validate the differences mentioned above and similarities, the study overlaid the urban agglomeration footprints derived under urban form and functions analyses on a Landsat satellite image.
Accordingly, the identified urban agglomeration footprints using the big data fusion approach can be used as a strategy to develop urban areas. As understood, the urban agglomeration boundaries have expanded beyond the administrative boundaries. Policymakers and urban planners need to pay attention to these identified urban centers. National Physical Planning Policy and the Plan (NPPD 2017–2050) also has identified the main urban centers as Colombo, Kalutara, Galle, and Matara along the coastal belt, and these centers have been identified based on population density as the only parameter. Further, the footprint’s extent and possible expansion type were not analyzed. As an example, although NPPD 2017–2015 has identified Kalutara as a main urban center, this study has identified Panadura as a town experiencing more agglomeration than Kalutara, which shows the importance of adopting a multisource geospatial big data analytics approach.
Higher urban agglomeration can lead to different environmental pollution [
55,
56,
57,
58,
59], extreme heat events [
60], urban sprawl [
45,
61], demands for administrative restructuring [
62], and regional imbalances [
63]. Therefore, understanding urban agglomeration footprints and expansion types are important to orient urban planning-related policy decisions to better manage the negative effects of urban agglomeration. Controlling urban agglomerations should not only limit the city’s growth but also promote the development of the identified urban clusters within the limitations of environmental constraints [
56]. Rather than applying the same policies to guide all agglomerating footprints that experience different expansion types with different agglomeration levels, applying a separate set of policies could lead to an increase in the city’s sustainability [
64,
65].
As underlined in the literature [
66], urban agglomeration is a problem for a city, as its magnitude, expansion levels, and types are not well studied. An updated, rich dataset with limited time- and cost-consuming methods would serve this need. If the nature of the urban agglomeration is not well studied and relevant actions are not taken, uncontrolled/unplanned/spontaneous urban growth, consequential pollution, and environmental abuse, and urban–rural conflicts in the form of unsustainable peri-urban growth could happen [
67]. Alternatively, in a context where urban agglomeration is well studied, and relevant actions are carefully taken, urban planning practices could orient towards a more sustainable development that limits environmental abuse/degradation and socioeconomic inequalities [
19,
66,
68].
Previous studies conducted to understand urban agglomeration footprints either have used small datasets, i.e., interview and survey results [
69,
70], or a dataset that represents only urban form, i.e., road density, building density, and land use [
14,
70]. Most recent studies have only attempted to investigate urban agglomeration footprints using the datasets related to defining an urban function, i.e., NTL [
71,
72,
73], while totally neglecting the urban form-related parameters. Nonetheless, none of the studies have tried to follow an integrated approach to understanding urban agglomeration footprints. This study has found that technological innovation and advanced data analytics are required to adequately demarcate urban agglomeration footprints [
74,
75,
76,
77,
78]. Accordingly, this study emphasized the importance of combining both urban form and function-related parameters using a multisource open geospatial big data fusion approach to accurately understand urban agglomeration footprints, also incorporating near-rea-time dimension to the approach.