1. Introduction
Official maps often omit the existence of deprived areas [
1] or declare them to be homogeneous [
2,
3]. However, deprived areas generally differ in their histories, their morphologies, services, socioeconomic, conditions and tenure (ranging from pavement dwellers and large slum areas to deprived resettlement colonies). Finding reliable information on deprived areas is a complex problem, as illustrated by population estimates in the large Mumbai slum Dharavi, which, according to [
4], range from 300,000 to 900,000 inhabitants. Furthermore, deprivation mapping is often carried out at the administrative ward level (c.f. [
5,
6]), hiding spatial differences within wards and clustering across ward boundaries. This is a particular problem if wards are rather large, as is the case of the health wards in Mumbai (of which there were 88 at the time of the 2001 Census, with an average population of 136,000). In the 2011 Census data, the metropolitan area of Mumbai is divided into 24 administrative wards, with populations ranging from 127,290 (city [
7]) to 941,366 people (suburban [
8]). Linking and integrating spatially detailed information on slums to such large and aggregated spatial units is a problem [
9], thus even when data on slums are available they are often not used as useful spatial relationships cannot be built.
Very high resolution (VHR) remote sensing imagery has become a valuable information source regarding urban morphologies [
10,
11,
12,
13], “providing spatially disaggregated data in a more timely fashion for urban planning processes” [
14] (p. 2) compared to traditional ground-based surveys [
15]. The utility of VHR imagery covering large areas is particularly relevant for complex megacities with rapid changes. With respect to the megacity of Mumbai, several studies have shown the potential of VHR imagery to map urban land uses (e.g., [
16]), and particularly deprived areas (e.g., [
4,
15,
17,
18,
19]) that house a large share of the population. Nevertheless, to date, little research has quantified their specific morphological characteristics. A first attempt by [
4], measuring morphological characteristics of slums in Mumbai, showed them to have similar characteristics in terms of high densities, building size, and height, but stressed their heterogeneous morphological characteristics. However, they did not focus on typologies of deprivation.
A recent review of slum mapping via remote sensing [
20] revealed a range of methods and image features, e.g., object-based image analysis (OBIA), grey-level co-occurrence matrix (GLCM), and spatial metrics, where [
21,
22] showed the effectiveness of index-based approaches to reduce feature dimensionality for urban mapping. OBIA allows the extraction of roof objects as well as the extraction of homogeneous settlements depending on the way the scale parameters are set [
23]. Homogenous settlements are also referred to as homogenous urban patches (HUP), following [
24]. However, the capacity to automatically extract roof objects depends on the image resolution and urban morphology, which is challenging in many Asian cities, where often large areas of relatively small buildings display high clustering [
20,
25]. Several remote sensing studies have extracted slum settlements (or slum HUPs) (e.g., [
17,
23]). These studies used typical morphological characteristics of deprived areas (i.e., small building sizes, high densities, and organic layout pattern), allowing their mapping via image features (spectral, texture, or spatial metrics). Spectral information assists in differentiating typical roofing materials between deprived and other built-up areas. However, the use of different roofing materials, ranging from plastic, wood and metal to concrete and asbestos, makes mapping relying on spectral information alone problematic. An alternative is employing the GLCM, which calculates several textural measures within a user-defined window size and shift [
26]. Previous studies employing GLCM-derived texture measures for mapping deprived areas include contrast [
23,
27], entropy [
28,
29], and variance [
17,
30]. Spatial metrics are increasingly used to analyze and quantify the urban morphology (e.g., [
15,
31,
32]), where [
23] showed the utility of combining both texture and spatial metrics for extracting slums in Pune (India), but also illustrated uncertainties in slum identification [
33]. However, they did consider slums a ‘homogeneous zone’, while uncertainties might also be caused by different types of deprived areas.
Therefore, the aim of this paper is to analyze the capacity of VHR imagery and image processing methods to map locally specific types of deprived areas in Mumbai, which can help in analyzing their diversity and clustering. The structure of the paper is as follows.
Section 2 develops a framework for analyzing the diversity of deprived areas in VHR images.
Section 3 describes the methodology to create a typology of deprived areas. In
Section 4, a random forest classifier and logistic regression (LR) model are employed using VHR imagery to model deprived areas. The output provides significant image features of the LR model and an accuracy assessment. In
Section 5, we discuss the main findings and the application relevance followed by conclusions on the scope, capacities and limitations for extracting such typologies from VHR imagery.
5. Discussion
The aim of the study was to analyze the capability of image processing methods to spatially distinguish different deprived areas in Mumbai from VHR imagery. Deprived areas in Mumbai have diverse and complex morphological characteristics, often overlooked in previous studies, e.g., [
17,
18]. The morphological characteristics were conceptualized into four dimensions, i.e., environment, texture pattern, density, and geometry, and further utilized in the image-based analysis to extract spatial information about their morphological differences. This not only improved our understanding of how to extract such information, but also has practical value. For instance, [
51] stressed that deprived areas with a more regular pattern offer a better “base for subsequent improvements and installation of infrastructure”(p. 7) than areas with more irregular patterns, which often require more investment for upgrading. Thus, if different morphologies require different action for upgrading, detailed knowledge on the morphology of deprivation will support planning and decision-making for implementing upgrading policies [
66]. However, the employed dimensions and their features have an inherent challenge, which refers to the spatial dimension used for its measurement [
12]; for instance, density measures vary considerably depending on the reference unit used. Thus utilizing a different spatial aggregation level, e.g., via smaller or larger HUPs or using more regular outlined blocks will give different feature values and impact final mapping results. Nevertheless, we argue that HUPs optimized for the local context are much better adapted to reflect the urban morphology compared to administrative units, which are often not suitable due to the modifiable areal unit problem (MAUP) [
12] and their overly large and variable size.
The extracted morphological features allowed us to capture the diversity of four deprived and one formal built-up area type. The significance of these image features was analyzed within a LR model, resulting in a set of coefficients and constants for the most significant features (i.e., GLCM variance, built-up mean area, land cover/use evenness (SHEI), DEM mean, GLCM second moment mean, GLCM entropy mean, and built-up patch density). This allowed us to calculate class probabilities for all HUPs, which resulted in a fuzzy probability layer at the HUP level. The final typology of deprived areas was based on the highest class probability. Due to the logistical challenges of collecting a large set of ground-truth data spread over a large urban area, the number of training points was relatively small. Collecting such data based on visual image interpretation, as is often done, would introduce a lot of uncertainty, as experts often disagree on the delineation of deprived areas in VHR imagery [
33,
67]. The increasing availability of crowdsourced data and Google Street View (e.g., in Indonesian cities) combined with visual image interpretation might, in the future, facilitate the extraction of suitable training data. Therefore, it would be interesting to repeat the approach for other cities using a larger set of training data.
Through this study we distinguished different types of deprivation with an overall classification accuracy of 79%. Obtained accuracy levels differed by type, showing that slums with small buildings had the highest classification accuracy while slums with mixed building sizes and the transition type between chawls and basic formal areas had the lowest classification accuracy. The aggregation of deprived areas to HUPs allowed for mapping the dominant type of entire neighborhoods. However, this aggregation often led to very small clusters of slum pockets (e.g., small pavement dwellings) being omitted as they are frequently part of a larger (e.g., formal) HUP. Employing a LR model helped to reduce the computational demand, because all feature values were aggregated to HUPs stored as vector data (in a raster data structure, image features would consume several GB). HUPs are also a more meaningful spatial unit for informing pro-poor policies. Furthermore, LR modeling allowed the extraction of the most significant features per type, while the fuzzy classification facilitated a better optimization of class threshold (probability) values compared to standard image classification methods.
The presented approach to capture the diversity of deprivation in a large and complex megacity was tailored to the local morphology of deprivation (in Mumbai) via the selected image features. However, the conceptual level of the four dimensions of the diversity of deprivation has the potential of being transferable (for concepts on measuring transferability and robustness, see [
59,
67,
68,
69]) to other cities in the Global South. Further studies are recommended to better understand and analyze the diversity of deprivation across the globe, as well as to decide which image features are relevant for specific regional conditions.
The application potential of mapping the diversity and clustering of deprived areas was illustrated by overlaying the result with the health ward boundaries. This showed that large administrative units have limited use in mapping fine-grained patterns of deprivation in a complex megacity [
15]. The ward boundaries sometimes cut across larger clusters of deprivation, splitting them into smaller subunits. For informing pro-poor policy, ward-based information hides the spatial heterogeneity of deprivation within wards and across boundaries, hampering effective planning and service provision [
66]. Thus, more disaggregated and clustered information on deprivation that also measures its diversity could improve planning and decision-making in complex and dynamic megacities. It also points to the possible benefit of coordinating anti-deprivation action across ward jurisdictions, so that spatial coherent investments and improvements are made. Thus, VHR imagery, with its potential for covering larger areas with high temporal frequency, is fit for capturing details of the urban morphology beyond the aggregated view of administrative units.