Next Article in Journal
Does Hazardous-Waste Testing Follow Technical Guidance, Thus Help Protect Environmental Justice and Health?
Next Article in Special Issue
Study on Spatial Distribution Equilibrium of Elderly Care Facilities in Downtown Shanghai
Previous Article in Journal
Exposure to Toxic Metals and Health Risk Assessment through Ingestion of Canned Sardines Sold in Brazil
Previous Article in Special Issue
Prediction and Trend Analysis of Regional Industrial Carbon Emission in China: A Study of Nanjing City
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Using Open-Access Data to Explore Relations between Urban Landscapes and Diarrhoeal Diseases in Côte d’Ivoire

1
School of Architecture, Civil and Environmental Engineering, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
2
Centre Suisse de Recherches Scientifiques en Côte d’Ivoire, Abidjan 01 BP 1303, Côte d’Ivoire
3
Swiss Tropical and Public Health Institute, 4123 Allschwil, Switzerland
4
University of Basel, 4001 Basel, Switzerland
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2022, 19(13), 7677; https://doi.org/10.3390/ijerph19137677
Submission received: 29 May 2022 / Revised: 17 June 2022 / Accepted: 21 June 2022 / Published: 23 June 2022

Abstract

:
Unlike water and sanitation infrastructures or socio-economic indicators, landscape features are seldomly considered as predictors of diarrhoea. In contexts of rapid urbanisation and changes in the physical environment, urban planners and public health managers could benefit from a deeper understanding of the relationship between landscape patterns and health outcomes. We conducted an ecological analysis based on a large ensemble of open-access data to identify specific landscape features associated with diarrhoea. Designed as a proof-of-concept study, our research focused on Côte d’Ivoire. This analysis aimed to (i) build a framework strictly based on open-access data and open-source software to investigate diarrhoea risk factors originating from the physical environment and (ii) understand whether different types and forms of urban settlements are associated with different prevalence rates of diarrhoea. We advanced landscape patterns as variables of exposure and tested their association with the prevalence of diarrhoea among children under the age of five years through multiple regression models. A specific urban landscape pattern was significantly associated with diarrhoea. We conclude that, while the improvement of water, sanitation, and hygiene infrastructures is crucial to prevent diarrhoeal diseases, the health benefits of such improvements may be hampered if the overall physical environment remains precarious.

1. Introduction

Diarrhoeal diseases still pose a considerable public health problem. Although diarrhoea is preventable and treatable, it remains one of the main causes of death worldwide, especially among young children and the elderly in sub-Saharan Africa [1]. With over 800,000 deaths attributed to inadequate water, sanitation and hygiene (WASH) in 2016, diarrhoea is the main component of the disease burden from unsafe WASH [2].
It is widely acknowledged that safe access to well-managed WASH services is key to interrupting the transmission pathways of pathogens causing diarrhoeal diseases [3,4,5]. Nonetheless, important research gaps remain, notably regarding potential risk factors related to the physical environment. For example, although previous studies have explained the relevance of understanding spatial patterns of diarrhoea [6,7], little is known about the impact of landscape patterns—i.e., the form and composition of the physical environment—on the occurrence of diarrhoeal diseases. Besides socio-economic and biological factors, the physical environment can also be considered as a key health determinant [8,9]. Moreover, understanding the relationship between diarrhoea and the spatial characteristics of human settlements becomes even more relevant in the case of child health, considering that the main exposure pathway for toddlers is hand-to-mouth contamination [10], which relates directly to the living environment.
In this study, we adopted a landscape ecology perspective [11] to deepen the current knowledge on risk factors for diarrhoea, by advancing “urban landscapes” as the key exposures. We purposively refer to urban landscapes in the plural, positing that they result from specific, local socio-spatial arrangements that come to bear in different forms. Indeed, cities are “patchy” ecosystems [12], formed by different types of land use and occupation that materialize into extremely diverse urban habitats—arguably, with some of these habitats being more healthy than others. In this sense, landscape ecology methods are instrumental to analyse the form and composition of urban habitats, by addressing landscape as spatial mosaic of discrete land use/land cover “patches” [11,13,14].
Thanks to advances in geospatial technologies and access to georeferenced health data, spatial patterns of disease have been extensively studied [7,15], while mathematical models have been developed to analyse and predict the spatiotemporal spread of infectious diseases [16]. However, most studies conducted at large geographic scales (e.g., national, or metropolitan scales) focus only on disease clustering, and inadequately address space as an abstract, homogenous entity. In fact, variables related to the physical environment (e.g., entropy levels of the built environment, density or land-use) are rarely integrated in large-scale studies—exceptions occur when both health and settlement data are available at a high spatial resolution, or are collected directly by the investigators [17,18].
The lack of high-resolution data systematically collected over large areas constitutes a barrier to ecological studies that aim to explore risk factors derived from the physical environment [19], especially in low- and middle-income countries [20]. Such research barriers are being overcome with rapidly growing open-access data portals (e.g., OpenStreetMap and Humanitarian Data Exchange) and the increased contribution of open science projects that freely share their data. There is a high potential to combine these openly accessible datasets to deepen the understanding of how the physical environment relates to health. Yet, there is a paucity of studies that use open-access data to investigate this relationship at a large geographic scale. Such analyses could provide a useful framework, given their low cost and high reproducibility.
Moreover, understanding how the physical environment affects health is key for decision-making, notably in contexts marked by high levels of socio-spatial segregation that lead to disparate health outcomes [9]. Indeed, in cities characterised by inequitable spatial development, the spatial dimension of disease must be addressed to properly identify priorities and to tailor targeted interventions. In sub-Saharan Africa, given the dynamics and intensity of recent urbanisation trends [21], decision makers would benefit from a deeper understanding of the relations between urban landscape patterns and diarrhoeal diseases.
The purpose of this study was to explore the possible relations between urban landscapes and diarrhoea, addressing the hypothesis that specific, spatial patterns of urban settlements can be associated with the prevalence of diarrhoea. In this way, the spatial characteristics of the physical environment were defined as key variables of exposure. At the same time, our study put forward an analytical framework that was strictly based on open-access data and open-source software. We focused on Côte d’Ivoire, a West African country that faces considerable challenges related to rapid urbanisation and has a high burden of diarrhoeal diseases, which are among the country’s top 10 causes of death [22]. In addition, Ivorian cities are marked by contrasting landscapes, as economic growth has failed to keep pace with the rapid urban and demographic growths [23], while transposed town planning models have failed to address local socio-economic needs [24]. Notably, access to water and sanitation services remain a great challenge. Between 2000 and 2020, access to basic water services in Ivorian cities decreased from 91 to 85%, while 52% of the urban population still lacked access to basic sanitation in 2020 [25]. The concentration of demographic growth around a few cities (“urban primacy”) has certainly exacerbated the issues related to the lack of basic services, as these cities have become saturated and witnessed the emergence of precarious, informal settlements [23]. In 2014, over half of the urban population in Côte d’Ivoire lived in slums [26], i.e., deprived settlements where the physical environment itself exacerbates the risk of several illnesses, including diarrhoeal diseases [9,27].

2. Materials and Methods

2.1. Datasets

This cross-sectional, ecological study combined open-access data on health, land cover, and basic infrastructures, readily obtained from different sources (Table 1). Geospatial data on the occurrence of diarrhoea among children under the age of five years can be obtained from the Demographic and Health Surveys (DHS) programme in vector format [28]. The DHS provides anonymised survey data at the individual and household levels that include cases of diarrhoea that had occurred in the 2 weeks preceding their survey. The DHS also provides data on access to WASH facilities and education, which were relevant to the current analysis. DHS data can be georeferenced by linking the observations to the point locations of survey clusters. To ensure anonymity, these locations do not correspond to the precise locations of participating households, but rather to the mean location of households belonging to a same cluster—i.e., for every cluster of surveyed households, there is one single GPS position that is attributed to all households belonging to the same cluster. For the datasets used in this analysis [29,30,31,32], there are 351 clusters distributed across Côte d’Ivoire, for a total of 9686 surveyed households. These clusters contain a median number of 27 households, or 134 people. Models of weather conditions can be obtained from Terraclimate in raster format, at a spatial resolution of ~4 × 4 km [33], and these data were included in our study because climatic conditions can be associated with diarrhoea [34]. Night illumination was used as a proxy for the presence of urban infrastructures, and can be obtained in raster format (500 × 500 m) from the National Aeronautics and Space Administration’s (NASA) Earth Observatory [35]. Land cover data can be obtained in raster format (300 × 300 m) from the European Space Agency’s Land Cover Climate Change Initiative Project (ESA Land Cover CCI) [36]. Population estimates at high spatial resolutions are provided by WorldPop [37], also in raster format (100 × 100 m). Finally, vector data on mobility infrastructures can be obtained from OpenStreetMap.

2.2. Data Pre-Processing

Combining health and environmental data is often a challenge, considering the differences in terms of the spatial and temporal resolutions of openly available data [38]. In this study, the areal units used to aggregate data were based on the geolocations of DHS clusters, which had the coarsest spatial resolution. All data processing steps were done in Python language, using the Jupyter computing environment. The following packages were used to process and visualise the data: PyLandStats 2.3.0 [39], rasterstats 0.15.0 [40], rasterio 1.2.9 [41], earthpy 0.9.2 [42], Fiona 1.8.20 [43], pandas 1.3.4 [44], geopandas 0.9.0 [45], numpy 1.20.3 [46], statsmodels 0.13.0 [47], pysal 2.5.0 [48], matplotlib 3.4.3 [49], and seaborn 0.11.2 [50].
The different data layers were harmonised spatially and temporally based on the buffer areas generated from DHS clusters, hereinafter called “spatial units”. Based on previous studies [51], we generated circular buffer zones originating from each cluster centroid, with radii based on the geographic blur determined by the DHS data anonymisation protocol: urban clusters had a buffer radius of 2 km, while rural clusters had a buffer radius of 5 km (Figure 1). The different data layers were aggregated at the cluster level: environmental data were aggregated based on the respective buffer areas, while DHS survey data were aggregated based on the clusters’ unique identifiers. In this way, for each spatial unit we obtained the following: (i) the prevalence of diarrhoea among children under the age of five; (ii) the proportion of the cluster’s population with access to at least “basic” water and sanitation, as defined by the World Health Organisation (WHO) and the United Nations Children’s Fund (UNICEF) Joint Monitoring Programme [25]; (iii) the proportion of the cluster’s female population who never attended school; (iv) local climatic conditions; (v) a list of landscape metrics derived from remotely sensed data (NASA and ESA Land Cover CCI), hybrid models (WorldPop), and volunteered geographic information (OpenStreetMap). Table 2 shows the variables obtained from this pre-processing in more detail.
Landscape metrics have been widely used to quantify and describe spatial patterns of “patches” of similar land cover categories [11,14,39], and thus are useful to analyse spatial patterns of urban settlements. These metrics were key features to our study, allowing us to relate the prevalence of diarrhoea (dependent variable) to indicators describing the form and composition of urban settlements (independent variables). We used the Python package PyLandStats [39] to calculate a series of landscape metrics for each land cover class contained within each spatial unit’s perimeter (Table 2), based on the data provided by ESA’s Land Cover Climate Change Initiative Project [36]. The metrics employed here—i.e., the proportion, edge and shape index of land cover patches—were based on previous studies that referred to land cover data to analyse spatial patterns of urban settlements [14] and environmental determinants of disease [18].
We added a level of detail to our landscape metrics by reclassifying patches originally categorised as “urban”, based on the levels of night illumination and demographic density (Figure 2 and Figure 3). Our study used the intensity of night illumination as a proxy for the presence of urban infrastructures—or, in other words, for the “quality” of urbanisation. Moreover, the quality of illumination affects the use of WASH amenities, especially by females [52], and thus may impact the risk of diarrhoea. We defined “precarious” and “regular” urban areas, building on the hypothesis that, if basic infrastructures are present, the level of illumination follows the level of demographic density. In this sense, urban pixels were considered “precarious” if they presented a high demographic density but a low (or relatively low) intensity of night illumination; on the other hand, “regular” urban patches had night illumination levels that matched the demographic density. This categorisation was done based on quantiles (Figure A1, Appendix A): for each urban pixel, if its density value was situated in a higher quantile than its night illumination value, it was considered “precarious”.
Another indicator of the quality of urbanisation was the presence of roads, for which we calculated two indicators: road availability (km of roads per ha of built-up area) and linearity (ratio between edge length and linear distance between the two vertices of the same edge). The latter also served as an indicator of the urban form.
Finally, based on the literature, a selection of features that have been associated with diarrhoea were included as control variables. Basic water and sanitation services are key to prevent diarrhoea [2], and were therefore included. Access to these services was measured using the DHS household datasets [30]. Maternal education has been associated with a lower risk of diarrhoea [53]; at the same time, it has been related with reporting bias, as households with higher levels of maternal education have shown increased rates of reported child diarrhoea [54]. A proxy variable of maternal education (i.e., women’s educational attainment) was therefore added, based on individual DHS data [31]. Climatic conditions also have been associated with diarrhoea [34], but here they showed no significant correlation (see Table A3, Appendix D). Hence, these data were discarded (for more details, see the computer code section at the end of this article).
To preserve some level of detail in the data, the variables resulting from the aggregation at the cluster level consisted of proportions (percentage) rather than simple means or medians. For example, demographic density in each spatial unit was given by the ratio between the number of built-up pixels classified as “dense” (with a value higher than the statistical series’ median) and the total number of built-up pixels contained in the respective spatial unit.
By the end of the pre-processing, we obtained a geo-dataset with 351 spatial units. Each spatial unit had three control variables (water, sanitation, and women’s educational attainment) and a total of 90 independent variables, i.e., the landscape metrics described in Table 2 (the full list of variables is given in Table S1, see Supplementary Materials).

2.3. Statistical Models and Feature Selection

We used four different regression models to assess the significance of the associations between the calculated landscape metrics and diarrhoea, while accounting for the effects of access to water, sanitation, and education, as well as spatial autocorrelation. The input data were standardised with a min/max scaler function, so that the effects of the different features could be compared through their coefficients.
We started by running a feature selection algorithm to identify the most important variables to be included in the regression models. Given the high number of independent variables that were derived from the landscape metrics (n = 90), a preliminary feature selection was necessary to avoid multicollinearity and to clarify the scope of the analysis. The feature selection algorithm was composed of two “filters”. The first filter was a stepwise selection, which is a process that adds variables from a predefined list to the model, one-by-one, rechecking at each step the importance of all previously included variables [55]. In other words, the stepwise process combines forward and backward feature selection processes, consisting of iterative linear regressions allowing to identify the “best” features based on predefined significance thresholds (maximum p-value was set to 0.1) and model performance (residual sum of squares). The second and final filter was based on Spearman’s rank correlations (ρ): we used the Python package statsmodels [47] to calculate bivariate correlations between each feature selected with the stepwise method and the dependent variable (i.e., prevalence of diarrhoea at cluster level); only features with a p-value smaller than 0.1 were kept. We purposively opted for relative high thresholds of p-values because of the exploratory nature of this study.
Once we concluded the feature selection, we ran both weighted and unweighted regression models. First, we built an unweighted ordinary least squares (OLS) model containing the dependent, control, and independent variables, as well as a constant. Then, we built a weighted model with the same features using the cluster weights given by the DHS. In fact, when conducting country-level analyses, the DHS suggests using cluster weights to adjust for eventual biases resulting from their sampling method. Given the infectious nature of diarrhoeal diseases, the analysis also needed to account for spatial dependence [56]. To this end, we used the Python package pysal [48] to run two models of spatial regression with the same features, spatial lag and spatial error, as explained in Section 2.4.

2.4. Addressing Spatial Dependence

Spatial dependence, or spatial autocorrelation, is the phenomenon by which values of observations are associated with each other through geographic distance (e.g., high values close to other high values) [57]. Accounting for spatial dependence is essential because linear regressions assume a normal, random distribution of error terms and the absence of spatial autocorrelation in the dependent variable. We estimated the probability of spatial autocorrelation in the dependent variable by calculating the global Moran’s I, which indicated whether the observed values of prevalence of diarrhoea were clustered, or randomly distributed, in space. As for the error terms in the OLS regression, we detected the probability of spatial dependence through the Lagrange multiplier test for spatial error.
Contrary to an OLS model, spatial regressions can account for the spatial autocorrelation of the dependent variable (spatial lag dependence) and of the error term (spatial error dependence) [58]. The spatial lag model used in this study incorporates the spatial autocorrelation of the dependent variable by introducing the average values of neighbours as an additional variable into the regression specification (Equation (1)):
y = α + ρWy + Xβ + ε
where y is an N × 1 vector of observations on a dependent variable taken at each of N locations, α is the intercept, ρ is a scalar spatial lag parameter, W is an N × N matrix of weights indicating the spatial framework of neighbourhood effects among the dependent variable, X is an N × k matrix of explanatory variables, β is a k × 1 vector of parameters, and ε is an N × 1 vector of error terms. Similarly, the spatial error model used in this study also incorporates spatial autocorrelation by introducing the average values of neighbours as an additional variable into the regression specification, but this time using the values of the error terms (Equation (2)):
y = α + Xβ + u,     u = λWu + ε
where u is the vector of spatially autocorrelated residuals with constant variance and covariance terms, specified by an N × N matrix of weights indicating the spatial framework of neighbourhood effects among the error terms (W) and a spatial error coefficient (λ).

2.5. Inclusion Criteria and Stratification of Analysis

The unit of analysis was the buffer area generated from each DHS cluster’s centroid (spatial unit). Out of the 351 spatial units, 10 were excluded as they did not have valid geographic coordinates. Furthermore, because our analysis focused on human settlements, we opted to keep only those spatial units with at least 1 “urban” pixel (300 × 300 m). Hence, we excluded 74 spatial units where no human occupation was detected—including units with settlements not sufficiently large to be detected at the spatial resolution used here. In the end, 267 spatial units (out of 351) were included in our regression analyses. Details about the discarded units are given in Table A1 (Appendix B).
To determine whether the size and proportion of urban areas affected the association between landscape features and diarrhoea, the processes described in Section 2.3 were stratified into two levels. First, we conducted the regression analyses with all the 267 spatial units that met our inclusion criteria. Then, we conducted the same analyses with an “urban” subset, which contained 105 spatial units. The criterion for a spatial unit to be classified as “urban” was to have a proportion of urbanised area (ratio between the surface of “urban” pixels and the spatial unit’s total area) that was above the average of the retained 267 spatial units. Figure 4 shows the location of the 267 spatial units included in the analysis, specified by subset.

3. Results

3.1. Overall Clustering of Data and Need for Spatial Regressions

The tests for spatial dependence confirmed the need for spatial regression models when processing the data of the full dataset (n = 267). The global Moran’s I statistics for the spatial distribution of the dependent variable showed a significant, positive spatial autocorrelation of the prevalence of diarrhoea (Moran’s I = 0.11, p = 0.002). Regarding the distribution of error terms in the OLS regression, a significant spatial dependence was detected by the Lagrange multiplier test (p = 0.002). For the urban subset (n = 105), however, no spatial dependence was detected. Nevertheless, we also ran spatial regressions with this subset, for the sake of comparison with the full dataset analysis, and overall coherence in the analysis.

3.2. Significant Landscape Feature: Dense, Precarious Urban Areas

One feature passed our selection filter and, hence, was retained as an independent variable: the proportion of urban patches (ratio between patch area and buffer area) that were characterised as being very dense (demographic density values situated within the last decile of the series) and with low-to-medium night illumination levels (values above the median but below the last decile of the series, see Figure A1, Appendix A). Following the rationale explained in Section 2.2, these patches correspond to “precarious” urban areas, as the level of density is higher than the illumination level. A total of 44 spatial units (all located around Abidjan and San Pédro, two large cities) had at least one urban pixel with these characteristics. We denominated this type of urban patch as “dense, precarious urban areas”.
Table 3 and Table 4 summarise the results of the different regression models used, following the stratification explained in Section 2.5. The proportion of dense, precarious urban areas was included as single independent variable (as it was the only significant landscape metric), while indicators of access to water, sanitation, and education were included as control variables.
For the full dataset (n = 267 spatial units), the weighted OLS regression and the spatial lag model performed better than the other two, based on their R2 and Akaike information criterion (AIC) values. Two features consistently showed significant coefficients (p < 0.05): (i) proportion of the cluster population with access to basic sanitation facilities; and (ii) proportion of dense, precarious urban areas. In the four models, basic sanitation was negatively associated with diarrhoea, while dense, precarious urban areas showed a positive association and had the strongest beta coefficients. The proportion of women without any education showed significant coefficients in the weighted OLS regression model and, globally, was negatively associated with the prevalence of diarrhoea. There was no significant association between access to basic water facilities and diarrhoea. We observed that neither the directions nor the levels of association between diarrhoea and the tested variables changed significantly between the different models. Also, although spatial dependence was detected, the spatial lag model only improved the coefficient of determination when compared to the OLS model, while it was the weighted model using DHS cluster weights that explained most of the variance of diarrhoea.
For the urban subset (n = 105 spatial units), there was considerably less variation in the models’ R2 and AIC values. In terms of the explained variance of the dependent variable, the models performed better with the urban subset than the full dataset. In this subset, however, the only feature that consistently showed significant coefficients was the proportion of dense, precarious urban areas—having once again the strongest beta coefficients. Except for sanitation, the control variables showed poor coefficients for the urban subset. As in the full dataset analysis, neither the directions nor the levels of association between diarrhoea and the tested variables changed significantly between the different models. Similarly, the model that explained most of the variance of diarrhoea was the weighted OLS model (with DHS cluster weights).

3.3. Stages of Urbanisation and Landscape Patterns

Our stratified regression analyses showed that the proportion of urbanised area within the spatial units affected the coefficients of the selected landscape metric, as well as those of the control variables. Indeed, in the urban subset, the independent variable’s coefficients became more important and significant, while the coefficients of the control variables became much less significant than in the analyses with the full dataset.
In addition to the regression coefficients, urbanisation also affected the independent variable’s rank correlation with the outcome of interest (Figure 5). For the full dataset (n = 267), the Spearman’s correlation coefficient, which was calculated between dense, precarious urban patches and the prevalence of diarrhoea, was +0.11 (p = 0.079), while for the urban subset (n = 105; see orange dots in Figure 5) it increased to +0.24 (p = 0.013).
The size of urbanised area (total “urban” pixels counted in each buffer area) was positively correlated with the percentage of urban patches that were characterised as dense and precarious (ρ = +0.50, p < 0.001). Put differently, as the extent of urbanised areas within the spatial units (buffer areas) increased, the probability of urban areas being classified as “dense and precarious” also increased. Moreover, the cumulated area covered by urban patches was also significantly correlated to the percentage of women without any education (ρ = −0.35, p < 0.001), access to basic water facilities (ρ = +0.53, p < 0.001), and to basic sanitation (ρ = +0.50, p < 0.001).
The correlation between urbanisation and access to basic sanitation facilities is less clear than with the other two control variables (Figure 6, Figure 7 and Figure 8). Additionally, the correlations changed according to the stage of urbanisation: we found higher significance levels (p-values) for Spearman’s correlations among spatial units having low proportions of urban areas, as compared to the urban subset. We found p = 0.002 for water (against 0.318 for the urban subset); p ≈ 0.000 for sanitation (against 0.089); p = 0.002 for women’s education (against 0.939).

4. Discussion

4.1. Towards Spatial Predictors of Health Outcomes in Urban Areas

Spatial predictors of socio-economic indicators based on the morphology of the built environment have already been studied [59,60]. At the same time, socioeconomic indicators have been assessed as potential risk factors for diarrhoeal diseases [53]. However, to our knowledge there are no studies that explore direct associations between the urban landscape and diarrhoea. In this sense, our study advanced a selection of landscape metrics as potential predictors of disease. One must note that this research for spatial predictors of diarrhoea was facilitated by the availability of open-source software and open-access data on human settlements and populations, which should be encouraged and valorised both by scientists and planning authorities.
We defined “dense, precarious areas” through demographic and night illumination data. Although hypothetical, this initial assumption proved to be useful in identifying potential vulnerabilities in terms of health. The coefficients of both linear regressions and Spearman’s rank correlation consistently showed a positive, significant association between “dense, precarious areas” and the prevalence of diarrhoea. Among the most urbanised clusters (“urban” subset), the selected landscape metric was a better predictor of the prevalence of diarrhoea than the “usual suspects”, such as access to basic water and sanitation facilities, or women’s educational attainment. In fact, the proportion of dense, precarious urban areas was the only feature to consistently show significant coefficients in all regression models.
These observations call for further investigations on predictors of diarrhoea that are more related to global aspects of the urban habitat, and not strictly focused on household-level indicators or facilities. Indeed, in the case of Côte d’Ivoire, we may argue that poor urban development has led to environmental conditions that limit the potential benefits of basic water and sanitation infrastructures. Without proper urban habitats, the sole presence of these infrastructures may not be sufficient to prevent diarrhoeal diseases. If this is confirmed, it would have policy implications, as decision-makers should aim to improve the overall urban habitat—instead of punctually improving infrastructures, as it may be the case in “slum upgrading” projects. Further research on this aspect could be useful to shed light on recent controversies regarding the health impacts of WASH interventions, which are not always so clear or significant as expected [61].
Although urbanisation in Côte d’Ivoire was correlated to key predictors of diarrhoea, such as the access to basic sanitation and the proportion of dense, precarious areas, it was not directly correlated with diarrhoea. In fact, neither the size nor the form of urban patches showed significant coefficients. Generally, there was no apparent association between city size and the prevalence of diarrhoea, based on aggregated cluster data (Table A2, Appendix C). Nevertheless, the “quality” of urbanisation (indicated here by the extent of “precarious” areas) was a significant feature.

4.2. Saturation of Urban Settlements and Health Inequities

Urbanisation has been associated with better aggregate indicators of social and health outcomes [8,62,63]. In Côte d’Ivoire, our results raised some nuances to this assumption. While access to basic water and sanitation, as well as access to education, were significantly associated with urbanisation, the latter alone was not associated in any way with the health outcome of interest—i.e., diarrhoea. On the contrary, depending on the urban landscape characteristics (e.g., the proportion of dense, precarious areas), the risk of diarrhoeal diseases appeared to be significantly higher.
Moreover, our results showed that the size of urban areas was positively associated with the percentage of urban patches that were characterised as dense and precarious. This suggests that infrastructures have failed to keep up with demographic growth, which visibly poses considerable challenges to town planners. The concentration of demographic growth in a few Ivorian cities, and their subsequent saturation, has been discussed previously [23]. As economic limitations hamper investment in infrastructures and access to adequate housing [64], the urban habitat becomes a risk factor for disease. If spatial development is not equitable, then urbanisation leads to health inequities that inflict a high burden on the most deprived populations—notably those living in slums [9,19].
Our analysis also showed that, at early stages of urbanisation, access to basic services significantly increases with urban growth; however, when the extent of urban areas reaches a larger size, this correlation disappears. An explanation could be that it is easier to expand infrastructures in smaller cities than in bigger cities; in the latter, demographic densities would tend to increase without a corresponding increase in infrastructures. In fact, all urban pixels reclassified as “dense, precarious urban areas” were concentrated in two of Côte d’Ivoire’s largest cities, namely Abidjan and San Pédro.
Africa is characterised by the fastest urbanisation rate in the world, and most of the future urbanisation is expected to occur on this continent [21]. Given the important spatial and social transformations that this engenders, specific attention of researchers and urban planners is needed to ensure that the urban environment becomes a catalyst for social development, rather than a health hazard. To this end, early interventions are key to prevent urban areas from becoming saturated and to ensure that basic infrastructures and services keep up with demographic growth. In this sense, focusing on small- and medium-sized cities would be crucial as they approach the “tipping point” (in terms of size of urban area), where it becomes more challenging to expand infrastructures to follow demographic (and spatial) growth.

4.3. Study’s Limitations and Need for Further Research

Our study has several limitations that are offered for consideration and, possibly, point towards paths for future research on spatial predictors of diarrhoea. First, the cross-sectional design is susceptible to what is known as the ecological fallacy [65], i.e., limitations to causal inference due to the use of aggregate data. Although precarious spatial development was associated with diarrhoea, there are other features in the complex urban system that may impact the prevalence of diarrhoea. We addressed these limitations by adding control variables (to mitigate effects of confounders) and by reducing the number of independent variables (which reduced multicollinearity in the statistical models).
Second, study design limitations impacted the prediction power of the regression models, which had R2 values situated between 0.06 and 0.20. However, we note that low R2 values are expected in such ecological studies, given the “noise” of environmental variables and the complexity of the studied ecosystems [66]. In this regard, our model focused on the material aspects of the urban environment, while omitting social dynamics that could be relevant to explain the prevalence of diarrhoea. For instance, mobility can be a key variable to model diarrhoeal diseases, as pathogens may travel long distances with humans [67], but this could not addressed here due to the lack of data. Third, although the stepwise feature selection process is a powerful screening tool to identify contender models, it is prone to statistical issues that merit acknowledgment. For instance, searching through a large number of potential features and keeping only the ones that best fit the sample data may lead to “overfitting” the model [55]. In addition, the multiple testing of a large number of contender features makes such selection methods prone to the “false discovery” of features that appear as significant, but in reality are not relevant to estimate the dependent variable [68]. In this sense, the explanatory variable in this study could be further assessed using datasets with a higher spatiotemporal resolution, based on a larger, random populational sample.
Lastly, the study material was strictly limited to open-access data, and hence, there were limitations in terms of spatial accuracy. While settlement data can be found at a fine spatial resolution, between 100 and 500 m (and potentially up to 0.5 m in the case of very-high resolution satellite imagery), socio-demographic data are only available at low spatial resolutions (in this case, 2 to 5 km). This is certainly due to the ethical considerations regarding privacy. Indeed, researchers must preserve the anonymity of their study participants, and hence, any georeferenced data must be transformed with a geographic blur so that the specific locations of participating households cannot be identified. In our study, this blur made it impossible to determine in which specific type of landscape patch the study subjects lived—which is why we worked with buffer areas and global landscape metrics. Moreover, we note that the socio-demographic data aggregated by clusters came from samples that were not necessarily representative of that same cluster’s population.
Further research is needed to verify to which extent the urban landscape effectively impacts the risk of diarrhoea. Similar study designs could be implemented, but this time using high-resolution socio-demographic data—the latter would certainly need to be primary data collected through household surveys. More generally, further research addressing WASH interventions and diarrhoea from an ecological perspective—i.e., focusing not only on household-level indicators, but also on features observed at community and urban scales—could significantly contribute to fill in the knowledge gaps regarding the health impacts (and efficiency) of different WASH solutions.

5. Conclusions

In this exploratory study, we addressed diarrhoeal diseases from a landscape ecology perspective by considering the physical environment as a key explanatory variable. We presented a framework strictly based on secondary, open-access data to assess whether specific patterns of urban landscapes could be associated with diarrhoea. We combined remotely sensed data from different sources to classify urban areas based on demographic density and the intensity of night illumination (used as a proxy for the presence of infrastructures). This allowed us to identify different types of “precarious” urban areas, and to confront them, along with other landscape metrics, with the prevalence of diarrhoea among children under the age of five years.
Based on the results of four different regression models, we found that patches of dense, precarious urban areas were a statistically significant feature, consistently showing a positive association with diarrhoea. This association was stronger in larger urban settlements, where the selected landscape metric was a better predictor of the outcome of interest than the control variables (water, sanitation, and education level). Given these results, we may argue that poor urban development may lead to environmental conditions that hamper the potential benefits of basic water and sanitation infrastructures—thus reducing their ability to prevent diarrhoeal diseases. These observations raise the question of the scope of urban interventions, which should focus on the overall quality of the landscape, and not be limited to punctual infrastructural improvements.
We acknowledge the experimental nature of the framework put forward by this study. There is a need for further research on how the urban environment may be associated with diarrhoea, and how it may impact on potential benefits of water and sanitation infrastructures. Urban planners and public health professionals will benefit from a better understanding of how WASH services interact with their spatial and social contexts, which certainly requires design adaptations.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijerph19137677/s1, Table S1: Spearman correlation coefficients between all environmental variables (n = 90) and the prevalence of diarrhoea, with the respective p-value (for each variable, the table also indicates whether it was selected by the stepwise regression).

Author Contributions

Conceptualisation, V.P.C. and J.C.; methodology, V.P.C. and M.B.; software, V.P.C. and M.B.; validation, V.P.C., J.C., B.K. and J.U.; formal analysis, V.P.C.; investigation, V.P.C.; resources, J.C.; data curation, V.P.C.; writing—original draft preparation, V.P.C.; writing—review and editing, J.U. and B.K.; visualisation, V.P.C.; supervision, J.C., B.K. and J.U.; project administration, V.P.C.; funding acquisition, J.C. and J.U. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Swiss National Science Foundation, grant number CRSII5_183577, and the APC was funded by École Polytechnique Fédérale de Lausanne.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data available in a publicly accessible repository are listed below. Data presented in this study (processed by the authors, i.e., buffers with aggregated household and landscape data) will openly available in Zenodo. Until the publication, they will be provisionally available on Git Hub, with the computer code (see “Computer Code and Software”). DHS data: restrictions apply to the availability of these data. Data was obtained from the Demographic and Health Surveys Program and are available at https://dhsprogram.com/data/dataset/Cote-d-Ivoire_Standard-DHS_2012.cfm?flag=0 (accessed on 11 May 2022) with the permission of the Demographic and Health Surveys Program. ESA Land Cover CCI data: restrictions apply to the availability of these data. Data was obtained from ESA Land Cover CCI and are available at https://maps.elie.ucl.ac.be/CCI/viewer/download.php (accessed on 11 May 2022) with the permission of ESA Land Cover CCI. NASA: publicly available datasets were analysed in this study. This data can be found here: https://earthobservatory.nasa.gov/features/NightLights/page3.php (accessed on 11 May 2022). OpenStreetMap: publicly available datasets were analysed in this study. This data can be found here: https://download.geofabrik.de/africa.html (accessed on 11 May 2022). Terraclimate: publicly available datasets were analysed in this study. This data can be found here: https://www.climatologylab.org/terraclimate.html (accessed on 11 May 2022). WorldPop: publicly available datasets were analysed in this study. This data can be found here: https://www.worldpop.org/project/categories?id=3 (accessed on 11 May 2022).

Acknowledgments

We are thankful to the kind support of Günther Fink (Swiss TPH) and Dominik Dietler (Swiss TPH), who provided useful insights regarding the choice and processing of data used in this study.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Computer Code and Software

A repository with the code materials used in this study is available at https://github.com/ceat-epfl/landscape-diarrhoea-civ (accessed on 28 May 2022).

Appendix A

Figure A1 illustrates the rationale behind the reclassification of urban pixels (300 × 300 m), based on the levels of demographic density and night illumination. Three thresholds were established for these two statistical series, corresponding to: (i) first decile; (ii) median; (iii) last decile. These thresholds allowed to classify each pixel into 4 classes of demographic density and night illumination (1 to 4, with “4” corresponding to the highest values, i.e., within the last decile of the series). Urban pixels with a density class higher than its illumination class were considered “precarious”.
Figure A1. Scatterplot showing the reclassification of urban pixels (300 × 300 m) according to demographic density and night illumination. Elaborated by the authors.
Figure A1. Scatterplot showing the reclassification of urban pixels (300 × 300 m) according to demographic density and night illumination. Elaborated by the authors.
Ijerph 19 07677 g0a1

Appendix B

Table A1 shows the general characteristics of the excluded and retained spatial units, based on the variables provided by the DHS surveys. The reader must note that it was not in the scope of this study to analyse these differences. In fact, our analysis focused on human settlements, which requires the presence of urban land cover “patches”—which is why we only included spatial units containing at least 1 pixel (300 × 300 m) classified as “urban”.
Globally, the 84 units that did not meet the inclusion criteria (explained in Section 2.5) showed similar prevalence rates of diarrhoea, as well as access to basic water facilities, to the 267 units retained for the analysis. As for the levels of access to basic sanitation and women’s education, the values were significantly different.
Table A1. General characteristics of the excluded and retained spatial units (obtained from DHS data).
Table A1. General characteristics of the excluded and retained spatial units (obtained from DHS data).
StatisticPrevalence of
Diarrhoea (%)
Access to Basic 1
Water (%)
Access to Basic 1
Sanitation (%)
Women who Never Went to School (%)
Excluded
obs. (n = 84)
Retained
obs. (n = 267)
Excluded
obs. (n = 84)
Retained
obs. (n = 267)
Excluded
obs. (n = 84)
Retained
obs. (n = 267)
Excluded
obs. (n = 84)
Retained
obs. (n = 267)
Median value16.716.767.187.43.718.583.050.0
Mean value17.818.261.078.16.227.080.151.0
Standard deviation10.912.627.724.97.926.115.422.2
Minimum value0.00.00.00.00.00.034.84.2
Maximum value42.956.3100.0100.033.3100.0100.0100.0
1 Definitions of “basic” and “safe” WASH facilities based on the WHO & UNICEF Joint Monitoring Programme.

Appendix C

Table A2 shows aggregated DHS data for a selection of cities across Côte d’Ivoire. These cities were selected based on their population sizes and, also, based on their role as “connectors” within the urban system of Côte d’Ivoire [23]. The data was aggregated spatially: the mean cluster-level prevalence of diarrhoea was calculated by selecting the specific clusters that were located in each city. There was no apparent association between city size (suggested here by the number of clusters comprised in each city) and the mean prevalence of diarrhoea; although Abidjan showed the highest maximum value (54.5%), small- and medium-sized towns such as Daloa and Douékoué showed much higher, mean prevalence rates of diarrhoea (26.0 and 43.8%, respectively).
Table A2. Observed prevalence of diarrhoea among children under five in a selection of Ivoirian cities.
Table A2. Observed prevalence of diarrhoea among children under five in a selection of Ivoirian cities.
LocationSize and Category 1N° ClustersMean Prevalence of Diarrhoea 2Standard Deviation of SampleRange (Min. and Max. Values)
AbidjanLarge (“global connector”)4821.413.80.0/54.5
YamoussoukroMedium (“global connector”)514.511.00.0/29.4
San PédroMedium (“global connector”)414.513.20.0/28.6
BouakéMedium-large (“regional connector”)1612.19.40.0/33.3
KorhogoMedium (“regional connector”)612.511.50.0/29.4
DaloaMedium (“regional connector”)426.07.520.0/36.8
KatiolaSmall (“local connector”)25.98.30.0/11.8
DouékouéSmall (“local connector”)143.8--
DivoSmall (“local connector”)115.4--
1 “Connector” categories based on the work by Fall and Coulibaly [23]. 2 Mean values resulting from aggregated DHS data.

Appendix D

Table A3 shows the results of the weighted OLS regression done with a pre-selection of control variables. The regression used the cluster weights provided by the DHS and was run with the 267 spatial units included in the study. The pre-selection of control variables was done based on the literature, including features that have been associated with diarrhoeal diseases (as explained in Section 2.2). Considering the extremely poor performance of variables related to climatic conditions and hygiene facilities, we opted to exclude them from the analysis. Only the variables related to water, sanitation and women’s education were retained as control variables.
Table A3. Results of the weighted OLS regression models for the pre-selected control variables (full dataset, n = 267 spatial units).
Table A3. Results of the weighted OLS regression models for the pre-selected control variables (full dataset, n = 267 spatial units).
Pre-Selected Control VariablesVariance
Inflation
Factor
Weighted OLS (DHS Cluster Weights)
R2 = 0.059/AIC = 46.45
Jarque-Bera Test for Normality of Errors: 31.090 (p < 0.001)
Breusch-Pagan Test for Heteroskedasticity: 4.152 (p = 0.656)
Coef.SEProb.
Constant-0.4480.0790.000
% of the population with access to basic water facilities 11.4680.0740.0670.270
% of the population with access to basic sanitation facilities 11.970−0.2540.0760.001
% of the population with access to safe hygiene facilities 11.363−0.0340.0540.535
% of the female population who never went to school1.744−0.2610.0790.001
Mean accumulated precipitation (monthly values) in 20121.2600.0490.0630.436
Mean maximal temperature (monthly values) in 20121.1650.00080.0910.993
1 Definitions of “basic” and “safe” WASH facilities based on the WHO & UNICEF Joint Monitoring Programme.

References

  1. Troeger, C.; Blacker, B.F.; Khalil, I.A.; Rao, P.C.; Cao, S.; Zimsen, S.R.; Albertson, S.B.; Stanaway, J.D.; Deshpande, A.; Abebe, Z.; et al. Estimates of the global, regional, and national morbidity, mortality, and aetiologies of diarrhoea in 195 countries: A systematic analysis for the Global Burden of Disease Study 2016. Lancet Infect. Dis. 2018, 18, 1211–1228. [Google Scholar] [CrossRef] [Green Version]
  2. WHO. Water, Sanitation, Hygiene and Health: A Primer for Health Professionals; World Health Organization: Geneva, Switzerland, 2019; p. 36. Available online: https://www.who.int/publications/i/item/WHO-CED-PHE-WSH-19.149 (accessed on 17 May 2022).
  3. Prüss, A.; Kay, D.; Fewtrell, L.; Bartram, J. Estimating the burden of disease from water, sanitation, and hygiene at a global level. Environ. Health Perspect. 2002, 110, 6. [Google Scholar] [CrossRef]
  4. Koné, B.; Doumbia, M.; Sy, I.; Dongo, K.; Agbo-Houenou, Y.; Houenou, P.V.; Fayomi, B.; Bonfoh, B.; Tanner, M.; Cissé, G. Étude des diarrhées en milieu périurbain à Abidjan par l’approche écosanté. VertigO 2014, 19, 14976. [Google Scholar] [CrossRef]
  5. Wolf, J.; Prüss-Ustün, A.; Cumming, O.; Bartram, J.; Bonjour, S.; Cairncross, S.; Clasen, T.; Colford, J.M.; Curtis, V.; De France, J.; et al. Systematic review: Assessing the impact of drinking water and sanitation on diarrhoeal disease in low- and middle-income settings: Systematic review and meta-regression. Trop. Med. Int. Health 2014, 19, 928–942. [Google Scholar] [CrossRef]
  6. Thiam, S.; Cissé, G.; Stensgaard, A.-S.; Niang-Diène, A.; Utzinger, J.; Vounatsou, P. Bayesian conditional autoregressive models to assess spatial patterns of diarrhoea risk among children under the age of 5 years in Mbour, Senegal. Geospat. Health 2019, 14, 321–328. [Google Scholar] [CrossRef] [PubMed]
  7. Reiner, R.C.; Wiens, K.E.; Deshpande, A.; Baumann, M.M.; Lindstedt, P.A.; Blacker, B.F.; Troeger, C.E.; Earl, L.; Munro, S.B.; Abate, D.; et al. Mapping geographical inequalities in childhood diarrhoeal morbidity and mortality in low-income and middle-income countries, 2000–2017: Analysis for the Global Burden of Disease Study 2017. Lancet 2020, 395, 1779–1801. [Google Scholar] [CrossRef]
  8. Galea, S.; Vlahov, D. Urban health: Evidence, challenges, and directions. Annu. Rev. Public Health 2005, 26, 341–365. [Google Scholar] [CrossRef] [Green Version]
  9. Ezeh, A.; Oyebode, O.; Satterthwaite, D.; Chen, Y.-F.; Ndugwa, R.; Sartori, J.; Mberu, B.; Melendez-Torres, G.J.; Haregu, T.; Watson, S.I.; et al. The history, geography, and sociology of slums and the health problems of people who live in slums. Lancet 2017, 389, 547–558. [Google Scholar] [CrossRef]
  10. Mattioli, M.C.M.; Davis, J.; Boehm, A.B. Hand-to-Mouth Contacts Result in Greater Ingestion of Feces than Dietary Water Consumption in Tanzania: A Quantitative Fecal Exposure Assessment Model. Environ. Sci. Technol. 2015, 49, 1912–1920. [Google Scholar] [CrossRef]
  11. Turner, M.G.; Gardner, R.H. Landscape Ecology in Theory and Practice: Pattern and Process, 2nd ed.; Springer: New York, NY, USA, 2015; ISBN 978-1-4939-2793-7. [Google Scholar]
  12. Collins, J.P.; Kinzig, A.; Grimm, N.B.; Fagan, W.F.; Hope, D.; Wu, J.; Borer, E.T. A New Urban Ecology: Modeling human communities as integral parts of ecosystems poses special problems for the development and testing of ecological theory. Am. Sci. 2000, 88, 416–425. [Google Scholar]
  13. Forman, R.T.T.; Godron, M. Patches and Structural Components for a Landscape Ecology. BioScience 1981, 31, 733–740. [Google Scholar] [CrossRef]
  14. Bosch, M.; Chenal, J.; Joost, S. Addressing Urban Sprawl from the Complexity Sciences. Urban Sci. 2019, 3, 3020060. [Google Scholar] [CrossRef] [Green Version]
  15. Baptista, E.A.; Queiroz, B.L. Spatial analysis of mortality by cardiovascular disease in the adult population: A study for brazilian micro-regions between 1996 and 2015. Spat. Demogr. 2019, 7, 83–101. [Google Scholar] [CrossRef]
  16. Arino, J. Spatio-temporal spread of infectious pathogens of humans. Infect. Dis. Model. 2017, 2, 218–228. [Google Scholar] [CrossRef] [PubMed]
  17. Araujo, R.V.; Albertini, M.R.; Costa-da-Silva, A.L.; Suesdek, L.; Franceschi, N.C.S.; Bastos, N.M.; Katz, G.; Cardoso, V.A.; Castro, B.C.; Capurro, M.L.; et al. São Paulo urban heat islands have a higher incidence of dengue than other urban areas. Braz. J. Infect. Dis. 2015, 19, 146–155. [Google Scholar] [CrossRef] [Green Version]
  18. Rossi, J.-P.; Kadaouré, I.; Godefroid, M.; Dobigny, G. Landscape epidemiology in urban environments: The example of rodent-borne Trypanosoma in Niamey, Niger. Infect. Genet. Evol. 2018, 63, 307–315. [Google Scholar] [CrossRef]
  19. WHO. Urban HEART: Urban Health Equity Assessment and Response Tool; World Health Organization, The WHO Centre for Health Development: Kobe, Japan, 2010; ISBN 978-92-4-150014-2.
  20. Elsey, H.; Thomson, D.R.; Lin, R.Y.; Maharjan, U.; Agarwal, S.; Newell, J. Addressing inequities in urban health: Do decision-makers have the data they need? Report from the urban health data special session at International Conference on Urban Health Dhaka 2015. J. Urban Health 2016, 93, 526–537. [Google Scholar] [CrossRef] [Green Version]
  21. OECD; Sahel and West Africa Club. Africa’s Urbanisation Dynamics 2020: Africapolis, Mapping a New Urban Geography; West African Studies; OECD: Paris, France, 2020; ISBN 978-92-64-57958-3. [Google Scholar]
  22. Vos, T.; Lim, S.S.; Abbafati, C.; Abbas, K.M.; Abbasi, M.; Abbasifard, M.; Abbasi-Kangevari, M.; Abbastabar, H.; Abd-Allah, F.; Abdelalim, A.; et al. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: A systematic analysis for the Global Burden of Disease Study 2019. Lancet 2020, 396, 1204–1222. [Google Scholar] [CrossRef]
  23. Fall, M.; Coulibaly, S. (Eds.) Diversified Urbanization: The Case of Côte d’Ivoire; World Bank: Washington, DC, USA, 2016; ISBN 978-1-4648-0809-8. [Google Scholar]
  24. Chenal, J. La Ville Ouest-Africaine: Modèles de Planification de L’espace Urbain; Vuesdensemble; Metispresses: Genève, Switzerland, 2013; ISBN 978-2-940406-18-0. [Google Scholar]
  25. WHO; UNICEF. WHO/UNICEF Joint Monitoring Programme for Water Supply, Sanitation and Hygiene. Available online: https://washdata.org/data/household#!/ (accessed on 11 May 2022).
  26. UN-Habitat. World Cities Report 2016. Urbanization and Development: Emerging Futures; World Cities Report; UN-Habitat: Nairobi, Kenya, 2016; ISBN 978-92-1-133395-4. [Google Scholar]
  27. Mberu, B.U.; Haregu, T.N.; Kyobutungi, C.; Ezeh, A.C. Health and health-related indicators in slum, rural, and urban communities: A comparative analysis. Glob. Health Action 2016, 9, 33163. [Google Scholar] [CrossRef]
  28. DHS. The DHS Program—Available Datasets. Available online: https://dhsprogram.com/data/available-datasets.cfm (accessed on 11 May 2022).
  29. INS [Côte d’Ivoire]; ICF International. Demographic and Health Survey 2011–2012 [Dataset]. CIGE61FL.SHP; Institut National de la Statistique-INS/Côte d’Ivoire and ICF International [Producers]; ICF International [Distributor]: Calverton, MD, USA, 2013. [Google Scholar]
  30. INS [Côte d’Ivoire]; ICF International. Demographic and Health Survey 2011–2012 [Dataset]. CIHR62FL.DTA; Institut National de la Statistique-INS/Côte d’Ivoire and ICF International [Producers]; ICF International [Distributor]: Calverton, MD, USA, 2013. [Google Scholar]
  31. INS [Côte d’Ivoire]; ICF International. Demographic and Health Survey 2011–2012 [Dataset]. CIIR62FL.DTA; Institut National de la Statistique-INS/Côte d’Ivoire and ICF International [Producers]; ICF International [Distributor]: Calverton, MD, USA, 2013. [Google Scholar]
  32. INS [Côte d’Ivoire]; ICF International. Demographic and Health Survey 2011–2012 [Dataset]. CIKR62FL.DTA; Institut National de la Statistique-INS/Côte d’Ivoire and ICF International [Producers]; ICF International [Distributor]: Calverton, MD, USA, 2013. [Google Scholar]
  33. Abatzoglou, J.T.; Dobrowski, S.Z.; Parks, S.A.; Hegewisch, K.C. TerraClimate, a high-resolution global dataset of monthly climate and climatic water balance from 1958–2015. Sci. Data 2018, 5, 170191. [Google Scholar] [CrossRef] [Green Version]
  34. Carlton, E.J.; Woster, A.P.; DeWitt, P.; Goldstein, R.S.; Levy, K. A systematic review and meta-analysis of ambient temperature and diarrhoeal diseases. Int. J. Epidemiol. 2016, 45, 117–130. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. NASA. Earth at Night. Available online: https://earthobservatory.nasa.gov/features/NightLights/page3.php (accessed on 11 May 2022).
  36. ESA. Land Cover CCI Product User Guide Version 2. 2017. Available online: http://maps.elie.ucl.ac.be/CCI/viewer/download/ESACCI-LC-Ph2-PUGv2_2.0.pdf (accessed on 11 May 2022).
  37. Stevens, F.R.; Gaughan, A.E.; Linard, C.; Tatem, A.J. Disaggregating census data for population mapping using random forests with remotely-sensed and ancillary data. PLoS ONE 2015, 10, e0107042. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  38. Friesen, J.; Friesen, V.; Dietrich, I.; Pelz, P.F. Slums, space, and state of health—a link between settlement morphology and health data. Int. J. Environ. Res. Public. Health 2020, 17, 2022. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  39. Bosch, M. PyLandStats: An open-source Pythonic library to compute landscape metrics. PLoS ONE 2019, 14, e0225734. [Google Scholar] [CrossRef] [Green Version]
  40. Perry, M.; Goodman, S.; Özak, Ö.; Smith, T.J.; Rykov, D.; Taylor, S.; Collins, B.; Valgur, M.; Amdal, H.; Pounder, N.; et al. Rasterstats. Available online: https://github.com/perrygeo/python-rasterstats (accessed on 13 May 2022).
  41. Gillies, S.; Perry, M.; Wurster, K.; Ward, B.; Jonas; Snow, A.D.; McBride, J.; Talbert, C.; Seglem, E.; Fitzsimmons, S.; et al. Rasterio: Geospatial Raster I/O for Python Programmers. Available online: https://github.com/rasterio/rasterio (accessed on 13 May 2022).
  42. Wasser, L.; Joseph, M.; McGlinchy, J.; Palomino, J.; Korinek, N.; Holdgraf, C.; Head, T. EarthPy: A Python package that makes it easier to explore and plot raster and vector data using open source Python tools. J. Open Source Softw. 2019, 4, 1886. [Google Scholar] [CrossRef] [Green Version]
  43. Gillies, S.; Buffat, R.; Arnott, J.; Wurster, K.; Cochran, M.; Perry, M.; de Andrade, E.S.; Jordahl, K.; Young, P.; Norris, S.; et al. Fiona. Available online: https://github.com/Toblerity/Fiona (accessed on 13 May 2022).
  44. McKinney, W. Data structures for statistical computing in Python. In Proceedings of the 9th Python in Science Conference, Austin, TX, USA, 28 June–3 July 2010; pp. 56–61. [Google Scholar]
  45. Jordahl, K.; den Bossche, J.V.; Fleischmann, M.; McBride, J.; Wasserman, J.; Gerard, J.; Badaracco, A.G.; Snow, A.D.; Tratner, J.; Perry, M.; et al. Geopandas/Geopandas: v0.9.0. Available online: https://zenodo.org/record/4569086 (accessed on 17 May 2022).
  46. Harris, C.R.; Millman, K.J.; van der Walt, S.J.; Gommers, R.; Virtanen, P.; Cournapeau, D.; Wieser, E.; Taylor, J.; Berg, S.; Smith, N.J.; et al. Array programming with NumPy. Nature 2020, 585, 357–362. [Google Scholar] [CrossRef]
  47. Seabold, S.; Perktold, J. Statsmodels: Econometric and statistical modeling with Python. In Proceedings of the the 9th Python in Science Conference (SciPy 2010), Austin, TX, USA, 28 June–3 July 2010; pp. 92–96. [Google Scholar]
  48. Rey, S.J.; Anselin, L. PySAL: A Python library of spatial analytical methods. Rev. Reg. Stud. 2007, 37, 5–27. [Google Scholar] [CrossRef]
  49. Hunter, J.D. Matplotlib: A 2D graphics environment. Comput. Sci. Eng. 2007, 9, 90–95. [Google Scholar] [CrossRef]
  50. Waskom, M. Seaborn: Statistical data visualization. J. Open Source Softw. 2021, 6, 3021. [Google Scholar] [CrossRef]
  51. Grace, K.; Nagle, N.N.; Burgert-Brucker, C.R.; Rutzick, S.; Van Riper, D.C.; Dontamsetti, T.; Croft, T. Integrating environmental context into DHS analysis while protecting participant confidentiality: A new remote sensing method. Popul. Dev. Rev. 2019, 45, 197–218. [Google Scholar] [CrossRef]
  52. UNICEF; WaterAid; WSUP. Female-Friendly Public and Community Toilets: A Guide for Planners and Decision Makers; WaterAid: London, UK, 2018; p. 56. Available online: https://washmatters.wateraid.org/publications/female-friendly-public-and-community-toilets-a-guide-for-planners-and-decision-makers (accessed on 17 May 2022).
  53. Mulatya, D.M.; Ochieng, C. Disease burden and risk factors of diarrhoea in children under five years: Evidence from Kenya’s demographic health survey 2014. Int. J. Infect. Dis. 2020, 93, 359–366. [Google Scholar] [CrossRef] [PubMed]
  54. Manesh, A.O.; Sheldon, T.A.; Pickett, K.E.; Carr-Hill, R. Accuracy of child morbidity data in demographic and health surveys. Int. J. Epidemiol. 2008, 37, 194–200. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  55. Rawlings, J.O.; Pantula, S.G.; Dickey, D.A. Applied Regression Analysis: A Research Tool, 2nd ed.; Springer Texts in Statistics; Springer: New York, NY, USA, 1998; ISBN 978-0-387-98454-4. [Google Scholar]
  56. Nilima, N.; Kamath, A.; Shetty, K.; Unnikrishnan, B.; Kaushik, S.; Rai, S.N. Prevalence, patterns, and predictors of diarrhea: A spatial-temporal comprehensive evaluation in India. BMC Public Health 2018, 18, 1288. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  57. Rey, S.J.; Arribas-Bel, D.; Wolf, L.J. Geographic Data Science with Python [Open Book]. Available online: https://geographicdata.science/book/intro.html# (accessed on 14 June 2022).
  58. Anselin, L. Spatial regression. In The SAGE Handbook of Spatial Analysis; Fotheringham, A.S., Rogerson, P., Eds.; SAGE Publications: Los Angeles, CA, USA; London, UK, 2009; pp. 255–275. ISBN 978-1-4129-1082-8. [Google Scholar]
  59. Wurm, M.; Taubenböck, H. Detecting social groups from space–assessment of remote sensing-based mapped morphological slums using income data. Remote Sens. Lett. 2018, 9, 41–50. [Google Scholar] [CrossRef]
  60. Wang, J.; Georganos, S.; Kuffer, M.; Abascal, A.; Vanhuysse, S. On the knowledge gain of urban morphology from space. Comput. Environ. Urban Syst. 2022, 95, 101831. [Google Scholar] [CrossRef]
  61. Pickering, A.J.; Null, C.; Winch, P.J.; Mangwadu, G.; Arnold, B.F.; Prendergast, A.J.; Njenga, S.M.; Rahman, M.; Ntozini, R.; Benjamin-Chung, J.; et al. The WASH Benefits and SHINE trials: Interpretation of WASH intervention effects on linear growth and diarrhoea. Lancet Glob. Health 2019, 7, e1139–e1146. [Google Scholar] [CrossRef] [Green Version]
  62. Njoh, A.J. Urbanization and development in sub-Saharan Africa. Cities 2003, 20, 167–174. [Google Scholar] [CrossRef]
  63. Rydin, Y.; Bleahu, A.; Davies, M.; Dávila, J.D.; Friel, S.; De Grandis, G.; Groce, N.; Hallal, P.C.; Hamilton, I.; Howden-Chapman, P.; et al. Shaping cities for health: Complexity and the planning of urban environments in the 21st century. Lancet 2012, 379, 2079–2108. [Google Scholar] [CrossRef] [Green Version]
  64. Nana, N.; Kouakou, A.B.; Kougouindiga, A. Chroniques D’investissements dans le Logement en Côte d’Ivoire; Centre for Affordable Housing Finance in Africa: Johannesburg, South Africa, 2019; p. 34. Available online: https://housingfinanceafrica.org/fr/documents/chroniques-dinvestissement-du-logement-cas-de-la-cote-divoire/ (accessed on 13 May 2022).
  65. Morgenstern, H. Ecologic studies in epidemiology: Concepts, principles, and methods. Annu. Rev. Public Health 1995, 16, 61–81. [Google Scholar] [CrossRef]
  66. Møller, A.P.; Jennions, M.D. How much variance can be explained by ecologists and evolutionary biologists? Oecologia 2002, 132, 492–500. [Google Scholar] [CrossRef]
  67. Mari, L.; Bertuzzo, E.; Righetto, L.; Casagrandi, R.; Gatto, M.; Rodriguez-Iturbe, I.; Rinaldo, A. Modelling cholera epidemics: The role of waterways, human mobility and sanitation. J. R. Soc. Interface 2012, 9, 376–388. [Google Scholar] [CrossRef] [PubMed]
  68. G’Sell, M.G.; Wager, S.; Chouldechova, A.; Tibshirani, R. Sequential selection procedures and false discovery rate control. J. R. Stat. Soc. 2016, 78, 423–444. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Buffer areas (red circles generated from DHS clusters’ centroids) were used as reference areas to calculate landscape metrics. Elaborated by the authors with QGIS, from: DHS [28] and ESA Land Cover CCI [36]. © ESA Climate Change Initiative—Land Cover led by UCLouvain (2017).
Figure 1. Buffer areas (red circles generated from DHS clusters’ centroids) were used as reference areas to calculate landscape metrics. Elaborated by the authors with QGIS, from: DHS [28] and ESA Land Cover CCI [36]. © ESA Climate Change Initiative—Land Cover led by UCLouvain (2017).
Ijerph 19 07677 g001
Figure 2. Layers used to reclassify urban patches based on demographic density and night illumination (zoom on Abidjan’s metropolitan area). Elaborated by the authors with QGIS, from: NASA [35], ESA Land Cover CCI [36] and WorldPop [37]. © ESA Climate Change Initiative—Land Cover led by UCLouvain (2017).
Figure 2. Layers used to reclassify urban patches based on demographic density and night illumination (zoom on Abidjan’s metropolitan area). Elaborated by the authors with QGIS, from: NASA [35], ESA Land Cover CCI [36] and WorldPop [37]. © ESA Climate Change Initiative—Land Cover led by UCLouvain (2017).
Ijerph 19 07677 g002
Figure 3. Reclassified urban patches (zoom on Abidjan’s metropolitan area). Elaborated by the authors with QGIS, from: NASA [35], ESA Land Cover CCI [36] and WorldPop [37]. © ESA Climate Change Initiative—Land Cover led by UCLouvain (2017).
Figure 3. Reclassified urban patches (zoom on Abidjan’s metropolitan area). Elaborated by the authors with QGIS, from: NASA [35], ESA Land Cover CCI [36] and WorldPop [37]. © ESA Climate Change Initiative—Land Cover led by UCLouvain (2017).
Ijerph 19 07677 g003
Figure 4. Location of DHS clusters, by subset. Elaborated by the authors with QGIS, from: DHS [28], GADM and OpenStreetMap.
Figure 4. Location of DHS clusters, by subset. Elaborated by the authors with QGIS, from: DHS [28], GADM and OpenStreetMap.
Ijerph 19 07677 g004
Figure 5. Relation between dense, precarious urban areas (x-axis) and the prevalence of diarrhoea (y-axis), by level of urbanisation (urban subset shown in orange). Elaborated by the authors.
Figure 5. Relation between dense, precarious urban areas (x-axis) and the prevalence of diarrhoea (y-axis), by level of urbanisation (urban subset shown in orange). Elaborated by the authors.
Ijerph 19 07677 g005
Figure 6. Relationship between the size of the urbanised area (x-axis) and access to basic water facilities (y-axis), by level of urbanisation (urban subset shown in red). Elaborated by the authors.
Figure 6. Relationship between the size of the urbanised area (x-axis) and access to basic water facilities (y-axis), by level of urbanisation (urban subset shown in red). Elaborated by the authors.
Ijerph 19 07677 g006
Figure 7. Relationship between the size of the urbanised area (x-axis) and access to basic sanitation facilities (y-axis), by level of urbanisation (urban subset shown in red). Elaborated by the authors.
Figure 7. Relationship between the size of the urbanised area (x-axis) and access to basic sanitation facilities (y-axis), by level of urbanisation (urban subset shown in red). Elaborated by the authors.
Ijerph 19 07677 g007
Figure 8. Relationship between the size of the urbanised area (x-axis) and the percentage of women without any education (y-axis), by level of urbanisation (urban subset shown in red). Elaborated by the authors.
Figure 8. Relationship between the size of the urbanised area (x-axis) and the percentage of women without any education (y-axis), by level of urbanisation (urban subset shown in red). Elaborated by the authors.
Ijerph 19 07677 g008
Table 1. List of open-access datasets used to explore relations between landscape and diarrhoea in Côte d’Ivoire.
Table 1. List of open-access datasets used to explore relations between landscape and diarrhoea in Côte d’Ivoire.
Data LayerSourceDescriptionAvailable YearsSpatial
Resolution
Type
Geolocation of DHS clusterDHSCluster location with a geographic blur of 2 to 5 km1998/1999
2011/2012
2 to 5 kmVector
(shp 1)
Cases of diarrhoeaDHSCases of diarrhoea (under-5), geocoded to cluster location1998/1999
2011/2012
2 to 5 kmVector
(table)
Access to water and sanitationDHSType of facility used by household, geocoded to cluster location1998/1999
2011/2012
2 to 5 kmVector
(table)
Education attainmentDHSEducation attainment of women (15–49 years), geocoded to cluster location1998/1999
2011/2012
2 to 5 kmVector
(table)
Climatic conditionsTerra-climateAccumulated precipitation and mean temperature1958–20201/24th degr.
(~4 km)
Raster
Illumination (night lights)NASAIntensity of night illumination2012 & 2016500 mRaster
Land useESA Land Cover CCIDiscrete categories of land cover1992–2019300 mRaster
Population densityWorldPopEstimated demographic densities (WorldPop’s model)2000–2020100 mRaster
RoadsOpenStreetMapSurveyed roads and pathways20195 to 20 mVector (shp 1)
1 shapefile.
Table 2. List of variables calculated for each spatial unit (variables aggregated by buffer area).
Table 2. List of variables calculated for each spatial unit (variables aggregated by buffer area).
VariableRole in AnalysisAggregation Operation
Prevalence of diarrhoea, under −5Dependent variable N c a s e s N p o p .     u n d e r 5
% access to basic 1 waterControl variable N p o p .     w i t h   b a s i c   w . N t o t a l   p o p .
% access to basic 1 sanitationControl variable N p o p .     w i t h   b a s i c   s . N t o t a l   p o p .
% women 2 with no educationControl variable N p o p .     w o m e n   n o   e d . N p o p .     w o m e n
Edge of land cover patches 3Independent variablesTotal length (m) of edges of given land cover
Shape index of land cover patches 3Independent variables i n P a r e a i × P s h p   i n d e x i N t o t a l   p a t c h e s
Proportion of land cover patches 3Independent variables N p i x e l s   o f   g i v e n   l a n d   c o v e r N t o t a l   p i x e l s
% dense urban areasIndependent variable N u r b .     p i x .   v a l u e > m e d i a n   u r b .   d e n . N t o t a l   u r b a n   p i x e l s
% precarious urban areasIndependent variable N p r e c a r i o u s   u r b .     p i x e l s N t o t a l   u r b a n   p i x e l s
Km of roads per urban areaIndependent variable L e n g t h   R d k m U r b a n   A r e a p i x e l s
1 As defined by the WHO and UNICEF Joint Monitoring Programme. 2 Aged between 15 and 49 years. 3 Applied to all land cover categories.
Table 3. Regression results of four models for the full dataset (n = 267 spatial units), by model type.
Table 3. Regression results of four models for the full dataset (n = 267 spatial units), by model type.
Included
Features
Variance Inflation FactorUnweighted OLS
R2 = 0.06/AIC = −50.4
JB 2: 12.020 (p = 0.003)
BP 3: 4.880 (p = 0.300)
Weighted OLS
R2 = 0.129/AIC = 21.75
JB 2: 10.676 (p = 0.005)
BP 3: 2.881 (p = 0.578)
Spatial Lag
Pseudo R2 = 0.09/AIC = −56.3
JB 2: 11.407 (p = 0.003)
BP 3: 5.807 (p = 0.214)
Spatial Error
Pseudo R2 = 0.059/AIC = −58.2
JB 2: 11.859 (p = 0.003)
BP 3: 4.693 (p = 0.320)
Coef.SEProb.Coef.SEProb.Coef.SEProb.Coef.SEProb.
Constant-0.4260.0700.0000.4600.0660.0000.2970.0760.0000.3660.0730.000
% basic water1.3720.0010.0630.9860.0140.0620.8200.0320.0610.6060.0490.0640.439
% basic sanitation1.810−0.1860.0690.007−0.2570.0690.000−0.1810.0670.007−0.1660.0680.015
% women with no ed.1.674−0.1410.0750.061−0.2210.0760.004−0.1230.0730.092−0.1080.0740.148
Dense, prec. areas 11.0740.2570.0810.0020.2910.0620.0000.2270.0810.0050.2750.0940.004
1 Selected landscape metric: proportion of dense, precarious urban areas (total patch area/buffer area). 2 Jarque-Bera test for normality of errors. 3 Breusch-Pagan test for heteroskedasticity. Bold characters indicate that the association is significant (p < 0.05).
Table 4. Regression results of four models for the urban subset (n = 105 spatial units), by model type.
Table 4. Regression results of four models for the urban subset (n = 105 spatial units), by model type.
Included
Features
Variance Inflation FactorUnweighted OLS
R2 = 0.141/AIC = −14.04
JB 2: 4.316 (p = 0.116)
BP 3: 4.444 (p = 0.349)
Weighted OLS
R2 = 0.196/AIC = 23.56
JB 2: 6.395 (p = 0.041)
BP 3: 7.101 (p = 0.131)
Spatial Lag
Pseudo R2 = 0.141/AIC = −12.05
JB 2: 4.364 (p = 0.113)
BP 3: 4.391 (p = 0.356)
Spatial Error
Pseudo R2 = 0.141/AIC = −14.09
JB 2: 4.285 (p = 0.117)
BP 3: 4.424 (p = 0.352)
Coef.SEProb.Coef.SEProb.Coef.SEProb.Coef.SEProb.
Constant-0.2820.1370.0420.3250.1570.0400.2900.1430.0430.2750.1340.041
% basic water1.1720.0820.1180.4880.1120.1360.4120.0810.1150.4830.0890.1150.439
% basic sanitation1.769−0.1680.1070.119−0.2520.1070.021−0.1670.1050.111−0.1680.1050.110
% women with no ed.1.810−0.0130.1270.917−0.0880.1360.520−0.0120.1240.925−0.0120.1240.922
Dense, prec. areas 11.0290.3150.0900.0010.3160.0790.0000.3180.0930.0010.3180.0900.000
1 Selected landscape metric: proportion of dense, precarious urban areas (total patch area/buffer area). 2 Jarque-Bera test for normality of errors. 3 Breusch-Pagan test for heteroskedasticity. Bold characters indicate that the association is significant (p < 0.05).
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Pessoa Colombo, V.; Chenal, J.; Koné, B.; Bosch, M.; Utzinger, J. Using Open-Access Data to Explore Relations between Urban Landscapes and Diarrhoeal Diseases in Côte d’Ivoire. Int. J. Environ. Res. Public Health 2022, 19, 7677. https://doi.org/10.3390/ijerph19137677

AMA Style

Pessoa Colombo V, Chenal J, Koné B, Bosch M, Utzinger J. Using Open-Access Data to Explore Relations between Urban Landscapes and Diarrhoeal Diseases in Côte d’Ivoire. International Journal of Environmental Research and Public Health. 2022; 19(13):7677. https://doi.org/10.3390/ijerph19137677

Chicago/Turabian Style

Pessoa Colombo, Vitor, Jérôme Chenal, Brama Koné, Martí Bosch, and Jürg Utzinger. 2022. "Using Open-Access Data to Explore Relations between Urban Landscapes and Diarrhoeal Diseases in Côte d’Ivoire" International Journal of Environmental Research and Public Health 19, no. 13: 7677. https://doi.org/10.3390/ijerph19137677

APA Style

Pessoa Colombo, V., Chenal, J., Koné, B., Bosch, M., & Utzinger, J. (2022). Using Open-Access Data to Explore Relations between Urban Landscapes and Diarrhoeal Diseases in Côte d’Ivoire. International Journal of Environmental Research and Public Health, 19(13), 7677. https://doi.org/10.3390/ijerph19137677

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop