1. Introduction
It is expected that 66% of the world population will be urban by 2050 [
1]. This on-going urbanization process entails major environmental and socioeconomic impacts that compromise the function and stability of cities. As complex systems, cities generate interesting patterns and trends that are neither easily described nor predicted [
2]. Systems that produce complexity consist of diverse rule-following entities of which the behaviors are interdependent [
3,
4]: Those entities interact over a contact structure or network, and they often adapt (i.e., by learning in a social system or by natural selection in an ecological one) [
5]. Due to their adaptive behavior and complexity, cities have been also identified as centers for economic development, productivity, creativity, innovation, and cultural transformation [
6,
7,
8]. In this scenario, understanding cities’ complexity will play a key role in facing current and future sustainability challenges at the regional and global scales [
9,
10,
11].
One of these challenges will be to improve the resilience of cities [
12,
13,
14,
15,
16], and this is not straightforward since one action may not give the expected transformation of the complex system. Resilience refers to the ability of a system to adapt to a modifying process while retaining its functionality and not necessarily returning to the previous equilibrium state [
17,
18,
19]. This definition comes from the non-equilibrium paradigm of ecology [
17] and fits the complexity and uncertainty inherent to cities, as it includes adaptation as a key process that cannot be easily predicted. Specifically, urban resilience has been defined by Meerow et al. [
16] as “the ability of an urban system—and all its constituent socio-ecological and socio-technical networks across temporal and spatial scales—to maintain or rapidly return to desired functions in the face of a disturbance, to adapt to change, and to quickly transform systems that limit current or future adaptive capacity”. This definition comprises the three pathways to a resilient state: persistence, transition, and transformation [
20]. To some extent, measuring resilience is tightly related to inspecting the property of the system to remain functional over time—from persistence to adaptation—and hence, to sustain itself. In any case, there are five questions that need to be addressed during the planning of effective resilience policy in a city: Who? Resilience for whom? Who benefits with that strategy?; What? Which part of the city is going to be more resilient?; When? Is the policy oriented to face short-term disruptions or long-term stress?; Where? Is the policy limited to a spatial scale? Is it portraying cross-scalar interactions?; and Why? What is the goal of the program [
21].
We are interested in the relationship between diversity and resilience in the urban system and in the effects of diversity fostering resilience, basically, in the development of a statistical framework to evaluate urban systems, which may be helpful in the formulation of urban policy and in answering these questions. When we speak about diversity, we refer to any of the three characteristics of a population [
5]: (1) diversity within a type or variation in some attribute (such as differences in the length of finches’ beaks); (2) diversity of types and kinds or species in biological systems (such as different types of stores in a mall); and (3) diversity of composition, which refers to differences in configuration (such as different connections between atoms in a molecule). The diversity concept has characterized populations or collections of entities like ecosystems, with multiple types of flora and fauna, but it is also suitable to be applied in cities, having different types of people, organizations, infrastructural systems, etc. Regardless that the influence of diversity in the stability of a system is still a controversial issue [
5,
22,
23], a number of studies in socio-ecological systems have pointed out a positive correlation between diversity and resilience [
12,
24,
25,
26,
27]. From an urban ecology perspective, it is argued that cities with higher levels of environmental, economic, and social diversity have a higher resilience capacity [
13,
14,
25]. This argument relies on the assumption that a highly diverse system possesses many different entity types and, therefore, a bunch of individuals belonging to these different types that are able to perform similar functions at a wider range of conditions—response diversity— [
13,
24]. Furthermore, it has been shown that diversity is a fundamental property that ensures the city’s functionality in the face of disturbances; it is an essential factor for economic growth, attractiveness, and liveability of cities [
6]. In the last decade, some scholars have supported this thesis by arguing that diversity fosters productivity, innovation, and therefore economic development in cities [
28,
29,
30].
From a formal perspective, the assessment and characterization of a system’s diversity have been increasingly performed by means of a multiplicity of variables instead of aggregated attributes. This has been motivated by the enormous increase of computer power and accessibility to open datasets in recent years. However, direct interpretation of multivariate information is not easy and some level of aggregation is required to display an integrated picture of the system: Aggregated measures surely reduce the amount of information, highlight patterns, and enhance the communication of results. Nevertheless, gathering all the information into one single diversity index is usually not convenient for the following reasons. Firstly, most of the methods for aggregating indicators into indices produce a disconnection between the more intuitively original variables and the quantities resulting from these indices [
31]. Secondly, aggregation also requires some form of human judgment so it relies on potentially distorting assumptions [
31,
32]. Lastly, aggregate indices (e.g.,variance, information entropy, or phylogenetic distances [
5]) barely capture interactions within complex systems [
32].
Here, we explore diversity in urban complex systems through multivariate statistical methods (i.e., Principal Component Analysis (PCA) and Multiple Factor Analysis (MFA)). These techniques are robust methodologies in the sense that they provide a moderate trade-off between the large volume of initial data and its final aggregation. In this regard, PCA [
33,
34] is probably the most widely used multivariate statistical technique. It has been applied in urban geography since the 1970s to analyze the landscape change [
35,
36,
37], but it has lately emerged as a meaningful statistical tool to understand the geographical complexity of globalization processes [
38], sustainable development [
32,
39,
40,
41], and socioeconomic resilience trends [
42]. At the urban scale, it has been used to identify urban typologies—of neighborhoods or cities—in metropolitan areas. By identifying typologies, these studies have exhibited—to some extent—the degree of diversity of the urban systems, being described by several socioeconomic variables (i.e., race, ethnicity, age, family structure, education, employment, and income) [
43,
44,
45]. MFA [
46,
47], on the other hand, is a multivariate data analysis technique—an extension of PCA—for summarizing and visualizing complex data, where variables are organized into groups of variables: It reveals an integrated picture displaying the relationship between observations and groups of variables [
33,
46,
48]. MFA has been applied mostly in sensory analysis; see for example the recent work of References [
49,
50,
51,
52]. In the urban context, few studies have applied MFA [
53,
54].
The way to measure diversity with these methods is explained by their capacity to reduce the dimensionality of the multivariate dataset by transforming the original data of correlated variables into a set of a few representative variables—known as principal components—that extract the most relevant information [
34,
55]. The objective is to visualize the relations between the system’s components, such that, the more differentiated entities, the more diversity indication. Since distanced results in the dimensionally reduced description indicate diverse attributes of districts as described by the multivariate system of indicators, we may seek to expose those differentiated results: One basic visualization method is the plot of variables and individuals in relation with the principal components (i.e., biplot). Another is the quantitative measurement of such differences with Euclidean distances.
In this paper, we apply a descriptive multivariate statistical analysis (i.e., PCA and MFA) to an urban system’s abstraction of the city’s functioning. Specifically, we relate the environmental, economical, and social characters of the city of Barcelona (Spain) in a multivariate system of indicators by collecting measurements of those variables at the district scale. We analyze diversity qualitatively and quantitatively by means of the dimensionality reduction of the dataset and the recognition of interactions and patterns among districts. The main purpose of this work is to exhibit the diversity of the urban system since—in a broader sense—we are interested in its relationship with resilience. In this sense, the temporal span of the dataset—from the year 2003 to the year 2015—not only brings a wider and reliable perspective to the city’s description but also allows for a temporal analysis: It constitutes the main tool to understand diversity trends and to identify key points towards the achievement of resilience. We also apply MFA to expose relations between urban metabolism groups of variables and the principal components.
This article is organized as follows. In
Section 2, we present our case study, the multivariate dataset, and the statistical techniques that we apply to expose diversity. The results of the multivariate statistical methods are presented in
Section 3. A discussion is given next in
Section 4. The paper ends with some conclusions and future research inquiries in
Section 5.
2. Materials and Methods
In this section, we present the statistical methodology for exposing diversity in the urban system. For doing so, the case study is described in the first part of the section. Then, we introduce the main hypothesis and transform the city into an abstraction of multiple variables that allows us to quantify diversity. Finally, the implemented statistical techniques are presented in the last part of this section.
2.1. Case Study
Barcelona city—our case study—can be studied at its largest scale or as a composition of smaller divisions. The choice of the scale influences the overall interpretation of diversity: Cities may seem diverse on one scale, but when looking at another scale, it may appear homogeneous. At its largest scale, Barcelona has a population of approximately 1.6 million inhabitants living in
km
(15,865 inhabitants per km
) [
56]. We choose to represent the city at the district scale with the aim of revealing particular functions and patterns of these urban subsystems while covering efficiently most of the city’s territory.
Since 1987, Barcelona has been divided into 10 administrative districts which can be compared with neighborhoods in a common metropolitan area. Our selection of districts—and therefore, of the city’s representation—ends up with Ciutat Vella, Eixample, Les Corts, and Gràcia as our case study. These districts are chosen to characterize Barcelona since the surface covered by the case study comprises a total area of
km
and represents
of the total city’s surface, as shown in
Figure 1. With respect to population, the case study comprises 571,183 inhabitants, accounting for
of the city’s population.
We also consider the following criteria in our selection: urban form, socioeconomic characteristics of the population, age of urban development—year or period of establishment and proximity to the city center—distance to historic center, and type of urban planning—organic or planned.
The urban form refers to the physical layout of the city: It is set by the distribution of urban elements, such as streets, blocks, plots, and buildings. There are different types or urban forms: grid or orthogonal, linear, radial, and irregular. Representative samples of the urban form for each one of the districts are shown in
Figure 2, where plans are drawn at the same scale, blocks are shown in black, streets are in white, and pedestrian streets are in light brown. Ciutat Vella exhibits an organic and irregular structure—distinctive of the Middle Ages—Eixample is a planned grid structure, Les Corts recalls a radial plan, and Gràcia is based on an orthogonal structure of different size plots. Hence, a fine-grain urban fabric characterizes Ciutat Vella and Gràcia and a coarse-grain urban fabric distinguishes Eixample and Les Corts. Based on previous measurements of the urban street networks [
57,
58,
59,
60,
61,
62], we compute Shannon’s entropy of the urban structure to provide quantitative information that is useful in the diversity analysis. In this methodology, the street orientation, length of the built tract of the street, and height of the adjacent buildings are measured in the four districts’ area. Then, the values for each variable and district are arranged—by absolute frequencies—in fixed bins
: Histograms illustrate this distribution (e.g., histograms for streets’ orientation in
Figure 3). Once the measures are binned, Shannon’s entropy
is computed for each
ith distribution (e.g., the streets’ orientation in a district) following
for
.
represents a positive constant, and
is the probability of the variable falling in the
jth bin. When the probability is zero,
, the supposition
is implemented. Entropy values for each one of the
ith variables and districts are shown in
Table 1. This information entropy metric reveals the degree of order of the urban structure: The low entropy exhibited by Eixample corresponds to its grid structure with a high degree of order. Les Corts and Ciutat Vella, on the other hand, present a high entropy that displays their heterogeneous urban structure, with neighborhoods having a dissimilar urban layout not only in formal aspects but also in the size of their entities (streets, plots, blocks, and empty urban spaces, etc.).
Regarding the socioeconomic component, we consider characteristics such as employment rate, income, education, population age, and percentage of foreigners (see
Table 2 for relevant demographic data of the case study, including percentages for each district with respect to the city). The district of Ciutat Vella is the oldest one, constituting itself the historical city center. The district of Eixample was designed by Ildefons Cerdà in 1859 as part of the widely known urbanism plan called “Plan Cerdà” [
63,
64,
65]. Eixample district is also the densest district of Barcelona with a population density of 35,617 inhabitants per km
. In the case of the districts of Les Corts and Gràcia, those were former independent municipalities annexed to the city in the last decade of the 19th century after a major conurbation process. As commented before, we select these districts to be representative and to constitute the urban essence of Barcelona.
2.2. Multivariate Description of the Case Study
The urban system description of the case study is achieved through a multivariate dataset that incorporates measurements of environmental and socioeconomic variables from the years 2003 to 2015. This dataset is a matrix containing 40 indicators (variables) and 52 individuals: Each individual represents a district in a given year (e.g., “CiutatVella 2003”, “Ciutat Vella 2004”, “Eixample 2003”, etc.). To construct the multivariate description of the case study, we considered the availability of time-dependent disaggregated data in the case of the four districts. Our dataset collects the information in the open databases of Departament d’Estadística [
56] and Agencia de Salut Pública de Barcelona [
66], and it is shown in
Table 3 for the district of Ciutat Vella between 2003 and 2015. The datasets for the remaining districts (Eixample, Les Corts, and Gràcia) are presented in Appendices
Appendix A,
Appendix B and
Appendix C. This particular time-span—from 2003 to 2015—has been chosen regarding the data availability at the district scale: Before 2003, only a few indicators were reported at this scale.
Most of the variables have been measured on a per capita basis in order to neutralize the effect of districts’ population size. This fact allows us to make a comparison focused on services and consumptions for each inhabitant of a given district. However, the discussion on analyzing urban systems based on aggregated or disaggregated data, normalized either by surface or population, remains an open debate. Particularly, we consider raw values for general or disaggregated indicators: Security level in the neighborhood (A12), Natural growth rate (B1), Population density (B2), Available household income per capita (B8), Average levels of PM (C1), Average levels of NO (C2), and Average values of CO (C3).
Within the framework of urban ecology, we use the concept of urban metabolism in order to describe and classify the interactions between the different components that take place in the city. One of the main ideas of this methodology is to integrate social, environmental, and economic variables into categories that characterize the ecological structure, diversity of amenities, population pressures, and consumption of the urban system: It is intended to portray the flows of matter, information, and energy in an ecosystem. We arrange our system of variables based on some previous indexes developed within the frameworks of urban metabolism and urban sustainability [
67,
68,
69,
70], but we include some additional variables that characterize social interactions, diversity of land uses, and liveability. Following Zhang et al. [
67], we group the variables (presented in
Table 3) into four metabolic categories:
Input Supportive. This category comprises 18 indicators and portrays the natural capital of the city and the liveability and the complexity of urban services. Some indicators in this category describe the diversity of economic activities and institutions associated with each district. We refer to the land use areas and buildings: housing, commerce, offices, education, health services, sport, religion, and entertainment, among others.
Output Pressure. It includes 12 indicators that describe the pressure exerted by the economic and social subsystems over the natural subsystem. This category is mostly related to the population size (social subsystem) and its consumption demand.
Destructive Metabolism. It includes 5 indicators that reveal the environmental pollution within the urban system, particularly described by air pollution and waste generation indicators. We also include the number of traffic accidents in order to measure one particular aspect of the non-liveability character of an urban system.
Regenerative Metabolism. It comprises 5 indicators related to ecological construction upon regenerative infrastructure and environmental care.
2.3. Multivariate Statistical Analysis of the Dataset
Since there is a large amount of information comprised in the multivariate dataset, its examination is complicated; thus, statistical methods are implemented to ease the analysis of the urban system. In the first place, we display in
Figure 4 the correlation matrix among the 40 variables (that has been produced in R [
71] using the corrplot package [
72]), which is colored according to the correlation coefficient. We realize that the interactions that take place in the city between districts and indicators are far too complex, and therefore, the dimensionality reduction given by the PCA is justified. Specifically, we rely on this mathematical procedure to transform the original
—dimensional dataset space into a reduced
—dimensional component space—being the cardinality of
y lesser or equal than
x—by solving the eigenvalues problem [
55]. For the sake of concision, we omit the mathematical formulation of the eigenvalues problem of the correlation matrix (see Jolliffe [
34] for a thorough explanation on this topic). Here, we are mainly interested in the transformation given by the solution of the eigenvalue problem.
The PCA transformation leads to the description of the original dataset by means of a set of linearly independent variables, which are the so-called principal component [
34,
73]. The principal components are uncorrelated and ordered so that the first component describes most of the variance of the dataset, and each successive component describes the largest possible variance after the previous components [
34]. Therefore, if only the subset of the first
y principal components is selected,
, the dimensionality can be reduced whilst a great percentage of the variance of the original correlation matrix is maintained. The main advantage of PCA lies in its ability to reduce the dimensionality of the multivariate system while extracting its main characteristics in the dimensionally reduced description. In the case study, we reduce the 40-dimension description to only 3 dimensions per individual. The PCA calculation is conducted in R Software [
71] using the FactoMineR package [
74]; the results are displayed using the ggplot2 [
75] and factoextra [
76] packages.
Since the variables are measured in different units and the measurements for each variable can vary up to two orders of magnitude, we perform a scaling of the raw data matrix prior to the application of the PCA. The data preparation is intended to make comparable data of different units and scales such that the standardized data is scaled to have standard deviation one and mean zero.
In order to visualize the outcome of the analysis, we display the relationship among the variables (indicators) and individuals (districts) using covariance biplots [
34,
77,
78]. In biplots, we make profit of the new low-dimensionality of the system by plotting individuals and variables against the three selected principal components, which are placed at the axes of the plots [
77]. Using biplots and loadings (
Table 4), we identify correlations (direct, inverse, or null) between variables and principal components such that a conceptual meaning is given to each principal component in a process that we call “labelling”. Individuals can be analyzed in their relative positions to the labeled principal components. This will be further exemplified in the next sections.
Also, distance measures can be used to quantify the differences—dissimilarities—between individuals. We perform some statistics of the districts’ performance, where districts’ positions are accounted as events: Each district’s location is accounted as a single observation such that the average value of the districts’ positions—or centroids—can be used as a representative location. We propose to calculate the component-wise and Euclidean distances between districts’ centroids in order to quantify diversity. Thus, the next step of analyzing diversity fostering resilience can be done based on quantitative results.
Furthermore, we implement MFA [
46] for summarizing and visualizing urban metabolism categories and their relation with the principal components’ description of districts. Particularly, MFA is an extension of PCA that takes advantage of the categorical groups of variables by weighting—or balancing—the contribution of the variables in each group [
48]. Conclusions on the metabolism of the city can be addressed by understanding the contribution of variables in each urban metabolism category to the principal component’s description of districts and, hence, of the city.
4. Discussion
Now, we interpret the comprised information in the dimensionally reduced description by considering the districts’ performance. Since the principal components have been labeled, in the first part of this section, we read the description of each district: Urban diversity can be recognized and quantified using the results of the previous section. Then, we evaluate the metabolic performance of districts by correlating individuals, principal components, and metabolic groups of variables, as given by the MFA results. We further discuss the link between urban diversity and resilience. Finally, we analyze the time-dependent evolution of the districts and the possible increase—or decrease—of urban diversity.
4.1. Urban Diversity
We follow the previous studies in References [
79,
80,
81,
82,
83,
84], which have taken advantage of biplot graphs to display diversity in multiple systems. Using the PCA representation (i.e., biplots), we perform a descriptive analysis of diversity with respect to principal components. To achieve this reading, we determine each district’s performance [
78] (positions in biplots of
Figure 5,
Figure 6 and
Figure 7) and quantify diversity as the Euclidean distance [
85] between districts’ centroids.
The first notable characteristic of the districts’ performance in the PC1-Social Background is that this component separates the districts of Ciutat Vella and Les Corts roughly by 10.5 units of distance (see
Table 8), one unit more than the distance between Ciutat Vella and Eixample (9.7), and four units more than distance between Ciutat Vella and Gràcia (6.7). The districts of Eixample and Les Corts are closely together (i.e., the distance between them is of 0.75), and relatively near to Gràcia (being 3.0 units apart from Eixample and 3.8 from Les Corts). Certainly, differentiated social interactions for Ciutat Vella are identifiable from the raw data: This district owns community amenities for the whole city, and in this sense, it exhibits high loads on the indicators related to religious and entertainment buildings, exclusive pedestrian areas, market stalls, and cultural and sport centers, among others. This can be explained because, in the last decade, an intensive culture-based program of urban regeneration has been developed in the district—mainly in The Raval, one of the neighborhoods of Ciutat Vella—in order to face urban marginality and social stigmatization, constituting itself a center for culture and creativity [
86]. On the other hand, Les Corts—and Eixample, to a lesser extent—are mainly residential districts, which are characterized, among other features, by higher income, higher security level, a large number of vehicles, and large parking areas. Therefore, PCA results agree with the actual nature of the districts, where districts are differentiated by the quantity and quality of social interactions associated with the diversity of land uses and functional typologies.
Figure 6 offers a remarkable characteristic, it clearly shows the emergence of “two Barcelonas”, with PC2-Ecological Background drawing a division between two groups of districts: Eixample and Gràcia on one hand (
Figure 6, left) and Les Corts on the other (
Figure 6, right), with an average distance of eight units between them. Les Corts has a good performance in the indicators related to the quality of public spaces and with green and recreational areas. Particularly, this district is defined by the following indicators: urban parks (A17), urban green areas (A18), educational buildings (A4), sports area (A6), playgrounds (A14), and street trees (A16). We observe that these indicators are inversely correlated with air pollution indicators (see
Figure 4). In contrast, the group of districts concerning Eixample and Gràcia is described mainly by the average levels of PM
, NO
, and CO. Therefore, Eixample and Gràcia exhibit a great flow of pollutants—compared to all the districts of the city—is linked to high population density (B2), and lacks green and recreational spaces (see the correlation in
Figure 4). According to Barcelona City Council [
87], the road transport activity in 2013 was responsible for 67.6% and 68.9% of the NO
emissions in Eixample and Gràcia, respectively. Furthermore, because of its high traffic, Eixample experiences the highest noise levels (over 70 dB(A)) in the city [
88]. In order to improve the environmental quality of these districts and to move forward to positive performances in PC2-Ecological Background, some planning strategies should be applied. Since the compact form and the mixed use development of these districts encourages walkability [
89], actions should focus on reducing road activity and on promoting cycling. Moreover, taking into account the high density of these districts and the scarcity of green areas, green roofs, and green walls represents some alternative solutions.
Regarding PC3-Urban Form and Architectural Typology, this principal component segregates Eixample and Gràcia (see
Figure 6 and
Figure 7), with a gap of around seven units of distance (6.5). This can be explained since Eixample is mainly associated with a larger amount of offices and greater commercial and industrial areas than those in Gràcia. In fact, Gràcia has the lowest records in these indicators for the whole case study. On the one side, Eixample displays a metropolitan scale due to its size, compactness, high density, and proximity to the city center (see
Figure 1). Eixample, being designed in 1859, has a compact urban form (see
Figure 2) consisting of a strict grid pattern with large straight streets, square blocks with chamfered corners (133 m side), and wide crossing avenues [
63,
64,
65]. In terms of architectural typology, Eixample is characterized by large-surface and high-rise buildings (up to 7 floors) compared with the average height of buildings in Barcelona. Hence, it allows a mixed-used development. Gràcia, on the other hand, is more related to a human scale of interactions since it was originally an independent and smaller town. This district was established originally in 1626 as a village—out of Barcelona’s boundaries—and displays a distinct urban layout: It has an orthogonal grid, with small and medium size blocks, crossed by narrow streets (see
Figure 2 and
Figure 3, and
Table 1). It is defined by low and medium-rise buildings of medium-size plots.
The indicator of traffic accidents also unveils the urban form and the amount and scale of interactions: Because of its straight streets, Eixample’s urban layout encourages car mobility with higher levels of street traffic and higher car speeds [
88]; hence, traffic accidents are more frequent. Instead, the human scale and car-restricted streets of Gràcia possess lower traffic and consequently less road and car accidents. In summary, both districts exhibit differentiated neighborhood characters and scales which are captured in the multivariate analysis.
The PCA applied on the multivariate system description reveals correlations among indicators and districts, which are difficult to identify only with the raw data (as it that has been shown in
Figure 4 or with the urban structure metrics in
Table 1), and allows us to understand the city at a macro level. It exhibits distinctive attributes of the four districts of the case study, as demonstrated by each one of the principal component descriptions presented before. In the case of the first component, it exposes an “inner diversity” of land uses, contrasting Ciutat Vella and Les Corts that own different functional typologies and are the most distant or are the most different of the case study. Our findings portray the different types of “Barcelonas” and, thus, the re-evaluation of the a priori homogeneity of the city. This is more evident when looking at the second principal component; the description made by this component draws a clear division between two distinguishable groups or types of districts: Les Corts on one side, characterized by large public spaces and green and recreational areas, and Eixample and Gràcia on the other, identified as dense and highly polluted.
Moreover, the measurement of the relative distances between the districts as described by PCA gives the main instrument for quantifying diversity at the city scale. The Euclidean distance among districts (i.e., three-dimensional distance between districts’ centroids) determine that Ciutat Vella and Les Corts are the most distant—different—districts of the case study, with an average distance of 11.17 units; Ciutat Vella is also widely separated—10.82 units—from Eixample; and Eixample and Gràcia are the closest—similar—ones, with an average distance among them of 7.19 units (as presented in
Table 9). These distances in a non-diverse case study would give close to zero values and imply districts’ stacks in specific locations. This has not been the case of the present results, in which significant distances among the districts indicate a diverse behavior of the city.
This diverse behavior among districts can be explained by factors such as building construction type, urban development period, economic prosperity, urban form, geographic setting, prevailing land use, and other biophysical and economic characteristics of the neighborhoods. The amount and type of activities within districts are also related to the previous factors. This has been illustrated by the third principal component, where there is a clear differentiation in the types of land uses between Eixample and Gràcia, mainly explained by their distinctive building type, urban form, and neighborhood characters.
4.2. Urban Trend Analysis
The multivariate system description of the current case study is a time-dependent data: It is, therefore, possible to analyze districts’ evolution between 2003 and 2015. Trends are clearly drawn by biplot graphs [
90]: The districts’ displacements with respect to principal components are shown in
Table 8. For example, with regard to PC1, we observe that none of the districts move—change—in the time span. Contrarily, the displacements of districts are substantial in PC2: Although there is no relative divergence or convergence movement between the districts, all of them move simultaneously towards positive values. In the case of Gràcia, Eixample, and Les Corts, they advance about two units of distance (2.55, 1.99, and 2.49, respectively) during the 12 years. Specifically, Eixample and Gràcia (districts at the bottom of
Figure 5) move forward to a reduction of the indicators related to air pollution (C1, C2, and C3). This trend agrees with the findings from Barcelona City Council [
87] and Baró et al. [
91]: The measures from the municipal monitoring stations show a steady trend for NO
values and a minor decrease for PM
since 2006. In the case of Les Corts (see the top of
Figure 5), there is an increase in units and areas of urban services such as education (A4), sport (A6), playgrounds (A14), street trees (A16), and urban green (A18), upgrading its carrying capacity—input supportive category. Ciutat Vella, on the other hand, is the district with the highest displacement in the time span: It moves more than three units (3.26) in the time period.
The displacement of the districts along PC3 demonstrates a particular divergence behavior of Ciutat Vella compared to the other districts, and therefore, the urban diversity increases. While Ciutat Vella stagnates, the other districts move backward in this component about two units of distance in the time span (Eixample: −1.75, Gràcia: −2.27, and Les Corts: −2.63).
4.3. Contribution of the Metabolism Categories to the PCA Description
Although we acknowledge that the urban metabolism approach can be developed in greater depth (e.g., by applying other urban metabolism measures [
92]), here, we compare the contribution of the metabolic groups of variables against the districts’ performance to have some insights on the metabolic character of districts.
Figure 7, for example, details the location of Les Corts and Eixample against the first and third principal components. If we superpose this figure with the results given in
Table 10, we observe that both districts are characterized by the input and output metabolic groups of variables. Ciutat Vella is also segregated when described by PC1, and therefore, it can be said that the input and output categories of variables contribute the most to its description. As indicated previously, this district is related to the urban amenities of the city. Instead, it is not correlated with industry and pollution.
In the case of the third principal component for which the contribution from all metabolism categories is similar, except for the regenerative metabolism type, we can see a gap between Gràcia and the rest. Indeed, Gràcia is not described by PC1, and therefore, not characterized by the input and output metabolisms. Instead, it is mainly described by PC2 and PC3, which are contributed the most by the destructive type and few by the regenerative type. This district displays correlation with industry and infrastructure, such as industrial areas (B12), water supplied to domestic consumption (B2), traffic accidents (C5), and air pollution indicators (C1, C2, and C3) and can be labeled as a destructive type of district. Thereby, planning actions must be taken in order to reduce its destructive metabolism and to enhance its regenerative metabolism (e.g., plans to reduce air pollution, to mitigate industrial activity, and to encourage recycling).
4.4. A Link with Resilience
Although the concept of differentiated characteristics has been reported in other urban studies at the neighborhood scale (see for example Codoban and Kennedy [
93]), the relation with resilience was not considered. Given the definition of resilience as the ability of a system to adapt to a modifying process while retaining its functionality and not necessarily returning to the previous equilibrium state [
24,
27], it can be said that the case study of 4 districts of Barcelona has been resilient between 2003 and 2015. The system, or the subsystems (if we consider the four districts as isolated systems), have adapted and responded to disturbances (i.e., 2008—to present Spanish financial crisis), performing in diverse circumstances while retaining their basic functions. Particularly, if we look at the system of indicators and specifically at those included in the input supportive category which increases in the time period (e.g., buildings of commerce, offices, education, health services, playgrounds, sport, and cultural centers), it can be stated that urban planning has built adaptable social and economic infrastructures, thus strengthening urban resilience.
Despite the above, an extensive analysis of urban resilience is required for each district. For example, it has also been reported in other studies [
86] that, after some urban renewal programs, Ciutat Vella improved its cultural infrastructure. Since those programs were mainly focused on physical aspects, gentrification processes have taken place [
94], threatening the district’s social resilience. Precisely, this is one of the challenges within the framework of urban resilience: cross-scale trade-offs among the different strategies that aim at fostering resilience [
20,
21,
95]. Hence, the application of the PCA to the multivariate dataset representing the districts and the visualization and measurement of the individuals can support effective urban planning.
In order to strengthen a system’s resilience, planners must identify the processes and disturbances that the urban system is likely to face. However, to reach accurate conclusions about urban factors fostering diversity and influencing urban resilience, complementary types of analysis are needed. Recent studies have suggested that urban resilience must be evaluated at multiple scales, including the scale of the city’s subsystems [
20,
95]. This is exactly the purpose of the present approach, where the analysis is undertaken at the district scale. Certainly, resilience is understood as a property of the system that is not subject to a specific scale (e.g., the city’s scale) [
95]. Focusing only on one scale could enhance resilience in a single district without considering the effects in the other districts of the city. On the other hand, a general perspective may neglect subsystems that can help further improve resilience in the whole city.
An insight we can provide is that there are “many types of Barcelonas”, which apparently should strength the city’s resilience as a whole. As remarked by Folke et al. [
24], the functions and services provided by those types can be sustained over a wider range of conditions and the system will have a greater capacity to recover from disturbance. Nonetheless, an immediate consequence of this fact is a reductionist approach to properly assess resilience at the lower levels of a system, in our case, at the neighborhood scale. At the city scale, diversity improves the resilience capacity of the system by having several types of districts that are capable of performing different and specialized urban functions. For example, Ciutat Vella has been identified as a diverse and pedestrian zone that owns amenities for the whole city. In contrast, Les Corts emerges as a predominant housing and car-usage zone. The answer to how this trade-off between global diversity and local specialized function at the district’s level favors resilience remains elusive, and at the same time, it raises serious ethical concerns for the governance of a city.
MFA results can also contribute to the understanding of the resilience and sustainability level of the urban system. Actually, the metabolism of the city can be seen as a living organism that struggles towards life—or sustainability: the input and regenerative categories must exceed (in quantity) the output and destructive ones for life to be possible and sustain. In this sense, we raise the question on how to quantify these categories for each district and to possibly conclude about the metabolism of the city itself. The differences between the contributions of the metabolism categories in the present results are somehow faint. Nevertheless, we believe that including these categories is important as a departure step to link this exploratory multivariate statistical approach with the urban metabolism framework. In this regard, metabolism types—categories—can highlight trends and patterns and can provide measures, which can be easily identified by decision makers for assessing the degree of sustainability in the urban development [
96]. Therefore, urban planning strategies could and should be applied to improve the input and regenerative capacity of districts and thus the sustainability of the whole system.
5. Conclusions
In this article, we have applied exploratory multivariate statistical techniques to expose the diversity between the districts of a city. Specifically, we applied PCA to a multivariate system of indicators that collects measurements of environmental, economical, and social indicators of the city of Barcelona (Spain) between 2003 and 2015. The assembly of the dataset for the particular case study of four districts of the city allowed us to evaluate the methodology in a complex urban system, which also constitutes itself a reliable dataset for future research. After applying PCA to the dataset, three principal components have been found to account for 88% of the total variance: “Social Background” (PC1), “Ecological Background” (PC2), and “Urban and architectural typology” (PC3). One of the main findings is that many types of “Barcelonas” emerge in the dimensionally reduced description: Districts exhibit differentiated neighborhood characters and distinctive functions at the city scale. We could also quantify diversity as the Euclidean distances between separated districts. The distances allow for the identification of clustered districts as well as those that are separated apart. At least two groups of districts are clearly identifiable, one that is characterized by large public spaces and green and recreational areas and another one that is dense and highly polluted.
The results demonstrated that Ciutat Vella and Les Corts are the most distant—different—districts, with an average distance of 11.17 units; Ciutat Vella is also widely separated—10.82 units—from Eixample; and Eixample and Gràcia are the closest—similar—ones, with an average distance of 7.19 units. Moreover, the temporal-dependency of the dataset reveals information about urban diversity trends, where the district of Ciutat Vella seems to diverge from the rest, increasing the diversity of the city. The temporal results also demonstrate that, for example, some districts have moved forward to an increase in urban services, while some others have decreased air pollution throughout the years.
The potential applications of this methodology are not only restricted to the development of urban policy, for which this framework constitutes another way to evaluate urban systems, but also can be exploited in the understanding of cities from the perspective of the science of cities [
97], where the aim is to quantify the complex relationships between the different elements that compose the urban system. Since this work is based on novel algorithmic developments in the field of exploratory multivariate statistical analysis (i.e., MFA), it can also be used by researchers to contrast and validate their own methods.
Some open questions left by the elaboration of this article are the following. Should diversity be assessed at the city scale or at the district scale? Why does robustness, a system level property, emerge in urban systems as well as in natural ecosystems? How to relate all the temporal trends and the direction of the city towards the increase of sustainability? Future studies on the science of cities and on urban metabolism will help to unveil these open questions and will allow us to more fully understand the links between diversity, urban resilience, and their trade-offs with sustainability [
95]. Nevertheless, our multivariate statistical approach provides a framework for further mapping of Barcelona’s diversity. The present methodology is feasible to be replicated at other scales and urban systems, for which particular indicators to each case study should be selected. One future work is focused on reviewing accurate indicators that describe urban sustainability in terms of diversity at the city scale, as measured with the present methodology. Cross-city profiles could be developed in future multivariate statistical analyses in order to find diversity patterns and causes influencing urban resilience and the scaling of those particular patterns in different cities. Another future work is to apply the exploratory multivariate analysis including all 10 districts of Barcelona. In that case, we could improve the urban description by contrasting this methodology with some recent statistical methods such as Self-Organising Maps (SOMs) [
98,
99], compositional data techniques [
100], and cluster analysis [
101].