2. Background
Liddle [
5]—in an analysis that considered city-based data from three datasets [
6,
7,
8]; a total of 167 cities—calculated the ratio of a city’s gross domestic product (GDP) per capita to the associated country GDP per capita. GDP per capita is higher for most cities than for their respective countries as a whole—the ratio of city-to-country GDPs is less than one for only 35 cities, and less than 0.85 for only 14 [
5]. Furthermore, the relative economic importance of cities is stronger in countries with lower GDP per capita [
5]. That second point (i) illustrates that cities in developing countries exercise an important migratory pull; and (ii) helps explain that urban areas will be the locus of all UN projected future population growth (over the next 40 years). Indeed, rural-urban migration accounts for around half of the total urban population in Africa [
9].
Urbanization and economic development do tend to accompany one another; in part, this is because the industrialization process involves the agricultural labor force migrating from rural farms to urban manufacturing plants. However, slow economic growth has spurred rural-urban migration and led to the phenomenon of urbanization without growth, particularly in Africa [
10,
11,
12,
13]. Similarly, in the case of India, how fast a state urbanizes is negatively correlated with that state’s rate of economic growth [
14]. Ultimately, urbanization may be more evidence of economic progress than a catalyst for economic growth, and the/any relationship between urbanization and development may be an equilibrium one [
13,
15].
Table 1 displays data from the World Bank’s World Development Indicators database [
16] on various measures of urban poverty and rural-urban differences. Data were collected for all countries not classified as high-income by the World Bank (this meant a maximum of 137 countries, but not all countries have data for each variable). For data that are collected annually, the most recent year was used (typically either 2014 or 2012); for data that are less frequently collected, the average over 2010–2014 was used.
The data do suggest that people in cities tend to have more opportunities than people in rural areas (since people in urban areas likely have more access to resources—electricity, quality water, education, health care, transport, capital, culture/arts—than do people in rural areas, and since access to such resources tends to increase opportunities for quality of life). Nearly all urban dwellers have access to electricity and quality water (the median shares of urban dwellers with such access are 98% and 97%, respectively). The ratio of the rural to the urban share of population with access to electricity/water demonstrates that urban dwellers are more likely to have such access than their rural counterparts. The median ratios are 0.71 and 0.89 for electricity and water, respectively, suggesting the urban population is about 10–30% more likely to have such access.
However, there is poverty in urban areas, too; almost half of urban populations live in settlements defined as slums (the World Bank defines a slum as a house that lacks one or more of the following conditions: access to improved water, access to improved sanitation, sufficient living area, and durability of housing). Moreover, over a fifth of the urban population lives below their national poverty lines. Yet, the share of people living below the poverty line in rural areas is double that, since the median ratio of rural-to-urban poverty share is 2.
3. Analysis and Discussion
Initially, we revisit Kuznets’ analysis by also considering urbanization levels via cross-sectional regressions. Cross-country data is particularly useful in studying developmental change; such data allow for the greatest degree of generalization, and are in the spirit of Kuznets’ original analysis. We perform pure cross-sectional regressions (in
Table 2,
Table 3 and
Table 4), since the measurements of the dependent variables differ much more across countries than across time. Furthermore, a cross-section is more appropriate when including nonlinear transformations of variables, since using pure cross-sectional data avoids the statistical complications that arise when such operations are performed on time-series data (see e.g., [
17,
18]).
Specifically, we use the Gini index as the measure of inequality, and estimate it (i) as a nonlinear function of GDP per capita (in constant 2011 international dollars that are adjusted for purchasing power parity); and (ii) as a nonlinear function of urbanization. The Gini index is based on the Lorenz curve, which plots the proportion of total income of a population that is cumulatively earned by the bottom x% of the population. So, a 45 degree line represents perfect equality of incomes, and the Gini index is the ratio of the area that lies between the 45 degree line and the Lorenz curve. Hence, a Gini index of 0 represents perfect equality.
The data are from the World Bank’s World Development Indicators [
16], and the observations are from 2013 for GDP per capita and urbanization and from the average over 2007–2013 for the Gini index (which is stable over time, and, for many countries, is observed/recorded only occasionally). All variables have been converted to natural logs so that their estimated coefficients can be interpreted as elasticities. The cross-section contains 133 countries (which include high-income countries), and the regression results are displayed in
Table 2.
For the polynomial model considered here, an inverted-U relationship between the dependent and independent variables is said to exist if the coefficient for the linear term,
, is statistically significant and positive, while the coefficient for the squared term,
, is statistically significant and negative. Moreover, whenever the estimations suggest a parabolic relationship (i.e., one coefficient is positive and the other negative), the implied turning point,
τ—or the level of the independent variable at which the sign of the relationship between dependent and independent variable changes—can be calculated by:
The regression confirms the expected inverted-U relationship between income inequality and income. The turning point of under US
$2000 per capita is well within the sample range—indeed, the mean income is over US
$14,500, and the median is over US
$9300. Interestingly, urbanization has a U-shaped relationship with inequality, i.e., initially, increases in urbanization are associated with lower levels of inequality, but ultimately, increases in urbanization are associated with higher levels of inequality. Again, the estimated turning point of 23% urbanization is well within the sample range; both the mean and median levels of urbanization are around 55%. This finding that higher urbanization can lead to more inequality in a cross-sectional context echoes the results of a recent, time-series-based, China-only study [
19].
Next, we revisit the variables on urban poverty and rural-urban differences introduced in
Table 1 to determine whether/how the level of income or urbanization is associated with those indicators via cross-sectional regressions (income and urbanization data are the same as described above). As before, the variables have been converted to natural logs so that their estimated coefficients can be interpreted as elasticities. We believe that income should improve these measures monotonically, so we do not consider a polynomial of GDP per capita. However, we do allow for a possible nonlinear relationship with urbanization level (urbanization squared was usually insignificant, and only significant results for this transformation are shown in the table). The regression results are displayed in
Table 3.
For the share of urban population with electricity access and the share of urban population living in slums (i.e., Regressions I and VI), GDP per capita had the expected, statistically significant impact. For urban electricity access, a one % increase in GDP per capita leads to a 0.3% increase in access; whereas, for urban slum share, a one % increase in GDP per capita leads to a 0.4% fall in the share of people living in slums. For both of those regressions, the level of urbanization was insignificant. For the rural-to-urban share of people living in poverty (i.e., Regression V), both income and urbanization were statistically significant, but the associated R-squared for the regression was very low, i.e., the model had a poor fit.
Urbanization had a more interesting relationship for the other dependent variables. For example, for the rural-to-urban electricity access ratio (i.e., Regression II), urbanization had an inverted-U relationship. A one % increase in GDP per capita raised the ratio by two-thirds of a percent—i.e., the rural-urban electricity access gap narrows at higher income. Initially, higher urbanization levels are associated with a higher access ratio/narrower gap as well, but at urbanization of 49%—approximately the mean and median for the 128-country sample—higher levels of urbanization are associated with a lower access ratio/larger rural-urban gap (i.e., disproportionately less electricity access for rural populations). In other words, at higher levels of urbanization, rural populations are more likely to be “left behind.” Perhaps, at high levels of urbanization, there is less interest/incentive for governments to invest in rural infrastructure.
The urban poverty gap regression (Regression III) reveals a similar nonlinear relationship with respect to urbanization. The urban poverty gap is the urban population’s mean shortfall from the poverty lines (counting the nonpoor as having zero shortfall) as a percentage of the poverty lines—so the measure reflects the depth of poverty and its incidence. Again, not surprisingly, GDP per capita has a strong negative relationship; indeed, increases in GDP per capita reduce this gap more than proportionally. Urbanization now has a U-shaped relationship with the urban poverty gap; however, because of the definition of the dependent variable, the basic relationship is the same as with the rural-to-urban electricity access ratio, i.e., initially, increases in urbanization are associated with an improvement (i.e., lower urban poverty gap), but ultimately, urbanization is correlated with a worsening in poverty.
What is particularly surprising about Regression III is the relatively low turning point for urbanization; indeed, the turning point is only 15%, whereas, the mean/median is approximately 40%. It is possible that when urbanization increases because of rural to urban migration, those rural migrants settle in marginal urban areas so that urban poverty increases. However, one should be mindful of the relatively small sample size of Regression III (only 47 out of 128 countries had the urban poverty gap data). Moreover, those countries included in Regression III were disproportionately poorer than the “full sample” (of Regressions I and II). For example, the mean and median GDP per capita for Regression III were only US$5200 and US$3600, respectively, compared to US$8000 and US$6500, respectively, for the 128-country sample (Regressions I and II). Moreover still, the maximum GDP per capita for Regression III was approximately US$14,500, which means the 21 richest countries in the 128-country sample were not included. By contrast, the range of urbanization levels is similar for the smaller sample (Regression III) and the larger ones (e.g., Regression I): 12–78% compared to 12–89%, respectively.
The urban poverty share regression (Regression IV) is similar to the urban poverty gap regression (Regression III) in two ways: (i) the smaller sample of countries are poorer than the larger sample—mean and median GDP per capita are US
$6600 and US
$4700, respectively; and (ii) increases in urbanization seem to offset improvements from increases in GDP per capita. Indeed, increases in GDP per capita lower the share of urban population living in poverty by nearly one-to-one; however, increases in urbanization appear to more than offset that improvement in poverty reduction (urbanization’s elasticity is 1.2). Again, the results could be interpreted as suggesting that, when urbanization increases because of rural to urban migration, poverty among the (newly designated) urban population would increase as well. Indeed, there is evidence that rapid urbanization has caused worsening urban poverty (e.g., [
20])—a finding that is in concert with both Regressions III and IV.
Lastly, we consider another method for analyzing potential nonlinearities, i.e., the quantile regression (initially proposed by [
21]). A quantile regression estimates the linear relationship between regressors and a specified quantile of the dependent variable (in this example we use quartiles). Among the questions quantile regressions can address are: (i) whether the coefficient estimates vary across different quantiles of the dependent variable; and (ii) whether the coefficients for the median are statistically different from those estimated at the upper and lower quantile.
The quantile regressions typically suggest a uniform impact of urbanization (see
Table 4). The main exception to that result is for the Gini index, for which urbanization is significant only at the higher quantiles of the index, i.e., urbanization matters in societies that are the most unequal. There was more evidence of differences across quantiles for income—this was the case for nearly half of the dependent variables as displayed in
Table 4 (results for the other quantile regressions not shown, but are available upon request).
The impact of GDP per capita declines significantly when the share of urban population with electricity access is sorted by quartiles (consider the reported p-value of the slope equality test between the lower and median quartile and between the upper and median quartile)—an unsurprising saturation effect. A similar pattern is observed for both the rural-to-urban electricity access ratio and the urban poverty share; however, the difference between estimations is only statistically significant when comparing the median quartile with the top quartile. For the rural-to-urban electricity access ratio, that relationship is in concert with the same finding for urban electricity access, as would be expected. Whereas, for the urban poverty share, the quantile regression suggests (surprisingly) that the income effect on lowering urban poverty is significantly smaller in societies with the highest share of urban people living in poverty.