6.1. Fixed Effects Estimation
Table 3 presents the estimates of Equation (4) using the fixed effects (FE) estimation methodology. As can be seen at the bottom of this table, the Hausman test statistic suggests that in all cases the fixed effects model is preferred to the random effects model. In addition, from the Wooldridge test for autocorrelation, we can conclude that the data do not have first-order autocorrelation.
The coefficients in
Table 3 are shown sequentially for the three alternative measures of migration diversity (FI, EI and AI, respectively). The most outstanding result of these regressions refers to the significance of the indexes of cultural diversity in the three regressions. This result is consistent with our main hypothesis of the existence of positive spillovers in terms of higher wages from greater heterogeneity of immigrants. Specifically, the estimated coefficients imply that, on average, an increase in the Fractionalization Index of 10% leads to a rise in the wage of nationals of 1.49%, keeping other regional factors constant. A higher impact is seen with the Entropy Index, as an increase by 10% will predict a growth in the wages of nationals of 1.96%, ceteris paribus. However, this effect is significantly smaller when we consider the Alesina Index. In this case, a higher index, around 10%, implies an increase of 0.8% in wages. The smaller value of this last index may be explained by the very nature of the Alesina Index. As shown previously, this index calculates the diversity strictly among those born abroad in a given place, instead of capturing heterogeneity among all individuals.
Coefficients on other control variables show the expected signs. We find that, except for population and unemployment, all of them have a significant effect on productivity. The lack of significance of the population may be due to the inclusion of other variables, such as a young population, that may capture, in some way, the scale of the region. As can be seen in
Table 3, the results from all the regressions suggest a positive and significant influence of a greater proportion of young population on real wages. Specifically, an increase of 10% in the rate of young population in each region increases the average wage for nationals above 8%. Similarly, our estimates verify the beneficial impact of skilled labour on wages. In particular, an increase of 10% in the share of the population with higher education will result in a rise in wages by 3%. The presence of migrants arriving from countries with a high or very high HDI also appears to be positively correlated with wages of native-born workers. According to our estimates, with a share of 10 higher, real wages will increase by approximately 4%. In contrast, the unemployment rate for natives seems to have a non-significant influence on wages of nationals. Following the recommendations of an anonymous referee, to avoid an omitted variables problem, we have also estimated the model including GDP per capita, as a determinant of real wage. However, probably due to the high correlation of this variable with unemployment (with a correlation coefficient of −0.7034), the GDP per capita is not significant in our regression. Nonetheless, the sign and the statistical significance of the rest of the control variables remains unchanged. The results are available upon request.
Next, we estimate whether these productivity spillovers, and hence the salary improvements for national workers derived from a greater cultural variety, extend to non-national workers too. To do so, we re-estimated the previous model using the total wage of the population as a dependent variable, considering the earnings from both immigrants and nationals. Accordingly, the estimated equation now takes the following form:
where
represents the real wage of the total population and the subscripts and the rest of the variables have the same definition as previously in Equation (4). According to the results of the Hausman test, the coefficients have been estimated once again through the FE methodology. The estimates are presented in
Table 4.
In general, the estimates obtained in these regressions confirm our previous outcomes, although now only two of the three diversity indexes (EI and AI) are significant. The lack of significance of the FI might be justified by the fact that, as pointed out in the literature, the Fractionalization Index might be overrated due to the presence of a large proportion of immigrants in a region, even when these immigrants do not come from a wide range of countries of origin. Thus, we can conclude that the salary benefits from a greater diversity of immigrants in terms of their birthplace are not limited to the national population, but also affect non-native workers.
Note, however, that although the sign of the coefficient on indexes is similar to those previously estimated, the estimated values are now slightly lower and somewhat less significant, revealing a lesser forcefulness of this effect in the case of non-national workers. Increases in the Fractionalization Index and in the Entropy Index of 10% are associated with an increase in the average wage of the total population of 1.34% and 1.82% respectively, whereas the same increase in the Alesina Index will imply a rise in wages of 0.73%.
The estimates for the other control variables confirm the beneficial influence that a greater young and skilled population has on real wages; as well as the positive productivity spillovers of an increase in immigration from countries with a high or very high HDI. Regarding the unemployment rate of natives, we found an expected significant influence. The estimated coefficients indicate a decrease of approximately 0.8% in total wages due to a rise of 10% in the unemployment rate. Finally, as can be seen at the bottom of the table, in all cases, the Wooldridge test for autocorrelation shows us that data do not have first-order autocorrelation.
6.2. Endogeneity and Instrumental Variable (IV) Approach
As mentioned above, the FE estimation takes into account unobserved heterogeneity among regions; however, it does not consider a potential simultaneity problem or reverse causality. Nonetheless, as pointed out by Cadena and Kovak [
53] and Lewis and Peri [
11], among others, the location of immigrants is not a random selection. In contrast, this may depend on the local economic outcomes. Consequently, whenever the amount of diversity of immigrants in a region and its economic performance are interrelated, we need to be cautious in our estimations in order to avoid biased estimates. To solve this, we employ two-stage regression techniques by using an instrumental variable (IV) whose exogenous variation affects migration diversity in a region, but not total worker productivity. Doing this, we try to capture the source of correlation between cultural diversity and wages that is due exclusively to the influence of diversity on wages [
9]. Although the endogeneity of immigrant diversity is clear from a theoretical point of view, we check this fact empirically through the Wu–Hausman endogeneity test (to compute this test, we add the residuals from the reduced form of the endogenous variables as an additional regressor in the structural equations). The results of this test are shown in
Table A3 from the
Appendix (under the null hypothesis of no endogeneity, ordinary least squared (OLS) estimation is consistent and efficient, while two-stage regression is also consistent, but inefficient. However, if endogeneity exists, an IV estimation methodology is required to guarantee consistent estimations.) As can be seen, the outcomes obtained confirm that the diversity indexes are endogenous in the three regressions. Accordingly, to deal with the problem of endogeneity, we next estimate our model using two-stage fixed effects (2SFE) methodology with an instrumental variable.
Traditionally, two kinds of instruments have been used to address the concern of endogeneity of migration. The first based on accessibility measures as ports or land borders (see, Ottaviano and Peri [
9], among others). This, however, does not seem appropriate for the Spanish case currently given the great advances and cost lowering of national transport, which allows people to move easily across the country. The second type of instruments is built according to the procedure often referred to as “shift-share methodology” (Card [
54]). The intuition behind this instrument relies on the fact that the initial share of immigrants by country of origin can be considered a good predictor of subsequent migration inflows, as migrants tend to be attracted to regions where other immigrants from the same country locate (Gagliardi, [
14]). As shown by Alamá et al. [
6], network or agglomeration effects play an important role in the attraction of immigrants in Spain. Accordingly, we can “predict” the immigrant composition of an Autonomous Community based on the current total immigration rate and the shares of immigrants from each region at the beginning of the period. In this line, the instrument used in our regressions is a type of diversity index constructed as the “predicted” change in the number of immigrants from each country in each region during the period 2008–2016. This instrument was initially proposed by Ottaviano and Peri (2006) [
9], which later became a standard instrument in literature, as in the case of Gagliardi [
14]. As mentioned by Ottaviano and Peri [
6], by definition, this index does not depend on any regional economic shock in the current period.
First, the growth rate of immigration is calculated for each group of immigrants according to their birthplace (this has been calculated year by year since 2009; it does not have information for 2007). Thus, using the same notation as in
Section 3, we have:
where
is the growth rate of immigrants born in country
r,
represents the previous period
t, and
represents the current period.
Second, from the above equation, we calculate the “attributed” share of people born in country
r and residing in an Autonomous Community
c in year
t:
As a final stage, we obtain a diversity index,
div, through the attributed share of foreign-born individuals:
As Ottaviano and Peri (2006) [
9] explained, the variable
div is independent of any specific shock in a region during the period, since the attributed diversity for each Autonomous Community in year
t is built using the participation of the Autonomous Community in year
t − 1 and the national growth rates of
of each group of immigrants (consequently, 17 observations corresponding to the year 2008 have been lost). Thus, this variable would meet the exogeneity requirements needed for a good instrument. However, an additional criterion is necessary for an instrument: to be relevant. To confirm the relevance of this instrument,
Table 5 presents the estimates of the first stage regression and the results of tests of both underidentification and weak identification. To test whether the equation is identified, i.e., that the excluded instruments are relevant, we display the Kleibergen–Paap [
55]
rk statistic. The Stock–Wright [
56]
S statistic is employed to test weak identification. As can be seen at the bottom of the table, the results obtained confirm that the instrumental variable,
div, is relevant and does not suffer from weak identification problems given that both tests are rejected at the 5% level in all cases. The explanatory power of the instrument in the diversity regressions is also confirmed through the
F tests of excluded instruments. In conclusion, we can say that the connection between predicted diversity and the different diversity indexes are significant. Accordingly, our instrument should plausibly increase (decrease) when the cultural diversity goes up (down) or vice versa. Therefore, to the extent that the predicted diversity does not depend on regional wages in the current period (by definition),
div can be considered a reasonably acceptable instrument for cultural diversity in the estimation of wages.
Subsequently,
Table 6 illustrates the estimations of the second stage. As in the fixed effects regressions, the three diversity indexes (FI, EI and AI) are now positive and statistically significant in the explanation of the average wage, suggesting that an increase in the diversity of immigrants is associated with higher real wages. In particular, similarly to the FE estimates, we find that when the Fractionalization Index rises by 10%, wages of nationals go up 8.36%. In the regression of the Entropy Index, outcomes are similar with a coefficient of 7.86%. The effect that an increase in the Alesina Index has on wages is lower than those obtained with the previous indexes, but higher than that achieved through the regression with fixed effects. Additionally, the results of the 2SFE regressions confirm the expected benefits derived by a higher rate of young and educated population as well as the benefits for national workers of a higher proportion of skilled immigrants.
Next, for the purpose of comparison, and to obtain a broader view of the impact of a greater cultural diversity on the labour market, we perform a similar analysis considering the wages of total workers (including non-native ones) as a dependent variable. Similarly, the problem of a non-random selection in the location of immigrants is analysed through the Wu–Hausman endogeneity test. The results in
Table A4 from the
Appendix confirm the endogenous nature of the three diversity indexes in the explanation of the total wages. Accordingly, we estimate the model by 2SFE using the predicted change in the number of immigrants coming from each country as the instrumental variable. The validity of this instrument has been previously confirmed by the Kleibergen–Paap [
55]
rk, the Stock–Wright [
56]
S and the F statistic tests of underidentification, weak identification and excluded instruments (see
Table 5). The estimates in
Table 7 support our suspicions that the beneficial effect of greater diversity in the workers’ compensations also affects non-native workers. As before, the three diversity indexes are positive and statistically significant (although these coefficients are not directly comparable due to differences in the number of observations in both regressions.). Furthermore, the roles of the other control variables in the explanation of total wages are similar to those obtained previously.
To sum up, our estimates consistently confirm that immigrant diversity is positively associated with wages (for natives and non-natives) in Spain. These outcomes are robust to the unobserved regional heterogeneity. In addition, in this work, the risk of a potential reverse causality bias between the economic conditions of a region and its cultural diversity is taken into consideration through instrumental techniques which provide us reasonably robust results in this matter. Finally, we prove the important role of an increase in a young and skilled workforce, whether native or foreign, to encourage productivity and regional development.