1. Introduction
The U.S. COVID-19 pandemic unraveled an unprecedented health and economic crisis in which historically racialized and vulnerable minorities paid the highest price with their lives. Federal policy defines “Hispanic” not as a race but as an ethnicity. Additionally, it prescribes that Hispanics can, in fact, be of any race. However, some authors argue that standard U.S. racial categories might either be confusing or not provide relevant options for Hispanics to describe their racial identity. A good share of Hispanics considers their Hispanic background to be part of their racial background, while others think of it as their ethnic background. A good share considers the label both an ethnic and racial identity [
1]. The disproportionally higher death rates decimated the Hispanic population of New York City which made it clear that legacies of structural and social inequalities combined to increase the higher risk of mortality for the group [
2,
3,
4,
5,
6,
7,
8,
9,
10,
11,
12,
13,
14,
15,
16,
17,
18,
19,
20,
21,
22,
23,
24,
25,
26,
27,
28,
29,
30,
31,
32,
33,
34,
35,
36,
37,
38,
39,
40,
41,
42,
43,
44,
45,
46,
47,
48,
49,
50,
51,
52,
53,
54,
55,
56,
57,
58,
59,
60]. In 2020, for example, during the peak year of the pandemic, the Hispanic population experienced a decline in life expectancy at birth, shrinking the group’s survival advantage over the white population from over three years to less than one year [
3]. As of June 2022, in New York City, at the epicenter of the pandemic, Hispanics exhibited the highest cumulative COVID-19 infection rates (28,025 per 100,000 people) and death rates (871 per 100,000) and the second-highest hospitalization rates (2150 per 100,000 people) among all racial and ethnic groups for whom the city collects data [
56]. This is the case even when both Hispanic adults and eligible children in New York City have higher vaccination rates compared to the white and black populations [
57]. Currently, at 60.6 million, Hispanics are the largest minority group and the youngest of all racial and ethnic groups, with an average age of 30 in the U.S. These young adults compose disproportionately the largest share of voters, workers, and consumers among Latinos. During the first and peak year of the pandemic, they suffered the highest excess mortality rates of any other racial or ethnic group in the nation [
58]. In California, for example, Hispanics ages 20 to 54 were 8.5 times more likely than white Americans to die of COVID-19 [
45]. In New Jersey, young Latino men between the ages of 18–49 died at 4.5 times the rate of Latino women and 7 times the rate of white men in the same age category. However, to this date, we lack insights in New York City as to which Hispanic subgroups experienced the worst risks or mortality caused by the COVID-19 pandemic.
On-going qualitative research in New York City by the lead author [
4] suggests that Hispanic families in households with mixed immigrant status experienced the worst forms of economic and health vulnerabilities due to COVID-19. Accordingly, living arrangements, family structures, education, and years of residence help explain the higher rates of infection, hospitalization, and deaths in these households. These preliminary insights also suggest that Hispanic women living with a spouse or male partner experienced higher risks of COVID-19 infections and hospitalizations than those who headed home on their own. In addition, emerging COVID-19 literature has argued that, nationally, while women experienced labor market shocks, losing jobs at much higher rates than men, the latter paid a higher price with their lives [
41]. An established epidemiological scholarship also documents that while women tend to experience higher comorbidities and a lower quality of health, especially as they age, men tend to develop more chronic illnesses and die sooner. These findings contribute to what has been conceptualized as a gender health paradox [
42]. The results have led these and other scholars to advocate for inter-disciplinary studies bridging public health and social sciences scholarships to distinguish the main factors affecting men’s and women’s health choices, resources, and outcomes. Beyond structural drivers of discrimination, implicit bias by health care providers based on gender, race, ethnicity, and other characteristics is now associated with the exclusion or misdiagnosis of women and those of other minority groups’ health conditions [
61].
Given the higher than usual COVID-19 death rates among different populations in specific spatial locations, researchers have also emphasized that we pay attention to geography in explaining the higher COVID-19 risks and death rates among the US Hispanic population [
3]. This is particularly the case in specific regions where the Hispanic population has traditionally concentrated [
5,
6,
60]. The higher-than-average death rates among Hispanic groups are surprising, as close to four decades of studies have consistently documented the health advantages of foreign-born Hispanics when compared to whites or their native-born counterparts. The phenomenon has been conceptualized as the Hispanic health paradox (HHP) [
2,
7,
8,
9,
10,
22]. However, researchers now argue that the COVID-19 pandemic has eroded the health advantage of Hispanics given the higher death rates experienced by the group since the pandemic.
Beyond geography, scholars have also examined the role of structural racism [
11], one especially exacerbated by the nesting effects of intersectional inequalities the Hispanic population confronts in the US, such as concentration in low-wage service occupations with higher risk of COVID-19 exposures [
6]; limited access to health care services [
9] such as vaccinations [
6] or preventive care; and the weathering effect of living in highly concentrated, poor, inner-city neighborhood areas [
11].
Studies have also shown that the positive correlation observed between death rates and the concentration of Hispanics seems to vanish at the county level when socio-economic characteristics linked to structural racism are controlled [
12]. Ample research has documented the role of geography on life expectancy [
13,
14], the rate of COVID-19 spread [
15], and the relation of air pollution concentration to deaths because of COVID-19 [
16]. Most significantly, these emerging insights reveal that COVID-19 deaths are linked to spatial location at the county level [
17]. However, other studies [
17] based on national data at the county level find no spatial correlation between COVID-19 deaths and the Hispanic population. Yet, our analysis of panel data at the level of the unit of metropolitan areas [PUMA] for New York City finds a positive spatial correlation between the Hispanic population’s COVID-19 death rate during the peak year of the pandemic. This result is consistent with previous studies, also based in New York City, documenting the correlation between neighborhood characteristics and spatial distribution in determining the health needs of specific groups [
18].
Informed by the above scholarship, this paper further examines how structural racism (measured by spatial segregation) and intersectional forms of gendered inequality (measured by family structure, employment, and citizenship status) combine to explain the disproportionally higher rates of COVID-19 deaths among Hispanic groups in New York City during the first and most devastating year of the pandemic. Beyond the analysis of spatial correlations, we document differences in gendered vulnerabilities associated with the risk of Hispanic men and women dying from COVID-19. An important caveat in our study is that once we control for neighborhood (PUMA) characteristics, the positive correlation between the death rate and the Hispanic population vanishes for women but not for men. This result is new and invites further research by gender, spatial stratification, and public health scholars, especially among proponents of the Hispanic health paradox. Our analysis of different subsamples reveals that Hispanic men with a shorter time of residence in the U.S., citizenship status, and health insurance have better health outcomes than those with a longer residence. We interpret the importance of the time spent in the US as proof that the weathering process matters [
11] in decreasing the health advantage of Hispanic groups. We also interpret the importance of health insurance and citizenship status as class indicators increasing access to preventive health services, such as vaccination [
6].
We perform further analysis focused on women to understand what characteristics of the PUMA data help eliminate the positive correlation between COVID-19 deaths and the Hispanic population share. A separate, ongoing qualitative study by the lead author [
4] in New York City suggests that Hispanic immigrant women who import at least one year of higher education and who act as sole heads of household experience lower risks of COVID-19 infection and deaths. Our quantitative analysis reveals that for different subsamples of women, only the inequality variables of the Townsend index explain in all the subsamples the positive correlation between deaths and the Hispanic population. These variables include the share of the Hispanic population that lives in overcrowded households, uses public transportation, is unemployed, has children, and lives in poverty.
This paper is organized as follows: the first part discusses our theoretical considerations of the Hispanic health paradox, structural racism, gender inequality and its relation to health outcomes, and the applicability of this scholarship for framing our findings on the role of spatial segregation and social inequality on the disproportionate death rates observed for the Hispanic population in New York City; the second part of the paper introduces our empirical models used to calculate death differentials for the Hispanic population; the third part presents the data and results of findings; the fourth part presents our discussion and conclusion as well as recommendations for future research.
3. Methodology
As a complete data set that disaggregates deaths by PUMA (Public Use Microdata Areas (PUMAs) are non-overlapping, statistical geographic areas that partition each state or equivalent entity into geographic areas containing no fewer than 100,000 people each) by race and ethnicity is not available, researchers have looked at the correlation between specific population shares and the total number of deaths [
12,
35,
36]. In this paper, we follow the approach of McLaren [
12] to indirectly estimate the differential mortality ratio between Hispanics and the rest of the population. The estimation of this mortality differential is the first objective of this paper. The method consists of estimating the partial correlation between total deaths and the Hispanic population, as shown in Equation (1):
where
represents deaths at PUMA
i;
represents the share of Hispanic population in PUMA
i;
represents the usual error term; and
represents the correlation between deaths at PUMA level and the Hispanic population share. Following [
12], define the weighted average deaths in PUMA
i by Equation (2):
where
represents deaths among Hispanics in PUMA
i, and
represents deaths among the rest of the population in PUMA
i.
Differentiating Equations (1) and (2) with respect to
is the death mortality differential for the Hispanic population. If
, the Hispanic population has a higher mortality ratio than the rest of the population. Studies usually introduce variables that can measure structural racism. If the mortality differential does not disappear, then the residual differential is not explained by structural racism [
11,
34]. If
, we would argue that the Hispanic population has a lower death prevalence, which is explained by the Hispanic health paradox [
8,
9,
10].
Equation (1) suffers from biases generated by a model that is not well specified because of the nature of the data and the existence of more covariates. Consequently, we include more control variables that could explain the mortality differentials, taking into consideration the nature of the data. Given the panel data structure of our data for deaths and the cross-section nature of the data on the PUMA characteristics, we estimate a random effect estimation on the change in death rate by PUMA:
where
represents control variable k. As control variables, we include the share of the population with some college or more education, the share of the population with high school completed, the share of the population working in 13 specified occupation categories, the median household income, the share of the insured population, the share of the population that drives to work, the share of the unemployed population, the share of the population that rents, the share of overcrowded households, and the fraction of households with child poverty. These variables are linked to structural racism [
12] and the Townsend Index [
36,
43], a deprivation index that has been found to be correlated with geographic areas with high health problems [
35]. This specification has the advantage of eliminating any potential time trend existing in the death rate.
Equation (7) shows a second specification where we exploit the spatial distribution of the data to consider the potential spatial distribution of deaths [
15,
16]. Specifically,
where
includes all control variables and the Hispanic population,
includes the spatial lags for control variables,
are spatial weighting matrices, and
are scalar parameters. In this specification, each variable is said to have a direct and an indirect effect. Direct effects refer to impacts that are not linked to spatial correlation, while indirect effects are linked to the different sources of spatial correlation that are present in Equation (7).
Exploiting the spatial panel nature of the data has certain advantages over an OLS estimation. First, by considering the spatial correlation, we obtain a better estimation than the one offered by OLS. Second, the spatial specification allows us to obtain direct and indirect effects, which allows the determination of the importance of spatial correlations in determining the effects of each variable.
The second analytical objective of this paper is to obtain gendered estimations for Equations (1), (6), and (7). The data on deaths is aggregated; consequently, we cannot separate it by gender. The data at the PUMA level, however, comes from the American community survey (ACS) and allows for the calculation of characteristics by gender by aggregating the PUMA characteristics only for the population of adult men and women, respectively.
A third objective of the paper is to further analyze the origins of the results. We will do so by generating different subsamples at the PUMA level, which will allow us to do a comparison of the estimations of Equations (1), (6), and (7) for the different partitions of the data. The analysis is explained fully in
Section 4.4.
5. Discussion
This paper studies the relationship between death rates at the PUMA level and the Hispanic population of New York City. Our census-based analysis confirms results by other scholars [
3,
60] documenting a positive correlation between COVID-19 death rates and the share of the New York City population that is Hispanic. Our analysis further shows that the death rate is spatially correlated with the Hispanic population, which offers a new and more varied understanding than is provided by earlier studies [
12,
17]. In addition, when we control for PUMA neighborhood characteristics on death rates, the positive correlation between Hispanic population and death rates vanishes for women but not for men. The first result confirms findings by McLaren [
12], while the second gendered result for women is new. This implies that in neighborhoods with a high concentration of Hispanic population, women may experience higher protective factors that decrease the risks of dying from COVID-19, while we did not observe this for men. Some of these factors may be mediated by women’s access to more diverse networks among people with longer tenure in the US or within local or mainstream institutions, such as schools and migrant-servicing agencies. This in turn increases access to health-preventive information or even access to the safety net.
Different subsamples of Hispanic individuals from different age groups confirm the results of our analysis for women. In fact, for medium-aged women, we find evidence of the Hispanic Health Paradox, since a negative correlation between deaths and the Hispanic population is found among individuals with longer tenure in the U.S. In the case of men, individuals with less time residing in the U.S., access to medical insurance, and U.S. citizenship tend to have better health outcomes than those with more years of residence in the U.S. These results substantiate earlier studies [
4,
11], except for our finding about the higher risk of death for young Hispanic men, which is a new finding in the literature. This may be due to younger men being more likely to live in neighborhood areas with higher population concentration, which exert greater traditional expectations on them as providers; or these men may be concentrated in jobs with higher exposure or less protective factors, such as access to health insurance. These risks translate to higher COVID-19 vulnerabilities for younger or immigrant males in more recent cohorts.
In the case of women, different specifications of the model are analyzed for the different subsamples. We find that in all subsamples, the variables of the Townsend index explain the positive correlation between deaths and the Hispanic population, which confirms results found by McLaren [
12] and other scholars in other geographic areas where the Hispanic population concentrates [
60]. In the case of the occupations, we find that they can explain the positive correlation between subsamples of women that are recent immigrants, heads of households, employed, and never married/single, which confirm results found by García et al. [
11] and Riley et al. [
6]. The median income can explain the positive correlation between deaths and the Hispanic population for those with citizenship status, for women with health insurance, and for those never married/single, confirming results by EDQ [
40]. The variables for education can explain the positive correlation between deaths and the Hispanic population in the case of heads of household, women who are employed, have health insurance, are citizens, and those never married/single, confirming results by EDQ [
40]. These results also imply that women who have never married and are single are over-represented among the subsample of Hispanic individuals with better health outcomes. These results confirm those by Fuentes and Kucheva [
4].
6. Conclusions
The first objective of this paper was to estimate the correlation between deaths caused by COVID-19 and the Hispanic population. If the estimation is performed at neighborhood levels, our results show a positive correlation between deaths and the Hispanic population, revealing a larger death rate among the Hispanic population when compared with the rest of the population of New York City. If the estimation is performed among subgroups, we obtain a negative correlation between deaths and the Hispanic population, substantiating the predictions of the Hispanic Health Paradox. If we control for spatial correlation effects and variables related to structural racism, the correlation between deaths and the Hispanic population vanishes.
The second objective of the paper is to estimate the correlation between deaths and the Hispanic population separately for men and women, controlling for the gendered characteristics of the PUMA. The results show that the same results obtained for the entire sample are obtained for women, while for men, we find a positive direct effect that is offset by the indirect effects of spatial correlation, rendering a non-significant total effect. For men, four elements are found to negatively affect their health in the U.S.: (1) the time spent in the U.S.; (2) their employment status; (3) their civil status or never married/single, and (4) their young age. Three elements are found to positively affect health outcomes: (1) US citizenship, (2) access to health insurance, and (3) being of medium age. In the case of women, only the characteristic being of medium age is found to positively affect the health status of individuals. In all other cases, the control variables linked to structural racism explain the positive correlation observed at different levels.
These findings reveal that women, despite suffering infections from COVID-19, had lower death rates, as has been found for all women nationally [
50]. We argue that these results highlight the need for additional studies to better understand why Hispanic women seem to derive a more protective effect from living in areas with a higher concentration of the population. Women appear to be more resilient to COVID-19, particularly when the results are linked to the occupations they perform, their access to better health services, or other biomedical reasons for these results. The results also highlight the need for improving the health services for the immigrant young men population of New York City, since they seem to perform riskier and essential jobs for the city and to experience higher limitations or vulnerability when accessing health insurance, and as research shows, they died at higher rates during the pandemic. Our results also suggest the need for further research on the needs of older immigrant adults, such as women, since we find evidence that the weathering hypothesis continues to matter or to shorten their presumed immigrant health advantages. Our results also emphasize that it is imperative that federal, state, and local governments collect and release comprehensive data on the number of confirmed COVID-19 cases and deaths by race/ethnicity and immigrant status, as well as age, to better gauge the impact of the outbreak across non-white Hispanic groups in mixed-status immigrant households. Finally, we claim that our results also highlight the need for a discussion among academics and policymakers of the different incremental steps to be taken to reduce the disproportionate burden of other potential future pandemics among minorities, given the decimation that the pandemic has caused among older black and vulnerable Hispanic populations in the U.S. Our results mainly emphasize the need for transformative actions that address structural racism to achieve greater health equity among historically racialized U.S. minorities. Finally, as Hispanics and Latinos make up a disproportional share of the young U.S. population, projected by 2030 to compose 22 percent of the U.S.’s labor force [
1], the size and composition of the population and its contribution to the U.S. economy make it more imperative to design policy solutions to reduce structural racism and other forms of social inequalities that threaten the recovery of the group from the ravages of the pandemic and its future members contributions to our society.