1. Introduction
Mental health is defined as a healthy state of managing stress, working productively, and contributing to the community [
1]. Mental disorders are the second leading cause of the global burden of disease, posing a widespread threat to human health. The mental health crisis is particularly pronounced among Chinese residents. More specifically, the cumulative number of people with mental disorders in China is 95 million, equivalent to one-third of the United States’ population. This results in an indirect economic loss of 4.85 trillion dollars annually, which is equivalent to the total losses from the 2008 sub-prime mortgage crisis. Therefore, to safeguard the well-being of all people and reduce health burdens, it is necessary to identity the potential undesired effects associated with poor mental health. Scholars have extensively explored the correlation between mental health and conventional factors, such as drug abuse, maternal infection, physical activity, hormonal changes, lifestyle, urbanization, etc. [
2,
3,
4,
5,
6,
7,
8]. However, despite the severity of the worldwide air pollution problem, no conclusive evidence exists describing the connection between air pollutants and mental health.
The previous literature examining the relationship between air pollutants and mental health suffers from the following three major methodological flaws: (a) air pollutants are highly correlated with the urban economy, industrial structure, and other confounding factors, all of which would influence mental health. Previous studies, despite accounting for personal income, have omitted these confounders, which could underestimate air pollutants’ influence on mental health; (b) there may be a reverse causality between air pollutants and mental health. Deterioration in mental health may lead to lower labor supply productivity and affect the pollutant emissions associated with economic activities. Some studies tried to use the inverse temperature phenomenon as an instrumental variable to reduce the reverse causality. However, they were unable to significantly improve the accuracy of the model; and (c) The irrational behavior of people in choosing where to live (exposure to air pollutants) invokes the problem of selective sorting in previous studies. More specifically, people with better mental health may prefer cleaner air and choose to live in cities with lower air pollution. This irrational behavior would result in a non-random distribution of air pollutants in the population and skew the estimated relationship between air pollution and mental health.
Fortunately, China’s free heating policy for the Huai River boundary could be a possible quasi-experimental design to study the relationship between air pollutants and mental health, overcoming the three limitations described above. Established during Mao’s period, the free heating policy aimed to provide free or highly subsidized coal for indoor heating for cities north of the Huai River boundary. This policy ensured that low temperatures would not seriously affect the daily lives of northern Chinese residents. However, burning a vast quantity of fossil fuels released numerous air pollutants and damaged the mental health of northern residents, which is a problem still existing now. This research aims to illustrate, through an empirical model, that there is no discontinuous change in the confounders (e.g., income, physical activity, or smoking) which affect mental health among urban residents living north and south of the Huai River boundary. Accordingly, this research was able to isolate the net effect of air pollutants on mental health.
Using an RD (Regression Discontinuity) design necessitates ensuring that the dependent variable (mental health) and independent variable (air pollution) are completely authentic, simultaneously. Mental health data used in this research are the resident self-reported mental health index (measured by the Center for Epidemiological Survey Depression Scale, CES-D) from the Chinese Family Panel Studies (CFPS), which objectively reflect the mental health of the Chinese residents. The panel data of air pollutants are from the China Environmental Statistics Yearbook. Though criticized due to the possible political manipulation, the data are nevertheless acceptable, as politicians do not have sufficient incentive to falsify environmental statistics when their performance is evaluated primarily based on economic rather than environmental performance [
9,
10].
The results of the RD regression indicate that a 10 μg/m3 rise in air pollutants would lead to a significant decrease of 4.9 units in mental health (SD = 0.07, p < 0.05). More specifically, with the arbitrary heating policy, China has 3.5 million more mentally unhealthy residents, which is similar to the population of Los Angeles. Meanwhile, different tests were applied to prove the robustness of the above empirical results.
2. Methods
2.1. Population and Study Design
The study is based on the China Family Panel Studies (CFPS), which are large-scale (contain 25 provinces and over 100,000 respondents), multi-ethnic (contain 14 ethnic groups), interdisciplinary social surveys executed by the Chinese Social Science Survey Center (ISSS) of Peking University. The CFPS cover hundreds of thousands of respondents and track 30 percent of them, which provides a detailed description of the socio-economic characteristics of the Chinese residents. More importantly, the CFPS use multilevel random sampling to maintain geographic balance. In other words, we were able to obtain a sample with a relatively balanced north–south distribution. This feature is important for our analysis of the causal relationship between air pollutant concentrations and mental health using the RD design. To enhance the comparability and authenticity of our data, we chose the CFPS 2016–2020 to construct the panel data, and the selected data use a consistent, standardized questionnaire for the measurement of mental health. After screening out invalid data, a final sample of 65,326 was included in this study.
2.2. Huai River Policy and Discontinuous APCs in Northern and Southern China
It is well-known that China is a developing country with a vast territory. Geographically speaking, China is divided by the Huai River into two significantly different climates, namely a Monsoon climate of medium latitudes (geographically located in Northern China) and a Subtropical monsoon climate (geographically located in Southern China). More specifically, northern cities experience extremely low temperatures in winter than their southern counterparts. This phenomenon makes it difficult for the residents of northern China to engage in outdoor production during the winter, which negatively affects the development of the Chinese economy.
Mao proposed an arbitrary heating policy (namely the Huai River policy) in the 1960s to enable northern Chinese residents to participate in outdoor productions during the winter. The Huai River is used as a border dividing China into two heterogeneous parts, with the north of the Huai River being the northern part of China, which receives free heating in winter. South of the Huai River is without heating.
However, due to technological limitations, Chinese heating is mainly accomplished through the burning of fossil fuels, which results in significantly higher concentrations of air pollutants (PM
10 (Particulate Matter 10), NO
2 (Nitrogen Dioxide), SO
2 (Sulfur Dioxide), CO (Carbon monoxide), O
3 (Ozone), etc.) in northern China. A representative article once found that air pollutant concentrations in northern cities were twice as high as those in southern cities for the arbitrary Huai River policy. The influence of the Huai River policy on China is depicted graphically in
Figure 1.
2.3. Air Pollutant Exposure Assessment
Air pollutant data (including the average concentrations of PM10, NO2, SO2, CO, and O3) were collected from the China Environment Yearbook, which contains nearly-comprehensive statistics for 300 cities from 2016 to 2020. To construct a nested database, we solely selected air pollution data from 125 cities where city Id was available in the CFPS.
We chose PM
10 concentration as a proxy for air pollutants for the following reasons. In a pathological context, PM
10 is more likely to affect mental health than other pollutants. A large body of literature has confirmed that PM
10 inhalation exacerbates the development of systemic or brain-based oxidative stress and inflammation, as well as causing severe sadness and anxiety [
11,
12]. Additionally, PM
10 has a large diameter and is more likely to affect air visibility compared to other pollutants. Accordingly, it is more likely to affect residents’ mental health psychologically [
13].
Figure 2 illustrates the graphical relationship between city location and air pollutant concentrations using ARCGIS 17.2 (where the darker the color, the more serious the air pollution). The blue line denotes the Huai River boundary, and each color block shows the location of the city and the concentration of air pollutants. Significant differences in air pollutant concentrations can be clearly identified between the cities on the north and south sides of the Huai River boundary. We detail in
Table 1 the north–south disparities in PM
10, NO
2, SO
2, CO, and O
3 concentrations. It can be found that the concentrations of five major air pollutants in northern cities are significantly higher than those in southern cities. Furthermore, the discontinuous change of air pollutants partly justifies the RD design.
2.4. Mental Health Assessment
The data for mental health were from the mental health module of the CFPS database, which was divided into 6 questions, including the perceptions of depression, low self-efficacy, dyssomnia, upset, loneliness, and hopelessness, respectively. All items were taken from the standard CES-D questionnaire, and we summed them up to obtain an index indicating the mental health of individual Chinese residents, referring to the strategies of other psychology researchers [
14].
Similarly, the difference in mental health between northern and southern residents on the Huai boundary is graphically represented in
Figure 3, where the depth of the color indicates the severity of the resident’s mental problems. The Huai boundary results in a statistically significant decline in the mental health of the northern residents compared to that of their southern counterparts. Similarly, we systematically report North–South differences in mental problems in
Table 2. The results indicate that residents in the North generally suffer severer mental problems compared to their southern counterparts, and were statistically significant. Taken together,
Figure 1 and
Figure 2 demonstrate the discontinuous change in air pollution and mental health near the Huai boundary, which fully justifies the RD design.
2.5. Covariates
Our study includes two levels of covariates: (a) level 1 includes conventional individual demographic characteristics such as age, sex, residence, wage, education, employment, BMI (Body Mass Index), physical activity, and smoking habits (as shown in Panel 3 of
Table 1) and (b) level 2 includes provincial variables that have rarely been considered in previous studies, such as GDP, per capita GDP, proportion of secondary industry, temperature, and wind (as shown in Panel 2 of
Table 1). Relevant studies have revealed that the characteristics listed above have statistically significant effects on mental health [
15,
16,
17]. Level 1 covariates are from the CFPS individual self-reported questionnaire, and level 2 covariates were compiled through a combination of the statistic from the World Meteorological Association and China National Bureau of Statistics.
2.6. Statistical Analysis
We used three approaches to assess the relationship between air pollutant concentrations and mental health. The first approach was to use the traditional ordinary least squares estimation as Equation (1):
where
Xi,j denotes the air pollutant concentration in city
j where individual
i is located,
Zi is a vector indicating the covariates affecting mental health,
Mi represents the mental health of individual
i, and the coefficient γ
1 indicates the impacts of air pollutant concentration on mental health. Consistent estimates of γ
1 require that unobserved mental health influences do not covary with
Xi,j after adjusting for
Zi. However, this condition is hard to satisfy in practice because of omitted variables and sample self-selection problems. More importantly, we cannot use traditional OLS (Ordinary Least Squares) regression because our dependent, independent, and covariate variables are clearly nested data.
Therefore, we used the Hierarchical Linear Models (HLM) to further estimate the relationship between air pollution and mental health as in Equation (2):
where
πi,j is the slope of individual-level variables for individual
i in province
j and
θi,j is the slope of provincial variables for province
j. γ
0,1 is a vector that denotes the coefficient between air pollutant concentrations and mental health based on HLM. However, although HLM can analyze nested data, it cannot cope with the endogeneity problems caused by omitted variables, reverse causality, and sample self-selection, which are not negligible in estimating the relationship between air pollution and mental health.
The RD design is ideal for eliminating endogenous problems and inferring a causal relationship between air pollutant and mental health. This study’s RD design utilizes the discontinuous variation in air pollutant concentrations and mental health as one crosses the Huai River boundary. The necessary precondition is that any covariates affecting mental health are smooth around the boundary. Adjustment for a sufficiently flexible polynomial in distance from the river will eliminate all potential causes of bias and provide the following causal inferences if the precondition is met:
Equation (3a,b) represent the RD estimates of air pollutant concentrations and mental health for year t in city j of individual i, respectively. Xi,j,t and Yi,j,t denote air pollutant concentrations and mental health in city j of individual i in year t. Nj is the driver variable, taking values of 1 and 0 to indicate that city j is on the northern or southern side of the Huai River boundary, respectively. f(Lj) is a polynomial used to assess the distance of city j from the boundary. Cj,i,t refers to all province-level, individual-level covariates affecting mental health in year t. Xi,j,t, Yi,j,t, and Cj,i,t are panel data, simultaneously.
Furthermore, we used the RD design to infer the causal relationship between air pollutant concentrations and mental health. More Specifically, we choose an appropriate bandwidth so that the Huai River boundary only affects mental health through air pollutant concentrations. Then, it was valid to treat Equation (3a) as the first stage in a two-stage least-squares (2SLS) system of equations. In this way, 2SLS estimation could assess the causal relationship between air pollutant concentrations and mental health as in Equation (3c):
By combining Equations (1)–(3), we can obtain an unbiased causal effect of air pollutants on mental health, excluding the endogeneity problem of omitted variables, confounders, reverse causation, and sample self-selection.
3. Results
3.1. Study Population Description
Table 3 presents a summary of numerous associated factors and evidence for the validity of the RD design. The means of key variables for the full sample, north, and south residents are shown in Columns (1)–(3), respectively. Column (4) shows the average difference of key indicators among the North and the South. Similar results are reported in Column (5), adjusted for a cubic polynomial in degrees north of the Huai River. It is a test for a discontinuous change at the Huai River boundary. An essential prerequisite of the RD design is that the covariates change continuously around the boundary, and Column (5) indicates that, after adjusting for a cubic polynomial in degrees north of the Huai River, there is no significant difference in the covariates (Panel 3 in Column (5)). However, it is also a desirable result if the covariates exhibit a smooth change around the boundary. Column (6) reports the
p values associated with the tests, and that the heterogeneity in Columns (4) and (5) are equal to zero. Taken together, the results of
Table 1 prove that the covariates change smoothly around the boundary, and the air pollutant concentration and mental health exhibited a discontinuous variation near the Huai River boundary.
Two preliminary results are clearly presented in
Table 1. First, significant heterogeneity of air pollutant and mental health exists among southern and northern residents, which may indicate a potential correlation between these two factors. Second, there are significant differences in several demographic characteristics (e.g., wage, education, BMI) between the residents north and south of the boundary after accounting for the cubic polynomial in latitude. The above findings systematically fulfill the essential prerequisite of the RD design and make it possible to infer the causal relationship between air pollution and mental health.
3.2. Graphical Analysis
Figure 4 methodically displays the graphical results used to examine the RD design’s validity. The left half of
Figure 4 represents the average air pollutant of each city in relation to their latitude north of the Huai River boundary. The horizontal coordinate takes the value of 0 to indicate the Huai River boundary, and each point represents the means of air pollutant (in
Figure 4) across locations within a standard bin from the boundary. The discontinuous change in air pollutant on the boundary of the Huai River indicates that the arbitrary heating policy has led to a 11.4-(μg/m
3) increase in PM
10 concentrations in the North relative to the South.
The difference in mental health between northern and southern residents on the Huai boundary is graphically represented in the right half of
Figure 4. The Huai boundary causes a 0.47-unit decrease in the mental health of the northern residents compared with their southern counterparts. Taken together,
Figure 4 demonstrates the discrete variations in air pollutants and mental health near the Huai boundary.
In addition,
Figure 5 shows graphically the validity of the RD design by charting expected mental health differences (calculated as the fitted value from a hierarchical linear model (HLM) regression of mental health on all covariates except PM
10) at the Huai boundary. Notably, all covariates cumulatively explained 33.05% of the change in mental health. After removing the impacts of air pollution, it is evident that mental health is practically equivalent north and south of the border (
p value of predicted mental health among the northern and southern residents of the border is 0.32, and statistically insignificant). Taken together, the above findings evidently prove that the potential covariates can’t explain the discontinuous decline in mental health north of the boundary, and uncontrolled air pollutants may be the cause of mental health differences between the North and South.
3.3. Regression Results of Air Pollutant Concentrations and Mental Health
Table 4 reveals the correlation between PM
10 and mental health using OLS and HLM. The set of covariates is shown in the last two rows of the table. A biased conclusion that PM
10 concentrations positively predict mental health will be drawn if we only take the first two columns into consideration, as the mental health of the population of a province is affected by its economic and medical situations. The two-level HLM model in Column 3 clearly reveals that a 100 μg/m
3 increase in PM
10 concentration results in a 0.4-unit decrease in Chinese individuals’ mental health, and is statistically significant (95% CI: 0.001, 0.007).
Table 5 displays the RD regression from the estimation of Equation (3a,3b) in panel 1 and the 2SLS results from Equation (3c) in panel 2. Column 1 introduces the cubic in distance between the location of the residents and the Huai boundary to account for possible deviations, and it limits the sample to locations within 2° latitude of the boundary to improve the accuracy of the RD estimation. Latitude is estimated by the first-order polynomial. Column 2 is identical to column 1, with the exception of the usage of a second-order polynomial to estimate the latitude. For robustness, the findings for 5° and 8° latitude are modeled with the first and second order polynomials, respectively, as shown in Columns 3 through 7.
Panel 1 of
Table 5 shows a statistically significant difference in air quality and mental health between the southern and northern residents near the Huai boundary after adjusting for the covariates. The Huai River boundary resulted in a discontinuous increase in PM
10 concentrations in northern Chinese cities by 11.60–27.28 μg/m
3, which is similar to the annual PM
10 concentration in Australia. Meanwhile, a result of the arbitrary heating practices, the mental health of residents to the north of the boundary has decreased by 0.11 to 0.74 units relative to those to the south.
Panel 2 reports the results of 2SLS in RD design. Column 1 presents the optimal RD model selected according to the AIC (Akaike information criterion) and BIC (Bayesian Information Criterion) optimality principle. Columns 2–7 provide evidence of robustness by correcting for polynomial order and latitudinal distance, respectively. Column 1 reveals that a 100-μg/m3 increase in PM10 concentration leads to a 4.9-unit decrease in the mental health of Chinese residents (SD = 0.07, p < 0.05), equivalent to 36% of the average of Chinese residents.
The heterogeneity of the results across the demographic characteristics is presented in
Table 6. Columns 1 and 2 reveal that the negative impact of air pollutants on mental health is more pronounced for male residents (coeff = 1.37, SD = 0.10,
p < 0.05), compared to female residents (coeff = 0.42, SD = 0.04,
p < 0.05). This is because more male residents work outdoors and are thus more likely to be exposed to air pollutants. Meanwhile, the damage to mental health from air pollutants is more pronounced in the smoking population (coeff = 1.73, SD = 0.46,
p < 0.05) than in the non-smoking population (coeff = 0.83, SD = 0.29,
p < 0.05). Thus, smoking positively moderates the negative mental health impacts of air pollutants.
3.4. Sensitivity Analysis
To prove the robustness of the RD design, we further used alternative approaches. We tested the validity of RD in the following two ways. Firstly,
Figure 6 depicts the correlation between sample density and distance from the Huai River boundary. No significant differences in sample density can be found within a standard bandwidth (h = 5) north and south of the Huai River boundary. This conclusion indicates that the distribution of the drive variable (the distance north from the boundary) is completely random around the breakpoint. Secondly, RD designers pointed out that the likelihood of sample manipulation increases as the sample approaches the breakpoint. Using the donut hole test, we next determined if the results remained statistically significant after deleting 1%, 2%, and 3% of the samples near the breakpoints.
Figure 7 clearly reveals that the coefficients remain statistically significant after removing 1% (95% CI: 0.47, 2.21), 2% (95% CI: 0.16, 3.11), and 3% (95%CI: 0.14, 3.54) of the samples near the breakpoints, respectively, which indicates that our samples were not manipulated near the breakpoint.
4. Discussion
The results of the RD regression indicate that a 100-μg/m
3 rise in air pollution leads to a 49-unit decrease in mental health (SD = 0.07,
p < 0.05). This estimate is ten times larger than results using the conventional HLM. The bias is due in part to sample self-selection, since people with better mental health may prefer cleaner air and choose to move to areas with better air quality, and this self-selection behavior confounds the estimation. In addition, county fixed effects will absorb permanent variations in economic activities across counties, but cannot absorb time-varying differences within counties, which might still bias the estimates downward. Meanwhile, this is approximately 1.25 times the estimated impact of PM
2.5 on the mental health of the Chinese population from Chen’s representative research [
18]. The differences in the results of the two studies stem from three aspects: (a) PM
10 is more sensitive to induce the dysregulation in cytokine signaling and result in the occurrence of mental disorders; (b) the data from the study of Chen et al. are from 2014–2015, whereas we used the longitudinal data of 2016–2020, a period when Chinese residents were more sensitive to air pollutants for the unpredictable epidemic; and (c) this research uses an RD method, which can simulate random distribution near breakpoints and assess the causal effects between air pollutants and mental health.
Several European, American, and Australian researchers have also assessed the relationship between air pollutants and mental health using their own national databases, allowing for a cross-country comparison [
19,
20,
21]. It turns out that PM
10 has the most devastating impact on the mental health of China’s residents (who are exposed to the most polluted air in the world) compared to residents of other countries. This conclusion implies that the effect of air pollutants on mental health is a concave function, i.e., the negative impact of air pollutants on mental health is more pronounced in severely polluted areas.
A natural concern related to the RD design is that the Huai River heating policy might influence the behavioral patterns of Chinese residents and induce bias in our estimation. This policy may promote mental health because northerners are likely to spend more time indoors and be exposed less to air pollutants. However, mental health could also be hindered if more time spent indoors leads to a stronger addiction to cigarettes and alcohol. Moreover, this free heating policy saves certain living costs and increases disposable income for northerners, which may further be spent on mental-health-benefitting (e.g., access to a psychiatrist, increased health insurance) or -harming (e.g., gambling, sex, drug use) behaviors and activities. The above situations could all lead to biased causal effects estimated by the RD design. Eventually, the evaluated effects of air pollutants on mental health should be interpreted with these constraints in mind, given the lack of data to test for these behavioral responses.
Unfortunately, it is hard to take the COVID-19 epidemic into our consideration, as the CFPS did not address the relevant individual characteristics. However, with China’s nationwide lockdown policy, we think that the impact of the epidemic on the causal relationship between APCs and mental health is negligible, especially with the RD design. More specifically, the impact of the epidemic on the mental health of all Chinese residents is almost homogeneous. That is, the impact of the epidemic on residents’ mental health will be the random error term in our regression ( in Equation (3c)).
Additionally, there are several limitations to be interpreted carefully: (a) as with the RD design, the fixed effects are not taken into our consideration. Although consistent with the traditional approach of contemporary empirical researchers dealing with breakpoint regressions on panel data, this may cause some estimation bias; (b) although containing 125 cities distributed on both sides of the Huai River boundary, data used still do not cover the entire potential sample; and (c) research design failed to account for the spatial autocorrelation between cities, which might cause several endogeneity problems. Despite the above limitations, this research is still theoretically and empirically significant, with an accurate and robust assessment of the relationship between air pollutants and mental health.
Results of this research could endorse Chinese ongoing policy and investment in pollution treatment. Because of arbitrary heating policies, China has an additional 3.5 million mentally unhealthy residents, the equivalent of the entire population of Los Angeles, California. Fortunately, since 2020, the Chinese government and environmental NGOs have been strongly appealing to residents to reduce carbon emissions. Empirical evidence indicates that it is vital for China’s government to strengthen its advocacy for a low-carbon lifestyle.
This paper is pioneering in attempting to apply an RD design to infer the causal relationship between air pollutant concentrations and mental health, and accordingly, provides several meaningful insights for future research. Researchers could compare the heterogeneous effects of air pollutant concentrations on the mental health between populations in developing and developed countries, as more cross-national studies based on RD designs are extremely meaningful. Moreover, future research could explore the potential relationship between air pollutant concentrations and mental health with different models such as the structural equation modeling model.
5. Conclusions
The results indicate that air pollutant concentrations significantly impair the mental health of Chinese residents, while the robustness of the results is ensured by changing the RD bandwidth and polynomial order and two unique sensitivity analyses. Meanwhile, the effect of APCs on mental health was more pronounced in male residents (compared to that in female residents), and smokers (compared to non-smokers). Additionally, we compared the results of the study with other literature and found that the effect of air pollutants on mental health is a concave function, i.e., the negative impact of air pollutants on mental health is more pronounced in severely polluted areas (such as China). More specifically, rising air pollutant concentrations due to arbitrary heating policies (shaping the Huai River boundary) have reduced the mental health of Northern Chinese residents by 0.49 units, equivalent to 3.6% of the average Chinese residents. These findings give empirical basis for the Chinese government to invest more in combatting air pollution and ensuring the mental health of its citizens.
Acronyms
RD | Regression Discontinuity |
OLS | Ordinary Least Squares |
HLM | Hierarchical Linear Model |
BMI | Body Mass Index |
APCs | Air Pollutant Concentrations |
AIC | Akaike Information Criterion |
BIC | Bayesian Information Criterion |