1. Introduction
The rapid growth of China’s GDP since 1978 has made it the second-fastest-growing economy in the world in recent times [
1]. However, environmental problems have become increasingly serious, with 135 Chinese cities exceeding air-quality and pollution standards and 34.0% experiencing acid rain in 2020, an increase of 0.7 percentage points from 2019 [
2]. Preventing and controlling pollution and protecting the economic environment have become urgent tasks. The emissions trading system is the first large-scale market-based environmental regulation policy in China. It plays a regulatory role in resource conservation, environmental protection, and green development for environmental resource allocation, mainly using market mechanisms. Emissions rights trading originated in the United States, and the American economist Coase formally proposed pollutant emissions rights in 1960. China’s emissions rights began to be explored early on, but progress has been slow [
3]. Since 1987, emissions trading has been carried out in several pilot cities, one after another. However, problems with the secondary market still persist for local emissions trading and when setting benchmark prices for emissions trading in China [
4]. The effect of emission reduction trading on industrial
reduction has long been debated, so this issue in China is a fascinating and crucial subject to study.
China launched the “4+3+1" program in 2002, but due to the early implementation and lack of practical experience at that time, many pilot regions had zero trading volume. The pilot program was further expanded in 2007, and the pilot system was improved. Each pilot region introduced its own policies to suit the local conditions, and the scope and scale of the transactions also expanded [
5]. Therefore, this paper constructs panel data for 30 provinces from 2001 to 2019 based on the 2007 pilot policy as a quasi-natural experiment. We use 11 approved provinces, such as Jiangsu and Tianjin, as the treatment group and 19 other provinces as the control group and adopt nonlinear
to evaluate the following questions: Has China’s first market-based environmental regulation policy had a positive effect on its emissions reduction? What is the extent of its emissions reduction? Are there differences in the policy treatment effects for the treatment and control groups? Is the implementation of the policy more effective for enterprises or regions with higher
emissions intensities?
2. Literature Review
Much research has been conducted on the emissions trading policy. Montgomery (1971), Woerdman (2000) demonstrated the policy’s feasibility and studied the relevant policy implementation systems, indicating that the policies should be implemented based on environmental conditions. Pollution discharge fees should be increased for systems with higher levels of pollution [
6,
7]. Cason (2003) proposed that the strict management of emission enterprises is insufficient to promote emissions trading effectively. At the same time, it is necessary to strengthen government oversight and emissions management [
8]. Rausch and Abrell (2017) investigated a hybrid emissions trading system based on the uncertain corporate emissions reduction costs and future emissions. The findings show that a hybrid policy that introduces price or reduction boundaries allows for hedging differences in marginal reduction costs across subregions [
9].
With regard to the emissions reduction effect of the emissions trading policy, many studies have been conducted in the related literature. Most scholars believe that the emissions trading policy is effective in reducing emissions. Yan et al. (2012), Li et al. (2016), and Zhang et al. (2017) conducted quasi-natural experiments with pilot implementations of the 2002 emissions trading policy in six provinces and cities of China. Their analyses discovered that the emissions trading policy is able to improve pollution reductions [
10,
11,
12]. Using the
model, Tu et al. (2018) discovered that the emissions trading system not only reduces pollutant emissions but also aids in economic development [
13]. Tang et al. (2017) discovered that the emissions trading policy helps to reduce
emissions while reducing corporate profits [
14]. Wu et al. (2018) discovered a weak “Porter effect” from emissions trading in China [
15].
The
DID method is the most commonly used method in this field. For example, Fu et al. (2018) and Wu et al. (2021) used the
method and the
method to find that the efficiency of green development can be improved by the
emissions trading policy, which has little effect on promoting green development [
16,
17]. Applying the
method, Shi et al. (2020) discovered that the system’s implementation positively impacts energy consumption and improves the green total factor energy efficiency [
18]. Ren et al. (2019) and Fu et al. (2019) used the
method to evaluate the effects of further increasing the policy pilots after 2007. They found that the implementation of the policy has shown positive effects for both
emission reduction and economic growth, which is a win–win in terms of environmental and economic goals [
19,
20]. Si et al. (2020) found significant emission reduction effects of emissions trading policy on pollutants
,
, and
[
21]. Qi et al. (2020) evaluated the effects of emissions trading policies by using the
method. The results showed that there was a significant emission reduction effect of policy on industrial
and industrial wastewater emission results, but there was no significant contribution to green development in the short term ([
22]. Yu et al. (2021), Cai et al. (2022), and Luo et al. (2018) conducted studies on carbon trading rights, demonstrating that carbon trading in China exhibits upward and then downward trends for the development of carbon technical efficiency [
23,
24,
25]. With the use of Support Vector Regression (SVR), Linear Regression (LR), and Analytic Hierarchy Process (AHP), Hong et al. (2022) conducted data mining and the evaluation of pollutant emissions, market prices, marginal benefits of emissions rights, and initial emissions rights [
26]. Qi et al. (2021) implemented an emissions trading policy using multi-period
, which indicates the policy significantly increased Chinese enterprises’ OFDI [
27]. Using the
model, Wu et al. (2022) discovered that actions towards the prevention and control of air pollution can significantly improve air quality in vital resource-based cities [
28]. Zhang et al. (2020) used the
model to examine the impact of the emissions trading system on environmental efficiency. The results indicate that the policy has significantly improved the environmental efficiency in these pilot provinces [
29]. Li et al. (2022) explored the impact of an emissions trading system (ETS) on the regional industrial structure. Their results show that the implementation of ETS can promote upgrades in the regional industrial structure but can hinder the rationalization of these upgrades [
30].
Another body of literature contends that emissions did not decrease with the implementation of the emissionstrading system. Zhang et al. (2020) used the
method to study the environmental and employment impacts of emissions trading policy. They found that piloting the policy did not improve the environment, but significantly increased employment levels [
31].
As mentioned by Xiang et al. (2016), there is a growing body of literature on natural experiments on policy implementation using the
method due to its simplicity and ease of implementation. However, this method also has many shortcomings. One of the biggest criticisms is that it only elaborates on linear relations well and no longer works for nonlinear relations. When using the
method, some studies have obtained biased results because of ignoring the strict assumptions required [
32]. Another shortcoming is that the method can only give average treatment–effect estimates and not other quantile estimates. Athey and Imbens (2006) developed a nonlinear
method, referred to as “changes-in-changes” (
), to assess the impact of policy changes that replaces the widely used
model. Unlike traditional methods, the authors proposed a nonlinear model that allows unobservable effects to change over time. This nonlinear
method estimates the entire counterfactual distribution of outcomes between the treated and untreated groups [
33]. Lucas and Mbiti (2008) investigated the relationship between Kenya’s compulsory primary education policy and entrance examination grades using the nonlinear
model. The inhibitory effect of student attendance was significant at the 1% level, with a positive but small effect at the middle quantile level and a positive effect at the higher quantile [
34].
The literature on the effect of emissions trading policies on emission reductions has three main characteristics. First, the current research primarily employs the method, but in actual natural experiments, meeting the strict requirements of the method is challenging. However, the results are assumed to be accurate. Second, current literature assumes that the application of the policy will have the same treatment effect on various groups, even though various experiments should be conducted to account for individual differences. Third, while all previous studies have been able to determine the average treatment effect of the policy, they have not been able to determine the distribution treatment effect. However, the distribution treatment effect explains which geographic areas the policy is effective for. In this study, we construct panel data of 30 provinces during the period from 2001 to 2019. The 11 approved provinces of emissions trading policy are evaluated as the treatment group, and the other 19 as the control group using the nonlinear model. The significance of this work is as follows. (1) We use the nonlinear method to overcome the problem of errors caused by the fact that natural experiments do not satisfy the strict assumption of . (2) Due to differences in the characteristics of different groups, we expect a different effect. (3) The treatment effect is further obtained to analyze whether a more obvious emissions reduction effect occurs with the implementation of the policy in regions with higher emissions intensities.
3. Materials and Methods
3.1. The Classical DID Method
For
, the data are presented as
. We use the indicator
to denote the treatment group (
) and the control group (
), where
is a treatment variable with
if individual
i receives treatment, and
if individual
i does not receive treatment. We use
to denote the outcome and
and
to denote the outcome of individuals with and without treatment, respectively. Let
; we have
In the classical
model, the outcome of individual
i without the intervention satisfies the following:
where
and
represent the time effect and group effect, respectively, and
is a random error with
. Classical
estimates are obtained using the following:
3.2. The Nonlinear Difference-In-Difference Method
The
method usually focuses on the average causal effect of the treatment group. As the policy effect varied with unobservable features and the distribution of individuals in the group are different, the average treatment effect on the two groups is not the same. Although the classical
method requires relatively few assumptions to calculate the effect of policy interventions on the treatment group, the effect of policy interventions on the control group is rarely considered, which is a disadvantage of the classical
method. Therefore, there is controversy in the literature about the results of using classical
method. Athey and Imbens (2006) proposed the changes-in-changes (
) model—a nonlinear
method—for estimating the effects of policy interventions on control groups. In particular, the method assumed that a treatment effect depends on the unobservable characteristics of an individual rather than directly on the population. The distribution of treatment effects differed due to the different distribution characteristics of the groups [
33].
3.2.1. The Changes-In-Changes Model
First, a random variable
is introduced to represent the region’s unobservable characteristics in order to better identify the
model. To simplify the analysis, we drop the the subscript
i and use
to denote the vector of random variables. Then, we define the following:
where
, and
represents the treatment indicator.
and
denote the corresponding distribution functions.
represents the untreated subgroups, which are subgroups other than
. At the same time, a set of hypotheses about the distribution of the counterfactual status of the treatment groups in terms of the second-period outcomes are analyzed. Therefore, we can use the simultaneous distribution of the observable values
to express the distribution of
. Indeed, these findings suggest that
can be expressed by conditional outcome distributions about the other three subpopulations
, and
, which are observable.
Assumption 1 (Model): In the absence of an intervention, the outcome of an individual satisfies the relationship , where h is an unidentified, nonlinear function. Since h does not change as the group changes, differences between groups result from the distribution of U. To identify with this model, the following hypotheses are required.
Assumption 2 (Monotonicity): For , the production function strictly increases in U.
Assumption 3 (Time Invariance): The structure of a group remains constant over time. Given a group G, U as an unobservable variable is time-stationary; that is, .
Assumption 4 (Support): .
Different from the
model, the classical
model has two additional assumptions:
that is,
. As a result, the classical
model is a special case of the
model, which relaxes and generalizes the classical
model’s assumptions.
Assumptions 1–3 are collectively referred to as the model, and the authors propose that Assumption 4 be invoked selectively as needed. According to Assumption 1, the results are not obtained directly depending on the group, and all associated unobservable variables must be encapsulated in a simple indicator U. According to Assumption 2, higher unobservable values must match higher outcomes. Weak monotonicity is only a standardization in a given subpopulation. Since we assume that unobservable values cause higher values in both periods, it is only limitative. When the unobservable variable is a personal characteristics such as health or ability, this structure naturally arises. Additive models allow for the existence of rich non-additive structures and can satisfy strict monotonicity automatically. As the outcome is continuous in models, the difference between strict and weak monotonicity is minimal. Furthermore, this assumption is overly restrictive if particles exist in the distribution. Assumption 3 requires that the population of a given group remains constant over time. The and methods are based on the assumption that the differences between groups must be stable to achieve the state of affairs in which one group can be used to eliminate trends in the other. Assumption 4 implies that and , and this assumption is relaxed in the authors’ subsequent inferences.
3.2.2. Treatment Effect on Treatment Group
Based on the above assumptions, the counterfactual distribution can be obtained after the following derivation:
where the distribution function of
is unobservable but we can derive the other three distribution functions from the data. Using these three observable distribution functions, we can obtain the unobservable distribution function. The average treatment effect
can be written as follows after a series of transformations:
Clearly, no assumptions are made about the specific form of the h function in the preceding process, so the entire identification process is non-parametric. Moreover, the effect can be estimated using the sample mean and actual distribution. In other words, the model does not require assumptions as strict as those of the model and generalizes the model.
3.2.3. Treatment Effects on the Control Group
To date, we have constructed a model with no intervention. In the presence of an intervention, no outcome model is required to infer the effect of the policy change on the treatment group, that is, the effect of the "treatment" on the treated. We simply need to compare the treatment group’s actual outcome to the counterfactual. However, more structure is needed to analyze the effect of policy intervention on the control group.
First, note that the difference between the counterfactual distribution of
and
is qualitative. After all, three subgroups do not receive treatment, all of which could be used in determining the outcome distribution without treatment in a fourth subgroup. Though only one subgroup received the treatment, we still want to know the distribution of
. Therefore, we can build a transformation based on group 1 and apply it to
, that is, assuming within a group that the distribution of
U does not change over time. More specifically,
,
, and
Because the distribution of
remains constant over time, for
,
The roles of group 0 and group 1 in
are simply switched. Following this logic, we apply the method described previously to calculate the counterfactual distribution of
. In other words, replacing
G with
and taking Assumptions 1–3, we assume that
, where
increases strictly in
U. Then, we can determine the distribution of
from the distributions of
,
, and
on the restricted branch set
as follows:
3.3. Model Selection
To study the effect of emissions trading on industrial emissions reduction, in this paper, we use the model. To be more specific, we begin with . In this case, U represents the industrial emissions intensity, which is demonstrated by the regional emissions trading policy at period 0 after accounting for other control variables. Because of the impact of heterogeneity between groups and the effects caused by the emissions trading policy at period 0, the distribution of should have a discrepancy between different groups. In addition to the assumption that U is normalized in this case, the application of the model must satisfy two assumptions. First, the distribution of U among the group does not change with time. Because U represents the region’s characteristics, changes in the emissions trading policy cannot cause regions to adjust quickly and should not influence whether to implement the emissions trading policy. This assumption is reasonable. Second, in the absence of policy interventions, the resultant functions of the two groups should be the same. This assumption excludes the relationship between the industrial emissions intensity and the pilot area’s emissions trading policy over time. On this basis, two additional assumptions are required by the more stringent model. (1) The main difference between the pilot and non-pilot regions of the emissions trading system is the industrial emissions intensity. (2) The additive effect is the same for all individuals over the process of change over time. Based on natural quasi-experiments, these assumptions are difficult to prove. The distribution of industrial exhibits different shapes with different groups and over different time periods, so we use the model here to investigate the effect of the emissions trading policy.
When applied to the problem studied in this paper, the treatment is the emissions trading policy. The outcome variable Y represents the emissions intensity; G indicates whether the provinces, cities, or regions are pilot areas for emissions trading, presented as either 1 or 0; and T represents the time indicator variable of whether the pilot policy of emissions trading begins to be implemented, and is 1 if the start date is 2007 or later and 0 otherwise.
3.4. Variable Selection and Data Description
3.4.1. Explained Variable
The industrial emissions intensity (lnpso2) is expressed as industrial /gross industrial output (tonnes/billion CNY), providing a more accurate description of industrial emissions.
3.4.2. Core Explanatory Variables
Time, the dummy variables for each group, and the interaction terms of dummy variables for the policy implementation provinces and the policy implementation time are the main explanatory variables. Treat denotes the dummy variable for policy implementation provinces. If the province is one of the 11 provinces approved by the Ministry of Environmental Protection in 2007, the dummy variable Treat takes a value of 1; otherwise, it takes a value of 0. For the time dummy variable (Time), a start date of 2007 or later takes a value of 1, and that before 2007 takes a value of 0. The primary variable of interest is the interaction terms between the two dummy variables: Treat*Time.
3.4.3. Control Variables
The factors influencing the industrial emissions intensity are controlled as thoroughly as possible based on data availability. The following 18 control variables are chosen: population size, industrial pollution control, investment in industrial waste gas control, sewage charges, economic development level, wage level, regional technological innovation level, technology introduction, technology level, fixed assets, industrialization level, industrial structure, scale of the service industry, degree of marketization, number of employees, scale of energy consumption, education development, and fiscal decentralization. At the same time, to carry out the relevant research in this paper, the industrial emissions intensity, economic development level, wage level, regional technological innovation level, technology introduction, technology level, fixed assets, population size, investment in industrial pollution control, investment in industrial waste gas control, sewage charges, scale of energy consumption, and number of employees are all logarithmically treated based on the actual data.
By summarizing all the variables used in this paper,
Table 1 illustrates each variable’s definition and the data source.
Descriptive analysis is performed on all variables, and
Table 2 describes the statistical analysis of the data.
3.5. Random Forest to Select Control Variables
There are two methods for ranking the importance of control variables in random forests, mainly based on mean decrease impurity and mean decrease accuracy. Mean decrease impurity means that for each tree, the control variables are ranked according to their impurity, and finally, the whole forest is averaged. Accuracy reduction is achieved by reordering the values of a column of features and observing how much it reduces the accuracy of the model. It directly measures the effect of each feature on the prediction accuracy of the model. For significant control variables, this method reduces the accuracy of the model significantly, while for insignificant control variables, this method has little effect on the accuracy of the model. Correspondingly, the two representations of feature importance ranking in the regression tree are %IncMSE and IncNodePurity. %IncMSE is the increase in MSE. Specifically, each control variable such as is assigned a random value. If is important, the prediction error will increase. So the increase in error is equivalent to the decrease in accuracy. The same is true for IncNodePurity. In the regression problem, node purity is actually the reduction of RSS (residual sum of squares). Node purity increase is equivalent to the reduction of the Gini coefficient; that is, the data in the node or classification are the same, known as Mean Decrease Gini.
Because the dependent variable in this paper is a continuous variable, a random forest regressor is constructed using the R package random forest. The number of trees used after parameter tuning is 500, and the leaf node defaults to one-third of the control variables. Based on the already constructed random forest regression model, the 18 control variables that influence the industrial emissions intensity can be ranked in order of importance. Then, the cross-validation method is used to select the top-ranking control variables, which have a greater influence on the industrial emissions intensities.
Figure 1 depicts the importance ranking of the 18 control variables in this paper by using random forests. The term “%IncMSE” represents the increase in mean squared error. By substituting random variables at random and observing the error in the model after the substitution, the variable becomes significant as the error increases. That is, the significance of the control variable is proportional to its magnitude; “IncNodePurity” denotes an increase in node purity. The sum of squares of residuals is used to calculate this indicator. Similarly, the magnitude of this variable is positively correlated with its importance. We discovered a discrepancy in the order of importance of control variables based on the different criteria of “%IncMSE” and “IncNodePurity.” This paper chooses the first few control variables with “%IncMSE” as the standard based on the specific situation that affects the industrial
emissions intensity.
To improve the model’s performance, determining how many control variables are best suited for model prediction is critical. In this paper, the number of control variables to be added to the model is determined by ten-fold cross-validation. The smaller the cross-validation error, the better the model fits. At the same time, the number of control variables is optimal.
Figure 2 depicts a ten-fold cross-validation graph, with the horizontal axis representing the number of feature variables and the vertical axis representing the cross-validation error. We discover that adding more irrelevant control variables does not improve prediction accuracy, so having more control variables is not better. The cross-validation curve above clearly shows the relationship between model error and the number of control variables used for fitting. The error increases when the number of control variables is 0–2, but it gradually decreases when the number of control variables is 2–5. It is probably smaller when the number of control variables is 5. As the number of control variables increases, the error becomes larger. As a result, we select five control variables to minimize error and achieve the ideal state. Therefore, using “%IncMSE” as the control variables sorting criterion, the top five essential control variables are chosen to add to the model, which are wage level, energy consumption structure, regional technological innovation level, foreign direct investment level, and education level.
4. Results
This paper evaluates the emission reduction effect of the emissions trading policy on industrial . The difference in the policy treatment effects between the treatment and control groups is also confirmed. Moreover, the study illustrates the effect of policy implementation on emissions reduction for enterprises or regions with higher emissions intensities.
4.1. Empirical Analysis of the Effect of Industrial Emissions Reduction
We use a random forest approach to select the control variables needed in the model, which are wage level, energy consumption structure, regional technological innovation level, foreign direct investment, and education level. Previous studies assumed that policy implementation had the same treatment effect on the treatment and control groups. However, due to individual differences, the treatment and control groups had different treatment effects. This paper primarily employs the model based on relevant panel data from 30 provinces and cities from 2001 to 2019 with the command in STATA16.0 to make an empirical analysis.
Table 3 shows the empirical results for the estimation of the policy effects, and the values in parentheses are the estimated standard errors obtained after 50 iterations of the self-help algorithm. With the highest importance as screened by random forest, these results are obtained by sequentially including the top five control variables, which are wage level, energy consumption structure, regional technology innovation level, foreign direct investment level, and education level.
From the coefficients of the control variables, the magnitude of the coefficient of wage level is −0.6109, which shows that an increase in wages reduces the emissions of industrial
. Wage growth indicates an improvement in the enterprises’ development level. To pursue higher-quality development, enterprises will pay more attention to the scale of pollutants generated and minimize the pollution of the environment while gaining benefits. The state can establish relevant policies that aid in the development of enterprises and raise the development level of enterprises to reduce emissions. The coefficient of the energy consumption structure is 1.6652, which means that more pollutants will be released as it rises. This will have a negative effect on industrial
emission reduction. In order to limit pollutant emissions, coal should be burned as infrequently as possible and fossil fuel energy should be substituted with clean new energy. The regional technological innovation level coefficient is −0.2869. In this study, we analyze the number of granted patent applications as a proxy for the degree of innovation in the region. We find that regional technical innovation is positively correlated with the ability of industrial
emissions reduction. As regional technological innovation levels increase, businesses use new technologies to carry out production tasks and cleaner technologies to address pollutant emissions with the achievement of emissions reduction. Foreign direct investment has a coefficient of −0.0511. The level of technology introduction in this paper is represented by foreign direct investment. It can be seen in
Table 3 that foreign direct investment can promote the effect of industrial
emissions reduction, but only marginally. Foreign investment can improve production technology and increase technological investment, thus improving production technology and reducing industrial
emissions. The coefficient of education level is −0.1910, indicating that increasing the average education level has an inhibitory effect on pollutant emissions. Improving citizens’ education levels positively affects people’s quality of life, which promotes human capital accumulation and reduces industrial
emissions.
4.2. Distribution Effect
In terms of the average treatment effect, the emissions trading policy shows a significant suppression effect on the industrial emissions in both the pilot and non-pilot areas. Furthermore, the average treatment effect in the non-pilot areas is higher than that in the pilot areas. However, there are some subjective factors while selecting pilot areas. The higher the industrial emissions intensity, the more likely the pilot areas are selected. Therefore, the 10th and 90th percentiles are added to the quartiles to analyze at which stage the emissions trading policy is more effective for industrial emissions intensity.
As with the treatment in the previous subsection, the average treatment effect indicates that the emissions trading policy has a significant suppressive effect on the industrial
emissions intensity in the pilot areas. The distribution effect allows us to observe more precisely at which stage the industrial
emission intensity is positively affected by the policy implementation. As can be seen from
Table 4, the implementation of the emissions trading policy has a significant suppressive effect on the industrial
emissions intensity at 10 percentile. The absolute values of the contral group coefficients are larger than those of the treatment group, indicating that the implementation of the policy has shown a stronger effect on emissions reduction in non-pilot areas. Only for industrial
emission intensity at the 25th percentile, the treatment effect for the control group is negative at the 1% level, with a coefficient size of −1.1279. The coefficients for industrial
emissions intensities at the higher percentile do not pass the significance test.
When the control variables are added sequentially, the results show that the policy is more effective at the lower percentile and that none of the coefficients at the higher percentile pass the significance test. This indicates the implementation of the emissions trading policy is less effective for “dirtier” industrial enterprises or regions but more effective for the regions with lower emissions. However, these industrial enterprises or regions with higher pollutant emissions intensities are the main source of environmental damage. Therefore, the government should improve the market-trading mechanism in regions with lower industrial emissions to further implement and promote the policy. Supervision and management should be strengthened in regions with higher industrial emissions. In this way, the policy implementation can have significant effects on these enterprises, thereby achieving the goal of environmental protection.
6. Discussion and Conclusions
Using the panel data from 30 Chinese provinces and cities from 2000 to 2019, this paper employs the nonlinear DID method to analyze the of the emissions trading system’s effect on industrial emissions reduction. Compared with the classical DID methods, there are two major advantages of the nonlinear DID method: first, it relaxes the linear assumption, which does not always hold in practice. Second, the method can give estimates of the distribution effects in different quartiles, whereas the classical DID method can only give an average treatment effect. In particular, this paper evaluates the distribution effects of industrial emissions intensity at the 10th, 25th, 50th, 75th, and 90th percentiles. In addition, we verified the robustness by replacing the dependent variable with domestic emissions, advancing the policy implementation time and excluding any contemporaneous policy effects. The main conclusions of this study are summarized as follows:
First, a certain emissions reduction effect on Chinese industrial emissions occurs with the implementation of the emissions trading policy. Increases in wage level, regional technological innovation level, foreign direct investment, and education level have significant inhibitory effects on industrial emissions. The effect of the policy is slightly greater in non-pilot areas than in pilot areas.
Second, the effect of the policy is slightly greater in non-pilot areas than in pilot areas. That is, the treatment effect of the control group is slightly larger than that of the treatment group. In terms of distribution effects, the areas with lower pollutant emissions intensity take the better policy’s emissions reduction effect. However, some subjective factors influence the selection of the pilot areas. The areas with higher pollutant emissions are more likely to be selected as pilot areas. As a result, the policy intervention has been demonstrated to not have the same treatment effect on every individual.
Third, for both the pilot areas and the non-pilot areas, the implementation of an emissions trading policy has a significant emissions reduction effect on areas (enterprises) with low industrial emissions intensities. However, areas with larger pollutant emissions did not show a significant effect. In fact, the emission reduction effect of the policy diminishes as the emissions intensity increases. This also explains why the policy’s implementation has a slightly higher treatment effect on the control group than on the treatment group.
By reviewing the above conclusions, this study gives the following policy recommendations.
The government should improve the market trading system in the pilot areas, unify the market trading standards, and strengthen the supervision of policy implementation in the pilot areas. The implementation of any policy requires clear laws and regulations as well as implementation standards. Therefore, the government should introduce and clarify the relevant regulations of the emissions trading market and unify the quota-allocation methods in each province and city. Moreover, the pricing and evaluation standards related to trading should be enhanced to promote the fair promotion of the emissions trading policy in the pilot areas.
To ensure the smooth operation of emissions trading, the government should enforce relevant laws and regulations, strengthen the supervision of the emissions rights trading market in pilot areas, and impose appropriate penalties on illegal emissions units. In addition to the relevant regulations at the national level, the relevant regulations of provinces and municipalities should also be very clear to give full play to the positive role of the market in environmental management.
The current emissions-trading policy for enterprises or regions with higher emissions does not play a positive role in reducing emissions. Therefore, the government should strengthen the total volume control and quota allocation for areas with higher industrial emissions and raise the trading fees. Total volume control and quota allocations are critical to the foundation, which are prerequisites for the emissions trading market’s smooth operation. For regions or enterprises with different levels of industrial emissions intensities, a more reasonable total amount of pollutant emissions should be determined in combination with the local environmental capacity. Furthermore, a reasonable emissions reduction task should be prescribed according to the pollutant emissions intensity. At the same time, refining the list of enterprises and industries that conduct emissions trading and making a reasonable plan for the total amount of emissions in a targeted manner are suggested.