4.2.1. Industry Consolidation and Matching
This study attempted to analyze China’s industry-level carbon emissions, economic input and output, and technical data from 1985 to 2011. Since its publication in 1984, China’s National Economic Industry Classification Standard has undergone three revisions in 1994, 2002, and 2011.
These revisions have led to mismatches in the economic industry before and after classification. Therefore, it is necessary to merge the Chinese industry’s classifications over the years. Following Chen [
20], we matched industry classifications from 1985 to 2011. The technical data used in this study are from the U.S. Patent and Trademark Office (USPTO) patent application database, and there is no industry classification information in the USPTO raw data. Hsu et al. in [
21] and Bhattacharya et al. in [
22] matched the three-digit technical classification code of the USPTO patent with the United States double-digit industry code. United States industry classification criteria can be obtained using their classification methods. Then, according to the four-digit industry comparison between China and the United States, the two-digit industry in both countries was matched.
The matching results are listed in
Table A1. To explore whether technological progress and gap play different roles between light and heavy industries, this study refers to Chen [
20] to separate the two industry sectors. The US_SIC code values of 10, 12, 13, 14, 20, 21, and 22 (industry merge with original US_SIC codes 22, 23, 24), 24, 25, 26, 27, and 39 are regarded as light industry, and other industries are regarded as heavy industries.
4.2.2. Variables and Descriptive Statistics
First, we constructed the input and output data of China’s industry level from 1985 to 2011, including the total industrial output value, added value, capital stock, labor, and total energy consumption. This study refers to the method used by Chen [
20] to unify the industrial caliber, using 1990 as the base period for price deflation. The missing value is supplemented by linear interpolation. Ex-factory price indices of industrial producer (1985–2011) (preceding year = 100) that comes from the “2012 China Urban Life and Price Yearbook” are employed as an output deflator index. The capital stock is deflated by the price index for investment in fixed assets, and the data are from the “China Statistical Yearbook”. The total industrial output value data were obtained from the “China Industry Economy Statistical Yearbook”. Referring to Chen [
20], we obtained the industrial added value, capital stock, and labor data from 1985 to 2008.
Since the industrial value-added rate is not given in the 2009–2011 statistical yearbook, this study assumed that the industrial value-added rate is stable and uses the mean value of the industrial value-added rate in 2005–2007 to represent the industrial value-added rate in 2009–2011. We then used the industrial output value data of 2009–2011 to multiply the industrial added-value rate to calculate the industrial added-value data for 2009–2011 and then used the price index to convert the industrial value-added data into a sequence with a constant price of 1990, and the missing value was supplemented by linear interpolation. The capital stock of 2009–2011 was constructed as follows: First, we obtained the original value of fixed assets, net fixed assets, and accumulated depreciation data of the sub-sectors from 2008 to 2011 from the China Industrial Economics Statistical Yearbook. Then, we calculated the depreciation rate from these three data points. The difference between the original value of the fixed assets in the current year and the original value of the fixed assets in the previous year was used to construct the current annual investment amount; we used the fixed asset investment price index to convert it into a sequence based on 1990. Then, we obtained all caliber investment data at comparable prices. Finally, we used the perpetual inventory method to estimate the industrial capital stock. The total energy consumption data of the industry was obtained from the China Energy Statistics Yearbook, and the missing value was supplemented by linear interpolation.
Air pollution variable is the core variable of this paper. This paper used carbon emissions to represent pollution because greenhouse gases are an important part of air pollution caused by economic growth. There are many methods to measure carbon emissions. Some measurement methods pay attention to the carbon emissions caused by production [
23,
24], and some pay attention to the carbon footprint [
25]. This paper focuses on the carbon emissions directly related to a country’s economic growth. Therefore, the carbon emissions were calculated by using the energy consumption used in production to represent air pollution. This study used energy consumption data to estimate CO
2 emissions on the basis of the methodology provided by the United Nations Intergovernmental Panel on Climate Change (IPCC) Guidelines for National Greenhouse Gas Inventories. Since the National Bureau of Statistics adjusted the energy data in 2008, the data for 1995–2011 are based on the data reported in the China Energy Statistics Yearbook since 2008. At the same time, this paper summarized data from the China Energy Statistics Yearbook from 1985 to 1994. Using the above methods and data, this study estimated industry carbon dioxide emissions data from 1985 to 2011 in China. Missing values were supplemented using linear interpolation methods.
There were three methods employed to measure the technology gap: indicator method, index method, and frontier production function method.
The basic logic of using the indicator method to measure the technological gap is to select an indicator to measure the technical level, and then measure the technological gap between the sample points by using the difference in the indicator value of different sample points. The indicators to measure the technical level include production input–output indicators and R&D input–output indicators. The production input–output indicators mainly include labor productivity or total factor productivity [
26,
27,
28]. The R&D input and output indicators mainly include the number of patent applications and the proportion of R&D expenditure in GDP [
7,
21]. The index method such as superlative index method improves the traditional production input–output indicators. It is not restricted by the setting of the production function form and is used to calculate the total factor productivity [
28,
29,
30]. The core of the frontier production function method is to calculate the frontier and calculate the degree of inefficiency of the sample points compared to the technological frontier [
31,
32].
Among the three measurement methods, the index method and the frontier production function method need to use the traditional production input indicators of the world’s technological frontier. We want to obtain comparable data across countries and industries. There are doubts about the comparability of economic and social indicators among different countries, different industries, and different years. Therefore, we selected the R&D input-output index method in the index method to measure the technology gap. Referring to Hsu et al. [
21], we used the patent data of the USPTO to construct technology variables. Using patent information as a “technical” proxy indicator has several advantages: patents are open data, provide a wealth of information, include all countries and technology types, are relatively standardized inventions, and provide longer time-series data [
33].
In this paper, the technology gap was calculated as follows:
where
C denotes China,
i denotes the industry,
t denotes time, and the technology gap value is between 0 and 1; the larger the value, the smaller the gap between the industry and the world technology frontier level.
This study used the Harvard Business School patent inventor database [
34] to obtain patent data to proxy for the technology gap and technological progress. There is a certain time lag between patent approval and patent applications. In the literature, patent data from the previous period is usually put into the model as a technical variable [
22]. We used patent data from the Chinese industry level and from the world technology frontier. We calculated the patent number for each industry in each year on the basis of the year the patent is approved. The technical level of an industry in a certain year is the number of patents approved by the USPTO in the industry in the previous year. This study also used this dataset to construct the world technology frontier and thus obtain a proxy variable of the industry-level technology gap. The world technology frontier of industry
i in year t refers to the largest technical index value of the industry
i among all countries in year t obtained in the data. The technological frontiers of different industries in different years may appear in different countries.
The number of patents approved does not comprehensively reflect the technical level of a particular industry. For example, the number of patents approved in industry
i is larger than that approved in industry
j, but this does not mean that the technical level of industry
i is higher than that of industry
j. The number of patent citations reflects the influence of patents, which can better capture the total quality and market value of patents [
35,
36,
37]. Therefore, this study used patent citation numbers as another proxy indicator of the industry’s technical variables.
When using patent citations as a proxy variable, we should note that it is difficult to assess whether the patents that were approved in 2000 and cited 10 times are of higher quality than those approved in 2008 that were only cited five times. This truncation error must also be considered. In this study, the weighting factor developed by Hall et al. in [
35] was used to adjust for patent citations.
In the empirical estimation, all variable values were logarithmically taken, and the descriptive statistics of the data are shown in
Table 1. In this study, the technical variables were lagged by one year, and the data for 1984–2010 were used. The data for the other variables were data from the period of 1985–2011. In this empirical estimation, all variable values were taken as logarithms. Descriptive statistics of the data are presented in
Table 1. The technical variables were lagged by one year, which means that the technical data were from 1984 to 2010 and industrial panel data were from 1985 to 2011. As shown in
Table 2, the logarithmic average of carbon emissions and carbon emission intensity of the light industry was lower than that of the heavy industry. The capital stock, energy consumption, total number of employed workers, industrial added value, and the total industrial output value of heavy industry were all higher than those of the light industry. From the perspective of the number of patents, the average value of the logarithm of the technological progress of the light industry and the logarithm of the technological gap were slightly smaller than that of the heavy industry. It can be seen from descriptive statistics that although heavy industries bring more added value, they also consume more energy and capital, and the data do not reflect the technical advantages of the heavy industry sector over the light industry sector.
To make a clearer comparison of the technological gap between light and heavy industries, this study depicts the values of the technological gap and the difference in technology gaps in
Figure 2. The left part of
Figure 2 shows the trend of technological gaps over time in terms of patent quantity and quality in various industries in China from 1982 to 2010. The trends of the two indicators were relatively consistent. It can be seen from the figure that the gap between China’s technological level and technological-frontier countries rapidly narrowed from 1982 to 1990; however, it experienced a relatively gradual decline from 1990 to 1995 and continued to shrink rapidly from 1995 to 2010.
The right part of
Figure 2 shows the logarithm of China’s light industry technology gap minus the logarithm of the heavy industry technology gap. A difference greater than 0 means that the technology of the light industry has a smaller gap relative to the world technology frontier, and a difference less than 0 indicates that the heavy industry has a smaller gap relative to the world technology frontier. It can be seen from the figure that before 2000, the technological gap in heavy industry was smaller, whereas after 2000, the technological gap in the light industry was smaller.
The mechanism analysis component of this study examined how technology gaps affect carbon emissions. The analysis believes that narrowing the technological gap will change the comparative advantage of a country relative to international competition and enable the country to improve its industrial structure and engage in more advanced and cleaner production, thereby reducing carbon emissions. What is more important in this mechanism is the industry-level comparative advantage data of a country. This study used data from the World Input–Output Database (WIOD) to estimate the industry’s explicit comparative advantage on the basis of the added value of manufacturing exports.
This study adopted the industry-explicit comparative advantage index to measure the industry’s explicit comparative advantage in export value:
is the export value of country
i and industry
j, and
is the sum of the export value added of industry
j of all countries.
is the sum of export value added of all industries in country
i.
is the sum of export value added of all industries in all countries. Export value-added data can be calculated using the Koopman, Powers, Wang, and Wei (KPWW) [
38] method with the world input–output table. After calculating the explicit comparative advantage of the industry, this study matched the industry classification of the WIOD data with the industry classification of this study and finally obtained the explicit comparative advantage data of industries between 1995 and 2011.