Next Article in Journal
Benefit Sharing of Power Transactions in Distributed Energy Systems with Multiple Participants
Previous Article in Journal
Promoting Underground Cultural Heritage through Sustainable Practices: A Design Thinking and Audience Development Approach
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multivariate Grey Prediction Model Application in Civil Aviation Carbon Emission Based on Fractional Order Accumulation and Background Value Optimization

College of Air Transportation, Shanghai University of Engineering Science, Shanghai 201600, China
*
Author to whom correspondence should be addressed.
Sustainability 2023, 15(11), 9127; https://doi.org/10.3390/su15119127
Submission received: 12 May 2023 / Revised: 1 June 2023 / Accepted: 2 June 2023 / Published: 5 June 2023

Abstract

:
The GM(1,N) model, as a classical multivariate grey prediction model, can make a holistic and dynamic analysis of multiple factors and reflect the dynamic change relationship between the variable series and the related factor series. However, numerous works in the literature show that the GM(1,N) model has mechanistic defects, parametric defects, and structural defects. Therefore, the thesis establishes the OGM(1,N) model based on the GM(1,N) model by adding the linear correction term and the amount of grey action. According to the principle of dynamic optimization, the PSO algorithm is used to determine the background value. On this basis, the fractional order idea is introduced to push the model order from the integer field to the real field, and the FOBGM(1,N) model is established to systematically reduce the model error. Second, the literature in the ScienceDirect database for the last ten years is reviewed, and the carbon emission impact factors of civil aviation are selected. The calculated carbon emission values are taken as sample data based on Method 2 of Civil Aviation in Volume 2 of the 2006 IPCC Guide to National Greenhouse Gas Inventories. The results show that the prediction accuracy of the model has an increasing trend after multi-layer and multi-angle optimization. Among them, the MAPE of the OGM model and FOBGM model decreased by 24.40% and 31.86% compared with the GM(1,N) model. The 5-year average prediction accuracy of the FOBGM model reaches 99.996%, which verifies the effectiveness and practicality of the model improvement and has certain practical significance and application prospects.

1. Introduction

The grey prediction method is one of the effective methods for the analysis and prediction of systems with “small samples, poor information, and uncertainty”, and its biggest advantage is that the amount of information required for prediction is relatively small, which is more suitable in the case of difficult data acquisition and high prediction accuracy. In grey prediction theory, the most basic prediction model is the GM(1,1) model, which is based on a small amount of information and has been widely used in industry, agriculture, medicine, and other fields [1]. However, the GM(1,1) model only analyzes and predicts the change pattern of a single variable, ignoring the influence of changes in other influencing factors on the subject of study. Meanwhile, the GM(1,N) model is the basic model of the multivariate grey system modeling method and is compared with the GM(1,1) grey model. This model makes up for the defect of using historical data directly to build a time series prediction model. It can make a holistic and dynamic analysis of multiple factors and reflect the dynamic change relationship between the study variable series and the related factor series.
At present, many scholars have applied multivariate grey prediction models for forecasting in different fields and optimized the GM(1,N) model from different perspectives to improve its prediction accuracy. Jiang S. et al. (2017) combined grey GM(1,N), Markov theory, and metabolic ideas and empirically showed that the relative error of the metabolism GM(1,N)–Markov model was 32.73% smaller than that of the GM(1,N) prediction model [2].
Wu and Zhang (2018) [3] proposed a new GMC(1,N) model for information priority accumulation, which adjusted the weight of the data by adding a parameter and ultimately improved the accuracy of the prediction model.
Xiong et al. (2018) [4] proposed a nonlinear multivariate NGM(1,N) model based on kernel function and grey radius, which was established based on the kernel number and grey radius sequence of an interval grey number sequence. The kernel number and grey radius of the interval grey number sequence were simulated and predicted. According to the formula of kernel function and grey radius, the upper and lower bounds of the interval gray number were derived so as to simulate and predict the model. The NGM(1,N) model based on the interval grey number was used to forecast AQI under haze weather. Experimental results showed that the model had high prediction accuracy.
Wu et al. (2019) [5] conducted a systematic study of the GM(α, n) model using grey modeling techniques and the forward difference method to calculate the simulated and predicted values by transforming fractional order differential equations into fractional order difference equations and proposed a stochastic testing scheme to verify the accuracy of the new GM(α, n) model. The results showed that the GM(α, n) model had a high potential for energy consumption forecasting in China.
Cheng et al. (2020) [6] derived the modified model of the conventional grey differential equation based on the GM(1,N) whitening equation. Using the grey differential equation estimation parameters of the improved GM(1,N) model, the parameter estimation methods under three different background values were given. Compared with the traditional model, the simulation accuracy and prediction accuracy of the improved GM(1,N) model were significantly improved.
According to Zeng et al. (2020) [7], the extreme value of the independent variable was one of the important factors affecting the simulation and prediction results of the GM(1,N) dependent variable, a new multivariate grey prediction model was constructed on the basis of the smooth generation of variable weight independent variable sequences. The performance of NMGM(1,N) was verified with an example
Shen et al. (2021) [8] proposed an optimized discrete GMC(1,N) model (called ODGMC) to further improve the prediction accuracy and stability of GMC(1,N). In particular, a linear correction term was introduced into the new model, and the time response function of the new model was derived. The ODGMC(1,N) model proposed in this paper not only adjusted the relationship between the dependent and independent variables but also exhibited better stability than GMC(1,N) and its discrete form. The algorithm results showed that the proposed ODGMC(1,N) model had better fitting and prediction accuracy than the traditional GM(1,N) model, GMC(1,N) model, and its discrete form regardless of whether the dependent variable series was increasing, decreasing or fluctuating.
By analyzing the relevant literature, it was found that scholars from different countries have improved and optimized the GM(1,N) model in different research fields and obtained relatively better prediction accuracy compared with the traditional GM(1,N) model. Some studies [5,9] combined the kernel function with GM(1,N) through the combination of embedded models, or they cleverly combined the kernel function with GM(1,N). The multi-variable grey prediction model with higher prediction accuracy was established to reduce the prediction error of the model to a certain extent, but the inherent defects of the GM(1,N) model and the optimization of background values were not studied. The authors [2,3,4,7] studied the optimization of background values of GM(1,N) and GM(1,N) models based on the idea of metabolism and the combination of kernel function and GM(1,N), respectively, to achieve the purpose of improving prediction accuracy. However, these studies did not deeply study the mechanism of GM(1,N) model construction and ignored the inherent defects of the GM(1,N) model, and the prediction accuracy of the model was affected. Aiming at the inherent defects of the GM(1,N) model, some studies [7,8] improved the GM(1,N) model from multiple perspectives and achieved good results. However, these studies lacked the optimization of the cumulative order of GM(1,N) and failed to further improve the prediction accuracy of the multivariate grey model.
This paper provided an in-depth analysis of the inherent flaws of the GM(1,N) model and found that there were still some flaws in the construction process of the GM(1,N) model. GM(1,N) solved the whitening equation and derived the approximate time corresponding function in a relatively idealized manner, resulting in its mechanistic flaws [8]. GM(1,N) treated the parameter column estimates as model parameters of the model approximate time response equation, which was its parametric defects [10]. GM(1,N) did not mine enough grey action from itself, lacked the study of the effect of the GM(1,N) model performance by the linear relationship of the number of terms k , and when N = 1, the model could not achieve structural equivalence with the GM(1,1) model, indicating its structural defects [8].
In all, multi-stage and multi-angle improvement and optimization of the multi-variable grey model were carried out, including background value optimization, fractional optimization, linear increment optimization, and so on. Specifically speaking, the paper added a linear correction term and grey action quantity to the GM(1,N) model to compensate for its inherent defects and built the OGM(1,N) model. Then the Particle Swarm Optimization (PSO) algorithm was used to find the optimal background value, and the OBGM(1,N) model was constructed. The OBGM(1,N) model was improved and optimized using the PSO algorithm and fractional order algorithm at multiple levels and angles, which was called the FOBGM(1,N) model. Then the literature in the ScienceDirect database for the last ten years was reviewed, and the carbon emission impact factors of civil aviation were selected by research frequency statistics and correlation analysis. The carbon emission calculation method for civil aviation of the 2006 IPCC Guidelines for National Greenhouse Gas Inventories was introduced, relevant data were collected, and the prediction accuracy of the model before and after optimization was compared and analyzed to prove the effectiveness of the model optimization, so as to propose a carbon emission prediction model with high prediction accuracy for civil aviation management to evaluate the effectiveness of implementing current emission reduction measures. It should be noted that the GM(1,N) model based on fractional order accumulation and background value optimization developed in this paper can also be applied to the carbon emission forecasting of other transportation systems according to practical needs.

2. Basic Theory

2.1. Analysis of the Correlation Characteristics of Traditional Multivariate Grey Prediction Model

In terms of the number of input variables to the model, the grey prediction model is divided into a multi-variable grey prediction model and a univariable grey prediction model. Among them, the GM(1,1) model (i.e., a grey prediction model with a first-order equation and only one variable) is a typical representative of univariate grey prediction models, which take a single time-series data as the modeling object. According to the system operation law contained in the time series data, the grey generation method is used to mine the data information so as to predict the future development of the system. As the prediction model with the widest application range and the most research achievements, the univariable grey prediction model has a simple structure and is easy to understand without considering the impact of impact factors on the future development of the system. However, its fitting results often show saturated S-shaped characteristics or exponential forms and do not consider the influence of external environment changes on the development trend of the system, so it cannot predict the “inflection point” in the development process of the system. Therefore, the limitations of this model are relatively large. As a representative of a multivariate grey prediction model, the GM(1,N) model is a causal prediction method, which consists of an N-1 series of related factors and one series of system characteristics. In the process of model construction, the influence of external environment changes on the development trend of the system is included in the research scope. The multiple regression algorithm is similar to this model; however, the former is based on probability statistics, and the latter is based on grey theory, and there are essential differences between them. Compared to univariate grey prediction models, GM(1,N) models no longer have a limited single simulation capability in terms of structure. For a long time, the GM(1,N) model has been used more for systematic analysis, and its prediction ability has not been widely utilized. This is because the GM(1,N) model has certain defects in the model construction mechanism and structure, which results in its prediction accuracy is often lower than the GM(1,1) model in the process of use.

2.2. GM(1,N)

2.2.1. Model Definition

Let X 1 0 be the sequence of dependent variables (i.e., the sequence of system characteristics), X 1 0 = x 1 0 1 , x 1 0 2 , , x 1 0 m . Let X i 0 i = 2 , 3 , N be the sequence of independent variables with a high correlation with X 1 0 (i.e., the sequence of explanatory variables), X i 0 = x i 0 1 , x i 0 2 , , x i 0 m . Further, X j 1 is the 1-AGO sequence of X j 0 j = 1 , 2 , , N , X j 1 = x j 1 1 , x j 1 2 , , x j 1 m . Where x j 1 k = g = 1 k x j 0 g , k = 1 , 2 , , m .   Z 1 1 is the immediate mean generating sequence of X 1 1 , which can be calculated by
Z 1 1 = z 1 1 2 , z 1 1 3 , , z 1 1 m .
In Equation (1) z 1 1 k = 0.5 × x 1 1 k + x 1 1 k 1 , k = 2 , 3 , , m . The GM(1,N) model is expressed as
x 1 0 k + a z 1 1 k = i = 2 N b i x i 1 k .

2.2.2. Parameter Estimation and Time Response Equation of GM(1,N) Model

In the GM(1,N) model, a is called the system development coefficient, b i x i 1 k is called the driver term, b i is called the driver coefficient, and a ^ = a , b 1 , b 2 , , b N T is called the parameter column for which the least squares estimate satisfies using
P = B T B 1 B T Y .
Among them, B and Y can be set
B = z 1 1 2 x 2 1 2 x N 1 2 z 1 1 3 x 2 1 3 x N 1 3 z 1 1 m x 2 1 m x N 1 m , Y = x 1 0 2 x 1 0 3 x 1 0 m .
Then we can obtain
d x 1 1 d t + a x 1 1 = i = 2 N b i x i 1 .
Equation (5) is the whitening equation of the GM(1,N) model, also called the shadow equation.
Let the sequence X i 0 , X i 1 i = 1 , 2 , , N , Z 1 1 and the matrix B , Y , a ^ be as described above, then the solution of the whitening equation d x 1 1 d t + a x 1 1 = i = 2 N b i x i 1 is
x 1 1 t = e a t x 1 0 1 t i = 2 N b i x i 1 0 + i = 2 N b i x i 1 t e a t d t .
When the variation in X i 1 i = 1 , 2 , , N is very small, visualizing i = 2 N b i x i 1 k as a grey constant, the approximate time response equation of the GM(1,N) model is
x 1 1 k + 1 = x 1 0 1 1 a i = 2 N b i x i 1 k + 1 e a k + 1 a i = 2 N b i x i 1 k + 1 .
Then the cumulative reduction equation is
x ^ 1 0 k + 1 = α 1 x ^ 1 1 k + 1 = x ^ 1 1 k + 1 x ^ 1 1 k .

2.2.3. Defects of the GM(1,N) Model

In the construction of the GM(1,N) model, there are still some shortcomings that affect the prediction accuracy of the GM(1,N) model.
(1) The GM(1,N) model solves the whitening equation and derives the approximate time equivalent function in a relatively idealized way. In reality, the “small magnitude of variation” of X i 1 is difficult to satisfy. This is because the variables represented by X i 1 i = 1 , 2 , , N are different, so their development trends, dynamic rules, and change characteristics are generally different, and the amplitude of change is difficult to guarantee, which results in the unstable prediction performance of the GM(1,N) model, resulting in the mechanism defects of the GM(1,N) model [10].
(2) GM(1,N) is not a time-response equation derived using x 1 0 k + a z 1 1 k = i = 2 N b i x i 1 k , but is computed using its shadow equation. Therefore, the parameter column a ^ = a , b 1 , b 2 , , b N T calculates its estimates with x 1 0 k + a z 1 1 k = i = 2 N b i x i 1 k . The GM(1,N) model, however, regards the estimated value of the parameter column as the model parameter of the approximate time response of the model. The “dislocation” of the GM(1,N) model parameter estimation and application object causes its unstable prediction performance, which is its parameter defect [4].
(3) The GM(1,N) model has a simple structure and is a state model and factor model. The model does not have a sufficient amount of grey action mined from itself and lacks a study on the effect of the GM(1,N) model performance by the linear relationship between the number of terms k . In addition, the GM(1,N) model serves as a first-order grey prediction model for N variables, but when N = 1, the model cannot achieve structural equivalence with the GM(1,1) model. This indicates that the GM(1,N) model also has structural flaws, resulting in relatively low prediction accuracy [10].

2.3. Prediction Accuracy Evaluation System

Mean absolute percentage error (MAPE) can be used to compare the prediction accuracy of different models [11], which can be obtained with
M A P E = i = 1 n q i n × 100 .
where absolute error e i and relative error q i are calculated as
e i = x i x ^ i , q i = e i x i .
Further, the mean square error (MSE), root mean square error (RMSE), and mean absolute error (MAE) are calculated as
M S E = i = 1 n x i x ^ i 2 n ,
R M S E = 1 n i = 1 n x i x ^ i 2 ,
M A E = 1 n i = 1 n x i x ^ i .

2.4. Carbon Emission Measurement of Civil Aviation

In order to verify the prediction performance of the model, relevant data from the China civil aviation carbon emission prediction study were used in this paper to verify the effectiveness of the model before and after improvement. The World Meteorological Organization (WMO) and the United Nations Environment Programme (UNEP) jointly established the Intergovernmental Panel on Climate Change (IPCC). This paper calculates carbon emissions based on Method 2 of Civil Aviation in Volume 2 of the 2006 IPCC Guidelines for National Greenhouse Gas Inventories (hereinafter referred to as the Guidelines). The flight of an aircraft has two stages: the LTO stage and the cruise stage. The method separates aviation carbon emissions above and below 914 m (3000 feet) during flight, that is, the LTO phase and cruise phase carbon emissions during flight. The specific calculation process is as
E = E L T O + E X
where E is the carbon emissions from air transport. E L T O indicates the carbon emissions from air transport in the LTO phase and E X indicates the carbon emissions from air transport in the cruise phase. E L T O and E X are calculated as
E L T O = N L T O F L T O ,
K L T O = N L T O U L T O ,
E X = K K L T O F X ,
F X = H ( 1 × 10 9 ) / L .
N L T O indicates the number of aircraft landings and takeoffs in the national aviation industry; F L T O indicates the CO2 emission factor in the LTO phase, using the average value of each aircraft type in the Guide, i.e., 4341 kg/LTO; K L T O indicates the fuel consumption in the LTO phase; U L T O indicates the fuel consumption per LTO, using the average value of each aircraft type in the Guide, i.e., 1374 kg/LTO; K indicates the total fuel consumption; F X indicates the CO2 emission factor in the cruise phase H; indicates the amount of CO2 per unit calorific value, using the recommended value of 71,500 kg/TJ in the Guide; and L indicates the low-level heating value of aviation fuel, using the recommended value of 44,100 KJ/kg in the Guide for Accounting Methods and Reporting of Greenhouse Gas Emissions of Chinese Civil Aviation Enterprises prepared by the National Development and Reform Commission.

3. Model Improvement and Optimization

3.1. Model Optimization Ideas

In order to solve the mechanistic defects, parametric defects, and structural defects of GM(1,N), this paper adds linear correction terms and grey action quantities to the GM(1,N) and establishes the OGM(1,N). For the multivariate grey model, the background value coefficient can affect its prediction accuracy to a certain extent, and the value is generally set to 0.5. In this paper, the global search capability of the PSO algorithm is used to search for its background value coefficient, and then the order of OBGGM(1,N) is extended from the integer domain to the real domain using the PSO algorithm and fractional order algorithm to establish the FOBGM model. The improved GM(1,N) model is optimized by multi-level and multi-angle, aiming to further improve the prediction accuracy. The calculation process is shown in Figure 1.

3.2. GM(1,N) Model Structure Improvement

3.2.1. Definition of the OGM(1,N) Model

Let the OGM(1,N) model (optimizing grey model) be
x 1 0 k + a z 1 1 k = i = 2 N b i x i 1 k + h 1 k 1 + h 2
where h 1 k 1 and h 2 are the linear correction term and the grey action of the model.

3.2.2. Parameter Estimation and Time Corresponding Equation of OGM(1,N)

The sequences X 1 0 , Z 1 1 , and X i 1 i = 1 , 2 , , N are as described in Section 2.2.1, then the least squares estimates of the OGM(1,N) model parameter columns p ^ = b 2 , b 3 , , b N , a , h 1 , h 2 T . satisfy
p ^ = B T B 1 B T Y .
Among them, B and Y can be set
B = x 2 1 2 x 3 1 2 x N 1 2 z 1 1 2 1 1 x 2 1 3 x 3 1 3 x N 1 3 z 1 1 3 2 1 x 2 1 m x 3 1 m x N 1 m z 1 1 m m 1 1 Y = x 1 0 2 x 1 0 3 x 1 0 m .
Then the difference model of OGM(1,N) can be described with
x ^ 1 0 k = i = 2 N b i x ^ i 1 k a z 1 1 k + h 1 k 1 + h 2 .
The time-corresponding equation is
x ^ 1 1 k = t = 1 k 1 μ 1 i = 2 N μ 2 t 1 b i x i 1 k t + 1 + μ 2 k 1 x ^ 1 1 1 + j = 0 k 2 μ 2 j k j μ 3 + μ 4 , k = 2 , 3 , .
Among them, μ 1 = 1 1 + 0.5 a , μ 2 = 1 0.5 a 1 + 0.5 a , μ 3 = h 1 1 + 0.5 a , μ 4 = h 2 h 1 1 + 0.5 a .
The cumulative reduction equation of OGM(1,N) is
x ^ 1 0 k = x ^ 1 1 k x ^ 1 1 k 1 , k = 2 , 3 , .
OGM(1,N) does not assume that “ i = 2 N b i x i 1 k is regarded as a grey quantity when the variation of X i 1 i = 1 , 2 , , N is small”. There is no “misalignment” between the shadow equation d x 1 1 / d t + a x 1 1 = i = 2 N b i x i 1 and the parameter column p ^ = b 1 , b 2 , , b N , a , h 1 , h 2 T . Furthermore, the linear correction term and the grey action quantity are added to the model to make the model structure more reasonable. OGM(1,N) is an optimization improvement of GM(1,N), which better compensates for the mechanism defects, parameter defects, and structural defects of the latter.

3.3. Optimization of Background Values for OGM(1,N)

The background value coefficient is a parameter in grey theory that can have an impact on the prediction accuracy of multivariate grey prediction models to a certain extent. However, multivariate grey prediction models generally use 0.5 as their background value coefficients with the aim of simplifying the model construction process. Therefore, this section improves the OGM(1,N) model by optimizing its background value coefficients with the help of the global search capability of the PSO algorithm, calls the improved model OBGM(1,N), and derives the time response function and the final reduced equation of OBGM(1,N).

3.3.1. OBGM(1,N)

OBGM(1,N) is defined by Equation (25), which is an OGM(1,N) model with a background value coefficient of ξ 0 < ξ < 1 .
x 1 0 k + a ξ x 1 1 k + a 1 ξ x 1 1 k 1 = i = 2 N b i x i 1 k + k c + d , k = 2 , 3 , , m .
The parameter column p ^ = b 2 , b 3 , , b N , a , c , d T is computed using the least squares method. Provided that p ^ = b 2 , b 3 , , b N , a , c , d T is known, If a certain ξ minimizes the mean absolute percentage error of sequence X 1 0 , it is the optimal background value coefficient of the OGM(1,N) model. Further, this model is called the OBGM(1,N) model.
The least squares estimate of the parameter column p ^ = b 2 , b 3 , , b N , a , c , d T satisfies
p ^ = E T E 1 E T S .
Among them, E and S can be set
E = x 2 1 2 x 3 1 2 x N 1 2 ξ 1 x 1 1 1 ξ x 1 1 2 2 1 x 2 1 3 x 3 1 3 x N 1 3 ξ 1 x 1 1 2 ξ x 1 1 3 3 1 x 2 1 m x 3 1 m x N 1 m ξ 1 x 1 1 m 1 ξ x 1 1 m m 1
S = x 1 0 2 x 1 0 3 x 1 0 m
then x ^ 1 0 k + a ξ x ^ 1 1 k + a 1 ξ x ^ 1 1 k 1 = i = 2 N b i x i 1 k + k c + d , k = 2 , 3 , , m . Because x ^ 1 0 k = x ^ 1 1 k x ^ 1 1 k 1 , then, we can obtain x ^ 1 1 k x ^ 1 1 k 1 + a ξ x ^ 1 1 k + a 1 ξ x ^ 1 1 k 1 = i = 2 N b i x i 1 k + k c + d . The collation is obtained from x ^ 1 1 k = 1 1 + a ξ i = 2 N b i x i 1 k + 1 a + a ξ 1 + a ξ x ^ 1 1 k 1 + c 1 + a ξ k + d 1 + a ξ . Let v 1 = 1 1 + a ξ , v 2 = 1 a 1 + a ξ , v 3 = c 1 + a ξ , v 4 = d 1 + a ξ , then x ^ 1 1 k = v 1 i = 2 N b i x i 1 k + v 2 x ^ 1 1 k 1 + v 3 k + v 4 , k = 2 , 3 , m . When k = 2 , x ^ 1 1 2 = v 1 i = 2 N b i x i 1 2 + v 2 x 1 1 1 + 2 v 3 + v 4 .
When k = 3 , x ^ 1 1 3 = v 1 i = 2 N b i x i 1 3 + v 2 x ^ 1 1 2 + 3 v 3 + v 4 . Substituting x ^ 1 1 2 , and the equation is reduced to x ^ 1 1 3 = v 1 i = 2 N b i x i 1 3 + v 1 v 2 i = 2 N b i x i 1 2 + v 2 2 x 1 1 1 + 2 v 2 v 3 + v 2 v 4 + 3 v 3 + v 4 .
Continuing the derivation, when k = p   x ^ 1 1 p = v 1 i = 2 N b i x i 1 p + v 1 v 2 i = 2 N b i x i 1 p 1 + + v 1 v 2 p 2 i = 2 N b i x i 1 2 + v 2 p 1 x ^ 1 1 1 + 2 v 2 p 2 v 3 + v 2 p 2 v 4 + 3 v 2 p 3 v 3 + v 2 p 3 v 4 + + p · v 3 + v 4 .
Simplify to get x ^ 1 1 p = u = 1 p 1 v 1 i = 2 N v 2 u 1 b i x i 1 p u + 1 + v = 0 p 2 v 2 v p v v 3 + v 4 + v 2 p 1 x ^ 1 1 1 , which is the time response function of the OBGM(1,N) model, k = p = 2 , 3 , , m .
From x ^ 1 0 k = x ^ 1 1 k x ^ 1 1 k 1 , we can obtain the final reduced form of the OBGM(1,N) model. Thus, the time response equation is
x ^ 1 1 k = u = 1 k 1 v 1 i = 2 N v 2 u 1 b i x i 1 k u + 1 + v = 0 k 2 v 2 v k v v 3 + v 4 + v 2 k 1 x ^ 1 1 1 .
The final reduced equation is
x ^ 1 0 k = v 1 d = 2 k i = 2 N v 2 1 d 2 d v 2 d 2 d d 3 d d 3 b i x i 1 k d + 2 + v = 0 k 3 v 2 k 2 2 v 3 + v 4 + 1 v k 3 v 2 k 2 + v 2 v v 3 .
Equation (30) contains two parts. The first half refers to the effect of the independent variable X i 1 ( i = 2 , 3 , , N ) on the dependent variable X 1 0 . The second half refers to the effect of the constant terms v 1 , v 2 , v 3 , and v 4 on the dependent variable X 1 0 .

3.3.2. Background Value Coefficient of OBGM(1,N) Model Optimized by PSO Algorithm

In 1995, Kennedy and Eberhart proposed a global optimization algorithm, namely particle swarm optimization (PSO) [12]. The PSO algorithm is simple in concept, easy to program, has fewer parameters, and is widely used in the fields of neural network training and function optimization. In this paper, the PSO optimization algorithm is used to find the optimal background value coefficients of the OBGM(1,N) model and the mean absolute percentage error (i.e., MAPE) of the output results of the OBGM(1,N) model is used as the objective function. When the minimum objective function is found, its corresponding background value is the optimal background value coefficient.
At the minimum MAPE, the optimal background value coefficient of the OBGM(1,N) model is essentially to solve the following optimization problems, which can be seen with Equation (31).
min f ξ = 1 m 1 k = 2 n x ^ 1 0 k x 1 0 k x 1 0 k , ξ R + .
The computational procedure for finding the optimal background value coefficients for the OBGM(1,N) model using the PSO algorithm ξ is shown below.
Step 1: Set the PSO algorithm parameters to initialize the velocities and positions of the particles in the particle swarm.
Step 2: Set the current position in the particle to p B e s t and the best particle position in the group to g B e s t = 1 .
Step 3: Calculate the MAPE of the OBGM(1,N) model when ξ = p B e s t .
Step 4: Update all particles as follows.
Step 4.1: Update of particle velocity and position (shown in Equations (32) and (33))
V = ω × V + c 1 × r a n d × p B e s t P r e s e n t + c 2 × r a n d × g B e s t P r e s e n t ,
P r e s e n t = P r e s e n t + V .
Step 4.2: Update p B e s t to the new location if the adaptation is better than the adaptation of p B e s t .
Step 4.3: Update g B e s t to the new position if the adaptation is better than the adaptation of g B e s t .
Step 5: Determine whether the maximum number of iterations is reached or the accuracy requirements are met. If yes, proceed to Step 6; Otherwise, go back to Step 4.
Step 6: Output g B e s t to get the optimal value of the optimal background value coefficient ξ , and output the simulated or predicted value of the OBGM(1,N) model of this optimal background value coefficient ξ = g B e s t , and the run is finished.

3.4. Fractional Order Optimization of OBGM(1,N)

The essence of the OBGM(1,N) model is a multi-variable grey prediction model of first-order equations. By mining the information contained in the data with the help of the first-order cumulative sequence, the fractional order idea is introduced into the OBGM(1,N) model, and the order of the model is extended from the integer domain to the real domain, in order to improve prediction accuracy. The OBGM(1,N) model is further optimized by the optimal cumulative order. Among them, the optimization of cumulative order is determined using the global optimization ability of the PSO optimization algorithm.

3.4.1. Model Definition

Assuming that the original non-negative series sequence is X 1 0 = x 1 0 1 , x 1 0 2 , x 1 0 m , add r-order to the original sequence X i 0 ( i = 2 , 3 , , N ) to obtain the sequence X j r ( j = 1 , 2 , , N ) , X j r = x j r 1 , x j r 2 , x j r m , which can be calculated using Equation (34) [13].
x j ( r ) ( k ) = i = 1 k Γ r + k i Γ k i + 1 Γ r x j ( 0 ) ( i ) , k = 1 , 2 , , m .
The OGM(1,N) model with a background value coefficient of ξ 0 < ξ < 1 and fractional order of r can be described by Equation (35), where p ^ = b 2 , b 3 , , b N , a , c , d T is the column of parameters estimated by the least squares method. Under the condition that p ^ = b 2 , b 3 , , b N , a , c , d T is known, if ξ and r exist to minimize the average relative simulation error percentage of sequence X 1 0 , the OGM(1,N) model is said to have the optimal background value coefficient ξ and the optimal order r , and the multivariate grey prediction model is denoted as the FOBGM(1,N) model.
x 1 0 k + a ξ x 1 r k + a 1 ξ x 1 r k 1 = i = 2 N b i x i r k + k c + d , k = 2 , 3 , , m .

3.4.2. Parameter Estimation and Time Response Equation of FOBGM(1,N)

Let X j ( r ) = ( x j ( r ) ( 1 ) , x j ( r ) ( 2 ) , , x j ( r ) ( m ) ) be the r-order reduction generator of X j 0 , which can be gained using Equation (36).
x j ( r ) ( k ) = i = 0 k 1 1 i Γ r + 1 Γ r i + 1 Γ i + 1 x j 0 ( k i ) , k = 1 , 2 , , m .
Let the least squares estimate of the parameter column p ^ = b 2 , b 3 , , b N , a , c , d T satisfy
p ^ = E T E 1 E T S ,
E = x 2 r 2 x 3 r 2 x N r 2 ξ 1 x 1 r 1 ξ x 1 r 2 2 1 x 2 r 3 x 3 r 3 x N r 3 ξ 1 x 1 r 2 ξ x 1 r 3 3 1 x 2 r m x 3 r m x N r m ξ 1 x 1 r m 1 ξ x 1 r m m 1 ,
S = x 1 0 2 x 1 0 3 x 1 0 m .
Then the time response equation of the FOBGM(1,N) model is
x ^ 1 r k = u = 1 k 1 v 1 i = 2 N v 2 u 1 b i x i r k u + 1 + v = 0 k 2 v 2 v k v v 3 + v 4 + v 2 k 1 x ^ 1 r 1 .
The final reduced equation is
x ^ 1 0 ( k ) = x ^ 1 r r k = i = 0 k 1 1 i Γ r + 1 Γ r i + 1 Γ i + 1 x ^ 1 r ( k i ) , k = 2 , 3 , , m . x ^ 1 0 ( 1 ) = x 1 0 ( 1 )

4. Empirical Study

4.1. Influencing Factor Screening

Although there is relatively little in the literature on China’s civil aviation carbon emission prediction, some scholars have studied the influencing factors of civil aviation carbon emission from different perspectives. In this paper, the literature on carbon emission factors of civil aviation from the most recent ten-year period was reviewed, and some literature on carbon emission of other modes of transport was introduced to calculate the research frequency of carbon emission factors studied so as to preliminarily screen out the impact factors of carbon emission by civil aviation.
As can be seen from Table 1, among the main indicators affecting civil aviation carbon emissions in the past decade, the research frequency of several influencing factors such as civil aviation transportation traffic (passenger/cargo volume), total civil aviation turnover, civil aviation fuel consumption, civil aviation industry-wide operating income, and transport intensity are relatively high. Therefore, in this paper, civil aviation transport traffic is divided into passenger traffic and cargo volume. Finally, six influencing factors were determined to predict China’s civil aviation carbon emissions using civil aviation passenger traffic, civil aviation cargo volume, total civil aviation turnover, civil aviation fuel consumption, civil aviation industry-wide operating income, and civil aviation transportation intensity.

4.2. Original Data Collection and Correlation Analysis

According to the above civil aviation carbon emission accounting standards and selected impact factors, the data on civil aviation fuel consumption, passenger traffic, cargo volume, total civil aviation turnover, and civil aviation industry-wide operating income were collected from 2003 to 2017. Among them, the data on civil aviation fuel consumption was from the Statistical Data on Civil Aviation of China. The data on total civil aviation turnover and the data on aircraft takeoffs and landings were from the Statistical Data on Civil Aviation of China and the China Traffic Yearbook. The data of civil aviation industry-wide operating income were from the Statistical Bulletin on the Development of the Civil Aviation Industry, among which the average value of two years before and after was used because the data for 2008 is missing. The data on civil aviation passenger traffic and cargo traffic volume were from the China Statistical Yearbook.
The prediction model in this paper is based on the GM(1,N) model, the construction of which focuses on the structure of matrix B. If the value difference between rows or columns of matrix B is too large or too close, the condition number of the matrix will be too large, resulting in the phenomenon of matrix drift. The order of magnitude and dimension of the original data selected influencing factors are quite different, so it is necessary to preprocess the original data, such as initializing. After initializing the original data, the predicted value is obtained, and then the predicted value is initialized by inverse transformation so as to restore the order of magnitude and dimension of the predicted sequence. Through the above-screened influencing factors, the Pearson correlation coefficient is used to analyze the correlation between each impact factor and civil aviation carbon emissions, and the results are shown in Figure 2.
As shown in Figure 2, the diagonal plot shows the distribution of China’s civil aviation carbon emissions and the six influencing factors themselves; the upper triangle shows the corresponding Pearson correlation coefficients, and the lower triangle shows the corresponding scatter plots. The Pearson correlation coefficient is −0.31583, while the other five factors are strongly correlated with China’s civil aviation carbon emissions, with Pearson correlation coefficients greater than 0.9, and these five factors are also strongly correlated with each other. Therefore, in summary, this paper selects five influencing factors of carbon emissions in China’s civil aviation, which are civil aviation passenger traffic, civil aviation cargo volume, total civil aviation turnover, civil aviation fuel consumption, and civil aviation industry-wide operating income.

4.3. Empirical Results

According to the five influencing factors screened out above, the background values and order optimization of different models were shown in Table 2, then GM(1,5), OGM(1,5), OBGM(1,5), and FOBGM(1,5) were used to forecast the carbon emissions of civil aviation in China, and the prediction accuracy of the four models was analyzed. In this paper, we collected relevant data from 2003 to 2017 and used the data from 2003 to 2012 as the original data to forecast China’s civil aviation carbon emissions from 2013 to 2017 and then compared it with the actual data to obtain the prediction accuracy of the model. The prediction results are shown in Table 3.
Based on the prediction results, the absolute and relative errors of each model for each year were obtained, as shown in Figure 3 and Figure 4.
It can be found in Figure 3 and Figure 4 that the prediction accuracy of the model gradually improved during the improvement of the model, and the FOBGM(1,5) model had the highest prediction accuracy among the four models, and the prediction error of the FOBGM(1,5) model gradually decreased by 99.56%, 33.46%, 28.43%, 26.67%, and 24.87%, relative to the GM(1,5) model from 2013 to 2017. In order to further simplify the prediction errors of each model and more directly reflect the prediction accuracy of the model, this paper compares the prediction performance of each model through MAPE, MAE, RMSE, and MAE.
As can be seen from Figure 5, through multi-stage optimization and improvement of the model, the prediction accuracy of the model is gradually improved. The prediction error of the OGM(1,5) model was reduced by 24.40% compared with the GM(1,5) model. The prediction error of the OBGM(1,5) model was reduced by 0.43% compared with OGM(1,5) model. The prediction error of the FOBGM(1,5) model was reduced by 9.48% compared with that of the OBGM(1,5) model, which proves the effectiveness of the model improvement, and the FOBGM(1,5) model had the smallest prediction error and the highest prediction accuracy among the four models.

5. Conclusions

In this paper, based on the GM(1,N) model, first, the mechanism defects, parameter defects, and structural defects of the GM(1,N) model were compensated by adding linear correction term and grey action, and the OGM(1,N) model is established. Then, the background value coefficients of the OGM(1,N) model were optimized using the PSO algorithm, and the OBGM(1,N) model was established. Then, by introducing the fractional order idea, this paper uses the PSO algorithm to optimize the cumulative order of the OBGM(1,N) model and extend the order of the OBGM(1,N) model from an integer field to a real number field to establish FOBGM(1,N) model. Five influencing factors were determined to predict China’s civil aviation carbon emissions using civil aviation passenger traffic, civil aviation cargo volume, total civil aviation turnover, civil aviation fuel consumption, civil aviation industry-wide operating income, and civil aviation transportation intensity. Based on the carbon emission prediction data of civil aviation transportation in China, the improvement effect of each model was empirically studied. It can be seen from the prediction results that the prediction error of the model decreases gradually, and the prediction accuracy shows an increasing trend after multi-level and multi-angle improvement and optimization. Among them, the MAPE of the OGM(1,5) model decreased by 24.40% compared with the GM(1,5) model, the MAPE of the OBGM(1,5) model decreased by 24.72% compared with the GM(1,5) model, the MAPE of the FOBGM(1,5) model decreased by 31.86% compared with the GM(1,5) model. It reflects the effectiveness of model improvement and proves the practicability of the FOBGM(1,5) model. From the perspective of algorithm improvement, the idea of improving the GM(1,N) model based on fractional summation and background value optimization has certain practical significance and application prospects and can be applied to the optimization research of other prediction models of grey system theory. Although the paper has carried out multi-angle and multi-level optimization of the model, such as background value, fractional order, linear increase, etc., the model still has room for improvement, such as from the perspectives of initial value improvement, residual correction, and metabolism, as a subject of further research.

Author Contributions

All authors have made valuable contributions to this paper. Conceptualization, C.L. and J.X.; methodology, C.L.; software, J.X.; validation, C.L., J.X. and Y.L.; formal analysis, J.X.; investigation, C.L.; resources, C.L.; data curation, J.X.; writing—original draft preparation, C.L. and J.X.; writing—review and editing, C.L. and Y.L.; visualization, J.X.; supervision, C.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Zhang, K.Z.; Jiang, H.H.; Liu, P.Z.; Zhang, Y. Ginger planting area prediction based on GM(1,N) model. Chin. J. Agric. Chem. 2020, 41, 139–143. [Google Scholar]
  2. Jiang, S.; Lian, M.; Lu, C.; Gu, Q.; Zhao, H. Prediction of the Death Toll of Environmental Pollution in China’s Coal Mine Based on Metabolism-GM (1, n) Markov Model. Ekoloji 2017, 26, 17–23. [Google Scholar]
  3. Wu, L.F.; Zhang, Z.Y. Grey multivariable convolution model with new information priority accumulation. Appl. Math. Model. 2018, 62, 595–604. [Google Scholar] [CrossRef]
  4. Xiong, P.; Yin, Y.; Shi, J.; Gao, H. Nonlinear Multivariable GM(1,N) Model Based on Interval Grey Number Sequence. J. Grey Syst. 2018, 30, 33–47. [Google Scholar]
  5. Wu, W.Q.; Xin, M.; Yong, W.; Zhang, Y.Y.; Zeng, B. Research on a novel fractional GM (α, n) model and its applications. Grey Syst. 2019, 3, 356–373. [Google Scholar] [CrossRef]
  6. Cheng, M.; Li, J.; Liu, Y.; Xiang, M. Parameter estimation of modified gray model GM(1,1) and model application. Commun. Stat-Simul. Comput. 2020, 49, 1367–1384. [Google Scholar] [CrossRef]
  7. Zeng, B.; Li, H.; Ma, X. A novel multi-variable grey forecasting model and its application in forecasting the grain production in China. Comput. Ind. Eng. 2020, 150, 106915. [Google Scholar] [CrossRef]
  8. Shen, Q.Q.; Cao, Y.; Yao, L.Q.; Zhu, Z.K. An optimized discrete grey multi-variable convolution model and its applications. Comp. Appl. Math. 2021, 40, 58. [Google Scholar] [CrossRef]
  9. Ma, X.; Liu, Z.B. The kernel-based nonlinear multivariate grey model. Appl. Math. Model. 2018, 56, 217–238. [Google Scholar] [CrossRef]
  10. Ma, X.; Liu, Z.B. The GMC(1,n) Model with Optimized Parameters and Its Application. J. Grey Syst. 2017, 4, 122–138. [Google Scholar]
  11. Özcan, T. Application of Seasonal and Multivariable Grey Prediction Models for Short-Term Load Forecasting. Alphanumeric J. 2017, 5, 329–338. [Google Scholar] [CrossRef]
  12. Kennedy, J.; Eberhart, R. Particle swarm optimization. In Proceedings of the 1995 IEEE International Conference on Neural Networks, Perth, WA, Australia, 27 November–1 December 1995; pp. 1942–1948. [Google Scholar]
  13. Li, C.; Xing, J. A Novel of Fractional Order Predictive Model on Carbon Emission Intensity in China’s Transportation Sector. Environ. Eng. Manag. J. 2021, 20, 1569–1581. [Google Scholar]
  14. Dong, J.K.; Zong, M.; Chen, J.J. A carbon emission prediction method for civil aviation industry based on STIRPAT model. Environ. Eng. 2014, 32, 165–169. [Google Scholar]
  15. Ding, S.; Dang, Y.G.; Li, X.M.; Wang, J.J.; Zhao, K. Forecasting Chinese CO2 emissions from fuel combustion using a novel grey multivariable model. J. Clean. Prod. 2017, 162, 1527–1538. [Google Scholar] [CrossRef]
  16. Liu, H.; Tian, H.; Hao, Y.; Liu, S.; Liu, X.; Zhu, C.; Wu, Y.; Liu, W.; Bai, X.; Wu, B. Atmospheric emission inventory of multiple pollutants from civil aviation in China: Temporal trend, spatial distribution characteristics and emission features analysis. Sci. Total Environ. 2019, 648, 871–879. [Google Scholar] [CrossRef]
  17. Yang, H.J.; O’Connell, J.F. Short-term carbon emissions forecast for aviation industry in Shanghai. J. Clean. Prod. 2020, 275, 122734. [Google Scholar] [CrossRef]
  18. Cui, Q. Investigating the airlines emission reduction through carbon trading under CNG2020 strategy via a Network Weak Disposability DEA. Energy 2019, 180, 763–771. [Google Scholar] [CrossRef]
  19. Yu, J.; Shao, C.; Xue, C.; Hu, H. China’s aircraft-related CO2 emissions: Decomposition analysis, decoupling status, and future trends. Energy Policy 2020, 138, 111215. [Google Scholar] [CrossRef]
  20. Wang, Z.; Xu, X.; Zhu, Y.; Gan, T. Evaluation of carbon emission efficiency in China’s airlines. J. Clean. Prod. 2020, 243, 118500.1–118500.8. [Google Scholar] [CrossRef]
  21. Tokuslu, A. Estimation of aircraft emissions at Georgian international airport. Energy 2020, 206, 118219. [Google Scholar] [CrossRef]
  22. Liu, X.; Hang, Y.; Wang, Q.; Zhou, D. Flying into the future: A scenario-based analysis of carbon emissions from China’s civil aviation. J. Air Transp. Manag. 2020, 85, 101793. [Google Scholar] [CrossRef]
  23. Demir, A.S.; Eminler, O.E. Estimation of CO2 emissions from air transportation in EU countries by artificial neural networks. Int. J. Glob. Warm. 2020, 21, 234–243. [Google Scholar] [CrossRef]
  24. Shi, Y.T.; Wu, W.W.; Li, X.X. Research on the development characteristics and influencing factors of China’s aviation carbon emissions. J. East China Jiaotong Univ. 2019, 36, 32–38. [Google Scholar]
  25. Ding, S.; Dang, Y.; Xu, N.; Xu, Z. A Novel Grey Model Based on the Trends of Driving Factors and Its Application. J. Grey Syst. 2018, 30, 105–126. [Google Scholar]
  26. Gudmundsson, S.V.; Anger, A. Global carbon dioxide emissions scenarios for aviation derived from IPCC storylines: A meta-analysis. Transp. Res. Part D Transp. Environ. 2012, 17, 61–65. [Google Scholar] [CrossRef]
Figure 1. FOBGM(1,N) operation logic.
Figure 1. FOBGM(1,N) operation logic.
Sustainability 15 09127 g001
Figure 2. Correlation analysis.
Figure 2. Correlation analysis.
Sustainability 15 09127 g002
Figure 3. Absolute error comparison.
Figure 3. Absolute error comparison.
Sustainability 15 09127 g003
Figure 4. Comparison of relative errors.
Figure 4. Comparison of relative errors.
Sustainability 15 09127 g004
Figure 5. Comparison of prediction errors.
Figure 5. Comparison of prediction errors.
Sustainability 15 09127 g005
Table 1. Civil aviation carbon emission influencing factors and representative authors.
Table 1. Civil aviation carbon emission influencing factors and representative authors.
Impact FactorLiterature StatisticsRepresentative Authors
Civil aviation fuel consumption16[14,15,16,17]
Total civil aviation turnover9[18,19,20]
Civil aviation industry-wide operating income9[18,19,20]
Civil aviation transport traffic (passenger/cargo volume)8[14,17,20,21]
Transportation intensity5[16,22]
Number of civil flights4[19,20,21,23]
GDP per capital4[24]
Population4[25]
GDP2[25,26]
Energy intensity2[22]
Flight Type1[23]
Table 2. Background values and order of the different models.
Table 2. Background values and order of the different models.
GM(1,5)OGM(1,5)OBGM(1,5)FOBGM(1,5)
Background value0.50.511
Order1111.0907
Table 3. Prediction results of carbon emissions.
Table 3. Prediction results of carbon emissions.
YearCivil Aviation Carbon Emissions Original DataGM Model Background Value/OrderOGM(1,5)OBGM(1,5)FOBGM(1,5)
Background ValueCumulative OrderBackground ValueCumulative OrderBackground ValueCumulative OrderBackground ValueCumulative Order
0.510.511111.0907
20136302.9162546303.0413426302.9885036302.9875496302.916802
20146990.0868956990.3328866990.2604126990.2595936990.250567
20157901.1073257901.6705487901.53317901.5314297901.510396
20168879.0550038879.860478879.679128879.6768218879.645614
201710,009.6654210,010.6470110,010.443710,010.4409510,010.40286
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, C.; Li, Y.; Xing, J. Multivariate Grey Prediction Model Application in Civil Aviation Carbon Emission Based on Fractional Order Accumulation and Background Value Optimization. Sustainability 2023, 15, 9127. https://doi.org/10.3390/su15119127

AMA Style

Li C, Li Y, Xing J. Multivariate Grey Prediction Model Application in Civil Aviation Carbon Emission Based on Fractional Order Accumulation and Background Value Optimization. Sustainability. 2023; 15(11):9127. https://doi.org/10.3390/su15119127

Chicago/Turabian Style

Li, Cheng, Yangzhou Li, and Jian Xing. 2023. "Multivariate Grey Prediction Model Application in Civil Aviation Carbon Emission Based on Fractional Order Accumulation and Background Value Optimization" Sustainability 15, no. 11: 9127. https://doi.org/10.3390/su15119127

APA Style

Li, C., Li, Y., & Xing, J. (2023). Multivariate Grey Prediction Model Application in Civil Aviation Carbon Emission Based on Fractional Order Accumulation and Background Value Optimization. Sustainability, 15(11), 9127. https://doi.org/10.3390/su15119127

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop