1. Introduction
Water occupies an important strategic position in the development of society, so it is undoubtedly an extremely precious resource [
1]. However, rapid economic development leads to an increased demand for resources, and a new problem arises regarding how to seek a balance between resource protection and economic development. Due to the impact of external complex factors, many cities in China have experienced varying degrees of water shortages [
2]. For example, the functionality of water resources is reduced in cities such as Hangzhou and Ningbo due to the inadequate management of water resources, which has caused adverse consequences that can lead to the local area bearing heavier pressure in future developments. Therefore, accurately predicting the water consumption can help to mitigate possible future conflicts during sustainable development and environmental protection.
In recent years, many scholars have been devoted to the forecasting research of water resources and have achieved many innovative results, which provided some new ideas for the prediction of water consumption, such as neural networks [
3] and support vector machines [
4]. Meanwhile, many well-performing models were used to study the issues related to water resources. A multi-objective model was proposed to study the sustainable management of water resources in the Zarrineh Basin in Iran [
5]. A support vector regression model was used for short-term predictions for eight water supply areas in the Netherlands and Belgium [
6]. Aiming at the prediction problem of urban water demand, the bat algorithm is introduced to improve the prediction accuracy [
7]. The interaction between Xinjiang’s population, water resources and economic development was studied based on the system dynamics model [
8]. A long- and short-term memory model was used to predict the short-term demand in Hefei, China [
9]. A water scarcity pricing model was proposed to promote more effective water resources management in areas with scarce water resources [
10].
Among the studies on water consumption requirements, the influential factors and model specification will have important impacts on the analysis results. Considering the climate and groundwater conditions, the Colne watershed in eastern England was analyzed under different drought conditions, and the population factors of the basin were integrated to predict the water demand [
11]. By combining the experimental design and Monte Carlo simulation, the prediction uncertainty of the water quality was evaluated [
12]. A quantitative method considering water resource distribution was proposed to study the relationship between climate change and water demand [
13]. A new demand forecasting method combining machine learning and statistical methods was proposed to give short-term forecasts for the water demand of British households [
14]. The evaluation and planning model was used to predict the water demand in the central and western regions of Côte d’Ivoire, which assisted with formulating water management optimization measures [
15].
In addition, the problem of predicting water resources using grey models has continued to attract the attention of scholars, which produced a fruitful contribution to this field [
16]. Considering the effect of various factors, some policy suggestions for the safety of agricultural water use were put forward based on the predictive analysis of the change in agricultural water use [
17]. The grey models, by optimizing objectives and parameters, are used to predict the water level quantity and quality [
18,
19]. An improved grey model, which utilizes the exponentially weighted moving average and particle swarm algorithm, was established to predict water emissions from faults [
20]. By combining multiple factors and grey relational analysis, a safety evaluation model was established to analyze the safety problems on water resources in Guizhou Province [
21]. A new grey model was put forward to evaluate the rural water environment quality by comprehensively using the network search information and principal component analysis [
22]. The grey water footprint index was used to study the impact of changes in residents’ consumption levels and patterns on water pollution [
23]. By integrating the grey decision method and entropy weight method, a multivariable model was established to offer a novel way to study mine water sources [
24]. The fractional grey model was used to make water consumption recommendations for 31 provinces [
25].
Since Professor Deng founded the grey system theory in 1982, the grey system theory has been widely used in various fields [
26]. The grey prediction model is an important part of the grey system theory. It uses the grey generating operator or sequence operator to weaken randomness, excavate the potential law, establish the continuous dynamic differential equation based on the discrete data series, and make the quantitative prediction of the research object according to the GM model [
27]. The traditional grey model (GM(1,1)) is represented with one order difference and one variable. In order to improve the prediction performance of the model in forecasting problems, many scholars have improved the grey model in the following aspects: (1) the optimization of the model background value [
28]; (2) the extension of the modeling equation [
29]; (3) the improvement of the cumulative generation method [
30]. In this study, the grey generating operator of the model is optimized by introducing the deformable accumulation generating operator. The detailed modeling process of the prediction model is introduced in
Section 3.2.
Because the traditional statistical model has relatively high requirements regarding sample data, it requires a large sample size of the research object and certain sampling probability distribution [
31]. When insufficient information is used to meet the conditions, the prediction effect of the statistical model is often unsatisfactory. Additionally, traditional econometrics has some shortcomings in the research of forecasting problems. Although traditional econometrics, which is based on economic theory, has excellent interpretability, its forecasting ability is relatively inferior [
32]. The grey prediction model has low requirements for the amount of information, as low as four data points, and has the advantages of easy operation and a high modeling accuracy [
33]. With the development of the grey system theory, the grey prediction model has been widely used in many forecasting fields and has become an effective tool to deal with forecasting problems [
34]. The grey prediction model provides a new way to solve the prediction problem, and the research on water resources prediction has been a hot topic. Finally, the grey prediction model is used as a research tool in this study. We optimize and improve the traditional grey model and use it to forecast water consumption.
Therefore, the accurate prediction of water consumption is particularly important, as traditional models are complicated and require too much relevant information. The forecasting model utilized in this paper integrates a novel type of accumulation method [
35], which has a better forecasting effect through the processing of the original data. According to the principle of using small samples and new information first in the grey system [
36], a new cumulative grey prediction model was established for the data and information from recent years. Because the finiteness of the new information and the validity of the initial value are fully considered, the new model has a better prediction accuracy. At the same time, we learn from the research methods of other scholars and use grey correlation analysis in the selection of factors affecting water consumption [
37]. Finally, through the combination of the new cumulative multivariable grey model and grey correlation analysis, this paper predicts the water consumption in the Yangtze River Delta region.
The rest of this paper is organized as follows.
Section 2 states the research area and data sources. The methods are provided in
Section 3. The forecasting results and discussion are presented in
Section 4.
Section 5 gives the conclusion and policy implications.
3. Methods
The grey correlation in this paper is applied to analyze the relationship between water consumption and influence factors. Then, the DGM(1,N) with deformable accumulation (DDGM(1,N) model) is established to study the change trend of water consumption. Finally, the validity of the model is proven via model comparative analysis.
3.1. Grey Correlation Analysis
The changing trend of water consumption is affected by many factors. Grey correlation analysis is used to explore the correlation between research objects and influencing factors, and the calculation process is shown as follows.
(1) Set as the system characteristic behavior sequence, and as the sequence of relevant factors. Calculate the mean terms of each sequence using .
(2) Calculate the grey relational coefficients using Equation (1). For
,
(3) Calculate the grey relational degree using Equation (2).
3.2. The DDGM(1,N) Model
In this paper, the DGM(1,N) with deformable accumulation (DDGM(1,N) model) is used to predict the water consumption (billion cubic meters) of 17 cities selected in the Yangtze River Delta Economic Zone. The process is introduced as follows.
(1) Assume that the system principal variable data sequence is Equation (3).
The sequences of relevant factors are Equation (4).
is a sequence of cumulative generation operations of
. Consider that the definition of the deformable derivative (Wu and Zhao, 2019)
-order accumulation sequence is Equation (5).
The cumulative generation of the model is calculated using Equation (6).
The particle swarm optimization algorithm is used to find the optimal order () in MATLAB (R2016b). is the deformable accumulation parameter; it can adjust the weight of old and new data during sequence generation.
(2) The grey model DDGM(1,N) of the
-order is Equation (7).
is the grey function system parameter of the model; is the principal variable parameter. are grey action system parameters of different independent variables in the model.
Then, the least squares method is used to solve the parameters in the model (Equation (8)). Therefore, the parameters
and
can be obtained by using Equations (9) and (10).
where
is recursive function of the DDGM(1,N) model. Then, the simulated value of
can be obtained by using Equation (11).
According to Equation (12), we can see that the simulation value of
is
(see Equation (13)).
(4) The fitting error and prediction error are solved by calculating the mean absolute percentage error value (MAPE) according to Equation (14).
3.3. Properties of the Cumulative Generating Operator
Property 1. When , the accumulation sequence is a monotonically increasing sequence. When , the accumulation sequence is a monotonically decreasing sequence. Otherwise, the accumulation sequence is not monotonic.
Proof. Knowing that
Therefore, when , the accumulation sequence is a monotonically increasing sequence; when , the accumulation sequence is a monotonically decreasing sequence. Otherwise, the accumulation sequence is not monotonic.
The proof of Property 1 is completed. □
Property 2. The larger the parameter , the larger the value obtained by accumulating the sequence.
Proof. By assuming that
we can obtain the following formulas.
Therefore, is true. This shows that the larger the parameter , the larger the value obtained by accumulating the sequence.
The proof of Property 2 is completed. □
3.4. Analysis and Discussion of Model Stability
Lemma 1. Let ; the vector norm and matrix norm are compatible. If a matrix norm has , then the solutions of the inhomogeneous linear equations and satisfy where It can be known from Lemma 1 that matrix and matrix are original matrices, and matrix and matrix are perturbation matrices, which are perturbations of the solution. According to the matrix perturbation analysis, the perturbation bound of the solution can be obtained.
Theorem 1. Assume that the solution of the DDGM(1,N) model is . If only is perturbed by , then the perturbation bound of the solution in this case is Proof. If only is perturbed by , it is equivalent to when .
It is obvious that we can obtain the value of
.
We can also obtain the value of
.
Therefore, and can be obtained.
According to Lemma 1, they can be obtained as follows:
Then, the perturbation bound of the solution can be obtained.
Theorem 2. If only is perturbed by , then the perturbation bound of the solution in this case is Proof. If only
is perturbed by
, it is equivalent to when
, where
The value of
can be obtained.
The value of
can be obtained.
Therefore, and can be obtained.
According to Lemma 1, they can be obtained as follows:
Then, the perturbation bound of the solution can be obtained.
Theorem 3. If only is perturbed by , then the perturbation bound of the solution in this case is Proof. If only
is perturbed by
, it is equivalent to when
, where
The value of
can be obtained.
The value of
can be obtained.
Therefore, and can be obtained.
According to Lemma 1, they can be obtained as follows:
Then, the perturbation bound of the solution can be obtained.
According to the analysis and proof presented above:
The disturbance bound of the model is correlated with the size of the sample . In the case of equivalent disturbance, the larger the number of samples, the larger the disturbance bound of the model. This indicates that as the number of samples increases, the disturbance bound of the model increases and stability becomes worse, which means that the grey model is suitable for the modeling of the “less data” problem.
When the sample size is unchanged, the particle swarm optimization algorithm can be used to adjust the parameter , so that the disturbance bound of the solution satisfies . That means is more sensitive to the solution than . It also indicates that new information is given more weight than old information. Therefore, the DDGM(1,N) model conforms to the new information priority principle.
3.5. The Case Study
This paper takes the relationship between water consumption and different factors in Jinhua City as an example. The original data of Jinhua presented in
Table 1 are from 2010 to 2017. The relationship between population and water consumption is analyzed according to the results, as well as the industrial added value and water consumption. Following the method described in
Section 3.1 above, we can obtain the results that
. The results show that the population has the largest impact on the water consumption, followed by the added value of the primary industry and the secondary industry, and finally, the added value of the tertiary industry has the least impact on the water consumption.
DDGM(1,N) model is constructed according to the original data in
Table 1, and the calculation process is shown as follows.
(1) The water consumption sequence of Jinhua City from 2010 to 2017 is
The population of Jinhua City from 2010 to 2017 is
Then, the optimal order
is obtained through the particle swarm optimization (PSO) algorithm in MATLAB (R2016b). The
-order
cumulative sequences are as follows:
According to Equations (9) and (10), the parameters
and
can be obtained.
(3) The simulated value of
is obtained using Equation (11) as follows:
Finally, the fitting and prediction results can be obtained, and the validity can be tested using MAPE with fitting MAPE = 0.9022% and prediction MAPE = 1.36%, respectively.
The fitting and prediction results of DDGM(1,2), GM(1,1) and GM(1,2) are shown in
Table 2. Compared with the two classical grey models, the fitting MAPE and prediction MAPE of DDGM(1,2) model are both the smallest. There is no significant difference between the fitting MAPE of DDGM(1,2) and GM(1,1), but the prediction MAPE of DDGM(1,2) is smaller than that of GM(1,1). As for GM(1,2), both the fitting MAPE and the prediction MAPE of DDGM(1,2) are much smaller than GM(1,2). Therefore, DDGM(1,2) model has better fitting and prediction performance, indicating that this model is more effective in predicting water consumption.
Similarly, the DDGM(1,2) model is constructed according to the water consumption and the added value of the tertiary industry shown in
Table 1. The fitting and prediction results of the three models are shown in
Table 3. It can be found that the fitting MAPE and prediction MAPE of the DDGM(1,2) model are smaller than those of GM(1,1) and GM(1,2). This further illustrates the effectiveness of the DDGM(1,2) model to predict water consumption.
5. Conclusions
This article firstly uses the grey relational analysis to study the related factors affecting water consumption. Then, through the model comparisons, we found that the DDGM(1,N) model has a better fitting and prediction effect. Finally, the DDGM (1,N) model was used to predict the water consumption in 17 cities selected in the Yangtze River Delta region by considering different influencing factors. The results show that as the growth rate of the population increases, the water consumption in 15 cities show a downward trend, and 2 cities show an upward trend. With the increase in the growth rate of the primary industry, the water consumption in 12 cities show a downward trend, and 5 cities show an upward trend. With the rise in the growth rate of the secondary industry, the water consumption in 14 cities show a downward trend, and 3 cities show an upward trend. With the rise in the growth rate of the tertiary industry, the water consumption in 12 cities show a downward trend, and 5 cities show an upward trend. In general, the water consumption in most cities will continue to show a downward trend in the future.
According to the prediction results obtained in this paper, we have some policy recommendations as follows. First of all, urban populations should be controlled. We should avoid a situation where the population density is concentrated, which will cause great pressure on the water consumption. Secondly, for cities where water consumption has increased with the development of the primary industry, the water resource utilization efficiency of agriculture and animal husbandry should be improved. Moreover, for cities where the water consumption increases with the development of the secondary industry, it is suggested that they adjust the internal structure of the secondary industry to reduce the proportion of high water-consuming industries, and meanwhile, strengthen the supervision of industrial water efficiency. It is possible to raise the utilization rate by adjusting the water prices and upgrading the water-saving technologies of water-using equipment. Finally, for cities whose water consumption has increased with the development of the tertiary industry, the service industries of these cities should shift to the direction of eco-environmental protection. We should focus on the accommodation and catering industry, carry out technical transformation of its water equipment and eliminate outdated high water-consuming equipment simultaneously. Through the government’s management and policy guidance, the goal of improving the utilization effect of water resources is realized.