Next Article in Journal
Impact and Recovery of Coastal Tourism Amid COVID-19: Tourism Flow Networks in Indonesia
Next Article in Special Issue
Evaluation of the Effectiveness of and the Extent to Which Large and Medium Logistics Organisations Report on Social Sustainability—The Case of South Africa
Previous Article in Journal
A Contribution to Social Acceptance of PV in an Oil-Rich Country: Reflections on Governmental Organisations in Iran
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Regional Logistics Demand Prediction: A Long Short-Term Memory Network Method

School of Logistics and Transportation, Central South University of Forestry and Technology, Changsha 410000, China
*
Author to whom correspondence should be addressed.
Sustainability 2022, 14(20), 13478; https://doi.org/10.3390/su142013478
Submission received: 26 September 2022 / Revised: 14 October 2022 / Accepted: 17 October 2022 / Published: 19 October 2022
(This article belongs to the Special Issue Sustainable Logistics Operations and Management)

Abstract

:
With the growth of e-commerce and the recurrence of the novel coronavirus pneumonia outbreak, the global logistics industry has been deeply affected. People are forced to shop online, which leads to a surge in logistics needs. Conversely, the novel coronavirus can also be transmitted through goods, so there are some security risks. Thus, in the post-epidemic era, the analysis of regional logistics needs can serve as a foundation for logistics planning and policy formation in the region, and it is critical to find a logistics needs forecasting index system and a effective method to effectively exploit the logistics demand information in recent years. In this paper, we use the freight volume to assess the logistics needs, and the Long short-term memory (LSTM) network to predict the regional logistics needs based on time series and impact factors. For the first time, the Changsha logistics needs prediction index system is built in terms of e-commerce and the post-epidemic era and compared with some well-known methods such as Grey Model (1,1), linear regression model, and Back Propagation neural network. The findings show that the LSTM network has the smallest prediction errors, and the logistics needs are not affected by the epidemic. Therefore, the authors suggest that the government and businesses pay more attention to regional logistics needs forecasting, choosing scientific prediction methods.

1. Introduction

Since the twentieth century, with the booming development of the Internet industry, E-commerce has gradually entered people’s lives, widely attracted people’s attention, and the world has gradually entered a new economic era characterized by the development of e-commerce [1,2]. Due to the further promotion and application of e-commerce, people’s needs for regional logistics is constantly rising.
Regional logistics needs forecasting is the advanced estimation and projection of the flow of goods, sources, flow direction, flow rate and composition of goods that have not yet occurred or are not clear at present in the region, so that it can be used to study the size of regional logistics needs and its hierarchical structure of needs, to inform decision-making and provide a foundation for regional logistics planning, and to further provide a scientific and reliable basis for government departments to formulate logistics policies and coordinate the allocation of logistics resources, etc. In addition, the regional logistics needs forecasting is accurate and reasonable. In addition, an accurate and reasonable forecast of regional logistics needs will help the Government to integrate regional logistics resources and establish an economical and efficient regional logistics system.
With the rapid advancement of economic globalization and regional economic integration, logistics needs is on the rise year by year and has emerged as a new engine for promoting healthy and stable regional economic development, with regional logistics needs forecasting having a significant impact on regional development planning. In recent years, logistics needs forecasting methods have received extensive attention. For example, some methods [3,4,5] have been proposed for putting forward to solve the issue of poor logistics and distribution ability from different angles. Mladenow et al. [3] proposed that introducing crowdsourced logistics would successfully tackle the last-mile problem of urban logistics and distribution, and alleviate the difficulties such as insufficient logistics and distribution capacity, during peak periods. Yu et al. [4] employed different regional logistics needs forecasting models to predict regional logistics needs and provide decision assistance for local economic development. Further, the impact of e-commerce on logistics and distribution cannot be overlooked in the age of e-commerce. Huang et al. [5] considered ground indicators related to e-commerce development and applied Grey Model (1,1) model and Back Propagation neural network model for predicting regional logistics. However, as the novel coronavirus continues to spread internationally, just as Severe acute respiratory syndrome coronavirus swept the world in 2003, it also had a serious impact on the logistics industry at that time. Therefore, 2004–2005 is also known as the post-epidemic era of Severe acute respiratory syndrome coronavirus, makes the models have better training, and one that can help for the prediction of freight volume in the post-epidemic era of the novel corona pneumonia, and has the effect of migration learning, so that its accuracy is more accurate. There is no doubt that this will affect every aspect of our lives including the logistics industry in the age of e-commerce. Previous methods, when performing logistics needs forecasting, did not consider the general environment of the new crown virus. Therefore, unlike the previous methods, this paper for the first time integrates the factors under the influence of the epidemic and the two factors of e-commerce development for forecasting. However, in the field of logistics needs forecasting, model building is still dominated by support vector regression machines, and deep learning networks are not yet studied much at present. Motivated by the issue above, we propose a regional logistics forecasting model based on Long short-term memory (LSTM) network.
The main contributions of our work are as follows:
(1) We create a logistics needs forecasting index system for Changsha city, China, which provides a novel aspect for selecting indicators, combining e-commerce with novel coronavirus pneumonia.
(2) We apply the LSTM network to the regional logistics demand forecasting. Extensive experiments demonstrate that the LSTM network can be well used in regional logistics needs forecasting.
(3) Through the prediction of the future freight volume, the government and logistics enterprises can seize this opportunity to further promote the development of the local economy.

2. Related Work

Recently, a large number of regional logistics needs forecasting methods have been proposed and can be broadly grouped into the three types as follows.
The first is the traditional statistical forecasting methods, mainly mathematical modeling, input-output methods, regression analysis, grey theory clustering, combinatorial forecasting methods and Markov chains [6,7]. Samvedi et al. [6] performed a simulation experiment on the beer game to compare the effectiveness of three proven prediction models and grey methodologies in this disrupted and stable situation. The grey forecasting approach showed the most stability according to the findings. In estimating sales of Chilean supermarkets, Sheu et al. [7] used a combination forecasting technique, combining a neural network architecture and a rolling average algorithm into a combined prediction system. Nuzzolo et al. [8] utilized a univariate nonlinear regression model to forecast logistics needs. Theoretically, the combined forecasting model was constructed using the inverse of variance weighted distribution approach. Although traditional statistics-based forecasting methods can forecast regional logistics needs by some simple models, such models based on linear functions have simpler assumptions and the forecasting results often do not match the actual situation.
With the success of artificial intelligence, new methods based on neural network are constantly being proposed [9,10,11]. Guo et al. [11] proposed a particle swarm optimization-Radial Basis neural network model, which combines a particle swarm optimization algorithm with a radial basis function neural network to forcast regional logistics needs by training the model with indicator data in the region. Li et al. [12] utilized a neural network mapping method to establish the optimal parameters of a second-order grey forecasting model, and the findings indicated that the proposed method was effectively improved by the accuracy of load forecasting. Jaipuria and Mahapatra [13] employed the Discrete Wavelet Transformation-Artificial Neural Network model for a regional logistics needs forecasting study to decrease inventory costs, and due to that the bullwhip effect and three different local manufacturing firms were forecasted using the Discrete Wavelet Transformation-Artificial Neural Network Model and Autoregressive Integrated Moving Average Model as an example, which conducted extensive comparative experiments and verified the effectiveness of the discrete wavelet transformation-artificial neural network model. Huang et al. [5] introduced a Back Propagation neural network based on the regional logistics needs prediction approach. In comparison to the standard grey prediction model, the model exhibits a lower prediction error and more reliable prediction outcomes. However, the artificial neural network’s learning process error is easy to settle to a local optimum, but learning accuracy with a limited amount of learning data is difficult to guarantee. When there are too many learning samples, the neural network also falls into the dilemma of dimensional catastrophe and weak generalization ability.
Further, to alleviate the above problem, Yu et al. [4] used two learning machine forecasting methods i.e., support vector machine and neural network, to predict the distortion needs in the final supply chain and compare the prediction results of these two forecasting methods with those of traditional forecasting methods. Finally, the prediction accuracy of the learning machines was found to be higher than that of the previous models. Subsequently, Methods based on support vector machine have emerged [14,15,16]. To forecast available renewable resources, Zendehboudi et al. [16] proposed a novel hybrid Support Vector Machine method, which achieved a favorable performance. Hu et al. [17] proposed a Principal Components Analysis-Support Vector Machine model-based algorithm for evaluating port logistics parks in order to predict port needs. To improve the Support Vector Machine-based prediction model, Wang et al. [18] employed fuzzy hierarchical analysis to optimize the parameters, and then trained the Support Vector Machine using the optimised parameters to create the final prediction model.
Regional logistics needs forecasting is a complex task that incorporates theories and methods from many disciplines. Previous methods based on traditional statistics and neural network-based methods have their own shortcomings, and this paper argues that regional logistics methods based on support vector machines will slowly replace both of these methods and are a trend for future research.
Therefore, motivated by the previous methods, we propose a time series and indicator system in the era of epidemics based on Changsha city, which employs Long short-term memory (LSTM) network, Grey Model (1,1), Back Propagation neural network model and the linear regression model to predict Changsha logistics needs. According to the comprehensive comparison of the above four methods, the findings reveal that the LSTM network has a smaller forecast error and more reliable predictions, and we first employ such method to predict the logistics needs of Changsha city in 2022–2024 under the impact of e-commerce and the pandemic.

3. Methodology

3.1. Recurrent Neural Networks

Long short-term memory (LSTM) network belong to Recurrent Neural Network, which is a variant in Recurrent Neural Network. To facilitate the distinction, the LSTM model based on time series prediction is called Time Series-Long short-term memory network; the LSTM model based on impact factor prediction is called Impact Factor-Long short-term memory network.
The prototype of Recurrent Neural Network was first introduced by John Hopfield, Michael I. Jordan defined the concept of Recurrent, and then Jeffery L, Elman proposed the earliest and the simplest Recurrent Neural Network with a single self-connected node at present. Recurrent Neural Network usually owns an input layer, a hidden layer and an output layer, as shown in Figure 1, and unlike traditional neural networks that only perform unidirectional propagation, Recurrent Neural Network has a memory function, and there is a loopback between the hidden layer units based on a time series which is expressed as the current output is influenced by the output of the previous moment. Recurrent Neural Network can be very effective in processing data with sequence characteristics, and can mine the effective information in the data. so Recurrent Neural Networks are now widely used in processing text, speech and other tasks. Figure 1 can be converted into Figure 2 when ignoring W , which shows that the structure is a fully connected neural network structure, through this figure, we can see that X is a three-dimensional vector, U denotes a three-row and four-column parameter matrix from the input layer to the hidden layer, S denotes a four-dimensional vector of the hidden layer, V represents a four-row and two-column parameter matrix from the hidden layer to the output layer, and O represents a two-dimensional vector of the output layer. In Figure 1, U represents the value from the input layer to the hidden layer, S denotes the hidden layer output value, W denotes the weight from the previous moment output to the current moment input, V  denotes the weight vector from the hidden layer to the output layer, and O represents the output vector of the output layer.
Expanding Recurrent Neural Network structure by the time line yields a graph of the hidden layer unit structure, and the connection structure between the hidden layer units of Recurrent Neural Network is illustrated in Figure 3. At moment t , the input of the hidden layer unit is s t 1 and x t , the value of the hidden layer unit is s t , and the output is o t .
From Figure 2, the forward calculation process of Recurrent Neural Network can be introduced as Equations (1) and (2)
s t = f ( W h h x t + W h h s t 1 + b s )
o t = σ ( W h o s t + b o )
where σ is the s i g m o i d function, which is a common activation function in neural networks. b is bias, f is full connection layer.

3.2. Long Short-Term Memory Network

Because Recurrent Neural Network frequently fail owing to gradient explosion and disappearance, a new class of Recurrent Neural Networks called Long short-term memory network (LSTM) was developed to overcome this problem. Sepp Hochreiter and Jürgen Schmidhuber first proposed LSTM in 1997. LSTM has more input gates compared to recurrent neural networks, forgetting gate and output gate, which can get the corresponding weights and have different values at different moments with fixed parameters. Thus, LSTM can control the memory of the whole process better compared to recurrent neural networks, and thus successfully avoid the problems of gradient explosion and vanishing.
The cell state lies at the heart of the LSTM, with the horizontal line at the top of the graph running through the upper part of the graph. The cell states extend along with a straight line like a chain, and the linear interactions are small. Information can flow very easily. The LSTM can change information to the cell state according to the gate structure in Figure 4. It can be found that at time t, the LSTM cell owns three inputs, which are the input sequence x t , the output of the previous step h t 1 and the LSTM cell state c t 1 ; there are two outputs, h t , c t ; and three internal gates, which are the forgetting gate f , the input gate i and the output gate O . The LSTM implements protection and control of cell states through three gate cells.
The work of LSTM can be divided into four steps.
(1) By selecting that unnecessary information should be eliminated from the cell state, the forgetting gate produces the outputs shown in Equation (3).
f t = σ ( W f [ h t 1 , x t ] + b f )
where σ is the s i g m o i d function, which is a common activation function in neural networks.
The output is ft which is then multiplied point by point with the cell state of c t 1 . The data is between 0–1. If when the data is 0, it means that all information is discarded, and conversely when the data is 1, it means that all information is retained.
  • c t 1 : Output of the previous moment of the cell state
  • h t 1 : Template output from the previous moment
  • f t : Output of the Oblivion Gate
  • W f : Weight matrix
  • b f : Offset
  • [ h t 1 , x t ] : two matrices stitched together
(2) The input gate has two parts that store information in the cell state: the first is the input gate layer’s decision on which there is information to update, and the second is a vector of optional values created by the layer to decide which information to add to those in the cell state. As in Equations (4) to (5)
i t = σ ( W i [ h t 1 , x t ] + b i )
c ˜ t = tan h ( W c [ h t 1 , x t ] + b c )
(3) Multiply the old state c t 1 by the point f t , indicating that the information that has been decided to be forgotten is discarded, and then add i t c ˜ t , thus forming the new cell state c t , which is the new optional value, scaled in size according to the state value. Equation (6).
c t = f t c t 1 + i t c ˜ t
(4) The output gate performs the output, and the cell state determines the output. As in Equations (7) to (8)
o t = σ ( W o [ h t 1 , x t ] + b o )
h t = o t tan h ( c t )

3.3. Back Propagation Neural Networks

Rumelhart [19] proposed the Back Propagation neural network, which solves the learning problem of multilayer neural networks. It is composed of three layers, including an input layer, a hidden layer, and an output layer, which has a wide range of applications. Firstly, the signal is transmitted from the input layer to the hidden layer through forward propagation and calculated in the hidden layer. Then the results of the hidden layer calculation are transmitted to the output layer and output. Finally, the results are compared with the expected values, and the error is corrected by back propagation. For example, Bai et al. [20], Fuh, K. H. et al., [21] employed the prediction model to medical, creep feed grinding, respectively. To facilitate the distinction, Back Propagation neural network model based on time series prediction is called Time Series-Back Propagation neural network; the Back Propagation neural network model based on impact factor prediction is called Impact Factor-Back Propagation neural network.

3.4. Grey Model (1,1)

Deng [22] fist introduced the Grey Model (1,1), which generated an approximation exponential rule by summarizing the original data and the modelling it. The model has been widely used in various area. For example, Liu et al. [23] and Wang et al. [24] employed the prediction model to tourism and construction. To facilitate the distinction, the Grey Model (1,1) is divided into Time Series-Grey Model (1,1) and Impact Factor-Grey Model (1,1). The modeling steps of Grey Model (1,1) are:
(1)
Build the original sequence, X 0 can be set to the original sequence:
X ( 0 ) = [ X ( 0 ) ( 1 ) , X ( 0 ) ( 2 ) , X ( 0 ) ( 3 ) , , X ( 0 ) ( n ) ]
(2)
Calculate a cumulative sequence, generate a new sequence, set to   X 1
X ( 1 ) = [ X ( 1 ) ( 1 ) , X ( 1 ) ( 2 ) , X ( 1 ) ( 3 ) , , X ( 1 ) ( n ) ]
X 1 ( k ) = j = 1 k X 0 ( j ) , j = 1 , 2 , 3 , , k
(3)
Generate the mean and calculate its background value
Z ( 1 ) ( k ) = 1 2 ( X ( 1 ) ( k ) + X ( 1 ) ( k 1 ) )
(4)
Construct matrices B and Y .
B = [ Z ( 1 ) ( 1 )   1 Z ( 1 ) ( 2 )   1 Z ( 1 ) ( n )   1 ] ,
Y = [ X ( 0 ) ( 2 ) X ( 0 ) ( 3 ) X ( 0 ) ( n ) ] ,
(5)
Construct differential equations
d X ( 1 ) d t + a X ( 1 ) = u ,
Solution of differential equation:
X ( 1 ) ( k + 1 ) = [ X ( 0 ) ( 1 ) u a ] e a k + u a ,
Coefficient vector:   a ^ = [ a , u ] T .

3.5. Linear Regression

Francis Galton proposed the linear regression model, which uses regression equations to model the connection between one or more independent and dependent variables. The model has been widely used in various area. For example, Goldberger et al. [25] and Massie et al. [26] applied the predictive model to studies in biology, temperatures, construction and other industries, respectively. To facilitate the distinction, the Linear regression model based on time series prediction is called Time Series-Linear Regression; the Linear regression model based on impact factor prediction is called Impact Factor-Linear Regression. The modeling steps of Grey Model (1,1) are:
(1)
For a given sequence of n points, ( x 1 , y 1 ) , ( x 2 , y 2 ) ( x n , y n ) Let the linear regression equation be:
y = b x + a
(2)
The total distance from the point in the direction to the straight line can use i = 1 n [ y i ( a + b x i ) ] 2 to quantitatively describe, so it can be regarded as a binary function:
Q ( a , b ) = i = 1 n [ y i ( a + b x i ) ] 2
(3)
Therefore, the problem of finding a straight line and making it closest to a point is transformed into finding two numbers a ^ , b ^ , so that the binary function Q ( a , b ) reaches the minimum at a = a ^ , b = b ^ . Through formula derivation, finally we can obtain:
b = i = 1 n ( x i x ¯ ) ( y i y ¯ ) i = 1 n ( x i x ¯ )
x ¯ = 1 n i = 1 n x i ,   y ¯ = 1 n i = 1 n y i ,   a = y ¯ b x ¯

4. Experimental Results and Analysis

4.1. Experimental Data Selection and Pre-Processing

Indicators for measuring logistics needs include logistics operations, cargo transportation, cargo turnover, logistics equipment, logistics employees, distribution operations and commodity inventory, etc. When selecting logistics indicators, we should not only consider the differences between indicators, but also consider the difficulty of obtaining each indicator. Therefore, we choose the freight volume as the measurement indicator. Since the volume of goods transported in reality is affected by more factors and the period of influencing factor values is long, this paper uses the annual unit for forecasting.
On the selection of impact factors, Huang et al. [5] selected 12 as observed variables including Gross Domestic Product, per capita disposable income, and investment in logistics fixed assets; Fan and Wu [27] chose Gross Domestic Product, the total social logistics cost, overall output value of one, two, three industries, freight volume, etc. to predict logistics needs. Du and Chen [28] employed Gross Domestic Product, postal services, and consumer consumption level as indicators for logistics needs forecasting; Han et al. [29] utilized Gross Domestic Product, overall social logistics cost, social fixed asset investment and import and export volume as indicators for logistics needs measurement and forecasting.
In view of the impact of the new crown pneumonia epidemic, many indicators are relatively lacking, so the impact factor is part of the indicator system constructed in the literature [5,27] as the attribute variables for the target prediction, in order to reflect the influence of the new crown pneumonia virus on logistics needs, and considering the integrity of the data, the more complete six indicators are selected as the impact factor. The influencing factors of logistics needs ( Y ) are Gross Domestic Product ( x 1 ), urban disposable income per capita ( x 2 ), overall retail sales of consumer products ( x 3 ), overall import and export commerce ( x 4 ), total postal and telecoms revenue ( x 5 ) and cargo turnover ( x 6 ).
The above data are presented in Table 1. First, the impact factors were normalized and calculated as follows:
X u = X u X min X max X min
where X u denotes the original data; X u denotes the normalized data; X max and X min denote the maximum and minimum values of each variable, respectively. As shown in Table 2, the pre-processed data can be obtained by normalizing the data from the literature [1].

4.2. Models Accuracy Evaluation

To validate the accuracy and reliability of the four models, Mean Absolute Percentage Error (MAPE), Root Mean Square Error (RMSE) and Mean Absolute Error (MAE) [30] were employed to assess the accuracy. Three error expressions are shown in (20) to (22), respectively.
M A E = 1 m i = 1 m | ( y i y ^ i ) |
R M S E = 1 m i = 1 m ( y i y ^ i ) 2
M A P E = 1 m i = 1 m | ( y i y ^ i ) | 100 % y ^ i
where m denotes the number of predicted outcomes; y ^ i denotes the actual value; y i denotes the predicted value.

4.3. Data Predictions and Results

To demonstrate the accuracy of Long short-term memory (LSTM) model prediction, we compare the LSTM with Grey Model, Back Propagation neural network model and linear regression model. Annual freight volume was used to create Grey Model, which was based on time series and impact variables, as well as the Back Propagation neural network model and linear regression model, all were trained and tested by MATLAB R2021b.

4.3.1. Prediction of LSTM Models

The setting of the hidden layer of Time Series-Long short-term memory network was determined by the grid search method [31] by selecting 19 hidden layer cells with 185 iterations, a step size of 1. 80% of the data set is used as a training set and the remaining 20% as a test set, and the optimization algorithm uses Adaptive Moment Estimation. As shown in Figure 5, the actual and predicted values are compared. The Time Series-Long short-term memory network model prediction error values are illustrated in Table 3.
The prediction model dimension of Impact Factor-Long short-term memory network model is then six dimensions input one dimension output, the Impact Factor-Long short-term memory network hidden layer unit is set to 23, the number of iterations is 250, the step size is 1, and the initial learning rate is 0.005. The prediction results are illustrated in Table 3.
After 2019, both time series and impact factor prediction of freight volume are gradually close to actual freight volume. This shows that LSTM has excellent prediction performance.

4.3.2. Prediction of Back Propagation Neural Network Models

The Time Series-Back Propagation neural network model, based on empirical Equation (23) [32] and repeated training tests, uses a 3-layer network structure with 1 neuron in the input layer and 4 neurons in the hidden layer. The actual and predicted values are compared as shown in Figure 6, and the prediction error values of the Back Propagation neural network time series model are illustrated in Table 4.
Impact Factor-Back Propagation neural network, through the empirical formula and repeated training tests, owns the optimal prediction accuracy when the number of hidden layer neurons is determined as 12. Figure 6 shows Back Propagation neural network prediction model that is created.
It is discovered that there is a tiny discrepancy between the projected and real values of logistics needs size by plotting the actual and forecasted values. Both Time Series-Back Propagation neural network and Impact Factor-Back Propagation neural network are less accurate than LSTMN.
h = m + n + a
where h denotes the amount of hidden layer nodes, m represents the amount of input layer nodes, n represents the amount of output layer nodes, and a denotes a regulation constant between 1 and 10.
Figure 7 demonstrates a comparison of the actual and results of Grey Model (1,1), and Table 5 reveals the prediction fault values of Grey Model (1,1). Figure 7 can be observed that the anticipated values differ significantly from the actual ones. When compared with the neural network method, the accuracy of the traditional prediction method is lower than that of the neural network.

4.3.3. Linear Regression Model

Figure 8 provides a comparison of the linear regression model’s actual and anticipated values, while Table 6 illustrates the model prediction error. Figure 8 demonstrates that the time series of logistics needs is not a linear function and there is also a large error in the multiple linear regression based on the impact factor predictions.
As can be seen from Table 7 and Table 8, the MAE, RMSE and MAPE of the LSTM are the smallest between the four forecasting methods. Among them, the MAPE of time series and impact factor of LSTM network is only 2.2874% and 1.3200%. However, the linear regression model has the worst results, which shows that the single linear regression model is not suitable for predicting regional logistics needs. Compared with neural network prediction, traditional prediction methods have poor accuracy, which demonstrates that the LSTM has the best application and extension values among four models.

4.4. Prediction of Logistics Needs Scale

As shown in Table 3, Time Series-Long short-term memory network is better than Impact Factor-Long short-term memory network from 2019 to 2021, so the logistics needs for Changsha City 2022–2024 is forecasted by the established LSTM time series forecasting model, with predicted results of 512.752, 516.784 and 519.694 million tons, respectively, as illustrated in Table 9. Changsha City’s logistics needs will increase from 58.54 million tonnes in 1998 to 519.694 million tonnes in 2024, and the increase is very large. Although the epidemic is constantly repeated, driven by e-commerce, community group buying and other factors, logistics needs in Changsha city will increase, so the government and logistics companies should seize this opportunity to promote the recovery and development of the local economy. At the same time, epidemic prevention and disinfection of goods still cannot be taken lightly.

5. Conclusions

Regional logistics demand prediction is employed for enterprise taking appropriate adjustment strategies and measures according to the prediction results, effectively avoiding risks, and seeking the maximum benefits. In the context of the ups and downs of the epidemic, if there is no effective prediction model for regional logistics demand, enterprises may not be able to make effective personnel and material scheduling, financial investment and other issues. For example, in the annual shopping carnival, if businesses cannot effectively predict this year’s sales volume, it will cause a large number of products and funds to be overstocked or the number of products to be insufficient, which will damage the interests of businesses. Therefore, the aim of the research is to select an effective model in term of e-commerce and the post-epidemic era by analyzing and comparing Grey Model (1,1), Back Propagation neural network, Linear regression, and Long Short-Term Memory (LSTM) to predict the regional logistics needs of Changsha city. Meanwhile, the LSTM network has the best results among the four models. The results show that LSTM model has a smaller prediction error and more stable prediction results due to its effective use of time series.
However, there are still several drawbacks. The first is that the selection of the index system is somewhat subjective, and the regional logistics needs will also be affected by, for example, the level of logistics service. Therefore, in future research, it is worth to extend the regional logistics forecasting index system to other diversified angles. Secondly, the approaches chosen for comparison have certain drawbacks, and it is evident that forecasting methods such as mixed forecasting models and support vector machines are equally viable options. In the future, we should add different forecasting models. Then, the selected region also has certain limitations, and does not verify whether the logistics demand of all regions will increase under the influence of epidemic and e-commerce. Finally, it is that this paper has not been able to optimize the LSTM network so that it can achieve the idealized results. Thus, it is necessary to take more attention to the related work so as to further study.

Author Contributions

Conceptualization, Y.L. and Z.W.; Formal analysis, Y.L.; Investigation, Y.L.; Methodology, Y.L.; Resources, Z.W.; Writing—original draft, Y.L.; Writing—review & editing, Z.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Conflicts of Interest

The authors do not have any possible conflict of interest.

References

  1. Geng, J.; Li, C. Empirical research on the spatial distribution and determinants of regional e-commerce in China: Evidence from Chinese provinces. Emerg. Mark. Finance Trade 2020, 56, 3117–3133. [Google Scholar] [CrossRef]
  2. Mohdhar, A.; Shaalan, K. The future of e-commerce systems: 2030 and beyond. In Recent Advances in Technology Acceptance Models and Theories; Springer: Cham, Switzerland, 2021; pp. 311–330. [Google Scholar]
  3. Mladenow, A.; Bauer, C.; Strauss, C. “Crowd logistics”: The contribution of social crowds in logistics activities. Int. J. Web Inf. Syst. 2016, 12, 379–396. [Google Scholar] [CrossRef]
  4. Yu, N.; Xu, W.; Yu, K.L. Research on regional logistics needs forecast based on improved support vector machine: A case study of Qingdao city under the New Free Trade Zone Strategy. IEEE Access 2020, 8, 9551–9564. [Google Scholar] [CrossRef]
  5. Huang, L.; Xie, G.; Zhao, W.; Gu, Y.; Huang, Y. Regional logistics needs forecasting: A BP neural network approach. Complex Intell. Syst. 2021, 1–16. [Google Scholar] [CrossRef]
  6. Samvedi, A.; Jain, V. A grey approach for forecasting in a supply chain during intermittentdisruptions. Eng. Appl. Artif. Intell. 2013, 26, 1044–1051. [Google Scholar] [CrossRef]
  7. Sheu, J.B.; Kundu, T. Forecasting time-varying logistics distribution flows in the One Belt-One Road strategic context. Transp. Res. Part E Logist. Transp. Rev. 2018, 117, 5–22. [Google Scholar] [CrossRef]
  8. Nuzzolo, A.; Comi, A. City logistics planning: Needs modelling requirements for direct effect forecasting. Procedia Soc. Behav. Sci. 2014, 125, 239–250. [Google Scholar] [CrossRef]
  9. Choi, T.M.; Wen, X.; Sun, X.; Chung, S.H. The mean-variance approach for global supply chain risk analysis with air logistics in the blockchain technology era. Transp. Res. Part E Logist. Transp. Rev. 2019, 127, 178–191. [Google Scholar] [CrossRef]
  10. He, Y.; Liu, N. Methodology of emergency medical logistics for public health emergencies. Transp. Res. Part E Logist. Transp. Rev. 2015, 79, 178–200. [Google Scholar] [CrossRef]
  11. Guo, H.; Guo, C.; Xu, B.; Xia, Y.; Sun, F. MLP neural network-based regional logistics needs prediction. Neural Comput. Appl. 2021, 33, 3939–3952. [Google Scholar] [CrossRef]
  12. Li, B.; Zhang, J.; He, Y.; Wang, Y. Short-term load-forecasting method based on wavelet decomposition with second-order gray neural network model combined with ADF test. IEEE Access 2017, 5, 16324–16331. [Google Scholar] [CrossRef]
  13. Jaipuria, S.; Mahapatra, S.S. An improved needs forecasting method to reduce bullwhip effect in supply chains. Expert Syst. Appl. 2014, 41, 2395–2408. [Google Scholar] [CrossRef]
  14. Prakash, C.; Barua, M.K. Integration of AHP-TOPSIS method for prioritizing the solutions of reverse logistics adoption to overcome its barriers under fuzzy environment. J. Manuf. Syst. 2015, 37, 599–615. [Google Scholar] [CrossRef]
  15. Wang, D.Z.; Lang, M.X.; Sun, Y. Evolutionary game analysis of co-opetition relationship between regional logistics nodes. J. Appl. Res. Technol. 2014, 12, 251–260. [Google Scholar] [CrossRef] [Green Version]
  16. Zendehboudi, A.; Baseer, M.A.; Saidur, R. Application of support vector machine models for forecasting solar and wind energy resources: A review. J. Clean. Prod. 2018, 199, 272–285. [Google Scholar] [CrossRef]
  17. Hu, B. Application of evaluation algorithm for port logistics park based on PCA-SVM model. Pol. Marit. Res. 2018, 3, 29–35. [Google Scholar] [CrossRef] [Green Version]
  18. Wang, Q.M.; Fan, A.W.; Shi, H.S. Network traffic prediction based on improved support vector machine. Int. J. Syst. Assur. Eng. Manag. 2017, 8, 1976–1980. [Google Scholar] [CrossRef]
  19. Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning representations by back-propagating errors. Nature 1986, 323, 533–536. [Google Scholar] [CrossRef]
  20. Bai, Y.; Jin, Z. Prediction of SARS epidemic by BP neural networks with online prediction strategy. Chaos Solitons Fractals 2005, 26, 559–569. [Google Scholar] [CrossRef]
  21. Fuh, K.H.; Wang, S.B. Force modeling and forecasting in creep feed grinding using improved BP neural network. Int. J. Mach. Tools Manuf. 1997, 37, 1167–1178. [Google Scholar] [CrossRef]
  22. Deng, J.L. Control problems of grey system. Syst. Control Lett. 1982, 1, 5. [Google Scholar]
  23. Liu, X.; Peng, H.; Bai, Y.; Zhu, Y.; Liao, L. Tourism flows prediction based on an improved grey GM (1, 1) model. Procedia Soc. Behav. Sci. 2014, 138, 767–775. [Google Scholar] [CrossRef] [Green Version]
  24. Wang, X.; Chen, Z.; Yang, C.; Chen, Y. Gray predicting theory and application of energy consumption of building heat-moisture system. Build. Environ. 1999, 34, 417–420. [Google Scholar] [CrossRef]
  25. Goldberger, A.S. Best linear unbiased prediction in the generalized linear regression model. J. Am. Stat. Assoc. 1962, 57, 369–375. [Google Scholar] [CrossRef]
  26. Massie, D.R.; Rose, M.A. Predicting daily maximum temperatures using linear regression and Eta geopotential thickness forecasts. Weather Forecast. 1997, 12, 799–807. [Google Scholar] [CrossRef]
  27. Fan, S.X.; Wu, B. Prediction analysis for logistics needs based on multiple kernels. Ind. Eng. Manag. 2018, 23, 40–44. [Google Scholar]
  28. Du, B.; Chen, A. Research on Logistics Needs Forecasting Based on the Combination of Grey GM (1, 1) and BP Neural Network. In Proceedings of the 5th Annual International Conference on Network and Information Systems for Computers (ICNISC2019), Wuhan, China, 19–20 April 2019; IOP Publishing: Bristol, UK, 2019; Volume 1288, p. 012055. [Google Scholar]
  29. Han, H.J.; Han, J.B.; Zhang, R. Study on logistics needs forecasting model based on fuzzy cognitive map. Syst. Eng. Theory Pract. 2019, 39, 1487–1495. [Google Scholar]
  30. Brailsford, T.J.; Faff, R.W. An evaluation of volatility forecasting techniques. J. Bank. Finance 1996, 20, 419–438. [Google Scholar] [CrossRef]
  31. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  32. Chen, M. MATLAB Neural Network Principles and Examples of Fine Solution; Tsinghua University Press: Beijing, China, 2013; pp. 156–158. [Google Scholar]
Figure 1. Recurrent Neural Network structure diagram.
Figure 1. Recurrent Neural Network structure diagram.
Sustainability 14 13478 g001
Figure 2. Fully connected neural network structure.
Figure 2. Fully connected neural network structure.
Sustainability 14 13478 g002
Figure 3. Recurrent Neural Network hidden layer unit structure diagram.
Figure 3. Recurrent Neural Network hidden layer unit structure diagram.
Sustainability 14 13478 g003
Figure 4. LSTM model structure diagram.
Figure 4. LSTM model structure diagram.
Sustainability 14 13478 g004
Figure 5. LSTM model prediction graph.
Figure 5. LSTM model prediction graph.
Sustainability 14 13478 g005
Figure 6. Back Propagation neural network model predicted values.
Figure 6. Back Propagation neural network model predicted values.
Sustainability 14 13478 g006
Figure 7. Grey model predicted values.
Figure 7. Grey model predicted values.
Sustainability 14 13478 g007
Figure 8. Linear regression model predicted values.
Figure 8. Linear regression model predicted values.
Sustainability 14 13478 g008
Table 1. Data on logistics requirements.
Table 1. Data on logistics requirements.
YearLogistics Needs/Million TonsGross Domestic Product/BillionUrban Disposable Income Per Capita/$Overall Retail Sales of Consumer Products/BillionOverall Import and Export Commerce/$ BillionTotal Postal and Telecoms Revenue/BillionCargo Turnover/Billion Tonne Kilometers
19985834542665022612.0820100
19996456588729719312.7626.71101
20005910656798523416.4437140
20017550728870434416.5149139
20028766812902140116.643679
200310,6329299933451204191
200411,066113311,0215252443101
200510,991151912,4347432647100
200612,478179813,9248652955109
200716,184219016,15310374058129
200817,158300018,28212735168132
200921,074374420,23815244178176
201022,947454722,81418646090219
201125,651561926,451220174104257
201226,145639930,288252186114301
201328,048715333,662280198135334
201430,449782436,8263162125194359
201533,932851039,9613690129250386
201636,767932343,2944117109378387
201741,73910,53546,9484547138557448
201843,79211,00350,7924765193647487
201949,01711,57455,21152472891006567
202049,34612,14257,97144833401271570
202150,36113,27062,14551114301665590
Table 2. Normalized data on logistics requirements.
Table 2. Normalized data on logistics requirements.
YearLogistics Needs/Million TonsGross Domestic Product/BillionUrban Disposable Income Per Capita/$Overall Retail Sales of Consumer Products/BillionOverall Import and Export Commerce/$ BillionTotal Postal and Telecoms Revenue/BillionCargo Turnover/Billaion Tonne Kilometers
199858340.0000.0000.0070.0000.000 0.041
199964560.0030.0120.0000.0020.0040.043
200059100.0090.0240.0080.0100.0100.119
200175500.0150.0370.0300.0110.0180.117
200287660.0210.0430.0410.0110.0100.000
200310,6320.0300.0590.0510.0190.0130.023
200411,0660.0460.0790.0660.0290.0140.043
200510,9910.0770.1040.1090.0330.0160.041
200612,4780.0990.1310.1330.0400.0210.172
200716,1840.1290.1710.1670.0670.0230.098
200817,1580.1930.2100.2140.0930.0300.104
200921,0740.2520.2450.2630.0690.0350.190
201022,9470.3150.2910.3310.1150.0430.274
201125,6510.3990.3570.3970.1480.0510.348
201226,1450.4600.4260.4610.1770.0570.434
201328,0480.5190.4870.5160.2060.0700.499
201430,4490.5720.5430.5870.2700.1060.548
201533,9320.6260.6000.6920.2800.1400.601
201636,7670.6900.6600.7760.2320.2180.603
201741,7390.7850.7260.8610.3010.3260.722
201843,7920.8220.7950.9050.4330.3810.798
201949,0170.8670.8751.0000.6630.5990.955
202049,3460.9110.9250.8490.7850.7600.961
202150,3611.0001.0000.9731.0001.0001.000
Table 3. LSTM model prediction results.
Table 3. LSTM model prediction results.
YearReal ValueTime Series-Long Short-Term Memory NetworkImpact Factor-Long Short-Term Memory Network
Predicted ValueRelative ErrorPredicted ValueRelative Error
201843,79245,130.30.030644,469.60.0155
201949,01747,751−0.025847,334−0.0343
202049,34649,4040.001249,276.8−0.0014
202150,36150,341−0.000450,276.2−0.0017
MAPE/%1.4490%1.3200%
MAE670.5762628.6807
RMSE921.6036908.8091
Table 4. Back Propagation neural network model prediction results.
Table 4. Back Propagation neural network model prediction results.
YearReal ValueTime Series-Back Propagation Neural Network Impact Factor-Back Propagation Neural Network
Predicted ValueRelative ErrorPredicted ValueRelative Error
201843,79246,1230.053242,568−0.0073
201949,01750,120.20.022545,704.6−0.0373
202049,34653,254.80.079247,089.4−0.0266
202150,36155,484.20.101746,339.1−0.0159
MAPE/%6.4169%5.4982%
MAE3116.54842688.7188
RMSE3470.46022883.5193
Table 5. Grey model prediction results.
Table 5. Grey model prediction results.
YearReal ValueTime Series-Grey Model (1,1) Impact Factor-Grey Model (1,1)
Predicted ValueRelative ErrorPredicted ValueRelative Error
201843,79247,496.50.084643,672.5−0.0027
201949,01752,496.70.070925,073−0.4885
202049,34658,023.40.175838,530−0.2192
202150,36164,131.830.273424,172−0.5200
MAPE/%15.1175%30.7600%
MAE7408.107515,267
RMSE8565.911418,548
Table 6. Linear regression model prediction results.
Table 6. Linear regression model prediction results.
YearReal ValueTime Series-Linear RegressionImpact Factor-Linear Regression
Predicted ValueRelative ErrorPredicted ValueRelative Error
201843,79226,501.8−0.394841,363−0.0555
201949,01730,416.1−0.379540,076−0.1824
202049,34627,409.6−0.451842,514−0.1385
202150,36128,214.5−0.439845,919−0.0882
MAPE/%41.4650%11.6150%
MAE19993.55661
RMSE20,103.59116169.4633
Table 7. Evaluation indicators for time series prediction results.
Table 7. Evaluation indicators for time series prediction results.
ModelsRMSEMAEMAPE/%
LSTM921.6036670.57621.4490%
Grey Model (1,1)8565.91147408.107515.1175%
Back Propagation neural network3470.46023116.54846.4169%
Linear regression model20,103.591119,993.500041.4650%
Table 8. Evaluation indicators for impact factor prediction results.
Table 8. Evaluation indicators for impact factor prediction results.
ModelsRMSEMAEMAPE/%
LSTM908.8091628.68071.3200%
Grey Model (1,N)18,548.000015,267.000030.7600%
Back Propagation neural network2883.51932688.71885.4982%
linear regression6169.46335661.000011.6150%
Table 9. Logistics needs scale from 2022 to 2024.
Table 9. Logistics needs scale from 2022 to 2024.
YearFreight Volume
202251,275.2
202351,678.4
202451,969.4
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Li, Y.; Wei, Z. Regional Logistics Demand Prediction: A Long Short-Term Memory Network Method. Sustainability 2022, 14, 13478. https://doi.org/10.3390/su142013478

AMA Style

Li Y, Wei Z. Regional Logistics Demand Prediction: A Long Short-Term Memory Network Method. Sustainability. 2022; 14(20):13478. https://doi.org/10.3390/su142013478

Chicago/Turabian Style

Li, Ya, and Zhanguo Wei. 2022. "Regional Logistics Demand Prediction: A Long Short-Term Memory Network Method" Sustainability 14, no. 20: 13478. https://doi.org/10.3390/su142013478

APA Style

Li, Y., & Wei, Z. (2022). Regional Logistics Demand Prediction: A Long Short-Term Memory Network Method. Sustainability, 14(20), 13478. https://doi.org/10.3390/su142013478

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop