1. Introduction
Since the twentieth century, with the booming development of the Internet industry, E-commerce has gradually entered people’s lives, widely attracted people’s attention, and the world has gradually entered a new economic era characterized by the development of e-commerce [
1,
2]. Due to the further promotion and application of e-commerce, people’s needs for regional logistics is constantly rising.
Regional logistics needs forecasting is the advanced estimation and projection of the flow of goods, sources, flow direction, flow rate and composition of goods that have not yet occurred or are not clear at present in the region, so that it can be used to study the size of regional logistics needs and its hierarchical structure of needs, to inform decision-making and provide a foundation for regional logistics planning, and to further provide a scientific and reliable basis for government departments to formulate logistics policies and coordinate the allocation of logistics resources, etc. In addition, the regional logistics needs forecasting is accurate and reasonable. In addition, an accurate and reasonable forecast of regional logistics needs will help the Government to integrate regional logistics resources and establish an economical and efficient regional logistics system.
With the rapid advancement of economic globalization and regional economic integration, logistics needs is on the rise year by year and has emerged as a new engine for promoting healthy and stable regional economic development, with regional logistics needs forecasting having a significant impact on regional development planning. In recent years, logistics needs forecasting methods have received extensive attention. For example, some methods [
3,
4,
5] have been proposed for putting forward to solve the issue of poor logistics and distribution ability from different angles. Mladenow et al. [
3] proposed that introducing crowdsourced logistics would successfully tackle the last-mile problem of urban logistics and distribution, and alleviate the difficulties such as insufficient logistics and distribution capacity, during peak periods. Yu et al. [
4] employed different regional logistics needs forecasting models to predict regional logistics needs and provide decision assistance for local economic development. Further, the impact of e-commerce on logistics and distribution cannot be overlooked in the age of e-commerce. Huang et al. [
5] considered ground indicators related to e-commerce development and applied Grey Model (1,1) model and Back Propagation neural network model for predicting regional logistics. However, as the novel coronavirus continues to spread internationally, just as Severe acute respiratory syndrome coronavirus swept the world in 2003, it also had a serious impact on the logistics industry at that time. Therefore, 2004–2005 is also known as the post-epidemic era of Severe acute respiratory syndrome coronavirus, makes the models have better training, and one that can help for the prediction of freight volume in the post-epidemic era of the novel corona pneumonia, and has the effect of migration learning, so that its accuracy is more accurate. There is no doubt that this will affect every aspect of our lives including the logistics industry in the age of e-commerce. Previous methods, when performing logistics needs forecasting, did not consider the general environment of the new crown virus. Therefore, unlike the previous methods, this paper for the first time integrates the factors under the influence of the epidemic and the two factors of e-commerce development for forecasting. However, in the field of logistics needs forecasting, model building is still dominated by support vector regression machines, and deep learning networks are not yet studied much at present. Motivated by the issue above, we propose a regional logistics forecasting model based on Long short-term memory (LSTM) network.
The main contributions of our work are as follows:
(1) We create a logistics needs forecasting index system for Changsha city, China, which provides a novel aspect for selecting indicators, combining e-commerce with novel coronavirus pneumonia.
(2) We apply the LSTM network to the regional logistics demand forecasting. Extensive experiments demonstrate that the LSTM network can be well used in regional logistics needs forecasting.
(3) Through the prediction of the future freight volume, the government and logistics enterprises can seize this opportunity to further promote the development of the local economy.
2. Related Work
Recently, a large number of regional logistics needs forecasting methods have been proposed and can be broadly grouped into the three types as follows.
The first is the traditional statistical forecasting methods, mainly mathematical modeling, input-output methods, regression analysis, grey theory clustering, combinatorial forecasting methods and Markov chains [
6,
7]. Samvedi et al. [
6] performed a simulation experiment on the beer game to compare the effectiveness of three proven prediction models and grey methodologies in this disrupted and stable situation. The grey forecasting approach showed the most stability according to the findings. In estimating sales of Chilean supermarkets, Sheu et al. [
7] used a combination forecasting technique, combining a neural network architecture and a rolling average algorithm into a combined prediction system. Nuzzolo et al. [
8] utilized a univariate nonlinear regression model to forecast logistics needs. Theoretically, the combined forecasting model was constructed using the inverse of variance weighted distribution approach. Although traditional statistics-based forecasting methods can forecast regional logistics needs by some simple models, such models based on linear functions have simpler assumptions and the forecasting results often do not match the actual situation.
With the success of artificial intelligence, new methods based on neural network are constantly being proposed [
9,
10,
11]. Guo et al. [
11] proposed a particle swarm optimization-Radial Basis neural network model, which combines a particle swarm optimization algorithm with a radial basis function neural network to forcast regional logistics needs by training the model with indicator data in the region. Li et al. [
12] utilized a neural network mapping method to establish the optimal parameters of a second-order grey forecasting model, and the findings indicated that the proposed method was effectively improved by the accuracy of load forecasting. Jaipuria and Mahapatra [
13] employed the Discrete Wavelet Transformation-Artificial Neural Network model for a regional logistics needs forecasting study to decrease inventory costs, and due to that the bullwhip effect and three different local manufacturing firms were forecasted using the Discrete Wavelet Transformation-Artificial Neural Network Model and Autoregressive Integrated Moving Average Model as an example, which conducted extensive comparative experiments and verified the effectiveness of the discrete wavelet transformation-artificial neural network model. Huang et al. [
5] introduced a Back Propagation neural network based on the regional logistics needs prediction approach. In comparison to the standard grey prediction model, the model exhibits a lower prediction error and more reliable prediction outcomes. However, the artificial neural network’s learning process error is easy to settle to a local optimum, but learning accuracy with a limited amount of learning data is difficult to guarantee. When there are too many learning samples, the neural network also falls into the dilemma of dimensional catastrophe and weak generalization ability.
Further, to alleviate the above problem, Yu et al. [
4] used two learning machine forecasting methods i.e., support vector machine and neural network, to predict the distortion needs in the final supply chain and compare the prediction results of these two forecasting methods with those of traditional forecasting methods. Finally, the prediction accuracy of the learning machines was found to be higher than that of the previous models. Subsequently, Methods based on support vector machine have emerged [
14,
15,
16]. To forecast available renewable resources, Zendehboudi et al. [
16] proposed a novel hybrid Support Vector Machine method, which achieved a favorable performance. Hu et al. [
17] proposed a Principal Components Analysis-Support Vector Machine model-based algorithm for evaluating port logistics parks in order to predict port needs. To improve the Support Vector Machine-based prediction model, Wang et al. [
18] employed fuzzy hierarchical analysis to optimize the parameters, and then trained the Support Vector Machine using the optimised parameters to create the final prediction model.
Regional logistics needs forecasting is a complex task that incorporates theories and methods from many disciplines. Previous methods based on traditional statistics and neural network-based methods have their own shortcomings, and this paper argues that regional logistics methods based on support vector machines will slowly replace both of these methods and are a trend for future research.
Therefore, motivated by the previous methods, we propose a time series and indicator system in the era of epidemics based on Changsha city, which employs Long short-term memory (LSTM) network, Grey Model (1,1), Back Propagation neural network model and the linear regression model to predict Changsha logistics needs. According to the comprehensive comparison of the above four methods, the findings reveal that the LSTM network has a smaller forecast error and more reliable predictions, and we first employ such method to predict the logistics needs of Changsha city in 2022–2024 under the impact of e-commerce and the pandemic.
3. Methodology
3.1. Recurrent Neural Networks
Long short-term memory (LSTM) network belong to Recurrent Neural Network, which is a variant in Recurrent Neural Network. To facilitate the distinction, the LSTM model based on time series prediction is called Time Series-Long short-term memory network; the LSTM model based on impact factor prediction is called Impact Factor-Long short-term memory network.
The prototype of Recurrent Neural Network was first introduced by John Hopfield, Michael I. Jordan defined the concept of Recurrent, and then Jeffery L, Elman proposed the earliest and the simplest Recurrent Neural Network with a single self-connected node at present. Recurrent Neural Network usually owns an input layer, a hidden layer and an output layer, as shown in
Figure 1, and unlike traditional neural networks that only perform unidirectional propagation, Recurrent Neural Network has a memory function, and there is a loopback between the hidden layer units based on a time series which is expressed as the current output is influenced by the output of the previous moment. Recurrent Neural Network can be very effective in processing data with sequence characteristics, and can mine the effective information in the data. so Recurrent Neural Networks are now widely used in processing text, speech and other tasks.
Figure 1 can be converted into
Figure 2 when ignoring
, which shows that the structure is a fully connected neural network structure, through this figure, we can see that
is a three-dimensional vector,
denotes a three-row and four-column parameter matrix from the input layer to the hidden layer,
denotes a four-dimensional vector of the hidden layer,
represents a four-row and two-column parameter matrix from the hidden layer to the output layer, and
represents a two-dimensional vector of the output layer. In
Figure 1,
represents the value from the input layer to the hidden layer,
denotes the hidden layer output value,
denotes the weight from the previous moment output to the current moment input,
denotes the weight vector from the hidden layer to the output layer, and
O represents the output vector of the output layer.
Expanding Recurrent Neural Network structure by the time line yields a graph of the hidden layer unit structure, and the connection structure between the hidden layer units of Recurrent Neural Network is illustrated in
Figure 3. At moment
, the input of the hidden layer unit is
and
, the value of the hidden layer unit is
, and the output is
.
From
Figure 2, the forward calculation process of Recurrent Neural Network can be introduced as Equations (1) and (2)
where
is the
function, which is a common activation function in neural networks.
is bias,
is full connection layer.
3.2. Long Short-Term Memory Network
Because Recurrent Neural Network frequently fail owing to gradient explosion and disappearance, a new class of Recurrent Neural Networks called Long short-term memory network (LSTM) was developed to overcome this problem. Sepp Hochreiter and Jürgen Schmidhuber first proposed LSTM in 1997. LSTM has more input gates compared to recurrent neural networks, forgetting gate and output gate, which can get the corresponding weights and have different values at different moments with fixed parameters. Thus, LSTM can control the memory of the whole process better compared to recurrent neural networks, and thus successfully avoid the problems of gradient explosion and vanishing.
The cell state lies at the heart of the LSTM, with the horizontal line at the top of the graph running through the upper part of the graph. The cell states extend along with a straight line like a chain, and the linear interactions are small. Information can flow very easily. The LSTM can change information to the cell state according to the gate structure in
Figure 4. It can be found that at time t, the LSTM cell owns three inputs, which are the input sequence
, the output of the previous step
and the LSTM cell state
; there are two outputs,
,
; and three internal gates, which are the forgetting gate
, the input gate
and the output gate
. The LSTM implements protection and control of cell states through three gate cells.
The work of LSTM can be divided into four steps.
(1) By selecting that unnecessary information should be eliminated from the cell state, the forgetting gate produces the outputs shown in Equation (3).
where
is the
function, which is a common activation function in neural networks.
The output is ft which is then multiplied point by point with the cell state of . The data is between 0–1. If when the data is 0, it means that all information is discarded, and conversely when the data is 1, it means that all information is retained.
: Output of the previous moment of the cell state
: Template output from the previous moment
: Output of the Oblivion Gate
Weight matrix
Offset
: two matrices stitched together
(2) The input gate has two parts that store information in the cell state: the first is the input gate layer’s decision on which there is information to update, and the second is a vector of optional values created by the layer to decide which information to add to those in the cell state. As in Equations (4) to (5)
(3) Multiply the old state
by the point
, indicating that the information that has been decided to be forgotten is discarded, and then add
, thus forming the new cell state
, which is the new optional value, scaled in size according to the state value. Equation (6).
(4) The output gate performs the output, and the cell state determines the output. As in Equations (7) to (8)
3.3. Back Propagation Neural Networks
Rumelhart [
19] proposed the Back Propagation neural network, which solves the learning problem of multilayer neural networks. It is composed of three layers, including an input layer, a hidden layer, and an output layer, which has a wide range of applications. Firstly, the signal is transmitted from the input layer to the hidden layer through forward propagation and calculated in the hidden layer. Then the results of the hidden layer calculation are transmitted to the output layer and output. Finally, the results are compared with the expected values, and the error is corrected by back propagation. For example, Bai et al. [
20], Fuh, K. H. et al., [
21] employed the prediction model to medical, creep feed grinding, respectively. To facilitate the distinction, Back Propagation neural network model based on time series prediction is called Time Series-Back Propagation neural network; the Back Propagation neural network model based on impact factor prediction is called Impact Factor-Back Propagation neural network.
3.4. Grey Model (1,1)
Deng [
22] fist introduced the Grey Model (1,1), which generated an approximation exponential rule by summarizing the original data and the modelling it. The model has been widely used in various area. For example, Liu et al. [
23] and Wang et al. [
24] employed the prediction model to tourism and construction. To facilitate the distinction, the Grey Model (1,1) is divided into Time Series-Grey Model (1,1) and Impact Factor-Grey Model (1,1). The modeling steps of Grey Model (1,1) are:
- (1)
Build the original sequence,
can be set to the original sequence:
- (2)
Calculate a cumulative sequence, generate a new sequence, set to
- (3)
Generate the mean and calculate its background value
- (4)
Construct matrices
and
.
- (5)
Construct differential equations
Solution of differential equation:
Coefficient vector: .
3.5. Linear Regression
Francis Galton proposed the linear regression model, which uses regression equations to model the connection between one or more independent and dependent variables. The model has been widely used in various area. For example, Goldberger et al. [
25] and Massie et al. [
26] applied the predictive model to studies in biology, temperatures, construction and other industries, respectively. To facilitate the distinction, the Linear regression model based on time series prediction is called Time Series-Linear Regression; the Linear regression model based on impact factor prediction is called Impact Factor-Linear Regression. The modeling steps of Grey Model (1,1) are:
- (1)
For a given sequence of
n points,
,
…
Let the linear regression equation be:
- (2)
The total distance from the point in the direction to the straight line can use
to quantitatively describe, so it can be regarded as a binary function:
- (3)
Therefore, the problem of finding a straight line and making it closest to a point is transformed into finding two numbers
,
, so that the binary function
reaches the minimum at
,
. Through formula derivation, finally we can obtain:
4. Experimental Results and Analysis
4.1. Experimental Data Selection and Pre-Processing
Indicators for measuring logistics needs include logistics operations, cargo transportation, cargo turnover, logistics equipment, logistics employees, distribution operations and commodity inventory, etc. When selecting logistics indicators, we should not only consider the differences between indicators, but also consider the difficulty of obtaining each indicator. Therefore, we choose the freight volume as the measurement indicator. Since the volume of goods transported in reality is affected by more factors and the period of influencing factor values is long, this paper uses the annual unit for forecasting.
On the selection of impact factors, Huang et al. [
5] selected 12 as observed variables including Gross Domestic Product, per capita disposable income, and investment in logistics fixed assets; Fan and Wu [
27] chose Gross Domestic Product, the total social logistics cost, overall output value of one, two, three industries, freight volume, etc. to predict logistics needs. Du and Chen [
28] employed Gross Domestic Product, postal services, and consumer consumption level as indicators for logistics needs forecasting; Han et al. [
29] utilized Gross Domestic Product, overall social logistics cost, social fixed asset investment and import and export volume as indicators for logistics needs measurement and forecasting.
In view of the impact of the new crown pneumonia epidemic, many indicators are relatively lacking, so the impact factor is part of the indicator system constructed in the literature [
5,
27] as the attribute variables for the target prediction, in order to reflect the influence of the new crown pneumonia virus on logistics needs, and considering the integrity of the data, the more complete six indicators are selected as the impact factor. The influencing factors of logistics needs (
) are Gross Domestic Product (
), urban disposable income per capita (
), overall retail sales of consumer products (
), overall import and export commerce (
), total postal and telecoms revenue (
) and cargo turnover (
).
The above data are presented in
Table 1. First, the impact factors were normalized and calculated as follows:
where
denotes the original data;
denotes the normalized data;
and
denote the maximum and minimum values of each variable, respectively. As shown in
Table 2, the pre-processed data can be obtained by normalizing the data from the literature [
1].
4.2. Models Accuracy Evaluation
To validate the accuracy and reliability of the four models, Mean Absolute Percentage Error (MAPE), Root Mean Square Error (RMSE) and Mean Absolute Error (MAE) [
30] were employed to assess the accuracy. Three error expressions are shown in (20) to (22), respectively.
where
denotes the number of predicted outcomes;
denotes the actual value;
denotes the predicted value.
4.3. Data Predictions and Results
To demonstrate the accuracy of Long short-term memory (LSTM) model prediction, we compare the LSTM with Grey Model, Back Propagation neural network model and linear regression model. Annual freight volume was used to create Grey Model, which was based on time series and impact variables, as well as the Back Propagation neural network model and linear regression model, all were trained and tested by MATLAB R2021b.
4.3.1. Prediction of LSTM Models
The setting of the hidden layer of Time Series-Long short-term memory network was determined by the grid search method [
31] by selecting 19 hidden layer cells with 185 iterations, a step size of 1. 80% of the data set is used as a training set and the remaining 20% as a test set, and the optimization algorithm uses Adaptive Moment Estimation. As shown in
Figure 5, the actual and predicted values are compared. The Time Series-Long short-term memory network model prediction error values are illustrated in
Table 3.
The prediction model dimension of Impact Factor-Long short-term memory network model is then six dimensions input one dimension output, the Impact Factor-Long short-term memory network hidden layer unit is set to 23, the number of iterations is 250, the step size is 1, and the initial learning rate is 0.005. The prediction results are illustrated in
Table 3.
After 2019, both time series and impact factor prediction of freight volume are gradually close to actual freight volume. This shows that LSTM has excellent prediction performance.
4.3.2. Prediction of Back Propagation Neural Network Models
The Time Series-Back Propagation neural network model, based on empirical Equation (23) [
32] and repeated training tests, uses a 3-layer network structure with 1 neuron in the input layer and 4 neurons in the hidden layer. The actual and predicted values are compared as shown in
Figure 6, and the prediction error values of the Back Propagation neural network time series model are illustrated in
Table 4.
Impact Factor-Back Propagation neural network, through the empirical formula and repeated training tests, owns the optimal prediction accuracy when the number of hidden layer neurons is determined as 12.
Figure 6 shows Back Propagation neural network prediction model that is created.
It is discovered that there is a tiny discrepancy between the projected and real values of logistics needs size by plotting the actual and forecasted values. Both Time Series-Back Propagation neural network and Impact Factor-Back Propagation neural network are less accurate than LSTMN.
where
denotes the amount of hidden layer nodes,
represents the amount of input layer nodes,
represents the amount of output layer nodes, and
denotes a regulation constant between 1 and 10.
Figure 7 demonstrates a comparison of the actual and results of Grey Model (1,1), and
Table 5 reveals the prediction fault values of Grey Model (1,1).
Figure 7 can be observed that the anticipated values differ significantly from the actual ones. When compared with the neural network method, the accuracy of the traditional prediction method is lower than that of the neural network.
4.3.3. Linear Regression Model
Figure 8 provides a comparison of the linear regression model’s actual and anticipated values, while
Table 6 illustrates the model prediction error.
Figure 8 demonstrates that the time series of logistics needs is not a linear function and there is also a large error in the multiple linear regression based on the impact factor predictions.
As can be seen from
Table 7 and
Table 8, the MAE, RMSE and MAPE of the LSTM are the smallest between the four forecasting methods. Among them, the MAPE of time series and impact factor of LSTM network is only 2.2874% and 1.3200%. However, the linear regression model has the worst results, which shows that the single linear regression model is not suitable for predicting regional logistics needs. Compared with neural network prediction, traditional prediction methods have poor accuracy, which demonstrates that the LSTM has the best application and extension values among four models.
4.4. Prediction of Logistics Needs Scale
As shown in
Table 3, Time Series-Long short-term memory network is better than Impact Factor-Long short-term memory network from 2019 to 2021, so the logistics needs for Changsha City 2022–2024 is forecasted by the established LSTM time series forecasting model, with predicted results of 512.752, 516.784 and 519.694 million tons, respectively, as illustrated in
Table 9. Changsha City’s logistics needs will increase from 58.54 million tonnes in 1998 to 519.694 million tonnes in 2024, and the increase is very large. Although the epidemic is constantly repeated, driven by e-commerce, community group buying and other factors, logistics needs in Changsha city will increase, so the government and logistics companies should seize this opportunity to promote the recovery and development of the local economy. At the same time, epidemic prevention and disinfection of goods still cannot be taken lightly.
5. Conclusions
Regional logistics demand prediction is employed for enterprise taking appropriate adjustment strategies and measures according to the prediction results, effectively avoiding risks, and seeking the maximum benefits. In the context of the ups and downs of the epidemic, if there is no effective prediction model for regional logistics demand, enterprises may not be able to make effective personnel and material scheduling, financial investment and other issues. For example, in the annual shopping carnival, if businesses cannot effectively predict this year’s sales volume, it will cause a large number of products and funds to be overstocked or the number of products to be insufficient, which will damage the interests of businesses. Therefore, the aim of the research is to select an effective model in term of e-commerce and the post-epidemic era by analyzing and comparing Grey Model (1,1), Back Propagation neural network, Linear regression, and Long Short-Term Memory (LSTM) to predict the regional logistics needs of Changsha city. Meanwhile, the LSTM network has the best results among the four models. The results show that LSTM model has a smaller prediction error and more stable prediction results due to its effective use of time series.
However, there are still several drawbacks. The first is that the selection of the index system is somewhat subjective, and the regional logistics needs will also be affected by, for example, the level of logistics service. Therefore, in future research, it is worth to extend the regional logistics forecasting index system to other diversified angles. Secondly, the approaches chosen for comparison have certain drawbacks, and it is evident that forecasting methods such as mixed forecasting models and support vector machines are equally viable options. In the future, we should add different forecasting models. Then, the selected region also has certain limitations, and does not verify whether the logistics demand of all regions will increase under the influence of epidemic and e-commerce. Finally, it is that this paper has not been able to optimize the LSTM network so that it can achieve the idealized results. Thus, it is necessary to take more attention to the related work so as to further study.