1. Introduction
In the context of the 2030 carbon-peaking action plan proposed by the Chinese government in 2021 [
1], in order to rationally regulate oil and gas consumption, gradually reduce the dependence of transportation on fossil energy, and guide the public to choose a low-carbon/carbon green travel mode, the energy industry has high expectations, and the low-carbon transformation of transportation needs to be accelerated [
2]. The power system is optimized to make the power system safer and more efficient. The design of the power system to the smart power system will increase its transparency. This transparency refers to the small particle size running data that contains various equipment status—including the type of electrical appliances, running time, and the operating habits of users—which has great value. By understanding the operating information of a single electrical appliance, that is, the small particle size operation data of the electrical appliances mentioned above, these data can intuitively reflect the operation of the electrical appliances to energy users. This can guide users’ energy-saving behavior. As of October 2020, China has 666,500 public charging piles and 43,300 charging stations, and has built a charging facility system with the largest number of charging facilities, the largest radiation area, and the most complete service vehicles in the world [
3]. However, when a large number of electric vehicles enter the grid for charging, the stability of the grid will be greatly challenged. At the same time, with the rapid development of battery technology and vehicle-to-grid (V2G) control technology, the storage capacity of the battery has further increased, and the battery control foundation of electric vehicles has become more mature. Assisting the operation of new energy power systems has become a hot research topic [
4]. At the same time, the electric vehicle charging state needs to be obtained before the implementation of the V2G algorithm, so how to perceive the electric vehicle load becomes the key to the operation of the cluster electric vehicle auxiliary power system.
There are two different ways to implement load monitoring, namely Intrusive Load Monitoring (ILM) and Non-Intrusive Load Monitoring (NILM). Intrusive load monitoring monitors each device individually by installing sensors on each device to capture the power consumption information of each device. This monitoring solution focuses mainly on the development of hardware and how to extract the above information quickly, efficiently, and accurately while reducing the cost of hardware equipment. Non-invasive load monitoring focuses mainly on software algorithms, especially how to use the power data of the electric vehicle charging station terminal to analyze the charging behavior of each charging pile. Although the ILM scheme based on sensors has high precision advantages, it also has disadvantages such as high prices and huge waste. Therefore, despite the limited accuracy of NILM, due to its low price, it is very popular among energy users. As a result, it has come to the forefront of research in the field of academic and industrial.
NILM was proposed by Professor Hart G. W of MIT in the 1990s [
5]. Due to the high computational complexity and low accuracy of the original NILM, it did not attract much attention at the time. In recent years, due to the rapid development of computer science and artificial intelligence, especially in the extensive application of in-depth learning in model recognition, NILM has now attracted the attention of scholars and has become a hot spot in smart grid research. At present, NILM is classified into two categories: low frequency and high frequency, according to different sampling rates. Due to the inherently simple structure of the data, the analysis results of low-frequency data may lead to performance degradation. Since high-frequency data usually contains more information, such as current harmonics, voltage–current traces, and high-frequency transient waveforms [
6], the results of high-frequency data analysis are usually more accurate. However, compared with low-frequency data analysis, high-frequency data’s high requirements for monitoring equipment show a cost disadvantage. From a practical application point of view and considering the industrial aspect, solutions based on low-frequency data are becoming increasingly more attractive due to stringent cost requirements and acceptance of loss of accuracy, thus becoming the focus of our work.
Various NILM solutions have been proposed. First, some mathematical models are explored [
7,
8,
9]. The fuzzy model obtains the objective function of membership of each sample signal to all home appliance centers through optimization, so as to determine the appliance category of the sample signal and to achieve the purpose of automatically identifying the sample data [
8]. Graph signal processing first simply builds an undirected graph based on the signal, then groups the on/off events of the load, and finally defines an optimization problem to find the signal with the smallest change [
7]. The linear programming model will be an optimization of the NILM problem to form a multi-feature objective function, so as to realize the load decomposition and identification of different characteristics [
9]. Although the current research based on mathematical models has achieved some results, it still has some limitations. The most prominent problem is that most of these explorations are based on optimized NILM schemes, resulting in low scalability of the formula and high algorithm dependence; that is, when the complexity of the problem increases or the model has structural changes, such methods will have large deviation. Therefore, the decomposition performance of these schemes is highly dependent on the scene, and the model needs to be adjusted for the actual application scene at the same time, which reduces the operability caused by this requirement. The rise of deep learning algorithms offers some effective ways to overcome these obstacles. Hidden Markov Models (HMMs) are a typical class of these studies, in which a bi-stochastic process combining electrical states and an explicit stochastic function is used to build the sequence of operations [
10]. In addition, neural networks (NNs) can also be combined with HMMs, where the firing probability of the HMM is modeled by a Gaussian distribution representing the state of a single device and a DNN representing the state of the aggregated signal [
11]. Through the above studies, the great potential of deep learning methods in NILM problems is demonstrated. The optimized deep learning algorithm has high accuracy for non-intrusive load monitoring and high adaptability to changing models, so it has better prospects than simply using mathematical model solutions for non-intrusive load monitoring.
At present, deep learning algorithms in non-intrusive home appliance monitoring learning algorithms can be roughly divided into three categories, namely supervised algorithms [
12], unsupervised algorithms [
13], and semi-supervised algorithms [
14]. The supervised algorithm learns to build a model from the training data, infers new instances according to the model, and realizes load perception and prediction. It has the advantages of simplicity, convenience, speed, accuracy, and small storage space [
12]. The supervision algorithm has simple requirements for data structure. After the neural network training is completed, the model can be used directly for analysis. At the same time, the model decomposition speed is fast and the required storage space is not large. However, when the spatial characteristics of the data are large, the logistic regression performance of the algorithm is poor, and the neural network also has shortcomings, such as underfitting or overfitting and poor self-learning ability. When the structure of the system changes and adjusts, the neural network fails. It is more likely that the model must be established and trained again. Unsupervised algorithm refers to a data processing method that classifies samples by analyzing many samples of the research object without category information. It has strong self-learning ability, and new data can be directly learned without retraining and added to the dataset; however, also has the disadvantage of low accuracy of the analysis results [
15]. Research on semi-supervised regression problems is relatively limited.
Among all the deep learning methods for supervised learning that have been studied, neural networks have attracted much attention due to their excellent performance. In addition to the literature [
16] on the use of neural networks to improve NILM performance, there are many research publications discussing the advantages of neural networks in enhancing NILM. Reference [
17] is from the early literature on using NN to solve NILM, and it was the first to propose the research of neural network NILM and inspired related research. Reference [
16] proposed a non-intrusive harmonic source identification method based on neural network. In order to be suitable for specific problems, some NN methods are modified for different NILM application problems. Reference [
18] proposed an additional optimization to be embedded into the NILM formulation to form a denoising autoencoder method. Reference [
19] proposed a multi-label learning method based on convolutional neural network (CNN). Reference [
20] proposed a general electrical appliance recognition model based on convolutional neural network. Reference [
21] studied the problem of identifying electrical loads connected to a house with a CNN-based NILM method and proposed a system capable of extracting the energy demand of each individual device. At the same time, recurrent neural network (RNN) has also received extensive attention from scholars in NILM research. Reference [
22] used the RNN model to extract the appliance features of the steady-state part as model input for recognition after an event is detected. To overcome the difficulty of RNNs in learning long-term dependencies, Reference [
23] proposed a 1DCNN-based method and solved the NILM problem by considering the current signal and using a long short-term memory (LSTM) neural network. Reference [
24] highlighted neural NILM by building a multilayer perceptron with multiple hidden layers. Reference [
25] used the simplest neural network, the Back Propagation Neural Network (BPNN), using the abrupt values of active power and the corresponding odd harmonics to achieve load identification. It can be seen from a large number of current literature publications and research that neural networks will have a high research interest for a long time in the future.
Unsupervised learning is roughly divided into eight categories, including hierarchical clustering such as
K-means clustering and other clustering algorithms, Principal Component Analysis (PCA), Probabilistic Latent Semantic Analysis (PLSA), Markov Chain Monte Carlo (MCMC), Latent Semantic Analysis (LSA), Singular Value Decomposition (SVD), Latent Dirichlet Analysis (LDA), and PageRank Algorithms. Reference [
26] used a hierarchical approach to cluster data objects in the form of trees called hierarchies. The results of Reference [
27] showed that the GMM method outperformed other methods to a certain extent. Reference [
28] described a method for computing the partial singular value decomposition of a matrix, which is suitable for problems where the matrix is known to have low rank and only the dominant singular vectors are of interest. Reference [
29] proposed a rotation-invariant principal component analysis method based on the maximum correlation entropy criterion (MCC). A semi-quadratic optimization algorithm is used to calculate the relevant entropy objective. Reference [
30] described that probabilistic latent semantic analysis (PLSA) can effectively capture semantic and statistical data for modeling. The
K-means clustering method has the advantage of simple model among many methods, so it has been widely used. Reference [
31] utilized the strong adaptability of unsupervised learning to optimize a non-intrusive load-sensing model of household electrical load using neural networks. Reference [
32] subdivided data user power consumption patterns. On the basis of clustering analysis of neighbor propagation for loads according to working and rest days, the rationality judgment of equipment working state is added, and the load and its working state are decomposed from the total power combined with genetic optimization.
Aimed at the characteristics of similar load characteristics and a large number of random loads in electric vehicle charging stations, a new non-intrusive load monitoring method for electric vehicle charging stations is proposed. In the case of using the neural network to establish the load perception model of the electric vehicle charging station, the K-means algorithm is used to optimize the neural network to realize the perception of random loads. Finally, the controllability evaluation of electric vehicle charging station is realized. In other words, the proposed NILM method can adapt to a random electric vehicle charging station and can even solve random electric vehicle loads without sufficient information. This contribution fills the research gap in related fields. The detailed technical contribution can be summarized as:
A practical and adaptive NILM model is established based on BP neural network.
Non-supervised learning optimization is based on the proposed NILM model to improve the scalability and robustness of the method.
The proposed NILM solution will combine the neural network model based on unsupervised learning optimization and supervision, which can determine the load of random electric vehicle while filling an important research gap.
3. Case Analysis
3.1. Electric Vehicle Charging Station Power Feature Dataset
In real life, the active power of the DC charging pile of the electric vehicle charging station is relatively fixed. There are different powers such as 60 KW, 150 KW, and 300 KW, and when the electric vehicle is not fully charged, the shape of the power–time curve of the electric vehicle charger during charging is roughly a rectangular wave. There will be a stepped descent when the electric car is fully charged, and each brand of electric vehicle charging reactive power has its own characteristics and differences in power factor. In the analysis after this article, the charging power of most of the small car charging station charging piles on the market is kept mainly within 60–90 KW, so we chose 60 KW as the analysis object of the charging power of the charging station in this article. In addition, four charging vehicles with different reactive power characteristics are selected for follow-up research.
To discuss the effectiveness of the program, we first present a case study based on a public dataset generated from electric vehicle charging characteristics [
34]. In order to avoid the excessive noise interference of the real data of electric vehicle charging stations and the safety protection of the information of domestic electric vehicle charging stations at present, we will generate charging data for the EV charging stations used in our experiments based on real EV charging characteristics. It is currently known that the frequency of the low-frequency dataset of the target electric vehicle charging station is 1 Hz.
From the above conditions, we have generated four electric vehicle charging characteristics, code-named EV1, EV2, EV3, and EV_new; there is a slight difference in the active power and reactive power among them, so as to simulate the current electric vehicles of various brands. There are differences in charging characteristics, but it is set so that the three electric vehicles have charging power fluctuations of no more than 10 W to simulate the influence of the power noise of the electric vehicle charging station on our modeling. At the same time, the charging habits of the three electric vehicle users are very different, and the random charging load existing in the electric vehicle charging station is set at the same time. The data for ten electric vehicle charging stations are set by the above rules, and the power–time curve of the electric vehicle charging station composed of them is shown in
Figure 4.
In addition, in order to simulate the randomness of electric vehicles in the electric vehicle charging station, we will add new car data; the impact on the electric vehicle charging station after adding is shown in
Figure 5. The random load added to the trained neural network is a strange load and can be used to detect the effect of the optimized algorithm in this article.
In order to comprehensively evaluate the properties of the research such as accuracy, this paper selects the four most important indicators in the non-intrusive load monitoring and evaluation system: recall rate,
F1 score, accuracy rate, and mean absolute error as evaluation indicators. The calculation formula of each index is as follows [
31]:
TP indicates that the actual work and state of the load are consistent with the work state of the NILM analysis results and both are data sequences that are working.
FP and
FN represent the data sequence in which the actual work and state of the load are inconsistent with the work state of the NILM analysis result; however, in
FP, the load is actually working, but the NILM analysis result is not working. In
FN, the load is not actually working, but the NILM analysis results are working [
31].
is the actual power of the EV charging station at time t,
is the decomposition power at time t, and
MAE is the mean absolute error from
T0 to
T1 [
31].
PRE,
REC, and
F1 scores are the basic indicators of NILM and can be used to reflect the correctness of the analysis results judged by the NILM model.
MAE reflects the accuracy of results across time periods. The lower the resulting value of
MAE, the higher the precision of its decomposed value.
3.2. Load Perception of Electric Vehicle Charging Station Based on Deep Learning
First, after obtaining the experimental data, the depth of the neural network can be extended without multi-layering the neural network. Instead, the normalized data is directly used as the neural network training data. The resulting neural network training results are poor, as shown in
Figure 6.
To sum up, the direct training of the neural network on the electric vehicle charging station data without multi-layer processing will lead to insufficient network depth. The neural network cannot get enough neuron connections and parameters. As a result, the neural network cannot perceive the operating characteristics of the electric vehicle well and cannot effectively decompose it. Therefore, it is necessary to perform multi-layer processing on the input data and expand the original two-column array into a data matrix so that the user’s electricity consumption habits are repeatedly connected to each other and increase the depth of the neural network.
After many trials, when the number of hidden layers is set to 16, the maximum number of failures is set to 10, and the learning rate is set to 0.01, the two requirements of speed and accuracy can be met.
Figure 7 shows the training results of matlab’s BP neural network.
The decomposition result of EV1 is basically in line with the real operating data after filtering, indicating that the neural network can effectively detect the user’s electricity consumption habits and the charging characteristics of the vehicle. The decomposition result is shown in
Figure 8.
As can be seen from the above figure, although there are still some fluctuations in the decomposition of the power data of the electric vehicle charging station by the deep neural network, its waveform can already fit the power curve in the real situation. The data of the first four days is used for training, and the data of the last four days is used for testing, which proves that the deepening of the neural network in this scheme can better realize the load perception of the electric vehicle charging station. At the same time, we use the decomposed data to calculate the precision rate, recall rate, and
F1 score of each load data by using the various indicators in the evaluation system of the non-intrusive load monitoring algorithm mentioned above. The analysis results and calculation indicators are shown in the
Table 1.
3.3. Electric Vehicle Charging Station Load Perception Based on Optimized Deep Learning for Newly Added Electric Vehicle Loads
The vehicles in the electric vehicle charging station are mainly divided into two categories: habitual long-term users and random short-term users. The main focus of our monitoring is habitual long-term users. For the monitoring of short-term users, we can sacrifice a little accuracy. However, if an undetected electric vehicle is added to the currently established electric vehicle charging station load model, the current model will not be able to perceive this vehicle. If the car is a random load, it will not have very much influence on the evaluation of the load perception and controllability of the electric vehicle charging station. However, if it is a habitual long-term user, our algorithm will have a big loophole. How can the parameters of the current deep learning neural network be updated for a long time to keep the model with high accuracy? Therefore, it is proposed to continually update the training set of the deep learning neural network, so that the neural network can always update the parameters in it to complete the perception of the newly added electric vehicle load.
Thus, we added a new EV to the bus based on the 10 EVs we generated earlier. The modified electric vehicle data is shown in
Figure 9.
After increasing the load of EV11, in addition to the inability to perceive the current model of EV11, there will be more or less deviations in the perception of other vehicles. For example, in the perception of EV6, due to the influence of EV11, more fluctuations on the decomposed power curve may affect subsequent identification difficulties, as shown in
Figure 10.
When EV11 data is added to the current EV charging station model, it will have a greater impact on the analytical capabilities of deep neural networks. Therefore, the training set is constantly updated to update the parameters in the deep neural network. First, we input the new load EV charging station bus data into the algorithm to get the initial
K value in the
K-means clustering algorithm. Because the active power of each charging pile in the electric vehicle charging station is the same, we can get the size of
K by comparing the active and reactive power. Then, we perform
K-means clustering on EV charging stations without EV11 added.
Figure 11 shows the clustering results.
After the above cluster centers are obtained,
K calculation and
K-means clustering are also performed on the data of the electric vehicle charging station with the newly added load. The clustering results are shown in
Figure 12.
The cluster center characteristics of newly added and unadded loads can be seen in the
Table 2.
It can be seen from the above table that a new cluster center appears in the current electric vehicle charging station bus data after the new load is added. The appearance of this new cluster center is considered to be a new feature generated by the overlap of the newly added EV11 load with other loads during this time. Therefore, we use time comparison and power data difference to generate EV11 operating data.
Figure 13 is a comparison diagram of EV11 data supplemented by this scheme and its real data.
After using the clustering and decomposition algorithm to obtain the EV11 running curve, we then use it together with the original running data as training data, update the deep neural network training set, and retrain the neural network. The number of hidden layers is set to 16, the maximum number of failures is 10, and the initial learning rate is set to 0.01. The training results are shown in
Figure 14.
Then, we use the updated data neural network to analyze the newly added load EV11 and decompose the running power–time curve of EV11.
Figure 15 is a comparison graph with the real power–time curve.
Although for the exploration of new loads, the accuracy of the neural network’s analysis after updating the parameters is not very high, it still distinguishes the running time of each EV11. This scheme is still very effective and can handle the randomness of users of electric vehicle charging stations. The
Table 3 shows the indicators obtained by running the non-intrusive load monitoring algorithm for EV charging stations after the addition of EV11.
The results of electric vehicles after adding EV11 can be obtained from the deep neural network analysis. The advantages of the non-intrusive load monitoring algorithm proposed in this paper over the traditional neural network algorithm to analyze the non-intrusive load monitoring problem can be seen in the
Figure 16, which shows the comparison of the parameters finally calculated by the three methods. It can be seen that the deep neural network is the most accurate for the perception of regular load. Although the deep neural network optimized by unsupervised learning is generally accurate for the perception of random loads, it solves the non-intrusive load monitoring algorithm’s ability to perceive random loads, and the error is also within an acceptable range.
3.4. Evaluation of Adjustable Capability of Electric Vehicle Charging Station
To assess the controllability of an EV charging station, one needs to focus on the controllability of each EV. The concept of electric vehicle controllability is proposed. Although electric vehicles are connected to the power grid when charging, and V2G interaction can be carried out at any time when the power grid is insufficient, it is necessary to consider whether the current state of electric vehicles allows V2G interaction. If the remaining power of the electric vehicle cannot meet the travel power demand of the owner, it will not be able to interact with the Internet of Vehicles. Therefore, this paper divides the vehicle into charging, idling, and discharging states according to the charging state of the vehicle entering the grid. After the previous analysis, the charging state must be maintained until the car can enter the idle state before it can be discharged, and each car owner has different requirements for entering the idle state. In addition, when entering the grid for charging, there is also a difference in the remaining power in the car. Therefore, it is necessary to analyze the average daily travel distance of users and determine the time when the electric vehicle can enter the idle state after being connected to the grid by judging the remaining power of the electric vehicle and the travel demand of the vehicle owner.
The average daily travel distance of users obeys the log-normal distribution [
34], and its probability density function is:
where
represents the average daily driving distance, and
and
represent the expectation and variance of the log-normal distribution, respectively. In this paper,
= 3 and
= 1.1. From this, the probability density function curve of the average daily travel distance of the user can be obtained.
After obtaining the probability density of the user’s average daily travel distance, the predicted remaining power of the current electric vehicle charging station system when the vehicle enters the power grid is set according to the probability, and we calculate the charging time required for the electric vehicle to enter the idle controllable state:
In the above formula, refers to the power consumption required to travel one kilometer, and the product of the probability density product of the average daily travel distance of electric vehicle users divided by the time is the minimum daily charging amount of this type of electric vehicle. Only electric vehicles that reach this charge amount can be regarded as a controllable idle state and can be included in the evaluation of the controllability of electric vehicle charging stations.
Using neural network to build a model can aid in analyzing the charging habits of electric vehicle charging station users and realize non-intrusive load monitoring of electric vehicle charging stations. It can break down the time each user plugs in the electric car and their charging time. The adjustable time of each electric vehicle is calculated by means of the adjustable ability model of each electric vehicle so as to realize the evaluation of the real-time power-adjustable ability of the electric vehicle charging station.
According to the average daily mileage of electric vehicle users, the average daily mileage of the car in the electric vehicle charging station was simulated according to the probability model. As a result, the charging time required for each electric vehicle is calculated, and then the power controllability is evaluated. That is, when the power grid requires, it can converge at the regulatory capacity of electric vehicles without affecting the normal use of electric vehicles. At different times per day, the number of vehicles that can be regulated at electric vehicle charging stations may different. The prediction of the control capacity of the electric vehicle charging station is shown in
Figure 17.