Deep Adaptive Ensemble Filter for Non-Intrusive Residential Load Monitoring

Kianpoor, Nasrin; Hoff, Bjarte; Østrem, Trond

doi:10.3390/s23041992

Open AccessArticle

Deep Adaptive Ensemble Filter for Non-Intrusive Residential Load Monitoring

by

Nasrin Kianpoor

^*

,

Bjarte Hoff

and

Trond Østrem

Department of Electrical Engineering, UiT—The Arctic University of Norway, 8514 Narvik, Norway

^*

Author to whom correspondence should be addressed.

Sensors 2023, 23(4), 1992; https://doi.org/10.3390/s23041992

Submission received: 18 January 2023 / Revised: 1 February 2023 / Accepted: 7 February 2023 / Published: 10 February 2023

(This article belongs to the Special Issue Practical Nonintrusive Load Monitoring Approaches with Meaningful Performance Evaluation)

Download

Browse Figures

Versions Notes

Abstract

:

Identifying flexible loads, such as a heat pump, has an essential role in a home energy management system. In this study, an adaptive ensemble filtering framework integrated with long short-term memory (LSTM) is proposed for identifying flexible loads. The proposed framework, called AEFLSTM, takes advantage of filtering techniques and the representational power of LSTM for load disaggregation by filtering noise from the total power and learning the long-term dependencies of flexible loads. Furthermore, the proposed framework is adaptive and searches ensemble filtering techniques, including discrete wavelet transform, low-pass filter, and seasonality decomposition, to find the best filtering method for disaggregating different flexible loads (e.g., heat pumps). Experimental results are presented for estimating the electricity consumption of a heat pump, a refrigerator, and a dishwasher from the total power of a residential house in British Columbia (a publicly available use case). The results show that AEFLSTM can reduce the loss error (mean absolute error) by 57.4%, 44%, and 55.5% for estimating the power consumption of the heat pump, refrigerator, and dishwasher, respectively, compared to the stand-alone LSTM model. The proposed approach is used for another dataset containing measurements of an electric vehicle to further support the validity of the method. AEFLSTM is able to improve the result for disaggregating an electric vehicle by 22.5%.

Keywords:

load disaggregation; non-intrusive load monitoring; flexible load; signal processing; deep learning

1. Introduction

The demand for power and energy is increasing due to the growing electrification of society. This creates severe problems of power and energy shortages. The residential sector is one of the sectors that has a large share of the end use of energy after industry and transport. In 2019, energy consumption in the household sector represented 26.3% of the final consumption in the EU, which is a significant amount [1]. This amount can be reduced using demand-side management strategies [2].

In a traditional power system, loads cannot be controlled, leading to higher consumption and subsequently higher pressure on the grid. However, it is possible to control the flexible loads in the residential sector using HEMS [3]. Flexible loads are used to reduce electricity consumption in peak load periods and shift it to off-peak load periods or times when the electricity price is low. Heat pumps, water heaters, floor heating cables, and electric vehicles are some examples of flexible loads in a residential building [4]. A balance between energy consumption and production can be provided using HEMS technologies by participating in flexible loads in different DR programs, considering consumer comfort preferences [5]. Providing feedback to the consumers gives them a better insight into managing their electricity consumption through a suitable DR program [6].

The first step in controlling flexible loads is to monitor them. The process of identifying and obtaining the load signatures in a power system is load monitoring [7]. Load signature is the intrinsic consumption pattern of each individual appliance. Each electrical appliance has its own unique features when it is in operation.

Generally, there are two main methods for load monitoring, the first is intrusive load monitoring and the second is non-intrusive load monitoring. ILM is a traditional method for load monitoring. In this method, each device has its own measuring sensor to record the power consumption of the device with a specific time resolution. Even though ILM is a precise method and the signatures of individual appliances can be obtained using direct measurements, there are drawbacks to this method, because a sensor needs to be installed for each appliance in order to acquire the power consumption data of that specific device. Therefore, it is a cumbersome process, and the large number of home appliances in each house leads to high maintenance and hardware costs [8]. In order to solve this problem, Hart proposed a non-intrusive load monitoring concept for the first time in the 1980s [9]. In simple terms, the process of extracting the load signature of individual appliances from the total power using computational methods is NILM or load disaggregation. In NILM, only one set of measurement sensors is required at the power entrance to record the aggregate power data. Based on the data that are used to solve the problem, it can be categorized into supervised and unsupervised learning methods. In supervised learning, data have a label and there is information about different appliances to train the network, while unsupervised learning data are unlabelled [10]. Depending on the requirements, supervised learning can be defined as a regression or classification problem. Different load monitoring methods are categorized in Figure 1. In this paper, a supervised learning method is used for load monitoring.

1.1. Literature Review

There are two main approaches to NILM [11]: state-based and event-based methods. The event-based approaches use edge detection techniques to identify the events in the signal. The total power consumption in a home is constantly changing, and new events (ON/OFF of one or more appliances and changes in the states of the appliances) can be identified based on the edges. Load signatures of different appliances are extracted based on the intrinsic features of each device. Different classification methods including kNN [12], Decision Trees [13], HMM, SVM [14] and Naive Bayes have been employed. The similarities between appliance signatures and high measurement noise limit the performance of the event-based method. In order to improve the performance of event-based NILM, a graph signal processing algorithm was studied by Zhao et al. [15].

The state-based method represents each operation of the appliance using a state machine with distinct state transitions according to the usage pattern of appliances. When the states of appliances change or appliances turn ON/OFF, they have different edges and each of them has a different probability distribution that fits a specific appliance. The HMM and its different extensions are examples of state-based NILM approaches [16,17,18,19,20]. In the state-based method, prior values of appliances through a long period of training are required, which is a limitation of this method. High computational complexity is another drawback of the state-based method [21,22]. There is some research that has tried to solve NILM using mixed integer programming. In recent years, Wittmann et al. have [23] considered the NILM problem as a mixed integer linear programming problem and achieved high accuracy. A two-stage optimization-based method based on mixed integer nonlinear programming was presented for load disaggregation in [24].

Recently, deep learning methods have received attention in different fields of study such as image processing [25] and speech recognition [26]. Powerful computers and datasets in different fields are the main reasons for increasing the usage of deep learning methods. Due to the extensive installation of smart meters in recent years, energy consumption data in the household sector are now accessible. For this reason, different deep learning methods such as CNN [27,28,29,30,31] and RNN [1] have been widely used in the energy disaggregation problem. In [27], three different deep neural network methods were used for energy disaggregation and seven metrics were used to check the performance of these algorithms on five different appliances. A casual CNN network for load disaggregation on low-frequency data was presented in [32], in which it was concluded that using all four components (current, active power, reactive power, and apparent power) leads to higher performance. In [33], a sequence-to-point learning CNN architecture was proposed, where the sliding window was used for input data to manage long-term time series. Athanasiadis et al. [34] proposed a scalable real-time event-based load disaggregation algorithm using the CNN network. The proposed algorithm included three parts: an event detection algorithm, a CNN classifier, and a power estimation model.

The LSTM model has also been used in the application of NILM. In Ref. [35], an energy disaggregation method based on the LSTM-RNN model was proposed. A bidirectional LSTM model was used in [36] for energy disaggregation. A U-net architecture for one-dimensional data for power estimation and multi-task-multi-appliance state detection was proposed by Faustine et al. [37]. In [38], a subtask-gated network was proposed, in which the main regression network was combined with an on/off classification subtask network.

Although some research has been undertaken on load disaggregation with different deep learning methods, few of them have considered the impact of data preprocessing methods on the proposed models, and most of the proposed methods are considered as a classification problem that detects only switch on/off events.

1.2. Our Contribution

This paper proposes a new adaptive load disaggregation framework (AEFLSTM), powered by the strength of signal processing methods and LSTM. The AEFLSTM is an adaptive framework that can be used for load disaggregation in residential houses by searching among an ensemble of signal processing techniques to improve the LSTM performance for disaggregating that specific load. Signal processing methods are strong tools for filtering frequencies that might not be useful for flexible load disaggregation. The LSTM is also a strong technique to learn the temporal dependencies of flexible loads and address nonlinearities and uncertainties. To the best of the authors’ knowledge, AEFLSTM is the first attempt at flexible load disaggregation which tries to find the best signal processing technique for LSTM to improve accuracy. The contributions of this paper are summarized as follows:

In terms of methodology:
-
An adaptive disaggregation framework powered by the signal processing technique and LSTM is developed that is applicable to any load disaggregation problem;
-
The proposed AEFLSTM is a holistic algorithm that can easily be generalized to have more signal-processing techniques.
From an application point of view, the proposed framework AEFLSTM is used to disaggregate a heat pump and a refrigerator from the total power of a residential building in British Columbia. The accuracy of load disaggregation is significantly improved by using the proposed AEFLSTM framework based on actual data from the use case.

The paper is organized as follows: The data that are used in this study are explained in Section 2. The methodology, including signal processing and deep learning methods, is presented in Section 3. Section 4 shows the results of the experimental evaluation of the proposed methods. Finally, the conclusion of the paper is given in Section 5.

2. Use Case

In this paper, data from a residential building in British Columbia, Canada are used. The dataset, named AMPDs, is an open-source dataset and is available from [39]. It contains different measured factors, such as voltage, current, frequency, real power, reactive power, etc., for different loads in the house such as a the heat pump, refrigerator, dishwasher, washing machine, and other common loads. The main focus of this paper is to disaggregate the major loads from the total power to use them as flexible loads in future research. Therefore, the focus of this study is on the disaggregation of heat pumps, which are the major electricity consumer in AMPDs data. Hence, in Figure 2, the total power of the house and heat pump power consumption is plotted. Moreover, their statistical metrics, including the mean value, standard deviation, and minimum and maximum values of the total power and heat pump power consumption, are presented in Table 1. The metrics in Table 1 show that the heat pump has a large share of electricity consumption in the AMPDs data. Therefore, it will be valuable if controlled as a flexible load.

3. Methodology

In this paper, a deep adaptive ensemble filter based on various signal processing tools integrated with an LSTM is developed for flexible load disaggregation. The overall schematic of the proposed framework AEFLSTM is presented in Figure 3. As is shown, the proposed framework has two main modules, and it should be mentioned that there is a data preprocessing step before applying the main signal to the first module. The first module is an adaptive ensemble filtering block that consists of three famous signal processing methods. The output of this module is a clean total power signal in which irrelevant frequencies are filtered (e.g., noise or high frequencies), depending on the main frequencies of the selected flexible load that need to be disaggregated from the total power. Then, the filtered signal and calendar variables are applied to the second module, which is a supervised deep-based load disaggregation block, to enhance the learning ability of the LSTM for disaggregating the flexible load from the filtered signal. There is feedback from the output of the second module to the input of the first module in order to find the best signal processing method which has the best performance for load disaggregation. In other words, all the available signal processing methods in the first modules are tested by the algorithm, and the one with the best performance is selected. In the following subsections, each module as well as the data preprocessing step is explained in detail.

Consider a house with N different household appliances. If

P (t)

is the total power that is taken from the smart meter at the power entrance of the house and

P_{j} (t)

is the power consumption of j-th appliance (

1 \leq j \leq N

), then the total power is [13]:

P (t) = \sum_{j = 1}^{N} P_{j} (t) + e (t)

(1)

where

e (t)

is the noise. The main goal of the problem in Equation (1) is to estimate

P_{j} (t)

with a given

P (t)

.

3.1. Data Cleaning

The first step before training any machine learning algorithm is data cleaning, which can improve data quality and consistency. Data cleaning can include the following steps such as finding missing values, interpolating or imputing missing values, detecting outlier data, standardization, or normalization. In this work, fortunately, no missing values or outliers were found in the AMPDs dataset. Moreover, the data are standardized as follows [40]:

x_{n e w} = \frac{x - \bar{x}}{σ}

(2)

where

x_{n e w}

, x, and

\bar{x}

are the normalized value, real value, and mean of real values, respectively. In addition,

σ

is the standard deviation of the real data. In this formula, all the new values are centered around a mean value with unit variance. Generally, there is no specific rule for selecting the normalization or standardization methods and it is highly dependent on the problem [41]. In this work, standardization was selected heuristically.

3.2. Adaptive Ensemble Filtering

After performing pre-processing on the data, the normalized data is applied to the adaptive ensemble filtering block. This block consists of three signal-processing methods. Each time series can be broken into two main components: systematic and non-systematic. Systematic parts can be described and modeled due to their recurrence and consistency. Non-systematic parts of the time series that are known as noise cannot be modeled because they are random variations in the time series. Therefore, the model accuracy can be increased by removing the non-systematic components of the time series. In this paper, low pass filtering, discrete wavelet transform, and seasonal decomposition are applied to remove the non-systematic part of the time series. Here, each of them is briefly described.

3.2.1. Low-Pass Filter

A low-pass filter passes a signal with a frequency lower than a certain cut-off frequency. In the other words, the main aim of low-pass filtering is to remove the components above the cut-off frequency. A fast Fourier transform can be used to determine the value of the cut-off frequency because the FFT can find the frequencies, amplitudes, and noise components of the signal. The cut-off frequency value can be set manually as well. A low pass filtering in the frequency domain is shown in Figure 4. The filter passes the signal in the passband and attenuates the signal in the stopband. The cut-off frequency is in the transition band.

3.2.2. Discrete Wavelet Transform

The second method that is used for denoising the time series is the discrete wavelet transform [43]. In practice, DWT is used as a filter bank that can deconstruct a signal into the low pass (approximation) and high pass (detail) coefficients, which is called signal decomposition. If the signal is reconstructed again using detail and approximation coefficients, the output will be the original signal. However, if the detail coefficients that are representative of the high-frequency part of the signal are left out in the reconstructing process, then the output signal is the original signal that is filtered out. The DWT of a time series signal x[n] is as follows:

W_{x} (a, b) = \sum_{n} \frac{1}{\sqrt{a}} x [n] ψ^{*} (\frac{n - b}{a})

(3)

where the parameter a sets the scale or dilation of the wavelet; it decides how squashed or stretched a wavelet is. Increasing the value of a will stretch the wavelet and decreasing its value will squash the wavelet. The location of the wavelet is defined by parameter b. Increasing the value of b moves the wavelet to the right and decreasing its value shifts the wavelet to the left.

ψ^{*}

is the complex conjugate of the mother wavelet,

ψ

. To define DWT, the following assumptions are considered:

\{\begin{matrix} a = 2^{j} \\ b = k 2^{j} \end{matrix}, j = 0, 1, 2, \dots

(4)

where k is an integer. The wavelet function is stretched in the time domain and squashed in the frequency domain by a factor of two if the index j increases by one.

By substituting (4) into (3), the new equation for the DWT is as follows:

W_{x} (j, k) = \sum_{n} \frac{1}{\sqrt{2^{j}}} x [n] ψ^{*} (2^{- j} n - k) .

(5)

The structure of the DWT decomposition model is depicted in Figure 5.

3.2.3. Seasonal-Trend Decomposition Method

The last method that has been used for denoising the signal is the additive decomposition method. Earlier in this section, it was explained that each time series has two components: a systematic part and a non-systematic part. The systematic component includes trend and seasonality. The trend component shows the long-term change (increasing or decreasing) in the time series, and the seasonality component indicates periodic cycles in the data. Therefore, a time series is a function of trend, seasonality, and noise. The equation can be written as follows [45]:

Y_{t} = f (T_{t}, S_{t}, e_{t})

(6)

where

Y_{t}

is a time series,

T_{t}

is the trend,

S_{t}

is the seasonality, and

e_{t}

is the noise. In an additive decomposition model, it is assumed that the time series is a combination of all components. The equation of the additive decomposition model is as follows:

Y_{t} = T_{t} + S_{t} + e_{t} .

(7)

3.3. Supervised Deep-Based Load Disaggregation

The output of the first module and calendar variables are applied to the second module, which is a supervised deep-based load disaggregation block. In this module, the LSTM network, which is a deep learning method, is used for load disaggregation. An LSTM model is composed of different cells that are responsible for remembering information. Each cell includes three parts: the forget gate, the input gate, and the output gate. The information that is less important or is no longer needed for the LSTM is removed through the forget gate. It optimizes the performance of the network. The forget gate has two inputs,

x_{t}

is the input at time t and

h_{t} - 1

is the hidden state from the previous cell.

f_{t}

is the output of the forget gate. It is a vector with values 0 and 1; a 0 output for a specific value implies forgetting the information related to that value, whereas 1 implies that the information is remembered. The input gate adds new information to the cell. It creates a vector containing all possible values that can be added to the cell. The vector is filtered and scaled in the range of

- 1

to 1 by the

t a n h

function to keep only important information. The output gate chooses useful information from the current cell. It sends them out as the hidden state for the next cell and as an output for the current cell. The structure of the LSTM cell is shown in Figure 6.

4. Experimental Results

To evaluate the proposed AEFLSTM model for load disaggregation, the AMPDs dataset was used. The resolution of the data is one minute, which is low frequency; therefore, it has less computational complexity than high-frequency data. The details of the dataset are explained in Section 2. The experiments were implemented in “Google Colab” using Numpy, Pandas, Keras, and sklearn libraries.

In this study, the adaptive ensemble filtering module (Block 1, shown in Figure 3) searches between different denoising methods in the block and at the end compares the result and chooses the best one for load disaggregation. The results were evaluated based on the mean absolute error (MAE) and root mean square error (RMSE), which are commonly used to evaluate the energy disaggregation problem. The formulas of the metrics are as follows:

M A E = \frac{1}{n} \sum_{i = 1}^{n} | y_{i} - {\hat{y}}_{i} |

(8)

R M S E = \frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2} .

(9)

The household load consumption is highly dependent on time. Hence, it is essential to discover the correlation between power consumption and calendar variables including hours, weekdays, and months. At this stage, some analyses have been performed to show these dependencies. In Figure 7, the correlation heat map between total power and power consumption of the heat pump with calendar variables including hour, day of week, day of month, and month is shown. To calculate the pairwise correlation of columns, the

c o r r ()

function in the

p a n d a s

library was used, in which the Pearson method (standard correlation coefficient) was selected for the correlation function. The heat map was plotted using the

s e a b o r n

library in python. As it can be seen in Figure 7, the heat pump power consumption has the highest correlation with the total power and among the calendar variables, it has the highest correlation with hour.

Before implementing AEFLSTM for load disaggregation, a stand-alone LSTM model was compared with the state-of-the-art models, including linear regression and decision tree regression, in order to determine which of them are more suitable for the application of load disaggregation. The total power consumption, history of the appliance, and calendar variables including hour, day of week, and day of month were used to train the LSTM network. The parameters of the LSTM model, including the number of training epochs, batch size, and the number of nodes to use in the hidden layer, were chosen using a grid search. The final LSTM network configuration is shown in Table 2.

The period from 1 November 2012 to 30 November 2012 of the aggregated power consumption data of the AMPDs dataset was selected for training, validation, and testing of the LSTM model. A total of 30% of the data was considered for validation and one day of the data was used for testing the model.

The input layer of the LSTM network receives the data in a three-dimensional format with a moving time window. Here, a time step of 360 min is considered, which means data are segmented into different time windows with a duration of 360.

After network configuration, LR, DTR, and LSTM were used to disaggregate the heat pump power consumption from the aggregated total power. A numerical comparison of all the methods is presented in Table 3. As presented in Table 3, the LSTM model outperforms the other models, as the MAE error was significantly reduced from 319.35 (W) and 97.01 (W) to 90.2 (W). Although the LR and DTR models could detect the switch on/off events of the heat pump, they could not correctly identify the peak of the load. Another problem of these two models is that they cannot correctly distinguish the load cycle (heat pump time period) and show the device as on while it is off in reality based on measured data. For the sake of a better understanding, the DR, DTR, and LSTM performances are presented in Figure 8.

The details of different filtering methods used in the adaptive ensemble filtering block in Figure 3 are presented separately in the following subsections.

4.1. Case 1: LPF

In this section, the specification of the LPF, which is one of the filtering methods in the adaptive ensemble filtering block, is explained. A signal processing toolbox called “scipy.signal” was used. The value of the desired cut-off frequency of the filter is an important factor to design a proper LPF. Improper selection of this parameter may change the nature of the original signal. For this reason, the signal was first transferred to the frequency domain using a fast Fourier transform. The frequencies and amplitudes of the signal vs. the noise components can be identified based on the signal response in the frequency domain. A range of frequencies outside the principle frequency was selected and, based on trial and error, the number

0.35

was considered as the desired cut-off frequency. The LPF frequency response and the aggregate power consumption signal and its LPF smoothed version are depicted in Figure 9.

4.2. Case 2: DWT

In this section, the DWT specification used in the adaptive ensemble filtering block is explained. DWT was used to remove the noise in which the signal is deconstructed into the detail and approximation coefficients. Here “PyWavelets” (an open-source wavelet transform software for python) was applied for signal decomposition. In the simulation, one family of wavelets, called “Daubechies”, was considered. The detail coefficients were filtered out using “pywt.threshold”, which removes coefficients above a certain threshold. The aggregate power consumption signal and its DWT smoothed version are shown in Figure 10.

4.3. Case 3: SD

The last method used for filtering the signal was the seasonal decompose method. In order to implement the SD model, a times series analysis toolbox named “statsmodels.tsa” was utilized. The total power consumption signal was decomposed into different components including trend, seasonality, and residual components using the “seasonal_decompose” function in the “tatsmodels.tsa” toolbox. The main signal and its components are plotted in Figure 11. As it can be seen from Figure 11, the signal does not show seasonality; therefore, the trend is considered as the extracted feature and the residual is filtered out.

In the following, AEFLSTM is implemented using the configuration and parameters mentioned in Section 4.1, Section 4.2 and Section 4.3 and Table 2 for the first and second modules, respectively, to disaggregate the heat pump power consumption from the total power. The AEFLSTM algorithm searches among all the available signal processing methods in the first module and the one with the best performance is selected. The AEFLSTM model shows, DWT-LSTM outperforms the others while disaggregating the heat pump power consumption. The output of the AEFLSTM model, which is the estimated power consumption of the heat pump, is compared with the ground truth in Figure 12. Finally, the performances of all the filtering methods are compared based on the MAE and RMSE metrics in Table 4. As can be seen from the comparison, all the methods have improved the model, and their performances are better than a stand-alone LSTM network. Among them all, DWT-LSTM outperforms with a slight difference from SD-LSTM; therefore, it was chosen as the output of the AEFLSTM model.

The main focus of this paper is to disaggregate the sizeable and major loads to use them as flexible loads in future work. However, in this part, the proposed AEFLSTM method is implemented to estimate the signatures of the refrigerator and the dishwasher of the AMPDs dataset, to show the performance of the proposed methodology on other appliances as well. First, a stand-alone LSTM model, a DTR model, and an LR model were used to disaggregate the power consumption of the appliances from the total power consumption. Again, the LSTM model outperforms the two other methods for both appliances. For the refrigerator, LR failed to identify on/off events. DTR could detect a few events but was not able to identify the peak of the load. However, for the dishwasher, both methods (LR and DTR) failed to identify on/off events. A numerical comparison of these different methods is presented in Table 5.

In the following, AEFLSTM was implemented using the configuration mentioned in Section 4.1, Section 4.2 and Section 4.3 and Table 2 for both modules. This time, as in the previous case for the heat pump, the AEFLSTM chose the DWT-LSTM method as the best result for estimating the power consumption of the refrigerator and it chose LPF-LSTM for disaggregating the signature of the dishwasher (slightly better than DWT-LSTM) from the total power. A comparison of the estimated power consumption using AEFLSTM with ground truth data for the refrigerator and the dishwasher is shown in Figure 13 and Figure 14, respectively. The zoomed-in versions of both figures show how accurately the estimated power consumption tracks the measured data. The numerical results of both case studies are presented in Table 5.

Here, the validation phase is expanded by including another dataset to further support the validity of the proposed approach. For this reason, a dataset from a residential building in the arctic region of northern Norway is considered. The dataset contains the measurements of the electricity consumption of the main circuit and different appliances, where electric vehicles are one the major loads, consuming a large part of the total power. Therefore, the AEFLSTM approach was used to estimate the power consumption of the electric vehicle from the total power. However, before implementing the AEFLSTM, other methods, including LR, DTR, and stand-alone LSTM, were used to estimate the power consumption of electric vehicles from the total power, in which LSTM performed better. The performance metrics of the difference between real and estimated data for electric vehicles are presented in Table 5. A comparison of the proposed method with ground truth for electric vehicles is depicted in Figure 15.

5. Conclusions

This paper presented a new deep adaptive ensemble filter for non-intrusive residential load monitoring. This method is used for non-intrusive residential load monitoring. The proposed AEFLSTM framework searches among ensemble filtering methods in the first module to find the best method for load disaggregation application. Experimental results are presented for disaggregating a heat pump, refrigerator, and dishwasher from the AMPDs dataset. The performance of the stand-alone LSTM is compared with DTR and LR models to determine which is more suitable for the load disaggregation problem. The results show that the LSTM model outperforms in disaggregating the heat pump, refrigerator, and dishwasher. AEFLSTM is implemented to estimate the power consumption of the appliances. AEFLSTM selects DWT-LSTM as the more accurate method for disaggregating the heat pump and refrigerator signatures from the total power and LPF-LSTM for disaggregating the signature of the dishwasher. The results show that AEFLSTM can reduce the loss error (mean absolute error) for the heat pump, refrigerator, and dishwasher by 57.4%, 44%, and 55.5%, respectively, compared to the stand-alone LSTM model. Finally, another dataset containing data on electric vehicle power consumption is considered to further support the validity of the proposed approach. AEFLSTM is able to improve the result of estimating the electricity consumption of the electric vehicle by 22.5%.

Research on NILM has led to a detailed understanding of the energy consumption of home appliances. Appliances used for heating, cooling, ventilation, washing, and drying can be considered as flexible loads where their utilization may be reduced or changed during peak load periods, considering user comfort. Future work will investigate the possibility of deploying a home energy management system for appliance flexibility using NILM.

Author Contributions

Methodology, N.K.; Writing—original draft, N.K.; Writing—review & editing, B.H. and T.Ø.; Visualization, N.K.; Supervision, B.H. and T.Ø. All authors have read and agreed to the published version of the manuscript.

Funding

The work of Nasrin Kianpoor, Bjarte Hoff and Trond Østrem was supported in part by the Project “Transformation to a Renewable and Smart Rural Power System Community (RENEW)” under Grant 310026, and in part by the Arctic Centre for Sustainable Energy (ARC), UiT-The Arctic University of Norway.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

CNN	Convolutional neural network
DR	Demand response
DTR	Decision tree regression
DWT	Discrete wavelet transform
FFT	Fast Fourier transform
HEMS	Home energy management system
HMM	Hidden Markov model
ILM	Intrusive load monitoring
KNN	k-nearest neighbors
LR	Linear regression
LSTM	Long short-term memory
LPF	Low pass filter
MAE	Mean absolute error
NILM	Non-intrusive load monitoring
RMSE	Root mean square error
RNN	Recurrent neural network
SD	Seasonal decomposition
SVM	Support vector method

References

Çimen, H.; Çetinkaya, N.; Vasquez, J.C.; Guerrero, J.M. A microgrid energy management system based on non-intrusive load monitoring via multitask learning. IEEE Trans. Smart Grid 2020, 12, 977–987. [Google Scholar] [CrossRef]
Azizi, E.; Shotorbani, A.M.; Hamidi-Beheshti, M.T.; Mohammadi-Ivatloo, B.; Bolouki, S. Residential household non-intrusive load monitoring via smart event-based optimization. IEEE Trans. Consum. Electron. 2020, 66, 233–241. [Google Scholar] [CrossRef]
Lemes, D.A.M.; Cabral, T.W.; Fraidenraich, G.; Meloni, L.G.P.; De Lima, E.R.; Neto, F.B. Load disaggregation based on time window for HEMS application. IEEE Access 2021, 9, 70746–70757. [Google Scholar] [CrossRef]
Coffman, A.R.; Guo, Z.; Barooah, P. Characterizing capacity of flexible loads for providing grid support. IEEE Trans. Power Syst. 2020, 36, 2428–2437. [Google Scholar] [CrossRef]
Erdem, H.; Uner, A. A multi-channel remote controller for home and office appliances. IEEE Trans. Consum. Electron. 2009, 55, 2184–2189. [Google Scholar] [CrossRef]
Zhai, S.; Zhou, H.; Wang, Z.; He, G. Analysis of dynamic appliance flexibility considering user behavior via non-intrusive load monitoring and deep user modeling. CSEE J. Power Energy Syst. 2020, 6, 41–51. [Google Scholar]
Munoz, O.; Ruelas, A.; Rosales, P.; Acuña, A.; Suastegui, A.; Lara, F. Design and Development of an IoT Smart Meter with Load Control for Home Energy Management Systems. Sensors 2022, 22, 7536. [Google Scholar] [CrossRef]
Devlin, M.A.; Hayes, B.P. Non-intrusive load monitoring and classification of activities of daily living using residential smart meter data. IEEE Trans. Consum. Electron. 2019, 65, 339–348. [Google Scholar] [CrossRef]
Hart, G.W. Nonintrusive appliance load monitoring. Proc. IEEE 1992, 80, 1870–1891. [Google Scholar] [CrossRef]
Massidda, L.; Marrocu, M. A bayesian approach to unsupervised, non-intrusive load disaggregation. Sensors 2022, 22, 4481. [Google Scholar] [CrossRef]
Pereira, L.; Nunes, N. Performance evaluation in non-intrusive load monitoring: Datasets, metrics, and tools—A review. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2018, 8, e1265. [Google Scholar] [CrossRef] [Green Version]
Tabatabaei, S.M.; Dick, S.; Xu, W. Toward non-intrusive load monitoring via multi-label classification. IEEE Trans. Smart Grid 2016, 8, 26–40. [Google Scholar] [CrossRef]
Liao, J.; Elafoudi, G.; Stankovic, L.; Stankovic, V. Non-intrusive appliance load monitoring using low-resolution smart meter data. In Proceedings of the 2014 IEEE International Conference on Smart Grid Communications (SmartGridComm), Venice, Italy, 3–6 November 2014; pp. 535–540. [Google Scholar]
Altrabalsi, H.; Stankovic, V.; Liao, J.; Stankovic, L. Low-complexity energy disaggregation using appliance load modelling. Aims Energy 2016, 4, 884–905. [Google Scholar] [CrossRef]
Zhao, B.; He, K.; Stankovic, L.; Stankovic, V. Improving event-based non-intrusive load monitoring using graph signal processing. IEEE Access 2018, 6, 53944–53959. [Google Scholar] [CrossRef]
Kim, H.; Marwah, M.; Arlitt, M.; Lyon, G.; Han, J. Unsupervised disaggregation of low frequency power measurements. In Proceedings of the SIAM International Conference on data mining (SIAM), Mesa, AZ, USA, 28–30 April 2011; pp. 747–758. [Google Scholar]
Kolter, J.Z.; Jaakkola, T. Approximate inference in additive factorial hmms with application to energy disaggregation. In Proceedings of the Artificial Intelligence and Statistics (PMLR), La Palma, Spain, 21 March 2012; pp. 1472–1482. [Google Scholar]
Parson, O.; Ghosh, S.; Weal, M.; Rogers, A. Non-intrusive load monitoring using prior models of general appliance types. In Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, Toronto, ON, Canada, 22–26 July 2012. [Google Scholar]
Makonin, S.; Popowich, F.; Bajić, I.V.; Gill, B.; Bartram, L. Exploiting HMM sparsity to perform online real-time nonintrusive load monitoring. IEEE Trans. Smart Grid 2015, 7, 2575–2585. [Google Scholar] [CrossRef]
Mauch, L.; Barsim, K.S.; Yang, B. How well can HMM model load signals. In Proceedings of the 3rd International Workshop on Non-Intrusive Load Monitoring (NILM 2016), Vancouver, BC, Canada, 14–15 May 2016. number 6. [Google Scholar]
Zoha, A.; Gluhak, A.; Imran, M.A.; Rajasegarar, S. Non-intrusive load monitoring approaches for disaggregated energy sensing: A survey. Sensors 2012, 12, 16838–16866. [Google Scholar] [CrossRef]
He, K.; Stankovic, L.; Liao, J.; Stankovic, V. Non-intrusive load disaggregation using graph signal processing. IEEE Trans. Smart Grid 2016, 9, 1739–1747. [Google Scholar] [CrossRef]
Wittmann, F.M.; López, J.C.; Rider, M.J. Nonintrusive load monitoring algorithm using mixed-integer linear programming. IEEE Trans. Consum. Electron. 2018, 64, 180–187. [Google Scholar] [CrossRef]
Balletti, M.; Piccialli, V.; Sudoso, A.M. Mixed-Integer Nonlinear Programming for State-based Non-Intrusive Load Monitoring. IEEE Trans. Smart Grid 2022, 13, 3301–3314. [Google Scholar] [CrossRef]
Razzak, M.I.; Naz, S.; Zaib, A. Deep learning for medical image processing: Overview, challenges and the future. In Classification in BioApps. Lecture Notes in Computational Vision and Biomechanics; Dey, N., Ashour, A., Borra, S., Eds.; Springer: Cham, Switzerland, 2018; pp. 323–350. [Google Scholar]
Dokuz, Y.; Tufekci, Z. Mini-batch sample selection strategies for deep learning based speech recognition. Appl. Acoust. 2021, 171, 107573. [Google Scholar] [CrossRef]
Kelly, J.; Knottenbelt, W. Neural nilm: Deep neural networks applied to energy disaggregation. In Proceedings of the 2nd ACM international Conference on Embedded Systems for Energy-Efficient Built Environments, Seoul, Republic of Korea, 4–5 November 2015; pp. 55–64. [Google Scholar]
Ding, D.; Li, J.; Zhang, K.; Wang, H.; Wang, K.; Cao, T. Non-intrusive load monitoring method with inception structured CNN. Appl. Intell. 2022, 52, 6227–6244. [Google Scholar] [CrossRef]
Yang, D.; Gao, X.; Kong, L.; Pang, Y.; Zhou, B. An event-driven convolutional neural architecture for non-intrusive load monitoring of residential appliance. IEEE Trans. Consum. Electron. 2020, 66, 173–182. [Google Scholar] [CrossRef]
Zhou, X.; Feng, J.; Li, Y. Non-intrusive load decomposition based on CNN–LSTM hybrid deep learning model. Energy Rep. 2021, 7, 5762–5771. [Google Scholar] [CrossRef]
Medeiros, A.; Canha, L.; Bertineti, D.; de Azevedo, R. Event classification in non-intrusive load monitoring using convolutional neural network. In Proceedings of the 2019 IEEE PES Innovative Smart Grid Technologies Conference-Latin America (ISGT Latin America), Gramado, Brazil, 15–18 September 2019; pp. 1–6. [Google Scholar]
Harell, A.; Makonin, S.; Bajić, I.V. Wavenilm: A causal neural network for power disaggregation from the complex power signal. In Proceedings of the ICASSP IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 12–17 May 2019; pp. 8335–8339. [Google Scholar]
Zhang, C.; Zhong, M.; Wang, Z.; Goddard, N.; Sutton, C. Sequence-to-point learning with neural networks for non-intrusive load monitoring. In Proceedings of the AAAI Conference on Artificial Intelligence, New Orleans, LA, USA, 2–7 February 2018; Volume 32. [Google Scholar]
Athanasiadis, C.; Doukas, D.; Papadopoulos, T.; Chrysopoulos, A. A scalable real-time non-intrusive load monitoring system for the estimation of household appliance power consumption. Energies 2021, 14, 767. [Google Scholar] [CrossRef]
Kim, J.; Le, T.T.H.; Kim, H. Nonintrusive load monitoring based on advanced deep learning and novel signature. Comput. Intell. Neurosci. 2017, 2017, 4216281. [Google Scholar] [CrossRef] [PubMed]
Kaselimi, M.; Doulamis, N.; Voulodimos, A.; Protopapadakis, E.; Doulamis, A. Context aware energy disaggregation using adaptive bidirectional LSTM models. IEEE Trans. Smart Grid 2020, 11, 3054–3067. [Google Scholar] [CrossRef]
Faustine, A.; Pereira, L.; Bousbiat, H.; Kulkarni, S. UNet-NILM: A deep neural network for multi-tasks appliances state detection and power estimation in NILM. In Proceedings of the 5th International Workshop on Non-Intrusive Load Monitoring, Virtual Event, Japan, 18 November 2020; pp. 84–88. [Google Scholar]
Shin, C.; Joo, S.; Yim, J.; Lee, H.; Moon, T.; Rhee, W. Subtask gated networks for non-intrusive load monitoring. In Proceedings of the AAAI Conference on Artificial Intelligence, Atlanta, GA, USA, 8–12 October 2019; Volume 33, pp. 1150–1157. [Google Scholar]
Makonin, S.; Ellert, B.; Bajić, I.V.; Popowich, F. Electricity, water, and natural gas consumption of a residential house in Canada from 2012 to 2014. Sci. Data 2016, 3, 160037. [Google Scholar] [CrossRef]
Ali, P.J.M.; Faraj, R.H.; Koya, E.; Ali, P.J.M.; Faraj, R.H. Data normalization and standardization: A technical report. Mach Learn. Tech. Rep. 2014, 1, 1–6. [Google Scholar]
Normalization vs. Standardization Standardization, Which One Is Better. Available online: https://towardsdatascience.com/normalization-vs-standardization-which-one-is-better-f29e043a57eb (accessed on 22 April 2020).
How to filter noise with a low pass filter — Python. Available online: https://medium.com/analytics-vidhya/how-to-filter-noise-with-a-low-pass-filter-python-885223e5e9b7 (accessed on 27 December 2019).
Gao, R.X.; Yan, R. Wavelets: Theory and Applications for Manufacturing; Springer: Berlin/Heidelberg, Germany, 2010. [Google Scholar]
Singh, P.; Pradhan, G.; Shahnawazuddin, S. Denoising of ECG signal by non-local estimation of approximation coefficients in DWT. Biocybernetics and Biomedical Engineering 2017, 3, 599–610. [Google Scholar] [CrossRef]
Damrongkulkamjorn, P.; Churueang, P. Monthly energy forecasting using decomposition method with application of seasonal ARIMA. In Proceedings of the 2005 International Power Engineering Conference, Liege, Belgium, 22–26 August 2005; pp. 1–229. [Google Scholar]

Figure 1. Different load monitoring methods.

Figure 2. Ten days data of (a) total power, (b) heat pump from 1 November 2012 to 11 November 2012.

Figure 3. The proposed framework for load disaggregation.

Figure 4. A low pass filter in the frequency domain [42].

Figure 5. The structure of DWT decomposition model [44].

Figure 6. The structure of the LSTM cell.

Figure 7. The correlation between total power and heat pump power consumption with calendar variables.

Figure 8. Comparison of the estimated power consumption of the heat pump with ground truth using (a) LR, (b) DTR, and (c) LSTM.

Figure 9. LPF smoothing.

Figure 10. Ten-day-data of aggregate power consumption signal and filtered data using DWT smoothing.

Figure 11. The main signal and its decomposed components.

Figure 12. A comparison of the estimated power consumption of the heat pump with ground truth using AEFLSTM.

Figure 13. A comparison of the estimated power consumption of the refrigerator with ground truth using AEFLSTM.

Figure 14. A comparison of the estimated power consumption of the dishwasher with ground truth using AEFLSTM.

Figure 15. A comparison of the estimated power consumption of the elctric vehicle with ground truth using AEFLSTM.

Table 1. The statistic metrics of total power and heat pump.

	Mean (W)	STD (W)	Min. (W)	Max. (W)
Total power	1396.7	1132.4	269.0	10,542.0
Heat pump	407.2	737.7	0.0	3030.0

Table 2. The LSTM network configuration.

LSTM Parameters
	First layer		Second layer
Nodes			100	50
Dropout rate			0.2	0.2
Return sequence			True	False
Activation function			relu	relu
Optimizer	loss	epotchs	batch_size	validation_split
Adam	MSE	15	30	0.3

Table 3. HPE with different methods.

HPE	MAE (W)	RMSE (W)
LR	319.35	500.9
DTR	97.01	268.6
LSTM	90.2	208.68

Table 4. HPE with different methods.

HPE	MAE (W)	RMSE (W)
LSTM	90.2	208.68
DWT-LSTM	38.4	148.75
LPF-LSTM	51.5	157.1
SD-LSTM	39.7	153.2

Table 5. Performance metrics (MAE and RMSE) of the difference between real and estimated data for different appliances.

	FGE		DWE		EV
	MAE (W)	RMSE (W)	MAE (W)	RMSE (W)	MAE (W)	RMSE (W)
LR	59.48	75.84	29.32	106.66	715.76	975.67
DTR	50.94	71.39	48.64	151.65	363.36	782.62
LSTM	25.74	58.82	10.99	36.59	300.04	735.07
AEFLSTM	14.4	50.6	4.89	28.33	232.37	668.01

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kianpoor, N.; Hoff, B.; Østrem, T. Deep Adaptive Ensemble Filter for Non-Intrusive Residential Load Monitoring. Sensors 2023, 23, 1992. https://doi.org/10.3390/s23041992

AMA Style

Kianpoor N, Hoff B, Østrem T. Deep Adaptive Ensemble Filter for Non-Intrusive Residential Load Monitoring. Sensors. 2023; 23(4):1992. https://doi.org/10.3390/s23041992

Chicago/Turabian Style

Kianpoor, Nasrin, Bjarte Hoff, and Trond Østrem. 2023. "Deep Adaptive Ensemble Filter for Non-Intrusive Residential Load Monitoring" Sensors 23, no. 4: 1992. https://doi.org/10.3390/s23041992

APA Style

Kianpoor, N., Hoff, B., & Østrem, T. (2023). Deep Adaptive Ensemble Filter for Non-Intrusive Residential Load Monitoring. Sensors, 23(4), 1992. https://doi.org/10.3390/s23041992

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Deep Adaptive Ensemble Filter for Non-Intrusive Residential Load Monitoring

Abstract

1. Introduction

1.1. Literature Review

1.2. Our Contribution

2. Use Case

3. Methodology

3.1. Data Cleaning

3.2. Adaptive Ensemble Filtering

3.2.1. Low-Pass Filter

3.2.2. Discrete Wavelet Transform

3.2.3. Seasonal-Trend Decomposition Method

3.3. Supervised Deep-Based Load Disaggregation

4. Experimental Results

4.1. Case 1: LPF

4.2. Case 2: DWT

4.3. Case 3: SD

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI