Next Article in Journal
The Case for Studying Non-Market Food Systems
Next Article in Special Issue
Multiobjective Genetic Algorithm-Based Optimization of PID Controller Parameters for Fuel Cell Voltage and Fuel Utilization
Previous Article in Journal
Credit Constraints on Farm Household Welfare in Rural China: Evidence from Fujian Province
Previous Article in Special Issue
An Analysis of Disparities and Driving Factors of Carbon Emissions in the Yangtze River Economic Belt
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Statistical and Electrical Features Evaluation for Electrical Appliances Energy Disaggregation

by
Pascal A. Schirmer
*,†,‡ and
Iosif Mporas
†,‡
School of Engineering and Computer Science, University of Hertfordshire, Hatfield AL10 9AB, UK
*
Author to whom correspondence should be addressed.
Current address: University of Hertfordshire, College Lane, Hatfield AL10 9AB, UK.
These authors contributed equally to this work.
Sustainability 2019, 11(11), 3222; https://doi.org/10.3390/su11113222
Submission received: 10 May 2019 / Revised: 30 May 2019 / Accepted: 5 June 2019 / Published: 11 June 2019

Abstract

:
In this paper we evaluate several well-known and widely used machine learning algorithms for regression in the energy disaggregation task. Specifically, the Non-Intrusive Load Monitoring approach was considered and the K-Nearest-Neighbours, Support Vector Machines, Deep Neural Networks and Random Forest algorithms were evaluated across five datasets using seven different sets of statistical and electrical features. The experimental results demonstrated the importance of selecting both appropriate features and regression algorithms. Analysis on device level showed that linear devices can be disaggregated using statistical features, while for non-linear devices the use of electrical features significantly improves the disaggregation accuracy, as non-linear appliances have non-sinusoidal current draw and thus cannot be well parametrized only by their active power consumption. The best performance in terms of energy disaggregation accuracy was achieved by the Random Forest regression algorithm.

1. Introduction

With the development of technology and the increasing usage of electrical appliances and automated services, the electric energy needs have been growing steadily for the last century with an annual growth of approximately 3.4% per year in the last decade [1]. Nowadays residential and commercial buildings account already for roughly 36% of the total electrical demand in the USA and 25% in the EU while they are responsible for roughly 43% of carbon dioxide ( CO 2 ) emissions [2,3,4]. To assure balance between renewable energies, CO 2 emissions, political stability and economic growth it is essential to focus on a sustainable development [5]. To achieve sustainable economic growth energy consumption in industrial and residential areas must be minimized under the consideration of rising volatility of nowadays energy production with increasing amounts of renewable energies [6]. Under the consideration of sustainable development several studies investigated real time pricing with additional storage systems [7,8] or large scale energy buffering [6] to reduce electrical energy consumption and peak loads. Other studies indicate that detailed analysis and real-time feedback of energy consumption in residential areas can lead to up to 20% savings in energy consumption through detection of faulty devices and poor operational strategies and thus would improve the sustainability of nowadays consumer households [9,10]. Therefore in the last few decades extensive research in smart grids, smart systems and demand management was carried out and different optimization techniques have been developed to reduce residential energy consumption [11,12,13]. To make use of those techniques, accurate and fine-grained monitoring of electrical energy consumption is needed [14]. However nowadays the energy consumption of most households is monitored via monthly aggregated measurements and therefore cannot provide real-time feedback information.
To measure the energy consumption of a household or building with high resolution in the order of seconds or below smart meters are utilized. According to [15] the largest improvements in terms of energy savings can be made when monitoring energy consumption on device level. Therefore the analysis of energy on device level is performed through energy disaggregation, i.e., the extraction of energy consumption on appliance level based on one or multiple measures from smart meters. When using only one sensor (smart meter) per household or building, therefore measuring only the aggregated consumption, the task is referred to as Non-Intrusive Load Monitoring ( N I L M ) [16], in contrast to Intrusive Load Monitoring ( I L M ) where multiple sensors are used, usually one per device. The goal of N I L M is to find the inverse of the aggregation function through a disaggregation algorithm using as input only the aggregated power consumption, which makes N I L M a highly under-determined problem and thus impossible to solve analytically [17].
In order to solve the N I L M problem different approaches have been proposed in literature, which can be split into methods with or without Source Separation ( S S ). Approaches with S S consider the task of energy disaggregation as a single channel source separation problem and extract the corresponding signal of each device from the aggregated samples using a set of conditions and constraints (e.g., sparseness or sum-to-one) [18,19]. Approaches without S S are based on the decomposition of the aggregated signal to a sequence of feature vectors. These feature vectors are then classified to device labels using a machine learning algorithm [8,20,21] or by predefined set of rules or thresholds [22,23]. As machine learning classification/regression models a wide variety of algorithms have been used such as Artificial Neural Networks ( A N N s ) [8], Decision Trees ( D T s ) [24], Hidden Markov Models ( H M M s ) [24,25,26,27,28,29], K-Nearest-Neighbours ( K N N s ) [30], Random Forests ( R F s ) [20], Support Vector Machines ( S V M s ) [24] and ensemble classifiers [31].
Another classification of N I L M methods is based on the sampling frequency f s of the smart meter and thus the features that can be extracted from the measured data [32]. In detail, depending on the sampling frequency either macroscopic (e.g., active/reactive power [23,33,34]) or microscopic (e.g., transient energy, harmonics, wavelets [22,35]) features are extracted to disaggregate energy consumption on appliance level for steady state and transient behaviour, respectively. Macroscopic features are extracted in low sampling frequencies in the order of 1 60 Hz to 1 Hz while microscopic features are extracted in high sampling frequencies from 50 Hz up to 30 kHz [32]. Many researchers have used microscopic features to efficiently detect transient device behaviour and thus improve energy disaggregation [36,37]. However measuring the power consumption with high sampling frequency has the drawback of higher cost through hardware and increase of computational power [38]. Therefore most studies focus on disaggregation algorithms using macroscopic features or only active power samples in combination with low computational cost disaggregation algorithms utilizing sampling rates in the order of seconds and minutes [28,39,40,41,42,43,44].
Considering the wide range of appliances with either steady-state behaviour [16], where appliances are modelled as finite state machines [16,45] or appliances with transient behaviour including non-linear and continuous appliances [18,36,46], investigation of the effect of different features and classification algorithms is essential. In this paper we evaluate the performance of various well-known and widely used classifiers and various features on the energy disaggregation task for the N I L M task. Specifically, we present a large scale evaluation of several features with respect to the N I L M performance on specific appliance types in combination with several widely used classification algorithms, in order to investigate which feature sets are more appropriate for accurately detecting specific appliance types, e.g non-linear appliances and the effect of using appliance specific features in the overall N I L M performance. The proposed methodology with appliance specific features is evaluated using several combinations of feature sets and classification algorithms.
The remainder of this paper is organized as follows: In Section 2 the baseline N I L M system is presented. In Section 3 the experimental setup is described and in Section 4 the evaluation results are presented. Finally conclusions are provided in Section 5.

2. NILM Architecture

N I L M energy disaggregation can be formulated as the task of determining the power consumption on device level based on the measurements of one sensor, within time windows (frames or epochs). Specifically, for a set of M 1 known devices each consuming power p m with 1 m M , the aggregated power P a g g measured by the sensor will be
P a g g = f ( p 1 , p 2 , , p M 1 , g ) = m = 1 M 1 p m + g = m = 1 M p m
where g = p M is a ‘ghost’ power consumption (noise) consumed by one or more unknown devices and f is the aggregation function. In N I L M the goal is to find estimations p ^ m , g ^ of the power consumption of each device m using an estimation method f 1 with minimal estimation error and p ^ M = g ^ , resulting in the total estimated power P ^ , i.e.,
P ^ = { p ^ 1 , p ^ 2 , , p ^ M 1 , g ^ } f 1 ( P a g g ) s . t . argmin f 1 { ( P a g g m = 1 M p ^ m ) 2 }
The block diagram of the N I L M architecture adopted in the present evaluation is illustrated in Figure 1 and consists of of four stages, namely pre-processing, feature extraction, appliance detection and post-processing. In detail, the aggregated power consumption signal calculated from a smart meter is initially pre-processed, i.e., passed through a median filter [47] and then frame blocked in time frames. After pre-processing feature vectors, v of length v , one for each frame are calculated. In the appliance detection stage the feature vectors are processed by a regression algorithm using a set of pre-trained appliance models to estimate the power consumption of each device. The output of the regression algorithm ( P r e g ) estimates the corresponding device consumption and a set of thresholds T m with 1 m M with T g = T M for the each device including the ghost device ( m = M ) is used to decide whether a device is switched on or off. The detection of appliances and estimation of their power consumption is performed for each frame of the aggregated signal. A post-processing stage is refining the power estimates form the regression model by mapping them to apriori known device states using a Look-Up-Table ( L U T ), i.e., if the distance of the regression output to any state in the device model is larger than 25 W the regression output is mapped to the closest device state. In order to define the number of states per device the K-Means algorithm was used for initialisation followed by Expectation-Maximization ( E M ) clustering to calculate the power consumption for each state of each device and form the L U T for the post-processing stage [48].

3. Experimental Setup

The N I L M architecture presented in Section 2 was evaluated using a number of publicly available datasets and a number of well-known machine learning algorithms for regression.

3.1. Datasets

To evaluate performance five different datasets of the E C O [47] database were used. The E C O database was chosen as it contains power consumption measurements per device as well as the aggregated consumption. The E C O -3 dataset was excluded as it contains only the aggregated signal and not the power consumptions per device. Furthermore the aggregated consumption measurements include not only the active power, but also the line currents ( I x ), line voltages ( V x ) and load angles ( φ x ) for all three phases ( x { 1 , 2 , 3 } ).
The evaluated datasets and their characteristics are tabulated in Table 1 with the number of appliances denoted in the column # A p p . In the same column, the number of appliances in brackets is the number of appliances after excluding devices with power consumption below 25 W , which were added to the power of the ghost device, similarly to the experimental setup followed in [40,49]. The next three columns in Table 1 are listing the sampling period T s , the duration T of the aggregated signal used and the appliance type for each evaluated dataset. The appliances type categorization is based on their operation as described in [50,51], i.e., one-state devices have only on/off status (e.g., resistive lamps, kettles or fridges without significant power spikes), multi-state devices having several discrete power consumption states (e.g., washing machines including different washing cycles) and non-linear loads (e.g., electronic appliances) having various states and stronger power variation. Considering their electrical layout all one- and multi-state appliances consist of a series of resistors, inductors and capacitors and thus can further be classified into resistive, inductive and capacitive devices. Non-linear appliances can include additional active components (e.g., semiconductors) and non-linear passive elements (e.g., diodes).
To evaluate N I L M performance in close to real conditions the aggregated signal including the ghost power from unknown devices was used as proposed in [52], instead of creating an artificial aggregated signal by adding the corresponding power consumption from each device. The aggregated signal includes real power samples, raw current samples, raw voltage samples and load angles, depending on the chosen feature set.

3.2. Pre-Processing and Feature Ranking

During pre-processing the aggregated signal was processed by a median filter of 5 samples as proposed in [47] and then was frame blocked in frames of 10 samples with overlap between successive frames equal to 50% (i.e., 5 samples). For every frame a feature vector v R D consisting of 15 statistical (mean value, minimum and maximum values, Root-Mean-Square ( R M S ) value, median value, percentiles 25% and 75%, variance, standard deviation ( s t d ), skewness, kurtosis, range, energy and Zero Crossings ( Z C )) and four electrical features (line current, neutral current, line voltage and load angle) were calculated resulting in a feature vector of dimensionality equal to D = 19 . In order to calculate the statistical importance of the 19 features the R e l i e f F feature ranking algorithm [53] was used. The R e l i e f F algorithm was chosen as it can deal with noisy data (in our task mainly coming from the ghost power) and is appropriate for feature ranking estimation of multi-class datasets [54], as the multiple devices of the N I L M task. The average R e l i e f F ranking scores across the five evaluated E C O datasets are shown in Figure 2.
As can be seen in Figure 2 statistical and electrical features can be divided into two groups based on their R e l i e f F scores. The first group includes eight statistical and three electrical features with high ranking score (≥0.04) namely the Min/Max, Mean, Energy, R M S , Percentiles75/25 (Per75/Per25), Median and the load angles ( φ 1 , 2 , 3 ), the line currents ( I 1 , 2 , 3 ) and the neutral current ( I N ), respectively. The second group includes features with lower ranking score (<0.04) namely the Zero Crossing rate, Peak2Rms, Range, Standard Deviation, Skewness, Kurtosis, Variance from the statistical features and the line voltages ( V 1 , 2 , 3 ) from the electrical features. For the electrical features it must be mentioned that the neutral current and the load angles are given by the sum of the line currents and the phase-shift between line currents and voltages, respectively, therefore they carry complementary information which affects their ranking scores.
The outcome of the feature ranking was used to design a set of seven experimental protocols, with the first four including only statistical features and the last three employing the additional electrical features. The chosen features for each experiment are tabulated in Table 2 where the first experiment is only considering the mean value of active power samples and thus is considered as baseline system, while for the following experiments features with decreasing feature score from the feature ranking were added under consideration of keeping similar pairs of features together (e.g., M i n / M a x ). There is one exceptional case (in protocol 5/7) for the electrical features were the load angles are added after the line currents/voltages. This is due to two reasons, namely that the combination of line current and voltages contains the same information as the load angles and that load angles can only be computed if line currents and voltages are available.
For the regression stage four different well known and widely used machine learning algorithms have been employed namely the feed-forward Deep Neural Networks ( D N N s ), the k-Nearest Neighbours ( K N N s ), the Random Forests ( R F s ) and the Support Vector Machines ( S V M s ). The free parameters of each regression algorithm were empirically optimized after grid search on a bootstrap training subset including data from the E C O -1/2/4/5/6 datasets with ideal aggregated data (without ghost power) as shown in Table 3. The best performance corresponding to the optimal values of each regression model is shown in bold. A “one vs. all” approach was chosen with the output of each regression model being the prediction of the power of the m t h appliance. In order to avoid overlap between training and test data, each of the evaluated datasets was equally split into two subsets, one for training each regression model and one for evaluating its performance.
As can be seen in Table 3 the parameter optimized regression models are a D N N feed-forward architecture with 3-hidden layers and 32 sigmoid nodes per layer, a K N N with K = 5 nearest neighbours, a R F with 32 trees per forest and a S V M with Radial Basis Function ( R b f ) as kernel and optimized kernel parameters γ = 12.8 , C = 1.45 . The D N N regression model achieved accuracy equal to 88.71% and outperformed all other evaluated regression models on the bootstrap training set.

4. Experimental Results

The N I L M architecture presented in Section 2 was evaluated according to the experimental setup described in Section 3 with the optimized parameters shown in Table 3. The performance was evaluated in terms of power estimation accuracy ( E A C C ), as proposed in [55] and defined in Equation (3). The estimation accuracy is taking into account the estimated power p ^ m and the ground-truth power consumption p m for each device m, where T is the number of frames and M is the number of disaggregated devices including the ghost power. For evaluating estimation accuracy on device level Equation (3) is modified by eliminating the summation over M appliances resulting in Equation (4) measuring the estimation accuracy on device level ( E A C C m ).
E A C C = 1 t = 1 T m = 1 M | p ^ m t p m t | 2 t = 1 T m = 1 M | p m t |
E A C C m = 1 t = 1 T | p ^ m t p m t | 2 t = 1 T | p m t |
The evaluation results for different experimental protocols and different regression models are tabulated in Table 4. As can be seen in Table 4 adding additional statistical and electrical features improves the energy disaggregation performance across all evaluated datasets, with the R F regression model outperforming all other regression algorithms. In detail for the R F regression model the greatest absolute improvement was observed for the E C O -6 dataset (9.32%), followed by the E C O -1 dataset (7.80%), while the lowest absolute improvement was found for the E C O -3 dataset (5.22%). Moreover, for almost all of the evaluated datasets and regression algorithms the best energy disaggregation performance was achieved when using additional electrical features, i.e., in protocols 5–7.
In order to evaluate the appropriateness of each regression algorithm in the seven experimental protocols the average performance across the five datasets for each of the regression models was calculated. The results are tabulated in Table 5.
As can be seen in Table 5, protocol five shows the best average performance for the D N N , S V M and R F regression models, while the K N N model shows a slightly higher performance protocol seven followed by protocol five. Moreover, as also shown in Table 4, the R F regression model outperforms the other models (87.6% with 6.5% performance increase), followed by K N N and D N N with similar performance (∼83.8% with 2.5% performance increase), while S V M achieved the lowest performance (79.6% with 1.9% performance increase).
Further analysis of the evaluated results was conducted on appliance type level, as they are described in Section 3 and tabulated in Table 1. The results for per device improvement using the best performing classifier ( R F ) are tabulated in Table 6. The first experimental protocol uses only the mean value of the active power as feature and thus is considered here as baseline system, against which all performance improvements have been calculated in Table 6 with the corresponding protocol denoted in brackets. Moreover appliances that are not operating during the testing are marked in red and were excluded from further investigation.
As can be seen in Table 6 high improvements of performance do not necessarily appear in experimental protocols with the highest number of features. To investigate the relation between appliance types and features on the energy disaggregation task we consider two types of linear appliances, either with pure resistive equivalent circuit diagram or complex loads with inductive/capacitive behaviour. Therefore three appliances categories are formed, namely one/multi-state appliances with resistive behaviour, one/multi-state appliances that can be modelled as complex loads (mainly inductive) and non-linear appliances. This appliance categorization is illustrated in Table 7.
After examining the results from Table 6 under the consideration of Table 7 it can be seen that for resistive one/multi-state appliances (e.g., kettle, coffee machine or lamp) where the reactive power is zero ( Q = 0 ) the best performing experimental protocol is protocol five in which together with the statistical features the line current is included in the feature vector as an electrical feature. For this appliance type adding the line voltage or the load angle as additional feature is not beneficial, since the load angle or the shift between current and voltage is always zero and thus does not contribute to their parametrized power signature with significant information. Except this, one/multi-state appliances with strong inductive behaviour (e.g., fridges or freezers) benefit from adding the load angle as a feature, as they consume a significant amount of reactive power and thus achieved their best performance with experimental protocol seven. In addition, non-linear devices cannot be described in terms of the active and reactive power consumption including the corresponding load angle, since the current flowing through them is non-sinusoidal as illustrated in Table 7. Thus their power consumption must be described through different techniques, as for example the Fryze power theory [56], where time domain analysis of active and non-active currents is used and the reactive power is split into a reactive component caused by the time domain shift between current and voltage and a component caused by the non-linearity of the device. For such appliances (e.g., entertainment, laptop or TV) the best performing experimental protocol was protocol six where line current and line voltage are added as features hence a time domain description is performed as suggested in [56] and does not include the load angle, since in non-linear appliances the load angle does not carry any device-dependent information.
As regards performance on dataset level the maximum overall performance can be achieved when detecting each device using its own set of optimal features (i.e., the best performing experimental protocol) as tabulated in Table 6. Additionally to selecting appliance-driven features the disaggregation results can be improved when employing the post-processing step from Section 2 where the power estimates from the regression stage P r e g are mapped to the appliance states determined through the appliance model during the pre-processing. The per dataset results when choosing the optimal set of features individually for each device and utilizing the post-processing are tabulated in Table 8 with the best performing datasets shown in bold.
As can be seen in Table 8, employing the optimal set of features for each device results in further improvement of the disaggregation accuracy varying from 0.1% to 3.2% depending on the dataset and the regression model. The maximum average performance increase for the best performing classifier ( R F ) is 0.8% with an overall average disaggregation accuracy of 89.0%. When further employing the post-processing as described in Section 2 another performance increase between 0.5% and 1.2% can be observed when utilizing D N N s , R F s or S V M s . However, no performance increase was observed when using K N N s . The performance increase when using D N N s , R F s or S V M s is mainly due to one/multi-state linear appliances, which can be modelled as finite-state-machines and benefit from the post-processing step where power estimates are mapped to discrete power states of the corresponding appliance. In terms of absolute improvement R F still outperforms all other classifiers when applying the L U T post-processing with overall disaggregation accuracy equal to 89.5%.

5. Conclusions

In this paper the performance of different classifiers in combination with different sets of features for energy disaggregation in non-intrusive load monitoring was investigated. The evaluation results showed significant importance on the selection of features, with the electrical features being more discriminative than the statistical ones. It was also shown that the optimal choice of features strongly depends on the device type and its electrical characteristics, with the non-linear devices better being disaggregated when using line current and line voltage, while linear devices were disaggregated well with statistical features only. After evaluating energy disaggregation across several datasets, random forest ( R F ) was found the best performing regression algorithm outperforming all other evaluated machine learning algorithms by an absolute performance increase of approximately 6.5%. Moreover, it was shown that, when using device dependent features and device state mapping as post-processing, further improvement in energy disaggregation accuracy can be achieved (up-to 1.3%), resulting in a maximum disaggregation performance of 89.5%. Energy disaggregation and especially non-intrusive load monitoring is a very challenging task. The use of detectors which have been designed, adapted or fine-tuned to the specifications of each appliance is a direction which will result in further improvement of disaggregation performance and, in combination with the recent evolution of deep learning [43,57,58] and the use of big data for training device models, e.g., in the order of years [57], is expected to contribute to even more accurate disaggregation methodologies. Moreover multimodal information other than energy data acquired from smart meters like weather, occupancy or socio-economical events could be supportive for disaggregating the energy consumption of households and buildings [59].

Author Contributions

Writing—original draft, P.A.S.; Writing—review & editing, I.M.

Funding

This research received no external funding.

Acknowledgments

This work was supported by the U A Doctoral Training Alliance (https://www.unialliance.ac.uk/) for Energy in the United Kingdom.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Yu, B.; Tian, Y.; Zhang, J. A dynamic active energy demand management system for evaluating the effect of policy scheme on household energy consumption behavior. Energy 2015, 91, 491–506. [Google Scholar] [CrossRef]
  2. Elma, O.; Selamogullar, U.S. A survey of a residential load profile for demand side management systems. In Proceedings of the 2017 the 5th IEEE International Conference on Smart Energy Grid Engineering (SEGE), Oshawa, ON, Canada, 14–17 August 2017; Gabbar, H.A., Ed.; IEEE Press: Piscataway, NJ, USA, 2017; pp. 85–89. [Google Scholar] [CrossRef]
  3. Eurostat. Energy Statistics—An Overview; European Union: Brussels, Belgium, 2018. [Google Scholar]
  4. Pérez-Lombard, L.; Ortiz, J.; Pout, C. A review on buildings energy consumption information. Energy Build. 2008, 40, 394–398. [Google Scholar] [CrossRef]
  5. Bilan, Y.; Streimikiene, D.; Vasylieva, T.; Lyulyov, O.; Pimonenko, T.; Pavlyk, A. Linking between Renewable Energy, CO2 Emissions, and Economic Growth: Challenges for Candidates and Potential Candidates for the EU Membership. Sustainability 2019, 11, 1528. [Google Scholar] [CrossRef]
  6. Sinn, H.W. Buffering volatility: A study on the limits of Germany’s energy revolution. Eur. Econ. Rev. 2017, 99, 130–150. [Google Scholar] [CrossRef]
  7. Miyazato, Y.; Tahara, H.; Uchida, K.; Celestino Muarapaz, C.; Motin Howlader, A.; Senjyu, T. Multi-Objective Optimization for Smart House Applied Real Time Pricing Systems. Sustainability 2016, 8, 1273. [Google Scholar] [CrossRef]
  8. Lin, Y.H.; Tsai, M.S. An Advanced Home Energy Management System Facilitated by Nonintrusive Load Monitoring With Automated Multiobjective Power Scheduling. IEEE Trans. Smart Grid 2015, 6, 1839–1851. [Google Scholar] [CrossRef]
  9. Lee, D.; Cheng, C.C. Energy savings by energy management systems: A review. Renew. Sustain. Energy Rev. 2016, 56, 760–777. [Google Scholar] [CrossRef]
  10. Zeifman, M.; Roth, K. Viterbi algorithm with sparse transitions (VAST) for nonintrusive load monitoring. In Proceedings of the 2011 IEEE Symposium on Computational Intelligence Applications in Smart Grid, Paris, France, 11–15 April 2011; Staff, I., Ed.; IEEE: Piscataway, NJ, USA, 2011; pp. 1–8. [Google Scholar] [CrossRef]
  11. Indragandhi, V.; Logesh, R.; Subramaniyaswamy, V.; Vijayakumar, V.; Siarry, P.; Uden, L. Multi-objective optimization and energy management in renewable based AC/DC microgrid. Comput. Electr. Eng. 2018, 70, 179–198. [Google Scholar] [CrossRef]
  12. Çavdar, İ.; Faryad, V. New Design of a Supervised Energy Disaggregation Model Based on the Deep Neural Network for a Smart Grid. Energies 2019, 12, 1217. [Google Scholar] [CrossRef]
  13. Gabaldón, A.; Álvarez, C.; Ruiz-Abellón, M.; Guillamón, A.; Valero-Verdú, S.; Molina, R.; García-Garre, A. Integration of Methodologies for the Evaluation of Offer Curves in Energy and Capacity Markets through Energy Efficiency and Demand Response. Sustainability 2018, 10, 483. [Google Scholar] [CrossRef]
  14. Chis, A.; Rajasekharan, J.; Lunden, J.; Koivunen, V. Demand response for renewable energy integration and load balancing in smart grid communities. In Proceedings of the 2016 24th European Signal Processing Conference (EUSIPCO), Budapest, Hungary, 29 August–2 September 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 1423–1427. [Google Scholar] [CrossRef]
  15. Meziane, M.N.; Abed-Meraim, K. Modeling and estimation of transient current signals. In Proceedings of the 2015 23rd European Signal Processing Conference (EUSIPCO), Nice, France, 31 August–4 September 2015; pp. 1960–1964. [Google Scholar] [CrossRef]
  16. Hart, G.W. Nonintrusive appliance load monitoring. Proc. IEEE 1992, 80, 1870–1891. [Google Scholar] [CrossRef]
  17. Gaur, M.; Majumdar, A. Disaggregating Transform Learning for Non-Intrusive Load Monitoring. IEEE Access 2018, 6, 46256–46265. [Google Scholar] [CrossRef]
  18. Zhu, Y.; Lu, S. Load profile disaggregation by Blind source separation: A wavelets-assisted independent component analysis approach. In Proceedings of the 2014 IEEE PES General Meeting, National Harbor, MD, USA, 27–31 July 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 1–5. [Google Scholar] [CrossRef]
  19. Figueiredo, M.; Ribeiro, B.; de Almeida, A. Electrical Signal Source Separation Via Nonnegative Tensor Factorization Using On Site Measurements in a Smart Home. IEEE Trans. Instrum. Meas. 2014, 63, 364–373. [Google Scholar] [CrossRef]
  20. Bilski, P.; Winiecki, W. Generalized algorithm for the non-intrusive identification of electrical appliances in the household. In Proceedings of the the Crossing Point of Intelligent Data Aquisition & Advanced Computing Systems and East & West Scientists, Bucharest, Romania, 1–23 September 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 730–735. [Google Scholar] [CrossRef]
  21. Liebgott, F.; Yang, B. Active learning with cross-dataset validation in event-based non-intrusive load monitoring. In Proceedings of the EUSIPCO 2017, Kos, Greece, 28 August–2 September 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 296–300. [Google Scholar] [CrossRef]
  22. Bilski, P.; Winiecki, W. The rule-based method for the non-intrusive electrical appliances identification. In Proceedings of the IDAACS’2015, Warsaw, Poland, 24–26 September 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 220–225. [Google Scholar] [CrossRef]
  23. Zhou, Y.; Zhai, Q.; Li, X.; Yang, Y. A method for recognizing electrical appliances based on active load demand in a house/office environment. In Proceedings of the 2017 Chinese Automation Congress (CAC), Jinan, China, 20–22 October 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 3584–3589. [Google Scholar] [CrossRef]
  24. Basu, K.; Debusschere, V.; Bacha, S.; Maulik, U.; Bondyopadhyay, S. Nonintrusive Load Monitoring: A Temporal Multilabel Classification Approach. IEEE Trans. Ind. Inform. 2015, 11, 262–270. [Google Scholar] [CrossRef]
  25. Li, Y.; Peng, Z.; Huang, J.; Zhang, Z.; Son, J.H. Energy Disaggregation via Hierarchical Factorial HMM. In Proceedings of the 2nd International Workshop on Non-Intrusive Load Monitoring, Austin, TX, USA, 3 June 2014. [Google Scholar]
  26. Valera, I.; Ruiz, F.; Svensson, L.; Perez-Cruz, F. Infinite Factorial Dynamical Model. In Proceedings of the NIPS’15 Proceedings of the 28th International Conference on Neural Information Processing Systems-Volume 1, Montreal, QC, Canada, 7–12 December 2015. [Google Scholar]
  27. Mueller, J.A.; Kimball, J.W. Accurate Energy Use Estimation for Nonintrusive Load Monitoring in Systems of Known Devices. IEEE Trans. Smart Grid 2018, 9, 2797–2808. [Google Scholar] [CrossRef]
  28. Kong, W.; Dong, Z.Y.; Ma, J.; Hill, D.J.; Zhao, J.; Luo, F. An Extensible Approach for Non-Intrusive Load Disaggregation With Smart Meter Data. IEEE Trans. Smart Grid 2018, 9, 3362–3372. [Google Scholar] [CrossRef]
  29. Agyeman, K.; Han, S.; Han, S. Real-Time Recognition Non-Intrusive Electrical Appliance Monitoring Algorithm for a Residential Building Energy Management System. Energies 2015, 8, 9029–9048. [Google Scholar] [CrossRef]
  30. Kim, Y.; Kong, S.; Ko, R.; Joo, S.K. Electrical event identification technique for monitoring home appliance load using load signatures. In Proceedings of the IEEE International Conference on Consumer Electronics (ICCE), Las Vegas, NV, USA, 10–13 January 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 296–297. [Google Scholar] [CrossRef]
  31. Bilski, P. Non-Intrusive Appliance Load Identification with the Ensemble of Classifiers. In Proceedings of the NILM2016 3rd International Workshop on Non-Intrusive Load Monitoring, Vancouver, BC, Canada, 14–15 May 2016. [Google Scholar]
  32. Gao, J.; Kara, E.C.; Giri, S.; Berges, M. A feasibility study of automated plug-load identification from high-frequency measurements. In Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP), Orlando, FL, USA, 14–16 December 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 220–224. [Google Scholar] [CrossRef]
  33. Zhao, B.; Stankovic, L.; Stankovic, V. On a Training-Less Solution for Non-Intrusive Appliance Load Monitoring Using Graph Signal Processing. IEEE Access 2016, 4, 1784–1799. [Google Scholar] [CrossRef] [Green Version]
  34. Bouhouras, A.; Gkaidatzis, P.; Chatzisavvas, K.; Panagiotou, E.; Poulakis, N.; Christoforidis, G. Load Signature Formulation for Non-Intrusive Load Monitoring Based on Current Measurements. Energies 2017, 10, 538. [Google Scholar] [CrossRef]
  35. Barriquello, C.H.; Borin, V.P.; Campos, A. Approach for home appliance recognition using vector projection length and Stockwell transform. Electron. Lett. 2015, 51, 2035–2037. [Google Scholar] [CrossRef]
  36. Chang, H.H.; Lian, K.L.; Su, Y.C.; Lee, W.J. Power-Spectrum-Based Wavelet Transform for Nonintrusive Demand Monitoring and Load Identification. IEEE Trans. Ind. Appl. 2014, 50, 2081–2089. [Google Scholar] [CrossRef]
  37. Chang, H.H. Non-Intrusive Demand Monitoring and Load Identification for Energy Management Systems Based on Transient Feature Analyses. Energies 2012, 5, 4569–4589. [Google Scholar] [CrossRef] [Green Version]
  38. Koutitas, G.C.; Tassiulas, L. Low Cost Disaggregation of Smart Meter Sensor Data. IEEE Sens. J. 2016, 16, 1665–1673. [Google Scholar] [CrossRef]
  39. van Cutsem, O.; Lilis, G.; Kayal, M. Automatic multi-state load profile identification with application to energy disaggregation. In Proceedings of the 2017 22nd IEEE International Conference on Emerging Technologies and Factory Automation, Limassol, Cyprus, 12–15 September 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 1–8. [Google Scholar] [CrossRef]
  40. Wang, H.; Yang, W. An Iterative Load Disaggregation Approach Based on Appliance Consumption Pattern. Appl. Sci. 2018, 8, 542. [Google Scholar] [CrossRef]
  41. Zhang, C.; Zhong, M.; Wang, Z.; Goddard, N.; Sutton, C. Sequence-to-point learning with neural networks for nonintrusive load monitoring. arXiv 2018, arXiv:1612.09106. [Google Scholar]
  42. Rehman, A.U.; Lie, T.T.; Valles, B.; Tito, S.R. Low Complexity Event Detection Algorithm for Non- Intrusive Load Monitoring Systems. In Proceedings of the 2018 IEEE Innovative Smart Grid Technologies—Asia (ISGT Asia), Singapore, 22–25 May 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 746–751. [Google Scholar] [CrossRef]
  43. Le, T.T.H.; Kim, H. Non-Intrusive Load Monitoring Based on Novel Transient Signal in Household Appliances with Low Sampling Rate. Energies 2018, 11, 3409. [Google Scholar] [CrossRef]
  44. Wang, H.; Yang, W.; Chen, T.; Yang, Q. An Optimal Load Disaggregation Method Based on Power Consumption Pattern for Low Sampling Data. Sustainability 2019, 11, 251. [Google Scholar] [CrossRef]
  45. Ridi, A.; Hennebert, J. Hidden Markov Models for ILM Appliance Identification. Procedia Comput. Sci. 2014, 32, 1010–1015. [Google Scholar] [CrossRef] [Green Version]
  46. Zheng, Z.; Chen, H.; Luo, X. A Supervised Event-Based Non-Intrusive Load Monitoring for Non-Linear Appliances. Sustainability 2018, 10, 1001. [Google Scholar] [CrossRef]
  47. Beckel, C.; Kleiminger, W.; Cicchetti, R.; Staake, T.; Santini, S. The ECO data set and the performance of non-intrusive load monitoring algorithms. In Proceedings of the BuildSys’14, 1st ACM Conference on Embedded Systems for Energy-Efficient Buildings, Memphis, TN, USA, 3–6 November 2014; Srivastava, M., Ed.; ACM: New York, NY, USA, 2014; pp. 80–89. [Google Scholar] [CrossRef]
  48. Ridi, A.; Gisler, C.; Hennebert, J. Appliance and state recognition using Hidden Markov Models. In Proceedings of the International Conference on Data Science and Advanced Analytics (DSAA), Shanghai, China, 30 October–1 November 2014; Cao, L., Ed.; IEEE: Piscataway, NJ, USA, 2014; pp. 270–276. [Google Scholar] [CrossRef]
  49. Tabatabaei, S.M.; Dick, S.; Xu, W. Toward Non-Intrusive Load Monitoring via Multi-Label Classification. IEEE Trans. Smart Grid 2017, 8, 26–40. [Google Scholar] [CrossRef]
  50. Shaw, S.R.; Leeb, S.B.; Norford, L.K.; Cox, R.W. Nonintrusive Load Monitoring and Diagnostics in Power Systems. IEEE Trans. Instrum. Meas. 2008, 57, 1445–1454. [Google Scholar] [CrossRef]
  51. Zoha, A.; Gluhak, A.; Imran, M.A.; Rajasegarar, S. Non-intrusive load monitoring approaches for disaggregated energy sensing: A survey. Sensors 2012, 12, 16838–16866. [Google Scholar] [CrossRef] [PubMed]
  52. Pereira, L.; Nunes, N. Performance evaluation in non-intrusive load monitoring: Datasets, metrics, and tools-A review. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2018, 8, e1265. [Google Scholar] [CrossRef]
  53. Urbanowicz, R.J.; Meeker, M.; LaCava, W.; Olson, R.S.; Moore, J.H. Relief-Based Feature Selection: Introduction and Review. J. Biomed. Informat. 2018, 85, 189–203. [Google Scholar] [CrossRef] [PubMed]
  54. Kononenko, I.; Robnik-Sikonja, M.; Pompe, U. ReliefF for Estimation and Discretization of Attributes in Classification, Regression, and ILP Problems. In Proceedings of the AIMSA-96, Sozopol, Bulgaria, 21–22 September 1996; pp. 31–40. [Google Scholar]
  55. Kolter, J.Z.; Johnson, M.J. (Eds.) REDD: A Public Data Set for Energy Disaggregation Research. In Proceedings of the SustKDD Workshop on Data Mining Applications in Sustainability, San Diego, CA, USA, 21 August 2011. [Google Scholar]
  56. Staudt, V. Fryze-Buchholz-Depenbrock: A time-domain power theory. In Proceedings of the 2008 International School on Nonsinusoidal Currents and Compensation, Lagow, Poland, 10–13 June 2008; IEEE: Piscataway, NJ, USA, 2008; pp. 1–12. [Google Scholar] [CrossRef]
  57. Harell, A.; Makonin, S.; Bajić, I.V. Wavenilm: A Causal Neural Network for Power Disaggregation from the Complex Power Signal. arXiv 2019, arXiv:1902.08736. [Google Scholar]
  58. Valenti, M.; Bonfigli, R.; Principi, E.; Squartini, S. Exploiting the Reactive Power in Deep Neural Models for Non-Intrusive Load Monitoring. In Proceedings of the 2018 International Joint Conference on Neural Networks (IJCNN), Rio de Janeiro, Brazil, 8–13 July 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 1–8. [Google Scholar]
  59. Ma, M.; Lin, W.; Zhang, J.; Wang, P.; Zhou, Y.; Liang, X. Towards Energy-Awareness Smart Building: Discover the Fingerprint of Your Electrical Appliances. IEEE Trans. Ind. Inform. 2017, 1. [Google Scholar] [CrossRef]
Figure 1. Block diagram of the proposed Non-Intrusive Load Monitoring ( N I L M ) architecture.
Figure 1. Block diagram of the proposed Non-Intrusive Load Monitoring ( N I L M ) architecture.
Sustainability 11 03222 g001
Figure 2. Feature ranking for 19 different statistical and electrical features determined with the R e l i e f F algorithm.
Figure 2. Feature ranking for 19 different statistical and electrical features determined with the R e l i e f F algorithm.
Sustainability 11 03222 g002
Table 1. List of evaluated datasets and their respective properties.
Table 1. List of evaluated datasets and their respective properties.
DatasetParameters
#App T s TAppliance Type
ECO-17 (6)1s7dOne-state/multi-state
ECO-212 (9)1s7dOne-state/multi-state/non-linear
ECO-48 (8)1s7dOne-state/multi-state/non-linear
ECO-58 (6)1s7dOne-state/multi-state/non-linear
ECO-67 (6)1s7dOne-state/multi-state/non-linear
Table 2. Definition of seven different experimental protocols including different numbers of features determined from the feature ranking (experiments 1–4 with statistical features only and experiments 5–7 with additional electrical features).
Table 2. Definition of seven different experimental protocols including different numbers of features determined from the feature ranking (experiments 1–4 with statistical features only and experiments 5–7 with additional electrical features).
ProtocolFeaturesCategory
1MeanStatistical Features
2Mean, Max, Min, RMS, EnergyStatistical Features
3Mean, Max, Min, RMS, Energy, Median, Per25, Per75Statistical Features
4Mean, Max, Min, RMS, Energy, Median, Per25, Per75, Peak2Rms, Range, Std, Variance, Skewness, KurtosisStatistical Features
5Statistical Features, Line CurrentsStatistical/Electrical Features
6Statistical Features, Line Currents, Line VoltagesStatistical/Electrical Features
7Statistical Features, Line Currents, Line Voltages, Load AnglesStatistical/Electrical Features
Table 3. Parametrization results (%) for four different classifiers namely Deep Neural Networks ( D N N s ), Random Forest ( R F s ), K-Nearest-Neighbours ( K N N s ) and Support Vector Machines ( S V M s ).
Table 3. Parametrization results (%) for four different classifiers namely Deep Neural Networks ( D N N s ), Random Forest ( R F s ), K-Nearest-Neighbours ( K N N s ) and Support Vector Machines ( S V M s ).
Deep Neural Network (DNN)
Nodes/ Layers48163264128
180.4287.5487.8583.7386.3881.67
270.0986.3986.9287.5082.6883.62
380.4086.7087.8688.7188.3984.20
475.4087.9587.0287.1585.32x
Random Forest (RF)
Trees8163264128256
85.4585.3185.4785.4285.4485.42
K-Nearest-Neighbours (KNN)
K123456
82.1582.7482.6883.0583.2682.42
Support Vector Machine (SVM)
KernelLinearGaussianRbfPol-2Pol-3Pol-4
55.0272.3376.2959.2463.5867.83
Table 4. Disaggregation results (%) for five different datasets out of the E C O database utilizing four different classifiers ( D N N , K N N , R F , S V M ) with seven experimental protocols each.
Table 4. Disaggregation results (%) for five different datasets out of the E C O database utilizing four different classifiers ( D N N , K N N , R F , S V M ) with seven experimental protocols each.
ECO-1
Protocol1234567
DNN73.1470.9471.9473.1277.1873.5278.38
KNN72.5675.8775.8276.0276.0675.9078.92
RF72.4277.0277.0078.3479.0578.2380.22
SVM67.0466.4366.3566.8469.4167.7567.98
ECO-2
DNN83.7084.6981.9985.7589.7883.3989.17
KNN85.4085.1785.2285.4085.4485.2687.88
RF83.8486.1685.9986.2290.0390.4091.18
SVM80.9771.7170.8578.7081.8180.6481.09
ECO-4
DNN82.4283.0081.3482.4578.3584.1680.78
KNN80.8582.1682.1782.1382.5082.4283.11
RF82.2783.2783.4983.8486.7686.4287.49
SVM77.8878.7978.5580.4182.3582.4681.70
ECO-5
DNN88.1186.8983.4887.5991.4889.4791.05
KNN85.6388.3688.4088.0888.5388.5588.88
RF87.6588.8088.8389.1893.7993.7893.79
SVM88.6287.3586.7888.4291.7191.1490.18
ECO-6
DNN80.0381.3082.8782.4281.9481.8868.06
KNN72.5675.8775.8276.0276.0675.9078.92
RF79.0083.3983.6384.5788.3287.3780.90
SVM72.8570.5870.2371.7472.9372.7759.03
Table 5. Comparison of different regression algorithms for averaged estimation accuracies (% E A C C ) (average performance across the five datasets E C O -1/2/4/5/6) for the seven experimental protocols.
Table 5. Comparison of different regression algorithms for averaged estimation accuracies (% E A C C ) (average performance across the five datasets E C O -1/2/4/5/6) for the seven experimental protocols.
Protocol1234567
DNN81.4881.3680.3282.2783.7582.4881.49
KNN80.4783.0783.1683.0783.2683.1783.89
RF81.0483.7383.7984.4387.5987.2486.72
SVM77.4774.9774.5577.2279.6478.9576.00
Table 6. Maximum improvement (%) with respect to the baseline (first) protocol for the best performing regression model ( R F ). The protocol with the best performance is given in brackets. Further devices that are not operating during the testing period are marked in red.
Table 6. Maximum improvement (%) with respect to the baseline (first) protocol for the best performing regression model ( R F ). The protocol with the best performance is given in brackets. Further devices that are not operating during the testing period are marked in red.
DeviceCategoryECO-1ECO-2ECO-4ECO-5ECO-6
Air-Exhaustone-state-0.00 (1)---
Coffee-Makerone-state---30.97 (5)0.00 (1)
Dryermulti-state24.55 (5)----
Entertainmentnon-linear-6.76 (6)11.12 (6)18.97 (6)16.85 (5)
Freezerone-state8.87 (7)2.91 (7)5.57 (7)--
Fridgeone-state15.78 (7)9.76 (7)22.65 (7)30.09 (7)38.16 (5)
Ghostundefined3.35 (4)4.48 (7)2.46 (5)3.30 (7)3.44 (5)
Kettleone-state28.42 (6)---14.77 (5)
Kitchenundefined--18.59 (4)--
Lampone-state-0.56 (7)46.44 (5)--
Laptopnon-linear-22.30 (5)-2.19 (5)11.30 (5)
Microwavemulti-state--23.10 (7)0.01 (0)-
Stereonon-linear-8.31 (6)0.33 (7)--
TVnon-linear-5.81 (6)---
WMmulti-state32.24 (4)----
Table 7. Impact of employing temporal contextual infromation for three different devices categories.
Table 7. Impact of employing temporal contextual infromation for three different devices categories.
One/Multi-State (Resistive)One/Multi-State (Inductive)Non-Linear
Sustainability 11 03222 i001 Sustainability 11 03222 i002 Sustainability 11 03222 i003
Sustainability 11 03222 i004 Sustainability 11 03222 i005 Sustainability 11 03222 i006
P = V · I Q = 0 P = V · I · c o s ( φ ) Q = V · I · s i n ( φ ) P = 1 T t T T k v k · i k d t Q = Q r e a c t i v e + Q n o n l i n e a r
  • Coffee Maker, Dryer, Kettle, Lamp, Microwave, WM
  • Fridge, Freezer
  • Entertainment, Stereo, Laptop, TV
Table 8. Maximum performance (%) per dataset for all classifiers using appliance driven features with (‘Post’) and without post-processing (‘App’) when compared to the best performing protocol without post-procesing and with uniform appliance features for all appliances (‘Base’).
Table 8. Maximum performance (%) per dataset for all classifiers using appliance driven features with (‘Post’) and without post-processing (‘App’) when compared to the best performing protocol without post-procesing and with uniform appliance features for all appliances (‘Base’).
DNNKNNRFSVM
BaseAppPostBaseAppPostBaseAppPostBaseAppPost
ECO-178.3881.5681.8678.9279.7179.3280.2281.8982.4769.4169.7569.85
ECO-289.7890.5290.6187.8890.5290.4791.1891.7592.3981.8183.6485.25
ECO-484.1685.3486.5683.1183.3283.3487.4987.9087.9982.4684.0284.88
ECO-591.4892.3693.1388.8889.9489.6293.7993.9294.5491.7192.7193.21
ECO-682.8783.0586.6283.7184.6285.2488.3289.5490.1372.9375.1476.58
AVG85.3386.5787.7684.5085.6285.6088.2089.0089.5079.6681.0581.95

Share and Cite

MDPI and ACS Style

Schirmer, P.A.; Mporas, I. Statistical and Electrical Features Evaluation for Electrical Appliances Energy Disaggregation. Sustainability 2019, 11, 3222. https://doi.org/10.3390/su11113222

AMA Style

Schirmer PA, Mporas I. Statistical and Electrical Features Evaluation for Electrical Appliances Energy Disaggregation. Sustainability. 2019; 11(11):3222. https://doi.org/10.3390/su11113222

Chicago/Turabian Style

Schirmer, Pascal A., and Iosif Mporas. 2019. "Statistical and Electrical Features Evaluation for Electrical Appliances Energy Disaggregation" Sustainability 11, no. 11: 3222. https://doi.org/10.3390/su11113222

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop