Open Access
This article is
 freely available
 reusable
Electronics 2019, 8(9), 1002; https://doi.org/10.3390/electronics8091002
Article
Nonintrusive Load Monitoring Based on Complementary Features of Spurious Emissions
School of Electronic and Information Engineering, Beihang University, Beijing 100191, China
^{*}
Author to whom correspondence should be addressed.
Received: 10 August 2019 / Accepted: 4 September 2019 / Published: 7 September 2019
Abstract
:In this paper, a novel method that utilizes the fractional correlationbased algorithm and the Bspline curve fittingbased algorithm is proposed to extract the complementary features for detecting the operating states of appliances. The identification of appliance operating states is one of the key parts for nonintrusive load monitoring (NILM). Considering the individual spurious emissions generated because of nonlinear components in each electronic device, the spurious emissions from the power cord can be picked up to solve the problem of data storage. Five types of common household appliances are considered in this study. The fractional correlationbased algorithm and Bspline curve fittingbased algorithm are used to extract two groups of complementary features from the spurious emissions of those five types of appliances. The experimental results show that the feature vectors extracted using the proposed method are obviously distinguishable. In addition, the features extracted show a good longtime stability, which is verified through a fiveday experiment. Finally, based on support vector machine (SVM) and Dempster–Shafer (DS) evidence theory, the identification accuracy reaches 85.5% using a combining classifier incorporated with the features extracted from the proposed methods.
Keywords:
feature extraction; fractional correlation; Bspline curve fitting; combining classifier; support vector machine (SVM); Dempster–Shafer (DS) evidence theory; nonintrusive load monitoring (NILM)1. Introduction
Buildings account for a major portion of both primary energy and electricity consumption, and previous investigations indicated that building electricity consumption can be reduced by up to 10%–15% using energy management [1,2]. The smart grid and its supporting home automation network can promote residents’ understanding of household appliance consumption and have the potential to be an effective means of energy management. To make household appliance consumption clear and understandable, each appliance requires a sensor, which can be achieved by modifying the existing appliances. This is associated with a low customer acceptance and thus is an obstacle in the promotion of the smart grid. Nonintrusive load monitoring (NILM), instead of placing a sensor on each appliance, reveals appliance usage at home from a single sensing point, which has been shown to increase customer acceptance of the home automation network [3,4,5].
NILM was first proposed by Hart [6] in the 1980s, in which the steadystate active power (P) and reactive power (Q) consumptions were used as load signatures. However, in certain situations, two appliances may consume the same amount of P and Q, which makes the two appliances indistinguishable [7]. Therefore, researchers extract appliance signatures from electrical parameters in switch states [8,9,10]. Nevertheless, transient signals require nonstop monitoring, which imposes data transmission and storage challenges. To observe the phase noise near 60 Hz, the current was measured 170 times per s in [8]. Furthermore, some of the transient signal can be missed, while some of the nontransient signal can be mistaken as transient [11]. In this paper, to tackle this problem, spurious emissions in the spectrum of each appliance are employed to identify different appliances. This technique is based on the fact that spurious emissions coupled into the power cord from various appliances are different due to their unique nonlinear components, such as the diode. The employment of spurious emissions can obviously reduce the amount of data storage and still guarantee identification accuracy.
To recognize the operating states of household appliances, the features of historical data are firstly extracted to establish a classifier model, and then, the operating states of home appliances are identified through realtime data. The issue of how to extract individual features from measured data is critical. In our study, the features of spurious emissions are extracted from the time–frequency distribution and their physical properties, respectively. The peak intensity of the fractional correlation is higher than the traditional correlation, and the correlation operation can be extended to any angle of the time–frequency plane by the fractional Fourier transform (FrFT) [12]. Furthermore, electromagnetic emissions can be divided into different components based on physical characteristics, presented in the “basic emission waveform theory” [13]. The signal can be accurately separated through Bspline curve fitting on the physical layer to obtain the statistical features of different components. Therefore, in this paper, the fractional correlationbased algorithm and Bspline curve fittingbased algorithm are proposed to extract two groups of complementary features, which show longtime stability and then are used in a classifier. There are various classification algorithms, such as an improved fuzzy clustering algorithm [14], neural networks [15,16], and so on. However, considering the amount of training sample, a combining classifier is used. The combining classifier is based on support vector machine (SVM) and Dempster–Shafer (DS) evidence theory, which shows the final identification results.
The main contributions in this paper are summarized as follows:
 Different from previous studies, spurious emissions in the spectrum of each appliance are used to identify the operating states based on the nonlinear components, which can obviously reduce the amount of data storage and still guarantee identification accuracy;
 Two groups of complementary features are extracted from the spurious emissions of household appliances. The fractional correlationbased algorithm exploits the correlation coefficient to evaluate the similarity between transformed sequences in the time–frequency plane, and the Bspline curve fittingbased algorithm obtains statistical features of different components on the physical layer.
The paper is organized as follows. In Section 2, the feature extraction method is proposed based on the fractional correlationbased algorithm and the Bspline curve fittingbased algorithm. Section 3 briefly introduces the combining classifier based on SVM and DS evidence theory. The measurement system and experimental verification are presented in Section 4. The conclusion comes in Section 5.
2. Feature Extraction
To reduce redundant information and improve the accuracy, features are extracted from the original sequence. In this section, two feature extraction algorithms are proposed as follows: the fractional correlationbased algorithm is introduced in Section 2.1, which extracts features by calculating the fractional correlation coefficient between transformed sequences; the Bspline curve fittingbased algorithm is proposed in Section 2.2, which measures the uniformity of different components.
2.1. Fractional CorrelationBased Algorithm
The time–frequency distributions of signals from the same device are similar, while those from different devices are significantly different. Therefore, the peak intensity of the fractional correlation coefficient between transformed sequences is used as the identification feature herein. The fractional correlation coefficient between sequences ${y}^{i}\left(x\right)$ and ${y}^{j}\left(x\right)$ is defined as:
where * denotes the conjugate; ${Y}_{p+1}^{i}\left(u\right)$ and ${Y}_{p+1}^{j}\left(u\right)$ are $p+1$ order FrFT of ${y}^{i}\left(x\right)$ and ${y}^{j}\left(x\right)$, respectively; $\tilde{{K}_{p}}\left(u,x\right)$ is the kernel function, defined as $\tilde{{K}_{p}}\left(u,x\right)={A}_{\alpha}exp\left[j\pi \left({u}^{2}cot\alpha 2uxcsc\alpha +{x}^{2}cot\alpha \right)\right]$ with $p\ne 2a$ (a is an integer) and ${A}_{\alpha}=\sqrt{1jcot\alpha}$, $\alpha \ne p\pi /\phantom{p\pi 2}\phantom{\rule{0.0pt}{0ex}}2$. FrFT is essentially a traditional timefrequency transform. Compared to the common timefrequency distribution, in the fractional Fourier domain, there is no crossterm problem. Besides, FrFT shows all the characteristics of a signal in the time–frequency plane when the order p increases from 0–4 [17,18]. To explore the features of the spectral data in the timefrequency plane, only the idea of FrFT is used in this paper, and different from the traditional FrFT, spectral data are treated as a time domain signal with the same phase. To save computational cost, the multipath parallel fast algorithm of FrFT is utilized, and the flow chart is shown in Figure 1. For discrete signals, its idea is to split the samples into the even ones and the odd ones and do all the operations on these two parts separately [19]. In our deployments, $p=0.1$ is an optimal choice by 10fold crossvalidation [20], and fractional correlation peak intensity is extracted as the feature. For each sequence, the dimension of the feature vector depends on the number of training sequences. The extracting procedures for the training and the test vectors are summarized in Algorithm 1.
$$COR{R}_{p}^{i,j}\left(\rho \right)={F}^{\pi /2}\left[{Y}_{p+1}^{i}\left(u\right){\left({Y}_{p+1}^{j}\left(u\right)\right)}^{*}\right]\left(\rho \right)$$
$${Y}_{p}\left(u\right)={\sum}_{x}\tilde{{K}_{p}}\left(u,x\right)y\left(x\right),$$
Algorithm 1 The extracting procedures for the training and the test vectors of the fractional correlationbased algorithm. 
Training vectors:

2.2. BSpline Curve FittingBased Algorithm
Electromagnetic emissions can be characterized by four basic waveforms: the square wave, sine wave, damped oscillation, and spike wave. The sine wave and square wave appear as a singlefrequency point or clustered singlefrequency points, referred to as narrowband components in the spectrum; while the damped oscillation and spike wave occupy a certain bandwidth and are defined as wideband components [21]. Roughly speaking, the emission spectrum can be decomposed into narrowband components and broadband components based on their physical properties. Since the narrowband components are sensitive to the environment, the broadband components are mainly considered in our study. The uniformity of the narrowband components and broadband components is exploited for identification.
In the proposed method, the broadband component is firstly fitted using the Bspline curve, and then, the sequences are disaggregated. The Bspline fitting function can be expressed as [22]:
where ${N}_{i,k}\left(x\right)$ denotes the basis function, which is obtained recursively and shown in Equation (4), and ${P}_{i}$ is the position vector of polygon control points. Note that the choice of control points directly determines the trend of fitting curves. Here, some extreme points are selected and combined as double control points. Then, the fitting curve $t\left(x\right)$ of the broadband component is obtained, including 1000 sampling points, as shown in Figure 2a, taking a fluorescent table lamp as an example. After removing the broadband components from the original signal, we only keep the remaining signal, denoted as $h\left(x\right)$ in Figure 2b, which mainly consists of the obvious narrowband components. Considering the narrowband components that are an order of magnitude larger than the amplitude of the wideband components, the datum line was set to 10 dB. Both $t\left(x\right)$ and $h\left(x\right)$ were used to construct the other group of features for appliance identification. Spectral similarity was estimated using the mean and standard deviation of the two overall components and the segmented broadband sequence. The segmentation interval was set to 10 through crossvalidation, which can guarantee the integrity of the information and the burden of the classifier. The extracting procedures for the training and the test vectors are summarized in Algorithm 2.
$$B\left(x\right)={\sum}_{i=1}^{n}{P}_{i}{N}_{i,k}\left(x\right),0<x<1,$$
$$\left\{\begin{array}{c}{N}_{i,0}\left(x\right)=\left\{\begin{array}{c}\begin{array}{cc}1,& {x}_{i}\le x\le {x}_{i+1}\end{array}\phantom{\rule{6.375pt}{0ex}}\\ \begin{array}{cc}0,& x\notin \left[{x}_{i},{x}_{i+1}\right)\end{array}\end{array}\right.\phantom{\rule{10.625pt}{0ex}}\hfill \\ \begin{array}{cc}{N}_{i,k}\left(x\right)=\frac{x{x}_{i}}{{x}_{i+k}{x}_{i}}{N}_{i,k1}\left(x\right)+\frac{{x}_{i+k+1}x}{{x}_{i+k+1}{x}_{i+1}}{N}_{i+1,k1}\left(x\right),& k>0\end{array}\phantom{\rule{10.625pt}{0ex}}\hfill \\ 0/0=0\hfill \end{array}.\right.$$
Algorithm 2 The extracting procedures for the training and the test vectors of the Bspline curve fittingbased algorithm. 
Training vectors:

3. Combining Classifier
The features extracted by different algorithms compose different feature spaces [23]; some samples are in an overlapping region of one feature space, while they may be absent from the overlapping region of another feature space. Therefore, highdimensional feature spaces, superimposed directly on lowdimensional feature spaces, may deteriorate the aggregation of samples belonging to the same class. To solve this problem, a combining classifier was introduced, and the combination of several complementary classifiers improved the performance of individual classifiers.
In this paper, considering the high cost of sample collection, the SVM was employed as the individual classifiers. Its core idea is that of using a kernel function to map a nonlinearlyseparable space to a linear separable highdimensional space and then searching for the optimal separating hyperplane [24]. For linearlyseparable data, the decision function of SVM is defined as:
where $\Phi \left(x\right)$ is the kernel function and w and b determine the hyperplane of the mapped space. For linear inseparable data, the decision function of SVM is defined as:
where $K\left({x}_{i},{x}_{j}\right)$ is the kernel function, l is the number of training data, ${\alpha}_{i}$ is the ${i}^{\mathrm{th}}$ mode embedded dimension, and ${y}_{i}$ is the ${i}^{\mathrm{th}}$ label. The classification problem is converted into a constrained optimization function, given by:
where C is a penalty factor. By solving the optimization problem in Equation (7), $\alpha $ and b are sequentially obtained. The classifier model is established.
$$f\left(x\right)=sign\left(w\xb7\Phi \left(x\right)+b\right),$$
$$f\left(x\right)=sign\left[\left(\sum _{i=1}^{l}{\alpha}_{i}{y}_{i}K\left({x}_{i},{x}_{j}\right)\right)+b\right],$$
$$\begin{array}{c}\underset{\alpha}{min}\phantom{\rule{1.em}{0ex}}\frac{1}{2}\sum _{i=1}^{l}\sum _{j=1}^{l}{y}_{i}{y}_{j}{\alpha}_{i}{\alpha}_{j}K\left({x}_{i},{x}_{j}\right)\sum _{j=1}^{l}{\alpha}_{j}\\ s.t.\phantom{\rule{1.em}{0ex}}\sum _{i=1}^{l}{y}_{i}{\alpha}_{i}=0,\phantom{\rule{1.em}{0ex}}\phantom{\rule{1.em}{0ex}}\phantom{\rule{1.em}{0ex}}\phantom{\rule{1.em}{0ex}}\phantom{\rule{1.em}{0ex}}\phantom{\rule{1.em}{0ex}}\phantom{\rule{1.em}{0ex}}\phantom{\rule{1.em}{0ex}}\phantom{\rule{1.em}{0ex}}\\ 0\le {\alpha}_{i}\le C,\phantom{\rule{0.277778em}{0ex}}i=1,2,\cdots ,l\phantom{\rule{1.em}{0ex}}\end{array},$$
Furthermore, the combination of individual classifiers was based on DS evidence theory [25] in this study. By only using the recognition, substitution, and rejection rates of each individual classifier as the prior knowledge, the performance indexes of a classifier were well represented by testing the classifier with a test sample set. Firstly, the basic probability assignment (BPA) function of each classifier was determined. Suppose that the classification result of classifier ${C}_{k}$ is ${j}_{k}$ for an input feature vector X, ${j}_{k}=1,2,\cdots ,M,M+1$; ${j}_{k}=M+1$ means the classifier ${C}_{k}$ does not recognize the input X. M is the total number of identified classes. The error of classifier ${C}_{k}$ is described by its confusion matrix as follows:
where ${n}_{ij}^{k}$ denotes the number that the sample of class i has been assigned a label j by classifier ${C}_{k}$. The uncertainty of event ${C}_{k}\left(X\right)=j$ can be given by conditional probabilities expressed as:
$$P{T}_{k}=\left[\begin{array}{cccc}{n}_{11}^{k}& {n}_{12}^{k}& \cdots & {n}_{1(M+1)}^{k}\\ {n}_{21}^{k}& {n}_{22}^{k}& \cdots & {n}_{2(M+1)}^{k}\\ \vdots & \vdots & \vdots & \vdots \\ {n}_{M1}^{k}& {n}_{M2}^{k}& \cdots & {n}_{M(M+1)}^{k}\end{array}\right],$$
$${n}_{ij}^{k}=sum({C}_{k}\left(X\right)=jX\in i),$$
$$\begin{array}{c}P(X\in i{C}_{k}\left(X\right)=i)=\frac{{n}_{ii}^{k}}{\sum _{j=1}^{M}{n}_{ji}^{k}}\\ P(X\notin i{C}_{k}\left(X\right)=i)=1\frac{{n}_{ii}^{k}}{\sum _{j=1}^{M}{n}_{ji}^{k}}\\ P(X\in {C}_{M+1}{C}_{k}\left(X\right)=i)=\frac{{n}_{i(M+1)}^{k}}{\sum _{i=1}^{M+1}\sum _{j=1}^{M}{n}_{ji}^{k}}\end{array}.$$
The BPA function of classifier ${C}_{k}$ is ${m}_{k}^{0}=1$, if $j=M+1$. While $j=1,2,\cdots ,M$, the focal elements are ${C}_{i}$, $\overline{{C}_{i}}$, $\Theta $, and the corresponding BPA functions can be calculated by the following formula:
$$\begin{array}{c}{m}_{k}^{0}\left({C}_{i}\right)=P\left(X\in i\left{C}_{k}\left(X\right)=i\right.\right)\phantom{\rule{6.375pt}{0ex}}\\ {m}_{k}^{0}\left(\overline{{C}_{i}}\right)=1{m}_{k}^{0}\left({C}_{i}\right)\phantom{\rule{6.375pt}{0ex}}\\ {m}_{k}^{0}\left(\Theta \right)=P\left(X\in {C}_{M+1}\left{C}_{k}\left(X\right)=i\right.\right)\end{array}.$$
${m}_{k}^{0}\left({C}_{i}\right)$, ${m}_{k}^{0}\left(\overline{{C}_{i}}\right)$, and ${m}_{k}^{0}(\Theta )$ can be normalized to obtain ${m}_{k}^{1}\left({C}_{i}\right)$, ${m}_{k}^{1}\left(\overline{{C}_{i}}\right)$, and ${m}_{k}^{1}(\Theta )$. Then, considering the recognition performance of the classifier for different samples of the same class, the BPA functions are modified by the defined posterior probability function, ${p}_{i}^{k}\left(X\right)=\frac{1}{1+exp\left(\u2225{g}^{k}\left(X\right){E}_{i}^{k}\left(X\right)\u22251\right)}$, where ${g}^{k}\left(X\right)$ is the output vector of the classifier ${C}_{k}$ for the input X and ${E}_{i}^{k}\left(X\right)$ is the average output vector of the classifier ${C}_{k}$ for samples of class i. The modified BPA functions are:
$$\begin{array}{c}{m}_{k}\left({C}_{i}\right)={m}_{k}^{l}\left({C}_{i}\right){p}_{i}^{k}\left(X\right)\phantom{\rule{6.375pt}{0ex}}\\ {m}_{k}\left(\overline{{C}_{i}}\right)={m}_{k}^{l}\left(\overline{{C}_{i}}\right)\left[{m}_{k}^{l}\left({C}_{i}\right){m}_{k}\left({C}_{i}\right)\right]\phantom{\rule{6.375pt}{0ex}}\\ {m}_{k}\left(\Theta \right)=1{m}_{k}\left({C}_{i}\right){m}_{k}\left(\overline{{C}_{i}}\right)\end{array}.$$
The different identification results of the individual classifiers can be combined by DS evidence theory using Equation (13), and the belief value $bel\left(j\right)$ expresses the final beliefs with uncertainty on each M mutuallyexclusive propositions $X\in j$; the higher the $bel\left(j\right)$, the more likely $X\in j$ is to be true. The identification result is determined by a threshold $bel\left(j\right)>\alpha $, $\alpha =0.5$ in this deployment.
$$\begin{array}{c}m\left({C}_{i}\right)={m}_{i}\left({C}_{i}\right){\displaystyle \prod _{j=1,j\ne i}^{n}}\left[{m}_{j}\left(\overline{{C}_{j}}\right)+{m}_{j}\left(\Theta \right)\right]+{m}_{i}\left(\Theta \right){\displaystyle \prod _{j=1}^{n}}{m}_{j}\left(\overline{{C}_{j}}\right)\phantom{\rule{6.375pt}{0ex}}\\ m\left(\Theta \right)=0\phantom{\rule{6.375pt}{0ex}}\\ m\left(\Theta \right)=1\sum _{i=1}^{n}m\left({C}_{i}\right)\end{array}.$$
4. Experiment and Verification
In this section, we evaluate the identification performance of the proposed method through various numerical experiments. The measurement system is described in Section 4.1. The distinguishability and time stability of the proposed features are discussed in Section 4.2 and Section 4.3, respectively. The identification performance by the combining classifier is verified in Section 4.4.
4.1. Measurement System
To validate the proposed method, some experiments were carried out in a shielded chamber, and the power of spurious emission was measured, which was different from the measurement of active power in [4]. The measurement system, mimicking the usage of household appliances, is shown in Figure 3, where a line impedance stabilization network (LISN, Schwarzbeck NNBL 8225) accessed the power cord, external noise was filtered out, and a spectral analyzer (CETC41 AV4037MB) was used to provide the realtime spurious emission spectrum on the neutral or live line while the appliances were operating. Since the difference between a neutral and live line signal is not critical for identification, the access point of the spectral analyzer could be selected at either. Spectrum data were recorded with 1200 sample points per 15 s, to reduce data storage effectively [8], in which an WCS2210 Hall sensor and an UNO R3 Arduino were used, as well as the current with a measurement interval of 1/170 s obtained. However, the phase information was missing during the spectrum measurement, leading to an increase in the number of training sample, which was discussed in Section 4.2.
The actual access locations in the power cord of some household appliances were variable, that is the distance between Access Point A, marked as A1, A2, and A3 for Appliance1, Appliance2, and Appliance3 as shown in Figure 3, and Monitoring Point B, the location of LISN, was not fixed in the measurement system. Therefore, lowfrequency spectra were recorded to ensure that the measurement results were not affected by the access location of the appliance. In addition, the frequency of the tested grid was 50 Hz; hence, the measuring spectral range was set from 30 kHz–150 kHz with a resolution of 200 Hz to obtain the spurious emissions from the power cord. To gather as much useful information as possible, both the maximum and minimum were recorded.
4.2. Different Appliances’ Measurement
In this subsection, the proposed method was verified using household appliances including a laptop, fluorescent table lamp, hair dryer, far infrared heater, and ultrasonic humidifier. The brand and model are shown in Table 1. In the following, only 50 groups of measured data, $N=50$, 10 for each appliance, were used to demonstrate the results of our proposed methods for clear visualization. However, in our training procedure (discussed in Section 4.4), 50 groups of measured data for each appliance were used to train the combined classifier.
In the fractional correlationbased algorithm, 50 measured sequences were used as training sequences, thereby the dimension of feature vector $M{C}_{0.1}^{i}=\left(M{C}_{0.1}^{i,1},M{C}_{0.1}^{i,2},\cdots ,M{C}_{0.1}^{i,N}\right)$ was 50 for each sequence, as shown in Figure 4. It can be seen that the correlation coefficients of 10 sequences were almost identical for the same appliance, while distinguishable from those of other appliances. In the Bspline curve fittingbased algorithm, the fitted broadband component of each appliance is shown in Figure 5, which obviously showed differences in the peaks, the amplitude, and other features, which led to obvious differences in the statistical parameters used.
Measurements covering different operating states were taken. Since an additional circuit was developed during the “on” state for the laptop, some narrowband components were introduced, which could be removed using curve fitting, because in our study, broadband components were mainly considered. In addition, the oscillation frequency of the broadband component for the ultrasonic humidifier varied with the states due to different circuit parameters, which could be used for the identification of operating states.
The results in Figure 4 and Figure 5 show that most devices in isolation were identifiable by the extracted features. However, spurious emissions from the power cord involved, in practicality, multiple appliances in parallel and not a simple amplitude addition due to the phase [26]. Therefore, further measurements were required for appliance combinations.
4.3. LongTime Stability Measurement
The power grid was not stable due to the access of highpower appliances and the power quality, which indicated that the measured spectra for each appliance may be different over time. Although the appliances were in parallel, the current on the power cord would increase when a highpower appliance was operating, which would increase the voltage ${V}_{L}$ across the loss resistance ${R}_{L}$ of the power cord and decrease the voltage ${V}_{A}$ across the appliances, as shown in Figure 6. Therefore, the stability of the features extracted for identifying various loads was required. In our study, the table lamp was taken as an example to validate the longtime stability of the extracted features.
The spurious emissions of the table lamp were measured over five consecutive days, and the features were extracted using the proposed methods, as shown in Figure 7. Figure 7a demonstrates that the difference in correlation coefficients was limited to 150, and the coefficients over three days almost overlapped. In addition, Figure 7b indicates that the broadband components of the spectra were consistent over time, although there was some variation around 130 kHz. However, this small variation was acceptable for a classifier.
4.4. Identification Accuracy
To evaluate the identification accuracy of the features extracted from the proposed method, the probability of correct identification was used as a performance measure. Three types of appliances were considered: the fluorescent table lamp, hair dryer, and ultrasonic humidifier. Fifty measured data groups for each appliance were used, and the total number of training sequences was $N=50*{2}^{K}$, where K is the number of appliances, $K=3$ and $N=400$ in this subsection.
Different feature spaces were constructed by the fractional correlationbased algorithm and Bspline curve fittingbased algorithm. Two SVM classifiers were trained by those two feature spaces, respectively, and the results from the two classifiers were combined for high identification accuracy. Here, two SVM classifiers were first implemented where the Gaussian radial basis function (RBF) was selected as the kernel function [27], then the combined results using DS evidence theory were calculated. The accuracy was 85.5% for 800 test sequences, which were collected in different time periods over one day and covered all appliance combinations, showing better performance than individual classifiers.
Note that the household meter included a main switch, toilet switch, and kitchen switch, amongst others, which controlled the power cord of the apartment and the power cords of different rooms covering specific appliances. In our study, the power cord of the room was mainly considered, that is the number of measured appliances K and the total number of training sequences N were acceptable and would not cause an index explosion.
5. Conclusions
In this paper, a method to extract complementary spectrum features from spurious emissions was proposed for detecting the operating states of appliances, using the fractional correlationbased algorithm and Bspline curve fittingbased algorithm, which had the capacity to support NILM and home automation networks. Firstly, the spurious emissions in the spectrum coupled to the power cord were measured based on the existence of nonlinear components inside appliances. Then, two groups of complementary features were extracted using the similarity between transformed sequences in the time–frequency plane and the statistical features of different components on the physical layer. Finally, the feasibility and longtime stability of the proposed method were validated using several measurements, and the identification accuracy reached 85.5% using the combining classifier based on SVM and DS evidence theory. It was found that the proposed feature extraction methods could be used to identify the appliances and showed good performance.
In future studies, further measurements should be performed for more household appliances in an actual home and the extracted features incorporated with other features to improve the identification accuracy. In addition, the identification results of appliance operating states would be applied to recover the energy consumption per appliance.
Author Contributions
Conceptualization, D.S.; methodology and software, Q.S.; validation, Q.S. and W.W.; formal analysis, Q.S.; investigation, Q.S. and W.W.; data curation, Q.S.; writing, original draft preparation, Q.S.; writing, review and editing, W.W. and H.X.; supervision, D.S.; project administration, H.X.; funding acquisition, D.S.
Funding
This research was funded by the National Natural Science Foundation of China Grant Number 61427803.
Conflicts of Interest
The authors declare no conflict of interest.
References
 Zeifman, M.; Roth, K. Nonintrusive appliance load monitoring: Review and outlook. IEEE Trans. Consum. Electron. 2011, 57, 76–84. [Google Scholar] [CrossRef]
 Salerno, V.; Rabbeni, G. An extreme learning machine approach to effective energy disaggregation. Electronics 2018, 7, 235. [Google Scholar] [CrossRef]
 Suzuki, K.; Inagaki, S.; Suzuki, T.; Nakamura, H.; Ito, K. Nonintrusive appliance load monitoring based on integer programming. In Proceedings of the 2008 SICE Annual Conference, Tokyo, Japan, 20–22 August 2008; pp. 2742–2747. [Google Scholar]
 Biansoongnern, S.; Plangklang, B. Nonintrusive load monitoring (NILM) using an Artificial Neural Network in embedded system with low sampling rate. In Proceedings of the 2016 13th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTICON), Chiang Mai, Thailand, 28 June–1 July 2016; pp. 1–4. [Google Scholar]
 Zhong, M.; Goddard, N.; Sutton, C. Signal aggregate constraints in additive factorial HMMs, with application to energy disaggregation. In Proceedings of the Advances in Neural Information Processing Systems, Montréal, QC, Canada, 8–13 December 2014; pp. 3590–3598. [Google Scholar]
 Hart, G.W. Nonintrusive appliance load monitoring. Proc. IEEE 1992, 80, 1870–1891. [Google Scholar] [CrossRef]
 Laughman, C.; Lee, K.; Cox, R.; Shaw, S.; Leeb, S.; Norford, L.; Armstrong, P. Power signature analysis. IEEE Power Energy Mag. 2003, 1, 56–63. [Google Scholar] [CrossRef]
 Lee, D. Phase noise as power characteristic of individual appliance for nonintrusive load monitoring. Electron. Lett. 2018, 54, 993–995. [Google Scholar] [CrossRef]
 Ducange, P.; Marcelloni, F.; Antonelli, M. A novel approach based on finitestate machines with fuzzy transitions for nonintrusive home appliance monitoring. IEEE Trans. Ind. Inform. 2014, 10, 1185–1197. [Google Scholar] [CrossRef]
 Lin, Y.H.; Tsai, M.S. Development of an improved time–frequency analysisbased nonintrusive load monitor for load demand identification. IEEE Trans. Instrum. Meas. 2013, 63, 1470–1483. [Google Scholar] [CrossRef]
 Chen, K.Y.; Gupta, S.; Larson, E.C.; Patel, S. DOSE: Detecting userdriven operating states of electronic devices from a single sensing point. In Proceedings of the 2015 IEEE International Conference on Pervasive Computing and Communications (PerCom), St. Louis, MO, USA, 23–27 March 2015; pp. 46–54. [Google Scholar]
 Zalevsky, Z.; Mendlovic, D.; Caulfield, J.H. Localized, partially spaceinvariant filtering. Appl. Opt. 1997, 36, 1086–1092. [Google Scholar] [CrossRef] [PubMed]
 Su, D.; Xie, S.; Chen, A.; Shang, X.; Zhu, K.; Xu, H. Basic Emission Waveform Theory: A Novel Interpretation and Source Identification Method for Electromagnetic Emission of Complex Systems. IEEE Trans. Electromagn. Compat. 2017, 60, 1330–1339. [Google Scholar] [CrossRef]
 Liu, Q.; Kamoto, K.M.; Liu, X.; Sun, M.; Linge, N. LowComplexity NonIntrusive Load Monitoring Using Unsupervised Learning and Generalized Appliance Models. IEEE Trans. Consum. Electron. 2019, 65, 28–37. [Google Scholar] [CrossRef]
 Zhang, C.; Zhong, M.; Wang, Z.; Goddard, N.; Sutton, C. Sequencetopoint learning with neural networks for nonintrusive load monitoring. In Proceedings of the ThirtySecond AAAI Conference on Artificial Intelligence, New Orleans, LA, USA, 2–7 February 2018. [Google Scholar]
 Bonfigli, R.; Felicetti, A.; Principi, E.; Fagiani, M.; Squartini, S.; Piazza, F. Denoising autoencoders for nonintrusive load monitoring: improvements and comparative evaluation. Energy Build. 2018, 158, 1461–1474. [Google Scholar] [CrossRef]
 Ozaktas, H.M.; Arikan, O.; Kutay, M.A.; Bozdagt, G. Digital computation of the fractional Fourier transform. IEEE Trans. Signal Process. 1996, 44, 2141–2150. [Google Scholar] [CrossRef]
 Almeida, L.B. The fractional Fourier transform and timefrequency representations. IEEE Trans. Signal Process. 1994, 42, 3084–3091. [Google Scholar] [CrossRef]
 Bultheel, A. A twophase implementation of the fractional Fourier transform; TW Reports; Department of Computer Science, KU Leuven: Leuven, Belgium, 2011. [Google Scholar]
 Yadav, S.; Shukla, S. Analysis of kfold crossvalidation over holdout validation on colossal datasets for quality classification. In Proceedings of the 2016 IEEE 6th International Conference on Advanced Computing (IACC), Bhimavaram, India, 27–28 Feburary 2016; pp. 78–83. [Google Scholar]
 Shang, X.; Su, D. Use modified LombScargle method to analyze electromagnetic emission spectrum. In Proceedings of the 2015 IEEE 6th International Symposium on Microwave, Antenna, Propagation, and EMC Technologies (MAPE), Shanghai, China, 28–30 October 2015; pp. 415–420. [Google Scholar]
 Su, D.; Wu, J.; Shang, X.; Zhu, K.; Peng, Z. A novel method of detecting and analyzing electromagnetic emission. In Proceedings of the 2017 IEEE Asia Pacific Microwave Conference (APMC), Kuala Lumpar, Malaysia, 13–16 November 2017; pp. 1156–1158. [Google Scholar]
 Ho, T.K.; Hull, J.J.; Srihari, S.N. Decision combination in multiple classifier systems. IEEE Trans. Pattern Anal. Mach. Intell. 1994, 16, 66–75. [Google Scholar]
 Lopes, F.F.; Ferreira, J.C.; Fernandes, M.A. Parallel Implementation on FPGA of Support Vector Machines Using Stochastic Gradient Descent. Electronics 2019, 8, 631. [Google Scholar] [CrossRef]
 Xu, L.; Krzyzak, A.; Suen, C.Y. Methods of combining multiple classifiers and their applications to handwriting recognition. IEEE Trans. Syst. Man Cybern. 1992, 22, 418–435. [Google Scholar] [CrossRef]
 Lane, R.; Badran, T.; Marshall, G. Measurements for remote identification of electrical equipment. Electron. Lett. 2017, 53, 1001–1005. [Google Scholar] [CrossRef]
 Chang, C.C.; Lin, C.J. LIBSVM: A library for support vector machines. ACM Trans. Intell. Syst. Technol. 2011, 2, 27. [Google Scholar] [CrossRef]
Figure 2.
Bspline curve fittingbased algorithm for the spectrum of a fluorescent table lamp: (a) original data $y\left(x\right)$ and fitting curve $t\left(x\right)$; (b) residual sequence after removing the broadband component and obvious narrowband sequence $h\left(x\right)$.
Figure 7.
Correlation coefficients and fitted broadband components for the fluorescent table lamp over five consecutive days: (a) Fractional correlation coefficient; (b) fitted broadband component by Bspline curve fitting.
Item  Brand and Model 

Laptop charger, connected to a laptop  Lenovo, ADLX65YCC3A, 65 W(Lenovo, ThinkPad E480) 
Fluorescent table lamp  Guandi, GD3021, 11 W 
Hair dryer  Panasonic, EHWNA3B, 1800 W 
Far infrared heater  Qianyou, NX, 1000 W 
Ultrasonic humidifier  DEERMA, DEMF470, 25 W 
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).