A Fusion Load Disaggregation Method Based on Clustering Algorithm and Support Vector Regression Optimization for Low Sampling Data

Yuan, Quanbo; Wang, Huijuan; Wu, Botao; Song, Yaodong; Wang, Hejia

doi:10.3390/fi11020051

Open AccessArticle

A Fusion Load Disaggregation Method Based on Clustering Algorithm and Support Vector Regression Optimization for Low Sampling Data

by

Quanbo Yuan

^1,2,*,

Huijuan Wang

^1,3,

Botao Wu

⁴,

Yaodong Song

¹ and

Hejia Wang

⁵

¹

School of Computer and Remote Sensing Information Technology, North China Institute of Aerospace Engineering, Langfang 065000, China

²

The School of Computer Software, Tianjin University, Tianjin 300354, China

³

State Key Laboratory of Reliability and Intelligence of Electrical Equipment, Hebei University of Technology, Tianjin 300130, China

⁴

Department of Automotive Engineering, Hebei Institute of Machinery and Electricity, Xintai 054000, China

⁵

The School of Computer Software, Huazhong University of Science and Technology, Wuhan 430074, China

^*

Author to whom correspondence should be addressed.

Future Internet 2019, 11(2), 51; https://doi.org/10.3390/fi11020051

Submission received: 11 December 2018 / Revised: 17 January 2019 / Accepted: 21 January 2019 / Published: 19 February 2019

Download

Browse Figures

Versions Notes

Abstract

:

In order to achieve more efficient energy consumption, it is crucial that accurate detailed information is given on how power is consumed. Electricity details benefit both market utilities and also power consumers. Non-intrusive load monitoring (NILM), a novel and economic technology, obtains single-appliance power consumption through a single total power meter. This paper, focusing on load disaggregation with low hardware costs, proposed a load disaggregation method for low sampling data from smart meters based on a clustering algorithm and support vector regression optimization. This approach combines the k-median algorithm and dynamic time warping to identify the operating appliance and retrieves single energy consumption from an aggregate smart meter signal via optimized support vector regression (OSVR). Experiments showed that the technique can recognize multiple devices switching on at the same time using low-frequency data and achieve a high load disaggregation performance. The proposed method employs low sampling data acquired by smart meters without installing extra measurement equipment, which lowers hardware cost and is suitable for applications in smart grid environments.

Keywords:

non-intrusive load monitoring; power consumption pattern; load disaggregation; low sampling data; optimized support vector regression

1. Introduction

With energy consumption growing year after year, carbon emissions have become an important issue for many countries. In order to considerably reduce their annual carbon emissions by 2050, energy conservation has been a national important aim. For this reason, current electricity infrastructure is transitioning to smart grid, in which information flowing between energy generation and consumption points is obtained from dedicated devices called smart meters. As popular electricity measuring devices, smart meters will be collecting vast amounts of electricity consumption data. However, the data from smart meters are mainly household total electricity use, and the energy consumption of a single appliance will remain unavailable. For market utilities, detailed electricity information can improve the power grid planning scheme and ensure the safe and economic operation of power grids by load forecasting. In addition, elaborate feedback of appliances’ consumption creates more savings in electricity [1,2] for consumers. Non-intrusive load monitoring (NILM), which disaggregates a household’s aggregate electricity consumption to individual appliance [3], has been greatly developed.

The NILM approach research focuses on two issues. The first is the feature sets adopted to identify appliances. A variety of features have been extracted and applied, such as the apparent power [4,5,6,7,8], current and voltage [9], active and reactive power [10,11], the power phase angle and power factor. Current features include transient and steady-state features. Steady-state features are observed when a state transition of appliances has been completely over, and transient features are extracted when a state transition is in progress. As higher sampling rate data of kHz are required, transient features need more complex and costly hardware equipment [12]. Except for the time domain features above, Fourier analysis and combination with other features of the time domain are used to identify appliances [13,14,15]. Discrete wavelet transforms of power were employed in References [16,17] and Reference [11].

The second issue for NILM is learning algorithms applied to NILM. In literature, k-nearest neighbor (k-NN) [18,19] and neural networks [14,20] are widely applied to aggregate total consumption. Decision trees were used in References [21] and [16]. Support vector machines have mainly been used as classifiers [22,23,24]. The naïve Bayes classifier is occupied to identify the different combination value of various appliances [25]. In recent years, deep learning based approaches have worked to load disaggregation [26,27]. In addition, a similarity measure between two sequences plays an important role in classification [28]. Dynamic time warping (DTW) has been employed to measure similarity, which can overcome time distortion [29].

To that end, although the existing NILM methods have achieved a good effect, there are still many problems discussed in recent works. The first, NILM technology using transient or high-frequency steady-state features, requires additional hardware equipment installed [30], which leads to high hardware cost. Smart meters are able to read 10-second resolution data. Thus, using electricity consumption data in 10-second resolution or even more is a benefit for the industrialization of NILM technology.

Secondly, event-based methods in literature are not suitable for features from low sampling rate data [5,8,9], because overlapping events inevitably happen, and the events of low power consumption appliances are difficult to detect due to the power variety of high-power consumption appliances. Factorial hidden Markov models (FHMM) have been used to solve the problem [31]. However, FHMM needs prior distribution and has high computational complexity of the algorithm [32] for multi-state appliance.

The third problem to NILM methods is the poor power estimation accuracy caused by the power variety of appliances when operating. Generally, average powers are considered as the estimation value of power [9]. Due to the complex components of electrical appliances, their power consumption shows pattern characteristics. In literature [32], DTW is used to estimate appliance consumption trajectories.

In this work, we considered the above three problems and propose a fusion load disaggregation method for low sampling data from smart meters. The combination of the K-median algorithm and DTW were used to identify the operation states of appliances and then estimate the power using optimized support vector regression (OSVR) according to the consumption pattern feature of the appliance. In literature [33], SVR estimates the power proportion of constant power loads to constant impedance loads. As an important branch of support vector machines (SVM), SVR was applied to NILM a few times. However, it has seldom been used to estimate power consumption. The proposed method is suitable for low sample data scenes and requires low computational complexity (only active power being used).

We developed a series of experiments to test the performance of our proposed method. We provide performance comparisons against the literature using the FHMM method and the approach using DTW, having analyzed the sensitivity of the proposed algorithm with the number of sample feature, proving that OSVR is better than SVR for power estimation. The remainder part of this paper is organized as follows. In Section 2, we describe our proposed method. We discuss our experiment setting and evaluation metrics in Section 3 and report the experimental result in Section 4. Finally, a summary conclusion is offered in Section 5.

2. Proposed Method

Home electrical appliances consist of complex components. After studying the power consumption characteristics of some typical devices, we found that every kind of everyday appliances shows a similar and unique power trend over time when operating. Even multiple devices with similar power have a special consumption pattern. Based on the above fact, we made the assumption that each appliance has its own consumption pattern and can be identified through the pattern.

Our approach includes three steps. At first, we extract power consumption pattern signatures of each appliance contributing to the total power, which will be used to estimate the power value of each appliance. Then, the aggregation powers are clustered, and the cluster with high power level is identified and retrieved. Finally, the result from last step is adjusted. The latter two steps are iteratively executed until every appliance is handled. The flowchart of the proposed method is shown in Figure 1. We detail the method in the following sections.

2.1. Power Consumption Pattern Signatures Extraction

When a kind of appliance is running, an integral duration includes a number of power states with different power level. In this section, we extract the power pattern signatures of each appliance during operation. The signatures are to be used to label the appliance after total power is clustered, and to retrieve the power via SVR training. Considering the need to minimize the intrusiveness and the impact of human activity on the power pattern, a week was taken into account. During the training period, several integral operation durations were extracted of each appliance contributing to the total power. After removing the trace noise by ignoring power value less than a threshold, each signature time series was spitted into a number of segments according to power states level (according to the operation modes of the appliance, different modes of a single appliance are recognized as multiple appliances [3]). The time series segments will be applied later.

2.2. Appliance States Identification and Load Disaggregation

2.2.1. Aggregation Power Clustering and Appliance Identification through DTW

As we mentioned in the previous sections, each turned-on appliance contributes to the total power; the larger the contribution, the greater the impact is on the aggregation power. Therefore, clustering the values in total power time series, the largest contribution appliance will be the first division.

Given time window length

p

, we measured the total power readings, and the set of

p

samples is represented as

{x_{1}, x_{2}, \dots, x_{p}}

. Then, the total power readings were divided into

k

clusters via the following simple version of K-median clustering [34]:

\min_{{C_{1}, \dots, C_{k}}} \sum_{i = 1}^{k} \sum_{x \in C_{i}} | x - \tilde{c_{i}} |,

(1)

where

C_{i}

is the set of total power readings in the i-th cluster and

\tilde{c_{i}}

is the median of the values in

C_{i}

.

The value of k is determined by the minimum sum of the interclass distance. When k changes from 2 to 10, the total power values are clustered into k class, and the sum of the interclass distances is calculated, respectively. The k value corresponding to the minimum sum of the interclass distance is obtained. The largest contribution to total power makes the largest total power values, so the cluster with the maximum cluster center is when the largest contribution appliance is running. Finding the cluster

C_{i}

with the maximum cluster center, each

C_{i}

subsequence is taken from the time series, then the DTW distance between these subsequences and the signatures of each appliance are respectively calculated according to literature [35] to give the similarity of the two trajectories. As a larger DTW distance means a lower similarity, the subsequences are respectively labeled according to the DTW distances.

2.2.2. Power Retrieve

We received several total power consumption trajectories when the appliance with the largest power contribution was operating. Then, we used SVR to retrieve the power of the highest contribution appliance in the time window and then adjusted the result.

Support vector machines (SVM) were first introduced in 1992 [36]. As one of two categories of support vector machines, a version of SVM for regression (SVR) was proposed by Vapnik [37]. The parameters

c

and

g

have a large impact on the predicted results of the SVR model. Parameter

c

is the penalty factor. The higher the

c

, the lower the tolerance of error is, and overfitting is easy to happen. The smaller

c

is, the easier underfitting is. This means that the generalization ability is poor if

c

is too big or too small. Parameter

g

comes with an RBF (Radial Basis Function) as the kernel. The larger the

g

, the fewer the support vectors, and the smaller the

g

, the more the support vectors. The number of support vectors affects the speed of training and prediction. Genetic algorithms [38] are used for parameter

c

and

g

selection to optimize the SVR to estimate the power trajectory of the appliance.

In our paper, the power values of signature segments split in Section 2.1 were numbered from 1 min, and each signature segment represented the duration of an appliance. For example, the power values of signature segments of a refrigerator in an operating cycle are as follows:

[148,135,129,127,127,125,124,124,123,122].

After being numbered, the signature segment was turned to:

[(1,148),(2,135),(3,129),(4,127),(5,127),(6,125),(7,124),(8,124),(9,123),(10,122)].

As each appliance has its own consumption pattern, the SVR model can use it to realize power estimation. We adopted time number in the above example to train the model. In order to improve the performance of the SVR model, the genetic algorithm was used to select

c

and

g

. The flowchart of optimized SVR (OSVR) is shown in Figure 2.

The newest total power time series were obtained by subtracting the retrieved power subsequence by SVR from the total power time series. Repeat the process in Section 2.2 until all the appliances are retrieved.

2.3. Result Correction

As the power estimated errors of the first appliance affect the next clustering and recognition, the current result is adjusted based on backtracking thinking. We considered the following situations:

Check the current total power time sequence, and negative values are considered estimation errors. In order to correct the estimation errors, the negative value was set as the average of the two numbers before and after it, and the estimation value was adjusted accordingly.
If the subsequence obtained after clustering was truncated by a few values which had been estimated formerly, these values were considered estimation errors.
The values, identified as multiple simultaneously operating states of the same appliance, were considered estimation errors.

We needed to correct the values to match right cluster for situation 2 and 3. The values were modified into the average value of their neighbor.

3. Experiment Setting

3.1. Data

We chose the AMPds (Almanac of Minutely Power dataset) [39] to test our proposed method. AMPds contains power consumption readings in a single house in Canada over a long of time (2 years) and provides metered data at the end-use level at 1 min resolution. We considered several typical indoor appliances, including the heat pump, clothes dryer, dish washer, and fridge. Since the measurement data of these appliances were given separately in AMPds, they were suitable for validating our method. As our algorithm does not apply to normally-open appliances (always in ON state), we did not take into account the normally-open appliances and undefined reading. The aggregation powers were attained by summing the powers of each appliance at the same time. The average powers for testing appliances were obtained by statistics as shown in Table 1.

Due to the possible seasonality effects on power consumption, we would like to extract the training and testing periods in the same season with a similar climate condition. As mentioned in Section 2.1, we extracted a week’s data from April 1st 2012 to April 7th 2012 for SVR training, and we chose several weeks’ data in different seasons for validation, which were the week from April 8th 2012 to April 14th 2012 (period 1), from June 1st 2012 to June 7th 2012 (period 2), and December 1st 2012 to December 7th 2012 (period 3).

The length of the time window was set to a day (1440 min). When retrieving, we chose the segment with same length and similar average active power as that of the OSVR training data. Since power values of all the appliances in our tests were greater than 30 watt, the noise threshold was set to 30 watt.

3.2. Performance Evaluation Metrics

The performance evaluation of disaggregation algorithms has been one of the main challenges of NILM. It is important to produce a complete evaluation. However, much of the literature focuses on the accuracy of on/off detection [9,25,29], and only few studies considered the retrieving appliances’ consumption information. We adopted a set of metrics introduced in the literature [32]. The set includes the following metrics:

F-score, evaluating the capability of NILM algorithms on operation state detection of appliance

i

, with ranges from 0 (0% accuracy) to 1 (100% accuracy):

F s_{i} = \frac{2 \times P C_{i} \times R C_{i}}{P C_{i} + R C_{i}},

(2)

where

P C_{i}

and

R C_{i}

are the recall and precision for appliance

i

, which are described as follows:

{\begin{matrix} R C_{i} = \frac{T P_{i}}{T P_{i} + F N_{i}} \\ P C_{i} = \frac{T P_{i}}{T P_{i} + F P_{i}} \end{matrix},

(3)

where

T P_{i}

is the number of examples correctly labeled as ON state (true positive),

F P_{i}

means OFF state examples incorrectly labeled as ON state (false positive), and

F N_{i}

represents the ON state ones labeled as OFF (false negative).

The PCE (power contribution error) estimates the power consumption contribution of each appliance according to the following formula:

{PCE}_{i} = \frac{| \sum_{t = 1}^{H} y_{t}^{i} - \sum_{t = 1}^{H} {\hat{y}}_{t}^{i} |}{\sum_{t = 1}^{H} P_{t}},

(4)

where

y^{i}

and

{\hat{y}}^{i}

are the ground truth and estimated power consumption for appliance

i

. The more accurate the algorithm, the closer to 0 the PCE value is. The PCE provides more information feedback to end-user and demand management utilities.

The R2 score evaluates the accuracy of trace retrieve. For each appliance

i

, it is defined as:

R 2_{i} = 1 - \frac{{(\sum_{t = 1}^{H} y_{t}^{i} - \sum_{t = 1}^{H} {\hat{y}}_{t}^{i})}^{2}}{\sum_{t = 1}^{H} {(y_{t}^{i} - \sum_{t = 1}^{H} y_{t}^{i} / H)}^{2}} .

(5)

In the comparison test, the root mean square error (RMSE) is used to measure estimation error. For each appliance

i

, it is defined as:

{RMSE}_{i} = \sqrt{\frac{\sum_{t = 1}^{H} {(y_{t}^{i} - {\hat{y}}_{t}^{i})}^{2}}{H}}

(6)

4. Results and Discussion

We tested the proposed method on one day data and calculated the three metrics to analyze the performance of the method. To further prove the performance of the algorithm, several comparison tests between our algorithm and another NILM method were conducted. Time consumption comparison of power trajectory estimation showed the estimation performance of OSVR, and estimation performance comparison between OSVR and SVR proved the improvement of the OSVR algorithm in power estimation.

4.1. The Load Disaggregation Results

The comparison of load disaggregation results and measured power for one day using our proposed method are shown in Figure 3. We calculated the cumulative estimated power and cumulative measured power of each appliance in one day and got the result in Figure 4. In Figure 3a, the dish washer and heat pump are simultaneously switched on at the 174th minute, and our method shows good performance in recognizing multiple devices when they are simultaneously turned on. We can see from Figure 3, there is a misidentification at the 345th minute due to an abnormal operating cycle of the fridge. Figure 3 and Figure 4 show that the proposed algorithm has achieved disaggregation of four appliances, and the estimated aggregation consumption of each appliance is close to the measured consumption.

To quantify the performance of our algorithm, Table 2 presents the three metrics values of four appliances using the adopted set of metrics. According to the description in Section 3.2, F-score ranges from 0 (0% accuracy in state detection) to 1 (100% accuracy in state detection), where the closer

P C E_{i}

is to 0, the closer the estimated contribution of appliance

i

is to the true contribution, and the more similar the estimated consumption trajectory is to the measured consumption trajectory, the closer R2 is to 1. Thus, since the F-score and the R2 values for four appliances are close to 1, and the PCE values are close to 0, it shows that the proposed algorithm performs well in identifying appliances states and estimating the power consumption assigned to each appliance and the power trace of each appliance.

Then, in order to test the performance of identification accuracy and decomposition accuracy for different periods, F-score and PCE values for different periods were counted separately. The results are listed in Table 3 and Table 4. Table 3 and Table 4 show that the performance for Period 1 is better than that for Period 2 and 3, due to the possible seasonality effects on power consumption.

4.2. Performance Comparison

Hidden Markov models (HMMs) [31] have been tested and shown higher disaggregation accuracy in this context, so HMM-family methods are often adopted as benchmarks for algorithm testing and comparison. In literature [32], FHMM was adopted to identify the appliance states and only power consumption considered as signature. We compared the F-score values of each appliance between our approach and the FHMM in literature [32]. The result is in Table 5. The F-score comparison shows that our approach is able to correctly detect the operating states of each appliance with a higher rate than FHMM. Based on the analysis in literature, the number of possible states for each appliance is limited to 2 in FHMM, and with the increase in the number of appliances and states, the calculation time of the FHMM algorithm increases dramatically.

Yet, average power and the DTW-based method are used for power estimation separately in References [9] and [32]. We implemented these two methods on AMPds. On the other metrics, the comparison of PCE and R2 values with the result of References [9] and [32] is in Table 6 and Table 7. Obviously, the average power based method cannot show the power change during the operation of appliances, so its estimation performance in R2 is worse than that of the DTW-based method and our proposed method. The results showed that our proposed method performs better in providing information on the consumption patterns and contribution of each appliance to the total power except when it comes to the heat pump. Since the consumption pattern of the heat pump is affected by season, there was a slight deviation in the estimated value.

4.3. Time Consumption Comparison of Power Trajectory Estimation

To further test the performance of our proposed method, the computation time of proposed method was compared to the existing literature. In literature [32], the SDTW (Subsequence Dynamic Time Warping) method was applied to estimate the power trajectory of a single appliance. We operated our proposed method and the SDTW-based method in literature [32] for the power consumption estimation of fridge, and the computation times are shown in Table 8.

From Table 8, we found that the OSVR-based method in our paper has an advantage on time consumption compared with the SDTW. As SDTW needs to scan the whole feature sequence and iteratively using DTW to search for best matched subsequence, the computation time is relative to the length of the signature time series. The OSVR-based method used the time feature for training, so only the subsequences with the same length duration were chosen for training, which reduced the computation time of the estimation algorithm.

4.4. Estimation Performance Comparison between OSVR and SVR

In order to quantify the estimation performance of OSVR, with the same training data as those selected in Section 4.3, RMSE of OSVR and SVR (parameters selection based on rule of thumb) were calculated, shown in Table 9.

We can see from Table 9, through the optimization of the parameters of the SVR, the estimation performance has been significantly improved.

4.5. Method Limitations and Ways to Address Them

One of the method’s limitations is that multiple normally-open appliances could not be disaggregated correctly. In the aggregation power clustering step, if multiple normally-open appliances contribute to the total power, the clustering method could not cluster to obtain the total power values when the largest contribution appliance is running. In order to overcome the limitation, after all the ON–OFF appliances are identified, the clustering phase is skipped, and the total power values are treated as one class for identification and estimation.

Another limitation of the method is it is not applicable to appliances whose features are not extracted. When a new load is added without extracting the feature and training the SVR model, it will be misidentified and estimated as the most similar features appliance whose features have been extracted. To notice misidentification of newly added appliances, a similarity threshold

θ

can be added to recognize whether there is a newly added load. If the minimum DTW distance from Section 2.2.1 is greater than

θ

, a newly added load is considered. When a new appliance is added, the features of the appliance need to be extracted to identify it, and the SVR model must be retrained to realize power estimation.

5. Conclusions

This paper proposed a new load disaggregation method for low sampling data from smart meters based on K-medians and OSVR. Our proposed method, using the power consumption pattern feature of appliances, clusters low sampling power consumption data and labels the appliance based on DTW. According to the fact that each appliance shows a similar and unique power trend over time when operating, OSVR is applied to estimate power consumption. We tested the proposed approach on an AMPds dataset and chose three metrics to evaluate the proposed algorithm. Several comparison tests were performed as well. The results demonstrated that the algorithm shows higher identification accuracy than FHMM, and better power estimation performance than the DTW-based method and average power method. The proposed algorithm has the advantage of quick execution, high applicability to multistates appliances compared to FHMM, and low-cost hardware without extra equipment installation.

In addition, for seasonal appliances (such as heat pumps), since the power consumption pattern is influenced by the surrounding temperature, estimated values deviate from true powers. Appliance features can be extracted from different seasons to avoid seasonal impact on performance, and further studies can focus on improving the estimation performance considering the impact of temperature of the surrounding environment.

Author Contributions

H.W. and Q.Y. conceived and designed the experiments; Q.Y. and Y.S. performed the experiments; Q.Y. analyzed the data and wrote the paper; B.W. and H.W. reviewed and edited the paper.

Funding

This research was funded by Hebei University of Technology and Scientific & Technical Research Foundation of Hebei Province, China (Grant NO. 17270333).

Conflicts of Interest

The authors declare no conflict of interest.

References

Armel, K.C.; Gupta, A.; Shrimali, G.; Albert, A. Is disaggregation the holy grail of energy efficiency? The case of electricity. Energy Policy 2013, 52, 213–234. [Google Scholar] [CrossRef] [Green Version]
Chakravarty, P.; Gupta, A. Impact of Energy Disaggregation on Consumer Behavior. Available online: https://www.mendeley.com/catalogue/impact-energy-disaggregation-consumer-behavior/ (accessed on 15 February 2019).
Hart, G.W. Nonintrusive appliance load monitoring. Proc. IEEE 1992, 80, 1870–1891. [Google Scholar] [CrossRef]
Egarter, D.; Bhuvana, V.P.; Elmenreich, W. PALDi: Online load an unsupervised training method for adisaggregation via particle filtering. IEEE Trans. Instrum. Meas. 2015, 64, 467–477. [Google Scholar] [CrossRef]
Egarter, D.; Elmenreich, W. Autonomous load disaggregation approach based on active power measurements. Presented at the 2015 IEEE International Conference on Pervasive Computing and Communication Workshops Workshop (PerCom Workshops), St. Louis, MO, USA, 23–27 March 2015; pp. 293–298. [Google Scholar]
Elafoudi, G.; Stankovic, L.; Stankovic, V. Power disaggregation of domestic smart meter readings using dynamic time warping. Presented at the Communications, Control and Signal Processing (ISCCSP), Athens, Greece, 21–23 May 2014; pp. 36–39. [Google Scholar]
Figueiredo, M.; Ribeiro, B.; de Almeida, A. Electrical signal source separation via nonnegative tensor factorization using on site measure-ments in a smart home. IEEE Trans. Instrum. Meas. 2014, 63, 364–373. [Google Scholar] [CrossRef]
Guo, Z.; Wang, Z.J.; Kashani, A. Home appliance load modelingfrom aggregated smart meter data. IEEE Trans. Power Syst. 2015, 30, 254–262. [Google Scholar] [CrossRef]
Wang, A.L.; Chen, B.X.; Wang, C.G.; Hua, D.D. Non-intrusive load monitoring algorithm based on features of V-I trajectory. Electr. Power Syst. Res. 2018, 157, 134–144. [Google Scholar] [CrossRef]
Wang, H.; Yang, W.; Yang, Q. An Optimal Load Disaggregation Method Based onPower Consumption Pattern for Low Sampling Data. Sustainability 2019, 11, 251. [Google Scholar] [CrossRef]
Tabatabaei, S.M.; Dick, S.; Xu, W. Towards Non-Intrusive Load Monitoring via Multi-Label Classification. IEEE Trans. Smart Grid 2017, 1, 26–40. [Google Scholar] [CrossRef]
Maitre, J.; Glon, G.; Gaboury, S.; Bouchard, B.; Bouzouane, A. Efficient appliances recognition in smart homes based on active andreactive power, fast fourier transform and decision trees. Presented at the AAAI Workshops: Artificial Intelligence Applied to Assistive Technologies and Smart Environments, Austin, TX, USA, 25–30 January 2015; pp. 24–29. [Google Scholar]
Meehan, P.; Ardle, C.M.; Daniels, S. An efficient, scalable time-frequency method for tracking energy usage of domestic appliancesusing a two-step classification algorithm. Energies 2014, 7, 7041–7066. [Google Scholar] [CrossRef]
Semwal, S.; Singh, M.; Prasad, R.S. Group control and identifi-cation of residential appliances using a nonintrusive method. Turk. J. Electr. Eng. Comput. Sci. 2015, 23, 1805–1816. [Google Scholar] [CrossRef]
Wild, B.; Barsim, K.S.; Yang, B. A new unsupervised event detector for non-intrusive load monitoring. Presented at the 2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP), Orlando, FL, USA, 14–16 December 2015; pp. 73–77. [Google Scholar]
Gillis, J.M.; Alshareef, S.M.; Morsi, W.G. Nonintrusive Load Monitoring Using Wavelet Design and Machine Learning. IEEE Trans. Smart Grid 2017, 7, 320–328. [Google Scholar] [CrossRef]
Gillis, J.M.; Morsi, W.G. Non-Intrusive Load Monitoring Using Semi-Supervised Machine Learning and Wavelet Design. IEEE Trans. Smart Grid 2017, 8, 2648–2655. [Google Scholar] [CrossRef]
Rahayu, D.; Narayanaswamy, B.; Krishnaswamy, S.; Labbé, C.; Seetharam, D.P. Learning to be energy-wise: Discriminative methods for load disaggregation. Presented at the 3rd International Conference on Future Energy Systems: Where Energy, Computing and Communication Meet, Madrid, Spain, 9–11 May 2012; pp. 1–4. [Google Scholar]
Berges, M.; Goldman, M.; Matthews, H.; Soibelman, L.; Anderson, K. User-centered nonintrusive electricity load monitoring for resi-dential buildings. J. Comput. Civ. Eng. 2011, 25, 471–480. [Google Scholar] [CrossRef]
Chang, H.-H.; Lee, M.-C.; Chen, N.; Chien, C.-L.; Lee, W.-J. Feature extraction based Hellinger distance algorithm for non-intrusive agingload identification in residential buildings. Presented at the IEEE Industry Applications Society Annual Meeting, Addison, TX, USA, 18–22 October 2015; pp. 1–8. [Google Scholar]
Nguyen, M.; Alshareef, S.; Gilani, A.; Morsi, W.G. A novel feature extraction and classification algorithm based on power components using single-point monitoring for NILM. Presented at the IEEE 28th Canadian Conference on Electrical and Computer Engineering (CCECE), Halifax, NS, Canada, 3–6 May 2015; pp. 37–40. [Google Scholar]
Jimenez, Y.; Duarte, C.; Petit, J.; Carrillo, G. Feature extraction for nonintrusive load monitoring based on S-transform. Presented at the Power Systems Conference (PSC), Clemson University, Clemson, SC, USA, 11–14 March 2014; pp. 1–5. [Google Scholar]
Singh, M.; Kumar, S.; Semwal, S.; Prasad, R.S. Residential load signature analysis for their segregation using wavelet—SVM. In Power Electronics and Renewable Energy Systems; Kamalakannan, C., Suresh, L.P., Dash, S.S., Panigrahi, B.K., Eds.; Springer: New Delhi, India, 2015; pp. 863–871. [Google Scholar]
Zoha, A.; Gluhak, A.; Nati, M.; Imran, M.A.; Rajasegarar, S. Acoustic and device feature fusion for load recognition. Presented at the IEEE 2012 6th IEEE International Conference Intelligent Systems, Sofia, Bulgaria, 6–8 September 2012; pp. 386–391. [Google Scholar]
Yang, C.C.; Soh, C.S.; Yap, V.V. A non-intrusive appliance load monitoring for efficient energy consumption based on Naive Bayes classifier. Sustain. Comput. Inform. Syst. 2017, 14, 34–42. [Google Scholar] [CrossRef]
Mauch, L.; Yang, B. A new approach for supervised power disaggregation by using a deep recurrent LSTM network. In Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP), Orlando, FL, USA, 14–16 December 2015. [Google Scholar]
Do Nascimento, P.P. Applications of Deep Learning Techniques on NILM. Ph.D.Thesis, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil, 2016. [Google Scholar]
Xing, Z.; Pei, J.; Keogh, E. A brief survey on sequence classification. Acm Sigkdd Explor. Newsl. 2010, 12, 40–48. [Google Scholar] [CrossRef]
Liu, B.; Luan, W.; Yu, Y. Dynamic time warping based non-intrusive load transient identification. Appl. Energy 2017, 195, 634–645. [Google Scholar] [CrossRef]
Chang, H.H.; Yang, H.T. Applying a non-intrusive energy-management system to economic dispatch for a cogeneration system and power utility. Appl. Energy 2009, 86, 2335–2343. [Google Scholar] [CrossRef]
Parson, O.; Ghosh, S.; Weal, M.; Rogers, A. An unsupervised training method for non-intrusive appliance load monitoring. Artif. Intell. 2014, 217, 1–9. [Google Scholar] [CrossRef]
Cominola, A.; Giuliani, M.; Piga, D.; Castelletti, A.; Rizzoli, A.E. A Hybrid Signature-based Iterative Disaggregation algorithm for Non-Intrusive Load Monitoring. Appl. Energy 2017, 185, 331–344. [Google Scholar] [CrossRef]
Li, J.; West, S.; Platt, G. Power decomposition based on SVM regression. In Proceedings of the IEEE International Conference on Modelling, Identification Control, Wuhan, China, 24–26 June 2012; pp. 1195–1199. [Google Scholar]
Har-Peled, S.; Mazumdar, S. On coresets for k-means and k-median clustering. ProceedingS of the Thirty-Sixth ACM Symposium on Theory of Computing, Chicago, IL, USA, 13–16 June 2004; pp. 291–300. [Google Scholar]
Wang, H.; Yang, W. An Iterative Load Disaggregation Approach Based on Appliance Consumption Pattern. Appl. Sci. 2018, 8, 542. [Google Scholar] [CrossRef]
Boser, B.E. A training algorithm for optimal margin classifiers. In Proceedings of the ACM Fifth Workshop on Computational Lerning Theory, Pittsburgh, PA, USA, 27–29 July 1992; pp. 144–152. [Google Scholar]
Vapnik, V.; Golowich, S.E.; Smola, A. Support Vector Method for Function Approximation, Regression Estimation, and Signal Processing. Adv. Neural Inf. Process. Syst. 1996, 9, 281–287. [Google Scholar]
Goldberg, D.E. Genetic Algorithms in Search, Optimization, and Machine Learning; Addison-Wesley: Reading, MA, USA, 1989. [Google Scholar]
Makonin, S.; Ellert, B.; Bajić, I.V.; Popowich, F. Electricity, water, and natural gas consumption of a residential house in Canada from 2012 to 2014. Sci. Data 2016, 3, 160037. [Google Scholar] [CrossRef] [PubMed]

Figure 1. The flowchart of the proposed algorithm in this paper.

Figure 2. The flowchart of optimized support vector regression (OSVR).

Figure 3. Load disaggregation result and the measured power of four appliances for a day. From left: (a) Load disaggregation result of four appliances in a day; (b) measured power values of four appliances in a day.

Figure 4. Electricity consumption statistics.

Table 1. Average powers of testing appliances in AMPds.

Appliance	Status	Average Power/W
Clothes dryer	Off	0
	On-state 1	245
	On-state 2	4586
Heat pump	Off	0
	On-state 1	37
	On-state 2	1767
Dish washer	Off	0
	On-state 1	139
	On-state 2	757
Fridge	Off	0
Fridge	On-state1	130

Table 2. The disaggregation accuracy metrics for the proposed method.

Metrics	F-score	PCE (%)	R2
Dish washer	0.995	0.057	0.996
Heat pump	0.980	0.521	0.975
Clothes dryer	1	0.083	0.999
Fridge	0.999	0.090	0.950

Table 3. F-score for four appliances from three periods.

Appliance	Period 1	Period 2	Period 3
Dish washer	0.996	0.991	0.991
Heat pump	0.984	0.976	0.973
Clothes dryer	1	1	1
Fridge	1	0.986	0.978

Table 4. Power contribution error (PCE) (%) for four appliances from three periods.

Appliance	Period 1	Period 2	Period 3
Dish washer	0.052	0.055	0.074
Heat pump	0.511	0.505	0.544
Clothes dryer	0.070	0.203	0.211
Fridge	0.075	0.137	0.216

Table 5. F-score values compared with the factorial hidden Markov models (FHMM) method.

Appliance	FHMM in Literature [32]	Proposed Method
Dish washer	0.87	0.995
Heat pump	0.96	0.980
Clothes dryer	0.72	1
Fridge	0.98	0.999

Table 6. PCE (%) values compared with the method in References [9] and [32].

Appliance	Average Power Method in Literature [9]	DTW-Based Method in Literature [32]	Proposed Method
Dish washer	0.074	0.242	0.057
Heat pump	0.453	0.1	0.521
Clothes dryer	0.463	0.2	0.083
Fridge	0.952	0.077	0.090

Table 7. R2 values compared with the method in literature [32].

Appliance	DTW-Based Method in Literature [32]	Proposed Method
Dish washer	0.924	0.996
Heat pump	0.959	0.975
Clothes dryer	0.999	0.999
Fridge	0.938	0.950

Table 8. Computation time for power consumption estimation.

Power Estimation Method	SDTW	OSVR Based Method
Time(s)	13.14296	2.91480

Table 9. The estimation error of OSVR and SVR.

Algorithm	OSVR	SVR
RMSE	1.93	4.16

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yuan, Q.; Wang, H.; Wu, B.; Song, Y.; Wang, H. A Fusion Load Disaggregation Method Based on Clustering Algorithm and Support Vector Regression Optimization for Low Sampling Data. Future Internet 2019, 11, 51. https://doi.org/10.3390/fi11020051

AMA Style

Yuan Q, Wang H, Wu B, Song Y, Wang H. A Fusion Load Disaggregation Method Based on Clustering Algorithm and Support Vector Regression Optimization for Low Sampling Data. Future Internet. 2019; 11(2):51. https://doi.org/10.3390/fi11020051

Chicago/Turabian Style

Yuan, Quanbo, Huijuan Wang, Botao Wu, Yaodong Song, and Hejia Wang. 2019. "A Fusion Load Disaggregation Method Based on Clustering Algorithm and Support Vector Regression Optimization for Low Sampling Data" Future Internet 11, no. 2: 51. https://doi.org/10.3390/fi11020051

APA Style

Yuan, Q., Wang, H., Wu, B., Song, Y., & Wang, H. (2019). A Fusion Load Disaggregation Method Based on Clustering Algorithm and Support Vector Regression Optimization for Low Sampling Data. Future Internet, 11(2), 51. https://doi.org/10.3390/fi11020051

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Fusion Load Disaggregation Method Based on Clustering Algorithm and Support Vector Regression Optimization for Low Sampling Data

Abstract

1. Introduction

2. Proposed Method

2.1. Power Consumption Pattern Signatures Extraction

2.2. Appliance States Identification and Load Disaggregation

2.2.1. Aggregation Power Clustering and Appliance Identification through DTW

2.2.2. Power Retrieve

2.3. Result Correction

3. Experiment Setting

3.1. Data

3.2. Performance Evaluation Metrics

4. Results and Discussion

4.1. The Load Disaggregation Results

4.2. Performance Comparison

4.3. Time Consumption Comparison of Power Trajectory Estimation

4.4. Estimation Performance Comparison between OSVR and SVR

4.5. Method Limitations and Ways to Address Them

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI