1. Introduction
The depletion of resources owing to the continual increase in energy consumption has been a global issue for a long time, and the efficient management of energy has become a challenging task. An estimation of the amount of energy consumption is an essential first step in the successful management of energy. Non-Intrusive Load Monitoring (NILM) [
1], which is a process for analyzing changes in the voltage and current entering into a house and deducing what appliances are used in the house, will be appropriate for efficient estimation. Consequently, various NILM techniques have be applied for appliance classification, which is one of the main purposes of NILM.
The total power consumption measured by the main meter can be considered as a collection of operation and usage patterns in the time sequence because the total power is the summation of the overall power of the working appliances. If we deeply analyze these patterns, abundant information underlying the patterns can be discovered, such as operation routines: when and how long an appliance is used in periods. An improvement in the recognition accuracy can be achieved by extracting the most useful patterns from the power signatures. Therefore, it is necessary to develop a delicate strategy to obtain the comprehensive patterns from the main power.
NILM techniques based on machine learning and deep learning algorithms have been improved with the drastic development of the Internet of Things (IoT) and smart meters. In terms of machine learning algorithms, the Support Vector Machine (SVM) [
2,
3], k-Nearest Neighbors (k-NN) [
3,
4,
5], and Hidden Markov Model (HMM) [
6,
7,
8,
9,
10] algorithms are representative algorithms for the load classification of NILM. The capabilities of these algorithms in the identification of appliances has been verified.
Numerous deep learning approaches are based on images that are preprocessed and transformed from the total power signature because they offer the advantage of processing two-dimensional data. Two characteristic image categories exist: the spectrogram [
11,
12] and voltage-current (VI) trajectory [
2,
13,
14,
15]. In these methods, high frequency data (sampled at kHz or higher), which consist of abundant information for appliance identification, are preprocessed and converted into images. When the power signal is converted into an image, a high frequency is relatively advantageous because it contains more information as mentioned above.
On the other hand, low frequency data generally contain simple on/off patterns and are not widely used for deep learning methods due to the simple information. This may be the reason that, to the best of our knowledge, no image-based approaches with low-frequency data using deep learning have been reported. Low frequency data can be more useful for algorithms through temporal patternization. In this paper, we develop a transformation-based method, which patternizes the operational features and time-series characteristics together.
The main contributions of this paper are described as follows:
- 1.
We propose a new method called the temporal bar graph, which forms new temporal usage patterns with a circular bar graph to capture more detailed features in the power signals. This method patternizes the characteristics in the time sequence and usage routines of appliances.
- 2.
We visualize the specified patterns in the time sequence by using the temporal bar graph, from which the features can be extracted effectively by convolutional layers.
- 3.
We empirically show that the temporal bar graph achieved a higher accuracy and F1-score compared with the state-of-the-art algorithms, including Very Deep One Dimensional CNN (VDOCNN) [
16] and Extreme Inception (Xception) [
17], especially when the number of appliances used was increased.
The rest of this paper is organized as follows.
Section 2 explains the background knowledge and summarizes the state-of-the-art related work.
Section 3 proposes our proposed method for temporal patternization.
Section 4 explains our experimental setup.
Section 5 presents the results of the evaluation. Finally,
Section 6 concludes this study.
2. Background and Related Work
In this section, we review the concepts of NILM and the literature related to our work.
Figure 1 presents some basic concepts of NILM. The appliances in the household are connected to the sub-meters, which are linked to the main meter, and these meters can monitor the overall operational events of the appliances. The aggregated power can be represented as follows:
is the total power read from the meter at time
t,
is the load of a single active appliance
i at time
t and
is a small noise or error term. The NILM technique was first introduced by Hart in 1992 [
1], and many studies have been conducted since its introduction, including machine learning-based, deep learning-based, and spectral graph-based research. In general, deep learning and machine learning-based approaches exhibit effective performance in load classification.
Numerous machine learning algorithms have been applied to NILM, including the Support Vector Machine (SVM) [
2,
3], k-Nearest Neighbors (k-NN) [
3,
4,
5], and Hidden Markov Model (HMM) [
6,
7,
8,
9,
10] algorithms, which are well-known machine learning algorithms that have achieved high performance. However, these traditional machine learning techniques exhibit several limitations in NILM. In the SVM, the classes in the classification are normally restricted to two, which can be increased by applying non-linearity.
However, the increased computational complexity will be a problem when training with a larger dataset [
19]. The k-NN algorithm is not efficient in recognizing new appliances. Moreover, as the number of classes increases, the classification accuracy of the method decreases [
20]. Finally, in the HMM, the entire structure must be retrained if a new class is added. Moreover, the computational complexity increases exponentially as the number of appliance classes increases, which restrains the performance of the algorithm [
21].
Over the past several years, various issues in NILM have been examined using deep learning-based approaches, which have frequently outperformed conventional methods, especially in load classification [
22]. Kelly and Knottenbelt [
23] proposed three deep neural network architectures to extract operational features from the total power: Long Short-Term Memory (LSTM), Denoising Autoencoders (DAE), and a network called Rectangles. The networks with convolutional layers exhibited superior performance, particularly on unseen data. This means that a convolutional layer is capable of extracting the inherent patterns from the total power. De Baets [
15] proposed voltage-current (VI) trajectory images that were weighted and reformed for appliance recognition.
De Baets used a simple and light CNN architecture, and the approach achieved novel results overall for a large number of appliances. Subsequently, Concatenate-CNN and spectrogram images that were preprocessed by Short-Time Fourier Transform (STFT) were suggested to eliminate noise and background loads from the target appliance to improve the classification performance. The results demonstrated that Concatenate-CNN outperformed the methods of previous works [
11].
However, the above image-based approaches only consider the operational events of appliances without temporal characteristics, which is an important factor for making the on–off events more valuable. The concatenate-CNN and spectrogram image approach were tested primarily in single-load cases. Thus, the results in the paper [
11] are not guaranteed in multi-load classification using this technique. In contrast, the method proposed in this paper can easily patternize the temporal features of the total power, which is the sum of the operational patterns, and CNN can capture the features in the created pattern. Moreover, the method can be used effectively in multi-load as well as single-load cases.
3. Temporal Bar Graph
We propose a temporal bar graph transformation, which patternizes the power signature in the time sequence. This transformation converts original time series data to a sequence of graphs. Each graph represents a temporal pattern of data in a specific time window, and this can be adopted in the training of various image-based deep learning techniques.
Our main idea of the temporal bar graph is as follows.
Figure 2 shows the power signature in a time sequence. Whenever an appliance is switched on or off, the power signal moves up or down with the power signature below. Apart from on–off events, no further useful features can be visually observed. With on–off events solely, it is difficult to identify which appliance is turned on or off particularly when several appliances are activated at the same time. Hence, the enhancement of the performance in load classification is limited with the on–off events especially in the multi-load case.
To address these limitations of using the on–off status, we propose the temporal bar graph. First of all, the temporal bar graph is converted from power signal as shown in
Figure 3 and offers an advantage in single-load and multi-load classification since the bar graph patternizes on–off events and the operational times together. This means that the temporal bar graph reorganizes the features of the on states as well as off states and determines how long the on and off states last. Below, we explain the details of the temporal patternization.
The temporal bar graph consists of 10 bars in this paper. Each bar has 6 s of temporal features. Therefore, one temporal bar graph has 60-s temporal features and appliance usage patterns. The time gap of 6 s in this paper can be changed depending on the domains or experimental circumstances. The length of a bar expresses the amount of energy consumption, with a longer bar indicating that more energy is consumed. Every bar graph is labeled at the Labeling Point, which is the last point among 60-s data, and the starting point is next to the Labeling Point bar as shown in
Figure 3.
The starting and labeling points are automatically set since the first data point becomes the starting point and the last data point becomes the labeling point when the graph is generated.
Thus, rotating or pivoting of the graph does not change the starting and labeling points and consequently, it does not lead to performance degradation. When the last point is labeled, it refers to the history of nine previous statuses, and, as the labeling is carried repeatedly, the labeling becomes the usage patterns in the time sequence. Therefore, the temporal bar graph itself becomes a combination of operational features and time characteristics. We call this temporal patternization. Subsequently, a convolutional layer can efficiently detect and obtain meaningful patterns by managing the weights of each bar. For convolutional layers, each 60-s bar graph is transformed into an image for the input data.
Figure 4 depicts two representative graphs of five appliances in a single load: Dish Washer (DW), Kettle, Washing Machine (WM), Microwave (MW), and Fridge. DW and Fridge in
Figure 4a,e exhibit round patterns, Kettle and MW in
Figure 4b,d exhibit fan-shaped patterns, and WM in
Figure 4c exhibits square-like patterns.
Figure 5 represents temporal bar graphs in a multi-load combination (MW + WM).
Figure 5a shows a pattern in which both MW and WM are not activated. However, the round shape and the graph size that exhibits the level of power consumption are somewhat similar to
Figure 4e, and we can assume that the Fridge is operated in
Figure 4a.
Likewise,
Figure 5b,c are analogous with
Figure 4c,d since these graphs were made from the same appliance usage patterns. When MW and WM are working together, the shape of a graph is more likely to be a combined form
Figure 4c,d.
Figure 5d is similar to the combined form. Naturally, these shapes depend on the operational characteristics of the appliance and the usage routines of its user. If the bar graph shape is analogous to a shape of another bar graph in certain periods, we consider these two patterns as the same usage patterns to enhance the model performance during training. If the graph is converted into an image, the proposed concept offers the advantage of energy management of the features, which can be extracted efficiently using deep learning techniques, including CNNs.
Two representative image-based approaches using deep learning methods exist: the spectrogram and VI trajectory. These methods preprocess high frequency data and transform the preprocessed data into images for appliance classification. Hence, the classification performance is highly affected by the data preprocessing and the time-series characteristics, which can be useful as the operational patterns are not considered important in these approaches when the data are converted into the image.
However, the proposed method does not require complicated preprocessing and provides a graphical visualization that is understandable by sight. Moreover, our method is advantageous for extracting the detailed patterns of a power signature because it patternizes the operational patterns and temporal features together. The simple application of our approach to raw data can enhance the load identification performance for both single and multiple loads.
4. Experiments
In this section, we demonstrate the performance of the proposed temporal bar graph on state-of-the-art deep learning techniques, namely the VDOCNN and Xception. To verify the proposed approach, experiments were conducted using three cases: (1) a single-load performance comparison between the original current data (raw data) and the bar graph; (2) a multi-load performance comparison between the raw data and bar graph; and (3) a performance comparison of the bar graph with the spectrogram.
4.1. Dataset and Data Preprocessing
The UK Domestic Appliance-Level Electricity (UK-DALE) [
24] dataset was used to confirm that the application of the bar graph could enhance the classification performance based on the same models and data. The UK-DALE dataset consists of five UK houses. The mains in each house were sampled at 1 Hz, and the data were measured every 6 s. The total duration of the five houses was 786 days, and the total number of appliances was 54. Houses 1, 2, and 5 were selected for our experiments because they had more realistic power signals. We used House 1 as training data and Houses 2 and 5 as test data. The House 1 data from 01-01-2014 to 11-01-2014 (11 days, date in the format DD-MM-YYYY) were used as the training dataset, whereas the House 2 data from 20-05-2013 to 31-05-2013 (11 days) and the House 5 data from 29-06-2014 to 10-07-2014 (11 days) were used as the test datasets.
The Tracebase dataset was sampled at 1 Hz, and the data points were measured every 1 s from German households. This did not contain the aggregated power. In our experiments, we used the sum of the power consumption of selected appliances as the aggregated power. The total duration of the data was 1883 days with 43 different types of appliances. We used the complete data of Tracebase and chose 7 days where there were the five appliances in common.
Table 1 shows the number of events of UK-DALE and Tracebase in the training and test sessions.
In each of the above datasets, we chose the following five common appliances for our experiments: Fridge, Washing Machine (WM), Dish Washer (DW), Kettle, and Microwave (MW). These five appliances were selected since they were present in the three houses of UK-DALE and Tracebase. Additionally, the five appliances are commonly used for evaluating NILM methods [
25]. Each 60-s temporal bar graph was converted into an image for the input data since a convolutional layer can efficiently extract the useful features from a graph image. For 60-s intervals, we used 10 data points for UK-DALE and 60 data points for Tracebase.
The detailed procedure of generating a temporal bar graph for single cases was as follows:
- 1.
The total power is sliced into 60-s intervals.
- 2.
Each sliced interval of the data is labeled by the activation status of each single appliance on the last data among the 60-s points, and labeling is based on the threshold listed in
Table 2.
- 3.
the largest value among the entire dataset is the maximum value, and the minimum value is set to 0.
- 4.
Each labeled interval is converted into temporal bar graph images with size.
The procedure of generating a temporal bar graph for multi-load cases is as follows:
- 1.
The total power is sliced into 60-s intervals.
- 2.
Each sliced interval of the data is labeled by the activation status of the appliances on the last point. For instance, in the DW+Fridge case, when both appliances are not activated on the last point, the label will be 0, and, when only DW is activated on the last point, the label will be 1. Likewise, when only Fridge is operational on the last point, the label will be 2, and when both appliances are operational on the last point, then the label will be 3. The three and five combinations are labeled in this way on the basis of the operational threshold listed in
Table 2 and the graph images are generated for the different combinations.
- 3.
Set the largest value among the entire set of data points as the maximum value, and the minimum value is set to 0.
- 4.
Each labeled interval is transformed into graph images with the size of .
We determined the ON (
) and OFF (
) states of the appliances by using the operational threshold
q of each appliance and the total power
, as indicated in
Table 2. Note that
is equivalent to
, and
is
for each appliance.
4.2. Experimental Setup
Every experiment in this study was carried out using the TensorFlow framework and Keras. The learning rate and optimizer were and Adam, respectively. For the loss function, we used binary cross-entropy in the single-load case and categorical cross-entropy in the multi-load case. The configuration for the experiments was as follows:
4.3. Evaluation Metrics
True positive (TP) was indicated when the working state of an appliance was classified as ON. True negative (TN) was stated when the not-working state was classified as OFF. False positive (FP) was indicated when the not-working state was classified as ON. False negative (FN) was stated when the working state was classified as OFF. The Precision, Recall, F1-score, and Accuracy were used for evaluation in this study and are defined as follows:
4.4. Network Architecture
VDOCNN [
16] and Xception [
17] are commonly used CNN architectures that yield state-of-the-art performance in image classification tasks. We deployed VDOCNN, which consists of a 1-dimensional (1D) convolutional layer, as illustrated in
Figure 6 and Xception, which consists of a 2-dimensional (2D) convolutional layer, as depicted in
Figure 7. The reason for selecting VDOCNN and Xception is that they are state-of-the-art networks consisting of 1D and 2D convolutional layers and show a solid performance in the image classification tasks. We can evaluate that our method showed stable performance on the dimension changes of the convolutional layers.
VDOCNN is a 1-dimensional convolutional neural network that can efficiently conduct 2-dimensional image classification tasks. However, there will be some missing values since VDOCNN converts 2-dimensional data to 1-dimensional data. Xception is motivated by the Inception model and showed more efficient performance over the Inception network on the ImageNet dataset. Specifically, 1 × 1 convolutional layers in Inception were replaced with 3 × 3 convolutional layers, and more 3 × 3 layers were added in the Xception model. Xception considerably reduced the convolutional computing cost. However, only a few performance demonstrations are reported since it is a new model.