Short-Term Load Forecasting for a Single Household Based on Convolution Neural Networks Using Data Augmentation

Acharya, Shree Krishna; Wi, Young-Min; Lee, Jaehee

doi:10.3390/en12183560

Open AccessArticle

Short-Term Load Forecasting for a Single Household Based on Convolution Neural Networks Using Data Augmentation

by

Shree Krishna Acharya

¹

,

Young-Min Wi

² and

Jaehee Lee

^3,*

¹

Department of Electronics Engineering, Mokpo National University, Muan 58554, Korea

²

School of Electrical and Electronic Engineering, Gwangju University, Gwangju 61743, Korea

³

Department of Information and Electronics Engineering, Mokpo National University, Muan 58554, Korea

^*

Author to whom correspondence should be addressed.

Energies 2019, 12(18), 3560; https://doi.org/10.3390/en12183560

Submission received: 16 August 2019 / Revised: 11 September 2019 / Accepted: 15 September 2019 / Published: 17 September 2019

(This article belongs to the Special Issue Short-Term Load Forecasting 2019)

Download

Browse Figures

Versions Notes

Abstract

:

Advanced metering infrastructure (AMI) is spreading to households in some countries, and could be a source for forecasting the residential electric demand. However, load forecasting of a single household is still a fairly challenging topic because of the high volatility and uncertainty of the electric demand of households. Moreover, there is a limitation in the use of historical load data because of a change in house ownership, change in lifestyle, integration of new electric devices, and so on. The paper proposes a novel method to forecast the electricity loads of single residential households. The proposed forecasting method is based on convolution neural networks (CNNs) combined with a data-augmentation technique, which can artificially enlarge the training data. This method can address issues caused by a lack of historical data and improve the accuracy of residential load forecasting. Simulation results illustrate the validation and efficacy of the proposed method.

Keywords:

data augmentation; convolution neural network; residential load forecasting

1. Introduction

Short-term load forecasting (STLF) is an important part of power system planning and operation [1]. The STLF, which has a prediction range of one hour to 168 h, is used for controlling and scheduling of daily power system operations. Furthermore, forecasting customer-level energy consumption is essential for many potential applications in the future power system, such as demand response (DR) programs, home load scheduling with renewables, and optimal operation of energy storage systems (ESS) [2].

Statistical methods, including multiple linear regression [3,4], exponential smoothing [5], and the autoregressive integrated moving average (ARIMA) [6] are the most commonly used for the STLF. Recently, deep-learning-based forecasting techniques are gaining attention in the STLF. Recursive neural networks (RNNs) capable of learning long-term dependence are being applied to the assumption of load prediction in [7]. However, the vanishing gradient problem of RNNs makes it hard to improve forecasting accuracy. In [8,9], to overcome this problem in the RNN, the long short-term memory (LSTM) has been proposed. Some studies show the capability of LSTM for more improved forecasting performance at the system-level forecast when there is relatively long-term historical load data. In addition to the above methods, many deep learning algorithms, such as deep feed forward [10,11], deep belief network (DBN) [12], are being applied to load forecasting. In addition, some hybrid spectral methods, including wavelet analysis [13] and empirical mode decomposition (EMD) [14] with neural networks, have been proposed to remove the uncertainty of historical electrical load. However, all of these techniques were implemented in the substation- or system-level load forecasting with training from sufficient historical load data.

In the STLF at the system-level, a lot of historical data are available because small electrical load changes do not significantly affect the overall load pattern trend. However, at the household level, STLF may not have enough data for some households to capture long-term dependencies and properly train the learning network. For example, if a homeowner has recently changed, only a small amount of electrical load data will help its load forecasting. In addition, even if homeowners have not changed for a long time, the pattern of energy consumption can change with new electrical devices and lifestyle changes [15]. Therefore, there is a limit to using the previous data to increase the training data size. In [2,7], RNN and LSTM were applied to single household load prediction, but these studies did not take into account the past data shortage issues.

To address the lack of historical data for training, the convolutional neural networks (CNNs) can be an effective method for residential load forecasting, because CNN can capture short-term trends in load data when the local data points are strongly related to each other [16]. In addition, data augmentation can be one solution to handle the issues. Data augmentation can handle short-duration data collection by enlarging the size of the training data in a way that adds extras copies of training examples in training dataset and minimizes the overfitting in deep-learning techniques [17]. The overall training cost of a deep network will be reduced when input data are large and contain more similar information. With such enlarged training data, the accuracy of residential load forecasting can be improved. Technical paper [7] tries to enlarge the input data using another household’s load series. However, this approach has the potential to obtain data that have different characteristics of the load series of the target household. Moreover, some target households have similar load profiles in their load series, but others do not [2]. Because of load profile diversity within and between households, existing forecasting benchmarks yield cons rather than pros. In fact, forecasting with only a single household’s data may not often have sufficient information to fit a wide variety of the model capacity, particularly in deep-learning techniques. Therefore, proper data augmentation is required, which can artificially create new training data from the historical data of a single household.

Herein, the paper proposes a load-forecasting method for a single residential household based on convolutional neural networks (CNNs). The CNN is a type of deep neural network where its structure is formed by using convolution and pooling layers [18,19,20]. With the help of various filters, CNNs can learn the inherent information of an electric load series. The proposed forecasting introduces a novel data augmentation technique that concatenates the various residual load series generated from the electrical load of a single household. The original load series is converted to another residual load series containing uncertainty information of electrical load. Several residual load series are extracted through multiple k-means clustering to collect sufficient training data [21,22]. Among the extracted residual load series, the less uncertain and more homogeneous ones were fed to the CNNs as training data. The proposed augmentation technique can provide enough homogeneous training data to CNNs for more accurate forecasting. The proposed forecasting method was tested on ten single households for a year and is compared with the results of pooling-based augmentation [7].

The rest of this paper is organized as follows. Section 2 discusses the implementation of the proposed augmentation strategies and the context of residual load series. Similarly, Section 3 describes the proposed algorithm. Section 4 talks about the implementation procedure and discusses the simulation results. Section 5 concludes the paper.

2. Augmentation Implementation

2.1. CNN with Augmentation

Since an electric load series consists of many load profiles, the diversities between the load profiles are a major concern for forecasting using CNN filter networks. In fact, CNNs can facilitate more precisely given less diverse training load profiles rather than highly diverse ones [23]. Because of human actions, weather, and types of day, the differences between load profiles are uncertain, non-linear, and complex [2,24]. On the other hand, the amount of training data is crucial for fitting all the parameters of CNNs.

To address the issues of the uncertainty and necessities of huge training data, the state-of-the-art papers used other households load series for data augmentation. Moreover, the existing approaches do forecasting by hoping that load profiles will repeat and assume that the repetition of load profiles is often similar among households during the same time interval [7,23]. As a consequence, the forecasting accuracy is trivialized when the augmented data contains only correlated households’ load series. However, the effects of human actions and the type of day are totally different in the electricity load of a single household and are too difficult to predict. Thus, incorporating other households’ load series into training data inevitably burdens the learning ability of the CNN rather than optimizing it.

The augmentation strategy can be applied with single households and is valid when each deep-learning-based forecasting framework improved its performance [25,26]. To improve cognition, augmentation techniques need to grasp all the potentially uncertain information about the electric load series. Figure 1a,b describe the concept of enlarging data with the existing pooling and proposed augmentation approach for deep-learning-based forecasting. To enlarge data, the proposed method artificially generates several augmented load series, each of which needs to be extracted in a way that facilitates the CNN learning strategies. To obtain a rationale, each series should have less uncertain and more homogeneous information. The appropriate series are concatenated with each other to turn out a huge amount of training data with homogeneous information. The more homogeneous information there is, the more useful it is for a CNN network to address the granular-level load prediction. Any dissimilar augmented load series in concatenation could destroy a CNN’s optimal cognition. The procedure of extracting a homogeneous information load series is vital and depends totally upon imagination [27]. The proposed augmentation strategies are described in the following section.

2.2. Extraction of Residual Load Series

The data structure of the historical electricity load of a target household can be expressed as the following matrix:

P = [\begin{matrix} p_{1, 1}, p_{2, 1}, \dots, p_{1, t}, \dots, p_{1, T} \\ p_{2, 1}, p_{2, 2}, \dots, p_{2, t}, \dots, p_{2, T} \\ \begin{matrix} ⋮ \\ p_{d, 1}, p_{d, 2}, \dots, p_{d, t}, \dots, p_{d, T} \\ \begin{matrix} ⋮ \\ p_{D, 1}, p_{D, 2}, \dots, p_{D, t}, \dots, p_{D, T} \end{matrix} \end{matrix} \end{matrix}],

(1)

where

p_{d, t}

represents historical electric load at time t in day d; D represents the number of days of historical data, and the time period T is 24 h. This matrix of a historical load can be simplified as follows:

P = [p_{1}, p_{2}, p_{3}, \dots, p_{d}, \dots, p_{D}],

(2)

where vector

p_{d}

are the hourly load profiles of a target household in day d. Figure 2 shows all daily load profiles of a household for one month. In Figure 2, it can be found that the electrical load of a single household fluctuates abruptly. Generally, a smaller electricity load tends to have a higher variation, which makes it difficult to accurately forecast the residential electricity load. In CNN-based forecasting, one way to achieve high performance is to reduce the variations of training data [21].

To improve the learning ability of CNNs, one must extract new features from a residential load series, which has less volatility but still has inherent characteristics of a residential load. One way to reduce the variation is to remove the regular pattern from the load series [28] so that CNNs use only its residuals. With this approach, each load profile is decomposed into centroid and residual load profiles as follows:

p_{d} = c + r_{d},

(3)

where vector

c

is the centroid load profile (average profile) of the historical load, and

r_{d}

is a vector of the residual load series which is the difference between a given centroid

c

and an actual load profile on day

d

. The centroid load profile can be used as the baseline of a particular group of daily load profiles, and the repetition of the centroid load profile yields an average load series for a certain duration. Figure 3 shows an average load series, and its residual load series of a residential household for a month. In Figure 3, the average load series does not contain any uncertain information, but it shows only a regular pattern. On the other hand, the residual load series contains all the uncertain and complex information about the residential load. The residual load series is less volatile than is the original load series but still has inherent characteristics of the residential load. With training from this less volatile data, the CNNs can forecast the small residential load more accurately.

To ensure the appropriateness of the residual load series in learning networks, auto-correlation (AC) or partial auto-correlation (PAC) coefficients [29] can be used. Higher AC or PAC coefficients, out of the confidence interval, mean that the present residual load series is strongly coupled with the historical residual load series. Figure 4 shows AC and PAC coefficients of the residual load series of a selected single household with different time lagging. As shown in Figure 4, when the lags are the multiple of 24 h, the AC coefficients would peak, so that some AC coefficients are out of the confidence interval. Similarly, the PAC coefficient spikes also confirm that the residual load series has vital information about the residential electricity load. The information is in a hidden state and can be learned by using learning networks.

With these residential load series, the paper proposes a CNN-based method for residential load forecasting using data augmentation. The augmentation technique and overall forecasting procedure are described in the following section.

3. Proposed Residential Load Forecasting Method

3.1. Generation of Centroid Load Profiles

To generate the different centroid load profiles from historical residential load P, multiple k-means clustering [21,22] is used in the paper. The multiple k-means clustering to generate the centroid load profiles of a residential load can be express as follows:

\underset{}{Minimize} \sum_{i = 1}^{k} \sum_{p_{d} \in S_{k, i}}^{} ‖ p_{d} - c_{k, i} ‖^{2}, k = 1, 2, \dots, K,

(4)

where

S_{k, i}

is the i-th partition of the load profile set P, which is generated with a clustering number of k, and

c_{k, i}

is the centroid load profile of the corresponding partition of

S_{k, i}

. The clustering starts with a clustering number 1 and ends with the clustering number

K

. Finally,

(K^{2} + K) / 2

centroid load profiles are generated. The paper used all centroids generated with clustering numbers of one to k, and the l-th generated centroid load profile is defined as

c_{l}^{'}

. The centroid load profiles (

c_{l}^{'}

) are

(K^{2} + K) / 2

daily load patterns, which are the mean values of load patterns of similar days. Figure 5 shows the centroid load profiles of a single household for one month using the multiple k-means clustering algorithm.

3.2. Augmentation of Homogeneous Residual Load Profile

Using multiple k-means clustering and its corresponding centroid load profile, the residential load profiles can be generated as follows:

r_{d, l}^{} = p_{d} - c_{l}^{'},

(5)

where

r_{d, l}^{}

is the vector of residual load profiles of the target household on day d, which is generated with the

l

-th centroid load profile. This residual load profile is expected to be less volatile and less uncertain than was the original load profile. In addition, from Equation (5), the amount of training data will be increased much more by using residual load profiles as training data. Namely, one time series (

p_{d}

) of a single household load can be transformed into several time series of residual load series (

r_{d, 1}, r_{d, 2}, \dots, r_{d, l}

).

For the CNN, the appropriate training data are concatenated with each other, and each training data should have less uncertainty and more homogeneous information. The more homogenous information there is, the more useful it will be for a CNN network to address the granular-level load prediction. Any dissimilar augmented load series could destroy the CNN’s optimal cognition. Therefore, one needs to select the more homogeneous residual load profiles from among the augmented load profiles from Equation (5).

To select the more homogenous residual load profiles, the Frobenius norm [30]

Φ_{l}

of each residual load profiles are calculated as follows:

Φ_{l} = \sqrt{\sum_{d = 1}^{D} ‖ r_{d, l}^{} ‖^{2}},

(6)

where each

r_{d, l}^{}

is sequentially observed with time. When residual load profiles (

r_{d, l}^{}

) have lower

Φ_{l}

, most residuals generated with

c_{l}^{'}

are closed to its centroid, and these augmented residual load profiles can be expected to be homogeneous. Finally, the paper used only the residual load series with lower

Φ_{l}

as training set

R_{i n}

,

R_{i n} = {r_{d, l}^{} | Φ_{l} \leq \frac{1}{L} \sum_{l = 1}^{L} Φ_{l}} .

(7)

In addition to the training set, the paper used the residual load profiles with the lowest

Φ_{l}

as the test set for the CNN model. Figure 6 shows the structure of training and testing sets of the proposed method. In Figure 6, the residual load profile is expressed with its elements as

r_{d, l} = {r_{(1, d), l}, r_{(2, d), l}, \dots, r_{(t, d), l}, \dots, r_{(24, d), l}}

. In the training set, several time series of residual load series (

r_{d, 1}, r_{d, 2}, \dots, r_{d, l}

) can be used instead of one original time series (

p_{d}

) of a single household load. With this enragement of training data, forecasting accuracy for individual residential households can be improved.

3.3. CNN Model for Residential Load Forecasting

The training process is yielded by running a program with a given number of iterations. To optimize the CNN model, in each iteration, a root mean square is used for the training process as follows:

argmin \sqrt{\frac{1}{L} . \frac{1}{D} . \sum_{l = 1}^{L} \sum_{d = 1}^{D} {({\hat{r}}_{d, l}, - r_{d, l})}^{2}},

(8)

where

{\hat{r}}_{d, l}

is the predicted vector load profile obtained from the CNN, and L represents the number of selected residual load profiles from Equation (6). A well-trained and converged CNN forecasting network is used for testing the process for predicting the future load. Since the CNN network deals with residual load profiles, the forecasting result can be generated in terms of a residual load profile. In fact, the forecast load profile

{\hat{p}}_{D + 1}

can be obtained by adding both the centroid load profile and the forecast residual load profile as follows:

{\hat{p}}_{D + 1} = \hat{c} + {\hat{r}}_{D + 1, p},

(9)

where

{\hat{r}}_{D + 1, p}

represents the day-ahead forecasted residual load profile, and

\hat{c}

represents the most appropriate centroid load profile which has the lowest Frobenius norm

Φ_{l}

.

Figure 7 explains the entire load forecasting procedure for the proposed methodology. In the first stage, centroid load profiles are generated using multiple k-means clustering, and different types of residual load profiles are extracted with corresponding centroid load profiles. In the second stage, only homogeneous residual load series are selected using the Frobenius norm for training the CNN. In the final stage, the CNN forecasting framework is used to predict day-ahead residential load profiles. The CNN implementation can be summarized into three parts: (1) initialization of the CNN parameters, (2) training the CNN model with the help of input matrix

R_{i n}

, and (3) predicting the day-ahead load profile using the optimally trained CNN model. With the proposed forecasting method, it is expected that the CNN can cognize the characteristics of historical load profiles more accurately, so that the forecasting accuracy is improved interestingly.

4. Simulation Results

4.1. Data Description and Hyper-Parameter Tuning

The proposed method was tested using hourly metering data gathered from 1181 residential households in Seoul, Korea, for one year (August 2016 to July 2017). With this dataset, the results of the proposed method are compared with the results of pooling-based augmentation [7] as well as with the results of other deep-learning models [2,7,20]. For the day-ahead load forecasting, historical load data for the last 30 days were used for the training process, and historical load data of the previous day were used for testing. To evaluate the accuracy of forecasting results, the paper employed the mean absolute percentage error (MAPE) and root mean square error (RMSE) as follows:

MAPE = \frac{1}{T} \sum_{t = 1}^{T} \frac{| {\hat{p}}_{t, (D + 1)} - p_{t, (D + 1)} |}{p_{t, (D + 1)}} \times 100 %,

(10)

RMSE = \sqrt{\frac{1}{T} \sum_{t = 1}^{T} {(({\hat{p}}_{t, (D + 1)} - p_{t, (D + 1)}))}^{2}} .

(11)

The proposed method is developed and tested through Python with the Keras library, whose backend is Tensorflow [31,32]. To avoid over-fitting in the training process, the parameter settings in Table 1 were tested for the proposed method. The tuning process of hyper-parameters was based on [23]. The numbers of hidden layers were selected based on [33,34,35,36,37,38]. The common hyper-parameters, such as activation function, optimizer, loss function, etc., are reported in Table 1. The additional more specific parameters of CNN were settled with the size of filter

3 \times 3

, number of input filters 24, maximum pooling size

3 \times 3

, and size of strides 1. The convolution layers were followed by a fully connected layer with the rectified linear unit (ReLU) activation functions. The final fully connected layer predicted one-hour electric load at a time, which was matched with [36]. The dataset (30 days) was split into validating set (3 days), testing set (1 day), and training set (26 days).

To tackle the overfitting in CNNs, the proposed method was preliminarily tested with different numbers of hidden layers in CNN architecture. Table 2 shows the forecasting results of ten households with six different hidden layers. The ten single households in Table 2 were randomly selected, and the results were monthly average values in July (peak season in Korea). In most households, the forecasting results with two or three hidden layers were more accurate. With these results, the paper set the number of hidden layers to be 2.

4.2. Effects of Proposed Augmentation Method

Table 3 and Table 4 show the MAPE and RMSE results of day-ahead forecasting with and without the proposed augmentation technique. The ten single households in tested Table 3 and Table 4 were randomly selected, and the results were monthly average values in July (peak season in Korea).

By using the proposed augmentation, the forecasting accuracy was improved by 5 percent, at least, as shown in Table 3. It can be concluded that the proposed forecasting method can significantly improve the forecasting accuracy more than can the other forecasting models without augmentation. When comparing only the cases with the proposed augmentation, the CNN provided more accurate forecasting than the LSTM. It is probably because the augmented data contain plenty of similar random information with small variations. With this information, the CNN received the opportunity for obtaining optimal convergence. On the other hand, as LSTM required more repetitive information for training to improve its performance, this augmented data was comparatively ineffective for LSTM.

Figure 8a,b show the average MAPE and RMSE of ten households, calculated using the results of daily forecasting in July. On most days, the proposed forecasting method significantly improved the forecasting accuracy more than did the other forecasting model without augmentation.

Figure 9 shows the daily average of forecasting results for four higher uncertain households. For most times, the proposed forecasting method significantly improved forecasting accuracy. This is probably because the homogeneous information from the proposed augmentation provides the opportunity for the CNN framework to obtain optimal convergence.

4.3. Forecasting Results in Peak Day

Figure 10a,b show the forecasting results of the peak day for two selected households. One was a less uncertain household which showed the lowest monthly average MAPE in Table 2 (household 4). The other household was a more uncertain household which showed the highest monthly average MAPE in Table 2 (household 10). The peak loads of the two households in July were 1.062 kW (23:00, July 17) and 2.033 kW (18:00, July 14), respectively. During a peak day, it is expected that the residential load profiles will be highly uncertain so that the load forecasting is more challenge. To test the forecasting accuracy, the results of the proposed forecasting method were compared to the results of the forecasting models using pooling augmentation techniques [7].

For pooling augmentation, the simulation used historical load data of six neighbors as an additional training set. In Figure 10a,b, the predicted load curves of the proposed method were much closer to the actual load profile at most hours of the day, for both households. Especially at the peak time of the day, the proposed forecasting model can provide significantly accurate forecasting results. On the other hand, the forecasting models using the other pooling techniques showed a worse performance for load forecasting at the peak time.

An important day for a forecasting test is the day of maximum energy. During this day, the residential load profiles would be highly uncertain at all hours of the day, so that the load forecasting is more challenging. The daily maximum energy consumption of the selected two households in July was 19.381 kWh (July 21) and 15.797 kWh (July 30), respectively. Figure 11a,b demonstrate the target load profile and predicted load profile of the less uncertain household and the highly uncertain household, respectively. For the less uncertain household, the predicted load curves of the proposed method were much closer to the actual load profile at most hours of the day. The proposed model reported 7.999% of MAPE for the less uncertain household, which was lower than the 10.6566% of MAPE from the pooled LSTM. Similarly, for the highly uncertain household, the proposed model reported 40.4058% of MAPE, which was lower than the 53.319% of MAPE from the pooled LSTM. These results strongly validate the proposed methodology for load prediction in the residential sector.

4.4. Monthly Results of Day-Ahead Load Forecasting

To examine the efficacy of the proposed technique for one year, this section tested the performance throughout eleven months by picking one of the best performer households and one of the worst performer households.

Table 5 and Table 6 show the monthly average MAPE and RMSE results of the less uncertain household and the highly uncertain household. The test results are from September 2016 to July 2017. In Table 5 and Table 6, the proposed method improved the forecasting accuracy by more than 6 percent throughout the year.

4.5. Impact of Clustering Number $K$

Figure 12a,b show the average MAPE and RMSE results of ten households with different clustering numbers

K

. In Figure 12, very accurate forecasting can be expected when the clustering number

K

is increased for most households. However, for some households, the MAPE is increased when the clustering number

K

is over 4. The higher clustering number

K

provides more training data to the CNNs, so that the CNNs have more chance to learn the load characteristics. However, more training data can increase the variations of the training set, which cause overfitting of the hyper-parameters of CNNs, which degrades the optimal learning of CNNs. Therefore, the optimal clustering number must be selected for each household.

5. Conclusions

The paper proposed a forecasting method based on convolution neural networks (CNNs) combined with a data augmentation technique with consideration of an insufficient period of training data. The proposed data augmentation can enlarge the training data for CNNs using only a target household’s own historical data, without the help of other households. The proposed forecasting method transforms a time series of a single household load data into several time series of residual loads. With this enragement of training data, forecasting accuracy for individual residential households can be improved. The test results indicated that the proposed method can deliver a notable improvement by including homogeneous information for an individual residential household’s load forecasting. The proposed method can be used for energy management at the household level and evaluate the baseline of energy consumption at the household level for demand-response programs.

Author Contributions

For research, S.K.A. developed the idea of augmentation strategies for a CNN-based forecasting framework, performed the simulation, and wrote the paper. Y.-M.W. helped organize the article. J.L. provided guidance for research and revised the paper.

Funding

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2017R1C1B5016244). This research was supported by Korea Electric Power Corporation. (Grant number: R18XA04).

Conflicts of Interest

The authors declare no conflict of interest.

References

Shahidehpour, M.; Yamin, H.; Li, Z. Market Operations in Electric Power Systems: Forecasting Scheduling and Risk Management, 1st ed.; Wiley-IEEE Press: New York, NY, USA, 2002; pp. 21–55. [Google Scholar]
Kong, W.; Dong, Z.Y.; Jia, Y.; Hill, D.J.; Xu, Y.; Zhang, Y. Short-Term Residential Load Forecasting based on LSTM Recurrent Neural Network. IEEE Trans. Smart Grid 2017, 10, 841–851. [Google Scholar] [CrossRef]
Charytoniuk, W.; Chen, M.S.; Van Olinda, P. Nonparametric Regression Based Short-Term Load Forecasting. IEEE Trans. Power Syst. 1998, 13, 725–730. [Google Scholar] [CrossRef]
Song, K.-B.; Baek, Y.-S.; Hong, D.H.; Jang, G. Short-Term Load Forecasting for The Holidays Using Fuzzy Linear Regression Method. IEEE Trans. Power Syst. 2005, 20, 96–101. [Google Scholar] [CrossRef]
Christiaanse, W.R. Short-Term Load Forecasting Using General Exponential Smoothing. IEEE Trans. Power Syst. 1971, 2, 900–911. [Google Scholar] [CrossRef]
Mohamed, N.; Ahmad, M.H.; Ismail, Z. Short Term Load Forecasting Using Double Seasonal ARIMA Model. In Proceedings of the Regional Conference on Statistical Sciences 2010 (RCSS’10), Kota Bharu, Malaysia, 13 June 2010. [Google Scholar]
Shi, H.; Xu, M.; Li, R. Deep Learning for Household Load Forecasting-A Novel Pooling Deep RNN. IEEE Trans. Smart Grid 2018, 9, 5271–5280. [Google Scholar] [CrossRef]
Tian, C.; Ma, J.; Zhang, C.; Zhan, P. A Deep Neural Network Model for Short-Term Load Forecast Based on Long Short-Term Memory Network and Convolutional Neural Network. Energies 2018, 11, 3493. [Google Scholar] [CrossRef]
Bouktif, S.; Fiaz, A.; Ouni, A.; Serhani, M.A. Optimal Deep Learning LSTM Model for Electric Load Forecasting using Feature Selection and Genetic Algorithm: Comparison with Machine Learning Approaches. Energies 2018, 11, 1636. [Google Scholar] [CrossRef]
Torres, J.F.; Galicia, A.; Troncoso, A. A Scalable Approach Based on Deep Learning for Big Data Time Series forecasting. In Proceedings of the International Work-Conference on the Interplay Between Natural and Artificial Computation (IWINAC), A Coruña, Spain, 19–23 June 2017. [Google Scholar]
Guo, Z.; Zhou, K.; Zhang, X.; Yang, S. A Deep Learning Model for Short-Term Power Load and Probability Density Forecasting. Energy 2018, 160, 1186–1200. [Google Scholar] [CrossRef]
Lin, Y. Time Series Forecasting by Evolving Deep Belief Network with Negative Correlation Search. In Proceedings of the 2018 Chinese Automation Congress (CAC), Xi’an, China, 30 November–2 December 2018. [Google Scholar]
Vu, D.H.; Muttaqi, K.M.; Agalagaonkar, A.P. Combinatorial Approach using Wavelet Analysis and Artificial Neural Network for Short-term Load Forecasting. In Proceedings of the 2014 Australasian Universities Power Engineering Conference (AUPEC), Perth, Australia, 28 September–1 October 2014. [Google Scholar]
Haq, M.R.; Ni, Z. A New Hybrid Model for Short-Term Electricity Load Forecasting. IEEE Access 2019, 7, 125413–125423. [Google Scholar] [CrossRef]
Gram-Hanseen, G. Standby consumption in households analyzed with a practice theory approach. J. Ind. Ecol. 2010, 14, 150–165. [Google Scholar] [CrossRef]
Du, S.; Li, T.; Gong, X.; Yang, Y.; Horng, S.J. Traffic Flow Forecasting based on Hybrid Deep Learning Framework. In Proceedings of the 12th International Conference on Intelligent Systems and Knowledge Engineering, Nanjing, China, 24–26 November 2017. [Google Scholar]
Wang, Y.; Chen, Q.; Gan, D.; Yang, J.; Kirchen, D.S.; Kang, C. Deep Learning-Based Socio-demographic Information Identification from Smart Meter Data. IEEE Trans. Smart Grid 2018, 10, 2593–2602. [Google Scholar] [CrossRef]
LeCun, Y.; Bengio, Y. Convolution Neural Networks for Images; Speech and Time Series; MIT Press: Cambridge, MA, USA, 1988. [Google Scholar]
Pattanayek, S. Pro Deep Learning with TensorFlow: A Mathematical Approach to Advanced Artificial Intelligence in Python, 1st ed.; Apress: New York, NY, USA, 2017; pp. 153–222. [Google Scholar]
Amarasinghe, K.; Marino, D.L.; Manic, M. Deep Neural Network for Energy Load Forecasting. In Proceedings of the IEEE 26th International Symposium on Industrial Electronics (ISIE), Edinburgh, UK, 19–21 June 2017. [Google Scholar]
Kwac, J.; Flora, J.; Rajagopal, R. Household Energy Consumption Segmentation Using Hourly Data. IEEE Trans. Smart Grid 2015, 5, 420–430. [Google Scholar] [CrossRef]
Wang, X.D.; Chen, R.C.; Yan, F.; Zeng, Z.Q.; Hong, C.Q. Fast Adaptive K-means Subspace Clustering for High-Dimensional Data. IEEE Access 2019, 7, 42639–42651. [Google Scholar] [CrossRef]
Lai, G.; Chang, W.C.; Yang, Y.; Liu, H. Modelling Long-and Short-Term Temporal Patterns with Deep Neural Networks. In Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, Ann Arbor, MI, USA, 8–12 July 2018. [Google Scholar]
Stephen, B.; Tang, X.; Harvey, P.R.; Galloway, S.; Jennett, K.I. Incorporating Practice Theory in Sub-Profile Models for Short term aggregated Residential Load Forecasting. IEEE Trans. Smart Grid 2017, 8, 1591–1598. [Google Scholar] [CrossRef]
Charalambous, C.C.; Bharath, A.A. A data augmentation methodology for training machine/deep learning gait recognition algorithms. In Proceedings of the British Machine Vision Conference, York, UK, 19–22 September 2016. [Google Scholar]
Lemley, J.; Bazrafkan, S.; Corcoran, P. Smart Augmentation Learning an Optimal Data Augmentation Strategy. IEEE Access 2017, 5, 5858–5869. [Google Scholar] [CrossRef]
Goodfellow, I.; Dengio, Y.; Courville, A. Deep Learning, 1st ed.; The MIT Press: Cambridge, MA, USA, 2016; pp. 224–270. [Google Scholar]
Zhang, Y.; Chen, W.; Xu, R.; Black, J. A Cluster-Based Method for Calculating Baselines for Residential Loads. IEEE Trans. Smart Grid 2015, 7, 1–10. [Google Scholar] [CrossRef]
Farukh, A.; Feng, D.; Habib, S.; Rahman, U.; Rasool, A.; Yan, Z. Short Term Residential Load Forecasting: An Improved Optimal Nonlinear Auto Regressive (NARX) Method with Exponential Weight Decay Function. Electronics 2018, 7, 432. [Google Scholar] [Green Version]
Sharma, T.; Shokeen, D.; Mathur, D. Multiple K Means++ Clustering of Satellite Image Using Hadoop Map Reduce and Spark. Int. J. Adv. Stud. Comput. Sci. Eng. 2016, 5, 23–29. [Google Scholar]
Abadi, M.; Agarwal, A.; Barham, P.; Brevdo, E.; Chen, Z.; Citro, C. Tensor Flow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. arXiv 2016, arXiv:1603.04467. [Google Scholar]
Pedregosa, F.; Varoquax, G.; Gramfort, A. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
Kong, W.; Dong, Z.Y.; Luo, F.; Meng, K. Effect of Automatic Hyper-Parameter Tuning for Residential Load Forecasting via Deep Learning. In Proceedings of the 2017 Australasian Universities Power Engineering Conference (AUPEC), Melbourne, Australia, 9–22 November 2017. [Google Scholar]
Marino, D.L.; Amarasinghe, K.; Manic, M. Building Energy Load Forecasting Using Deep Neural Networks. In Proceedings of the 42nd Annual Conference of the IEEE Industrial Electronics Society, Florence, Italy, 24–27 October 2016. [Google Scholar]
Ozaki, Y.; Yano, M.; Onishi, O. Effective Hyperparameter Optimization Using Nelder–Mead Method in Deep Learning. IPSJ Trans. Comput. Vis. Appl. 2017, 9, 20. [Google Scholar] [CrossRef]
Torres, J.F.; Troncoso, A.; Gutierrez, D.; Martinez-Alvarez, F. Random Hyper-Parameter Search-Based Deep Neural Network for Power Consumption Forecasting. In Proceedings of the International Work-Conference on Artificial Neural Networks Neural Networks, Gran Canaria, Spain, 12–14 June 2019. [Google Scholar]
Neary, P. Automatic Hyper Parameter Tuning in Deep Convolutional Neural Networks Using Asynchronous Reinforcement Learning. In Proceedings of the 2018 IEEE International Conference on Cognitive Computing (ICCC), San Francisco, CA, USA, 2–7 July 2018. [Google Scholar]
Kim, J.Y.; Cho, S.B. Evolutionary Optimization of Hyper-Parameters in Deep Learning Models. In Proceedings of the 2019 IEEE Congress on Evolutionary Computation (CEC), Wellington, New Zealand, 10–13 June 2019. [Google Scholar]

Figure 1. Comparison of data augmentation strategies: (a) pooling technique and (b) proposed augmentation technique.

Figure 2. Daily load profile of a single household for one month.

Figure 3. Average and its residual load series of a single household.

Figure 4. Auto-correlation of residual load series with different time lagging: (a) auto-correlation (AC) coefficients, and (b) partial auto-correlation (PAC) coefficients.

Figure 5. Centroid load profiles generated using multiple k-means clustering algorithm with different clustering number

k

: (a)

k = 1

; (b)

k = 2

; (c)

k = 3

, and (d)

k = 4

.

Figure 5. Centroid load profiles generated using multiple k-means clustering algorithm with different clustering number

k

: (a)

k = 1

; (b)

k = 2

; (c)

k = 3

, and (d)

k = 4

.

Figure 6. Structure of training and testing data for the proposed method.

Figure 7. The overall procedure of the proposed load forecasting method for an individual household.

Figure 8. Average mean absolute percentage error (MAPE) and root mean square error (RMSE) of ten arbitrary selected households in July: (a) MAPE result, and (b) RMSE result.

Figure 9. Daily average of MAPE of higher uncertain households from Table 3: (a) household 10; (b) household 8; (c) household 9 and (d) household 1.

Figure 10. Day-ahead load forecasting for peak day: (a) lower uncertain household (household 4) and (b) higher uncertain household (household 10).

Figure 11. Day-ahead hourly load forecasting for the day of maximum energy consumption: (a) less uncertain household (household 4) and (b) highly uncertain household (household 10).

Figure 12. Effect of clustering number K on forecasting accuracy: (a) MAPE and (b) RMSE.

Table 1. Hyper parameters for selected deep learning models.

Parameters	BPNN	CNN	LSTM
No. of hidden layers	2 or 3	2 or 3	2 or 3
No. of nodes per layer	32	24	20
Activation functions	ReLU	ReLU	tanh and sigmoid
No. of Epochs (iteration)	150	150	300
Optimizer	RMS-Prop	RMS-prop	RMS-prop
Loss Function	MSE	MSE	MSE
Testing samples	24-h	24-h	24-h

Table 2. Load Forecasting results (mean absolute percentage error (MAPE)) with a number of different hidden layers.

Household	Hidden	Hidden	Hidden	Hidden	Hidden	Hidden
Household	Layer 0	Layer 1	Layer 2	Layer 3	Layer 4	Layer 5
1	23.75%	24.44%	19.26%	21.77%	23.43%	26.77%
2	11.96%	11.25%	11.17%	9.63%	11.51%	12.98%
3	32.10%	32.05%	30.08%	35.87%	38.09%	39.73%
4	9.86%	9.53%	9.65%	10.30%	11.05%	11.47%
5	13.13%	13.59%	11.72%	12.44%	15.47%	15.54%
6	12.88%	12.44%	11.77%	11.43%	12.32%	13.82%
7	11.76%	11.34%	9.77%	12.15%	11.68%	12.82%
8	14.78%	14.97%	13.53%	14.60%	15.12%	16.32%
9	22.19%	22.30%	22.06%	21.41%	22.46%	21.69%
10	37.24%	43.59%	34.72%	46.80%	47.12%	49.25%

Table 3. Load Forecasting results (MAPE) with and without proposed augmentation.

Household	Without Augmentation			With the Proposed Augmentation
Household	BPNN (%)	LSTM (%)	CNN (%)	LSTM (%)	CNN (%)
1	32.17	33.49	43.40	31.36	19.26
2	20.54	21.62	24.77	16.60	9.63
3	39.52	38.15	48.48	37.41	30.08
4	15.56	14.61	18.42	15.47	9.53
5	17.36	16.50	20.85	17.15	11.72
6	17.46	16.85	20.77	16.99	11.77
7	14.61	14.85	17.01	13.36	9.77
8	20.38	20.31	24.08	18.17	13.53
9	42.40	43.11	46.03	28.89	21.41
10	53.71	57.02	66.64	51.28	34.72

Table 4. Load Forecasting results (root mean square error (RMSE)) with and without proposed augmentation.

Household	Without Augmentation			With the Proposed Augmentation
Household	BPNN (kWh)	LSTM (kWh)	CNN (kWh)	LSTM (kWh)	CNN (kWh)
1	0.3601	0.3440	0.4092	0.3156	0.1666
2	0.2614	0.2864	0.3169	0.1253	0.1116
3	0.2382	0.2242	0.2570	0.2160	0.1313
4	0.0954	0.0907	0.1114	0.1397	0.0691
5	0.0790	0.0768	0.0882	0.0744	0.0506
6	0.0757	0.0728	0.0859	0.0769	0.0488
7	0.0663	0.0679	0.0743	0.0635	0.0389
8	0.1083	0.1028	0.1160	0.0960	0.0683
9	0.1423	0.1321	0.1638	0.1072	0.0661
10	0.3323	0.3074	0.3421	0.2984	0.1796

Table 5. Monthly-average of MAPE for the lower uncertain household.

Forecasting Model		Sept.	Oct.	Nov.	Dec.	Jan.	Feb.	Mar.	Apr.	May	Jun.	Jul.
Pooled BPNN	MAPE (%)	16.48	16.72	19.33	16.55	18.21	17.30	20.99	20.22	20.88	26.99	15.02
Pooled BPNN	RMSE (kWh)	0.086	0.099	0.105	0.095	0.099	0.111	0.104	0.110	0.110	0.124	0.106
Pooled CNN	MAPE (%)	15.97	20.66	21.11	18.04	19.80	18.71	22.38	22.12	22.74	28.62	15.89
Pooled CNN	RMSE (kWh)	0.085	0.111	0.116	0.104	0.110	0.121	0.112	0.119	0.120	0.132	0.112
Pooled LSTM	MAPE (%)	14.46	16.33	18.23	15.31	17.31	16.70	23.30	18.76	19.90	26.82	14.03
Pooled LSTM	RMSE (kWh)	0.080	0.099	0.101	0.091	0.096	0.069	0.107	0.106	0.110	0.123	0.100
Proposed Method	MAPE (%)	9.662	11.65	10.53	9.50	9.91	10.34	11.25	10.95	12.07	12.25	9.65
Proposed Method	RMSE (kWh)	0.062	0.061	0.060	0.060	0.061	0.105	0.067	0.075	0.072	0.068	0.070

Table 6. Monthly-average of MAPE for the higher uncertain household.

Forecasting Model		Sept.	Oct.	Nov.	Dec.	Jan.	Feb.	Mar.	Apr.	May	Jun.	Jul.
Pooled BPNN	MAPE (%)	20.07	24.65	34.87	46.06	34.32	36.06	43.42	42.87	36.86	43.25	35.39
Pooled BPNN	RMSE (kWh)	0.086	0.119	0.137	0.156	0.160	0.168	0.185	0.191	0.141	0.152	0.241
Pooled CNN	MAPE (%)	18.80	28.08	34.83	55.80	39.61	37.44	48.49	53.90	40.09	50.07	41.40
Pooled CNN	RMSE (kWh)	0.085	0.129	0.140	0.168	0.174	0.175	0.203	0.204	0.151	0.162	0.246
Pooled LSTM	MAPE (%)	16.998	24.81	32.13	46.47	34.02	33.10	43.09	47.59	33.81	45.15	35.99
Pooled LSTM	RMSE (kWh)	0.080	0.116	0.135	0.156	0.164	0.161	0.176	0.198	0.140	0.153	0.228
Proposed Method	MAPE (%)	13.83	12.79	22.26	30.47	23.51	22.09	26.21	30.89	23.05	29.53	29.12
Proposed Method	RMSE (kWh)	0.062	0.075	0.085	0.101	0.092	0.105	0.106	0.119	0.099	0.084	0.131

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Acharya, S.K.; Wi, Y.-M.; Lee, J. Short-Term Load Forecasting for a Single Household Based on Convolution Neural Networks Using Data Augmentation. Energies 2019, 12, 3560. https://doi.org/10.3390/en12183560

AMA Style

Acharya SK, Wi Y-M, Lee J. Short-Term Load Forecasting for a Single Household Based on Convolution Neural Networks Using Data Augmentation. Energies. 2019; 12(18):3560. https://doi.org/10.3390/en12183560

Chicago/Turabian Style

Acharya, Shree Krishna, Young-Min Wi, and Jaehee Lee. 2019. "Short-Term Load Forecasting for a Single Household Based on Convolution Neural Networks Using Data Augmentation" Energies 12, no. 18: 3560. https://doi.org/10.3390/en12183560

APA Style

Acharya, S. K., Wi, Y.-M., & Lee, J. (2019). Short-Term Load Forecasting for a Single Household Based on Convolution Neural Networks Using Data Augmentation. Energies, 12(18), 3560. https://doi.org/10.3390/en12183560

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Short-Term Load Forecasting for a Single Household Based on Convolution Neural Networks Using Data Augmentation

Abstract

1. Introduction

2. Augmentation Implementation

2.1. CNN with Augmentation

2.2. Extraction of Residual Load Series

3. Proposed Residential Load Forecasting Method

3.1. Generation of Centroid Load Profiles

3.2. Augmentation of Homogeneous Residual Load Profile

3.3. CNN Model for Residential Load Forecasting

4. Simulation Results

4.1. Data Description and Hyper-Parameter Tuning

4.2. Effects of Proposed Augmentation Method

4.3. Forecasting Results in Peak Day

4.4. Monthly Results of Day-Ahead Load Forecasting

4.5. Impact of Clustering Number $K$

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Short-Term Load Forecasting for a Single Household Based on Convolution Neural Networks Using Data Augmentation

Abstract

1. Introduction

2. Augmentation Implementation

2.1. CNN with Augmentation

2.2. Extraction of Residual Load Series

3. Proposed Residential Load Forecasting Method

3.1. Generation of Centroid Load Profiles

3.2. Augmentation of Homogeneous Residual Load Profile

3.3. CNN Model for Residential Load Forecasting

4. Simulation Results

4.1. Data Description and Hyper-Parameter Tuning

4.2. Effects of Proposed Augmentation Method

4.3. Forecasting Results in Peak Day

4.4. Monthly Results of Day-Ahead Load Forecasting

4.5. Impact of Clustering Number K

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.5. Impact of Clustering Number $K$