You are currently viewing a new version of our website. To view the old version click .
Sustainability
  • Article
  • Open Access

31 May 2021

An Improved Residential Electricity Load Forecasting Using a Machine-Learning-Based Feature Selection Approach and a Proposed Integration Strategy

,
,
,
and
1
Department of Electrical Engineering, Superior University, Lahore 54000, Pakistan
2
Department of Electrical Engineering, Government College University, Lahore 54000, Pakistan
3
College of Computer Science and Information Technology, Al Baha University, Al Baha 1988, Saudi Arabia
*
Author to whom correspondence should be addressed.
This article belongs to the Special Issue Machine Learning, Data Mining and IoT Applications in Smart and Sustainable Networks

Abstract

Load forecasting (LF) has become the main concern in decentralized power generation systems with the smart grid revolution in the 21st century. As an intriguing research topic, it facilitates generation systems by providing essential information for load scheduling, demand-side integration, and energy market pricing and reducing cost. An intelligent LF model of residential loads using a novel machine learning (ML)-based approach, achieved by assembling an integration strategy model in a smart grid context, is proposed. The proposed model improves the LF by optimizing the mean absolute percentage error (MAPE). The time-series-based autoregression schemes were carried out to collect historical data and set the objective functions of the proposed model. An algorithm consisting of seven different autoregression models was also developed and validated through a feedforward adaptive-network-based fuzzy inference system (ANFIS) model, based on the ML approach. Moreover, a binary genetic algorithm (BGA) was deployed for the best feature selection, and the best fitness score of the features was obtained with principal component analysis (PCA). A unique decision integration strategy is presented that led to a remarkably improved transformation in reducing MAPE. The model was tested using a one-year Pakistan Residential Electricity Consumption (PRECON) dataset, and the attained results verify that the proposed model obtained the best feature selection and achieved very promising values of MAPE of 1.70%, 1.77%, 1.80%, and 1.67% for summer, fall, winter, and spring seasons, respectively. The overall improvement percentage is 17%, which represents a substantial increase for small-scale decentralized generation units.

1. Introduction

The growing global need for electricity has given rise to a smart solution termed a smart grid. A smart grid system is an intelligent system that can sense and control loads to avoid power outages and is flexible enough to incorporate other energy resources and loads. End users can alter their energy consumption according to their preference of time, price, and significance in various aspects. This approach also enables the customers to minimize their bills, and the grid is capable of rapid outage detection. The smart grid covers a variety of functionalities, including two-way secure communication of information between the electric suppliers and users, and provides measurements via smart meters [1,2,3,4]. Controllable power generation groups include biomass and hydropower, while wind and solar are categorized in the variable and intermittent power generation groups. Load forecasting (LF) is especially required when both groups of energy resources are integrated into the utility grid. In the conventional power management system, only the supply side is managed, while it is required to conduct demand-side management via LF and demand response management. Smart appliances assist in reducing the residential peaks by alerting consumers to shift the load to low peak periods or by shedding the load automatically [5]. Currently, LF and implementation of demand response are used to improve efficiency, ensure system reliability, and achieve the balance of load and supply. This approach is much better than conventional schemes as it offers improved response time, reduced cost, and fewer environmental hazards and runs smoothly if integrated with distributed systems [6]. It also increases energy efficiency and helps national electricity markets to grow by incorporating control and communication systems to maintain reliable power systems. However, the physical implementation of the smart grid is facing both technical and financial challenges.
Household load variation occurs at different times of the day, and careful examination is required to assure flexibility. An aggregator can be used for this examination [7,8]. The aggregator depends upon enabling technologies such as controllable loads, micro-generators, and distributed energy resources [9]. Smart meters act as a gateway between the household and the outside world. A community storage system can store excess energy while the owners of the community and their cooperatives manage the generators and batteries. The aggregator assists in signaling and scheduling demand response to maintain the balance between demand and supply [10]. Numerous hybrid approaches that integrate the expert system of load prediction with many other LF designs have been proposed. For instance, Dash [11] blended fuzzy logic and an expert method. A Korean electric power company’s convolutional neural network with a fuzzy export system was also used to forecast load [12]. Integration of a neural network and a distribution method to predict load for the Egyptian Electricity Holding Company [13] was employed to form an hourly forecasting model.
LF will become an important part of the power system soon. In addition to the smart grid, which faces the technical and financial challenges as described above, LF also needs to ensure accurate predictions with minimal MAPE. In the existing LF model, there are two types of concerns, and the model we propose is better than other models due to the following reasons:
  • The first concern of existing models is that the feature selection approach is not employed for analysis as shown in Table 1. Furthermore, the mentioned methods have the problems of high computational time, complex data, repeated data, and issues with extracting relevant data from a huge quantity of data. In our proposed model (BGA-PCA), historical data of four seasons based on different inputs for 10 houses were exploited, whereas the duplicated, irrelevant, and unimportant data were removed without affecting the information. This contribution reduces the computational time and makes it less complex as compared to the other existing models that do not employ the feature selection approach.
  • The second concern is that the different models existing in the literature have their drawbacks under diverse conditions as demonstrated in Table 1. Our proposed integration strategy aims to improve the success rate by using a combined strategy of time series autoregression, the ANFIS model, and the BGA-PCA model in a single collaborative method. In our model, the final decision is not made by considering all algorithms in an autonomous way; only the best model is considered, based on the historical data for MAPE calculation of each season.
We investigated that some researchers have worked on a GA-based feature selection approach for reducing the complexity and computational time. To the best of our knowledge, the proposed hybrid model of BGA-PCA for feature selection purposes has never been deployed yet in electricity LF. Further improvement in MAPE calculation is made by an integration strategy where the optimistic integration model takes maximum inputs compared with other models that have limitations (demonstrated in Table 1) of fewer inputs. It is a novel attempt to integrate a time series autoregression algorithm, ANFIS based algorithm, and BGA-PCA algorithms for optimal MAPE calculation in LF.
In this paper, data are collected from Pakistan Residential Electricity Consumption (PRECON), for the load forecasting model where 10 houses out of 42 are used for data collection [14]. A load of each building shows variation when the different loads with different seasons are attached to the system. The system proposed in this research is based on residential houses with four seasons: Summer, Fall, Winter, and Spring, whereas the data of weekdays and weekends have also been taken into account. The consumers respond to the power system with the sudden event where power consumption varied. In this regard, the proposed system is based on the four seasons as events where LF is optimized and tries to reduce the Mean Absolute Percentage Error (MAPE). The time-series approach is firstly adopted for collecting the data and optimizing LF; then, a blend of the BGA-PCA feature selection technique is proposed.
Paper contributions are accomplished well and summarized as:
  • Data of 10 houses from PRECON have considered where the LF has optimized by reducing the MAPE.
  • Time series and autoregression algorithms have been developed to fetch the data set and to set the objective function, while proposed equations set the base for further validation, which is verified by improved results.
  • Feedforward ANFIS based on the ML approach is proposed which is better as compared to the previous one.
  • BGA-PCA approach for attaining the best feature selection is evaluated for further improvement.
  • MAPE calculations for the integration strategy are formulated for individual building and obtained remarkable improvement.
  • Our results validate the proposed model by providing 17% in the overall system improvement compared to past work. Moreover, the most optimized value of MAPE is obtained for the four seasons.
The rest of the article is organized as follows: Section 2 summarized the latest related research work as given in Table 1. Section 3 is based on the framework where time series and autoregression algorithms are proposed in the first part. The model is further validated by machine learning (ML) in the second part. The third part is a novel feature selection model based on BGA-PCA algorithms. The fourth part validates the model by the proposed integration strategy. Section 4, Section 5 and Section 6 are the model validation, results discussion, and conclusions, respectively.

3. Proposed Framework

This section is based on the proposed LF techniques and framework such as:

3.1. Time Series and Autoregression Method

The time series forecasting models have a standard approach of collecting historical data for predicting load, and nine different regression algorithms are taken in [46]. In our model, we have computed seven estimation models and three autoregression models with one exponential smoothing. The autoregression models are further estimated at level-II with exponential smoothing, and the algorithm with parameters that have been taken into account is defined in Table 3.
Table 3. Time series and autoregression algorithm.

3.2. Machine Learning Approach of Proposed Feedforward ANFIS

The model is further validated by the ML approach for implementing ANN, while it is a computer model influenced by the brain and nervous system in animals or humans. Each network is a “neuron” integration that can generate values from sources that feed data across to the network. It provides a systematic analysis of neural networks. The majority of the research articles indicated that ANNs can be divided into two categories. Thirty US power companies used a software-based ML platform in 1998 [47]. The radial base feature network, self-organizing maps, grouping, and recursive cognitive network are a few of the neural forecasting models used by LF. The ANFIS model is presented in [48] to forecast the potential load, and we have modified it according to our requirement as five stages computed in this section. An algorithm uses an unmonitored training data principle to construct a load–temperature relationship to forecast 24 h of load [48]. This method uses real-time data as error-correcting function input, and simulation findings reveal that acceptable percent mean absolute error percentage is better than the conventional method. In this regard, the proposed system validates the Feedforward Artificial Neural Network of one-year LF data. The proposed Feedforward ANFIS model consists of five stages where Random Forest training schemes are taken into account, and the training process is given as follows:
Stage 1: This layer is known as the fuzzy fiction layer. At this layer, the received signal of any node will transmit to the other layer. Each node in this layer produces a membership function of the linguistic variable. The outputs are given in the equations where the linear data containing weekly, seasonally, and per year demand are illustrated in Table 3 and considered in Equation (1). The nonlinear data include ambient temperature, dew pony, solar irradiation, peak loads time, off-peak loads time, energy price, sunshine duration, fog duration in winter, and robust weather conditions. Table 3 defines it with the exponential method and level-2 AR methods considered in Equations (2) and (3):
O i 1 = μ A i ( Y y i )   for   I = 1 , 2 , 3
O 2 1 = μ A 2 ( Y t )
O 3 1 = μ A 3 ( Y t )
where Y ( t 1 ) ,   y t ,   and   Y t are the inputs to the nodes as shown in Figure 1.
Figure 1. Flowchart for illustration of training and forecasting processes of the proposed feedback ANFIS algorithm.
μ A i is a Member Function.
A is a linguistic variable that is linked with the node function.
The μA in the model is chosen based on Equation (4):
μ A i ( x ) = e x p [ Y t c i 2 a i ] 2
where a i and   c i are the set of premise parameters.
Stage 2: This layer 2 is the rule layer. It is used as a simplification of the predicter training scheme and minimizes the burden. The output of each node demonstrates the firing strength of each rule obtained by the membership functions given in Equation (5):
w n = μ A i ( Y t i ) × μ A 2 ( Y t ) ×   μ A 3 ( Y t )   for   n = 1 , 2 , 3 , 4 , 5
Stage 3: Each node at this stage is a fixed node and is named N. This layer is known as the normalization layer because it calculates a ratio of firing strength of the rule relative and the sum of firing strengths of previous rules. The output is given by Equation (6):
w ¯ n = w w 1 + w 2 + w 3 + w 4 + w 5   n = 1 ,   2 ,   3 ,   4 ,   5
Stage 4: The nodes at this stage depend on adaptive nodes. The output for each rule is calculated by taking the product of normalized firing strength from the previous phase and the first-order Sugeno Model. This layer is known as the defuzzification layer:
θ n 4 = w ¯ n f n   = w ¯ n ( p n + q n + r n )
where w ¯ n is the output of stage 3, and p i ,   q i ,   r i are the parametric set.
Stage 5: The total output for the proposed model is calculated at this layer. The calculation is done based on the output values of each rule. This is why this layer is called the sum layer. There is a single node here that computes the overall output. The overall output is computed in a single node as illustrated in Figure 1 and given in Equation (8):
θ n 5 = n w ¯ n f n   = n w ¯ n f n   n w ¯ n
Using the ANFIS model, the final output for given premise parameters can be represented as a linear combination of consequent parameters:
f = w ¯ 1 ( p 1 + q 1 + r 1 )   +   w ¯ 2   ( p 2 + q 2 + r 3 )   +   w ¯ 3 ( p 3 + q 3 + r 3 )   +   w ¯ 4 ( p 4 + q 4 + r 4 )   +   w ¯ 5 ( p 5 + q 5 + r 5 )

3.3. BGA-PCA for Feature Selection

Feature selection is used in this research for eliminating the extraneous and redundant data that can improve accuracy and reduce computation time as applied to ML in [49]. In this regard, the BGA is based on the evolutionary perception of genetics and natural selection as it is considered a global solution for optimization problems. In this proposed model, the feature selection is based on BGA along with the blend of PCA as shown by the flowchart in Figure 2. The best feature selection is evaluated in five key steps. The steps are:
Figure 2. Flowchart of the BGA-PCA-based feature selection scheme.

3.3.1. Initialization of Data Inputs and Model Paraments

The model is started with objective functions as BGA works on a binary data set of special values for variables (chromosomes) as a genetic algorithm starts with these variables for indicating the optimal variables for the problem [50]. The model parameters of this section are “1” and “0”, where “1” means that the feature selected for the fitness evaluation and “0” is for the feature that is going to be not selected. After defining the objective functions and model parameters, the next step is the creation of the initial population.

3.3.2. Initial Population

As this feature selection is a GA-based solution, there is a need for two matrices for creating the initial population that is “k” matrix and “l” matrix, where k defines the number of chromosomes and l indicates the length. Quantity and length of chromosomes are computed by population size and the number of genes, respectively. The suggestion for every population is given [51].

3.3.3. Determining Fitness from PCA

Although the time series and autoregression method are used in this research for assessing the deployed fitness function, the objective functions are linked with parameters in Table 3 to evaluate the means square error of the autoregression method. After MSE evaluation and for each subset, there is a need for data reduction. All forecasting work consists of a massive amount of data and facing difficulties of a lot of computational work. PCA has been used to overcome the computational complexity by reducing the number of features and processing time for intrusion detection [52]. The autoregression methods are a function of y ( t ) ,   and two matrices k and l are proposed. Then, the PCA of the linear combination for required value forms as:
P C 1 = 1 y 1 + 2 y 2 + 3 y 3 + + k y l
In addition, the large sample variance is reduced by:
j = 1 l j 1 2 = 1
It indicates and optimizes the problem that is needed as a maximize function formed as:
1 n j = 1 k ( j = 1 l k y l ) 2
The data are further minimized by:
1 n j = 1 n j 1 2 = 0
On the other hand, the main objective of BGA is a minimization of MSE as a loss function for ML approach that is formulated by:
f i t f u n = i n i = 1 n ( T i y t ) 2
where T is a vector function of load demand and n is a training sample or assumptions for forecasting. As earlier mentioned, the term 1 defines the selected values, while 0 defines the non-selected values for the assessment of chromosomes. Each irritation of the BGA-PCA feature selection method decreases the MSE and finds the best objective function value.

3.3.4. Selection or Reproduction

The parameters considered for BGA are given in Table 4. The chromosome length is 48, and the number of iterations is set to be 50. The new population is further produced by crossover and mutation sections. The crossover and mutation functions use two chromosomes as parent-1 and parent-2 in this model, while the crossover function is the child selection.
Table 4. The parameters considered for BGA.
In the parents’ chromosomes, the child must be better than the parent. The XOR function is applied for the binary form where the crossover probability is 0.95. Here are two options, one is yes for the violation of any constraints, and the second is no for the evaluation of the new population. This crossover function can operate a single time or multiple times based on the maximum length of chromosomes. In the mutation process, each bit is checked individually for searching for the best possible solution where its operating function single/multiple times is the same as the crossover function. The mutation probability is 0.20 as it is a genetic disorder of chromosomes, and it creates a random number which uniformly distributed with a maximum length of chromosome size. The proposed BGA-PCA model is chosen by setting the initial population, various probability values for mutation and crossover, maximum iterations, and the stopping criteria. It helps in the selection of a new population ultimately for the best feature subset.

3.3.5. Convergence Condition and Final Feature Subset

The new population is further tested in one step if it succeeds in finding the best chromosome within the iteration limit, decoding it, and considering the best feature subset. On the other hand, the new population might not be better than the previous one. Then, it is suggested to keep the initial value as new. The convergence limits are as follows:
  • When a maximum number of iterations is reached;
  • If fitness values are not better than the previous.
After finding the best chromosome, encode the best fitness score, and finally, choose the best feature subset. This flowchart of this complete process is shown in Figure 2 and is ready for further validation.

3.4. Proposed Integration Strategy

The LF approach is based on a time series and autoregression model, ML method, and the proposed feature selection scheme. In this section, the forecasting is further optimized by reducing MAPE. In this regard, this section integrates all approaches, where data fetching and modeling of time series and autoregression model are considered. One-year data having four seasons with the transformation of per week data, ML, and feature selection are not considered as a final decision because the integration strategy looks toward the best algorithm of the related week of each season. Moreover, the MAPE calculations are modified as computed for the integration strategy in [46] for individual buildings as follows:
M A P E i = 1 W s t = 1 W s | f i t f u n ( t ) D ( t ) | D ( t )
where f i t f u n   ( t ) is a function taking from feature selection concerning time t , D is actual data, and w s is weeks per season. The average MAPE is then further calculated as:
M A P E a v g = i = 1 W s C i M A P E i
The PRECON data are modeled by the proposed time series and autoregression system, where it has seven different estimation models. These consist of three AR models, one exponential smoothing, and three AR models with exponential smoothing. The algorithm is elaborated in Table 3 and validated with a Feedforward Artificial Neural Network of one-year LF data. The proposed feedback ANFIS model consists of five stages and uses real-time data as error-correcting function input. The simulation findings reveal that an acceptable mean absolute error percentage is better than the conventional method. The next step for the model evaluation starts from the feature selection, and a novel BGA-PCA model is presented in the article for the best fitness score. Finally, the best feature subset is chosen as indicated in Equation (13). Moreover, the MAPE calculation for the integration strategy is formulated for an individual building in Equations (14) and (15). This complete sequence of simulations is shown in Table 5.
Table 5. The sequence of simulation.

4. Model Evaluation

In this article, the time series and autoregression models are considered for an ML-based feature selection named BGA-PCA. It is further validated using the integration strategy technique. For this purpose, 10 residential houses are taken into consideration where building area in square feet, number of floors, building year, the ceiling height in feet, total number of rooms, bedrooms, living rooms, drawing rooms, kitchen, connection type, number of people, number of adults (14 to 60), number of children (0–13), permanent residents, temporary residents, number of ACs, number of refrigerators, number of washing machines, number of electronic devices, number of fans, number of water dispensers, number of water pumps, number of electric heaters, number of irons, number of lighting devices, and number of UPS are the parts of data set.
As earlier discussed, the data of 10 houses from PRECON are considered for the LF. The proposed work and Random Forest training scheme are shown in Figure 3 systematized as follows:
Figure 3. Flowchart of preprocessing for the time series autoaggregation method with the proposed framework.
Time series and autoregression algorithms are developed to fetch the data set and set the objective function, while the proposed equations in Table 3 set the base for further validation as improved results validate it. In Table 3, the data set fetching with seven AR methods are computed by different linear and nonlinear data.
The preprocessing of the AR method is illustrated in Figure 3, where the three types of data are fed to feedback ANFIS. It contains linear data of Level-1 AR models, nonlinear data of exponential smooth method, and Level-II AR models as computed in Table 3.
The time series AR data have been considered the inputs of proposed Feedforward ANFIS as formulated in Equations (1)–(3). The overall output of Feedforward ANFIS is computed in Equations (8) and (9).   w ¯ 1 , w ¯ 2 , w ¯ 3 , w ¯ 4 , w ¯ 5 are the five best weights obtained from the ANFIS model. For example, weights are w ¯ 1 = 50 % , w ¯ 2 = 40 % , w ¯ 3 = 30 % , w ¯ 4 = 20 % , w ¯ 5 = 10 % , respectively. Then, the new weights according to Equation (6) could be: w ¯ 1 = 50 150 = 33.3 % , w ¯ 2 = 40 150 = 26.6 % , w ¯ 3 = 30 150   = 20 % , w ¯ 4 = 20 150 = 13.3 % , and w ¯ 5 = 10 150 = 6.6 % . The forecasted weight according to Equations (8) and (9), where f n is the forecasting parameter of each weight according to week, weekend, month, season, and year as parameters defined in Table 3. Assume that f n are 30, 25, 20, 15, 10. Then, the forecast weights are computed as: f = 0.33 × 30+ 0.26 × 25+ 0.2 × 20 + 0.13 × 15+ 0.06 × 10 ≈ 23.
BGA-PCA for the proposed feature selection used in this work eliminates the extraneous and redundant data that can improve accuracy and reduce computation time. The main objective of BGA is to minimize the loss function as computed in Equation (14). The extra parameters, repeated terms, and irrelevant data have been removed by the BGA-PCA algorithms, as shown in the flowchart of the algorithm in Figure 2 and simulation parameters in Table 4. It also shows that the child’s chromosomes are better than the parent’s. In the next step, the XOR function is applied where the crossover probability is 0.95. Whereas there are two possibilities, the first one is yes if the violation of any constraints is found, and the second one is no when the new population is evaluated. This crossover function can operate a single time or multiple times based on the maximum length of chromosomes. In the mutation process, each bit is checked individually for searching for the best possible solution where it is an operating function single/multiple times is the same as the crossover function. The mutation probability is 0.20 as it is a genetic disorder of chromosomes, and it creates a random number that is distributed uniformly with a maximum length of chromosome size. The proposed BGA-PCA model is chosen by setting the initial population, various probability values for mutation and crossover, maximum iterations, and the stopping criteria that help in the selection of new population ultimately for the best feature subset.
MAPE calculation for the integration strategy is formulated for an individual building, and average MAPE calculation is done in Equations (15) and (16), respectively. As per MAPE calculation in Table 6, the average MAPE of house 1 for the summer season is 1.72 as calculated from Equation (16). The weekly coefficient from different algorithms C i are 30%, 25%, 20%, 15%, 10%, and M A P E i of house 1 according to different algorithms are 1.70, 1,77,1.68,1.78, 1.69 as per Equation (15). The integration strategy computed the minimum MAPE as per Equation (16) is M A P E   o f   h o u s e   1 = 0.30 × 1.70 + 0.25 × 1.77 + 0.20 × 1.68 + 0.15 × 1.78 + 0.10 × 1.69 = 1.72 . Similarly, the MAPE of each house for four seasons is calculated, and the average is computed as given in Table 6.
Table 6. LF improvement per building.
The MAPE improvements are discussed in Equations (17)–(19), and the calculated results are demonstrated in Table 6, and Figure 4, Figure 5, Figure 6 and Figure 7 validate the proposed model. The MAPE has improved the system by 0.89, for the summer seasons of house 1, as its value was 2.97 with MAPE and 2.08 without MAPE, taking the MAPE value equal to 1.72. Then, E 1 according to Equation (17) is 2.97 2.08 2.97 = 29.99%, and, after four seasons, the house 1 MAPE average becomes 39% as shown in Table 6. The further improvement is calculated in Equation (18) as 2.08 1.72 2.08 = 17%, where after four seasons, the house 1 MAPE average becomes 23% as shown in Table 6. The overall MAPE improvement of house 1 is calculated using Equation (19) as 39 % 23 % = 16 % , and the average of 10 houses is 17%.
Figure 4. Actual vs. forecasted residential consumption demand of the Summer season and average MAPE of a season.
Figure 5. Actual vs. forecasted residential consumption demand of the Fall season and average MAPE of a season.
Figure 6. Actual vs. forecasted residential consumption demand of the Winter season and average MAPE of a season.
Figure 7. Actual vs. forecasted residential consumption demand of the Spring season and average MAPE of a season.
The highest load is in the summer season, in which July and August were at their peak, and the minimum load is in the winter season, in which January and February were on their minimum value. In addition, the feature selection approach has the following features: day hours, day of the week, weekend, holidays, weeks in a month, months of a season, seasons in a year, ambient temperature, dew pony, solar irradiation, peak loads time, off-peak loads time, energy price, sunshine duration, fog duration in winter, and robust weather conditions. One year of data from July 2018 to June 2019 is taken into account. After deploying the complete model modeled in Section 3 and the complete sequence of simulation illustrated in Table 5, the MAPE for the LF improvement and the MAPE of each house are calculated, where the average MAPE without proposed integration strategy is ( M 1 ) . The average MAPE with only the proposed integrated approach without feature selection is ( M 2 ). This improvement percentage is calculated by:
E 1 = ( M 1 M 2 ) M 2
The average MAPE of 10 houses computed by our proposed model is ( M 3 ). Then, the improvement is formulated by:
E 2 = ( M 2 M 3 ) M 3
The overall improvement percentage can be calculated by:
E = E 1 E 2
This complete model is simulated in a MATLAB simulation environment on a research workstation with an Intel Core i5-4300CU, 8 GB RAM, 64-bit operating system, and 2.50 GHz Processor. The proposed model of 10 houses with a one-year hourly database is simulated in 50 min.

5. Results and Discussion

LF improvement for the MAPE of each house is calculated in Table 6. The results of M 1 without proposed integration strategy is 2.90, 3.00, 3.05, and 2.87 percentage for Summer, Fall, Winter, and Spring, respectively. The results are improved with only the proposed integrated approach without feature selection, which is: 2.10, 2.15, 2.20, and 2.05 percentage for Summer, Fall, Winter, and Spring, respectively. The results are more improved with the proposed model illustrated in Table 6, and the average MAPE of 10 houses M 3 is calculated with help of proposed equations and sequence simulation.
The proposed model attained 1.70, 1.77, 1.80, and 1.67 percentage for Summer, Fall, Winter, and Spring, respectively. The overall improvement based on 10 buildings is 17% computed in Table 6.
Randomly, eight weeks of each season are selected to illustrate the forecasting results, where Sunday is considered a weekend and the seasons are: Summer (July–August 2018), Fall (October–November 2018), Winter (January–February 2019), and Spring (March–April 2019). The actual and forecasted results of electricity demand with minimum MAPE are depicted in Figure 4, Figure 5, Figure 6 and Figure 7 for the Summer, Fall, Winter, and Spring seasons, respectively. Besides the overall improvement with our proposed model, being 17% computed in Table 6, the proposed model has some better results of LF as compared with LF models that are given in [53,54], illustrated in Table 7.
Table 7. Comparison: Results in Figure 4, Figure 5, Figure 6 and Figure 7 with past works.
Although the results of our research work are promising and desirable, some limitations of this study are that a new variety of data input like the probability of load shedding, faults occurring, and power failure of the region can also be considered. In addition, seven regression methods are adopted in time series and autoregression algorithms, but this model can consider more AR methods for accurate data preprocessing. Despite these trivial limitations, our trial with a large dataset is sped up and beneficial for the researchers, as it obtained a MAPE of less than 2%.

6. Conclusions and Future Studies

LF is a key tool for better planning and operation of power generation systems in smart grid applications. This paper proposes a new combinational forecasting model based on the following: time series and autoregression algorithm for data fetching, ML-based feedback ANFIS, BGA-PCA algorithm for best feature selection, and a proposed integration strategy for MAPE calculation. We presented a new time series and autoregression algorithm for analyzing the more complex and massive historical data. The time series AR model is further validated using the ML-based feedback ANFIS model where five stages are computed. The BGA-PCA approach for attaining the best feature selection is evaluated and attained more optimized results. A remarkable improvement in MAPE and average MAPE have been achieved by a proposed integration strategy, which gives a 17% better annual improvement compared to previous ones. Mainly, the attained MAPE are as follows: 1.70, 1.77, 1.80, and 1.67 percentage for Summer, Fall, Winter, and Spring, respectively. This model can optimize the performance of power grids by predicting the optimized LF and can overcome the problems related to the planning and operation of the smart grid.
In the future, this proposed model can be implemented for LF with various time horizon locations to validate its effectiveness. Moreover, by taking a new variety of data input for economic studies, the model can be implemented for price forecasting in financial applications. Lastly, we plan to do stability analysis to check the stability of our model through various data sizes of statistically significant data.

Author Contributions

Writing—original draft preparation, Conceptualization, methodology, validation, data curation, and visualization, A.Y.; formal analysis, investigation, and supervision, M.S.; project administration, funding acquisition, Writing—review and editing, R.M.A. and A.U.R. Investigation, Validation, Writing—review and editing, M.S.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not Applicable.

Data Availability Statement

The data presented in this study are openly available in Energy Information Group, PRECON at doi:10.1145/3307772.3328317 [14].

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Gellings, C.W. The perspective of the man who coined the term ‘DSM’. Energy Policy 1996, 24, 285–288. [Google Scholar] [CrossRef]
  2. Asif, R.M.; Ur Rehman, A.; Ur Rehman, S.; Arshad, J.; Hamid, J.; Tariq Sadiq, M.; Tahir, S. Design and analysis of robust fuzzy logic maximum power point tracking based isolated photovoltaic energy system. Eng. Rep. 2020, 2, e12234. [Google Scholar] [CrossRef]
  3. Bayindir, R.; Colak, I.; Fulli, G.; Demirtas, K. Smart grid technologies and applications. Renew. Sustain. Energy Rev. 2016, 66, 499–516. [Google Scholar] [CrossRef]
  4. Malik, F.H.; Lehtonen, M. A review: Agents in smart grids. Electr. Power Syst. Res. 2016, 131, 71–79. [Google Scholar] [CrossRef]
  5. Siddique, M.A.B.; Khan, M.A.; Asad, A.; Rehman, A.U.; Asif, R.M.; Rehman, S.U. Maximum Power Point Tracking with Modified Incremental Conductance Technique in Grid-Connected PV Array. In Proceedings of the 2020 5th International Conference on Innovative Technologies in Intelligent Systems and Industrial Applications (CITISIA), Sydney, Australia, 25–27 November 2020; pp. 1–6. [Google Scholar]
  6. Fallah, S.N.; Ganjkhani, M.; Shamshirband, S.; Chau, K. Computational Intelligence on Short-Term Load Forecasting: A Methodological Overview. Energies 2019, 12, 393. [Google Scholar] [CrossRef]
  7. Wang, F.; Xiang, B.; Li, K.; Ge, X.; Lu, H.; Lai, J.; Dehghanian, P. Smart households’ aggregated capacity forecasting for load aggregators under incentive-based demand response programs. IEEE Trans. Ind. Appl. 2020, 56, 1086–1097. [Google Scholar] [CrossRef]
  8. Khan, F.; Siddiqui, M.A.B.; Rehman, A.U.; Khan, J.; Asad, M.T.S.A.; Asad, A. IoT Based Power Monitoring System for Smart Grid Applications. In Proceedings of the 2020 International Conference on Engineering and Emerging Technologies (ICEET), Lahore, Pakistan, 22–23 February 2020; pp. 1–5. [Google Scholar]
  9. Du, P.; Lu, N.; Zhong, H. Demand Response in Smart Grids. In Demand Response in Smart Grids; Chapter 1; Springer: Cham, Switzerland, 2019; Volume 1, p. 66. [Google Scholar] [CrossRef]
  10. Vahid-Ghavidel, M.; Mahmoudi, N.; Mohammadi-Ivatloo, B. Self-scheduling of demand response aggregators in short-term markets based on information gap decision theory. IEEE Trans. Smart Grid 2019, 10, 2115–2126. [Google Scholar] [CrossRef]
  11. Dash, P.K.; Liew, A.C.; Rahman, S. Fuzzy neural network and fuzzy expert system for load forecasting. IET Proc. Gener. Transm. Distrib. 1996, 143, 106–114. [Google Scholar] [CrossRef]
  12. Bak, G.; Bae, Y. Predicting the Amount of Electric Power Transaction Using Deep Learning Methods. Energies 2020, 13, 6649. [Google Scholar] [CrossRef]
  13. Hamza, A.S.H.; Abdel-Gawad, N.M.; Salama, M.M.; Hegazy, A.; El-Debeiky, S. Electric load forecast for developing countries. In Proceedings of the 11th IEEE Mediterranean Electrotechnical Conference (IEEE Cat. No.02CH37379), Cairo, Egypt, 7–9 May 2002; Volume 1, pp. 429–441. [Google Scholar] [CrossRef]
  14. Nadeem, A.; Arshad, N. PRECON: Pakistan residential electricity consumption dataset. In e-Energy 2019—10th ACM International Conference on Future Energy Systems; Association for Computing Machinery: New York, NY, USA, 2019. [Google Scholar]
  15. Widén, J.; Lundh, M.; Vassileva, I.; Dahlquist, E.; Ellegård, K.; Wäckelgård, E. Constructing load profiles for household electricity and hot water from time-use data—Modelling approach and validation. Energy Build. 2009, 41, 753–768. [Google Scholar] [CrossRef]
  16. Bartels, R.; Fiebig, D.G. Metering and modelling residential enduse electricity load curves. J. Forecast. 1996, 15, 415–426. [Google Scholar] [CrossRef]
  17. Rehman, A.U.; Naqvi, R.A.; Rehman, A.; Paul, A.; Sadiq, M.T.; Hussain, D. A Trustworthy SIoT Aware Mechanism as an Enabler for Citizen Services in Smart Cities. Electronics 2020, 9, 918. [Google Scholar] [CrossRef]
  18. Alahmed, A.S.; Almuhaini, M.M. Hybrid top-down and bottom-up approach for investigating residential load compositions and load percentages. arXiv 2020, arXiv:2004.12940. [Google Scholar]
  19. Narayan, N.; Qin, Z.; Popovic-Gerber, J.; Diehl, J.C.; Bauer, P.; Zeman, M. Stochastic load profile construction for the multi-tier framework for household electricity access using off-grid DC appliances. Energy Effic. 2020, 13, 197–215. [Google Scholar] [CrossRef]
  20. Proedrou, E. A comprehensive review of residential electricity load profile models. IEEE Access 2021, 9, 12114–12133. [Google Scholar] [CrossRef]
  21. Masood, B.; Guobing, S.; Naqvi, R.A.; Rasheed, M.B.; Hou, J.; Rehman, A.U. Measurements and channel modeling of low and medium voltage NB-PLC networks for smart metering. IET Gener. Transm. Distrib. 2021, 15, 321–338. [Google Scholar] [CrossRef]
  22. Hossain, M.S.; Madlool, N.A.; Rahim, N.A.; Selvaraj, J.; Pandey, A.K.; Khan, A.F. Role of smart grid in renewable energy: An overview. Renew. Sustain. Energy Rev. 2016, 60, 1168–1184. [Google Scholar] [CrossRef]
  23. Grosz, B.J.; Altman, R.; Horvitz, E.; Mackworth, A.; Mitchell, T.; Mulligan, D.; Shoham, Y. Artificial Intelligence and Life in 2030: One Hundred Year Study on Artificial Intelligence; Stanford University: Stanford, CA, USA, 2016. [Google Scholar]
  24. Bansal, R.C.; Pandey, J.C. Load forecasting using artificial intelligence techniques: A literature survey. Int. J. Comput. Appl. Technol. 2005, 22, 109–119. [Google Scholar] [CrossRef]
  25. Del Real, A.J.; Dorado, F.; Durán, J. Energy demand forecasting using deep learning: Applications for the French grid. Energies 2020, 13, 2242. [Google Scholar] [CrossRef]
  26. Masood, B.; Khan, M.A.; Baig, S.; Song, G.; Rehman, A.U.; Rehman, S.U.; Asif, R.M.; Rasheed, M.B. Investigation of Deterministic, Statistical and Parametric NB-PLC Channel Modeling Techniques for Advanced Metering Infrastructure. Energies 2020, 13, 3098. [Google Scholar] [CrossRef]
  27. Niu, D.; Wang, Y.; Wu, D.D. Power load forecasting using support vector machine and ant colony optimization. Expert Syst. Appl. 2010, 37, 2531–2539. [Google Scholar] [CrossRef]
  28. Saini, L.M. Peak load forecasting using Bayesian regularization, Resilient and adaptive backpropagation learning based artificial neural networks. Electr. Power Syst. Res. 2008, 78, 1302–1310. [Google Scholar] [CrossRef]
  29. Yoon, B.; Yoon, C.; Park, Y. On the development and application of a self–organizing feature map–based patent map. R&D Manag. 2002, 32, 291–300. [Google Scholar] [CrossRef]
  30. Brandstetter, P.; Chlebis, P.; Simonik, P. Active power filter with soft switching. Int. Rev. Electr. Eng. 2010, 5, 2516–2526. [Google Scholar]
  31. Leung, K.F.; Leung, F.H.F.; Lam, H.K.; Ling, S.H. Application of a modified neural fuzzy network and an improved genetic algorithm to speech recognition. Neural Comput. Appl. 2007, 16, 419–431. [Google Scholar] [CrossRef][Green Version]
  32. Moradzadeh, A.; Zakeri, S.; Shoaran, M.; Mohammadi-Ivatloo, B. Short-Term Load Forecasting of Microgrid via Hybrid Support Vector Regression and Long Short-Term Memory Algorithms. Sustainability 2020, 12, 7076. [Google Scholar] [CrossRef]
  33. Dudek, G. Multilayer perceptron for short-term load forecasting: From global to local approach. Neural Comput. Appl. 2020, 32, 3695–3707. [Google Scholar] [CrossRef]
  34. Khandelwal, I.; Adhikari, R.; Verma, G. Time series forecasting using hybrid arima and ann models based on DWT Decomposition. Procedia Comput. Sci. 2015, 48, 173–179. [Google Scholar] [CrossRef]
  35. Liu, J.; Li, C. The Short-Term Power Load Forecasting Based on Sperm Whale Algorithm and Wavelet Least Square Support Vector Machine with DWT-IR for Feature Selection. Sustainability 2017, 9, 1188. [Google Scholar] [CrossRef]
  36. Siddique, M.A.B.; Asad, A.; Rao, M.; Asif, A.U.R.; Sadiq, M.T.; Ulah, I. Implementation of Incremental Conductance MPPT Algorithm with Integral Regulator by using Boost Converter in Grid Connected PV Array. IETE J. Res. 2021, 67. [Google Scholar] [CrossRef]
  37. Chang, M.-W.; Chen, B. EUNITE Network Competition: Electricity Load Forecasting; National Taiwan University: Taipei, Taiwan, 2001. [Google Scholar]
  38. Wang, H.; Li, G.; Wang, G.; Peng, J.; Jiang, H.; Liu, Y. Deep learning based ensemble approach for probabilistic wind power forecasting. Appl. Energy 2017, 188, 56–70. [Google Scholar] [CrossRef]
  39. Ugurlu, U.; Oksuz, I.; Tas, O. Electricity Price Forecasting Using Recurrent Neural Networks. Energies 2018, 11, 1255. [Google Scholar] [CrossRef]
  40. Zhang, X.; Wang, J.; Zhang, K. Short-term electric load forecasting based on singular spectrum analysis and support vector machine optimized by Cuckoo search algorithm. Electr. Power Syst. Res. 2017, 146, 270–285. [Google Scholar] [CrossRef]
  41. Abedinia, O.; Amjady, N.; Zareipour, H. A New Feature Selection Technique for Load and Price Forecast of Electrical Power Systems. IEEE Trans. Power Syst. 2017, 32, 62–74. [Google Scholar] [CrossRef]
  42. Sheikhan, M.; Mohammadi, N. Neural-based electricity load forecasting using hybrid of GA and ACO for feature selection. Neural Comput. Appl. 2012, 21, 1961–1970. [Google Scholar] [CrossRef]
  43. Urraca, R.; Sanz-Garcia, A.; Fernandez-Ceniceros, J.; Pernia-Espinoza, A.; Martinez-De-Pison, F.J. Improving hotel room demand forecasting with a hybrid GA-SVR methodology based on skewed data transformation, feature selection and parsimony tuning. Log. J. IGPL 2017, 25, 877–889. [Google Scholar] [CrossRef]
  44. Liu, Y.; Yin, Y.; Gao, J.; Tan, C. Wrapper Feature Selection Optimized SVM Model for Demand Forecasting. In Proceedings of the 2008 The 9th International Conference for Young Computer Scientists, Hunan, China, 18–21 November 2008; pp. 953–958. [Google Scholar]
  45. Mangai, U.G.; Samanta, S.; Das, S.; Chowdhury, P.R. A Survey of Decision Fusion and Feature Fusion Strategies for Pattern Classification. IETE Tech. Rev. 2010, 27, 293–307. [Google Scholar] [CrossRef]
  46. Kilimci, Z.H.; Akyuz, A.O.; Uysal, M.; Akyokus, S.; Uysal, M.O.; Atak Bulbul, B.; Ekmis, M.A. An Improved Demand Forecasting Model Using Deep Learning Approach and Proposed Decision Integration Strategy for Supply Chain. Complexity 2019, 2019, 9067367. [Google Scholar] [CrossRef]
  47. Nguyen, G.; Dlugolinsky, S.; Bobák, M.; Tran, V.; López García, Á.; Heredia, I.; Malík, P.; Hluchý, L. Machine Learning and Deep Learning frameworks and libraries for large-scale data mining: A survey. Artif. Intell. Rev. 2019, 52, 77–124. [Google Scholar] [CrossRef]
  48. Jadidi, A.; Menezes, R.; De Souza, N.; De Castro Lima, A.C. Short-term electric power demand forecasting using NSGA II-ANFIS Model. Energies 2019, 12, 1891. [Google Scholar] [CrossRef]
  49. Cai, J.; Luo, J.; Wang, S.; Yang, S. Feature selection in machine learning: A new perspective. Neurocomputing 2018, 300, 70–79. [Google Scholar] [CrossRef]
  50. Eseye, A.T.; Lehtonen, M. Short-Term Forecasting of Heat Demand of Buildings for Efficient and Optimal Energy Management Based on Integrated Machine Learning Models. IEEE Trans. Ind. Inform. 2020, 16, 7743–7755. [Google Scholar] [CrossRef]
  51. Hassanat, A.; Almohammadi, K.; Alkafaween, E.; Abunawas, E. Choosing Mutation and Crossover Ratios for Genetic Algorithms—A Review with a New Dynamic Approach. Information 2019, 10, 390. [Google Scholar] [CrossRef]
  52. Keerthi Vasan, K.; Surendiran, B. Dimensionality reduction using Principal Component Analysis for network intrusion detection. Perspect. Sci. 2016, 8, 510–512. [Google Scholar] [CrossRef]
  53. Haq, M.R.; Ni, Z. A new hybrid model for short-term electricity load forecasting. IEEE Access 2019, 7, 125413–125423. [Google Scholar] [CrossRef]
  54. Elahe, M.F.; Jin, M.; Zeng, P. An Adaptive and Parallel Forecasting Strategy for Short-Term Power Load Based on Second Learning of Error Trend. IEEE Access 2020, 8, 201889–201899. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.