Game Theoretical Demand Response Management and Short-Term Load Forecasting by Knowledge Based Systems on the basis of Priority Index

Demand Response Management (DRM) is considered one of the crucial aspects of the smart grid as it helps to lessen the production cost of electricity and utility bills. DRM becomes a fascinating research area when numerous utility companies are involved and their announced prices reflect consumer’s behavior. This paper discusses a Stackelberg game plan between consumers and utility companies for efficient energy management. For this purpose, analytical consequences (unique solution) for the Stackelberg equilibrium are derived. Besides this, this paper presents a distributed algorithm which converges for consumers and utilities. Moreover, different power consumption activities on the basis of time series are becoming a basic need for load prediction in smart grid. Load forecasting is taken as the significant concerns in the power systems and energy management with growing technology. The better precision of load forecasting minimizes the operational costs and enhances the scheduling of the power system. The literature has discussed different techniques for demand load forecasting like neural networks, fuzzy methods, Naïve Bayes, and regression based techniques. This paper presents a novel knowledge based system for short-term load forecasting. The algorithms of Affinity Propagation and Binary Firefly Algorithm are integrated in knowledge based system. Besides, the proposed system has minimum operational time as compared to other techniques used in the paper. Moreover, the precision of the proposed model is improved by a different priority index to select similar days. The similarity in climate and date proximity are considered all together in this index. Furthermore, the whole system is distributed in sub-systems (regions) to measure the consequences of temperature. Additionally, the predicted load of the entire system is evaluated by the combination of all predicted outcomes from all regions. The paper employs the proposed knowledge based system on real time data. The proposed scheme is compared with Deep Belief Network and Fuzzy Local Linear Model Tree in terms of accuracy and operational cost. In addition, the presented system outperforms other techniques used in the paper and also decreases the Mean Absolute Percentage Error (MAPE) on a yearly basis. Furthermore, the novel knowledge based system gives more efficient outcomes for demand load forecasting. Electronics 2018, 7, 431; doi:10.3390/electronics7120431 www.mdpi.com/journal/electronics Electronics 2018, 7, 431 2 of 34


Introduction
In the modern day world, smart meters offer two way communication between the users and the utilities.This communication leads towards a prevalent computing environment, which develops large-scale data with high velocity and veracity [1].The resultant data also give rise to a time series concept.This phenomenon generally includes power consumption measurements of appliances over a specific time interval [2].The techniques of big data are proficient enough to utilize resultant huge volumes data of sequential time series.Moreover, these techniques also assist in data-driven decision making.Besides, this big data can update utilities to learn power consumption patterns of consumers, predicting demand and averting blackouts.
The utilities are keen on finding the optimal ways for cost reduction.Moreover, electricity companies desire to increase their yields by acquainting their consumers with effective programs like Demand Side Management (DSM) and demand response.Currently, marginal success has been observed in achievement of goals for these programs.However, viable results still need to be achieved [3].Furthermore, implementation of DSM and demand response is a challenging task for utilities.It is difficult to comprehend and conclude the behavior of every individual consumer.Moreover, it is also challenging to customize strategies that include profit contrary to distress from varying behavior of consumers on the basis of energy-saving policies introduced by utilities.Besides, the association between consumer behavior and the constraints that affect power utilization patterns are non-static, i.e., the activities of consumers keep on changing from time to time [4].
Usually, the behavior of consumers is reliant on weather and seasons, which has a capricious effect on power utilization decisions.Thus, active participation of consumers in customized power management is crucial for energy saving schemes.The companies should give timely response on power consumption and associated costs [5].Consequently, it is challenging to design such models that are proficient enough to evaluate energy time series from smart meters.Also, it is stimulating to train the model that predicts power consumption.
The aforementioned discussion helps to study the influence of consumers' behavior on power consumption and to forecast the energy utilization patterns.This analysis can assist the utilities to develop power saving strategies.Moreover, the utilities can design programs to stabilize the demand and supply of energy ahead of time.For instance, short term forecast is related to daily and weekly power usage.This type of prediction is best suitable when there is a need to enhance scheduling and distribution.Alternatively, medium term forecasting is related to weekly and monthly forecasting.Besides, long term forecasting is about yearly predictions of energy consumption.Medium and long term predictions are capable of maintaining the equilibrium between the production of smart grid and strategic scheduling [6].However, such a task is very challenging as it is significant to mine complex interdependencies between appliance usages where numerous data streams are taking place.
Generally, DRM can be characterized in two extents, which are the utilities and consumers.There has been substantial quantity of work done in power systems to maintain the balance between supply and demand [7].However, these studies have laid emphasis on the financial aspects on the planning and production levels.Moreover, these studies are unable to take both consumer and utility as a substantial constituent.Contrariwise, the literature on consumer and utility has presented models to increase user comfort, devoid of taking the cost of power or the profits of the utilities [8].This paper takes motivation from this phenomenon.Moreover, this paper observes the increased profits for consumers and utilities.
This paper analyzes the collaborations between several utilities and consumers.Both entities share mutual objectives, i.e., maximization of their payoffs.The utilities can increase their profits by setting a suitable price per unit.Nonetheless, the users select a specified amount of power to purchase from any utility on the basis of announced prices.Furthermore, the purchasing behavior of consumer is dependent on the prices settled by the company.Likewise, the behavior of utilities is reliant for the prices settled by other utilities.Thus, for solving these challenging collaborations between consumers and utilities, this paper employs a game theoretical framework.This paper presents a Stackelberg game plan between consumers and utilities.In this game, the utilities play a non-cooperative game and the users look for their best optimum response.
The systematic and proficient utilization of electrical power is a hot debate topic in today's world [9].The optimal power management and maintaining balance between demand and supply are considered as challenging tasks for modern power systems [10].Moreover, the prediction of uncertain production of renewable energy resources [11] and short-term load forecasting [12] are measured as significant components of the power grid for optimal power scheduling.Besides, short-term load forecasting has wide applications in the energy market like load scheduling, unit commitment and power production [13].It has been observed in the literature that error maximization in short-term load forecasting can result in substantial growth in the utility operating expenses.Thus, enhancing the accuracy of predicted results is a challenging task and vital issue in power management.
The proximity of choosing a similar day to the target day is very crucial for selecting the similar day along with temperature, according to previous studies.In this regard, this paper proposes a different priority indexing technique for selection of similar days by analyzing the date proximity and temperature similarity.Moreover, the date proximity used in this paper is the total number and nature of days between selected and similar days.In contrast, the historic power load data is categorized according to nature of days in demand prediction.Furthermore, this paper also presents four different day types and two data-sets are presented for utilization of historical power load data.In addition, the proposed knowledge based short-term load forecasting method employs monthly and weekly data for two different data-sets.The best optimum results for short-term load forecasting will be achieved by grouping of prediction results obtained from these two data-sets.
The consideration of exceptional temperature for any region is ineffectual because of variations in temperatures in a vast topographical zone.A vast topographical zone is separated into three climate types in [14].Moreover, the temperature of three cities is labeled as cold, moderate, and warm.The biased integration of these temperatures is presented as the temperature of the huge region.The temperature is taken in [15] and the whole system is distributed in different regions.Besides, the short-term load has been forecasted by some regression techniques.However, the precedence of choosing similar days is also unnoticed in previous studies.
This paper divides the entire system in nine regions.Moreover, the climatic conditions of only one city is chosen from every region.The knowledge based short-term load forecasting is employed to every region after the consideration of temperature.In addition, the predicted power load of the entire system is the aggregate of predicted load of particular regions.The impact of temperature is believed to be much more efficient and result improving when the system is divided.
The proposed system model is employed in Pakistan's National Power Network (PNPN), which is taken as a sample system in this paper.In the proposed system model, Affinity Propagation (AP) [16], and Binary Firefly Algorithm (BFFA) are used as hybrid model.The proposed system model shows a significant decrease in MAPE in comparison with other traditional knowledge based methods.This paper uses algorithms of Deep Belief Network (DBN) and Fuzzy Local Linear Model Tree (F-LOLIMOT) for comparison purposes.The experimental results specifies that the proposed model requires minimum time for computation when associated with DBN and F-LOLIMOT.
The major research contributions of this paper include the proposition of the priority index for selection of similar days by means of temperature of specified regions and date proximity.Moreover, the historic power load is separated in two different data-sets in the paper.Subsequently, the data-sets predict the short-term load and then the final outcome is supposed to be more precise.The final outcomes are achieved by the summation of predicted results from two data-sets.Furthermore, the paper makes the impact of temperature effective by dividing the system in different regions.
The remaining paper is organized in following manner: Section II presents the previous work done, Section III provides a brief discussion of a Stackelberg game and demonstrates the distinctiveness and existence of the Stackelberg Equilibrium.Moreover, Section IV discusses the categorization of knowledge based short-term load forecasting and Section V employs the proposed method on different topographical regions.Moreover, results and their discussion are presented in Section VI and Section VII concludes the paper.

Related Work
The challenges addressed in Section I are also discussed in the literature through methodologies of big-data.A brief discussion of behavioral power consumption data to acquire better energy competence are presented in [6].Likewise, the influence of developmental fluctuations for energy savings was observed by [17].The study also discussed the contribution of consumers to collaborate with the utilities and better energy savings were highlighted.
The literature has proposed many novel methods for short-term load forecasting like fuzzy [18], exponential smoothing [19], regression based [20], neural networks [21], and others.Moreover, every proposed model has incorporated some techniques.For example, regression based processes are usually comprised of Autoregressive Integrated Moving Average (ARIMA) [22], Auto-Regressive Moving Average (ARMA) [23], Support Vector Regression (SVR) [24], and Auto-Regressive Moving Average with Exogenous variable (ARMAX) [25].Nevertheless, it is essential for aforementioned techniques to learn the process by bulks of preceding data for tuning of various parameters.Furthermore, the complexities of these techniques, minimum time of computation and memory essentials of knowledge based model, can initiate a different perspective to knowledge based short-term load forecasting.
In literature, there are some works cited in knowledge based systems that employ a similar day method [26][27][28].Although, there is a lot of room for enhancement in this scenario which can be studied.The authors in [29] proposed a knowledge based system for short-term load demand forecasting.However, the paper overlooked the consequences of temperature.The change in temperature can cause fluctuations in the load demand.Consequently, the effect of temperature must be included in the short-term load forecasting.The different eight day categories are enumerated in [30].
Moreover, average stabilized loads of historic data for every day has been evaluated by means of least and maximum load per hour.Furthermore, the least and maximum load for 11 days was forecasted by means of regression techniques.The Mean Absolute Percentage Error (MAPE) of Irish electrical power system attained was 2.52%.Moreover, the temperature was also incorporated in this study and was associated with 3.86% by the statistical technique in [31].
The authors in [32] calculated the weighted mean load of every hour for three preceding and similar days for short-term load forecasting.Moreover, the impact of temperature on prediction of short-term load is also considered by means of exponential association between power demand and temperature.Likewise, the mean prediction error for a daily peak load of France was attained 2.74% in [32].Besides, the consequences of temperature, wind pressure and humidity, was scrutinized in [33].The MAPE calculated in this study was 1.43%.The study in [23] was almost equivalent to the proposed model presented in [22].Moreover, the MAPE achieved in this study was between 1.23% to 3.35% in seven different states of America [34].
The mean prediction error for daily peak load in [24] was achieved 4.65% for weekdays and 7.08% for weekends of three different states of Turkey [35].This mean prediction error was achieved after smoothing the temperature discrepancies throughout the day.The precedence of similar days is overlooked in previous studies.It is obvious that there are numerous days which are advantageous for the knowledge based forecasting of load.Nevertheless, the best suitable preference of these same days has a substantial effect on forecasting results.
The consequences of temperature are neglected in [36] in terms of priority index.Moreover, in [37] a priority index for medium term load prediction was presented.The proposed model was based on the similarity of temperature for the selected day.The mean error achieved in [37] for Western States of America was 3.25% for summer season.Besides, few values of error were attained that were more than 6%.Though, the temperature was the only parameter, which was assessed in this study and the proximity of chosen day to similar day was ignored.It is a well-known fact that same days do not have alike temperature.Moreover, the similar days must be near to the target days in order to avert the selection of similar days with similar temperature and different power load.
The work presented in [38] used the Bayesian network to forecast activities of different residents by a particular appliance.However, the model was not efficient enough to be functional towards real world circumstances.The authors in [39] and [40] discussed a multi-label and time sequence based classifier model for a decision tree taking appliance association as a correlation.The basic purpose of their model was to predict the power consumption of the appliance.Though, the authors merely observed the past 24 h frame for future forecasting.
The work in [41] presented the association rule mining method to classify the interdependence between power consumption and appliance usage to help power saving, anomaly detection, and demand response.Nevertheless, this work lacked the proper rule mining process and appliance-appliance association.
At present, Artificial Neural Network (ANN) and SVM are considered to work efficiently for non-linear time series sequences.Karatasou et al. [42] demonstrated the practical implementation of ANN in forecasting power expenditure of a building accompanied by statistical study.In [43,44], a model is presented which hybrids the Support Vector Regression (SVR) and Immune Algorithm (IA) to estimate local yearly report and power load in Taiwan.
Zhao et al. [45] presented a framework, which employed SVM to predict residential power utilization in the humid area.Moreover, the study took meteorological conditions of that particular area.Besides, Xuemei et al. [46] suggested Least Square Support Vector Machine (LS-SVM) for chilling load prediction [47] for a residential zone in Singapore.The forecasting was done by hourly weather information.
Wang et al. [48] discussed that the SVM based models have proven to be efficient as compared to ANN and ARIMA configurations.They employed Differential Evolution (DE) and SVM to predict the configurations for yearly energy consumption.Conversely, the development of SVM model is influenced by the category and constraints of the kernel function.Generally, it is discussed in literature that the tuning constraints of SVM is a challenging task [49].In addition, a number of models are presented in literature to tune the parameters of SVM by techniques of machine learning and artificial intelligence.
Ogliari et al. [50] proposed a hybrid model using Neural Network and Genetical Swarm Optimization for energy prediction.The authors in [51] combined SVM with algorithms of Simulated Snnealing to predict yearly load.On the subject of optimization techniques, Jaya Algorithm has achieved attention in the last few years as a metaheuristic computing technique.The authors in [52] and [53] observed that Jaya Algorithm outperforms other optimization techniques.Moreover, Jaya Algorithm has also been employed for various real work applications.
There is a variety of literature available on the topic of game theory and DRM.In [54], the authors have discussed power utilization and forecasting as a non-cooperative game plan.This basic aim was to maximize the cost functions.Likewise, the authors in [55] have proposed a distributed set-up.In this set-up, the cost function is demonstrated by its dependence on inclusive load.The consumers adjusted their behavior for power consumption on the basis of cost function introduced by the utility.The authors in [56] presented a theoretical framework for mutual optimization of investment and functioning of a smart grid.Moreover, the aspects of power storing, renewable energy integration, and demand response were taken into consideration.The paper signified the sharing of portfolio decisions, day-ahead pricing, and scheduling.They also presented the benefits of integrated renewable energy and demand response in terms of minimizing the sharing cost.
A robust optimization has been discussed in [57] in order to increase the utility of the end-user by hourly prediction.The study presented in [58] laid emphasis on the knowledge and interest of users to be aware of the announced electricity prices.The study proposed a technique to cope with preferences of the consumers to increase power competence and consumer satisfaction.Moreover, a dynamic cost price has been introduced to motivate users for attaining a cumulative load [59].Also, this load was handled by different utilities and DRM was scrutinized for bi-directional communication between consumers in the micro-grid.The authors in [60] and [61] discussed the dynamic pricing in detail for smart energy model of a smart grid.The discussed model was dependent on renewable energy sources, which were further integrated with intelligent control that processed information from a smart metering devices.
The studies discussed above are inadequate to meet the needs, i.e., the electricity firms considered utility companies as a single firm.This study differs in this context as this incorporates numerous utilities and consumers.Moreover, the basic aim of both entities is to increase their profits (remunerations) by game theoretic approach.Besides, there is a broad literature and findings available on the Stackelberg game on the topics of profits maximization, congestion control, and interactive communication [62,63].

Game Theoretical Problem Formulation
This study takes n consumers and U C utility companies in consideration.Besides, the energy sources of the utilities include non-renewable and renewable resources.In literature, it is observed that power generators, which are centered on the energy of fossils utilize a definite amount of energy.Moreover, the energy of fossils is also supposed to be harmful for the environment.Contrarily, renewable energy sources are considered environmentally friendly.However, renewable resources have inherent natural stochastic behavior, which makes it difficult to predict and control.The studies show that uncertainties are common with renewable resources.Furthermore, Markov chain (discrete time) has been extensively employed in literature for the generation of power from renewable resources [64].
This study takes residential type consumers into account.In addition, all users have dissimilar requirements for power consumption.The study also distinguishes the users based on their financial plans; i.e., purchasing power of electrical energy.Likewise, this study proposes a utility function for every consumer.The function shows an increment using the total expanse of power that any consumer is able to utilize.Moreover, this paper integrates cost parameters for every consumer.
The U C and n have established a two way communication using the advanced metering infrastructure for pricing swapping and information sharing.Conversely, U C can also communicate with one another.The n collect the value (cost) facts from the U C. In return, the U C then provide their services to n.
Power initiation, dissemination, and expenditure can be divided in three ways [65]: Power generators, U C, and n.This paper emphasizes the communication between n and U C. Moreover, this paper assumes that U C show a fluctuating behavior at the business level.Inspired from the game theory models, the U C can play a vital part in an economical marketplace.No participant is capable enough to affect the market price of electricity through his particular activities.Thus, the market price is such constraint over which U C have no control.Moreover, the U C need to increase their production up to the point where the minimal cost is equivalent to the cost of the market.This phenomenon occurs once the total contributors increase and no contributor is authorized to govern an enormous power generation quantity.Nonetheless, this study proposes a predetermined figure of U C (contributors).This scenario depicts that every utility will announce its own price according to its generation capacity.Table 1 shows the list of symbols used.
Threshold estimate in SVM

Analysis of User and Utility Company
The cost for every consumer shows fluctuation when there are various utility firms having diverse electricity costs.Moreover, the setting of cost is highly reliant on the rates of other U C. In this regard, game theory offers an ordinary pattern to represent the activities of n and U C. Consequently, the U C settle the cost for each unit of energy and then publicize this to consumers.The users then respond back to the cost by demanding an optimal amount of power from the U C. In this case, U C play first.The consumers then decide on the basis of announced prices.Moreover, both events are in sequence.The events are that the utilities play primarily and at that time the consumers decide their verdict based on the cost.Hence, this paper models the communication between the U C and n by a Stackelberg game [66].The proposed game model takes the U C as influential (leaders) and users as followers.Moreover, the proposed model also considers the events as a multiple leaders and followers game.

Analysis of User Side
Assume that d n 0 ,uc 0 is the request of consumer n 0 from a utility uc 0 .Hence, the value of a consumer n 0 , C cons,n 0 can be expressed as: Here γ n 0 and τ n 0 are constants.Also, the ln function is extensively employed in literature for user making decisions [67].The valuable function used for consumer n 0 in Equation ( 1) is interrelated to the function γ n 0 ∑ ln d n 0 ,uc 0 .The consumer will recompense -∞ when the valuable function γ n 0 ∑ ln d n 0 ,uc 0 is used regarding uc 0 , such that, d n 0 ,uc 0 = 0.When d n 0 ,uc 0 and τ n 0 are equivalent to 0, then benefit of n 0 regarding uc 0 begin to be finite.Generally, the representative cost of τ n 0 = 1.
Suppose κ uc 0 is the per unit cost given by any utility company uc 0 and B n 0 ≥ 0 is the total expenditure of any consumer n 0 .Each uc 0 has given a distinct price rates of electrical energy [κ 0 , κ 1 ,......, κ c ] when n 0 ∈ n.Subsequently, the n 0 computes the best demand response through resolving best optimum solution (OS cons ) given in Equation (2). where OS cons is a convex optimization problem.Therefore, the obtained solution is distinctive and optimal.
This paper considers the scrutiny accompanied by U C consumers and three U Cs. Thus, they seek for best optimum solution in this scenario for a specified uc 0 can be expressed as follows: where The paper employs Lagrange multipliers (Λ n 0 ,1 , Λ n 0 ,2 , Λ n 0 ,3 ) for the respective U C and setting of parameters as discussed above.Thus, the Equation ( 4) can be rewritten as: The values of the Lagrange multipliers are used as strategies for finding the local maximal and minimal of the function subjected to inequality constraint.Thus, it improves the performance of Equation ( 5).
The first order optimality condition for linear, best optimum solution and maximization problem is by setting ∇υ cons = 0. Here, υ cons = (υ cons ∀n 0 ∈ n).All of the n are interconnected by κ c .Also, ∇υ cons = 0 shows that, Also, and Next, this paper has considered four of the cases, which the n 0 can avail.

Case 2
If d n 0 ,1 > 0 and d n 0 ,2 are equivalent to 0, then As discussed above that Λ n 0 ,2 d n 0 ,1 corresponds to 0. This paper derives Equation ( 13) by considering the cost of the first utility.
This paper further expands Equation ( 6) to include extra parameter and ease simplification.Thus, Equation ( 14) is derived.
As Λ n 0 ,1 > 0 and γ n 0 − κ 1 Λ n 0 ,1 τ n 0 − B n 0 = 0, which refers to the point that Λ n 0 ,1 = γ n 0 /(κ 1 τ n 0 + B n 0 ).Now, evaluating this in Equation ( 13), Equation ( 15) is now equivalent to B n 0 /κ 1 .Moreover, Equation ( 15) can also be presented as: where If d n 0 ,1 is equivalent to 0 and d n 0 ,2 > 0, then the identical scrutiny can be valuated as specified in Case 2. This paper considered the cost of the second utility; thus, the demand of users with respect to the second utility is given in Equation ( 17).

Case 4
If d n 0 ,1 and d n 0 ,2 both are equivalent to 0, then Λ n 0 ,1 , Λ n 0 ,2 , and Λ n 0 ,3 are real and positive values.It is noted that Case 4 is assumed as best case which rarely occurs only when κ uc 0 = ∞ or else B n 0 ≥ 0. This paper has satisfied the power and cost parameters as equalities in Case 1, 2, and 3.However, this scenario cannot be mapped on Case 4. This study further assumes that there are n consumers in total and U C utilities that satisfies the equality conditions in previous cases for a given set of κ uc 0 .So, Equations ( 12), (16), and ( 17) can be combined in the above discussed scenario as: In Equation ( 18), d n 0 ,uc 0 ≥ 0, n 0 ∈ n and uc 0 ∈ U C. As d n 0 ,uc 0 ≥ 0. So,

Analysis of Utility Companies
This study assumes that E uc 0 (U C ∈ uc 0 ) depicts the available electrical energy of U C. The aim of every U C is to vend the energy to gain maximum profit.For instance, if there is only one U C then this firm will settle the price range according to its ease as there is no competition involved.However, this study takes two basic strategies that decide the cost range of any U C. Firstly, it can be the economical conditions of average consumers and secondly, it could be an aspect of competitiveness among U Cs. Furthermore, the U Cs also take part in choosing the best optimum cost (game) with another.Additionally, this study expresses the maximum profit E prod,uc 0 of any U C as: Here, ξ uc 0 +1 is cost of U C apart from uc 0 .Thus, the best optimum solution for any U C can be related in terms of OP prod and can be expressed as: where ∑ n∈n 0 d n 0 ,uc 0 ≤ E uc 0 and κ uc 0 > 0, ∀U C ∈ uc 0 .The maximum profit of any U C is fluctuating in relation to energy for a constant κ uc 0 .According to Equation (20), this phenomenon leads to parameters of equality.Every U C proffers to vend all its energy to consumers.This paper assumes υ prod,uc 0 to resolve OP prod by: The best optimal solution for the U C furthers presents ℘υ prod,uc 0 /℘κ uc 0 , which is equivalent to 0.
where ρ = ∑ n 0 ∈n τ n 0 and B = ∑ n∈n 0 B n 0 .Moreover, the conditions used in Equations ( 21) and ( 22) express U C equations.Now, solving these three U C, this study sets ,uc 0 can be evaluated by means of κ + .Consequently, employing Equation ( 18) for uc 0 , Now, using the current value of κ uc 0 this study observes that, It can also be deduced from Equation ( 23) that ζ uc 0 = ρκ uc 0 (U C − 1).Also, ρ and B ≥ 0. It refers to the phenomenon that there is no essential need to play any game when U C = 1.Therefore, the study merely focuses on the circumstances when U C ≥ 3. To handle the discussed scenario, Equation ( 23) can now be computed as: Here , and S = BU C −1 .From the above equations, it can be concluded that M is an invertible matrix.However, it could be expressed as: This paper considers some cases to achieve closed-form solution of κ.

Case 1
All the U C have equivalent amount of energy available and capacity to produce, then 26), Likewise, Also, E ∝ 1 κ .Then the Equation ( 27) is used in Equation ( 19), so that the total demand to any U C from n 0 is given as: Here, Equation (30) indicates that B n 0 ≥ 0. This phenomenon indicates that now all U Cs produce equivalent amount of power.Moreover, they have settled some pricing scheme that users have to follow.

Case 2
Contrary to Case 1, this case considers that capacity of power generation is different for all U Cs.The M in Equation ( 26) has some unique aspects, which relates that a real valued matrix where According to [68], a taut diagonal matrix is always non-singular and |M| is positive.It is observed that M is taut and diagonal matrix as Consequently, E uc 0 > 0. Thus, M is invertible.
Theorem 1.The distinctive solution achieved from M is positive.
Proof of Theorem 1.The solution of M is deduced by Since |M| is invertible; thus, |M| is positive if its eigenvalues are non zeros and show a symmetry property.Also, the solution presented in Equation (32) depicts that κ uc 0 > 0.
Theorem 2. The cost function discussed in Equation ( 27) is a best optimum solution for raising profits.
Proof of Theorem 2. Let the solution gained from Equation ( 27) be κ uc 0 for any uc 0 .Moreover, this paper assumes that uc 0 has increased the cost from κ uc 0 to κ * uc 0 , while U C have same cost of power generation.From Equation 19, suppose that any consumer n demands power d n 0 ,uc 0 > 0 from any κ uc 0 then the constraint in Equation ( 33) is satisfied.
Now suppose that κ uc 0 and κ * uc 0 fulfil the requirements of Equation (33).In this regard, the necessities of consumers will show deviating behavior from d n 0 ,uc 0 to d * n 0 ,uc 0 as: The differentiation among the necessities of any n from the firm uc 0 will now be expressed as: From Equation (35), it is obvious that d n 0 ,uc 0 − d * n 0 ,uc 0 > 0. Hence, the consumers are not capable of demanding the total power generated by any uc 0 , i.e., the consumer will then demand for lesser energy as required.Moreover, the profit and cost of uc 0 will increase on the basis of consumer total power demand.Thus, Equation (36) provides the balanced equation of demand and supply.
It is observed that in Equation ( 36), E * prod,uc 0 (κ * uc 0 , ξ uc 0 +1 ) < E prod,uc 0 (κ uc 0 , ξ uc 0 +1 ).Thus, the profit gaining of uc 0 leads towards the loss and it is concluded that the price function presented in Equation ( 27) is the best optimum function as it will result in financial advantage.
On the subject of range of κ uc 0 , κ uc 0 ∈ [κ uc 0min , κ uc 0max ].As a matter of fact, κ uc 0min is owing to the cost functions that is generated by U C. Moreover, any uc 0 is not capable to lessen the price lower than κ uc 0min .Nonetheless, κ uc 0max is the maximum range.According to κ uc 0max , the government has to settle the cost, which consumers have to follow.

Proposed Stackelberg Game Modeling
All the U Cs partake to play the non-cooperative game with one another in order to settle the price that will be further used by consumers.This is a critical point where Nash Equilibrium is required.In a Stackelberg game, the equilibrium strategy for the followers is defined as any strategy that is compromised of the best response.The response is optimal as compared to the strategy that is adopted or announced by the leaders [69].
This study assumes that prod,uc 0 is the game-plan rectified for any uc 0 and cons,n 0 is the scheme planned for n 0 .Subsequently, the game-plan for U C will be prod = prod a × prod b × • • • × prod,U C and for n 0 will be cons Here, is the game plan of all consumers n.Moreover, d and κ + are best feedback of all consumers, i.e., d ∈ cons .The best feedback of any consumer n 0 for any particular Here, χ cons,n 0 = d + n 0 ∈ (κ + )d n 0 .Thus, d + n 0 is supposed to be best optimum scheme for n 0 .Besides, d + and κ + is a Stackelberg equilibrium achieved for the game concerning the U C and n.

Distinctiveness of Stackelberg Equilibrium
The OS cons has an exclusive maximum range (as discussed above) for κ.Whenever the cost planning game is played between the companies with a distinctive Nash Equilibrium, then the Stackelberg game plan holds a special equilibrium.Theorem 3.An exclusive Nash equilibrium occurs in the cost selection game plan between U C. Likewise, a distinctive Stackelberg equilibrium subsists as well.

Proof of Theorem 3.
There is equilibrium if κ is a real value and ⊂ R U C .Moreover, κE * prod,uc 0 is constant in κ.On the topic of cost choosing of all U C in Stackelberg game, prod = ( prod,1 × prod,2 × • • • × prod,U C ).Here, κ uc 0 ⊂ prod,U C .Moreover, prod = [κ uc 0 ,min , κ uc 0 ,max ].Therefore, the game plan is real value and ⊂ R U C .
Furthermore, E prod,uc 0 is constant in κ uc 0 as discussed in Equation (20).Subsequently, the f (E prod,uc 0 ) according to

Distributed Algorithm
The users are now proficient enough to compute their optimum demands on the basis of the cost function provided by the utility companies as discussed in the preceding section.However, the different utilities show the significant response to policies announced by other companies.Moreover, it is essential to calculate the price per unit.For this purpose, uc 0 should know the production capacity of other utilities.Contrary to this, this paper proposes a distributed algorithm that further proves the Stackelberg equilibrium of the game.The equilibrium is established in such a way that utilities are not able to identify the constraints of each other.
The uc 0 establishes a subjective cost and then conduct their cost statistics to the users.This communication is done efficiently by setting an interactive environment for utilities and consumers.As a consequence, the consumers choose specific amount of electricity they need to purchase from uc 0 .
All the U C acquire these demanding conditions from consumers.At that moment, uc 0 will analyze the contrast among the available electrical energy and the entire energy needed by consumers from the company.The U C will upgrade its price per unit with the help of Equation (40).
In Equation (40), r is the repetition number and δ uc 0 is the rate modification constraint of uc 0 .Whenever a uc 0 updates its cost function, it sends this information to n 0 .Furthermore, the n 0 update the demands and send this information back to uc 0 .Subsequently, the U Cs will also update their cost functions sequentially.Thus, the procedure lasts until the cost function shows convergence.Algorithm 1 supposes that n 0 = 1 specifies the first consumer.
Algorithm 1 meets the best optimum solution for all U C and n as the particular game plans are upgraded in a specified order.
Proof of Theorem 4. The feedback of a consumer as specified in Equation ( 18) is best optimum solution for a particular κ uc 0 .Whenever, the cost per unit shows a converging behavior then the demand of every consumer will coincides towards an established set.Therefore, it is necessary to discuss the converging behavior of cost in order to demonstrate the changing performance of Algorithm 1.
Equation (40) suggests that the cost κ uc 0 amplifies only if d n 0 ,uc 0 ,r ∑ n 0 ∈n −E uc 0 gives positive results and vice versa.However, in Equation ( 40), when d n 0 ,uc 0 ,r ∑ n 0 ∈n −E uc 0 = 0 the price value is not changed.This particular condition is the established stage to which Algorithm 1 shows converging behavior.This stage is the Nash Equilibrium of the game plan (Stackelberg game between n and U C). Afterwards, the U C will not show any fluctuating behavior.

Knowledge Based Short-Term Load Forecasting
Knowledge based systems and computational intelligence are considered as major tools of artificial intelligence.The knowledge based systems employs categorical representations of knowledge like symbols and words [70].The knowledge based systems are efficient and simple as the categorical representation makes the knowledge readable and implicit for a human as compared to numerical derived models in computational intelligence.The techniques of knowledge based systems incorporate case based, model based, and rule based systems.
The major difference between a traditional program and knowledge based system is in their structure [71].The knowledge of the domain is closely associated with software for monitoring the performance of that particular knowledge in a traditional program.However, the roles are clearly divided in knowledge based systems.Moreover, there are two basic components of knowledge based systems, which are knowledge base and inference engine.Nonetheless, some interface proficiencies are also compulsory for a real-world system, as presented in Figure 1.The paper categorizes knowledge based short-term load forecasting as classic and proposed.The explanation of each is given below.

Classic Knowledge Based Short Term Load Forecasting
All categories of days are quantified initially in a classic knowledge based forecasting on the basis of annual and weekly load curves.Moreover, this type of categorization of days is usually associated with the user consumption behavior of a particular state.Besides, the annual growth rate in load demand also plays a significant role in typical knowledge based forecasting as historical load data are also required.The annual growth in load demand is mostly reliant on different aspects like growing economy or population.Consequently, normalization and stabilization of load data are considered crucial in order to lessen the consequences of annual growth rate.Likewise, normalization of data is also beneficial to determine similarities in load curves more precisely [72].The hourly data normalization of load demand is attained by distribution of load on hourly basis [73], which is shown in Equation (42).
In Equation ( 42), Γ S d ,H is the load demand, Γ S d ,H is the normalized value of data at any hour H of a similar day S d , and X is the mean of n preceding days.In addition, H = 1, 2, 3, • • • , 24.The load demand at any hour H of a target day can be obtained by normalization of load demand data of chosen similar days and average load of n preceding hours, which is presented as: In Equation (42), tar indicates the target variable, which is predicted by the model for a specified day.Moreover, Γ tar,H is the predicted demand load for any hour H, γ is set of identical days, and D γ is total number of days chosen, which are similar.The minimum value of D γ reduces the utilized historic data and inadequate similar days, which are selected.Contrarily, the maximum value of D γ indicate that γ is comprised of vast historic data.Besides, a few number of days may have not sustainable correlation with selected day according to this scenario.

Proposed Knowledge Based Short-Term Load Forecasting
The paper proposes a novel hybrid data mining technique in order to predict the load demand by knowledge based systems.The proposed algorithm basically consists of two parts.The clustering technique AP is used initially.The AP is employed in this scenario as it looks for noise in data and then removes this noise from data, thus, decreases the instances of data.Subsequently, BFFA is used in the next step for feature selection and classification.Furthermore, Support Vector Regression (SVR) is used as classifier model in this proposed hybrid model.This proposed hybrid model chooses the most relevant target variables and increases the accuracy of the system.Moreover, the proposed knowledge based system is able to minimize the operational cost and maximizes the process of data mining for selection of similar days.
The proposed knowledge based short-term load forecasting is categorized in three parts, which are explained as follows.

Distribution of Historic Load Data
The selection of similar days from historic days is considered as crucial for knowledge based forecasting.Moreover, the selection of similar months and days also have a significant impact on the results of short-term load forecasting.Therefore, this paper presents two historic data-sets, which are well-defined for every type of days.The first data-set is comprised of similar days from preceding month along with the selected date.Furthermore, the second data-set incorporates same days from seven days earlier and subsequent to the target day of the week.The target year and similar days are also chosen from all preceding years in both data-sets.Besides, the data-sets are specified by scrutiny of annual load demand and meteorological conditions of Pakistan.
It is a well-known fact that temperature and load demand have a direct relationship with each other.For example, usage of air conditioners and other cooling devices increases in summers especially.This phenomenon shows variations in load curve and peak hour of the entire system.Moreover, the impact of climatic conditions on the load demand in summers is usually more than other time of year [74].
Figure 2 illustrates the load curves for Thursday as an example.Moreover, this load curve is for Pakistan and depicts all four seasons.It is obvious from Figure 2 that the load level and hourly peaks by day and nights shows a significant fluctuation in different spells.Therefore, it can be determined that by maximization of the measured time, the range of both data-sets may affect the selection of similar days with similar temperature.However, this phenomenon is not suitable for load curves because changes in climate also affect load consumption behavior.
In the first data-set, the same days are chosen from days that have equivalent month along with the target day.Moreover, this paper has assumed that the selected day can also be similar to its month or preceding month.Contrary to this, load curves from seven days earlier and subsequent to the target day is more comparable to the target day when associated to load bends of the preceding month.Consequently, the other data-set specifies the consideration of these days in a data-set.Moreover, this paper assumes that this data-set must have a maximum weightage factor, in contrast to the first data-set.The priority index for both data-sets can be evaluated by Equation (43).The paper valuates the final results from the combination of results achieved from both data-sets as: In Equation ( 44), Γ ds 1 tar,H and Γ ds 2 tar,H are forecasted power load demand specified for each hour H and targeted day tar.Moreover, W 1 and W 2 are weights assigned to each data-set.Thus, Γ tar,H is the final forecasting achieved by system for each hour H and targeted day tar.
The proposed methodology for knowledge based forecasting is comprised of two main constituents, which are W 1 and D γ .Furthermore, the proposed method must also execute for training data-set in order to choose the best optimum values of W 1 and D γ .Subsequently, the proposed method should be proficient enough to select the execution, which gives the least prediction error.Besides, the values of W 1 and D γ are then selected as the optimal ones in order to predict the target day.Moreover, this paper also assumes that the next 24 h are forecasted by preceding load demand data and predicted loads of the day.This load demand data is achieved after prediction of the first hour of tar day by preceding load demand data.

Priority Index for Same Day
In knowledge based short-term load forecasting, temperature has a significant role.The fluctuating behavior of climate and weather throughout a week or month shows a significant effect on load curves.Therefore, it is a vital part in choosing similar days for target year.Conversely, there can be different motives that are the cause of divergence for load curves.For instance, the power evaluating strategies and variations in utilization behaviors of Pakistan alter the levels of load demand.Thus, the selection of similar days along with date proximity is effective to choose for knowledge based forecasting.The paper determines a priority index of similar days as: (temp In Equation ( 45), P I S d reg. is the priority index of S d in specific region, temp C and temp tar C are the average temperatures of a specified city C on the daily basis for a similar day S d and tar days, correspondingly.Furthermore, η reg. is total number of days between tar days and S d days, κ reg.are the chosen cities from every region.This paper separates the system in seven different regions and from every region only one city is selected.
In this paper, ℘ temp,reg.is considered as weighting factor of temperature, while W 1,reg. is taken as weighting factor of η reg. .They are calculated as follows: ℘ temp,reg.= D γ ∑ S d ∈γ reg.

×
(temp and In Equation ( 46), D city is total number of chosen cities from regions.Furthermore, γ reg.are similar days in a specified region in Equation (47).
This paper assumes that if variance of temperatures among tar and S d is more than a determined value temp ds , then this day is overlooked in S d .Moreover, two days having huge differences in temperature can depict different curve shapes of load demand.Likewise, this difference can cause critical impact on knowledge based short-term load forecasting.In addition, this paper also employs the priority index to the historical data and thus, specifies similar days.Equations ( 44) and ( 45) have significant worth in this paper.The impact of temperature can be measured in an efficient way from these equations by dividing the PNPN.The next section specifies this phenomenon.

Distribution of PNPN
The selection of exclusive temperature for huge topographical states usually affects the results in short-term load forecasting.Therefore, an exclusive temperature could not be given to a huge topographical state or zone in order to attain satisfactory forecasted outcomes.However, it is practical to give an exclusive temperature to every region when the entire region is distributed.The distribution of vast topographical zones has been observed in [75,76].Nevertheless, these studies overlooked priority index for similar day selection.
The paper distributes the region separately and then predicts the short-term load by consideration of the proposed priority index for S d selection.Furthermore, the forecasting of short-term load for the entire system can be achieved by summation of predicted results from all regions.Besides, this technique takes the temperature for S d selection knowledge based load forecasting in an efficient way.

Proposed Strategy
The similar days are computed by Equation (45) for every respective region.Subsequently, Γ ds 1 tar,H and Γ ds 2 tar,H are computed.Moreover, Γ tar,H is attained as ultimate forecasting for every region by selected similar days, according to Equation (44).The results obtained from all regions are combined to achieve final forecasted load for the entire system.

Application of Proposed Method on Vast Topographical Zone
This paper employs the knowledge based short-term load forecasting model on a vast topographical region.Moreover, this paper has selected regions of Pakistan for implementation of the proposed model.Pakistan has four seasons and different climates with significant discrepancies throughout the year.PNPN is a huge topographical system, which is distributed in nine regions that are equivalent to regional electric utilities.The primary objective of PNPN in this study is to forecast the demand load for every region.In addition, Figure 3 presents the different colored portions along with the mean of regions having high temperature throughout the year.
A city is selected from every region that is supposed to be the representative of the region.Moreover, a city also specifies the temperature of that particular region.There is no restriction on any system to distribute into specified number of regions.However, the system can be divided according to the requirement of the system and fluctuating behavior of weather.Figure 4 depicts the changing behavior of temperature for Lahore city as a sample.The investigation of PNPN demands more scrutiny of Pakistan's user consumption behavioral analytics.Monday is the first working day of the week while Sunday is the last one.Moreover, the seven days of the week are categorized into four types in Pakistan.The first category of the day is Monday, which is the first working day in Pakistan.Monday has different power demand provisions, especially in early morning (peak-hours).Furthermore, the days from Tuesday to Friday that are also considered week-days in Pakistan, show the same load curve.The difference between Monday and other days of the week is illustrated in Figure 5. Subsequently, another category of day is Friday and Saturday.In this category of days, the operational hours of most workplaces and factories show a fluctuating behavior in contradiction to other week-days.Moreover, Sunday is supposed to be the rest day in Pakistan and is the last category of day.The load curve and load demand depict an entire variating behavior from other categories of day. Figure 6 shows the fluctuating behavior of load curve for a successive week.The paper scrutinizes hourly load for nine regions of PNPN.In this regard, the data form the duration of June 2015 to May 2017 is used as historic data for short-term load forecasting.Besides, the paper predicts the load demand for the duration of June 2017 to May 2018.A city is chosen from every region as a representative of that particular region.It is observed in the literature that there is no concept of splitting the data-set into training and test data in knowledge based systems.Moreover, the knowledge based systems use the entire historic data for choosing the best optimum results and similar days as discussed in Section II.However, the data-sets are divided into training and test data in DBN and F-LOLIMOT.This paper labels 77% of the data as training data and the remaining 23% of the data as test data.
This paper performs sensitivity analysis on the PNPN and concludes that the optimal values achieved for D γ , W 1 , and W 2 are 8, 0.4, and 0.6, respectively.The sensitivity analysis is performed by means of historic data for the duration of June 2015 to May 2018 in order to get the best optimum parameter values.Moreover, the data for the duration of June 2017 to May 2018 is not utilized to get the best optimum parameter values.The load demand for the specified time period of previous data like from the duration of June 2016 to May 2017 is supposed to be the vital goal of prediction by the load information and earlier than that period.This helps in selecting the best optimum parameter values.The best optimal value is achieved when it has least prediction error for the specified period as discussed above.The value of W 1 is changing from 0 to 1. Therefore, it is now obvious that the value of W 2 will be calculated by In addition, the best optimum values of W 1 and W 2 are evaluated by the scrutiny of the historic data.Besides, data for the duration of June 2017 to May 2018 is not used in this analysis as this data is for prediction purposes.Likewise, the value of D γ is also attained from this method.This constraint shows a fluctuating behavior to achieve the least predicting error for a particular time spell.Table 2 presents the prediction error for every execution.In this table, the values of W 1 and W 2 show a variance between 0 and 1.Nonetheless, the value of D γ lies between 5 and 15.The best optimum values for D γ , W 1 and W 2 are 8, 0.4, and 0.6, respectively.This paper further assumes that the proposed methodology employs the similar day load demand data in the preceding years for the distinct days like public and religious holidays.This is done because there is an inadequacy in the historic data.Therefore, the technique of priority index is not applicable for distinct days.Consequently, it is one of the major reasons to observe the effect of temperature in the priority index for normal days instead of distinct days.
The paper only lays emphasis on the short-term forecasting for normal days.Moreover, the distinct days are overlooked from record for selection of similar day.Besides, the paper explains the knowledge based short-term forecasting for Tuesday, 28 June 2016.
1.At first, the days having a similar category of day are chosen on the basis of categorization of target day.In this scenario, Tuesday is included in the second category of day classification as discussed above.Moreover, all the days between Tuesday to Friday are selected.However, all the distinct days is overlooked for analytical purposes.Subsequently, these days are distributed in two data-sets, as discussed in Section II. 2. The priority index of every region is evaluated by Equation ( 45), for all chosen days.Table 3 presents the priority index of selected days for a sample region Islamabad as an example of 30 June 2015.Moreover, in this scenario the value of ℘ temp,reg.is 0.03 and ℘ 1,reg. is 1.5 ×10 −5 .
All the values and Table 3 are associated with the second data-set of Islamabad for the specified date.Every region and every data-set are different from one another.3 depicts that few same days show less difference in temperature rather than choosing same days.However, they are overlooked in this paper as along with the difference in temperature, the proximity of date has also significant worth.For instance, 10 June 2015 and 11 June 2015 will have less difference in temperature as compared to 15 June 2015.However, such days are neglected because they have maximum values of date proximity.Therefore, this paper can choose a similar day that has maximum difference in temperature in the proposed methodology because of proximities in date.Moreover, this phenomenon can produce more similar load curve shapes.Besides, the same chosen days in Islamabad and other regions can cause a discrepancy in selecting the same days from Islamabad for prediction of 28 June 2016.4. The predicted demand load of the entire system is combined load that is obtained from all regions after short-term load forecasting is done for every respective region.

Deep Belief Network
In [77], the basis of DBN is presented briefly.Moreover, the auto-correlation of load demand data has been depicted in  for the previous data.It is obvious from the auto-correlation plots that the preceding data is more auto-correlated to experimental data, to some extent.This paper performs Ljung Box [78] analysis of null supposition to check this assumption more quantitively.The suppositions are as follows: • S 0 : The preceding data are disseminated autonomously, i.e., the correlation is 0 in the preceding data from where the sample is chosen.Therefore, any experimental correlations in the preceding data are the resultant from the unpredictability of the test group.• S 1 : The preceding data are not disseminated autonomously, i.e., the data show serial correlation.
The auto-correlations tests are performed whose outcomes are shown in Table 4.The outcomes show that the preceding data is much more auto-correlated as compared to the experimental data.It is often observed in literature that numerous testing process reject the S 0 for the preceding data.However, S 0 is not rejected by experimental data.Therefore, there subsists a spatial correlation in preceding data.Moreover, if sampling techniques are applied on the historic data then this correlation can be disintegrated.The paper also performs sensitivity analysis and the structure of DBN used for this paper includes one hidden layer with five neurons.Moreover, there are 25 neurons are in input layer and 20 neurons in the output layer in the proposed architecture.These neurons generate the prediction of load demand for the target day (24 h).On the topic of architecture of this network, the input layer is comprised of two constraints for mean and maximum temperature for selected day.Moreover, one constraint is for categorization of the forecasted day while the remaining 22 input constraints are associated with the preceding load demand data, which are as follows: In Equation (48), τ represents the total load demand data, Γ τ m and Γ τ n are demand load for τ th hour (τ = 1, 2, 3, • • • , 24) preceding to selected day.This paper assumes that Γ τ m and Γ τ n represents the τ and τ-1 hours in Equation (48).Moreover, there are 20 neurons in the output (OS τ ) layer of DBN that signifies the difference of load demand on the hourly basis for preceding and selected days, The categorization of days in DBN are entirely divergent from knowledge based system.According to Equations ( 48) and ( 49), Tuesday must be taken apart from days that range from Wednesday to Friday.Therefore, in DBN five categories of days are taken for analysis.

Fuzzy Local Linear Model Tree Algorithm
The paper employs F-LOLIMOT algorithm for training of the linear fuzzy model.The explanatory analysis of F-LOLIMOT algorithm has been discussed in detail in [79].Moreover, the F-LOLIMOT algorithm is capable of predicting the hourly demand load, which is ahead than the current time by means of climatic and load data.Figure 11 depicts that there are different inputs and outputs of demand load and climatic data.This is done after sensitivity analysis on the system.
Furthermore, the lags of climate are the climatic condition of the preceding week and target day.Likewise, the time lags of each hour load demand (inputs) are actually demand load data of similar hour at preceding 9 and 10 days earlier than selected hour.It is obvious that the initial hour of target day by utilizing preceding and recognized load data the upcoming hourly load is forecasted by preceding data.

Results and Discussion
At first, this section presents the evaluational measures that are used in this paper.Subsequently, the results are discussed.

Evaluational Measures
In literature, Daily Maximum Error (DME), Maximum Distance Minimum Error (MDME), and MAPE have been widely used in order to valuate the outcomes, which are achieved from short-term forecasting.This paper has used MAPE, MDME and DME as: and In Equations ( 50) and ( 51), D f are the hours that are forecasted and Γ ν,H is the real demand load at specified hour H of tar day.This paper presents 4 implications to indicate the benefits of the proposed system.The implications are based on the forecasting of load demand for the duration of June 2015 to June 2016.Moreover, these implications are made by climatic and load data, which lies in the range of June 2015 to exactly one day before the target day.The paper takes this data as training data in this scenario.The implications are: 1. MAPE of short-term load forecasting throughout the year (D f = 9750) 2. Average of DME throughout the year, which is referred as maximum distance and minimum error 3. Total number of days, which have MAPE higher than 3% ( 3 ) 4. Total number of days, which have maximum error higher than 5% ( 5 ) The last two implications depict the division of errors, which are achieved from the results of short-term load forecasting.In this paper, the proposed model minimizes the total number of exceeding days from a certain limit and also enhances the performance of MAPE and DME.

Discussion of Results
The paper has evaluated the results on the basis of two assumed evaluations that are discussed as follows.

Evaluation of Priority Index and Splitting Consequences on Knowledge Based Systems
This paper implements the proposed method on PNPN.In this regard, the following cases are observed to discuss the consequences, which are associated with distribution of the forecasting results and taking temperature in priority index.The data distribution is overlooked in Case 1.Therefore, a distinctive temperature is not suitable for the system.Moreover, the priority index is the center of attention in this case along with the date proximity.Besides, the whole system is distributed in different sections in Case 2. Subsequently, the prediction is performed for every respective section.The prediction of the entire system is a combination of predicting outcomes in all sections.Case 2 differs from Case 1 as the data distribution is carried out in this scenario.Nonetheless, the data distribution is also overlooked in Case 3.However, the temperature is taken in consideration in terms of subjective average values in relation to the demand from every region.The consequences of temperature are studied in Case 4. The paper assumes Case 4 as a comprehensive case as it takes temperature in the priority index for selection of similar days from every section.
Tables 5 and 6 presents the outcomes of the aforementioned cases for every category of the day.It can be observed that MAPE of the entire system is minimum in Case 4 as compared to other cases.The data distribution is done in Case 2 and Case 1 has overlooked this phenomenon.Thus, it is proved that distributing the entire system can enhance the prediction outcomes.Moreover, the data distribution also minimizes the MAPE and maximum distance and minimum error.Besides, the data distribution among different regions minimizes the total number of days that go beyond acceptable measures ( 3 and 5 ).The consideration of temperature devoid of distributing the data in different regions is responsible for reduction in valuation constraints when associated with Case 1 and Case 2. Nevertheless, the MAPE of Case 4 is enhanced as compared to Case 3.Moreover, Case 4 has minimum days with maximum error that is larger than 4%.Contrariwise, forecasting results are improved in Case 4 as it distributes the data in different regions and takes temperature in priority index.The MAPE in Case 4 is 1.02 % as depicted in Tables 5 and 6.This achieved MAPE is approximately 8% improved than Case 2 and almost 9% enhanced than Case 1. Besides, Case 4 has the total number of optimum days that exceeds the acceptable criteria.The results achieved for minimum days with maximum error and 5 are also enhanced in Case 4 as compared to Case 3. Nonetheless, 3 has achieved enhanced results in Case 3 in comparison with Case 4.
Table 7 presents D γ and W 1 for target year.The optimum result achieved is for D γ = 7 and W 1 = 0.3.The results achieved for W 1 = 0.3 are approximately near to W 1 = 0.4.Thus, the achieved parameters from training data can give suitable outcomes and are proven appropriate for the proposed method.The preceding data is categorized in two different sets as discussed in Section II.The paper studies three cases in this subsection to depict the consequences of this type of categorization.The MAPE of the entire system in Case 1 is maximum as compared to Case 2 and Case 3 in case of Γ ds 1 tar,H .Thus, it can be concluded that taking same days from selected or last month gives maximum errors in forecasting results.Moreover, the total number of exceeding days from acceptable conditions is not suitable, particularly 5 .Nevertheless, integration of Γ ds 1 tar,H and Γ ds 2 tar,H gives enhanced results for MAPE and minimum days with maximum error in Case 3. Furthermore, passed days from acceptable conditions is lessened in Case 3.
Figures 12-14 depicts the comparative analysis of traditional and proposed forecasting method.According to Figures 12-14, the days presented are four different days and these days belong to dissimilar months.The predicted outcomes are then associated with real load demand.Moreover, the results of the proposed system are much nearer to real load as compared to traditional forecasting techniques.

Comparative Analysis of Proposed Method, DBN, and F-LOLIMOT
The paper compares the results achieved from proposed knowledge based system with DBN and F-LOLIMOT.The results are evaluated in terms of precision and operational time.The short-term load predicting techniques is applied on PNPN to forecast the load demand for the duration of June 2017 to May 2018.Moreover, these predictions are based on temperature and load demand data, which lies in the range of June 2015 to exactly one day before the target day.The results are presented in Table 9, which shows that proposed knowledge based system has enhanced MAPE to 1.01.Besides, the MAPE of 5 and 5 is also decreased.The DBN and F-LOLIMOT techniques show MAPE is approximately higher than 3% for a month and approximately 5% greater in 47-50 days (maximum error).Nonetheless, the proposed method has MAPE, which is greater than 3% in 15-18 days and 5% with 23 days (maximum error).The variances discussed are notable enhancements in forecasting.On the topic of operational cost, the proposed knowledge based method takes minimum time in training and executing in comparison with DBN and F-LOLIMOT.The proposed knowledge based system, DBN, and F-LOLIMOT are executed to predict the days on a yearly basis.Besides, the operational time is distributed to total number of predicted days in order to get the usual operational time of prediction for a specified day.Moreover, the proposed system, DBN, and F-LOLIMOT are executed with the same conditions.Besides, the parameters were tuned for every specified day and forecasted demand load has been achieved for every technique.The paper distributes the day, according to training and operational time in every technique.The proposed knowledge base systems have less operational time as it does not require as much training as compared to DBN and F-LOLIMOT.The proposed method lays emphasis on the selection of similar day and then predicts the load demand as discussed above.
The forecasting of sample day is presented in Figures 15 and 16 by means of DBN, F-LOLIMOT, and proposed knowledge based system.It is obvious that MAPE of the proposed method is 0.69 for a sample day.This MAPE is lesser than MAPEs of DBN and F-LOLIMOT, which are 0.91 and 0.97 respectively.Moreover, the DME is minimized in the presented knowledge based system as compared to others.The phenomenon of priority index is not suitable for special days (public holidays) as discussed in earlier sections.Nevertheless, the special days can be forecasted by the presented knowledge based system devoid of taking a priority index.Besides, the MAPE of the proposed system is 1.30 for all days, together with special days.Nonetheless, the major aim of this paper is to study the consequences of the priority index on the knowledge based system.Moreover, the scrutiny of special days is beyond the scope of this paper.

Conclusions
This paper presents a novel knowledge based short-term load forecasting method.The entire system (region) is distributed in nine sub-systems (zones) by consideration of temperature to predict the demand load more efficiently.The outcomes depict that distribution of huge topographical power network improves the forecasting results.Moreover, this paper presents a novel priority index in which climatic conditions and the date proximity of every particular region is observed.The algorithms of AP and BFFA are hybridized in this paper to achieve better accuracy for a knowledge based system.The proposed knowledge based system is verified on PNPN.The achieved outcomes depict that proposed method minimizes the MAPE and other errors of forecasting in comparison with traditional

8 return9
Price value is not changed by uc 0 Jump

Figure 1 .
Figure 1.Principle components of knowledge based system.

Figure 2 .
Figure 2. Variations in load behavior of sample Thursday during 2015 of Pakistan's National Power Network (PNPN).

Figure 5 .
Figure 5. Fluctuating Behavior of Load Curve in Pakistan and Difference of Monday and a Sample Week-day.

Figure 6 .
Figure 6.Fluctuating behavior of load curve in Pakistan of a particular week.

Figure 10 .
Figure 10.Auto-correlation of preceding demand load data for day lags in DBN for resampled data (1,2).

Figure 11 .
Figure 11.Auto-correlation of preceding demand load data for day lags in Fuzzy Local Linear Model Tree (F-LOLIMOT).

1 .
Case 1: Short-term load forecasting of PNPN without taking temperature and distribution of data 2. Case 2: Short-term load forecasting of PNPN including consequences of data distribution without taking the temperature 3. Case 3: Short-term load forecasting of PNPN including including temperature without taking the consequences of data distribution 4. Case 4: Short-term load forecasting of PNPN with temperature and distribution of data

Figure 12 .
Figure 12.Comparative analysis and effect of proposed and traditional method for Monday, 19 September 2015.

Figure 13 .
Figure 13.Comparative analysis and effect of proposed and traditional method for Wednesday, 13 June 2015.

Figure 14 .
Figure 14.Comparative analysis and effect of proposed and traditional method for Sunday, 4 January 2015.

Figure 15 .Figure 16 .
Figure 15.Short-term load forecasting for a sample day.

Table 1 .
List of Symbols Used.Total budget of consumerΛ n 0 ,1 , Λ n 0 ,2 , Λ n 0 ,3Lagrange multipliers ∇υ consBest condition of first order E uc 0 Available power of uc 0 ξ uc 0 +1Price of U C other than uc 0 M prod Strategy sets for M cons Strategy sets for n 0 d Game plan for all n dκ + Best feedback of all n d +

Table 2 .
Mean Absolute Percentage Error (MAPE) for every pair of D γ and W 1 for training data.

Table 3 .
Selection of similar days on the basis of priority index values for 28 June 2016.

Table 4 .
ρ Values of the Ljung Box auto-correlation test with different region values.

Table 5 .
Consequences of priority index and data distribution on the forecasting results for 5 and 3 .

Table 6 .
Consequences of priority index and data distribution on the forecasting results for Maximum Distance Minimum Error (MDME) and MAPE.

Table 7 .
MAPE for every pair of D γ and W 1 for target data.
1. Case 1: Load forecasting by collected similar days in initial data-set, Γ Load forecasting by collected similar days in last data-set, Γ ds 2 tar,H and Γ ds 2 tar,H , i.e., Γ tar,H =⇒ Equation (44) Table 8 presents the outcomes of Case 1, Case 2, and Case 3 for every category of the day.Table 8 presents the outcomes of Case 1, Case 2, and Case 3 for every category of the day.

Table 8 .
Consequences of taking Γ ds 1 tar,H and Γ ds 2 tar,H on forecasting.