Next Article in Journal
A Lower Bound on the Differential Entropy of Log-Concave Random Vectors with Applications
Previous Article in Journal
Equilibrium States in Two-Temperature Systems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Fisher Information Based Meteorological Factors Introduction and Features Selection for Short-Term Load Forecasting

1
College of Electrical Information and Engineering, Jiangsu University, Zhenjiang 212013, China
2
Key Laboratory of Facility Agriculture Measurement and Control Technology and Equipment of Machinery Industry, Jiangsu University, Zhenjiang 212013, China
*
Author to whom correspondence should be addressed.
Entropy 2018, 20(3), 184; https://doi.org/10.3390/e20030184
Submission received: 21 January 2018 / Revised: 26 February 2018 / Accepted: 7 March 2018 / Published: 9 March 2018

Abstract

:
Weather information is an important factor in short-term load forecasting (STLF). However, for a long time, more importance has always been attached to forecasting models instead of other processes such as the introduction of weather factors or feature selection for STLF. The main aim of this paper is to develop a novel methodology based on Fisher information for meteorological variables introduction and variable selection in STLF. Fisher information computation for one-dimensional and multidimensional weather variables is first described, and then the introduction of meteorological factors and variables selection for STLF models are discussed in detail. On this basis, different forecasting models with the proposed methodology are established. The proposed methodology is implemented on real data obtained from Electric Power Utility of Zhenjiang, Jiangsu Province, in southeast China. The results show the advantages of the proposed methodology in comparison with other traditional ones regarding prediction accuracy, and it has very good practical significance. Therefore, it can be used as a unified method for introducing weather variables into STLF models, and selecting their features.

1. Introduction

Short-term load forecasting (STLF) plays an important role on ensuring power system security and economic operation [1], and its prediction accuracy is influenced by many interdependent factors. Of all these factors, meteorological factors are the dominant exogenous factors that affects STLF [2,3,4]. More recently, meteorological sensitive load demands such as air conditioning, space heating, agricultural irrigation, etc., increasingly grow, and the influence of weather conditions on electrical demands is further intensified, which makes their relation more complicated. Whether weather factors are properly considered has a significant impact on the prediction accuracy of STLF, while even a small improvement in prediction accuracy means big cost savings and also a great contribution to the environment in which we live [5]. Meanwhile, with the wide deployment of real-time monitoring devices in several 220- and 110-kV substations across the forecasted area, it becomes possible to gather real-time weather data such as temperature, humidity, rainfall, wind speed, etc. [6]. In this context, it would be of immense value to develop methods introducing meteorological variables into the forecasting models to improve prediction accuracy and boost prediction speed.
Numerous studies have shown that, of all meteorological factors, temperature has the most remarkable effect on electrical demands [7,8]. Humidity also has a major effect on the comfort or discomfort felt, because it affects the amount of heat that the human body rejects through evaporation [9,10]. Other factors that have an impact on load demand behavior are the wind speed and cloud cover. Rainfall influences electrical demand rather indirectly by affecting other weather variables [11]. Meteorological variables could be introduced into the forecasting models in a primitive form or in a derived form created by combining two or few of them, namely, Meteorological Composite Index (MCI), such as Temperature–humidity index (THI), Comfort Index of Human Body (CIHB), etc. [12]. The MCIs are more popular among conventional models, especially temperature derivatives [13,14]. The MCIs created by mixing different variables such as temperature, humidity, and wind speed are also popular among artificial intelligence models. These MCIs include: Heat Index (HI), introduced by USA National Weather Service [15], tries to represent perception of temperature by a human depending on relative humidity, namely, human-perceived equivalent temperature [16]. Humidex, proposed by Canadian meteorologists Masterton and Richardson in 1979 [17], is another parameter calculated using temperature and humidity. It was utilized in similar day-based wavelet neural network (NN) models [18], and implied models using the MCI performed better than ones using primitives. Temperature–humidity index (THI) is an analysis index used frequently by forecasting models [19]. Through this, both temperature and humidity are introduced into load prediction, which was employed in some models [11,20], and showed its effectiveness. Real sense of temperature is another type of meteorological parameter integrating together temperature, relative humidity and wind speed in an attempt to capture coupling effect of these factors on human perception. A backpropagation based NN model used it as one of its inputs to forecast load demands, and achieved better results [21]. Comfort Index of Human Body (CIHB) is a measure of synthesized effect combining temperature, relative humidity and wind speed. It is an indicator depicting human body’s comfortable level in atmospheric environment [22]. An improved Elman neural network model utilized it as one of the inputs, and compared it with another one using temperature only. As a result, better prediction accuracy was acquired with the former solution [23]. Enthalpy latent days (ELD) is a parameter depicting the influence of humidity on summer load demand for cooling and air-conditioning [24]. It can be expounded as an amount of energy necessary to decreasing indoor humidity to an acceptable level without reducing the indoor temperature. Wind-chill index (WCI) is an index aiming to capture cooling effect of the wind blowing while temperature is below threshold of 50 °F (10 °C) [25]. The WCI was selected as an input to wavelet neural network to correct similar-day based forecasting model, and showed an improvement of the accuracy [18,26]. Cooling power of the wind (CP) was developed by Taylor and Buizza [27], which is a nonlinear function of wind speed and average temperature trying to capture chill induced by gusts of wind.
All the aforementioned MCIs have greatly enriched the approaches of introduction of weather factors into STLF and lay a firm foundation for the improvement of prediction accuracy in STLF. All these approaches introducing meteorological variables into STLF models, however, have some intrinsic limitations due to the complexity of the problems involved. Current load demand is affected not only by the temperature of the current moment but also by the previous one because of the accumulative effect of temperature (AET) [19]. Although Li and Fang et al. [1,28] presented their respective corrected models by taking AET into account, a cumulative effect coefficient based on the models must be adjusted frequently over time. Jiang et al. [29] acquired a quantitative calculation formula for AET by means of curve fitting algorithm, but a proportion coefficient concerning contribution of both continuous number of high temperature days and the highest daily temperature to AET varied over different regions. On the other hand, all the MCIs above involve only the features of daily meteorological factors (Max, Mean, and Min) as their effects on electrical demand. The investigation by Callaway et al. [30] showed that using a simple daily average temperature cannot provide good representations of weather, therefore, cannot obtain a higher prediction accuracy of STLF. Although Kang et al. [31] developed a NN model based on real-time meteorological factors for STLF, the number of input variables grows dramatically, which would inevitably lead to an increase in computational costs. Hence, it is more important to investigate pointwise influence of meteorological factors on the load demands to establish load forecasting models point by point under real-time weather information circumstances.
Information theory provides a new way of introducing weather variables into STFL models. The reason is that essentially any type of data or model can be converted to information regardless of disciplinary origin [32]. Zhu et al. [33] described STLF as a process of information decision-making, and presented a combined model for STLF based on maximum entropy principle. Sun et al. [34,35] regarded STLF as a process of load information movement to tackle uncertainties in STLF, and then developed the minimization of Information Loss Based Hybrid STLF model to improve prediction accuracy, even though weather information is not involved separately, instead it was considered as a whole. It should be noted that, however, rich weather information available throughout various substations provides us a chance to further investigate the impact of meteorological factors on load demands. Thus, the key is to find an appropriate method of introduction of weather variables into STLF to characterize fully its influence on load demands.
Fisher information (FI) is well known for its ability to measure a system’s stability [32]. Unlike other approaches of system information measures, Fisher information provides a method that can monitor the system states’ changes and shifts by means of measuring the systems’ variables [36]. In fact, Fisher information has been applied to deriving fundamental equations of physics, thermodynamics, and population genetics [37,38]. More recently, Fisher information was widely applied to various areas, such as ecological systems [36,39], climate systems [40], power system fault detection [41], etc. The objective of this article is to demonstrate our beginning steps in the development of a methodology of introducing climatic factors into STLF models using Fisher information, and examine its effectiveness in practical application using a real data collected from a local utility company.
The remainder of the paper is organized as follows. Fisher information theory and its calculation are presented in Section 2. The description of introduction of meteorological factors and input variables selection based on FI for STLF models is given in Section 3. Different forecasting models are constructed with the proposed methodology in Section 4. Case study and discussion are presented in Section 5. Finally, the contributions with concluding remarks are reported in Section 6.

2. Methodology

The statistician Ronald Fisher (Fisher 1922), from a system’s stability point of view, developed a measure of indeterminacy, now called Fisher information. Fisher Information (FI) for a single measurement of one variable is calculated as follows [42]:
I = d s P ( s ) ( d P ( s ) d s ) 2
where P(s) is the probability density function (PDF), and s is a state variable.
A system’s stability is conceptually related to the repeatability of observations. Thus, for a system that is perfectly stable, repeated observations of the variables over time acquire the same values within the limits of measurement uncertainty. Hence, for perfect stability, the probability density function (P(s)) becomes a very sharp spike with a derivative dP/ds that is approaching infinity, and a Fisher information (see Equation (1)) is approaching infinity. For a system that is perfectly unstable, the opposite is true. Here, all observations acquire completely different and uncorrelated values for the variables, the likelihood of observing one value is the same as any other value, the probability density function (P(s), i.e., PDF) is flat, and the derivative dP/ds is approaching zero. In addition, when a system flips from a steady state to another, the derivate of the probability density function of the system shows an obvious transition. Real systems, however, exist between these two extremes of perfect stability and perfect instability and infinite and zero Fisher information. Hence, Fisher information is a theoretically sound measure of system stability.
In practical application, to reduce calculation errors due to dividing by small values of P(s), we replace the probability density function in Equation (1) with its amplitude, which is defined by q2(s) ≡ P(s). Equation (1) then becomes [43]:
I = 4 d s [ d q ( s ) d s ] 2
Note that in specific calculation, we do not know the concrete form of the continuous function q(s), but instead we have a finite number of samples qi. Thus, Fisher information (FI) is usually computed numerically. For that purpose, we shall replace the derivative by the numerical difference Δq = qiqi+1 and Δs = sisi+1, and correspondingly use the sum of finitely many terms to approximate the integral, which then leads to the following formula for calculating Fisher information approximately:
I 4 i = 1 n [ q i q i + 1 s i s i + 1 ] 2 ( s i s i + 1 )
In Equation (3), si is an index denoting a particular state of the system, i.e., s1 is state 1, s2 is state 2, etc. Accordingly, sisi+1 = 1 and the final expression for computing Fisher information is:
I 4 [ q i q i + 1 ] 2
The expression in Equation (4) will be used in all our Fisher information calculations afterwards.

2.1. FI Computation for the One-Dimensional Variables

The initial step in computing FI for the one-dimensional variables is to acquire an observed data series that characterize the state of the system over time. Then the time series data are divided into time windows by defining a parameter w denoting the size of the window. This parameter (w) is determined based on the amount of data available and the behavior of the system. From empirical studies, the w should contain at least eight time steps to ensure that one point in the window does not unduly influence the overall computation. A sequence of overlapping windows is then created to measure a system’s stability that may extend beyond the boundary of the window. This is achieved by moving the time window forward by a time increment (δ). The only rule regarding setting δ is ensuring that w > δ. The parameters w and δ denote the integration window size and window increment (in time steps) used to move through the data over time [43].
Suppose the measured time data serial is D = {d(1),…, d(N)}. We then introduce a series of sliding windows Wm on D as follows
W m = { d ( k ) ,   d ( k + w 1 ) }
where k = 1 + m × δ, wN is the window size, δN is the window increment, i.e., sliding factor, and m = 1, …, M with M = (Nw)/δ being the number of windows. Suppose all the elements in a sliding window Wm can be divided into I intervals:
W ( m , w , δ ) = i = 1 I Z i
where
Z i = [ S i 1 , S i ) ,   i = 1 ,   2 ,   ,   I
and
S 0 < S 1 < S 2 < < S I
S 0 = min [ W ( m , w , δ ) ] = min [ { d ( k ) ,   k = 1 + m × δ ,   , w + m × δ } ]
S I = max [ W ( m , w , δ ) ] = max [ { d ( k ) ,   k = 1 + m × δ ,   , w + m × δ } ]
Then, the probability P(Zi) that the element d(k) falls into an interval Zi is:
P ( Z i ) = the   number   of   elements   falling   into   the   interval   Z i the   total   number   of   elements   in   the   sliding   window
Next, the q(Zi) is calculated for each state ( q ( Z i ) = p ( Z i ) ), and estimate FI value for each Wm.
We summarize the procedures of calculating FI for the one-dimensional variables as follows: (i) Categorize a time series data into a sequence of time windows, overlapping each other. (ii) Divide each window into intervals with the same length by using the above method, and count the number of elements falling into each interval (iii) Construct a probability distribution function for the window. (iv) Calculate the Fisher information from the PDFs constructed in Step (iii) for each time window.

2.2. FI Computation for the Multidimensional Variables

Given a time series data with length N in m-dimensional state space are as follow.
{(X(t1), X(t2), …, X(ti), …, X(tN)}
where X(ti) = (x1(ti), x2(ti), …, xm(ti)) is a vector with m-dimensional components in the state space. Similarly, a series of overlapping windows each other are constructed on the above data series.
As stated above, the state of the system is defined by its state variables. The behavior of these state variables determines the stability of the system. Support the data points of representing the state variables within each window characterize a series of states for a system, and if | X ( t i ) X ( t j ) | X ( X is a measurement’s allowed error), then both X (ti) and X(tj) are deemed as the same state. According to Chebyshev’s theorem [44], when X takes 2 δ (here, δ is the standard deviation of all the data points within a window), 75% of data points in this window will be categorized into the same state, no matter what the probability distribution is. Considering that each data point of the above window is an m-dimensional vector, an m-dimensional hyper-rectangle is constructed, and the lengths of the “m” sides of the hyper rectangles are determined by the uncertainties 2 δ 1 , 2 δ 2 ,   ,   2 δ m in m-dimensional components respectively. (δ1, δ2, …, δM are the standard deviation of data points for corresponding m-dimensional component within the window). More specifically, the first data point in chronological order within the first time window is taken as the center of the first state and a state hyper rectangle is constructed around it. All the points falling within the boundaries of the first state are counted or binned together, denoted as Z1, namely:
Z 1 = { Z 1 : ( | x 1 ( t 1 ) x 1 ( t j ) | 2 δ 1 )   & ( | x 2 ( t 1 ) x 2 ( t j ) | 2 δ 2 )   & & ( | x M ( t 1 ) x M ( t j ) | 2 δ M ) } , j = 2 ,   3 , ,   N .
The next uncounted point within the time window is selected, the second state is built around it, and the points are binned as before. Finally, this procedure is repeated until all the points are binned into states within the current time window, and the process moves to the next time window.
Suppose all the elements in a sliding data window can be put into L bins. Then, we have
Length ( W ) = i = 1 L Length ( Z i )
where Length(W) stands for the total number of data points contained in a window or bin of concern. That is, the total number of elements in a sliding window equals the sum of the subtotals contained in those bins.
We summarize the aforementioned procedures of the binning method: (i) Categorize a multidimensional time series into a sequence of time windows. (ii) In each window, convert data points into states by using the above method. (iii) In each window, construct a probability distribution function for possible states of the system. (iv) Compute the Fisher information from the PDFs constructed in Step (iii).

3. Methods of FI-Based Weather Variables Introduction and Feature Selection Process for STLF

Of all the weather factors, temperature is considered to have most influence on electrical demand. During high-temperature seasons, load demands increase sharply when the temperature rises. Particularly, a constantly high temperature has an accumulative effect on it, and a similar situation was found with the other weather variables. Accumulative effect of weather factors has a significant impact on STLF, and hence, it is taken into account in STLF [29].
FI is a measure of a system’s stability. The degree of the stability of the weather factors in previous period can be described quantitatively by measuring the FI value of past weather variables. In this way, current weather variables value is weighted using the FI value of weather factors in previous period, and then accumulative effect of weather factors on load demands is fully reflected. This approach is well in accordance with one’s general sense of weather variation caused by accumulative effect, that is, “steadily” low or high. It avoids identifying a cumulative effect coefficient regarding weather factors in conventional methods as well.

3.1. Methods of FI-Based Primitive Weather Variables Introduction into STLF Models

Fang et al. [28] investigated the relation between meteorological sensitive load and real-time temperature to examine accumulative effect of temperature on load demands. The result shows the effect comes from not only currently forecasted day, but previously several days. The former refers that a load demand at forecasted time point on the day was influenced by several previous time points before the time point within the day, especially for one previous time point, two previous time points, and three previous time points (in 1 h sampling time interval), while the latter refers that constantly high temperature within several previous days before the day would lead to grow unusually for the load demand. Specifically, these are one previous day and two previous days. Other weather variables show a similar result. Therefore, the sliding window for FI computation of a single weather variable, considering accumulation effect, consists of three sections as follows (in 15 min sampling time interval).
{ ( X d , t X d , t 1 , , X d , t 11 ) ,   ( X d 1 , t X d 1 , t 1 , ,   X d 1 , t 7 ) ,   ( X d 2 , t X d 2 , t 1 , ,   X d 2 , t 3 ) }
Note that the first subscript identifies day type, and the second one identifies time point. The subscript d and t represent currently forecasted day and time point, respectively, and the subscript (di) and (tj) represent previous i-th day before the day and previous j-th time point before forecasted time point, respectively. As shown above, the first group is the sampling data points of the weather variables of 12 previous time points prior to the time point within the day. The second group is that of eight previous time points prior to the same time point on one previous day before the day, and the third group is that of four previous time points prior to the same time point on two previous days before the day. That is, the data window used for FI calculation comprises a total of 24 data points, which meets the principle that more sample real data should be selected when the distance regarding time between the sampled data points and the forecasted time point is much nearer and vice versa in handling those issues. This approach makes it more reasonable and comprehensive to depict accumulative impact of the weather factors on load demands.

3.2. Methods of FI-Based MCIs Introduction into STLF Models

MCIs can track more sensitively and describe more effectively load variation compared to the plain weather variables [19]. This is because one’s feeling of comfort is determined by an integrated effect mixing different weather factors such as temperature, humidity, wind speed, etc., together, while a single weather variable cannot characterize exactly one’s true feeling. In this context, several MCIs were introduced into STLF models, such as HI, THI, CIHB, etc.
Traditionally, the MCIs are defined by two or three weather variables to reflect an effect of interacting and inter coupling between them. One can image the effect is definitely associated with the stability of a system which is defined by corresponding weather variables, and the stability is measured by FI’s value of the multidimensional weather variables. The MCIs, similar to plain weather variables, have an accumulative effect on load demands, which lead to a time lag regarding one’s reaction to outside weather conditions. Accordingly, the effect should be considered when the STLF models are established, which is achieved by the weighted MCIs, namely, the MCIs multiplied by corresponding FI value.
The sliding window data serial for calculation multiple weather variables FI, considering accumulation effect, is constructed as follow.
{ ( X d , t X d , t 1 , ,   X d , t 11 ) ,   ( X d 1 , t X d 1 , t 1 , ,   X d 1 , t 7 ) ,   ( X d 2 , t X d 2 , t 1 , , X d 2 , t 3 ) }
As displayed above, it consists of three parts similar to the single weather variables in Section 3.1. It is worth noting that each element in the above window, e.g., Xd,t = (xd,t(1), xd,t(2),…, xd,t(M)), is a vector with m-dimensional components in the weather state space. Data selection for the sliding window and the symbols’ meanings are the same as Section 3.1.

3.3. FI-Based Feature Selection Process for STLF Models

It is also relevant to notice that the forecasting engine is only a part of an accurate forecasting model and other processes such as feature selection (FS) are very important as well [45]. FS is commonly applied to identifying the most significant input variables influencing STLF’s prediction accuracy. The variables selection is very crucial for artificial intelligence (AI) models training. Input variables, through selecting reasonably, can improve STLF’s prediction accuracy and reduce models’ training costs. Therefore, how to choose the most important factors impacting on load demands from massive historical data to yield a set of effective and sufficient input variables is a challenging task. Drezga et al. [46] proposed a phase-space embedding method to identify input variables for NN models, but it is only able to find out historical loads that have the most impact on the future forecasted time point, and has limited effect for the weather variables. Gao et al. [47] applied orthogonal least square (OLS) approach to select feature for NN models in STLF. It, however, require to identify a measurement’s allowed error in advance. Liu et al. [48] adopted Relief algorithm to address FS matter for STLF, but computation process is more complicated.
Essentially, Fisher Information is the inverse of the Shannon entropy; Where Shannon entropy measures the degrees of disorder or indeterminacy of the system, while Fisher Information measures the degrees of order or determinacy of the system. For a certain thing, the more deterministic it is, the more it outputs information, the more regular its structure is, and the larger its FI value is. By means of this property, the importance degree of all the input variables in STLF can be identified. The concrete implementation process is as follow.
Suppose a historical daily average load consumption dataset with length m is used to analyze the impact of n factors on load, and the following matrix for the analysis is then constructed:
X = ( x i j ) m × n = ( x 11 x 1 n x m 1 x m n )
To avoid the effect of different dimension, each element in the matrix is normalized to [0, 1] to acquire a normalized matrix R = ( r i j ) m × n . FI value regarding per column corresponding to a factor is calculated, then is normalized to [0, 1] once again. The weight of the j-th influencing factor is computed as follow.
W j = 1 FI j n j = 1 n FI j
As can be seen in Equation (14), when the j-th factor all takes the same value in the j-th column, the factor’s FI value takes the maximum 1, while its weight Wj takes the minimum 0, which signifies the factor cannot provide any useful information for STLF. The factor should be ignored or deleted from the above influencing factors. Conversely, when the j-th factor takes completely different and uncorrelated values in the j-th column, the factor’s FI value takes the minimum, while its weight Wj takes the maximum, which signifies the factor can provide useful information and much attention should be paid to the factor in STLF. As a result, the j-th factor’s value is weighted using its weight wj above to acquire a final value as the j-th input variable of STLF models. All the rest of the input variables are determined in the same way. FI-based feature selection process (FSP) can reduce the redundant information and improve the prediction accuracy and speed.

4. Load Forecasting Models

Over the past few decades, using weather variables as an integral part of forecasting process has been the most populous. It is common approach among neural network (NN) ones. Of all these methods using weather variables, backpropagation (BP) NN is one of the most widely applied means. The reason for this is that it can approximate numerically any continuous function to the desired accuracy. In addition, it is data-driven methods, in the sense, it is not necessary for the researchers to postulate tentative models and then estimate their parameters [49]. The most common BP structure has three layers with a sigmoid transfer function for the activation function in the hidden layer and a linear transfer function in the output layer. The possible drawbacks for the method are, first, no theoretical approach can be applied to calculating the appropriate number of hidden layer nodes [50]. Secondly, it is prone to overfit sample data because of the large number of parameters that must be estimated [49]. In consideration of this, it is adopted to demonstrate the effectiveness of FI-based weather variables introduction and feature selection process for STLF, although it should be noted that the choice of modeling technique is not central to this paper.

4.1. Forecasting Model with Primitive Weather Variables

Previous load forecasting methods mainly consider characteristics of daily meteorological factors, yet out of consideration is the influence of meteorological factors on load at different time points over a day. For depicting pointwise influence of meteorological factors on the load by means of primitive weather variables, a BP NN-based STLF model with 19 inputs variables is built [31]. The model consists of three layers, namely the input layer, the hidden layer, and the output layer, and the output layer has only one variable, i.e., the forecasted load at this point. Temperature, humidity, and load, both at different points in the same day and at the same point in different days, and others, are defined as input variables for the model. All the input variables are shown in Table 1.
Nineteen variables are applied to the input neurons of BP neural network algorithm to examine the impact of real-time weather factors on load. This model is adopted as the benchmark method and denoted as Model I, for comparing with the following methods based on FI, in this paper.

4.2. Forecasting Model with Primitive MCIs

For a long time, MCIs have played a significant role in introducing weather variables into STLF models, and tey exhibit a comprehensive effect for outside weather conditions compared to the primitive weather variables. Therefore, it is imperative to construct a BP NN-based STLF model with primitive MCIs in terms of real-time MCI influence on load, which is achieved by using an MCI (specifically, THI) instead of both temperature and humidity in Model I. The model is presented in Table 2.
This model with 13 input variables is denoted as Model II to demonstrate the impact of real-time MCI on load, and also to compare it with Model I above.

4.3. Forecasting Model with Weighted Weather Variables Based on FI

Using the aforementioned Model I as a basis, temperature and humidity are renewably introduced the model by means of FI-based approach, and then, the input variables are reduced to 11. They are displayed in Table 3.
This model with 11 input variables is denoted as Model III. The model can describe well accumulative effect of both temperature and humidity on load demands by means of the proposed method. In addition, it offsets the time differences between load variation and weather conditions, which makes the variation of load with weather conditions to exhibit a real-timely effect.

4.4. Forecasting Model with Weighted MCIs Based on FI

THI is one of the most widely applied MCIs because of its practicability. As a result, the THI is utilized for demonstrating the performance of the method of weighted MCIs introduction into STLF models. The THI is the temperature and humidity index, which can be calculated with the following formula [19]:
THI = T + 1450.8 ( T + 235 ) 4030 ( T + 235 ) ln R 43.4
where T and R are temperature (in degrees Celsius) and relative humidity (integer percentage), respectively. The THI value computed multiplies by two-dimensional FI value of both temperature and relative humidity as a weighted THI (WTHI), as stated in Section 3.2, and is introduced BP NN model. In this manner, the input variables are reduced to nine. They are described in Table 4.
This model with nine input variables is denoted as Model IV. The model is established, using the proposed method, to characterize the accumulation impact of the MCIs on load point by point. It is supposed to be capable of further improving the accuracy and speed of STLF.

5. Case Study and Discussion

Here is a case study designed to demonstrate the performance of the proposed methods. The historical load and meteorological dataset are collected from Zhenjiang utility company and Zhenjiang Weather Bureau, respectively, Jiangsu Province, in southeast China. The load dataset used for the simulation were sampled every 15 min, from 1 August 2016, to 31 August 2016. The weather dataset collected in corresponding period were sampled every 1 h, and then interpolated to produce corresponding sampling interval dataset. Therefore, 96 points of load and weather data (involving temperature, humidity, and wind speed), day type, THI are included in the data of each day. The simulation is implemented with MATLAB 2015b (MathWorks, Natick, MA, USA).

5.1. Comparison of the Correlation between the Load Demands and the Weather Variables

The comparison of the correlation among daily average load and mean temperature before and after weighted with the proposed method is performed. Note that both daily average load and corresponding mean temperature are taken by Per-Unit Value.
As shown in Figure 1, the daily average load has an evolution trend similar to a primitive temperature as a whole, which matches to the anticipation. In contrast, the weighted temperature (WT) has a much better characterization for the evolution rules of the load. It can be observed that the load stays in rising stage when the WT rises, while the load is in decreasing stage when the WT falls. Moreover, the peak and valley values of the curve of the load are well corresponding to that of the WT, which show a much stronger correlation between the load and the WT. It is worth noting that the variations of the load of certain days are not fully consistent with that of the temperature of corresponding days. This is because temperature is not an only influencing factor on load changes.
Figure 2 presents the comparison of the correlation among the load and THI before and after weighted. As displayed in the figure, a similar result is found with the THI.
The values of the measure of the correlations among the aforementioned variables are calculated based on the following formula:
r ( x , y ) = t = 1 n ( x t x ¯ ) ( y t y ¯ ) i = 1 n ( x t x ¯ ) 2 i = 1 n ( y t y ¯ ) 2
r(x, y) represents the correlation coefficient between the variable x and y; xt and yt refer to the value of x and y at t time point, respectively; x ¯   and   y ¯ identify the mean value of x and y, respectively; and n is the number of sampling time points. The measured values are illustrated in Table 5.
It can be seen in Table 5 that, using the proposed method, the correlation coefficient between the load and the temperature increases from 0.88 to 0.94, while that of both the load and the THI grows from 0.91 to 0.96. Obviously, the correlations among them are improved. All these analyses confirm the feasibility of the proposed approach, which lay a foundation for the following discussion.

5.2. Comparison of the Prediction Effect of Different Forecasting Models

Based on the aforementioned load forecasting models and the analysis of feature selection in Section 4, a load in term of every 15 min on 10 August 2016 is forecasted point by point. The mean absolute percentage error (MAPE) e MAPE and the margins of forecasting error AC are adopted as the criteria of evaluating the performances of load forecasting models, respectively:
e MAPE = 1 n i = 1 n | A ( i ) F ( i ) A ( i ) | × 100 %
A C = 1 n i = 1 n ( A ( i ) F ( i ) A ( i ) ) 2 × 100 %
where n is the number of forecasting time points; F(i) is the forecasted load at time point i; A(i) is the actual load at corresponding time point; and AC represents the average prediction variance of load forecasting in 96 points.
Figure 3 exhibits the prediction results, the relative error curve, and MAPE for different forecasting models. Figure 3a depicts the actual load demand and the forecasted load demand using the benchmark model i.e., Model I and Model II. The relative error curves is presented as well. As shown in the figure, the forecasted load demand with Model II better matches the actual one compared to Model I, and the prediction performances with Model II is better than Model I in term of both relative error and MAPE. Figure 3b displays those of Model III before and after FS. From this figure, it can be observed that the forecasted load demand obtained using Model III is better than both Model I and Model II, and Model III with FS is better than the one without FS, which implies FS has an impact on the accuracy of STLF. Overall, the amplitude of the relative error curves plotted in Figure 3b are lower and more homogeneous than both Model I and Model II, and the MAPE obtained in Figure 3b) is less compared to Figure 3a. Figure 3c characterizes the actual load demand and forecasted load demands of Model IV before and after FS. It can be seen clearly that the prediction results shown in Figure 3c is much better compared to Figure 3a,b. Particularly, Model IV with FS displays the best prediction performance among the models, and the forecasted load demand with Model IV through FS closely matches the actual load demand. Correspondingly, the amplitude of the relative error curves of the model is much smaller and more homogeneous than others, and its MAPE is the smallest one in all the models. In summary, the proposed models can achieve better forecasting results in comparison with both Model I and Model II in Figure 3.
The relative error of hourly load forecasting results among different models is listed in Table 6. Subsequently, e MAPE and AC are presented to further assess their respective performance. As can be seen in Table 6, a smaller e MAPE of load forecasting performance can be clearly observed in the proposed models when compared with both Model I and Model II. The e MAPE of Model II decreased from 3.68% to 2.97%, and the AC reduced from 4.54% to 3.84%. Those of Model III decreased from 3.68% to 2.43%, and from 4.54% to 3.13%, respectively. Those of Model IV decreased from 3.68% to 1.98% and from 4.54% to 2.55%, respectively. Those of Model III with FS decreased from 3.68% to 1.69% and from 4.54% to 2.22%, respectively. Those of Model IV with FS decreased from 3.68% to 1.39% and from 4.54% to 1.86%, respectively. overall, the accuracy of the load prediction is improved step by step from Model I to Model IV with FS, which is in accordance with the anticipation.

6. Conclusions

The principal contributions in this paper are to propose a robust methodology as a practical means of both introducing weather factors into STLF models and selecting its input variables, and comparing it with traditional methods.
Weather information is an important factor in load forecasting models. How to effectively introduce meteorological variables and reasonably select feature for STLF models has always been a topic of interest in this area. When looked at from an information point of view, FI can be interpreted as the amount of information that can be extracted from a set of measurements. From this point, the amount of weather information in previous period can be measured from past weather variables by FI. In this way, current weather variables value is weighted using the FI value of weather information in previous period, and then accumulative effect of weather factors on load demands is fully reflected. The same applies to FS in STLF models.
The advantages of the proposed method are easy to understand and convenient to apply in practical load forecasting. As demonstrated above, the method overcomes the main drawback of traditional method in term of considering accumulative effect of both the plain weather variables and the MCIs on load, and displaying a real-time effect on the load demands. In addition, the proposed FS approach reduces the redundant information, saves model training time and improves STLF’s prediction speed. Different forecasting models with the proposed approach are established and implemented, based on the case study of Zhenjiang City in southeast China, to assess the performance of the methodology. The simulation results obtained show the usefulness of the proposed methodology, which can be used as a unified method for weather variables introduction and feature selection for STLF models.

Acknowledgments

This research was funded by National Nature Science Foundation of China (Grant No. 61301138 and 51505195), Nature Science Foundation of Jiangsu Province, China (Grant No. BK20130501) and the Priority Academic Program Development of Jiangsu Higher Education Institutions ((2011) No. 6).

Author Contributions

Shuping Cai conceived the theoretical methodologies and prepared the manuscript. Shuping Cai and Lin Liu designed the experiments. Lin Liu, Huachen Sun and Jing Yan performed the experiments. All authors discussed the results, and commented on the manuscript at all stages. All authors have read and approved the final manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Li, C.B.; Li, X.H.; Zhao, R. A novel algorithm of selecting similar days for short-term power load forecasting. Autom. Electr. Power Syst. 2008, 32, 69–73. [Google Scholar]
  2. Karaki, S.H. Weather sensitive short-term load forecasting using artificial neural networks and time series. Int. J. Power Energy Syst. 1999, 19, 251–256. [Google Scholar]
  3. Xie, H.; Chen, Z.Y.; Niu, D.X. Principle component analysis of data processing for short-term load forecasting. Power Syst. Technol. 2000, 24, 43–46. [Google Scholar]
  4. Li, R.; Liu, Y.; Li, J.H. Study on the daily characteristic load forecasting based on the optimized algorithm of decision tree. Proc. CSEE 2005, 25, 36–41. [Google Scholar]
  5. Raza, M.Q.; Nadarajah, M.; Hung, D.Q. An intelligent hybrid short-term load forecasting model for smart power grids. Sustain. Cities Soc. 2017, 31, 264–275. [Google Scholar] [CrossRef]
  6. Liu, D.; Zeng, L.; Li, C.; Ma, K.; Chen, Y.; Cao, Y.C. A distributed short-term load forecasting method based on local weather information. IEEE Syst. J. 2016, 1, 1–7. [Google Scholar] [CrossRef]
  7. Gross, G.; Galiana, F.D. Short-Term Load Forecasting. Proc. IEEE 1987, 75, 1558–1573. [Google Scholar] [CrossRef]
  8. Hobby, J.D.; Tucci, G.H. Analysis of the residential, commercial and industrial electricity consumption. In Proceedings of the IEEE PES Innovative Smart Grid Technologies Asia (ISGT), Perth, WA, Australia, 13–16 November 2011; pp. 1–7. [Google Scholar]
  9. Rao, G.M.; Narasimhaswamy, I.; Kumar, B.S. Deregulated Power system load forecasting using artificial intelligence. In Proceedings of the IEEE International Conference on Computational Intelligence and Computing Research, Coimbatore, India, 28–29 December 2010; pp. 1–5. [Google Scholar]
  10. Contaxi, E.; Delkis, C.; Kavatza, S.; Vournas, C. The effect of humidity in a weather-sensitive peak load forecasting model. In Proceedings of the IEEE PES Power Systems Conference and Exposition, Atlanta, GA, USA, 29 October–1 November 2006; pp. 1528–1534. [Google Scholar]
  11. Rahman, S.; Hazim, O. A generalized knowledge-based short-term load-forecasting technique. IEEE Trans. Power Syst. 1993, 8, 508–514. [Google Scholar] [CrossRef]
  12. Sailor, D.J.; Munoz, J.R. Sensitivity of electricity and natural gas consumption to climate in the U.S.A.—Methodology and results for eight states. Energy 1997, 22, 987–998. [Google Scholar] [CrossRef]
  13. Hyde, O.; Hodnett, P.F. An adaptable automated procedure for short-term electricity load forecasting. IEEE Trans. Power Syst. 1997, 12, 84–94. [Google Scholar] [CrossRef]
  14. Cancelo, J.R.; Espasa, A.; Grafe, R. Forecasting the electricity load from one day to one week ahead for the Spanish system operator. Int. J. Forecast. 2008, 24, 588–602. [Google Scholar] [CrossRef] [Green Version]
  15. National Weather Service. Available online: http://www.nws.noaa.gov/om/heat/heat_index.shtml (accessed on 12 December 2017).
  16. Chu, W.-C.; Chen, Y.P.; Xu, Z.-W.; Lee, W.-J. Multiregion short-term load forecasting in consideration of HI and load/weather diversity. IEEE Trans. Ind. Appl. 2011, 47, 232–237. [Google Scholar] [CrossRef]
  17. Masterton, J.M.; Richardson, F.A. Humidex: A Method of Quantifying Human Discomfort Due to Excessive Heat and Humidity; Environment Canada, Atmospheric Environment Service, Downsview: Ontario, ON, Canada, 1979; pp. 1–79.
  18. Chen, Y.; Luh, P.B.; Guan, C.; Zhao, Y.; Michel, L.D.; Collbeth, M.A.; Friedlan, P.B.; Rourke, S.J. Short-term load forecasting: similar day-based wavelet neural networks. IEEE Trans. Power Syst. 2010, 25, 322–330. [Google Scholar] [CrossRef]
  19. Kang, C.Q.; Xia, Q.; Liu, M. Power System Load Forecasting, 2nd ed.; China Electric Power Press: Beijing, China, 2017. [Google Scholar]
  20. Rahman, S. Formulation and analysis of a rule-based short-term load forecasting algorithm. Proc. IEEE 1990, 78, 805–816. [Google Scholar] [CrossRef]
  21. Mao, Y.; Yang, F.; Wang, C. Application of BP network to short-term power load forecasting considering weather factor. In Proceedings of the International Conference on Electric Information and Control Engineering (ICEICE), Wuhan, China, 15–17 April 2011; pp. 172–175. [Google Scholar]
  22. Zhang, B.; Shao, C.N.; Zhao, R. Short-term load forecasting based on deviation correction. Electr. Power Autom. Equip. 2015, 35, 152–157. [Google Scholar]
  23. Liu, R.; Fang, G.F. Short-term load forecasting with comprehensive weather factors based on improved Elman neural network. Power Syst. Prot. Control 2012, 40, 113–117. [Google Scholar]
  24. Sailor, D.J. Relating residential and commercial sector electricity loads to climate-evaluating state level sensitivities and vulnerabilities. Energy 2001, 26, 645–665. [Google Scholar] [CrossRef]
  25. Osczevski, R.; Bluestein, M. The new wind chill equivalent temperature chart. Bull. Am. Meteorol. Soc. 2005, 86, 1453–1458. [Google Scholar] [CrossRef]
  26. Ruzic, S.; Vuckovic, A.; Nikolic, N. Weather sensitive method for short term load forecasting in Electric Power Utility of Serbia. IEEE Trans. Power Syst. 2003, 18, 1581–1586. [Google Scholar] [CrossRef]
  27. Taylor, J.W.; Buizza, R. Using weather ensemble predictions in electricity demand forecasting. Int. J. Forecast. 2003, 19, 57–70. [Google Scholar] [CrossRef]
  28. Fang, G.F.; Hu, C.H.; Zheng, Y.H.; Cai, J.M. Study on the method of short-term load forecasting considering summer weather factors. Power Syst. Prot. Control 2010, 38, 100–104. [Google Scholar]
  29. Jiang, J.D.; Cheng, Z.H.; Zhu, M.J. Combined short-term load forecast with accumulated temperature effect. Electr. Power Autom. Equip. 2011, 31, 28–31. [Google Scholar]
  30. Callaway, D.S. Sequential reliability forecasting for wind energy: Temperature dependence and probability distributions. IEEE Trans. Energy Convers. 2010, 25, 577–585. [Google Scholar] [CrossRef]
  31. Kang, C.Q.; Zhou, A.S.; Wang, P. Impact analysis of hourly weather factors in short-term load forecasting and its processing strategy. Power Syst. Technol. 2006, 30, 5–10. [Google Scholar]
  32. Cabezas, H.; Fath, B.D. Towards a theory of sustainable systems. Fluid Phase Equilib. 2002, 3, 194–197. [Google Scholar] [CrossRef]
  33. Zhu, C.Q.; Sun, H.B.; Zhang, B.M. A combinedmodel for short term load forecasting based on maximum entropy principle. Proc. CSEE 2005, 25, 1–6. [Google Scholar]
  34. Sun, H.B.; Chen, J.; Jiang, X.Y.; Zhang, J. Minimization of information loss based hybrid short-term load forecasting part 1—Theory. Proc. CSEE 2016, 36, 4513–4520. [Google Scholar]
  35. Sun, H.B.; Chen, J.; Jiang, X.Y.; Zhang, J. Minimization of information loss based hybrid short-term load forecasting part 2—Algorithm and case study. Proc. CSEE 2016, 36, 4801–4807. [Google Scholar]
  36. Fath, B.; Cabezas, H.; Pawlowski, C.W. Regime changes in ecological systems: An information theory approach. J. Theor. Biol. 2003, 222, 517–530. [Google Scholar] [CrossRef]
  37. Frieden, B.R. Physics from Fisher Information: A Unification; Cambridge University Press: Cambridge, UK, 1998. [Google Scholar]
  38. Frieden, B.R. Probability, Statistical Optics and Data Testing, 3rd ed.; Springer: New York, NY, USA, 2001. [Google Scholar]
  39. Mayer, A.L.; Pawlowski, C.W.; Cabezas, H. Fisher Information and dynamic regime changes in ecological systems. Ecol. Model. 2006, 195, 72–82. [Google Scholar] [CrossRef]
  40. Cai, S.P.; Dai, L.; Ni, H. A new approach to abrupt change detection based on Fisher information. Acta Phys. Sin. 2013, 62, 189204.1–189204.9. [Google Scholar]
  41. Cai, S.P.; Liu, G.H. A novel line selection method for power distribution networks with indirectly earthed neutral. Int. J. Power Energy Syst. 2014, 34, 1–7. [Google Scholar] [CrossRef]
  42. Fisher, R.A. On the mathematical foundations of theoretical statistics. Philos. Trans. R. Soc. Lond. 1922, 222, 309–368. [Google Scholar] [CrossRef]
  43. Eason, T.; Cabezas, H. Evaluating the sustainability of a regional system using Fisher information in the San Luis Basin, Colorado. J. Environ. Manag. 2012, 94, 41–49. [Google Scholar] [CrossRef] [PubMed]
  44. Lapin, L.L. Statistics: Meaning and Method; Harcourt Brace Jovanovich: New York, NY, USA, 1975. [Google Scholar]
  45. Lopez, M.; Valero, S.; Senabre, C.; Gabaldon, A. Analysis of the influence of meteorological variables on real-time short-term load forecasting in Balearic Islands. In Proceedings of the 2017 11th IEEE International Conference on Compatibility, Power Electronics and Power Engineering, CPE-POWERENG, Cadiz, Spain, 4–6 April 2017; pp. 10–15. [Google Scholar]
  46. Drezga, I.; Rahman, S. Input variable selection for ANN-based short-term load forecasting. IEEE Trans. Power Syst. 1998, 13, 1238–1244. [Google Scholar] [CrossRef]
  47. Gao, S.; Shan, Y.D. Novel input variable selection for ANN short-term load forecasting. Autom. Electr. Power Syst. 2001, 22, 41–44. [Google Scholar]
  48. Liu, G. The application of relevance vector machine and feature selection technology in short-term electric load forecasting. Electron. World 2016, 16, 56–58. [Google Scholar]
  49. Hippert, H.S.; Pedreira, C.E.; Souza, R.C. Neural networks for short-term load forecasting: A review and evaluation. IEEE Trans. Power Syst. 2001, 16, 44–55. [Google Scholar] [CrossRef]
  50. Buhari, M.; Adamu, S.S. Short-term load forecasting using artificial neural network. In Proceedings of the International Multi Conference of Engineers and Computer Scientists, Hong Kong, China, 14–16 March 2012; pp. 1–6. [Google Scholar]
Figure 1. Comparison of the curves of daily average load and mean temperature before and after weighted.
Figure 1. Comparison of the curves of daily average load and mean temperature before and after weighted.
Entropy 20 00184 g001
Figure 2. Comparison of the curves of daily average load and THI before and after weighted.
Figure 2. Comparison of the curves of daily average load and THI before and after weighted.
Entropy 20 00184 g002
Figure 3. (a) Forecasting output and error analysis of both Model I and Model II on 10 August 2016; (b) forecasting output and error analysis of Model III before and after FS on 10 August 2016; and (c) forecasting output and error analysis of Model IV before and after FS on 10 August 2016.
Figure 3. (a) Forecasting output and error analysis of both Model I and Model II on 10 August 2016; (b) forecasting output and error analysis of Model III before and after FS on 10 August 2016; and (c) forecasting output and error analysis of Model IV before and after FS on 10 August 2016.
Entropy 20 00184 g003aEntropy 20 00184 g003b
Table 1. The description of input variables.
Table 1. The description of input variables.
Input VariablesDescription
1–2day type and time point for a forecasted load.
3–4load at one and four previous time points before the time point on the day.
5–10temperature and humidity at the time point, one and four previous time points on the day.
11–13load at the time point, one and four previous time points on the same day in last week.
14–19temperature and humidity at the time point, one and four previous time points on the same day in last week.
Table 2. The description of input variables.
Table 2. The description of input variables.
Input VariablesDescription
1–2day type and time point for a forecasted load.
3–4load at one and four previous time points before the time point on the day.
5–7THI at the time point, one and four previous time points on the day.
8–10load at the time point, one and four previous time points on the same day in last week.
11–13THI at the time point, one and four previous time points on the same day in last week.
Table 3. The description of input variables.
Table 3. The description of input variables.
Input VariablesDescription
1–2day type and time point for a forecasted load.
3–4load at one and four previous time points before the time point on the day.
5–6weighted temperature and humidity at the time point on the day.
7–9load at the time point, one and four previous time points on the same day in last week.
10–11weighted temperature and humidity at the time point on the same day in last week.
Table 4. The description of input variables.
Table 4. The description of input variables.
Input VariablesDescription
1–2day type and time point for a forecasted load.
3–4load at one and four previous time points before the time point on the day.
5weighted THI at the time point on the day.
6–8load at the time point, one and four previous time points on the same day in last week.
9weighted THI at the time point on the same day in last week.
Table 5. The similarity among different curves.
Table 5. The similarity among different curves.
ItemLoadTWTTHIWTHI
Load1.000.880.940.910.96
T1.000.920.900.89
WT1.000.910.93
THI1.000.95
WTHI1.00
WT: the weighted temperature; THI: Temperature–humidity index; WTHI: a weighted THI.
Table 6. Comparison of hourly load forecasting output (FS: feature selection).
Table 6. Comparison of hourly load forecasting output (FS: feature selection).
HourActual Load (MW)Model I (%)Model II (%)Model III (%)Model III with FS (%)Model IV (%)Model IV with FS (%)
00:002794.703.040.372.202.783.112.22
01:002718.781.381.690.591.141.010.20
02:002677.851.080.641.031.200.910.23
03:002598.204.540.171.271.220.800.07
04:002524.421.350.191.070.420.750.11
05:002580.443.852.131.511.991.971.05
06:002624.226.321.990.760.230.320.92
07:002617.437.481.852.553.272.571.98
08:002834.886.763.943.934.734.463.60
09:002735.861.722.711.800.251.670.22
10:002852.197.197.191.611.180.011.22
11:002904.613.925.615.150.680.390.06
12:002968.991.394.134.231.471.381.40
13:003148.012.100.541.270.090.520.24
14:003291.500.610.181.052.151.351.88
15:003240.100.532.450.930.050.290.33
16:003175.920.411.930.440.650.060.27
17:003009.971.441.445.860.511.141.18
18:002575.661.951.551.621.844.341.48
19:002548.535.023.670.410.771.890.33
20:002766.079.599.026.436.016.154.72
21:003028.179.535.895.882.160.310.17
22:003047.151.603.130.840.731.882.05
23:002918.876.997.566.772.344.150.30
e MAPE (%)_3.682.972.431.691.981.39
AC (%)_4.543.843.132.222.551.86

Share and Cite

MDPI and ACS Style

Cai, S.; Liu, L.; Sun, H.; Yan, J. Fisher Information Based Meteorological Factors Introduction and Features Selection for Short-Term Load Forecasting. Entropy 2018, 20, 184. https://doi.org/10.3390/e20030184

AMA Style

Cai S, Liu L, Sun H, Yan J. Fisher Information Based Meteorological Factors Introduction and Features Selection for Short-Term Load Forecasting. Entropy. 2018; 20(3):184. https://doi.org/10.3390/e20030184

Chicago/Turabian Style

Cai, Shuping, Lin Liu, Huachen Sun, and Jing Yan. 2018. "Fisher Information Based Meteorological Factors Introduction and Features Selection for Short-Term Load Forecasting" Entropy 20, no. 3: 184. https://doi.org/10.3390/e20030184

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop