Next Article in Journal
Building Integrated Photovoltaics: A Concise Description of the Current State of the Art and Possible Research Pathways
Previous Article in Journal
Review of the Thermo-Physical Properties and Performance Characteristics of a Refrigeration System Using Refrigerant-Based Nanofluids
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Robust Weighted Combination Forecasting Method Based on Forecast Model Filtering and Adaptive Variable Weight Determination

1
College of Mechatronic Engineering, Beifang University of Nationalities, Yinchuan 750021, China
2
State Key Laboratory of Robotics and System, Harbin Institute of Technology (HIT), Harbin 150001, China
3
State Grid Ningxia Electric Power Design Co. Ltd., Yinchuan 750001, China
4
School of Management, Qingdao Technological University, Qingdao 266520, China
5
College of Electrical &Information Engineering, Hunan University, Changsha 410082, China
6
College of Electrical Engineering and Information, Sichuan University, Chengdu 610065, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Energies 2016, 9(1), 20; https://doi.org/10.3390/en9010020
Submission received: 12 August 2015 / Revised: 21 September 2015 / Accepted: 19 October 2015 / Published: 31 December 2015

Abstract

:
Medium-and-long-term load forecasting plays an important role in energy policy implementation and electric department investment decision. Aiming to improve the robustness and accuracy of annual electric load forecasting, a robust weighted combination load forecasting method based on forecast model filtering and adaptive variable weight determination is proposed. Similar years of selection is carried out based on the similarity between the history year and the forecast year. The forecast models are filtered to select the better ones according to their comprehensive validity degrees. To determine the adaptive variable weight of the selected forecast models, the disturbance variable is introduced into Immune Algorithm-Particle Swarm Optimization (IA-PSO) and the adaptive adjustable strategy of particle search speed is established. Based on the forecast model weight determined by improved IA-PSO, the weighted combination forecast of annual electric load is obtained. The given case study illustrates the correctness and feasibility of the proposed method.

1. Introduction

Nowadays, the strong smart grid (SSG) is vigorously being constructed and the renewable distributed electricity generation capacity is steadily increasing. As an important basis to ensure the security and stable operation of the electric system, electric load forecasting is playing a more and more important role in the implementations of energy policies and the investment decision-making of the electric department under this background [1,2,3,4,5,6,7,8,9,10]. However, medium-and-long-term load forecasting has non-linear characteristics caused by the influence of various factors (e.g., national policy, economic and social factors) [7,8,9]. It makes the medium-and-long-term load forecasting much complex and uncertain. Thus, how to improve the robustness and accuracy of annual electric load forecasting is very worthy of study.
On the one hand, the forecast accuracy has reached a high level under the given sample and condition in the existing research [1,2,3,4,5,6,7,8,9,10,11,12,13], but the forecast method’s robustness currently becomes the bottleneck. The main reason lies in the status that the existing methods are mostly based on the error theory. By the error theory, the fact of the unknown amount of the forecast year’s true load is neglected and the accurate forecast error is difficult to obtain. Chen [14,15] pointed out that the validity degree of the forecast model can be expressed by its full and average precision. In the mathematical sense, any forecast model has its inherent attributes which can be measured by its validity rather than the result error reported by Sun et al. [16], Chen et al. [17] and Jin et al. [18]. At the same time, the single forecast models should be filtered so that the better ones will be selected and the worse ones will be eliminated. Therefore, filtering the forecast models to select the better ones based on their comprehensive validity degrees can improve the robustness of the forecast method.
On the other hand, the general load forecast method mainly includes artificial neural network reported by Hernandez et al. [19] and Gofman et al. [20], regression analysis reported by Li et al. [21] and time series analysis reported by Paparoditis et al. [22]. The exponential smoothing method reported by Weron et al. [23], the gray forecast method reported by Li et al. [24] based on time trend extrapolation reported by Ismail et al. [25], the clustering forecast method reported by Kodogiannis et al. [26] and multiple regression analysis method reported by Hong et al. [27] based on the load related factor analysis cannot ensure the satisfactory result in any case. In order to make full use of the advantages and the contained information of each single forecast model, combination forecast [28,29,30,31,32,33] is an effective method. The question of how to determine the weight assignment of single forecast method is a difficult point in combination forecasting. The constant weight and the variable weight are two common weight determination ways, and the variable weight has a better adaptability. Focusing on this point, a large number of research has been carried out such as mathematical programming method reported by Ma et al. [34], genetic algorithm reported by Chaturvedi et al. [2], Bayesian method reported by Niu et al. [35] and neural network method reported by Hernandez et al. [19] and Gofman et al. [20]. These methods mostly have the stabile performance and the accuracy meeting the application requirements, but there are still several problems such as much complexity, slow convergence or strong status dependence.
In this paper, we propose a robust weighted combination forecasting method based on forecast model filtering and adaptive variable weight determination. Firstly, the similar years are selected from the sample history years according to the similarity between the history year and the forecast year. Secondly, the forecast models are filtered based on the comprehensive validity degree which is composed of the fitted validity degree in the history year interval and the estimated forecast validity degree in the forecast year interval. Thirdly, the improved Immune Algorithm-Particle Swarm Optimization (IA-PSO), in which the disturbance variable is introduced and the adaptive adjustable strategy of particle search speed is established, is used to determine the adaptive variable weight of the selected forecast models. Lastly, the weighted combination forecast is carried out. The flowchart of the proposed method is shown in Figure 1.

2. Similar Years Selection

There are a series of factors influencing the annual load. For example, in order to reflect the influence of each factor on the load forecasting result, Zhu et al. [36] have investigated an Artificial Neural Network-based approach for medium-and-long-term load forecasting. In the proposed three-layer back propagation network, seven factors are selected as inputs which include Gross Domestic Product (GDP), heavy industry production, light industry production, agriculture production, primary industry, secondary industry and tertiary industry; Wang et al. [37] have pointed out that there are mainly eight factors affecting the annual load: area GDP, primary industry GDP ratio, secondary industry GDP ratio, tertiary-industry GDP ratio, power consumption per unit of GDP, electricity price, urban per-capita income and rural per-capita income; Lei et al. [38] have analyzed the variation characteristics of the annual maximum load, annual minimum load and typical daily load based on the recent historical load data and meteorological data of Chongqing region. Then, they have studied the interrelationship between load characteristics and major influencing factors. The results show that the temperature, rainfall, holidays and festivals have a significant influence on the region power load; Liao et al. [39] have researched the current load characteristics of Changde region and main factors influencing load variation. Influencing extents of main influencing factors on regional load, respective proportions of these factors in the influences and the time periods influenced by these factors are analyzed, and the quantization analysis on the relation between load and influencing factors is performed. In conclusion, the factors considered by Wang et al. [37] are more than Zhu et al. [36], and the factors considered by Lei et al. [38] and Liao et al. [39] relate to the characteristics in short-term load forecasting. After comparison and summary, we choose the eight factors pointed out by Wang et al. [37] which are more comprehensive and reasonable than others to select the similar years in medium-and-long-term load forecasting.
Figure 1. The flowchart of the proposed robust weighted combination forecasting method.
Figure 1. The flowchart of the proposed robust weighted combination forecasting method.
Energies 09 00020 g001
It is assumed that there are nhis history years {1, 2, …, nhis} and nfo forecast years {nhis + 1, nhis + 2, …, nhis + nfo}. The year characteristic is defined as the factor affecting the annual load and the year characteristic quantity is defined as the value of year characteristic in a history year.
If the year characteristic CHa is efficiency type, the year characteristic quantity CHQa,b which means the value of the year characteristic CHa of the history year HYb is standardized as follows:
C H Q a , b = C H Q a , b max { C H Q a , 1 , C H Q a , 2 , , C H Q a , n h i s }
where a = 1, 2, …, nch, b = 1, 2, …, nhis.
If the year characteristic CHa is cost type, the year characteristic quantity CHQa,b which means the value of the year characteristic CHa of the history year HYb is standardized as follows:
C H Q a , b = min { C H Q a , 1 , C H Q a , 2 , , C H Q a , n his } C H Q a , b
where a = 1, 2, …, nch, b = 1, 2, …, nhis.
For two individuals whose characteristics have the same dimension number, distance and similarity are usually used to measure the difference between them. The distance measures the absolute distance between two individuals in space which is directly related to the position coordinates of each individual (i.e., the numerical value of each characteristic dimension), but the similarity measures the angle between two individual vectors and reflects more difference in direction that in difference in position [40,41,42]. Therefore, similarity is more suitable to measure the difference between a history year and a forecast year.
Here, the most common cosine similarity is chosen [41,42]. The cosine similarity between the history year HYb and the forecast year FYc is as follows:
C S I b , c = a = 1 n c h C H Q a , b C H Q a , c ( a = 1 n c h C H Q a , b 2 ) ( a = 1 n c h C H Q a , c 2 )
where b = 1, 2, …, nhis, c = nhis + 1, nhis + 2, …, nhis + nfo.
Lastly, nshis (nshis < nhis) history years with the highest similarity are selected as the similar years of the forecast year FYc.

3. Forecast Model Filtering

3.1. Forecast Model Validity Degree

As shown in Figure 2, the load sequence of similar years is { y 1 , y 2 , , y n s   h i s } , and the load sequence of forecast years is { y n h i s + 1 , y n h i s + 2 , ... , y n h i s + n f o } . We assume that there are nfm forecast models. By the forecast model FMd (d = 1, 2, …, nfm), y e , d is the fitted value of the similar year e (e = 1, 2, …, nshis), and yc,d is the forecast value of the forecast year c (c = nhis + 1, nhis + 2, …, nhis + nfo).
Figure 2. The history years and the forecast years.
Figure 2. The history years and the forecast years.
Energies 09 00020 g002
The fitted value relative error of FMd for the similar year e is as follows:
R E e , d = y e y e , d y e
The forecast value relative error of FMd for the forecast year c is as follows:
R E c , d = y c y c , d y c
Then, the fitted precision of FMd for the similar year e is as follows:
P e , d = { 1 | R E e , d | , 0 | R E e , d | 1 0    ,      | R E e , d | 1
The forecast precision of FMd for the forecast year c is as follows:
P c , d = { 1 | R E c , d | , 0 | R E c , d | 1 0    ,      | R E c , d | 1
Lastly, the fitted validity degree of FMd is as follows:
F I V d = EXP ( P e , d ) ( 1 σ ( P e , d ) )
where EXP ( P e , d ) = 1 n s h i s e = 1 n s h i s P e , d and σ ( P e , d ) = 1 n s h i s e = 1 n s h i s ( P e , d EXP ( P e , d ) ) 2 are the expectation and the standard deviation of the fitted precision of FMd for the similar year e, respectively.
The forecast validity degree of FMd is as follows:
F O V d = EXP ( P c , d ) ( 1 σ ( P c , d ) )
where EXP ( P c , d ) = 1 n f o c = n h i s + 1 n h i s + n f o P c , d , σ ( P c , d ) = 1 n f o c = n h i s + 1 n h i s + n f o ( P c , d EXP ( P c , d ) ) 2 are the expectation and the standard deviation of the forecast precision of FMd for the forecast year c, respectively.

3.2. Forecast Model Precision Estimation

The validity degree of a forecast model is defined in Equation (9) which can depict its credibility and is a reflection of its inherent property [16,17,18]. Obviously, true value has not yet occurred and the forecast error cannot be obtained in the future forecast interval. We can only estimate the precision and the validity of a forecast model based on its inherent property. Then, the suitable forecast models are selected and the combination forecast model is put forward.

3.2.1. Markov Chain-Based Precision Range Estimation

As an inherent property, the forecast model precision is shown in the form of the fitted precision which is obtained through the virtual forecast for the multi-time load. Using the forecast model FMd to forecast the load of the similar year e (e = 1, 2, …, nshis), we can obtain the fitted precision sequence { P 1 , d , P 2 , d , , P n s   h i s , d } . In this sequence, the expectation and the standard deviation of the fitted precision of each similar year show the property of the forecast model. As is known, randomness and discreteness appear in the fitted precision sequence. Therefore, we can use the Markov chain transition matrix [43,44] to represent the transition rule as follows:
(1) The fitted precision distribution interval of FMd for the similar year e is divided into nsi (nsinshis) sub-intervals with equal distance as S d 1 , S d 2 , ... , S d n s i , where
S d g = [ S d g _ , S d g ¯ ] , S d 1 _ = min { P 1 , d , P 2 , d , , P n s h i s , d } , S d n s i ¯ = max { P 1 , d , P 2 , d , , P n s h i s , d } , S d g ¯ S d g _ = S d h ¯ S d h _ s . t . g h , g , h = 1 , 2 , ... , n s i
Each fitted precision sub-interval can be considered as a fitted precision state.
(2) According to the fitted precision of FMd for the similar year e, the occurrence number of the fitted precision state S d g is OCg (OCg < nshis). That means there are OCg times which belong to the fitted precision state S g d . The transition number from the fitted precision state S d h to S d g is TRNh,g. Thus, the transition probability of FMd from the fitted precision state S d h to S d g can be obtained as follows:
T P d h , g = { S d g = c g | S d h = c k } = T R N h , g O C g
According to Equation (11), the one-step state transition matrix of FMd is as follows:
T M d ( 1 ) = [ T P d 1 , 1 T P d 1 , 2 T P d 1 , n s i T P d 2 , 1 T P d 2 , 2 T P d 2 , n s i T P d n s i , 1 T P d n s i , 2 T P d n s i , n s i ]
The q-step state transition matrix is as follows:
T M d ( q ) = ( T M d ( 1 ) ) q
(3) We construct an initial vector IVd whose elements are the occurrence numbers of each fitted precision state of FMd. Though multiplying the initial vector IVd with the q-step state transition matrix T M d ( q ) , the new state matrix of FMd is obtained as follows:
S M d = I V d T M d ( q )
(4) We calculate the sum of each column vector of SMd. If the column vector CVEi (i = 1, 2, …, nsi) has the maximum sum, the forecast precision will belong to the precision state S d i .

3.2.2. Cloud Model-Based Precision Estimation

Due to the various factors, the fitted precision sequence P d = { P 1 , d , P 2 , d , ... , P n s   h i s , d } of the forecast model FMd in nshis similar history years clearly has the property of random variables. As a result, the forecast precision is uncertain in its precision range. Therefore, we can describe this uncertainty by the expectation, entropy and hyper-entropy in the precision range and make the quantitative precision estimation based on the normal cloud model [45,46] as follows:
(1) Firstly, we construct a backward cloud generator. We can map the fitted precision sequence P d = { P 1 , d , P 2 , d , ... , P n s   h i s , d } into the normal cloud model. In this normal cloud model, the input is P d and the output is the expectation Exd, the entropy End and the hyper-entropy Hed. The algorithm of this backward cloud generator is as follows:
E x d   =   EXP   ( P d )
E n d = π 2 1 n s   h i s e = 1 n s   h i s | P e , d E x d |
H e d = 1 n s   h i s 1 e = 1 n s   h i s ( P e , d E x d ) 2 ( E n d ) 2
(2) Secondly, we construct a forward cloud generator. The input is Exd, End, Hed and the constraint is the precision range S g d . The algorithm using the forward cloud generator for precision estimation is as follows:
P c , d = NORM ( E x d , E n d )
where E n d = NORM ( E n d , H e d ) is a normal random number with the expectation End and the variance Hed, P c , d is the estimated forecast precision with the expectation Exd and the variance E n d in the precision range S g d .

3.3. Forecast Model Filtering Based on Comprehensive Validity Degree

The comprehensive validity degree of the forecast model FMd in the whole interval [1, nshis] [nhis + 1, nhis + nfo] is as follows:
C V d = α F I V d + ( 1 α ) F O V d
where FIVd is the fitted validity degree of FMd in the similar history years interval [1, nshis], FOVd is the estimated forecast validity degree of FMd in the forecast years interval [nhis + 1, nhis + nfo].
The empirical coefficient α ( 0 α 1 ) is determined by the forecast staff based on their experiences. The bigger α is, the more important FIVd is. Here α = 0.5, which means that FIVd and FOVd are equally important.
We use the mean comprehensive validity degree of nfm forecast models as the forecast model filtering threshold:
C V ¯ = d = 1 n f m C V d
If C V d C V ¯ , the forecast model FMd will be selected for the combination forecast, else it will be eliminated. In the applications, the filtering threshold can be adjusted according to the actual situation and forecast decision-makers’ experiences.

4. Forecast Model Weight Determination and Combination Forecast

4.1. Mathematical Description

After the forecast models filtering, nsfm (nsfm < nfm) better forecast models are selected from nfm forecast models for combination forecast.
We assume that the weight of the selected forecast model SFMj (j = 1, 2, …, nsfm) is ω j ( j = 1 n s f m ω j = 1 , 0 ≤ ω j ≤ 1). For the similar history year e (e = 1, 2, …, nshis), the actual load is y e and the forecast load by SFMj is y e , j , so the combination forecast load of nsfm selected forecast models is as follows:
y e = j = 1 n s f m ω j y e , j
The target is to achieve the minimum square sum of the combination forecast error, so the mathematical description is described as follows:
min e = 1 n s h i s ( j = 1 n s f m ω j y e , j y e ) 2 ,  s.t . j = 1 n s f m ω j = 1 , 0 ω j 1

4.2. Improved Immune Algorithm-Particle Swarm Optimization (Improved IA-PSO)

4.2.1. Particle Swarm Optimization (PSO)

As a kind of stochastic optimization algorithm, particle swarm optimization (PSO) [47] is developed based on the simulation of bird-group foraging behavior. To search for the optimal solution, the individuals have to cooperate and compete with each other in PSO.
There is an m-dimensional space and an initial swarm PO composed of npa particles as follows:
P O = { P A 1 , P A 2 , , P A n p a }
For the particle PAl (l = 1, 2, …, npa), its position xl and its speed vl are expressed by two m-dimensional vectors as follows:
xl = (xl,1, xl,1, …, xl,m)T
vl = (vl,1, vl,1, …, vl,m)T
Each particle is moving in the solution space, and its direction is determined by its speed. The speed and position of the particle is continuously updated as follows:
v l ( k + 1 ) = w v l ( k ) + L F 1 r a n d 1 ( k ) ( p b e s t l ( k ) x l ( k ) ) + L F 2 r a n d 2 ( k ) ( G b e s t l ( k ) x l ( k ) )
x l ( k + 1 ) = x l ( k ) + v l ( k + 1 )
where v l ( k ) and x l ( k ) are the speed and position of the particle PAl in the iteration iterk, LF1 and LF2 are the learning factors, and 0 ≤ r a n d 1 ( k ) , r a n d 2 ( k ) ≤ 1 are two random numbers.
  • The momentum part w v l ( k ) represents the trust in its current motion state where w is the inertia coefficient used to control the influence of the speed v l ( k ) on the speed v l ( k + 1 ) . This part provides a necessary momentum which enables the particle to carry on the inertia motion based on its speed.
  • The individual cognitive part L F 1 r a n d 1 ( k ) ( p b e s t l ( k ) x l ( k ) ) represents the particle self-thinking behavior. This part encourages the particle to fly to the best position found by itself.
  • The social cognitive part L F 2 r a n d 2 ( k ) ( G b e s t l ( k ) x l ( k ) ) represents the information sharing and cooperation of different particles. This part guides the particle to fly to the best position of the group.
Therefore, the momentum part w v l ( k ) represents the particles’ diversification; the individual cognitive part L F 1 r a n d 1 ( k ) ( p b e s t l ( k ) x l ( k ) ) and the social cognitive part L F 2 r a n d 2 ( k ) ( G b e s t l ( k ) x l ( k ) ) represent the particles’ centralization. The main performance of PSO is determined by the balance of the three parts.
In early evolution, PSO has the advantage of fast convergence speed and simple operation, so it can be used for solving the nonlinear, non-differentiable and multi-peak complex optimization problems. But in late evolution, PSO has a significantly slower convergence speed and reaches a poor accuracy, so it is easy to fall into the local optimum.

4.2.2. Immune Algorithm-Particle Swarm Optimization (IA-PSO)

To solve this problem, IA-PSO introduces biological immune system’s specific information processing mechanism (e.g., immune memory, immune regulation and immune selection) into PSO’s basic framework [48,49].
  • Immune memory: The immune system keeps the antibodies opposing against the invading antigen as memory cells. If the same antigen invades again, the memory cells will be activated and produce a large number of antibodies. In IA-PSO, this idea is used to preserve the excellent particle. The best position p b e s t l ( k ) searched by each particle up to now is considered as a memory cell. If the new born particles are detected not to meet the requirements, they will be replaced by the memory cells.
  • Immune regulation: In IA-PSO, immune regulation is used for particle selection. If a particle has a strong affinity or a low concentration, it will be promoted. Otherwise, it will be demoted. Therefore, the particle diversification can be always kept. The selected probability [48,49] of the particle PAl is as follows:
    P R O l = χ P R O l 1 + ( 1 χ ) P R O l 2
    In Equation (28), P R O l 1 = A F l / u = 1 all A F u represents the selected probability determined by the affinity where AFl is the affinity of the particle PAl, P R O l 2 = C O N l 1 / u = 1 all C O N u 1 represents the selected probability determined by the concentration where CONl is the concentration of the particle PAl. χ represents the weight of PROl1 and 1 − χ represents the weight of PROl2 (0 ≤ χ ≤ 1).
  • Immune selection: In the immune system, vaccinating means to change several components of the antibody according to the vaccination. In IA-PSO, the group best position G b e s t i ( k ) up to the iteration iterk can be considered as the closest one to the optimal solution. Thus, we use several components of G b e s t i ( k ) as the vaccination to vaccinate the particles and calculate the particle fitness value for immune selection. If the particle fitness value after the vaccination is lower than its parent, the vaccination will be abolished. Otherwise, the particle will be retained.
IA-PSO, which inherits the global optimization ability of PSO and the immune information processing mechanism of IA, can improve the algorithm accuracy. But at the same time, the algorithm complexity is increased because of the introduction of the immune system.

4.2.3. Improved IA-PSO Based on Disturbance Variable

By introducing a disturbance variable and establishing the searching speed adaptive mechanism, we improve IA-PSO in this paper. Through this improvement, not only the diversity of particles can be ensured to avoid the local optimum, but also the accuracy and convergence speed can be increased.
To solve the mathematical problem described in Equation (22), the search space is set as nsfm-dimensional, the particles number is npa, and the maximum iteration number is iterMax. The position of each particle is an nsfm-dimensional vector in which each dimensional represents the weight of a selected forecast model. After the iteration iterk (iterk = 1, 2, …, iterMax), the position and speed of the particle PAl (l = 1, 2, …, npa) are as follows:
x l ( k ) = ( x l , 1 ( k ) , x l , 2 ( k ) , , x l , n s f m ( k ) ) T
v l ( k ) = ( v l , 1 ( k ) , v l , 2 ( k ) , , v l , n s f m ( k ) ) T
p b e s t l ( k ) , g b e s t ( k ) , G b e s t ( k ) are used to represent the best position searched by the particle PAl in the iteration iterk, the best position searched by the particle swarm in the iteration iterk, the best position searched by the particle swarm up to the iteration iterk, respectively.

Introducing Disturbance Variable into IA-PSO

In the production process of a new particle swarm, the particle position is updated according to Equation (27), and the step length is represented by Equation (26). The coefficients of the three parts in Equation (26) are randomly changed, but only the rules of the particle to follow the order are changed. It means that the step length only reflects the rules of the search behavior and the diversification is lacking. Group optimization should be a balance between the order following and random irrational behaving, so we introduce the disturbance term [50,51] to Equation (26). In each iteration, the particle position is updated as follows:
x l ( k ) = x l ( k 1 ) + v l ( k ) + ( r a n d 3 ( k ) 0.5 ) β l ( k )
where 0 ≤ r a n d 3 ( k ) ≤ 1 is a random number subject to uniform distribution, β l ( k ) > 0 is the disturbance variable of the particle PAl in the iteration iterk. The disturbance term ( r a n d 3 ( k ) 0.5 ) β l ( k ) reflects an unpredictable random behavior.
The disturbance variable β l ( k ) controls the random decision-making behavior strength of the particle PAl in the iteration iterk. If β l ( k ) is too big, the awareness of order following will be submerged. If β l ( k ) is too small, the population diversity and global search ability will be reduced. Therefore, the disturbance variable of each particle should be adjusted in the algorithm operation according to its evolution speed. The adjustment can make the particle swarm with a good generalization ability and convergence speed in the evolution process. Thus β l ( k ) is defined as follows:
β l ( k ) = β min F ( p b e s t l ( k 1 θ ) ) F ( p b e s t l ( k 1 ρ ) ) F ( p b e s t l ( k 1 ) ) F ( p b e s t l ( k 1 θ ) ) s.t.  p b e s t l ( k 1 ) p b e s t l ( k 1 θ ) p b e s t l ( k 1 ρ )    θ < ρ , θ = min { 1 , 2 , , k } , ρ = min { 1 , 2 , , k }
where β min is the minimum value of the disturbance variable, F ( ) is the fitness function, p b e s t l ( k 1 ) , p b e s t l ( k 1 θ ) , p b e s t l ( k 1 ρ ) is the best position found by the particle PAl in the iteration iterk−1, i t e r k 1 θ , i t e r k 1 ρ . When the evolution begins, β l is bigger which means that the particle PAl has a step length with strong randomness. After several iterations, β l tend to β min which means that the step length randomness of the particle PAl becomes weak.
Through the introduction of the disturbance term, Equation (31) reflects the positive and negative sides of the particle updating decision. In Equation (31), the first part x l ( k 1 ) is the original position, the second part v l ( k ) reflects the step length of order following, and the third part ( r a n d 3 ( k ) 0.5 ) β l ( k ) reflects the step length of random irrational behaving. Due to the disturbance variable, the particle position updating can be ensured and a strong search desire can be kept even if the local optimum appears when compared with Equation (26). As a result, the premature convergence problem can be overcome, the local best solution can be prevented and the algorithm accuracy can be improved.

Establishing the Adaptive Adjustable Strategy of Particle Searching Speed

In the particle searching process, the searching speed should be adaptively adjusted to accelerate the convergence based on the diversity of particles. For the excellent particles, their searching speeds should be decreased to make them quickly be close to the global best solution, and then the convergence can be accelerated. For the poor particles, their searching speed should be adjusted according to the convergence degree of the particle swarm: if the swarm individuals tend to be dispersed, the searching speed should be reduced and the swarm development ability should be enhanced to strengthen the local optimization; if the swarm individuals tend to be converged (the algorithm falls into local optimum), the searching speed should be increased and the swarm detection ability should be enhanced to effectively jump out of the local optimum and achieve the accelerated convergence [42,43].
In iteration iterk, the fitness of the particle PAl is P F l ( k ) , the fitness of the best particle is P F 0 ( k ) , the average fitness of the swarm is P F avg ( k ) , and the average fitness of the particles whose fitness are bigger than P F avg ( k ) is P F AVG ( k ) . We use Δ ( k ) = | P F 0 ( k ) P F AVG ( k ) | to evaluate the swarm convergence degree in the iteration L. According to the particle fitness, the swarm is divided into three sub-swarms: P F l ( k ) > P F AVG ( k ) , P F avg ( k ) < P F l ( k ) < P F AVG ( k ) , P F l ( k ) < P F avg ( k ) . For the different sub-swarms, we take different adjustment operations to their searching speeds as follows:
(1) P F l ( k ) > P F AVG ( k )
As the excellent individuals in the swarm, these particles have been relatively close to the global optimum. Their searching speeds should be reduced to prevent from jumping out of the global optimum. So the searching speed is adjusted as follows:
v new , l ( k ) = ( 1 0.5 P F l ( k ) P F AVG ( k ) | Δ ( k ) | ) v l ( k )
If the particle is more excellent, it will have a lower searching speed. As a result, the local optimum ability is strengthened and the convergence is accelerated.
(2) P F avg ( k ) < P F l ( k ) < P F AVG ( k )
As the general individuals of the swarm, both the local optimum ability and the global optimum ability of these particles are good. Therefore, we don’t adjust their searching speeds.
(3) P F l ( k ) < P F avg ( k )
These particles are the relatively poor individuals in the swarm. The searching speed is adjusted as follows:
v new , l ( k ) = ( 1.5 1 1 + η 1 exp ( η 2 Δ ( k ) ) ) v l ( k )
where η 1 , η 2 > η 1 , η 2 > 0 and η 1 is used to control the upper limit of v new , l ( k ) . If η 1 is bigger, the upper limit of v new , l ( k ) will be bigger. Here we choose η 1 = η 2 = 4. Δ ( k ) ≥ 0, so v new , l ( k ) [ 0.5 v i ( L ) , 1.3 v i ( L ) ] .
In the searching process, v new , l ( k ) of these particles is dynamically and adaptively adjusted according to the value of Δ ( k ) : if the individuals tend to be dispersed ( Δ ( k ) becomes bigger), v new , l ( k ) will be reduced and the swarm development ability will be enhanced to strengthen the local optimization; if the individuals tend to be converged ( Δ ( k ) becomes smaller), v new , l ( k ) will be increased and the swarm detection ability will be enhanced to effectively jump out of the local optimum.

4.3. Implementation Steps of Forecast Model Weight Determination Based on Improved IA-PSO

The flowchart of forecast model weight determination (FMWD) based on improved IA-PSO is shown in Figure 3.
Figure 3. The flowchart of forecast model weight determination based on improved IA-PSO.
Figure 3. The flowchart of forecast model weight determination based on improved IA-PSO.
Energies 09 00020 g003
The implementation steps are as follows:
Step 1: Initialization. It is assumed that the elements of the particle position vector x all belong to the interval [0, 1], the elements of the particle speed vector v all belong to the interval [−vmax, vmax], the maximum iteration number is itermax and the initial value of the iteration number k is k = 0. npa particles are randomly generated. The particle PAl (l = 1, 2, …, npa) has the position x l ( 0 ) and the flying speed v l ( 0 ) .
Step 2: Calculate p b e s t l ( k ) , g b e s t l ( k ) , G b e s t l ( k ) and the fitness of the particle PAl. Here, the fitness of the particle PAl can be represented by the target function in Equation (22) as follows:
F l ( k ) = e = 1 n s h i s ( j = 1 n s f m x l , j ( k ) y e , j y e ) 2
Step 3: Obtain the new generation of particles. The particle speed is adjusted based on the adaptive adjustable mechanism and the new position x l ( k + 1 ) and new speed v l ( k + 1 ) can be obtained according to Equations (26) and (31). The elements of the new speed v l ( k + 1 ) must belong to the interval [−vmax, vmax].
Step 4: Check the new generation of particles. If the position of a new particle is an infeasible solution (one or more elements of the new position x l ( k + 1 ) don’t belong to the interval [0, 1]), this particle will be replaced with the memory particle p b e s t l ( k ) .
In addition, another napa particles meeting the requirements will be randomly added. According to Equation (28), npa particles are selected from the npa + napa particles based on the affinity and concentration of antibody and antigen.
Step 5: Vaccinate. One particle is randomly selected from npa new particles. Then, one element is randomly selected from the front t − 1 elements of G b e s t l ( k ) to exchange with the selected particle at the corresponding element. The tth element of the selected particle is calculated as follows:
x l , t ( k + 1 ) = 1 j = 1 t 1 x l , j ( k + 1 )
Hence the vaccination is finished.
Step 6: Immune selection. The particle fitness after the vaccination is calculated. If the particle fitness value after the vaccination is lower than its parent, the vaccination will be abolished. Else, the particle will be retained.
Step 7: Looping execution of Steps 4 and 5 for r times (r times vaccination). A new generation of npa particles is obtained.
Step 8: Judge whether the algorithm should be stopped. The stopping of algorithm is usually determined by the maximum iteration number or the precision. If the algorithm meets its stopping condition, the optimization will be stopped. Else, k = k + 1 and go back to Step 2 to continue.

4.4. Weighted Combination Forecast

Based on the improved IA-PSO, the weight ω j of the selected forecast model SFMj (j = 1, 2, …, nsfm) is obtained where j = 1 n s f m ω j = 1 , 0 ≤ ω j ≤ 1. For the selected forecast model SFMj, y c , j is the forecast value of the forecast year c (c = nhis + 1, nhis + 2, …, nhis + nfo). So the weighted combination forecast (WCF) value of the forecast year c is as follows:
y c = j = 1 n s f m ω j y c , j

5. Case Study

We use the proposed method to forecast a Chinese province’s load of 2005 based on its loads of the history years from 1998 to 2004. The year characteristic quantity data from 1998 to 2005 is shown in Table 1.
The cosine similarity between the history years 1998–2004 and the forecast year 2005 is shown in Table 2.
Table 1. The year characteristic quantity data from 1998 to 2005 [37,52].
Table 1. The year characteristic quantity data from 1998 to 2005 [37,52].
YearArea GDP (108 Yuan)Primary Industry GDP Ratio (%)Secondary Industry GDP Ratio (%)Tertiary Industry GDP Ratio (%)Power Consumption per Unit of GDP (kWh/Yuan)Electricity Price (Yuan/kWh)Urban per-Capita Income (Yuan)Rural per-Capita Income (Yuan)
19989686.612.9550.2236.830.1560.4083005.211896.56
19999802.812.9049.9037.200.1520.4083859.862003.63
20009912.312.8850.0737.050.1500.4104663.232150.36
200110,626.612.8350.0637.110.1480.4125551.912340.14
200211,586.512.8049.6937.510.1420.4126599.242485.86
200312,955.212.3850.7536.870.1390.4127370.652657.93
200415,133.912.6851.6335.690.1320.4198245.553103.98
200517,140.812.7949.6237.590.1250.4449227.553391.82
Table 2. The cosine similarity between the history years 1998–2004 and the forecast year 2005.
Table 2. The cosine similarity between the history years 1998–2004 and the forecast year 2005.
YearCosine Similarity (%)
199898.56
199995.21
200094.66
200199.60
200297.52
200396.37
200498.63
According to forecasting decision-makers’ experiences, we choose the threshold value of similar years selection as 96%. Therefore, the five history years (1998 and 2001–2004) with highest similarity are selected as the similar years of the forecast year 2005.
The power consumption of the province in 1998 and 2001–2005 is shown in Table 3.
Table 3. The power consumption of the Chinese province in 1998 and 2001–2005 (108 kWh) [16,52].
Table 3. The power consumption of the Chinese province in 1998 and 2001–2005 (108 kWh) [16,52].
Year199820012002200320042005
Power Consumption437.85557.58628.82725.20833.01946.33
The forecast values by eleven forecast models are shown in Table 4.
By the validity degree calculation method based on Markov chain and cloud model, the comprehensive validity degrees of the eleven forecast models are obtained as shown in Table 5.
The forecast model filtering threshold of the eleven forecast models is c v d ¯ = 81.46 % . Therefore, the forecast models FM4, FM5, FM8, FM9, FM11 are selected for the combination forecast and the others are eliminated. Respectively, we use PSO, IA-PSO and improved IA-PSO for the forecast model weight determination. There are five selected forecast models SFM1 (FM4: Power function model), SFM2 (FM5: S-curve model), SFM3 (FM8: Cubic curve model), SFM4 (FM9: Artificial neural network method) and SFM5 (FM11: Grey system method). The parameters are set as follows: npa = 100, nsfm = 5, vmax = 1, itermax = 1000, w = 0.6, LF1 = LF2 = 2, napa = 30, r = 25, β min = 0.001, η 1 = η 2 = 4. The results of forecast model weights determination (FMWD) by PSO, IA-PSO and improved IA-PSO, which are abbreviated as FMWD-PSO, FMWD-IA-PSO, FMWD-improved-IA-PSO, are shown in Table 6.
Table 4. The forecast power consumption by the eleven forecast models (108 kWh).
Table 4. The forecast power consumption by the eleven forecast models (108 kWh).
Forecast Model199820012002200320042005
FM1: Exponential model (y = 780.65e−0.82/x)343.91636.02662.58680.93694.36704.55
FM2: Logarithm model (y = 362.13 + 188.39lnx)362.09623.33665.32699.65728.73753.90
FM3: Hyperbola model (y = 722.84 − 354.37/x)368.54634.24652.01663.76672.21678.53
FM4: Para-curve model (y = 431.79 − 3.58x + 8.17x2)436.88556.81631.53723.67833.29960.30
FM5:Grey system method [53]437.94567.04642.01727.00823.23932.08
FM6: COMPERTZ model (lny = 6.46 − 1.29ex)400.04626.88636.27639.75641.14641.64
FM7: Power function model (y = 358.90x0.385)358.90612.43667.48716.12759.91800.02
FM8: Cubic curve model (y = 432.10 − 3.94x + 8.81x2 − 0.0087x3)437.04556.81631.56723.84833.17960.04
FM9: Artificial neural network method [36]406.81583.02658.81725.38775.44809.03
FM10: S-curve model (y−1 = 0.0015 + 0.0039ex)345.60649.47669.01676.35679.22680.30
FM11: Exponential smoothing method [54]437.85544.91615.01708.33816.72892.51
Table 5. The comprehensive validity degree of the eleven forecast models based on the real and forecast power consumptions in 1998 and 2001–2005.
Table 5. The comprehensive validity degree of the eleven forecast models based on the real and forecast power consumptions in 1998 and 2001–2005.
Forecast ModelComprehensive Validity Degree (%)
FM181.09
FM277.90
FM375.56
FM484.32
FM585.80
FM679.03
FM774.87
FM888.91
FM987.23
FM1078.45
FM1182.91
Table 6. The results of FMWD-PSO, FMWD-IA-PSO and FMWD-improved-IA-PSO.
Table 6. The results of FMWD-PSO, FMWD-IA-PSO and FMWD-improved-IA-PSO.
AlgorithmIteration NumberThe Forecast Model Weight
SFM1SFM2SFM3SFM4SFM5
FMWD-PSO6230.06110.21200.38760.08610.2532
FMWD-IA-PSO4900.25980.16620.33430.03430.2054
FMWD-improved-IA-PSO1930.41050.04010.48950.00020.0597
Using the forecast model weights shown in Table 6, the results of weighted combination forecast (WCF) based on FMWD-PSO, FMWD-IA-PSO and FMWD-improved-IA-PSO, which are abbreviated as WCF-FMWD-PSO, WCF-FMWD-IA-PSO and WCF-FMWD-improved-IA-PSO, are shown in Table 7.
Table 7. The results of weighted combination forecast methods (108 kWh).
Table 7. The results of weighted combination forecast methods (108 kWh).
Weighted Combination Forecast199820012002200320042005
WCF-FMWD-PSO434.82558.22631.93720.71821.93923.03
WCF-FMWD-IA-PSO436.28556.97630.82721.19826.19936.41
WCF-FMWD-improved-IA-PSO437.05556.52630.98722.97831.81954.96
Four synthesized forecast methods reported by Kang et al. [55] are as follows:
(1) Equal weight method: the weights of forecast models are equal, so the combination forecast value of the forecast year c is as follows:
y c = 1 n f m d = 1 n f m y c , d
where c = nhis + 1, nhis + 2, …, nhis + nfo, d = 1, 2, …, nfm. It is a simple combination forecast method, and both the precision of single forecast model and the relationship between different forecast models are considered.
(2) Variance analysis method: the combination forecast value of the forecast year c is as follows:
y c = d = 1 n f m ω d y c , d
where c = nhis + 1, nhis + 2, …, nhis + nfo, d = 1, 2, …, nfm. All forecast models are independent of each other, so the variance of combination forecast can be expressed as follows:
V a r = d = 1 n f m ω d 2 δ d d
where c = nhis + 1, nhis + 2, …, nhis + nfo, d = 1, 2, …, nfm, δ d d is the variance of the forecast model FMd. To obtain the minimum value of Varon ω d , the Lagrange multiplier method is used and the weight of FMd is defined as follows:
ω d = 1 δ d d ( 1 δ 11 + 1 δ 22 + ... + 1 δ n f m n f m )
(3) Optimum fitting method
The forecast model weight determination of the optimum fitting method is based on the deviations of all single forecast models and the complementarily between different forecast models. The deviations of the forecast model FMd can be expressed as follows:
D e v d = 1 2 ( 1 n s h i s | e = 1 n s h i s ( y y e , d ) | + 1 n s h i s e = 1 n s h i s | y y e , d | )
Therefore, the weight of FMd is defined as follows:
ω d = max 1 d n f m D e v d + min 1 d n f m D e v d D e v d d = 1 n f m ( max 1 d n f m D e v d + min 1 d n f m D e v d D e v d )
(4) Optimum forecast method
In this method, nfm forecast models are used to carry out the forecasting respectively, and then these models are compared according to standard deviations, fitting goodness, correlation degree or relative error et al. Lastly the best one is chosen as the final forecast model.
Percentage error (PE) and mean absolute percentage error (MAPE) are used as evaluating indicators to compare the proposed method (WCF-FMWD-improved-IA-PSO) to WCF-FMWD-PSO, WCF-FMWD-IA-PSO, the single forecast models (SFM1SFM5) and the four synthesized forecast methods [55] (equal weight method, variance analysis method, optimum fitting method and optimum forecast method). They are shown in Table 8.
Table 8. The percentage error of the proposed method (WCF-FMWD-improved-IA-PSO) and others.
Table 8. The percentage error of the proposed method (WCF-FMWD-improved-IA-PSO) and others.
Forecast MethodMean Absolute Percentage Error (%)Percentage Error (%)
199820012002200320042005
SFM10.42−0.22−0.140.43−0.210.031.48
SFM21.120.021.702.100.251.17−1.51
SFM30.40−0.18−0.140.44−0.190.021.45
SFM46.31−7.094.564.770.02−6.91−14.51
SFM52.410−2.27−2.20−2.33−1.96−5.69
Equal weight method [55]0.980.041.561.100.80−1.02−1.33
Variance analysis method [55]1.250.02−1.27−0.20−2.21−1.16−2.63
Optimum fitting method [55]2.78−2.095.560.771.02−2.91−4.32
Optimum forecast method [55]1.16−0.592.110.970.561.211.50
WCF-FMWD-PSO0.93−0.690.120.49−0.62−1.33−2.36
WCF-FMWD-IA-PSO0.53−0.36−0.110.32−0.55−0.82−1.05
WCF-FMWD-improved-IA-PSO0.35−0.18−0.190.34−0.31−0.140.91
From the comparison of PE and MAPE of the proposed method (WCF-FMWD-improved-IA-PSO) and others shown in Figure 4 and Figure 5 and Table 8, we can see the follows:
  • The maximum and minimum PE of the single forecast models (SFM1SFM5) are −14.51% and 0 respectively, the maximum and minimum MAPE of the single forecast models (SFM1SFM5) are 6.31% and 0.40% respectively.
  • The maximum and minimum PE of the four synthesized forecast methods in Reference [55] are 5.56% and 0.02% respectively, the maximum and minimum MAPE of the four synthesized forecast methods in Reference [55] are 2.78% and 0.98% respectively.
  • The maximum and minimum PE of WCF-FMWD-PSO are −2.36% and 0.12% respectively, the MAPE of WCF-FMWD-PSO is 0.93% and the iteration number is 623.
  • The maximum and minimum PE of WCF-FMWD-IA-PSO are 1.05% and −0.11% respectively, the MAPE of WCF-FMWD-IA-PSO is 0.53% and the iteration number is 490.
  • The maximum and minimum PE of WCF-FMWD-improved-IA-PSO are 0.91% and −0.14% respectively, the MAPE of WCF-FMWD-improved-IA-PSO is 0.35% and the iteration number is 193.
Figure 4. The comparison of PE of the proposed method (WCF-FMWD-improved-IA-PSO) and others.
Figure 4. The comparison of PE of the proposed method (WCF-FMWD-improved-IA-PSO) and others.
Energies 09 00020 g004
Figure 5. The comparison of MAPE of the proposed method (WCF-FMWD-improved-IA-PSO) and others.
Figure 5. The comparison of MAPE of the proposed method (WCF-FMWD-improved-IA-PSO) and others.
Energies 09 00020 g005
Through the above analysis and the comparison shown in Figure 4 and Figure 5, the proposed method WCF-FMWD-improved-IA-PSO results in an improved accuracy overall (MAPE is 0.35, maximum and minimum PE are 0.91% and −0.14%) when compared against other described methods over the chosen time period and year characteristics. In addition, the proposed method WCF-FMWD-improved-IA-PSO has the fastest convergence rate in forecast model weight determination when compared against WCF-FMWD-PSO and WCF-FMWD-IA-PSO. Therefore, using the proposed method to forecast the medium-and-long-term load is better than using other methods. The correctness and feasibility of the proposed method are proven.

6. Conclusions

In this paper, we have proposed a robust weighted combination load forecasting method WCF-FMWD-improved-IA-PSO based on forecast model filtering and adaptive variable weight determination to forecast the annual electric load. The contribution and novelty are mainly as follows:
(1)
Due to the fact that the forecast year’s true load is unknown, the comprehensive validity degree of forecast model is defined by the integration of fitted value relative error and forecast value relative error, and then forecast models are filtered based on their comprehensive validity degrees.
  • The definition of validity degree can effectively overcome the inherent shortcomings of error theory. Entirely investigating the fitting level and the validity of forecast model, the comprehensive validity degree definition and the forecast model filtering method can improve the robustness of combination forecasting.
  • Revealing the transition pattern between the natural precision and validity degree, the forecast precision estimation method based on Markov chain and cloud model can provide an important basis for the subsequent weighted combination forecasting. In the forecast models’ filtering, the better ones will be selected and the worse ones will be eliminated. It can also improve the robustness of combination forecasting.
(2)
The improved IA-PSO is used to determine the forecast model weight in combination forecasting. Based on the uniting of immune system’s specific information processing mechanism and PSO’s global convergence ability, disturbance variable and particle searching speed’s adaptive adjustable strategy are introduced to improve the algorithm performance. The particles’ diversity is ensured while the convergence speed is increased. It can avoid the local optimal and improve the accuracy.
As can be seen from the case study, the maximum and minimum of percentage error by the proposed method WCF-FMWD-improved-IA-PSO are 0.91%, −0.14% and the mean absolute percentage error is 0.35%, which are smaller than those by the single forecast models (SFM1SFM5), the four synthesized forecast methods [55], WCF-FMWD-PSO and WCF-FMWD-IA-PSO. These indicate that the proposed method has significant superiority over other methods in the terms of annual electric load forecasting accuracy. The iteration number of FMWD-improved-IA-PSO in the proposed method (193) is far less than the iteration numbers of FMWD-PSO and FMWD-IA-PSO (623 and 490), so its advantage that the global optimal solution is reached faster than PSO and IA-PSO is confirmed. In conclusion, the proposed method WCF-FMWD-improved-IA-PSO has a higher robustness and better accuracy, and it can meet the requirements of the annual electric load forecast and can also be applied in the forecast of related fields. In the forecast with analogous features, the proposed method WCF-FMWD-improved-IA-PSO can also be applied.

Acknowledgments

This work is supported by the National Natural Science Foundation of China (Nos. 61162005 and 61163002), External-Planned Task of State Key Laboratory of Robotics and System, Harbin Institute of Technology (HIT) (No. SKLRS-2013-MS-05), Natural Science Foundation of Ningxia (No. NZ15103), Key Scientific Research Project of Beifang University of Nationalities (No. 2015KJ08), Science & Technology Research Project of Ningxia High School (No. NGY2015149), and Introduction Talent Starting Scientific Research Project of Beifang University of Nationalities for Lianhui Li and Shaohu Ding.

Author Contributions

Lianhui Li and Chunyang Mu are the principal investigators of this work. They proposed the robust weighted combination load forecasting method and wrote the manuscript. Shaohu Ding designed the case study solution and checked the whole manuscript. Zheng Wang processed the data in the case study. Runyang Mo and Yongfeng Song did the data analysis work.

Conflicts of Interest

The authors declare no conflict of interest.

Nomenclature

n
number
CH
year characteristic
CHQ
year characteristic quantity
HY
history year
FY
forecast year
CSI
Cosine similarity
FM
forecast model
y′
similar year load
y
forecast year load
y‴
forecast year load by a forecast model
RE′
fitted value relative error
RE
forecast value relative error
P′
fitted precision
P
forecast precision
FIV
fitted validity degree
FOV
forecast validity degree
S
sub-interval
OC
occurrence number
TRN
transition number
TP
transition probability
TM
state transition matrix
IV
initial vector
SM
state matrix
CVE
column vector
Ex
expectation
En
entropy
He
hyper-entropy
En′
normal random number
CV
comprehensive validity degree
SFM
selected forecast model
PO
particle swarm
PA
particle
x
position
v
speed
LF
learning factor
rand
random number
w
inertia coefficient
PRO
probability
AF
affinity
CON
concentration
iter
iteration
F
fitness function
PF
particle fitness
r
vaccination times
Var
variance
Dev
deviation
Greek letters
 
σ
standard deviation
α
empirical coefficient
χ
weight
β
disturbance variable
Δ
swarm convergence degree
η
coefficient used to control the upper limit
ω
forecast model’s weight
Superscripts
 
g
the gth sub-interval
h
the hth sub-interval
(h, g)
the transition from the hth to the gth
(k)
the kth iteration
(q)
step
q
the qth power
Subscripts
 
his
history year
fo
forecast year
shis
selected history year
fm
forecast model
sfm
selected forecast model
si
sub-interval
pa
particle
apa
added particle
avg
average
AVG
the average of the numbers which are bigger than the global average
a
the ath year characteristic
b
the bth year characteristic quantity
c
the cth forecast year
d
the dth forecast model
e
the eth similar year
i
the ith column vector
j
the jth selected forecast model
l
the lth particle
m
the dimensional number of solution space
t
the tth element

References

  1. Kouhi, S.; Keynia, F. A new cascade NN based method to short-term load forecast in deregulated electricity market. Energy Convers. Manag. 2013, 71, 76–83. [Google Scholar] [CrossRef]
  2. Chaturvedi, D.K.; Sinha, A.P.; Malik, O.P. Short term load forecast using fuzzy logic and wavelet transform integrated generalized neural network. Int. J. Electr. Power Energy Syst. 2015, 67, 230–237. [Google Scholar] [CrossRef]
  3. Yang, Z.C. Electric load evaluation and forecast based on the elliptic orbit algorithmic model. Int. J. Electr. Power Energy Syst. 2012, 42, 560–567. [Google Scholar]
  4. Bennett, C.; Stewart, R.A.; Lu, J. Autoregressive with exogenous variables and neural network short-term load forecast models for residential low voltage distribution networks. Energies 2014, 7, 2938–2960. [Google Scholar] [CrossRef]
  5. Li, F.; Buxiang, Z.; Shi, C. The Medium and Long Term Load Forecasting Combined Model Considering Weight Scale Method. TELKOMNIKA Indones. J. Electr. Eng. 2013, 11, 2181–2186. [Google Scholar] [CrossRef]
  6. Li, R.; Su, H.; Wang, Z. Medium-and long-term load forecasting based on heuristic least square support vector machine. Power Syst. Technol. 2011, 35, 195–199. (In Chinese) [Google Scholar]
  7. Mello, P.E.; Lu, N.; Makarov, Y. An optimized autoregressive forecast error generator for wind and load uncertainty study. Wind Energy 2011, 14, 967–976. [Google Scholar] [CrossRef]
  8. Li, H.; Guo, S.; Zhao, H. Annual electric load forecasting by a least squares support vector machine with a fruit fly optimization algorithm. Energies 2012, 5, 4430–4445. [Google Scholar] [CrossRef]
  9. Filik, Ü.B.; Gerek, Ö.N.; Kurban, M. A novel modeling approach for hourly forecasting of long-term electric energy demand. Energy Convers. Manag. 2011, 52, 199–211. [Google Scholar] [CrossRef]
  10. Suganthi, L.; Samuel, A.A. Energy models for demand forecasting—A review. Renew. Sustain. Energy Rev. 2012, 16, 1223–1240. [Google Scholar] [CrossRef]
  11. Sheikh, S.K.; Unde, M.G. Short Term Load Forecasting using ANN Technique. Int. J. Eng. Sci. Emerg. Technol. 2012, 1, 97–107. [Google Scholar]
  12. Wu, Y. The medium and long-term load forecasting based on improved D-S evidential theory. Trans. China Electrotech. Soc. 2012, 27, 157–162. (In Chinese) [Google Scholar]
  13. Long, R.; Mao, Y.; Mao, L. A combination model for medium-and long-term load forecasting based on induced ordered weighted averaging operator and Markov chain. Power Syst. Technol. 2010, 3, 150–156. (In Chinese) [Google Scholar]
  14. Chen, H.Y. The Validity Theory of Combination Forecasting Method and Its Application; Science Press: Beijing, China, 2008; pp. 76–109. (In Chinese) [Google Scholar]
  15. Chen, H.Y. Research on combination forecasting model based on effective measure of forecasting methods. Forecasting 2001, 20, 72–73. [Google Scholar]
  16. Sun, G.Q.; Yao, J.G.; Xie, Y.X. Combination forecast of medium-and-long-term load using fuzzy adaptive variable weight based on fresh degree function and forecasting availability. Power Syst. Technol. 2009, 33, 103–107. (In Chinese) [Google Scholar]
  17. Chen, C.; Guo, W.; Fan, J.Z. Combined method of mid-long term load forecast based on improved forecasting effectiveness. Relay 2007, 35, 70–74. (In Chinese) [Google Scholar]
  18. Jin, X.; Luo, D.S.; Sun, G.Q. Sifting and combination method of medium-and-long-term load forecasting model. Proc. Chin. Soc. Univ. Electr. Power Syst. Autom. 2012, 24, 150–156. (In Chinese) [Google Scholar]
  19. Hernandez, L.; Baladrón, C.; Aguiar, J.M. Short-term load forecasting for microgrids based on artificial neural networks. Energies 2013, 6, 1385–1408. [Google Scholar] [CrossRef]
  20. Gofman, A.V.; Vedernikov, A.S.; Vedernikova, E.S. Increasing the accuracy of the short-term and operational prediction of the load of a power system using an artificial neural network. Power Technol. Eng. 2013, 46, 410–415. [Google Scholar] [CrossRef]
  21. Li, H.; Guo, S.; Li, C. A hybrid annual power load forecasting model based on generalized regression neural network with fruit fly optimization algorithm. Knowledge-Based Syst. 2013, 37, 378–387. [Google Scholar] [CrossRef]
  22. Paparoditis, E.; Sapatinas, T. Short-term load forecasting: The similar shape functional time-series predictor. IEEE Trans. Power Syst. 2013, 28, 3818–3825. [Google Scholar] [CrossRef]
  23. Weron, R.; Taylor, J. Discussion on “Electrical load forecasting by exponential smoothing with covariates”. Appl. Stoch. Models Bus. Ind. 2013, 29, 648–651. [Google Scholar] [CrossRef]
  24. Li, D.C.; Chang, C.J.; Chen, C.C. Forecasting short-term electricity consumption using the adaptive grey-based approach—An Asian case. Omega 2012, 40, 767–773. [Google Scholar] [CrossRef]
  25. Ismail, N.A.; King, M. Factors influencing the alignment of accounting information systems in small and medium sized Malaysian manufacturing firms. J. Inf. Syst. Small Bus. 2014, 1, 1–20. [Google Scholar]
  26. Kodogiannis, V.S.; Amina, M.; Petrounias, I. A clustering-based fuzzy wavelet neural network model for short-term load forecasting. Int. J. Neural Syst. 2013, 23, 1557–1565. [Google Scholar] [CrossRef] [PubMed]
  27. Hong, T.; Wang, P. Fuzzy interaction regression for short term load forecasting. Fuzzy Optim. Decis. Mak. 2014, 13, 91–103. [Google Scholar] [CrossRef]
  28. Borges, C.E.; Penya, Y.K.; Fernandez, I. Evaluating combined load forecasting in large power systems and smart grids. IEEE Trans. Ind. Inf. 2013, 9, 1570–1577. [Google Scholar] [CrossRef]
  29. Che, J.; Wang, J.; Wang, G. An adaptive fuzzy combination model based on self-organizing map and support vector regression for electric load forecasting. Energy 2012, 37, 657–664. [Google Scholar] [CrossRef]
  30. Liu, Z.; Li, W.; Sun, W. A novel method of short-term load forecasting based on multiwavelet transform and multiple neural networks. Neural Comput. Appl. 2013, 22, 271–277. [Google Scholar] [CrossRef]
  31. Wang, J.; Wang, J.; Li, Y. Techniques of applying wavelet de-noising into a combined model for short-term load forecasting. Int. J. Electr. Power Energy Syst. 2014, 62, 816–824. [Google Scholar] [CrossRef]
  32. Enayatifar, R.; Sadaei, H.J.; Abdullah, A.H. Imperialist competitive algorithm combined with refined high-order weighted fuzzy time series (RHWFTS-ICA) for short term load forecasting. Energy Convers. Manag. 2013, 76, 1104–1116. [Google Scholar] [CrossRef]
  33. Ko, C.N.; Lee, C.M. Short-term load forecasting using SVR (support vector regression)-based radial basis function neural network with dual extended Kalman filter. Energy 2013, 49, 413–422. [Google Scholar] [CrossRef]
  34. Ma, S.; Chen, X.; Liao, Y.; Wang, G.; Ding, X.; Chen, K. The variable weight combination load forecasting based on grey model and semi-parametric regression model. In Proceedings of the TENCON 2013—2013 IEEE Region 10 Conference (31194), Xi’an, China, 22–25 October 2013; pp. 1–4.
  35. Niu, D.; Shi, H.; Wu, D.D. Short-term load forecasting using bayesian neural networks learned by Hybrid Monte Carlo algorithm. Appl. Soft Comput. 2012, 12, 1822–1827. [Google Scholar] [CrossRef]
  36. Zhu, J.P.; Dai, J. Optimization selection of correlative factors for long-term load prediction of electric power. Comput. Simul. 2008, 5, 226–229. [Google Scholar]
  37. Wang, Q.; Wang, Y.L.; Zhang, L.Z. An approach to allocate impersonal weights of factors influencing electric power demand forecasting. Power Syst. Technol. 2008, 32, 82–86. (In Chinese) [Google Scholar]
  38. Lei, S.L.; Gu, L.; Yang, J.; Liu, X.Y. Analysis of electric power load characteristics and its influencing factors in chongqing region. Electr. Power 2014, 12, 61–71. (In Chinese) [Google Scholar]
  39. Liao, F.; Congying, X.U.; Yao, J.; Cai, J.; Chen, S. Load characteristics of change region and analysis on its influencing factors. Power Syst. Technol. 2012, 7, 117–125. [Google Scholar]
  40. Karakoc, E.; Cherkasov, A.; Sahinalp, S.C. Distance based algorithms for small biomolecule classification and structural similarity search. Bioinformatics 2006, 14, 243–251. [Google Scholar] [CrossRef] [PubMed]
  41. Ye, J. Cosine similarity measures for intuitionistic fuzzy sets and their applications. Math. Comput. Model. 2011, 1, 91–97. [Google Scholar] [CrossRef]
  42. Zhu, S.; Wu, J.; Xiong, H.; Xia, G. Scaling up top-K cosine similarity search. Data Knowl. Eng. 2011, 1, 60–83. [Google Scholar] [CrossRef]
  43. Keilson, J. Markov Chain Models—Rarity and Exponentiality; Springer Science and Business Media: Berlin, Germany, 2012. [Google Scholar]
  44. Yoder, M.; Hering, A.S.; Navidi, W.C.; Larson, K. Short-term forecasting of categorical changes in wind power with Markov chain models. Wind Energy 2014, 17, 1425–1439. [Google Scholar] [CrossRef]
  45. Li, D.Y.; Liu, C.Y. Study on the universality of the normal cloud model. Eng. Sci. 2004, 6, 28–34. [Google Scholar]
  46. Wang, G.; Xu, C.; Li, D. Generic normal cloud model. Inf. Sci. 2014, 280, 1–15. [Google Scholar] [CrossRef]
  47. Kennedy, J. Particle swarm optimization. In Encyclopedia of Machine Learning; Springer Publishing Company: New York, NY, USA, 2010; pp. 760–766. [Google Scholar]
  48. Zhao, F.; Li, G.; Yang, C.; Abraham, A.; Liu, H. A human-computer cooperative particle swarm optimization based immune algorithm for layout design. Neurocomputing 2014, 132, 68–78. [Google Scholar] [CrossRef]
  49. Fu, X.; Li, A.; Wang, L.; Ji, C. Short-term scheduling of cascade reservoirs using an immune algorithm-based particle swarm optimization. Comput. Math. Appl. 2011, 6, 2463–2471. [Google Scholar] [CrossRef]
  50. Afshinmanesh, F.; Marandi, A.; Rahimi-Kian, A. A novel binary particle swarm optimization method using artificial immune system. In Proceedings of the International Conference on Computer as a Tool, EUROCON 2005, Belgrade, Serbia, 21–24 November 2005; Volume 1, pp. 217–220.
  51. Wu, J.M.; Zuo, H.F.; Chen, Y. A combined forecasting method based on particle swarm optimization with immunity algorithms. Syst. Eng. Theory Method Appl. 2006, 15, 229–233. (In Chinese) [Google Scholar]
  52. China National Bureau of Statistics. China Energy Statistical Yearbook; China Statistics Press: Beijing, China, 2011. (In Chinese)
  53. Deng, J.L. Introduction to grey system theory. J. Grey Syst. 1989, 1, 1–24. (In Chinese) [Google Scholar]
  54. Gardner, E.S. Exponential smoothing: The state of the art. J. Forecast. 1985, 1, 1–28. [Google Scholar] [CrossRef]
  55. Kang, C.Q.; Xia, Q.; Liu, M. Power System Load Forecasting; China Electric Power Press: Beijing, China, 2007; pp. 73–75. (In Chinese) [Google Scholar]

Share and Cite

MDPI and ACS Style

Li, L.; Mu, C.; Ding, S.; Wang, Z.; Mo, R.; Song, Y. A Robust Weighted Combination Forecasting Method Based on Forecast Model Filtering and Adaptive Variable Weight Determination. Energies 2016, 9, 20. https://doi.org/10.3390/en9010020

AMA Style

Li L, Mu C, Ding S, Wang Z, Mo R, Song Y. A Robust Weighted Combination Forecasting Method Based on Forecast Model Filtering and Adaptive Variable Weight Determination. Energies. 2016; 9(1):20. https://doi.org/10.3390/en9010020

Chicago/Turabian Style

Li, Lianhui, Chunyang Mu, Shaohu Ding, Zheng Wang, Runyang Mo, and Yongfeng Song. 2016. "A Robust Weighted Combination Forecasting Method Based on Forecast Model Filtering and Adaptive Variable Weight Determination" Energies 9, no. 1: 20. https://doi.org/10.3390/en9010020

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop