Next Article in Journal
Visitors’ Attitudes towards Bicycle Use in the Teide National Park
Previous Article in Journal
Future Projected Changes in Local Evapotranspiration Coupled with Temperature and Precipitation Variation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Monthly Load Forecasting Based on Economic Data by Decomposition Integration Theory

1
Economics and Management School, North China Electric Power University, Changping District, Beijing 102206, China
2
Beijing Key Laboratory of New Energy and Low-Carbon Development (North China Electric Power University), Changping District, Beijing 102206, China
3
Institute of Smart Energy, North China Electric Power University, Changping District, Beijing 102206, China
*
Author to whom correspondence should be addressed.
Sustainability 2018, 10(9), 3282; https://doi.org/10.3390/su10093282
Submission received: 3 September 2018 / Revised: 8 September 2018 / Accepted: 10 September 2018 / Published: 14 September 2018
(This article belongs to the Section Energy Sustainability)

Abstract

:
Accurate load forecasting can help alleviate the impact of renewable-energy access to the network, facilitate the power plants to arrange unit maintenance and encourage the power broker companies to develop a reasonable quotation plan. However, the traditional prediction methods are insufficient for the analysis of load sequence fluctuations. The economic variables are not introduced into the input variable selection and the redundant information interferes with the final prediction results. In this paper, a set of the ensemble empirical mode is used to decompose the electricity consumption sequence. Appropriate economic variables are as selected as model input for each decomposition sequence to model separately according to its characteristics. Then the models are constructed by selecting the optimal parameters in the random forest. Finally, the result of the component prediction is reconstituted. Compared with random forest, support vector machine and seasonal naïve method, the example results show that the prediction accuracy of the model is better than that of the contrast models. The validity and feasibility of the method in the monthly load forecasting is verified.

1. Introduction

Power load forecasting plays a key role in the power system operation and electricity market activities. The forecasting is still more important because it is the base of schedule of power system operation [1,2]. Electricity comes from traditional coal-fired power generation, wind power generation [3,4], solar power generation [5,6], biomass power generation [7] and tidal power generation [8]. These directions also require accurate load forecasting. Many activities within the power system such as the maintenance scheduling of generators, renewable-energy integration and even the investment of power plants and power grids depend on the monthly load forecasting. In the electricity market the regulators monitor the activities based upon the forecasting load and power generators [9]. Customers and power brokers decide their action strategies.
Power load forecasting has been studied for decades [10]. Various models, novel algorithms, advanced techniques and ingenious tricks have been developed to improve forecasting accuracy [11,12]. It was based on SVR and fuzzy-rough with PSO algorithms to forecast the residential sector’s electricity demand. And the method can identify relevant variables for developing the forecasting model [13]. It discussed the effects of various models in energy planning and forecasting, with emphasis on the application of ANN in the field of load forecasting [14,15]. It introduced different power load forecasting models and combined regression model with machine learning model to forecast the power load of commercial buildings [16].
The time-series models present good performance with historical data. Taking exogenous variables into account, the econometric models have better forecasting capability [16]. The model combined dynamic model and fuzzy time series. When considering the weather factors, the prediction accuracy of the model was better than that of the traditional model [17]. It based on a hybrid model of autoregressive integrated moving average (ARIMA) and support vector machines (SVMs). The ARIMA was used to predict the linear fundamental part of the load and the SVM predicted the nonlinearly sensitive part of the load [18]. It considered many long memories seasonal data sets and combined Fuzzy Time Series and SARFIMA for short-term load forecasting [19].
Artificial Neural Networks, Support Vector Machine and other intelligence algorithms dominate in the forecasting field for their powerful approximation between the dependent variable and independent variables [20]. It used wavelet analysis to decompose historical load data and then used support vector machine and neural network to select appropriate parameters [21,22]. The single branch predictions for each sequence are separately made and each branch prediction results are reconstructed to achieve ultimate load forecasting [23]. It proposed the goa-svm model, which predicts short-term load for local climate conditions. The results show that the prediction accuracy was better than that of the traditional model and other mixed models [24]. A deep belief network model was incorporated into a feed-forward neural network. And the model was applied to short-term electricity load forecasting [25]. An improved empirical mode decomposition (EMD) model combined with Elman neural network was proposed to predict the building electricity load. Effectiveness of the proposed model was carried out to real-world engineering test case in comparison with other prediction models [26]. Neural network was combined with optimization techniques for finding optimal network parameters to reduce the forecasting error. Further, the proposed algorithm was integrated with neural network for the proper tuning of network parameters to solve the real-world problem of short term load forecasting [27]. It proposed an improved grey model to enhance the disadvantage of the general GM (1,1) model when the load mutation was large. A practical application verifies that, compared with the existing grey forecasting models. The proposed model is a stable and feasible forecasting model with a higher forecasting accuracy [28].
However, these functions require massive training samples and the performances are sensitive to model parameters and input. Until now, more efforts have been put to tune the model parameters and to select proper input. Wavelet Decomposition [29,30,31], Empirical Mode Decomposition(EMD) [32,33], Complete Ensemble Empirical Mode Decompositions with adaptive noise(CEEMDAN) [34] broke down the original sequence into several more regular sequences for modeling. The effectiveness of such techniques has been validated.
Random Forest (RF) is a novel machine learning algorithm developed in recent years [35,36,37,38]. RF is robust to the input and number of samples. The great advantage of RF has been validated in many forecasting contests. Someone proposed a short-term load predictor, able to forecast the next 24 h of load using RF [39].
The above traditional methods consider less economic data but in actual life, economic factors have a great impact on the load. The reason for the small consideration is that there are fewer economic data samples, more variables. There is redundancy between variables and the correlation between variables and load is low. So, the traditional method cannot solve this problem and random forest can solve the problem. At the same time, the law of medium and long-term load fluctuation is complicated and there may be multiple regular superpositions using EEMD decomposition. This paper uses random forests to overcome these problems for accurate monthly load forecasting.
The other sections of this paper are organized as follows: the basic principles and modeling process of EEMD and RF algorithms were introduced in the second section, then an experimental study of monthly electricity consumption forecasting was carried to validate the proposed method and the discussion and conclusion were made at the end section.

2. Methods

In the field of signal analysis, EMD is widely used in various engineering fields. Because it has obvious advantages in dealing with non-stationary and nonlinear data compared with other algorithms. Meanwhile, it has a high signal-to-noise ratio to guarantee data availability. In contrast to EMD, EEMD incorporates normally distributed white noise to aid analysis, which makes the signal continuous at different scales, thereby reducing the degree of modal aliasing. The introduction of the noise in load forecasting is mainly to resist the damage of bad data on the accuracy of prediction results and improve the robustness of the model.
The subsequent RF algorithm is an algorithm based on a combined decision tree. Because of its insensitivity to default on problems and its high tolerance to noise or outliers, it is widely used on the field of classification and regression. This paper uses RF algorithm to make use of its many advantages in the good adaptability of multiple data sets, excellent fitting ability and insensitivity to irrelevant variables.

2.1. EEMD Fundamental Principle

In the signal-processing process of EMD, if there is an uneven distribution of signal extreme points, modal aliasing problems will occur. In response to this problem, Huang proposed adding uniformly distributed white noise to the decomposed signal. The noise with a mean value of zero will be canceled out as the result after several times of average elimination on signals of different time scales. Specific steps are as follows:
(1)
Add a random uniform Gaussian white noise sequence H ( t ) to the original sequence Y ( t ) to get the new sequence Y 0 ( t ) ;
Y 0 ( t ) = Y ( t ) + H ( t )
(2)
Decompose the noise-added sequence Y 0 ( t ) into I M F i ( t ) and a residual series R n ( t ) using EMD;
Y 0 ( t ) = i = 1 n I M F i ( t ) + R n ( t ) ,   ( i = 1 , 2 , , n )
(3)
Repeat steps (1) and (2) until a smooth decomposition signal is obtained;
(4)
Calculate the average value of each decomposed I M F i ( t ) component as the result. N is the number of Gaussian white noise added
I M F i ( t ) = 1 N I M F i j ( t )
Y ( t ) = j = 1 n I M F i ( t ) + R ( t )
The result of EEMD is shown as Formula (4). After increasing the uniform distribution of white noise, the occurrence of modal aliasing can be well improved and the degree of coincidence of each Y_IMF component to the overall trend and the fluctuation trend can be improved.

2.2. RF Fundamental Principle

RF is a collection method that aggregates many decision tree predictions and there is no correlation between each decision tree. The representation of RF is mainly reflected in the random sampling of features when the specimen is put back into a random number of the samples (bootstraps) and a decision tree is constructed. The introduction of this randomness is very helpful to the performance improvement of RF. Because of it, RF is not easy to fall into over-fitting and has good noise immunity (e.g., Insensitive to default). The specific modeling steps are as follows:
(1)
Assume that the number of original data samples is N and the number of decision trees in RF is k. k decision trees are extracted from the N by resampling and the number of training specimens in each decision tree is n.
(2)
Assume that the feature dimension of the input variable is M and any feature set whose number is m (m < M and m remains unchanged) in M. Through these m features, the optimal splitting node is determined.
(3)
RF consists of the k decision trees that grow as much as possible and do not require pruning.
(4)
In the regression algorithm, the result of each decision tree is weighted and averaged to obtain the result.

2.3. Diebold Mariano Test

Diebold and Marino proposed the DM test to determine whether a model’s predictive power is significantly different from another model [40]. Specific assumptions are as follows:
H 0 : E [ F ( e t 1 ) ] = E [ F ( e t 2 ) ]
H 1 : E [ F ( e t 1 ) ] E [ F ( e t 2 ) ]
Formulas (5) and (6) are the null hypothesis and alternative hypothesis of the DM test, respectively e t 1 and e t 2 are the prediction errors between actual values and forecasted values of the different models and the function F is the loss function of forecasting errors.
d ¯ = 1 L t = 1 L [ F ( e t 1 ) F ( e t 2 ) ]
In Formula (7) d ¯ is the sample mean loss differential difference and L is the length of forecasting values.
D M = d ¯ 2 π f d ^ ( 0 ) L N ( 0 , 1 )
From Formula (8) we can see the DM value converges to the normal distribution. f d ^ ( 0 ) is the zero-spectral density and 2 π f d ^ ( 0 ) is a consistent estimate of the asymptotic variance of T d ¯ . So, after calculating DM value | D M | , we draw a conclusion by comparing | D M | with | Z α / 2 | from the standard normal distribution table. If | D M | is less than | Z α / 2 | , we can accept the null hypothesis and consider the difference between the predictive powers of the two models to be inconspicuous. For example, if | D M | ≤ 1.96, we accept the null hypothesis. Otherwise, | D M | > 1.96, then the null hypothesis is rejected at the 5% level.

2.4. Modeling Process

The specific modeling process is shown in Figure 1. The process is divided into three parts. First, the primary, secondary, tertiary industry and residential electricity original sequences are respectively decomposed into six components of Y_IMF1~Y_IMF5 and Y_R by EEMD. Then combine them into three components of high, medium and low frequency separately excepting Y_IMF1. Because the fluctuation frequency of Y_IMF1 is too large to be suitable for modeling. Secondly, the combined sequences are used for correlation analysis with economic and weather variables. The factors with higher correlation are selected as the input of different frequency models. Thirdly, the models are established by using the selected input and relative target value. Afterwards, the predicted values of different frequency sequences of different industries can be obtained through the models. The monthly load of the industry can be obtained by adding different frequency sequences of the same industry.
These industrial loads add up to the total social load.

3. Empirical Research

3.1. Data Description and Data Processing

This article adopts the China’s electricity consumption data from July 2009 to November 2017 from the National Bureau of Statistics of China and some power company. The electricity consumptions of primary, secondary, tertiary industries and resident are shown in Figure 2. Table A1 shows the input variables of economic and weather factors for modeling.
To avoid the influence of seasonal factors and other cyclical factors in the forecast results, the growth rate forecast method is adopted to handle four original power load sequences and the growth rate is to compare the data of the previous year.
In the prediction sequence, there is a lack of power consumption data for December 2009~2012. There are two main methods for missing data imputation. One is to use the 7~11 month-on-year growth average to reverse the missing data from 2013. The other is to use the average of the electricity consumption in December in the second half of the year to calculate the missing data. The specific method is selected based on the estimated errors in 2013~2017.

3.2. Data Set Division and Experimental Evaluation Index

Data sets are divided into training sets, validation sets and test sets according to different time spans. The goal of this article is to make accurate monthly load forecasts for the next six months, so the data from June to November of 2017 will be set as the final test set. The data of six months before November 2016 are randomly selected as the validation set for adjusting experimental parameters and verifying the effectiveness of the algorithm. The rests are training set for training models.
Based on the monthly national load demand forecast, this paper selects the mean absolute error (MAE), mean absolute percentage error (MAPE) and root mean square error (RMSE) as evaluation indicators. The expression is:
MAE = 1 n i = 1 n | T i P i |
RMSE = 1 n i = 1 n ( T i P i ) 2
MAPE = 1 n i = 1 n | T i P i T i |
Among them, T i is a real value. P i is a prediction value and n is the number of selected prediction points. If the obtained MAPE is tinier, there is smaller difference between predicted value and actual load value. It shows that the prediction is more accurate.

3.3. Analysis of Specific Modeling Process

3.3.1. EEMD Factorization Variable

Use EEMD to decompose the primary, secondary, tertiary industrial electricity and residential electricity consumption sequences that need to be predicted. Since July 2009 to November 2017, there are 101 sets of data. In this paper, the electricity consumption is predicted two quarters ahead of time and the influential factors in the third quarter before the forecast periods are selected as the initial input variables. Because of the growth forecast, the data for the initial 12 months will be used as a basis. So, the N that needs to be decomposed is 80 (101 − 6 − 3 – 12 = 80). The number of components after EEMD can be obtained by the Formula (8), where the fix is rounded to 0. The number of solution scores is 5. The decomposition results are shown in Figure 3.
m = fix(log2(N)) − 1
In the process of EEMD decomposing the four original sequences into five components (Y_IMF1~Y_IMF5) and one residual component Y_R respectively, the standard deviation of added Gaussian white noise (Nstd) is 0.2 and the number of noise added is 100. From the decomposition of EEMD in Figure 3, it is unfavorable for the RF modeling prediction in the later period that the frequency of some component oscillations is very fast, so the high-frequency Y_IMF1 is discarded. For the latter low-frequency sequences, the components are superposed and combined in order to avoid that the single decomposition sequence has too great influence on the prediction accuracy. The combination method is that Y_IMF2, Y_IMF3 and Y_IMF4 make up high-frequency data. Y_IMF5 is regarded as a medium frequency sequence (My) and remainder Y_R is regarded as a low-frequency sequence (Ly).

3.3.2. Random Forest Modeling

Since the lead time of this paper is set at six months, the input of random forest is the external economic and weather index from seventh to the ninth before the predicted month. The total number of one month’s relevant indices that can be found is 303. If you enter all the three-month indices into the model, it will undoubtedly bring infinite challenges to the complexity and accuracy of the model. To simplify the model and improve the accuracy of the model, this paper introduces the Kendall correlation coefficient to filter the input variables of the corresponding different frequency sequences of varied industry. By controlling the size of the correlation and the Kendall coefficient return value p, the final size of each model input variable is controlled to be between 15 and 40. Although random forests are highly inclusive for redundant data, proper screening of variables can also improve model prediction accuracy. Meanwhile, random forest is the tree regression model that does not require normalization of selected input variables. However, when the comparison models are established, the input variables must be normalized to remove the dimension of the variables.
In the RF modeling of high, medium and low frequency sequences, the number of random forests is the key to the effect on the model. Using MAPE as a criterion, RF modeling was performed on 50 to 1500 trees in the training set. In the case of different numbers of trees, the verification set MAPE behaves as shown in Figure 4.
From Figure 4, we can see that in the random forest modeling process of the first industry, the different frequency models achieve the optimal at 250 (5 × 50), 50 (1 × 50) and 50 (1 × 50) trees respectively. In the verification, the MAPE of the model is the smallest showing excellent adaptability. The test set modeling is then modeled using its optimal number. Similarly, it can be concluded that the best number of high, medium and low frequency data for the secondary and tertiary industries and residential electricity consumption are respectively 50 (1 × 50), 550 (11 × 50), 100 (2 × 50), 550 (11 × 50), 400 (8 × 50), 50 (1 × 50), 900 (18 × 50), 400 (8 × 50), 350 (7 × 50).

3.3.3. Cross Validation and Contrast Model Establishment

The models are tested multiple times with the set validation set before final testing. The validation set is randomly selected from data other than the test set. Table A2 shows six verification set errors.
This paper selects SVM and seasonal naïve method that has been approved by many experts and scholars in recent years for comparative analysis and compares the effect of adding EEMD on the accuracy of the model. Input variable selection methods of RF and SVM are the same as before. Among them, the SVM needs to be normalized to the input before modeling. Seasonal naïve method only needs to be used for preliminary finishing of the original four power sequences.

4. Results Analysis

The model prediction results are divided into three parts. One is the forecast of electricity consumption by sub-industries. The second part is the forecast of the total social electricity consumption superimposed on the electricity consumption of sub-industries and the last part is the completion of the whole society electricity forecast for the simple use of combination forecasts. At the same time, because there is no essential difference between RMSE and MAE in a single month calculation, only MAE is marked in the table and RMSE is used as a reference when comparing results. The primary, secondary and tertiary industries and residential forecast results are shown in Table 1.
The sub-industry fit and prediction curve is shown in Figure 5. The left side of the vertical line is the fitted curve and the right side is the predicted curve. It can be seen from the fitting curve that seasonal naïve method has a poor fitting effect, because of reflecting volatility earlier or later.
After comparing the four methods of MAE, MAPE and RMSE, it was found that the EEMD-RF model about primary industry had the highest degree of agreement with MAE of 298.96 GWh, MAPE of 2.90% and RMSE of 340.35 GWh. The SVM and RF that were not processed by EEMD were the next with MAE of 451.16 and 509.14 GWh, MAPE of 3.47% and 4.01% and RMSEs of 580.89 and 606.84 GWh. The poor performance of seasonal naïve method in the primary industry forecast is since the algorithm does not reflect the external changes in time. When the EEMD is not used for decomposition the RF cannot recognize the increase in the error caused by the external weather-related load.
The secondary industry fit and prediction curve is shown in Figure 5b. After the three evaluation criteria, the same EEMD-RF model was found to have the highest degree of agreement with MAE of 8255.49 GWh, MAPE of 2.14% and RMSE of 8752.63 GWh. The performance of SVM and seasonal naïve method followed with MAE of 11,380.77 and 17,887.41 GWh, MAPE of 2.91% and 4.55% and RMSE of 12,127.78 and 20,437.14 GWh. The RF model has the same problems as the primary industry forecast. Under the premise of setting a long lead time, the useful information cannot be distinguished well and the accuracy of the model is reduced. In the secondary industry forecast, EEMD optimizes its input variables for RF where the impact of noise is less than the benefits of the EEMD decomposition variables, so the results increase slightly.
The tertiary industry fitting and prediction curve is shown in Figure 5c. After comparing, the RF model was found to be in leading state and the MAE was 607.10 GWh, the MAPE was 0.82% and the RMSE was 774.37 GWh. The RF treated with EEMD and SVM performed second with MAE of 908.27 and 1848.28 GWh, MAPE of 1.22% and 2.37% and RMSE of 1090.27 and 8405.35 GWh. The good fit into the RF model is related to the third industry load characteristics. The tertiary industry is mainly the service industry. The load of such industries is relatively stable and there is no obvious fluctuation in demand. In the parameter, selection is not taken into account but a large-scale input like other industries into the model. This makes the RF insensitivity to the variables fully utilized and achieves results higher than the EEMD decomposition model.
The residential electricity fitting and prediction curve is shown in Figure 5d. Residual electricity forecast results can be found from Table 1. The forecasting effect was consistent with that of the secondary industry. The EEMD-RF model had the highest degree of agreement with MAE of 2212.13 GWh, MAPE of 2.33% and RMSE of 3630.70 GWh. The SVM model and RF followed with MAE of 2607.08 and 3764.43 GWh, MAPE of 3.01% and 4.14% and RMSE of 3312.32 and 5401.80 GWh, respectively. The seasonal naïve method performed poorest.
For each method, the six-months electricity consumption forecast of the whole society is shown in Table 2. We can see that the best method is the EEMD-RF method with MAPE of 1.34% and MAE of 7447 GWh. Followed by SVM and RF, EEMD-RF accuracy stands out. The seasonal naïve method provides a comparison index, which proves that other algorithms have certain feasibility.
The two methods EEMD-RF and SVM with the best average effect on the verification set are used to perform combined forecasting. The two methods are simply averaged to obtain the final forecast result, as shown in Table 3. From the combined forecasting results, we can see that the last forecasting accuracy is about 10% higher than that of the single model, with the MAE raised to 6828 GWh and the MAPE raised to 1.24%. However, this completely depends on the average fitness of the selected model. If there is a poor fit of a single model, it can easily affect the effect of its combined forecast. This is why we have to give up the direct modeling of SVM and RF.

5. Discussion

DM test is used to verify the validity of the model being developed. All other models were compared to the EEMD-RF model. According to the DM test principle proposed above, the null hypothesis is that the prediction abilities of the two models are similar and the other hypothesis is that there are significant differences in the prediction performance of the two models. Table 4 shows us the DM value about EEMD-RF and other models with MAE and MAPE.
Except for residential electricity DM value less than 1.960, other values are more than 1.960. It indicates that the EEMD-RF model is different from the other models at a 5% significance level in sub-industry load forecasting. Thus, the null hypothesis could be rejected at a 5% significance level. The DM value of residential electricity is less than 1.960 and greater than 1.645, indicating EEMD-RF model is different from RF and SVM at a 10% significance level. Thereby, the null hypothesis could be rejected at a 10% significance level. Therefore, the proposed EEMD-RF model significantly outperforms the other models.

6. Conclusions

In this paper, the input of the model, the optimization of the model parameters and combinatorial prediction are used to forecast the electricity load of the whole population in China. On the model input, selecting the economic data from the National Bureau of Statistics of China as the model input increases the accuracy of the model by about 15% over the pure use of power data and other data. At the same time, EEMD is used to decompose the prediction sequence, analyze the original sequence fluctuation trend and improve the correlation between the predictor and the input variable. In the optimization of model parameters due to fewer random variables in the RF, similar enumeration method is used to complete model optimization in a certain range of values. The idea of aggregation is respectively embodied in the model after EEMD decomposition, the prediction results are added together and the accuracy of the model is improved through the combination of prediction methods.
By applying EEMD and RF to the national monthly electricity usage data, it was found that the improved model achieved better accuracy than the traditional SVM and the error was reduced by 10~25% compared to the single random forest. It reflects the dynamic characteristics of the original sequence and verifies the effectiveness of the method in monthly load forecasting. At the same time, the advantage of using RF for variable insensitivity is reduced and the prediction error caused by unstable noise to the model is reduced, the model generalization ability is improved and it can be applied to different prediction fields. Because there are steps in the EEMD of the later reconstruction pre-diction results when each component is modeled in a RF merely the effects of a unitary model is considered. The parameters that make the single model topgallant are sought and the global optimum results cannot be obtained, only getting the sub-optimal results. Therefore, we can continue to carry out the next stage research on how to achieve the optimal parameters of combined forecasting.

Author Contributions

Conceptualization, D.L.; Data curation, K.S.; Formal analysis, K.S.; Funding acquisition, D.L.; Software, K.S.; Writing—original draft, K.S.; Writing—review & editing, H.H. and P.T.

Funding

This research was funded by the National Natural Science Foundation of China (NSFC) (51641701), the 111 Project (B18021) and the Fundamental Research Funds for the Central Universities (2017MS080).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Number and meaning of input variables.
Table A1. Number and meaning of input variables.
NumberMeaningNumberMeaning
001average temperature153Export value of metal products export growth (%)
002Average maximum temperature154Export value of general equipment manufacturing industry increased by year (%)
003Mean minimum temperature155Export value of special equipment manufacturing industry increased by year (%)
004Absolute maximum temperature156Export value of electrical machinery and equipment manufacturing industry increased by year (%)
005Absolute minimum temperature157Export delivery value of computers, telecommunications and other electronic equipment manufacturing industry increased by year (%)
006Precipitation158Export value of instrument manufacturing industry increased by year (%)
007The number of days of precipitation equal to or greater than 1 mm159Exports of other manufacturing exports increased year-on-year (%)
008The number of days of precipitation equal to or greater than 0.1 mm160Export delivery value of waste comprehensive utilization industry increased by year (%)
009The number of snow days161The delivery value of electricity, heat production and supply industry increased by year (%)
010Storm days162The delivery value of gas production and supply industry increased by year (%)
011Foggy days163The delivery value of water production and supply industry increased by year (%)
012Frost days164The value of cargo throughput at the port scale above sea level (ten thousand tons)
013Maximum temperature (reliability index)165The current value of the throughput of foreign trade goods (ten thousand tons)
014Minimum temperature (reliability index)166Value of freight volume (ten thousand tons)
015Precipitation (reliability index)167The current value of railway freight volume (ten thousand tons)
016Industrial producer purchasing price index (same month last year = 100)168The current value of highway freight volume (ten thousand tons)
017Fuel and power purchase price index (same month last year = 100)169Current value of water transport freight volume (ten thousand tons)
018Black metal material purchase price index (same month last year = 100)170Current value of civil aviation freight volume (ten thousand tons)
019Purchase price index of nonferrous metals and wires (same month last year = 100)171Turnover of goods turnover (100 million tons)
020Purchase price index of chemical raw materials (same month last year = 100)172The current value of railway freight turnover (hundreds of millions of tons)
021Wood and pulp purchase price index (same month last year = 100)173The current value of highway cargo turnover (100 million tons)
022Purchase price index of building materials and nonmetallic minerals (same month last year = 100)174The current value of the turnover of waterborne goods (hundreds of millions of tons)
023Purchase price index of other industrial raw materials and semi-finished products (same month last year = 100)175Civil aviation cargo turnover current value (100 million tons)
024Purchase price index of agricultural and sideline products (same month last year = 100)176Value of passenger traffic volume (ten thousand people)
025Purchase price index of textile raw materials (same month last year = 100)177The value of railway passenger traffic volume (ten thousand people)
026Producer price index for industrial producers (same month last year = 100)178Traffic volume of highway passenger volume (ten thousand people)
027Producer price index of producer goods (same month last year = 100)179Water transport passenger traffic volume (10,000 people)
028Producer price index of producer of living goods (same month last year = 100)180Passenger volume of civil aviation passenger volume (10,000 people)
029Producer price index of producer goods (same month last year = 100)181Turnover of passenger turnover (100 million kilometers)
030Producer price index of producer goods for mining industry (same month last year = 100)182Railway passenger turnover in the current period (100 million kilometers)
031Producer price index of producer goods for raw material industry (same month last year = 100)183Highway passenger turnover in the current period (100 million kilometers)
032Processing industry producer goods producer price index (same month last year = 100)184Water transport passenger turnover rate (100 million kilometers)
033Producer price index of producer of living goods (same month last year = 100)185Civil aviation passenger turnover in the current period (100 million kilometers)
034Producer price index for food industry producers (same month last year = 100)186Cumulative value of total retail sales of consumer goods ($100 million)
035Clothing factory producer price index (same month last year = 100)187Retail value of clothing, shoes, hats, needles and textile commodities: cumulative value (100 million yuan)
036Producer price index for general consumer goods manufacturers (same month last year = 100)188Cumulative value of retail sales of clothing commodities (100 million yuan)
037Durable consumer goods producer price index (same month last year = 100)189Cumulative value of retail sales of cosmetics commodities (100 million yuan)
038Producer price index for metallurgical industry (same month last year = 100)190Cumulative value of retail sales of household appliances and audio-visual equipment (100 million yuan)
039Producer price index for power producers (same month last year = 100)191Cumulative value of retail sales of furniture commodities (100 million yuan)
040Producer price index of coal and coking industry producers (same month last year = 100)192Cumulative value of retail sales of building and decoration materials (100 million yuan)
041Producer price index for oil producers (same month last year = 100)193Value of gold, silver and jewelry retail sales value (100 million yuan)
042Producer price index of chemical industry (same month last year = 100)194Cumulative value of retail sales of grain, oil, food, beverages, cigarettes and alcoholic drinks (100 million yuan)
043Producer price index for machinery industry (same month last year = 100)195Cumulative value of grain, oil and food retail sales (100 million yuan)
044Producer price index for producer of building materials industry (same month last year = 100)196Cumulative value of retail sales of beverage commodities ($100 million)
045Producer price index of forest industry producer (same month last year = 100)197Cumulative value of retail value of tobacco and wine commodities (billion yuan)
046Producer price index of food industry (same month last year = 100)198Cumulative value of retail sales of other commodities ($100 million)
047Producer price index of textile industry (same month last year = 100)199Cumulative value of retail sales of automobile commodities (100 million yuan)
048Producer price index for sewing industrial producers (same month last year = 100)200Cumulative value of retail value of daily use commodities (billion yuan)
049Producer price index of leather industry producer (same month last year = 100)201Cumulative value of retail sales of petroleum and products (100 million yuan)
050Producer price index for producer of paper industry (=100 of last year)202Cumulative value of retail sales of books, magazines and magazines (100 million yuan)
051Producer price index of industrial and cultural arts and Crafts Industrial producer (same month last year = 100)203Cumulative value of retail sales of sports and entertainment commodities (100 million yuan)
052Producer price index for other industrial producers (same month last year = 100)204Cumulative value of retail sales of communications equipment (100 million yuan)
053Producer price index for industrial producers (same month last year = 100)205Cultural and office goods retail value (100 million yuan)
054Industrial producer price index for coal mining and washing industry (same month last year = 100)206Cumulative value of retail sales of Chinese and Western medicines (100 million yuan)
055Producer price index for industrial producer of oil and natural gas (=100 of last year)207Cumulative value of real estate construction area (10,000 square meters)
056Black metal ore mining industry producer price index (same month last year = 100)208Cumulative area of newly built real estate (10,000 square meters)
057Producer price index of industrial producer of nonferrous metal mining industry (same month last year = 100)209Cumulative value of completed area of real estate (10,000 square meters)
058Producer price index of industrial producer of non-metallic mineral industry (=100 of last year)210Accumulative value of construction area of office building (ten thousand square meters)
059Ex-factory price index for mining and auxiliary activities (=100 of last year)211The total area of newly built office buildings is accumulated (10,000 square meters)
060Producer price index of industrial producers in agricultural and sideline products processing industry (same month last year = 100)212Accumulated value of office building area (10,000 square meters)
061Producer price index of industrial producers in food industry (=100 of last year)213Accumulative value of construction area of commodity residence (ten thousand square meters)
062Ex-factory price index for industrial producers of wine, beverages and refined tea (same month last year = 100)214Cumulative area of newly built commercial housing area (ten thousand square meters)
063Producer price index for industrial products in tobacco industry (same month last year = 100)215Accumulated value of the completed area of a commodity house (ten thousand square meters)
064Producer price index of textile industrial producers (same month last year = 100)216Cumulative area of commercial business space (10,000 square meters)
065Producer price index of industrial producer of textile, clothing and apparel industry (same month last year = 100)217The accumulative value of construction area for commercial business premises (10,000 square meters)
066Ex-factory price index for leather, fur, feather and its products and footwear manufacturers (same month last year = 100)218Cumulative area of commercial business space (10,000 square meters)
067Ex-factory price index for timber processing and industrial producer of wood, bamboo, rattan, brown and straw products (same month last year = 100)219Cumulative value of sales area of office buildings (10,000 square meters)
068Furniture producer industrial producer price index (same month last year = 100)220The total sales area of office buildings (10,000 square meters)
069Producer price index for industrial producers of paper and paper products (same month last year = 100)221The total sales area of office buildings will be accumulated (10,000 square meters)
070Printing and recording media reproduction industry producer price index (same month last year = 100)222Cumulative value of commodity housing sales area (10,000 square meters)
071Producer price index for industrial producers in culture, education, industry, sports and entertainment products (same month last year = 100)223The total sales area of commercial housing is accumulated (10,000 square meters)
072Petroleum, coal and other fuel processing industry producer price index (same month last year = 100)224The total sales area of commercial housing will be accumulated (10,000 square meters)
073Producer price index for industrial producers of chemical raw materials and chemical products (same month last year = 100)225Accumulative value of sale area of commodity house (ten thousand square meters)
074Factory producer price index for pharmaceutical manufacturers (same month last year = 100)226The total sales area of commercial housing is about 10,000 square meters
075Producer price index for industrial producer of chemical fiber industry (same month last year = 100)227The total sales area of commercial housing will be accumulated (10,000 square meters)
076Producer price index for industrial producer of rubber and plastic products (same month last year = 100)228The total sales area of commercial business space is accumulated (10,000 square meters)
077Ex-factory price index for industrial producers of non-metallic mineral products (same month last year = 100)229The total sales area of commercial business premises is accumulated (10,000 square meters)
078Producer price index of industrial producers in ferrous metal smelting and calendaring processing industry (same month last year = 100)230The total sales area of commercial business premises will be accumulated (10,000 square meters)
079Producer price index of industrial producers in nonferrous metal smelting and calendaring processing industry (same month last year = 100)231Total value of office building sales ($100 million)
080Ex-factory price index for industrial producers of metal products (same month last year = 100)232Total value of present house sales of office building (100 million yuan)
081Producer price index for industrial producer of general equipment manufacturing (=100 of last year)233Total value of office building sales volume ($100 million)
082Producer price index of industrial producer for special equipment manufacturing industry (same month last year = 100)234Cumulative value of commodity housing sales ($100 million)
083Ex-factory price index for industrial vehicle manufacturers (same month last year = 100)235Accumulative value of present house sales of commodity housing (billion yuan)
084Ex-factory price index for industrial producers in railways, ships, aerospace and other transport equipment manufacturing industries (same month last year = 100)236Cumulative value of commodity housing sales volume ($100 million)
085Producer price index of industrial producer for electrical machinery and equipment manufacturing (=100 of last year)237Cumulative value of commodity house sales ($100 million)
086Producer price index for industrial producer computers, telecommunications and other electronic equipment manufacturers (same month last year = 100)238Accumulative value of present housing sales of commodity house (billion yuan)
087Industrial producer price index for instrument manufacturing industry (same month last year = 100)239Cumulative value of commodity house sales volume ($100 million)
088Producer price index for other manufacturing industries (same month last year = 100)240Cumulative value of commercial business housing sales ($100 million)
089Ex-factory price index of industrial producer of waste comprehensive utilization industry (=100 of last year)241The total value of existing commercial housing sales is 100 billion yuan
090Metal producer, machinery and equipment repair industry producer price index (same month last year = 100)242Cumulative sales value of commercial business premises (100 million yuan)
091Producer price index of industrial producers in electricity, heat production and supply industries (same month last year = 100)243Cumulative value of land acquisition area of real estate (10,000 square meters)
092Producer price index of industrial producer for gas production and supply industry (same month last year = 100)244Accumulated value of land transaction price of real estate industry (billion yuan)
093Producer price index of industrial producers in water production and supply industry (same month last year = 100)245Cumulative value of real estate investment ($100 million)
094Cumulative value of export value of industrial exports (100 million yuan)246Cumulative value of real estate investment ($100 million)
095Cumulative value of export value of coal mining and washing industry (100 million yuan)247Cumulative value of housing investment of 90 square meters and below (100 million yuan)
096Cumulative value of export value of oil and natural gas extraction industry (100 million yuan)248Cumulative value of housing investment over 144 square meters (billion yuan)
097The cumulative value of export value of non-ferrous metal mining and export industry (100 million yuan)249Accumulative value of villa and high-grade apartment investment (billion yuan)
098Cumulative value of export value of non-metallic ore mining and export industry (100 million yuan)250Total value of investment in real estate office building (100 million yuan)
099Export value of agricultural and sideline products processing industry accumulated value (100 million yuan)251Cumulative value of investment in commercial real estate business (100 million yuan)
100The cumulative value of the export delivery value of the food manufacturing industry (100 million yuan)252Cumulative value of other real estate investment ($100 million)
101Export value of liquor, beverages and refined tea manufacturing industry value accumulated (100 million yuan)253Cumulative value of investment in real estate development and construction projects (100 million yuan)
102The cumulative value of the export delivery value of the tobacco products industry (billion yuan)254Cumulative value of investment in real estate development and installation projects (100 million yuan)
103The cumulative value of the export delivery value of the textile industry (100 million yuan)255Cumulative value of investment in real estate equipment and equipment purchase (100 million yuan)
104Export value of textile, clothing and apparel industry accumulated value (100 million yuan)256Cumulative value of other investment in real estate investment ($100 million)
105Export value of leather, fur, feathers and their products and footwear industry accumulated value (100 million yuan)257Real estate land purchase fee accumulative value (100 million yuan)
106Export value of timber processing, timber, bamboo, rattan, brown and straw products accumulated value (100 million yuan)258Total investment of real estate development plan (100 million yuan)
107Total value of export delivery value of furniture manufacturing industry (billion yuan)259Added value of fixed assets investment in real estate development (100 million yuan)
108Export value of paper and paper products, accumulated value (100 million yuan)260Accumulative value of the source of investment funds of real estate (billion yuan)
109Cumulative value of export delivery value of printing and recording media reproduction industry (100 million yuan)261Accumulated value of capital surplus in real estate investment last year (100 million yuan)
110The total value of delivery value of exports of culture, education, industry, sports and entertainment products (100 million yuan)262Real estate investment this year, the total amount of funds is accumulated (100 million yuan)
111Export value of oil, coal and other fuel processing industry accumulated value (100 million yuan)263Real estate investment domestic loan accumulative value (billion yuan)
112Export value of chemical raw materials and chemical products manufacturing value accumulated value (100 million yuan)264Total value of investment and utilization of foreign investment in real estate investment (billion yuan)
113The cumulative value of the export delivery value of the pharmaceutical manufacturing industry (100 million yuan)265Accumulative value of self-financing of real estate investment (billion yuan)
114Cumulative value of export value of chemical fiber manufacturing industry (100 million yuan)266Cumulative value of other funds for real estate investment ($100 million)
115Export value of non-metallic mineral products industry accumulated value (100 million yuan)267The total value of the real estate investment should be paid ($100 million)
116Export value of ferrous metal smelting and calendaring processing industry value accumulated value (100 million yuan)268Cumulative value of real estate investment projects ($100 million)
117Export value of non-ferrous metal smelting and calendaring processing industry value accumulated value (100 million yuan)269Fixed long distance telephone call duration (IP) and current value (ten thousand minutes)
118The cumulative value of the export delivery value of the metal products industry (100 million yuan)270The value of a long term (ten thousand minutes) for a mobile phone call
119Export value of general equipment manufacturing industry accumulated value (100 million yuan)271End value of subscriber number (ten thousand households) at the end of a fixed phone
120Export value of special equipment manufacturing industry: cumulative value (100 million yuan)272End of user number (10,000) at the end of city phone year
121Export value of electrical machinery and equipment manufacturing industry accumulated value (100 million yuan)273End of user number (ten thousand households) at the end of rural telephone year
122Export value of computers, telecommunications and other electronic equipment manufacturing industry accumulated value (100 million yuan)274Terminal value of mobile phone users (10,000 households)
123Export value of instrument manufacturing industry: cumulative value (100 million yuan)275Mobile SMS business volume (100 million)
124Cumulative value of export value of other manufacturing exports ($100 million)276Number of letters in the current period (100 million)
125Cumulative value of export delivery value of waste resources comprehensive utilization industry (100 million yuan)277The value of the number of packages (10,000 pieces)
126Cumulative value of delivery value of electricity, heat production and supply industry (100 million yuan)278The current value of the bill of exchange (ten thousand)
127Cumulative value of delivery value of gas production and supply industry (100 million yuan)279Number of newspapers in the current period (ten thousand)
128The cumulative value of delivery value of water production and supply industry (100 million yuan)280Number of magazines in the current period (ten thousand)
129Year-on-year increase in export value of industrial exports (%)281Value of express volume (10,000 pieces)
130Export value of coal mining and washing industry increased by year (%)282The current value of the total telecommunications business (100 million yuan)
131Export value of oil and natural gas extraction industry grew by year on year (%)283The end value of the supply of money and quasi money (M2)
132The delivery value of non-ferrous metal mining and export industry increased by year (%)284Monetary and quasi currency (M2) supply growth (%)
133The delivery value of non-metallic ore mining and export industry increased by year (%)285Money (M1) supply terminal value (billion)
134Export value of agricultural and sideline products processing industry increased by year (%)286Money (M1) supply growth (%)
135Food manufacturing export delivery value increase (%)287Cash (M0) supply at the end of the period (billion yuan)
136Export value of wine, beverages and refined tea manufacturing increased by year (%)288Cash (M0) supply in circulation increased year-on-year (%)
137Export value of tobacco products export growth (%)289Consumer price index (=100 of the same month of the previous year)
138Export delivery value of textile industry increase (%)290Consumer price index for food, tobacco and alcoholic drinks (same month last year = 100)
139Export value of textile, clothing and apparel industry increased by year (%)291Clothing consumer price index (same month last year = 100)
140Export value of leather, fur, feather and its products and footwear industry increased by year (%)292Consumer price index for residential residents (same month last year = 100)
141Export value of timber processing and timber, bamboo, rattan, brown and straw products increased by year (%)293Consumer price index for consumer goods and services (same month last year = 100)
142Export value of furniture manufacturing export increase (%)294Consumer price index for transport and communications (=100 of last year)
143Export delivery value of paper and paper products increased year by year (%)295Education, culture and entertainment consumer price index (same month last year = 100)
144Export delivery value of printing and recording media replication industry increased year by year (%)296Consumer price index for health care residents (same month last year = 100)
145The delivery value of exports of culture, education, industry, sports and entertainment products increased by year (%)297Consumer price index for other supplies and services (=100 of last year)
146Export value of oil, coal and other fuel processing industries increased by year (%)298Food consumer price index (same month last year = 100)
147Export value of chemical raw materials and chemical products manufacturing increased by year (%)299Food consumer price index (same month last year = 100)
148Increase in export delivery value of pharmaceutical manufacturing industry (%)300The consumer price index of egg residents (=100 of the same month of the same year)
149Export value of chemical fiber manufacturing industry increased by year (%)301Consumer price index for aquatic products (same month last year = 100)
150Export value of non-metallic mineral products industry increased by year (%)302Fresh vegetable consumer price index (same month last year = 100)
151Export value of ferrous metal smelting and calendaring processing industry increased by (%)303Fresh fruit consumer price index (same month last year = 100)
152Export value of non-ferrous metal smelting and calendaring processing industry increased by (%)
Table A2. Verification set error.
Table A2. Verification set error.
V1V2V3V4V5V6
primaryMAE0.0340.0170.0750.0610.0680.050
RMSE0.0400.0220.0880.0650.0910.052
MAPE0.0350.0160.0820.0640.0630.050
secondaryMAE0.0370.0460.0430.0350.0130.028
RMSE0.0640.0580.0610.0450.0150.031
MAPE0.0380.0480.0450.0330.0120.027
tertiaryMAE0.0270.0250.0210.0280.0190.027
RMSE0.0300.0280.0240.0400.0220.029
MAPE0.0250.0240.0190.0270.0180.025
residentialMAE0.0460.0260.0440.0340.0450.036
RMSE0.0520.0290.0530.0360.0580.047
MAPE0.0430.0250.0410.0320.0460.034
allMAE0.1000.0680.0790.0920.0360.049
RMSE0.1100.1030.0990.1120.0450.076
MAPE0.0240.0170.0200.0230.0090.012

References

  1. Guo, H.; Chen, Q.; Xia, Q.; Kang, C.; Zhang, X. A monthly electricity consumption forecasting method based on vector error correction model and self-adaptive screening method. Int. J. Electr. Power Energy Syst. 2018, 95, 427–439. [Google Scholar] [CrossRef]
  2. Liu, D.; Ruan, L.; Liu, J.; Huan, H.; Zhang, G.; Feng, Y.; Li, Y. Electricity consumption and economic growth nexus in beijing: A causal analysis of quarterly sectoral data. Renew. Sustain. Energy Rev. 2018, 82, 2498–2503. [Google Scholar] [CrossRef]
  3. Liu, D.; Wang, J.; Wang, H. Short-term wind speed forecasting based on spectral clustering and optimised echo state networks. Renew. Energy 2015, 78, 599–608. [Google Scholar] [CrossRef]
  4. Foley, A.M.; Leahy, P.G.; Marvuglia, A.; McKeogh, E.J. Current methods and advances in forecasting of wind power generation. Renew. Energy 2012, 37, 1–8. [Google Scholar] [CrossRef] [Green Version]
  5. Diagne, M.; David, M.; Lauret, P.; Boland, J.; Schmutz, N. Review of solar irradiance forecasting methods and a proposition for small-scale insular grids. Renew. Sustain. Energy Rev. 2013, 27, 65–76. [Google Scholar] [CrossRef]
  6. Pieri, E.; Kyprianou, A.; Phinikarides, A.; Makrides, G.; Georghiou, G.E. Forecasting degradation rates of different photovoltaic systems using robust principal component analysis and arima. IET Renew. Power Gener. 2017, 11, 1245–1252. [Google Scholar] [CrossRef]
  7. Ferreira, L.R.A.; Otto, R.B.; Silva, F.P.; Souza, S.N.M.D.; Souza, S.S.D.; Junior, O.H.A. Review of the energy potential of the residual biomass for the distributed generation in brazil. Renew. Sustain. Energy Rev. 2018, 94, 440–455. [Google Scholar] [CrossRef]
  8. Theo, W.L.; Lim, J.S.; Ho, W.S.; Hashim, H.; Lee, C.T. Review of distributed generation (DG) system planning and optimisation techniques: Comparison of numerical and mathematical modelling methods. Renew. Sustain. Energy Rev. 2017, 67, 531–573. [Google Scholar] [CrossRef]
  9. O’Connell, N.; Pinson, P.; Madsen, H.; O’Malley, M. Benefits and challenges of electrical demand response: A critical review. Renew. Sustain. Energy Rev. 2014, 39, 686–699. [Google Scholar] [CrossRef]
  10. Pappas, S.S.; Ekonomou, L.; Moussas, V.C.; Karampelas, P.; Katsikas, S.K. Adaptive load forecasting of the hellenic electric grid. J. Zhejiang Univ.-Sci. A (Appl. Phys. Eng.) 2008, 9, 1724–1730. [Google Scholar] [CrossRef]
  11. Khan, A.R.; Mahmood, A.; Safdar, A.; Khan, Z.A.; Khan, N.A. Load forecasting, dynamic pricing and dsm in smart grid: A review. Renew. Sustain. Energy Rev. 2016, 54, 1311–1322. [Google Scholar] [CrossRef]
  12. Ahmad, A.S.; Hassan, M.Y.; Abdullah, M.P.; Rahman, H.A.; Hussin, F.; Abdullah, H.; Saidur, R. A review on applications of ann and svm for building electrical energy consumption forecasting. Renew. Sustain. Energy Rev. 2014, 33, 102–109. [Google Scholar] [CrossRef]
  13. Son, H.; Kim, C. Short-term forecasting of electricity demand for the residential sector using weather and social variables. Resour. Conserv. Recycl. 2017, 123, 200–207. [Google Scholar] [CrossRef]
  14. Debnath, K.B.; Mourshed, M. Forecasting methods in energy planning models. Renew. Sustain. Energy Rev. 2018, 88, 297–325. [Google Scholar] [CrossRef]
  15. Ekonomou, L.; Christodoulou, C.; Mladenov, V. A short-term load forecasting method using artificial neural networks and wavelet analysis. Int. J. Power Syst. 2016, 1, 64–68. [Google Scholar]
  16. Yildiz, B.; Bilbao, J.I.; Sproul, A.B. A review and analysis of regression and machine learning models on commercial building electricity load forecasting. Renew. Sustain. Energy Rev. 2017, 73, 1104–1122. [Google Scholar] [CrossRef]
  17. Lee, W.-J.; Hong, J. A hybrid dynamic and fuzzy time series model for mid-term power load forecasting. Int. J. Electr. Power Energy Syst. 2015, 64, 1057–1062. [Google Scholar] [CrossRef]
  18. Nie, H.; Liu, G.; Liu, X.; Wang, Y. Hybrid of arima and SVMs for short-term load forecasting. Energy Procedia 2012, 16, 1455–1460. [Google Scholar] [CrossRef]
  19. Sadaei, H.J.; Guimarães, F.G.; José da Silva, C.; Lee, M.H.; Eslami, T. Short-term load forecasting method based on fuzzy time series, seasonality and long memory process. Int. J. Approx. Reason. 2017, 83, 196–217. [Google Scholar] [CrossRef]
  20. Voyant, C.; Notton, G.; Kalogirou, S.; Nivet, M.-L.; Paoli, C.; Motte, F.; Fouilloy, A. Machine learning methods for solar radiation forecasting: A review. Renew. Energy 2017, 105, 569–582. [Google Scholar] [CrossRef]
  21. Deo, R.C.; Tiwari, M.K.; Adamowski, J.F.; Quilty, J.M. Forecasting effective drought index using a wavelet extreme learning machine (w-elm) model. Stoch. Environ. Res. Risk Assess. 2017, 31, 1211–1240. [Google Scholar] [CrossRef]
  22. Wang, S.; Zhang, N.; Wu, L.; Wang, Y. Wind speed forecasting based on the hybrid ensemble empirical mode decomposition and ga-bp neural network method. Renew. Energy 2016, 94, 629–636. [Google Scholar] [CrossRef]
  23. Xia, C.; Zhang, M.; Cao, J. A hybrid application of soft computing methods with wavelet svm and neural network to electric power load forecasting. J. Electr. Syst. Inf. Technol. 2017. [Google Scholar] [CrossRef]
  24. Barman, M.; Dev Choudhury, N.B.; Sutradhar, S. A regional hybrid goa-svm model based on similar day approach for short-term load forecasting in Assam, India. Energy 2018, 145, 710–720. [Google Scholar] [CrossRef]
  25. Dedinec, A.; Filiposka, S.; Dedinec, A.; Kocarev, L. Deep belief network based electricity load forecasting: An analysis of macedonian case. Energy 2016, 115, 1688–1700. [Google Scholar] [CrossRef]
  26. Liu, Y.; Wang, W.; Ghadimi, N. Electricity load forecasting by an improved forecast engine for building level consumers. Energy 2017, 139, 18–30. [Google Scholar] [CrossRef]
  27. Singh, P.; Dwivedi, P. Integration of new evolutionary approach with artificial neural network for solving short term load forecast problem. Appl. Energy 2018, 217, 537–549. [Google Scholar] [CrossRef]
  28. Jin, M.; Zhou, X.; Zhang, Z.M.; Tentzeris, M.M. Short-term power load forecasting using grey correlation contest modeling. Expert Syst. Appl. 2012, 39, 773–779. [Google Scholar] [CrossRef]
  29. Rana, M.; Koprinska, I. Forecasting electricity load with advanced wavelet neural networks. Neurocomputing 2016, 182, 118–132. [Google Scholar] [CrossRef]
  30. Bahrami, S.; Hooshmand, R.-A.; Parastegari, M. Short term electric load forecasting by wavelet transform and grey model improved by pso (particle swarm optimization) algorithm. Energy 2014, 72, 434–442. [Google Scholar] [CrossRef]
  31. Liu, J.P.; Li, C.L. The short-term power load forecasting based on sperm whale algorithm and wavelet least square support vector machine with dwt-ir for feature selection. Sustainability 2017, 9, 1188. [Google Scholar] [CrossRef]
  32. Qiu, X.; Ren, Y.; Suganthan, P.N.; Amaratunga, G.A.J. Empirical mode decomposition based ensemble deep learning for load demand time series forecasting. Appl. Soft Comput. 2017, 54, 246–255. [Google Scholar] [CrossRef]
  33. Fan, G.F.; Qing, S.; Wang, H.; Hong, W.C.; Li, H.J. Support vector regression model based on empirical mode decomposition and auto regression for electric load forecasting. Energies 2013, 6, 1887–1901. [Google Scholar] [CrossRef]
  34. Zhang, W.; Qu, Z.; Zhang, K.; Mao, W.; Ma, Y.; Fan, X. A combined model based on ceemdan and modified flower pollination algorithm for wind speed forecasting. Energy Convers. Manag. 2017, 136, 439–451. [Google Scholar] [CrossRef]
  35. Yajima, H.; Derot, J. Application of the random forest model for chlorophyll-a forecasts in fresh and brackish water bodies in Japan, using multivariate long-term databases. J. Hydroinform. 2018, 20, 206–220. [Google Scholar] [CrossRef]
  36. Lahouar, A.; Ben Hadj Slama, J. Hour-ahead wind power forecast based on random forests. Renew. Energy 2017, 109, 529–541. [Google Scholar] [CrossRef]
  37. Lei, C.; Deng, J.; Cao, K.; Ma, L.; Xiao, Y.; Ren, L. A random forest approach for predicting coal spontaneous combustion. Fuel 2018, 223, 63–73. [Google Scholar] [CrossRef]
  38. Wang, Z.; Wang, Y.; Zeng, R.; Srinivasan, R.S.; Ahrentzen, S. Random forest based hourly building energy prediction. Energy Build. 2018, 171, 11–25. [Google Scholar] [CrossRef]
  39. Lahouar, A.; Ben Hadj Slama, J. Day-ahead load forecast using random forest and expert input selection. Energy Convers. Manag. 2015, 103, 1040–1051. [Google Scholar] [CrossRef]
  40. Diebold, F.X.; Mariano, R.S. Comparing predictive ability. J. Bus. Econ. Stat. 1995, 13, 253–262. [Google Scholar]
Figure 1. Monthly load forecasting flow chart based on EEMD-RF. (Notes: Hy: High frequency sequence; My: Medium frequency sequence; Ly: Low frequency sequence).
Figure 1. Monthly load forecasting flow chart based on EEMD-RF. (Notes: Hy: High frequency sequence; My: Medium frequency sequence; Ly: Low frequency sequence).
Sustainability 10 03282 g001
Figure 2. (a) the primary industry electricity consumption; (b) the secondary industry electricity consumption; (c) the tertiary industry electricity consumption; (d) the residential electricity consumption.
Figure 2. (a) the primary industry electricity consumption; (b) the secondary industry electricity consumption; (c) the tertiary industry electricity consumption; (d) the residential electricity consumption.
Sustainability 10 03282 g002
Figure 3. (a) EEMD decomposition of monthly the primary industrial electricity consumption; (b) EEMD decomposition of monthly the secondary industrial electricity consumption; (c) EEMD decomposition of monthly the tertiary industrial electricity consumption; (d) EEMD decomposition of monthly the residential electricity consumption.
Figure 3. (a) EEMD decomposition of monthly the primary industrial electricity consumption; (b) EEMD decomposition of monthly the secondary industrial electricity consumption; (c) EEMD decomposition of monthly the tertiary industrial electricity consumption; (d) EEMD decomposition of monthly the residential electricity consumption.
Sustainability 10 03282 g003
Figure 4. (A) Diverse serial verification set mean absolute percentage error (MAPE) for primary industries electricity for different number of trees; (B) Diverse serial verification set MAPE for secondary industries electricity for different number of trees; (C) Diverse serial verification set MAPE for tertiary industries electricity for different number of trees; (D) Diverse serial verification set MAPE for residential electricity for different number of trees.
Figure 4. (A) Diverse serial verification set mean absolute percentage error (MAPE) for primary industries electricity for different number of trees; (B) Diverse serial verification set MAPE for secondary industries electricity for different number of trees; (C) Diverse serial verification set MAPE for tertiary industries electricity for different number of trees; (D) Diverse serial verification set MAPE for residential electricity for different number of trees.
Sustainability 10 03282 g004
Figure 5. (a) Fitting situation of monthly the primary industrial electricity consumption; (b) Fitting situation of monthly the secondary industrial electricity consumption; (c) Fitting situation of monthly the tertiary industrial electricity consumption; (d) Fitting situation of monthly the residential electricity consumption.
Figure 5. (a) Fitting situation of monthly the primary industrial electricity consumption; (b) Fitting situation of monthly the secondary industrial electricity consumption; (c) Fitting situation of monthly the tertiary industrial electricity consumption; (d) Fitting situation of monthly the residential electricity consumption.
Sustainability 10 03282 g005aSustainability 10 03282 g005b
Table 1. The forecast error of the primary, secondary and tertiary industries and residential electricity.
Table 1. The forecast error of the primary, secondary and tertiary industries and residential electricity.
DateRFSVMSeasonal NaïveEEMD+RF
MAE/GWhMAPE/%MAE/GWhMAPE/%MAE/GWhMAPE/%MAE/GWhMAPE/%
primary industryJune 2017759.016.32932.427.771219.6410.16178.521.49
July 2017885.426.09839.295.771325.859.1235.260.24
August 2017826.595.57628.124.231562.1810.53360.822.43
September 2017336.603.11230.922.14317.482.94560.395.19
October 2017229.352.7339.850.47298.553.55348.284.14
November 201717.890.2336.370.46241.463.06310.503.93
average509.144.01451.163.47827.536.56298.962.90
Secondary industryJune 201710,153.862.668146.922.1319,551.725.113854.711.01
July 201736,299.228.3217,527.414.0238,641.138.8612,005.202.75
August 201718,760.534.8211,782.693.039453.572.436522.621.68
September 201725,011.877.314318.141.2616,010.544.686794.001.99
October 201713,456.313.6913,432.033.6810,273.232.8111,723.033.21
November 201713,253.663.4013,077.423.3513,394.273.438633.362.21
average19,489.245.0311,380.772.9117,887.414.558255.492.14
Tertiary industryJune 2017500.590.711745.412.496995.119.97393.320.56
July 2017628.840.752678.483.199759.5811.62455.030.54
August 2017349.020.372213.772.3810,524.4011.30919.700.99
September 201799.370.121034.561.248814.7310.56281.070.34
October 2017445.600.631321.191.877861.9311.111521.722.15
November 20171619.152.362096.293.055426.427.901878.782.73
average607.100.821848.282.378230.3610.41908.271.22
ResidentialJune 2017511.080.85191.980.324150.096.92136.920.23
July 20171716.822.371808.312.505180.117.15200.420.28
August 201710,646.7510.436006.675.8814,459.1314.167968.427.80
September 20177485.387.854537.754.7610,017.7110.513856.984.05
October 20171065.121.552274.333.305572.248.10705.251.02
November 20171161.401.81823.431.284785.487.46404.790.63
average3764.434.142607.083.017360.799.052212.132.33
Table 2. Six months’ electricity forecast for each method/GWh.
Table 2. Six months’ electricity forecast for each method/GWh.
DateOrigRFMAEMAPE/%SVMMAEMAPE/%SnaïveMAEMAPE/%EEMD-RFMAEMAPE/%
June 2017524,442513,51810,9232.08532,34979071.51492,52531,9176.09526,23617940.34
July 2017607,242567,71239,5306.51590,05517,1872.83552,33654,9079.04592,32314,9192.46
August 2017599,108568,52530,5835.1603,90747990.8563,10935,9996.01595,08840200.67
September 2017531,682499,62132,0616.03531,7931110.02496,52235,1606.61534,14524630.46
October 2017512,995498,25814,7382.87528,29615,3012.98488,98924,0064.68523,04010,0451.96
November 2017531,016518,20212,8142.41546,49915,4842.92507,16823,8484.49542,46111,4452.16
Table 3. Six months’ electricity consumption combined forecasting for the whole society/GWh.
Table 3. Six months’ electricity consumption combined forecasting for the whole society/GWh.
DatePrimary IndustrySecondary IndustryTertiary IndustryResidentialPredOrigMAEMAPE/%
June 201711,587387,47469,03959,778527,878524,44234360.66
July 201714,210425,02881,53572,450593,224607,24214,0182.31
August 201714,225400,25390,89593,865599,237599,1081290.02
September 201711,206348,34882,30790,031531,892531,6822090.04
October 20178652377,70669,23668,610524,204512,99511,2082.18
November 20178128400,53669,90164,418542,984531,01611,9682.25
Table 4. DM test of different models.
Table 4. DM test of different models.
DM-MAEDM-MAPE
PrimarySecondaryTertiaryResidentialPrimarySecondaryTertiaryResidential
RF1.9682.1492.1551.7001.9872.4082.0541.856
SVM1.9772.6872.2841.6532.0152.7412.3681.743
snaïve1.9501.9685.5022.6292.0901.9729.2225.181

Share and Cite

MDPI and ACS Style

Liu, D.; Sun, K.; Huang, H.; Tang, P. Monthly Load Forecasting Based on Economic Data by Decomposition Integration Theory. Sustainability 2018, 10, 3282. https://doi.org/10.3390/su10093282

AMA Style

Liu D, Sun K, Huang H, Tang P. Monthly Load Forecasting Based on Economic Data by Decomposition Integration Theory. Sustainability. 2018; 10(9):3282. https://doi.org/10.3390/su10093282

Chicago/Turabian Style

Liu, Da, Kun Sun, Han Huang, and Pingzhou Tang. 2018. "Monthly Load Forecasting Based on Economic Data by Decomposition Integration Theory" Sustainability 10, no. 9: 3282. https://doi.org/10.3390/su10093282

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop