Next Article in Journal
Anthropogenic Heat Flux Estimation Based on Luojia 1-01 New Nighttime Light Data: A Case Study of Jiangsu Province, China
Next Article in Special Issue
Mapping the Distribution of Coffee Plantations from Multi-Resolution, Multi-Temporal, and Multi-Sensor Data Using a Random Forest Algorithm
Previous Article in Journal
Mapping Forest Fire Risk—A Case Study in Galicia (Spain)
Previous Article in Special Issue
An Improved Spatiotemporal Data Fusion Method Using Surface Heterogeneity Information Based on ESTARFM
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Hybrid Spatio-Temporal Prediction Model for Solar Photovoltaic Generation Using Numerical Weather Data and Satellite Images

Department of Convergence & Fusion System Engineering, Kyungpook National University, Sangju 37224, Korea
*
Author to whom correspondence should be addressed.
Remote Sens. 2020, 12(22), 3706; https://doi.org/10.3390/rs12223706
Submission received: 13 October 2020 / Revised: 4 November 2020 / Accepted: 6 November 2020 / Published: 12 November 2020

Abstract

:
Precise and accurate prediction of solar photovoltaic (PV) generation plays a major role in developing plans for the supply and demand of power grid systems. Most previous studies on the prediction of solar PV generation employed only weather data composed of numerical text data. The numerical text weather data can reflect temporal factors, however, they cannot consider the movement features related to the wind direction of the spatial characteristics, which include the amount of both clouds and particulate matter (PM) among other weather features. This study aims developing a hybrid spatio-temporal prediction model by combining general weather data and data extracted from satellite images having spatial characteristics. A model for hourly prediction of solar PV generation is proposed using data collected from a solar PV power plant in Incheon, South Korea. To evaluate the performance of the prediction model, we compared and performed ARIMAX analysis, which is a traditional statistical time-series analysis method, and SVR, ANN, and DNN, which are based on machine learning algorithms. The models that reflect the temporal and spatial characteristics exhibited better performance than those using only the general weather numerical data or the satellite image data.

Graphical Abstract

1. Introduction

The issue of climate change caused by carbon emission and the depletion of fossil fuels is emerging worldwide. To address this problem, the Kyoto Protocol aimed at reducing greenhouse gas emissions for the purpose of decarbonization. Furthermore, the Paris Agreement, which strengthened global carbon regulations to suppress the global average temperature within 2 °C before industrialization, was signed [1,2]. While the development of many renewable energies are being sought to replace fossil fuels, South Korea announced its renewable energy generation by 2030 along with denuclearization in line with global trends [3]. In addition, more than 95% of the new renewable energy facilities use clean energy such as solar PV and wind power.
In the past decade, the unit price of solar panels and facilities for power generation systems was decreased, and there is an increase in the number of large-scale solar PV farms worldwide. Typically, the United States, Germany, and China have gigawatt-scale farms, and South Korea established a 467 MW solar PV farms in 2013 [4]. Compared to other renewable energy sources, the solar PV power generation has the advantage of low installation, and maintenance costs and an expected life of more than 20 years [5]. In addition, it is possible to minimize the damages to nature that occur when installing power plants, which is one of the emerging environmental problems. However, solar PV power generation requires a large installation area due to its low energy density, and the amount of power generated fluctuates with meteorological factors such as a change in irradiance due to clouds or particulate matter (PM) [6,7]. This phenomenon increases the complexity of the plan for stable supply and demand of power systems, it mostly disrupts the schedule for power grid operations. In particular, the solar PV generation technology, which is considered as one of the key components in smart grids, a technology that integrates information and communication technology (ICT) into the power grid, has the advantage of generating clean and unlimited energy, however, for stable systems, there is a demand for accurate predicting technology [8]. Without establishing an accurate power supply and demand plan, here could be huge financial and social losses. For this reason, there is a rapid increase in the need for accurate prediction of the amount of solar PV power generated. Therefore, accurate prediction of the power generation of renewable energy sources is very important in establishing an efficient power supply and demand plan.
Most of the previous studies on the prediction of solar PV generation can be divided into two categories. The first category involves the use of numerical text weather data such as the most common irradiance, temperature, and precipitation [9,10,11,12,13]. This method increases the predictive accuracy of the power generation by taking advantage of the continuity over various times in the data. The second category uses motion vectors or indexes of irradiance and clouds in satellite images [14,15,16,17,18]. However, this approach sometimes ignores the physical information of the solar PV generation system [19]. The most directly affected factor in predicting solar PV power generation is the irradiance, which is greatly influenced by the cloud shadow. The cloud shadow can reflect the increase or decrease in irradiance by detecting the cloud motion vector through the movement of the cloud shown in the satellite image.
Recently, air pollution caused by PM has been another environmental issue in South Korea [20]. The increase in the concentration of PM in the atmosphere not only has a fatal effect on the human body, such as the respiratory organs, but also reduces the irradiance reaching solar panels due to the scattering of the solar radiation [21]. Most of the previous studies analyzed the effects of accumulation various types of dust as well as PM on solar panels [22,23,24,25,26], however, this study aims at investigating the effect of PM concentration distribution in the atmosphere on the solar PV generation. Both clouds and PM vary with time and affect the atmosphere because of the spatial characteristics shifted from the adjacent region to the measurement point in the wind direction. Therefore, using satellite images, we investigated the influence on the spatial characteristics of the cloud and PM. To reflect the spatial characteristics, the area where the solar PV power plant is located was designated as a region of interest (ROI) and then the adjacent area in eight directions was set as the adjacent region of interest (ROIadj). To investigate the effect of the movement of clouds and PM according to the direction of the wind moving from ROIadj to ROI, the amounts of clouds and PM extracted from the satellite image were first predicted and used as variables for the solar PV generation prediction model. We propose a hybrid solar PV generation forecasting model, which combines the numerical weather data composed of texts with spatial information extracted from satellite images. To develop the proposed model that reflects the temporal and spatial characteristics, a comparative analysis was conducted by dividing the experiments into three groups. The first group uses the numerical text weather data from the Korea Meteorological Administration (KMA), the second group uses the data extracted from the satellite images, and the third group combines the numerical weather data from the KMA and those from the satellite images. The proposed model uses the power generation data from a solar PV power plant located in Incheon, South Korea, to predict the amount of PV generation after one hour. The prediction model employs the autoregressive moving average with exogenous input (ARIMAX), which combines external factors in autoregressive moving average (ARIMA) considering the time-series among the traditional statistical analysis methods. It also uses Support Vector Regression (SVR) and artificial neural network (ANN), which employ machine learning algorithms that have been recently used actively in various fields. In addition, the Deep Neural Network (DNN) with additional hidden layers in ANN was analyzed.

Research Framework

In this study, the numerical text weather data from KMA and the data extracted from satellite images were used together to develop a hybrid spatio-temporal model that not only considers the temporal characteristics of the input parameter over the time flow but also reflect the spatial characteristics. The research framework is shown in Figure 1. The first step in the framework is the collection and preprocessing of the data provided by KMA and the Korea National Meteorological Satellite Center (NMSC). The second step involves the extraction of the necessary data from each of the four satellite images collected. The wind direction and wind speed in the Atmospheric Motion Vector (AMV) image, the amount and thickness of the cloud in the cloud optical thickness image, the amount and concentration of PM in the aerosol optical depth image, and the irradiance from the insolation image were extracted. The third step involves setting the ROI of the desired region in the image and then designating the ROIadj of the same size for the 8 directions adjacent to each edge and vertex. A solar PV power plant located in Incheon, South Korea, was analyzed and set as the ROI. Then, based on the wind direction information of the ROI, the effect of cloud and PM movement from the ROIadj to the ROI was analyzed, and the amount of cloud, cloud thickness, amount and concentration of PM in the ROI were predicted. In the fourth step, the data extracted from satellite images were analyzed together with numerical text weather data collected from KMA to perform the preprocessing process for the development of hybrid spatio-temporal models. In the last step, to develop solar PV generation forecasting analysis models that are based on the ARIMAX, SVR, ANN, and DNN methods, the prediction performance of the model was optimized by choosing optimal parameters for each technique.

2. Methodology

2.1. Numerical Text Data

To predict one hour ahead solar PV generation, the numerical text data used in the experiment are classified into three categories: meteorological data, air pollution data (such as PM) and solar PV generation. The numerical text weather data were provided by Incheon meteorological station, located at 37.47772 lat. and 126.6249 long. in KMA [27]. KMA provides public data on more than 15 types of time-based weather data, including temperature, precipitation and wind speed, for 103 stations across the country. In this study, temperature, precipitation, wind speed and direction, humidity, amount of sunshine, irradiance, cloudiness, and visibility were used as the input parameters to predict the amount of solar PV generated in Incheon. The air pollution data were provided at an Air Korea air pollutant station located within 3 km from the test location [28]. The rapid increase in the use of fossil fuels has raised the issue of air pollution, and the recent increase in the level of air pollutions has great environmental and economic effects in Korea. As the concentration of PM, such as PM10 and PM2.5, in the atmosphere increases, the solar radiation reaching the Earth from the Sun is scattered in the atmosphere, reducing the visibility and irradiance reaching the Earth surface. To analyze the air pollutants expected to affect the performance of solar PV generation, pollutants such as SO2, CO, O3, NO2, PM10, and PM2.5 were considered together with the meteorological data. Lastly, the solar PV generation data was provided by the Open Data Portal [29]. The analyzed solar PV power plant is located in Incheon, and the data on the amount of solar PV power generation were collected on an hourly basis from 0:00 on 1 January to 23:00 on 31 December 2015. K-NN algorithm was used for the interpolation of missing data for all parameters. This algorithm selects the nearest value and does not consider the values of other neighboring points, a constant interpolation can be generated. Therefore, interpolation was performed using 36 h of data before and after the data point at the time of missing. As mentioned above, solar PV generation is highly affected by the irradiance, so only the data from 9:00 to 17:00, which is the daylight time, were used. Table 1 shows the numerical text data samples.

2.2. Satellite Image Data

In this study, satellite images provided by the NMSC were used [30]. The satellite images were provided by the Communication, Ocean, and Meteorological Satellite (COMS), which was launched on 27 June 2010. It is the first geostationary combined satellite in Korea that performs ocean and meteorological observations and communication service mission. Detailed sensor information of the COMS is summarized in Table 2. The COMS provides more than 16 types of image data, including raw images, basic images, and processed images, every 15 min for North-East Asia and the Korean Peninsula. Image data of AMV, including cloud optical thickness image, aerosol optical depth image, and insolation image, were used in this study [31,32,33,34]. Each image shows the Korea Peninsula in a size of 1024 × 1024 and has a ground resolution of 1720.8 m per pixel. Figure 2 shows four types of images used in this experiment, which were provided at 15:00 on 18 April 2015. Detailed information and the data extraction method for each image are described in the subsequent subsections.

2.2.1. Atmospheric Motion Vector (AMV) Image

Figure 2a shows the AMV image, which depicts the information about the movement of the atmosphere, including the information on wind direction and wind speed. The ROI has to be set before extracting the information of the wind direction and speed from the image. The numerous wind direction arrows shown in the AMV image have fixed starting point of the arrow and are referred to as center point in this paper. In order to set the ROI, the center point that exists at the Incheon, South Korea as the target point is selected. As the wind direction changes over time, the angle of the arrow indicating the wind direction in the next time step image is changed, but the coordinates of the center point are fixed because the arrow is rotated based on the center point. Therefore, the ROI was set at 50 × 50, a size that does not interfere with the rotation radius of the arrow when the arrow rotates 360° to the center point. Figure 3 shows the set ROI in magenta.
The wind direction vector information is not provided in the AMV image. Thus, to extract the wind direction information, other previous studies have extracted wind direction vector information from images [14]. In this study, the wind direction arrows are extracted from images and calculated as described in the following sequence. To obtain information on the wind direction, first, the coordinates of the tail point is found using the value of the pixel located farthest from the center point among the pixels that constitute the arrow. Next, the farthest pixel from the center point and tail point at the same time among the pixels that constitute the arrow is designated as the endpoint. Then, the wind direction can be acquired by calculating the angle with respect to the north direction of the straight line connecting the obtained center point and the endpoint. Figure 4 shows a single AMV vector, which includes the information on the wind direction, wind speed, center point, endpoint, and tail point. The process of extracting information on the wind speed is as follows: first, a straight line is drawn parallel to the arrow body in about the distance of the endpoint in the direction of the tail point; secondly, the wind speed is calculated primarily by counting the number of pixels that overlap between the newly drawn straight line and the existing wind direction arrow; thirdly, another new straight line is drawn in the middle of the distance between the arrow body and the tail point; lastly, by calculating the number of overlapping pixels in the same way as in the second step, it is determined whether there is a tail segment of 2 m/s, and the wind speed is finally determined by calculating whether there is a triangle representing 25 m/s.

2.2.2. Cloud Optical Thickness, Aerosol Optical Depth, and Insolation Image

Figure 2b shows the Cloud Optical Thickness Image, which presents the cloud thickness in a color index. Various studies are being conducted to analyze the movement and amount of clouds on satellite images through cloud motion vector [35]. However, in this study, the color index of the image was used to predict the amount of clouds. To get information on the amount and thickness of the cloud, the index from 0 to 100 was divided into four levels: clear, partly cloudy, mostly cloudy, and cloudy. Before extracting the data to analyze the spatial characteristics, ROIadj that is of the same size as the ROI was designated for 8 directions of the areas adjacent to the ROI adjacent in the AMV image. The ROIadj were denoted as S1-8 in order from the top left. To extract the amount and thickness of clouds present in the ROIadj and ROI, the pixels in each region are divided into four color indexes, and the number of pixels per index was recorded. Figure 5 shows the set ROIadj in cyan.
Figure 2c shows the Aerosol Optical Depth Image, which represents the aerosol components, such as yellow dust and PM, in a color index. As in the Cloud Optical Thickness image, the color index is divided into good, moderate, unhealthy, and very unhealthy, and the number of pixels was recorded.
Figure 2d shows the Insolation Image, which represents the irradiance reaching the ground surface in a color index. To extract the irradiance in the ROI, the amount of insolation in the range of 0–1000, which is the size of the color index, was readjusted to the range of 0–488, which is the pixel size of the color index in the insolation image. After that the average value of the numerical index in the ROI was recorded.
The satellite images taken at the NMSC are provided every 15 min, which involves the time-series continuity. However, to combine with the numerical text weather data from KMA, an image of the same time-scale of 1 h was extracted and used. As with the numerical text weather data, the data extracted from the satellite images had temporal continuity, as shown in Figure 6. Finally, the values of wind direction, wind speed, cloud thickness, concentration of PM, and irradiance, which are parameters every hour extracted from each satellite image, were added to the numerical text weather data set.

2.3. Correlation Analysis with the Wind Direction of the Cloud and PM

Clouds and PM in the atmosphere have spatial characteristics that move in the wind direction [36]. Therefore, in order to demonstrate the reflection of spatial characteristics in the prediction of solar PV generation, the mobility of the clouds and PM according to the wind direction from the ROIadj to ROIadj was numerically verified and expressed as Algorithm 1. The sample datasets used for the Algorithm 1 are shown in Table 3 and Table 4, and the composition of Algorithm 1 follows the following sequence: the wind direction of the ROI at time t is identified; the amount of cloud and PM at time t of each ROIadj and ROI are compared and analyzed; the increase or decrease of clouds and PM due to the movement of clouds and PM at time t + 1 of the ROI according to the wind is determined. For example, assuming that the wind direction at the ROI at time t is northwest and the amount of clear cloud in S1 is greater than that in the ROI (ROIIncheon), then, the amount of clear cloud of ROI increases at time t + 1, it is determined as true about movement of clouds and the opposite case as false. Then, the accuracy is derived by calculating the number of true and false for the entire time. PM also uses the same method as Algorithm 1. Table 5 and Table 6 list the results of the determined mobilities of both clouds and PM with respect to the wind direction. PM has high accuracy in all the cases, but clouds have high accuracy only in the mostly cloudy and cloudy levels. In the case of partly cloudy and cloudy, the accuracy is considered inferior because there is a possibility of natural extinction over time. Therefore, it can be determined that clouds and PM, which have a great influence on the fluctuations in irradiance and solar PV generation, have mobility according to the movement of the wind.
Algorithm 1. Algorithm of discriminant for movement of clouds and PM by wind direction.
Denotes W D = W i n d   D i r e c t i o n ; S R O I = R O I I n c h e o n ; S i = R O I a d j ;   N = n u m b e r   o f   s a m p l e s ; c 1 = n u m b e r   o f   1   i n   c o n d i t i o n a l   f o r .
1:Determination of i by identification of W D i in the S R O I
2: W D = { N W ,   N ,   N E ,   W ,   E ,   S W ,   S ,   S E }
3:Determination of the S i through i
4: S = { S 1 ,   S 2 ,   S 3 ,   S 4 ,   S 5 ,   S 6 ,   S 7 ,   S 8 }
5:Initialize the time step t = 1
6:Comparison of the amount of cloud in S i , t at t and the amount of cloud in the S R O I , t
7:Comparison of the amount of cloud in S R O I , t and S R O I , t + 1
8:For n = 1 to N
9: If S i , t   S R O I , t
10:  If S R O I , t + 1   S R O I , t
11:   True
12:  Else
13:   False
14: Else
15:  If S R O I , t + 1   S R O I , t
16:   True
17:  Else
18:   False
19: A c c u r a c y   ( % ) =   c 1 N × 100

3. Forecasting Method of Solar PV Generation

3.1. Prediction of Cloud and PM in ROI

Before directly predicting solar PV generation, the concentration of cloud and PM were first predicted from the image data to reflect the temporal and spatial effects of clouds and PM. There are four variables in both clouds and PM (cloud: clear, partly cloudy, mostly cloudy, cloudy; PM: good, moderate, unhealthy, very unhealthy). Next, the data for predicting the amount of cloud and PM in units of an hour about ROI using the above variables. To predict the amount of the clouds and PM, the SVR using the linear kernel was employed, and the data from January to December 2015, which is the entire experimental period, were randomly sampled and trained 60% of each month and predicted 40% of each month to reflect seasonal characteristics.

3.2. Proposed Models for the Prediction of Solar PV Generation in ROI

The solar PV generation was predicted using ARIMAX, which is a time-series analysis method, and SVR, ANN, and DNN, which are machine learning algorithms widely used in various fields. Each method is described in detail as the subsequent subsections.

3.2.1. Autoregressive Moving Integrated Average Exogenous input (ARIMAX)

The ARIMAX is developed by Newsham and Birt [37], and is a method in which external factors act as additional variables in the ARMIA, which is a traditional statistical time-series analysis method [38]. The ARMIA is a method that satisfies both the autocorrelation model (AR) and the moving average model (MA) at the same time. The AR determines whether the past data affect the future data, and the MA identifies the tendency for the average value of any random variable to increase or decrease continuously over time. The ARIMA is a technique in which differences (I) are added in ARMA, and it can eliminate abnormalities by applying the initial differencing step. The ARIMAX is mainly employed when the variables to be predicted have a time series or a periodicity, and is mainly used for short-term solar PV generation forecasting [39,40]. Since the meteorological and air pollutant variables, including solar PV generation, provide as an hour data that satisfies the time-series characteristic. The ARIMAX has the order of p, d, and q, which represent the autoregressive order, difference order, and moving average order, respectively. In this study, the ARIMAX with and order (1, 0, 1) was used.

3.2.2. Support Vector Regression (SVR)

The SVR is an extended method for regression prediction in the Support Vector Machine (SVM) proposed by Vapnik in 1995 [41]. The key algorithm of SVR is to learn the training data and then find the optimal regression function f(x) in which all the predicted values exist within ε , a specific deviation called the support vector, and the error values of the predicted result are minimized. In general, the datasets used for the actual prediction cannot be solved with only linear problems of one dimension. To this end, vectors existing in high dimensions can be calculated linearly using kernel functions such as RBF, linear and polynomial kernels. The regression function f(x) can be calculated by the Lagrangian method using the value of the calculated dot product. Compared to other techniques, the SVR is more generalization ability and is widely used to solve regression problems. Also, due to the global minimum value, it has remarkable performance in time-series analysis, so it is widely used in solar PV generation prediction research [42,43]. In this study, a linear kernel was used for both the prior prediction model for predicting the cloud and PM and the prediction model for predicting solar PV generation.

3.2.3. Artificial Neural Network (ANN)

ANN, which is actively used for classification and prediction in various fields, is an artificial neural network that mimics the structure of the human brain. It is composed of input, hidden, and output layers [44,45]. Like the human brain has numerous neurons connected to collect and process data, ANN has interconnected nodes in each layer. The output value for input is predicted by the activation function included in each layer. In the prediction process, the activation function of each layer mainly calculates the function value and the first-order derivative, and the learning time depends on the corresponding calculation process. Herein, the proposed model has one hidden layer, and the Relu function was used as the activation function for the solar PV generation prediction model.

3.2.4. Deep Neural Network (DNN)

DNN is used to expand the number of hidden layers in ANN, as shown in Figure 7 [46,47]. The ANN contains a single hidden layer between input and output layers whereas the basic form of the DNN can have one or more hidden layers. As the number of hidden layers increases, the computational complexity also increases, but a combination of nonlinear transformation techniques can lead to high prediction accuracy. As in ANN, d nodes X = (1, x1, x2,, xd)T, excluding the bias nodes, are inputs and O = (O1, O2,, Oc)T with c nodes are outputs. Then, the number of nodes on the hlth layer is denoted as nj [48].
The weight W that connects the (L − 1)th layer to the Lth layer is given by (nhl−1 + 1)∗nl, and it is represented by the matrix Whl, as expressed in Equation (1). As mentioned earlier, the number of added hidden layers in ANN can be expressed as Equation (2), and input parameter vector X is represented as an internal parameter Z0, as expressed in Equation (3), when applied in DNN.
In the operation of DNN, the sum of the product of the edge weights connected to the Z vector and jth node is first obtained and then stored in the S variables. The values of S and Z variables are stored independently because they are used in the later applied backpropagation algorithm. In this case, the computation of the jth node of the hlth layer is represented by Equation (4), where nhl − 1 is the number of nodes in the (hl − 1)th layer. Equation (4) is only for the calculation of a single node, but Equation (5) allows all operations of the hlth layer to be performed simultaneously. The activation function τ1τL−1 included in each layer mainly uses such functions as logistic, sigmoid, and ReLu functions, and the softmax function is mainly used in the output layer.
In this study, the tanh, Relu, and sigmoid functions were applied to the hidden layer and the sigmoid function was applied to the final output layer. Detailed information about each hidden layer is presented in Table 7. In addition, to avoid overfitting, a dropout layer was added, which omits and calculates some of the nodes. Nodes omitted from the dropout do not affect learning, thus, the problem of overfitting can be avoided.
W h l = ( w 10 h l w 11 h l w 20 h l w 21 h l w 1 n h l 1 h l w 2 n h l 1 h l w n h l 0 h l w n h l 1 h l w n h l n h l 1 h l ) ,     h l = 1 ,   2 ,   ,   L ,   1 h l L
O = f ( x ) = f L ( f 2 ( f 1 ( x ) ) )
Z 0 = ( z 0 ,   z 1 ,   z 2 ,   ,   z n 0 ) T = ( 1 ,   x 1 ,   x 2 ,   x d ) T
z j h l = τ l ( s j h l ) , s j h l = u j h l z h l 1 , z h l 1 = ( 1 ,   z 1 h l 1 ,   z 2 h l 1 ,   ,   z n h l 1 h l 1 ) T , u j h l = ( u j 0 h l 1 ,   u j 1 h l 1 ,   ,   u j n n h l 1 h l 1 ) T
z h l = τ 1 ( U h l z h l 1 ) ,     1 h l L

3.3. Analytic Process for Predicting Solar PV Generation

To predict the solar PV generation, an experiment was conducted by considering three groups for the same period from January to December 2015 with respect to the solar PV power plant located at Incheon as described above. In the entire experiment, after predicting the amount of the clouds and PM of 60% per each month to reflect spatial characteristics in the prediction model. After that, remained sample data of 40% per each month was randomly chosen, and then 70% were used for training and 30% for testing. Group 1 used only numerical text data of the commonly used weather and air pollutants, and forecasting was performed hourly. Group 2 used only satellite image data to identify spatial characteristics. Thereafter, the predicted amounts of clouds and PM were used together with the data extracted from the satellite images, including the wind direction and speed, as the input parameters to predict the solar PV generation. Finally, Group 3 performed prediction by combining the numerical text data and the data extracted from the satellite images to develop a hybrid spatio-temporal prediction model. Group 3 conducted an experiment by first predicting the amount of cloud and PM in the same way as in Group 2, and then combining with numerical text data of the same period.
For all groups, the parameters for the month, day, and time, representing time, were added to reflect the temporal characteristics, and one hour previous solar PV generation data was added as the input parameters for the generation model to reflect the presentity of the prediction model. Also, to understand the effect of each group on the cloud and PM for all the groups, three cases were adopted: case 1, where only the cloud is applied as a parameter; case 2, where only PM is applied as a parameter; case 3 that applies the cloud and PM at the same time. All the groups with the cases are shown in Table 8.

4. Results and Discussion

To predict solar PV generation, experiments were conducted using five models in three groups, including ARIMAX, SVR_RBF, SVR_Linear, SVR_Poly, ANN, and DNN. The following error analysis methods were employed to evaluate the performance of each model:
M A E =   1 n i = 1 n | y i y i |
R M S E =   1 n i = 1 n ( y i y i ) 2
S M A P E ( % ) =   1 n i = 1 n | y i y i | | y i | + | y i |
M B E ( % ) =   i = 1 n ( y i y i ) i = 1 n y i
C V ( % ) =   R M S E 1 n i = 1 n y i
where *   y : Predicted   value ,   y : Observed   value ,   n : Number   of   samples .
In general, the methods of analyzing errors in prediction models can be divided into two: the relative error analysis methods, such as Mean Absolute Error (MAE) and Root Mean Square Error (RMSE); absolute error analysis method, which uses percentages, such as Mean Absolute Error (MAPE). In most cases, these error analysis methods are used, but MAPE has the disadvantage of producing distorted results when the actual value is zero or when there are many extreme anomalies. For solar PV generation, MAPE cannot be used for a time other than the daylight time because the power generation converges to zero. To compensate for this problem, the Symmetric Mean Absolute Percentage Error (SMAPE) was employed. The SMAPE can compensate for the above shortcoming because it produces the same values even if the actual and the predicted values change.
In addition, the Mean Bias Error (MBE) and Coefficient of Variation (Cv) were employed according to the criteria of ASHRAE Guideline 14 [49]. These have absolute values like SMAPE, and more objective evaluation is possible because there is a clear standard of ASHRAE Guideline 14. According to the standard, as shown in Table 9, the hourly prediction is specified within MBE of ±10% and Cv of 30%. The MBE has better performance as it goes closer to zero regardless of the sign. However, in this study, the absolute value was taken for the MBE value for intuitiveness and convenience.
Table 10, Table 11 and Table 12 summarize the error analysis results for each case. In each group, the model with the best performance for each error analysis method is shown in bold. From the analyses of the three cases, the prediction performance of Case 2, in which only PM parameters were used, was the lowest and that of Case 3, in which parameters for both the cloud and PM were used, was the best. The analysis of the prediction results of the three groups in each case showed that Group 3, which combined the other two groups to reflect the temporal and spatial characteristics, exhibited the best performance compared with Group 1 and Group 2. In Case 1 and Case 3, the models of Group 2 had better performance than the models of Group 1. Besides, we found that the cloud information extracted from the satellite images has more influence on solar PV generation prediction than the numerical weather information. On the other hand, in the case of the PM, numerical weather text data has a significant impact on solar PV generation prediction compare to satellite images. Among the SVR models for each kernel, the linear kernel model showed the best performance, and overall, the performance was improved in the order of ANN, ARIMAX, SVR_Linear, and DNN. The performance of the DNN model of Group 3 in Case 3 has the best performance in the entire experiments. The DNN and SVR_Linear models satisfied the criteria of the ASHRAE Guideline 14 in all cases and groups. However, all ANN models except for Group 2 in Case 2, Group 1 and 2 in Case 3, and ARIMAX model of Group 1 in Case 3 did not satisfy the criteria.
Table 13 lists the prediction results of the SVR_Linear model, which have the best performance among the SVR models, and those of the ARIMAX, ANN, and DNN models. These graphs are represented by extracting 30 h from the entire test periods. All the nine experimental group models yielded similar observed values, and the DNN models had the best performance.
To further improve the performance of the proposed models, it is necessary to identify and improve the factors that affect the prediction performance. As an impediment factor, the temporal continuity of the temporal characteristics may be interrupted during the removal of the missing values to satisfy the same conditions between each group. Also, there are cases where the clouds disappear naturally and the wind direction is not observed for a long time due to the weakness of the wind. Therefore, there is a need for more precise data interpolation methodologies to complement this. Moreover, the performance of the proposed model could be improved by collecting more data for analysis and optimizing the machine learning techniques used in this study.

5. Conclusions

In this study, a hybrid spatio-temporal prediction model, which combines numerical weather text data and satellite image data, is proposed to develop an accurate solar PV generation prediction model, which is the most popular model in the field of renewable energies. Conventional meteorological data are composed of numerical text data and have a continuity of time, but it is difficult to reflect the effects of spatial characteristics, such as the movement of clouds and PM moving by the wind direction, as raw data in the prediction model. Therefore, numerical weather text data, satellite images, and time series-based solar PV generation data were employed to develop the prediction model. Data from a solar PV power plant located in Incheon, South Korea, was used as the test target, and the entire test was conducted on an hourly basis from January to December 2015. To develop the optimal prediction model, machine learning algorithms and statistical time-series analysis methods were employed, including ARIMAX, SVR_RBF, SVR_Linear, SVR_Poly, ANN, and DNN.
To reflect spatial characteristics in the proposed prediction model, ROI was designated in the satellite image, and then, ROIadj was designated for eight adjacent directions around the ROI. After that, the data extracted from ROIadj and ROI were used to predict the amount of cloud and PM parameters to be used in the prediction model for solar PV generation in the ROI. In addition to the amount of cloud and PM predicted in the ROI, other numerical weather text data and an hour previous solar PV generation data were combined to predict the solar PV generation. The experiment was conducted in three cases and three groups to determine the impact of clouds and PM on the prediction of solar PV generation. Among the different cases, the DNN model of Group 3 in Case 3, one of the hybrid models, yielded MAE of 71.532, RMSE of 92.938, SMAPE of 14.107%, MBE of 4.986%, Cv of 20.633%, indicating the best performance among all the models.
We propose a hybrid spatio-temporal DNN model that reflects the movement of clouds and PM using numerical weather text data and satellite images. From nine different experiments, we found that the spatial characteristics of clouds and PM affect the solar PV generation. Through the deepen data analysis, we confirmed that a proposed prediction model based on both satellite image data of clouds and numerical weather text data of PM improves the performance of solar PV generation prediction. In this study, a hybrid spatio-temporal prediction model supporting a notable performance was developed by utilizing satellite images and numerical weather data in predicting solar PV generation with time-series characteristics. This would be a useful guideline for the development of precise solar PV generation prediction models required to improve the efficient and stable power supply of renewable energy generation systems.

Author Contributions

Conceptualization & Methodology, B.K., D.S.; Writing—original draft, B.K., D.S.; Writing—review & editing, D.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by Korea Electric Power Corporation (grant number R19XO01-04) and results of a study on the “HPC Support” Project, supported by the ‘Ministry of Science and ICT’ and NIPA.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Cullen, D. Climate change. Nature 2011, 479, 267–268. [Google Scholar] [CrossRef]
  2. Horowitz, C.A. Paris Agreement. Int. Leg. Mater. 2016, 55, 740–755. [Google Scholar] [CrossRef]
  3. Ministry of Trade, Industry and Energy. Renewable Energy 3020 Plan. 3020 Plan; Ministry of Trade, Industry and Energy: Sejong, Korea, 2017.
  4. Korea Ministry of Trade, Industry and Energy. Renewable Energy Statistics 2013. 2014. Available online: http://www.motie.go.kr (accessed on 17 December 2017).
  5. Choi, H.; Zhao, W.; Ciobotaru, M.; Agelidis, V.G. Large-scale PV system based on the multiphase isolated DC/DC converter. In Proceedings of the 2012 3rd IEEE International Symposium on Power Electronics for Distributed Generation Systems (PEDG), Aalborg, Denmark, 25–28 June 2012; pp. 801–807. [Google Scholar] [CrossRef]
  6. Javier, M.; Luis, M.; Eduardo, L.; David, A.; Izco, E. Power output fluctuations in large scale PV plants: One year observations with one second resolution anda derived analytic model. Prog. Photovolt. 2011, 19, 218–227. [Google Scholar]
  7. Won, J.-M.; Doe, G.-Y.; Heo, N.-R. Predict Solar Radiation According to Weather Report. J. Korean Navig. Port Res. 2011, 35, 387–392. [Google Scholar] [CrossRef]
  8. Fang, X.; Misra, S.; Xue, G.; Yang, D. Smart Grid—The New and Improved Power Grid: A Survey. IEEE Commun. Surv. Tutor. 2011, 14, 944–980. [Google Scholar] [CrossRef]
  9. Bae, K.Y.; Jang, H.S.; Sung, D.K. Hourly Solar Irradiance Prediction Based on Support Vector Machine and Its Error Analysis. IEEE Trans. Power Syst. 2016, 32, 935–945. [Google Scholar] [CrossRef]
  10. Shi, J.; Lee, W.-J.; Liu, Y.; Yang, Y.; Wang, P. Forecasting Power Output of Photovoltaic Systems Based on Weather Classification and Support Vector Machines. IEEE Trans. Ind. Appl. 2012, 48, 1064–1069. [Google Scholar] [CrossRef]
  11. Yang, H.-T.; Huang, C.-M.; Huang, Y.-C.; Pai, Y.-S. A Weather-Based Hybrid Method for 1-Day Ahead Hourly Forecasting of PV Power Output. IEEE Trans. Sustain. Energy 2014, 5, 917–926. [Google Scholar] [CrossRef]
  12. Lee, D.; Kim, K. Deep Learning Based Prediction Method of Long-term Photovoltaic Power Generation Using Meteorological and Seasonal Information. Soc. e-Bus. Stud. 2019, 24, 1–16. [Google Scholar]
  13. Lee, K.; Kim, W.-J. Forecasting of 24_hours Ahead Photovoltaic Power Output Using Support Vector Regression. J. Korean Inst. Inf. Technol. 2016, 14, 175–183. [Google Scholar] [CrossRef]
  14. Jang, H.S.; Bae, K.Y.; Park, H.-S.; Sung, D.K. Solar Power Prediction Based on Satellite Images and Support Vector Machine. IEEE Trans. Sustain. Energy 2016, 7, 1255–1263. [Google Scholar] [CrossRef]
  15. Jang, H.S.; Bae, K.Y.; Park, H.-S.; Sung, D.K. Effect of aggregation for multi-site photovoltaic (PV) farms. In Proceedings of the 2015 IEEE International Conference on Smart Grid Communications (SmartGridComm), Miami, FL, USA, 2–5 November 2015; pp. 623–628. [Google Scholar] [CrossRef]
  16. Hammer, A.; Heinemann, D.; Lorenz, E.; Lückehe, B. Short-Term Forecasting of Solar Radiation. 1999 ISES Sol. World Congr. 2000, 67, 411–418. [Google Scholar] [CrossRef]
  17. Peng, Z.; Yoo, S.; Yu, D.; Huang, D. Solar irradiance forecast system based on geostationary satellite. In Proceedings of the 2013 IEEE International Conference on Smart Grid Commun. SmartGridComm, Vancouver, BC, Canada, 21–24 October 2013; pp. 708–713. [Google Scholar] [CrossRef]
  18. Kim, I.-J.; Lee, S.-K. A Study on the Design of Testable CAM using MTA Code. Trans. Korean Inst. Electr. Eng. 2019, 106–111. [Google Scholar] [CrossRef]
  19. Zhang, X.; Li, Y.; Lu, S.; Hamann, H.F.; Hodge, B.-M.; Lehman, B. A Solar Time Based Analog Ensemble Method for Regional Solar Power Forecasting. IEEE Trans. Sustain. Energy 2018, 10, 268–279. [Google Scholar] [CrossRef]
  20. Kang, H. An Analysis of the Causes of Fine Dust in Korea Considering Spatial Correlation. Environ. Resour. Econ. Rev. 2019, 28, 327–354. [Google Scholar] [CrossRef]
  21. Peters, I.M.; Karthik, S.; Liu, H.; Buonassisi, T.; Nobre, A. Urban haze and photovoltaics. Energy Environ. Sci. 2018, 11, 3043–3054. [Google Scholar] [CrossRef] [Green Version]
  22. Darwish, Z.A.; Kazem, H.A.; Sopian, K.; Al-Goul, M.; Alawadhi, H. Effect of dust pollutant type on photovoltaic performance. Renew. Sustain. Energy Rev. 2015, 41, 735–744. [Google Scholar] [CrossRef]
  23. Maghami, M.R.; Hizam, H.; Gomes, C.; Hajighorbani, S.; Rezaei, N. Evaluation of the 2013 Southeast Asian Haze on Solar Generation Performance. PLoS ONE 2015, 10, e0135118. [Google Scholar] [CrossRef]
  24. Sarver, T.; Al-Qaraghuli, A.; Kazmerski, L.L. A comprehensive review of the impact of dust on the use of solar energy: History, investigations, results, literature, and mitigation approaches. Renew. Sustain. Energy Rev. 2013, 22, 698–733. [Google Scholar] [CrossRef]
  25. Bergin, M.H.; Ghoroi, C.; Dixit, D.; Schauer, J.J.; Shindell, D. Large Reductions in Solar Energy Production Due to Dust and Particulate Air Pollution. Environ. Sci. Technol. Lett. 2017, 4, 339–344. [Google Scholar] [CrossRef] [Green Version]
  26. El-Shobokshy, M.S.; Hussein, F.M. Degradation of photovoltaic cell performance due to dust deposition on to its surface. Renew. Energy 1993, 3, 585–590. [Google Scholar] [CrossRef]
  27. Korea Meteorolgical Administration. Available online: https://data.kma.go.kr/ (accessed on 22 October 2020).
  28. Air Korea. Available online: https://www.airkorea.or.kr/ (accessed on 22 October 2020).
  29. Open Data Portal. Available online: https://www.data.go.kr/ (accessed on 22 October 2020).
  30. National Meteorological Satellite Center. Available online: https://nmsc.kma.go.kr/ (accessed on 22 October 2020).
  31. National Meteorological Satellite Center. AMV (AMV: Atmospheric Motion Vector) Algorithm Theoretical Basis Document; National Meteorological Satellite Center: Jincheon, Korea, 2012.
  32. National Meteorological Satellite Center. COT Algorithm Theoretical Basis Document; National Meteorological Satellite Center: Jincheon, Korea, 2012.
  33. National Meteorological Satellite Center. AOD Algorithm Theoretical Basis Document; National Meteorological Satellite Center: Jincheon, Korea, 2012.
  34. National Meteorological Satellite Center. INS Algorithm Theoretical Basis Document; National Meteorological Satellite Center: Jincheon, Korea, 2012.
  35. Wolff, B.; Kühnert, J.; Lorenz, E.; Kramer, O.; Heinemann, D. Comparing support vector regression for PV power forecasting to a physical modeling approach using measurement, numerical weather prediction, and cloud motion data. Sol. Energy 2016, 135, 197–208. [Google Scholar] [CrossRef]
  36. Li, L.; Gong, J.; Zhou, J. Spatial Interpolation of Fine Particulate Matter Concentrations Using the Shortest Wind-Field Path Distance. PLoS ONE 2014, 9, e96111. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Newsham, G.R.; Birt, B.J. Building-level occupancy data to improve ARIMA-based electricity use forecasts. In Proceedings of the 2nd ACM Workshop on Embedded Sensing Systems for Energy-Efficiency in Building, Zurich, Switzerland, 3–5 November 2010; pp. 13–18. [Google Scholar] [CrossRef] [Green Version]
  38. Dong-hyun, L.; A-hyun, J.; Jin-young, K.; Chang-Ki, K.; Hyun-goo, K.; Yung-seop, L. Solar Power Generation Forecast Model Using Seasonal ARIMA. Korean Sol. Energy Soc. 2019, 39, 59–66. [Google Scholar]
  39. Paulescu, M.; Badescu, V.; Brabec, M. Tools for PV (photovoltaic) plant operators: Nowcasting of passing clouds. Energy 2013, 54, 104–112. [Google Scholar] [CrossRef]
  40. Alsharif, M.H.; Younes, M.K.; Kim, J. Time Series ARIMA Model for Prediction of Daily and Monthly Average Global Solar Radiation: The Case Study of Seoul, South Korea. Symmetry 2019, 11, 240. [Google Scholar] [CrossRef] [Green Version]
  41. Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
  42. Liu, W.; Liu, C.; Lin, Y.; Ma, L.; Xiong, F.; Li, J. Ultra-Short-Term Forecast of Photovoltaic Output Power under Fog and Haze Weather. Energies 2018, 11, 528. [Google Scholar] [CrossRef] [Green Version]
  43. Kim, K.; Jin, H. Photovoltaic Power Forecasting and Analysis of Forecasting Error for Model Learning Periods Using SVR. In Proceedings of the Korean Institute of Electrical Engineers, Pyeongchang, Korea, 11–13 July 2018; pp. 324–325. [Google Scholar]
  44. McCulloch, W.; Pitts, W. A logical calculus of the ideas immanent in nervous activity. Bull. Math. Biol. 1990, 52, 99–115. [Google Scholar] [CrossRef]
  45. Saberian, A.; Hizam, H.; Radzi, M.A.M.; Ab Kadir, M.Z.A.; Mirzaei, M. Modelling and Prediction of Photovoltaic Power Output Using Artificial Neural Networks. Int. J. Photoenergy 2014, 2014, 1–10. [Google Scholar] [CrossRef] [Green Version]
  46. Teoh, E.J.; Tan, K.C.; Xiang, C. Estimating the Number of Hidden Neurons in a Feedforward Network Using the Singular Value Decomposition. IEEE Trans. Neural Netw. 2006, 17, 1623–1629. [Google Scholar] [CrossRef]
  47. Liu, W.; Wang, Z.; Liu, X.; Zeng, N.; Liu, Y.; Alsaadi, F.E. A survey of deep neural network architectures and their applications. Neurocomputing 2017, 234, 11–26. [Google Scholar] [CrossRef]
  48. Oh, I.-S. Machine Learning; HANBIT Academy, Inc.: Seoul, Korea, 2017. [Google Scholar]
  49. ANSI/ASHRAE. ASHRAE Guideline 14-2002 Measurement of Energy and Demand Savings. ASHRAE 2002, 8400, 170. [Google Scholar]
Figure 1. Research framework.
Figure 1. Research framework.
Remotesensing 12 03706 g001
Figure 2. Four types of satellite images: (a) AMV Image; (b) Cloud Optical Thickness Image; (c) Aerosol Optical depth Image; (d) Insolation Image.
Figure 2. Four types of satellite images: (a) AMV Image; (b) Cloud Optical Thickness Image; (c) Aerosol Optical depth Image; (d) Insolation Image.
Remotesensing 12 03706 g002aRemotesensing 12 03706 g002b
Figure 3. An exemplary satellite image of AMV with ROI.
Figure 3. An exemplary satellite image of AMV with ROI.
Remotesensing 12 03706 g003
Figure 4. A standard station model for wind direction and speed with the location of each point.
Figure 4. A standard station model for wind direction and speed with the location of each point.
Remotesensing 12 03706 g004
Figure 5. An exemplary satellite image of the Cloud Optical Thickness with sub ROI.
Figure 5. An exemplary satellite image of the Cloud Optical Thickness with sub ROI.
Remotesensing 12 03706 g005
Figure 6. Continuity of satellite images with the change in time.
Figure 6. Continuity of satellite images with the change in time.
Remotesensing 12 03706 g006
Figure 7. Structure of the DNN model.
Figure 7. Structure of the DNN model.
Remotesensing 12 03706 g007
Table 1. The sample of the numerical dataset.
Table 1. The sample of the numerical dataset.
DateTemperature
[°C]
Precipitation
[mm]
Wind Speed
[m/s]
Wind
Direction
[0–360 degree]
Humidity
[%]
Amount
of
Sunshine
[hr]
Irradiance
[MJ/m]
Cloudiness
[0–10 level]
Visibility
[10m]
SO2
[ppm]
CO
[μg/m2]
O3
[ppm]
NO2
[ppm]
PM10
[μg/m2]
PM2.5
[μg/m2]
PV
[kW]
1 January 2015
09:00:00
−8.406.7340560.80.21020000.0060.50.0170.0121453360
1 January 2015
10:00:00
−8.106.1226540.00.67120000.0060.50.0190.0111734374
1 January 2015
11:00:00
−7.606.1340530.01.1120000.0060.60.0190.019833638
1 January 2015
12:00:00
−6.906.4340520.01.41120000.0060.60.0210.019030784
1 January 2015
12:00:00
−6.106.4340530.01.53120000.0060.60.0230.018527842
.................
.................
.................
31 December 2015
13:00:00
2.90.03.3360741.01.3848000.0111.20.0130.0427866230
31 December 2015
14:00:00
3.30.03.1360751.01.2438000.0111.20.0230.03210072310
31 December 2015
15:00:00
3.10.03.4340771.00.9338000.0111.20.0240.0348767439
31 December 2015
16:00:00
3.30.03.2340771.00.6729000.0091.20.0240.0359068303
31 December 2015
17:00:00
2.90.02.1320781.00.2607000.0091.20.0170.047836595
Table 2. Basic performance data of COMS.
Table 2. Basic performance data of COMS.
ChannelCenter Wavelength (μm)Wavelength Band (μm)Spatial Resolution (km)
Visible0.670.55~0.81
Shortwave Infrared3.73.5~4.04
Water vapor6.76.5~7.04
Infrared 110.810.3~11.34
Infrared 212.011.5~12.54
Table 3. The example of the cloud data sets.
Table 3. The example of the cloud data sets.
DateWind
Direction
ClearPartly CloudyMostly CloudyCloudy
18 January 2015 11:00:00W0000
18 January 2015 12:00:00W160811460
18 January 2015 13:00:00SW1147935310
18 January 2015 14:00:00W0000
2015-08-02 08:00:00SW0000
2015-08-02 09:00:00W1756361656
2015-08-02 10:00:00W23833222235
2015-08-02 11:00:00W525913640
Table 4. The example of the PM data sets.
Table 4. The example of the PM data sets.
DateWind
Direction
GoodModerateUnhealthyVery Unhealthy
18 January 2015 11:00:00W2713111957
18 January 2015 12:00:00W08820478
18 January 2015 13:00:00SW043020
18 January 2015 14:00:00W0600
2 August 2015 08:00:00SW61441218
2 August 2015 09:00:00W012129
2 August 2015 10:00:00W096239
2 August 2015 11:00:00W00018
Table 5. The result of the discriminant for movement of clouds by wind direction.
Table 5. The result of the discriminant for movement of clouds by wind direction.
CloudClearPartly CloudyMostly CloudyCloudy
Accuracy (%)75.06875.79384.04491.296
Table 6. The result of discriminant for movement of PM by wind direction.
Table 6. The result of discriminant for movement of PM by wind direction.
PMGoodModerateUnhealthyVery Unhealthy
Accuracy (%)87.48985.58589.66593.382
Table 7. The structure of the DNN.
Table 7. The structure of the DNN.
Number of Hidden Layer1234567
Number of Nodes1800.41000.41000.41
Activation FunctiontanhDrop outReluDrop OutSigmoidDrop outSigmoid
Table 8. Input parameters by each group and case.
Table 8. Input parameters by each group and case.
Group 1
(Numerical Text
Weather Data)
Group 2
(Satellite Images)
Group 3
(Mixed, G1 + G2)
Common
Parameters
Month, Day, Time, PV (previous data)
Case 1 (Cloud)Temperature, Precipitation, Wind Speed, Wind Direction, Humidity, Amount of Sunshine, Irradiance, Cloudiness, VisibilityWind Speed, Wind Direction, Clear, Partly cloudy, Mostly cloudy, Cloudy, IrradianceTemperature, Precipitation, Wind Speed, Wind Direction, Humidity, Amount of Sunshine, Irradiance, Clear, Partly cloudy, Mostly cloudy, Cloudy, Visibility
Case 2 (PM)Temperature, Precipitation, Wind Speed, Wind Direction, Humidity, Amount of Sunshine, Irradiance, SO2, CO, O3, NO2, PM10, PM2.5, VisibilityWind Speed, Wind Direction, PM_Good, PM_Moderate, PM_Unhealthy, PM_Very Unhealthy, IrradianceTemperature, Precipitation, Wind Speed, Wind Direction, Humidity, Amount of Sunshine, Irradiance, PM_Good, PM_Moderate, PM_Unhealthy, PM_Very Unhealthy, Visibility
Case 3 (Cloud + PM)Temperature, Precipitation, Wind Speed, Wind Direction, Humidity, Amount of Sunshine, Irradiance, SO2, CO, O3, NO2, PM10, PM2.5, Cloudiness, VisibilityWind Speed, Wind Direction, Clear, Partly cloudy, Mostly cloudy, Cloudy, PM_Good, PM_Moderate, PM_Unhealthy, PM_Very Unhealthy, IrradianceTemperature, Precipitation, Wind Speed, Wind Direction, Humidity, Amount of Sunshine, Irradiance, Clear, Partly cloudy, Mostly cloudy, Cloudy, PM_Good, PM_Moderate, PM_Unhealthy, PM_Very Unhealthy, Visibility
Output ParameterPV (One hour ahead)
Table 9. Acceptable range of ASHRAE Guideline 14.
Table 9. Acceptable range of ASHRAE Guideline 14.
Calibration TypeIndexAcceptable Value
MonthlyMBE month± 5%
Cv (RMSE) month15%
HourlyMBE hour±10%
Cv (RMSE) hour30%
Table 10. Solar PV generation prediction result of case 1.
Table 10. Solar PV generation prediction result of case 1.
GroupErrorARIMIXSVR_RBFSVR_LinearSVR_PolyANNDNN
Group 1
(Numerical Text Data)
MAE81.26190.68681.112102.638122.28572.554
RMSE101.768111.289101.63128.766149.35498.519
SMAPE17.84517.11417.9121.1619.70614.197
MBE0.0170.3940.5260.50423.9580.956
Cv22.59324.70622.56228.58733.15721.872
Group 2
(Satellite Images)
MAE79.186101.17881.147180.10792.73873.496
RMSE101.712124.462103.107232.476117.63595.934
SMAPE15.63518.17816.5128.44716.78614.484
MBE0.0930.0520.45512.66613.1713.142
Cv22.5827.63122.8951.6126.11521.298
Group 3
(Mixed,
G1 + G2)
MAE78.14388.86878.88485.33992.32367.531
RMSE101.496109.402100.88116.526114.27493.642
SMAPE17.79216.84117.61316.5817.85715.039
MBE0.060.5920.143.8912.216.013
Cv22.53224.28822.39625.86925.36920.789
Table 11. Solar PV generation prediction result of case 2.
Table 11. Solar PV generation prediction result of case 2.
GroupErrorARIMIXSVR_RBFSVR_LinearSVR_PolyANNDNN
Group 1
(Numerical Text Data)
MAE79.75394.81283.15276.80886.4475.432
RMSE100.344115.05104.31997.71106.32695.944
SMAPE17.37717.4171816.24119.12414.439
MBE0.2651.81.3011.58510.8073.661
Cv22.27725.54123.15921.69223.60521.3
Group 2
(Satellite Images)
MAE78.3396.578.103190.97382.3475.727
RMSE102.158118.67999.672255.323105.741103.094
SMAPE16.68317.75717.01728.40315.48915.202
MBE0.1061.6650.00918.5863.510.303
Cv22.67926.34722.12756.68223.47522.887
Group 3
(Mixed,
G1 + G2)
MAE78.89590.66380.28277.88293.97471.295
RMSE102.432110.424102.413104.554115.70396.355
SMAPE17.63116.95718.22915.21717.82914.257
MBE0.1950.4930.4821.20712.4812.156
Cv22.7424.51422.73623.21125.68621.391
Table 12. Solar PV generation prediction result of case 3.
Table 12. Solar PV generation prediction result of case 3.
GroupErrorARIMIXSVR_RBFSVR_LinearSVR_PolyANNDNN
Group 1
(Numerical Text Data)
MAE196.12990.09681.15479.12895.9779
RMSE235.048111.22102.268100.929119.607104.059
SMAPE31.62617.04117.59616.4920.80815.123
MBE11.8162.0760.1711.2196.6198.376
Cv52.18124.69122.70422.40626.55323.101
Group 2
(Satellite Images)
MAE78.44493.15581.448126.29982.91177.713
RMSE100.831115.8223102.696169.343107.843102.005
SMAPE15.78317.53917.0421.65716.0814.815
MBE0.1460.8830.7444.9158.3215.343
Cv22.38525.71322.79937.60423.94122.646
Group 3
(Mixed,
G1 + G2)
MAE77.55487.58780.12383.163101.23371.532
RMSE101.943108.229102.889107.278122.50892.938
SMAPE17.66916.6617.9117.37318.77614.107
MBE0.3420.4980.0390.13416.5724.986
Cv22.63224.02722.84223.81627.19720.633
Table 13. Solar PV generation prediction result of each group.
Table 13. Solar PV generation prediction result of each group.
Case 1 (Cloud Only)Case 2 (PM Only)Case 3 (Using Together)
Group 1
(KMA)
Remotesensing 12 03706 i001 Remotesensing 12 03706 i002 Remotesensing 12 03706 i003
Group 2
(Satellite)
Remotesensing 12 03706 i004 Remotesensing 12 03706 i005 Remotesensing 12 03706 i006
Group 3
(Mix)
Remotesensing 12 03706 i007 Remotesensing 12 03706 i008 Remotesensing 12 03706 i009
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Kim, B.; Suh, D. A Hybrid Spatio-Temporal Prediction Model for Solar Photovoltaic Generation Using Numerical Weather Data and Satellite Images. Remote Sens. 2020, 12, 3706. https://doi.org/10.3390/rs12223706

AMA Style

Kim B, Suh D. A Hybrid Spatio-Temporal Prediction Model for Solar Photovoltaic Generation Using Numerical Weather Data and Satellite Images. Remote Sensing. 2020; 12(22):3706. https://doi.org/10.3390/rs12223706

Chicago/Turabian Style

Kim, Bowoo, and Dongjun Suh. 2020. "A Hybrid Spatio-Temporal Prediction Model for Solar Photovoltaic Generation Using Numerical Weather Data and Satellite Images" Remote Sensing 12, no. 22: 3706. https://doi.org/10.3390/rs12223706

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop