Effect of Prediction Error of Machine Learning Schemes on Photovoltaic Power Trading Based on Energy Storage Systems

Photovoltaic (PV) output power inherently exhibits an intermittent property depending on the variation of weather conditions. Since PV power producers may be charged to large penalties in forthcoming energy markets due to the uncertainty of PV power generation, they need a more accurate PV power prediction scheme in energy market operation. In this paper, we characterize the effect of PV power prediction errors on energy storage system (ESS)-based PV power trading in energy markets. First, we analyze the prediction accuracy of two machine learning (ML) schemes for the PV output power and estimate their error distributions. We propose an efficient ESS management scheme for charging and discharging operation of ESS in order to reduce the deviations between the day-ahead (DA) and real-time (RT) dispatch in energy markets. In addition, we estimate the capacity of ESSs, which can absorb the prediction errors and then compare the PV power producer’s profit according to ML-based prediction schemes with/without ESS. In case of ML-based prediction schemes with ESS, the ANN and SVM schemes yield a decrease in the deviation penalty by up to 87% and 74%, respectively, compared with the profit of those schemes without ESS.


Introduction
In recent years, the world has faced a urgent climate change problem and a depletion problem of fossil fuels.The portion of renewable energy sources (RESs) has gradually increased instead of fossil fuel based generators in many countries.We expect that worldwide penetration of solar power rapidly increase until 2030 since it is manageable to build, operate, and maintain PV systems [1].Utility-scale photovoltaic (PV) plants have been rapidly installed in many countries.For example, China, Germany, and the USA have tens of utility-scale PV farms.In addition, China, Japan and European countries have operated a feed-in tariff (FIT) policy to increase investment in PV industry, while the USA and South Korea adopted a renewable portfolio standard (RPS) in order to spread PV systems [2].
However, PV output power generally exhibits an 'intermittent' output characteristic depending on weather conditions [3].Due to the poor predictability, PV systems may experience a power imbalance during those operation.Therefore, it is difficult for MW-scale PV power producers to participate in energy markets due to the prediction errors.Furthermore, this intermittent property may decrease the economical profit for utility-scale PV power producers since higher capacity of energy storage systems (ESSs) is required to absorb the prediction errors of PV power generation and avoid the variability of utility.
Integration of RES output power in the power system would highly depend on accurate forecasts [4,5].Therefore, a number of studies on the prediction of solar irradiance or PV output power have been found in literature.The persistence scheme may be effective only for its use in short-term forecast (e.g., intra-hour) applications [6].Cloud motion vector (CMV) by total sky imagers [7] and satellite images [8] were utilized to analyze cloud movement and predict short-term irradiance at the ground level.For longer time horizons (hours to days), numerical weather prediction (NWP) schemes have been applied [9,10].A number of time series prediction schemes such as autoregressive moving average (ARMA) [11], autoregressive integrated moving average (ARIMA) [12], and autoregressive moving average with exogenous inputs (ARMAX) [13] have also been used to design the prediction model based on stochastic characteristics of solar irradiance.In addition, a number of prediction studies have developed nonlinear solar or PV output power prediction schemes based on machine learning (ML).In specific, artificial neural networks (ANN) [14], support vector machine (SVM) [15,16], and hybrid schemes [17] have been utilized in prediction models.
The Federal Energy Regulatory Commission (FREC) issued the Order 764, which aims to remove a barrier for the integration of renewable energy resources into the power grid [18].In the existing energy market, the FERC noted that the suppliers of renewable energy resources should pay for imbalance penalties for any difference between their settled energy and actual delivery in that hour.In addition, the suppliers of renewable energy resources need to provide bids in RT markets, based on forecasts and then follow dispatch instructions in the current Independent System Operator (ISO)/Regional Transmission Organization (RTO) markets [19].New York (NYISO), PJM Interconnection (PJM), Electric Reliability Council of Texas (ERCOT), and California ISO (CAISO) have the market policy of the imbalance settlements and deviation penalties.If renewable generation is a capacity resource, it must bid in DA market and RT market.Also, a deviation penalty applies.However, the previous PV power prediction studies did not consider the energy market for PV power trading.Few studies dealt with energy markets, which are regulation markets [20,21] and locational marginal price (LMP) markets [4,22] for wind power trading.
ESSs play an important role in integrating RES into power grid.However, there are only few studies considering the contributions of the renewable prediction schemes based on the economic analysis such as an assessment of the profit for the PV power producers.The power capacity of ESSs was calculated to compensate the wind power generation [23].However, the operational environment of the ESS is different for absorbing the deviation due to the prediction errors between the wind and PV systems.For example, ESSs for the PV system can be operated at night and day time from the power grid in order to prevent an unnecessary increase in the energy capacity.The capacity of ESSs was calculated to reduce the power fluctuation of PV output power in [24].However, the authors did not consider the uncertainty of the prediction schemes.As a result, the previous studies on the ESS operation strategy mainly focused on cost minimization of a given system under a Time-of-Use (ToU) pricing or real-time pricing [25,26].It is evident that the prior studies often neglected the uncertainty of forecasting PV output power.Therefore, it should be considered to provide the accurate operation for various operational conditions.In this paper, by using the prediction scheme, we propose an efficient ESS management scheme in order to estimate the accurate deviation penalties in energy markets.
In this paper, we estimate the effect of the prediction errors on energy storage system (ESS)-based PV power trading as shown in Figure 1.First, we evaluate the performance of PV power prediction models with the ANN and SVM schemes, and their prediction errors are fitted by a t location-scale distribution in order to describe a characteristic of heavy tails.And then, we characterize the ESS role in forthcoming energy markets and propose an efficient ESS management scheme for charging and discharging operation of ESS in order to reduce the deviation penalties from differences between day-ahead (DA) schedule and real-time (RT) supply in energy markets.As a case study, we estimate the capacity of ESSs from different energy market constraints in terms of a tolerance limit and a penalty factor.We calculate the PV power producer's profit considering the deviation penalties in case of ML-based prediction schemes with/without ESS in order to quantitatively estimate the effect of ESS for different prediction accuracies of two ML-based prediction schemes.As a result, we analyze the benefit from the accurate prediction scheme with ESS management as a case study on the participation of a utility-scale PV farm at a locational marginal price (LMP) market in the United States.Furthermore, we study the economic analysis of ESS installation for using both the ANN and SVM schemes through the benefit cost analysis (BCA).

Economic Analysis Efficient ESS Management
Charging and discharging operation of ESS under energy market constraints

ESS Capacity Estimation
Assessment of PV power producer's profit Some studies have investigated the economic analysis of PV systems based on the installation cost analysis [26].And, few studies have considered the uncertainty of the renewable power prediction for renewable power trading.A persistence prediction scheme was utilized in order to estimate the profit for wind energy trading [22].But, they did not consider ML-based prediction schemes.In addition, the distributions of PV prediction errors is slightly different from that of wind power prediction errors.Therefore, it is valuable to apply the accurate prediction scheme based on ML in PV power trading.Also, more quantitative analysis is required for analyzing the effect of the prediction accuracy for PV power trading in detail.The contributions of this paper can be summarized as follows: (1) We estimate the accuracy of two ML-based prediction schemes and characterize their error distributions in order to quantitatively estimate the effect of the prediction accuracy for PV power trading; (2) The capacity of ESSs, which can reduce cost associated with PV prediction errors for PV power trading, is estimated from different market parameters; (3) In case of ML-based prediction schemes with/without ESS, the benefit from the accurate prediction scheme with ESS management is analyzed as a case study; (4) The economic analysis of ESS installation for two ML-based prediction schemes is done through the BCA.
The rest of this paper is organized as follows.We briefly introduce two ML-based prediction schemes for PV output power in Section 2. In Section 3, we estimate the prediction accuracy of two ML-based prediction schemes and their error distributions for PV output power prediction.In Section 4, we characterize the ESS role and describe the participation of PV producers in forthcoming energy markets.And then, we determine the efficient ESS management, which consists of the operation and sizing of ESS for PV power trading in forthcoming energy markets.In Section 5, we analyze the benefits from the accurate prediction scheme with ESS management as a case study.Finally, we give conclusive remarks in Section 6.

Artificial Neural Networks
Artificial neural networks (ANNs), which resemble a human brain, have been extensively applied to uncertain problems.They have been widely used as an alternative method in engineering analysis and prediction such as computer vision and speech recognition [27].ANNs do not require any detailed information from the system.Instead, ANNs learn the relationship between the input parameters.The controlled and uncontrolled variables by characterizing the collected data and may depend on a large number of input parameters [28].
ANNs are typically specified using the architecture, activity rules, and learning rules [29].Figure 2 shows a structure of a feed-forward ANN architecture.The ANN network has an input layer, hidden layers and an output layer.The j-th node in the hidden layer, z j is expressed as where g(a) is an activation function, such as unit step, sigmoid, and hyperbolic tangent functions.x k is the k-th input value and w k,j is the weight between the k-th input node and the j-th hidden node.b is a bias value.Furthermore, The value of the output node, y is calculated as where v j is the weight between the j-th hidden node and the output node.c is another bias value.We here use an ANN to model the complex relationship between various meteorological elements and PV output power in order to improve the prediction performance.The ANN model has two-layered feed-forward network with sigmoid functions, which is defined as g(a) = 1/(1 + e −a ) and linear output neurons.We use 20 neurons in the hidden layer and train the network with the Levenberg-Marquardt back propagation algorithm [30].We finally obtain the predicted value ŷi = f (x i ) where ŷi is the i-th predicted value and x i is the i-th input column vector.

Support Vector Machine
Support vector machine (SVM) is one of high-performance machine learning (ML) schemes, which was proposed by Cortes and Vapnik [31].SVM is designed to minimize the error and maximize the separation margin among multiple classes [32].In particular, SVM can be used to train solar PV output power and predict their output in advance.It has been proved that a better learning capability and smaller prediction errors can be achieved than other methods by using a hyperplane trick [16].
We define a observation sample set of N input and output data as where w is the unit normal vector, x i is the input vector, and b is the distance from the origin to the hyperplane.
The main idea of typical SVM is to map the input vector x i into higher-order feature spaces by using a non-linear kernel function, and performs linear regression in the higher-order feature space as follows [33].
subject to where ζ i and M represent a slack variable and the number data for training stage, respectively.C is a positive constant value for regularization.The mapping function of ϕ(x i ) can be replaced by some special kernels K(x i , x j ).We utilize a radial basis function (RBF) as a kernel function in this study.
The RBF kernel is represented as follows: where δ represents the dilation parameter determining the width of the kernal.In addition, we utilize the LibSVM software [34] in order to develop the SVM scheme.

Data for PV Output Power Prediction
We have collected actual meteorological data for 26 months (January 2012 to April 2014) from the Korea Meteorological Administration (KMA) in Daejeon (36.37 • N, 127.37 • E) to develop the prediction models [35].The sampling interval is one hour.The sunshine duration is set from 7:00 A.M. to 6:00 P.M. Table 1 summarizes the various meteorological parameters.Specifically, cloud cover refers to the fraction of the sky obscured by clouds at a particular location with a range of 0 to 10.When 70% of the area was occupied by clouds the cloud cover value becomes 7. Also, there are 10 types of clouds in the measured data from the weather station in Daejeon.Those data is observed by the weather satellite, Chollian, which is independent weather observation system from space.
The solar irradiance at the i-th time sequence is set to a single output, and the other parameters at the i-th time sequence are configured as an input vector.We utilize 80% and 20% of the entire data for the training process and the test step, respectively.In this paper, we utilized the measured meteorological data from the weather station instead of the forecasted meteorological data in both stages due to the difficulty of collecting the forecasted meteorological data.

Prediction Results for PV Output Power
We convert the irradiance into PV output power based on the prediction data in [15].In this paper, we assume that the PV output power is linearly proportional to the irradiance without considering the module temperature.We do not consider some conversion errors in this paper.Therefore, the PV output power of the i-th time, P i is estimated as 1000 without consideration of temperature, where C int is a given PV system capacity.
Figure 3 shows the PV prediction results of the ANN and SVM schemes and the actual PV power values.The prediction horizon is one-hour ahead in this results.At least one-hour ahead prediction is required because PV power producers can submit re-bids at one-hour ahead of the operating hour in the PV power trading [19].Both schemes show lower prediction accuracy for cloudy days, compared with sunny days.In addition, we can observe that SVM scheme has lower errors than the ANN scheme in both sunny and cloudy days.The weather based data clustering enhances the accuracy of the PV power prediction for severe intermittency [15].But, we did not consider the weather classification in this paper.In order to evaluate the prediction accuracy, Root Mean Square Error (RMSE), R 2 (coefficient of determination) are used, i.e.,

Time [h]
where y i is the measured value at each time sequence i and f (x i ) is the prediction result at each time sequence i. (N − M) denotes the total number of data for the test.
The numerical result of R 2 of SVM prediction scheme are 58.72[W/m 2 ], 0.9562, while that of the ANN scheme are 71.41[W/m 2 ], 0.9264, respectively.

Estimation of Prediction Error Distribution
The t location-scale distribution has been utilized for fitting data distributions with heavier tails, compared with the normal distribution [36].The probability density function (PDF) of the t location-scale distribution is expressed as where Γ(•), µ, σ, and ν represent the gamma function, the location parameter, the scale parameter, and the shape parameter, respectively.The distribution is parameterized with mean and variance of µ and σ 2 ν/(ν − 2), respectively.In this paper, ε represents the prediction error expressed as a ratio of the installed PV capacity in [p.u.].
In order to measure the tail weight and the peakedness of the probability distribution the kurtosis κ value can be utilized [37].In particular, the kurtosis with zero mean random variable ε is defined as follows: where E and s represents the expectation operator and the standard deviation.
In general, a higher kurtosis distribution has heavier tails, while a lower kurtosis distribution has thinner tails.If κ value is greater than 3, the distribution is expressed as leptokurtic or fat-tailed with tails that approach zero more slowly than the normal distribution (c.f. the kurtosis of the normal distribution is 3).
In order to obtain the PDF of the prediction errors, we first calculate κ values of three prediction schemes.As a result, κ values of the ANN and SVM schemes are obtained as 4.59 and 6.11, respectively.We observe that the PV output prediction errors of each prediction scheme are not precisely fitted with a normal distribution because those κ values are in the rage between 3.9 and 7.5, which is heavier-tailed or leptokurtic.Hence, we approximately model the PV power prediction errors of ML-based prediction schemes with a t location-scale distribution, which can well fit heavier tail distributions [15].
Figure 4 shows the histograms of the PV output prediction errors and PDF fits of the ANN and SVM schemes.We observe that the prediction errors of the ANN and SVM schemes are well fitted with a t location-scale distribution.The prediction errors of the ANN scheme are accurately fitted in all regions, while the SVM scheme shows fitting errors nearby 0 region due to a higher probability density of small errors.

The Importance of the ESS Role for PV Power Trading
In a utility-scale PV farm, the ESS consists of multiple identical batteries considering the maximum capacity of a PV system [38].The ESS can discharge when the actual PV output power is less than the predicted PV output power in a given system.On the other hand, when the predicted PV output power is larger than the actual PV output power, the ESS can charge the energy in order to absorb an excessive amount of power generation for the stability of power grid.Accordingly, as the PV output power rapidly increases in power grid, energy storage systems (ESSs) are a potential solution to make the PV output power more dispatchable.
However, a large amount of ESS might be required to make the PV output power dispatchable, as shown in the conventional generator [4].Furthermore, the prediction errors may cause a poor economical profit for utility-scale PV power producers because the installation cost will be paid to higher capacity of ESSs for absorbing the prediction errors of a given PV system.Thus, it is essential to manage the ESS-based on the accurate prediction schemes for PV power trading in energy markets.
In Section 4.2 and Section 4.3, we describe the participation of PV producers for PV power trading in forthcoming energy markets and estimate the deviation penalties in those energy markets.And then, considering those energy markets, we determine the efficient ESS management, which consists of the operation and sizing of ESS for PV power trading in Section 4.4.

Participation of PV Producers in LMP Markets
Typical independent system operators (ISOs) and transmission organizations (RTOs) utilize a two-settlement system for market clearing [39].The day-ahead (DA) schedule is settled at the DA LMP, and the deviation between the DA schedule and real-time (RT) delivery is settled by the RT LMP.Deviation penalty apply for the difference between the RT dispatch and advance schedules in LMP markets [22].FREC issued the Order 764 in 2012, which aims to remove a barrier for the integration of renewable energy resources including the the renewable power generation into the power grid [18].
The FERC noted that the renewable energy producers pay for deviation penalties for any imbalance between their schedule and actual delivery in that hour [40].
Figure 5 shows a typical timeline of the market operation [19].The market operations consist of a determination of reserve requirements, day-ahead operations, a reliability assessment commitment, and real-time operations.Therefore, PV power producers must summit bids into the DA and RT market based on their predictions and follow dispatch instructions in current ISO/RTO markets.Deviation penalties have recently been charged to the deviation between their scheduled energy and actual dispatch in real-time operations.The ISO/RTO's need to reduce such deviations in order to save the cost for the balancing reserves in RT delivery [19].The deviation penalty is based on the imbalance between the DA schedule and RT delivery.A deviation penalty is proportional to the absolute deviation between the DA schedule and RT delivery [22].Energy market profit of the PV power producer for a given hour without considering operation costs is calculated as where p DA,h , p RT,h , q d,h , q DA,h , and pen DA denote the DA LMP [$/MWh], RT LMP [$/MWh], acutal RT delivery [MW], a quantity bid into DA market [MW], and penalty for deviation [$/MWh] for a given hour, h, of PV power delivery, respectively.The PV power producer needs to determine the amount of bidding into the DA market, q DA,h .And the PV power producer's energy market profit in Equation ( 11) can be re-formulated as follows: where the first part (p DA,h • q d,h ) is the profit when the actual PV power generation is settled at the DA LMP.When a perfect prediction is achieved, the profit is only determined by this part into the DA market.The second and third parts represent the consequence of a deviation between the DA schedule and RT delivery.Therefore, those parts are decided by the difference between the RT and DA LMPs and the deviation penalty.

Estimation of Deviation Penalties
D h denotes the imbalance between the DA schedule and RT delivery and the penalty for the deviation at a given hour h can be expressed as pen DA = α × p RT,h , where α is the penalty factor for PV power trading in Equation (12).When the penalty factor is 0.7, deviation penalties were paid to pay 70% of the RT LMP of the energy shortage and the energy excess.Then, the deviation penalties incurred by the market participant in n hours can be expressed as Most of the RTO/ISOs allow a tolerance band, which is the no-penalty interval for deviations [41].Therefore, we assume that the deviation penalties are imposed for injecting energy outside the tolerance band.P tol represents the range of the tolerance band defined a fraction of the installed PV capacity in [%].P ESS represents the power capacity defined as a fraction of the capacity of a given PV system in [%].The PV power producer would be penalized according to Equation ( 13) when the absolute prediction error |ε| exceeds the allowable limit P (|ε| > P tol + η p × P ESS = P ) where η p denotes the efficiency of the power conversion system (PCS) of the ESS [4].
Figure 6 shows three different cases of positive deviations between the DA schedule and RT delivery for charging the ESS.In the first time interval, the prediction error does not exceed the allowable limit P = P tol + η p × P ESS (zero deviation).The power output for charging the ESS is P 1 in this time interval.In the second time interval, the PV power producer is penalized according to the deviation of the second time interval, D 2 , because the prediction error exceeds the allowable limit P (positive deviation) and the power output for charging the ESS is equal to η p × P ESS .The prediction error is within the tolerance band limit in the third time interval (zero deviation).In addition, the ESS does not operate in this case.There exist similar cases for the negative prediction errors for discharging the ESS, as shown in Figure 7.In the real world, PV power producers have more interest in the expected hourly deviation penalties, not an instant deviation penalty for each given hour.Dealing with the deviation (D h ) and the RT LMP (p RT,h ) for a given hour h as independent random variables, the expected hourly deviation penalties could be written as From the PDF h(ε) of the PV power prediction errors ε, E(|D h |) can be calculated as [4] In Equation ( 15), The first term represents the expected deviation for the positive prediction errors and the second term represents the expected deviation for the negative prediction errors.

Operation and Sizing of Energy Storage Systems
The stored energy in the ESS is utilized as a state variable and the power output of the ESS can be estimated as three cases for both charging and discahrging operations.All scheduling intervals used here are one hour (h).
For charging operation (positive prediction errors), the power output of the ESS is expressed as follows.
where ε i represents the prediction error at the i-th time sequence in [p.u.].
Similarly, the power output of the ESS for discharging operation (negative prediction errors) can be expressed as In order to absorb the prediction errors as shown in Equations ( 16) and ( 17), the optimum size of energy capacity of ESS, which is denoted by E ESS are calculated as the system reserve.We set the initial state-of-charge (SoC) of ESS to Q I [%], which can be determined as the median of the minimum and maximum state of the ESS.The optimum energy capacity of ESS (E ESS ) is calculated by finding the maximum absolute value of the total sum of P i in Equations ( 16) and ( 17) within scheduling time intervals.E ESS can be formulated as: where η E represents the round-trip energy efficiency, and η DoD represents the depth-of-discharge (DoD) of the ESS.In addition, i, N, and M represents the time sequence, the total number of data samples, and the number of data samples for the training stage, respectively.In Equation (18), we should multiply the maximum absolute value of the total sum of P i by two in order to consider both the maximum positive and negative value of the total sum of P i because the existing market rules do not permit the PV power producers to adjust their power within the short time period (e.g., one-hour) to avoid deviation penalty [40].E ESS represents the energy capacity of ESS defined as a fraction of the installed PV capacity in [p.u.].

Case Study
We calculate the PV power producer's profit by considering the deviation penalties with/without ESS in order to quantitatively estimate the effect of ESS for different prediction accuracies of two ML-based prediction schemes.In addition, we evaluate the PV power trading in energy market for a utility-scale PV farm.We assume that each prediction scheme is applied to a 30-MW PV system with a lithium-ion ESS to absorb prediction errors.The power efficiency (η P ) of the PCS is approximately 95%, and the roundtrip energy efficiency (η E ) of large-scale lithium-ion battery is approximately 85% [42].Since the occurrence probability of both the positive and negative errors are the same (µ = 0), the ESS should maintain 50% of a state-of-charge (SoC) before 7:00 A.M. (Q I = 50%) in order to prevent an unnecessary increase in E ESS .In addition, we assume that the available SoC range is located between 10% and 90% for in order to guarantee the maximum life cycle of the lithium-ion battery [43].The ESS never goes out the available SoC region during the total operating duration.Finally, we estimate the optimum energy capacity of ESS for the ANN and SVM schemes by considering three different power capacities of ESSs, i.e., 5%, 10%, and 15% of the installed PV capacity.
Then, we evaluate the performance of the different prediction schemes by using historical data for DA and RT LMP from the PJM over a 12-month in 2014 [44].The average DA LMP and RT LMP in 2014 are $53.76/MWhand $52.72/MWh, respectively.In addition, the DA bids due is 12 a.m., and we assume that PV power producers can submit RT re-bids at 1-hour ahead of the operating hour.

ESS Sizing for Machine Learning Prediction Schemes
In this section, three different power capacities of the ESS, P ESS , are considered for absorbing the prediction errors.Figure 8 shows the operation of the PCS of the ESS, which utilizes the ANN scheme for varying time intervals of day time (7:00 A.M. to 6:00 P.M.) with a 5% tolerance limit of the installed PV capacity.The power output of the PCS of the ESS is obtained in a straightforward calculation from Equations ( 16) and (17).The ESS power is very frequently charged or discharged in order to reduce the prediction error for the ANN scheme.In addition, it can be inferred that the deviation penalties of the small-sized PCS (5% of the installed PV capacity) would be significantly larger than that of the large-sized PCS (15% of the installed PV capacity) because a large amount of prediction errors is still not absorbed by the ESS in case of 5% of the installed PV capacity.Similarly, the power output of the PCS for the SVM scheme is shown in Figure 9.In case of the SVM scheme, The ESS power is less frequently charged or discharged, compared with that of the ANN scheme and the average power output of the PCS of the ESS is also smaller than that of the ANN scheme.Furthermore, in order to quantitatively compare the amount of exchanged power of ESS between the ANN and SVM schemes, we estimate the average amount of the hourly exchanged power of ESS (P ESS ), which is calculated as 3 summarizes the average amount of the hourly exchanged power of ESS both the ANN and SVM schemes for a 30-MW PV system.This result also shows that the average amount of the hourly exchanged power of ESS for the SVM scheme is smaller than that of the ANN scheme for three different power capacities of the ESS.In addition, compared with the smaller power capacity of ESS for the SVM scheme, the average amount of the hourly exchanged power of the SVM scheme is very slightly increased and most of the prediction errors do not exceed the allowable limit P when the power capacity of the ESS is 15% of the installed PV capacity.The optimum ESS energy capacity, E ESS , has been calculated by Equation (18). Figure 10 shows the energy capacity of the ANN scheme with different tolerance limits.As the power capacity of the ESS P ESS increases, E ESS rapidly becomes larger.On the other hand, the tolerance limit does not have a significant effect on the decrease in E ESS for the ANN scheme because there still exist greater prediction errors than P ESS even if we consider the tolerance limit.

Sampling interval [hour]
Figure 11 shows the energy capacity of the SVM scheme with different tolerance limits.Most prediction errors of the SVM scheme occur in the small error region (−0.05 to 0.05 [p.u.]), as shown in Figure 4 and a large portion of prediction errors for the SVM scheme is reduced by the tolerance limit.Therefore, as the tolerance limit increases, E ESS gradually becomes smaller.Table 4 summarizes the energy capacity of both of the ANN and SVM schemes with different tolerance limits.

Assessment of the Profit for the PV Power Producers
Figure 12 shows the hourly deviation penalties for varying tolerance limits with a penalty factor α of 1.0.The hourly deviation penalties have been calculated by Equations ( 14) and ( 15) considering the 10% power capacity as a fraction of the 30-MW PV farm.It can be observed that the deviation penalties from the SVM scheme without ESS significantly decrease, compared with those of the ANN scheme without ESS due to high prediction accuracy.The SVM-based prediction scheme without ESS yields a decrease in the deviation penalty by up to 48%, compared with that of the ANN scheme.In addition, the deviation penalties can be quite reduced by using ESS.Since the ESS power is more frequently charged or discharged for the ANN scheme, compared with that of the SVM scheme, as shown in Figure 8, the effect of the ESS for the ANN scheme is greater than that of the SVM scheme in terms of the deviation penalty reduction.Considering the 10% power capacity as a fraction of the 30-MW PV farm, the deviation penalty for using ESS for the ANN and SVM schemes decreases by up to 87% and 74%, respectively.Table 5 summarizes the hourly deviation penalties [$/h] for both of the ANN and SVM schemes with different power capacities and tolerance limits.Figure 13 shows the variation of the hourly energy market profit with different tolerance limits and a penalty factor of 1.0.As the tolerance limit P tol increases, the hourly energy market profit for both of the ANN and SVM schemes with ESS gradually approaches the no-penalty market profit ($ 589/h).It implies that a zero deviation penalty would be given to both of the ANN and SVM schemes with ESS when energy markets allow a tolerance limit P tol of larger than 10%.In Equation ( 12), the hourly deviation penalties with a tolerance limit of 5% for both of the ANN and SVM schemes increase in proportion to an increase in the penalty factor, as shown in Figure 14.In addition, in high penalty factors, the hourly deviation penalties for both of the ANN and SVM schemes with ESS is negligible, compared with those for both of the ANN and SVM schemes without ESS.As a result, a reasonable selection of the tolerance limit and the penalty factor is required to satisfy the economic feasibility of the PV producers in order to introduce the successful energy market introduction of ESSs.In Section 5.3, we can find the appropriate market policy or parameters for both the ANN and SVM schemes from the benefit-cost analysis (BCA) of PV power trading.Hourly deviation penalty with a tolerance limit of 5%.

Benefit-Cost Analysis (BCA) for the ESS in PV Power Trading
The BCA provides the comparison of the total expected cost of against the total expected benefit to confirm whether the benefit outweighs the cost and by how much [45].In the BCA, the benefit-cost ratio (B-C ratio) is the present value of benefit divided by the present value of cost.In order that a project may be economically feasible, the B-C ratio should be greater than one [46].The benefit-cost ratio is computed as where B i is the benefit in the i-th year and C i is the cost in the i-th year.Moreover, T li f e is the life of the project (years) and r RI is the rate of interest (fraction) and r IR is the inflation rate (fraction).In this paper, B i represents the profit difference between with/without ESS in the i-th year for BCA in PV power trading.Considering the cost reduction of the ESS, the power system cost ($/kW) is set to $300/kW and the energy storage cost ($/kWh) of the lithium-ion battery is set to $450/kWh [42].Moreover, we consider the life-time of the project of 10 years, the rate of interest (r RI ) of 4%, and the inflation rate (r IR ), the increase rate of electricity price) of 2%.
Figure 15 shows the benefit-cost ratio of the ANN scheme with a power capacity of 10% of the installed PV capacity.When the energy market permits a low tolerance limit (≤6%) and a high penalty factor (≥1.9), the benefit-cost ratio of the ANN scheme is greater than one.In other words, there is economical benefit of installing the ESS when the energy market enforces a harsh policy in terms of the tolerance limit and penalty factor for PV power producers.The maximum B-C Ratio of the ANN scheme is 1.1034 in a tolerance limit of 4% and a penalty factor of 2.0.
Figure 16 shows the benefit-cost ratio of the SVM scheme with a power capacity of 10% of the installed PV capacity.The SVM scheme yields a larger benefit-cost ratio than that of the ANN scheme in the same market parameters because the SVM scheme requires a quite smaller amount of the energy capacity of the ESS than the ANN scheme, as shown in Figure 11.When the tolerance limit is smaller than 11% and the penalty factor is larger than 1.0, the benefit-cost ratio of the SVM scheme is greater than one.When PV power producers adopt the SVM scheme for PV output power prediction, they also obtain much higher profit at a hard penalty market than a soft penalty market.Furthermore, the maximum benefit-cost ratio (BCR) value of the SVM scheme is 2.2043 in a tolerance limit of 2% and a penalty factor of 2.0.Table 6 summarizes the maximum BCR value of the ANN and SVM schemes for three different power capacities of ESSs.The smaller power capacity of the ESS has a greater maximum BCR value for each prediction scheme.However, the smaller power capacity of the ESS is not always the best choice because a power capacity of 10% of the installed PV capacity yields a higher maximum BCR value (2.20) than that of a power capacity of 5% of the installed PV capacity (2.02) for the SVM scheme when a tolerance limit of less than 2% is allowed in a specific energy market.Therefore, PV producers need to select the optimal size of the ESS, which has the highest BCR value to absorb the prediction errors, considering the different energy market policies.In addition, it can be observed that the PV power producer's profit also significantly increases by achieving high accuracy of the prediction scheme.

Conclusions
In this paper, it has been shown that the accurate machine learning (ML)-based PV output power prediction schemes with ESS management could significantly increase the market profit of PV power producers.For that purpose, the ANN and SVM schemes were utilized for predicting the PV output power from various meteorological big data.In addition, the capacity of ESSs, which can compensate the prediction error, was estimated from different energy market constraints.And then, we calculated the PV power producer's profit considering the deviation penalties in case of ML-based prediction schemes with/without ESS in order to quantitatively estimate the effect of ESS for different prediction accuracies of two ML-based prediction schemes.Numerical results were presented as a case study on the participation of a 30-MW PV farm in the LMP market of the PJM.These results showed the SVM scheme without ESS yields a decrease in the deviation penalties by up to 48% than the ANN scheme without ESS.In case of ML-based prediction schemes with ESS, the ANN and SVM schemes yield an decrease in the deviation penalty by up to 87% and 74%, respectively, compared with the profit of the ANN and SVM schemes without ESS.From the benefit-cost analysis (BCA) in the case study, it could be observed that the maximum B-C ratio of PV power producers significantly increases by using a more accurate prediction scheme with ESS and the maximum B-C ratio is 2.32 for the SVM scheme.Furthermore, the BCA results provided the economic regions of market parameters for both of the ANN and SVM schemes.As a result, the proposed ESS-based PV power trading could be suitable for the participation of PV power producers in order to increase the profit in a penalized market.
In this paper, since we utilized the actual meteorological data instead of the forecasted meteorological data, which has already been known in the prediction time, due to a difficulty in obtaining the proper forecasted meteorological data set, the prediction accuracy of the ANN and SVM schemes may slightly degrade in a real environment due to the forecasting accuracy of the meteorological parameters.We need to enhance the prediction scheme by reflecting the forecasted meteorological data for the practicality of prediction in further works.

Figure 2 .
Figure 2. A typical structure of a feed-forward ANN network.

Figure 3 .
Figure 3. Results of PV power prediction of day time for eight consecutive days.

Figure 4 .
Figure 4. PDFs of the prediction errors for the ANN and SVM schemes.

Figure 6 .Figure 7 .
Figure 6.Estimation of three different positive deviations for charging the ESS.

Figure 8 .Figure 9 .
Figure 8. Operation of the ESS for the ANN scheme with varying time intervals of day time (Tolerance limit = 5%).

Figure 14 .
Figure 14.Hourly deviation penalty with a tolerance limit of 5%.

Figure 15 .
Figure 15.The benefit-cost ratio of the ANN scheme with a power capacity of 10%.

Figure 16 .
Figure16.The benefit-cost ratio of the SVM scheme with a power capacity of 10%.

Author
Contributions: K.Y.B., H.S.J., and D.K.S. conceived and designed the experiments; K.Y.B. performed the experiments; K.Y.B. and H.S.J. analyzed the data; B.C.J. contributed analysis tools; B.C.J. took part in the validation and analysis of results; K.Y.B., H.S.J., B.C.J., and D.K.S. wrote the paper.

Table 1 .
Meteorological parameters and value range.
Table 2 summarizes the parameters values of the ANN and SVM schemes.

Table 2 .
Parameters of the t location-scale distribution for the ANN and SVM schemes.

Table 3 .
The average amount of the hourly exchanged power [kW] of ESS both the ANN and SVM schemes for a 30-MW PV system (Tolerance limit = 5%).

Table 4 .
Energy capacity [p.u.] of both the ANN and SVM schemes with different tolerance limits.
Energy capacity of the ANN scheme with different tolerance limits.Energy capacity of the SVM scheme with different tolerance limits.

Table 5 .
Hourly deviation penalties [$/h] of both ANN and SVM schemes with different power capacities and tolerance limits.

Table 6 .
The maximum benefit-cost ratio (BCR) value of the ANN and SVM schemes for different power capacities of ESSs.