Maximum Instantaneous Wind Speed Forecasting and Performance Evaluation by Using Numerical Weather Prediction and On-Site Measurement

: A maximum instantaneous wind speed forecast methodology based on the autoregressive with exogenous inputs (ARX) model is proposed, in which numerical weather prediction and on-site measurement are used as inputs and the model parameters are estimated using non-parametric regression with forgetting factors. The accuracy of prediction using a proposed dynamic model is then evaluated and compared to the conventional static model output statistics (MOS) model. It is found that the prediction accuracy is improved by utilizing numerical weather prediction with a higher horizontal resolution. Finally, the predictability of the maximum instantaneous wind speed higher than 15m/s is evaluated using the receiver operating characteristic (ROC) curve and the area under the curve (AUC). The optimal quantile level of the maximum instantaneous wind speed is derived using a cost function.


Introduction
Many forms of infrastructure such as railways and roads are constructed in complex terrain in Japan. The operation and maintenance of these infrastructures are influenced by strong gusts of wind. For example, train operation has to be suspended when a certain value of instantaneous wind speed is recorded. This requires the forecasting of strong gusts of wind, including information on when the strong gust of wind happens and the intensity of the gust, with a forecast horizon of up to 24 h. Since decisions concerning operation are usually based on the instantaneous wind speed, forecasting of instantaneous wind speed is needed, as shown in Misu and Ishihara [1]. Kobayashi and Shimamura [2] proposed a method of predicting the instantaneous wind speed up to 15 min ahead based on the on-site measurement data and the Kalman filter for the application of train operation regulation. Hoppmann et al. [3] proposed a method for the prediction of instantaneous wind speed with a confidence interval up to two minutes based on past measurement data using a linear trend model for railway operations in Germany. Nevertheless, these methods based on previously measured data do not have the ability to perform forecasts up to 24 h ahead, and for this it is necessary to use forecast methods which can consider weather conditions, such as numerical weather prediction data.
Accordingly, much research has been conducted on wind power forecasting of up to 24 h, which is needed for power grid operation and power trading [4,5]. The most important part of wind power forecasting is the forecasting of the wind speed, which can be done using a combination of a physical model, such as numerical weather prediction (NWP) and a statistical model based on on-site measurement. Landberg [6,7] predicted the wind speed in a wind farm based on numerical weather prediction and measured

Input Data
The Japan Meteorological Agency provides several numerical weather prediction datasets with different resolutions and forecast horizons, as shown in Table 1. In this study, the GPV-MSM dataset, which has the finest horizontal resolution of about 5 km at the surface, and the GPV-GSM dataset (Japan area), which has the second-finest horizontal resolution of about 20 km at the surface, are used in order to investigate the effect of the resolution of the NWP. The forecast interval of the GPV-MSM is three hours (i.e., eight forecasts per day), though only four forecasts per day are used for the sake of the comparison with GPV-GSM (Japan area). Among the forecast variables available in these NWPs, only the east-west component and the north-south components of the surface wind velocity are used as input data for the forecast model in this study.
In the proposed model, real-time on-site measurement of average wind speed, average wind direction, standard deviation of the wind speed and maximum instantaneous wind speed are used as input data. As described in Figure 1, the average wind speed and wind direction are the inputs to the average wind speed forecast model, the standard deviation of the wind speed is the input to the fluctuating wind speed forecast model, and the average wind speed and the standard deviation of the wind speed, as well as the maximum instantaneous wind speed, are the inputs to the peak factor forecast model. The averaging time of the wind speed can be set between 10 min and 1 h. In this study the averaging time is set to 30 min because the maximum instantaneous wind speed is forecasted every 30 min based on the 30-min average wind speed and the standard deviation of the wind speed for every 30 min. The forecast is also conducted every 30 min. Figure 2 shows the forecast schedule when GPV-MSM is used as the input NWP. Forecast data is delivered online from the Meteorological Service Support Center approximately three hours after the initial time but may be delayed for up to three hours. For this reason, each forecast dataset is considered to become available six hours after the initial time and can be used for the wind speed forecast. For example, if the question of whether a strong wind event will occur during the daytime has to be determined at 6:00 am, the NWP data that can be used is based on the initial time at 21:00. This data is the forecast value from the initial value (21:00 the previous day) until 12:00 the next day, which is 39 h after the initial time.
viation of the wind speed is the input to the fluctuating wind speed forecast model, and the average wind speed and the standard deviation of the wind speed, as well as the maximum instantaneous wind speed, are the inputs to the peak factor forecast model. The averaging time of the wind speed can be set between 10 minutes and 1 hour. In this study the averaging time is set to 30 minutes because the maximum instantaneous wind speed is forecasted every 30 minutes based on the 30-minute average wind speed and the standard deviation of the wind speed for every 30 minutes. The forecast is also conducted every 30 minutes. Figure 2 shows the forecast schedule when GPV-MSM is used as the input NWP. Forecast data is delivered online from the Meteorological Service Support Center approximately three hours after the initial time but may be delayed for up to three hours. For this reason, each forecast dataset is considered to become available six hours after the initial time and can be used for the wind speed forecast. For example, if the question of whether a strong wind event will occur during the daytime has to be determined at 6:00 am, the NWP data that can be used is based on the initial time at 21:00. This data is the forecast value from the initial value (21:00 the previous day) until 12:00 the next day, which is 39 hours after the initial time.

Forecast Model
The NWP does not contain the effects of high-resolution terrain and surface roughness because of the limitation of the spatial resolution. In this study, the local forecast is used in which the local wind speed, affected by the high-resolution topography and surface roughness, is expressed as a function of wind speed and direction of NWP and is identified by non-parametric regression using the measurements and the NPW data. After k steps (k hours) at time t, the local wind speed forecast value | | is modeled as a function of the NWP wind speed | and NWP wind direction θ | as shown in Equation (1), where the nearest neighbor of the output grid is used to extract exact-location wind forecasts from the NWP and is expressed as

Forecast Model
The NWP does not contain the effects of high-resolution terrain and surface roughness because of the limitation of the spatial resolution. In this study, the local forecast is used in which the local wind speed, affected by the high-resolution topography and surface roughness, is expressed as a function of wind speed and direction of NWP and is identified by non-parametric regression using the measurements and the NPW data. Afterk steps (k hours) at time t, the local wind speed forecast value | u local t+k|t is modeled as a function of the NWP wind speed u nwp t+k|t and NWP wind direction θ nwp t+k|t as shown in Equation (1), where the nearest neighbor of the output grid is used to extract exact-location wind forecasts from the NWP and is expressed as Note that k =k∆t. u The function f u nwp t+k|t , θ nwp t+k|t is estimated using non-parametric regression with a forgetting factor, which is set to the value of 0.999, as proposed by Joensen et al. [13]. The local wind speed obtained using Equation (1) is considered to contain no bias component of error, but the phase error, which is originally included in NWP, is still included. To reduce this error for short-term forecasts, a model in which the weighted sum of the persistent forecast, which is a forecast that the future weather condition will be the same as the present condition, and the local forecast obtained by Equation (1) are used as the final forecast value, as where a and b are the smooth functions of the forecast horizon k and the wind direction of NWP, and are estimated by non-parametric regression with the forgetting factor of 0.999 [13]. The fluctuating component of the wind speed σ local t+k|t is also assumed to be a function of the mean wind speed and wind direction of NWP.
Similar to the average wind speed, the forecast value of the fluctuating wind speed σ pred t+k|t is modeled as the weighted sum of the persistent forecast and the local forecast so that the phase error of the forecast can be reduced by Equation (5).
where a σ and b σ are the smooth functions of the forecast horizon k and the wind direction of the NWP, and are estimated by non-parametric regression with the forgetting factor of 0.999 [13].
Based on the average wind speed u   where p t is a peak factor and is assumed to be a value that depends only on the initial forecast time, not on the wind speed, wind direction or forecast period. It can be estimated as a constant value by using the regression with the forgetting factor, in which the error shown in Equation (7) is minimized. Since the peak factor fluctuates greatly depending on the weather conditions, the forgetting factor corresponds to a relatively short time scale, and 0.917 is used.
where u max,meas Since the proposed model is based on the average relationship between the predicted and measured wind speeds, the final output will be the expected value of the maximum instantaneous wind speed. During an actual strong wind event, the maximum instantaneous wind speed may fluctuate around this value. In this study, the standard deviation of this fluctuation can be modeled by using the mean square error of the maximum instantaneous wind speed forecast of the past t (k) and only the function of the forecast horizon k, as shown in Equation (8). When the maximum instantaneous wind speed forecast is carried out, Equation (9) is used to estimate the quantile level of the maximum instantaneous wind speed, which is important for engineering applications. The value of γ should be determined depending on the application and will be discussed in Section 3.

Non-Parametric Regression with Forgetting Factor
In this study, non-parametric regression with a forgetting factor is used to estimate the parameters or functions of each forecast model described in Section 2.2. This method can be formulated in the form of Equation (10).
where z s = z s(1) z s(2) · · · z s(M) T and q s = q s(1) q s(2) · · · q s(N) are the ex- is a smooth function of q s estimated by non-parametric regression and s is an model error modeled as white noise. The subscript s is the index of time series data. The models proposed in Section 2.2 can be expressed in the form of Equation (10). For example, Equation (1) can be shown as Equation (10) with the parameters defined in Equation (11): Equation (3) can be shown as Equation (10) with the parameters defined in Equation (12) as In non-parametric regression, a function φ(q s ) is estimated by using the pair of explanatory variables and training data obtained in the past locally around the described points q p = q p(1) q p(2) · · · q p(N) T . When the training data obtained at the time s is defined as y s , z s and q s = q s(1) q s(2) · · · q s(N) T , the functionφ p,t (q) near q p at time t is obtained by minimizing the weighted error, as shown in Equation (13).
where the weight λ t−s w q s , q p can be divided into two parts as follows.
λ is a model parameter called a forgetting factor that takes a value in the range of 0 < λ ≤ 1. Since t is the current time and s is the time when the measurement data were obtained, λ t−s is small for the old measurement data and large for the new data. By using this forgetting factor, a dynamic model which forgets past measurement data and emphasizes the weight of new measurement data is realized. When a larger value of λ is used, longer period data is remembered, and when a smaller value of λ is used, the training focuses more on the latest data. An index N e f f has been proposed [12], as shown in Equation (14), to show the effective number of data used for the training when different λ values are used.
For example, if λ = 1, the effective number of data used will be infinite, which is equivalent to using all the past measurement data. In the field of wind power forecasting, 0.999 is used as a forgetting factor [12].
w q s , q p is a weight function that takes a large value when the distance is small between the training data q s at time s and the point q p , in the vicinity of which the function is estimated as small, and takes a small value when the distance is large. With this weight function, it is possible to increase the weight of the training data in the vicinity of the estimation point of the function. The multidimensional weight function w q s , q p is expressed as the product of the one-dimensional weight function W(x), as shown in Equation (15).
where j is a quantity called bandwidth, which is a parameter that determines the smoothness of the estimated function. A function shown in Equation (16) is used as the onedimensional weight function W(x).
z s and λ, which minimize the error shown in Equation (13), can be calculated using the recurrence formula. The details are shown in Appendix A.

Forecast Results and Error Evaluation
First, the strong wind events and analysis conditions used in this study are explained in Section 3.1. Next, examples of forecasts performed in the study and models for comparison are introduced and the effect of differences in numerical weather forecast data used as inputs on forecast accuracy is clarified in Section 3.2. The performance of the forecast of the strong wind event with a maximum gust wind speed of 15 m/s or more is then quantitatively evaluated using the ROC curve and AUC in Section 3.3. Finally, the optimal quantile level of the maximum instantaneous wind speed is discussed in Section 3.4.

Analysis Conditions
In this study, we used the wind speed measured by an anemometer at a height of 10 m above the ground level in the mountainous region in East Japan during three winters, i.e., from 6 January, 2014 to 31 March, 2014; from 6 December, 2014 to 31 March, 2015; and from 6 December, 2015 to 31 March, 2016. The anemometer used was a N-262LVS from Nippon Electric Instruments, set at 138.769167 degrees East and 36.944089 degrees North. The sampling frequency was 4 Hz and a 3-s averaged value was used as the instantaneous wind speed. Table 2 shows a list of the 36 strong wind events with a maximum instantaneous wind speed of 15 m/s or more during the measurement campaign. All the 36 events are targeted for the maximum instantaneous wind speed forecast. The weather conditions for 36 strong wind events and corresponding weather conditions are also shown in Table 2. Figure 3 shows an aerial photograph and elevation contours of the area. The center of Figure 3 is the location of the anemometer. The measurement data required for correction of the local average wind speed and the local fluctuation wind speed were updated every 30 min using the latest data obtained at the time of the forecast. The initial values of the forecast model were trained by using the measurement data from December 2013 prior to the forecast evaluation.   The non-parametric regressions for each model were carried out under the conditions listed in Table 3. The forgetting factor λ was set to 0.999 in the models other than the peak factor forecast model. The effective number of data in Equation (14) corresponds to approximately 20 days, i.e., the models are always trained by using the past 20 days of data, approximately. On the other hand, 0.917 was used for estimating the peak factor, which corresponds to about 5.5 h. When performing non-parametric regression, it is necessary to set the initial values of the local wind speed conversion and the wind speed correction functions. In this study, a forecast was performed in advance from 1 December to 31 December 2013, prior to the analysis period, and the initial values of each model were obtained by training.

Forecast Examples and Comparative Models
In this section, model functions are identified by non-parametric regression and an example of the identified functions is discussed. Figure 4 shows the function f u nwp t+k|t , θ nwp t+k|t identified by the mean wind speed prediction model for the case of θ nwp t+k|t = 0 • and θ nwp t+k|t = 180 • . It is found that the local wind speed generally does not vary linearly with the wind speed forecasted by the numerical weather prediction and also depends on the wind direction.
In the mean wind speed forecast model, the functions a k, θ nwp t+k|t and b k, θ nwp t+k|t , which are used to take into account the on-site measurement, are also trained. Figure 5 shows the training results of these functions for θ nwp t+k|t = 180 • . When the forecast horizon k is short, the weight of the latest measurement data is large, and if the forecast horizon k time becomes longer, the weight of the forecast value based on the numerical weather forecast increases.

Forecast Examples and Comparative Models
In this section, model functions are identified by non-parametric regression and a example of the identified functions is discussed. Figure 4 shows the functio | , | identified by the mean wind speed prediction model for the ca of | = 0° and | = 180°. It is found that the local wind speed generally does n vary linearly with the wind speed forecasted by the numerical weather prediction an also depends on the wind direction.  In the mean wind speed forecast model, the functions , | an , | , which are used to take into account the on-site measurement, are also traine Figure 5 shows the training results of these functions for | = 180°. When the foreca horizon is short, the weight of the latest measurement data is large, and if the foreca horizon time becomes longer, the weight of the forecast value based on the numeric weather forecast increases.

Forecast Examples and Comparative Models
In this section, model functions are identified by non-parametric regression and an example of the identified functions is discussed. Figure 4 shows the function | , | identified by the mean wind speed prediction model for the case of | = 0° and | = 180°. It is found that the local wind speed generally does not vary linearly with the wind speed forecasted by the numerical weather prediction and also depends on the wind direction.  In the mean wind speed forecast model, the functions , | and , | , which are used to take into account the on-site measurement, are also trained. Figure 5 shows the training results of these functions for | = 180°. When the forecast horizon is short, the weight of the latest measurement data is large, and if the forecast horizon time becomes longer, the weight of the forecast value based on the numerical weather forecast increases.  The concept of using the wind speed ratio for each wind direction has been widely used to evaluate the influence of local topography on wind conditions [1]. In addition, in the wind power output forecast model MOS, the wind speed is multiplied by a constant for each wind direction. Based on this idea, it is possible to perform the forecast by multiplying the coefficient, which is a function of each wind direction, to the numerical weather forecast data released by the Japan Meteorological Agency. In this study, MOS is a function of the NWP with GPV-GSM, and the maximum instantaneous wind speed forecast model based on this concept is constructed as shown in Equations (17) to (19) and is used as the forecast model for comparison. Average wind speed u MOS t+k|t is expressed as where C θ nwp t+k|t is a function of the wind direction. S θ nwp t+k|t in Equation (18), which is used to forecast the fluctuating wind speed σ MOS t+k|t , is also a function of the wind direction and is identified basd on the past measurement data.
The peak factor p MOS is a constant and was determined based on the research by Ishizaki [15]. The maximum instantaneous wind speed u max,MOS t+k|t based on the static MOS model can be calculated by Figure 6 shows a comparison between the forecasted maximum instantaneous wind speed at 6:00 a.m. and measurement on the days when a typical strong wind event occurred. It is found that the prediction accuracy is improved by using the dynamic model proposed in this study compared to the conventional static MOS model, in which there is a large difference between the forecast value based on the numerical forecast and the observed value at 6:00 in the morning. There is also a significant difference between the MOS model and the dynamic model immediately after the start of the forecast. In addition, the prediction accuracy is better when the GPV-MSM data is used as the input NWP.
It is noted that the forecasted maximum instantaneous wind speeds in Figure 6 express the expected values at γ = 0 as shown in Equation (9) and are lower than the optimal values. In this study, γ = 0.9 was used as an optimal quantile level. This implies that the optimal forecasted maximum instantaneous wind speeds can match the measured ones, as shown in Figure 6. However, all model significantly underestimated the maximum instantaneous wind speeds on 30 March, 2014. This strong wind event was caused by atmospheric stratification and the mountain, as pointed out Saito and Ikawa [16]. The operational forecast models such as JMA-GSM or JMA-MSM utilize data assimilation of the global forecast data at all levels to achieve a high forecast score. However, this may be not the best solution for all cases. The use of an ensemble of models may improve the overall performance, as mentioned by Meng and Zhang [17], Mohrlen and Jorgensen [18].

Evaluation of Forecast Results
In this study, the accuracy of the forecast is evaluated using the root mean square error (RMSE) of the maximum instantaneous wind speed as a function of forecast horizon and the ROC curve of the strong wind event. The root mean square error of the maximum instantaneous wind speed is calculated for the entire forecast period of 11 months as a function of the forecast horizon k. Figure 7 shows the RMSE of maximum instantaneous wind speed predicted by the conventional static MOS model and the proposed dynamic method as functions of the forecast horizon. The static MOS model is almost constant regardless of the forecast horizon, whereas the proposed dynamic model uses the most up-to-date measurement data, and the forecast error within 6 h is greatly reduced. In addition, the use of the GPV-MSM forecast value as the input improves the forecast error for all forecast horizons compared to the GPV-GSM forecast value. This is due to the high horizontal resolution of the GPV-MSM model, which reflects the effects of mountainous terrain on the prediction.  Table 2. The strong wind events (a) and (i) were caused by South-coast cyclone, while the other strong wind events were caused by Siberian High and Aleutian Low.  Table 2. The strong wind events (a) and (i) were caused by South-coast cyclone, while the other strong wind events were caused by Siberian High and Aleutian Low.
The model for estimating the quantile level of the maximum instantaneous wind speed described in Section 2.2 is important for the capability of forecasting strong wind events, as shown in Table 2. Since the proposed dynamic model predicts the average value of the maximum instantaneous wind speed, it is necessary to estimate the quantile level of the maximum instantaneous wind speed in order to predict whether a strong wind event will occur. When the larger value of the parameter γ is used, the high quantile level of the maximum instantaneous wind speed is predicted, and the number of predicted strong wind events increases. On the other hand, when the smaller value of the parameter γ is used, the low quantile level of the maximum instantaneous wind speed decreases, and the number of predicted strong wind events is reduced.

Evaluation of Forecast Results
In this study, the accuracy of the forecast is evaluated using the root mean square error (RMSE) of the maximum instantaneous wind speed as a function of forecast horizon and the ROC curve of the strong wind event. The root mean square error of the maximum instantaneous wind speed is calculated for the entire forecast period of 11 months as a function of the forecast horizon . Figure 7 shows the RMSE of maximum instantaneous wind speed predicted by the conventional static MOS model and the proposed dynamic method as functions of the forecast horizon. The static MOS model is almost constant regardless of the forecast horizon, whereas the proposed dynamic model uses the most upto-date measurement data, and the forecast error within 6 hours is greatly reduced. In addition, the use of the GPV-MSM forecast value as the input improves the forecast error for all forecast horizons compared to the GPV-GSM forecast value. This is due to the high horizontal resolution of the GPV-MSM model, which reflects the effects of mountainous terrain on the prediction. The model for estimating the quantile level of the maximum instantaneous wind speed described in Section 2.2 is important for the capability of forecasting strong wind events, as shown in Table 2. Since the proposed dynamic model predicts the average value of the maximum instantaneous wind speed, it is necessary to estimate the quantile level of the maximum instantaneous wind speed in order to predict whether a strong wind event will occur. When the larger value of the parameter is used, the high quantile level of the maximum instantaneous wind speed is predicted, and the number of predicted strong wind events increases. On the other hand, when the smaller value of the parameter is used, the low quantile level of the maximum instantaneous wind speed decreases, and the number of predicted strong wind events is reduced.
It is common to use ROC curves [19,20] to evaluate forecasts including such model parameters. The ROC curve shows the false positive rate on the horizontal axis and the true positive rate (also called sensitivity) on the vertical axis and shows the change in the false positive rate and the true positive rate when the model parameters are changed. Table 4 shows the definitions of the true positive , false positive , false negative and true negative . The false positive rate is defined as /( + ), which means cases were predicted with a strong wind event, in a total of ( + ) cases in which no strong wind event occurred. On the other hand, the true positive rate can be defined as ( ⁄ + ), which means cases of strong wind events were predicted among ( + ) cases in which the strong wind event occurred. With a perfect forecast, the true positive rate is 1 and the false positive rate is 0, which means the ROC curve passes through the upper left corner of the figure and the value of the AUC is equal to one. For the random model, the true positive rate and the false positive rate are expected to be equal and the ROC curve is a straight line which connects the lower left corner and the upper right corner. The ROC It is common to use ROC curves [19,20] to evaluate forecasts including such model parameters. The ROC curve shows the false positive rate on the horizontal axis and the true positive rate (also called sensitivity) on the vertical axis and shows the change in the false positive rate and the true positive rate when the model parameters are changed. Table 4 shows the definitions of the true positive a, false positive b, false negative c and true negative d. The false positive rate is defined as b/(b + d), which means b cases were predicted with a strong wind event, in a total of (b + d) cases in which no strong wind event occurred. On the other hand, the true positive rate can be defined as a/(a + c), which means a cases of strong wind events were predicted among (a + c) cases in which the strong wind event occurred. With a perfect forecast, the true positive rate is 1 and the false positive rate is 0, which means the ROC curve passes through the upper left corner of the figure and the value of the AUC is equal to one. For the random model, the true positive rate and the false positive rate are expected to be equal and the ROC curve is a straight line which connects the lower left corner and the upper right corner. The ROC curve of the actual forecast model is located between these two curves. With a better forecast model, the ROC curve approaches the upper left corner. The forecast model can be evaluated quantitatively using the AUC. A larger AUC value indicates a better model [21]. In this study, the definition of the occurrence and forecast of a strong wind event is as follows. If a maximum instantaneous wind speed of 15 m/s or more is recorded between 6:00 a.m. and 18:00 p.m., it is defined as an occurrence of a strong wind event. Similarly, if a maximum instantaneous wind speed of 15 m/s or more is forecasted, it is defined as a forecast of a strong wind event. Figure 8 shows the ROC curves for the static MOS model and the proposed dynamic models, with GPV-GSM and GPV-MSM as the input NWPs. Regardless of the model used, the true positive rate can be increased by increasing the model parameter γ, but this also causes an increase in the false positive rate. When GPV-MSM is used as the input NWP, the ROC curve is the closest to the upper left corner, indicating that the true positive rate shows the highest value, whereas false positive rate has been kept low. Table 5 shows the AUC for each model. The AUC values were improved from 0.84 in the static MOS model to 0.94 in the proposed dynamic model with GPV-MSM, since NWP with GPV-MSM increases the resolution of land use and topography, resulting in the improvement of NWP performance, as mentioned by Kikuchi et al. [22]. Moreover, it should be noted that NWPs with very high resolution may not improve the performance of predictions, as shown in Goit et al. [23], in which the resolution of the NWP was increased from 2 km to 333 m and an accuracy improvement was not observed.

None
c (False negative) d (True negative) In this study, the definition of the occurrence and forecast of a strong wind event is as follows. If a maximum instantaneous wind speed of 15 m/s or more is recorded between 6:00 a.m. and 18:00 p.m., it is defined as an occurrence of a strong wind event. Similarly, if a maximum instantaneous wind speed of 15 m/s or more is forecasted, it is defined as a forecast of a strong wind event. Figure 8 shows the ROC curves for the static MOS model and the proposed dynamic models, with GPV-GSM and GPV-MSM as the input NWPs. Regardless of the model used, the true positive rate can be increased by increasing the model parameter , but this also causes an increase in the false positive rate. When GPV-MSM is used as the input NWP, the ROC curve is the closest to the upper left corner, indicating that the true positive rate shows the highest value, whereas false positive rate has been kept low. Table 5 shows the AUC for each model. The AUC values were improved from 0.84 in the static MOS model to 0.94 in the proposed dynamic model with GPV-MSM, since NWP with GPV-MSM increases the resolution of land use and topography, resulting in the improvement of NWP performance, as mentioned by Kikuchi et al. [22]. Moreover, it should be noted that NWPs with very high resolution may not improve the performance of predictions, as shown in Goit et al. [23], in which the resolution of the NWP was increased from 2 km to 333 m and an accuracy improvement was not observed.

Optimal Quantile Level of the Maximum Instantaneous Wind Speed
In order to find an optimal quantile level of the maximum instantaneous wind speed for engineering applications, the cost function of Pinson et al. [24] was used. The total loss caused by the false negative c and the false positive b is considered as the cost function. Assuming that the loss caused by one false negative event is l and the loss caused by one false positive is αl, the total loss l total over the forecast period can be written as Equation (20). In this study, by changing the value of α, the cost function as function of γ is investigated.
l total = cl + bαl (20) Figure 9 shows the dimensionless loss l total /l as a function of γ when α is 0.5 and 1. There is an optimum value at γ = 0.9 when α = 1 and at γ = 0.7 when α = 0.5. Since the number of occurrences of strong wind events is limited and the number of false negatives is limited, whereas the number of false positives is not limited, the graph is asymmetrical. It is obvious that the optimal quantile level of the maximum instantaneous wind speed depends on the losses by misses and false alarms. When α = 1, i.e., the losses by misses are equal to those by false alarms, the cost decreases when using a higher quantile level of the maximum instantaneous wind speed. However, when α = 0.5, i.e., the losses by false alarms are lower than those by misses, the cost decreases when using a lower quantile level of the maximum instantaneous wind speed. Figure 9 shows the dimensionless loss / as a function of when is 0.5 and 1. There is an optimum value at = 0.9 when = 1 and at = 0.7 when = 0.5. Since the number of occurrences of strong wind events is limited and the number of false negatives is limited, whereas the number of false positives is not limited, the graph is asymmetrical. It is obvious that the optimal quantile level of the maximum instantaneous wind speed depends on the losses by misses and false alarms. When = 1, i.e., the losses by misses are equal to those by false alarms, the cost decreases when using a higher quantile level of the maximum instantaneous wind speed. However, when = 0.5, i.e., the losses by false alarms are lower than those by misses, the cost decreases when using a lower quantile level of the maximum instantaneous wind speed.

Conclusions
In this study, a maximum instantaneous wind speed forecast model has been proposed, based on numerical weather prediction data and on-site measurement data, and validated using data measured in a mountainous area. The conclusions obtained are as follows: 1. The maximum instantaneous wind speed forecast model is constructed using the ARX model. A non-parametric regression with multiple time scale forgetting factors is proposed to identify the model parameters. The maximum instantaneous wind speeds calculated using the proposed dynamic model show favorable agreement with the measurements, whereas the conventional static MOS model underestimates them. 2. The prediction accuracy of the maximum instantaneous wind speed forecast is improved when high-resolution forecast data are used as the input NWP and when multiple time scale forgetting factors are adopted for the proposed dynamic model. 3. The predictability of a strong wind event with a maximum instantaneous wind speed of 15 m/s or more has been evaluated using the ROC curve and the AUC. The proposed dynamic model increases the true positive rate and the AUC increases from 0.84 in the static MOS model to 0.94 in the proposed dynamic model.

Conclusions
In this study, a maximum instantaneous wind speed forecast model has been proposed, based on numerical weather prediction data and on-site measurement data, and validated using data measured in a mountainous area. The conclusions obtained are as follows: 1.
The maximum instantaneous wind speed forecast model is constructed using the ARX model. A non-parametric regression with multiple time scale forgetting factors is proposed to identify the model parameters. The maximum instantaneous wind speeds calculated using the proposed dynamic model show favorable agreement with the measurements, whereas the conventional static MOS model underestimates them.

2.
The prediction accuracy of the maximum instantaneous wind speed forecast is improved when high-resolution forecast data are used as the input NWP and when multiple time scale forgetting factors are adopted for the proposed dynamic model.

3.
The predictability of a strong wind event with a maximum instantaneous wind speed of 15 m/s or more has been evaluated using the ROC curve and the AUC. The proposed dynamic model increases the true positive rate and the AUC increases from 0.84 in the static MOS model to 0.94 in the proposed dynamic model. study. When the vector q is one-dimensional (N = 1), the local estimation approximation functionφ p,t (q) can be expressed by the following equation.