1. Introduction
Over the past 20 years, the use of numerical weather prediction models has improved the forecast accuracy of tropical cyclone (TC) track by about 50% for lead times of 24–72 h [
1]. One of the main reasons for this improved accuracy is the use of high resolution remote sensing data. Satellite data are very important in the accurate tracking of fast-moving TCs, because they contain effective observational information for optimizing the initial numerical prediction field. Furthermore, high-spatiotemporal-resolution satellite measurements can provide observational data for areas lacking ground observational data [
2]. For example, cloud data from satellite images have been employed to improve the accuracy of numerical prediction [
3], and wind information contained in satellite data has been employed to improve TC forecast accuracy [
4]. Despite the advancements in the use of numerical models, the past two decades have seen little improvement in the accuracy of TC intensity prediction for lead times of 24–72 h [
5], and although meaningful improvements have been seen in TC intensity forecasting, such forecasting remains challenging, as the internal convection and formation mechanisms of such storms are not fully understood [
6].
In addition to conventional numerical methods, the statistical-dynamical approach has been used as an operational guide for TC intensity prediction. The “statistical hurricane prediction scheme” (SHIPS) has used this approach successfully in the North Atlantic and East Pacific to predict TC intensity [
7]. The first version of the “statistical typhoon intensity forecasting scheme” (STIPS) was developed by Knaff et al. [
8] in 2002, to predict TC intensity based on the SHIPS. In 2003, the STIPS was improved and applied in operational settings. Following this, Gao and Chiu [
9] attempted to predict TC intensity using an artificial neural network-based method; they proved that the accuracy of a scheme using neural network models was superior to that of one using multiple linear regression models, leading to the recent adoption of this method to predict TC intensity. In another study, in which the intensity of TCs was forecast using a radial-basis function network (RBFN), multilayer perceptron (MLP) model, and statistical multiple linear and ordinary linear regression [
10], the MLP model was found to have the lowest forecasting error. Chandra and Dayal [
11], predicted cyclone wind intensity in the South Pacific region using Elman recurrent neural networks, and Chaudhuri et al. [
12] found a double hidden layer neural network to be more accurate than a single-layer network for forecasting typhoon intensity. Thus, as described above, most researchers use a non-linear model to predict TC intensity.
Given the advancements in satellite data, some researchers investigated the use of satellite data in TC intensity forecasting [
13]. TC intensity in the North Indian Ocean was predicted by incorporating average values from satellite data into a set of predictors, and it was found that the addition of satellite data improved the prediction model [
11]. Gao et al. predicted TC intensity using a decision tree and satellite data; they showed that employing the ocean coupling potential intensity index with satellite data reduced the accuracy of the TC intensity prediction [
13]. The satellite predictors were extracted as area-averaged data, extracting detailed information using the mean results in the details being hidden or diluted. The satellite data are particularly rich around the center of the TC, and this data can describe the convective characteristics affecting TC intensity [
14]. Wang [
15] suggested that convection near the TC center is a key feature of overall convection in tropical cyclogenesis and that the spatial pattern of the convection intensity might also be important. Given such findings, a new extraction method that can comprehensively glean TC intensity information from remote sensing data could improve TC intensity prediction [
16].
In recent years, some scholars have shown that the decision tree method can improve the accuracy of the TC intensity prediction. TC intensity in the western North Pacific was predicted using the decision tree method [
17]. The theory and application of the decision tree method have developed significantly since Chen’s proposal of the eXtreme Gradient Boosting (XGBoost) model [
18]; this model has been applied in several studies, on topics including image classification [
19], speech recognition [
20], and biomedical studies [
21]. The XGBoost model has also proved to be useful in predicting TC intensity [
22]. TC intensity is affected by several factors that are often ambiguous and uncertain. In this study, we used the XGBoost model to predict TC intensity, because its algorithm can handle many dimensions and be used to conduct multi-factor predictions. Factor mining has proved to be useful in forecasting model constructions [
23,
24]. The XGBoost model can also be used in conjunction with factor mining, which has proved to be useful in forecasting model construction. Furthermore, another advantage of the XGBoost model is its very low computational cost for users with a basic knowledge of TCs.
Therefore, we focused on using satellite predictors extracted via a new method, to establish a machine learning model for predicting TC intensity. This study aimed to establish an XGBoost-based framework to predict TC intensity in the South China Sea, using data from the China Meteorological Administration–Shanghai Typhoon Institute (CMA-STI) [
25], meteorological variables from the European Centre for Medium-Range Weather Forecasts (ECMWF) [
26], and satellite data from the National Satellite Meteorological Center FengYun Satellite Data Center, covering the period of 2006–2017. The differences between the predicted and observed data were used to analyze the dynamics of TC intensity development. The primary objectives of this study were: (1) to establish a South China Sea TC intensity prediction model based on the XGBoost framework and satellite-based potential predictors; (2) to analyze the influence of the FengYun-2 (FY-2) remote sensor data on TC intensity forecast results, and (3) to analyze the influence of the satellite data extraction method used on TC intensity forecast results.
4. Discussion
In this study, we developed a new TC intensity prediction model that is based on the XGBoost model; the proposed model integrates climatology and persistence, environmental, and satellite-based predictors. Climatology and persistence factors can provide a more detailed description of TC motion features. Environmental factors can reflect the synoptic features of TC in the South China Sea. The TBB features are used to characterize the structure and the development of convection. Our experimental results demonstrate that satellite data can improve the accuracy of TC intensity prediction, particularly when TBB is used as input data. The MAE of model A2, which included S_mean satellite predictors that were not considered in A1, produced improvements of 2.27%, 5.90%, 5.84%, and 0.00% over the MAEs of A1 at lead times of 6, 12, 18, and 24 h. However, the MAE of model A3, which includes satellite predictors extracted via method S_2, produced MAE improvements of 2.73%, 7.58%, 7.64%, and 5.04% for lead times of 6, 12, 18, and 24 h over those of model A1. The conclusions of model A3 were also consistent with those of STIPER (Statistical Typhoon Intensity Prediction including surface Evaporation and inner core Rainfall) [
9].
In this study, we developed a new extraction method (the “ring segmentation method”) for satellite image data. Our experimental results suggested that the ring segmentation method can improve the accuracy of TC intensity prediction. The MAE of model A3 (A1 + satellite-based predictors extracted via ring segmentation) decreased by 0.47%, 1.79%, 1.91%, and 5.04% at 6, 12, 18, and 24 h lead times, respectively, compared to those of model A2 (A1 + satellite-based predictors extracted as mean values). There are two possible reasons for this. The first is that high-resolution FY satellite data can provide detailed information on TC structure. For example, we noted that the TC intensity attributes extracted from TBB features in FY-2 images included the TC cloud-top convection strength, distribution, and size, all of which are closely related to TC intensity. The absolute CC values between the TBB values at the TC centers and the TC intensity become smaller at longer lead times. Muramatsu [
55] suggested that TBB has a very high correlation with TC intensity and is therefore very important in TC prediction. High-resolution FY-2 satellite data allow TBB_C data to be extracted at smaller intervals (0.1°), allowing us to obtain 480 features from the TBB images by applying Fitzpatrick’s extraction method [
47]. The factors related to TC intensity at lead times of 6, 12, 18, and 24 h were extracted using the ring segmentation method, and the small intervals at which these data could be extracted provided detailed descriptions of the relationships between the TBB and TC intensity. The pixels colder than a particular TBB at given radii were also helpful in improving TC intensity prediction. Pixels colder than 253.15 K at a radius of 2° were found to be a noteworthy predictor of intensity [
56]. At a 24 h lead time, the highest predictive value was found for pixels colder than 218 K at radii of 0.6–0.7°. Pixels colder than 208 K represent a deep convective region [
57]. At an 18 h lead time, the number of pixels colder than 208 K at radii of 0.7–0.8° was the most significant predictor of TC intensity, indicating that deep convection regions are also very important for predicting TC intensity at lead times close to 18 h. In another study, the highest correlation between the maximum TBB and TB intensity was found in the 1.4° radial area at a 0–36 h lead time [
45]. In this study, the highest correlation between TBB and TC intensity was found in the 1.1–1.5° radial area at a 6–12 h lead time. Because our extraction method allowed us to employ high-resolution FY-2 satellite data, our results can be considered more precise than those found in previous studies.
The second reason that the ring segmentation method led to better results is that the XGBoost model allows multi-predictor input. In this study, we assessed the response of the model to different combinations of input features related to TC intensity in the South China Sea, such as inner-core convection, climatology and persistence factors, and environmental factors. The resulting combination of predictors obtained from the TBB features derived from the FY-2 satellite data, the best track data taken from the CMA-STI, and environmental information obtained from the ERA-Interim dataset were then used to build the final model. Because the model primarily uses randomly selected features, each tree can be built from a random set of features rather than the overall set. Combining all trees constructed in this manner led to the most accurate predictions. In this study, we used approximately 100 predictors that did not produce any serious over-fitting in the XGBoost model, because an appropriate scale was used to select the eta hyperparameters. However, we did identify one limitation in our proposed model. Its errors when predicting the intensity of very strong storms in the “severe typhoon” and “super typhoon” intensity categories were substantially large at lead times of 6, 12, 18, 24 h. This limitation is likely attributable to the relatively low number of samples concerning high-intensity TC events, leading to larger model errors concerning these types of storms.
Figure 8 shows the frequencies of different TC intensities in the South China Sea from 2006 to 2012. Cyclones in the severe typhoon and super typhoon categories have the lowest occurrence in the training data. Owing to the lack of training samples, the accuracy of the model, with regard to severe typhoons and super typhoons, is low.
The XGBoost method and the LSTM method can both predict TC intensity from a non-linear perspective. The XGBoost method essentially incorporates the XGBoost algorithm into the TC intensity prediction model, which improves the average absolute error of the existing TC intensity model predictions. The combination of feature selection and the spatio-temporal consideration of the LSTM model can also improve TC intensity predictions. The performance of each of the models seems to be dependent on lead time; the XGBoost model is best applied to the prediction of TC intensity at short lead times. At a 6 h lead time, the MAE and NRMSE values obtained using the XGBoost model were better than those of the LSTM model, for storms of all intensity levels. The LSTM model can sometimes provide better performance than the XGBoost model at longer lead times. At a 24 h lead time, the LSTM model is better suited for the prediction of tropical storms, severe tropical storms, and super typhoons than is the XGBoost model. However, the XGBoost method and the LSTM method may not fit the prediction of the TC intensity in the South China Sea at 72 h lead time. This limitation is likely attributable to the relatively low number of samples concerning TC events at 72 h lead time. The lack of sample is one of the bottlenecks for predicting TC intensity using non-linear method [
58]. Overall, both models offer improved Fengyun-2 satellite data utilization over that of other models and represent promising advances in TC intensity prediction at 6, 12, 18, and 24 h lead times.
5. Conclusions
Using TBB features derived from FY-2 satellite data, best-track data derived from the CMA-STI, and environmental information taken from the ERA-Interim dataset as input data, we used the XGBoost model to simulate and predict the intensity of TCs in the South China Sea, at lead times of 6, 12, 18, and 24 h. We analyzed the influence of satellite data utilization and explored the most accurate parameter sets under three scenarios. The examined prediction models were built based on climatology and persistence predictors, environmental predictors, and satellite-based predictors. The traditional extraction method for satellite image data directly extracts average values from the images, but it cannot fully reflect the cloud-based characteristics. Based on existing research, this study employed a high-precision and maneuverable ring segmentation method for satellite-based predictor extraction and compared its performance to that of the conventional method. We trained the models using TC event data from 2006–2012 and verified the accuracy of their TC-intensity predictions using TC event data from 2013–2017. The results were as follows.
(1) The Fengyun-2 satellite data can be used to increase the accuracy of TC intensity forecasts for the South China Sea at 6, 12, 18, and 24 h lead times. We analyzed the influence of satellite data on the accuracy of TC intensity forecasts under three models; the MAEs of models A2 and A3, which employed satellite-based prediction factors, were smaller than the MAE of model A1 at 6, 12, 18, and 24 h lead times.
(2) The ring segmentation method was found to be a suitable extraction method for satellite data. Compared with that of model A2 (which considered satellite predictors extracted as averages), the MAE of model A3 decreased by 0.47%, 1.79%, 1.91%, and 5.04% at 6, 12, 18, and 24 h lead times, respectively. The relative biases of model A3 were 8.62%, 12.57%, 16.48%, and 19.04% at 6, 12, 18, and 24 h lead times, respectively; the R squares of model A3 were 0.91, 0.82, 0.73, 0.63 at 6, 12, 18, and 24 h lead times, respectively; the NRMSEs of model A3 were 5.28, 8.19, 9.42, and 10.51 at 6, 12, 18, and 24 h lead times, respectively.
(3) The TC intensity forecast results of the LSTM and XGBoost models for the South China Sea from 2013 to 2017 were analyzed. The XGBoost model was found to be more stable and better suited for 6 h TC intensity predictions than the LSTM model.
Overall, these results suggest that the proposed XGBoost model and extraction method can be used to accurately estimate TC intensities in the South China Sea, using satellite data and optimized parameter sets. The MAE, NRMSE, and CC results used to compare the three different modeling approaches further suggest that satellite data might be the most important type of predictor and that the ring segmentation method might be the best available extraction method for such satellite data.