1. Introduction
Taiwan, located in the Northwest Pacific and at the boundary of temperate and subtropical zones, is often hit by typhoons. The terrain of Taiwan is mostly hillsides with short and fast-flowing rivers. Typhoons cause economic losses in agriculture, fishery, and animal husbandry and cause casualties yearly. This type of loss is mostly caused by the heavy rainfall that occurs with a typhoon. If a high-intensity typhoon hits, the disaster will be more serious. The quality of typhoon forecasting plays an important role in preventing and reducing major disasters.
Typhoon intensity prediction is one of the important items of typhoon forecast as stronger typhoons can cause more severe disasters. However, an accurate forecast is difficult due to complex thermal and dynamical conditions (namely, environmental factors, such as atmosphere and ocean) for typhoon development [
1,
2]. The US Navy’s Joint Typhoon Warning Center (JTWC) uses an ST5D statistical model as the baseline to evaluate the intensity forecast technology. The ST5D model applies the concepts of climate and CLIPER to estimate the 5-day typhoon intensity without considering the possible track in the future. It will significantly underestimate the effects of typhoon intensity over time. DeMaria et al. [
2] pointed out that though the typhoon intensity forecast technology is steadily improved, it still lags the typhoon track forecast technology.
Statistical and dynamical models are commonly used for typhoon intensity forecasts worldwide. Common statistical models based on CLIPER include SHIFOR (Statistical Hurricane Intensity Forecast Model) [
3] and SHF5 (namely, 5-day 6-hourly SHIFOR) [
4,
5,
6,
7,
8,
9]. Dynamical models include GFDI, GHMI, and HWFI (coupled atmosphere-ocean model) [
2]. Due to complex changes in typhoon intensity, statistical models based on CLIPER often struggle to accurately describe the interaction between atmospheric and oceanic environmental factors and typhoon intensity. In contrast, numerical weather models (namely, dynamical models) can describe the interaction between atmospheric and oceanic environmental factors and typhoon intensity. However, as typhoons are mesoscale weather systems, there is still room for improvement in dynamical models’ ability to simulate detailed typhoon structures. Besides statistical and dynamical models, there is a statistical dynamical approach that combines statistical methods with environmental factors output by the numerical weather forecast. SHIPS (Statistical Hurricane Intensity Prediction Scheme) [
1] is used in the North Atlantic and Eastern Pacific. The results show that the error of SHIPS is smaller than that of models based on climate and persistency. STIPS (Statistical Typhoon Intensity Prediction Scheme), proposed by Knaff et al. [
10], is a multiple linear regression (MLR) model developed for the Northwest Pacific and takes the synoptic-scale environmental variable of global models as forecast factors [
10].
The statistical dynamical approach (such as SHIPS and STIPS) mainly uses atmospheric and oceanic environmental factors output by dynamical models as predictors of MLR to forecast changes in typhoon intensity. The interaction of atmospheric and oceanic environmental factors with changes in typhoon intensity is a highly nonlinear system. MLR cannot effectively analyze the nonlinear relationship [
11]. Machine learning (ML) can learn and simulate nonlinear systems more effectively than MLR [
12]. Common ML methods are an artificial neural network (ANN) [
13,
14,
15,
16] and a fuzzy inference system (FIS). A study has shown that ANN is superior to MLR in estimating the depth of the oceanic mixing layer [
17]. By combining genetic algorithms and artificial neural networks, Jin et al. obtained a more efficient method in intensity forecasting than climate and persistency [
18]. Sharma et al. employed ANN to develop a soft-computing cyclone intensity prediction scheme (SCIPS) which, as with STIPS, is a model to forecast typhoon intensity in the Northwest Pacific [
11]. They used ANN to improve the typhoon intensity forecasting in the Northwest Pacific and attempted to take the ocean heat content (OHC) as a predictor. SCIPS improves the previously widely used MLR intensity forecast model. Its performance is improved with increasing intensity and lead time when compared with MLR.
FIS is widely applied in studies on a hydrometeorological forecast to simulate the nonlinear relationship [
13,
19,
20]. Jang developed adaptive neuro-fuzzy inference systems (ANFIS) with the combination of artificial neural networks, which optimizes parameters through self-learning and organizational abilities of artificial neural networks [
21]. In recent years, ANFIS has been widely used in hydrometeorology. The research results show that ANFIS can provide reliable and stable hydrological forecasts [
22,
23,
24,
25].
As mentioned above, the main objective of this paper is that a typhoon intensity forecasting model proposed to utilize the typhoon dataset, SHIPS, from the statistical and dynamical models to improve the method by Knaff et al. However, this study does not specifically analyze the prediction of rapid intensification (RI) of typhoons. The ANFIS is employed to construct a typhoon intensity forecast model, including both the thermal and dynamical conditions, for predicting every 12 h for the next five days to improve the accuracy and quality of typhoon intensity forecast. The stepwise regression procedure (SRP) was employed to simplify model input data and reduce model complexity. A benchmark model was built based on the Knaff’s MLR approach in order to analyze and compare the performance of the ANFIS typhoon intensity forecast, which gives predictions every 12 h for the coming five days.
4. Typhoon Intensity Forecast Model Construction
4.1. Selection of Input Factors for Typhoon Intensity Forecast Models
In this study, input factors of ANFIS were selected by SRP. SRP can be divided into forward selection, backward selection, and stepwise selection according to selection methods. Stepwise selection, combining forward and backward selection, first selects predictors by forward selection and then conducts tests by backward selection.
SRP is used to select a variable combination with maximum explanatory power. It selects input variables according to the regression analysis of input and output variables and the performance of the linear regression test on the variable combination. It selects variables that pass the F-test by forward selection and removes variables that fail to pass the F-test by backward selection. After SRP’s F-test of the input variable combination, the t-test of a causal relationship between independent and dependent variables is conducted.
The input variables selected by SRP for the SHIPSa 12-h forecast model are shown in
Table 2, including VMAX (initial typhoon intensity), VMAX2 (squared initial typhoon intensity), DVMAX (intensity difference 12 h before typhoon), LON (longitude of typhoon center at the current moment), MPI2 (squared maximum potential typhoon intensity), MPI × VMAX (maximum potential typhoon intensity times initial typhoon intensity), POT (difference between maximum potential typhoon intensity times initial intensity), SHRD (vertical wind shear at 850–200 mb), T200 (the average temperature at 200 mb), and RHLO (relative humidity at 850–700 mb). Based on the input variables selected by SRP for SHIPSa 12 to 120-h forecast models,
Table 3 shows the order of factors selected for SHIPSa in all forecast periods. The numbers at the top of the table indicate the selection order by SRP, and the numbers at the left of the table indicate the 12 to 120-h forecast models. According to the selection results, POT has an important influence on various forecast models. VMAX2 and LON enhance their influences over the lead time, and DVMAX and SHRD lose their influences with an increase in the lead time.
The input variables selected by SRP for the SHIPSb 12-h forecast model are shown in
Table 4, including VMAX (initial typhoon intensity), VMAX
2 (squared initial typhoon intensity), DVMAX (intensity difference 12 h before typhoon), LON (longitude of typhoon center at the current moment), MPI × VMAX (maximum potential typhoon intensity times initial typhoon intensity), POT (difference between maximum potential typhoon intensity times initial intensity), SHRD (vertical wind shear at 850–200 mb), T200 (the average temperature at 200 mb), RHLO (relative humidity at 850–700 mb), and RHCN (ocean heat content). The words in bold are oceanic environmental factors. Based on the input variables selected by SRP for the SHIPSa forecast models for 12 to 120 h,
Table 5 shows the order of factors selected for SHIPSb in all forecast periods. In the table, the words in bold are oceanic environmental factors. Except for the 96-h forecast model with the same selection results as SHIPSa, oceanic factors of other forecast models were added to the model input variables. RHCN had a great influence when the lead time was close to the forecast time. Similarly, RSST (surface sea temperature) had a great influence when the forecast was made in advance.
4.2. ANFIS Typhoon Intensity Forecast Model Construction
In ANFIS architecture, the main optimized parameters are nonlinear premise parameters (
) as presented in Equation (3), and the linear conclusion parameter is presented in Equation (6). The hybrid learning rule [
21] used in this study optimized linear and nonlinear parameters separately. The least-square estimator optimized parameters in the linear parameter set, and the steepest gradient descent method optimized the nonlinear parameter set. This compound structure can effectively search for model parameters and improve the speed of model convergence. In the ANFIS model construction process, the radius of influence of SC was first determined. An initial FIS architecture was built after obtaining clustering results. Then, the training data were input into the FIS architecture to solve the linear and nonlinear parameter sets. Finally, the model was trained until the convergence.
This study used MI as the basis for ANFIS model selection to avoid overfitting and underfitting. The ANFIS data used were divided into ANFIS_SHIPSa and ANFIS_SHIPSb. The process of selecting the optimal ANFIS is shown in
Figure 3. A different radius of influence was tested in order to build multiple ANFIS sets to finally obtain the model with the lowest MI as the ANFIS typhoon intensity forecast model network.
Table 6 shows MI, radius of influence, and the number of rules of ANFIS typhoon intensity forecast models for 12 to 120 h.
4.3. MLR Typhoon Intensity Forecast Baseline Model Construction
To evaluate the improvement of the ANFIS on the typhoon intensity forecast by the MLR method, two additional MLR models were built in this study as baseline models for comparison. In the baseline models, all predictors of ANFIS were used, and MLR was used to forecast DELV. Unlike ANFIS, which uses three sets of data (training, validation, and testing data), MLR requires only two sets of data, one for training and one for testing. In the MLR baseline model, data from 2000 to 2005 were used as training data (same as ANFIS), and data from 2009 to 2012 were used as testing data.
In this study, the multiple regression coefficients of MLR were obtained from training data. Then predictive variables of testing data from 2009 to 2012 were used to obtain the multiple regression coefficients of training data to forecast DELV. Finally, two MLR baseline models, MLR_SHIPSa and MLR_SHIPSb, were built.