Figure 1.
Three-dimensional TC atmospheric characterization, where u, v, and z represent the wind field and geopotential features that we chose over the four isobars, and the red box is a subset of the entire TC range for which we visualize the 3D TC features.
Figure 1.
Three-dimensional TC atmospheric characterization, where u, v, and z represent the wind field and geopotential features that we chose over the four isobars, and the red box is a subset of the entire TC range for which we visualize the 3D TC features.
Figure 2.
The network architecture of AFR-SimVP. The Spatial Encoder is used to extract the 2D spatial features of TC variables, the Temporal Encoder is used to extract the temporal features of TCs, and the Decoder is used to reconstruct the 3D TC atmospheric field and use the reconstructed 3D atmospheric field for the soft labeling of subsequent dynamic self-distillation. The red block represents the coordinate attention mechanism, which is used to extract regions with a high response of the 3D features to the TC trajectory. The last multiple MLP layers are used to perform late fusion of the extracted features to predict the TC trajectory after 24 h. The U,V hidden state, Z hidden state, and wide hidden state together are used as the initial hidden state of AFRGRU-SimVP, and the state at 24 h is used as part of the input to AFRGRU-SimVP.
Figure 2.
The network architecture of AFR-SimVP. The Spatial Encoder is used to extract the 2D spatial features of TC variables, the Temporal Encoder is used to extract the temporal features of TCs, and the Decoder is used to reconstruct the 3D TC atmospheric field and use the reconstructed 3D atmospheric field for the soft labeling of subsequent dynamic self-distillation. The red block represents the coordinate attention mechanism, which is used to extract regions with a high response of the 3D features to the TC trajectory. The last multiple MLP layers are used to perform late fusion of the extracted features to predict the TC trajectory after 24 h. The U,V hidden state, Z hidden state, and wide hidden state together are used as the initial hidden state of AFRGRU-SimVP, and the state at 24 h is used as part of the input to AFRGRU-SimVP.
Figure 3.
The prediction results of the model after adding the second-order loss are plotted against the prediction results of the original model, where the blue curve represents the correct trajectory of Nakri, the red curve represents the prediction result curve of the original model for 24 h, and the green curve represents the prediction result curve of the model after adding the second-order loss. The pink line in the small figure represents the distance error between the predicted value and the real value, the yellow line represents the error between the predicted value and the real value at the previous moment, the green line represents the error between the real value and the real value at the previous moment, and the red angle between the green line and the yellow line represents the angle between the real position at the previous moment to the real position at this moment and the real position at the previous moment to the predicted position at this moment.
Figure 3.
The prediction results of the model after adding the second-order loss are plotted against the prediction results of the original model, where the blue curve represents the correct trajectory of Nakri, the red curve represents the prediction result curve of the original model for 24 h, and the green curve represents the prediction result curve of the model after adding the second-order loss. The pink line in the small figure represents the distance error between the predicted value and the real value, the yellow line represents the error between the predicted value and the real value at the previous moment, the green line represents the error between the real value and the real value at the previous moment, and the red angle between the green line and the yellow line represents the angle between the real position at the previous moment to the real position at this moment and the real position at the previous moment to the predicted position at this moment.
Figure 4.
The specific architecture of HTAFR-SimVP, where only the 6 h prediction process is drawn, and all other time points are similar. The Decoder is used to up-sample the feature maps of the students, and the up-sampled three-dimensional atmospheric state feature maps are utilized for auxiliary tasks, namely, TC wind field prediction and three-dimensional geopotential field prediction, where the coordinate attention block and some fully connected layers after up-sampling the network for each student are omitted. “Soft label” represents the soft label of the atmospheric state after the reconstruction of the network by the teacher, and “label” represents the real atmospheric state.
Figure 4.
The specific architecture of HTAFR-SimVP, where only the 6 h prediction process is drawn, and all other time points are similar. The Decoder is used to up-sample the feature maps of the students, and the up-sampled three-dimensional atmospheric state feature maps are utilized for auxiliary tasks, namely, TC wind field prediction and three-dimensional geopotential field prediction, where the coordinate attention block and some fully connected layers after up-sampling the network for each student are omitted. “Soft label” represents the soft label of the atmospheric state after the reconstruction of the network by the teacher, and “label” represents the real atmospheric state.
Figure 5.
The GRU structure for implementing 24 h to 72 h trajectory prediction, where initial hidden states and inputs are obtained from the AFR-SimVP network. H_GRU represents the initial hidden state of the GRU fused from the three parts of the hidden state of the AFR-SimVP network. The state at 24 h serves as part of the input for all time steps of the GRU, and the other part of the input consists of the prediction results for each time step.
Figure 5.
The GRU structure for implementing 24 h to 72 h trajectory prediction, where initial hidden states and inputs are obtained from the AFR-SimVP network. H_GRU represents the initial hidden state of the GRU fused from the three parts of the hidden state of the AFR-SimVP network. The state at 24 h serves as part of the input for all time steps of the GRU, and the other part of the input consists of the prediction results for each time step.
Figure 6.
The visualization of the effect of attention. The left figure represents the predicted geopotential field without adding attention, and the right figure represents the effect after adding attention.
Figure 6.
The visualization of the effect of attention. The left figure represents the predicted geopotential field without adding attention, and the right figure represents the effect after adding attention.
Figure 7.
Distance error box plots of 6-72 h trajectory prediction distances for three recurrent neural networks and AFR-SimVP and AFRGRU-SimVP.
Figure 7.
Distance error box plots of 6-72 h trajectory prediction distances for three recurrent neural networks and AFR-SimVP and AFRGRU-SimVP.
Figure 8.
The scatterplot distributions of latitude and longitude for 12 h (a,b), 24 h (c,d), 48 h (e,f), and 72 h (g,h) forecasts. Colors represent the maximum wind speed at the TC center.
Figure 8.
The scatterplot distributions of latitude and longitude for 12 h (a,b), 24 h (c,d), 48 h (e,f), and 72 h (g,h) forecasts. Colors represent the maximum wind speed at the TC center.
Figure 9.
The predicted results of the geopotential field at the 1000 hPa isobaric surface at a specific point along the track of Typhoon Bavi.
Figure 9.
The predicted results of the geopotential field at the 1000 hPa isobaric surface at a specific point along the track of Typhoon Bavi.
Figure 10.
Track prediction results.
Figure 10.
Track prediction results.
Table 1.
The CLIPER features.
Table 1.
The CLIPER features.
Factors | Feature Name | Description |
---|
- | | Latitude in the last 24 h |
- | | Longitude in the last 24 h |
- | | Wind speed in the last 24 h |
| | Current month |
- | | Six-hour latitude difference |
- | | Six-hour longitude difference |
- | | Six-hour wind speed difference |
| | Sum of squares of six-hour latitude |
difference |
| | Sum of squares of six-hour |
longitude difference |
| | Square root of feature 29 |
| | Square root of feature 30 |
- | | Square root of current latitude |
and longitude |
- | | Physical acceleration |
- | | Zonal angle |
- | | Meridional angle |
- | | Angle of historical location |
| |
- | | Angle of historical path |
| |
Table 2.
Dataset segmentation. To facilitate comparisons with previous methods, we followed the division method used in past studies and chose the 2015–2017 TC data as the test set.
Table 2.
Dataset segmentation. To facilitate comparisons with previous methods, we followed the division method used in past studies and chose the 2015–2017 TC data as the test set.
| Years | TCs | Samples |
---|
Train | 1979–2014, 2018–2021 | 1098 | 24,869 |
Test | 2015–2017 | 82 | 1951 |
Valid | 2022 | 24 | 451 |
Table 3.
Forecast errors (km) for the proposed model and traditional methods in 6 h, 12 h, 18 h, and 24 h prediction. Bold represents the minimum error.
Table 3.
Forecast errors (km) for the proposed model and traditional methods in 6 h, 12 h, 18 h, and 24 h prediction. Bold represents the minimum error.
Methods | 6 h | 12 h | 18 h | 24 h |
---|
Extrapolation (2003) | 33.78 | 79.20 | 135.48 | 201.28 |
CLIPER-BP (1975) | 37.53 | 73.31 | 115.13 | 162.62 |
Fusion CNN (2020) | 32.90 | - | - | 136.10 |
AE-GRU (2020) | - | - | 138.67 | 143.23 |
AM-convgru (2022) | - | - | - | 140.67 |
MMSTN (2022) | | 59.09 | - | 139.19 |
DBF-Net (2022) | 31.30 | 58.94 | 87.60 | 119.05 |
Smoothed-3DGRU (2022) | 27.89 | 52.37 | 79.16 | 112.05 |
HTAFR-SimVP | 31.46 | 52.80 | 77.97 | 108.21 |
AFR-SimVP | 29.31 | | | |
Table 4.
The long-term prediction effectiveness evaluation (MSE/RMSE) of multiple models, where RNN, LSTM, and GRU represent the direct cascading of AFR-SimVP with the recurrent network, SimVP represents our 24 h prediction model, and represents AFRGRU-SimVP. Bold highlights the best performance.
Table 4.
The long-term prediction effectiveness evaluation (MSE/RMSE) of multiple models, where RNN, LSTM, and GRU represent the direct cascading of AFR-SimVP with the recurrent network, SimVP represents our 24 h prediction model, and represents AFRGRU-SimVP. Bold highlights the best performance.
| Lat | Long |
---|
Forecast Hour | 6 h | 12 h | 18 h | 24 h | 48 h | 72 h | 6 h | 12 h | 18 h | 24 h | 48 h | 72 h |
---|
RNN | MAE | 0.307 | 0.422 | 0.536 | 0.683 | 1.587 | 2.884 | 0.489 | 0.564 | 0.695 | 0.854 | 2.001 | 4.078 |
| RMSE | 0.426 | 0.567 | 0.714 | 0.910 | 2.213 | 3.863 | 0.718 | 0.781 | 0.927 | 1.138 | 2.598 | 5.334 |
LSTM | MAE | 0.288 | 0.388 | 0.502 | 0.648 | 1.482 | 2.780 | 0.322 | 0.449 | 0.592 | 0.753 | 1.831 | 3.706 |
| RMSE | 0.369 | 0.501 | 0.666 | 0.870 | 2.128 | 3.806 | 0.430 | 0.596 | 0.783 | 1.003 | 2.449 | 4.892 |
GRU | MAE | 0.208 | 0.322 | 0.445 | 0.583 | 1.429 | 2.789 | 0.278 | 0.423 | 0.583 | 0.746 | 1.771 | 3.498 |
| RMSE | 0.272 | 0.424 | 0.588 | 0.774 | 2.009 | 3.755 | 0.369 | 0.562 | 0.772 | 1.002 | 2.355 | 4.658 |
SimVP | MAE | 0.155 | 0.277 | 0.417 | 0.583 | 1.509 | 2.841 | 0.206 | 0.328 | 0.501 | 0.673 | 1.860 | 3.709 |
| RMSE | 0.210 | 0.374 | 0.564 | 0.787 | 2.073 | 3.846 | 0.277 | 0.449 | 0.678 | 0.919 | 2.571 | 4.843 |
| MAE | | | | | | | | | | | | |
| RMSE | | | | | | | | | | | | |
Table 5.
A comparison of the average distance error (km) of multiple deep learning models. Bold represents the best value.
Table 5.
A comparison of the average distance error (km) of multiple deep learning models. Bold represents the best value.
Methods | 6 h | 12 h | 18 h | 24 h | 48 h | 72 h |
---|
CLIPER (1992) | - | - | - | 213 | 442 | 659 |
RNN | 67.44 | 82.67 | 103.19 | 129.35 | 295.35 | 557.99 |
LSTM | 51.42 | 70.02 | 91.19 | 116.56 | 273.45 | 523.17 |
GRU | 40.57 | 62.72 | 86.43 | 111.00 | 263.70 | 503.02 |
AFR-SimVP | 29.31 | 50.06 | 75.45 | 102.68 | 271.89 | 528.48 |
MMSTN(2022) | | 59.09 | - | 139.18 | 336.16 | 544.16 |
AFRGRU-SimVP | 29.31 | | | | | |
Table 6.
A comparison of AFR-SimVP and numerical weather prediction method trajectory prediction results. The variable “#Samples” represents the number of TCs in the current year used for model testing, while “AVG” represents the average error of the model predictions for all TCs over the three-year period.
Table 6.
A comparison of AFR-SimVP and numerical weather prediction method trajectory prediction results. The variable “#Samples” represents the number of TCs in the current year used for model testing, while “AVG” represents the average error of the model predictions for all TCs over the three-year period.
Year | | T213/T639 | SHTP | AFR-SimVP |
---|
2015 | #Samples | 46 | 440 | 908 |
24 h MDE(km) | 150.6 | 67.8 | 98.28 |
2016 | #Samples | 412 | 194 | 489 |
24 h MDE(km) | 114.9 | 88.5 | 106.02 |
2017 | #Samples | 301 | 253 | 554 |
24 h MDE(km) | 98.7 | 89.1 | 103.73 |
AVG | 121.4 | 81.8 | 102.68 |
Inference time | - | - | 1.51s |
Table 7.
The proposed method is compared with previous deep learning methods and numerical forecasting methods in terms of prediction errors (m/s) for the 24 h central wind speed forecasting task. Bold represents the minimum error.
Table 7.
The proposed method is compared with previous deep learning methods and numerical forecasting methods in terms of prediction errors (m/s) for the 24 h central wind speed forecasting task. Bold represents the minimum error.
Methods | 2015 | 2016 | 2017 | Avg |
---|
CMA | 4.26 | 5.10 | 5.63 | 4.99 |
JMA | 5.08 | 5.30 | 5.05 | 5.14 |
JTWC | 4.88 | 5.40 | 5.23 | 5.17 |
ECMWF-IFS | 5.44 | 9.22 | 6.40 | 7.02 |
NCEP-GFS | 6.42 | 6.42 | 5.92 | 6.25 |
TC-Pred (2022) | - | - | - | 3.98 |
SAF-Net (2022) | 4.54 | 4.78 | 3.95 | 4.42 |
TITP-Net (2022) | | 4.61 | | 4.03 |
Transformer (2023) | - | - | - | 5.04 |
AFR-SimVP | 4.02 | 4.13 | 3.80 | 3.98 |
HTAFR-SimVP | 3.95 | | 3.68 | |
Table 8.
A comparison of the proposed method with previous deep learning approaches in terms of multi-time-step prediction errors (m/s) for the TC central wind speed. Bold represents the minimum error.
Table 8.
A comparison of the proposed method with previous deep learning approaches in terms of multi-time-step prediction errors (m/s) for the TC central wind speed. Bold represents the minimum error.
Methods | 6 h | 12 h | 18 h | 24 h |
---|
Transformer (2023) | | 2.89 | 4.06 | 5.04 |
TC-Pred (2022) | 1.74 | | | 3.98 |
AFR-SimVP | 3.02 | 3.41 | 3.74 | 3.98 |
HTAFR-SimVP | 1.98 | 2.59 | 3.31 | |
Table 9.
Ablation study table. A tick indicates that the module is used, a cross indicates that it is not used.
Table 9.
Ablation study table. A tick indicates that the module is used, a cross indicates that it is not used.
Second-Order Loss | Data Enhancement | Catten | UV Atmospheric Variables | Atmospheric Field Reconstruction | Distance Error |
---|
✗ | ✗ | ✗ | ✗ | ✗ | 128.58 |
✓ | ✗ | ✗ | ✗ | ✗ | 121.28 |
✓ | ✓ | ✗ | ✗ | ✗ | 119.9 |
✓ | ✓ | ✓ | ✗ | ✗ | 112.86 |
✓ | ✓ | ✓ | ✓ | ✗ | 106.98 |
✓ | ✓ | ✓ | ✓ | ✓ | 102.68 |