Enhanced Wind Field Spatial Downscaling Method Using UNET Architecture and Dual Cross-Attention Mechanism
Abstract
:1. Introduction
2. Data and Processing
2.1. Study Area
2.2. Data
- (1)
- CLDAS-V2.0 data [30], provided by the National Meteorological Information Center of the China Meteorological Administration, is coarse-resolution land surface data used as input for the model in this study. This data are generated by assimilating various ground and satellite observations using techniques such as the Spatial and Temporal Multiscale Analysis System (STMAS), Cumulative Distribution Function (CDF) matching, physical inversion, and terrain correction. It produces hourly, 0.0625° spatiotemporal resolution products covering the Asian region (0–60°N, 70–140°E). Compared to similar products, CLDAS-V2.0 data exhibit superior quality and has been widely applied in meteorological and environmental research fields. Each individual grid of the low-resolution wind field in the study area measures 112 × 112.
- (2)
- CLDAS-V3.0 product [1], high-resolution land surface data from the National Meteorological Information Center of the China Meteorological Administration, is used as the label data for the model in this study. This product combines the weather forecast products from the European Centre for Medium-Range Weather Forecasts (ECMWF) with over 60,000 national and regional automatic weather station data deployed by the China Meteorological Administration using the Spatial and Temporal Multiscale Analysis System (STMAS) assimilation method. It generates hourly, 0.01° spatiotemporal resolution merged data on an equally spaced latitude-longitude land grid, providing more detailed and accurate land surface meteorological information such as temperature, humidity, wind speed, and precipitation with higher spatiotemporal resolution. The grid size of each high-resolution wind field label in the study area is 700 × 700.
- (3)
- DEM data, obtained from a joint mapping mission called the Shuttle Radar Topography Mission (SRTM) conducted by the United States, Germany, and Italy’s national space agencies, is used in this study. The SRTM data used are version 4.1, with a resolution of 0.01°, and it has been filled using a new interpolation algorithm to better repair the gaps in the SRTM terrain data [31]. The DEM grid size in the study area is 700 × 700.
- (4)
- Station observation data include data from 339 national-level automatic weather stations and 5903 regional-level automatic weather stations within the study area. The spatial distribution of the weather stations can be seen in Figure 1.
Dataset | Source | Time Frame | Spatial Resolution | Spatial Range |
---|---|---|---|---|
CLDAS-V2.0 | NMIC | 2019.01–2021.12 (hourly) | 0.0625° | 109.0°~116.0°E 34.0°~41.0°N |
CLDAS-V3.0 | NMIC | 2019.01–2021.12 (hourly) | 0.01° | |
SRTM(DEM)-V4.1 | NASA | - | 0.01° | |
Station Observation | NMIC | 2019.01–2021.12 (hourly) | - |
2.3. Data Processing
2.3.1. Grid Data
2.3.2. Station Observation Data
3. Methodology
3.1. Structure of the Model
3.2. Multi-Scale Feature Embedding Module (MSFEM)
3.3. Dual Cross-Attention Module (DCA)
3.3.1. Channel Cross-Attention Module (CCA)
3.3.2. Spatial Cross-Attention Module (SCA)
3.4. Loss Function
4. Experimental Design and Evaluation Criteria
4.1. Experimental Design
4.1.1. Ablation Experiment
4.1.2. Contrast Experiment
- (1)
- Bilinear interpolation
- (2)
- SN-CLDASSD
4.2. Evaluation Criteria
5. Result
5.1. Ablation Results
5.2. Topographic Assessment
5.3. Time Assessment
5.4. Assessed by Wind Speed Rating
6. Discussion
7. Conclusions
- (1)
- The performance of deep learning models significantly surpasses that of the traditional bilinear interpolation method. Models based on the UNET architecture outperform SNCA_CLDASSD, showcasing the UNET’s ability to extract multi-level features and capture richer spatial information in wind field downscaling. UNET models with Cross-Attention mechanisms (CCA and SCA) outperform those without, demonstrating the effectiveness of these mechanisms. UNET_DCA, incorporating both channel and Spatial Cross-Attention mechanisms, outperforms UNET_CCA and UNET_SCA, showing superior performance in RMSE, MAE, and COR metrics. It outperforms BILINEAR by 50.19%, 51.47%, and 33.05%, and outperforms UNET by 6.54%, 8.49%, respectively. Additionally, UNET_DCA_ars, with more auxiliary information, excels in PSNR and SSIM indexes, displaying improvements of 30.21% and 37.07% over BILINEAR and showcasing enhancements of 4.33% and 3.29% over UNET.
- (2)
- Based on the terrain assessment results, UNET_DCA demonstrates superior performance in RMSE, MAE, and COR across mountain, plateau, basin, and valley regions. On the other hand, UNET_DCA_ars excels in PSNR and SSIM metrics across all terrains and also leads in RMSE, MAE, and COR in plain areas. This suggests that UNET_DCA shows a stronger correlation with actual values, while UNET_DCA_ars excels in preserving the quality and structural similarity of wind field images and capturing finer details in plain regions. At the same time, it can be seen from the comparison of visual images that the downscaling result of the bilinear interpolation method increases the number of grids, making it difficult to reconstruct the corresponding details. In contrast, the deep learning model can reconstruct the spatial details of the wind field, and UNET_DCA_ars can capture more delicate details.
- (3)
- The results of the time-based evaluation show that all indexes of all methods have the same trend over time in the intraday variation, and all deep learning models also perform poorly in the period of poor bilinear interpolation performance, indicating that data quality determines the upper limit of downscaling results, and the better the data quality, the better the downscaling results. Except for the COR index, the other four indexes were worse in the daytime and better at night. In general, SNCA_CLDASSD performs significantly better than bilinear interpolation in each season, while UNET is significantly better than SNCA_CLDASSD, and UNET_DCA is slightly better than UNET.
- (4)
- According to wind speed grade, the evaluation results indicate decreasing accuracy with higher wind speeds. UNET_DCA performs best for winds below grade 7, while UNET_DCA_ars excels for winds grade 7 and above. Small wind speed (less than or equal to 2 wind) has low accuracy during the day, high accuracy at night, the lowest accuracy in spring, and the highest accuracy in autumn; moderate wind speed (3~6 wind) has high accuracy during the day, low accuracy at night, the lowest accuracy in summer, and the highest accuracy in spring. For significant wind speeds (grade 7 and above), there are no apparent regular patterns in intra-day and intra-seasonal accuracy changes.
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Han, S.; Shi, C.; Xu, B.; Sun, S.; Zhang, T.; Jiang, L.; Liang, X. Development and evaluation of hourly and kilometer resolution retrospective and real-time surface meteorological blended forcing dataset (SMBFD) in China. J. Meteorol. Res. 2019, 33, 1168–1181. [Google Scholar] [CrossRef]
- Griggs, D.J.; Noguer, M. Climate change 2001: The scientific basis. In Contribution of Working Group I to the Third Assessment Report of the Intergovernmental Panel on Climate Change; Cambridge University Press: Cambridge, UK, 2002; Volume 57, pp. 267–269. [Google Scholar]
- Huang, X.; Rhoades, A.M.; Ullrich, P.A.; Zarzycki, C.M. An evaluation of the variable-resolution CESM for modeling California’s climate. J. Adv. Model. Earth Syst. 2016, 8, 345–369. [Google Scholar] [CrossRef]
- Chen, L.; Liang, X.Z.; DeWitt, D.; Samel, A.N.; Wang, J.X. Simulation of seasonal US precipitation and temperature by the nested CWRF-ECHAM system. Clim. Dyn. 2016, 46, 879–896. [Google Scholar] [CrossRef]
- Stehlík, J.; Bárdossy, A. Multivariate stochastic downscaling model for generating daily precipitation series based on atmospheric circulation. J. Hydrol. 2002, 256, 120–141. [Google Scholar] [CrossRef]
- Hertig, E.; Jacobeit, J. Assessments of Mediterranean precipitation changes for the 21st century using statistical downscaling techniques. Int. J. Climatol. J. R. Meteorol. Soc. 2008, 28, 1025–1045. [Google Scholar] [CrossRef]
- Semenov, M.A. Simulation of extreme weather events by a stochastic weather generator. Clim. Res. 2008, 35, 203–212. [Google Scholar] [CrossRef]
- Ailliot, P.; Allard, D.; Monbet, V.; Naveau, P. Stochastic weather generators: An overview of weather type models. J. Société Française Stat. 2015, 156, 101–113. [Google Scholar]
- Kwon, M.; Kwon, H.H.; Han, D. A spatial downscaling of soil moisture from rainfall, temperature, and AMSR2 using a Gaussian-mixture nonstationary hidden Markov model. J. Hydrol. 2018, 564, 1194–1207. [Google Scholar] [CrossRef]
- Sun, X.; Wang, J.; Zhang, L.; Ji, C.; Zhang, W.; Li, W. Spatial downscaling model combined with the Geographically Weighted Regression and multifractal models for monthly GPM/IMERG precipitation in Hubei Province, China. Atmosphere 2022, 13, 476. [Google Scholar] [CrossRef]
- Chan, K.C.; Zhou, S.; Xu, X.; Loy, C.C. Basicvsr++: Improving video super-resolution with enhanced propagation and alignment. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 5972–5981. [Google Scholar]
- Ranade, R.; Liang, Y.; Wang, S.; Bai, D.; Lee, J. 3D Texture Super Resolution via the Rendering Loss. In Proceedings of the ICASSP 2022–2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Virtual, 7–13 May 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 1556–1560. [Google Scholar]
- Wang, P.; Bayram, B.; Sertel, E. A comprehensive review on deep learning based remote sensing image super-resolution methods. Earth-Sci. Rev. 2022, 232, 104110. [Google Scholar] [CrossRef]
- Choi, H.; Lee, J.; Yang, J. N-gram in swin transformers for efficient lightweight image super-resolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 17–24 June 2023; pp. 2071–2081. [Google Scholar]
- Leinonen, J.; Nerini, D.; Berne, A. Stochastic super-resolution for downscaling time-evolving atmospheric fields with a generative adversarial network. IEEE Trans. Geosci. Remote Sens. 2020, 59, 7211–7223. [Google Scholar] [CrossRef]
- Wang, F.; Tian, D.; Lowe, L.; Kalin, L.; Lehrter, J. Deep learning for daily precipitation and temperature downscaling. Water Resour. Res. 2021, 57, e2020WR029308. [Google Scholar] [CrossRef]
- Harris, L.; McRae, A.T.; Chantry, M.; Dueben, P.D.; Palmer, T.N. A generative deep learning approach to stochastic downscaling of precipitation forecasts. J. Adv. Model. Earth Syst. 2022, 14, e2022MS003120. [Google Scholar] [CrossRef] [PubMed]
- Gerges, F.; Boufadel, M.C.; Bou-Zeid, E.; Nassif, H.; Wang, J.T.L. A Novel Deep Learning Approach to the Statistical Downscaling of Temperatures for Monitoring Climate Change. In Proceedings of the 2022 The 6th International Conference on Machine Learning and Soft Computing, Haikou, China, 15–17 January 2022; pp. 1–7. [Google Scholar]
- Vandal, T.; Kodra, E.; Ganguly, S.; Michaelis, A.; Nemani, R.; Ganguly, A.R. Deepsd: Generating high resolution climate change projections through single image super-resolution. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, 13–17 August 2017; pp. 1663–1672. [Google Scholar]
- Mao, Z. Spatial Downscaling of Meteorological Data Based on Deep Learning Image Super-Resolution. Master’s Thesis, Wuhan University, Wuhan, China, 2019. [Google Scholar]
- Tie, R.; Shi, C.; Wan, G.; Hu, X.; Kang, L.; Ge, L. CLDASSD: Reconstructing fine textures of the temperature field using super-resolution technology. Adv. Atmos. Sci. 2022, 39, 117–130. [Google Scholar] [CrossRef]
- Tie, R.; Shi, C.; Wan, G.; Kang, L.; Ge, L. To Accurately and Lightly Downscale the Temperature Field by Deep Learning. Ournal Atmos. Ocean. Technol. 2022, 39, 479–490. [Google Scholar] [CrossRef]
- Shen, Z.; Shi, C.; Shen, R.; Tie, R.; Ge, L. Spatial Downscaling of Near-Surface Air Temperature Based on Deep Learning Cross-Attention Mechanism. Remote Sens. 2023, 15, 5084. [Google Scholar] [CrossRef]
- Höhlein, K.; Kern, M.; Hewson, T.; Westermann, R. A comparative study of convolutional neural network models for wind field downscaling. Meteorol. Appl. 2020, 27, e1961. [Google Scholar] [CrossRef]
- Dupuy, F.; Durand, P.; Hedde, T. Downscaling of surface wind forecasts using convolutional neural networks. Nonlinear Process. Geophys. 2023, 30, 553–570. [Google Scholar] [CrossRef]
- Lin, H.; Tang, J.; Wang, S.; Wang, S.; Dong, G. Deep learning downscaled high-resolution daily near surface meteorological datasets over east asia. Sci. Data 2023, 10, 890. [Google Scholar] [CrossRef]
- Gorkem, C.A.; Prasoon, M.; Emrah, C. Dual Cross-Attention for medical image segmentation. Eng. Appl. Artif. Intell. 2023, 126, 107139. [Google Scholar]
- Zhou, Y.; Huo, C.; Zhu, J.; Huo, L.; Pan, C. DCAT: Dual Cross-Attention-Based Transformer for Change Detection. Comput. Biol. Med. 2023, 15, 2395. [Google Scholar] [CrossRef]
- Fu, Z.; Li, J.; Hua, Z. DEAU-Net: Attention networks based on dual encoder for Medical Image Segmentation. Comput. Biol. Med. 2022, 150, 106197. [Google Scholar] [CrossRef] [PubMed]
- Han, S.; Liu, B.; Shi, C.; Liu, Y.; Qiu, M.; Sun, S. Evaluation of CLDAS and GLDAS datasets for Near-surface Air Temperature over major land areas of China. Sustainability 2020, 12, 4311. [Google Scholar] [CrossRef]
- Reuter, H.I.; Nelson, A.; Jarvis, A. An evaluation of void-filling interpolation methods for SRTM data. Int. J. Geogr. Inf. Sci. 2007, 21, 983–1008. [Google Scholar] [CrossRef]
- QX/T 118-2020; Meteorological Observation Data Quality Control. Chinese Industry Standard: Beijing, China, 2020.
- Olaf, R.; Philipp, F.; Thomas, B. U-net: Convolutional networks for biomedical images egmentation. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, 5–9 October 2015; pp. 234–241. [Google Scholar]
- Jha, D.; Riegler, M.A.; Johansen, D.; Halvorsen, P.; Johansen, H.D. Doubleu-net: A deep convolutional neural network for medical image segmen-tation. In Proceedings of the 2020 IEEE 33rd International Symposium on Computer-Based Medical Systems (CBMS), Rochester, MN, USA, 28–30 July 2020; pp. 558–564. [Google Scholar]
- Bao, K.; Zhang, X.; Peng, W.; Yao, W. Deep learning method for super-resolution reconstruction of the spatio-temporal flow field. Adv. Aerodyn. 2023, 5, 19. [Google Scholar] [CrossRef]
- Xiao, Y.; Zhang, J.; Chen, W.; Wang, Y.; You, J.; Wang, Q. SR-DeblurUGAN: An End-to-End Super-Resolution and Deblurring Model with High Performance. Drones 2022, 6, 162. [Google Scholar] [CrossRef]
- Fan, Z.; Dan, T.; Liu, B.; Sheng, X.; Yu, H.; Cai, H. SGUNet: Style-guided UNet for adversely conditioned fundus image super-resolution. Neurocomputing 2021, 465, 238–247. [Google Scholar] [CrossRef]
- Mela, C.A.; Liu, Y. Application of convolutional neural networks towards nuclei segmentation in localization-based super-resolution fluorescence microscopy images. BMC Bioinform. 2021, 1, 325. [Google Scholar] [CrossRef]
- Chen, F.; Manning, K.W.; LeMone, M.A.; Trier, S.B.; Alfieri, J.G.; Roberts, R.; Tewari, M.; Niyogi, D.; Horst, T.W.; Oncley, S.P.; et al. Description and evaluation of the characteristics of the NCAR high-resolution land data assimilation system. J. Appl. Meteorol. Climatol. 2007, 46, 694–713. [Google Scholar] [CrossRef]
Topography | Serial Number |
---|---|
Mountains | 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, 18, 19, 22, 24, 26, 29, 32, 33, 36 |
Highland | 1, 2, 3, 39, 45, 46 |
Basin | 25, 31, 37, 38, 43, 44 |
Valley | 16, 23, 30 |
Plain | 20, 21, 27, 28, 34, 35, 40, 41, 42, 47, 48, 49 |
Model | CCA | SCA | Auxiliary Information |
---|---|---|---|
UNET | - | - | DEM |
UNET_CCA | √ | - | DEM |
UNET_SCA | - | √ | DEM |
UNET_DCA | √ | √ | DEM |
UNET_DCA_ars | √ | √ | DEM, slope, aspect, relief |
Methods | RMSE | MAE | COR | PSNR | SSIM |
---|---|---|---|---|---|
BILINEAR | 0.803 | 0.577 | 0.699 | 22.277 | 0.642 |
SNCA_CLDASSD | 0.589 | 0.427 | 0.844 | 24.917 | 0.748 |
UNET | 0.428 | 0.306 | 0.912 | 27.801 | 0.852 |
UNET_CCA | 0.401 | 0.286 | 0.928 | 28.363 | 0.876 |
UNET_SCA | 0.412 | 0.288 | 0.926 | 28.205 | 0.878 |
UNET_DCA | 0.400 | 0.280 | 0.930 | 28.806 | 0.877 |
UNET_DCA_ars | 0.410 | 0.289 | 0.928 | 29.006 | 0.880 |
Evaluation Index | Topography | Methods | ||||
---|---|---|---|---|---|---|
BILINEAR | SNCA_CLDASSD | UNET | UNET_DCA | UNET_DCA_ars | ||
RMSE | Mountains | 0.892 | 0.602 | 0.417 | 0.390 | 0.399 |
Highland | 0.776 | 0.638 | 0.446 | 0.408 | 0.419 | |
Basin | 0.724 | 0.592 | 0.437 | 0.424 | 0.431 | |
Valley | 0.989 | 0.561 | 0.339 | 0.321 | 0.326 | |
Plain | 0.569 | 0.524 | 0.434 | 0.443 | 0.421 | |
MAE | Mountains | 0.399 | 0.441 | 0.298 | 0.274 | 0.282 |
Highland | 0.419 | 0.464 | 0.321 | 0.283 | 0.298 | |
Basin | 0.431 | 0.427 | 0.308 | 0.304 | 0.309 | |
Valley | 0.326 | 0.429 | 0.249 | 0.232 | 0.238 | |
Plain | 0.421 | 0.382 | 0.319 | 0.313 | 0.300 | |
COR | Mountains | 0.628 | 0.829 | 0.923 | 0.932 | 0.929 |
Highland | 0.766 | 0.850 | 0.928 | 0.938 | 0.936 | |
Basin | 0.707 | 0.834 | 0.894 | 0.902 | 0.901 | |
Valley | 0.685 | 0.720 | 0.903 | 0.912 | 0.910 | |
Plain | 0.791 | 0.828 | 0.885 | 0.888 | 0.896 | |
PSNR | Mountains | 22.237 | 24.925 | 27.731 | 28.706 | 28.915 |
Highland | 22.283 | 24.564 | 27.862 | 28.812 | 29.031 | |
Basin | 22.452 | 24.732 | 27.615 | 28.693 | 28.762 | |
Valley | 23.035 | 25.154 | 28.061 | 29.120 | 29.210 | |
Plain | 21.398 | 24.281 | 26.914 | 27.062 | 27.235 | |
SSIM | Mountains | 0.621 | 0.748 | 0.852 | 0.877 | 0.881 |
Highland | 0.649 | 0.726 | 0.832 | 0.843 | 0.860 | |
Basin | 0.658 | 0.721 | 0.840 | 0.849 | 0.866 | |
Valley | 0.658 | 0.795 | 0.861 | 0.897 | 0.901 | |
Plain | 0.586 | 0.710 | 0.820 | 0.831 | 0.843 |
Grade | Methods | ||||
---|---|---|---|---|---|
BILINEAR | SNCA_CLDASSD | UNET | UNET_DCA | UNET_DCA_ars | |
≤2 | 0.635 | 0.705 | 0.774 | 0.788 | 0.787 |
3–4 | 0.259 | 0.415 | 0.584 | 0.603 | 0.597 |
5–6 | 0.184 | 0.232 | 0.379 | 0.420 | 0.404 |
≥7 | 0.167 | 0.154 | 0.313 | 0.322 | 0.331 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Liu, J.; Shi, C.; Ge, L.; Tie, R.; Chen, X.; Zhou, T.; Gu, X.; Shen, Z. Enhanced Wind Field Spatial Downscaling Method Using UNET Architecture and Dual Cross-Attention Mechanism. Remote Sens. 2024, 16, 1867. https://doi.org/10.3390/rs16111867
Liu J, Shi C, Ge L, Tie R, Chen X, Zhou T, Gu X, Shen Z. Enhanced Wind Field Spatial Downscaling Method Using UNET Architecture and Dual Cross-Attention Mechanism. Remote Sensing. 2024; 16(11):1867. https://doi.org/10.3390/rs16111867
Chicago/Turabian StyleLiu, Jieli, Chunxiang Shi, Lingling Ge, Ruian Tie, Xiaojian Chen, Tao Zhou, Xiang Gu, and Zhanfei Shen. 2024. "Enhanced Wind Field Spatial Downscaling Method Using UNET Architecture and Dual Cross-Attention Mechanism" Remote Sensing 16, no. 11: 1867. https://doi.org/10.3390/rs16111867
APA StyleLiu, J., Shi, C., Ge, L., Tie, R., Chen, X., Zhou, T., Gu, X., & Shen, Z. (2024). Enhanced Wind Field Spatial Downscaling Method Using UNET Architecture and Dual Cross-Attention Mechanism. Remote Sensing, 16(11), 1867. https://doi.org/10.3390/rs16111867