A Robust Deep Learning Ensemble Framework for Waterbody Detection Using High-Resolution X-Band SAR Under Data-Constrained Conditions
Highlights
- The performance of waterbody segmentation using deep learning models can be improved by incorporating land cover maps and topography information, such as slope and the Height Above Nearest Drainage (HAND), alongside high-resolution X-band SAR images.
- An ensemble of deep learning models provided moderate Intersection over Union (IoU) gains over the best single model but offered critical operational advantages in Precision–Recall balance and prediction consistency.
- Multi-modal data integration for waterbody detection: Our results suggest that incorporating auxiliary geospatial layers (e.g., topography and land cover), when available and sufficiently reliable, can effectively reduce sensitivity to ambiguity arising from any single sensor modality in waterbody detection, particularly in complex terrain like Republic of Korea.
- Ensemble modeling for operational stability: For applications where consistent performance is critical (e.g., flood risk assessment), combining complementary architectures can help mitigate model-specific inductive biases. Although the quantitative gains in this study were moderate, ensemble aggregation improved the Precision–Recall balance and prediction consistency when adopting Optimized Weights via systematic grid search.
Abstract
1. Introduction
2. Materials
2.1. The Capella SAR System
2.2. SAR Data Preprocessing
2.2.1. Standard SAR Intensity Pre-Processing
- (1)
- Dynamic range compression via logarithmic transformation;
- (2)
- Outlier mitigation through percentile-based clipping;
- (3)
- Min–max normalization to scale values to a standardized [0, 1] range.
2.2.2. Incidence Angle Corrected Pre-Processing
2.3. Auxiliary Data Preprocessing
2.3.1. Topography Data
2.3.2. Land Cover Maps
2.4. Waterbody Labeling
- Step 1: Identification of Water Candidates: Regions exhibiting low backscatter values due to specular reflection were identified as potential water body candidates.
- Step 2: Exclusion of Ambiguous Regions: To eliminate potential noise, radar shadows were removed by cross-referencing DEM-based slope information. Furthermore, considering the limited vegetation penetration depth of the X-band, the detection targets were strictly confined to open water.
- Step 3: Cross-validation and Final Labeling: Final validation was performed using the Normalized Difference Water Index (NDWI) and high-resolution aerial imagery. A conservative labeling approach was adopted, where only pixels confirmed as water in both SAR and optical imagery were selected to minimize false positives.
3. Methods
3.1. Overview
3.2. Input Channel Configuration
3.3. Segmentation Models and Ensemble Framework
3.3.1. PIDNet
3.3.2. Mask2Former
3.3.3. Swin Transformer
3.3.4. K-Net
3.3.5. Model Optimization
3.3.6. Weighted Average Ensemble
3.4. Model Performance Evaluation
- TP: Number of waterbody pixels correctly classified as waterbodies;
- TN: Number of non-waterbody pixels correctly classified as non-waterbodies;
- FP: Number of non-waterbody pixels incorrectly classified as waterbodies;
- FN: Number of waterbody pixels incorrectly classified as non-waterbodies.
4. Results
4.1. Input Channel Configurations
4.2. Performance Comparison Among Deep Learning Models
4.3. Blind Test Using Ensemble Model
4.3.1. Baseline Comparison and Ensemble Performance
4.3.2. Evaluation of Ensemble Strategies
5. Discussions
5.1. Ensemble Approach
5.1.1. Ensemble for Constrained Data
5.1.2. Inter-Model Disagreement
5.2. Analyses by Case
5.2.1. Case 1: Estuarine Area
5.2.2. Case 2: Mountainous Terrain
5.2.3. Case 3: Urban Area
5.3. Implications, Limitations and Future Work
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Sheffield, J.; Wood, E.F.; Pan, M.; Beck, H.; Coccia, G.; Serrat-Capdevila, A.; Verbist, K.J.W.R.R. Satellite remote sensing for water resources management: Potential for supporting sustainable development in data-poor regions. Water Resour. Res. 2018, 54, 9724–9758. [Google Scholar] [CrossRef]
- Rango, A. Application of remote sensing methods to hydrology and water resources. Hydrol. Sci. J. 1994, 39, 309–320. [Google Scholar] [CrossRef]
- Li, B.; Liu, K.; Wang, M.; Wang, Y.; He, Q.; Zhuang, L.; Zhu, W. High-spatiotemporal-resolution dynamic water monitoring using LightGBM model and Sentinel-2 MSI data. Int. J. Appl. Earth Obs. Geoinf. 2023, 118, 103278. [Google Scholar] [CrossRef]
- Liu, S.; Wu, Y.; Zhang, G.; Lin, N.; Liu, Z. Comparing water indices for Landsat data for automated surface water body extraction under complex ground background: A case study in Jilin Province. Remote Sens. 2023, 15, 1678. [Google Scholar] [CrossRef]
- Yang, X.; Chen, Y.; Wang, J. Combined use of Sentinel-2 and Landsat 8 to monitor water surface area dynamics using Google Earth Engine. Remote Sens. Lett. 2020, 11, 687–696. [Google Scholar] [CrossRef]
- Markert, K.N.; Chishtie, F.; Anderson, E.R.; Saah, D.; Griffin, R.E. On the merging of optical and SAR satellite imagery for surface water mapping applications. Results Phys. 2018, 9, 275–277. [Google Scholar] [CrossRef]
- Musa, Z.N.; Popescu, I.; Mynett, A. A review of applications of satellite SAR, optical, altimetry and DEM data for surface water modelling, mapping and parameter estimation. Hydrol. Earth Syst. Sci. 2015, 19, 3755–3769. [Google Scholar] [CrossRef]
- Uddin, K.; Matin, M.A.; Meyer, F.J. Operational flood mapping using multi-temporal Sentinel-1 SAR images: A case study from Bangladesh. Remote Sens. 2019, 11, 1581. [Google Scholar] [CrossRef]
- Grimaldi, S.; Xu, J.; Li, Y.; Pauwels, V.R.; Walker, J.P. Flood mapping under vegetation using single SAR acquisitions. Remote Sens. Environ. 2020, 237, 111582. [Google Scholar] [CrossRef]
- Pulvirenti, L.; Pierdicca, N.; Chini, M.; Guerriero, L. An algorithm for operational flood mapping from Synthetic Aperture Radar (SAR) data using fuzzy logic. Nat. Hazards Earth Syst. Sci. 2011, 11, 529–540. [Google Scholar] [CrossRef]
- Cutler, P.J.; Schwartzkopf, W.C.; Koehler, F.W. Robust automated thresholding of SAR imagery for open-water detection. In Proceedings of the 2015 IEEE Radar Conference (RadarCon), Arlington, VA, USA, 11–15 May 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 310–315. [Google Scholar] [CrossRef]
- Wang, Z.; Zhang, R.; Zhang, Q.; Zhu, Y.; Huang, B.; Lu, Z. An automatic thresholding method for waterbody detection from SAR image. In Proceedings of the 2019 IEEE International Conference on Signal, Information and Data Processing (ICSIDP), Chongqing, China, 11–13 December 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1–4. [Google Scholar] [CrossRef]
- Chen, S.; Huang, W.; Chen, Y.; Feng, M. An adaptive thresholding approach toward rapid flood coverage extraction from Sentinel-1 SAR imagery. Remote Sens. 2021, 13, 4899. [Google Scholar] [CrossRef]
- Bangira, T.; Alfieri, S.M.; Menenti, M.; Van Niekerk, A. Comparing thresholding with machine learning classifiers for mapping complex water. Remote Sens. 2019, 11, 1351. [Google Scholar] [CrossRef]
- Navale, A.; Haldar, D. Evaluation of machine learning algorithms to Sentinel SAR data. Spat. Inf. Res. 2020, 28, 345–355. [Google Scholar] [CrossRef]
- Pech-May, F.; Aquino-Santos, R.; Delgadillo-Partida, J. Sentinel-1 SAR images and deep learning for waterbody mapping. Remote Sens. 2023, 15, 3009. [Google Scholar] [CrossRef]
- Bereczky, M.; Wieland, M.; Krullikowski, C.; Martinis, S.; Plank, S. Sentinel-1-based water and flood mapping: Benchmarking convolutional neural networks against an operational rule-based processing chain. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2022, 15, 2023–2036. [Google Scholar] [CrossRef]
- Tavus, B.; Can, R.; Kocaman, S. A CNN-based flood mapping approach using sentinel-1 data. ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci. 2022, 3, 549–556. [Google Scholar] [CrossRef]
- Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, 5–9 October 2015; Springer International Publishing: Cham, Switzerland, 2015. Proceedings, Part III 18. pp. 234–241. [Google Scholar] [CrossRef]
- Khan, S.; Naseer, M.; Hayat, M.; Zamir, S.W.; Khan, F.S.; Shah, M. Transformers in vision: A survey. ACM Comput. Surv. (CSUR) 2022, 54, 1–41. [Google Scholar] [CrossRef]
- Saleh, T.; Weng, X.; Holail, S.; Hao, C.; Xia, G.S. DAM-Net: Flood detection from SAR imagery using differential attention metric-based vision transformers. ISPRS J. Photogramm. Remote Sens. 2024, 212, 440–453. [Google Scholar] [CrossRef]
- Zhang, Y.; Lu, H.; Ma, G.; Zhao, H.; Xie, D.; Geng, S.; Tian, W.; Sian, K.T.C.L.K. MU-net: Embedding MixFormer into unet to extract water bodies from remote sensing images. Remote Sens. 2023, 15, 3559. [Google Scholar] [CrossRef]
- Ma, D.; Jiang, L.; Li, J.; Shi, Y. Water index and Swin Transformer Ensemble (WISTE) for water body extraction from multispectral remote sensing images. GIScience Remote Sens. 2023, 60, 2251704. [Google Scholar] [CrossRef]
- Chen, B.; Zou, X.; Zhang, Y.; Li, J.; Li, K.; Xing, J.; Tao, P. LEFormer: A hybrid CNN-transformer architecture for accurate lake extraction from remote sensing imagery. In Proceedings of the ICASSP 2024–2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, Republic of Korea, 14–19 April 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 5710–5714. [Google Scholar] [CrossRef]
- Zhou, Y.; Yang, K.; Ma, F.; Hu, W.; Zhang, F. Water–land segmentation via structure-aware CNN–transformer network on large-scale SAR data. IEEE Sens. J. 2022, 23, 1408–1422. [Google Scholar] [CrossRef]
- Tsyganskaya, V.; Martinis, S.; Marzahn, P.; Ludwig, R. SAR-based detection of flooded vegetation–a review of characteristics and approaches. Int. J. Remote Sens. 2018, 39, 2255–2293. [Google Scholar] [CrossRef]
- Ganaie, M.A.; Hu, M.; Malik, A.K.; Tanveer, M.; Suganthan, P.N. Ensemble deep learning: A review. Eng. Appl. Artif. Intell. 2022, 115, 105151. [Google Scholar] [CrossRef]
- Sharma, N.K.; Saharia, M. DeepSARFlood: Rapid and Automated SAR-based flood inundation mapping using Vision Transformer-based Deep Ensembles with uncertainty estimates. Sci. Remote Sens. 2025, 11, 100203. [Google Scholar] [CrossRef]
- Hosseiny, B.; Mahdianpari, M.; Brisco, B.; Mohammadimanesh, F.; Salehi, B. WetNet: A spatial–temporal ensemble deep learning model for wetland classification using Sentinel-1 and Sentinel-2. IEEE Trans. Geosci. Remote Sens. 2021, 60, 1–14. [Google Scholar] [CrossRef]
- Paul, S.; Ganju, S. Flood segmentation on Sentinel-1 SAR imagery with semi-supervised learning. arXiv 2021, arXiv:2107.08369. [Google Scholar] [CrossRef]
- Hong, S.; Jang, H.; Kim, N.; Sohn, H.G. Water area extraction using RADARSAT SAR imagery combined with Landsat imagery and terrain information. Sensors 2015, 15, 6652–6667. [Google Scholar] [CrossRef]
- Hughes, L.H.; Marcos, D.; Lobry, S.; Tuia, D.; Schmitt, M. A deep learning framework for matching of SAR and optical imagery. ISPRS J. Photogramm. Remote Sens. 2020, 169, 166–179. [Google Scholar] [CrossRef]
- Najem, S.; Baghdadi, N.; Bazzi, H.; Zribi, M. Incidence angle normalization of C-band radar backscattering coefficient over agricultural surfaces using dynamic cosine method. Remote Sens. 2024, 16, 3838. [Google Scholar] [CrossRef]
- Azam, M.; Park, H.; Kim, J. Spatial and Temporal Trend Analysis of Precipitation and Drought in South Korea. Water 2018, 10, 765. [Google Scholar] [CrossRef]
- Jiang, C.; Zhang, H.; Wang, C.; Ge, J.; Wu, F. Water surface mapping from Sentinel-1 imagery based on attention-UNet3+: A case study of Poyang Lake region. Remote Sens. 2022, 14, 4708. [Google Scholar] [CrossRef]
- Hahmann, T.; Roth, A.; Martinis, S.; Twele, A.; Gruber, A. Automatic extraction of waterbodies from TerraSAR-X data. In Proceedings of the IGARSS 2008-2008 IEEE International Geoscience and Remote Sensing Symposium, Boston, MA, USA, 7–11 July 2008; IEEE: Piscataway, NJ, USA, 2008; Volume 3, pp. III-103–III-106. [Google Scholar] [CrossRef]
- Martinis, S.; Kersten, J.; Twele, A. A fully automated TerraSAR-X based flood service. ISPRS J. Photogramm. Remote Sens. 2015, 104, 203–212. [Google Scholar] [CrossRef]
- Yayong, S.; Shifeng, H.; Jiren, L.; Xiaotao, L.; Jianwei, M.; Hui, W. Monitoring seasonal changes in the water surface areas of Poyang Lake using COSMO-SkyMed time series data in PR China. In Proceedings of the 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Beijing, China, 10–15 July 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 7180–7183. [Google Scholar] [CrossRef]
- Tong, X.; Luo, X.; Liu, S.; Xie, H.; Chao, W.; Liu, S.; Liu, S.; Makhinov, A.N.; Makhinova, A.F.; Jiang, Y. An approach for flood monitoring by the combined use of Landsat 8 optical imagery and COSMO-SkyMed radar imagery. ISPRS J. Photogramm. Remote Sens. 2018, 136, 144–153. [Google Scholar] [CrossRef]
- Castelletti, D.; Farquharson, G.; Stringham, C.; Duersch, M.; Eddy, D. Capella space first operational SAR satellite. In Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, Brussels, Belgium, 11–16 July 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 1483–1486. [Google Scholar] [CrossRef]
- Ignatenko, V.; Laurila, P.; Radius, A.; Lamentowski, L.; Antropov, O.; Muff, D. ICEYE Microsatellite SAR Constellation Status Update: Evaluation of first commercial imaging modes. In Proceedings of the IGARSS 2020-2020 IEEE International Geoscience and Remote Sensing Symposium, Virtual, 26 September–2 October 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 3581–3584. [Google Scholar] [CrossRef]
- Yague-Martinez, N.; Leach, N.R.; Dasgupta, A.; Tellman, E.; Brown, J.S. Towards frequent flood mapping with the Capella SAR system. The 2021 Eastern Australia floods case. In Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, Brussels, Belgium, 11–16 July 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 6174–6177. [Google Scholar] [CrossRef]
- Popien, P.; D’Hondt, O.; Sunkara, V.; Chakrabarti, S. Deep Learning Based Urban Flood Mapping From High Resolution Capella Space Sar Imagery. In Proceedings of the IGARSS 2023-2023 IEEE International Geoscience and Remote Sensing Symposium, Pasadena, CA, USA, 16–21 July 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1384–1387. [Google Scholar] [CrossRef]
- Stringham, C.; Farquharson, G.; Castelletti, D.; Quist, E.; Riggi, L.; Eddy, D.; Soenen, S. The capella X-band SAR constellation for rapid imaging. In Proceedings of the IGARSS 2019—2019 IEEE International Geoscience and Remote Sensing Symposium, Yokohama, Japan, 28 July–2 August 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 9248–9251. [Google Scholar] [CrossRef]
- Albinet, C. Technical Note for Capella Data Assessment; ESA Earthnet Data Assessment Pilot; European Space Agency: Paris, France, 2022; p. 28. Available online: https://earth.esa.int/eogateway/documents/20142/37627/Technical%20Note%20for%20Capella%20Data%20Assessment.pdf (accessed on 5 January 2026).
- Capella Space. Capella SAR System Performance v2.0; White Paper; Capella Space: San Francisco, CA, USA, 2020; p. 18. Available online: https://geokom.ba/wp-content/uploads/2020/12/Capella_Space_SAR_System_Performance.pdf (accessed on 5 January 2026).
- Younis, M.; Huber, S.; Patyuchenko, A.; Bordoni, F.; Krieger, G. Performance comparison of reflector-and planar-antenna based digital beam-forming SAR. Int. J. Antennas Propag. 2009, 2009, 614931. [Google Scholar] [CrossRef]
- Wang, X.; Liu, J.; Zhang, S.; Deng, Q.; Wang, Z.; Li, Y.; Fan, J. Detection of oil spill using SAR imagery based on AlexNet model. Comput. Intell. Neurosci. 2021, 2021, 4812979. [Google Scholar] [CrossRef]
- Rousso, R.; Katz, N.; Sharon, G.; Glizerin, Y.; Kosman, E.; Shuster, A. Automatic recognition of oil spills using neural networks and classic image processing. Water 2022, 14, 1127. [Google Scholar] [CrossRef]
- Singha, S.; Bellerby, T.J.; Trieschmann, O. Satellite oil spill detection using artificial neural networks. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2013, 6, 2355–2363. [Google Scholar] [CrossRef]
- Yu, Q.; Liu, W.; Gonçalves, W.N.; Junior, J.M.; Li, J. Spatial Resolution Enhancement for Large-Scale Land Cover Mapping via Weakly Supervised Deep Learning. Photogramm. Eng. Remote Sens. 2021, 87, 405–412. [Google Scholar] [CrossRef]
- Small, D. Flattening gamma: Radiometric terrain correction for SAR imagery. IEEE Trans. Geosci. Remote Sens. 2011, 49, 3081–3093. [Google Scholar] [CrossRef]
- Wang, Z.; Zhang, C.; Atkinson, P.M. Combining SAR images with land cover products for rapid urban flood mapping. Front. Environ. Sci. 2022, 10, 973192. [Google Scholar] [CrossRef]
- Li, Z.; Demir, I. U-net-based semantic classification for flood extent extraction using SAR imagery and GEE platform: A case study for 2019 central US flooding. Sci. Total Environ. 2023, 869, 161757. [Google Scholar] [CrossRef] [PubMed]
- Islam, M.T.; Meng, Q. An exploratory study of Sentinel-1 SAR for rapid urban flood mapping on Google Earth Engine. Int. J. Appl. Earth Obs. Geoinf. 2022, 113, 103002. [Google Scholar] [CrossRef]
- Mason, D.C.; Bevington, J.; Dance, S.L.; Revilla-Romero, B.; Smith, R.; Vetra-Carvalho, S.; Cloke, H.L. Improving urban flood mapping by merging synthetic aperture radar-derived flood footprints with flood hazard maps. Water 2021, 13, 1577. [Google Scholar] [CrossRef]
- Engen, M.; Sandø, E.; Sjølander, B.L.O.; Arenberg, S.; Gupta, R.; Goodwin, M. Farm-scale crop yield prediction from multi-temporal data using deep hybrid neural networks. Agronomy 2021, 11, 2576. [Google Scholar] [CrossRef]
- Johary, R.; Révillion, C.; Catry, T.; Alexandre, C.; Mouquet, P.; Rakotoniaina, S.; Pennober, G.; Rakotondraompiana, S. Detection of large-scale floods using Google Earth Engine and Google Colab. Remote Sens. 2023, 15, 5368. [Google Scholar] [CrossRef]
- Deng, K.; Hu, X.; Zhang, Z.; Su, B.; Feng, C.; Zhan, Y.; Wang, X.; Duan, Y. Cross-modal change detection using historical land use maps and current remote sensing images. ISPRS J. Photogramm. Remote Sens. 2024, 218, 114–132. [Google Scholar] [CrossRef]
- Brown, C.F.; Brumby, S.P.; Guzder-Williams, B.; Birch, T.; Hyde, S.B.; Mazzariello, J.; Czerwinski, W.; Pasquarella, V.J.; Haertel, R.; Ilyushchenko, S.; et al. Dynamic World, Near real-time global 10 m land use land cover mapping. Sci. Data 2022, 9, 251. [Google Scholar] [CrossRef]
- Xu, J.; Xiong, Z.; Bhattacharyya, S.P. PIDNet: A real-time semantic segmentation network inspired by PID controllers. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 17–24 June 2023; pp. 19529–19539. [Google Scholar] [CrossRef]
- Cheng, B.; Choudhuri, A.; Misra, I.; Kirillov, A.; Girdhar, R.; Schwing, A.G. Mask2former for video instance segmentation. arXiv 2021, arXiv:2112.10764. [Google Scholar] [CrossRef]
- Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, QC, Canada, 10–17 October 2021; pp. 10012–10022. [Google Scholar] [CrossRef]
- Zhang, W.; Pang, J.; Chen, K.; Loy, C.C. K-net: Towards unified image segmentation. Adv. Neural Inf. Process. Syst. 2021, 34, 10326–10338. [Google Scholar] [CrossRef]
- Zhou, Z.; Rahman Siddiquee, M.M.; Tajbakhsh, N.; Liang, J. Unet++: A nested u-net architecture for medical image segmentation. In Proceedings of the International Workshop on Deep Learning in Medical Image Analysis, Granada, Spain, 20 September 2018; Springer International Publishing: Cham, Switzerland, 2018; pp. 3–11. [Google Scholar] [CrossRef]
- Peña, F.J.; Hübinger, C.; Payberah, A.H.; Jaramillo, F. DeepAqua: Semantic segmentation of wetland water surfaces with SAR imagery using deep neural networks without manually annotated data. Int. J. Appl. Earth Obs. Geoinf. 2024, 126, 103624. [Google Scholar] [CrossRef]
- Das, P.; Jensen, K.; De, S.; Ganguly, A.R. Flood Depth Estimation Using Synthetic Aperture Radar (SAR) Imagery and Topography: A Case Study of the 2021 and 2022 Floods in Hawkesbury Valley, Australia. In Proceedings of the IGARSS 2023—2023 IEEE International Geoscience and Remote Sensing Symposium, Pasadena, CA, USA, 16–21 July 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 2402–2405. [Google Scholar] [CrossRef]
- Jensen, K.; De, S.; Hughes, L.; Yalla, G. Flood Monitoring with X-Band and C-Band SAR: A Case Study of the 2021 British Columbia Floods. In Proceedings of the IGARSS 2022–2022 IEEE International Geoscience and Remote Sensing Symposium, Kuala Lumpur, Malaysia, 17–22 July 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 5535–5538. [Google Scholar] [CrossRef]

















| Parameter | Wavelength |
|---|---|
| Frequency Band | X-band (9.4–9.9 GHz) |
| Imaging Bandwidth | 500–700 MHz |
| Imaging Modes | Spotlight |
| Sliding Spotlight | |
| Stripmap | |
| Imaging Polarizations | Single-Pol HH and VV |
| Imaging Orbit Directions | Ascending and Descending |
| Imaging Look Directions | Left and Right |
| Accessible Imaging Latitudes | SSO 1 97° Orbital Plane: +87.4°N to −87.4°S |
| MIO 2 53° Orbital Plane: +58.3°N to −58.3°S | |
| MIO 45° Orbital Plane: +48.9°N to −48.9°S | |
| Look Angle Ranges | 25–50° (Standard Products) |
| Up to 15–50° (Extended Products) | |
| Up to 5–50° (Custom Products) | |
| Transmit Power | 600 Watt |
| Acquisition Direction | Left and Right sides |
| Image Product | Imaging Mode | Nominal Scene Size | Azimuth Resolution | Slant Range Resolution | Ground Range Resolution | Look Angle Range |
|---|---|---|---|---|---|---|
| SLC | Stripmap | 5–10 km | 1.2 m | 0.75 m | NA | 25–45° |
| GEC/GEO | Stripmap | 5–10 km | 1.2 m | NA | 1.1–1.6 m | 25–45° |
| Image ID | US0629 | HS0904 | PH0912 | PH0916 | PH0925 | |
|---|---|---|---|---|---|---|
| Image size | 13,154 × 28,246 | 25,865 × 26,585 | 25,680 × 26,748 | 12,950 × 28,188 | 12,992 × 28,168 | |
| Acquisition date | 29 June 2022 | 4 September 2022 | 12 September 2022 | 16 September 2022 | 25 September 2022 | |
| Satellite | Capella-7 | Capella-6 | Capella-6 | Capella-8 | Capella-8 | |
| Frequency | X-band (9.65 GHz) | |||||
| Resolution | Ground range | 1.59 m | 1.59 m | 1.79 m | 1.60 m | 1.43 m |
| Azimuth | 1.41 m | 1.41 m | 1.37 m | 1.43 m | 1.43 m | |
| Pixel spacing | 0.8 m × 0.8 m | |||||
| Angle | Incidence | 36.80° | 36.75° | 32.00° | 36.60° | 41.80° |
| Look | 33.50° | 33.54° | 29.10° | 33.40° | 37.90° | |
| Orbit | Desc. | Desc. | Asc. | Desc. | Desc. | |
| NESZ peak | −17.71 | −13.61 | −16.17 | −18.90 | −17.36 | |
| Feature | Dynamic World Map [60] | MCEE Land Cover Map |
|---|---|---|
| Data Source | Sentinel-2 MSI (Optical Satellite) | Aerial Orthophotos |
| Data Type and Resolution | Raster (10 m) | Vector (Effective res. ≈ 1 m) |
| Update Frequency | Near Real-time (Available within 2–5 days of acquisition) | Periodic (1-year to multi-year update cycles) |
| Geographic Coverage | Global | Republic of Korea (National) |
| Classification Scheme | 9 Land Use/Land Cover (LULC) classes | 41 Detailed classes (Reclassified to 7) |
| Key Advantages | Temporal Synchronization: Aligns with SAR acquisition time, capturing dynamic changes (e.g., floods, seasonal water) | Geometric Precision: High boundary accuracy for static features (e.g., buildings, roads) due to vector format |
| Limitations | Coarser Resolution: Limited capacity to resolve small-scale objects; susceptible to cloud cover | Temporal Discrepancy: Potential misalignment with SAR imagery due to update latency (time lag) |
| Experiment ID | Input Channels | Descriptions | No. of Channels |
|---|---|---|---|
| Exp. 1 | HH | 1 | |
| Exp. 2 | HH, Gamma | 2 | |
| Exp. 3 | HH, Slope, and HAND | 3 | |
| Exp. 7 | HH, Slope, HAND, and Land cover | Land cover layers (urban, agriculture, forest, grassland) | 7 |
| Exp. 8 | HH, Slope, HAND, Land cover, and Gamma | All channels | 8 |
| Setting | PIDNet | Mask2Former | Swin Transformer | K-Net |
|---|---|---|---|---|
| Input size | 512 × 512 | |||
| Input channels | 8 (HH, Gamma, HAND, Slope, and land cover with four classes) | |||
| Backbone | PIDNet | Swin-Large | Swin-Large | Swin-Large |
| Batch size | 4 | |||
| Iterations | 30,000 | 20,000 | 20,000 | 20,000 |
| Learning rate | 1.0 × 10−4 | 2.0 × 10−4 | 2.0 × 10−4 | 2.0 × 10−4 |
| Optimizer | AdamW | |||
| Loss Function | CrossEntropy + OHEM + Boundary | CrossEntropy + Dice | CrossEntropy | CrossEntropy |
| Experiment ID | Exp. 1 | Exp. 2 | Exp. 3 | Exp. 7 | Exp. 8 |
|---|---|---|---|---|---|
| Water IoU | 0.9149 | 0.9166 | 0.9341 | 0.9521 | 0.9550 |
| Accuracy | 0.9859 | 0.9863 | 0.9891 | 0.9921 | 0.9926 |
| F1-score | 0.9556 | 0.9565 | 0.9659 | 0.9755 | 0.9770 |
| Recall | 0.9728 | 0.9800 | 0.9812 | 0.9785 | 0.9810 |
| Precision | 0.9389 | 0.9342 | 0.9511 | 0.9725 | 0.9730 |
| No. of Patches | Scenes Used | Fold 1 | Fold 2 | Fold 3 | Fold 4 | Fold 5 |
|---|---|---|---|---|---|---|
| Training set (w/Augmentation) | PH0912 PH0916 US0629 PH0925 | 2933 | 2940 | 2772 | 3017 | 3066 |
| Validation set (w/o Augmentation) | 107 | 106 | 130 | 95 | 88 | |
| Test set (w/o Augmentation) | HS0904 | 159 | 159 | 159 | 159 | 159 |
| Model | Water IoU | Accuracy | F1-Score | Precision | Recall |
|---|---|---|---|---|---|
| U-Net++ | 0.915 ± 0.016 | 0.991 ± 0.003 | 0.955 ± 0.009 | 0.962 ± 0.013 | 0.949 ± 0.016 |
| PIDNet | 0.916 ± 0.024 | 0.991 ± 0.003 | 0.956 ± 0.013 | 0.965 ± 0.013 | 0.947 ± 0.020 |
| Swin Transformer | 0.920 ± 0.026 | 0.991 ± 0.002 | 0.958 ± 0.015 | 0.966 ± 0.010 | 0.950 ± 0.021 |
| Mask2Former | 0.928 ± 0.018 | 0.992 ± 0.002 | 0.962 ± 0.010 | 0.967 ± 0.015 | 0.958 ± 0.019 |
| K-Net | 0.932 ± 0.014 | 0.993 ± 0.002 | 0.964 ± 0.010 | 0.968 ± 0.017 | 0.959 ± 0.020 |
| Model | Water IoU | Accuracy | F1-Score | Precision | Recall |
|---|---|---|---|---|---|
| Otsu | 0.6313 | 0.9016 | 0.7740 | 0.7240 | 0.8313 |
| U-Net++ | 0.8551 | 0.9687 | 0.9219 | 0.9333 | 0.9108 |
| PIDNet | 0.8921 | 0.9779 | 0.9430 | 0.9902 | 0.9001 |
| Swin Transformer | 0.9371 | 0.9871 | 0.9675 | 0.9916 | 0.9446 |
| Mask2Former | 0.9155 | 0.9828 | 0.9559 | 0.9930 | 0.9215 |
| K-Net | 0.9247 | 0.9845 | 0.9609 | 0.9851 | 0.9378 |
| Ensemble | 0.9422 | 0.9881 | 0.9703 | 0.9833 | 0.9575 |
| Model | Weights * | Water IoU | Accuracy | F1-Score | Precision | Recall |
|---|---|---|---|---|---|---|
| Best single | NA | 0.9371 | 0.9871 | 0.9675 | 0.9916 | 0.9446 |
| Equal Weights | [0.25, 0.25, 0.25, 0.25] | 0.9408 | 0.9878 | 0.9695 | 0.9861 | 0.9534 |
| Automated Weights (Softmax Normalization) | [0.2478, 0.2512, 0.2506, 0.2504] | 0.9408 | 0.9879 | 0.9695 | 0.9861 | 0.9535 |
| Optimized Weights (Grid Search) | [0.05, 0.40, 0.20, 0.35] | 0.9422 | 0.9881 | 0.9703 | 0.9833 | 0.9575 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Choi, S.; Kim, S.H.; Nghiem, S.V.; Kafatos, M.; Choi, M.; Kim, J.; Lee, Y. A Robust Deep Learning Ensemble Framework for Waterbody Detection Using High-Resolution X-Band SAR Under Data-Constrained Conditions. Remote Sens. 2026, 18, 301. https://doi.org/10.3390/rs18020301
Choi S, Kim SH, Nghiem SV, Kafatos M, Choi M, Kim J, Lee Y. A Robust Deep Learning Ensemble Framework for Waterbody Detection Using High-Resolution X-Band SAR Under Data-Constrained Conditions. Remote Sensing. 2026; 18(2):301. https://doi.org/10.3390/rs18020301
Chicago/Turabian StyleChoi, Soyeon, Seung Hee Kim, Son V. Nghiem, Menas Kafatos, Minha Choi, Jinsoo Kim, and Yangwon Lee. 2026. "A Robust Deep Learning Ensemble Framework for Waterbody Detection Using High-Resolution X-Band SAR Under Data-Constrained Conditions" Remote Sensing 18, no. 2: 301. https://doi.org/10.3390/rs18020301
APA StyleChoi, S., Kim, S. H., Nghiem, S. V., Kafatos, M., Choi, M., Kim, J., & Lee, Y. (2026). A Robust Deep Learning Ensemble Framework for Waterbody Detection Using High-Resolution X-Band SAR Under Data-Constrained Conditions. Remote Sensing, 18(2), 301. https://doi.org/10.3390/rs18020301

