- Article
Evaluation of Machine Learning Methods for Detecting Subcircular Structures Associated with Potential Natural Hydrogen Sources
- Sergio García-Arias,
- Manuel A. Flórez and
- Joaquín Andrés Valencia Ortiz
Natural hydrogen has gained attention as a low-carbon energy vector, and some reported surface expressions have been linked to subcircular patterns, or fairy circles (FC), that may be detectable in multispectral satellite imagery. The Carolina Bays region, on the eastern coast of the United States, was selected because it hosts abundant, well-mapped subcircular depressions. This study aims to comparatively evaluate machine learning algorithms for identifying subcircular structures using Landsat-8 data. We processed 105 Collection 2 Level 2 scenes, masking clouds and shadows using the Level 2 quality band. Pixel-level labels were determined using a well-curated public dataset, derived from a high-resolution LiDAR survey. Traditional models (logistic regression, random forest, and multilayer perceptron) achieved precision scores below 0.66 and enabled a variable-importance analysis, which identified Band 3 (green), Band 6 (SWIR1), and five Normalised Unit Indices as the most predictive features. Deep learning models improved detection, and a U-Net architecture allowed for pixel-level segmentation of FC-like structures, producing false positives mostly in cloudy or shadowed areas. Overall, the results suggest that FC detection from multispectral data alone remains challenging due to class overlap and cloud/shadow contamination. Future work could explore integrating additional non-spectral descriptors, such as morphometric variables, to reduce ambiguities.
6 February 2026



![(A) Location of the study area within the Carolina Bays (USA), where McMaho et al. [7] report subcircular structures associated with potential natural hydrogen emissions. (B) AOI-01 and AOI-02, zones with clear subcircular expression. (C) Landsat-8 coverage used (path 016/row 036). (D) AOI-03 and AOI-04, areas without surface evidence of subcircular structures. The training labels were derived from the public layer provided by the North Carolina Department of Environmental Quality (NCDEQ). The coordinate reference system for Figure (A) is WGS 84 (EPSG: 4326), while Figures (B–D) use UTM Zone 17N (EPSG: 32617).](https://mdpi-res.com/cdn-cgi/image/w=470,h=317/https://mdpi-res.com/geomatics/geomatics-06-00016/article_deploy/html/images/geomatics-06-00016-g001-550.jpg)



