Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (114)

Search Parameters:
Keywords = random forest Kriging

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
17 pages, 5059 KB  
Article
Decision Tree-Based Pilot Workload Prediction Through Optimized HRV Features Selection
by Carmelo Rosario Vindigni, Giuseppe Iacolino, Antonio Esposito, Calogero Orlando and Andrea Alaimo
Aerospace 2026, 13(1), 73; https://doi.org/10.3390/aerospace13010073 - 9 Jan 2026
Viewed by 121
Abstract
This research explores the use of physiological signals derived from heart activity to assess mental effort during flight-related tasks. Data were collected through wearable sensors during simulations with varying cognitive demands. Specific indicators related to heart rate variability (HRV) were extracted and tested [...] Read more.
This research explores the use of physiological signals derived from heart activity to assess mental effort during flight-related tasks. Data were collected through wearable sensors during simulations with varying cognitive demands. Specific indicators related to heart rate variability (HRV) were extracted and tested in different combinations to identify those most relevant for distinguishing levels of mental workload (WL). A Random Forest (RF) ensemble method is applied to classify two conditions, and its performance is examined under various settings, including model complexity and data partitioning strategies. Results showed that certain feature pairs significantly enhanced classification accuracy. The best features settings obtained from the RF are then used to train the other two decision trees-based classifiers, namely the AdaBoost and the XGBoost. Moreover, the decision trees models output is compared with predictions from a Kriging spatial interpolation technique, showing superior results in terms of reliability and consistency. This study highlights the potential of using heart-based physiological data and advanced classification techniques for developing intelligent support systems in aviation. Full article
(This article belongs to the Section Aeronautics)
Show Figures

Figure 1

20 pages, 16754 KB  
Article
GSA-cGAN: A Geospatial-Aware Conditional Wasserstein Generative Adversarial Network for Mineral Resources Interpolation
by Hosang Han and Jangwon Suh
Appl. Sci. 2026, 16(2), 674; https://doi.org/10.3390/app16020674 - 8 Jan 2026
Viewed by 168
Abstract
In the context of mineral resource exploration, spatial prediction must cope with heterogeneous, non-normal data distributions and limited sampling. While conventional geostatistics and standard machine learning provide baseline estimates, they often suffer from excessive smoothing or fail to capture continuous spatial dependencies. This [...] Read more.
In the context of mineral resource exploration, spatial prediction must cope with heterogeneous, non-normal data distributions and limited sampling. While conventional geostatistics and standard machine learning provide baseline estimates, they often suffer from excessive smoothing or fail to capture continuous spatial dependencies. This study proposes a geospatially aware Wasserstein conditional Generative Adversarial Network (GSA-cGAN) to complement existing workflows for multivariate mineral interpolation. The framework augments a baseline cGAN with WGAN-GP for stable adversarial training, CoordConv to encode absolute spatial coordinates and Self-Attention to capture long-range spatial dependencies. Eight model configurations were trained on 272 samples from a mineralized zone in the Taebaek Mountains, Korea, and strictly benchmarked against Ordinary/Universal Kriging and multivariate machine learning baselines (Random Forest, XGBoost). Under the adopted experimental design, the full GSA-cGAN achieved the lowest test root mean squared error and highest coefficient of determination, demonstrating a significant performance improvement over the baselines. Furthermore, distribution analysis confirmed that the model effectively overcomes the smoothing limitations of regression-based methods, generating high-resolution 10 m × 10 m maps that preserve statistical variance, hotspot anomalies, and complex spatial patterns. The results indicate that deep generative models can serve as practical decision-support tools for identifying drilling targets and prioritizing follow-up exploration in geologically complex settings. Full article
Show Figures

Figure 1

17 pages, 5286 KB  
Article
Enhanced Urban Heat Island Modeling with Machine Learning and Regression Kriging in a Topographically Diverse Medium-Sized City
by Iulian-Horia Holobaca, Mircea Alexe, Kinga Temerdek-Ivan and Cosmina-Daniela Ursu
Land 2026, 15(1), 49; https://doi.org/10.3390/land15010049 - 26 Dec 2025
Viewed by 327
Abstract
Urban heat islands (UHIs) represent a major environmental challenge, particularly in cities with complex topography and local air dynamics where spatial variability of the surface temperature is difficult to model. This study presents an enhanced UHI modeling framework that integrates Random Forest regression [...] Read more.
Urban heat islands (UHIs) represent a major environmental challenge, particularly in cities with complex topography and local air dynamics where spatial variability of the surface temperature is difficult to model. This study presents an enhanced UHI modeling framework that integrates Random Forest regression with Regression Kriging (RF–RK) in Cluj-Napoca, Romania. Using a network of 36 air temperature sensors and spatial predictors such as urban environment and location parameters, we evaluated the performance of RF–RK against traditional Multiple Linear Regression (MLR). Results show that RF–RK achieved substantially higher accuracy (R2 = 0.844, RMSE = 0.241 °C) compared to MLR (R2 = 0.653, RMSE = 0.359 °C). Spatial patterns revealed that a notable thermal gradient can be seen between the western and eastern parts of the city and the extension of the heat island core eastward. The combined approach effectively captured both the non-linear relationships of predictors and the spatial autocorrelation of residuals, outperforming single-method models. These findings highlight the potential of hybrid machine learning–geostatistical frameworks for urban climate research and provide insights for urban planning and heat mitigation strategies in topographically diverse cities. Full article
(This article belongs to the Special Issue The Impact of Urban Planning on the Urban Heat Island Effect)
Show Figures

Figure 1

22 pages, 6114 KB  
Article
Remote Sensing Inversion of Full-Profile Topography Data for Coastal Wetlands Using Synergistic Multi-Platform Sensors from Space, Air, and Ground
by Jiabao Zhang, Jin Wang, Yu Dai, Yiyang Miao and Huan Li
Sensors 2025, 25(24), 7405; https://doi.org/10.3390/s25247405 - 5 Dec 2025
Viewed by 569
Abstract
This study proposes a “zonal inversion–fusion mosaicking” technical framework to address the challenge of acquiring continuous full-profile topography data in coastal wetland intertidal zones. The framework integrates and synergistically analyzes data from multi-platform sensors, including satellite, unmanned aerial vehicle (UAV), and ground-based instruments. [...] Read more.
This study proposes a “zonal inversion–fusion mosaicking” technical framework to address the challenge of acquiring continuous full-profile topography data in coastal wetland intertidal zones. The framework integrates and synergistically analyzes data from multi-platform sensors, including satellite, unmanned aerial vehicle (UAV), and ground-based instruments. Applied to the Min River Estuary wetland, this framework employs zone-specific optimization strategies: in the inundated zone, the topography was inverted using Landsat-9 OLI imagery and a Random Forest algorithm (R2 = 0.79, RMSE = 2.08 m); in the bare flat zone, a linear model was developed based on Sentinel-2 time-series imagery using the inundation frequency method, and it achieved an accuracy of R2 = 0.86 and RMSE = 0.34 m; and in the vegetated zone, high-precision topography was derived from UAV oblique photography with Kriging interpolation (RMSE = 0.10 m). The key innovation is the successful generation of a seamless full-profile digital elevation model (DEM) with an overall RMSE of 0.54 m through benchmark unification and precision-weighted fusion algorithms from the sensor data fusion perspective. This study demonstrates that the synergistic sensors framework effectively overcomes the limitations of single-sensor observations, providing a reliable and generalizable integrated solution for the full-profile topographic monitoring of tidal flats, which offers crucial support for coastal wetland research and management. Full article
(This article belongs to the Section Environmental Sensing)
Show Figures

Figure 1

22 pages, 3470 KB  
Article
A Multi-Sensor Machine Learning Framework for Field-Scale Soil Salinity Mapping Under Data-Scarce Conditions
by Joyce Mongai Chindong, Jamal-Eddine Ouzemou, Ahmed Laamrani, Ali El Battay, Soufiane Hajaj, Hassan Rhinane and Abdelghani Chehbouni
Remote Sens. 2025, 17(22), 3778; https://doi.org/10.3390/rs17223778 - 20 Nov 2025
Viewed by 1333
Abstract
Soil salinity severely constrains agricultural productivity and soil health, particularly in arid and semi-arid regions. Conventional salinity assessment methods are labor-intensive, time-consuming, and spatially limited. This study developed a data-scarce workflow integrating proximal sensing (EM38-MK2), very high-resolution multispectral imagery, and machine learning to [...] Read more.
Soil salinity severely constrains agricultural productivity and soil health, particularly in arid and semi-arid regions. Conventional salinity assessment methods are labor-intensive, time-consuming, and spatially limited. This study developed a data-scarce workflow integrating proximal sensing (EM38-MK2), very high-resolution multispectral imagery, and machine learning to map soil salinity at field scale in the semi-arid Sehb El Masjoune area, central Morocco. A total of 26 soil samples were analyzed for Electrical Conductivity (EC), and 500 Apparent Electrical Conductivity (ECa) measurements were collected and calibrated using the field samples. Spectral and topographic covariates derived from Unmanned Aerial Vehicle (UAV) and PlanetScope imagery supported model training using Partial Least Squares Regression (PLSR), Support Vector Regression (SVR), Random Forest (RF), and a Stacked Ensemble Learning Model (ELM). Regression Kriging (RK) was applied to model residuals to improve spatial prediction. ELM achieved the highest accuracy (R2 = 0.87, RMSE ≈ 4.15), followed by RF, which effectively captured nonlinear spatial patterns. RK improved PLSR accuracy (by 11.1% for PlanetScope, 13.8% for UAV) but offered limited gains for RF, SVR, and ELM. SHAP analysis identified topographic covariates as the most influential predictors. Both UAV and PlanetScope delineated similar saline–sodic zones. The study demonstrates the following: (1) a scalable, data-efficient workflow for salinity mapping; (2) model and RK performance depend more on algorithmic design than sensor type; (3) interpretable ML and spatial modeling enhance understanding of salinity processes in semi-arid systems. Full article
Show Figures

Figure 1

27 pages, 3207 KB  
Article
Interpolation and Machine Learning Methods for Sub-Hourly Missing Rainfall Data Imputation in a Data-Scarce Environment: One- and Two-Step Approaches
by Mohamed Boukdire, Çağrı Alperen İnan, Giada Varra, Renata Della Morte and Luca Cozzolino
Hydrology 2025, 12(11), 297; https://doi.org/10.3390/hydrology12110297 - 10 Nov 2025
Cited by 1 | Viewed by 919
Abstract
Complete sub-hourly rainfall datasets are critical for accurate flood modeling, real-time forecasting, and understanding of short-duration rainfall extremes. However, these datasets often contain missing values due to sensor or transmission failures. Recovering missing values (or filling these data gaps) at high temporal resolution [...] Read more.
Complete sub-hourly rainfall datasets are critical for accurate flood modeling, real-time forecasting, and understanding of short-duration rainfall extremes. However, these datasets often contain missing values due to sensor or transmission failures. Recovering missing values (or filling these data gaps) at high temporal resolution is challenging due to the imbalance between rain and no-rain periods. In this study, we developed and tested two approaches for the imputation of missing 10-min rainfall data by means of machine learning (Multilayer Perceptron and Random Forest) and interpolation methods (Inverse Distance Weighting and Ordinary Kriging). The (a) direct approach operates on raw data to directly feed the imputation models, while the (b) two-step approach first classifies time steps as rain or no-rain with a Random Forest classifier and subsequently applies an imputation model to predicted rainfall depth instances classified as rain. Each approach was tested under three spatial scenarios: using all nearby stations, using stations within the same cluster, and using the three most highly correlated stations. An additional test involved the comparison of the results obtained using data from the imputed time interval only and data from a time window containing several time intervals before and after the imputed time interval. The methods were evaluated with reference to two different environments, mountainous and coastal, in Campania region (Southern Italy), under data-scarce conditions where rainfall depth is the only available variable. With reference to the application of the two-step approach, the Random Forest classifier shows a good performance both in the mountainous and in the coastal area, with an average weighted F1 score of 0.961 and 0.957, and an average Accuracy of 0.928 and 0.946, respectively. The highest performance in the regression step is obtained by the Random Forest in the mountainous area with an R2 of 0.541 and an RMSE of 0.109 mm, considering a spatial configuration including all stations. The comparison with the direct approach results shows that the two-step approach consistently improves accuracy across all scenarios, highlighting the benefits gained from breaking the data imputation process in stages where different physical conditions (in this case, rain and no-rain) are separately managed. Another important finding is that the use of time windows containing data lagged with respect to the imputed time interval allows capturing the atmospheric dynamics by connecting rainfall instances at different time levels and distant stations. Finally, the study confirms that machine learning models outperform spatial interpolation methods, thanks to their ability to manage data with complicated internal structure. Full article
Show Figures

Figure 1

28 pages, 22819 KB  
Article
Enhanced Spatially Explicit Modeling of Soil Particle Size and Texture Classification Using a Novel Two-Point Machine Learning Hybrid Framework
by Liya Qin, Zong Wang and Xiaoyuan Zhang
Agriculture 2025, 15(19), 2008; https://doi.org/10.3390/agriculture15192008 - 25 Sep 2025
Viewed by 530
Abstract
Accurately predicting soil particle size fractions (PSFs) and classifying soil texture types are essential for soil resource assessment and sustainable land management. PSFs, comprising clay, silt, and sand, form a compositional dataset constrained to sum to 100%. The practical implications of incorporating compositional [...] Read more.
Accurately predicting soil particle size fractions (PSFs) and classifying soil texture types are essential for soil resource assessment and sustainable land management. PSFs, comprising clay, silt, and sand, form a compositional dataset constrained to sum to 100%. The practical implications of incorporating compositional data characteristics into PSF mapping remain insufficiently explored. This study applies a two-point machine learning (TPML) model, integrating spatial autocorrelation and attribute similarity, to enhance both the quantitative prediction of PSFs and the categorical classification of soil texture types in the Heihe River Basin, China. TPML was compared with random forest regression kriging (RFRK), random forest (RF), XGBoost, and ordinary kriging (OK), and a novel TPML-C model was developed for multi-class classification tasks. Results show that TPML achieved R2 values of 0.58, 0.55, and 0.64 for clay, silt, and sand, respectively. Among all models, the ALR_TPML predictions showed the most consistent agreement with the observed variability, with predicted ranges of 2.63–98.28% for silt, 0.26–36.16% for clay, and 0.64–96.90% for sand. Across all models, the dominant soil texture types were identified as Sandy Loam (SaLo), Loamy Sand (LoSa), and Silty Loam (SiLo). For soil texture classification, TPML with raw, ALR-, and ILR-transformed data reached right ratios of 61.09%, 55.78%, and 60.00%, correctly identifying 25, 26, and 27 types out of 43. TPML with raw data exhibited strong performance in both regression and classification, with superior ability to separate ambiguous boundaries. Log-ratio transformations, particularly ILR, further improved classification performance by addressing the constraints of compositional data. These findings demonstrate the promise of hybrid machine learning approaches for digital soil mapping and precision agriculture. Full article
(This article belongs to the Section Agricultural Soils)
Show Figures

Figure 1

18 pages, 9202 KB  
Article
Fine-Scale Mapping and Uncertainty Quantification of Intertidal Sediment Grain Size Using Geostatistical Simulation Integrated with Machine Learning and High-Resolution Remote Sensing Imagery
by No-Wook Park and Dong-Ho Jang
Remote Sens. 2025, 17(18), 3230; https://doi.org/10.3390/rs17183230 - 18 Sep 2025
Viewed by 810
Abstract
This study presents a geostatistical simulation approach for fine-scale grain size mapping in tidal flats, which complements sparse field survey data with high-resolution optical satellite imagery and quantifies prediction uncertainty at unsampled locations. Within a multi-Gaussian regression kriging (MGRK) framework, a random forest [...] Read more.
This study presents a geostatistical simulation approach for fine-scale grain size mapping in tidal flats, which complements sparse field survey data with high-resolution optical satellite imagery and quantifies prediction uncertainty at unsampled locations. Within a multi-Gaussian regression kriging (MGRK) framework, a random forest (RF) regression model is used to estimate the trend component of grain size variability in Gaussian space. Residual components are estimated using kriging, and the trend and residual components are combined to construct conditional cumulative distribution functions for uncertainty modeling. Sequential Gaussian simulation based on the CCDFs generates alternative realizations of grain size, allowing for quantification of prediction uncertainty. The potential of this integrated approach was tested on the Baramarae tidal flat in Korea using KOMPSAT-2 imagery. Three spectral features, the green band, red band, and normalized difference water index (NDWI), explained 42.74% of the grain size variability, with NDWI identified as the most influential feature, contributing 40.8% compared with 31.7% for the red band and 27.5% for the green band. MGRK effectively captured local grain size variations, reducing the mean absolute error from 0.554 to 0.280 compared with univariate kriging based solely on field survey data, corresponding to an improvement of approximately 49.5%. The benefit of the proposed approach was validated by a reduction in prediction uncertainty, with the mean standard deviation decreasing from 0.743 in simulations based solely on field data to 0.280 in MGRK-based simulations. These findings indicate that the proposed geostatistical approach, integrating satellite-derived features, is a reliable method for fine-scale mapping of intertidal sediment grain size by providing both predictions and associated uncertainty estimates. Full article
(This article belongs to the Section Environmental Remote Sensing)
Show Figures

Graphical abstract

29 pages, 35542 KB  
Article
A Novel Remote Sensing Framework Integrating Geostatistical Methods and Machine Learning for Spatial Prediction of Diversity Indices in the Desert Steppe
by Zhaohui Tang, Chuanzhong Xuan, Tao Zhang, Xinyu Gao, Suhui Liu, Yaobang Song and Fang Guo
Agriculture 2025, 15(18), 1926; https://doi.org/10.3390/agriculture15181926 - 11 Sep 2025
Cited by 2 | Viewed by 812
Abstract
Accurate assessments are vital for the effective conservation of desert steppe ecosystems, which are essential for maintaining biodiversity and ecological balance. Although geostatistical methods are commonly used for spatial modeling, they have limitations in terms of feature extraction and capturing non-linear relationships. This [...] Read more.
Accurate assessments are vital for the effective conservation of desert steppe ecosystems, which are essential for maintaining biodiversity and ecological balance. Although geostatistical methods are commonly used for spatial modeling, they have limitations in terms of feature extraction and capturing non-linear relationships. This study therefore proposes a novel remote sensing framework that integrates geostatistical methods and machine learning to predict the Shannon–Wiener index in desert steppe. Five models, Kriging interpolation, Random Forest, Support Vector Machine, 3D Convolutional Neural Network and Graph Attention Network, were employed for parameter inversion. The Helmert variance component estimation method was introduced to integrate the model outputs by iteratively evaluating residuals and assigning relative weights, enabling both optimal prediction and model contribution quantification. The ensemble model yielded a high prediction accuracy with an R2 of 0.7609. This integration strategy improves the accuracy of index prediction, and enhances the interpretability of the model regarding weight contributions in space. The proposed framework provides a reliable, scalable solution for biodiversity monitoring and supports scientific decision-making for grassland conservation and ecological restoration. Full article
(This article belongs to the Section Artificial Intelligence and Digital Agriculture)
Show Figures

Figure 1

20 pages, 2723 KB  
Article
Downscaling of Urban Land Surface Temperatures Using Geospatial Machine Learning with Landsat 8/9 and Sentinel-2 Imagery
by Ratovoson Robert Andriambololonaharisoamalala, Petra Helmholz, Dimitri Bulatov, Ivana Ivanova, Yongze Song, Susannah Soon and Eriita Jones
Remote Sens. 2025, 17(14), 2392; https://doi.org/10.3390/rs17142392 - 11 Jul 2025
Cited by 1 | Viewed by 3131
Abstract
Urban surface temperatures are increasing because of climate change and rapid urbanisation, contributing to the urban heat island (UHI) effect and significantly influencing local climates. Satellite-derived land surface temperature (LST) plays a vital role in analysing urban thermal patterns. However, current satellite thermal [...] Read more.
Urban surface temperatures are increasing because of climate change and rapid urbanisation, contributing to the urban heat island (UHI) effect and significantly influencing local climates. Satellite-derived land surface temperature (LST) plays a vital role in analysing urban thermal patterns. However, current satellite thermal infrared (TIR) sensors have a low spatial resolution, making it difficult to accurately capture the complex thermal variations within urban areas. This limitation affects the assessments of UHI effects and hinders effective mitigation strategies. We proposed a hybrid model named “geospatial machine learning” (GeoML) to address these challenges, combining random forest and kriging downscaling techniques. This method utilises high spatial resolution data from Sentinel-2 to enhance the LST derived from Landsat 8/9 data. Tested in Perth, Australia, GeoML generated an enhanced LST with good agreement with ground-based measurements, with a Pearson’s correlation coefficient of 0.85, a root mean square error (RMSE) of 2.7 °C, and a mean absolute error (MAE) of less than 2.2 °C. Validation with LST derived from another TIR sensor also provided promising outputs. The results were compared with the high-resolution urban thermal sharpener (HUTS) downscaling methods, which GeoML outperformed, demonstrating its effectiveness as a valuable tool for urban thermal studies involving high-resolution LST data. Full article
(This article belongs to the Special Issue Remote Sensing Applications in Urban Environment and Climate)
Show Figures

Figure 1

20 pages, 4216 KB  
Article
Stochastic Blade Pitch Angle Analysis of Controllable Pitch Propeller Based on Deep Neural Networks
by Xuanqi Zhang, Wenbin Shao, Yongshou Liu, Xin Fan and Ruiyun Shi
Modelling 2025, 6(3), 54; https://doi.org/10.3390/modelling6030054 - 25 Jun 2025
Viewed by 809
Abstract
The accuracy of the blade pitch angle (BPA) motion in controllable pitch propellers (CPPs) is considered crucial for the efficacy and reliability of marine propulsion systems. The pitch adjustment process of CPPs is highly complex and influenced by various uncertain factors. A parametric [...] Read more.
The accuracy of the blade pitch angle (BPA) motion in controllable pitch propellers (CPPs) is considered crucial for the efficacy and reliability of marine propulsion systems. The pitch adjustment process of CPPs is highly complex and influenced by various uncertain factors. A parametric kinematic model for the pitch adjustment process for CPPs was established, incorporating the geometric dimensions and material surface friction coefficients caused during workpiece production as uncertainty parameters. The aim was to establish the correspondence between these uncertainty parameters and the BPA of CPPs. A large dataset was generated by batch calling on Adams. Based on the collected dataset, five surrogate models (e.g., deep neural network (DNN), Kriging, support vector regression (SVR), random forest (RF), and polynomial chaos expansion Kriging (PCK)) were constructed to predict the BPA. Among these, the DNN approach demonstrated the highest prediction accuracy. Accordingly, the influence of uncertainties on the BPA was investigated using the DNN model, focusing on variations in the slider width, crank pin diameter, crank disc diameter, piston rod–slider friction coefficient, crank pin–slider friction coefficient, and hub bearing–crank disc friction coefficient. The high-fidelity model established in this study can replace the kinematic model of the CPP pitch adjustment process, significantly improving computational efficiency. The research findings also provide important references for the design optimization of CPPs. Full article
Show Figures

Figure 1

18 pages, 1063 KB  
Article
Multi-Model and Variable Combination Approaches for Improved Prediction of Soil Heavy Metal Content
by Xiaolong Chen, Hongfeng Zhang, Cora Un In Wong and Zhengchun Song
Processes 2025, 13(7), 2008; https://doi.org/10.3390/pr13072008 - 25 Jun 2025
Cited by 4 | Viewed by 1077
Abstract
Soil heavy metal contamination poses significant risks to ecosystems and human health, necessitating accurate prediction methods for effective monitoring and remediation. We propose a multi-model and variable combination framework to improve the prediction of soil heavy metal content by integrating diverse environmental and [...] Read more.
Soil heavy metal contamination poses significant risks to ecosystems and human health, necessitating accurate prediction methods for effective monitoring and remediation. We propose a multi-model and variable combination framework to improve the prediction of soil heavy metal content by integrating diverse environmental and spatial features. The methodology incorporates environmental variables (e.g., soil properties, remote sensing indices), spatial autocorrelation measures based on nearest-neighbor distances, and spatial regionalization variables derived from interpolation techniques such as ordinary kriging, inverse distance weighting, and trend surface analysis. These variables are systematically combined into six distinct sets to evaluate their predictive performance. Three advanced models—Partial Least Squares Regression, Random Forest, and a Deep Forest variant (DF21)—are employed to assess the robustness of the approach across different variable combinations. Experimental results demonstrate that the inclusion of spatial autocorrelation and regionalization variables consistently enhances prediction accuracy compared to using environmental variables alone. Furthermore, the proposed framework exhibits strong generalizability, as validated through subset analyses with reduced training data. The study highlights the importance of integrating spatial dependencies and multi-source data for reliable heavy metal prediction, offering practical insights for environmental management and policy-making. Compared to using environmental variables alone, the full framework incorporating spatial features achieved relative improvements of 18–23% in prediction accuracy (R2) across all models, with the Deep Forest variant (DF21) showing the most substantial enhancement. The findings advance the field by providing a flexible and scalable methodology adaptable to diverse geographical contexts and data availability scenarios. Full article
(This article belongs to the Special Issue Environmental Protection and Remediation Processes)
Show Figures

Figure 1

23 pages, 8102 KB  
Article
Ensemble Learning for Spatial Modeling of Icing Fields from Multi-Source Remote Sensing Data
by Shaohui Zhou, Zhiqiu Gao, Bo Gong, Hourong Zhang, Haipeng Zhang, Jinqiang He and Xingya Xi
Remote Sens. 2025, 17(13), 2155; https://doi.org/10.3390/rs17132155 - 23 Jun 2025
Viewed by 840
Abstract
Accurate real-time icing grid fields are critical for preventing ice-related disasters during winter and protecting property. These fields are essential for both mapping ice distribution and predicting icing using physical models combined with numerical weather prediction systems. However, developing precise real-time icing grids [...] Read more.
Accurate real-time icing grid fields are critical for preventing ice-related disasters during winter and protecting property. These fields are essential for both mapping ice distribution and predicting icing using physical models combined with numerical weather prediction systems. However, developing precise real-time icing grids is challenging due to the uneven distribution of monitoring stations, data confidentiality restrictions, and the limitations of existing interpolation methods. In this study, we propose a new approach for constructing real-time icing grid fields using 1339 online terminal monitoring datasets provided by the China Southern Power Grid Research Institute Co., Ltd. (CSPGRI) during the winter of 2023. Our method integrates static geographic information, dynamic meteorological factors, and ice_kriging values derived from parameter-optimized Empirical Bayesian Kriging Interpolation (EBKI) to create a spatiotemporally matched, multi-source fused icing thickness grid dataset. We applied five machine learning algorithms—Random Forest, XGBoost, LightGBM, Stacking, and Convolutional Neural Network Transformers (CNNT)—and evaluated their performance using six metrics: R, RMSE, CSI, MAR, FAR, and fbias, on both validation and testing sets. The stacking model performed best, achieving an R-value of 0.634 (0.893), RMSE of 3.424 mm (2.834 mm), CSI of 0.514 (0.774), MAR of 0.309 (0.091), FAR of 0.332 (0.161), and fbias of 1.034 (1.084), respectively, when comparing predicted icing values with actual measurements on pylons. Additionally, we employed the SHAP model to provide a physical interpretation of the stacking model, confirming the independence of selected features. Meteorological factors such as relative humidity (RH), 10 m wind speed (WS10), 2 m temperature (T2), and precipitation (PRE) demonstrated a range of positive and negative contributions consistent with the observed growth of icing. Thus, our multi-source remote-sensing data-fusion approach, combined with the stacking model, offers a highly accurate and interpretable solution for generating real-time icing grid fields. Full article
(This article belongs to the Special Issue Remote Sensing for High Impact Weather and Extremes (2nd Edition))
Show Figures

Figure 1

26 pages, 4304 KB  
Article
A Hybrid Regression–Kriging–Machine Learning Framework for Imputing Missing TROPOMI NO2 Data over Taiwan
by Alyssa Valerio, Yi-Chun Chen, Chian-Yi Liu, Yi-Ying Chen and Chuan-Yao Lin
Remote Sens. 2025, 17(12), 2084; https://doi.org/10.3390/rs17122084 - 17 Jun 2025
Viewed by 2270
Abstract
This study presents a novel application of a hybrid regression–kriging (RK) and machine learning (ML) framework to impute missing tropospheric NO2 data from the TROPOMI satellite over Taiwan during the winter months of January, February, and December 2022. The proposed approach combines [...] Read more.
This study presents a novel application of a hybrid regression–kriging (RK) and machine learning (ML) framework to impute missing tropospheric NO2 data from the TROPOMI satellite over Taiwan during the winter months of January, February, and December 2022. The proposed approach combines geostatistical interpolation with nonlinear modeling by integrating RK with ML models—specifically comparing gradient boosting regression (GBR), random forest (RF), and K-nearest neighbors (KNN)—to determine the most suitable auxiliary predictor. This structure enables the framework to capture both spatial autocorrelation and complex relationships between NO2 concentrations and environmental drivers. Model performance was evaluated using the coefficient of determination (r2), computed against observed TROPOMI NO2 column values filtered by quality assurance criteria. GBR achieved the highest validation r2 values of 0.83 for January and February, while RF yielded 0.82 and 0.79 in January and December, respectively. These results demonstrate the model’s robustness in capturing intra-seasonal patterns and nonlinear trends in NO2 distribution. In contrast, models using only static land cover inputs performed poorly (r2 < 0.58), emphasizing the limited predictive capacity of such variables in isolation. Interpretability analysis using the SHapley Additive exPlanations (SHAP) method revealed temperature as the most influential meteorological driver of NO2 variation, particularly during winter, while forest cover consistently emerged as a key land-use factor mitigating NO2 levels through dry deposition. By integrating dynamic meteorological variables and static land cover features, the hybrid RK–ML framework enhances the spatial and temporal completeness of satellite-derived air quality datasets. As the first RK–ML application for TROPOMI data in Taiwan, this study establishes a regional benchmark and offers a transferable methodology for satellite data imputation. Future research should explore ensemble-based RK variants, incorporate real-time auxiliary data, and assess transferability across diverse geographic and climatological contexts. Full article
Show Figures

Figure 1

13 pages, 2203 KB  
Article
Evaluating Spatio-Temporal Kriging with Machine Learning Considering the Sources of Spatio-Temporal Variation
by Min Jeong and Hyeongmo Koo
ISPRS Int. J. Geo-Inf. 2025, 14(6), 224; https://doi.org/10.3390/ijgi14060224 - 5 Jun 2025
Viewed by 2859
Abstract
Integrating spatio-temporal kriging with machine learning improves estimation accuracy by addressing complex spatial and temporal variations in spatio-temporal phenomena. The improvement can be attributed to the enhanced flexibility of machine learning in capturing non-linear global trends, which traditional methods struggle to model, while [...] Read more.
Integrating spatio-temporal kriging with machine learning improves estimation accuracy by addressing complex spatial and temporal variations in spatio-temporal phenomena. The improvement can be attributed to the enhanced flexibility of machine learning in capturing non-linear global trends, which traditional methods struggle to model, while kriging remains effective in representing spatio-temporal interactions. However, differences in the estimated global trends and spatio-temporal interactions resulting from applying machine learning may influence the spatio-temporal variation patterns of the kriging results. Therefore, this study evaluates the effectiveness of machine learning in spatio-temporal kriging using NO2 concentrations in Seoul, focusing on its impact on overall accuracy and the contributions to global trends and spatio-temporal interactions. The results show that integrating machine learning enhances overall accuracy relative to ordinary spatio-temporal kriging. Global trend estimates differ by the models, with polynomial regression producing smoother patterns but larger errors, while random forest and boosting yield more abrupt patterns with smaller errors. These differences lead to smoother kriging outcomes in the polynomial model and more discrete patterns in the ensemble-based models. This study highlights the importance of considering both overall estimation accuracy and spatio-temporal patterns when selecting kriging methods. Full article
Show Figures

Figure 1

Back to TopTop