MDPI - Publisher of Open Access Journals

34 pages, 8819 KB

Open AccessArticle

Mitigating Overfitting and Physical Inconsistency in Flood Susceptibility Mapping: A Physics-Constrained Evolutionary Machine Learning Framework for Ungauged Alpine Basins

by Chuanjie Yan, Lingling Wu, Peng Huang, Jiajia Yue, Haowen Li, Chun Zhou, Congxiang Fan, Yinan Guo and Li Zhou

Water 2026, 18(7), 882; https://doi.org/10.3390/w18070882 - 7 Apr 2026

Viewed by 277

Abstract

Flood susceptibility mapping in high-altitude ungauged basins faces a structural dichotomy: physically based models often suffer from systematic biases due to uncertain satellite precipitation, whereas data-driven models are prone to overfitting and lack physical consistency in data-scarce regions. To resolve this, this study [...] Read more.

Flood susceptibility mapping in high-altitude ungauged basins faces a structural dichotomy: physically based models often suffer from systematic biases due to uncertain satellite precipitation, whereas data-driven models are prone to overfitting and lack physical consistency in data-scarce regions. To resolve this, this study proposes a Physically constrained Particle Swarm Optimization–Random Forest (P-PDRF) framework, validated in the Lhasa River Basin. The core innovation lies in coupling a hydrological model with statistical learning by utilizing the maximum daily runoff depth as a “Relative Hydraulic Intensity Index.” This approach leverages the topological correctness of physical simulations to circumvent absolute forcing errors. Furthermore, a Physiographically Constrained Negative Sampling (PCNS) strategy and a PSO-optimized “Shallow Tree” configuration are introduced to enforce structural regularization against stochastic noise. Empirical results demonstrate that P-PDRF achieves superior generalization (AUC = 0.942), significantly outperforming standard Random Forest, Support Vector Machine, and Analytic Hierarchy Process models. Ablation studies confirm that the dynamic index outweighs the static Topographic Wetness Index in feature importance, effectively correcting topographic artifacts where static models misclassify arid depressions as high-risk zones. This study offers a scalable Physics-Informed Machine Learning solution for the global “Prediction in Ungauged Basins” initiative. Full article

(This article belongs to the Special Issue Urban Flood Risk Assessment and Management)

► Show Figures

Figure 1

19 pages, 7201 KB

Open AccessArticle

Functional Variation in Morphological and Wood Traits Across 38 Timber Species of the Northern Colombian Amazon

by Carolina Martínez-Guevara, Bernardo Giraldo Benavides, Orlando Martínez Wilches and Jaime Barrera García

Forests 2026, 17(4), 454; https://doi.org/10.3390/f17040454 - 4 Apr 2026

Viewed by 239

Abstract

Functional traits help to understand plant ecological strategies and play a determinant role in restoration. This study evaluated interspecific variability among 38 timber species of bioeconomic importance associated with natural forests and forest trials in the northern Colombian Amazon, identifying Plant Functional Types [...] Read more.

Functional traits help to understand plant ecological strategies and play a determinant role in restoration. This study evaluated interspecific variability among 38 timber species of bioeconomic importance associated with natural forests and forest trials in the northern Colombian Amazon, identifying Plant Functional Types (PFTs) and their implications for productive restoration. Soft and hard traits were integrated, including tree morphological characteristics (diameter at breast height, total height, and crown cover) and wood functional traits (wood basic specific gravity, SG; maximum moisture content; fiber diameter and wall thickness; and vessel diameter and density). Correlations among these traits were also assessed. Five PFTs were identified. PFTs 1 and 2 grouped species with acquisitive strategies and high hydraulic efficiency, making them suitable for rapid vegetation cover recovery. In contrast, PFT 5 included conservative and hydraulically safe species, appropriate for enrichment processes once vegetation cover has been established. PFTs 3 and 4 represented intermediate strategies. Additionally, tree size was found to directly influence stem hydraulic architecture, and distinct anatomical configurations may occur within similar SG ranges, highlighting the need to integrate multi-trait approaches, as this trait alone does not fully capture the hydraulic and mechanical strategies of species. Full article

(This article belongs to the Special Issue Advances in Wood Anatomy: From Species Identification to Ecological and Technological Insights)

► Show Figures

Figure 1

23 pages, 1837 KB

Open AccessArticle

Use of Machine Learning for Solar Power Generation Prediction in the Field of Alternative Renewable Energy Sources

by Juan D. Parra-Quintero, Daniel Ovalle-Cerquera, Edwin Chica and Ainhoa Rubio-Clemente

Technologies 2026, 14(4), 206; https://doi.org/10.3390/technologies14040206 - 31 Mar 2026

Viewed by 415

Abstract

This study focused on the application of supervised learning in the field of renewable energy, specifically for predicting daily solar irradiance in Neiva, department of Huila, Colombia. To this end, decision tree and artificial neural network (DT and ANN, respectively) models were trained [...] Read more.

This study focused on the application of supervised learning in the field of renewable energy, specifically for predicting daily solar irradiance in Neiva, department of Huila, Colombia. To this end, decision tree and artificial neural network (DT and ANN, respectively) models were trained and tested using the online tool Google Colab. The main objective was based on the need to optimize energy planning processes at local and regional levels, motivated by the increase in demand for the integration of non-conventional energy sources and the spatial–temporal variability in solar resources in the country. A dataset consisting of 366 daily records for the year 2024 was obtained from the NASA POWER database at the geographic coordinates (2.930079, −75.255650) and used for training and evaluating the proposed models. Statistical and cleaning techniques were used, including the treatment of outliers using the moving-window median for the latter. Metrics, such as mean absolute error (

M A E

), root mean square error (

R M S E

), and coefficient of determination (

R^{2}

), were used to evaluate the models. Data inclusion and exclusion criteria were applied to ensure the quality and validity of the observations. Model performance was evaluated using a randomized Hold-Out validation strategy (90% training and 10% testing), which was repeated across multiple iterations. The performance metrics reported corresponded to the 10th iteration of the validation process after outlier treatment. Under this configuration, the DT model achieved a higher predictive performance (

R^{2}

= 0.8882) compared with the ANN model (

R^{2}

= 0.7679), demonstrating its effectiveness as a reliable approach for estimating daily solar irradiance under the studied conditions. This result was also confirmed by the decreased

M A E

and

R M S E

for the DT model, which indicated that this model performed better in predicting the real values than the ANN model. Finally, the added value of the study is to consolidate national evidence and open access tools to facilitate the development of sustainable energy policies in intermediate cities such as Neiva. Full article

(This article belongs to the Special Issue Artificial Intelligence for Energy Integration and Efficiency in Photovoltaic and Thermal Solar Systems)

► Show Figures

Figure 1

24 pages, 3168 KB

Open AccessArticle

Application of Machine Learning Models to Oil Refinery Programming

by Evar Umeozor

Processes 2026, 14(7), 1072; https://doi.org/10.3390/pr14071072 - 27 Mar 2026

Viewed by 358

Abstract

Transparent and evidence-based representations of global crude oil refining systems remain limited in the public literature, constraining robust energy systems modeling and policy analysis. This study develops a comprehensive, configuration-based modeling framework for all operating crude oil refineries worldwide using plant-level process unit [...] Read more.

Transparent and evidence-based representations of global crude oil refining systems remain limited in the public literature, constraining robust energy systems modeling and policy analysis. This study develops a comprehensive, configuration-based modeling framework for all operating crude oil refineries worldwide using plant-level process unit data. Forty unique refinery configurations are identified through an unsupervised decision tree-based clustering approach that accounts for process unit presence and relative conversion intensity. An extremely randomized trees (ETR) machine learning model is trained on approximately 11,000 refinery-year observations to predict refined product yields as a function of refinery configuration, capacity, and crude oil diet. The model achieves out-of-sample coefficients of determination exceeding 0.90 for all major products and outperforms multiple linear regression and other ensemble methods. The predictive model is integrated with a differential evolution optimization algorithm to enable refinery programming under operational and feedstock constraints. The application of this model to Gulf Cooperation Council (GCC) refineries shows that, under existing technologies, petrochemical feedstock yields are bounded at approximately 37%, significantly below announced long-term diversification targets of 70–85%. Yield improvements of up to 6 percentage points are feasible through operational optimization but are associated with capacity utilization adjustments and product trade-offs. The framework provides a scalable tool for refinery benchmarking, energy transition analysis, and strategic planning across facility, national, and global levels. Full article

(This article belongs to the Special Issue Feature Review Papers in Section "Chemical Processes and Systems")

► Show Figures

Graphical abstract

27 pages, 4998 KB

Open AccessArticle

Machine Learning-Based Human Detection Using Active Non-Line-of-Sight Laser Sensing

by Semra Çelebi and İbrahim Türkoğlu

Sensors 2026, 26(7), 2046; https://doi.org/10.3390/s26072046 - 25 Mar 2026

Viewed by 340

Abstract

Active non-line-of-sight (NLOS) human detection aims to infer the presence of hidden individuals by analyzing indirectly reflected photons between a relay surface and occluded targets. In this study, a single-photon avalanche diode (SPAD) and time-correlated single-photon counting (TCSPC)-based acquisition system were used to [...] Read more.

Active non-line-of-sight (NLOS) human detection aims to infer the presence of hidden individuals by analyzing indirectly reflected photons between a relay surface and occluded targets. In this study, a single-photon avalanche diode (SPAD) and time-correlated single-photon counting (TCSPC)-based acquisition system were used to measure time–photon waveforms in controlled NLOS environments designed to represent post-disaster rubble scenarios. Although the effective temporal resolution of the system is limited by the detector timing jitter and laser pulse width, the recorded transient signals retain distinguishable intensity and temporal delay patterns associated with the primary and secondary reflections. To construct a representative dataset, measurements were collected under varying subject poses, orientations, and surrounding object configurations. The recorded signals were processed using a unified preprocessing pipeline that included normalization, histogram shaping, and signal windowing. Three machine learning models, namely, Convolutional Neural Network, Gated Recurrent Unit, and Random Forest, were trained and evaluated for human presence classification. All models achieved full sensitivity in detecting human presence; however, notable differences emerged in the classification of human-absent scenarios. Among the tested approaches, random forest achieved the highest overall accuracy and specificity, demonstrating stronger robustness to statistical variations in time–photon histograms under limited photon conditions. These results suggest that tree-based classifiers capture amplitude distribution patterns and temporal dispersion characteristics more effectively than deep neural architectures under the present acquisition constraints. Overall, the findings indicate that low-cost SPAD-based NLOS sensing systems can provide reliable human detection in indirect-observation scenarios. Full article

(This article belongs to the Special Issue AI-Based Sensing and Imaging Applications)

► Show Figures

Figure 1

23 pages, 11235 KB

Open AccessArticle

Programming Air Phytoremediation in Row−Alley Agroforestry Systems to Enhance Environmental Benefits: A Modelling Approach

by Ewa Podhajska, Robert Borek, Aleksandra Anna Halarewicz, Anetta Drzeniecka–Osiadacz, Bronisław Podhajski, Paweł Radzikowski, Małgorzata Głogowska and Barbara Ptak

Forests 2026, 17(4), 405; https://doi.org/10.3390/f17040405 - 24 Mar 2026

Viewed by 284

Abstract

Agroforestry, where trees and shrubs are planted in row-alley systems, can utilize the natural ability of plants to interact with pollutants and serve as a passive biotechnological method for improving air quality. A method for programming air phytoremediation processes is presented, using appropriately [...] Read more.

Agroforestry, where trees and shrubs are planted in row-alley systems, can utilize the natural ability of plants to interact with pollutants and serve as a passive biotechnological method for improving air quality. A method for programming air phytoremediation processes is presented, using appropriately shaped plant structures, considering species characteristics and the spatial configuration of plants in row-alley plantings. The main objectives of this study were: to determine the relationship between pollution reduction and the characteristics of plant communities, considering the parameters of individual plants and group characteristics, to determine strategic parameters for the interaction between plants and pollutant flows, and to identify optimization paths for each stage. The optimization of the air phytoremediation process is presented using the example of changes in the fine particulate matter (PM_2.5) concentration pattern, analyzed through numerical experiments using micrometeorological computational fluid dynamics models (ENVI-met software). Ex-ante analysis of hypothetical scenarios showed that introducing appropriate configurations of variable vegetation structure could lead to pollution reductions of up to 19%. The effectiveness of the presented plant systems qualifies this method as a type of bioengineering technology, supporting the multifunctionality of agroforestry systems. Full article

(This article belongs to the Section Forest Operations and Engineering)

► Show Figures

Figure 1

25 pages, 3363 KB

Open AccessArticle

Spatial Clustering of Front Yard Landscapes: Implications for Urban Soil Conservation and Green Infrastructure Sustainability in the Río Piedras Watershed

by L. Kidany Sellés and Elvia J. Meléndez-Ackerman

Sustainability 2026, 18(6), 2821; https://doi.org/10.3390/su18062821 - 13 Mar 2026

Viewed by 391

Abstract

Current sustainability discourse promotes sustainable yard practices as a means for residents to contribute to urban environmental health and soil conservation. Social–ecological research suggests that yard practices are shaped by multiscale social drivers, including social contagion, whereby visible expressions of individuality in front [...] Read more.

Current sustainability discourse promotes sustainable yard practices as a means for residents to contribute to urban environmental health and soil conservation. Social–ecological research suggests that yard practices are shaped by multiscale social drivers, including social contagion, whereby visible expressions of individuality in front yard design are copied by nearby neighbors. This study evaluated residential areas within the Río Piedras Watershed (RPWS) in the San Juan metropolitan area to assess evidence of social contagion in front yard configuration and vegetation structure, and to examine whether these variables were associated with socio-demographic and economic characteristics when spatial effects were considered. A total of 6858 front yards across six highly urbanized sites were analyzed using Google Earth Street View imagery. Housing lot sizes were quantified, and yards were classified into eight landscape configurations based on green and gray cover elements. Woody vegetation structures, including trees, shrubs, and palms, were also quantified to generate estimates of functional diversity and a front yard quality index. Significant differences in yard characteristics were observed among sites. Spatial analyses revealed significant clustering at distances of 65–80 m, particularly for front yard configuration, while clustering of woody vegetation density was weaker. Local clustering patterns and the distribution of outliers varied across sites. Spatial lag models indicated that lot area positively influenced yard configuration and quality, and the density and diversity of woody vegetation. While socio-economic variables were not significant predictors of yard quality, their effects cannot be discarded. Overall, results are consistent with social contagion processes but also highlight neighborhood design as a key driver of clustering, alongside widespread conversion of green to paved front yards, with implications for soil and green infrastructure loss as well as environmental and human health in the RPWS. Full article

(This article belongs to the Special Issue Ecosystem Sustainability, Soil Biodiversity, and Environmental Restoration)

► Show Figures

Figure 1

48 pages, 6469 KB

Open AccessArticle

Adaptive Instantaneous Frequency Synchrosqueezing Transform and Enhanced AdaBoost for Power Quality Disturbance Detection

by Chencheng He, Yuyi Lu and Wenbo Wang

Symmetry 2026, 18(3), 475; https://doi.org/10.3390/sym18030475 - 10 Mar 2026

Viewed by 189

Abstract

The integration of renewable energy and power electronics has intensified the occurrence of complex power quality disturbances (PQDs), which increasingly threaten grid stability. To address the challenges of multi-class PQD identification under noisy conditions, this paper proposes a novel framework that combines an [...] Read more.

The integration of renewable energy and power electronics has intensified the occurrence of complex power quality disturbances (PQDs), which increasingly threaten grid stability. To address the challenges of multi-class PQD identification under noisy conditions, this paper proposes a novel framework that combines an enhanced time–frequency analysis method with an optimized AdaBoost decision tree. The main contributions are three-fold: (1) We develop an instantaneous frequency adaptive Fourier synchrosqueezing transform (IFAFSST) equipped with a custom adaptive operator that aligns closely with the frequency modulation patterns in PQD signals, thereby improving time–frequency energy localization. (2) The IFAFSST outputs are decomposed into low-frequency and high-frequency components, from each of which a set of 16 discriminative features is extracted. (3) An improved AdaBoost classifier is introduced, incorporating forward feature selection and Hyperband-based hyperparameter optimization to enhance classification performance. Hyperband accelerates the optimization process by dynamically allocating computing resources and iteratively eliminating suboptimal configurations, thereby enabling efficient determination of the optimal hyperparameters. The method proposed in this paper achieved an accuracy rate of 99.50% on simulated data containing 30 dB white noise and 98.30% on hardware platform data. This framework can effectively handle 23 types of interference, including seven types of single interference, 12 types of double compound interference, three types of triple compound interference, and one type of quadruple compound interference. It performs particularly well in identifying composite interference scenarios. This research has made a significant contribution to power quality analysis, providing a powerful solution with high accuracy and practical applicability, and offering great potential for the implementation of smart grid monitoring systems and the integration of renewable energy. Full article

(This article belongs to the Section Engineering and Materials)

► Show Figures

Figure 1

31 pages, 28149 KB

Open AccessArticle

Geospatial Analysis of Land Cover Change During Solar and Wind Energy Installation in the Semi-Arid Region of Paraíba, Brazil

by Ada Liz Coronel Canata, Rafael dos Santos Gonçalves, Ivonete Alves Bakke, Lorena de Moura Melo, Olaf Andreas Bakke, Mayara Maria de Lima Pessoa, Arliston Pereira Leite, Maria Beatriz Ferreira, Elisama Soares dos Santos, Nítalo André Farias Machado and Marcos Vinícius da Silva

Environments 2026, 13(3), 149; https://doi.org/10.3390/environments13030149 - 10 Mar 2026

Viewed by 648

Abstract

Recent large-scale renewable energy projects, such as the Luzia Solar and Chafariz Wind energy plants in Santa Luzia, Paraíba, Brazil, raised environmental concerns due to their impact on vegetation cover and landscape structure. This study used geospatial technologies to evaluate changes in tree [...] Read more.

Recent large-scale renewable energy projects, such as the Luzia Solar and Chafariz Wind energy plants in Santa Luzia, Paraíba, Brazil, raised environmental concerns due to their impact on vegetation cover and landscape structure. This study used geospatial technologies to evaluate changes in tree cover and landscape configuration resulting from the installation of these projects. Sentinel-2 imagery processed in Google Earth Engine generated NDVI, SAVI, NDWIveg, and LAI vegetation index data for the dry and rainy seasons of the six years between 2019 and 2024. With these vegetation index values and considering MapBiomas (version 8.0) and FRAGSTATS software (version 4.2), we analyzed the changes in land use and vegetation cover of Santa Luzia municipality during this six-year period. Land use and vegetation cover remained stable from 2019 to 2020 (before the installation of the energy plants), characterized by an NDVI value of 0.60, while tree cover decreased in the following four years, during or after the installation of the energy plants, as indicated by the consistent decreases in NDVI and NDWIveg values. Grassland class areas declined from 41.80% (18,434.59 ha) in 2019, to 34.36% (15,151.22 ha) in 2023, while non-vegetated areas increased by 148%. Landscape metrics showed increased fragmentation, with patch density rising from 3.31 to 3.88 patches/100 ha and core area decreasing from 3045.60 ha to 1395.01 ha. These data demonstrated measurable ecological impacts linked to the infra-structure built to run the two solar and wind energy plants in the semi-arid region of Santa Luzia, Paraíba, Brazil. Full article

(This article belongs to the Special Issue Balancing Energy and Environmental Sustainability: Innovations, Impacts, and Pathways)

► Show Figures

Figure 1

23 pages, 3889 KB

Open AccessArticle

Enhanced Runoff Prediction in Zijiang River Basin Using Machine Learning and SHAP-Based Interpretability

by Kaiwen Ma, Changbo Jiang, Yuannan Long, Zhiyuan Wu and Shixiong Yan

Water 2026, 18(5), 601; https://doi.org/10.3390/w18050601 - 2 Mar 2026

Viewed by 418

Abstract

To address the limitations of traditional runoff prediction methods—namely, the oversimplification of meteorological factor selection, ambiguous interactions among core variables, and the disruptive influence of redundant inputs—this study focuses on the Zijiang River Basin as a representative case. A suite of machine learning [...] Read more.

To address the limitations of traditional runoff prediction methods—namely, the oversimplification of meteorological factor selection, ambiguous interactions among core variables, and the disruptive influence of redundant inputs—this study focuses on the Zijiang River Basin as a representative case. A suite of machine learning models, including Long Short-Term Memory Neural Network (LSTM), Convolutional Neural Network (CNN)-LSTM, Temporal Convolutional Network (TCN), and Gradient Boosting Regression Tree (GBRT), was constructed and trained using 13 distinct combinations of meteorological variables. These configurations were systematically evaluated to assess their compatibility with each model in simulating daily runoff patterns. Additionally, the Shapley Additive Explanations (SHAP) algorithm was employed to quantitatively assess the contribution of each factor to predictive accuracy. Among the models tested, the TCN model consistently demonstrated superior performance, particularly in mitigating the effects of irrelevant or redundant features. The GBRT model showed distinctive strengths in accurately predicting peak flow timings. Of all input configurations, the combination of “runoff + precipitation + evaporation + temperature” emerged as the most effective. Findings indicate that the predictive value of individual meteorological variables hinges primarily on their direct correlation with runoff, while the effectiveness of multi-factor schemes depends on the degree of functional integration—specifically, the coupling of hydrological recharge, consumption, and regulatory processes. The presence of redundant variables was found to impair model performance unless they contributed to a meaningful synergistic relationship with core inputs. The SHAP analysis further reinforced these insights: precipitation-related variables proved to be the most critical to prediction accuracy, whereas temperature and evaporation served more complementary roles. Notably, the inclusion of relative humidity tended to suppress runoff responses and increased deviation in peak timing estimates. These findings shed light on the nuanced interplay between meteorological input design and model selection, offering a robust foundation for optimizing data-driven runoff prediction frameworks. Full article

(This article belongs to the Special Issue Application of Machine Learning in Hydrological Monitoring)

► Show Figures

Figure 1

28 pages, 4477 KB

Open AccessFeature PaperArticle

Automated Microclimate Model Generation from Remote Sensing Data

by Max Spett, Kevin Lau and Agatino Rizzo

Land 2026, 15(2), 329; https://doi.org/10.3390/land15020329 - 14 Feb 2026

Cited by 1 | Viewed by 514

Abstract

The ongoing climate crisis has highlighted the need for sustainability and resilience in the development and maintenance of urban areas regarding climate comfort. Weather simulation tools can aid researchers in understanding the effects that weather has on the microclimate in urban areas. While [...] Read more.

The ongoing climate crisis has highlighted the need for sustainability and resilience in the development and maintenance of urban areas regarding climate comfort. Weather simulation tools can aid researchers in understanding the effects that weather has on the microclimate in urban areas. While simulations are handled autonomously by computers once set up, the creation of the requisite input urban models is still a highly manual process. In this study, a novel method for the automated generation of urban models using land and cadastral remote sensing data is presented. By analyzing grass, trees, buildings, and roads algorithmically, data can be extracted and configured into spatial models compatible with microclimate simulation software such as ENVI-Met. Comparison to a baseline model shows that our method enables the creation of models fit for use for exploring microclimate scenarios in the urban environment, saving time by eliminating the need for manual processing. Full article

(This article belongs to the Special Issue Big Data in Urban Land Use Planning)

► Show Figures

Figure 1

28 pages, 883 KB

Open AccessArticle

Graph-Guided Genetic Algorithm for Optimal PMU Placement Ensuring Topological and Numerical Observability

by Vladimir Bečejac, Darko Šošić and Aleksandar Savić

Energies 2026, 19(4), 927; https://doi.org/10.3390/en19040927 - 10 Feb 2026

Viewed by 347

Abstract

This paper presents a novel hybrid algorithm for determining the optimal Phasor Measurement Units (PMU) configuration in power networks to ensure full topological and numerical observability through a multi-phase process. In the first phase, a graph-theoretic Heuristic Node Selector (HNS) is developed to [...] Read more.

This paper presents a novel hybrid algorithm for determining the optimal Phasor Measurement Units (PMU) configuration in power networks to ensure full topological and numerical observability through a multi-phase process. In the first phase, a graph-theoretic Heuristic Node Selector (HNS) is developed to rapidly establish topological observability via Core-Tree construction and node dominance evaluation. Unlike most existing studies that implicitly assume topological observability implies numerical observability, the second phase applies a Genetic Algorithm to refine and extend the initial solution from HNS, ensuring complete numerical observability while minimizing number of PMUs. This hybrid method significantly reduces the search space and improves convergence. The HNS procedure is further extended in this work to explicitly handle Zero Injection Buses (ZIB) through rule-based topological modifications, enabling a modified version of the algorithm applicable to real networks with complex structures. Real-world implementation practices from European Transmission System Operators are considered through the adoption of a “one PMU per feeder” configuration. The proposed method is validated on standard IEEE test systems and Serbian transmission networks. Results demonstrate high scalability, adaptability to various network topologies (with and without ZIB nodes), and efficient PMU allocation. Notably, the method consistently achieves high values of the System Observability Redundancy Index, indicating strong robustness and redundancy in measurement placement. Full article

► Show Figures

Figure 1

18 pages, 1796 KB

Open AccessArticle

SpADE-BERT: Multilingual BERT-Based Model with Trigram-Sensitive Tokenization, Tuned for Depression Detection in Spanish Texts

by Abdiel Reyes-Vera, Magdalena Saldana-Perez, Marco Moreno-Ibarra and Juan Pablo Francisco Posadas-Durán

AI 2026, 7(2), 48; https://doi.org/10.3390/ai7020048 - 1 Feb 2026

Viewed by 575

Abstract

This article proposes an automated approach, based on artificial intelligence techniques, for detecting indicators of depression in texts written in Spanish. Among the main contributions is the construction of a new specialized corpus, supervised by mental health professionals and based on the Beck [...] Read more.

This article proposes an automated approach, based on artificial intelligence techniques, for detecting indicators of depression in texts written in Spanish. Among the main contributions is the construction of a new specialized corpus, supervised by mental health professionals and based on the Beck Depression Inventory. Text processing included linguistic techniques such as lemmatization, stopword removal, and structural transformation using trigrams. As part of the work, SpADE-BERT was designed, a model based on multilingual BERT with a tokenization scheme adapted to incorporate trigrams directly from the input phase. This modification allowed for more robust interaction between the local context and semantic representations. SpADE-BERT was evaluated against multiple approaches reported in the literature, which employ algorithms such as logistic regression, support vector machines, decision trees, and Random Forest with advanced configurations and specialized preprocessing. In all cases, our model showed consistently superior performance on metrics such as precision, recall, and F1-score. The results show that integrating deep language models with adapted tokenization strategies can significantly strengthen the automated identification of linguistic signals associated with depression in Spanish texts. Full article

(This article belongs to the Topic Applications of NLP, AI, and ML in Software Engineering)

► Show Figures

Figure 1

26 pages, 60486 KB

Open AccessArticle

Spatiotemporal Prediction of Ground Surface Deformation Using TPE-Optimized Deep Learning

by Maoqi Liu, Sichun Long, Tao Li, Wandi Wang and Jianan Li

Remote Sens. 2026, 18(2), 234; https://doi.org/10.3390/rs18020234 - 11 Jan 2026

Viewed by 642

Abstract

Surface deformation induced by the extraction of natural resources constitutes a non-stationary spatiotemporal process. Modeling surface deformation time series obtained through Interferometric Synthetic Aperture Radar (InSAR) technology using deep learning methods is crucial for disaster prevention and mitigation. However, the complexity of model [...] Read more.

Surface deformation induced by the extraction of natural resources constitutes a non-stationary spatiotemporal process. Modeling surface deformation time series obtained through Interferometric Synthetic Aperture Radar (InSAR) technology using deep learning methods is crucial for disaster prevention and mitigation. However, the complexity of model hyperparameter configuration and the lack of interpretability in the resulting predictions constrain its engineering applications. To enhance the reliability of model outputs and their decision-making value for engineering applications, this study presents a workflow that combines a Tree-structured Parzen Estimator (TPE)-based Bayesian optimization approach with ensemble inference. Using the Rhineland coalfield in Germany as a case study, we systematically evaluated six deep learning architectures in conjunction with various spatiotemporal coding strategies. Pairwise comparisons were conducted using a Welch t-test to evaluate the performance differences across each architecture under two parameter-tuning approaches. The Benjamini–Hochberg method was applied to control the false discovery rate (FDR) at 0.05 for multiple comparisons. The results indicate that TPE-optimized models demonstrate significantly improved performance compared to their manually tuned counterparts, with the ResNet+Transformer architecture yielding the most favorable outcomes. A comprehensive analysis of the spatial residuals further revealed that TPE optimization not only enhances average accuracy, but also mitigates the model’s prediction bias in fault zones and mineralize areas by improving the spatial distribution structure of errors. Based on this optimal architecture, we combined the ten highest-performing models from the optimization stage to generate a quantile-based susceptibility map, using the ensemble median as the central predictor. Uncertainty was quantified from three complementary perspectives: ensemble spread, class ambiguity, and classification confidence. Our analysis revealed spatial collinearity between physical uncertainty and absolute residuals. This suggests that uncertainty is more closely related to the physical complexity of geological discontinuities and human-disturbed zones, rather than statistical noise. In the analysis of super-threshold probability, the threshold sensitivity exhibited by the mining area reflects the widespread yet moderate impact of mining activities. By contrast, the fault zone continues to exhibit distinct high-probability zones, even under extreme thresholds. It suggests that fault-controlled deformation is more physically intense and poses a greater risk of disaster than mining activities. Finally, we propose an engineering decision strategy that combines uncertainty and residual spatial patterns. This approach transforms statistical diagnostics into actionable, tiered control measures, thereby increasing the practical value of susceptibility mapping in the planning of natural resource extraction. Full article

► Show Figures

Figure 1

28 pages, 3866 KB

Open AccessArticle

Motion Pattern Recognition Based on Surface Electromyography Data and Machine Learning Classifiers: Preliminary Study

by Katarzyna Pytka, Natalia Szarwińska, Wiktoria Wojnicz, Marek Chodnicki and Wiktor Sieklicki

Appl. Sci. 2026, 16(1), 233; https://doi.org/10.3390/app16010233 - 25 Dec 2025

Viewed by 615

Abstract

Objective: The aim of this preliminary study was to recognize motion patterns by classifying time series features extracted from electromyography (EMG) data of the upper limb muscles. Methods: In this study, we tested six machine learning (ML) classification models (decision trees, [...] Read more.

Objective: The aim of this preliminary study was to recognize motion patterns by classifying time series features extracted from electromyography (EMG) data of the upper limb muscles. Methods: In this study, we tested six machine learning (ML) classification models (decision trees, support vector machines, linear discriminant, quadratic discriminant, k-nearest neighbors, and efficient logistic regression) to classify time series features segmented from processed EMG data that were acquired from eight superficial muscles of two upper limbs over performing given physical activities in two main stages (supination and neutral forearm configuration) in initial and target (isometric) positions. Results: Findings indicate that in aiming to classify stages of the upper limb with the highest performance, the following ML models should be used: (1) K-NN cityblock (F1 equals 0.973/0.992) and K-NN minkowski (0.966/0.992) for the left limb in initial or target position; (2) K-NN seuclidean (0.959/0.985) and K-NN minkowski (0.957/0.986) for the right limb in initial position; (3) K-NN cityblock (0.966/0.986), K-NN seuclidean (0.959/0.985), and K-NN minkowski (0.957/0.986) for the right limb in target position. Conclusions: Upper limb positions tested in this study can be recognized based on classification of surface EMG data by using the k-nearest neighbors models (K-NN cityblock, K-NN seuclidean or K-NN minkowski) that have to be trained separately for the right and left upper limbs. Full article

(This article belongs to the Special Issue Machine Learning in Biomedical Sciences)

► Show Figures

Figure 1

Search Results (142)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (142)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI