Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (1,314)

Search Parameters:
Keywords = extreme gradient boost (XGBoost)

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
19 pages, 4537 KiB  
Article
Learning the Value of Place: Machine Learning Models for Real Estate Appraisal in Istanbul’s Diverse Urban Landscape
by Ahmet Hilmi Erciyes, Toygun Atasoy, Abdurrahman Tursun and Sibel Canaz Sevgen
Buildings 2025, 15(15), 2773; https://doi.org/10.3390/buildings15152773 - 6 Aug 2025
Abstract
The prediction of real estate values is vital for taxation, transactions, mortgages, and urban policy development. Values can be predicted more accurately by statistical or advanced methods together when the size of the data is huge. In metropolitan cities like İstanbul, where size [...] Read more.
The prediction of real estate values is vital for taxation, transactions, mortgages, and urban policy development. Values can be predicted more accurately by statistical or advanced methods together when the size of the data is huge. In metropolitan cities like İstanbul, where size of the real estate data is vast and complex, mass appraisal methods supported by Machine Learning offer a scalable and consistent alternative. This study employs six algorithms: Artificial Neural Network, Extreme Gradient Boosting, K-Nearest Neighbors, Support Vector Regression, Random Forest, and Semi-Log Regression, to estimate the values of real estate on both the Asian and European continent parts of İstanbul. In total, 168,099 residential properties were utilized along with 30 of their features from both sides of the Bosphorus. The results show that RF yielded the best performance in Beşiktaş, while XGBoost performed best in Üsküdar. ANN also produced competitive results, although slightly less accurate than those of XGBoost and RF. In contrast, traditional SVR and SLR models underperformed, especially in terms of R2 and RMSE values. With its large-scale dataset, focusing on one of the greatest metropolitan areas, Istanbul, and the usage of multiple ML algorithms, this study stands as a comprehensive and practical contribution to the field of automated real estate valuation. Full article
(This article belongs to the Section Architectural Design, Urban Science, and Real Estate)
Show Figures

Figure 1

14 pages, 7406 KiB  
Article
Machine Learning-Driven Calibration of MODFLOW Models: Comparing Random Forest and XGBoost Approaches
by Husam Musa Baalousha
Geosciences 2025, 15(8), 303; https://doi.org/10.3390/geosciences15080303 - 5 Aug 2025
Abstract
The groundwater inverse problem has several challenges such as instability, non-uniqueness, and complexity, especially for heterogeneous aquifers. Solving the inverse problem is the traditional way to calibrate models, but it is both time-consuming and sensitive to errors in the measurements. This study explores [...] Read more.
The groundwater inverse problem has several challenges such as instability, non-uniqueness, and complexity, especially for heterogeneous aquifers. Solving the inverse problem is the traditional way to calibrate models, but it is both time-consuming and sensitive to errors in the measurements. This study explores the use of machine learning (ML) surrogate models, namely Random Forest (RF) and Extreme Gradient Boosting (XGBoost), to solve the inverse problem for the groundwater model calibration. Datasets for 20 hydraulic conductivity fields were created randomly based on statistics of hydraulic conductivity from the available data of the Northern Aquifer of Qatar, which was used as a case study. The corresponding hydraulic head values were obtained using MODFLOW simulations, and the data were used to train and validate the ML models. The trained surrogate models were used to estimate the hydraulic conductivity based on field observations. The results show that both RF and XGBoost have considerable predictive skill, with RF having better R2 and RMSE values (R2 = 0.99 for training, 0.93 for testing) than XGBoost (R2 = 0.86 for training, 0.85 for testing). The ML-based method lowered the computational effort greatly compared to the classical solution of the inverse problem (i.e., using PEST) and still produced strong and reliable spatial patterns of hydraulic conductivity. This demonstrates the potential of machine learning models for calibrating complex groundwater systems. Full article
(This article belongs to the Section Hydrogeology)
Show Figures

Graphical abstract

28 pages, 3960 KiB  
Article
Electric Bus Battery Energy Consumption Estimation and Influencing Features Analysis Using a Two-Layer Stacking Framework with SHAP-Based Interpretation
by Runze Liu, Jianming Cai, Lipeng Hu, Benxiao Lou and Jinjun Tang
Sustainability 2025, 17(15), 7105; https://doi.org/10.3390/su17157105 (registering DOI) - 5 Aug 2025
Abstract
The widespread adoption of electric buses represents a major step forward in sustainable transportation, but also brings new operational challenges, particularly in terms of improving their efficiency and controlling costs. Therefore, battery energy consumption management is a key approach for addressing these issues. [...] Read more.
The widespread adoption of electric buses represents a major step forward in sustainable transportation, but also brings new operational challenges, particularly in terms of improving their efficiency and controlling costs. Therefore, battery energy consumption management is a key approach for addressing these issues. Accurate prediction of energy consumption and interpretation of the influencing factors are essential for improving operational efficiency, optimizing energy use, and reducing operating costs. Although existing studies have made progress in battery energy consumption prediction, challenges remain in achieving high-precision modeling and conducting a comprehensive analysis of the influencing features. To address these gaps, this study proposes a two-layer stacking framework for estimating the energy consumption of electric buses. The first layer integrates the strengths of three nonlinear regression models—RF (Random Forest), GBDT (Gradient Boosted Decision Trees), and CatBoost (Categorical Boosting)—to enhance the modeling capacity for complex feature relationships. The second layer employs a Linear Regression model as a meta-learner to aggregate the predictions from the base models and improve the overall predictive performance. The framework is trained on 2023 operational data from two electric bus routes (NO. 355 and NO. W188) in Changsha, China, incorporating battery system parameters, driving characteristics, and environmental variables as independent variables for model training and analysis. Comparative experiments with various ensemble models demonstrate that the proposed stacking framework exhibits superior performance in data fitting. Furthermore, XGBoost (Extreme Gradient Boosting, version 2.1.4) is introduced as a surrogate model to approximate the decision logic of the stacking framework, enabling SHAP (SHapley Additive exPlanations) analysis to quantify the contribution and marginal effects of influencing features. The proposed stacked and surrogate models achieved superior battery energy consumption prediction accuracy (lowest MSE, RMSE, and MAE), significantly outperforming benchmark models on real-world datasets. SHAP analysis quantified the overall contributions of feature categories (battery operation parameters: 56.5%; driving characteristics: 42.3%; environmental data: 1.2%), further revealing the specific contributions and nonlinear influence mechanisms of individual features. These quantitative findings offer specific guidance for optimizing battery system control and driving behavior. Full article
(This article belongs to the Section Sustainable Transportation)
Show Figures

Figure 1

29 pages, 16357 KiB  
Article
Evaluation of Heterogeneous Ensemble Learning Algorithms for Lithological Mapping Using EnMAP Hyperspectral Data: Implications for Mineral Exploration in Mountainous Region
by Soufiane Hajaj, Abderrazak El Harti, Amin Beiranvand Pour, Younes Khandouch, Abdelhafid El Alaoui El Fels, Ahmed Babeker Elhag, Nejib Ghazouani, Mustafa Ustuner and Ahmed Laamrani
Minerals 2025, 15(8), 833; https://doi.org/10.3390/min15080833 (registering DOI) - 5 Aug 2025
Abstract
Hyperspectral remote sensing plays a crucial role in guiding and supporting various mineral prospecting activities. Combined with artificial intelligence, hyperspectral remote sensing technology becomes a powerful and versatile tool for a wide range of mineral exploration activities. This study investigates the effectiveness of [...] Read more.
Hyperspectral remote sensing plays a crucial role in guiding and supporting various mineral prospecting activities. Combined with artificial intelligence, hyperspectral remote sensing technology becomes a powerful and versatile tool for a wide range of mineral exploration activities. This study investigates the effectiveness of ensemble learning (EL) algorithms for lithological classification and mineral exploration using EnMAP hyperspectral imagery (HSI) in a semi-arid region. The Moroccan Anti-Atlas mountainous region is known for its complex geology, high mineral potential and rugged terrain, making it a challenging for mineral exploration. This research applies core and heterogeneous ensemble learning methods, i.e., boosting, stacking, voting, bagging, blending, and weighting to improve the accuracy and robustness of lithological classification and mapping in the Moroccan Anti-Atlas mountainous region. Several state-of-the-art models, including support vector machines (SVMs), random forests (RFs), k-nearest neighbors (k-NNs), multi-layer perceptrons (MLPs), extra trees (ETs) and extreme gradient boosting (XGBoost), were evaluated and used as individual and ensemble classifiers. The results show that the EL methods clearly outperform (single) base classifiers. The potential of EL methods to improve the accuracy of HSI-based classification is emphasized by an optimal blending model that achieves the highest overall accuracy (96.69%). The heterogeneous EL models exhibit better generalization ability than the baseline (single) ML models in lithological classification. The current study contributes to a more reliable assessment of resources in mountainous and semi-arid regions by providing accurate delineation of lithological units for mineral exploration objectives. Full article
(This article belongs to the Special Issue Feature Papers in Mineral Exploration Methods and Applications 2025)
Show Figures

Figure 1

35 pages, 4098 KiB  
Article
Prediction of Earthquake Death Toll Based on Principal Component Analysis, Improved Whale Optimization Algorithm, and Extreme Gradient Boosting
by Chenhui Wang, Xiaotao Zhang, Xiaoshan Wang and Guoping Chang
Appl. Sci. 2025, 15(15), 8660; https://doi.org/10.3390/app15158660 (registering DOI) - 5 Aug 2025
Abstract
Earthquakes, as one of the most destructive natural disasters, often cause significant casualties and severe economic losses. Accurate prediction of earthquake fatalities is of great importance for pre-disaster prevention and mitigation planning, as well as post-disaster emergency response deployment. To address the challenges [...] Read more.
Earthquakes, as one of the most destructive natural disasters, often cause significant casualties and severe economic losses. Accurate prediction of earthquake fatalities is of great importance for pre-disaster prevention and mitigation planning, as well as post-disaster emergency response deployment. To address the challenges of small sample sizes, high dimensionality, and strong nonlinearity in earthquake fatality prediction, this paper proposes an integrated modeling approach (PCA-IWOA-XGBoost) combining Principal Component Analysis (PCA), the Improved Whale Optimization Algorithm (IWOA), and Extreme Gradient Boosting (XGBoost). The method first employs PCA to reduce the dimensionality of the influencing factor data, eliminating redundant information and improving modeling efficiency. Subsequently, the IWOA is used to intelligently optimize key hyperparameters of the XGBoost model, enhancing the prediction accuracy and stability. Using 42 major earthquake events in China from 1970 to 2025 as a case study, covering regions including the west (e.g., Tonghai in Yunnan, Wenchuan, Jiuzhaigou), central (e.g., Lushan in Sichuan, Ya’an), east (e.g., Tangshan, Yingkou), north (e.g., Baotou in Inner Mongolia, Helinger), northwest (e.g., Jiashi in Xinjiang, Wushi, Yongdeng in Gansu), and southwest (e.g., Lancang in Yunnan, Lijiang, Ludian), the empirical results showed that the PCA-IWOA-XGBoost model achieved an average test set accuracy of 97.0%, a coefficient of determination (R2) of 0.996, a root mean square error (RMSE) and mean absolute error (MAE) reduced to 4.410 and 3.430, respectively, and a residual prediction deviation (RPD) of 21.090. These results significantly outperformed the baseline XGBoost, PCA-XGBoost, and IWOA-XGBoost models, providing improved technical support for earthquake disaster risk assessment and emergency response. Full article
(This article belongs to the Section Earth Sciences)
Show Figures

Figure 1

9 pages, 1436 KiB  
Proceeding Paper
Insights into Air Quality Index (AQI) Variability with Explainable Machine Learning Techniques
by Claudio Andenna and Roberta Valentina Gagliardi
Environ. Earth Sci. Proc. 2025, 34(1), 1; https://doi.org/10.3390/eesp2025034001 - 5 Aug 2025
Abstract
In this study, a combined approach joining the machine learning model Extreme Gradient Boosting (XGBoost) with Shapley Additive Explanation (SHAP) is adopted to simulate the temporal pattern of the air quality index (AQI) and subsequently explore the key factors affecting AQI variability. Based [...] Read more.
In this study, a combined approach joining the machine learning model Extreme Gradient Boosting (XGBoost) with Shapley Additive Explanation (SHAP) is adopted to simulate the temporal pattern of the air quality index (AQI) and subsequently explore the key factors affecting AQI variability. Based on the analysis of air pollutants and meteorological data acquired from two air quality monitoring stations in Rome (Italy), over the 2018–2022 period, the results demonstrate the effectiveness of the proposed methodological approach in elucidating the role of the main factors driving AQI evolution, and their interaction effects. Full article
Show Figures

Figure 1

22 pages, 1247 KiB  
Article
Evaluating and Predicting Urban Greenness for Sustainable Environmental Development
by Chun-Che Huang, Wen-Yau Liang, Tzu-Liang (Bill) Tseng and Chia-Ying Chan
Processes 2025, 13(8), 2465; https://doi.org/10.3390/pr13082465 - 4 Aug 2025
Viewed by 205
Abstract
With the rapid pace of urbanization, cities are increasingly facing severe challenges related to environmental pollution, ecological degradation, and climate change. Extreme climate events—such as heatwaves, droughts, heavy rainfall, and wildfires—have intensified public concern about sustainability, environmental protection, and low-carbon development. Ensuring environmental [...] Read more.
With the rapid pace of urbanization, cities are increasingly facing severe challenges related to environmental pollution, ecological degradation, and climate change. Extreme climate events—such as heatwaves, droughts, heavy rainfall, and wildfires—have intensified public concern about sustainability, environmental protection, and low-carbon development. Ensuring environmental preservation while maintaining residents’ quality of life has become a central focus of urban governance. In this context, evaluating green indicators and predicting urban greenness is both necessary and urgent. This study incorporates international frameworks such as the EU Green City Index, the European Green Capital Award, and the United Nations Sustainable Development Goals to assess urban sustainability. The Extreme Gradient Boosting (XGBoost) algorithm is employed to predict the green level of cities and to develop multiple optimized models. Comparative analysis with traditional models demonstrates that XGBoost achieves superior performance, with an accuracy of 0.84 and an F1-score of 0.81. Case study findings identify “Greenhouse Gas Emissions per Person” and “Per Capita Emissions from Transport” as the most critical indicators. These results provide practical guidance for policymakers, suggesting that targeted regulations based on these key factors can effectively support emission reduction and urban sustainability goals. Full article
(This article belongs to the Section Environmental and Green Processes)
Show Figures

Figure 1

20 pages, 4135 KiB  
Article
A PSO-XGBoost Model for Predicting the Compressive Strength of Cement–Soil Mixing Pile Considering Field Environment Simulation
by Jiagui Xiong, Yangqing Gong, Xianghua Liu, Yan Li, Liangjie Chen, Cheng Liao and Chaochao Zhang
Buildings 2025, 15(15), 2740; https://doi.org/10.3390/buildings15152740 - 4 Aug 2025
Viewed by 256
Abstract
Cement–Soil Mixing (CSM) Pile is an important technology for soft ground reinforcement, and its as-formed compressive strength directly affects engineering design and construction quality. To address the significant discrepancy between laboratory-tested strength and field as-formed strength arising from differing environmental conditions, this study [...] Read more.
Cement–Soil Mixing (CSM) Pile is an important technology for soft ground reinforcement, and its as-formed compressive strength directly affects engineering design and construction quality. To address the significant discrepancy between laboratory-tested strength and field as-formed strength arising from differing environmental conditions, this study conducted modified laboratory experiments simulating key field formation characteristics. A cement–soil preparation system considering actual immersion conditions was established, based on controlling the initial water content state of the foundation soil before pile formation and applying submerged conditions post-formation. Utilizing data mining on 84 sets of experimental data with various preparation parameter combinations, a prediction model for the as-formed strength of CSM Pile was developed based on the Particle Swarm Optimization-Extreme Gradient Boosting (PSO-XGBoost) algorithm. Engineering validation demonstrated that the model achieved an RMSE of 0.138, an MAE of 0.112, and an R2 of 0.961. It effectively addresses the issue of large prediction deviations caused by insufficient environmental simulation in traditional mix proportion tests. The research findings establish a quantitative relationship between as-formed strength and preparation parameters, providing an effective experimental improvement and strength prediction method for the engineering design of CSM Pile. Full article
Show Figures

Graphical abstract

44 pages, 6212 KiB  
Article
A Hybrid Deep Reinforcement Learning Architecture for Optimizing Concrete Mix Design Through Precision Strength Prediction
by Ali Mirzaei and Amir Aghsami
Math. Comput. Appl. 2025, 30(4), 83; https://doi.org/10.3390/mca30040083 - 3 Aug 2025
Viewed by 259
Abstract
Concrete mix design plays a pivotal role in ensuring the mechanical performance, durability, and sustainability of construction projects. However, the nonlinear interactions among the mix components challenge traditional approaches in predicting compressive strength and optimizing proportions. This study presents a two-stage hybrid framework [...] Read more.
Concrete mix design plays a pivotal role in ensuring the mechanical performance, durability, and sustainability of construction projects. However, the nonlinear interactions among the mix components challenge traditional approaches in predicting compressive strength and optimizing proportions. This study presents a two-stage hybrid framework that integrates deep learning with reinforcement learning to overcome these limitations. First, a Convolutional Neural Network–Long Short-Term Memory (CNN–LSTM) model was developed to capture spatial–temporal patterns from a dataset of 1030 historical concrete samples. The extracted features were enhanced using an eXtreme Gradient Boosting (XGBoost) meta-model to improve generalizability and noise resistance. Then, a Dueling Double Deep Q-Network (Dueling DDQN) agent was used to iteratively identify optimal mix ratios that maximize the predicted compressive strength. The proposed framework outperformed ten benchmark models, achieving an MAE of 2.97, RMSE of 4.08, and R2 of 0.94. Feature attribution methods—including SHapley Additive exPlanations (SHAP), Elasticity-Based Feature Importance (EFI), and Permutation Feature Importance (PFI)—highlighted the dominant influence of cement content and curing age, as well as revealing non-intuitive effects such as the compensatory role of superplasticizers in low-water mixtures. These findings demonstrate the potential of the proposed approach to support intelligent concrete mix design and real-time optimization in smart construction environments. Full article
(This article belongs to the Section Engineering)
Show Figures

Figure 1

22 pages, 4943 KiB  
Article
Predicting De-Handing Point in Bananas Using Crown Morphology and Interpretable Machine Learning
by Lei Zhao, Zhou Yang, Chunxia Wang, Mohui Jin and Jieli Duan
Agronomy 2025, 15(8), 1880; https://doi.org/10.3390/agronomy15081880 - 3 Aug 2025
Viewed by 127
Abstract
Banana de-handing is a critical yet labor-intensive step in postharvest processing, with current manual methods resulting in high costs and occupational risks. This study addresses the automation of de-handing point localization by integrating high-resolution 3D scanning and morphometric analysis of banana crowns with [...] Read more.
Banana de-handing is a critical yet labor-intensive step in postharvest processing, with current manual methods resulting in high costs and occupational risks. This study addresses the automation of de-handing point localization by integrating high-resolution 3D scanning and morphometric analysis of banana crowns with machine learning techniques. A total of 210 crown samples were analyzed to extract key morphological features, including inner arc length (Li), inner arc radius (Ri), outer arc radius (Ro), and the distance between inner and outer arcs (Doi), among others. Four machine learning algorithms, namely, Multi-Layer Perceptron (MLP), Gradient Boosted Decision Trees (GBDT), Extreme Gradient Boosting (XGBoost), and Random Forest (RF), were developed to predict the target radius (Rt) and target distance (Dti) of the de-handing point. The RF models achieved the optimal predictive performance on the testing set, with the following results: for Rt, R2 = 0.95, MAE = 1.50, and RMSE = 1.94; for Dti, R2 = 0.91, MAE = 1.33, and RMSE = 1.66. A Shapley Additive Explanations (SHAP) analysis revealed that Li, Ri, and Ro were the most influential features for Rt, while Doi was the most important for Dti. Notably, feature threshold effects were observed, with limited gains in prediction accuracy beyond specific morphological values. These results provide a quantitative foundation for vision-guided automated de-handing systems, advancing intelligent and efficient banana postharvest management. Full article
(This article belongs to the Section Precision and Digital Agriculture)
Show Figures

Figure 1

15 pages, 1635 KiB  
Article
Modeling the Abrasive Index from Mineralogical and Calorific Properties Using Tree-Based Machine Learning: A Case Study on the KwaZulu-Natal Coalfield
by Mohammad Afrazi, Chia Yu Huat, Moshood Onifade, Manoj Khandelwal, Deji Olatunji Shonuga, Hadi Fattahi and Danial Jahed Armaghani
Mining 2025, 5(3), 48; https://doi.org/10.3390/mining5030048 - 1 Aug 2025
Viewed by 149
Abstract
Accurate prediction of the coal abrasive index (AI) is critical for optimizing coal processing efficiency and minimizing equipment wear in industrial applications. This study explores tree-based machine learning models; Random Forest (RF), Gradient Boosting Trees (GBT), and Extreme Gradient Boosting (XGBoost) to predict [...] Read more.
Accurate prediction of the coal abrasive index (AI) is critical for optimizing coal processing efficiency and minimizing equipment wear in industrial applications. This study explores tree-based machine learning models; Random Forest (RF), Gradient Boosting Trees (GBT), and Extreme Gradient Boosting (XGBoost) to predict AI using selected coal properties. A database of 112 coal samples from the KwaZulu-Natal Coalfield in South Africa was used. Initial predictions using all eight input properties revealed suboptimal testing performance (R2: 0.63–0.72), attributed to outliers and noisy data. Feature importance analysis identified calorific value, quartz, ash, and Pyrite as dominant predictors, aligning with their physicochemical roles in abrasiveness. After data cleaning and feature selection, XGBoost achieved superior accuracy (R2 = 0.92), outperforming RF (R2 = 0.85) and GBT (R2 = 0.81). The results highlight XGBoost’s robustness in modeling non-linear relationships between coal properties and AI. This approach offers a cost-effective alternative to traditional laboratory methods, enabling industries to optimize coal selection, reduce maintenance costs, and enhance operational sustainability through data-driven decision-making. Additionally, quartz and Ash content were identified as the most influential parameters on AI using the Cosine Amplitude technique, while calorific value had the least impact among the selected features. Full article
(This article belongs to the Special Issue Mine Automation and New Technologies)
Show Figures

Figure 1

19 pages, 3532 KiB  
Article
Machine Learning Prediction of CO2 Diffusion in Brine: Model Development and Salinity Influence Under Reservoir Conditions
by Qaiser Khan, Peyman Pourafshary, Fahimeh Hadavimoghaddam and Reza Khoramian
Appl. Sci. 2025, 15(15), 8536; https://doi.org/10.3390/app15158536 (registering DOI) - 31 Jul 2025
Viewed by 150
Abstract
The diffusion coefficient (DC) of CO2 in brine is a key parameter in geological carbon sequestration and CO2-Enhanced Oil Recovery (EOR), as it governs mass transfer efficiency and storage capacity. This study employs three machine learning (ML) models—Random Forest (RF), [...] Read more.
The diffusion coefficient (DC) of CO2 in brine is a key parameter in geological carbon sequestration and CO2-Enhanced Oil Recovery (EOR), as it governs mass transfer efficiency and storage capacity. This study employs three machine learning (ML) models—Random Forest (RF), Gradient Boost Regressor (GBR), and Extreme Gradient Boosting (XGBoost)—to predict DC based on pressure, temperature, and salinity. The dataset, comprising 176 data points, spans pressures from 0.10 to 30.00 MPa, temperatures from 286.15 to 398.00 K, salinities from 0.00 to 6.76 mol/L, and DC values from 0.13 to 4.50 × 10−9 m2/s. The data was split into 80% for training and 20% for testing to ensure reliable model evaluation. Model performance was assessed using R2, RMSE, and MAE. The RF model demonstrated the best performance, with an R2 of 0.95, an RMSE of 0.03, and an MAE of 0.11 on the test set, indicating high predictive accuracy and generalization capability. In comparison, GBR achieved an R2 of 0.925, and XGBoost achieved an R2 of 0.91 on the test set. Feature importance analysis consistently identified temperature as the most influential factor, followed by salinity and pressure. This study highlights the potential of ML models for predicting CO2 diffusion in brine, providing a robust, data-driven framework for optimizing CO2-EOR processes and carbon storage strategies. The findings underscore the critical role of temperature in diffusion behavior, offering valuable insights for future modeling and operational applications. Full article
Show Figures

Figure 1

32 pages, 17155 KiB  
Article
Machine Learning Ensemble Methods for Co-Seismic Landslide Susceptibility: Insights from the 2015 Nepal Earthquake
by Tulasi Ram Bhattarai and Netra Prakash Bhandary
Appl. Sci. 2025, 15(15), 8477; https://doi.org/10.3390/app15158477 (registering DOI) - 30 Jul 2025
Viewed by 224
Abstract
The Mw 7.8 Gorkha Earthquake of 25 April 2015 triggered over 25,000 landslides across central Nepal, with 4775 events concentrated in Gorkha District alone. Despite substantial advances in landslide susceptibility mapping, existing studies often overlook the compound role of post-seismic rainfall and lack [...] Read more.
The Mw 7.8 Gorkha Earthquake of 25 April 2015 triggered over 25,000 landslides across central Nepal, with 4775 events concentrated in Gorkha District alone. Despite substantial advances in landslide susceptibility mapping, existing studies often overlook the compound role of post-seismic rainfall and lack robust spatial validation. To address this gap, we validated an ensemble machine learning framework for co-seismic landslide susceptibility modeling by integrating seismic, geomorphological, hydrological, and anthropogenic variables, including cumulative post-seismic rainfall. Using a balanced dataset of 4775 landslide and non-landslide instances, we evaluated the performance of Logistic Regression (LR), Random Forest (RF), and eXtreme Gradient Boosting (XGBoost) models through spatial cross-validation, SHapley Additive exPlanations (SHAP) explainability, and ablation analysis. The RF model outperformed all others, achieving an accuracy of 87.9% and a Receiver Operating Characteristic (ROC) Area Under the Curve (AUC) value of 0.94, while XGBoost closely followed (AUC = 0.93). Ensemble models collectively classified over 95% of observed landslides into High and Very High susceptibility zones, demonstrating strong spatial reliability. SHAP analysis identified elevation, proximity to fault, peak ground acceleration (PGA), slope, and rainfall as dominant predictors. Notably, the inclusion of post-seismic rainfall substantially improved recall and F1 scores in ablation experiments. Spatial cross-validation revealed the superior generalizability of ensemble models under heterogeneous terrain conditions. The findings underscore the value of integrating post-seismic hydrometeorological factors and spatial validation into susceptibility assessments. We recommend adopting ensemble models, particularly RF, for operational hazard mapping in earthquake-prone mountainous regions. Future research should explore the integration of dynamic rainfall thresholds and physics-informed frameworks to enhance early warning systems and climate resilience. Full article
(This article belongs to the Section Earth Sciences)
Show Figures

Figure 1

20 pages, 732 KiB  
Review
AI Methods Tailored to Influenza, RSV, HIV, and SARS-CoV-2: A Focused Review
by Achilleas Livieratos, George C. Kagadis, Charalambos Gogos and Karolina Akinosoglou
Pathogens 2025, 14(8), 748; https://doi.org/10.3390/pathogens14080748 - 30 Jul 2025
Viewed by 430
Abstract
Artificial intelligence (AI) techniques—ranging from hybrid mechanistic–machine learning (ML) ensembles to gradient-boosted decision trees, support-vector machines, and deep neural networks—are transforming the management of seasonal influenza, respiratory syncytial virus (RSV), human immunodeficiency virus (HIV), and severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Symptom-based [...] Read more.
Artificial intelligence (AI) techniques—ranging from hybrid mechanistic–machine learning (ML) ensembles to gradient-boosted decision trees, support-vector machines, and deep neural networks—are transforming the management of seasonal influenza, respiratory syncytial virus (RSV), human immunodeficiency virus (HIV), and severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Symptom-based triage models using eXtreme Gradient Boosting (XGBoost) and Random Forests, as well as imaging classifiers built on convolutional neural networks (CNNs), have improved diagnostic accuracy across respiratory infections. Transformer-based architectures and social media surveillance pipelines have enabled real-time monitoring of COVID-19. In HIV research, support-vector machines (SVMs), logistic regression, and deep neural network (DNN) frameworks advance viral-protein classification and drug-resistance mapping, accelerating antiviral and vaccine discovery. Despite these successes, persistent challenges remain—data heterogeneity, limited model interpretability, hallucinations in large language models (LLMs), and infrastructure gaps in low-resource settings. We recommend standardized open-access data pipelines and integration of explainable-AI methodologies to ensure safe, equitable deployment of AI-driven interventions in future viral-outbreak responses. Full article
(This article belongs to the Section Viral Pathogens)
Show Figures

Figure 1

26 pages, 8762 KiB  
Article
Clustered Rainfall-Induced Landslides in Jiangwan Town, Guangdong, China During April 2024: Characteristics and Controlling Factors
by Ruizeng Wei, Yunfeng Shan, Lei Wang, Dawei Peng, Ge Qu, Jiasong Qin, Guoqing He, Luzhen Fan and Weile Li
Remote Sens. 2025, 17(15), 2635; https://doi.org/10.3390/rs17152635 - 29 Jul 2025
Viewed by 235
Abstract
On 20 April 2024, an extreme rainfall event occurred in Jiangwan Town Shaoguan City, Guangdong Province, China, where a historic 24 h precipitation of 206 mm was recorded. This triggered extensive landslides that destroyed residential buildings, severed roads, and drew significant societal attention. [...] Read more.
On 20 April 2024, an extreme rainfall event occurred in Jiangwan Town Shaoguan City, Guangdong Province, China, where a historic 24 h precipitation of 206 mm was recorded. This triggered extensive landslides that destroyed residential buildings, severed roads, and drew significant societal attention. Rapid acquisition of landslide inventories, distribution patterns, and key controlling factors is critical for post-disaster emergency response and reconstruction. Based on high-resolution Planet satellite imagery, landslide areas in Jiangwan Town were automatically extracted using the Normalized Difference Vegetation Index (NDVI) differential method, and a detailed landslide inventory was compiled. Combined with terrain, rainfall, and geological environmental factors, the spatial distribution and causes of landslides were analyzed. Results indicate that the extreme rainfall induced 1426 landslides with a total area of 4.56 km2, predominantly small-to-medium scale. Landslides exhibited pronounced clustering and linear distribution along river valleys in a NE–SW orientation. Spatial analysis revealed concentrations on slopes between 200–300 m elevation with gradients of 20–30°. Four machine learning models—Logistic Regression, Support Vector Machine (SVM), Random Forest (RF), and Extreme Gradient Boosting (XGBoost)—were employed to assess landslide susceptibility mapping (LSM) accuracy. RF and XGBoost demonstrated superior performance, identifying high-susceptibility zones primarily on valley-side slopes in Jiangwan Town. Shapley Additive Explanations (SHAP) value analysis quantified key drivers, highlighting elevation, rainfall intensity, profile curvature, and topographic wetness index as dominant controlling factors. This study provides an effective methodology and data support for rapid rainfall-induced landslide identification and deep learning-based susceptibility assessment. Full article
(This article belongs to the Special Issue Study on Hydrological Hazards Based on Multi-Source Remote Sensing)
Show Figures

Figure 1

Back to TopTop