MDPI - Publisher of Open Access Journals

24 pages, 1964 KiB

Open AccessArticle

Data-Driven Symmetry and Asymmetry Investigation of Vehicle Emissions Using Machine Learning: A Case Study in Spain

by Fei Wu, Jinfu Zhu, Hufang Yang, Xiang He and Qiao Peng

Symmetry 2025, 17(8), 1223; https://doi.org/10.3390/sym17081223 (registering DOI) - 2 Aug 2025

Understanding vehicle emissions is essential for developing effective carbon reduction strategies in the transport sector. Conventional emission models often assume homogeneity and linearity, overlooking real-world asymmetries that arise from variations in vehicle design and powertrain configurations. This study explores how machine learning and [...] Read more.

Understanding vehicle emissions is essential for developing effective carbon reduction strategies in the transport sector. Conventional emission models often assume homogeneity and linearity, overlooking real-world asymmetries that arise from variations in vehicle design and powertrain configurations. This study explores how machine learning and explainable AI techniques can effectively capture both symmetric and asymmetric emission patterns across different vehicle types, thereby contributing to more sustainable transport planning. Addressing a key gap in the existing literature, the study poses the following question: how do structural and behavioral factors contribute to asymmetric emission responses in internal combustion engine vehicles compared to new energy vehicles? Utilizing a large-scale Spanish vehicle registration dataset, the analysis classifies vehicles by powertrain type and applies five supervised learning algorithms to predict CO₂ emissions. SHapley Additive exPlanations (SHAPs) are employed to identify nonlinear and threshold-based relationships between emissions and vehicle characteristics such as fuel consumption, weight, and height. Among the models tested, the Random Forest algorithm achieves the highest predictive accuracy. The findings reveal critical asymmetries in emission behavior, particularly among hybrid vehicles, which challenge the assumption of uniform policy applicability. This study provides both methodological innovation and practical insights for symmetry-aware emission modeling, offering support for more targeted eco-design and policy decisions that align with long-term sustainability goals. Full article

(This article belongs to the Section Engineering and Materials)

► Show Figures

Figure 1

23 pages, 3427 KiB

Open AccessArticle

Visual Narratives and Digital Engagement: Decoding Seoul and Tokyo’s Tourism Identity Through Instagram Analytics

by Seung Chul Yoo and Seung Mi Kang

Tour. Hosp. 2025, 6(3), 149; https://doi.org/10.3390/tourhosp6030149 (registering DOI) - 1 Aug 2025

Abstract

Social media platforms like Instagram significantly shape destination images and influence tourist behavior. Understanding how different cities are represented and perceived on these platforms is crucial for effective tourism marketing. This study provides a comparative analysis of Instagram content and engagement patterns in [...] Read more.

Social media platforms like Instagram significantly shape destination images and influence tourist behavior. Understanding how different cities are represented and perceived on these platforms is crucial for effective tourism marketing. This study provides a comparative analysis of Instagram content and engagement patterns in Seoul and Tokyo, two major Asian metropolises, to derive actionable marketing insights. We collected and analyzed 59,944 public Instagram posts geotagged or location-tagged within Seoul (n = 29,985) and Tokyo (n = 29,959). We employed a mixed-methods approach involving content categorization using a fine-tuned convolutional neural network (CNN) model, engagement metric analysis (likes, comments), Valence Aware Dictionary and sEntiment Reasoner (VADER) sentiment analysis and thematic classification of comments, geospatial analysis (Kernel Density Estimation [KDE], Moran’s I), and predictive modeling (Gradient Boosting with SHapley Additive exPlanations [SHAP] value analysis). A validation analysis using balanced samples (n = 2000 each) was conducted to address Tokyo’s lower geotagged data proportion. While both cities showed ‘Person’ as the dominant content category, notable differences emerged. Tokyo exhibited higher like-based engagement across categories, particularly for ‘Animal’ and ‘Food’ content, while Seoul generated slightly more comments, often expressing stronger sentiment. Qualitative comment analysis revealed Seoul comments focused more on emotional reactions, whereas Tokyo comments were often shorter, appreciative remarks. Geospatial analysis identified distinct hotspots. The validation analysis confirmed these spatial patterns despite Tokyo’s data limitations. Predictive modeling highlighted hashtag counts as the key engagement driver in Seoul and the presence of people in Tokyo. Seoul and Tokyo project distinct visual narratives and elicit different engagement patterns on Instagram. These findings offer practical implications for destination marketers, suggesting tailored content strategies and location-based campaigns targeting identified hotspots and specific content themes. This study underscores the value of integrating quantitative and qualitative analyses of social media data for nuanced destination marketing insights. Full article

(This article belongs to the Special Issue Data-Driven Insights in Tourism and Hospitality: Smart Technologies and Data Science)

► Show Figures

Figure 1

22 pages, 2120 KiB

Open AccessArticle

Machine Learning Algorithms and Explainable Artificial Intelligence for Property Valuation

by Gabriella Maselli and Antonio Nesticò

Real Estate 2025, 2(3), 12; https://doi.org/10.3390/realestate2030012 - 1 Aug 2025

Abstract

The accurate estimation of urban property values is a key challenge for appraisers, market participants, financial institutions, and urban planners. In recent years, machine learning (ML) techniques have emerged as promising tools for price forecasting due to their ability to model complex relationships [...] Read more.

The accurate estimation of urban property values is a key challenge for appraisers, market participants, financial institutions, and urban planners. In recent years, machine learning (ML) techniques have emerged as promising tools for price forecasting due to their ability to model complex relationships among variables. However, their application raises two main critical issues: (i) the risk of overfitting, especially with small datasets or with noisy data; (ii) the interpretive issues associated with the “black box” nature of many models. Within this framework, this paper proposes a methodological approach that addresses both these issues, comparing the predictive performance of three ML algorithms—k-Nearest Neighbors (kNN), Random Forest (RF), and the Artificial Neural Network (ANN)—applied to the housing market in the city of Salerno, Italy. For each model, overfitting is preliminarily assessed to ensure predictive robustness. Subsequently, the results are interpreted using explainability techniques, such as SHapley Additive exPlanations (SHAPs) and Permutation Feature Importance (PFI). This analysis reveals that the Random Forest offers the best balance between predictive accuracy and transparency, with features such as area and proximity to the train station identified as the main drivers of property prices. kNN and the ANN are viable alternatives that are particularly robust in terms of generalization. The results demonstrate how the defined methodological framework successfully balances predictive effectiveness and interpretability, supporting the informed and transparent use of ML in real estate valuation. Full article

(This article belongs to the Topic Improving Nature-Smart Policies through Innovative Resilient Evaluations)

► Show Figures

Figure 1

17 pages, 1584 KiB

Open AccessArticle

What Determines Carbon Emissions of Multimodal Travel? Insights from Interpretable Machine Learning on Mobility Trajectory Data

by Guo Wang, Shu Wang, Wenxiang Li and Hongtai Yang

Sustainability 2025, 17(15), 6983; https://doi.org/10.3390/su17156983 (registering DOI) - 31 Jul 2025

Abstract

Understanding the carbon emissions of multimodal travel—comprising walking, metro, bus, cycling, and ride-hailing—is essential for promoting sustainable urban mobility. However, most existing studies focus on single-mode travel, while underlying spatiotemporal and behavioral determinants remain insufficiently explored due to the lack of fine-grained data [...] Read more.

Understanding the carbon emissions of multimodal travel—comprising walking, metro, bus, cycling, and ride-hailing—is essential for promoting sustainable urban mobility. However, most existing studies focus on single-mode travel, while underlying spatiotemporal and behavioral determinants remain insufficiently explored due to the lack of fine-grained data and interpretable analytical frameworks. This study proposes a novel integration of high-frequency, real-world mobility trajectory data with interpretable machine learning to systematically identify the key drivers of carbon emissions at the individual trip level. Firstly, multimodal travel chains are reconstructed using continuous GPS trajectory data collected in Beijing. Secondly, a model based on Calculate Emissions from Road Transport (COPERT) is developed to quantify trip-level CO₂ emissions. Thirdly, four interpretable machine learning models based on gradient boosting—XGBoost, GBDT, LightGBM, and CatBoost—are trained using transportation and built environment features to model the relationship between CO₂ emissions and a set of explanatory variables; finally, Shapley Additive exPlanations (SHAP) and partial dependence plots (PDPs) are used to interpret the model outputs, revealing key determinants and their non-linear interaction effects. The results show that transportation-related features account for 75.1% of the explained variance in emissions, with bus usage being the most influential single factor (contributing 22.6%). Built environment features explain the remaining 24.9%. The PDP analysis reveals that substantial emission reductions occur only when the shares of bus, metro, and cycling surpass threshold levels of approximately 40%, 40%, and 30%, respectively. Additionally, travel carbon emissions are minimized when trip origins and destinations are located within a 10 to 11 km radius of the central business district (CBD). This study advances the field by establishing a scalable, interpretable, and behaviorally grounded framework to assess carbon emissions from multimodal travel, providing actionable insights for low-carbon transport planning and policy design. Full article

(This article belongs to the Special Issue Sustainable Transportation Systems and Travel Behaviors)

► Show Figures

Figure 1

35 pages, 3218 KiB

Open AccessArticle

Integrated GBR–NSGA-II Optimization Framework for Sustainable Utilization of Steel Slag in Road Base Layers

by Merve Akbas

Appl. Sci. 2025, 15(15), 8516; https://doi.org/10.3390/app15158516 (registering DOI) - 31 Jul 2025

Viewed by 11

Abstract

This study proposes an integrated, machine learning-based multi-objective optimization framework to evaluate and optimize the utilization of steel slag in road base layers, simultaneously addressing economic costs and environmental impacts. A comprehensive dataset of 482 scenarios was engineered based on literature-informed parameters, encompassing [...] Read more.

This study proposes an integrated, machine learning-based multi-objective optimization framework to evaluate and optimize the utilization of steel slag in road base layers, simultaneously addressing economic costs and environmental impacts. A comprehensive dataset of 482 scenarios was engineered based on literature-informed parameters, encompassing transport distance, processing energy intensity, initial moisture content, gradation adjustments, and regional electricity emission factors. Four advanced tree-based ensemble regression algorithms—Random Forest Regressor (RFR), Extremely Randomized Trees (ERTs), Gradient Boosted Regressor (GBR), and Extreme Gradient Boosting Regressor (XGBR)—were rigorously evaluated. Among these, GBR demonstrated superior predictive performance (R² > 0.95, RMSE < 7.5), effectively capturing complex nonlinear interactions inherent in slag processing and logistics operations. Feature importance analysis via SHapley Additive exPlanations (SHAP) provided interpretative insights, highlighting transport distance and energy intensity as dominant factors affecting unit cost, while moisture content and grid emission factor predominantly influenced CO₂ emissions. Subsequently, the Gradient Boosted Regressor model was integrated into a Non-Dominated Sorting Genetic Algorithm II (NSGA-II) framework to explore optimal trade-offs between cost and emissions. The resulting Pareto front revealed a diverse solution space, with significant nonlinear trade-offs between economic efficiency and environmental performance, clearly identifying strategic inflection points. To facilitate actionable decision-making, the Technique for Order Preference by Similarity to Ideal Solution (TOPSIS) method was applied, identifying an optimal balanced solution characterized by a transport distance of 47 km, energy intensity of 1.21 kWh/ton, moisture content of 6.2%, moderate gradation adjustment, and a grid CO₂ factor of 0.47 kg CO₂/kWh. This scenario offered a substantial reduction (45%) in CO₂ emissions relative to cost-minimized solutions, with a moderate increase (33%) in total cost, presenting a realistic and balanced pathway for sustainable infrastructure practices. Overall, this study introduces a robust, scalable, and interpretable optimization framework, providing valuable methodological advancements for sustainable decision making in infrastructure planning and circular economy initiatives. Full article

(This article belongs to the Special Issue Advanced Technologies and Optimization for Sustainable Geotechnical Engineering)

► Show Figures

Figure 1

32 pages, 17155 KiB

Open AccessArticle

Machine Learning Ensemble Methods for Co-Seismic Landslide Susceptibility: Insights from the 2015 Nepal Earthquake

by Tulasi Ram Bhattarai and Netra Prakash Bhandary

Appl. Sci. 2025, 15(15), 8477; https://doi.org/10.3390/app15158477 (registering DOI) - 30 Jul 2025

Viewed by 153

Abstract

The Mw 7.8 Gorkha Earthquake of 25 April 2015 triggered over 25,000 landslides across central Nepal, with 4775 events concentrated in Gorkha District alone. Despite substantial advances in landslide susceptibility mapping, existing studies often overlook the compound role of post-seismic rainfall and lack [...] Read more.

The Mw 7.8 Gorkha Earthquake of 25 April 2015 triggered over 25,000 landslides across central Nepal, with 4775 events concentrated in Gorkha District alone. Despite substantial advances in landslide susceptibility mapping, existing studies often overlook the compound role of post-seismic rainfall and lack robust spatial validation. To address this gap, we validated an ensemble machine learning framework for co-seismic landslide susceptibility modeling by integrating seismic, geomorphological, hydrological, and anthropogenic variables, including cumulative post-seismic rainfall. Using a balanced dataset of 4775 landslide and non-landslide instances, we evaluated the performance of Logistic Regression (LR), Random Forest (RF), and eXtreme Gradient Boosting (XGBoost) models through spatial cross-validation, SHapley Additive exPlanations (SHAP) explainability, and ablation analysis. The RF model outperformed all others, achieving an accuracy of 87.9% and a Receiver Operating Characteristic (ROC) Area Under the Curve (AUC) value of 0.94, while XGBoost closely followed (AUC = 0.93). Ensemble models collectively classified over 95% of observed landslides into High and Very High susceptibility zones, demonstrating strong spatial reliability. SHAP analysis identified elevation, proximity to fault, peak ground acceleration (PGA), slope, and rainfall as dominant predictors. Notably, the inclusion of post-seismic rainfall substantially improved recall and F1 scores in ablation experiments. Spatial cross-validation revealed the superior generalizability of ensemble models under heterogeneous terrain conditions. The findings underscore the value of integrating post-seismic hydrometeorological factors and spatial validation into susceptibility assessments. We recommend adopting ensemble models, particularly RF, for operational hazard mapping in earthquake-prone mountainous regions. Future research should explore the integration of dynamic rainfall thresholds and physics-informed frameworks to enhance early warning systems and climate resilience. Full article

(This article belongs to the Section Earth Sciences)

► Show Figures

Figure 1

14 pages, 6012 KiB

Open AccessArticle

Decoding the Primacy of Transportation Emissions of Formaldehyde Pollution in an Urban Atmosphere

by Shi-Qi Liu, Hao-Nan Ma, Meng-Xue Tang, Yu-Ming Shao, Ting-Ting Yao, Ling-Yan He and Xiao-Feng Huang

Toxics 2025, 13(8), 643; https://doi.org/10.3390/toxics13080643 - 30 Jul 2025

Viewed by 173

Abstract

Understanding the differential impacts of emission sources of volatile organic compounds (VOCs) on formaldehyde (HCHO) levels is pivotal to effectively mitigating key photochemical radical precursors, thereby enhancing the regulation of atmospheric oxidation capacity (AOC) and ozone formation. This investigation systematically selected and analyzed [...] Read more.

Understanding the differential impacts of emission sources of volatile organic compounds (VOCs) on formaldehyde (HCHO) levels is pivotal to effectively mitigating key photochemical radical precursors, thereby enhancing the regulation of atmospheric oxidation capacity (AOC) and ozone formation. This investigation systematically selected and analyzed year-long VOC measurements across three urban zones in Shenzhen, China. Photochemical age correction methods were implemented to develop the initial concentrations of VOCs before source apportionment; then Positive Matrix Factorization (PMF) modeling resolved six primary sources: solvent usage (28.6–47.9%), vehicle exhaust (24.2–31.2%), biogenic emission (13.8–18.1%), natural gas (8.5–16.3%), gasoline evaporation (3.2–8.9%), and biomass burning (0.3–2.4%). A machine learning (ML) framework incorporating Shapley Additive Explanations (SHAP) was subsequently applied to evaluate the influence of six emission sources on HCHO concentrations while accounting for reaction time adjustments. This machine learning-driven nonlinear analysis demonstrated that vehicle exhaust nearly always emerged as the primary anthropogenic contributor in diverse functional zones and different seasons, with gasoline evaporation as another key contributor, while the traditional reactivity metric method, ozone formation potential (OFP), tended to underestimate the role of the two sources. This study highlights the primacy of strengthening emission reduction of transportation sectors to mitigate HCHO pollution in megacities. Full article

(This article belongs to the Special Issue Concentration of VOCs in the Atmosphere and Its Environmental Exposure)

► Show Figures

Graphical abstract

14 pages, 2727 KiB

Open AccessArticle

A Multimodal MRI-Based Model for Colorectal Liver Metastasis Prediction: Integrating Radiomics, Deep Learning, and Clinical Features with SHAP Interpretation

by Xin Yan, Furui Duan, Lu Chen, Runhong Wang, Kexin Li, Qiao Sun and Kuang Fu

Curr. Oncol. 2025, 32(8), 431; https://doi.org/10.3390/curroncol32080431 - 30 Jul 2025

Viewed by 86

Abstract

Purpose: Predicting colorectal cancer liver metastasis (CRLM) is essential for prognostic assessment. This study aims to develop and validate an interpretable multimodal machine learning framework based on multiparametric MRI for predicting CRLM, and to enhance the clinical interpretability of the model through [...] Read more.

Purpose: Predicting colorectal cancer liver metastasis (CRLM) is essential for prognostic assessment. This study aims to develop and validate an interpretable multimodal machine learning framework based on multiparametric MRI for predicting CRLM, and to enhance the clinical interpretability of the model through SHapley Additive exPlanations (SHAP) analysis and deep learning visualization. Methods: This multicenter retrospective study included 463 patients with pathologically confirmed colorectal cancer from two institutions, divided into training (n = 256), internal testing (n = 111), and external validation (n = 96) sets. Radiomics features were extracted from manually segmented regions on axial T2-weighted imaging (T2WI) and diffusion-weighted imaging (DWI). Deep learning features were obtained from a pretrained ResNet101 network using the same MRI inputs. A least absolute shrinkage and selection operator (LASSO) logistic regression classifier was developed for clinical, radiomics, deep learning, and combined models. Model performance was evaluated by AUC, sensitivity, specificity, and F1-score. SHAP was used to assess feature contributions, and Grad-CAM was applied to visualize deep feature attention. Results: The combined model integrating features across the three modalities achieved the highest performance across all datasets, with AUCs of 0.889 (training), 0.838 (internal test), and 0.822 (external validation), outperforming single-modality models. Decision curve analysis (DCA) revealed enhanced clinical net benefit from the integrated model, while calibration curves confirmed its good predictive consistency. SHAP analysis revealed that radiomic features related to T2WI texture (e.g., LargeDependenceLowGrayLevelEmphasis) and clinical biomarkers (e.g., CA19-9) were among the most predictive for CRLM. Grad-CAM visualizations confirmed that the deep learning model focused on tumor regions consistent with radiological interpretation. Conclusions: This study presents a robust and interpretable multiparametric MRI-based model for noninvasively predicting liver metastasis in colorectal cancer patients. By integrating handcrafted radiomics and deep learning features, and enhancing transparency through SHAP and Grad-CAM, the model provides both high predictive performance and clinically meaningful explanations. These findings highlight its potential value as a decision-support tool for individualized risk assessment and treatment planning in the management of colorectal cancer. Full article

(This article belongs to the Section Gastrointestinal Oncology)

► Show Figures

Graphical abstract

22 pages, 1724 KiB

Open AccessArticle

Development and Clinical Interpretation of an Explainable AI Model for Predicting Patient Pathways in the Emergency Department: A Retrospective Study

by Émilien Arnaud, Pedro Antonio Moreno-Sanchez, Mahmoud Elbattah, Christine Ammirati, Mark van Gils, Gilles Dequen and Daniel Aiham Ghazali

Appl. Sci. 2025, 15(15), 8449; https://doi.org/10.3390/app15158449 - 30 Jul 2025

Viewed by 236

Abstract

Background: Overcrowded emergency departments (EDs) create significant challenges for patient management and hospital efficiency. In response, Amiens Picardy University Hospital (APUH) developed the “Prediction of the Patient Pathway in the Emergency Department” (3P-U) model to enhance patient flow management. Objectives: To develop and [...] Read more.

Background: Overcrowded emergency departments (EDs) create significant challenges for patient management and hospital efficiency. In response, Amiens Picardy University Hospital (APUH) developed the “Prediction of the Patient Pathway in the Emergency Department” (3P-U) model to enhance patient flow management. Objectives: To develop and clinically validate an explainable artificial intelligence (XAI) model for hospital admission predictions, using structured triage data, and demonstrate its real-world applicability in the ED setting. Methods: Our retrospective, single-center study involved 351,019 patients consulting in APUH’s EDs between 2015 and 2018. Various models (including a cross-validation artificial neural network (ANN), a k-nearest neighbors (KNN) model, a logistic regression (LR) model, and a random forest (RF) model) were trained and assessed for performance with regard to the area under the receiver operating characteristic curve (AUROC). The best model was validated internally with a test set, and the F1 score was used to determine the best threshold for recall, precision, and accuracy. XAI techniques, such as Shapley additive explanations (SHAP) and partial dependence plots (PDP) were employed, and the clinical explanations were evaluated by emergency physicians. Results: The ANN gave the best performance during the training stage, with an AUROC of 83.1% (SD: 0.2%) for the test set; it surpassed the RF (AUROC: 71.6%, SD: 0.1%), KNN (AUROC: 67.2%, SD: 0.2%), and LR (AUROC: 71.5%, SD: 0.2%) models. In an internal validation, the ANN’s AUROC was 83.2%. The best F1 score (0.67) determined that 0.35 was the optimal threshold; the corresponding recall, precision, and accuracy were 75.7%, 59.7%, and 75.3%, respectively. The SHAP and PDP XAI techniques (as assessed by emergency physicians) highlighted patient age, heart rate, and presentation with multiple injuries as the features that most specifically influenced the admission from the ED to a hospital ward. These insights are being used in bed allocation and patient prioritization, directly improving ED operations. Conclusions: The 3P-U model demonstrates practical utility by reducing ED crowding and enhancing decision-making processes at APUH. Its transparency and physician validation foster trust, facilitating its adoption in clinical practice and offering a replicable framework for other hospitals to optimize patient flow. Full article

(This article belongs to the Special Issue Unlocking Scientific Insights: Data Mining, Large Models, and AI-Driven Discovery)

► Show Figures

Figure 1

19 pages, 3509 KiB

Open AccessArticle

Explainable Machine Learning Model for Source Type Identification of Mine Inrush Water

by Yong Yang, Jing Li, Huawei Tao, Yong Cheng and Li Zhao

Information 2025, 16(8), 648; https://doi.org/10.3390/info16080648 - 30 Jul 2025

Viewed by 148

Abstract

The prevention and control of mine inrush water has always been a major challenge for safety. By identifying the type of water source and analyzing the real-time changes in water composition, sudden water inrush accidents can be monitored in a timely manner to [...] Read more.

The prevention and control of mine inrush water has always been a major challenge for safety. By identifying the type of water source and analyzing the real-time changes in water composition, sudden water inrush accidents can be monitored in a timely manner to avoid major accidents. This paper proposes a novel explainable machine learning model for source type identification of mine inrush water. The paper expands the original monitoring system into the XinJi No.2 Mine in Huainan Mining Area. Based on the online water composition data, using the Spearman coefficient formula, it analyzes the water chemical characteristics of different aquifers to extract key discriminant factors. Then, the Conv1D-GRU model was built to deeply connect factors for precise water source identification. The experimental results show an accuracy rate of 85.37%. In addition, focused on the interpretability, the experiment quantified the impact of different features on the model using SHAP (Shapley Additive Explanations). It provides new reference for the source type identification of mine inrush water in mine disaster prevention and control. Full article

► Show Figures

Figure 1

26 pages, 8762 KiB

Open AccessArticle

Clustered Rainfall-Induced Landslides in Jiangwan Town, Guangdong, China During April 2024: Characteristics and Controlling Factors

by Ruizeng Wei, Yunfeng Shan, Lei Wang, Dawei Peng, Ge Qu, Jiasong Qin, Guoqing He, Luzhen Fan and Weile Li

Remote Sens. 2025, 17(15), 2635; https://doi.org/10.3390/rs17152635 - 29 Jul 2025

Viewed by 177

Abstract

On 20 April 2024, an extreme rainfall event occurred in Jiangwan Town Shaoguan City, Guangdong Province, China, where a historic 24 h precipitation of 206 mm was recorded. This triggered extensive landslides that destroyed residential buildings, severed roads, and drew significant societal attention. [...] Read more.

On 20 April 2024, an extreme rainfall event occurred in Jiangwan Town Shaoguan City, Guangdong Province, China, where a historic 24 h precipitation of 206 mm was recorded. This triggered extensive landslides that destroyed residential buildings, severed roads, and drew significant societal attention. Rapid acquisition of landslide inventories, distribution patterns, and key controlling factors is critical for post-disaster emergency response and reconstruction. Based on high-resolution Planet satellite imagery, landslide areas in Jiangwan Town were automatically extracted using the Normalized Difference Vegetation Index (NDVI) differential method, and a detailed landslide inventory was compiled. Combined with terrain, rainfall, and geological environmental factors, the spatial distribution and causes of landslides were analyzed. Results indicate that the extreme rainfall induced 1426 landslides with a total area of 4.56 km², predominantly small-to-medium scale. Landslides exhibited pronounced clustering and linear distribution along river valleys in a NE–SW orientation. Spatial analysis revealed concentrations on slopes between 200–300 m elevation with gradients of 20–30°. Four machine learning models—Logistic Regression, Support Vector Machine (SVM), Random Forest (RF), and Extreme Gradient Boosting (XGBoost)—were employed to assess landslide susceptibility mapping (LSM) accuracy. RF and XGBoost demonstrated superior performance, identifying high-susceptibility zones primarily on valley-side slopes in Jiangwan Town. Shapley Additive Explanations (SHAP) value analysis quantified key drivers, highlighting elevation, rainfall intensity, profile curvature, and topographic wetness index as dominant controlling factors. This study provides an effective methodology and data support for rapid rainfall-induced landslide identification and deep learning-based susceptibility assessment. Full article

(This article belongs to the Special Issue Study on Hydrological Hazards Based on Multi-Source Remote Sensing)

► Show Figures

Figure 1

23 pages, 5330 KiB

Open AccessArticle

Explainable Reinforcement Learning for the Initial Design Optimization of Compressors Inspired by the Black-Winged Kite

by Mingming Zhang, Zhuang Miao, Xi Nan, Ning Ma and Ruoyang Liu

Biomimetics 2025, 10(8), 497; https://doi.org/10.3390/biomimetics10080497 - 29 Jul 2025

Viewed by 279

Abstract

Although artificial intelligence methods such as reinforcement learning (RL) show potential in optimizing the design of compressors, there are still two major challenges remaining: limited design variables and insufficient model explainability. For the initial design of compressors, this paper proposes a technical approach [...] Read more.

Although artificial intelligence methods such as reinforcement learning (RL) show potential in optimizing the design of compressors, there are still two major challenges remaining: limited design variables and insufficient model explainability. For the initial design of compressors, this paper proposes a technical approach that incorporates deep reinforcement learning and decision tree distillation to enhance both the optimization capability and explainability. First, a pre-selection platform for the initial design scheme of the compressors is constructed based on the Deep Deterministic Policy Gradient (DDPG) algorithm. The optimization space is significantly enlarged by expanding the co-design of 25 key variables (e.g., the inlet airflow angle, the reaction, the load coefficient, etc.). Then, the initial design of six-stage axial compressors is successfully completed, with the axial efficiency increasing to 84.65% at the design speed and the surge margin extending to 10.75%. The design scheme is closer to the actual needs of engineering. Secondly, Shapley Additive Explanations (SHAP) analysis is utilized to reveal the influence of the mechanism of the key design parameters on the performance of the compressors in order to enhance the model explainability. Finally, the decision tree inspired by the black-winged kite (BKA) algorithm takes the interpretable design rules and transforms the data-driven intelligent optimization into explicit engineering experience. Through experimental validation, this method significantly improves the transparency of the design process while maintaining the high performance of the DDPG algorithm. The extracted design rules not only have clear physical meanings but also can effectively guide the initial design of the compressors, providing a new idea with both optimization capability and explainability for its intelligent design. Full article

(This article belongs to the Special Issue Advances in Biological and Bio-Inspired Algorithms)

► Show Figures

Figure 1

20 pages, 8154 KiB

Open AccessArticle

Strategies for Soil Salinity Mapping Using Remote Sensing and Machine Learning in the Yellow River Delta

by Junyong Zhang, Xianghe Ge, Xuehui Hou, Lijing Han, Zhuoran Zhang, Wenjie Feng, Zihan Zhou and Xiubin Luo

Remote Sens. 2025, 17(15), 2619; https://doi.org/10.3390/rs17152619 - 28 Jul 2025

Viewed by 285

Abstract

In response to the global ecological and agricultural challenges posed by coastal saline-alkali areas, this study focuses on Dongying City as a representative region, aiming to develop a high-precision soil salinity prediction mapping method that integrates multi-source remote sensing data with machine learning [...] Read more.

In response to the global ecological and agricultural challenges posed by coastal saline-alkali areas, this study focuses on Dongying City as a representative region, aiming to develop a high-precision soil salinity prediction mapping method that integrates multi-source remote sensing data with machine learning techniques. Utilizing the SCORPAN model framework, we systematically combined diverse remote sensing datasets and innovatively established nine distinct strategies for soil salinity prediction. We employed four machine learning models—Support Vector Regression (SVR), Random Forest (RF), Extreme Gradient Boosting (XGBoost), and Geographical Gaussian Process Regression (GGPR) for modeling, prediction, and accuracy comparison, with the objective of achieving high-precision salinity mapping under complex vegetation cover conditions. The results reveal that among the models evaluated across the nine strategies, the SVR model demonstrated the highest accuracy, followed by RF. Notably, under Strategy IX, the SVR model achieved the best predictive performance, with a coefficient of determination (R²) of 0.62 and a root mean square error (RMSE) of 0.38 g/kg. Analysis based on SHapley Additive exPlanations (SHAP) values and feature importance indicated that Vegetation Type Factors contributed significantly and consistently to the model’s performance, maintaining higher importance than traditional salinity indices and playing a dominant role. In summary, this research successfully developed a comprehensive, high-resolution soil salinity mapping framework for the Dongying region by integrating multi-source remote sensing data and employing diverse predictive strategies alongside machine learning models. The findings highlight the potential of Vegetation Type Factors to enhance large-scale soil salinity monitoring, providing robust scientific evidence and technical support for sustainable land resource management, agricultural optimization, ecological protection, efficient water resource utilization, and policy formulation. Full article

(This article belongs to the Special Issue Remote Sensing of Soil Condition Assessment and Degradation Drivers Monitoring)

► Show Figures

Figure 1

16 pages, 1471 KiB

Open AccessArticle

Leveraging Machine Learning Techniques to Predict Cardiovascular Heart Disease

by Remzi Başar, Öznur Ocak, Alper Erturk and Marcelle de la Roche

Information 2025, 16(8), 639; https://doi.org/10.3390/info16080639 - 27 Jul 2025

Viewed by 282

Abstract

Cardiovascular diseases (CVDs) remain the leading cause of death globally, underscoring the urgent need for data-driven early diagnostic tools. This study proposes a multilayer artificial neural network (ANN) model for heart disease prediction, developed using a real-world clinical dataset comprising 13,981 patient records. [...] Read more.

Cardiovascular diseases (CVDs) remain the leading cause of death globally, underscoring the urgent need for data-driven early diagnostic tools. This study proposes a multilayer artificial neural network (ANN) model for heart disease prediction, developed using a real-world clinical dataset comprising 13,981 patient records. Implemented on the Orange data mining platform, the ANN was trained using backpropagation and validated through 10-fold cross-validation. Dimensionality reduction via principal component analysis (PCA) enhanced computational efficiency, while Shapley additive explanations (SHAP) were used to interpret model outputs. Despite achieving 83.4% accuracy and high specificity, the model exhibited poor sensitivity to disease cases, identifying only 76 of 2233 positive samples, with a Matthews correlation coefficient (MCC) of 0.058. Comparative benchmarks showed that random forest and support vector machines significantly outperformed the ANN in terms of discrimination (AUC up to 91.6%). SHAP analysis revealed serum creatinine, diabetes, and hemoglobin levels to be the dominant predictors. To address the current study’s limitations, future work will explore LIME, Grad-CAM, and ensemble techniques like XGBoost to improve interpretability and balance. This research emphasizes the importance of explainability, data representativeness, and robust evaluation in the development of clinically reliable AI tools for heart disease detection. Full article

(This article belongs to the Special Issue Information Systems in Healthcare)

► Show Figures

Figure 1

28 pages, 5172 KiB

Open AccessArticle

Machine Learning-Assisted Sustainable Mix Design of Waste Glass Powder Concrete with Strength–Cost–CO₂ Emissions Trade-Offs

by Yuzhuo Zhang, Jiale Peng, Zi Wang, Meng Xi, Jinlong Liu and Lei Xu

Buildings 2025, 15(15), 2640; https://doi.org/10.3390/buildings15152640 - 26 Jul 2025

Viewed by 429

Abstract

Glass powder, a non-degradable waste material, offers significant potential to reduce cement consumption and carbon emissions in concrete production. However, existing mix design methods for glass powder concrete (GPC) fail to systematically balance economic efficiency, environmental sustainability, and mechanical performance. To address this [...] Read more.

Glass powder, a non-degradable waste material, offers significant potential to reduce cement consumption and carbon emissions in concrete production. However, existing mix design methods for glass powder concrete (GPC) fail to systematically balance economic efficiency, environmental sustainability, and mechanical performance. To address this gap, this study proposes an AI-assisted framework integrating machine learning (ML) and Multi-Objective Optimization (MOO) to achieve a sustainable GPC design. A robust database of 1154 experimental records was developed, focusing on five key predictors: cement content, water-to-binder ratio, aggregate composition, glass powder content, and curing age. Seven ML models were optimized via Bayesian tuning, with the Ensemble Tree model achieving superior accuracy (R² = 0.959 on test data). SHapley Additive exPlanations (SHAP) analysis further elucidated the contribution mechanisms and underlying interactions of material components on GPC compressive strength. Subsequently, a MOO framework minimized unit cost and CO₂ emissions while meeting compressive strength targets (15–70 MPa), solved using the NSGA-II algorithm for Pareto solutions and TOPSIS for decision-making. The Pareto-optimal solutions provide actionable guidelines for engineers to align GPC design with circular economy principles and low-carbon policies. This work advances sustainable construction practices by bridging AI-driven innovation with building materials, directly supporting global goals for waste valorization and carbon neutrality. Full article

(This article belongs to the Special Issue Advanced Cementitious Materials: Integrating Nanotechnology, Sustainability, and Intelligent Design)

► Show Figures

Figure 1

Search Results (909)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (909)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI