Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (4,654)

Search Parameters:
Keywords = random forest importance

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
22 pages, 2120 KiB  
Article
Machine Learning Algorithms and Explainable Artificial Intelligence for Property Valuation
by Gabriella Maselli and Antonio Nesticò
Real Estate 2025, 2(3), 12; https://doi.org/10.3390/realestate2030012 - 1 Aug 2025
Abstract
The accurate estimation of urban property values is a key challenge for appraisers, market participants, financial institutions, and urban planners. In recent years, machine learning (ML) techniques have emerged as promising tools for price forecasting due to their ability to model complex relationships [...] Read more.
The accurate estimation of urban property values is a key challenge for appraisers, market participants, financial institutions, and urban planners. In recent years, machine learning (ML) techniques have emerged as promising tools for price forecasting due to their ability to model complex relationships among variables. However, their application raises two main critical issues: (i) the risk of overfitting, especially with small datasets or with noisy data; (ii) the interpretive issues associated with the “black box” nature of many models. Within this framework, this paper proposes a methodological approach that addresses both these issues, comparing the predictive performance of three ML algorithms—k-Nearest Neighbors (kNN), Random Forest (RF), and the Artificial Neural Network (ANN)—applied to the housing market in the city of Salerno, Italy. For each model, overfitting is preliminarily assessed to ensure predictive robustness. Subsequently, the results are interpreted using explainability techniques, such as SHapley Additive exPlanations (SHAPs) and Permutation Feature Importance (PFI). This analysis reveals that the Random Forest offers the best balance between predictive accuracy and transparency, with features such as area and proximity to the train station identified as the main drivers of property prices. kNN and the ANN are viable alternatives that are particularly robust in terms of generalization. The results demonstrate how the defined methodological framework successfully balances predictive effectiveness and interpretability, supporting the informed and transparent use of ML in real estate valuation. Full article
Show Figures

Figure 1

15 pages, 1635 KiB  
Article
Modeling the Abrasive Index from Mineralogical and Calorific Properties Using Tree-Based Machine Learning: A Case Study on the KwaZulu-Natal Coalfield
by Mohammad Afrazi, Chia Yu Huat, Moshood Onifade, Manoj Khandelwal, Deji Olatunji Shonuga, Hadi Fattahi and Danial Jahed Armaghani
Mining 2025, 5(3), 48; https://doi.org/10.3390/mining5030048 (registering DOI) - 1 Aug 2025
Abstract
Accurate prediction of the coal abrasive index (AI) is critical for optimizing coal processing efficiency and minimizing equipment wear in industrial applications. This study explores tree-based machine learning models; Random Forest (RF), Gradient Boosting Trees (GBT), and Extreme Gradient Boosting (XGBoost) to predict [...] Read more.
Accurate prediction of the coal abrasive index (AI) is critical for optimizing coal processing efficiency and minimizing equipment wear in industrial applications. This study explores tree-based machine learning models; Random Forest (RF), Gradient Boosting Trees (GBT), and Extreme Gradient Boosting (XGBoost) to predict AI using selected coal properties. A database of 112 coal samples from the KwaZulu-Natal Coalfield in South Africa was used. Initial predictions using all eight input properties revealed suboptimal testing performance (R2: 0.63–0.72), attributed to outliers and noisy data. Feature importance analysis identified calorific value, quartz, ash, and Pyrite as dominant predictors, aligning with their physicochemical roles in abrasiveness. After data cleaning and feature selection, XGBoost achieved superior accuracy (R2 = 0.92), outperforming RF (R2 = 0.85) and GBT (R2 = 0.81). The results highlight XGBoost’s robustness in modeling non-linear relationships between coal properties and AI. This approach offers a cost-effective alternative to traditional laboratory methods, enabling industries to optimize coal selection, reduce maintenance costs, and enhance operational sustainability through data-driven decision-making. Additionally, quartz and Ash content were identified as the most influential parameters on AI using the Cosine Amplitude technique, while calorific value had the least impact among the selected features. Full article
(This article belongs to the Special Issue Mine Automation and New Technologies)
Show Figures

Figure 1

18 pages, 723 KiB  
Article
A Machine Learning-Based Model for Predicting High Deficiency Risk Ships in Port State Control: A Case Study of the Port of Singapore
by Ming-Cheng Tsou
J. Mar. Sci. Eng. 2025, 13(8), 1485; https://doi.org/10.3390/jmse13081485 - 31 Jul 2025
Abstract
This study developed a model to predict ships with high deficiency risk under Port State Control (PSC) through machine learning techniques, particularly the Random Forest algorithm. The study utilized actual ship inspection data from the Port of Singapore, comprehensively considering various operational and [...] Read more.
This study developed a model to predict ships with high deficiency risk under Port State Control (PSC) through machine learning techniques, particularly the Random Forest algorithm. The study utilized actual ship inspection data from the Port of Singapore, comprehensively considering various operational and safety indicators of ships, including but not limited to flag state, ship age, past deficiencies, and detention history. By analyzing these factors in depth, this research enhances the efficiency and accuracy of PSC inspections, provides decision support for port authorities, and offers strategic guidance for shipping companies to comply with international safety standards. During the research process, I first conducted detailed data preprocessing, including data cleaning and feature selection, to ensure the effectiveness of model training. Using the Random Forest algorithm, I identified key factors influencing the detention risk of ships and established a risk prediction model accordingly. The model validation results indicated that factors such as ship age, tonnage, company performance, and flag state significantly affect whether a ship exhibits a high deficiency rate. In addition, this study explored the potential and limitations of applying the Random Forest model in predicting high deficiency risk under PSC, and proposed future research directions, including further model optimization and the development of real-time prediction systems. By achieving these goals, I hope to provide valuable experience for other global shipping hubs, promote higher international maritime safety standards, and contribute to the sustainable development of the global shipping industry. This research not only highlights the importance of machine learning in the maritime domain but also demonstrates the potential of data-driven decision-making in improving ship safety management and port inspection efficiency. It is hoped that this study will inspire more maritime practitioners and researchers to explore advanced data analytics techniques to address the increasingly complex challenges of global shipping. Full article
(This article belongs to the Topic Digital Technologies in Supply Chain Risk Management)
21 pages, 3532 KiB  
Article
Machine Learning Prediction of CO2 Diffusion in Brine: Model Development and Salinity Influence Under Reservoir Conditions
by Qaiser Khan, Peyman Pourafshary, Fahimeh Hadavimoghaddam and Reza Khoramian
Appl. Sci. 2025, 15(15), 8536; https://doi.org/10.3390/app15158536 (registering DOI) - 31 Jul 2025
Abstract
The diffusion coefficient (DC) of CO2 in brine is a key parameter in geological carbon sequestration and CO2-Enhanced Oil Recovery (EOR), as it governs mass transfer efficiency and storage capacity. This study employs three machine learning (ML) models—Random Forest (RF), [...] Read more.
The diffusion coefficient (DC) of CO2 in brine is a key parameter in geological carbon sequestration and CO2-Enhanced Oil Recovery (EOR), as it governs mass transfer efficiency and storage capacity. This study employs three machine learning (ML) models—Random Forest (RF), Gradient Boost Regressor (GBR), and Extreme Gradient Boosting (XGBoost)—to predict DC based on pressure, temperature, and salinity. The dataset, comprising 176 data points, spans pressures from 0.10 to 30.00 MPa, temperatures from 286.15 to 398.00 K, salinities from 0.00 to 6.76 mol/L, and DC values from 0.13 to 4.50 × 10−9 m2/s. The data was split into 80% for training and 20% for testing to ensure reliable model evaluation. Model performance was assessed using R2, RMSE, and MAE. The RF model demonstrated the best performance, with an R2 of 0.95, an RMSE of 0.03, and an MAE of 0.11 on the test set, indicating high predictive accuracy and generalization capability. In comparison, GBR achieved an R2 of 0.925, and XGBoost achieved an R2 of 0.91 on the test set. Feature importance analysis consistently identified temperature as the most influential factor, followed by salinity and pressure. This study highlights the potential of ML models for predicting CO2 diffusion in brine, providing a robust, data-driven framework for optimizing CO2-EOR processes and carbon storage strategies. The findings underscore the critical role of temperature in diffusion behavior, offering valuable insights for future modeling and operational applications. Full article
Show Figures

Figure 1

28 pages, 5503 KiB  
Article
Feature Selection Framework for Improved UAV-Based Detection of Solenopsis invicta Mounds in Agricultural Landscapes
by Chun-Han Shih, Cheng-En Song, Su-Fen Wang and Chung-Chi Lin
Insects 2025, 16(8), 793; https://doi.org/10.3390/insects16080793 (registering DOI) - 31 Jul 2025
Abstract
The red imported fire ant (RIFA; Solenopsis invicta) is an invasive species that severely threatens ecology, agriculture, and public health in Taiwan. In this study, the feasibility of applying multispectral imagery captured by unmanned aerial vehicles (UAVs) to detect red fire ant [...] Read more.
The red imported fire ant (RIFA; Solenopsis invicta) is an invasive species that severely threatens ecology, agriculture, and public health in Taiwan. In this study, the feasibility of applying multispectral imagery captured by unmanned aerial vehicles (UAVs) to detect red fire ant mounds was evaluated in Fenlin Township, Hualien, Taiwan. A DJI Phantom 4 multispectral drone collected reflectance in five bands (blue, green, red, red-edge, and near-infrared), derived indices (normalized difference vegetation index, NDVI, soil-adjusted vegetation index, SAVI, and photochemical pigment reflectance index, PPR), and textural features. According to analysis of variance F-scores and random forest recursive feature elimination, vegetation indices and spectral features (e.g., NDVI, NIR, SAVI, and PPR) were the most significant predictors of ecological characteristics such as vegetation density and soil visibility. Texture features exhibited moderate importance and the potential to capture intricate spatial patterns in nonlinear models. Despite limitations in the analytics, including trade-offs related to flight height and environmental variability, the study findings suggest that UAVs are an inexpensive, high-precision means of obtaining multispectral data for RIFA monitoring. These findings can be used to develop efficient mass-detection protocols for integrated pest control, with broader implications for invasive species monitoring. Full article
(This article belongs to the Special Issue Surveillance and Management of Invasive Insects)
Show Figures

Figure 1

18 pages, 4863 KiB  
Article
Evaluation of Explainable, Interpretable and Non-Interpretable Algorithms for Cyber Threat Detection
by José Ramón Trillo, Felipe González-López, Juan Antonio Morente-Molinera, Roberto Magán-Carrión and Pablo García-Sánchez
Electronics 2025, 14(15), 3073; https://doi.org/10.3390/electronics14153073 (registering DOI) - 31 Jul 2025
Abstract
As anonymity-enabling technologies such as VPNs and proxies become increasingly exploited for malicious purposes, detecting traffic associated with such services emerges as a critical first step in anticipating potential cyber threats. This study analyses a network traffic dataset focused on anonymised IP addresses—not [...] Read more.
As anonymity-enabling technologies such as VPNs and proxies become increasingly exploited for malicious purposes, detecting traffic associated with such services emerges as a critical first step in anticipating potential cyber threats. This study analyses a network traffic dataset focused on anonymised IP addresses—not direct attacks—to evaluate and compare explainable, interpretable, and opaque machine learning models. Through advanced preprocessing and feature engineering, we examine the trade-off between model performance and transparency in the early detection of suspicious connections. We evaluate explainable ML-based models such as k-nearest neighbours, fuzzy algorithms, decision trees, and random forests, alongside interpretable models like naïve Bayes, support vector machines, and non-interpretable algorithms such as neural networks. Results show that neural networks achieve the highest performance, with a macro F1-score of 0.8786, but explainable models like HFER offer strong performance (macro F1-score = 0.6106) with greater interpretability. The choice of algorithm depends on project-specific needs: neural networks excel in accuracy, while explainable algorithms are preferred for resource efficiency and transparency, as stated in this work. This work underscores the importance of aligning cybersecurity strategies with operational requirements, providing insights into balancing performance with interpretability. Full article
(This article belongs to the Special Issue Network Security and Cryptography Applications)
Show Figures

Graphical abstract

35 pages, 3218 KiB  
Article
Integrated GBR–NSGA-II Optimization Framework for Sustainable Utilization of Steel Slag in Road Base Layers
by Merve Akbas
Appl. Sci. 2025, 15(15), 8516; https://doi.org/10.3390/app15158516 (registering DOI) - 31 Jul 2025
Abstract
This study proposes an integrated, machine learning-based multi-objective optimization framework to evaluate and optimize the utilization of steel slag in road base layers, simultaneously addressing economic costs and environmental impacts. A comprehensive dataset of 482 scenarios was engineered based on literature-informed parameters, encompassing [...] Read more.
This study proposes an integrated, machine learning-based multi-objective optimization framework to evaluate and optimize the utilization of steel slag in road base layers, simultaneously addressing economic costs and environmental impacts. A comprehensive dataset of 482 scenarios was engineered based on literature-informed parameters, encompassing transport distance, processing energy intensity, initial moisture content, gradation adjustments, and regional electricity emission factors. Four advanced tree-based ensemble regression algorithms—Random Forest Regressor (RFR), Extremely Randomized Trees (ERTs), Gradient Boosted Regressor (GBR), and Extreme Gradient Boosting Regressor (XGBR)—were rigorously evaluated. Among these, GBR demonstrated superior predictive performance (R2 > 0.95, RMSE < 7.5), effectively capturing complex nonlinear interactions inherent in slag processing and logistics operations. Feature importance analysis via SHapley Additive exPlanations (SHAP) provided interpretative insights, highlighting transport distance and energy intensity as dominant factors affecting unit cost, while moisture content and grid emission factor predominantly influenced CO2 emissions. Subsequently, the Gradient Boosted Regressor model was integrated into a Non-Dominated Sorting Genetic Algorithm II (NSGA-II) framework to explore optimal trade-offs between cost and emissions. The resulting Pareto front revealed a diverse solution space, with significant nonlinear trade-offs between economic efficiency and environmental performance, clearly identifying strategic inflection points. To facilitate actionable decision-making, the Technique for Order Preference by Similarity to Ideal Solution (TOPSIS) method was applied, identifying an optimal balanced solution characterized by a transport distance of 47 km, energy intensity of 1.21 kWh/ton, moisture content of 6.2%, moderate gradation adjustment, and a grid CO2 factor of 0.47 kg CO2/kWh. This scenario offered a substantial reduction (45%) in CO2 emissions relative to cost-minimized solutions, with a moderate increase (33%) in total cost, presenting a realistic and balanced pathway for sustainable infrastructure practices. Overall, this study introduces a robust, scalable, and interpretable optimization framework, providing valuable methodological advancements for sustainable decision making in infrastructure planning and circular economy initiatives. Full article
Show Figures

Figure 1

30 pages, 10011 KiB  
Article
Machine Learning Methods as a Tool for Analysis and Prediction of Impact Resistance of Rubber–Textile Conveyor Belts
by Miriam Andrejiova, Anna Grincova, Daniela Marasova and Zuzana Kimakova
Appl. Sci. 2025, 15(15), 8511; https://doi.org/10.3390/app15158511 (registering DOI) - 31 Jul 2025
Abstract
Rubber–textile conveyor belts are an important element of large-scale transport systems, which in many cases are subjected to excessive dynamic loads. Assessing the impact resistance of them is essential for ensuring their reliability and longevity. The article focuses on the use of machine [...] Read more.
Rubber–textile conveyor belts are an important element of large-scale transport systems, which in many cases are subjected to excessive dynamic loads. Assessing the impact resistance of them is essential for ensuring their reliability and longevity. The article focuses on the use of machine learning methods as one of the approaches to the analysis and prediction of the impact resistance of rubber–textile conveyor belts. Based on the data obtained from the design properties of conveyor belts and experimental testing conditions, four models were created (regression model, decision tree regression model, random forest model, ANN model), which are used to analyze and predict the impact force of the force acting on the conveyor belt during material impact. Each model was trained on training data and validated on test data. The performance of each model was evaluated using standard metrics and model indicators. The results of the model analysis show that the most powerful model, ANN, explains up to 99.6% of the data variability. The second-best model is the random forest model and then the regression model. The least suitable choice for predicting the impact force is the regression tree. Full article
Show Figures

Figure 1

15 pages, 2057 KiB  
Article
Machine Learning-Based Prediction of Atmospheric Corrosion Rates Using Environmental and Material Parameters
by Saurabh Tiwari, Khushbu Dash, Nokeun Park and Nagireddy Gari Subba Reddy
Coatings 2025, 15(8), 888; https://doi.org/10.3390/coatings15080888 (registering DOI) - 31 Jul 2025
Abstract
Atmospheric corrosion significantly impacts infrastructure worldwide, with traditional assessment methods being time-intensive and costly. This study developed a comprehensive machine learning framework for predicting atmospheric corrosion rates using environmental and material parameters. Three regression models (Linear Regression, Random Forest, and Gradient Boosting) were [...] Read more.
Atmospheric corrosion significantly impacts infrastructure worldwide, with traditional assessment methods being time-intensive and costly. This study developed a comprehensive machine learning framework for predicting atmospheric corrosion rates using environmental and material parameters. Three regression models (Linear Regression, Random Forest, and Gradient Boosting) were trained on a scientifically informed synthetic dataset incorporating established corrosion principles from ISO 9223 standards and peer-reviewed literature. The Gradient Boosting model achieved superior performance with cross-validated R2 = 0.835 ± 0.024 and RMSE = 98.99 ± 16.62 μm/year, significantly outperforming the Random Forest (p < 0.001) and Linear Regression approaches. Feature importance analysis revealed the copper content (30%), exposure time (20%), and chloride deposition (15%) as primary predictors, consistent with the established principles of corrosion science. Model diagnostics demonstrated excellent predictive accuracy (R2 = 0.863) with normally distributed residuals and homoscedastic variance patterns. This methodology provides a systematic framework for ML-based corrosion prediction, with significant implications for protective coating design, material selection, and infrastructure risk assessment, pending comprehensive experimental validation. Full article
(This article belongs to the Special Issue Advanced Anticorrosion Coatings and Coating Testing)
Show Figures

Figure 1

18 pages, 2263 KiB  
Article
Predicting Antimicrobial Peptide Activity: A Machine Learning-Based Quantitative Structure–Activity Relationship Approach
by Eliezer I. Bonifacio-Velez de Villa, María E. Montoya-Alfaro, Luisa P. Negrón-Ballarte and Christian Solis-Calero
Pharmaceutics 2025, 17(8), 993; https://doi.org/10.3390/pharmaceutics17080993 (registering DOI) - 31 Jul 2025
Abstract
Background: Peptides are a class of molecules that can be presented as good antimicrobials and with mechanisms that avoid resistance, and the design of peptides with good activity can be complex and laborious. The study of their quantitative structure–activity relationships through machine [...] Read more.
Background: Peptides are a class of molecules that can be presented as good antimicrobials and with mechanisms that avoid resistance, and the design of peptides with good activity can be complex and laborious. The study of their quantitative structure–activity relationships through machine learning algorithms can shed light on a rational and effective design. Methods: Information on the antimicrobial activity of peptides was collected, and their structures were characterized by molecular descriptors generation to design regression and classification models based on machine learning algorithms. The contribution of each descriptor in the generated models was evaluated by determining its relative importance and, finally, the antimicrobial activity of new peptides was estimated. Results: A structured database of antimicrobial peptides and their descriptors was obtained, with which 56 machine learning models were generated. Random Forest-based models showed better performance, and of these, regression models showed variable performance (R2 = 0.339–0.574), while classification models showed good performance (MCC = 0.662–0.755 and ACC = 0.831–0.877). Those models based on bacterial groups showed better performance than those based on the entire dataset. The properties of the new peptides generated are related to important descriptors that encode physicochemical properties such as lower molecular weight, higher charge, propensity to form alpha-helical structures, lower hydrophobicity, and higher frequency of amino acids such as lysine and serine. Conclusions: Machine learning models allowed to establish the structure–activity relationships of antimicrobial peptides. Classification models performed better than regression models. These models allowed us to make predictions and new peptides with high antimicrobial potential were proposed. Full article
Show Figures

Graphical abstract

16 pages, 1182 KiB  
Article
Machine Learning-Based Identification of Risk Factors for ICU Mortality in 8902 Critically Ill Patients with Pandemic Viral Infection
by Elisabeth Papiol, Ricard Ferrer, Juan C. Ruiz-Rodríguez, Emili Díaz, Rafael Zaragoza, Marcio Borges-Sa, Julen Berrueta, Josep Gómez, María Bodí, Susana Sancho, Borja Suberviola, Sandra Trefler and Alejandro Rodríguez
J. Clin. Med. 2025, 14(15), 5383; https://doi.org/10.3390/jcm14155383 - 30 Jul 2025
Abstract
Background/Objectives: The SARS-CoV-2 and influenza A (H1N1)pdm09 pandemics have resulted in high numbers of ICU admissions, with high mortality. Identifying risk factors for ICU mortality at the time of admission can help optimize clinical decision making. However, the risk factors identified may [...] Read more.
Background/Objectives: The SARS-CoV-2 and influenza A (H1N1)pdm09 pandemics have resulted in high numbers of ICU admissions, with high mortality. Identifying risk factors for ICU mortality at the time of admission can help optimize clinical decision making. However, the risk factors identified may differ, depending on the type of analysis used. Our aim is to compare the risk factors and performance of a linear model (multivariable logistic regression, GLM) with a non-linear model (random forest, RF) in a large national cohort. Methods: A retrospective analysis was performed on a multicenter database including 8902 critically ill patients with influenza A (H1N1)pdm09 or COVID-19 admitted to 184 Spanish ICUs. Demographic, clinical, laboratory, and microbiological data from the first 24 h were used. Prediction models were built using GLM and RF. The performance of the GLM was evaluated by area under the ROC curve (AUC), precision, sensitivity, and specificity, while the RF by out-of-bag (OOB) error and accuracy. In addition, in the RF, the im-portance of the variables in terms of accuracy reduction (AR) and Gini index reduction (GI) was determined. Results: Overall mortality in the ICU was 25.8%. Model performance was similar, with AUC = 76% for GLM, and AUC = 75.6% for RF. GLM identified 17 independent risk factors, while RF identified 19 for AR and 23 for GI. Thirteen variables were found to be important in both models. Laboratory variables such as procalcitonin, white blood cells, lactate, or D-dimer levels were not significant in GLM but were significant in RF. On the contrary, acute kidney injury and the presence of Acinetobacter spp. were important variables in the GLM but not in the RF. Conclusions: Although the performance of linear and non-linear models was similar, different risk factors were determined, depending on the model used. This alerts clinicians to the limitations and usefulness of studies limited to a single type of model. Full article
(This article belongs to the Special Issue Current Trends and Prospects of Critical Emergency Medicine)
Show Figures

Figure 1

18 pages, 1777 KiB  
Article
Machine Learning in Sensory Analysis of Mead—A Case Study: Ensembles of Classifiers
by Krzysztof Przybył, Daria Cicha-Wojciechowicz, Natalia Drabińska and Małgorzata Anna Majcher
Molecules 2025, 30(15), 3199; https://doi.org/10.3390/molecules30153199 - 30 Jul 2025
Abstract
The aim was to explore using machine learning (including cluster mapping and k-means methods) to classify types of mead based on sensory analysis and aromatic compounds. Machine learning is a modern tool that helps with detailed analysis, especially because verifying aromatic compounds is [...] Read more.
The aim was to explore using machine learning (including cluster mapping and k-means methods) to classify types of mead based on sensory analysis and aromatic compounds. Machine learning is a modern tool that helps with detailed analysis, especially because verifying aromatic compounds is challenging. In the first stage, a cluster map analysis was conducted, allowing for the exploratory identification of the most characteristic features of mead. Based on this, k-means clustering was performed to evaluate how well the identified sensory features align with logically consistent groups of observations. In the next stage, experiments were carried out to classify the type of mead using algorithms such as Random Forest (RF), adaptive boosting (AdaBoost), Bootstrap aggregation (Bagging), K-Nearest Neighbors (KNN), and Decision Tree (DT). The analysis revealed that the RF and KNN algorithms were the most effective in classifying mead based on sensory characteristics, achieving the highest accuracy. In contrast, the AdaBoost algorithm consistently produced the lowest accuracy results. However, the Decision Tree algorithm achieved the highest accuracy value (0.909), demonstrating its potential for precise classification based on aroma characteristics. The error matrix analysis also indicated that acacia mead was easier for the algorithms to identify than tilia or buckwheat mead. The results show the potential of combining an exploratory approach (cluster map with the k-means method) with machine learning. It is also important to focus on selecting and optimizing classification models used in practice because, as the results so far indicate, choosing the right algorithm greatly affects the success of mead identification. Full article
(This article belongs to the Special Issue Analytical Technologies and Intelligent Applications in Future Food)
Show Figures

Graphical abstract

16 pages, 1833 KiB  
Article
Prediction of Waste Generation Using Machine Learning: A Regional Study in Korea
by Jae-Sang Lee and Dong-Chul Shin
Urban Sci. 2025, 9(8), 297; https://doi.org/10.3390/urbansci9080297 - 30 Jul 2025
Viewed by 26
Abstract
Accurate forecasting of household waste generation is essential for sustainable urban planning and the development of data-driven environmental policies. Conventional statistical models, while simple and interpretable, often fail to capture the nonlinear and multidimensional relationships inherent in waste production patterns. This study proposes [...] Read more.
Accurate forecasting of household waste generation is essential for sustainable urban planning and the development of data-driven environmental policies. Conventional statistical models, while simple and interpretable, often fail to capture the nonlinear and multidimensional relationships inherent in waste production patterns. This study proposes a machine learning-based regression framework utilizing Random Forest and XGBoost algorithms to predict annual household waste generation across four metropolitan regions in South Korea Seoul, Gyeonggi, Incheon, and Jeju over the period from 2000 to 2023. Independent variables include demographic indicators (total population, working-age population, elderly population), economic indicators (Gross Regional Domestic Product), and regional identifiers encoded using One-Hot Encoding. A derived feature, elderly ratio, was introduced to reflect population aging. Model performance was evaluated using R2, RMSE, and MAE, with artificial noise added to simulate uncertainty. Random Forest demonstrated superior generalization and robustness to data irregularities, especially in data-scarce regions like Jeju. SHAP-based interpretability analysis revealed total population and GRDP as the most influential features. The findings underscore the importance of incorporating economic indicators in waste forecasting models, as demographic variables alone were insufficient for explaining waste dynamics. This approach provides valuable insights for policymakers and supports the development of adaptive, region-specific strategies for waste reduction and infrastructure investment. Full article
Show Figures

Figure 1

23 pages, 7371 KiB  
Article
A Novel Method for Estimating Building Height from Baidu Panoramic Street View Images
by Shibo Ge, Jiping Liu, Xianghong Che, Yong Wang and Haosheng Huang
ISPRS Int. J. Geo-Inf. 2025, 14(8), 297; https://doi.org/10.3390/ijgi14080297 - 30 Jul 2025
Viewed by 44
Abstract
Building height information plays an important role in many urban-related applications, such as urban planning, disaster management, and environmental studies. With the rapid development of real scene maps, street view images are becoming a new data source for building height estimation, considering their [...] Read more.
Building height information plays an important role in many urban-related applications, such as urban planning, disaster management, and environmental studies. With the rapid development of real scene maps, street view images are becoming a new data source for building height estimation, considering their easy collection and low cost. However, existing studies on building height estimation primarily utilize remote sensing images, with little exploration of height estimation from street-view images. In this study, we proposed a deep learning-based method for estimating the height of a single building in Baidu panoramic street view imagery. Firstly, the Segment Anything Model was used to extract the region of interest image and location features of individual buildings from the panorama. Subsequently, a cross-view matching algorithm was proposed by combining Baidu panorama and building footprint data with height information to generate building height samples. Finally, a Two-Branch feature fusion model (TBFF) was constructed to combine building location features and visual features, enabling accurate height estimation for individual buildings. The experimental results showed that the TBFF model had the best performance, with an RMSE of 5.69 m, MAE of 3.97 m, and MAPE of 0.11. Compared with two state-of-the-art methods, the TBFF model exhibited robustness and higher accuracy. The Random Forest model had an RMSE of 11.83 m, MAE of 4.76 m, and MAPE of 0.32, and the Pano2Geo model had an RMSE of 10.51 m, MAE of 6.52 m, and MAPE of 0.22. The ablation analysis demonstrated that fusing building location and visual features can improve the accuracy of height estimation by 14.98% to 69.99%. Moreover, the accuracy of the proposed method meets the LOD1 level 3D modeling requirements defined by the OGC (height error ≤ 5 m), which can provide data support for urban research. Full article
Show Figures

Figure 1

20 pages, 3810 KiB  
Article
Exploring Drought Response: Machine-Learning-Based Classification of Rice Tolerance Using Root and Physiological Traits
by Wuttichai Gunnula, Nantawan Kanawapee, Hathairat Chokthaweepanich and Piyaporn Phansak
Agronomy 2025, 15(8), 1840; https://doi.org/10.3390/agronomy15081840 - 29 Jul 2025
Viewed by 216
Abstract
Drought is a key limitation for rice productivity. While oxidative stress markers like hydrogen peroxide (H2O2) are important for drought adaptation, the predictive value of combining root anatomical and physiological traits is underexplored. We assessed 20 rice cultivars under [...] Read more.
Drought is a key limitation for rice productivity. While oxidative stress markers like hydrogen peroxide (H2O2) are important for drought adaptation, the predictive value of combining root anatomical and physiological traits is underexplored. We assessed 20 rice cultivars under drought and control conditions using a random forest, a multi-layer perceptron, and a SHAP-optimized stacking ensemble. The stacking ensemble achieved the highest classification accuracy (81.8%) and identified hydrogen peroxide, relative water content, and endodermis inner circumference as key predictors. SHAP analysis revealed important interactions between root anatomical and physiological traits, providing new biological insights into drought tolerance. Our integrative approach, supported by robust cross-validation, improves predictive power and transparency for breeding drought-resilient rice cultivars. Full article
(This article belongs to the Section Precision and Digital Agriculture)
Show Figures

Figure 1

Back to TopTop