Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (328)

Search Parameters:
Keywords = random forest regression (RFR)

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
21 pages, 10154 KB  
Article
Sea Ice Concentration Retrieval in the Arctic and Antarctic Using FY-3E GNSS-R Data
by Tingyu Xie, Cong Yin, Weihua Bai, Dongmei Song, Feixiong Huang, Junming Xia, Xiaochun Zhai, Yueqiang Sun, Qifei Du and Bin Wang
Remote Sens. 2026, 18(2), 285; https://doi.org/10.3390/rs18020285 - 15 Jan 2026
Abstract
Recognizing the critical role of polar Sea Ice Concentration (SIC) in climate feedback mechanisms, this study presents the first comprehensive investigation of China’s Fengyun-3E(FY-3E) GNOS-II Global Navigation Satellite System Reflectometry (GNSS-R) for bipolar SIC retrieval. Specifically, reflected signals from multiple Global Navigation Satellite [...] Read more.
Recognizing the critical role of polar Sea Ice Concentration (SIC) in climate feedback mechanisms, this study presents the first comprehensive investigation of China’s Fengyun-3E(FY-3E) GNOS-II Global Navigation Satellite System Reflectometry (GNSS-R) for bipolar SIC retrieval. Specifically, reflected signals from multiple Global Navigation Satellite Systems (GNSS) are utilized to extract characteristic parameters from Delay Doppler Maps (DDMs). By integrating regional partitioning and dynamic thresholding for sea ice detection, a Random Forest Regression (RFR) model incorporating a rolling-window training strategy is developed to estimate SIC. The retrieved SIC products are generated at the native GNSS-R observation resolution of approximately 1 × 6 km, with each SIC estimate corresponding to an individual GNSS-R observation time. Owing to the limited daily spatial coverage of GNSS-R measurements, the retrieved SIC results are further aggregated into monthly composites for spatial distribution analysis. The model is trained and validated across both polar regions, including targeted ice–water boundary zones. Retrieved SIC estimates are compared with reference data from the OSI SAF Special Sensor Microwave Imager Sounder (SSMIS), demonstrating strong agreement. Based on an extensive dataset, the average correlation coefficient (R) reaches 0.9450 in the Arctic and 0.9602 in the Antarctic for the testing set, with corresponding Root Mean Squared Error (RMSE) of 0.1262 and 0.0818, respectively. Even in the more challenging ice–water transition zones, RMSE values remain within acceptable ranges, reaching 0.1486 in the Arctic and 0.1404 in the Antarctic. This study demonstrates the feasibility and accuracy of GNSS-R-based SIC retrieval, offering a robust and effective approach for cryospheric monitoring at high latitudes in both polar regions. Full article
Show Figures

Figure 1

16 pages, 1477 KB  
Article
Machine Learning-Based Modeling of Tractor Fuel and Energy Efficiency During Chisel Plough Tillage
by Ergün Çıtıl, Kazım Çarman, Muhammet Furkan Atalay, Nicoleta Ungureanu and Nicolae-Valentin Vlăduț
Sustainability 2026, 18(2), 855; https://doi.org/10.3390/su18020855 - 14 Jan 2026
Viewed by 31
Abstract
Improving fuel and energy efficiency in agricultural tillage is critical for sustainable farming and reducing environmental impacts. In this study, the effects of forward speed and tillage depth on the fuel efficiency parameters of a tractor–chisel plough combination were investigated under controlled field [...] Read more.
Improving fuel and energy efficiency in agricultural tillage is critical for sustainable farming and reducing environmental impacts. In this study, the effects of forward speed and tillage depth on the fuel efficiency parameters of a tractor–chisel plough combination were investigated under controlled field conditions on clay soil. Specific fuel consumption (SFC), fuel consumption per unit area (FCPA), and overall energy efficiency (OEE) were evaluated at four forward speeds (0.6, 0.95, 1.2 and 1.4 m·s−1) and four tillage depths (15, 19.5, 23 and 26.5 cm). SFC ranged from 0.519 to 1.237 L·kW−1·h−1, while OEE varied between 7.918 and 18.854%. Higher forward speeds significantly reduced fuel consumption and improved energy efficiency, whereas deeper tillage increased fuel use and reduced efficiency. Optimal operation occurred at speeds of 1.2–1.4 m·s−1 and shallow to medium depths. Five machine learning algorithms: Polynomial Regression (PL), Random Forest Regressor (RFR), Gradient Boosting Regressor (GBR), Support Vector Regression (SVR), and Decision Tree Regressor (DTR), were applied to model fuel efficiency parameters. RFR achieved the highest accuracy for predicting SFC, while PL performed best for FCPA and OEE, with the mean absolute percentage error (MAPE) below 2%. Models such as PL and RFR excel in data structures dominated by nonlinear relationships. These results highlight the potential of machine learning to guide data-driven decisions for fuel and energy optimization in tillage, promoting more sustainable mechanization strategies and resource-efficient agricultural production. Full article
Show Figures

Figure 1

21 pages, 3302 KB  
Article
Evaluating Parameter Influences on Planted Concrete Properties via Explainable Machine Learning Models
by Xiansheng Duan, Ming Zhang and Runjuan Zhou
Appl. Sci. 2026, 16(2), 761; https://doi.org/10.3390/app16020761 - 12 Jan 2026
Viewed by 98
Abstract
To investigate the complex functional relationships between pH, effective porosity, and compressive strength of planted concrete and their corresponding mixing ratios, a comprehensive database was developed from the relevant published literature. In this study, four machine learning (ML) algorithms were employed: a single [...] Read more.
To investigate the complex functional relationships between pH, effective porosity, and compressive strength of planted concrete and their corresponding mixing ratios, a comprehensive database was developed from the relevant published literature. In this study, four machine learning (ML) algorithms were employed: a single algorithm—Multi-Layer Perceptron (MLP), and three ensemble algorithms—Gradient Boosting Regression (GBR), Extreme Gradient Boosting (XGBoost), and Random Forest Regression (RFR)—to predict the pH, effective porosity, and compressive strength of planted concrete. Additionally, the interpretable algorithm Shapley Additive Explanations (SHAP) was used to evaluate both global and local interpretations independent of the ML algorithms, providing insight into the decision-making process. The results demonstrate that the RFR algorithm achieved the highest R2 values of 0.93 (pH), 0.97 (effective porosity), and 0.94 (compressive strength) in predicting planted concrete properties, demonstrating optimal predictive performance. Furthermore, cement content was identified as the most influential factor affecting pH, while design porosity and maximum coarse aggregate size were the primary factors influencing effective porosity, in that order. For compressive strength, the two most critical factors were the water reducer and cement content. Full article
Show Figures

Figure 1

37 pages, 7023 KB  
Article
Data-Driven AI Approach for Optimizing Processes and Predicting Mechanical Properties of Boron Nitride Nanoplatelet-Reinforced PLA Nanocomposites
by Sundarasetty Harishbabu, Joy Djuansjah, P. S. Rama Sreekanth, A. Praveen Kumar, Borhen Louhichi, Santosh Kumar Sahu, It Ee Lee and Qamar Wali
Polymers 2026, 18(2), 185; https://doi.org/10.3390/polym18020185 - 9 Jan 2026
Viewed by 237
Abstract
This research explores the optimization of mechanical properties and predictive modeling of polylactic acid (PLA) reinforced with boron nitride nanoplatelets (BNNPs) using data-driven machine learning (ML) models. PLA-BNNP composites were fabricated through injection molding, with a focus on how key processing parameters influence [...] Read more.
This research explores the optimization of mechanical properties and predictive modeling of polylactic acid (PLA) reinforced with boron nitride nanoplatelets (BNNPs) using data-driven machine learning (ML) models. PLA-BNNP composites were fabricated through injection molding, with a focus on how key processing parameters influence their mechanical performance. A Taguchi L27 orthogonal array was applied to assess the effects of BNNP composition (0.02 wt.% and 0.04 wt.%), injection temperature (135–155 °C), injection speed (50–70 mm/s), and pressure (30–50 bar) on properties such as tensile strength, Young’s modulus, and hardness. The results indicated that a 0.04 wt.% BNNP loading improved tensile strength, Young’s modulus, and hardness by 18.6%, 32.7%, and 20.5%, respectively, compared to pure PLA. Taguchi analysis highlighted that higher BNNP concentrations, along with optimal injection temperatures, improved all mechanical properties, although excessive temperatures compromised tensile strength and modulus, while enhancing hardness. Analysis of variance (ANOVA) revealed that injection temperature was the dominant factor for tensile strength (68.88%) and Young’s modulus (86.39%), while BNNP composition played a more significant role in influencing hardness (78.83%). Predictive models were built using machine learning (ML) models such as Random Forest Regression (RFR), Gradient Boosting Regression (GBR), and Extreme Gradient Boosting (XGBoost). Among the ML models, XGBoost demonstrated the highest predictive accuracy, achieving R2 values above 98% for tensile strength, 92–93% for Young’s modulus, and 96% for hardness, with low error metrics i.e., Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE). These findings underscore the potential of using BNNP reinforcement and machine learning-driven property prediction to enhance PLA nanocomposites’ mechanical performance, making them viable for applications in lightweight packaging, biomedical implants, consumer electronics, and automotive components, offering sustainable alternatives to petroleum-based plastics. Full article
(This article belongs to the Special Issue Emerging Trends in Polymer Engineering: Polymer Connect-2024)
Show Figures

Figure 1

31 pages, 13729 KB  
Article
Stage-Wise SOH Prediction Using an Improved Random Forest Regression Algorithm
by Wei Xiao, Jun Jia, Wensheng Gao, Haibo Li, Hong Xu, Weidong Zhong and Ke He
Electronics 2026, 15(2), 287; https://doi.org/10.3390/electronics15020287 - 8 Jan 2026
Viewed by 120
Abstract
In complex energy storage operating scenarios, batteries seldom undergo complete charge–discharge cycles required for periodic capacity calibration. Methods based on accelerated aging experiments can indicate possible aging paths; however, due to uncertainties like changing operating conditions, environmental variations, and manufacturing inconsistencies, the degradation [...] Read more.
In complex energy storage operating scenarios, batteries seldom undergo complete charge–discharge cycles required for periodic capacity calibration. Methods based on accelerated aging experiments can indicate possible aging paths; however, due to uncertainties like changing operating conditions, environmental variations, and manufacturing inconsistencies, the degradation information obtained from such experiments may not be applicable to the entire lifecycle. To address this, we developed a stage-wise state-of-health (SOH) prediction approach that combined offline training with online updating. During the offline training phase, multiple single-cell experiments were conducted under various combinations of depth of discharge (DOD) and C-rate. Multi-dimensional health features (HFs) were extracted, and an accelerated aging probability pAA was defined. Based on the correlation statistics between HFs, kHF, the SOH, and pAA, all cells in the dataset were divided into general early, middle, and late aging stages. For each stage, cells were further classified by their longevity (long, medium, and short), and multiple models were trained offline for each category. The results show that models trained on cells following similar aging paths achieve significantly better performance than a model trained on all data combined. Meanwhile, HF optimization was performed via a three-step process: an initial screening based on expert knowledge, a second screening using Spearman correlation coefficients, and an automatic feature importance ranking using a random forest regression (RFR) model. The proposed method is innovative in the following ways: (1) The stage-wise multi-model strategy significantly improves the SOH prediction accuracy across the entire lifecycle, maintaining the mean absolute percentage error (MAPE) within 1%. (2) The improved model provides uncertainty quantification, issuing a warning signal at least 50 cycles before the onset of accelerated aging. (3) The analysis of feature importance from the model outputs allows the indirect identification of the primary aging mechanisms at different stages. (4) The model is robust against missing or low-quality HFs. If certain features cannot be obtained or are of poor quality, the prediction process does not fail. Full article
Show Figures

Figure 1

19 pages, 3298 KB  
Article
Detection of Cadmium Content in Pak Choi Using Hyperspectral Imaging Combined with Feature Selection Algorithms and Multivariate Regression Models
by Yongkuai Chen, Tao Wang, Shanshan Lin, Shuilan Liao and Songliang Wang
Appl. Sci. 2026, 16(2), 670; https://doi.org/10.3390/app16020670 - 8 Jan 2026
Viewed by 120
Abstract
Pak choi (Brassica chinensis L.) has a strong adsorption capacity for the heavy metal cadmium (Cd), which is a big threat to human health. Traditional detection methods have drawbacks such as destructiveness, time-consuming processes, and low efficiency. Therefore, this study aimed to [...] Read more.
Pak choi (Brassica chinensis L.) has a strong adsorption capacity for the heavy metal cadmium (Cd), which is a big threat to human health. Traditional detection methods have drawbacks such as destructiveness, time-consuming processes, and low efficiency. Therefore, this study aimed to construct a non-destructive prediction model for Cd content in pak choi leaves using hyperspectral technology combined with feature selection algorithms and multivariate regression models. Four different cadmium concentration treatments (0 (CK), 25, 50, and 100 mg/L) were established to monitor the apparent characteristics, chlorophyll content, cadmium content, chlorophyll fluorescence parameters, and spectral features of pak choi. Competitive adaptive reweighted sampling (CARS), the successive projections algorithm (SPA), and random frog (RF) were used for feature wavelength selection. Partial least squares regression (PLSR), random forest regression (RFR), the Elman neural network, and bidirectional long short-term memory (BiLSTM) models were established using both full spectra and feature wavelengths. The results showed that high-concentration Cd (100 mg/L) significantly inhibited pak choi growth, leaf Cd content was significantly higher than that in the control group, chlorophyll content decreased by 16.6%, and damage to the PSII reaction centre was aggravated. Among the models, the FD–RF–BiLSTM model demonstrated the best prediction performance, with a determination coefficient of the prediction set (Rp2) of 0.913 and a root mean square error of the prediction set (RMSEP) of 0.032. This study revealed the physiological, ecological, and spectral response characteristics of pak choi under Cd stress. It is feasible to detect leaf Cd content in pak choi using hyperspectral imaging technology, and non-destructive, high-precision detection was achieved by combining chemometric methods. This provides an efficient technical means for the rapid screening of Cd pollution in vegetables and holds important practical significance for ensuring the quality and safety of agricultural products. Full article
(This article belongs to the Section Agricultural Science and Technology)
Show Figures

Figure 1

28 pages, 6311 KB  
Article
Machine Learning-Assisted Optimisation of the Laser Beam Powder Bed Fusion (PBF-LB) Process Parameters of H13 Tool Steel Fabricated on a Preheated to 350 C Building Platform
by Katsiaryna Kosarava, Paweł Widomski, Michał Ziętala, Daniel Dobras, Marek Muzyk and Bartłomiej Adam Wysocki
Materials 2026, 19(1), 210; https://doi.org/10.3390/ma19010210 - 5 Jan 2026
Viewed by 394
Abstract
This study presents the first application of Machine Learning (ML) models to optimise Powder Bed Fusion using Laser Beam (PBF-LB) process parameters for H13 steel fabricated on a 350 °C preheated building platform. A total of 189 cylindrical specimens were produced for training [...] Read more.
This study presents the first application of Machine Learning (ML) models to optimise Powder Bed Fusion using Laser Beam (PBF-LB) process parameters for H13 steel fabricated on a 350 °C preheated building platform. A total of 189 cylindrical specimens were produced for training and testing machine learning (ML) models using variable process parameters: laser power (250–350 W), scanning speed (1050–1300 mm/s), and hatch spacing (65–90 μm). Eight ML models were investigated: 1. Support Vector Regression (SVR), 2. Kernel Ridge Regression (KRR), 3. Stochastic Gradient Descent Regressor, 4. Random Forest Regressor (RFR), 5. Extreme Gradient Boosting (XGBoost), 6. Extreme Gradient Boosting with limited depth (XGBoost LD), 7. Extra Trees Regressor (ETR) and 8. Light Gradient Boosting Machine (LightGBM). All models were trained using the Fast Library for Automated Machine Learning & Tuning (FLAML) framework to predict the relative density of the fabricated samples. Among these, the XGBoost model achieved the highest predictive accuracy, with a coefficient of determination R2=0.977, mean absolute percentage error MAPE = 0.002, and mean absolute error MAE = 0.017. Experimental validation was conducted on 27 newly fabricated samples using ML predicted process parameters. Relative densities exceeding 99.6% of the theoretical value (7.76 g/cm3) for all models except XGBoost LD and KRR. The lowest MAE = 0.004 and the smallest difference between the ML-predicted and PBF-LB validated density were obtained for samples made with LightGBM-predicted parameters. Those samples exhibited a hardness of 604 ± 13 HV0.5, which increased to approximately 630 HV0.5 after tempering at 550 °C. The LightGBM optimised parameters were further applied to fabricate a part of a forging die incorporating internal through-cooling channels, demonstrating the efficacy of machine learning-guided optimisation in achieving dense, defect-free H13 components suitable for industrial applications. Full article
(This article belongs to the Special Issue Multiscale Design and Optimisation for Metal Additive Manufacturing)
Show Figures

Graphical abstract

29 pages, 14822 KB  
Article
Estimation of Cotton Aboveground Biomass Based on UAV Multispectral Images: Multi-Feature Fusion and CNN Model
by Shuhan Huang, Xinjun Wang, Hanyu Cui, Qingfu Liang, Songrui Ning, Haoran Yang, Panfeng Wang and Jiandong Sheng
Agronomy 2026, 16(1), 74; https://doi.org/10.3390/agronomy16010074 - 26 Dec 2025
Viewed by 373
Abstract
Precise estimation of cotton aboveground biomass (AGB) plays a crucial role in effectively analyzing growth variations and development of cotton, as well as guiding agricultural management practices. Multispectral (MS) sensors mounted on UAVs offer a practical and accurate approach for estimating the AGB [...] Read more.
Precise estimation of cotton aboveground biomass (AGB) plays a crucial role in effectively analyzing growth variations and development of cotton, as well as guiding agricultural management practices. Multispectral (MS) sensors mounted on UAVs offer a practical and accurate approach for estimating the AGB of cotton. Many previous studies have mainly emphasized the combination of spectral and texture features, as well as canopy height (CH). However, current research overlooks the potential of integrating spectral, textural features, and CH to estimate AGB. In addition, the accumulation of AGB often exhibits synergistic effects rather than a simple additive relationship. Conventional algorithms, including Bayesian Ridge Regression (BRR) and Random Forest Regression (RFR), often fail to accurately capture the nonlinear and intricate correlations between biomass and its relevant variables. Therefore, this research develops a method to estimate cotton AGB by integrating multiple feature information with a deep learning model. Spectral and texture features were derived from MS images. Cotton CH extracted from UAV point cloud data. Variables of multiple features were selected using Spearman’s Correlation (SC) coefficients and the variance inflation factor (VIF). Convolutional neural network (CNN) was chosen to build a model for estimating cotton AGB and contrasted with traditional machine learning models (RFR and BRR). The results indicated that (1) combining spectral, textural features, and CH yielded the highest precision in cotton AGB estimation; (2) compared to traditional ML models (RFR and BRR), the accuracy of applying CNN for estimating cotton AGB is better. CNN has more advanced power to learn complex nonlinear relationships among cotton AGB and multiple features; (3) the most effective strategy in this study involves combining spectral, texture features, and CH, selecting variables using the SC and VIF methods, and employing CNN for estimating AGB of cotton. The R2 of this model is 0.80, with an RMSE of 0.17 kg·m−2 and an MAE of 0.11 kg·m−2. This study develops a framework for evaluating cotton AGB by multiple features fusion with a deep learning model. It provides technical support for monitoring crop growth and improving field management. Full article
Show Figures

Figure 1

23 pages, 3135 KB  
Article
Coupling Approach of Crystal Plasticity and Machine Learning in Predicting Forming Limit Diagram of AA7075-T6 at Various Temperatures and Strain Rates
by Hyuk Jong Bong, Seonghwan Choi and Kyung Mun Min
Metals 2026, 16(1), 21; https://doi.org/10.3390/met16010021 - 25 Dec 2025
Viewed by 234
Abstract
This study proposes a data-driven framework for predicting forming limit diagrams (FLDs) of AA7075-T6 aluminum sheets under various temperatures and strain rates. To overcome the limitations of costly and time-consuming experiments, a hybrid dataset combining experimental results and virtual data from rate-dependent crystal [...] Read more.
This study proposes a data-driven framework for predicting forming limit diagrams (FLDs) of AA7075-T6 aluminum sheets under various temperatures and strain rates. To overcome the limitations of costly and time-consuming experiments, a hybrid dataset combining experimental results and virtual data from rate-dependent crystal plasticity finite element (CPFE) simulations coupled with the Marciniak–Kuczyński (M–K) model was developed. Several machine learning (ML) models—including linear regression (LR), random forest regression (RFR), support vector regression (SVR), Gaussian process regression (GPR), and multilayer perceptron (MLP)—were trained to predict FLDs. The nonlinear dependence of the FLD on temperature and strain rate was accurately captured by the ML models, with nonlinear algorithms demonstrating notably improved predictive performance. The proposed approach offers an efficient, accurate, and cost-effective method for FLD prediction and supports data-driven process design in lightweight alloy forming. Full article
(This article belongs to the Section Crystallography and Applications of Metallic Materials)
Show Figures

Figure 1

27 pages, 4169 KB  
Article
Optimizing Mortar Mix Design for Concrete Roofing Tiles Using Machine Learning and Particle Packing Theory: A Case Study
by Jorge Fernando Sosa Gallardo, Vivian Felix López Batista, Aldo Fernando Sosa Gallardo, María N. Moreno-García and Maria Dolores Muñoz Vicente
Appl. Sci. 2026, 16(1), 236; https://doi.org/10.3390/app16010236 - 25 Dec 2025
Viewed by 257
Abstract
The increasing demand for sustainable construction materials has motivated the optimization of mortar mix designs to reduce cement consumption and its environmental impact while maintaining adequate mechanical performance. This study develops a machine learning (ML) model for optimizing mortar mixtures used in concrete [...] Read more.
The increasing demand for sustainable construction materials has motivated the optimization of mortar mix designs to reduce cement consumption and its environmental impact while maintaining adequate mechanical performance. This study develops a machine learning (ML) model for optimizing mortar mixtures used in concrete roofing tiles by integrating aggregate particle packing techniques with non-linear regression algorithms, using an industry-grade dataset generated in the Central Laboratory of Wienerberger Ltd. Unlike most previous studies, which mainly focus on compressive strength, this research targets the transverse strength of industrial roof tile mortar. The proposed approach combines Tarantula Curve gradation limits, experimentally derived packing density (η), and ML regression within a unified and application-oriented workflow, representing a research direction rarely explored in the literature for optimizing concrete mix transverse strength. Fine concrete aggregates were characterized through a sand sieve analysis and subsequently adjusted according to the Tarantula Curve method to optimize packing density and minimize void content. Physical properties of cements and fine aggregates were assessed, and granulometric mixtures were evaluated using computational methods to calculate fineness modulus summation (FMS) and packing density. Mortar samples were tested for transverse strength at 1, 7, and 28 days using a three-point bending test, generating a robust dataset for modeling training. Three ML models—Random Forest Regressor (RFR), XG-Boost Regressor (XGBR), and Support Vector Regressor (SVR)—were evaluated, confirming their ability to capture nonlinear relationships between mix parameters and transverse strength. The analysis of input variables, which consistently ranked as the highest contributors according to impurity-based and permutation-based importance metrics, revealed that the duration of curing, density, and the summation of the fineness modulus significantly influenced the estimated transverse strength derived from the models. The integration of particle size distribution optimization and ML demonstrates a viable pathway for reducing cement content, lowering costs, and achieving sustainable mortar mix designs in the tile manufacturing industry. Full article
(This article belongs to the Topic Software Engineering and Applications)
Show Figures

Figure 1

13 pages, 503 KB  
Article
Rapid Evaluation of Wet Gluten Content in Wheat Using Hyperspectral Technology Combined with Machine Learning Algorithms
by Yan Lai, Yan-Yan Li, Min Sha, Peng Li and Zheng-Yong Zhang
Foods 2026, 15(1), 41; https://doi.org/10.3390/foods15010041 - 23 Dec 2025
Viewed by 327
Abstract
The development of rapid and intelligent methods is urgently needed for wheat quality evaluation. Using the prediction of wet gluten content as a case study, this work systematically investigated the performance of various machine learning algorithms and their optimization for content prediction, based [...] Read more.
The development of rapid and intelligent methods is urgently needed for wheat quality evaluation. Using the prediction of wet gluten content as a case study, this work systematically investigated the performance of various machine learning algorithms and their optimization for content prediction, based on hyperspectral data from the visible and near-infrared ranges of wheat grains and flour. The results revealed that the random forest regression (RFR) algorithm delivered the best predictive performance under two conditions: first, when applied directly to visible spectra; and second, when applied to fused visible and near-infrared spectral data. This held true for both grains and flour. Conversely, its direct application to NIR spectra alone yielded relatively worse performance. Following data optimization, the first-derivative (FD) visible spectra of wheat grains were smoothed using a Savitzky–Golay (SG) filter and subsequently used as input for the RFR model. This optimized approach achieved a coefficient of determination (r2) of 0.8579, a root mean square error (RMSE) of 0.0216, and a relative percent deviation (RPD) of 2.6978. Under the same conditions, for wheat flour, the corresponding values were 0.8383, 0.0231, and 2.5293, respectively. Similarly, for wheat flour, the RFR model was applied to the SG-filtered FD spectra derived from the fused visible and near-infrared data, yielding an r2 of 0.8474, an RMSE of 0.0224, and an RPD of 2.6034. Under the same conditions, wheat grains yielded an r2 of 0.8494, an RMSE of 0.0223, and an RPD of 2.6208. This efficient and rapid intelligent prediction scheme demonstrates considerable potential for the quality assessment and control of relevant food products. Full article
(This article belongs to the Section Food Analytical Methods)
Show Figures

Figure 1

21 pages, 1502 KB  
Article
Failure Analysis and Machine Learning-Based Prediction in Urban Drinking Water Systems
by Salih Yılmaz
Appl. Sci. 2025, 15(24), 12887; https://doi.org/10.3390/app152412887 - 5 Dec 2025
Viewed by 791
Abstract
This work illustrates a machine learning methodology to forecast pipe failure frequencies in drinking water systems to enhance asset management and operational planning. Three supervised regression models—Random Forest Regressor (RFR), Extreme Gradient Boosting (XGB), and Multi-Layer Perceptron (MLP)—were developed and evaluated using historical [...] Read more.
This work illustrates a machine learning methodology to forecast pipe failure frequencies in drinking water systems to enhance asset management and operational planning. Three supervised regression models—Random Forest Regressor (RFR), Extreme Gradient Boosting (XGB), and Multi-Layer Perceptron (MLP)—were developed and evaluated using historical failure data from Malatya, Türkiye. The primary predictive variables identified were pipe diameter, pipe type, pipe age, and seasonal average ambient air temperature. The MLP demonstrated superior performance compared to the other models, attaining the lowest RMSE (1.48) and the highest R2 (0.993) with respect to the training data, effectively capturing the nonlinear characteristics and failure patterns. The MLP was validated using two datasets from 24 District Metered Areas (DMAs) in Sakarya and Kayseri, Türkiye. The model’s anticipated failure frequencies exhibited strong concordance with the observed failure frequencies, even in regions of elevated failure density, indicating the model’s proficiency in identifying high-risk locations and facilitating the prioritization of maintenance activities. The work demonstrates the potential of machine learning in water infrastructure management. It emphasizes the importance of employing a hybrid method with Geographic Information Systems (GISs) in future research to enhance forecast accuracy and spatial analysis. Full article
(This article belongs to the Section Civil Engineering)
Show Figures

Figure 1

20 pages, 3082 KB  
Article
Predicting Structural Traits and Chemical Composition of Urochloa decumbens Using Aerial Imagery and Machine Learning
by Iuly Francisca Rodrigues de Souza, Aureana Matos Lisboa, Igor Lima Bretas, Domingos Sárvio Magalhães Valente, Francisco de Assis de Carvalho Pinto, Filipe Bueno Pena de Carvalho, Lara Gabriely Silva Moura, Priscila Dornelas Valote and Fernanda Helena Martins Chizzotti
AgriEngineering 2025, 7(12), 406; https://doi.org/10.3390/agriengineering7120406 - 2 Dec 2025
Viewed by 402
Abstract
Precision agriculture, including sensors and artificial intelligence, is transforming agricultural monitoring. This study developed predictive models for fresh and dry forage mass, canopy height, forage density, dry matter (%DM), and crude protein (%CP) concentrations in signalgrass [Urochloa decumbens (Stapf) R.D. Webster] pastures [...] Read more.
Precision agriculture, including sensors and artificial intelligence, is transforming agricultural monitoring. This study developed predictive models for fresh and dry forage mass, canopy height, forage density, dry matter (%DM), and crude protein (%CP) concentrations in signalgrass [Urochloa decumbens (Stapf) R.D. Webster] pastures using machine learning and UAV-based multispectral imagery. The experiment was conducted at the Federal University of Viçosa (2019–2020), applying nitrogen doses after each harvest to promote variability. Multiple Linear Regression (MLR), Support Vector Regressor (SVR), and Random Forest Regressor (RFR) models were trained with multispectral and meteorological data. The best results were obtained for fresh forage mass with RFR (R2 = 0.82, RMSE = 2894.10 kg ha−1), dry forage mass with SVR (R2 = 0.68, RMSE = 719.87 kg ha−1), and dry matter concentration with MLR (R2 = 0.64, RMSE = 3.83%). Forage density showed moderate performance (R2 = 0.56), while canopy height demonstrated limited accuracy (R2 = 0.44). Crude protein was not adequately predicted by any model, highlighting multispectral sensor limitations and suggesting hyperspectral sensors usage. Results demonstrate the applicability of remote sensing combined with machine learning in forage management, but indicate the need to expand temporal and spatial data variability and integrate different sensor types to increase model robustness. Full article
Show Figures

Figure 1

14 pages, 1017 KB  
Article
Bio-Methanization of Sheep Manure and Beet Waste in the Meknes–Fès Region, Morocco: Effects of Pretreatment and Machine Learning Applications for Biochemical Methane Potential Prediction
by Meryem Rouegui, Hind Bellabair, Abdelghani El Asli, Amine Amar, Wilfried Zoerner, Fouad Rachidi and Rachid Lghoul
Recycling 2025, 10(6), 213; https://doi.org/10.3390/recycling10060213 - 25 Nov 2025
Viewed by 882
Abstract
Sheep manure and beet waste (the uneatable leaf part of the beet) are promising feedstock for biogas production due to their abundance and organic richness. However, their high lignocellulosic content reduces anaerobic digestibility and controls methane yield. This study investigates the effect of [...] Read more.
Sheep manure and beet waste (the uneatable leaf part of the beet) are promising feedstock for biogas production due to their abundance and organic richness. However, their high lignocellulosic content reduces anaerobic digestibility and controls methane yield. This study investigates the effect of various pretreatment strategies, namely physical, thermal, and combined physical–thermal methods, on the Biochemical Methane Potential (BMP) of sheep manure and beet waste. Batch anaerobic digestion experiments were conducted under mesophilic conditions, with BMP values recorded for each treatment. The highest BMP for sheep manure, 125 Nml CH4/g VS, was achieved using combined physical and thermal pretreatment. This approach enhanced methane production by 16%, 25%, and 60% compared to physical pretreatment (PP) alone, thermal pretreatment (TP) alone, and no pretreatment, respectively, while the one BMP for beet waste is 80 Nml CH4/g VS and obtained with thermal pretreatment. To predict BMP outcomes, three machine learning approaches are applied, namely Linear Regression (LM), Random Forest Regression (RFR), and Gradient Boosting Machine (GBM), using digestion time (N days), total solids (Ts), volatile solids (Vs), pretreatment type, and biomass type. The variance analysis confirmed that the interaction between pretreatment and biomass type significantly improved model performance. While diagnostic checks revealed non-linear patterns limiting the linear model, ensemble methods achieved stronger results. The RFR model explained 79.5% of the variance with a Root Mean Square Error (RMSE) of about 15.7, whereas the GBM model achieved the lowest RMSE of 5.05. GBM captures complex non-linear interactions. In addition, variable importance analyses identified digestion time, solid content, and pretreatment as the most influential factors for methane yield, with the combined chemical and physical pretreatment producing the highest biogas outputs. These findings underscore the potential of advanced machine learning models, particularly GBM (Gradient Boosting Machine), for optimizing anaerobic digestion strategies and maximizing biogas recovery from sheep manure and beet waste. Full article
Show Figures

Figure 1

30 pages, 3563 KB  
Article
Gross Calorific Value Estimation in Coal Using Multi-Model FTIR and Machine Learning Approach
by Arya Vinod, Anup Krishna Prasad, Sameeksha Mishra, Bitan Purkait, Shailayee Mukherjee, Anubhav Shukla, Bhabesh Chandra Sarkar and Atul Kumar Varma
Appl. Sci. 2025, 15(22), 12209; https://doi.org/10.3390/app152212209 - 18 Nov 2025
Viewed by 790
Abstract
The Gross Calorific Value (GCV) is a key indicator used to assess the energy potential and quality of coal. Conventional oxygen bomb calorimetry, though widely used, is inherently time-consuming due to the combustion process involved. Similarly, regression models for GCV prediction based on [...] Read more.
The Gross Calorific Value (GCV) is a key indicator used to assess the energy potential and quality of coal. Conventional oxygen bomb calorimetry, though widely used, is inherently time-consuming due to the combustion process involved. Similarly, regression models for GCV prediction based on ultimate or proximate analyses require extensive laboratory procedures and sample preparation. To address these challenges, this study investigates the use of mid-infrared Fourier Transform Infrared (FTIR) spectroscopy coupled with supervised variable selection to enable rapid, non-destructive, and cost-effective assessment of coal properties. In this work, a detailed mid-infrared FTIR spectral analysis of coal was conducted to identify fifty-six selective absorption bands (supervised input variables) sensitive to the organic functional group content in coal, coupled with several machine learning (ML) techniques to model the GCV of coal samples from the Johilla coal basin, India. The ML techniques employed here are piecewise linear regression (PLR), partial least squares regression (PLSR), support vector regression (SVR), random forest regression (RFR), artificial neural networks (ANN), and extreme gradient boosting regression (XGB). A multi-model estimation of GCV using the simple average output of the three models (PLSR, RFR, and XGB) achieved the best predictive performance (R2 = 0.951, RMSE = 19.050%, MBE = 1.420%, MAE = 4.053 cal/g), reflecting strong consistency between predictions and actual measurements. The FTIR-based approach achieves competitive or improved results relative to conventional methods and models documented in prior studies. The GCVs derived through modeling of FTIR data are also statistically proven (using t-test and F-test at alpha = 0.01) to be significantly similar to those of the bomb calorimeter, an industry standard for GCV measurements. Consequently, this novel FTIR-based methodology establishes an efficient, dependable tool for GCV determination that operates independently of conventional techniques, thereby enabling rapid quality assessment critical for industrial applications. Full article
Show Figures

Figure 1

Back to TopTop