MDPI - Publisher of Open Access Journals

30 pages, 22156 KB

Open AccessArticle

Daily-Scale Meteorological Normalization of Surface Solar Radiation in Varying Pollution Levels: A Statistical Case Study in Beijing (2015–2019)

by Tong Wu, Zhigang Li and Xueying Zhou

Remote Sens. 2026, 18(9), 1368; https://doi.org/10.3390/rs18091368 - 29 Apr 2026

Abstract

Surface solar radiation at the ground is affected by aerosols, clouds, and atmospheric moisture, as well as by circulation-related conditions that influence cloud formation and pollutant transport. In daily observations, these influences are mixed, which makes pollution-related variability difficult to interpret. We analyzed [...] Read more.

Surface solar radiation at the ground is affected by aerosols, clouds, and atmospheric moisture, as well as by circulation-related conditions that influence cloud formation and pollutant transport. In daily observations, these influences are mixed, which makes pollution-related variability difficult to interpret. We analyzed data from Beijing station 54511 (2015–2019), including daily integrated radiation components and collocated meteorological and pollution variables. We used wavelet coherence, pollution-stratified association analysis, and gray relational analysis, and compared two meteorological normalization methods: multiple linear regression (MLR) and random forest (RF). The results show that meteorological–radiation relationships vary systematically across pollution levels, indicating substantial meteorological confounding in daily radiation analyses. Among the radiation components, DR shows the clearest pollution-dependent shift in its relationship with RH, while several direct components become less sensitive to cloud cover under heavier pollution. RF reproduced daily radiation components with strong predictive performance (R² = 0.83–0.88), and the meteorologically adjusted anomalies from RF were consistent with those from MLR (r = 0.63–0.78 across components). These findings suggest that both MLR and RF can be effectively used to normalize meteorological effects in daily station records. The analysis supports routine interpretation of day-to-day surface radiation variability and can be extended to multi-site studies and finer temporal resolution. Full article

(This article belongs to the Special Issue Advanced AI Technology for Remote Sensing Analysis (Second Edition))

► Show Figures

Figure 1

28 pages, 3490 KB

Open AccessArticle

A Multi-Output Deep Learning Framework for Simultaneous Forecasting of PM₁₀ and Air Quality Index in High-Altitude Basins: A Case Study of Igdir, Türkiye

by Hakan Çelikten

Sustainability 2026, 18(8), 3883; https://doi.org/10.3390/su18083883 - 14 Apr 2026

Viewed by 337

Abstract

Air pollution forecasting is particularly challenging in basins with frequent winter seasons and temperature inversions. In this study, we developed and rigorously evaluated deep learning models to forecast PM₁₀ and the Air Quality Index (AQI) in Igdır, Türkiye, using a five-year, hourly [...] Read more.

Air pollution forecasting is particularly challenging in basins with frequent winter seasons and temperature inversions. In this study, we developed and rigorously evaluated deep learning models to forecast PM₁₀ and the Air Quality Index (AQI) in Igdır, Türkiye, using a five-year, hourly dataset (2020–2024) from the Igdır/Central station (PM₁₀, NO₂, O₃, SO₂; meteorology: pressure, temperature, wind speed, relative humidity, precipitation, cloud cover). Using linear interpolation and Z-score normalization, sine/cosine features (hour, month) were used to encode temporal periodicity, and a 72-h lookback → 24-h look-ahead design was employed. LSTM, GRU, BiLSTM, and CNN-LSTM models were compared under a three-stage ablation (meteorology only; +cyclic encoders; +lagged targets), and their hyperparameters were tuned via Bayesian optimization. The deep learning results were further contextualized against a Multiple Linear Regression (MLR) baseline serving as a snapshot persistence model to evaluate the specific advantage of LSTM’s temporal memory in short-horizon forecasting. Multi-output forecasting is central to the proposed design, featuring a multi-task learning (MTL) framework based on a single shared temporal encoder with two task-specific regression heads that simultaneously predict PM₁₀ and AQI. Compared with separate single-task models, the multi-output setup exploits cross-target covariance (AQI’s dependence on pollutant loads under meteorology), improves data efficiency and generalization through shared representations, and promotes coherent, horizon-stable forecasts across targets, which is particularly valuable when winter stagnation regimes couple PM₁₀ and AQI dynamics. Moreover, this study introduces a structured ablation design to explicitly evaluate the added value of multi-output forecasting under inversion-dominated basin conditions. The results show stepwise gains from cyclic encoders and, most strongly, from lagged target histories. Under the optimized 24-h setting, LSTM performs best (R²_{PM₁₀} = 0.7989, RMSE = 48.74 µg/m³; R²_{AQI} = 0.6626, RMSE = 37.81), marginally surpassing GRU and clearly outperforming BiLSTM and CNN-LSTM. Horizon sensitivity confirms the benefit of nowcasting: when retrained for shorter horizons, LSTM attains R² = 0.9991 for PM₁₀ (MAE = 2.44; RMSE = 3.30 µg/m³) and 0.9535 for AQI (MAE = 4.87; RMSE = 14.03) at 1 h, and R² = 0.9792 (PM₁₀; MAE = 9.70; RMSE = 15.67) and 0.8849 (AQI; MAE = 11.19; RMSE = 22.08) at 6 h. Residual diagnostics reveal heteroskedastic, regime-dependent errors peaking near 0 °C and low winds, as well as a conservative bias that underpredicts extremes. Collectively, the findings show that multi-output, temporally aware deep models enable accurate operational forecasting in Igdır. The proposed framework provides real-time air quality alerts and daily planning, providing decision support for sustainable air quality management, public health protection, and evidence-based urban policy and is transferable to similar continental basin environments. Full article

(This article belongs to the Section Air, Climate Change and Sustainability)

► Show Figures

Figure 1

12 pages, 417 KB

Open AccessReview

Source Apportionment Methods for Soil Heavy Metals: Principles and Optimal Scenarios

by Linhua Sun, Weihua Peng, Xianghong Liu and Kai Chen

Processes 2026, 14(7), 1143; https://doi.org/10.3390/pr14071143 - 2 Apr 2026

Viewed by 411

Abstract

Accurate source apportionment of soil heavy metals (HMs) is critical for targeted pollution mitigation and ecological remediation. This review systematically synthesizes and compares five mainstream source apportionment approaches—receptor models (positive matrix factorization, PMF; absolute principal component score-multiple linear regression, APCS-MLR; UNMIX model), stable [...] Read more.

Accurate source apportionment of soil heavy metals (HMs) is critical for targeted pollution mitigation and ecological remediation. This review systematically synthesizes and compares five mainstream source apportionment approaches—receptor models (positive matrix factorization, PMF; absolute principal component score-multiple linear regression, APCS-MLR; UNMIX model), stable isotope tracing, and random forest (RF)-based machine learning—to provide researchers with a comprehensive methodological framework. The methodology includes a systematic literature review, comparative analysis of methodological principles, and synthesis of representative case studies from diverse geographical contexts. The core principles, evolutionary paths, typical use cases (e.g., industrial zones, agricultural fields, regional surveys), and inherent limitations are synthesized for each method. A practical decision framework linking research contexts (study objectives, spatial scales, data availability) to optimal method selection, along with guidelines for multi-method integration, is proposed. This review provides actionable guidance for researchers and practitioners in selecting appropriate methods for specific pollution scenarios, ultimately supporting more effective environmental management and policy development. Full article

(This article belongs to the Section Environmental and Green Processes)

► Show Figures

Figure 1

25 pages, 2400 KB

Open AccessArticle

Machine Learning-Based Production Dynamics Prediction for Chemical Composite Cold Production

by Wenyang Shi, Rongxin Huang, Jie Gao, Hao Ma, Tiantian Zhang, Jiazheng Qin, Lei Tao, Jiajia Bai, Zhengxiao Xu and Qingjie Zhu

Processes 2026, 14(7), 1050; https://doi.org/10.3390/pr14071050 - 25 Mar 2026

Viewed by 373

Abstract

Accurate prediction of production dynamics in chemical composite cold production (CCCP) for heavy oil reservoirs remains challenging due to complex multi-phase fluid interactions and nonlinear flow regime transitions. Traditional numerical simulations are computationally expensive and rely heavily on detailed geological characterization. To address [...] Read more.

Accurate prediction of production dynamics in chemical composite cold production (CCCP) for heavy oil reservoirs remains challenging due to complex multi-phase fluid interactions and nonlinear flow regime transitions. Traditional numerical simulations are computationally expensive and rely heavily on detailed geological characterization. To address these limitations, a data-driven predictive framework integrating physical mechanisms with machine learning is proposed. A dual-driven feature selection strategy combining Spearman rank correlation and the Entropy Weight Method (EWM) was applied to quantify nonlinear parameter correlations and data informativeness, identifying injection-production balance and development and maximum adsorption capacity as dominant factors controlling oil production fluctuations. Latin Hypercube Sampling (LHS) was used to construct a representative parameter space, followed by weighted standardization. A Multiple Linear Regression (MLR) model was then trained to jointly predict key production indicators. Field validation shows strong predictive capability, with a coefficient of determination above 0.94 and relative fitting error below 5%. The method reduces computational time by over two orders of magnitude while maintaining high precision. Full article

(This article belongs to the Section Chemical Processes and Systems)

► Show Figures

Figure 1

18 pages, 9838 KB

Open AccessArticle

Unlocking Roadside Carbon Sequestration Potential: Machine Learning Estimation of AGB in Highway Vegetation Belts Using GF-2 High-Resolution Imagery

by Weiwei Jiang, Heng Tu and Qin Wang

Sensors 2026, 26(5), 1729; https://doi.org/10.3390/s26051729 - 9 Mar 2026

Viewed by 365

Abstract

Aboveground biomass (AGB) is a key indicator of vegetation productivity and terrestrial carbon stocks; therefore, robust AGB estimation is critical for assessing ecosystem services and carbon cycle research. Previous studies have largely focused on forest and cropland ecosystems. In contrast, roadside vegetation along [...] Read more.

Aboveground biomass (AGB) is a key indicator of vegetation productivity and terrestrial carbon stocks; therefore, robust AGB estimation is critical for assessing ecosystem services and carbon cycle research. Previous studies have largely focused on forest and cropland ecosystems. In contrast, roadside vegetation along highways and other linear transport corridors remains comparatively underexplored despite its potentially important role as a carbon sink. Here, we integrate field-measured AGB samples with GF-2 high-resolution satellite imagery to evaluate the suitability of multiple remote-sensing predictors and machine-learning algorithms for estimating AGB in highway roadside vegetation. Six remote-sensing variables were used as predictors, including four vegetation indices (Normalized Difference Vegetation Index (NDVI), Perpendicular Vegetation Index (PVI), Enhanced Vegetation Index (EVI), and Modified Soil-Adjusted Vegetation Index (MSAVI) and two-band ratios (B342 and B12/34). Five regression models—multiple linear regression (MLR), partial least squares regression (PLSR), random forest (RF), support vector regression (SVR), and extreme gradient boosting (XGBoost)—were developed and systematically compared under both single-variable and multi-variable scenarios. Model performance was evaluated using five-fold cross-validation, with the coefficient of determination (R²) and the root mean square error (RMSE) as metrics of evaluation. The results indicate that the RF model under the multi-variable scenario achieved the best overall performance, with a training R² of 0.83 and a testing RMSE of 0.84 kg·m⁻², substantially outperforming the other linear and non-linear models. The optimal RF model was further applied to GF-2 imagery to produce a spatially explicit AGB map for a 32 km highway segment and a 30 m roadside buffer on both sides, yielding an estimated total aboveground biomass of 566.97 t for the corridor. These findings demonstrate that combining high-resolution remote sensing with machine-learning approaches can effectively improve AGB estimation for linear roadside vegetation systems, providing technical support for ecological monitoring, roadside greening management, and carbon accounting for transport infrastructure. Full article

(This article belongs to the Section Remote Sensors)

► Show Figures

Figure 1

28 pages, 2032 KB

Open AccessArticle

Addressing Class Imbalance in Fetal Health Classification: Rigorous Benchmarking of Multi-Class Resampling Methods on Cardiotocography Data

by Zainab Subhi Mahmood Hawrami, Mehmet Ali Cengiz and Emre Dünder

Diagnostics 2026, 16(3), 485; https://doi.org/10.3390/diagnostics16030485 - 5 Feb 2026

Viewed by 880

Abstract

Background/Objectives: Fetal health is essential in prenatal care, influencing both maternal and fetal outcomes. Cardiotocography (CTG) monitors uterine contractions and fetal heart rate, yet manual interpretation exhibits significant inter-examiner variability. Machine learning offers automated alternatives; however, class imbalance in CTG datasets where [...] Read more.

Background/Objectives: Fetal health is essential in prenatal care, influencing both maternal and fetal outcomes. Cardiotocography (CTG) monitors uterine contractions and fetal heart rate, yet manual interpretation exhibits significant inter-examiner variability. Machine learning offers automated alternatives; however, class imbalance in CTG datasets where pathological cases constitute less than 10% leads to poor detection of minority classes. This study aims to provide the first systematic benchmark comparing five resampling strategies across seven classifier families for multi-class CTG classification, evaluated using imbalance-aware metrics rather than overall accuracy alone. Methods: Seven machine learning models were employed: Naïve Bayes (NB), Random Forest (RF), Linear Discriminant Analysis (LDA), k-Nearest Neighbors (KNN), Linear Support Vector Machine (SVM), Multinomial Logistic Regression (MLR), and Multi-Layer Perceptron (MLP). To address class imbalance, we evaluated the original unbalanced dataset (base) and five resampling methods: SMOTE, BSMOTE, ADASYN, NearMiss, and SCUT. Performance was evaluated on a held-out test set using Balanced Accuracy (BACC), Macro-F1, the Macro-Matthews Correlation Coefficient (Macro-MCC), and Macro-Averaged ROC-AUC. We also report per-class ROC curves. Results: Among all models, RF proved most reliable. Training on the original distribution (base) yielded the highest BACC (0.9118), whereas RF combined with BSMOTE provided the strongest class-balanced performance (Macro-MCC = 0.8533, Macro-F1 = 0.9073) with a near-perfect ROC-AUC (approximately 0.986–0.989). Overall, resampling effects proved model dependent. While some classifiers achieved optimal performance on the natural class distribution, oversampling techniques, particularly SMOTE and BSMOTE, demonstrated significant improvements in minority class discrimination and class-balanced metrics across multiple model families. Notably, certain models benefited substantially from resampling, exhibiting enhanced Macro-F1, BACC, and minority class recall without sacrificing overall accuracy. Conclusions: These findings establish robust, model-agnostic baselines for CTG-based fetal health screening. They highlight that strategic oversampling can translate improved minority class discrimination into clinically meaningful performance gains, supporting deployment in cost-sensitive and threshold-aware clinical settings. Full article

(This article belongs to the Special Issue Artificial Intelligence in Biomedical Diagnostics and Analysis 2025)

► Show Figures

Figure 1

17 pages, 2118 KB

Open AccessArticle

Influencing Factors of Pine Wood Milling Force Based on Principal Component Analysis and Multiple Linear Regression

by Bo Shen, Dietrich Buck, Ziyi Yuan and Zhaolong Zhu

Materials 2026, 19(2), 439; https://doi.org/10.3390/ma19020439 - 22 Jan 2026

Cited by 1 | Viewed by 365

Abstract

Milling force is a parameter affecting wood processing quality, tool life, and energy consumption, and its variation is influenced by the multi-factor coupling of cutting parameters and tool geometric factors. This study systematically investigates milling forces during the processing of pine wood ( [...] Read more.

Milling force is a parameter affecting wood processing quality, tool life, and energy consumption, and its variation is influenced by the multi-factor coupling of cutting parameters and tool geometric factors. This study systematically investigates milling forces during the processing of pine wood (Pinus sylvestris var. mongholica Litv.) using a hybrid modeling approach combining principal component analysis (PCA) and multiple linear regression (MLR). Firstly, PCA was employed to reduce the dimensionality of the tool rake angle (γ), helix angle (λ), cutting depth (h), feed per tooth (U_z), and triaxial milling forces (F_x, F_y, F_z); this eliminated the multicollinearity among variables and extracted the integrated features. Subsequently, an MLR model was constructed using the principal components as independent variables to quantitatively evaluate the contribution of each factor to milling forces. The results support the conclusion that PCA successfully extracted the first four principal components (cumulative variance contribution rate: 92.78%), with PC1 (49.16%) characterizing the comprehensive milling force effect and PC2 (15.03%) primarily reflecting the characteristics of the tool geometric parameters. The established MLR model demonstrated a high significance (R²: F_x = 0.915, F_y = 0.907, F_z = 0.852). The cutting depth exerted a significant positive driving effect on the triaxial milling forces via PC1 (each 1 mm increase in depth increased the PC1 score by 0.64 units, resulting in increases of 27.2%, 26.6%, and 21.8% for F_x, F_y, and F_z, respectively). The helix angle significantly suppressed F_y through PC2 (β = −0.090, p < 0.001), whereas the rake angle exhibited a weak negative effect on F_x via PC3 (β = −0.015). Parameter optimization identified the combination γ = 25°, λ = 30°, h = 0.5 mm, and U_z = 0.1 mm∙z⁻¹ as optimal, which reduced the triaxial milling forces by 62.3% compared to the experimental maximum. This study provides a theoretical foundation and novel parameter optimization strategy for the efficient, low-damage processing of wood materials. Full article

(This article belongs to the Section Materials Simulation and Design)

► Show Figures

Figure 1

25 pages, 9139 KB

Open AccessArticle

Meteorological and Air Quality Effects on Bioaerosol Detection Using WIBS-NEO and IBAC-2 in Dublin City

by Emma Markey, Jerry Hourihane Clancy, Moisés Martínez-Bracero, José María Maya-Manzano, Raúl Pecero-Casimiro, Eoin Joseph McGillicuddy, Gavin Sewell, Roland Sarda-Estève, Andrés M. Vélez-Pereira and David J. O’Connor

Atmosphere 2026, 17(1), 86; https://doi.org/10.3390/atmos17010086 - 15 Jan 2026

Cited by 1 | Viewed by 798

Abstract

This study evaluates the performance of two real-time fluorescence-based bioaerosol sensors, the WIBS-NEO and IBAC-2, operating in urban Dublin, Ireland, and assesses the influence of different meteorological and pollution parameters on their outputs. This was done by comparing particle sensor data to meteorological [...] Read more.

This study evaluates the performance of two real-time fluorescence-based bioaerosol sensors, the WIBS-NEO and IBAC-2, operating in urban Dublin, Ireland, and assesses the influence of different meteorological and pollution parameters on their outputs. This was done by comparing particle sensor data to meteorological variables and air quality metrics. Over the 41-day campaign, Urticaceae pollen and Cladosporium spores were the dominant bioaerosols recorded, comprising 78% and 66% of total pollen and fungal spore concentrations, respectively. Correlation analyses revealed several significant variables: fluorescent BC-type particles (>8 μm) detected by WIBS-NEO strongly correlated with pollen concentrations (r = 0.84 after excluding high-wind days). For fungal spores, PM₁₀ and grass minimum temperature were the most significant parameters related to variability. Anthropogenic pollutants, particularly NO_X and combustion-related aerosols, were found to correlate with fluorescence signals, especially for smaller particles (<2 μm), underscoring urban detection challenges. Wind trajectory analysis identified the likely source of Urticaceae pollen as northerly green spaces (e.g., Phoenix Park), while Cladosporium spores showed multidirectional transport. Multiple linear regression (MLR) analysis achieved strong correlation (

R^{2}

= 0.82 for pollen, 0.78 for fungal spores), highlighting the value of incorporating multiple environmental variables to investigate the complex relationships between urban environmental conditions and bioaerosol sensor outputs. Both instruments exhibited operational limitations under the study conditions. The WIBS-NEO outperformed the IBAC-2 in biological discrimination due to its multi-channel single particle fluorescence capabilities. However, operational limitations emerged during higher wind speeds, comparable to moderate breezes (>16.6 km/h), which affected sampling comparability when compared with traditional methods. This study investigates how meteorological conditions and air quality influence bioaerosol detection in an urban environment. The use of MLR techniques to examine the complex relationships between environmental variables and fluorescent sensor outputs may help inform future bioaerosol modelling efforts. Full article

(This article belongs to the Section Aerosols)

► Show Figures

Figure 1

32 pages, 1950 KB

Open AccessArticle

Association of Circulating Irisin with Insulin Resistance and Metabolic Risk Markers in Prediabetic and Newly Diagnosed Type 2 Diabetes Patients

by Daniela Denisa Mitroi Sakizlian, Lidia Boldeanu, Diana Clenciu, Adina Mitrea, Ionela Mihaela Vladu, Alina Elena Ciobanu Plasiciuc, Mohamed-Zakaria Assani and Daniela Ciobanu

Int. J. Mol. Sci. 2026, 27(2), 787; https://doi.org/10.3390/ijms27020787 - 13 Jan 2026

Viewed by 447

Abstract

Circulating irisin, a myokine implicated in energy expenditure and adipose tissue regulation, has been increasingly studied as a potential biomarker of metabolic dysfunction. This study evaluated the relationship between serum irisin and metabolic indices, including the atherogenic index of plasma (AIP), the lipid [...] Read more.

Circulating irisin, a myokine implicated in energy expenditure and adipose tissue regulation, has been increasingly studied as a potential biomarker of metabolic dysfunction. This study evaluated the relationship between serum irisin and metabolic indices, including the atherogenic index of plasma (AIP), the lipid accumulation product (LAP), and hypertriglyceridemic-waist (HTGW) phenotype in individuals with prediabetes (PreDM) and newly diagnosed type 2 diabetes mellitus (T2DM). A total of 138 participants (48 PreDM, 90 T2DM) were assessed for anthropometric, glycemic, and lipid parameters. Serum irisin levels were measured by enzyme-linked immunosorbent assay (ELISA) and correlated with insulin resistance indices (Homeostatic Model Assessment of Insulin Resistance (HOMA-IR), Quantitative Insulin Sensitivity Check Index (QUICKI)), glycemic control (glycosylated hemoglobin A1c (HbA1c)), and composite lipid markers (total triglycerides-to-high-density lipoprotein cholesterol (TG/HDL-C)). Group differences were evaluated using non-parametric tests; two-way ANOVA assessed interactions between phenotypes and markers; multiple linear regression (MLR) and logistic regression models explored independent associations with metabolic indices and HTGW; receiver operating characteristic (ROC) analyses compared global and stratified model performance. Serum irisin was significantly lower in T2DM than in PreDM (median 140.4 vs. 230.7 ng/mL, p < 0.0001). Irisin levels remained comparable between males and females in both groups. Post hoc analysis shows that lipid indices and irisin primarily distinguish HTGW phenotypes, especially in T2DM. In both groups, irisin correlated inversely with HOMA-IR, AIP, and TG/HDL-C, and positively with QUICKI, indicating a possible compensatory role in early insulin resistance. MLR analyses revealed no independent relationship between irisin and either AIP or LAP in PreDM, while in T2DM, waist circumference remained the strongest negative predictor of irisin. Logistic regression identified age, male sex, and HbA1c as independent predictors of the HTGW phenotype, while irisin contributed modestly to overall model discrimination. ROC curves demonstrated good discriminative performance (AUC = 0.806 for global; 0.794 for PreDM; 0.813 for T2DM), suggesting comparable predictive accuracy across glycemic stages. In conclusion, irisin levels decline from prediabetes to overt diabetes and are inversely linked to lipid accumulation and insulin resistance but do not independently predict the HTGW phenotype. These findings support irisin’s role as an integrative indicator of metabolic stress rather than a stand-alone biomarker. Incorporating irisin into multi-parameter metabolic panels may enhance early detection of cardiometabolic risk in dysglycemic populations. Full article

(This article belongs to the Special Issue Molecular Diagnosis and Treatments of Diabetes Mellitus: 2nd Edition)

► Show Figures

Figure 1

30 pages, 3322 KB

Open AccessArticle

Insights into the Feature-Selection Mechanisms for Modeling the Shear Capacity of Stud Connectors in Concrete: A Machine Learning Approach

by Sadi Ibrahim Haruna, Abdulwarith Ibrahim Bibi Farouk, Yasser E. Ibrahim, Mahmoud T. Nawar, Suleiman Abdulrahman and Mustapha Abdulhadi

J. Compos. Sci. 2026, 10(1), 34; https://doi.org/10.3390/jcs10010034 - 8 Jan 2026

Cited by 2 | Viewed by 460

Abstract

Shear connections between concrete structural elements play a vital role in defining performance and overall stability. However, limitations in traditional methods for predicting the shear capacity (Vu) of stud connectors in concrete have been highlighted. Developing strategies that precisely describe the performance of [...] Read more.

Shear connections between concrete structural elements play a vital role in defining performance and overall stability. However, limitations in traditional methods for predicting the shear capacity (Vu) of stud connectors in concrete have been highlighted. Developing strategies that precisely describe the performance of stud-headed connectors requires insight into their failure mechanisms and the corresponding shear transmission. Therefore, leveraging advancements in machine learning, this study aims to predict the Vu of the headed stud connector in concrete structures using various input parameters. A database (1121) of the shear strength collected from the literature was trained using six machine learning (ML) algorithms: extreme learning machine (ELM), decision tree (DT), artificial neural network (ANN), multi-linear regression (MLR), support vector machine (SVM), and hybrid ANN–particle swarm optimization (ANN-PSO). Feature selection methods and system identification were applied to explore the optimal or most relevant input parameters. The feature selection techniques indicated that the geometric properties of the stud connector (diameter and cross-sectional area), the concrete modulus of elasticity (Ec), and the height of the weld collar (h_w) are the most relevant input variables. The ANN-PSO model outperformed the other classical models in estimating the shear capacity at two modeling stages. The hybrid ANN-PSO achieved R² = 0.976, MAE = 7.61 kN, RMSE = 10.8 kN, and MAPE = 8.04%, demonstrating the best predictive accuracy among the classical models. On the other hand, DT is the second-best model, with an R² of 0.958, MAE of 10.27 kN, RMSE of 14.43 kN, and MAPE of 8.53 kN for forecasting the shear capacity of stud connectors in concrete. Full article

(This article belongs to the Section Composites Modelling and Characterization)

► Show Figures

Figure 1

20 pages, 6530 KB

Open AccessArticle

Monthly Temperature Prediction in the Han River Basin, South Korea, Using Long Short-Term Memory (LSTM) and Multiple Linear Regression (MLR) Models

by Chul-Gyum Kim, Jeongwoo Lee, Jeong-Eun Lee and Hyeonjun Kim

Water 2026, 18(1), 98; https://doi.org/10.3390/w18010098 - 31 Dec 2025

Viewed by 544

Abstract

This study compares and evaluates the performance of a statistical model, Multiple Linear Regression (MLR), and a deep learning model, Long Short-Term Memory (LSTM), for predicting monthly mean temperature in the Han River Basin, South Korea. Predictor variables were dynamically selected based on [...] Read more.

This study compares and evaluates the performance of a statistical model, Multiple Linear Regression (MLR), and a deep learning model, Long Short-Term Memory (LSTM), for predicting monthly mean temperature in the Han River Basin, South Korea. Predictor variables were dynamically selected based on lagged correlation analysis between climate indices and temperature over the past 40 years, identifying the top ten variables with the highest correlations for lag times ranging from 1 to 18 months. The MLR model was developed through stepwise regression with cross-validation, while the LSTM model was constructed using an 18-month input sequence to capture temporal dependencies in the data. Model performance was evaluated using percent bias (PBIAS), Nash–Sutcliffe efficiency (NSE), Pearson’s correlation coefficient (r), and tercile-based probability metrics. Both models reproduced the seasonal variability of monthly temperature with high accuracy (NSE > 0.97, r > 0.98). The LSTM model showed slightly higher predictive skill in several periods but also exhibited larger prediction variance, reflecting the sensitivity of nonlinear architectures to variations in predictor–response relationships. In contrast, the MLR model demonstrated more stable predictive behavior with narrower uncertainty bounds, particularly under low signal-to-noise conditions, owing to its structural simplicity. These findings indicate that the two approaches are complementary; the LSTM model better captures nonlinear temporal dynamics, while the MLR model provides interpretability and robustness. Future work will explore advanced hybrid architectures such as CNN–LSTM and Transformer-based models, as well as multi-model ensemble methods, to further enhance the accuracy and reliability of medium-range temperature prediction. Full article

(This article belongs to the Section Hydrology)

► Show Figures

Figure 1

20 pages, 2490 KB

Open AccessArticle

Modeling Moso Bamboo Tree Density and Aboveground Biomass Using Multi-Site UAV-LiDAR Data

by Xinyao Liu, Guiying Li, Longwei Li and Dengsheng Lu

Remote Sens. 2026, 18(1), 115; https://doi.org/10.3390/rs18010115 - 28 Dec 2025

Cited by 2 | Viewed by 706

Abstract

Moso bamboo, widely distributed in subtropical regions of China, plays an important role in forest management and carbon cycle research. However, accurate estimation of tree density and aboveground biomass (AGB) remains challenging due to the unique characteristics of Moso bamboo forests in their [...] Read more.

Moso bamboo, widely distributed in subtropical regions of China, plays an important role in forest management and carbon cycle research. However, accurate estimation of tree density and aboveground biomass (AGB) remains challenging due to the unique characteristics of Moso bamboo forests in their growth and stand structure. This research aims to develop a new procedure for bamboo tree density and AGB estimation based on UAV-LiDAR and sample plots from multiple sites through comparative analysis of the incorporation of two groups of variables—regular point cloud metrics (e.g., height, point density) and layered texture metrics—and three modeling methods—multiple linear regression (MLR), mixed-effects modeling (MEM), and hierarchical Bayesian modeling (HBM). The results showed that incorporating layered texture metrics with regular variables substantially improved the estimation accuracy of both tree density and AGB. Among these models, HBM achieved the highest predictive performance, yielding coefficient of determination (R²) values of 0.54 for tree density and 0.59 for AGB, with corresponding relative root mean square errors (rRMSE) of 21.46% and 17.97%. This study presents a novel and effective method for estimating Moso bamboo tree density and AGB using multi-site UAV-LiDAR and sample plots, offering a scientific basis for precise management and carbon stock assessment. Full article

(This article belongs to the Special Issue Applications of Artificial Intelligence Methods to Agroforestry Remote Sensing)

► Show Figures

Figure 1

26 pages, 13352 KB

Open AccessArticle

Robust Rainfall Gap-Filling in Coastal Arid Regions Using Ensemble Fusion Models

by Badar Al-Jahwari, Ghazi Al-Rawas, Mohammad Reza Nikoo, Talal Etri and Jens Grundmann

Hydrology 2026, 13(1), 1; https://doi.org/10.3390/hydrology13010001 - 20 Dec 2025

Viewed by 962

Abstract

In arid regions, the challenges posed by rainfall data availability, missing data, and limited historical records significantly affect hydrological modeling studies and climate change assessments. For various hydrology applications, it is essential to implement advanced techniques in order to obtain a complete dataset [...] Read more.

In arid regions, the challenges posed by rainfall data availability, missing data, and limited historical records significantly affect hydrological modeling studies and climate change assessments. For various hydrology applications, it is essential to implement advanced techniques in order to obtain a complete dataset series. This study explores the implementation of multiple machine learning techniques to address the complexity of filling daily rainfall data for 88 rainfall stations in the Al-Batinah region of Oman, covering the period from 1993 to 2024. The machine learning models applied in this study include Multiple Linear Regression (MLR), Random Forest (RF), K-Nearest Neighbors (KNN), Support Vector Regression (SVR), and Gradient-Boosting Trees (GBT). A non-clustering approach is used as well as a clustering approach as part of the methodology. In the first method, rainfall stations are not clustered, while in the second method, optimal cluster numbers are calculated using K-means clustering. The target station utilizes the nearby rainfall station data located within a 50 km radius with the highest correlation coefficients. A novel Ensemble Fusion Model has been applied to improve the efficacy of multiple predictive models, including the RF Fusion Model (RF) and Multi-Model Super Ensemble Fusion Model (MMSE). The estimation approaches are further enhanced and evaluated by Bayesian optimization of hyperparameters, dataset imputation utilizing Multiple Imputation by Chained Equations (MICE), and Leave-One-Year-Out (LOYO) cross-validation. Based on the results, it can be concluded that the GBT model performs the best in both cluster and non-cluster approaches. A further benefit of applying Ensemble Fusion Models to rainfall gap-filling methods is that the coefficient of determination (R²) for clustering and non-clustering approaches increases to 22.5% and 22.2%, respectively. Full article

(This article belongs to the Section Hydrological and Hydrodynamic Processes and Modelling)

► Show Figures

Figure 1

28 pages, 4461 KB

Open AccessArticle

Predicting Sea-Level Extremes and Wetland Change in the Maroochy River Floodplain Using Remote Sensing and Deep Learning Approach

by Nawin Raj, Niharika Singh, Nathan Downs and Lila Singh-Peterson

Remote Sens. 2025, 17(17), 2988; https://doi.org/10.3390/rs17172988 - 28 Aug 2025

Viewed by 1461

Abstract

Wetlands are an important part of coastal ecosystems but are under increasing pressure from climate change-induced sea-level rise and flooding, in addition to development pressures associated with increasing human populations. The change in tidal events and their intensity due to sea-level rise is [...] Read more.

Wetlands are an important part of coastal ecosystems but are under increasing pressure from climate change-induced sea-level rise and flooding, in addition to development pressures associated with increasing human populations. The change in tidal events and their intensity due to sea-level rise is also reshaping and challenging the vitality of existing wetland systems, requiring more intensive localized studies to identify future-focused restoration and conservation strategies. To support this endeavor, this study utilizes tide gauge datasets from the Australian Bureau of Meteorology (BOM) for maximum sea-level (Hmax) prediction and Landsat Collection surface reflectance datasets obtained from the United States Geological Survey (USGS) database to detect and project patterns of change in the Maroochy River floodplain of Queensland, Australia. This study developed an efficient hybrid deep learning model combining a Convolutional Neural Network and Bidirectional Long Short-Term Memory (CNNBiLSTM) architecture for the prediction of maximum sea-level and tidal events. The proposed model significantly outperformed three benchmark models (Multiple Linear Regression (MLR), Support Vector Regression (SVR), and CatBoost) in achieving a high correlation coefficient (r = 0.9748) for maximum sea-level prediction. To further address the increasing frequency and intensity of tidal events linked to sea-level rise, a CNNBiLSTM classification model was also developed, achieving 96.72% accuracy in predicting extreme tidal occurrences. This study identified a significant positive linear increase in sea-level rise of 0.016 m/year between 2014 and 2024. Wetland change detection using Landsat imagery along the Maroochy River floodplain also identified a substantial vegetation loss of 395.64 hectares from 2009 to 2023. These findings highlight the strong potential of integrating deep learning and remote sensing for improved prediction and assessment of sea-level extremes and coastal ecosystem changes. The study outcomes provide valuable insights for informing not only conservation and restoration activities but also for providing localized projections of future change necessary for the progression of effective climate adaptation and mitigation strategies. Full article

(This article belongs to the Special Issue Ecological Environment Remote Sensing and Sustainable Development Evaluation in Coastal Zones)

► Show Figures

Figure 1

17 pages, 4999 KB

Open AccessArticle

Simulating the Phylogenetic Diversity Metrics of Plant Communities in Alpine Grasslands of Xizang, China

by Mingxue Xiang, Tao Ma, Wei Sun, Shaowei Li and Gang Fu

Diversity 2025, 17(8), 569; https://doi.org/10.3390/d17080569 - 14 Aug 2025

Viewed by 1271

Abstract

Phylogenetic diversity serves as a critical complement to traditional species diversity metrics. However, the performance variations among different computational models in simulating phylogenetic diversity within plant communities in the alpine grasslands of the Qinghai-Xizang Plateau remain insufficiently characterized. Here, we evaluated nine modeling [...] Read more.

Phylogenetic diversity serves as a critical complement to traditional species diversity metrics. However, the performance variations among different computational models in simulating phylogenetic diversity within plant communities in the alpine grasslands of the Qinghai-Xizang Plateau remain insufficiently characterized. Here, we evaluated nine modeling approaches—random forest (RF), generalized boosting regression (GBR), multiple linear regression (MLR), artificial neural network (ANN), generalized linear regression (GLR), conditional inference tree (CIT), extreme gradient boosting (eXGB), support vector machine (SVM), and recursive regression tree (RRT)—for predicting three key phylogenetic diversity metrics [Faith’s phylogenetic diversity (PD), mean pairwise distance (MPD), mean nearest taxon distance (MNTD)] using climate variables and NDVImax. Our comprehensive analysis revealed distinct model performance patterns under grazing vs. fencing regimes. The eXGB algorithm demonstrated superior accuracy for fencing conditions, achieving the lowest relative bias (−0.08%) and RMSE (9.54) for MPD, along with optimal performance for MNTD (bias = 2.95%, RMSE = 44.86). Conversely, RF emerged as the most robust model for grazing scenarios, delivering the lowest bias (−1.63%) and RMSE (16.89) for MPD while maintaining strong predictive capability for MNTD (bias = −1.09%, RMSE = 27.59). Notably, scatterplot analysis revealed that only RF, GBR, and eXGB maintained symmetrical distributions along the 1:1 line, while other models showed problematic one-to-many value mappings or asymmetric patterns. These findings show that machine learning (especially RF and eXGB) enhances phylogenetic diversity predictions by integrating climate and NDVI data, though model performance varies by metric and management context. This study offers a framework for ecological forecasting, emphasizing multi-metric validation in biodiversity modeling. Full article

(This article belongs to the Topic Effective Strategies for Rangeland Conservation and Sustainable Management)

► Show Figures

Figure 1

Search Results (159)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (159)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI