High-Performance Concrete Strength Regression Based on Machine Learning with Feature Contribution Visualization

Lei Zhen; Chang Qu; Man-Lai Tang; Junping Yin

doi:10.3390/math13243965

,

and

¹

Academy for Advanced Interdisciplinary Studies, Northeast Normal University, Changchun 130024, China

²

Department of Mathematics, Shantou University, Shantou 515063, China

³

Department of Physics, Astronomy and Mathematics, University of Hertfordshire, Hatfield AL10 9AB, UK

⁴

Institute of Applied Physics and Computational Mathematics, Chinese Academy of Sciences, Haidian District, Beijing 100094, China

Mathematics2025, 13(24), 3965;https://doi.org/10.3390/math13243965

This article belongs to the Special Issue Advanced Computational Mechanics

Version Notes

Order Reprints

Abstract

Concrete compressive strength is a fundamental indicator of the mechanical properties of High-Performance Concrete (HPC) with multiple components. Traditionally, it is measured through laboratory tests, which are time-consuming and resource-intensive. Therefore, this study develops a machine learning-based regression framework to predict compressive strength, aiming to reduce experimental costs and resource usage. Under three different data preprocessing strategies—raw data, standard score, and Box–Cox transformation—a selected set of high-performance ensemble models demonstrates excellent predictive capacity, with both the coefficient of determination (

R^{2}

) and explained variance score (EVS) exceeding 90% across all datasets, indicating high accuracy in compressive strength prediction. In particular, stacking ensemble (

R^{2}

-

0.920

,

EVS

-

0.920

), XGBoost regression (

R^{2}

-

0.920

,

EVS

-

0.920

), and HistGradientBoosting regression (

R^{2}

-

0.913

,

EVS

-

0.914

) based on Box–Cox transformation data show strong generalization capability and stability. Additionally, tree-based and boosting methods demonstrate high effectiveness in capturing complex feature interactions. Furthermore, this study presents an analytical workflow that enhances feature interpretability through visualization techniques—including Partial Dependence Plots (PDP), Individual Conditional Expectation (ICE), and SHapley Additive exPlanations (SHAP). These methods clarify the contribution of each feature and quantify the direction and magnitude of its impact on predictions. Overall, this approach supports automated concrete quality control, optimized mixture proportioning, and more sustainable construction practices.

Keywords:

High-Performance Concrete; concrete compressive strength; regression model; SHAP

High-Performance Concrete Strength Regression Based on Machine Learning with Feature Contribution Visualization

Abstract

Article Metrics

Citations

Article Access Statistics