Next Article in Journal
From Strong Fluoride Binding to Reversible Electrodesorption: S, N-Regulated La-MOF-Derived Carbon Electrodes for Capacitive Deionization Defluoridation
Previous Article in Journal
Automatic Segmentation and Recognition of the Microstructure of High-Strength Low-Alloy Steel
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Prediction of the Compressive Strength of Tailings-Based Cement Material Using Machine Learning Models with Experimental Validation

1
School of Civil and Environmental Engineering, Changsha University of Science & Technology, Changsha 410114, China
2
School of Civil Engineering, Guilin University of Technology, Guilin 541004, China
3
Shenzhen Raw Water Co., Ltd., Shenzhen 518000, China
4
School of Hydraulic and Ocean Engineering, Changsha University of Science & Technology, Changsha 410114, China
*
Author to whom correspondence should be addressed.
Materials 2026, 19(12), 2557; https://doi.org/10.3390/ma19122557 (registering DOI)
Submission received: 23 April 2026 / Revised: 24 May 2026 / Accepted: 25 May 2026 / Published: 12 June 2026
(This article belongs to the Section Construction and Building Materials)

Abstract

Partially replacing cement with mine tailings offers a sustainable strategy for solid waste resource utilization. As a cement admixture, the compressive strength of tailings-based cement materials serves as a critical performance indicator. Machine learning (ML) techniques offer high efficiency, cost-effectiveness, and superior predictive accuracy. However, variations in the chemical composition of tailings often introduce uncertainties into model predictions. Consequently, this study developed an integrated approach incorporating chemical composition and activation methods as input parameters. Four optimized ML models were deployed to predict the compressive strength of tailings-based cementitious materials. Multiple metrics were employed to evaluate model performance, which identified the PSO-XGBoost model as the superior predictive architecture. SHAP analysis revealed that mechanical grinding, NaOH concentration, and the proportions of gypsum and tailings were the primary features influencing compressive strength. Experimental validation yielded a low prediction error of 8.7%, confirming the model’s high predictive accuracy. This research establishes a robust framework for predicting the strength of tailings-based cementitious materials, providing a theoretical foundation for solid waste upcycling.

1. Introduction

Mine tailings constitute a major category of industrial solid waste. Intensive mineral extraction leads to the continuous accumulation of these wastes, posing substantial economic and environmental challenges. Large-scale storage not only impedes the recovery of valuable components but also causes environmental degradation, including land encroachment, soil degradation, and heavy metal contamination [1,2,3]. Consequently, optimizing the recycling and valorization of tailings has emerged as a critical research priority.
Given the chemical similarities between tailings and cement, recent research has focused on the partial replacement of cement with properly activated tailings as supplementary cementitious materials (SCMs) for concrete production [4,5,6,7]. This strategy offers a scalable, high-value waste management solution while significantly reducing the carbon footprint of cement manufacturing. Previous studies have demonstrated that incorporating activated tailings inherently influences the performance of cement-based materials. In particular, compressive strength serves as the primary metric for assessing these tailings-based cementitious materials [8,9,10,11], typically evaluated using specimens after 28 days of standard curing. This key property is dictated by the activation method employed, with common techniques including mechanical grinding, alkaline activation, and sulfate activation. However, variations in the chemical composition and dosage of tailings often lead to discrepancies among reported strength values. For instance, Geng [12] reported that the activity of mechanically activated tungsten tailings depends on the replacement ratio, peaking at 20%. Meanwhile, Li [13] emphasized chemical composition as the decisive factor controlling binder performance. Specifically through the optimization of the raw materials’ calcium-to-silicon (Ca/Si) ratio. Notably, the activity of tailings is governed by multiple coupled factors rather than a single variable. For example, Saedi et al. [14] successfully replaced a portion of cement by applying a mechanochemical activation method, demonstrating that tailings activity is simultaneously affected by dosage and the specific activation mode. Although these studies elucidate the independent effects of individual factors, the mechanisms underlying their synergistic interactions remain poorly understood. Consequently, a systematic investigation into these coupling effects is essential to establish a robust theoretical framework for solid waste upcycling.
Recent advances in artificial intelligence have established machine learning (ML) as a powerful tool for predicting the properties of tailings-based cementitious materials [15,16,17], with algorithms such as BPNN, RF, SVM, and XGBoost being widely adopted due to their predictive accuracy and robustness. To further enhance prediction accuracy and generalization, metaheuristic optimization algorithms have been integrated to automate hyperparameter tuning, thereby preventing premature convergence to local optima and enhancing model stability [18,19]. For instance, Alyami et al. [20] hybridized the firefly algorithm (FFA) with SVM to elevate strength predictions for copper tailings concrete, while Sobuz et al. [21] utilized grid search to optimize XGBoost, achieving high-precision predictions for nanomaterial concrete composites. Similarly, Wang et al. [22] and Miao et al. [23] applied Particle Swarm Optimization (PSO) and Bayesian optimization, respectively, to refine BPNN and XGBoost architectures, thereby improving the prediction of concrete compressive strength. While the application of ML frameworks to predict tailings-based material performance is rapidly evolving, hybrid machine learning models systematically achieve superior predictive accuracy compared to standalone baseline models [24,25,26,27].
Although machine learning has been widely implemented to predict the properties of tailings-based concrete, knowledge gaps persist regarding the fundamental behavior of tailings within cement-based systems. To bridge these gaps, this study integrates multidimensional variables—including chemical composition, mechanical grinding, alkali/sulfate activation, and tailings dosage—to develop an optimized ML framework. This model quantifies the synergistic effects of these coupled factors on the compressive strength of the resulting tailings-based cementitious materials. Furthermore, Shapley Additive Explanations (SHAP) analysis is deployed to identify the critical features dictating strength development, the individual relationships of which are further elucidated through partial dependence plots (PDPs). Finally, experimental validation using tungsten mine tailings is conducted to corroborate the predictive accuracy of the optimized framework, the findings of which provide valuable insights and practical guidance for the sustainable upcycling of industrial solid waste.

2. Research Methods

2.1. Machine Learning Algorithms

2.1.1. The Principle of Limit Gradient Boosting (Xgboost)

XGBoost [28] demonstrates superior performance in both regression and classification tasks. The algorithm accelerates optimization by employing a second-order Taylor expansion of the loss function and incorporating a regularization term that penalizes model complexity, thereby mitigating overfitting. The objective function is defined as shown in Equation (1):
o b j ( θ ) = i = 1 n L ( y i , y i ^ ) + j = 1 m Ω ( f j )
where n is the total number of samples, m is the number of decision trees, L ( y i , y i ^ ) denotes the loss function measuring the discrepancy between the experimental value y i and the predicted value y i ^ , and Ω ( f j ) represents the regularization term used to penalize the complexity of the j-th tree to prevent overfitting.

2.1.2. Random Forest (Rf)

Random Forest (RF) [29] is an ensemble learning algorithm built on the architecture of decision trees (DTs). An RF model aggregates an ensemble of individual decision trees to improve predictive stability. Each tree is trained on a bootstrap sample of the dataset and generates independent predictions. The final output is determined through a collective voting or averaging mechanism. The predictive performance of the RF model is primarily governed by hyperparameters such as the number of trees and the maximum depth of each tree.

2.1.3. Support Vector Regression (Svr)

Support Vector Regression (SVR) [30] aims to find a function f ( x ) such that the prediction error for all samples lies within an -insensitive tube. Slack variables ξ i and ξ i * are introduced to allow a certain degree of error. The optimization problem of SVR is shown in Equation (2):
min 1 2 w 2 + C i = 1 n ( ξ i + ξ i )
In order to deal with the nonlinear relationship, SVR introduces the kernel function K and maps the data to the high-dimensional space. The final regression function is shown in Equation (3):
f ( x ) = i = 1 n ( a i a i ) K ( x i , x ) + b
where α i and α i * are Lagrange multipliers.

2.1.4. Back Propagation Neural Network (Bpnn)

The Back Propagation Neural Network (BPNN) [31] is a fundamental multilayer feedforward architecture. Model training consists of two primary phases. First, input signals undergo forward propagation through successive layers of weights, biases, and activation functions to generate the predicted output. Then, the error between the predicted output and the true label is calculated, and the backpropagation algorithm is employed to iteratively update network parameters in reverse order, thereby minimizing the loss function and optimizing model performance. The mathematical framework for parameter updates is given by Equation (4):
y i = j = 1 n ( w i j x j ) + b i
where y i is the output of the neuron, x j is the input from the preceding layer, w i j is the weight, and b i is the corresponding bias term.

2.2. Particle Swarm Optimization (PSO)

Particle Swarm Optimization (PSO) [32] is a metaheuristic algorithm inspired by the social foraging behavior of bird flocks. Within this framework, each candidate solution is conceptualized as a “particle” characterized by position and velocity vectors in the multidimensional search space. During the iterative process, particles dynamically adjust their trajectories by tracking both their individual best position and the swarm’s global best position, thereby facilitating progressive convergence toward the global optimum. Owing to its algorithmic simplicity, rapid convergence, and robust global search capabilities, PSO is widely used for hyperparameter optimization in machine learning. The velocity and position update mechanisms are mathematically expressed in Equations (5) and (6):
v i t = w v i t 1 + c 1 r 1 ( p b e s t i t x i t ) + c 2 r 2 ( g b e s t t x i t )
x i t + 1 = x i t + v i t
v i t and x i t signify the velocity and position of the i-th particle at iteration t, respectively. The parameters c 1 and c 2 represent the cognitive and social acceleration coefficients, while w denotes the inertia weight. Furthermore, p b e s t i t refers to the individual best position of particle i, and g b e s t t indicates the global best position attained by the entire swarm.

2.3. Shap Principle

SHAP (Shapley Additive Explanations) [33] is a model-agnostic interpretability framework rooted in the Shapley values of cooperative game theory. It is designed to quantify the individual contribution of each feature to specific model predictions. Its core principle is to treat the model’s prediction as the payoff of a “cooperative game,” where the input features serve as the participating players. By calculating the marginal contribution of a feature across all potential subsets and applying a rigorous weighted averaging method, the predictive influence is fairly allocated among all features. The mathematical formulation of the Shapley value is given in Equation (7):
y i = y ¯ + f ( x i 1 ) + f ( x i 2 ) + + f ( x i k )
where x i k is the feature value, y ¯ is the mean predicted value across the training set, and y i is the specific model output.

2.4. Experimental Verification

To validate the engineering applicability and generalization capability of the proposed framework, the optimized model was applied to a dataset of tungsten tailings from a mine in China. First, critical features were identified via SHAP analysis, which provided the basis for a targeted experimental activation program. This program was used to fabricate cement mortar specimens containing activated tailings, followed by 28-day compressive strength testing. Finally, the trained model was used to predict the experimental outcomes, facilitating a systematic evaluation of its performance in practical engineering scenarios.
The main chemical composition of the tungsten tailings is summarized in Table 1. Specimen preparation followed the standard “Test method of cement mortar strength (ISO)” (GB/T 17671-2021) [34]. The mix proportions were designed with a cement-to-sand ratio of 1:3 and a water-to-binder (w/b) ratio of 0.5, with activated tungsten tailings replacing 30% of the cement by mass. After 28 days of standard curing, the compressive strength of the specimens was measured.

2.5. Implementation Process

The technical workflow of this study is illustrated in Figure 1. First, a comprehensive dataset was constructed by integrating compressive strength records from various types of tailings-based cement materials. Prior to model training, the data were normalized to eliminate dimensional discrepancies. To optimize model performance, the PSO algorithm was employed to tune the hyperparameters of the ML models. On this basis, the SHAP method was used to attribute features and identify key influencing factors. In addition, partial dependence plots (PDPs) were used to elucidate the nonlinear relationships and underlying mechanisms between these features and compressive strength. Finally, to validate the model’s generalization performance, an experimental activation program for mine tailings was conducted, guided by the insights from the feature analysis. The model’s predictions were then compared with measured strength values to systematically assess its practical performance.

3. Creation and Analysis of Data Sets

3.1. Dataset Creation

In this study, a total of 283 data points were compiled from peer-reviewed literature (Table S1). The dataset encompasses experimental results for various activated mine tailings used as supplementary cementitious materials (SCMs). The target (output) variable is the 28-day compressive strength of the tailings-based cement materials. The input features include the chemical composition of the tailings, the activation methods, and the tailings replacement levels.
Mechanical activation is quantified by the specific surface area (SSA) of the tailings. Chemical activation strategies primarily involve alkaline activation and sulfate activation. NaOH, Na2SO4, and Na2SiO3 are used as activators for alkali excitation, whereas gypsum serves as the primary activator for sulfate excitation. Incorporating the primary chemical constituents enables the model to account for the mineralogical background of the tailings, thereby enhancing predictive accuracy.

3.2. Data Set Processing and Analysis

In this study, the collected dataset for tailings-based cement materials was meticulously curated and processed. Descriptive statistics—including the minimum, maximum, mean, and median values of the independent and dependent variables—are summarized in Table 2. Additionally, frequency distribution histograms for each variable were constructed to visualize data dispersion, as shown in Figure 2. Figure 2 demonstrates that the dataset has adequate size and balanced coverage for model training.
In this study, Pearson and Spearman correlation analyses were used to examine the linear and monotonic relationships within the dataset. Figure 3 shows the correlation coefficient matrices for the input features and between the input features and the output. In the heatmap, red and blue indicate positive and negative correlations, respectively. Coefficients range from −1 to 1, where an absolute value closer to 1 indicates a stronger linear or monotonic dependence. The relationships between the characteristic variables and the compressive strength of tailings concrete were comprehensively analyzed using both Spearman and Pearson correlation analyses. A significant negative correlation was observed between tailings content and compressive strength ( R p = −0.59, R S = −0.54).
This trend is primarily attributed to the “dilution effect,” whereby higher tailings replacement levels reduce the relative cement content, thereby limiting the extent of the hydration reaction. The specific surface area ( R p = 0.36, R s = 0.24) and gypsum content ( R p = 0.11, R s = 0.23) were positively correlated with strength. This positive correlation occurs because mechanical grinding and chemical activation enhance the micro-filling effect and stimulate the secondary (pozzolanic) hydration reaction, thereby improving concrete strength. Furthermore, all correlation coefficients among the input features were below 0.8, indicating the absence of strong multicollinearity. This absence of strong linear redundancy confirms the suitability of the dataset for developing nonlinear machine learning models.

4. Results and Discussion

4.1. Analysis of Prediction Results

To validate the predictive accuracy of the proposed PSO-XGBoost model, its performance was benchmarked against four alternative models: XGBoost, PSO-BPNN, PSO-SVR, and PSO-RF. The comparative results between the predicted and experimental compressive strengths for all five models are shown in Figure 4. In the figure, the red dotted line represents the model’s fit, the green dashed line represents the identity line (y = x), and the blue dashed lines indicate the ±20% error envelope. The closer the fitting slope of the model on the dataset is to 1, the more accurate the model’s predictions are.
As shown in Figure 4, the test-set slope for the PSO-XGBoost model is 0.9881, and the proportion of predicted values within ±20% of the actual values is significantly higher, indicating more accurate predictions.
As depicted in Figure 5, t p r e / t r e is the ratio of predicted compressive strength to experimental compressive strength. The distribution of the prediction-to-experimental ratio ( t p r e / t r e ) highlights the robustness of the PSO-XGBoost model. Specifically, 96% of the samples in the training set and 77% in the testing set lie within the ±5% error margin (0.95–1.05). This concentration of errors near unity is markedly superior to that of the competing models, confirming the high reliability of the proposed approach.
To systematically assess and compare predictive performance, R 2 , RMSE, and MAE were calculated for both the training and testing phases. On the training set, the PSO-XGBoost model exhibited R 2 improvements of 5.36%, 0.60%, 2.17%, and 2.07% over PSO-BPNN, PSO-SVM, PSO-RF, and XGBoost, respectively. Correspondingly, RMSE was reduced by 66.5%, 29.2%, 52.5%, and 51.0% for the same model comparisons. MAE decreased by 75.2%, 51.4%, 70.5%, and 68.7%, respectively. These metrics consistently show that the PSO-XGBoost model achieves the highest training accuracy. As shown in Figure 6, the prediction performance of each model on the training set is better than that on the testing set. This observation is consistent with the phenomenon of overfitting, in which models tend to fit the training data more closely than the testing data. Notably, the PSO-XGBoost model exhibited the narrowest performance gap between the training and testing sets. This further substantiates its superior generalization capability and algorithmic stability (Table 3).
In order to evaluate the reliability of the PSO-XGBoost model, the prediction interval method [35] is used to quantify its prediction uncertainty. First, the ratio r between the actual compressive strength and the predicted compressive strength of each sample is calculated, and then a 95% prediction interval is constructed based on the mean and standard deviation of the ratio r, as shown in Equation (8).
P I = [ μ z σ , μ + z σ ]
where μ is the average value of the ratio of the actual value to the predicted value, σ is the standard deviation of the ratio of the actual value to the predicted value, and z is the value corresponding to the selected probability level. When the probability is 95%, z is 1.96.
The calculation results show that the prediction interval of PSO-XGBoost model is [0.9264, 1.0729]. The 283 sets of data in the dataset are all within the range of 95% prediction interval, indicating that the PSO-XGBoost model has excellent reliability.

4.2. Feature Importance Analysis

To address the “black-box” nature of machine learning models, the SHAP method was used to elucidate the relationships between the input features and the response variable. As shown in Figure 7, a horizontal bar chart presents the percentage importance of each feature in predicting the 28-day compressive strength of tailings-based cementitious materials. The vertical axis represents the input features, the horizontal axis represents the normalized importance (%), and the sum of the importance values of all features is 100%. The longer the bar, the greater the influence of the feature on the model’s predicted output. The SHAP feature importance analysis results are generally consistent with the previous correlation analysis findings. Among these, SiO2, Al2O3, Fe2O3, and tailings content emerged as the most critical determinants of the strength of tailings-based cement materials, accounting for a cumulative contribution of 79.22%. Specific surface area (SSA), NaOH dosage, and gypsum content also exhibited significant importance scores. The cumulative contribution of the above seven features is approximately 95.51%, indicating their dominant role in the predictive performance of the PSO-XGBoost model. Mechanical grinding, NaOH content, and gypsum content—among the activation strategies—exert the greatest influence on the compressive strength of activated tailings-based cement materials.
Figure 8 provides a global summary plot of the input features, quantifying the direction and magnitude of their influence on strength predictions. As shown in the figure, tailings content is negatively correlated with the compressive strength of activated tailings concrete, where higher values consistently reduce the model’s predicted output. In contrast, specific surface area, NaOH content, and gypsum content are positively correlated with the predicted strength, and increases in these feature values lead to higher predicted strength.
To further investigate the influence of critical features on compressive strength, partial dependence plot (PDP) analyses were performed for tailings content, specific surface area (SSA), NaOH dosage, and gypsum content (Figure 9). As tailings content increases, the predicted compressive strength exhibits a clear downward trend, confirming a significant negative correlation. This is attributed to the “dilution effect,” whereby higher tailings replacement levels reduce the cement content, thereby decreasing the Ca(OH)2 yield and lowering the system’s alkalinity [36]. For specific surface area (SSA) in the range of 0 to 800 m2/g, a positive correlation with compressive strength was observed. Exceeding this threshold, however, negatively impacts strength development. This is because moderate mechanical grinding increases the yield of fine particles, enhances the micro-aggregate filling effect, and optimizes the pore structure. Excessive grinding, however, triggers particle agglomeration, hindering the continuous development of mechanical strength [37]. NaOH dosage and gypsum content exhibit similar threshold behavior, with optimal levels of approximately 2% and 9%, respectively. This is because adding appropriate amounts of NaOH and gypsum effectively stimulates the pozzolanic activity of the tailings, promotes secondary hydration reactions, and facilitates the formation of calcium silicate hydrate (C-S-H) gels, thereby fundamentally enhancing matrix strength. Excessive incorporation, however, disrupts the system’s balance and has an adverse effect [38].

4.3. Experimental Verification

To validate the predictive performance and generalization capability of the PSO-XGBoost model, an experimental verification phase was conducted. Guided by the SHAP feature importance analysis, optimized operational ranges for mechanical grinding time, NaOH dosage, and gypsum content were established. The ranges of the parameters are as follows: mechanical grinding time, 40–80 min; NaOH content, 1–3%; and gypsum content, 6–12%. Subsequently, a response surface methodology (RSM) experiment was conducted using tailings from a tungsten mine in China. Table 4 presents the correlation between mechanical grinding duration and the resulting specific surface area (SSA) of the tailings, and Table 5 presents the response levels of the experimental factors. The experimental results are presented in Table 6. The optimized PSO-XGBoost model was then used to predict the compressive strength of tailings-based cement materials, with the predictions benchmarked against measured values. The prediction errors remained consistently within ±8.7%, demonstrating the model’s high fidelity and confirming that the PSO-XGBoost model possesses robust generalization capability and reliability for tailings-based cement materials.

5. Conclusions

(1) The PSO-XGBoost model demonstrated superior predictive performance over the XGBoost, PSO-SVM, PSO-BPNN, and PSO-RF models, with R 2 on the test set improving by 6.19%, 6.10%, 2.67%, and 3.41%, respectively. These results show that the PSO-XGBoost model can accurately and reliably predict the compressive strength of tailings-based cement materials.
(2) Feature importance analysis based on the PSO-XGBoost model identified mechanical grinding, NaOH dosage, gypsum content, and tailings replacement level as the primary determinants of compressive strength.
(3) The partial dependence plot (PDP) shows that tailings content is negatively correlated with compressive strength, where higher substitution levels lead to a systematic reduction in strength due to the dilution effect. The specific surface area (0–800 m2/kg), NaOH content (0–2%), and gypsum content (0–9%) have a positive effect on the compressive strength of activated tailings concrete within these respective thresholds; beyond them, “over-activation” leads to diminished strength.
(4) Based on the key features identified, tailings from a tungsten mine were tested, and the optimized PSO-XGBoost model was used for prediction. Experimental validation using these tungsten tailings confirmed that the model’s prediction errors remained within an 8.7% margin. This demonstrates that the PSO-XGBoost model has high predictive accuracy and provides theoretical guidance for the sustainable resource utilization of tungsten tailings in the mine.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ma19122557/s1, Table S1: dataset.

Author Contributions

Z.Z.: Writing—original draft, Writing—review & editing. J.Y.: Investigation, Supervision. S.D.: Data curation, Investigation. T.L.: review, Supervision. W.Y.: Visualization, Resources. X.L.: Writing—review & editing, Validation. X.Y.: Writing—review & editing, Formal analysis. All authors have read and agreed to the published version of the manuscript.

Funding

Yang Jianyu reports financial support was provided by National Natural Science Foundation of China; Yang Jianyu reports financial support was provided by Hunan Provincial Water Conservancy Department Science and Technology Project. If there are other authors, they declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. This research was supported by National Natural Science Foundation of China (52209155); Hunan Provincial Water Conservancy Department Science and Technology Project (XSKJ2024064-34).

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

Author Xiuxin Li was employed by the company Shenzhen Raw Water Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Saedi, A.; Jamshidi-Zanjani, A.; Darban, A.K. A review on different methods of activating tailings to improve their cementitious property as cemented paste and reusability. J. Environ. Manag. 2020, 270, 110881. [Google Scholar] [CrossRef]
  2. Simonsen, A.M.T.; Solismaa, S.; Hansen, H.K.; Jensen, P.E. Evaluation of mine tailings’ potential as supplementary cementitious materials based on chemical, mineralogical and physical characteristics. Waste Manag. 2020, 102, 710–721. [Google Scholar] [CrossRef]
  3. Maruthupandian, S.; Chaliasou, A.; Kanellopoulos, A. Recycling mine tailings as precursors for cementitious binders–Methods, challenges and future outlook. Constr. Build. Mater. 2021, 312, 125333. [Google Scholar] [CrossRef]
  4. Zhu, Q.; Yuan, Y.; Chen, J.; Lv, Y.; Sun, X.; Duan, J. Bond performance between deformed bars and concrete containing iron tailing sand exposed to high temperature. Structures 2023, 56, 104940. [Google Scholar] [CrossRef]
  5. Bai, J.; Ma, J.; Feng, G.; Cui, B.; Wu, Y. Mechanical characteristic and microstructural properties of tailing cemented paste backfill under uniaxial compression. Structures 2025, 82, 110651. [Google Scholar] [CrossRef]
  6. Gu, X.; Zhang, W.; Zhang, X.; Li, X.; Qiu, J. Hydration characteristics investigation of iron tailings blended ultra high performance concrete: The effects of mechanical activation and iron tailings content. J. Build. Eng. 2022, 45, 103459. [Google Scholar] [CrossRef]
  7. Saedi, A.; Jamshidi-Zanjani, A.; Mohseni, M.; Darban, A.K.; Nejati, H. Mechanical activation of lead–zinc mine tailings as a substitution for cement in concrete construction. Constr. Build. Mater. 2023, 364, 129973. [Google Scholar] [CrossRef]
  8. Zhang, M.; Lv, H.; Jiang, Q.; Wu, Y.; Yang, Y.; Dai, H. Study on flexural behavior of laminated slabs constructed with composite limestone powder-tailings mixed sand concrete. Structures 2025, 77, 109169. [Google Scholar] [CrossRef]
  9. Pazouki, G.; Saeed, N.; Tao, Z.; Kang, W.H. Proposing an optimised data-efficient artificial neural network model for estimating the compressive strength of concrete containing SCMs. Structures 2025, 79, 109531. [Google Scholar] [CrossRef]
  10. Zhu, T.; Chen, Z.; Wang, Z.; Cao, J.; Hao, J.; Zhou, Z. Energy damage evolution and mesoscopic failure mechanism of cemented waste rock tailing backfill under axial compression. Structures 2025, 71, 108057. [Google Scholar] [CrossRef]
  11. Mohammed, H.X.; Mohammed, A.S.; Hassan, A.M.T. Soft computing models to evaluate the effect of fly ash and ground granulated blast furnace slag (GGBS) on the compressive strength of concrete in normal and high strength ranges. Structures 2023, 58, 105459. [Google Scholar] [CrossRef]
  12. Yao, G.; Wang, Q.; Wang, Z.; Wang, J.; Lyu, X. Activation of hydration properties of iron ore tailings and their application as supplementary cementitious materials in cement. Powder Technol. 2020, 360, 863–871. [Google Scholar] [CrossRef]
  13. Li, K.; Yao, J.; Li, X.; Li, S.; Min, N.; Li, Z.; Li, X. Activating silicic gangue tailings via high-Ca/Si ratio all-solid-waste cementitious materials: Constructing low-carbon, high-performance cemented backfill and mechanisms. Miner. Eng. 2026, 235, 109847. [Google Scholar] [CrossRef]
  14. Saedi, A.; Jamshidi-Zanjani, A.; Mohseni, M.; Darban, A.K. Effect of mechanochemical activation on hydration properties of lead-zinc sulfide tailings for concrete construction. Case Stud. Constr. Mater. 2023, 18, e01996. [Google Scholar] [CrossRef]
  15. Zhang, Y.; Xi, M.; Liu, J.; Wang, W.; Lyu, X. Axial compressive capacity prediction of coal gangue concrete-filled steel tube stub columns: Genetic programming-augmented code calibration and Bayesian-optimized machine learning. Structures 2025, 80, 110119. [Google Scholar] [CrossRef]
  16. Philip, S.; Marakkath, N. An interpretable machine learning approach with graphical user interface for strength prediction of sustainable slag-based geopolymer concrete. Structures 2025, 81, 110281. [Google Scholar] [CrossRef]
  17. Jafari, A.; Mollaali, M.; Ma, L.; Shahmansouri, A.A.; Zhou, Y.; Dugnani, R. Brittle fracture strength prediction via XML with reliability considerations. Eng. Fract. Mech. 2025, 328, 111555. [Google Scholar] [CrossRef]
  18. Hasan, M.; Hassan, M.S.; Hasan, K.; Tushar, F.H.; Khan, M.; Jaya, R.P. Interpretable machine learning approach for predicting the workability and mechanical properties of betel nut husk fiber-reinforced concrete. Structures 2025, 81, 110152. [Google Scholar] [CrossRef]
  19. Wang, Z.; Liu, S.; Liang, W.; Liu, J.; Zhou, Y.; Lei, K.; Gao, Y.; Ou, W. Predictive Modeling of Compressive Strength in Tailings Concrete Using Explainable Machine Learning Approaches. Results Eng. 2025, 27, 105516. [Google Scholar] [CrossRef]
  20. Alyami, M.; Onyelowe, K.; AlAteah, A.H.; Alahmari, T.S.; Alsubeai, A.; Ullah, I.; Javed, M.F. Innovative hybrid machine learning models for estimating the compressive strength of copper mine tailings concrete. Case Stud. Constr. Mater. 2024, 21, e03869. [Google Scholar] [CrossRef]
  21. Sobuz, M.H.R.; Alzlfawi, A.; Hakeem, I.Y.; Tilak, M.K.C.; Kabbo, M.K.I.; Jameel, M.; Abubakar, S.A. Machine learning-based modeling to predict and parametrically optimize the compressive strength of nanomaterial concrete composites exposed to elevated temperatures. Case Stud. Constr. Mater. 2025, 25, e05546. [Google Scholar] [CrossRef]
  22. Chi, B.; Wang, S.; Zhu, F.; Quan, Y.; Yang, X.; Wang, F. Optimized Prediction of the Compressive Strength of Hollow Concrete Masonry Prisms Using Unified Mortar Index and Hybrid Machine Learning Models. Constr. Build. Mater. 2025, 505, 144800. [Google Scholar] [CrossRef]
  23. Miao, Q.; Gao, Z.; Zhu, K.; Guo, Z.; Sun, Q.; Zhou, L. Interpretable machine learning model for compressive strength prediction of self-compacting concrete with recycled concrete aggregates and SCMs. J. Build. Eng. 2025, 108, 112965. [Google Scholar] [CrossRef]
  24. Alyami, M.; Ullah, I.; AlAteah, A.H.; Alsubeai, A.; Alahmari, T.S.; Farooq, F.; Alabduljabbar, H. Machine learning models for predicting the compressive strength of cement-based mortar materials: Hyper tuning and optimization. Structures 2025, 71, 107931. [Google Scholar] [CrossRef]
  25. Yılmaz, Y.; Nayır, S. Machine learning based prediction of compressive and flexural strength of recycled plastic waste aggregate concrete. Structures 2024, 69, 107363. [Google Scholar] [CrossRef]
  26. Yadav, A.A.; Hirekhan, S.G.; Bhandari, P.S.; Bhagat, R.M.; Ranit, A.B.; Shelare, S.; Soudagar, M.E.M.; Sharma, S.; Raja, V.B.; Mahapatro, A.; et al. AI-driven sustainable concrete mix design: Hybrid deep Q-Learning and genetic algorithms-based multi-objective machine learning optimizations for high structural strength, low cost, and low carbon footprints. Structures 2025, 82, 110443. [Google Scholar] [CrossRef]
  27. Wijesundara, S.; Wijesundara, K.; Bandara, S. Machine learning approach for predicting the compressive strength of ultra-high performance fiber reinforced concrete (UHPFRC). Structures 2025, 75, 108704. [Google Scholar] [CrossRef]
  28. Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
  29. Boulesteix, A.L.; Janitza, S.; Kruppa, J.; König, I.R. Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2012, 2, 493–507. [Google Scholar] [CrossRef]
  30. Awad, M.; Khanna, R. Support Vector Regression. In Efficient Learning Machines: Theories, Concepts, and Applications for Engineers and System Designers; Apress: Berkeley, CA, USA, 2015; pp. 67–80. [Google Scholar]
  31. Asaad, R.R.; Ali, R.I. Back Propagation Neural Network (BPNN) and sigmoid activation function in multi-layer networks. Acad. J. Nawroz Univ. 2019, 8, 216. [Google Scholar] [CrossRef]
  32. Wang, D.; Tan, D.; Liu, L. Particle swarm optimization algorithm: An overview. Soft Comput. 2018, 22, 387–408. [Google Scholar] [CrossRef]
  33. Van den Broeck, G.; Lykov, A.; Schleich, M.; Suciu, D. On the tractability of SHAP explanations. J. Artif. Intell. Res. 2022, 74, 851–886. [Google Scholar] [CrossRef]
  34. GB/T 17671-2021; Test Method of Cement Mortar Strength (ISO Method). National Standards of China: Beijing, China, 2021.
  35. Alavi, S.A.; Noel, M. Uncertainty and prediction intervals of new machine learning approach for non-destructive evaluation of concrete compressive strength. Buildings 2025, 15, 544. [Google Scholar] [CrossRef]
  36. Yang, M.; Sun, J.; Yuan, J.; Li, H.; Xu, D.; Zhang, R.; Ma, X. Iron tailing Powder in Cement-Based Composites: Evaluating Filling Effect from Particle-Size Distribution. Case Stud. Constr. Mater. 2025, 23, e04919. [Google Scholar] [CrossRef]
  37. Yanning, S.; Qiong, F.; Hongxia, Q.; Chao, W. Study on mechanically activated ferrous extraction tailing of nickel slag as cementitious materials: Physical and chemical properties, mechanical properties and reaction mechanism. Constr. Build. Mater. 2024, 422, 135748. [Google Scholar] [CrossRef]
  38. Wang, Z.; Chu, H.; Wang, J.; Feng, E.; Zhang, Y.; Lyu, X. Mechanical activation of siliceous tailings and its application as cement admixtures. Miner. Eng. 2022, 177, 107366. [Google Scholar] [CrossRef]
Figure 1. Flow chart.
Figure 1. Flow chart.
Materials 19 02557 g001
Figure 2. Frequency distribution of characteristic variables and response variables.
Figure 2. Frequency distribution of characteristic variables and response variables.
Materials 19 02557 g002
Figure 3. Variable correlations: (a) Spearman correlation coefficient matrix, (b) Pearson correlation coefficient matrix.
Figure 3. Variable correlations: (a) Spearman correlation coefficient matrix, (b) Pearson correlation coefficient matrix.
Materials 19 02557 g003
Figure 4. The relationship between the actual value and the predicted compressive strength of each model.
Figure 4. The relationship between the actual value and the predicted compressive strength of each model.
Materials 19 02557 g004
Figure 5. Statistical graphs of tpre/tre values of different models.
Figure 5. Statistical graphs of tpre/tre values of different models.
Materials 19 02557 g005
Figure 6. Comparison of evaluation indexes of different prediction models: (a) R2; (b) RMSE; (c) MAE.
Figure 6. Comparison of evaluation indexes of different prediction models: (a) R2; (b) RMSE; (c) MAE.
Materials 19 02557 g006
Figure 7. Feature importance graph.
Figure 7. Feature importance graph.
Materials 19 02557 g007
Figure 8. SHAP global interpretation diagram.
Figure 8. SHAP global interpretation diagram.
Materials 19 02557 g008
Figure 9. Partial dependence graph.
Figure 9. Partial dependence graph.
Materials 19 02557 g009
Table 1. Chemical composition of tungsten tailings/wt.%.
Table 1. Chemical composition of tungsten tailings/wt.%.
SiO2CaOMgOFe2O3Al2O3SO3Others
64.713.696.8912.177.590.194.76
Table 2. Statistical table of each parameter.
Table 2. Statistical table of each parameter.
Input Characteristics and Output VariablesMinimum ValueMaximum ValueAverage ValueMedianStandard Deviation
SiO2 (%)30.4274.8964.4867.7110.34
Al2O3 (%)0.7619.657.057.593.45
CaO (%)2.129.215.653.696.97
Fe2O3 (%)1.8922.1411.4612.173.77
BET Surface Area (m2/g)45.61617782.43770339.45
NaOH Content (%)060.6801.08
Na2SiO3 Content (%)0150.5302.21
Na2SO4 Content (%)020.05400.24
Gypsum Content (%)0121.7603.29
Tailings Dosage (%)08031.77309.81
CS (MPa)8.5758.735.8336.69.44
Table 3. Summary of evaluation results.
Table 3. Summary of evaluation results.
ModelData SetR2RMSE (MPa)MAE (MPa)
PSO-BPNNTrain0.94282.241.45
Test0.91492.802.08
PSO-SVMTrain0.98741.060.72
Test0.88543.351.39
PSO-RFTrain0.97221.571.22
Test0.90842.822.01
PSO-XGBoostTrain0.99330.750.36
Test0.93932.560.59
XGBoostTrain0.97311.531.15
Test0.88462.791.81
Table 4. Relationship between mechanical grinding time and specific surface area.
Table 4. Relationship between mechanical grinding time and specific surface area.
Mechanical grinding time (min)020406080
Specific surface area (m2/g)420580640770670
Table 5. Levels of response experiment factors.
Table 5. Levels of response experiment factors.
FactorsCodeLevel
−101
Mechanical grinding time (min)A406080
NaOH content (%)B123
Gypsum content (%)C6912
Table 6. Response surface design table.
Table 6. Response surface design table.
NOABCTest Strength (MPa)Prediction Strength
(MPa)
Relative Error (%)
1401940.7439.612.77
2801940.1439.621.29
3403940.8939.643.05
4803940.6239.642.41
5402640.6940.440.61
6802640.8740.470.98
74021241.2240.591.53
88021240.5740.730.39
9601641.2442.262.47
10603641.5342.321.90
116011241.1642.372.94
126031241.4542.52.53
13602943.7539.948.70
14602941.3639.943.43
15602942.5639.946.16
16602942.339.945.58
17602943.0439.947.20
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhong, Z.; Deng, S.; Liu, T.; Li, X.; Ye, X.; Yang, W.; Yang, J. Prediction of the Compressive Strength of Tailings-Based Cement Material Using Machine Learning Models with Experimental Validation. Materials 2026, 19, 2557. https://doi.org/10.3390/ma19122557

AMA Style

Zhong Z, Deng S, Liu T, Li X, Ye X, Yang W, Yang J. Prediction of the Compressive Strength of Tailings-Based Cement Material Using Machine Learning Models with Experimental Validation. Materials. 2026; 19(12):2557. https://doi.org/10.3390/ma19122557

Chicago/Turabian Style

Zhong, Zhanming, Senrui Deng, Tao Liu, Xiuxin Li, Xin Ye, Weijun Yang, and Jianyu Yang. 2026. "Prediction of the Compressive Strength of Tailings-Based Cement Material Using Machine Learning Models with Experimental Validation" Materials 19, no. 12: 2557. https://doi.org/10.3390/ma19122557

APA Style

Zhong, Z., Deng, S., Liu, T., Li, X., Ye, X., Yang, W., & Yang, J. (2026). Prediction of the Compressive Strength of Tailings-Based Cement Material Using Machine Learning Models with Experimental Validation. Materials, 19(12), 2557. https://doi.org/10.3390/ma19122557

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop