1. Introduction
Currently, the issues associated with cracking in concrete experienced by clients, design team members, and contractors were more than any other problems according to Materials for Life (M4L) [
1]. Moreover, cracks are primarily responsible for the reduction of the strength and stiffness of concrete. In European countries, the annual cost spent on maintenance, refurbishment, and repair of concrete cracks in extending the service life of infrastructure is estimated to be around 50% of their annual construction budget [
2]. It has been suggested by M4L that self-healing materials have great potential to address the problems associated with concrete cracking and thereby reducing the maintenance costs over a structure’s lifetime [
1].
The inspiration of self-healing comes from the biomimicry concept and the healing process in living nature [
3]. For example, the skin of humans or animals can biologically repair itself from simple injuries. In cement-based materials, the process of crack self-healing can be categorized into two major mechanisms, i.e., autogenous healing and autonomous healing [
4]. The former indicates the self-healing ability results from the physical and/or chemical composition of the cementitious matrix, whereas the self-healing mechanism of the latter is triggered by some biological agents, such as bacteria that are deliberately introduced into the cementitious matrix.
Generally, the autogenous self-healing of concrete is mainly controlled by two mechanisms including (1) further hydration of cement particles and/or swelling of calcium silicate hydrate and (2) calcium hydroxide carbonation [
5,
6]. It has been reported that crack widths of 10
m [
7], 100
m [
8], 200
m [
9], 205
m [
5], and 300
m [
10] of engineered cementitious composite (ECC) can be self-healed completely [
11].
ECC is a high-performance fiber-reinforced cementitious composite, and its matrix design is strongly associated with the autogenous self-healing mechanism [
12]. ECC features high tensile ductility with a typical fiber volume fraction of 2% [
13,
14] to promote the self-healing ability [
4]. However, the intrinsic self-healing ability of ECC is complex and difficult to predict because of different mineral admixture types, interactivity between different composites in the cementitious matrix and its interaction with the exposed environment [
15], and unpredictable crack location, orientation, and width [
16]. Previous studies have explored the influence of several factors such as limestone powders (LPs) [
17,
18], fly ash (FA) [
19,
20], hydrated lime [
21], the water/binder ratio [
22], water permeation [
23], and different curing conditions (air, carbon dioxide, wet/dry, and water) [
24] on the self-healing behavior of ECC. However, these studies did not predict the self-healing efficiency of ECC by modeling their experimental data. In fact, the relationship between multiple factors is complex and non-linear, so it is difficult to predict the self-healing of ECC mathematically based on the available data. Moreover, mathematical models based on empirical data are generally in regression forms, which cannot be used when the problem (e.g., prediction of the self-healing potential of ECC) contains too many independent variables and requires more assumptions to be made [
25].
To account for the drawbacks of using mathematical models, machine learning (ML) techniques have been used for solving many civil engineering problems with multiple variables [
26]. They are model-free approaches that do not rely on predefined models [
27]. Many research works have been conducted using ML algorithms for the prediction of various properties of concrete [
28,
29]. Gilan et al. [
30] developed a hybrid support vector regression (SVR)–particle swarm optimization (PSO) algorithm model to predict the compressive strength and chloride ion penetrability of concretes containing metakaolin. Yan et al. [
31] predicted the bond strength of a glass-fiber-reinforced polymer bar in concrete by an artificial neural network (ANN) with the genetic algorithm (GA). Yaseen et al. [
32] proposed an ML method called extreme learning machine (ELM) to predict the compressive strength of lightweight foamed concrete.
In the literature, the performance of various ML algorithms in predicting concrete properties have been evaluated and compared. Yan and Shi [
33] reported that SVR is better than other individual methods in predicting the elastic modulus of normal and high-strength concrete. Chou [
34] compared the performance of several individual and ensemble methods for predicting the mechanical properties of high-performance concrete. The results revealed that ensemble learning strategies outperform individual learning techniques in predicting the compressive strength of high-performance concrete. Reuter et al. [
27] employed three different individual approaches for modeling concrete failure surfaces. They found that all three individual approaches are able to fit the experimental data with low error. Sobhani et al. [
35] suggested that their proposed fuzzy inference system and ANN are more reliable than traditional regression models in predicting no-slump concrete. Omran et al. [
36] predicted the compressive strengths of an environmentally friendly concrete by using three individual methods, two ensemble methods, and four regression tree models. Their results showed that the individual Gaussian process regression model and its ensemble models outperformed other models.
Although different ML algorithms have been utilized to predict several properties of concrete, the application of ML on self-healing prediction is considerably rare. Recently, Mauludin and Oucif [
37] reviewed the common methods used for modeling autogenous self-healing of concrete and stated that the methods can be classified into two categories: (1) numerical simulation and (2) ML. However, the only ML model reviewed in their study was the GA–ANN method proposed by Ramadan et al. [
3]. They predicted the self-healing ability of cement-based materials using a dataset collected from the literature. The results showed that the GA–ANN model was capable of capturing the complex effects of various self-healing agents (e.g., biochemical material, silica-based additive, expansive and crystalline components) on the self-healing performance of cement-based materials.
Chaitanya et al. [
38] used an ANN model to predict the self-healing property of concrete containing ground granulated blast furnace slag in terms of compressive strength recovery based on 51 samples collected from their experimental studies. Generally, the predicted results by the ANN model were in good agreement with the experimental values. Zhuang and Zhou [
39] conducted a comparative study on six ML algorithms including SVR, decision tree regression (DTR), gradient boosting regression (GBR), ANN, Bayesian ridge regression (BRR), and kernel ridge regression (KRR) for the crack-repairing ability of the bacteria-based self-healing concrete. The results showed that GBR performed much better than other models with
values of 0.93 and 0.74 for the training set and testing set, respectively. However, the
values of most models were less than 0.7 on both the training and testing sets. Although extensive experiments with different combinations of influencing variables were utilized to generate the empirical dataset, their study only selected three variables including the number of bacteria, the healing time, and the initial crack width to predict the crack closure percentage as the output. Huang et al. [
40] used six types of machine learning algorithms to predict the healing performance of self-healing concrete. The data were taken from the open literature; however, different studies used different self-healing indicators (e.g., crack width, permeability, or mechanical properties) to assess self-healing; therefore, their study lacks a specific analysis and discussion on the efficiency of the models for predicting a specific indicator. Ahmad et al. [
41] explored several ML algorithms, including support vector machine (SVM), random forest (RF), AdaBoost, and k-nearest neighbor (KNN) for predicting the shear strength of rockfill materials. The results demonstrated that the SVM achieved the best prediction performance.
To the best of our knowledge, there has been no study to date to predict the self-healing of ECC using the ML approach. It is worthwhile to understand and evaluate the prediction capability of various ML models for the self-healing of ECC. Furthermore, conducting experiments is usually expensive and time consuming; therefore, it is necessary to develop accurate and reliable prediction models for the self-healing ability of ECC. In addition, it is critical to choose an appropriate ML method as each algorithm has a significant effect on the accuracy of the results [
42]. Therefore, this study aims to provide a comparative analysis on the performance of various ML models in predicting the self-healing capability of ECC. The ML model with the best performance can be used as a baseline prediction model for developing advanced models in the future.
In this paper, four ML individual methods including linear regression (LR), SVR, back-propagation neural network (BPNN), and classification and regression tree (CART) were proposed to predict the self-healing capability of ECC. To improve the prediction accuracy, three ensemble methods, namely bagging, AdaBoost, and stacking, were used to construct ensemble models using the individual models as the base learners. A series of experimental works on the self-healing performance of ECC samples was conducted, and the results were used to develop and compare the accuracy among the ML models. Experimental data collected from the experiments were first preprocessed and then divided into a 10-fold cross-validation algorithm (for the details, refer to
Section 4.1) to avoid overfitting.
Figure 1 summarizes the steps that were performed when predicting the self-healing of ECC.
This paper is organized as follows.
Section 2 presents the experimental program detailing the materials used for ECC specimen preparation and the test setup for crack data measurement. The concepts and formulations of individual and ensemble models used for predicting the self-healing of ECC are presented in
Section 3, whereas the validation and evaluation methods are described in
Section 4. In
Section 5, the computational results are presented and compared, and the model with the best prediction performance is identified. Finally,
Section 6 draws the major conclusions from this work and suggests some directions for future research.
5. Results and Discussion
In this section, the prediction performance of individual and ensemble methods are examined by the MAE, RMSE, and according to the ten-fold cross-validation. The abbreviations for labeling models were adopted in such a way that the letters Bag, Ada, and Stack stand for the ensemble methods of bagging, AdaBoost, and stacking, respectively. The letters LR, SVR, BPNN, and CART stand for the base estimators. However, Stack_LR refers to the model combining the base methods including SVR, BPNN, and CART in the first level and using LR as a meta-regressor in the second level.
5.1. Prediction Performance of the Proposed Models
Table 5 shows the ten-fold cross-validation results (MAE, RMSE, and
) for both individual and ensemble models and their deviation with respect to the results of the LR model.
Generally, most of the proposed models were able to learn and predict the empirical data with an acceptable degree of precision. Based on the results, the Stack_LR model showed the best prediction performance as it had the highest
value and the lowest MAE and RMSE values. Among the individual models, SVR performed the best in terms of the MAE (4.296), whereas BPNN had the lowest RMSE value (6.515) and the highest
of 0.899. Among the individual models boosted by either AdaBoost or bagging, Bag_CART gave the best performance in terms of the MAE (4.093), while Bag_BPNN performed better on the RMSE value (6.341). In terms of
, the Bag_CART and Bag_BPNN models showed the same performance (0.901) and were better than other ensemble methods, except Stack_LR. The performances of all ML models described in
Table 5 are depicted in
Figure 10a–c in terms of the MAE, RMSE, and
, respectively.
Overall, all models could reduce the error values and increase the prediction accuracy compared with LR, except Bag_LR. Among the models boosted by AdaBoost, Ada_SVR performed the best with the lowest MAE value, whereas Ada_BPNN performed the best on the RMSE value, showing the highest
value. In the case of bagging, both Bag_CART and Bag_BPNN performed better in terms the MAE, RMSE and
than those of the corresponding models boosted by AdaBoost. However, Bag_LR showed a poor performance compared to LR on the MAE and RMSE values. For a better comparison among the ensemble methods used, the performance results between the ensemble models and their corresponding individual (or benchmark) models are indicated in
Table 6. The results indicate that most ensemble methods improved the performance of individual models. For example, the MAE and RMSE values of the BPNN after bagging reduced by 4.3% and 2.7%, respectively, and its
was much higher than that of the individual BPNN model. Among all the ensemble methods studied, stacking showed the best improvement on all performance measures.
However, the results showed that the effectiveness of the ensemble methods on the individual models varied. For instance, the bagging method enhanced the performance of the BPNN and CART substantially, but not for both LR and SVR models. On the other hand, the AdaBoost method brought a considerable improvement for the LR and SVR models. To improve the performance accuracy, researchers should employ different ensemble methods and compare their effectiveness on different ML models.
5.2. Prediction Performance Comparison
5.2.1. Comparison of the MAE
To reveal the accuracy of the proposed ML models in self-healing prediction, the comparison of the observed crack widths of ECC after self-healing with predicted crack widths are shown in
Figure 11,
Figure 12,
Figure 13 and
Figure 14.
Figure 11a shows the observed crack widths compared with the crack widths predicted by different individual ML models.
Figure 11b–e shows the variations between the observed and the crack widths predicted by each individual ML model corresponding to their initial crack widths before self-healing. In other words, the prediction performance of the models in a particular range of crack widths can be revealed. It should be noted that the horizontal line located at the vertical coordinate of zero (
) is considered as the target line [
25,
31]. Generally, the smaller the variation (i.e., closer to the target line), the better the self-healing prediction was, which means smaller or even no variation between the observed and the predicted crack widths after self-healing.
As shown in
Figure 11, the SVR model generally exhibited better prediction results than other individual models, while the LR model was the worst, showing substantial deviation from the target line (denoting relatively large differences between the observed and the predicted crack widths). For the initial crack widths less than 20
m and over 100
m before self-healing, the variations between the observed and the ones predicted by the SVR model were smaller than other individual ML models. The corresponding MAE values were 1.358 and 2.724. However, for the crack widths between 20
m and 60
m, the CART model performed the best with the lowest MAE of 5.045, while the BPNN model had the lowest MAE of 9.565 for the crack widths between 60
m and 100
m. It seems that the choice of ML models may depend on the initial crack widths. However, in terms of overall accuracy among the individual models, SVR performed the best, followed by CART, the BPNN, and LR. This is consistent with the results shown in
Table 5.
The performance of ensemble methods using AdaBoost and bagging is shown in
Figure 12 and
Figure 13. In general, the ensemble models Ada_CART and bag_CART exhibited lower variations in the self-healing results compared to the other ensemble models. In particular, the MAE values of Ada_CART and bag_CART for crack widths between 20 self heal and 60
m were 5.037 and 5.000, respectively. These values were smaller than that of CART (5.045), as shown in
Figure 11e. However, the variations among the BPNN, Ada_BPNN, and bag_BPNN were not significant. Similar variations can be found when comparing SVR with Ada_SVR and bag_SVR.
After stacking, the error variations shown in
Figure 14 were much reduced when compared to those shown in
Figure 11,
Figure 12 and
Figure 13. More specifically, the MAE of stack_LR for crack widths less than 20
m, between 20
m and 60
m, between 60
m and 100
m, and over 100
m were 1.361, 4.931, 9.789, and 3.177, respectively. These MAE values were the lowest among all the ML models studied. Based on the results, it can be concluded that the stack_LR model performed the best.
It is known that a smaller crack width is favorable for autogenous healing in concrete [
76,
77] as small cracks consume less repair products to complete self-healing [
78]. However, a larger crack width will not heal completely or just heal partially. As shown in
Figure 11b,
Figure 12b,
Figure 13b and
Figure 14b, the variations between the observed and predicted results for the LR, bag_LR, and Ada_LR models increased with the increase of the crack width. For a crack width below 20
m, the MAE values were less than 1.5, which was much lower than those for crack widths between 20
m and 60
m (i.e., 6.23) and between 60
m and 100
m (around 10). Similar trends were observed in other models, but with smaller variations. Specifically, for a crack width over 100
m, the LR, bag_LR, and Ada_LR models showed much higher variations. Their MAE values were over 20 and higher than other ML models with the MAE less than 10.
5.2.2. Comparison of the RMSE
A box plot, as shown in
Figure 15, was created to show the distribution of the RMSE results of each ML model based on the ten-fold cross-validation. The RMSE values were calculated based on the differences between the predicted and observed crack widths. The box plot is a statistical tool that is used to depict numerical data through their quartiles including the maximum, minimum, and median values of a dataset [
79,
80]. The medium value is shown as the red line within the box. The interquartile range (IQR) in each box covers 50% (the lower 25% to the upper 75% quartiles) of the RMSE data point, while the whiskers drawn up and down to the maximum and minimum values represent 1.5-times the IQR from the RMSE data. All other points out of the whiskers range are outliers and shown as red dots. A mean value of the RMSE equal to zero would indicate that the predictions perfectly fit the observed data. However, this is almost never achieved in practice [
81]. In general, the lower the RMSE value, the better the prediction performance of a model is.
Assessment of the box plot revealed that the stack_LR model outperformed all other models because of its shortest IQR length and smallest RMSE values, as shown in
Figure 5. In contrast, the LR and bag_LR models had the longest IQR length and largest RMSE values, thereby suggesting that the LR model and its ensemble methods have low accuracy. Among the individual models, the BPNN had the lowest RMSE, while SVR had the shortest IQR length, but with three outliers (out of ten data points). In general, the BPNN gave the most stable performance, showing reasonable low RMSE values with a short IQR length.
5.3. Limitations of Application
Although the present study provided evidence indicating that ML models can be used to predict the self-healing of ECC, there are some challenges that need to be considered. For example, the prolonged time required to optimize and tune the parameters and the high dependence on engineering datasets are the major concerns. The former can be addressed by using an optimization algorithm or developing a hybrid model to automatically adjust the parameters. In addition, high-performance computing (HPC) [
82] can also be used to achieve parallel data processing so as to improve the computing performance and save time. In terms of the engineering dataset, it can be improved by experimental design and experimental process control [
75].
Besides, it is worth noting that the ML models in this study mainly focused on the internal factors (features) such as materials and mix composition; other external environmental factors such as W/D cycles and healing time should also be considered. More research is required to explore the potential benefits and challenges of using ML models to predict the self-healing of cement-based composites.