3.1. Mechanical Properties
As previously stated, the implementation of the CRISP-DM methodology involved the training and testing of several different ML models, ranging from simpler MR methods that are mainly adopted for comparison purposes to more complex models such as ANN, SVM, and RF. The exploration of these different models was accompanied by the associated iterations concerning the features depicted in
Table 3. One of the advantages of iterating different features over several models is that it allows for a better understanding of their ability to fit a problem, by analyzing their performances in function of the resulting metrics. Thus, the same analysis sequence is followed for all models concerning the four main assessed mortar properties (compressive and flexural strength and water absorption by capillarity and by immersion) throughout
Section 3. This sequence begins with the comparison of the predictive performance of all models for each given mortar property. This comparison is then followed by a selection of the ones featuring a better fit, which, in turn, are then analyzed in more detail.
In this context, concerning the mechanical properties of PCM mortars, specifically unilateral compressive strength (UCS),
Table 4 shows a matrix-like distribution of the model assessment metrics described by Equation (1) across the several adopted models and feature selection alternatives. From the analysis of this Table, one can easily infer that, according to the resulting R
2 and RMSE metrics, the ANN seems to have the better fit of all the models for the combination of data corresponding to “allVars” (highest R
2 of 0.98, with lowest RMSE of 1.03), closely followed by the RF model (0.97 R
2 and 1.17 RMSE). Given this, the regression error characteristic (REC) curve for the “allVars” data was drawn, as a way to provide validation on the previous analysis as well as additional insight on the behavior of these models for this database variation (
Figure 2). The REC curves corroborate the findings related to
Table 4, showing that the ANN outperforms the other models, namely in terms of the area under curve (AUC), closely followed by the RF model.
Accordingly, these results prompted a more in-depth analysis of the performance of the ANN and RF models for the prediction of UCS, which was realized through the plotting of the values predicted by the model during its testing phase vs. the actual values obtained during the experimental campaign, representing the ground truth for the models. In these plots, illustrated by
Figure 3a,b for the ANN and the RF models, respectively, it is evident that the closer the points are to the diagonal line, the better the fit and, consequently, the higher the R² value. The figures reveal that both models effectively replicated the behavior of the target variable (UCS), particularly in the lower-to-middle range (i.e., UCS values up to 20 MPa), though the values at the upper range (i.e., above 40 MPa) were slightly over or underestimated. This discrepancy is attributed to the lower number of records in this upper range in the database, which is anticipated to improve as the database expands during future experimental campaigns. Regardless, the ANN still seems to be slightly more able to provide a relatively close estimation of these values at these upper ranges.
Another significant aspect for consideration is the relative importance of the variables for both models, shown in
Figure 4. The figure illustrates how significant each of the used variables (in this case corresponding to the “allVars” database variation) is for each model’s prediction of UCS. It is noteworthy that, similarly to both models, the variables related to the contents of sand (both sand 1 and sand 2) and cement are considered among the most relevant. Bearing in mind that the parameter being predicted is UCS, it is indeed intuitive that the coarser material, especially sand 2, as it is coarser than sand 1, is thus likely to have a greater impact on compressive strength, together with the main binding agent.
Yet, whereas the ANN model seems to follow the more conservative approach in terms of variation of importance between variables, the RF model seems to be more assertive, nearly neglecting the contributions of aspects such as the content of fibers, fly ash, binder type, or gypsum in favor of a higher significance of sand and cement content, which is more in line with the expert knowledge in the field. In addition, the ANN model seems to allocate a high level of importance to the presence of fibers. Although this can make sense in many cases (depending on the type of polymer and the length and width of the fiber stripes), as mixing fibers into aggregates typically results in a more even distribution of stresses and increased ductility, which may result in higher compressive strength, the experimental campaign results did not emphasize this. In fact, while the presence of fibers may have increased ductility and even tensile strength, the direct analysis of experimental results indicates that compressive strength was not affected by it.
Conversely, the RF model assigns only a minimum importance to the content of PCM, which appears to be undervalued when facing the expert knowledge expectation. This expectation pertains to the fact that the addition of PCM, especially when directly incorporated in the form of a paraffin (as was the case throughout the experimental campaign), delays the hydration process of the binders, which ultimately leads to a reduction in mechanical performance in most cases. Despite this, overall UCS results seem to indicate that, even though both models attained a very good performance in terms of metrics, the prioritization of variable influence seems to be slightly more intuitive in the case of RF when compared to expert knowledge in the field.
As far as mechanical properties are concerned, the other PCM-enhanced mortar parameter studied in this work was flexural strength. Similarly to the process adopted concerning UCS, the first step taken in the analysis of flexural strength was metrics-based, as detailed in
Table 5. Although the observation of the metrics seems to indicate that the SVM model is capable of obtaining a slightly higher performance under the “noFibers” database variation (0.84 R
2 and 1.01 RMSE), the latter also seems to be accompanied with a slightly worse performance regarding every other model when compared to the “allVars” database. Moreover, considering that expert knowledge in the field indicates that the inclusion of fibers in mortars enhances flexural strength by increasing ductility and tensile strength (due to fibers’ ability to bridge cracks forming under tensile stress), the approach adopted for analyzing these parameters was to resort to the “allVars” database. The drive behind this choice is related to the fact that the inclusion of the additional variables (in this case related to fibers) can potentially provide added insights, particularly in what concerns to the relative importance of variables.
Ensuing this decision, the REC curves depicted in
Figure 5 were assessed in order to confirm the metrics-based indication that the SVM, ANN, and RF models outperform most of their peers, with the SVM featuring a slightly higher AUC. The seemingly higher performance of the SVM model is further supported by the predicted vs. actual values plots presented in
Figure 6, at least in the lower-to-mid range of values (i.e., below 10 MPa).
Bearing in mind that flexural strength features a component related to compression and another related to tensile strength, the expectation on the relative importance of variables (
Figure 7) is that not only should the sand and cement content continue to display a high significance on the results (as the main contributors to compressive strength), but the fact that the cement, together with the fiber content, are the major factors influencing tensile strength should enhance their relative importance further.
In this context, and similarly to the UCS case, the ANN model output a more conservative approach once again, while correctly identifying the sand content (especially sand 2), the cement content, and the fibers as highly relevant, aligning with the expectations. Concurrently, the RF model also performed similarly to the UCS case, providing a more assertive choice of most important factors, namely both sands and especially cement content, which fits the expert knowledge. Still, this was achieved at the expense of other factors that seem to be undervalued, specifically the presence of fibers. The SVM model, however, while behaving similarly to the RF in terms of selection assertiveness, allocated an extremely high important to the water content. Notwithstanding the fact that water content is obviously important in the mechanical behavior of mortar, favoring this factor in detriment of those most typically related to mechanical performance hinders the generalization potential of this model, even though its assessment metrics were among the best of all models. In summary, the more conservative approach that characterized the ANN model comes across as the best fit for the estimation of the flexural strength behavior of PCM-enhanced mortars.
3.2. Physical Properties
As mentioned, the physical properties of PCM-enhanced mortars that were considered in this study were water absorption by capillarity and by immersion. Beginning with the former and following the same methodology adopted in the previous subsection on mechanical properties,
Table 6 pertains to the assessment of the seven implemented models for the different database combinations. Once again, though some of the models can perform well over all databases, namely ANN, RF, and, to a slightly lesser extent, SVM, it is clear that there is no clear gain in adopting one of the less encompassing database variations in detriment of the “allVars” variation for this parameter. In fact, except for the RF model, which seems to have a slight increase in R
2 of 1% for the “noFibers” variation, the performance of these models tends to decrease with the reduction of the number of variables, providing an indication that all variables are relevant for the prediction of water absorption by capillarity.
Figure 8, featuring the comparison between the REC curves of the models, seems to support the claim that the ANN model displays the best fit for this parameter, followed by the competing RF and SVM models.
In what concerns the predicted vs. actual value plot analysis, depicted in
Figure 9, the ability of ANN to predict the behavior of the mortar in terms of water absorption by capillarity over the entire range of the data is noteworthy. Indeed, even at the upper ranges, which are characterized by a lack of data, the ANN shows a very good fit to the data, corroborating the high R
2 with low RMSE that characterized this model.
This is further validated by the fact that its selection of variables in what concerns their relative importance (
Figure 10) seems to be very reasonable, as it identified water content and the finer materials, such as the sand content, especially sand 1. It also assigned a moderate importance to superplasticizer and PCM, which fits the expert knowledge in the field. Indeed, whereas the former reduces the amount of water typically added to the mortar mixes (which in turn translates into lower porosity and thus less water absorption by capillarity), the latter tends to enfold the aggregate components of the mixes, especially when directly incorporated in the form of paraffin, hindering the amount of water absorbed by capillarity. The duality between superplasticizer and water content also seems to have been identified by the RF and the SVM models, although these tended to favor the superplasticizer and the water content (respectively) individually much more than its counterpart. Thus, the analysis of results concerning water capillarity confirms the ANN model’s effective fit to the prediction of this parameter.
The second and final physical property of mortars with direct incorporation of PCM at issue in this work is water absorption by immersion. Beginning once again with the interpretation of the assessment metrics of the implemented models, detailed in
Table 7, one can immediately infer that the overall values for R
2 and RMSE are lower in comparison with the metrics obtained in the study of other mortar parameters (both mechanical and physical). This is likely to be related to a much higher dispersion in the results of the experimentally tested samples, as a consequence of the interaction between the directly incorporated PCM and the mortar aggregates. As a matter of fact, the direct incorporation of PCM into the mortar has the tendency to result in the aggregates being enfolded by the PCM paraffin in several layers, which are randomly distributed throughout the mortar. In turn, this comprises a major factor contributing to a high variation in results concerning the absorption of water by immersion, ultimately resulting in a hindrance to the accurate estimation of this aspect in the studied mortars. Naturally, as the experimental campaign proceeds towards gathering additional data, this hindrance is expected to be gradually mitigated over time.
Notwithstanding this fact, the current metrics-based assessment of models seems to indicate that the “noFibers” database variation originated two reasonably consistent models for the prediction of water absorption by immersion, namely the RF (0.70 R
2 and 3.72 RMSE) and the ANN (0.61 R
2 and 4.19 RMSE). It is evident that a reasonable performance can also be found in the database variations with fewer variables, possibly as a result of a higher difficulty for the models to understand the relationships between variables and identify patterns, which is also a consequence of the higher dispersion of results. However, taking into account that the gains in predictive performance with the reduction of variables are not conclusive, the “noFibres” database variation was selected as the one with the most available information for the purpose of comparative analysis as well as potential for additional insight. While the SVM model appears to initially be capable of competing with the RF and the ANN model when observing the subsequent REC curves shown in
Figure 11, the two latter models quickly overcome the former, outperforming it in terms of AUC.
The aforementioned higher dispersion of results is blatant in the predicted vs. actual value plots depicted in
Figure 12, once again showing the clear difficulty experienced by both models in estimating values in the mid-to-high ranges, characterized by a lower amount of data in comparison to the lower ranges. Nonetheless, it is possible to observe that the RF model predictions seem to be closer to the actual values, as conveyed by their closer proximity to the 45° line in the figure, substantiating the higher values achieved by this model in terms of the previous metrics.
In what concerns the relative importance of variables assumed by each model, the RF model strongly points out the PCM and water content as the paramount variables, adding up to being responsible for nearly 75% of the total importance of all variables in this model. This is consistent with the previously described behavior of the PCM paraffin enfolding the mortar aggregates, partially isolating the aggregates randomly throughout the mortar body, and subsequently resulting in a strong influence over its rate of water absorption by immersion. In opposition to this, the typical pattern of the ANN model characterized by the tendency to distribute the weights of relative importances slightly more evenly seems to slightly hamper its predictive ability in this case, ultimately supporting the RF model’s metrics-based indication of a better fit in the context of water absorption by immersion behavior prediction (
Figure 13).