Machine-Learning-Aided Prediction of Flexural Strength and ASR Expansion for Waste Glass Cementitious Composite

: Waste glass (WG) is unsustainable due to its nonbiodegradable property. However, its main ingredient is silicon dioxide, which can be utilised as a supplementary cementitious material. Before reusing WG, the ﬂexural strength (FS) and alkali–silica reaction (ASR) expansion of WG concrete are two essential properties that must be investigated. This study produced mortar containing activated glass powder using mechanical, chemical, and mechanical–chemical (combined) approaches. The results showed that mortar containing 30% WG powder using the combined method was optimal for improving the FS and mitigating the ASR expansion. The microstructure analysis was implemented to explore the activation effect on the glass powder and mortar. Moreover, a random forest (RF) model was proposed with hyperparameters tuned by beetle antennae search (BAS), aiming at predicting FS and ASR expansion precisely. A large database was established from the experimental results based on 549 samples prepared for the FS test and 183 samples produced for the expansion test. The BAS-RF model presented high correlation coefﬁcients for both FS (0.9545) and ASR (0.9416) data sets, showing much higher accuracy than multiple linear regression and logistic regression. Finally, a sensitivity analysis was conducted to rank the variables based on importance. Apart from the curing time, the particle granularity and content of WG were demonstrated to be the most sensitive variable for FS and expansion, respectively.


Introduction
Sustainable construction involves the creation of a resource-efficient and eco-friendly environment. Construction waste reduction and waste recycling are two essential principles of this practice. Waste glass (WG) is a common urban solid waste, which is hazardous to the environment because of its inert and biodegradable properties. However, WG treatment is expansive, and its reuse is still at the initial stage. Statistically, 11,530 kilotons of WG were generated in the United States in 2010, with only 30% recycled, and approximately 300 tonnes per day were produced in Hong Kong in 2016, with only 10% recycled [1,2]. Therefore, developing a feasible method to cope with the increasing WG content is urgent and essential.
Due to its abundance, low cost, and high stiffness, WG has been used in the construction industry. Previous research studies have shown that WG is a good replacement for sand or cement, serving as a supplementary cementitious material. This is mainly attributed to WG's high silica content (over 70%), which can react with calcium hydroxide (CH) to generate calcium silica hydrates (C-S-H). Furthermore, glass can alleviate the corrosion of steel bars within reinforced concrete. Therefore, reusing WG in concrete manufacturing is promising in terms of increasing its recycling degree, reducing carbon dioxide emissions caused by cement production, and mitigating the shortage of high-quality sand.
However, the strength property and expansion arising from alkali-silica reaction (ASR) of concrete containing WG are two key points that must be investigated before applying WG. This is mainly due to WG's chemical and physical properties. Generally, discarded glass cullet from bottles, windows, decorations, etc. has poor geometry, that is, an angular, flat, or elongated shape, leading to poor workability and diminished strength grade [3,4]. In addition, the high silica content of WG will inevitably produce ASR expansion and probably result in structural instability, which is the chief reason for the failed early application of waste glass concrete [5,6]. Therefore, the investigation on ASR expansion and reliable expansion prediction is essential.
To overcome the reduced strength and excessive ASR expansion, researchers made massive efforts to investigate concrete containing WG and proposed many practicable approaches. These include in-depth studies of glass granularity and admixtures. When the glass cullet was milled to less than 300 µm, both the pozzolanic level of waste glass powder (WGP) and the strength of concrete significantly increased [7][8][9]. Several studies even pointed out the improved pozzolanic ability (compared with fly ash) from ground WG with particle sizes below 100 µm [5,10]. Moreover, the smaller particle sizes of waste glass can mitigate ASR expansion, as demonstrated by many reports [11,12]. Corinaldesi et al. [13] observed improved unconfined compression strength and flexural strength (FS) without obvious macroscopic expansion. Petrella et al. [14] conducted the mechanical test on mortar containing waste porous glass and found the good adhesion of the glass to the cement paste through microstructural observations. Furthermore, the addition of admixtures, such as silica fume, fly ash, lithium compounds, and metakaolin was useful in preventing deleterious ASR expansion [15,16]. Adhikary et al. [17] also illustrated that expanded glass produced by mixing waste glass and expansive agents was effective in prohibiting the ASR expansion.
Previous studies concentrated on the aforementioned aspects in solving the strength and expansion problems, but few reports mentioned the chemical activation of WGP. The possible explanations are the complex chemical reactions and the various types of chemical agents. Additionally, the traditional method of analysis is inadequate in seeking optimal mixture proportions and controlling the error when the variables are multiple. Although multiple linear regression (MLR) and logistic regression (LR) have been proposed, an accurate simulation is still challenging mainly due to the curse of dimensionality and co-linearity susceptibility [18][19][20]. Therefore, this study aimed to activate WGP using chemical activators and an accurate machine learning (ML) model to predict the FS and ASR expansion of WGP mortars.
Recently, the performance of cementitious materials has been broadly studied using artificial intelligence (AI) techniques. ML models exhibit an excellent generalisation ability and perfect prediction accuracy in coping with nonlinear tasks [21,22]. The artificial neural network (ANN), support vector regression (SVR), and random forest (RF) are three widely used ML models that are effective for employing various construction materials [23,24]. The first two models are standalone, while the latter (RF) is an ensemble model that exhibits less probability of generating overfitting problems. RF also has better tolerance than standalone ML models when outliers or noise exist in the data set [25,26]. Furthermore, the blessing of dimensionality and the central limit theorem partly contribute to the success of the ensemble models that outperform the individual models. Therefore, the performance of the RF model is expectedly higher than individual models in solving high dimensionality problems [27,28]. As a result, the RF model was chosen for data prediction.
The accuracy of the RF model depends mainly on two hyperparameters: the total number of trees (numTree) and the minimum sample number of a leaf node (minNumLea f ) [29]. The numTree has a remarkable effect on the model's ability, and the minNumLea f controls the splitting condition and determines the relationship between various decision trees. Nevertheless, the process of determining the optimal hyperparameters is time consuming and thus inadequate through the traditional selection method, and if time was not the issue, they would work as well [25]. Recently, optimisation algorithms are preferable since they can automatically seek optimal hyperparameters via iterations. Commonly used metaheuristic algorithms include particle swarm optimisation (PSO) [30], genetic algorithm (GA) [31], and firefly algorithm (FA) [32], which are all computationally intensive. The beetle antennae search (BAS) algorithm uses only one beetle to search the optimal hyperparameters instead of the beetle swarm so that it converges fast and significantly reduces calculation time [33,34]. However, BAS is easily trapped into a local optimum as indicated by previous researchers [35,36]. To reduce the possibility of the local optimum, the BAS algorithm with a changeable step size was proposed for the RF hyperparameter adjustment in this study.
This study investigated the impact of the mechanical, chemical, and mechanicalchemical (combined) activation on the FS and ASR expansion of mortars containing WGP. The sand was replaced with WGP with mean particle sizes of 75 µm and 300 µm. The variables were the replacement ratio, the WGP size, and the content and category of chemical activators (sodium sulphate, calcium hydroxide, and sodium hydroxide). A microscopic test was conducted to investigate the activation effect. In total, 549 samples were prepared for the FS test, and 183 samples were produced for the expansion test. The experimental results are presented as the data set to train the highly accurate BAS-RF model. The baseline models (LR and MLR) were also adopted to compare with BAS-RF. In addition, the sensitivity analysis (SA) was performed on the FS and ASR expansion data sets for a ranking of the input variables.

Materials and Methods
The mean particle sizes of WGP derived from waste glass vessels were 75 µm and 300 µm. Before being crushed and ground, the vessels were cleaned to eliminate labels and taps and then air-dried. Then, the milled WGP was properly reserved in a sealed and dry container. Figure 1 shows the 75 µm and 300 µm WGP, and Table 1 presents the chemical ingredients. The high silicon dioxide content (approximately 74%) demonstrates the excellent potential of WGP as pozzolans [37]. The ordinary Portland cement (with a strength grade of 42.5) was utilised as an essential binder composition, whose physical and chemical indexes are presented in Table 2. Natural sand was used as the filler with over 96% silicon dioxide, and ASTM C778 [38] was used for its gradation before use. performance of the RF model is expectedly higher than individual models in solving high dimensionality problems [27,28]. As a result, the RF model was chosen for data prediction. The accuracy of the RF model depends mainly on two hyperparameters: the total number of trees ( ) and the minimum sample number of a leaf node ( ) [29]. The has a remarkable effect on the model's ability, and the controls the splitting condition and determines the relationship between various decision trees. Nevertheless, the process of determining the optimal hyperparameters is time consuming and thus inadequate through the traditional selection method, and if time was not the issue, they would work as well [25]. Recently, optimisation algorithms are preferable since they can automatically seek optimal hyperparameters via iterations. Commonly used metaheuristic algorithms include particle swarm optimisation (PSO) [30], genetic algorithm (GA) [31], and firefly algorithm (FA) [32], which are all computationally intensive. The beetle antennae search (BAS) algorithm uses only one beetle to search the optimal hyperparameters instead of the beetle swarm so that it converges fast and significantly reduces calculation time [33,34]. However, BAS is easily trapped into a local optimum as indicated by previous researchers [35,36]. To reduce the possibility of the local optimum, the BAS algorithm with a changeable step size was proposed for the RF hyperparameter adjustment in this study.
This study investigated the impact of the mechanical, chemical, and mechanicalchemical (combined) activation on the FS and ASR expansion of mortars containing WGP. The sand was replaced with WGP with mean particle sizes of 75 μm and 300 μm. The variables were the replacement ratio, the WGP size, and the content and category of chemical activators (sodium sulphate, calcium hydroxide, and sodium hydroxide). A microscopic test was conducted to investigate the activation effect. In total, 549 samples were prepared for the FS test, and 183 samples were produced for the expansion test. The experimental results are presented as the data set to train the highly accurate BAS-RF model. The baseline models (LR and MLR) were also adopted to compare with BAS-RF. In addition, the sensitivity analysis (SA) was performed on the FS and ASR expansion data sets for a ranking of the input variables.

Materials and Methods
The mean particle sizes of WGP derived from waste glass vessels were 75 μm and 300 μm. Before being crushed and ground, the vessels were cleaned to eliminate labels and taps and then air-dried. Then, the milled WGP was properly reserved in a sealed and dry container. Figure 1 shows the 75 μm and 300 μm WGP, and Table 1 presents the chemical ingredients. The high silicon dioxide content (approximately 74%) demonstrates the excellent potential of WGP as pozzolans [37]. The ordinary Portland cement (with a strength grade of 42.5) was utilised as an essential binder composition, whose physical and chemical indexes are presented in Table 2. Natural sand was used as the filler with over 96% silicon dioxide, and ASTM C778 [38] was used for its gradation before use.   To investigate the activation effects, five variables were considered: the particle size and substitute proportion of WGP, the category and content of the chemical activators, and the curing times. These five variables were set as the features for the ML modelling. Natural sand was replaced with WGP (75 µm and 300 µm) on three levels (10 wt%, 20 wt%, and 30 wt%). The chemical additive comprises a salt activator (sodium sulphate) and alkaline activators (calcium hydroxide and sodium hydroxide). Additionally, the water-to-cement (W/C) and aggregate-to-cement ratios (A/C) were maintained at 0.47 and 2.25, respectively. The following is the detailed activation methodology:

•
Mechanical activation: By employing a ball mill, a glass cullet was ground into WGP with two particle sizes: 300 µm and 75 µm. Compared to coarse WGP (300 µm), the finer WGP was supposed to acquire better pozzolanic characteristics. • Chemical activation: Sodium hydroxide, calcium hydroxide, and sodium sulphate (anhydrous) were the three agents used to activate WGP and the WGP-cement system. They are all analytically pure (AR) and commercially available. The alkaline activator is composed of 50 wt% sodium hydroxide and 50 wt% calcium hydroxide, and sodium sulphate served as a salt activator. Both activators were jointly applied at 2 wt%, 4 wt%, and 6 wt% of cement. Before their application, sodium sulphate and sodium hydroxide were dissolved in water, while calcium hydroxide was directly mixed with WGP and sand. The mixing time was enough for the thorough dispersion of raw materials as per ASTM C305 [39]. • Combined activation: The mechanical-chemical activation integrated the abovementioned mechanical and chemical approaches, where the 300 µm WG was ground to 75 µm WGP, and the chemical activators were utilised in the meantime.
In total, 60 mixes with one control group are summarised in Table 3, where the WG, sodium sulphate, and alkali activators are expressed as G, S, and H, respectively. The particle sizes of WGP are represented as the numbers 75 and 300 without the unit. The numbers 10, 20, and 30 represent the replacement ratios of WGP, i.e., 10%, 20%, and 30%, respectively. The numbers 2, 4, 6 represent the chemical activators' ratios of 2%, 4%, and 6%, respectively.

Mechanical Test
The FSs at 7, 14, and 28 curing times were tested for each mix of mortar containing WGP. Three parallel specimens were cast and the mean values were calculated as the ultimate results after eliminating the outliers. The FS samples were prepared in a prismatic shape (40 × 40 × 160 mm), complying with ASTM C348 [40]. After casting, the mortar samples with moulds were transferred to a wet closet at 20 ± 1 • C temperature and 95 ± 5% humidity for 24 h. Then, the samples were separated from the moulds and stored in saturated lime water until the curing time ended. A three-point bending apparatus was fabricated at a steady loading level of 0.03 MPa/s and the flexural strength f s (MPa) was formulated in Equation (1). The results of mechanical test are summarised in the Appendix A.
where F is the fracture surcharge (N); L, h, and b are the length, height, and width of the sample (mm), respectively.

Alkali-Silica Reaction
The ASR experiment was conducted per ASTM C1260 [41] to estimate the risk of detrimental expansion. For each mix, the average longitudinal change of three identical bar samples (25 × 25 × 280 mm) was determined after 2, 4, 7, 10, and 14 days. The effective length of the specimen was 260 mm, which was the length between two steel stud gauges, as shown in Figure 2. The prepared samples are shown in Figure 3. After casting, they were stored in the moisture closet for 24 h and were subsequently immersed in water (80 • C) for another 24 h. The initial lengths L 0 of the samples (accurate to 0.002 mm) were measured through a length comparator, and then specimens were transferred to a 1 N 80 • C NaOH solution. The longitudinal length of the sample at day x is symbolised by L x , and the expansion proportion e x can be determined by Equation (2). The deleterious expansion is negligible if the expansion ratio at 14 days is less than 0.1%. The results of ASR test are summarised in the Appendix B.

Mechanical Test
The FSs at 7, 14, and 28 curing times were tested for each mix of mortar containing WGP. Three parallel specimens were cast and the mean values were calculated as the ultimate results after eliminating the outliers. The FS samples were prepared in a prismatic shape (40 × 40 × 160 mm), complying with ASTM C348 [40]. After casting, the mortar samples with moulds were transferred to a wet closet at 20 ± 1 °C temperature and 95 ± 5% humidity for 24 h. Then, the samples were separated from the moulds and stored in saturated lime water until the curing time ended. A three-point bending apparatus was fabricated at a steady loading level of 0.03 MPa/s and the flexural strength (MPa) was formulated in Equation (1). The results of mechanical test are summarised in the Appendix A.
where is the fracture surcharge (N); , ℎ, and are the length, height, and width of the sample (mm), respectively.

Alkali-Silica Reaction
The ASR experiment was conducted per ASTM C1260 [41] to estimate the risk of detrimental expansion. For each mix, the average longitudinal change of three identical bar samples (25 × 25 × 280 mm) was determined after 2, 4, 7, 10, and 14 days. The effective length of the specimen was 260 mm, which was the length between two steel stud gauges, as shown in Figure 2. The prepared samples are shown in Figure 3. After casting, they were stored in the moisture closet for 24 h and were subsequently immersed in water (80 °C) for another 24 h. The initial lengths of the samples (accurate to 0.002 mm) were measured through a length comparator, and then specimens were transferred to a 1 N 80 °C NaOH solution. The longitudinal length of the sample at day is symbolised by , and the expansion proportion can be determined by Equation (2). The deleterious expansion is negligible if the expansion ratio at 14 days is less than 0.1%. The results of ASR test are summarised in the Appendix B.

Scanning Electron Microscopy
To investigate the microscopic influence of chemical activators on WGP and the bonding ability of WGP with hydration products, scanning electron microscopy (SEM) was employed as a supplementary test. The chemically activated WGP samples were fabricated by immersing them into the water that dissolved the alkali and sodium sulphate for 7 days. The mortar specimens for the SEM experiment were derived from the sample fragments after conducting the FS test.

Machine Learning Models
The ML models utilised in this study are RF, LR, and MLR. In this section, the RF model is elaborated, followed by the baseline models (LR, MLR). The BAS algorithm and 10-fold cross-validation (CV) are subsequently explained with the ability to optimise the hyperparameters of RF and increasing the prediction performance of ML, respectively. The model evaluation index and variable sensitivity analysis are finally demonstrated.

Random Forest Model
To acquire more reliable output, random forest generates hundreds of decision trees (RTs) with their own regression functions. RT is a nonparametric model which consists of a root node, decision nodes, and leaf nodes. The training set is represented as shown in Equation (3) where and are the input vector with features (X = { , , … , }) and the output scalar, respectively. In this study, represents the FS and ASR datasets. The and indicate the input vector with five features and the outputs (FS, ASR), respectively. Each sample in the training set will be evaluated by the decision nodes using a split function. Then, the sample will be passed to different branches until the leaf node is reached. The tree is stopped when the minimum sample number of a leaf node is achieved.
During the training process of each RT, samples from the training set are randomly sampled without replacement (1/n possibility for each sample to be selected at each time). This sample collecting process is called 'bootstrap', and the corresponding sample set is represented as . Subsequently, the samples of are split from the root node to the leaf node with each node applying its own split function. At the end of this training process, the prediction function (X, ) is established over . As mentioned before, the RF is an extension of RT constructing numerous RTs. A 'bagging' method is applied to integrate all the results derived from these RTs and utilises the average value as the final output [42]. The bagging technique is proposed by Breiman [43], which effectively reduces the prediction variance to improve prediction performance.

Scanning Electron Microscopy
To investigate the microscopic influence of chemical activators on WGP and the bonding ability of WGP with hydration products, scanning electron microscopy (SEM) was employed as a supplementary test. The chemically activated WGP samples were fabricated by immersing them into the water that dissolved the alkali and sodium sulphate for 7 days. The mortar specimens for the SEM experiment were derived from the sample fragments after conducting the FS test.

Machine Learning Models
The ML models utilised in this study are RF, LR, and MLR. In this section, the RF model is elaborated, followed by the baseline models (LR, MLR). The BAS algorithm and 10-fold cross-validation (CV) are subsequently explained with the ability to optimise the hyperparameters of RF and increasing the prediction performance of ML, respectively. The model evaluation index and variable sensitivity analysis are finally demonstrated.

Random Forest Model
To acquire more reliable output, random forest generates hundreds of decision trees (RTs) with their own regression functions. RT is a nonparametric model which consists of a root node, decision nodes, and leaf nodes. The training set is represented as R n shown in Equation (3) where X and Y are the input vector with m features (X = {x 1 , x 2 , . . . , x m }) and the output scalar, respectively. In this study, R n represents the FS and ASR datasets. The X and Y indicate the input vector with five features and the outputs (FS, ASR), respectively. Each sample in the training set will be evaluated by the decision nodes using a split function. Then, the sample will be passed to different branches until the leaf node is reached. The tree is stopped when the minimum sample number of a leaf node is achieved.
During the training process of each RT, n samples from the training set R n are randomly sampled without replacement (1/n possibility for each sample to be selected at each time). This sample collecting process is called 'bootstrap', and the corresponding sample set is represented as R θ n . Subsequently, the samples of R θ n are split from the root node to the leaf node with each node applying its own split function. At the end of this training process, the prediction functionâ(X, R θ n ) is established over R θ n . As mentioned before, the RF is an extension of RT constructing numerous RTs. A 'bagging' method is applied to integrate all the results derived from these RTs and utilises the average value as the final output [42]. The bagging technique is proposed by Breiman [43], which effectively reduces the prediction variance to improve prediction performance. The RF algorithm is described in Figure 4 in which RF comprises k de-correlated RTs so that k prediction functionsâ(X, R θ k n ), where k = 1, 2, . . . , k, will be constructed. The symbol θ k represents independently distributed random vector to represent different RTs. Finally, the RF produces k outputs Ŷ 1 ,Ŷ 2 , . . . ,Ŷ k corresponding to each RT. The ultimate prediction Y is achieved by averaging these outputs as shown in Equation (4).
Appl. Sci. 2021, 11, x FOR PEER REVIEW 8 of 24 The RF algorithm is described in Figure 4 in which RF comprises de-correlated RTs so that prediction functions (X, ), where = 1,2, … , , will be constructed. The symbol represents independently distributed random vector to represent different RTs. Finally, the RF produces outputs { , , … , } corresponding to each RT. The ultimate prediction is achieved by averaging these outputs as shown in Equation (4).

Baseline Models
Lasso regression is a type of linear regression method using shrinkage theory that can address high levels of multicollinearity problems [44,45]. However, LR and MLR models are more popular than Lasso regression in solving regression problems in the construction field [46,47]. Therefore, these two baseline models (LR and MLR) are proposed to compare with the BAS-RF model and assess their prediction discrepancy.

Beetle Antennae Search (BAS)
To automatically seek the optimal hyperparameters of the RF model, the BAS algorithm is proposed, which is derived from the foraging behaviour of the longhorn beetle [48,49]. The beetle can continuously change the moving orientation according to its two antennae that can perceive the odour concentration. In the BAS algorithm, we use and to symbolise the location of the left and right antennae. Their placements at the time instant can be given by where is a random vector and can be written in Equation (6).
where ( , 1) is a random function having dimensions. The following equations show the location vector of the beetle, the updated antennae length, and the step size.

Baseline Models
Lasso regression is a type of linear regression method using shrinkage theory that can address high levels of multicollinearity problems [44,45]. However, LR and MLR models are more popular than Lasso regression in solving regression problems in the construction field [46,47]. Therefore, these two baseline models (LR and MLR) are proposed to compare with the BAS-RF model and assess their prediction discrepancy.

Beetle Antennae Search (BAS)
To automatically seek the optimal hyperparameters of the RF model, the BAS algorithm is proposed, which is derived from the foraging behaviour of the longhorn beetle [48,49]. The beetle can continuously change the moving orientation according to its two antennae that can perceive the odour concentration. In the BAS algorithm, we use x l and x r to symbolise the location of the left and right antennae. Their placements at the i th time instant can be given by where b is a random vector and can be written in Equation (6).
where rand(k, 1) is a random function having k dimensions. The following equations show the location vector of the beetle, the updated antennae length, and the step size.
where δ means the step length; f (x) represents fitness function; sign() is a sign function that extracts the sign of a real number.

The 10-Fold Cross-Validation Process
Two hyperparameters of the RF model need to be adjusted, which are numTree and minNumLea f . To eliminate the overfitting problems caused by a limited database, a 10-fold cross-validation method was utilised, which is frequently used in predicting cementitious materials performance [47,50]. Meanwhile, all the data can be used for training and testing, giving a fair and comprehensive evaluation metric, although the database is not large. Therefore, the small dataset limitation is supposed to be eliminated. During the process of CV, the data set is firstly randomly separated into two parts: the test group (30%) and the training group (70%) [51]. Secondly, the training group is divided into 10 folds among which the 9 folds are applied to develop the RF model and the rest is for performance validation [52,53]. During the modelling process, the RF hyperparameters are tuned by the BAS algorithm through 50 iterations. Additionally, the root means square error (RMSE) is obtained when finishing validating. This process repeats 10 times to compare the validation outcome. Ultimately, the trained model with the minimum RSME and optimal hyperparameters was used to predict the FS and ASR expansion in this study. The procedure of RF model training by 10-fold CV and BAS is shown in Figure 5.
= 0.95 + 0.01 (8) = 0.95 (9) where means the step length; ( ) represents fitness function; () is a sign function that extracts the sign of a real number.

The 10-Fold Cross-Validation Process
Two hyperparameters of the RF model need to be adjusted, which are and . To eliminate the overfitting problems caused by a limited database, a 10fold cross-validation method was utilised, which is frequently used in predicting cementitious materials performance [47,50]. Meanwhile, all the data can be used for training and testing, giving a fair and comprehensive evaluation metric, although the database is not large. Therefore, the small dataset limitation is supposed to be eliminated. During the process of CV, the data set is firstly randomly separated into two parts: the test group (30%) and the training group (70%) [51]. Secondly, the training group is divided into 10 folds among which the 9 folds are applied to develop the RF model and the rest is for performance validation [52,53]. During the modelling process, the RF hyperparameters are tuned by the BAS algorithm through 50 iterations. Additionally, the root means square error (RMSE) is obtained when finishing validating. This process repeats 10 times to compare the validation outcome. Ultimately, the trained model with the minimum RSME and optimal hyperparameters was used to predict the FS and ASR expansion in this study. The procedure of RF model training by 10-fold CV and BAS is shown in Figure 5.

Performance Evaluation
The RMSE and correlation coefficient (R) are two evaluation indexes to estimate the accuracy of various ML models. The definitions are as follows:

Performance Evaluation
The RMSE and correlation coefficient (R) are two evaluation indexes to estimate the accuracy of various ML models. The definitions are as follows: where n represents the quantity of the data specimens; y * i is the predicted output by the ML models; y i is the actual output in the data set; y * and y are the mean values of the predicted and actual values, separately.

Variable Significance Determination
An approach based upon sensitivity analysis (SA) was used to investigate the connection between variables and outcomes. Within the range of input variables, this method can assess the impact on the output by changing the input values [54,55]. The beginning procedure is to design input and output variables, followed by evaluating each input variable with all the other variables constant. The SA comprises global sensitivity analysis (GSA) and local sensitivity analysis (LSA), but LSA cannot be used to search uncertainties. In contrast, GSA can simultaneously assess all input variables and therefore is applied in this study. Finally, importance sequences (0 to 100%) are visualised through bar charts [56]. The following equations show a gradient metric to estimate the resulting change of the output and the relative importance formulation [56].
where ε is the analysed input variable;ŷ ε,j represents the susceptibility reaction indicator for x ε,j ; R ε is the relative importance of the variable.  Figure 6 shows the FS values of mortar samples containing different WGP substituting ratios without adding chemical agents at 7, 14, and 28 days. Lower FS was observed in specimens containing 300 µm WGP, which agrees with previous findings that coarse glass adversely affected mechanical characteristics [1,57]. In contrast, the mortar specimens containing 75 µm WGP exhibited improved FS, compared to both 300 µm WGP samples and the control one. This phenomenon illustrates the positive effect of mechanical activation in increasing FS. The 10% 75 µm WG proportion showed the highest strength at 9.7 MPa (28 days), which was 26.63% higher than that of the control sample. Moreover, the FS of the sample containing 30% 75 µm WGP was 7.6 MPa, which almost reached the same strength level as the ordinary mortar sample. This is mainly attributed to the bonding strength arising from WGP's pozzolanic reaction [11,58,59]. When glass's particle granularity shrank through physical grinding, its stable silica tetrahedron structure would be destroyed, leading to reduced crystallisation and increased reactivity. Meanwhile, the finer WGP had a larger specific surface area and simultaneously compacted the mortar structure through evenly distributed in the pores of the mortar. Therefore, it is effective to increase the FS of WGP mortars by applying a mechanical approach. Figure 7 compares the 28-day FS results of the mortar samples containing 300 µm and 75 µm WGP (10%, 20%, and 30%) under the chemical activation. When the WGP particle size was 300 µm, the FS increase was only observed in samples containing activated WGP (2% Na 2 SO 4 and 2% alkali), compared to mortars with nonactivated WGP. Meanwhile, the FS in all the 300 µm WGP mortars was less than that in the control sample, illustrating the limited chemical activation effect on coarse glass particles. The opposite phenomenon was observed when the WGP size was mechanically ground into 75 µm. Figure 7c shows the highest FS value (around 9 MPa) on the mortar sample with 30% WGP (75 µm) under the treatment of 2% Na 2 SO 4 and 2% alkali. It was both higher than the 30% nonactivated WGP sample (7.60 MPa) and the normal sample (7.66 MPa), indicating the positive efficacy of the mechanical-chemical activation. However, the FS was reduced provided that the 75 µm activated WGP content was 10%, as shown in Figure 7a. This is mainly due to the excessive ASR expansion that exceeded the ASTM threshold (0.1%) shown in following chapters.  Figure 7 compares the 28-day FS results of the mortar samples containing 300 μm and 75 μm WGP (10%, 20%, and 30%) under the chemical activation. When the WGP particle size was 300 μm, the FS increase was only observed in samples containing activated WGP (2% Na2SO4 and 2% alkali), compared to mortars with nonactivated WGP. Meanwhile, the FS in all the 300 μm WGP mortars was less than that in the control sample, illustrating the limited chemical activation effect on coarse glass particles. The opposite phenomenon was observed when the WGP size was mechanically ground into 75 μm. Figure 7c shows the highest FS value (around 9 MPa) on the mortar sample with 30% WGP (75 μm) under the treatment of 2% Na2SO4 and 2% alkali. It was both higher than the 30% nonactivated WGP sample (7.60 MPa) and the normal sample (7.66 MPa), indicating the positive efficacy of the mechanical-chemical activation. However, the FS was reduced provided that the 75 μm activated WGP content was 10%, as shown in Figure 7a. This is mainly due to the excessive ASR expansion that exceeded the ASTM threshold (0.1%) shown in following chapters.

Results and Discussion
The Na2SO4 and alkali effect on the WGP-cement system is analogous to that on fly ash-cement system considering the slow pozzolanic reaction and similar chemical constituents for WGP and fly ash. According to the previous findings, the Na2SO4 reacts with Ca(OH)2 to generate gypsum and NaOH [60,61]. The gypsum further reacts with the aluminate in cement to generate ettringite (AFt), and NaOH participates in the reaction with Ca(OH)2 and WGP (or fly ash) to produce the additional gels [62,63]. This results in more hydration products compacting the WGP mortar structure and improving the flexural strength. In addition, the Alite (C3S) hydration has been verified to be accelerated under the high concentration of dissolved sulphate ions [60]. Therefore, more calcium silicate hydrates and calcium hydroxide will be generated to further increase mechanical strength.
Regarding the alkali (NaOH and Ca(OH)2) effect, it has been verified to effectively depolymerise WGP structure by destroying the chemical bond between the oxygen atom and silicon [64,65]. Therefore, the WGP stable Silica tetrahedron units can be broken to accelerate the silica dissolution and WGP pozzolanic reaction, as illustrated by many researchers [66,67]. However, the results in Figure 7 also indicated the negative effect of both Na2SO4 and alkali when their contents were over 2%. This was in agreement with the previous investigations that excessive Na2SO4 and alkali would lead to numerous AFt generation and heterogeneous gel production, respectively [64,68]. Therefore, the loose structure can be ultimately formed diminishing the flexural strength. Moreover, the excessive alkali content also increases the risk of detrimental ASR expansion. From the FS test results, the optimal content of chemical agents is 2% Na2SO4 and 2% alkali. The Na 2 SO 4 and alkali effect on the WGP-cement system is analogous to that on fly ash-cement system considering the slow pozzolanic reaction and similar chemical constituents for WGP and fly ash. According to the previous findings, the Na 2 SO 4 reacts with Ca(OH) 2 to generate gypsum and NaOH [60,61]. The gypsum further reacts with the aluminate in cement to generate ettringite (AFt), and NaOH participates in the reaction with Ca(OH) 2 and WGP (or fly ash) to produce the additional gels [62,63]. This results in more hydration products compacting the WGP mortar structure and improving the flexural strength. In addition, the Alite (C 3 S) hydration has been verified to be accelerated under the high concentration of dissolved sulphate ions [60]. Therefore, more calcium silicate hydrates and calcium hydroxide will be generated to further increase mechanical strength.
Regarding the alkali (NaOH and Ca(OH) 2 ) effect, it has been verified to effectively depolymerise WGP structure by destroying the chemical bond between the oxygen atom and silicon [64,65]. Therefore, the WGP stable Silica tetrahedron units can be broken to accelerate the silica dissolution and WGP pozzolanic reaction, as illustrated by many researchers [66,67]. However, the results in Figure 7 also indicated the negative effect of both Na 2 SO 4 and alkali when their contents were over 2%. This was in agreement with the previous investigations that excessive Na 2 SO 4 and alkali would lead to numerous AFt generation and heterogeneous gel production, respectively [64,68]. Therefore, the loose structure can be ultimately formed diminishing the flexural strength. Moreover, the excessive alkali content also increases the risk of detrimental ASR expansion. From the FS test results, the optimal content of chemical agents is 2% Na 2 SO 4 and 2% alkali.   Figure 8 shows the increase in the ASR expansion with curing time. The expansion kept increasing with the rise in the WGP ratio. The 14-day ASR expansion of all mortars containing 300 µm WGP was larger than the expansion of the control specimens. This demonstrates the negative effect of coarse glass powder, which is consistent with the previous investigations [69,70]. In contrast, the mortar sample containing 10% 75 µm WGP showed the lowest expansion value (0.0192% at 14 days), which was much less than the control sample and the ASTM threshold. This resulted from the pozzolanic reaction of WGP, which is prone to generating gels with nonswelling properties [71,72]. However, the excessive ASR expansion was obvious when the WGP (75 µm) ratio was increased to 30%. This phenomenon disagreed with the previous finding that fine glass particle was harmless to the ASR expansion. Du et al. [69] explained that this may be attributed to the microcracking of the glass powder during the grinding process. Although high WGP content can cause detrimental expansion, mechanical grinding's efficacy in mitigating ASR expansion has been verified.  Figure 8 shows the increase in the ASR expansion with curing time. The expansion kept increasing with the rise in the WGP ratio. The 14-day ASR expansion of all mortars containing 300 μm WGP was larger than the expansion of the control specimens. This demonstrates the negative effect of coarse glass powder, which is consistent with the previous investigations [69,70]. In contrast, the mortar sample containing 10% 75 μm WGP showed the lowest expansion value (0.0192% at 14 days), which was much less than the control sample and the ASTM threshold. This resulted from the pozzolanic reaction of WGP, which is prone to generating gels with nonswelling properties [71,72]. However, the excessive ASR expansion was obvious when the WGP (75 μm) ratio was increased to 30%. This phenomenon disagreed with the previous finding that fine glass particle was harmless to the ASR expansion. Du et al. [69] explained that this may be attributed to the microcracking of the glass powder during the grinding process. Although high WGP content can cause detrimental expansion, mechanical grinding's efficacy in mitigating ASR expansion has been verified.  Figure 9 depicts the impact of chemical activation on 14-day ASR results of samples with different WGP particle sizes and contents. When the WGP ratios were 10% and 20%, the chemical activator showed adverse effects because a larger expansion appeared on chemically excited samples. However, this negative effect turned out to be positive when the WGP ratio was increased to 30%, as shown in Figure 9c. The ASR expansion was 0.0654% for 30% WGP samples activated by 2% Na SO and 2% alkali, which was 54% smaller than that of specimen 75G30. This might be because the alkaline activator increased the concentration of calcium ions, thus generating fewer swelling gels [66,73]. Regarding the chemical activator's content, the relationship between it and ASR expansion was not obvious, and therefore, ML models were proposed, as discussed in the next section. Generally, chemical activation was feasible in mitigating the detrimental ASR expansion only for mortars with high WGP content. Specifically, the mitigation effect of chemical activator on ASR expansion mainly depended on the WGP content other than the activator's content. Combined with the FS results, mortars containing 30% 75 μm WGP activated by 2% Na SO and 2% alkali can achieve improved strength and satisfactory potential for ASR expansion.  Figure 9 depicts the impact of chemical activation on 14-day ASR results of samples with different WGP particle sizes and contents. When the WGP ratios were 10% and 20%, the chemical activator showed adverse effects because a larger expansion appeared on chemically excited samples. However, this negative effect turned out to be positive when the WGP ratio was increased to 30%, as shown in Figure 9c. The ASR expansion was 0.0654% for 30% WGP samples activated by 2% Na 2 SO 4 and 2% alkali, which was 54% smaller than that of specimen 75G30. This might be because the alkaline activator increased the concentration of calcium ions, thus generating fewer swelling gels [66,73]. Regarding the chemical activator's content, the relationship between it and ASR expansion was not obvious, and therefore, ML models were proposed, as discussed in the next section. Generally, chemical activation was feasible in mitigating the detrimental ASR expansion only for mortars with high WGP content. Specifically, the mitigation effect of chemical activator on ASR expansion mainly depended on the WGP content other than the activator's content. Combined with the FS results, mortars containing 30% 75 µm WGP activated by 2% Na 2 SO 4 and 2% alkali can achieve improved strength and satisfactory potential for ASR expansion.   Figure 10 shows the microstructures of different WGP and mortar samples, exploring the chemical activation's effect. The relatively smooth surface of the nonactivated WGP was in striking contrast with the eroded surface shown in Figure 10c. This phenomenon indicates the susceptibility of the WGP to chemical activators. Moreover, more hydrates and finer fissures were revealed in the WGP mortar sample activated by 2% alkali and 2% Na 2 SO 4 , compared with the nonactivated one. Figure 10d shows that fewer calcium hydroxide (CH) crystals were detected than in the sample without chemical activation, indicating a more intense pozzolanic reaction. Therefore, the chemical agents promote the pozzolanic activation of WGP and simultaneously increase its mechanical property, as described before, if the activators' content is moderate. Figure 10 shows the microstructures of different WGP and mortar samples, exploring the chemical activation's effect. The relatively smooth surface of the nonactivated WGP was in striking contrast with the eroded surface shown in Figure 10c. This phenomenon indicates the susceptibility of the WGP to chemical activators. Moreover, more hydrates and finer fissures were revealed in the WGP mortar sample activated by 2% alkali and 2% Na SO , compared with the nonactivated one. Figure 10d shows that fewer calcium hydroxide (CH) crystals were detected than in the sample without chemical activation, indicating a more intense pozzolanic reaction. Therefore, the chemical agents promote the pozzolanic activation of WGP and simultaneously increase its mechanical property, as described before, if the activators' content is moderate.

Hyperparameter Tuning
By using BAS and 10-fold CV, the optimal hyperparameters can be obtained from the fold having the minimum RMSE. As shown in Figure 11, the first and fourth folds achieved the minimum RMSE in the process of FS and ASR modelling. The RMSE converged during the 50 iterations, illustrating the success of hyperparameter tuning and model establishment. Finally, the optimal hyperparameters chosen for FS and ASR datasets were both = 1 and = 87.

Hyperparameter Tuning
By using BAS and 10-fold CV, the optimal hyperparameters can be obtained from the fold having the minimum RMSE. As shown in Figure 11, the first and fourth folds achieved the minimum RMSE in the process of FS and ASR modelling. The RMSE converged during the 50 iterations, illustrating the success of hyperparameter tuning and model establishment. Finally, the optimal hyperparameters chosen for FS and ASR datasets were both minNumlea f = 1 and numTree = 87.

Performance of BAS-RF for FS and ASR Expansion Modelling
The capability of the RF model with adjusted hyperparameters was verified by assessing the predictive precision of the test group. In Figure 12, the predicted and actual FS and ASR values in the training and test groups are presented. The relationship between the predicted and actual outcomes was estimated by calculating the RMSE and correlation coefficient. Apart from minor RMSE values, high correlation coefficients were observed for both training (FS: 0.9841, ASR: 0.9582) and test sets (FS: 0.9545, ASR: 0.9416). Therefore, the prediction performance of the ML models was accurate and reliable. Furthermore, the RMSE/R values of the training set and test set were close illustrating that no overfitting problem existed.

Performance of BAS-RF for FS and ASR Expansion Modelling
The capability of the RF model with adjusted hyperparameters was verified by assessing the predictive precision of the test group. In Figure 12, the predicted and actual FS and ASR values in the training and test groups are presented. The relationship between the predicted and actual outcomes was estimated by calculating the RMSE and correlation coefficient. Apart from minor RMSE values, high correlation coefficients were observed for both training (FS: 0.9841, ASR: 0.9582) and test sets (FS: 0.9545, ASR: 0.9416). Therefore, the prediction performance of the ML models was accurate and reliable. Furthermore, the RMSE/R values of the training set and test set were close illustrating that no overfitting problem existed.

Comparison of Two Baseline Models with the BAS-RF
The precision of the BAS-RF model, compared with the other two (LR and MLR), is shown in Figure 13 through boxplots and Taylor diagrams. The difference between the predicted and actual sample values is indicated in the boxplot. In Figure 13a, the RF model had a lower median value (the red line) and a more condensed interquartile range (the space between the upper and lower blue lines). This phenomenon indicated a lower prediction error than the other two baseline models. In addition, the upper limit of the RF was lower than the LR and MLR, although one outlier was observed, illustrating the high accuracy of the RF model. In Figure 13b, the RF model was also more accurate in predicting ASR values than LR and MLR because of its condensed interquartile range although several outliers appeared. Moreover, three model evaluation indices (standard deviation, RMSE, and R) are integrated into the polar coordinates, as shown in Figure 13c,d. The two RF models were both in the closest position to the "actual" points with the uppermost Rvalue, minimum RMSE, and lowest standard deviation values. Owing to the highest prediction accuracy and lowest error among the three ML models, the BAS-RF model is the most suitable to predict the FS and ASR values of WGP modified mortar.

Comparison of Two Baseline Models with the BAS-RF
The precision of the BAS-RF model, compared with the other two (LR and MLR), is shown in Figure 13 through boxplots and Taylor diagrams. The difference between the predicted and actual sample values is indicated in the boxplot. In Figure 13a, the RF model had a lower median value (the red line) and a more condensed interquartile range (the space between the upper and lower blue lines). This phenomenon indicated a lower prediction error than the other two baseline models. In addition, the upper limit of the RF was lower than the LR and MLR, although one outlier was observed, illustrating the high accuracy of the RF model. In Figure 13b, the RF model was also more accurate in predicting ASR values than LR and MLR because of its condensed interquartile range although several outliers appeared. Moreover, three model evaluation indices (standard deviation, RMSE, and R) are integrated into the polar coordinates, as shown in Figure 13c,d. The two RF models were both in the closest position to the "actual" points with the uppermost R-value, minimum RMSE, and lowest standard deviation values. Owing to the highest prediction accuracy and lowest error among the three ML models, the BAS-RF model is the most suitable to predict the FS and ASR values of WGP modified mortar.

Variable Significance for FS and ASR Expansion
The effects of the variables on the FS and ASR results were quantified as the importance ratio presented in Figure 14. The curing time is the most significant factor for both FS and ASR expansion. The WGP particle size was the second important variable for FS (26%), thus remarkably affecting the mechanical strength of mortars containing WGP. This was followed by the alkali content (18%) and the WGP replacement ratio (12%). Further, Na SO appeared less important than alkali, which agrees with the experimental findings. For the ASR expansion, the WGP replacement ratio became the most significant factor (16%) except for the curing time. This agreed with the experimental results that ASR expansion significantly depended on the WGP replacement ratio other than the activator's content. Moreover, the alkali content was less important than Na SO , which is reasonable since the 1 N NaOH solution was used to immerse the samples. In conclusion, the importance measure provides a quantitative analysis of input variables, indicating that the WGP's particle size and replacement ratio are the most crucial parameters (except curing time) for FS and ASR, respectively.

Variable Significance for FS and ASR Expansion
The effects of the variables on the FS and ASR results were quantified as the importance ratio presented in Figure 14. The curing time is the most significant factor for both FS and ASR expansion. The WGP particle size was the second important variable for FS (26%), thus remarkably affecting the mechanical strength of mortars containing WGP. This was followed by the alkali content (18%) and the WGP replacement ratio (12%). Further, Na 2 SO 4 appeared less important than alkali, which agrees with the experimental findings. For the ASR expansion, the WGP replacement ratio became the most significant factor (16%) except for the curing time. This agreed with the experimental results that ASR expansion significantly depended on the WGP replacement ratio other than the activator's content. Moreover, the alkali content was less important than Na 2 SO 4 , which is reasonable since the 1 N NaOH solution was used to immerse the samples. In conclusion, the importance measure provides a quantitative analysis of input variables, indicating that the WGP's particle size and replacement ratio are the most crucial parameters (except curing time) for FS and ASR, respectively.

Conclusions
In this study, the characteristics of mortars containing WGP were investigated by conducting FS and ASR expansion tests. Based on the experimental data set, three ML models were applied to simulate the FS and expansion values. The prediction results were consistent with the actual figures and the essential conclusions were described as follows: 1. WGP's mechanical grinding significantly improved the FS of WGP mortars. The chemical activation was only effective when the amounts of the chemical agents were appropriate. The combined activation exhibited the most remarkable efficacy; the FS of the sample containing 30% WGP that underwent the combined activation method was 8.97 MPa (18% higher than the sample subjected to mechanical excitation only). 2. Mechanical activation mitigated ASR expansion, and chemical activation was also feasible when the WGP content was 30%. The ASR expansion was 0.0654% (54% lower than the nonactivated one) for the sample that had 30% of WGP with combined activation. This illustrates the effectiveness of combined activation. The SEM results also verified its efficacy which improved the pozzolanic activity of WGP and concurrently compacted the mortars' structure. 3 In the future, the activated WGP incorporated 3D-printed concrete and alkali-activated concrete will be investigated. Regarding the ML research, a more comprehensive database must be constructed to increase the model generalisability. The evolved ML models with improved applicability, accuracy, and efficiency should also be developed. Furthermore, it is crucial to investigate more variables, such as glass types, chemical activator categories, and admixtures.

Conclusions
In this study, the characteristics of mortars containing WGP were investigated by conducting FS and ASR expansion tests. Based on the experimental data set, three ML models were applied to simulate the FS and expansion values. The prediction results were consistent with the actual figures and the essential conclusions were described as follows: 1.
WGP's mechanical grinding significantly improved the FS of WGP mortars. The chemical activation was only effective when the amounts of the chemical agents were appropriate. The combined activation exhibited the most remarkable efficacy; the FS of the sample containing 30% WGP that underwent the combined activation method was 8.97 MPa (18% higher than the sample subjected to mechanical excitation only).

2.
Mechanical activation mitigated ASR expansion, and chemical activation was also feasible when the WGP content was 30%. The ASR expansion was 0.0654% (54% lower than the nonactivated one) for the sample that had 30% of WGP with combined activation. This illustrates the effectiveness of combined activation. The SEM results also verified its efficacy which improved the pozzolanic activity of WGP and concurrently compacted the mortars' structure. 3.
The The variable significance ranking showed that WGP's particle size and replacement ratio were the most essential factors (except the curing time) for FS and ASR expansion, respectively. The results also agreed with the experimental findings.
In the future, the activated WGP incorporated 3D-printed concrete and alkali-activated concrete will be investigated. Regarding the ML research, a more comprehensive database must be constructed to increase the model generalisability. The evolved ML models with improved applicability, accuracy, and efficiency should also be developed. Furthermore, it is crucial to investigate more variables, such as glass types, chemical activator categories, and admixtures.