A New Formulation to Estimate the Elastic Modulus of Recycled Concrete Based on Regression and ANN

: A new formulation to estimate the elastic modulus of concrete containing recycled coarse aggregate is proposed in this work using artiﬁcial neural networks (ANN) and nonlinear regression. Up to six predictors variables were used to training 243 ANN. The models were generated based on results obtained from experimental campaigns. Feedforward neural network and Levenberg– Marquardt back propagation algorithm were used for training the ANN. The best ANN was found with the architecture 6-4-2-1 (input -1st hidden layer -2nd hidden layer -output), attaining a root-mean-square error of 2.4 GPa associated with a coefﬁcient of determination of 0.91. Once the ANN model was established, 46,656 concrete samples were created. These were employed to formulate the model using nonlinear regression. The developed model showed a highly efﬁcient performance to predict the elastic modulus. Lastly, considering the parametric study conducted, the results pointed out that the approach can be applied to predict the concrete elastic modulus and can indicate better mix proportions for concretes containing natural and/or recycled coarse aggregates, enabling its use as a simulation tool in the development of engineering projects focused on durability and sustainability. values presented errors below the RMSE , thus indicating this value as the model error. The results indicate that the model was coherently developed and can predict the elastic modulus of concretes made with natural and recycled aggregates. The results of the analytical model proposed in this work indicate a reduction of 7% in elastic modulus for concretes made with 100% recycled aggregate when the WCR increased 25%. On the other hand, when analyzing concrete produced only with natural aggregate, an increase of 25% in the WCR generates a 13% of reduction on the concrete elastic modulus. This is explained by the porosity of the cement matrix, which increases, as the WCR also does. G ó mez-Sober ó n [76] studied the inﬂuence of saturation degree in concretes made with recycled aggregate and observed that as the RCA increases, porosity also increases, which directly inﬂuences material stiffness. Results obtained with the proposed analytical show a reduction of 22% in the elastic modulus for concretes made with 100% recycled aggregate compared with concrete made only with natural aggregate (see Figure 13). These results are consistent with those obtained by Etxeberria et al. [80], where the authors evaluated the inﬂuence of the RCA in concrete properties and veriﬁed that the stiffness of concretes made with recycled aggregate increases with increments of the CC, which resulted from a more compact matrix. The authors also observed that, for a WCR of 0.5 and a CC of 325 kg/m 3 , concretes made with 100% of RCA had a decrease of 20–25% in the elastic modulus.


Introduction
Civil construction is one of the sectors that most contribute to cities' development and economic growth. However, it also produces a large amount of waste and emits air pollutants. China, for example, generates an average of 2 billion tons of construction waste per year, followed by the United States of America, with an average ranging between 200-300 million tons per year [1,2]. Thus, alternative materials that minimize environmental damage are used, such as construction and demolition wastes (CDW) as aggregate to produce new concretes [3,4]. Nevertheless, its employability in structural components is reduced due to some differences in composition and properties of recycled aggregate (depending on the sorting process carried out at the CDW or on the recycling technique used) that alter the mechanical behavior of concrete and oppose the inherent sustainability benefits [5][6][7].
One of the main differences between recycled aggregate concrete (RAC) and natural aggregate concrete (NAC) is the greater porosity of RAC, which results in greater water absorption and interferes in the mortar quality adhered to the aggregate surface [8]. Mortar directly influences the mechanical properties of concrete, such as its compressive strength and elastic modulus [6,9].
The elastic modulus is considered one of the most important mechanical parameters. It is used to design concrete structures, in the same way that other parameters, such as the compressive strength and the Poisson coefficient, also are. The elastic modulus quantifies One reason why neural networks spread out is the backpropagation training algorithm [44]. This algorithm can be easily implemented based on the downward gradient optimization technique. Due to its simplicity, most developed works in civil engineering employ it [45][46][47][48][49][50].
The first publication about machine learning in civil engineering was made by Adeli and Yeh [51]. The authors presented an artificial neural network of perceptron type One reason why neural networks spread out is the backpropagation training algorithm [44]. This algorithm can be easily implemented based on the downward gradient optimization technique. Due to its simplicity, most developed works in civil engineering employ it [45][46][47][48][49][50].
The first publication about machine learning in civil engineering was made by Adeli and Yeh [51]. The authors presented an artificial neural network of perceptron type (networks without hidden layers) to create a design model for steel beams. Many studies were published after that, most of them focused on pattern recognition and function mapping [52][53][54][55][56][57][58][59].
Moselhi et al. [52] analyzed the artificial neural networks' applicability to model problems related to the real estate market in different house flipping scenarios. Subsequently, in the same line of research, Chao and Skibniewski [53] and Li et al. [54] have shown that an ANN is capable of analyzing and estimating workforce productivity in the construction sector.
The largest number of ANN applications in civil engineering are found in structural engineering and materials science. In structural engineering, neural networks have been employed for the design and the analysis of structural components [55,56], in the study of structural optimization [57,58], dynamic analysis of structures due to earthquakes [45], and risk and damage monitoring [60,61].
Duan et al. [66] employed an artificial neural network to estimate the elastic modulus of concretes made with recycled aggregate. The authors used 324 data collected from different works and demonstrated the ANN's efficiency through comparison between predicted values and results from formulations in design standards.
Awoyera et al. [68] developed models based on machine learning to predict the elastic modulus of concretes made with geopolymer cements, in which they verified that the ANN was more efficient than other techniques.
Yoon et al. [69] used neural networks with a backpropagation training algorithm to obtain the elastic modulus of concretes made with lightweight recycled aggregate. The authors carried out a convergence analysis to select the best topology for the ANN and observed that, for this kind of problem, a maximum of four hidden layers were required. Moreover, they observed that the water/cement ratio, the cement consumption, and the aggregate/cement ratio have great influence on the network's learning.
Even though the use of ANNs to forecast the elastic modulus of concrete has increased, most of them are not used in practice by an engineer from the construction sector because they require prior knowledge about artificial neural networks. Therefore, some propose user-friendly software or even an analytical formulation based on a model generated by an ANN [70,71].
In this context, in this research, the possibility of applying machine learning coupled with nonlinear regression was evaluated in order to obtain a new formulation to estimate the elastic modulus of concretes containing recycled aggregate from construction and demolition waste, with a distinct replacement ratio (0-100%). Nonlinear regression is a form of regression analysis in which observational data are modeled by a function that is a nonlinear combination of the model parameters and depends on one or more independent variables. The data are fitted by a method of successive approximations.
Therefore, the possibility of applying machine learning coupled with nonlinear regression to obtain a new formulation to predict the elastic modulus of concretes containing natural and recycled aggregate was evaluated. Artificial neural networks, which have the best learning power among various machine learning models, were applied. The main novelty of this work refers to the methodology employed in the development of the analytical formulation, which uses artificial neural networks coupled with nonlinear regression. The regression modeling considered a dataset generated with the ANN that efficiently mapped the elastic modulus of concrete from the following predictor variables: cement consumption, water/cement ratio, replacement ratio of recycled coarse aggregate, fine aggregate/cement ratio, total aggregate/cement ratio, and coarse aggregate/cement ratio. The model was developed considering predictor parameters that are easy to obtain and do not require destructive testing.
In the second section, a brief description of artificial neural networks' main characteristics and functionalities is presented. In the third section, the methodology is detailed, and the process of ANN training and its use to generate the database for the development of the analytical formulation with nonlinear regression is described. In the fourth section,

Artificial Neural Networks
Artificial neural networks are inspired by biological neural networks and excel in data mining [72]. Artificial neural networks are parallel and distributed systems, composed of simple processing units called artificial neurons, similar to the structure of the human brain, which allow superior performance to that of conventional models [73].
Haykin [74] reports that an ANN presents five basic elements that resemble the biological ones: I.
An input set x j carrying its respective synaptic weight, w kj ; II.
An adder ∑ to the weighted input signals; III.
An activation function, F(·), to bound the output amplitude; IV.
A bias b k to increase or decrease the net input of the activation function (a horizontal translation of the activation function graph); V.
An output of the network y k (see Figure 1b).
In general, a k neuron output of an ANN can be evaluated by Equation (1): where y k represents the k neuron output, F is the activation function, w kj are the synaptic weights, x j represents the inputs, and b k refers to the bias. Every network must go through a learning process to map and approximate a function, and the most used is the supervised learning process [75]. The name "supervised" holds for the network that is initially controlled by a supervisor, who presents the data to the ANN, which has the objective of finding a relation between input-output pairs.
The supervised training method used in this study was the "feedforward backpropagation" [74], the same training method used by Yoon et al. [69] to develop a model to predict the elastic modulus of concrete made with recycled aggregate. Considering a nonlinear mapping, in this work a multilayer perceptron (MLP) was utilized. The MLP is a supplement of feedforward neural networks that consists of three or more layers, as shown in Figure 2. conclusions are presented in the fifth section.

Artificial Neural Networks
Artificial neural networks are inspired by biological neural networks and data mining [72]. Artificial neural networks are parallel and distributed system posed of simple processing units called artificial neurons, similar to the structur human brain, which allow superior performance to that of conventional models [ Haykin [74] reports that an ANN presents five basic elements that resemble logical ones: I.
An input set xj carrying its respective synaptic weight, wkj; II.
An adder ∑ to the weighted input signals; III.
An activation function, F(•), to bound the output amplitude; IV.
A bias bk to increase or decrease the net input of the activation function (a ho translation of the activation function graph); V.
An output of the network yk (see Figure 1b).
In general, a k neuron output of an ANN can be evaluated by Equation (1): where yk represents the k neuron output, F is the activation function, wkj are the s weights, xj represents the inputs, and bk refers to the bias. Every network must go through a learning process to map and approximate tion, and the most used is the supervised learning process [75]. The name "supe holds for the network that is initially controlled by a supervisor, who presents the the ANN, which has the objective of finding a relation between input-output pair The supervised training method used in this study was the "feedforward ba agation" [74], the same training method used by Yoon et al. [69] to develop a m predict the elastic modulus of concrete made with recycled aggregate. Considerin linear mapping, in this work a multilayer perceptron (MLP) was utilized. The M supplement of feedforward neural networks that consists of three or more la shown in Figure 2.

Model Development
In order to determine a predictive model for the elastic modulus of concretes containing natural and recycled aggregates, the methodology presented in the flowchart of Sustainability 2021, 13, 8561 5 of 21 Figure 3 was followed. A program developed in C++, which has already been validated and employed in other work by the authors [47,48,50], was used to train the ANN.

Model Development
In order to determine a predictive model for the elastic modulus of concretes containing natural and recycled aggregates, the methodology presented in the flowchart of Figure 3 was followed. A program developed in C++, which has already been validated and employed in other work by the authors [47,48,50], was used to train the ANN. As seen in Figure 3, in the first step, a database was created from experimental data available from several references [11,20, regarding the elastic modulus of concrete made with coarse aggregate from construction and demolition waste (CDW). The database is divided into three sets: the training set (60% of the database), the validation set (20% of the database), and the testing set (20% of the database). Training, validation, and testing data were randomly selected, and no changes were made afterwards. In the second step, a statistical analysis was carried out to evaluate the data distribution. The input parameters were selected, and three ANN topologies were proposed for the training process.
In the third step, training and validation of the selected networks were carried out. The network with the best performance was identified in the fourth step. Afterwards, in the fifth step, input parameters were randomly generated following statistical distribution built in the second stage. The network selected in the fourth step is used to establish the associated outputs. Thus, a new and large dataset is generated.
At last, the new dataset is used to formulate an analytical model by multivariable and nonlinear regression in the sixth step. The proposed model is then evaluated in a parametric study and compared with the initial dataset.

Database Definition
The first and main stage for the development of an ANN model is the definition of a consistent database with reliable and representative data. Thus, the database of this study was assembled considering thirty experimental campaigns of the last 20 years [11,20,76- As seen in Figure 3, in the first step, a database was created from experimental data available from several references [11,20, regarding the elastic modulus of concrete made with coarse aggregate from construction and demolition waste (CDW). The database is divided into three sets: the training set (60% of the database), the validation set (20% of the database), and the testing set (20% of the database). Training, validation, and testing data were randomly selected, and no changes were made afterwards. In the second step, a statistical analysis was carried out to evaluate the data distribution. The input parameters were selected, and three ANN topologies were proposed for the training process.
In the third step, training and validation of the selected networks were carried out. The network with the best performance was identified in the fourth step. Afterwards, in the fifth step, input parameters were randomly generated following statistical distribution built in the second stage. The network selected in the fourth step is used to establish the associated outputs. Thus, a new and large dataset is generated.
At last, the new dataset is used to formulate an analytical model by multivariable and nonlinear regression in the sixth step. The proposed model is then evaluated in a parametric study and compared with the initial dataset.

Database Definition
The first and main stage for the development of an ANN model is the definition of a consistent database with reliable and representative data. Thus, the database of this study was assembled considering thirty experimental campaigns of the last 20 years [11,20,. All the concrete samples were produced with Portland cement and cured under normal conditions for 28 days, when the value of the elastic modulus of the concrete was determined. In this study, the effect of fine aggregate replacement by recycled aggregate as well as the influence of supplementary cementitious materials in the concrete mix proportions were not considered. The following characterizes the concrete mixture proportions: cement consumption (CC), water/cement ratio (WCR), replacement of natural aggregate by recycled coarse aggregate ratio (RCA), fine aggregate/cement ratio (FACR), total aggregate/cement ratio (TACR), and coarse aggregate/cement ratio (CACR). FACR, TACR, and CACR were defined in weight ratio, and the RCA was defined as a volume replacement of natural aggregate by recycled aggregate, following Behnood et al. [3].
It is well known that the characteristics of the natural and recycled aggregate are relevant to the value of elastic modulus, as indicated by Butler et al. [96] and Etxeberria et al. [80]. Butler et al. [96], for instance, showed that the recycled coarse aggregate from different sources could have different effects on the properties of concrete made with recycled coarse aggregate. Additionally, the amount of impurity in the recycled aggregate, the processing condition, and the moisture condition could affect the mechanical properties of the concrete produced with these aggregates. However, a regression analysis was conducted, considering that the dataset used in the present study showed that the amount and type of impurities were not significant. Similar results were found by Behnood et al. [3]. These results pointed out that the amount and type of aggregate impurities were not statistically significant when generating their predictive model for the elastic modulus of concrete made with recycled aggregate. Results in this sense should be analyzed thoroughly, especially in practical applications, as shown in Gao et al. [104], where the authors developed a study in which a framework was proposed to overcome industry barriers to producing recycled mixture.
The database containing 412 records of concretes produced with natural or recycled coarse aggregates (from CDW) and with elastic modulus between 20 and 45 GPa was divided into three subsets: one for training (60% of the data), one for validation (20% of the data), and another for testing and performance assessment (20% of the data). This subdivision aimed to minimize network overtraining and to increase the model's applicability domain. All these datasets were created considering fixed data from the experimental database.

Statistical Analysis of the Data
Throughout the modeling process, the proper compilation of the model's variables is extremely important since inappropriate selection could prevent the ANN from process information, including mapping input and output data [74,75].
Therefore, a dispersion analysis on the database of the elastic modulus (E c ) was carried out, considering the influence of the available parameters, such as CC, WCR, RCA, FACR, TACR, and CACR. In every analysis, correlation coefficients of Pearson "P" (Equation (2)) and Spearman "S" (Equation (3)) were determined, as well as the average, the standard deviation, the minimum value (Min), the first quartile (Q1), the median, the third quartile (Q3), and the maximum value (Max). Table 1 presents the statistical parameters and Figure 4 shows the frequency distribution curves.
where x i and y i are the variables analyzed, x m and y m are the average values, cov(•, •) is the covariance function, σ(•) is the standard deviation function, and rg X and rg Y are the rank variables of X and Y. be represented by simple functions, such as normal, log-normal, logistic, and exponential functions. Thus, the data dispersion and the correlation coefficients identified (i) an inverse relationship between the WCR and the elastic modulus; (ii) increases in the elastic modulus as the CC increases; (iii) a small inverse correlation of the TACR with the elastic modulus; and (iv) an inverse relation of 0.52 between the elastic modulus and the RCA, therefore indicating that as the RCA increases, the mechanical property decreases.   The data dispersion and the correlation coefficients presented in Figure 4 and Table 1 enable a description of how to spread out a set of data and suggest whether they could be represented by simple functions, such as normal, log-normal, logistic, and exponential functions. Thus, the data dispersion and the correlation coefficients identified (i) an inverse relationship between the WCR and the elastic modulus; (ii) increases in the elastic modulus as the CC increases; (iii) a small inverse correlation of the TACR with the elastic modulus; and (iv) an inverse relation of 0.52 between the elastic modulus and the RCA, therefore indicating that as the RCA increases, the mechanical property decreases.
It was noticeable that the WCR, the RCA, and the CC would have a great influence on the predictive model, as indicated by the values of the correlation coefficients presented in Table 1. However, since neural networks can map complex and nonlinear relations, all variables at disposal in the database were used, letting the neural network choose the best parameters.

ANN Training and Parameters Definition
For network training and, consequently, mapping of the elastic modulus of concrete, some ANN topologies were created with different numbers of neurons in the input layer (from four up to six input parameters) and by the number of neurons present in each of the two hidden layers used (from one up to nine), as recommended by Felix at al. [50], in which the authors relate that no more than two hidden layers containing up to nine neurons are necessary to map complex problems, such as the mechanical parameters of concrete. All the ANN topologies utilized in the training process can be seen in Figure 5.
It was noticeable that the WCR, the RCA, and the CC would have a great influence on the predictive model, as indicated by the values of the correlation coefficients presented in Table 1. However, since neural networks can map complex and nonlinear relations, all variables at disposal in the database were used, letting the neural network choose the best parameters.

ANN Training and Parameters Definition
For network training and, consequently, mapping of the elastic modulus of concrete, some ANN topologies were created with different numbers of neurons in the input layer (from four up to six input parameters) and by the number of neurons present in each of the two hidden layers used (from one up to nine), as recommended by Felix at al. [50], in which the authors relate that no more than two hidden layers containing up to nine neurons are necessary to map complex problems, such as the mechanical parameters of concrete. All the ANN topologies utilized in the training process can be seen in Figure 5. For the training process, a bipolar sigmoid function was adopted. As the network learning is supervised with the backpropagation training algorithm, a learning rate must be established since it is directly related to the network convergence. The learning rate adopted was 0.35, as indicated in Felix et al. [48]. Training and validation were performed simultaneously in order to prevent overtraining-when the network perfectly maps the training set data but is unable to interpolate the validation data, resulting in a low performance index [75,105,106]. Training and validation were performed using the same dataset for every ANN topology. As convergence criteria, root-mean-square error (RMSE-Equation (4)) was used. Additionally, training was interrupted when the number of interactions exceeded 105. The networks were created and trained using the computational package project-yapy, developed in C++ [107]. For the training process, a bipolar sigmoid function was adopted. As the network learning is supervised with the backpropagation training algorithm, a learning rate must be established since it is directly related to the network convergence. The learning rate adopted was 0.35, as indicated in Felix et al. [48]. Training and validation were performed simultaneously in order to prevent overtraining-when the network perfectly maps the training set data but is unable to interpolate the validation data, resulting in a low performance index [75,105,106]. Training and validation were performed using the same dataset for every ANN topology. As convergence criteria, root-mean-square error (RMSE-Equation (4)) was used. Additionally, training was interrupted when the number of interactions exceeded 105. The networks were created and trained using the computational package project-yapy, developed in C++ [107].
where y i refers to network estimated values, t i represents known values (targets), and n is the amount of data used in the analysis.
A robust dataset (46,656 samples) was determined using the ANN model previously created, and it was employed in the nonlinear regression modeling. Input variables were randomly generated following its distribution (see Figure 4) to assemble a dataset similar to the experimental database.
A function base that represents the analytical formulation was defined considering the statistical analysis, which employed correlation coefficients between the selected input parameters and the concrete elastic modulus. It was verified that, when the RCA is combined with the CC, WCR, CACR, or FACR, the correlation with the elastic modulus increases. However, the TACR presented a better correlation with the elastic modulus when analyzed by itself (see Equation (5)).
In the second step, the format of functions f 1 to f 5 in Equation (5) are defined using one of Equations (6)- (12). Whichever equation better represents the relation of the current parameter with the elastic modulus is selected. Adjustments to coefficients α i and β i are made in the third step using least squares and backward elimination [109]. Terms that do not contribute to the model are removed, as long as the removal does not decrease the accuracy of the adjustment. This step is repeated until the convergence of coefficients α i and β i . In the last stage, results obtained with the proposed model are compared with the experimental database [11,20,.
one of Equations (6)- (12). Whichever equation better represents the relation of the current parameter with the elastic modulus is selected. Adjustments to coefficients αi and βi are made in the third step using least squares and backward elimination [109]. Terms that do not contribute to the model are removed, as long as the removal does not decrease the accuracy of the adjustment. This step is repeated until the convergence of coefficients αi and βi. In the last stage, results obtained with the proposed model are compared with the experimental database [11,20,.

Results and Discussion
The development of the formulation was performed using two modeling techniques. ANN was first used to map the elastic modulus of concrete containing coarse recycled aggregate. Once trained and validated, many samples were generated. Then, this new dataset was employed to create an analytical formulation using multivariable and nonlinear regression. Hence, the results regarding the application of ANN and nonlinear and multivariable regression are presented below. Afterwards, a parametrical study of the proposed formulation is presented to demonstrate its applicability.

Analysis of the ANN Modeling
After ANN training, a performance analysis was conducted to select the optimum number of neurons in the hidden layer for each of the adopted topologies, represented in Figure 5. Fifteen models were selected regarding the RMSE obtained in training and validation stages. These results are shown in Table 2, where the maximum error (Emax) and coefficient of determination (R 2 ) are also included.

Results and Discussion
The development of the formulation was performed using two modeling techniques. ANN was first used to map the elastic modulus of concrete containing coarse recycled aggregate. Once trained and validated, many samples were generated. Then, this new dataset was employed to create an analytical formulation using multivariable and nonlinear regression. Hence, the results regarding the application of ANN and nonlinear and multivariable regression are presented below. Afterwards, a parametrical study of the proposed formulation is presented to demonstrate its applicability.

Analysis of the ANN Modeling
After ANN training, a performance analysis was conducted to select the optimum number of neurons in the hidden layer for each of the adopted topologies, represented in Figure 5. Fifteen models were selected regarding the RMSE obtained in training and validation stages. These results are shown in Table 2, where the maximum error (E max ) and coefficient of determination (R 2 ) are also included. * [x-y-w-z] 4-layer topology, where x indicates the input number, y the neuron number in the first hidden layer, w the neuron number in the second hidden layer, and z the output number.
Most ANNs presented good results for R 2 , RMSE, and Emax, but their values worsen when training and validation stages are compared, as can be seen in Figure 7. This remarks the importance of cross-validation in any ANN training process. As shown in Figure 7 and Table 2, the ANN achieved good performance in the training processes. The results point out that the variables cement consumption-CC, water/cement ratio-WCR, replacement of natural aggregate by recycled coarse aggregate ratio-RCA, and total aggregate cement ratio-TACR, would have a great influence on the prediction model and that they are very important to consider as predictor parameters As shown in Figure 7 and Table 2, the ANN achieved good performance in the training processes. The results point out that the variables cement consumption-CC, water/cement ratio-WCR, replacement of natural aggregate by recycled coarse aggregate ratio-RCA, and total aggregate cement ratio-TACR, would have a great influence on the prediction model and that they are very important to consider as predictor parameters of the prediction model. However, it is possible to observe that when more variables are introduced, such as the fine aggregate cement ratio-FACR and the coarse aggregate cement ratio-CACR, the results are improved, demonstrating that ANN performance can be improved when the number of predictor parameters increases since they are thus more representative. Figure 8 shows the mapping performance for the best ANN of each basic topology used in this work (topologies that can be seen in Figure 5). Additionally, Figure 8 indicates the coefficient of determination of each ANN for the training and validation stages.
As seen in Figure 8, the ANN [4-3-3-1], [5-4-2-1], and [6-4-2-1] presented the optimum number of neurons in the hidden layer for the three basic topologies-those with a different number of input parameters, as shown in Figure 5. To select the topology that best maps the elastic modulus of concrete containing natural and recycled coarse aggregates, the results presented in Figure 8 and Table 2 were analyzed, where the performance parameters obtained in the training and validation stages were compared.
The ANN with the topology [6-4-2-1] was selected with the best performance, where the coefficient of determination was 0.95 and 0.92 in the stages of training and validation, respectively. The maximum error of this ANN was 3.79 and 3.20 GPa in the training and validation stages, respectively.
Finally, is it necessary to test the model with regard to its potential for generalizability. Haykin [74] and Patterson [75] relate that if the ANN performs well on the data that it has not trained on, it can be said that it has generalized well to the given data. Considering that, Figure 9 shows the good performance obtained with the model application in the test analysis, where the analysis was performed with experimental results collected from the literature. The results point out the model's generalizability and that it is able to estimate the elastic modulus of concrete containing recycled aggregate from construction and demolition waste.
introduced, such as the fine aggregate cement ratio-FACR and the coarse aggregate cement ratio-CACR, the results are improved, demonstrating that ANN performance can be improved when the number of predictor parameters increases since they are thus more representative. Figure 8 shows the mapping performance for the best ANN of each basic topology used in this work (topologies that can be seen in Figure 5). Additionally, Figure 8 indicates the coefficient of determination of each ANN for the training and validation stages. As seen in Figure 8, the ANN [4-3-3-1], [5-4-2-1], and [6-4-2-1] presented the optimum number of neurons in the hidden layer for the three basic topologies-those with a different number of input parameters, as shown in Figure 5. To select the topology that best maps the elastic modulus of concrete containing natural and recycled coarse aggregates, the results presented in Figure 8 and Table 2 were analyzed, where the performance parameters obtained in the training and validation stages were compared. Finally, is it necessary to test the model with regard to its potential for generalizability. Haykin [74] and Patterson [75] relate that if the ANN performs well on the data that it has not trained on, it can be said that it has generalized well to the given data. Considering that, Figure 9 shows the good performance obtained with the model application in the test analysis, where the analysis was performed with experimental results collected from the literature. The results point out the model's generalizability and that it is able to estimate the elastic modulus of concrete containing recycled aggregate from construction and demolition waste.

Analysis of the Nonlinear Regression
ANN [6-4-2-1] was used to generate 46,656 datasets based on input variables randomly obtained based on their distribution (see Figure 4). Multivariable regression analysis using linear, polynomial, rational, and exponential functions according to Equations 6-12 was employed to propose a formulation to predict the elastic modulus. A backward process was used to set up the parameters for the prediction of the elastic modulus based

Analysis of the Nonlinear Regression
ANN [6-4-2-1] was used to generate 46,656 datasets based on input variables randomly obtained based on their distribution (see Figure 4). Multivariable regression analysis using linear, polynomial, rational, and exponential functions according to Equations 6-12 was employed to propose a formulation to predict the elastic modulus. A backward process was used to set up the parameters for the prediction of the elastic modulus based on the following input variables: CC, WCR, CACR, FACR, TACR, and RCA. Table 3 shows the selected function and the required coefficients for each function. Null terms are omitted. Table 3. Results of the multivariable regression.

Function Equation Number
Thus, the general form of the equation to predict the elastic modulus is given by: where f 1 , f 2 , f 3 , f 4 and f 5 establish the relation between input and output variables, written as: Following Santana et al. [108], normality, homoscedasticity, and independence from residuals of the proposed multivariable and nonlinear regression were evaluated. A Shapiro-Wilk test (of normality) resulted (p-value > 0.05) in 0.917, which indicates that the null hypothesis of the data being normally distributed is not rejected. A Durbin-Watson test (of independence) resulted in 0.803-values below 2.0 indicate that error terms have a positive autocorrelation. Additionally, a Breusch-Pagan test (for homoscedasticity) resulted in 0.286 for a significance level of 5%-low values indicate homoscedasticity. This behavior can be observed in Figure 10c, in which the residuals has a constant dispersion. Figure 10 also shows other performance indicators: R 2 (Figure 10a), the sum of squares error "SQE" (Figure 10b), the PRESS (prediction error sum of squares) (Figure 10b), the RMSE (Figure 10c), the E max (Figure 10c), and the percentage residuals (Figure 10d). Additionally, a coefficient of determination of 0.88 was obtained, pointing out its estimation capacity. The PRESS value of 463.51 has the same magnitude as the sum of squares error (391. 18), an indication of model validity. Errors have a normal distribution with an average close to zero. Ninety-seven percent of the predicted values presented errors below the RMSE, thus indicating this value as the model error. The results indicate that the model was coherently developed and can predict the elastic modulus of concretes made with natural and recycled aggregates. 10d). Additionally, a coefficient of determination of 0.88 was obtained, pointing out its estimation capacity. The PRESS value of 463.51 has the same magnitude as the sum of squares error (391. 18), an indication of model validity. Errors have a normal distribution with an average close to zero. Ninety-seven percent of the predicted values presented errors below the RMSE, thus indicating this value as the model error. The results indicate that the model was coherently developed and can predict the elastic modulus of concretes made with natural and recycled aggregates. Therefore, the results point out the model's applicability and that it is able to estimate the elastic modulus of concrete containing recycled aggregate from construction and demolition waste, with distinct replacement ratio (0-100%), water/cement ratio varying from 0.25 to 0.68, and cement consumption (in kg/m 3 ) varying from 247.00 to 512.50.

Parametric Analysis
In order to assess whether the developed formulation consistently represents the influence of each variable considered in the model and whether the formulation efficiently maps the concrete elastic modulus, a parametric analysis was conducted, and the results obtained were compared with the results available in the literature.
Five analyses were performed considering the combination of two input variables and their effects on the elastic modulus.
A reference scenario with the average values of model input parameters was set up, according to Table 1. The applicability domain was used to establish the range of the variables, as also shown in Table 1.
Initially, the mutual influence of the WCR and the RCA is presented in Figure 11. There is an inverse relation between the WCR and elastic modulus, regardless of the RCA. Therefore, the results point out the model's applicability and that it is able to estimate the elastic modulus of concrete containing recycled aggregate from construction and demolition waste, with distinct replacement ratio (0-100%), water/cement ratio varying from 0.25 to 0.68, and cement consumption (in kg/m 3 ) varying from 247.00 to 512.50.

Parametric Analysis
In order to assess whether the developed formulation consistently represents the influence of each variable considered in the model and whether the formulation efficiently maps the concrete elastic modulus, a parametric analysis was conducted, and the results obtained were compared with the results available in the literature.
Five analyses were performed considering the combination of two input variables and their effects on the elastic modulus.
A reference scenario with the average values of model input parameters was set up, according to Table 1. The applicability domain was used to establish the range of the variables, as also shown in Table 1.
Initially, the mutual influence of the WCR and the RCA is presented in Figure 11. There is an inverse relation between the WCR and elastic modulus, regardless of the RCA.
The results of the analytical model proposed in this work indicate a reduction of 7% in elastic modulus for concretes made with 100% recycled aggregate when the WCR increased 25%. On the other hand, when analyzing concrete produced only with natural aggregate, an increase of 25% in the WCR generates a 13% of reduction on the concrete elastic modulus. This is explained by the porosity of the cement matrix, which increases, as the WCR also does. Gómez-Soberón [76] studied the influence of saturation degree in concretes made with recycled aggregate and observed that as the RCA increases, porosity also increases, which directly influences material stiffness. Figure 11. Effect of the WCR and the RCA in the elastic modulus in (a) 3D and (b) isoline. Figure 12 presents the influence of the CC and the WCR. Higher CC associated with low WCR improves the elastic modulus. However, by fixing the WCR, only a small increment in the elastic modulus is seen when the CC is increased.  The results of the analytical model proposed in this work indicate a reduction of 7% in elastic modulus for concretes made with 100% recycled aggregate when the WCR increased 25%. On the other hand, when analyzing concrete produced only with natural aggregate, an increase of 25% in the WCR generates a 13% of reduction on the concrete elastic modulus. This is explained by the porosity of the cement matrix, which increases, as the WCR also does. Gómez-Soberón [76] studied the influence of saturation degree in concretes made with recycled aggregate and observed that as the RCA increases, porosity also increases, which directly influences material stiffness. Figure 12 presents the influence of the CC and the WCR. Higher CC associated with low WCR improves the elastic modulus. However, by fixing the WCR, only a small increment in the elastic modulus is seen when the CC is increased.
x FOR PEER REVIEW 16 of 22 Results obtained with the proposed analytical model show a reduction of 22% in the elastic modulus for concretes made with 100% recycled aggregate compared with concrete made only with natural aggregate (see Figure 13). These results are consistent with those obtained by Etxeberria et al. [80], where the authors evaluated the influence of the RCA in concrete properties and verified that the stiffness of concretes made with recycled aggregate increases with increments of the CC, which resulted from a more compact matrix. The authors also observed that, for a WCR of 0.5 and a CC of 325 kg/m 3 , concretes made with 100% of RCA had a decrease of 20-25% in the elastic modulus.  Figures 11 and 13 show that concretes made with replacement ratios up to 20% of natural aggregate reach a similar elastic modulus, considering the same CC and WCR. However, concretes made with a replacement ratio above 50% require a reduction of 5-23% of WCR and an increase of 4-18% of CC to sustain around the same elastic modulus Results obtained with the proposed analytical model show a reduction of 22% in the elastic modulus for concretes made with 100% recycled aggregate compared with concrete made only with natural aggregate (see Figure 13). These results are consistent with those obtained by Etxeberria et al. [80], where the authors evaluated the influence of the RCA in concrete properties and verified that the stiffness of concretes made with recycled aggregate increases with increments of the CC, which resulted from a more compact matrix. The authors also observed that, for a WCR of 0.5 and a CC of 325 kg/m 3 , concretes made with 100% of RCA had a decrease of 20-25% in the elastic modulus.
x FOR PEER REVIEW 16 of 22 Results obtained with the proposed analytical model show a reduction of 22% in the elastic modulus for concretes made with 100% recycled aggregate compared with concrete made only with natural aggregate (see Figure 13). These results are consistent with those obtained by Etxeberria et al. [80], where the authors evaluated the influence of the RCA in concrete properties and verified that the stiffness of concretes made with recycled aggregate increases with increments of the CC, which resulted from a more compact matrix. The authors also observed that, for a WCR of 0.5 and a CC of 325 kg/m 3 , concretes made with 100% of RCA had a decrease of 20-25% in the elastic modulus.  Figures 11 and 13 show that concretes made with replacement ratios up to 20% of natural aggregate reach a similar elastic modulus, considering the same CC and WCR. However, concretes made with a replacement ratio above 50% require a reduction of 5-23% of WCR and an increase of 4-18% of CC to sustain around the same elastic modulus as from concretes made with only natural aggregates. Figures 14 and 15 show the influence of the replacement ratio of recycled aggregates  Figures 11 and 13 show that concretes made with replacement ratios up to 20% of natural aggregate reach a similar elastic modulus, considering the same CC and WCR. However, concretes made with a replacement ratio above 50% require a reduction of 5-23% of WCR and an increase of 4-18% of CC to sustain around the same elastic modulus as from concretes made with only natural aggregates. Figures 14 and 15 show the influence of the replacement ratio of recycled aggregates associated with coarse and fine aggregate/cement ratio, respectively. Smaller ratios of aggregate/cement lead to lower elastic modulus. A minimum value of elastic modulus was found for FACR and CACR close to 2.0 and 100% of RCA.  Cabral et al. [110] describe that the FACR has a greater influence than the CACR on the elastic modulus, once the concrete elastic modulus is associated with the volume fraction, specific weight, and aggregate elastic modulus. Mehta and Monteiro [8] point out that the aggregate strain is related to its porosity, to the maximum size, its shape, texture, granulometry, and mineralogical composition.
In addition, Figures 11 to 15 show that the influence of the replacement ratio of natural aggregates by recycled aggregates may decrease the elastic modulus up to 32%. Ajdukiewicz and Kliszczewicz [11], Gómez-Soberón [76], Etxeberria et al. [80], and Cabral et al. [110] point out that 100% RCA generates concretes with an elastic modulus from 25% up to 35% lower than the concretes produced with 100% natural aggregate. Estolano et al. [111] found a decrease of 35.4% of the concrete stiffness due to complete replacement of natural aggregates associated with large increases in void index and water absorption.

Conclusions
In this study, we evaluated the possibility of applying machine learning coupled with  Cabral et al. [110] describe that the FACR has a greater influence than the CACR on the elastic modulus, once the concrete elastic modulus is associated with the volume fraction, specific weight, and aggregate elastic modulus. Mehta and Monteiro [8] point out that the aggregate strain is related to its porosity, to the maximum size, its shape, texture, granulometry, and mineralogical composition.
In addition, Figures 11 to 15 show that the influence of the replacement ratio of natural aggregates by recycled aggregates may decrease the elastic modulus up to 32%. Ajdukiewicz and Kliszczewicz [11], Gómez-Soberón [76], Etxeberria et al. [80], and Cabral et al. [110] point out that 100% RCA generates concretes with an elastic modulus from 25% up to 35% lower than the concretes produced with 100% natural aggregate. Estolano et al. [111] found a decrease of 35.4% of the concrete stiffness due to complete replacement of natural aggregates associated with large increases in void index and water absorption.

Conclusions
In this study, we evaluated the possibility of applying machine learning coupled with Cabral et al. [110] describe that the FACR has a greater influence than the CACR on the elastic modulus, once the concrete elastic modulus is associated with the volume fraction, specific weight, and aggregate elastic modulus. Mehta and Monteiro [8] point out that the aggregate strain is related to its porosity, to the maximum size, its shape, texture, granulometry, and mineralogical composition.
In addition, Figures 11-15 show that the influence of the replacement ratio of natural aggregates by recycled aggregates may decrease the elastic modulus up to 32%. Ajdukiewicz and Kliszczewicz [11], Gómez-Soberón [76], Etxeberria et al. [80], and Cabral et al. [110] point out that 100% RCA generates concretes with an elastic modulus from 25% up to 35% lower than the concretes produced with 100% natural aggregate. Estolano et al. [111] found a decrease of 35.4% of the concrete stiffness due to complete replacement of natural aggregates associated with large increases in void index and water absorption.

Conclusions
In this study, we evaluated the possibility of applying machine learning coupled with nonlinear regression to obtain a formulation to estimate the elastic modulus of concretes made with natural and recycled coarse aggregate. Artificial neural networks, which have the best learning power among various machine learning models, were applied.
The main novelty of this work is the methodology employed in the development of the analytical formulation, which used artificial neural networks coupled with nonlinear regression. The regression modeling considered a dataset generated with an ANN that efficiently mapped the concrete elastic modulus from the following predictor variables: cement consumption, water/cement ratio, replacement ratio of recycled coarse aggregate, fine aggregate/cement ratio, total aggregate/cement ratio, and coarse aggregate/cement ratio. The model was developed considering predictor parameters that are easy to obtain and do not require destructive testing.
Regarding the ANN modeling, it was observed that networks with two hidden layers containing up to six neurons were sufficient to efficiently map the concrete's elastic modulus, reducing the ANN size. In additional, the results show that the replacement ratio of recycled coarse aggregate, fine aggregate/cement ratio, and coarse aggregate/cement ratio are very important parameters to consider as predictors.
Regarding the mathematical expression, the proposed formulation presented a coefficient of determination of 0.88, which indicates its predictive capacity. The error residuals presented a normal distribution with an average close to zero, and 97% of predicted values had errors below the 3.06 GPa, with a maximum error of 3.67 GPa.
In addition to the results, the following conclusions were drawn from the study: • Concretes made with a ratio of natural aggregates replacement with recycled aggregates of up to 20% reaches almost the same stiffness as concrete made with 100% natural aggregate; • Concretes made with a replacement ratio above 50% require lower water/cement ratios (about 5-23%) and higher cement consumption (about 4-18%) than concretes made with 100% natural aggregate; • Results of the analytical model proposed in this work showed a reduction of 7% in elastic modulus for concretes made with 100% recycled aggregate when the WCR increased 25%. On the other hand, when analyzing concrete produced only with natural aggregate, an increase of 25% in the WCR generated a 13% of reduction on the concrete elastic modulus; • Modeling with a nonlinear regression technique coupled with artificial intelligence provides an alternative and efficient methodology to solve problems related to civil and materials engineering.
Finally, the parametric study of the proposed analytical model demonstrated that it can be used to predict the concrete elastic modulus and that it can indicate better mix proportions for concretes containing natural and/or recycled coarse aggregates, thus enabling its use as a simulation tool in the development of engineering projects focused on durability and sustainability.

Funding:
The research support of the Brazilian National Council for Scientific and Technological Development (CNPq 141078/2018 and CNPq 310564/2018-2) is gratefully acknowledged. This study was also financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior-Brasil (CAPES)-Finance Code 001, Universidade Federal da Integração Latino-Americana-(120/ 2020/PRPPG), and by the Centro Universitário Estácio de Ribeirão.

Data Availability Statement:
The data presented in this study are available on request from the corresponding author.