Development of an ANN-Based Lumped Plasticity Model of RC Columns Using Historical Pseudo-Static Cyclic Test Data

Featured Application: Predicting the critical properties of RC columns using the developed ANN model, providing parameters for their numerical simulation based upon a lumped plasticity model. Abstract: This study explores the possibility of using an ANN-based model for the rapid numerical simulation and seismic performance prediction of reinforced concrete (RC) columns. The artiﬁcial neural network (ANN) method is implemented to model the relationship between the input features of RC columns and the critical parameters of the commonly used lumped plasticity (LP) model: The strength and the yielding, capping and ultimate deformation capacity. Cyclic test data of 1163 column specimens obtained from the PEER and NEEShub database and other sources are collected and divided into the training set, test set and validation set for the ANN model. The e ﬀ ectiveness of the proposed ANN model is validated by comparing it with existing explicit formulas and experimental results. Results indicated that the developed model can e ﬀ ectively predict the strength and deformation capacities of RC columns. Furthermore, the response of two RC frame structures under static force and strong ground motion were simulated by the ANN-based, bi-linear and tri-linear LP model method. The good agreement between the proposed model and test results validated that the ANN-based method can provide su ﬃ ciently accurate model parameters for modeling the seismic response of RC columns using the LP model.


Introduction
Reinforced concrete (RC) columns are fundamental structural components that are widely used in civil infrastructures such as buildings and bridges. In seismic-prone regions, the seismic performance of such structural components significantly affects structural safety in seismic events. However, due to the nonlinearity of structural materials and the uncertainty of earthquake excitations, there are still some difficulties exists for researchers and engineers in some fields, e.g., the rapid seismic evaluation of a regional transport network, including a number of bridges [1]. Therefore, it is still necessary to develop a rapid and reliable model for the main components of structures, e.g., RC columns and piers.
Researchers and engineers in seismic engineering have conducted many laboratory experiments, including pseudo-static and shaking table tests, to investigate the mechanical properties of RC columns. Based on them, several explicit formulas have also been established, founded upon a 1.
The shape of the cross section of the RC columns should be rectangular; others like circular columns are not included in this study.

2.
The RC columns only sustained constant axial loading and unidirectional cyclic lateral force. Cyclic tests of the RC column under biaxial lateral or variable axial loads are excluded.

3.
A complete loading process was applied to the specimens until failure; the load carrying capacity of the specimen decreased by more than 20% compared to the peak strength. 4.
Details of the specimens are available, such as the geometrical size and reinforcement, as well as experimental results of the hysteretic curves.

5.
Normal concrete was used in the manufacture of the specimens, without additives such as steel fiber.
A total of 1163 cyclic test results of RC columns were collected to build a database for training, testing and validating the ANN model. Details of the selected specimens, including the data source and geometrical size, are listed in the Appendix A. It can be observed that the main features of the selected columns have a relatively wide distribution.
The numerical simulation of RC columns may depend upon many factors, such as the material properties, geometrical configuration, reinforcement layout and loading protocol. Figure 1 shows a schematic diagram of a typical cantilever column, illustrating some of these factors, as well as its LP model. The section width (B) and depth (D) is defined as the section size which is perpendicular and parallel to the lateral force, respectively. It should be noted that a cantilever and fixed-end columns are two typical structural components used for civil structures. For simplicity, this study mainly focused on cantilever columns, which are widely used in bridge piers; the geometrical size and test results of the fixed-end columns were modified by assuming that the inflection point of the columns occurs at the half height of the column. As illustrated, the optimal feature subset is selected by the genetic algorithm (GA) method, then used to train the ANN model. It can help determine the parameters of the LP model for RC columns. As for the modeling, the LP model of RC columns is usually represented by the bi-linear and trilinear model [28]. As shown in Figure 2, the strength and the yielding, capping and ultimate deformation of the RC columns can be defined according to the method using the hysteretic curves obtained from cyclic tests. The strength is defined as the peak point when the resistance force reaches its maximum value. According to statistical results by Haselton et al. [29], the yielding deformation can be roughly estimated as that corresponding to 85% of the peak strength, and the ultimate deformation is obtained from the drift when the resistance drops to 20% of the peak strength [6]. It should be noted that the actual test results of the hysteretic curve cannot always achieve perfect symmetry in both the positive and negative directions. Therefore, the mean values of the strength and deformation in the positive and negative directions were adopted for the following analysis. The strength and deformation capacities of 1,163 RC columns were determined according to the above approach, and used as the outputs of the ANN model. Consequently, the model parameters of the LP model (e.g., yielding moment My, initial stiffness K1, and the coefficient of the post yielding stiffness r) can be determined by the four parameters: The strength ( max F ), the yielding ( y δ ), capping ( c δ ) and ultimate drift ( u δ ), as shown in Tables 2 and 3. As can be seen,   As is known, there are many factors influencing the performance of RC columns, some of which are dependent. For example, the effective depth (d e ) can be calculated by B, D and the effective cover thickness (d ). Although the independent parameters are simple and easily accessible, the derived parameters may be more effective in some cases. Consequently, the potential influencing features are collected for feature selection, which are categorized into six groups as listed in Table 1. It can be observed from the table that a total of 24 features were included, in order to investigate their effects on RC columns. As for the modeling, the LP model of RC columns is usually represented by the bi-linear and tri-linear model [28]. As shown in Figure 2, the strength and the yielding, capping and ultimate deformation of the RC columns can be defined according to the method using the hysteretic curves obtained from cyclic tests. The strength is defined as the peak point when the resistance force reaches its maximum value. According to statistical results by Haselton et al. [29], the yielding deformation can be roughly estimated as that corresponding to 85% of the peak strength, and the ultimate deformation is obtained from the drift when the resistance drops to 20% of the peak strength [6]. It should be noted that the actual test results of the hysteretic curve cannot always achieve perfect symmetry in both the positive and negative directions. Therefore, the mean values of the strength and deformation in the positive and negative directions were adopted for the following analysis. The strength and deformation capacities of 1163 RC columns were determined according to the above approach, and used as the outputs of the ANN model.

Architecture of the ANN Model
ANN is a humanlike information processing method, which is composed of many highly interconnected processing elements or neurons. In this study, the developed ANN model that maps the features of the RC column and the structural capacities can be written as: where y is the output feature vector of the ANN, including the strength and the yielding, capping and ultimate drift of the columns; x is the input feature vector of the column, as listed in Table 1; and θ represents the learning parameters.
As shown in Figure 3, a typical ANN architecture consists of an input layer, output layer and one or several hidden layers, and each layer has corresponding neurons [31]. In this study, back propagation neural network (BPNN) [32] was adopted to predict the strength and deformation capacity of the columns. In the training process, the inputs propagate toward the output layer through the hidden layers, and errors between the predicted and experimental values back propagate from the output layer to the input layer to adjust the weights and thresholds of the hidden layers. In addition, the activation function defines how the input of a unit combines with its current activation level to complete a new activation. In this study, the commonly-used sigmoid activation function is utilized. Once the optimal connection weights and thresholds are determined, the trained ANN model can be conveniently employed to evaluate the strength and deformation capacities of RC columns. Consequently, the model parameters of the LP model (e.g., yielding moment M y , initial stiffness K 1 , and the coefficient of the post yielding stiffness r) can be determined by the four parameters: The strength (F max ), the yielding (δ y ), capping (δ c ) and ultimate drift (δ u ), as shown in Tables 2 and 3. As can be seen, M y = F y L + Pδ y L and M u = F max L + Pδ c L are the yielding moment and ultimate moment, respectively. Consequently, the four variables (F max , δ u , δ c , δ y ) are used as the outputs of the ANN model. Table 2. Definition and determination of the bi-linear lumped plasticity (LP) model.

Parameter Description Calculation
K e the initial stiffness of the linear segment M y /δ y M y the yielding moment F y L + Pδ y L b the hardening ratio K sh /K e , K sh = (M c − M y )/(δ u − δ y ) Table 3. Definition and determination of the tri-linear LP model [30].

Architecture of the ANN Model
ANN is a humanlike information processing method, which is composed of many highly interconnected processing elements or neurons. In this study, the developed ANN model that maps the features of the RC column and the structural capacities can be written as: where y is the output feature vector of the ANN, including the strength and the yielding, capping and ultimate drift of the columns; x is the input feature vector of the column, as listed in Table 1; and θ represents the learning parameters. As shown in Figure 3, a typical ANN architecture consists of an input layer, output layer and one or several hidden layers, and each layer has corresponding neurons [31]. In this study, back propagation neural network (BPNN) [32] was adopted to predict the strength and deformation capacity of the columns. In the training process, the inputs propagate toward the output layer through the hidden layers, and errors between the predicted and experimental values back propagate from the output layer to the input layer to adjust the weights and thresholds of the hidden layers. In addition, the activation function defines how the input of a unit combines with its current activation level to complete a new activation. In this study, the commonly-used sigmoid activation function is utilized. Once the optimal connection weights and thresholds are determined, the trained ANN model can be conveniently employed to evaluate the strength and deformation capacities of RC columns.

Input and Output Layer
As described previously, the strength and the deformation capacities of RC columns are affected by numerous factors. If all of the features are included in the input layer, the ANN model becomes very complex, and prone to fail due to overfitting. Therefore, it is reasonable to select an optimal input feature subset for the ANN model. To achieve optimization, all the input features listed in Table   1, and the output features are first organized into an n m × dimensional matrix and an 4 n× dimensional matrix, respectively, as follows: is the j th input feature of the i th RC column in the training set, and ik y (k = 1, 2, 3, 4) is the corresponding output feature of the i th RC column. Considering the significant difference in the value ranges of the features, data obtained from the specimens and the test results of the training set are first normalized within the range of [0,1] by: where ij x is j th normalized input feature corresponding to the i th specimen; j x represents the j th input feature vector of all the specimens; and k y denotes the k th output feature vector. After In this study, a genetic algorithm (GA) was adopted to find the optimal input features of the column. GA is a metaheuristic inspired by the process of theoretical Darwinian natural selection in biological systems to generate high-quality solutions of search problems [33]. In this method, each feature subset, such as structural configuration and material properties, is represented by an

Input and Output Layer
As described previously, the strength and the deformation capacities of RC columns are affected by numerous factors. If all of the features are included in the input layer, the ANN model becomes very complex, and prone to fail due to overfitting. Therefore, it is reasonable to select an optimal input feature subset for the ANN model. To achieve optimization, all the input features listed in Table 1, and the output features are first organized into an n × m dimensional matrix and an n × 4 dimensional matrix, respectively, as follows: where x ij (i = 1, 2, · · · , n) is the jth input feature of the ith RC column in the training set, and y ik (k = 1, 2, 3, 4) is the corresponding output feature of the ith RC column. Considering the significant difference in the value ranges of the features, data obtained from the specimens and the test results of the training set are first normalized within the range of [0, 1] by: where x ij is jth normalized input feature corresponding to the ith specimen; x j represents the jth input feature vector of all the specimens; and y k denotes the kth output feature vector. After normalization, n input-output pairs ( . . , y i4 are the normalized input and output features of the ith column, respectively. In this study, a genetic algorithm (GA) was adopted to find the optimal input features of the column. GA is a metaheuristic inspired by the process of theoretical Darwinian natural selection in biological systems to generate high-quality solutions of search problems [33]. In this method, each feature subset, such as structural configuration and material properties, is represented by an individual population. In the selection process, individuals with better phenotypic characteristics have greater probability to survive and reproduce in a population, whereas the less adapted individuals are more likely to disappear. Thus, the GA obtains the optimal solution after a series of iterative computations. Figure 4 shows the chromosome design, fitness function and system architecture for the GA-based optimal feature selection of the columns, which operates in binary spaces (called chromosomes) and manipulates a population of potential solutions for the optimal input subset. A chromosome (genotype of the input features) is represented by binary coding as follows: where g 1 , g 2 , . . . , g m represent the 24 input features that will be selected. The initial chromosome population is randomly generated first, using binary coding. For example, chromosome 100100 means that the first and the fourth input features were selected.
Appl. Sci. 2019, 9, x FOR PEER REVIEW 8 of 21 chromosomes) and manipulates a population of potential solutions for the optimal input subset. A chromosome (genotype of the input features) is represented by binary coding as follows: where 1 2 , , ..., m g g g represent the 24 input features that will be selected. The initial chromosome population is randomly generated first, using binary coding. For example, chromosome 100100 means that the first and the fourth input features were selected. During the selection process, the fitness function is used to evaluate the quality of possible input subsets. There are several measurements can be used to evaluate the accuracy and survival probability of the chromosomes [34]. For convenience, the general fitness function is adopted in this study for the optimal input sets: where ik y and ˆi k y are the k th experimental result and predicted value of the i th specimen, respectively. For the selection of the optimal input features, an ANN model with one hidden layer containing 20 neurons is used, as the optimal ANN architecture is still not obtained in this stage.
Fitter chromosomes from the current population have a higher probability to be selected to generate offspring population using genetic operators, namely crossover and mutation. The population size, number of generations, crossover rate and mutation rate of the GA were selected as 500, 25, 0.8, and 0.3, respectively, using the approach proposed by Oliveira et al. [35]. This evaluation process will be performed iteratively until the termination criterion (10 −3 ) is satisfied. After that, the most fitted features can be determined as the inputs for ANN training.
The optimal input feature subset with 10 nodes, including B , D , ' ρ and n were selected from the 24 features listed in Table 1 using GA. Laboratory test results reported by [36] also demonstrate that the selected optimal features are the most important parameters that affect the seismic performance of these RC columns. However, other parameters, like the effective cover thickness (d') and depth (de), may be also important. The reason for omitting them is that they are either similar in most specimens, or can be easily calculated. For example, the d' of most of the specimens is in the range of 0.02 m to 0.03 m, while the de can be determined based on B and d'. During the selection process, the fitness function is used to evaluate the quality of possible input subsets. There are several measurements can be used to evaluate the accuracy and survival probability of the chromosomes [34]. For convenience, the general fitness function is adopted in this study for the optimal input sets: where y ik andŷ ik are the kth experimental result and predicted value of the ith specimen, respectively. For the selection of the optimal input features, an ANN model with one hidden layer containing 20 neurons is used, as the optimal ANN architecture is still not obtained in this stage.
Fitter chromosomes from the current population have a higher probability to be selected to generate offspring population using genetic operators, namely crossover and mutation. The population size, number of generations, crossover rate and mutation rate of the GA were selected as 500, 25, 0.8, and 0.3, respectively, using the approach proposed by Oliveira et al. [35]. This evaluation process will be performed iteratively until the termination criterion (10 −3 ) is satisfied. After that, the most fitted features can be determined as the inputs for ANN training.
The optimal input feature subset with 10 nodes, including B, D, f c , f yl , λ, f yt , s, ρ l , ρ sv and n were selected from the 24 features listed in Table 1 using GA. Laboratory test results reported by [36] also demonstrate that the selected optimal features are the most important parameters that affect the seismic performance of these RC columns. However, other parameters, like the effective cover thickness (d ) and depth (d e ), may be also important. The reason for omitting them is that they are either similar in most specimens, or can be easily calculated. For example, the d of most of the specimens is in the range of 0.02 m to 0.03 m, while the d e can be determined based on B and d .

Training of the ANN Model
Before training the ANN model, all of the collected structural and material parameters of the 1163 specimens and the corresponding test results on the strength and drift capacities were normalized as the input and output vectors using the above approach. Then, 863 specimens in the database were randomly selected as the training set (n train = 863). While the remaining specimens were used as the test set (n test = 150) and validation set (n validation = 150), respectively.
The number of input and output nodes in the ANN architecture is obviously 10 and 4, respectively. The following parameters of the BPNN were used in training the model: (1) Error tolerance = 10 −3 ; (2) learning parameter = 0.15; (3) maximum number of iterations = 10 4 ; (4) momentum parameter = 0.05; (5) noise = 0.01. The mean absolute error (E) and goodness of fit (R 2 ) were adopted to evaluate the accuracy of the ANN model, as follows: where N is the number of specimens in the training set or test and validation set.
It should be noted that there is no specific rule or heuristics [37] to determine the number of hidden layers and the corresponding number of nodes. In general, an ANN model with too many neurons tends to fail due to an overfitting of the training data, whereas those with few neurons may not be able to capture the complex underlying relationship. Therefore, several trials were conducted for ANN models with different nodes and a single hidden layer to obtain an optimal ANN architecture for the strength and deformation capacity prediction of RC columns. For an optimal ANN architecture, the mean absolute error and goodness of fit of the ANN model should satisfy the following criteria: (1) Well-distributed around 0 (smaller E k ) and (2) as close as possible (larger R 2 k ). The mean value (E = E k , R 2 = R 2 k ) of the four properties was used. Figure 5 shows the test results of the trained ANN models with 10-50 hidden neurons. It can be observed that the mean absolute error and goodness of fit of the three properties are within the range of [0.126, 0.142] and [0.870, 0.882], respectively. As can be seen, the N-10-21-4 ANN architecture has the smallest E and largest R 2 , and thus was determined as the optimal ANN architecture.

Training of the ANN Model
Before training the ANN model, all of the collected structural and material parameters of the 1,163 specimens and the corresponding test results on the strength and drift capacities were normalized as the input and output vectors using the above approach. Then, 863 specimens in the database were randomly selected as the training set ( where N is the number of specimens in the training set or test and validation set. It should be noted that there is no specific rule or heuristics [37] to determine the number of hidden layers and the corresponding number of nodes. In general, an ANN model with too many neurons tends to fail due to an overfitting of the training data, whereas those with few neurons may not be able to capture the complex underlying relationship. Therefore, several trials were conducted for ANN models with different nodes and a single hidden layer to obtain an optimal ANN architecture for the strength and deformation capacity prediction of RC columns. For an optimal ANN architecture, the mean absolute error and goodness of fit of the ANN model should satisfy the following criteria: (1) Well-distributed around 0 (smaller k E ) and (2)   the smallest E and largest 2 R , and thus was determined as the optimal ANN architecture.

Validation
Experimental data in the training set, test set and validation set were used to validate the ANN model. Figure 6

Validation
Experimental data in the training set, test set and validation set were used to validate the ANN model. Figure 6   Different types of explicit formulas have been proposed by researchers or adopted in design codes for calculating the strength of RC columns. In general, concrete and reinforcement contribute to the shear capacity of RC columns with shear failure mode. Therefore, the explicit formula has two distinct parts.
where max V F = is the shear strength for shear-failed columns, while c V and s V are the contributions of concrete and steel reinforcement, respectively. In this study, three types of explicit formulas for predicting the shear strength of RC columns were adopted to validate the ANN model as summarized in Table 4.   Different types of explicit formulas have been proposed by researchers or adopted in design codes for calculating the strength of RC columns. In general, concrete and reinforcement contribute to the shear capacity of RC columns with shear failure mode. Therefore, the explicit formula has two distinct parts.
where V = F max is the shear strength for shear-failed columns, while V c and V s are the contributions of concrete and steel reinforcement, respectively. In this study, three types of explicit formulas for predicting the shear strength of RC columns were adopted to validate the ANN model as summarized in Table 4. Table 4. Explicit formulas for estimating the shear capacity of RC columns.

Ast fytD s
Note: k µ is the ductility-related strength degradation factor, and D is the distance from the extreme compression fiber to the centroid of the tension reinforcement. The meanings of other parameters in this table are the same as those in Table 1.
For flexural-dominated specimens, the lateral loading capacity is strongly dependent on the flexural strength (M c ) and the distance between the critical section of the plastic hinge and point of contra-flexure length (L), as follows: where ∆ c is the capping displacement corresponding to the maximum lateral strength (F max ). Fardis et al. [40] proposed a semi-empirical strength and deformation expression with good average agreement with test results. It has been widely used in the seismic performance assessment in engineering [29] M c /M y = (1.25)(0.89) n (0.91) 0.01 f c (10) where φ y is the yield curvature; δ is the ratio of the distance of the compression reinforcement center from the extreme compression fibers to the span depth (D); k y is the normalized compression zone depth; E s is the reinforcement elastic modulus; ρ, ρ and ρ v are the reinforcement ratios of tension, compression and web reinforcement, respectively. Figure 7 shows a comparison of the experimental results, predicted strength of the ANN model and calculated results from the explicit formula. In Figure 7a, the test results of the shear-failed specimens were obtained using the formula by [5], while the flexural failure specimens were selected from the database of this study. As shown in Figure 7a, the mean absolute error of the predicted shear strength of the ANN model is 0.107, which is smaller than those obtained from the three explicit formulas, 0.235, 0.207 and 0.217. In Figure 7b, it can also be observed that the mean absolute error of the ANN model is 0.087. However, the statistical result of the mean absolute error obtained from the design code is 0.101. It is illustrated that the results indicate that the ANN model yields a more accurate prediction of the structural strength of the columns. Figure 8 also shows a comparison of the test results of the ultimate drift of the column, and predicted results from the ANN model. It can be observed that the ANN model can also achieve a reasonable prediction of the ultimate drift. However, it is evident that the mean absolute error of the ultimate drift is larger than that of the strength prediction, particularly for specimens with large ultimate deformation capacity.
The mean absolute error and goodness of fit are found to be 0.1761 and 0.8114, respectively, for the training set, and 0.1783 and 0.8216 for the test and validation set, respectively. This is because the ultimate deformation of the column is more strongly affected by the nonlinearity of the structures. Another reason for there being less accuracy is that the ultimate deformation is more difficult to measure.  (9) where c Δ is the capping displacement corresponding to the maximum lateral strength ( max F ).
Fardis et al. [40] proposed a semi-empirical strength and deformation expression with good average agreement with test results. It has been widely used in the seismic performance assessment in engineering [29] ( )( ) ( ) where y φ is the yield curvature; ' δ is the ratio of the distance of the compression reinforcement center from the extreme compression fibers to the span depth ( D ); y k is the normalized compression zone depth; s E is the reinforcement elastic modulus; ρ , ' ρ and v ρ are the reinforcement ratios of tension, compression and web reinforcement, respectively. Figure 7 shows a comparison of the experimental results, predicted strength of the ANN model and calculated results from the explicit formula. In Figure 7a, the test results of the shear-failed specimens were obtained using the formula by [5], while the flexural failure specimens were selected from the database of this study. As shown in Figure 7a, the mean absolute error of the predicted shear strength of the ANN model is 0.107, which is smaller than those obtained from the three explicit formulas, 0.235, 0.207 and 0.217. In Figure 7b, it can also be observed that the mean absolute error of the ANN model is 0.087. However, the statistical result of the mean absolute error obtained from the design code is 0.101. It is illustrated that the results indicate that the ANN model yields a more accurate prediction of the structural strength of the columns.   The mean absolute error and goodness of fit are found to be 0.1761 and 0.8114, respectively, for the training set, and 0.1783 and 0.8216 for the test and validation set, respectively. This is because the ultimate deformation of the column is more strongly affected by the nonlinearity of the structures. Another reason for there being less accuracy is that the ultimate deformation is more difficult to measure.  To assess the ultimate deformation of lightly reinforced columns, Elwood et al. [6] proposed an explicit formula given as follows: where the effects of the transverse reinforcement ratio ( t ρ ), axial load ratio ( n ) and strength ( max F ) are taken into account. Furthermore, Lehman et al. [41] also developed an explicit method to evaluate the ultimate drift of flexural-dominated columns as follows: where y Δ , sp Δ and f Δ are the yield, slip and flexural displacement, respectively.
The above explicit formulas for calculating the ultimate drift were also used to validate the accuracy and effectiveness of the ANN model. Figure 9 shows a comparison of results obtained from the formula methods and test results from the collected test database, which are divided into two parts, namely the shear-failed specimens and the flexural-dominated specimens. It can be observed that the ANN model also yields better prediction of the ultimate drift than the two explicit formulas. The mean absolute errors obtained from the ANN model and the Elwood's method are 0.165 and 0.266, respectively, for the shear-failure columns, whereas those obtained from the ANN model and the Lehman's method are 0.188 and 0.316, respectively, for the flexural-failure columns. To assess the ultimate deformation of lightly reinforced columns, Elwood et al. [6] proposed an explicit formula given as follows: where the effects of the transverse reinforcement ratio (ρ t ), axial load ratio (n) and strength (F max ) are taken into account. Furthermore, Lehman et al. [41] also developed an explicit method to evaluate the ultimate drift of flexural-dominated columns as follows: where ∆ y , ∆ sp and ∆ f are the yield, slip and flexural displacement, respectively. The above explicit formulas for calculating the ultimate drift were also used to validate the accuracy and effectiveness of the ANN model. Figure 9 shows a comparison of results obtained from the formula methods and test results from the collected test database, which are divided into two parts, namely the shear-failed specimens and the flexural-dominated specimens. It can be observed that the ANN model also yields better prediction of the ultimate drift than the two explicit formulas. The mean absolute errors obtained from the ANN model and the Elwood's method are 0.165 and 0.266, respectively, for the shear-failure columns, whereas those obtained from the ANN model and the Lehman's method are 0.188 and 0.316, respectively, for the flexural-failure columns.  Since there are currently no explicit formula for the estimation of capping drift corresponding to the maximum strength, Figure 10 only shows the comparison of the experimental results of the capping drift of the column and the predicted results from the ANN model, with the mean absolute error of 0.0794 and 0.1071, goodness of fit of 0.8001 and 0.8345, for the training set and the test and validation set, respectively. Nevertheless, it can still be seen that the ANN-based model for the capping drift estimation is reasonable and relatively accurate. Since there are currently no explicit formula for the estimation of capping drift corresponding to the maximum strength, Figure 10 only shows the comparison of the experimental results of the capping drift of the column and the predicted results from the ANN model, with the mean absolute error of 0.0794 and 0.1071, goodness of fit of 0.8001 and 0.8345, for the training set and the test and validation set, respectively. Nevertheless, it can still be seen that the ANN-based model for the capping drift estimation is reasonable and relatively accurate. Similarly, Figure 11 shows a comparison of the yielding drift of column between experimental results and predicted results from the ANN model. The mean absolute error for the training set and the test-validation set is 0.1081 and 0.0752, and the corresponding goodness of fit is 0.8548 and 0.8638, respectively. Generally, the yielding drift of RC columns always appears earlier with low nonlinearity than the ultimate drift. Therefore, the proposed ANN-based model can get a better prediction on the yielding drifts than the ultimate drift due to the nonlinearity difference between these two physical parameters. Similarly, Figure 11 shows a comparison of the yielding drift of column between experimental results and predicted results from the ANN model. The mean absolute error for the training set and the test-validation set is 0.1081 and 0.0752, and the corresponding goodness of fit is 0.8548 and 0.8638, respectively. Generally, the yielding drift of RC columns always appears earlier with low nonlinearity than the ultimate drift. Therefore, the proposed ANN-based model can get a better prediction on the yielding drifts than the ultimate drift due to the nonlinearity difference between these two physical parameters.  For the estimation of the yielding drift, Fardis et al. [40] proposed a statistical formula based on the results of 963 tests, which has been widely used in engineering [29].  For the estimation of the yielding drift, Fardis et al. [40] proposed a statistical formula based on the results of 963 tests, which has been widely used in engineering [29].
where φ y is the yielding curvature of the section, α sl is the slip coefficient (equals 1 if slippage of longitudinal steel is possible, or 0 if it is not), and ε y is the yield strain of longitudinal steel. Figure 12 presents the comparison results of the yielding drift estimation of RC columns from the test set between the proposed ANN model and Fardis' method. It shows that the ANN model performs better than the empirical formula, with the mean absolute error of 0.2327 and 0.4572, respectively. Therefore, the developed ANN-based model can provide a more reliable and accurate prediction of the critical parameters for the LP model. For the estimation of the yielding drift, Fardis et al. [40] proposed a statistical formula based on the results of 963 tests, which has been widely used in engineering [29].
where y φ is the yielding curvature of the section, sl α is the slip coefficient (equals 1 if slippage of longitudinal steel is possible, or 0 if it is not), and y ε is the yield strain of longitudinal steel. Figure 12 presents the comparison results of the yielding drift estimation of RC columns from the test set between the proposed ANN model and Fardis' method. It shows that the ANN model performs better than the empirical formula, with the mean absolute error of 0.2327 and 0.4572, respectively. Therefore, the developed ANN-based model can provide a more reliable and accurate prediction of the critical parameters for the LP model.

Comparison with the Pseudo-Static Test
The boundary conditions of the RC column during pseudo-static tests are generally close to its real situation. Therefore, it is reasonable to validate the ANN-based bi-linear and tri-linear LP model through a quasi-stable test on a frame. This test was conducted by [42], and the simulation is carried out using the Open System for Earthquake Engineering Simulation (OpenSEES, [28]) platform, as shown in Figure 13a. Each column was simulated by one elastic element and two zero-length elements at the two ends, as shown in Table 5. As illustrated in Figure 13b, the ANN-based LP model can provide an acceptable prediction result, especially for the envelope curve. It also shows that the ANN-based tri-linear model performs better than the bi-linear one during the whole hysteresis curve prediction process, especially after the capping point. Table 5. ANN-based method for the LP model of the RC column sample [42].   Table Test For further validation, a time history response comparison was performed between the proposed ANN-based method and a shaking table test of a single-story RC frame [43]. As shown in Figure 14, the frame consists of two identical columns and a rigid link beam. Each column is modeled by one elastic element and two flexural springs at the two ends. Based on their properties, the ANN model was used to predict their strength and deformation capacities, as presented in Table 6. Therefore, the parameters for bilinear model ( , ,   Table Test For further validation, a time history response comparison was performed between the proposed ANN-based method and a shaking table test of a single-story RC frame [43]. As shown in Figure 14, the frame consists of two identical columns and a rigid link beam. Each column is modeled by one elastic element and two flexural springs at the two ends. Based on their properties, the ANN model was used to predict their strength and deformation capacities, as presented in Table 6. Therefore, the parameters for bilinear model (M y , K e , b) and tri-linear model parameters (K e , M y , β l , β c , M c /M y ) of their LP models were calculated, based on which the FE model was developed on the OpenSEES platform.

Comparison with Shake
During the test, the structure was subjected to a scaled ground motion with PGA = 1.28 g (recorded by TCU076NS in the 1999 Chi-Chi earthquake), as shown in Figure 15a. After performing the dynamic time history analysis, the drift response of the specimen was obtained and compared with the test result, as is shown in Figure 15b. It is observed that the ANN-based LP models can provide an acceptable estimation of the drift response before the collapse of the frame structure, especially for the tri-linear one. The corresponding maximum error of the drift response before the time of 22.6 s between the ANN-based tri-linear model and the test is very small. However, due to the limitation of the LP model itself, the collapse phenomenon of the specimen could not have been directly represented during the simulation. In spite of that, the proposed ANN-based model-free and data-driven method is still an effective and reliable candidate for the researchers and engineers to determine the key parameters of RC columns during finite element analysis.     [43]. During the test, the structure was subjected to a scaled ground motion with PGA = 1.28 g (recorded by TCU076NS in the 1999 Chi-Chi earthquake), as shown in Figure 15a. After performing the dynamic time history analysis, the drift response of the specimen was obtained and compared with the test result, as is shown in Figure 15b. It is observed that the ANN-based LP models can provide an acceptable estimation of the drift response before the collapse of the frame structure, especially for the tri-linear one. The corresponding maximum error of the drift response before the time of 22.6 s between the ANN-based tri-linear model and the test is very small. However, due to the limitation of the LP model itself, the collapse phenomenon of the specimen could not have been directly represented during the simulation. In spite of that, the proposed ANN-based model-free and data-driven method is still an effective and reliable candidate for the researchers and engineers to determine the key parameters of RC columns during finite element analysis. The developed ANN-based model for the LP parameters of RC columns has been already implemented in an efficient Matlab GUI, anyone has access to it in the supplementary materials.

Conclusions
This study explored the possibility of using an ANN-based method to rapidly determine the seismic performance of RC columns, as well as the development of the LP model for them. An ANN The developed ANN-based model for the LP parameters of RC columns has been already implemented in an efficient Matlab GUI, anyone has access to it in the supplementary materials.

Conclusions
This study explored the possibility of using an ANN-based method to rapidly determine the seismic performance of RC columns, as well as the development of the LP model for them. An ANN model was established and validated based on the large database of 1163 cyclic tests of RC columns to predict the strength and the yielding, capping and ultimate deformation capacities, which are critical for the commonly-used LP model. The following conclusions are drawn:

1.
On the basis of a large historical experimental database and advanced humanlike information processing algorithm, the proposed model-free method in this study can rapidly get more accurate input parameters for any bilinear and tri-linear LP model than current explicit formulas. The accuracy of the proposed method can also have been improved with the increment of the sample quantity of the database.

2.
The validation results through both the collected experimental data and several existing functions indicate that the ANN-based method can be effectively used to predict the most important characteristics of RC columns, which are also critical for the further modeling of structures. In addition, another advantage of the proposed model-free method is that the quantity of the input features could be easily changed according to the requirement of an arbitrary multi-linear model.

3.
The ANN-based LP model can help reduce the subjective and experimental errors. The prediction results of the RC frame structures using the well-trained ANN-based LP model show a good agreement with both the quasi-static and shaking table test results, especially for the pre-collapse stage. Thus, the model-free method based on the machine learning theory will be an innovative and promising approach for a fast seismic performance evaluation of the buildings and bridges in the future.
Thus, the model-free method based on the machine learning theory (e.g., ANN method) can be used as a promising surrogate for the rapid seismic performance evaluation of the buildings and bridges in the future.