Acoustic Emission Monitoring of Progressive Damage of Reinforced Concrete T-Beams under Four-Point Bending

Acoustic Emission (AE) is revealed to be highly adapted to monitor materials and structures in materials research and for site monitoring. AE-features can be either analyzed by means of physical considerations (geophysics/seismology) or through their time/frequency waveform characteristics. However, the multitude of definitions related to the different parameters as well as the processing methods makes it necessary to develop a comparative analysis in the case of a heterogeneous material such as civil engineering concrete. This paper aimed to study the micro-cracking behavior of steel fiber-reinforced reinforced concrete T-beams subjected to mechanical tests. For this purpose, four-points bending tests, carried out at different displacement velocities, were performed in the presence of an acoustic emission sensors network. Besides, a comparison between the sensitivity to damage of three definitions corresponding to the b-value parameter was performed and completed by the evolution of the RA-value and average frequency (AF) as a function of loading time. This work also discussed the use of the support-vector machine (SVM) approach to define different damage zones in the load-displacement curve. This work shows the limits of this approach and proposes the use of an unsupervised learning approach to cluster AE data according to physical and time/frequency parameters. The paper ends with a conclusion on the advantages and limitations of the different methods and parameters used in connection with the micro/macro tensile and shear mechanisms involved in concrete cracking for the purpose of in situ monitoring of concrete structures.


Introduction
Reinforced concrete is predominantly used in civil engineering structures for its desirable mechanical properties. However, the decline in strength of such structures is very common and is generally attributed to increased service loading, ageing, fatigue, corrosion, environmental impacts, etc. [1]. The presence of damage and progressive deterioration of an existing concrete structure may result in poor performance, and even incur failure under service loading. Therefore, it is of paramount importance to understand the involved damage mechanisms and their symptoms for proper maintenance of these structures ensuring safety, enhanced durability, and economic operations.
Acoustic emission is revealed to be highly adapted to monitor materials and structures in different fields (e.g., civil engineering) in a nondestructive way [2][3][4]. The AE technique is based on the detection of the rapid release of energy in the form of transient elastic stress waves from a localized source within a material [5,6]. These transient AE signals are usually burst type and can be related to different types of sources such as initiation/propagation of crack fronts, yielding, failure of bonds, fiber failure, delamination, etc. [7,8]. Despite the lack of a standard procedure that can be used for all types of structures [9], AE remains widely accepted and used in structural health monitoring of civil engineering structures [1] thanks to its non-invasive character. The main advantage of the AE-based technique is its ability to perform real-time continuous monitoring of the entire volume of a structure [10]. Consequently, it can provide early information about incipient damage and thereby prevent catastrophic failure of a structure. Although acoustic emission-based studies were introduced long ago [9], AE-based assessment of materials and/or structures has gained significant traction in recent times. Various AE features can be determined from an AE-based experiment, where it becomes possible to link the AE signals with the involved types of damage [6]. AE event rate analysis was used to monitor the deterioration processes during creep and fatigue tests in association with failure prediction models [11,12]. AE event rate was also used to study salt crystallization in limestones [13]. Some research revealed that AE signals can also be used to understand the nonlinear relaxation (slow dynamics) of concrete taken at the initial and micro-cracked states [14,15].
AE features have also been analyzed by means of physical considerations in order to evaluate the main damage mechanisms responsible for the detected AE events. Features such as rise time, signal amplitude, and average frequency were studied in different research in order to access the involved fracture created in different brick and mortar types [16][17][18]. The severity of damage can be evaluated through the seismology parameter called the b-value. Being sensitive to the coalescence of micro-cracks into macro-cracks, it has been used in different studies in order to detect and monitor critical damage development [19][20][21][22]. Based on the shift of the b-value and AE parameters, a statistical and fractal analysis was performed by Carpinteri et al. in which they showed that the energy release during the microcracking happens in a fractal domain and where the b-value was found to be related to the fractal dimension [23][24][25].
A waveform-based analysis of AE signals requires broadband AE sensors and high sampling rates. In the case of heterogeneous materials, the involved elastic waves can be attenuated, which makes the study of high-frequency components difficult [26]. A simplified signal analysis method was then proposed in which one takes into account the rise time, amplitude, and average frequency of the AE signals [18,27,28]. However, the use of the proposed method depends on the distance between the source and the sensor, which should be compensated in some cases as in ref. [29].
Several works have used AE-based techniques to classify crack modes in reinforced concrete beams subjected to bending [17,[30][31][32][33]. Ohno and Ohtsu [27] successfully used simplified Green's functions for moment tensor analysis (SiGMA) to quantitatively classify cracks in terms of tensile (mode I) and shear (mode II) in concrete structures. Probabilistic methods, such as Gaussian mixture modeling (GMM), have also been used to classify cracks using AE data [1,34]. Very few studies were found on support vector machine (SVM)-based classification of AE data. Das et al. [35] classified cracking modes in steel fiber-reinforced concrete beam under bending and strain hardening cementitious composite samples under tension loading using Gaussian Mixture Models and a SVM-combined framework. The crack mode classification showed that more events were caused by matrix cracking during strain hardening, while during the softening phase, a larger number of events were found to be associated with fiber pull out. R. V. Sagar [36] classified the crack modes in reinforced concrete beams using GMM and the results were validated with the help of SVM.
The AE events were localized with the help of localization algorithms [30] by considering the arrival time of longitudinal AE waves across sensors, and it was observed that localization can be improved with the help of a well-arranged network of sensors. Mirgal et al. [37], with the help of their localization algorithm, found that sensors placed in a zigzag fashion can provide better localization of pencil lead break (PLB) sources than using a rectangular or circular arrangement on the surface of concrete slab. Soulioti et al. [31] studied the effect of fiber content on fracture modes in concrete beams using AE data. They observed that for unreinforced concrete, the dominating fracture mode is tensile in nature, whereas in the case of reinforced concrete the dominating mode of fracture changes to shear as fiber content increases. Anay et al. [38] monitored cement paste-based specimens by AE under compressive loading and classified progressive failure mechanisms. They identified three distinct crack behaviors, namely microcrack initiation, crack extension, and unstable crack growth.
The huge quantity of AE data makes the reduction of their dimensionality necessary. For that purpose, principal component analysis (PCA) was often used in different applications including concrete and composites [38,39]. PCA is often followed by a clustering using an unsupervised technique. In that sense, Sena et al. [40] studied the micro cracking of steel fiber-reinforced concrete beams under three-point bending. The multivariable clustering revealed the existence of two types of failures, namely matrix failure and steel fiber/matrix debonding. Calabrese et al. [41] applied PCA to reduce the dimension of AE data obtained from a four-point bending test on concrete beams. The Kohonen's self-organizing map (SOM) algorithm, which is an artificial neural net based on an unsupervised learning method, was implemented with the aim to discern clusters and relate them to the AE patterns. Sun et al. [42] investigated the failure process involved in crumb rubber concrete under four-point bending using AE techniques. Various clustering methods, namely k-means, fuzzy c-means (FCM), self-organizing mapping (SOM), Gaussian mixture model (GMM), hierarchical model, and density peak clustering were used in order to find the best algorithm for that particular scenario, and the density peak algorithm was found to produce best results. The main advantage of the AE-based technique is its ability to monitor the entire volume of a structure in real time [10]. Hence, AE monitoring can provide early information about incipient damage and thereby prevent catastrophic failure of a structure.
The present research work focused on the characterization of civil engineering concrete T-beams reinforced with steel bars subjected to four-point bending quasi-static tests. Steel bars prevent fragility and associated catastrophic failure and improve the post-peak behavior. They consequently offer a considerable enhancement of the load-carrying capacity and delayed failure was obtained [31]. Mechanical tests were monitored with AE sensors in order to extract the different AE signals emitted during the creation and propagation of micro-cracks. In order to highlight the dependence of the AE activity on the loading rates, bending tests were performed at different rates and AE signals were analyzed according to different time and frequency domain parameters. According to the literature, three different definitions of b-value can be used. This work compared the three definitions of the of b-value parameter, used in geophysics to study the fracture process in rocks, and drew a conclusion about their relevance in the case of the damaged concrete samples. AE data analysis was then performed with the help of specific waveform parameters, namely RA-value (rise time to maximum amplitude) and AF (average frequency). The use of a supervised learning approach based on the support vector machine (SVM) allowed us to separate the load-displacement curve into different areas according to the involved damage mechanisms. Moreover, unsupervised machine learning for clustering AE data allowed us to identify three clusters which were revealed to be frequency independent. This work ends by presenting the time/frequency characteristics of the AE signals according to the involved damage mechanisms.

Theoretical Background
Various AE features can be collected during an experiment with the help of a sophisticated AE monitoring system. Upon analyzing the AE features by using suitable algorithms, a good insight about the damage stage can be ascertained. Some useful algorithms that are usually used in AE based analysis are presented in the following.

Average Frequency (AF) and RA Value (RA)
Average frequency and RA value are two important quantities that can be used to determine crack modes in concrete structures [43]. Figure 1a presents the main parameters used to obtain the AF and RA of an AE waveform. Average Frequency (AF) is defined as the ratio between the number of counts and duration of an AE waveform. It basically determines the number of threshold crossings per unit time of an AE waveform.
where "Counts" indicates the number of times the signal amplitude exceeds the fixed threshold over the entire duration of the AE waveform. RA value of an AE waveform is defined as the ratio between the rise time and amplitude: AE waveforms generated due to tensile cracks ( Figure 1b) have shorter rise time [17,44]. Hence, tensile-type cracks usually generate AE signals with lower RA values and higher AF. On the other hand, in the case of shear-type cracks (Figure 1c), the AE waveforms are longer, RA values are relatively higher, and AF is lower [44].

The Three Different b-Values
The b-value analysis methods are generally used in geophysics to study the fracture process in rocks [45]. It uses amplitude of AE events. According to the literature, there are three different types of b-value, namely b 1 -value, b 2 -value, and b 3 -value.

b 1 -Value
The pioneer work using the b 1 -value to assess the progressive failure was carried out by Shiotani et al. [46]. The authors suggested that the b 1 -value has the potential to be a precursor of slope failure. The expression of b 1 -value is given as: where N is the number of recent events, µ is the mean value of the amplitudes of those recent events, σ is the standard deviation of the amplitudes of the same group of events, and α 1 and α 2 are empirical constants that specify which part of the population is considered. Usually, α 1 and α 2 are considered to be 0 and 1, respectively. Although b 1 -value was first used to study the fracture process in rocks, it was later used by many researchers to study fracture process in concrete [47,48]. Researchers in refs. [47,49] found that micro-cracks lead to relatively higher values of b 1 -value and macrocracks lead to lower b 1 -value. Hence, a decrease in the b 1 -value may indicate successive accumulation of stress associated with a propagating rupture front [47].

b 2 -Value
In the field of seismology, C. F. Richter and B. Gutenberg proposed an empirical relationship between earthquake frequency and magnitude, which is commonly known as the GR law, and the exponent of this law is known as the b-value [50]. The expression for the seismic b-value is given as: where N is the number of earthquakes whose magnitude is ≥ M. a and b are constants in an area over a specified span of time. [51,52]. Due to the resemblance between acoustic emission in materials and earthquakes, the b-value analysis has been incorporated in the progressive damage assessment of concrete structures. However, since the peak amplitude of an acoustic emission signal is measured in decibels (dB), and on the other hand, the magnitude of an earthquake is measured in the Richter Scale that considers the logarithm of seismic wave amplitude [45], it is necessary to use an adapted formula for the calculation of the "new" b-value, noted as the b 2 -value: N is the number of acoustic emission hits with an amplitude ≥ A dB , and a 2 is a constant that depends on the background noise (environment). The b 2 -value is basically the slope of the log-linear graph of the frequency-magnitude distribution of AE hits.

b 3 -Value
The b 3 -value was proposed by K. Aki [53]. He considered the discrete frequency distribution of earthquake magnitudes and, by means of statistical analysis, proposed the maximum likelihood of the b-value of the Gutenberg-Richter form of distribution. The expression of the b 3 -value is given as follows [45]: where, a avg is the average amplitude and a c is the threshold magnitude [53,54]. In the context of AE-based damage assessment using the b 3 -value, earlier research works were carried out to find the fracture process in rocks [54,55]. Later, some researchers implemented this method in the progressive damage assessment of concrete structures [50].

Machine Learning Based Approaches
Two types of the machine learning techniques, i.e., supervised learning and unsupervised learning, were used in the present research work for the interpretation of AE data. In the case of the supervised learning technique, a model is trained with labeled data to predict future outputs. On the contrary, an unsupervised learning technique does not need labeled data, but it can find the hidden patterns in the input data. There are many supervised as well as unsupervised learning algorithms available; however, the discussion is limited to only those used in the present study, which are presented below.

Supervised Learning Using Support-Vector Machine
Support-vector machine (SVM) is a widely used supervised learning method for classification [56]. The SVM algorithm searches for an optimal separation boundary (usually called the hyperplane or the decision boundary), which maximizes the margin as shown in Figure 2. Such maximization of the margin between the two classes on the training data results in an efficient classification on test data [57]. The points on the margins are called support vectors [57]. In SVM, the training data are first mapped in space and then the test data are mapped in the same space to determine in which class they fall. The Lagrangian dual problem [58] for linearly and nonlinearly separable classes can be expressed as follows.
, is the dot product (i.e., inner product) of the transformed features using kernel function φ. Some well-known kernel functions are the Gaussian radial basis function (RBF), polynomial, sigmoid (i.e., neural network), and linear. The RBF and linear kernel functions were used in the present study and are given as: Linear : Nonlinear SVM works in the transformed predictor space to find an optimal hyperplane. The advantage of the RBF kernel is that it has an efficient learning ability in nonlinear cases [59].

Unsupervised Learning Approach
The unsupervised learning approach adopted in the present study consists of three steps: feature selection, feature optimization (i.e., dimensionality reduction), and clustering. Although clustering of data is the main goal to acquire an insight into the hidden patterns of the data, the other two steps are very important to improve the abovementioned clustering. In the present study, important features were identified using Laplacian score and feature optimization has been carried out using principal component analysis (PCA). Finally, the clustering of AE data was performed with the help of a k-means algorithm considering optimized data ( Figure 3). Laplacian score (LS) is an advanced variance analysis tool that prefers features with larger variance having more locality preserving ability [60]. An important feature is determined with the help of its Laplacian score. A good AE feature that can cluster data has an LS greater than 0.9 [61]. Thus, with the help of LS analysis only the important features can be selected for the subsequent principal component analysis. Principal component analysis (PCA) [39] is a well-known feature optimization technique [41]. It is based on the eigen decomposition of the feature covariance matrix. The principal components are orthogonal to each other, and each principal component is basically a linear combination of the original variables. Principal components have a decreasing trend in variance, i.e., the first principal component has the maximum variance. Projection of each observation on the first principal component axis would therefore result in maximum variance, while projection of the same observations on second principal component axis would yield second largest variance, and so on. Usually, only the first few PC are used for subsequent analysis since the cumulative variance of these PC is quite large. Finally, the k-means algorithm partitions data into k-number of mutually exclusive clusters [62]. k-means minimizes the sum of the distances between the centroid and all member objects of each cluster. The point to be noted here is that the value of k of the algorithm is not known initially but chosen depending on some cluster validity indices. The widely used cluster validity indices found in the literature are the Davies-Bouldin (DB) index and the Silhouette coefficient (SC), which are defined below.
Davies-Bouldin (DB) index: It is defined as the ratio of the sum of within-cluster scatter to between-cluster separation [63].
where, k is the number of clusters, and d i and d j are average within-cluster distances of clusters i and j, respectively. D ij is the distance between the centers of the i th and j th clusters. It can be easily understood from this formula (Equation (11)) that a lower value of DB indicates a better separation of the clusters with a good compactness inside the clusters. [64] Silhouette coefficient (SC): This is an interpretation and validation method of consistency of data within the clusters. The Silhouette value can be calculated as follows [65]: where A(i) is the average dissimilarity of i with all other data within the same cluster. B(i) is the lowest average dissimilarity of i to any other cluster of which i is not a member. A high Silhouette value indicates that the object is well matched to its own cluster and poorly matched to the neighboring clusters. SC ranges from 0 to 1, and a value of SC greater than 0.6 generally assures that the clustering is of a sufficient quality [66]. DB and SC are therefore used for obtaining the optimal cluster number, which is simultaneously indicated by a lower Davies-Bouldin index and a higher Silhouette coefficient [67,68].

AE Monitoring of Mechanical Tests
Four-point bending tests were conducted on 3.5 m-long samples of reinforced concrete T-beams with the help of a Universal Testing Machine (INSTRON 8801). The T-beams were manufactured by RECTOR ® following the French standard NF EN 15804+A1 and its national complement NF EN 15804/CN. The mechanical properties of the beams are given in Table 1 and the dimensions of the same in Figure 4. The distance between the bottom supports is 3 m and the distance between the loading points is 1 m ( Figure 5). The load is applied with the help of a servo-controlled hydraulic actuator of the Universal Testing Machine, in a displacement-controlled manner. Different displacement rates (1 mm/min, 2 mm/min, and 4 mm/min) were applied on identical samples, namely sample 1, sample 2, and sample 3, respectively. The AE set-up ( Figures 5 and 6) consists of sensors, preamplifiers, and a data acquisition system (i.e., PCI-2 of Physical Acoustics). Four sensors capture acoustic emission hits created during the occurrence of damage events in the tested sample. The sensors used were broad band-type PAC (MICRO-80) and the selected frequency bandwidth was 20 kHz-1 MHz for all the tests. A threshold of 45 dB and a pre-amplifier gain of 40 dB were assigned as inputs. In order to check the sensitivity and coupling of the sensors, pencil lead break (PLB) was performed before the actual AE monitoring of each sample.

Global Analysis of Damage
In the present work, the four-point bending test was coupled with AE monitoring, where the main characteristics of the performed mechanical tests are presented in Table 2. In Figure 7a, it can be observed that results corresponding to the three different tests were very close to each other and within the limit of proportionality. It can also be observed that the load-displacement curves of sample 1 and sample 2 had a good agreement even in the major part of the nonlinear zone. Note that the behavior of sample 3 was different from those of samples 1 and 2 due to its relatively high loading rate. It should be noted that, in the load-displacement curve, a sudden drop in the value of load in the nonlinear zone indicates the formation of a major crack in the sample during the test. We also note that the change in the speed ratios between the three different tests has a clear effect on their durations. This can be verified in Figure 7b with the help of the sudden drop in the value of load in the nonlinear zone, where the final cracking happened approximately at 650 s, 1300 s, and 2300 s for tests 1, 2, and 3, respectively. In the literature, it was found that the main crack proceeds with increasing velocity for increasing loading rates [69]. Indeed, the crack velocity increased by orders of magnitude at high loading rates while it propagated in an almost constant way at lower loading rates [70,71].  In order to explain the cracks in terms of their appearance and orientations, and corresponding typical AE-signals, during the successive loading stages, a schematic presentation is given in Figure 8. In the initial stages of loading, as shown in Figure 8a, a few tiny vertical cracks were observed, mostly in the span between the loading points (upper). These are tensile cracks. With the increase in the loading, these tiny cracks increased in size and crack tips propagated upwards. A few more cracks were also seen to develop during this period. When the applied loading reached considerably high values during final stages of loading, shear cracks, mainly oriented horizontal or inclined, as shown in Figure 8b, started to develop and quickly became significantly large, resulting in a significant reduction in strength, indicating impending failure of the sample. The typical AE signals during the initial and final stages of loading are also shown in Figure 8. The AE signals during initial stages of loading were characterized by high frequency and low rise time; on the contrary, the AE signals during the final stages of loading had low frequency and higher rise time. These explain AE signals of the initial and final stages of loading to possess significantly different RA values. In our tests, various features (i.e., amplitude, rise time, counts, duration, energy, number of hits, etc.) [72] of the acoustic emission signals can be used for analysis with the aim to monitor the progressive damage of the samples under the four-point bending test. It was observed that the AE activity evolves as function of the applied load, where amplitudes of the involved AE hits become higher in the case of a major crack (Figure 9a). On the other hand, the cumulative number of hits can be considered as a global parameter, which can be sensitive to the mechanical test conditions (i.e., load rate). Figure 9b shows that the evolution of the cumulative hits per unit time is constant for each mechanical test. This is in agreement with the results of the literature, which predict a constant velocity for the cracks for moderate loading rates [70,71]. On the other hand, the evolution of the cumulative hits per unit time was found to be proportional to the loading rate, where the highest value was found for the loading rate corresponding to 4 mm/s, which created the shortest nonlinear zone.

Physical Parameters-Based Analysis of Damage
The physical parameters discussed in the theory section can also be used for the interpretation of AE data. Since macro-cracking is accompanied with an important release of stored strain energy within tested materials, relatively small b-values can be expected. The prediction of macro-cracking can therefore be performed on the basis of the dominating micro-cracks events having small energies. Large energies are emitted when macro-crack events are created (through the coalescence of micro cracks) which makes the b-value drop. Results obtained using the three different b-value definitions show that b 1 -value, b 2 -value, and b 3 -value sharply decreased when macro-cracking events occurred ( Figure 10). The three definitions were effectively able to identify both small-scale and large-scale events in accordance with previous studies performed on geomaterials [45,73,74]. Finally, we note that a quantitative distribution based on the abovementioned b-values can also be performed in order to predict large-scale events and/or ultimate failure, as was carried out to predict the fracture damage of sandstone, where it was found that the ultimate failure happened when the three types of b-values were the minimum [45]. This observation is in agreement with our results. The monitoring of the fracture mode can be performed by evaluating the average frequency (AF) and RA value. Indeed, when the fracture mode is dominated by tension-type cracking, the value of AF is higher, and the RA value is lower. However, in the case of a shear-type cracking, there is a sudden decline in AF and a sharp increase in RA value. As shown in Figure 11, the pattern was found to be consistent for the different mechanical tests. According to literature, early stages of damage are mainly due to tensile cracking modes. As damage evolves, more shear cracks are created. This observation, which has been confirmed by many studies [32,75], justify the results presented in Figure 11, in which high sensitivity of both parameters to the type of cracking can be observed.

Machine Learning Based Approaches
In order to classify the AE data, machine learning methods (i.e., supervised and unsupervised) were employed. In the case of supervised learning, support-vector machine (SVM) was used to classify the AE data. On the other hand, in the case of unsupervised learning, principal component analysis (PCA) and k-means were implemented for dimensionality reduction and clustering of the AE data, respectively. However, a point to be noted here is that a large number of features were used in the unsupervised machine learning approach, whereas the supervised machine learning was only based on two features, namely average frequency and RA value.

Supervised Learning Using SVM
Support-vector machine (SVM) is vastly used in supervised learning. However, to implement this method, a labeled data set is required for training the model. Therefore, at first, the AE data set consisting of average frequency (AF) and RA value, which was used in this supervised learning, were labeled. This step is mandatory for the algorithm to work. It is also very important since the performance of the algorithm depends on this. Since the load-displacement characteristic of the tested beam samples has three separate zones (i.e., linear, nonlinear, and shear), the labeling was carried out based on that. In other words, AE data which correspond to linear, nonlinear, and shear zones were labeled as Zone 1, Zone 2, and Zone 3, respectively ( Figure 12).   The results obtained using SVM showed that Zone 1 could not be well separated from Zone 2 ( Figure 13). This is reasonable because Zone 1 data corresponds to tensile micro cracks, while Zone 2 data corresponds to micro as well as moderate cracks. Basically, the characteristics of Zone 1 data are similar to the characteristics of a fraction of the Zone 2 data. Hence, in the subsequent analysis, using SVM only two labels were used. In other words, the linear and nonlinear part have the same label, namely Zone 1 + Zone 2, and the shear part was assigned the same label as before, i.e., Zone 3. On the other hand, it was also observed that the performance of SVM is better for the Gaussian kernel compared with linear kernel. Therefore, the SVM analyses were performed using the Gaussian kernel only. Very few studies are available on the application of SVM for the classification of AE data obtained from concrete structures. SVM was adopted in refs. [35,36] to classify cracks in reinforced concrete using RA-AF data. AE data were classified into two classes, i.e., tensile and shear, in the RA-AF plane using a linear kernel. A point to be mentioned is that the choice of the kernel type depends on the pattern in the data. The Gaussian kernel trick of SVM was found to be very efficient in the classification of AE data (i.e., average frequency and RA value) with the help of its nonlinear hyper plane ( Figure 14). Although the supervised machine learning approach using SVM worked well for the classification of AE data, it is difficult to classify data if multiple mechanisms are present in a particular zone. For example, in the nonlinear zone (i.e., Zone 2), where multiple damage mechanisms are present, it remains difficult to label data of Zone 2 with different labels that correspond to different damage mechanisms. Hence, there is a need for an unsupervised learning approach because it does not require pre-assigned labeled data for the classification.

Unsupervised Machine Learning for Clustering of AE Data
A large number of features can be considered for the unsupervised approach as shown in Table 3. The determination of the Laplacian score (LS) corresponding to each feature helps to identify the most relevant ones. The latter are characterized by a large variance and have more representative power. In this study, the features which have LS> 0.9 were chosen for subsequent principal component analysis (PCA) to reduce the dimensionality of data. Finally, only the first few principal components, which have cumulative variance >90%, were used for the subsequent k-means method to cluster AE data. A point to be noted here is that an optimal number of clusters for the k-means was determined with the help of the Davies-Bouldin (DB) index and Silhouette Coefficient (SC), as discussed in the theory section. Typical results of the pre-requisite steps before clustering by k-means are given in Figure 15 using the AE data of sample 1. Although the Laplacian score of each feature was used for the selection of features that have larger variance with more locality preserving ability, the standard deviation of each feature is also given in Figure 15a to show the correspondence between Laplacian score (LS) and standard deviation (SD) of each feature. A similar trend can be observed through both LS and SD data. Based on the criterion LS > 0.9, as given in Figure 15b, it can be observed that the three first principal components represent more than 90% of the total variance, as shown in Figure 15c. Hence, only these first three principal components were considered for the subsequent k-means algorithm. The optimal number of clusters for k-means analysis was determined based on the Davies-Bouldin (DB) index and Silhouette coefficient (SC) (Figure 15d), which should be simultaneously the minimum and maximum, respectively. Finally, note that similar trends were also found for the AE data of sample 2 and sample 3, but are not included here. According to the explanations given above, the results obtained using k-means for three clusters are presented in Figure 16a. The three clusters do not have the same char-acteristics in terms of time and frequency parameters. Unlike cluster 3, cluster 1 has the highest frequency components, as shown in Figure 16c,e. On the other hand, cluster 2 has frequency components of intermediate values, as shown in Figure 16d. Figure 16b shows that the cumulative hits for cluster 1 increased consistently until the major shear crack happened before keeping a constant value. The cumulative hits for cluster 2 were found to be gradually increasing from start till the occurrence of the major crack, where a moderate jump could be observed during the major shear crack. The cumulative hits for cluster 3 increased slowly from the start and a significant jump could be observed during the occurrence of the major shear crack. Finally, note that these observations were found to be consistent for all three samples. The three clusters mentioned earlier can be associated with microcracking at the matrixaggregate interface/reinforcement and/or micro-cracking in the matrix. An amplitudebased correspondence between the amplitudes of the signals and the mechanisms can be performed as in ref. [76]. However, differences between ranges of amplitude of classes are to be expected depending on the specimens' geometries and recording system parameters [77][78][79][80]. On the other hand, a tensile crack is created at the initial loading stage and evolves throughout the loading, whereas shear movements are restrained by the high-strength steel reinforcements inside the beam [33]. However, we should note that due to shear displacement, more friction at shear cracking is to be expected, where the resulting shear waves filter out high-frequency components more easily than the longitudinal waves created by the tensile cracking [81].
In light of these observations, the results obtained suggested that cluster 3, which has the weakest frequency components (around~40 kHz) is related to friction while cluster 1 is basically related to tensile cracking [82]. Cluster 2 with its intermediate frequency components (around~170 kHz) appears as a representation combining both shear and tensile cracking. This has been reported in the literature as a tensile cracking mixed with friction between crack faces [83]. Therefore, the performed analysis completes the one based on the evolution of RA value and average frequency. Indeed, the RA parameter may be insufficient to distinguish between tensile cracking and friction due to the attenuation of elastic waves emitted by the creation and propagation of cracks within heterogeneous materials such as concrete [84]. Due to attenuation, an increase in RA value with decreasing amplitude (i.e., decrease in the average frequency) will then lead to erroneous signal labeling, which makes the effectiveness of such a classification questionable beyond a certain propagation distance [82].

Conclusions
Acoustic emission (AE)-based techniques were adopted to monitor progressive damage mechanisms in reinforced concrete T-beams subjected to four-point bending test. The test was conducted on three identical samples but with different displacement rates to observe the effect on global damage behavior. Various analysis schemes were adopted. Based on the analysis of the results, the following conclusions are drawn:

•
The load-displacement curves obtained for the three samples were similar, hence repeatable. In the load-time curves, it was found that rate of displacement has a direct impact on the rate of acoustic emission events; a higher displacement rate results in a higher rate of AE events. • Various physical parameter-based algorithms were adopted to discern different damage mechanisms in reinforced concrete T-beams under test. Although different algorithms may use different AE feature/s, they are able to provide useful information about the progressive damage stages. For instance, AE features used in average frequency (AF) and RA value are different; however, these two algorithms separately able to distinguish the stages of tension-type cracking and shear-type cracking, effectively.

•
In the case of b 1 -value, b 2 -value, and b 3 -value analysis, each of these algorithms use AE amplitude. It was found that all these b-value algorithms can identify micro-damage and macro-damage cases by showing a sudden drop in their respective indices in case of a macro-damage. However, b 1 -value and b 3 -value were found to be more sensitive in discriminating micro-damage and macro-damage compared with b 2 -value.

•
With a view to classify and cluster AE data, supervised and unsupervised machine learning methods were adopted, respectively. In case of the supervised machine learning using SVM, two classes were successfully made considering only the average frequency (AF) and RA value as features in the algorithm. The Gaussian kernel trick of SVM was found to be very efficient in the classification of AE data with the help of its nonlinear hyper plane. Although the supervised machine learning approach using SVM worked well for the classification of AE data, it is difficult to classify the data if multiple mechanisms are present in a particular zone of load-displacement curve, because the labeling of data becomes very difficult in such a scenario. Hence, there is a need of an unsupervised learning approach as it does not require labeled data for the classification.

•
In the case of the unsupervised machine learning approach, a large number of features were considered. With a view to reduce the dimension and optimize the AE data, Laplacian score and principal component analysis (PCA) were performed. Finally, the k-means algorithm was used for clustering of data, considering only the first few PCs. Three clusters were obtained. Based on the cumulative hits of different clusters and frequency domain analysis of randomly chosen signals from each of the clusters, it can be concluded that the three clusters correspond to three different types of damage cases, namely, tensile cracking (cluster 1), both shear and tensile cracking (cluster 2), and friction (cluster 3).
Future studies can be focused on analysis of new time/frequency parameters that are to be defined in accordance with the involved micro-mechanisms (e.g., shear and/or tensile) in order to refine the damage analysis in reinforced concrete.