COA-Optimized Kernel K-Means Clustering for Identifying Acoustic Emission Signals Associated with Different Damage Types in RC Beams

Wang, Xianqiang; Feng, Xiaonan; Yi, Fan; Wang, Yaoxuan

doi:10.3390/buildings16132617

Open AccessArticle

COA-Optimized Kernel K-Means Clustering for Identifying Acoustic Emission Signals Associated with Different Damage Types in RC Beams

¹

State Key Laboratory of Safety, Durability and Healthy Operation of Long Span Bridges, JSTI Group, Nanjing 210019, China

²

School of Civil Engineering, Southeast University, Nanjing 211189, China

³

School of Safety Science and Engineering, Nanjing University of Science and Technology, Nanjing 210019, China

^*

Author to whom correspondence should be addressed.

Buildings 2026, 16(13), 2617; https://doi.org/10.3390/buildings16132617

Submission received: 5 June 2026 / Revised: 25 June 2026 / Accepted: 26 June 2026 / Published: 30 June 2026

(This article belongs to the Special Issue Transformable and Modular Building Systems: Mechanics, Design, and Applications Innovations)

Download

Browse Figures

Versions Notes

Abstract

Acoustic emission (AE) signals associated with different damage processes in reinforced concrete (RC) beams often show overlapping feature distributions, making unsupervised identification difficult. In this study, three RC beams were tested under loading-induced damage, freeze–thaw damage and reinforcement corrosion conditions, and 490 valid AE samples were obtained. Seven AE parameters were selected to construct the clustering feature set. To enhance the separation of different damage-related AE signals, a kernel K-means clustering framework was adopted, and the coyote optimization algorithm was used to optimize the kernel function type and key parameters based on the Gap Statistic. Comparative analysis with K-means, FCM and GMM was also conducted. The results show that the COA-optimized kernel K-means method achieved the best overall clustering performance, increasing the mean ACC by 10.35 percentage points compared with K-means and by 2.49 percentage points compared with GMM. Its mean ARI was also higher than that of GMM by 0.0330, while the standard deviation of ACC decreased from 7.73% to 4.19%. Class-level results indicated that loading-induced AE signals were more readily identified, whereas freeze–thaw and corrosion signals were more affected by feature overlap. Feature interpretation further showed that the main misclassified samples were located in transitional feature regions. The results suggest that COA-based kernel optimization can improve the clustering separation and stability of AE signal identification for different damage types in RC beams.

Keywords:

reinforced concrete beam; acoustic emission signal; damage type identification; unsupervised clustering; COA-optimized kernel K-means

1. Introduction

Reinforced concrete bridges are susceptible to vehicle loads, chloride erosion, freeze–thaw cycles, environmental humidity, and alkali–silica reaction (ASR) during long-term service, leading to multiple types of damage, such as cracking, reinforcement corrosion, and material deterioration [1,2]. Such damage is usually hidden and progressive; if not identified in time, the load-bearing capacity and durability of the structure may deteriorate, thereby threatening bridge service safety. As a passive non-destructive testing method, acoustic emission (AE) technology can capture transient elastic waves released by internal microcracking. Owing to its high sensitivity to microcrack initiation and propagation, AE technology can reflect the dynamic process of structural damage development [3,4], and therefore holds considerable application potential for damage identification in concrete structures.

Existing studies have shown that AE signals are closely related to different damage processes in concrete structures. Under loading and mechanical failure, AE parameters such as event counts, ring-down counts, energy, and duration can be used to characterize crack initiation, propagation, and failure processes [5,6,7]. In recent years, Zhou et al. [8], Rather et al. [9], and Cui et al. [10] further verified the capability of AE signals to characterize load-induced damage from the perspectives of fatigue damage assessment, interface degradation identification, and fatigue crack propagation analysis, respectively. For reinforcement corrosion damage, previous studies have shown that hit counts, cumulative signal strength, and absolute energy can reflect corrosion initiation, passive film breakdown, and corrosion-induced cracking processes [11,12,13,14]. For freeze–thaw damage, related studies have indicated that freeze–thaw conditions and saturation state affect AE response characteristics [15,16]. Choi et al. [17], Mainali et al. [18], and Todak et al. [19] further revealed the characterization capability of AE signals for freeze–thaw damage from the perspectives of freeze–thaw damage response, stage-dependent AE activity, and the influence of saturation state.

However, owing to the complex internal structure of concrete, AE signals are inherently burst-like, non-stationary, and multi-parametric. It is difficult to accurately identify different damage modes using only a single parameter or manually defined thresholds. With the development of data processing methods, machine learning has gradually been introduced into AE signal analysis. Thirumalaiselvi et al. [20] used pattern recognition methods to characterize crack development during progressive damage in large concrete structures. Abdeljaber et al. [21] and Ding et al. [22] carried out structural damage identification based on convolutional neural networks and deep belief networks, respectively, demonstrating the potential of machine learning methods for feature extraction and damage identification in complex structural responses. Compared with supervised learning, unsupervised clustering methods do not require pre-labeled samples and can mine the latent internal structure of AE signals. Previous studies have applied clustering methods to AE signal interpretation in different cement-based materials, including polymer concrete damage pattern recognition and steel fiber reinforced concrete fracture classification [23,24]. More recently, K-means clustering has been used to analyze the failure process and damage mechanisms of BFRP-strengthened concrete beams based on AE parameters [25], while GMM-based clustering has been adopted to identify tensile crack AE events and predict crack propagation in lightly reinforced concrete beams [26].

Although these studies demonstrate the applicability of clustering and machine learning methods in AE signal interpretation, many existing AE clustering studies still rely on conventional clustering algorithms in the original or reduced Euclidean feature space. Such methods may be affected by initialization, cluster shape assumptions, and overlapping feature distributions. In particular, AE signals generated by different damage mechanisms may exhibit nonlinear and locally overlapping distributions in multi-parameter feature space, which limits the separability achieved by conventional clustering methods. Kernel-based methods provide a potential way to improve nonlinear separability through implicit feature space mapping. In AE-based structural damage monitoring, kernel methods such as kernel ridge regression have been used for corrosion-induced damage assessment in reinforced concrete structures [27]. In the broader field of structural health monitoring, adaptive kernel spectral clustering has also been applied for automated damage detection under environmental and operational variability [28]. However, the performance of kernel clustering methods is strongly affected by the selected kernel function and its parameters. Although optimization algorithms have been introduced to improve AE data clustering, such as genetic algorithm-based AE clustering optimization [29], automatic optimization of both kernel type and key kernel parameters for unsupervised AE damage-type identification remains insufficiently explored.

Nevertheless, from the perspective of multi-type damage identification in RC beams, most existing studies still focus on a single damage mechanism or a specific experimental scenario. In practical monitoring of RC beams, AE signals may be associated with different damage types, such as loading-induced cracking, reinforcement corrosion, and freeze–thaw deterioration. Due to the overlap of time-domain, frequency-domain, and energy-related features, distinguishing these damage-related AE signals remains challenging [30]. Therefore, improving the identification accuracy of AE signals corresponding to different damage types is still an important issue in AE-based monitoring applications.

To address this issue, this study focuses on AE signals collected from RC beams under three different damage conditions, namely loading-induced damage, corrosion damage, and freeze–thaw damage. A COA-optimized kernel K-means (COA-KKM) clustering method is proposed to improve the unsupervised identification of AE signals associated with different damage types. Compared with existing AE clustering studies, the proposed framework introduces nonlinear kernel mapping to enhance the separability of overlapping AE feature distributions, and uses the coyote optimization algorithm to optimize the kernel function type and key parameters under an unsupervised Gap Statistic criterion. This reduces the dependence on empirically selected kernels and manually tuned parameters, and provides a more adaptive clustering strategy for multi-type AE signal identification. The proposed method is compared with K-means, FCM, and GMM, and its clustering accuracy and stability are evaluated. The results can provide a methodological reference for distinguishing damage-related AE signals and establishing cluster–damage type mappings in RC beam assessment.

2. Materials and Experiments

2.1. Specimen Preparation

The experiment designed three reinforced concrete beams with identical geometric dimensions and reinforcement configurations to ensure comparability of the initial states during subsequent freeze–thaw, corrosion, and loading tests. The specimen dimensions are 1100 mm × 120 mm × 150 mm, with a concrete design strength grade of C40 and a cover thickness of 10 mm. All reinforcement bars are HRB400 grade, with an ultimate tensile strength of 570 MPa. Two 12-mm-diameter longitudinal bars are arranged in the bottom tension zone, two 8-mm-diameter distribution bars are placed at the top, and stirrups of 6 mm diameter are evenly spaced at 100 mm intervals along the beam length. The reinforcement layout is illustrated in Figure 1.

During the reinforcement binding stage, a rust remover is first used to remove all oxide scales and corrosion products on the surface of steel bars, and then anti-rust oil is evenly coated to reduce re-oxidation during exposure. Meanwhile, to meet the connection requirements of external circuits in the electrochemical accelerated corrosion test, a short steel bar is welded to one end of each reinforcing bar as the electrode lead-out point. All three beams are cast with concrete from the same batch, and then placed in a standard curing room for 28-day standard curing to ensure stable concrete performance. According to the mechanical property test results, the 28-day compressive strength of standard cubic concrete specimens from this batch is 55.6 MPa, which meets the design requirements. The test conditions of each beam are shown in Table 1.

2.2. Acoustic Emission Monitoring System

The acoustic emission (AE) monitoring system adopted in this experiment is the Express8 AE acquisition system produced by Physical Acoustics Corporation (PAC), USA. The system mainly consists of an AE sensor with a resonant frequency of 150 kHz, a preamplifier, and a host integrating signal conditioning and data acquisition. The effective bandwidth of its acquisition card ranges from 1 kHz to 1 MHz. Powered by 220 V AC, the system supports up to 16 AE signal channels and 8 external parameter input channels, enabling real-time acquisition and storage of AE signal waveforms generated during the test. Online monitoring and preliminary analysis of AE signals are performed using AEwin64 software, while subsequent data processing and feature parameter analysis are carried out on the MATLAB platform (version R2025a). A photograph of the AE test system is shown in Figure 2.

In this experiment, four AE sensors were arranged on the surface of each beam: Sensors No. 1 and No. 2 were symmetrically placed in the compression zone near the support axis, and Sensors No. 3 and No. 4 were symmetrically placed in the pure bending zone near the mid-span position. The specific layout is shown in Figure 3. Before sensor installation, the beam surface was ground smooth, and iron sheets were attached using AB adhesive to facilitate the magnetic holding of sensor fixtures. Couplant was applied to the contact surface of each sensor, and small sponges were placed inside the fixtures to reduce noise caused by rigid contact among the fixture, sensor, and beam. After all sensors were installed, the coupling efficiency was verified by the lead break test; the test started only after passing the verification. The initial parameters of the AE system are listed in Table 2.

2.3. Acoustic Emission Signal Acquisition Test

2.3.1. Reinforced Concrete Beam Load-to-Failure Test

Trial Beam L1 is subjected to a three-point bending load. Simply supported roller supports are installed at both ends, with a clear span of 100 cm between them. A hydraulic jack is placed at the mid-span of the beam’s top surface, and a pressure sensor is mounted between the jack and the reaction beam to achieve precise control and real-time monitoring of the loading process.

The bearing capacity of the test beam is theoretically checked according to the Standard for Design of Concrete Structures [31]. Specifically, the flexural capacity of the normal section of the non-deteriorated test beam under three-point bending is 49.62 kN, and the shear capacity of the inclined section is 86.32 kN. Since the normal section bearing capacity is significantly lower than that of the inclined section, the failure mode of the test beam is expected to be dominated by normal section bending failure. Based on this, the ultimate bearing capacity of the non-deteriorated beam is initially determined to be approximately 50 kN.

Loading is carried out in a stepped manner, with an increment of 5 kN per step and a loading rate of 5 kN/min. During the loading process, the AE system synchronously acquires the acoustic emission signals of the beam. The specific loading arrangement is shown in Figure 4.

2.3.2. Freeze–Thaw Damage Test

The freeze–thaw cycling test on Beam L2 is conducted in accordance with the Standard for Long-term and Durability Performance Test Methods of Ordinary Concrete [32], employing the slow freezing method. This method features a gradual freezing process and prolonged holding time at minimum temperature, more closely simulating real-world freeze-thaw environments and accurately reflecting the long-term deterioration behavior of concrete under cyclic freezing and thawing.

The test is performed using a Haier ultra-low temperature freezer, 369 L capacity. During the freezing phase, the air temperature is controlled within the range of −20 °C to −18 °C. Approximately 30 min before the end of freezing, a heating device for the electrolyte solution is activated to ensure a stable thawing temperature of 18 °C to 20 °C. One complete freeze–thaw cycle consists of sequential freezing and thawing phases.

Throughout the entire test, the acoustic emission (AE) system synchronously acquires the AE signals emitted from the beam to monitor internal damage evolution in real time. The experimental setup is illustrated in Figure 5.

2.3.3. Rebar Corrosion Test

The electrochemical accelerated corrosion method with external current is adopted to prefabricate corrosion on Beam L3. The corrosion degree of steel bars is controlled by the theoretical corrosion rate, which is calculated based on Faraday’s law. During the electrochemical corrosion process, the mass loss of anode metal is related to the applied current density, energization time and surface area of steel bars. The relationship is as follows:

Δ m = \frac{A t i S}{Z F}

(1)

The symbols in the equation are defined as follows:

Δ m

: mass loss of steel reinforcement, in kg; A: molar mass of iron, taken as 56 g/mol; i: corrosion current density, in A/m²; S: surface area of the corroded steel reinforcement, in m²; t: theoretical energization time for corrosion, in seconds; n: number of electrons lost per iron atom in the electrode reaction, taken as 2; and F: Faraday’s constant, equal to 96,485 C/mol.

Based on the above, the theoretical corrosion rate of the steel reinforcement can be expressed as follows:

η = \frac{Δ m}{m_{0}}

(2)

In the formula,

η

is the theoretical corrosion rate of steel bars and

m_{0}

is the original mass (in kg) of the steel bar to be corroded.

To ensure a stable electrical connection between the external conductors and the steel reinforcement, the corrosion products at the connection points must be removed before energization. A transparent plastic tray without a lid, matching the dimensions of the beam, is used as the immersion container. Before energization, the test beam is immersed in the electrolyte solution for at least 72 h to allow the solution to fully penetrate the pores, forming a continuous ion transport channel, reducing concrete resistivity, and enhancing the corrosion efficiency.

The applied current density is controlled within 250 μA/cm² to ensure that the corrosion morphology during the accelerated corrosion process is comparable to actual engineering conditions. The corrosion process employs a constant current full-immersion method with applied current acceleration. The power supply equipment used is an adjustable DC stabilized voltage source. During the entire energization process, the Acoustic Emission (AE) system is used synchronously to collect acoustic emission signals from the test beam. The experimental setup and the corrosion test process are shown in Figure 6a and Figure 6b, respectively.

3. Model Establishment

3.1. Clustering Analysis

3.1.1. K-Means Clustering Algorithm

The classical K-Means clustering algorithm is a partition-based clustering method. Its core idea is to divide data samples into several pre-specified number of non-overlapping clusters through iterative optimization. Let the dataset be as follows:

D = \{x_{i} ∣ i = 1,2, \dots, N\}

(3)

In the formulation, let

x_{i}

denote the i-th n-dimensional feature vector. Given the number of clusters is pre-specified, the objective of K-means is to partition the dataset into k disjoint clusters such that the within-cluster similarity of samples is maximized and the between-cluster dissimilarity is also maximized. Euclidean distance is adopted to quantify the similarity between samples. In the clustering procedure, the distance between a data point x and the centroid

c_{i}

of the i-th cluster can be formulated as follows:

d (x, c_{i}) = \sqrt{\sum_{j = 1}^{n} {(x_{j} - c_{i j})}^{2}}

(4)

Meanwhile, K-means clustering uses the sum of squared errors (SSE) as an evaluation metric for cluster quality, with its objective function defined as follows:

S S E = \sum_{i = 1}^{k} \sum_{x \in C_{i}} {|x - c_{i}|}^{2}

(5)

In the formula,

c_{i}

represents the i-th cluster, and

c_{i}

denotes the corresponding cluster center. SSE reflects the compactness of samples within each cluster; a smaller SSE value indicates higher intra-cluster consistency in the clustering results.

3.1.2. Fuzzy C-Means Clustering Algorithm

Fuzzy C-Means (FCM) is a representative soft clustering algorithm, whose core mechanism introduces a membership framework that allows each sample to belong to multiple clusters with varying degrees of weight. Unlike hard clustering, FCM does not enforce exclusive assignment; instead, it quantifies the relative proximity of each sample to cluster centroids, making it particularly suitable for datasets with fuzzy boundaries, overlapping distributions, or transitional patterns. Let the input dataset be denoted as D. The objective of FCM is to partition C fuzzy clusters by constructing a membership matrix

u_{i} j

represents the degree to which sample

x_{i}

belongs to cluster j. The membership values must satisfy the following constraints:

0 \leq u_{i} j \leq 1, \sum (j = 1)^{c} u_{i} j = 1, i = 1,2, \dots, N

(6)

The constraint ensures that the sum of membership degrees of each sample over all clusters equals 1, reflecting its fuzzy membership property. Given the membership matrix, the centroid of the j-th cluster is determined by the weighted average of the samples:

C_{j} = (\sum {(i = 1)}^{N} u_{i} j^{m} x_{i}) / (\sum {(i = 1)}^{N} u_{i} j^{m}), j = 1,2, \dots, c

(7)

In the formula, m is the fuzzy weighting exponent, which usually takes a value in the interval [1.5, 2.5]. A larger m makes the membership distribution smoother and enhances fuzziness, while a smaller m makes the result closer to hard partitioning. FCM achieves optimization by minimizing the following objective function:

J (U, C) = \sum (i = 1)^{N} \sum (j = 1)^{c} u_{i} j^{n} ‖ x_{i} - C_{j} ‖^{2}

(8)

The objective function simultaneously incorporates the spatial distance between samples and cluster centroids, as well as the membership weights of samples to each cluster. Through the alternating update of the membership matrix U and the cluster centroids C, the algorithm drives each sample in the feature space to converge toward its high-membership cluster center.

3.1.3. Gaussian Mixture Model

The Gaussian Mixture Model (GMM) is a soft clustering method based on probabilistic generation mechanism. This model assumes that observed samples do not come from a single distribution, but are jointly generated by several latent sub-distributions, and each Gaussian component corresponds to a latent clustering structure in the data space. Through the weighted superposition of each component, the GMM can describe the multimodal characteristics of data under a unified probabilistic framework.

Formally, given the dataset D, the GMM models the probability density function of the overall data as the weighted sum of K elemental Gaussian distributions:

p (x) = \sum_{k = 1}^{K} π_{k} N (x ∣ μ_{k}, Σ_{k})

(9)

The symbols in the equation are defined as follows: K: the number of Gaussian components;

π_{k}

: the mixing coefficient of the k-th component, representing its weight in the overall model, satisfying

π_{k}

> 0 and

\sum_{k = 1}^{K} π_{k}

= 1;

μ_{k}

: the mean vector of the k-th Gaussian component; and

Σ_{k}

: the covariance matrix of the k-th Gaussian component.

The probability density function of the multivariate Gaussian distribution is defined as follows:

N (x ∣ μ_{k}, Σ_{k}) = e x p \{- 1 / 2 {(x - μ_{k})}^{T} Σ_{k}^{(- 1)} (x - μ_{k})\} / ({(2 π)}^{(d / 2)} {|Σ_{k}|}^{(1 / 2)})

(10)

The parameter set in Formula (10) is typically estimated using the Expectation-Maximization (EM) algorithm. The EM algorithm alternately optimizes between the hidden variable space and the parameter space to gradually approach the maximum value of the model’s logarithmic likelihood function of the observed data [33].

3.2. Model Modification

3.2.1. Kernel Clustering

In practical engineering and complex data analysis problems, the distribution relationship between samples often presents significant nonlinear characteristics. Traditional clustering methods based on Euclidean distance can hardly effectively characterize the true similarity between samples in the original feature space. To overcome this limitation, Kernel Clustering was proposed, and Kernel K-Means is the most typical application form of kernel clustering.

Different from traditional methods, Kernel K-Means does not directly operate on high-dimensional coordinates. Instead, it implicitly completes sample assignment and cluster center updating with the help of a kernel function. Therefore, it can effectively identify cluster structures of any shape and has stronger adaptability to nonlinearly distributed data, as shown in Figure 7.

There are many types of kernel functions, and this paper considers the following three types of classical kernel functions that are widely used in pattern recognition and clustering tasks:

Gaussian radial basis kernel function:

K (x_{i}, x_{j}) = e x p (- \frac{{|x_{i} - x_{j}|}^{2}}{2 σ^{2}})

(11)

The parameter

σ

is the kernel width parameter, which controls the smoothness of the kernel function.

2.: Polynomial kernel function:

K (x_{i}, x_{j}) = {(⟨x_{i}, x_{j}⟩ + c)}^{e}

(12)

In the formula, c is the bias constant, and e is the degree parameter of the polynomial kernel function.

3.: Linear kernel function:

K (x_{i}, x_{j}) = ⟨x_{i}, x_{j}⟩

(13)

3.2.2. COA Kernel Optimization K-Means Clustering Method

The performance of kernel K-means clustering is significantly influenced by the type of kernel function and its parameter values. Different kernel functions correspond to distinct feature mapping schemes, while the kernel parameters alter the distance relationships among samples in the kernel space and the expressiveness of cluster structures. When kernel functions and their parameters are selected based on manual experience, the clustering results become susceptible to subjective bias, making it difficult to stably adapt to the nonlinear and overlapping distribution characteristics inherent in acoustic emission signal feature spaces. To address this limitation, this paper proposes a COA-optimized kernel K-means clustering method (COA-KKM) by integrating the Coyote Optimization Algorithm (COA) to adaptively optimize the kernel function type and its critical parameters, with the Gap Statistic employed as the fitness evaluation metric.

The COA is a swarm intelligence optimization algorithm that simulates the social behavior and evolutionary mechanisms of coyote populations. In this algorithm, each candidate solution is regarded as an individual coyote, and the variable combination corresponding to the candidate solution represents the social state of the coyote. During iteration, each coyote individual is jointly influenced by the dominant individual in the group and the cultural trend of the population. The fundamental update process can be expressed as follows:

{new_soc}_{c}^{p, t} = {soc}_{c}^{p, t} + r_{1} (α^{p, t} - {soc}_{r 1}^{p}) + r_{2} ({cult}^{p, t} - {soc}_{r 2}^{p})

(14)

In the formula,

{soc}_{c}^{p, t}

is the social state of the c-th coyote in the p-th group at the t-th iteration;

α^{p, t}

is the dominant coyote in the p-th group;

{cult}^{p, t}

is the cultural trend of the p-th group, which is usually determined by the median of the social states of all dimensions within the group;

{cult}^{p, t}

is two randomly selected coyote individuals within the group; and is random factors in the interval [0, 1]. This updating mechanism allows individuals to move closer to dominant individuals and the overall population trend while retaining a certain random search capability, thus balancing local exploitation and global exploration.

This paper selects the Gaussian radial basis kernel function, polynomial kernel function and linear kernel function (Equations (11)–(13)) as the optimization objects. The kernel function type and its parameters are encoded into a three-dimensional vector as the parameters to be optimized:

soc = [t, θ_{1}, θ_{2}]

(15)

where t denotes the kernel type code takes the value of 1, 2, 3 (1 = RBF, 2 = Poly,3 = Linear);

θ_{1}

,

θ_{2}

denote the kernel parameter.

For any coyote individual’s corresponding kernel function configuration, kernel K-means is adopted to cluster the standardized acoustic emission feature matrix. Let the number of clusters be k, the r-th cluster is

C_{r}

, and the number of samples in it is, then within-cluster dispersion in the kernel space can be expressed as follows:

W_{k} = \sum_{r = 1}^{k} \frac{1}{2 n_{r}} \sum_{x_{i}, x_{j} \in C_{r}} ‖ ϕ (x_{i}) - ϕ (x_{j}) ‖^{2}

(16)

In the formula, is the nonlinear mapping function implicitly defined by the kernel function. According to the properties of kernel functions, the distance in the kernel space can be calculated directly from the kernel function:

{‖ϕ (x_{i}) - ϕ (x_{j})‖}^{2} = K (x_{i}, x_{i}) + K (x_{j}, x_{j}) - 2 K (x_{i}, x_{j})

(17)

To evaluate the clustering effectiveness under different kernel function configurations, this paper adopts the Gap Statistic as the fitness function for the COA. Based on the range of features in the original data, BBB reference datasets are generated, and kernel K-means clustering is performed on each to compute the corresponding within-cluster scatter. With B = 20, the Gap Statistic can be expressed as follows:

Gap (k) = \frac{1}{B} \sum_{b = 1}^{B} \log (W_{kb}^{*}) - \log (W_{k})

(18)

A larger Gap Statistic indicates that the clustering structure obtained under the current kernel function configuration is more distinct compared to a random distribution. Therefore, the optimization objective of the COA can be expressed as follows:

maxf (soc) = Gap (k)

(19)

After random initialization of the coyote population within the given search space, each individual corresponds to a unique combination of kernel function type and parameters. For each individual, the kernel function is constructed based on its encoded representation, followed by kernel K-means clustering on the standardized acoustic emission feature matrix, and the resulting Gap Statistic is computed as the individual’s fitness value. Subsequently, the Coyote Optimization Algorithm (COA) iteratively updates the social state of each individual according to Equation (14), while maintaining population diversity through operations such as newborn individual replacement, inferior individual elimination, and inter-group migration. The optimization process terminates when the global best fitness remains unimproved for a consecutive number of iterations, or when the maximum number of iterations is reached. Finally, the individual with the maximum Gap Statistic is selected as the optimal kernel configuration, and kernel K-means clustering is performed using this optimal kernel function and its corresponding parameters to obtain the final clustering results of acoustic emission signals from RC beams.

3.2.3. Parameter Settings

The main parameter settings used in the COA-KKM algorithm are listed in Table 3.

3.3. Dataset Construction

3.3.1. AE Feature Dataset

Due to the significant non-stationarity and randomness characteristics of acoustic emission signals, it is usually necessary to conduct quantitative characterization via feature parameters to reveal the acoustic response properties during the evolution of internal damage in structures. Although an acoustic emission system can extract a variety of feature parameters, the selection of feature parameters for unsupervised clustering analysis needs to balance the clarity of physical meaning, parameter stability, and the ability to distinguish different damage mechanisms. Meanwhile, excessive or redundant feature parameters will not only increase the dimension of the feature space, but also may reduce the discriminative performance of clustering results.

Based on these considerations and the AE generation characteristics during reinforcement corrosion, freeze–thaw damage, and loading-induced failure of RC beams, seven AE parameters were selected as input features for subsequent clustering analysis, including rise time, ring-down counts, energy, duration, average frequency, counts to peak, and amplitude. These parameters characterize AE events from the perspectives of time-domain response, activity level, energy release, frequency content, and signal intensity, and can provide complementary information for distinguishing different damage-related AE signals. The definitions of several AE parameters are illustrated in Figure 8.

Based on the AE monitoring data obtained from three RC beams under loading, corrosion, and freeze–thaw conditions, as described in Section 2, a seven-dimensional feature dataset containing 490 valid samples was constructed. The distribution ranges of the AE parameters for each damage condition are listed in Table 4.

To provide a more intuitive description of the collected AE dataset, the distributions of the seven AE features under the three damage conditions are shown in Figure 9. It can be observed that the loading-induced damage samples generally exhibit higher values in ring-down counts, duration, average frequency, and amplitude, whereas the freeze–thaw and corrosion samples show closer distributions in several features. Meanwhile, overlaps still exist among the three damage types, indicating that single AE parameters are insufficient for reliable damage identification. Therefore, multi-feature clustering analysis is necessary for further distinguishing different damage-related AE signals.

3.3.2. Data Pretreatment

The seven AE features used in this study have different physical dimensions and numerical ranges. Therefore, min–max normalization was applied to each feature before clustering, so that all features were transformed into the same numerical range while preserving their relative distribution characteristics.

Although the normalized seven-dimensional feature set characterizes AE signals from multiple perspectives, including time-domain response, energy release, amplitude response and frequency content, the Pearson correlation matrix shows that correlations exist among several feature parameters. As shown in Figure 10, rise time is strongly correlated with peak frequency, while ring-down counts show strong correlations with duration and amplitude. Duration is also highly correlated with amplitude. These correlations indicate that the original AE feature set contains partially overlapping information and potential feature redundancy. Therefore, PCA was introduced to reduce feature redundancy and obtain a compact low-dimensional representation for visualization and the corresponding clustering analysis. The cumulative variance contribution rates of the first three and first four principal components were 90.05% and 94.68%, respectively. Therefore, the first three principal components were used for three-dimensional visualization, while the first four principal components were retained as the PCA-reduced input for the corresponding clustering analysis.

To further interpret the physical meaning of the retained principal components, the loading matrix of PC1–PC4 was analyzed. The absolute loading values of the first four principal components are listed in Table 5. For PC1, the dominant features are ring-down counts, amplitude and duration, indicating that PC1 mainly reflects the overall AE activity intensity and sustained response level. PC2 is dominated by average frequency, suggesting that it primarily represents the frequency-related characteristics of AE signals. PC3 is mainly controlled by amplitude, peak frequency and rise time, and can be associated with waveform morphology, peak response and frequency distribution characteristics. PC4 is dominated by energy, amplitude and duration, indicating that it preserves supplementary information related to energy release and signal duration. This also explains why retaining the fourth principal component is meaningful, although its individual variance contribution is relatively small.

The PCA visualization of the AE feature dataset is shown in Figure 11. It can be observed that the three types of damage signals exhibit a certain degree of aggregation in the reduced feature space, indicating that the selected AE parameters contain damage-related discriminative information. However, the category boundaries are not completely separated, and local overlap still exists among different damage types. This further suggests that simple visual separation or single-parameter judgment is insufficient, and clustering algorithms with stronger nonlinear separation capability are required for multi-type AE signal identification.

3.4. Performance Evaluation

The AE dataset used in this study was constructed from three experimentally designed dominant damage conditions, namely loading-induced damage, freeze–thaw damage, and reinforcement corrosion. Accordingly, three clusters were used for all clustering algorithms in the comparative analysis, so that the clustering results could be consistently compared with the experimentally defined damage categories.

Since the cluster labels output by unsupervised clustering algorithms are only numerical identifiers of sample partitions in the feature space, they do not inherently correspond to specific damage categories. Therefore, before calculating external evaluation metrics, the Hungarian algorithm was employed to establish the optimal correspondence between the clustering labels and the experimental damage labels. Specifically, a cost matrix was constructed according to the mismatch between the clustering results and the ground-truth damage categories, and the optimal assignment with the minimum total matching error was obtained. Based on the matched labels, the clustering accuracy was then calculated to evaluate the consistency between the clustering results and the experimental damage categories.

In this paper, the performance of different clustering methods is evaluated using the metrics clustering accuracy (ACC), adjusted Rand index (ARI), and silhouette coefficient (SC), whose calculation formulas are given in Equations (20)–(22). Here,

y_{i}

represents the true label of the i-th sample;

{\hat{y}}_{i}

denotes the cluster label after optimal matching via the Hungarian algorithm; N is the total number of samples; a(i) and b(i) are the mean intra-cluster distance and the mean nearest-cluster distance for sample i, respectively; and RI and denote the Rand index and its expected value under random partitioning. To ensure comparability among different methods, SC was calculated in the normalized original seven-dimensional feature space for all clustering results. It should be noted that, for kernel-based clustering, the distance relationship in the kernel-induced feature space is different from that in the original Euclidean space. Therefore, SC is used only as a supplementary internal validity index in this study, while ACC and ARI are taken as the main metrics for evaluating the consistency between clustering results and experimental damage categories.

ACC = \frac{1}{N} \sum_{i = 1}^{N} 1 \{{\hat{y}}_{i} = y_{i}\}

(20)

ARI = \frac{RI - E [RI]}{\max (RI) - E [RI]}

(21)

SC = \frac{1}{N} \sum_{i = 1}^{N} \frac{b (i) - a (i)}{\max {a (i), b (i)}}

(22)

Among these indexes, the ACC reflects the proportion of correctly classified samples after label matching, where a higher ACC indicates better clustering accuracy. The ARI measures the agreement between the clustering result and the ground truth with correction for chance, with values closer to 1 signifying higher consistency and 0 corresponding to random partitioning. The SC evaluates the compactness and separation of clusters without using true labels; a higher SC suggests that samples are well assigned to their clusters.

Since clustering results may be affected by random initialization, each method was independently repeated 10 times with different random seeds. The mean values were used to compare the overall clustering performance, while the standard deviations were used to evaluate the stability of each method.

4. Result

4.1. Clustering Visualization

Figure 12 presents the visualized clustering results of the four methods, including K-means, FCM, GMM and COA-KKM. It can be observed that the misclassified samples of K-means (Figure 12a) are mainly located in the overlapping regions among different damage types, indicating that the hard partition based on Euclidean distance has limited ability to deal with samples with ambiguous boundaries. After fuzzy membership is introduced, FCM (Figure 12b) reduces the number of misclassified samples compared with K-means, but its distance-based clustering mechanism still limits its ability to describe complex nonlinear boundaries. The GMM (Figure 12c) can describe non-spherical cluster structures through covariance matrices and achieves improved clustering performance. However, some edge samples and transition-region samples are still incorrectly identified, suggesting that the Gaussian distribution assumption cannot fully adapt to the irregular distribution of AE signals associated with different damage types. In contrast, COA-KKM (Figure 12d) further reduces the number of misclassified samples, with most errors confined to local overlapping regions. This indicates that kernel mapping and COA-based parameter optimization can enhance the nonlinear separation capability of the clustering model for different damage-related AE signals.

4.2. Effect of PCA Dimensionality Reduction on Baseline Methods

To clarify the influence of PCA dimensionality reduction on clustering performance, K-means, the FCM and GMM were tested using both the normalized original seven-dimensional AE features and the first four principal components as inputs. Each method was repeated 10 times under the same settings, and the results are summarized in Table 6.

As shown in Table 6, PCA dimensionality reduction slightly improved the ACC and ARI of K-means and FCM, indicating that the reduced feature space helped weaken feature redundancy and improve the identification of the underlying cluster structure for these methods. For GMM, the ACC obtained using PC1–PC4 was slightly lower than that obtained using the original seven-dimensional features, whereas the ARI was slightly higher. This indicates that PCA did not uniformly improve all performance metrics, but the first four principal components preserved the main discriminative information of the original AE features while reducing the input dimension. Therefore, using PC1–PC4 as the reduced-dimensional input for the corresponding baseline methods is considered reasonable.

4.3. COA Optimization Results

4.3.1. Convergence Behavior of COA Optimization

To examine the convergence behavior of the COA-based kernel optimization process, the best Gap Statistic in each iteration was recorded over 10 independent runs. As shown in Figure 13, the mean best Gap Statistic increased rapidly in the early iterations and gradually tended to stabilize as the iteration proceeded. Specifically, the mean best Gap Statistic increased from 1.2101 in the first iteration to 1.2614 in the 10th iteration, and further reached 1.2880 in the 20th iteration. After approximately the 18th iteration, the improvement in the mean fitness became very small, indicating that the optimization process had generally approached a stable state within 20 iterations. Therefore, the 20-iteration setting used in this study was considered acceptable for obtaining stable near-optimal solutions for the present dataset while keeping the computational cost within a reasonable range.

It should be noted that one of the repeated runs remained at a relatively low Gap Statistic after the early iterations and finally selected a polynomial kernel instead of the RBF kernel. This run also produced lower external clustering performance, with ACC and ARI of 81.22% and 0.6482, respectively. This phenomenon indicates that the stochastic COA process may occasionally fall into a local optimum. Nevertheless, this run was retained in the repeated-run statistics to objectively reflect the stochastic nature and robustness of the proposed method.

4.3.2. Optimized Kernel Function and Parameter Analysis

The optimized kernel configurations obtained from the 10 independent COA runs were further examined. As summarized in Table 7, the RBF kernel was selected in 9 out of the 10 runs, whereas the polynomial kernel was selected in only one run and the linear kernel was not selected. For the RBF kernel runs, the optimized kernel parameter was mainly concentrated in the range of 0.1706–0.2363, with an average value of approximately 0.2203. This indicates that the RBF kernel was the dominant optimized kernel configuration for the present AE feature dataset.

The preference for the RBF kernel is consistent with the distribution characteristics of AE signals associated with different damage types. AE parameters collected from loading-induced damage, freeze–thaw damage, and reinforcement corrosion generally show nonlinear and partially overlapping distributions in the feature space. In particular, freeze–thaw and corrosion-related signals may share similar energy- and duration-related characteristics, making them difficult to distinguish using a linear boundary. The RBF kernel maps the original feature space into a nonlinear kernel space and emphasizes local similarity between samples, which is suitable for separating AE signals with nonlinear and locally overlapping distributions. Therefore, the COA-based selection of the RBF kernel suggests that nonlinear local mapping is more appropriate for the unsupervised identification of damage-related AE signals in this study.

It should also be noted that one run selected the polynomial kernel. Although this run yielded a relatively high silhouette coefficient, its ACC and ARI were substantially lower than those of the RBF-kernel runs. This indicates that a compact clustering structure measured by an internal validity index does not necessarily correspond to better consistency with the actual damage labels. This result also reflects the stochastic nature of COA and the possibility of occasionally falling into a local optimum. Therefore, the polynomial kernel run was retained in the repeated-run statistics, while the overall kernel selection results indicate that the RBF kernel provides a more stable and suitable mapping scheme for the present AE signal feature set.

4.4. Clustering Performance Evaluation

4.4.1. Clustering Performance Comparison

As shown in Table 8 and Figure 14, the four clustering methods exhibit clear differences in overall identification performance and stability. K-Means achieves relatively low ACC and ARI values of 82.51% ± 0.45% and 0.6603 ± 0.0052, respectively, indicating limited consistency between its clustering results and the experimental damage categories. Although its SC reaches 0.5680 ± 0.0036, the highest among the four methods, this only suggests that K-Means forms relatively compact clusters in the normalized original feature space. For AE feature data with local overlap and ambiguous boundaries, the Euclidean distance-based hard partition of K-Means is insufficient to characterize the complex boundaries among different damage types.

FCM also shows limited performance, with ACC and ARI values of 75.84% ± 20.66% and 0.6571 ± 0.1861, respectively. The large standard deviations indicate that FCM is highly sensitive to random initialization and lacks stability in repeated runs. The relatively large standard deviation of FCM can be attributed to its sensitivity to initialization and the fuzzy membership updating mechanism. In the 10 repeated runs, FCM showed two distinct convergence patterns: six runs achieved an ACC of 91.84% and an ARI of 0.8013, whereas the other four runs only achieved an ACC of 51.84% and an ARI of 0.4408. This indicates that FCM may converge to different local solutions under different initial membership conditions. In addition, the low-ACC runs showed a relatively high SC of 0.5623, suggesting that the clusters were internally compact but not consistent with the actual damage-type labels. This is mainly because AE signals related to freeze–thaw damage and corrosion damage have partially overlapping feature distributions, and samples near the transition regions may receive similar membership degrees in FCM. Therefore, the fuzzy membership mechanism may amplify the uncertainty of boundary samples and lead to unstable clustering results. In this study, the fuzziness exponent was kept fixed to ensure a consistent comparison among baseline methods.

Compared with K-Means and FCM, the GMM shows a clear improvement in overall performance, with the ACC and ARI increasing to 90.37% ± 7.73% and 0.7885 ± 0.0730, respectively. This indicates that probabilistic distribution modeling can better improve the consistency between clustering results and experimental damage categories. Unlike K-Means and FCM, the GMM can describe non-spherical data distributions through Gaussian components and covariance matrices, which makes it more suitable for AE signal data with complex distribution characteristics. However, its SC is only 0.3336 ± 0.1026, the lowest among the four methods, suggesting that the clustering labels obtained by the GMM show relatively weak compactness and separation in the normalized original feature space. Meanwhile, the relatively large standard deviations of the ACC and ARI indicate that its results still fluctuate under different random initializations.

COA-KKM further outperforms the GMM in overall performance, achieving the highest ACC and ARI values of 92.86% ± 4.19% and 0.8215 ± 0.0662, respectively. Compared with the GMM, which already provides relatively good identification performance, COA-KKM not only further improves the mean values of ACC and ARI, but also reduces their standard deviations, indicating better stability in repeated runs. This suggests that kernel mapping enhances the nonlinear representation capability of AE features, while COA-based adaptive optimization of the kernel function type and key parameters helps reduce the influence of manual parameter selection and random initialization.

It should be noted that SC was calculated in the normalized original seven-dimensional feature space for all methods to ensure comparability, whereas the actual clustering process of COA-KKM is performed in the kernel-induced feature space. Therefore, SC is mainly used as an auxiliary reference and cannot fully characterize the cluster structure of COA-KKM in the kernel space. The SC of COA-KKM is 0.4032 ± 0.0626, which is lower than that of K-Means and FCM but higher than that of the GMM. This does not indicate weaker identification performance; rather, it shows that COA-KKM does not simply pursue intra-cluster compactness in the original Euclidean space, but improves the nonlinear separability of AE signals associated with different damage types through kernel mapping and parameter optimization. Overall, COA-KKM achieves the best ACC and ARI while maintaining good stability, indicating that the proposed method is more suitable for the unsupervised identification of AE signals corresponding to different damage types.

4.4.2. Statistical Significance Analysis

To further examine whether the observed performance improvement of COA-KKM was statistically significant, a two-sided Wilcoxon signed-rank test was conducted based on the results of 10 repeated runs. Since the ACC and ARI are external evaluation metrics that directly measure the consistency between clustering results and the actual damage-type labels, they were selected for the significance analysis. In contrast, SC is an internal validity index that only evaluates the compactness and separation of clusters without using the true damage labels. Therefore, SC was not included in the significance test, but was retained as a supplementary indicator in the overall performance comparison.

The test results are summarized in Table 9. COA-KKM significantly outperformed K-means and FCM in terms of both the ACC and ARI, indicating that the proposed kernel optimization strategy effectively improved the consistency between clustering results and damage-type labels. Compared with the GMM, COA-KKM also achieved higher mean ACC and ARI, with improvements of 2.49 percentage points and 0.0330, respectively. The corresponding p-values were 0.0645 and 0.0566, which are close to but slightly above the 0.05 significance level in the two-sided test. This result indicates that, although the improvement over the GMM cannot be regarded as strictly statistically significant at the 0.05 level, COA-KKM still showed a favorable performance trend against this strong probabilistic clustering baseline. Overall, the statistical results support the effectiveness of COA-KKM, especially in comparison with conventional partition-based and fuzzy clustering methods, while also indicating that further validation with larger datasets is needed to more robustly quantify its advantage over the GMM.

5. Discussion

5.1. Class-Level Performance

Although the overall clustering performance has been evaluated in Section 4 using the ACC, ARI and SC, these global metrics cannot fully reveal the recognition characteristics of different damage types. These metrics compress the classification performance of freeze–thaw damage, corrosion damage and load-induced damage into single overall values. As a result, they may conceal the fact that some damage types are easier to identify while others are more prone to misclassification. For AE-based damage identification of RC beams, this class-dependent recognition performance is important because different damage mechanisms may generate AE signals with different degrees of feature separability. If only the overall clustering accuracy is considered, the model may appear reliable while still performing poorly for a specific damage type. Therefore, further analysis of the recognition performance for each damage category is necessary to clarify the identification difficulty of different AE signals and to reveal the source of misclassification.

To further evaluate the recognition performance of each damage type, the F1-score was introduced in this study. The F1-score is a class-level evaluation metric that combines precision and recall. Precision reflects the proportion of correctly identified samples among all samples predicted as a certain damage type, while recall reflects the proportion of correctly identified samples among all actual samples of that damage type. Therefore, the F1-score can simultaneously consider false positives and false negatives, making it suitable for evaluating the recognition performance of each individual damage category. The precision, recall and F1-score are defined as follows:

Precision = \frac{TP}{TP + FP}

(23)

Recall = \frac{TP}{TP + FN}

(24)

F 1 = \frac{2 \times Precision \times Recall}{Precision + Recall}

(25)

where TP denotes the number of samples correctly identified as a given damage type, FP denotes the number of samples from other damage types incorrectly identified as that damage type, and FN denotes the number of samples of that damage type incorrectly assigned to other categories. A higher F1-score indicates better recognition performance for the corresponding damage type. In contrast, a lower F1-score suggests that the damage type is more likely to be missed or confused with other categories in the current feature space.

As shown in Table 10 and Figure 15, the class-level F1-scores show clear differences in the recognition difficulty of the three damage types. Load-induced damage generally achieves high F1-scores under different methods, with values of 0.9944 ± 0.0008, 0.8760 ± 0.1532, 0.9717 ± 0.0039 and 0.9706 ± 0.0158 for K-Means, FCM, GMM and COA-KKM, respectively. Except for the relatively large fluctuation observed in FCM, the F1-scores of load-induced damage remain at a high level, indicating that the AE signals associated with loading-induced damage are more distinguishable in the current feature space.

In contrast, freeze–thaw and corrosion damage show stronger dependence on the clustering method. For freeze–thaw damage, K-Means and FCM obtain relatively low and unstable F1-scores, especially FCM, whose large standard deviation indicates strong sensitivity to random initialization. The GMM and COA-KKM significantly improve the recognition performance of freeze–thaw damage, with F1-scores of 0.8860 ± 0.0792 and 0.8759 ± 0.1091, respectively. This suggests that methods capable of describing non-spherical or nonlinear feature distributions are more suitable for identifying freeze–thaw-related AE signals.

For corrosion damage, COA-KKM achieves the highest F1-score of 0.9214 ± 0.0538, outperforming K-Means, FCM and the GMM. Although the GMM shows good overall performance in Table 8, its F1-score for corrosion damage is 0.8067 ± 0.2494, with a relatively large standard deviation, indicating that corrosion-related AE signals are still difficult to identify stably under the Gaussian mixture assumption. In comparison, COA-KKM provides a more balanced recognition performance across the three damage categories, especially improving the identification of corrosion damage. These results indicate that the advantage of COA-KKM is mainly reflected in its ability to handle boundary or overlapping samples, thereby improving the class-level consistency of different damage-related AE signals.

For visualization and misclassification analysis, the representative run whose ACC was closest to the mean ACC of the 10 repeated runs was selected. Figure 16 further reveals the specific misclassification paths of different methods in the representative runs. For K-Means, the misclassification mainly occurs between freeze–thaw and corrosion damage: 83 freeze–thaw samples are identified as corrosion, whereas corrosion and load-induced samples are almost all correctly assigned. This indicates that the poor recognition of freeze–thaw damage by K-Means is not caused by general confusion among all three categories, but mainly by the shift of freeze–thaw samples toward the corrosion category in the feature space. FCM partly alleviates this problem, reducing the number of freeze–thaw samples misclassified as corrosion from 83 to 38. However, the misclassified samples are still mainly concentrated between freeze–thaw and corrosion, suggesting that fuzzy membership can improve boundary sample identification to some extent, but cannot fully eliminate the feature overlap between these two categories.

The misclassification pattern of the GMM differs from those of K-Means and FCM. Most freeze–thaw samples are correctly identified, whereas 22 corrosion samples are misclassified as freeze–thaw and 9 corrosion samples are misclassified as load-induced damage. This indicates that corrosion samples show a certain transitional characteristic in the feature space, and may be close to either freeze–thaw or load-induced AE signals. For COA-KKM, the main misclassification is the shift of freeze–thaw samples toward the load-induced category, with 21 freeze–thaw samples misclassified as load-induced damage. In addition, 12 corrosion samples are misclassified as freeze–thaw and 3 corrosion samples are misclassified as load-induced damage, while all load-induced samples are correctly identified. These results show that COA-KKM maintains stable recognition of load-induced damage and effectively reduces the misclassification of corrosion samples; however, some boundary freeze–thaw samples may still be assigned to the load-induced category when their AE features become close to those of loading-related signals.

Overall, the misclassified samples of the four methods are mainly concentrated in adjacent or transitional regions of the feature space. K-Means and FCM mainly confuse freeze–thaw with corrosion, the GMM shows a tendency to disperse corrosion samples toward other categories, while COA-KKM performs better in reducing corrosion misclassification and maintaining stable recognition of load-induced damage. Combined with the F1-score results, the three types of AE signals exhibit different degrees of separability in the feature space. Load-induced AE signals are the easiest to form a stable category, whereas freeze–thaw- and corrosion-related signals remain the main sources affecting the clarity of classification boundaries.

5.2. Feature Interpretation

To further interpret the feature characteristics of the COA-KKM clustering results, the original AE features of the samples identified by the model and the main misclassification paths were analyzed based on the representative run. Figure 17 presents the normalized median values of the seven AE features for different sample groups. The first three rows represent the samples identified by COA-KKM as freeze–thaw, corrosion and load-induced damage, while the last two rows correspond to the two main misclassification paths in Figure 16d, namely corrosion samples misclassified as freeze–thaw damage and freeze–thaw samples misclassified as load-induced damage.

For the three identified damage groups, the samples identified as load-induced damage show the most distinctive feature profile. Their normalized median values of ring-down count, duration, average frequency and counts to peak are 0.37, 0.30, 0.49 and 0.27, respectively, which are generally higher than those of the freeze–thaw and corrosion groups. This indicates that COA-KKM tends to assign samples with stronger AE activity, longer signal duration and more prominent frequency characteristics to the load-induced damage category. In contrast, the samples identified as freeze–thaw damage generally show lower feature values, with only counts to peak reaching a relatively noticeable level. The samples identified as corrosion damage show higher values than the freeze–thaw group in rise time, duration, average frequency and amplitude, but are still generally lower than the load-induced group, presenting an intermediate feature profile between freeze–thaw and load-induced damage.

For the main misclassification paths, the corrosion samples misclassified as freeze–thaw damage show relatively low values in most AE features. Their ring-down count, duration and amplitude are lower than those of the overall samples identified as corrosion damage, and are closer to the feature levels of the freeze–thaw group. This suggests that some corrosion samples with weak AE activity and less pronounced duration-related responses may be located close to the freeze–thaw category in the feature space. On the other hand, the freeze–thaw samples misclassified as load-induced damage show a particularly high normalized median value of counts to peak, reaching 0.71, and their average frequency is also higher than that of the overall freeze–thaw group. Although these samples still have relatively low ring-down count, duration and amplitude, their more prominent rising-stage activity may cause them to shift toward the load-induced category during clustering.

Overall, Figure 17 further explains the misclassification phenomena observed in the COA-KKM confusion matrix. Load-induced samples show more pronounced responses in several AE features and therefore form a relatively stable identified category. Freeze–thaw and corrosion samples exhibit closer feature profiles, especially when corrosion samples have weak AE activity, which may cause them to be assigned to the freeze–thaw category. In addition, some freeze–thaw samples with high counts to peak tend to approach the load-induced category, becoming the main source of freeze–thaw-to-load misclassification.

6. Conclusions

This paper proposed a COA-optimized kernel K-means clustering method for the unsupervised identification of AE signals associated with different damage types in RC beams. Based on AE signals collected from three RC beams under loading-induced damage, freeze–thaw damage and reinforcement corrosion conditions, a dataset containing 490 valid samples with seven AE features was constructed. The main conclusions are as follows:

(1): The AE signals corresponding to the three damage types show distinguishable but partially overlapping feature distributions. Load-induced damage generally exhibits higher ring-down count, duration, average frequency and amplitude, while freeze–thaw and corrosion damage show closer feature distributions in several parameters. This indicates that single AE parameters are insufficient for reliable damage identification, and multi-feature clustering is necessary.
(2): Compared with K-means, FCM and the GMM, the proposed COA-KKM method achieves the best overall clustering performance. Over 10 repeated runs, COA-KKM obtains the highest ACC and ARI, reaching 92.86% ± 4.19% and 0.8215 ± 0.0662, respectively. The improvements in ACC and ARI are statistically significant compared with K-means and FCM, while COA-KKM also shows a higher mean performance and better stability than the GMM. The results show that kernel mapping and COA-based parameter optimization can improve the nonlinear separation ability and stability of clustering for different damage-related AE signals.
(3): The recognition difficulty differs among damage types. Load-induced damage can be identified more stably, while freeze–thaw and corrosion damage are more prone to confusion due to feature overlap. Class-level results show that COA-KKM achieves the highest F1-score for corrosion damage, reaching 0.9214 ± 0.0538, and maintains good recognition performance for both freeze–thaw and load-induced damage.
(4): Feature interpretation based on the representative COA-KKM result further shows that the main misclassified samples are located in feature-overlapping or transitional regions. Corrosion samples with weaker AE activity tend to be misclassified as freeze–thaw damage, while some freeze–thaw samples with higher counts to peak tend to shift toward the load-induced category.

It should be noted that this study mainly focuses on the feasibility validation of the COA-KKM method for the unsupervised identification of AE signals associated with different damage types and does not include a complete engineering deployment study for bridge structural health monitoring systems. Due to the limitation of the experimental conditions, the dataset used in this study was collected from controlled tests on three RC beams, with a relatively limited sample size. In addition, the three types of AE signals were obtained from different dominant damage conditions, which cannot fully represent complex service scenarios in which multiple types of damage coexist or evolve sequentially in actual bridge structures. Nevertheless, the COA-KKM method showed good performance in terms of clustering accuracy, ARI and stability, indicating its potential as an auxiliary analysis tool for AE-based bridge monitoring. For future engineering application, further validation using field AE monitoring data is still needed, with consideration of sensor deployment, real-time computation, traffic-induced noise, environmental variations, sensor coupling conditions and non-damage AE sources, so as to evaluate its robustness and applicability under complex on-site conditions.

Author Contributions

Conceptualization, X.W.; methodology, X.W.; validation, X.F.; formal analysis, X.F.; investigation, Y.W.; data curation, X.F.; writing—original draft preparation, Y.W.; writing—review and editing, X.F. (lead reviewer), F.Y. and X.W.; visualization, F.Y.; supervision, X.W.; project administration, X.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Basic Research Program from the Natural Science Foundation of Jiangsu Province, China (grant number BK20220209), Jiangsu Province Youth Science and Technology Talent Support Project (grant number JSTJ-2024-302).

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

Authors Xianqiang Wang, Xiaonan Feng, and Yi Fan are employed by the company JSTI Group. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

AE	acoustic emission
RC	reinforced concrete
FCM	Fuzzy C-Means Clustering
GMM	Gaussian Mixture Model
COA	Coyote Optimization Algorithm
COA-KKM	kernel K-means clustering method optimized by the coyote optimization algorithm

References

Ma, J.; Yang, Q.W.; Peng, X.; Xia, K.S. Review on Durability Deterioration and Mitigation of Concrete Structures. Coatings 2025, 15, 982. [Google Scholar] [CrossRef]
Eker, H.; Demir Şahin, D.; Çullu, M. Effect of Reduced Fineness of Fly Ash Used on the Alkali–Silica Reaction (ASR) of Concrete. Iran. J. Sci. Technol. Trans. Civ. Eng. 2023, 47, 2203–2217. [Google Scholar] [CrossRef]
Farhidzadeh, A.; Dehghan-Niri, E.; Salamone, S.; Luna, B.; Whittaker, A. Monitoring Crack Propagation in Reinforced Concrete Shear Walls by Acoustic Emission. J. Struct. Eng. 2013, 139, 04013010. [Google Scholar] [CrossRef]
Rucka, M.; Knak, M.; Nitka, M. A Study on Microcrack Monitoring in Concrete: Discrete Element Method Simulations of Acoustic Emission for Non-Destructive Diagnostics. Eng. Fract. Mech. 2023, 292, 109636. [Google Scholar] [CrossRef]
Farahat, A.M.; Ohtsu, M. Evaluation of Plastic Damage in Concrete by Acoustic Emission. J. Mater. Civ. Eng. 1995, 7, 148–153. [Google Scholar] [CrossRef]
Soulioti, D.; Barkoula, N.M.; Paipetis, A.; Matikas, T.E.; Shiotani, T.; Aggelis, D.G. Acoustic Emission Behavior of Steel Fibre Reinforced Concrete under Bending. Constr. Build. Mater. 2009, 23, 3532–3536. [Google Scholar] [CrossRef]
Xie, C.; Yuan, L.; Zhao, M.; Jia, Y. Study on Failure Mechanism of Porous Concrete Based on Acoustic Emission and Discrete Element Method. Constr. Build. Mater. 2020, 235, 117409. [Google Scholar] [CrossRef]
Zhou, Q.Z.; Wen, B.C. Fatigue Damage Assessment Using Adaptive Acoustic Emission Waveform Analysis. Results Eng. 2025, 27, 105713. [Google Scholar] [CrossRef]
Rather, A.I.; Laskar, A.; Banerjee, S. Monitoring Bond Degradation and Predicting Failure Modes in GFRP-Reinforced Concrete Beams Using Information Entropy Analysis of Acoustic Emission Waveforms. Constr. Build. Mater. 2025, 498, 144014. [Google Scholar] [CrossRef]
Cui, J.; Qu, X.; Lv, C.; Du, J.; Wang, H. Multi-Parameter Acoustic Emission Analysis for Fatigue Crack Evaluation in Structural Health Monitoring. Measurement 2025, 256, 118529. [Google Scholar] [CrossRef]
Idrissi, H.; Liman, A. Study and Characterization by Acoustic Emission and Electrochemical Measurements of Concrete Deterioration Caused by Reinforcement Steel Corrosion. NDT E Int. 2003, 36, 563–569. [Google Scholar] [CrossRef]
Zaki, A.; Chai, H.K.; Aggelis, D.G.; Alver, N. Non-Destructive Evaluation for Corrosion Monitoring in Concrete: A Review and Capability of Acoustic Emission Technique. Sensors 2015, 15, 19069–19101. [Google Scholar] [CrossRef] [PubMed]
Di Benedetti, M.; Loreto, G.; Matta, F.; Nanni, A. Acoustic Emission Monitoring of Reinforced Concrete under Accelerated Corrosion. J. Mater. Civ. Eng. 2013, 25, 1022–1029. [Google Scholar] [CrossRef]
Di Benedetti, M.; Loreto, G.; Matta, F.; Nanni, A. Acoustic Emission Historic Index and Frequency Spectrum of Reinforced Concrete under Accelerated Corrosion. J. Mater. Civ. Eng. 2014, 26, 04014059. [Google Scholar] [CrossRef]
Colombo, I.G.; Colombo, M.; Di Prisco, M. Tensile Behavior of Textile Reinforced Concrete Subjected to Freezing-Thawing Cycles in Un-Cracked and Cracked Regimes. Cem. Concr. Res. 2015, 73, 169–183. [Google Scholar] [CrossRef]
Yun, H.D.; Choi, W.C. Acoustic Emission to Detect Damage in FRP Strength under Freeze and Thaw Cycles. In Acoustic Emission Research and Applications; IntechOpen: Rijeka, Croatia, 2013. [Google Scholar] [CrossRef] [PubMed][Green Version]
Choi, W.C.; Yun, H.D. Acoustic Emission Activity of CFRP-Strengthened Reinforced Concrete Beams after Freeze-Thaw Cycling. Cold Reg. Sci. Technol. 2015, 110, 47–58. [Google Scholar] [CrossRef]
Mainali, G.; Dineva, S.; Nordlund, E. Experimental Study on Debonding of Shotcrete with Acoustic Emission during Freezing and Thawing Cycle. Cold Reg. Sci. Technol. 2015, 111, 1–12. [Google Scholar] [CrossRef]
Todak, H.N.; Tsui, M.; Ley, M.T.; Weiss, W.J. Evaluating freeze-thaw damage in concrete with acoustic emissions and ultrasonics. In Advances in Acoustic Emission Technology; Springer International Publishing: Cham, Switzerland, 2017; Volume 179, pp. 175–189. [Google Scholar] [CrossRef]
Thirumalaiselvi, A.; Sasmal, S. Pattern Recognition Enabled Acoustic Emission Signatures for Crack Characterization during Damage Progression in Large Concrete Structures. Appl. Acoust. 2021, 175, 107797. [Google Scholar] [CrossRef]
Abdeljaber, O.; Avci, O.; Kiranyaz, S.; Gabbouj, M.; Inman, D.J. Real-Time Vibration-Based Structural Damage Detection Using One-Dimensional Convolutional Neural Networks. J. Sound Vib. 2017, 388, 154–170. [Google Scholar] [CrossRef]
Ding, Z.; Li, J.; Hao, H. Structural Damage Identification by Sparse Deep Belief Network Using Uncertain and Limited Data. Struct. Control Health Monit. 2020, 27, e2522. [Google Scholar] [CrossRef]
Yu, X.; Montrésor, S.; Bentahar, M.; Mechri, C. Cluster Analysis of Acoustic Emission Signals for the Damage Pattern Recognition of Polymer Concrete. Appl. Acoust. 2023, 211, 109533. [Google Scholar] [CrossRef]
Saha, I.; Sagar, R.V. Classification of the Acoustic Emissions Generated during the Tensile Fracture Process in Steel Fibre Reinforced Concrete Using a Waveform-Based Clustering Method. Constr. Build. Mater. 2021, 294, 123541. [Google Scholar] [CrossRef]
Liu, K.; Wulan, T.; Yao, Y.D.; Bian, M.; Bao, Y. Assessment of Damage Evolution of Concrete Beams Strengthened with BFRP Sheets with Acoustic Emission and Unsupervised Machine Learning. Eng. Struct. 2024, 300, 117228. [Google Scholar] [CrossRef]
Vishwakarma, V.; Ray, S. Probability-Based Prediction of Crack Propagation in Lightly Reinforced Concrete Beams Using Acoustic Emission Technique. Appl. Acoust. 2026, 241, 111035. [Google Scholar] [CrossRef]
Thirumalaiselvi, A.; Sasmal, S. Machine Learning-Based Acoustic Emission Technique for Corrosion-Induced Damage Monitoring in Reinforced Concrete Structures. Eng. Appl. Artif. Intell. 2024, 137, 109121. [Google Scholar] [CrossRef]
Langone, R.; Reynders, E.; Mehrkanoon, S.; Suykens, J.A.K. Automated Structural Health Monitoring Based on Adaptive Kernel Spectral Clustering. Mech. Syst. Signal Process. 2017, 90, 64–78. [Google Scholar] [CrossRef]
Sibil, A.; Godin, N.; R’mili, M.; Maillet, E.; Fantozzi, G. Optimization of Acoustic Emission Data Clustering by a Genetic Algorithm Method. J. Nondestruct. Eval. 2012, 31, 169–180. [Google Scholar] [CrossRef]
Li, Z.; Huang, Y.; Wu, X.; Wang, X.; Su, H. Acoustic-emission study of freeze-thaw damage of hydraulic concrete due to sustained freezing at minimum temperature. Constr. Build. Mater. 2025, 498, 143950. [Google Scholar]
GB/T 50010-2010; Standard for Design of Concrete Structures. China Architecture and Building Press: Beijing, China, 2010.
GB/T 50082-2024; Standard for Test Methods of Long-Term Performance and Durability of Concrete. China Architecture and Building Press: Beijing, China, 2024.
Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum Likelihood from Incomplete Data via the EM Algorithm. J. R. Stat. Soc. B 1977, 39, 1–22. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of reinforcement arrangement for experimental beam (unit: mm).

Figure 2. EXpress8Acoustic emission acquisition system (Physical Acoustics, West Windsor Township, NJ, USA).

Figure 3. Layout of acoustic emission sensors (Unit: mm).

Figure 4. Layout of loading setup for concrete beam.

Figure 5. Photographs of freeze–thaw test process.

Figure 6. Electrochemical corrosion test: (a) setup; (b) process.

Figure 7. Plot of kernel function mapping.

Figure 8. Schematic diagram of selected AE parameters.

Figure 9. Boxplots of the original AE features: (a) rise time, (b) ring-down count, (c) energy, (d) signal duration, (e) average frequency, (f) counts to peak, and (g) amplitude.

Figure 10. Pearson correlation heatmap of normalized AE features.

Figure 11. Three-dimensional distribution of dataset after PCA dimensionality reduction.

Figure 12. Clustering results of four methods: (a) K-Means, (b) FCM, (c) GMM, and (d) COA-KKM.

Figure 13. Convergence curves of the COA optimization process.

Figure 14. Overall performance and stability of different clustering methods.

Figure 15. Mean values and standard deviations of class-level F1-scores for different clustering methods.

Figure 16. Confusion matrix heatmaps of different clustering methods: (a) K-Means, (b) FCM, (c) GMM, and (d) COA-KKM.

Figure 17. Feature heatmap of AE signal groups identified by COA-KKM.

Table 1. Test conditions of specimens.

Specimen No.	Degradation Mode	Test Purpose
L1	Load	Acquisition of acoustic emission signals under loading
L2	Freeze–Thaw	Acquisition of acoustic emission signals under freeze–thaw
L3	Corrosion	Acquisition of acoustic emission signals under corrosion

Table 2. Initial parameters of acoustic emission system.

Parameter	Value
Sampling frequency	1 MHz
Sampling Length (Number of Sampling Points)	2048
Filter	20–100 kHz
Threshold	40 dB
Peak Discrimination Time (PDT)	50 μs
Hit Identification Time (HIT)	100 μs
Hit Lockout Time (HLT)	300 μs
Built-in Amplifier	40 dB

Table 3. Parameter settings of COA-KKM algorithm.

Parameter	Value
Number of coyote groups	4
Number of coyotes in each group	5
Maximum number of COA iterations	20
Number of reference datasets for Gap Statistic, B	20
Candidate kernel functions	RBF, polynomial, linear
Search range of kernel type code	[1, 3]
Search range of RBF kernel width σ	[0.1, 10]
Search range of polynomial degree d	[1, 5]
Search range of polynomial coefficient c	[0, 2]

Table 4. The distribution characteristics of the dataset.

Damage Type	Number	Rise Time (μs)	Ring-Down Counts	Energy (aJ)	Duration (μs)	Amplitude (dB)	Average Frequency (kHz)	Counts to Peak
Freeze–thaw	151	1–904	31–160	2–95	237–4174	42–71	15–131	1–31
Corrosion	151	11–3336	31–78	20–177	1027–22,113	43–65	3–31	1–58
Load	188	6–13,124	201–588	66–18,680	3318–17,589	59–99	21–75	1–470

Table 5. Absolute loadings of first four principal components.

Feature	PC1	PC2	PC3	PC4
Rise time	0.1848	0.2579	0.4871	0.0492
Ring-down counts	0.6005	0.2356	0.2393	0.0044
Energy	0.1445	0.0100	0.3135	0.7574
Duration	0.4542	0.1753	0.0602	0.4303
Amplitude	0.5718	0.1359	0.5355	0.4659
Average frequency	0.0735	0.9014	0.2441	0.1208
Peak frequency	0.2138	0.1268	0.5072	0.0837

Table 6. Clustering performance of baseline methods before and after PCA dimensionality reduction.

Algorithm	Original 7 Features ACC (%)	Original 7 Features ARI	PC1–PC4 ACC (%)	PC1–PC4 ARI
K-means	81.22 ± 0.00	0.6455 ± 0.0000	82.51 ± 0.45	0.6603 ± 0.0052
FCM	71.94 ± 22.27	0.6328 ± 0.2036	75.84 ± 20.66	0.6571 ± 0.1861
GMM	91.88 ± 3.52	0.7723 ± 0.0933	90.37 ± 7.73	0.7885 ± 0.0730

Table 7. Summary of optimized kernel function selection results.

Kernel Type	Selection Frequency	Optimized Parameter	Mean ACC	Mean ARI	Mean SC
RBF	9/10	0.1706–0.2363	94.15%	0.8407	0.3836
Polynomial	1/10	c = 2, e = 2	81.22%	0.6482	0.5796
Linear	1/10	—	—	—	—

Table 8. Mean and standard deviation of overall clustering metrics.

	ACC (%)	ARI	SC
K-Means	82.51 ± 0.45	0.6603 ± 0.0052	0.5680 ± 0.0036
FCM	75.84 ± 20.66	0.6571 ± 0.1861	0.5121 ± 0.0432
GMM	90.37 ± 7.73	0.7885 ± 0.0730	0.3336 ± 0.1026
COA-KKM	92.86 ± 4.19	0.8215 ± 0.0662	0.4032 ± 0.0626

Table 9. Statistical significance test results based on 10 repeated runs.

Metric	Comparison	Mean Difference	p-Value
ACC	COA-KKM vs. K-means	+10.35%	0.0039
	COA-KKM vs. FCM	+17.02%	0.0020
	COA-KKM vs. GMM	+2.49%	0.0645
ARI	COA-KKM vs. K-means	+0.1612	0.0020
	COA-KKM vs. FCM	+0.1644	0.0039
	COA-KKM vs. GMM	+0.0330	0.0566

Table 10. Mean and standard deviation of F1-scores for each damage type.

	K-Means	FCM	GMM	COA-KKM
Freeze–thaw	0.6108 ± 0.0124	0.5117 ± 0.4404	0.8860 ± 0.0792	0.8759 ± 0.1091
Corrosion	0.7809 ± 0.0048	0.8004 ± 0.1100	0.8067 ± 0.2494	0.9214 ± 0.0538
Load	0.9944 ± 0.0008	0.8760 ± 0.1532	0.9717 ± 0.0039	0.9706 ± 0.0158

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, X.; Feng, X.; Yi, F.; Wang, Y. COA-Optimized Kernel K-Means Clustering for Identifying Acoustic Emission Signals Associated with Different Damage Types in RC Beams. Buildings 2026, 16, 2617. https://doi.org/10.3390/buildings16132617

AMA Style

Wang X, Feng X, Yi F, Wang Y. COA-Optimized Kernel K-Means Clustering for Identifying Acoustic Emission Signals Associated with Different Damage Types in RC Beams. Buildings. 2026; 16(13):2617. https://doi.org/10.3390/buildings16132617

Chicago/Turabian Style

Wang, Xianqiang, Xiaonan Feng, Fan Yi, and Yaoxuan Wang. 2026. "COA-Optimized Kernel K-Means Clustering for Identifying Acoustic Emission Signals Associated with Different Damage Types in RC Beams" Buildings 16, no. 13: 2617. https://doi.org/10.3390/buildings16132617

APA Style

Wang, X., Feng, X., Yi, F., & Wang, Y. (2026). COA-Optimized Kernel K-Means Clustering for Identifying Acoustic Emission Signals Associated with Different Damage Types in RC Beams. Buildings, 16(13), 2617. https://doi.org/10.3390/buildings16132617

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

COA-Optimized Kernel K-Means Clustering for Identifying Acoustic Emission Signals Associated with Different Damage Types in RC Beams

Abstract

1. Introduction

2. Materials and Experiments

2.1. Specimen Preparation

2.2. Acoustic Emission Monitoring System

2.3. Acoustic Emission Signal Acquisition Test

2.3.1. Reinforced Concrete Beam Load-to-Failure Test

2.3.2. Freeze–Thaw Damage Test

2.3.3. Rebar Corrosion Test

3. Model Establishment

3.1. Clustering Analysis

3.1.1. K-Means Clustering Algorithm

3.1.2. Fuzzy C-Means Clustering Algorithm

3.1.3. Gaussian Mixture Model

3.2. Model Modification

3.2.1. Kernel Clustering

3.2.2. COA Kernel Optimization K-Means Clustering Method

3.2.3. Parameter Settings

3.3. Dataset Construction

3.3.1. AE Feature Dataset

3.3.2. Data Pretreatment

3.4. Performance Evaluation

4. Result

4.1. Clustering Visualization

4.2. Effect of PCA Dimensionality Reduction on Baseline Methods

4.3. COA Optimization Results

4.3.1. Convergence Behavior of COA Optimization

4.3.2. Optimized Kernel Function and Parameter Analysis

4.4. Clustering Performance Evaluation

4.4.1. Clustering Performance Comparison

4.4.2. Statistical Significance Analysis

5. Discussion

5.1. Class-Level Performance

5.2. Feature Interpretation

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI