5.1. Analysis of Crack Prediction Results
Figure 8 shows the error bar chart of crack length prediction results under different strain input conditions before strain compensation. The input strain conditions are mainly divided into three types: X represents the input from the
x-axis strain alone, Y represents the input from the
y-axis strain alone, and XY represents the input from all strain channels in both the
x and
y axes. Peak represents the peak value of the dynamic strain, while Valley represents the valley value of the dynamic strain. For example, X-Peak represents the peak value of the dynamic strain along the
x-axis of the input; X-Valley represents the valley value of the dynamic strain along the
x-axis of the input. The chart compares the prediction results for CT specimens with crack lengths of 5 mm, 15 mm, 25 mm, 35 mm, and 45 mm.
Figure 9 shows the error bar chart of prediction results under different input conditions after strain compensation. From
Figure 8 and
Figure 9, it can be observed that the prediction error under the XY input condition is smaller than that under the X and Y strain input conditions.
Table 7 and
Table 8 present a comparison of crack length prediction relative percent errors (RPEs) before and after strain compensation, respectively. The RPE calculation method is shown in the Formula (8).
where
represents the predicted value,
represents the actual value.
The data in the tables indicate that the RPE decreases as the crack length increases. When the crack length exceeds 25 mm, the RPE after strain compensation is generally maintained within 15%. Moreover, when the input utilizes XY strain data, the RPE remains largely within 10%.
Table 9 demonstrates the percentage reduction in RPE after strain compensation compared to that before compensation. The data in the table show that strain compensation effectively reduces the RPE in the majority of cases. The reduction is most pronounced when XY strain data are used as input. For crack lengths beyond 15 mm, the RPEs for both XY-Peak and XY-Valley are reduced by over 50%. The comparison of data in the tables confirms the importance of both the
x-axis and the
y-axis strain for accurate crack prediction. Uniaxial strain alone is insufficient for accurately predicting structural crack length, while the combination of
x-axis and
y-axis strain provides multi-dimensional signal characteristics that enhance prediction accuracy. Different magnitudes of dynamic loads result in varying crack propagation rates; the higher the dynamic load, the faster the crack grows, and vice versa. Comparing the prediction errors at peak and valley loads is intended to establish the maximum and minimum ranges of prediction error. The prediction error is greatest when the load is at its minimum and smallest when the load is at its maximum. In practice, the strain under the peak value of alternating load is typically selected as the model input to predict crack length.
When the crack length is 5 mm, the prediction errors are significantly large, and in some cases even exceed 100%. This occurs because, with small cracks, the strain values at each measurement point remain very low even under relatively high loads. Low strain values amplify the discrepancy between the measured and FEA strain results. This issue persists even after strain compensation, ultimately leading to increased prediction errors for small crack lengths.
Figure 10 and
Figure 11 illustrate the relationship between RPE and load under different crack lengths. As shown in
Figure 10, for crack lengths of 5 mm and 15 mm, the RPE gradually decreases as the load increases. This phenomenon is more pronounced for X and XY input conditions. Furthermore, after compensation for the strain data, the prediction error is also reduced. As shown in
Figure 11, when the crack length increases to 35 mm or more, the RPE of the XY remains relatively low. And under the XY input condition, for a crack length of 35 mm, the RPE can be maintained within 10% across all load states. Moreover, when the crack grows to 45 mm, the RPE remains within 5% across the entire load range. Here, XY-3 represents the input condition where the strain data consists of
x1–
x3 and
y1–
y3. Analysis of
Figure 10 and
Figure 11 indicates that the prediction performance under the XY-3 input condition is suboptimal. Therefore, based on the analysis of the prediction results, it can be concluded that the contributions of the
x-axis and
y-axis strain from each measurement point vary across different stages of crack growth, and the XY input condition yields the best prediction accuracy for all crack lengths.
5.2. Self-Diagnosis of Strain Sensors
The mean absolute percentage error (MAPE) reflects the deviation between predicted results and actual results, as shown in Formula (9).
In the formula,
represents the predicted value,
represents the actual value, and
n represents the number of samples involved in the calculation. The bar chart in
Figure 12 displays the MAPE of the load predicted by the
DF1 model under different crack lengths, both before and after strain compensation. It can be clearly observed from the figure that the load prediction error for the XY case is significantly reduced after strain compensation. Moreover, as the crack length increases, the MAPE of the load prediction results gradually decreases after strain compensation. Compared to the uncompensated case, the reduction in MAPE for load prediction is smallest at a crack length of 5 mm, with a decrease of 31.84%. At a crack length of 45 mm, the percentage reduction in MAPE is largest, reaching 95.22%. The accuracy of load prediction directly determines the performance of the sensor’s self-diagnostic fault prediction results.
The paper conducted a statistical analysis of the prediction results, including the determination coefficient (R
2), mean absolute error (MAE), mean square error (MSE) and root mean square error (RMSE), for all CT specimens under different loading conditions. These metrics were evaluated using a total of 2300 test samples. The results under X, Y, and XY inputs were compared, as shown in
Figure 13. The comparison demonstrates that the XY case achieved the highest R
2 value of 0.9295, while also yielding the relatively lowest MAE, MSE, and RMSE values of 0.2970, 12.2309, and 3.4973, respectively.
Table 10 and
Table 11 present the improvement in prediction performance achieved by using XY compared to X and Y, before and after strain compensation, respectively. Here, XY_X denotes the improvement of XY over X, and XY_Y represents the improvement of XY over Y. The data in both tables indicate that, both before and after strain compensation, the XY condition consistently yields better prediction performance than either X or Y alone. This also confirms that utilizing XY effectively enhances the model’s prediction accuracy. The calculation methods are shown in Formulas (10)–(13).
where
represents the true value,
represents the predicted value, and
represents the mean of the true values.
To ensure the persuasiveness of the comparison results,
Table 12 presents the changes in various statistical indicators after strain compensation relative to those before compensation. Positive values in the table indicate improvement in the indicators, while negative values indicate deterioration. The data in the table demonstrate that, regardless of whether the input condition is X, Y, or XY, the prediction results improve after strain compensation. For the XY input condition, the R
2 value increases by 18.94%, while the MAE, MSE, and RMSE decrease by 53.94%, 67.74%, and 43.2%, respectively. The data in the table validate the effectiveness of strain compensation in improving prediction performance.
In practical applications, strain gauges are inevitably susceptible to certain degrees of damage, such as degumming or fractures. This necessitates fault diagnosis for strain sensors to guide their replacement, thereby promptly preventing increased errors in crack length predictions caused by sensor failures during testing. If the dynamic strain sequences from the normal service condition of strain sensors are directly used as reference values for damage calculation, the strain sequences under normal service conditions for multiple channels would need to be recorded. This would require building a database of standard strain sequences and comparing measured data sequences with these standard sequences to identify the fault or damage state of the strain sensors. However, such an approach would complicate the entire diagnostic process and reduce efficiency.
In this paper, the crack length and load are used as input conditions, while the strain values of each channel serve as the output to train the DF model, resulting in model
DF2.
DF2 can predict the current strain values of each channel based on the existing crack length and load conditions. Building on the Pearson correlation coefficient, this paper proposes a self-diagnostic coefficient
for strain sensors, as shown in the Equation (14) below. The fault determination of the strain sensor is achieved by comparing the self-diagnosis coefficient of the predicted strain by
DF2 with the value of the actual dynamic strain data.
where
r represents the Pearson correlation coefficient, and its calculation method is shown in Formula (15).
In the formula,
x represents the actual measured strain sensor data sequence,
y denotes the strain data predicted by the
DF2 model, and
xm and
ym represent the mean values of the measured and predicted strain data, respectively. The symbol
p denotes the proportionality coefficient, as shown in Formula (16).
In the formula, xmax and xmin represent the maximum and minimum values of the measured strain sequence, respectively, while ymax and ymin represent the maximum and minimum values of the strain sequence predicted by the DF2.
During actual testing, if a strain sensor experiences debonding or aging failure, the actual measured strain value will significantly decrease under the same load conditions. In this study, 0.2 times the actual strain value is used to simulate strain data from a debonded strain gauge, and setting a specific channel’s strain to a constant value simulates a fractured strain gauge. To evaluate the effectiveness of strain sensor damage diagnosis, two channels of each tested CT specimen were configured with faults, with specific faulty channel settings detailed in
Table 13.
Figure 14 shows the diagnostic results of the strain fault index under the XY input condition. As seen in
Figure 14a, under normal sensor conditions, the strain fault indices for channels
y4 and
y5 are relatively small, resulting in false detections with a misdiagnosis rate of 20%. This occurs because the strain values of channels
y4 and
y5 are generally small and minimally influenced by load. The misdiagnosis rate for degumming conditions is 24%, while for fracture conditions, it is 4%.
Figure 15 presents the calculated strain fault indices for each sensor under the XY-3 input condition.
Figure 15a shows the results under normal sensor conditions, while
Figure 15b displays the results under degumming conditions. Analysis of these figures indicates that the damage index values for degumming sensors are relatively small, allowing clear differentiation of their faulty state, which is represented by darker colors in the heatmap. When a strain sensor fractures, the calculated damage index drops to 0, as shown in
Figure 15c. Under the XY-3 input condition, the fault diagnosis accuracy reaches 100%, with a misdiagnosis rate of 0%. These diagnostic results demonstrate that the dimensionality of the input strain data also influences the damage diagnosis outcome to some extent. Higher input strain dimensionality leads to a higher misdiagnosis rate. In practical monitoring, the XY input condition is used for crack length prediction, while the XY-3 condition can be combined with it for cross-validating strain sensor faults. Based on the self-diagnosis results of the predictive sensors, the diagnostic accuracy of XY-3 is higher than that of XY. This is because the
DF2 model takes both crack length and load as inputs and outputs strain values from multiple measurement points. When the number of output variables is excessively large, the predictive accuracy of the model tends to decrease.