1. Introduction
With the rapid development of science, technology, and engineering fields, the performance evaluation of the fiber optic gyroscope (FOG) has become a crucial aspect of the defense and industrial sectors [
1,
2,
3]. As the core sensing unit of inertial navigation systems, FOG provides attitude references through angular velocity measurements and is extensively used in aerospace, marine navigation, and other engineering applications. The performance of FOG can decline due to aging components and external environmental interference. It is crucial to address this degradation promptly to prevent economic losses and, more importantly, safety hazards [
4,
5,
6]. Additionally, due to the high costs associated with FOG, replacement often surpasses maintenance expenses when failures occur. Therefore, assessing FOG performance is of great importance for enhancing instrument reliability and ensuring the accuracy of engineering decisions.
In the context of engineering practice, sensor-monitored data inevitably incorporates imperfect data, which manifests two distinct features. On one hand, low data quality arises from sensor failures and environmental disturbances. On the other hand, diversity in distribution characteristics emerges due to varied monitoring data types in engineering applications, where imperfect data exhibit considerable complexity as they are more susceptible to external interference. Current traditional evaluation methods that rely on ideal data are not capable of accurately reflecting the true performance of FOG [
7]. Imperfect data lead to the misrepresentation of FOG performance in evaluation models, resulting in the production of unreliable results. Consequently, accounting for the effects of imperfect data is of critical importance in engineering practice [
8].
In recent years, as the industry has continued to shift toward digital development, many intelligent and efficient performance evaluation methods have emerged. Analyzing the current literature shows that the main performance evaluation approaches fall into three key categories: mechanism-based, knowledge-based, and data-driven methods. The mechanism-based method is highly dependent on the internal working mechanism of the instrumentation and requires an explicit physical or mathematical model to describe the relationship between inputs and outputs. For example, Yuan et al. [
9] combined event tree analysis, computational fluid dynamics simulation, and evacuation modeling to assess the risk of toxic gas leakage in a chemical plant. Li et al. [
10] proposed an implicit Kalman filtering method to predict the remaining service life of a system and evaluate its health status. The knowledge-based method utilizes the valuable experience accumulated over time by experts in the field of navigation instruments to assess FOG performance. For example, Jo et al. [
11] developed a probabilistic safety evaluation method for personnel reliability analysis by incorporating operator action time. Rahim et al. [
12] developed a deep learning-based engine health monitoring system using a hybrid CNN-BiGRU model, which analyzes sensor data for accurate fault detection. The data-driven method does not require prior knowledge of the FOG’s working mechanisms; instead, performance evaluation is performed by analyzing large amounts of observational data, leading to its widespread use in assessments. Che et al. [
13] developed a predictive health assessment framework for lithium-ion batteries using probabilistic degradation prediction and accelerating aging detection.
As navigation instruments become increasingly complex and their working principles harder to understand, mechanism-based methods are rarely used; due to the accumulated knowledge and experience of domain experts, the knowledge-based method is a convenient and useful evaluation approach, but the results are influenced by subjective judgment from experts. Recently, data-driven methods have gained popularity in performance evaluation. However, most data-driven models have a “black-box” nature, which reduces interpretability in the evaluation process. Additionally, this approach depends on high-quality data and struggles with evaluating navigation instruments when the data is imperfect. On one hand, as the reliability of navigation instruments improves, high-value data becomes scarcer. On the other hand, the accuracy of data-driven evaluation models is affected by the presence of imperfect data.
In summary, existing methods are unable to address the issue of insufficient evaluation accuracy when the data is mostly imperfect. Therefore, a new evaluation framework based on a belief rule base (BRB) is proposed. BRB was introduced by Yang et al. in 2006 [
14]. As a modeling approach using “if-then” belief rules, BRB incorporates the concept of belief degrees during inference, representing all possible results in the form of belief distributions. This structure provides BRB with dual advantages: it combines qualitative knowledge with quantitative data and allows for the incorporation of expert knowledge and domain expertise through its rule-based format, ensuring model interpretability [
15]. As a knowledge–data-hybrid-driven model, BRB has demonstrated effectiveness in performance evaluation applications [
16,
17,
18,
19]. For example, Li et al. [
20] proposed a dynamic performance evaluation method for high-value complex systems based on BRB. Cheng et al. [
21] developed a multidiscounted BRB model for health status assessment in large complex electromechanical systems.
Currently, BRB-based research for imperfect data has mainly focused on low-quality data and non-uniformly distributed data. For low-quality data, Lian et al. [
22] used interval conversion to describe sensor interference, which improved the accuracy and robustness of BRB, but the reliability difference among evaluation attributes was not directly quantified. Feng et al. [
23] introduced SNR as a quality factor to reduce the influence of noisy data, but this description is closely related to signal–noise characteristics and is not always applicable to different types of monitoring attributes. For non-uniformly distributed data, traditional BRB models usually rely on Yang’s rule-based utility conversion, which is more suitable for uniformly distributed data. To improve the information transformation process, Liu et al. [
24] and Lian et al. [
25] introduced Gaussian and idempotent membership functions into BRB, respectively. Han et al. [
26] proposed BRB-RAMF by considering random variables of adaptive membership functions, in which the adaptive coefficient is treated as a random variable to improve the flexibility of membership function adjustment. These studies have enhanced BRB modeling under imperfect data from different perspectives. However, quality-oriented methods mainly describe the reliability of input data, while membership function-oriented methods mainly improve the matching between numerical inputs and referential values. The influence of attribute reliability on rule activation and the influence of data distribution on information transformation are still not sufficiently considered in a unified reasoning process.
This limitation is particularly important in FOG performance evaluation. On the one hand, low-quality measurements caused by environmental interference may reduce the reliability of evaluation attributes and distort the inferred performance state. On the other hand, the numerical values of monitoring attributes are often non-uniformly distributed, typically showing local density and overall sparsity within adjacent reference intervals. Under such conditions, conventional membership functions may produce inaccurate information transformation because they cannot adequately reflect the actual distribution pattern of the data. For example, when most samples cluster near one reference value while only a few are distributed near the other, the same numerical distance does not necessarily imply the same membership degree. Moreover, even if the distribution of input data is considered, attributes with poor stability may still participate in rule activation in the same way as reliable attributes. Therefore, a BRB-based performance evaluation model is needed in which data quality affects rule activation and data distribution affects the transformation from numerical inputs to belief degrees.
To address the above issues, a new adaptive BRB model for data quality and distribution (ABRB-QD) is proposed for FOG performance evaluation under imperfect data. In this model, the stability of monitoring data is used to calculate the quality factor of each evaluation attribute, and this factor is introduced into rule activation to reduce the influence of low-quality inputs. Meanwhile, the distribution characteristics of monitoring data are used to construct an adaptive membership function, so that non-uniformly distributed data can be transformed into belief degrees more reasonably. In this way, ABRB-QD retains the interpretability of BRB while further considering both attribute reliability and distribution-adaptive information transformation in the reasoning process.
The main work and contributions are as follows:
- (1)
An ABRB-QD framework is proposed for FOG performance evaluation under imperfect data. The framework considers both attribute reliability and data distribution in the BRB reasoning process.
- (2)
A stability-based data quality calculation method is proposed to quantify the reliability of evaluation attributes. The obtained quality factor is introduced into rule activation to reduce the influence of low-quality monitoring data.
- (3)
An adaptive membership function is developed according to the distribution characteristics of monitoring data, improving the transformation of non-uniformly distributed inputs into belief degrees.
The main structure of this paper is as follows.
Section 2 presents the basic definition, problem formulation, and performance evaluation model.
Section 3 describes the model construction, inference process, and optimization of model parameters for ABRB-QD. Finally,
Section 4 validates the proposed method using a performance evaluation case of a FOG.
Section 5 concludes the paper.
4. Case Study
In this section, a case study on the performance evaluation of FOG is conducted to validate the practical value of the proposed method.
Section 4.1 introduces the brief background of fiber optic gyroscope performance evaluation.
Section 4.2 calculates the quality factor and membership functions using the proposed method in this paper. In
Section 4.3, a performance evaluation model based on ABRB-QD is established and optimized. In
Section 4.4, comparative studies are performed.
Section 4.5 analyzes and discusses the results.
4.1. Background Description
In the experimental setup, the FOG was horizontally mounted on an isolated foundation to sense the vertical component of the Earth’s rotation rate. The test site was located at a latitude of 34°15′ N. During data acquisition, the gravity acceleration at the test site was 9.7967 m/s
2, and the interval between two consecutive tests was 6 h. The corresponding FOG and its test equipment are shown in
Figure 8.
To determine the evaluation attributes of FOG, this paper introduces frequency difference analysis. The model of FOG’s input angular velocity
and output frequency difference
is as follows:
where
denotes the output frequency difference in the FOG,
represents the input angular velocity,
is the scaling factor, also known as the first-order drift coefficient, and
denotes the zeroth-order drift coefficient. The scaling factor
is related to the structural parameters of the FOG and can be expressed as
, where
represents the area enclosed by the FOG ring resonator loop,
denotes the loop perimeter, and
is the laser wavelength. As shown in Equation (17), the output frequency difference
is determined by the input angular velocity
, the scaling factor
, and the zeroth-order drift coefficient
. Therefore, by measuring multiple sets of
and
, the values of
and
can be obtained. The specific formula is as follows:
The values of
and
are
Since
and
are derived from the raw measurements
,
,
and
, their measurement uncertainties should be considered. Let
. Based on Equation (19), the first-order uncertainty propagation of
can be expressed as
Similarly, the uncertainty propagation of
can be approximated as
The above equations show that the propagated uncertainties of and are related to the denominator . When the two operating angular velocities are too close, the uncertainty may be amplified, leading to an ill-conditioned derivation. In this study, the two angular velocity operating points were selected with sufficient separation, and the angular velocity difference was much larger than the angular velocity control uncertainty. Therefore, the calculation of and is assumed to be well-posed under the present experimental conditions.
From the above frequency difference analysis, it can be seen that the navigation accuracy of FOG is mainly affected by
and
. Therefore,
and
are selected as the evaluation attributes in this study. This selection has clear physical significance and reflects the dominant performance variation of the considered FOG under the current conditions. The gravity acceleration at the test site is 9.7967, with a sampling interval of 6 h. The experiment obtained 180 test datasets for
and
through sensors, as shown in
Figure 9.
A key advantage of ABRB-QD is its flexibility in attribute selection. While these two attributes are used here, other relevant indicators can also be included depending on the application scenario and available engineering knowledge.
4.2. Calculation of Data Quality Factors and Initial Membership Functions
Based on FOG’s test results,
and
represent long-term test data. According to the steps in
Section 3.1, the data quality factors
and
for the two evaluation attributes
and
are obtained as follows:
To partition the SAMF intervals, first provide the reference values and reference levels of FOG’s evaluation attributes. Based on expert experience,
and
are categorized into four reference levels: High (H), Sub-high (SH), Medium (M), and Low (L), as illustrated in
Table 2.
Based on the SAMF computation process in
Section 3.2 and FOG test data,
Figure 10 shows the data distribution and initial membership functions of
and
.
4.3. Construction and Optimization of the ABRB-QD-Based Performance Evaluation Model
In this experiment, the FOG was used to measure the vertical component of the Earth’s rotation rate. Since the vertical component of the Earth’s rotation rate at a fixed test site is theoretically constant, the difference between the FOG measurement and the theoretical Earth-rate component can be regarded as the measurement error of the gyroscope. Therefore, the measurement accuracy of the FOG is used as the basis for assigning its performance state.
According to the distribution of the Earth-rate measurement errors, four reference performance levels are defined: I (Excellent), II (Good), III (Moderate), and IV (Poor). A smaller measurement error indicates better FOG performance. In this study, the measurement error of 2.2 × 10
−5 °/s is taken as the reference value of Level I, while the measurement error of 10.1 × 10
−5 °/s is taken as the reference value of Level IV. The reference values of Levels II and III are obtained by equal-interval partitioning between these two boundary values. For model inference, the four performance levels are further mapped to normalized referential values of 1, 0.67, 0.33, and 0, respectively, as shown in
Table 3.
Since the measurement error of each sample varies continuously during the experiment, the real performance state used in
Figure 11 and in the MSE calculation is represented as a continuous normalized score. Specifically, the real performance state of each sample is obtained by linear interpolation between the two nearest reference measurement errors and their corresponding normalized referential values. If the measurement error is smaller than 2.2 × 10
−5 °/s, the real performance state is set to 1; if it is larger than 10.1 × 10
−5 °/s, the real performance state is set to 0. In this way, the ground-truth performance state used for MSE calculation is consistent with the continuous expected utility output of the BRB model.
Based on the above referential values, the initial rule weights are set to 1, and the initial BRB is established according to expert knowledge, resulting in 16 initial rules. The initial ABRB-QD settings are illustrated in
Table 4.
Since the initial ABRB-QD model fails to achieve the desired evaluation accuracy, the optimization model in Equation (16) is trained using the P-CMAES algorithm. Here, P-CMAES is adopted as an optimization tool for parameter training of the proposed ABRB-QD model, as commonly used in BRB-related studies [
17,
20,
26]. The number of iterations for P-CMAES was set to 500, with 90 random sets of test data selected as the training set and the remaining 90 sets used as the test set. The real performance state used in
Figure 11 and in the MSE calculation is obtained by applying the above piecewise linear mapping to the Earth-rate measurement error calculated from the FOG measurement and the theoretical Earth-rate component.
As shown in
Table 5, some rules with uniform initial belief degrees are adjusted after optimization. For example, rules 4 and 13 correspond to cases where
and
indicate different performance tendencies. Such cases are relatively rare in the available samples and are also difficult for experts to assess with high confidence; therefore, uniform belief degrees are adopted in the initial rule base to represent this uncertainty. After optimization, the corresponding belief degrees are further calibrated according to the available data and the evaluation accuracy objective. For the FOG considered in this study, when either
or
becomes relatively large, the evaluation result tends to assign more belief to worse performance levels, which is consistent with the physical understanding that larger error-related coefficients generally indicate poorer gyroscope performance.
As shown in
Figure 11, the evaluation performance of the ABRB-QD model improved significantly after training.
Figure 11 presents a representative experimental run to illustrate the performance state evaluation results before and after model training. In this representative run, compared with the initial ABRB-QD model, the optimized ABRB-QD model reduces the MSE from 3.28 × 10
−2 to 7.11 × 10
−4, corresponding to a 97.8% reduction in the evaluation error. The evaluation results of the optimized model are therefore closer to the real performance state.
To further examine the sensitivity of the normalization parameter in Equation (5), an additional comparative experiment was conducted in the case study. Specifically, the original half-range-based definition of was replaced by two commonly used scale measures, namely the standard deviation and the interquartile range (IQR), while the remaining model settings were kept unchanged. For each normalization choice, the ABRB-QD parameters were re-optimized under the same training and testing protocol. The obtained AMSE values are 7.11 × 10−4, 7.83 × 10−4, and 8.49 × 10−4 for the half-range, standard deviation, and IQR definitions, respectively. The results indicate that the evaluation accuracy changes only slightly under different normalization choices. Therefore, the proposed ABRB-QD method is relatively robust to the definition of , and the half-range-based normalization is retained in this study.
4.4. Comparative Study
To further evaluate the effectiveness of ABRB-QD, comparative experiments are conducted from two perspectives: different BRB evaluation models and data-driven models. The evaluation metrics include the average MSE (AMSE) and the standard deviation (STD) of twenty repeated experiments. AMSE is used to describe the average prediction error, while STD is used to measure the dispersion of the results across repeated runs. Therefore, these two metrics provide statistically descriptive information on both the accuracy and stability of the compared models.
- a.
Different BRB evaluation models
- (1)
The BRB model with a triangular membership function, called BRB-t. The input information transformation function is shown in Equation (6).
- (2)
The BRB model with a Gaussian membership function, called BRB-g. The input information transformation function is shown in Equation (10).
- (3)
The BRB model with an idempotent membership function, called BRB-n. The input information transformation function is shown in Equation (11), and the values of s in the experiment are shown in
Table 6 and
Table 7.
- (4)
The BRB model with adaptive maximum likelihood ratio considering random variables is called BRB-RAMF. The membership function of BRB-RAMF resembles SAMF, differing in that SAMF determines parameters based on data distribution, while BRB-RAMF employs the maximum likelihood ratio for parameter determination.
The above five BRB models were optimized using the same P-CMAES algorithm under a unified optimization setting and then compared with the proposed ABRB-QD. In this way, the optimization procedure was kept consistent across the BRB-type models. The statistical evaluation results of the five BRB models over twenty repeated experiments are presented in
Table 8, where AMSE and STD are used to compare the average evaluation accuracy and stability, respectively.
- b.
Different data-driven models.
Machine learning methods have been widely applied in performance evaluation. This paper conducts a comparative study on K-Nearest Neighbor (KNN), Backpropagation Neural Network (BPNN), Random Forest (RF), and Naive Bayes (NB) models. The evaluation results are shown in
Table 9.
- (1)
KNN is a machine learning algorithm that does not require training [
27], which finds the k-nearest samples by calculating distances and makes decisions based on the majority class of the samples.
- (2)
BPNN is a neural network that adjusts weights through error backpropagation [
28], where data is transmitted forward from the input layer to the output layer, and connection weights are adjusted in the reverse direction based on the output error.
- (3)
RF is an ensemble learning algorithm [
29], comprising multiple decision trees that collectively vote to determine evaluation results.
- (4)
NB is a probabilistic classification model based on Bayes’ theorem [
30]. It directly calculates the prior and conditional probabilities from the training set and infers the posterior probabilities. The evaluation result for NB can be obtained using Equation (15).
The data-driven baseline models were evaluated under the same training and test setting. Specifically, for KNN, the hyperparameters n_neighbors, weights, and metric were searched in [3, 5, 7, 9], {uniform, distance}, and {euclidean, manhattan}, respectively. For BPNN, the hidden-layer structure was selected from [(64,), (128,), (128, 64)], the activation function was chosen from {relu, tanh}, the initial learning rate was searched in [0.001, 0.01], and the batch size was searched in [32, 64, 128]. For RF, the hyperparameters n_estimators, max_depth, min_samples_split, and min_samples_leaf were searched in [100, 200, 300], [None, 5, 10, 15], [2, 5, 10], and [1, 2, 5], respectively. For NB, the hyperparameter var_smoothing was searched in [10−9, 10−8, 10−7]. All hyperparameter tuning was carried out on the training set using GridSearchCV with five-fold cross-validation.
4.5. Result Analysis and Discussion
In this section, the advantages of the ABRB-QD are analyzed and discussed in conjunction with the results of comparative experiments.
- (1)
Comparison with different BRB evaluation models.
After considering data quality and SAMF, the evaluation accuracy of BRB has improved. This result is well explained, as the triangular membership function of BRB-t cannot reflect the non-uniform distribution of imperfect data and exhibits poor conversion effectiveness. Meanwhile, BRB-g and BRB-n, which show slightly improved data conversion performance, are prone to causing input data fuzzification errors under poor data quality conditions. Therefore, the ABRB-QD model significantly improves evaluation performance under imperfect data. As shown in
Table 8, compared to trained BRB-t, BRB-g, BRB-n, and BRB-RAMF models, the evaluation accuracy improved by 43.6%, 26.3%, 34.3%, and 19.4%, respectively, while evaluation stability increased by 51.5%, 44.2%, 41.0%, and 26.6%. The evaluation performance of ABRB-QD is significantly improved compared to other BRB models.
It is worth noting that, compared with other membership functions, the SAMF can adapt to a broader range of data distributions, especially in cases where data anomalies are more pronounced. This is attributed to its capability to fit data distributions, thereby determining model adjustment parameters to achieve more precise fuzzy processing of quantitative data.
- (2)
Comparison with different data-driven models.
The limited number of high-value samples in FOG exposes the limitations of data-driven models. Since the training set may not fully represent the overall mapping relationship, data-driven models are generally prone to overfitting. Among these, RF demonstrates advantages through dual randomness in feature and sample selection, thereby improving model performance. As shown in
Table 9, compared with post-training KNN, NB, BPNN, and RF models, ABRB-QD achieves evaluation accuracy improvements of 32.5%, 48.1%, 38.7%, and 27.9%, respectively, along with stability enhancements of 33.4%, 42.9%, 38.6%, and 26.1%. These results indicate that ABRB-QD achieves lower average prediction error and better stability than the compared data-driven models under the revised evaluation protocol.
These results demonstrate the advantage of expert knowledge in small-sample modeling, where experts provide an interpretable initial model based on domain knowledge and experience, and the evaluation accuracy is further improved by combining this prior knowledge with the available small-sample data. The computational cost of ABRB-QD mainly depends on the size of the rule base and the number of optimization iterations. In the current experiments, the approximate training time is about 7–20 s, depending on initialization and optimization. As the number of attributes or referential levels increases, the number of rules grows accordingly, which may make inference and training more time-consuming in higher-dimensional cases. Under these experimental conditions, the results primarily demonstrate the effectiveness of ABRB-QD with the two selected evaluation attributes. Future work will further explore additional FOG samples and evaluation attributes to assess the method’s applicability in more complex scenarios.
5. Conclusions
In this paper, a novel BRB-based method is proposed for the performance evaluation of FOG operating under imperfect data conditions. This approach effectively mitigates the impact of low-quality data on evaluation results while capturing the actual distribution characteristics of the data, offering a potentially useful solution for the considered FOG performance evaluation problem.
The main contributions of this paper can be summarized as follows: First, to address the issue of poor data quality, the calculation method of the data quality factor based on data stability is proposed, where quality factors are integrated into the model to reflect the reliability of evaluation attributes. Second, to resolve uneven data distribution, the SAMF is designed for fuzzy processing of quantitative data. To demonstrate the effectiveness of the ABRB-QD model, a FOG performance case study is conducted. The model achieves a minimum MSE of 7.11 × 10−4, demonstrating not only superior accuracy but also enhanced practical utility compared to conventional methods. Third, a parameter optimization model is developed to improve the evaluation accuracy of ABRB-QD.
It is worth noting that SAMF has limited capability in representing more complex data distributions. Therefore, using fewer membership function model parameters to reflect complex distribution situations with imperfect data and improve the model’s generalization ability remains an important challenge. In addition, as the number of attributes and referential levels increases, the growth of the rule base may limit the computational scalability of the proposed method. Under the current experimental conditions, the results primarily demonstrate the effectiveness of ABRB-QD in the present two-attribute FOG performance evaluation case, and further validation with additional FOG samples and evaluation attributes will be considered in future work. Future work will also focus on improving the expressive ability of the membership function, enhancing computational scalability, analyzing uncertainty propagation more rigorously, and validating the proposed method on more FOG systems.