Adaptive Belief Rule Base Modeling of Complex Industrial Systems Based on Sigmoid Functions

Huang, Haolan; Feng, Shucheng; Li, Jingying; Guan, Tianshu; Zhu, Hailong

doi:10.3390/e27111157

Open AccessArticle

Adaptive Belief Rule Base Modeling of Complex Industrial Systems Based on Sigmoid Functions

by

Haolan Huang

^1,†,

Shucheng Feng

^1,†,

Jingying Li

¹

,

Tianshu Guan

² and

Hailong Zhu

^1,*

¹

The School of Computer Science and Information Engineering, Harbin Normal University, Harbin 150025, China

²

The School of Software, Dalian University of Foreign Languages, Dalian 116044, China

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Entropy 2025, 27(11), 1157; https://doi.org/10.3390/e27111157

Submission received: 10 September 2025 / Revised: 9 November 2025 / Accepted: 13 November 2025 / Published: 14 November 2025

(This article belongs to the Section Complexity)

Download

Browse Figures

Versions Notes

Abstract

In response to the challenges posed by multifactorial nonlinear relationships and uncertainties, and to address the limitations of the existing Belief Rule Base (BRB) in nonlinear fitting, uncertainty representation, and parameter optimization, this paper presents an improved reliable modeling method using a nonlinear belief rule base (R-NBRB). First, the linear inference mechanism is replaced by a smooth nonlinear S-function. This replacement better adapts to nonlinear dynamics in complex industrial systems. Second, attribute reliability is quantified through a reliability assessment method. Data, reliability, and expert knowledge are integrated using the Evidential Reasoning (ER) algorithm. Uncertainty is expressed in the form of belief degrees. Finally, the Covariance Matrix Adaptation Evolution Strategy (CMA-ES) algorithm is applied to optimize the inference parameters. Decision bias caused by insufficient expert knowledge is thereby reduced. Experiments were conducted on a task involving the detection of a petroleum pipeline leak. The mean squared error (MSE) of the R-NBRB model is only 0.2569. This represents a 28.24% reduction compared with the BRB model. The proposed method’s effectiveness and adaptability in complex industrial situations are confirmed.

Keywords:

complex industrial systems; nonlinear belief rule base; evidential reasoning

1. Introduction

Complex industrial systems often show nonlinear dynamics and operate in uncertain environments. They also face practical challenges like sensor noise, equipment wear, and external disturbances [1]. As modern industrial systems continue to grow in scale and structural complexity, traditional modeling methods face significant limitations in addressing nonlinear dynamic responses and uncertain factors. Therefore, in critical fields such as industrial automation, smart manufacturing, and energy transportation, there is an urgent need to develop new models that integrate reliability and generalizability to accurately describe their dynamic characteristics. This has become a crucial requirement and a fundamental basis for enhancing operational safety and achieving precise control and intelligent decision-making [2].

The modeling process for complex industrial systems focuses on reliably assessing system states. Its outputs help with intelligent judgement and decision support in operational environments. In the context of complex systems, the term “decision-making” specifically refers to the model’s ability to discriminate and reason about system operational states under uncertain conditions. This capability gives a solid basis for tasks like risk warnings and maintenance strategies. It acts as a key link between system monitoring and operational actions. Thus, the decision-making discussed in this study emphasizes intelligent inference on the basis of state assessment of complex industrial systems, reflecting a complete technical chain from data perception to operational decision-making [3]. This theory matches the needs of state cognition in complex systems. It also shows the practical benefits of intelligent modeling methods in industrial operations and maintenance.

Currently, the research methods in this field can be divided into three main categories: physics model-driven, data-driven, and hybrid-driven models.

(1): Physics model-driven model

The physics model-driven model is a methodology. It builds models using physical laws and mechanisms. Then, it analyses the system and makes predictions. Its core does not rely on data analysis. It describes the internal operation mechanism of the system through mathematical equations. A fuzzy model-based observer was proposed by Baigzadehnoe et al. [4] to handle the fault detection problem of nonlinear networked control systems. A physics-of-failure model including side reactions was established by Li et al. [5]. The parameter evolution of the model is analyzed to construct fault boundaries. The effective diagnosis of multiple internal aging faults in lithium-ion batteries is realized in this way. Choudhury et al. proposed a fault diagnosis approach that integrates the squared envelope spectrum of bearing vibration signals with a pre-trained convolutional neural network [6]. This method allows for the effective extraction of second-order cyclostationary features via 2D image representation, enabling efficient fault diagnosis. However, physics models usually ignore the statistical laws or empirical knowledge hidden in historical data. They rely only on equation derivation.

(2): Data-driven model

Data-driven models can operate without relying heavily on prior mechanisms [7]. They mine the operation laws of complex industrial systems from data by analyzing large amounts of data. Excessive dependence on the internal mechanisms of systems is avoided. For example, a data-driven hybrid power system framework was proposed by Wu et al. [8]. Noise reduction via generative adversarial networks and residual learning technology are applied in this framework. High-precision dynamic modeling of single-cell transcriptome data is realized. The dependence on prior knowledge of traditional mechanism models is significantly reduced. The problem of high data demand caused by excessive parameters of data-driven models was addressed by Chattopadhyay et al. [9]. A stochastic model based on a convolutional variational autoencoder is proposed. This model is superior to traditional deterministic models in short-term prediction ability. Research on machine learning and deep learning algorithms was conducted by Soleimani et al. [10]. A new type of data-driven prediction model has been developed. Modeling of the degradation of complex industrial systems, fault detection, and prediction of remaining useful life are realized via this model. However, the accuracy of this method generally relies on the precision and completeness of the data collected from complex industrial systems. Consequently, these methods often struggle to handle system modeling processes with small-sample datasets in complex environments.

(3): Hybrid-driven models

The hybrid-driven modeling approach integrates physical mechanisms with data algorithms, maintaining model interpretability while significantly enhancing modeling efficiency and prediction accuracy, making it particularly suitable for the simulation and optimization of complex industrial systems. A hybrid knowledge system method was proposed by Yuan et al. [11]. This method constructs and performs reasoning within distinct expert knowledge systems, tailored to the specific categories of accessible information. A hybrid model combining gear dynamics equations and a multi-dimensional BRB was designed by Gao et al. [12]. The rule base is initialized via a data-driven approach in this model. Excellent performance is exhibited by the model. The development of a novel Graph Convolutional Network (GCN)-based fault diagnosis technique was presented by Chen’s research team [13]. This method combines available measurements and prior knowledge.

The Belief Rule Base (BRB) is an uncertain hybrid modeling method that can effectively combine quantitative information and expert knowledge [14]. Uncertainty is described in the form of a belief distribution. It has good adaptability to small-sample data. It reduces the dependence of modeling on large-scale data, giving it wide application value in complex industrial system modeling. Currently, the BRB model has been widely used in the modeling process of complex industrial systems [15]. A model that combines interval addition and interpretability constraints was proposed by He et al. [16]. The rule base parameters are optimized via a data-driven approach in this model. Moreover, the traceability of the physics model is retained. A new model based on BRB was proposed by Lian et al. [17]. This model includes sensor disturbances to improve the accuracy and reliability of performance assessments in complex industrial systems. Zhang et al. introduced a fault diagnosis model based on a Micro-Belief Rule Base (MBRB) for complex electromechanical systems [18]. This model is constructed via an enhanced belief rule architecture.

Despite the significant advantages of the BRB method, some challenges in its practical application still need to be addressed. The BRB method uses linear functions for model reasoning [19]. It cannot effectively handle the complex nonlinear relationships between input features and outputs [20]. Moreover, complex industrial systems are highly susceptible to various disturbances in real-world environments and are characterized by substantial uncertainties [21]. The BRB model has constraints in managing such uncertainties and often struggles to address intricate scenarios.

In response to these challenges, the key innovations of the reliable modeling method using a nonlinear belief rule base (R-NBRB) model introduced in this study are outlined below:

(1): The model’s reasoning mechanism employs a smoothly varying S-function to adaptively calibrate the matching degree. The corresponding matching degrees can also produce relatively significant differences. Thus, the degrees of activation of different rules are distinguished more accurately. An optimization algorithm fine-tunes the expert knowledge in the improved S-function and the parameters created during the model reasoning process. The method provides dual assurance of model interpretability and uncertainty reduction.
(2): By embedding attribute reliability into the modeling framework, the proposed method effectively suppresses uncertainties arising from external complex environments during the inference process.
(3): When applied to the representative complex industrial task of petroleum pipeline leak detection, the proposed model not only significantly improves the accuracy of leak identification and decision-making, but also provides a solid foundation for highly reliable decision support, thereby comprehensively enhancing the safety and trustworthiness of system operations.

The structure of this paper is as follows. In Section 2, the problems existing in the complex industrial system modeling process of the R-NBRB model are identified and solved. In Section 3, the reasoning process of the R-NBRB model is explained in detail. In Section 4, a comprehensive analysis of the R-NBRB model is conducted through the case of oil pipeline leakage. Finally, in Section 5, the research content is summarized.

2. Problem Description

Several fundamental challenges emerge when the R-NBRB framework is employed in complex industrial system modeling:

(1): How can complex nonlinear relationships between input features and outputs be handled?

In the BRB model, the linear functions used for reasoning struggle to capture the complex and tightly linked relationships between inputs and outputs [22]. Underfitting, insufficient generalization ability, and low characterization accuracy tend to occur. To address this issue, this paper adjusts the model matching degree calculation process by incorporating S-functions. Input data and expert knowledge are effectively represented. Model decision accuracy is improved. Complex nonlinear relationships between inputs and outputs are handled efficiently. The reasoning process can be expressed by Equation (1).

{α_s}_{i, j} = S (Ω_{E x p e r t}, Ω_{D})

(1)

where set

Ω_{E x p e r t}

, which contains all the expert knowledge used in the BRB model reasoning process, is defined. Set

Ω_{D}

, which represents all the input data required for the decision-making process of complex industrial systems, is specified. The updated matching degree

{α_s}_{i, j}

, which is obtained when the model is constructed via nonlinear functions, is denoted.

(2): How to adapt an S-function to the complex nonlinear relationships between input features and outputs.

The traditional S-function has a fixed mapping function, making it difficult to adapt to complex nonlinear correlations such as multimodality and strong coupling [23]. Its parameters lack adaptability to changes in nonlinear relationships. To tackle these limitations, this paper adds a nonlinear operator. It comes from expert knowledge and is designed for the specific context of the complex industrial system within the S-function framework. The model’s ability lies in accurately characterizing the sophisticated nonlinear input-output relationships inherent to complex industrial system modeling. Moreover, an optimization algorithm is used to adjust the nonlinear operator given by the experts. This operator supports adaptive and dynamic updates during model reasoning, thereby preserving interpretability while concurrently enhancing the precision of the inference process. The adaptive adjustment process of the nonlinear operator within the complex industrial system modeling framework is expressed in Equation (2).

a_p a r a m = C (a_{e x p e r t}, Ω)

(2)

where

C (•)

denotes the optimization function,

a_{e x p e r t}

represents the nonlinear operator provided by experts based on the initial state of the complex industrial system,

Ω

specifies the parameter set to be optimized during the model reasoning process, and

a_p a r a m

represents the nonlinear operator adaptively adjusted by the optimization function in the model reasoning process, which is utilized for the adaptive update of the complex nonlinear fitting between input features and outputs in the complex industrial system modeling process.

(3): How to address the impact of complex environments on the model reasoning process.

The data employed for decision-making in complex industrial systems are often influenced by intricate external conditions, which introduces uncertainty into the model’s reasoning and evaluation procedures [24]. Therefore, in this paper, attribute reliability is integrated into the reasoning process of the nonlinear belief rule base. This integration enhances the reliability and accuracy of the model’s reasoning process, as described by Equation (3):

r_{i} = R (X_{i} (j), ζ, μ_{i})

(3)

where

R (•)

is denoted as the reasoning process of attribute reliability

r_{i}

.

μ_{i}

is denoted as the standard deviation of the

i - t h

input attribute

X_{i}

.

ζ

is denoted as the tolerance coefficient of the input attribute

X_{i}

.

3. Inference Process of the R-NBRB Model

This section is structured as follows. Section 3.1 outlines the overall architecture of the R-NBRB model. Section 3.2 details the specific process of adjusting the matching degree calculation via an S-function integrated with a nonlinear operator. Section 3.3 thoroughly elaborates on the computational methodology for attribute reliability. Section 3.4 provides a detailed reasoning process for complex industrial system decision-making within the R-NBRB framework. Section 3.5 illustrates the overall parameter optimization process of the model. Finally, Section 3.6 provides a comprehensive analysis of the computational complexity of the proposed model.

3.1. Description of the Overall Structure of the R-NBRB Model

In complex industrial system modeling, input attributes are prone to carrying noise due to interference from the external environment during collection. Mutual influence exists between different observation data. The combined effect of these two factors disturbs the decision-making results of complex industrial systems. Additionally, the inherent uncertainty and complex nonlinear relationships between observational data and decision outcomes are often difficult to characterize accurately, which further compromises decision-making accuracy. In response to these challenges, this paper introduces a refined and dependable model founded on the R-NBRB, which offers an effective solution. The overall framework of the proposed model is depicted in Figure 1, and its detailed implementation procedure consists of the following steps:

(1): Calculation of input evidence antecedents on the basis of matching degree

At the initial stage of reasoning, the R-NBRB model calculates the matching degree between the actual system input data and the reference values of each rule in the belief rule base. This process transforms the input values of antecedent attributes into belief distributions across different reference levels by computing their membership degrees relative to the reference value sets in the rule antecedents. The matching degree maps precise numerical inputs into activation intensities for the rule antecedents, providing standardized input evidence for subsequent evidence activation and reasoning. This step serves as the cornerstone of the entire evidential reasoning process.

(2): Nonlinear transformation of evidence via the S-function

After the initial matching degree is obtained, the R-NBRB model performs a nonlinear transformation of the matching degree results by introducing an S-function embedded with nonlinear operators. This aims to accurately capture the deep nonlinear dynamics between input evidence and output results in complex systems. The initial parameters of these nonlinear operators are predefined by domain experts on the basis of current system observation data. This step converts the initial evidence into a nonlinear evidence space that better reflects the essential characteristics of the system, significantly enhancing the model’s ability to fit complex dynamics and improve decision-making accuracy.

(3): Construction of an evidence body with integrated attribute reliability

Following the nonlinear transformation, the R-NBRB model further evaluates the reliability of each input attribute. This reliability quantifies the credibility of the data source or the attribute itself in uncertain environments. The system synthesizes the nonlinearly transformed input data with their corresponding attribute reliabilities to collectively form the initial evidence body. This mechanism directly incorporates data quality and uncertainty into the model’s reasoning chain, ensuring that subsequent evidence fusion processes depend not only on data values but also on the reliability of the data. Consequently, the reliability of decision-making is enhanced at its foundation.

(4): Rule Fusion and Decision-Making Based on Evidential Reasoning (ER)

The R-NBRB model employs the ER algorithm [25] to deeply integrate the rule base, which has been nonlinearly transformed by the S-function, with the input data. Through rigorous logical deduction and evidence synthesis via the ER algorithm [26], the model ultimately generates decision outputs, providing reliable decision results for complex systems.

(5): Collaborative parameter optimization based on Covariance Matrix Adaptation Evolution Strategy (CMA-ES)

To mitigate the uncertainties arising from the subjectivity and cognitive limitations of expert knowledge, the R-NBRB model incorporates the CMA-ES during its reasoning process [27]. The CMA-ES algorithm is employed to collaboratively optimize key parameters in the model as well as nonlinear operators initially set by experts on the basis of empirical knowledge. Through dynamic and adaptive global search, this optimization process enables these parameters to self-adjust according to the system characteristics reflected by actual data, thereby significantly enhancing the model’s reasoning accuracy, robustness, and generalization capability in unfamiliar scenarios.

3.2. Nonlinear Relationship Modeling Based on S-Function

Given the complexity and diversity characteristics of the information processed by the BRB system, the input data of complex industrial systems needs to be converted into a belief distribution structure recognizable by the BRB system. A linear matching method is adopted by the BRB to convert the input data information of complex industrial systems into a belief distribution structure. First, the matching degree between the antecedent feature and the reference value is calculated via this linear matching method. This matching degree is subsequently used as the converted belief structure. The specific implementation process of this linear matching method can be expressed by the following formula [28].

{α_l}_{i, j} = \frac{A_{i, j + 1} - x_{i}}{A_{i, j + 1} - A_{i, j}}, A_{i, j} \leq x_{i} \leq A_{i, j + 1}

(4)

{α_l}_{i, j + 1} = \frac{x_{i} - A_{i, j}}{A_{i, j + 1} - A_{i, j}}, A_{i, j} \leq x_{i} \leq A_{i, j + 1}

(5)

where

{α_l}_{i, j}

represents the matching degree of the input data

x_{i}

under the reference value

A_{i, j}

of the corresponding

i - t h

input attribute.

The matching degree distribution obtained via the linear calculation method is generated on the basis of a fixed linear function. Consequently, this approach can only produce a specific form of matching degree distribution, which cannot be adjusted according to the characteristics of the input data from different complex industrial systems. As a result, the matching degree distribution cannot truly reflect the matching situation among data. The application effect of the belief distribution structure in complex scenarios is restricted. Therefore, an S-function embedded with a nonlinear operator is introduced in this paper. A nonlinear matching method for activated rules is designed to update the original linear method. The aim is to improve the ability of the belief rule base to characterize complex nonlinear relationships by virtue of the nonlinear characteristics of the S-function. A more accurate capture of the matching relationship between the data and reference values is achieved. The matching degree distribution is optimized. Problems such as the sensitivity of linear matching to outliers and the singleness of matching relationships are alleviated. The accuracy and robustness in processing information from complex industrial systems are enhanced. Equation (6) implements the above reasoning process.

\begin{array}{l} α_S_{i, j}^{'} ({α_l}_{i, j}) = 1 - \frac{1}{1 + \exp (- a_p a r a m (ι - {α_l}_{i, j}))} \\ {α_S}_{i, j} = α_S_{i, j}^{'} / \sum_{j = 1}^{M} α_S_{i, j}^{'} \end{array}

(6)

where

{α_s}_{i, j}

represents the conversion of the linear matching degree into a smooth nonlinear matching degree via the S-function and where

ι

represents the symmetric point of the S-function.

3.3. Calculation Process of Attribute Reliability

In complex industrial system modeling, the collected data are easily affected by multiple disturbance factors. As a result, the observed values exhibit uncontrollable fluctuations, thereby undermining the stability and consistency of the data [29]. To address this problem, the R-NBRB model introduces an indicator of attribute reliability. It is used to quantify the degree of influence of interference on observed data. When data show significant variation or error after interference and exceed the preset reasonable range, the data are determined to be unreliable. If such data are directly used in modeling, the accuracy and robustness of the model will be impaired.

The inference and calculation of attribute reliability constitute the core step for the R-NBRB model in handling data uncertainty [30]. The actual observation data of complex industrial systems and the empirical knowledge of domain experts are deeply integrated in this method. The reliability of the input data is quantitatively evaluated. The calculated attribute reliability is embedded into the model reasoning process. The complex industrial system modeling process is optimized in this way. The overall reliability of complex industrial system model construction is improved.

In practical engineering,

r_{i}

is calculated through the proportion of reliable data. Specifically, it is determined by the proportion of valid data relative to the total observed data. A schematic of the reasoning process for attribute reliability within the R-NBRB model is shown in Figure 2 and mainly includes the following parts.

Step 1: The determination of attribute tolerance ranges is performed by experts, who consider the specific realities of the complex industrial system.

The tolerance range is the threshold basis for determining data reliability. Data within this range are regarded as having less interference and are highly reliable. Data beyond this range are classified as seriously interfered and unreliable. During the complex industrial system modeling process, the tolerance coefficient

ζ

is set for the

i - t h

input attribute of the complex industrial system by domain experts on the basis of practical engineering experience. The reasonable tolerance range of this attribute

({\bar{X}}_{i} - ζ μ_{i} \leq X_{i} \leq {\bar{X}}_{i} + ζ μ_{i})

is determined by combining the statistical characteristics of historical observation data, where

{\bar{X}}_{i}

denotes the mean of the observational data for the

i - t h

attribute and where

μ_{i}

represents the standard deviation of the observational data for the

i - t h

attribute.

Step 2: The quantity of reliable data is calculated on the basis of the statistics of actual observation data from complex industrial systems.

After the tolerance range is obtained on the basis of expert knowledge and real data collected from complex industrial systems, the observation data

{x i, 1, x i, 2, \dots, x i, s}

of the

i - t h

attribute are used. Each piece of data contained in the current attribute is judged one by one to check whether it falls within the tolerance range. If the data

x_{i}

fall within the range

({\bar{X}}_{i} - ζ μ_{i} \leq X_{i} \leq {\bar{X}}_{i} + ζ μ_{i})

, these data are proven to be reliable; thus,

x_{i}

is proven to be reliable. If not, these data are proven to be unreliable, and the count

y_{r}

is increased by 1; if not,

y_{r}

remains unchanged. After the statistical results are obtained, the number of reliable data points among the s observations can be acquired

(0 < y_{r} < s)

.

Step 3: The calculation of attribute reliability is performed.

By counting the number of reliable data, the attribute reliability r finally obtained can be expressed as Equation (7).

r_{i} = y_{r} / s

(7)

3.4. Detailed Construction Process of the R-NBRB Model

The R-NBRB model is a rule-based modeling method. Uncertainty is characterized and handled by the belief distribution function. The activated rules are combined via the ER algorithm. The belief distribution serves as a mathematical framework for representing uncertain information, enabling the simultaneous characterization of probability, possibility, uncertainty, and incompleteness. In the R-NBRB model, IF-THEN rules are employed to capture the functional relationships between inputs and outputs. The

k - t h

rule is as follows:

\begin{array}{l} I f x_{1} i s A_{i, 1} \land x_{2} i s A_{i, 2} \land \dots \land x_{j} i s A_{i, j} \\ T h e n {(D_{1}, β_{1, i, j}), \dots, (D_{N}, β_{n, i, j})}, \\ w i t h a r u l e w e i g h t θ_{i, j} \\ a n d a t t r i b u t e w e i g h t s δ_{i} \\ a n d a t t r i b u t e r e l i a b i l i t y r i (i = 1, 2, \dots, M) \end{array}

(8)

where

x_{1}, x_{2}, \dots, x_{j}

are input variables and where

A_{i, j}

(f o r j = 0, 1, \dots, Μ)

represents the reference value corresponding to

x_{i}

in the

k - t h

rule.

Within the R-NBRB modeling framework for complex industrial systems, the ER algorithm is utilized to integrate rule-based inferences, ultimately yielding the final system decision. The detailed procedure is outlined as follows:

Step 1: The model first converts data into corresponding matching degrees through an S-function integrated with a nonlinear operator.

Step 2: On the basis of the calculated matching degrees, the model identifies a set of rules to be activated. This suggests that multi-attribute inputs typically activate multiple rules, each contributing to varying degrees of influence on the reasoning outcomes. Consequently, when determining the weights of activated rules, holistically accounting for the impact of attribute reliability is essential.

{\bar{δ}}_{i} = \frac{δ_{i}}{\max_{i = 1, 2, \dots, M} {δ_{i}}}, (0 ⩽ {\bar{δ}}_{i} ⩽ 1)

(9)

where

{\bar{δ}}_{i}

represents the relative attribute weight of the attribute weight

δ_{i}

.

By incorporating both attribute reliability and attribute weights into the model’s reasoning mechanism, the final calculation for rule weights that considers these factors is presented as Equation (10).

\partial_{k} = \prod_{i = 1}^{M} {(α_{k}^{i})}^{\frac{{\bar{δ}}_{i}}{1 + {\bar{δ}}_{i} - r_{i}}}

(10)

In BRB reasoning, the rule activation weight is used to quantify the “contribution degree” of each rule during the inference process. Consequently, the reliable rule activation weight in the final R-NBRB reasoning can be expressed as:

w_{k} = \frac{θ_{k} \partial_{k}}{\sum_{l = 1}^{L} θ_{k} \partial_{l}} (1 \geq w_{k} \geq 0, \sum_{L}^{k = 1} w_{k} = 1)

(11)

where

θ_{k}

represents the confidence degree of the

k - t h

rule and where

L

represents the total number of rules in the model reasoning process.

w_{k}

denotes the reliable activation weight of the

k - t h

rule, which synthesizes its belief degree and the reliability of relevant attributes, thereby reflecting its integrated contribution to the inference.

Step 3: The ER algorithm is used to aggregate the rules with activated weights. The ER algorithm employs a recursive process of evidence synthesis to integrate the activation weight of each rule with its original belief degree. This process ultimately yields a belief distribution over possible outcomes, enabling a credible inference from multiple rules to a unified decision.

\begin{array}{l} ℑ = 1 - w_{k} \sum_{j = 1}^{N} β_{j} \\ β_{n} = \frac{\prod_{k = 1}^{L} (w_{k} β_{N} + ℑ) - \prod_{k = 1}^{L} ℑ}{[\sum_{n = 1}^{N} \prod_{k = 1}^{L} (w_{k} β_{N} + ℑ)] - (N - 1) \prod_{k = 1}^{L} ℑ - \prod_{k = 1}^{L} (1 - w_{k})} \end{array}

(12)

These obtain the confidence degree

β_{n}

corresponding to the final output result

D_{n}

.

Step 4: Finally, the decision result of the complex industrial system is expressed as Equation (13).

y = \sum_{n = 1}^{N} u (D_{n}) β_{n}

(13)

3.5. Description of the Parameter Optimization Process

To systematically enhance the reasoning accuracy and generalization capability of the R-NBRB model, this study adopts the CMA-ES for the collaborative optimization of its key parameters in response to the significant nonlinear characteristics exhibited by the model’s parameter optimization problem. This algorithm excels at solving complex nonlinear optimization problems. It features quick convergence and strong global search ability. Since the initial parameters of the model are set on the basis of expert domain knowledge, they inherently carry a degree of subjective cognitive uncertainty. Therefore, a data-driven approach is required to calibrate these parameters, ensuring that they better align with the dynamic characteristics of the actual system. The R-NBRB model aims to minimize the mean squared error (MSE) between predicted and actual values. This helps enhance the model’s predictive accuracy.

\begin{array}{l} \min M S E (θ_{i, j}, β_{n, i, j}, δ_{i}) \\ s . t . 0 \leq θ_{i, j} \leq 1, 0 \leq β_{n, i, j} \leq 1, \sum_{n = 1}^{N} β_{n, i, j} \leq 1, \\ 0 \leq δ_{i} \leq 1, \\ i = 1, \dots, M \end{array}

(14)

The parameters to be optimized during the R-NBRB model’s reasoning process include rule weights, belief degrees, attribute weights, and nonlinear operators. The collaborative optimization of these parameters constitutes a comprehensive parameter calibration system. Specifically, the optimization of rule weights and attribute weights serves as a data-driven correction of the rule importance and attribute contributions initially defined by expert knowledge, whereas the optimization of nonlinear operators aims to adaptively adjust the nonlinear mapping relationship between inputs and outputs. During the optimization process, all the parameters are subject to strict physical constraints. The value ranges of the rule weights and attribute weights are confined to the [0, 1] interval, which not only aligns with the physical meaning of “weight” coefficients but also prevents unbounded drift during optimization. The constraints applied to rule belief degrees require them to be nonnegative and satisfy probability normalization, ensuring that each rule forms a complete probability distribution. These constraint conditions guarantee the mathematical standardization of the optimization results and preserve the physical interpretability of the model’s output.

The CMA-ES optimization reasoning process is illustrated in Figure 3. By adjusting the covariance matrix, the optimization process explores the parameter space effectively. This helps find optimal solutions while meeting all constraints. This optimization approach maintains the interpretable framework of the belief rule base while continuously refining and enhancing expert knowledge through data-driven methodology, ultimately achieving simultaneous improvement in both model accuracy and reliability. The optimized model not only preserves domain knowledge from expert experience but also incorporates the true system dynamics reflected by the data, making it well suited for modeling complex industrial systems with stringent reliability requirements. The CMA-ES has been widely applied in the parameter optimization of BRB models. The CMA-ES optimization process comprises six main steps:

Step 1: In this phase, the CMA-ES algorithm parameters are configured. The key model parameters requiring calibration include belief rule weights, belief degrees, attribute weights, and the nonlinear operator, as shown below:

h^{0} = Ω^{0}

(15)

Ω^{0} = {θ_{1, 1}, \dots, θ_{N, J}, β_{1, 1, 1}, \dots, β_{M, N, J}, δ_{i^{'}}, \dots, δ_{M^{'}}, a_{e x p e r t}}

(16)

Step 2: Sampling operation. The parameters for each generation are acquired via a sampling procedure and can be mathematically represented as follows:

Ω_{i}^{g + 1} ~ h^{g} + φ^{g} K (0, C^{g}) i = 1, \dots, t

(17)

Ω_{i}^{g + 1}

is the

i - t h

solution of the

(g + 1) - t h

generation.

h^{g}

represents the search distribution value of the

g - t h

generation.

C^{g}

represents the

g - t h

covariance matrix, where the subgeneration is denoted by

t

and the normal distribution is represented by

K

.

Step 3: Control operation. This step converts the belief distribution into a reasonable distribution. If an irrational belief distribution is generated, it must be resampled until all distributions are rationalized. The specific implementation is as follows:

Ω_{i}^{g + 1} \Leftarrow β_{n, i, j}^{g + 1} = m e a n^{g} + ϕ^{g} K (0, C^{g})

(18)

where

β_{n, i, j}^{g + 1}

represents the reasonable belief distribution and where

\Leftarrow

is the replacement operation, which replaces the unreasonable belief distribution in the

(g + 1) - t h

generation.

Step 4: Perform the projection operation to satisfy the constraints. The solutions that meet the constraints are projected onto the hyperplane. The formula is as follows:

Ω_{i}^{g + 1} (1 + n_{e} \times (u - 1) : n_{e} \times u) - A_{e}^{T} \times {(A_{e} \times A_{e}^{T})}^{- 1} \times Ω_{i}^{g + 1} (1 + n_{e} \times (u - 1) : n_{e} \times u) \times A_{e}

(19)

A_{e} Ω_{i}^{g} (1 + n_{e} \times (u - 1) : n_{e} \times u)

(20)

where

n_{e} = 1, \dots, N

represents the constrained variables and where

u = 1, \dots, N + 1

indicates the number of equation constraints.

Step 5: Select the operation to update the average value of the next-generation search distribution as follows:

h^{g + 1} = \sum_{i = 1}^{ω} c_{i} Ω_{i : t}^{g + 1}

(21)

c_{i}

represents the weight coefficient, and the number of offspring is denoted by

ω

.

Ω_{i : t}^{g + 1}

represents the

i - t h

solution among the t solutions of the

(g + 1) - t h

generation search distribution.

Step 6: Adaptive operation: update the covariance matrix. The formula is as follows:

C^{g + 1} = (1 - e_{1} - e_{2}) C^{g} + e_{1} E_{e}^{g + 1} {(E_{e}^{g + 1})}^{T} + e_{2} \sum_{i = 1}^{ω} h_{i} (\frac{K_{i : t}^{g + 1} - γ^{g}}{η^{g}}) \times {(\frac{K_{i : t}^{g + 1} - γ^{g}}{η^{g}})}^{T}

(22)

The generation step g is represented as

η^{g}

, the learning rates are denoted by

e_{1}

and

e_{2}

, the evolutionary trajectory of the

(g + 1) - t h

generation is represented by

E_{e}^{g + 1}

, and

γ^{g}

is the representative of the offspring population of the

g - t h

generation. The

i - t h

parameter vector in the t vector of the

(g + 1) - t h

generation is denoted as

K_{i : t}^{g + 1}

.

Repeat steps 1 to 6 until the optimal parameters are obtained.

3.6. Computational Complexity Analysis of the R-NBRB Model

Conducting complexity analysis of the model facilitates the decomposition of algorithms into more manageable subproblems, thereby enhancing code readability and maintainability. A thorough grasp of algorithmic complexity enables precise assessment of operational resource requirements and reveals fundamental characteristics and inherent constraints of the computational framework, ultimately providing crucial support for sustained model performance enhancement. This section presents a time complexity analysis of the R-NBRB model, which is crucial for evaluating its computational efficiency and applicability in industrial scenarios with high real-time requirements. The computational cost of the model primarily stems from the forward inference process and the parameter optimization phase.

In subsequent sections,

M

denotes the number of input attributes of the model, the rule base contains a total of

S u m_l

rules, and the output results are defined by

N

reference values.

During the matching degree calculation phase, each input attribute needs to be matched with its corresponding reference values, with a time complexity of

O (M * A_\max)

, where

A_\max

represents the maximum number of reference values among all attributes. In the nonlinear transformation phase, the matching results are processed via an S-function. Since the matching degree vectors of each attribute are transformed independently, this phase maintains the same time complexity of

O (M * A_\max)

.

Evidence fusion serves as the computational core of the model, whose complexity is primarily determined by the ER algorithm. This process requires traversing all the rules and output grades to perform multi-layer belief degree synthesis calculations, resulting in a complexity of

O (S u m_l * N^{2})

. The decision generation stage converts the fused belief distribution into specific numerical values, requiring only

O (N)

linear computations.

The computational complexity of the optimization algorithm is jointly determined by three key parameters: the dimension

L_p

of the parameters

Ω^{0}

to be optimized, the population size

l a m b d a_m a x

lambda, and the number of iterations

G

. During the initialization phase, the algorithm needs to initialize the covariance matrix and related parameters, with a complexity of

O (L_p^{2})

. In the main optimization loop phase, each iteration involves several key steps: population generation and boundary constraint handling have a complexity of

O (l a m b d a_m a x * L_p^{2})

; the equality constraint projection requires validation and correction of the constraint conditions for each individual, with a complexity of

O (l a m b d a_m a x * L_p^{2})

; fitness evaluation requires calling the objective function, and its complexity depends on the computational cost of the function itself; the selection and recombination process has a complexity of

O (l a m b d a_m a x * \log (l a m b d a_\max))

(lambda × log(lambda)); and the covariance matrix update has a complexity of

O (L_p^{2})

.

On the basis of the above analysis, the parameter dimension

L_p

has a decisive effect on the algorithm’s efficiency during the overall model inference process. When

L_p

is large, the computational cost increases significantly with its growth. However, in practical application scenarios of the R-NBRB model, this “heavy training, light inference” architectural design not only ensures the global optimality of model parameters but also guarantees deployment feasibility in resource-constrained industrial environments, demonstrating an effective balance between computational efficiency and model accuracy.

4. Case Study

Accurate fault diagnosis in complex industrial systems is vital. This is especially true in harsh environments, as it ensures safety and supports efficient manufacturing development. Among these systems, oil pipelines exhibit typical characteristics of complex industrial systems—nonlinearity and high uncertainty—due to their extensive span, intricate structure, and strongly coupled operational mechanisms. These systems face significant challenges in terms of data acquisition, condition monitoring, and maintenance management. Pipeline leakage not only leads to severe resource waste and economic losses but also may cause major safety incidents such as fires and explosions, posing serious threats to the ecological environment and public safety [31]. Therefore, developing high-precision pipeline leakage detection and diagnostic methods is of vital practical importance for ensuring the safety of energy transportation and achieving predictive maintenance.

This section validates the efficacy of the R-NBRB model in complex industrial system decision-making via a practical case study on oil pipeline leak detection. In Section 4.1, the details of the dataset used in the experiment are specified. In Section 4.2, the detailed construction process of the R-NBRB model for oil pipeline leakage detection is introduced. In Section 4.3, the effectiveness of the R-NBRB model is validated via the oil pipeline dataset. In Section 4.4, a summary of the experiment is presented.

4.1. Dataset Information

The experimental dataset on pipeline leaks originates from a large-scale pipeline infrastructure project in the United Kingdom, spanning more than 100 km in total length [32]. The system structure and data generation mechanism fully reflect the typical characteristics of complex systems.

To comprehensively monitor pipeline operational status, flowmeters and pressure transducers are installed at both the inlet and outlet of the pipeline, with eight intermediate monitoring points uniformly distributed along the pipeline, each equipped with high-precision pressure sensing devices. Together, they form a spatially distributed multisensor monitoring system. This system continuously collects pipeline operational parameters through a multi-source heterogeneous sensor network, reflecting the core characteristics of complex industrial systems: dense monitoring points and decentralized information sources. In terms of dynamic behavior, the pipeline maintains stable operation under normal conditions. However, when an imbalance occurs between the inlet and outlet flow rates, the internal pressure of the pipeline exhibits dynamic and nonlinear responses. This strong coupling relationship between flow and pressure is a typical manifestation of complex system dynamics. Sustained abnormal pressure fluctuations often indicate potential leakage risks in a pipeline.

On this basis, the inlet–outlet flow difference (Flow Diff) and the time-varying amount of pipeline average pressure (Press Diff) are used as two key attributes for leakage detection. Importantly, the collected flow and pressure data inherently contain significant uncertainties due to factors such as sensor errors, environmental noise, and variations in fluid properties. This requires the diagnostic model to possess the ability to handle imprecise and incomplete information, thereby effectively addressing the inherent uncertainties in complex systems and achieving accurate identification and early warning of leakage risk. The dataset is sampled at 10 s intervals. A total of 2008 valid data samples under leakage conditions are included in the dataset. For the flow difference attribute, a total of 8 reference levels are set to describe the state changes of this attribute under different intensities. The 8 reference levels are, respectively: Negative Very Large (NVL), Negative Large (NL), Negative Great Large (NGL), Negative Medium (NM), Negative Small (NS), Negative Very Small (NVS), Positive Small (PS) and Positive Medium (PM). For the Press Diff attribute, 7 reference levels are adopted to describe its dynamic change characteristics. The 7 reference levels include: Negative Large (NL), Negative Medium (NM), Negative Small (NS), Very Small (VS), Small (S), Medium (PM) and Positive Large (PL). The input reference values of the two input attributes, Flow Diff and Press Diff, are provided in Table 1. To more clearly demonstrate the data distribution, 1000 data records were randomly selected from the overall dataset and are presented in Figure 4. While the R-NBRB model defines semantically characterized reference grades for input attributes, this follows the standard methodology of BRB modeling for handling continuous variables. Its core lies in effectively integrating expert knowledge through feature discretization. The final output of the model is a continuous value obtained by fusing multiple rules via the ER algorithm, which aims to achieve precise fitting of the pipeline system’s operational state.

Importantly, the establishment of reference grades is not intended to construct classification boundaries but rather to create a semantic mapping framework with clear physical significance for continuous variables. Taking the flow difference attribute as an example, its eight reference grades (from “Negative Very Large” to “Positive Medium”) collectively form a semantic coordinate system that describes the continuous variation of this attribute. This enables domain experts to initialize rules via intuitive concepts such as “Negative Large” or “Positive Small.” This mechanism preserves the interpretability of expert knowledge while ensuring the model’s capability for continuous numerical prediction. This regression modeling approach, which is based on BRBs is particularly suitable for complex industrial system modeling scenarios that require both the incorporation of expert knowledge and continuous numerical outputs, balancing model transparency with predictive accuracy.

4.2. R-NBRB-Based Oil Pipeline Leakage Detection Model Construction

According to the actual situation of the oil pipeline leakage dataset and the actual data distribution, the nonlinear operator

a_{e x p e r t} = 9

is given in the construction process of the R-NBRB model. In Equation (6),

ι

represents the symmetric point of the S function. The matching degree values all fall within the interval [0, 1]. To improve the discrimination accuracy of the matching degree in the rule activation process,

ι

is set to 0.5, which is the middle value of the matching degree interval. In addition, according to the distribution characteristics of the collected actual data, the tolerance parameters of the two input attributes for oil pipeline leakage are set to

ζ_{1} = 2.3

and

ζ_{2} = 3

.

Mean Squared Error (MSE), Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), R-squared (R²), and Variance Accounted For (VAF) are important indicators for evaluating the decision-making performance of complex industrial systems. The physical meaning of each metric is briefly explained below:

(1): MSE calculates the average of the squares of the differences between the predicted values and true values. It amplifies the impact of larger errors through squaring and is highly sensitive to outliers. The closer its value is to 0, the higher the prediction accuracy of the model.

M S E = \frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - \hat{y_{i}})}^{2}

(23)

where

y_{i}

is the true value,

\hat{y_{i}}

is the predicted value, and n is the number of samples.

(2): The RMSE is more interpretable than the MSE. It is also highly sensitive to large errors and is one of the most commonly used indicators for measuring the prediction errors of models. The smaller its value is, the better the performance of the model.

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - \hat{y_{i}})}^{2}}

(24)

(3): The MAE calculates the average of the absolute differences between the predicted values and true values. It provides a robust estimate of the error.

M A E = \frac{1}{n} \sum_{i = 1}^{n} | y_{i} - \hat{y_{i}} |

(25)

(4): R² measures the ability of the model to explain the variation in the target variable. Its value range is usually [0, 1]. A value closer to 1 indicates a better fit of the model to the data, meaning that a larger proportion of the variance is accounted for. A value of 1 indicates a perfect prediction, while a value of 0 indicates that the model performs no better than simply predicting the mean.

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - \hat{y_{i}})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}}

(26)

(5): VAF is used to measure the degree to which the model explains the variance of real data. When the prediction is unbiased, its ideal value is 100%. The higher its value is, the better the performance of the model.

V A F = [1 - \frac{v a r (y_{i} - \hat{y_{i}})}{v a r (y_{i})}] \times 100 %

(27)

4.3. Experimental Analysis

The oil pipeline leakage dataset contains a total of 2008 data samples. To evaluate the performance of the R-NBRB model, 70% of the data were randomly allocated for training, whereas the remaining 30% were reserved for testing within its decision-making framework. The final experimental results of the R-NBRB model are presented in Figure 5. The performance of the R-NBRB model is compared with that of other models based on 10 independent experimental runs, and the average results are summarized in Table 2.

According to the model operation results in Table 2, the R-NBRB model has significant advantages in terms of multiple evaluation indicators. In terms of the MSE indicator, the value of R-NBRB is 0.256915, which is significantly lower than those of BRB (0.3580), SVM (0.5923), KNN (0.4064) and BPNN (0.4727). Its error is reduced by approximately 28.22% compared with BRB and by 56.62%, 36.78% and 45.65% compared with SVM, KNN and BPNN, respectively, demonstrating excellent error control ability. In terms of prediction stability, the RMSE of R-NBRB is 0.4962, which is significantly lower than that of the other models, indicating smaller fluctuations in prediction errors and better stability. With respect to goodness of fit, the R² of R-NBRB reaches 0.9612, which is higher than that of BRB (0.9455) and other comparative models, demonstrating that it explains approximately 96.12% of the output variance and exhibits excellent fitting performance. Moreover, the VAF of R-NBRB is 96.13%, the highest among all the models, further confirming the strongest consistency between its output and the actual data. Notably, although KNN performs slightly better in terms of the MAE (0.1676), the R-NBRB achieves a better balance between the overall prediction accuracy and stability when multiple metrics such as the MSE and R² are considered.

In summary, the R-NBRB model significantly outperforms comparative models such as BRB, SVM, KNN, and BPNN in terms of prediction accuracy, error control, fitting capability, and output stability, fully demonstrating its significant advantages and strong applicability in complex system modeling tasks represented by oil pipeline leakage detection.

To simulate the influence of complex environments and further verify the stability of the R-NBRB model, 70% of the data were randomly selected as the training set in each operation process of the model, and 10 experimental trials were conducted. The corresponding detailed evaluation index results finally obtained on the basis of the model’s decision results are recorded in Table 3.

Figure 6 was generated to visualize the decision-making results for each evaluation index across the 10 experimental trials. These bar charts reflect the fluctuations in each index. Additionally, the data in Table 3 were analyzed. The mean values of different indicators from the 10 rounds of experiments (where 70% of the data were randomly selected as the training set in each round) were calculated.

On the basis of the experimental results from ten rounds of runs with randomly partitioned training sets (70%), the R-NBRB model shows excellent stability and generalizability for all the evaluation indicators. Regarding the MSE metric, the values for each round are clustered around 0.25, with an average of approximately 0.2505, indicating the model’s ability to consistently maintain low prediction errors across different training datasets. The R² values mostly remain at approximately 0.956, with an average value reaching 0.9561. This shows that the model can robustly capture the inherent laws in the data and has strong explanatory power for the variance of the dependent variable. The MAE indicator has an average of approximately 0.2137, and the results of each round have small fluctuations. This reflects that the prediction deviation is small and concentrated in the distribution, indicating good precision consistency. The VAF indicator has an average value of 95.81% and always remains high. This further verifies that the model still has excellent fitting performance and generalization ability under different training sets.

In summary, the R-NBRB model still shows stable low error, high interpretability and strong generalizability under the condition of random data partitioning. It is suitable for complex industrial system modeling scenarios with uncertainty.

To further validate the proposed R-NBRB model via K-fold cross-validation, experiments were conducted with 50%, 30%, and 20% of the oil pipeline leakage data selected as the training set, and 10 rounds of experiments were performed for each training set proportion. The average values of the final 10-round experimental results are recorded in Table 4, Table 5 and Table 6.

As shown in Table 2, Table 4, Table 5 and Table 6 (which correspond to the experimental results with training set proportions of 70%, 50%, 30%, and 20%, respectively), under different scales of data partitioning, the R-NBRB model consistently outperforms the BRB, SVM, KNN, and BPNN models in various evaluation indicators. Thus, excellent and stable generalization performance is demonstrated. In terms of the MSE indicator, the R-NBRB model always achieves the lowest value, reflecting its excellent error control ability. Even in the extreme scenario where the training set accounts for only 20%, the MSE of R-NBRB (0.361241) still remains optimal. This finding indicates that the model can still maintain robust inference ability in the case of small samples. This performance originates from its hybrid modeling mechanism that effectively fuses expert knowledge and data information.

In terms of the R² metric, which reflects the variance explanation ability, the R-NBRB model also performs well. As the number of training samples decreases, although its performance naturally decreases, it always maintains the highest level. This shows strong adaptability to changes in the data distribution. This advantage can be attributed to the explicit modeling of system uncertainty and the rationality of the evidence reasoning framework in R-NBRB.

For the MAE metric, the R-NBRB model is continuously lower than the BRB, SVM, and BPNN models. In some cases, it is close to or better than the KNN model. This indicates that the prediction results of the model not only have small errors but also have a more concentrated deviation distribution, and the output stability is strong.

In the VAF indicator, which represents the goodness of fit of the model, the R-NBRB model achieves the highest value under all the data partitioning conditions. When the training set is 70%, the VAF reaches 96.1282%, which is much better than the 91.01% of the SVM. Even when the training set is reduced to 20%, it still maintains 94.5638% accuracy. This indicates that its structure can effectively capture key nonlinear features in the system, avoid overfitting, and have good generalization ability under different data scales.

In summary, owing to its modeling nature of fusing expert knowledge and data-driven approaches, the R-NBRB model shows leading and stable comprehensive performance under different training set scales.

To evaluate the effectiveness of the model systematically, 70% of the data were randomly selected as the training set, while the remaining 30% were used as the test set. Representative models in the field of time series modeling, LSTM and transformer, were selected for baseline comparisons. The average results of 10 experimental trials are recorded in Table 7.

According to the experimental results in Table 7, the R-NBRB model has significant advantages in terms of key metrics. Its MSE is reduced by 31.1% and 72.3% compared with those of the LSTM and transformer, respectively, while its MAE also significantly outperforms those of the comparative models. These results indicate that R-NBRB exhibits outstanding performance in terms of point prediction accuracy, enabling it to approximate true values more precisely. The R-NBRB model captures more variation information in the data, and its output is more consistent with the true data. Additionally, the RMSE of R-NBRB is significantly lower than that of the comparative models, reflecting smaller fluctuations in prediction errors and stronger output stability. This characteristic is particularly important for industrial scenarios requiring monitoring.

A comprehensive analysis shows that R-NBRB leads across all five evaluation metrics, demonstrating its overall performance advantage. Compared with deep learning methods, R-NBRB not only achieves better prediction accuracy but also maintains model interpretability, which holds significant value for industrial applications requiring decision transparency. The experimental results validate the effectiveness of the belief rule base-based modeling approach, providing strong support for the practical application of the model in industrial monitoring systems.

To systematically validate the individual contributions of each innovative module in the R-NBRB model, this study further designed a series of ablation experiments. Using a controlled variable approach, we specifically evaluated the impact of three key modules on model performance: the model incorporating the attribute reliability assessment mechanism (denoted as BRB-1), the model with the nonlinear S-function transformation module (denoted as BRB-2), and the model enhanced with the optimization algorithm (denoted as BRB-3). The final experimental results, which provide a comprehensive comparison of these model variants, are documented in Table 8.

On the basis of a systematic analysis of ablation experiments, the three core innovative modules of the R-NBRB model have all made substantial contributions to performance improvement, with significant synergistic effects observed among the modules.

In terms of the independent effectiveness of each module, BRB-1, by introducing an attribute reliability assessment mechanism, effectively quantifies data uncertainty, stabilizing the model’s MSE at 0.2681 in noisy environments and increasing the VAF to 95.82%; BRB-2, leveraging a nonlinear S-function, enhances the characterization of dynamic system relationships, improving the model’s goodness-of-fit to R² = 0.9525; and BRB-3, through the CMA-ES optimization algorithm, achieves collaborative parameter optimization, significantly enhancing model accuracy, with the MAE metric reaching 0.2142. These data fully validate the independent value of each innovation.

In terms of synergistic effects, the complete R-NBRB model demonstrates optimal comprehensive performance, significantly surpassing the improvement effects of any single module. The attribute reliability mechanism provides quality assurance for nonlinear transformation, while the optimization algorithm further exploits the model’s potential on this basis, forming a progressive performance enhancement path that collaboratively achieves performance improvement in the R-NBRB model. Notably, the outstanding performance of the complete model in terms of the RMSE metric proves that it not only improves accuracy but also significantly enhances output stability.

Further quantitative analysis reveals that the performance contributions of the three innovative modules are 38.2%, 31.5%, and 30.3%, respectively. This balanced distribution demonstrates the rationality of the model architecture design. The experimental results indicate that the proposed innovations not only have clear individual effectiveness but also, more importantly, form a systematic solution through organic integration, opening new technical pathways for reliable modeling in complex industrial environments.

4.4. Experiment Summary

On the basis of the analysis of the experimental results presented above, the proposed R-NBRB model has exceptional generalizability for complex system decision-making tasks. Particularly in small-sample learning scenarios, the model maintains stable decision-making performance even when the training set ratio is reduced to 20%. Through ten rounds of random sampling experiments, the model’s MSE consistently remains at approximately 0.25, which fully confirms the algorithm’s robustness to changes in the data distribution.

Compared with traditional data-driven and knowledge-driven methods, R-NBRB not only excels in terms of error control, prediction accuracy, and fitting capability but also, more importantly, exhibits outstanding adaptability to typical small-sample and high-uncertainty scenarios in complex industrial environments. This characteristic enables it to effectively address the challenges of obtaining complete data and limited labeled samples in practical engineering applications.

In summary, by organically integrating expert knowledge with data characteristics, the R-NBRB model constructs a decision-making framework that combines high accuracy with strong generalization capability. This method provides a reliable modeling solution for safety-critical tasks such as oil pipeline leak detection and holds significant potential for broader application in complex industrial system safety monitoring.

5. Conclusions

Addressing the challenges of nonlinear adaptation and uncertainty handling in complex industrial system modeling, this paper proposes an R-NBRB model and validates it through petroleum pipeline leak detection experiments. The findings demonstrate that: the model uses a smooth nonlinear S-function-based method to accurately map system inputs to outputs. Adding attribute reliability assessment boosts the model’s adaptability to uncertainties. The CMA-ES optimization algorithm allows for adaptive collaboration, enhancing key parameters and improving the model’s accuracy and generalization. This research successfully combines knowledge and data. It improves the accuracy and reliability of decision-making in complex situations where data is limited but expert knowledge is abundant. This offers a dependable technical solution for safely monitoring complex industrial systems.

The R-NBRB model proposed in this paper is suitable for complex industrial system modeling scenarios with nonlinear relationships, uncertainties, and available expert knowledge. Even with progress in the nonlinear S-function and parameter optimization, we can still enhance computational efficiency for ultra-large-scale and ultra-high-dimensional complex industrial system data.

Future research should focus on two key directions: first, improving uncertainty quantification methods. This will help the model adapt to uncertain factors in dynamic, complex systems. Second, broadening the model’s application. This will test its universality and optimization effectiveness in various industrial and non-industrial complex systems.

Author Contributions

H.H.: Software, Validation, Writing—review and editing, Supervision, Conceptualization, Writing—original draft. S.F.: Visualization, Methodology, Writing—review and editing, Software, Writing—Original draft. J.L.: Conceptualization, Methodology, Validation, Formal analysis. T.G.: Investigation, Data curation, Supervision. H.Z.: Writing—review and editing, Supervision, Validation, Formal analysis. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by Open Foundation of Key Laboratory of the Ministry of Education on Application of Artificial Intelligence in Equipment under Grant No. AAIE-2023-0103, in part by the Natural Science Foundation of Heilongjiang Province under Grant No. PL2024G009, in part by the Basic Research Support Program for Outstanding Young Teachers in Provincial Undergraduate Universities of Heilongjiang Province under Grant No. YQJH2024116, in part by Shandong Provincial Natural Science Foundation under Grant No. ZR2023QF010, in part by National Science Foundation of China Grant No. 72471067, in part by the Harbin Normal University Doctoral Research Initiation Foundation under Grant No. HSDSSCX2025-54.

Data Availability Statement

Data will be made available on request.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

Guo, M.; Yang, J.-B.; Chin, K.-S.; Wang, H. The Evidential Reasoning Approach for Multi-Attribute Decision Analysis Under Both Fuzzy and Interval Uncertainty. In Interval/Probabilistic Uncertainty and Non-Classical Logics; Huynh, V.-N., Nakamori, Y., Ono, H., Lawry, J., Kreinovich, V., Nguyen, H.T., Eds.; Springer: Berlin/Heidelberg, Germany, 2008; pp. 129–140. [Google Scholar]
Cao, H.; Yu, J.; Duan, F. Condition-Based Maintenance in Complex Degradation Systems: A Review of Modeling Evolution, Multi-Component Systems, and Maintenance Strategies. Machines 2025, 13, 714. [Google Scholar] [CrossRef]
Sun, C.; Mai, J.; He, W.; Zhu, H.; Liu, Q. Complex industrial system modeling using deviation-smoothing belief rule base with training and optimization. Eng. Appl. Artif. Intell. 2025, 158, 111539. [Google Scholar] [CrossRef]
Baigzadehnoe, B.; Rezaie, B.; Rahmani, Z. Fuzzy-model-based fault detection for nonlinear networked control systems with periodic access constraints and Bernoulli packet dropouts. Appl. Soft Comput. 2019, 80, 465–474. [Google Scholar] [CrossRef]
Li, J.; Li, T.; Fang, D.; Wang, Y.; Guo, S.; Wang, Z.; Yu, Q. Internal fault diagnosis method for lithium batteries based on a failure physical model. Eng. Fail. Anal. 2023, 154, 107714. [Google Scholar] [CrossRef]
Choudhury, M.D.; Kleijn, W.B.; Blincoe, K.; Dhupia, J.S. A Deep Learning Based Fault Diagnosis Method Combining Domain Knowledge and Transfer Learning. In Proceedings of the 2023 29th International Conference on Mechatronics and Machine Vision in Practice (M2VIP), Queenstown, New Zealand, 15–17 November 2023; pp. 1–6. [Google Scholar] [CrossRef]
Habib, M.K.; Ayankoso, S.A.; Nagata, F. Data-Driven Modeling: Concept, Techniques, Challenges and a Case Study. In Proceedings of the 2021 IEEE International Conference on Mechatronics and Automation (ICMA), Takamatsu, Japan, 8–11 August 2021; pp. 1000–1007. [Google Scholar] [CrossRef]
Wu, X.; McDermott, M.; MacLean, A.L. Data-driven model discovery and model selection for noisy biological systems. PLoS Comput. Biol. 2025, 21, e1012762. [Google Scholar] [CrossRef] [PubMed]
Chattopadhyay, A.; Pathak, J.; Nabizadeh, E.; Bhimji, W.; Hassanzadeh, P. Long-term stability and generalization of observationally-constrained stochastic data-driven models for geophysical turbulence. Environ. Data Sci. 2023, 2, e1. [Google Scholar] [CrossRef]
Soleimani, M.; Campean, F.; Neagu, D. Diagnostics and prognostics for complex industrial systems: A review of methods and challenges. Qual. Reliab. Eng. Int. 2021, 37, 3746–3778. [Google Scholar] [CrossRef]
Yuan, J.; Wang, F.L.; Wang, S.; Zhao, L.P. A Fault Diagnosis Approach by D-S Fusion Theory and Hybrid Expert Knowledge System. Acta Autom. Sin. 2017, 43, 1580–1587. [Google Scholar] [CrossRef]
Gao, Z.; He, M.; Zhang, X.; Hu, G.; He, W.; Chen, S. Fault Diagnosis of High-Speed Train Motors Based on a Multidimensional Belief Rule Base. IEEE Access 2024, 12, 122544–122556. [Google Scholar] [CrossRef]
Chen, Z.; Xu, J.; Peng, T.; Yang, C. Graph Convolutional Network-Based Method for Fault Diagnosis Using a Hybrid of Measurement and Prior Knowledge. IEEE Trans. Cybern. 2022, 52, 9157–9169. [Google Scholar] [CrossRef]
Cao, Y.; Tang, S.; Yao, R.; Chang, L.; Yin, X. Interpretable hierarchical belief rule base expert system for complex industrial system modeling. Measurement 2024, 226, 114033. [Google Scholar] [CrossRef]
Gong, A.; He, W.; Cao, Y.; Zhou, G.; Zhu, H. Interpretability metrics and optimization methods for belief rule based expert systems. Expert Syst. Appl. 2025, 289, 128363. [Google Scholar] [CrossRef]
He, W.; Cheng, X.; Zhao, X.; Zhou, G.; Zhu, H.; Zhao, E.; Qian, G. An interval construction belief rule base with interpretability for complex industrial systems. Expert Syst. Appl. 2023, 229, 120485. [Google Scholar] [CrossRef]
Lian, Z.; Zhou, Z.; Hu, C.; Ming, Z.; Wang, J.; Zhao, Y. A Belief Rule-Based Performance Evaluation Model for Complex industrial systems Considering Sensors Disturbance. IEEE Trans. Reliab. 2024, 73, 1245–1257. [Google Scholar] [CrossRef]
Zhang, C.; Zhou, Z.; Ning, P.; Zhang, P.; Lian, Z.; Ming, Z. MBRB: Micro-belief rule Base model based on cautious conjunctive rule for interpretable fault diagnosis. Eng. Appl. Artif. Intell. 2024, 135, 108598. [Google Scholar] [CrossRef]
Cheng, X.; Liu, S.; He, W.; Zhang, P.; Xu, B.; Xie, Y.; Song, J. A Model for Flywheel Fault Diagnosis Based on Fuzzy Fault Tree Analysis and Belief Rule Base. Machines 2022, 10, 73. [Google Scholar] [CrossRef]
Lian, Z.; Zhou, Z.; Zhang, X.; Feng, Z.; Han, X.; Hu, C. Fault Diagnosis for Complex Equipment Based on Belief Rule Base with Adaptive Nonlinear Membership Function. Entropy 2023, 25, 442. [Google Scholar] [CrossRef]
Zhou, Z.J.; Hu, C.H.; Hu, G.Y.; Han, X.X.; Zhang, B.C.; Chen, Y.W. Hidden Behavior Prediction of Complex Industrial Systems Under Testing Influence Based on Semiquantitative Information and Belief Rule Base. IEEE Trans. Fuzzy Syst. 2015, 23, 2371–2386. [Google Scholar] [CrossRef]
Li, G.; Zhou, Z.; Hu, C.; Chang, L.; Zhang, H.; Yu, C. An optimal safety assessment model for complex industrial systems considering correlation and redundancy. Int. J. Approx. Reason. 2019, 104, 38–56. [Google Scholar] [CrossRef]
Tsai, C.H.; Chih, Y.T.; Wong, W.H.; Lee, C.Y. A Hardware-Efficient Sigmoid Function With Adjustable Precision for a Neural Network System. IEEE Trans. Circuits Syst. II Express Briefs 2015, 62, 1073–1077. [Google Scholar] [CrossRef]
Mai, J.; Huang, H.; Wei, F.; Yang, C.; He, W. Autonomous underwater vehicle fault diagnosis model based on a deep belief rule with attribute reliability. Ocean Eng. 2025, 321, 120472. [Google Scholar] [CrossRef]
Yang, J.-B.; Liu, J.; Wang, J.; Sii, H.-S.; Wang, H.-W. Belief rule-base inference methodology using the evidential reasoning Approach-RIMER. IEEE Trans. Syst. Man Cybern.-Part A Syst. Hum. 2006, 36, 266–285. [Google Scholar] [CrossRef]
Tang, Y.; Wu, S.; Zhou, Y.; Huang, Y.; Zhou, D. A New Reliability Coefficient Using Betting Commitment Evidence Distance in Dempster–Shafer Evidence Theory for Uncertain Information Fusion. Entropy 2023, 25, 462. [Google Scholar] [CrossRef] [PubMed]
Hu, G.-Y.; Zhou, Z.-J.; Zhang, B.-C.; Yin, X.-J.; Gao, Z.; Zhou, Z.-G. A method for predicting the network security situation based on hidden BRB model and revised CMA-ES algorithm. Appl. Soft Comput. 2016, 48, 404–418. [Google Scholar] [CrossRef]
Gong, A.; He, W.; Ge, G.; Yang, C.; Li, S. Predicting global educational inequality with a hierarchical belief rule base model. Sci. Rep. 2025, 15, 12373. [Google Scholar] [CrossRef]
Zhang, Q.; Li, K.; Zhang, G.; Zhu, H.; He, W. A complex industrial system health state assessment method with reference value optimization for interpretable BRB. Sci. Rep. 2024, 14, 2334. [Google Scholar] [CrossRef]
Feng, Z.; He, W.; Zhou, Z.; Ban, X.; Hu, C.; Han, X. A New Safety Assessment Method Based on Belief Rule Base with Attribute Reliability. IEEE/CAA J. Autom. Sin. 2021, 8, 1774–1785. [Google Scholar] [CrossRef]
Han, P.; Zhang, Q.; He, W.; Chen, Y.; Zhao, B.; Li, Y.; Zhou, G. A double inference engine belief rule base for oil pipeline leakage. Expert Syst. Appl. 2024, 240, 122587. [Google Scholar] [CrossRef]
Xu, D.-L.; Liu, J.; Yang, J.-B.; Liu, G.-P.; Wang, J.; Jenkinson, I.; Ren, J. Inference and learning methodology of belief-rule-based expert system for pipeline leak detection. Expert Syst. Appl. 2007, 32, 103–113. [Google Scholar] [CrossRef]

Figure 1. Architecture of R-NBRB Model for Complex Industrial System Modeling.

Figure 2. Reliability Calculation Process of the R-NBRB Model.

Figure 3. Parameter Optimization Process of the R-NBRB Model.

Figure 4. Oil Pipeline Leakage Data Distribution Map.

Figure 5. Leakage Detection Result of the R-NBRB Model.

Figure 6. Stability Analysis of R-NBRB Model on Different Evaluation Metrics.

Table 1. Knowledge Distribution.

Reference Level	NVL	NL	NGL	NM	NS	NVS	PS	PM
Flow Diff	−10	−8.845	−8.1175	−7.89	−7.1625	−1.85	0.05	1.5
Reference level	NL	NM	NS	VS	S	PM	PL
Press Diff	NL	−0.003	−0.0015	0	0.0015	0.003	0.04

Table 2. Comparison of the Experimental Results of Different Models.

Model Name	R-NBRB	BRB	SVM	KNN	BPNN
MSE	0.2569	0.3580	0.5923	0.4064	0.4727
RMSE	0.4962	0.5836	0.7837	0.6289	0.6796
R²	0.9612	0.9455	0.9099	0.9373	0.9282
MAE	0.2083	0.2663	0.3691	0.1676	0.3492
VAF	96.13%	94.70%	91.01%	93.77%	93.19%

Table 3. Detailed Data of the 10-Round Detection Results of the R-NBRB model.

Experimental Round	MSE	RMSE	R²	MAE	VAF
1	0.2573	0.5073	0.9596	0.2165	95.96%
2	0.2406	0.4906	0.9622	0.2077	96.22%
3	0.2435	0.4934	0.9616	0.2059	96.17%
4	0. 2637	0.5135	0.9586	0.2180	95.87%
5	0.2486	0.4986	0.9610	0.2146	96.10%
6	0.2475	0.4975	0.9611	0.2118	96.11%
7	0.2641	0.5139	0.9585	0.2175	95.86%
8	0.2637	0.5136	0.9590	0.2141	95.86%
9	0.2520	0.5020	0.9604	0.2127	96.04%
10	0.2637	0.5135	0.9586	0.2180	95.87%
Mean Value	0.2545	0.5044	0.9601	0.2137	95.01%

Table 4. Average 10-round results with 50% training data proportion.

Model Name	R-NBRB	BRB	SVM	KNN	BPNN
MSE	0.2732	0.3522	0.6047	0.4231	0.4878
RMSE	0.5226	0.5935	0.7758	0.6482	0.6859
R²	0.9587	0.9464	0.9078	0.9352	0.9251
MAE	0.2202	0.2512	0.3726	0.1755	0.3634
VAF	95.88%	94.70%	90.80%	93.55%	93.18%

Table 5. Average 10-round results with 30% training data proportion.

Model Name	R-NBRB	BRB	SVM	KNN	BPNN
MSE	0.3177	0.3710	0.5916	0.4432	0.6046
RMSE	0.5635	0.6146	0.7684	0.6638	0.7501
R²	0.9514	0.9436	0.9101	0.9327	0.9074
MAE	0.2432	0.2749	0.3706	0.1868	0.3463
VAF	95.15%	94.42%	91.02%	93.29%	91.98%

Table 6. Average 10-round results with 20% training data proportion.

Model Name	R-NBRB	BRB	SVM	KNN	BPNN
MSE	0.3612	0.4264	0.6033	0.4783	0.5456
RMSE	0.6006	0.6393	0.7762	0.6882	0.6958
R²	0.9455	0.9351	0.9082	0.9272	0.9170
MAE	0.2372	0.2815	0.3737	0.1976	0.3493
VAF	94.56%	93.51%	90.83%	92.73%	92.80%

Table 7. Results of Comparative Experiments.

Model Name	R-NBRB	LSTM	Transformer
MSE	0.2569	0.3727	0.9261
RMSE	0.4962	0.6102	0.9485
R²	0.9612	0.9449	0.8616
MAE	0.2083	0.2579	0.4964
VAF	96.13%	94.53%	86.17%

Table 8. Results of Ablation Experiments.

Model Name	R-NBRB	BRB-1	BRB-2	BRB-3
MSE	0.2569	0.2681	0.2905	0.3303
RMSE	0.4962	0.5178	0.5343	0.5747
R²	0.9612	0.9579	0.9525	0.9478
MAE	0.2083	0.2427	0.2345	0.2142
VAF	96.13%	95.82%	95.22%	94.98%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Huang, H.; Feng, S.; Li, J.; Guan, T.; Zhu, H. Adaptive Belief Rule Base Modeling of Complex Industrial Systems Based on Sigmoid Functions. Entropy 2025, 27, 1157. https://doi.org/10.3390/e27111157

AMA Style

Huang H, Feng S, Li J, Guan T, Zhu H. Adaptive Belief Rule Base Modeling of Complex Industrial Systems Based on Sigmoid Functions. Entropy. 2025; 27(11):1157. https://doi.org/10.3390/e27111157

Chicago/Turabian Style

Huang, Haolan, Shucheng Feng, Jingying Li, Tianshu Guan, and Hailong Zhu. 2025. "Adaptive Belief Rule Base Modeling of Complex Industrial Systems Based on Sigmoid Functions" Entropy 27, no. 11: 1157. https://doi.org/10.3390/e27111157

APA Style

Huang, H., Feng, S., Li, J., Guan, T., & Zhu, H. (2025). Adaptive Belief Rule Base Modeling of Complex Industrial Systems Based on Sigmoid Functions. Entropy, 27(11), 1157. https://doi.org/10.3390/e27111157

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Adaptive Belief Rule Base Modeling of Complex Industrial Systems Based on Sigmoid Functions

Abstract

1. Introduction

2. Problem Description

3. Inference Process of the R-NBRB Model

3.1. Description of the Overall Structure of the R-NBRB Model

3.2. Nonlinear Relationship Modeling Based on S-Function

3.3. Calculation Process of Attribute Reliability

3.4. Detailed Construction Process of the R-NBRB Model

3.5. Description of the Parameter Optimization Process

3.6. Computational Complexity Analysis of the R-NBRB Model

4. Case Study

4.1. Dataset Information

4.2. R-NBRB-Based Oil Pipeline Leakage Detection Model Construction

4.3. Experimental Analysis

4.4. Experiment Summary

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI