1. Introduction
Over recent decades, human reliability is becoming more significant to the safety and reliability of complex engineering systems [
1,
2]. Human reliability analysis (HRA) is a systematic methodology used to quantify the influence of human errors on human–machine systems [
3,
4]. It can not only realistically assess human errors, but also provide a fundamental platform for testing previously overlooked error mechanisms [
5,
6]. The cognitive reliability and error analysis method (CREAM), as one of the second-generation HRA methods, can quantify the impact of context environment on the behavioral reliability of personnel [
7]. The CREAM can estimate human errors, determine their error mechanisms based on human activities, and emphasize the importance of the working environment on human reliability [
8,
9]. In recent years, the CREAM has been applied to a variety of fields to improve the safety and reliability of socio-technical systems, including the building construction [
8], the nuclear power plant [
10], the offshore platform operation [
11], and the crude oil cargo discharging operation [
12].
In the CREAM, the qualitative opinions from experts are converted into quantitative human failure analysis results for the human error probability (HEP) estimation [
13,
14,
15]. This method allows for the retrospective analysis of historical events as well as a prospective analysis of high-risk systems. Normally, it provides an approximation analysis and determines the error rate intervals of human error events via four control modes due to the lack of sufficient failure data [
16,
17]. However, as indicated by many researchers, the traditional CREAM has several shortcomings in practical applications [
10,
15,
18]. For example, the judgment criteria for experts’ assessment are vague; all common performance conditions (CPCs) are treated as the same weight, and their different impacts on human performance reliability are not considered. Besides, failure rate intervals of HFP values obtained by the traditional CREAM are unacceptably wide and cannot be used for the initial screening of human error events [
19].
Normally, multiple experts are involved in the state assessments of human tasks in solving HRA problems [
8,
12]. Nevertheless, due to the increasing complexity of real-world human error events, the numerical values adopted in the traditional CREAM are insufficient to express experts’ uncertain assessments in the real world [
16,
20]. Moreover, domain experts prefer to express their judgments by using linguistic terms because they are convenient to use [
15]. To effectively capture the linguistic information of decision makers, the probabilistic linguistic term sets (PLTSs) were proposed by Pang et al. [
21] by describing an alternative with several linguistic terms and assigning different probability values to them. Compared to other linguistic computing methods, the PLTSs can represent decision-makers’ assessments much closer to reality and avoid the loss of original linguistic information to the greatest extent [
22,
23]. In recent years, the PLTSs have attracted widespread attention from researchers and have been employed to solve various decision-making problems, such as knowledge representation [
24], online product ranking [
25], occupational health risk assessment [
26], film classification [
27], and drug value assessment [
28].
As a multidisciplinary team activity, the HRA is mainly based on experts’ experience and knowledge to drive HEP estimations [
10,
13]. In some situations, the task state assessments given by experts may conflict because they have different educational backgrounds, knowledge structures, and understanding of HRA problems [
5]. Thus, a consensus-reaching process should be used to achieve a solution with a high level of consensus and ensure the effective human error quantification of systems [
15]. Besides, experts may exhibit non-cooperative behaviors such as making only minor modifications in the consensus-based group HRA problems. These noncooperative behaviors will lead to high intra-group conflicts and low consensus efficiency. To derive a consensual solution, the minimum conflict consensus model (MCCM) was proposed in [
29] by considering the noncooperative behaviors of decision-makers. It can coordinate different assessments provided by decision-makers, alleviate the conflicts among decision-makers, and provide an effective consensus-reaching mechanism [
30]. In MCCM, only the decision-makers’ information with a low consensus level will be adjusted, which can retain the original assessments of decision-makers to the greatest extent. Thus, it is significant to use the MCCM to promote expert consensus and obtain acceptable HEP estimations in the CREAM.
Taking advantage of the PLTSs and the MCCM, the paper aims to develop a new integrated CREAM to estimate the HEPs in human error activities. This paper provided the following valuable contributions to the current HRA methods: First, the PLTSs are introduced to represent the task state assessment information of experts accurately by taking the ambiguity and hesitation of task state assessment information into account. Second, to minimize experts’ conflict task state assessments, the MCCM is employed to assist individual experts in reaching a consensus. Third, the entropy weighting method is introduced in the CREAM model to obtain the relative weights of CPCs. In addition, for the HEP estimation, the CPC effect indexes are adopted to measure how CPCs influence the human activities’ performance reliability quantitatively. Finally, the feasibility and practicality of the proposed CREAM are verified by an example of the polymerase chain reaction (PCR) detector operation.
The rest of the paper is structured as follows.
Section 2 provides an overview of the relevant studies on the CREAM.
Section 3 is concerned with the basic theories associated with the PLTSs.
Section 4 proposes a new CREAM by integrating the PLTSs with the MCCM. In
Section 5, a practical case is presented to demonstrate the applicability and effectiveness of the CREAM proposed in this paper.
Section 6 concludes this paper and makes recommendations for future research.
2. Literature Review
In the literature, a variety of improved CREAMs have been developed to handle the uncertain task state assessment information in HRA. For example, Sezer et al. [
12] suggested a modified CREAM for the quantitative analysis of human errors, in which fuzzy sets were used to manage the uncertainty and ambiguity of CPCs. Shi et al. [
15] proposed an improved CREAM based on linguistic D numbers with the decision-making trial and evaluation laboratory-based analytic network process (DANP) approach to assess human factor reliability. Li et al. [
8] combined CREAM with fuzzy theory and Bayesian network to transform experts’ fuzzy task state assessments into conditional probabilities and calculate the HEPs of building construction work at height. Elidolu et al. [
31] designed a fuzzy bow-tie CREAM to quantify failure probabilities of explosion and fire accidents on tanker vessels. Abbassinia et al. [
32] integrated Bayesian networks and fuzzy CREAM to put forward a dynamic model to estimate the HEP in emergencies. Ahn and Kurt [
33] constructed a CREAM by integrating Bayesian networks and evidential reasoning methods to analyze human reliability in emergency response to engine room fires on ships. Yu et al. [
11] improved the traditional CREAM through Z-number-based grey relation analysis (GRA) and Z-number-based best-worst method. Besides, Ung [
34] utilized a fuzzy Bayesian network-based CREAM to handle the uncertainty of experts’ task state evaluations in oil tanker collision tasks and Lee et al. [
35] applied a Bayesian network-based fuzzy CREAM to address the uncertainty in the collected data for fishing vessels.
Recently, it has become a trend to quantify the weights of CPCs in solving complex HRA problems. For instance, Bafandegan Emroozi et al. [
18] developed a CREAM that takes into consideration the costs associated with enhancing CPCs and assigns weights to CPCs with the Bayesian best-worst method. Zhang et al. [
36] proposed a modified CREAM to estimate the HEP in advanced control rooms and computed the correlative and important weights of CPCs for different cognitive functions. Chen et al. [
37] introduced an extended CREAM model for the high-speed train operation, in which the weights of CPCs were determined by analytic network process (ANP), and the uncertain CPC information was expressed via interval type-2 fuzzy sets. Chen et al. [
38] adopted a triangular fuzzy number to quantify the fuzzy semantics of CPCs and constructed a Bayesian network model to compute the weights of CPCs for the manned submersible diving process. Wang et al. [
39] provided a weighted fuzzy CREAM to estimate HEP in subway construction, in which the multiple correlation analysis and evidence theory were used to compute the weights of CPCs. Bafandegan Emroozi and Fakoor [
40] proposed a modified CREAM for financial services, employed the DANP to weight CPCs, and examined various impacts of work condition factors on CPCs. Lin et al. [
10] presented an improved CREAM for quantifying human reliability and applied the hesitant fuzzy matrix to determine the weights of CPCs on the steam generator tubes of nuclear power plants. Yao et al. [
20] used the fuzzy CREAM for HRA in the digital main control room of nuclear power plants and determined the CPC weights by the analytic hierarchy process (AHP) method.
The above literature review indicated that many approaches have been proposed to address the uncertainty of task state assessments elicited from experts and obtain the CPC weights in CREAM. Nevertheless, there are some limitations associated with the CREAMs in literature. First, due to the complexities of practical HRA problems, the current methods are ineffective in capturing the probabilistic linguistic task state judgments of experts. Second, conflict task state assessment information is not considered in previous studies, but a consensus-reaching process is needed for HRA because of the different educational backgrounds and experiences of experts. To fill these research gaps, this paper develops an improved CREAM for HRA by integrating the PLTSs and the MCCM for describing experts’ uncertain task state assessments and alleviating the conflicts among experts to reach a consensus.
4. The Proposed CREAM
This section proposes a new CREAM by combining the PLTSs and the MCCM to calculate the HEPs of different human tasks. The proposed CREAM involves four stages: (1) aggregating the individual state assessments of experts; (2) achieving expert consensus via the MCCM; (3) calculating the weights of CPCs with the entropy weighting method; (4) identifying the HEPs of tasks according to the CPC effect indexes.
For an HRA problem, l experts are engaged to assess the states of m tasks regarding n CPCs . Let be the probabilistic linguistic state assessment matrix of the kth expert, where is the PLTS denotes the state assessment of expert Ek. Next, a step-by-step procedure of the proposed CREAM is explained.
Stage 1: Aggregate the individual state assessments of experts.
In this stage, the individual state assessments of experts are aggregated to derive the collective probabilistic linguistic state assessments of tasks.
Step 1: Calculate the conflict degrees among experts.
The conflict measurement method is proposed by Yuan et al. [
29] based on task conflict and relationship conflict. The conflict degree
γkh between expert
Ek and expert
Eh is calculated by
where
tkh is the trust degree of expert
Ek on expert
Eh obtained by the expert
Ek. Note that
tkh satisfies
;
indicates that expert
Ek completely trusts expert
Eh,
means that expert
Ek does not trust expert
Eh at all.
Step 2: Compute the individual conflict degree of each expert.
The conflict degree
γk of expert
Ek is computed by
Step 3: Obtain the relative weights of experts.
The weight
λk of expert
Ek can be calculated by
Step 4: Determine the collective probabilistic linguistic state assessment matrix.
The collective probabilistic linguistic state assessment matrix
can be obtained by
Stage 2: Achieve expert consensus via the MCCM.
In this stage, the MCCM with the budget constraint [
29] is used to modify experts’ judgments and reach the consensus process.
Step 5: Measure experts’ individual consensus level.
The individual consensus level
Clk of expert
Ek is computed by
where
.
The consensus level of experts is acceptable if
, where
is the consensus threshold. Note that the consensus threshold
can be determined by experts based on their experiences directly or by the methods suggested in previous studies [
5,
45,
46]. If
, a consensus-reaching process is implemented to arrive at a consensus regarding experts’ state assessments.
Step 6: Obtain the final adjusted state assessment matrix .
A consensus budget
B is determined and the MCCM with the budget constraint is constructed as:
where
z is the total conflict degree of experts,
is the adjusted individual consensus level of expert
Ek,
ck is the unit adjustment cost of expert
Ek,
is the optimally adjusted state assessment matrix of expert
Ek. By solving model (19), the final adjusted state assessment matrix
can be obtained.
Stage 3: Calculate the weights of CPCs by using the entropy weighting method.
In this stage, the entropy weighting method [
47] is used to derive the relative weights of CPCs.
Step 7: Establish the probabilistic linguistic entropy matrix E.
Based on the final adjusted state assessment matrix
, the probabilistic linguistic entropy matrix
can be determined as
Step 8: Calculate the normalized probabilistic linguistic entropy matrix .
The normalized probabilistic linguistic entropy matrix
is obtained by
Step 9: Calculate the importance weights of CPCs.
Finally, the importance weights of CPCs
can be calculated by
Stage 4: Compute the HEPs of tasks based on the CPC effect indexes.
The CPCs present a systematic framework to describe the expected performance conditions in human error problems. The CPC effect indexes assess the overall influence of CPCs quantitatively on human reliability. In this stage, the CPC effect indexes [
24] are used for HEP estimation.
Step 10: Compute the CPC effect indexes of tasks.
According to the final adjusted state assessment matrix
and the weights of CPCs
, the CPC effect index
of the task
is computed by
where
is the middle linguistic term of
S, which implies that the CPCs have no significant effect on human reliability of human error activities.
Step 11: Calculate the HEPs of different tasks.
A natural logarithm function can represent the correlation between HEPs and CPC effect indexes [
48]. For the task
, the
can be represented as:
where the constant coefficients
HEP0 and
are calculated by the upper and lower bounds of the CPC effect indexes and HEP estimations. Based on the correspondence between control modes and the probability of action failure, it is appropriate to let
and
[
49]. In this study,
and
, and thus the HEP for the task
can be represented as:
5. Case Study
In this section, an example of the PCR detector operation process [
50] is provided to illustrate the effectiveness of the CREAM being proposed in this paper.
5.1. Implementation and Results
As the demand for healthcare services increases, the use of PCR detectors has grown rapidly in recent years. The PCR detector operation process involves four tasks: (1) selecting an appropriate condition for the sample (T1); (2) setting up the condition and starting preincubation (T2); (3) changing the condition according to specific requirements (T3); (4) diagnosing and responding to the results (T4). PCR detectors can reduce the burden on healthcare staff and significantly improve diagnostic efficiency and accuracy. Due to the particularity of PCR tests, the reliability of PCR detectors can be affected by many factors, such as working conditions, healthcare staff’s states, and training and experience of healthcare staff. Thus, it is vital to implement the proposed CREAM to qualify the reliability of the PCR detector operation process.
In this case example, nine CPCs are considered in the PCR detector operation process. Five experts from different departments formed an HRA expert panel to assess the states of the tasks via an online questionnaire system. These experts include a medical laboratory technologist, a pathologist, an infectious disease physician, a clinical researcher, and a molecular biologist. The CPCs and their corresponding linguistic terms of the operation process are listed in
Table 1. For the expert
E1, the probabilistic linguistic state assessment matrix
is obtained as shown in
Table 2. The trust degree matrix of experts is exhibited as:
Next, the proposed CREAM is utilized for estimating the HEPs of tasks in the PCR detector operation.
Step 1: Via Equation (14), the conflict degrees among experts are calculated and presented in
Table 3.
Step 2: Through Equation (15), the individual conflict degrees of experts are acquired as: .
Step 3: Via Equation (16), the weights of experts are yielded as: .
Step 4: By Equation (17), the collective probabilistic linguistic state assessment matrix
is obtained as shown in
Table 4.
Step 5: Based on Equation (18), the individual consensus levels of experts are derived as: . The consensus threshold is set as in this example, which is determined by the moderator to ensure obtaining results with a high level of consensus. As the consensus level of the fifth expert is smaller than the threshold , Step 6 is executed.
Step 6: Based on the MCCM (19), the unit adjustment costs of experts are obtained as:
and the moderator sets the consensus budget constraint as
. The optimally adjusted state assessment matrix of the fifth expert
is obtained by adjusting six elements of the task state assessment matrix
. The final adjusted state assessment matrix
is determined as shown in
Table 5. Therefore, the experts’ individual consensus levels are derived as:
, satisfying
.
Step 7: Using Equation (20), the probabilistic linguistic entropy matrix
is determined as listed in
Table 6.
Step 8: Applying Equation (21), the normalized probabilistic linguistic entropy matrix
is displayed in
Table 7.
Step 9: Based on Equation (22), the weights of CPCs are calculated as: , , , , , , , , .
Step 10: Via Equation (23), the effect indexes of the four tasks are yielded as: .
Step 11: By Equations (24) and (25), the HEPs for the four tasks are obtained as:
5.2. Comparison Analysis
To demonstrate the effectiveness of the proposed CREAM, this section carries out a comparative analysis based on the above case study. In the traditional CREAM method, the context influence index CREAM (CII-CREAM) [
51], the evidential reasoning CREAM (ER-CREAM) [
52], the hesitant fuzzy matrix CREAM (HFM-CREAM) [
10], and the modified CREAM [
48] are chosen in this case study.
Table 8 presents the priority of the four tasks in the PCR detector operation process based on the listed approaches.
From
Table 8, it can be seen that
T3 ranks third via the proposed CREAM, the HFM-CREAM, the ER-CREAM, and the modified CREAM. Besides, except for the CII-CREAM, the other four methods place
T4 in second place. Furthermore, the priority of tasks obtained by the proposed CREAM is identical to the results determined by the HFM-CREAM, the ER-CREAM, and the modified CREAM. These results imply the availability and practicality of the proposed CREAM.
However,
Table 8 shows that the ranking results of the task for the PCR detector operation process obtained by the traditional CREAM, the CII-CREAM, and the proposed CREAM are not the same. Inconsistent outcomes may be attributable to the following reasons: First, the numerical information utilized in the traditional CREAM and the CII-CREAM cannot express task state assessments accurately. As a result, the original state assessment information given by experts may be missed in HRA. Second, the importance weights of CPCs are considered to be the same. This may not be in line with practical situations for a real application of CREAM. Furthermore, the HFP values obtained by the traditional CREAM are intervals that are unacceptably wide and cannot be used for the initial screening of human error events.
Moreover, the HEP ranking outcomes derived from the proposed CREAM and the CII-CREAM are different. Specifically, T1 is situated first by the proposed CREAM but is in the fourth position with the CII-CREAM. In addition, T2 occupies the fourth position by using the proposed CREAM. But by the CII-CREAM, T2 stands in the first place. These inconsistent results may be explained by the following points: First, the PLTSs are not used in the CII-CREAM, which cannot express the uncertain assessments of experts accurately and reflect the probabilistic information effectively. Second, the CPCs are treated as the same weight in the CII-CREAM, and the interactions between CPCs are not considered. Third, the CII-CREAM uses the performance influence index to quantify the overall impact of CPCs in solving HRA problems, which cannot realize the continuity of HEPs.
The comparison of different approaches is further analyzed via the aggregation technique [
15] to validate the proposed CREAM model. The optimal method is expected to produce results that closely match the aggregate ranking.
Table 9 presents the HEP ranking matrix for four operational tasks, where the entries indicate the frequency of each task assigned to different rankings. Subsequently,
Table 10 displays the smoothing of task assignments on rankings based on the results computed through the aggregation technique. From
Table 10, the linear programming model is constructed to determine the optimal rankings:
where
Nik is equal to 0 or 1 for all
i and
k. By solving the model above, the optimal ranking of four operational tasks in the case study is determined as
, which is identical to the ranking results calculated by the proposed CREAM model. Thus, the proposed CREAM model provides a more logical and credible HEP ranking in the specified application.
From the analyses above, it can be concluded that a more accurate and reliable HEP ranking result for the PCR detector operation process can be obtained by employing the CREAM proposed in this paper. Compared with extant approaches, the proposed CREAM has the following advantages: First, via the PLTSs, the proposed CREAM can consider the probability of linguistic term sets, accurately describe the uncertain linguistic evaluations of experts, and retain the original assessment information as much as possible. Second, with the MCCM, the proposed CREAM can minimize conflicts among experts and assist experts with different opinions to achieve a consensus. As a result, the proposed CREAM can address the limitations associated with the traditional CREAM and provide a more reasonable estimation of the HEPs.