Towards Dynamic Uncertain Causality Graphs for the Intelligent Diagnosis and Treatment of Hepatitis B

: Hepatitis B is a widespread epidemic in the world, but so far no single drug has been shown to kill or eliminate the Hepatitis B virus and heal people with chronic Hepatitis B virus infection. Based on comprehensive investigations to relevant characteristics of Hepatitis B, a diagnostic modelling and reasoning methodology using Dynamic Uncertain Causality Graph is proposed. The symptoms, physical signs, examinations results, medical histories, etiology, pathogenesis and other factors were included in the diagnosis model. In order to reduce the difﬁculty of building the model, a modular modeling scheme is proposed, which provides multi-perspectives and arbitrary granularity for the expression of disease causality. The chain reasoning algorithm and weighted logic operation mechanism are introduced to ensure the correctness and effectiveness of diagnostic reasoning under incomplete and uncertain information. In addition, the causal view of the potential interactions between diseases and symptoms visually shows the reasoning process in a graphical way. In the relevant model, the model of the diagnostic process and the model of the therapeutic process are symmetrical. The results show that, even with incomplete observations, the proposed methodology achieves encouraging diagnostic accuracy and effectiveness, providing a promising assistance tool for physicians in the diagnosis of Hepatitis B.


Introduction
Hepatitis B is a kind of infectious disease which spreads widely all over the world. As published by the World Gastroenterology Organization, more than 400 million people have been infected with hepatitis B, which is the third leading cause of cancer death in the world [1]. However, until now, there is no drug available for the hepatitis B virus elimination and to cure the chronic hepatitis B virus-infected patients. In response to this disease, researchers have made active efforts, aiming to improve the accuracy of diagnosis and provide more accurate treatment plans.
In recent years, with the development of computer science and artificial intelligence, it is possible to digitize medical diagnosis. Currently, the application of computer technology, in particular the artificial intelligence in medical diagnosis, has become one of the hot spots in the research society. Various decision making systems for medical diagnosis have been developed [2,3]. Moreover, computer-aided systems for clinical diagnoses have been involved in many major medical areas. These works promotes the development of medical science [4][5][6][7].
Rule reasoning method was first applied in clinical diagnosis, and has been widely used, such as MYCIN and INTERNIST-1 [8,9]. Various mixed reasoning method has also been adopted in clinical diagnosis and treatment decision making, such as method combining case-based and rule-based reasoning [10], method combining rule and neural network [11], method combining hybrid fuzzy logic and case [12], method combining neural network and hybrid fuzzy logic [13], method combining fuzzy To overcome these defects, a new model named as Dynamic Uncertain Causality Graph (DUCG) is presented [23][24][25]. DUCG has been successfully applied in the fault diagnoses of industrial systems [26] and medical diagnoses of vertigo [27].
In this paper, an intelligent diagnosis methodology for both the diagnosis and treatment recommendation of hepatitis B is proposed based on DUCG, which is applicable to model uncertain behaviours of complex systems. In order to get a better performance, the DUCG medical diagnosis method is also expanded in the following aspects: quantifying the trend of clinical data, providing a treatment plan, providing personalized treatment plans for patient resistance, and giving interpretable diagnosis results.
The remainder of this paper is organized as follows: Section 2 provides the priori knowledge of DUCG. Section 3 shows a method of the process of intelligent inference. Section 4 shows the experiment cases of the HepatitisB diagnosis and treatment. Section 5 evaluated the results of the experiment. Section 6 concludes this paper and outlines the future work. And the detailed description of DUCG can be found in the Appendix A.

Preliminaries
DUCG is a probabilistic graphical model that explicitly represents uncertain causalities among variables and performs probabilistic reasoning [24]. It is a directed graph consisting of a number of nodes (variables) and directed arcs (causalities).
The corresponding nodes (variables) are divided into several classes: A-type events are virtual events defined in DUCG to represent the random causality between the parent and child nodes.
Please refer to the Appendix A for more details about DUCG theory.

Diagnosis and Treatment of Hepatitis B
The hepatitis B virus (HBV) particle (virion) consists of an outer lipid envelope and a core composed of protein. These virions are 30-42 nm in diameter. These particles are composed of lipids and protein which constitute a part of the surface of virosomes, which are called the hepatitis B surface antigens (HBsAg) and are overproduced in the life cycle of the virus. The center of the virions is about 28 nm in diameter and is the core of the virions, which include the hepatitis B core antigen (HBcAg) and the hepatitis B e antigen (HBeAg). The core contains the viral genomic DNA and the DNA polymerase. Therefore, the hepatitis B virus has three antigen: surface antigens (HBsAg), core antigen (HBcAg) and e antigen (HBeAg). During the natural course of an infection, the presence of these three antigens in a host's serum can cause the immune response of the host, and antibodies to the hepatitis B surface antigen (anti-HBs), antibodies to the hepatitis B core antigen (anti-HBc) and antibodies to the e antigen (anti-HBe) will arise immediately afterwards. These viral antigens and antibodies produced by the host can be used as the markers of hepatitis B virus infection for the diagnosis of hepatitis B. However, it is difficult to detect hepatitis B core antigen (HBcAg) in serum by the usual tests. Therefore, the tests for the detection of hepatitis B virus infection involve serum or blood tests that usually detect HBsAg and anti-HBs, HBeAg, and anti-HBe and anti-HBc.
Under normal circumstances, acute hepatitis B infection does not need treatment, and most adults will recover automatically [28]. On the other hand, in order to reduce the risk of liver cirrhosis and liver cancer, it may be necessary to treat chronic infection. Before discussing the diagnosis of chronic hepatitis B virus infection, we must first distinguish the difference between chronic hepatitis B virus infection and chronic hepatitis B. Chronic hepatitis B virus infection refers to the status that the hepatitis B virus has been in the host's body for a long time, and the host can not clear the infection. Therefore, as long as the hepatitis B surface antigens (HBsAg) and/or the hepatitis B virus DNA (HBV DNA) in host's serum is positive for more than 6 months, it is chronic hepatitis B virus infection. According to serological, virological, biochemical tests, and other clinical and laboratory tests of HBV-infected individuals, chronic hepatitis B virus infection can be divided into chronic HBV carriers, HBeAg-positive chronic hepatitis B, HBeAg-negative chronic hepatitis B, inactivity HBsAg carriers, occult chronic hepatitis B and hepatitis B cirrhosis. Treatment is often required when an infected person has persistently elevated HBV DNA levels and serum alanine aminotransferase (a marker of liver damage) [29]. According to different genotypes and the drug, the treatment time can range from six months to one year. However, oral medications are usually required for a longer duration of treatment, most of which will last more than a year [30]. Although the existing drugs cann't clear the infection, they can prevent the virus from replicating, so as to minimize the liver damage. As of 2015, there are seven medications licensed for the treatment of hepatitis B infection in China. These include the two immune system modulators interferon alpha-2a and PEGylated interferon alpha-2a (Pegasys), and antiviral drugs entecavir (Baraclude), adefovir (Hepsera), telbivudine (Tyzeka), tenofovir (Viread) and lamivudine (Epivir).

Characteristic Analysis to Hepatitis B Diagnosis and Treatment
In the diagnosis of Hepatitis B, medical evidence used usually includes laboratory results, imaging examination results, liver biopsy, symptoms, medical signs, and basic information. First, the patient needs to have a laboratory examination. Detection of hepatitis B virus infection includes serum or blood detection, detection of virus antigens (proteins produced by the virus), or antibodies produced by the patient. Hepatitis B viral antigens and antibodies are detectable in the blood following infection as shown in Figure 1. Test results can distinguish whether the patient has hepatitis B or not. Meanwhile, the Hepatitis B virus antigens and antibodies detected in the blood can also predict the therapeutic effect of antiviral drugs and immune system regulator interferon. Once chronic hepatitis B virus infection is confirmed, the patient need to do some tests to detect and measure HBV DNA. These tests are used to assess a person's infection status and to monitor treatment [31]. Secondly, the levels of these enzymes in serum are collected in detail. The activity of enzymes in the liver is an important indicator of the liver's function. For example, serum alanine aminotransferase (ALT) levels and aspartate transaminase (AST) levels generally reflect the degree of hepatocyte damage. Thirdly, the imaging examinations carried out, such as medical ultrasound and computed tomography. Fourth, collect the results of liver biopsy to evaluate the inflammation of the liver, exclude other liver diseases and monitor treatment. Fifth, the symptoms and medical signs, such as fatigue, abdominal distension and jaundice, are also used in the diagnosis of chronic hepatitis B. Finally, some basic pieces of information are taken into consideration, e.g., sex, age, history of drinking, history of mental illness, and medications that have been taken before. In addition, seven licensed drugs and related basic medicine for treating hepatitis B infection were introduced in the construction of the knowledge base. In fact, the information used in our diagnosis methods is basically based on objective examination results and subjective descriptions of patients. Through this information, the system can diagnose hepatitis B patients in more detail at the same time, and make a treatment plan according to their personal situation.

A Hepatitis B Diagnosis and Treatment Model Based on DUCG
Hepatitis B is a widespread epidemic disease in the world, but, up to now, there is no single drug that can kill or eliminate the hepatitis B virus and cure chronic hepatitis B virus infection. In order to give a more appropriate treatment scheme, hepatitis B infection will be further diagnosed in clinical diagnosis. Chronic hepatitis B virus infection can be divided into chronic HBV carriers, HBeAg-positive chronic hepatitis B, HBeAg-negative chronic hepatitis B, inactivity HBsAg carriers, occult chronic hepatitis B, and hepatitis B cirrhosis. We can diagnose chronic hepatitis B virus infection by detecting viral antigens and antibodies in serum, and these diseases are listed in Table 1. In order to minimize drug abuse and treat patients, different treatment plans are given for different diseases and the condition of the patient, and these treatment plans are listed in Table 2. In the process of building the model, each disease is treated as a separate module, and some variables can be shared with other variables [27]. This is a modular modeling scheme, which allows a causal graph to be divided into semi-independent parts from multiple angles, while still keeping the consistency of events definitions. Therefore, all aspects of a disease can be described at any granularity level, and the difficulty of modeling complex system is greatly reduced. In addition, as we analyzed, incomplete knowledge and imprecise parameters have little influence on the accuracy of reasoning. These characteristics have brought us great flexibility in using statistical data and domain knowledge.
For example, Figures 2 and 3 demonstrate two sub-graphs of HBeAg-positive chronic hepatitis B. Figure 2. The sub-DUCG includes treatment plans of HBeAg-positive chronic hepatitis B. For the sake of simplicity, the variable symbols "B" and "X" are omitted from the system, and only their subscripts are kept. All causality and related parameters, such as event matrices (A n;i ), are decided by domain knowledge.

A Introduction to the Sub-DUCG for HBeAg-Positive Chronic Hepatitis B
Based on the prior knowledge and previous works of DUCG, after the initial diagnosis of hepatitis B, we will make a detailed distinction between hepatitis B diseases and recommend treatment options.
Take HBeAg-positive chronic hepatitis B as an example. The hepatitis B surface antigens in serum (X 10 ) is positive in patients with HBeAg-positive chronic hepatitis B. At the same time, both the hepatitis B surface e antigens in serum (X 12 ) and the hepatitis B virus DNA (X 16 ) are positive. The serum alanine aminotransferase levels (X 18 ) of the patients are abnormal. Through the above laboratory test results, we can distinguish the development degree and type of hepatitis B disease in detail. After simply distinguishing the type of hepatitis B disease, we need to recommend a treatment plan for the patient based on the condition of the hepatitis B disease and the patient's current test results. At this time, the simple diagnosis model needs further modification and addition.
To get more accurate results when we get relevant treatment plans, we need a continuous observation of some of the patient's findings. This involves the problem that the state of the X variable for each inspection result will change over time.
In order to better express and reason about many times involved in the process of patient state observation and detection, in addition to the Duck variable that we have introduced in previous work, we add a variable preprocessing process here.
We keep the patient's test results every time(t m ). For each historical record, we subtract the state value of the current X n (t m+1 ) variable from the state value of the current X n (t m ) variable, and record it as X n . For example, the status of a patient's X 18 (t 1 ) is 0 last time, and the status of X 18 (t 2 ) is 2 this time. Therefore, The positive and negative of X n can express the change trend of X n . After recording the patient's test results multiple times, we can also get the changing trend of X n within a period. X n (t 1 ), X n (t 2 ), X n (t 3 ) . . . X n (t m ) are added to obtain the sum of results in a period of time, It should be noted that X n (t) is not directly equal to the result of the t time test results, it is assigned a different state depending on the detection result. Because in addition to some variables directly as a numerical result, there are still some variables that cannot be directly described by numerical values. Here are two examples.
The status list of Jaundice(X 50 ) as shown as Table 3. The status list of ALT (X 18 ) as shown as Table 4, and the "ULN" in Table 4 means "upper limit of normal". Less than 2 × ULN (<80) 4 Greater than or equal to 2 × ULN and less than or equal to 10 × ULN (80∼400) 6 Greater than 10 × ULN (>400) For example, the weekly ALT value of a patient is 23.7, 49.5, and 115.3, respectively, and the patient has no symptoms of jaundice. The state of X 18 (t 1 ) is 0; the state of X 18 (t 2 ) is 2; the state of X 18 (t 3 ) is 4; and the status of X 50 (t 1 ), X 50 (t 2 ), and X 50 (t 3 ) are 0. During the period from t 1 to t 3 , and It should be noted that when the state of X n is singular, the singular value of X n should be subtracted from the integral calculation. The X n can express the cumulative trend of X n , preventing abnormal test results at certain times from causing erroneous diagnosis. After the concepts of X n and X n were introduced into the DUCG model, the HBeAg-positive chronic hepatitis B patients were taken as examples, a more accurate diagnosis model is as shown in Figure 4.  The definitions of each X variable are shown in the Table 5. The e antigens in serum X 18 Serum alanine aminotransferase levels X 16 Serum hepatitis B virus DNA levels X 41 Inflammatory conditions of liver tissue X 10 The hepatitis B surface antigens in serum X [58][59][60][61][62] Gender; Way of infection; Drinking history; Duration of the disease; Hepatitis viruses X 67 , X 69 Age; Medication history Antiviral therapy was used to treat HBeAg positive chronic hepatitis B. Antiviral therapy can be divided into nucleotide or nucleoside analogue therapy and interferon therapy. However, if the patient is young and the ALT value is very high (X 18 state is more than 6), the doctor will usually observe the patient for 3 to 6 months. With a probability of 1.1% to 9%, the patient will develop antibodies to eliminate the virus and no treatment is needed. This type of treatment that requires time for continuous observation is difficult to describe in previous diagnostic models. And after introducing the X n and X n , we can clearly express this tendency. After observation, the patient did not spontaneously develop HBeAg seroconversion, and the doctor would take antiviral treatment to the patient. At the same time, after introducing the X n and X n , it is convenient to express the drug resistance process in the treatment process, to find out the drug resistance as soon as possible and replace the treatment plan. We found that in the process of causal reasoning, the diagnostic reasoning graph and the treatment reasoning graph have a certain symmetry.

Verification Experiments
In order to verify the effectiveness of the diagnosis and treatment system, and demonstrate the calculation details of diagnostic reasoning algorithm, we collected clinical data from patients with chronic hepatitis B. In the validation experiment, two typical cases were used as diagnostic evidence.

Verification Experiment: Case 1
In DUCG, E j denotes the evidence indexed by j, in which the j is a serial number of positive integers. Note that E = ∏ j E j can be divided as two parts: E = E E , in which E is called the abnormal evidence, E is called the normal evidence.
Suppose that the evidence E collected from a patient is: E 9 = X 50,2 ; E 10 = X 52,2 ; E 11 = X 58,2 ; E 12 = X 59,2 ; Other nonexistent X-type variables indicate that the patient is normal to the symptom, or the related pathological features are not confirmed or determined.
The purpose of the simplification is to remove the invalid or impossible causalities and irrelevant variables based on the evidence were given. In this way, the problem is more focused on, and unnecessary details are removed. Of course, the computation scale is then reduced exponentially without losing accuracy.
By applying the simplification rules to the DUCG-based diagnostic model, the irrelevant variables in the DUCG are simplified and the meaningless causal functions are removed, so as to obtain the simplified DUCG, as shown in Figure 5.
In order to better distinguish the states of X-type variables, ellipses of different colors is used to indicate whether a symptom are normal or not: normal state 0 is represented in green; sky blue and yellow municipal initiates state 1 and state 2; orange and red respectively indicate state 4 and state 6. And state6. Any X-type variable without color in Figure 5 means an unconfirmed/unrecognizable pathological feature. Please note that the green ellipse didn't appear in this experiment, because all normal variables were eliminated in the process of simplifications. The simplification process is an essential step in DUCG diagnostic reasoning, and if some causal relationships lack evidentiary support, they are filtered during the simplification process and are no longer placed within the scope of the reasoning.
As we can see that Figure 5 contains all abnormal evidence, it is meaningful and explainable, and can vividly show the user how the root cause(B 1 in this example) influenced other evidence through causal chains and led to the eventual observed symptoms.   In DCUG, assuming that only one B-type event can be true in one case. The DUCG can be decomposed as a set of sub-DUCGs k, and the weight xi k of sub-DUCG can be calculated based on E of the sub-DUCG. Each decomposed sub-DUCG can be a further simplification. Only the sub-DUCGs which can explain E are needed. And find the B-type variable of those sub-DUCGs, put them in a hypothesis spaceS H . The member in S H called H kj . If there is only one member in S H , the diagnosis is done, because the exact result H kj is found. The event outspread operations are first carried out to generate the hypothesis space In DUCG, assuming that only one type of event can be true in one case. The event outspread operations are first carried out to generate the hypothesis space the event outspread operations are first carried out to generate the hypothesis space S H g . Limited by the length, we only take E 6 and E 13 as examples. The event outspread expressions are as follows.
Then, we get Pr{E } = 0.997288162. By ignoring all the A type events and weighing factors (r n;i /r n ) from the outspread expression of E , we get the hypothesis space: Because of the normal evidence E = null in Figure 5, we have Therefore, the modification factor between h S k,j and h S k,j is 1, and the local state probability of It should be noted that X n (t) is not directly equal to the result of t time test results, it is assigned a different state depending on the detection result. Because in addition to some variables directly as a numerical result, there are still some variables that cannot be directly described by numerical values.
Here are two examples. The status list of Jaundice(X 50 ) as shown as Table 3. The status list of ALT(X 18 ) as shown as Table 4, and the "ULN" in Table 4 means "upper limit of normal".
Similarly, with the above calculation, the probabilistic inference is performed in Figure 5. As we can see that Figure 5 is meaningful because it can account for all the evidence observed. The abnormal evidence E is outspread as E .
Thus, we have and H 2,1 = B 10,1 (H 2 indicates B 10 ) is determined as the only candidate hypothesis of Figure 5. We proceed to perform reasoning calculation in Figure 5, with B 1 being a candidate hypothesis, and we get that Pr{E} = 0.0000234. The reasoning details are omitted for brevity.
Combining all the candidate hypotheses in the meaningful sub-DUCGs, a complete hypothesis space is obtained.
The root causes of the observed symptoms have been found without absence. After modification by a weighting coefficient associated with the prior evidence probability in each sub-DUCG containing the hypothesis H k,j , the global state probabilities of H k,j , S H , h s k,j , k = 1, 2, 3, can be obtained as As a conclusion to this diagnostic case, these three hypotheses were identified as the possible reasons of observed evidence. And each hypothesis was able to justify itself. The probability of different states in turn indicate the possibility that the hypothesis is the root cause. B 10,1 and B 7,1 are considered more likely with larger state probabilities (0.5500382 for B 10,1 and 0.3458932 for B 7,1 ). Being exactly consistent with the doctor's conclusion, such a quantified result can provide physicians with effective assistance in the differential diagnosis.

Verification Experiment: Case 2
Now look at another verified calculation case. It is assumed that Case 1 will undergo further clinical examination and collect more detailed medical history, while the original DUCG and parameters remain unchanged. The evidence in case 1 after supplementing the relevant symptoms is as follows. E 1 = X 10,2 ; E 2 = X 12,2 ; E 3 = X 16,4 ; E 4 = X 18,2 ; E 5 = X 41,2 ; E 6 = X 46,2 ; E 7 = X 47,2 ; E 8 = X 48,2 ; E 9 = X 50,2 ; E 10 = X 52,2 ; E 11 = X 58,2 ; E 12 = X 59,2 ; E 13 = X 60,1 ; E 14 = X 62,2 ; E 15 = X 63,2 ; By applying the simplification rules to Figure 4, we get the simplified DUCG shown in Figure 6. Note that the other two sub-DUCGs of B 1 and B 10 are both invalid and eliminated, for containing no causal explanations to some abnormal evidence such as X 49,1 , X 52,1 , X 55,1 , X 118,1 , X 150,1 and X 154,2 . Therefore, Figure 6 becomes the only meaningful sub-DUCG in this situation. The hypothesis space obtained is where H 1 indicates B 7 . As we can see that in comparison to case 1, the more evidence observed, the smaller the hypothesis space, and the more accurate the inference result. In fact, the root cause of the symptoms can now be determined, even if no numerical calculation is needed. Nevertheless, we complete the reasoning process in order to maintain the integrity: because of H 1,1 E = E , the approximate state probability can be obtained as Since the normal evidence E = φ, the state probability of H 1,1 is h s 1,1 = h s 1,1 = 1 as expected. In conclusion, the hypothesis B 1,1 is uniquely determined as the root cause of the current hepatitis B virus. This result is consistent with the clinical conclusions confirmed by doctors. The causal reasoning process can be demonstrated graphically to the user, making the conclusions and recommendations more convincing and reliable. For example, the reasoning results are shown in case 1 prove that patients with chronic hepatitis B will have better treatment results with ADV drugs in the case of LAM resistance: continuing to use LAM treatment will not achieve the desired therapeutic effect. Therefore, drug selection is not only Related to the currently diagnosed disease. Medication history also needs to be considered. Such a process of diagnostic reasoning not only produces results but also explains why the disease was proposed and why the current treatment options were chosen. Compared to its method, such probabilistic results can be explained to doctors and patients, significantly promoting clinical decisions. Besides, the system can be used as a tool for doctors to test patients' questions and needs. Also, it can be used as an educational tool for medical students by causally mapping and visualizing certain diseases.

Empirical Evaluations
To evaluate the performance of the duck-based intelligent diagnosis and treatment of hepatitis B system, we randomly selected 120 cases of hepatitis B from the medical record database of Beijing Ditan Hospital. Sixty cases were selected for diagnosis and testing of hepatitis B, and 90 cases were selected for testing of treatment plan. The correct diagnosis was confirmed by doctors who are not involved in the system development. In the process of clinical diagnosis, being able to diagnosis based on incomplete information is very important for intelligent diagnosis methods. Therefore, part of the available medical evidence was discarded and used for experiments with incomplete information. Specifically, only part of the medical data is uploaded to the system, and there is no further test data. We divided the data into two groups, and detected cases with complete information and cases with incomplete information, and compared the diagnosis and treatment plan results with the confirmed correct cases. The results are as shown in Table 6.

Remark 1.
The reason why we select only 10 cases for each disease is because most diseases have less than 10 cases. If we increase the test cases for each disease, most of the increased cases would be for only a few common diseases, which would improperly increase the precision, because the common diseases are easy to diagnose. On the other hand, 10 cases have covered most knowledge points in the DUCG and the marginal utility decreases greatly.

Remark 2.
In the actual diagnosis process, there are cases where some of the test results are missing. For example, due to financial constraints, patients did not apply for too expensive examinations. In order to simulate these conditions in the real diagnosis process, part of the information in the case has been hidden randomly. The results obtained are divided into two cases: complete medical information and incomplete medical information. Even in the case of incomplete medical information, the correct rate of the system is still acceptable. This result proves the robustness of the system to incomplete information. The results showed that the DUCG-based system correctly diagnosed 88.3% of disease with complete information and the correct rate of the treatment plan was 95.5% with complete information.
In addition, 20 cases of non-hepatitis B were added to the test, and the system was able to diagnose 100% of non-hepatitis B patients, proving that the correct rate of the DUCG-based system for diagnosing hepatitis B disease could reach 100%. While the other methods have 74.37-98.75% correct rate when diagnosing hepatitis [32]. The comparisons are as shown in Table 7.
As a result, the DUCG-based intelligent diagnosis and treatment of hepatitis B system show shows high accuracy and excellent robustness. The DUCG-based method uses expressions with their own causal semantic to construct models, so that the system has the advantage of being faithful to the pathogenesis. Therefore, the conclusions obtained by the model are inherent and will not change with the application environment. This characteristic ensures the high accuracy of system diagnosis.
In addition, the error results in the test were analysed in detail. Part of them are because of the changing process of the disease samples, and the diagnosis given by the human doctors may be more conservative. Another part of the error results is caused by the inaccuracy of the relevant parameters. In this paper, the parameters in the system are given by domain experts, and may bring in some subjective factors. This leads to deviations in some of the results. One possible solution may be design a learning based parameter tuning method, to overcome the subjective bias in the parameters. Table 7. Comparisons of DUCG and other methods.

Method Accuracy
PNN(10XFC) [33] 91.25% LDFA-SVM [34] 96.77% SVM+SA [35] 96.25% MLP [32] 74.37% 3SVM [32] 98.75% DUCG 100% Remark 3. The DUCG method is 100% correct. The data used is a sample of 80 cases, of which 60 cases are diagnosed with hepatitis B disease and 20 cases are non-hepatitis B disease. The DUCG method correctly diagnosed 60 cases of hepatitis B disease and ruled out another 20 cases, so the correct rate here is 100%. Of course, this result is obtained from existing case samples, and some new and different cases are also in the search process to test the diagnostic accuracy of the system under extreme conditions.

Conclusions and Future Work
In this study, we developed an intelligent diagnosis and treatment of hepatitis b system based on Dung, which can support the diagnosis of hepatitis b in 7 diseases and give the best treatment plan. This system includes diagnosis and treatment models, including 67 variables and 127 causal arcs. The models use clinical data(such as signs, symptoms, examination results, environmental factors, medical history, pathophysiological knowledge, and disease-related psychosocial; for causality analysis. At the same time, a modular modeling scheme is used, which enables the causality diagram to be divided into sub-graphs from multiple angles. As a result, large and complex system model are allowed to be split, and the modeling process becomes simple.
The system experiment results show that the model has good accuracy and robustness. This indicates that the system will play a role in clinical diagnosis. With the help of link reasoning and weighted logic algorithms, the system can achieve high accuracy even if the information is incomplete. In addition, the graphic representations can directly show the pathological mechanisms and express the interpretable reasoning processes. And through the graphical representation, it is also observed that the graph of the diagnostic process and the graph of the treatment process are symmetrical. This makes the diagnosis conclusions and treatment suggestions given by the system easier to accept, and helps doctors to show more objective clinical decisions to patients. The above characteristics show that this method has extensive medical significance, especially for complicated diagnosis and treatment of hepatitis B. The contributions of this paper can be concluded as: 1. Introducing the expression method of the trend brought by the inspection data over time in the system; 2. Providing the treatment plan after the diagnosis of the disease; 3. Providing a new treatment plan immediately after the patient develops resistance to a certain drug; 4. Giving an explanation of the diagnosis and treatment plan.
Of course, the current system still has certain limitations. The methods discussed in the article are mainly for the diagnosis and treatment of hepatitis B. There is not much discussion about treatment options other than interferon. In addition, the diagnosis methods used in this article are more based on the doctor's knowledge accumulation, and there is a lack of coping methods for the existing intractable diseases that cannot be diagnosed.
As future work, we will improve the current system by integrating other diseases that cause hepatitis B. Also, the model will continue to optimize the treatment plan and try to predict the response. In addition, experiments on clinical data will continue, and test data samples will be enriched to further evaluate and improve the accuracy of the model.

Conflicts of Interest:
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results. The corresponding nodes(variables) are divided into classes as follows. B-type variables represent the root cause variables, drawn as squares, and can only be root causes without input. Each B-type event must have a prior probability, B i is a basic event variable indexed by i. In Figure A1, B 1 represents rat, B 2 represents intruder, and B 3 represents earthquake.
X-type variables drawn as cycles represent the effect or consequence variables, must have at least one input and arbitrarily output. X i is an X-type event variable indexed by i. In Figure A1, X 4 represents infra-red, X 5 represents vibration, both X 4 and X 5 are effect variables, e.g., X 7 represents alarm, and is a consequence variable.
D-type variables are the default cause variables without input, drawn as pentagons. When a X-type variable have no reason to explain, a D-type variable are associated with the X-type variable, representing unknown causes of the X-type variable. A D-type variable can only be connected to one X-type variable, e.g., D 7 represents the default cause event variable of X 7 .
G-type variables are virtual logic gate variables. The state of G-type variable depends on the logical relationship among its parent variables (AND, OR, NOT, etc.). G i is a gate event variable indexed by i. In Figure A1, G 6 represents the logic gate event variable specifying the logic among X 4 and X 5 to X 7 , where the G-type variables are drawn as gates and the details of G i are specified in the LGS i (Logic Gate Specification i).
The A-type event encoded in a directed arc is a virtual event defined in DUCG to represent the random causality between the parent and child. The causal events A connect the parent variables and child variables. A nk;ij represents the effect of state j of parent variable V i (V ∈ {B, X, D, G}) on state k of child variable X n , given that V ij occurs, if A nk;ij occurs, then X nk occurs; otherwise, X nk does not occur.
In the event A nk;ij , ";" is used to separate the child variable (subscripts before) and the parent variable (subscripts after), "," is used to separate the variable index (subscripts before) and the variable state (subscripts after), and sometimes "," can be ignored. Usually, 0 indexes the normal state, for example, in Figure A1, the definitions of {B, X, D}-type events are follows. The probability of independent random event A nk;ij is a nk;ij ≡ Pr{A nk;ij }. A nk;ij is a member of the matrix A n;i . Define F n;i ≡ (r n;i /r n )A n;i . F n;i represents the red directed arcs among variables in DUCG. It is a A n;i with a weight coefficient. r n;i > 0, and it represents the strength of the causal relationship between the parent variable V i and child variable X n , r n ≡ ∑ i r n;i . F n;i is actually an event matrix with F nk;ij as its members.
Define f nk;ij = Pr{F nk;ij } = (r n;i /r n ) Pr{A nk;ij } = (r n;i /r n )a nk;ij . Normal state 0 does not affect any abnormal state and it can not be caused by any abnormal state. It means that, in DUCG, the knowledge representation and casual inference can be done without specifying the cause or result of X n;0 .
Probability calculation is an important part of DUCG. Posterior probability calculation of DUCG needs to expand the event expression. During the expanding, the logic operations such as AND, OR, XOR, absorption, and exclusion are used to simplify the expressions. The algorithm to expand more than one event ANDs (multiplied) together is presented in [36]. The formula of basic event expression is as follows.
In DUCG, assume V i , V ∈ {B, X, G, D}, are the parent variables of X n , the expression formula of event X nk is where j expresses the state of V i , r n;i represents the causal relationship intensity between X n and V i , and r n = ∑ i r n;i . A nk;ij is a random event that V ij affects X nk . Equation (A1) can be applied repeatedly, until only B-type or D-type variables remaining. Therefore, this logical expansion can be for any X variables. In DUCG, the lower case letters represent the probabilities of the corresponding upper case letter events. Replace the upper case letters with the lower case letters in Equation (A1), we have X nk = ∑ i f nk;i v i = ∑ i (r n;i /r n ) a nk;i v i = ∑ i;j (r n;i /r n ) a nk;ij v ij . (A2) In addition to Figure A1, the BX-type variable is used in the clinical diagnosis DUCG, which is a special X-type variable and its state is to be diagnosed just like B-type events.