Next Article in Journal
Predictive Migration Performance in Vehicular Edge Computing Environments
Next Article in Special Issue
Data-Dependent Feature Extraction Method Based on Non-Negative Matrix Factorization for Weakly Supervised Domestic Sound Event Detection
Previous Article in Journal
A Tool for the Rapid Seismic Assessment of Historic Masonry Structures Based on Limit Analysis Optimisation and Rocking Dynamics
Previous Article in Special Issue
Applying Knowledge Inference on Event-Conjunction for Automatic Control in Smart Building
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comparison of Dengue Predictive Models Developed Using Artificial Neural Network and Discriminant Analysis with Small Dataset

1
Department of Mathematics, Faculty of Mathematics and Natural Sciences, Universitas Indonesia, Kampus Baru UI, Depok 16424, Indonesia
2
Department of Mathematics, Faculty of Science and Information Technology, Institut Sains dan Teknologi Nasional, Jl.Moh Kahfi II Srengseng Sawah Jagakarsa, Jakarta Selatan 12640, Indonesia
3
Department of Biology, Faculty of Mathematics and Natural Sciences, Universitas Indonesia, Kampus Baru UI, Depok 16424, Indonesia
4
Department of Microbiology, Faculty of Medicine, Universitas Indonesia, Jl. Salemba Raya no. 5, Kota Jakarta Pusat, Daerah Khusus Ibu Kota Jakarta 10430, Indonesia
*
Author to whom correspondence should be addressed.
Appl. Sci. 2021, 11(3), 943; https://doi.org/10.3390/app11030943
Submission received: 8 December 2020 / Revised: 29 December 2020 / Accepted: 30 December 2020 / Published: 21 January 2021
(This article belongs to the Special Issue Machine Learning Methods with Noisy, Incomplete or Small Datasets)

Abstract

:
In Indonesia, dengue has become one of the hyperendemic diseases. Dengue consists of three clinical phases—febrile phase, critical phase, and recovery phase. Many patients have died in the critical phase due to the lack of proper and timely treatment. Therefore, we developed models that can predict the severity level of dengue based on the laboratory test results of the corresponding patients using Artificial Neural Network (ANN) and Discriminant Analysis (DA). In developing the models, we used a very small dataset. It is shown that ANN models developed using logistic and hyperbolic tangent activation function with 70% training data yielded the highest accuracy (90.91%), sensitivity (91.11%), and specificity (95.51%). This is the proposed model in this research. The proposed model will be able to help physicians in predicting the severity level of dengue patients before entering the critical phase. Furthermore, it will ease physicians in treating dengue patients early, so fatal cases or deaths can be avoided.

1. Introduction

Dengue is an acute febrile disease caused by dengue virus (DENV). Commonly, dengue incidences happen in tropical and subtropical countries, such as South America, Southeast Asia, etc. [1]. Dengue incidences usually occur in the rainy season, and happen in urban and suburban areas. Based on a study that analyzed 130 countries, there were around 9221 dengue deaths per year from 1990 to 2013, with the lowest of 8277 in 1992, and the highest of 11,302 in 2010 [2]. Dengue had made Indonesia suffered the most significant economic loss in Southeast Asia. The average annual economic burden of dengue in Indonesia was approximately USD 381.5 million. Indonesia has the highest dengue infection rate in Southeast Asia and the second-highest dengue infection rate in the world after Brazil [3]. In 2017, there were 59,047 Dengue Hemorrhagic Fever (DHF) cases and 444 DHF-associated deaths in Indonesia, with 0.75% case fatality rate and 22.55 incidence rate per 100,000 person-years [4]. In 2018, the national incidence of dengue was 24.75 cases per 100,000 population, resulting in 467 deaths [5].
DENV is a member of the family Flaviviridae and genus Flavivirus [6]. DENV is transferred by Aedes aegypti and Aedes albopictus female mosquitoes [7]. Those female mosquitoes consume blood as their regular meal to mature their eggs [8]. They fulfill their need for blood by biting humans. The incubation period of DENV is around 4 to 10 days. After the incubation period ends, an infected mosquito can transfer DENV in its lifetime [9]. There are four serotypes of DENV, which are DENV-1, 2, 3, and 4. A DENV-infected person can be infected by one serotype of DENV or more [10].
Dengue consists of three clinical phases, which are febrile phase (occurs on the first until the third day of fever), critical phase (occurs on the fourth until sixth day of fever), and recovery phase (occurs on the seventh day of fever or afterwards). The common clinical symptoms of dengue are as follows: during the febrile phase, the symptoms are high fever, headache, nausea, myalgia, arthralgia, malaise, retro-orbital pain, and vomiting [11]. During the critical phase, the symptoms are thrombocytopenia, leukocytopenia, and plasma leakage, which is clinically manifested by hemoconcentration, pleural effusion, and/or ascites. Patients may also experience severe bleeding and shock [11]. During the recovery phase, the extravasated fluid is re-absorbed into the intravascular compartment. Some patients may experience an erythematous rash. The severity level of dengue consists of Dengue Fever (DF), Dengue Hemorrhagic Fever (DHF), and Dengue Shock Syndrome (DSS) [12]. DF is commonly classified as dengue, while DHF and DSS are commonly classified as severe dengue. The main difference between dengue and severe dengue is, patients who suffer severe dengue experience plasma leakage, while patients who suffer dengue do not. In this research, we only considered level DF and DHF. DHF itself was divided into two different levels, which were DHF grade 1 and DHF grade 2.
The gold standard to confirm a DENV infection are Reverse Transcription-Polymerase Chain Reaction (RT-PCR) test, culture of DENV, hemagglutination inhibition test, laboratory test, and tourniquet test. A suspected person can go through a laboratory test that measures hematocrit, thrombocyte (platelet count), and white blood cell (WBC). DENV infection will decrease in platelet count below 100,000 per μL between the third to eighth day onset of fever, and an increase in hematocrit of 20% or more. A positive tourniquet test also indicates that the corresponding patient might suffer dengue. Laboratory test parameters that we analyzed consist of hematocrit, hemoglobin, platelet count, WBC, monocyte, lymphocyte, and neutrophil.
In this research, we developed models that can predict the severity level of a dengue patient, based on the values of the seven laboratory test parameters. In other words, the laboratory test parameters were used as predictors to predict the severity level of a dengue patient. We used two methods, which were Artificial Neural Network (ANN) and Discriminant Analysis (DA). We decided to use ANN because ANN is one of machine learning methods that imitates the neuron structure of a human brain that has been used in many fields, including medical diagnosis prediction. Meanwhile, we decided to use DA because DA is one of dependency statistical analysis techniques in Multivariate Analysis that is used to classify an object into one of the statistically independent groups or categories. This is aligned with our purpose of this research, which was to classify data of dengue patients into one of the statistically independent severity levels of dengue.
We only used patients’ data on the third day onset of fever. Because on the third day, dengue patients are going to enter the critical phase. We expected that this model can help physicians to treat dengue patients early before the patients enter the critical phase, so they can have timely treatment and fatal cases or deaths can be avoided. We also used the data on the third day because there haven’t been any previous researches that used data on the third day.
The objectives of this research were to develop models that can predict the severity level of dengue using Artificial Neural Network (ANN) and Discriminant Analysis (DA), to evaluate the performances of the models, and to conclude which model has the best performance. A model with the best performance would be the proposed predictive model in this research. The predictive model will assist physicians in predicting the severity level of dengue.

2. Research Significance

Many patients have died in the critical phase due to the lack of proper and timely treatment. Therefore, we developed models that can predict the severity level of dengue based on the laboratory test results of the corresponding patients using ANN and DA. Our proposed predictive model—the one with the highest accuracy—will be able to help physicians in predicting the severity level of dengue patients before entering the critical phase. Furthermore, it will ease physicians in treating dengue patients early. So, dengue patients can receive proper and timely treatment, and fatal cases or deaths can be avoided.

3. Related Works

Abdiel E. Laureano-Rosario et al. [13] utilized ANN, which was trained with genetic algorithm to predict dengue fever outbreak in Puerto Rico and some areas in the coast of Mexico. They concluded that the model they developed using ANN had a good predictive ability.
Jorge D. Mello-Román et al. [14] compared two machine learning methods, which were Artificial Neural Networks multilayer perceptron (ANN-MLP) and Support Vector Machine (SVM) as the tools to assist medical diagnosis. ANN-MLP produced a better result with an average of 96% accuracy, 96% sensitivity, and 97% specificity. In conclusion, ANN-MLP could be used as a classifier to diagnose dengue infection with high accuracy, sensitivity, and specificity.
Oswaldo Santos Baquero et al. [15] compared Seasonal Autoregressive Integrated Moving Average (SARIMA), Generalized Additive Models (GAM), Artificial Neural Networks (ANN), naïve model, and ensemble model to predict dengue cases in São Paulo for one month ahead.
Tanujit Chakraborty et al. [16] developed a novel hybrid model, which was a combination of Autoregressive Integrated Moving Average (ARIMA) and Neural Network Autoregressive (NNAR). The model was used to analyze time series dengue data of three dengue endemic regions, which were San Juan, Iquitos, and Filipina. They concluded that the proposed hybrid model was easy to interpret, had an excellent performance in forecasting dengue epidemic for three dengue time series data from different regions, and had better forecasting accuracy compared to the other methods used in previous researches, such as traditional methods or other hybrid methods.
Siriyasatien et al. [17] developed models that enable forecasting of outbreaks of dengue, giving medical professionals the opportunity to develop plans for handling the outbreak, well in advance. They utilized several methods, one of them is ANN. Based on their results, the ANN model had two advantages: easy to be used in incremental learning and can learn to ignore irrelevant attributes.
Yulia Resti et al. [18] applied Quadratic Discriminant Analysis (QDA) in mapping the incidence of dengue into five areas in Palembang based on significant factors, such as age, gender, blood group, etc. The overall correct percentage of the mapping results was 66.7%.
Abdul Halim Poh et al. [19] utilized Principal Component Analysis (PCA) and DA to predict the patients’ clinical dengue positivity. The DA method yielded accuracy between 93–98%, sensitivity between 75–89%, and specificity between 94–100%.
ANN and DA have been widely used to predict dengue incidences. That is why we decided to use both methods separately in our research to develop models that can predict the severity level of dengue based on the laboratory test results.

4. Materials and Methods

4.1. Dataset

We used dengue patients’ data from the year 2009 and 2010 to develop the models (https://drive.google.com/drive/folders/1C1ZciLa2Cwsb1IrDpBpULP-2IZrcskBQ?usp=sharing). Many information was written in that data, such as age, gender, patients’ length of stay, clinical symptoms, laboratory test results, and patient diagnosis. The diagnoses were classified into three distinct severity levels, which were DF as the mild level, DHF grade 1 as the intermediate level, and DHF grade 2 as the severe level.
The data was obtained from Department of Microbiology, Universitas Indonesia. The data only consists of 77 dengue patients’ data. It is very small because it is difficult to collect laboratory test results of dengue patients needed in this research. The data was split into training and testing data. We developed the models with three different data splits. That is, with the ratio of training: testing = 70%:30%, 80%:20%, and 90%:10%. Training data was used for learning, which was to fit the parameters (i.e., weights). While testing data was used to assess the (generalization) performance of the neural network [20]. Table 1 is the summary of our data:

4.2. Discriminant Analysis (DA)

Discriminant Analysis (DA) is one of dependency statistical analysis techniques in Multivariate Analysis. It means that in DA, there are dependent and independent variables being analyzed, where the value of the dependent variable depends on the values of the independent variables. In DA, there is only one dependent variable, but there are more than one independent variables. DA can be used if the dependent variable is categorical (in nominal or ordinal scale), and the independent variables are metric (in interval or ratio scale). A categorical dependent variable means the corresponding variable consists of certain categories.
DA is used to classify an object into one of the statistically independent groups or categories based on the values of the independent variables. Classification process in DA is mutually exclusive, that is, if an object is classified into one particular group, it will not be classified into another group.
In DA, discriminant function(s) will be formulated. A discriminant function is used to classify an object into a group or category based on the value of the discriminant function itself. The values of the independent variables of an object will be inputted in the function. Then, the obtained value of the function, which is called the discriminant value, will determine in which group an object belongs. For k categories, k-1 discriminant functions have to be formulated.
Based on the amount of the categories of the dependent variable, there are two types of DA, which are two-group discriminant analysis and Multiple Discriminant Analysis (MDA). Two-group discriminant analysis is a type of DA where the dependent variable consists of two categories. While MDA is another type of DA where the dependent variable consists of more than two categories.
Based on the type of the discriminant function, there are also two types of DA, which are Linear Discriminant Analysis (LDA) and Quadratic Discriminant Analysis (QDA). LDA is a classification method used to find the optimum linear combination of features to separate two or more groups of objects [21]. QDA can be considered as a direct extension of LDA. There are some differences between LDA and QDA. First, the discriminant function in LDA is a linear combination of the independent variables. While the discriminant function in QDA is in the form of quadratic function. Second, in LDA, every covariance matrix of every independent variable has to be the identical. While in QDA, every covariance matrix of every independent variable can be different. The third difference, the decision curve in LDA is in the form of a straight line, while the decision curve in QDA is in the form of a quadratic curve [22].
Assume that observations X j ( k ) of a group k are random vectors of size p sampled from a Gaussian distribution 𝒩 ( μ ( k ) , Σ ( k ) ) , for all j { 1 ,   ,   N } . With a Bayesian approach, an object x can be classified into group k * that reach the maximum value of the posterior density function
k * = argmax k ( f k ( x ) π k ) = argmax k ( log ( f k ( x ) π k ) )
where f k ( x ) = 1 ( 2 π ) p 2 | Σ ( k ) | 1 2 exp { 1 2 ( x μ ( k ) ) ( Σ ( k ) ) 1 ( x μ ( k ) ) T } is the density function of a Gaussian vector and π k is the probability of an object belongs to group k . Formula (1) is the Quadratic Discriminant Analysis (QDA) formula. Linear Discriminant Analysis (LDA) has a similar formula. The only difference is in LDA, the covariance matrices Σ ( k ) are assumed to be identical for all classes [23].

4.3. Artificial Neural Network (ANN)

Artificial Neural Network (ANN) is a simple imitation of neuron structure of a human brain [24]. Similar to human brain, ANN is capable to analyze incomplete or unclear information, and furthermore, evaluate them. ANN imitates human brain in processing input signals and translating it into output signals. ANN is also capable to learn from data without any assumptions of certain functions.
ANN is a part of Artificial Intelligence, along with Support Vector Machines, Expert Systems, and Fuzzy Logic. ANN consists of processing units which are called artificial neuron. Artificial neurons try to imitate the structure and behavior of biological neurons. A neuron can consist of more than one input (dendrite), but commonly consists of only one output (synapsis through axon).
A neuron has a function which determines the activation of the neuron itself. That function is called an activation function. An activation function processes input signals that have been combined together, then transforms them into an output signal. Mathematically, the procedure of signal processing can be expressed as follows:
y ( x ) = Φ ( w i · x i i = 1 n )
where y is the output signal, Φ ( ) is the activation function, x is the input variable, and w is the weight that is given to each of the input variable [25].
There’s a lot of types of activation functions, which are linear, sigmoid, step, ramp, hyperbolic tangent, etc. The most frequently used activation function is sigmoid function. Hyperbolic tangent function has a similar shape to sigmoid function, but its value lies between −1 to +1, unlike sigmoid which the values lies between 0 to 1.
ANN can be widely used in different scopes of problems, such as finding new features of an object, and classifying or predicting an object/event using huge sets of data. Some of the fields where ANN is frequently used are medical diagnosis prediction, character recognition, speech recognition, human face recognition, signature verification application, etc.
ANN has a different way of work with normal computers in many ways. ANN has strengths compared to normal computer programs. Some of the strengths of ANN are as follows: (i) ANN is an adaptive learning method. It imitates human brain on how to do its task while learning, even with different types of inputs; (ii) ANN can organize itself while learning; (iii) ANN works parallelly like human brain; (iv) ANN has a high fault tolerance. It is able to work even on fuzzy, noisy, and incomplete data; and (v) ANN is applicable to classify data, recognize patterns, and any other tasks that involve obscure data.

4.4. Confusion Matrix and Performance Measurement

Confusion matrix is a matrix used to evaluate the performance of classifier models in general. For binary classification problems, confusion matrix size 2 × 2 is used. However, in real life problems, classification can involve more than two classes or categories. These problems are considered as multi-class classification problems. For a multi-class classification problem, confusion matrix size 3 × 3 is used. Principally, 3 × 3 confusion matrix is similar to 2 × 2 confusion matrix. The general form of 3 × 3 confusion matrix is as follows:
A B C A B C [ a a b a c a a b a c b b b c c b c c ]
where the columns denote actual cases, the rows denote predicted cases.
For category A,
True Positive (TP): a a
True Negative (TN): b b + c b + b c + c c
False Positive (FP): b a + c a
False Negative (FN): a b + a c
Calculations for other categories are similar.
In this research, our problem was a multi-class classification problem, because the severity level consisted of three distinct categories, and we aimed to classify each dengue patient into one of those level. So, we used 3 × 3 confusion matrix and these performance measurements below:
  • Accuracy: This metric measures the overall performance of the model. Generally, accuracy is the proportion of true results among the total number of cases examined. Accuracy can be expressed as follows:
    A c c u r a c y = n u m b e r   o f   c o r r e c t l y   c l a s s i f i e d   e x a m p l e s t o t a l   n u m b e r   o f   c a s e s
Accuracy can also be calculated from the elements of the 3 × 3 confusion matrix using the formula below:
A c c u r a c y = i = 1 r x i i i = 1 r j = 1 r x i j
where x denotes an element of the confusion matrix, i denotes the number of rows of the confusion matrix, and j denotes the number of columns of the confusion matrix. Note that the numerator of the Formula (3) is the sum of the main diagonal elements of the confusion matrix, and the denominator is the sum of all of the elements of the confusion matrix.
2
Sensitivity: this metric measures the proportion of the True Positive (TP) cases among the total number of positive cases. The formula to calculate sensitivity is as follows:
S e n s i t i v i t y = T P T P + F N
3
Specificity: This metric measures the proportion of the True Negative (TN) cases among the total number of negative cases. The formula to calculate sensitivity is as follows:
S p e c i f i c i t y = T N T N + F P

5. Results and Discussion

5.1. Model Construction

As mentioned previously, we developed models to predict the severity level of dengue with ANN and DA. The models will be able to predict the severity level of dengue based on the laboratory test results of the corresponding patients. The accuracy of both models developed with ANN and DA would be evaluated, and the one with the higher accuracy would be the proposed model. For short, we mentioned the models as “predictive models” in Figure 1.
By inputting the values of hematocrit, hemoglobin, platelet count, WBC, monocyte, lymphocyte, and neutrophil of the corresponding patient into the predictive model, the model would be able to process those values and predict the severity level of the dengue patient. In other words, the input variables of the model were the independent variables of this research, which were the seven laboratory test parameters. While the output variable was the dependent variable of this research, which was the severity level of dengue.
Before we developed the model, we conducted Spearman’s rank correlation test. This correlation test was conducted for two purposes:
  • To examine whether there is a significant correlation between the independent variables. To be able to use DA, there has to be no multicollinearities between the independent variables. This means all of the independent variables are not correlated. If there’s a couple of correlated independent variables, one of them has to be eliminated.
  • To conduct feature selection. This means to select only the necessary independent variables to develop the models.
First, we conducted a two-ways hypothesis test, where the hypotheses were as follows:
H0: ρ = 0 , which means there is no significant correlation between variable xi and xj, where i, j = 1, …, 7, i j. H1: ρ 0 , which means there is a significant correlation between variable xi and xj.
We used a significance level of 0.05. Below were the results of the correlation test.
The elements of Table 2 were the correlation coefficients between the corresponding independent variables. The starred coefficients mean that there is a significant correlation between the corresponding independent variables, or in other words, the corresponding independent variables were correlated. The correlated independent variables were as follows:
  • Hemoglobin and hematocrit
  • Hemoglobin and platelet count
  • Hematocrit and platelet count
  • WBC and platelet count
  • Neutrophil and lymphocyte
  • Neutrophil and monocyte
Therefore, we eliminated some of the independent variables which were correlated with more than one other independent variables. The eliminated independent variables were hematocrit, platelet count, and neutrophil. Furthermore, we only used the remaining four independent variables to develop the models – hemoglobin, WBC, lymphocyte, and monocyte.
Then, we applied the upSample technique because the data that had been used in this research was imbalanced. upSampling means doing a sampling with replacement, in a random manner, to the data in the minority category until the sample size equals the size of the majority category. upSample is one of the techniques to handle imbalanced dataset. An imbalanced dataset is usually upSampled before it is used to develop the model, so the developed model will tend to have higher accuracy. If the dataset is not upSampled and is directly used to develop a model, it is unlikely to yield a model with high accuracy. The dataset used in our research was imbalanced, in the sense that the classes are not represented equally. Before the data was upSampled, it consisted of 77 cases. From the 77 cases, 38 cases were from category DF, 28 cases were from category DHF grade 1, and 11 cases were from category DHF grade 2. It is clear that the majority category was DF with a size of 38, and the minority category was DHF grade 2 with size 11. After the dataset was upSampled, each category has the size 38. So, in total, the upSampled data consisted of 38 × 3 = 114 cases.
After we applied the upSample technique, we standardized the data. We only standardized the data of the independent variables, which were the seven laboratory test parameters. We didn’t standardize the data of the dependent variable, which was the severity level of dengue, because we wanted the real values of the dependent variable. Data of the independent variables was standardized because the values varied greatly and had different units. It was also standardized in order to develop better models compared to models developed using unstandardized data. After the data was standardized, we started to develop the models. We developed twelve models in total. Six models were developed with ANN, and the other six were developed with DA. We developed the models using R programming version 3.6.3. We decided to use R because it has some advantages, such as: (i) it has data science engine, specifically the statistics and machine learning packages, and (ii) it is an open-source language programming, so it is accessible by anyone.

5.2. Predictive Models Developed Using DA

We developed two DA models. The first model that we developed was Linear Discriminant Analysis (LDA) model, and the second one was Quadratic Discriminant Analysis (QDA) model. Below were the table of performance measurements of both models.
  • LDA
Based on Table 3, it is shown that the LDA model with 70% training data yielded the highest accuracy (45.45%), sensitivity (45.73%), and specificity (72.89%).
2.
QDA
Based on Table 4, it is shown that the QDA model with 80% training data yielded the highest accuracy (60.87%) and sensitivity (59.72%).
Based on the obtained results, QDA model with 80% training data yielded the highest accuracy (60.87%) and sensitivity (59.72%). Therefore, QDA model with 80% training data was the proposed predictive model so far. This model was then compared with the other predictive models developed using ANN.

5.3. Predictive Models Developed Using ANN

We also developed the predictive models using ANN—next will be called ANN model for short. We developed the ANN models using R programming with the nnet package. The nnet trains feed-forward neural networks with traditional backpropagation algorithm [26].
Feed-Forward Neural Networks (FFNN) is a type of ANN that doesn’t have any feedback connection from the output to the input [27]. In FFNN, neurons of the previous layer are entirely connected to the consecutive layer, but there are no intra-layer connections [28]. FFNN is a supervised learning method that is utilized for classification problems and consists of input, hidden, and output layers [29].
Our ANN architecture consisted of one input layer, one hidden layer, and one output layer. There were four neurons in input layer that represented hemoglobin, WBC, lymphocyte, and monocyte.
There was only one hidden layer in this ANN architecture. The number of hidden layers and neurons in each hidden layer were obtained through the iteration process until the accuracy could not be furtherly improved [30]. The number of hidden layers is arbitrary. However, one hidden layer is commonly used for simple problems. The numbers of input and output neurons are determined according to the problem, while the number of hidden neurons was well optimized [20]. The problem in this research could be considered as a simple problem because it only consisted of four independent variables and the purpose was related to medical diagnosis, one of the common purposes of using ANN. That was why one hidden layer was enough for this research. Meanwhile, to ease the iteration process, we decided to use the rule of thumb in determining the hidden neurons. Based on the research conducted by Panchal and Panchal [31], one of the rules in determining the number of hidden neurons is “the number of hidden neurons should be 2/3 of the input layer size, plus the size of the output layer”.
We followed this rule when developing the ANN model. So, the number of hidden neurons is:
Hidden neuron = ( 2 3 × input neuron ) + output neuron = ( 2 3 × 4 ) + 3 = 17 3 = 5.66 ... 6
Meanwhile, the number of input neurons was 4 because there were four remaining laboratory test parameters. The number of the output neuron was 3 because it represented three categories of severity level. The output that would come out was the probability values from each neuron that would sum up to 1. The probability value means the probability of a patient to suffer from dengue in a corresponding level. For example, the output value from neuron DF is 0.3, from DHF grade 1 is also 0.3, and from DHF grade 2 is 0.4. So, the patient is suffering dengue level DHF grade 2 because the probability value is the highest compared to the other two. Figure 2 was the architecture of the developed ANN predictive model.
The activation function is frequently a bounded nondecreasing, differentiable, and nonlinear function such as the hyperbolic tangent or the logistic function [26]. We tried to used logistic and hyperbolic tangent as the activation functions. First, we developed an ANN model with logistic activation function. Then, we developed another ANN model with hyperbolic tangent activation function.
Logistic function may have different forms, but the one used in R programming—the one we used in this research—is the standard logistic function, which is also known as sigmoid function. The formula of the logistic function is as follows
f ( x i ) = 1 1 + e x i = e x i e x i + 1
where x i is the value of the i th independent variable [32].
Meanwhile, hyperbolic tangent function, which is also known as tanh function, has the formula as follows
tan h ( x i ) = sin h ( x i ) cos h ( x i ) = e x i e x i e x i + e x i
where x i is the value of the i th independent variable [33].
We provided the biases and weights of our ANN models, so our developed ANN models can be useful for other researchers who want to conduct similar researches. We displayed them in tables so it would be easier to read. Below is the annotation of the notations in the tables below:
Bi–Hj: weight from Bi to Hj, where i = 1, 2 and j = 1, …, 6.
Ik–Hj: weight from Ik to Hj, where k = 1, …, 4.
Bi–Ol: weight from Bi to Ol, where l = 1, 2, 3.
Hj–Ol: weight from Hj to Ol.
Table 5 and Table 6 were the tables of the biases and weights of our ANN models.
1.
ANN model developed using logistic activation function
2.
ANN model developed using hyperbolic tangent (tanh) activation function
Below were the tables of performance measurements of both models.
  • ANN model developed using logistic activation function
Based on Table 7, it is shown that the ANN model with 70% training data yielded the highest accuracy (90.91%), sensitivity (91.11%), and specificity (95.51%).
2
ANN model developed using hyperbolic tangent (tanh) activation function
Based on Table 8, it is shown that the ANN model with 70% training data yielded the highest accuracy (90.91%), sensitivity (91.11%), and specificity (95.51%).

5.4. The Proposed Predictive Model

Based on the obtained results, both ANN models developed using logistic and hyperbolic tangent activation function with 70% training data yielded the highest accuracy (90.91%), sensitivity (91.11%), and specificity (95.51%). These models also have the highest accuracy, sensitivity, and specificity compared to the previously proposed model, QDA model with 80% training data. Therefore, both ANN models developed using logistic and tanh activation function with 70% training data were the proposed predictive models in this research. We displayed the proposed predictive models’ performances in Table 9.

6. Conclusions and Recommendation

As shown in the previous chapter, both ANN models developed using logistic and hyperbolic tangent activation function with 70% training data yielded the highest accuracy (90.91%), sensitivity (91.11%), and specificity (95.51%). Therefore, both ANN models developed using logistic and tanh activation function with 70% training data are the proposed models to be used as the models to predict the severity level of dengue based on the laboratory test results.
We suggest other researchers consider developing another ANN architecture to be applied to the same data, to which the link has been given to obtain a new model with better performance. Researchers may improvise with the number of hidden neurons and/or hidden layer(s).

Author Contributions

Writing—original draft, developing the ANN models, and algorithm implementation, P.S.; project administrator, designing methods, formal analysis, and final revision, A.B.; Software—developing the DA models, H.M.; Funding acquisition, W.M.; Writing—review & editing, designing methods, and clinical analysis, B.E.D. All authors have read and agreed to the published version of the manuscript.

Funding

This work was fully funded by PDUPT 2020 with the contract no. NKB-2827/UN2.RST/HKP.05.00/2020 from Kementrian Riset dan Teknologi/Badan Riset dan Inovasi Nasional, Indonesia.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are openly available in the link provided in Section 4.1.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Silitonga, P.; Dewi, B.E.; Bustamam, A.; Siswantining, T. Correlation between laboratory characteristics and clinical degree of dengue as an initial stage in a development of machine learning predictor program. AIP Conf. Proc. 2020, 030008, In Symposium on Biomathematics 2019, SYMOMATH 2019. Apri, M., Akimenko, V., Eds.; American Institute of Physics Inc. 2020. 030008. (AIP Conference Proceedings). [Google Scholar] [CrossRef]
  2. Stanaway, J.; Shepard, D.; Undurraga, E.; Halasa, Y.; Coffeng, L.; Brady, O.; Hay, S.I.; Bedi, N.; Bensenor, I.M.; Castañeda-Orjuela, C.A.; et al. The global burden of dengue: An analysis from the Global Burden of Disease Study 2013. Lancet Infect Dis. 2016, 16, 712–723. [Google Scholar] [CrossRef] [Green Version]
  3. Widoretno Sjahrurachman, A.; Dewi, B.; Lischer, K.; Pratami, D.; Flamandita, D.; Sahlan, M. Surface plasmon resonance analysis for detecting non-structural protein 1 of dengue virus in Indonesia. Saudi J. Biol. Sci. 2020, 27, 1931–1937. [Google Scholar] [CrossRef] [PubMed]
  4. Harapan, H.; Michie, A.; Mudatsir, M.; Sasmono, R.; Imrie, A. Epidemiology of dengue hemorrhagic fever in Indonesia: Analysis of fve decades data from the National Disease Surveillance. BMC Res. Notes 2019, 350, 1–6. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Santoso, M.; Yohan, B.; Denis, D.; Hayati, R.; Haryanto, S.; Trianty, L.; Noviyanti, R.; Hibberd, M.L.; Sasmono, R.T. Diagnostic accuracy of 5 different brands of dengue virus non-structural protein 1 (NS1) antigen rapid diagnostic tests (RDT) in Indonesia. Diagn. Microbiol. Infect. Dis. 2020, 98, 1–7. [Google Scholar] [CrossRef] [PubMed]
  6. Beltrán-Silva, S.; Chacón-Hernández, S.; Moreno-Palacios, E.; Pereyra-Molina, J. Clinical and differential diagnosis: Dengue, chikungunya and Zika. Rev. Méd. Hosp. Gen. Méx. 2016, 81, 146–153. [Google Scholar] [CrossRef]
  7. Smartt, C.; Shin, D.; Alto, B. Dengue serotype-specific immune response in Aedes aegypti and Aedes albopictus. Mem. Inst. Oswaldo Cruz. 2017, 112, 829–837. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Bustamam, A.; Aldila, D.; Yuwanda, A. Understanding Dengue Control for Short- and Long-Term Intervention with a Mathematical Model Approach. J. Appl. Math. 2018, 2018, 1–13. [Google Scholar] [CrossRef] [Green Version]
  9. Götz, T.; Altmeier, N.; Bock, W.; Rockenfeller, R.; Sutimin, S.; Wijaya, K. Modeling dengue data from Semarang, Indonesia. Ecol. Complex. 2016, 30, 57–62. [Google Scholar] [CrossRef]
  10. Lardo, S.; Utami, Y.; Yohan, B.; Tarigan, S.M.; Santoso, W.D.; Nainggolan, L.; Sasmono, R.T. Concurrent infections of dengue viruses serotype 2 and 3 in patient with severe dengue from Jakarta, Indonesia. Asian Pac. J. Trop. Med. 2016, 9, 134–140. [Google Scholar] [CrossRef]
  11. Kuo, H.; Lee, I.; Liu, J. Analyses of clinical and laboratory characteristics of dengue adults at their hospital presentations based on the World Health Organization clinical-phase framework: Emphasizing risk of severe dengue in the elderly. J. Microbiol. Immunol. Infect. 2016, 51, 740–748. [Google Scholar] [CrossRef] [PubMed]
  12. Dewi, B.E.; Lugito, N.P.; Sutrisna, B.; Pohan, H.T.; Syahrurachman, A.; Widodo, D.; Ronoatmodjo, S.; Sudaryo, M.K.; Windiyaningsih, C.; Sudiro, T.M. Scoring Model to Predict Dengue Infection in the Early Phase of Illness in Primary Health Care Centre. Arch. Clin. Microbiol. 2015, 6, 1–8. [Google Scholar]
  13. Laureano-Rosario, A.E.; Duncan, A.P.; Mendez-Lazaro, P.A.; Garcia-Rejon, J.E.; Gomez-Carro, S.; Farfan-Ale, J.; Savic, D.A.; Muller-Karger, F.E. Application of Artificial Neural Networks for Dengue Fever Outbreak Predictions in the Northwest Coast of Yucatan, Mexico and San Juan, Puerto Rico. Trop. Med. Infect. Dis. 2018, 3, 1–16. [Google Scholar] [CrossRef] [Green Version]
  14. Mello-Román, J.; Mello-Román, J.; Gómez-Guerrero, S.; García-Torres, M. Predictive Models for the Medical Diagnosis of Dengue: A Case Study in Paraguay. Comput. Math. Methods Med. 2019, 2019, 1–7. [Google Scholar] [CrossRef] [PubMed]
  15. Baquero, O.; Santana, L.; Chiaravalloti-Neto, F. Dengue forecasting in São Paulo city with generalized additive models, articial neural networks and seasonal autoregressive integrated moving average models. PLoS ONE 2018, 13, e0195065. [Google Scholar] [CrossRef]
  16. Chakraborty, T.; Chattopadhyay, S.; Ghosh, I. Forecasting dengue epidemics using a hybrid methodology. Phys. A: Stat. Mech. Its Appl. 2019, 2019, 1–8. [Google Scholar] [CrossRef]
  17. Siriyasatien, P.; Chadsuthi, S.; Jampachaisri, K.; Kesorn, K. Dengue Epidemics Prediction: A Survey of the State-of-the-Art Based on Data Science Processes. IEEE Access 2018, 6, 53757–53795. [Google Scholar] [CrossRef]
  18. Resti, Y.; Ismail, N.; Verawaty, M. Quadratic Discriminant Analysis of Dengue Viruses Disease Incidence in Palembang. Am. J. Appl. Sci. 2017, 14, 578–582. [Google Scholar] [CrossRef] [Green Version]
  19. Poh, A.H.; Adikan, F.R.M.; Moghavvemi, M.; Omar, S.F.S.; Poh, K.; Mahyuddin, M.B.H.; Yan, G.; Ariffin, M.A.A.; Harun, S.W. Precursors to non-invasive clinical dengue screening: Multivariate signature analysis of in-vivo diffuse skin reflectance spectroscopy on febrile patients in Malaysia. PLoS ONE 2020, 15, e0228923. [Google Scholar] [CrossRef] [Green Version]
  20. Jiang, H.; Xi, Z.; Rahman, A.; Zhang, X. Prediction of output power with artificial neural network using extended datasets for Stirling engines. Appl. Energy 2020, 271, 1–16. [Google Scholar] [CrossRef]
  21. Othman, N.H.; Lee, K.Y.; Radzol, A.R.M.; Mansor, W.; Ramlan, N.N.M. Linear Discriminant Analysis for Detection of Salivary NS1 from SERS Spectra. In Proceedings of the TENCON 2017—2017 IEEE Region 10 Conference, Penang, Malaysia, 5–8 November 2017; pp. 2876–2879. [Google Scholar]
  22. Zhang, M. Identification of protein coding regions in the human genome by quadratic discriminant analysis. Proc. Natl. Acad. Sci. USA 1997, 94, 565–568. [Google Scholar] [CrossRef] [Green Version]
  23. Le, K.; Chaux, C.; Richard, F.; Guedj, E. An adapted linear discriminant analysis with variable selection for the classification in high-dimension, and an application to medical data. Comput. Stat. Data Anal. 2020, 152, 1–12. [Google Scholar] [CrossRef]
  24. Kukreja, H.; Bharath, N.; Siddesh, C.S.; Kuldeep, S. An introduction to artificial neural network. Int. J. Adv. Res. Innov. Ideas Educ. 2016, 1, 27–30. [Google Scholar]
  25. Zhang, Z. A gentle introduction to artificial neural networks. Ann. Transl. Med. 2016. 2016, 4, 1–6. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Günther, F.; Fritsch, S. neuralnet: Training of Neural Networks. R J. 2010, 2, 30–38. [Google Scholar] [CrossRef] [Green Version]
  27. Mansoor, M.; Grimaccia, F.; Leva, S.; Mussetta, M. Comparison of echo state network and feed-forward neural networks in electrical load forecasting for demand response programs. Math. Comput. Simul. 2020, 1–12. [Google Scholar] [CrossRef]
  28. Zhang, A.; Zhou, H.; Li, X.; Zhu, W. Fast and robust learning in Spiking Feed-forward Neural Networks based on Intrinsic Plasticity mechanism. Neurocomputing 2019, 365, 102–112. [Google Scholar] [CrossRef]
  29. Yibre, A.; Koçer, B. Semen quality predictive model using Feed Forwarded Neural Network trained by Learning-Based Artificial Algae Algorithm. Eng. Sci. Technol. Int. J. 2020, 1–9. [Google Scholar] [CrossRef]
  30. Zhao, N.; Charland, K.; Carabali, M.; Nsoesie, E.; Maher-Giroux, M.; Rees, E.; Yuan, M.; Balaguera, C.G.; Ramirez, G.J.; Zinszer, K. Machine learning and dengue forecasting: Comparing random forests and artificial neural networks for predicting dengue burden at national and sub-national scales in Colombia. PLoS Negl. Trop. Dis. 2020, 1–16. [Google Scholar]
  31. Panchal, F.; Panchal, M. Review on Methods of Selecting Number of Hidden Nodes in Artificial Neural Network. IJCSMC 2014, 3, 455–464. [Google Scholar]
  32. Kamson, A.; Sharma, L.; Dandapat, S. Enhancement of the heart sound envelope using the logistic function amplitude moderation method. Comput. Methods Programs Biomed. 2019, 187, 1–9. [Google Scholar] [CrossRef] [PubMed]
  33. Liu, T.; Qiu, T.; Luan, S. Hyperbolic-tangent-function-based cyclic correlation: Definition and theory. Signal Process. 2019, 164, 206–216. [Google Scholar] [CrossRef]
Figure 1. Experiment workflow.
Figure 1. Experiment workflow.
Applsci 11 00943 g001
Figure 2. Artificial Neural Network (ANN) architecture of the predictive models developed using R.
Figure 2. Artificial Neural Network (ANN) architecture of the predictive models developed using R.
Applsci 11 00943 g002
Table 1. Dengue patients’ data summary.
Table 1. Dengue patients’ data summary.
HemoglobinHematocritWBCPlatelet CountNeutrophilLymphocyteMonocyteSeverity Level
Minimum value8.530.3100060002212.100
1st quartile12.638.4233093,0005520.730
Median13.940.63100119,0006425101
Mean 13.941.33178126,18861.7289.50.6
3rd quartile15.4454000161,0007334141
Maximum value17.852.36900278,0008468232
Table 2. The result of Spearman’s rank correlation test between the laboratory test parameters.
Table 2. The result of Spearman’s rank correlation test between the laboratory test parameters.
HemoglobinHematocritWBCPlatelet countNeutrophilLymphocyteMonocyte
Hemoglobin10.957 *0.126−0.475 *−0.1200.1370.001
Hematocrit0.957 *10.186−0.380 *−0.0710.144−0.080
WBC0.1260.18610.262 *0.069−0.1530.028
Platelet count−0.475 *−0.380 *0.262 *10.061−0.1330.099
Neutrophil−0.120−0.0710.0690.0611−0.833 *−0.581 *
Lymphocyte0.1370.144−0.153−0.133−0.833 *10.143
Monocyte0.001−0.0800.0280.099−0.581 *0.1431
* Correlation is significant at the 0.05 level (2-Tailed).
Table 3. Performance measurements of the Linear Discriminant Analysis (LDA) model.
Table 3. Performance measurements of the Linear Discriminant Analysis (LDA) model.
Training Data Percentage
70%80%90%
Accuracy45.45%30.43%27.27%
Sensitivity45.73%31.95%27.78%
Specificity72.89%65.05%63.10%
Table 4. Performance measurements of the Quadratic Discriminant Analysis (QDA) model.
Table 4. Performance measurements of the Quadratic Discriminant Analysis (QDA) model.
Training Data Percentage
70%80%90%
Accuracy60.61%60.87%54.55%
Sensitivity56.07%59.72%55.56%
Specificity82.43%80.67%80.56%
Table 5. The biases and weights of the ANN model developed using logistic activation function.
Table 5. The biases and weights of the ANN model developed using logistic activation function.
Data Training Percentage
70%80%90%
B1–H15.4039439−4.49103004.5469032
I1–H1−15.076642511.3254070−11.0499554
I2–H11.5627953−1.74906822.7245021
I3–H12.7587930−0.91633990.9683699
I4–H1−0.1562694−0.6655746−1.9540237
B1–H2−13.793552−1.4021008−3.2660749
I1–H26.0313709.11469270.7803872
I2–H26.958133−0.39671366.6604275
I3–H24.9423393.8428082−0.7337680
I4–H27.859338−7.95239555.1178065
B1–H3−4.7210114.965760−8.241048
I1–H34.605214−6.5778781.584072
I2–H3−5.5369115.6341946.583553
I3–H3−13.4127747.212313−2.312179
I4–H310.487666−6.1573066.642880
B1–H4−2.739654−5.661526−3.080241
I1–H49.5068761.911992−6.917437
I2–H46.98539610.440174−1.723676
I3–H4−4.91539011.821140−2.475820
I4–H4−5.293412−2.47842210.804223
B1–H510.342958−6.0974985.354134
I1–H5−5.3407654.321218−2.957952
I2–H5−12.0731486.730848−7.420634
I3–H5−1.495051−1.673032−1.183222
I4–H5−9.6814497.487540−4.575078
B1–H61.1958247−10.2314575.565350
I1–H6−12.1059627−1.590068−4.021247
I2–H60.03448648.8651505.553954
I3–H6−9.2652208−4.6161115.455334
I4–H612.018514612.299163−10.527717
B2–O114.765571−11.104739−0.62485862
H1–O1−6.6627085.5251940.90165390
H2–O1−10.577831−6.4031595.92878925
H3–O1−7.2840021.918726−9.87260412
H4–O1−8.7447318.0045590.08648525
H5–O1−12.3193158.997052−7.63236367
H6–O1−4.108802−12.639489−0.07664491
B2–O2−8.337048−0.07675744−3.107592
H1–O2−9.4444448.49491407−9.655582
H2–O28.975756−7.48027294−6.024728
H3–O2−8.7264509.935623675.738516
H4–O24.006490−7.014141239.385994
H5–O210.649440−10.119856712.076983
H6–O214.58352611.582277758.594578
B2–O3−4.4530081−1.135868351.294068
H1–O39.75571320.021803948.395259
H2–O3−3.05769198.99669282−1.187068
H3–O313.5887864−9.102823854.191767
H4–O3−0.8085856−6.27224615−8.351199
H5–O30.8808544−1.029239105.886176
H6–O3−12.49934574.83479909−8.863369
Table 6. The biases and weights of the ANN model developed using tanh activation function.
Table 6. The biases and weights of the ANN model developed using tanh activation function.
Data Training Percentage
70%80%90%
B1–H15.4039439−0.11579674.5469032
I1–H1−15.0766425−5.3827556−11.0499554
I2–H11.5627953−3.42234092.7245021
I3–H12.75879302.94223210.9683699
I4–H1−0.15626944.3960790−1.9540237
B1–H2−13.793552−9.118127−3.2660749
I1–H26.0313702.8364620.7803872
I2–H26.95813312.1011056.6604275
I3–H24.9423391.334247−0.7337680
I4–H27.8593389.3067915.1178065
B1–H3−4.7210114.775554−8.241048
I1–H34.605214−5.1855071.584072
I2–H3−5.5369117.5622616.583553
I3–H3−13.4127748.626503−2.312179
I4–H310.487666−9.5578006.642880
B1–H4−2.73965410.686570−3.080241
I1–H49.506876−1.389448−6.917437
I2–H46.985396−7.260299−1.723676
I3–H4−4.9153904.507227−2.475820
I4–H4−5.293412−11.68031610.804223
B1–H510.3429582.16818605.354134
I1–H5−5.340765−11.4221402−2.957952
I2–H5−12.073148−0.8285786−7.420634
I3–H5−1.495051−5.1982137−1.183222
I4–H5−9.6814499.6726127−4.575078
B1–H61.19582474.91311145.565350
I1–H6−12.1059627−13.9610034−4.021247
I2–H60.03448642.43101585.553954
I3–H6−9.26522082.23660865.455334
I4–H612.01851460.2850172−10.527717
B2–O114.765571−17.24445758−0.62485862
H1–O1−6.6627086.635973090.90165390
H2–O1−10.5778319.957681365.92878925
H3–O1−7.2840023.98047085−9.87260412
H4–O1−8.7447319.884973940.08648525
H5–O1−12.319315−0.06539451−7.63236367
H6–O1−4.108802−7.47772434−0.07664491
B2–O2−8.3370486.135447−3.107592
H1–O2−9.444444−1.290546−9.655582
H2–O28.975756−10.614987−6.024728
H3–O2−8.7264508.3717785.738516
H4–O24.006490−9.9333849.385994
H5–O210.64944011.6917942.076983
H6–O214.583526−8.4216558.594578
B2–O3−4.45300817.26979551.294068
H1–O39.7557132−0.75640208.395259
H2–O3−3.0576919−2.9147165−1.187068
H3–O313.5887864−10.55698714.191767
H4–O3−0.8085856−0.6266332−8.351199
H5–O30.8808544−9.89400815.886176
H6–O3−12.49934579.8877980−8.863369
Table 7. Performance measurements of the ANN model developed using logistic activation function.
Table 7. Performance measurements of the ANN model developed using logistic activation function.
Training Data Percentage
70%80%90%
Accuracy90.91%78.26%72.73%
Sensitivity91.11%81.39%72.22%
Specificity95.51%88.19%86.90%
Table 8. Performance measurements of the ANN model developed using tanh activation function.
Table 8. Performance measurements of the ANN model developed using tanh activation function.
Training Data Percentage
70%80%90%
Accuracy90.91%78.26%72.73%
Sensitivity91.11%79.68%72.22%
Specificity95.51%89.10%86.90%
Table 9. Performance measurements of the proposed predictive ANN model.
Table 9. Performance measurements of the proposed predictive ANN model.
Activation Function
LogisticTanh
Accuracy90.91%90.91%
Sensitivity91.11%91.11%
Specificity95.51%95.51%
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Silitonga, P.; Bustamam, A.; Muradi, H.; Mangunwardoyo, W.; Dewi, B.E. Comparison of Dengue Predictive Models Developed Using Artificial Neural Network and Discriminant Analysis with Small Dataset. Appl. Sci. 2021, 11, 943. https://doi.org/10.3390/app11030943

AMA Style

Silitonga P, Bustamam A, Muradi H, Mangunwardoyo W, Dewi BE. Comparison of Dengue Predictive Models Developed Using Artificial Neural Network and Discriminant Analysis with Small Dataset. Applied Sciences. 2021; 11(3):943. https://doi.org/10.3390/app11030943

Chicago/Turabian Style

Silitonga, Permatasari, Alhadi Bustamam, Hengki Muradi, Wibowo Mangunwardoyo, and Beti E. Dewi. 2021. "Comparison of Dengue Predictive Models Developed Using Artificial Neural Network and Discriminant Analysis with Small Dataset" Applied Sciences 11, no. 3: 943. https://doi.org/10.3390/app11030943

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop