Sensor-Assisted Weighted Average Ensemble Model for Detecting Major Depressive Disorder

Mahendran, Nivedhitha; Vincent, Durai Raj; Srinivasan, Kathiravan; Chang, Chuan-Yu; Garg, Akhil; Gao, Liang; Reina, Daniel Gutiérrez

doi:10.3390/s19224822

Open AccessArticle

Sensor-Assisted Weighted Average Ensemble Model for Detecting Major Depressive Disorder

¹

School of Information Technology and Engineering, Vellore Institute of Technology, Vellore 632014, India

²

Department of Computer Science and Information Engineering, National Yunlin University of Science and Technology, Yunlin 64002, Taiwan

³

State Key Lab of Digital Manufacturing Equipment & Technology, School of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan 430074, China

⁴

Electronic Engineering Department, University of Seville, 41092 Seville, Spain

^*

Author to whom correspondence should be addressed.

Sensors 2019, 19(22), 4822; https://doi.org/10.3390/s19224822

Submission received: 11 October 2019 / Revised: 31 October 2019 / Accepted: 3 November 2019 / Published: 6 November 2019

(This article belongs to the Special Issue Selected Papers from the IEEE International Conference on Consumer Electronics – Taiwan (IEEE 2019 ICCE-TW))

Download

Browse Figures

Versions Notes

Abstract

:

The present methods of diagnosing depression are entirely dependent on self-report ratings or clinical interviews. Those traditional methods are subjective, where the individual may or may not be answering genuinely to questions. In this paper, the data has been collected using self-report ratings and also using electronic smartwatches. This study aims to develop a weighted average ensemble machine learning model to predict major depressive disorder (MDD) with superior accuracy. The data has been pre-processed and the essential features have been selected using a correlation-based feature selection method. With the selected features, machine learning approaches such as Logistic Regression, Random Forest, and the proposed Weighted Average Ensemble Model are applied. Further, for assessing the performance of the proposed model, the Area under the Receiver Optimization Characteristic Curves has been used. The results demonstrate that the proposed Weighted Average Ensemble model performs with better accuracy than the Logistic Regression and the Random Forest approaches.

Keywords:

correlation-based feature selection; random forest; weighted average ensemble; major depressive disorder; smartwatch sensor

1. Introduction

As we live in modern times where computer jobs and fast food chains are used to enhance life, no one is ready to consider exercising or taking time out [1]. With all the stress happening, it is more than typical that the stress comes with anxiety and depression. People these days are confused with what they are doing and do not have a proper vision, which makes them puzzled. In the long term, this can end up in depression [2]. The more dangerous part is that the person may not know they are experiencing depression [3]. The prevailing thought among people is that they should take care of their physical health, but the fact is mental health is just as important as physical health, and in fact, it is more important as anything that affects mental health will eventually affect the physical, too [4]. The reason for Clinical Depression or Major Depressive Disorder (MDD) is still unknown, but experts say it can be one among the following: substance abuse, tragic stressful events, family history, present health issues, or medications [5]. The symptoms usually include long term sadness, hopelessness, thoughts of suicide, and other psychotic symptoms [6]. MDD should be handled at an early stage because there are chances it could lead to suicide or result in mania or bipolar disorder (also known as manic depression), which is more dangerous [7].

Nowadays, mental health issues seem to be growing day by day [8]. It would be easier to treat if there are methods to anticipate the mental health issues [9]. There are two complications in treating a depressed individual. First, it is difficult for the person to identify that MDD is a perilous phenomenon and fail to realize that their mental health is degrading day after day. Then even if they realize the fact that they are affected by depression, getting them to professional help is difficult due to lack of motivation, cost, and mobility [10]. If the patients do seek professional help, the psychiatrists find it difficult to diagnose depression, as it is comorbid, i.e., it occurs with other disorders. Also, the initial level of treatment is a questionnaire, which can be subjective. The psychiatrists may not know whether the patient is truthful or not [11].

Digital smartwatches are a powerful tool in analyzing the behavior of the wearer and use the data for various studies. For instance, how long an average person works out in a day, average calories burnt per day, and much more [12]. Nowadays, researchers are focusing on using smartwatch data to analyze the mental health of individuals. When it is about mental health and seeking help, almost 40% percent of the population reported to have experienced depression or anxiety in their life but never looked for help, and the reason, perhaps, is they did not know where to begin with [13]. That is where the smartwatches and apps come into the picture, as they can gather data without any effort from the wearer. Also, this overcomes the issue with the questionnaire, which can result in a subjective outcome [14]. Smartwatches help in real-time monitoring, and high-frequency data offers an objective and a different variation from the traditional subjective self-reported data.

Machine Learning is one of the tools of Artificial Intelligence [15,16]. Artificial Intelligence is a simulation done with the help of machines, mainly computers of human intelligence. This simulation includes constant learning, reasoning, and adapting by self-correction. Machine Learning is derived from AI, which gives the machines the ability to teach themselves without anyone instructing it what to do [11]. Machine learning algorithms learn themselves from the given data and apply the gained knowledge to make predictions. For diagnosing depression at an early stage, it is better to focus on the overall pattern in the data rather than looking at individual attributes [17]. Machine Learning has algorithms that are experts in finding the pattern from the data. Machine Learning techniques are very good at discovering the best combination of features from the data [5].

The Machine Learning models based on the way they handle the data are classified into three major categories: Supervised Learning, Unsupervised Learning, and Reinforcement Learning [18,19]. The process of prediction or classification involves four stages: collecting data, pre-processing it to handle the missing values and reduce the noise, select the essential features, then implement a model suitable for the data in hand [20].

In this study, we have done all three stages and designed a machine learning model for diagnosing Clinical Depression or Major Depressive Disorder at an early stage where it can be treated easily. We have used two types of data, including objective and subjective. The objective data is recorded from the smartwatch sensors, and the subjective data is taken from the participants through self-rated questionnaires. In the age of modern technologies ruling, we have made use of electronic smartwatches, which has an accelerometer component for gathering data about the participants [21].

Once the features are decided, we have proposed a Weighted Average Ensemble machine learning model by combining the Logistic Regression and Random Forest method. It is always better to combine models than to implement it as individuals, as one’s limits would be compromised by the other. Here, we have proposed a Weight Average Ensemble model by combining the Logistic Regression and Random Forest Model to improve the prediction accuracy.

The key contributions of this work are summarized as follows:

(i): To the best of our knowledge, a Weighted Average Ensemble machine learning model is developed for the first time in this paper detecting Major Depressive Disorder (MDD) using an integrated feature set, and its performance is justified through experimental results.
(ii): A unique integrated feature set is formulated by combining the features from the questionnaire, and the smartwatch sensor encompassing a heart rate monitor.
(iii): The gathered data is pre-processed to handle the missing values with the help of Mean Imputation, and then the significant features are selected using the Correlation-based Feature Selection technique.
(iv): The proposed Weighted Average Ensemble model surpasses the logistic regression, and the random forest approaches in terms of the area under the receiver operating characteristic (ROC) curves measure.
(v): It can be observed from the experimental results that the Weighted Average Ensemble model performs better in terms of accuracy, precision, recall, specificity, and FMeasure in due comparison with Logistic regression and Random Forest Models. Furthermore, the proposed model also illustrates a superior performance with an accuracy of 99.01%.

The remaining portion of the paper is structured as a Review of Literature, Dataset Description, Methodology, Results and Discussion, and Conclusion.

2. Review of Literature

It is an arduous task to differentiate between Parkinson’s disease’s postural re-emergent tremors from essential tremors. So, the authors in this study [22] have used a smartwatch to monitor and record the tremors. They have considered 41 patients for the study, and the recordings were done for an accelerometer from a smartwatch as well as an analog accelerometer in parallel. With the results they got, they concluded that the smartwatch shows more prominence than the analog device. It provides diagnostically accurate and relevant information on postural tremor when compared with other analog methods.

In newborn babies, it is crucial to find out the Initial Heart Rate [23,24]. The heart rate detection using auscultation of heart and palpation of umbilical cords were found to be inaccurate and unreliable [25]. Also, late, the NRP (Neonatal Resuscitation Program) has recommended using pulse oximetry, but that took a long time to detect the heart rate, and sometimes it is even a failure. Then they came up with ECG, which is faster but expensive. Hence, to overcome these issues, the authors have adopted a smartwatch technique to find out the heart rate of newborn babies, which is accurate and inexpensive.

In this approach [14], the authors have implemented a machine learning-based approach using the smartwatches for activity recognition. Furthermore, this involves matching activities with a times series sensor from a smartphone or smartwatch. The activities include Walking, Sitting, Jogging, Standing, Climbing, Eating, and many other activities that involve hands and does not involve hands. They have used the WEKA tool for implementing the machine learning models. The algorithms employed are Random Forest, IB3 instance-based, J48 Decision Tree, Multi-Layered Perceptron, and Naïve Bayes. They have compared the performance of the accelerometer and gyroscope of the smartwatch and found out that the accelerometer of the smartwatch performs better than the gyroscope.

For people with mental disorders such as dementia, it is essential to monitor patients to ensure whether they are performing the proper exercise and the necessary amount of sunlight. For their health and safety, it is imperative to monitor them continuously. The authors in this study [26] have developed a smartwatch with features such as accelerometer, GPS, and illumination sensor. Moreover, this will help in monitoring patients with dementia. The proposed algorithm effectively identifies the amount of exercise the patients are taking and everything else that is required, and it shows 96% of success with its experimental results.

In this approach [27], the authors have implemented a machine learning model by gathering sensor data from the electronic smartwatches to predict the alcohol content in blood in real-time. They have also developed an Android application to connect to the sensors and also to store data from the sensor. They have implemented both classification and regression models. For the regression problem, they have used Linear Regression and Artificial Neural Networks, and for the classification problem, they have using Support Vector Machine and Logistic Regression. They have evaluated their model using RMSE for regression and recall, FMeasure, and Precision for classification. In the end, they found that this problem is tackled effectively if it is considered as a classification problem, and among the two classification algorithms, SVM performed better.

It is vital to diagnose depression at an early stage in order to treat it. In this approach, the authors [28] have done a non-linear analysis of EEG signals to differentiate depressed patients from healthy individuals. They have considered 45 depressed patients and extracted features from the EEG signals. They have used three machine learning classifiers, such as K-Nearest Neighbour, Linear Discriminant Analysis, and Logistic Regression. The results show that the highest classification accuracy is achieved by the Logistic Regression classifier (90%).

The studies mentioned above are sensor-based and machine learning related approaches applied to various healthcare domains. There are very fewer works on sensor-based approaches along with machine learning applied in classifying or predicting MDD. The major drawback of using the smartwatch sensors is that various factors would affect the sensors and heart rate-monitors of the wearer. The traditional method to diagnose depression is using a questionnaire. However, most of the time, the questionnaire seems to be subjective, whereas the sensors generate objective data, i.e., it requires no effort from the subject. Hence, in this study, we have used the combination of both (sensors and questionnaire) to collect the subjective and objective data and validate the results in classifying the subjects with MDD.

3. Methodology

In this study, we have used the Mi band-3 smartwatch to detect the behavior and heart rate of the participants in diagnosing MDD. The participants were given a questionnaire to fill every day in the morning for one week. The participants were made to wear the smartwatch all the time for one week including the weekend, even when they are sleeping. The smartwatches can measure the sleep patterns and have the component for continuous heart-rate monitoring. The participants in this study are a combination of male and female, with an average age of 40, weight of 60 kg, and height 176 cm.

The methodology comprises of three phases: Pre-processing of data, selecting essential features from the data, and applying Machine Learning models individually on the high discriminative features, and also applying the proposed weighted average ensemble machine learning model on the data, then comparing it with the individual implementation of the base models. In this section, we explain the process by which we did the classification. Figure 1 illustrates the overall architecture of the study.

3.1. Dataset Description

We have collected data from 500 average aged people who complained about mood swings. We used data from the Questionnaire, Smartwatch, and Heart rate monitor. The data collection involved 500 people, and among the 500 records, we rejected 50 because of technical issues. Together the dataset consists of 35 features, which are later reduced using feature selection techniques. We have used the Hamilton Depression Rating Scale [29] for the questionnaire and the features from the accelerometer of the smartwatch. All these features are combined after feature selection and then passed through machine learning models.

3.2. Hamilton Depression Rating Scale

Hamilton Depression Rating Scale (HDRS) is the most widely used self-rating report for assessing depression. The questionnaire consists of 21 items to rate the severity of depression. Among the 21 items, 17 are severity measures for the depression while the remaining four are symptoms related to depression, but not considered as severity measures, for instance, paranoia or obsessive symptoms. The interpretation of the scores in HDRS is 0–7 is normal, 8–16 is mild depression, 17–23 is moderate depression, and greater than 24 is considered as severe depression [30].

3.3. Smart Watch Sensors

A smartwatch is similar to the wristwatch in appearance, but does so many things other than keep time. The digital smartwatches, Bluetooth is enabled, and the features can be extended to smartphones. In such cases, the users can use the smartwatch for reading the messages, answer phone calls, check the weather, and many advanced features [12]. In addition to these benefits, the smartwatches help in analyzing the behavior of the wearer and determine their mental health. The smartwatch has two sensors, Accelerometer and Gyroscope [13] for recording the data from the gestures that the wearer makes.

3.3.1. Accelerometer

The accelerometer is used in the smartwatches to detect the movement and orientation of the wearer. With accelerometer in the smartwatch, around two dozens of gestures and actions are detected, irrespective of one or two hands [20]. These gestures are later mapped to the controls or software applications. The typical variation used in the smartwatch is the Tri-axial, which keeps track of the physical activities of the wearers. Unlike the uniaxial variation, which records only the up and down movements, the Tri-axials record up and down, side-to-side, and back-and-forth movements [18].

3.3.2. Gyroscope

Gyroscope sensors are also used to measure the orientation and the angular velocity of the wearer. Gyroscope sensors have advanced functionalities than the accelerometers. These sensors can track the lateral and also the tilt orientations. On the other hand, accelerometers can track only the linear motions [31]. The design of the gyroscope has a rotating disk called the motor, which is mounted on a spinning axis. This sensor determines the orientation of the wearer with the help of Earth’s gravitational force [32].

3.3.3. Heart-Rate Monitor

The heart-rate monitors used in the smartwatches are designed based on a principle called Photo Plethysmography (PPG) [32]. PPG is the process of sending a shining light through the skin and measuring the scattered amount in the blood. The scattering is based on the blood flow dynamics such as the blood volume or the pulse rate. The three essential components in measuring the heart-rate are Optical Emitter (LEDs to send the light), Digital Signal Processor (captures the refracted light from the user and converts them into heart rate data), and Accelerometer (used in combination with the DSP in measuring the motions). With the help of DSP and Accelerometer data, the calories burned, blood pressure, and oxygen levels in the blood can also be measured [12].

3.4. Pre-Processing

Data pre-processing is a technique to remove the missing values and irrelevant values from the data. Data collected from various sources will usually have missing values, redundant and irrelevant data, which will reduce the prediction accuracy of the model [33]. It is essential to clean the data before applying any algorithm to improve the performance of the model [34].

In this study, we have used a single imputation method, which is mean imputation. In mean imputations, the missing values in certain features will be replaced by taking the mean of the available values in the feature column [35]. This is a simple yet effective method for handling missing values.

3.5. Feature Selection

Feature selection is also a part of pre-processing, where the number of features in the original dataset is reduced [36]. The reason for performing feature selection is to avoid features that do not affect the target variable [37]. Feature selection is a critical phase in any machine learning process, which significantly impacts the performance of the model. Partially irrelevant or irrelevant features are considered to negatively impact the model accuracy [38]. It is the process of selecting (automatically or manually) the critical feature that contributes more to the target variable or prediction variable [39]. If the model learns from the data that has missing values and irrelevant values, then the model would become weak.

In this study, we have used a correlation-based feature selection method to remove the features that interact the most and not having any part in the classification or prediction process. The correlation-based feature selection technique will identify the best possible subset from the whole feature set that might have a potential impact on the model outcome. The features are eliminated based on their inter-feature correlation threshold. This technique takes into consideration the individual attribute abilities and the redundancy among them with the help of merits score of subsets [40].

3.6. Machine Learning Models

Machine learning is one of the applications of Artificial Intelligence, which makes the system capable of making its own decision by learning from past events, and also improve and adapt to future changes [8]. Based on the way of handling the data, machine learning mainly consists of three types. Primarily, Supervised Learning where the data is labeled, and the output is known already. Secondly, Unsupervised Learning where the data is unlabelled, and the output is decided later from the inferences. Finally, Reinforcement Learning, which is based on the feedback mechanism, the algorithm is in such a way that it interacts with the environment, finds the rewards or errors [8,20].

In this study, we have implemented two supervised machine learning methods: Logistic Regression and Random Forest, along with the proposed weighted average ensemble model. Logistic Regression is a statistical method usually used in cases where the target variable is categorical (i.e., dichotomous, 1 or 0) [41]. It is used widely for predictive analysis also to interpret the relationship between one binary variable that is dependent and other independent or nominal variables. The logistic regression is represented by a sigmoidal curve [42]. Figure 2 shows a sample sigmoidal curve where ‘x’ can be any dependent attribute; if the curve goes towards the positive side, then the prediction becomes 1. If it goes towards negative, then the prediction becomes 0. The logistic regression equation is given by [43],

P (A) = \frac{1}{1 + e^{- (a_{0} + a_{1} B)}}

(1)

where,

P(A)—Is the probability of A (Dependent Variable)
a₀—moves the curve right and left
a₁—Slope
B—Nominal Variable or Independent Variable.

Random forest is the most widely used machine learning model that can be used for both regression and classification. Random forest is an ensemble of several decision trees, and the prediction is made as a result of taking an average of all the predictions from the decision trees [44]. The random forest has a root node which separates the sample classes, which is further divided into several branches [45]. The classifications are done using the training data, and the predictions are accomplished using the testing data. Figure 3 represents a sample schematic of the random forest approach.

Also, we have implemented the weighted average method by combing the Logistic Regression model and the Random Forest model to improve the prediction accuracy. Two is always better than one. A weighted average is considered to be an extension to the average method where multiple predictions are made on the data points, and then an average of all the predictions would be taken as the final prediction value. On the other hand, in the weighted average, each data point would be given a pre-defined weight in order to show their importance in the prediction, and then the weighted average of all the data points is considered to be the final prediction. In a weighted ensemble, each member of the ensemble contributes to the final prediction. In the case of class label prediction, the mode of the predictions by the members is used for final prediction. In the case of class probability prediction, argmax of the summed probabilities of every class label is used for the final prediction.

4. Results and Discussion

The dataset consists of 500 records combining the questionnaire and smartwatch encompassing the heart monitor sensor data. Of the 500 data, we have rejected 50 due to technical errors and used 450 records. The 450 records are pre-processed using mean imputation technique to handle all the missing values in the accelerometer data from smartwatch data as well as the questionnaire data. After correcting the missing values, the feature selection technique is employed. The feature selection technique is applied to data from the questionnaire and also the accelerometer data from a smartwatch. The data is segregated into a training set and testing set where the training is done with the training set, and the predictions are made using the testing set. The composition of training and testing data is 80% and 20% with 10-Fold cross-validation.

The selected features from the questionnaire excluding the Outcome feature, which is the Target variable, is:

Feeling Sad
Feeling Irritable
Feeling Anxious about Tense
Response to Mood to Good or Desired Events
The mood in Relation to the Time of Day
Thoughts of Death or Suicide
Capacity for Pleasure or Enjoyment
Bodily Symptoms
Panic/Phobic Symptoms

The selected features from the accelerometer of the smartwatch are:

Standard Deviation
Root Mean Square
Root Sum Square
Upper Quartile
Lower Quartile
Kurtosis

Along with the nine features from the questionnaire and six from the accelerometer heart rate from the heart rate monitor, there are totally 16 features selected by the correlation-based feature selection technique.

After removing the missing values and selecting the critical features, two machine learning techniques are implemented, Logistic Regression and Random Forest. The classifiers are evaluated using Confusion Matrix, Accuracy, and cut-off points along with the AUC-ROC Curve (Area under the Receiver Optimization Characteristics). Confusion Matrix is a table used for evaluating a model whose truth-values are known. The confusion matrix consists of features such as Accuracy, Specificity, Sensitivity, Precision, and Recall. The formula and definition of confusion matrix features are given in Table 1. We have utilized the AUC-ROC curve, too, which is the most critical metric for evaluating the performance of the classification model. In AUC-ROC, AUC is the measure or degree of separability and ROC is the probability curve. When the model generates higher AUC, it implies that the model is efficiently classifying the patients without disease and with disease. The AUC-ROC curve is plotted with the True Positive Rate (TPR) on the y-axis against the False Positive Rate (FPR) on the x-axis. AUC-ROC says the capability of the model in differentiating the classes.

Figure 4 and Figure 5 represents the Accuracy vs. cut-off plot for Logistic regression and Random Forest, respectively. With the results, we have found that the Accuracy and cut-off for Logistic regression are 93% and 62%, respectively. The Accuracy and cut-off for Random Forest are 98% and 88%, respectively.

Figure 6, Figure 7 and Figure 8 represents the AUC-ROC curve for Logistic Regression, Random Forest approaches, and Weighted Average Ensemble Model, respectively. The Area under the curve for Logistic Regression is found to be 95%, for Random Forest Approach, it is 99.31%, and for the Weighted Average Ensemble Model, it is 99.76%, which is a reasonable improvement. From Figure 9, we observe that the Weighted Average Ensemble model shows better results when compared to Logistic Regression and Random Forest Approaches.

The values of the confusion matrix for Logistic Regression, Random Forest, and Weighted Average Ensemble Models are given in Table 2. From this table, we can realize that the performance of the Random Forest machine learning model is more accurate and useful than the Logistic Regression model. Random Forest Approach is 98% accurate, and Logistic Regression Model is considerably less, which is 93% accuracy. We can also notice that the Weighted Average Ensemble model performs marginally better than the two models implemented individually. We found out that, in the Weighted Average Ensemble model, as the error rates are minimized, the algorithm provides faster results than the individual implementations of Logistic Regression and Random Forest Approaches.

5. Conclusions and Future Work

In this study, we have implemented the machine learning models to diagnose Clinical Depression at the earliest possible time. We have gathered data from the electronic smartwatch and questionnaire. The smartwatch has a sensor called Accelerometer. We have used accelerometer sensor data and combined it with the features from the questionnaire. We have used the Hamilton Depression Rating Scale as a questionnaire, but it is subjective data, hence why we used the smartwatch sensor data, which was attached to the wrists of patients. The data collected are pre-processed using the mean-imputation technique, and then correlation-based feature selection technique is applied separately on questionnaire data, and accelerometer data and then the selected features are combined together, and then Logistic Regression and Random Forest machine learning models are applied on the features and then combined the two models to form a Weighted Average ensemble model. For evaluating the implemented model, we have employed Confusion Matrix and Area under the Receiver Optimization Characteristics, and the results show that the Random Forest model performs better in predicting the depressed patients than the Logistic Regression model and the Weighted Average Ensemble performs better than the two models.

In this implementation, we have considered only the accelerometer sensor of the smartwatch. The accelerometer will help in orientation only when the object is relative to the earth’s surface. For instance, when the object is under free fall, accelerometer data will show zero acceleration. The other sensor which is used in the smartwatches is the Gyroscope, which will sense rotation and movements that are not relative to the earth’s surface. In our future works, we will consider both the sensors of the smartwatch for better accuracy and employ various other machine learning models to explore this area of research further.

Author Contributions

This research specifies below the individual contributions. “Conceptualization—D.R.V. and N.M.; Data curation—K.S.; Formal analysis—N.M.; Funding acquisition—C.-Y.C.; Investigation—L.G. and A.G.; Methodology—N.M.; Project administration—K.S. and D.G.R.; Resources—C.-Y.C.; Software—A.G.; Supervision—D.R.V., K.S. and C.-Y.C.; Validation—D.R.V. and N.M.; Visualization—N.M. and D.G.R.; Writing—review & editing—D.R.V., K.S. and C.-Y.C.

Funding

Part of this work was financially supported by the “Intelligent Recognition Industry Service Research Center” from The Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education (MOE) in Taiwan.

Conflicts of Interest

The authors declare no conflict of interest.

References

Moreno, M.A.; Jelenchick, L.A.; Egan, K.G.; Cox, E.; Young, H.; Gannon, K.E.; Becker, T. Feeling bad on Facebook: Depression disclosures by college students on a social networking site. Depress. Anxiety 2010, 28, 447–455. [Google Scholar] [CrossRef] [PubMed]
McElroy, E.; Fearon, P.; Belsky, J.; Fonagy, P.; Patalay, P. Networks of Depression and Anxiety Symptoms Across Development. J. Am. Acad. Child Adolesc. Psychiatry 2018, 57, 964–973. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Fried, E.I.; Nesse, R.M. Depression is not a consistent syndrome: An investigation of unique symptom patterns in the STAR*D study. J. Affect. Disord. 2015, 172, 96–102. [Google Scholar] [CrossRef] [PubMed]
Fried, E.; Nesse, R.; Zivin, K.; Guille, C.; Sen, S. Depression is more than the sum score of its parts: Individual DSM symptoms have different risk factors. Psychol. Med. 2014, 44, 2067–2076. [Google Scholar] [CrossRef] [PubMed]
Mahendran, N.; Vincent, D.R. Effective Classification of Major Depressive Disorder Patients Using Machine Learning Techniques. Recent Pat. Comput. Sci. 2019, 12, 41–48. [Google Scholar] [CrossRef]
Klakk, H.; Kristensen, P.L.; Andersen, L.B.; Froberg, K.; Møller, N.C.; Grøntved, A. Symptoms of depression in young adulthood is associated with unfavorable clinical- and behavioral cardiovascular disease risk factors. Prev. Med. Rep. 2018, 11, 209–215. [Google Scholar] [CrossRef]
Gerrits, M.M.; van Oppen, P.; van Marwijk, H.W.; Penninx, B.W.; van der Horst, H.E. Pain and the onset of depressive and anxiety disorders. Pain 2014, 155, 53–59. [Google Scholar] [CrossRef]
Dietterich, T.G. Machine learning for sequential data: A review. In Proceedings of the Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR), Windsor, ON, Canada, 6–9 August 2002; Springer: Berlin/Heidelberg, Germany, 2002; pp. 15–30. [Google Scholar]
Cohn, J.F.; Kruez, T.S.; Matthews, I.; Yang, Y.; Nguyen, M.H.; Padilla, M.T.; Zhou, F.; De la Torre, F. Detecting depression from facial actions and vocal prosody. In Proceedings of the 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops, Amsterdam, The Netherlands, 10–12 September 2009; pp. 1–7. [Google Scholar]
Karasz, A.; Dowrick, C.; Byng, R.; Buszewicz, M.; Ferri, L.; Hartman TC, O.; Reeve, J. What we talk about when we talk about depression: Doctor-patient conversations and treatment decision outcomes. Br. J. Gen. Pract. 2012, 62, e55–e63. [Google Scholar] [CrossRef]
Papakostas, G.I.; Petersen, T.; Mahal, Y.; Mischoulon, D.; Nierenberg, A.A.; Fava, M. Quality of life assessments in major depressive disorder: A review of the literature. Gen. Hosp. Psychiatry 2004, 26, 13–17. [Google Scholar] [CrossRef]
Lu, T.C.; Fu, C.M.; Ma MH, M.; Fang, C.C.; Turner, A.M. Healthcare applications of smart watches. Appl. Clin. Inform. 2016, 7, 850–869. [Google Scholar] [CrossRef]
Bonino, D.; Corno, F.; De Russis, L. Dwatch: A personal wrist watch for smart environments. Procedia Comput. Sci. 2012, 10, 300–307. [Google Scholar] [CrossRef]
Weiss, G.M.; Timko, J.L.; Gallagher, C.M.; Yoneda, K.; Schreiber, A.J. Smartwatch-based activity recognition: A machine learning approach. In Proceedings of the 2016 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI), Las Vegas, NV, USA, 25–27 February 2016; pp. 426–429. [Google Scholar]
Morganti, E.; Angelini, L.; Adami, A.; Lalanne, D.; Lorenzelli, L.; Mugellini, E. A smart watch with embedded sensors to recognize objects, grasps and forearm gestures. Procedia Eng. 2012, 41, 1169–1175. [Google Scholar] [CrossRef]
Sanchez-Riera, J.; Srinivasan, K.; Hua, K.; Cheng, W.; Hossain, M.A.; Alhamid, M.F. Robust RGB-D Hand Tracking Using Deep Learning Priors. IEEE Trans. Circuits Syst. Video Technol. 2017, 28, 2289–2301. [Google Scholar] [CrossRef]
Stetco, A.; Dinmohammadi, F.; Zhao, X.; Robu, V.; Flynn, D.; Barnes, M.; Keane, J.; Nenadic, G. Machine learning methods for wind turbine condition monitoring: A review. Renew. Energy 2019, 133, 620–635. [Google Scholar] [CrossRef]
Lison, P. An Introduction to Machine Learning; Language Technology Group: Edinburgh, UK, 2015. [Google Scholar]
Mitchell, T.; Buchanan, B.; DeJong, G.; Dietterich, T.; Rosenbloom, P.; Waibel, A. Machine learning. Annu. Rev. Comput. Sci. 1990, 4, 417–433. [Google Scholar] [CrossRef]
Carbonell, J.G.; Michalski, R.S.; Mitchell, T.M. An overview of machine learning. In Machine Learning; Morgan Kaufmann: Burlington, MA, USA, 1983; pp. 3–23. [Google Scholar]
Hänsel, K.; Alomainy, A.; Haddadi, H. Large scale mood and stress self-assessments on a smartwatch. In Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing: Adjunct, Heidelberg, Germany, 12–16 September 2016; pp. 1180–1184. [Google Scholar]
Wile, D.J.; Ranawaya, R.; Kiss, Z.H. Smart watch accelerometry for analysis and diagnosis of tremor. J. Neurosci. Methods 2014, 230, 1–4. [Google Scholar] [CrossRef] [PubMed]
Chang, C.Y.; Chang, C.W.; Kathiravan, S.; Lin, C.; Chen, S.T. DAG-SVM based infant cry classification system using sequential forward floating feature selection. Multidimens. Syst. Signal Process. 2017, 28, 961–976. [Google Scholar] [CrossRef]
Chen, S.; Srinivasan, K.; Lin, C.; Chang, C. Chapter 10—Neonatal Cry Analysis and Categorization System Via Directed Acyclic Graph Support Vector Machine. In Intelligent Data-Centric Systems, Big Data Analytics for Sensor-Network Collected Intelligence; Hsu, H., Chang, C., Hsu, C., Eds.; Academic Press: Cambridge, MA, USA, 2017; pp. 205–222. [Google Scholar] [CrossRef]
Lin, Y.C.; Wei, K.C. An electronic smart watch monitors heart rate of an extremely preterm baby. Pediatrics Neonatol. 2018, 59, 214–215. [Google Scholar] [CrossRef] [Green Version]
Shin, D.; Shin, D.; Shin, D. Ubiquitous health management system with watch-type monitoring device for dementia patients. J. Appl. Math. 2014, 2014, 878741. [Google Scholar] [CrossRef]
Gutierrez, M.A.; Fast, M.L.; Ngu, A.H.; Gao, B.J. Real-time prediction of blood alcohol content using smartwatch sensor data. In ICSH; Springer: Cham, Switzerland, 2015; pp. 175–186. [Google Scholar]
Hosseinifard, B.; Moradi, H.M.; Rostami, R. Classifying depression patients and normal subjects using machine learning techniques and nonlinear features from EEG signal. Comput. Methods Programs Biomed. 2013, 109, 339–345. [Google Scholar] [CrossRef]
Hamilton, M. The Hamilton rating scale for depression. In Assessment of Depression; Springer: Berlin/Heidelberg, Germany, 1986; pp. 143–152. [Google Scholar]
Williams, J.B. A structured interview guide for the Hamilton Depression Rating Scale. Arch. Gen. Psychiatry 1988, 45, 742–747. [Google Scholar] [CrossRef] [PubMed]
Mekruksavanich, S.; Hnoohom, N.; Jitpattanakul, A. Smartwatch-based sitting detection with human activity recognition for office workers syndrome. In Proceedings of the 2018 International ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunications Engineering (ECTI-NCON), Chiang Rai, Thailand, 25–28 February 2018; pp. 160–164. [Google Scholar]
Lee, Y.; Song, M. Recognizing problem behaviors of children with developmental disabilities using smartwatch. In Proceedings of the 2016 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), San Francisco, CA, USA, 10–14 April 2016; pp. 1001–1002. [Google Scholar]
Kotsiantis, S.B.; Kanellopoulos, D.; Pintelas, P.E. Data preprocessing for supervised leaning. Int. J. Comput. Sci. 2006, 1, 111–117. [Google Scholar]
Huang, J.; Li, Y.; Xie, M. An empirical analysis of data preprocessing for machine learning-based software cost estimation. Inf. Softw. Technol. 2015, 67, 108–127. [Google Scholar] [CrossRef]
Donders AR, T.; Van Der Heijden, G.J.; Stijnen, T.; Moons, K.G. A gentle introduction to imputation of missing values. J. Clin. Epidemiol. 2006, 59, 1087–1091. [Google Scholar] [CrossRef] [PubMed]
Chang, C.; Srinivasan, K.; Chen, S.; Chang, M.; Sharma, V. An Efficient SVM Based Lymph Node Classification Approach Using Intelligent Communication Ant Colony Optimization. J. Med. Imaging Health Inform. 2018, 8, 1077–1086. [Google Scholar] [CrossRef]
Hira, Z.M.; Gillies, D.F. A Review of Feature Selection and Feature Extraction Methods Applied on Microarray Data. Adv. Bioinform. 2015, 2015, 198363. [Google Scholar] [CrossRef]
Kira, K.; Rendell, L.A. A practical approach to feature selection. In Machine Learning Proceedings 1992; Morgan Kaufmann: Burlington, MA, USA, 1992; pp. 249–256. [Google Scholar]
Blum, A.L.; Langley, P. Selection of relevant features and examples in machine learning. Artif. Intell. 1997, 97, 245–271. [Google Scholar] [CrossRef] [Green Version]
Mursalin, M.; Zhang, Y.; Chen, Y.; Chawla, N.V. Automated epileptic seizure detection using improved correlation-based feature selection with random forest classifier. Neurocomputing 2017, 241, 204–214. [Google Scholar] [CrossRef]
Dreiseitl, S.; Ohno-Machado, L. Logistic regression and artificial neural network classification models: A methodology review. J. Biomed. Inform. 2002, 35, 352–359. [Google Scholar] [CrossRef]
Harrington, P. Machine Learning in Action; Manning: Greenwich, UK, 2012; Volume 5. [Google Scholar]
Tabaei, B.P.; Herman, W.H. A multivariate logistic regression equation to screen for diabetes: Development and validation. Diabetes Care 2002, 25, 1999–2003. [Google Scholar] [CrossRef]
Ham, J.; Chen, Y.; Crawford, M.M.; Ghosh, J. Investigation of the random forest framework for classification of hyperspectral data. IEEE Trans. Geosci. Remote Sens. 2005, 43, 492–501. [Google Scholar] [CrossRef] [Green Version]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]

Figure 1. Architectural Diagram of the Proposed Model.

Figure 2. Sigmoidal Curve. Description: Here, ‘x’ can be any dependent attribute.

Figure 3. The schematic diagram for Random Forest Approach.

Figure 4. Accuracy Vs. cut-off curve for Logistic Regression Approach.

Figure 5. Accuracy Vs. cut-off curve for Random Forest Approach.

Figure 6. AUC-ROC curve for Logistic Regression Approach.

Figure 7. AUC-ROC for Random Forest Approach.

Figure 8. AUC-ROC curve for Weighted Average Ensemble Model.

Figure 9. Performance comparison between Logistic Regression Model, Random Forest Approach, and the proposed Weighted Average Ensemble Model.

Table 1. Confusion Matrix Components.

Confusion Matrix	Definition	Formula
Accuracy	It is the ratio of correctly classified to the whole set. For instance, which answers the question: How many patients did we correctly diagnosed as depressed out of all the patients?	TN + TP/All
Precision	It is the ratio of correctly classified positive subjects to all the positive subjects. For instance, which answers the question: How many of the patients whom we named as depressed are actually depressed?	TP/TP + FP
Sensitivity (Recall)	It is the ratio of correctly classified positive subjects to all those who have the disease in reality. Which answers the question: Of all the depressed people in the dataset, how many did we correctly predict as depressed?	TP/TP + FN
Specificity	It is the ratio of correctly classified negative subjects to all the healthy subjects in reality. Which answers the question: Of all the healthy people in the dataset, how many we correctly predict as not depressed?	TN/TN + FP
FMeasure	It is a combination of both recall and precision. Harmonic average.	2 × (Precision × Recall)/(Recall + Precision)

Table 2. Performance Evaluation of LR, RF, and the proposed Weighted Average Ensemble Model.

Performance Metrics	Logistic Regression	Random Forest	Weighted Average
Accuracy	0.9318	0.9839	0.9901
Precision	0.9539	0.9673	0.9754
Sensitivity (Recall)	0.8430	0.9729	0.9840
Specificity	0.9785	0.9772	0.9887
FMeasure	0.8950	0.9465	0.9795

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mahendran, N.; Vincent, D.R.; Srinivasan, K.; Chang, C.-Y.; Garg, A.; Gao, L.; Reina, D.G. Sensor-Assisted Weighted Average Ensemble Model for Detecting Major Depressive Disorder. Sensors 2019, 19, 4822. https://doi.org/10.3390/s19224822

AMA Style

Mahendran N, Vincent DR, Srinivasan K, Chang C-Y, Garg A, Gao L, Reina DG. Sensor-Assisted Weighted Average Ensemble Model for Detecting Major Depressive Disorder. Sensors. 2019; 19(22):4822. https://doi.org/10.3390/s19224822

Chicago/Turabian Style

Mahendran, Nivedhitha, Durai Raj Vincent, Kathiravan Srinivasan, Chuan-Yu Chang, Akhil Garg, Liang Gao, and Daniel Gutiérrez Reina. 2019. "Sensor-Assisted Weighted Average Ensemble Model for Detecting Major Depressive Disorder" Sensors 19, no. 22: 4822. https://doi.org/10.3390/s19224822

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Sensor-Assisted Weighted Average Ensemble Model for Detecting Major Depressive Disorder

Abstract

1. Introduction

2. Review of Literature

3. Methodology

3.1. Dataset Description

3.2. Hamilton Depression Rating Scale

3.3. Smart Watch Sensors

3.3.1. Accelerometer

3.3.2. Gyroscope

3.3.3. Heart-Rate Monitor

3.4. Pre-Processing

3.5. Feature Selection

3.6. Machine Learning Models

4. Results and Discussion

5. Conclusions and Future Work

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI