Next Article in Journal
Signal Pattern Recognition Based on Fractal Features and Machine Learning
Previous Article in Journal
Influence of Aggregate Gradation on the Engineering Properties of Lightweight Aggregate Concrete
Previous Article in Special Issue
Comparing Deep Learning and Classical Machine Learning Approaches for Predicting Inpatient Violence Incidents from Clinical Text
Article Menu

Export Article

Open AccessArticle
Appl. Sci. 2018, 8(8), 1325; https://doi.org/10.3390/app8081325 (registering DOI)

Hybrid Prediction Model for Type 2 Diabetes and Hypertension Using DBSCAN-Based Outlier Detection, Synthetic Minority Over Sampling Technique (SMOTE), and Random Forest

1
Department of Industrial and Systems Engineering, Dongguk University, Seoul 100-715, Korea
2
u-SCM Research Center, Nano Information Technology Academy, Dongguk University, Seoul 100-715, Korea
*
Authors to whom correspondence should be addressed.
Received: 16 July 2018 / Revised: 6 August 2018 / Accepted: 6 August 2018 / Published: 8 August 2018
(This article belongs to the Special Issue Data Analytics in Smart Healthcare)
View Full-Text   |   Download PDF [2041 KB, uploaded 15 August 2018]   |  

Abstract

As the risk of diseases diabetes and hypertension increases, machine learning algorithms are being utilized to improve early stage diagnosis. This study proposes a Hybrid Prediction Model (HPM), which can provide early prediction of type 2 diabetes (T2D) and hypertension based on input risk-factors from individuals. The proposed HPM consists of Density-based Spatial Clustering of Applications with Noise (DBSCAN)-based outlier detection to remove the outlier data, Synthetic Minority Over-Sampling Technique (SMOTE) to balance the distribution of class, and Random Forest (RF) to classify the diseases. Three benchmark datasets were utilized to predict the risk of diabetes and hypertension at the initial stage. The result showed that by integrating DBSCAN-based outlier detection, SMOTE, and RF, diabetes and hypertension could be successfully predicted. The proposed HPM provided the best performance result as compared to other models for predicting diabetes as well as hypertension. Furthermore, our study has demonstrated that the proposed HPM can be applied in real cases in the IoT-based Health-care Monitoring System, so that the input risk-factors from end-user android application can be stored and analyzed in a secure remote server. The prediction result from the proposed HPM can be accessed by users through an Android application; thus, it is expected to provide an effective way to find the risk of diabetes and hypertension at the initial stage. View Full-Text
Keywords: type 2 diabetes; hypertension; classification; DBSCAN; SMOTE; Random Forest; Internet of Things type 2 diabetes; hypertension; classification; DBSCAN; SMOTE; Random Forest; Internet of Things
Figures

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. (CC BY 4.0).

Share & Cite This Article

MDPI and ACS Style

Ijaz, M.F.; Alfian, G.; Syafrudin, M.; Rhee, J. Hybrid Prediction Model for Type 2 Diabetes and Hypertension Using DBSCAN-Based Outlier Detection, Synthetic Minority Over Sampling Technique (SMOTE), and Random Forest. Appl. Sci. 2018, 8, 1325.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
Appl. Sci. EISSN 2076-3417 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top