Next Article in Journal
Visual Object Tracking Robust to Illumination Variation Based on Hyperline Clustering
Next Article in Special Issue
Full Support for Efficiently Mining Multi-Perspective Declarative Constraints from Process Logs
Previous Article in Journal
A Signal Period Detection Algorithm Based on Morphological Self-Complementary Top-Hat Transform and AMDF
Previous Article in Special Issue
Modeling and Visualizing Smart City Mobility Business Ecosystems: Insights from a Case Study
Article Menu

Export Article

Open AccessArticle
Information 2019, 10(1), 25;

Identifying a Medical Department Based on Unstructured Data: A Big Data Application in Healthcare

Indian Institute of Technology Bhilai, Raipur-492015, India
Indian Institute of Technology Kanpur, Kanpur-208016, India
IBM United Kingdom Limited, London SE1 9PZ, UK
Author to whom correspondence should be addressed.
Received: 5 December 2018 / Revised: 3 January 2019 / Accepted: 9 January 2019 / Published: 11 January 2019
PDF [486 KB, uploaded 17 January 2019]


Health is an individual’s most precious asset and healthcare is one of the vehicles for preserving it. The Indian government’s spend on healthcare system is relatively low (1.2% of GDP). Consequently, Secondary and Tertiary government healthcare centers in India (that are presumed to be of above average ratings) are always crowded. In Tertiary healthcare centers, like the All India Institute of Medical Science (AIIMS), patients are often unable to articulate their problems correctly to the healthcare center’s reception staff, so that these patients to be directed to the correct healthcare department. In this paper, we propose a system that will scan prescriptions, referral letters and medical diagnostic reports of a patient, process the input using OCR (Optical Character Recognition) engines, coupled with image processing tools, to direct the patient to the most relevant department. We have implemented and tested parts of this system wherein a patient enters his symptoms and/or provisional diagnosis; the system suggests a department based on this user input. Our system suggests the correct department 70.19% of the time. On further investigation, we found that one particular department of the hospital was over-represented. We eliminated the department from the data and performance of the system improved to 92.7%. Our system presently makes its suggestions using random forest algorithm that has been trained using two information repositories-symptoms and disease data, functional description of each medical department. It is our informed assumption that, once we have incorporated medicine information and diagnostics imaging data to train the system; and the complete medical history of the patient, performance of the system will improve further. View Full-Text
Keywords: healthcare; big data; unstructured data; tertiary healthcare healthcare; big data; unstructured data; tertiary healthcare

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

Share & Cite This Article

MDPI and ACS Style

Bansal, V.; Poddar, A.; Ghosh-Roy, R. Identifying a Medical Department Based on Unstructured Data: A Big Data Application in Healthcare. Information 2019, 10, 25.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics



[Return to top]
Information EISSN 2078-2489 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top