You are currently viewing a new version of our website. To view the old version click .
  • Article
  • Open Access

3 March 2014

Health Recommender Systems: Concepts, Requirements, Technical Basics and Challenges

and
Department of Medical Informatics, Heilbronn University, Max-Planck-Str. 39, Heilbronn 74081, Germany
*
Author to whom correspondence should be addressed.
This article belongs to the Special Issue Public Health Informatics

Abstract

During the last decades huge amounts of data have been collected in clinical databases representing patients' health states (e.g., as laboratory results, treatment plans, medical reports). Hence, digital information available for patient-oriented decision making has increased drastically but is often scattered across different sites. As as solution, personal health record systems (PHRS) are meant to centralize an individual's health data and to allow access for the owner as well as for authorized health professionals. Yet, expert-oriented language, complex interrelations of medical facts and information overload in general pose major obstacles for patients to understand their own record and to draw adequate conclusions. In this context, recommender systems may supply patients with additional laymen-friendly information helping to better comprehend their health status as represented by their record. However, such systems must be adapted to cope with the specific requirements in the health domain in order to deliver highly relevant information for patients. They are referred to as health recommender systems (HRS). In this article we give an introduction to health recommender systems and explain why they are a useful enhancement to PHR solutions. Basic concepts and scenarios are discussed and a first implementation is presented. In addition, we outline an evaluation approach for such a system, which is supported by medical experts. The construction of a test collection for case-related recommendations is described. Finally, challenges and open issues are discussed.

1. Introduction

Increasing health information needs and changes in information seeking behavior can be observed around the globe [1]. According to recent studies 81% of U.S. adults use the Internet and 59% say they have looked online for health information regarding diseases, diagnoses and different treatments [2]. Such effects influence the patient-physician relationship as educated patients raise questions or discuss treatment options [3,4,5]. Thus, patients tend to become active participants in the decision-making process. This change in the way of thinking is often referred to as patient empowerment [6,7].

However, information overload and irrelevant information are major obstacles for drawing conclusions on the personal health status and taking adequate actions [8]. Faced with a large amount of medical information on different channels (e.g., news sites, web forums, etc.) users often get lost or feel uncertain when investigating on their own. In addition, a manifold and heterogeneous medical vocabulary poses another barrier for laymen [9]. Therefore, improved personalized delivery of medical content can support users in finding relevant information [10,11,12].

Medical information available for patient-oriented decision making has increased drastically but is often scattered across different sites [13]. As as solution, personal health record systems (PHRS) are meant to centralize an individual's health data and to allow access for the owner as well as for authorized health professionals [14].

Recommender systems (RS) suggest items of interest to users of information systems or e-business systems and have evolved in recent decades. A typical and well known example is Amazon's suggest service for products. We believe the idea behind recommender systems can be adapted to cope with the special requirements of the health domain.

1.1. Outline of This Article

This paper contributes to the current state of research by discussing major concepts and challenges revolving around health recommender systems: In following section we give an informal definition of the term “health recommender system” followed by the presentation of an integrated system architecture, which relies on the existence of a PHRS. Moreover, we distinguish usage scenarios for two major user groups (i.e., laymen and health professionals). In Section 2, we give an overview of the many areas that affect our contribution ranging from research in the field of health information seeking via foundations in computer science over to related systems of other researchers. Section 3 details on the requirements to be met for good recommendation quality—many of them incurring natural language processing of PHR entries, which often consist of semi-structured text. The section also introduces our prototypical implementation with its major components as well as its related processing steps. Section 4 describes our evaluation approach: As a basic step we must obtain a gold standard for optimal recommendations under a fixed setting. This will be done by conducting a specialized online survey amongst medical experts. We outline the design of a statistical test in order to compare results of our prototype to those of a naive implementation and to the expert's recommendations. Since the gold standard is not yet finished, Section 5 presents early results for typical recommendation cases. The examples indicate that our system may indeed return relevant recommendations for potential users. The paper closes with a conclusion and a reference to future work.

1.2. Definition & Typical Scenarios

A health recommender systems (HRS) is a specialization of an RS as defined by Ricci et al. ([15], p. 28). In the context of an HRS, a recommendable item of interest is a piece of non-confidential, scientifically proven or at least generally accepted medical information, which in itself is not linked to an individual's medical history. However, an HRS's suggestions are driven by individualized health data such as documented in a personal health record (PHR). According to [16] this source of information is considered the user profile of an recommender system.

The goal of an HRS is to supply it's user with medical information which is meant to be highly relevant to the medical development of the patient associated with that PHR. Related medical information may be recommended to health professionals who work on or with the given PHR but also it may be recommended to laymen inspecting their own PHR. Depending on a user's medical expertise an HRS should suggest medical information, which is comprehendable to that user.

For a successful integration into any health related information system, it is important to consider the system context of an HRS. As depicted in Figure 1, a profile-based HRS component is implemented as an extension of an existing PHR system. Data entries in a PHR database (DB) constitute the medical history of a PHR owner. Supplied with medical facts, an HRS computes a set of potentially relevant items of interest for a target user (e.g., a PHR owner or an authorized health professional). Such items originate from trustworthy health knowledge repositories and may be displayed while he/she inspects the PHR online.

Figure 1. System context of an HRS-enabled PHR system.

Thus, it is possible to compute and deliver potentially relevant information items from trusted health related knowledge repositories. Depending on the expert level of a PHR user, at least two separate use cases can be defined as follows:

  • Use case A = Health professional as end-user

    In this scenario an HRS is used by a health professional to retrieve additional information for a certain case. For instance, related clinical guidelines or research articles from Pubmed (see: http://www.ncbi.nlm.nih.gov/pubmed) can be computed automatically (see Section 2.1). This form of case-related information enrichment might support a physician with the process of clinical diagnostics as latest research results can be used for decision support.

    In addition, laymen-friendly documents can also be retrieved for the purpose of a direct handout (i.e., as a printout) to a patient when he or she is in a doctor's office for consultation. Thus, a user can be supplied with high quality information to cope with a certain disease or adapt his or her lifestyle habits.

  • Use case B = Patient as end-user

    In this scenario a layperson interacts with a HRS-enabled PHR without direct support by a physician. The system computes laymen-friendly content according to the person's longterm individual medical history. The relevant items are presented within the PHR system's user interface. By selecting the highest ranking documents or media content a patient is empowered in terms of health information acquisition. Thereby, the risk to retrieve “incomplete, misleading and inaccurate” content via famous search engines could be lowered (see Section 2.2), as we intend to recommend only evidence based (i.e., high quality) health related content to end-users.

    Both scenarios intend to lower the effects of information overload which originate from the increasing amount of health related data [8,13]. For this article we focus on the patient-centered scenario. Hence, our first prototype of an HRS implementation will suggest laymen-friendly content only, i.e., it realizes use case B.

3. Concepts

Besides just managing medical and health related data, an HRS-enabled PHR system offers specific information to its users that fits their needs and interests. This can be achieved on the basis of a user's PHR and further profile information, which may have been collected during the user's interaction with the system. As the intention of our approach is to recommend individually tailored health information, an HRS implementation feeds a PHR's UI with a highly specific list of documents relevant to a PHR user's medical history. Hence, one should discuss what data is typically stored about users in a PHR system:

  • explicit medical data about a PHR system user (e.g., current medication, care plans, surgery reports, or discharge letters, etc.) such as included in his/her PHR entries

  • terms gathered by the PHR system due to user-initiated searches and queries (e.g., “symptoms myocardial infarction”, “medication flu”)

  • user interaction statistics (i.e., click behavior, duration of page visits, rating of read articles, etc.)

In order to enable a viable HRS solution, all of these data may be considered for a recommendation procedure, since they reflect an individual's historical and current health status and potentially his specific health interests. However, so far our suggested approach only considers data according to (1), because it is of predominant importance with respect to a user's information interest.

3.1. Challenges & Requirements

When integrating an HRS into a PHR system certain requirements need to be addressed in order to optimize the recommendation:

  • The recommendation process must be able to cope with

    (a)

    imprecise terms (e.g., Hepatitis ⇔ chronic viral Hepatitis),

    (b)

    colloquial terms (e.g., Period ⇔ Menstruation) and

    (c)

    misspellings (e.g., Diabedes or Diabedis ⇔ Diabetes mellitus).

  • The system must deal with expert vocabulary primarily used by physicians and other health professionals (e.g., Hypo-insulinism ⇔ Diabetes mellitus).

  • The system needs to detect whether clinical conditions mentioned in clinical reports are negated. In particular, medical facts (i.e., terms) occur in conjunction with abbreviations frequently used by physicians. In this context, a negation detection algorithm must cope with various negation patterns, for instance:

    (a)

    “Patient's third ECG and subsequent ECG's show no signs of STEMI” ⇔ excluded term ‘STEMI’

    (b)

    “Autoimmune retinopathy in the absence of cancer” ⇔ excluded term ‘cancer’

    (c)

    “Patient suffers from snoring. Analysis did not provide evidence of chronic sleep apnea ⇔ excluded term ‘chronic sleep apnea’

  • The confidentiality of PHR user data must be guaranteed under all circumstances. Even administrators of PHR systems must not gain any sort of insight into these data.

  • It must also be capable to cope with a new localization of the system, i.e., a change of the user language must be possible without having to manually rework underlying language resources such as text corpora or localized ontologies.

Data entries of medical records are frequently stored as unstructured plain text. This creates further difficulties for IR term matching approaches. A HRS must also recognize expert vocabulary (i.e., common medical abbreviations) and classification system codes primarily used by physicians and other health professionals, such as

  • STEMI ⇔ myocardial infarction

  • I21 or I22 (ICD-10) for ‘myocardial infarction’

Such obstacles can result in less specific recommendations when integrating classic IR approaches into electronic/personal health record systems. Therefore, our approach uses semantic query expansion techniques (see Section 2.3.1) to enrich concept terms found in health record entries to reformulate any query.

Another requirement arises from the classification of health information artifacts in terms of laymen-friendliness: Depending on a patient's background knowledge and his/her ability to understand expert documents (or in other words the expert level of a system user) an HRS should be capable of pre-filtering medical information artifacts which might be too difficult to understand for the target reader. This also reduces the number of recommendable candidate documents which have to be processed further and therefore increases computational efficiency.

We address this text categorization problem by training and employing a so-called support vector machine (SVM) in a similar fashion as described in [74]. SVMs originate from the field of machine learning in computer science and are known to perform well even for difficult classification tasks [75]. As basis to use a SVM on text, related documents are first transformed into document vectors according to the vector space model (see Section 2.3.1). On the basis of a larger training set the SVM “learns” to distinguish between vectors of laymen-friendly documents and vectors of expert documents. After the training phase, the classifier can be applied to new documents in order to predict their laymen-friendliness with high accuracy.

3.2. Basic HRS Architecture

Our approach tries to recommend personalized health information artifacts to be displayed as part of the PHR system's graphical user interface after the user has logged in to the system. In this context, related artifacts can be pre-computed in the background (e.g., once a day by a batch processing job which runs on a regular basis without any user intervention). Thus, an HRS component can deal with increasing amounts of PHR entries.

By means of linguistic preprocessing, we obtain an extract Q′ of the user's PHR data Q as input to the relevance processing step (see Figure 2).

The extract Q′ consist of terms {q1,…, qk} originating from the user's PHR data entries but also potentially additional terms as added by semantic query expansion. It is meant to represent the user's medical information interest and serves as a user profile according to Section 2.3.2. Given a set of possible recommendation items R = {r1,…, rn} (i.e., health information artifacts) it is our aim to select those elements in R that match best against Q′. Thus, a set of relevant recommendations SR is computed.

Figure 2. Basic architecture of a proposed HRS. It interacts with a PHR system's database to obtain medical facts to compute individual relevance on the basis of G.

3.3. Refined HRS Architecture

We implemented an HRS prototype as an extension to a PHR system in Java. For this purpose, we introduced a modular structure, as depicted in Figure 3.

Figure 3. System structure and processing workflow of our HRS prototype and attached data sources. A PHR system feeds in data elements via Q. The process yields a set of recommendable items S which are highly relevant to a PHR user.

At first the HRS prototype connects to one or more knowledge repositories ➀ containing various health information artifacts which correspond to R. During the next step it creates or updates a so-called inverted index related to all attached knowledge repositories ➁, i.e., it uses indexing techniques from the field of IR (provided by the Apache Lucene framework).

Thereafter, our HRS implementation computes a list of relevant health information artifacts. Q which contains a PHR owner's data entries is used as input to the recommendation process. In ➂ the linguistic query processing module uses a negation detection algorithm that detects whether medical terms in Q are present in a negated context. In case negations are found, such terms are removed from Q and thus a reduction of terms is achieved. Elements in Q might contain ambiguous or misspelled terms not contained in G. Therefore, the query processing module performs additional preprocessing on Q. In particular, we can rely on Levensthein spelling correction and n-gram approximate matching to find equivalent terms which have a semantic context in G.

Step ➃ applies semantic query expansion (see Section 2.3.1.) by means of the Health Graph G. Access to G is provided via the module information management as presented in Figure 3. It selects k-nearest neighbours, i.e., nodes connected to a given qi at a distance of k. Thereby, a semantically enriched set of terms Q′Q is gained. As context resolution can yield a high number of semantically associated nodes, a module concerned with query optimization reduces the number of terms in Q′ to a fixed upper limit for simplicity ➄. Relevant concept nodes are selected according to their related edge types. We prefer edge types REDIRECT and CODE to CATEGORY and ARTICLE, as they are assumed to have a stronger semantic relationship. The result of the module query optimization is a query string formulated according to Lucene's query parser syntax.

Finally, Q′ is submitted to the index search module ➅ which retrieves relevant artifacts via the inverted index. The result is a ranked list of recommended documents S which is likely to match the health information needs of a PHR system owner.

4. Evaluation Approach

The need to evaluate the quality of the proposed approach and a related implementation, mandates a controlled experiment in which a sophisticated system is compared to a naive implementation via a test collection (gold standard). In this section we describe an approach to evaluate such a system via a gold standard in a controlled study involving real world cardiologic cases.

In an initial phase, health professionals will have to assess the relevance of potential matches for a certain clinical case ci. During this phase a gold standard of human expert recommendations is obtained. The second phase of the evaluation involves at least two implementations of an HRS component:

(1)

A naive HRS implementation based on well known but basic techniques of IR, in particular on a standard implementation of the VSM using the Apache Lucene library. Thus, document sets DVSM are collected.

(2)

An advanced HRS implementation which uses query expansion techniques via G and features such as negation recognition according to Section 3.3. Thus, document sets DHRS are collected.

Both implementations will compete against each other in matching the recommendations made by the human experts. Thereby, the retrieval precision ρ of both systems will be measured and compared against each other. A statistical test will then reveal which system performed better in computing the ideal set of documents from the content repository Rc. First simulation results indicate that there might be an improvement provided by our advanced HRS implementation, as presented in Section 5.1.

4.1. Web-Based Assessment System

We implemented a browser-based assessment system, as depicted in Figure 4. In this setting, a cardiologic case ciC is presented (A) on the left.

Figure 4. The main view of our HRS assessment system. Health professionals select matching items based on their expertise. To the left (A): The cardiologic case (here: NSTEMI/myocardial infarction)—To the right (B): Current selections made by an expert (here: ‘Heart attack’ and ‘Sudden cardiac death’).

Every physician selects items from a candidate list (B), displayed on the right of the current case. For every candidate item a preview and a full text version is available in the box below the list of items (C). Finally, if the health professional has made his/her selection the current case ci is closed (D), selected expert documents are persisted and the next case is presented. Thereby, a set of expert recommendations DG for all ci is compiled.

4.2. Data Sources

On the one hand, about 27,000 fully anonymized, real-world discharge letters (provided by the Heidelberg University Hospital) are available in our letter repository Rl. The letters revolve around the field of cardiology and typically contain text sections including anamnesis, diagnoses, laboratory results, outcomes of procedures and recommended medication. These medical facts represent the foundation for our experiment, as they provide semi-structured text data which has to be processed by any HRS component.

On the other hand, the medical content (i.e., health information artifacts) to be rated by the group of physicians is provided by the German Institute for Quality and Efficiency in Health Care IQWiG (see: http://www.iqwig.de). It is an independent publisher of evidence-based consumer health and patient information.

This collection of documents comprises a total of about 800 health information artifacts written especially for laymen. It is guaranteed to be of high quality and the result of evidence-based medicine. A subset of 75 documents which are relevant for the field of cardiology is presented to the participants of the study.

4.3. Setting & Statistical Test

During the next phase of the evaluation (see Section 4), both implementations will compute matching documents pairwise, i.e., for every ciRl we will compute the sets DHRS,ci and DVSM,ci . Thereafter, we evaluate the retrieval precision ρ of both approaches by computing:

ρ H = 1 | R l | c i | D r e l , c i D HRS , c i | | D HRS , c i |
ρ V = 1 | R l | c i | D rel , c i D VSM , c i | | D VSM , c i |
in which Drel,ci represents relevant items, i.e., it contains only elements also contained in DG,ci which originate from recommendations of the test collection previously obtained.

Rather than recommending all relevant documents it is sufficient to deliver only few yet highly relevant ones. Thus, as opposed to typical evaluations in IR, we do not consider the fraction of relevant documents that are retrieved (known as recall in field of IR).

As we will evaluate a large number of cases (i.e., nc ≥ 100) a normal distribution can be assumed for this setting. To test if our HRS implementation approach outperforms the basic VSM implementation we postulate: ρH > ρV. Hence, a dependent t-Test for paired samples can be formulated for a one-sided case:

H 0 : ρ H ρ V , H a : ρ H > ρ V
with a significance level of α = 0.05 and a target power of 1 − β = 0.8. Equation (3) can be reformulated to:
H 0 : μ H μ V ω 0 , H a : μ H μ V > ω 0
given that μH, μV are the mean values of ρH, ρV. ω0 represents the expected effect value (i.e., the change in terms of retrieval precision). For the time being we assume an effect size of:
ω 0 = Δ ρ = ρ H ρ V = 0.1
i.e., an improvement in retrieval precision of 10% is expected.

4.4. Sample Size Estimation & Recruitment

A pre-study sample size calculation for ω0 = 0.1 indicates that a total of nc = 620 cases (for ω0 = 0.15 → nc = 277 and ω0 = 0.2 → nc = 156 accordingly) is needed to ensure that a target power of 1 − β = 0.8 is achieved.

Yet, there is a certain risk for the data collection being biased (e.g., loss of interest of participants, different levels of expertise, physicians being pressed for time, etc.) during phase one of the evaluation. This is especially true if every evaluation case ci would be assessed by just a single physician. In order to prevent any sort of such biases we plan to assess every ci at least with an interjudge agreement factor j of 2. Thus, the required number of physicians is obtained:

n p = n c * j c p
As a consequence, we have to recruit at least np = 25 physicians to achieve the target power of 0.8, for a modest ω0 = 0.1 and the number of cases ci assessed per participant cp = 50.

5. Results & Discussion

5.1. Initial Investigation on Recommendation Quality

Although, the above-described study is not yet completed we found that many phenomena which limit exact term matching (as outlined in Section 3.1.) are indeed present in our collection of discharge letters. We classified health professional language into four categories: (a) abbreviations; (b) expert terms; (c) colloquial terms; (d) disease codes.

Table 1 shows a comparison between our advanced HRS implementation and the naive HRS implementation. Each recommended information artifact is ranked according to its normalized score. Note well, that two scores from the two different approaches must not be compared with each other. This originates from the fact, that for our HRS approach Q is semantically enriched with a large amount of related terms (see Section 3.3, step ➃).

We used some of the most frequent terms contained in the discharge letters (mentioned in Section 4.2) as input query to compute recommendations. These examples of health professional language are chosen as follows: (a) NSTEMInon-ST elevation MI; (b) palpitations; (c) Zuckerkrankheit (German term for Diabetes mellitus, frequently used by laymen); (d) I21Acute MI (ICD-10).

The semantically enhanced HRS approach computes four best matching items within every category. By contrast, the set of recommendations computed by the naive approach contains no elements for category a and d at all. If elements are found (category b and c) the relative ranking order varies from the order computed by our HRS implementation:

Table 1. Listing of top-4 recommendations: Comparison of recommended information artifacts computed by our advanced HRS implementation and a naive HRS implementation based Apache Lucene. Scores normalized to [0..1].
Table 1. Listing of top-4 recommendations: Comparison of recommended information artifacts computed by our advanced HRS implementation and a naive HRS implementation based Apache Lucene. Scores normalized to [0..1].
Category (a) abbreviations—primary query term: NSTEMI
RankArtifacts (Advanced HRS)ScoreArtifacts (Naive HRS)Score
1Coronary disease1.0No results at all-
2Sudden cardiac death0.915--
3Arteriosclerosis0.868--
4Myocardial infarction0.818--
Category (b) expert vocabulary—primary query term: Palpitations

RankArtifacts (Advanced HRS)ScoreArtifacts (Naive HRS)Score
1Palpitation1.0Myocarditis1.0
2Tachycardia1.0Inflammation of heart muscle1.0
3Cardiac dysrhythmia0.822--
4Myocarditis0.802--
Category (c) colloquial terms—primary query term: Zuckerkrankheit

RankArtifacts (Advanced HRS)ScoreArtifacts (Naive HRS)Score

1Zuckerkrankheit1.0Thirst1.0
2Diabetes mellitus1.0Angular cheilitis0.66
3Prader Willi syndrome0.573Prader Willi syndrome0.583
4Gestational diabetes0.448Otitis externa0.583
Category (d) disease codes (icd-10)—primary query term: I21
RankArtifact (Advanced HRS)ScoreArtifact (Naive HRS)Score
1Sudden cardiac death1.0No results at all-
2Coronary disease0.836--
3Arteriosclerosis0.774--
4Atrial fibrillation0.762--

A comparison of the result sets (category b and c) indicates that our HRS offers more relevant results. Apparently, the health information artifact ‘Palpitation’ is not recommended at all by the naive VSM approach, though it is obviously one of the best items that helps to understand ‘palpitations’. This is because no health information artifact contained an entry for the exact term ‘palpitations’ in the inverted index. By contrast, our proposed prototype ranks this artifact as its top recommendation along with ‘Tachycardia’, which is a valid medical equivalent. This is made possible by semantic query expansion implemented via the Health Graph G.

Category (d) illustrates why a regular full-text search is limited. By means of G our prototype determines the context of the disease code. Although this disease code never appears in any of the laymen compatible information artifacts, our HRS approach is capable of computing matching artifacts. The naive approach recommends no artifacts at all.

Still, the results presented in Table 1 are only an initial investigation. For this reason, we are conducting the assessment study as outlined in Section 4 at the time of writing.

5.2. Limitations of the Study

Unfortunately, there is some uncertainty which results from a yet unknown effect size, i.e., we cannot exactly determine what improvement in retrieval precision can be expected. As this effect size influences the absolute number of cases to be evaluated, this problem makes any prior estimation difficult.

The latter poses a risk to the success of the study, as there might be some dropouts during the recruitment phase already. Consequently, a large effort has to be put into the recruitment phase of physicians which tackle the amount of work to be done. However, there is a chance for an early recalculation of the sample size during phase one if the actual effect size of ω0 should be higher than initially estimated, e.g., ω0 = 0.15. Thereby, less physicians would be needed to take part in the evaluation. Moreover, cp could be lowered or the inter-judge agreement factor j could be increased.

Additionally, a bias might occur as the data in Rl originate from the field of cardiology. This might have an effect on how physicians decide which items are relevant. A high number of well selected participants and an accompanying study manual should compensate for this.

5.3. Open Issues & Challenges

PHR adoption rates are rising slowly [17]. Yet, many open questions and unresolved issues exist before large real life setting is available. Patient as well as care provider engagement will also play a major role, as a PHRS must be user-friendly for all ages and technological backgrounds. Lack of education in medical terminology and technology skepticism might cause patients to avoid using PHR systems. In particular, concerns about personal health record privacy may be a key factor for the success of (mobile) PHR systems.

PHR systems need well defined international standards to proliferate. Unfortunately, available solutions lack interoperability and inter-system communication is still a future vision. A standardized access interface to PHR entries is crucial for our proposed approach of embedding a health recommender system into PHR solutions. A standardized PHR data core including patient and provider identification, basic medications and diagnoses, insurance information, allergies etc. would help to avoid the new user problem of recommender systems as described in Section 2.3.2 Well-suited recommendations, might keep PHR users and care-givers motivated in keeping PHR systems up-to-date. In turn, this may improve recommendation quality because recommendations would be based on highly accurate user profiles.

An open problem for HRSs is to select those entries from the PHR which lead to highly relevant recommendations. In particular, PHR entries referring to a patient's past diseases may be irrelevant or, on the contrary, may well affect the patient's current and future health situation. An HRS should be enabled to assess whether diseases described or encoded in a record entry have long term effects for the patient. e.g., an acute disease like cold that happened a long time ago, can probably be ignored for recommendations in the present. On the other hand, chronic diseases like the diagnosis of diabetes affects a patient at any time and should most likely be considered for recommendations in the present as well as in the future.

6. Conclusions

This article has given a general motivation, why there is a need for context-based, individually tailored health information in personal health records. Satisfying this need will help to put patients in control of their own health data and therefore increase patients' autonomy.

An approach of integrating recommender systems into personal health records—termed health recommender system (HRS)—was outlined. A first definition of an HRS in the context of personal health record systems was presented. System requirements for such a software component were discussed. We introduced the so-called Health Graph G—a graph-based data structure of health related concepts extracted from information included in Wikipedia. An HRS prototype which acts as an extension to a PHR system was presented. Given medical facts from PHR data entries, it makes use of techniques like negation detection, spell-correction and semantic query expansion via G. Early results indicate an improvement compared to naive approaches from the field of IR.

To evaluate the quality of the proposed approach we designed a controlled experiment in which an advanced HRS implementation is compared against a naive HRS implementation by means of a test collection (gold standard). In a first phase we make use of a group of physicians to develop the test collection. For this purpose, we implemented a web-based assessment system which helps the experts to select laymen-friendly, recommendable documents matching a particular medical case.

A statistical method for sample size calculation was used to estimate how many medical cases should be evaluated to achieve a fixed target power. For a large number of cases, the retrieval precision is computed and a dependent t-Test for paired samples will be applied. As a test hypothesis we assume that the retrieval precision of the advanced HRS implementation is significantly higher than the one of the naive HRS implementation. Still, the gold standard does not yet exist—we are currently conducting a recruitment of study participants at different universities and hospitals in Germany and Switzerland.

At the moment, we have only access to a German set of discharge letters and German health professionals; however, a second study based on English PHR documents and a related test collection would desirable to validate the results of the first study.

Acknowledgments

The authors would like to thank Monika Pobiruchin of Heilbronn University for her feedback and valuable inputs to the work. This work benefited from comments provided by three anonymous reviewers.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Gavgani, V.Z. Health Information Need and Seeking Behavior of Patients in Developing Countries' Context; an Iranian Experience. Proceedings of the 1st ACM International Health Informatics Symposium (IHI '10), ACM, New York, New York, NY, USA, 11–12 November 2010; pp. 575–579.
  2. Pew Internet & American Life project. Health Online 2013 81 % of U.S. Adults Use the Internet and 59% Say They Have Looked Online for Health Information. Available online: http://www.pewinternet.org/Reports/2013/Health-online.aspx (accessed on 30 January 2014).
  3. Gerber, B.S.; Eiser, A.R. The patient-physician relationship in the Internet age: Future prospects and the research agenda. J. Med. Internet Res. 2001, 3. [Google Scholar] [CrossRef]
  4. Kivits, J. Informed patients and the Internet. A mediated context for consultations with health professionals. J. Health Psychol. 2006, 11, 269–282. [Google Scholar] [CrossRef]
  5. McMullan, M. Patients using the Internet to obtain health information: How this affects the patient–health professional relationship. Patient Educ. Counsel. 2006, 63, 24–28. [Google Scholar] [CrossRef]
  6. Ueckert, F.; Goerz, M.; Ataian, M.; Tessmann, S.; Prokosch, H.U. Empowerment of patients and communication with health care professionals through an electronic health record. Int. J. Med. Inf. 2003, 70, 99–108. [Google Scholar] [CrossRef]
  7. Anderson, R.M.; Funnell, M.M. Patient empowerment: Reflections on the challenge of fostering the adoption of a new paradigm. Patient Educ. Counsel. 2005, 57, 153–157. [Google Scholar] [CrossRef]
  8. Sommerhalder, K.; Abraham, A.; Zufferey, M.C.; Barth, J.; Abel, T. Internet information and medical consultations: Experiences from patients and physicians' perspectives. Patient Educ. Counsel. 2009, 77, 266–271. [Google Scholar] [CrossRef]
  9. Hardey, M. Doctor in the house: The Internet as a source of lay health knowledge and the challenge to expertise. Sociol. Health Illness 1999, 21, 820–835. [Google Scholar] [CrossRef]
  10. Swan, M. Emerging patient-driven health care models: An examination of health social networks, consumer personalized medicine and quantified self-tracking. Int. J. Environ. Res. Public Health 2009, 6, 492–525. [Google Scholar] [CrossRef]
  11. Roitman, H.; Yossi, M.; Yevgenia, T.; Yonatan, M. Increasing Patient Safety Using Explanation-driven Personalized Content Recommendation. Proceedings of the 1st ACM International Health Informatics Symposium (IHI '10), ACM, New York, NY, USA, 11–12 November 2010; pp. 430–434.
  12. Agarwal, D.; Chen, B.; Elango, P.; Ramakrishnan, R. Content recommendation on web portals. Commun. ACM 2013, 56, 92–101. [Google Scholar]
  13. Müller, H.; Hanbury, A.; Al Shorbaji, N. Health information search to deal with the exploding amount of health information produced. Methods Inf. Med. 2012, 51, 516. [Google Scholar]
  14. Tang, P.C.; Ash, J.S.; Bates, D.W.; Overhage, J.M.; Sands, D.Z. Personal health records: Definitions, benefits, and strategies for overcoming barriers to adoption. J. Am. Med. Informatics. Assoc. 2006, 13, 121–126. [Google Scholar] [CrossRef]
  15. Kantor, P.B.; Ricci, F.; Rokach, L.; Shapira, B. Recommender Systems Handbook; Springer: Heidelberg, Germany, 2011. [Google Scholar]
  16. Adomavicius, G.; Tuzhilin, A. Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Trans. Knowl. Data Eng. 2005, 17, 734–749. [Google Scholar]
  17. McCann, E. PHR Progress Still Hangs in Limbo-Interoperability Shortcomings Stand in the Way, Researchers Note. Available online: http://www.healthcareitnews.com/news/mobile-personal-health-record-progress-hangs-in-limbo (accessed on 28 January 2014).
  18. Chase, D. Why Google Health Really Failed—It's about the Money. Available online: http://techcrunch.com/2011/06/26/why-google-really-failed-money/ (accessed on 28 January 2014).
  19. Markle Foundation. The Personal Health Working Group: Final Report. Available online: http://www.policyarchive.org/collections/markle/index?section=5&id=15473 (accessed on 25 January 2014).
  20. Health Informatics–Electronic Health Record–Definition, Scope And Context; Standard ISO/TR 20514:2005; International Organization for Standardization: Geneva, Switzerland, 2005.
  21. Jones, D.A.; Shipman, J.P.; Plaut, D.A.; Selden, C.R. Characteristics of personal health records: Findings of the medical library association/national library of medicine joint electronic personal health record task force. J. Med. Lib. Assoc. JMLA 2010, 98, 243–249. [Google Scholar] [CrossRef]
  22. Baird, A.; North, F.; Raghu, T. Personal Health Records (PHR) and the Future of the Physician-Patient Relationship. Proceedings of the 2011 iConference, ACM, New York, NY, USA, 8–11 February 2011; pp. 281–288.
  23. Liu, L.S.; Shih, P.C.; Hayes, G.R. Barriers to the Adoption and Use of Personal Health Record Systems. Proceedings of the 2011 iConference, ACM, New York, NY, USA, 8–11 February 2011; pp. 363–370.
  24. Raisinghani, M.S.; Young, E. Personal health records: Key adoption issues and implications for management. Int. J. Electron. Healthc. 2008, 4, 67–77. [Google Scholar] [CrossRef]
  25. Huba, N.; Zhang, Y. Designing Patient-Centered Personal Health Records (PHRs): Health care professionals perspective on patient-generated data. J. Med. Syst. 2012, 36, 3893–3905. [Google Scholar] [CrossRef]
  26. Huang, Z.; Lu, X.; Duan, H.; Zhao, C. Collaboration-based medical knowledge recommendation. Artif. Intell. Med. 2012, 55, 13–24. [Google Scholar] [CrossRef]
  27. Johnson, J.D.; Case, D.O. Health Information Seeking; Peter Lang Publishing: New York, NY, USA, 2012. [Google Scholar]
  28. Cline, R.J.; Haynes, K.M. Consumer health information seeking on the Internet: The state of the art. Health Educ. Res. 2001, 16, 671–692. [Google Scholar] [CrossRef]
  29. Niederdeppe, J.; Hornik, R.C.; Kelly, B.J.; Frosch, D.L.; Romantan, A.; Stevens, R.S.; Barg, F.K.; Weiner, J.L.; Schwartz, J.S. Examining the dimensions of cancer-related information seeking and scanning behavior. Health Commun. 2007, 22, 153–167. [Google Scholar] [CrossRef]
  30. Powell, J.; Inglis, N.; Ronnie, J.; Large, S. The characteristics and motivations of online health information seekers: Cross-sectional survey and qualitative interview study. J. Med. Internet Res. 2011, 13. [Google Scholar] [CrossRef]
  31. Eysenbach, G. Design and Evaluation of Consumer Health Information Web Sites. Consum. Health Inform. 2005. [Google Scholar] [CrossRef]
  32. Eysenbach, G.; Köhler, C. How do consumers search for and appraise health information on the world wide web? Qualitative study using focus groups, usability tests, and in-depth interviews. Br. Med. J. 2002, 324, 573. [Google Scholar] [CrossRef]
  33. Wilson, T.D. Information overload: Implications for healthcare services. Health Inf. J. 2001, 7, 112–117. [Google Scholar] [CrossRef]
  34. Silberg, W.M.; Lundberg, G.D.; Musacchio, R.A. Assessing, controlling, and assuring the quality of medical information on the Internet. J. Am. Med. Assoc. 1997, 277, 1244–1245. [Google Scholar]
  35. Pandolfini, C.; Impicciatore, P.; Bonati, M. Parents on the web: Risks for quality management of cough in children. Pediatrics 2000, 105, e1:1–e1:8. [Google Scholar]
  36. Sonnenberg, F.A. Health information on the Internet: Opportunities and pitfalls. Arch. Internal Med. 1997, 157, 151. [Google Scholar] [CrossRef]
  37. HON - Health on the Net Foundation. HONs Fourth Survey on the Use of the Internet for Medical & Health Purposes. Available online: http://www.hon.ch/Survey/ResumeApr99.html (accessed on 17 January 2014).
  38. HON-Health on the Net Foundation. The HON Code of Conduct for Medical and Health Web Sites (HONcode). Available online: http://www.hon.ch/HONcode/Patients/Visitor/visitor.html (accessed on 24 January 2014).
  39. Grossman, D.; Frieder, O. Information Retrieval: Algorithms and Heuristics (The Information Retrieval Series), 2nd ed.; Springer: Heidelberg, Germany, 2004. [Google Scholar]
  40. Baldi, P.; Frasconi, P.; Smyth, P. Modelling the Internet and the Web: Probabilistic Methods and Algorithms; Wiley & Sons: East Grinstead, UK, 2003; Volume 1. [Google Scholar]
  41. Salton, G.; Wong, A.; Yang, C.S. A vector space model for automatic indexing. Commun. ACM 1975, 18, 613–620. [Google Scholar]
  42. Lovins, J.B. Development of a stemming algorithm. Mech. Transl. Comput. Linguist. 1968, 11, 22–31. [Google Scholar]
  43. Porter, M. An algorithm for suffix stripping. Program 1980, 14, 130–137. [Google Scholar]
  44. Ahmed, F.; Nürnberger, A.; Luca, E.W.D. Revised N-gram based automatic spelling correction tool to improve retrieval effectiveness. Res. J. Comput. Sci. Comput. Eng. Appl. (Polibits) 2009, 40, 39–48. [Google Scholar]
  45. Munir, K. others. Ontology Assisted Query Reformulation Using the Semantic and Assertion Capabilities of OWL-DL Ontologies. Proceedings of the 2008 International Symposuim on DB Engineering Applications (IDEAS '08), ACM, New York, NY, USA; 2008; pp. 81–90. [Google Scholar]
  46. Qiu, Y.; Frei, H.P. Concept Based Query Expansion. Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '93), ACM, New York, NY, USA; 1993; pp. 160–169. [Google Scholar]
  47. Resnick, P.; Varian, H. Recommender systems. Commun. ACM 1997, 40, 56–58. [Google Scholar]
  48. Felfernig, A.; Jeran, M.; Ninaus, G.; Reinfrank, F.; Reiterer, S. Toward the Next Generation of Recommender Systems: Applications and Research Challenges. In Multimedia Services in Intelligent Environments; Springer: Heidelberg, Germany, 2013; pp. 81–98. [Google Scholar]
  49. Eysenbach, G.; Jadad, A.R. Evidence-based patient choice and consumer health informatics in the Internet age. J. Med. Internet Res. 2001, 3, e19. [Google Scholar] [CrossRef]
  50. Zhang, Y. Contextualizing Consumer Health Information Searching: An Analysis of Questions in a Social Q&A Community. Proceedings of the 1st ACM International Health Informatics Symposium (IHI '10), ACM, New York, NY, USA, 11–12 November 2010; pp. 210–219.
  51. Wendel, S.; Dellaert, B.G.; Ronteltap, A.; van Trijp, H.C. Consumers intention to use health recommendation systems to receive personalized nutrition advice. BMC Health Serv. Res. 2013, 13, 126. [Google Scholar] [CrossRef]
  52. Fernandez-Luque, L.; Karlsen, R.; Vognild, L.K. Challenges and opportunities of using recommender systems for personalized health education. Stud. Health Technol. Inform. 2009, 150, 903–907. [Google Scholar] [CrossRef]
  53. Fernandez-Luque, L.; Karlsen, R.; Melton, G.B. HealthTrust: A social network approach for retrieving online health videos. J. Med. Internet Res. 2012, 14, e22. [Google Scholar] [CrossRef]
  54. Morrell, T.G.; Kerschberg, L. Personal Health Explorer: A Semantic Health Recommendation System. Proceedings of the 2013 IEEE 29th International Conference on Data Engineering Workshops (ICDEW), Brisbane, Australia, 8–11 April 2013; pp. 55–59.
  55. Rivero-Rodriguez, A.; Konstantinidis, S.; Sanchez-Bocanegra, C.; Fernandez-Luque, L. A Health Information Recommender System: Enriching YouTube Health Videos with Medline Plus Information by the use of SnomedCT Terms. Proceedings of the 2013 IEEE 26th International Symposium on Computer-Based Medical Systems (CBMS), Porto, Portugal, 20–22 June 2013; pp. 257–261.
  56. Wiesner, M.; Pfeifer, D. Adapting Recommender Systems to the Requirements of Personal Health Record Systems. Proceedings of the 1st ACM International Health Informatics Symposium (IHI '10), ACM, New York, NY, USA, 11–12 November 2010; pp. 410–414.
  57. Wikimedia Foundation Inc Encyclopedia online. Wikipedia: The Free Encyclopedia. Available online: http://en.wikipedia.org/wiki/Wikipedia (accessed on 30 January 2014).
  58. Wiesner, M.; Pfeifer, D.; Yilmaz, A. Satisfying Health Information Needs: A German Health Exhibition Example. Proceedings of the 25th International Symposium on IEEE Computer-Based Medical Systems (CBMS), Rome, Italy, 20–22 June 2012; pp. 1–4.
  59. Farrell, R.G.; Danis, C.M.; Ramakrishnan, S.; Kellogg, W.A. Intrapersonal Retrospective Recommendation: Lifestyle Change Recommendations Using Stable Patterns of Personal Behavior. Proceedings of the First International Workshop on Recommendation Technologies for Lifestyle Change (LIFESTYLE 2012), Dublin, Ireland, 13 September 2012; p. p. 24.
  60. Herlocker, J.; Konstan, J.; Terveen, L.; Riedl, J. Evaluating collaborative filtering recommender systems. ACM Trans. Inf. Syst. (TOIS) 2004, 22, 5–53. [Google Scholar]
  61. Chomutare, T.; Arsand, E.; Hartvigsen, G. Mobile peer support in diabetes. Stud. Health Technol. Inform. 2011, 169, 48–52. [Google Scholar]
  62. Ghorai, K.; Saha, S.; Bakshi, A.; Mahanti, A.; Ray, P. An Mhealth Recommender for Smoking Cessation Using Case Based Reasoning. Proceedings of the 46th Hawaii International Conference on System Sciences (HICSS), Wailea, HI, USA, 7–10 January 2013; pp. 2695–2704.
  63. Sanderson, M. Test Collection Based Evaluation of Information Retrieval Systems; Now Publishers, Inc.: Delft, The Netherlands, 2010. [Google Scholar]
  64. Robertson, S.; Hancock-Beaulieu, M. On the evaluation of IR systems. Inform. Process. Manag. 1992, 28, 457–466. [Google Scholar] [CrossRef]
  65. Hripcsak, G.; Rothschild, A.S. Agreement, the f-measure, and reliability in information retrieval. J. Am. Med. Inform. Asso. 2005, 12, 296–298. [Google Scholar] [CrossRef]
  66. Kandula, S.; Zeng-Treitler, Q. Creating a Gold Standard for the Readability Measurement of Health Texts. Proceedings of the AMIA Annual Symposium Proceedings, Washington, DC, USA, 8–12 November 2008; Volume 2008, p. p. 353.
  67. Sowa, J.F. Principles of Semantic Networks: Explorations in the Representation of Knowledge; Morgan Kaufmann Publishers: San Mateo, CA, USA, 1991. [Google Scholar]
  68. Princeton University Cognitive Science Laboratory. Wordnet 3.0, A Lexical Database for the English Language. Available online: http://wordnet.princeton.edu/ (accessed on 9 January 2014).
  69. Hamp, B.; Feldweg, H. Germanet a Lexical Semantic Net for German. Proceedings of the ACL Workshop Automatic Information Extraction and Building of Lexical Semantic Resources for NLP Applications, Madrid, Spain; 1997; pp. 9–15. [Google Scholar]
  70. Bizer, C.; Lehmann, J.; Kobilarov, G.; Auer, S.; Becker, C.; Cyganiak, R.; Hellmann, S. DBpedia— A crystallization point for the Web of Data. Web Seman.: Sci. Serv. Agents World Wide Web 2009, 7, 154–165. [Google Scholar] [CrossRef]
  71. International Health Terminology Standards Development Organisation (IHTSDO). Snomed ct. Available online: http://www.ihtsdo.org/snomed-ct/ (accessed on 16 January 2014).
  72. Medische Informatiekunde, 6500 HB Nijmegen. Open Galen. Available online: http://www.opengalen.org/ (accessed on 16 January 2014).
  73. U.S. National Library of Medicine National Institutes of Health. Unified Medical Language System. Available online: http://www.nlm.nih.gov/research/umls/ (accessed on 16 January 2014).
  74. Joachims, T. Text Categorization with Support Vector Machines: Learning with Many Relevant Features; Springer: Heidelberg, Germany, 1998. [Google Scholar]
  75. Tan, P.N.; Steinbach, M.; Kumar, V. Introduction to Data Mining, 1st ed.; Addison-Wesley, Longman Publishing Co., Inc.: Boston, MA, USA, 2005. [Google Scholar]

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.