Next Article in Journal
The Unequal Effect of the COVID-19 Pandemic on the Labour Market and Income Inequality in China: A Multisectoral CGE Model Analysis Coupled with a Micro-Simulation Approach
Next Article in Special Issue
A Systematic Approach in Developing Management Workforce Readiness for Digital Health Transformation in Healthcare
Previous Article in Journal
The Association between Health-Enhancing Physical Activity and Quality of Life in Patients with Chronic Kidney Disease: Propensity Score Matching Analysis
Previous Article in Special Issue
Construction and Evaluation of a High-Frequency Hearing Loss Screening Tool for Community Residents
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Developing and Validating Risk Scores for Predicting Major Cardiovascular Events Using Population Surveys Linked with Electronic Health Insurance Records

1
Institute of Population Health Sciences, National Health Research Institutes, Maoli 350401, Taiwan
2
Institute of Public Health, National Yang-Ming University, Taipei 112304, Taiwan
3
Health Service Division, Industrial Technology Research Institute, Hsinchu 310401, Taiwan
4
Department of Cardiology, Taipei Veterans General Hospital, Taipei 112201, Taiwan
5
Department of Nephrology, Wei-Gong Memorial Hospital, Maoli 350401, Taiwan
6
Department of Pain Management, Min-Sheng General Hospital, Taoyuan 330056, Taiwan
7
Institute of Biomedical Sciences, Academia Sinica, Taipei 115201, Taiwan
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2022, 19(3), 1319; https://doi.org/10.3390/ijerph19031319
Submission received: 24 November 2021 / Revised: 13 January 2022 / Accepted: 19 January 2022 / Published: 25 January 2022

Abstract

:
A risk prediction model for major cardiovascular events was developed using population survey data linked to National Health Insurance (NHI) claim data and the death registry. Another set of population survey data were used to validate the model. The model was built using the Nutrition and Health Survey in Taiwan (NAHSIT) collected from 1993–1996 and linked with 10 years of events from NHI data. Major adverse cardiovascular events (MACEs) were identified based on hospital admission or death from coronary heart disease or stroke. The Taiwanese Survey on Hypertension, Hyperglycemia, and Hyperlipidemia (TwSHHH), conducted in 2002 was used for external validation. The NAHSIT data consisted of 1658 men and 1652 women aged 35–70 years. The incidence rates for MACE per 1000 person-years were 13.77 for men and 7.76 for women. These incidence rates for the TwSHHH were 7.27 for men and 3.58 for women. The model had reasonable discrimination (C-indexes: 0.76 for men; 0.75 for women), thus can be used to predict MACE risks in the general population. NHI data can be used to identify disease statuses if the definition and algorithm are clearly defined. Precise preventive health services in Taiwan can be based on this model.

1. Introduction

Precise preventive health services emphasize managing chronic diseases and developing individualized risk prediction to improve the quality and effectiveness of health care [1]. The ideal disease prevention should start with screening risk factors for predicting risk and then promote health throughout a patient’s entire life. Cardiovascular disease is a major cause of death and disability worldwide. The World Health Organization (WHO) recognizes the seriousness of cardiovascular disease (https://www.who.int/cardiovascular_diseases/guidelines/Pocket_GL_information/en/, accessed on 30 December 2021) and suggests that most cardiovascular diseases (CVDs) are preventable. A pocket guide can be used to identify people at high risk and provide guidance to prevent heart attacks or strokes. In the U.S., CVD risk is estimated using the Framingham Risk Score (FRS), which was developed in the U.S. when it was facing a high CVD prevalence [2,3]. The ability of the FRS to accurately predict CVD risk for different ethnicities is uncertain. Individual studies examining FRS performance have been conducted in Europe [4,5,6,7] and Asia [8,9,10,11,12]. Knowing the limitations of the original FRS, the American Heart Association (AHA) and the American College of Cardiology (ACC) released a new risk calculator [13]. The calculator, known as the ASCVD, was developed by pooling data from several cohorts, including cohorts from the Atherosclerosis Risk in Community [14], the Cardiovascular Health [15], the Coronary Artery Risk Development in Young Adults [16], and the Framingham, Original [17] and Offspring [18] studies. In November 2013, the AHA and ACC updated the clinical guidelines for managing lipids [19], and risk score was an important feature. The International Atherosclerosis Society provided a calibration factor for each country with different risks [20]. Following their instructions, we generated and submitted our own calibration factor [21]. Based on our experience in generating the calibration factor and observations from Asian countries [8], risk factors for coronary heart disease (CHD) may differ in Asian populations. Liu et al. recalibrated the Framingham prediction function using data from the Chinese Multi-Provincial Cohort study (CMCS) [11]. We applied their new coefficients to our national representative data from the 2002 Taiwan Survey of Hypertension, Hyperglycemia, and Hyperlipidemia (TwSHHH) and found that the prediction did not fit the Taiwanese population [22]. Thus, we developed our own model.
Most of the scores were based on longitudinal follow-up, and the diseases were ascertained either from self-reports or doctors’ diagnoses. Periodic follow-up of patients nationwide would be very costly. As use of electronic medical records (EMRs) increases, one form of the EMR can be used to ascertain events. Since 1995, the National Health Insurance (NHI) program has provided universal coverage to more than 99% of the population in Taiwan. To get reimbursed, medical facilities (either clinics or hospitals) must upload the information for each patient’s clinical or hospital visits. Thus, the NHI database contains personal characteristics (sex, date of birth, and insurance information) and clinical information (date, expenditures, diagnosis, prescription details, and operations) [23]. We conducted this study to construct and validate our own MACE risk prediction model using national surveys linked to claim data and the death registry.

2. Materials and Methods

2.1. Data

Data for developing the model were obtained from a nationwide survey, the 1993–1996 Nutrition and Health Survey in Taiwan (NAHSIT), which asked questions on disease history, food intake, and health-related behaviors. Participants were asked to fast for 8 h before physical examination. Blood samples were centrifuged immediately. Serum samples were frozen on dry ice, then delivered to the Academia Sinica and frozen at −70 °C on the same day. Frozen serum samples were analyzed in a certified laboratory. Coefficients of variation from the blood samples were within acceptable ranges deriving from 5% split blood samples [22]. No written consent form was required during the survey; however, we obtained patients’ oral consent before the survey. The 1993–1996 NAHSIT was the first representative survey with biomarkers. It was also conducted during the time National Health Insurance (NHI) was implemented. It was the beginning of following-up individuals using nationwide insurance claim data electronically.
Validation data were obtained from another nationwide survey, the TwSHHH, conducted in 2002 and followed up in 2007 [24]. The health examination was similar to that of the NAHSIT; however, nutrition intake was not recorded. Participants were informed about the survey, and data were linked only from those who signed the consent form.
The model included the following data: baseline values of log (age), systolic blood pressure (SBP), diastolic blood pressure (DBP), fasting glucose, total cholesterol, high-density lipoprotein (HDL-C), low-density lipoprotein (LDL-C), ratio of total cholesterol to HDL-C, triglycerides, uric acid, body mass index (BMI; kg/m2), waist circumference (cm), waist-to-hip ratio, and smoking status. The Institute Review Board of the National Health Research Institutes approved this study.

2.2. Events

MACEs are a group of disorders of the heart and blood vessels (https://www.who.int/news-room/fact-sheets/detail/cardiovascular-diseases-(cvds), accessed on 30 December 2021). Heart attacks and strokes are usually acute events and are mainly caused by a blockage that prevents blood from flowing to the heart or brain. MACEs, which include CHD and stroke, were extracted from the NHI databank or death registry. Only hospitalization or death due to CHD or stroke was considered. CHD was diagnosed as per ICD-9: 410-414 or ICD-10: I20-I25; stroke was diagnosed as per ICD-9:430-438 or ICD-10: I60-I69. Time-to-event was calculated from the date of the survey to the date of the event.

2.3. Statistical Methods

The model’s discrimination ability was evaluated using Harrell’s C [25], which evaluates the proportion of concordant pairs over all possible pairs. The formula is (Equation (1)),
C = p r { z ( X i ) > z ( X j ) | T i < T j   &   D i = 1 } = π conc π comp
where (Equation (2))
π conc = p r { z ( X i ) > z ( X j )   & T i < T j   &   D i = 1 }
represents the concordant pair, and (Equation (3))
π comp = p r { T i < T j   &   D i = 1 }
represents all pairs. If a variable increased the C value by 0.002, the variable was kept in the model. The Akaike information criterion (AIC) was used to guarantee the goodness of the model fit. The model with the lowest AIC was selected. In other words, the model was selected based on a higher C value and lower AIC. The Hosmer–Lemeshow test was used to evaluate the calibration [26]. This test divided the predicted risk into ten groups then compared the observed risk to the predicted risk. The χ2 test was also used. The same method was applied to the external calibration. All analyses were conducted using R and SAS statistical software, version 9.4 (SAS Institute Inc., Cary, NC, USA).

3. Results

To build the model, we used data from 1658 men and 1652 women aged between 35 and 70 years. For each disease, we excluded those who reported having the disease at baseline. The MACE incidence rates were 13.77 per 1000 person-years for men and 7.76 per 1000 person-years for women. We constructed separate models for CHD and stroke, and then combined them as MACE. Table 1 compares the baseline variables between patients who developed MACE, CHD, or stroke and those who did not. Men who developed MACEs were significantly older, with higher blood pressure, higher waist-hip ratios, higher waist circumferences, and higher proportions of hypertension than their counterparts at baseline. Among women, almost all variables differed except the glucose level. The 10-year event probabilities S0(10) for MACEs were 0.82 for men and 0.90 for women. Thus, approximately 82% of the men and 90% of the women remained MACE-free during the 10-year period.
Table 2 presents the final models. The C values for CHD were 0.73 for men and 0.82 for women. The AIC values were the lowest in both models, at 760 for men and 474 for women. Agreement between the predicted and observed probabilities was classified into deciles and tested via a χ2 test. The χ2 were 22.91 for men and 21.68 for women. The C values for stroke were 0.80 for men and 0.79 for women and for MACEs were 0.76 for men and 0.75 for women.
Models were validated using TwSHHH data linked to the NHI data. The MACE incidence rates were 7.27 per 1000 person-years for men and 3.58 per 1000 person-years for women in the 1993–1996 NAHSIT sample. The 10-year MACE-free probabilities were 0.90 for men and 0.95 for women. Figure 1 shows the validation. The χ2 statistics for comparing the predicted to the observed values were <20 for CHD and MACEs, but slightly higher for stroke (28.67 for men; 20.93 for women).

4. Discussion

In this study, we used data from national surveys linked to the NHI and death registry and extracted events over a 10-year period to develop a risk prediction model. Millions of records were processed. Manuel et al. used population health survey data to develop and validate a model for cardiovascular disease in Canada [27]. They combined the 2001, 2003, and 2005 Canadian Community Health Surveys to develop the model and used the 2007 survey to validate the model. The predictors were the self-reported risk behaviors, and the events were obtained from either the hospital admission for the disease or the death registry. We used a similar method to identify events, but chose surveys with biomarkers. Because the NHI data were for insurance claims, the data contained only the date of the claim and no information on disease conformation. We were advised by experts in the fields on hypertension, heart disease, and cerebrovascular disease to use the records from hospitalization or death to guarantee that real events were selected.
The Framingham score and CMCS used categorical data to calculate the risk scores (Table 3). The Framingham score used blood pressure, total cholesterol, HDL-C, diabetes (yes/no), and smoking (yes/no) as predictors. The CMCS was developed using Chinese data [9]. Figure 2 shows the comparisons of the two models fitting the NAHSIT 1993–1996 data for CHD events. The C statistics were lower in men than in women, whereas the χ2 statistic was much higher in women when using the Framingham score.
Chien et al. developed a point-based prediction model for CHD in Taiwan [28] using data collected in northern Taiwan. CHD was ascertained by physicians. The authors developed three models: a clinical model, a total cholesterol-based model, and an LDL-C-based model. The clinical model included age, sex, BMI, SBP, smoking status, and family history of CHD. The total cholesterol-based model was similar to the LDL-C-based model, except for the total cholesterol. However, the study lacked a nationally representative sample. Our sample was selected using a probability sampling scheme and covered the entire population of Taiwan. Therefore, our model can be used for risk prediction for the population in Taiwan.
The first global estimate of the burden of 135 diseases listed cerebrovascular diseases as the second leading cause of death after ischemic heart disease [29]. The WHO reported that 15 million people suffer a stroke worldwide annually (http://www.emro.who.int/health-topics/stroke-cerebrovascular-accident/index.html, accessed on 30 December 2021). Approximately one-third of these remain disabled for long periods, resulting in heavy burdens on their family and community. Taiwan is no exception to this. The earliest risk prediction model for stroke was developed using the Framingham study in 1991 [30]. A risk prediction model was developed using Taiwanese community data in 2010 [31]. The incidence was ~6.8% in the 16-year follow-up. Two models have been developed based on these data [31]. One was a clinical model that included measures of blood pressure and disease history. The other was a biomedical model that included total cholesterol, white blood cell counts, and fasting glucose in addition to items in the clinical model. A model was developed for Chinese populations using the China Health and Nutrition Survey [32] using the incidence between 2009 and 2015. Separate models were developed for ischemic stroke and hemorrhagic stroke. Each population has different risk factors. Thus, we developed our model using Taiwan population data. We focused on severe stroke that resulted in hospitalization or death. Our model selected SBP, triglycerides, glucose, and uric acid for men and SBP, waist-hip ratio, smoking, hypertension, and diabetes for women.
Because CHD and stroke are major cardiac events, we put them in one model. Our final model included age, sex, SBP, waist-hip ratio, HDL-C, and uric acid for men. The weight (coefficient) was heavy for the waist-hip ratio, implying that obesity may contribute largely to MACEs in men. Waist circumference, SBP, total cholesterol/HDL ratio, and smoking were used in the model for women. Lipid profiles (total cholesterol/HDL ratio) played a relatively important role in MACE development in women. The C statistics reached 0.76 for men and 0.75 for women. Using another national survey with lower incidence rates to validate the models, the C statistics were higher than those in the original population, reaching 0.78 for men and 0.79 for women. The overestimation on risk scores has been observed in many models for the same purpose. The WHO CVD Risk Chart Working Group suggested it might be models were developed using incidence at the population level and might include recurrent cases [33]. We used event-history model and the National Health Insurance data, which only count the event once. The overestimation was high in the highest 10th percentile. It was possibly caused by the linear function in the model. We have tried other functions, but they did not improve the model. In the end, the purpose was risk prevention. A higher estimation might alert individuals to modify their lifestyle in order to lower their risks.
This study had some limitations. First, no behavioral variables other than obesity-related variables were selected; thus, these variables may have all been expressed in patients’ blood pressure or biomarkers. Mediation models may be one solution. Second, we did not stratify stroke into different subtypes. However, our purpose was primary prevention in the general population. We hoped this model would apply to government-funded health check-ups for people aged ≥40 years. Taiwanese residents get free health check-ups when they are ≥40 years old. Those aged between 40 and 64 years get free health check-ups every 3 years; those aged ≥65 years get free health check-ups annually. The health check-up could implement our mode into the report and inform people about their 10-year risk of MACE. There is a website developed for risk prediction as well as guidelines for prevention (https://cdrc.hpa.gov.tw/index.jsp, accessed on 1 January 2022).

5. Conclusions

In conclusion, linking national surveys to health insurance data enabled generating a MACE risk prediction model. Our model was validated using a dataset from another survey conducted a few years later and with fewer incidences. The models performed well, indicating that our model was valid regardless of time.

Author Contributions

Conceptualization, H.-Y.C.; methodology, H.-Y.C. and C.-Y.H.; software, H.-L.F.; validation, H.-L.F.; formal analysis, H.-L.F.; investigation, H.-Y.C.; resources, W.-H.P.; data curation, H.-Y.C. and C.-Y.C.; writing—original draft preparation, H.-Y.C.; writing—review & editing, S.-Y.C., C.-C.H., H.-M.C., T.-W.C. and W.-C.Y.; visualization, H.-Y.C., C.-Y.H. and H.-L.F.; supervision, H.-Y.C.; project administration, C.-Y.C.; funding acquisition, H.-Y.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by the Health Promotion Administration, Ministry of Health and Welfare (D1060310, D1060310-107, MOHW108-HPA-M-114-000108). The content of this paper reflects only the authors’ views. The sponsors did not place any restrictions on the study design.

Institutional Review Board Statement

This study used secondary data for the study. The study was approved by the Institutional Review Board of the National Health Research Institutes (EC106061-E).

Informed Consent Statement

No written consent form was required during the survey; however, we obtained patients’ oral consent before the 1993–1996 survey. Informed consent was obtained from all subjects in the 2002 TwSHHH.

Data Availability Statement

The survey data and National Health Insurance data used in this study are available at the Ministry of Health and Welfare (https://dep.mohw.gov.tw/DOS/lp-2503-113-xCat-DOS_dc002.html, accessed on 30 December 2021).

Acknowledgments

Authors would like to give their thanks for all to S.-B.C., who is a computer engineer and provides a lot of help in developing the risk prediction system. Appreciation also goes to L.-S.H. who reviewed and organized literature.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Chiu, H.-W.; Li, Y.-C. Toward precise and preventive healthcare with computational tools. Comput. Methods Programs Biomed. 2018, 153, A1. [Google Scholar] [CrossRef]
  2. D’Agostino, R.B.; Grundy, S.; Sullivan, L.M.; Wilson, P.; Group, C.H.D.R.P. Validation of the Framingham coronary heart disease prediction scores: Results of a multiple ethnic groups investigation. JAMA 2001, 286, 180–187. [Google Scholar] [CrossRef] [PubMed]
  3. Eichler, K.; Puhan, M.A.; Steurer, J.; Bachmann, L.M. Prediction of first coronary events with the Framingham score: A systematic review. Am. Heart J. 2007, 153, 722–731.e8. [Google Scholar] [CrossRef] [PubMed]
  4. Brindle, P.; Jonathan, E.; Lampe, F.; Walker, M.; Whincup, P.; Fahey, T.; Ebrahim, S. Predictive accuracy of the Framingham coronary risk score in British men: Prospective cohort study. BMJ 2003, 327, 1267. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Marques-Vidal, P.; Rodondi, N.; Bochud, M.; Chiolero, A.; Pecoud, A.; Hayoz, D.; Paccaud, F.; Mooser, V.; Firmann, M.; Waeber, G.; et al. Predictive accuracy of original and recalibrated Framingham risk score in the Swiss population. Int. J. Cardiol. 2009, 133, 346–353. [Google Scholar] [CrossRef] [Green Version]
  6. Merry, A.H.H.; Boer, J.M.A.; Schouten, L.; Ambergen, T.; Steyerberg, E.; Feskens, E.; Verschuren, W.M.M.; Gorgels, A.P.M.; Brandt, P.V.D. Risk prediction of incident coronary heart disease in the Netherlands: Re-estimation and improvement of the SCORE risk function. Eur. J. Prev. Cardiol. 2011, 19, 840–848. [Google Scholar] [CrossRef]
  7. Hense, H.-W.; Schulte, H.; Löwel, H.; Assmann, G.; Keil, U. Framingham risk function overestimates risk of coronary heart disease in men and women from Germany—results from the MONICA Augsburg and the PROCAM cohorts. Eur. Hear. J. 2003, 24, 937–945. [Google Scholar] [CrossRef]
  8. Asia Pacific Cohort Studies Collaboration Cardiovascular risk prediction tools for populations in Asia. J. Epidemiol. Community Health 2007, 61, 115–121. [CrossRef]
  9. Chow, C.K.; Joshi, R.; Celermajer, D.S.; Patel, A.; Neal, B. Recalibration of a Framingham risk equation for a rural population in India. J. Epidemiol. Community Health 2009, 63, 379–385. [Google Scholar] [CrossRef]
  10. Jee, S.H.; Jang, Y.; Oh, D.J.; Oh, B.-H.; Lee, S.H.; Park, S.-W.; Seung, K.-B.; Mok, Y.; Jung, K.J.; Kimm, H.; et al. A coronary heart disease prediction model: The Korean Heart Study. BMJ Open 2014, 4, e005025. [Google Scholar] [CrossRef]
  11. Liu, J.; Hong, Y.; D’Agostino, S.R.B.; Wu, Z.; Wang, W.; Sun, J.; Wilson, P.W.F.; Kannel, W.B.; Zhao, D. Predictive Value for the Chinese Population of the Framingham CHD Risk Assessment Tool Compared with the Chinese Multi-provincial Cohort Study. JAMA 2004, 291, 2591–2599. [Google Scholar] [CrossRef] [PubMed]
  12. Nishimura, K.; Okamura, T.; Watanabe, M.; Nakai, M.; Takegami, M.; Higashiyama, A.; Kokubo, Y.; Okayama, A.; Miyamoto, Y. Predicting Coronary Heart Disease Using Risk Factor Categories for a Japanese Urban Population, and Comparison with the Framingham Risk Score: The Suita Study. J. Atheroscler. Thromb. 2014, 21, 784–798. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Goff, D.C., Jr.; Lloyd-Jones, D.M.; Bennett, G.; Coady, S.; D’Agostino, R.B.; Gibbons, R.; Greenland, P.; Lackland, D.T.; Levy, D.; O’Donnell, C.J.; et al. 2013 ACC/AHA Guideline on the Assessment of Cardiovascular Risk: A report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. Circulation 2014, 129 (Suppl. S2), S49–S73. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Li, R.; Bensen, J.T.; Hutchinson, R.G.; Province, M.A.; Hertz-Picciotto, I.; Sprafka, J.M.; Tyroler, H.A. Family risk score of coronary heart disease (CHD) as a predictor of CHD: The atherosclerosis risk in communities (ARIC) study and The NHLBI Family Heart Study. Genet. Epidemiol. 2000, 18, 236–250. [Google Scholar] [CrossRef]
  15. Fried, L.P.; Borhani, N.O.; Enright, P.; Furberg, C.D.; Gardin, J.M.; Kronmal, R.A.; Kuller, L.H.; Manolio, T.A.; Mittelmark, M.B.; Newman, A.B.; et al. The cardiovascular health study: Design and rationale. Ann. Epidemiology 1991, 1, 263–276. [Google Scholar] [CrossRef]
  16. Friedman, G.D.; Cutter, G.R.; Donahue, R.P.; Hughes, G.H.; Hulley, S.B.; Jacobs, D.R.; Liu, K.; Savage, P.J. Cardia: Study design, recruitment, and some characteristics of the examined subjects. J. Clin. Epidemiol. 1988, 41, 1105–1116. [Google Scholar] [CrossRef]
  17. Higgins, M.W. The Framingham Heart Study: Review of epidemiological design and data, limitations and prospects. Prog. Clin. Boil. Res. 1984, 147, 51–64. [Google Scholar]
  18. Feinleib, M.; Kannel, W.B.; Garrison, R.J.; McNamara, P.M.; Castelli, W.P. The Framingham Offspring Study. Design and preliminary data. Prev. Med. 1975, 4, 518–525. [Google Scholar] [CrossRef]
  19. Stone, N.; Robinson, J.G.; Lichtenstein, A.H.; Merz, C.N.B.; Blum, C.B.; Eckel, R.H.; Goldberg, A.C.; Gordon, D.; Levy, D.; Lloyd-Jones, D.; et al. 2013 ACC/AHA Guideline on the Treatment of Blood Cholesterol to Reduce Atherosclerotic Cardiovascular Risk in Adults. J. Am. Coll. Cardiol. 2014, 63, 2889–2934. [Google Scholar] [CrossRef] [Green Version]
  20. Grundy, S.M.; Arai, H.; Barter, P.; Bersot, T.P.; Betteridge, D.J.; Carmena, R.; Cuevas, A.; Davidson, M.H.; Genest, J.; Kesäniemi, Y.A.; et al. An International Atherosclerosis Society Position Paper: Global recommendations for the management of dyslipidemia-Full report. J. Clin. Lipidol. 2014, 8, 29–60. [Google Scholar] [CrossRef] [Green Version]
  21. Chang, H.-Y.; Fang, H.-L.; Huang, C.-Y.; Hung, L.-S.; Chien, K.-L.; Pan, W.-H. Estimating the Calibrating Coefficient of the Framingham Score to Predict Risk of Coronary Heart Disease in the Taiwanese Population. Int. J. Gerontol. 2018. [Google Scholar] [CrossRef]
  22. Chang, H.Y.; Pan, W.H.; Yeh, W.T.; Tsai, K.S. Hyperuricemia and gout in Taiwan: Results from the Nutritional and Health Survey in Taiwan (1993–96). J. Rheumatol. 2001, 28, 1640–1646. [Google Scholar] [PubMed]
  23. Hsing, A.W.; Ioannidis, J.P.A. Nationwide Population Science. JAMA Intern. Med. 2015, 175, 1527–1529. [Google Scholar] [CrossRef] [PubMed]
  24. Su, T.-C.; Bai, C.-H.; Chang, H.-Y.; You, S.-L.; Chien, K.-L.; Chen, M.-F.; Chen, H.-J.; Pan, W.-H.; Tseng, C.-H.; Cheng, S.-H.; et al. Evidence for improved control of hypertension in Taiwan: 1993–2002. J. Hypertens. 2008, 26, 600–606. [Google Scholar] [CrossRef]
  25. Harrell, F.E., Jr.; Lee, K.L.; Mark, D.B. Multivariable prognostic models: Issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat. Med. 1996, 15, 361–387. [Google Scholar] [CrossRef]
  26. Cook, N.R. Statistical Evaluation of Prognostic versus Diagnostic Models: Beyond the ROC Curve. Clin. Chem. 2008, 54, 17–23. [Google Scholar] [CrossRef] [Green Version]
  27. Manuel, D.G.; Tuna, M.; Bennett, C.; Hennessy, D.; Rosella, L.; Sanmartin, C.; Tu, J.V.; Perez, R.; Fisher, S.; Taljaard, M. Development and validation of a cardiovascular disease risk-prediction model using population health surveys: The Cardiovascular Disease Population Risk Tool (CVDPoRT). Can. Med Assoc. J. 2018, 190, E871–E882. [Google Scholar] [CrossRef] [Green Version]
  28. Chien, K.-L.; Hsu, H.-C.; Su, T.-C.; Chang, W.-T.; Chen, P.-C.; Sung, F.-C.; Lin, H.-J.; Chen, M.-F.; Lee, Y.-T. Constructing a point-based prediction model for the risk of coronary artery disease in a Chinese community: A report from a cohort study in Taiwan. Int. J. Cardiol. 2012, 157, 263–268. [Google Scholar] [CrossRef]
  29. Murray, C.J.; Lopez, A.D. World Health Organization. The Global Burden of Disease: A Comprehensive Assessment of Mortality and Disability from Diseases, Injuries, and Risk Factors in 1990 and Projected to 2020; Harvard School of Public Health: Boston, MA, USA, 1996. [Google Scholar]
  30. Wolf, P.A.; D’Agostino, R.B.; Belanger, A.J.; Kannel, W.B. Probability of stroke: A risk profile from the Framingham Study. Stroke 1991, 22, 312–318. [Google Scholar] [CrossRef] [Green Version]
  31. Chien, K.-L.; Su, T.-C.; Hsu, H.-C.; Chang, W.-T.; Chen, P.-C.; Sung, F.-C.; Chen, M.-F.; Lee, Y.-T. Constructing the Prediction Model for the Risk of Stroke in a Chinese Population. Stroke 2010, 41, 1858–1864. [Google Scholar] [CrossRef] [Green Version]
  32. Yu, Q.; Wu, Y.; Jin, Q.; Chen, Y.; Lin, Q.; Liu, X. Development and internal validation of a multivariable prediction model for 6-year risk of stroke: A cohort study in middle-aged and elderly Chinese population. BMJ Open 2021, 11, e048734. [Google Scholar] [CrossRef] [PubMed]
  33. WHO CVD Risk Chart Working Group. World Health Organization cardiovascular disease risk charts: Revised models to estimate risk in 21 global regions. Lancet Glob. Health 2019, 7, e1332–e1345. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Validation of the developed models using TwSHHH data for men (a) and women (b).
Figure 1. Validation of the developed models using TwSHHH data for men (a) and women (b).
Ijerph 19 01319 g001
Figure 2. Applying different prediction models for CHD to the Taiwanese population (1993–1996 NAHSIT) for men (a) and women (b).
Figure 2. Applying different prediction models for CHD to the Taiwanese population (1993–1996 NAHSIT) for men (a) and women (b).
Ijerph 19 01319 g002
Table 1. Comparisons of baseline values between patients who developed MACEs, CHD, or stroke over a 10-year period and those did not.
Table 1. Comparisons of baseline values between patients who developed MACEs, CHD, or stroke over a 10-year period and those did not.
MenMACECHD Stroke
No (N = 1139)Yes (N = 193) No (N = 1230)Yes (N = 102) No (N = 1407)Yes (N = 158)
VariableMeanS.D.MeanS.D.p-ValueMeanS.D.MeanS.D.p-ValueMeanS.D.MeanS.D.p-Value
Age (years)51.069.9357.828.58*51.599.9957.428.95*52.4110.1158.657.88*
SBP (mmHg)126.5917.02139.5023.22*127.7918.12136.4822.30*128.9018.42142.3421.76*
DBP (mmHg)82.1512.0885.8914.34*82.6212.5283.6112.190.4482.8612.2088.0415.05*
Cholesterol (mg/dL)196.6937.12194.8639.070.63196.3237.19197.6140.040.79197.4237.80193.8537.720.37
Glucose (mg/dL)96.4218.1597.6021.090.5996.4418.1398.2923.370.5699.8726.33107.6837.770.06
Triglycerides (mg/dL)124.7179.88133.8480.790.28125.0679.84137.0281.910.27128.5680.56139.9187.000.21
HDL54.0517.9153.3228.810.7954.1319.9951.7118.940.3652.9717.8253.2329.690.93
LDL117.1736.54116.3135.670.82116.9936.39117.6436.640.89118.2836.77113.9136.750.27
Uric acid (mg/dL)6.671.676.861.920.316.691.706.761.830.786.781.757.041.910.16
CHOL/HDL3.981.354.221.680.163.991.354.341.960.174.101.484.171.430.66
BMI (kg/m2)23.593.2123.913.160.3123.593.2324.102.970.2323.903.3224.613.45*
Waist-hip ratio0.870.060.890.06*0.870.060.880.060.100.880.060.900.06*
Waist (cm)81.648.5583.749.09*81.758.5684.239.44*82.679.1284.988.96*
Smoker (%)71 78 0.0772 75 0.5871 760.430.27
Diabetes (%)3 6 0.184 5 0.557 170.38*
Hypertension (%)33 56 *35 54 *40 650.48*
WomenNo (N =1116)Yes (N = 106) No (N = 1159)Yes (N = 64) No (N = 1473)Yes (N = 121)
Age (years)50.019.7657.698.25*50.259.8158.637.27*51.589.9058.409.17*
SBP (mmHg)123.5718.77134.1725.48*124.1119.25131.7025.16*126.6520.53142.4723.40*
DBP (mmHg)78.8711.4083.0615.56*79.1011.6281.7015.650.1980.0011.9985.9615.32*
Cholesterol (mg/dL)197.9338.54215.7643.60*198.4038.49219.1648.14*200.3939.34215.2946.08*
Glucose (mg/dL)98.1720.61103.5439.580.2998.1221.05107.9643.940.17102.3429.37123.4559.55*
Triglycerides (mg/dL)106.1161.74142.2587.68*106.9262.86150.5088.19*117.3972.54156.3599.57*
HDL61.0119.1653.3116.25*60.9019.0650.1415.52*59.3219.0054.0617.39*
LDL115.7136.54132.8940.73*116.0936.43137.4344.96*117.6737.25128.1737.37*
Uric acid (mg/dL)5.311.455.671.57*5.321.475.751.400.065.541.625.981.88*
Cholesterol/HDL3.521.184.381.53*3.531.184.731.69*3.691.304.411.97*
BMI (kg/m2)24.253.6925.784.07*24.293.7325.943.75*24.713.7625.834.33*
Waist-hip ratio0.800.070.840.07*0.800.070.850.07*0.800.070.850.07*
Waist (cm)76.008.8881.439.85*76.138.9782.858.99*77.399.3281.949.42*
Smoker (%)6 9 0.346 6 0.866 17 *
Diabetes (%)3 7 0.163 5 0.638 27 *
Hypertension (%)27 44 *28 39 0.0636 64 *
All Ns excluded patients who reported having this disease at baseline. * p ≤ 0.05.
Table 2. Variables used in the models for each disease by sex.
Table 2. Variables used in the models for each disease by sex.
Disease
MenMACECHDStroke
Incidence13.77/10007.14/10009.53/1000
VariableCoefficientCoefficientCoefficient
Age (years)7.27828.30078.9606
SBP (mmHg)0.0257-0.0231
Glucose (mg/dL) -0.0050
Triglycerides (mg/dL) -0.0013
HDL−0.0039-
LDL −0.0081
Uric acid (mg/dL)0.0214-0.0603
Cholesterol/HDL -
Waist-hip ratio3.2778-
Waist (cm) 0.0163
Smoke (yes) -
Diabetes (yes) -
Hypertension (yes) 0.6715
C statistic0.760.730.80
Women
Incidence7.76/10004.63/10006.98/1000
Age (years)6.88339.38914.3538
SBP (mmHg)0.01280.00150.0138
Glucose (mg/dL)
Triglycerides (mg/dL) 0.0001
HDL
LDL
Uric acid (mg/dL)
CHOL/HDL0.30540.3581
Waist-hip ratio 3.9712
Waist (cm)0.02570.0425
Smoke (yes)0.1923 0.7821
Diabetes (yes) 0.5348
Hypertension (yes) 0.5572
C statistic0.750.820.79
Table 3. Framingham and CMCS score coefficients.
Table 3. Framingham and CMCS score coefficients.
MenWomen
Risk FactorsCMCSFraminghamCMCSFramingham
Age0.070.050.070.17
Blood pressure
        Optimal−0.510.09−0.50−0.74
        NormalReferenceReferenceReferenceReference
        High normal0.210.42−0.87−0.37
        Stage 1 hypertension0.330.660.340.22
        Stage 2–4 hypertension0.770.900.470.61
Cholesterol, mg/dL
        <160−0.51−0.380.180.21
        160–199ReferenceReferenceReferenceReference
        200–2390.070.570.130.44
        240–2790.320.740.140.56
        ≥2800.520.831.670.89
HDL, mg/dL
        <35−0.250.610.620.73
        35–440.010.370.300.60
        45–49ReferenceReference0.080.60
        50–59−0.070.00ReferenceReference
        ≥60−0.40−0.46−0.78−0.54
Diabetes0.090.530.180.87
Smoking0.620.73−0.950.98
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Chang, H.-Y.; Fang, H.-L.; Huang, C.-Y.; Chiang, C.-Y.; Chuang, S.-Y.; Hsu, C.-C.; Cheng, H.-M.; Chen, T.-W.; Yao, W.-C.; Pan, W.-H. Developing and Validating Risk Scores for Predicting Major Cardiovascular Events Using Population Surveys Linked with Electronic Health Insurance Records. Int. J. Environ. Res. Public Health 2022, 19, 1319. https://doi.org/10.3390/ijerph19031319

AMA Style

Chang H-Y, Fang H-L, Huang C-Y, Chiang C-Y, Chuang S-Y, Hsu C-C, Cheng H-M, Chen T-W, Yao W-C, Pan W-H. Developing and Validating Risk Scores for Predicting Major Cardiovascular Events Using Population Surveys Linked with Electronic Health Insurance Records. International Journal of Environmental Research and Public Health. 2022; 19(3):1319. https://doi.org/10.3390/ijerph19031319

Chicago/Turabian Style

Chang, Hsing-Yi, Hsin-Ling Fang, Ching-Yu Huang, Chi-Yung Chiang, Shao-Yuan Chuang, Chih-Cheng Hsu, Hao-Min Cheng, Tzen-Wen Chen, Wei-Cheng Yao, and Wen-Harn Pan. 2022. "Developing and Validating Risk Scores for Predicting Major Cardiovascular Events Using Population Surveys Linked with Electronic Health Insurance Records" International Journal of Environmental Research and Public Health 19, no. 3: 1319. https://doi.org/10.3390/ijerph19031319

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop