Analysis of Geospatial Variations in Healthcare Across Rural Communities in the US Using Machine Learning
Abstract
1. Introduction
2. Materials and Methods
2.1. Data Source
2.2. Design of Analysis and Model Specification
3. Results
3.1. Model Performance, Key Predictors, and Impression
3.2. Geospatial Variations in Healthcare Across Rural Communities
4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
AHRQ | Agency of Healthcare Research and Quality |
AIC | Akaike information criterion |
CRNA | Certified registered nurse anesthetist |
DRF | Distributed random forest |
E&M | Evaluation and management |
ED | Emergency department |
FFS | Fee for service |
FQHCs | Federally qualified health centers |
GBM | Gradient boosting machine |
GLM | Generalized linear model |
ICU | Intensive care unit |
MGWR | Multiscale geographically weighted regression |
NPI | National provider identifier |
RHC | Rural health clinics |
RUCC | Rural–urban continuum code |
SDOH | Social drivers of health |
SHAP | Shapley additive explanations (SHAP) analysis |
XGBoost | Extreme gradient boosting |
References
- Probst, J.C.; Barker, J.C.; Enders, A.; Gardiner, P. Current State of Child Health in Rural America: How Context Shapes Children’s Health. J. Rural Health 2018, 34 (Suppl. 1), s3–s12. [Google Scholar] [CrossRef] [PubMed]
- AHRQ Agency for Healthcare Research and Quality. Social Determinants of Health Database. Available online: https://www.ahrq.gov/sdoh/data-analytics/sdoh-data.html (accessed on 21 June 2025).
- Reddy, G.T.; Reddy, M.P.K.; Lakshmanna, K.; Kaluri, R.; Rajput, D.S.; Srivastava, G.; Baker, T. Analysis of Dimensionality Reduction Techniques on Big Data. IEEE Access 2020, 8, 54776–54788. [Google Scholar] [CrossRef]
- Svynarenko, R.; Lindley, L. Defining Rurality in End-of-Life Research: Evaluation of Common Measures. J. Health Care Poor Underserved 2021, 32, 2167–2180. [Google Scholar] [CrossRef] [PubMed]
- U.S. Department of Health and Human Services. Office of Disease Prevention and Health Promotion Healthy People 2030. Available online: https://health.gov/healthypeople/objectives-and-data/social-determinants-health (accessed on 21 June 2025).
- Svynarenko, R. Social Determinants of Health Geospatial Research Methods. Available online: https://pedeolcare.utk.edu/sdoh-database-reviews/ (accessed on 21 June 2025).
- Fotheringham, A.S.; Yang, W.; Kang, W. Multiscale Geographically Weighted Regression (MGWR). Ann. Am. Assoc. Geogr. 2017, 107, 1247–1265. [Google Scholar] [CrossRef]
- Probst, J.; Eberth, J.M.; Crouch, E. Structural Urbanism Contributes to Poorer Health Outcomes for Rural America. Health Aff. 2019, 38, 1976–1984. [Google Scholar] [CrossRef]
- Li, Z. Extracting Spatial Effects from Machine Learning Model Using Local Interpretation Method: An Example of SHAP and XGBoost. Comput. Environ. Urban Syst. 2022, 96, 101845. [Google Scholar] [CrossRef]
- Tanner, D.; Zhang, Y.; Chang, J.E.; Speyer, P.; Adamson, E.; Aerts, A.; Lavista Ferres, J.M.; Weeks, W.B. Machine Learning to Evaluate the Relationship between Social Determinants and Diabetes Prevalence in New York City. BMJ Public Health 2024, 2, e001394. [Google Scholar] [CrossRef]
- Cutter, S.L.; Boruff, B.J.; Shirley, W.L. Social Vulnerability to Environmental Hazards. Soc. Sci. Q. 2003, 84, 242–261. [Google Scholar] [CrossRef]
- Ekwaru, J.P.; Veugelers, P.J. The Overlooked Importance of Constants Added in Log Transformation of Independent Variables with Zero Values: A Proposed Approach for Determining an Optimal Constant. Stat. Biopharm. Res. 2018, 10, 26–29. [Google Scholar] [CrossRef]
- Boudt, K.; Todorov, V.; Wang, W. Robust Distribution-Based Winsorization in Composite Indicators Construction. Soc. Indic. Res. 2020, 149, 375–397. [Google Scholar] [CrossRef]
- James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning: With Applications in R; Springer Texts in Statistics; Corrected at 8th Printing; Springer: New York, NY, USA; Berlin/Heidelberg, Germany; Dordrecht, The Netherlands; London, UK, 2017; ISBN 978-1-4614-7137-0. [Google Scholar]
- Marcilio, W.E.; Eler, D.M. From Explanations to Feature Selection: Assessing SHAP Values as Feature Selection Mechanism. In Proceedings of the 2020 33rd SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), Ipojuca, Brazil, 7–10 November 2020; IEEE: Ipojuca, Brazil, 2020; pp. 340–347. [Google Scholar]
- Fotheringham, A.S. Quantitative Geography: Perspectives on Spatial Data Analysis; Sage Publications Ltd.: Thousand Oaks, CA, USA, 2000; ISBN 978-0-7619-5948-9. [Google Scholar]
- De Smith, M.J.; Goodchild, M.F.; Longley, P.A. Geospatial Analysis: A Comprehensive Guide to Principles, Techniques and Software Tools, 6th ed.; Drumlin Security: London, UK, 2018; ISBN 978-1-912556-04-5. [Google Scholar]
- Liu, Y.; Liu, Z.; Luo, X.; Zhao, H. Diagnosis of Parkinson’s Disease Based on SHAP Value Feature Selection. Biocybern. Biomed. Eng. 2022, 42, 856–869. [Google Scholar] [CrossRef]
- Wang, H.; Liang, Q.; Hancock, J.T.; Khoshgoftaar, T.M. Feature Selection Strategies: A Comparative Analysis of SHAP-Value and Importance-Based Methods. J. Big Data 2024, 11, 44. [Google Scholar] [CrossRef]
- Chea, H.; Kim, H. Examining Spatial Disparities of Obesity: Residential Segregation and the Urban–Rural Divide. Prof. Geogr. 2020, 72, 206–218. [Google Scholar] [CrossRef]
- Korvink, M.; Biondolillo, M.; Van Dijk, J.W.; Banerjee, A.; Simenz, C.; Nelson, D. Detection of Potential Causal Pathways among Social Determinants of Health: A Data-Informed Framework. Soc. Sci. Med. 2025, 373, 118025. [Google Scholar] [CrossRef]
- Cohen, S.A.; Greaney, M.L. Aging in Rural Communities. Curr. Epidemiol. Rep. 2023, 10, 1–16. [Google Scholar] [CrossRef]
- Coburn, A.F.; Kilbreth, E.H.; Long, S.H.; Marquis, M.S. Urban-Rural Differences in Employer-Based Health Insurance Coverage of Workers. Med. Care Res. Rev. 1998, 55, 484–496. [Google Scholar] [CrossRef] [PubMed]
- United States Government Accountability Office. Federal Social Safety Net Programs: Millions of Full-Time Workers Rely on Federal Health Care and Food Assistance Programs. 2020. Available online: https://www.gao.gov/products/gao-21-45 (accessed on 21 June 2025).
- Feyereisen, S.L.; Puro, N.; McConnell, W. Addressing Provider Shortages in Rural America: The Role of State Opt-Out Policy Adoptions in Promoting Hospital Anesthesia Provision. J. Rural Health 2021, 37, 684–691. [Google Scholar] [CrossRef]
- Fornehed, M.L.C.; Svynarenko, R.; Keim-Malpass, J.; Cozad, M.J.; Qualls, K.A.; Stone, W.L.; Lindley, L.C. Comparison between Rural and Urban Appalachian Children in Hospice Care. South. Med. J. 2022, 115, 192–197. [Google Scholar] [CrossRef]
- Choi, S.; Weech-Maldonado, R.; Powers, T.L.; Hearld, L.R. Antecedents of Geographical Expansion: The Case of Federally Qualified Health Centers. Health Care Manag. Rev. 2022, 47, E32–E40. [Google Scholar] [CrossRef]
- Rural Health Information Hub. Rural Health Clinic Program at 45 Years: Created for Access and Still Delivering Care. 2022. Available online: https://www.ruralhealthinfo.org/rural-monitor/rhc-program (accessed on 21 June 2025).
- Center on Budget and Policy Priorities. Status of Medicaid Expansion in 2020. Available online: https://www.cbpp.org/charts/status-of-state-medicaid-expansion-january-2020 (accessed on 21 June 2025).
- Cecil, G. Sheps Center for Health Services Research 194 Rural Hospital Closures and Conversions Since January 2005. Available online: https://www.shepscenter.unc.edu/programs-projects/rural-health/rural-hospital-closures/ (accessed on 21 June 2025).
- US Census Bureau. 341 U.S. Counties Experiencing Persistent Poverty. Available online: https://www.census.gov/library/stories/2023/05/persistent-poverty-areas-with-long-term-high-poverty.html (accessed on 21 June 2025).
- Gee, R.E.; Rosenbaum, S. The Affordable Care Act: An Overview for Obstetricians and Gynecologists. Obstet. Gynecol. 2012, 120, 1263–1266. [Google Scholar] [CrossRef]
- Statz, M.; Evers, K. Spatial Barriers as Moral Failings: What Rural Distance Can Teach Us about Women’s Health and Medical Mistrust. Health Place 2020, 64, 102396. [Google Scholar] [CrossRef] [PubMed]
- Fornehed, M.L.C.; Mixer, S.J.; Lindley, L.C. Families’ Decision Making at End of Life in Rural Appalachia. J. Hosp. Palliat. Nurs. 2020, 22, 188–195. [Google Scholar] [CrossRef] [PubMed]
- Barua, S. Spatial Inequality and Explaining the Urban-Rural Gap in Obesity in India: Evidence from 2015–2016 Population-Based Survey. PLoS ONE 2023, 18, e0279840. [Google Scholar] [CrossRef] [PubMed]
Models | Cross-Validation Sample (n = 1581) | Test Sample (n = 395) | ||
---|---|---|---|---|
MRD | R2 | MRD | R2 | |
GBM | 0.0040 | 0.6438 | 0.0035 | 0.6626 |
XGBoost | 0.0043 | 0.6110 | 0.0037 | 0.6437 |
DRF | 0.0041 | 0.6345 | 0.0038 | 0.6378 |
Predictors: | Mean | SD | Min | Max |
---|---|---|---|---|
A. Health Insurance Status a (4): | ||||
Employer-based health insurance | 38.60 | 8.61 | 6.32 | 69.21 |
Direct-purchase health insurance only | 6.82 | 3.90 | 0.00 | 39.21 |
Medicare prescription drug plan | 128.97 | 40.28 | 13.49 | 284.79 |
Medicare FFS beneficiaries | 165.83 | 49.35 | 26.98 | 380.71 |
B. Access to Care (14): | ||||
Density of providers b: | ||||
Dentists with NPI | 0.41 | 0.29 | 0.00 | 2.72 |
CRNAs with NPI | 0.09 | 0.14 | 0.00 | 1.25 |
Substance abuse facilities offering HIV testing | 0.01 | 0.03 | 0.00 | 0.51 |
Facilities that provide mental health services | 0.07 | 0.10 | 0.00 | 1.57 |
Ambulatory surgical centers | 0.01 | 0.02 | 0.00 | 0.43 |
Density of FQHCs | 0.10 | 0.21 | 0.00 | 4.63 |
Density of RHCs | 0.30 | 0.37 | 0.00 | 2.87 |
Hospitals with ED | 0.14 | 0.21 | 0.00 | 2.12 |
Hospitals with medical–surgical ICU | 0.05 | 0.09 | 0.00 | 0.95 |
Hospitals with obstetric care | 0.07 | 0.11 | 0.00 | 0.99 |
Hospitals with rehabilitative care | 0.06 | 0.09 | 0.00 | 1.00 |
Hospices | 0.03 | 0.06 | 0.00 | 0.79 |
Other: | ||||
ED visits c | 591.20 | 134.57 | 29.00 | 1517.00 |
Percentage of clinicians who may accept Medicare-approved amounts | 4.69 | 10.49 | 0.00 | 100.00 |
C. Standardized Medicare Payments d (2): | ||||
FQHC/RHC | 218.21 | 189.35 | 0.00 | 1260.51 |
E&M services | 639.20 | 199.34 | 235.02 | 1583.40 |
Dependent Variable: | ||||
Rural–Urban Continuum Code (RUCC) | 6.82 | 1.54 | 4 | 9 |
Explanatory Variables | Opt. BW ** | Sig.: Number of Counties (%) | Mean | SD | Min | Median | Max |
---|---|---|---|---|---|---|---|
Intercept (Scaled) | 57 | 175 (8.9) | 0.062 | 0.264 | −0.734 | 0.058 | 0.860 |
A. Health Insurance Status a (4): | |||||||
Employer-based health insurance | 840 | 1547 (78.3) | −0.121 | 0.055 | −0.212 | −0.121 | 0.012 |
Direct-purchase health insurance only | 1976 | 0 (0.0) | 0.001 | 0.001 | −0.006 | 0.001 | 0.002 |
Medicare prescription drug plan | 397 | 731 (37.0) | 0.277 | 0.260 | −0.127 | 0.197 | 0.869 |
Medicare FFS beneficiaries | 1976 | 1976 (100.0) | 0.155 | 0.001 | 0.150 | 0.156 | 0.156 |
B. Access to Care (14): | |||||||
Density of providers b: | |||||||
Dentists with NPI | 1408 | 1227 (62.1) | −0.082 | 0.051 | −0.157 | −0.076 | 0.007 |
CRNAs with NPI | 1976 | 1976 (100.0) | −0.133 | 0.001 | −0.135 | −0.133 | −0.127 |
Substance abuse facilities offering HIV testing | 882 | 1157 (58.6) | −0.084 | 0.044 | −0.186 | −0.080 | 0.002 |
Facilities that provide mental health services | 882 | 716 (36.2) | −0.065 | 0.042 | −0.150 | −0.056 | 0.012 |
Ambulatory surgical centers | 184 | 746 (37.8) | −0.173 | 0.093 | −0.434 | −0.153 | 0.056 |
Density of FQHCs | 1976 | 0 (0.0) | −0.033 | 0.000 | −0.035 | −0.033 | −0.031 |
Density of RHCs | 1976 | 0 (0.0) | 0.008 | 0.001 | 0.004 | 0.009 | 0.016 |
Hospitals with ED | 1976 | 0 (0.0) | −0.031 | 0.000 | −0.036 | −0.031 | −0.030 |
Hospitals with medical–surgical ICU | 1976 | 38 (1.9) | −0.034 | 0.001 | −0.043 | −0.033 | −0.032 |
Hospitals with obstetric care | 1517 | 462 (23.4) | −0.036 | 0.020 | −0.068 | −0.035 | −0.007 |
Hospitals with rehabilitative care | 1935 | 0 (0.0) | −0.016 | 0.005 | −0.026 | −0.017 | −0.003 |
Hospices | 1976 | 1976 (100.0) | −0.076 | 0.002 | −0.088 | −0.075 | −0.074 |
Other: | |||||||
ED visits c | 1966 | 1885 (95.4) | −0.060 | 0.007 | −0.066 | −0.062 | −0.017 |
Percentage of clinicians who may accept Medicare-approved amounts | 1976 | 1976 (100.0) | −0.080 | 0.002 | −0.083 | −0.081 | −0.066 |
C. Standardized Medicare Payments d (2): | |||||||
FQHC/RHC | 1976 | 0 (0.0) | −0.001 | 0.002 | −0.012 | −0.001 | 0.001 |
E&M services | 489 | 1976 (100.0) | −0.286 | 0.097 | −0.535 | −0.257 | −0.146 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Svynarenko, R.; Kim, H.; Stansberry, T.; Oh, C.; Sarkar, A.; Lindley, L.C. Analysis of Geospatial Variations in Healthcare Across Rural Communities in the US Using Machine Learning. Healthcare 2025, 13, 1504. https://doi.org/10.3390/healthcare13131504
Svynarenko R, Kim H, Stansberry T, Oh C, Sarkar A, Lindley LC. Analysis of Geospatial Variations in Healthcare Across Rural Communities in the US Using Machine Learning. Healthcare. 2025; 13(13):1504. https://doi.org/10.3390/healthcare13131504
Chicago/Turabian StyleSvynarenko, Radion, Hyun Kim, Tracey Stansberry, Changwha Oh, Anujit Sarkar, and Lisa Catherine Lindley. 2025. "Analysis of Geospatial Variations in Healthcare Across Rural Communities in the US Using Machine Learning" Healthcare 13, no. 13: 1504. https://doi.org/10.3390/healthcare13131504
APA StyleSvynarenko, R., Kim, H., Stansberry, T., Oh, C., Sarkar, A., & Lindley, L. C. (2025). Analysis of Geospatial Variations in Healthcare Across Rural Communities in the US Using Machine Learning. Healthcare, 13(13), 1504. https://doi.org/10.3390/healthcare13131504