Next Article in Journal
Exploration of Psychological Resilience during a 25-Day Endurance Challenge in an Extreme Environment
Previous Article in Journal
Spatio-Temporal Effects of Multi-Dimensional Urbanization on Carbon Emission Efficiency: Analysis Based on Panel Data of 283 Cities in China
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Disproportionate Impact of COVID-19 among Undocumented Immigrants and Racial Minorities in the US

by
Mohammad Tawhidul Hasan Bhuiyan
1,†,
Irtesam Mahmud Khan
2,†,
Sheikh Saifur Rahman Jony
1,
Renee Robinson
3,
Uyen-Sa D. T. Nguyen
4,
David Keellings
5,
M. Sohel Rahman
1,‡ and
Ubydul Haque
4,*,‡
1
Department of Computer Science & Engineering, Bangladesh University of Engineering & Technology, West Palasi, Dhaka 1205, Bangladesh
2
Department of Computer Science and Engineering, United International University, Dhaka 1212, Bangladesh
3
Department of Pharmacy Practice and Administration, University of Alaska Anchorage/Idaho State University, Anchorage, AK 99508, USA
4
Department of Biostatistics and Epidemiology, University of North Texas Health Science Center, Fort Worth, TX 76177, USA
5
Department of Geography, University of Florida, Gainesville, FL 32611, USA
*
Author to whom correspondence should be addressed.
Joint first author.
Joint senior author.
Int. J. Environ. Res. Public Health 2021, 18(23), 12708; https://doi.org/10.3390/ijerph182312708
Submission received: 8 September 2021 / Revised: 24 November 2021 / Accepted: 26 November 2021 / Published: 2 December 2021
(This article belongs to the Topic Burden of COVID-19 in Different Countries)

Abstract

:
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the virus responsible for coronavirus disease 2019 (COVID-19), has had an unprecedented effect, especially among under-resourced minority communities. Surveillance of those at high risk is critical for preventing and controlling the pandemic. We must better understand the relationships between COVID-19-related cases or deaths and characteristics in our most vulnerable population that put them at risk to target COVID-19 prevention and management efforts. Population characteristics strongly related to United States (US) county-level data on COVID-19 cases and deaths during all stages of the pandemic were identified from the onset of the epidemic and included county-level socio-demographic and comorbidities data, as well as daily meteorological modeled observation data from the North American Regional Reanalysis (NARR), and the NARR high spatial resolution model to assess the environment. Advanced machine learning (ML) approaches were used to identify outbreaks (geographic clusters of COVID-19) and included spatiotemporal risk factors and COVID-19 vaccination efforts, especially among vulnerable and underserved communities. COVID-19 outcomes were found to be negatively associated with the number of people vaccinated and positively associated with age, the prevalence of cardiovascular disease, diabetes, and the minority population. There was also a strong positive correlation between unauthorized immigrants and the prevalence of COVID-19 cases and deaths. Meteorological variables were also investigated, but correlations with COVID-19 were relatively weak. Our findings suggest that COVID-19 has had a disproportionate impact across the US population among vulnerable and minority communities. Findings also emphasize the importance of vaccinations and tailored public health initiatives (e.g., mask mandates, vaccination) to reduce the spread of COVID-19 and the number of COVID-19 related deaths across all populations.

1. Introduction

The burden of the COVID-19 pandemic, similar to other health conditions, has not been equally distributed and has been particularly detrimental to vulnerable populations, older adults, those with chronic conditions, minority populations, the poor, and those who work essential jobs [1,2]. Strong associations were found between and COVID-19 outcomes and various socio-demographic, socioeconomic, and environmental variables such as race/ethnicity and healthcare access [3,4,5,6]. Nearly 80% of hospitalized patients were over 65 years of age, and the risk of death was 23-times greater in this age group compared with those under 65 years of age [3]. Comorbidities such as cardiovascular disease (CVD), diabetes, obesity, hypertension, and cholesterol levels further increased the risk of COVID-19 fatality [7,8,9,10].
Racial and ethnic minorities in the United States (US) were disproportionately affected by COVID-19, leading to higher morbidity and mortality rates within these groups [11,12,13,14]. A Center for Disease Control (CDC) study concluded minorities in 14 states were disproportionately impacted by COVID-19 [15]. Data compiled from four different sources across seven states indicated that Black communities were disproportionately affected, and race was an independent risk factor for COVID-19 infection and mortality [1]. A study in California found that compared with than non-Hispanic white patients, non-Hispanic African American patients were 2.7 times more likely to be hospitalized, and Alaska Native people had higher death rates (10% in Alaska Native versus 6% Non-Hispanic White population), adjusting for age, sex, comorbidities, and income [11].
The COVID-19 pandemic has also exposed the increased risk of morbidity and mortality to unauthorized segments of the population [12,13,16,17,18]. Health and economic disparities among unauthorized immigrants placed them at increased risk for COVID-19 infections and deaths. The fear of deportation, healthcare access concerns, and environmental challenges (e.g., quarantine issues in multipartite households) can result in a disproportionate number of COVID-19 infections, complications, and deaths in the undocumented population [2,19]. However, COVID-19 infection rates dropped from 9.0% to 4.6% with vaccination, with over 50% reduction among individuals 65 years or older [20]. In the same period, vaccination was associated with a decrease in non-ICU hospitalizations (63.5%), ICU hospitalizations (65.6%), and deaths (69.3%) [20]. In addition to population characteristics, previous studies also suggested cold and dry environments are likely to facilitate the COVID-19 transmission [21,22,23]. For each 1 °C increase in ambient temperature and increased humidity, a decline in daily-confirmed case counts was noted for up to 14 days [24].
Taken together, surveillance of those at high risk of COVID-19 infection and/or mortality is critical for preventing and controlling the pandemic, especially with the emergence of variants and more virulent strains. We hypothesize that there are geographic clusters of COVID-19. These geographic clusters might be associated with spatiotemporal risk factors (e.g., age, sex, race, undocumented immigrants, vaccination, etc.). Advanced knowledge of outbreaks can help tailor COVID-19 mitigation efforts, especially among vulnerable and underserved communities.

2. Methods

Daily reported aggregated data on US county-level COVID-19 cases and deaths from the onset of the epidemic (22 January 2020) to 30 September 2021 were used to establish target variables (outcome variables) [25]. Daily meteorological modeled observation data (1 January 2020 to 30 September 2021) from the North American Regional Reanalysis (NARR), consisting of county-level temperature mean, minimum and maximum, total precipitation, and relative humidity data, and the NARR high spatial resolution model (32 km) was used to assess environmental factors (Table 1) [26] (https://psl.noaa.gov/data/gridded/data.narr.monolevel.html#source, accessed on 1 October 2021). We also used aggregated county-level meteorological features (Table 1), vaccination data, socio-demographic and comorbidities data, and the number of unauthorized people living in each county (January 2020 to 30 September 2021), as features as described in Supplementary Materials Table S1 [27,28,29].

2.1. Time Frames

Except for the (independent) analysis on vaccination data, we divided the whole timeline into three segments, namely, the first wave (from the onset to 30 September 2020), the second wave (1 October 2020 to 11 January 2021), and vaccination (21 January 2021 to 30 September 2021). Notably, the beginning of the second wave was assumed to be from 1 October 2020, based on expert opinions [30,31,32]. In addition to these three time segments, the entire study period is also considered as an overall time segment in our analyses.

2.2. Data Preprocessing

For county-level socio-demographic and comorbidities data, features that vary across populations were normalized using min–max normalization (so that the maximum value is 1 and the minimum is 0), i.e., dividing them by the total population of the respective county. For missing data, the median of the normalized value was used [33]. Meteorological data were preprocessed separately, aligning them with available COVID-19 data for the prevalence of infection and mortality. For each county, when analyzing the prevalence of COVID-19 infection and mortality, we removed data for the days’ field when the number of COVID-19 cases was zero.
Some anomalies were noted in the county-level unauthorized population data (Figure S1). The number of unauthorized populations was unavailable for some of the counties. Rather, the total number of unauthorized populations of a group of counties was available. We used the value for the aggregated unauthorized population for the groups of counties as the value for the unauthorized population for the individual counties included in that aggregated group [34]. Vaccination data and socio-demographic and comorbidities data were available at the county level. Only publicly available, de-identified, unlinked, and aggregated county-level data were used in this study; therefore, Investigational Review Board approval was not required.

2.3. Spearman Rank Order Correlation

The Spearman Rank Order Correlation coefficient is a nonparametric measurement of the relation between two variables. It measures both strength and direction of the association. It is defined as the Pearson correlation between the rank values of two variables. For n samples of two variables X and Y, their Spearman correlation coefficient r s is computed as:
r s = ρ r g X , r g Y = c o v ( r g X , r g Y ) σ r g X σ r g Y
Here,
ρ denotes the Pearson correlation coefficient;
r g X , r g Y denote the rank variables corresponding to X and Y;
c o v ( r g X , r g Y ) is the covariance of the rank variables;
σ r g X , σ r g Y are the standard deviations of the rank variables.
The Spearman correlation coefficient for each feature (predictor) and each outcome variable of our interest was calculated in a county-level manner as follows:
  • The Spearman correlation coefficient between each meteorological feature, and daily COVID-19 prevalence/mortality of the counties was calculated for each time segment. In particular, we present the Spearman Correlation results of mean, maximum, and minimum temperature, total precipitation, and relative humidity against prevalence (Figure S3) and mortality (Figure S4);
  • The Spearman correlation coefficient between the cumulative number of people vaccinated (per hundred) daily (predictor) and daily COVID-19 prevalence/mortality was calculated for the vaccination period;
  • The Spearman correlation coefficient between socio-demographic, comorbidities features, unauthorized population and COVID-19 prevalence/mortality was calculated for each time segment;
  • The Spearman correlation coefficient between each socio-demographic, comorbidities feature, and the number of people vaccinated per hundred (aggregated for the whole study period) was calculated for the vaccination period.
For all analyses, only the predictors with p-value < 0.05 (unadjusted) and a Spearman correlation coefficient > 0.3 were selected. We plotted the Spearman correlation coefficients on a map for analyses A and B.

2.4. Feature Importance

To calculate the importance of each predictor on the outcome variables, Shapley Additive Explanations (SHAP) was used [35]. Shapley Additive Explanations [36] is a method to explain individual predictions. SHAP is used to explain the prediction of an instance x by calculating the contribution of each feature to the prediction. Shapley values show how to distribute the prediction between the features fairly. A variable can be an individual feature value or a group of feature values. Any features with large absolute Shapley values are significant and used to calculate the contribution each feature makes on a single prediction.
In order to analyze the association of socio-demographic and comorbidities data with COVID-19 features, for each period (first wave, second wave, vaccination, and entire study period), we calculated the prevalence of COVID-19 infection and mortality from COVID-19 of each county. We trained several ML models, namely, Random Forest [37], XGBoost [38], and Neural Network, to predict these outcome variables from socio-demographic and comorbidities features. In our analysis, we observed that Random Forest and XGBoost fit the data better than Neural Networks in all cases. Subsequently, we selected the best model and used SHAP to explain the importance and association of each predictor with each outcome variables (Table 2).
Again, to analyze the association between people vaccinated (per hundred) and socio-demographic and comorbidities data, we considered both one dose and two doses vaccinated population as outcome variables. For both of the cases stated above, a summary plot was generated by the SHAP explainer to understand the feature contributions for each time segment using the best model in terms of Mean Squared Error (MSE) and R2 values (Table 2). The higher (in Y-axis) feature is in the plot, the greater is its importance. On the X-axis, we plotted SHapely values for each data point, and on the Y-axis, we plotted the features. The color of each data point indicates the importance of that feature in that particular prediction.
We attempted to identify COVID-19 hotspots in states across the United States over a 542-day timeframe from 6 April 2020 to 30 September 2021. For this purpose, we used Getis-Ord Gi * statistic, which outputs a z-score based on the attribute value and spatial weight [39,40]. Gi statistics is a well-known method to determine hot spots and cold spots in geospatial data. For more details, see the Supplementary Materials (Figure S2).

2.5. Codes, Environment, and Availability

We used both R (version 3.6.1, The R Foundation, Vienna, Austria) and Python (version 3.9.1, Python Software Foundation, Wilmington, DE, USA) for data analysis and model fitting. Several R packages, namely, sf, sp, tmap, tmaptools, RColorBrewer, etc., were used to plot maps. The R package dplyr was used for data analysis. Several Python packages, namely, Pandas, Numpy, Scipy, etc., were used for data preparation and to calculate the Spearman Rank Order Correlation. We used SHAP, scikit-learn, and xgboost packages for calculating feature importance. For plotting and generating images, the packages matplotlib and seaborn were used. All R and python codes were run on a Windows 10 machine with an Intel Core i7-8750H processor and 16 GB RAM. All data and a data dictionary were uploaded as a Supplementary Materials. All code can be found in the Github repository: https://github.com/rizvi23061998/US_DATA_ANALYSIS.git (accessed on 10 October 2021).

3. Results

3.1. Spearman Rank Order Correlation

Spearman correlation coefficients were, for the most part, negative for the associations between cumulative daily number of people vaccinated (per hundred) and prevalence of COVID-19 infections and COVID-19 mortality, especially in the southern regions (Figure 1). The counties that have a higher proportion of the unauthorized population have a highly positive relationship with both COVID-19 prevalence/mortality (Table 3). Therefore, a high percentage of the unauthorized population is attributed to high prevalence/mortality. Table S2 of the Supplementary Materials describes the positive and negative association of each socio-demographic and comorbidities feature with prevalence and mortality.
Against the number of people vaccinated with one dose (per hundred), the Asian population has a positive correlation. As expected, population (per county) and population per square mile have a positive correlation; in contrast, CVD and Diabetes have a negative correlation (Table 4). It is to be noted that people vaccinated (per hundred) with two doses show almost identical correlation values. That is why only the analysis on the vaccinated population with one dose is provided.
Our analysis on socio-demographic and comorbidities features reveals different behavior in different time segments when prevalence is used as the outcome variable. During the first wave, we see a positive association with the African American population and a negative association with the White population (Figure 2, Supplementary Materials Table S3). The relation flips during the second wave and subsequently restores for the African American population in the vaccination period. Next, the prevalence of CVD has a positive relation (most impactful feature over the entire study period). The age group of 45 to 64 years mostly shows a negative association, and the age group of ≥85 years has a positive association before the vaccination period and negative afterward. The age group of <5 years and 25 to 34 show positive associations. The multiracial population shows a negative association (second most impactful feature over the entire study period). Hispanic and Native American populations mostly have a positive association. In contrast, the Asian population has a negative association except during the first wave (Figure 2). Mostly different results are observed when mortality was used as the outcome variable. A female-headed household with children has a significantly positive association (most impactful feature over the entire study period). African American and Hispanic populations have a positive association, whereas the Asian population has a negative association. Except for the first wave, CVD and the prevalence of diabetes were found to have a positive association. Both mortality and prevalence show a similar association with the age groups. The older population also seems to have a positive correlation. The feature importance of the age group of ≥85 years decreased significantly during the vaccination period. The age group 45 to 54 years has negative relation (Figure 3).

3.2. Analysis of Vaccination Data

From our analysis of vaccination data against socio-demographic and comorbidities features, we observed almost identical results for a vaccinated population with one and two doses. The high importance of different races is noticed. Hispanic population shows a negative association, whereas Asian, Native American, Multi-racial, and Hawaiian and Pacific Islander population show positive association. Comorbidities such as CVD and Diabetes have a negative association. The age group of 35 to 54 and the age group of ≥85 show positive association, but, interestingly, the age group 65 to 74 shows a negative association (Figure 4).
From our hotspot analysis, we found that the spread of COVID-19 first started at the upper east coast. Then by 14 May 2020, the whole east coast was exposed and predicted as a hotspot, and moved west gradually. Until 6 June 2020, the whole West Coast was a cold spot, i.e., there were very little to no cases of COVID-19 until 23 June 2020, the hotspot moved from the east coast to the lower west coast through Texas and other lower states. By 19 July 2021, the whole West Coast was predicted as a hotspot, while the whole East Coast was predicted as a cold spot. The hotspot was located on the West coast and lower parts until 9 October 2020, and then, the hotspot moved to the center and again to the east coast. By 14 November 2020, the whole country was a hotspot. Then the hotspot moved again to the lower west coast by 1 January 2021, while most of the East Coast was cold spots. By 17 February 2021, the hotspot moved to the lower right part of the US. Then, the hotspot was located again on the entire east coast by 2 March 2021, when the West Coast was a cold spot. By 7 June 2021, the hotspot moved to the lower part of the US, and by 20 June 2021, it moved to the West Coast and fluctuated from the lower part to West Coast until the end (30 September 2021). See Figure S2 for the choropleth diagram of hotspots in the US.

4. Discussion

Our results are consistent with other published studies [16,41,42,43,44]. COVID-19 prevalence and mortality are significantly high in counties living a high proportion of the unauthorized population and minority communities, in communities with a high prevalence of CVD and diabetes, and among elders [4,12,13,19,45]. Even before the pandemic, access to healthcare facilities was typically low among undocumented immigrants [41]. Barriers such as cost, lack of knowledge of resources, and fear of deportation are among those noted [14]. Extending government assistance is important, and perhaps creating a low-cost “insurance plan” for uninsured individuals regardless of citizenship status would be beneficial if barriers are removed [42]. Further mistrusts on the public-healthcare-for-all concept, with no exception to unauthorized populations, will negate the effectiveness of all identified resolutions. It is crucial to overcome language barriers, resource scarcity, and predatory job sites to seek immediate care for any potential symptoms or even preventive services, and should not tie with the negative impact on future adjustment to their immigration status [16]. Educating communities about available resources in a culturally appropriate manner (through community health workers who speak the same language, can relate to cultural beliefs, etc.) is important to promoting healthcare utilization, reduce risk, provide much-needed service to immigrants, and ensure all Americans are safe [46]. Creating programs that reach out to vulnerable undocumented communities can help to build trust with these communities and relay information that can help to curb the spread of COVID-19.
The findings of this study are also consistent with previous findings that show the infection rate is higher in counties where ethnic minorities such as Black, Hispanic, Alaska Natives, and Hawaiian/Pacific Islanders are higher compared with the White population [1]. A positive association with prevalence (except for the second wave) was seen higher in counties where the Black community has a high population, whereas in the counties where the White community has a higher proportion of population showed an exact inverse relationship. A more rigorous surveillance system, vaccination coverage, promotion of education to fight against vaccine hesitancy, and mask mandates may help curb the spread of infection among minority communities [47,48,49].
Our study, similar to other published studies, suggests counties that have a high proportion of an older population (≥85 years of age group) had a higher prevalence of COVID-19 and higher mortality throughout the four time segments [50,51,52]. However, it is important to note that counties generally composed of older adults were more likely to be vaccinated. Our findings suggest that as the number of vaccinations increases, the prevalence and mortality decrease in the county; however, no direct association was observed between climatic parameters, COVID-19 prevalence, and mortality. However, in counties where more people had a greater number of comorbidities, in particular CVD and/or diabetes (Supplementary Materials Table S1), individuals were at greater risk of morbidity and mortality [3,7,8,9]. Vaccinations were shown to decrease risk in this vulnerable population; therefore, steps need to be taken to increase vaccinations to prevent mortality and morbidity.
Lastly, as expected, the correlation between mean, minimum, and maximum temperature and COVID-19 prevalence follows the same pattern (Figure S3). From our hotspot analysis, it seemed that COVID-19 started to spread from the East Coast and gradually spread to the lower part, and then went to the West Coast. It fluctuated from the East Coast to the West Coast only through the lower part (Texas, NM, USA), but to our surprise, it rarely traveled northward, possibly due to low traffic and rural environment limiting spread.
Relative humidity showed a positive correlation on the eastern and western regions during vaccination, a negative correlation on western and northern middle regions when the whole period is considered, and little to no correlation during other time segments.
There might have some reporting bias, such as misreporting and lack of reporting by various states and counties. This study used aggregated county-level data. The observed associations between COVID-19 prevalence and mortality and the risk factors were ecologic and not at the level of individuals [53,54]. Our findings suggest broad relationships between county-level population characteristics and the impacts of COVID-19, but these associations are likely confounded by many variables not captured by this study, and a causal or individual-level interpretation should not be made.

5. Conclusions

This study applies several ML models and spatiotemporal approaches to support rapid detection of geographic clusters of COVID-19 outbreaks that may focus on COVID-19 mitigation and management strategies on a county level. Strong correlations were found between both COVID-19 cases and deaths with the number of people vaccinated, the prevalence of CVD, and diabetes, age, and minority population. There was also a strong relationship between the number of unauthorized immigrants and the prevalence of COVID-19 cases and deaths. These findings confirm that already vulnerable populations were disproportionately impacted by COVID-19 and that public health initiatives need to focus on targeting and tailoring initiatives that promote health equity, and the fundamental changes are required on a system level.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/ijerph182312708/s1, Table S1: Description of socio-demographic and comorbidities features (Source: https://www.arcgis.com/home/item.html?id=8e5c3c6e1fa94e379553e199dcc4e777&view=list#data accessed on 1 October 2021). Table S2: Association of socio-demographic and comorbidities features with COVID-19 prevalence/mortality (Only features with a spearman coefficient of >0.3 with a 5% significance level were selected). Table S3: Supplementary Materials: Impact of socio-demographic and comorbidities features on prevalence/mortality (only features with high importance were chosen). Figure S1: Spatial distribution of unauthorized population in the US. Figure S2: Geographic hotspots of COVID-19 in the USA. Figure S3: Spearman correlation values were calculated for each meteorological feature with the daily prevalence (at a county-level). We conducted this for each time segment and plotted the values of each county on the map. In the figure, indices (a–d) indicate plots for different time segments: (a) first wave, (b) second wave, (c) vaccination period, and (d) the entire study period. Figure S4: Spearman correlation values were calculated for each meteorological feature with the daily mortality (in a county-wise manner). We conducted this for each time segment and plotted the values of each county on the map. In the figure, indices (a–d) indicate plots for different time segments: (a) first wave, (b) second wave, (c) vaccination period, and (d) the entire study period.

Author Contributions

Conceived and designed the research: U.H. Data preparation: U.H., M.T.H.B., I.M.K. and D.K. contributed to weather data preparation. Drafted the manuscript: U.H., M.T.H.B. and M.S.R. Analyzed and interpreted the data: M.T.H.B., I.M.K., M.S.R. and U.H. Assisted in analysis: S.S.R.J. Critically revised the draft: R.R., U.-S.D.T.N. and D.K. All authors contributed in writing. U.H. and M.S.R. jointly supervised the project. All authors have read and agreed to the published version of the manuscript.

Funding

No external funding was available for this study. U.H. was supported by the Research Council of Norway (grant # 281077).

Institutional Review Board Statement

Not applicable since we used publicly available data.

Informed Consent Statement

Patient consent was waived since we used de-identified delinked publicly available data.

Data Availability Statement

All data were uploaded as Supplementary Materials, and codes were posted in the Github repository.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Mude, W.; Oguoma, V.M.; Nyanhanda, T.; Mwanri, L.; Njue, C. Racial disparities in COVID-19 pandemic cases, hospitalisations, and deaths: A systematic review and meta-analysis. J Glob Health 2021, 11, 05015. [Google Scholar] [CrossRef]
  2. Shadmi, E.; Chen, Y.; Dourado, I.; Faran-Perach, I.; Furler, J.; Hangoma, P.; Hanvoravongchai, P.; Obando, C.; Petrosyan, V.; Rao, K.D.; et al. Health equity and COVID-19: Global perspectives. Int. J. Equity Health 2020, 19, 104. [Google Scholar] [CrossRef]
  3. Mueller, A.L.; McNamara, M.S.; Sinclair, D.A. Why does COVID-19 disproportionately affect older people? Aging 2020, 12, 9959–9981. [Google Scholar] [CrossRef]
  4. Shah, G.H.; Shankar, P.; Schwind, J.S.; Sittaramane, V. The Detrimental Impact of the COVID-19 Crisis on Health Equity and Social Determinants of Health. J. Public Health Manag. Pract. 2020, 26, 317–319. [Google Scholar] [CrossRef]
  5. Shrestha, N.; Shad, M.Y.; Ulvi, O.; Khan, M.H.; Karamehic-Muratovic, A.; Nguyen, U.D.T.; Baghbanzadeh, M.; Wardrup, R.; Aghamohammadi, N.; Cervantes, D.; et al. The impact of COVID-19 on globalization. One Health 2020, 11, 100180. [Google Scholar] [CrossRef] [PubMed]
  6. Peeri, N.C.; Shrestha, N.; Rahman, M.S.; Zaki, R.; Tan, Z.; Bibi, S.; Baghbanzadeh, M.; Aghamohammadi, N.; Zhang, W.; Haque, U. The SARS, MERS and novel coronavirus (COVID-19) epidemics, the newest and biggest global health threats: What lessons have we learned? Int. J. Epidemiol. 2020, 49, 717–726. [Google Scholar] [CrossRef] [Green Version]
  7. Kang, Y.; Chen, T.; Mui, D.; Ferrari, V.; Jagasia, D.; Scherrer-Crosbie, M.; Chen, Y.; Han, Y. Cardiovascular manifestations and treatment considerations in COVID-19. Heart 2020, 106, 1132–1141. [Google Scholar] [CrossRef]
  8. Zaki, N.; Alashwal, H.; Ibrahim, S. Association of hypertension, diabetes, stroke, cancer, kidney disease, and high-cholesterol with COVID-19 disease severity and fatality: A systematic review. Diabetes Metab. Syndr. 2020, 14, 1133–1142. [Google Scholar] [CrossRef] [PubMed]
  9. Rajpal, A.; Rahimi, L.; Ismail-Beigi, F. Factors leading to high morbidity and mortality of COVID-19 in patients with type 2 diabetes. J. Diabetes 2020, 12, 895–908. [Google Scholar] [CrossRef]
  10. Aggarwal, G.; Lippi, G.; Lavie, C.J.; Henry, B.M.; Sanchis-Gomar, F. Diabetes mellitus association with coronavirus disease 2019 (COVID-19) severity and mortality: A pooled analysis. J. Diabetes 2020, 12, 851–855. [Google Scholar] [CrossRef] [PubMed]
  11. Tai, D.B.G.; Shah, A.; Doubeni, C.A.; Sia, I.G.; Wieland, M.L. The Disproportionate Impact of COVID-19 on Racial and Ethnic Minorities in the United States. Clin. Infect. Dis. 2021, 72, 703–706. [Google Scholar] [CrossRef] [PubMed]
  12. Stokes, E.K.; Zambrano, L.D.; Anderson, K.N.; Marder, E.P.; Raz, K.M.; Felix, S.E.; Tie, Y.; Fullerton, K.E. Coronavirus Disease 2019 Case Surveillance—United States, January 22–May 30, 2020. MMWR Morb. Mortal. Wkly. Rep. 2020, 69, 759–765. [Google Scholar] [CrossRef] [PubMed]
  13. Killerby, M.E.; Link-Gelles, R.; Haight, S.C.; Schrodt, C.A.; England, L.; Gomes, D.J.; Shamout, M.; Pettrone, K.; O’Laughlin, K.; Kimball, A.; et al. Characteristics Associated with Hospitalization Among Patients with COVID-19—Metropolitan Atlanta, Georgia, March–April 2020. MMWR Morb. Mortal. Wkly. Rep. 2020, 69, 790–794. [Google Scholar] [CrossRef]
  14. Rossen, L.M.; Ahmad, F.B.; Anderson, R.N.; Branum, A.M.; Du, C.; Krumholz, H.M.; Li, S.-X.; Lin, Z.; Marshall, A.; Sutton, P.D.; et al. Disparities in Excess Mortality Associated with COVID-19—United States, 2020. MMWR Morb. Mortal. Wkly. Rep. 2021, 70, 1114–1119. [Google Scholar] [CrossRef]
  15. Van Dyke, M.E.; Mendoza, M.C.; Li, W.; Parker, E.M.; Belay, B.; Davis, E.M.; Quint, J.J.; Penman-Aguilar, A.; Clarke, K.E.N. Racial and Ethnic Disparities in COVID-19 Incidence by Age, Sex, and Period Among Persons Aged <25 Years—16 U.S. Jurisdictions, January 1–December 31, 2020. MMWR Morb. Mortal. Wkly. Rep. 2021, 70, 382–388. [Google Scholar] [CrossRef]
  16. Page, K.R.; Venkataramani, M.; Beyrer, C.; Polk, S. Undocumented U.S. Immigrants and Covid-19. N. Engl. J. Med. 2020, 382, e62. [Google Scholar] [CrossRef] [PubMed]
  17. Galvan, T.; Lill, S.; Garcini, L.M. Another Brick in the Wall: Healthcare Access Difficulties and Their Implications for Undocumented Latino/a Immigrants. J. Immigr. Minor. Health 2021, 23, 885–894. [Google Scholar] [CrossRef] [PubMed]
  18. Bozio, C.H.; Grannis, S.J.; Naleway, A.L.; Ong, T.C.; Butterfield, K.A.; DeSilva, M.B.; Natarajan, K.; Yang, D.; Rao, S.; Klein, N.P.; et al. Laboratory-Confirmed COVID-19 among Adults Hospitalized with COVID-19–Like Illness with Infection-Induced or mRNA Vaccine-Induced SARS-CoV-2 Immunity—Nine States, January–September 2021. MMWR Morb. Mortal. Wkly. Rep. 2021, 70, 1539–1544. [Google Scholar] [CrossRef] [PubMed]
  19. Clark, E.; Fredricks, K.; Woc-Colburn, L.; Bottazzi, M.E.; Weatherhead, J. Disproportionate impact of the COVID-19 pandemic on immigrant communities in the United States. PLoS Negl. Trop. Dis. 2020, 14, e0008484. [Google Scholar] [CrossRef]
  20. Moghadas, S.M.; Vilches, T.N.; Zhang, K.; Wells, C.R.; Shoukat, A.; Singer, B.H.; Meyers, L.A.; Neuzil, K.M.; Langley, J.M.; Fitzpatrick, M.C.; et al. The impact of vaccination on COVID-19 outbreaks in the United States. Clin. Infect. Dis. 2021. [Google Scholar] [CrossRef]
  21. Fu, S.; Wang, B.; Zhou, J.; Xu, X.; Liu, J.; Ma, Y.; Li, L.; He, X.; Li, S.; Niu, J.; et al. Meteorological factors, governmental responses and COVID-19: Evidence from four European countries. Environ. Res. 2021, 194, 110596. [Google Scholar] [CrossRef]
  22. Kerr, G.H.; Badr, H.S.; Gardner, L.M.; Perez-Saez, J.; Zaitchik, B.F. Associations between meteorology and COVID-19 in early studies: Inconsistencies, uncertainties, and recommendations. One Health 2021, 12, 100225. [Google Scholar] [CrossRef]
  23. Khan, I.M.; Haque, U.; Zhang, W.; Zafar, S.; Wang, Y.; He, J.; Sun, H.; Lubinda, J.; Rahman, M.S. COVID-19 in China: Risk Factors and R0 Revisited. Acta Trop. 2021, 213, 105731. [Google Scholar] [CrossRef] [PubMed]
  24. Liu, J.; Zhou, J.; Yao, J.; Zhang, X.; Li, L.; Xu, X.; He, X.; Wang, B.; Fu, S.; Niu, T.; et al. Impact of meteorological factors on the COVID-19 transmission: A multi-city study in China. Sci. Total Environ. 2020, 726, 138513. [Google Scholar] [CrossRef]
  25. Vollert, N. Gaussian Process Surrogates with Increased Flexibility For Computer Simulation Output. Ph.D. Thesis, AAU Klagenfurt, Klagenfurt, Austria, 2020. [Google Scholar]
  26. Mesinger, F.; DiMego, G.; Kalnay, E.; Mitchell, K.; Shafran, P.C.; Ebisuzaki, W.; Jović, D.; Woollen, J.; Rogers, E.; Berbery, E.H.; et al. North American regional reanalysis. Bull. Am. Meteorol. Soc. 2006, 87, 343–360. [Google Scholar] [CrossRef] [Green Version]
  27. Gramacy, R.B. Surrogates. Gaussian Process Modeling, Design, and Optimization for the Applied Sciences; Chapman and Hall/CRC: Boca Raton, FL, USA, 2020. [Google Scholar]
  28. Shabbir, W.; Pilz, J. Bayesian Spatio-temporal Analysis for Dengue Fever in Major Cities of Pakistan (2006–2017). In Proceedings of the 12th RSEP International Social Sciences Conference, Barcelona, Spain, 23–25 April 2019; Veysel, M., Chodnicka-Jaworska, P., Eds.; pp. 1–9, ISBN 978-605-80676-0-8. ISSN 2547-9385. [Google Scholar]
  29. Wikle, C.K.; Zammit-Mangion, A.; Cressie, N. Spatio-Temporal Statistics with R; Chapman and Hall/CRC: Boca Raton, FL, USA, 2019. [Google Scholar]
  30. Tsang, T.K.; Wu, P.; Lin, Y.; Lau, E.H.Y.; Leung, G.M.; Cowling, B.J. Effect of changing case definitions for COVID-19 on the epidemic curve and transmission parameters in mainland China: A modelling study. Lancet Public Health 2020, 5, e289–e296. [Google Scholar] [CrossRef]
  31. Griffin, S. Covid-19: Second wave death rate is doubling fortnightly but is lower and slower than in March. BMJ 2020, 371, m4092. [Google Scholar] [CrossRef] [PubMed]
  32. Iftimie, S.; Lopez-Azcona, A.F.; Vallverdu, I.; Hernandez-Flix, S.; de Febrer, G.; Parra, S.; Hernandez-Aguilera, A.; Riu, F.; Joven, J.; Andreychuk, N.; et al. First and second waves of coronavirus disease-19: A comparative study in hospitalized patients in Reus, Spain. PLoS ONE 2021, 16, e0248029. [Google Scholar] [CrossRef]
  33. Rothman, K.J. No adjustments are needed for multiple-comparisons. Epidemiology 1990, 1, 43–47. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Batalova, J.; Hooker, S.; Capps, R. DACA at the Two-Year Mark: A National and State Profile of Youth Eligible and Applying for Deferred Action; MPI: Washington, DC, USA, 2014; Available online: https://www.migrationpolicy.org/research/daca-two-year-mark-national-and-state-profile-youth-eligible-and-applying-deferred-action (accessed on 4 November 2021).
  35. Lundberg, S.M.; Lee, S.-I. A Unified Approach to Interpreting Model Predictions. Available online: https://arxiv.org/pdf/1705.07874.pdf (accessed on 4 November 2021).
  36. Shapley, L.S. 17. A Value for n-Person Games. In Contributions to the Theory of Games (AM-28); Princeton University Press: Princeton, NJ, USA, 2016; Volume II, pp. 307–318. [Google Scholar]
  37. Appice, A.; Gel, Y.R.; Iliev, I.; Lyubchich, V.; Malerba, D. A Multi-Stage Machine Learning Approach to Predict Dengue Incidence: A Case Study in Mexico. IEEE Access 2020, 8, 52713–52725. [Google Scholar] [CrossRef]
  38. Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; Association for Computing Machinery: New York, NY, USA, 2016; pp. 785–794. [Google Scholar]
  39. Ord, J.K.; Getis, A. Local spatial autocorrelation statistics: Distributional issues and an application. Geogr. Anal. 1995, 27, 286–306. [Google Scholar] [CrossRef]
  40. Esri: How Hot Spot Analysis (Getis-Ord Gi*) Works. Available online: http://help.arcgis.com/en/arcgisdesktop/10.0/help/index.html#/How_Hot_Spot_Analysis_Getis_Ord_Gi_works/005p00000011000000/] (accessed on 1 November 2021).
  41. Hacker, K.; Anies, M.; Folb, B.L.; Zallman, L. Barriers to health care for undocumented immigrants: A literature review. Risk Manag. Healthc. Policy 2015, 8, 175–183. [Google Scholar] [CrossRef] [Green Version]
  42. Wallace, S.P.; Rodriguez, M.; Padilla-Frausto, I.; Arredondo, A.; Orozco, E. Improving access to health care for undocumented immigrants in the United States. Salud Publica Mex. 2013, 55, S508–S514. [Google Scholar] [CrossRef] [Green Version]
  43. Maldonado-Cabrera, A.; Angulo-Molina, A.; Haque, U.; Velazquez, C.; Alvarez-Villasenor, A.S.; Santacruz-Gomez, K.J.; Gallego-Hernandez, A.L. Acute Inflammatory Mediators in Young Adult Patients with COVID-19 in Mexico. Pathogens 2021, 10, 1056. [Google Scholar] [CrossRef] [PubMed]
  44. Mangla, S.; Pathak, A.K.; Arshad, M.; Ghosh, D.; Sahoo, P.K.; Garg, V.K.; Haque, U. Impact of Environmental Indicators on the COVID-19 Pandemic in Delhi, India. Pathogens 2021, 10, 1003. [Google Scholar] [CrossRef]
  45. Hill, J.; Rodriguez, D.X.; McDaniel, P.N. Immigration status as a health care barrier in the USA during COVID-19. J. Migr. Health 2021, 4, 100036. [Google Scholar] [CrossRef]
  46. Page, K.R.; Flores-Miller, A. Lessons We’ve Learned—Covid-19 and the Undocumented Latinx Community. N. Engl. J. Med. 2021, 384, 5–7. [Google Scholar] [CrossRef] [PubMed]
  47. Mangla, S.; Zohra Makkia, F.T.; Pathak, A.K.; Robinson, R.; Sultana, N.; Koonisetty, K.S.; Karamehic-Muratovic, A.; Nguyen, U.S.D.; Rodriguez-Morales, A.J.; Sanchez-Duque, J.A.; et al. COVID-19 Vaccine Hesitancy and Emerging Variants: Evidence from Six Countries. Behav. Sci. 2021, 11, 148. [Google Scholar] [CrossRef]
  48. Jony, S.S.R.; Haque, U.; Webb, N.J.; Spence, E.; Rahman, M.S.; Aghamohammadi, N.; Lie, Y.; Angulo-Molina, A.; Ananth, S.; Ren, X.; et al. Analyzing Predictors of Control Measures and Psychosocial Problems Associated with COVID-19 Pandemic: Evidence from Eight Countries. Behav. Sci. 2021, 11, 106. [Google Scholar] [CrossRef] [PubMed]
  49. Rahman, M.S.; KaramehicMuratovic, A.; Amrin, M.; Chowdhury, A.H.; Mondol, M.S.; Haque, U.; Ali, P. COVID-19 Epidemicin Bangladesh among Rural and Urban Residents: An Online Cross-Sectional Survey of Knowledge, Attitudes, and Practices. Epidemiologia 2021, 2, 1–13. [Google Scholar] [CrossRef]
  50. Polverino, F.; Stern, D.A.; Ruocco, G.; Balestro, E.; Bassetti, M.; Candelli, M.; Cirillo, B.; Contoli, M.; Corsico, A.; D’Amico, F.; et al. Comorbidities, Cardiovascular Therapies, and COVID-19 Mortality: A Nationwide, Italian Observational Study (ItaliCO). Front. Cardiovasc. Med. 2020, 7, 585866. [Google Scholar] [CrossRef] [PubMed]
  51. Michelozzi, P.; de’Donato, F.; Scortichini, M.; Pezzotti, P.; Stafoggia, M.; De Sario, M.; Costa, G.; Noccioli, F.; Riccardo, F.; Bella, A.; et al. Temporal dynamics in total excess mortality and COVID-19 deaths in Italian cities. BMC Public Health 2020, 20, 1238. [Google Scholar] [CrossRef] [PubMed]
  52. Ioannidis, J.P.A.; Axfors, C.; Contopoulos-Ioannidis, D.G. Population-level COVID-19 mortality risk for non-elderly individuals overall and for non-elderly individuals without underlying diseases in pandemic epicenters. Environ. Res. 2020, 188, 109890. [Google Scholar] [CrossRef] [PubMed]
  53. Fawcett, S.; Holt, C.; Schultz, J.; Rabinowitz, P. Section 22: Using small area analysis to uncover disparities. In Chapter 3: Assessing Community Needs and Resources; Published 2016; Available online: http://ctb.ku.edu/en/table-of-contents/assessment/assessing-community-needs-and-resources/small-area-analysis/main. (accessed on 8 May 2017).
  54. Talbot, T.O.; Haley, V.B.; Dimmick, W.F.; Paulu, C.; Talbott, E.O.; Rager, J. Developing consistent data and methods to measure the public health impacts of ambient air quality for Environmental Public Health Tracking: Progress to date and future directions. Air Qual. Atmos. Health 2009, 2, 199–206. [Google Scholar] [CrossRef] [Green Version]
Figure 1. The cumulative number of people vaccinated (per hundred) daily with daily prevalence and mortality (in a county-wise manner). (a). Spearman correlation between daily number of cases and cumulative number of people vaccinated daily, (b). Spearman correlation between daily number of deaths and cumulative number of people vaccinated daily.
Figure 1. The cumulative number of people vaccinated (per hundred) daily with daily prevalence and mortality (in a county-wise manner). (a). Spearman correlation between daily number of cases and cumulative number of people vaccinated daily, (b). Spearman correlation between daily number of deaths and cumulative number of people vaccinated daily.
Ijerph 18 12708 g001
Figure 2. We used SHAP to understand the impact of different features on the prevalence of COVID-19 and described the results in this figure. Shapely values were calculated for each socio-demographic and comorbidities-related feature. A summary was plotted for each time segment. In the figure, indices (ad) indicate plots for different time segments: (a) first wave, (b) second wave, (c) vaccination period, and (d) the entire study period. In the plot, a higher position implies greater importance for a feature (the actual values in the Y-axis do not have any significance).
Figure 2. We used SHAP to understand the impact of different features on the prevalence of COVID-19 and described the results in this figure. Shapely values were calculated for each socio-demographic and comorbidities-related feature. A summary was plotted for each time segment. In the figure, indices (ad) indicate plots for different time segments: (a) first wave, (b) second wave, (c) vaccination period, and (d) the entire study period. In the plot, a higher position implies greater importance for a feature (the actual values in the Y-axis do not have any significance).
Ijerph 18 12708 g002
Figure 3. We used SHAP to understand the impact of different features on mortality and described the results in this figure. Shapely values were calculated for each socio-demographic and comorbidities-related feature. A summary was plotted for each time segment. In the figure, indices (ad) indicate plots for different time segments: (a) first wave, (b) second wave, (c) vaccination period, and (d) the entire study period. In the plot, a higher position implies greater importance for a feature.
Figure 3. We used SHAP to understand the impact of different features on mortality and described the results in this figure. Shapely values were calculated for each socio-demographic and comorbidities-related feature. A summary was plotted for each time segment. In the figure, indices (ad) indicate plots for different time segments: (a) first wave, (b) second wave, (c) vaccination period, and (d) the entire study period. In the plot, a higher position implies greater importance for a feature.
Ijerph 18 12708 g003
Figure 4. We used SHAP to understand the impact of different features on the number of people vaccinated (per hundred) and described the results in this figure. Shapely values were calculated for each socio-demographic and comorbidities-related feature. A summary is plotted here. Here, a higher position implies greater importance for a feature. We used both (a) vaccinated population with one dose and (b) vaccinated population with two doses as outcome variables.
Figure 4. We used SHAP to understand the impact of different features on the number of people vaccinated (per hundred) and described the results in this figure. Shapely values were calculated for each socio-demographic and comorbidities-related feature. A summary is plotted here. Here, a higher position implies greater importance for a feature. We used both (a) vaccinated population with one dose and (b) vaccinated population with two doses as outcome variables.
Ijerph 18 12708 g004
Table 1. Description of meteorological features.
Table 1. Description of meteorological features.
FeatureDescription
Mean TemperatureThe average temperature of the day in degrees Celsius
Min TemperatureMinimum temperature of the day in degree Celsius
Max TemperatureMaximum temperature of the day in degree Celsius
Total PrecipitationTotal precipitation of the day in millimeter
Relative HumidityRelative humidity of the day in percentage
Table 2. Best models for Feature Importance calculations.
Table 2. Best models for Feature Importance calculations.
Outcome VariableTime SegmentBest Model
COVID-19 prevalenceFirst waveRandom Forest with 500 estimators
Second waveRandom Forest with 500 estimators
VaccinationXGBoost with 300 estimators
The entire study periodXGBoost with 300 estimators
COVID-19 mortalityFirst waveRandom Forest with 500 estimators
Second waveRandom Forest with 300 estimators
VaccinationXGBoost with 150 estimators
The entire study periodXGBoost with 180 estimators
People vaccinated with one dose (per hundred)VaccinationXGBoost with 700 estimators
People vaccinated with two doses (per hundred)XGBoost with 550 estimators
Table 3. Spearman coefficient of prevalence and mortality against the unauthorized population.
Table 3. Spearman coefficient of prevalence and mortality against the unauthorized population.
Feature (County Level)Time SegmentSpearman Correlation Coefficient
COVID-19 prevalenceFirst wave0.83
Second wave0.75
Vaccination0.72
The entire study period0.77
COVID-19 mortalityFirst wave0.74
Second wave0.68
Vaccination0.71
The entire study period0.75
All p-values are <0.01.
Table 4. Spearman correlation between each predictor and people vaccinated with one dose per hundred (only predictors with spearman coefficient > 0.3 with a p-value < 0.05 were selected).
Table 4. Spearman correlation between each predictor and people vaccinated with one dose per hundred (only predictors with spearman coefficient > 0.3 with a p-value < 0.05 were selected).
PredictorSpearman Coefficient
Asian 0.42
Population 0.40
CVD−0.35
Population per sq. mile 0.31
Diabetes−0.30
All p-values are <0.01.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Hasan Bhuiyan, M.T.; Mahmud Khan, I.; Rahman Jony, S.S.; Robinson, R.; Nguyen, U.-S.D.T.; Keellings, D.; Rahman, M.S.; Haque, U. The Disproportionate Impact of COVID-19 among Undocumented Immigrants and Racial Minorities in the US. Int. J. Environ. Res. Public Health 2021, 18, 12708. https://doi.org/10.3390/ijerph182312708

AMA Style

Hasan Bhuiyan MT, Mahmud Khan I, Rahman Jony SS, Robinson R, Nguyen U-SDT, Keellings D, Rahman MS, Haque U. The Disproportionate Impact of COVID-19 among Undocumented Immigrants and Racial Minorities in the US. International Journal of Environmental Research and Public Health. 2021; 18(23):12708. https://doi.org/10.3390/ijerph182312708

Chicago/Turabian Style

Hasan Bhuiyan, Mohammad Tawhidul, Irtesam Mahmud Khan, Sheikh Saifur Rahman Jony, Renee Robinson, Uyen-Sa D. T. Nguyen, David Keellings, M. Sohel Rahman, and Ubydul Haque. 2021. "The Disproportionate Impact of COVID-19 among Undocumented Immigrants and Racial Minorities in the US" International Journal of Environmental Research and Public Health 18, no. 23: 12708. https://doi.org/10.3390/ijerph182312708

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop