Abstract
Background: COVID-19 efforts were often ineffective in controlling the spread of the pandemic. Thus, identifying ineffective controls during a pandemic is vital. Method: Utilizing publicly available data on COVID deaths in the counties of US states, we created an index to capture and interpret ineffectiveness in the efforts to reduce the spread of the pandemic in US counties. This index is based on the Intervened Poisson Distribution (IPD) introduced originally by Shanmugam. Motivation for the research idea occurred when we noticed the data dispersion of the COVID deaths was smaller than the average only in some counties. Under-dispersed data is common in statistical modeling. A novel approach we adapted in this article includes the estimation of an intervention parameter estimated through iterative non-linear optimization. Results: Twenty-five counties in California, Idaho, Minnesota, Mississippi, Montana, Nebraska, North Carolina, North Dakota, Texas, and Utah were found to be ineffective in controlling for fatalities based on the expected probability distribution. A review of the policies enacted in these areas would provide insight into ineffective prevention efforts, and some of these issues are documented in the current literature. Conclusions: The IPD index an innovate way to document efficacy of interventions during pandemics.
Keywords:
positive poisson distribution; under dispersion; bayesian analysis; prediction; index of infectivity MSC:
6204; 62E10; 62F30; 62P10
1. Introduction
As of September 2022, the SARS-CoV2 (COVID-19) pandemic remains a public health concern for the United States of America and the world in general [1]. The number of COVID-19 cases worldwide as of September 2022 was over 611 million with an associated 6.5 million casualties [2]. About 16% of the worldwide cases and fatalities occurred in the United States [2]. Public health professionals all over the world implemented measures intended to reduce death and suffering from COVID-19 [3,4,5]. The efficacy of those measures is something that is not well understood as the efforts varied from county to county, but ongoing work in this area is identifying best practices [6]. Thus, an index that evaluates intervention efficacy can be used post hoc or even during the middle of an epidemic to help facilitate best practice identification and control the spread of disease.
Developing appropriate models for evaluating efficacy of interventions by region is a necessary step to evaluate best practice. Previous studies have identified both state and county variations in response to the epidemic [7]; however, no local efficacy index exists in the literature, rather only country-level assessments which may have limited regional importance [8]. This work addresses that issue directly.
We propose a probability model as an abstraction of the reality for U.S. counties with respect to COVID deaths. Fatality rates themselves provide an indicator of the severity of the disease. However, these rates do not consider interventional controls employed that may have affected the severity and outcomes. Our models address intervention as an unobserved (latent) variable. This research article constructs an appropriate probability model and applies statistical concepts to develop an index to portray the inefficacy level of public health policy to stop or at least reduce the COVID death in some US counties. A novelty in this article is the use of the intervention parameter, estimated using nonlinear optimization implemented in R Statistical Software [9]. We apply the index to categorize a qualifying subset of counties throughout the United States. Extensions of this research are also proffered.
2. Methods
2.1. Model Specification
In an outbreak of an epidemic (appearance of illness among a large number of people), endemic (regularly occurring illness), or pandemic (contagious deadly illness), public health professionals must collect and analyze data to disseminate information necessary to control disease spread and severity. COVID-19 is such a pandemic. Once a SARS-CoV2-related death at a time epoch is reported, the data collection apparatus is activated and efforts to contain/reduce the pandemic are initiated. Let be a random number of COVID deaths at a time the data collection apparatus is activated. The domain for is the set. With an inclusion of zero, the underlying model for the data would have been Poisson with an unknown, finite mortality rate, . As zero is not a possibility for, it is reasonable to assume that the model for is a positive Poisson probability distribution (Equation (1)).
Often, public health professionals do not stay idle but impose preventive efforts to stop any escalation of the mortality.
Let the inefficacy of their efforts to stop an escalation of COVID mortality is a non-observable inefficacy parameter such that the COVID’s mortality rate becomes . Suppose the random number refers the number of new COVID deaths since the imposition of efforts. Note that the frequency pattern of could be a regular Poisson probability distribution (Equation (2)).
When the inefficacy parameter is in this scenario, the preventive efforts ought to have been a greatest success. When the inefficacy parameter was , the scenario would be thought to have reduced the COVID mortality. Beware that when (which is an undesirable, adversarial), the COVID mortality rate might have worsened as the pandemic outpowered the efforts. However, the registry of COVID incidences records only the sum . An analyst ought to consider a convoluted probability distribution of the random number . Such a convoluted probability distribution is the intervened Poisson distribution (IPD, Equation (3)).
Equation (3) was introduced by Shanmugam [10] and studied in Shanmugam [11]. Joyce et al. [12] designed a mixed sampling plan to judge an IPD chance mechanism. Utilizing dispersion and mean, Shanmugam [13] revelated a shrunken quantity to portray the public perception of situations which might spread AIDS or HIV. Earlier, Shanmugam [14] modeled the web changes data to recatch during a spread of internet virus using IPD. Additionally, Shanmugam [15] predicted a “successful” inefficacy of an epidemic, using IPD.
The expected number of the IPD in (3) is Equation (4), and it is intrinsically related to its dispersion, Equation (5).
The dispersion reflects the volatility in the COVID deaths. If the dispersion is lesser than the expected number, the efforts ought to have been effective, no matter what is the level of the expected number of the COVID deaths in a county? Before examining it in the data, realize that the second term is nonnegative, and, hence, in the IPD model. This characteristic property is a litmus test to decide whether the IPD model (4) is indeed the underlying model for the chance mechanism which generated the data on COVID mortality and the existence of successful efforts. In other words, in those counties in which the mortality data-based estimate of the inefficacy parameter is less than one is indicative of successful efforts.
2.2. Index of Inefficacy
Consider two mutually exclusive dichotomous scenarios in an effort to stop escalation of COVID mortality. One scenario encompasses an effective, efforts. The other scenario is an adversarial ineffective, efforts. Only one of these two scenarios could have ever happened in a county with respect to efforts by the health professionals dealing with COVID mortality. Combining the data dispersion, , expected number, and of the COVID mortality, we introduce an index to portray the inefficacy of the efforts by the health professionals. Note that and the balancing factor, .
When the efforts were ideal and the best (that is, ), the target index is whose level is data dependent as the estimates of and vary from a county to another. When the efforts were adversarial and worst (that is, ), the index is negative and is indicative of how much trailing behind the target level, .
Additionally, the factor is indicative of how much the chance mechanism of COVID mortality has tilted away from the regular Poisson chance mechanism at zero level because the mean, and variance, are equal in regular Poisson mechanism. The dynamics of the tilt is visualized in the Figure 1 after denoting in the , in the, and in the .
Figure 1.
The dynamics of the inefficacy confronting COVID pandemic in which in the , in the , and in the .
We name the factor as a unique deflated risk level to die in COVID pandemic because of the efforts by the health professionals. In the deflated risk to die of COVID in a county during the pandemic, the co-proportion in interval [0, 1] gets the name efforts index and the other co-factor could be named pandemic index . The nonlinear configuration of the pandemic index is seen in Figure 2.
Figure 2.
The nonlinear configuration of the pandemic index . The pandemic-index is in vertical z-axis, μ and v are in the breath and x-axes respectively.
The genesis of IPD (intervened Poisson distribution) follows. With no intervention efforts, the expected number of deaths would have been just the incident rate (θ). The intervention efforts impact the expected deaths to change to ρθ. When the intervention parameter ρ is less than one, the efforts have been effective, as the expected number of deaths (after the intervention) reduced. When the intervention parameter ρ is greater than one, the variance is still less than the mean, but the scenario is indicative of inefficient efforts. Hence, we the analysis proceeds in two steps. When variance is less than one, the IPD is the underlying model for the pattern of COVID-19 deaths. When the intervention parameter is less than one, the efforts were effective. Otherwise (that is, when the intervention parameter is greater than one and the variance is lesser than the mean), the efforts were inefficient.
2.3. Estimating the Efficacy Parameter
In this section, we provided an innovative approach to estimate the IPD’s parameters using R Statistical Software [9] and the nloptr non-linear optimization package [16]. First, we cite below the major expressions to be exercised to obtain the estimates of the mortality parameter,and the efficacy parameter, from the data mean, and data variance, of the COVID deaths in a county. The mean and the variance are and . The p-value of the data based estimate, is computed using the expression , which was derived by Shanmugam [11] using the so called Neyman’s procedure. The p-value is the left tail probability area, under the standard Gaussian frequency curve.
2.4. Data
Daily COVID-19 fatality data were obtained from USA Facts [17] where were compiled from the Centers for Disease Control and Prevention (CDC) [18] at the county level for all United States counties from 28 March 2020 through 26 January 2022. County-level data were selected to prevent smoothing due to aggregation, which would be associated with filtering out the ineffectiveness of the prevention efforts. This study specifically focuses on the United States, as disparate control efforts were implemented from state to state and county to county. In many more homogenous countries, controls were constant. One-hundred and thirty-seven counties with the variance less than the mean were retained to develop the inefficiency index, as this is a characteristic of the IPD. A variance greater than the mean implies efficiency in interventions. Further, when the variance is less than the mean, there is evidence that a pure Poisson process is not appropriate. In the pure Poisson process, the mean and variance should be equal, and this equality is the characteristic property. In the COVID-19 data for some counties in US, this characteristic property is not met and is thus indicative of the reality that the data deviated from the Poisson process.
3. Results
3.1. Map of Counties with Potentially Ineffective Interventions
The 137 counties retained in the study where the mean exceeds the variance are depicted in Figure 3 with shading associated with logarithm of the sum of their deaths. Many of the observations are located in the center of the country. The large majority of the counties are in the central United States. The interactive plots designed with leaflet [19] are available online [20].
Figure 3.
Counties where the mean is greater than the variance.
3.2. Estimation of Parameters
To estimate both θ and ρ for each county, a positive semi-definite non-linear objective was formed that minimized the squared difference (sum of squares) between , as specified previously, and the observed mean mortality ( for each of the counties: . For each county, the objective function was solved using Constrained Optimization by Local Approximation (COBYLA), a derivative-free optimization algorithm developed by Powell [21] and implemented in nloptr [16]. While we might have chosen one of a number of optimization algorithms, we chose COBYLA, as it does not require gradient specification. The code is available for review online [20].
3.3. Analysis of COVID Fatalities
In the illustration, we considered COVID deaths in all counties of USA from 22 January 2020 till 9 November 2022. There were approximately 3142 counties in USA with COVID deaths in the above specified duration; however, there were another 50 observations, one per state, reflecting fatalities unallocated to a specific county. Descriptive statistics are shown in Table 1.
Table 1.
Descriptive statistics.
The population in this study was 328.5 million representing the United States population. The ‘average’ county was about 103 thousand (median ~25 thousand) in population and experienced 330 deaths (median of 102). The rate of fatalities per 100,000 population was about 339 on average (median of 259), and the overall fatality rate per 100,000 was 321. The fatality rates per 100,000 and logarithm of the sum of the total fatalities (+1) are depicted in Figure 4 and Figure 5.
Figure 4.
COVID-19 Fatality Rate per 100,000 population (county level). NOTE: counties in gray have in excess of 1600 fatalities per 100,000.
Figure 5.
COVID-19 Logarithm of the Sum of Deaths (+1).
Aggregated at the state level, fatality rates per 100,000 individuals (not adjusted for age) were highest in Mississippi (436.54), West Virginia (420.61), Alabama (419.28), and Arizona (414.97), while they were lowest in Hawaii (121.2), Vermont (122.28), Utah (157.99), and Alaska (184.68). Table 2 provides a complete enumeration of deaths, populations, and fatality rates per 100,000, while Figure 6 provides a map of the logarithm of the sum of COVID-19 Deaths (+1) for counties where the mean is greater than the variance (N = 137).
Table 2.
Deaths, population, and fatality rate per 100,000 persons (sorted by fatality rate).
Figure 6.
Logarithm of the Sum of COVID-19 Deaths (+1) for counties where the mean is greater than the variance (N = 137).
After finding the mean, and dispersion, , we eliminated all those counties in which the dispersion is more than the mean. In other words, we screened and selected the counties with under dispersion, which is a requirement for the IPD model. Exactly 137 counties had under dispersion. These counties are depicted in Figure 6 and online.
The efficacy and the mortality parameters of these counties were estimated using nonlinear optimization as described previously. Of the 137 counties which were observed to have under dispersion, only 25 were statistically significant at the α = 0.05 level. The estimates for the statistically significant counties are summarized in Table 3 and depicted in Figure 7.
Table 3.
Summary of county, state, n, dispersion, mean, estimate of efficacy of efforts, p-value for COVID mortality.
Figure 7.
Logarithm of the Sum of COVID-19 Deaths (+1) for N = 25 counties where the mean is greater than the variance and with p < 0.05.
4. Discussion
The IPD provides a probabilistic method for indexing inefficacious response efforts during pandemics. In our analysis, we discovered that 25 of the 137 counties were likely inefficient. By identifying these counties during a pandemic, efforts to identify and address the inefficiencies might be pursued. Since COVID-19 is likely not to be the last pandemic, the index generated by our analysis will be useful for future pandemics.
Some of these counties may have reasonable explanations for under dispersion. For example, Mono County, California experienced extreme particulate matter due to the wildfires in 2020, and COVID-19 deaths on these ‘wildfire days’ were higher [22]. Thus, a sustained high level of COVID-19 fatalities might be associated with environmental conditions. Kenedy, Texas was considered ‘high-risk’ for transmission of COVID-19 for various reasons, and thus might have experienced under dispersion [23]. Garfield County, Utah experienced a decline in population due to death during the pandemic [24], which might be due to pandemic prevention efforts themselves. Hooker County, Nebraska was previously identified as a high-incident outlier [25]. Neighboring Brown County may be associated with the same problematic interventions. Many of the counties in Texas had populations with high-risk comorbidities [26]. Failure to implement effective prevention measures might have exacerbated death rates. Other counties would need additional investigation to determine explanations.
5. Conclusions
COVID-19 is only a single pandemic; however, it resulted in the death of millions. The ability to identify efficacy of prevention measures using distributions, such as the IPD is critical to policymakers, as a finite number of assets are available for intervention. The human suffering and mortality rate had shaken the trust and peace of mind in everyone. Government agencies did formulate and implement preventive and treatment policies; however, the assessment of these policies might have been assisted through the use of mathematical models, such as the IPD. Policies that were assumed to be effective might have been rapidly assessed using reasonable probability distribution assumptions.
The study is based on secondary analysis, which is common when experimental designs are infeasible. Still, the classification of inefficacious versus efficacious is based on a reasonable probability model that might be affected by other factors not included in the study. Additional models would probe this limitation further. Finally, the study is limited by the validity of the data collected by the Centers for Disease Control and Prevention, although the results are likely not influenced largely by accidental input errors.
The findings in this research article identified those United States counties in which the efforts to stop the escalation of COVID-19 mortality were inefficient. In this research process, we have created an approach of indexing the inefficiency of the healthcare operations during a pandemic prior to formal modeling. This index might be used in future pandemics to identify those entities which are implementing inefficacious policies. The index can provide decision makers areas that require interventional assistance. The techniques proffered herein should prove useful for indexing efforts in future pandemics.
Author Contributions
All four authors equally contributed to all five sections. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Data Availability Statement
The data are open source as referenced in the paper.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Le, T.T.B.; Vasanthakumaran, T.; Hien, H.N.T.; Hung, I.C.; Luu, M.N.; Khan, Z.A.; An, N.T.; Tran, V.P.; Lee, W.J.; Aziz, J.M.A. SARS-CoV-2 omicron and its current known unknowns: A narrative review. Rev. Med. Virol. 2022, e2398. [Google Scholar] [CrossRef] [PubMed]
- WHO. Who Coronavirus (COVID-19) Dashboard. 2022. Available online: https://covid19.who.int/ (accessed on 2 November 2022).
- Rajan, S.; McKee, M.; Hernández-Quevedo, C.; Karanikolos, M.; Richardson, E.; Webb, E.; Cylus, J. What have european countries done to prevent the spread of COVID-19? Lessons from the COVID-19 health system response monitor. Health Policy 2022, 126, 355–361. [Google Scholar] [CrossRef] [PubMed]
- Unruh, L.; Allin, S.; Marchildon, G.; Burke, S.; Barry, S.; Siersbaek, R.; Thomas, S.; Rajan, S.; Koval, A.; Alexander, M. A comparison of 2020 health policy responses to the COVID-19 pandemic in canada, ireland, the united kingdom and the united states of america. Health Policy 2022, 126, 427–437. Available online: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9187506/pdf/main.pdf (accessed on 2 November 2022). [CrossRef] [PubMed]
- Jiao, J.; Shi, L.; Zhang, Y.; Chen, H.; Wang, X.; Yang, M.; Yang, J.; Liu, M.; Sun, G. Core policies disparity response to COVID-19 among brics countries. Int. J. Equity Health 2022, 21, 9. [Google Scholar] [CrossRef] [PubMed]
- Brauner, J.M.; Mindermann, S.; Sharma, M.; Johnston, D.; Salvatier, J.; Gavenčiak, T.; Stephenson, A.B.; Leech, G.; Altman, G.; Mikulik, V. Inferring the effectiveness of government interventions against COVID-19. Science 2021, 371, eabd9338. Available online: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7877495/pdf/371_abd9338.pdf (accessed on 2 November 2022). [CrossRef] [PubMed]
- Feinhandler, I.; Cilento, B.; Beauvais, B.; Harrop, J.; Fulton, L. Predictors of death rate during the COVID-19 pandemic. Presented Healthc. 2020, 8, 339. [Google Scholar] [CrossRef] [PubMed]
- McAleer, M. Prevention is better than the cure: Risk management of COVID-19. J. Risk Financ. Manag. 2020, 13, 46. [Google Scholar] [CrossRef]
- R Development Core Team. R: A Language and Environment for Statistical Computing; R Development Core Team: Vienna, Austria, 2018. [Google Scholar]
- Shanmugam, R. An intervened poisson distribution and its medical application. Biometrics 1985, 41, 1025–1029. Available online: http://www.jstor.org/stable/2530973 (accessed on 2 November 2022). [CrossRef] [PubMed]
- Shanmugam, R. An inferential procedure for the poisson intervention parameter. Biometrics 1992, 48, 559–565. [Google Scholar] [CrossRef] [PubMed]
- Joyce, V.J.; Merlin, G.S.; Edna, K.R.J.; Fenella, S. Designing mixed sampling plan based on ipd. J. Manag. Inf. Decis. Sci. 2021, 24, 1–6. [Google Scholar]
- Shanmugam, R. Revelation of shrunken or stretched binomial dispersion and public perception of situations which might spread aids or hiv. Int. J. Res. Med. Sci. 2014, 2, 462–467. [Google Scholar] [CrossRef][Green Version]
- Shanmugam, R. Modelling web changes data recatched during a spread of internet virus. Math. Comput. Model. 2005, 42, 1175–1190. [Google Scholar] [CrossRef]
- Shanmugam, R. Predicting “successful” prevention of an epidemic. Commun. Stat.-Theory Methods 2006, 30, 93–103. [Google Scholar] [CrossRef]
- Johnson, S.G. The Nlopt Nonlinear Optimization Package. 2022. Available online: https://nlopt.readthedocs.io/en/latest/ (accessed on 2 November 2022).
- Usafacts. Us COVID-19 Cases and Deaths by State. 2022. Available online: https://usafacts.org/visualizations/coronavirus-COVID-19-spread-map (accessed on 2 November 2022).
- CDC. Trends in Number of COVID-19 Cases and Deaths in the Us Reported to CDC, by State/Territory. 2022. Available online: https://covid.cdc.gov/covid-data-tracker/#trends_weeklydeaths_select_00 (accessed on 2 November 2022).
- Cheng, J.; Karambelkar, B.; Xie, Y. Leaflet: Create Interactive Web Maps with the Javascript ‘Leaflet’ Library, 2022.
- Fulton, L. Rpubs Code. 2022. Available online: https://rpubs.com/R-Minator/Ram22 (accessed on 2 November 2022).
- Powell, M.J. A view of algorithms for optimization without derivatives. Math. Today-Bull. Inst. Math. Appl. 2007, 43, 170–174. [Google Scholar]
- Zhou, X.; Josey, K.; Kamareddine, L.; Caine, M.C.; Liu, T.; Mickley, L.J.; Cooper, M.; Dominici, F. Excess of COVID-19 cases and deaths due to fine particulate matter exposure during the 2020 wildfires in the united states. Sci. Adv. 2021, 7, eabi8789. [Google Scholar] [CrossRef] [PubMed]
- Rao, S.; Singh, M.; Hemnani, K.; Arroyo, C. Can we abandon our masks? Deciphering cdc’s new guidelines about masking. DHR Proc. 2022, 2, 1–3. [Google Scholar] [CrossRef]
- Lake, S.; Juan, A.S. State and county population estimates for utah: 2021. Population 2021, 3, 3–4. [Google Scholar]
- Andrews, M.R.; Tamura, K.; Best, J.N.; Ceasar, J.N.; Batey, K.G.; Kearse, T.A., Jr.; Allen, L.V., III; Baumer, Y.; Collins, B.S.; Mitchell, V.M. Spatial clustering of county-level COVID-19 rates in the us. Int. J. Environ. Res. Public Health 2021, 18, 12170. Available online: https://mdpi-res.com/d_attachment/ijerph/ijerph-18-12170/article_deploy/ijerph-18-12170-v3.pdf?version=1637639494 (accessed on 2 November 2022). [CrossRef] [PubMed]
- Chin, T.; Kahn, R.; Li, R.; Chen, J.T.; Krieger, N.; Buckee, C.O.; Balsari, S.; Kiang, M.V. US county-level characteristics to inform equitable COVID-19 response. medRxiv 2020. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).