Next Article in Journal
The Impact of Surgical Delay: A Single Institutional Experience at the Epicenter of the COVID Pandemic Treatment Delays in Women with Endometrial Cancer and Endometrial Intraepithelial Hyperplasia
Previous Article in Journal
Association between Priority Conditions and Access to Care, Treatment of an Ongoing Condition, and Ability to Obtain Prescription Medications among Medicare Beneficiaries during the COVID-19 Pandemic
Previous Article in Special Issue
Genetic Analysis and Epitope Prediction of SARS-CoV-2 Genome in Bahia, Brazil: An In Silico Analysis of First and Second Wave Genomics Diversity
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genomic Surveillance of SARS-CoV-2 Sequence Variants at Universities in Southwest Idaho

1
Biology Department, Northwest Nazarene University, Nampa, ID 83686, USA
2
Biomolecular Research Center, Boise State University, Boise, ID 83725-1511, USA
3
Cleveland Clinic, 9500 Euclid Avenue, Cleveland, OH 44195, USA
4
Genetics and Infectious Disease Laboratory, Boise State University, Boise, ID 83725, USA
5
College of Medicine, Drexel University, 2900 W. Queen Lane, Philadelphia, PA 19129, USA
6
Department of Biological Sciences, Boise State University, Boise, ID 83725-1515, USA
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
These authors also contributed equally to this work.
COVID 2024, 4(1), 23-37; https://doi.org/10.3390/covid4010003
Submission received: 29 November 2023 / Revised: 22 December 2023 / Accepted: 23 December 2023 / Published: 25 December 2023
(This article belongs to the Special Issue SARS-CoV-2 Bioinformatics)

Abstract

:
Although the impact of the SARS-CoV-2 pandemic on major metropolitan areas is broadly reported and readily available, regions with lower populations and more remote areas in the United States are understudied. The objective of this study is to determine the progression of SARS-CoV-2 sequence variants in a frontier and remote intermountain west state among university-associated communities. This study was conducted at two intermountain west universities from 2020 to 2022. Positive SARS-CoV-2 samples were confirmed by quantitative real-time reverse transcription-polymerase chain reaction and variants were identified by the next-generation sequencing of viral genomes. Positive results were obtained for 5355 samples, representing a positivity rate of 3.5% overall. The median age was 22 years. Viral genomic sequence data were analyzed for 1717 samples and phylogeny was presented. Associations between viral variants, age, sex, and reported symptoms among 1522 samples indicated a significant association between age and the Delta variant (B 1.167.2), consistent with the findings for other regions. An outbreak event of AY122 was detected August–October 2021. A 2-month delay was observed with respect to the timing of the first documented viral infection within this region compared to major metropolitan regions of the US.

1. Introduction

The coronavirus disease 2019 (COVID-19) pandemic caused by the novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) wreaked havoc on communities, economies, and healthcare systems as the virus infected over 771 million people and contributed to the deaths of more than 6.9 million people as reported to the World Health Organization as of 8 November 2023 [1]. As the virus evolved newer strains, transmission rates, symptoms, and treatment efficacy also changed. The first case of COVID-19 in Idaho was confirmed on 12 March 2020 by the Idaho Department of Health and Welfare. Since the beginning of the pandemic, a total of 522,919 cases have been reported in Idaho [2]. The impact of the pandemic on Idaho was substantial. The Idaho Department of Health and Welfare issued three separate Crisis Standards of Care declarations during the pandemic, as intensive care units were pushed beyond their limits and emergency care had to be rationed. COVID-19 infections contributed to the deaths of 5582 Idahoans, or 1 in 328 residents [3].
Idaho features a low population density within a relatively large geographical region, which can create economic and social challenges. Southwest Idaho includes a small metro area with fewer than 1 million residents surrounded by extensive frontier and remote (FAR) regions [4], with the nearest metropolitan areas more than 350 miles away in any direction. The low population density and frontier and remote characteristics put the intermountain west at risk of being an understudied region during the SARS-CoV-2 pandemic. As genomic surveillance efforts to monitor the pandemic quickly became mainstream, the bulk of resources and efforts were concentrated in highly populous, urban areas. Understanding the effects of the pandemic in sparsely populated states is crucial because there are unique challenges and considerations associated with these regions. To better understand how the COVID-19 pandemic evolved in Southwest Idaho, this study analyzes the sequences of COVID19-positive samples collected from the Genetics and Infectious Disease Laboratory at Boise State University and the Biology Department at Northwest Nazarene University. The two southwest Idaho counties (Ada and Canyon) from which most of the samples are obtained are part of the Census Bureau’s Boise Metropolitan Statistical Area and are designated as Health Professional Shortage Areas and a Medically Underserved Area [5] and comprise approximately 40% of the population of the state of Idaho. The population density is 420 people per square mile in these two counties compared to the statewide density of 22 people per square mile [6]. Here, we report the results of the genomic surveillance in Southwest Idaho, among college-aged students, university staff and faculty, local preK-12 students and teachers, and state employees in Southwest Idaho.

2. Materials and Methods

2.1. Surveillance Testing

Northwest Nazarene University, Nampa, Idaho, and Boise State University, Boise, Idaho, athletes and staff participating in NCAA sports were tested weekly. Several preK-12 schools in the valley opted to test weekly as well. Students residing in dormitories were tested after arriving on campus prior to the 2021–2022 Fall semester and after holiday travel. Individuals working on campus and commuting students taking on-campus classes were tested as they deemed necessary. The Boise State Public Health testing center tested individuals from local businesses, state and local government employees, and arts organizations, as well as community members in general. Northwest Nazarene University offered daily options for asymptomatic screening for all faculty and students taking classes and working on campus (symptomatic individuals were asked to test at the campus wellness center). Businesses or groups concerned about an outbreak requested on-site testing. Patient demographics and self-reported symptoms were collected.

2.2. Sample Collection

Two types of collection methods were used. Patients’ saliva was collected into a sterile 15 mL tube. Anterior nasal swabs were collected by patients after receiving instructions to swab each nasal cavity with a nylon fiber swab for 15 s and then to place the sample into 3 mL of viral transport media in a sterile tube. The tests were transported to the laboratory and stored at room temperature (saliva samples) or at 4 °C (anterior nasal samples) until further analysis.

2.3. Screening for SARS-CoV-2

Screening tests included the following: Yale’s SalivaDirect™ PCR Assay (SD)1, TaqMan One-Step RT-PCR master mix (Applied Biosystems™, #A15300, Waltham, MA, USA), TaqPath™ COVID-19 Combo Kit (TP) (Applied Biosystems™, #A47814), and the TaqPath™ COVID-19, FluA, FluB Combo Kit (TP-CF) (Applied Biosystems™, #A47814). For the SD test [7], “Workflow 1” was used with ThermoFisher Scientific MagMAX™ Viral/Pathogen Proteinase K (Thermo Fisher, #A42363, Waltham, MA, USA). All samples were run on a QuantStudio™ 5 Real-Time PCR System, using 96 well plates and a volume of 0.1 mL (Applied Biosystems™: #A28568). Samples with CT < 40 were considered positive for COVID-19 and were aliquoted and stored at −80 °C. Samples with CT < 35 were sequenced.

2.4. Sample Sequencing

RNA extraction was performed with a KingFisher™ Flex using the MagMAX™ Viral/Pathogen II (MVP II) Nucleic Acid Isolation Kit (Applied Biosystems™, #A48383), and sequenced on an Illumina NextSeq 1000 using Illumina’s COVIDSeq™ Assay (Illumina, #20049393) and a P2/300 cycle reagent kit (Illumina, #4462230). Illumina COVIDSeq™ v4 Primer Pools were used for sequencing (Illumina, #20065135) (San Diego, CA USA). Library quantity was determined using the Qubit™ 4 Fluorometer with the 1× dsDNA HS Assay Kit (Invitrogen, #Q33231) (Waltham, MA, USA). A 2% PhiX spike-in was added for quality control.

2.5. Genomic Data Analysis

Sequences were analyzed with the DRAGEN COVID Lineage (version 3.5.12) pipeline on Illumina’s BaseSpace hub. The DRAGEN COVID Lineage analysis involves the Kmer-based detection of SARS-CoV-2, the alignment of reads to a reference genome, variant calls, and the generation of consensus genome sequences. This pipeline used the applications Pangolin [8] and Nextclade [9] to analyze the lineage and clade, respectively. Sequences that failed the DRAGEN default quality control analysis, most likely due to low virus titers in the original sample, were removed from the study. Sequences were aligned to GenBank’s “Wuhan-Hu-1/2019” root sequence with a custom-built reference dataset of 21,094 sequences deposited in the Global Initiative on Sharing All Influenza Data (GISAID) with collection dates between January 2020 and 7 May 2022. These represented samples from 36 US states, excluding Idaho (Nextstrain 4.2.0), to determine the lineage and clade. Sequences with the requisite metadata were uploaded to GISAID. All clade and lineage nomenclature used the Pango system with common names (Delta, Omicron) [10].

2.6. Phylogenic Analysis

SARS-CoV-2 phylogeny was reconstructed with Augur and visualized with Auspice using the Nextstrain pipeline (https://github.com/nextstrain/ncov; release version 12 (accessed on 1 January 2023)) [11]. We included all 1717 sequences for the Nextstrain analysis.

2.7. Data Cleaning

Demographic data were aligned with the genomic samples and files were uploaded to GISAID and the National Center for Biotechnology Information (NCBI). Duplicates were removed and the identified viral sequences originating from the same individuals within a 90-day window were assumed to represent the same infection event. Sequences that failed the NextClade quality control were removed from the dataset. We confirmed that sequence quality and the percentage of non-N bases were equivalent across clades using a chi-squared test and a one-way analysis of variance, respectively. As some analyses depended upon the entire testing population, not just COVID-19-positive cases, we summarized the number of tests in each month, including each individual once per month to reduce over-counting due to some groups requiring weekly testing.

2.8. Analysis of Age or Sex with Clade and Lineage

The data were grouped by age into 5 categories: under 5, 5 to 17, 18 to 25, 26 to 40, and over 40 years old. Because testing groups could be overrepresented at specific times of the year (i.e., during the school year) and during specific outbreak periods, age distribution at testing varied throughout the study period. We used our testing demographics to identify the periods when age groups were tested and divided the data into four six-month testing time frames (Fall 2020, Spring 2021, Fall 2021, and Spring 2022). For each time period, age and clade assessments were performed for the available data within these time frames. The association between age group or sex and clades with n > 10 was assessed with a Pearson’s chi-squared test. The relationship was considered significant for p < 0.05. The odds ratio and confidence intervals for sex or age groups to test positive in our total study group were calculated using the epitools package (version 0.5–10.1) [12].

2.9. Analysis of Symptoms with Clade

Fifteen groups of common symptoms were created (e.g., cough and congestion) from the ICD10 codes. Because individuals could report up to 10 symptoms and most people reporting symptoms reported more than 1 symptom (54% of the 1527 who reported symptoms), we analyzed the prevalence of symptoms by clade separately for each symptom. For each symptom, we analyzed its presence or absence across clades using Fisher’s exact test and adjusted the p-value for multiplicity using the false discovery rate [13] To understand the different clade representations when the test was statistically significant, we examined the cell’s contribution to the χ2 statistic in the 2 × n table (presence/absence of symptom group, by clade), flagging values > 3. We used base R and the library gmodels (version 2.18.1.1) for this assessment [14]. We visually examined the occurrence by lineage for 21J (Delta) and 21K (Omicron) to determine if these symptoms could be associated with lineages. This analysis was limited to those individuals that were specifically asked about symptoms and those clades that occurred after June 2021, when consistent symptom collection was employed.

3. Results

The Boise State University Genetics and Infectious Disease Laboratory performed 86,289 SARS-CoV-2 diagnostic tests between 4 November 2020 and 7 May 2022, registering 4620 positive results for a positivity rate of 5.4%. Northwest Nazarene University performed 67,814 tests between 31 August 2020 and 27 April 2022, registering 735 positive results for a 1.1% positivity rate. From these, 1717 unique SARS-CoV-2 isolates were sequenced (Figure 1).
Nationally, when campuses reopened in the Fall of 2020, outbreaks occurred in student living areas, and many students and employees became infected with the virus spreading to the surrounding community. In the Spring of 2021, intervention policies were in place that helped curb the virus’ spread. College-age people were eligible for the vaccine and, by late April 2021, individuals aged 16 years or older could receive a vaccination. Northwest Nazarene University has a student population of approximately 1778 (1100 undergraduate and 700 graduate students), with a 24% minority enrollment. Boise State University has a student population of approximately 26,000 (23,000 undergraduate and 3200 graduate students), with a 25% minority enrollment.
Contingency policies at Northwest Nazarene University included required masking and 6′ spacing during Fall 2020 and throughout April 2021. Asymptomatic campus members were advised to test twice a week during the first academic year and at least once a week in the second year (2021–2022). Campus members who tested positive for COVID-19 were isolated in designated housing (if residential students) or at home according to the CDC guidelines. A contactless food delivery system was implemented for quarantined students. Similar quarantine periods were imposed for individuals identified by contact tracing, unless the individuals were vaccinated.
At Boise State University, measures to prevent and/or minimize the spread were implemented to support safe working, learning, and living environments. Measures included wearing masks, social distancing, online courses, and contact tracing within classrooms using a mobile app, using an online pre-ordering system for meals with contactless grocery and food delivery services, providing isolation housing with a limited number of beds available for residents who tested positive, a notification system provided by Boise State Public Health, guidance during isolation, clearance from the isolation protocol, and making available a real-time status to provide information on clearance to be on campus through the student “myBoiseState” homepage, the “Welcome” page, and “My Bronco ID” on the Boise State mobile app. A QR Code app was implemented 1/11/21 to improve COVID-19 classroom contact tracing and attendance tracking data in Boise State classrooms. Students and instructors used their mobile devices to scan QR codes at assigned seats and locations in classrooms. This launched a web application with a quick prompt for authorization and location data already populated, resulting in more accurate information compared to a previous approach that asked students to fill out a survey. A dashboard was established by the Boise State Hazard and Climate Resilience Institute (HCRI) to facilitate the dissemination of information and collect and generate resources for the community during the COVID-19 pandemic, including information about how to stay informed about cases in Idaho, trends in the data, steps to take to reduce risks, and resources available.
Our study population was 54% female, 80% under the age of 40 years, identifying as White (77%) and non-Hispanic (77%) (Table 1).
Viral sequences were organized by phylogeny by Nextstrain (version 12) and compared against a sampling of 21,094 high-quality viral genomes acquired from GISAID representing 36 US states and excluding other Idaho samples (Figure 2). The first strains sequenced in 2020 were of the 20G clade. Over the course of the 2020–2021 and 2021–2022 academic years, despite substantial and impactful measures to reduce the spread, the SARS-CoV-2 pandemic resulted in infections from 17 Nextstrain clades and 102 Pango lineages within the study population (Figure 2).

3.1. Viral Evolution

SARS-CoV-2 demonstrated a considerable capacity for rapid evolution. The first divergent clades appeared within weeks of sample archiving. By December 2020, four clades and corresponding Pango lineages were detected. Both detected lineages of the 21C clade, B.1.427 and B.1.429, occurred over the same time period as the more dominant 20G clade. An assessment of the B.1.429 infections identified three mutations in the S1 subunit of the spike protein relative to the original 2019 Wuhan strain, and a C10641T mutation unique within our dataset (Figure 3). The last detected B.1.429 infection was first found on 29 March 2021 and was the last detected 21C clade sample in our dataset, marking the end of the 20C-derived SARS-CoV-2 variants. These were eventually supplanted by the 21A-derived Delta variants (Figure 2).
Several short-lived variants of the 20B clade were detected between February and May 2021. These variants represent the early lineage-level evolution detected in our dataset. Testing slowed down following the close of the 2021 Spring semester in May and the presence of the 20C and 20G clades began to decrease while lineages of the 20B clade began to rise. Testing ahead of the Fall 2021 semester in July determined that the detected lineages were all from the newly established Delta variant.
We found 32 unique lineages in the Delta family of variants across three clades co-existing between July 2021 and January 2022. The dominant clade was 21J, representing 93% of our cases. The 21A clade was detectable throughout the Fall 2021 semester but represented just 1.2% of cases (n = 9). The 21I clade, with eight Pango lineages, was also present during this same period and accounted for 5.3% (n = 41) of sequenced Delta variant infections. The lineages found in our Delta sequences largely reflected the same lineages found across North America, although the Idaho subset featured at least one overrepresented mutational site, ORF1b F685Y, detected in 54 Idaho infections and rarely in the north American cohort (Figure 3).
The Nextstrain analysis delineated major nodes within the 21J clade. The Pango lineage AY.44 was the most populous (38% of cases) and featured eight mutations in the S1 subunit of the spike protein, which were mutations common to all Delta variants, and a unique F183V amino acid change in the second open reading frame, ORF1b. All three Delta variants were present from July to November 2021 with 85% or more of the sequences each month presenting the 21J clade. However, this abruptly shifted in December 2021, when the Omicron variant, 21K, became the dominant variant in 71% of cases.
The supplanting Omicron variant family, derived from the 20B clade, consisted of 15 Pango lineages (BA.2 and subtypes) and accounted for 818 sequenced infections collected between December 2021 and May 2022. Nearly all cases were from the 21K clade (90%), and it was the only clade identified in our study group until March 2022. As observed in much of the world, the BA.1 lineage was then supplanted by BA.2. In April 2022 to the end of the study, all cases were BA.2 lineages (clades 21L and 22C). We did not detect any B.1.1.529 Omicron variants in our sample set.
The Omicron samples featured 14 unique spike mutations. The 21K clade had six unique spike mutations (F371L, G446S, G496S, T547K, N856K, and L981F) in 717 samples, while another four mutations were specific to 21L and 22C (T19I, V213G, T376A, and D405N) in the remaining 79 cases.
The genomic surveillance of epidemics allows for unique clinical insights. Several localized outbreaks were identified by genetic likeness within our study population. Between August and October 2021, 380 SARS-CoV-2-positive samples were collected. Among these, 27 were highly related infections of the AY122 Pango lineage (Figure 4). The first occurrence of this variant occurred in August 2021, and samples with this variant were found over the next three weeks. Of these cases, all were from 17–20-year-old college students with 12 of them living on campus. Many were asymptomatic with only five indicating known exposure.

3.2. Characteristic of Infected Individuals

More women than men were tested for SARS-CoV-2 in our study (Table 1). However, men were not more likely than women to test positive for SARS-CoV-2 (OR 0.98, CI 0.93 to 1.05). For clades with 10 or more cases, there was an association between sex and clade (p = 0.015). Men were underrepresented in the 20I (Alpha, V1) and 21L (Omicron) clades (std residuals −1.85 and −1.77, respectively; χ2-statistic 17.3). In our study group, individuals aged 26 to 40 years were significantly more likely to test positive for SARS-CoV-2 than other groups (OR 1.39 (1.2, 1.7)) (Table 1).
During the first academic year (2020–2021), college-aged individuals made up the majority of tests; there was more parity in testing ages in year two (Figure 5). Age groups and clades were significantly associated (χ2, p = 4 × 10 –14). Notably, in the Fall of 2021, the age group of 5 to 17 year olds was underrepresented regarding those infected with the 21K (Omicron) clade, while the 26 to 40-year-old age group was overrepresented (p < 0.001, χ2 std residuals −3.0, 2.5, χ2 = 26.2, df = 8) (Figure 5). There were similar rates of testing for the 26–40-year-old group as for younger adults (Figure 5). These susceptibility differences did not persist into the Spring semester, even though 21K (Omicron) infections continued (Figure 5c).
Twenty-three distinct Pango lineages of at least 10 cases each were identified. Within the three families (AY, BA, and B), there was a significant association of AY and B lineages with age. Those aged 18 to 25 years old were substantially more likely than other age groups to be infected with AY.122 within the AY lineages (χ2 std residual 4.9, χ2 statistic 100, df = 40). As noted previously, there was a 27-student outbreak with this strain, which accounted for the overrepresentation of the 18 to 25-year-old group. There was also an unexpectedly high number of children under five years old infected with the B.1.617.2 lineage relative to all other B lineages (χ2 std residuals 3.6, χ2 = 34.5, df = 12), although this only accounted for four of the 18 cases.

3.3. Symptom Analysis

A total of 1522 sequences out of the original 1717 had identified symptoms or stated they were asymptomatic. The 21J (Delta) variant produced a higher prevalence of congestion and loss of taste and smell and a lower prevalence of a sore throat. The opposite was observed for 21K (Omicron). We did not observe an obvious association of these three symptoms for any specific lineage within 21J and 21K. Approximately 50% of cases with a loss of taste and smell that were also 21J (Delta) were assigned to Pango lineage AY.44, but the observed odds ratio (OR 1.6 (0.9,3.0)) did not reach significance with the relatively small numbers of cases (n = 42 with a loss of taste and smell in 21J (Delta)). Table 2 presents the analysis, including the significant association of 21L with fatigue, 21I with headache and congestion, and the lack of association of 21C with a cough and sore throat.

4. Discussion

The goal of this study was to contribute sequences to the SARS-CoV-2 databases and to track the progression of variants in an intermountain west region during the pandemic. We reported the submission of 1717 sequences to GISAID and NCBI and described the sequences of clades and lineages between 31 August 2020 and May 2022. Information on patient demographics and symptoms provided insights into how infections manifested across different populations in our community.
We noted differences in the positivity rates for the two universities participating in this study: Northwest Nazarene University and Boise State University. The difference in positivity rates was most likely due to the fact that Northwest Nazarene University offered daily options for asymptomatic screening for all faculty and students taking classes and working on campus. In contrast, at Boise State University, asymptomatic screening was more limited. At Boise State University, weekly testing for asymptomatic individuals was provided for athletes and staff participating in NCAA sports and limited preK-12 schools. Students residing in dormitories were tested after arriving on campus prior to the 2021 Fall semester and after holiday travel. However, individuals working at Boise State and commuting students taking on-campus classes were tested as they deemed necessary, and so were more likely to test if they had symptoms or had an exposure risk. Individuals from local businesses, state and local government employees, arts organizations, and community members were tested if an outbreak occurred.
The frontier and remote relative geographic and population Isolation of Idaho may have been protective against the early spread of SARS-CoV-2 when travel restrictions and lockdown measures were implemented in various parts of the world. This may have delayed the arrival and subsequent spread of the virus in the state, but by 2022, the virus was even reported in isolated areas. Idaho was one of the last of the United States to record a case of SARS-CoV-2 (13 March 2020), almost two months after the first case in the US [15]. This later arrival in the state is reflected in our data. While the first Idaho case of the Delta variant was identified in April 2021, no more cases were identified until July 2021 when Idaho initiated more extensive sequencing statewide. Our first sample of Omicron (BA.1) was collected on 14 December 2021, within weeks of the first US case. By that time, masking and group size restrictions were no longer in place, likely contributing to the more rapid spread of later variants.
To design effective disease prevention and management strategies, it is important to differentiate the relative risks for demographic groups. Especially in the first year of the pandemic, it was believed that males had higher clinical severity, mortality, and perhaps susceptibility rates [16,17,18,19]. However, in November 2021, there were fewer cases of COVID-19 among men than women (145,163 vs. 159,058, respectively) and men had a significantly lower population odds ratio of contracting COVID-19 (OR 0.90, CI 0.89 to 0.91) [20]. We did not find a significant association between sex and susceptibility to infection in our study when accounting for the higher COVID testing rate among women. Together, these results suggest that susceptibility is not affected by differences in the testing rates between sexes within our study population.
There were significant associations between age and specific clades or lineages of SARS-CoV-2, consistent with other observations. The Delta variant (B 1.167.2) was identified worldwide as highly transmissible, particularly infecting children and adolescents [21,22]. One hypothesis for this increased transmission was that children were more likely to be in school and group settings. In our study, children under five years old had even higher infection rates than school-aged children or college students. This can potentially be explained by a reduced compliance to masking in this age group, as well as a need for more hands-on playing and learning behaviors. Interestingly, the large wave of 21K (Omicron) and its lineages in Southwestern Idaho did not show a similar higher susceptibility rate for children than other cohorts [23].
The finding that a loss of taste and smell decreases in those infected with the Omicron variant is well-supported in the literature [24,25,26,27,28]. Omicron has been shown to lack specific mutations that are associated with anosmia and ageusia. The pre-Omicron literature that examined the symptoms of the Delta variant used different reference groups and was not consistent, showing both relative increases [29,30] in this symptom and decreases [31]. An increased cough and sore throat in Omicron clades relative to Delta and/or pre-Delta clades was reported [26,29,30]. However, our results show significantly less congestion than expected in the 21K Omicron clade compared to the Delta clades. It has been suggested that, among Omicron cases, symptoms broadened to the upper respiratory system, increasing these symptoms along with fever, a sore throat, fatigue, and cough, with the caveat that these were all symptoms of other common infections and could not be isolated to COVID-19 in the case of co-infection [25].
The limitations of our study included an overrepresentation of individuals from the 18 to 25-year-old age range. University populations are often homogeneous in terms of age, lifestyle, and socio-economic background, which may not reflect the broader population. The population studied here had a median age of 22 years, which can present limitations regarding the conclusions for age susceptibilities. University settings foster close interactions, shared living spaces, and social gatherings, making it challenging to isolate or control the variables related to viral transmission. The findings from studies on university populations may not always be extrapolated to other age groups or settings due to the unique dynamics of campus life. Additionally, those undergoing testing were self-selecting to some extent based on whether they were symptomatic, while those participating in athletic programs were tested regardless of symptoms and most often presented as asymptomatic. Some limitations existed for the recording of symptoms as these records may not have included all the symptoms. Finally, the majority of the samples analyzed in this study were from the Delta and Omicron waves during Fall 2021 and early 2022. These factors could have limited our conclusions regarding the associations between symptoms and clades, which could have been underestimated for the lesser clades.

5. Conclusions

While Idaho is a frontier and remote state, this study focused on a small urban center. In addition to the rapid arrival of new lineages traced from origins well outside the geographically isolated regions studied here, the detection of a missense mutation (ORF1b F685Y) in a high proportion of local infections, which was not detected in the reference set and was rarely found in other datasets, highlighted an example of how such a mutation could spread rapidly and locally, even without a clear or lasting positive selection bias. Both this incident and the broader observation of the general course of SARS-CoV-2 viral evolution underpin the critical nature of genomic surveillance efforts. To better understand the results of our study, Figure 6 illustrates the progression of newly confirmed cases in Idaho highlighting public policy decisions and interventions that are implemented over time.
Significant public resources were used for unprecedented genomic and clinical surveillance during the SARS-CoV-2 pandemic. What is clear from this and other efforts is how unpredictably, rapidly, and even regionally a viral pandemic can evolve.
With the benefit of experience and the analyses these efforts permitted, we can better prepare for a future that may face additional pandemic events. Public awareness and expectations can be established earlier on while minimizing the risk of miscommunications, leading to an improved accuracy of forecasting and increased compliance with public health safety measures.

Author Contributions

Concept and design: J.T.O. and S.F.H. Acquisition, analysis, or interpretation of data: all authors. Writing—original draft preparation and review and editing: all authors. Statistical analysis: J.R.C., L.B. and D.J.V. Administrative, technical, or material support: M.S., J.C., A.R., J.T.O. and S.F.H. Data curation, investigation, methodology, validation, and writing—review and editing: D.J.V., J.R.C. and L.B. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by the National Institutes of Health, NIGMS (P20GM109095, P20GM103408, and U54GM104944), and the Office of University Affairs and Public Health at Boise State University.

Institutional Review Board Statement

The Boise State University Office of Research Compliance reviewed the protocol and determined that this research is exempt from further IRB reviews and supervision under 45 CFR 46.101(b) 685-MED21-004; SARS-CoV-2 surveillance of variants by viral genome sequencing; exemption: Category #4.

Informed Consent Statement

Not applicable.

Data Availability Statement

Genomic sequences were submitted to GISAID, the global data science initiative. Sequences were also submitted to the National Center for Biotechnology Information (NCBI).

Acknowledgments

The authors wish to acknowledge the technical support from Diane Smith and Usha Acharya, the administrative support from Alicia Shier Estey, and the case management support from Maureen Welcker. We acknowledge the support from The Biomolecular Research Center at Boise State, BSU-Biomolecular Research Center, RRID:SCR_019174), and Lori and Duane Stueckle.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. World Health Organization. WHO Coronavirus (COVID-19 Dashboard). 2023. Available online: https://covid19.who.int/ (accessed on 8 November 2023).
  2. The New York Times. Track COVID-19 in Idaho. 2023. Available online: https://www.nytimes.com/interactive/2023/us/idaho-covid-cases.html (accessed on 8 November 2023).
  3. Idaho Division of Public Health. Idaho Department of Health and Welfare|Idaho, United States. 2023. Available online: https://public.tableau.com/app/profile/idaho.division.of.public.health (accessed on 8 November 2023).
  4. USDA Economic Research Services, US Department of Agriculture. 2023. Available online: https://www.ers.usda.gov/data-products/chart-gallery/gallery/chart-detail/?chartId=62021 (accessed on 8 November 2023).
  5. Rural Health Information. RHIhub. Health and Healthcare in Frontier Areas. 2023. Available online: https://www.ruralhealthinfo.org/topics/frontier (accessed on 8 November 2023).
  6. United States Census Bureau, Quick Facts, Canyon County, Idaho; Ada County, Idaho; Idaho. 2023. Available online: https://www.census.gov/quickfacts/fact/table/canyoncountyidaho,adacountyidaho/RHI525222 (accessed on 8 November 2023).
  7. Vogels, C.B.; Watkins, A.E.; Harden, C.A.; Brackney, D.E.; Shafer, J.; Wang, J.; Caraballo, C.; Kalinich, C.C.; Ott, I.M.; Fauver, J.R.; et al. SalivaDirect: A simplified and flexible platform to enhance SARS-CoV-2 testing capacity. Med 2021, 2, 263–280. [Google Scholar] [CrossRef]
  8. O’Toole, Á.; Scher, E.; Underwood, A.; Jackson, B.; Hill, V.; McCrone, J.T.; Colquhoun, R.; Ruis, C.; Abu-Dahab, K.; Taylor, B.; et al. Assignment of epidemiological lineages in an emerging pandemic using the pangolin tool. Virus Evol. 2021, 7, veab064. [Google Scholar] [CrossRef] [PubMed]
  9. Aksamentov, I.; Roemer, C.; Hodcroft, E.B.; Neher, R.A. Nextclade: Clade assignment, mutation calling and quality control for viral genomes. J. Open Source Softw. 2021, 6, 3773. [Google Scholar] [CrossRef]
  10. Rambaut, A.; Holmes, E.C.; O’Toole, Á.; Hill, V.; McCrone, J.T.; Ruis, C.; Plessis, L.D.; Pybus, O.G. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat. Microbiol. 2020, 5, 1403–1407. [Google Scholar] [CrossRef]
  11. Hadfield, J.; Megill, C.; Bell, S.M.; Huddleston, J.; Potter, B.; Callender, C.; Sagulenko, P.; Bedford, T.; Neher, R.A. Nextstrain: Real-time tracking of pathogen evolution. Bioinformatics 2018, 34, 4121–4123. [Google Scholar] [CrossRef] [PubMed]
  12. Aragon, T.J.; Fay, M.P.; Wollschlaeger, D.; Omidpanah, A.; Omidpanah, M.A. Package ‘Epitools’. 2017. Available online: https://cran.r-project.org/web/packages/epitools/epitools.pdf (accessed on 1 June 2022).
  13. Benjamani, Y.; Hochberg, Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. R. Stat. Soc. B 1995, 57, 289–300. [Google Scholar] [CrossRef]
  14. Warnes, G.R.; Bolker, B.; Lumley, T. SAIC-Frederick RCJCfRCJaC, Program IFbtIR, NIH, Institute NC, NO1-CO-12400. CfCRuNC (2022). _gmodels: Various R Programming Tools for Model Fitting_. R Package Version 2.18.1.1. Available online: https://CRAN.R-project.org/package=gmodels (accessed on 1 June 2022).
  15. Office of the Governor. Governor Brad Little. Governor Little Issues Statement Following First Confirmed Case of Coronavirus in Idaho. 13 March 2020. Available online: https://gov.idaho.gov/pressrelease/governor-little-issues-statement-following-first-confirmed-case-of-coronavirus-in-idaho/ (accessed on 8 November 2023).
  16. Danielsen, A.C.; Lee, K.M.; Boulicault, M.; Rushovich, T.; Gompers, A.; Tarrant, A.; Reiches, M.; Shattuck-Heidorn, H.; Miratrix, L.W.; Richardson, S.S. Sex disparities in COVID-19 outcomes in the United States: Quantifying and contextualizing variation. Soc. Sci. Med. 2022, 294, 114716. [Google Scholar] [CrossRef] [PubMed]
  17. Guan, W.J.; Ni, Z.Y.; Hu, Y.; Liang, W.H.; Ou, C.Q.; He, J.X.; Liu, L.; Shan, H.; Lei, C.L.; Hui, D.S.; et al. Clinical Characteristics of Coronavirus Disease 2019 in China. N. Engl. J. Med. 2020, 382, 1708–1720. [Google Scholar] [CrossRef] [PubMed]
  18. Centers for Disease Control and Prevention. COVID Data Tracker; US Department of Health and Human Services, CDC: Atlanta, GA, USA, 2023. Available online: https://covid.cdc.gov/covid-data-tracker (accessed on 30 May 2023).
  19. Vahidy, F.S.; Nicolas, J.C.; Meeks, J.R.; Khan, O.; Pan, A.; Jones, S.L.; Masud, F.; Sostman, H.D.; Phillips, R.; Andrieni, J.D.; et al. Racial and ethnic disparities in SARS-CoV-2 pandemic: Analysis of a COVID-19 observational registry for a diverse US metropolitan population. BMJ Open 2020, 10, e039849. [Google Scholar] [CrossRef] [PubMed]
  20. Gendersci Lab GSL. GenderSci Lab COVID Project. Updates and New Releases. 2021. Available online: https://www.genderscilab.org/gender-and-sex-in-covid19?rq=gender-and-sex%20in%20covid19 (accessed on 1 November 2021).
  21. Chun, J.Y.; Jeong, H.; Kim, Y. Age-Varying Susceptibility to the Delta Variant (B.1.617.2) of SARS-CoV-2. JAMA Netw. Open 2022, 5, e223064. [Google Scholar] [CrossRef] [PubMed]
  22. Khemiri, H.; Ayouni, K.; Triki, H.; Haddad-Boubaker, S. SARS-CoV-2 infection in pediatric population before and during the Delta (B.1.617.2) and Omicron (B.1.1.529) variants era. Virol. J. 2022, 19, 144. [Google Scholar] [CrossRef] [PubMed]
  23. Chun, J.Y.; Jeong, H.; Kim, Y. Identifying susceptibility of children and adolescents to the Omicron variant (B.1.1.529). BMC Med. 2022, 20, 451. [Google Scholar] [CrossRef] [PubMed]
  24. Butowt, R.; Bilińska, K.; von Bartheld, C. Why Does the Omicron Variant Largely Spare Olfactory Function? Implications for the Pathogenesis of Anosmia in Coronavirus Disease 2019. J. Infect. Dis. 2022, 226, 1304–1308. [Google Scholar] [CrossRef] [PubMed]
  25. Looi, M.K. How are COVID-19 symptoms changing? BMJ 2023, 380, 3. [Google Scholar] [CrossRef]
  26. Marquez, C.; Kerkhoff, A.D.; Schrom, J.; Rojas, S.; Black, D.; Mitchell, A.; Wang, C.Y.; Pilarowski, G.; Ribeiro, S.; Jones, D.; et al. COVID-19 Symptoms and Duration of Rapid Antigen Test Positivity at a Community Testing and Surveillance Site During Pre-Delta, Delta, and Omicron BA.1 Periods. JAMA Netw. Open 2022, 5, e2235844. [Google Scholar] [CrossRef] [PubMed]
  27. Rodriguez-Sevilla, J.J.; Güerri-Fernádez, R.; Bertran Recasens, B. Is There Less Alteration of Smell Sensation in Patients with Omicron SARS-CoV-2 Variant Infection? Front. Med. 2022, 9, 852998. [Google Scholar] [CrossRef]
  28. von Bartheld, C.S.; Wang, L. An Explanation for Reports of Increased Prevalence of Olfactory Dysfunction with Omicron: Asymptomatic Infections. J. Infect. Dis. 2023, jiad394. [Google Scholar] [CrossRef] [PubMed]
  29. Fernández-de-Las-Peñas, C.; Ortega-Santiago, R.; Fuensalida-Novo, S.; Martín-Guerrero, J.D.; Pellicer-Valero, O.J.; Torres-Macho, J. Differences in Long-COVID Symptoms between Vaccinated and Non-Vaccinated (BNT162b2 Vaccine) Hospitalized COVID-19 Survivors Infected with the Delta Variant. Vaccines 2022, 10, 1481. [Google Scholar] [CrossRef] [PubMed]
  30. Fernández-de-las-Peñas, C.; Cuadrado, M.L.; Gómez-Mayordomo, V.; Torres-Macho, J.; Pellicer-Valero, O.J.; Martín-Guerrero, J.D.; Arendt-Nielsen, L. Headache as a COVID-19 onset symptom and post-COVID-19 symptom in hospitalized COVID-19 survivors infected with the Wuhan, Alpha, or Delta SARS-CoV-2 variants. Headache 2022, 62, 1148–1152. [Google Scholar] [CrossRef] [PubMed]
  31. Alexandar, S.; Mathesan, R.; Raju, S.K.; Jakkan, K. A Comprehensive Review on COVID-19 Delta variant. Int. J. Pharmacol. Clin. Res. 2021, 5, 83–85. [Google Scholar]
Figure 1. Flowchart of selection process of samples for sequencing and downstream analysis from 31 August 2020–7 May 2022.
Figure 1. Flowchart of selection process of samples for sequencing and downstream analysis from 31 August 2020–7 May 2022.
Covid 04 00003 g001
Figure 2. Phylogeny of the study samples (n = 1717 sequences with lengths > 27,000 BP) colored by clade in Auspice by Nextstrain. Pango lineage AY.122.
Figure 2. Phylogeny of the study samples (n = 1717 sequences with lengths > 27,000 BP) colored by clade in Auspice by Nextstrain. Pango lineage AY.122.
Covid 04 00003 g002
Figure 3. Genetic diversity across the sequence of the SARS-CoV-2 genome in the study group (a) vs. GSAID sequences in North America (b).
Figure 3. Genetic diversity across the sequence of the SARS-CoV-2 genome in the study group (a) vs. GSAID sequences in North America (b).
Covid 04 00003 g003
Figure 4. Complete Pango lineage AY.122 subset from 2021.
Figure 4. Complete Pango lineage AY.122 subset from 2021.
Covid 04 00003 g004
Figure 5. (a) Total tests administered each month by age category; (b) fraction of administered tests that were positive per month per age group; (c) number of cases (for n > 2) assigned to a clade each month.
Figure 5. (a) Total tests administered each month by age category; (b) fraction of administered tests that were positive per month per age group; (c) number of cases (for n > 2) assigned to a clade each month.
Covid 04 00003 g005
Figure 6. Timeline of weekly newly confirmed cases in Idaho. Weekly cases are shown from 13 March 2020 to April 2023. Key changes to the public policy decisions and interventions are indicated in lower-case letters, a to z, across the timeline. Notes referring to timepoints a to z are shown below the graph. Stage 1: retail stores, places of worship, daycare centers, and youth activities are allowed to reopen if they can maintain appropriate social distancing. Stage 2: gatherings of more than 10 people are not allowed. Social distancing and masks should be utilized during any gatherings indoors and outdoors. Stage 3: gatherings should be limited to 50 or fewer people. Face coverings are strongly recommended and are required for long-term care facilities. Bars, restaurants, and nightclubs should continue to operate with seating. Everyone should follow social distancing and sanitation recommendations. Stage 4: no suggested limits on gathering sizes, but recommendations to adhere to social distancing and sanitation guidelines. Masks are required in long-term care facilities. Vaccines are encouraged for all eligible people and masks are strongly recommended in accordance with the CDC prevention guidance.
Figure 6. Timeline of weekly newly confirmed cases in Idaho. Weekly cases are shown from 13 March 2020 to April 2023. Key changes to the public policy decisions and interventions are indicated in lower-case letters, a to z, across the timeline. Notes referring to timepoints a to z are shown below the graph. Stage 1: retail stores, places of worship, daycare centers, and youth activities are allowed to reopen if they can maintain appropriate social distancing. Stage 2: gatherings of more than 10 people are not allowed. Social distancing and masks should be utilized during any gatherings indoors and outdoors. Stage 3: gatherings should be limited to 50 or fewer people. Face coverings are strongly recommended and are required for long-term care facilities. Bars, restaurants, and nightclubs should continue to operate with seating. Everyone should follow social distancing and sanitation recommendations. Stage 4: no suggested limits on gathering sizes, but recommendations to adhere to social distancing and sanitation guidelines. Masks are required in long-term care facilities. Vaccines are encouraged for all eligible people and masks are strongly recommended in accordance with the CDC prevention guidance.
Covid 04 00003 g006aCovid 04 00003 g006b
Table 1. Demographics of patients surveilled for SARS-CoV-2 sequence study.
Table 1. Demographics of patients surveilled for SARS-CoV-2 sequence study.
No. Cases% Cases
Gender
Females92754.00%
Males78045.40%
Not identified/not recorded100.58%
Age Category, Years
<5231.3%
5–1730017.5%
18–2566538.7%
26–4038222.2%
>4033919.7%
Not indicated80.5%
Race
American Indian or Alaska Native110.6%
Asian613.6%
Black or African American362.1%
Hispanic or Latino583.4%
Native Hawaiian or Pacific Islander150.9%
Other or not indicated21514.5%
White132176.9%
Ethnicity
Hispanic or Latino1146.6%
Not Hispanic or Latino132277.0%
Unknown or not indicated28116.4%
Category
Boise State campus community67239.2%
Boise State NCAA athletics774.5%
NNU campus1257.3%
NNU community211.2%
Local Boise community members53030.9%
School-aged and pre-school27416.0%
State government140.8%
Table 2. Symptoms by clade and statistical results. Independence between the symptom group and clade was assessed using Fisher’s exact test (separately for each symptom group) and adjusting the overall p-value using the false discovery rate [13]; where the adjusted p-value was <0.05, we compared the expected counts to the observed for cells where the contribution to the overall chi-squared value was >3. Individuals could indicate up to 10 symptoms at testing. N = 1522 individuals reporting symptoms or reporting being asymptomatic. Yellow shading indicates that the clade has more cases than expected by chance alone reporting this symptom, while blue shading indicates that the clade has fewer cases than expected by chance alone reporting this symptom.
Table 2. Symptoms by clade and statistical results. Independence between the symptom group and clade was assessed using Fisher’s exact test (separately for each symptom group) and adjusting the overall p-value using the false discovery rate [13]; where the adjusted p-value was <0.05, we compared the expected counts to the observed for cells where the contribution to the overall chi-squared value was >3. Individuals could indicate up to 10 symptoms at testing. N = 1522 individuals reporting symptoms or reporting being asymptomatic. Yellow shading indicates that the clade has more cases than expected by chance alone reporting this symptom, while blue shading indicates that the clade has fewer cases than expected by chance alone reporting this symptom.
Symptom Group 121A Delta21C Epsilon21I Delta21J Delta21K Omicron21L Omicron22C Omicron# CasesChi-
Squared Value 3
Adj. p-Value 4
Asymptomatic0.9%1.6%2.3%48.9%37.6%4.1%0.9%4445.810.444
Congestion0.3%1.6%3.8%50.2%36.4%3.8%0.6%62720.840.009
Cough0.6%0.0%3.1%48.7%40.4%4.7%1.2%51511.670.051
Fatigue0.0%0.6%1.9%38.9%46.3%9.9%1.2%16218.910.030
Fever1.0%0.6%3.8%49.5%40.0%3.5%0.3%3156.600.444
Headache1.0%0.3%5.2%48.7%40.6%2.6%1.0%31015.300.051
Loss of Taste/Smell0.0%2.0%4.0%84.0%8.0%0.0%0.0%5033.290.003
Muscle aches0.9%0.9%3.7%48.2%42.7%1.8%0.5%2185.610.444
Nausea0.0%1.7%6.7%53.3%36.7%0.0%1.7%608.280.199
Sore Throat0.7%0.0%2.0%34.2%55.6%5.6%1.1%4442.720.928
Number of Cases 29174070563364131522
1: Chest pain (0 reports), diarrhea (16 reports), mental symptoms (12 reports), shortness of breath (33 reports), and COVID-19 exposure (9 reports) were not included. 2: Clades with a poor representation, or occurring prior to consistent symptom collection, were not used in this analysis (20A, 20B, 20G, and 20I). 3: All chi-squared tests have six degrees of freedom. 4: Fisher’s exact p-value was adjusted by false discovery rate [13].
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chase, J.R.; Bond, L.; Vail, D.J.; Sengthep, M.; Rodriguez, A.; Christianson, J.; Hudon, S.F.; Oxford, J.T. Genomic Surveillance of SARS-CoV-2 Sequence Variants at Universities in Southwest Idaho. COVID 2024, 4, 23-37. https://doi.org/10.3390/covid4010003

AMA Style

Chase JR, Bond L, Vail DJ, Sengthep M, Rodriguez A, Christianson J, Hudon SF, Oxford JT. Genomic Surveillance of SARS-CoV-2 Sequence Variants at Universities in Southwest Idaho. COVID. 2024; 4(1):23-37. https://doi.org/10.3390/covid4010003

Chicago/Turabian Style

Chase, Jennifer R., Laura Bond, Daniel J. Vail, Milan Sengthep, Adriana Rodriguez, Joe Christianson, Stephanie F. Hudon, and Julia Thom Oxford. 2024. "Genomic Surveillance of SARS-CoV-2 Sequence Variants at Universities in Southwest Idaho" COVID 4, no. 1: 23-37. https://doi.org/10.3390/covid4010003

APA Style

Chase, J. R., Bond, L., Vail, D. J., Sengthep, M., Rodriguez, A., Christianson, J., Hudon, S. F., & Oxford, J. T. (2024). Genomic Surveillance of SARS-CoV-2 Sequence Variants at Universities in Southwest Idaho. COVID, 4(1), 23-37. https://doi.org/10.3390/covid4010003

Article Metrics

Back to TopTop