Next Article in Journal
Genetic Diversity and Forensic Parameters of 27 Y-STRs in Two Mestizo Populations from Western Mexico
Previous Article in Journal
Alternative Splicing Events and ABA Hormone Regulation in Drought Response of Hippophae gyantsensis L.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

SARS-CoV-2 Genetic Variants Identified in Selected Regions of Ethiopia Through Whole Genome Sequencing: Insights from the Fifth Wave of COVID-19

1
Aklilu Lemma Institute of Pathobiology, Addis Ababa University, Addis Ababa P.O. Box 1176, Ethiopia
2
Ethiopian Public Health Institute, Addis Ababa P.O. Box 1242, Ethiopia
3
Armaur Hansson Research Institute, Addis Ababa P.O. Box 1005, Ethiopia
4
College of Veterinary and Animal Science, University of Gondar, Gondar P.O. Box 196, Ethiopia
5
Africa Centres for Disease Control and Prevention (Africa CDC), Addis Ababa P.O. Box 3243, Ethiopia
6
Genomics and Metagenomics Platform, Pasteur Institute, Abidjan P.O. Box 75015, Côte d’Ivoire
7
Laboratory of Molecular Virology, International Centre for Genetic Engineering and Biotechnology (ICGEB), Padriciano 99, 34149 Trieste, Italy
8
Bio and Emerging Technology Institute, Addis Ababa P.O. Box 5954, Ethiopia
*
Author to whom correspondence should be addressed.
Genes 2025, 16(3), 351; https://doi.org/10.3390/genes16030351
Submission received: 13 February 2025 / Revised: 19 February 2025 / Accepted: 23 February 2025 / Published: 18 March 2025
(This article belongs to the Section Viral Genomics)

Abstract

:
Background: The COVID-19 pandemic highlighted SARS-CoV-2 variants with increased transmissibility and immune evasion. In Ethiopia, where cases surged, the understanding of the virus’s dynamics was limited. This study analyzed SARS-CoV-2 variants during the fifth wave, crucial for guiding vaccines, therapeutics, diagnostics, and understanding disease severity. Method: From June to August 2022, 150 SARS-CoV-2-positive samples were randomly selected from the Ethiopian Public Health Institute repository. Sixty-three high-quality genome sequences were analyzed. Results: Of the 63 sequences, 70% were from males and 30% from females, with a median age of 34. Omicron dominated (97%, 61/63), primarily clade 22A (64%, 40/63), followed by 22B (18%, 11/63) and 21K (14%, 9/63). Delta accounted for 3.2% (2/63). Omicron was identified in all (25) vaccinated study participants. Ethiopian sequences showed limited evolutionary divergence and lower genetic diversity compared to global sequences. Conclusion: Omicron was the predominant variant during Ethiopia’s fifth wave, indicating recent community transmission. Despite minor genetic diversity differences, ongoing surveillance remains critical for tracking variants and informing public health interventions.

1. Introduction

Coronavirus disease (COVID-19) is an infectious disease caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) which originated in Wuhan City, Hubei Province, in China [1]. SARS-CoV-2 is transmitted from person to person through the inhalation of respiratory droplets from an infected individual or through direct contact with infected droplets through the eyes, mouth, or nose. However, transmissibility, incubation period, and infectivity vary depending on the variants and individual immunity [2].
Coronaviruses are enveloped RNA viruses belonging to the Coronaviridae family, characterized by a positive single-stranded genome exceeding 29,891 nucleotides, encoding for 9890 amino acids [3]. Their genome includes numerous open reading frames (ORFs) that produce structural (SPs) and non-structural proteins (NSPs) essential for the virus’s life cycle and pathogenesis [4]. Structural proteins include the spike (S), envelope (E), membrane (M), and nucleocapsid (N) proteins, while non-structural proteins (nsp1–16) facilitate viral metabolism and interaction with the host immune system [5]. The spike glycoprotein of SARS-CoV-2 is cleaved into two subunits, S1 and S2, facilitating virus–cell membrane fusion [6]. The S1 subunit contains receptor-binding and N-terminal domains, crucial for viral entry and potential targets for neutralization. Host protease primes the S2 subunit [6]. Viral mutations can lead to new pathogenic variants affecting transmission, disease severity, and vaccine efficacy [7].
The COVID-19 pandemic was first reported in Africa on 14 February 2020, with the initial confirmed case detected in Egypt [8]. Ethiopia confirmed its first COVID-19 case in March 2020. The wide spread of COVID-19 led the World Health Organization (WHO) to declare the disease as a pandemic [9]. On 8 April 2020, the national government of Ethiopia declared a five-month state of emergency but allowed economic activities to continue during the public health crisis [10]. Ethiopia has experienced multiple pandemic waves, similar to the global trend. The first wave occurred from July to December 2020, followed by the second wave from January to July 2021. The third wave was recorded between July and December 2021, while the fourth wave spanned from December 2021 to April 2022. The fifth wave then emerged from June to August 2022. As of April 2024, from the total of more than 4 million tested, 501,157 positive cases were reported. From these, 488,171 and 7574 (%) individuals recovered and died, respectively. And only 38% of the population received at least one dose of a COVID-19 vaccine [11].
A sero-epidemiological survey conducted in Addis Ababa and Jimma between August 2020 and April 2022 revealed that over 96% of the study group had been exposed to SARS-CoV-2 at least once [12]. This figure indicates a high level of exposure across the population. Similarly, most Ethiopians have had multiple exposures to SARS-CoV-2, leading to high antibody titers with slow decay characteristics. This suggests the development of hybrid immunity due to recurrent infections with different variants and vaccination among many individuals [13].
At the beginning of the pandemic, Ethiopia, like many countries, primarily dealt with the initial SARS-CoV-2 strains. The virus’s rapid spread and the emergence of variants were closely monitored as part of global efforts to understand the virus’s behavior and adapt public health measures. As the virus continued to circulate, variants began to emerge. Variants of concern (VOCs) like Alpha (B.1.1.7), Beta (B.1.351), Gamma (P.1), Delta (B.1.617.2), and Omicron (B.1.1.529) have been identified in Africa [14,15]. In Ethiopia, similar variants have likely been detected [16]. Unfortunately, SARS-CoV-2 in the country was not sequenced during the actual pandemic period. This was due to a lack of laboratory infrastructure, skilled human resources, and reagents and kits. However, due to the relative improvement of sequencing infrastructure in the country, this project sequenced 63 retrospective samples from the national repository of the Ethiopian Public Health Institute collected during the fifth pandemic wave.
The dynamics of SARS-CoV-2 variants were not consistently monitored as part of the national surveillance systems in Ethiopia. This information is crucial for informing public health interventions, monitoring the genomic epidemiology of the virus, identifying specific regional challenges, and supporting research and collaboration [13]. Only 824 SARS-CoV-2 samples were sequenced and submitted to GISAID from Ethiopia to date. This is relatively very small compared to other African countries like South Africa (56,886), Kenya (13,770), Egypt (5146), and Tunisia (2915). Out of 824 SARS-CoV-2 samples sequenced and submitted, 63 were for this project. This shows how SARS-CoV-2 sequencing activities were not conducted during the actual outbreak period. So, any information generated in this manuscript is very novel for the Ethiopian condition. The global science community can get information about what was the situation of SARS-CoV-2 variants in Ethiopia during the 5th wave of the pandemic from this manuscript as well as from the GISAID/NCBI submissions we made. This was the main rationale for initiating this study project. Therefore, this study was conducted to investigate the SARS-CoV-2 variants that circulated during the fifth wave of the pandemic in selected regions of Ethiopia.

2. Materials and Methods

2.1. Study Design, Period, and Sample Collection

A retrospective experimental study design was used. The study design was a retrospective experimental design. Sample collection was undertaken during the fifth wave of the pandemic (June to August, 2022) in Ethiopia. A total of 150 SARS-CoV-2-positive samples were collected retrospectively from the national virology reference laboratory repository at the Ethiopian Public Health Institute (EPHI). These samples were nasopharyngeal (NP) swab samples collected from patients residing in Addis Ababa, Oromia, Amhara, and Southern Nation Nationalities and Peoples (SNNP) regions (Figure 1). EPHI is a national reference laboratory for SARS-CoV-2 in Ethiopia, and samples were brought from those geographic areas, as well as from diplomatic communities and international travelers, for viral testing. Samples were stored at −80 °C for long-term storage. Samples for sequencing were purposively selected based on their identifiers (IDs) from the electronic national database. Only the IDs that tested positive and had a cycle threshold (Ct) value less than or equal to 30 were selected. As a result, 150 SARS-CoV-2-positive samples were included in this study due to our limited sequencing resources. The patients’ metadata, including demographic information, clinical data, and RT-qPCR results, were anonymously extracted from electronic databases. The study participants were then grouped based on factors such as age, sex, signs and symptoms, pre-existing chronic diseases, travel history, and vaccination status.

2.2. Quality Assurance

To increase the success rate of whole genome sequencing, an RT-qPCR test was performed again after the samples were thawed at room temperature [17]. The RNA from these samples was re-extracted using the BGI extraction kit, and detection was performed using a BIO-RAD CFX96 Deep Well™ Real-Time PCR Detection system (Bio Rad Laboratories, Inc., Hercules, CA, USA) following the standard protocol. The clinical samples were handled and processed in a Biosafety Laboratory-2 (BSL-2) at the national virology reference laboratory. Out of the 150 re-analyzed samples, 70 had a Ct value less than or equal to 30 and were selected for whole genome sequencing, and others were discarded due to their high Ct values (i.e., >30). The RNA extracts from these 70 samples were sequenced using the Illumina NextSeq 550 platform, San Diego, CA, USA, at the virology laboratory of EPHI during March 2023. Out of 70 SARS-CoV-2 sequence data, 63 sequences passed the bioinformatics quality controls requirements (i.e., with genome coverage greater than 80%, no clustered mutations, and no misplaced stop codons) and were selected for further sequence analysis.

2.3. RNA Extraction and qRT-PCR

Total RNA was extracted from 300 μL nasopharyngeal samples using MegaBio plus Virus Purification Kit II (BSC87S1E), (Hangzhou Bioer Technology Co., Ltd., Hangzhou, China), using the Bioner Gene Pure Pro fully automatic Nucleic Acid Purification System (NPA-32P) per the manufacturer’s instructions [18]. Real-time qRT-PCR was performed to confirm the presence of the virus in the samples using primers and probes corresponding to SARS-CoV-2 with the QIAamp Viral RNA Mini Kit (Qiagen, Shanghai, China) according to the manufacturer’s protocol [19]. qRT-PCR was performed using the BIO-RAD CFX96 Deep Well™ Real-Time PCR Detection system (Bio-Rad Laboratories, Inc., USA). All samples were tested for the human RNase P gene as an internal control.

2.4. Complementary DNA (cDNA) Synthesis, DNA Library Preparation, and Whole Genome Sequencing

Seventy RNA extracts, which exhibited low Ct values (Ct < 30) in quantitative real-time PCR assays, were chosen for library preparation. The cDNA synthesis, SARS-CoV-2 sequence enrichment, library amplification, and indexing were performed using the Illumina COVIDSeq test (RUO) (Illumina, USA) following the manufacturer’s instructions. The resulting libraries were pooled, normalized, and quantified using a Qubit® 4 fluorometer (Thermo Fisher Scientific, Waltham, MA, USA) [20] before paired-end sequencing was performed on a NextSeq system (Illumina, USA) with a NextiSeq 500/550 cycle V3 kit (Illumina, USA).

2.5. Sequence and Metadata Analysis

The metadata were exported from the national database and converted into Excel format. The data were encoded into STATA v17, and descriptive analyses were performed. Raw sequencing data were obtained in FASTQ format and processed for variant calling and consensus sequence generation. First, the FASTQ sequences’ quality was evaluated by the FastQC (v0.11.9) program, and then adapter sequences were trimmed from reads using Cutadapt software v.4.6 [21] to remove adaptor sequences and poor-quality reads. An average base quality of Q30 was used for trimming. The trimmed reads were then mapped against the SARS-CoV-2 WuhanHu-1 reference genome sequence (GenBank accession No. NC_045512.2) using the Burrows–Wheeler Alignment BWA mem v0.7.12 [22]. The consensus sequence was then generated using Samtools version 1.12. Moreover, SARS-CoV-2 variants were called with locally installed Pangolin (daily updated version [23], and mutations were identified with online Nextclade v3.8.2 Clade assignment, mutation calling, and sequence quality checks [24]. All viral genome sequences obtained from this study were submitted to the Global Initiative on Sharing All Influenza Data (GISAID) with accession IDs from EPI_ISL_19,147,618 to EPI_ISL_19302657 [25] and National Center for Biotechnology Information (NCBI) repositories with accession numbers from PQ140559 to PP069552 [26].

2.6. Computation of SNP Distances in Samples to the SARS-CoV-2 Reference Genome

SNP-dist v0.70 [27] was used to determine the number of SNPs in the 63 sequences compared to the SARS-CoV-2 reference genome. The hCoV-19/Wuhan/WIV04/2019 sequence from Wuhan, China, which consists of 29,891 base pairs, was used as a reference genome obtained from the GISAID. The snp-dist tool calculates the SNP distance matrix from a dataset of multiple sequence alignments (MSAs), where the sequences are of the same length. These distance matrices are commonly used in studies of infectious disease outbreaks. For this analysis, we only considered informative bases (A, T, C, and G) and not Ns. MSA was conducted using MAFFT v7.526 [28], employing the multiple alignment technique with the Fast Fourier Transform. The following parameters were applied: a gap opening penalty of 1.53 for group-to-group alignment and a gap extension penalty of 0.00 for group-to-group alignment. The block substitution matrix (BLOSUM) was selected, with a coefficient of 62, and the L-INS-I alignment method of MAFFT was chosen, known for its accuracy in alignments of ≤200 sequences.

2.7. Evolutionary Divergence Between Sequences and Overall Sequence Pairs

To estimate the evolutionary divergence between sequences as well as the average evolutionary divergence over all sequence pairs, MSA of the 63 sequences was performed using MAFFT v7.526 [28]. The output of the alignment was used to construct the phylogenetic tree via IQ-tree version v2.3.5-3 [29]. All ambiguous positions were removed for each sequence pair (pairwise deletion option). There was a total of 29,921 positions in the final dataset. The pairwise and overall mean (average) distances were used as proxies for evolutionary divergence.
For comparison purposes, the overall evolutionary divergence among all sequence pairs obtained from the GISAID uploaded during and before the study period was calculated. This analysis included the WIV04 genome and a total of 118 SARS-CoV-2 genome sequences from 62 countries. The sequences encompassed various variants, including former VOCs present in neighboring countries and different regions worldwide (Table 1).

2.8. Phylogenetic Analysis of SARS-CoV-2 Genome Sequences

The IQ-TREE v2.3.5.3 was used to compute the phylogenetic inference of the 63 SARS-CoV-2 genome sequences [29]. In addition to these 63 sequences, after the consensus sequences obtained in this research were aligned, the top 118 SARS-CoV-2 genomes were selected and downloaded from GISAID for phylogenetic analysis. First, an alignment analysis using MAFFT v7.526 was performed [28]. The MSA which had 30,058 nucleotide sites was then imported to IQ-TREE. The phylogeny involved determining the best-fit DNA substitution model using Model Finder [6], which tested 88 DNA models. The initial parsimony tree was created by the phylogenetic likelihood library (PLL). The model chosen according to the Bayesian Information Criterion (BIC) score (115,726.5435) was the TIM + F + I + G4. The model of rate heterogeneity was as follows: proportion of invariable sites: 0.8736 and Gamma shape alpha: 0.9901 with 4 categories. The rate parameter (R) was as follows: A-C: 1.0000, A-G: 1.6270, A-T: 0.5506, C-G: 0.5506, C-T: 3.9021, and G-T: 1.0000. A bootstrap analysis using the UFBoot2 method with 1000 replicates and ML heuristic method was performed. Additionally, branch lengths were optimized through ML on the original alignment. The initial trees for the heuristic search were generated using BioNJ algorithms. From these analyses, we selected the phylogenetic tree with the highest log-likelihood value (−55,910.033). The resulting phylogenetic trees were saved in Newick format and visualized using the Interactive Tree of Life interface [30]. Statistical analysis of the demographic and clinical information was performed using STATA v17. Frequency distribution and tabulation were performed to show the relationship between outcome and independent variables.

3. Results

3.1. Demographics and Clinical Characteristics of the Study Subjects

Out of 150 samples collected, 70 samples had a Ct value of less than 30 and were sequenced. Out of 70 samples sequenced, 63 samples passed the bioinformatics quality control. Of the 63 genome sequences obtained, the samples were collected from patients in the age range of 14 to 75 years, with a median age of 34 years. Of the study participants, 32% (20/63) belong in the age group 21–30 years old, followed by 21% (13/60) in 31–40 years and 16% (10/63) in 41–50 years old. Females accounted for 30% (19/63) and males for 70% (44/63). The majority of the sequenced samples were collected from Addis Ababa (70%; 44/63), followed by Oromia (24%; 15/6), Amhara (5%; 3/63), and Southern Nation Nationalities Region (SNNPR) (2%; 1/63). In terms of clinical data, 70% (44/63) of the study participants exhibited signs and symptoms of cough, fever, and anosmia, and 43% (27/63) of participants reported they had known chronic disease. In addition, 13% (8/63) had a travel history outside Ethiopia, delayed 8–11 days before testing, and approximately 40% (25/63) reported having a vaccination history (Table 2).

3.2. Sequencing and Sequence Analysis

The majority of the sequences (99.8%) had high quality, with gap sizes ranging from 20 to 114 nt, accounting for approximately 0.4% of the total sequence. On average, each genome had 69.21 single nucleotide variations (SNVs) in different genes of SARS-CoV-2. The highest numbers of single nucleotide mutations per clade of SARS-CoV-2 were identified in 22A and 21K (Figure 2).
Nextclade variant analysis shows that out of the 63 sequences examined, on average, 69 nucleotide mutations were identified across the samples in comparison to the Wuhan reference genome (Figure 3).
This analysis also detected an average of 198 nucleotide deletions, 68 substitutions, and 1 insertion in different genes of SARS-CoV-2 (Supplementary Table S1). These SARS-CoV-2 nucleotide mutations were observed in different positions of S, M, and N genes and genes coding for non-structural proteins (nsp1, nsp6, nsp9, nesp10, nsp13, ORF3a, and ORF7a) (Supplementary Materials). Among them, 371 nucleotide deletions were observed in the S gene in the positions of 25–27, 69–70, 124–144, 157–158, 212, 374–376, 415–432, 446–464, and 758–767, followed by 177 deletions in N in the position of 31–33 and 168 deletions in the nsp6 gene in the position of 105–108.
This study identified several mutations in the PLpro (papain-like protease), including K38R, V1069I, A1892T, D821N, E906A, T24I, G489S, I684V, S370L, V932L, P985S, T703I, A1179V, P822L, A488S, P1228L, and P1469S. Additionally, this study explores the presence of the P132H mutation in the 3CLpro (main protease), and P323L mutation in RdRP.
In this study, the transition/transversion bias (R) was 1.693, indicating that transitions occur more frequently than transversions. The base substitution rates (r) for each nucleotide pair were as follows: A => T (0.06), A => C (0.03), A => G (0.07), T => A (0.05), T => C (0.17), T => G (0.03), C => A (0.05), C => T (0.29), C => G (0.03), G => A (0.11), G => T (0.06), and G => C (0.03). These results show that thymine (T) has the highest substitution rate with cytosine (C) (0.17) and the lowest with guanine (G) (0.03). Cytosine (C) has the highest substitution rate with thymine (T) (0.29), while its rates with adenine (A) and guanine (G) are relatively low (both 0.03). Adenine (A) and guanine (G) have generally similar substitution rates across different nucleotides, with A => G having the highest rate among adenine substitutions and G => A having the highest rate among guanine substitutions. In this study, the average Ti/Tv ratio was 1.9734, ranging from 1.5862 to 2.4737, and the most common nucleotide change was the G > A transition.

3.3. SARS-CoV-2 Variants Identified

This study found that omicron was the dominant variant (97%, 61/63) identified in the total sample sequenced during the fifth wave of the pandemic in selected regions of Ethiopia, followed by Delta variants (3%, 2/63). All of the identified cases of the Omicron variant belonged to four age groups (<20, 21–30, 31–40, 41–50, 51–60, and >60 years old). In the age groups of 21–30 and 41–50 years old, Delta accounted for 5% and 10% of the identified cases, respectively, and the remaining infections were caused by Omicron variants. In addition, all of the 25 individuals who were vaccinated before testing were infected with Omicron variants. Out of the 38 non-vaccinated study participants, 95% (36/38) were infected with Omicron, and 5.3% (2/38) were identified with Delta variants (Table 3).
Almost comparable numbers of vaccinated and unvaccinated participants were infected with the BA.4.1 and BA.4.1.1 sub-lineages of SARS-CoV-2 (Figure 4). According to the Nextstrain classification, the dominant clades identified during the study period were 22A (64%, 40/63) and 22B (18%, 11/63), followed by 21K (Omicron; 14%, 9/63), 21J (Delta, 3.2%, 2/63), and 21L (Omicron; 1.6%, 1/63). Additionally, sub-lineages BA.4.1 (30.2%, 19/63) and BA.4.1.1 (28.5%, 18/63) were more prevalent than BA.1.1 (9.5%, 6/63), BA.5.2 (7.9%, 5/63), BF.2 (6.4%, 4/63), and BA.4 (4.8%, 3/63). Interestingly, 90% of the sub-lineages belonged to different strains within the BA lineage, which represents a specific genetic variant or group of closely related variants of the SARS-CoV-2 virus. Additionally, 6.4% (4/63) of the samples were part of the BF lineage, while the AY and B lineages each accounted for 1% of the samples.

3.4. Evolutionary Divergence and Phylogenetic Analysis of SARS-CoV-2

The evolutionary analysis among sequenced data for this study showed there is evidence of a phylogenetic relationship in some of the sequences. The evolutionary divergence between sequences ranges from 0.00 to 3.45 × 10−3 substitution per site. Looking at the broader analysis, the average evolutionary divergence across all sequence pairs is similarly low. For sequences from Ethiopia, the overall distance is calculated to be 9.78 × 10−4 substitutions per site, with a standard error of 8.47 × 10−5 substitutions per site. In comparison, when considering selected global sequences, the overall distance slightly increases to 1.54 × 10−3 substitutions per site, with a standard error of 1.19 × 10−4 substitutions per site. In addition, the maximum likelihood phylogenetic tree from our study, comprising 63 SARS-CoV-2 genomes, demonstrated seven main dominant clades in the viral population (Figure 5).

4. Discussion

COVID-19, caused by the SARS-CoV-2 virus, first emerged in Wuhan, China, before spreading globally, including to Ethiopia. SARS-CoV-2 is primarily transmitted through respiratory droplets and direct contact. The spread of the virus is influenced by various factors, including sunlight, climate, temperature, humidity, and population demographics [31]. Asymptomatic and pre-symptomatic individuals play a significant role in the transmission of the virus [32]. It possesses a large RNA genome that encodes various proteins essential for its life cycle and pathogenesis, with the spike protein being particularly important for viral entry and a key target for neutralization. Mutations in the virus can lead to the emergence of new variants that may impact transmission rates, disease severity, and vaccine efficacy [33].
In Ethiopia, the first COVID-19 case was reported in March 2020, and by April 2024, the country had recorded over 500,000 cases, with approximately 38% of the population vaccinated [34,35]. A sero-epidemiological survey indicated high exposure rates among the population [12]. While Ethiopia initially contended with early virus strains, variants of concern emerged globally, although local sequencing efforts were hampered by infrastructure challenges. This study aims to investigate the SARS-CoV-2 variants circulating during the fifth pandemic wave in selected regions of Ethiopia, addressing a critical gap in the country’s genomic surveillance capabilities. The findings from this study provided significant insights into the demographics, clinical characteristics, and SARS-CoV-2 variants circulating in selected regions of Ethiopia during the fifth wave of the pandemic.
In this study, the age distribution of the study participants ranged from 14 to 75 years old, with a mean of 34 years old, revealing significant insights into the demographic distribution of COVID-19 cases within the sample. Notably, 32% of participants were aged 21–30 years, followed by 21% in the 31–40 age group and 16% in the 41–50 age range. This age distribution suggests a predominance of younger individuals in the sample. This finding is aligned with a previous study conducted in Ethiopia by Sisay A. et al. [16]; a high number of study participants infected with SARS-CoV-2 belonged to the age group of 21 to 30. However, different findings by Varnica B. et al. [33] and Carmola L.R. et al. [34] reported that the elderly population was the most significantly afflicted in terms of the number of COVID-19 cases and deaths. These differences might be due to the predominance of younger individuals in our study sample, which has the potential for the underrepresentation of older populations.
The geographical distribution of participants, with the majority from Addis Ababa, highlights the urban concentration of cases, which is consistent with a study that analyzed COVID-19 infections in Zimbabwe and emphasizes that urban areas, characterized by condensed residences, tend to expedite SARS-CoV-2 transmission, leading to a higher total number of confirmed cases [36]. This is echoed by Sharifi and Khavarian-Garmsir [37], who discussed how urban planning and design can significantly influence the infection and mortality rates of COVID-19 and suggested that urban centers are more susceptible to outbreaks due to their structural characteristics. Moreover, they added that the dynamics of urban life, including increased mobility and social interactions, further exacerbate the spread of the virus.
The clinical presentation of symptoms such as cough, fever, and anosmia in 70% of participants underscores the typical manifestations associated with SARS-CoV-2 infections. This finding is consistent with the study by Elavia et al. [38]. They reported that fever was observed in approximately 88% of cases, while cough was presented in about 68% of infected individuals. They suggested that these symptoms are critical for clinicians to recognize, as they are often the first indicators prompting testing for SARS-CoV-2. Furthermore, similar findings demonstrated that fever and cough, along with anosmia, serve as independent predictors of positive SARS-CoV-2 PCR results, reinforcing their significance in clinical assessments [39]. This also aligns with previous findings that highlight the prevalence of these symptoms in COVID-19 patients, emphasizing their importance in clinical diagnosis and public health surveillance, as shown by Sisay A. et al. [16] and Tarris G. et al. [40].
We observed that 70 out of 150 samples had Ct values below 30 after retesting. This is particularly noteworthy, as lower Ct values are indicative of higher viral loads [41]. This study also identified, from 70 samples that had a cycle threshold (Ct) value of less than 30 and were sequenced, 63 samples passing bioinformatics quality control. This is supported by the study presented in [42] stressing the necessity of rigorous bioinformatics analysis to eliminate low-quality reads that could lead to erroneous conclusions regarding viral evolution. This suggests that the high pass rate of samples in this study highlights that the methodologies employed were robust and effective, which is critical for genomic surveillance efforts.
In this project, the sequencing analysis revealed a high-quality yield from the RNA extracts, with 99.8% of the sequences being of high quality and only 0.4% comprising gaps. This underscores the robustness of the sequencing methodologies employed, which align with the standards set by various genomic surveillance initiatives [26,43]. The genomic analysis of SARS-CoV-2 reveals significant insights into the virus’s mutation landscape and its evolutionary dynamics. This study revealed, on average, approximately 69 single nucleotide variations (SNVs) per genome, particularly concentrated in clades 22A and 21K, which is similar to the study conducted by Saldivar-Espinoza et al. [44] that explored the mutational landscape of SARS-CoV-2 for the Omicron variant by early June 2022 and reported that the average number of SNVs per genome was around 72. In addition, the study by Harvey et al. [45] and Yang et al. [46], highlighted that the identification of high mutation rates, particularly in certain lineages, suggests that these variants may be under selective pressure, possibly due to host immune responses or therapeutic interventions.
The findings from the Nextclade variant analysis identified a total of 4429 nucleotide mutations, 12,664 deletions, 4392 substitutions, and 84 insertions across 63 SARS-CoV-2 sequences compared to the Wuhan reference genome. This provides critical insights into the genomic evolution of the virus [26]. This extensive mutational landscape reflects the ongoing adaptive changes that SARS-CoV-2 undergoes in response to selective pressures, including host immune responses and antiviral treatments [47]. In this study, significant nucleotide mutations were identified in critical genes such as the spike (S), membrane (M), and nucleocapsid (N) genes, as well as non-structural proteins (nsp1, nsp6, nsp9, nsp10, nsp13, ORF3a, and ORF7a). Similar studies have corroborated these findings, highlighting the high mutation rates observed in SARS-CoV-2. For instance, a study conducted in Turkey reported elevated mutation rates in the S and RdRP gene regions, consistent with the findings of this analysis [26,48]. Furthermore, research from Egypt indicated a significant presence of the D614G mutation, which is known to enhance viral infectivity and has been widely documented across various global studies [49].
Moreover, this study identified specific mutations in the papain-like protease (PLpro) and the main protease (3CLpro). This study is aligned with the studies conducted by Gao et al. [50] and Ismail et al. [51] that suggested the mutations K38R, V1069I, and P132H in PLpro, along with P323L in RdRP, could potentially alter the enzymatic activities of these proteins, impacting viral replication and immune evasion.
The phylogenetic analysis performed in this study indicates that the dominant circulating variants in Ethiopia during the study period (June to August, 2022) were primarily Omicron, with a smaller proportion of Delta variants. The Omicron variant (B.1.1.529) of SARS-CoV-2 was first identified in November 2021 in South Africa [52]. It was classified as a VOC by the WHO because of its high transmissibility and evasion from neutralizing antibodies induced by vaccination or natural infection with the wild-type virus [53]. As of 14 April 2022, the Omicron variant has been confirmed in at least 150 countries around the world [54]. From 3 January 2022 to 9 January 2022, the Omicron variant accounted for roughly 62.5% of all SARS-CoV-2 sequences available on the GISAID database [25]. In this study, the majority of the samples belonged to the BA lineage. This is consistent with findings reported from North India, 2022 [55], and the Philippines, 2022 [56], during the study period. The Centers for Disease Control and Prevention (CDC) has developed taxonomic nomenclatures (BA, BF, AY, and B lineages) to categorize different variants of SARS-CoV-2 [57]. A study conducted by Rauseo AM et al. [58] suggests understanding the distribution of these variants is crucial for public health interventions, including vaccination strategies and treatment approaches.
This study explored the evolutionary divergence among genetic sequences of SARS-CoV-2 and revealed a spectrum of genetic distances ranging from nearly identical matches to slight variations. Specifically, comparisons within Ethiopian sequences revealed a perfect match between three samples and a very small distance between others, indicating recent community transmission. On a broader scale, the average genetic distance among Ethiopian sequences was low, suggesting high genetic similarity within the country. This finding is similar to a previous study conducted in Ethiopia through whole genome sequences of SARS-CoV-2 isolates from Ethiopian patients [59].
In addition, in this analysis, a comparison of the Ethiopian sequences with selected global sequences showed that a slightly higher average distance was observed. This indicates slightly more genetic diversity among global sequences. These findings were contrary to the evolution of SARS-CoV-2 being characterized by a steady increase in divergence within major lineages and a stepwise increase associated with each successive new major lineage, leading to a faster overall rate of evolution [60]. However, another study that analyzed 27,977 SARS-CoV-2 sequences from 84 countries obtained throughout the pandemic showed that SARS-CoV-2 genetic diversity was remarkably low [61].

Limitations of This Study

One limitation of our study is the small sample size that limits the power to give more conclusive recommendations. Another limitation is geographical coverage. Due to financial and infrastructure limitations, the sequencing was conducted very late; however, the data are still novel for readers who are interested in the SARS-CoV-2 variant situation in Ethiopia.

5. Conclusions

In conclusion, this study on SARS-CoV-2 variants in Ethiopia during the fifth wave reveals critical insights into the demographics, clinical characteristics, and genomic evolution of the virus, highlighting the predominance of the Omicron variant, particularly sub-lineages BA.4.1 and BA.4.1.1. The findings indicate a significant male predominance among participants, consistent with global trends of higher severity in males, and underscore the typical clinical manifestations of COVID-19, such as cough, fever, and anosmia. The genomic analysis suggests minimal genetic divergence among local sequences, indicating stable transmission dynamics, while the presence of specific mutations raises concerns regarding vaccine efficacy and potential immune evasion. These results emphasize the necessity for ongoing genomic surveillance to monitor variant emergence and inform public health strategies, contributing to a broader understanding of the pandemic’s trajectory in Ethiopia and its implications for global health responses.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes16030351/s1, Table S1. SARS-CoV-2 nucleotide mutations in different genes position during the fifth wave the pandemic in Ethiopia.

Author Contributions

The contribution of the authors to the manuscript is as follows: Conceptualization: G.H. and M.L.; data curation: G.H. and D.H.A.; formal analysis and methodology: G.H., M.M.T., A.M. (Anargachew Mulu), D.H.A., A.M. (Alessandro Marcello), A.G., A.T., A.A. (Adimkewu Aguine), S.K.T., H.O., A.E.S., E.A., H.D., M.B., A.A. (Abaysew Ayele), E.A., G.M. and M.L.; investigation: G.H., M.M.T., D.H.A., A.G. and A.T.; supervision: M.L.; writing original draft: G.H.; writing—review and editing: G.H., M.M.T., A.M. (Andargachew Mulu), D.H.A., A.M. (Alessandro Marcello), A.G., A.T., A.A. (Adimkewu Aguine), S.K.T., H.O., H.D., A.A. (Abaysew Ayele), A.E.S., M.B., E.A., G.M. and M.L. All authors have read and agreed to the published version of the manuscript.

Funding

The sequencing of SARS-CoV-2 was done with the financial support of ICGEB-CRP with the grant code CRP/ETH21-01. This work was supported, in whole or in part, by the Bill & Melinda Gates Foundation [Grant No. INV-034,799]. Under the grant conditions of the Foundation, a Creative Commons Attribution 4.0 Generic License has already been assigned to the Author Accepted Manuscript version that might arise from this submission. The findings and conclusions contained within are those of the authors and do not necessarily reflect positions or policies of the Gates Foundation. The first author got financial support for this research from Addis Ababa University.

Institutional Review Board Statement

This study was conducted according to the guidelines of the Declaration of Helsinki. Ethical review and approval were obtained from the Institutional Review Board (IRB) of the Aklilu Lemma Institute of Pathobiology at Addis Ababa University, under protocol number ALIPB IRB/52/2013/21 dated 19 April 2021. A permission letter was obtained from the Ethiopian Public Health Institute to use leftover samples from repositories.

Informed Consent Statement

Patient consent was waived due to the retrospective nature of this study, as approved by the Institutional Review Board.

Data Availability Statement

The sequence data are available at GISAID and NCBI.

Acknowledgments

The authors are grateful to the Ethiopian Public Health Institute, Armauer Hansen Research Institute, and Addis Ababa University for their support, coordination, and providing whole genome sequencing laboratories. The Bio and Emerging Technology Institute (BETin) and the International Center for Genetic Engineering and Biotechnology (ICGEB) are highly acknowledged for technical and financial support. We also gratefully thank the authors and the originating and submitting laboratories for their sequences and metadata shared through GISAID. The authors would like to express their special thanks to all who participated in data collection and laboratory activities.

Conflicts of Interest

The authors declare no competing interests.

References

  1. Kumar, S.; Tao, Q.; Weaver, S.; Sanderford, M.; Caraballo-Ortiz, M.A.; Sharma, S.; Pond, S.L.K.; Miura, S. An Evolutionary Portrait of the Progenitor SARS-CoV-2 and Its Dominant Offshoots in COVID-19 Pandemic. Mol. Biol. Evol. 2021, 38, 3046–3059. [Google Scholar] [CrossRef] [PubMed]
  2. WHO Transmission of SARS-CoV-2: Implications for Infection Prevention Precautions. Available online: https://www.who.int/news-room/commentaries/detail/transmission-of-sars-cov-2-implications-for-infection-prevention-precautions (accessed on 19 February 2025).
  3. Farhud, D.D.; Mojahed, N. SARS-CoV-2 Notable Mutations and Variants: A Review Article. Iran. J. Public Health 2022, 51, 1494–1501. [Google Scholar] [CrossRef] [PubMed]
  4. Wu, A.; Peng, Y.; Huang, B.; Ding, X.; Wang, X.; Niu, P.; Meng, J.; Zhu, Z.; Zhang, Z.; Wang, J.; et al. Genome Composition and Divergence of the Novel Coronavirus (2019-nCoV) Originating in China. Cell Host Microbe 2020, 27, 325–328. [Google Scholar] [CrossRef]
  5. Huang, Y.; Yang, C.; Xu, X.-F.; Xu, W.; Liu, S.-W. Structural and functional properties of SARS-CoV-2 spike protein: Potential antivirus drug development for COVID-19. Acta Pharmacol. Sin. 2020, 41, 1141–1149. [Google Scholar] [CrossRef] [PubMed]
  6. Cryo-EM Structure of the SARS-Coronavirus Spike Glycoprotein in Complex with Its Host Cell Receptor ACE2|PLOS Pathogens. Available online: https://journals.plos.org/plospathogens/article?id=10.1371/journal.ppat.1007236 (accessed on 15 April 2024).
  7. Malone, B.; Urakova, N.; Snijder, E.J.; Campbell, E.A. Structures and functions of coronavirus replication–transcription complexes and their relevance for SARS-CoV-2 drug design. Nat. Rev. Mol. Cell Biol. 2022, 23, 21–39. [Google Scholar] [CrossRef] [PubMed]
  8. Africa CDC. Africa Identifies First Case of Coronavirus Disease: Statement by the Director of Africa CDC. Available online: https://africacdc.org/news-item/africa-identifies-first-case-of-coronavirus-disease-statement-by-the-director-of-africa-cdc/ (accessed on 9 June 2024).
  9. Cucinotta, D.; Vanelli, M. WHO Declares COVID-19 a Pandemic. Acta Biomed. 2020, 91, 157–160. [Google Scholar] [CrossRef]
  10. Ethiopia Confirms First Coronavirus Case. Available online: https://www.aa.com.tr/en/africa/ethiopia-confirms-first-coronavirus-case/1765130 (accessed on 15 April 2024).
  11. Ethiopia COVID—Coronavirus Statistics—Worldometer. Available online: https://www.worldometers.info/coronavirus/country/ethiopia/ (accessed on 7 May 2024).
  12. Gudina, E.K.; Ali, S.; Girma, E.; Gize, A.; Tegene, B.; Hundie, G.B.; Sime, W.T.; Ambachew, R.; Gebreyohanns, A.; Bekele, M.; et al. Seroepidemiology and model-based prediction of SARS-CoV-2 in Ethiopia: Longitudinal cohort study among front-line hospital workers and communities. Lancet Glob. Health 2021, 9, e1517. [Google Scholar] [CrossRef]
  13. Merkt, S.; Ali, S.; Gudina, E.K.; Adissu, W.; Gize, A.; Muenchhoff, M.; Graf, A.; Krebs, S.; Elsbernd, K.; Kisch, R.; et al. Long-term monitoring of SARS-CoV-2 seroprevalence and variants in Ethiopia provides prediction for immunity and cross-immunity. Nat. Commun. 2024, 15, 3463. [Google Scholar] [CrossRef] [PubMed]
  14. Tegally, H.; Wilkinson, E.; Giovanetti, M.; Iranzadeh, A.; Fonseca, V.; Giandhari, J.; Doolabh, D.; Pillay, S.; San, E.J.; Msomi, N.; et al. Detection of a SARS-CoV-2 variant of concern in South Africa. Nature 2021, 592, 438–443. [Google Scholar] [CrossRef]
  15. Viana, R.; Moyo, S.; Amoako, D.G.; Tegally, H.; Scheepers, C.; Althaus, C.L.; Anyaneji, U.J.; Bester, P.A.; Boni, M.F.; Chand, M.; et al. Rapid epidemic expansion of the SARS-CoV-2 Omicron variant in southern Africa. Nature 2022, 603, 679–686. [Google Scholar] [CrossRef]
  16. Sisay, A.; Tshiabuila, D.; Van Wyk, S.; Tesfaye, A.; Mboowa, G.; Oyola, S.O.; Tesema, S.K.; Baxter, C.; Martin, D.; Lessells, R.; et al. Molecular Epidemiology and Diversity of SARS-CoV-2 in Ethiopia, 2020–2022. Genes 2023, 14, 705. [Google Scholar] [CrossRef] [PubMed]
  17. Sisay, A.; Abera, A.; Dufera, B.; Endrias, T.; Tasew, G.; Tesfaye, A.; Hartnack, S.; Beyene, D.; Desta, A.F. Diagnostic accuracy of three commercially available one step RT-PCR assays for the detection of SARS-CoV-2 in resource limited settings. PLoS ONE 2022, 17, e0262178. [Google Scholar] [CrossRef]
  18. Bioer Technology Introduction. Available online: https://conference.medgenetics.ru/upload/Bioer%20Technology%20Product%20Intro-711_%D0%A0%D0%9C%D0%A2.pdf (accessed on 21 August 2023).
  19. QIAGEN QIAamp® Viral RNA Mini Handbook, July 2020—Google Search. Available online: https://www.google.com/search?q=QIAGEN+QIAamp%C2%AE+Viral+RNA+Mini+Handbook%2C+July+2020.&rlz=1C1GCEU_enET1014ET1014&oq=QIAGEN+QIAamp%C2%AE+Viral+RNA+Mini+Handbook%2C+July+2020.&aqs=chrome.69i57.1026j0j4&sourceid=chrome&ie=UTF-8 (accessed on 21 August 2023).
  20. Burrows-Wheeler Alignment (BWA)—Research Computing Documentation. Available online: https://wiki.rc.usf.edu/index.php/Burrows-Wheeler_Alignment_(BWA) (accessed on 4 September 2024).
  21. Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011, 17, 10–12. [Google Scholar] [CrossRef]
  22. Burrows-Wheeler Aligner. Available online: https://bio-bwa.sourceforge.net/ (accessed on 7 May 2024).
  23. O’Toole, Á.; Scher, E.; Underwood, A.; Jackson, B.; Hill, V.; McCrone, J.T.; Colquhoun, R.; Ruis, C.; Abu-Dahab, K.; Taylor, B.; et al. Assignment of epidemiological lineages in an emerging pandemic using the pangolin tool. Virus Evol. 2021, 7, veab064. [Google Scholar] [CrossRef]
  24. Aksamentov, I.; Roemer, C.; Hodcroft, E.B.; Neher, R.A. Nextclade: Clade assignment, mutation calling and quality control for viral genomes. J. Open Source Softw. 2021, 6, 3773. [Google Scholar] [CrossRef]
  25. GISAID Initiative. Available online: https://www.epicov.org/epi3/frontend#28d967 (accessed on 21 August 2023).
  26. NCBI Virus. Available online: https://www.ncbi.nlm.nih.gov/labs/virus/vssi/#/virus?SeqType_s=Nucleotide&VirusLineage_ss=Severe%20acute%20respiratory%20syndrome%20coronavirus%202,%20taxid:2697049&utm_source=nuccore&utm_medium=referral&Authors_idx%20q.op%3DAND=Hailu&SubmitterAffil_idx%20q.op%3DOR=Ethiopian%20Public%20Health%20Institute&SubmitterCountry_idx%20q.op%3DOR=Ethiopia (accessed on 18 September 2024).
  27. GitHub—Tseemann/Snp-Dists: Pairwise SNP Distance Matrix from a FASTA Sequence Alignment. Available online: https://github.com/tseemann/snp-dists (accessed on 26 October 2023).
  28. Katoh, K.; Standley, D.M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 2013, 30, 772–780. [Google Scholar] [CrossRef]
  29. Minh, B.Q.; Schmidt, H.A.; Chernomor, O.; Schrempf, D.; Woodhams, M.D.; von Haeseler, A.; Lanfear, R. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol. Biol. Evol. 2020, 37, 1530–1534. [Google Scholar] [CrossRef]
  30. Interactive Tree of Life (iTOL) v5: An Online Tool for Phylogenetic Tree Display and Annotation|Nucleic Acids Research|Oxford Academic. Available online: https://academic.oup.com/nar/article/49/W1/W293/6246398?login=true (accessed on 26 October 2023).
  31. Matson, M.J.; Yinda, C.K.; Seifert, S.N.; Bushmaker, T.; Fischer, R.J.; Doremalen, N.V.; Lloyd-Smith, J.O.; Munster, V.J. Effect of Environmental Conditions on SARS-CoV-2 Stability in Human Nasal Mucus and Sputum. Emerg. Infect. Dis. 2020, 26, 2276. [Google Scholar] [CrossRef] [PubMed]
  32. Sirakov, I.; Stankova, P.; Bakalov, D.; Mirani, Y.; Bardarska, L.; Paraskova, G.; Popov, I.; Alexandrova, A.; Dimitrov, G.; Mizgova, G.; et al. Retrospective Analysis of the Spread of SARS-CoV-2 in the Mediterranean Part of Bulgaria, During the First Wave of the Pandemic. J. Pure Appl. Microbiol. 2024, 18, 438–450. Available online: https://microbiologyjournal.org/retrospective-analysis-of-the-spread-of-sars-cov-2-in-the-mediterranean-part-of-bulgaria-during-the-first-wave-of-the-pandemic/ (accessed on 19 February 2025). [CrossRef]
  33. Iqbal, S.; Saki, M.; Sirakov, I.; Akrami, S. Pathogenesis of the SARS-Coronavirus-2 and Potential Therapeutic Strategies. J. Commun. Dis. 2022, 202–209. [Google Scholar] [CrossRef]
  34. Improving Response as Ethiopia Marks Two Years Since First COVID-19 Case|WHO|Regional Office for Africa. Available online: https://www.afro.who.int/countries/ethiopia/news/improving-response-ethiopia-marks-two-years-first-covid-19-case (accessed on 19 February 2025).
  35. Anteneh, A.; Getachew, A.; Kenera, M.; Salim, A.; Kedir, F.; Belayihun, B.; Felker-Kantor, E. COVID-19 vaccine hesitancy in Ethiopia: A latent class analysis. BMC Public Health 2024, 24, 2894. [Google Scholar] [CrossRef]
  36. Silwangani, B.; Munodawafa, D.; Chadambuka, A. COVID-19 Infections and Deaths between Rural and Urban Provinces of Zimbabwe, 2020: A Spatial Variations Analysis. Texila Int. J. Public Health 2023, 11, 1–13. [Google Scholar] [CrossRef]
  37. Sharifi, A.; Khavarian-Garmsir, A.R. The COVID-19 pandemic: Impacts on cities and major lessons for urban planning, design, and management. Sci. Total Environ. 2020, 749, 142391. [Google Scholar] [CrossRef]
  38. Elavia, N.; Sharma, N.; Li, S.; Wang, Y.; Milekic, B. An Atypical Presentation of Acute Pulmonary Embolism With Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) Pneumonia. Cureus 2020, 12, e8249. [Google Scholar] [CrossRef]
  39. Lan, F.-Y.; Filler, R.; Mathew, S.; Buley, J.; Iliaki, E.; Bruno-Murtha, L.A.; Osgood, R.; Christophi, C.A.; Fernandez-Montero, A.; Kales, S.N. COVID-19 symptoms predictive of healthcare workers’ SARS-CoV-2 PCR results. PLoS ONE 2020, 15, e0235460. [Google Scholar] [CrossRef] [PubMed]
  40. Tarris, G.; de Rougemont, A.; Estienney, M.-A.; Journet, J.; Lariotte, A.-C.; Aubignat, D.; Rebibou, J.-M.; De La Vega, M.F.; Legendre, M.; Belliot, G.; et al. Chronic kidney disease linked to SARS-CoV-2 infection: A case report. BMC Nephrol. 2021, 22, 278. [Google Scholar] [CrossRef] [PubMed]
  41. Kucirka, L.M.; Lauer, S.A.; Laeyendecker, O.; Boon, D.; Lessler, J. Variation in False-Negative Rate of Reverse Transcriptase Polymerase Chain Reaction–Based SARS-CoV-2 Tests by Time Since Exposure. Ann. Intern. Med. 2020, 173, 262–267. [Google Scholar] [CrossRef] [PubMed]
  42. De Maio, N.; Walker, C.R.; Turakhia, Y.; Lanfear, R.; Corbett-Detig, R.; Goldman, N. Mutation Rates and Selection on Synonymous Mutations in SARS-CoV-2. Genome Biol. Evol. 2021, 13, evab087. [Google Scholar] [CrossRef] [PubMed]
  43. Hadfield, J.; Megill, C.; Bell, S.M.; Huddleston, J.; Potter, B.; Callender, C.; Sagulenko, P.; Bedford, T.; Neher, R.A. Nextstrain: Real-time tracking of pathogen evolution. Bioinformatics 2018, 34, 4121–4123. [Google Scholar] [CrossRef] [PubMed]
  44. Saldivar-Espinoza, B.; Garcia-Segura, P.; Novau-Ferré, N.; Macip, G.; Martínez, R.; Puigbò, P.; Cereto-Massagué, A.; Pujadas, G.; Garcia-Vallve, S. The Mutational Landscape of SARS-CoV-2. Int. J. Mol. Sci. 2023, 24, 9072. [Google Scholar] [CrossRef] [PubMed]
  45. Harvey, W.T.; Carabelli, A.M.; Jackson, B.; Gupta, R.K.; Thomson, E.C.; Harrison, E.M.; Ludden, C.; Reeve, R.; Rambaut, A.; Peacock, S.J.; et al. SARS-CoV-2 variants, spike mutations and immune escape. Nat. Rev. Microbiol. 2021, 19, 409–424. [Google Scholar] [CrossRef]
  46. Yang, L.; Liu, S.; Liu, J.; Zhang, Z.; Wan, X.; Huang, B.; Chen, Y.; Zhang, Y. COVID-19: Immunopathogenesis and Immunotherapeutics. Sig Transduct. Target. Ther. 2020, 5, 128. [Google Scholar] [CrossRef]
  47. Gao, Z.; Xu, Y.; Sun, C.; Wang, X.; Guo, Y.; Qiu, S.; Ma, K. A systematic review of asymptomatic infections with COVID-19. J. Microbiol. Immunol. Infect. 2021, 54, 12–16. [Google Scholar] [CrossRef]
  48. Sahin, E.; Bozdayi, G.; Yigit, S.; Muftah, H.; Dizbay, M.; Tunccan, O.G.; Fidan, I.; Caglar, K. Genomic characterization of SARS-CoV-2 isolates from patients in Turkey reveals the presence of novel mutations in spike and nsp12 proteins. J. Med. Virol. 2021, 93, 6016–6026. [Google Scholar] [CrossRef]
  49. Roshdy, W.H.; Kandeil, A.; El-Shesheny, R.; Khalifa, M.K.; Al-Karmalawy, A.A.; Showky, S.; Naguib, A.; Elguindy, N.; Fahim, M.; Abu Elsood, H.; et al. Insight into Genetic Characteristics of Identified SARS-CoV-2 Variants in Egypt from March 2020 to May 2021. Pathogens 2022, 11, 834. [Google Scholar] [CrossRef] [PubMed]
  50. Gao, X.; Qin, B.; Chen, P.; Zhu, K.; Hou, P.; Wojdyla, J.A.; Wang, M.; Cui, S. Crystal structure of SARS-CoV-2 papain-like protease. Acta Pharm. Sin. B 2021, 11, 237–245. [Google Scholar] [CrossRef] [PubMed]
  51. Ismail, G.; Abdelghaffar, H.; Seadawy, M.G.; El-Hosseny, M.F.; Gad, A.F.; Ageez, A.; ElShafei, A.; Mohammed, S.S.; Ali, M.S.; El-Ashry, M. abd E.-R. Genome sequencing reveals existence of SARS-CoV-2 B.1.1.529 variant in Egypt. J. Genet. Eng. Biotechnol. 2022, 20, 70. [Google Scholar] [CrossRef] [PubMed]
  52. Ren, S.-Y.; Wang, W.-B.; Gao, R.-D.; Zhou, A.-M. Omicron variant (B.1.1.529) of SARS-CoV-2: Mutation, infectivity, transmission, and vaccine resistance. World J. Clin. Cases 2022, 10, 1–11. [Google Scholar] [CrossRef] [PubMed]
  53. Classification of Omicron (B.1.1.529): SARS-CoV-2 Variant of Concern. Available online: https://www.who.int/news/item/26-11-2021-classification-of-omicron-(b.1.1.529)-sars-cov-2-variant-of-concern (accessed on 23 April 2024).
  54. Omicron Variant (COVID-19/SARS-CoV-2) by BNO News. Available online: https://www.google.com/maps/d/viewer?mid=1lKX8ikHpKAkWCvuXn2g-CVW3OmHbN-k- (accessed on 23 April 2024).
  55. Zaman, K.; Shete, A.M.; Mishra, S.K.; Kumar, A.; Reddy, M.M.; Sahay, R.R.; Yadav, S.; Majumdar, T.; Pandey, A.K.; Dwivedi, G.R.; et al. Omicron BA.2 lineage predominance in severe acute respiratory syndrome coronavirus 2-positive cases during the third wave in North India. Front. Med. 2022, 9, 955930. [Google Scholar] [CrossRef] [PubMed]
  56. Li, Y.-T.; Polotan, F.G.M.; Sotelo, G.I.S.; Dolor, A.Y.M.; Tujan, M.A.A.; Gomez, M.R.R.; Onza, J.T.; Chang, A.K.T.; Bautista, C.T.; Carandang, J.C.; et al. Lineage BA.2 dominated the Omicron SARS-CoV-2 epidemic wave in the Philippines. Virus Evol. 2022, 8, veac078. [Google Scholar] [CrossRef]
  57. CDC Coronavirus Disease 2019 (COVID-19). Available online: https://www.cdc.gov/coronavirus/2019-ncov/variants/variant-classifications.html (accessed on 14 December 2023).
  58. Rauseo, A.M.; O’Halloran, J.A. What Are the Clinical Implications of the SARS-CoV-2 Variants. JACC Basic. Transl. Sci. 2021, 6, 305–308. [Google Scholar] [CrossRef] [PubMed]
  59. Alemayehu, D.H.; Adnew, B.; Alemu, F.; Tefera, D.A.; Seyoum, T.; Beyene, G.T.; Gelanew, T.; Negash, A.A.; Abebe, M.; Mihret, A.; et al. Whole-Genome Sequences of SARS-CoV-2 Isolates from Ethiopian Patients. Microbiol. Resour. Announc. 2021, 10, e0072121. [Google Scholar] [CrossRef]
  60. Markov, P.V.; Ghafari, M.; Beer, M.; Lythgoe, K.; Simmonds, P.; Stilianakis, N.I.; Katzourakis, A. The evolution of SARS-CoV-2. Nat. Rev. Microbiol. 2023, 21, 361–379. [Google Scholar] [CrossRef]
  61. Rausch, J.W.; Capoferri, A.A.; Katusiime, M.G.; Patro, S.C.; Kearney, M.F. Low genetic diversity may be an Achilles heel of SARS-CoV-2. Proc. Natl. Acad. Sci. USA 2020, 117, 24614–24616. [Google Scholar] [CrossRef] [PubMed]
Figure 1. A geographic map of Ethiopia showing the regions and districts where SARS-CoV-2 specimens were collected.
Figure 1. A geographic map of Ethiopia showing the regions and districts where SARS-CoV-2 specimens were collected.
Genes 16 00351 g001
Figure 2. Number of single nucleotide mutations per clades of SARS-CoV-2 during the fifth wave of the pandemic in Ethiopia, 2022.
Figure 2. Number of single nucleotide mutations per clades of SARS-CoV-2 during the fifth wave of the pandemic in Ethiopia, 2022.
Genes 16 00351 g002
Figure 3. Example of nucleotide mutations in different gene positions of SARS-CoV-2 during the fifth wave of the pandemic in Ethiopia, 2022.
Figure 3. Example of nucleotide mutations in different gene positions of SARS-CoV-2 during the fifth wave of the pandemic in Ethiopia, 2022.
Genes 16 00351 g003
Figure 4. SARS-CoV-2 vaccination status and infection with different sub-lineages of SARS-CoV-2 identified during the fifth wave of the pandemic in Ethiopia.
Figure 4. SARS-CoV-2 vaccination status and infection with different sub-lineages of SARS-CoV-2 identified during the fifth wave of the pandemic in Ethiopia.
Genes 16 00351 g004
Figure 5. Analysis of the phylogenetic relatedness of SARS-CoV-2 identified in Ethiopia during the fifth wave of the pandemic, 2022, with selected global sequences.
Figure 5. Analysis of the phylogenetic relatedness of SARS-CoV-2 identified in Ethiopia during the fifth wave of the pandemic, 2022, with selected global sequences.
Genes 16 00351 g005
Table 1. The WIV04 genome, along with 118 SARS-CoV-2 genome sequences from 62 countries, were obtained from the GISAID.
Table 1. The WIV04 genome, along with 118 SARS-CoV-2 genome sequences from 62 countries, were obtained from the GISAID.
ClassificationsVariantsNameFirst Reported Country
VOCalphaGRY (B.1.1.7 + Q.*)United Kingdom
VOCBetaGH/501Y.V2 (B.1.351 + B.1.351.2 + B.1.351 + 3)South Africa
VOCGammaGR/501Y.V3 (P.1 + P.1*)Brazil and Japan
VOCOmicronGRA (B.1.1.529 + BA.*)Botswana, South Africa, and Hong Kong
Variant of interest (VOI)LambdaGR/452Q.V1 (C.37 + C.37.1)Peru
VOIMuGH (B.1.621 + B.1.621.1)Colombia
VOI GRA (XBB.1.5 + XBB.1.5.*)Australia, India, and Bangladesh
VOC GRA (XBB.1.16 + XBB.1.1.16.*)India
Variant under monitoring (VUM) GRA (XBB + XBB.*, excluding XBB.1.5, XBB.1.16, XBB.1.9.1, XBB.1.9.2, XBB.2.3)India
VUM GRA (XBB.1.9.1 + XBB.19.1.*)Indonesia, Singapore, and Israel
VUM GH/490R (B.1.640 + B.1.640.*)Congo and France. Except for the Gamma VOC
Note: These were sequences that we collected between 1 January and 31 July 2022 before our samples were collected between June and August 2022. There was a total of 30,150 positions in the final dataset.
Table 2. Socio-demographic characteristics of study participants.
Table 2. Socio-demographic characteristics of study participants.
CharacteristicsNumber (n)Percentage (%)
SexFemale1930
Male4470
Age Group≤20813
21–302032
31–401321
41–501016
51–6046
>60813
Testing SitesOromia1524
Amhara35
Addis Ababa4470
SNNPR11
Signs and SymptomsNo1930
Yes4470
Co-morbidity No3657
Yes2743
Travel HistoryNo5587
Yes813
Vaccination StatusNot vaccinated3860
Vaccinated2540
Reason for Testing Contact with cases1016
Follow-up23
Suspect4368
Travel purpose813
Table 3. Frequency distribution of SARS-CoV-2 variants with demographic and clinical information of patients during the fifth wave of the pandemic in Ethiopia, 2022.
Table 3. Frequency distribution of SARS-CoV-2 variants with demographic and clinical information of patients during the fifth wave of the pandemic in Ethiopia, 2022.
Variants of SARS-CoV-2 Virus
DeltaOmicron
SexFemaleNumber118
%5.394.7
MaleNumber143
%2.397.7
VaccinationNoNumber236
%5.394.7
YesNumber025
%0.0100.0
Signs and SymptomsNoNumber019
%0.0100.0
YesNumber242
%4.595.5
Travel HistoryNoNumber253
%3.696.4
YesNumber08
%0.0100.0
Ethiopian NationalityNoNumber01
%0.0100.0
YesNumber260
%3.296.8
Age Group≤20Number08
%0.0100.0
21–30Number119
%5.095.0
31–40Number013
%0.0100.0
41–50Number19
%10.090.0
51–60Number04
%0.0100.0
>60Number08
%0.0100.0
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Hailu, G.; Legesse, M.; Mulu, A.; Medhin, G.; Tsegaye, M.M.; Alemayehu, D.H.; Ayele, A.; Gebreegziabxier, A.; Tayachew, A.; Aguine, A.; et al. SARS-CoV-2 Genetic Variants Identified in Selected Regions of Ethiopia Through Whole Genome Sequencing: Insights from the Fifth Wave of COVID-19. Genes 2025, 16, 351. https://doi.org/10.3390/genes16030351

AMA Style

Hailu G, Legesse M, Mulu A, Medhin G, Tsegaye MM, Alemayehu DH, Ayele A, Gebreegziabxier A, Tayachew A, Aguine A, et al. SARS-CoV-2 Genetic Variants Identified in Selected Regions of Ethiopia Through Whole Genome Sequencing: Insights from the Fifth Wave of COVID-19. Genes. 2025; 16(3):351. https://doi.org/10.3390/genes16030351

Chicago/Turabian Style

Hailu, Getnet, Mengistu Legesse, Andargachew Mulu, Girmay Medhin, Mesfin Mengesha Tsegaye, Dawit Hailu Alemayehu, Abaysew Ayele, Atsbeha Gebreegziabxier, Adamu Tayachew, Adimkewu Aguine, and et al. 2025. "SARS-CoV-2 Genetic Variants Identified in Selected Regions of Ethiopia Through Whole Genome Sequencing: Insights from the Fifth Wave of COVID-19" Genes 16, no. 3: 351. https://doi.org/10.3390/genes16030351

APA Style

Hailu, G., Legesse, M., Mulu, A., Medhin, G., Tsegaye, M. M., Alemayehu, D. H., Ayele, A., Gebreegziabxier, A., Tayachew, A., Aguine, A., Dejene, H., Tessema, S. K., Onywera, H., Stanislas, A. E., Abate, E., Marcello, A., & Bitew, M. (2025). SARS-CoV-2 Genetic Variants Identified in Selected Regions of Ethiopia Through Whole Genome Sequencing: Insights from the Fifth Wave of COVID-19. Genes, 16(3), 351. https://doi.org/10.3390/genes16030351

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop