Next Article in Journal
A Single-Tube Two-Step MIRA-CRISPR/Cas12b Assay for the Rapid Detection of Mpox Virus
Previous Article in Journal
HIV, Inflammation, and Immunometabolism: A Model of the Inflammatory Theory of Disease
Previous Article in Special Issue
Behavioral Dynamics, Genomic Insights, and Social Drivers of SARS-CoV-2 Waves and Variants in Cali, Colombia (2020–2023)
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Identification of Clinical and Genomic Features Associated with SARS-CoV-2 Reinfections

by
Francisco Muñoz-López
1,
Antoni E. Bordoy
2,3,
Francesc Català-Moll
1,
Verónica Saludes
2,4,
David Panisello Yagüe
2,3,
Mariona Parera
1,
Ignacio Blanco
2,5,
Pere-Joan Cardona
2,6,7,
Cristina Casañ
2,
Ana Blanco-Suárez
2,
Sandra Franco
1,
Álvaro F. García-Jiménez
8,
Roger Paredes
1,9,10,11,
Bonaventura Clotet
1,9,10,12,
Lourdes Mateu
9,10,12,
Marc Noguera-Julian
1,
Elisa Martró
2,3,4,
José Ramón Santos
9,10,* and
Marta Massanella
1,11,*
1
IrsiCaixa, Badalona, 08916 Barcelona, Spain
2
Northern Metropolitan Clinical Laboratory, Microbiology Department, Hospital Universitari Germans Trias i Pujol, 08916 Badalona, Spain
3
Germans Trias i Pujol Health Sciences Research Institute Foundation (IGTP), 08916 Badalona, Spain
4
Epidemiology and Public Health Networking Biomedical Research Center (CIBERESP), Health Institute Carlos III, 28029 Madrid, Spain
5
Clinical Genetics Department, Northern Metropolitan Clinical Laboratory, Hospital Universitari Germans Trias i Pujol, 08916 Badalona, Spain
6
Genetics and Microbiology Department, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, 08193 Catalonia, Spain
7
Respiratory Diseases Networking Biomedical Research Center (CIBERES), Health Institute Carlos III, 28029 Madrid, Spain
8
Department of Immunology and Oncology, National Centre for Biotechnology, Spanish National Research Council (CNB-CSIC), 28049 Madrid, Spain
9
Infectious Diseases Department, Hospital Universitari Germans Trias i Pujol, 08916 Badalona, Spain
10
Fight Against Infections Foundation, 08916 Badalona, Spain
11
Infectious Diseases Networking Biomedical Research Center (CIBERINFEC), 28029 Madrid, Spain
12
Chair in Infectious Diseases and Immunity, Centre for Health and Social Care Research (CESS), Faculty of Medicine, University of Vic-Central University of Catalonia (UVic-UCC), 08500 Vic, Spain
*
Authors to whom correspondence should be addressed.
Viruses 2025, 17(6), 840; https://doi.org/10.3390/v17060840
Submission received: 11 May 2025 / Revised: 5 June 2025 / Accepted: 6 June 2025 / Published: 11 June 2025
(This article belongs to the Special Issue Emerging Variants of SARS-CoV-2)

Abstract

Although SARS-CoV-2 reinfections remain a concern for healthcare systems worldwide, the factors driving them are still not fully understood. In this study, we examined data for 3303 individuals who experienced two SARS-CoV-2 infections between March 2020 and May 2022 from both clinical and viral genomics perspectives. Our findings indicate that viral evolution was the primary driver of reinfection. However, chronic conditions were common among reinfected individuals, including those under 26 years old, suggesting that the presence of underlying and/or chronic conditions increases susceptibility to reinfection. The median time elapsed between infections was one year, often coinciding with the emergence of new variants. While vaccination showed only a limited protective effect against reinfection, it drastically decreased the hospitalization rate, underscoring its role in mitigating disease severity. Our findings point to the need for more flexible vaccination strategies, especially for individuals with chronic conditions. Understanding the interactions between host factors and viral evolution is critical to strengthening prevention strategies and reducing the burden of reinfections and their possible long-term complications.

1. Introduction

As the SARS-CoV-2 pandemic evolves, reinfections are becoming increasingly common. Understanding the clinical characteristics and risk factors associated with reinfection, along with viral evolution, may be useful for guiding public health strategies and shaping vaccination efforts.
Early reports of reinfections highlighted the genetic differences between viral strains and distinct viral lineages approximately three months after the initial infection [1,2]. Since then, SARS-CoV-2 reinfections have received increasing scientific attention due to their potential implications for disease severity, vaccine effectiveness, and long-term health outcomes. While COVID-19 vaccines were primarily designed to prevent severe disease rather than infection, they have shown short-term protection against infection, particularly in the months following vaccination [3]. However, this protection decreases over time [4,5], especially with the emergence of new variants. Booster doses have been shown to enhance and extend immunity, reducing the risk of reinfection [6]. Studying reinfections is crucial for understanding their clinical impact, the evolution of SARS-CoV-2, and the potential long-term consequences, including the risk that patients will develop long COVID [7].
This study aims to provide a comprehensive overview of SARS-CoV-2 reinfections, focusing on the clinical characteristics of affected individuals and the role of viral evolution. For that purpose, we retrospectively selected participants with confirmed SARS-CoV-2 reinfection and then documented their clinical timeline, demographics, health status, vaccination history, and reinfection patterns. Additionally, we performed viral sequencing on a portion of viral samples from both primary infection and reinfection to assess the genetic variability of the virus and its potential impact on reinfection dynamics.

2. Materials and Methods

2.1. Study Design and Participants

We conducted a retrospective cohort study of reinfections using clinical records and SARS-CoV-2 genomic sequencing data for samples (RECOVID study) obtained from the Clinical Laboratory of the Northern Metropolitan Area (Laboratori Clínic de la Metropolitana Nord, LCMN), a referral laboratory located at the Hospital Universitari Germans Trias i Pujol (HUGTiP). This laboratory serves the northern area of Barcelona Province in Catalonia, Spain, and has systematically stored nasopharyngeal or nasal swabs from SARS-CoV-2 tests throughout the pandemic. All samples underwent routine nucleic acid amplification testing (NAAT), including reverse transcription real-time PCR, RT-PCR (Allplex 2019-nCoV assay [RdRp, E, and N genes, Seegene, Seoul, South Korea], Alinity [RdRp and N genes, Abbott, Chicago, IL, USA], Simplexa [ORF1ab and/or S genes, DiaSorin, Saluggia, Italy], and GeneXpert [RdRp, E, and N genes, Cepheid, Sunnyvale, CA, USA]) using CFX96 instruments (Bio-Rad Laboratories, Hercules, CA, USA) or transcription-mediated amplification (TMA) using the Panther System (ORF1ab gene, Hologic or Procleix, Grífols S.A., Sant Cugat del Vallès, Spain) to detect the presence of SARS-CoV-2 infection.
On our request, the HUGTiP’s Information Systems Department (Direcció d’Organització de Sistemes d’Informació, DOSI) identified SARS-CoV-2 reinfections in an anonymized database they manage called CORE-COVID and provided us with the resulting data subset, which we labeled the RECOVID cohort. This cohort consisted of a total of 3303 individuals reinfected with SARS-CoV-2 between March 2020 and May 2022, with at least two positive PCR tests ≥90 days apart. Demographic, clinical, virological, and immunological data from the database were extracted following the European Union’s General Data Protection Regulation (GDPR) to ensure participant anonymity; therefore, no informed consent was required. This study was approved by the HUGTiP’s Institutional Review Board (PI-21-235). All procedures adhered to good clinical practices and complied with GDPR 2016/679 [8] to ensure participant privacy and confidentiality.
Variables recorded included demographic and clinical data (sex, age, comorbidities, adjusted morbidity groups) and SARS-CoV-2-related information (nucleic acid amplification tests [RT-PCR or TMA], rapid antigen tests, hospitalization, symptomatology, serology, vaccination [dates, manufacturer, and number of doses]).
For comparative purposes, health data for the general population of Catalonia (one of Spain’s constituent Autonomous Regions) was gathered from the Modules for the Monitoring of Quality Indicators (MSIQ) platform of the Catalan Health Institute (CatSalut).

2.2. Clinical Timeline Encoding

To facilitate analysis, the clinical timeline of each participant was encoded into a structured string format. Each string began with the date of the first recorded event, followed by a sequence of letters and numbers. The letters represented specific events such as SARS-CoV-2 infection, vaccination, and type of vaccine, and NAAT and antigen tests (among other things), while the numbers indicated the number of days between consecutive events. A detailed description of these timeline strings is shown in Supplementary Figure S1. This encoding was carried out using Python 3.11.

2.3. Definitions and Categorization

A summary of the key definitions used in the study is provided in Table 1 to ensure clarity about the criteria used for important variables including reinfection, extended infection, breakthrough infection, partial breakthrough infection, booster doses and Adjusted Morbidity Groups (AMG, a measure of general health status developed by the Catalan Health Institute [9]). In order to study the association between vaccination and reinfection, participants were categorized into four distinct groups based on the timing of their infections (I = infection) and vaccinations (V = first vaccine, in any possible case {Ad26.COV2.S dose, BNT162b2/mRNA-1273/AZD1222 first dose or first + second doses}; B = booster dose), as follows: (1) VII[B], consisting of individuals vaccinated before their first infection without receiving a booster; (2) IVI[B], individuals vaccinated between infections but without a booster; (3) IVBI, individuals vaccinated and boosted between infections; and (4) VIBI, individuals vaccinated before the first infection and subsequently boosted before the second infection. Except for one individual, all participants received a single booster dose.

2.4. SARS-CoV-2 Whole-Genome Sequencing

To address the role of viral genomic variability in the reinfection process, we selected a subset of 130 pairs of nasopharyngeal swab samples (first episode from August 2020 to September 2021 and second episode from February 2021 to March 2022). All SARS-CoV-2-positive selected samples initially processed by TMA were subjected to real-time RT-PCR to determine the cycle threshold (Ct) value. Only those samples with a Ct  < 30 for the N gene target were selected for whole-genome sequencing (WGS) to maximize the sequencing success rate. Genomic reverse transcription, tiling PCR amplification, and sequencing were performed as previously described [11]. The ARTIC network v3 amplicon panel (Integrated DNA Technologies, Coralville, IA, USA) was used for amplification, the Illumina DNA Prep kit for library preparation (Illumina, San Diego, CA, USA), and the Illumina MiSeq platform for sequencing.

2.5. Bioinformatic and Phylogenetic Analysis

The sequences were analyzed with the nf-core/viralrecon pipeline [12,13], which executes all the steps, from quality control and alignment of raw sequences to SARS-CoV-2 lineage assignment using Nextclade [14] and Pangolin [15]. We discarded 9 sequences due to low quality (<90% coverage), and after some tagging errors, 74 pairs of sequences (one from each episode of the same participant) and 52 unpaired sequences remained for further analysis. All FASTA sequences with good quality were uploaded to the GISAID database [16] (https://www.gisaid.org/; accession ID: EPI_SET_250311pz; accessed on 11 March 2025).
We extracted the spike gene of the paired sequences and used MEGA software version 12 [17] to select a suitable distance model to build a phylogenetic tree. The model with the lowest BIC was T92+G+I. Then, we built the tree in the IQ-TREE [18] web application. The tree was rooted in the reference SARS-CoV-2 sequence (Wuhan-Hu-1/MN908947.3 [19]). The representations of the tree were created using FigTree (http://tree.bio.ed.ac.uk/software/figtree/; version 1.4.4, accessed on 11 March 2025). We focused on the spike gene because it harbors the majority of mutations responsible for immune escape and is the primary target of neutralizing antibodies and vaccines. Analyzing this region provides critical insights into reinfection risk driven by antigenic evolution.
Finally, we devised a genetic distance model to study the importance of several variables in the genetic distance between pairs of sequences. For that purpose, we calculated all the pairwise distances between all whole genome sequences, both paired and unpaired. In this case, the most suitable distance model was GTR+G. The regression models between these distances and the variables “time between infections”, “same individual”, and “type” were fitted using the lm() function in R. Two models were fitted, whose calls were lm(dist~time + same_ind) and lm(dist~time + same_ind + type) for models 1 and 2, respectively. The variables were the pairwise distance between sequences (dist), time in days between infections (time), whether the sequences belonged to the same individual (same_ind; 0–no; 1–yes), and the combination of variants for each pairwise comparison of sequences (type). Model 1 was fitted with all the pairwise distances. Model 2 also added the variable lineage, and sequences with a low-represented lineage, which were those other than the B.1.177 lineage, and Alpha, Delta, and Omicron variants of concern (VOC), were removed to avoid loss of statistical power.

2.6. Statistical Analyses

Epidemiological data were described using medians and interquartile ranges (IQR) for continuous variables and proportions for categorical variables. To compare the prevalence of comorbidities and GMA between the RECOVID cohort and the overall Catalan population, we employed the Chi-square test, assuming a total Catalan population of 7,896,306 individuals (December 2022). The descriptive statistical analysis was performed with base functions of R 4.3 [20]. The graphs were generated using Python 3.12 [21] (matplotlib [22], seaborn [23], plotly [24]) and R 4.4 (ggpubr [25], ggalluvial [26], ggridges [27]).

3. Results

3.1. Participant Characteristics

Of the 3303 participants with at least two positive SARS-CoV-2 test results, we selected 2344 participants with a confirmed reinfection. Participants who did not meet these criteria were excluded from further analysis and classified as either extended infections (n = 21) or uncertain reinfections (n = 938) (Figure 1). Among the 2344 participants selected, only one individual suffered three SARS-CoV-2 infections during the study period, the remainder experiencing only two.
The temporal distribution of all infections revealed distinct peaks corresponding to the infection waves that took place in Catalonia. The first increase in infections occurred in March 2020, followed by subsequent peaks in October 2020, November 2021, and May 2022 (Figure 2a). These peaks represent periods of heightened infection rates, likely associated with the emergence and spread of new variants or the relaxation of public health measures. The median time between the first and second episodes was 364 days, IQR (200–464) (Figure 2b).
The cohort selected consisted of 72.2% females, with a median age of 45 years IQR (28–63) (Figure 2c). The age distribution within the study population revealed different patterns for each sex. For females, three noticeable peaks were observed: the first around age 25, the second around age 50, and the third gradual rise beginning at age 75. In contrast, the distribution among males was flatter, with overall lower density across most age groups and no clearly defined peaks.
Of the cohort, only 421 participants (17.9%) were fully or partially vaccinated before the first infection, while 1 882 participants (80.3%) were vaccinated before their second infection. Most participants (n = 2241, 95.6%) were not hospitalized for either of their SARS-CoV-2 infections. Of the 103 participants (4.4%) who were hospitalized due to their infection, 86 individuals (83.5%) required hospitalization during their first infection, while 11 participants were hospitalized during their second infection, and 6 were hospitalized during both infections. Among them, 18 required ICU admission during their first infection and 1 during their second. A full description of the cohort is shown in Table 2.
We next assessed the health status of the participants included. Most reinfected individuals had at least one chronic condition (n = 2028, 86.5%), regardless of age, suggesting an overall compromised health status in reinfected individuals (Figure 2d). Notably, a high prevalence of chronic conditions was also observed in younger individuals: 70.9% among those aged 0–17 years and 77.3% in the 18–25 age groups, unexpectedly high rates for these age ranges. Overall, 42.9% of the cohort presented a chronic condition affecting four or more systems, compared to 24.5% of the general population of Catalonia (MSIQ, CatSalut) (Table 3). Some comorbidities were underrepresented in our cohort, such as chronic kidney failure (0.2-fold change, p = 0.0001), malignant neoplasm (0.41-fold change, p = 0.0001), arthritis (0.14-fold change, p = 0.0001), and Chronic Obstructive Pulmonary Disease (0.75-fold change, p = 0.0059). Other comorbidities were overrepresented in the RECOVID cohort, including cirrhosis (3.1-fold change, p = 0.0001), type I and II diabetes (1.4-fold change, p = 0.0001), ischemic heart disease (1.3-fold change, p = 0.0097), and high blood pressure (1.2-fold change, p = 0.0001). Additionally, 0.4% of the reinfected participants experienced an organ transplant, while only about 0.01% of the Catalan population receives transplants each year (23.5-fold change, p = 0.0001).

3.2. Impact of Vaccination on SARS-CoV-2 Reinfections

Of all participants, 19.8% experienced SARS-CoV-2 reinfection, having received no vaccination before either the initial or second infection. Among the remaining participants, 80.3% had received at least one dose of the vaccine before their second infection. As illustrated in Figure 3a, we categorized the participants based on their vaccination and reinfection timelines: 4.6% of individuals were vaccinated before their first infection and were reinfected without having received a booster (VII[B] group); 34.9% were vaccinated between infections without a booster (IVI[B] group); 27.2% were both vaccinated and boosted between infections (IVBI group); and 13.5% were vaccinated before their first infection and later boosted between infections (VIBI group). Importantly, approximately half of the participants in the groups vaccinated between infections (IVI[B] and VIBI), with or without a booster, experienced partial breakthrough reinfections. This suggests that the vaccine had not yet reached full efficacy at the time of the second exposure.
The density of days between vaccination and second infection is shown in Figure 3b. We observed a high incidence of reinfections within the first 40 days post-vaccination, followed by a sharp decline between 40 and 100 days, and then a second peak. This trend was consistent regardless of the order of vaccination and initial infection.
Regarding hospitalization in the context of vaccination, 80 of the 86 participants hospitalized during their first infection were not vaccinated (93%), as they were infected during the early waves before vaccines were available. Only one individual requiring ICU admission was vaccinated prior to infection but tested positive for SARS-CoV-2 just 5 days post-vaccination. However, a few individuals were hospitalized during their second infection (n = 11, 0.47%) or both infections (n = 6, 0.26%), with approximately half of these individuals (n = 6, 54.5% and n = 3, 50%, respectively) having been vaccinated before the time of reinfection. Only one individual required ICU admission during the second infection and had received only a single dose of the BNT162b2 vaccine.

3.3. Impact of SARS-CoV-2 Variants on Reinfection Patterns

As WGS was only performed for a subset of cases, we first inferred the SARS-CoV-2 variant infecting each participant (n = 2344) based on the dominant circulating variant in Catalonia at the time of their infection, using GISAID data (www.gisaid.org, Supplementary Figure S2A, accessed on 25 July 2022). The most common reinfection pattern observed was an initial infection with a pre-Alpha lineage followed by an Omicron infection, accounting for nearly 40% of the cases (Supplementary Figure S2B). The next most frequent combination involved pre-Alpha followed by Delta VOC (17%). Cases where both infections involved two pre-Alpha lineages occurred in just over 10% of individuals. In contrast, reinfection events where the first and second infections involved the same VOC—specifically Alpha–Alpha, Delta–Delta, or Omicron–Omicron combinations—were exceptionally uncommon, with frequencies approaching zero.
To perform a more precise genetic analysis of SARS-CoV-2 variants, we analyzed 74 pairs of viral sequences and 52 unpaired sequences using various methods. In the first infection, the most common lineages were pre-Alpha B.1.177 (35.2%) and the Delta variant AY.43 (29.6%). In the second infection, Omicron lineages predominated, with BA.1.1 (20.6%), BA.1.17 (23.5%), and BA.1.1.1 (10.8%) being the most abundant, and some Delta lineages, particularly AY.43 (12.7%). The proportions of all variants found are detailed in Table 4. When the variant pairs were examined, the most common combinations of variants observed between the two infections were Delta–Omicron, followed by B.1.177–Omicron, B.1.177–Delta, and Alpha–Omicron (Figure 4a). Phylogenetic analysis of the 74 sequence pairs revealed substantial genetic divergence between the spike genes of first and second infection variants, particularly between Omicron and earlier strains like pre-Alpha and Delta (Figure 4b). This supports the finding that reinfections typically involve shifts to newer variants, with Omicron emerging as the dominant strain in second infections.
To examine the genetic distances between the sequences, we applied two linear models (Table 5). The first model aimed to analyze the genetic distance between viral sequences from different infections, factoring in two variables: the time between infections and whether the sequences came from the same individual or different individuals (distance~time + same_individual). This model revealed a positive relation between the genetic distance between sequences and the time between infections (estimate = 3.514 × 10−6; p < 2 × 10−16), meaning greater genetic divergence with longer intervals between infections. The model also suggested that viruses infecting the same individual were genetically less related than those infecting different individuals (estimate = 3.243 × 10−4; p = 0.00198). This result could be influenced by data imbalance, as only 148 pairwise distances were from the same individual, compared to 40,253 from different individuals. The adjusted R-squared of 0.254 indicates that the model explains 25.4% of the genetic distance variation, highlighting the importance of time between infections and individual-specific factors.
The second model (distance~time + same_individual + type) included SARS-CoV-2 lineage/VOC as a variable, focusing on B.1.177, Alpha, Delta, and Omicron. The time variable remained significant, with a positive correlation, while the “same individual” variable lost significance. Regarding variants, all combinations of them, except B.1.177-Delta, showed p-values < 2 × 10−16. The estimates were positive for different VOC pairs (e.g., Delta-Omicron) and negative for the same VOC pairs (e.g., Delta–Delta). The adjusted R-squared of this model was 0.9537, showing that the primary factor driving genetic distance is the evolution of SARS-CoV-2 over time.

4. Discussion

Our study provides new insights into the factors contributing to SARS-CoV-2 reinfection, with a particular emphasis on the role of the host’s health status. We found that a significant proportion of participants had chronic conditions, suggesting a potential link between compromised health and increased susceptibility to reinfection—an aspect not commonly explored in previous research. While the genetic variability of the virus remained the primary driver of reinfection [28,29], our findings also underscore the impact of underlying health conditions as an important factor. Additionally, waning immunity seemed to play a significant role, as the median time between infections was one year [30]. Regarding vaccination, while it was crucial in reducing disease severity, its effectiveness in preventing reinfections was limited. We observed a decrease in reinfection incidence between 40 and 100 days post-vaccination, after which the incidence increased, reflecting a diminishing protective effect.
Although SARS-CoV-2 reinfections are a major problem, the scientific literature assessing this topic remains limited. Some established risk factors for SARS-CoV-2 reinfection include waning immunity, viral evolution, and variations in immune responses among individuals. Moreover, studies have demonstrated that reinfections could be associated with specific variants, such as the Delta and Omicron VOCs [31,32,33], which exhibit key mutations in the spike protein. Other risk factors described for reinfection are being unvaccinated, infrequent face mask use, time since first infection, being non-Hispanic black, a severe first infection, or being over 60 years old [34,35]. Our results suggested that the median time between infections was 1 year, reflecting both loss of immunity over time and the emergence of new SARS-CoV-2 variants [36].
Our temporal data illustrates the distribution of reinfections at three key time points: 200, 400, and 600 days between infections, with a median of 364 days. These distributions were directly associated with the emergence of significant new SARS-CoV-2 VOCs, such as Delta, appearing seven months after the summer of 2020, and the Omicron VOC in January 2022 [37]. Therefore, our data align with the high rates of reinfection involving new variants that other researchers have previously documented [38]. The genetic distance models also hint at the impact of the genetic variability of the virus in the reinfection phenomenon.
Our cohort consisted primarily of women, with the age density curve showing three maxima over different age groups being much more prominent in women. The first two peaks correspond to ages 24 and 45. A feasible explanation for this is the nature of the cohort: all the samples analyzed here were taken from hospitalized or in-and-out patients and healthcare workers, which suggests that the higher number of women in our sample is in part attributable to a significant proportion of them being hospital staff and the prominent caregiving role of women in society [39]. In addition, it has been demonstrated that women tend to prioritize their health more than men and are therefore more likely to schedule medical appointments [40]. The third peak, around 85 years, is arguably filled with women living in long-term care facilities, who generally have a longer life expectancy than men [41]. At this point, no sex/gender-related susceptibility to reinfection has been reported, so we assume the sex bias observed in our cohort is due to the sample’s composition rather than biological factors.
Our AMG data revealed that the RECOVID cohort generally had poor health, with more than 86% of participants having at least one chronic condition. However, the median age of the cohort was 45 years, a distribution similar to that of the overall Catalan population (mean of 43.3 years, data obtained from the Statistical Institute of Catalonia, Idescat, 2022). Thus, age does not appear to be the main factor driving the poor health observed in our cohort. Notably, over 75% of younger individuals (<26 years old) also had one or more chronic conditions. This suggests that the probability of reinfection increases with chronic comorbidities [42]. The increased reinfection rate among individuals with chronic conditions may be attributed to several factors. First, chronic diseases can impair immune function, reducing the ability to develop an effective response to SARS-CoV-2. Second, medications commonly used to manage chronic conditions (i.e., corticosteroids or immunosuppressants) may further weaken immune functions. In addition, it is also possible that individuals with chronic conditions have more frequent contact with healthcare services, increasing their exposure risk and likelihood of testing. While we did not assess viral load in this study, we acknowledge that higher viral loads in immunocompromised individuals could enhance RT-PCR detection sensitivity. Further studies exploring the interplay between viral kinetics, host immunity, and comorbidity burden are warranted.
Although the main goal of vaccination has been to prevent severe illness and hospitalization due to SARS-CoV-2 infection [43], its impact on reinfection has not been widely studied. Previous research has shown a 27–35% reduced risk of reinfection after vaccination [44], although this protective effect is limited. Similarly to this, and in line with the fact that the humoral immune response declines after 3 months of vaccination [45], our data showed a protective effect of the vaccination that lasted between 40 and 100 days after administration, but beyond 100 days, reinfection rates increased. As expected, we observed a higher number of hospitalizations during the first infection, with 93% of hospitalizations occurring among unvaccinated individuals. In contrast, the number of individuals requiring hospitalization during their reinfection was 0.68%. This is attributable to the success of the vaccination campaign in Catalonia, which managed to fully vaccinate 85% of the population by May 2022. Although 15% of the general population remained unvaccinated, 50% of study participants hospitalized during their reinfection were unvaccinated, highlighting the effectiveness of current vaccination regimens in preventing hospitalization [3]. Globally, women have generally experienced milder COVID-19 and lower hospitalization rates compared to men [39], which may have also contributed to the relatively low proportion of hospitalizations observed in our predominantly female cohort.
Our viral genomics results and genetic distance analysis suggest that the main driver for SARS-CoV-2 reinfection is the emergence of new variants over time. However, this could be biased by the definition of reinfection, which requires a gap of more than 90 days between the two infections. Nevertheless, the dominance period of each predominant variant exceeded 90 days (pre-Alpha period: 331 days, Alpha: 140 days, Delta: 175 days, and Omicron: 162 days, until the end of the study), allowing two consecutive infections with the same variant. This finding highlights the global need to implement immunological surveillance in the face of possible genetic modifications of the virus that could evade the immune response generated by existing vaccines, especially in sensitive individuals such as those with comorbidities [46].
This study presents some limitations, primarily due to its retrospective design and lack of control groups of participants with a single infection. The absence of this control group limited our ability to apply certain statistical analyses, such as hypothesis testing or odds ratios. Moreover, as the data was completely anonymized, we were unable to access certain relevant information, such as the characteristics of symptoms during the acute infection period or whether participants developed long COVID. The phylogenetic analysis was limited to 74 reinfection pairs with high-quality sequences, which constrains the statistical power and generalizability of the findings. Furthermore, we were unable to retrieve additional sequences due to many of the samples having a high Ct value, making it impossible to sequence them effectively, which would have strengthened the genomic analysis. Finally, we lacked information about the immune response of the participants in any of the infections, which would have been extremely valuable to understanding the development of these reinfections. Nevertheless, our study has some strong points. First, it involved a relatively large number of participants, mainly due to the fact that SARS-CoV-2 testing was carried out on a routine basis in healthcare facilities and nursing homes, increasing the possibility of detecting reinfections even in asymptomatic individuals. Thus, we were able to extensively characterize the clinical timeline of individual participants. In addition, our findings suggest a potential link between compromised health and increased susceptibility to reinfections and their decrease between 40 and 100 days after vaccination, something to be borne in mind by clinicians working with susceptible patient populations, as well as by those involved in designing health policies.

5. Conclusions

Our study sheds light on some of the factors contributing to SARS-CoV-2 reinfection, with genetic variability of the virus being the primary driver, followed by the infected individual’s underlying health status. We also found that vaccination, while essential for reducing disease severity, provided only a short-term protective effect against reinfections lasting around two months. Prospective studies evaluating more risk factors associated with reinfections and their clinical consequences—not least their impact on Long COVID—are needed.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/v17060840/s1, Figure S1: Clinical timeline coding of the participants.; Figure S2: SARS-CoV-2 variant estimations.

Author Contributions

This study was conceived by M.M., J.R.S., B.C. and M.N.-J. The DOSI contributed to data collection and management. The designs and experimental assays were conducted by M.N.-J., J.R.S., M.M., E.M., V.S., D.P.Y., A.E.B., M.P., C.C., A.B.-S., I.B. and P.-J.C. Clinical and demographic analyses were performed by F.M.-L., M.M., J.R.S., L.M. and R.P. Genomic analyses were designed by M.N.-J., F.M.-L., F.C.-M., S.F. and Á.F.G.-J. and carried out by F.M.-L. and F.C.-M. The manuscript was first drafted by F.M.-L., J.R.S. and M.M. All authors made substantial contributions to revising the successive drafts and approved the final version of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by La Marató de TV3 Foundation (342/C/2021 to MNJ and EM), CERCA Program/Generalitat de Catalunya grants 2021 SGR 00452, Departament de Salut de la Generalitat de Catalunya (PERIS ref.: SLT028/23/000129), and the crowdfunding initiatives #joemcorono, BonPreu/Esclat, and Correos. FML was supported by a doctoral grant from the Sorigué Foundation. MM received funding from grant RYC2020-028934-I/AEI/10.13039/501100011033 issued by the Spanish Ministry of Science and Innovation and State Research Agency and the EU’s “investing in your future” Social Fund.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the Institutional Review Board of the Hospital Universitari Germans Trias i Pujol (protocol code PI-21-235 and 22 October 2021).

Informed Consent Statement

Informed consent was waived due to the study’s retrospective design and the use of anonymized data, in compliance with the Spanish Law on Data Protection (LOPD) and the EU’s General Data Protection Regulation (GDPR) 2016/679.

Data Availability Statement

Viral genomic data generated were uploaded to the GISAID database (www.gisaid.org) and can be accessed through the set ID EPI_SET_250311pz.

Acknowledgments

We are heavily indebted to all the anonymous participants in the RECOVID cohort. We also thank the personnel of the DOSI and the Laboratori Clínic Metropolitana Nord at HUGTiP for their invaluable help in sample management and sequencing. We are also grateful to the IGTP Translational Genomics Core Facility staff for their contribution to sequencing the samples included in this study, and the CERCA Program/Generalitat de Catalunya for their support of the IGTP. We wish to acknowledge Emili Vela and the Catalan Health Institute for providing demographic information for the overall Catalan population. Lastly, we are grateful to Michael Kennedy-Scanlon for his editorial assistance, which contributed to refining the final version of this manuscript.

Conflicts of Interest

The authors declare no competing interests.

References

  1. To, K.K.-W.; Hung, I.F.-N.; Chan, K.-H.; Yuan, S.; To, W.-K.; Tsang, D.N.-C.; Cheng, V.C.-C.; Chen, Z.; Kok, K.-H.; Yuen, K.-Y. Serum Antibody Profile of a Patient with COVID-19 Reinfection. Clin. Infect. Dis. 2020, 72, e659–e662. [Google Scholar] [CrossRef] [PubMed]
  2. Van Elslande, J.; Vermeersch, P.; Vandervoort, K.; Wawina-Bokalanga, T.; Vanmechelen, B.; Wollants, E.; Laenen, L.; André, E.; Van Ranst, M.; Lagrou, K.; et al. Symptomatic Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) Reinfection by a Phylogenetically Distinct Strain. Clin. Infect. Dis. 2021, 73, 354–356. [Google Scholar] [CrossRef] [PubMed]
  3. Berber, E.; Ross, T.M. Factors Predicting COVID-19 Vaccine Effectiveness and Longevity of Humoral Immune Responses. Vaccines 2024, 12, 1284. [Google Scholar] [CrossRef] [PubMed]
  4. Peng, Q.; Zhou, R.; Wang, Y.; Zhao, M.; Liu, N.; Li, S.; Huang, H.; Yang, D.; Au, K.-K.; Wang, H.; et al. Waning Immune Responses against SARS-CoV-2 Variants of Concern among Vaccinees in Hong Kong. EBioMedicine 2022, 77, 103904. [Google Scholar] [CrossRef]
  5. Pérez-Alós, L.; Armenteros, J.J.A.; Madsen, J.R.; Hansen, C.B.; Jarlhelt, I.; Hamm, S.R.; Heftdal, L.D.; Pries-Heje, M.M.; Møller, D.L.; Fogh, K.; et al. Modeling of Waning Immunity after SARS-CoV-2 Vaccination and Influencing Factors. Nat. Commun. 1 2022, 13, 1614. [Google Scholar] [CrossRef]
  6. Stamatatos, L.; Czartoski, J.; Wan, Y.-H.; Homad, L.J.; Rubin, V.; Glantz, H.; Neradilek, M.; Seydoux, E.; Jennewein, M.F.; MacCamy, A.J.; et al. MRNA Vaccination Boosts Cross-Variant Neutralizing Antibodies Elicited by SARS-CoV-2 Infection. Science 2021, 372, 1413–1418. [Google Scholar] [CrossRef]
  7. Bosworth, M.L.; Shenhuy, B.; Walker, A.S.; Nafilyan, V.; Alwan, N.A.; O’Hara, M.E.; Ayoubkhani, D. Risk of New-Onset Long COVID Following Reinfection With Severe Acute Respiratory Syndrome Coronavirus 2: A Community-Based Cohort Study. Open Forum Infect. Dis. 2023, 10, ofad493. [Google Scholar] [CrossRef]
  8. Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data (General Data Protection Regulation). Off. J. Eur. Union 2016, L119, 1–88. Available online: https://eur-lex.europa.eu/eli/reg/2016/679/oj (accessed on 11 March 2025).
  9. Monterde, D.; Vela, E.; Clèries, M.; Garcia-Eroles, L.; Roca, J.; Pérez-Sust, P. Multimorbidity as a Predictor of Health Service Utilization in Primary Care: A Registry-Based Study of the Catalan Population. BMC Fam. Pract. 2020, 21, 39. [Google Scholar] [CrossRef]
  10. Monterde, D.; Vela, E.; Clèries, M. Adjusted Morbidity Groups: A New Multiple Morbidity Measurement of Use in Primary Care. Aten. Primaria 2016, 48, 674–682. [Google Scholar] [CrossRef]
  11. López, M.G.; Chiner-Oms, Á.; García de Viedma, D.; Ruiz-Rodriguez, P.; Bracho, M.A.; Cancino-Muñoz, I.; D’Auria, G.; de Marco, G.; García-González, N.; Goig, G.A.; et al. The First Wave of the COVID-19 Epidemic in Spain Was Associated with Early Introductions and Fast Spread of a Dominating Genetic Variant. Nat. Genet. 2021, 53, 1405–1414. [Google Scholar] [CrossRef] [PubMed]
  12. Ewels, P.A.; Peltzer, A.; Fillinger, S.; Patel, H.; Alneberg, J.; Wilm, A.; Garcia, M.U.; Di Tommaso, P.; Nahnsen, S. The Nf-Core Framework for Community-Curated Bioinformatics Pipelines. Nat. Biotechnol. 2020, 38, 276–278. [Google Scholar] [CrossRef] [PubMed]
  13. Patel, H.; Monzón, S.; Varona, S.; Espinosa-Carrasco, J.; Garcia, M.U.; Kelly, S.; Heuer, M.L.; Underwood, A.; Gabernet, G.; Ewels, P.; et al. Nf-Core/Viralrecon: Nf-Core/Viralrecon v2.6.0—Rhodium Raccoon, Seqera Labs: Barcelona, Spain. [CrossRef]
  14. Aksamentov, I.; Roemer, C.; Hodcroft, E.B.; Neher, R.A. Nextclade: Clade Assignment, Mutation Calling and Quality Control for Viral Genomes. J. Open Source Softw. 2021, 6, 3773. [Google Scholar] [CrossRef]
  15. O’Toole, Á.; Scher, E.; Underwood, A.; Jackson, B.; Hill, V.; McCrone, J.T.; Colquhoun, R.; Ruis, C.; Abu-Dahab, K.; Taylor, B.; et al. Assignment of Epidemiological Lineages in an Emerging Pandemic Using the Pangolin Tool. Virus Evol. 2021, 7, veab064. [Google Scholar] [CrossRef]
  16. Shu, Y.; McCauley, J. GISAID: Global Initiative on Sharing All Influenza Data—from Vision to Reality. Eurosurveillance 2017, 22, 30494. [Google Scholar] [CrossRef]
  17. Tamura, K.; Stecher, G.; Kumar, S. MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Mol. Biol. Evol. 2021, 38, 3022–3027. [Google Scholar] [CrossRef]
  18. Nguyen, L.T.; Schmidt, H.A.; Von Haeseler, A.; Minh, B.Q. IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies. Mol. Biol. Evol. 2015, 32, 268–274. [Google Scholar] [CrossRef]
  19. Wu, F.; Zhao, S.; Yu, B.; Chen, Y.M.; Wang, W.; Song, Z.G.; Hu, Y.; Tao, Z.W.; Tian, J.H.; Pei, Y.Y.; et al. A New Coronavirus Associated with Human Respiratory Disease in China. Nature 2020, 579, 265–269. [Google Scholar] [CrossRef]
  20. R Core Team. R: A Language and Environment for Statistical Computing 2021; R Foundation for Statistical Computing: Vienna, Austria, 2021. [Google Scholar]
  21. Van Rossum, G.; Drake, F. Python 3 Reference Manual; CreateSpace: Scotts Valley, CA, USA, 2009; ISBN 1441412697. [Google Scholar]
  22. Hunter, J.D. Matplotlib: A 2D Graphics Environment. Comput. Sci. Eng. 2007, 9, 90–95. [Google Scholar] [CrossRef]
  23. Waskom, M.L. Seaborn: Statistical Data Visualization. J. Open Source Softw. 2021, 6, 3021. [Google Scholar] [CrossRef]
  24. Plotly Technologies Inc. Collaborative Data Science; Plotly Technologies Inc.: Montreal, QC, Canada, 2015. [Google Scholar]
  25. Kassambara, A. Ggpubr: “ggplot2” Based Publication Ready Plots, version 0.6.0, CRAN: Vienna, Austria, 2023.
  26. Brunson, J.C. Ggalluvial: Layered Grammar for Alluvial Plots. J. Open Source Softw. 2020, 5, 2017. [Google Scholar] [CrossRef] [PubMed]
  27. Wilke, C.O. Ggridges: Ridgeline Plots in “Ggplot2”, version 0.5.6, The Comprehensive R Archive Network (CRAN): Vienna, Austria, 2024.
  28. Wei, J.; Stoesser, N.; Matthews, P.C.; Khera, T.; Gethings, O.; Diamond, I.; Studley, R.; Taylor, N.; Peto, T.E.A.; Walker, A.S.; et al. Risk of SARS-CoV-2 Reinfection during Multiple Omicron Variant Waves in the UK General Population. Nat. Commun. 2024, 15, 1008. [Google Scholar] [CrossRef] [PubMed]
  29. Peghin, M.; De Martino, M.; Palese, A.; Chiappinotto, S.; Fonda, F.; Gerussi, V.; Sartor, A.; Curcio, F.; Grossi, P.A.; Isola, M.; et al. Antibody Response and Risk of Reinfection over 2 Years among the Patients with First Wave of COVID-19. Clin. Microbiol. Infect. 2024, 30, 522–530. [Google Scholar] [CrossRef]
  30. Marot, S.; Malet, I.; Leducq, V.; Zafilaza, K.; Sterlin, D.; Planas, D.; Gothland, A.; Jary, A.; Dorgham, K.; Bruel, T.; et al. Rapid Decline of Neutralizing Antibodies against SARS-CoV-2 among Infected Healthcare Workers. Nat. Commun. 2021, 12, 844. [Google Scholar] [CrossRef]
  31. Turabelidze, G.; Womack, A.J.; Mobley, E.; Garikapaty, V.; Finley, S. SARS-CoV-2 Reinfections during the Delta Variant Surge—Missouri, June-October, 2021. MoMed 2021, 118, 539. [Google Scholar]
  32. Pulliam, J.R.C.; van Schalkwyk, C.; Govender, N.; von Gottberg, A.; Cohen, C.; Groome, M.J.; Dushoff, J.; Mlisana, K.; Moultrie, H. Increased Risk of SARS-CoV-2 Reinfection Associated with Emergence of Omicron in South Africa. Science 2022, 376, eabn4947. [Google Scholar] [CrossRef]
  33. Breznik, J.A.; Rahim, A.; Zhang, A.; Ang, J.; Stacey, H.D.; Bhakta, H.; Clare, R.; Liu, L.M.; Kennedy, A.; Hagerman, M.; et al. Early Omicron Infection Is Associated with Increased Reinfection Risk in Older Adults in Long-Term Care and Retirement Facilities. EClinicalMedicine 2023, 63, 102148. [Google Scholar] [CrossRef]
  34. Ellingson, K.D.; Hollister, J.; Porter, C.J.; Khan, S.M.; Feldstein, L.R.; Naleway, A.L.; Gaglani, M.; Caban-Martinez, A.J.; Tyner, H.L.; Lowe, A.A.; et al. Risk Factors for Reinfection with SARS-CoV-2 Omicron Variant among Previously Infected Frontline Workers—Volume 29, Number 3—March 2023—Emerging Infectious Diseases Journal—CDC. Emerg. Infect. Dis. 2023, 29, 599–604. [Google Scholar] [CrossRef]
  35. Sacco, C.; Petrone, D.; Del Manso, M.; Mateo-Urdiales, A.; Fabiani, M.; Bressi, M.; Bella, A.; Pezzotti, P.; Rota, M.C.; Riccardo, F.; et al. Risk and Protective Factors for SARS-CoV-2 Reinfections, Surveillance Data, Italy, August 2021 to March 2022. Eurosurveillance 2022, 27, 2200372. [Google Scholar] [CrossRef]
  36. de Rioja, V.L.; Perramon-Malavez, A.; Alonso, S.; Andrés, C.; Antón, A.; Bordoy, A.E.; Càmara, J.; Cardona, P.J.; Català, M.; López, D.; et al. Mathematical Modeling of SARS-CoV-2 Variant Substitutions in European Countries: Transmission Dynamics and Epidemiological Insights. Front. Public Health 2024, 12, 1339267. [Google Scholar] [CrossRef]
  37. Chu, R.Y.K.; Szeto, K.C.; Wong, I.O.L.; Chung, P.H. A Global Scale COVID-19 Variants Time-Series Analysis across 48 Countries. Front. Public Health 2023, 11, 1085020. [Google Scholar] [CrossRef] [PubMed]
  38. Nguyen, N.N.; Nguyen, Y.N.; Hoang, V.T.; Million, M.; Gautret, P. SARS-CoV-2 Reinfection and Severity of the Disease: A Systematic Review and Meta-Analysis. Viruses 2023, 15, 967. [Google Scholar] [CrossRef] [PubMed]
  39. Wu, C.; Qian, Y. The Gender Peak Effect: Women Are Most Vulnerable to Infections during COVID-19 Peaks. Front. Public Health 2022, 10, 937179. [Google Scholar] [CrossRef] [PubMed]
  40. Bertakis, K.D.; Azari, R.; Helms, L.J.; Callahan, E.J.; Robbins, J.A. Gender Differences in the Utilization of Health Care Services. J. Fam. Pract. 2000, 49, 147–152. [Google Scholar]
  41. Seko, R.; Hashimoto, S.; Kawado, M.; Murakami, Y.; Hayashi, M.; Kato, M.; Noda, T.; Ojima, T.; Nagai, M.; Tsuji, I. Trends in Life Expectancy With Care Needs Based on Long-Term Care Insurance Data in Japan. J. Epidemiol. 2012, 22, 238. [Google Scholar] [CrossRef]
  42. Mensah, A.A.; Lacy, J.; Stowe, J.; Seghezzo, G.; Sachdeva, R.; Simmons, R.; Bukasa, A.; O’Boyle, S.; Andrews, N.; Ramsay, M.; et al. Disease Severity during SARS-COV-2 Reinfection: A Nationwide Study. J. Infect. 2022, 84, 542–550. [Google Scholar] [CrossRef]
  43. Mohammed, I.; Nauman, A.; Paul, P.; Ganesan, S.; Chen, K.H.; Jalil, S.M.S.; Jaouni, S.H.; Kawas, H.; Khan, W.A.; Vattoth, A.L.; et al. The Efficacy and Effectiveness of the COVID-19 Vaccines in Reducing Infection, Severity, Hospitalization, and Mortality: A Systematic Review. Hum. Vaccin. Immunother. 2022, 18, 2027160. [Google Scholar] [CrossRef]
  44. Omoike, E.; Miceli, A.; Busen, K.; Kohrman, A.; Sharma, M.; Harrington, P. Relative Risk of COVID-19 Reinfection by Demographic and Vaccination Factors During the Period March 2020-October 2022. J. Community Health 2024, 49, 339–342. [Google Scholar] [CrossRef]
  45. Trigueros, M.; Pradenas, E.; Palacín, D.; Muñoz-López, F.; Ávila-Nieto, C.; Trinité, B.; Bonet-Simó, J.M.; Isnard, M.; Moreno, N.; Marfil, S.; et al. Reduced Humoral Response 3 Months Following BNT162b2 Vaccination in SARS-CoV-2 Uninfected Residents of Long-Term Care Facilities. Age Ageing 2022, 51, afac101. [Google Scholar] [CrossRef]
  46. Uludağ, M.; Incitti, R.; Gao, X.; Heeney, J.L.; Gojobori, T.; Alam, I. Global Surveillance of Emerging SARS-CoV-2 Variants. Bull. World Health Organ. 2024, 102, 374. [Google Scholar] [CrossRef]
Figure 1. Inclusion flowchart for the RECOVID cohort.
Figure 1. Inclusion flowchart for the RECOVID cohort.
Viruses 17 00840 g001
Figure 2. Cohort demographics. (a) Temporal distribution of all infections. (b) Distribution of number of days between episodes. (c) Age distribution of RECOVID cohort by sex. (d) AMG in different age groups (far-left: 0–17, left-center: 18–25 years; center: 26–50 years; right-center: 51–75 years; far-right: 76–100 years). The intensity of red shading increases with the healthcare complexity of the participants.
Figure 2. Cohort demographics. (a) Temporal distribution of all infections. (b) Distribution of number of days between episodes. (c) Age distribution of RECOVID cohort by sex. (d) AMG in different age groups (far-left: 0–17, left-center: 18–25 years; center: 26–50 years; right-center: 51–75 years; far-right: 76–100 years). The intensity of red shading increases with the healthcare complexity of the participants.
Viruses 17 00840 g002
Figure 3. Vaccination data. (a) Number of participants broken down by vaccination and infection chronology. Group labels are defined as: NV—not vaccinated; I—infection; V—vaccine; B—booster dose; [B]—possible booster dose (not relevant); the timeline of the events for each group is illustrated in the right panel. The color coding indicates vaccination status: Gray—not vaccinated/vaccinated after both infections. Blue—breakthrough infection. Orange—partial breakthrough infection. (b) Density of days between vaccination and reinfection. The days are calculated using the dates of the first dose and the first positive PCR of the second infection. The three colors depict the different types of infection-vaccine chronology. (c) Number of COVID-19-related hospitalized participants and vaccination status. The x-axis shows the COVID-19 episode where hospitalization was required. Green—not vaccinated before hospitalization. Orange—vaccinated before hospitalization. Blue—partially vaccinated before hospitalization.
Figure 3. Vaccination data. (a) Number of participants broken down by vaccination and infection chronology. Group labels are defined as: NV—not vaccinated; I—infection; V—vaccine; B—booster dose; [B]—possible booster dose (not relevant); the timeline of the events for each group is illustrated in the right panel. The color coding indicates vaccination status: Gray—not vaccinated/vaccinated after both infections. Blue—breakthrough infection. Orange—partial breakthrough infection. (b) Density of days between vaccination and reinfection. The days are calculated using the dates of the first dose and the first positive PCR of the second infection. The three colors depict the different types of infection-vaccine chronology. (c) Number of COVID-19-related hospitalized participants and vaccination status. The x-axis shows the COVID-19 episode where hospitalization was required. Green—not vaccinated before hospitalization. Orange—vaccinated before hospitalization. Blue—partially vaccinated before hospitalization.
Viruses 17 00840 g003
Figure 4. Genomic data. (a) Sankey plot of the change in variant between episodes of 74 participants. The sub-variants belonging to Alpha, Delta, and Omicron have been grouped for representational purposes. (b) Phylogenetic tree of the spike sequence of the sequences of both episodes of 74 patients. The labels show the lineage, and the colors depict the episode of the infection (green—first infection; blue—second infection; red—reference sequence).
Figure 4. Genomic data. (a) Sankey plot of the change in variant between episodes of 74 participants. The sub-variants belonging to Alpha, Delta, and Omicron have been grouped for representational purposes. (b) Phylogenetic tree of the spike sequence of the sequences of both episodes of 74 patients. The labels show the lineage, and the colors depict the episode of the infection (green—first infection; blue—second infection; red—reference sequence).
Viruses 17 00840 g004
Table 1. Definitions.
Table 1. Definitions.
ConceptDefinition
ReinfectionReinfection occurs when a person contracts SARS-CoV-2 infection, recovers, and becomes infected again. In our cohort, we define reinfection as two positive PCR tests separated by at least 90 days, with a negative test in between.
Extended infectionAn extended infection is a single infection episode, lasting longer than usual, defined by at least two positive PCR tests separated by less than 30 days. Participants with extended infection were identified by reviewing complex cases in our cohort.
Complete vaccination scheduleA complete vaccination schedule was defined as receiving two doses of BNT162b2 (Pfizer-BioNTech, Mainz, Germany), mRNA-1273 (Moderna, Cambridge, MA, USA), and/or AZD1222 (AstraZeneca, Cambridge, United Kingdom), or a single dose of Ad26.COV2.S (Janssen, Leiden, The Netherlands) vaccine, with at least 14 days elapsed since the final dose.
Breakthrough infectionA breakthrough infection occurs when a fully vaccinated individual contracts SARS-CoV-2 at least two weeks after the final dose (first dose of viral vector vaccine or second dose of mRNA vaccine).
Partial breakthrough infectionA partial breakthrough infection occurs when a partially vaccinated individual contracts SARS-CoV-2, either after a single dose of an mRNA vaccine or less than two weeks after the final dose (first dose of viral vector vaccine or second dose of mRNA vaccine).
Booster doseA booster dose is an additional vaccine dose after the primary vaccination series to enhance or prolong immunity against SARS-CoV-2. We also regard as a booster dose any dose given after a breakthrough or partial breakthrough infection.
Adjusted Morbidity Groups (AMG) [10] Adjusted Morbidity Groups (AMG) [10] is a measure of a person’s general health status, developed by the Catalan Health Institute. It is based on two indicators: group and complexity. The group categories include “healthy”, “acute disease [of any type]”, “pregnant/giving birth”, “chronic condition in one system”, “chronic condition in two or three systems”, “chronic condition in four or more systems”, and “active neoplasia”. Complexity ranges from 0 to 5, reflecting an individual’s personal healthcare needs.
Table 2. Descriptive data for the RECOVID cohort (n = 2344).
Table 2. Descriptive data for the RECOVID cohort (n = 2344).
RECOVID
Number of confirmed reinfections (2 episodes), n2343
Number of confirmed reinfections (3 episodes), n1
Sex, females, n (%)1692 (72.2)
Age in years, median (IQR)45 (28–63)
Time between infections in days, median (IQR)364 (200–464)
Hospitalization related to COVID-19, n (%)103 (4.4)
 Hospitalization—first infection, n (%)86 (3.6)
 Hospitalization—second infection, n (%)11 (0.5)
 Hospitalization—both infections, n (%)6 (0.3)
Vaccinated before first infection 1, n (%)421 (18)
Vaccinated before second infection 1, n (%)1882 (80.3)
1 Vaccination status includes both fully and partially vaccinated individuals.
Table 3. Adjusted morbidity groups and comorbidities of the RECOVID cohort compared with the general population of Catalonia.
Table 3. Adjusted morbidity groups and comorbidities of the RECOVID cohort compared with the general population of Catalonia.
Prevalence (%)Chi-Square p-Value 2
RECOVID
Cohort
Catalonia
Population 1
AMG
 Healthy417.80.0001
 Acute disease (any)7.280.1586
 Pregnancy/birth2.51.30.0001
 Chronic disease 1 system1418.70.0001
 Chronic disease 2–3 systems2825.30.0028
 Chronic disease 4+ systems42.924.50.0001
 Active neoplasm (any)1.64.40.0001
Comorbidities
 Diabetes (Type I and II)11.28.20.0001
 COPD 33.64.80.0059
 Asthma6.870.6809
 Ischemic heart disease3.82.90.0097
 Chronic heart failure3.53.40.7929
 High blood pressure23.620.40.0001
 Chronic kidney failure0.974.80.0001
 Cirrhosis1.10.350.0001
 HIV 4 infection0.240.420.2195
 Malignant neoplasm (any)2.970.0001
 Arthritis0.96.30.0001
 Organ transplant (any)0.40.0170.0001
1 Data regarding the Catalan population was obtained from the Catalan Health Institute (CatSalut, 2021) and the Catalan Transplant Organization (OCATT, 2021). 2 For Chi-square calculations, we assumed a total Catalan population of 7,896,306 individuals. 3 COPD: Chronic obstructive pulmonary disease. 4 HIV: Human Immunodefiency virus
Table 4. SARS-CoV-2 lineages of sequenced samples.
Table 4. SARS-CoV-2 lineages of sequenced samples.
LineageVariantEpisode 1
N (%)
Episode 2
N (%)
AA.1-1 (1.1)0 (0)
AY.122Delta0 (0)1 (0.9)
AY.1251 (1.1)1 (0.9)
AY.1270 (0)1 (0.9)
AY.360 (0)1 (0.9)
AY.40 (0)1 (0.9)
AY.4.2.30 (0)1 (0.9)
AY.422 (2.1)1 (0.9)
AY.4326 (27.7)13 (12.3)
AY.53 (3.2)1 (0.9)
AY.5.40 (0)1 (0.9)
AY.532 (2.1)0 (0)
AY.61 (1.1)0 (0)
AY.710 (0)1 (0.9)
B.1-1 (1.1)0 (0)
B.1.1.269-3 (3.2)0 (0)
B.1.1.39-2 (2.1)0 (0)
B.1.1.406-1 (1.1)0 (0)
B.1.1.420-1 (1.1)0 (0)
B.1.1.7Alpha11 (11.7)2 (1.9)
B.1.160-1 (1.1)0 (0)
B.1.617.2-0 (0)1 (0.9)
B.1.1.177-36 (38.3)0 (0)
BA.1Omicron0 (0)9 (8.5)
BA.1.1 0 (0)22 (20.8)
BA.1.1.1 0 (0)11 (10.4)
BA.1.15 0 (0)4 (3.8)
BA.1.17 1 (1.1)24 (22.6)
BA.1.17.2 0 (0)5 (4.7)
BA.1.18 0 (0)1 (0.9)
BA.1.20 0 (0)1 (0.9)
BA.2 0 (0)1 (0.9)
BA.2.9 0 (0)2 (1.9)
P.1.7Gamma1 (1.1)0 (0)
Total 94106
Table 5. Linear models of genetic distance.
Table 5. Linear models of genetic distance.
Model 1
Coefficients
EstimateStd. errorp-value
(Intercept)1.15 × 10−31.03 × 10−5<2 × 10−16
Time3.51 × 10−64.26 × 10−8<2 × 10−16
Same individual3.24 × 10−41.05 × 10−40.00198
Adjusted R-squared0.2539
F-statistic p-value<2.2 × 10−16
Model 2
Coefficients
EstimateStd. errorp-value
(Intercept)1.59 × 10−31.12 × 10−5<2 × 10−16
Time1.54 × 10−72.9 × 10−81.98E-07
Type Alpha–Delta6.57 × 10−41.30 × 10−5<2 × 10−16
Type Alpha–Omicron1.01 × 10−31.29 × 10−5<2 × 10−16
Type B.1.177–B.1.177−1.17 × 10−31.45 × 10−5<2 × 10−16
Type B.1.177–Delta9.38 × 10−61.20 × 10−50.436
Type B.1.177–Omicron6.99 × 10−41.39 × 10−5<2 × 10−16
Type Delta–Delta−1.15 × 10−31.22 × 10−5<2 × 10−16
Type Delta–Omicron1.39 × 10−31.09 × 10−5<2 × 10−16
Type Omicron–Omicron−1.25 × 10−31.16 × 10−5<2 × 10−16
Same individual −1.44 × 10−52.93 × 10−50.623
Adjusted R-squared0.9537
F-statistic p-value<2.2 × 10−16
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Muñoz-López, F.; Bordoy, A.E.; Català-Moll, F.; Saludes, V.; Panisello Yagüe, D.; Parera, M.; Blanco, I.; Cardona, P.-J.; Casañ, C.; Blanco-Suárez, A.; et al. Identification of Clinical and Genomic Features Associated with SARS-CoV-2 Reinfections. Viruses 2025, 17, 840. https://doi.org/10.3390/v17060840

AMA Style

Muñoz-López F, Bordoy AE, Català-Moll F, Saludes V, Panisello Yagüe D, Parera M, Blanco I, Cardona P-J, Casañ C, Blanco-Suárez A, et al. Identification of Clinical and Genomic Features Associated with SARS-CoV-2 Reinfections. Viruses. 2025; 17(6):840. https://doi.org/10.3390/v17060840

Chicago/Turabian Style

Muñoz-López, Francisco, Antoni E. Bordoy, Francesc Català-Moll, Verónica Saludes, David Panisello Yagüe, Mariona Parera, Ignacio Blanco, Pere-Joan Cardona, Cristina Casañ, Ana Blanco-Suárez, and et al. 2025. "Identification of Clinical and Genomic Features Associated with SARS-CoV-2 Reinfections" Viruses 17, no. 6: 840. https://doi.org/10.3390/v17060840

APA Style

Muñoz-López, F., Bordoy, A. E., Català-Moll, F., Saludes, V., Panisello Yagüe, D., Parera, M., Blanco, I., Cardona, P.-J., Casañ, C., Blanco-Suárez, A., Franco, S., García-Jiménez, Á. F., Paredes, R., Clotet, B., Mateu, L., Noguera-Julian, M., Martró, E., Santos, J. R., & Massanella, M. (2025). Identification of Clinical and Genomic Features Associated with SARS-CoV-2 Reinfections. Viruses, 17(6), 840. https://doi.org/10.3390/v17060840

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop