Next Article in Journal
Ovotransferrin as a Multifunctional Bioactive Protein: Unlocking Its Potential in Animal Health and Wellness
Previous Article in Journal
The Protein Encoded by the UL3.5 Gene of the Duck Plague Virus Affects Viral Secondary Envelopment, Release, and Cell-to-Cell Spread
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Mining Porcine Blood Whole-DNA Sequencing Datasets to Uncover Pig Viromes: An Exploratory Application to Identify Potential Infecting Agents of an Undefined Disease Outbreak

Animal and Food Genomics Group, Division of Animal Sciences, Department of Agricultural and Food Sciences, University of Bologna, 40127 Bologna, Italy
*
Author to whom correspondence should be addressed.
Vet. Sci. 2025, 12(6), 513; https://doi.org/10.3390/vetsci12060513
Submission received: 31 March 2025 / Revised: 14 May 2025 / Accepted: 22 May 2025 / Published: 24 May 2025
(This article belongs to the Section Veterinary Biomedical Sciences)

Simple Summary

Pigs can be infected or co-infected with numerous pathogenic and non-pathogenic viruses, making it challenging to interpret clinical symptoms. Diagnostic assays should provide unbiased results to help identify the infecting viruses. In this study, we tested a new and potentially unbiased method to identify viral sequences in porcine blood. The approach involves whole-genome sequencing of all extracted DNA from blood, isolating short DNA sequences not part of the pig genome (unmapped reads) and identifying viral sequences within this fraction. This method was used to investigate a suspected outbreak of Post-weaning Multisystemic Wasting Syndrome on a pig farm. By utilising advanced DNA sequencing technology and bioinformatic analysis of DNA sequencing data, we identified sequences from 12 different viruses within the sequencing data of nine diseased pigs from the farm. The results suggest a heterogeneous profile among the analysed pigs, potentially indicating potential secondary infections or co-infections. Considering the results of this pilot application, mining unmapped reads from whole-genome sequencing data can provide useful information to better understand complex viral disease states, which can help improve veterinary surveillance.

Abstract

Pigs are affected by a variety of pathogenic agents that need to be identified correctly and diagnosed even when co-infections may complicate the application of specific and targeted assays. Next-generation sequencing can provide new perspective to monitor viruses infecting or co-infecting diseased pigs. In this study, we tested, for the first time for diagnostic purposes in a livestock species, a new method based on whole-genome sequencing of all the DNAs extracted from the blood of nine pigs sampled from a farm where there was a suspected outbreak of Post-weaning Multisystemic Wasting Syndrome. We then used unmapped reads on the porcine reference genome to mine for viral sequences using a specifically designed bioinformatic pipeline. Within this fraction of reads, viral sequences ranged from 0.002% to 4.4% of the total unmapped reads and were derived from twelve different viruses known to infect pigs, where three were herpesviruses, eight were parvoviruses, and one was a circovirus. All pig sequencing datasets were positive for one or more viruses, with various potential viral loads. Suid betaherpesvirus 2, also known as Porcine cytomegalovirus (PCMV), was the most frequently identified virus as five out of the nine pig sequencing datasets contained viral sequences from this virus. The results may suggest a heterogeneous viral profile of the diseased pigs that may be derived from potential secondary infections or co-infections. This pilot application demonstrated that a whole-genome sequencing approach can complement other routine diagnostic assays in veterinary virology. Other studies and improvements are needed to validate the results and apply this approach in routine monitoring applications.

1. Introduction

Livestock species are affected by a variety of pathogenic agents that can cause similar symptoms. In certain environmental conditions or physiological states, many agents are difficult to detect, complicating their diagnosis or identification even when assays are available and correctly applied. Furthermore, assays for many pathogens are not commercially available. Additionally, newly emerging or re-emerging diseases are not yet included in any target assays or are caused by multiple agents, making it challenging to identify diseased animals and monitor their epidemiology and diffusion. All these conditions may create gaps in the ability to detect and identify animal diseases quickly and accurately. This is especially true for viruses that mutate rapidly and frequently cause complex syndromes derived from multiple co-infections, emphasising the need for suitable and comprehensive diagnostic methods [1,2]. It is also evident that viruses are major contributors to animal health emergencies, as demonstrated by recurrent outbreaks of viral diseases in pigs, such as foot and mouth disease, African swine fever, classical swine fever, porcine reproductive and respiratory syndrome (PRRS) and several emerging zoonotic diseases, among many others, causing concerns and large economic losses [3,4]. Therefore, monitoring viruses and being capable of correctly diagnosing viral diseases are essential for ensuring proper management actions to reduce their negative impacts at the pig farm level as well as within the pig production industry as a whole [4].
Pig viral diseases have traditionally been diagnosed using isolation systems; immunoassay-based methods, such as the enzyme-linked immunosorbent assay (ELISA); and targeted PCR assays or similar nucleic acid amplification-based methods [5]. Recently, high-throughput approaches have been proposed to investigate the pig virome by using next-generation sequencing for viral metagenomic analyses [6,7,8,9,10,11,12,13].
Viral metagenomics relies on specific nucleic acid extraction protocols from available specimens that are capable of enriching the isolated material of nucleic acids from viruses [14,15,16]. Some protocols include initial steps for the purification of viral particles, the removal of host nucleic acids and sequence-dependent or -independent amplification of viral nucleic acids (including the common sequence-independent single-primer amplification approach), followed by next-generation sequencing after library preparation and finally the identification of viral sequences by comparing obtained reads against annotated viral sequence databases, with several modifications and improvements in the flow of the various steps listed [17,18,19]. Despite viral metagenomics largely contributing to enhancing our understanding of the composition, diversity and pathogenic roles of animal viromes, the various ways in which different steps are conducted, starting from the viral extraction methodologies from different tissues to the amplification protocols, can produce biases in the obtained viral profiles [19,20,21,22,23,24].
Unbiased approaches not based on the enrichment of viral particles or amplification steps have been proposed to create catalogues of animal tissue-specific viromes. For example, the human blood DNA virome has been explored from whole-DNA sequencing (or whole-genome sequencing) data using sequence reads that did not map to the human reference genome [25,26,27]. A fraction of these unmapped reads was from viruses infecting the sampled individuals, demonstrating that mining whole-genome sequencing data obtained for other purposes (e.g., analysis of the host genome) can be useful for monitoring infections. We applied a similar approach by mining shotgun sequencing datasets obtained from blood DNA pools derived from 100 archived porcine samples to analyse the pig genome and, as a byproduct, identify several viruses of the Parvoviridae family [28]. Subsequently, we further expanded this approach by mining whole-genome sequencing datasets obtained to analyse the cattle, pig, chicken and rabbit genomes and identified a large number of viral sequences from the unmapped reads [29]. This demonstrates the utility of this approach in describing the animal DNA virome with potential diagnostic purposes, especially when the pathogenic condition is not clear and cannot be easily diagnosticated.
In this study, we tested this approach to search for a potential explanation for a disease outbreak that occurred on a pig farm. To our knowledge, this is the first practical pilot application of this approach to detect potential infecting agents in a livestock species. The animals investigated displayed symptoms that could be attributed to Post-weaning Multisystemic Wasting Syndrome (PMWS) or uncharacterised multi-virus infections. To evaluate the presence of undefined viruses, we used porcine blood as a source of DNA. The whole DNA was sequenced using a shotgun next-generation sequencing approach. We then used unmapped reads on the pig reference genome, which may contain sequence data not derived from the host genome, to identify virus sequences.

2. Materials and Methods

2.1. General Disease Description and Standard Laboratory Diagnostic Analyses

In May 2023, a disease outbreak occurred at a growth-to-finish pig farm (open-cycle) located in Northern Italy. The farm was closely monitored by veterinary personnel who provided clinical information. The affected animals, specifically seven-week-old pigs (about three weeks after weaning), displayed severe clinical signs, such as fever, dyspnea and coughing. The animals also experienced weight loss, gradually becoming emaciated, and unfortunately did not recover. Approximately 20–30% of the animals died one/two weeks after clinical signs appeared. Post-weaning Multisystemic Wasting Syndrome (PMWS) was suspected.
Before an experimental design for this study could be implemented, during the outbreak, veterinary inspectors collected lung specimens from four deceased pigs and sent samples to detect the presence of potential viral pathogens associated with respiratory and wasting syndromes in pigs. Diagnostic testing was conducted at Vaxxinova GmbH Diagnostic (Leipzig, Germany), where standard real-time PCR analyses were used to search for several viruses, including porcine circovirus 2 (PCV2), porcine circovirus 3 (PCV3), porcine reproductive and respiratory syndrome virus (PRRSV), and Swine influenza virus A (SIV). These analyses were conducted to monitor the presence of these viruses in the farm production sites.
Animals were not specifically bred, raised or treated for this study. The pig farm was closely inspected by authorised veterinary personnel as part of the routine husbandry and health management practices; therefore, no ethical concerns are applicable.

2.2. Sampled Pigs and Whole-Genome Sequencing

To further investigate the potential involvement of DNA viruses in the outbreak other than those tested directly, we applied a whole-DNA sequencing approach, expanding on the proof-of-concept demonstrated in our previous studies [28,29]. Whole-DNA sequencing analyses were applied to DNA samples obtained from nine pigs (specimens S1–S9) with the following characteristics: (i) the pigs were weaned at 28 days of age, (ii) vaccinated for PRRS at ~14 days, (iii) vaccinated for PCV at ~25 days and (iv) raised in three different sites (site #1, site #2 and site #3) of the farm (3 pigs/site) at ~50 days of age. Pigs were sampled for blood as part of the outbreak investigation by authorised veterinary personnel at 70–90, 50–80 and 95–105 days of age for site #1, site #2 and site #3, respectively. DNA was extracted using the PureLink™ Genomic DNA Mini Kit (Thermo Fisher Scientific, Waltham, MA, USA). A total of 4.0 μg of extracted DNA from each of the nine pig samples was used for library preparation. Nine genomic libraries of 300–400 bp in size were constructed and sequenced on a BGISeq500 machine (BGI, Shenzhen, China), following the provider’s protocol. Approximately 56 Gb of sequenced paired-end reads of 150 bp in length was obtained for each sampled pig.

2.3. Mining Whole-Genome Sequencing Datasets for Virus Sequences

The characterisation of the virome followed the bioinformatics pipeline previously outlined by Bovo et al. [29]. Briefly, the workflow consists of three main steps:
  • The development of a comprehensive virus database was based on the NCBI Virus resource (https://www.ncbi.nlm.nih.gov/genome/viruses; accessed on 5 February 2025), which contains DNA sequences of reference viral genomes and their strains. Processing and curating this resource are essential to create a reliable database.
  • Host-related reads were removed by aligning sequenced reads to the Sus scrofa genome, which was assembled using Sscrofa11.1. Unmapped reads, which represent non-host DNA, likely originating from microorganisms or viruses, were then extracted.
  • Screening for viral sequences involved aligning the extracted unmapped reads to the curated virus database to identify viral sequences. This process included a quick alignment using the BWA v.0.7.17 tool, followed by confirming alignments with a more sensitive approach, like the BLAST+ v.2.7.1 tool. Only viral genomes with a minimum of three aligned read pairs (six reads) were considered representative of the sample.

2.4. Statistical Analyses

Pearson’s correlation was utilised to compare the number of unmapped reads and the number of reads matching virus sequences across whole-genome sequencing datasets.
A positive match was defined as a dataset containing reads assigned to that specific virus. A dataset has the number of positive matches equal to the number of different viruses identified within its unmapped reads. Fisher’s exact tests were used to compare the number of positive matches across pigs from different production sites. This test aimed to assess potential differences in the number of distinct viruses infecting the three production sites.
Additionally, these tests were used to evaluate differences in virus infection loads between two pigs (from the same or different production sites) for the same virus. This was performed by comparing the corresponding pair of datasets and analysing the number of reads assigned to the identified virus in relation to the total number of unmapped reads. A significance threshold was set at p < 0.01 after Bonferroni correction for multiple testing.
A positive whole-genome sequencing dataset was defined as a dataset that contained reads mapping against at least one specific porcine virus. A positive dataset was also considered in relation to a specific virus. The number of positive datasets identified in this study was compared with the number of positive datasets identified by Bovo et al. [29] who mined a total of 464 porcine whole-genome sequencing datasets retrieved from the European Nucleotide Archive (ENA). These ENA datasets, derived from 52 pig populations from various countries, constitute random pig samples without any information on their health status. However, it may be assumed that they were from healthy populations based on the fact that these datasets were derived from animals that were declared to be healthy or for which the experimental analyses from which these datasets have been produced needed to be derived from healthy pigs [29]. For the comparison, Fisher’s exact tests were used. Also, for this group of tests, the significance threshold was set at p < 0.01 after Bonferroni correction for multiple testing.

3. Results

3.1. Description of the Disease Outbreak at the Farm Level and Standard Laboratory Analyses

As part of the preface of this study, we provide information on the routine veterinary practice that was initiated to evaluate the disease outbreak that occurred on the pig farm. Post-weaning Multisystemic Wasting Syndrome (PMWS) was initially suspected based on the clinical description of the affected piglets (which included fever, dyspnea, cough, weight loss and progressive emaciation), as documented by the veterinary personnel on the farm. Therefore, some specimens were collected by the veterinary personnel and submitted for routine investigation at a commercial veterinary laboratory. Given that PMWS may be primarily associated with PCV2 [30,31], specific diagnostic testing focused on detecting this virus and potential co-infecting agents [32,33,34,35], including PCV3, PRRSV and SIV. These tests were conducted on deceased animals. The four tested samples were negative for PCV2, suggesting that this virus may not be directly associated with the disease outbreak on this farm. Similarly, all four samples also tested negative for PCV3 and SIV, indicating that these viruses were not contributing to the observed clinical signs. Instead, PRRSV was detected in all samples, raising the possibility that this virus played a role in the outbreak. PRRSV is known to cause severe respiratory disease in post-weaning pigs, which may explain some of the symptoms observed in the affected animals.

3.2. Overview of Virus Sequences in Whole-DNA Sequencing Datasets

Since the preliminary viral profile obtained from deceased animals may not fully explain the observed clinical signs, we tested the possibility to obtain an unbiased screening of the blood DNA virome. This was performed by performing whole-DNA sequencing on the total DNA extracted from the blood of a select few affected pigs. These pigs were chosen randomly from those displaying the most severe clinical signs and were raised in three different production sites on the same farm (three pigs for each of the three sites). This approach targets DNA and cannot identify RNA viruses unless they utilise DNA intermediates during replication.
Whole-DNA sequencing was performed on each sample, generating approximately 400 million reads. On average, the Sus scrofa reference genome (host genome) had a coverage of 99%, with a sequencing depth of approximately 20×. Summary statistics of the unmapped reads are shown in Table 1. The average proportion of unmapped reads per sample was approximately 0.13% of all sequenced reads (Table 1). All the pigs studied had reads that matched some virus sequences. After further refinement of the alignments, the fraction of unmapped reads that truly mapped to viral genomes ranged from 0.002% to 4.40%. The correlation between the number of unmapped reads and the number of reads assigned to virus sequences was moderate (0.45). This suggests that there is a relationship between the content of DNA viruses and the total DNA not assigned to the pig genome, likely stemming from variations in virus infection loads among pigs.

3.3. Viral DNA Profiles in the Pigs Studied

A total of 25 matches with viral genomes, defined as the combination between pigs and the presence of reads from different viruses, were identified in the nine whole-DNA sequencing datasets (Table 2). Each match was confirmed by a minimum of three read pairs, with some viruses identified in multiple samples. Most of the pigs were positive for more than one virus, meaning that seven out of the nine obtained whole-DNA sequencing datasets contained reads matching two to six different viruses. Only two samples, one from site #1 (S2) and one from site #2 (S6), were positive for only one virus (the same in both samples: Suid betaherpesvirus 2).
These matches included a total of 12 different viruses (Table 2), all known to infect vertebrate species. The genome coverage for these viruses is shown in Figure 1. Among them, eleven viruses are clearly associated with pigs, where three are herpesviruses, seven are parvoviruses, and one is a circovirus. For two parvoviruses, reads consistently matched different Porcine parvovirus 2 (PPV2) reference genomes (Table 2), potentially identifying different strains. For this reason, we maintained separate information for each, counting them as separate viruses.
Among the identified herpesviruses, Suid betaherpesvirus 2 [SuBHV2; also known as Porcine cytomegalovirus (PCMV)] was the most frequently identified virus in terms of the presence of reads matching its genome among the sequenced reads of the investigated pigs. Five out of the nine pig samples from all production sites tested positive. Pig S2 had a significantly potential higher virus load compared to the other positive pigs (Table 2). This virus causes lifelong latent infections in pigs and can occasionally lead to rhinitis in piglets and reproductive disorders in pregnant sows [35,36]. Sequences from other herpesviruses were from Suid gammaherpesvirus 3 [SuGHV3; also known as Porcine lymphotropic herpesvirus 1 (PLVH1)] and Suid gammaherpesvirus 4 [SuGHV4; also known as Porcine lymphotropic herpesvirus 2 (PLHV2)]. These viruses are usually considered harmless or are associated with mild clinical signs [37,38]. Sequences matching Suid gammaherpesvirus 3 were identified in three pigs from two production sites (one from site #1 and two from site #3), with pig S9 from site #3 having a significantly higher load compared to the other two positive pigs. Sequences matching Suid gammaherpesvirus 4 were only found in one pig from site #2, with a relatively higher virus load compared to other herpesviruses in the other pigs (Table 2).
Among the identified parvoviruses, Ungulate copiparvovirus 4 [also known as Porcine parvovirus 6 (PPV6)] was the virus for which sequences were detected in three pigs, all from site #3. This virus has been associated with porcine reproductive failure in sows with mono-infections, although its pathogenic impact is still not completely clear [39]. Notably, the read counts supporting the detection of this virus were very high, reaching up to 4.4% of the unmapped read sets, with an average depth of over 1000 (Table 1 and Table 2). Pig S9 had a significantly lower virus load than the other two pigs.
Sequences matching four other parvoviruses were each identified in more than one pig: Ungulate tetraparvovirus 3, which had reads matching the reference sequence Parvovirus YX-2010/CHN CnPPV_JH13 and is considered a strain of Porcine parvovirus 2 (PPV2); Porcine parvovirus 2, with reads matching the Porcine parvovirus 2 strain BR/GO/ion_09_PPV-2/2011; Ungulate tetraparvovirus 2, also known as Porcine parvovirus 3 (PPV3) or Porcine hokovirus; and Porcine partetravirus. The two pigs (S1 and S4) that tested positive for both Ungulate tetraparvovirus 3 and Porcine parvovirus 2 had significantly different loads of viral reads. Since reads matching both viruses were identified in the same two pigs, it is possible that different strains of PPV2 may have infected the same animals, given the varying numbers of reads that were detected in the same animals for the two strains.
Reads from the other two parvoviruses, Ungulate copiparvovirus 2 [also known as Porcine parvovirus 4 (PPV4)] and Porcine parvovirus 5 (PPV5), were each identified in the sequencing dataset of only one pig.
Reads matching Porcine circovirus 2 (PCV2) (NCBI: NC_005148) were observed in one sample from site #1. PCV2 is the primary causative agent of porcine circovirus-associated disease (PCVAD) and the responsible primary agent of PMWS, including some enteric, respiratory and reproductive disorders [40].
Regarding the virus that was not directly associated with the pig, reads in two pigs (with significantly different viral loads) matched the version of the genome indicated to belong to Rodent protoparvovirus 1 (NCBI: NC_001510). However, reads better aligned with the GenBank sequence AB076669.1 rather than the indicated reference genome. This GenBank sequence has been included in an entry annotated as another parvovirus first identified in pigs from Myanmar [41]. Therefore, we can consider that all twelve different viruses may be regarded as prevalently associated with the pig, increasing the number of viruses assigned to the parvovirus group to eight, among those identified in the nine investigated pigs.

3.4. Putative Association of Virus Profiles with a Disease Condition

Based on the results regarding the prevalence of reads from different DNA viruses in the nine diseased pigs and the seemingly random distribution of these viruses, the results may suggest a heterogeneous profile that does not clearly explain the Post-weaning Multisystemic Wasting-like syndrome suggested by the clinical signs.
One out of the nine pig whole-DNA sequencing datasets contained reads from PCV2, which is usually associated with PMWS [30,31,32,33,34]. This virus was not detected in preliminary investigations that attempted to find explanations for the disease outbreak. The prevalence of the reads from this virus detected through whole-DNA sequencing (one out of the nine pigs) was lower than expected given the small number of pigs that was initially analysed as part of the preface of this study (none of the four pigs were positive for this virus).
We then evaluated whether there were some differences in the number of positive matches of viral sequences between the three production sites. Out of the 25 positive matches of viral sequences in the nine porcine whole-DNA sequencing datasets, 7 matches were found in pigs raised on site #1, while pigs raised in sites #2 and #3 had a total of 9 matches each (Table 2). There were no statistical differences in the number of positive matches between the three production sites.
In this study, we identified reads from seven viruses (Suid gammaherpesvirus 4, Suid gammaherpesvirus 3, Suid betaherpesvirus 2, Ungulate copiparvovirus 4, Ungulate tetraparvovirus 2, Porcine parvovirus 5 and Porcine partetravirus) that were also found in a previous study [29]. The previous study involved analysing 464 whole-DNA sequencing datasets from ENA derived from 52 different pig populations and many different studies. In this control dataset, Suid betaherpesvirus 2, which was the most common virus from which reads were identified in our current study (55.5%), had a frequency of 3.2%. The difference in positive datasets was statistically significant (Fisher’s exact test, p = 0.00009). Significant differences in prevalence between the current study and the reference datasets from ENA [29] were also observed for an additional five viruses: Suid gammaherpesvirus 3 (33.3% in this study and 13.1% in ENA; p = 0.0005), Ungulate copiparvovirus 4 (33.3% in this study and 1.3% in ENA; p = 0.0003), Ungulate tetraparvovirus 3 (22.2% in this study and 0.0% in ENA; p = 0.00032), Porcine parvovirus 2 (22.2% in this study and 0.0% in ENA; p = 0.00032) and Rodent Protoparvovirus 1 (22.2% in this study and 0.0% in ENA; p = 0.00032). Since all the pigs analysed in our study were diseased animals, the higher prevalence of reads from these viruses in diseased pigs may suggest a potential association with Post-weaning Multisystemic Wasting-like syndrome. Furthermore, a total of only 130 out of the 464 control datasets tested positive for at least one virus for which we detected reads in this study. The 100% detection rate of pigs in our group of diseased pigs was significantly different from the 28.0% observed in the ENA control datasets (p = 0.00001). This indicates that the presence of one or more of the detected viruses in pigs may be indicative of a potential association with a complex disease state.

4. Discussion

In this study, we tested the use of whole-DNA sequencing datasets obtained from diseased pigs as a potential source of viral sequence information for diagnostic purposes. This study can be considered a pilot application that can provide insights to evaluate the potential and limitations of this approach, with the final objective being to improve protocols and data analysis methodologies. Additionally, this exploratory approach may help clarify complex and undefined disease outbreaks caused by multiple pathogenic agents. Ultimately, this approach could complement other routine diagnostic assays in veterinary virology and traditional viral metagenomic analyses.
One advantage of the tested approach is that the production of whole-DNA sequencing datasets does not require any specific enrichment protocols of viral particles or amplification steps, eliminating the biases that may occur from these procedures. Standard DNA extractions are traditionally used to obtain DNA that is then sequenced. However, it is not yet clear whether different DNA extraction methods, protocols and kits may affect the possibility of capturing virus particles and producing viral sequences using this approach. A few studies have evaluated the potential impact of various DNA extraction kits and enrichment strategies on the outcomes of traditional viral metagenomic protocols, indicating that biassed results may be derived and could also be tissue-dependent [21,42,43]. However, these issues may be reduced in shotgun whole-DNA sequencing where all DNA is extracted and sequenced without any prefiltering or preference.
One of the main potential problems in viral metagenomics is the high frequency of contaminating virus sequences that are sometimes difficult to separate from the true viromes of the investigated specimens [16,44,45,46]. We did not identify any potential contaminating viral sequences that are commonly present in molecular-grade reagents or that are ubiquitous in some laboratories that routinely investigate viromes. This indicates that our laboratory procedures did not introduce any artefacts into the detected viral sequences.
Another advantage of shotgun whole-DNA sequencing is that novel DNA viruses are also sequenced, providing useful information to discover previously unknown viral diseases [25,26,27,28,29,47]. This should however be paired with specifically designed data mining bioinformatic pipelines that are capable of capturing novel viruses [47]. This may increase the computational effort in analysing the datasets. Computational efforts are already much higher in our approach than in traditional viral metagenomics because we first have to separate reads from those belonging to the host genome (the pig). We used only unmapped reads which are indirectly enriched with virus sequences. However, unmapped reads may also contain bacterial and fungal DNA derived not only from environmental contaminants but also from pathogenic agents, enlarging the possibility of using unmapped reads as a source of multi-kingdom pathogen information. We will further mine the datasets produced from the nine sampled pigs to obtain a more complete picture of the potential microbial agents contributing to the described disease state. Computational efforts may increase with the depth of sequencing. Increasing the depth of sequencing per sample (i.e., the number of reads generated) would also be useful to enhance the sensitivity of this approach in detecting viral sequences, as it is not entirely clear what the correlation may be between the viral load and the pathogenic conditions of the animals in most viruses. On the other hand, increasing the number of reads produced may also increase the cost of the assay. However, since the sequencing cost is continuously dropping and computing power is expected to increase, these conditions may make a routine approach based on whole-DNA sequencing at high depths feasible in the future. For comparative analyses across datasets or assays, it would be important to define some standards in terms of sequencing depth.
In this study, we analysed DNA sequencing datasets to identify DNA viruses. As a result, we were unable to detect any RNA viruses, which could be relevant in understanding the complex disease observed in the analysed pigs. PRRSV is an RNA virus that during the preface of this study has been shown to infect pigs of the farms from which we investigated the nine diseased pigs. To identify RNA viruses, one option is to sequence the entire transcriptome of a tissue/specimen and then analyse the obtained datasets to pinpoint porcine RNA viruses. Some studies have successfully utilised animal transcriptomes to identify RNA viruses using the same method we employed in our study by focusing on unmapped reads in the corresponding reference genome. This strategy has shown promise in identifying infections caused by RNA viruses [48,49]. However, this approach may increase costs not only for sequencing and data analysis but also for sampling to prevent RNA degradation. Another possibility is to simultaneously analyse DNA and RNA viruses by modifying nucleic acid extraction protocols and sequencing procedures, as suggested in viral metagenomics studies [50,51].
Another question arises from the tissue used for whole-DNA sequencing: blood. While blood may be the most convenient tissue to sample in live animals, it may not always be the most suitable target tissue for certain viruses. Some studies have evaluated this question applied to PMWS [52,53]. Sampling other tissues in deceased animals could provide a more accurate description of pathogenic infections. Conducting whole-DNA sequencing on multiple tissues could offer valuable comparative infection data, helping to identify the most appropriate tissue for specific diseases. However, analysing multiple tissues would increase the cost of the assay, making it unrealistic to routinely use more than one target tissue.
By mining whole-DNA sequencing datasets obtained from nine diseased pigs, we identified reads matching twelve viral genomes. There was no consistent viral profile among sites or pigs. In all diseased pigs, we found sequences for at least one pig-specific DNA virus. However, the number of positive matches for each pig varied from one to six different viruses, with a diversity among pigs suggesting that the identified DNA viruses could represent secondary infections or co-infections within another pathogenic condition primarily determined by PRRSV. Co-infection with multiple DNA viruses is common in PRRS, leading to synergistic pathogenicity and complex clinical signs [54,55,56]. A co-infection situation is also common in PMWS [6].
The number of reads identified for the same virus in different pigs may indicate varying viral loads. For some viruses, there were significant differences between pigs. For instance, reads matching Ungulate copiparvovirus 4 were found in all three pigs from site #3. In pig S9, the number of reads was approximately two orders of magnitude lower than that of the other two pigs. Significant variations in the number of virus reads between pigs were also observed for Suid betaherpesvirus 2, Suid gammaherpesvirus 3, Ungulate tetraparvovirus 3, Porcine parvovirus 2 and Rodent protoparvovirus 1. It will be interesting to validate these differences in viral loads with other analytical approaches, such as quantitative PCR.
To indirectly evaluate if the viral profile obtained for the nine investigated pigs could effectively be associated with their disease state, we used the results obtained in a previous study where we identified the presence of viral sequences in 464 porcine whole-DNA sequencing datasets retrieved from ENA as the reference [29]. These datasets have been produced from more than 50 pig populations that could indirectly be considered healthy. Seven out of twelve viruses (Suid betaherpesvirus 2, Suid gammaherpesvirus 3, Suid gammaherpesvirus 4, Ungulate copiparvovirus 4, Ungulate tetraparvovirus 2, Porcine parvovirus 5 and Porcine partetravirus) for which we detected reads in the nine diseased pigs were also present in some datasets derived from ENA. For three of these viruses (Suid betaherpesvirus 2, Suid gammaherpesvirus 3 and Ungulate copiparvovirus 4), there was a significantly lower prevalence in the ENA datasets than in the pigs analysed in this study. For the remaining four other viruses (Suid gammaherpesvirus 4, Ungulate tetraparvovirus 2, Porcine partetravirus and Porcine parvovirus 5), there were no statistically significant differences when considering their prevalence in the datasets of the nine investigated pigs and in the ENA dataset. Reads matching three other viruses (Ungulate tetraparvovirus 3, Porcine parvovirus 2 and Rodent protoparvovirus 1) were identified only in the nine pigs analysed in this study. Based on these comparative analyses, the presence of reads from five of these viruses (associated with the disease state, i.e., the three viruses with significantly higher prevalence in the diseased pigs of this study and the two viruses that were not present in the ENA dataset but were detected in the diseased pigs) may be indicative markers of a complex pathological condition in pigs similar to PMWS. However, this hypothesis should be considered with caution because the reference dataset was not specifically constructed for this purpose, and the precise health condition of the pigs in the 464 datasets is not completely known. A larger reference dataset with additional metadata per sample, sequenced in a standardised way, should be created to better facilitate comparative evaluations in defining potential diseased states in pigs for case and control studies, as we indirectly evaluated in this study.

5. Conclusions

To our knowledge, this exploratory study tested for the first time the possibility of using whole-DNA sequencing datasets as a source of viral information useful for detecting infection states in diseased pigs. For the routine application of this approach, some improvements are needed, specifically the ability to analyse both RNA and DNA viruses simultaneously. Additionally, comparing the viral profile in healthy pigs is necessary to interpret the viral profiles of diseased animals. Therefore, a larger dataset of whole-DNA sequencing data obtained in a standardised way would be necessary to avoid any biases in interpretation. Validation of the results derived from this approach with other viral metagenomic assays or specific target assays is needed to better evaluate the pros and cons of this approach. Looking ahead, mining unmapped reads from whole-DNA sequencing data can provide useful information to better understand disease states that are difficult to interpret with conventional assays and approaches.

Author Contributions

Conceptualization, L.F. and S.B.; methodology, S.B. and L.F.; software, S.B. and G.S.; validation, S.B., A.R., V.T. and M.B.; formal analysis, S.B., A.R. and V.T.; investigation, S.B., G.S., F.B. and L.F.; resources, L.F. and S.B.; data curation, S.B., M.B., G.S. and F.B; writing—original draft preparation, L.F. and S.B.; writing—review and editing, S.B., A.R., G.S., V.T., M.B., F.B. and L.F.; visualisation, S.B.; supervision, L.F., S.B. and F.B.; project administration, L.F.; funding acquisition, L.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the University of Bologna, RFO 2024 Funds; by the European Open Science Cloud (EOSC) Secretariat, project “Application of animal genomics and data mining to predict and monitor novel coronavirus potential infections (VirAnimalOne)”; the EGI call for COVID-19 research projects (AnGen1H project); and the POR FESR Emilia-Romagna 2014–2020 (actions 1.1.4 and 1.2.2—Bando per sostenere progetti di ricerca ed innovazione per lo sviluppo di soluzioni finalizzate al contrasto dell’epidemia da COVID-19—Project LIVESTOCK-STOP-COVI).

Institutional Review Board Statement

The study was conducted in accordance with the National and European legislation on animal health and welfare. Pigs were not specifically bred or treated for this study. Animals were blood sampled by authorised veterinary personnel for routine health monitoring analyses. Aliquoted blood samples were provided to the Animal and Food Genomics Group only for DNA analyses after veterinary approval following the national law for animal inspection and health monitoring. Therefore, no ethical approval was needed for this study.

Informed Consent Statement

Written consent on the use of the biological material for research purposes as described in this study was obtained from the farmer.

Data Availability Statement

The sequencing datasets generated and analysed during the current study are available in the EMBL-EBI European Nucleotide Archive (ENA) repository (http://www.ebi.ac.uk/ena) under the study accession PRJEB87604.

Acknowledgments

We thank Silvio Zavattini (DVM) for the veterinary assistance, activity and support and Maurizio Gallo (Associazione Nazionale Allevatori Suini) for the collaboration in this study.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:
ELISAEnzyme-linked immunosorbent assay
ENAEuropean Nucleotide Archive
GbGiga base pairs
PCV2Porcine circovirus 2
PCV3Porcine circovirus 3
PCVADPorcine circovirus-associated disease
PCVMPorcine cytomegalovirus
PLHV1Porcine lymphotropic herpesvirus 1
PLHV2Porcine lymphotropic herpesvirus 2
PMWSPost-weaning Multisystemic Wasting Syndrome
PPV2Porcine parvovirus 2
PPV3Porcine parvovirus 3
PPV4Porcine parvovirus 4
PPV5Porcine parvovirus 5
PPV6Porcine parvovirus 6
PRRSPorcine reproductive and respiratory syndrome
PRRSVPorcine reproductive and respiratory syndrome virus
SIVSwine influenza virus A
SuBHV2Suid betaherpesvirus 2
SuGHV3Suid gammaherpesvirus 3
SuGHV4Suid gammaherpesvirus 4

References

  1. Cassedy, A.; Parle-McDermott, A.; O’Kennedy, R. Virus Detection: A Review of the Current and Emerging Molecular and Immunological Methods. Front. Mol. Biosci. 2021, 8, 637559. [Google Scholar] [CrossRef] [PubMed]
  2. Neujahr, A.C.; Loy, D.S.; Loy, J.D.; Brodersen, B.W.; Fernando, S.C. Rapid detection of high consequence and emerging viral pathogens in pigs. Front. Vet. Sci. 2024, 11, 1341783. [Google Scholar] [CrossRef] [PubMed]
  3. McLean, R.K.; Graham, S.P. The pig as an amplifying host for new and emerging zoonotic viruses. One Health 2022, 14, 100384. [Google Scholar] [CrossRef]
  4. Alarcón, L.V.; Allepuz, A.; Mateu, E. Biosecurity in pig farms: A review. Porcine Health Manag. 2021, 7, 5. [Google Scholar] [CrossRef]
  5. Yoon, K.-J. Overview of Viruses. In Diseases of Swine, 10th ed.; Zimmerman, J.J., Karriker, L.A., Ramirez, A., Schwartz, K.J., Stevenson, G.W., Eds.; Wiley-Blackwell: West Sussex, UK, 2012; pp. 383–391. [Google Scholar]
  6. Blomström, A.-L.; Fossum, C.; Wallgren, P.; Berg, M. Viral metagenomic analysis displays the co-infection situation in healthy and PMWS affected pigs. PLoS ONE 2016, 11, e0166863. [Google Scholar] [CrossRef]
  7. Shan, T.; Li, L.; Simmonds, P.; Wang, C.; Moeser, A.; Delwart, E. The fecal virome of pigs on a high-density farm. J. Virol. 2011, 85, 11697–11708. [Google Scholar] [CrossRef]
  8. Karlsson, O.E.; Larsson, J.; Hayer, J.; Berg, M.; Jacobson, M. The intestinal eukaryotic virome in healthy and diarrhoeic neonatal piglets. PLoS ONE 2016, 11, e0151481. [Google Scholar] [CrossRef]
  9. Yang, S.; Zhang, D.; Ji, Z.; Zhang, Y.; Wang, Y.; Chen, X.; He, Y.; Lu, X.; Li, R.; Guo, Y.; et al. Viral Metagenomics Reveals Diverse Viruses in Tissue Samples of Diseased Pigs. Viruses 2022, 14, 2048. [Google Scholar] [CrossRef]
  10. He, B.; Gong, W.; Yan, X.; Zhao, Z.; Yang, L.; Tan, Z.; Xu, L.; Zhu, A.; Zhang, J.; Rao, J.; et al. Viral Metagenome-Based Precision Surveillance of Pig Population at Large Scale Reveals Viromic Signatures of Sample Types and Influence of Farming Management on Pig Virome. mSystems 2021, 6, e0042021. [Google Scholar] [CrossRef]
  11. Bhatta, T.R.; Chamings, A.; Alexandersen, S. Exploring the cause of diarrhoea and poor growth in 8–11-week-old pigs from an Australian pig herd using metagenomic sequencing. Viruses 2021, 13, 1608. [Google Scholar] [CrossRef]
  12. Schuele, L.; Lizarazo-Forero, E.; Strutzberg-Minder, K.; Schütze, S.; Löbert, S.; Lambrecht, C.; Harlizius, J.; Friedrich, A.W.; Peter, S.; Rossen, J.W.A.; et al. Application of shotgun metagenomics sequencing and targeted sequence capture to detect circulating porcine viruses in the Dutch-German border region. Transbound. Emerg. Dis. 2022, 69, 2306–2319. [Google Scholar] [CrossRef]
  13. Zhou, L.; Zhou, H.; Fan, Y.; Wang, J.; Zhang, R.; Guo, Z.; Li, Y.; Kang, R.; Zhang, Z.; Yang, D.; et al. Metagenomics to Identify Viral Communities Associated with Porcine Respiratory Disease Complex in Tibetan Pigs in the Tibetan Plateau, China. Pathogens 2024, 13, 404. [Google Scholar] [CrossRef]
  14. Klenner, J.; Kohl, C.; Dabrowski, P.W.; Nitsche, A. Comparing viral metagenomic extraction methods. Curr. Issues Mol. Biol. 2017, 24, 59–70. [Google Scholar] [CrossRef]
  15. Kohl, C.; Brinkmann, A.; Dabrowski, P.W.; Radonić, A.; Nitsche, A.; Kurth, A. Protocol for metagenomic virus detection in clinical specimens. Emerg. Infect. Dis. 2015, 21, 48. [Google Scholar] [CrossRef]
  16. Zhang, D.; Lou, X.; Yan, H.; Pan, J.; Mao, H.; Tang, H.; Shu, Y.; Zhao, Y.; Liu, L.; Li, J.; et al. Metagenomic analysis of viral nucleic acid extraction methods in respiratory clinical samples. BMC Genom. 2018, 19, 773. [Google Scholar] [CrossRef]
  17. Thurber, R.V.; Haynes, M.; Breitbart, M.; Wegley, L.; Rohwer, F. Laboratory procedures to generate Viral metagenomes. Nat. Protoc. 2009, 4, 470–483. [Google Scholar] [CrossRef]
  18. Mokili, J.L.; Rohwer, F.; Dutilh, B.E. Metagenomics and future perspectives in virus discovery. Curr. Opin. Virol. 2012, 2, 63–77. [Google Scholar] [CrossRef]
  19. Conceição-Neto, N.; Zeller, M.; Lefrère, H.; De Bruyn, P.; Beller, L.; Deboutte, W.; Yinda, C.K.; Lavigne, R.; Mae, P.; Van Ranst, M.; et al. Modular approach to customise sample preparation procedures for viral metagenomics: A reproducible protocol for virome analysis. Sci. Rep. 2015, 5, 16532. [Google Scholar] [CrossRef]
  20. Kieft, K.; Anantharaman, K. Virus genomics: What is being overlooked? Curr. Opin. Virol. 2022, 53, 101200. [Google Scholar] [CrossRef]
  21. Parras-Moltó, M.; Rodríguez-Galet, A.; Suárez-Rodríguez, P.; López-Bueno, A. Evaluation of bias induced by viral enrichment and random amplification protocols in metagenomic surveys of saliva DNA viruses. Microbiome 2018, 6, 119. [Google Scholar] [CrossRef]
  22. Callanan, J.; Stockdale, S.R.; Shkoporov, A.; Draper, L.A.; Ross, R.P.; Hill, C. Biases in viral metagenomics-based detection, cataloguing and quantification of bacteriophage genomes in human faeces, a review. Microorganisms 2021, 9, 524. [Google Scholar] [CrossRef] [PubMed]
  23. Haagmans, R.; Charity, O.J.; Baker, D.; Telatin, A.; Savva, G.M.; Adriaenssens, E.M.; Powell, P.P.; Carding, S.R. Assessing Bias and Reproducibility of Viral Metagenomics Methods for the Combined Detection of Faecal RNA and DNA Viruses. Viruses 2025, 17, 155. [Google Scholar] [CrossRef] [PubMed]
  24. Aulicino, P.C.; Kimata, J.T. Beyond surveillance: Leveraging the potential of next generation sequencing in clinical virology. Front. Trop. Dis. 2024, 5, 1512606. [Google Scholar] [CrossRef]
  25. Moustafa, A.; Xie, C.; Kirkness, E.; Biggs, W.; Wong, E.; Turpaz, Y.; Bloom, K.; Delwart, E.; Nelson, K.E.; Venter, J.C.; et al. The blood DNA virome in 8000 humans. PLoS Pathog. 2017, 13, e1006292. [Google Scholar] [CrossRef]
  26. Liu, S.; Huang, S.; Chen, F.; Zhao, L.; Yuan, Y.; Francis, S.S.; Fang, L.; Li, Z.; Lin, L.; Liu, R.; et al. Genomic Analyses from Non-invasive Prenatal Testing Reveal Genetic Associations, Patterns of Viral Infections, and Chinese Population History. Cell 2018, 175, 347–359.e14. [Google Scholar] [CrossRef]
  27. Guo, J.; Huang, X.; Zhang, C.; Huang, P.; Li, Y.; Wen, F.; Wang, X.; Yang, N.; Xu, M.; Bi, Y.; et al. The blood virome of 10,585 individuals from the ChinaMAP. Cell Discov. 2022, 8, 113. [Google Scholar] [CrossRef]
  28. Bovo, S.; Mazzoni, G.; Ribani, A.; Utzeri, V.J.; Bertolini, F.; Schiavo, G.; Fontanesi, L. A viral metagenomic approach on a non-metagenomic experiment: Mining next generation sequencing datasets from pig DNA identified several porcine parvoviruses for a retrospective evaluation of viral infections. PLoS ONE 2017, 12, e0179462. [Google Scholar] [CrossRef]
  29. Bovo, S.; Schiavo, G.; Bolner, M.; Ballan, M.; Fontanesi, L. Mining livestock genome datasets for an unconventional characterization of animal DNA viromes. Genomics 2022, 114, 110312. [Google Scholar] [CrossRef]
  30. Segalés, J.; Domingo, M. Postweaning multisystemic wasting syndrome (PMWS) in pigs. A review. Vet. Q. 2002, 24, 109–124. [Google Scholar] [CrossRef]
  31. Baekbo, P.; Kristensen, C.S.; Larsen, L.E. Porcine circovirus diseases: A review of PMWS. Trans. Emerg. Dis. 2021, 59, 60–67. [Google Scholar] [CrossRef]
  32. Arruda, B.; Piñeyro, P.; Derscheid, R.; Hause, B.; Byers, E.; Dion, K.; Long, D.; Sievers, C.; Tangen, J.; Williams, T. PCV3-associated disease in the United States swine herd. Emerg. Microbes Infect. 2019, 8, 684–698. [Google Scholar] [CrossRef] [PubMed]
  33. Dinh, P.X.; Nguyen, M.N.; Nguyen, H.T.; Tran, V.H.; Tran, Q.D.; Dang, K.H.; Vo, D.T.; Le, H.T.; Nguyen, N.T.T.; Nguyen, T.T. Porcine circovirus genotypes and their copathogens in pigs with respiratory disease in southern provinces of Vietnam. Arch. Virol. 2021, 166, 403–411. [Google Scholar] [CrossRef]
  34. Guo, Z.; Ruan, H.; Qiao, S.; Deng, R.; Zhang, G. Co-infection status of porcine circoviruses (PCV2 and PCV3) and porcine epidemic diarrhea virus (PEDV) in pigs with watery diarrhea in Henan province, central China. Microb. Pathog. 2020, 142, 104047. [Google Scholar] [CrossRef]
  35. Segalés, J.; Calsamiglia, M.; Rosell, C.; Soler, M.; Maldonado, J.; Martín, M.; Domingo, M. Porcine reproductive and respiratory syndrome virus (PRRSV) infection status in pigs naturally affected with post-weaning multisystemic wasting syndrome (PMWS) in Spain. Vet. Microbiol. 2002, 85, 23–30. [Google Scholar] [CrossRef]
  36. Ohlinger, V. Porcine cytomegalovirus (PCMV). In Herpesvirus Diseases of Cattle, Horses, and Pigs; Wittmann, G., Ed.; Kluwer Academic Publishers: Boston, MA, USA, 1989; pp. 326–333. [Google Scholar]
  37. Yoon, K.J.; Edington, N. Porcine cytomegalovirus. In Diseases of Swine, 9th ed.; Straw, B.E., Zimmerman, J., D’Allaire, S., Taylor, D.J., Eds.; Blackwell Publishing Company: Ames, IW, USA, 2006; pp. 323–329. [Google Scholar]
  38. Ehlers, B.; Ulrich, S.; Goltz, M. Detection of two novel porcine herpesviruses with high similarity to gammaherpesviruses. J. Gen. Virol. 1999, 80, 971–978. [Google Scholar] [CrossRef]
  39. Ni, J.; Qiao, C.; Han, X.; Han, T.; Kang, W.; Zi, Z.; Cao, Z.; Zhai, X.; Cai, X. Identification and genomic characterization of a novel porcine parvovirus (PPV6) in China. Virol. J. 2014, 11, 203. [Google Scholar] [CrossRef]
  40. Vargas-Bermudez, D.S.; Mogollon, J.D.; Franco-Rodriguez, C.; Jaime, J. The novel porcine parvoviruses: Current state of knowledge and their possible implications in clinical syndromes in pigs. Viruses 2023, 15, 2398. [Google Scholar] [CrossRef]
  41. Segalés, J. Porcine circovirus type 2 (PCV2) infections: Clinical signs, pathology and laboratory diagnosis. Virus Res. 2012, 164, 10–19. [Google Scholar] [CrossRef]
  42. Hijikata, M.; Abe, K.; Win, K.M.; Shimizu, Y.K.; Keicho, N.; Yoshikura, H. Identification of new parvovirus DNA sequence in swine sera from Myanmar. Jpn. J. Infect. Dis. 2002, 54, 244–245. [Google Scholar] [CrossRef]
  43. Kim, K.H.; Bae, J.W. Amplification methods bias metagenomic libraries of uncultured single-stranded and double-stranded DNA viruses. Appl. Environ. Microbiol. 2011, 77, 7663–7668. [Google Scholar] [CrossRef]
  44. Asplund, M.; Kjartansdóttir, K.; Mollerup, S.; Vinner, L.; Fridholm, H.; Herrera, J.; Friis-Nielsen, J.; Hansen, T.; Jensen, R.; Nielsen, I.; et al. Contaminating viral sequences in high-throughput sequencing viromics: A linkage study of 700 sequencing libraries. Clin. Microbiol. Infect. 2019, 25, 1277–1285. [Google Scholar] [CrossRef] [PubMed]
  45. Duan, J.; Keeler, E.; McFarland, A.; Scott, P.; Collman, R.G.; Bushman, F.D. The virome of the kitome: Small circular virus-like genomes in laboratory reagents. Microbiol. Resour. Announc. 2024, 13, e0126123. [Google Scholar] [CrossRef] [PubMed]
  46. Sangiovanni, M.; Granata, I.; Thind, A.S.; Guarracino, M.R. From trash to treasure: Detecting unexpected contamination in unmapped NGS data. BMC Bioinform. 2019, 20, 168. [Google Scholar] [CrossRef]
  47. Bartoszewicz, J.M.; Seidel, A.; Renard, B.Y. Interpretable detection of novel human viruses from genome sequencing data. NAR Genom. Bioinform. 2021, 3, lqab004. [Google Scholar] [CrossRef]
  48. Buggiotti, L.; Cheng, Z.; Wathes, D.C.; GplusE Consortium. Mining the Unmapped Reads in Bovine RNA-Seq Data Reveals the Prevalence of Bovine Herpes Virus-6 in European Dairy Cows and the Associated Changes in Their Phenotype and Leucocyte Transcriptome. Viruses 2020, 12, 1451. [Google Scholar] [CrossRef]
  49. Usman, T.; Hadlich, F.; Demasius, W.; Weikard, R.; Kühn, C. Unmapped reads from cattle RNAseq data: A source for missing and misassembled sequences in the reference assemblies and for detection of pathogens in the host. Genomics 2017, 109, 36–42. [Google Scholar] [CrossRef]
  50. Sachsenröder, J.; Twardziok, S.; Hammerl, J.A.; Janczyk, P.; Wrede, P.; Hertwig, S.; Johne, R. Simultaneous identification of DNA and RNA viruses present in pig faeces using process-controlled deep sequencing. PLoS ONE 2012, 7, e34631. [Google Scholar] [CrossRef]
  51. Ogawa, H.; Taira, O.; Hirai, T.; Takeuchi, H.; Nagao, A.; Ishikawa, Y.; Tuchiya, K.; Nunoya, T.; Ueda, S. Multiplex PCR and multiplex RT-PCR for inclusive detection of major swine DNA and RNA viruses in pigs with multiple infections. J. Virol. Methods 2009, 160, 210–214. [Google Scholar] [CrossRef]
  52. Brunborg, I.M.; Moldal, T.; Jonassen, C.M. Quantitation of porcine circovirus type 2 isolated from serum/plasma and tissue samples of healthy pigs and pigs with postweaning multisystemic wasting syndrome using a TaqMan-based real-time PCR. J. Virol. Methods 2004, 122, 171–178. [Google Scholar] [CrossRef]
  53. Chianini, F.; Majó, N.; Segalés, J.; Domınguez, J.; Domingo, M. Immunohistochemical characterisation of PCV2 associate lesions in lymphoid and non-lymphoid tissues of pigs with natural postweaning multisystemic wasting syndrome (PMWS). Vet. Immunol. Immunopathol. 2003, 94, 63–75. [Google Scholar] [CrossRef]
  54. Zhao, D.; Yang, B.; Yuan, X.; Shen, C.; Zhang, D.; Shi, X.; Zhang, T.; Cui, H.; Yang, J.; Chen, X.; et al. Advanced Research in Porcine Reproductive and Respiratory Syndrome Virus Co-infection With Other Pathogens in Swine. Front. Vet. Sci. 2021, 8, 699561. [Google Scholar] [CrossRef] [PubMed]
  55. Zhang, J.; Wang, P.; Xie, C.; Ha, Z.; Shi, N.; Zhang, H.; Li, Z.; Han, J.; Xie, Y.; Qiu, X.; et al. Synergistic Pathogenicity by Coinfection and Sequential Infection with NADC30-like PRRSV and PCV2 in Post-Weaned Pigs. Viruses 2022, 14, 193. [Google Scholar] [CrossRef] [PubMed]
  56. Vargas-Bermudez, D.S.; Diaz, A.; Polo, G.; Mogollon, J.D.; Jaime, J. Infection and coinfection of porcine-selected viruses (PPV1 to PPV8, PCV2 to PCV4, and PRRSV) in gilts and their associations with reproductive performance. Vet. Sci. 2024, 11, 185. [Google Scholar] [CrossRef]
Figure 1. Graphical representation of the genome coverage of the 12 detected viruses. Vertical red lines denote the genomic boundaries, indicating the start and end of each genome.
Figure 1. Graphical representation of the genome coverage of the 12 detected viruses. Vertical red lines denote the genomic boundaries, indicating the start and end of each genome.
Vetsci 12 00513 g001
Table 1. Summary statistics on the whole-DNA sequencing datasets, including unmapped reads that matched virus sequences.
Table 1. Summary statistics on the whole-DNA sequencing datasets, including unmapped reads that matched virus sequences.
PigSite 1Sequenced ReadsUnmapped Reads (%) 2Reads of Viral Origin (%) 3
S1#1448,120,122666,430 (0.15%)934 (0.140%)
S2#1327,414,198411,866 (0.13%)44 (0.011%)
S3#1315,609,910407,978 (0.13%)10 (0.002%)
S4#2441,166,336521,010 (0.12%)378 (0.073%)
S5#2400,074,484545,276 (0.14%)84 (0.015%)
S6#2334,788,158492,688 (0.15%)22 (0.004%)
S7#3448,195,724606,724 (0.14%)26,720 (4.40%)
S8#3448,168,478572,470 (0.13%)14,618 (2.55%)
S9#3416,512,760526,100 (0.13%)572 (0.108%)
1 describes the production sites where animals were sampled. 2 represents the reads that did not align on the Sus scrofa reference genome (host genome). The percentage is given in relation to the sequenced reads. 3 represents the reads that aligned on the viral sequences in the virus database. The percentage is given in relation to the unmapped reads.
Table 2. The number of reads identified for the 12 viruses for the analysed pigs (S1–S9) raised in different production sites (#1–3) and the estimated depth of viral sequences on the corresponding reference viral genomes.
Table 2. The number of reads identified for the 12 viruses for the analysed pigs (S1–S9) raised in different production sites (#1–3) and the estimated depth of viral sequences on the corresponding reference viral genomes.
Site #1 3Site #2 3Site #3 3
Virus NameAlternative Name 1NCBI IDDepth 2S1S2S3S4S5S6S7S8S9
Suid betaherpesvirus 2Porcine cytomegalovirus (PCMV)NC_0222330.119-40 a-22 b-20 b12 b8 b-
Suid gammaherpesvirus 3Porcine lymphotropic herpesvirus 1 (PLHV1)NC_0382640.172--6 a----6 a72 b
Suid gammaherpesvirus 4Porcine lymphotropic herpesvirus 2 (PLHV2)NC_0382650.593---236-----
Ungulate copiparvovirus 4Porcine parvovirus 6 (PPV6)NC_0238601027------26,616 a14,596 a496 b
Ungulate tetraparvovirus 3Parvovirus YX-2010/CHN CnPPV_JH13; Porcine parvovirus 2 (PPV2)NC_038883; NC_0351805.400186 a--10 b-----
Porcine parvovirus 2-NC_02596511.88394 a--36 b-----
Ungulate tetraparvovirus 2Porcine hokovirus (p-PARV4) PPV3NC_0385462.932---3466----
Porcine partetravirus-NC_0221040.777--14-14----
Ungulate copiparvovirus 2Porcine parvovirus 4 (PPV4)NC_0146650.406------16--
Porcine parvovirus 5-NC_0230201.756------68--
Porcine circovirus 2-NC_0051486.44176--------
Rodent protoparvovirus 1Minute virus of miceNC_0015108.503268 a--24 b-----
1 Alternative name or identified strain according to the information retrieved from the NCBI Viruses resource. 2 The estimated depth of viral sequences reported assumes an equal coverage of the reference genome. 3 Significantly different loads of viral reads for the same virus among different pigs are identified with different superscript letters (a, b; p < 0.01 after Bonferroni correction for multiple testing).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bovo, S.; Ribani, A.; Schiavo, G.; Taurisano, V.; Bolner, M.; Bertolini, F.; Fontanesi, L. Mining Porcine Blood Whole-DNA Sequencing Datasets to Uncover Pig Viromes: An Exploratory Application to Identify Potential Infecting Agents of an Undefined Disease Outbreak. Vet. Sci. 2025, 12, 513. https://doi.org/10.3390/vetsci12060513

AMA Style

Bovo S, Ribani A, Schiavo G, Taurisano V, Bolner M, Bertolini F, Fontanesi L. Mining Porcine Blood Whole-DNA Sequencing Datasets to Uncover Pig Viromes: An Exploratory Application to Identify Potential Infecting Agents of an Undefined Disease Outbreak. Veterinary Sciences. 2025; 12(6):513. https://doi.org/10.3390/vetsci12060513

Chicago/Turabian Style

Bovo, Samuele, Anisa Ribani, Giuseppina Schiavo, Valeria Taurisano, Matteo Bolner, Francesca Bertolini, and Luca Fontanesi. 2025. "Mining Porcine Blood Whole-DNA Sequencing Datasets to Uncover Pig Viromes: An Exploratory Application to Identify Potential Infecting Agents of an Undefined Disease Outbreak" Veterinary Sciences 12, no. 6: 513. https://doi.org/10.3390/vetsci12060513

APA Style

Bovo, S., Ribani, A., Schiavo, G., Taurisano, V., Bolner, M., Bertolini, F., & Fontanesi, L. (2025). Mining Porcine Blood Whole-DNA Sequencing Datasets to Uncover Pig Viromes: An Exploratory Application to Identify Potential Infecting Agents of an Undefined Disease Outbreak. Veterinary Sciences, 12(6), 513. https://doi.org/10.3390/vetsci12060513

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop