Next Article in Journal
Deciphering the Population Characteristics of Leiqiong Cattle Using Whole-Genome Sequencing Data
Previous Article in Journal
Postmortem Findings in Free-Ranging North American Beavers (Castor canadensis) Reveal Potential Threats to California’s Freshwater Ecosystems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome Wide Identification of Structure Variations in Five Italian Turkey Populations

1
Department of Veterinary Medicine, University of Bari Aldo Moro, 70010 Valenzano, Italy
2
Animal Breeding and Genomics, Wageningen University & Research, P.O. Box 338, 6700 AH Wageningen, The Netherlands
3
Department of Animal Production, Faculty of Agriculture, Benha University, Benha 13736, Egypt
4
Department of Biosciences, Biotechnologies and Environment, University of Bari Aldo Moro, 70125 Bari, Italy
*
Author to whom correspondence should be addressed.
Animals 2025, 15(3), 339; https://doi.org/10.3390/ani15030339
Submission received: 16 December 2024 / Revised: 12 January 2025 / Accepted: 22 January 2025 / Published: 24 January 2025
(This article belongs to the Section Animal Genetics and Genomics)

Simple Summary

Structure variations are believed to be a significant part of evolutionary processes and are crucial factors for phenotypic variations. A total of 73 whole genome sequencing of five Italian turkey populations were used to detect structure variations. A comprehensive structure variations catalog was produced involving deletions, duplications, inversions, and translocations. The annotation of structure variations revealed that intron variants, intergenic variants, coding sequence variants, downstream gene variants, and transcript ablations were the most common structure variation outcomes. This structural variant catalog will help in understanding genetic diversity in turkeys and provide valuable insights into the influence of structure variations on economic traits.

Abstract

Structural variants (SVs) are one of the main sources of genetic variants and have a significant impact on phenotype evolution, disease susceptibility, and environmental adaptations. We used 73 whole genome sequencing (12x) to apply a mapping approach to identify SVs in five turkey populations. A notable degree of genetic isolation was observed between the Basilicata and Apulian populations, as indicated by principal component analysis and admixture results. A total of 11,733 SVs were detected, including 6712 deletions, 2671 duplications, 1430 inversions, and 920 translocations. The Variant Effect Predictor (VEP) analysis predicted various consequences of filtered SVs as follows: intron variants (35.8%), intergenic variants (9.6%), coding sequence variants (8.3%), downstream gene variants (7.5%), and transcript ablations (7.3%). Our functional annotation of genes overlapping with SVs was mainly enriched in recognized pathways governing positive regulation of nucleoplasm, protein binding, mitochondrion, negative regulation of cell population proliferation, identical protein binding, and calcium signaling. We produced a comprehensive SV catalog utilizing unique whole-genome turkey data. This SV catalog not only increases our understanding of genetic diversity in turkeys but also enhances our knowledge of the role of SVs in their phenotypic traits.

1. Introduction

Turkeys are considered one of the most important agricultural species. Turkey meat is the second most popular poultry meat worldwide, and there is significant consumer demand for fresh, lower-fat, protein-rich meat [1]. In 2021, turkey meat production reached 6.1 million tons, representing 4% of the world’s poultry meat and 1.6% of the global meat production [2]. Meleagris gallopavo is the genus name used for both domesticated and wild turkeys, and it is currently classified into six subspecies: M. g. gallopavo, M. g. mexicana, M. g. intermedia, M. g. merriami, M. g. silvestris, and M. g. osceola [3]. Turkey domestication began around 1800 BC in southern Mexico and about 200 BC in the American Southwest [4]. Genetic research reveals that the southern Mexican subspecies (M. g. gallopavo) is the ancestor of the domestic turkeys now raised worldwide through documented trade between Europe and the Americas in the 16th century [5]. Turkeys were introduced to Italy in 1520, and their rapid spread, particularly in the southern regions, confirms their integration into Italy’s agricultural and culinary heritage [6,7]. Also, bone remains indicate that they were also used for food starting in 1600, to the point that they are described in cooking recipes of the time [6,8].
Indigenous Italian turkey breeds are a source of genetic diversity that must be preserved and exploited. These native turkey breeds have unique production traits, are resistant to disease, and are adapted to local conditions [9,10]. Election by farmers over the past five centuries has led to significant variation in feather color, body size, and weight among Italian turkey breeds, contributing to their differentiation [10,11,12]. These divergences may have also been influenced by Italy’s geopolitical structure, which was divided into numerous small states with limited product and population exchange, resulting in genetic isolation among turkey populations [13]. Basilicata and Apulian turkeys (BAS-APUs) are sustainable in the Basilicata and Puglia regions of southern Italy. They have different plumage colors, buff and black, with streaks of buff or white. The body weight of adult birds is up to 9 and 4 kg for males and females, respectively. The number of eggs is 50/60 egg per year. BAS-APU populations are rustic and climate-adapted to local conditions.
The detection and genotyping of genetic variations provide a foundation for genetic evaluation. Over the past decade, advancements in genomics have enabled the reliable identification of millions of single nucleotide polymorphisms (SNPs) through whole-genome sequencing (WGS) and high-throughput genotyping using SNP panels [14,15]. SNP array data are commonly used in animal breeding to identify SNPs and detect unbalanced structural variants (SVs), such as copy number variants (CNVs), a subgroup of SVs that includes deletions and duplications. However, array-based SV detection has limitations, such as low resolution and undefined breakpoints [16]. Alternatively, short-read WGS data offers a more accurate method for identifying SVs, including CNVs and balanced SVs like inversions [17].
SVs are genetic rearrangements larger than 50 base pairs (bp) and include deletions, insertions, inversions, duplications, translocations, and CNVs [16]. Numerous novel SVs and CNVs have been associated with phenotypic variation, production traits, and immune responses in farm animals, including turkeys [16,18,19], chickens [20,21,22], sheep [23,24], cattle [25,26,27], and pigs [28,29]. While much of the research has focused on CNVs, small insertions or deletions, and SNPs in turkey populations, the study of larger SVs remains limited despite their potential impact on phenotypes. Therefore, the objective of this study is to identify structural variations across five Italian turkey populations using whole-genome sequencing data.

2. Materials and Methods

2.1. Animals and Sampling

A total of 73 blood samples (about 2 mL) were collected from wing veins of Basilicata (BAS; 46 samples), Apulian M (APU_M; 7 samples), and Apulian PS (APU_PS; 8 samples) in Potenza, Basilicata region and Apulian PN (APU_PN; 7 samples), and Apulian C (APU_C; 5 samples) populations in marginal area of Brindisi and Lecce, Puglia region, Italy (Supplementary File S1: Table S1). Detailed information about these populations and sampling was mentioned in our previous study [30].

2.2. Sequencing and Mapping

DNA extraction and whole genome sequencing (coverage  =  12x) were assessed at Neogen (Ayr, Scotland, UK) using a commercial kit. The Burrows–Wheeler Aligner BWA-MEM v0.7.17 [31] was used to align raw reads to a new turkey reference assembly [Meleagris gallopavo (Turkey)-GCA_905368555.1 (MGAL_WU_HG_1.0)] [32], generating a BAM file for each animal. These BAM files were sorted and indexed with SAMtools v.1.9 [33]. Duplicate reads were detected and eliminated by the Samtools dedup function [33]. Mapping statistics outputs were generated using Qualimap [34].

2.3. Analysis of Population Structure

To indirectly verify the results of the SV detection study, we compared the population structure derived from SVs with those obtained from SNPs. The principal component analysis (PCA) was employed using the PLINK v.1.9 software [35] using all validated SV information and visualized using the R package ggplot2 [36]. The SNP data of our previous manuscript were used to perform PCA for comparison with the PCA of SVs and to perform the admixture genetic analysis [37]. The ADMIXTURE 1.3 genetic analysis software was used to assess the population genome-wide genetic structure among the populations [38], using different numbers of ancestral populations. The most probable number of ancestral populations was identified in conjunction with the lowest cross-validation error (CV), setting the analysis with an optimal number of clusters (K-value) from 2 to 8, and visualized with BITE V2 R package [39].

2.4. SVs Calling and Annotation

We identified SVs using the Smoove pipeline (https://github.com/brentp/smoove, accessed on 21 August 2021). Smoove uses different pieces of software to call and filter SVs, using aligned BAM files and the turkey reference genome GCA_905368555.1 (MGAL_WU_HG_1.0) as inputs. In the first step, the Lumpy software was used to call SVs [40]. In the second step, Svtyper was used to combine all SV calls into a single variant call format (VCF) file [41]. In the third step, Svtyper was also used to genotype SVs at the population level. The pipeline generated a VCF file in which all detected SVs were referenced, and each sample was assigned a genotype for each SV. Duphold was used in the fourth step to add depth information, which can be implemented to filter out false-positive SVs [42].
SVs were annotated using BCFtools [43], with the Ensembl rapid release [Meleagris gallopavo (Turkey)-GCA_905368555.1 (MGAL_WU_HG_1.0)] annotations.

2.5. Post-Filtration and SV Processing

Four types of SVs are discovered with Lumpy: deletions, duplications, inversions, and breakend variants. To reduce the discovery of false-positive SVs further, we used the following criteria: (1) thresholds for the duphold-generated tag-DHFFC- were set to 0.7 and 1.3 for deletion and duplication, respectively, i.e., for deletion, it should be less than 0.7, while, for duplication, it should be greater than 1.3; (2) Breakend variants that cannot be assigned to one of the three classes were removed. (3) The SVs located in the unplaced contigs were excluded, and (4) all the SVs less than 50 bp and greater than 1 Mbp were removed.

2.6. Functional Annotation

The Ensembl Variant Effect Predictor (VEP) software [44] was implemented to annotate the gene content of SVs using the final VCF file post-filtration.

2.7. Gene Ontology (GO) Terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) Analysis

The DAVID software was used to identify GO terms and KEGG pathways of the function candidate genes of SVs. In this study, we used the David 2021 version for GO and KEGG analysis (https://david.ncifcrf.gov/summary.jsp, accessed on 23 March 2022). The top 20 GO terms based on their p-values were visualized using the R package ggplot2 [36].

3. Results

3.1. Population Genetic Structure

The PCA was conducted to assess the population’s genetic structure using SNP and structural variations (Figure 1). PCA1 revealed a clear separation of the BAS population from the APU_C, APU_M, APU_PN, and APU_PS populations. PCA2 showed that the APU_C population was distinct from the other Apulian groups, while the APU_M, APU_PN, and APU_PS populations were closely related, with overlapping between APU_PN and APU_PS. The PCA1 vs. PCA3 separated the APU_C, APU_M, APU_PN, and APU_PS populations (Supplementary File S2: Figure S1). These results demonstrate the high quality of the structural variation data.
An admixture analysis with ancestral components (K) ranging from 2 to 10 was performed to further investigate population structure. The minimum cross-validation (CV) error identified K = 8 as the optimal number of ancestral populations (Supplementary File S2: Figure S2). The admixture analysis results were consistent with PCA findings (Figure 2). At K = 2, the BAS population was separated from the Apulian populations (APU_C, APU_M, APU_PN, APU_PS). At K = 3, the APU_C and APU_M populations were further separated from the APU_PN population. By K = 4, the APU_C population became distinct from the other Apulian groups, and the APU_PS population was separated from APU_PN. At K = 5, a substructure emerged within the BAS population, reflecting a more complex ancestral composition.

3.2. SV Discovery and Characterization

Whole genome sequences (WGSs) of 73 samples representing five turkey populations were analyzed to detect structural variants (SVs). The sequencing depth of these samples ranged from 6.01x to 15.13x, with an average depth of 11.44x (Supplementary File S1: Table S2). Across the five turkey populations, a total of 11,733 SVs were identified, comprising 6712 deletions, 2671 duplications, 1430 inversions, and 920 translocations. The summary statistics for the number and length of these SVs are provided in Table 1. Deletions were the most common type of SV identified. The highest number of deletions and duplications was observed in the 100–1000 bp length range, while most inversions were longer than 10,000 bp.

3.3. Distribution Across Chromosomes

The distribution of SVs across the chromosomes was analyzed using whole genome sequence data. Most SVs were identified on the macro-chromosomes (Figure 3). Chromosome 1 exhibited the highest number of deletions, duplications, inversions, and translocations. Interestingly, the macro-chromosomes (with chromosome lengths > 50 Mb) had more inversions compared to duplications, except for chromosome 4. In contrast, the intermediate and micro-chromosomes generally displayed a higher number of duplications than inversions, with notable exceptions on chromosomes 8, 13, 14, 22, and 33.
A comprehensive analysis of SV density and distribution across chromosomes was performed. The chromosomal distribution and density of all SVs are presented in Figure 4. The SV density in the micro-chromosomes was higher compared to the macro- and intermediate chromosomes in the turkey genome. Notably, micro-chromosomes 34, 32, and 35 exhibited the highest SV density within the turkey genome. Chromosome 18 also shows a high density of structural variants. This chromosome contains the MHC regions known to be enriched for structural variants.

3.4. Distribution of Structure Variations per Animal

The counts of each type of SV per animal are displayed in (Supplementary File S2: Figure S3) and detailed in (Supplementary File S1: Table S3). On average, the number of deletions per animal is 1452, with a range from 1299 to 1580. The mean number of duplications is 527, ranging from 468 to 557, while the average number of inversions is 95, with a range of from 79 to 110 (Supplementary File S1: Table S4).

3.5. Allele Frequency

The deletions and duplications exhibited similar allele frequency spectra, both containing many rare variants. Notably, most of the inversions were rare, with only a small subset reaching an allele frequency of approximately 0.50 (Figure 5).

3.6. Functional Annotation

After applying post-filtration criteria, a total of 6918 SVs were reminded, comprising 5320 deletions, 1418 duplications, and 180 inversions (Supplementary File S1: Table S5). These SVs overlapped with 3387 genes and 7962 transcripts (Table S6). The VEP (Variant Effect Predictor) analysis predicted various consequences of these SVs: 35.8% were associated with intron variants, 9.6% with intergenic variants, 7.5% with downstream gene variants, and 7.3% with transcript ablations (Figure 6). Additionally, 8.3% of the SVs were found to be coding sequence variants in the turkey genome.

3.7. Gene Ontology and Kyoto Encyclopedia of Genes and Genomes

The 766 genes detected from the SVs annotation were used for functional analysis through GO and KEGG enrichment pathways. The GO enrichment analysis revealed 42 significant (p < 0.05) functional pathways (Supplementary File S2: Table S7). The top 20 GO terms, based on their p-values, are illustrated in Figure 7. Among these, the most significant terms are nucleoplasm (GO:0005654), protein binding (GO:0005515), mitochondrion (GO:0005739), negative regulation of cell population proliferation (GO:0008285), identical protein binding (GO:0042802), nucleolus (GO:0005730), and heparin binding (GO:0008201). KEGG enrichment analysis identified two pathways, one of which is significant: the calcium signaling pathway (mgp04020) (Supplementary File S2: Table S7).

4. Discussion

To further investigate the characteristics of structural variants (SVs) across five turkey populations, we conducted population genetic structure analyses. Notably, the BAS population was distinct from the Apulian populations, as shown in PCA1 vs. PCA2, while PCA1 vs. PCA3 separated the Apulian populations from each other in both SVs and SNPs. Together, our PCA results demonstrated that SV genotypes produced a population structure and ancestral components similar to those derived from SNP genotypes. The findings revealed that the genetic background of the BAS population differs significantly from that of the APU turkey populations. The close genetic relationship observed among APU populations may be linked to their geographic origins in southern Italy. These results were further supported by the ADMIXTURE analysis, which highlighted distinct background compositions for the APU and BAS populations.
Genome structural variations constitute a significant portion of genomic diversity and play critical roles in phenotypic variation and biological functions [21]. Unlike single nucleotide variations, which are relatively well studied, SVs remain largely underexplored due to limitations in detection methods [26,44]. In this study, we present the first comprehensive whole-genome SV landscape in Italian turkeys, which may be associated with traits such as growth and reproduction, providing valuable insights for future research. We detected 11,733 SVs across the turkey populations based on sequencing data. Among these, deletions accounted for the largest proportion of SVs (57.20%), followed by duplications (22.76%), inversions (12.18%), and translocations (7.84%). The prevalence of deletions aligns with previous studies, suggesting that non-allelic homologous recombination (NAHR) events may generate more deletions than duplications. Additionally, there may be a biological or algorithmic bias favoring the detection of deletions [45]. For example, deletions were more frequent than duplications and inversions in the bovine genome [26]. Zhang et al. [21] found 49,501 SVs in chickens, including 23,817 deletions, 3292 duplications, 20,847 insertions, 407 inversions, and 1138 translocations using PacBio data. Our SV length distribution analysis (Table 1) showed that most deletions (48.22%) and duplications (38.48%) ranged between 100 and 1000 bp, while most inversions (69.30%) exceeded 10,000 bp. All translocations measured between 1 and 50 bp. Consistent with other studies, SVs were not uniformly distributed across the genome [46]. For example, Upadhyay et al. [26] found that the average deletion length (~2.4 Kb) was significantly smaller than duplications (~29.9 Kb) and inversions (~46.4 Kb). Previous studies using next-generation sequencing technology to detect SVs in Xiang pigs [47], chickens [48], and cattle [15,49] identified only a few SVs larger than 10 Kb.
The influence of structural variants (SVs) on the genome and their distribution patterns across chromosomes can provide important insights into genetic diversity. In this study, the number of SVs per chromosome ranged from 1308 to 18. Macro-chromosomes exhibited the highest number of SVs, with deletions accounting for the majority, while duplications, inversions, and translocations made up a smaller proportion (Figure 3). Interestingly, micro-chromosomes displayed a higher density of SVs compared to macro-chromosomes, where structural variation was notably lower. In cattle, Boussaha et al. [25] found that Chr12 contained the highest percentage of SVs (approximately 7% of the total), followed by ChrX (5.5%) and Chr23 (5%). No clear relationship between SV types and chromosomal distributions was observed. For the macro-chromosomes (from Chr1 to Chr5), the proportion of total SVs was 21.7%, 16.2%, 9.8%, 7.6%, and 4.3%, respectively, in chickens [21]. Zhang et al. [21] also noted that SVs were more densely distributed on micro-chromosomes than on macro-chromosomes, with one SV every 2.94 kb on chromosome 30 and every 2.06 kb on chromosome 32. Similarly, Qiao et al. [24] observed that the highest number of SVs and their distribution in the sheep genome occurred on chromosomes 1, 2, and 3.
The range of SVs per animal was from 1887 to 2201, showing relatively little variation among individuals. In comparison, a study by Upadhyay et al. [26] identified a broader range of SVs per individual in cattle, from 2502 to 7164, highlighting greater differences across the cattle genome.
The allele frequency of deletions and duplications is skewed towards lower frequencies, with the majority falling between 0.0 and 0.5. This distribution is similar to findings in cattle, where Lee et al. [15] observed that deletions and duplications exhibited comparable allele frequency spectra, with many rare variants present. In particular, most duplications were uncommon, with only a few reaching an allele frequency of approximately 0.25. Similarly, [29] reported that, in pigs, the proportion of rare SVs (allele count, AC = 1 to 10; allele frequency, AF < 0.01) was higher than that of singleton SVs (AC = 1). Most SVs were classified as common (AC > 10, AF > 0.01), suggesting a lower level of diversity in these pig populations. This lower diversity in common SVs may contribute to the smaller percentages of singleton and rare SVs observed in these animals.
In this study, the most prevalent consequences for SVs in this study were intron variants (9994) coding sequence variants (2323), and transcript ablations (2033). In the cattle genome, 361 SVs caused ablation and 482 caused amplification, affecting entire transcript features [26]. Additionally, 790 SVs were found in the coding regions of the cattle genome. In total, 3078 SVs (2.4%) overlapped with 3080 (15.9%) protein-coding genes, according to Yang et al. [29]. This is much lower than the number of SVs found in noncoding regions (8174 SVs, 6.3%), intergenic regions (62,871 SVs, 48.2%), and introns (57,179 SVs, 43.8%, affecting 3307 or 17.1% of genes). Only 749 SVs (0.6%) were located within exons, mostly in the 3′ regulatory regions (3′-UTR) of 595 genes (0.5%), suggesting that SVs could potentially alter gene function by disrupting these regions.
Gene Ontology (GO) enrichment analysis showed that some SVs related to genes in the turkey populations were enriched in pathways for biological processes, molecular function, and cellular components. These pathways included nucleoplasm, protein binding, mitochondrion, negative regulation of cell population proliferation, and identical protein binding, while KEGG pathway enrichment revealed significant association with the calcium signaling pathway. Similarly, Qiao et al. [24] reported that specific SV-related genes were enriched in plasma membrane adhesion molecules and the calcium-signaling pathway of the GO and KEGG pathways, respectively, in Southdown sheep.

5. Conclusions

This study offers a deeper understanding of structural variations in turkeys and presents an SV catalog comprising 11,733 SVs identified through whole-genome sequencing of turkeys. The identified structural variations in turkeys may impact the function of associated genes, potentially influencing key production traits such as nucleoplasm, protein binding, mitochondrion, negative regulation of cell population proliferation, identical protein binding, and calcium signaling pathway. Therefore, our genome-wide SV catalog for turkeys provides valuable insights into the possible genomic basis of economically important traits in turkeys.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/ani15030339/s1: Figure S1: Principal component analysis (PCA) of the whole genome sequenced individuals. (a) PCA1 vs. PCA3 plot based on SNPs. (b) PCA1 vs. PCA3 plot based on structural variations. Apulian C (APU_C), Apulian M (APU_M), Apulian PN (APU_PN), Apulian PS (APU_PS), and Basilicata (BAS). Figure S2: Cross validation error. Figure S3: Distribution of structure variations per animal, (A) Deletions, (B) Duplications and (C) Inversions; Table S1: Detailed sample information on sequenced birds; Table S2: NGS data reads mapping; Table S3: Basic statistics of the different types of SVs identified in this study; Table S4: Summary statistics for SVs; Table S5: Post filtration structure variations detected in this study, in VCF format.; Table S6: Functional annotation of post-filtered SVs in this study; Table S7: GO terms and KEGG pathways of all SVs.

Author Contributions

M.S.S. and V.L. conceived and designed this study. A.C., M.S.S., G.C. and P.D.P. collected the data. M.S.S., M.F.L.D., M.A.M.G. and V.L. performed bioinformatics analysis and visualized the results. M.S.S. wrote the original draft. M.F.L.D., M.A.M.G., V.L., P.D.P., E.C. (Elena Ciani), N.P., E.C. (Elena Circella), G.C. and A.C. reviewed and edited the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the BIO.ZOO.CARE project no. 04250180603: “PSR Puglia 2014–2022. Misura 10–Pagamenti agro-climatico-ambientali. Sottomisura 10.2–Operazione 10.2.1”.

Institutional Review Board Statement

All experimental procedures involving the handling of animals were approved by the Ethical Committee for Animal Experimentation of the Department of Veterinary Medicine, University of Bari Aldo Moro, Italy Prot. N. 2999–III/13 (Approval Number: 19/24).

Informed Consent Statement

Not applicable.

Data Availability Statement

Whole genome data are available at https://www.ncbi.nlm.nih.gov/sra/PRJNA1197539, (accessed on 12 December 2024).

Acknowledgments

We are grateful to the Department of Animal Breeding and Genomics, Wageningen University & Research for using the High-Performance Compute cluster at Wageningen University & Research to perform the genomic analysis of our data.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Baéza, E.; Guillier, L.; Petracci, M. Review: Production Factors Affecting Poultry Carcass and Meat Quality Attributes. Animal 2022, 16, 100331. [Google Scholar] [CrossRef] [PubMed]
  2. FAO FAOSTATA [Internet]. 2023. Available online: https://www.fao.org/faostat/en/#data/QCL/visualize (accessed on 22 June 2023).
  3. Thornton, E.K. Introduction to the Special Issue—Turkey Husbandry and Domestication: Recent Scientific Advances. J. Archaeol. Sci. Rep. 2016, 10, 514–519. [Google Scholar] [CrossRef]
  4. Speller, C.F.; Kemp, B.M.; Wyatt, S.D.; Monroe, C.; Lipe, W.D.; Arndt, U.M.; Yang, D.Y. Ancient Mitochondrial DNA Analysis Reveals Complexity of Indigenous North American Turkey Domestication. Proc. Natl. Acad. Sci. USA 2010, 107, 2807–2812. [Google Scholar] [CrossRef] [PubMed]
  5. Monteagudo, L.V.; Avellanet, R.; Azón, R.; Tejedor, M.T. Mitochondrial DNA Analysis in Two Heritage European Breeds Confirms Mesoamerican Origin and Low Genetic Variability of Domestic Turkey. Anim. Genet. 2013, 44, 786. [Google Scholar] [CrossRef] [PubMed]
  6. De Grossi Mazzorin, I.; Epifani, I. Introduzione e Diffusione in Italia Di Animali Esotici Dal Nuovo Mondo: Il Caso Del Tacchino (Meleagris gallopavo L.). L’Idomeneo 2016, 2015, 55–74. [Google Scholar]
  7. Maltin, E.; Jakobsson, H. The Introduction of the Turkey (Meleagris gallopavo) in Early Modern Sweden–Historical and Zooarchaeological Evidence of Husbandry and Consumption. Postmediev Archaeol. 2023, 57, 1–28. [Google Scholar] [CrossRef]
  8. Eiche, S. Presenting the Turkey: The Fabulous Story of a Flamboyant and Flavourful Bird; Centro Di: Florence, Italy, 2004; ISBN 8870384144. [Google Scholar]
  9. Bernini, F.; Bagnato, A.; Marelli, S.P.; Zaniboni, L.; Cerolini, S.; Strillacci, M.G. Genetic Diversity and Identification of Homozygosity-Rich Genomic Regions in Seven Italian Heritage Turkey (Meleagris gallopavo) Breeds. Genes 2021, 12, 1342. [Google Scholar] [CrossRef]
  10. Marelli, S.P.; Zaniboni, L.; Strillacci, M.G.; Madeddu, M.; Cerolini, S. Morphological Characterization of Two Light Italian Turkey Breeds. Animals 2022, 12, 571. [Google Scholar] [CrossRef]
  11. Marelli, S.P.; Colombo, E.; Mangiagalli, M.G.; Zaniboni, L.; Cassinelli, C.; Cerolini, S.; Guidobono Cavalchini, L. Phenotypical Characterization of Some Italian Turkey (Meleagris gallopavo) Breeds. 2009. Available online: https://www.semanticscholar.org/paper/Phenotypical-characterization-of-some-Italian-Marelli-Colombo/eb04b909f5d4b5bb2bbf51c8f5400a2ebf787030 (accessed on 1 January 2009).
  12. Colli, L.; Negrini, R.; Ajmone Marsan, P.; Consortium, G. Marcatori Molecolari, Genoma e Risorse Genetiche Animali. Fondazione Iniziative Zootecniche. La Salvaguardia della Biodiversita Animale. Iniziative Generali ed Azioni Intraprese in Italia a Tutela delle Razze Minacciate; Fondazione Iniziative Zooprofilattiche e Zootecniche: Brescia, Italy, 2011. [Google Scholar]
  13. Strillacci, M.G.; Gorla, E.; Ríos-Utrera, A.; Vega-Murillo, V.E.; Montaño-Bermudez, M.; Garcia-Ruiz, A.; Cerolini, S.; Román-Ponce, S.I.; Bagnato, A. Copy Number Variation Mapping and Genomic Variation of Autochthonous and Commercial Turkey Populations. Front. Genet. 2019, 10, 982. [Google Scholar] [CrossRef]
  14. Makanjuola, B.O.; Miglior, F.; Abdalla, E.A.; Maltecca, C.; Schenkel, F.S.; Baes, C.F. Effect of Genomic Selection on Rate of Inbreeding and Coancestry and Effective Population Size of Holstein and Jersey Cattle Populations. J. Dairy. Sci. 2020, 103, 5183–5199. [Google Scholar] [CrossRef]
  15. Lee, Y.-L.; Bosse, M.; Takeda, H.; Moreira, G.C.M.; Karim, L.; Druet, T.; Oget-Ebrad, C.; Coppieters, W.; Veerkamp, R.F.; Groenen, M.A.M.; et al. High-Resolution Structural Variants Catalogue in a Large-Scale Whole Genome Sequenced Bovine Family Cohort Data. BMC Genom. 2023, 24, 225. [Google Scholar] [CrossRef] [PubMed]
  16. Alkan, C.; Coe, B.P.; Eichler, E.E. Genome Structural Variation Discovery and Genotyping. Nat. Rev. Genet. 2011, 12, 363–376. [Google Scholar] [CrossRef] [PubMed]
  17. Sudmant, P.H.; Rausch, T.; Gardner, E.J.; Handsaker, R.E.; Abyzov, A.; Huddleston, J.; Zhang, Y.; Ye, K.; Jun, G.; Hsi-Yang Fritz, M. An Integrated Map of Structural Variation in 2504 Human Genomes. Nature 2015, 526, 75–81. [Google Scholar] [CrossRef]
  18. Aslam, M.L.; Bastiaansen, J.W.M.; Megens, H.-J.; Crooijmans, R.P.M.A.; Nasreen, F.; Blomberg, L.A.; Van Tassell, C.P.; Sonstegard, T.S.; Schroeder, S.G.; Groenen, M.A.M.; et al. Genome-Wide Candidate Regions for Selective Sweeps Revealed through Massive Parallel Sequencing of DNA across Ten Turkey Populations. BMC Genet. 2014, 15, 117. [Google Scholar] [CrossRef]
  19. Strillacci, M.G.; Marelli, S.P.; Milanesi, R.; Zaniboni, L.; Punturiero, C.; Cerolini, S. Copy Number Variants in Four Italian Turkey Breeds. Animals 2021, 11, 391. [Google Scholar] [CrossRef]
  20. Fernandes, A.C.; da Silva, V.H.; Goes, C.P.; Moreira, G.C.M.; Godoy, T.F.; Ibelli, A.M.G.; Peixoto, J.D.O.; Cantão, M.E.; Ledur, M.C.; de Rezende, F.M. Genome-Wide Detection of CNVs and Their Association with Performance Traits in Broilers. BMC Genom. 2021, 22, 1–18. [Google Scholar] [CrossRef]
  21. Zhang, J.; Nie, C.; Li, X.; Zhao, X.; Jia, Y.; Han, J.; Chen, Y.; Wang, L.; Lv, X.; Yang, W. Comprehensive Analysis of Structural Variants in Chickens Using PacBio Sequencing. Front. Genet. 2022, 13, 971588. [Google Scholar] [CrossRef]
  22. Cendron, F.; Cassandro, M.; Penasa, M. Genome-Wide Investigation to Assess Copy Number Variants in the Italian Local Chicken Population. J. Anim. Sci. Biotechnol. 2024, 15, 2. [Google Scholar] [CrossRef]
  23. Li, R.; Gong, M.; Zhang, X.; Wang, F.; Liu, Z.; Zhang, L.; Yang, Q.; Xu, Y.; Xu, M.; Zhang, H. A Sheep Pangenome Reveals the Spectrum of Structural Variations and Their Effects on Tail Phenotypes. Genome Res. 2023, 33, 463–477. [Google Scholar] [CrossRef]
  24. Qiao, G.; Xu, P.; Guo, T.; He, X.; Yue, Y.; Yang, B. Genome-wide Detection of Structural Variation in Some Sheep Breeds Using Whole-genome Long-read Sequencing Data. J. Anim. Breed. Genet. 2024, 141, 403–414. [Google Scholar] [CrossRef]
  25. Boussaha, M.; Esquerré, D.; Barbieri, J.; Djari, A.; Pinton, A.; Letaief, R.; Salin, G.; Escudié, F.; Roulet, A.; Fritz, S. Genome-Wide Study of Structural Variants in Bovine Holstein, Montbéliarde and Normande Dairy Breeds. PLoS ONE 2015, 10, e0135931. [Google Scholar] [CrossRef] [PubMed]
  26. Upadhyay, M.; Derks, M.F.L.; Andersson, G.; Medugorac, I.; Groenen, M.A.M.; Crooijmans, R.P.M.A. Introgression Contributes to Distribution of Structural Variations in Cattle. Genomics 2021, 113, 3092–3102. [Google Scholar] [CrossRef] [PubMed]
  27. Braga, L.G.; Chud, T.C.S.; Watanabe, R.N.; Savegnago, R.P.; Sena, T.M.; do Carmo, A.S.; Machado, M.A.; Panetto, J.C.d.C.; da Silva, M.V.G.B.; Munari, D.P. Identification of Copy Number Variations in the Genome of Dairy Gir Cattle. PLoS ONE 2023, 18, e0284085. [Google Scholar] [CrossRef]
  28. Zong, W.; Wang, J.; Zhao, R.; Niu, N.; Su, Y.; Hu, Z.; Liu, X.; Hou, X.; Wang, L.; Wang, L. Associations of Genome-Wide Structural Variations with Phenotypic Differences in Cross-Bred Eurasian Pigs. J. Anim. Sci. Biotechnol. 2023, 14, 136. [Google Scholar] [CrossRef]
  29. Yang, L.; Yin, H.; Bai, L.; Yao, W.; Tao, T.; Zhao, Q.; Gao, Y.; Teng, J.; Xu, Z.; Lin, Q. Mapping and Functional Characterization of Structural Variation in 1060 Pig Genomes. Genome Biol. 2024, 25, 116. [Google Scholar] [CrossRef]
  30. Saleh, M.S.; Landi, V.; Derks, M.F.L.; Centoducati, G.; Groenen, M.A.M.; De Palo, P.; Ciani, E.; Strillacci, M.G.; Bagnato, A.; Pugliese, N.; et al. Genome-Wide Analyses of Genomic Diversity, Population Structure and Selection Signatures in Italian Turkey Populations. Poult. Sci. 2025, 104, 104543. [Google Scholar] [CrossRef]
  31. Li, H.; Durbin, R. Fast and Accurate Short Read Alignment with Burrows–Wheeler Transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef]
  32. Barros, C.P.; Derks, M.F.L.; Mohr, J.; Wood, B.J.; Crooijmans, R.P.M.A.; Megens, H.-J.; Bink, M.C.A.M.; Groenen, M.A.M. A New Haplotype-Resolved Turkey Genome to Enable Turkey Genetics and Genomics Research. Gigascience 2023, 12, giad051. [Google Scholar] [CrossRef]
  33. Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R. Subgroup, 1000 Genome Project Data Processing the Sequence Alignment/Map Format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef]
  34. Okonechnikov, K.; Conesa, A.; García-Alcalde, F. Qualimap 2: Advanced Multi-Sample Quality Control for High-Throughput Sequencing Data. Bioinformatics 2016, 32, 292–294. [Google Scholar] [CrossRef]
  35. Chang, C.C.; Chow, C.C.; Tellier, L.C.A.M.; Vattikuti, S.; Purcell, S.M.; Lee, J.J. Second-Generation PLINK: Rising to the Challenge of Larger and Richer Datasets. Gigascience 2015, 4, s13742-015. [Google Scholar] [CrossRef] [PubMed]
  36. Wickham, H.; Chang, W.; Wickham, M.H. Package ‘Ggplot2.’ Create Elegant Data Visualisations Using the Grammar of Graphics, Version 2; MIT: Cambridge, MA, USA, 2016; pp. 1–189. [Google Scholar]
  37. Saleh, M.S.; Landi, V.; Derks, M.F.L.; Centoducati, G.; Groenen, M.A.M.; De Palo, P.; Ciani, E.; Pugliese, N.; Circella, E.; Camarda, A. Genomic Scans for Selection and Runs of Homozygosity in Southern Italian Turkey Populations. Poult. Sci. 2025, 104, 104750. [Google Scholar] [CrossRef] [PubMed]
  38. Alexander, D.H.; Novembre, J.; Lange, K. Fast Model-Based Estimation of Ancestry in Unrelated Individuals. Genome Res. 2009, 19, 1655–1664. [Google Scholar] [CrossRef] [PubMed]
  39. Milanesi, M.; Capomaccio, S.; Vajana, E.; Bomba, L.; Garcia, J.F.; Ajmone-Marsan, P.; Colli, L. BITE: An R Package for Biodiversity Analyses 2017. Available online: https://github.com/marcomilanesi/BITE (accessed on 29 August 2017).
  40. Layer, R.M.; Chiang, C.; Quinlan, A.R.; Hall, I.M. LUMPY: A Probabilistic Framework for Structural Variant Discovery. Genome Biol. 2014, 15, 1–19. [Google Scholar] [CrossRef]
  41. Chiang, C.; Layer, R.M.; Faust, G.G.; Lindberg, M.R.; Rose, D.B.; Garrison, E.P.; Marth, G.T.; Quinlan, A.R.; Hall, I.M. SpeedSeq: Ultra-Fast Personal Genome Analysis and Interpretation. Nat. Methods 2015, 12, 966–968. [Google Scholar] [CrossRef]
  42. Pedersen, B.S.; Quinlan, A.R. Duphold: Scalable, Depth-Based Annotation and Curation of High-Confidence Structural Variant Calls. Gigascience 2019, 8, giz040. [Google Scholar] [CrossRef]
  43. Danecek, P.; McCarthy, S.A. BCFtools/Csq: Haplotype-Aware Variant Consequences. Bioinformatics 2017, 33, 2037–2039. [Google Scholar] [CrossRef]
  44. McLaren, W.; Gil, L.; Hunt, S.E.; Riat, H.S.; Ritchie, G.R.S.; Thormann, A.; Flicek, P.; Cunningham, F. The Ensembl Variant Effect Predictor. Genome Biol. 2016, 17, 122. [Google Scholar] [CrossRef]
  45. Bertolotti, A.C.; Layer, R.M.; Gundappa, M.K.; Gallagher, M.D.; Pehlivanoglu, E.; Nome, T.; Robledo, D.; Kent, M.P.; Røsæg, L.L.; Holen, M.M. The Structural Variation Landscape in 492 Atlantic Salmon Genomes. Nat. Commun. 2020, 11, 5176. [Google Scholar] [CrossRef]
  46. Turner, D.J.; Miretti, M.; Rajan, D.; Fiegler, H.; Carter, N.P.; Blayney, M.L.; Beck, S.; Hurles, M.E. Germline Rates of de Novo Meiotic Deletions and Duplications Causing Several Genomic Disorders. Nat. Genet. 2008, 40, 90–95. [Google Scholar] [CrossRef]
  47. Liao, Z.; Zhang, X.; Zhang, S.; Lin, Z.; Zhang, X.; Ming, R. Structural Variations in Papaya Genomes. BMC Genom. 2021, 22, 335. [Google Scholar] [CrossRef] [PubMed]
  48. Zhou, R.; Li, S.; Yao, W.; Xie, C.; Chen, Z.; Zeng, Z.; Wang, D.; Xu, K.; Shen, Z.; Mu, Y. The Meishan Pig Genome Reveals Structural Variation-mediated Gene Expression and Phenotypic Divergence Underlying Asian Pig Domestication. Mol. Ecol. Resour. 2021, 21, 2077–2092. [Google Scholar] [CrossRef] [PubMed]
  49. Kerstens, H.H.D.; Crooijmans, R.P.M.A.; Dibbits, B.W.; Vereijken, A.; Okimoto, R.; Groenen, M.A.M. Structural Variation in the Chicken Genome Identified by Paired-End Next-Generation DNA Sequencing of Reduced Representation Libraries. BMC Genom. 2011, 12, 94. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Principal component analysis (PCA) of the whole genome sequenced individuals. (a) PCA1 vs. PCA2 plot based on SNPs. (b) PCA1 vs. PCA2 plot based on structural variations. Apulian C (APU_C), Apulian M (APU_M), Apulian PN (APU_PN), Apulian PS (APU_PS), and Basilicata (BAS).
Figure 1. Principal component analysis (PCA) of the whole genome sequenced individuals. (a) PCA1 vs. PCA2 plot based on SNPs. (b) PCA1 vs. PCA2 plot based on structural variations. Apulian C (APU_C), Apulian M (APU_M), Apulian PN (APU_PN), Apulian PS (APU_PS), and Basilicata (BAS).
Animals 15 00339 g001
Figure 2. Admixture circle plot from K = 2 to 8 clusters for five Italian turkey populations. Apulian C (APU_C), Apulian M (APU_M), Apulian PN (APU_PN), Apulian PS (APU_PS), and Basilicata (BAS).
Figure 2. Admixture circle plot from K = 2 to 8 clusters for five Italian turkey populations. Apulian C (APU_C), Apulian M (APU_M), Apulian PN (APU_PN), Apulian PS (APU_PS), and Basilicata (BAS).
Animals 15 00339 g002
Figure 3. Distribution of discovered SVs per chromosome. Deletions (DELs), duplications (DUPs), inversions (INVs), and translocations (TRAs).
Figure 3. Distribution of discovered SVs per chromosome. Deletions (DELs), duplications (DUPs), inversions (INVs), and translocations (TRAs).
Animals 15 00339 g003
Figure 4. Chromosomal distribution of structure variations density on turkey genome. Chromosomes are split between macrochromosomes (>50 Mbp), intermediate chromosomes (>20 Mbp < 50 Mbp), and microchromosomes (<20 Mbp).
Figure 4. Chromosomal distribution of structure variations density on turkey genome. Chromosomes are split between macrochromosomes (>50 Mbp), intermediate chromosomes (>20 Mbp < 50 Mbp), and microchromosomes (<20 Mbp).
Animals 15 00339 g004
Figure 5. Derived allele frequency of structure variations. Deletions (DELs), duplications (DUPs), and inversions (INVs).
Figure 5. Derived allele frequency of structure variations. Deletions (DELs), duplications (DUPs), and inversions (INVs).
Animals 15 00339 g005
Figure 6. Distribution of structure variations situated in different genomic regions.
Figure 6. Distribution of structure variations situated in different genomic regions.
Animals 15 00339 g006
Figure 7. The gene ontology (GO) enrichment analysis of candidate genes. The top 20 enriched GO terms, ranked by p-value, were plotted. Biological processes, molecular functions, and cellular components are visualized by different shapes.
Figure 7. The gene ontology (GO) enrichment analysis of candidate genes. The top 20 enriched GO terms, ranked by p-value, were plotted. Biological processes, molecular functions, and cellular components are visualized by different shapes.
Animals 15 00339 g007
Table 1. The length distribution of structural variations by type.
Table 1. The length distribution of structural variations by type.
Length/TypeDeletionsDuplicationsInversions
0–50 bp274013
50–100 bp1062011
100–1000 bp32371028142
1000–10,000 bp1581984273
>10,000 bp558659991
Total671226711430
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Saleh, M.S.; Landi, V.; Derks, M.F.L.; Centoducati, G.; Groenen, M.A.M.; De Palo, P.; Ciani, E.; Pugliese, N.; Circella, E.; Camarda, A. Genome Wide Identification of Structure Variations in Five Italian Turkey Populations. Animals 2025, 15, 339. https://doi.org/10.3390/ani15030339

AMA Style

Saleh MS, Landi V, Derks MFL, Centoducati G, Groenen MAM, De Palo P, Ciani E, Pugliese N, Circella E, Camarda A. Genome Wide Identification of Structure Variations in Five Italian Turkey Populations. Animals. 2025; 15(3):339. https://doi.org/10.3390/ani15030339

Chicago/Turabian Style

Saleh, Medhat S., Vincenzo Landi, Martijn F. L. Derks, Gerardo Centoducati, Martien A. M. Groenen, Pasquale De Palo, Elena Ciani, Nicola Pugliese, Elena Circella, and Antonio Camarda. 2025. "Genome Wide Identification of Structure Variations in Five Italian Turkey Populations" Animals 15, no. 3: 339. https://doi.org/10.3390/ani15030339

APA Style

Saleh, M. S., Landi, V., Derks, M. F. L., Centoducati, G., Groenen, M. A. M., De Palo, P., Ciani, E., Pugliese, N., Circella, E., & Camarda, A. (2025). Genome Wide Identification of Structure Variations in Five Italian Turkey Populations. Animals, 15(3), 339. https://doi.org/10.3390/ani15030339

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop