Next Article in Journal
Reference Intervals for Biochemical Analytes in Clinically Healthy Adult Lusitano Horses
Previous Article in Journal
Transcriptomic Insights into the Protective Effects of Apigenin and Sodium Butyrate on Jejunal Oxidative Stress in Ducks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genomic Analysis of Reproductive Trait Divergence in Duroc and Yorkshire Pigs: A Comparison of Mixed Models and Selective Sweep Detection

College of Animal Science, Jilin University, Changchun 130062, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Vet. Sci. 2025, 12(7), 657; https://doi.org/10.3390/vetsci12070657
Submission received: 26 May 2025 / Revised: 23 June 2025 / Accepted: 5 July 2025 / Published: 11 July 2025
(This article belongs to the Special Issue Future Perspectives in Pig Reproductive Biotechnology)

Simple Summary

This study investigated genetic differences related to reproductive traits in Duroc and Yorkshire (Dutch Large White) pigs using two approaches: linear mixed models (LMM) with additive and dominant effects, and selective sweep analysis. The first principal component (PC1) was used as a simulated phenotype to capture population variance. A comparative analysis revealed that 587 SNPs and 171 genes were uniquely identified by the LMM + ADDO framework, not overlapping with the top 5% FST and the top and bottom 5% of θπ values. Some key genes like HSPG2, KAT6B, and SAMD8 were linked to litter size, while DLGAP1 and MYOM1 were associated with teat number traits. This integration of additive/dominant modeling and population genetics provided novel insights into the genetic architecture of reproductive traits in pigs, highlighting the value of complementary approaches for understanding complex trait inheritance.

Abstract

This study aimed to investigate population genetic differences related to reproductive traits between Duroc and Yorkshire (Dutch Large White) pigs using two approaches: linear mixed models that dissect additive and dominant effects, and selective sweep analysis. (1) Methods: Genome-wide single-nucleotide polymorphism (SNP) data of 3917 Duroc and 3217 Yorkshire pigs were analyzed. The first principal component (PC1) was used as a simulated phenotype to capture population-level variance. Additive and dominant genetic effects were partitioned and evaluated by using the combination of the linear mixed models (LMM) and ADDO’s algorithm (LMM + ADDO). In parallel, selective sweep signals were detected using fixation index (FST) and nucleotide diversity (θπ) analyses. A comparative assessment was then conducted between the LMM + ADDO and the selective sweep analysis results. Significant loci were annotated using quantitative trait loci (QTL) databases and the Ensembl genome browser. (2) Results: There are 39040 SNPs retained after quality control. Using the LMM + ADDO framework with PC1 as a simulated phenotype, a total of 632 significant SNPs and 184 candidate genes were identified. Notably, 587 SNPs and 171 genes were uniquely detected by the LMM + ADDO method and not among loci detected by the top 5% of FST and θπ values. Key candidate genes associated with litter size included HSPG2, KAT6B, SAMD8, and LRMDA, while DLGAP1, MYOM1, and VTI1A were associated with teat number traits. (3) Conclusions: This study demonstrates the power of integrating additive and dominant effect modeling with population genetics approaches for the detection of genomic regions under selection. The findings provide novel insights into the genetic architecture of reproductive traits in pigs and have practical implications for understanding the inheritance of complex traits.

1. Introduction

Genomic selection signatures across populations reflect the effects of natural and artificial selection pressures. Cross-population genome-wide comparisons have proven effective in detecting the genetic architecture underlying phenotypic variation. Studies have identified population-specific selection signatures in pigs accounting for adaptive differentiation and trait-specific genetic variation through whole-genome comparisons, such as between Landrace and Yorkshire breeds [1] and Meishan subpopulations [2].
The fixation index (FST) is a widely used statistic for identifying selection signatures across populations, quantifying allele frequency divergence. FST captures only frequency differences, which correspond to additive effects. However, other genetic effects, e.g., dominance, are important components of the genetic architecture of complex traits, especially those with low heritability, such as reproductive traits in mammals [3]. Linear mixed models (LMMs), as employed in genome-wide association studies (GWAS), allow decomposition of these effects [4] but require individual-level phenotypic data, which are often unavailable in selection signature analyses.
To overcome this limitation, we adopted an EigenGWAS-inspired framework using the first principal component (PC1) of genome-wide SNP variation as a surrogate phenotype. This method, initially proposed by Cortés et al. (2016) [5] and further refined in subsequent studies [6], enables the dissection of genetic effects in the absence of phenotypic data. We further integrated the ADDO (Additive-Dominance Decomposition Orthogonalization) algorithm to efficiently distinguish and visualize additive and dominant contributions to trait variation [6].
We applied this approach to SNP data from two reproductively divergent pig breeds, Duroc and Yorkshire, which serve as typical sire and dam lines in commercial breeding, respectively. Notably, Yorkshire pigs demonstrate significantly higher reproductive output—averaging 35.41 lifetime offspring born alive, compared to 24.83 in Duroc [7]—making them an ideal model for investigating the genetic basis of reproductive performance.

2. Materials and Methods

2.1. Populations and Genotypes

Dataset 1 comprised SNP chip genotypes of 3917 Duroc pigs with 40,535 SNP loci, obtained from Ruan et al. [8]. Dataset 2 included 3217 Dutch Large White pigs (a breed of Yorkshire) with 47,314 SNP loci, sourced from Deng et al. [9]. Variant call format (VCF) files from both datasets were merged using custom scripts. Standard quality control (QC) steps were applied to the merged data: samples with less than 90% SNP call rate were excluded; missing genotypes were imputed using Beagle v5.0 [10]; SNPs with a minor allele frequency (MAF) below 0.05 were removed.

2.2. Dissection of Additive and Dominant Genetic Effects

To capture genetic variance components related to fecundity, we performed a genome-wide dissection of additive and dominance effects. Principal component analysis (PCA) was conducted using GCTA [11]. Because PC1 explained 65.4% of the variance between the two populations, while PC2 accounted for only 19.6%, the first principal component (PC1) was extracted as a surrogate phenotype.
The GWAS was conducted using linear mixed models (LMMs) implemented in the HIBLUP platform [12]. The model is formulated as follows:
y = X b + i = 1 k Z i u i + e
where y represents the vector of phenotypic observations (PC1 values in this study), X is the n by n b design matrix for fixed effects and covariates, b is the vector of their corresponding coefficients, and k denotes the total number of genetic random effects. Z i is the n by n i design matrix for the i t h genetic random effect and u i is the vector of its corresponding genetic effects with a length of n i , which follows the multivariate normality of N ( 0 , K i σ i 2 ) , and k i can be the additive or dominant matrix that is derived from genomic information. The residual term e is the vector of model residuals with a zero mean and a variance of σ e 2 . And the Kinship matrices for each population were calculated by HIBLUP.
Furthermore, following the ADDO algorithm [6], we applied constrained least squares optimization to decompose SNP effects into additive and dominance components using singular value decomposition (SVD). Linear regressions were performed on the genomic data in R (v4.4.1) under three models: a null model (excluding genotype effects), an additive model (A model), and a combined additive-dominance model (AD model). Analyses of variance (ANOVA) were conducted to evaluate the significance of SNP effects within the LMM + ADDO framework. The genome-wide significance threshold was set at p = 1.3 × 10−6, based on a Bonferroni correction (0.05/n, n means the total number of SNPs).

2.3. Selection Sweep Detection

To assess genetic differentiation between Duroc and Yorkshire pigs based on allele frequency differentiation, pairwise FST values were calculated at each SNP site using vcftools (v1.16.0) [13]. In addition, nucleotide diversity (θπ) and the ratio between populations were computed for each population using PLINK (v1.9) [14] (www.cog-genomics.org/plink/1.9/, accessed on 22 October 2024), providing complementary evidence for selection signals.

2.4. Functional Annotation of Significant Loci

For functional annotation of significant loci, we focus on the loci (1) co-identified by different models; (2) detected by dominant effects but not by sweep selection analysis; and (3) subjected to co-selection by both populations by sweep selection analysis. The position relative to functional genes of each SNP, e.g., upstream, downstream, coding region, introns, or intergenic region, was annotated using VEP [15]. Due to the large number of identified loci and genes, we chose to focus mainly on those co-identified loci) for the annotation of genes and the subsequent discussion of the analysis The possible functions of candidate genes were also analyzed by querying major databases (NCBI, Ensembl [16], iswine [17], PigBiobank [18], pig QTLdb [19], etc.) and published studies.
In addition, to verify whether the selected SNPs have effects on proliferation traits in real pig populations, we performed association studies of genotypes with litter size traits in a Dongliao pig population with phenotype records, which has been used in our previous study [20]. The Dongliao pig population contains genotype and Total Born Number (TBN) and Number Born Alive (NBA) phenotype of 723 sows with records of 2499 litters. We performed a multifactor ANOVA to assess the effects of genotype and other environmental factors by R4.4.1 [21].

3. Results

3.1. Additive and Dominant Effect Detection

Following quality control, 38,040 SNPs were retained from the Duroc and Yorkshire pig populations. Principal component analysis (PCA) was performed using these SNPs. The first two components (PC1 and PC2) accounted for a combined 85.4% of the total genetic variance, with PC1 explaining 65.4% and PC2 explaining 19.6%. Clear population stratification was observed along PC1, with Duroc and Yorkshire pigs forming distinct clusters (Supplementary Materials Figure S1). PC1 was thus used as the phenotypic surrogate for downstream analyses.
Three linear regressions were fitted, with the null model (no genotype effects), the A model (which includes additive genetic effects), and the AD model (which includes both additive and dominant effects), respectively. And 630 additive loci and 47 dominant loci were identified at the level of p 1.3 × 10 6 (0.05/n, n denotes the number of SNPs). Notably, 45 out of the 47 dominant loci were also among the additive ones. Manhattan and Q–Q plots of the results are shown in Figure 1, while Figure 1A displays additive effects and Figure 1B dominant effects. The share of additive variance was 75.91% and the share of dominant effect variance was 24.09%. And there are two significant SNPs that were exclusively uniquely identified through dominant effects alone.

3.2. The Result of Selection Sweep Detection

Candidate loci under selection were identified using a composite approach combining FST and nucleotide diversity (θπ) metrics. The Z-score normalization (z(FST)) on the values of FST was computed by using the mean and standard deviation. The πratio metric was calculated as the l o g 2 - transformation ratio of θπ between two populations ( l o g 2   π r a t i o ) , specifically defined as l o g 2 θ π _ D u r o c θ π _ Y o r k s h i r e . The genomic regions with both top 5% z(FST) and l o g 2   ( π r a t i o )   (absolute) values were determined as the selected regions in the two populations, as shown in Figure 2. This screening approach identified 844 loci under selection, with 496 in Duroc and 348 in Yorkshire pigs. The detail information of the results is provided in Supplementary Table S1.

3.3. Comparison of LMM + ADDO Framework with FST Analysis

A total of 44 loci were shared between the selective sweep and additive effect categories, while only one SNP was common across all three categories: selective sweep, additive effect, and dominant effect (Figure 3A). These findings suggest that the selective sweep analysis method, which is based on allele frequency differentiation, primarily captures additive genetic effects. However, it fails to detect the majority of additive loci and completely overlooks loci with dominant effects. The F s t values, along with the p-values for additive and dominant effects across all loci, are provided in Supplementary Materials Table S2.

3.4. Functional Annotation Results of Significant Loci

For the 630 significant SNPs with significant additive effect, a total of 20 loci identified in this study overlapped with known reproduction trait QTLs (Table S2). A total of 173 genes were annotated by Ensembl for loci with only significant additive effects, while 11 were for those with both significant additive and dominant effects (Figure 3B).
Of the 844 loci identified by selection sweep detection, 29 overlapped with known reproduction QTLs. A total of 170 genes were annotated by Ensembl for loci under selection in the Duroc population, and 135 for those in the Yorkshire population. Among them, nine genes were annotated by those loci under selection in both populations: HSPG2, DLGAP1, MYOM1, GALNTL6, KAT6B, SAMD8, LRMDA, PRKG1, and VTI1A. The detail information of VEP annotation results of each SNP, including variant types such as the non-coding transcript exon variant, intergenic variant, intron variant, and regulatory region variant, is shown in Supplementary Materials Table S3. Some variants in enhancer regions were annotated, such as ENSSSCR1_BMTR5 (chr1:90098790), ENSSSCR6_CLGC9 (chr6:165850427), etc., but their specific functions and effects on porcine reproductive traits need to be further investigated.
Figure 4 presents the genotype–phenotype association results for litter size traits in Dongliao black pigs. Due to differences in genotyping platforms between the Dongliao black pig dataset and our study, association analyses were restricted to the five loci shared by both datasets. The locus located on chromosome 2 (chr2:148668028) is the locus with only a significant dominant effect (dom), and the other four loci are the loci that are significant in both additive and dominant effects (add_dom), but not in sweep selection analysis. Heterozygotes (0|1) at locus chr2:148669028 had lower average TNB and NBA compared to the homozygotes (0|0 and 1|1), indicating a potential dominant deleterious effect on litter size. At the other four loci, homozygous (1|1) individuals showed significantly reduced TNB and NBA compared to both heterozygous (0|1) and wild-type homozygous (0|0) genotypes, demonstrating combined additive and dominant genetic effects.
Notably, our LMM + ADDO framework identified some known reproduction traits related to QTLs, which were not identified by selection sweep detection (Table 1).

4. Discussion

We hypothesize that dominant genetic effects may play a significant role in shaping the phenotypes of traits with low heritability, such as reproduction characteristics. When aiming to detect dominant effects between populations in genome-wide selection signature analysis, the markers identified as having dominant effects are primarily loci where one population predominantly exhibits heterozygous genotypes, while the other population predominantly exhibits homozygous genotypes. These heterozygous loci can often also be detected by additive and FST analyses, though their significance is typically lower compared to loci with distinct homozygote genotypes in different populations. Therefore, when exploring genetic selection signatures between populations, there is a risk of overlooking these heterozygous loci, which may nonetheless play a crucial role in understanding the genetic basis of dominant effects and heterosis. Extensive evidence supports the importance of incorporating dominant effects in genomic selection. Studies have demonstrated that dominant models improve prediction accuracy for crossbreeding performance using purebred data [28,29] and outperform additive models [30]. Specifically, Sun et al. [31] reported superior prediction for dairy yield traits when combining additive and dominant effects compared to additive-only models. Dominance variance accounts for approximately 13% of total genetic variance for litter size in crossbred populations [32], with dominant effects explaining 11–20% of variance for total number born (TNB) compared to just 2–11% for backfat thickness [33]. These findings collectively highlight the necessity of including dominant effects in pig breeding programs to optimize genetic gain.
To further investigate this, we collected genome-wide SNP data from two commercial pig breeds: Duroc and Yorkshire pigs. In commercial pig breeds, the Yorkshire pig breed has a better performance in reproduction traits than the Duroc pig breed. Yorkshire pigs are known for their superior reproduction performance compared to Duroc pigs, which is why they are typically used as dam lines in pig production, whereas Duroc pigs are often utilized as sire lines. The PCA revealed clear genetic differentiation between the two breeds. Thus, using the PC1 as the phenotype, we investigated both additive and dominant effects through the LMM + ADDO framework to explore the genetic basis of reproduction traits. In this study, a total of 632 significant SNPs and 184 genes were identified. Compared with the FST analysis, 587 SNPs and 171 genes were newly detected. The FST analysis revealed loci selected by Duroc and Yorkshire pigs separately. The analysis results in Duroc and Yorkshire populations demonstrated that both additive and dominant QTLs were identified by LMM + ADDO, with superior performance compared to the FST method. Furthermore, we observed that while the FST method is effective in detecting additive loci, it performs poorly in identifying dominant effects.
Among these genes, nine co-selected genes by Duroc and Yorkshire pigs were identified, namely HSPG2, DLGAP1, MYOM1, GALNTL6, KAT6B, SAMD8, LRMDA, PRKG1, and VTI1A. A query in the PigBiobank database [18] showed that HSPG2 [34], KAT6B, SAMD8, and LRMDA were associated with litter size traits in pigs, while DLGAP1 [35], MYOM1, GALNTL6, PRKG1, and VTI1A exhibited strong correlations with teat size traits. Additionally, HSPG2 regulates morphogenesis and tissue differentiation, influencing birth weight and growth [36] while also affecting follicular quality and fertility in dairy cattle [37]. Notably, human studies indicate an inverse relationship between HSPG2 expression and oocyte quality in normal ovulatory cycles [38]. The GALNTL6 plays a role in male fertility through spermatogenesis regulation in Holstein cattle [39].
Our integrated analysis identified 13 candidate genes (MYO6, OTX2, BNC2, CCDC171, PTPRD, LSM1, AGPS, PDE11A, OSBPL6, ITGA4, FAM117B, SORCS2, and THSD7A) showing significant signals in both FST and additive effect analyses. Notably, 11 of these genes (excluding LSM1 and PDE11A) are documented in PigBiobank as associated with reproductive performance traits. Functional characterization reveals these genes’ possible roles in proliferation: OTX2 regulates embryonic development through epigenetic mechanisms [40,41], with its elevated expression correlating with improved fertilization rates in the avian model [42]. BNC2, expressed in ovarian somatic cells, is essential for normal ovarian morphology and fertility [43]. PDE11A, a known litter size candidate in sheep [44], modulates cAMP hydrolysis critical for gonadotropin signaling [45], ITGA4 influences both gestation length [46] and nuclear transfer embryo development in pigs [47], while SORCS2 participates in lipid metabolism [48]. THSD7A emerges as a key player in high-altitude adaptation across species [49,50,51].
Importantly, we prioritized loci that were not identified by sweep selection analysis but emerged through our LMM + ADDO framework, underscoring their potential role in porcine reproductive traits. Among the genes annotated, 11 genes—including DPYSL3, TRABD2B, CMPK1, STIL, FAAH, MAST2, PRDX1, TESK2, and BTBD19—were found to be associated with the teat numbers, while LRRC41 was associated with the litter size. CMPK1 has been identified across various bovine populations [52], and its association with epithelial ovarian cancer (EOC) has been reported in humans, and knockdown of CMPK affects apoptosis in EOC cells [53].
Notably, the loci of chr6:165892172 (T > C) located on TESK2 were associated with the number of pigs born alive in PigBiobank (Figure 5). Although studies on TESK2 in pigs are limited, Yang et al. [54] reported its involvement in the early growth and development of calves, and another study linked it to sperm function and fertility [55].
The functions of some candidate loci related to litter size in pigs were further validated through a genotype–phenotype association analysis in an independent pig population with both phenotypic and genotypic data. These findings collectively demonstrate the enhanced precision achieved by combining FST analysis with the LMM + ADDO framework to jointly consider both additive and dominant effects. This study also has some limitations since the study was conducted using PC1 as the phenotypic value. More experiments and studies are needed to validate these loci and genes we identified.

5. Conclusions

In the realm of genome-wide selection characterization, investigating additive and dominant effects across populations can enable more precise identification of candidate genes, which serve as valuable targets for breeding enhancement. As an additional result, we identified loci and genes potentially associated with porcine reproduction traits. The insights gained from this research deepen our understanding of the genetic mechanisms underlying phenotypic variation in Duroc and Yorkshire pigs. Moreover, our analytical mindset provides a viable method for exploring genome-wide trait selection within populations, offering new perspectives for future studies.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/vetsci12070657/s1, Figure S1: PCA; Table S1: Results of Fst analysis; Table S2: The Fst value and p-value of additive and dominant; Table S3: The annotation results of VEP.

Author Contributions

Conceptualization, C.C., H.S. and C.B.; methodology, C.C.; software, H.S.; validation, C.C. and C.B.; formal analysis, C.C. and Y.H.; investigation, Y.H.; resources, C.C., J.K. and H.S.; data curation, J.K. and X.Z.; writing—original draft preparation, C.C. and Y.H.; writing—review and editing, Y.H., C.B. and H.S.; visualization, C.C.; supervision, H.S.; project administration, J.F.; funding acquisition, B.S. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the Science and Technology Development Project of Jilin province (20230202066NC).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data have been included in the main text and Supplementary Materials.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Wang, K.; Wu, P.; Yang, Q.; Chen, D.; Zhou, J.; Jiang, A.; Ma, J.; Tang, Q.; Xiao, W.; Jiang, Y.; et al. Detection of Selection Signatures in Chinese Landrace and Yorkshire Pigs Based on Genotyping-by-Sequencing Data. Front. Genet. 2018, 9, 119. [Google Scholar] [CrossRef] [PubMed]
  2. Sun, H.; Wang, Z.; Zhang, Z.; Xiao, Q.; Mawed, S.; Xu, Z.; Zhang, X.; Yang, H.; Zhu, M.; Xue, M.; et al. Genomic signatures reveal selection of characteristics within and between Meishan pig populations. Anim. Genet. 2018, 49, 119–126. [Google Scholar] [CrossRef] [PubMed]
  3. Alves, K.; Brito, L.F.; Baes, C.F.; Sargolzaei, M.; Robinson, J.A.B.; Schenkel, F.S. Estimation of additive and non-additive genetic effects for fertility and reproduction traits in North American Holstein cattle using genomic information. J. Anim. Breed. Genet. 2020, 137, 316–330. [Google Scholar] [CrossRef] [PubMed]
  4. Bolormaa, S.; Pryce, J.E.; Zhang, Y.; Reverter, A.; Barendse, W.; Hayes, B.J.; Goddard, M.E. Non-additive genetic variation in growth, carcass and fertility traits of beef cattle. Genet. Sel. Evol. 2015, 47, 26. [Google Scholar] [CrossRef]
  5. Cortés, O.; Martinez, A.M.; Cañon, J.; Sevane, N.; Gama, L.T.; Ginja, C.; Landi, V.; Zaragoza, P.; Carolino, N.; Vicente, A.; et al. Conservation priorities of Iberoamerican pig breeds and their ancestors based on microsatellite information. Heredity 2016, 117, 14–24. [Google Scholar] [CrossRef]
  6. Cui, L.; Yang, B.; Pontikos, N.; Mott, R.; Huang, L. ADDO: A comprehensive toolkit to detect, classify and visualize additive and non-additive quantitative trait loci. Bioinformatics 2020, 36, 1517–1521. [Google Scholar] [CrossRef]
  7. Li, Q.; Yuan, X.; Chen, Z.; Zhang, A.; Zhang, Z.; Zhang, H.; Li, J. Heritability estimates and effect on lifetime reproductive performance of age at puberty in sows. Anim. Reprod. Sci. 2018, 195, 207–215. [Google Scholar] [CrossRef]
  8. Ruan, D.; Yang, J.; Zhuang, Z.; Ding, R.; Huang, J.; Quan, J.; Gu, T.; Hong, L.; Zheng, E.; Li, Z.; et al. Assessment of Heterozygosity and Genome-Wide Analysis of Heterozygosity Regions in Two Duroc Pig Populations. Front. Genet. 2021, 12, 812456. [Google Scholar] [CrossRef]
  9. Deng, M.; Qiu, Z.; Liu, C.; Zhong, L.; Fan, X.; Han, Y.; Wang, R.; Li, P.; Huang, R.; Zhao, Q. Genome-wide association analysis revealed new QTL and candidate genes affecting the teat number in Dutch Large White pigs. Anim. Genet. 2024, 55, 206–216. [Google Scholar] [CrossRef]
  10. Browning, B.L.; Tian, X.; Zhou, Y.; Browning, S.R. Fast two-stage phasing of large-scale sequence data. Am. J. Hum. Genet. 2021, 108, 1880–1890. [Google Scholar] [CrossRef]
  11. Yang, J.; Lee, S.H.; Goddard, M.E.; Visscher, P.M. GCTA: A tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 2011, 88, 76–82. [Google Scholar] [CrossRef] [PubMed]
  12. Yin, L.; Zhang, H.; Tang, Z.; Yin, D.; Fu, Y.; Yuan, X.; Li, X.; Liu, X.; Zhao, S. HIBLUP: An integration of statistical models on the BLUP framework for efficient genetic evaluation using big genomic data. Nucleic Acids Res. 2023, 51, 3501–3512. [Google Scholar] [CrossRef] [PubMed]
  13. Danecek, P.; Auton, A.; Abecasis, G.; Albers, C.A.; Banks, E.; DePristo, M.A.; Handsaker, R.E.; Lunter, G.; Marth, G.T.; Sherry, S.T.; et al. The variant call format and VCFtools. Bioinformatics 2011, 27, 2156–2158. [Google Scholar] [CrossRef] [PubMed]
  14. Chang, C.C.; Chow, C.C.; Tellier, L.C.; Vattikuti, S.; Purcell, S.M.; Lee, J.J. Second-generation PLINK: Rising to the challenge of larger and richer datasets. Gigascience 2015, 4, 7. [Google Scholar] [CrossRef]
  15. McLaren, W.; Gil, L.; Hunt, S.E.; Riat, H.S.; Ritchie, G.R.; Thormann, A.; Flicek, P.; Cunningham, F. The Ensembl Variant Effect Predictor. Genome Biol. 2016, 17, 122. [Google Scholar] [CrossRef]
  16. Dyer, S.C.; Austine-Orimoloye, O.; Azov, A.G.; Barba, M.; Barnes, I.; Barrera-Enriquez, V.P.; Becker, A.; Bennett, R.; Beracochea, M.; Berry, A.; et al. Ensembl 2025. Nucleic Acids Res. 2025, 53, D948–D957. [Google Scholar] [CrossRef]
  17. Fu, Y.; Xu, J.; Tang, Z.; Wang, L.; Yin, D.; Fan, Y.; Zhang, D.; Deng, F.; Zhang, Y.; Zhang, H.; et al. A gene prioritization method based on a swine multi-omics knowledgebase and a deep learning model. Commun. Biol. 2020, 3, 502. [Google Scholar] [CrossRef]
  18. Zeng, H.; Zhang, W.; Lin, Q.; Gao, Y.; Teng, J.; Xu, Z.; Cai, X.; Zhong, Z.; Wu, J.; Liu, Y.; et al. PigBiobank: A valuable resource for understanding genetic and biological mechanisms of diverse complex traits in pigs. Nucleic Acids Res. 2024, 52, D980–D989. [Google Scholar] [CrossRef]
  19. Hu, Z.L.; Park, C.A.; Reecy, J.M. Bringing the Animal QTLdb and CorrDB into the future: Meeting new challenges and providing updated services. Nucleic Acids Res. 2022, 50, D956–D961. [Google Scholar] [CrossRef]
  20. Sun, H.; Cheng, Y.; He, Y.; Cheng, C.; Zhao, H.; Yang, S.; Wei, M.; Yang, J.; Liang, S.; Bai, C.; et al. Genome-wide association studies for the number of piglets born alive and dead in Dongliao black pigs. Anim. Genet. 2024, 55, 282–285. [Google Scholar] [CrossRef]
  21. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2024; Available online: https://www.R-project.org/ (accessed on 14 June 2024).
  22. Wang, X.; Wang, L.; Shi, L.; Zhang, P.; Li, Y.; Li, M.; Tian, J.; Wang, L.; Zhao, F. GWAS of Reproductive Traits in Large White Pigs on Chip and Imputed Whole-Genome Sequencing Data. Int. J. Mol. Sci. 2022, 23, 13338. [Google Scholar] [CrossRef] [PubMed]
  23. Duijvesteijn, N.; Veltmaat, J.M.; Knol, E.F.; Harlizius, B. High-resolution association mapping of number of teats in pigs reveals regions controlling vertebral development. BMC Genom. 2014, 15, 542. [Google Scholar] [CrossRef] [PubMed]
  24. Bovo, S.; Ballan, M.; Schiavo, G.; Ribani, A.; Tinarelli, S.; Utzeri, V.J.; Dall’Olio, S.; Gallo, M.; Fontanesi, L. Single-marker and haplotype-based genome-wide association studies for the number of teats in two heavy pig breeds. Anim. Genet. 2021, 52, 440–450. [Google Scholar] [CrossRef] [PubMed]
  25. Chalkias, H.; Jonas, E.; Andersson, L.S.; Jacobson, M.; de Koning, D.J.; Lundeheim, N.; Lindgren, G. Identification of novel candidate genes for the inverted teat defect in sows using a genome-wide marker panel. J. Appl. Genet. 2017, 58, 249–259. [Google Scholar] [CrossRef]
  26. He, L.C.; Li, P.H.; Ma, X.; Sui, S.P.; Gao, S.; Kim, S.W.; Gu, Y.Q.; Huang, Y.; Ding, N.S.; Huang, R.H. Identification of new single nucleotide polymorphisms affecting total number born and candidate genes related to ovulation rate in Chinese Erhualian pigs. Anim. Genet. 2017, 48, 48–54. [Google Scholar] [CrossRef]
  27. Schneider, J.F.; Miles, J.R.; Brown-Brandl, T.M.; Nienaber, J.A.; Rohrer, G.A.; Vallet, J.L. Genomewide association analysis for average birth interval and stillbirth in swine. J. Anim. Sci. 2015, 93, 529–540. [Google Scholar] [CrossRef]
  28. Chen, J.; Dou, T.; Wu, Z.; Bai, L.; Xu, M.; Zhang, Y.; Yang, S.; Xu, S.; Han, X.; Qiao, R.; et al. Genomic prediction accounting for dominance and epistatic genetic effects on litter size traits in Large White pigs. J. Anim. Sci. 2025, 103, skaf004. [Google Scholar] [CrossRef]
  29. Esfandyari, H.; Bijma, P.; Henryon, M.; Christensen, O.F.; Sørensen, A.C. Genomic prediction of crossbred performance based on purebred Landrace and Yorkshire data using a dominance model. Genet. Sel. Evol. 2016, 48, 40. [Google Scholar] [CrossRef]
  30. Zeng, J.; Toosi, A.; Fernando, R.L.; Dekkers, J.C.; Garrick, D.J. Genomic selection of purebred animals for crossbred performance in the presence of dominant gene action. Genet. Sel. Evol. 2013, 45, 11. [Google Scholar] [CrossRef]
  31. Sun, C.; VanRaden, P.M.; Cole, J.B.; O’Connell, J.R. Improvement of prediction ability for genomic selection of dairy cattle by including dominance effects. PLoS ONE 2014, 9, e103934. [Google Scholar] [CrossRef]
  32. Vitezica, Z.G.; Varona, L.; Elsen, J.M.; Misztal, I.; Herring, W.; Legarra, A. Genomic BLUP including additive and dominant variation in purebreds and F1 crossbreds, with an application in pigs. Genet. Sel. Evol. 2016, 48, 6. [Google Scholar] [CrossRef] [PubMed]
  33. Mei, Q.; Vitezica, Z.G.; Li, J.; Zhao, S.; Legarra, A.; Xiang, T. Impacts of additive, dominance, and inbreeding depression effects on genomic evaluation by combining two SNP chips in Canadian Yorkshire pigs bred in China. Genet. Sel. Evol. 2022, 54, 69. [Google Scholar] [CrossRef]
  34. Zhang, J.; Xiong, Y.; Zuo, B.; Lei, M.; Jiang, S.; Li, F.; Zheng, R.; Li, J.; Xu, D. Detection of quantitative trait loci associated with several internal organ traits and teat number trait in a pig population. J. Genet. Genom. 2007, 34, 307–314. [Google Scholar] [CrossRef] [PubMed]
  35. Tortereau, F.; Gilbert, H.; Heuven, H.C.; Bidanel, J.P.; Groenen, M.A.; Riquet, J. Combining two Meishan F2 crosses improves the detection of QTL on pig chromosomes 2, 4 and 6. Genet. Sel. Evol. 2010, 42, 42. [Google Scholar] [CrossRef] [PubMed]
  36. Ogunbawo, A.R.; Hidalgo, J.; Mulim, H.A.; Carrara, E.R.; Ventura, H.T.; Souza, N.O.; Lourenco, D.; Oliveira, H.R. Applying the algorithm for Proven and young in GWAS Reveals high polygenicity for key traits in Nellore cattle. Front. Genet. 2025, 16, 1549284. [Google Scholar] [CrossRef]
  37. Zachut, M.; Sood, P.; Levin, Y.; Moallem, U. Proteomic analysis of preovulatory follicular fluid reveals differentially abundant proteins in less fertile dairy cows. J. Proteom. 2016, 139, 122–129. [Google Scholar] [CrossRef]
  38. Ma, Y.; Jin, J.; Tong, X.; Yang, W.; Ren, P.; Dai, Y.; Pan, Y.; Zhang, Y.; Zhang, S. ADAMTS1 and HSPG2 mRNA levels in cumulus cells are related to human oocyte quality and controlled ovarian hyperstimulation outcomes. J. Assist. Reprod. Genet. 2020, 37, 657–667. [Google Scholar] [CrossRef]
  39. Ghoreishifar, M.; Vahedi, S.M.; Salek Ardestani, S.; Khansefid, M.; Pryce, J.E. Genome-wide assessment and mapping of inbreeding depression identifies candidate genes associated with semen traits in Holstein bulls. BMC Genom. 2023, 24, 230. [Google Scholar] [CrossRef]
  40. Jia, Y.; Liu, Y.; Li, Y.; Guan, X.; Xu, J.; Yan, Z.; Liu, K.; Zhang, Y.; Bai, D.; Xiang, J.; et al. Persistent Wnt signaling affects IVF embryo implantation and offspring metabolism. Sci. Bull. 2025. [Google Scholar] [CrossRef]
  41. Zhang, T.; Peng, Z.; Meng, F.; Li, Z.; Chen, J.; Zhou, Q.; Leng, L.; Bo, H.; Lu, G.; Deng, Y.; et al. Maternal transcription factor OTX2 directly induces SETD1A and promotes embryonic genome activation in human pre-implantation embryos. Sci. China Life Sci. 2025. [Google Scholar] [CrossRef]
  42. Wu, S.J.; Cheng, Y.S.; Liu, H.L.; Wang, H.H.; Huang, H.L. Global transcriptional expression in ovarian follicles from Tsaiya ducks (Anas platyrhynchos) with a high-fertilization rate. Theriogenology 2016, 85, 1439–1445.e1. [Google Scholar] [CrossRef] [PubMed]
  43. Lang, M.R.; Patterson, L.B.; Gordon, T.N.; Johnson, S.L.; Parichy, D.M. Basonuclin-2 requirements for zebrafish adult pigment pattern development and female fertility. PLoS Genet. 2009, 5, e1000744. [Google Scholar] [CrossRef] [PubMed]
  44. Li, J.; Gong, Y.; Wang, X.; He, X.; He, X.; Chu, M.; Di, R. Screening of Litter-Size-Associated SNPs in NOX4, PDE11A and GHR Genes of Sheep. Animals 2024, 14, 767. [Google Scholar] [CrossRef] [PubMed]
  45. Petersen, T.S.; Stahlhut, M.; Andersen, C.Y. Phosphodiesterases in the rat ovary: Effect of cAMP in primordial follicles. Reproduction 2015, 150, 11–20. [Google Scholar] [CrossRef]
  46. Wang, X.Q.; Wang, L.G.; Shi, L.Y.; Tian, J.J.; Li, M.Y.; Wang, L.X.; Zhao, F.P. Imputation strategies for low-coverage whole-genome sequencing data and their effects on genomic prediction and genome-wide association studies in pigs. Animal 2024, 18, 101258. [Google Scholar] [CrossRef]
  47. Wada, Y.; Akita, T.; Awata, T.; Furukawa, T.; Sugai, N.; Inage, Y.; Ishii, K.; Ito, Y.; Kobayashi, E.; Kusumoto, H.; et al. Quantitative trait loci (QTL) analysis in a Meishan x Göttingen cross population. Anim. Genet. 2000, 31, 376–384. [Google Scholar] [CrossRef]
  48. Faggion, S.; Bonfatti, V.; Carnier, P. Genome-Wide Association Study for Weight Loss at the End of Dry-Curing of Hams Produced from Purebred Heavy Pigs. Animals 2024, 14, 1983. [Google Scholar] [CrossRef]
  49. Ahmad, S.F.; Gangwar, M.; Kumar, A.; Kumar, A.; Dige, M.S.; Jha, G.K.; Gaur, G.K.; Dutt, T. Dissecting genomes of multiple yak populations: Unveiling ancestry and high-altitude adaptation through whole-genome resequencing analysis. BMC Genom. 2025, 26, 214. [Google Scholar] [CrossRef]
  50. Du, H.; Zhou, L.; Liu, Z.; Zhuo, Y.; Zhang, M.; Huang, Q.; Lu, S.; Xing, K.; Jiang, L.; Liu, J.F. The 1000 Chinese Indigenous Pig Genomes Project provides insights into the genomic architecture of pigs. Nat. Commun. 2024, 15, 10137. [Google Scholar] [CrossRef]
  51. Ma, Y.F.; Han, X.M.; Huang, C.P.; Zhong, L.; Adeola, A.C.; Irwin, D.M.; Xie, H.B.; Zhang, Y.P. Population Genomics Analysis Revealed Origin and High-altitude Adaptation of Tibetan Pigs. Sci. Rep. 2019, 9, 11463. [Google Scholar] [CrossRef]
  52. Edea, Z.; Dadi, H.; Dessie, T.; Uzzaman, M.R.; Rothschild, M.F.; Kim, E.S.; Sonstegard, T.S.; Kim, K.S. Genome-wide scan reveals divergent selection among taurine and zebu cattle populations from different regions. Anim. Genet. 2018, 49, 550–563. [Google Scholar] [CrossRef] [PubMed]
  53. Zhou, D.; Zhang, L.; Lin, Q.; Ren, W.; Xu, G. Data on the association of CMPK1 with clinicopathological features and biological effect in human epithelial ovarian cancer. Data Brief. 2017, 13, 77–84. [Google Scholar] [CrossRef] [PubMed]
  54. Yang, Y.; Chang, C.; Baiyin, B.; Liu, Z.; Guo, L.; Zhou, L.; Liu, B.; Shi, C.; Zhang, W. Blood Transcriptome Analysis of Beef Cow with Different Parity Revealed Candidate Genes and Gene Networks Regulating the Postpartum Diseases. Genes. 2022, 13, 1671. [Google Scholar] [CrossRef] [PubMed]
  55. Gòdia, M.; Castelló, A.; Rocco, M.; Cabrera, B.; Rodríguez-Gil, J.E.; Balasch, S.; Lewis, C.; Sánchez, A.; Clop, A. Identification of circular RNAs in porcine sperm and evaluation of their relation to sperm motility. Sci. Rep. 2020, 10, 7985. [Google Scholar] [CrossRef]
Figure 1. Manhattan and Q-Q plots for additive effects (A) and dominant effects (B).
Figure 1. Manhattan and Q-Q plots for additive effects (A) and dominant effects (B).
Vetsci 12 00657 g001
Figure 2. Selective sweep analysis plot for Duroc and Yorkshire. The curves represent the stacking percentages of the bar charts. The blue dots in the center are the Duroc group selected region, and the green dots are the Yorkshire group selected region. The orange area on the right is the highest 5% FST.
Figure 2. Selective sweep analysis plot for Duroc and Yorkshire. The curves represent the stacking percentages of the bar charts. The blue dots in the center are the Duroc group selected region, and the green dots are the Yorkshire group selected region. The orange area on the right is the highest 5% FST.
Vetsci 12 00657 g002
Figure 3. Veen diagrams of SNPs (A) and genes (B) identified by F s t , additive effect (Add), and dominant effect (Dom).
Figure 3. Veen diagrams of SNPs (A) and genes (B) identified by F s t , additive effect (Add), and dominant effect (Dom).
Vetsci 12 00657 g003
Figure 4. Loci genotype effects on total number born (A) and number born alive (B). The dom means the locus has a significant dominant effect, add_dom means this locus has a significant additive and dominant effect, 0|0 means no mutant purebred, 0|1 means heterozygous, and 1|1 means mutant purebred. Different letters in the top indicate significant differences (p < 0.05) between genotypes. The values in the boxes mean the average of TNB and NBA for individuals of different genotypes, and the n above the boxes represents the sample size.
Figure 4. Loci genotype effects on total number born (A) and number born alive (B). The dom means the locus has a significant dominant effect, add_dom means this locus has a significant additive and dominant effect, 0|0 means no mutant purebred, 0|1 means heterozygous, and 1|1 means mutant purebred. Different letters in the top indicate significant differences (p < 0.05) between genotypes. The values in the boxes mean the average of TNB and NBA for individuals of different genotypes, and the n above the boxes represents the sample size.
Vetsci 12 00657 g004
Figure 5. PheWAS results for locus chr6:165892172 (T > C) in PigBiobank. The y-axis shows the l o g 10   ( p v a l u e ) , where higher values indicate greater statistical significance of the association between the locus and each trait. The x-axis represents reproduction-related traits. NBA: number born alive; GD: gestation days; TNUM_T: total teat number; DPR: sperm abnormality rate; NBH_P2: number of healthy piglets born at parity 2.
Figure 5. PheWAS results for locus chr6:165892172 (T > C) in PigBiobank. The y-axis shows the l o g 10   ( p v a l u e ) , where higher values indicate greater statistical significance of the association between the locus and each trait. The x-axis represents reproduction-related traits. NBA: number born alive; GD: gestation days; TNUM_T: total teat number; DPR: sperm abnormality rate; NBH_P2: number of healthy piglets born at parity 2.
Vetsci 12 00657 g005
Table 1. Noteworthy SNP loci recorded in the QTL database.
Table 1. Noteworthy SNP loci recorded in the QTL database.
LociAdditive
(p)
Dominant
(p)
F s t Reproduction QTL (QTL ID)References
16_1566478535.24 × 10−70.7201510.096352Gestation length (261,966)Wang et al., 2017 [22]
7_819489411.64 × 10−70.8435020.19554Teat number (37,446)Duijvesteijn et al., 2014 [23]
8_44513192.58 × 10−150.003730.004865Teat number (223,271)Bovo et al., 2021 [24]
10_328226793.64 × 10−70.4240280.258834Nonfunctional nipples (124,209)Chalkias et al., 2017 [25]
14_1073772941.52 × 10−110.0050870.002753Litter size (130,290)He et al., 2017 [26]
15_750639533.91 × 10−90.0497190.516495Number of stillborn (57,518)Schneider et al., 2015 [27]
16_685426701.92 × 10−80.8031310.378008Teat number (223,295)Bovo et al., 2021 [24]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chen, C.; He, Y.; Ke, J.; Zhang, X.; Fei, J.; Sun, B.; Sun, H.; Bai, C. Genomic Analysis of Reproductive Trait Divergence in Duroc and Yorkshire Pigs: A Comparison of Mixed Models and Selective Sweep Detection. Vet. Sci. 2025, 12, 657. https://doi.org/10.3390/vetsci12070657

AMA Style

Chen C, He Y, Ke J, Zhang X, Fei J, Sun B, Sun H, Bai C. Genomic Analysis of Reproductive Trait Divergence in Duroc and Yorkshire Pigs: A Comparison of Mixed Models and Selective Sweep Detection. Veterinary Sciences. 2025; 12(7):657. https://doi.org/10.3390/vetsci12070657

Chicago/Turabian Style

Chen, Changyi, Yu He, Juan Ke, Xiaoran Zhang, Junwen Fei, Boxing Sun, Hao Sun, and Chunyan Bai. 2025. "Genomic Analysis of Reproductive Trait Divergence in Duroc and Yorkshire Pigs: A Comparison of Mixed Models and Selective Sweep Detection" Veterinary Sciences 12, no. 7: 657. https://doi.org/10.3390/vetsci12070657

APA Style

Chen, C., He, Y., Ke, J., Zhang, X., Fei, J., Sun, B., Sun, H., & Bai, C. (2025). Genomic Analysis of Reproductive Trait Divergence in Duroc and Yorkshire Pigs: A Comparison of Mixed Models and Selective Sweep Detection. Veterinary Sciences, 12(7), 657. https://doi.org/10.3390/vetsci12070657

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop