Next Article in Journal
Functional Analysis of CLE26 in Controlling De Novo Root Regeneration from Detached Arabidopsis Leaves
Previous Article in Journal
Special Issue “The Tight Junction and Its Proteins: From Structure to Pathologies”
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genomic Prediction of Semen Traits in Boars Incorporating Biological Interactions

1
Key Laboratory of Livestock and Poultry Resources (Pig) Evaluation and Utilization, Ministry of Agriculture and Rural Affair, Yuelushan Laboratory, College of Animal Science and Technology, Hunan Agricultural University, Changsha 410128, China
2
Institute of Subtropical Agriculture, Chinese Academy of Sciences, Changsha 410125, China
3
State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou 510006, China
*
Authors to whom correspondence should be addressed.
Int. J. Mol. Sci. 2024, 25(23), 13155; https://doi.org/10.3390/ijms252313155
Submission received: 5 November 2024 / Revised: 3 December 2024 / Accepted: 5 December 2024 / Published: 7 December 2024
(This article belongs to the Section Molecular Genetics and Genomics)

Abstract

In the context of modern pig farming, the central role of boars is underscored by large-scale centralized breeding and the widespread application of artificial insemination techniques. However, previous studies and breeding programs have focused mainly on product efficiency traits, such as growth rate, lean meat yield, and litter size, often neglecting boar semen traits. In this study, we estimated the genetic parameters and assessed the genomic prediction accuracy of boar semen traits with phenotypes evaluated from 274,332 ejections in a large population consisting of 2467 Duroc boars. Heritability of sperm morphological abnormality rate (ABN), fresh semen volume (VOL), sperm concentration (DEN), and motility (MOT) were estimated to be 0.43, 0.22, 0.23, and 0.16, respectively. GBLUP achieved a moderate predictive ability of semen traits, with a range of 0.32–0.50. Incorporating gene interactions indicated by the KEGG pathways (biBLUP) significantly improved predictive accuracy over the classical additive model (GBLUP) and epistatic model (RKHS). Moreover, biBLUP showed an improvement from 9.50% to 20.10% among the studied traits compared with GBLUP, with the greatest improvement (0.40 vs. 0.48) observed in sperm morphological abnormality rate. In conclusion, moderate to low heritability was estimated for the Duroc boar semen traits. Genomic prediction was able to achieve moderate accuracy, with a range from 0.32 to 0.56, for the studied traits. Considering gene interactions within KEGG pathways enhanced the predictive ability of boar semen traits.

1. Introduction

In recent years, with the widespread application of artificial insemination, the inheritance of superior boar genetics plays a crucial role in modern pork production systems. Traditional breeding programs have mainly focused on selecting traits such as growth rate, backfat thickness, and feed efficiency, neglecting the improvement of boar reproductive performance. Therefore, improving boar semen quality and enhancing boar reproductive performance are of urgent necessity. Previous studies reported that boar semen traits have shown moderate to low heritability, reflecting the potential of genetic improvement [1,2,3]. Genomic prediction (GP) [1], which utilizes genetic markers covering the whole genome to predict genetic merits, is able to achieve higher predictive accuracy than the traditional best linear unbiased prediction (BLUP) [4] and thus accelerate the genetic improvement progress of traits with low heritability.
With the development of high-throughput sequencing technology, the contribution of GP to the genetic improvement of economic traits in livestock and plants has become increasingly prominent, providing a basis for understanding the relationship between DNA sequence variations and economic traits [5,6,7]. GP is essential in facilitating trait selection, optimizing resource use, accelerating breeding progress, and promoting sustainable industry development [8]. Currently, statistical models used for GP include Genomic Best Linear Unbiased Prediction (GBLUP) [9], ridge regression best linear unbiased prediction (RR-BLUP) [10], Bayesian methods (BayesA, BayesB, etc.) [10,11], single-step GBLUP (ss-GBLUP) [7], and epistatic models such as reproducing kernel Hilbert space (RKHS) [12]. Depending on the genetic effects utilized, GP models can be categorized into additive effect models and non-additive effect models. Additive effect models assume that each gene’s impact on the phenotype is independent and cumulative, and they are widely used in livestock breeding. In contrast, non-additive effect models further consider gene–gene interactions, providing more accurate genetic parameter estimates and phenotype predictions. Ye, H. et al. H. et al. [13] introduced epistatic effects in genomic prediction using a haplotype-based model in Chinese yellow chickens, significantly improving prediction accuracy. Research in crops like maize [14] and wheat [15,16,17] also found that integrating non-additive effects could significantly enhance the accuracy of phenotype prediction. Moreover, it has been reported that models incorporating non-additive effects could achieve up to twice the predictive ability of additive effect models [18]. These findings demonstrate that models integrating non-additive effects have distinct advantages in phenotype prediction across different species, offering important theoretical guidance for further improving prediction accuracy. The rapid development and combined use of bioinformatics and molecular biology technologies have uncovered and validated the functions of numerous genes, providing resources for exploring gene interaction relationships. Previous studies suggest that estimating gene–gene epistatic effects based on prior gene function information and incorporating them into GP could significantly enhance prediction accuracy [19]. Aiming to incorporate gene interactions revealed by the KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways into GP, Gao et al. [20] recently developed the biBLUP (biological interaction BLUP) model. This model uses KEGG pathways as carriers of biological interactions, integrating the interactive effects among genes within specific pathways into the model, thereby capturing these interactions in genomic prediction. This approach not only bridges the gap between statistics and biology but also enhances the predictive ability for complex traits. The model has improved the prediction accuracy by up to 40.36% and 16.29% for traits of yeast and rice, respectively. This indicates that integrating prior information regarding gene interactions in the GP model can enhance genomic prediction accuracy.
In this study, we estimated the genetic parameters of semen traits and investigated the genomic predictive ability in a large boar population with intensive phenotypic records and sequence level SNPs. Additionally, KEGG pathways were integrated into the prediction model to study the usefulness of pathways in modeling gene interactions and accelerating genomic prediction. This study intends to provide basic knowledge for constructing an early culling scheme for boars and further improve the efficiency of boar selection in large-scale pig breeding enterprises.

2. Results

2.1. Estimation of Genetic Parameters

The heritability of traits’ fresh semen volume (VOL), sperm concentration (DEN), motility (MOT), and abnormality rate (ABN) were estimated to be 0.221, 0.227, 0.164, and 0.432, respectively (Table 1). The parameter estimates indicate low to moderate heritability for the boar semen traits. Repeatability of traits VOL, DEN, MOT, and ABN were estimated to be 0.422, 0.363, 0.322, and 0.587, respectively.
The estimates of genetic and phenotypic correlations for the four semen traits are presented in Figure 1. It was found that VOL is negatively correlated with DEN (−0.736), MOT (−0.248), and ABN (−0.20). This suggests that selecting boars with a larger fresh semen volume may result in reduced DEN, decreased MOT, and the reduced probability of ABN. However, from a biological perspective, there is typically a balance between VOL and DEN. In larger semen samples, the sperm count is often more dispersed, leading to a relatively lower DEN. DEN is positively correlated with MOT (0.552) and ABN (0.064). Higher DEN may be associated with healthier spermatogenic processes and maturation within the testes, with these sperm typically exhibiting stronger MOT and better movement capabilities. Additionally, higher DEN may reflect more optimal hormone levels (such as testosterone), which have a positive impact on sperm production and function. Environmental factors, such as appropriate temperature, nutritional support, and reduced external stress, also contribute to maintaining sperm quality, potentially enhancing the correlation between DEN and MOT. MOT and ABN are highly negatively correlated (−0.837), indicating that higher MOT is associated with a lower probability of ABN. This strong negative correlation provides valuable information for breeding. Sperm with higher motility generally have better quality and are less likely to exhibit morphological ABN. Therefore, in breeding programs, boars with higher MOT should be prioritized, as their sperm not only demonstrate better MOT but also have a lower incidence of morphological defects. This can be crucial for improving the reproductive performance of the herd. The estimates of phenotypic correlations among the four semen traits displayed similar trends as the genetic correlations (Figure 1).

2.2. Predictive Ability of the Classical Models

Overall, the genomic prediction of the semen traits demonstrated moderate predictive ability, with a range from 0.32 to 0.56 (Table 2), reflecting the potential of the implementation of genomic selection in the genetic improvement of these traits in elite boar breeding programs. For the studied boar semen traits, both GFBLUP and biBLUP showed an improvement in genomic prediction accuracy across all traits compared to GBLUP (Table 2). In this study, we found that the biBLUP method significantly outperformed the traditional GBLUP, RKHS, and GFBLUP methods in predicting boar semen traits. For the ABN trait, biBLUP was 20.10% more accurate than GBLUP. And it demonstrated superior predictive accuracy by 9.50% and 15.79% compared to RKHS and GFBLUP, respectively. This trend was similarly evident in the prediction of the VOL trait, where the biBLUP was 13.94% more accurate than GBLUP and performed 13.48% and 10.16% better than RKHS and GFBLUP. For the DEN trait, biBLUP showed the highest accuracy, outperforming GBLUP, RKHS, and GFBLUP by 13.63%, 12.39%, and 10.71%, respectively. In the prediction accuracy for the MOT trait, biBLUP similarly showcased its exceptional predictive power, leading by 15.63%, 10.12%, and 9.14% over GBLUP, RKHS and GFBLUP, respectively. These findings validate the potential of biBLUP in the field of genomic prediction.

2.3. Predictive Ability of the Top Pathways

In this study, we focused on investigating the utility of KEGG pathways in accelerating the prediction of semen traits. The top 10 KEGG pathways showed the highest predictive ability of fresh semen volume, as listed in Table 3. Comparatively, the biBLUP model, which accounts for interactions within the KEGG pathways, showed a better predictive accuracy than the GFBLUP model, with an increase ranging from 0.511 to 0.564. Overall, the biological functions encompassed by the top pathways include energy metabolism, cell cycle regulation, oxidative stress response, protein synthesis and degradation, tissue growth and regeneration, neurotransmission, metabolic regulation, immune response, cell proliferation and differentiation, and the development of blood and immune cells. These functions may indirectly or directly affect sperm vitality, maturation, survival, as well as the development and function of the testes, thereby exerting a significant impact on reproductive health and sperm quality. Notably, the beta-alanine metabolism pathway (ssc00410) showed the highest increase in predictive accuracy, representing a 10.37% increase compared with GFBLUP. Additionally, pathway ssc00410, which is involved in cellular energy metabolism and pH regulation, may indirectly influence sperm vitality and function.
For the trait of sperm concentration, the top pathways encompass a range of biological processes, including viral infections, protein digestion, glycosylation, autophagy, cardiovascular diseases, cytoskeletal dynamics, mRNA surveillance, platelet activation, drug addiction, and thermogenesis (Table 4). These biological functions may indirectly or directly influence sperm vitality, maturation, survival, as well as the development and function of the testes, thereby exerting a significant impact on reproductive health and sperm quality. The herpes simplex virus 1 infection pathway (ssc05168) showed the highest increase in predictive accuracy, representing a 11.18% increase compared to GFBLUP. This suggests a potential underlying connection between viral infection mechanisms and sperm function, possibly through the impact on the host’s immune response and cellular health.
For the trait of sperm MOT, the top pathways encompass biological processes including T cell receptor signaling, beta-alanine metabolism, retinol metabolism, Notch signaling, intestinal immune responses, morphine addiction, pancreatic cancer, and the degradation of branched-chain amino acids, as well as cytokine–cytokine receptor interactions (Table 5). These biological functions may indirectly or directly influence sperm motility, maturation, and survival, as well as the development and function of the testes, thereby exerting a significant impact on reproductive health and sperm quality. The identification of these pathways underscores the complexity of the genetic factors influencing sperm motility and highlights the importance of considering a broad spectrum of biological processes in genomic prediction models. Notably, the cellular senescence pathway (ssc04218) showed the highest increase in predictive accuracy, representing an 11.11% increase compared with GFBLUP. This suggests a potential underlying connection between cellular aging processes and sperm motility, possibly through impacts on the vitality and functional maintenance of sperm cells.
For the trait of sperm morphology abnormality, the top pathways encompass biological processes including VEGF signaling, long-term potentiation, metabolism of xenobiotics by cytochrome P450, insulin resistance, N-Glycan biosynthesis, FOXO signaling, basal cell carcinoma, antigen processing and presentation, and regulation of the actin cytoskeleton (Table 6). These biological functions may indirectly or directly influence sperm morphology, maturation, and survival, as well as the development and function of the testes, thereby exerting a significant impact on reproductive health and sperm quality. Notably, the pertussis pathway (ssc05133) showed the highest increase in predictive accuracy, representing a 16.91% increase compared to GFBLUP. This suggests a potential underlying connection between bacterial infection mechanisms, such as those involved in pertussis, and sperm morphology, possibly through the impact on the host’s immune response and cellular health.

3. Discussion

3.1. Estimation of Heritability for Semen Traits

Heritability is an important measure of the role of genetics in determining traits [21]. In this study, we estimated the heritability of semen traits in a large population of Duroc boars, providing valuable insights into the impact of genetics and the environment on traits. This is significant for further research into the mechanisms of trait formation and the prediction of phenotypes. Our results indicate that VOL and DEN possess moderate heritability, aligning with the results of pioneers in the field [1], suggesting that these traits can be effectively targeted in breeding programs to enhance the overall reproductive capacity of the population. For MOT, our results support the findings of previous researchers [22]. This discrepancy may be attributed to a broad range of variation in environmental factors [23] or potential estimation bias due to an insufficient sample size in previous studies [24]. The high heritability implies that selective breeding could significantly reduce the rate of ABN, thereby improving the overall reproductive quality of boars.
Compared to the study by Marques D et al. [22], the standard error of the estimated heritability in this study is lower, indicating the higher degree of precision and reliability of our estimates. Furthermore, we estimated the variance components using data from 2467 Duroc boars and 274,332 semen collections, a larger sample size than the study by Smital. J et al. [25], which involved 132 Duroc boars. The larger sample size reduces sampling error and improves the stability of the estimation, further enhancing the reliability of our results. This study indicates that incorporating gene interactions within KEGG pathways improved predictive accuracy, surpassing the traditional GBLUP and RKHS methods. Moreover, the biBLUP demonstrated a significant enhancement in predictive ability for the trait of ABN, further emphasizing the importance of enhancing the reliability of predictive models within the broader context of quantitative genetics.

3.2. Integration of Biological a Priori Information for Enhancing Genomic Prediction Accuracy

It is widely accepted in the quantitative genetics community that utilizing biological a priori information is a strategic approach to improve the accuracy of predictive models. Our results align with this consensus, suggesting that integrating biological a priori information into genomic prediction models is more likely to correctly identify biomarkers associated with the traits under study. Hashemi et al. [26] reported that by integrating some dynamical system properties, the predictive ability can be improved by reducing false-positive results. Additionally, the inclusion of gene annotations [27] or the genetic structure of traits [28] into genomic predictions was able to improve the predictive ability and reduce the prediction bias of traits in different species. Considering candidate gene identification, Zakharov et al. [29] highlighted the importance of biological information in clarifying variants that may significantly affect traits but are challenging to detect using conventional analyses, thereby enhancing the effectiveness of GWAS in uncovering associations between rare variants and complex traits. Our results further confirm the value of integrating diverse information sources in quantitative genetics research, especially when considering gene interactions. This indicates that carefully selecting and integrating biological a priori information can significantly improve the performance of genomic prediction models, which is of great importance for animal production.

3.3. Impact of KEGG Pathways on Gene Interaction Effects and Prediction Accuracy

In this study, we investigated the utility of KEGG pathways in capturing gene interaction effects and subsequently enhancing the predictive ability for semen traits in a large Duroc boar population. Compared with the classical additive effects model (GBLUP), the global interactive model (RKHS), and the additive model based on KEGG pathways (GFBLUP), the interaction effects model based on KEGG pathways (biBLUP) demonstrated varying degrees of improvement in predictive accuracy. Notably, the biBLUP model showed particularly prominent performance in the trait of sperm abnormality rate, with an improvement in predictive accuracy ranging from 9.14% to 20.10% over the classical models. These results indicated that the inclusion of non-additive effects significantly improved genomic prediction accuracy compared to models that only included additive effects. Our findings suggested a potential epistatic genetic background for the studied traits (especially for the trait of ABN) and highlighted the capability of KEGG pathways in capturing the underlying gene interactions that contribute to the variation in these traits.
Our study not only reveals the potential epistatic genetic background of these traits but also highlights the key role of KEGG pathways in deciphering these complex genetic interactions. Given the rapid advances in genomics and bioinformatics, we can now delve deeper into non-additive effects, which play a crucial role in determining multiple biological characteristics and have a far greater impact on economically important traits in animals than simple additive effects. The expression of many biological characteristics is influenced by the complex interactions between multiple genes [30]. Non-additive effects can explain a larger proportion of genetic variation compared to additive effects, and they play an important role in economically important traits in animals [31]. For instance, the genetic assessment and prediction of wood and growth traits were significantly improved by incorporating non-additive effects in the prediction model for tree growth traits in balsam fir [32]. By incorporating non-additive effects (such as dominance effects) into the prediction models for Holstein and Jersey cattle, the prediction accuracy of production traits was significantly improved compared to models that only considered additive effects [33]. These findings suggest that models integrating non-additive effects have significant advantages in phenotypic prediction. Moreover, genomic prediction models with non-additive effects can better capture the complex associations between genotypes and phenotypes, thereby enhancing prediction accuracy and stability, providing new ideas and methods for pig breeding practices [34].

3.4. Exploring the Role of Biological Pathways in Sperm Traits and Reproductive Efficiency

In this study, we utilized genomic prediction models, particularly the biBLUP model, to conduct an in-depth analysis of the genetic basis of semen traits in Duroc boars. By investigating predictive ability of the KEGG pathway model on the traits VOL, DEN, MOT, and ABN, we found the potential roles of multiple biological processes in spermatogenesis and sperm function. Firstly, we found that the beta-alanine metabolism pathway (ssc00410) showed an enhancement in predictive accuracy for semen volume, indicating that cellular energy metabolism and pH regulation may significantly influence sperm generation and vitality [35]. Notable, beta-alanine serves as an important intracellular buffering agent [36] and is crucial for maintaining the stability of the sperm cell environment. Additionally, beta-alanine metabolism was the most significant pathway in cauda epididymal lumen fluid [37]. Key enzymes within this pathway may participate in the energy metabolism of sperm, providing the necessary energy support for sperm motility and metabolic activities. Secondly, the significant improvement in predictive accuracy associated with the herpes simplex virus 1 infection pathway (ssc05168) for sperm concentration suggests that viral infections may indirectly affect sperm maturation and quantity by interfering with the host’s immune response and cellular health. A previous study had demonstrated that the asymptomatic seminal infection of HSV plays an important role in male infertility by adversely affecting sperm count [38]. HSV is not only detectable in semen but may also impair sperm quality, leading to male infertility or reduced fertility, and potentially increases the risk of miscarriage and adverse effects on the fetus when assisted reproductive technology (ART) is used [39]. Furthermore, the enhancement in predictive accuracy for sperm motility linked to the cellular senescence pathway (ssc04218) reveals that the process of cellular aging may impact sperm vitality and functional maintenance, subsequently affecting motility. Previous reports have indicated that cellular senescence, in synergy with the MAPK signaling pathway, further affects the normal synthesis of cholesterol and androgens, inhibits the normal synthesis of lactate and pyruvate, and ultimately affects spermatogenesis [40]. Cellular senescence can result in decreased energy metabolism and structural damage within sperm cells, thereby compromising their motility [41]. Lastly, the improvement in predictive accuracy for sperm morphological abnormalities was associated with the pertussis pathway (ssc05133), indicating that bacterial infections may influence sperm morphology and function by activating the host immune response. Bacterial infections can trigger the release of inflammatory mediators, which may exert toxic effects on sperm cells, leading to morphological abnormalities. Previous reports have indicated that treatment with pertussis toxin significantly reduces the ability of sperm to undergo the acrosome reaction [42].
Our results underscore the complex interplay of various biological processes in spermatogenesis and sperm function. These results not only enhance our understanding of the genetic basis of semen traits, but they also provide new perspectives for future reproductive health research and pig breeding practices. Through further functional studies and validations, these pathways may serve as potential targets for improving semen production and reproductive efficiency in boars. Future research could explore the regulatory mechanisms of specific genes and molecules within these pathways, as well as how they interact to influence semen traits. Additionally, genes and molecules within these pathways could act as biomarkers or therapeutic targets for developing strategies to enhance sperm quality and improve the success rate of artificial insemination.

4. Materials and Methods

4.1. Animals and Phenotypes

An expanded dataset from our previous GWAS study [43] with more boars and semen ejection records were involved in this study. Phenotypic records from the years between 2016 and 2020 of 274,332 ejections from 2467 Duroc boars (111 ejections each boar on average, with a range of 11–343) were analyzed. VOL, DEN, MOT and ABN were evaluated after ejaculation using the UltiMateTM CASA system (Hamilton Thorne Inc., Beverly, MA, USA). The CASA system utilizes advanced digital image processing technologies to analyze and evaluate sperm motility and morphological characteristics. The system employs high-resolution microscopy and sophisticated image analysis software to perform the automated, in-depth analysis of sperm samples, including parameters such as sperm velocity, direction, morphology, and other biological factors. The CASA algorithms accurately identify and track the movement trajectories of individual sperm, enabling researchers to assess sperm quality in a highly efficient and objective manner. This approach enhances the precision and reliability of sperm quality evaluation. We removed any records where the VOL was less than 20 mL, any records with DEN higher than 20 × 109/mL, any records where the MOT was less than 0.5, and any data where the ABN was more than 0.6. The outlier values were removed prior to final analysis. Overall, 7664 pigs were included in the pedigree. Boars with less than 10 ejection records were pre-pruned from the dataset. Descriptive statistics of the semen traits were presented in Table 7.

4.2. Genotyping, Sequencing, and Imputation

Among the 2467 boars with phenotypic records, 2261 of which include ear tissue or the fresh semen available were genotyped with the GeneSeek Genomic Profiler (GGP) 50K SNP array. SNP positions were matched to the Sus Scrofa 11.1 reference genome, leaving 48,829 autosome SNPs for further analysis. SNPs with minor allele frequency (MAF) ≤ 0.01 or call rate ≤ 90% were filtered out, and the remained markers were imputed through Beagle v5.1 [44] for further analysis.
Aiming at obtaining sequence level SNP data for further genetic study, we integrated a resequencing-imputation strategy in this study. Briefly, 95 highly representative individuals from the genotyped population were sequenced with depths of 10× coverage on average, and sequence-level SNPs were called via GATK [45]. SNP array data of the other boars were imputed to the sequence level with the sequenced individual serve as reference panel. Concerning the resequencing individual selection, we implemented the strategy introduced by Druet et al. [46]. Briefly, the kinship of the sequencing candidates was calculated based on genotype data, and the average kinship coefficients with other individuals were computed for each candidate. The first pig with the highest average kinship coefficient with the rest was chosen as the first sequencing subject. The remaining candidates were selected following the principle of “maximizing the representativity of the sequenced genomes” until an expected population genome representativity of 70% was achieved.
DNA was extracted from fresh semen (boars) or ear tissues (sows) of the selected 95 individuals and sequenced with 150 bp paired-end reads on the Illumina Novaseq6000 platform. GATK v4.0 [47] software was employed for SNP calling from the resequencing data. Initially, sequencing data were aligned to the pig reference genome (Sus scrofa 11.1) using the BWA software. SAMtools was used to convert sam files to binary bam files, and Picard’s MarkDuplicates was used to remove duplicate sequences. SNP detection and quality control were then conducted with GATK software, applying criteria such as Quality by Depth (QD) < 2.0, FisherStrand > 60.0, Mapping Quality (MAPQ) < 40.0, MQRankSum < −12.5, and ReadPosRankSum < −8.0. This resulted in 8.429 million high-quality SNPs.
GGP 50K chip data were imputed to sequence-level using Beagle v5.1 with the resequencing SNP panel as reference. The imputed sequence-level SNP data were further quality-controlled with the criteria MAF ≥ 0.01 and linkage disequilibrium r 2 < 0.9 , resulting in 2084 boars and 262,766 SNPs for further analysis.

4.3. Genetic Parameter Estimation

The Asreml-R package was used to estimate breeding values for each trait using a repeatability model. Genetic parameters, including heritability ( h 2 ), repeatability ( r 2 ), and correlations among traits, were calculated. The four-trait repeatability model is given as follows:
y = X b + A g e + I n t + Z u + E p + e ,
where y is the matrix of observations, X is the incidence matrix for fixed effects, b is the vector of fixed effects of semen collection herd-year-season, and A g e and I n t denote the age of boar in month and collection interval, respectively. Z is the incidence matrix for polygenic effects; u ~ N ( 0 , A σ u 2 ) is the vector of estimated breeding values; A is the pedigree kinship matrix; E is the incidence matrix for permanent environmental effects; p ~ N ( 0 , I σ p 2 ) is the vector of permanent environmental effects; and e ~ N ( 0 , I σ e 2 ) represents the residuals, σ u 2 , σ p 2 , and σ e 2 , which are the variance components. De-regressed proofs (DRPs) were calculated following the formula introduced by Garrick [48], based on the estimated breeding values, for further predictive ability evaluation.

4.4. SNP-KEGG Pathway Mapping

The KEGGREST package [49] in R v4.2 was used to download the KEGG pathways for pigs (Sus scrofa 11.1) from the KEGG database. Genes were extended 5 kb upstream of the transcription start site and downstream of the transcription termination site to include potential regulatory elements. SNPs were mapped to genes and then to the KEGG pathways, resulting in SNP sets for all pathways [27]. KEGG pathways with fewer representative genes (each gene having at least one SNP located) than 10 were pruned from the analysis, resulting in 323 pathways for further study.

4.5. Genomic Prediction Models

This study intended to integrate KEGG pathways into the genomic prediction model and compare the predictive ability of the new model with the classical additive effect model (GBLUP), global epistatic model (RKHS), and additive effect model integrating single pathways (GFBLUP).
The GBLUP model is written as follows:
y = μ + Z g + e ,
where y is the vector of DRPs; μ is the overall mean, Z is the incidence matrix; g ~ N ( 0 , G σ g 2 ) is the vector of genomic estimated breeding values; σ g 2 is the genetic variance; and e ~ N ( 0 , I σ e 2 ) represents the residuals, where σ e 2 is the residual variance. The kinship matrix G is constructed according to VanRaden (2008) [9], G = M M 2 j = 1 m = 1 p j 1 p j , where p j denotes the minor allele frequency (MAF) for the j t h SNP; m denotes the total number of SNPs; and M denotes the MAF-adjusted genotype matrix with elements { 0 2 p j , 1 2 p j , 2 2 p j } representing genotypes AA, AB, and BB, respectively.
The model for RKHS [12] is the same as Equation (2) except that g is replaced with i ~ N ( 0 , K σ i 2 ) , where i is the epistatic genetic values and σ i 2 is the epistatic genetic variance. The Gaussian kernel (kinship) is calculated as K M i , M i = e x p { | | M i M i | | 2 h } , where h represents the bandwidth parameter. In this study, we set the bandwidth parameter, following the form of our recent study [20], as h = m e d i a n | | M i M i | | 2 / { 0.05,0.1,0.5,1 , 2,4 } .
Concerning the KEGG pathway utilization, we compared two models with different manners for integrating the pathway information. For GFBLUP and biBLUP, additive or epistatic relationship matrices were constructed with SNPs mapped to each pathway, and subsequently, the additive or epistatic genetic values were calculated. The model incorporating KEGG pathways was defined as follows:
y = μ + Z g + Z i + e ,
where y represents is the vector of DRPs, μ represents the overall mean, and Z is the incidence matrix. Additionally, g ~ N 0 , G σ g 2 is the same as defined in Equation (2), with i ~ N ( 0 , K p σ i 2 ) representing the KEGG pathway (s) genetic values. For the GFBLUP [50,51,52] model, K p = G p = M p M p 2 j = 1 m p = 1 p j 1 p j , M p denotes the MAF-adjusted genotype matrix of SNPs mapped to the p t h pathway and m p denotes the number of SNPs mapped to the p t h pathway. For biBLUP, K p is the Gaussian kernel calculated with the formula mentioned above using SNPs mapped to the p t h pathway. If the target trait is genetically controlled by gene interaction effects and certain KEGG pathway can capture these interaction effects, then biBLUP is expected to outperform GFBLUP in predictive ability. The models mentioned above are summarized in Table 8.

4.6. Predictive Accuracy Assessment

The predictive ability of the genomic prediction models was evaluated through 20 repetitions of 5-fold random cross-validation. Predictive ability was defined as the Pearson’s correlation between the predicted total genetic values and the DRPs in the validation set.

5. Conclusions

In this study, we estimated genetic parameters and assessed the genomic predictive ability of semen traits in a large Duroc boar population. Moderate to high heritability and repeatability of ABN, VOL, DEN, and MOT were estimated. Moreover, genomic prediction achieved moderate accuracy of the studied semen traits, indicating the possibility of utilizing genomic selection in the genetic improvement of semen traits. Integrating KEGG pathways (biBLUP) showed a considerable improvement of predictive ability in the studied traits, especially for sperm morphological abnormality (ABN). To our knowledge, this study reported the genetic parameters and genomic predictive ability of semen traits based on the largest boar semen ejection records and number of boars in a single breeding population around the world so far.

Author Contributions

Conceived the study, N.G., F.Y. and Y.C.; Designed the experiments, Y.C., F.Y. and N.G.; Performed the experiments and analyzed the data, Y.C., Y.H., F.Y. and Y.Y. (Yanda Yang); Wrote the paper, Y.C., N.G. and F.Y.; Reviewed the manuscript, N.G., J.H., Y.Z., M.R. and Y.Y.(Yulong Yin); Results visualization, Y.C. and Y.H.; Codes development, N.G., Y.H. and Y.C.; Visualization, Y.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Natural Science Foundation of China (32372855); the Scientific Research Fund of Hunan Provincial Education Department (22B0219); the Natural Science Foundation of Guangdong Province General Program (2021A1515010770); and the Science And Technology Innovation Program of Hunan Province (2024RC3181).

Institutional Review Board Statement

The animal study was reviewed and approved, and the experimental animal procedures were carried out in accordance with the approved protocols of Animal Science and Technology College of Hunan Agriculture University (No. 2021-13).

Data Availability Statement

The data presented in this study are available on request from the corresponding author for purely scientific (non-commercial) reasons.

Acknowledgments

We extend our sincere thanks to the Scientific Research Fund of Hunan Provincial Education Department for their support of our research. In addition, we are grateful for the financial support provided by the National Natural Science Foundation of China General Program and the Natural Science Foundation of Guangdong Province General Program. Each author has made significant contributions to this work and has given their final approval to the version of the manuscript submitted.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Li, X.; Jiang, B.; Wang, X.; Liu, X.; Zhang, Q.; Chen, Y. Estimation of Genetic Parameters and Season Effects for Semen Traits in Three Pig Breeds of South China. J. Anim. Breed. Genet. 2019, 136, 183–189. [Google Scholar] [CrossRef] [PubMed]
  2. Kanehisa, M.; Sato, Y.; Kawashima, M.; Furumichi, M.; Tanabe, M. KEGG as a Reference Resource for Gene and Protein Annotation. Nucleic Acids Res. 2016, 44, D457–D462. [Google Scholar] [CrossRef] [PubMed]
  3. Kanehisa, M.; Goto, S.; Kawashima, S.; Okuno, Y.; Hattori, M. The KEGG Resource for Deciphering the Genome. Nucleic Acids Res. 2004, 32, D277–D280. [Google Scholar] [CrossRef] [PubMed]
  4. Henderson, C.R. Best Linear Unbiased Estimation and Prediction under a Selection Model. Biometrics 1975, 31, 423–447. [Google Scholar] [CrossRef]
  5. Wang, H.; Misztal, I.; Aguilar, I.; Legarra, A.; Muir, W.M. Genome-Wide Association Mapping Including Phenotypes from Relatives without Genotypes. Genet Res 2012, 94, 73–83. [Google Scholar] [CrossRef]
  6. Hayes, B.J.; Bowman, P.J.; Chamberlain, A.J.; Goddard, M.E. Invited Review: Genomic Selection in Dairy Cattle: Progress and Challenges. J. Dairy Sci. 2009, 92, 433–443. [Google Scholar] [CrossRef]
  7. Goddard, M.E.; Hayes, B.J. Mapping Genes for Complex Traits in Domestic Animals and Their Use in Breeding Programmes. Nat. Rev. Genet. 2009, 10, 381–391. [Google Scholar] [CrossRef]
  8. Montesinos-López, O.A.; Crossa, J.; Saint Pierre, C.; Gerard, G.; Valenzo-Jiménez, M.A.; Vitale, P.; Valladares-Cellis, P.E.; Buenrostro-Mariscal, R.; Montesinos-López, A.; Crespo-Herrera, L. Multivariate Genomic Hybrid Prediction with Kernels and Parental Information. Int. J. Mol. Sci. 2023, 24, 13799. [Google Scholar] [CrossRef]
  9. VanRaden, P.M. Efficient Methods to Compute Genomic Predictions. J. Dairy Sci. 2008, 91, 4414–4423. [Google Scholar] [CrossRef]
  10. Meuwissen, T.H.; Hayes, B.J.; Goddard, M.E. Prediction of Total Genetic Value Using Genome-Wide Dense Marker Maps. Genetics 2001, 157, 1819–1829. [Google Scholar] [CrossRef]
  11. Habier, D.; Fernando, R.L.; Dekkers, J.C.M. The Impact of Genetic Relationship Information on Genome-Assisted Breeding Values. Genetics 2007, 177, 2389–2397. [Google Scholar] [CrossRef] [PubMed]
  12. Gianola, D.; van Kaam, J.B.C.H.M. Reproducing Kernel Hilbert Spaces Regression Methods for Genomic Assisted Prediction of Quantitative Traits. Genetics 2008, 178, 2289–2303. [Google Scholar] [CrossRef] [PubMed]
  13. Ye, H.; Xu, Z.; Bello, S.F.; Zhu, Q.; Kong, S.; Zheng, M.; Fang, X.; Jia, X.; Xu, H.; Zhang, X.; et al. Haplotype Analysis of Genomic Prediction by Incorporating Genomic Pathway Information Based on High-Density SNP Marker in Chinese Yellow-Feathered Chicken. Poult. Sci. 2023, 102, 102549. [Google Scholar] [CrossRef] [PubMed]
  14. Jiang, Y.; Reif, J.C. Modeling Epistasis in Genomic Selection. Genetics 2015, 201, 759–768. [Google Scholar] [CrossRef]
  15. Martini, J.W.R.; Wimmer, V.; Erbe, M.; Simianer, H. Epistasis and Covariance: How Gene Interaction Translates into Genomic Relationship. Theor. Appl. Genet. 2016, 129, 963–976. [Google Scholar] [CrossRef]
  16. Raffo, M.A.; Sarup, P.; Guo, X.; Liu, H.; Andersen, J.R.; Orabi, J.; Jahoor, A.; Jensen, J. Improvement of Genomic Prediction in Advanced Wheat Breeding Lines by Including Additive-by-Additive Epistasis. Theor. Appl. Genet. 2022, 135, 965–978. [Google Scholar] [CrossRef]
  17. Vojgani, E.; Pook, T.; Simianer, H. Phenotype Prediction Under Epistasis. In Epistasis: Methods and Protocols; Wong, K.-C., Ed.; Springer: New York, NY, USA, 2021; pp. 105–120. ISBN 978-1-07-160947-7. [Google Scholar]
  18. Hu, Z.; Li, Y.; Song, X.; Han, Y.; Cai, X.; Xu, S.; Li, W. Genomic Value Prediction for Quantitative Traits under the Epistatic Model. BMC Genet. 2011, 12, 15. [Google Scholar] [CrossRef]
  19. Sokolov, A.; Carlin, D.E.; Paull, E.O.; Baertsch, R.; Stuart, J.M. Pathway-Based Genomics Prediction Using Generalized Elastic Net. PLoS Comput. Biol. 2016, 12, e1004790. [Google Scholar] [CrossRef]
  20. Gao, N.; Teng, J.; Ye, S.; Lin, Q.; Gao, Y.; Wang, J.; Huang, S.; He, J.; Li, J.; Chen, Y.; et al. Capturing Biological Interactions Improves Predictive Ability of Complex Traits via Epistatic Model. PREPRINT (Version 1), Research Square. 14 April 2024. [Google Scholar] [CrossRef]
  21. Speed, D.; Hemani, G.; Johnson, M.R.; Balding, D.J. Improved Heritability Estimation from Genome-Wide SNPs. Am. J. Hum. Genet. 2012, 91, 1011–1021. [Google Scholar] [CrossRef]
  22. Marques, D.B.D.; Lopes, M.S.; Broekhuijse, M.L.W.J.; Guimarães, S.E.F.; Knol, E.F.; Bastiaansen, J.W.M.; Silva, F.F.; Lopes, P.S. Genetic Parameters for Semen Quality and Quantity Traits in Five Pig Lines. J. Anim. Sci. 2017, 95, 4251–4259. [Google Scholar] [CrossRef]
  23. Visscher, P.M.; Hill, W.G.; Wray, N.R. Heritability in the Genomics Era--Concepts and Misconceptions. Nat. Rev. Genet. 2008, 9, 255–266. [Google Scholar] [CrossRef] [PubMed]
  24. Barton, N.H.; Keightley, P.D. Understanding Quantitative Genetic Variation. Nat. Rev. Genet. 2002, 3, 11–21. [Google Scholar] [CrossRef] [PubMed]
  25. Smital, J.; Wolf, J.; Sousa, L.L.D. Estimation of Genetic Parameters of Semen Characteristics and Reproductive Traits in AI Boars. Anim. Reprod. Sci. 2005, 86, 119–130. [Google Scholar] [CrossRef] [PubMed]
  26. Hashemi, M.; Vattikonda, A.N.; Sip, V.; Diaz-Pier, S.; Peyser, A.; Wang, H.; Guye, M.; Bartolomei, F.; Woodman, M.M.; Jirsa, V.K. On the Influence of Prior Information Evaluated by Fully Bayesian Criteria in a Personalized Whole-Brain Model of Epilepsy Spread. PLoS Comput. Biol. 2021, 17, e1009129. [Google Scholar] [CrossRef] [PubMed]
  27. Gao, N.; Martini, J.W.R.; Zhang, Z.; Yuan, X.; Zhang, H.; Simianer, H.; Li, J. Incorporating Gene Annotation into Genomic Prediction of Complex Phenotypes. Genetics 2017, 207, 489–501. [Google Scholar] [CrossRef]
  28. Gao, N.; Li, J.; He, J.; Xiao, G.; Luo, Y.; Zhang, H.; Chen, Z.; Zhang, Z. Improving Accuracy of Genomic Prediction by Genetic Architecture Based Priors in a Bayesian Model. BMC Genet. 2015, 16, 120. [Google Scholar] [CrossRef]
  29. Zakharov, S.; Teoh, G.H.K.; Salim, A.; Thalamuthu, A. A Method to Incorporate Prior Information into Score Test for Genetic Association Studies. BMC Bioinform. 2014, 15, 24. [Google Scholar] [CrossRef]
  30. Visscher, P.M.; Brown, M.A.; McCarthy, M.I.; Yang, J. Five Years of GWAS Discovery. Am. J. Hum. Genet. 2012, 90, 7–24. [Google Scholar] [CrossRef]
  31. Alves, K.; Brito, L.F.; Baes, C.F.; Sargolzaei, M.; Robinson, J.A.B.; Schenkel, F.S. Estimation of Additive and Non-Additive Genetic Effects for Fertility and Reproduction Traits in North American Holstein Cattle Using Genomic Information. J. Anim. Breed. Genet. 2020, 137, 316–330. [Google Scholar] [CrossRef]
  32. Nadeau, S.; Beaulieu, J.; Gezan, S.A.; Perron, M.; Bousquet, J.; Lenz, P.R.N. Increasing Genomic Prediction Accuracy for Unphenotyped Full-Sib Families by Modeling Additive and Dominance Effects with Large Datasets in White Spruce. Front. Plant Sci. 2023, 14, 1137834. [Google Scholar] [CrossRef]
  33. Sun, C.; VanRaden, P.M.; Cole, J.B.; O’Connell, J.R. Improvement of Prediction Ability for Genomic Selection of Dairy Cattle by Including Dominance Effects. PLoS ONE 2014, 9, e103934. [Google Scholar] [CrossRef] [PubMed]
  34. Mackay, T.F.C. Epistasis and Quantitative Traits: Using Model Organisms to Study Gene-Gene Interactions. Nat. Rev. Genet. 2014, 15, 22–33. [Google Scholar] [CrossRef] [PubMed]
  35. Kuang, W.; Zhang, J.; Lan, Z.; Deepak, R.N.V.K.; Liu, C.; Ma, Z.; Cheng, L.; Zhao, X.; Meng, X.; Wang, W.; et al. SLC22A14 Is a Mitochondrial Riboflavin Transporter Required for Sperm Oxidative Phosphorylation and Male Fertility. Cell Rep. 2021, 35, 109025. [Google Scholar] [CrossRef] [PubMed]
  36. Vaughan, R.A.; Gannon, N.P.; Garcia-Smith, R.; Licon-Munoz, Y.; Barberena, M.A.; Bisoffi, M.; Trujillo, K.A. β-Alanine Suppresses Malignant Breast Epithelial Cell Aggressiveness through Alterations in Metabolism and Cellular Acidity in Vitro. Mol. Cancer 2014, 13, 14. [Google Scholar] [CrossRef]
  37. Hu, S.-G.; Liang, A.-J.; Yao, G.-X.; Li, X.-Q.; Zou, M.; Liu, J.-W.; Sun, Y. The Dynamic Metabolomic Changes throughout Mouse Epididymal Lumen Fluid Potentially Contribute to Sperm Maturation. Andrology 2018, 6, 247–255. [Google Scholar] [CrossRef]
  38. Monavari, S.H.; Vaziri, M.S.; Khalili, M.; Shamsi-Shahrabadi, M.; Keyvani, H.; Mollaei, H.; Fazlalipour, M. Asymptomatic Seminal Infection of Herpes Simplex Virus: Impact on Male Infertility. J. Biomed. Res. 2013, 27, 56–61. [Google Scholar] [CrossRef]
  39. Akhigbe, R.E.; Dutta, S.; Hamed, M.A.; Ajayi, A.F.; Sengupta, P.; Ahmad, G. Viral Infections and Male Infertility: A Comprehensive Review of the Role of Oxidative Stress. Front. Reprod. Health 2022, 4, 782915. [Google Scholar] [CrossRef]
  40. Wang, Z.; Ma, Y.; Chen, Z.; Yang, R.; Liu, Q.; Pan, J.; Wang, J.; Liu, Y.; Zhou, M.; Zhang, Y.; et al. COVID-19 Inhibits Spermatogenesis in the Testes by Inducing Cellular Senescence. Front. Genet. 2022, 13, 981471. [Google Scholar] [CrossRef]
  41. Peña, F.J.; Ortiz-Rodríguez, J.M.; Gaitskell-Phillips, G.L.; Gil, M.C.; Ortega-Ferrusola, C.; Martín-Cano, F.E. An Integrated Overview on the Regulation of Sperm Metabolism (Glycolysis-Krebs Cycle-Oxidative Phosphorylation). Anim. Reprod. Sci. 2022, 246, 106805. [Google Scholar] [CrossRef]
  42. Franken, D.R.; Morales, P.J.; Habenicht, U.F. Inhibition of G Protein in Human Sperm and Its Influence on Acrosome Reaction and Zona Pellucida Binding. Fertil. Steril. 1996, 66, 1009–1011. [Google Scholar] [CrossRef]
  43. Gao, N.; Chen, Y.; Liu, X.; Zhao, Y.; Zhu, L.; Liu, A.; Jiang, W.; Peng, X.; Zhang, C.; Tang, Z.; et al. Weighted Single-Step GWAS Identified Candidate Genes Associated with Semen Traits in a Duroc Boar Population. BMC Genom. 2019, 20, 797. [Google Scholar] [CrossRef] [PubMed]
  44. Browning, B.L.; Zhou, Y.; Browning, S.R. A One-Penny Imputed Genome from Next-Generation Reference Panels. Am. J. Hum. Genet. 2018, 103, 338–348. [Google Scholar] [CrossRef] [PubMed]
  45. McKenna, A.; Hanna, M.; Banks, E.; Sivachenko, A.; Cibulskis, K.; Kernytsky, A.; Garimella, K.; Altshuler, D.; Gabriel, S.; Daly, M.; et al. The Genome Analysis Toolkit: A MapReduce Framework for Analyzing next-Generation DNA Sequencing Data. Genome Res. 2010, 20, 1297–1303. [Google Scholar] [CrossRef] [PubMed]
  46. Druet, T.; Macleod, I.M.; Hayes, B.J. Toward Genomic Prediction from Whole-Genome Sequence Data: Impact of Sequencing Design on Genotype Imputation and Accuracy of Predictions. Heredity 2014, 112, 39–47. [Google Scholar] [CrossRef]
  47. Van der Auwera, G.A.; Carneiro, M.O.; Hartl, C.; Poplin, R.; Del Angel, G.; Levy-Moonshine, A.; Jordan, T.; Shakir, K.; Roazen, D.; Thibault, J.; et al. From FastQ Data to High Confidence Variant Calls: The Genome Analysis Toolkit Best Practices Pipeline. Curr. Protoc. Bioinform. 2013, 43, 11.10.1–11.10.33. [Google Scholar] [CrossRef]
  48. Garrick, D.J.; Taylor, J.F.; Fernando, R.L. Deregressing Estimated Breeding Values and Weighting Information for Genomic Regression Analyses. Genet. Sel. Evol. 2009, 41, 55. [Google Scholar] [CrossRef]
  49. Tenenbaum, D. KEGGREST: Client-Side REST Access to KEGG; R Package Version 1.46.0; Bioconductor: Riverside, CA, USA, 2013. [Google Scholar] [CrossRef]
  50. Edwards, S.M.; Sørensen, I.F.; Sarup, P.; Mackay, T.F.C.; Sørensen, P. Genomic Prediction for Quantitative Traits Is Improved by Mapping Variants to Gene Ontology Categories in Drosophila Melanogaster. Genetics 2016, 203, 1871–1883. [Google Scholar] [CrossRef]
  51. Fang, L.; Sahana, G.; Ma, P.; Su, G.; Yu, Y.; Zhang, S.; Lund, M.S.; Sørensen, P. Use of Biological Priors Enhances Understanding of Genetic Architecture and Genomic Prediction of Complex Traits within and between Dairy Cattle Breeds. BMC Genom. 2017, 18, 604. [Google Scholar] [CrossRef]
  52. Liu, Y.; Zhang, Y.; Zhou, F.; Yao, Z.; Zhan, Y.; Fan, Z.; Meng, X.; Zhang, Z.; Liu, L.; Yang, J.; et al. Increased Accuracy of Genomic Prediction Using Preselected SNPs from GWAS with Imputed Whole-Genome Sequence Data in Pigs. Animals 2023, 13, 3871. [Google Scholar] [CrossRef]
Figure 1. Genetic (a) and phenotypic correlations (b) of boar semen traits.
Figure 1. Genetic (a) and phenotypic correlations (b) of boar semen traits.
Ijms 25 13155 g001
Table 1. Estimation of variance components and genetic parameters for semen traits.
Table 1. Estimation of variance components and genetic parameters for semen traits.
Trait σ α 2 ± s e σ e 2 ± s e σ p e 2 ± s e h 2 ± s e r 2 ± s e
VOL8.496 ± 1.01722.269 ± 0.0607.744 ± 0.6800.221 ± 0.0240.422 ± 0.009
DEN1.217 ± 0.1333.422 ± 0.0090.732 ± 0.0840.227 ± 0.0220.363 ± 0.009
MOT0.001 ± 0.0000.003 ± 0.0000.001 ± 0.0000.164 ± 0.0200.322 ± 0.008
ABN0.003 ± 0.0000.003 ± 0.0000.001 ± 0.0000.432 ± 0.0300.587 ± 0.009
σ α 2 : additive genetic variance; σ p e 2 : permanent environmental effect variance; σ e 2 : residual variance; h 2 : heritability; r 2 : repeatability; s e : standard error; VOL: semen volume; DEN: sperm concentration; MOT: sperm motility; ABN: sperm morphological abnormality rate.
Table 2. Genomic predictive ability on semen traits in the Duroc breeding populations (Mean ± SE).
Table 2. Genomic predictive ability on semen traits in the Duroc breeding populations (Mean ± SE).
TraitsGBLUPRKHSGFBLUPbiBLUP
VOL0.495 ± 0.0020.497 ± 0.0020.512 ± 0.0200.564 ± 0.089
DEN0.455 ± 0.0020.460 ± 0.0020.467 ± 0.0110.517 ± 0.061
MOT0.320 ± 0.0020.336 ± 0.0020.339 ± 0.0130.370 ± 0.043
ABN0.403 ± 0.0030.442 ± 0.0020.418 ± 0.0160.484 ± 0.085
VOL: semen volume; DEN: sperm concentration; MOT: sperm motility; ABN: sperm morphological abnormality rate.
Table 3. Predictive ability of the top ten KEGG pathways for VOL (Mean ± SE).
Table 3. Predictive ability of the top ten KEGG pathways for VOL (Mean ± SE).
ModelsGFBLUPbiBLUPFunction
ssc004100.511 ± 0.0210.564 ± 0.089beta-alanine metabolism
ssc040680.512 ± 0.0200.564 ± 0.089FOXO signaling pathway
ssc009100.511 ± 0.0210.563 ± 0.089Nitrogen metabolism
ssc043900.511 ± 0.0210.563 ± 0.089Hippo signaling pathway
ssc047250.511 ± 0.0210.563 ± 0.089CHOLINERGIC SYNAPSE
ssc049500.511 ± 0.0210.563 ± 0.089Maturity onset diabetes of the young
ssc051500.511 ± 0.0210.563 ± 0.089Staphylococcus aureus infection
ssc052310.511 ± 0.0210.563 ± 0.089Choline metabolism in cancer
ssc046400.508 ± 0.0140.542 ± 0.055Hematopoietic cell lineage
The model denotes the identifiers for the KEGG pathways.
Table 4. Predictive ability of the top ten KEGG pathways for DEN (Mean ± SE).
Table 4. Predictive ability of the top ten KEGG pathways for DEN (Mean ± SE).
ModelsGFBLUPbiBLUPFunction
ssc051680.465 ± 0.0110.517 ± 0.061Herpes simplex virus 1 infection
ssc049740.466 ± 0.0110.516 ± 0.061Protein digestion and absorption
ssc005150.466 ± 0.0110.516 ± 0.061Mannose type O-glycan biosynthesis
ssc041360.466 ± 0.0110.516 ± 0.061Autophagy—other
ssc054100.465 ± 0.0110.516 ± 0.061Hypertrophic cardiomyopathy
ssc048140.466 ± 0.0110.516 ± 0.061Motor proteins
ssc030150.466 ± 0.0110.516 ± 0.061mRNA surveillance pathway
ssc046110.466 ± 0.0110.516 ± 0.061PLATELET ACTIVATION
ssc050320.467 ± 0.0110.498 ± 0.041Morphine addiction
ssc047140.465 ± 0.0110.497 ± 0.041Thermogenesis
The model denotes the identifiers for the KEGG pathways.
Table 5. Predictive ability of the top ten KEGG pathways for MOT (Mean ± SE).
Table 5. Predictive ability of the top ten KEGG pathways for MOT (Mean ± SE).
ModelsGFBLUPbiBLUPFunction
ssc042180.333 ± 0.0130.370 ± 0.043Cellular senescence
ssc046600.335 ± 0.0130.365 ± 0.043T cell receptor signaling pathway
ssc004100.334 ± 0.0140.364 ± 0.044beta-alanine metabolism
ssc008300.334 ± 0.0140.364 ± 0.044Retinol metabolism
ssc043300.334 ± 0.0140.364 ± 0.044Notch signaling pathway
ssc046720.334 ± 0.0140.364 ± 0.044Intestinal immune network for IgA production
ssc050320.334 ± 0.0140.364 ± 0.044Morphine addiction
ssc052120.334 ± 0.0140.364 ± 0.044Pancreatic cancer
ssc002800.334 ± 0.0130.364 ± 0.043Valine, leucine, and isoleucine degradation
ssc040600.333 ± 0.0140.364 ± 0.043Cytokine–cytokine receptor interaction
The model denotes the identifiers for the KEGG pathways.
Table 6. Predictive ability of the top ten KEGG pathways for ABN (Mean ± SE).
Table 6. Predictive ability of the top ten KEGG pathways for ABN (Mean ± SE).
ModelsGFBLUPbiBLUPFunction
ssc051330.414 ± 0.0160.484 ± 0.085Pertussis
ssc043700.418 ± 0.0160.482 ± 0.085VEGF signaling pathway
ssc047200.418 ± 0.0160.480 ± 0.085Long-term potentiation
ssc009800.416 ± 0.0160.477 ± 0.085Metabolism of xenobiotics by cytochrome P450
ssc049310.416 ± 0.0160.474 ± 0.085Insulin resistance
ssc005100.415 ± 0.0160.471 ± 0.085N-Glycan biosynthesis
ssc040680.414 ± 0.0170.471 ± 0.085FOXO signaling pathway
ssc052170.415 ± 0.0160.467 ± 0.085Basal cell carcinoma
ssc046120.414 ± 0.0170.465 ± 0.058Antigen processing and presentation
ssc048100.415 ± 0.0170.461 ± 0.058Regulation of actin cytoskeleton
The model denotes the identifiers for the KEGG pathways.
Table 7. Descriptive statistics for phenotypes.
Table 7. Descriptive statistics for phenotypes.
TraitCountMinimumMaximumMeanStandard
Deviation
Coefficient of
Variation (%)
VOL24670.021.9590.1630.00338.428
DEN24670.0119.975.0632.30345.485
MOT24670.510.8990.0586.466
ABN246700.60.09990.0681.986
VOL: semen volume (L); DEN: sperm concentration (109/mL); MOT: sperm motility; ABN: sperm morphological abnormality rate (%).
Table 8. Genomic prediction models in this study.
Table 8. Genomic prediction models in this study.
ModelFormulaGenetic EffectsRelationship Definition
GBLUP y = μ + Z g + e g ~ N ( 0 , G σ g 2 ) G : additive relationship matrix
RKHS y = μ + Z i + e i ~ N ( 0 , K σ i 2 ) K : Gaussian kernel
GFBLUP y = μ + Z g + Z i + e g ~ N 0 , G σ g 2 , i ~ N 0 , G p σ i 2 G p : pathway additive relationship
biBLUP y = μ + Z g + Z i + e g ~ N ( 0 , G σ g 2 ) , i ~ N 0 , K p σ i 2 K p : pathway Gaussian kernel
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chen, Y.; Yang, F.; Yang, Y.; Hu, Y.; Meng, Y.; Zhang, Y.; Ran, M.; He, J.; Yin, Y.; Gao, N. Genomic Prediction of Semen Traits in Boars Incorporating Biological Interactions. Int. J. Mol. Sci. 2024, 25, 13155. https://doi.org/10.3390/ijms252313155

AMA Style

Chen Y, Yang F, Yang Y, Hu Y, Meng Y, Zhang Y, Ran M, He J, Yin Y, Gao N. Genomic Prediction of Semen Traits in Boars Incorporating Biological Interactions. International Journal of Molecular Sciences. 2024; 25(23):13155. https://doi.org/10.3390/ijms252313155

Chicago/Turabian Style

Chen, Yantong, Fang Yang, Yanda Yang, Yulong Hu, Yang Meng, Yuebo Zhang, Maoliang Ran, Jun He, Yulong Yin, and Ning Gao. 2024. "Genomic Prediction of Semen Traits in Boars Incorporating Biological Interactions" International Journal of Molecular Sciences 25, no. 23: 13155. https://doi.org/10.3390/ijms252313155

APA Style

Chen, Y., Yang, F., Yang, Y., Hu, Y., Meng, Y., Zhang, Y., Ran, M., He, J., Yin, Y., & Gao, N. (2024). Genomic Prediction of Semen Traits in Boars Incorporating Biological Interactions. International Journal of Molecular Sciences, 25(23), 13155. https://doi.org/10.3390/ijms252313155

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop