Next Article in Journal
Microsatellites as Molecular Markers with Applications in Exploitation and Conservation of Aquatic Animal Populations
Next Article in Special Issue
Transcriptome Profiling of Different Developmental Stages on Longissimus Dorsi to Identify Genes Underlying Intramuscular Fat Content in Wannanhua Pigs
Previous Article in Journal
CRISPR/Cas9 Gene Editing System Can Alter Gene Expression and Induce DNA Damage Accumulation
Previous Article in Special Issue
tsRNA Landscape and Potential Function Network in Subcutaneous and Visceral Pig Adipose Tissue
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Molecular Design-Based Breeding: A Kinship Index-Based Selection Method for Complex Traits in Small Livestock Populations

1
College of Animal Sciences, Zhejiang University, Hangzhou 310058, China
2
Key Laboratory of Livestock and Poultry Resources Evaluation and Utilization, Ministry of Agriculture and Rural Affairs, Hangzhou 310058, China
3
Zhejiang Key Laboratory of Dairy Cattle Genetic Improvement and Milk Quality Research, Hangzhou 310058, China
4
Hainan Institute, Zhejiang University, Building 11, Yongyou Industrial Park, Yazhou Bay Science and Technology City, Yazhou District, Sanya 572025, China
5
Hainan Yazhou Bay Seed Laboratory, Yongyou Industrial Park, Yazhou Bay Sci-Tech City, Sanya 572025, China
6
Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai 200240, China
*
Authors to whom correspondence should be addressed.
Genes 2023, 14(4), 807; https://doi.org/10.3390/genes14040807
Submission received: 7 February 2023 / Revised: 22 March 2023 / Accepted: 23 March 2023 / Published: 27 March 2023
(This article belongs to the Special Issue Advances in Pig Breeding and Genetics)

Abstract

:
Genomic selection (GS) techniques have improved animal breeding by enhancing the prediction accuracy of breeding values, particularly for traits that are difficult to measure and have low heritability, as well as reducing generation intervals. However, the requirement to establish genetic reference populations can limit the application of GS in pig breeds with small populations, especially when small populations make up most of the pig breeds worldwide. We aimed to propose a kinship index based selection (KIS) method, which defines an ideal individual with information on the beneficial genotypes for the target trait. Herein, the metric for assessing selection decisions is a beneficial genotypic similarity between the candidate and the ideal individual; thus, the KIS method can overcome the need for establishing genetic reference groups and continuous phenotype determination. We also performed a robustness test to make the method more aligned with reality. Simulation results revealed that compared to conventional genomic selection methods, the KIS method is feasible, particularly, when the population size is relatively small.

1. Introduction

China is home to approximately 84 indigenous pig breeds, which are rich in genetic resources and possess a wide range of desirable and undesirable traits. Some of the desirable traits include high meat quality, large litter size, high tolerance to roughage, and disease resistance, while undesirable traits include slow growth, high feed conversion ratio, and low lean meat rate. Examples of typical indigenous pig breeds include Jinhua and Laiwu pigs. To achieve pigs with desirable traits, one approach is to select indigenous breeds with superior performance or to create new breeds (lines) through crossbreeding between Chinese indigenous breeds and intensive breeds such as Duroc, Landrace, and Large White.
Before the widespread availability of high-throughput sequencing technology, best linear unbiased prediction (BLUP) was the most commonly used selection method in China, which is based on pedigree information and uses a kinship matrix to estimate breeding values. Despite its widespread use in early animal breeding, the accuracy of BLUP is limited by the quality of the pedigree information and performance test scale [1].
The utilization of genomic prediction, through the correlation of whole-genome genotypes with phenotypes in a large sample of individuals, has been established as a means of enhancing quantitative traits in domesticated animals [2,3,4]. However, the reliance on imputed genomic regions can result in decreased prediction accuracy when applied to populations with divergent linkage disequilibrium structures [5].
The advancement of next-generation sequencing technology has greatly increased the volume of pig genomic data, making it possible to utilize this information in breeding studies. The most commonly used method is genomic best linear unbiased prediction (GBLUP), which allows for the direct estimation of an individual’s genetic merit through the construction of a genomic relationship matrix G, instead of a kinship matrix A based on pedigree information. While GBLUP methods assume that all markers have the same genetic variance. In reality, only a few single nucleotide polymorphisms (SNPs) have an impact and are linked to the quantitative trait loci (QTL) affecting the trait, while most SNPs are ineffective [6,7,8].
In the realm of genetic prediction, Bayesian techniques have gained prominence due to their incorporation of marker effect variance as a predetermined prior distribution. This approach encompasses a range of variations, such as BayesA, BayesB, BayesC, Bayesian Lasso, among others. These methods address limitations in the single-trait genomic best linear unbiased prediction (GBLUP) method by assuming that only a few single nucleotide polymorphisms (SNPs) have significant effects, with the variance of these effects following an inverse chi-squared distribution. This assumption aligns more closely with reality, as only a few SNPs are usually linked to the quantitative trait loci (QTLs) affecting the trait, while most SNPs have no effect. As a result, the BayesB method is often more accurate when few QTLs control genetic variation with large effects [9,10,11].
Compared to GBLUP methods, Bayesian models have more estimated parameters, leading to increased computational effort and improved prediction accuracy. However, the computational speed and robustness are not as favorable as those of GBLUP [12,13]. Both Bayesian and GBLUP methods rely on the accuracy of phenotypic determination and require the creation of a large-scale reference population to ensure selection accuracy [8,9].
In the case of Chinese pig breeds with small foundation populations and the breeding work being carried out involves both intraspecific selection and the hybridization of locally adapted varieties with introduced varieties to create new hybrid varieties, a genetic reference population may not exist, and the determination of quantitative traits may not be precise due to a lack of highly qualified breeding staff and refined breeding equipment.
The main purpose of genomic selection is to estimate genome-wide SNP effects based on a reference population and then to sum the genetic effects of candidate individuals based on the specific alleles they carry. This means first identifying an ideal individual; then the more similar a candidate is to an ideal individual, the higher its breeding value. For this purpose, we propose a kinship index based selection (KIS) method. To validate the efficacy and stability of the KIS method, we performed a series of simulations using the MoBPS R package [14].

2. Materials and Methods

2.1. Methods

In the KIS method, the first step is to identify parental varieties that support the breeding objectives. The second step is to design breeding strategies; common breeding strategies include introgression cross breeding, stepwise cross breeding, and multiple cross breeding. The third step is to design ideal individuals with the desired genotype harboring parental advantageous traits, compare the differences between candidate and ideal individuals, and finally, select and cross the individuals with high similarity with the ideal individual. The visual representation of the process is shown in Figure 1A.
The establishment of the ideal individual is the core component of the KIS method, and the accuracy of this method is entirely dependent on the quality of the ideal individual. The first step in building the ideal individual is to identify the advantageous traits in the parent breed. Different pig breeds have specific advantages; the advantageous traits of Chinese indigenous pigs are mainly in meat quality, roughage tolerance, and disease resistance, while that of the introduced breeds has better leanness, growth rate, and feed conversion ratio [15]. The beneficial genotype corresponding to the desired trait is identified after the determination of the advantageous trait. The beneficial genotype can be established by GWAS, QTL database, pig resource census data, etc. The ideal individual containing the desired traits is established. In the actual operation process, the genome is divided into several subsystems based on traits or trait categories. The KIS method calculates the similarity between candidate individuals and ideal individuals based on each trait or trait category, and then synthesizes a comprehensive index (kinship index KI) by weighting according to importance. Considering the existence of multiple effects for one cause, we analyzed the interactions between genes and selected loci for consideration based on the effect value; loci with small effect values will be ignored. The process of selection is illustrated in Figure 1B.
Based on the positional information of the ideal individual, the genotypes of the candidate individuals at these loci were extracted and compared to that of the ideal individual. The results of the comparison are expressed in the form of ordinals of 0, 1, and 2, where 0 represents both disadvantage alleles, 1 represents allelic heterozygosity with half advantageous alleles, and 2 represents advantageous alleles. The sum of the scores for all loci represents the degree of similarity between the candidate individual and the ideal individual. Candidate individuals are ranked according to their similarities to ideal individuals, and those with high similarities will be selected.

2.2. Simulation

2.2.1. Historical Population

The historical population consisted of 200 animals, with an equal number of males and females. For each individual, an underlying true genetic value is calculated for each trait. Based on this, phenotypes can be generated [14]. Two quantitative traits containing 100 additive QTLs and 500 additive QTLs, respectively, were assigned to the historical population where each individual carries a varying number of QTLs. The genomic variance of these traits was simulated to be 1. According to the KIS method, two purebred populations, each to be selected for one quantitative trait, were created from the historical population. The purebred populations were raised separately for 10 generations. In each generation, 5 sires and 20 dams were crossed to obtain 200 offspring, of which the numbers of males and females were equal. The selection of sires and dams depended on the advantage of genotypic similarity with the ideal individual and was independent for the two purebred populations. Hereon, the two purebred populations are referred to as population A and population B. The simulation process for historical populations is shown in Figure 2A. Figure 2B shows principal components analysis of individuals in the founder population and generation 10 of population A and population B; the x-axis denotes the first principal component and the y-axis represents the second principal component.

2.2.2. Breeding Program

We implemented hybridization and pure breeding programs to test the effectiveness of different breeding approaches. During the hybridization process, the first hybrid generation was obtained by mating five sires from population A and 20 dams from population B from generation 10 of their respective populations. Subsequently, 20 females from the first hybrid generation were mated with five males from population B of generation 10. The two generations in the hybridization process were referred to as generations 11 and 12, respectively. Animals in the purebred reproduction breeding process were mated with their contemporaries, and five males and 20 females were selected to maintain the population in each generation. Five generations of the purebred reproduction breeding process were simulated to examine the effectiveness of diverse breeding methods.

2.2.3. Evaluation Criteria

An animal’s true breeding value was used as an evaluation criterion to evaluate the effectiveness of different breeding methods. The advantageous genotype, as well as, the effect of each QTL was considered in the simulation, therefore, an individual’s true breeding value was calculated as follows:
T B V i = j = 1 n Q T L Z i j a Q T L j ,
where T B V i is the true breeding value of individual i, a Q T L is the vector that includes the additive QTL effect values, and Z is the genotype matrix for all animals, which is coded by 0, 1, 2. All simulations were repeated 10 times and the resultant data were tested for significance using ANOVA.

2.3. Feasibility Test

Four selection methods, including KIS, GBLUP, BayesB, and strategy based on true breeding value (TBV), were implemented in the simulation. The effectiveness of each selection method was evaluated by the true breeding values of the simulated individual. In addition, negative control simulation was implemented by replacing the ideal individual with an individual with 600 noneffective loci and random beneficial alleles.

2.4. Robustness Test

Robustness tests were implemented to fit better replicate the real situation, including QTL with different false negative rates, QTL with different false positive rates, QTL with different quantitative gradients, and the scale of foundation population and selection proportion, and simulation of dominant and epistatic effects.

2.4.1. QTL with Different False Negative Rates

QTL for the ideal individual was sorted considering the smallest to greatest effect values, that is, 10%, 20%, and 30% of the QTL were randomly selected for deletion to reconstruct the ideal individual (false negative individual), and the missing QTL of the ideal individual with a low absence rate was contained in the one with high absence rate. The selection and mating of simulated individuals were optimized according to the false negative individual. Nevertheless, the TBV of the mating individuals was calculated using true QTL effects.
The advanced KIS method did not consider QTL effects when establishing the ideal individual, therefore, we built an ideal individual containing beneficial genotypes and QTL effects. Other basic settings were as previously described.

2.4.2. QTLs with Different Pseudo-Positive Rates

The pseudo-positive rates were the same as those in the false negative test, which included three gradients. However, pseudo-positive QTLs were adjacent to true QTLs on chromosomes. The false positive QTL did not duplicate true QTL, and beneficial genotypes were assigned randomly. Other basic settings were as previously described.

2.4.3. QTLs with Different Quantitative Gradients

In addition to the combination of 100 and 500, which represent the number of QTLs for two quantitative traits, three other combinations, including, 100–350, 150–400, and 200–500 were considered to investigate the robustness of the KIS method. The number of QTLs for the other three selection methods subsequently changed. Other basic settings were as previously described.

2.4.4. Scale of Foundation Population

In this context, different scales of the foundation population were taken into consideration to detect the robustness of the KIS method, while the number of QTLs was the same as in the previous simulation (100–500). Five different gradients of the foundation population sizes, that is, 2000, 1000, 500, 200, and 50, were utilized to examine if the KIS method was still valid in the disparate scale of the foundation population. The ratio of male to female was half in all the selection methods. Five sires and 20 dams were selected for mating according to the criteria corresponding to different selection methods. Other basic settings were as previously described.

2.4.5. Selection Proportion

For the examination of the selection proportion, the scale of the foundation population was fixed at 200, with an equal number of sires and dams. Five selection proportions: 0.2–0.5, 0.15–0.4, 0.1–0.3, 0.05–0.2, and 0.03–0.1 were simulated. In these combinations, each group of figures represented the proportion of male and female animals in the foundation population. Other basic settings were the same as previously described.

2.4.6. Simulation of Dominance and Epistatic Effects

In the simulation of dominance and epistatic effects, the impact of each effect on the accuracy of the KIS method was first simulated separately, followed by a simultaneous consideration of both effects on the simulation results. Different QTL gradient settings were employed in the simulation, representing 10%, 20%, and 30% of the total number of QTLs with additive effects.

3. Result

3.1. Negative Control Simulation

Figure 3 shows the comparison of the KIS method and negative control for determining the breeding values of boars in all generations. The comparison reflected that the breeding value of the boars in the KIS method increased in succeeding generations. Moreover, the breeding values of boars corresponding to the negative control remained unchanged except for in the F1 generation. This may be because the F1 boars were mainly heterozygotes, and heterozygotes were assigned a value of 1 during calculation. The same phenomenon was consistent in the selected group of sows.

3.2. Feasibility Test

The comparison of the KIS method with conventional breeding methods for determining the breeding values of boars and sows is illustrated in Figure 3. The breeding values corresponding to all breeding methods increased in succeeding generations. The TBV method had the highest breeding values among all generations, followed by the KIS method, whose value was higher than those of the BayesB and GBLUP methods. The results of the significance tests for breeding values are presented Table 1. The significance test results show no significant difference between the breeding values corresponding to the various breeding methods until the G3 generation (generation 3). In contrast, G4 generation (generation 4) onwards, there was a considerable difference between TBV and other breeding methods.

3.3. QTLs with Different False Negative Rates

The breeding values corresponding to the KIS method at different deletion rates are shown in Figure 4A,B. The breeding values corresponding to the KIS method increased in succeeding generations at different absence rates, and there was no apparent regularity in the scale of breeding values between absence rates. The significance test results are presented in Table 2; the differences in breeding values corresponding to absence deletion rates were not significant in any generation.
Data corresponding to the KIS method with zero absence rate were selected for comparison with general breeding methods; the results were the same as those shown in Figure 3. Figure 4C,D shows the results of the comparison of four different absence rates with QTL effects taken into consideration. The results of the significance test (Table 2) indicate that there was no significant difference between the breeding values associated with varying rates of absence until generation G3, while the breeding values related to a 30% absence rate in generation G4 were significantly different. The breeding values associated with a 20% absence rate in generation G5 also show significant differences compared to the other absence rates.

3.4. QTLs with Different Pseudo-Positive Rates

The results of the pseudo-positive test are shown in Figure 5A,B. The comparison of breeding values for different pseudo-positive rates shows that the breeding value decreased as the pseudo-positive rate increased in succeeding generations. The significance test shows that the differences between breeding values corresponding to different pseudo-positive rates were not significant in any generation. The results of the significance tests for breeding values are shown in Table 3.
The result of the comparison of the KIS method with the other selection methods is presented in Figure 5C,D. In addition, data on breeding values relating to the KIS method with a 30% false positive rate were compared with those of the conventional methods and showed that the KIS method was less effective than the TBV method but better than the rest of the selection methods. The results of the significance tests for breeding values are presented in Table 3. The significance test results showed no significant difference between the breeding values corresponding to the various breeding methods until the G3 generation. In contrast, G4 generation onwards, there was a considerable difference between TBV and other breeding methods.

3.5. QTLs with Different Quantitative Gradients

The results of the three quantitative gradients regarding the number of different QTL tests are presented in Figure 6, and the significance test results are shown in Table 4. The results of breeding values show the same regularity regardless of the quantitative gradient, i.e., the KIS method is less effective than the TBV method but better than the other methods. Furthermore, the results of the significance test showed that the TBV method showed significant differences compared with the other selection methods only at generation G5 when the number of QTLs was small. In contrast, the TBV method showed significant differences compared with the other selection methods at generation G1 as the number of QTLs increased. The KIS method showed no substantial compared differences with the BayesB and GBLUP methods in any generation.

3.6. Scale of Foundation Population

Figure 7A,B illustrates the results of the foundation population size simulation. The comparison reflected that breeding values increased as the scale of the foundation population increased, however, the difference in breeding values between the foundation population scales was not significant. Only at a foundation population size of 100 in the G5 generation, the breeding value was significantly different from those of the other foundation population scales. The results of the significance tests for breeding values with scales of different foundation population is shown in Table 5.

3.7. Selection Proportion

The selection proportion simulation experiment results are presented in Figure 7C,D and show that the breeding value is essentially inversely proportional to the selection proportion in both males and females. In addition, the significance test results showed that the effect of different selection proportions on breeding values does not differ significantly in all generations. The results of the significance tests for breeding values with different selection proportions are shown in Table 5.

3.8. Simulation Results of Dominance and Epistatic Effects

Figure 8 presents the results of considering only dominant effects, only epistatic effects, and both dominant and epistatic effects, from top to bottom, and the number of QTL from 10% to 30%, from left to right. Notably, when accounting for both dominant and epistatic effects, a declining trend in breeding values was observed for each generation as the number of QTL increased.
Compared to the conventional breeding method, individuals selected through the KIS method exhibited superior performance in terms of true breeding values, with overall higher breeding values than those selected by the conventional method, albeit with a slight reduction in just a few generations. Importantly, individuals selected based solely on true breeding values achieved the highest scores, and all KIS scores were inferior to these. In addition, the results of the significance test show that there is no significant difference in breeding value among KIS method and conventional breeding methods. The results of the significance tests for breeding values with different selection proportions are shown in Supplementary Table S1.

4. Discussion

In this study, we aimed to assess the effectiveness and stability of the KIS method in a breeding program. The KIS method is distinct from previous selection methods in the following ways: (1) The selection criteria are established by identifying an ideal individual, (2) The measurement of phenotypic traits is not a crucial aspect of the breeding process, and (3) A large reference population is not necessary, making the KIS method appropriate for small pig populations.
The effectiveness of genomic selection is dependent on the linkage disequilibrium between causative mutations and the SNP markers (generally numbering approximately 50,000) utilized in genomic predictions [16]. The dairy cattle industry has widely adopted this technology, with over 4 million animals genotyped via SNP arrays. Despite its success in narrowly defined populations, such as Holstein-Friesian dairy cattle [9]. However, in the field of pig breeding, the use of whole-genome markers is not feasible due to the multitude of indigenous pig breeds in China. Furthermore, the SNP markers predominantly utilized are derived from breeds such as Duroc, Landrace, and Large White, which exhibit a significant genetic distance from the indigenous pig breeds of China [17].
Under almost all the mentioned simulation conditions above, the selection effectiveness of the KIS method was better than that of the GBLUP and BayesB methods but lower than that of the TBV method across all generations. This may be due to the limited sample size in the simulation population, which hinders the full potential of the GBLUP and Bayes methods and reduces their accuracy, this also demonstrates that the KIS method has its own unique features in the case of small population size.
In the negative control simulation experiment, the breeding value of F1 was significantly lower than that at other ages. This may be because the F1 generation mainly included heterozygotes, and the heterozygotes were assigned a value of 1 during calculation.
In the simulation experiment for false negatives, the performance of individuals selected through the equally weighted KIS method showed no decrease in their true breeding values as the rate of absence increased. However, this result was not in line with expectations as the individuals selected by the equally weighted KIS method were observed to have a higher abundance of favorable gene loci, however, the total of these loci did not yield the highest values in terms of breeding values, in the scenario of low absence rates. This discrepancy could also be due to a truncated selection mating strategy, as the breeding values of offspring produced through different mating combinations are not always positively correlated with the breeding values of their parents. When assigning equal weights, certain QTLs with large effects may not be favored solely based on the number of favorable genes they possess. However, by removing part of QTLs with minor effects, the weights of these large-effect QTLs are amplified, which could help explain this phenomenon. In other words, removing part of the influence of small-effect QTLs through the equal-weighted approach may enable the large-effect QTLs to receive the recognition they deserve.
The simulation results for false positives were as anticipated, with a decrease in breeding values as the false positive rate increased. Despite this, the KIS method was generally consistent with both the GBLUP and BayesB methods, even when the false positive rate reached 30%.
The patterns observed in the three distinct QTL quantitative gradients were not completely congruent with prior simulations. While the patterns of the breeding values remained unchanged, the results of the significance tests revealed that as the number of QTLs increased, the TBV method demonstrated significant disparities compared to the other methods at earlier generations. For instance, in the first QTL quantitative gradient (350–100), significant differences were only noted in generation G5, while in the third QTL quantitative gradient (500–200), significant disparities appeared as early as generation G1. This may be attributed to the increase in the breeding value of the ideal individual with an increasing number of simulated QTL and the consequent loss of more QTL loci during the transmission hybridization process.
In practice, balancing breeding effectiveness with production costs requires controlling the scale of the foundation population within acceptable limits. Results from simulation studies indicated that increasing the size of the foundation population leads to a corresponding increase in breeding effectiveness. However, statistical significance tests of the KIS method revealed little difference between different foundation population scales in generations other than the fifth, where a foundation population size of 100 led to a substantial difference. Additionally, the results of the seed reservation simulations were as expected, with breeding values rising with each generation as reservation rates decreased. These results suggest that breeders using the KIS method have the flexibility to tailor the reservation rate and foundation population scale to meet specific requirement.
When dominant and epistatic effects were considered in the simulation, the selection effect of KIS gradually decreased as the proportion of dominant and epistatic effects QTL number increased, probably because the proportion of additive effects gradually decreased when dominant and epistatic effects were taken into account in the design of ideal individuals.
When considering only additive effects, we calculated the coefficient of variation corresponding to four different methods within the same generation, and the results are shown in the Supplementary Table S2. In the table, the coefficient of variation for the KIS method is very close to that of the TBV method. After the G2 generation, the CV for the KIS method is slightly higher than that of the TBV method, which may be due to the fact that the KIS method did not assign weights to each locus during selection.
The simulation study presented in this paper showcases the effectiveness of the KIS method in selecting individuals for breeding. Our model selectively bred individuals from the same generation, under similar environmental conditions, and with a focus on a few key traits. This method has proven to be effective in enhancing genetic gain while minimizing the risk of negative traits being unintentionally selected for. However, it is important to note that our model only considered individuals from the same generation and under similar environmental conditions, without accounting for external factors such as field effects and maternal effects. In the process of hybrid breeding, selection is carried out in the same generation and the same gender, considering only the genotypes and excluding the influence of environmental effects.

5. Limitations and the Future of Applying the KIS Method

The effectiveness of the kinship index based selection (KIS) method is limited by the accuracy and coverage of functional genes in pigs. Previous studies have reported on the association of functional genes with traits in pigs, and databases, such as The Farm animal Genotype-Tissue Expression TWAS server (http://twas.farmgtex.org/) [18] and PigVar [19], exist which combine non-redundant variations and evolutionary selective scores. Results from the simulation study indicated that the mating method used may impact selection by the KIS method and that the truncated selection method used may not be the most optimal. Previous research has suggested that incorporating advantageous effects into the mating method can balance genetic progression and genetic diversity, albeit with some loss of genetic progression [20,21,22]. In the KIS method, a truncation selection approach is adopted. After the selection of males and females is completed, they are randomly paired without considering the impact of inbreeding coefficients on breeding results. In actual breeding processes, conventional mating methods can be used to avoid rapid increases in inbreeding coefficients, some mating methods have been mentioned in the previous text. In subsequent studies of the KIS method, we will continue to research mating methods for improvement. In future studies, the authors aim to provide a database of beneficial genotypes and mating methods to support the KIS method.

6. Conclusions

The KIS method solves the problem of phenotypic determination and establishment of genomic reference populations in the breeding process by establishing an ideal individual; it is a breeding method that is easy to implement, has a wide range of applications, and has high accuracy. The application value of the KIS method will continue to improve as research on pig functional genes progresses.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes14040807/s1, Table S1: The result of significance tests for non-additive effect simulations. Table S2: The result of variation corresponding to four different methods within the same generation.

Author Contributions

Conceptualization, Y.P.; methodology, Q.W.; validation, J.G. (Jiamin Gu), J.G. (Jianwei Guo) and Y.X.; formal analysis, J.G. (Jiamin Gu); investigation, J.G. (Jiamin Gu); resources, Y.P. and Q.W.; data curation, J.G. (Jiamin Gu); writing—original draft preparation, J.G. (Jiamin Gu) and J.G. (Jianwei Guo); writing—review and editing, Y.P., Q.W. and Q.R.Q.; supervision, Z.W., Z.Z. (Zhe Zhang) and Z.Z. (Zhenyang Zhang); project administration, Y.P.; funding acquisition, Q.W. and Y.P. All authors have read and agreed to the published version of the manuscript.

Funding

This work was financially supported by the National Natural Science Foundation of China (grant nos. 31972534, 32272833), Joint Research Project of the Ministry of Agriculture and rural affairs (19220034, 19220035), Zhejiang Science and Technology Major Program on Agricultural New Variety Breeding (Grant number: 2021C02068-1) and Hainan Provincial Key R & D Program of China (ZDYF2021XDNY296).

Data Availability Statement

All simulation-related codes are accessible at the following URLs: http://alphaindex.zju.edu.cn/ALPHADB/download.html.

Acknowledgments

Thanks to all the teachers and students of Panlab for their help, especially Pan and Wang for their guidance on design ideas, and Miao, Zhang, and Chen for their technical guidance.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Yan, B.; Yan, J.; Shi, W.; Li, Y. Study on the comprehensive comparative advantages of pig production and development in China based on geographic information system. Clean Technol. Environ. Policy 2020, 22, 105–117. [Google Scholar] [CrossRef]
  2. Goddard, M.E.; Kemper, K.E.; MacLeod, I.M.; Chamberlain, A.J.; Hayes, B.J. Genetics of complex traits: Prediction of phenotype, identification of causal polymorphisms and genetic architecture. Proc. R. Soc. B Boil. Sci. 2016, 283, 20160569. [Google Scholar] [CrossRef] [Green Version]
  3. Hayes, B.J.; Lewin, H.A.; Goddard, M.E. The future of livestock breeding: Genomic selection for efficiency, reduced emissions intensity, and adaptation. Trends Genet. 2013, 29, 206–214. [Google Scholar] [CrossRef]
  4. Hayes, B.; Bowman, P.; Chamberlain, A.; Goddard, M. Invited review: Genomic selection in dairy cattle: Progress and challenges. J. Dairy Sci. 2009, 92, 433–443. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Lund, M.S.; Su, G.; Janss, L.; Guldbrandtsen, B.; Brøndum, R.F. Genomic evaluation of cattle in a multi-breed context. Livest. Sci. 2014, 166, 101–110. [Google Scholar] [CrossRef] [Green Version]
  6. Koivula, M.; Strandén, I.; Su, G.; Mäntysaari, E. Different methods to calculate genomic predictions—Comparisons of BLUP at the single nucleotide polymorphism level (SNP-BLUP), BLUP at the individual level (G-BLUP), and the one-step approach (H-BLUP). J. Dairy Sci. 2012, 95, 4065–4073. [Google Scholar] [CrossRef] [Green Version]
  7. Pérez-Enciso, M.; Zingaretti, L.M. A Guide on Deep Learning for Complex Trait Genomic Prediction. Genes 2019, 10, 553. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Moeinizade, S.; Kusmec, A.; Hu, G.; Wang, L.; Schnable, P. Multi-trait Genomic Selection Methods for Crop Improvement. Genetics 2020, 215, 931–945. [Google Scholar] [CrossRef] [PubMed]
  9. Meuwissen, T.H.E.; Hayes, B.J.; Goddard, M.E. Prediction of Total Genetic Value Using Genome-Wide Dense Marker Maps. Genetics 2001, 157, 1819–1829. [Google Scholar] [CrossRef]
  10. Habier, D.; Fernando, R.L.; Kizilkaya, K.; Garrick, D.J. Extension of the bayesian alphabet for genomic selection. BMC Bioinform. 2011, 12, 186. [Google Scholar] [CrossRef] [Green Version]
  11. Yi, N.; Xu, S. Bayesian LASSO for quantitative trait loci mapping. Genetics 2008, 179, 1045–1055. [Google Scholar] [CrossRef] [Green Version]
  12. Zeng, J.; Pszczola, M.; Wolc, A.; Strabel, T.; Fernando, R.L.; Garrick, D.J.; Dekkers, J.C.M. Genomic breeding value prediction and QTL mapping of QTLMAS2011 data using Bayesian and GBLUP methods. BMC Proc. 2012, 6 (Suppl. 2), S7. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Teng, J.; Gao, N.; Zhang, H.; Li, X.; Li, J.; Zhang, H.; Zhang, X.; Zhang, Z. Performance of whole genome prediction for growth traits in a crossbred chicken population. Poult. Sci. 2019, 98, 1968–1975. [Google Scholar] [CrossRef]
  14. Pook, T.; Schlather, M.; Simianer, H. MoBPS—Modular Breeding Program Simulator. G3 Genes Genomes Genet. 2020, 10, 1915–1918. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Chen, Z.; Teng, J.; Diao, S.; Xu, Z.; Ye, S.; Qiu, D.; Zhang, Z.; Pan, Y.; Li, J.; Zhang, Q. Insights into the architecture of human-induced polygenic selection in Duroc pigs. J. Anim. Sci. Biotechnol. 2022, 13, 99. [Google Scholar] [CrossRef]
  16. Giuffra, E.; Tuggle, C.K. FAANG Consortium Functional Annotation of Animal Genomes (FAANG): Current Achievements and Roadmap. Annu. Rev. Anim. Biosci. 2019, 7, 65–88. [Google Scholar] [CrossRef]
  17. Tao, J.; Qin, Z.-Q.; Tao, Y.; Wen, L.; Shu, X.-S.; Wang, Z.-C.; Liu, X.-W.; Li, W.-J.; Hu, W.-X. Genetic relationships among Chinese pigs and other pig populations from Hunan Province, China. Anim. Genet. 2007, 38, 417–420. [Google Scholar] [CrossRef] [PubMed]
  18. Zhang, Z.; Chen, Z.; Teng, J.; Liu, S.; Lin, Q.; Gao, Y.; Bai, Z.; Consortium, T.F.; Li, B.; Liu, G.; et al. FarmGTEx TWAS-server: An interactive web server for customized TWAS analysis in both human and farm animals. bioRxiv 2023. bioRxiv:2023.02.03.527092. [Google Scholar] [CrossRef]
  19. Zhou, Z.-Y.; Li, A.; Otecko, N.O.; Liu, Y.-H.; Irwin, D.M.; Wang, L.; Adeola, A.C.; Zhang, J.; Xie, H.-B.; Zhang, Y.-P. PigVar: A database of pig variations and positive selection signatures. Database 2017, 2017, bax048. [Google Scholar] [CrossRef] [Green Version]
  20. González-Diéguez, D.; Tusell, L.; Carillier-Jacquin, C.; Bouquet, A.; Vitezica, Z.G. SNP-based mate allocation strategies to maximize total genetic value in pigs. Genet. Sel. Evol. 2019, 51, 55. [Google Scholar] [CrossRef] [Green Version]
  21. González-Diéguez, D.; Tusell, L.; Bouquet, A.; Legarra, A.; Vitezica, Z.G. Purebred and Crossbred Genomic Evaluation and Mate Allocation Strategies To Exploit Dominance in Pig Crossbreeding Schemes. G3 Genes Genomes Genet. 2020, 10, 2829–2841. [Google Scholar] [CrossRef] [PubMed]
  22. Liu, H.; Henryon, M.; Sørensen, A.C. Mating strategies with genomic information reduce rates of inbreeding in animal breeding schemes without compromising genetic gain. Animal 2017, 11, 547–555. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. (A) The entire progress of the KIS method; (B) The progress of constructing the ideal individual.
Figure 1. (A) The entire progress of the KIS method; (B) The progress of constructing the ideal individual.
Genes 14 00807 g001
Figure 2. (A) The entire progress of simulating the historical population and (B) the principal com-ponent analysis of individuals in the founder population (black circle) and population A (green circle) and population B (red circle) of generation 10.
Figure 2. (A) The entire progress of simulating the historical population and (B) the principal com-ponent analysis of individuals in the founder population (black circle) and population A (green circle) and population B (red circle) of generation 10.
Genes 14 00807 g002
Figure 3. The comparison of the KIS method with negative control and general breeding methods for breeding values of boars and sows.
Figure 3. The comparison of the KIS method with negative control and general breeding methods for breeding values of boars and sows.
Genes 14 00807 g003
Figure 4. The results of comparing four different absence rates for equal-weight KIS method for boars (A) and sows (B); The results of comparing four different absence rates with QTL effects taken into consideration for weighted KIS method for boars (C) and sows (D).
Figure 4. The results of comparing four different absence rates for equal-weight KIS method for boars (A) and sows (B); The results of comparing four different absence rates with QTL effects taken into consideration for weighted KIS method for boars (C) and sows (D).
Genes 14 00807 g004
Figure 5. The comparison of breeding values for different pseudo-positive rates for boars (A) and sows (B); The comparison of the KIS method with general breeding methods for breeding values for boars (C) and sows (D).
Figure 5. The comparison of breeding values for different pseudo-positive rates for boars (A) and sows (B); The comparison of the KIS method with general breeding methods for breeding values for boars (C) and sows (D).
Genes 14 00807 g005
Figure 6. The comparison of the KIS method with general breeding methods for breeding values in different QTL quantitative gradients.
Figure 6. The comparison of the KIS method with general breeding methods for breeding values in different QTL quantitative gradients.
Genes 14 00807 g006
Figure 7. The breeding values corresponding to the KIS method at different scales of foundation population for boars (A) and sows (B); The breeding values corresponding to the KIS method at different selection proportions for boars (C) and sows (D).
Figure 7. The breeding values corresponding to the KIS method at different scales of foundation population for boars (A) and sows (B); The breeding values corresponding to the KIS method at different selection proportions for boars (C) and sows (D).
Genes 14 00807 g007
Figure 8. The result of simulation for non-additive effects. In the figure, from top to bottom, the addition of dominant effects, the addition of supernumerary effects, and the addition of both dominant and supernumerary effects; from left to right, the proportion of the number of non-additive effect QTL to the number of additive effect QTL, from 10% to 30%.
Figure 8. The result of simulation for non-additive effects. In the figure, from top to bottom, the addition of dominant effects, the addition of supernumerary effects, and the addition of both dominant and supernumerary effects; from left to right, the proportion of the number of non-additive effect QTL to the number of additive effect QTL, from 10% to 30%.
Genes 14 00807 g008
Table 1. The results of the significance tests for breeding values. The same letter (A,B) used in the labeling of different methods indicates no significant difference, while different letters indicate a significant difference between the two methods.
Table 1. The results of the significance tests for breeding values. The same letter (A,B) used in the labeling of different methods indicates no significant difference, while different letters indicate a significant difference between the two methods.
GenerationF1F2G1G2G3G4G5
Method
KIS55.068 B56.049 A57.165 B58.196 B59.318 B60.396 B61.446 B
Negative control45.003 A53.809 A53.958 A54.179 A53.873 A54.262 A54.011 A
GBLUP53.685 A55.624 A56.949 A57.648 A58.585 A59.396 B60.199 B
BayesB53.952 A55.665 A56.638 A57.594 A58.583 A59.136 B59.930 B
KIS55.068 A 56.049 A57.165 A58.196 A59.318 A60.396 AB61.446 AB
TBV55.207 A57.896 A59.439 A60.858 A62.190 A63.410 A64.576 A
Table 2. The results of the significance tests for breeding values in false negative simulation. The same letter used in the labeling of different methods indicates no significant difference, while different letters indicate a significant difference between the two methods.
Table 2. The results of the significance tests for breeding values in false negative simulation. The same letter used in the labeling of different methods indicates no significant difference, while different letters indicate a significant difference between the two methods.
GenerationF1F2G1G2G3G4G5
False Negative Rates
0% (equal)55.068 A56.049 A57.165 A58.196 A59.318 A60.396 A61.446 A
10% (equal)55.014 A56.242 A57.620 A58.552 A59.498 A60.633 A61.768 A
20% (equal)54.887 A56.222 A57.497 A58.702 A59.665 A60.559 A61.536 A
30% (equal)55.137 A56.462 A57.636 A58.582 A59.462 A60.473 A61.499 A
0% (weight)55.807 A57.131 A58.755 A60.199 A61.466 A62.903 A64.040 A
10% (weight)54.789 A55.891 A56.724 A57.622 A58.589 A59.534 AB60.148 AB
20% (weight)54.765 A55.707 A56.476 A57.594 A58.342 A59.278 AB59.843 B
30% (weight)54.936 A55.739 A56.601 A57.490 A58.252 A59.106 B59.954 B
Table 3. The results of the significance tests for breeding values in the pseudo-positive simulation. The same letter used in the labeling of different simulations indicates no significant difference, while different letters indicate a significant difference between the two simulations.
Table 3. The results of the significance tests for breeding values in the pseudo-positive simulation. The same letter used in the labeling of different simulations indicates no significant difference, while different letters indicate a significant difference between the two simulations.
GenerationF1F2G1G2G3G4G5
Pseudo-Positive Test
0%55.068 A56.049 A57.165 A58.196 A59.318 A60.396 A61.446 A
10%55.109 A56.088 A57.146 A58.155 A59.193 A60.087 A60.780 A
20%54.733 A56.124 A57.032 A57.928 A58.946 A59.749 A60.649 A
30%54.641 A55.666 A56.673 A57.777 A58.757 A59.703 A60.646 A
GBLUP53.685 A55.624 A56.949 A57.648 A58.585 A59.396 B60.199 B
BayesB53.952 A55.665 A56.638 A57.594 A58.583 A59.136 B59.930 B
KIS (30%)54.641 A55.666 A56.673 A57.777 A58.757 A59.703 B60.646 B
TBV55.207 A57.896 A59.439 A60.858 A62.190 A63.410 A64.576 A
Table 4. The results of the significance tests for breeding values with QTL quantitative gradients. The same letter used in the labeling of different simulations indicates no significant difference, while different letters indicate a significant difference between the two simulations.
Table 4. The results of the significance tests for breeding values with QTL quantitative gradients. The same letter used in the labeling of different simulations indicates no significant difference, while different letters indicate a significant difference between the two simulations.
GenerationF1F2G1G2G3G4G5
QTL Quantitative Gradients
GBLUP (QTL 100-350)48.941 A51.324 A52.115 A53.249 A54.355 A55.065 A56.156 AB
BayesB (QTL 100-350)49.161 A51.138 A52.353 A53.312 A53.979 A54.609 A55.551 B
KIS (QTL 100-350)50.320 A51.661 A52.970 A54.012 A55.304 A56.204 A57.185 AB
TBV (QTL 100-350)50.372 A53.380 A54.931 A56.339 A57.617 A58.834 A59.977 A
GBLUP (QTL 150-400)52.4935 A53.8096 B54.6248 B55.8352 B56.4513 B57.2635 B58.1057 B
BayesB (QTL 150-400)52.5697 A53.9613 B54.6381 B55.6529 B56.6019 B57.3517 B57.8415 B
KIS (QTL 150-400)53.6179 A54.5528 AB55.5732 B56.5278 B57.4771 B58.5359 B59.4759 B
TBV (QTL 150-400)53.4380 A56.2417 A57.8186 A59.1561 A60.5100 A61.7221 A62.9629 A
GBLUP (QTL 200-500)60.5938 A62.6374 A63.8198 B64.7000 B65.5406 B66.1291 B 66.6235 B
BayesB (QTL 200-500)60.5819 A62.8641 A63.9442 B64.6765 B65.7173 B66.5116 B67.1417 B
KIS (QTL 200-500)61.8339 A63.2821 A64.4985 AB65.5540 AB66.4141 B67.6554 B68.6712 B
TBV (QTL 200-500)61.9914 A64.5347 A66.2707 A67.4954 A68.9448 A70.3415 A71.4575 A
Table 5. The results of the significance tests for breeding values with different selection proportions.The same letter used in the labeling of different simulations indicates no significant difference, while different letters indicate a significant difference between the two simulations.
Table 5. The results of the significance tests for breeding values with different selection proportions.The same letter used in the labeling of different simulations indicates no significant difference, while different letters indicate a significant difference between the two simulations.
GenerationF1F2G1G2G3G4G5
Simulation Test
Popsize 200055.391 A56.751 A58.575 A60.159 A61.646 A63.016 A64.532 A
Popsize 100055.363 A56.690 A58.272 A59.692 A60.943 A62.250 A63.591 AB
Popsize 50055.130 A56.260 A57.748 A58.893 A60.252 A61.464 A62.400 AB
Popsize 20055.068 A56.049 A57.165 A58.196 A59.318 A60.396 A61.446 AB
Popsize 10054.641 A55.911 A56.907 A57.583 A58.345 A59.235 A60.039 B
SI_3_10 55.411 A56.738 A57.999 A59.044 A60.615 A61.564 A62.152 A
SI_5_20 55.068 A56.049 A57.165 A58.196 A59.318 A60.396 A61.446 A
SI_10_30 55.219 A56.186 A57.222 A58.179 A59.157 A59.483 A60.948 A
SI_15_40 55.051 A56.009 A56.810 A57.689 A58.574 A59.483 A60.184 A
SI_20_50 54.978 A55.899 A56.666 A57.329 A57.968 A58.675 A59.303 A
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Gu, J.; Guo, J.; Zhang, Z.; Xu, Y.; Qadri, Q.R.; Zhang, Z.; Wang, Z.; Wang, Q.; Pan, Y. Molecular Design-Based Breeding: A Kinship Index-Based Selection Method for Complex Traits in Small Livestock Populations. Genes 2023, 14, 807. https://doi.org/10.3390/genes14040807

AMA Style

Gu J, Guo J, Zhang Z, Xu Y, Qadri QR, Zhang Z, Wang Z, Wang Q, Pan Y. Molecular Design-Based Breeding: A Kinship Index-Based Selection Method for Complex Traits in Small Livestock Populations. Genes. 2023; 14(4):807. https://doi.org/10.3390/genes14040807

Chicago/Turabian Style

Gu, Jiamin, Jianwei Guo, Zhenyang Zhang, Yuejin Xu, Qamar Raza Qadri, Zhe Zhang, Zhen Wang, Qishan Wang, and Yuchun Pan. 2023. "Molecular Design-Based Breeding: A Kinship Index-Based Selection Method for Complex Traits in Small Livestock Populations" Genes 14, no. 4: 807. https://doi.org/10.3390/genes14040807

APA Style

Gu, J., Guo, J., Zhang, Z., Xu, Y., Qadri, Q. R., Zhang, Z., Wang, Z., Wang, Q., & Pan, Y. (2023). Molecular Design-Based Breeding: A Kinship Index-Based Selection Method for Complex Traits in Small Livestock Populations. Genes, 14(4), 807. https://doi.org/10.3390/genes14040807

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop