Preliminary Evaluation of Blending, Tuning, and Scaling Parameters in ssGBLUP for Genomic Prediction Accuracy in South African Holstein Cattle

Mafolo, Kgaogelo Stimela; MacNeil, Michael D.; Neser, Frederick W. C.; Makgahlela, Mahlako Linah

doi:10.3390/ani15192866

Open AccessArticle

Preliminary Evaluation of Blending, Tuning, and Scaling Parameters in ssGBLUP for Genomic Prediction Accuracy in South African Holstein Cattle

by

Kgaogelo Stimela Mafolo

^1,2,*

,

Michael D. MacNeil

^1,2,3

,

Frederick W. C. Neser

²

and

Mahlako Linah Makgahlela

^1,2

¹

Agricultural Research Council, Animal Production, Private Bag X2, Irene 0062, South Africa

²

University of the Free State, Department of Animal Science, P.O. Box 339, Bloemfontein 9301, South Africa

³

Delta G, Miles City, MT 59301, USA

^*

Author to whom correspondence should be addressed.

Animals 2025, 15(19), 2866; https://doi.org/10.3390/ani15192866

Submission received: 19 June 2025 / Revised: 4 August 2025 / Accepted: 31 August 2025 / Published: 30 September 2025

(This article belongs to the Section Animal Genetics and Genomics)

Download Review Reports Versions Notes

Simple Summary

Predicting the genetic merit of dairy cattle animals is critical for increasing milk production and breeding efficiency. This study looked at how different adjustments to a genomic prediction method known as single-step genomic best linear unbiased prediction (ssGBLUP) affected the accuracy of breeding value estimates in South African Holstein cattle. We specifically tested modifications such as blending, tuning, and scaling to see how well genomic and pedigree information can be combined. Our findings revealed that ssGBLUP was more accurate than traditional pedigree-based methods, but this accuracy was influenced by how genomic and pedigree data were adjusted. Blending strategies with up to 40% polygenic effects increased prediction accuracy. Tuning methods had a less significant impact, and no tuning occasionally led to optimal performance. Scaling adjustments influenced prediction accuracy, with some lower scaling options resulting in improved accuracy for certain traits. This study shows the importance of parameters used in genomic prediction models to ensure more reliable genetic evaluations, which will ultimately assist farmers in making more accurate breeding decisions and increasing dairy production in South Africa.

Abstract

The objective of this study was to evaluate the impact of blending, tuning, and scaling adjustments in ssGBLUP on the accuracy of genomic estimated breeding values (GEBVs) for South African Holstein cattle. The edited dataset included pedigree information for 541,325 animals, 696,413 phenotypic records (milk, protein, and fat yields), and genotypes for 1221 Holstein cattle. The accuracy of GEBVs was evaluated based on different parameter settings for blending (β = 0.05, 0.10, 0.20, 0.30, and 0.40), tuning (τ), and scaling (τ and ω), ranging from 0.60 to 1.00. The results show that ssGBLUP outperformed the traditional pedigree-based approach (ABLUP), with realized accuracies increasing from 0.01 to 0.23 for milk yield, 0.03 to 0.29 for protein yield, and 0.03 to 0.30 for fat yield. Blending with β = 0.30–0.40 slightly increased the accuracy, while tuning adjustments showed limited influence on the prediction results. Scaling factors had a significant influence on accuracy, with ω = 0.60 yielding the highest values (0.26 for milk, 0.32 for protein, and 0.34 for fat). The results of this study show the importance of optimizing the integration of pedigree and genomic information in ssGBLUP to improve the accuracy of genomic predictions, ultimately enhancing selection decisions and genetic progress in South African Holstein cattle.

Keywords:

genomic prediction; ssGBLUP; blending; tuning; scaling; Holstein; accuracy

1. Introduction

Predictions of genomic estimated breeding values (GEBVs) using single-nucleotide polymorphism (SNP) markers together with pedigree and phenotypic information provide the information needed for accurate selection of breeding animals [1,2,3,4,5]. Benefits include increasing the rate of genetic progress, reducing costs, and facilitating the use of animals from different groups without pedigree relationships [6,7,8,9]. Genomic prediction accuracy is dependent on factors such as the number of genotypes, relationships among genotyped animals, the density of SNP markers, statistical methods, linkage disequilibrium (LD), heritability, and genetic structure of the traits [10,11,12]. Most importantly, the genomic prediction model is a key factor that affects the prediction accuracy [12,13]. Single-step genomic best linear unbiased prediction (ssGBLUP) is commonly used in many practical applications and considered superior compared to the conventional pedigree-focused best linear unbiased prediction (ABLUP) methods [1,2,3,4,5,14,15].

In ssGBLUP, the inverse of the pedigree relationship matrix (A⁻¹) is replaced by the inverted realized relationship matrix (H⁻¹) [16]. The H⁻¹ is calculated from A⁻¹, the inverse of the genomic relationship matrix (G⁻¹), and the inverse of the pedigree relationship matrix for genotyped animals (

A_{22}^{- 1}

). However, ssGBLUP overlooks factors such as variability of the SNP effects and variation across genomic regions [17]. These issues can be addressed through the weighted ssGBLUP, which assigns different weights to SNP based on their estimated contributions to genetic variance and has shown potential in numerous studies despite potential effects on the dispersion of breeding values and bias [14,18,19]. Furthermore, GEBVs obtained from ssGBLUP are associated with bias linked to various factors, including singularity of the G matrix, incompatible G and A₂₂ matrices, non-random selection of genotyped animals, and allele frequencies used in the G matrix creation [15,20,21,22,23,24]. Therefore, it is important to establish strategies to reduce the influence of these factors to improve the genomic prediction accuracy of ssGBLUP. Some of the strategies used to reduce bias and improve the accuracy of genomic predictions when merging the G and A₂₂ include blending, tuning, and scaling [24,25,26].

Blending conditions of the G and A₂₂ ensure that G is not singular and the relationship matrix is positive definite [15,23,27]. Tuning ensures that A₂₂ is consistent by rebasing and scaling G, which corrects for differences in the variability in elements of G and A₂₂ [15,22,28]. Lastly, scaling restricts G and A₂₂ to minimize over- or under-estimation of the GEBVs [15,23]. Although the BLUPF90 software applies default blending and tuning parameters, these values may not be universally optimal. Their impact on evaluations can be significant, particularly in populations with limited genotyping or incomplete pedigree information, where inappropriate values may exacerbate inconsistencies between G and A₂₂ or introduce additional bias [15,22,23,29]. Thus, exploring the most suitable adjustments related to the construction of the H⁻¹ matrix of the ssGBLUP is essential for addressing bias and reducing the inflation/deflation of the GEBVs, thereby improving their accuracy.

In the absence of substantial information regarding genomic prediction in South African Holstein cattle, there is limited information as to how adjustments to the H⁻¹ matrix could affect the accuracy of the GEBVs. Therefore, the objective of this study was to evaluate the accuracy of GEBVs predicted using ssGBLUP with standard blending, tuning, and scaling parameters, and compare these to alternative parameter configurations applied independently to the inverted relationship matrix for milk production traits in South African Holstein cattle.

South African Holsteins present a unique population for genomic evaluation, characterized by a historical reliance on pedigree-based evaluations [30]. Despite recent national initiatives such as the Dairy Genomic Programme (DGP), which aims to expand genotyping efforts [31,32], progress has been limited by financial and infrastructural constraints. As a result, the number of genotyped animals remains low compared to those in developed countries. Consequently, this study highlights the importance of testing ssGBLUP strategies to determine how parameter adjustments affect prediction accuracy in South African Holstein cattle with small genotypes.

2. Materials and Methods

2.1. Data Sources and Editing

Phenotypic and pedigree data were obtained from the national database, the Integrated Registration and Genetic Information System, South Africa. The original pedigree data included 3,699,231 data points for the Holstein cattle. The phenotype data consisted of 305-day lactation yield records for 4,779,369 cows. Dairy traits considered were milk, protein, and fat yields. Only the first three lactation records from 1989 to 2016 were used in the analysis; however, only cows with a first lactation record were retained. Cows without a recorded first lactation, even if they had records for later lactations, were excluded.

The pedigree data were edited to exclude records with unknown birth and calving dates. Age at calving was restricted from 20 to 42, 30 to 54, and 40 to 67 months for lactations 1, 2, and 3, respectively [33,34]. Records of milk yields below 1000 kg and more than 30,000 kg, as well as records of butterfat and protein percentages less than 2% or greater than 9%, were also excluded. The data were edited further to remove incomplete lactation records and those deemed unusable for genetic evaluations [33,34].

The edited phenotypic data used for the analysis contained 696,413 milk production records from 354,228 cows across 1991 herds. The final pedigree data comprised 541,325 animals: 9355 sires and 328,929 dams. Descriptive statistics of the phenotypic data are presented in Table 1. The season variable was categorized based on calving periods, with summer defined as October to March and winter as April to September. These two distinct seasons were used to account for potential environmental and management differences affecting lactation performance. Cows were assigned to contemporary groups defined by herd-year-season of calving. Contemporary groups with fewer than five animals and/or fewer than two sires were excluded, resulting in 22,410 groups.

2.2. Genotypic Data

Genomic data were generated through the DGP, which is a consortium between the ARC, South African universities, and the dairy industry, funded by the South African government. The Illumina 50K chip v3 (Illumina Inc., San Diego, CA, USA), featuring 53,218 SNP markers, was used for genotyping 1473 Holstein cattle. These animals are registered in the INTERGIS database and were sampled through routine evaluations conducted by the DGP. Using PLINK v. 1.07 [35], uninformative markers with MAF < 0.05 and markers with low genotyping rate were removed, as well as animals with an individual call rate < 0.90. Markers that deviated from the Hardy–Weinberg equilibrium (p < 0.0001) were also removed. This resulted in 1221 genotyped animals characterized by 41,407 SNP markers. The genotyped animals included 78 bulls and 1143 cows. Pedigree records indicated that 1218 animals had both parents recorded, while only 3 animals had a single recorded parent.

2.3. Statistical Analysis

Two approaches were used to predict breeding values, namely ABLUP and ssGBLUP. A pedigree-based relationship matrix was used in ABLUP, while the ssGBLUP model incorporated the H matrix. Computations were calculated using the BLUPF90 family of programs [36].

2.3.1. Pedigree-Focused Best Linear Unbiased Prediction

The pedigree-based ABLUP model was utilized to estimate variance components and estimated breeding values (EBVs) for milk, protein, and butterfat yields. Variance components were estimated using the average information restricted maximum likelihood method, implemented through AIREMLF90 v1.149. The resulting heritability estimates are shown in Table 1.

Subsequently, EBVs were predicted using BLUPF90 v1.63, which implements the Best Linear Unbiased Prediction (BLUP) model in the BLUPF90 family of programs. The following single-trait repeatability model was used for the estimations:

y = Xb + Za + Wpe + e

(1)

where y is the vector of observations for the traits; X, Z, and W are the known incidence matrices relating records to fixed, random, and permanent environmental effects, respectively; b is the vector of fixed effects (herd-year-season, age at calving, parity); a is the vector of additive genetic effects for each animal, following a normal distribution N(0,A

σ_{a}^{2}

), with A as a pedigree-based additive genetic relationship matrix and

σ_{a}^{2}

as the additive genetic variance; pe is the vector of permanent environmental effects following a normal distribution N (0, I

σ_{p e}^{2}

), and

σ_{p e}^{2}

is the permanent environmental variance; and e is the vector of residual effects, following a normal distribution N (0, I

σ_{e}^{2}

), with

σ_{e}^{2}

as the residual variance and I as the identity matrix.

2.3.2. Single-Step Genomic Best Linear Unbiased Prediction

The GEBVs were estimated using the single-trait repeatability ssGBLUP model, like the ABLUP model. However, in ssGBLUP, the relationship matrix is replaced by the H matrix [2,16,23], which combines genotypes and pedigree data. Thus, the inverse of the matrix H is the following:

H^{- 1} = A^{- 1} + [\begin{matrix} 0 & 0 \\ 0 & τ G^{- 1} - ω A_{22}^{- 1} \end{matrix}]

(2)

where A⁻¹ is the inverse of the numerator relationship matrix (A), including all animals;

G^{- 1}

is the inverse of the genomic relationship matrix;

A_{22}^{- 1}

is the inverse of the A matrix for only genotyped animals; and weighting factors for

G^{- 1}

and

A_{22}^{- 1}

are represented by the τ and ω, respectively. G is created according to VanRaden [20]. Adjustments of the H matrix inverse were explored to assess the influence of blending, tuning, and scaling factors on prediction accuracies.

Blending

In ssGBLUP, the blending of the G and A₂₂ matrices was carried out as (1 − β)

G^{- 1}

+ β

A_{22}^{- 1}

, where β represented the amount of residual polygenic variance unaccounted for by G. In the current study, the blending strategies were defined by varying β = 0.05, 0.10, 0.20, 0.30, and 0.40 and abbreviated as ssGBLUP_G0.95, ssGBLUP_G0.90, ssGBLUP_G0.80, ssGBLUP_G0.70, and ssGBLUP_G0.60, respectively. In addition, β = 0.05 was benchmarked as a standard blending coefficient that has been commonly used in ssGBLUP applications. However, β values as low as 0.50 may be appropriate when G captures most of the additive genetic variance [23,37].

Tuning

Options available for tuning in the BLUPF90 program were explored to determine their impact on predictions. Tuning was accomplished by adjusting the values of G and A₂₂ as follows: ssGBLUP_TG0 = no adjustment; ssGBLUP_TG1 = Mean(diag(G)) = 1 and Mean(offdiag(G)) = 0 proposed by Legarra [38], where diag is the diagonal elements and offdiag is the off-diagonal elements; ssGBLUP_TG2 = Mean(diag(G)) = Mean(diag(A₂₂)) and Mean(offdiag(G)) = Mean(offdiag(A₂₂)) proposed by Chen et al. [28] and referred to as standard; ssGBLUP_TG3 = Mean(G) = Mean(A₂₂) [22]; and ssGBLUP_TG4 = rescaling G using an Fst adjustment [22,39].

Scaling

Scaling was implemented in ssGBLUP with the scaling factors τ and ω as shown in Equation (2). The accuracy of prediction using ssGBLUP was evaluated by varying the scaling parameters τ and ω. The scaling values tested for both parameters were 0.60, 0.70, 0.80, 0.90, and 1.0, with the standard BLUPF90 default set at 1.0. Each combination of τ and ω was applied separately to assess their impact on genomic prediction accuracy.

The ssGBLUP model was implemented independently for each scaling value, resulting in 10 analyses: five analyses with varying τ while keeping ω = 1.0, and five analyses with varying ω while keeping τ = 1.0.

2.3.3. Validation of Prediction Accuracy

To assess the accuracy, a forward validation approach was implemented. Two datasets were used for evaluations: (1) the complete set of phenotype records and (2) datasets with phenotypic records for 390 genotyped cows intentionally omitted. These 390 cows were selected from a total of 1221 genotyped animals. Cows whose phenotype records were omitted had at least one lactation record, and priority was given to those from more recent births. The 831 remaining genotyped animals, in combination with the phenotypic and pedigree data from both genotyped and ungenotyped animals, formed the basis for predicting the breeding values of the 390 validation cows.

This approach differs from cross-validation, which divides genotyped animals into several subsets for iterative training and validation. Due to the relatively small number of genotyped animals available in our study, forward validation was chosen instead. Forward validation closely mirrors the practical implementation of genetic evaluations in livestock populations [40].

The terminology “realized accuracy” was used to indicate the Pearson correlation of the solutions for the 390 cows from the paired mixed model analyses that either do or do not contain their phenotypic data. The formula used to compute realized accuracy is as follows:

Realized Accuracy = cor(EBV_{BLUP_full}, G/EBV_Reduced)

(3)

where EBV_{BLUP_full} refers to the EBVs estimated using the full BLUP model with all phenotypes, and G/EBV_Reduced refers to the EBVs or GEBVs estimated without phenotypic records for the 390 validation cows.

3. Results

3.1. Genomic Prediction Accuracy of ABLUP and ssGBLUP Models

Presented in Table 2 are the prediction accuracies from ABLUP and ssGBLUP models for milk, protein, and fat. Use of ssGBLUP produced large increases in accuracy relative to ABLUP. The mean individual accuracies across all models ranged from 0.58 to 0.61. This result shows the advantage of incorporating genomic information in genetic evaluations.

3.2. Accuracy of Predictions Using Different Blending Parameters

Table 3 shows that accuracy increased as the blending parameter (β) increased. The accuracy for milk, protein, and fat decreases slightly as the genomic parameter increases, with values for milk ranging from 0.26 at ssGBLUP_G0.60 to 0.23 at ssGBLUP_G0.95, protein ranging from 0.32 to 0.29, and fat ranging from 0.33 to 0.30. These results suggest that while genomic information improves accuracy, excessive reliance on genomic relationships may diminish predictive power.

3.3. Accuracy of Predictions Using Different Tuning Options

The prediction accuracy of ssGBLUP was improved at the lowest value of the tuning parameter (Table 4). The accuracy for milk, protein, and fat is highest at ssGBLUP_TG0 and ssGBLUP_TG1, with values of 0.25 for milk, 0.30 for protein, and 0.31 for fat, and decreases slightly from ssGBLUP_TG2 (default) onwards, where the values stabilize at 0.23 for milk, 0.29 for protein, and 0.30 for fat. This indicates that minimal tuning of the genomic relationship matrix can enhance prediction accuracy, whereas aggressive adjustments may reduce the reliability of genomic estimates.

3.4. Accuracy of Predictions Using Different Scaling Parameters for τ and ω

The results for the τ and ω scaling factors are presented in Table 5. The accuracy for milk ranges from 0.22 at τ 0.60 and τ 0.70 to 0.23 from τ 0.80 onwards. The accuracy increases from 0.28 at τ 0.60 to 0.29 at τ 0.70, while remaining stable through τ 1 (default) for protein. For fat, the accuracy increases from 0.28 at τ 0.60 to 0.30 at τ 0.90 and remains at 0.30 at τ 1. These findings suggest that higher τ values help maintain predictive stability, likely by balancing pedigree and genomic contributions effectively.

In terms of scaling ω, the accuracy for milk decreases from 0.26 at ω 0.60 and ω 0.70 to 0.23 at ω 1 (default). For protein, the accuracy declines from 0.32 at ω 0.60 and ω 0.70 to 0.29 at ω 1. For fat, the accuracy decreases from 0.34 at ω 0.60 to 0.30 at ω 1. The decline in accuracy at higher ω values suggests that while genomic information is beneficial, overemphasizing it relative to pedigree data may compromise predictive precision.

4. Discussion

The current study found that ssGBLUP models outperformed ABLUP in predicting EBVs in Holstein cattle. Generally, traditional BLUP models achieved low realized accuracies due to the limited and perhaps inaccurate parentage in pedigree records, which could explain the low prediction accuracies [41]. Similar low realized accuracies have been reported, with ABLUP as low as 0.12 and ssGBLUP at 0.23 [42], and ABLUP accuracy as low as 0.02 [43]. These studies confirm that low ABLUP or ssGBLUP accuracies are not unique to the current study and can occur when data limitations exist.

The correlations between the EBVs and the GEBVs for the full model are presented in Supplementary Table S1. As expected, there were strong correlations (0.86–0.88) between EBVs and GEBVs. However, the mean individual accuracies across models were identical (e.g., 0.61 for milk). This shows that pedigree-based ABLUP may exaggerate EBV confidence [44], particularly for cows, which often have lower accuracies due to fewer or no progeny [45,46,47,48]. The low ABLUP accuracies (Table 2) were most likely caused by the under-representation of bulls, typically more informative in genetic evaluations, and the predominance of younger over older animals. As a result, inadequate pedigree connectedness, limited progeny data, and previous selection all lower parent average accuracy [49]. ABLUP, which lacks genomic data, cannot overcome these constraints when compared to ssGBLUP, which uses both pedigree and genomic information to increase predictive accuracy [50].

This study reiterates the importance of using genomic data through ssGBLUP, which integrates genotyped and non-genotyped animals in a single evaluation [5,11,16,51,52]. However, the realized accuracies found in the current study were generally low but consistent with the ranges reported in previous research (9% to 47%) using small numbers of genotypes in ssGBLUP for traits such as milk yield in the Holstein breed [45,46,47,48]. It is indeed remarkable that having genotypes for only a relatively small fraction of the animals produced improvements in prediction accuracy of this magnitude.

Despite ssGBLUP consistently outperforming ABLUP in this study, the low realized accuracy observed is likely due to the limited number of genotyped animals, the degree of relatedness among them, and the characteristics of the prediction models used [10,11,12]. While ssGBLUP was utilized in this study, the GEBVs obtained are associated with potential biases resulting from the singularity of G and challenges associated with compatibility between G and A₂₂, which could lead to low genomic prediction accuracy [5,20,22,24].

The present study utilized genotypes from medium-density SNP chip panels (50K), which, while not capturing the entire genome, can effectively cover LD blocks for genomic relationship matrix construction [53]. Therefore, studying alternative strategies for blending pedigree and genomic information is necessary to evaluate different amounts of information being captured by polygenic effects [53]. This study shows that prediction accuracy improves when polygenic effects account for up to 40% of the information used in calculating the GEBVs for Holstein cattle, as seen with the ssGBLUP models using blending values of β = 0.40, β = 0.30, and β = 0.20. This contrasts with the standard value of β = 0.05 used in the ssGBLUP_G0.95 model. Our results align with those of Curzon et al. [54], where increasing β was beneficial for a small Israeli Holstein population, and giving more weight to pedigree (β = 0.30–0.50) reduced the upward bias of GEBVs and improved prediction accuracy. According to Lourenco et al. [23], increasing β is often necessary to control inflation when pedigree data are incomplete and can speed up convergence with minimal or no impact on accuracy. In our study, pedigree completeness was generally high among the genotyped animals, with 1218 out of 1221 animals having both parents recorded, indicating that adjustments in β were likely not driven by pedigree gaps but by other structural aspects of the data and model.

Although β = 0.05 is commonly used for blending to address singularity issues in BLUPF90 programs [23], the results of this study demonstrate variations in prediction accuracy when using alternative blending values. Consistent with Piccoli et al. [53], the current study shows that β = 0.05 provides reliable accuracy, and only a marginal improvement in accuracy resulted from using other values for blending. Interestingly, while Gao et al. [55] and Neves et al. [56] reported gains with β = 0.20, the present study shows limited advantage when using this value, suggesting that the optimal blending parameter may be population-dependent. Furthermore, in agreement with Abdalla et al. [27], the results of this study reveal minimal differences across blending strategies, although β = 0.05 and β = 0.10 emerge as practical choices. These findings highlight the importance of polygenic effects, which, as Meyer et al. [57] noted, can provide up to 20% of the relationship information among animals. In this population study, weighting the polygenic effects more heavily improved accuracy when marker-based relationships alone were insufficient to capture additive genetic variance. This means that increasing the reliance on polygenic effects depends on certain conditions and populations [57,58,59]. Overall, these results indicate that the optimal blending value is context-dependent, influenced by population structure, relatedness, and the proportion of information contributed by pedigree and genomic data. Although the current study did not evaluate pedigree depth or accuracy beyond parentage, these factors may influence the optimal blending value in other populations [23,54]. Therefore, developing general strategies for determining appropriate β values under different pedigree structures warrants further investigation, particularly in larger datasets with variable pedigree depth.

Moreover, the effectiveness of blending, and ssGBLUP more broadly, also hinges on the composition of the genotyped reference population. The limited number of genotyped animals (1221) relative to the large population with phenotypes restricts the informativeness of the genomic relationship matrix and thus ssGBLUP accuracy [16,60]. Selecting animals to include in the genotyped reference population is crucial to maximize connectedness, genetic diversity, and family representation so as to improve prediction accuracy [61,62]. Beyond the work of Spangler et al. [63], future efforts should improve on genotyping strategies to enhance genomic evaluations, especially in settings with limited resources.

Exploring additional ways to modify H⁻¹ may result in improved accuracy of prediction. Previously, tuning and scaling have been identified as being crucial [59]. Tuning ensures compatibility between G and A₂₂ matrices, and both refer to the same genetic base [37]. Nevertheless, the explored tuning methods made only small changes to the realized accuracy compared to setting the means of the off-diagonal and diagonal elements of G to the means of the off-diagonal and diagonal elements of A₂₂, which is standard in BLUPF90 programs [28,64]. McWhorter et al. [37] also found minor differences in accuracy from various tuning options. According to Neshat et al. [65], tuning improves the genomic prediction accuracy based on their research using simulated genomic data. Consequently, some tuning methods that depend on the allele frequencies, such as equating the means of the off-diagonal and diagonal elements of G and A₂₂ or rescaling G by F_st, are computationally demanding [65]. Hence, a tuning strategy that does not depend on allele frequencies with simplified computations was proposed by Bermann et al. [64]. However, it did not perform better than the above-mentioned standard strategies. In the present study, the option of not tuning seemed better than the standard method, although not to a large extent. There is limited evidence to justify this observation based on studies that considered not tuning as an option. Bermann et al. [64] and Hsu et al. [66] demonstrated lower accuracy of GEBVs without tuning compared to tuning. Consequently, it is important to recognize that not tuning may inflate and bias the GEBVs [64]. Further investigation into the no-tuning approach is warranted, particularly when combined with adjustments to the H⁻¹ matrix, such as blending and scaling. This will help identify the most effective combination of tuning methods, which will be explored in a subsequent article.

One of the challenges in ssGBLUP is that when the

G

⁻¹ and

A_{22}^{- 1}

matrices are not properly scaled in the H⁻¹ matrix, inflated or deflated GEBVs may result [15]. To address this, ω scaling regulates

A_{22}^{- 1}

while τ regulates G⁻¹ [23]. This makes it necessary to ensure proper scaling of genomic and pedigree matrices using τ and ω values [23]. The findings of this study indicate that the scaling values of τ and ω have a significant effect on the realized accuracy of EBVs estimated with ssGBLUP. Adjusting τ to a value less than 1 did not improve the realized accuracy in Holstein, contrary to previous research [5,67]. This study focused on values of τ less than 1, following the basis established in prior research. However, previous studies have demonstrated that increasing τ values beyond 1 can significantly improve prediction accuracy [68,69]. The decision to limit the exploration to values less than 1 was a limitation of this study. Further exploration of higher τ values could reveal additional benefits.

This study found that reducing ω to 0.6 improved the realized accuracy in the evaluation. Similar trends were observed in other studies when ω was reduced to less than 1 [5,67,68,69]. Conversely, Hong et al. [70] observed reduced prediction accuracy with ω less than 1. According to Lourenco et al. [23], reducing ω below 1 contributes to reducing overestimation of the GEBVs. The current results emphasize the need for careful consideration of population-specific factors and thorough evaluation of scaling parameters in genomic prediction analyses.

Scaling ω down to 0.60 and blending with up to 20% polygenic effects enhanced Holstein prediction accuracy, emphasizing the significance of these modifications in the matrices that were derived from the relationships among animals. Aligning the G⁻¹ with

A_{22}^{- 1}

has improved accuracy and reduced bias in GEBVs [71]. Therefore, these studies indicate a more effective accounting for additive genetic variation. However, the necessity of fine-tuning was recommended to account for higher proportions of blending or when the model explicitly accounts for a residual polygenic effect [29,72]. Further research needs to focus on optimizing the tuning and blending of additive effects, particularly when exceeding 20%, and explore the benefits of incorporating ancestral genetic contributions through pedigree information to enhance prediction accuracy and reduce bias in genomic prediction models.

Given that the accuracy achieved with the current approach remains low, additional strategies are needed to enhance it. While the focus here was on ssGBLUP, the next step would involve exploring how combinations of parameters such as blending, scaling, and no-tuning could improve prediction accuracy and reduce bias, an aspect not fully addressed in this study. It is also significant to determine how these affect bias due to the inflation of differences among the GEBVs. Neshat et al. [59] recommended the importance of establishing the best configuration of blending, tuning, and scaling parameters to improve prediction accuracy. However, there are arguments around the best way to combine these parameters when adjusting the H⁻¹ matrix [37,70,73]. These adjustments are crucial, as they optimize the blending of pedigree and genomic information and directly impact the accuracy of GEBVs, leading to more informed breeding decisions and achieving genetic progress in livestock populations.

5. Conclusions

Single-step GBLUP outperformed ABLUP across all traits. However, overall prediction accuracies remained low, likely influenced by factors such as the limited number of genotyped animals, the structure of the reference population, and model assumptions. Blending with up to 40% polygenic effects slightly improved accuracy, which means that the optimal blending parameter is population-dependent. Tuning methods showed only marginal improvements, and the no-tuning approach produced slightly better results, suggesting the need for further exploration of no-tuning strategies combined with blending and scaling. Scaling parameters ω had a significant effect on accuracy, especially when reduced. These findings highlight the influence of blending, tuning, and scaling choices to enhance genomic prediction accuracy. Therefore, it is recommended that preliminary optimization be tailored to a specific population and that data be collected before genomic evaluations. As this study is preliminary and constrained by a limited number of genotyped animals, it is recommended that the SA Holstein Cattle Breeders’ Society and breeders prioritize genotyping not only more animals but also those that are more informative to enhance the accuracy and reliability of future genomic evaluations.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ani15192866/s1, Table S1: Pearson correlation between EBVs and GEBVs from full (including all phenotypes) and reduced (excluding phenotypes of 390 animals) models for Holstein cattle, with mean individual accuracy shown in brackets.

Author Contributions

Conceptualization, K.S.M. and M.L.M.; methodology, K.S.M.; validation, M.D.M., M.L.M. and F.W.C.N.; formal analysis, K.S.M.; investigation, K.S.M.; writing—original draft preparation, K.S.M.; writing—review and editing, K.S.M., M.L.M., M.D.M. and F.W.C.N.; supervision, M.L.M., M.D.M. and F.W.C.N.; project administration, M.L.M.; funding acquisition, M.L.M. All authors have read and agreed to the published version of the manuscript.

Funding

The authors would like to thank the National Research Foundation (Grant No: 98680 and 99618) and the Technology and Innovation Agency (TIA), an implementing agency of the Department of Science and Innovation (DSI), as part of the Dairy Genomics Project (DGP).

Institutional Review Board Statement

The study was approved by the Agricultural Research Council (ARC) Animal Ethics Committee (Ref no APIEC 22/06, approval date: 10/08/2022).

Informed Consent Statement

Not applicable.

Data Availability Statement

The data analyzed in this study were obtained from the Dairy Genomic Programme and the SA Holstein Cattle Breeders’ Society. These datasets are not publicly available because they are owned by third parties. Data may be made available for research purposes upon reasonable request to the corresponding author.

Acknowledgments

The SA Holstein Cattle Breeders’ Society is acknowledged for granting permission to use the data. Lastly, we would like to acknowledge the National Integrated Cyber Infrastructure System (NICIS) and the Centre for High Performance Computing for providing computational resources to perform our analysis.

Conflicts of Interest

Author Michael D. MacNeil is employed by the company “Delta G”. The remaining authors declare no conflicts of interest.

Abbreviations

BLUP	Best linear unbiased prediction
EBVs	Estimated breeding values
GEBVs	Genomic estimated breeding values
SNP	Single-nucleotide polymorphism
ssGBLUP	Single-step genomic best linear unbiased prediction

References

Legarra, A.; Aguilar, I.; Misztal, I. A Relationship Matrix Including Full Pedigree and Genomic Information. J. Dairy Sci. 2009, 92, 4656–4663. [Google Scholar] [CrossRef]
Christensen, O.F.; Lund, M.S. Genomic Prediction When Some Animals Are Not Genotyped. Genet. Sel. Evol. 2010, 42, 2. [Google Scholar] [CrossRef] [PubMed]
Christensen, O.F.; Madsen, P.; Nielsen, B.; Ostersen, T.; Su, G. Single-Step Methods for Genomic Evaluation in Pigs. Animals 2012, 6, 1565–1571. [Google Scholar] [CrossRef] [PubMed]
Li, X.; Wang, S.; Huang, J.; Li, L.; Zhang, Q.; Ding, X. Improving the Accuracy of Genomic Prediction in Chinese Holstein Cattle by Using One-Step Blending. Genet. Sel. Evol. 2014, 46, 66. [Google Scholar] [CrossRef] [PubMed]
Koivula, M.; Strandén, I.; Pösö, J.; Aamand, G.P.; Mäntysaari, E.A. Single-Step Genomic Evaluation Using Multitrait Random Regression Model and Test-Day Data. J. Dairy Sci. 2015, 98, 2775–2784. [Google Scholar] [CrossRef]
Schaeffer, L.R. Strategy for Applying Genome-Wide Selection in Dairy Cattle. J. Anim. Breed. Genet. 2006, 123, 218–223. [Google Scholar] [CrossRef]
Saatchi, M.; Miraei-Ashtiani, S.R.; Javaremi, A.N.; Moradi-Shahrebabak, M.; Mehrabani-Yeghaneh, H. The Impact of Information Quantity and Strength of Relationship between Training Set and Validation Set on Accuracy of Genomic Estimated Breeding Values. Afr. J. Biotechnol. 2010, 9, 438–442. [Google Scholar]
Wientjes, Y.C.J.; Bijma, P.; Vandenplas, J.; Calus, M.P.L. Multi-Population Genomic Relationships for Estimating Current Genetic Variances Within and Genetic Correlations Between Populations. Genetics 2017, 207, 503–515. [Google Scholar] [CrossRef]
Calus, M.P.L.; Goddard, M.E.; Wientjes, Y.C.J.; Bowman, P.J.; Hayes, B.J. Multibreed Genomic Prediction Using Multitrait Genomic Residual Maximum Likelihood and Multitask Bayesian Variable Selection. J. Dairy Sci. 2018, 101, 4279–4294. [Google Scholar] [CrossRef]
Weigel, K.A.A.; de los Campos, G.; Vazquez, A.I.I.; Rosa, G.J.M.J.M.; Gianola, D.; Van Tassell, C.P.P. Accuracy of Direct Genomic Values Derived from Imputed Single Nucleotide Polymorphism Genotypes in Jersey Cattle. J. Dairy Sci. 2010, 93, 5423–5435. [Google Scholar] [CrossRef]
Uemoto, Y.; Osawa, T.; Saburi, J. Effect of Genotyped Cows in the Reference Population on the Genomic Evaluation of Holstein Cattle. Animals 2017, 11, 382–393. [Google Scholar] [CrossRef]
Zhang, H.; Yin, L.; Wang, M.; Yuan, X.; Liu, X. Factors Affecting the Accuracy of Genomic Selection for Agricultural Economic Traits in Maize, Cattle, and Pig Populations. Front. Genet. 2019, 10, 189. [Google Scholar] [CrossRef]
Wang, X.; Miao, J.; Chang, T.; Xia, J.; An, B.; Li, Y.; Xu, L.; Zhang, L.; Gao, X.; Li, J.; et al. Evaluation of GBLUP, BayesB and Elastic Net for Genomic Prediction in Chinese Simmental Beef Cattle. PLoS ONE 2019, 14, e0210442. [Google Scholar] [CrossRef] [PubMed]
Cesarani, A.; Masuda, Y.; Tsuruta, S.; Nicolazzi, E.L.; VanRaden, P.M.; Lourenco, D.; Misztal, I. Genomic Predictions for Yield Traits in US Holsteins with Unknown Parent Groups. J. Dairy Sci. 2021, 104, 5843–5853. [Google Scholar] [CrossRef] [PubMed]
Nilforooshan, M.A. A Note on the Conditioning of the H−1 Matrix Used in Single-Step GBLUP. Animals 2022, 12, 3208. [Google Scholar] [CrossRef] [PubMed]
Aguilar, I.; Misztal, I.; Johnson, D.L.; Legarra, A.; Tsuruta, S.; Lawlor, T.J. Hot Topic: A Unified Approach to Utilize Phenotypic, Full Pedigree, and Genomic Information for Genetic Evaluation of Holstein Final Score. J. Dairy Sci. 2010, 93, 743–752. [Google Scholar] [CrossRef] [PubMed]
Karaman, E.; Lund, M.S.; Su, G. Multi-Trait Single-Step Genomic Prediction Accounting for Heterogeneous (Co)Variances over the Genome. Heredity 2020, 124, 274–287. [Google Scholar] [CrossRef]
Mehrban, H.; Naserkheil, M.; Lee, D.H.; Cho, C.; Choi, T.; Park, M.; Ibáñez-escriche, N. Genomic Prediction Using Alternative Strategies of Weighted Single-step Genomic Blup for Yearling Weight and Carcass Traits in Hanwoo Beef Cattle. Genes 2021, 12, 266. [Google Scholar] [CrossRef]
Mancin, E.; Mota, L.F.M.; Tuliozi, B.; Verdiglione, R.; Mantovani, R.; Sartori, C. Improvement of Genomic Predictions in Small Breeds by Construction of Genomic Relationship Matrix Through Variable Selection. Front. Genet. 2022, 13, 814264. [Google Scholar] [CrossRef]
VanRaden, P.M. Efficient Methods to Compute Genomic Predictions. J. Dairy Sci. 2008, 91, 4414–4423. [Google Scholar] [CrossRef]
Forni, S.; Aguilar, I.; Misztal, I. Different Genomic Relationship Matrices for Single-Step Analysis Using Phenotypic, Pedigree and Genomic Information. Genet. Sel. Evol. 2011, 43, 1. [Google Scholar] [CrossRef] [PubMed]
Vitezica, Z.G.; Aguilar, I.; Misztal, I.; Legarra, A. Bias in Genomic Predictions for Populations under Selection. Genet. Res. 2011, 93, 357–366. [Google Scholar] [CrossRef] [PubMed]
Lourenco, D.; Legarra, A.; Tsuruta, S.; Masuda, Y.; Aguilar, I.; Misztal, I. Single-Step Genomic Evaluations from Theory to Practice: Using Snp Chips and Sequence Data in Blupf90. Genes 2020, 11, 790. [Google Scholar] [CrossRef] [PubMed]
Tsuruta, S.; Lawlor, T.J.; Lourenco, D.A.L.; Misztal, I. Bias in Genomic Predictions by Mating Practices for Linear Type Traits in a Large-Scale Genomic Evaluation. J. Dairy Sci. 2021, 104, 662–677. [Google Scholar] [CrossRef]
Tsuruta, S.; Lourenco, D.A.L.; Masuda, Y.; Misztal, I.; Lawlor, T.J. Controlling Bias in Genomic Breeding Values for Young Genotyped Bulls. J. Dairy Sci. 2019, 102, 9956–9970. [Google Scholar] [CrossRef]
Aguilar, I.; Fernandez, E.N.; Blasco, A.; Ravagnolo, O.; Legarra, A. Effects of Ignoring Inbreeding in Model-Based Accuracy for BLUP and SSGBLUP. J. Anim. Breed. Genet. 2020, 137, 356–364. [Google Scholar] [CrossRef]
Abdalla, E.E.A.; Schenkel, F.S.; Emamgholi Begli, H.; Willems, O.W.; van As, P.; Vanderhout, R.; Wood, B.J.; Baes, C.F. Single-Step Methodology for Genomic Evaluation in Turkeys (Meleagris gallopavo). Front. Genet. 2019, 10, 1248. [Google Scholar] [CrossRef]
Chen, C.Y.; Misztal, I.; Aguilar, I.; Legarra, A.; Muir, W.M. Effect of Different Genomic Relationship Matrices on Accuracy and Scale. J. Anim. Sci. 2011, 89, 2673–2679. [Google Scholar] [CrossRef]
Garcia, A.; Aguilar, I.; Legarra, A.; Tsuruta, S.; Misztal, I.; Lourenco, D. Theoretical Accuracy for Indirect Predictions Based on SNP Effects from Single-Step GBLUP. Genet. Sel. Evol. 2022, 54, 66. [Google Scholar] [CrossRef]
Banga, C.; Neser, F.; Garrick, D. Breeding Objectives for Holstein Cattle in South Africa. S. Afr. J. Anim. Sci. 2014, 44, 199. [Google Scholar] [CrossRef]
van Marle-Köster, E.; Visser, C. Genetic Improvement in South African Livestock: Can Genomics Bridge the Gap between the Developed and Developing Sectors? Front. Genet. 2018, 9, 331. [Google Scholar] [CrossRef]
Visser, C.; Lashmar, S.F.; Reding, J.; Berry, D.P.; van Marle-Köster, E. Pedigree and Genome-Based Patterns of Homozygosity in the South African Ayrshire, Holstein, and Jersey Breeds. Front. Genet. 2023, 14, 1136078. [Google Scholar] [CrossRef]
Makgahlela, M.; Banga, C.; Norris, D.; Dzama, K.; Ng’ambi, J. Genetic Correlations between Female Fertility and Production Traits in South African Holstein Cattle. S. Afr. J. Anim. Sci. 2007, 37, 180–188. [Google Scholar] [CrossRef]
Interbull. National Genetic Evaluation–Republic of South Africa. 2020. Available online: https://interbull.org/ib/geforms (accessed on 31 July 2021).
Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.R.; Bender, D.; Maller, J.; Sklar, P.; De Bakker, P.I.W.; Daly, M.J.; et al. PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef] [PubMed]
Misztal, I.; Tsuruta, S.; Strabel, T.; Auvray, B.; Druet, T.; Lee, D.H. BLUPF90 and related programs (BGF90). In Proceedings of the 7th World Congress on Genetics Applied to Livestock Production, Montpellier, France, 19–23 August 2002; pp. 1–28. [Google Scholar]
McWhorter, T.M.; Bermann, M.; Garcia, A.L.S.; Legarra, A.; Aguilar, I.; Misztal, I.; Lourenco, D. Implication of the Order of Blending and Tuning When Computing the Genomic Relationship Matrix in Single-Step GBLUP. J. Anim. Breed. Genet. 2022, 140, 60–78. [Google Scholar] [CrossRef] [PubMed]
Legarra, A. Comparing Estimates of Genetic Variance across Different Relationship Models. Theor. Popul. Biol. 2016, 107, 26–30. [Google Scholar] [CrossRef] [PubMed]
Powell, J.E.; Visscher, P.M.; Goddard, M.E. Reconciling the Analysis of IBD and IBS in Complex Trait Studies. Nat. Rev. Genet. 2010, 11, 800–805. [Google Scholar] [CrossRef]
Junqueira, V.S.; Lopes, P.S.; Lourenco, D.; Silva, F.F.e.; Cardoso, F.F. Applying the Metafounders Approach for Genomic Evaluation in a Multibreed Beef Cattle Population. Front. Genet. 2020, 11, 556399. [Google Scholar] [CrossRef]
Scholtens, M.; Lopez-Villalobos, N.; Lehnert, K.; Snell, R.; Garrick, D.; Blair, H.T. Advantage of Including Genomic Information to Predict Breeding Values for Lactation Yields of Milk, Fat, and Protein or Somatic Cell Score in a New Zealand Dairy Goat Herd. Animals 2021, 11, 24. [Google Scholar] [CrossRef]
Bernstein, R.; Du, M.; Du, Z.G.; Strauss, A.S.; Hoppe, A.; Bienefeld, K. First Large-Scale Genomic Prediction in the Honey Bee. Heredity 2023, 130, 320–328. [Google Scholar] [CrossRef]
Naserkheil, M.; Lee, D.H.; Mehrban, H. Improving the Accuracy of Genomic Evaluation for Linear Body Measurement Traits Using Single-Step Genomic Best Linear Unbiased Prediction in Hanwoo Beef Cattle. Genetics 2020, 21, 144. [Google Scholar] [CrossRef]
Gorjanc, G.; Bijma, P.; Hickey, J.M. Reliability of Pedigree-Based and Genomic Evaluations in Selected Populations. Genet. Sel. Evol. 2015, 47, 65. [Google Scholar] [CrossRef]
Lourenco, D.A.L.A.L.; Misztal, I.; Tsuruta, S.; Aguilar, I.; Ezra, E.; Ron, M.; Shirak, A.; Weller, J.I.I. Methods for Genomic Evaluation of a Relatively Small Genotyped Dairy Population and Effect of Genotyped Cow Information in Multiparity Analyses. J. Dairy Sci. 2014, 97, 1742–1752. [Google Scholar] [CrossRef]
Nayee, N.G.; Su, G.; Gajjar, S.G.; Sahana, G.; Saha, S.; Trivedi, K.R.; Guldbrandtsen, B.; Lund, M.S. Genomic prediction by single-step genomic BLUP using cow reference population in Holstein crossbred cattle in India. In Proceedings of the 11th World Congress on Genetics Applied to Livestock Production, Auckland, New Zealand, 11–16 February 2018; Article 11.411. Available online: http://www.wcgalp.org/proceedings/2018/genomic-prediction-single-step-genomic-blup-using-cow-reference-population-holstein (accessed on 31 July 2022).
Lee, S.H.; Dang, C.G.; Choy, Y.H.; Do, C.H.; Cho, K.; Kim, J.; Kim, Y.; Lee, J. Comparison of Genome-Wide Association and Genomic Prediction Methods for Milk Production Traits in Korean Holstein Cattle. Asian-Australas J. Anim. Sci. 2019, 32, 913–921. [Google Scholar] [CrossRef] [PubMed]
Kudinov, A.A.; Mäntysaari, E.A.; Pitkänen, T.J.; Saksa, E.I.; Aamand, G.P.; Uimari, P.; Strandén, I. Single-Step Genomic Evaluation of Russian Dairy Cattle Using Internal and External Information. J. Anim. Breed. Genet. 2022, 139, 259–270. [Google Scholar] [CrossRef] [PubMed]
Bijma, P. Accuracies of Estimated Breeding Values from Ordinary Genetic Evaluations Do Not Reflect the Correlation between True and Estimated Breeding Values in Selected Populations. J. Anim. Breed. Genet. 2012, 129, 345–358. [Google Scholar] [CrossRef]
Misztal, I.; Tsuruta, S.; Aguilar, I.; Legarra, A.; VanRaden, P.M.; Lawlor, T.J. Methods to Approximate Reliabilities in Single-Step Genomic Evaluation. J. Dairy Sci. 2013, 96, 647–654. [Google Scholar] [CrossRef]
Legarra, A.; Christensen, O.F.; Aguilar, I.; Misztal, I. Single Step, a General Approach for Genomic Selection. Livest. Sci. 2014, 166, 54–65. [Google Scholar] [CrossRef]
Tsuruta, S.; Misztal, I.; Lawlor, T.J. Short Communication: Genomic Evaluations of Final Score for US Holsteins Benefit from the Inclusion of Genotypes on Cows. J. Dairy Sci. 2013, 96, 3332–3335. [Google Scholar] [CrossRef]
Piccoli, M.L.; Brito, L.F.; Braccini, J.; Brito, F.V.; Cardoso, F.F.; Cobuci, J.A.; Sargolzaei, M.; Schenkel, F.S. A Comprehensive Comparison between Single-and Two-Step GBLUP Methods in a Simulated Beef Cattle Population. Can J. Anim. Sci. 2018, 98, 565–575. [Google Scholar] [CrossRef]
Curzon, A.Y.; Ezra, E.; Weller, J.I.; Seroussi, E.; Börner, V.; Gershoni, M. Single-Step Genomic BLUP (SsGBLUP) Effectively Models Small Cattle Populations: Lessons from the Israeli-Holstein Herdbook. Genomics 2024, 25, 1147. [Google Scholar] [CrossRef] [PubMed]
Gao, H.; Christensen, O.F.; Madsen, P.; Nielsen, U.S.; Zhang, Y.; Lund, M.S.; Su, G. Comparison on Genomic Predictions Using Three GBLUP Methods and Two Single-Step Blending Methods in the Nordic Holstein Population. Genet. Sel. Evol. 2012, 44, 8. [Google Scholar] [CrossRef] [PubMed]
Neves, H.H.R.; Carvalheiro, R.; O’Brien, A.M.P.; Utsunomiya, Y.T.; Do Carmo, A.S.; Schenkel, F.S.; Sölkner, J.; McEwan, J.C.; Van Tassell, C.P.; Cole, J.B.; et al. Accuracy of Genomic Predictions in Bos Indicus (Nellore) Cattle. Genet. Sel. Evol. 2014, 46, 17. [Google Scholar] [CrossRef] [PubMed]
Meyer, K.; Tier, B.; Swan, A. Estimates of Genetic Trend for Single-Step Genomic Evaluations. Genet. Sel. Evol. 2018, 50, 39. [Google Scholar] [CrossRef]
Hollifield, M.K.; Bermann, M.; Lourenco, D.; Misztal, I. Impact of Blending the Genomic Relationship Matrix with Different Levels of Pedigree Relationships or the Identity Matrix on Genetic Evaluations. JDS Commun. 2022, 3, 343–347. [Google Scholar] [CrossRef]
Neshat, M.; Momin, M.; Truong, B.; Van Der Werf, J.H.J.; Lee, S.; Lee, S.H. Finetuning hyper-parameters increases the prediction accuracy in single-step genetic evaluation. In Proceedings of the 12th World Congress on Genetics Applied to Livestock Production, Rotterdam, The Netherlands, 3–8 July 2022; pp. 1352–1355. [Google Scholar]
Van Grevenhof, E.M.; Van Arendonk, J.A.; Bijma, P. Response to Genomic Selection: The Bulmer Effect and the Potential of Genomic Selection When the Number of Phenotypic Records Is Limiting. Genet. Sel. Evol. 2012, 44, 26. [Google Scholar] [CrossRef]
Buaban, S.; Prempree, S.; Sumreddee, P.; Duangjinda, M.; Masuda, Y. Genomic Prediction of Milk-Production Traits and Somatic Cell Score Using Single-Step Genomic Best Linear Unbiased Predictor with Random Regression Test-Day Model in Thai Dairy Cattle. J. Dairy Sci. 2021, 104, 12713–12723. [Google Scholar] [CrossRef]
Misztal, I.; Lourenco, D.; Legarra, A. Current Status of Genomic Evaluation. J. Anim. Sci. 2020, 98, skaa101. [Google Scholar] [CrossRef]
Spangler, M.L.; Sapp, R.L.; Bertrand, J.K.; MacNeil, M.D.; Rekaya, R. Different Methods of Selecting Animals for Genotyping to Maximize the Amount of Genetic Information Known in the Population. J. Anim. Sci. 2008, 86, 2471–2479. [Google Scholar] [CrossRef]
Bermann, M.; Lourenco, D.; Misztal, I. Technical Note: Automatic Scaling in Single-Step Genomic BLUP. J. Dairy Sci. 2021, 104, 2027–2031. [Google Scholar] [CrossRef]
Neshat, M.; Lee, S.; Momin, M.M.; Truong, B.; van der Werf, J.H.J.; Lee, S.H. An Effective Hyper-Parameter Can Increase the Prediction Accuracy in a Single-Step Genetic Evaluation. Front. Genet. 2023, 14, 1104906. [Google Scholar] [CrossRef] [PubMed]
Hsu, W.L.; Garrick, D.J.; Fernando, R.L. The Accuracy and Bias of Single-Step Genomic Prediction for Populations under Selection. G3 2017, 7, 2685–2694. [Google Scholar] [CrossRef] [PubMed]
Harris, B.L.; Winkelman, A.M.; Johnson, D.L. Large-Scale Single-Step Genomic Evaluation for Milk Production Traits. In Proceedings of the Interbull, Interbull Bulletin, Cork, Ireland, 28–31 May 2012; Volume 46, pp. 20–24. [Google Scholar]
Misztal, I.; Aguilar, I.; Lawlor, T.J. Choice of Parameters for Single-Step Genomic Evaluation for Type. J. Dairy Sci. 2010, 93, 533. [Google Scholar]
Martini, J.W.R.; Schrauf, M.F.; Garcia-Baccino, C.A.; Pimentel, E.C.G.; Munilla, S.; Rogberg-Muñoz, A.; Cantet, R.J.C.; Reimer, C.; Gao, N.; Wimmer, V.; et al. The Effect of the H -1 Scaling Factors τ and ω on the Structure of H in the Single-Step Procedure. Genet. Sel. Evol. 2018, 50, 16. [Google Scholar] [CrossRef]
Hong, J.K.; Kim, Y.S.; Cho, K.H.; Lee, D.H.; Min, Y.J.; Cho, E.S. Application of Single-Step Genomic Evaluation Using Social Genetic Effect Model for Growth in Pig. Asian-Australas J. Anim. Sci. 2019, 32, 1836–1843. [Google Scholar] [CrossRef]
Paiva, J.T.; Mota, R.R.; Lopes, P.S.; Hammami, H.; Vanderick, S.; Oliveira, H.R.; Veroneze, R.; Fonseca e Silva, F.; Gengler, N. Genomic Prediction and Genetic Correlations Estimated for Milk Production and Fatty Acid Traits in Walloon Holstein Cattle Using Random Regression Models. J. Dairy Res. 2022, 89, 222–230. [Google Scholar] [CrossRef]
Ben Zaabza, H.; Taskinen, M.; Mäntysaari, E.A.; Pitkänen, T.; Aamand, G.P.; Strandén, I. Breeding Value Reliabilities for Multiple-Trait Single-Step Genomic Best Linear Unbiased Predictor. J. Dairy Sci. 2022, 105, 5221–5237. [Google Scholar] [CrossRef]
Guarini, A.R.; Lourenco, D.A.L.; Brito, L.F.; Sargolzaei, M.; Baes, C.F.; Miglior, F.; Misztal, I.; Schenkel, F.S. Comparison of Genomic Predictions for Lowly Heritable Traits Using Multi-Step and Single-Step Genomic Best Linear Unbiased Predictor in Holstein Cattle. J. Dairy Sci. 2018, 101, 8076–8086. [Google Scholar] [CrossRef]

Table 1. Characteristics of the data used to assess the accuracy of genomic prediction with different parameters for blending, tuning, and scaling.

Trait	Descriptive Statistics			Heritability
Trait	Minimum	Maximum	Mean ± SD	Heritability
Milk yield (kg)	1000	25,993	7940.10 ± 2615.10	0.28
Protein (kg)	25	857.19	290.23 ± 100.25	0.21
Fat (kg)	26	833.88	252.54 ± 82.98	0.25

Standard deviation (SD).

Table 2. Comparison of the realized accuracy of the EBV from pedigree-based BLUP (ABLUP) and the single-step GBLUP (ssGBLUP) analyses.

Model	Milk	Protein	Fat
ABLUP	0.01	0.03	0.03
ssGBLUP	0.23	0.29	0.30

ABLUP = Traditional animal model best linear unbiased prediction; ssGBLUP = Single-step genomic best linear unbiased prediction.

Table 3. Effect of different genetic relationship matrix blending approaches on prediction accuracy.

Model	Milk	Protein	Fat
ssGBLUP_G0.60	0.26	0.32	0.33
ssGBLUP_G0.70	0.26	0.32	0.33
ssGBLUP_G0.80	0.25	0.31	0.32
ssGBLUP_G0.90	0.24	0.30	0.30
ssGBLUP_G0.95	0.23	0.29	0.30

ssGBLUP = Single-step genomic best linear unbiased prediction; G = Genomic relationship matrix blending proportion. The models ssGBLUP_G0.60 to ssGBLUP_G0.95 represent blending strategies with β values of 0.40, 0.30, 0.20, 0.10, and 0.05, respectively.

Table 4. The realized accuracy of the EBV from single-step GBLUP analyses using different tuning options in the construction of the inverse of the combined pedigree and genomic relationship matrix.

Model	Milk	Protein	Fat
ssGBLUP_TG0	0.25	0.30	0.31
ssGBLUP_TG1	0.25	0.30	0.31
ssGBLUP_TG2	0.23	0.29	0.30
ssGBLUP_TG3	0.23	0.29	0.30
ssGBLUP_TG4	0.23	0.29	0.30

ssGBLUP_TG0 = No scaling; ssGBLUP_TG1 = mean(diag(G)) = 1 and mean(offdiag(G)) = 0; ssGBLUP_TG2 = mean(diag(G)) = mean(diag(A₂₂)) and mean(offdiag(G)) = mean(offdiag(A₂₂)); ssGBLUP_TG3 = mean(G) = mean(A₂₂); ssGBLUP_TG4 = rescale G using an Fst adjustment.

Table 5. Effect of different parameter scaling strategies (τ and ω) on prediction accuracy.

	Model	Milk	Protein	Fat
Scaling τ	ssGBLUP_τ 0.60	0.22	0.28	0.28
	ssGBLUP_τ 0.70	0.22	0.29	0.29
	ssGBLUP_τ 0.80	0.23	0.29	0.29
	ssGBLUP_τ 0.90	0.23	0.29	0.30
	ssGBLUP_τ 1	0.23	0.29	0.30
Scaling ω	ssGBLUP_ω 0.60	0.26	0.32	0.34
	ssGBLUP_ω 0.70	0.26	0.32	0.33
	ssGBLUP_ω 0.80	0.25	0.31	0.32
	ssGBLUP_ω 0.90	0.25	0.30	0.31
	ssGBLUP_ω 1	0.23	0.29	0.30

Scaling τ = different parameter scaling strategies for τ; Scaling ω = different parameter scaling strategies for ω.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mafolo, K.S.; MacNeil, M.D.; Neser, F.W.C.; Makgahlela, M.L. Preliminary Evaluation of Blending, Tuning, and Scaling Parameters in ssGBLUP for Genomic Prediction Accuracy in South African Holstein Cattle. Animals 2025, 15, 2866. https://doi.org/10.3390/ani15192866

AMA Style

Mafolo KS, MacNeil MD, Neser FWC, Makgahlela ML. Preliminary Evaluation of Blending, Tuning, and Scaling Parameters in ssGBLUP for Genomic Prediction Accuracy in South African Holstein Cattle. Animals. 2025; 15(19):2866. https://doi.org/10.3390/ani15192866

Chicago/Turabian Style

Mafolo, Kgaogelo Stimela, Michael D. MacNeil, Frederick W. C. Neser, and Mahlako Linah Makgahlela. 2025. "Preliminary Evaluation of Blending, Tuning, and Scaling Parameters in ssGBLUP for Genomic Prediction Accuracy in South African Holstein Cattle" Animals 15, no. 19: 2866. https://doi.org/10.3390/ani15192866

APA Style

Mafolo, K. S., MacNeil, M. D., Neser, F. W. C., & Makgahlela, M. L. (2025). Preliminary Evaluation of Blending, Tuning, and Scaling Parameters in ssGBLUP for Genomic Prediction Accuracy in South African Holstein Cattle. Animals, 15(19), 2866. https://doi.org/10.3390/ani15192866

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Preliminary Evaluation of Blending, Tuning, and Scaling Parameters in ssGBLUP for Genomic Prediction Accuracy in South African Holstein Cattle

Simple Summary

Abstract

1. Introduction

2. Materials and Methods

2.1. Data Sources and Editing

2.2. Genotypic Data

2.3. Statistical Analysis

2.3.1. Pedigree-Focused Best Linear Unbiased Prediction

2.3.2. Single-Step Genomic Best Linear Unbiased Prediction

Blending

Tuning

Scaling

2.3.3. Validation of Prediction Accuracy

3. Results

3.1. Genomic Prediction Accuracy of ABLUP and ssGBLUP Models

3.2. Accuracy of Predictions Using Different Blending Parameters

3.3. Accuracy of Predictions Using Different Tuning Options

3.4. Accuracy of Predictions Using Different Scaling Parameters for τ and ω

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI