A Note on the Conditioning of the H−1 Matrix Used in Single-Step GBLUP

Nilforooshan, Mohammad Ali

doi:10.3390/ani12223208

Open AccessArticle

A Note on the Conditioning of the H⁻¹ Matrix Used in Single-Step GBLUP

by

Mohammad Ali Nilforooshan

Livestock Improvement Corporation, Private Bag 3016, Hamilton 3240, New Zealand

Animals 2022, 12(22), 3208; https://doi.org/10.3390/ani12223208

Submission received: 6 October 2022 / Revised: 14 November 2022 / Accepted: 17 November 2022 / Published: 19 November 2022

(This article belongs to the Section Animal Genetics and Genomics)

Download

Browse Figures

Versions Notes

Abstract

Simple Summary

Compared to BLUP, in single-step genomic BLUP,

G^{- 1} - A_{22}^{- 1}

is added to the inverse of the pedigree relationship matrix (

A^{- 1}

), forming

H^{- 1}

, where G is the genomic relationship matrix, and

A_{22}

is the block of A for genotyped animals. Incompatibility between G and A may cause inflated genetic variance. Blending and tuning G with

A_{22}

partially solves the problem. However, conditioning

H^{- 1}

might still be needed, which is usually performed via

τ G^{- 1} - ω A_{22}^{- 1}

. This may violate the properties upon which H is built. Alternative ways of weighting the

H^{- 1}

components are presented to prevent/minimise violations of the properties of H.

Abstract

The single-step genomic BLUP (ssGBLUP) is used worldwide for the simultaneous genetic evaluation of genotyped and non-genotyped animals. It is easily extendible to all BLUP models by replacing the pedigree-based additive genetic relationship matrix (A) with an augmented pedigree–genomic relationship matrix (H). Theoretically, H does not introduce any artificially inflated variance. However, inflated genetic variances have been observed due to the incomparability between the genomic relationship matrix (G) and A used in H. Usually, G is blended and tuned with

A_{22}

(the block of A for genotyped animals) to improve its numerical condition and compatibility. If deflation/inflation is still needed, a common approach is weighting

G^{- 1} - A_{22}^{- 1}

in the form of

τ G^{- 1} - ω A_{22}^{- 1}

, added to

A^{- 1}

to form

H^{- 1}

. In some situations, this can violate the conditional properties upon which H is built. Different ways of weighting the

H^{- 1}

components (

A^{- 1}

,

G^{- 1}

,

A_{22}^{- 1}

, and

H^{- 1}

itself) were studied to avoid/minimise the violations of the conditional properties of H. Data were simulated on ten populations and twenty generations. Responses to weighting different components of

H^{- 1}

were measured in terms of the regression of phenotypes on the estimated breeding values (the lower the slope, the higher the inflation) and the correlation between phenotypes and the estimated breeding values (predictive ability). Increasing the weight on

H^{- 1}

increased the inflation. The responses to weighting

G^{- 1}

were similar to those for

H^{- 1}

. Increasing the weight on

A^{- 1}

(together with

A_{22}^{- 1}

) was not influential and slightly increased the inflation. Predictive ability is a direct function of the slope of the regression line and followed similar trends. Responses to weighting

G^{- 1} - A_{22}^{- 1}

depend on the inflation/deflation of evaluations from

A^{- 1}

to

H^{- 1}

and the compatibility of the two matrices with the heritability used in the model. One possibility is a combination of weighting

G^{- 1} - A_{22}^{- 1}

and weighting

H^{- 1}

. Given recent advances in ssGBLUP, conditioning

H^{- 1}

might become an interim solution from the past and then not be needed in the future.

Keywords:

conditional property; inflated; relationship matrix; single-step GBLUP; weighting

1. Introduction

The unified genetic evaluation of genotyped and non-genotyped animals has been of great interest. In an initial attempt, Misztal et al. [1] suggested a unified pedigree (A) and genomic (G) relationship matrix (

H_{ini}

), in which genomic relationships between genotyped animals replaced their pedigree relationship coefficients in A. Denoting non-genotyped and genotyped animals with 1 and 2:

H_{ini} = [\begin{matrix} A_{11} & A_{12} \\ A_{21} & G \end{matrix}] = A + [\begin{matrix} 0 & 0 \\ 0 & G - A_{22} \end{matrix}] .

(1)

This relationship matrix did not condition the distributions of breeding values for genotyped and non-genotyped animals on each other, leading to incoherencies in the joint distribution of genetic values for genotyped and non-genotyped animals. Legarra et al. [2] presented an augmented (A and G) relationship matrix in which the genetic values of non-genotyped animals were conditioned to the genetic values of genotyped animals. The resulting matrix was:

\begin{matrix} \begin{matrix} H & = [\begin{matrix} H_{11} & H_{12} \\ H_{21} & H_{22} \end{matrix}] \\ = [\begin{matrix} A_{11} + A_{12} A_{22}^{- 1} (G - A_{22}) A_{22}^{- 1} A_{21} & A_{12} A_{22}^{- 1} G \\ G A_{22}^{- 1} A_{21} & G \end{matrix}] \\ = A + [\begin{matrix} A_{12} A_{22}^{- 1} (G - A_{22}) A_{22}^{- 1} A_{21} & A_{12} A_{22}^{- 1} (G - A_{22}) \\ (G - A_{22}) A_{22}^{- 1} A_{21} & G - A_{22} \end{matrix}], \end{matrix} \end{matrix}

(2)

which can be simplified to any of the following:

\begin{matrix} H & = [\begin{matrix} A_{11} - A_{12} A_{22}^{- 1} A_{21} & 0 \\ 0 & 0 \end{matrix}] + [\begin{matrix} A_{12} A_{22}^{- 1} \\ I \end{matrix}] G [\begin{matrix} A_{22}^{- 1} A_{21} \\ I \end{matrix}], \end{matrix}

(3)

\begin{matrix} H & = [\begin{matrix} {(A^{11})}^{- 1} & 0 \\ 0 & 0 \end{matrix}] + [\begin{matrix} A_{12} A_{22}^{- 1} \\ I \end{matrix}] G [\begin{matrix} A_{22}^{- 1} A_{21} \\ I \end{matrix}], \end{matrix}

(4)

\begin{matrix} H & = A + [\begin{matrix} A_{12} A_{22}^{- 1} \\ I \end{matrix}] (G - A_{22}) [\begin{matrix} A_{22}^{- 1} A_{21} \\ I \end{matrix}] . \end{matrix}

(5)

In matrix H, the genomic information in G influences the relationships between non-genotyped and genotyped animals and among non-genotyped animals. Later, it was discovered that

H^{- 1}

can be indirectly obtained without forming and inverting H [3,4].

H^{- 1} = [\begin{matrix} H^{11} & H^{12} \\ H^{21} & H^{22} \end{matrix}] = [\begin{matrix} A^{11} & A^{12} \\ A^{21} & H^{22} \end{matrix}] = A^{- 1} + [\begin{matrix} 0 & 0 \\ 0 & G^{- 1} - A_{22}^{- 1} \end{matrix}] .

(6)

Note that:

\begin{matrix} G^{- 1} - A_{22}^{- 1} & = H_{22}^{- 1} - A_{22}^{- 1} \\ = H^{22} - H^{21} {(H^{11})}^{- 1} H^{12} - A^{22} + A^{21} {(A^{11})}^{- 1} A^{12} \\ = H^{22} - A^{21} {(A^{11})}^{- 1} A^{12} - A^{22} + A^{21} {(A^{11})}^{- 1} A^{12} \\ = H^{22} - A^{22} . \end{matrix}

Matrix G is not always full-rank (e.g., when the number of genotyped animals is greater than the number of loci or when there are duplicated genotypes, such as for identical twins). To force G to be positive-definite and avoid large diagonal values of

G^{- 1}

due to the bad numerical condition of G, the first step of conditioning G often involves blending it with

A_{22}

, which is always positive-definite (except in the existence of identical twins or clones [5]) and of good numerical conditions (i.e.,

G \leftarrow (1 - k) G + k A_{22}

, 0 < k < 1). Blending introduces residual polygenic effects (genetic effects not captured by genetic markers) to the evaluation model without explicitly modelling it, where the scalar k is the ratio of the polygenic to the total additive genetic variance [6].

It is theoretically true that no artificially inflated variance is introduced via the H matrix [2]. However, inflated genetic variances have been observed due to incompatibilities between G and

A_{22}

[6,7,8,9]. Incompatible G and

A_{22}

lead to incorrectly weighted pedigree and genomic information [7,8]. Besides different distributions of G and

A_{22}

elements, incomplete and incorrect pedigree information, and genotyping and imputation errors, incompatibilities between G and

A_{22}

can be due to the non-random selection of genotyped animals [10], and the different bases and scales of the two matrices [7]. Matrices

A_{22}

and G regress data to different means. Matrix

A_{22}

regresses solutions towards pedigree founders, animals in the pedigree with unknown parents or genetic groups if considered in the pedigree. On the other hand, G regresses solutions toward a founder population comprising genotyped animals [5,10] since the real allele frequencies in the founder population are unknown. The average genetic merit of genotyped animals can be different from founders, especially in the presence of selection. Different approaches (referred to as tuning) have been used for correcting the base difference between G and

A_{22}

[7,11] and rebasing and scaling G to improve its consistency with

A_{22}

[10]. Those approaches were tested by Nilforooshan [9] on New Zealand Romney sheep. Christensen [8] and Gao et al. [6] tuned G by regressing its averages to the averages of

A_{22}

(Equations (7) and (8), respectively).

\begin{matrix} \{\begin{matrix} μ (diag (G)) β + α = μ (diag (A_{22})) \\ μ (G) β + α = μ (A_{22}) \end{matrix} \end{matrix}

(7)

\begin{matrix} \{\begin{matrix} μ (diag (G)) β + α = μ (diag (A_{22})) \\ μ (offdiag (G)) β + α = μ (offdiag (A_{22})) \end{matrix} \end{matrix}

(8)

The

α

and

β

scalars obtained by solving either of the equations above are used for transforming G into

β G + α 11^{'}

. Another solution proposed to tackle the problem of inflated genomic evaluations (i.e., an increased variance of genomic predictions) as a result of incorrectly scaled genomic and pedigree information was scaling

G^{- 1} - A_{22}^{- 1}

in the form of

τ G^{- 1} - ω A_{22}^{- 1}

[3,12,13]. Applying

τ G^{- 1} - ω A_{22}^{- 1}

is equivalent to transforming G into

{[τ G^{- 1} - (ω - 1) A_{22}^{- 1}]}^{- 1}

[3,9], which equals

G {[(1 - ω) G + τ A_{22}]}^{- 1} A_{22}

. It is also equivalent to replacing

G - A_{22}

with

{[τ G^{- 1} - (ω - 1) A_{22}^{- 1}]}^{- 1}

in Equation (2) [12].

Reducing

τ

and

ω

values toward 0 brings G closer to

A_{22}

by bringing

H^{22}

closer to

A^{22}

. However, it is not easily quantifiable how G and

A_{22}

are proportionally combined. With

τ

and

ω

deviating from each other and 1, there is a risk of distorting the conditional properties of H, because the changes made in

H^{22}

are not reflected in other blocks of

H^{- 1}

. Whereas 1 – k and k are the commonly used blending coefficients of G and

A_{22}

,

τ

and

ω

are the commonly used blending coefficients of

H^{- 1}

and

A^{- 1}

. i.e.,

A^{- 1} + [\begin{matrix} 0 & 0 \\ 0 & τ G^{- 1} - ω A_{22}^{- 1} \end{matrix}] = ω H^{- 1} + (1 - ω) A^{- 1} + [\begin{matrix} 0 & 0 \\ 0 & (τ - ω) G^{- 1} \end{matrix}] .

(9)

Considering the above equation, there is no legitimate reason for

ω

being out of the boundary of 0 and 1, and

τ - ω

being out of the boundary of –1 and 1. Martini et al. [12] studied

τ

ranging from 0.1 to 2, and

ω

ranging from –1 to 1 by steps of 0.1, leading to 420 analyses. Dealing with two parameters increases the number of analyses and validation tests in a two-dimensional space. It is assuming that the k coefficient has already been chosen and does not need to be validated. The most coherent approach for finding k is by restricted maximum likelihood (REML), as proposed by Christensen and Lund [4], rather than using empirical values by screening and validation.

Weighting

G^{- 1}

and

A_{22}^{- 1}

as

τ G^{- 1} - ω A_{22}^{- 1}

has been used until recently [12,13,14,15,16,17]. Several improvements have been made to ssGBLUP [18] and the use of

τ G^{- 1} - ω A_{22}^{- 1}

is declining. For example, one of the factors leading to the need for an

ω

considerably less than 1 was that inbreeding coefficients were considered in

A_{22}^{- 1}

but not in

A^{- 1}

[19]. The aim of this study was to communicate the problems that might occur using

τ G^{- 1} - ω A_{22}^{- 1}

, and investigate the possible solutions for weighting the

H^{- 1}

components if the modifications in G are not satisfactory and the weighting of the

H^{- 1}

components is still needed for the deflation/inflation of genomic breeding values.

2. Methods

2.1. Possible Problems with $τ G^{- 1} - ω A_{22}^{- 1}$

The

(τ - ω) G^{- 1}

matrix in Equation (9) is unconditional and not reflected in the other blocks of

H^{- 1}

. As such, some combinations of

τ \neq ω

potentially distort the conditional properties of H. However, any

τ = ω

ranging from 0 to 1 is legitimate and can be considered as a blending of

H^{- 1}

and

A^{- 1}

. While it might make sense to weight

G^{- 1}

and

A_{22}^{- 1}

to bring them closer to each other and make them more compatible, weighting

A_{22}^{- 1}

causes incompatibility between

A_{22}^{- 1}

and

A^{- 1}

. Matrix

H^{- 1}

can also be written as:

\begin{matrix} H^{- 1} & = [\begin{matrix} I \\ - A_{22}^{- 1} A_{21} \end{matrix}] A^{11} [\begin{matrix} I \\ - A_{12} A_{22}^{- 1} \end{matrix}] + [\begin{matrix} 0 & 0 \\ 0 & G^{- 1} \end{matrix}] \end{matrix}

(10)

\begin{matrix} = [\begin{matrix} A^{11} & A^{12} \\ A^{21} & A^{22} - A_{22}^{- 1} \end{matrix}] + [\begin{matrix} 0 & 0 \\ 0 & G^{- 1} \end{matrix}] . \end{matrix}

(11)

Weighting the components of

[\begin{matrix} I \\ - A_{22}^{- 1} A_{21} \end{matrix}] A^{11} [\begin{matrix} I \\ - A_{12} A_{22}^{- 1} \end{matrix}]

in Equation (10), the aim is to preserve the existing quadratic form. This study aimed to introduce weighting on the

H^{- 1}

components that are unlikely to introduce distortions to the conditional properties of H. Weighting

H^{- 1}

can be performed on any of the following components:

1.: $H^{- 1}$ itself
2.: $G^{- 1} - A_{22}^{- 1}$
3.: $G^{- 1}$
4.: $A^{- 1}$
5.: $A^{11}$
6.: $A^{22}$
7.: $A_{22}^{- 1}$

2.2. Weighting $H^{- 1}$

This scenario is helpful when the heritability estimate (h²) does not match the data or

H^{- 1}

. Heritability may change over time and as a result of selection. An outdated h² may differ from the current h² of the trait in the population. Estimating variance components is a computationally expensive process. The h² estimate might have been from a population subset or via a matrix other than

H^{- 1}

(

A^{- 1}

or

G^{- 1}

). Different relationship matrices contain different information and may result in different genetic variances and h² estimates [20].

2.3. Weighting $G^{- 1} - A_{22}^{- 1}$

Aguilar et al. [3] suggested using equal

τ

and

ω

. Weighting

G^{- 1} - A_{22}^{- 1}

by

α

is equivalent to

α H^{- 1} + (1 - α) A^{- 1}

.

2.4. Weighting $G^{- 1}$

This scenario can be understood as scaling the h² corresponding to

G^{- 1}

to the h² corresponding to

A^{- 1}

. No violation is made to the conditional properties of

H^{- 1}

, and weighting

G^{- 1}

by

α

is equivalent to using

G / α

in H. Therefore, instead of G,

G / α

is propagated through the blocks of H. A

G / α

more compatible with

A_{22}

would bring G closer to and more compatible with A.

2.5. Weighting $A^{- 1}$

This scenario can be understood as scaling the h² corresponding to

A^{- 1}

to the h² corresponding to

G^{- 1}

. In response to

A^{- 1}

weighted by

α

,

G^{- 1} - A_{22}^{- 1}

in Equation (6) should be changed to

G^{- 1} - α A_{22}^{- 1}

, which is equivalent to multiplying

[\begin{matrix} A^{11} & A^{12} \\ A^{21} & A^{22} - A_{22}^{- 1} \end{matrix}]

in Equation (11) by

α

. With an h² estimate based on pedigree information, weighting

G^{- 1}

is preferred over weighting

A^{- 1}

.

2.6. Weighting $A^{11}$

Considering Equation (10), weighting

A^{11}

is equivalent to weighting all the components of

H^{- 1}

, except

G^{- 1}

, similar to that of the weighting

A^{- 1}

scenario.

2.7. Weighting $A^{22}$

Considering Equation (11), weighting

A^{22}

should coincide with weighting the other blocks of

A^{- 1}

to preserve its conditional properties, as well as weighting

A_{22}^{- 1}

, similar to that of the weighting

A^{- 1}

scenario.

2.8. Weighting $A_{22}^{- 1}$

Considering Equation (10), weighting

A_{22}^{- 1}

is equivalent to:

\begin{matrix} H^{- 1} & = [\begin{matrix} I \\ - \sqrt{α} A_{22}^{- 1} A_{21} \end{matrix}] A^{11} [\begin{matrix} I \\ - \sqrt{α} A_{12} A_{22}^{- 1} \end{matrix}] + [\begin{matrix} 0 & 0 \\ 0 & G^{- 1} \end{matrix}] \\ = [\begin{matrix} A^{11} & \sqrt{α} A^{12} \\ \sqrt{α} A^{21} & α (A^{22} - A_{22}^{- 1}) \end{matrix}] + [\begin{matrix} 0 & 0 \\ 0 & G^{- 1} \end{matrix}] \\ = [\begin{matrix} I & 0 \\ 0 & \sqrt{α} I \end{matrix}] [\begin{matrix} A^{11} & A^{12} \\ A^{21} & A^{22} \end{matrix}] [\begin{matrix} I & 0 \\ 0 & \sqrt{α} I \end{matrix}] + [\begin{matrix} 0 & 0 \\ 0 & G^{- 1} - α A_{22}^{- 1} \end{matrix}] . \end{matrix}

(12)

However, this is not recommended as it imposes a different pedigree-based h² on the genotyped and non-genotyped animals in

A^{- 1}

. Furthermore, as

α

becomes smaller, the relationships between genotyped and non-genotyped animals are weakened.

2.9. The Experiments

Since the scenarios of weighting

A^{11}

and

A^{22}

are equivalent to weighting

A^{- 1}

, and weighting

A_{22}^{- 1}

is not recommended, the four scenarios of weighting

H^{- 1}

,

G^{- 1} - A_{22}^{- 1}

,

G^{- 1}

, and

A^{- 1}

were tested. These scenarios were tested with

α

ranging from 0.8 to 1.2 to know the responses of each

H^{- 1}

conversion to the deviation of

α

from 1. Because weighting

G^{- 1} - A_{22}^{- 1}

requires

α

to be between 0 and 1, it was studied with

α

ranging from 0.8 to 1. Predictive ability was calculated as Pearson’s correlation between the phenotypes and the estimated breeding values. Phenotypes were regressed on the estimated breeding values, where a lower slope means inflation and a higher slope means deflation.

3. Materials

Data were simulated for a species in a 1:1 sex ratio, litter size of 2, and generation overlap of 1. The pedigree, phenotypes, and genotypes were simulated using the R package pedSimulate [21]. Initially, ten generations were simulated, starting with a base generation (F0) of 100 animals (50 of each sex). No non-random pre-mating mortality or selection was applied to F0. Genotypes were simulated on 5000 markers, and allele frequencies were sampled from a uniform distribution ranging from 0.1 to 0.9. Marker (allele substitution) effects were simulated from a gamma distribution with shape and rate parameters equal to 2. The distribution was rebased to have a mean of 0 and scaled to create a variance of (true) marker breeding values in F0,

σ_{g}^{2}

= 9. Residual polygenic and environment (residual) effects were simulated from normal distributions with variances

σ_{a}^{2}

= 1 and

σ_{e}^{2}

= 30, respectively.

Following F0, half of the males were mated to half of the females, which were all randomly selected and mated. Where the numbers of mating animals per sex were not equal, the sex with the higher number of animals underwent random selection to match the number of animals of the opposite sex. These ten generations were followed by ten more generations, in which 50% of male candidates (to become sires of the next generation) were selected for their marker breeding value and mated to the same number of randomly selected females. Genotypes in each subsequent generation were obtained by combining sampled gametes from the parents’ genotypes.

Phenotypes were calculated as

y = μ 1 + g + a + e

, where

μ

is the population mean, and g, a, and e are the vectors of effects corresponding to

σ_{g}^{2}

,

σ_{a}^{2}

, and

σ_{e}^{2}

. Genotypes before F8 and phenotypes for the last generation (F19) and before F7 were set to missing. Randomly, 5% of the known dams and 5% of the known sires (after F0) were set to missing. As such, missing pedigree and phenotype information, genomic pre-selection, and base and scale deviations between A and G were accommodated in the simulation. Data simulation was repeated ten times to reduce the possibility of observing the results specific to a dataset.

No fixed effect was simulated, and the data were analysed using the following mixed model equations:

[\begin{matrix} 1^{'} 1 & 1^{'} Z \\ Z^{'} 1 & Z^{'} Z + H^{- 1} \frac{σ_{e}^{2}}{σ_{g}^{2} + σ_{a}^{2}} \end{matrix}] [\begin{matrix} \hat{μ} \\ \hat{u} \end{matrix}] = [\begin{matrix} \sum y \\ Z^{'} y \end{matrix}],

(13)

where Z is the matrix relating phenotypes to animals, 1 and

\hat{u}

are the vectors of ones and predicted breeding values, and

\hat{μ}

is the mean estimate. Matrix G was used in

H^{- 1}

and built according to method 1 of VanRaden [5], where

G = {WW}^{'} / 2 \sum p (1 - p)

, W is the centred and scaled genotype matrix, and p is the marker allele frequency. Markers with minor allele frequency below 0.02 were discarded before calculating G. Then, G was blended as

G \leftarrow 0.9 G + 0.1 A_{22}

.

4. Results

The simulated pedigrees had a population size of 2162.8 ± 358.3 (

μ \pm

sd), 1326.4 ± 298.2 genotypes, 1324.6 ± 277.2 phenotypes, 1074.7 ± 156.8 males, and 1088.1 ± 202.9 females. Inflation and predictive ability estimates over the ten simulated pedigrees were averaged and presented (Figure 1 and Figure 2).

Different

H^{- 1}

components were weighted by

α

ranging from 0.8 to 1.2, except for

G^{- 1} - A_{22}^{- 1}

, where

α

ranged from 0.8 to 1. Weighting

H^{- 1}

and

G^{- 1}

showed similar trends for inflation (Figure 1) and predictive ability (Figure 2), with the slope of the trends being slightly less for

G^{- 1}

compared to

H^{- 1}

. Weighting

A^{- 1}

(accompanied by weighting

A_{22}^{- 1}

) showed slightly decreasing trends, with the regression slope decreasing by 0.01 (i.e., inflation increasing by 0.01) and the predictive ability decreasing by 4.4

\times 10^{- 3}

over the range of

α

. The inflation and prediction ability increased by weighting

G^{- 1} - A_{22}^{- 1}

with

α

decreasing from 1 to 0.8.

5. Discussion

Matrices G and

A_{22}

indicate different means and variances for genotyped animals. This can cause differently scaled genomic and pedigree information in

H^{- 1}

[3]. Usually, G is blended and tuned (rebased and scaled) with

A_{22}

. If genomic breeding values are still inflated, a complementary weighting of

G^{- 1} - A_{22}^{- 1}

might be needed. A common practice is to weight using

τ G^{- 1} - ω A_{22}^{- 1}

. It was shown that some

τ \neq ω

combinations are likely to distort the properties of H that provide conditionality between the breeding values of genotyped and non-genotyped animals. Other ways of weighting the components of

H^{- 1}

were presented that are unlikely to distort the conditional properties of H.

Weighting

H^{- 1}

with

α

> 1 is equivalent to reducing h² and increasing inflation due to increased dispersion. It is equivalent to adding

(1 - α) / α

to 1/h² or weighting the genetic variance by 1/

α

. Due to selection, h² can be lower than expected. The h² reduction is expected to be greater due to genomic selection. Change of genetic variance by genomic selection is propagated from G throughout H. The predictive ability declined with increasing

α

(Figure 2), which might be concerning. However, predictive ability is a direct function of the slope of the regression line (Figure 1). Therefore, the slope of the regression line (inflation) should be the main concern.

Weighting

A^{- 1}

(accompanied by weighting

A_{22}^{- 1}

) did not influence inflation and predictive ability. Predictive ability and the slope of the regression line decreased slightly (inflation increased slightly) over the increase in

α

. The reason for this is likely that H is a genomic relationship matrix extended from G for genotyped animals to non-genotyped animals via the

A_{12} A_{22}^{- 1}

coefficients (Equations (2)–(5)). As such, G is more influential in defining the variances in H than A. This was confirmed by similar trends for weighting

G^{- 1}

and

H^{- 1}

(Figure 1 and Figure 2). The slopes of the regression line (inflation) and predictive ability were slightly steeper for

H^{- 1}

than for

G^{- 1}

, and that was a result of the combined weighting of

G^{- 1}

,

A^{- 1}

and

A_{22}^{- 1}

. Weighting

G^{- 1} - A_{22}^{- 1}

by

α

< 1 increased the inflation but at a lower rate than weighting

H^{- 1}

or

G^{- 1}

with

α

> 1.

The inflation results are expected to be valid for other data as weighting

H^{- 1}

or its components is equivalent to inversely weighting the genetic variance, regardless of the data. The exception is weighting

G^{- 1} - A_{22}^{- 1}

. Whether weighting

G^{- 1} - A_{22}^{- 1}

with a larger

α

results in inflation or deflation depends on whether using

H^{- 1}

instead of

A^{- 1}

results in inflation or deflation. If using

H^{- 1}

results in inflation, then weighting

G^{- 1} - A_{22}^{- 1}

with a larger

α

(more emphasis on

H^{- 1}

than

A^{- 1}

) results in greater inflation. The predictive ability improved by weighting

G^{- 1} - A_{22}^{- 1}

with

α

decreasing from 1 to 0.8. Generally, predictive ability increases by the increase in the slope of the regression line. Notice that the predictive ability ignoring inflation can be misleading. Since the trends for prediction ability and the slope of the regression line were in opposite directions for weighting

G^{- 1} - A_{22}^{- 1}

, it shows that the predictive ability benefited from blending

H^{- 1}

and

A^{- 1}

, mainly because the h² was more compatible with a blended

H^{- 1}

and

A^{- 1}

than with

H^{- 1}

.

This study does not completely rule out using

τ G^{- 1} - ω A_{22}^{- 1}

. However, weighting

H^{- 1}

components should meet specific conditions to avoid/minimise violating the conditional properties of H. As such,

\begin{matrix} A^{- 1} + [\begin{matrix} 0 & 0 \\ 0 & α (G^{- 1} - A_{22}^{- 1}) \end{matrix}], \\ τ (A^{- 1} + [\begin{matrix} 0 & 0 \\ 0 & α (G^{- 1} - A_{22}^{- 1}) \end{matrix}]), \\ A^{- 1} + [\begin{matrix} 0 & 0 \\ 0 & α G^{- 1} - A_{22}^{- 1} \end{matrix}], \end{matrix}

and

α H^{- 1}

are better alternatives to

τ G^{- 1} - ω A_{22}^{- 1}

. By definition, none of these four options are better than the others. However, achieving good compatibility between the resulting

H^{- 1}

and h² without blending

H^{- 1}

and

A^{- 1}

at a high rate (low emphasis on genomic information) is important.

Concerning pedigree and genomic errors, regardless of the emphasis given to pedigree and genomic information, genotype errors propagate through non-genotyped animals, and pedigree errors incorrectly and insufficiently propagate genotype information through non-genotyped animals. Therefore, the correctness and the completeness of pedigree and genomic information are vital for accurate and unbiased ssGBLUP evaluations.

Future research may focus on changing genetic parameters over time or across populations in genomic predictions. It is possible to reduce inflation in genomic predictions for young animals by using smaller additive genetic variances. This can be done by replacing

H^{- 1}

with

{DH}^{- 1} D

. Considering no overall weight on

H^{- 1}

:

\sum {DH}^{- 1} D = \sum H^{- 1}

. Matrix D is a diagonal matrix of positive values descending in function of the animal’s age. The researcher would need to decide the

\min (d) \leq \frac{σ_{e}}{σ_{g}} \leq \max (d)

range, where d = diag(D). With recent advances in ssGBLUP (mentioned by Misztal et al. [18]), which improve the compatibility between A and G, conditioning

H^{- 1}

might become an interim solution from the past or be reduced to only weighting

H^{- 1}

.

Funding

This work was supported by the NZ Ministry for Primary Industries, SFF Futures Programme: Resilient Dairy-Innovative breeding for a sustainable dairy future (grant number PGP06-17006).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data, code, and the results supporting the findings of this study are openly available in Mendeley Data at doi:10.17632/cn9jzpj7fg.1 [22].

Conflicts of Interest

M.A.N. is employed at Livestock Improvement Corporation, Hamilton, New Zealand. He declares that the research was conducted in the absence of any commercial or financial interest.

References

Misztal, I.; Legarra, A.; Aguilar, I. Computing procedures for genetic evaluation including phenotypic, full pedigree, and genomic information. J. Dairy Sci. 2009, 92, 4648–4655. [Google Scholar] [CrossRef] [PubMed]
Legarra, A.; Aguilar, I.; Misztal, I. A relationship matrix including full pedigree and genomic information. J. Dairy Sci. 2009, 92, 4656–4663. [Google Scholar] [CrossRef] [PubMed]
Aguilar, I.; Misztal, I.; Johnson, D.L.; Legarra, A.; Tsuruta, S.; Lawlor, T.J. Hot topic: A unified approach to utilise phenotypic, full pedigree, and genomic information for genetic evaluation of Holstein final score. J. Dairy Sci. 2010, 93, 743–752. [Google Scholar] [CrossRef] [PubMed]
Christensen, O.F.; Lund, M.S. Genomic prediction when some animals are not genotyped. Genet. Sel. Evol. 2010, 42, 2. [Google Scholar] [CrossRef] [PubMed]
VanRaden, P.M. Efficient methods to compute genomic predictions. J. Dairy Sci. 2008, 91, 4414–4423. [Google Scholar] [CrossRef] [PubMed]
Gao, H.; Christensen, O.F.; Madsen, P.; Nielsen, U.S.; Zhang, Y.; Lund, M.S.; Su, G. Comparison on genomic predictions using three GBLUP methods and two single-step blending methods in the Nordic Holstein population. Genet. Sel. Evol. 2012, 44, 8. [Google Scholar] [CrossRef] [PubMed]
Forni, S.; Aguilar, I.; Misztal, I. Different genomic relationship matrices for single-step analysis using phenotypic, pedigree and genomic information. Genet. Sel. Evol. 2011, 43, 1. [Google Scholar] [CrossRef] [PubMed]
Christensen, O.F. Compatibility of pedigree-based and marker-based relationship matrices for single-step genetic evaluation. Genet. Sel. Evol. 2012, 44, 37. [Google Scholar] [CrossRef] [PubMed]
Nilforooshan, M.A. Application of single-step GBLUP in New Zealand Romney sheep. Anim. Prod. Sci. 2020, 60, 1139–1144. [Google Scholar] [CrossRef]
Vitezica, Z.G.; Aguilar, I.; Misztal, I.; Legarra, A. Bias in genomic predictions for populations under selection. Genet. Res. 2011, 93, 357–366. [Google Scholar] [CrossRef] [PubMed]
Chen, C.Y.; Misztal, I.; Aguilar, I.; Legarra, A.; Muir, W.M. Effect of different genomic relationship matrices on accuracy and scale. J. Anim. Sci. 2011, 89, 2673–2679. [Google Scholar] [CrossRef] [PubMed]
Martini, J.W.R.; Schrauf, M.F.; Garcia-Baccino, C.A.; Pimentel, E.C.G.; Munilla, S.; Rogberg-Muñoz, A.; Cantet, R.J.C.; Reimer, C.; Gao, N.; Wimmer, V.; et al. The effect of the H⁻¹ scaling factors τ and ω on the structure of H in the single-step procedure. Genet. Sel. Evol. 2018, 50, 16. [Google Scholar] [CrossRef] [PubMed]
Misztal, I.; Aguilar, I.; Legarra, A.; Lawlor, T.J. Choice of parameters for single-step genomic evaluation for type. In Proceedings of the 61st Annual EAAP Meeting, Heraklion, Greece, 23–27 August 2010; p. 357. [Google Scholar]
Kang, H.; Ning, C.; Zhou, L.; Zhang, S.; Yan, Q.; Liu, J.-F. Short communication: Single-step genomic evaluation of milk production traits using multiple-trait random regression model in Chinese Holsteins. J. Dairy Sci. 2018, 101, 11143–11149. [Google Scholar] [CrossRef] [PubMed]
Imai, A.; Kuniga, T.; Yoshioka, T.; Nonaka, K.; Mitani, N.; Fukamachi, H.; Hiehata, N.; Yamamoto, M.; Hayashiet, T. Single-step genomic prediction of fruit-quality traits using phenotypic records of non-genotyped relatives in citrus. PLoS ONE 2019, 14, e0221880. [Google Scholar] [CrossRef] [PubMed]
Alvarenga, A.B.; Veroneze, R.; Oliveira, H.R.; Marques, D.B.D.; Lopes, P.S.; Silva, F.F.; Brito, L.F. Comparing alternative single-step GBLUP approaches and training population designs for genomic evaluation of crossbred animals. Front. Genet. 2020, 11, 263. [Google Scholar] [CrossRef] [PubMed]
Fu, C.; Ostersen, T.; Christensen, O.F.; Xiang, T. Single-step genomic evaluation with metafounders for feed conversion ratio and average daily gain in Danish Landrace and Yorkshire pigs. Genet. Sel. Evol. 2021, 53, 79. [Google Scholar] [CrossRef] [PubMed]
Misztal, I.; Lourenco, D.; Tsuruta, S.; Aguilar, I.; Masuda, Y.; Bermann, M.; Cesarani, A.; Legarra, A. How ssGBLUP became suitable for national dairy cattle evaluations. In Proceedings of the 12th World Congress on Genetics Applied to Livestock Production, Rotterdam, The Netherlands, 3–8 July 2022; p. 357. Available online: https://www.wageningenacademic.com/pb-assets/wagen/WCGALP2022/52_009.pdf (accessed on 5 October 2022).
Lourenco, D.A.L.; Legarra, A.; Tsuruta, S.; Masuda, Y.; Aguilar, I.; Misztal, I. Single-step genomic evaluations from theory to practice: Using SNP chips and sequence data in BLUPF90. Genes 2020, 11, 790. [Google Scholar] [CrossRef] [PubMed]
Legarra, A. Comparing estimates of genetic variance across different relationship models. Theor. Pop. Biol. 2016, 107, 26–60. [Google Scholar] [CrossRef] [PubMed]
Nilforooshan, M.A. pedSimulate—An R package for simulating pedigree, genetic merit, phenotype, and genotype data. R. Bras. Zootec. 2022, 51, e20210131. [Google Scholar] [CrossRef]
Nilforooshan, M.A. Code & Data—A Note on the Conditioning of the H-1 Matrix Used in Single-Step GBLUP. Mendeley Data V1. 2022. Available online: https://doi.org/10.17632/cn9jzpj7fg.1 (accessed on 15 November 2022).

Figure 1. Regression coefficients of the phenotypes on genomic breeding values for different components of

H^{- 1}

weighted by

α

. Each data point is an average of ten observations for the simulated populations.

Figure 1. Regression coefficients of the phenotypes on genomic breeding values for different components of

H^{- 1}

weighted by

α

. Each data point is an average of ten observations for the simulated populations.

Figure 2. Correlation coefficients between phenotypes and genomic breeding values for different components of

H^{- 1}

weighted by

α

. Each data point is an average of ten observations for the simulated populations.

Figure 2. Correlation coefficients between phenotypes and genomic breeding values for different components of

H^{- 1}

weighted by

α

. Each data point is an average of ten observations for the simulated populations.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Nilforooshan, M.A. A Note on the Conditioning of the H⁻¹ Matrix Used in Single-Step GBLUP. Animals 2022, 12, 3208. https://doi.org/10.3390/ani12223208

AMA Style

Nilforooshan MA. A Note on the Conditioning of the H⁻¹ Matrix Used in Single-Step GBLUP. Animals. 2022; 12(22):3208. https://doi.org/10.3390/ani12223208

Chicago/Turabian Style

Nilforooshan, Mohammad Ali. 2022. "A Note on the Conditioning of the H⁻¹ Matrix Used in Single-Step GBLUP" Animals 12, no. 22: 3208. https://doi.org/10.3390/ani12223208

APA Style

Nilforooshan, M. A. (2022). A Note on the Conditioning of the H⁻¹ Matrix Used in Single-Step GBLUP. Animals, 12(22), 3208. https://doi.org/10.3390/ani12223208

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Note on the Conditioning of the H⁻¹ Matrix Used in Single-Step GBLUP

Abstract

Simple Summary

Abstract

1. Introduction

2. Methods

2.1. Possible Problems with $τ G^{- 1} - ω A_{22}^{- 1}$

2.2. Weighting $H^{- 1}$

2.3. Weighting $G^{- 1} - A_{22}^{- 1}$

2.4. Weighting $G^{- 1}$

2.5. Weighting $A^{- 1}$

2.6. Weighting $A^{11}$

2.7. Weighting $A^{22}$

2.8. Weighting $A_{22}^{- 1}$

2.9. The Experiments

3. Materials

4. Results

5. Discussion

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

A Note on the Conditioning of the H−1 Matrix Used in Single-Step GBLUP

Abstract

Simple Summary

Abstract

1. Introduction

2. Methods

2.1. Possible Problems with τ G − 1 − ω A 22 − 1

2.2. Weighting H − 1

2.3. Weighting G − 1 − A 22 − 1

2.4. Weighting G − 1

2.5. Weighting A − 1

2.6. Weighting A 11

2.7. Weighting A 22

2.8. Weighting A 22 − 1

2.9. The Experiments

3. Materials

4. Results

5. Discussion

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

A Note on the Conditioning of the H⁻¹ Matrix Used in Single-Step GBLUP

2.1. Possible Problems with $τ G^{- 1} - ω A_{22}^{- 1}$

2.2. Weighting $H^{- 1}$

2.3. Weighting $G^{- 1} - A_{22}^{- 1}$

2.4. Weighting $G^{- 1}$

2.5. Weighting $A^{- 1}$

2.6. Weighting $A^{11}$

2.7. Weighting $A^{22}$

2.8. Weighting $A_{22}^{- 1}$