Advertisement
Research Article| Volume 98, ISSUE 4, P2775-2784, April 2015

Single-step genomic evaluation using multitrait random regression model and test-day data

Open ArchivePublished:February 04, 2015DOI:https://doi.org/10.3168/jds.2014-8975

      Abstract

      The objectives of this study were to evaluate the feasibility of use of the test-day (TD) single-step genomic BLUP (ssGBLUP) using phenotypic records of Nordic Red Dairy cows. The critical point in ssGBLUP is how genomically derived relationships (G) are integrated with population-based pedigree relationships (A) into a combined relationship matrix (H). Therefore, we also tested how different weights for genomic and pedigree relationships affect ssGBLUP, validation reliability, and validation regression coefficients. Deregressed proofs for 305-d milk, protein, and fat yields were used for a posteriori validation. The results showed that the use of phenotypic TD records in ssGBLUP is feasible. Moreover, the TD ssGBLUP model gave considerably higher validation reliabilities and validation regression coefficients than the TD model without genomic information. No significant differences were found in validation reliability between the different TD ssGBLUP models according to bootstrap confidence intervals. However, the degree of inflation in genomic enhanced breeding values is affected by the method used in construction of the H matrix. The results showed that ssGBLUP provides a good alternative to the currently used multi-step approach but there is a great need to find the best option to combine pedigree and genomic information in the genomic matrix.

      Key words

      Introduction

      Most genomic evaluations are based on multi-step approach that requires (1) calculation of traditional EBV without genomic information; (2) extraction of pseudo-observations, typically either daughter yield deviations (DYD) or deregressed EBV (deregressed proofs; DRP); and (3) genomic model for prediction of direct genomic values (DGV;
      • VanRaden P.M.
      Efficient methods to compute genomic predictions.
      ;
      • Hayes B.J.
      • Bowman P.J.
      • Chamberlain A.J.
      • Goddard M.E.
      Invited review Genomic selection in dairy cattle: Progress and challenges.
      ;
      • VanRaden P.M.
      • Van Tassell C.P.
      • Wiggans G.R.
      • Sonstegard T.S.
      • Schnabel R.D.
      Invited review: Reliability of genomic predictions for North American Holstein bulls.
      ). Genomic evaluations can be further improved by combining the DGV and information from traditional EBV (e.g.,
      • VanRaden P.M.
      Efficient methods to compute genomic predictions.
      ) to yield genomic enhanced breeding values (GEBV).
      The multi-step approach to calculate GEBV has an inherent problem. First, the parent averages (PA) of progeny of genomically selected animals do not automatically include genomic information. Second, when animals are selected by their GEBV, the future estimation of unbiased EBV becomes difficult because genomic information is not taken into account in the traditionally calculated EBV. Moreover, genomic selection using the multi-step approach is complex and includes several approximations, all of which reduce accuracy and can inflate the resultant GEBV. None of these issues applies to the single-step approach.
      Single-step evaluation (single-step genomic BLUP; ssGBLUP) is a unified approach to calculate GEBV. The ssGBLUP combines phenotypic records, pedigree information, and genomic information optimally in calculation of GEBV (
      • Misztal I.
      • Legarra A.
      • Aguilar I.
      Computing procedures for genetic evaluation including phenotypic, full pedigree, and genomic information.
      ;
      • Aguilar I.
      • Misztal I.
      • Johnson D.L.
      • Legarra A.
      • Tsuruta S.
      • Lawlor T.J.
      Hot topic: A unified approach to utilize phenotypic, full pedigree, and genomic information for genetic evaluation of Holstein final score.
      ;
      • Christensen O.F.
      • Lund M.S.
      Genomic prediction when some animals are not genotyped.
      ). The approach integrates the pedigree relationship matrix A and genomic relationship matrix G into a single H matrix, which replaces the traditional relationship matrix A in the mixed-model equations (
      • Legarra A.
      • Aguilar I.
      • Misztal I.
      A relationship matrix including full pedigree and genomic information.
      ;
      • Misztal I.
      • Legarra A.
      • Aguilar I.
      Computing procedures for genetic evaluation including phenotypic, full pedigree, and genomic information.
      ;
      • Christensen O.F.
      • Lund M.S.
      Genomic prediction when some animals are not genotyped.
      ). To date, the single-step approach has been rated computationally demanding with large data sets and multi-trait analysis (
      • Su G.
      • Madsen P.
      • Nielsen U.S.
      • Mäntysaari E.A.
      • Aamand G.P.
      • Christensen O.F.
      • Lund M.S.
      Genomic prediction for the Nordic Red Cattle using one-step and selection index blending approaches.
      ). However, ssGBLUP has been successfully applied, for example, for final scores of over 6 million Holsteins with greater accuracy than that of a multi-step procedure (
      • Aguilar I.
      • Misztal I.
      • Johnson D.L.
      • Legarra A.
      • Tsuruta S.
      • Lawlor T.J.
      Hot topic: A unified approach to utilize phenotypic, full pedigree, and genomic information for genetic evaluation of Holstein final score.
      ), and in a multi-trait national genomic evaluation for type traits in US Holsteins (
      • Tsuruta S.
      • Misztal I.
      • Aguilar I.
      • Lawlor T.J.
      Multiple-trait genomic evaluation of linear type traits using genomic and phenotypic data in US Holsteins.
      ). Performance of ssGBLUP has also been evaluated in other species.
      • Chen C.Y.
      • Misztal I.
      • Aguilar I.
      • Tsuruta S.
      • Meuwissen T.H.E.
      • Aggrey S.E.
      • Wing T.
      • Muir W.M.
      Genome-wide marker-assisted selection combining all pedigree phenotypic information and genotypic data in one step: An example using broiler chickens.
      used ssGBLUP to analyze 3 traits in 2 separately selected lines of chickens, and
      • Forni S.
      • Aguilar I.
      • Misztal I.
      Different genomic relationship matrices for single-step analysis using phenotypic, pedigree and genomic information.
      used ssGBLUP to analyze litter size in pigs. Thus, despite its high computational requirements, the single-step method is suitable for large multi-trait analyses. The critical issue with ssGBLUP is compatibility between the marker-based relationship matrix and the pedigree-based relationship matrix for genotyped animals. Applications of the first unified approaches for merging information from animals with or without genotypes by combining the A matrix with the G matrix resulted in biased GEBV (e.g.,
      • Meuwissen T.H.E.
      • Luan T.
      • Woolliams J.A.
      The unified approach to the use of genomic and pedigree information in genomic evaluations revisited.
      ). Since then, it has been demonstrated that accuracy of prediction can be improved and bias reduced by adjusting the G matrix toward their expected values in the A matrix to decrease the scaling problem (e.g.,
      • Vitezica Z.G.
      • Aguilar I.
      • Misztal I.
      • Legarra A.
      Bias in genomic predictions for populations under selection.
      ;
      • Christensen O.F.
      • Madsen P.
      • Nielsen B.
      • Ostersen T.
      • Su G.
      Single-step methods for genomic evaluation in pigs.
      ).
      A random regression test-day (TD) model is currently used for the official Nordic genetic evaluation of production (
      • Lidauer M.H.
      • Pösö J.
      • Pederson J.
      • Lassen J.
      • Madsen P.
      • Mäntysaari E.A.
      • Nielsen U.
      • Eriksson J.-Å.
      • Johansson K.
      • Pitkänen T.
      • Strandén I.
      • Aamand G.P.
      Across-country test-day model evaluations for Nordic Holstein, Red Cattle and Jersey.
      ) in Nordic Red Dairy Cattle (RDC). As more selection decisions are made using genomic information, it is becoming essential that all genomic information is included in national evaluations. The objectives of this study were to evaluate the feasibility of the large random regression TD ssGBLUP, and to estimate the accuracy of GEBV when using this model. We also tested how different combinations of the A and G matrices affect the bias and accuracy of GEBV in the TD ssGBLUP.

      Materials and Methods

      All analyses used the data used in the official Nordic RDC milk production evaluations. The multiple-trait milk production evaluation includes TD records for milk, fat, and protein production. Production records from the first 3 lactations are in the same multiple-trait model. Each trait has random regression function for random genetic and permanent environmental effects. For more information, see
      • Lidauer M.H.
      • Pösö J.
      • Pederson J.
      • Lassen J.
      • Madsen P.
      • Mäntysaari E.A.
      • Nielsen U.
      • Eriksson J.-Å.
      • Johansson K.
      • Pitkänen T.
      • Strandén I.
      • Aamand G.P.
      Across-country test-day model evaluations for Nordic Holstein, Red Cattle and Jersey.
      .
      The routine full evaluation data from May 2014 for the RDC were obtained from the Nordic Cattle Genetic Evaluation (NAV; Aarhus, Denmark). For production traits, the TD data included 3.8 million cows with a total of 85 million records and 5.1 million animals in the Nordic RDC pedigree. To be able to validate the model, a reduced data set was extracted from the full data set, as follows: the last 4 yr of observations were removed and the reduced data included 2.7 million cows with 72 million records. The reduced data set was used to solve GEBV and EBV for all animals in the pedigree, and the full data set was used to solve current EBV for testing purpose. The initial EBV from the reduced data set were denoted EBVr. For the females without observations and bulls without daughters in reduced data, EBVr are hereafter referred to as parent average (PA). Comparing initial predictions from the reduced data set with those from the full data set allowed estimation of validation accuracy (
      • Mäntysaari E.
      • Liu Z.
      • VanRaden P.
      Interbull validation test for genomic evaluations.
      ). The total number of equations in the reduced run was 217,370,251, and in the full run 238,041,030.
      The unified relationship matrix H in single-step evaluations defines the relationships among genotyped and nongenotyped animals. Although H can be expensive to compute, its inverse has a simple structure (
      • Aguilar I.
      • Misztal I.
      • Johnson D.L.
      • Legarra A.
      • Tsuruta S.
      • Lawlor T.J.
      Hot topic: A unified approach to utilize phenotypic, full pedigree, and genomic information for genetic evaluation of Holstein final score.
      ;
      • Christensen O.F.
      • Lund M.S.
      Genomic prediction when some animals are not genotyped.
      ):
      H1=A1+000G1A221,


      where A22 is the sub-matrix of the pedigree-based numerator relationship matrix A for the genotyped animals, and G is the relationship matrix constructed using genomic information. The G matrix had 15,148 genotyped RDC animals, of which 5,534 were bulls and 9,529 cows. The G matrix also included genotypes of animals without offspring or records. Genotypes were obtained from the Illumina Bovine SNP50 Bead Chip (Illumina, San Diego, CA). After application of exclusion criteria, 46,914 SNP markers on the 29 bovine autosomes were available for further analysis. The genotype file was the same as was used in official genomic evaluation of Nordic Cattle Genetic Evaluation in June 2014. Genotypes were used to form the raw G matrix with method 1 in
      • VanRaden P.M.
      Efficient methods to compute genomic predictions.
      ). Before the matrices G and A22 were combined, the raw G matrix was scaled by scalar t=trA22trG, where tr is the trace of matrix. Thus, G has, on average, the same diagonals as the A22 matrix.
      When the mixed-model equation for single-step is considered, the difference from the normal animal model is the matrix block H22 = A22 + G−1A22−1 between genotyped animals. To improve the properties of the ssGBLUP, different weights in building the H22 matrix were tested.
      • Aguilar I.
      • Misztal I.
      • Johnson D.L.
      • Legarra A.
      • Tsuruta S.
      • Lawlor T.J.
      Hot topic: A unified approach to utilize phenotypic, full pedigree, and genomic information for genetic evaluation of Holstein final score.
      and
      • Christensen O.F.
      • Lund M.S.
      Genomic prediction when some animals are not genotyped.
      noted that if not all genetic variance is accounted for by the SNP effects, the residual polygenic effect can be included in the model by changing the genomic matrix G and using H22=A22+Gw1A221, where Gw = (1 − w)G + w A22, and the constant w represents the proportion of polygenic variance not described by markers. So, the smaller w, the more genetic variance that is attributed to genomic markers. We used 3 different proportions w (w = 0.10, w = 0.15, or w = 0.20) in Gw. In
      • Christensen O.F.
      • Madsen P.
      • Nielsen B.
      • Ostersen T.
      • Su G.
      Single-step methods for genomic evaluation in pigs.
      , the optimal w was found to be 0.20, although differences in accuracy with different w were small. In
      • Misztal I.
      • Aguilar I.
      • Legarra A.
      • Lawlor T.J.
      Choice of parameters for single-step genomic evaluation for type.
      ) and
      • Tsuruta S.
      • Misztal I.
      • Aguilar I.
      • Lawlor T.J.
      Multiple-trait genomic evaluation of linear type traits using genomic and phenotypic data in US Holsteins.
      , the H22 matrix was further scaled to be H22=A22+τGw1ωA221.
      • Misztal I.
      • Tsuruta S.
      • Aguilar I.
      • Legarra A.
      • VanRaden P.M.
      • Lawlor T.J.
      Methods to approximate reliabilities in single-step genomic evaluation.
      suggested that optimal weights τ for Gw1 and ω for A221 decrease the possible inflation of GEBV estimated by ssGBLUP. The parameters τ and ω scale the size of the genomic and pedigree relationships, respectively. The larger τ is, the less weight is given to G, whereas larger values of ω decrease the importance of pedigree relationships and increase the importance of genomic relationships. We tested 4 combinations of these parameters. The first was the combination found best in
      • Misztal I.
      • Aguilar I.
      • Legarra A.
      • Lawlor T.J.
      Choice of parameters for single-step genomic evaluation for type.
      : τ = 1.5 and ω = 0.6. In the other set ups, the weights were selected to be τ = 1.6 and ω = 0.5, τ = 1.6 and ω = 1.0, or τ = 1.0 and ω = 0.5. In the tests with different τ and ω, the proportion of polygenic variance was fixed to w = 0.10 in Gw. In the following, these different methods are referred to as follows: w20, w15, w10, τ1.6ω1.0, τ1.0ω0.5, τ1.6ω0.5, and τ1.5ω0.6. Note that the method using w = 0.10 corresponds to the situation where τ = 1.0 and ω = 1.0; however, we refer to this method as w10, instead of τ1.0ω1.0.
      The analyses used the NAV routine EBV milk production evaluation model, in which a multiple-trait TD model is used to estimate EBV of milk, fat, and protein simultaneously. The GEBV were obtained from the TD ssGBLUP. Unknown parents by genetic groups and inbreeding coefficients were taken into account in the model. The mixed model equations were solved by MiX99 software using iteration on data and preconditioned conjugate gradients (PCG) iteration (
      • Strandén I.
      • Lidauer M.
      Solving large mixed models using preconditioned conjugate gradient iteration.
      ). Computations for EBV and GEBV were very similar. Equations to solve EBV used inverse of A−1, which was replaced by alternative H−1 matrices in the single-step method for GEBV. In the PCG algorithm, the iteration involves multiplication of search direction vector v by the MME coefficient matrix. The implementation of the single-step method in MiX99 splits the required matrix multiplications into several steps. The first step is the multiplication of the least squares part of the coefficient matrix and the second is the product H−1v. This is further divided into 2 steps. First, the product A−1v is computed directly by reading the pedigree file as is done in the traditional EBV calculation. Second, the product H22v is calculated by reading [H22A22] from a separate file during each PCG iteration cycle. Thus, the only additional work for the single step in solving MME is the matrix times vector product H22v in each PCG iteration where H22A22=Gw1A221. However, due to different convergence in the iteration with different models, extra work may be required for the single-step method.
      All the analyses of the results used the official 305-d lactation total yields of milk, protein, and fat (Lidauer et al., 2014). For validation of GEBV, the EBV for the lactation totals were obtained from the full data analysis. These were then deregressed for bull GEBV validation and cow GEBV validation using the Secant method in option DeRegress (
      • Strandén I.
      • Mäntysaari E.A.
      A recipe for multiple trait deregression.
      ) in MiX99 software. The effective record contribution (ERC) was used as a weighting factor in the deregressions. The ERC (
      • Taskinen M.
      • Mäntysaari E.
      • Lidauer M.
      • Knürr T.
      • Pösö J.
      • Su G.
      • Aamand G.P.
      • Strandén I.
      Comparison of model reliabilities from single-step and bivariate blending methods.
      ) were calculated by the ApaX99 program (
      • Strandén I.
      • Lidauer M.
      • Mäntysaari E.A.
      • Pösö J.
      Calculation of Interbull weighting factors for the Finnish test day model.
      ) for all animals in the pedigree when full data was used. Variance parameters in ERC approximation were from the average daily TD model, and the same values (h2milk = 0.48, h2protein = 0.48, and h2fat = 0.49) were used throughout the study. Deregressions used the full pedigree in NAV evaluation and EBV for the bulls and cows from the full evaluation model. The 3 traits were deregressed simultaneously but assuming genetic and residual correlations to be zero. We chose to use DRP also for the bulls in the validation calculations instead of DYD to have directly comparable results with sire model GBLUP studies, including those in
      • Koivula M.
      • Strandén I.
      • Su G.
      • Mäntysaari E.A.
      Different methods to calculate genomic predictions—Comparisons of SNP-BLUP, G-BLUP and H-BLUP.
      .
      Bulls born between years 2006 and 2009 and having their EBV based on ERC ≥3.0 in the full data but having PA with ERC = 0.0 in the reduced data were defined as candidate bulls. For a bull, the phenotypic information obtained with ERC = 3.0 corresponds roughly to 20 daughters. For the cow validation, genotyped cows with no TD records in the reduced data and a minimum of 5 TD records in the full data were considered candidate cows. Finally, in the validation test, we had 707 candidate bulls and 7,113 candidate cows. Validation reliability of predictions was assessed using the Interbull validation protocol (
      • Mäntysaari E.
      • Liu Z.
      • VanRaden P.
      Interbull validation test for genomic evaluations.
      ):
      y=1b0+b1aˆ+e,


      where y has the DRP of the candidate bulls or cows in the full data, b0 and b1 are unknown regression coefficients, â has the genomic prediction for bulls or cows based on the reduced data analysis (GEBV), and e is the residual error. The validation reliability of the model was obtained from the coefficient of determination (R2) of the model (R2model), after correcting it by the average reliability of DRP r¯DRP2 of the candidate bulls or cows; that is, R2validation=R2model/r¯DRP2. The reliabilities of DRP were calculated as rDRP2=ERCi/ERCi+λ, where λ = (1 − h2)/h2. To estimate the further gain from the genomic information over the traditional PA (
      • VanRaden P.M.
      • Van Tassell C.P.
      • Wiggans G.R.
      • Sonstegard T.S.
      • Schnabel R.D.
      Invited review: Reliability of genomic predictions for North American Holstein bulls.
      ;
      • Mäntysaari E.
      • Liu Z.
      • VanRaden P.
      Interbull validation test for genomic evaluations.
      ), the same validation tests were also applied to PA. Confidence intervals (CI) were estimated for the regression coefficients (b1), the validation reliabilities, and the differences between the validation reliabilities among the model alternatives (e.g., R2validation,w20 − R2validation,w15) using nonparametric bootstrap. The boot and boot.ci functions of the R package (

      R Core Development Team. 2012. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. http://www.R-project.org/

      ) were used to calculate the bootstrap CI separately for candidate bulls and cows. Number of bootstrap samples was 10,000. Bootstrap confidence intervals calculated using 3 methods: basic, norm, and perc. Confidence intervals by the basic method are given, because all methods gave approximately the same values.

      Results and Discussion

      The number of iterations for EBV estimation was 3,818, that for EBVr was 3,705, and that for the ssGBLUP varied from 3,545 to 4,990 depending on the method used. The models were run into the same level of convergence: Ca=10.5, where Ca = relative difference between left- and right-hand side of the part of the MME that includes the equations of the additive genetic animal effects. All models took about 35 to 54 h to run with 4 Intel Xeon 3.6 GHz processors. There was an approximately 15% difference in iteration time: 33s and, on average, 38s per iteration round for EBV and ssGBLUP, respectively. Although the increase in computing time was mainly due to extra iterations, some differences were apparent between methods. According to the time used per iteration round, w20 and w15 had the best convergence among ssGBLUP methods. Computationally, the ssGBLUP added very little extra computing time to solving the mixed model equations by the PCG method. The only significant extra computations in the single-step method were due to the construction of the H22 matrix, which was done once before applying the PCG method. Presumably, the inclusion of genomic data affects convergence because the variance structure of genotyped animals in the G matrix is less diagonally dominant than with the pedigree-based relationship matrix A only. It has been shown that slight changes in scaling of G and A22 can affect convergence without a negative effect on validation accuracy (
      • Meuwissen T.H.E.
      • Luan T.
      • Woolliams J.A.
      The unified approach to the use of genomic and pedigree information in genomic evaluations revisited.
      ;
      • Vitezica Z.G.
      • Aguilar I.
      • Misztal I.
      • Legarra A.
      Bias in genomic predictions for populations under selection.
      ;
      • Misztal I.
      • Tsuruta S.
      • Aguilar I.
      • Legarra A.
      • VanRaden P.M.
      • Lawlor T.J.
      Methods to approximate reliabilities in single-step genomic evaluation.
      ). Generally, ssGBLUP with w15 or τ1.0ω0.5 needed the fewest iteration rounds to achieve convergence. Also, ω = 0.5 seemed to give better convergence than ω = 0.6 or ω = 1.0. This is in agreement with
      • Misztal I.
      • Tsuruta S.
      • Aguilar I.
      • Legarra A.
      • VanRaden P.M.
      • Lawlor T.J.
      Methods to approximate reliabilities in single-step genomic evaluation.
      , who noted that larger ω causes H to be less positive definite, leading to slower convergence or even divergence when mixed model equations are solved by the iterative method, whereas smaller ω tend to give better convergence.
      We also observed poor convergence with some Gw matrices.
      • Christensen O.F.
      • Madsen P.
      • Nielsen B.
      • Ostersen T.
      • Su G.
      Single-step methods for genomic evaluation in pigs.
      presented a method to scale and adjust the genomic relationship matrix. The proposed method calculates adjusted genomic relationship matrix G* = a Gw + b 11′, where 1 is a vector of ones and the constants a and b are such that in G*, the average of diagonals and average of off-diagonals equal to those in the pedigree-based relationship matrix. We tested G* in ssGBLUP using our data, but did not get solutions for further examination because the PCG algorithm showed poor convergence and failed to converge within 5,000 iterations. The adjustment constants for our data set were a = 0.968 and b = 0.0323; that is, the Gw matrix was inflated by 3% after which all values were increased by a constant b. When the diagonal values of Gw are <1, the G* matrix is diagonally less dominant than Gw, which means increased correlations between animals and possibly poorer convergence of the PCG algorithm. There are other ways to correct genetic differences among genotyped and nongenotyped individuals (
      • Vitezica Z.G.
      • Aguilar I.
      • Misztal I.
      • Legarra A.
      Bias in genomic predictions for populations under selection.
      ;
      • Lourenco D.A.L.
      • Misztal I.
      • Tsuruta S.
      • Aguilar I.
      • Lawlor T.J.
      • Forni S.
      • Weller J.I.
      Are evaluations on young genotyped animals benefiting from the past generations?.
      ).
      Table 1 presents standard deviations of EBV and GEBV for reference and candidate bulls and cows in the full and reduced data sets. Standard deviations were derived within birth year and pooled across years to account for selection. For the reference animals, standard deviations of the GEBV closely followed standard deviations of the standardized EBV, although overall, some variation existed in the standard deviations of bulls and cows among different methods in building the H22 matrix. By using different w, standard deviations of GEBV were close to that of EBV, whereas when using different τ and ω, standard deviations of GEBV were somewhat lower than that of EBV. The reference bulls have a lot of progeny information and, there, changes in w did not affect reference animal GEBV much. In contrast, changes in τ and ω tended to have larger effects on reference bull standard deviations. In candidate bulls, standard deviations of GEBV increased with decreasing w; that is, increased genomic information increased standard deviations of GEBV. Overall, in the candidate animals, the standard deviations of GEBV were higher than that of PA due to added information from genotypes. Standard deviations of the GEBV varied also with different estimation methods. In contrast to reference animals, in candidate bulls and cows, the use of different w gave higher standard deviations than the use of τ and ω in building the H22 matrix. In conclusion, the effect of changing τ and ω was an important one that affected standard deviations of both candidate and reference animals when changes in w affected to a larger degree candidate animals.
      Table 1Standard deviations of EBV in the full data set and the reduced data (EBVr; for candidates, this is parent average, PA), and genomic enhanced breeding values with different single-step genomic BLUP (ssGBLUP) methods for the reference and candidate bulls and cows
      EBV are expressed as standardized EBV with SD of 10 units for all bulls and cows born between the years 1990 and 2012. Standard deviations were calculated within birth year and pooled across years.
      Reference
      w=proportion of polygenic variance; τ=weight for G−1; ωxx = weight for A22−1 matrix.
      MilkProteinFat
      BullsCowsBullsCowsBullsCows
      No. of animals4,442270
       EBV11.118.8310.517.9711.5210.27
       EBVr10.939.4210.399.2111.4011.71
      ssGBLUP
       w2011.088.8910.558.7111.6310.90
       w1511.088.8110.548.6211.6310.80
       w1011.088.7410.548.5211.6210.70
       τ1.0ω0.510.577.799.997.6211.089.61
       τ1.6ω0.510.347.409.767.1610.858.95
       τ1.5ω0.610.447.569.877.3110.968.17
       τ1.6ω1.010.798.1210.247.7711.349.72
      Candidate
       No. of animals7077,113
       EBV10.859.1910.129.0210.079.87
       PA7.016.516.906.497.767.83
      ssGBLUP
       w208.938.928.908.869.5010.52
       w159.069.139.039.059.6110.69
       w109.209.379.179.269.7310.87
       τ1.0ω0.57.257.197.116.837.798.38
       τ1.6ω0.57.267.267.076.817.668.24
       τ1.5ω0.67.447.427.277.017.868.48
       τ1.6ω1.08.738.878.628.609.1010.09
      1 EBV are expressed as standardized EBV with SD of 10 units for all bulls and cows born between the years 1990 and 2012. Standard deviations were calculated within birth year and pooled across years.
      2 w = proportion of polygenic variance; τ = weight for G−1; ωxx = weight for A22−1 matrix.
      The correlations between GEBV from different methods for candidate animals are presented in Table 2. The correlations for reference bulls were all near to 1 (varying from 0.998 to 0.999). Thus, for proven bulls, different methods seem not to affect the GEBV greatly. For candidate bulls and cows, the correlations among different methods differed considerably, varying from 0.952 to 0.999 for bulls and from 0.933 to 0.999 for cows. There, the differences appeared between methods using w and methods using τ and ω. The correlations between the different combinations of τ and ω increased as the amount of the additive genetic variance attributed to the genomic markers increased.
      Table 2Correlations (above diagonal = candidate bulls; below diagonal = candidate cows) among genomic enhanced breeding values from different methods in building genomic relationship matrices
      Method
      w=proportion of polygenic variance; τ=weight for G−1; ωxx = weight for A22−1 matrix.
      Single-step genomic BLUP
      w20w15w10τ1.0ω0.5τ1.6ω0.5τ1.5ω0.6τ1.6ω1.0
      w200.9990.9960.9570.9520.9610.985
      w150.9980.9990.9570.9540.9630.989
      w100.9920.9970.9570.9560.9650.992
      τ1.0ω0.50.9330.9400.9470.9960.9990.966
      τ1.6ω0.50.9330.9420.9500.9960.9990.974
      τ1.5ω0.60.9440.9520.9600.9960.9990.980
      τ1.6ω1.00.9820.9890.9930.9570.9670.975
      1 w = proportion of polygenic variance; τ = weight for G−1; ωxx = weight for A22−1 matrix.
      The model validation results for bulls are in Table 3 and for cows in Table 4. The tables present regression coefficients (b1) and validation reliabilities (R2) with 95% bootstrap CI with 10,000 bootstrap resampling. For the bulls, validation reliabilities from the ssGBLUP with different H22 matrix block varied between 0.48 and 0.49 for milk, between 0.39 and 0.40 for protein, and between 0.41 and 0.44 for fat. The PA based on the same data but without genomic information gave, on average, 14-percentage-units lower reliability for milk, protein, and fat. For bulls, the best choice of parameters for milk was τ1.5ω0.6, and for protein and fat was τ1.6ω0.5. The lower ω for protein and fat means that more weight should be placed on genomic relationships for protein and fat relative to milk. For cows, validation reliabilities with different genomic matrix varied between 0.41 and 0.42 for milk, 0.31 and 0.33 for protein, and 0.32 and 0.34 for fat. Cow PA gave on average 12.6-percentage-unit lower reliabilities than GEBV. For cows, the best choice of parameters for milk and protein was τ1.5ω0.6, and for fat, the best choice was τ1.6ω0.5. Thus, protein has different results in cows compared with bulls. However, for cows, validation reliabilities can also differ because they have fewer close relatives in the reference population
      Table 3Bull validation results from different single-step genomic BLUP (ssGBLUP) methods, showing regression coefficients (b1), validation reliabilities (R2), and their 95% bootstrap confidence intervals (CI) from the parent average (PA) and genomic enhanced breeding values with different methods in ssGBLUP
      MethodMilkProteinFat
      b1 (CI)R2 (CI)b1 (CI)R2 (CI)b1 (CI)R2 (CI)
      PA0.97 (0.87−1.07)0.36 (0.30−0.42)0.79 (0.69−0.89)0.27 (0.20−0.34)0.68 (0.60−0.77)0.26 (0.20−0.33)
      ssGBLUP
      w=proportion of polygenic variance; τ=weight for G−1; ωxx = weight for A22−1 matrix.
       w200.87 (0.80−0.94)0.48 (0.42−0.54)0.73 (0.66−0.81)0.39 (0.33−0.46)0.72 (0.65−0.79)0.43 (0.36−0.50)
       w150.86 (0.79−0.93)0.48 (0.42−0.54)0.72 (0.65−0.80)0.39 (0.33−0.46)0.72 (0.65−0.79)0.43(0.36−0.50)
       w100.84 (0.78−0.91)0.48 (0.42−0.54)0.71 (0.64−0.79)0.39 (0.33−0.46)0.71 (0.64−0.78)0.43 (0.36−0.50)
       τ1.0ω0.51.09 (1.00−1.18)0.49 (0.43−0.55)0.92 (0.83−1.01)0.39 (0.33−0.46)0.87 (0.79−0.95)0.41 (0.35−0.48)
       τ1.6ω0.51.09 (1.00−1.18)0.49 (0.43−0.55)0.93 (0.84−1.02)0.40 (0.33−0.47)0.90 (0.82−0.98)0.43 (0.36−0.49)
       τ1.5ω0.61.06 (0.98−1.15)0.49 (0.43−0.55)0.91 (0.82−1.00)0.40 (0.34−0.47)0.88 (0.80−0.96)0.43 (0.37−0.50)
       τ1.6ω1.00.90 (0.83−0.97)0.49 (0.42−0.55)0.77 (0.69−0.84)0.40 (0.34−0.47)0.77 (0.70−0.84)0.44 (0.38−0.51)
      1 w = proportion of polygenic variance; τ = weight for G−1; ωxx = weight for A22−1 matrix.
      Table 4Cow validation results from different single-step genomic BLUP (ssGBLUP) methods, showing regression coefficients (b1), validation reliabilities (R2), and their 95% bootstrap confidence intervals (CI) from the parent average (PA), and genomic enhanced breeding values with different methods in ssGBLUP
      MethodMilkProteinFat
      b1 (CI)R2 (CI)b1 (CI)R2 (CI)b1 (CI)R2 (CI)
      PA1.10 (1.03−1.17)0.28 (0.24−0.31)0.93 (0.86−1.01)0.20 (0.17−0.24)0.82 (0.75−0.88)0.19 (0.16−0.21)
      ssGBLUP
      w=proportion of polygenic variance; τ=weight for G−1; ωxx = weight for A22−1 matrix.
       w200.89 (0.85−0.94)0.40 (0.36−0.44)0.78 (0.73−0.83)0.30 (0.26−0.33)0.75 (0.70−0.79)0.32 (0.28−0.35)
       w150.87 (0.82−0.91)0.40 (0.36−0.44)0.77 (0.72−0.81)0.30 (0.26−0.33)0.73 (0.69−0.77)0.32 (0.28−0.35)
       w100.84 (0.80−0.88)0.40 (0.36−0.44)0.74 (0.70−0.79)0.30 (0.26−0.33)0.71 (0.67−0.75)0.32 (0.27−0.35)
       τ1.0ω0.51.13 (1.07−1.18)0.42 (0.37−0.45)1.02 (0.96−1.08)0.31 (0.27−0.35)0.97 (0.91−1.03)0.32 (0.28−0.36)
       τ1.6ω0.51.14 (1.09−1.20)0.43 (0.38−0.46)1.05 (0.99−1.12)0.32 (0.28−0.35)0.99 (0.93−1.05)0.34 (0.29−0.37)
       τ1.5ω0.61.12 (1.06−1.17)0.43 (0.38−0.46)1.02 (0.96−1.08)0.32 (0.28−0.36)0.96 (0.91−1.020.34 (0.29−0.37)
       τ1.6ω1.00.93 (0.88−0.97)0.42 (0.38−0.45)0.83 (0.78−0.88)0.32 (0.27−0.35)0.79 (0.74−0.83)0.33 (0.29−0.36)
      1 w = proportion of polygenic variance; τ = weight for G−1; ωxx = weight for A22−1 matrix.
      Table 5 presents the root mean square errors (MSE) for the regressions. The MSE combine all criteria (inflation, trend, and accuracy), and the model with the lowest MSE should be the best one. Only small differences existed among MSE and they were mainly in line with the validation reliability results. However, according to MSE for bulls, the best parameter option for protein and fat would be τ1.6ω1.0, which contradicts the results from Table 3.
      Table 5Root mean square errors (MSE) from different validation regression models
      ModelMilkProteinFat
      Bull
       PA644.8120.9425.77
       w20589.4619.2623.09
       w15589.4619.2623.00
       w10589.4519.2622.99
       τ1.0ω0.5584.5019.2623.26
       τ1.6ω0.5584.5619.1923.03
       τ1.5ω0.6583.0219.1522.97
       τ1.6ω1.0585.8319.1322.74
      Cow
       PA866.9030.8535.23
       w20840.0430.1934.17
       w15839.9430.1034.16
       w10840.2430.2034.17
       τ1.0ω0.5836.9631.0034.09
       τ1.6ω0.5835.2030.0534.02
       τ1.5ω0.6834.5630.0434.01
       τ1.6ω1.0836.2830.0834.17
      1PA = parent average; w = proportion of polygenic variance; τ = weight for G−1; ωxx = weight for A22−1 matrix.
      In general, our results indicate that differences in validation reliabilities among different methods are small, especially for bulls. This is demonstrated in more detail with the pairwise bootstrap comparisons, where only significant differences appear between validation reliabilities from PA and different GEBV, and between some ssGBLUP methods in fat (Table 6). For cows, significant differences seem to arise in validation reliabilities between methods using different values of w, τ, and ω. Narrower CI could be achieved by increasing the number of validation animals. However, this would reduce the reference population size and could lead to lower validation reliability. Thus, significant differences in genomic prediction methods may be difficult to find when the number of genotyped animals is small. In addition, a small validation population has a large CI in validation reliability and this (random noise) may be one reason for different methods giving highest validation reliability, particularly in other studies where no confidence intervals have been given.
      Table 6Pairwise comparisons of validation reliabilities (R
      w=proportion of polygenic variance; τ=weight for G−1; ωxx = weight for A22−1 matrix.
      ) from different methods in single-step genomic BLUP (ssGBLUP) and parent average (PA)
      Letters indicate significant differences between compared methods. Differences in methods for bulls are shown above the diagonal and those for cows below the diagonal; M, P, F indicate milk, protein, and fat.
      MethodPAw20w15w10τ1.0ω0.5τ1.6ω0.5τ1.5ω0.6τ1.6ω1.0
      PAM P FM P FM P FM P FM P FM P FM P F
      w20M P FF
      w15M P FF
      w10M P FF
      τ1.0ω0.5M P FFFF
      τ1.6ω0.5M P FM P FM P FM P FM P FF
      τ1.5ω0.6M P FM P FM P FM P FM P FM
      τ1.6ω1.0M P FM P FM P FM P FM
      1 Letters indicate significant differences between compared methods. Differences in methods for bulls are shown above the diagonal and those for cows below the diagonal; M, P, F indicate milk, protein, and fat.
      2 w = proportion of polygenic variance; τ = weight for G−1; ωxx = weight for A22−1 matrix.
      The degree of inflation is indicated by the coefficient of regression (b1) of true genetic values on (G)EBV. Optimal prediction of genetic merit of young individuals should have a regression coefficient of 1. With b1 <1, the predictions are inflated, and the differences in estimated genetic merit of young individuals are exaggerated compared with their future performance. For bulls, the b1 values were almost always lower than the expected value, indicating that GEBV over-evaluate differences between bulls (Table 3). Changes in τ seemed to have only small effect on the b1 value, unlike changes in ω. Decreasing ω increases weight of pedigree relationship matrix such that GEBV will be more influenced by pedigree information. Decrease in ω increased b1 value. Only for milk and with τ1.6ω0.5, τ1.5ω0.5, or τ1.0ω0.5, the b1 values were >1. However, depending on the method used to build the H22 matrix for ssGBLUP, the over-dispersion was very similar to or even lower than that with PA. This suggests that GEBV by the single-step method are less biased than PA, but it is still essential to determine the best method to build the H22 matrix. The results were similar with cows, although in general, b1 values were higher for cows than for bulls, indicating a smaller bias (Table 4). Several studies have found better accuracies and lower biases of GEBV by fine-tuning w, τ, and ω when constructing the H matrix (
      • Misztal I.
      • Tsuruta S.
      • Aguilar I.
      • Legarra A.
      • VanRaden P.M.
      • Lawlor T.J.
      Methods to approximate reliabilities in single-step genomic evaluation.
      ). Moreover, validation reliability has been observed to be less sensitive to changes in w than accuracy (
      • Christensen O.F.
      • Madsen P.
      • Nielsen B.
      • Ostersen T.
      • Su G.
      Single-step methods for genomic evaluation in pigs.
      ).
      The genetic trends in milk (G)EBV for genotyped bulls are shown in Figure 1. For reference bulls, the genetic trends with different GEBV followed close to the EBV and EBVr trends, although there was a tendency for methods using w to give genetic trends that were closer to EBV compared with methods using a wider range of τ and ω. For the candidate bulls, the ssGBLUP with different w as well as PA seem to overestimate the trend, whereas methods using τ and ω seemed to underestimate the trend. Figure 1 indicates that putting more weight on genomic information is desirable. However, this must be balanced with the proper parameter choice for τ and ω to have the correct combination of genomic and pedigree relationships.
      Figure thumbnail gr1
      Figure 1Genetic trends for milk genomic enhanced breeding values (GEBV) and EBV of reference and candidate bulls from reduced data (i.e., EBVr) with different methods used in single-step genomic BLUP (ssGBLUP). For the candidate bulls, the EBVr is parent average (PA); EBV are from the full test-day model. Solid lines indicate reference bulls and dashed lines indicate candidate bulls. w = proportion of polygenic variance; τ = weight for G−1; ωxx = weight for A22−1 matrix. Color version available online.
      Comparison of different methods to build the H22 matrix for the ssGBLUP showed that alternative parameters could be chosen depending on whether the goal is to achieve higher validation reliabilities or smaller bias. In general, differences in validation reliabilities and their CI (Table 3 and Table 4) with the studied methods were smaller than effects to regression coefficient b1, which indicate bias. Therefore, it is essential to consider the whole picture before choosing which method to use. Moreover, different methods can be optimal for different traits. Differences in optimal weighting factors may be due to differences in genetic architecture. Use of a genomic relationship matrix that weights markers according to analyzed trait (e.g.,
      • VanRaden P.M.
      Efficient methods to compute genomic predictions.
      ;
      • Makgahlela M.
      • Knürr T.
      • Aamand G.P.
      • Strandén I.
      • Mäntysaari E.A.
      Single step evaluations using haplotype segments.
      ) may better account for differences in genetic architecture.
      Validation reliabilities for bulls from the current study were generally higher than the validation reliabilities obtained for RDC with genomic evaluation by multi-step approach and by sire model single-step method (
      • Koivula M.
      • Strandén I.
      • Su G.
      • Mäntysaari E.A.
      Different methods to calculate genomic predictions—Comparisons of SNP-BLUP, G-BLUP and H-BLUP.
      ;
      • Su G.
      • Madsen P.
      • Nielsen U.S.
      • Mäntysaari E.A.
      • Aamand G.P.
      • Christensen O.F.
      • Lund M.S.
      Genomic prediction for the Nordic Red Cattle using one-step and selection index blending approaches.
      ). This is partly due to the larger reference population. It also appears that we achieved better validation reliabilities by using phenotypic data in the ssGBLUP than by using deregressed breeding values (DRP) from either sire or animal model deregression (
      • Mäntysaari E.A.
      • Koivula M.
      • Strandén I.
      • Pösö J.
      • Aamand G.P.
      Estimation of GEBVs using deregressed individual cow breeding values.
      ;
      • Koivula M.
      • Strandén I.
      • Su G.
      • Mäntysaari E.A.
      Different methods to calculate genomic predictions—Comparisons of SNP-BLUP, G-BLUP and H-BLUP.
      ,
      • Koivula M.
      • Strandén I.
      • Pösö J.
      • Aamand G.P.
      • Mäntysaari E.A.
      Single step genomic evaluations for the Nordic Red Dairy cattle test day data.
      ,

      Koivula, M., I. Strandén, G. P. Aamand, and E. A. Mäntysaari. 2014. Effect of cow reference group on validation accuracy of genomic evaluation. Proc. 10th World Congr. Genet. Appl. Livest. Prod., Vancouver, Canada, Aug. 17–22, Comm. 083.

      ). Moreover, based on the regression coefficients b1, the GEBV from the phenotypic records seemed to be less inflated than the DGV from sire model or GEBV from animal model deregressions. Still, the models in the current study might fail the Interbull GEBV validation test. From bootstrapping with 10,000 samples, CI for the regression coefficients of the GEBV reached 1.0 with only some methods in milk and protein; for fat, the upper limits were always <1.0. The bootstrap CI are consistent with results in
      • Mäntysaari E.A.
      • Koivula M.
      GEBV validation test revisited.
      , in which we suggested that to reach a standard error of 0.049 for the estimate of b1 requires 443 bulls. However, the Interbull requirement for unbiasedness will accept b1 estimates between 0.90 and 1.20, even if the CI of b1 does not include 1. Thus, a b1 estimate of 0.9 is not considered biologically significantly biased. Therefore, a standard error of 0.05is an important limit for the power of the test, because with it the statistical and biological significance agree. Reversing the power consideration, we could suggest that the number of validation bulls should always be at least 500.
      The ssGBLUP has been used for several large-scale analyses, including in dairy cattle (
      • Aguilar I.
      • Misztal I.
      • Tsuruta S.
      • Wiggans G.R.
      • Lawlor T.J.
      Multiple trait genomic evaluation of conception rate in Holsteins.
      ;
      • Tsuruta S.
      • Misztal I.
      • Aguilar I.
      • Lawlor T.J.
      Multiple-trait genomic evaluation of linear type traits using genomic and phenotypic data in US Holsteins.
      ;
      • VanRaden P.M.
      Avoiding bias from genomic pre-selection in converting daughter information across countries.
      ), pigs (
      • Forni S.
      • Aguilar I.
      • Misztal I.
      Different genomic relationship matrices for single-step analysis using phenotypic, pedigree and genomic information.
      ;
      • Christensen O.F.
      • Madsen P.
      • Nielsen B.
      • Ostersen T.
      • Su G.
      Single-step methods for genomic evaluation in pigs.
      ), and broiler chickens (
      • Chen C.Y.
      • Misztal I.
      • Aguilar I.
      • Tsuruta S.
      • Meuwissen T.H.E.
      • Aggrey S.E.
      • Wing T.
      • Muir W.M.
      Genome-wide marker-assisted selection combining all pedigree phenotypic information and genotypic data in one step: An example using broiler chickens.
      ). Experiences from these studies have indicated that ssGBLUP gives as high or higher validation reliability than multi-step methods, and the inflation of GEBV is generally smaller. Another reason for using ssGBLUP is the ability to account for selection bias when selection is based on genotypes only (
      • VanRaden P.M.
      Avoiding bias from genomic pre-selection in converting daughter information across countries.
      ).
      The current study shows that ssGBLUP is easy to implement in existing national evaluation models. In this way, phenotypic records are combined directly with genomic information and, thus, resulting GEBV directly combine both sources of information. Moreover, in TD ssGBLUP, genomic information can be accounted for in estimation of environmental effects. Additional computational costs in the single-step approach may be lower than in multiple-step genomic evaluations. In our study, the number of genotyped bulls was relatively low, and the G matrix was easy to invert. For a large number of genotyped animals, algorithms have been proposed that overcome the need of G inverse, for example, by
      • Ducrocq V.
      • Legarra A.
      An iterative implementation of the single step approach for genomic evaluation which preserves existing genetic evaluation models and software.
      ,
      • Legarra A.
      • Ducrocq V.
      Computational strategies for national integration of phenotypic, genomic, and pedigree data in a single-step best linear unbiased prediction.
      , and

      Liu, Z., M. Goddard, F. Reinhardt, and R. Reents. 2013. Computing strategies for a single step SNP model with an across country reference population. No. 19:452 in Book of Abstracts: 64th Annu. Mtg. EAAP, Nantes, France. EAAP, Rome, Italy.

      . This could also make ssGBLUP computationally more appealing. However, it may also cause convergence problems in iterative methods (

      Strandén, I., and E. A. Mäntysaari. 2014. Comparison of some equivalent equations to solve single-step GBLUP. Proc. 10th World Congr. Genet. Appl. Livest. Prod., Vancouver, Canada, Aug. 17–22, Comm. 069.

      ).

      Conclusions

      Our results show that the use of phenotypic test-day records in single step analysis is feasible. The ssGBLUP provides a good alternative to the current multi-step approach used for Nordic RDC. The ssGBLUP is easy to implement and it gives results comparable to those of the original models. Moreover, the use of phenotypic records gave higher validation reliabilities compared with earlier validations that used sire model or animal model deregressions. However, it is essential to find the optimal way to combine G and A matrixes to minimize bias and maximize reliability of GEBV.

      Acknowledgments

      This work was a part of the Genomic Selection project originally established by Aarhus University and the Nordic cattle breeding organizations Viking Genetics (Randers, Denmark), Nordic Cattle Genetic Evaluation (Aarhus, Denmark), and Faba (Hollola, Finland). They are acknowledged for providing the genotype and test-day data.

      References

        • Aguilar I.
        • Misztal I.
        • Johnson D.L.
        • Legarra A.
        • Tsuruta S.
        • Lawlor T.J.
        Hot topic: A unified approach to utilize phenotypic, full pedigree, and genomic information for genetic evaluation of Holstein final score.
        J. Dairy Sci. 2010; 93: 743-752
        • Aguilar I.
        • Misztal I.
        • Tsuruta S.
        • Wiggans G.R.
        • Lawlor T.J.
        Multiple trait genomic evaluation of conception rate in Holsteins.
        J. Dairy Sci. 2011; 94: 2621-2624
        • Chen C.Y.
        • Misztal I.
        • Aguilar I.
        • Tsuruta S.
        • Meuwissen T.H.E.
        • Aggrey S.E.
        • Wing T.
        • Muir W.M.
        Genome-wide marker-assisted selection combining all pedigree phenotypic information and genotypic data in one step: An example using broiler chickens.
        J. Anim. Sci. 2011; 89: 23-28
        • Christensen O.F.
        • Lund M.S.
        Genomic prediction when some animals are not genotyped.
        Genet. Sel. Evol. 2010; 42: 2
        • Christensen O.F.
        • Madsen P.
        • Nielsen B.
        • Ostersen T.
        • Su G.
        Single-step methods for genomic evaluation in pigs.
        Animal. 2012; 6: 1565-1571
        • Ducrocq V.
        • Legarra A.
        An iterative implementation of the single step approach for genomic evaluation which preserves existing genetic evaluation models and software.
        Interbull Bull. 2011; 44: 138-142
        • Forni S.
        • Aguilar I.
        • Misztal I.
        Different genomic relationship matrices for single-step analysis using phenotypic, pedigree and genomic information.
        Genet. Sel. Evol. 2011; 43: 1
        • Hayes B.J.
        • Bowman P.J.
        • Chamberlain A.J.
        • Goddard M.E.
        Invited review Genomic selection in dairy cattle: Progress and challenges.
        J. Dairy Sci. 2009; 92: 433-443
      1. Koivula, M., I. Strandén, G. P. Aamand, and E. A. Mäntysaari. 2014. Effect of cow reference group on validation accuracy of genomic evaluation. Proc. 10th World Congr. Genet. Appl. Livest. Prod., Vancouver, Canada, Aug. 17–22, Comm. 083.

        • Koivula M.
        • Strandén I.
        • Pösö J.
        • Aamand G.P.
        • Mäntysaari E.A.
        Single step genomic evaluations for the Nordic Red Dairy cattle test day data.
        Interbull Bull. 2012; 46 (b): 115-120
        • Koivula M.
        • Strandén I.
        • Su G.
        • Mäntysaari E.A.
        Different methods to calculate genomic predictions—Comparisons of SNP-BLUP, G-BLUP and H-BLUP.
        J. Dairy Sci. 2012; 95 (a): 4065-4073
        • Legarra A.
        • Aguilar I.
        • Misztal I.
        A relationship matrix including full pedigree and genomic information.
        J. Dairy Sci. 2009; 92: 4656-4663
        • Legarra A.
        • Ducrocq V.
        Computational strategies for national integration of phenotypic, genomic, and pedigree data in a single-step best linear unbiased prediction.
        J. Dairy Sci. 2012; 95: 4629-4645
        • Lidauer M.H.
        • Pösö J.
        • Pederson J.
        • Lassen J.
        • Madsen P.
        • Mäntysaari E.A.
        • Nielsen U.
        • Eriksson J.-Å.
        • Johansson K.
        • Pitkänen T.
        • Strandén I.
        • Aamand G.P.
        Across-country test-day model evaluations for Nordic Holstein, Red Cattle and Jersey.
        J. Dairy Sci. 2015; 98 (10.3168/jds.2014-8307): 1296-1309
      2. Liu, Z., M. Goddard, F. Reinhardt, and R. Reents. 2013. Computing strategies for a single step SNP model with an across country reference population. No. 19:452 in Book of Abstracts: 64th Annu. Mtg. EAAP, Nantes, France. EAAP, Rome, Italy.

        • Lourenco D.A.L.
        • Misztal I.
        • Tsuruta S.
        • Aguilar I.
        • Lawlor T.J.
        • Forni S.
        • Weller J.I.
        Are evaluations on young genotyped animals benefiting from the past generations?.
        J. Dairy Sci. 2014; 97: 3930-3942
        • Makgahlela M.
        • Knürr T.
        • Aamand G.P.
        • Strandén I.
        • Mäntysaari E.A.
        Single step evaluations using haplotype segments.
        Interbull Bull. 2013; 47: 217-221
        • Mäntysaari E.
        • Liu Z.
        • VanRaden P.
        Interbull validation test for genomic evaluations.
        Interbull Bull. 2010; 41: 17-24
        • Mäntysaari E.A.
        • Koivula M.
        GEBV validation test revisited.
        Interbull Bull. 2012; 45: 11-16
        • Mäntysaari E.A.
        • Koivula M.
        • Strandén I.
        • Pösö J.
        • Aamand G.P.
        Estimation of GEBVs using deregressed individual cow breeding values.
        Interbull Bull. 2011; 44: 19-24
        • Meuwissen T.H.E.
        • Luan T.
        • Woolliams J.A.
        The unified approach to the use of genomic and pedigree information in genomic evaluations revisited.
        J. Anim. Breed. Genet. 2011; 128: 429-439
        • Misztal I.
        • Aguilar I.
        • Legarra A.
        • Lawlor T.J.
        Choice of parameters for single-step genomic evaluation for type.
        J. Dairy Sci. 2010; 93 (Abstr.): 533
        • Misztal I.
        • Legarra A.
        • Aguilar I.
        Computing procedures for genetic evaluation including phenotypic, full pedigree, and genomic information.
        J. Dairy Sci. 2009; 92: 4648-4655
        • Misztal I.
        • Tsuruta S.
        • Aguilar I.
        • Legarra A.
        • VanRaden P.M.
        • Lawlor T.J.
        Methods to approximate reliabilities in single-step genomic evaluation.
        J. Dairy Sci. 2013; 96: 647-654
      3. R Core Development Team. 2012. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. http://www.R-project.org/

        • Strandén I.
        • Lidauer M.
        Solving large mixed models using preconditioned conjugate gradient iteration.
        J. Dairy Sci. 1999; 82: 2779-2787
        • Strandén I.
        • Lidauer M.
        • Mäntysaari E.A.
        • Pösö J.
        Calculation of Interbull weighting factors for the Finnish test day model.
        Interbull Bull. 2001; 26: 78-81
        • Strandén I.
        • Mäntysaari E.A.
        A recipe for multiple trait deregression.
        Interbull Bull. 2010; 42: 21-24
      4. Strandén, I., and E. A. Mäntysaari. 2014. Comparison of some equivalent equations to solve single-step GBLUP. Proc. 10th World Congr. Genet. Appl. Livest. Prod., Vancouver, Canada, Aug. 17–22, Comm. 069.

        • Su G.
        • Madsen P.
        • Nielsen U.S.
        • Mäntysaari E.A.
        • Aamand G.P.
        • Christensen O.F.
        • Lund M.S.
        Genomic prediction for the Nordic Red Cattle using one-step and selection index blending approaches.
        J. Dairy Sci. 2012; 95: 909-917
        • Taskinen M.
        • Mäntysaari E.
        • Lidauer M.
        • Knürr T.
        • Pösö J.
        • Su G.
        • Aamand G.P.
        • Strandén I.
        Comparison of model reliabilities from single-step and bivariate blending methods.
        Interbull Bull. 2013; 47: 246-251
        • Tsuruta S.
        • Misztal I.
        • Aguilar I.
        • Lawlor T.J.
        Multiple-trait genomic evaluation of linear type traits using genomic and phenotypic data in US Holsteins.
        J. Dairy Sci. 2011; 94: 4198-4204
        • VanRaden P.M.
        Efficient methods to compute genomic predictions.
        J. Dairy Sci. 2008; 91: 4414-4423
        • VanRaden P.M.
        Avoiding bias from genomic pre-selection in converting daughter information across countries.
        Interbull Bull. 2012; 45: 1-5
        • VanRaden P.M.
        • Van Tassell C.P.
        • Wiggans G.R.
        • Sonstegard T.S.
        • Schnabel R.D.
        Invited review: Reliability of genomic predictions for North American Holstein bulls.
        J. Dairy Sci. 2009; 92: 16-24
        • Vitezica Z.G.
        • Aguilar I.
        • Misztal I.
        • Legarra A.
        Bias in genomic predictions for populations under selection.
        Genet. Res. (Camb.). 2011; 93: 357-366