ABSTRACT
Key words
INTRODUCTION
Hayes, B. J., I. M. Macleod, H. D. Daetwyler, P. J. Bowman, A. J. Chamberlian, C. J. Vander Jagt, A. Capitan, H. Pausch, P. Stothard, X. Liao, C. Schrooten, E. Mullaart, R. Fries, B. Guldbrandtsen, M. S. Lund, D. A. Boichard, R. F. Veerkamp, C. P. Vantassell, B. Gredler, T. Fruet, A. Bagnato, J. Vilkki, D. J. deKoning, E. Santus, and M. E. Goddard. 2014. Genomic Prediction from Whole Genome Sequence in Livestock: The 1000 Bull Genomes Project. Proceedings of the 10th World Congress of Genetics Applied to Livestock Production, Vancouver, Canada, 2014.
Moghaddar, N., I. M. Macleod, N. Duijvesteijn, S. Bolormaa, M. Khansefid, A. A. Swan, H. D. Daetwyler, and J. H. J. van der Werf. 2018. Genomic evaluation based on selected variants from imputed whole-genome sequence data in Australian sheep populations. Proceedings of the World Congress on Genetics Applied to Livestock Production, Auckland, New Zealand.
Hayes, B. J., I. M. Macleod, H. D. Daetwyler, P. J. Bowman, A. J. Chamberlian, C. J. Vander Jagt, A. Capitan, H. Pausch, P. Stothard, X. Liao, C. Schrooten, E. Mullaart, R. Fries, B. Guldbrandtsen, M. S. Lund, D. A. Boichard, R. F. Veerkamp, C. P. Vantassell, B. Gredler, T. Fruet, A. Bagnato, J. Vilkki, D. J. deKoning, E. Santus, and M. E. Goddard. 2014. Genomic Prediction from Whole Genome Sequence in Livestock: The 1000 Bull Genomes Project. Proceedings of the 10th World Congress of Genetics Applied to Livestock Production, Vancouver, Canada, 2014.
- Larroque H.
- Astruc J.M.
- Barbat A.
- Barillet F.
- Boichard D.
- Bonaïti B.
- Clément V.
- David I.
- Lagriffoul G.
- Palhière I.
- Piacère A.
- Robert-Granié C.
- Rupp R.
- Tosser-Klopp G.
- Bardou P.
- Bouchez O.
- Cabau C.
- Crooijmans R.
- Dong Y.
- Donnadieu-Tonon C.
- Eggen A.
- Heuven H.C.M.
- Jamli S.
- Jiken A.J.
- Klopp C.
- Lawley C.T.
- McEwan J.
- Martin P.
- Moreno C.R.
- Mulsant P.
- Nabihoudine I.
- Pailhoux E.
- Palhière I.
- Rupp R.
- Sarry J.
- Sayre B.L.
- Tircazes A.
- Jun Wang W.
- Wang
- Zhang W.
MATERIALS AND METHODS
Animals, Phenotypes, and 50K Genotypes
- Larroque H.
- Astruc J.M.
- Barbat A.
- Barillet F.
- Boichard D.
- Bonaïti B.
- Clément V.
- David I.
- Lagriffoul G.
- Palhière I.
- Piacère A.
- Robert-Granié C.
- Rupp R.
Trait | Number of lactations | Number of females with phenotypes | Minimum | Mean | SD | Maximum |
---|---|---|---|---|---|---|
MY (kg) | 3,470,255 | 1,242,020 | 34.51 | 837.05 | 266.03 | 2,615.04 |
PY (kg) | 3,470,255 | 1,274,581 | 1.27 | 25.05 | 8.08 | 84.64 |
FY (kg) | 3,470,255 | 1,271,383 | 1.09 | 28.06 | 10.01 | 111.92 |
LSCS | 1,449,698 | 705,753 | −0.58 | 8.82 | 1.34 | 13.57 |
FU | — | 160,086 | 1 | 3.29 | 1.16 | 9 |
TO | — | 160,086 | 1 | 4.03 | 0.86 | 9 |
UFP | — | 160,086 | 1 | 6.23 | 1.14 | 9 |
UP | — | 160,086 | 1 | 6.28 | 1.34 | 9 |
RUA | — | 160,086 | 1 | 4.83 | 1.67 | 9 |
Sequence-Derived Information
Quality Check of Sequence Data and Imputation.
Illumina GoatSNP50 BeadChip Update.
Evaluation Methods
Single-Step GBLUP.
where y is the vector of performances for the studied trait, β is the vector of fixed effects defined as in the official genomic evaluations. For production traits and LSCS, fixed effects were defined for each year and parity and were: herd, age at kidding × 4 geographic regions, month of kidding × 4 regions, length of dry period × 4 regions; u is a vector of random additive genetic effects assumed to be normally distributed , where is the variance of u; p is the vector of random permanent environmental effects assumed to be normally distributed , where I is the identity matrix; e is a vector of random residuals also normally distributed ; X is the incidence matrix relating phenotypes and the fixed effects; Z and W are the design matrices linking phenotypes to genetic and permanent environmental effects, respectively. The H matrix is the genetic relationship matrix that integrates both genotype and pedigree information implemented as in
where A is a pedigree-based relationship matrix with indices 1 for ungenotyped animals and 2 for genotyped animals, and G is the genomic relationship matrix derived as in
where m is the number of variants, pi is the estimated allele frequency at the locus i, and M is a centered matrix of variant genotypes.
where y, u, and e are the same as previously stated; β includes 3 combined fixed effects: herd × parity × year, age at scoring × year, and days in milk at scoring × year. Variance components were estimated by using the restricted maximum likelihood method in the remlf90 software (
Weighted Single-Step GBLUP.
where ȃ is a vector of variant effects, D is a diagonal matrix of weights (set to 1 in ssGBLUP), M is the centered matrix of variant genotypes, and Ûg is the vector of GEBV from genotyped animals only. The additive variances of the effect of variant i were estimated as
where pi is the allele frequency of variant i. The vector of variances of SNP effects was normalized (the normalization process ensured that the sum of the variances remained constant and was equal to the number of SNP) and used as weights in matrix D to construct the weighted matrix G (G*) as described by
The GEBV were estimated again with models [1a] and [1b] by considering weights for each SNP via the G* matrix included in the H matrix. This process was carried out iteratively with weights estimated at each iteration as described by
- 1.Iteration 1, initialization, D(1) = I; G* = 0.95 × λZD(1)Z' + 0.05 × A22.
- 2.Calculation of G* following the previous formula to obtain the EBV vector Ûg.
- 3.Iteration 2 (it).
- 4.Estimation of variant effects
- 5.Conversion of effects into weights following the formula: Weights are integrated into matrix .
- 6.Weights normalization tr s the trace of the D matrix.
- 7.Building .
- 8.Launch of a WssGBLUP using the newly calculated matrix to obtain new EBV.
- 9.Exit.
Weighted Single-Step GBLUP Using Windows
Tested Scenarios
Name of the scenario | Variants included in the scenario | Total number of variants | ||||
---|---|---|---|---|---|---|
50K markers | 178 SNP in chromosome 19 QTL region on the chip update | 178 randomly selected SNP on chromosome 19 | 539,476 sequence variants over the whole chromosome 19 | 22,269 sequence variants in the QTL region of chromosome 19 | ||
Refgeno_50k | x | 47,147 | ||||
1geno_50kv2QTL | x | x | 47,325 | |||
2geno_50kv2_random | x | x | 47,325 | |||
3geno_50kseqCHI19 | x | x | 586,623 | |||
4geno_50kseqQTL | x | x | 69,416 |
Single-Step GBLUP
Weighted Single-Step GBLUP
Accuracy of Genomic Predictions
RESULTS
Single-Step GBLUP


WssGBLUP
Trait | geno_50k | geno_50v2QTL | geno_50kseqQTL |
---|---|---|---|
FU | 0.60 | 0.62 | 0.60 |
TO | 0.48 | 0.48 | 0.48 |
UFP | 0.66 | 0.66 | 0.67 |
UP | 0.35 | 0.35 | 0.35 |
RUA | 0.63 | 0.64 | 0.63 |
LSCS | 0.42 | 0.43 | 0.42 |
MY | 0.54 | 0.57 | 0.56 |
FY | 0.47 | 0.50 | 0.50 |
PY | 0.50 | 0.53 | 0.53 |
WssGBLUPwindows
Item | geno_50k | geno_50kv2QTL | geno_50kseqQTL | |||
---|---|---|---|---|---|---|
2.4 Mb (≈40 SNP) | 40 SNP | 2.4Mb | 40 SNP | 2.4 Mb | 2.4Mb on CHI19 only | |
FU | 0.62 | 0.64 | 0.64 | 0.63 | 0.63 | 0.57 |
TO | 0.48 | 0.48 | 0.48 | 0.47 | 0.47 | 0.44 |
UFP | 0.67 | 0.69 | 0.68 | 0.68 | 0.67 | 0.66 |
UP | 0.38 | 0.38 | 0.38 | 0.37 | 0.37 | 0.34 |
RUA | 0.65 | 0.67 | 0.67 | 0.66 | 0.65 | 0.65 |
LSCS | 0.48 | 0.46 | 0.46 | 0.45 | 0.44 | 0.43 |
MY | 0.53 | 0.58 | 0.58 | 0.56 | 0.56 | 0.54 |
FY | 0.44 | 0.50 | 0.50 | 0.50 | 0.51 | 0.50 |
PY | 0.49 | 0.54 | 0.54 | 0.52 | 0.52 | 0.49 |
DISCUSSION
Variant Selection

Model Comparisons
Trait | Number of variants |
---|---|
FU | 12 |
UFP | 150 |
RUA | 52 |
LSCS | 0 |
MY | 104 |
FY | 37 |
PY | 62 |

- Larroque H.
- Astruc J.M.
- Barbat A.
- Barillet F.
- Boichard D.
- Bonaïti B.
- Clément V.
- David I.
- Lagriffoul G.
- Palhière I.
- Piacère A.
- Robert-Granié C.
- Rupp R.
CONCLUSIONS
ACKNOWLEDGMENTS
REFERENCES
- Accuracy of imputation to whole-genome sequence in sheep.Genet. Sel. Evol. 2019; 51 (30654735): 1
- Quantitative trait loci markers derived from whole genome sequence data increases the reliability of genomic prediction.J. Dairy Sci. 2015; 98 (25892697): 4107-4116
- A first step toward genomic selection in the multi-breed French dairy goat population.J. Dairy Sci. 2013; 96 (24054303): 7294-7305
- Including αs1 casein gene information in genomic evaluations of French dairy goats.Genet. Sel. Evol. 2016; 48 (27491470): 54
- Genomic prediction when some animals are not genotyped.Genet. Sel. Evol. 2010; 42 (20105297): 2
- Elaboration of a total merit index combining dairy and udder type traits.Renc. Rech. Rumin. 2006; 1: 209-212
- Incorporation of causative quantitative trait nucleotides in single-step GBLUP.Genet. Sel. Evol. 2017; 49 (28747171): 59
- Alternative SNP weighting for single-step genomic best linear unbiased predictor evaluation of stature in US Holsteins in the presence of selected sequence variants.J. Dairy Sci. 2019; 102 (31495612): 10012-10019
Hayes, B. J., I. M. Macleod, H. D. Daetwyler, P. J. Bowman, A. J. Chamberlian, C. J. Vander Jagt, A. Capitan, H. Pausch, P. Stothard, X. Liao, C. Schrooten, E. Mullaart, R. Fries, B. Guldbrandtsen, M. S. Lund, D. A. Boichard, R. F. Veerkamp, C. P. Vantassell, B. Gredler, T. Fruet, A. Bagnato, J. Vilkki, D. J. deKoning, E. Santus, and M. E. Goddard. 2014. Genomic Prediction from Whole Genome Sequence in Livestock: The 1000 Bull Genomes Project. Proceedings of the 10th World Congress of Genetics Applied to Livestock Production, Vancouver, Canada, 2014.
- A phasing and imputation method for pedigreed populations that results in a single-stage genomic evaluation.Genet. Sel. Evol. 2012; 44 (22462519): 9
- National genetic evaluations in dairy sheep and goats in France.in: Proc. Annual Meeting of the European Federation of Animal Science (EAAP), Stavanger, Norway. Wageningen Academic Publishers, Wageningen, the Netherlands2011: 62
- A relationship matrix including full pedigree and genomic information.J. Dairy Sci. 2009; 92 (19700729): 4656-4663
- A genome scan for milk production traits in dairy goats reveals two new mutations in Dgat1 reducing milk fat content.Sci. Rep. 2017; 7 (28500343)1872
- Genome-wide association mapping for type and mammary health traits in French dairy goats identifies a pleiotropic region on chromosome 19 in the Saanen breed.J. Dairy Sci. 2018; 101 (29573797): 5214-5226
Misztal, I., S. Tsuruta, T. Strabel, B. Auvrey, T. Druet, and D. Lee. 2002. BLUPF90 and related programs. Proceedings of the 7th World Congress on Genetics Applied to Livestock Production, Montpellier, France.
Moghaddar, N., I. M. Macleod, N. Duijvesteijn, S. Bolormaa, M. Khansefid, A. A. Swan, H. D. Daetwyler, and J. H. J. van der Werf. 2018. Genomic evaluation based on selected variants from imputed whole-genome sequence data in Australian sheep populations. Proceedings of the World Congress on Genetics Applied to Livestock Production, Auckland, New Zealand.
- Genome-wide association study of conformation and milk yield in mixed-breed dairy goats.J. Dairy Sci. 2018; 101 (29290434): 2213-2225
- Sequence- vs. chip-assisted genomic selection: Accurate biological information is advised.Genet. Sel. Evol. 2015; 47 (25956961): 43
- PLINK: A tool set for whole-genome association and population-based linkage analyses.Am. J. Hum. Genet. 2007; 81 (17701901): 559-575
- A new approach for efficient genotype imputation using information from relatives.BMC Genomics. 2014; 15 (24935670): 478
- Genome wide association analysis on semen volume and milk yield using different strategies of imputation to whole genome sequence in French dairy goats.BMC Genet. 2020; 21 (32085723): 19
- Weighted single-step genomic BLUP improves accuracy of genomic breeding values for protein content in French dairy goats: A quantitative trait influenced by a major gene.Genet. Sel. Evol. 2018; 50 (29907084): 31
- Accuracy of genomic evaluation with weighted single-step genomic best linear unbiased prediction for milk production traits, udder type traits, and somatic cell scores in French dairy goats.J. Dairy Sci. 2019; 102 (30712939): 3142-3154
- Design and characterization of a 52K SNP chip for goats.PLoS One. 2014; 9 (24465974)e86227
- Selecting sequence variants to improve genomic predictions for dairy cattle.Genet. Sel. Evol. 2017; 49 (28270096): 32
- Derivation, calculation, and use of national animal model information.J. Dairy Sci. 1991; 74 (1918547): 2737-2746
- Genome-wide association mapping including phenotypes from relatives without genotypes in a single-step (ssGWAS) for 6-week body weight in broiler chickens.Front. Genet. 2014; 5 (24904635): 134
- Genome-wide association mapping including phenotypes from relatives without genotypes.Genet. Res. (Camb). 2012; 94 (22624567): 73-83
- The comparison of regression variables.J. R. Stat. Soc. B. 1959; 21: 396-399
- Weighting strategies for single-step genomic BLUP: An iterative approach for accurate calculation of GEBV and GWAS.Front. Genet. 2016; 7 (27594861): 151
Article info
Publication history
Identification
Copyright
User license
Creative Commons Attribution – NonCommercial – NoDerivs (CC BY-NC-ND 4.0) |
Permitted
For non-commercial purposes:
- Read, print & download
- Redistribute or republish the final article
- Text & data mine
- Translate the article (private use only, not for distribution)
- Reuse portions or extracts from the article in other works
Not Permitted
- Sell or re-use for commercial purposes
- Distribute translations or adaptations of the article
Elsevier's open access license policy