Research| Volume 101, ISSUE 11, P10022-10033, November 01, 2018

# Value of the Dutch Holstein Friesian germplasm collection to increase genetic variability and improve genetic merit

Open AccessPublished:September 12, 2018

## ABSTRACT

National gene bank collections for Holstein Friesian (HF) dairy cattle were set up in the 1990s. In this study, we assessed the value of bulls from the Dutch HF germplasm collection, also known as cryobank bulls, to increase genetic variability and improve genetic merit in the current bull population (bulls born in 2010–2015). Genetic variability was defined as 1 minus the mean genomic similarity (SIMSNP) or as 1 minus the mean pedigree-based kinship (fPED). Genetic merit was defined as the mean estimated breeding value for the total merit index or for 1 of 3 subindices (yield, fertility, and udder health). Using optimal contribution selection, we minimized relatedness (maximized variability) or maximized genetic merit at restricted levels of relatedness. We compared breeding schemes with only bulls from 2010 to 2015 with schemes in which cryobank bulls were also included. When we minimized relatedness, inclusion of genotyped cryobank bulls decreased mean SIMSNP by 0.7% and inclusion of both genotyped and nongenotyped cryobank bulls decreased mean fPED by 2.6% (in absolute terms). When we maximized merit at restricted levels of relatedness, inclusion of cryobank bulls provided additional merit at any level of mean SIMSNP or mean fPED except for the total merit index at high levels of mean SIMSNP. Additional merit from cryobank bulls depended on (1) the relative emphasis on genetic variability and (2) the selection criterion. Additional merit was higher when more emphasis was put on genetic variability. For fertility, for example, it was 1.74 SD at a mean SIMSNP restriction of 64.5% and 0.37 SD at a mean SIMSNP restriction of 67.5%. Additional merit was low to nonexistent for the total merit index and higher for the subindices, especially for fertility. At a mean SIMSNP of 64.5%, for example, it was 0.60 SD for the total merit index and 1.74 SD for fertility. In conclusion, Dutch HF cryobank bulls can be used to increase genetic variability and improve genetic merit in the current population, although their value is very limited when selecting for the current total merit index. Anticipating changes in the breeding goal in the future, the germplasm collection is a valuable resource for commercial breeding populations.

## INTRODUCTION

The Holstein Friesian (HF) breed is the dominating dairy cattle breed worldwide. Despite its census size of millions of individuals, the breed has an effective population size of 18 to 115 (
• Danchin-Burge C.
• Hiemstra S.J.
• Blackburn H.
Ex situ conservation of Holstein-Friesian cattle: Comparing the Dutch, French, and US germplasm collections.
;
• Rodríguez-Ramilo S.T.
• Fernández J.
• Toro M.A.
• Hernández D.
• Villanueva B.
Genome-wide estimates of coancestry, inbreeding and effective population size in the Spanish Holstein population.
;
• Doekes H.P.
• Veerkamp R.F.
• Bijma P.
• Hiemstra S.J.
• Windig J.J.
Trends in genome-wide and region-specific genetic diversity in the Dutch-Flemish Holstein-Friesian breeding program from 1986 to 2015.
). In the early 1990s, national HF gene bank collections were established to safeguard genetic variability (
• Danchin-Burge C.
• Hiemstra S.J.
• Blackburn H.
Ex situ conservation of Holstein-Friesian cattle: Comparing the Dutch, French, and US germplasm collections.
). Since then, ex situ in vitro conservation has been used as a complementary strategy to in situ in vivo management of genetic variability in the breed (
• FAO (Food and Agriculture Organization of the United Nations)
Part 3: The state of capacities. Section D: Conservation programmes.
).
In vitro conservation has several advantages and potential uses. One advantage is that the stored material harbors genetic variation of the population at the time of sampling, which may include variation that has since been lost in vivo due to selection and drift. Material from cryobank individuals, therefore, could be used to restore or increase genetic variability in the current live population (
• Leroy G.
• Danchin-Burge C.
• Verrier E.
Impact of the use of cryobank samples in a selected cattle breed: A simulation study.
;
• Eynard S.E.
• Windig J.J.
• Hulsegge I.
• Hiemstra S.J.
• Calus Mario P.L.
The impact of using old germplasm on genetic merit and diversity—A cattle breed case study.
). In an extreme scenario in which the live population becomes highly endangered or extinct (e.g., due to a disease outbreak), the stored material could also be used to re-establish the population. Other potential uses of gene bank collections include the management of inbreeding in small populations (
• Sonesson A.K.
• Goddard M.E.
• Meuwissen T.H.
The use of frozen semen to minimize inbreeding in small populations.
;
• Shepherd R.K.
• Woolliams J.A.
Minimising inbreeding in small populations by rotational mating with frozen semen.
), the documentation of genetic trends (
• Smith C.
Use of stored frozen semen and embryos to measure genetic trends in farm livestock.
;
• Garcıa M.
• Baselga M.
Estimation of genetic response to selection in litter size of rabbits using a cryopreserved control population.
), and the introgression of specific genetic variants into live populations (e.g., introgression of the polled allele).
Recently,
• Doekes H.P.
• Veerkamp R.F.
• Bijma P.
• Hiemstra S.J.
• Windig J.J.
Trends in genome-wide and region-specific genetic diversity in the Dutch-Flemish Holstein-Friesian breeding program from 1986 to 2015.
reported a decrease in genetic variability in the Dutch-Flemish HF breeding program from 1986 through 2015, with a particularly fast decrease since the introduction of genomic selection. This recent decrease suggests that old bulls from the Dutch HF germplasm collection could be used to increase variability in the current population.
A disadvantage of old cryobank bulls is that their genetic level, measured by EBV, is generally lower than that of recently born bulls. Consequently, the use of cryobank bulls to increase genetic variability in the current population is expected to reduce genetic merit. This hypothesis, however, does not necessarily hold for all traits because not all traits currently of interest have been continuously selected for in the past. For example, HF breeding goals consisted of mainly yield and conformation traits before 2000, whereas they now also include many traits related to health, reproduction, and longevity (
• Miglior F.
• Muir B.L.
• van Doormaal B.J.
Selection indices in Holstein cattle of various countries.
;
• Cole J.B.
Symposium review: Possibilities in an age of genomics: The future of selection indices.
;
• Doekes H.P.
• Veerkamp R.F.
• Bijma P.
• Hiemstra S.J.
• Windig J.J.
Trends in genome-wide and region-specific genetic diversity in the Dutch-Flemish Holstein-Friesian breeding program from 1986 to 2015.
). When the focus of the current breeding goal would shift toward one of the latter traits, cryobank bulls might have value for the population in terms of both genetic variability and genetic merit.
The aim of this study was to assess the value of the Dutch HF germplasm collection to increase genetic variability and improve genetic merit in the current bull population. We considered 3 scenarios: (1) maximizing genetic variability, (2) maximizing genetic merit for the total merit index while maintaining variability, and (3) maximizing genetic merit for a subindex (yield, fertility, or udder health) while maintaining variability. In addition to an SNP-based assessment, we performed a pedigree-based evaluation to include bulls from the germplasm collection that had no genotype data.

## MATERIALS AND METHODS

### Germplasm Collection, Groups, and Data

The Dutch HF germplasm collection was set up in 1993 and is now managed by the Centre for Genetic Resources the Netherlands of Wageningen University and Research (Wageningen, the Netherlands). The collection consists of 5,457 HF bulls (fraction HF >87.5%, either red or black). The majority of these bulls are from progeny testing schemes of 2 commercial companies: the Dutch-Flemish cattle improvement cooperative (CRV; Arnhem, the Netherlands) and Alta Genetics (Feerwerd, the Netherlands).
In this study, we used 5,783 HF bulls (both cryobank and noncryobank). To assess the additional value of the germplasm collection to the current bull population, we defined 4 groups: very young bulls (VYB; n = 212), young bulls (YB; n = 762), cryobank bulls with genotype data (CBG; n = 2,888), and cryobank bulls with only pedigree data (CBP; n = 1,921). The VYB consisted of bulls born in 2014 to 2015 with EBV based on only genomic and parental information. The YB consisted of bulls born in 2010 to 2013 with EBV based on genomic and parental information (23% of bulls) or on genomic, parental, and daughter information (77% of bulls). The mean reliability of EBV was about 60% in the VYB and about 80% in the YB. Of the VYB and YB, 68 bulls (32%) and 551 bulls (72%), respectively, were also stored in the germplasm collection. The CBG consisted of genotyped cryobank bulls born in 1985 to 2009. The CBP consisted of cryobank bulls born in 1978 to 2015 that had pedigree but no genotype data. Figure 1 shows the number of bulls by group and year of birth.
Pedigree and genotype data were provided by CRV. The total pedigree comprised 429,981 individuals. Pedigree completeness per bull was assessed with the number of ancestral generations completely known (NCG) and the complete generation equivalent (CGE). We computed the CGE for each bull as the sum of (1/2)n over all its known ancestors, with n being the generation number of a given ancestor. Bulls with an NCG lower than 2 were excluded from the analyses (n = 29; these bulls had no genotype data and were excluded from the abovementioned group sizes). Genotyping was performed with the Illumina (San Diego, CA) BovineSNP50 BeadChip (versions v1 and v2) or CRV custom-made 60k Illumina panel (versions v1 and v2). Genotypes were imputed to 76k from the different panels following
• Druet T.
• Georges M.
A hidden Markov model combining linkage and linkage disequilibrium information for haplotype reconstruction and quantitative trait locus fine mapping.
. Prior to imputation, SNP with a call rate lower than 0.85, a minor allele frequency lower than 0.025, or a difference of more than 0.15 between observed and expected heterozygosity were discarded. We also discarded SNP with unknown position on the Btau4.0 genome assembly. After quality control and imputation, the data set consisted of 75,538 autosomal SNP per bull.

### Measures of Genetic Variability

Genetic variability was defined as 1 minus the mean relatedness in a population. We considered 2 measures of relatedness: the genomic similarity (SIMSNP) and the pedigree-based kinship (fPED).
The $SIMSNPij$ was defined as the probability that 2 alleles at a random SNP, 1 sampled from bull i and 1 from bull j, were identical by state (
• Malécot G.
Mathématiques de l'Hérédité.
). To calculate $SIMSNPij$, we first computed a genomic relationship matrix for all genotyped individuals with allele frequencies fixed to 0.5 using calc_grm (
• Calus M.P.L.
• Vandenplas J.
Calc_grm—A programme to Compute Pedigree, Genomic, and Combined Relationship Matrices.
). We then scaled the obtained relationships (Gij) to genomic similarities according to $SIMSNPij=Gij+24$ (for derivation, see Additional File 1 of
• Eynard S.E.
• Windig J.J.
• Leroy G.
• Van Binsbergen R.
• Calus M.P.
The effect of rare alleles on estimated genomic relationships from whole genome sequence data.
). Note that the mean SIMSNP in a population is equal to 1 minus the expected heterozygosity.
The $fPEDij$ was defined as the pedigree-based probability that 2 alleles at a random (imaginary) selection-free locus, 1 sampled from bull i and 1 from bull j, were identical by descent with reference to a base population (
• Falconer D.S.
• Mackay T.F.C.
Introduction to Quantitative Genetics.
). Founders in the pedigree were considered as base population. We first computed an additive genetic relationship matrix for all bulls with calc_grm (
• Calus M.P.L.
• Vandenplas J.
Calc_grm—A programme to Compute Pedigree, Genomic, and Combined Relationship Matrices.
) according to the algorithms of
• Sargolzaei M.
• Iwaisaki H.
• Colleau J.J.
A fast algorithm for computing inbreeding coefficients in large populations.
and
• Colleau J.J.
An indirect approach to the extensive calculation of relationship coefficients.
. We then obtained the $fPEDij$ as half of the additive genetic relationship.

### Measures of Genetic Merit

Genetic merit was defined as the mean EBV for a selection index. We considered 4 selection indices: 1 total merit index (NVI) and 3 subindices—namely, yield (INET), daughter fertility (FERT), and udder health (UH). All EBV were obtained from the December 2017 publication of the organization for genetic evaluation of bulls in the Netherlands and Flanders (
• CRV
Zoek stier.
). The NVI is the Dutch-Flemish total merit index that includes INET, FERT, UH, longevity, conformation, and birth traits with relative weights (based on the sum of genetic SD) of 26, 14, 14, 11, 30, and 5%, respectively (). The breeding value for INET comprised the EBV for lactose yield, fat yield, and protein yield and was calculated as INET = 0.3 × lactose yield + 2.1 × fat yield + 4.1 × protein yield (). The breeding value for FERT comprised the EBV for the interval between first and last insemination (IFL) and the calving interval and was calculated as FERT = 0.52 × (IFL – 100) + 0.52 × (calving interval – 100) + 100 (). The breeding value for UH comprised the EBV for subclinical mastitis (SCM) and clinical mastitis (CM) and was calculated as UH = 0.477 × (SCM – 100) + 0.641 × (CM – 100) + 100 (). The EBV for FERT and UH were rescaled such that the mean equaled 100 and the SD at population level was 4, whereas the NVI and INET were used on their original scales ().

### Optimal Contribution Selection

Since its introduction in the late 1990s (
• Meuwissen T.H.E.
Maximizing the response of selection with a predefined rate of inbreeding.
;
• Grundy B.
• Villanueva B.
• Woolliams J.A.
Dynamic selection procedures for constrained inbreeding and their consequences for pedigree development.
), optimal contribution selection (OCS) has become the gold standard to maximize the mean EBV in the next generation while restricting the mean relatedness to a predefined value. The restriction on relatedness is generally based on the desired rate of inbreeding (e.g., 0.5 or 1%). In addition to balancing genetic merit and variability, OCS may be used to maximize variability by minimizing relatedness irrespective of genetic gain (e.g.,
• Eynard S.E.
• Windig J.J.
• Hulsegge I.
• Hiemstra S.J.
• Calus Mario P.L.
The impact of using old germplasm on genetic merit and diversity—A cattle breed case study.
).
In this study, we used OCS to assess the value of cryobank bulls to the current bull population. We compared results of OCS schemes with only bulls from the current population with those with both bulls from the current population and cryobank bulls. More specifically, we considered the following 4 populations: (1) VYB, (2) VYB + YB, (3) VYB + YB + CBG, and (4) VYB + YB + CBG + CBP. For each population, we first ran OCS to maximize variability (i.e., minimize mean SIMSNP or mean fPED). We then maximized the mean EBV for NVI, INET, FERT, or UH while restricting the mean SIMSNP or mean fPED to predefined values. These predefined values ranged from a minimum (previously determined by minimizing SIMSNP or fPED) to a maximum of 68% for SIMSNP and 12.5% for fPED. The chosen maxima corresponded to a rate of inbreeding of about 5% when considering the VYB as the current generation. Between the minimum and maximum, we ran scenarios at intervals of 0.04% for SIMSNP and of 0.08% for fPED. Analyses were performed with Gencont software (
• Meuwissen T.H.E.
GENCONT: An operational tool for controlling inbreeding in selection and conservation schemes.
), which uses the Lagrangian multiplier approach to solve the optimization problem (
• Meuwissen T.H.E.
Maximizing the response of selection with a predefined rate of inbreeding.
).
To compare results across traits, we present the differences in merit between populations not only on the original index scales but also in standard deviations (which equaled 128.3 for NVI, 136.0 for INET, 4.7 for FERT, and 5.1 for UH). To further visualize the value of cryobank bulls, we evaluated the total contribution that was assigned to each subgroup when running OCS with all genotyped bulls (VYB + YB + CBG) for SIMSNP or with all bulls with pedigree (VYB + YB + CBG + CBP) for fPED.

## RESULTS

### Genetic Trends

Mean NVI and INET have increased continuously over the last 30 yr (Figure 2). Mean FERT and mean UH first decreased until they were included in the breeding goal around 2000. Since then, the genetic level for FERT and UH has steadily increased.

### Descriptive Statistics Across Groups

Mean pedigree completeness (NCG and CGE), mean relatedness (SIMSNP and fPED), and mean EBV (for all traits) were greatest in the VYB followed by the YB, CBG, and CBP (Table 1). Means for the CBG and CBP were relatively similar compared with means in the VYB and YB. Cryobank bulls showed most variation in EBV (i.e., greatest SD) followed by the YB and the VYB. Cryobank bulls also showed the lowest minimum EBV followed by the YB and the VYB. The maximum of the total merit index NVI was smaller in the CBG (268) and CBP (287) than in the YB (374) and VYB (380). In fact, the maxima for NVI in the CBG and CBP were less than 1 SD above the mean EBV in the VYB. For the subindices (INET, FERT, and UH), however, the maximum was similar across groups, and the maxima for the CBG and CBP were in the upper tail of the corresponding distributions in the VYB and YB (i.e., above the mean +2 SD).
Table 1Descriptive statistics for pedigree completeness (2 measures), relatedness (2 measures), and EBV (4 selection indices) by group of bulls
Item
VYB = very young bulls (n = 212); YB = young bulls (n = 762); CBG = cryobank bulls with genotype data (n = 2,888); CBP = cryobank bulls with only pedigree data (n = 1,921).
Pedigree completeness
NCG = number of completely known generations; CGE = complete generation equivalent.
Relatedness
SIMSNP = genomic similarity (excluding self-similarities); fPED = pedigree-based kinship (excluding self-kinships).
EBV
NVI = total merit index; INET = yield index; FERT = daughter fertility index; UH = udder health index.
NCGCGESIMSNP (%)fPED(%)NVIINETFERTUH
Mean
VYB7.2713.9565.857.82241.1237.5101.5105.3
YB6.6112.6765.256.91141.8139.5100.7103.4
CBG5.9210.8664.485.20−46.7−7.3898.098.5
CBP5.7110.48NA
Not applicable.
5.14−78.9−51.197.898.2
SD
VYB0.890.440.972.1146.773.72.22.9
YB1.120.551.052.3964.7100.63.63.5
CBG0.980.720.882.0599.6117.15.04.9
CBP1.070.91NA2.19101.5118.04.34.8
Minimum
VYB412.7963.134.00124369695
YB210.5962.432.23−96−1818889
CBG27.3861.920.48−413−4107680
CBP25.90NA0.19−438−4878475
Maximum
VYB914.8975.2331.43380448107112
YB914.0675.7832.38374415111114
CBG812.7175.7533.22268472113111
CBP914.06NA31.47287382114113
1 VYB = very young bulls (n = 212); YB = young bulls (n = 762); CBG = cryobank bulls with genotype data (n = 2,888); CBP = cryobank bulls with only pedigree data (n = 1,921).
2 NCG = number of completely known generations; CGE = complete generation equivalent.
3 SIMSNP = genomic similarity (excluding self-similarities); fPED = pedigree-based kinship (excluding self-kinships).
4 NVI = total merit index; INET = yield index; FERT = daughter fertility index; UH = udder health index.
5 Not applicable.

### Maximizing Genetic Variability

Genetic variability was maximized by minimizing relatedness (either mean SIMSNP or mean fPED) with OCS. Minimization of SIMSNP in the VYB decreased mean SIMSNP from the current generation (65.9%) to the next generation (65.4%) by 0.5% (Table 2). This is equivalent to a 0.5% increase in mean heterozygosity. When the YB was added to the VYB, the mean SIMSNP in the next generation further decreased by 0.9% (to 64.5%). Adding the CBG resulted in a further decrease of 0.7%. In other words, inclusion of genotyped cryobank bulls increased expected heterozygosity by 0.7% when compared with a scheme in which only VYB and YB were used. Minimization of mean fPED in the VYB decreased mean fPED from the current generation (7.8%) to the next generation (7.0%) by 0.8%. Stepwise adding the YB, CBG, and CBP resulted in further decreases of 1.8, 1.8, and 0.8%, respectively. Thus, the inclusion of genotyped and nongenotyped cryobank bulls decreased mean fPED by 2.6% when compared with the scenario in which only VYB and YB were used. Note that although in absolute terms the realized decrease in mean fPED was larger than that for mean SIMSNP, in relative terms (i.e., scaled by the noninbred part) they were quite similar. The difference between the VYB before OCS and the VYB + YB + CBG after OCS, for example, was 2.1% for SIMSNP and 4.3% for fPED in absolute terms and 6.1% for SIMSNP and 4.7% for fPED in relative terms.
Table 2Mean relatedness before and after minimizing relatedness with optimal contribution selection as well as number of selected bulls (n_sel) for 4 populations and 2 relatedness measures
Population
VYB = very young bulls (n = 212); YB = young bulls (n = 762); CBG = cryobank bulls with genotype data (n = 2,888); CBP = cryobank bulls with only pedigree data (n = 1,921).
No.Minimized relatedness measure
SIMSNP = genomic similarity; fPED = pedigree-based kinship.
SIMSNPfPED
Before (%)After (%)n_selBefore (%)After (%)n_sel
VYB21265.8565.37917.827.02127
VYB + YB97465.3364.511517.025.24232
VYB + YB + CBG3,86264.5063.782255.263.45236
VYB + YB + CBG + CBP5,783NA
Not applicable.
NANA5.162.63247
1 VYB = very young bulls (n = 212); YB = young bulls (n = 762); CBG = cryobank bulls with genotype data (n = 2,888); CBP = cryobank bulls with only pedigree data (n = 1,921).
2 SIMSNP = genomic similarity; fPED = pedigree-based kinship.
3 Not applicable.
The decrease in mean relatedness that was achieved by including additional groups of selection candidates was accompanied by an increase in the number of candidates that was selected and by a decrease in the percentage of candidates that was selected. The increase in number of selected candidates was especially apparent when moving from VYB to VYB + YB for both SIMSNP and fPED and when moving from VYB + YB to VYB + YB + CBG for SIMSNP (Table 2). Moving from the VYB to larger populations was also accompanied by a redistribution of contributions among selected bulls.
Maximizing genetic variability, irrespective of genetic merit, decreased mean EBV for all indices (Figure 3). For example, minimizing mean SIMSNP in the VYB decreased the mean NVI from the current generation (241.1) to the next generation (227.8) by 13.3 points. Stepwise adding the YB and CBG resulted in further decreases of 107.7 points (to 120.1) and 193.1 points (to −73.0), respectively. Thus, there was a clear cost in merit when selecting only for variability.

### Maximizing Genetic Merit While Maintaining Genetic Variability

The inclusion of additional groups of (old) selection candidates, such as cryobank bulls, resulted in more merit at the same level of variability (Figure 3). At a mean SIMSNP equal to the mean SIMSNP of the current VYB (65.58%), for example, maximization for INET resulted in an INET of 322.8 when using VYB, 365.6 when using VYB + YB, and 389.3 when using VYB + YB + CBG. An exception was found for scenarios in which NVI was maximized at high levels of mean SIMSNP. For these scenarios, the VYB + YB + CBG provided slightly less merit than the VYB + YB (see Discussion).
The benefit of including additional groups of (old) selection candidates, such as genotyped cryobank bulls, was greater when more emphasis was put on genetic variability. In other words, the difference between the curves in Figure 3 was greater at lower levels of relatedness. For example, the additional merit for FERT obtained by adding the CBG to the VYB + YB at mean SIMSNP levels of 64.5, 65.5, 66.5, and 67.5% was 8.1 (1.74 SD), 2.4 (0.51 SD), 1.9 (0.4 SD), and 1.7 (0.37 SD), respectively.
The benefit of including additional groups of (old) selection candidates, such as genotyped cryobank bulls, at specific levels of relatedness differed across selection indices (Figure 4). The additional merit obtained by adding the CBG to the VYB + YB at a mean SIMSNP of 64.5%, for example, was 0.60 SD (71.2 points) for NVI, 1.06 SD (144.3 points) for INET, 1.74 SD (8.1 points) for FERT, and 1.18 SD (6.0 points) for UH. For NVI, there was no additional merit of including the CBG at mean SIMSNP above 65.24%. For UH, there was no additional merit (i.e., <0.01 SD) of the CBG at mean SIMSNP above 65.80%. For INET, additional merit of the CBG was relatively stable (∼0.16 SD) for mean SIMSNP above 65%. Of the 4 indices, FERT showed the greatest additional merit for the CBG. Although the additional merit for FERT also decreased when mean SIMSNP increased, there was still benefit of including CBG at high levels of mean SIMSNP (e.g., 0.36 SD more merit at a mean SIMSNP of 67.5%).
Results for fPED were similar to those for SIMSNP (Figure 3). The VYB + YB + CBG + CBP resulted in more genetic merit compared with the VYB + YB + CBG, but this difference was present only at very low levels of mean fPED. For NVI, INET, and FERT, the additional merit of the CBP quickly decreased with increasing fPED, and for mean fPED of ≥5% it was approximately zero. For FERT, there was still a bit of additional merit at higher levels of mean fPED (∼0.13 SD).

### Contributions of Groups

When SIMSNP was minimized with all genotyped bulls (VYB + YB + CBG), 89% of the contributions were assigned to the CBG, 10% to the YB, and 1% to the VYB (Figure 5). When fPED was minimized with all bulls available (VYB + YB + CBG + CBP), 64% of the contributions were assigned to the CBP, 34% to the CBG, 2% to the YB, and 0% to the VYB.
As expected from Figure 3, the contribution of cryobank bulls (CBG and CBP) generally decreased when the restriction on relatedness became less stringent. The exact pattern differed across selection indices. For the total merit index NVI, the contribution of cryobank bulls continued to decrease with increasing levels of relatedness. At very high levels of relatedness, only bulls from the VYB (∼70%) and YB (∼30%) were selected. For INET, the contribution of cryobank bulls also decreased with increasing relatedness but remained stable (at ∼40%) for mean SIMSNP of ≥65.5% and for mean fPED of ≥8%. For FERT, about 90% of the contributions were assigned to cryobank bulls at any level of relatedness. For UH, the contribution of cryobank bulls also decreased with an increase in relatedness. This index showed a relatively high contribution of the YB. For UH, a single bull in the CBP (born in 2004) with a very high EBV (113) was assigned a contribution of about 15% in the scenarios with high mean fPED.

## DISCUSSION

### Value of the Germplasm Collection

Our primary objective was to assess the value of the Dutch HF germplasm collection to increase genetic variability and improve genetic merit in the current bull population. The selection of almost exclusively cryobank bulls when maximizing genetic variability (Figure 5) shows that diversity of the current bull population is well captured by the germplasm collection. This finding is in line with the results of
• Danchin-Burge C.
• Hiemstra S.J.
• Blackburn H.
Ex situ conservation of Holstein-Friesian cattle: Comparing the Dutch, French, and US germplasm collections.
. Results based on minimization of mean SIMSNP and mean fPED furthermore suggest that the germplasm collection can be used to increase genetic variability in the current population (Table 2). When genetic merit and genetic variability are balanced, the inclusion of cryobank bulls in addition to bulls from the current population may result in more merit at the same level of variability or, equivalently, in more variability at the same level of merit (Figure 3). We found that the benefit of using cryobank bulls depended on 2 factors: (1) the relative emphasis on variability and (2) the selection criterion (i.e., the index).
The additional merit of cryobank bulls, and the percentage of contributions assigned to cryobank bulls, was greater when more emphasis was put on genetic variability (Figure 3, Figure 4). This is not surprising because both relatedness and genetic merit have increased over time (Figure 2;
• Doekes H.P.
• Veerkamp R.F.
• Bijma P.
• Hiemstra S.J.
• Windig J.J.
Trends in genome-wide and region-specific genetic diversity in the Dutch-Flemish Holstein-Friesian breeding program from 1986 to 2015.
).
Additional merit of cryobank bulls was relatively low for the current total merit index NVI and greater for the subindices INET, UH, and FERT (Figure 3, Figure 4). For the NVI, there was almost no value in including cryobank bulls as selection candidates (except when we put a lot of emphasis on genetic variability). The limited additional merit for the NVI can be explained by the fact that the NVI has been the main index under selection in recent decades and that bulls from the current population simply have the highest NVI (Figure 2; Table 1). This finding also suggests that selection for NVI in the past has been effective, at least for the EBV. The observed difference between additional merit of cryobank bulls for the NVI on one hand and the subindices on the other reflects the principle that “single-trait selection often suffers from antagonistic correlations with traits not in the selection objective,” whereas “multiple-trait selection avoids those problems at the cost of less-than-maximal progress for individual traits” (
• Cole J.B.
Symposium review: Possibilities in an age of genomics: The future of selection indices.
). Past selection for NVI has resulted in less-than-maximal progress for the subindices. Some bulls that were assigned high contributions in this study when selecting for a subindex (e.g., yield), may not have been used so much in practice because they scored relatively low for other traits. The relatively high additional merit for FERT can be explained by the availability of cryobank bulls with high FERT, which were born before the intense selection for yield traits in the last decades of the 20th century and, thus, before the associated decrease in FERT (Figure 2) that was due to the well-known antagonistic correlation between fertility and yield traits.
Cryobank bulls with only pedigree data (i.e., CBP) were assigned up to 70% of the contributions when maximizing merit at low levels of mean fPED (Figure 5). This finding suggests that, based on pedigree, there are bulls in the CBP that are of interest with regard to genetic variability. To determine whether these bulls are also less related at the genomic level or whether they were simply selected by OCS because they had limited pedigree completeness, they would have to be genotyped.
It is important to note that the actual breeding program is more dynamic than the schemes we evaluated. First, selection in practice occurs in both sexes, whereas we considered only bulls. We believe that the use of only bulls was sufficient for the purpose of this study because genetic variability in the population is largely driven by relatedness between bulls (due to substantial use of AI). Nevertheless, selection at the cow side may offer some possibilities for management of genetic variability, especially because cows show lower relatedness levels than AI bulls (
• Doekes H.P.
• Veerkamp R.F.
• Hiemstra S.J.
• Bijma P.
• van der Beek S.
• Windig J.J.
Genomic selection and inbreeding and kinship in Dutch-Flemish Holstein Friesian cattle.
). Second, the definition of the current bull population is not straightforward. We focused on bulls from a single breeding program, whereas in practice bulls from other breeding programs (from other countries) also may be used as selection candidates. Furthermore, we used 2 groups of bulls to represent the current bull population: 1 consisting of only VYB, born in 2013 to 2015, and 1 consisting of VYB and YB, born in 2010 to 2015. In practice, mostly top bulls from the VYB will be selected for breeding together with some top bulls from earlier years. To investigate the effect of focusing on top bulls, we performed OCS using the 50 top bulls from the VYB with the highest NVI. For any mean SIMSNP above 66.3%, the NVI obtained with the top bulls was approximately identical to the NVI obtained with the entire VYB. When using only top bulls, a mean SIMSNP below 66.3% could not be realized, whereas a mean SIMSNP of 65.4% could be realized when using the entire VYB. This finding emphasizes that focusing only on top bulls further reduces genetic variability and highlights the importance of also storing lower ranked bulls in the gene bank collection.
In this study, we considered only a single generation of OCS. As shown by
• Leroy G.
• Danchin-Burge C.
• Verrier E.
Impact of the use of cryobank samples in a selected cattle breed: A simulation study.
, using cryobank bulls to increase genetic variability in a single generation will have no effect in the long term if their offspring are not selected subsequently. Genetic variability, therefore, should receive attention in subsequent generations as well. Continuous use of OCS may ensure that offspring of cryobank bulls are selected depending on the relative emphasis put on genetic variability. It would be interesting to investigate the decrease in long-term contributions of cryobank bulls considering various constraints on loss of genetic variability.

### Limit of the Lagrangian Multiplier Approach

An unexpected result was found when maximizing NVI at high levels of mean SIMSNP in the VYB + YB and VYB + YB + CBG populations. For these scenarios, the obtained NVI was greater for the VYB + YB than for the VYB + YB + CBG (although the difference was small; i.e., <0.05 SD; Figure 4). This result is theoretically impossible because all bulls in the former population were also part of the latter population. If the CBG would have provided no additional merit at all, the obtained NVI at a given mean SIMSNP for the VYB + YB + CBG should have been at least equal to that for the VYB + YB. The observed pattern, therefore, has to be an artifact of the used algorithm. A difficulty of running OCS with the Lagrangian multiplier approach is that some contributions in the obtained optimal solution may be negative (
• Meuwissen T.H.E.
Maximizing the response of selection with a predefined rate of inbreeding.
). This problem is remedied in Gencont by eliminating selection candidates with negative contributions and repeating the optimization procedure until no negative contributions remain. The drawback of this approach, however, is that some of the candidates that were eliminated in early iterations may have had a positive contribution in the true optimal solution (
• Pong-Wong R.
• Woolliams J.A.
Optimisation of contribution of candidate parents to maximise genetic gain and restricting inbreeding using semidefinite programming.
;
• Woolliams J.A.
• Berg P.
• Dagnachew B.S.
• Meuwissen T.H.E.
Genetic contributions and their optimization.
). When we compared high mean SIMSNP scenarios between the VYB + YB and the VYB + YB + CBG, we observed that some bulls with moderate to high contributions in the former population were not selected at all in the latter population. This could be due to the elimination procedure. A possibility to get closer to the true optimal solution is to remove only a single candidate per iteration (the one with the most negative contribution) instead of eliminating all candidates with negative contributions. This remedy, however, would increase computation time. Alternatively, one may consider a completely different approach to solve the optimization problem, such as semidefinite programming (
• Pong-Wong R.
• Woolliams J.A.
Optimisation of contribution of candidate parents to maximise genetic gain and restricting inbreeding using semidefinite programming.
).

### Future Perspectives for HF Germplasm Collections

The Dutch HF germplasm collection is a rather unique collection containing material from many AI bulls over a period of approximately 40 yr. In addition to the national gene bank collection, AI companies and farmers have stored germplasm over time. These companies and individual farmers, however, cannot guarantee the availability of stored material in the long term. Systematically storing genetic material in national collections, therefore, is required to safeguard the material for future use.
Various national HF collections exist across the globe. For the collections of the Netherlands, France, and the United States,
• Danchin-Burge C.
• Hiemstra S.J.
• Blackburn H.
Ex situ conservation of Holstein-Friesian cattle: Comparing the Dutch, French, and US germplasm collections.
showed that there is substantial overlap in terms of the stored (pedigree-based) variability. An interesting question is whether national collections should be combined to reduce storage redundancy. We believe that although cooperation between gene banks is important to efficiently allocate resources and ensure that global HF variability is stored, there is substantial value in having separate national collections. The main advantage of having separate collections is that material from a national collection is more readily available for local breeding programs. In addition, having separate gene bank collections is an insurance against calamities.
An important aspect of gene bank management is to determine which and how many individuals are stored in a collection. In the case of the Dutch HF, a vast majority of AI bulls have been stored in the collection over time, with generally 25 straws per bull. These preselected AI bulls will not have covered all genetic variability and genetic potential for various traits present in the national population. To optimize collections, one could also store material from bulls that are not used for AI (and use OCS to determine exactly which bulls to store). The number of straws stored per individual is important with regard to the potential use, and therefore the value, of the collection. Today, fast genetic gain in HF is realized by producing many embryos through superovulation (
• Jaton C.
• Koeck A.
• Sargolzaei M.
• Malchiodi F.
• Price C.A.
• Schenkel F.S.
• Miglior F.
Genetic analysis of superovulatory response of Holstein cows in Canada.
), which requires a lot of semen. One may question whether cryobank bulls, with a limited number of straws, can make a significant contribution in the current system. Producing 90% of all offspring in 1 generation with cryobank bulls (such as in the hypothetical FERT scenario in this study), for example, is not possible. Across multiple generations, however, gene bank material could be disseminated throughout the population.
In line with the simulation study of
• Leroy G.
• Danchin-Burge C.
• Verrier E.
Impact of the use of cryobank samples in a selected cattle breed: A simulation study.
, our results suggest that gene bank collections are mostly valuable when the aim is to increase genetic variability or when major changes in selection objectives or practices occur (and when the use of animals from other breeds is not preferred). Breeding goals for HF have changed in the past (
• Miglior F.
• Muir B.L.
• van Doormaal B.J.
Selection indices in Holstein cattle of various countries.
;
• Cole J.B.
Symposium review: Possibilities in an age of genomics: The future of selection indices.
;
• Doekes H.P.
• Veerkamp R.F.
• Bijma P.
• Hiemstra S.J.
• Windig J.J.
Trends in genome-wide and region-specific genetic diversity in the Dutch-Flemish Holstein-Friesian breeding program from 1986 to 2015.
) and are expected to change further in the future. A complete shift from the total merit index to a subindex such as FERT is very unlikely. Instead, we expect that relative weights for specific trait groups will gradually shift over time and that novel traits will be added to the breeding goal. Various factors may influence the shift in breeding goal composition, including production economics (e.g., milk quota), societal demands (e.g., animal welfare), environment (e.g., climate change), technology (e.g., mid-infrared spectroscopy), and breeding value estimation (e.g., genomic prediction). By separating phenotype recording from the selection process, genomic prediction has removed the need for large-scale phenotyping and enabled selection for novel traits that are difficult to measure. An overview of such novel traits is provided by
• Boichard D.
• Brochard M.
New phenotypes for new breeding goals in dairy cattle.
,
• Egger-Danner C.
• Cole J.
• Pryce J.
• Gengler N.
• Stock K.F.
Invited review: Overview of new traits and phenotyping strategies in dairy cattle with a focus on functional traits.
, and
• Cole J.B.
Symposium review: Possibilities in an age of genomics: The future of selection indices.
. Cryobank bulls may have relatively high EBV for novel traits because these traits have not been directly selected for in the past. Anticipating changes in the HF breeding goal in the future, the germplasm collection is a valuable resource in terms of both genetic variability and genetic merit.

## CONCLUSIONS

Bulls from the Dutch HF germplasm collection can be used to increase genetic variability in the current breeding population. When genetic merit and genetic variability are to be balanced, the benefit of including cryobank bulls as selection candidates in addition to bulls from the current population depends on (1) the relative emphasis on genetic variability and (2) the selection criterion. Additional merit from cryobank bulls is higher when more emphasis is put on variability. Additional merit from cryobank bulls is very low for the current total merit index but higher for the subindices INET, UH, and FERT (especially high for fertility). Anticipating changes in the HF breeding goal in the future, the germplasm collection is a valuable resource for commercial breeding populations in terms of both genetic variability and genetic merit.

## ACKNOWLEDGMENTS

The research leading to these results was conducted as part of the IMAGE project, which received funding from the European Union's Horizon 2020 Research and Innovation Programme under grant agreement no. 677353. In addition, the Dutch Ministry of Agriculture, Nature and Food Quality (The Hague, the Netherlands) contributed financially (KB-21-004-003). The authors acknowledge CRV (Arnhem, the Netherlands) for providing pedigree and genotype data and thank Ina Hulsegge (Animal Breeding and Genomics, Wageningen University and Research, Wageningen, the Netherlands) for technical support.

## REFERENCES

• Boichard D.
• Brochard M.
New phenotypes for new breeding goals in dairy cattle.
Animal. 2012; 6 (22436268): 544-550
• Calus M.P.L.
• Vandenplas J.
Calc_grm—A programme to Compute Pedigree, Genomic, and Combined Relationship Matrices.
Animal Breeding and Genomics Centre, Wageningen UR Livestock Research, Wageningen, the Netherlands2013
• Cole J.B.
Symposium review: Possibilities in an age of genomics: The future of selection indices.
J. Dairy Sci. 2018; 101 (29103719): 3686-3701
• Colleau J.J.
An indirect approach to the extensive calculation of relationship coefficients.
Genet. Sel. Evol. 2002; 34 (12270102): 409-421
• CRV
Zoek stier.
https://www.cooperatie-crv.nl/zoek-stier/
Date: 2018
Date accessed: January 29, 2018
• CRV
E-Hoofdstukken.
• Danchin-Burge C.
• Hiemstra S.J.
• Blackburn H.
Ex situ conservation of Holstein-Friesian cattle: Comparing the Dutch, French, and US germplasm collections.
J. Dairy Sci. 2011; 94 (21787945): 4100-4108
• Doekes H.P.
• Veerkamp R.F.
• Bijma P.
• Hiemstra S.J.
• Windig J.J.
Trends in genome-wide and region-specific genetic diversity in the Dutch-Flemish Holstein-Friesian breeding program from 1986 to 2015.
Genet. Sel. Evol. 2018; 50 (29642838): 15
• Doekes H.P.
• Veerkamp R.F.
• Hiemstra S.J.
• Bijma P.
• van der Beek S.
• Windig J.J.
Genomic selection and inbreeding and kinship in Dutch-Flemish Holstein Friesian cattle.
in: Proc. 11th World Congr. Genet. Appl. Livest. Prod., Auckland, New Zealand. Massey University, Palmerston North, New Zealand2018: 90
• Druet T.
• Georges M.
A hidden Markov model combining linkage and linkage disequilibrium information for haplotype reconstruction and quantitative trait locus fine mapping.
Genetics. 2010; 184: 789-798
• Egger-Danner C.
• Cole J.
• Pryce J.
• Gengler N.
• Stock K.F.
Invited review: Overview of new traits and phenotyping strategies in dairy cattle with a focus on functional traits.
Animal. 2015; 9 (25387784): 191-207
• Eynard S.E.
• Windig J.J.
• Hulsegge I.
• Hiemstra S.J.
• Calus Mario P.L.
The impact of using old germplasm on genetic merit and diversity—A cattle breed case study.
J. Anim. Breed. Genet. 2018; (29808552)
• Eynard S.E.
• Windig J.J.
• Leroy G.
• Van Binsbergen R.
• Calus M.P.
The effect of rare alleles on estimated genomic relationships from whole genome sequence data.
BMC Genet. 2015; 16 (25887220): 24
• Falconer D.S.
• Mackay T.F.C.
Introduction to Quantitative Genetics.
4th ed. Longman, Harlow, UK1996
• FAO (Food and Agriculture Organization of the United Nations)
Part 3: The state of capacities. Section D: Conservation programmes.
in: Pilling B.D.S.D. The Second Report on the State of the World's Animal Genetic Resources for Food and Agriculture. FAO Commission on Genetic Resources for Food and Agriculture Assessments, Rome, Italy2015: 295-307
• Garcıa M.
• Baselga M.
Estimation of genetic response to selection in litter size of rabbits using a cryopreserved control population.
Livest. Prod. Sci. 2002; 74: 45-53
• Grundy B.
• Villanueva B.
• Woolliams J.A.
Dynamic selection procedures for constrained inbreeding and their consequences for pedigree development.
Genet. Res. 1998; 72: 159-168
• Jaton C.
• Koeck A.
• Sargolzaei M.
• Malchiodi F.
• Price C.A.
• Schenkel F.S.
• Miglior F.
Genetic analysis of superovulatory response of Holstein cows in Canada.
J. Dairy Sci. 2016; 99 (26923051): 3612-3623
• Leroy G.
• Danchin-Burge C.
• Verrier E.
Impact of the use of cryobank samples in a selected cattle breed: A simulation study.
Genet. Sel. Evol. 2011; 43 (22047533): 36
• Malécot G.
Mathématiques de l'Hérédité.
Masson & Cie, Paris, France1948
• Meuwissen T.H.E.
Maximizing the response of selection with a predefined rate of inbreeding.
J. Anim. Sci. 1997; 75 (9110204): 934-940
• Meuwissen T.H.E.
GENCONT: An operational tool for controlling inbreeding in selection and conservation schemes.
in: Proc. 7th World Congr. Genet. Appl. Livest. Prod., Montpellier, France. Editions Quae, Montpellier, France2002: 769-770
• Miglior F.
• Muir B.L.
• van Doormaal B.J.
Selection indices in Holstein cattle of various countries.
J. Dairy Sci. 2005; 88 (15738259): 1255-1263
• Pong-Wong R.
• Woolliams J.A.
Optimisation of contribution of candidate parents to maximise genetic gain and restricting inbreeding using semidefinite programming.
Genet. Sel. Evol. 2007; 39 (17212945): 3-25
• Rodríguez-Ramilo S.T.
• Fernández J.
• Toro M.A.
• Hernández D.
• Villanueva B.
Genome-wide estimates of coancestry, inbreeding and effective population size in the Spanish Holstein population.
PLoS One. 2015; 10 (25880228): e0124157
• Sargolzaei M.
• Iwaisaki H.
• Colleau J.J.
A fast algorithm for computing inbreeding coefficients in large populations.
J. Anim. Breed. Genet. 2005; 122 (16191041): 325-331
• Shepherd R.K.
• Woolliams J.A.
Minimising inbreeding in small populations by rotational mating with frozen semen.
Genet. Res. 2004; 84 (15678746): 87-93
• Smith C.
Use of stored frozen semen and embryos to measure genetic trends in farm livestock.
J. Anim. Breed. Genet. 1977; 94: 119-130
• Sonesson A.K.
• Goddard M.E.
• Meuwissen T.H.
The use of frozen semen to minimize inbreeding in small populations.
Genet. Res. 2002; 80 (12448855): 27-30
• Woolliams J.A.
• Berg P.
• Dagnachew B.S.
• Meuwissen T.H.E.
Genetic contributions and their optimization.
J. Anim. Breed. Genet. 2015; 132 (25823835): 89-99