|
|
||||||||
,
,1
* Department of Genetics and Biotechnology, Faculty of Agricultural Sciences, University of Aarhus, Denmark
Agricultural Biotechnology Research Institute of Iran, Karaj, Iran
Department of Large Animal Sciences, Faculty of Life Sciences, University of Copenhagen, Denmark
1 Corresponding author: SaeidAnsari.Mahyari{at}agrsci.dk
| ABSTRACT |
|---|
|
|
|---|
Key Words: marker-assisted selection dairy cattle quantitative trait locus selective genotyping
| INTRODUCTION |
|---|
|
|
|---|
In previous studies, the selection schemes using marker information for dairy cattle were largely based on information from within families (Kashi et al., 1990; Mackinnon and Georges, 1998; Spelman and Garrick, 1998). Recently, the use of linkage disequilibrium (LD) information to locate QTL has increased (Pérez-Enciso, 2003; Meuwissen and Goddard, 2004; Uleberg and Meuwissen, 2007). The next step after fine-mapping of QTL is to use them in prediction of breeding values. There are several examples in dairy cattle with LD markers used for preselection of candidates (e.g., Dekkers, 2004).
Using DNA information in a population with LD can enhance the accuracy of identifying superior candidates. However, the cost of complete genotype information in MAS is a major limitation in a breeding scheme. Genotyping the whole population is also difficult for practical reasons in commercial dairy cattle populations. Several investigations in QTL experiments attempted to decrease genotyping costs by identifying the most informative individuals based on phenotypic information (Lander and Botstein, 1989; Darvasi and Soller, 1992), segregation analysis (Kinghorn, 1999; Macrossan and Kinghorn, 2003), or combining the phenotypic and genotypic information (Ansari-Mahyari and Berg, 2008).
The objective of this study was to compare genetic gain for several strategies of selective genotyping relative to complete genotype information under the assumption that QTL parameters are known. The 2 genotyping levels (20 and 50%) were considered and compared with genotyping all or no candidates, the latter resulting in selection being based on BLUP. This case was used as a reference. The response to selection at the QTL and polygenes for the different strategies were compared in a simulated dairy cattle breeding scheme for a low-heritability trait only expressed in females (e.g., mastitis). This trait was chosen to maximize the power to detect differences between strategies because it is a trait that is expected to favor MAS over BLUP.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Genetic Model
The base-population was simulated to generate LD between the markers and QTL according to Meuwissen and Goddard (2000), which assumed that variation in a QTL is due to a mutation that occurred 100 generations ago and increased to a certain frequency due to random drift, given a specific effective population size (200 in this study). Genotypes were simulated for 4 marker loci placed in an identified 4-cM region of a chromosome and positioned at 0, 1.5, 3.5, and 4 cM. Markers with 5 alleles of equal frequencies were assumed. The detected QTL was situated between the second and the third marker at position 2.5 cM. In the first generation of simulating LD, unique alleles were assigned to the QTL for the founders. At the end of simulating LD (generation 100), a founder allele with a frequency of 0.10 (± 0.01) was defined as the favorable QTL allele and the rest of the remaining founder alleles were defined to be the unfavorable QTL allele. Hence, the QTL was biallelic and the frequency of the favorable QTL allele was between 0.09 and 0.11 in the base population. Marker and QTL genotypes were assigned according to Mendelian inheritance. The simulation allowed for recombination within the region, according to the genetic distance between the loci. Haplotypes for base animals were sampled from the haplotypes present in generation 100. In this study, 50 base populations with the above characteristics were generated. Then, the same base-populations were used in all genotyping strategies described below.
True breeding values were simulated assuming a mixed inheritance model with an additive QTL effect. A sex-limited trait, like mastitis resistance, was considered with a heritability of 0.04 and a total phenotypic variance of 1.0 in the base population. The polygenic effect of each animal (ai) in the base population was sampled from N(0,
a2), where
a2 is the variance of the additive polygenic component, which was 0.03. The polygenic effect of later generations was obtained from ai =
asire +
adam + mi, where mi is the Mendelian sampling effect, which was sampled from N(0,
(1 – f)
a2), where f is the average inbreeding coefficient of the sire and dam of i.
It was assumed that the QTL had been identified in earlier QTL mapping experiments, and that the position and variance of the QTL was known with certainty. The variance due to the QTL was
qtl2 =2p(1–P)
2 (Falconer and Mackay, 1996), where p is the frequency of the favorable QTL allele in the base population and
is the gene substitution effect. The gene substitution effect was constant and equal to 0.236, so that
qtl2 was approximately 0.01 in the base-population, as the initial frequency of the favorable QTL allele was approximately 0.10. Given that the QTL allele frequency changed, the total genetic variance changed over time.
Population Structure
The numbers of selected candidates in the scenarios are shown in Table 1
. After simulating the base-population with LD, 40 herds of equal size (100 cows per herd) were generated. The population size was kept constant.
|
Selection of Females.
In the BLUP approach, 4,000 cows were selected based on EBV including own performance and offspring information and the best 1,000 cows were chosen as bull dams. In the MA-BLUP approach, potential bull dams (maximum 2,000 cows) were first selected for genotyping based on EBV. Again, depending on the selective genotyping strategies, all or a fraction of them were genotyped and MA-BLUP evaluation was performed including the new genotype information. Ultimately, the best 1,000 cows were selected as bull dams. In both BLUP and MA-BLUP, another 3,000 cows were selected, based on the last evaluation to keep the population size constant.
Genotyping Strategies
Several selective genotyping strategies were used under the MA-BLUP approach. In the first year of genotyping in this experiment (yr 4), it was assumed that all founder sires, which were selected for progeny testing, had already been genotyped. Additionally, from the fourth year onward, the sires of animals in the evaluation were assumed to have marker information. The main objective of the selective genotyping strategies was to maximize genetic gain given a reduced number of genotyped animals. Each strategy was replicated 50 times, from 50 different base populations. The criteria in the current study were
1. Complete Genotyping (ALL).
Each year, all 400 young bulls that were candidates to be progeny tested and 2,000 cows that were candidates to be bull dams were genotyped. These individuals were identified based on EBV from an MA-BLUP evaluation.
2. Random Genotyping (RAN).
A random proportion (20 and 50%) of the 400 young bulls and 2,000 cows were genotyped.
3. Average EBV (AVE).
A fraction of the 400 young bulls and 2,000 cows were selected for genotyping based on the estimated QTL and polygenic effects. Animals with EBV closest to the truncation point were chosen for genotyping. This strategy aimed at genotyping the animals most likely to change rank relative to the truncation point.
After an MA-BLUP evaluation, the overall haplotype effect was computed for each candidate as
, where
off is the total predicted QTL effect in the candidate, and
S1 and
S2 (
D1 and
D2) are the 2 estimated QTL effects in the sire (dam) of the candidate. If one or both of the parents were not included in the evaluation, the average haplotype effects of the grandparents were used or in case they were also not in the evaluation, the expected effects of the parental haplotypes were computed as
parent = 2x [(PQ x
Q) + (Pq x
q)], where
parent is the sum of the parental haplotype effects, PQ (Pq) is allele frequency of the favorable (unfavorable) QTL-allele in the birth year of the nongenotyped parent, and
Q (
q) is the gene substitution effect of the favorable (unfavorable) QTL allele. In this case,
parent was used for
S1 +
S2 (
D1 +
D2) if the sire (dam) was not included in the evaluation. The total genetic value in the offspring was computed as:
. Given the truncation point (t) among preselected animals, an index for each candidate was calculated to compute the absolute deviation as Index = |t –Â off|.
A proportion (20 and 50%) of candidates with the smallest index values was genotyped.
4. Combined Average-EBV and Variance-QTL (COMBINE).
Whereas the previous strategy used the sum of the estimated QTL and polygenic effects, this strategy also used the segregation variance of the detected QTL. The rationale for this strategy followed from the assumption that the Mendelian sampling variance of the potential candidates depends on the parental genotypes, therefore affecting the probability that genotyping may cause a change of the total genotypic effect relative to the truncation point. The EBV (Âoff) were computed as in the previous strategy. Mendelian sampling variance in the preselected candidates was calculated as follows. Each sire (dam) has 2 estimated haplotype effects as
S1 and
S2 (
D1 and
D2). There are 4 possible combinations of the QTL effects in offspring with equal probability as below:
![]() |
![]() |
Mendelian sampling variance was calculated for all preselected candidates by
![]() |
where
off is the mean predicted QTL effect in the offspring
. Again, if one of the parents or both were not genotyped, the mean of the haplotype effects were computed as previously described. However, in the situation neither of the parents were evaluated, the average haplotype effects of the grandparents were used. If grandparents were not in the evaluation, the 1 effects in each parent were computed as
, for the first haplotype effect and
, for the second haplotype. The
was computed as the mean of absolute differences of the estimated haplotype effects (
i,1 and
i,2) in all animals (i = 1 . . . N), which were included in the MA-BLUP evaluation by
![]() |
The
was used to compute a more accurate Mendelian segregation variance for ungenotyped parents without affecting
off. Finally, an index was calculated to identify candidates for genotyping as Index = |P – 0.5|, where P =
(Âoff, MenVar
off, t).
( Âoff, MenVar
off, t) is the cumulative normal distribution function with mean Âoff and variance MenVar
off, which is evaluated at the truncation point. Therefore, P is the probability of an animal being below the truncation point and thus not being selected. Either 20 or 50% of the individuals with the lowest value of the index were genotyped. These were the animals likely to change order relative to the truncation point when genotyped and reevaluated.
Breeding Value Evaluation
The MA-BLUP evaluation was used for selection based on phenotypic, pedigree, and marker information. For progeny tested bulls, daughter yield deviations (DYD) were sampled directly. Twice the DYD was used as the phenotype of the sires in genetic evaluations. During the first 3 yr of all genotyping strategies, animals were selected using BLUP evaluations without genotypic information of the parents.
The MA-BLUP evaluation was based on the following mixed model:
![]() |
where y = vector of records for cows or 2 x DYD for progeny-tested bulls, X = incidence matrix relating phenotypes to fixed effects, β = vector of fixed herd-year-season effects, Z = incidence matrix relating phenotypes to animals, a = vector of additive polygenic effects for each animal, W = incidence matrix relating phenotypes to random QTL effects, q = vector of random additive effects due to the QTL (haplotype effect), and e = vector of residuals. The QTL was fitted as a random effect because QTL genotypes (alleles) in the animals are unknown. The random vectors are assumed to be normally distributed and mutually independent. Specifically, a is N(0, A
a2), q is N(0, Hp
q2), and e is N(0, R
e2), where A is the additive relationship matrix, Hp is the identity by descent (IBD) matrix that contains the IBD probabilities for the QTL position (2.5 cM in this study), and R is a diagonal matrix with elements of 1 for cows and
for the bulls with DYD. The weight was calculated as the variance of twice the DYD. The IBD matrix was derived with assumption about historical population structure and clustering approach for LDLA analysis (combined linkage disequilibrium and linkage analysis) across sire family groups using the method described by Meuwissen and Goddard (2000, 2001). The variance component used for prediction of QTL effects was recalculated every year based on the frequency of the QTL alleles.
For the BLUP evaluation, the QTL effect was excluded from the model and the evaluation was based on phenotypic and pedigree information. The total EBV for BLUP and MA-BLUP evaluations were â and â + 2
, respectively. Because computation of the gametic relationship matrix was time consuming, data used in the genetic evaluation were constrained to the final 15 yr.
The results of genetic responses, QTL frequency, and rate of inbreeding from the strategies in each year were tested using standard ANOVA with the genotyping strategy method and replicate as 2 fixed independent variables. Replicate was included in the model, as the same 50 base populations were used.
| RESULTS |
|---|
|
|
|---|
|
|
|
Both genotyping levels (20 and 50%) in the candidates (young bulls and bull dams) were studied to compare the genetic gain achieved relative to the maximum gain obtainable from genotyping all candidates. As explained in Materials and Methods, in each year of selection regardless of the genotyping criteria, the number of genotyped candidates was based on the genotyping levels (20, 50, or 100%) and only nongenotyped animals were candidates for genotyping. Therefore, in the ALL strategy the numbers of genotyped heifers and cows were in most cases fewer than planned as some were already genotyped in a previous year. Figure 4
shows the percentage of cumulative numbers of genotyped animals (males, females, and both sexes) in genotyping levels of 20 and 50% of the potential candidates relative to the ALL strategy. In the first year of MAS (yr 4), the number of genotyped males was greater because of genotyping of progeny-tested bulls from previous years. On the other hand, the proportion of genotyping in females showed that more than 20 and 50% of the candidates were genotyped in this study in comparison to the ALL strategy until yr 8. This was because more ancestors needed to be genotyped with the identified genotyping levels. Consequently, on average in this study, the actual genotyping proportions relative to the ALL strategy were 32 to 35% for the 20% genotyping level and 72 to 75% for the 50% level.
|
f) was 0.72% in BLUP, whereas the MA-BLUP strategies (with 100% genotyping) resulted in a rate of inbreeding of 0.58%. Genotyping 20% of the potential candidates showed a slightly greater rate of inbreeding than 50 and 100% genotyping during yr 11 to 19 (Figure 5
f with MA-BLUP increased to a value similar to the rate of inbreeding with BLUP.
|
| DISCUSSION |
|---|
|
|
|---|
Three selective genotyping criteria were used to choose animals for genotyping from the potential candidates each year. The AVE and COMBINE strategies chose animals with the highest uncertainty on whether to be selected or not. In the RAN strategy, the animals were sampled randomly from the preselected group. All proposed genotyping strategies showed similar cumulative genetic level each year, at similar proportion of genotyping. The computational time demand was not different between the genotyping strategies using the same genotyping level and, therefore, any of them can be used in a MAS scheme based on selective genotyping.
Increased initial response from MAS is mostly due to increased frequency of the favorable allele and strong emphasis on the detected QTL (Spelman and Garrick, 1997). However, in practical breeding schemes, more than one trait or QTL is used and therefore a slower fixation of the favorable QTL allele is expected. The strategies for selective genotyping presented in this study are easily extendable to multiple QTL and multiple traits.
In this study, selective genotyping criteria were applied for both males and females across families. However, other studies of multistage MAS in dairy cattle considered only sires for genotyping (e.g., Schrooten et al., 2005). One could consider using the genotyping criteria developed in this study in the sires only. It is expected that selective genotyping of males is less superior to BLUP because associations between phenotypes and haplotypes come exclusively from older, progeny-tested sires and not from animals being both phenotyped and genotyped.
Cumulative polygenic response using BLUP evaluations were higher than for MA-BLUP schemes. However, extra polygenic response in BLUP decreased considerably compared with MA-BLUP from yr 13, when the QTL was close to fixation with MA-BLUP selection. Other studies showed less polygenic response to selection based on pedigree and marker information compared with pedigree information only (e.g., Spelman and Garrick, 1997). In contrast, Sonesson (2007) used within-family MAS and showed that polygenic genetic gain was not significantly different when using marker information.
Increased gain in MAS was due to more information in genetic evaluations compared with BLUP (e.g., Villanueva et al., 2005), which results in increased accuracy of predicted breeding values. The accuracy of predicted QTL effects increased during the first years of selection because the increasing number of genotyped offspring gave more information per haplotype. Accuracies for 50% of the potential candidates being genotyped were very close to the accuracies in the ALL strategy, whereas accuracies were lower with 20% of candidates genotyped. In BLUP and MA-BLUP scenarios, the accuracy decreased over time because of decreasing genetic variance due to the Bulmer effect (Bulmer, 1971), inbreeding, and reduced heterozygosity at the QTL (after yr 7 to 10).
Previous simulation studies investigated the benefit of using markers linked to known genes in dairy cattle breeding schemes (Meuwissen and van Arendonk, 1992; Stella et al., 2002; Schrooten et al., 2005) and showed higher genetic response. The genetic progress increased with 5 to 64% compared with BLUP selection, depending on the trait being selected and the genetic models used (Meuwissen and Goddard, 1996; Hayes and Goddard, 2003). The MAS studies assumed complete genotype information, but genotyping the whole population or even the potential candidates alone is expensive and not a practical approach for dairy cattle breeding (Dekkers and Van der Werf, 2007). For reasons mainly related to the genotyping costs, dairy cattle breeding programs based on MAS should identify and genotype the most informative breeding animals in multistage breeding programs. This simulation study attempted to clarify the usefulness of a proper genotyping criterion in a real situation of a dairy cattle breeding scheme.
Marshall et al. (2002) used several genotyping strategies and assumed an advanced breeding scheme (e.g., closed nucleus system) with complete information on pedigree and phenotypes. Based on the predicted marker genotypes with reasonable certainty (Kinghorn, 1999), Marshall et al. (2002) used an index based on genotype information with a single linked marker and EBV of 2 traits to select a fraction of the population for genotyping. Marshall et al. indicated that the gain at a detected QTL with selective genotyping was close to the gain with complete genotyping. However, in more extensive breeding schemes (e.g., sheep and beef), there is incomplete recording of pedigree relationships. In addition, in this approach it is more complicated to compute the genotype probabilities when several markers or complex pedigrees are considered. The method developed by Kinghorn (1999) to find the most informative individuals for genotyping does not take into account the phenotypic information and only uses the results from segregation analysis.
In dairy cattle breeding, it is necessary to develop more flexible selective genotyping methods for existing outbred populations. The results showed that the selective genotyping approaches could replace complete genotyping, with a minimal loss in genetic progress. In addition, the considered methods were quite simple and used the results from the previous MAS evaluation. With 20% genotyping, the total cumulative genetic level per year was not significantly different from 50% genotyping, but 20% genotyping was significantly lower than complete genotyping in yr 11 to 16. With a genotyping level of 50% (20%), 95% (89%) of the maximum response at the QTL was obtained in yr 13.
The MAS schemes in dairy cattle in France and Germany, which are based on within family selection, have been described in the literature (Weller, 2007). These schemes use linkage information and need to genotype either all young bulls and bull dams or all sires and their daughters in the German and French breeding plans, respectively. The use of LD to locate genes that affect quantitative traits has increased recently (Meuwissen and Goddard, 2004; Uleberg and Meuwissen, 2007). The classical linkage and LD analysis are complementary, and combined methods could simultaneously use linkage and LD information. Therefore, to increase the accuracy of the selection based on fine-mapped QTL and decrease genotyping costs in breeding programs, it is necessary to develop MAS across families in a population using selective genotyping methods. This study indicated that one method to decrease genotyping costs in a MAS scheme across families was genotyping the candidates close to the truncation point, when QTL effect estimation was based on both LD and linkage analysis information. Therefore, the markers that are close enough to the causative mutation and have consistent associations across families can be used in animal breeding programs (Dekkers, 2004). In addition, selection across families does not require any specific family structure (Dekkers, 2004). However, the benefits of implementing marker-linked QTL, using selective genotyping or in general for MAS, depend on the net result of extra costs, savings, and amount of genetic progress of the chosen breeding program (Schrooten et al., 2005).
Another benefit of MAS schemes over BLUP in this study was a reduction in the rate of inbreeding, as using additional marker information allows selection of high-ranking animals within families due to reduced covariances between EBV of genotyped animals within families. In a practical breeding scheme, restricting
f can be achieved through selecting parents from more families [e.g., optimum contribution selection for MAS (Villanueva et al., 2002)]. Potentially the attempts to restrict
f in MAS is expected to increase genetic progress as has been shown for BLUP selection (Villanueva et al., 2002).
The markers in this study were assumed in LD with the QTL. Additionally, the developed genotyping criteria used LD information to find the most informative animals for genotyping across families. Using dense markers in LD with a QTL (as a haplotype), such that the marker QTL associations persist across families in the population, makes the implementation of MAS much simpler, and the linkage phase for each sire family does not have to be established (Hayes and Goddard, 2003). Exceptional marker information content can be obtained using a dense marker map of the QTL region. Therefore, the haplotypes with the strongest link to the quantitative trait can be detected. Consequently, the most valuable haplotypes are determined not only by proximity to the QTL, but also by the relative information content and degree of linkage disequilibrium between the markers and QTL alleles.
In this study, LD between markers and QTL was simulated in the base populations. In the first years of selection based on marker information, the frequency of the favorable QTL allele was low (
0.10). In MAS schemes, the breeding animals are selected based on estimated effects of the QTL and polygenic effects, and therefore during the first years of selection, this could result in lower polygenic gain in MAS procedures in comparison to BLUP selection, where selection depends on EBV not directly accounting for the QTL. Consequently, total genetic gain was higher with MA-BLUP compared with BLUP evaluation.
| CONCLUSIONS |
|---|
|
|
|---|
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
Received for publication August 14, 2007. Accepted for publication December 4, 2007.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
L. D. Pedersen, A. C. Sorensen, and P. Berg Marker-assisted selection can reduce true as well as pedigree-estimated inbreeding J Dairy Sci, May 1, 2009; 92(5): 2214 - 2223. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |