|
|
||||||||

* Bavarian State Research Center for Agriculture, Institute of Animal Breeding, D-85580 Grub, Germany
Christian-Albrechts-University, Institute of Animal Breeding and Husbandry, D-24098 Kiel, Germany
1 Corresponding author: Stefan.Neuner{at}LfL.bayern.de
| ABSTRACT |
|---|
|
|
|---|
10%). A useful strategy for practical implementation is to estimate variance components in DYD models and breeding values in DYD-YD models.
Key Words: quantitative trait loci variance component accuracy of estimated breeding value marker-assisted best linear unbiased prediction
| INTRODUCTION |
|---|
|
|
|---|
In contrast to the assumption of nucleus breeding programs in other studies about MAS (Meuwissen and van Arendonk, 1992; Ruane and Colleau, 1996), this study assumed that MAS is added to an existing breeding program. Data from fine-mapping experiments and practical applications of MAS in cattle demonstrate that usually only a small part of the overall population is genotyped. Because only these genotyped animals provide information for QTL-specific evaluations, a 2-step approach has been developed and used to estimate QTL variance components and MA-BLUP EBV (Bennewitz et al., 2004b; Liu et al., 2004; Druet et al., 2006). In the 2-step approach, first, a standard animal model evaluation for the entire population is performed to estimate precorrected phenotypes such as yield deviations (YD) and daughter yield deviations (DYD; Van Raden and Wiggans, 1991). These estimates are then used as observations in an MA-BLUP model for the genotyped animals in the second step. The first implementations of this approach in practical MAS (Liu et al., 2004) have used only information for bulls. Consequently, no information about the EBV of the bull dam contributes to the MA-BLUP EBV of candidates for selection, which are the main focus of MAS breeding programs. This raises the question of how strongly the MA-BLUP EBV for selection candidates are affected, if the dam information is not included in the evaluation model.
This study examines the combined use of DYD of bulls and YD of cows (bull dams) as observations in MA-BLUP models. Various models were examined to determine the best combination of aggregated phenotypic information (DYD, YD) and weighting factors for the estimation of QTL variance components and MA-BLUP EBV. Weighting factors are applied to account for the different amount of information that is available for the calculation of DYD. Weighting factors commonly used are the variance of DYD (Thaller et al., 2003; Bennewitz et al., 2004a), effective daughter contributions (EDC; Fikse and Banos, 2001), and daughter equivalents (DE; VanRaden and Wiggans, 1991; Druet et al., 2006).
In some situations, the overall accuracies of MA-BLUP EBV may not be superior to conventional BLUP, because the QTL effects are not large enough. Even in these cases, there will be variance within families attributable to markers, if at least one of the parents is heterozygous at the QTL. As an example for these situations, the benefit of using DYD and YD together in MA-BLUP models for genetic evaluations is shown for the selection among paternal half-sibs inheriting alternative QTL alleles.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Data Generation
In the simulation, a conventional dairy cattle breeding scheme with progeny testing, overlapping generations, and use of proven bulls as sires of second crop daughters was assumed to generate the data. The time horizon for the simulation was 16 yr, which is approximately equal to the time span for availability and collection of genotypic data in real research projects. This period consisted of 2 sections. First, a base population was created, and animals were mated randomly to provide a homogeneous population. Subsequently, a breeding program with random selection for the simulated trait was applied to generate data sets. The underlying breeding program is shown in Figure 1
. Continuous herd replacement was accounted for, as well as loops and restrictions for the service life of bulls. Parameters of the simulated population and base parameters of the progeny-testing program are shown in Table 1
.
|
|
Analysis of Simulated Data Sets
Quantitative trait loci variance components and MA-BLUP EBV were estimated in a 2-step approach. In the first step, a classical polygenic animal model (AM) evaluation assuming the true variance components to be known was conducted for the entire population to estimate DYD for progeny-tested bulls and YD for cows. Observations in this step were phenotypic records of cows. In typical real-life situations, all animals are included in genetic evaluations of dairy cattle, but only a small fraction of animals might be genotyped at genetic markers. These animals are most likely proven bulls, bull dams, and selection candidates for progeny testing. Because only genotyped animals can provide information for the estimation of QTL variance components and MA-BLUP EBV, the second evaluation step was only applied to this genotyped subset of the population. For the current study, we assumed that marker information was available for all animals in the MA-BLUP pedigree and known without error. Observations used in step 2 were twice the DYD and YD calculated in step 1. Twice the DYD was used as phenotypic information of bulls in all MA-BLUP models in the current study, because twice the DYD contain the complete additive genetic variance, and estimates of different models can be compared directly.
The pedigree used for MA-BLUP evaluations contains all progeny-tested bulls, waiting bulls, young bulls currently used, and young bull candidates for the subsequent years. All these animals are included with 3 generations of ancestors. The number of animals for MA-BLUP calculations was therefore about 5,500 individuals, consisting of 1,200 proven bulls with DYD, 3,800 cows with YD, and 500 young bulls
The MA-BLUP model of Fernando and Grossman (1989) was used for evaluations:
![]() |
where yi = the record (YD for dams and twice the DYD for sires) of individual i; ui = the residual polygenic effect of individual i; vip and vim = the paternal and maternal gametic effects of individual i; and ei = the residual. Gametic effects were included in the evaluations in terms of the identical-by-descent matrix. The identical-by-descent matrices were calculated following the algorithm of Abdel-Azim and Freeman (2001).
For estimating AM EBV, DYD, and YD, the package MIX99 (Vuori et al., 2006) was used. Parameters for MA-BLUP models and MA-BLUP EBV were estimated with the ASREML package (Gilmour et al., 1995).
In addition to the AM and MA-BLUP models, EBV were calculated using a classical AM for the decreased data set. This model had only 1 predictor for the overall animal effect, and it is henceforth denoted as AM on MA-BLUP records (AM-MA). For the AM-MA and the MA-BLUP models, various combinations of phenotypic information (DYD, YD) and weighting factors were considered. The evaluations were divided into blocks A and B, which are characterized by the phenotypic information used. Block A is similar to the German Holstein MA-BLUP system (Liu et al., 2004), in which only DYD (DYD models) are used, whereas in block B, DYD and YD (DYD-YD model) are used together, similar to the French MA-BLUP system (Boichard et al., 2002). Within the blocks, different weighting factors, as described in the literature, were applied to DYD: no weighting, variance of DYD (Bennewitz et al., 2004a), EDC (Fikse and Banos, 2001; Liu et al., 2004; Szyda et al., 2005), and DE (VanRaden and Wiggans, 1991; Druet et al., 2006). Yield deviations were not weighted, because each cow had only 1 record in the current study (T. Druet, Station de Génétique Quantitative et Appliquée, Institut National de la Recherche Agronomique, Jouy-en-Josas, France, personal communication). Table 2
summarizes the different variants.
|
was derived for DYD to combine DYD and YD in a single evaluation model by accounting for the different amounts of genetic and residual variances that DYD and YD contain. Results for using the additional weight
are shown for a QVR of 0.30. The analysis of benefits of MA-BLUP for the selection among paternal half-sibs was done for a simulated data set with a QVR of 0.10. Progeny of heterozygous sires were divided into 2 groups: those inheriting the favorable allele 1 and those inheriting the unfavorable allele 0. Then their simulated BV, their AM EBV, and MA-BLUP EBV were compared.
| RESULTS |
|---|
|
|
|---|
|
. In the DYD model (block A), only the phenotypic variance changed toward the simulation parameter if
was applied. Using
, it was possible to estimate the ordinary phenotypic variance as expected. Including
in the weighting factor for the DYD-YD model, the additive genetic variance was no longer overestimated, and the estimates for the genetic variances were similar to the findings for DYD models.
|
|
|
to DYD for the estimation of MA-BLUP EBV. The results for a QVR of 0.30 are presented in Table 6
was applied or not, and accuracies increased only slightly in block B.
|
was not applied. Otherwise, the variances of the EBV were in better agreement with the expectations for observed accuracies in Tables 5
|
If only progeny of heterozygous sires were considered, it was found that progeny inheriting the favorable QTL allele had on average 216 kg greater true BV. In MA-BLUP models their EBV were 30 and 46 kg higher without and with the use of YD, respectively. In other words, the use of YD allowed for increased distinction of the progeny with different QTL alleles inherited by their sire. As expected, no differences were found among the progeny of nonsegregating sires.
| DISCUSSION |
|---|
|
|
|---|
A precondition for MAS is the ability to trace the transmission of QTL alleles through the pedigree. In practical research, multiple markers surrounding a QTL make it possible to infer the transmission of QTL alleles with great precision if the data are complete and polymorphic markers are chosen. In our study, only a single highly polymorphic marker was used to mimic a multiple-marker situation to decrease computational costs for data simulation and MA-BLUP evaluations. Such a marker is unlikely to exist in real life, however. To affirm the assumption that a single marker can be used to mimic multiple markers, the MPIC (Rijsdijk and Sham, 2002) for a situation of 4 equally spaced markers that bracket a QTL was calculated. Each marker had 5 alleles, and the distance between the markers was 4 cM. The MPIC for a QTL position centered between the 2 markers in the middle of the marker bracket was 0.912. Using our parameters of the single marker, the corresponding MPIC at the QTL position was 0.914.
In most cases, practical applications of MA-BLUP are based on a 2-step approach as described by Liu et al. (2004) and Druet et al. (2006). Up to now, EDC (Liu et al., 2004) and DE (Druet et al., 2006) have been used as weighting factors for MA-BLUP evaluations. One focus of the current study was to appropriately weight the daughter information in DYD and DYD-YD models. Our results show that only correct weights accounting for the varying amount of information available for the phenotypes YD and DYD ensure unbiased estimates of variance components and, therefore, high accuracies of MA-BLUP EBV. With this respect, it seems that the correct choice of the weighting factor is more important when estimating variance components, to obtain the correct estimates, than for the estimation of MA-BLUP EBV. By setting up the mixed model equations for BLUP, one can see that the correct ratios of the genetic variance components to each other and to the residual variance component are more important than to their absolute values, especially if accuracies are considered. Greatest accuracies are always obtained if the estimated variance components and ratios are closest to the simulated parameters.
Weighting is especially difficult for variance component estimation in DYD-YD models, because the scales of the 2 types of information are not identical. We showed that the estimates obtained from DYD-YD models can be substantially improved by using the additional weight
. Otherwise, a considerable overestimation of the total additive genetic variance occurs. As a consequence, we strongly recommend to correct for different amounts of genetic and residual variances when combining DYD and YD for variance component estimation. If variance component estimates from DYD-YD MA-BLUP models without
are further used for MA-BLUP DYD-YD evaluations, the variances of the EBV will be overestimated.
To validate this observation, the expected variance of the EBV can be calculated by multiplying the simulated additive genetic variance by the squared observed accuracies. Deviations between the expected variances and the observed values with correction were between 0% for bulls and 1% for young bulls. If the correction was not applied, the variance of EBV for young bulls was too large. For illustration, the expected variance of EBV for young bulls and a QVR of 0.30 was 102,253 kg2 (0.6272 x 260,100 kg2), but the observed value was 116,434 kg2 (Table 7
). This result corresponds to an overestimation of 14%.
Another advantage for EBV calculated in DYD-YD models using
is that they can directly be compared with EBV from an AM without additional postprocessing (standardization) of the EBV. This is important in practical applications, because the subset of results from MA-BLUP EBV must be comparable to all other EBV from the AM.
Applying 2-step approaches for MA-BLUP models always causes a loss of information. Because only a small fraction of the population is included in the analysis, several relationships among animals are not accounted for in MA-BLUP data sets. The information content for proven bulls decreases only marginally, because DYD are estimated very accurately from many daughters and parents contribute only marginally to their EBV. For cows, only (male) relationships and, in the case of DYD-YD models, their own corrected phenotypes are taken as sources of information. As a consequence, the accuracies of cows were decreased from 0.742 to 0.466 in AM-MA without YD, and the use of MA-BLUP could only compensate a very small part of this. If YD were included in the model, the accuracy for cows again increased up to 0.704. In consequence, the loss of information due to the 2-step approach and to missing phenotypes of bull dams in DYD models has to be overcome by an additional source of information: QTL information. Our results show that with an appropriate model, the compensation of the loss of accuracy requires a QTL explaining at least 10% of the genetic variance of the trait. Analyses of MAS applied to practical breeding programs describe the increase in accuracies of young bulls (Liu et al., 2004; Druet et al., 2005). Liu et al. (2004) described the increase in accuracy of young German Holstein bulls if 2 QTL were included as random effects and DGAT1 (Grisart et al., 2002; Winter et al., 2002) as a fixed effect in MA-BLUP evaluation. Correlations increased from 0.45 in the AM-MA to 0.65 in the MA-BLUP model, mainly due to DGAT1. However, more important than comparing results of AM-MA and MA-BLUP models is the superiority of accuracies from MA-BLUP models over traditional AM. Druet et al. (2005) investigated this for the French MAS program. In the French MAS system, 40 to 50% of the variance for all milk traits is explained by 3 to 5 QTL. Accuracies for milk yield EBV of young bulls increased from 0.47 to 0.55. Results of our analysis for a 40% QTL showed an increase from 0.58 to 0.68. These differences can be explained by different heritabilities and different designs for the French breeding program and the one assumed in our study. In addition, the very favorable properties of the QTL and the markers in our study lead to a higher level of accuracies.
Further investigations are necessary regarding the phenotypic information used in DYD-YD MA-BLUP models. Daughter YD were not corrected for bull dams having a YD in DYD-YD models to avoid double counting. Because our analyses were based on random selection, and because each bull had at least 70 daughters with records, the effect of double counting was assumed to be small. Even more gain is expected if the YD of a bull dam and information of its parents and progeny is combined as phenotypic information for bull dams. Another issue that has to be noted in the context of phenotypic information used in DYD-YD MA-BLUP is the effect of preferential treatment of bull dams on MA-BLUP EBV. If there is any preferential treatment of bull dams, this problem arises in both conventional AM evaluations and MA-BLUP DYD-YD models. One approach to decrease this problem is the consideration of heterogeneous variances in the AM evaluations, which should lead to less biased YD for bull dams in MA-BLUP. In case of a preferential treatment, the superiority of DYD-YD models over DYD models will decrease. According to our results, it seems to be preferable to accept a small possible bias rather than discard the phenotypic information of bull dams completely.
The general choice of whether DYD or DYD-YD models are to be preferred depends on the intention of the research. In fine-mapping of QTL, one is especially interested in correct estimates of variance components, whereas MAS aims at an increase in accuracies for MA-BLUP BV of animals without their own phenotypic or progeny information. Therefore, DYD models should be chosen for the estimation of variance components if it is not possible to derive the correct weighting factors to combine DYD and YD correctly. One way to derive the weighting factors for the assumptions made in this study is shown in the appendix. The importance of correct parameter estimates explained by a QTL for MAS was shown by Ruane and Colleau (1996) and Spelman and van Arendonk (1997). They calculated the genetic gain with correct parameter estimates for the selection for a nonexistent QTL and for the selection with an overestimated QTL variance. Both situations of incorrect parameter estimates resulted in less genetic gain compared with the results for correct parameter estimates. To achieve an increase in accuracies for MA-EBV of animals without their own phenotypic or progeny information, MA-BLUP models for MAS should include both DYD and YD to ensure the greatest possible efficiency of selection.
In addition to our empirical results, the benefit of using YD of dams can be shown by a simple calculation. The accuracy of the EBV of a young bull candidate (rYB) is
. A realistic accuracy for a dam EBV (rDam) without considering its own YD in the AM-MA model is 0.466 (Table 5
). By accounting for YD, this can be considerably increased to 0.704. Assuming an accuracy for the EBV of sires (rSire) of 0.910, rYB equals 0.575 with and 0.511 without the YD of the dam, respectively. This theoretical result is in line with the observed results in our study and indicates that YD should be included in MA-BLUP models.
According to our results, MA-BLUP is a useful tool for the selection among half-sibs in segregating families, even with small QTL effects. For example, the accuracies of young bulls for a QVR of 0.10 were 0.595 in the AM, 0.536 in the MA-BLUP DYD model, and 0.596 in the MA-BLUP DYD-YD model. If only the accuracies are considered, a substantial decrease is observed when moving from the AM to the MA-BLUP DYD model and only a small increase when comparing AM and the MA-BLUP DYD-YD model. However, an explicit analysis of segregating families shows a better differentiation between half-sibs, because the Mendelian sampling variance within families can be partially exploited, and correlations between EBV of relatives decrease (Meuwissen and van Arendonk, 1992). This is especially true for MA-BLUP DYD-YD models. The key requirement to achieve these gains is the ability to produce enough full or half-sib progeny to replace those precluded from progeny testing due to carrying unfavorable QTL alleles (Mackinnon and Georges, 1998).
This study considered only a single trait, whereas breeding goals in dairy cattle consist of a variety of traits. Although QTL have been mapped for many of these traits, in some cases explaining approximately 50% of the genetic variation of a particular trait (Grisart et al., 2002), the fraction of the variation of the overall breeding goal explained by QTL is likely to be moderate (Schrooten et al., 2005). Nevertheless, preliminary simulation studies for MAS (Schrooten et al., 2005) show encouraging results. Even when the QTL explained only 5% of the genetic variance, MAS had a considerable effect on the genetic response.
| CONCLUSIONS |
|---|
|
|
|---|
The MA-BLUP evaluations that do not make use of phenotypic data for bull dams will only give benefits in young bull selection when the QTL explain more than 30% of the additive genetic variance. Even in DYD-YD models, data are still incomplete compared with a conventional AM evaluation. To outweigh the loss of information caused by the 2-step approach, a QVR of approximately 10% is required. With respect to genetic progress, MA-BLUP should be applied even if QTL effects are small, as long as the costs are justified, because MA-BLUP can always improve the selection within segregating families. As a consequence of the results of this study, MA-BLUP models used to estimate EBV for MAS should include DYD and YD to ensure that MAS improves selection even for moderate QTL effects. Accounting for the different genetic and residual variances in DYD and YD by the additional weight
further improved the results for the estimation of variance components and EBV in DYD-YD MA-BLUP models.
| APPENDIX |
|---|
|
|
|---|
Because YD are only corrected for fixed effects (Van-Raden and Wiggans, 1991), they carry full additive genetic variance (
2A) and full residual variance (
2E): var(YD) =
2A +
2E. DYD are defined as a weighted average of corrected YD of all progeny of a sire, where the correction is for all fixed effects and the breeding values of the mates of a sire (VanRaden and Wiggans, 1991). Ignoring inbreeding, assuming that all mating partners of a sire are known, and only considering 1 record per progeny, the DYD of a bull can be calculated as
, where YDi = the YD of progeny i; EBVmi = the EBV of the dam of the progeny i; and n = the number of progeny used for the calculation of the DYD. As a consequence, a DYD calculated under the assumptions made contains the genetic variance of half the BV of the father (
2U) and a residual variance component (
2E*) that is a function of the Mendelian sampling variance (
2A /2), the residual variance (
2E), and the number of progeny (n) that were used for the calculation of the DYD:
![]() |
To put DYD and YD on the same scales, both should contain full
2A and full
2E. Multiplication of DYD by the factor 2 leads to full
2A in DYD:
![]() |
As can be seen,
of 2 x DYD is definitely not equal to
2E. The averaging effect of denominator n is usually accounted for by applying weighting factors like DE or EDC that describe the amount of phenotypic information of the progeny that was used for their calculation.
Additionally, (2 x
2A + 4 x
2E) of twice the DYD and
2E of the YD have to be on the same scale. Therefore, we suggest an additional weighting factor
. Equating (2 x
2A + 4 x
2E) and
2E and integrating
leads to:
2E =
x (2 x
2A + 4 x
2E). Setting this equation equal to
gives us the additional weight for twice the DYD:
![]() |
Applying the weighting factor EDC, which is in our situation directly proportional to n (w
n), and the additional weight
leads to:
![]() |
The calculation of
requires only the knowledge of variance components. These can easily be estimated from an animal model without QTL effects.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
Received for publication January 29, 2008. Accepted for publication June 30, 2008.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
I. Misztal, A. Legarra, and I. Aguilar Computing procedures for genetic evaluation including phenotypic, full pedigree, and genomic information J Dairy Sci, September 1, 2009; 92(9): 4648 - 4655. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Legarra, I. Aguilar, and I. Misztal A relationship matrix including full pedigree and genomic information J Dairy Sci, September 1, 2009; 92(9): 4656 - 4663. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |