|
|
||||||||
Interbull Centre, Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, S-750 07, Uppsala, Sweden
Corresponding author: W. F. Fikse; e-mail: Freddy.Fikse{at}hgen.slu.se.
| ABSTRACT |
|---|
|
|
|---|
Key Words: simulation borderless evaluation multiple-trait across-country evaluation
Abbreviation key: GAMcou = international evaluation on country basis using performance records, GAMenv = international evaluation on production environment basis using performance records, MACE = multiple-trait across-country evaluation, NAMcou = national evaluation using performance records, PBV = predicted breeding value, RMSE = root mean square error, TBV = true breeding value
| INTRODUCTION |
|---|
|
|
|---|
Weigel and Rekaya (2000) and Zwald et al. (2001), both working with field data, clustered herds across country borders according to information on production systems. The lowest genetic correlation between performance in production environments was 0.81 (Weigel and Rekaya, 2000) and 0.59 (Zwald et al., 2001), indicating the presence of genotype x environment interaction. Furthermore, the number of production environments was smaller than the number of countries, and boundaries of production environments and countries did not overlap perfectly.
Consequences of similar situations as described before for the optimum design of breeding programs and genetic improvement were investigated by Banos and Smith (1991) and Lohuis and Dekkers (1998), using deterministic simulations. However, little is known about the gain in precision of predicted breeding values (PBV) when international genetic evaluations are borderless. The aim of this study was to investigate gain in precision and accuracy of estimated variance components and predicted genetic merit when individual performance records can be used instead of national breeding values and when international evaluation is across production environment rather than country.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Simulation of Data
The breeding program was a conventional progeny test scheme. Countries A and C were of equal size and comprised 20,000 cows per generation. Country B had only one-half as many cows per generation. In each generation, 200 young bulls were progeny-tested in countries A and C and 100 bulls were progeny-tested in country B, of which the best 10 and 5%, respectively, were used as proven bulls and sire of sons in the next generation. Progeny group size for young bulls was normally distributed around a mean of 80 (SD = 5). Proven bulls that had passed the progeny test received an additional 200 progeny in the next generation. One hundred elite dams in countries A and C and 50 elite dams in country B were selected as dams of sons, and each elite dam was mated to one sire of sons to produce 2 full brothers. Each cow was mated to either a proven or a young bull to produce one female offspring for the next generation. Mating of selected animals was random.
Selection of animals to produce the next generation was done within country, regardless of production environment. The criterion for selection was national breeding values predicted within country. Sires of sons and proven bulls that had been selected within country were exchanged from generation 3 forward. Country B imported 30 and 20% male genetic material from countries A and C, respectively. Each of countries A and C imported 25% male genetic material from the other.
Two different distributions of environments over countries were considered (Figure 1
). In the first situation (balanced), 50% of the observations in each country were made in each production environment. In the second situation (unbalanced), 90% of the observations in countries A and B and 10% of the observations in country C were made in the first production environment.
|
Observations were generated as the sum of a contemporary group effect, the TBV and a residual. Contemporary group size was, on average, 20 (SD = 5) with a minimum size of 5. Contemporary group effects were drawn from a normal distribution with mean zero and variance equal to 10% of the total phenotypic variance. Contemporary groups were nested within country and production environment. Cows were assigned to contemporary groups such that daughter-dam pairs were likely to produce in the same production environment.
Breeding values were predicted for each country separately in each generation after observations had been generated. A model was used that included fixed contemporary and genetic group effects as well as a random animal effect. Genetic groups for phantom parents (of base animals and imported animals) were formed on the bases of country of origin and generation of birth. Simulated variance components were used in the national genetic evaluation. The mixed model equations for breeding value prediction were set up according to the implicit representation method, which reduces memory requirements by taking advantage of the occurrence of repeated blocks in the equation system (Tier and Graser, 1991). The mixed model equations were solved with a pre-condition conjugate gradient algorithm (Strandén and Lidauer, 1999). The pre-conditioner matrix was block diagonal, and, for fixed effects, the complete diagonal block of the left hand side of the mixed models equations was used. For animals, the diagonal blocks pertaining to all equations of an animal were used. Solutions of the mixed model equations were assumed converged when the average relative difference between consecutive rounds was <106.
International Genetic Evaluation
At the end of the last generation, the complete set of generated data was used to compare 3 methods for international genetic evaluations:
Multiple-trait across-country evaluation.
A 2-step procedure was used to obtain breeding values with a multiple-trait across-country evaluation (MACE) (Figure 2
). First, national breeding values were predicted separately for all 3 countries. Next, national breeding values for sires were de-regressed and used in an across-country evaluation that treated performance in each country as separate traits.
|
![]() |
where yij is the observation on animal j in contemporary group i; cgi is the fixed effect of contemporary group i; aj is the random additive genetic effect of animal j (
, where A is the additive genetic relationship matrix for all animals based on sire and dam relationships, and
is the additive genetic variance); qjm is the contribution of group m to the genetic merit of animal j; gm is the fixed effect of group m; M is the number of groups; and eij is the random residual (
, where
is the residual variance). The mixed model equations were solved with the pre-condition conjugate gradient algorithm described previously.
Estimated variance components were used and were obtained with a similar model as for prediction of breeding values, except that sire instead of animal was used as the genetic effect. A sire model was used for estimation of variance components for computational reasons. The AI-REML implementation in the DMU software package (Jensen et al., 1997) was used. Estimates of variance components were assumed converged when the norm of the gradient of the parameter vector was <104.
De-regressed national breeding values for sires were analyzed with a multiple-trait model containing fixed country mean effects and random genetic group and sire effects (Schaeffer, 1994):
![]() |
where zij is the de-regressed national breeding value for sire j in county i; µi is the fixed effect of country i; gim is the random effect of group m in country i (g
N(0,I
Gs), where Gs contains the sire variances and covariances among countries); sij is the random additive genetic effect of sire j in country i (s
N(0,As
Gs), where As is the additive genetic relationship matrix for all bulls based on sire and maternal grandsire relationships); and
ij is the random residual (
, where Di is a diagonal matrix with weighting factors (Fikse and Banos, 2001) as elements, and
is the residual variance for country i). Genetic groups for unknown parents were defined according to country of origin, generation, and path of selection. Solutions to the MACE mixed model equations were obtained with a Gauss-Seidel iteration on data algorithm.
National breeding values were de-regressed within country (Jairath et al., 1998). Sire variances were estimated within country with an EM-REML algorithm (Sullivan, 1999). Genetic correlations were also estimated with EM-REML, using a well-connected subset of the data (Sigurdsson et al., 1996). This subset contained data from full-sib families whose members had daughters in multiple countries, similar to the current Interbull practice.
Breeding values for cows on country scales other than the country of origin were computed using conversion equations. Conversion equations were computed from MACE breeding values as a linear regression of international breeding values in the importing country on breeding values in the exporting country. The MACE breeding values for bulls that originated in the exporting country were used to estimate regression coefficients.
International evaluation on country basis using performance records.
Performance records were analyzed with a 3-trait model, where countries were treated as separate, although correlated, traits:
![]() |
where yijk is the observation on animal j in country k and contemporary group i; ajk is the random additive genetic effect of animal j in country k (a
N(0,A
Gc), where Gc contains the genetic variances and covariances among countries); and eijk is the random residual (
, where
is the residual variance for country k). Variance components were estimated from the data with a similar model, except sthat sire was used as random effect instead of animal. The procedures to estimate variance components and predict breeding values were the same as for the national genetic evaluation.
International evaluation on production environment basis using performance records.
Performance records were analyzed with a bi-variate model, where production environments were treated as separate, although correlated, traits:
![]() |
where yijl is the observation on animal j in production environment l and contemporary group i; ajl is the random additive genetic effect of animal j in production environment l (a
N(0,A
Gp), where Gp contains the genetic variances and covariances among production environments); and eijl is the random residual (
, where
is the residual variance for production environment l). Estimated variance components and PBV were obtained in a similar way as for the international evaluation on country basis using performance records (GAMcou).
Comparisons and Statistics
Bias and root mean square error (RMSE) for estimated variance components were computed for all 3 international genetic evaluation methods. True breeding values were simulated for each production environment; thus, variance components and TBV for each country were not available and had to be calculated (Table 1
). True breeding values specific to each country were obtained by weighting TBV for each production environment by the proportion of herds belonging to this production environment. The true genetic variance for each country and the true genetic correlations between countries were computed as the variance (x2) and correlation between Mendelian sampling deviations for each country. Mendelian sampling deviations on each country scale were for each animal calculated as the difference between the TBV and the parent average.
|
Lastly, the proportion of selected animals that were in common among the top 25 based on MACE, GAMcou, or GAMenv international breeding values and the top 25 based on TBV was examined.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
Genetic variance of production environments was underestimated by 12% (Table 3
), but estimates were less variable compared with GAMcou estimates of genetic variance (mean SD of estimates from 15 replicates was 7.5 and 9.8 for GAMenv and GAMcou, respectively). As a result, RMSE was higher for GAMenv estimates of genetic variance for production environments compared with GAMcou estimates for genetic variance for countries. The effect of the level of simulated genetic correlation between production environments was not as large for GAMenv as for GAMcou, especially for the balanced situation.
|
Genetic correlations.
Genetic correlations between performance in different countries were consistently underestimated with GAMcou (Table 4
). The magnitude of the bias in estimated genetic correlations between performance in different countries was higher for MACE than for GAMcou in most cases (Table 4
). The bias of genetic correlations estimated with MACE was always negative for the balanced situation, but, for the unbalanced situation, bias was negative only for the correlation between performance in countries A and B.
|
The true genetic correlation between countries for the balanced situation and the true genetic correlation between countries A and B (rGA,B) for the unbalanced situation was equal to unity by design. The upper bound of the parameter space for correlations is unity; hence, the distribution of the estimator will be skewed, and a downward bias was to be expected. The true values for rGA,C and rGB,C were different from unity for the unbalanced situations with genetic correlation between production environments (Table 1
). Gates et al. (1999) reported negative bias in genetic correlation estimates when populations were selected and sires and management groups were confounded. The same phenomenon could be behind the underestimation of rGA,C and rGB,C with GAMcou, as ties between countries were weak as indicated by the low number of bulls (n = 27) with daughters in countries B or C and at least one other country.
Genetic correlations between production environments estimated with GAMenv were biased downward (Table 5
). The bias was larger when the true genetic correlation between production environments was 0.70. Bias and RMSE of estimated genetic correlations between production environments were smaller than for estimated genetic correlations between countries.
|
Breeding Values
International breeding values of males predicted with GAMenv had the lowest RMSE, whether they were expressed on country scales (Table 6
) or production environment scales (Table 7
). Predicted breeding values were largely unbiased, except for the unbalanced situation with low correlation between production environments, and differences in RMSE were mainly due to differences in precision.
|
|
When the interest was to rank animals in production environments, breeding values predicted with GAMenv were more accurate than with MACE and GAMcou, especially for low values of the genetic correlation between production environments (Table 7
). Both MACE and GAMcou performed similarly, and access to performance records (GAMcou) alone was not an advantage over meta-analysis of national breeding values (MACE). The larger RMSE for the production environment 2 in both unbalanced situations reflects the lower number of animals with observations in this production environment.
The correlation between TBV, either expressed on country or production environment scale, was high (>0.90) for 3 of the 4 situations, which may explain the small differences in RMSE. For the unbalanced situation with low correlation between production environments, TBV on country and production environment scale were less well correlated, and the RMSE differed more between the international evaluation methods.
Levels of RMSE of breeding values for females were similar for the 3 methods except for the unbalanced situation with low correlation between production environments (results not shown). The observed patterns followed closely those for males described previously. Breeding values for females can be decomposed into a regression of the contemporary group deviation on the parent average. The magnitude of regression was similar in all 3 models because of the similarity of heritabilities in the 3 models. Variation in parent average was largely determined by sire breeding values because of their higher reliability, explaining why observations about RMSE for males and females were alike.
Rankings
The success of each method in identifying the top bulls differed (Figures 3
and 4
), despite the resemblance of RMSE of PBV for males among all 3 methods. Sire rankings on country bases mainly differed for the unbalanced situations (Figure 3
); however, sire rankings on production environment scales mainly differed for the situations with low correlation between production environments (Figure 4
). The TBV of the 25 best sires for GAMenv was, on average, between 0.01 and 0.25 genetic standard deviation units higher compared with the other 2 methods, depending on the situation.
|
|
The same genetic parameters were assumed in both production environments in the present study, a condition that is not likely met in practice. For example, residual variances and heritabilities have been reported to differ between herds with high and low production level (e.g., Hill et al., 1983; Meuwissen et al., 1996). The use of an average heritability in such cases may over- or underestimate the genetic trend in diverse environments, causing bias in PBV. In addition, the degree of regression of contemporary group deviations toward the parent average will be affected, which may lead to under- or overdispersed PBV and, consequently, to suboptimal rankings of animals.
For the international evaluation across production environment, it was assumed known which production environment each herd belonged to. In practice, however, statistical techniques have to be used for classification of herds, and uncertainty is associated with such procedures. Misclassification of herds probably decreases the differences between the across-production environment evaluation and the other 2 methods, but the extent should be evaluated in a sensitivity study.
The statistical models for national and international genetic evaluations were the same in this study. However, in practice, national evaluation models vary widely across country (Interbull, 2000), which would affect estimates of the genetic correlation between countries computed with MACE (Emanuelson et al., 1999). Lohuis and Dekkers (1998) showed that standardization of data collection and genetic evaluation models across countries could lead to increased genetic gains.
Using MACE, national PBV are jointly analyzed with a sire model, and effects of selection and non-random mating are only partially accounted for. As mating between animals was random in the present study, an important advantage of using an animal instead of a sire model for prediction of breeding values, the possibility to account for non-random mating (e.g., Hudson and Schaeffer, 1984), was not exploited. Evidence of non-random mating for milk production and conformation traits have been reported (Hudson and Schaeffer, 1984), and MACE could be expected to perform suboptimally under such conditions. Effects of non-random mating are at least partly accounted for in meta-analysis of national breeding values, as these breeding values are commonly predicted with an animal model (Interbull, 2000). However, in many cases, limited information is available about ancestors of imported bulls, restricting the possibility of the animal model to account for non-random mating. Furthermore, joint analysis of national breeding values using MACE considers only information on male relatives.
| CONCLUSIONS |
|---|
|
|
|---|
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
Received for publication August 6, 2002. Accepted for publication February 3, 2004.
| REFERENCES |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |