|
|
||||||||


* Interbull Centre, Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, S-750 07, Uppsala, Sweden
Department of Animal and Dairy Science, University of Georgia, Athens 30602
Department of Dairy Science, University of Wisconsin, Madison 53706
| ABSTRACT |
|---|
|
|
|---|
Key Words: international genetic evaluation Guernsey lactation records
Abbreviation key: DIC = deviance information criterion, HC = multiple-trait herd cluster model, ST = single-trait model, SThet = single-trait model with heterogeneous residual variance across countries, MT = multiple-trait across country model, RR = reaction norm model (random regression)
| INTRODUCTION |
|---|
|
|
|---|
Access to individual performance records allows for international evaluations that can avoid some shortcomings of the use of national genetic evaluation results as input. Two procedures that consider variation in production environments between and within countries, and allow for more uniform processing of data, have recently been suggested for international genetic evaluation using performance records. The multiple-trait herd cluster model (Weigel and Rekaya, 2000) stratifies herds according to production environment based on information on environmental descriptors. Production in each herd cluster is treated as a genetically distinct trait in the genetic evaluation model. Reaction norms (Kolmodin et al., 2002), on the other hand, describes the phenotype as a function of environmental descriptors. The function that relates phenotype to environment is unique to each genotype. Hence, the response to changes in environments can vary from one genotype to the other.
Several reports have been presented on investigations of the feasibility of joint, multinational genetic evaluations. For example, a Nordic cooperation has been established with the aim of developing a joint evaluation of dairy cattle in four Nordic countries (Pedersen et al., 2001). A single-trait across-country model was adopted, but a multiple-trait across-country model was also investigated. A multiple-trait across-country model was used for a joint evaluation of milking speed records from Germany and Austria (Sprengel et al., 2001). Breeding values for only one trait (country) were published, though, since the genetic correlations between traits (countries) were high. The majority of studies on multinational genetic evaluations either assumed a single-trait across-country model or treated performance in different countries as separate, though correlated traits. However, little consideration has been given as to whether country boundaries appropriately defined production environment, and whether genotype x environment interaction was optimally accounted for.
Numerous models for international genetic evaluations that use performance records can be contemplated, however, emphasis on identifying the order of models with respect to their ability to make correct inferences has thus far been limited. The objective of this study was to estimate genetic parameters for the multiple-trait herd cluster model, the reaction norm model, and three models that treat country as production environment, using data on Guernsey cows from four countries. These five models were compared with regard to their goodness of fit and model complexity.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
Statistical Models
The comparison involved five models that differed in the way genotype x environment interaction was considered. The same three fixed effects were included in all five models, and these were herd-year-season of calving, heterosis class, and age at calving. Contemporary groups were made by clustering calvings within a herd based on the number of days between two consecutive calvings (Smith et al., 1991). Minimum contemporary group size was five, and the length of seasons was at most 7 mo. Breed designation codes of parents and grandparents were used to form heterosis classes, and resulted in four classes for animals with 100, 75, 50, and 25% of Guernsey genes, respectively.
Model ST.
The simplest model in this study was a single-trait sire model, with one sire variance and one residual variance common to the whole population. Also one set of solutions for levels of age at calving and heterosis class was estimated for the whole population:
![]() |
where y is the vector of lactation yields, fixed represents the vector of fixed effects for contemporary group, age at calving and heterosis class, sire is the vector of random sire effects, and e is the vector of residual effects.
Model SThet.
Lactation yields were analyzed with a model that in addition to a sire effect contained country-specific residual variances and fixed effects:
![]() |
where yi is the vector of lactation yields in country i (i = 1...4), fixedi is the vector of fixed effect nested within country i, and ei is the vector of residual effects for country i. Thus, for each country a separate set of estimates for the residual variance and for levels of age at calving and heterosis class was computed. One sire variance common to all countries was assumed, which means that a unity genetic correlation among countries was assumed.
Model MT.
Performance records from different countries were treated as genetically different traits in a conventional multiple-trait model. Genetic and environmental parameters were estimated specific to each country:
![]() |
where sirei is the vector of random sire effects for country i. The sire effects were assumed to be correlated among countries, but residuals were uncorrelated since each daughter produced in one country only.
Model HC.
Herds were clustered according to production environment, and lactation yields were subsequently analyzed with a multiple-trait sire model in which production in each cluster was treated as a separate trait:
![]() |
where yj is the vector of lactation yields in cluster j (j = 1...C), fixedj denotes the vector of fixed effects nested within cluster j, sirej is the vector of random sire effects for cluster j, and ej is the vector of residual effects for cluster j. This model yielded genetic parameters and fixed effects solutions specific to production environment rather than country. Herds were grouped into clusters with the K-means algorithm that minimized the within-cluster sums of squares (NAG, 2000). The distance was computed as the weighted squared difference for nine environmental descriptors. These environmental descriptors were found to be able to distinguish between production environments in the study by Fikse et al. (2003) on the same data. The weight given to each variable was based on both the genetic correlation between milk production in extreme environments (5 and 95% percentile values for herd averages of environmental descriptor) and the phenotypic correlation between herd averages for all environmental descriptors (Kinghorn, 1998). The values for genetic and phenotypic correlations were taken from the study by Fikse et al. (2003).
Model RR.
Reaction norms for sires were estimated as random regressions of lactation yields on an environmental descriptor:
![]() |
where sire(env) is a random regression on env, the herd average milk peak yield. The random regression included an intercept and linear term that were treated as genetically correlated traits. One set of estimates for fixed effects was computed for the whole population. Herd average for milk peak yield was chosen as environmental descriptor, since genotype x environment interaction was most pronounced when this variable was used as environmental descriptor in the study by Fikse et al. (2003). They compared 15 environmental descriptors for their potential to reveal genotype x environment interaction, based on the genetic correlation between milk production in "poor" and "good" environment according to each descriptor.
Implementation
A Bayesian approach via the Gibbs sampler was implemented to obtain posterior distributions of the model parameters. Bounded uniform priors were used for the fixed effects. For sire breeding values a multivariate normal prior distribution was used. The prior distributions for the residual variances for all five models and the sire variance in models ST and SThet were bounded uniform distributions. For the sire covariance matrix a conjugate prior with two degrees of belief was used leading to weakly informative prior information. A single chain with a length of 150,000 was run for all five models. Burn-in and thinning parameters were determined for each chain separately with the Gibanal program (VanKaam, 1998).
Model Comparison
Support for each model was determined with the Bayesian Deviance Information Criterion (DIC). This criterion combines both goodness of fit and model complexity. It was computed as:
![]() |
where
is the posterior expectation of the Bayesian deviance. The Bayesian deviance is computed as:
![]() |
pD is the effective number of parameters,
![]() |
where
is the posterior mean of parameters in the model. For computational details see Spiegelhalter et al. (1998).
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
|
Posterior mode of genetic correlations between countries obtained with the MT model were between 0.86 and 0.90, except for the correlation between CAN and ZAF (Table 4
). Genetic correlations with ZAF were estimated with much uncertainty, mainly due to the small amount of data in combination with the low number of bulls with daughters in both ZAF and one of the other three countries. Genetic correlations that involved AUS were higher in this study compared to correlations calculated from national evaluations for sires that are used by Interbull. A similar observation was made by Weigel et al. (2001) who used performance records of Holstein cows from 17 countries. The estimated genetic correlations may have been higher because the data used in this study spanned a shorter time period than did the data used by Interbull, and statistical models and number of lactations considered were different from national evaluations.
|
HC
Three clusters were identified (Table 5
). The majority of herds from CAN, USA, and ZAF were in cluster 1; this cluster consisted of mainly small herds with relatively low within-herd variation of milk production, and early peak production. The majority of the AUS herds were in the second cluster, that also contained about one fourth of the USA herds. Herds in this cluster were small as well, but peak production was somewhat later in lactation, and more variation in milk production existed within these herds. The third cluster was made up by the somewhat larger herds from USA and ZAF with a relatively high level of within-herd variation.
|
|
RR
Sire variances for the intercept and the slope of reaction norms for sires were 66,400 and 1360 kg2, respectively. The posterior mode for the genetic correlation between the intercept and the slope of reaction norms was 0.57. The residual variance, with a value of 755,900 kg2, was close to that of the ST model (Table 3
). The heritability in herds with an average level of milk peak yield was 0.33, and the estimated correlation between milk production in extreme environments for milk peak yield was 0.62.
The sire variance of 305-d milk production in each environment was computed using the sire variance-covariance matrix for the intercept and slope, as described by Kolmodin et al. (2002), and results hereof are in Figure 1
. The sire variance of 305-d milk production increased with increasing herd average for milk peak yield over the major part of environment range. Standardized peak milk yield for 95% of the herds was between -5 and 4, and in this interval the sire variance of 305-d milk production increased from 47,000 to 134,000 kg2, comparable with results from literature (e.g., Zwald et al., 2003). An unusual pattern of sire variance at the extremes of the environment range was observed, but these sire variances were estimated with more uncertainty (Figure 1
). In addition, the sire variance of 305-d milk production in each environment is a quadratic function of the value for the environmental descriptor (Kolmodin et al., 2002), and the few observations in the extreme environments had relatively little influence on the actual shape of the parabola. Other functions to describe reaction norms, in combination with transformation of the environmental descriptor, may be needed to improve the behavior of the reaction norm model at the extremes.
|
|
| DISCUSSION |
|---|
|
|
|---|
Genetic correlations between clusters were more precisely estimated than genetic correlations between countries (Tables 4
and 6
) even though the difference in the order of the genetic (co)variance matrix between model HC and MT was only one. Surprisingly, both the goodness of fit and the effective number of parameters was in favor of model MT compared with model HC, suggesting that countries rather than production environments should be used for stratification of herds. The range of phenotypic variances was much larger for countries than for clusters, probably as a result of the presence of many USA herds in all three clusters (Table 5
). Hence, phenotypic variances associated with AUS and ZAF records were probably too high with model HC, causing the performance of this model to be below expectation.
The only difference between models RR and ST was the slope of the reaction norms for sires, i.e., the random linear regression of 305-d yield on the herd average for peak milk yield. The significant slope of the reaction norm found in this study implied that the genetic variance increased with higher peak milk yield in herds, i.e., more genetic variance existed in the better environments (Figure 1
). The random regression model used in this study did not allow for heterogeneity of residual variances. However, results from models ST, SThet, and MT indicated that residual variances were heterogeneous. Relaxing this assumption may improve the goodness of fit and the comparison with the other models in this study. Other improvements worth considering are inclusion of additional environmental descriptors in the reaction norms and inclusion of linear regressions on environmental descriptors nested within fixed effects classes to allow effects to change over environments (e.g., the effect of age at calving may not be the same in herds with high and low peak milk yield).
The most basic categorization of genotype x environment interaction is to distinguish between scaling effect and reranking. The high genetic correlations between clusters indicate few rank changes of animals over clusters. In addition, genetic correlations among countries were estimated with much uncertainty, and values near unity were not unlikely. Thus, the presence of a genotype x environment interaction that causes reranking was not supported by grouping herds according to country or production environment. On the other hand, both MT and HC analysis showed that genetic and residual variances were heterogeneous over production environments. For the country analysis, the DIC value for model SThet was better than for model MT even though homogeneity of the genetic variance was assumed. Model MT was, however, handicapped by the lack of information to estimate genetic correlations among countries, whereas the assumption of homogeneous variance probably had relatively little impact due to the dominance of USA herds. A model that accounts for heterogeneous genetic and residual variances but with unity genetic correlation between performance in production environments may perform better than the models studied in the present investigation.
Explanatory effects were the same in all five models, whereas the models for national evaluation in the four countries differ considerably (Interbull, 2000). For example, the national evaluation model in USA contains an effect of registry status and a herd x sire interaction, season is not part of the contemporary group definition in ZAF, and AUS considers only two season per year. All models in this study can be modified to include country-specific explanatory effects, but variation in type and definition of explanatory effects should be kept at a minimum to avoid so-called genotype x model interactions.
National milk recording schemes in the four participating countries are organized in accordance with international standards, and unstandardized test-day observations could easily be combined into a joint genetic evaluation. Sprengel et al. (2001) conducted a binational evaluation for milking speed, and faced a situation in which the trait was recorded very differently in both countries. The multiple-trait across-country model is well suited for such a situation, however, adaptation of the other models to accommodate for differently recorded traits is less straightforward if the genetic correlation among the recorded traits deviates from unity.
The prime interest of computing international genetic evaluations is to predict performance of future offspring in any country or production environment. The criterion for model comparison used in this study summarized the goodness of fit of each model to the data. However, a better fit of the model to the data does not guarantee more accurate predictions of future records. Therefore, it is recommended to compare sire breeding values and rankings from each model, as well as to evaluate the ability of each model to predict future observations.
| CONCLUSIONS |
|---|
|
|
|---|
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
Corresponding author: F. Fikse; e-mail:
Freddy.Fikse{at}hgen.slu.se.
Received for publication March 27, 2002. Accepted for publication August 12, 2002.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
H. Hammami, B. Rekik, C. Bastin, H. Soyeurt, J. Bormann, J. Stoll, and N. Gengler Environmental sensitivity for milk yield in Luxembourg and Tunisian Holsteins by herd management level J Dairy Sci, September 1, 2009; 92(9): 4604 - 4612. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Lillehammer, B. J. Hayes, T. H. E. Meuwissen, and M. E. Goddard Gene by environment interactions for production traits in Australian dairy cattle J Dairy Sci, August 1, 2009; 92(8): 4008 - 4017. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Lillehammer, M. E. Goddard, H. Nilsen, E. Sehested, H. G. Olsen, S. Lien, and T. H. E. Meuwissen Quantitative Trait Locus-by-Environment Interaction for Milk Yield Traits on Bos taurus Autosome 6 Genetics, July 1, 2008; 179(3): 1539 - 1546. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. Shariati, G. Su, P. Madsen, and D. Sorensen Analysis of Milk Production Traits in Early Lactation Using a Reaction Norm Model with Unknown Covariates J Dairy Sci, December 1, 2007; 90(12): 5759 - 5766. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. R. Bryant, N. Lopez-Villalobos, J. E. Pryce, C. W. Holmes, D. L. Johnson, and D. J. Garrick Environmental Sensitivity in New Zealand Dairy Cattle J Dairy Sci, March 1, 2007; 90(3): 1538 - 1547. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Konig, G. Dietl, I. Raeder, and H. H. Swalve Genetic Relationships for Dairy Performance Between Large-Scale and Small-Scale Farm Conditions J Dairy Sci, November 1, 2005; 88(11): 4087 - 4096. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Maltecca, A. Bagnato, and K. A. Weigel Comparison of International Dairy Sire Evaluations from Meta-Analysis of National Estimated Breeding Values and Direct Analysis of Individual Animal Performance Records J Dairy Sci, August 1, 2004; 87(8): 2599 - 2605. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |