|
|
||||||||
Department of Animal Sciences, Purdue University, West Lafayette, IN 47907
1 Corresponding author: agfahey{at}purdue.edu
| ABSTRACT |
|---|
|
|
|---|
Key Words: heteroscedasticity heritability variance quartile
| INTRODUCTION |
|---|
|
|
|---|
Calus et al. (2002) suggested that grouping herds by production system or intensity of production and then determining if genotype-by-environment interaction (GxE) was present might be useful. A GxE occurs when sires ranking for a trait changes between 2 environments. A scaling effect occurs when the rank of sires in 2 environments is the same, but the production level of their progeny in one environment is lower than in the other environment. Previous work on mature-equivalent milk (MEM), protein (MEP), and fat (MEF) by Kearney et al. (2004a) did not find evidence of a large GxE between grazing (G) and confinement (C) herds. In that study, genetic correlations between both management systems for MEM, MEP, and MEF were found to be 0.89, 0.91, and 0.88, respectively. A study to determine if a GxE was present for lactation-average SCS (LSCS) did not show that a GxE existed between G herds and C herds with a genetic correlation of 0.89 (Kearney et al., 2004b). In those studies, heteroscedasticity was considered in selected methodology.
Heteroscedasticity exists for both production and conformation traits (Boldman and Freeman, 1990; Smothers et al., 1991, 1993; Van der Werf et al., 1994). Ignoring heteroscedasticity may cause a bias in genetic evaluations (Vinson, 1987; Van der Werf et al., 1994), and the bias may become more severe as selection intensity increases (Vinson, 1987). Van der Werf et al. (1994) suggested that EBV for sires are biased, because dams of sires are high producers, and therefore the variability of their production records is greater, leading to a bias in calculations. Hill (1984) suggested that ignoring heteroscedasticity could lead to selecting animals from more-variable herds, resulting in a decrease in response to selection that would be detrimental to long-term genetic improvement. Accounting for heteroscedasticity improves the accuracy of evaluations. However, some research has shown that transformation for heteroscedasticity had only a small effect on rankings of animals (Winkleman and Schaeffer, 1988; Sullivan and Schaeffer, 1989; Wiggans and VanRaden, 1991). Other studies have shown that a moderate reranking of animals occurs when heteroscedasticity is taken into account (Meinert et al., 1988; Boldman and Freeman, 1990).
Various methods have been used to reduce the effect of heteroscedasticity. A log transformation was used in the northeastern United States (Everett and Keown, 1984). Further analysis of this approach showed that with a log transformation, herds with lowest yields had the greatest number of elite cows (Mirande and Van Vleck, 1985; Short et al., 1990), suggesting an overcompensation for heterogeneity of variance. A Bayesian approach was adopted in Australia. This method regressed herd variance toward a population variance; however, a constant heritability was assumed throughout the population (Jones and Goddard, 1990). Heteroscedasticity was adjusted as a weighted mean for herd-year-region, variation in adjacent years for the same herd-parity, and region-year-parity variance in a study reported by Wiggans and VanRaden in (1991), and this method is currently used in the United States.
The first objective of this study was to determine if accounting for heteroscedasticity by using a method similar to that used by the USDA in the US national genetic evaluations affected the ability of sires PTA to predict daughter production of MEM, MEP, MEF, and LSCS in G and C herds. The second objective was to determine if accounting for heteroscedasticity affected the heritabilities of the 4 traits and to ascertain whether heteroscedasticity produced sufficient extraneous variance to mask the underlying scaling effects or GxE. The hypothesis for this study was that standardization of within-herd standard deviation would reduce extraneous variance sufficiently to unmask scaling effects and GxE for milk, fat, protein, and LSCS between the G and C environments.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Analysis of MEM, MEP, MEF, and LSCS utilized data from 366 G herds in 11 states (New York, Indiana, Michigan, Iowa, Kansas, Virginia, North Carolina, South Carolina, Mississippi, Louisiana, and Oklahoma) consisting of 72,489 records from 35,674 cows with 4,048 sires. Confinement data consisted of 117,629 records of 50,963 cows with 6,268 sires in 373 herds in 12 states (same 11 states plus Illinois) as shown in Table 1
. The LSCS records were from 366 G herds from 11 states with 68,165 records from 28,635 cows with 3,638 sires, and 373 C herds from 12 states with 108,151 records from 45,486 cows with 5,900 sires (Table 1
). There were 3,408 sires common to both the G and C herds. Seven edits were applied to the data set. The first 6 are as suggested by Kearney et al. (2004a): 1) only records after 1990 were included in the analysis; 2) cows must have at least 1 lactation; 3) sire and dam pedigree information was required for all cows; 4) sixth and later lactations were deleted from the data set; 5) records more than 4 standard deviations from the mean of MEM, MEP, MEF, and LSCS were removed from analysis; and 6) lactations must have been at least 60 d long. In addition, cows with age of calving of less than 17 mo and greater than 135 mo for lactations 1 through 5 were deleted. Pedigree information for this study was supplied by the USDA Animal Improvement Programs Laboratory (Beltsville, MD). Kearney et al. (2004a) carried out analyses on quartiles based on mean MEM; however, in the current study we are investigating the effect of heteroscedasticity on production data based on variance (MEM) quartiles. Our data set was similar to that of Kearney et al. (2004a); however, we did not include Wisconsin herds and we included updated information for all herds.
|
![]() |
where yijklm = MEM, MEP, MEF, or LSCS for the mth record for cow l, in herd i, calving in year-season j, in age-parity class k, hi = fixed effect of herd i, ysj = fixed effect of year-season of calving j, apk = fixed effect of age-parity class k, ß = linear regression coefficient of daughter MEM, MEP, MEF, or LSCS on sire PTA, PTAl = USDA-DHIA PTA for MEM, MEP, MEF, or LSCS of the sire of cow l, and eijklm = random residual.
Age-parity (apk) accounts for variation due to the age of each cow within parity and is necessary despite the use of mature-equivalent records, which are determined by use of regionally derived factors that may not entirely account for age effects within a particular herd.
Herds were sorted by within-herd variation and then divided into quartiles based on variance for MEM. The number of lactations, cows, sires, and mean age of cows and sires for each variance quartile is detailed in Table 2
.
|
To account for heteroscedasticity, yield deviations within herd for MEM, MEP, MEF, and LSCS were standardized to the population average within-herd standard deviation in 1997 (Wiggans and VanRaden, 1991). An adjusted within-herd standard deviation for MEM, MEP, MEF, and LSCS was obtained as
![]() |
where sdijk * = within-herd standard deviation adjusted toward across-herd standard deviation according to herd size; dfijk = total degrees of freedom for MEM, MEP, MEF, or LSCS for herd i, year j, and parity k; sdijk = standard deviation for herd i, year j, and parity k for MEM, MEP, MEF, or LSCS; and
Base Year = the average standard deviation for MEM, MEP, MEF, or LSCS for the base year of 1996 for parity 1 and 1997 for parity 2 and later parities.
Then, individual lactation records were standardized according to
![]() |
where tm = adjusted values of the mth record of MEM (AMEM), MEP (AMEP), MEF (AMEF), or LSCS (ALSCS); ym = mth record for MEM, MEP, MEF, or LSCS; xijk = the within-herd mean of MEM, MEP, MEF, or LSCS for herd i, year j, and parity k; sdi = the standard deviation within-herd i; and sd* = within-herd standard deviation adjusted toward across-herd standard deviation according to herd size.
The base years of 1996 and 1997 were selected because they represented the middle time point in the data set, with years in the data set ranging between 1990 and 2003. This transformation was chosen because it is similar to the transformation used in the national genetic evaluations carried out by the USDA, which matches our objective to determine if the transformation accounts for the different environments between G and C herds in the United States. Following transformation, the previously described regression equation was repeated, but with AMEM, AMEP, AMEF, and ALSCS as dependent variables.
The (co)variance component estimation package, VCE4 (Neumaier and Groeneveld, 1998), was used to calculate heritabilities of traits within environments and genetic correlations of traits between environments before and after transformation. Restricted maximum likelihood (REML) using analytical gradients was used for these procedures. A bivariate animal model was used to calculate variance components, and the same trait in different environments was considered to be a different trait. Models were fit separately for the quartiles based on within-herd variance for MEM. The algebraic form of the animal model was as follows:
![]() |
where yijklm = the mth MEM, MEP, MEF, or LSCS record for animal j in permanent environment k, herd-year-season i, and in age-parity class l either adjusted or unadjusted for heteroscedasticity; hysi = fixed effect of herd-year-season i; aj = random additive genetic effect of animal j; pek = random permanent environment effect k; apl = fixed effect of age-parity class l; and eijklm = random residual effect.
Separate numerator relationship matrices were utilized for each quartile. The number of sires with daughters in the relationship matrices and the number of other relatives in the relationship matrices is outlined in Table 2
. Unknown parent groups were assigned as in Kearney et al. (2004a). Heritabilities for each trait were calculated as
a2/
p 2 and genetic correlations were calculated as
![]() |
where
2a = estimate of additive genetic variance,
2 =
2a +
2pe +
2e;
2 pe = estimate of permanent environmental variance;
2 e = estimate of residual variance;
g = estimate of genetic correlation;
a(G)a(C) = estimate of the additive genetic covariance of traits in G and C;
2a (G) = estimate of the additive genetic variance of traits in G; and
2a (C) = estimate of the additive genetic variance of traits in C.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
|
Means of PTA for milk (PTAM) of G herds in variance quartiles analysis showed that PTAM increased with within-herd variance of MEM (Table 5
). This indicated that bulls with higher producing daughters were in the higher variance herds. Standard deviations of these values were similar, and the maximum PTAM values increased as quartiles fluctuated. In C herds, PTAM values fluctuated among variance quartiles. Standard deviations were similar, and the maximum PTAM values were shown to fluctuate. Fluctuation of PTAM on C herds may be an artifact of sampling of herds.
|
|
|
|
Regression coefficients of unadjusted MEM, MEP, MEF, and LSCS on sires PTA for variance quartiles are shown in Table 9
. Although regression coefficients deviated from the expected value of 1.0, all ranged from 0.466 to 1.165. Regression coefficients for variance quartiles of G herds for MEM, MEP, MEF, and LSCS were all less than 1.0, suggesting that PTA overestimated production yields and LSCS. In G herds, regression coefficients increased from the lowest quartile to the highest quartile for MEM, MEP, and MEF. The first 3 quartiles for MEP were significantly less than 1 (P < 0.01); however, the fourth quartile with a regression coefficient of 0.893 was not significantly different than 1. Regression coefficients were significantly less than 1 (P < 0.01) for all quartiles, which suggests that sires PTA for fat consistently overestimated MEF in all quartiles.
|
Regression coefficients increased in both G and C herds from the lower quartiles to higher quartiles. This suggests that current sires PTA overestimated production for herds in low quartiles and that a separate evaluation with reporting of PTA may be justifiable for herds with low phenotypic variance quartiles vs. those with high phenotypic variance. Alternatively, standardization for heteroscedasticity could be applied.
Table 10
has heritabilities, ratios of permanent environmental variation to phenotypic variation, and genetic correlations for MEM, MEP, MEF, and LSCS for variance quartiles based on MEM variance. Heritability of MEM varied between 0.15 and 0.18 for G herds, and 0.14 and 0.20 for C herds. The ratio of permanent environmental variation to phenotypic variation ranged from 0.20 to 0.23 for G herds with a lower range (0.23 to 0.27) for C herds. Values for genetic correlations were diverse. The first quartile had a correlation of 0.21 with a standard error of 0.22. The second quartile had a correlation of 0.77; the third quartile had a correlation of 0.97, whereas the fourth quartile had a genetic correlation of 0.86. Heritabilities for MEM were similar to those calculated in 2 previous studies that split data sets according to phenotypic variance (Hill et al., 1983; Lofgren et al., 1985). Production levels usually increased as phenotypic variance of the trait increased, and herds with higher production had higher heritabilities (Legates, 1962). Other studies found that milk had a higher heritability in high-variance herds compared with low-variance herds (Hill et al., 1983; Lofgren et al., 1985; de Veer et al., 1987; Van Vleck and Dong, 1988). In this study for G herds, the lowest variance quartile for milk had a heritability of 0.15 and this increased to 0.18 for the second quartile; then, a negligible drop to 0.17 was found for the third and fourth quartiles. Increasing permanent environmental variation will increase phenotypic variation, and if the additive genetic variation does not increase with permanent environmental variation, then heritability will be reduced. In C herds, no consistent trend for heritability of MEM was noticed. The heritability was similar for the first, second, and fourth quartiles. For the third quartile, heritability for MEM increased to 0.20, but this result is not readily explained.
|
Heritability for MEP was calculated to be 0.18 for the first quartile in G herds; however, this decreased steadily to 0.16 in the fourth quartile, even though protein yield increased for G herds (Table 10
). Permanent environmental variation does not seem to have a very clear effect on the heritability trend for MEP; this may be because the data set was divided according to the variance of MEM and the relationship between MEM and MEP, although strong, is not perfect. Heritabilities for MEP in C herds were highest for the first quartile with a value of 0.19; the heritability decreased to 0.16 for the second quartile, increased to 0.18 for the third quartile, and fell again to 0.16 for the fourth quartile. This occurred even though MEP yields increased in C herds as the variance quartiles increased. Changes in permanent environmental variation were small, with a range from 0.25 to 0.28 for C herds. Heritability trends such as these may be due to a relatively greater increase in phenotypic variance than additive genetic variance causing heritabilities to decrease.
Genetic correlations for MEP increased from 0.78 to 0.93 from the first to the third quartile with a decrease to 0.85 for the fourth quartile. A scaling effect may be present for the first quartile. This implies that production in the lowest quartile is largely controlled by the same genes; however, due to environmental effects, gene expression for the G herds does not reach its full potential. Correlations for MEP were similar to that of other studies (Calus et al., 2002; Boettcher et al., 2003; Kearney et al., 2004a). The other quartiles with genetic correlations
0.80 likely do not present evidence of detectable scaling effects or a GxE.
Heritabilities for MEF decreased as quartile variances increased. The heritability for MEF in the first quartile was 0.23; however, by the fourth quartile this had decreased steadily to 0.17, similar to that found by Kearney et al. (2004a). This occurs despite the increase in fat yields (Table 7
). A similar trend was observed for both heritability and permanent environmental variation for MEF in the C herds. The decrease in heritability may be explained by the increase in permanent environmental variation for MEF in G herds from the first quartile to the fourth quartile.
Compelling evidence for GxE was not observed for MEF, as the genetic correlations were all greater than 0.80. A scaling effect may be present for the second and fourth quartiles with genetic correlations of 0.84 and 0.87, respectively.
Heritability of LSCS ranged between 0.08 and 0.16 for G herds and from 0.10 to 0.14 for C herds, which is similar to previous work (Kearney et al., 2004b). Trends in both the G and C herds showed that the heritability fluctuated from one quartile to the next. A similar pattern was observed for permanent environmental variation. These trends were quite different compared with those observed for MEM, MEP, and MEF, probably because quartiles were based on yield, and yield traits had higher correlations among each other than they did with LSCS. Lactation SCS had genetic correlations that varied between 0.78 and 0.98 for the variance quartile analysis, which fluctuated in a similar way as the heritabilities and permanent environmental variance.
Regression of AMEM, AMEF, AMEP, and ALSCS on sires PTA was carried out for variance quartiles for G and C herds (Table 11
). Changes in regression coefficients were small (<0.10) when compared with the regression coefficients for MEM, MEP, MEF, and LSCS before transformation (Table 9
), and changes in standard errors were negligible. In G herds, the coefficients of regression increased for AMEM, AMEP, AMEF, and ALSCS when compared with regression coefficients for MEM, MEP, MEF, and LSCS (Table 9
) for the first 3 quartiles, but they were still less than unity. This result suggests that by reducing the level of heteroscedasticity for these traits, sires PTA overestimated production levels to a lesser extent for the first 3 quartiles. Regression coefficients for AMEM, AMEP, AMEF, and ALSCS decreased for the fourth quartile after the transformation was applied to the data. Accounting for heteroscedasticity reduced regression coefficients, thereby increasing the overestimation of sires PTA on production, when compared with the data before transformation.
|
Heritabilities, ratios of permanent environmental variation to phenotypic variation, genetic correlations, and standard errors associated with these values were calculated for AMEM, AMEP, AMEF, and ALSCS for G and C herds, and are shown in Table 12
. When compared with the corresponding values for the 4 traits before transformation (Table 10
), it can be seen that accounting for heteroscedasticity did not change heritability, permanent environmental values or their standard errors by any significant amount. There was also no trend in the changes; some values increased, some decreased, and some were unchanged. Changes in genetic correlations and their standard errors for the 4 traits between the 2 environments were small and did not change the overall result of whether a GxE was present.
|
A GxE was found for the first and second variance quartiles in G herds (Table 10
). This would suggest that sires PTA might not be adequate for G herds as progeny testing is typically carried out in C systems. One way to correct for this would be to introduce separate progeny testing for G herds; however, this would not be practical due to the expense (Weigel et al., 1999).
| CONCLUSIONS |
|---|
|
|
|---|
Regression coefficients were used to determine the adequacy of sires PTA in predicting performance of daughters. Levels of phenotypic performance in the lower variance quartiles (mostly the first and second) of G herds were inadequately predicted by the sires PTA but prediction was satisfactory for the upper quartiles of G herds and all quartiles for C herds. This may be because sires for the US dairy industry are progeny tested primarily in C systems, and their PTA values are a result of this. Graziers using the least intensive management should be advised that bull PTA for milk, fat, and protein might not be as predictive of future daughter performance as in C herds or the more intensively managed G herds. Accounting for heteroscedasticity did not resolve the issue for low-variance G herds.
Heritabilities were largely unaffected in all analyses by the incorporation of a transformation to reduce heteroscedasticity, so current heritability figures for the industry are valid for both G and C herds. Genetic correlations were not significantly <1 among higher quartiles. However, in lower quartiles there were genetic correlations less than 0.80, suggesting a biologically significant interaction. The results of these genetic correlations suggest that MEM, MEP, MEF, and LSCS in high-producing G herds are largely being controlled by the same genes as in C herds.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
Received for publication October 24, 2006. Accepted for publication April 27, 2007.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
H. Hammami, B. Rekik, H. Soyeurt, C. Bastin, J. Stoll, and N. Gengler Genotype x Environment Interaction for Milk Yield in Holsteins Using Luxembourg and Tunisian Populations J Dairy Sci, September 1, 2008; 91(9): 3661 - 3671. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |