|
|
||||||||
Department of Genetics and Biotechnology, Faculty of Agricultural Sciences, Aarhus University, DK-8830, Tjele, Denmark
1 Corresponding author: guosheng.su{at}agrsci.dk
| ABSTRACT |
|---|
|
|
|---|
Key Words: Gibbs sampler heterosis by environment interaction reaction norm model spline regression
| INTRODUCTION |
|---|
|
|
|---|
An H x E interaction is usually examined by comparing the measures of heterosis in a few distinct environments or environment clusters. In beef cattle, many previous studies have analyzed heterosis x forage (or feeding) environment interaction (Arthur et al., 1999; Brown et al., 2001; Phillips et al., 2001). In dairy cattle, Bryant et al. (2007) analyzed breed performance and heterosis in different environments including herd-average of yield (4 levels), heat load index (4 levels), herd size (3 levels), and altitude (3 levels). The traditional method is useful in situations with few levels of environments. Generally, grouping environments could have some disadvantages. First, if a continuous underlying scale exists, the grouping could be more or less arbitrary (Strandberg, 2006). Second, the grouping ignores the variation of environment effects within group. Third, it may lead to inefficient estimation of parameters.
To deal with a large number of environment levels (e.g., herd-years) in the analysis of H x E interaction, an alternative approach could be to apply reaction norm model. Reaction norm models have been used in analysis of additive genetic and environment interaction, especially in dairy cattle (Calus et al., 2002; Kolmodin et al., 2002; Oseni et al., 2004). Su et al. (2006) developed a method to estimate environmental value simultaneously with other parameters in a reaction norm model.
It is a statistical challenge to distinguish production environment effect, heterosis, and breeding value in a hybrid population, provided H x E interaction. The objectives of the present study were: a) to propose a reaction norm model to analyze H x E interaction, b) to develop a computing method for the proposed model, and c) to test the algorithm for the proposed model and method using simulated dairy cattle data. In addition, the effect of ignoring H x E interaction on predicted breeding value was investigated, based on the simulated dairy cattle data.
| MATERIALS AND METHODS |
|---|
|
|
|---|
The proposed reaction norm model is an extension of a traditional regression of heterosis on heterozygosity (h) based on the dominance theory. In a traditional regression model, heterosis is expressed as heterosis = bh.
In the situation of H x E interaction, the regression coefficients (b) are different in different environments. Assume that environments can be quantified as an environmental gradient, the regression coefficients can be described as a function of environmental values:
|
|
For example, there are 5 environments with an environmental gradient u = (10, 15, 20, 25, 30), and the regression coefficients of heterosis on heterozygosity in the 5 environments are b = (10, 12, 14, 16, 18). It can be seen that there is a linear relationship between b and u. Thus, b can be described by a function:
![]() |
In the situation of nonlinear relationship between b and u, say b = (13.0, 15.5, 17.0, 17.5, 17.0), b can be described by a function as:
![]() |
As demonstrated above, in the case of a large number of environment levels, H x E interaction can be accounted for using a reaction norm with few parameters. Therefore, an observation (yij) involved in H x E interaction can be modeled as:
|
|
Restricting to linear H x E interaction, the model is simplified as:
|
|
ui < Tk+1:
|
|
and
|
|
|
|
Let
k = β0 + βk,
k+1 = β0 + βk+1, and so on (notice that
0 = β0 and
1 = β1 in the linear reaction norm model). The model for nonlinear H x E is written as:
|
|
The model for linear H x E is:
|
|
![]() |
is a vector of regressions on the product of heterozygosity and function of environment effect (reaction norm), a is a vector of additive genetic effect, e is a vector of residuals, and X, Zu, H, and Za are incidence matrixes. The nonzero elements of the matrices H are the functions of environmental value u. In the model, u is treated as random to avoid problems with identifiability. To make description convenient, in the context, location parameters (b, u,
, and a) are expressed as fixed and random effects (which are usually used in likelihood setting), though the terms are seldom used in Bayesian settings.
The conditional distribution of y is assumed to be normal having the form:
|
|
To estimate the environmental covariates, the regression coefficients of the reaction norm, and the other parameters of the model simultaneously, a Bayesian Gibbs sampling approach was developed on the basis of the methodology reported by Su et al. (2006).
Prior Distributions and Joint Posterior Distribution
The prior distributions of b and
are assumed to be improper uniform, p(b)
constant, p(
)
constant.
The random vectors of u and a are assumed to have normal prior distributions:
|
|
|
|
The prior distributions of the variances of
and
are assumed to be scaled inverse
2 distributions:
|
|
i is degree of freedom and si is a scale parameter. The joint posterior distribution of all the parameters is
![]() |
Fully Conditional Posterior Distribution of the Location Parameters 
Let
be the vector of all location parameters except for u [i.e.,
= (b',
', a')'], the fully conditional posterior distribution of
is:
|
|
|
|
Thus, the fully conditional posterior distribution of
can be expressed as:
|
|
, given dispersion parameters and u, is multivariate normal: |
|
Letting
i denote an arbitrary element (or set of elements) of
and
–i denote the vector of
excluding
i, the fully conditional posterior distribution of
i is:
|
|
Fully Conditional Posterior Distribution of u
The density of the fully conditional posterior distribution of u is:
|
|
Reparameterization for Linear H x E Model.
For a linear H x E model, an observation y can be described as:
|
|
An alternative formulation of the reaction norm model [4] is:
|
|
1hj).
Reparameterization for Nonlinear H x E Model.
In the case of a nonlinear H x E model, the reparameterization can be done as follows:
![]() |
Thus, the reaction norm model [4] for a nonlinear H x E interaction has an alternative form as:
|
|
|
|
|
|
Without knowing the range of environmental values, it is difficult to assign a set of reasonable values for the knots using the original scale. To simplify the implementation, the current model uses predetermined knot values in units of standard deviation of environment effects, say T*, the knot values in the original scale are assigned dynamically during Gibbs sampling process as:
![]() |
u(i) is the standard deviation of u in the ith iteration.
Fully Conditional Distribution.
Assuming that
is known, for a linear reaction norm model, define:
|
|
For a nonlinear reaction norm model, define:
|
|
|
|
|
|
|
|
Fully Conditional Posterior Distribution of Dispersion Parameters
The fully conditional posterior distribution of disperse parameters are recognized as scaled inverse
2 distributions. Thus,
|
|
|
|
![]() |
i (i = e, a, u) is the degree of freedom and
Implementation of the Gibbs Sampler
The algorithm of Gibbs sampler for the proposed reaction norm model is as follows:
, C
, and r
and draw
i from
2 distributions.
The computing program for the proposed model was developed and integrated into the DMU-package (Madsen and Jensen, 2004; Madsen et al., 2006). In the actual implementation, the "iteration on data" technique, which avoids storing C
and Cu, was applied.
Simulation Studies
Data Generation.
The proposed method was evaluated using a simulation study. Data were generated according to a structure mimicking dairy cattle but simplifying the method of selection and replacement.
The data were produced in 2 steps. In the first step, data were generated without considering herd-year effect and heterosis. An observation of base generation was generated using a model:
![]() |
In the following generations, an observation was simulated as:
![]() |
Two populations were created for a period of 20 yr. The cows and young bulls were selected randomly. The proven bulls were selected on the phenotypic means of their daughters. From yr 3 and onward, approximately 25% of the cows in population I were inseminated with the semen imported from population II, but all daughters of young bulls were used for replacement. Cows did not move from one herd to another. The breeding scheme for the 2 populations is shown in Table 1.
|
|
|
![]() |
The simulation study included 4 scenarios with regard to the size of additive genetic and herd-year variation (M = moderate variation and S = small variation in the context). For each scenario, 10 replicates were generated. The parameters used in the 4 scenarios are reported in Table 2. Data for the first parity of population I were used in the current analysis.
|
Statistical Analysis
The simulated data were analyzed using 4 alternative models. Model 1 ignored H x E interaction,
|
|
bp was the regression on breed proportion (population II),
0 was the regression coefficient on breed heterozygosity (H0), u was the vector of random herd-year effect, a was the vector of additive genetic effects, and e was the vector of residuals.
Model 2 used the true curve function and true covariates (Ht) to fit heterosis, and treating herd-year (u) as random effect,
|
|
was the vector of regression coefficients on known covariates of the reaction norm [i.e., the first, second, third, and fourth column of Ht was breed heterozygosity (het), Model 3 was the same as model 2 but treating herd-year effect as fixed effect,
|
|
Model 4 (the proposed model) was the model with unknown covariates (Hu) of the reaction norm for heterosis, and treating herd-year (u) as random effect,
|
|
was the vector of spline regression coefficients on unknown covariates of the reaction norm [i.e., regressions on | RESULTS |
|---|
|
|
|---|
|
|
|
|
| DISCUSSION |
|---|
|
|
|---|
In the proposed reaction norm model, total heterosis consists of an environment-independent heterosis and an environment-dependent heterosis. The environment-dependent heterosis was described as a function of breed heterozygosity and environmental value (e.g., herd-year effect). Reaction norm model has the advantage of dealing with an infinite number of environment levels with few parameters. It is useful when phenotypes change gradually and continuously over an environmental gradient (de Jong, 1995). Although reaction norm models have been widely used to analyze additive genetic effect x environment interaction in dairy cattle (Calus et al., 2002; Kolmodin et al., 2002; Shariati et al., 2007), the present study is the first to propose a reaction norm model to analyze H x E interaction.
Various known descriptive variables (e.g., temperature, humidity, feeding level) can be used as covariates of reaction norm, but a common used environmental gradient in practical genetic evaluation is the effect of production environment. However, the effect of production environment is unknown. One approximation is to use the average phenotypic performance of the individuals in a given environment as environmental covariate (Kolmodin et al., 2002). Another approximation is to estimate effect of production environment using a simple linear model or a standard additive genetic model, and then use the estimate as proxy of environmental covariate (Calus et al., 2002; Oseni et al., 2004). These approximations could be unsatisfactory in many situations, especially when the trait of interest is under selection (Calus et al., 2004; Su et al., 2006). Su et al. (2006) developed a Bayesian reaction norm model that allows inferring environmental covariates and the other parameters in the model simultaneously. The advantage of this approach over the approximate methods has been shown in a simulation study (Su et al., 2006). Therefore the proposed model applies the methodology reported by Su et al. (2006).
The proposed model applies linear spline regression to fit nonlinear reaction form. Spline regression has the advantage of accommodating various nonlinear reaction norms including nonparametric reaction norms. One challenge in spline regression is to choose the right number and locations of the knots. Too many knots will increase complexity and the uncertainty of the estimates, whereas too few would reduce the predictive ability. Usually, the number and positions of knots in splines are chosen corresponding to the pattern of the trajectory. Knots can be dense in regions with fast changes and sparse in regions with slow changes. The strategies of choosing number and positions of knots have been reported by several studies (Biller, 2000; Zhou and Shen, 2001; Hansen and Kooperburg, 2002; Misztal, 2006). With unknown environmental covariates, it is difficult to predetermine the positions of knots. The proposed model uses predetermined values of knots in standard deviation units of environmental values. The positions of knots in original scale are assigned dynamically during the process of Gibbs sampling. The results from the simulation study show that this technique works well.
When applying the proposed model to analyze H x E interaction, attention should be paid to interpretation of the regression coefficients in relation to the scale of the covariables. In the proposed model, heterosis is described as
According to the properties of regression coefficient, in general, scaling environmental value (u) by a constant (e.g., by SD) will change βk but not β0, and scaling heterozygosity (h) by a constant (e.g., h in percentage) will change both βk and β0. Moving the origin of u (e.g., subtracting the mean) will change β0 or both βk and β0, dependent on the feature of
k(ui). However, these changes in scale of u and h will not change the prediction of heterosis. On the other hand, moving the origin of h (e.g., subtracting a constant c) will result in biased prediction of heterosis with an amount of β0c for all individuals and
which is different for individuals in different environments.
The simulation study showed that the proposed model detected H x E interaction and predicted heterosis and breeding value accurately. The accuracy of predicted heterosis was even higher than the model using true environmental values and true curve function, and treating herd-year as a random effect (Table 4). In addition, the accuracy of predicted breeding values from the model treating herd-year as fixed effect was lower than the models treating herd-year as random effect (Table 5). It has been shown that treating herd-year effect as random can increase the accuracy of selection (Visscher and Goddard, 1993) and the predictive ability of the model (Babot et al., 2003). On the other hand, in the situation of an association between sire effect and herd effect or an environmental trend, treating herd-year effect as random could lead to biased estimates of breeding values (Henderson, 1973; Visscher and Goddard, 1993; Babot et al., 2003). However, Ugarte et al. (1992) reported that even under an association between sire and herd, there is no bias in predicted breeding values in the simulated scenarios where sizes of contemporary groups are larger than 12.
Based on the simulated data, a model ignoring H x E interaction does not harm the accuracy of predicted breeding value. This can be explained from 2 aspects. First, although there was a considerable H x E interaction in the simulated populations, the variance of environment-dependent component of heterosis was small. In addition, though there was a large difference in the levels of breed heterozygosity between years, the variation within herd-year was small because sires were used randomly. In a model ignoring H x E interaction, the between-herd-year variation of H x E interaction (the major part of H x E variation) was accounted by herd-year effect. It indicates that the random error due to H x E interaction is negligible in the current simulated data.
The second reason is that in the simulated scenarios (sires were used randomly across environments), ignoring H x E interaction does not lead to systematic bias in prediction of breeding value. As shown in Table 3, the correlation between H x E effect and breeding value, and between dam and daughters H x E effect, is close to zero. The variance between sires for H x E effect, after correction for herd-year mean of H x E interaction, is extremely small. These results indicate that ignoring H x E interaction does not increase or decrease the resemblance between relatives. Therefore, the systematic error due to ignoring H x E is negligible.
However, ignoring H x E in genetic evaluation might lead to a serious problem in situations other than those mimicked by the current simulation. In the case with many close relatives with similar heterozygosities within the same herd-year, ignoring H x E could lead to a systematic error in prediction of breeding values. It would be useful to carry out further studies on the influence of ignoring H x E on prediction of breeding value, with regard to various breeding schemes and population structures.
The proposed model and method were developed to handle H x E interaction using a reaction norm with few parameters. This approach could be a good alternative to estimate H x E interaction and predict breeding value in situations where there are a large number of environment levels and the environments can be qualified as an environmental gradient, but not necessary to perform well in all situations. For example, to analyze environment-dependent heterosis in few levels of heterozygosity and few levels of environment, the traditional classified interaction model could be the best choice.
The weakness of this study was that there were some flaws in simulating data. In the simulation study, data were generated to mimic dairy cattle populations. However, the properties of the simulation were not really agreeable with real-life scenarios, due to simplifying the simulation procedure. First, the simulation did not generate history generations to create the difference in allele frequencies between 2 populations, and a certain level of inbreeding. Second, the Mendelian sampling terms for offspring were generated without considering inbreeding coefficients of parents. Third, the function to create heterosis leads to a negative heterosis in environments with a value less than –2.4 standard deviation, which could be not realistic.
| CONCLUSIONS |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
|
|
|---|
Received for publication June 30, 2008. Accepted for publication December 3, 2008.
| REFERENCES |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |