|
|
||||||||
,


* Department of Animal and Aquacultural Sciences, Norwegian University of Life Sciences, N-1432 Ås, Norway
University of Debrecen, Centre of Agricultural Sciences, Department of Animal Sciences, H-4032 Debrecen, Hungary
Bovibank Ltd., PO Box 58, N-1431 Ås, Norway
Centre for Integrative Genetics (CIGENE), Norwegian University of Life Sciences, N-1432 Ås, Norway
|| GENO Breeding and AI Organisation, N-1432 Ås, Norway
1 Corresponding author: marie.lillehammer{at}umb.no
| ABSTRACT |
|---|
|
|
|---|
Key Words: quantitative trait loci random regression autosome genotype by environment interaction
| INTRODUCTION |
|---|
|
|
|---|
The third point explains a major difference between using a multiple-trait approach and using random regression to describe GxE. When using the correlation between environments as the test-statistic for interaction, as is done by the multiple-trait model, only re-ranking of effects between environments is defined as GxE. The random regression model searches for differences in slope between the different genotypes. There can be a large difference in slope even though the genotype effects do not show reranking within the environmental range considered. This situation, referred to as "scaling effects", is defined as GxE when a random regression model is used, but not when a multiple-trait model is used.
When random regression models have been used to describe sires BV as a function of the production level in the herd, the function (or reaction norm) has often been reported to be linear (Kolmodin et al., 2002). High correlations between the intercept and slope of the genetic effect has been reported for protein yield (Calus et al., 2002; Kolmodin et al., 2002), indicating that there are some genes affecting both slope and intercept of the trait. When the same genes affect the average effect and the environmental sensitivity of a trait, selecting for increased average production will also change the environmental sensitivity in the population.
When searching for QTL, possible interactions with the environment have usually not been taken into account (e.g., Olsen et al., 2004; Jiang and De et al., 2005; Szyda et al., 2005). Including interaction between QTL effect and environment might increase the power of QTL detection (Lund et al., 2002; Lillehammer et al., 2007). However, as in prediction of BV, a multiple-trait model or a random regression model can be used in QTL analysis. When the environmental value is a continuous variable, such as production level, a random regression model has the previously described benefits over a multiple-trait model.
The use of a random regression model to detect QTL with an environmental interaction effect requires observations on the same QTL allele over a range of different environments. The most common design for QTL detection in dairy cattle is the granddaughter design (Freyer et al., 2003). The data consists of marker information on grandsires and their sons and phenotypic information on daughters of genotyped sires. A daughter yield deviation is calculated per son as an average of precorrected records from the sons daughters. As a result of averaging over environments, the environmental effects are lost. Thus, GxE studies are based on analysis of individual cow records. Lillehammer et al. (2007) presented a method in which individual cow records were used in the QTL analyses and applied it to simulated data.
The aim of this study was to search for QTL by environment interactions in Norwegian Red cattle using random regression models. In these models, an environmental gradient is defined as the random herd-year effect, predicted in an initial analysis. The search was performed on all the autosomes except Bos taurus autosome 6 (BTA6). Several QTL for milk production have been reported on BTA6 (Khatkar et al., 2005), and a more detailed study of BTA6 is planned with a dense marker map; thus, BTA6 was dropped from the current study.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Phenotypic Data
Lactation records of milk (kg x 103 per 305-d lactation), protein (kg per 305-d lactation), and fat yields (kg per 305-d lactation) in first lactation were obtained from the GENO Breeding and AI Association (Ås, Norway). The phenotypes were precorrected for heterogeneous variance due to parity and age within parity. Corrected yields were analyzed with the official repeatability animal model, including the fixed effects of age x parity, month of calving x parity, days open x parity, and year of calving and for the random effects of herd x year, animal (genetic effect) and permanent environment of the cow. Yields from 3 lactations were included in the repeatability model analysis. Records for the subsequent QTL mapping analyses were obtained from the repeatability model by subtracting fixed and random effects from the corrected yield:
![]() |
where Y is used as the record in the QTL mapping analyses, cy is the precorrected yield (milk, fat or protein), age x par is the fixed effect of age x parity, cm x par is the fixed effect of calving month x parity, do x par is the fixed effect of days open x parity, year is the fixed effect of year of calving, hy is the random effect of herd x year, and pe is the random effect of permanent environment.
The random herd-year solution predicted by the repeatability model was used as the variable describing the production level of the herd in the subsequent QTL mapping analyses. The phenotypic data included first-lactation records on 468,565 daughters from 738 genotyped sires. Each sire had between 108 and 8,586 daughters.
Marker Data
The DNA from each animal was extracted from frozen semen samples. The IOBT730 microsatellite was chosen from the BOVMAP database (http://locus.jouy.in-ra.fr/cgi-bin/lgbc/mapping/common/main.pl?BASE=cattle; accessed 2004) Web site, and the remaining markers were chosen from the USDA MARC database (http://www.marc.usda.gov/genome/genome.html; accessed 2004). Genotypes for the 160 microsatellites were determined using the GeneMapper program, version 4.0 (Applied Biosystems, Foster City, CA). Linkage maps (Table 1
) were constructed using the CRI-MAP program, version 2.4 (Green et al., 1990) and the Haldane map function.
|
![]() | [1] |
where Yjkl = the yield deviation of milk, protein, or fat yield for daughter k of sire j, in herd-year class l, µ = the overall mean, b = overall fixed regression coefficient on hyl, hyl = random solution for herd-year class l for either milk, protein, or fat yield, sirej = random effect of sire j (intercept), which assumed that the effect comes from a N(0, A
2sire) distribution, ßj = random regression coefficient on hyl, nested within sire j; the effect was assumed to come from a N(0, A
2ß) distribution. The covariance between sire and ß effect was modeled as: Cov(sire, ß) = A
sire,ß, and ejkl = random residual.
The model with a QTL that does not interact with the environment was
![]() | [2] |
where hi1 and hi2 are the random effects of the paternal and maternal haplotypes of sire j, respectively, which assumed that the effect comes from a N(0, G
2h) distribution, where G is the IBD matrix between the haplotypes and
2h is the haplotype variance for intercept.
The model with a QTL that interacts with the environment was
![]() | [3] |
where hi1 and hi2 describe the random effects of sire js haplotypes on the intercept of the trait,
i1 and
i2 describe the random regression coefficients on hyl, nested within each sires paternal and maternal haplotypes respectively, and
i1 and
i2 were assumed to come from a N(0, G
2
) distribution, where G is the IBD matrix between the haplotypes and
2
is the haplotype variance for slope. The covariance between the intercept and slope of the QTL effect was modeled a Cov(h,
) = G
h,
.
Statistical analyses were performed with the ASREML software (Gilmour et al., 2001). Paternal and maternal haplotypes were assumed to be different classes within the same random effect.
Likelihood ratio (LR) tests were used to test for significance in 2 steps. First, the log-likelihoods of model [2] were analyzed with model [1] as the reduced model to test whether a significant QTL for intercept of the trait was segregating at the putative QTL position. Second, an LR test of model [3] was performed, with model [2] as the reduced model to test for significant QTL for slope of the trait. Each test was carried out for the position on each chromosome with the highest LR value.
Chromosome-wise thresholds were chosen to obtain a genome-wise suggestive linkage, and one false positive per genome scan was expected (Lander and Kruglyak, 1995). For each test, a chromosome-wise threshold value was calculated by the method of Piepho (2001). For the intercept test, the number of genetic effects of the putative QTL (degrees of freedom of the test) was 1 (an additive effect only). For the slope test, the number of genetic effects of the putative QTL was 2 (additive slope effect and covariance between slope and intercept).
A genome-wise false discovery rate (FDR) was calculated by dividing the number of expected false positives by the number of detected QTL. The test was conducted for 2 models and 3 traits over 28 chromosomes, giving 168 (2 x 3 x 28) tests. The expected number of false positives was 6 (2 models x 3 traits).
| RESULTS |
|---|
|
|
|---|
|
Figures 1
, 2
, and 3
show the haplotype effects of the grandsires for the QTL with significant environmental interactions. Each grandsire is represented by 2 lines in each figure, but 1 line can refer to more than 1 grandsire if they share a haplotype with a probability of more than 0.95 (Meuwissen et al., 2002).
|
|
|
Figure 2
shows the haplotype effects found on BTA7, in the bracket between BMS713 and BM6105, for milk yield and protein yield. For milk and protein yield, the estimated correlation between slope and intercept was close to unity (0.999). However, because there is no variance in the random intercept for protein yield (Figure 2
), the correlation between slope and intercept is not defined for this trait. All of the haplotype effects cross each other in one single point, indicating that there are 2 QTL alleles.
Figure 3
shows the effects found in the bracket between TGLA254 and BM121 on BTA16. In this bracket significant QTL for milk and protein yield were found. For both traits, correlation between slope and intercept of the QTL effects were <0.9.
The 2 QTL with a significant interaction effect for milk yield contributed 32.0 and 34.7% of the total sire slope variance, according to their variances (Table 2
). The 3 QTL with significant slopes for protein explained 48.7, 28.1, and 33.0% of the total sire slope variance for protein (Table 2
). Theoretically, the sum of several QTL can exceed the total genetic variance, because the variance generated by QTL can cancel each other out. However, it is well known that the variance explained by significant QTL is overestimated (Hayes and Goddard, 2001). The large proportions of the genetic variances explained by these detected QTL implies that the model overestimates the QTL variance.
To obtain a threshold corresponding to genome-wise suggestive linkage (Lander and Kruglyak, 1995), the chromosome-wise P-values could not exceed 0.0357 (1/28). With that criterion, 13 QTL were detected, considering independent QTL effects for the different traits. With an expected number of false significant effects of 6, this gives a FDR of 46%.
| DISCUSSION |
|---|
|
|
|---|
The calculated FDR of 46% indicates that there is considerable uncertainty attached to the detected QTL in the genome-wide scan conducted here. To validate these detected QTL, further research including GxE is required. Searches for QTL x environment interactions in other populations would be useful to validate the QTL. Using a dense marker map would not only serve as a validation, but also determine the positions of the QTL more accurately.
The QTL detected with model [3] only are not expected to be highly supported in the existing literature because none of the references included environmental information and interaction in their QTL models. Several of the QTL detected with model [3] only were detected in regions where no QTL have previously been reported for this trait (Khatkar et al., 2005). The possibility of detection of QTL having linear reaction norms, without assuming interactions in the analysis, depends on whether the QTL contribute to genetic variation in the average environments of the animals in the study. If the mean environment is close to the crossing point of the reaction norms, the QTL effects are much harder to detect than if the mean environment is far from the crossing point. In addition, the interaction pattern of the QTL might affect the power of detection (Lillehammer et al., 2007).
Reaction norms of BV have shown that considerable scaling effects exist, but little reranking is reported, except in extreme environments (Calus et al., 2002; Kolmodin et al., 2002). For single genes, the situation seems to be different. All the QTL detected with a significant interaction term had reranking of alleles within the environments from which the data originated (Figures 1
, 2
, and 3
), with the crossing point often close to the mean environment. This may explain why these QTL have not been fixed by strong selection for production traits. Lillehammer et al. (2007) found that the random regression method detects QTL with reranking of effects more easily than QTL with scaling effects.
Reranking among alleles makes animal breeding more complicated, because the best allele in one environment is not necessarily the best in another environment. This is a challenge if marker-assisted or gene-assisted selection is performed and the offspring are expected to produce in many different environments. However, it also raises the opportunity to breed animals especially adapted for production in specific environments.
For the QTL detected for milk and protein yield on BTA7 and BTA16, the correlation between slope and intercept was close to 1 or 1. This confirms that the same gene is affecting both slope and intercept. It also indicates that only 2 alleles are segregating in the population, or at least that 2 alleles are much more frequent than the others (Lillehammer et al., 2007).
Some of the QTL produced an estimated slope variance that could explain a large proportion of the total GxE (Table 2
). The high slope variances reported for the QTL indicate that the model overestimates the QTL x environment effects. This might happen because of a heterogeneous residual variance. The residual variance itself might change when the environment is changing, and in addition, three-fourths of the genetic variance and three-fourths of the QTL variance are included in the residual variance because a sire-model was used for these 2 terms. A random regression model that accounted for heterogeneous residual variance would be preferable (Kolmodin et al., 2002). The variances of the interaction terms are small compared with the variances in intercept. Slope variances from the genetic and QTL parameters that are included in the residual variance will therefore have little impact on the residuals. Simulations have shown that the sire model used in this study does not give a high proportion of falsely significant interaction terms (Lillehammer et al., 2007). Even though the predicted interaction effects of the QTL might be overestimated, the QTL probably have real interaction effects with the environment.
When a few genes control a big part of the total variance, marker-assisted selection can be quite efficient, especially, if markers are identified to be in strong linkage disequilibrium with the gene. The question is whether, depending on the environmental sensitivity of the QTL, the QTL represents important characteristics for selection. So far, selection has focused on changing the average production over all environments. The QTL detected with environmental interaction have small average effects and will therefore be bad candidates for selection as long as average production is the measurement for the gene effect. However, it might be desirable to change not only the average production, but also to control production over a range of environments. The QTL with a large QTL x environment interaction will be very well suited to change the environmental sensitivity in the population. Whether it is desirable to change the environmental sensitivity depends on which environments we expect to appear in the future. Some different breeding strategies are described below.
The production potential of the offspring of the selected parents will be a combination of their genetics, the environments they are given, and the interactions between genes and environments. If the future environments were known, the future production of the offspring could be predicted from the genotypes of the parents combined with the future environmental information. The situation is more complicated when the future environment is unknown and expected to be highly variable. In the long term, genetic selection for environmental sensitivity can be performed by selecting different lines specialized to different environments (Calus et al., 2005). The QTL by environmental effects detected in this study may facilitate selection for such lines.
The protein yield QTL on BTA2 and the milk QTL on BTA7 had positive correlations between slope and intercept (Table 2
). This has the practical consequence that a positive allele will be even more positive if the environment is improved. Selection for the best allele in todays environment will create a population that is not well suited for less-favorable environments. As long as the environment is expected to improve in the future, the allele with a positive slope is a good selection candidate.
The QTL for milk and protein yield on BTA16 have negative correlations between slope and intercept. This means that a positive allele in an average environment will have a lower effect in an improved environment. Hence, assuming environmental improvement over time, a currently unfavorable allele on average may be the best allele in the long term.
The protein yield QTL on BTA7 has no effect in an average environment, and represents a QTL suitable for changing the environmental sensitivity without affecting the average production. The QTL can be of great importance if the environment is changed in one direction or another, even though it seems to have no average effect. However, as environment changes, the average herd-level will change as well, and the QTL might get an average effect. This might be the case over time or when applying the results in other populations or environments than the Norwegian. Before any QTL alleles are selected for, their effects in the expected future environment(s) should be studied.
Historically, production levels have increased from both selection and environmental improvements. Further improvement of the environment will probably continue to increase production in the future. Alleles with positive slope effects will have positive effects in such improved environments. On the other hand, these alleles will make animals sensitive to poor environments. Thus, selecting for alleles with steep reaction norms is a "high risk, high output" approach.
The opposite idea is to select for flatter reaction norms in the population. The environment, as defined here, produces an average reaction norm of 1 in the population. If a lower average response to environmental changes is obtained, the measured environmental variation decreases, whereas the average reaction norm remains 1. Animals with a lower environmental sensitivity can be desirable if the future environment is uncertain, because such animals will respond less to changes in the environment. The animals having alleles described by a negative environmental sensitivity for the detected QTL will have a lower than average environmental sensitivity. Selecting for these alleles will create animals well suited to tolerate bad environments. This is thus a "low risk, low output" approach. Both the QTL detected on BTA7 and the QTL on BTA16 facilitate selection for a lower environmental sensitivity. However, selecting for the alleles with negative slopes will result in lower production in the highest producing environments. Hence, the strategy is bad if the environment is expected to improve in the future.
A third possibility is to use the information about the GxE to breed specialized animals for certain environments. This can be accomplished either by including different specialized lines in the national breeding scheme, or by making the genotypic information of the bulls available for breeders to select bulls well-suited for their specific farm environments. To establish multiple breeding schemes and select different sires for production in different environments is expensive, and has been found to be disadvantageous (Weigel et al., 1999). Regardless of which strategy is chosen, GxE should be taken into account before a specific allele is selected for.
| CONCLUSIONS |
|---|
|
|
|---|
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
Received for publication December 12, 2006. Accepted for publication March 20, 2007.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
M. Lillehammer, M. E. Goddard, H. Nilsen, E. Sehested, H. G. Olsen, S. Lien, and T. H. E. Meuwissen Quantitative Trait Locus-by-Environment Interaction for Milk Yield Traits on Bos taurus Autosome 6 Genetics, July 1, 2008; 179(3): 1539 - 1546. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |