|
|
||||||||
1 Institut für Tierzucht und Tierhaltung, Christian-Albrechts-Universität, D-24098 Kiel, Germany
2 Forschungsinstitut für die Biologie landwirtschaftlicher Nutztiere, D-18196 Dummerstorf, Germany
3 Institut für Tierzucht und Haustiergenetik der Justus-Liebig-Universität, D-35390 Gießen, Germany
4 Lehrstuhl für Tierzucht, Technische Universität München, D-85354 Freising, Germany
5 Vereinigte Informationssysteme Tierhaltung w.V., D-27283 Verden, Germany
Corresponding author: J. Bennewitz; e-mail: jbennewitz{at}tierzucht.uni-kiel.de.
| ABSTRACT |
|---|
|
|
|---|
Key Words: DGAT1 multiple allele quantitative trait locus dairy cattle
Abbreviation key: DGAT1 = acyl-CoA:diacylglycerol acyltransferase1, DYD = daughter yield deviation, K232A = lysine to alanine substitution
| INTRODUCTION |
|---|
|
|
|---|
Recently, Winter et al. (2002) and Grisart et al. (2002) independently identified the acyl-CoA:diacylglycerol acyltransferase1 (DGAT1) gene as a strong candidate gene for the QTL for milk production traits in the proximal region of chromosome 14. The DGAT1 gene encodes an enzyme that plays a central role in the synthesis of triglycerides. It catalyzes the reaction of diacylglycerol and fatty acyl-CoA to form triglycerides. Both studies mapped DGAT1 close to the region of the QTL for milk production traits on the bovine chromosome 14 mentioned above and found a nonconservative substitution of lysine by alanine (K232A) in DGAT1 caused by an adenine/adenine to guanine/cytosine dinucleotide substitution at position 10433 and 10434, respectively, in exon number VIII.
The DGAT1 effects in the New Zealand and in the Dutch Holstein dairy cattle population were estimated by Grisart et al. (2002), in the New Zealand Holstein dairy cattle population by Spelman et al. (2002), and recently, in the German Holstein dairy cattle population by Thaller et al. (2003). All authors reported a strong allele substitution effect on milk, fat, and protein yield indicating that DGAT1 is a major gene. The lysine variant increased fat yield and decreased protein and milk yield, the alanine variant affected these traits in an opposite direction compared to the lysine variant. The mode of inheritance is almost completely additive (Grisart et al., 2002), the allele frequency of the lysine variant in the German Holsteins was estimated to be 0.53 (Thaller et al., 2003).
From other major genes it is known that in general more than 2 alleles are segregating at a particular locus. For example, at the Extension (E) locus, identified as the melanocyte-stimulating hormone (MSH) receptor locus (Robbins et al., 1993) and responsible for most of the variation of cattle coat colour, three alleles are known. These are ED (dominant black), E+ (combination of black and red), and e (recessive red). At the mh locus, causing double-muscling in cattle, Georges et al. (1998) reported at least 5 different alleles with impact on the trait. However, until now only two alleles were reported at the DGAT1 locus.
In the present study, our objective was to determine whether there exists an additional source of genetic variance attributable to genes and/or QTL on chromosome 14 for the traits milk, fat and protein yield, and fat and protein percentage besides the diallelic DGAT1 effect K232A.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Genotypes and Phenotypes
The families were genotyped for 8 microsatellite markers, as well as the KIEL_E8 and DGAT1 polymorphisms. Microsatellite and KIEL_E8 genotypes were determined by automated fragment analysis (A.L.F., Amersham-Pharmacia or ABI377, Perkin-Elmer). Genotyping at the DGAT1 locus was carried out by a PCR-RFLP test based on the K232A substitution in DGAT1 (Kaupe et al., unpublished), the lysine variant was denoted as the Q allele. The genotypes were checked for their agreement with the Mendelian laws of inheritance and conflicts were solved, if possible. The marker data were stored in the ADRDB database (Reinsch, 1999). Multipoint marker maps were computed using CRIMAP (Green et al., 1990). The marker order is as follows (estimated distances from the first marker in cM estimated by the Kosambi mapping function are in parentheses): KIEL_E8 (0.0), DGAT1 (0.3), ILSTS039 (1.3), CSSM66 (8.7), RM180 (41.5), RM11 (56.5), BM4630 (57.6), RM192 (75.8), BM4513 (113.3), and BL1036 (139.8). Note that the TWOPOINT option of CRIMAP, which considers only two particular loci at a time (Green et al., 1990), estimated a recombination rate of 0% (lod score 69.42) between KIEL_E8 and DGAT1. The marker order is in agreement with previously published marker maps (e.g., Kappes et al., 1997). For a more detailed description of the markers and the genetic map see Thomsen et al. (2000).
The following 5 traits were considered: milk yield, fat yield, protein yield, fat percentage, and protein percentage of the milk. For the 3 yield traits, daughter yield deviations (DYD) were calculated from the production records of the daughters. The DYD are the average phenotypes of daughters, corrected for fixed effects and the genetic contribution of the daughters dam and were obtained as by-products from the routine national sire evaluation. The DYD were calculated for the first lactation and as a weighted average for the first 3 lactations. The latter yield trait estimates were slightly more variable than the estimates for the first lactation only. No DYD were available for the 2 percentage traits, therefore deregressed estimated breeding values were used. The deregressed values were obtained by dividing the estimated breeding value by the square of its reliability. The DYD and the estimated breeding values were taken from the routine national sire evaluation in February 2001 and in February 2003, respectively, no DYD were available from February 2003. For a more detailed description of the phenotypes and genetic parameters of the traits as used for the national sire evaluation see Table 1
.
|
![]() | (1) |
where yij is the phenotype of jth sire of the ith grandsire, gsi is the fixed effect of the ith grandsire, bik is the regression coefficient for the ith grandsire at the kth chromosomal location, tpijk is the probability of the jth sire receiving the chromosomal segment for gamete one (gamete numbers were randomly assigned) from the ith grandsire at the kth chromosomal position, and eijk is the random residual. A weighted regression was performed for the yield traits. The corresponding weights were one over the variance of the DYD. For the derivation of the formulas see appendix A. An unweighted regression was performed for the 2 percentage traits. The QTL transition probabilities were calculated for each cM and each progeny with the software BIGMAP (Reinsch, 1999), the regression was performed using the software ADRQLT (Reinsch, 1999).
The respective null hypothesis is that no QTL segregates on the chromosome 14, i.e., H0: bik = 0. The alternative hypothesis is that a QTL segregates on this chromosome, i.e., H1: bik
0. To account for multiple testing, test statistic critical values for the QTL effect were calculated for each trait separately by the use of the permutation test (Churchill and Doerge, 1994). Briefly, by shuffling the phenotypic data 10,000 times randomly while keeping the marker data constant, the genotype-phenotype association was uncoupled and, hence, after applying the mapping procedure every QTL estimate indicated a type I error by definition. The chromosomewise critical values
= 1, 5, and 10% were calculated by taking the 99th, 95th, and 90th quantile from the corresponding distribution of the test statistic, respectively. To aim a situation of a whole genome scan with 30 chromosomes, genomewise error probabilities were calculated from the corresponding chromosomewise error probabilities using the Bonferroni correction:
![]() |
Model 1 provided only information about whether a QTL is segregating for these traits in this design and was used only for comparison purposes for the following model.
The second model regressed the phenotypes on 2 regression variables. The first was the number of copies of the lysine alleles at the diallelic DGAT1 polymorphism, and the second was the QTL transition probability as described above. The following model was applied:
![]() | (2) |
where xij is the number of lysine alleles (0, 1, or 2), b1 is the corresponding regression coefficient and the remaining terms are as described in (1). The regression coefficient of the QTL transition probability represented an effect that was corrected for the diallelic DGAT1 and is henceforth denoted as a conditional QTL effect. Again, a regression was performed for each cM on the chromosome using the software ADRQLT (Reinsch, 1999) with same weights as used in the first model. From Thaller et al. (2003) we know that the diallelic DGAT1 polymorphism is significant in this experimental design for all traits investigated here. Therefore, the respective null hypothesis is that the diallelic DGAT1 polymorphism shows a significant effect, but the second regression term is not significant. That is, all the genetic variance due to this chromosome was explained by the diallelic DGAT1 polymorphism, and the null hypothesis is H0: b1
0 and bik = 0. The alternative hypothesis is that besides the 2 alleles of DGAT1 there must be another source of genetic variation attributable to this chromosome. Thus, the alternative hypothesis is H1: b1
0 and bik
0.
The position with the highest test statistic of the conditional QTL effect was taken as the estimated position of a putative QTL that goes beyond the diallelic DGAT1 effect. Chromosomewise error probabilities were calculated by the permutation test. In contrast to model (1), the QTL transition probabilities were shuffled 10,000 times while keeping the phenotypic data constant. By doing this, the marker genotype-phenotype association was uncoupled but the DGAT1-phenotype association was kept intact. After applying the QTL analysis, every estimate for the conditional QTL effect indicated a type 1 error per definition. Test statistic critical values were then calculated as described above. In addition, the estimated QTL positions from the evaluated permutations were recorded for permutation bootstrapping. Again, to focus on the situation of a whole genome scan, genomewise error probabilities were calculated from the corresponding chromosomewise error probabilities using the Bonferroni correction described above.
For the second model (2), confidence intervals for the estimated positions of putative conditional QTL were computed by permutation bootstrapping (Bennewitz et al., 2002), performing 250 bootstrap samples. The distributions of the 250 QTL position estimates from the evaluated bootstrap samples along the chromosome were corrected for the marker impact using the distributions of the conditional QTL position estimates from the corresponding evaluated permutations. From the marker corrected distributions noncentral 95% confidence intervals were computed as described by Bennewitz et al. (2002).
The third model focused only on the estimated position of DGAT1 (i.e., at the first position of the chromosome). It regressed the phenotypes on the number of lysine copies at the DGAT1 locus, on the QTL transmission probability (i.e., the conditional QTL effect) at that particular position k (k = 0 cM), and it included a third variable due to KIEL_E8. Because the diallelic marker KIEL_E8 showed a linkage disequilibrium with the QTL (Looft et al., 2001), it was possible to include the probability of each offspring j inheriting the KIEL_E8 marker allele "1" from the population via the maternal path (tp_popij) as a regression term:
![]() | (3) |
The values of tp_pop are either zero or one when it could be determined with certainty whether the individual received the marker allele "1" from the population pool. Otherwise, it was equal to 0.42 if the inheritance of the marker allele "1" could not be determined. This value was the allele frequency of the allele "1" of KIEL_E8, estimated with a maximum likelihood approach (Appendix B). The respective null hypothesis is that all the genetic variance due to this chromosomal position was explained by the diallelic DGAT1 effect (H0: b1
0 and bik = b2 = 0). The alternative hypothesis is that the 2 alleles of DGAT1 do not explain all of the genetic variance at this chromosomal position, and hence, the conditional QTL effect and/or the KIEL_E8 regression term will show significant effects as well (H1: b1
0 and bik
0 and/or b2
0). For example, a significant effect of the KIEL_E8 regression term might appear if there is at least a third allele segregating at the DGAT1 locus that is in linkage disequilibrium with one allele of KIEL_E8. A significance of the conditional QTL effect might occur if either a third allele is segregating or at least one sire is heterozygous for a second QTL that is in close linkage with DGAT1. Again, a weighted analysis was performed for the yield traits. The analysis was done using the GLM procedure of the SAS package (SAS Inst., Inc., Cary, NC).
The advantage of model 3 over model 2 is that it utilized all available information at this chromosomal region (i.e., diallelic DGAT1 polymorphism, information of genetic markers via QTL transition probability and the linkage disequilibrium of KIEL_E8 and the QTL). On the other hand, only model 2 allowed examination of all positions of this chromosome, and hence, detection of further putative conditional QTL.
Average allele effects of the lysine allele of DGAT1 were estimated first by model 2 at position 0 cM and then by a model where only DGAT1 was included besides the fixed effect of the grandsire (model not shown). Because of the applied granddaughter design, the regression coefficient of the number of lysine alleles at DGAT1 represented one half of the average allele substitution effect in both models (Falconer and MacKay, 1996) for the yield traits and the full average allele substitution effect for the 2 percentage traits. The proportion of the variance explained by the diallelic DGAT1 effect and by the diallelic DGAT1 effect together with the conditional QTL effect at position 0 cM was estimated also.
| RESULTS |
|---|
|
|
|---|
|
No sire was homozygous QQ. This could partly be explained by the fact, that two of the three great-grandsires were homozygous qq at DGAT1. The haplotype analysis of the sires revealed that the marker allele "1" ("2") of KIEL_E8 was always on the same chromosome as the DGAT1 allele Q (q). The allele frequency of the allele l of KIEL_E8 was 0.48. Around 98% of the offspring, but all sires, showed corresponding genotypes at DGAT1 and KIEL_E8 (either Q,Q and 1,1; Q,q and 1,2; or q,q and 2,2).
As expected, model 1 revealed genomewise significant effects for the QTL effect for all traits investigated. As results of this model only the plots of the test statistics along the chromosome are shown (Figures 1
to 5
).
|
|
|
|
| DISCUSSION |
|---|
|
|
|---|
A visual inspection of the test statistic plots (Figures 1
to 5
) revealed a pronounced maximum at the chromosomal position of DGAT1 (i.e., at 0 cM) for all traits, if DGAT1 was not included (model 1). Given the first explanation (at least one more allele at DGAT1) would be true, a similar shape of the test statistic plot would be expected, if DGAT1 was included (model 2). This is, however, only the case for fat percentage (Figure 4
). For fat yield, protein yield, and protein percentage, the location at which the test statistic is maximal is more distal (Figures 1
, 2
, 3
, and 5
). Additionally, the lower confidence interval bound for protein percentage is around 10 cM (Table 3
). Following this, the hypothesis of further allele(s) segregating at DGAT1 seems to be more likely for the 2 fat traits than for the 2 protein traits. For the latter 2 traits, the theory of a second QTL more distal on the chromosome would be likely. It may be possible that these putative additional QTL were superposed by the strong DGAT1 effect, and therefore, not detected yet. Note, that none of the test statistic plots produced by model 2 (DGAT1 included) showed a pronounced maximum, and the estimated confidence intervals were large.
|
|
|
|
Putative other alleles might be a result of an undetected nonconservative mutation at another position of this exon or in another exon of DGAT1. Assuming that a further nonconservative mutation in this gene exists, 4 alleles (actually, 4 different haplotypes within this gene) are possible, resulting from combinations of 2 single mutations. Four alleles are possible at position 10433/10434 assuming a combination of the 2 single mutations: an adenine/adenine to guanine/adenine substitution resulting in glutamine, and an adenine/adenine to adenine/cytosine substitution resulting in threonine. The procedure we used for scoring the DGAT1 genotypes is RFLP based. Everything other than guanine/cytosine at these particular positions would be declared as adenine/adenine.
Furthermore, conservative mutations in exons and those detected in introns and in the UTR can influence the phenotype. There is evidence for alternative splicing of DGAT1 in human (http://www.ncbi.nlm.nih.gov/IEB/Research/Acembly/av.cgi?c=Locusid&org=9606&L=8694). Therefore, silent mutations could have an impact when located at positions necessary for correct splicing (Cartegni et al., 2002, Miriami et al., 2003).
Finally, Ludwig et al. (2002) reported 5 polymorphisms in the human DGAT1 promoter that affected several traits in Turkish women (i.e., body mass index, high density lipoprotein levels and blood pressure). This study indicates that a mutation in the DGAT1 promoter might also be responsible for the additional effect. To resolve this, further sequencing effort is needed. Based on our data it was not possible to postulate more than 3 alleles, because no sire showed the DGAT1 genotype QQ. It is also possible that both the multiple segregating allele theory and the hypothesis of a second QTL are correct.
The successful mapping of this QTL by many QTL experiments might be a result of approximate equal frequencies of the 2 DGAT1 alleles, because according to the Hardy-Weinberg equilibrium one half of the sires are expected to be heterozygous under equal allele frequencies at a diallelic locus. However, the expected number of heterozygous sires might be even higher when considering the putative third allele. In this study we genotyped 8 from 16 sires as DGAT1 heterozygous, but if the multiple allele theory at this locus is correct, three additional sires are segregating for fat percentage at this locus (sires F1, F2 and F4, Table 5
).
The third hypothesis for explaining our results (interaction of DGAT1 with further genes) would be supported by detecting significant effects of the conditional QTL in DGAT1 heterozygous families. However, as mentioned above, homozygous families also revealed significant effects for the conditional QTL. Further, we applied a linear model taking the family, the DGAT1 genotype, the KIEL_E8 genotype and the interaction of these 2 loci as fixed effects into account. The results showed no significant effects for the interaction term (not shown). Of course, an interaction with other genes not in linkage with DGAT1 might exist resulting in increased or decreased effects. A statistically significant interaction of DGAT1 with other genes is not a likely explanation for the results found in this study. This is even more plausible because the effects of DGAT1 on the traits investigated were at a very high level.
| CONCLUSIONS |
|---|
|
|
|---|
| Appendix A. Estimating the Variance of the Daughter Yield Deviation |
|---|
|
|
|---|
![]() |
where
,
, and
are the additive genetic, permanent environment and residual variance of the first lactation and ntmi is the number of test milking of daughter i in the first lactation. The elements of the off-diagonal are:
![]() |
which is the genetic covariance of observations between half-sibs. According to Falconer and MacKay (1996) it is possible to factor out the phenotypic variance
. After little algebra the formula for estimating the variance of the DYD for the first lactation of a sire can be written in a more convenient form as:
![]() |
where h2 and p2 are the heritability and the permanent environment of the first lactation.
The estimation of the variance of the DYD for more than one lactation will be demonstrated in a short example. Assume a sire has 2 daughters, one with observations in the first 2 and one only in the first lactation. Further, assume that the yield deviations were not corrected for the effect of the permanent environment. The covariance matrix can be written as:
![]() |
where
,
, and
are the additive genetic, permanent environment and residual variance of lactation j, respectively,
and
are the covariance of the additive genetic and permanent environment of lactation j and j' (j
j'), respectively, and ntmij is the number of test milking of cow i in lactation j. Extending the formula to more daughters and more lactations is straightforward. In the present study, the genetic parameters for both covariance matrices were taken from Reents et al. (1995).
| Appendix B: Estimating Allele Frequencies |
|---|
|
|
|---|
![]() |
where n is the number of distinct alleles at the marker under consideration, ni is the number of the ith allele from founder animals (i.e., either from founder sires or unequivocally descending from unknown dams), nij is the number of half-sibs which share the same heterozygous genotype with their sire.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
Received for publication June 23, 2003. Accepted for publication September 14, 2003.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
S. Neuner, R. Emmerling, G. Thaller, and K.-U. Gotz Strategies for Estimating Genetic Parameters in Marker-Assisted Best Linear Unbiased Predictor Models in Dairy Cattle J Dairy Sci, November 1, 2008; 91(11): 4344 - 4354. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Naslund, W. F. Fikse, G. R. Pielberg, and A. Lunden Frequency and Effect of the Bovine Acyl-CoA:Diacylglycerol Acyltransferase 1 (DGAT1) K232A Polymorphism in Swedish Dairy Cattle J Dairy Sci, May 1, 2008; 91(5): 2127 - 2134. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. P. W. de Roos, C. Schrooten, E. Mullaart, M. P. L. Calus, and R. F. Veerkamp Breeding Value Estimation for Fat Percentage Using Dense Markers on Bos taurus Autosome 14 J Dairy Sci, October 1, 2007; 90(10): 4821 - 4829. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Szyda and J. Komisarek Statistical Modeling of Candidate Gene Effects on Milk Production Traits in Dairy Cattle J Dairy Sci, June 1, 2007; 90(6): 2971 - 2979. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Gautier, A. Capitan, S. Fritz, A. Eggen, D. Boichard, and T. Druet Characterization of the DGAT1 K232A and Variable Number of Tandem Repeat Polymorphisms in French Dairy Cattle J Dairy Sci, June 1, 2007; 90(6): 2980 - 2988. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Kaupe, H. Brandt, E-M. Prinzenberg, and G. Erhardt Joint analysis of the influence of CYP11B1 and DGAT1 genetic variation on milk production, somatic cell score, conformation, reproduction, and productive lifespan in German Holstein cattle J Anim Sci, January 1, 2007; 85(1): 11 - 21. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Sanders, J. Bennewitz, N. Reinsch, G. Thaller, E.-M. Prinzenberg, C. Kuhn, and E. Kalm Characterization of the DGAT1 mutations and the CSN1S1 promoter in the German Angeln dairy cattle population. J Dairy Sci, August 1, 2006; 89(8): 3164 - 3174. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Furbass, A. Winter, R. Fries, and C. Kuhn Alleles of the bovine DGAT1 variable number of tandem repeat associated with a milk fat QTL at chromosome 14 can stimulate gene expression. Physiol Genomics, March 13, 2006; 25(1): 116 - 120. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |