|
|
||||||||



* Faculty of Veterinary Medicine, Aristotle University of Thessaloniki, Greece
Roslin, Edinburgh, Scotland, United Kingdom
Merial Limited, Duluth, GA 30096
Merial S.A.S., Lyon, France
# Sustainable Livestock Systems Group, Scottish Agricultural College, Edinburgh, Scotland, United Kingdom
1 Corresponding author: banos{at}vet.auth.gr
| ABSTRACT |
|---|
|
|
|---|
Key Words: leptin growth hormone diacylglycerol acyltransferase (DGAT1) milk
| INTRODUCTION |
|---|
|
|
|---|
Several genes localized within QTL are thought to influence important traits in dairy cattle. Single nucleotide polymorphisms (SNP) are mostly used to genetically characterize such chromosomal regions. Leptin (LEP), leptin receptor (LEPR), growth hormone receptor (GHR), and diacylglycerol acyltransferase (DGAT1) are some pertinent examples.
The LEP gene, located on Bos taurus autosome (BTA) 4, encodes leptin, a protein hormone secreted from white adipose tissue that is involved in feed intake, energy partitioning, and metabolism of the cow (Liefers et al., 2002; Lagonigro et al., 2003). Consequently, the gene may also affect traits such as milk production, energy balance, and reproduction (Buchanan et al., 2002; Liefers et al., 2002; Silva et al., 2002). Furthermore, variation in leptin concentration during a cows pregnancy is affected by polymorphisms in the LEPR gene (Liefers et al., 2004), located on chromosome BTA3, suggesting possible associations of the latter with cow traits linked to leptin.
Polymorphisms in the GHR gene, found on chromosome BTA20, have also been linked to milk production traits (Falaki et al., 1996; Blott et al., 2003). This gene controls the function of the receptor of the growth hormone, whose key role in mammary gland development and milk production has been well established (Bauman et al., 1985). The impact of the gene on other economically important traits in dairy cattle has not been fully investigated.
The DGAT1 gene, which is located on the BTA14 chromosomal site and encodes acyl coenzyme A:diacyl-glycerol acyltransferase, is known to catalyze the last step in triglyceride synthesis (Grisart et al., 2002) and is associated with milk yield and composition (Sanders et al., 2006; Gautier et al., 2007; Szyda and Komisarek, 2007). Its association with other important cow traits merits further attention.
The objective of this study was to determine if polymorphisms in the LEP, LEPR, GHR, and DGAT1 gene loci can provide useful genetic markers for daily milk yield, feed intake, and feed conversion as well as live weight, BCS, and body energy traits in UK Holstein cows.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Blood samples were drawn from these cows to determine their genotypes on the LEP, LEPR, GHR, and DGAT1 gene loci, as described below. Cow genotypes were then matched to individual cow production files. Daily phenotypic records were collected on the farm for milk yield, feed intake, and dry matter intake. These records were available for the first 3 lactations. Measures of feed conversion to milk yield were calculated as the ratios of feed and dry matter intake over milk yield. Furthermore, weekly first-lactation live weight and BCS records were obtained. These traits were combined to produce estimates of total body energy content and cumulative effective energy balance. The first of these energy traits is a measure of the actual energy level of the cow on the day of recording and the second a measure of the change in energy status as it accumulates throughout lactation. Detailed description of these traits can be found in Banos et al. (2006). Descriptive statistics of all traits considered in the present study are shown in Table 1
.
|
One additional SNP was determined at each of the LEPR (Liefers et al., 2004), GHR (Blott et al., 2003), and DGAT1 (Grisart et al., 2002) gene loci. The SNP at the LEPR and GHR loci characterize a cytosine to thymine and a thymine to adenine base substitution, respectively, whereas a lysine to alanine amino acid change occurs at the DGAT1 locus.
All genetic analyses were conducted by Igenity (Igenity is a registered trademark of Merial in the United States and elsewhere). Briefly, blood samples were retained from all cows that contributed to the database, and DNA was extracted following a phenol-chloroform based protocol. The genotyping process used was allele-specific primer extension reactions analyzed with matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. Table 2
summarizes the 9 SNP in the 4 loci.
|
A minimal set of haplotypes comprising the 6 SNP in the LEP gene locus were determined using parsimony, both by genotype observation and custom software. Parsimony, as described by Clark (1990), is based on the assignment of haplotypes to genotypes that minimizes the number of unassigned or unresolved haplotypes (termed orphans by Clark, 1990). In practice, definite and unambiguous haplotypes are identified first from homozygous individuals and then from individuals with only 1 heterozygous SNP locus (Tier, 2006). Following that, unique pairs of haplotypes are assigned, where possible, to their genotypes. In the present study, data were restricted to 491 cows with all 6 LEP SNP available. Given the animal genotype for these 6 SNP, an algorithm was developed to test every plausible pair of haplotypes against the genotypes. A putative pair of haplotypes that corresponded to the LEP genotype was assigned to each of these animals.
Linkage disequilibrium among the 6 SNP loci was derived from the haplotypes and was calculated as D(i,j) = P(i,j) – P(i)P(j), for specific alleles at SNP loci i and j, using the haplotype [P(i,j)] and allele [P(i), P(j)] frequencies for each SNP locus inferred from them. The D coefficients were then normalized by dividing them by the theoretical maximum for the observed allelic frequencies (Lewontin, 1964), where the latter was the smaller of P(i)[1 – P(j)] and [1 – P(i)]P(j) for D
0 or the larger of –P(i)P(j) and –[1 – P(i)][1 – P(j)] for D < 0. The D coefficients were also normalized by dividing them by the geometric mean of the binomial variances to obtain the final disequilibrium values. In the case of 2 biallelic loci, the latter is equivalent to a correlation coefficient defined by the quotient between D and the square root of the product of the 4 allelic frequencies, as shown in equation 1:
![]() | [1] |
where r = final value of linkage disequilibrium (Hill and Robertson, 1968). As this correlation approaches unity, the second locus becomes redundant.
Statistical Analysis
Daily milk production and feed trait records (feed and dry matter intake and their ratios over milk yield) were analyzed with the following random regression model (model 2):
![]() | [2] |
where Y = daily milk production or feed trait record; m = overall mean; ys = fixed effect of year-by-season of calving interaction (35 classes); g = fixed effect of genetic line (2 classes); f = fixed effect of diet group (2 classes); lc = fixed effect of lactation number (3 classes); ph = linear regression on percentage of Holstein (North American) genes; age = linear regression on age at calving; GEN = regression on individual SNP alleles or haplotypes expressed as 0, 1, or 2 as described below;
kbkPk(d) = fixed regression of the overall lactation curve (bk) described by fourth-degree Legendre polynomials (Pk) on DIM (d);
kakPk(d) = random regression of the individual cow effect (ak) described by third-degree Legendre polynomials (Pk) on DIM (d) (the term included pedigree genetic relationships among animals); and e = random residual.
In model 2, the SNP allelic effect was described as 0, 1, or 2, in each case corresponding to as many copies of the substitution SNP base. For example, LEP SNP C207T includes the bases cytosine and thymine. For this SNP, the GEN variable in model 2 would be assigned values 0, 1, and 2 for cows with TT, CT, and CC genotypes, respectively. This yields the effect of base C, equivalent to the allele substitution effect at this SNP locus at the observed frequency. Thus, model 2 is an allele substitution model.
The analysis was repeated for each SNP locus and trait. In a separate series of analyses, interactions between alleles in a SNP locus were also fitted to assess possible dominance effects. In yet another set of analyses, LEP haplotypes were fitted in the model, each one separately, instead of individual LEP SNP.
In model 2, the random regression corresponded to the polygenic effect of each cow. All known pedigree information was included, bringing the total number of animals in the analysis to 968.
Live weight, BCS, energy content, and cumulative effective energy balance, which henceforth will be collectively referred to as body energy traits, were analyzed with a similar model, except that lactation number was omitted (because there were only first-lactation records) and milk yield on the day of measurement was included as a covariate, to account for the phenotypic correlation between milk production and body energy.
The trajectory in these analyses was defined by week of lactation. Otherwise, the exact same analyses were conducted as for milk and feed traits. Variance components and fixed effect solutions were calculated using the ASREML software package (Gilmour et al., 2002).
In each of the above analyses, the null hypothesis was of no SNP or haplotype effect on the trait of interest. Inevitably, multiple hypothesis tests were made because several traits were analyzed. However, these traits are genetically correlated to each other, so not all analyses constitute truly independent statistical tests. To account for multiple testing, we considered the analyses of (i) milk yield and feed and dry matter intake, (ii) feed conversion, and (iii) body traits as distinct, independent hypothesis tests. A Bonferroni correction, based on the Holm-Bonferroni method (Holm, 1979), was then implemented to separately test each hypothesis. This method uses the ratio of the original significance level (e.g., 0.05) over the number of hypothesis tests as the new significance threshold. The method works sequentially starting with the test with the lowest unadjusted P-value, which it compares with 0.05/ k, where k is the number of hypotheses tests. If this hypothesis is rejected, the next unadjusted P-value is compared with 0.05/(k – 1) and so on, until a hypothesis is not rejected (Holm, 1979).
| RESULTS |
|---|
|
|
|---|
Grading-up selection favoring certain alleles and the finite number of parents (in populations undergoing sexual reproduction) may lead to departures from Hardy-Weinberg equilibrium, that is, the classical genotypic frequencies of p2, 2p(1 – p) and (1 – p)2, where p is the frequency of 1 of the 2 alleles. The only locus showing a statistically significant (P < 0.05) departure from equilibrium was the DGAT1 gene locus, with a deficit of heterozygotes in favor of both homozygotes of approximately 10% from the expected value. In contrast, the polymorphism in the GHR gene locus displayed an excess of heterozygotes, of approximately 9% over expectations, although this deviation was only suggestive, because it did not attain statistical significance (P > 0.05). These departures are relatively small and are difficult to interpret without further data and more detailed analysis fully accounting for the population structure. The estimates of allele frequencies given in Table 2
assume random mating.
Eight LEP haplotypes were identified by parsimony. The order of the 6 corresponding LEP SNP was C207T, C528T, A1457G, C963T, A252T, and C305T, according to their physical position on the gene; the first 4 SNP are located in the promoter and the last 2 in the exon 2 region. Table 3
shows the number of copies for each of 7 of the haplotypes. In total, the number of copies is twice the number of animals (490 cows with all LEP SNP identified) because each cow bears 2 haplotype copies. The remaining haplotype pertained to 1 animal and could not be precisely defined but was of the form TCXCAT, where X is most likely to be A. This cow was excluded from the analysis. For all other animals, the allocation of pairs of haplotypes was uniquely defined by the observed genotypes and the algorithm developed for this purpose.
|
Linkage disequilibrium values normalized by the geometric mean of the binomial variances ranged from –0.222 to 0.998. Such values approach a magnitude of 1 when the occurrence of the allele at an SNP locus perfectly predicts the allele at the other SNP locus. In this data set, large (>0.99) disequilibrium values occurred among 3 SNP loci: C207T, C963T, and C305T. This means that little is to be gained in genotyping more than 1 SNP from this set of loci. Furthermore, linkage disequilibrium values between these 3 SNP loci and SNP A1457G ranged from 0.765 to 0.770, suggesting that variation in the latter may add some additional information, but is also relatively highly correlated with variation at the first set of 3 SNP loci. In all other cases, linkage disequilibrium values were smaller, supporting the need to genotype the SNP to improve characterization of the LEP gene locus.
Milk and Feed Traits
Table 4
shows the genetic effects on milk production and feed traits when each SNP was fitted individually into the model. These effects refer to substituting the allele shown in Table 4
for the other allele at the corresponding SNP locus. For example, according to Table 4
, substituting one copy of base (allele) A for a copy of base T at the A252T LEP SNP was associated with a significant (P < 0.05) reduction of daily milk yield by 2.3 kg. Similarly, replacing a copy of base A by a copy of base T at the GHR SNP locus decreases the daily milk yield by 0.52 kg, although this effect was not significantly different from zero (P > 0.05).
|
When dominance was fitted in the model, LEP SNP A1457G had a significant (P < 0.05, even after the Bonferroni correction) effect of –0.88 ± 0.37 kg on daily milk yield. The estimate of the additive effect in this case was –0.65 ± 0.32 kg (P < 0.05, but not significant following Bonferroni correction) and represents half the difference between the homozygotes, whereas the dominance coefficient is the deviation from this estimate (Falconer, 1983). Results suggest nearly complete dominance of base A over G at this SNP locus, meaning that milk yield of animals with the AG genotype is much more similar to milk yield of AA animals than to that of GG animals. The estimate of the additive effect when dominance was included was very similar to the estimate from the additive-only model (Table 4
). This is probably the result of allelic frequencies being close to 0.50 in the SNP locus A1457G (Table 2
), meaning that the allelic substitution effect is independent of allele frequency. With this dominance relationship, the magnitude of the allelic substitution effect will increase (or decrease) as G increases (or decreases) in frequency. Dominance effects at other LEP SNP loci were not different from zero (P > 0.05).
The GHR effects on feed and dry matter intake also showed significant (P < 0.05) dominance, with estimated effects of 1.86 ± 0.89 and 0.61 ± 0.23 kg, respectively. However, this result was not significant after the Bonferroni correction. Corresponding allele substitution (additive) effects for base T were 2.85 ± 0.86 kg (P < 0.05, significant after Bonferroni correction) and 0.79 ± 0.22 kg (P < 0.05, significant after Bonferroni correction), slightly greater than those from the additive-only model (Table 4
). These values suggest the T allele may be partially dominant, such that cows with AT genotypes consume feed at a rate more similar to cows with TT than with AA. Because T is the most common allele (frequency of 0.80; Table 2
), the partial dominance reduces the effect of its substitution on selection and breeding.
When an additive-only model is fitted, the estimate obtained is the effect of the allele substitution on breeding value. In such a case, if there is any dominance present, the additive estimate will vary with the gene frequency. As explained earlier, when dominance is fitted explicitly, the additive regression coefficient represents an estimate of half the difference between the homozygotes and the dominance coefficient is the deviation. The additive part assumes additivity given the absence of both heterozygotes. The dominance deviation contributes to the offspring only to a certain degree, which depends on the allele frequency. Therefore, the additive effects from an additive-only model are the appropriate estimates when considering the effect of these loci on selection and breeding.
All previous results pertained to fitting each SNP separately in the model. Fitting all LEP SNP simultaneously would yield marginal effect estimates but also encounter difficulties given the high degree of linkage disequilibrium among the individual SNP loci. In light of this observation, more reliable results may be obtained from estimating substitution effects of inferred LEP gene haplotypes. This allows for some aspects of epistasis among the loci and works with causes of disequilibrium rather than trying to treat them as independent.
Estimates of haplotype substitution effects on milk and feed traits are shown in Table 5
. The CCGTTT haplotype had a significant (P < 0.05) positive impact on milk yield and feed and dry matter intake, which remained significant after the Bonferroni correction. One copy of the haplotype is associated with an additional 3.13 kg of milk being produced and 4.64 and 1.29 kg more feed and dry matter, respectively, being consumed by the cow daily. This haplotype also had the largest magnitude of effect on the 2 feed to milk ratios (in fact reducing the ratio), but estimates were not significantly different from zero. The outcome is consistent with the allele substitution analyses, because it contains the G allele for SNP A1457G and it is also the only haplotype with the T allele for SNP A252T. The TCACAC haplotype was the only other LEP haplotype that seemed associated with a trait, namely daily dry matter intake (–0.31 ± 0.16 kg), but this effect was not significant post-Bonferroni correction.
|
|
The estimates for LEP haplotypes are given in Table 7
. Nonsignificant (P > 0.05) effects were observed, but there was a suggestion for the CCGTTT haplotype to be associated with somewhat heavier animals (P = 0.10) with more body condition (P = 0.12) and more energy content (P = 0.08).
|
| DISCUSSION |
|---|
|
|
|---|
For the DGAT1 and GHR genotypes, the effects on milk yield and its components have been well established during their discovery and validation, primarily by QTL detection in granddaughter designs (Grisart et al., 2002; Blott et al., 2003). The results from the present study are, in general, consistent with previous work in dairy cattle showing that the allele responsible for lysine is usually associated with decreased milk production (Spelman et al., 2002; Thaller et al., 2003; Szyda and Komisarek, 2007). These studies also reported increasing milk fat concentration in animals carrying the lysine variant, but milk components were not included in the present analysis.
At the GHR locus, the studied mutation involved a thymine to adenine base substitution. Results of the present study suggested a positive but not significant (P = 0.19) effect of adenine on daily milk production. Blott et al. (2003) reported favorable effects of the same allele on daughter milk yield deviations of Dutch and New Zealand bulls.
For the most part, the published studies have estimated, by virtue of their design, the effect of an allele substitution and have been less able to detect dominance among the alleles. The degree of dominance was estimable in the present study, but there was no statistically significant evidence for any departure from additivity at either locus for milk yield.
A further benefit of this study in relation to DGAT1 and GHR gene loci is the availability of data on daily feed intake and body energy, which is not routinely recorded in commercial populations. Therefore, little information has been available on the associations of these loci with such traits. In the present study, the DGAT1 allele responsible for reduced milk production appeared to have a marginal positive effect on cumulative effective energy balance, suggesting that carriers are more likely to conserve energy than expend it for milk production. Furthermore, we found a significant impact of the GHR gene locus on feed and dry matter intake, with the thymine base allele being associated with increased intake and feed to milk ratios and, consequently, poorer feed conversion. There was also evidence that the thymine base allele was partially dominant in these actions. The effect of allelic substitution at the GHR SNP locus on these traits will increase as the allele frequency for adenine, which is likely to be positively associated with milk production, increases. Cows carrying this base also accumulated more energy balance over a lactation (87.3 MJ/copy). This appears to be the case when efficient conversion of feed to milk does not necessitate mobilization of the animals energy reserves, which accumulate as lactation progresses. This picture may reflect the potential impact of the allele on milk components, which may alter apparent benefits.
Polymorphism in LEPR did not affect significantly any of the traits studied here. Minimal association between this gene and milk production was reported by Szyda and Komisarek (2007) in a study of Polish Holstein cattle.
The impact of variation in the LEP gene locus on dairy production has not been established by classical genome mapping, and its study has mostly followed a candidate gene approach (Buchanan et al., 2002; Lagonigro et al., 2003). In such circumstances, mutations with significant effects may not be causal but may instead reflect linkage disequilibrium with mutations in other genes potentially at some distance from the LEP gene. In this context, 2 LEP SNP (A252T and A1457G) appeared to be associated with increased daily milk yield by 2.3 kg (in favor of thymine) and 0.7 kg (in favor of guanine), respectively. From a practical point of view, it may be tempting to dismiss the thymine base allele and the associated haplotype as being rare (3%) and therefore not worthy of further pursuit. However, it is important to recognize that the allele with the greatest potential impact will frequently be rare with its value diminishing as it becomes more and more common in the population. Commercial cost-benefit realities will determine the utility of this result. In any case, the value of individual alleles to the population may be assessed by their average effect, which is measured by the product (1 – p)
, where p is the allelic frequency and
the allele substitution effect; thus, for daily milk yield, the average effect of thymine in SNP locus A252T is (1 – 0.03) x 2.3 = 2.23 kg, whereas the average effect of guanine in A1457G is (1 – 0.48) x 0.7 = 0.36 kg.
The same LEP SNP genotypes that produced more milk were also found to consume proportionately more feed, confirming the results of Lagonigro et al. (2003). Because the increase in feed intake was proportional to that of milk yield, feed conversion remained unaffected. The effect of these genotypes on live weight and body energy traits was nonsignificant. In general, cows that do not lose body energy to support increases in milk yield may be ideal candidates for future bull mothers that will help rectify losses in cow body energy following increased yield at the population level.
Given the high degree of linkage disequilibrium identified among certain SNP at the LEP gene locus, the analysis using the haplotypes may be more reliable than individual SNP analyses. In this regard, LEP haplotype CCGTTT was identified as the one with the greatest substitution effect. This haplotype includes both favorable alleles (G and T for A1457G and A252T, respectively). The average effect of CCGTTT on daily milk yield, measured by the product (1 – p)
, is estimated to be 3.05 kg. Although this haplotype was also associated with greater feed intake, the estimates of feed to milk ratios were slightly lower (albeit not significantly different from zero). Nevertheless, LEP haplotype CCGTTT was associated with somewhat heavier (P = 0.10), better-conditioned animals (P = 0.12) with increased total body energy content (P = 0.08). This association is consistent with greater feed consumption by animals bearing this haplotype. The relative economic weight of each of these traits will determine the overall economic importance of the haplotype.
Clearly, it would be interesting to further examine the effects of haplotype CCGTTT. If it is indeed favorable, it would be important to determine whether its low frequency (2.7% in our data) is due to a recent mutation or due to selection against these alleles for some unstudied trait.
| CONCLUSIONS |
|---|
|
|
|---|
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
Received for publication December 7, 2007. Accepted for publication April 22, 2008.
| REFERENCES |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |