|
|
||||||||
1 Institute of Agricultural Biology and Biotechnology, National Research Council, Segrate 20090, Italy
2 Dipartimento di Sanità e Benessere degli Animali, Università di Bari, Valenzano 70010 Italy
3 Centro Ricerche e Studi Agroalimentari, Fondazione Parco Tecnologico Padano, Segrate 20090, Italy
4 Dipartimento di Scienze e Tecnologie Veterinarie per la Sicurezza Alimentare, Università degli Studi di Milano, Italy
5 Associazione Nazionale Allevatori Frisona Italiana, Cremona 26100, Italy
6 Associazione Nazionale Allevatori Razza Bruna Italiana, Bussolengo 37012, Italy
Corresponding author: Paul Boettcher; e-mail: boettch{at}ibba.cnr.it.
| ABSTRACT |
|---|
|
|
|---|
Key Words: haplotype casein cattle milk production
| INTRODUCTION |
|---|
|
|
|---|
Results of studies considering the effects of single protein genes have not been consistent. Some investigations have found a positive effect for the B allele of the
-CN locus (e.g., Ng-Kwai-Hang et al., 1986; Cowan et al., 1992), but other studies have failed to corroborate this result (Ron et al., 1994; Sabour et al., 1996). This result seems to suggest that the significant effects observed in some studies were due not only to the
-CN gene, but also possibly to other genes or combination of genes physically linked to this locus. Genes that encode
S1-CN and ß-CN are also located on chromosome 6, within a region of about 200 kb that includes
-CN (Ferretti et al., 1990; Threadgill and Womack, 1990). This closeness in physical location makes it difficult to separate the effects of different CN genes (Lien et al., 1995). For this reason, several investigators have taken a different approach for estimating effects associated with different combinations of the CN alleles. Instead of analyzing effects of single alleles, these studies estimated effects of haplotypes defined by the CN genes (e.g., Braunschweig et al., 2000; Ikonen et al., 2001) and, in some cases, by other sites of polymorphism within the area of chromosome 6 linked to the CN loci (Lien et al., 1995). Various significant effects have been observed in these studies, particularly for protein content in milk.
The commonly used genotyping procedures currently applied in molecular genetic studies yield only information about the 2 alleles present at a given locus of diploid individuals. When multiple loci are genotyped, haplotypes are not known, because no information is produced regarding the linkage phase of the alleles at the different loci. Definition and assignment of haplotypes must be done in a separate step, using information about genotypes of individuals within the same family or the population to reconstruct and deduce the most probable haplotype for each individual. One common approach taken for analyses of haplotype effects (and the approach taken in the previously mentioned studies on CN haplotypes) has been to determine the most likely haplotype configuration for each individual and assume that this haplotype assignment is known without error when subsequent statistical analyses are performed. One problem with this approach is that haplotype configurations are usually impossible to determine unambiguously for some individuals. Therefore, another option for such analyses would be to estimate the probability of each haplotype configuration for each individual and then to regress phenotypes on the respective probabilities for each haplotype. Such an approach would help account for the uncertainty of haplotype reconstruction and could yield greater precision in the analysis.
A question of interest in studies on CN haplotypes is whether the effects observed are due to differences among genes within the CN complex or to genes linked physically on chromosome 6. Finding similar effects of the same haplotypes in different breeds would provide some evidence that the effects observed were due to the direct effects of the CN loci or to DNA polymorphisms within the casein complex. In contrast, differences among breeds would tend to rule out this conclusion, suggesting that differences were also associated with other factors, such as direct effects of other genes linked to the CN loci or interaction with loci in other regions of the genome. The logic behind this hypothesis is that the casein complex is rather short at 200 kb and thus may be subject to little recombination.
The objective of this study was to estimate the effects of CN haplotypes on milk production traits 2 breeds, Italian Holstein and Brown Swiss, and to compare effects of the same haplotypes in the 2 breeds.
| MATERIALS AND METHODS |
|---|
|
|
|---|
S1-, ß-, and
-CN were available for 352 Italian Holstein and 311 Brown Swiss, based on milk samples collected in 2001. Genotypes had been determined by isoelectric focusing analysis of individual milk samples according to Erhardt et al. (1998). Twenty different sires were represented in each breed. The number of daughters per sire ranged from 8 to 28 for Holsteins and from 9 to 22 for Brown Swiss. Haplotypes were reconstructed separately for each breed, using the Monte Carlo based method of Boettcher et al. (2004). This procedure involved 2 steps. In the first step, haplotype probabilities of the sires were determined, conditional upon genotypes of the offspring and allelic frequencies in the general population. Briefly, for each sire, all possible haplotype configurations were identified based on offspring genotypes. Then, for each plausible haplotype combination for that sire, random gametes were sampled for the offspring that were consistent with their respective genotypes. The complementary dam gametes were then determined and the probability of receiving this combination of sire and dam gametes was determined based on population haplotype frequencies, recombination rates, and probability theory. This process was repeated many times for each plausible sire haplotype combination and summed together within each combination to obtain a relative probability for each. Relative probabilities were converted to absolute probabilities by standardizing by the sum of all relative probabilities. In the second step, the haplotype probabilities of offspring were then directly calculated conditional upon these sire haplotype frequencies. These 2 steps were alternated iteratively until estimated population frequencies within the breed of interest converged to stable values. The final result was a set of estimated haplotype probabilities for each animal, expressed as the expected number of copies of each haplotype carried by each animal. Phenotypic records for 305-d yields of milk, fat, and protein, and percentages of fat and protein were obtained from the Italian Holstein (ANAFI, Cremona, Italy) and Brown Swiss (ANARB, Bussolengo, Italy) Associations. Data had been preadjusted for age, parity, month of calving, and region effects and deviated from contemporary group means. Preadjustment of records was expected to be much more precise than directly accounting for the various systematic nongenetic effects in the model. Adjustments had been made based on factors calculated using the national databases for each breed, which include hundreds of thousands of records, vs. only a few hundred for this study. For a similar reason, using deviations from contemporary groups in the national databases was expected to be superior to including contemporary groups in the eventual statistical model. Cattle were selected for the project based on sire, rather than herd, and most cows had many fewer herdmates in the study than present in the national databases. Holstein cows were from 51 different herds and Brown Swiss were from 77. Phenotypic data were not available for all cattle with CN genotypes, and some cows had multiple records. After keeping only those records from cattle with both types of information, the final datasets included 728 records from 347 Holstein cows and 773 records from 298 Brown Swiss cows. The Italian national evaluation for Holsteins considers only the first 3 lactations, whereas all records are eligible for the Brown Swiss, which explains why the average number of records per cow was greater for Brown Swiss. Among Holsteins, 93 cows had only a single record, and 129 had 3 records. For Brown Swiss, 77 cows had only first parity information, and 134 had >2 records. Breeds were analyzed separately.
The goal of the analysis was to estimate haplotype effects by regressing phenotypes on haplotype probabilities (expressed as expected numbers of copies of each haplotype). Because these expected values summed to exactly 2.0 for each individual, linear dependencies existed among haplotype probabilities. Thus, a standard regression approach would not have allowed for the estimation of both an overall intercept and a separate regression coefficient for each haplotype. Therefore, before the statistical analysis, the overall mean was subtracted from the record for each trait, forcing an implicit restriction that the sum of haplotype effects (weighted for individual probabilities) in the population was equal to zero. Then regression was applied without including an intercept in the model, allowing for no explicit restrictions on the fixed effects and the estimation of a regression coefficient for each haplotype. Rare haplotypes (frequency <0.005) were grouped together into a single common effect.
The model used for the statistical analyses was:
![]() | ([1]) |
where yijk was the kth phenotypic record (preadjusted effects of age, parity, herd-year-season and deviated from the population mean) for the jth daughter of the ith sire, h is the total number of haplotypes evaluated, ßa is the regression coefficient for the effect of a single haplotype a, pa:ij is the probability that daughter j of sire i carried haplotype a, si is the random effect of sire i, cij is the random effect of the jth daughter of sire i (nested within sire i), and eijk is the random residual. A sire model was chosen rather than an animal model because the procedure for selection of animals resulted in very few close maternal genetic relationships (i.e., mother-to-daughter or maternal half-sibs). Thus, an expectedly small amount of statistical precision was sacrificed for the increased operational simplicity of the sire model. Equation 1 was solved using maximum-likelihood and the MIXED procedure for SAS software (SAS Institute, Inc., Cary, NC). Variances for sire, cow, and residual effects were estimated simultaneously with the estimation of fixed effects.
The regression coefficients ßa from equation 1 allow one to calculate the expected difference in a given trait between 2 animals with different casein haplotypes. In the hypothetical situation in which 2 animals differed at only one haplotype, the expected difference in a given trait would be the difference between the solutions of the different haplotypes that the 2 animals carry. For example, consider 2 cows that are alike in all ways (including one of the 2 haplotypes) except that one cow carries hypothetical haplotype X and the other carries Y. Assuming that the respective regression coefficients for these 2 haplotypes were +100 and 200, the milk yield of the first cow would be expected to exceed that of the second by 300 kg.
To determine the statistical significance of the haplotype effects, we tested the null hypothesis that all regression coefficients were equal to zero. In addition, P values of the estimates of the effects of each haplotype were also obtained. To determine if the estimates of haplotype effects were similar across breeds, simple Pearson correlations were calculated across breeds for each trait, using solutions for the haplotypes that were present in both breeds. Only a small number of haplotypes were present in both breeds, thus the precise values of correlation coefficients between solutions were clearly not expected to have the same meaning as a correlation coefficient calculated on a very high number of observations. For example, such correlations could not be converted to meaningful regression coefficients to estimate the change in one variable associated with a unit change in the other variable. However, high and positive correlations were still expected to indicate similarity of effects across breeds. In addition, the correlations were not likely to be distributed with a normal distribution under the null hypothesis. Therefore, a permutation test was used to test for significance of the resulting correlation coefficients.
Under the null hypothesis of no similarity in solutions for the same haplotypes across breeds, one would expect the correlation of solutions across breeds to be near zero and the way in which individual haplotype solutions were paired across breeds to calculate a correlation coefficient should have no particular relationship with the magnitude of the correlation. Thus, to perform the permutation test on the correlations observed for the N haplotypes in common across breeds, we calculated the correlation coefficient for each of the N! possible ways of pairing the N different solutions for each trait across breeds. Then, for each trait, the P value was estimated as the proportion of pairing schemes that yielded greater or equal correlation coefficients than that observed when solutions were paired according to the haplotypes.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
s1-CN in the sample of Holstein cattle. Four alleles (A1, A2, A3, and B) were present for ß-CN, and 3
-CN alleles (A, B, and E) were observed. Among the Brown Swiss, an additional F allele was observed for
s1-CN and some cattle carried the C allele of
-CN, whereas the E allele of
-CN was not present. Fifteen different genotypes were observed among the Holsteins, with (
s1, ß,
-CN) BBA1A2AA (26%), BBA2A2AA (24%), and BBA1A2AE (12%) being the most common. Brown Swiss cows had 33 different genotypes, with the 3 most common being BBA2A2BB (28%), BBA2BAB (12%), and BBA2A2AB (9%). Based on the distributions of alleles, a maximum of 24 (2 x4 x3) haplotypes were possible among Holsteins and 36 (3 x4 x3) were possible for Brown Swiss. Following haplotype reconstruction, 9 different plausible haplotypes were observed among the Holsteins and 17 were present for the Brown Swiss. Estimated haplotype frequencies of the most common (frequency >0.005) are shown in Table 2
s1-ß-
), with an estimated frequency of 0.48. More than 75% of the haplotypes present were either BA2A or BA1A. The BA2B haplotype was the most common (0.50) among the Brown Swiss. The frequency of this haplotype was only about 5% among the Holsteins (Table 2
|
|
In Holsteins, the BA1B haplotype was associated with increased percentages of fat (0.16) and protein (0.09). Ikonen et al. (2001) reported a positive effect for this haplotype on these traits in Finnish Ayrshires. The BBA haplotype had the unusual effect of increasing fat percentage (0.11) and decreasing protein percentage (0.08), whereas the CA3A haplotype had opposite effects on the same traits. However, both of these haplotypes were rare and the estimates of their effects had high standard errors and were not significant statistically (P >0.05).
Similar to Holsteins, the BA1B haplotype was associated with increased fat and protein percentages (P <0.01) in Brown Swiss (Table 3
). This haplotype also had a negative effect on milk yield (P <0.05), which was consistent with its effect in Holsteins (Table 2
), although the effect in Holsteins was not significant. The CA2B haplotype had effects similar to BA1B, in that it was associated with significantly decreased yield and increased concentration of protein. Three other haplotypes (BA2A, BBA, and BBB) had significant (P <0.05) effects on protein percentage. In general, among haplotypes differing only at the
-CN locus, those haplotypes with the B allele had positive effects on protein percentage relative to the corresponding haplotype with the A allele, which was consistent with previous studies that had found differences between
-CN genotypes (Ng-Kwai-Hang et al., 1986; Cowan et al., 1992; Ikonen et al., 2001).
Comparing results across breeds, one similarity was that the same traits protein percentage in particular, and fat percentage to a lesser degree were affected most strongly by differences in CN haplotypes. As mentioned previously, haplotype BA1B was associated with significant increases in fat and protein percentages in both breeds. Ikonen et al. (2001) observed similar effects for this haplotype in Finnish Ayrshires, and Caroli et al. (2004) reported a positive effect of this haplotype on protein percentage in the local Reggiana cattle breed of Italy.
Six haplotypes (BA1A, BA1B, BA2A, BA2B, BBA, and BBB) were present in both breeds. Table 4
has correlation coefficients across breeds for haplotype effects for each trait. Correlations between haplotype solutions were positive for all traits and above 0.50 for all traits except protein yield. The correlation for milk yield was particularly high (0.81). According to the permutation test procedure, only 5 (<1%) of the 720 possible pairing schemes yielded higher correlation coefficients. In addition, fewer than 5% of the permutations of paired solutions yielded higher correlations for fat yield and fat percentage.
|
Although the number of animals evaluated in this study was rather low (347 Holsteins and 298 Brown Swiss for a total of slightly less than 650 cattle), the positive significance tests across breeds for multiple traits seem to suggest that genes within or near the CN region have effects on milk production traits. Assuming these effects are real, the high correlations across breeds may indicate that the CN protein sequences themselves or DNA polymorphisms within the CN complex are responsible for the effects on the various traits, rather than other genes linked to these loci. If effects were due to linked genes, similar haplotype solutions would have been observed only if the linkage phase with the CN alleles was the same across breeds, which seems unlikely. Interactions of CN loci with other polymorphisms in the CN complex may be present, which could help explain why haplotype effects ranked similarly in both breeds but were more distinct in the Brown Swiss than in Holsteins. Sites of polymorphism within noncoding regions within the CN complex have been recently identified. Damiani et al. (2000b) described a polymorphism within the short interspersed nucleotide element Bov-A2 in the second intron of
-CN. This polymorphism was found to be in linkage disequilibrium with the
-CN protein variants (Damiani et al., 2001), and had statistically significant associations with several milk production traits (Damiani et al., 2000a). More recently, polymorphisms of the
s1-CN promoter were found have significant associations with milk protein content (Prinzenberg et al., 2003). Thus, the CN polymorphism has to be considered as a whole complex where expression sequence polymorphisms could help explain the productive implications of the different CN loci and their corresponding haplotypes.
| CONCLUSIONS |
|---|
|
|
|---|
-CN. However, some caution must be used when interpreting these results, given that they were based on records from <350 animals per breed and only 6 haplotypes were present in both breeds. From a biological point of view, further investigation is required to confirm these results and to obtain more details about the precise polymorphisms that could be responsible for associations related to the CN complex. If observed effects are real, such work could increase the value of the CN complex for use in marker-assisted selection, which remains questionable based on the current work, inasmuch as the haplotypes that improved fat and protein percentage were also associated with decreased milk yield and only small effects on yield of fat and protein.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
Received for publication December 19, 2003. Accepted for publication September 2, 2004.
| REFERENCES |
|---|
|
|
|---|
-casein and ß-lactoglobulin in Holstein cattle. J. Dairy Sci. 75:10971104.[Abstract]
-casein SINE Bov-A2 and CYP21-hydroxylase in some bovine breeds. Zoot. Nutriz. Anim. 3:145148.
-casein (CSN3) gene. Anim. Genet. 31:277279.[Medline]
This article has been cited by other articles:
![]() |
A. Comin, M. Cassandro, S. Chessa, M. Ojala, R. Dal Zotto, M. De Marchi, P. Carnier, L. Gallo, G. Pagnacco, and G. Bittante Effects of Composite {beta}- and {kappa}-Casein Genotypes on Milk Coagulation, Quality, and Yield Traits in Italian Holstein Cows J Dairy Sci, October 1, 2008; 91(10): 4022 - 4027. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Caroli, S. Chessa, F. Chiatti, D. Rignanese, B. Melendez, R. Rizzi, and G. Ceriotti Short Communication: Carora Cattle Show High Variability in {alpha}s1-Casein J Dairy Sci, January 1, 2008; 91(1): 354 - 359. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Chessa, F. Chiatti, G. Ceriotti, A. Caroli, C. Consolandi, G. Pagnacco, and B. Castiglioni Development of a Single Nucleotide Polymorphism Genotyping Microarray Platform for the Identification of Bovine Milk Protein Genetic Polymorphisms J Dairy Sci, January 1, 2007; 90(1): 451 - 464. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Sacchi, S. Chessa, E. Budelli, P. Bolla, G. Ceriotti, D. Soglia, R. Rasero, E. Cauvin, and A. Caroli Casein Haplotype Structure in Five Italian Goat Breeds J Dairy Sci, April 1, 2005; 88(4): 1561 - 1568. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |