JDS
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


J. Dairy Sci. 2008. 91:1614-1627. doi:10.3168/jds.2007-0655
© 2008 American Dairy Science Association ®

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Interpretive Summary
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Lipkin, E.
Right arrow Articles by Soller, M.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Lipkin, E.
Right arrow Articles by Soller, M.

Effect of Quantitative Trait Loci for Milk Protein Percentage on Milk Protein Yield and Milk Yield in Israeli Holstein Dairy Cattle

E. Lipkin1, R. Tal-Stein, A. Friedmann and M. Soller

Department of Genetics, Alexander Silberman Institute of Life Science, Hebrew University of Jerusalem, Jerusalem 91904, Israel

1 Corresponding author: lipkin{at}vms.huji.ac.il


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 CONCLUSIONS
 Appendix 1
 ACKNOWLEDGEMENTS
 REFERENCES
 
Although numerous quantitative trait loci (QTL) mapping studies involving milk protein percent (PP), milk yield (MY), and protein yield (PY) have been carried out, there has not been any systematic evaluation of the effects of individual QTL on these 3 interrelated traits. Consequently, the aim of the present study was to investigate the effects on MY and PY of QTL for PP previously mapped in various laboratories. The study, based on selective DNA pooling of milk samples, included 10 Israeli Holstein artificial insemination bulls, each the sire of 1,800 or more milk-recorded daughters. For each sire-trait combination across the 10 sires, milk samples of the highest and lowest daughters with respect to estimated breeding values for PP, PY, and MY were collected for pooling. A total of 134 dinucleotide microsatellites distributed over 25 bovine autosomes were used. An empirical standard error for marker-QTL linkage testing was calculated based on the variation among split samples within the same tail. Threshold comparison-wise error rate P-values were set to control proportion of false positives at P = 0.10 level for declaring significant effects at the marker-trait level. Estimates of the number of true null hypotheses for each trait were obtained from the histogram of marker comparison-wise error rate P-values. Based on these estimates, effective power of the experiment at the marker-trait level was estimated as 0.75, 0.41, and 0.73 for PP, PY, and MY. The proportion of heterozygosity at the QTL was estimated as 0.46, 0.39, and 0.40, respectively. After correcting for incomplete power and proportion of false positives, it was estimated that 38.7 and 37.5% of the markers affecting PP and MY, respectively, also affected PY. Of the markers affecting PY, 68.9 and 76.5%, respectively, also affected PP and MY. Apparently, none of the significant markers affected PY exclusively, and only 6.5 and 16.0%, respectively, affected PP or MY exclusively. Thus, almost all significant markers, and by inference almost all QTL, had effects on at least 2 of the 3 traits.

Key Words: quantitative trait loci mapping • selective DNA • proportion of false positive


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 CONCLUSIONS
 Appendix 1
 ACKNOWLEDGEMENTS
 REFERENCES
 
A large number of studies have consistently documented a very high positive genetic correlation between milk yield (MY) and protein yield (PY), a moderate negative genetic correlation between MY and protein percentage (PP), and a very low, although generally positive, genetic correlation between PP and PY (Sölkner, 1989; Chauhan and Hayes, 1991; Welper and Freeman, 1992). For marker-assisted selection, it is important to know whether these relationships hold at the level of the individual QTL, or if there are QTL that have exclusive effects on PP, MY, or PY. Estimates of effects of β-CN A1 and A2 alleles and for the DGAT K232A allele on PP, MY, and PY were reported previously and are summarized in Table 1Go. Overall, directions of effects of the various alleles are consistent across populations and are consistent with the genetic correlations among the traits, namely specific alleles affect MY and PY in the same direction and PP in the opposite direction. On the other hand, estimated effects of specific alleles on the individual traits change in different populations. This may be due to sampling, to interaction of the allele with genetic background of the population, or to differences in the DNA sequence of the allele in different populations. On average, standardized effects of β-CN and DGAT alleles on PP were the same, but DGAT alleles appeared to have a greater effect on MY than β-CN alleles; the opposite was true for the effects of DGAT and β-CN alleles on PY. Thus, effects of individual alleles on the 3 traits may differ among genes. Viitala et al. (2003) used combined analysis of multiple chromosomes to map QTL affecting PP, MY, and PY in Finnish Ayrshire dairy cattle, at relatively high power. Assuming that QTL affecting PP, MY, or PY that map to within 20 cM of one another represent multiple effects of the same QTL, results by Viitala et al. (2003) can be interpreted as having identified 12 QTL affecting MY, of which 5 (45%) also affected PY, and 6 QTL affecting PP, of which only 1 affected PY (Table 2Go). Thus, the Viitala et al. (2003) study indicates that a majority of QTL affecting MY or PP did not affect PY and that effects of individual sire haplotypes on MY and PP were in opposite directions. On the basis of these results, Viitala et al. (2003) proposed that many QTL affect the volume of the liquid component of milk through primary effects on concentration of osmotic molecules (such as lactose and some of the Ca and P components). Such QTL would affect MY and PP in opposite directions and hence would be essentially neutral with respect to PY. Recently, Bagnato et al. (2008) reported a genome-wide scan of Brown Swiss dairy cattle for PP and MY using selective DNA pooling (Darvasi and Soller, 1994; Lipkin et al., 1998; Mosig et al., 2001). They identified a total of 55 QTL regions (QTLR), of which 16 affected MY exclusively and 13 affected PP exclusively. Eight of the 16 QTLR exclusively affecting MY, and 10 of the 13 exclusively affecting PP, also had reported results in 1 or more of the 3 Web cattle QTL databases (Cattle QTLdb, Hu et al., 2007; QTL Map, Khatkar et al., 2004; QTL Viewer, Polineni et al., 2006). Remarkably, all 8 QTLR in the Brown Swiss that exclusively affected MY were also found to exclusively affect MY in the databases. Similarly, all 10 QTL in the Brown Swiss that exclusively affected PP also exclusively affected PP in the databases. Following the Lander and Kruglyak (1995) validation criteria, and considering that the databases are based mainly on data obtained from Holstein populations, the completely independent replicated specificity of effects across different populations can be taken as a validation of these specific effects. Thus, in the study by Bagnato et al. (2008), an appreciable fraction of QTL appear to affect MY or PP primarily or exclusively, with little or no effect on the other trait.


View this table:
[in this window]
[in a new window]

 
Table 1. Absolute and standardized effects of β-CN/A1A1, β-CN/A2A2, and DGAT/K232A alleles on protein percentage (PP), protein yield in kilograms (PY), and milk yield in kilograms (MY) in various published studies1
 

View this table:
[in this window]
[in a new window]

 
Table 2. Quantitative trait loci location mapping results of Viitala et al. (2003) for protein percentage (PP), protein yield (PY), and milk yield (MY) according to chromosome (Bos taurus autosome, BTA) and location (cM)
 
We have previously reported extensive QTL-mapping studies for PP in the Israel Holstein dairy cattle population using selective DNA pooling of milk samples (Lipkin et al., 1998; Mosig et al., 2001). The aim of the present study, also based on selective DNA pooling, was to determine the pleiotropic effects of QTL affecting PP on the 2 closely related traits PY and MY and to develop a method to correct the observed proportion of multiple effects for incomplete power.


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 CONCLUSIONS
 Appendix 1
 ACKNOWLEDGEMENTS
 REFERENCES
 
Selective DNA Pooling
Large half-sib or full-sib families are routinely produced in dairy cattle populations as part of routine genetic improvement programs. The size of those families provide statistical power for QTL mapping within population, which can be accessed by use of a daughter design (Soller and Genizi, 1978; Weller et al., 1990). However, achieving the high power theoretically available with the daughter design requires genotyping very large numbers of daughters against large numbers of markers, resulting in high genotyping costs (Soller, 1994). These costs can be greatly reduced by the use of selective DNA pooling (Darvasi and Soller, 1994; Lipkin et al., 1998; Mosig et al., 2001). In this procedure, determination of linkage between a molecular marker and a QTL is based on the distribution of parental marker alleles among pooled DNA samples of the extreme high and low phenotypic groups of the offspring population. Densitometric estimates of sire allele frequencies in the pools were obtained after correction for shadow bands, as previously described (Lipkin et al., 1998; Mosig et al., 2001).

Population and Samples
The study included 10 half-sib daughter families of Israeli Holstein AI bulls, each the sire of 1,800 or more milk-recorded daughters and 134 markers. This would have required a total of almost 2 million genotyping data points if done by individual genotyping. Thus, the study was made feasible only through selective DNA pooling, which enabled it to be carried out with a total of about 25,000 genotyping data points. Herd book data and milk samples of these daughters were collected at different times, generating a series of data sets (DS), which are summarized in Table 3Go. In particular, daughters of 4 of the sires, comprising DS1 were previously sampled and mapped for PP in 1996. In 1998, for purposes of the present study, daughters of these 4 sires were independently resampled for PY and MY (DS2), and 6 additional sire families were sampled for all 3 traits (DS3). Based on EBV, lists of the highest and lowest 220 (DS1) or 230 (DS2, DS3) daughters were prepared for each sire-trait-tail combination. Milk sampling was as previously described (Lipkin et al., 1998; Mosig et al., 2001). For each sire-trait-tail combination, 2 subpools (DS1) or a single pool (DS2, DS3) were prepared in completely independent duplicate. Genotyping of individual and pooled milk samples was as described (Lipkin et al., 1998). Densitometric values for the marker and sire-marker tests of DS1 were obtained from the original data of Lipkin et al. (1998) and Mosig et al. (2001) as described in those studies. These were reanalyzed using the empirical standard error obtained in the present study. All other densitometric values were obtained on the basis of data collected in the present study.


View this table:
[in this window]
[in a new window]

 
Table 3. Data sets used in this study1
 
In addition to the above samples used for mapping the 3 traits, data from daughter samples DS4, DS5, and DS6 were used to calculate the empirical standard error (see below) but were not otherwise included in the present study.

Microsatellite Markers
A total of 134 dinucleotide microsatellites distributed over 25 bovine autosomes were used to scan chromosomal intervals indicated previously to harbor QTL affecting PP (Lipkin et al., 1998; Mosig et al., 2001). All markers were obtained from the public Web sites (USDA, http://www.marc.usda.gov/genome/genome.html; IBRP, http://www.cgd.csiro.au/cgd1.html).

Marker-QTL Linkage Tests
The test for linkage between a marker and each of the 3 study traits was carried out at 2 levels: the sire-marker-trait level, which tests for association of marker and trait at the level of the individual sire, and the marker-trait level, which tests for association of marker and trait across all sires.

Sire-Marker-Trait Combinations.
For each sire-marker-trait combination for which the sire was heterozygous at the marker, the comparison-wise error rate (CWER) Pijk-value, for the ith sire-jth marker-kth trait combination was obtained as twice the area of the normal curve from Zijk to + {infty}, where


Formula

Dijk = (DLijk – DSijk)/2 is the difference in sire-allele frequencies between the high and low daughter pools of the ith sire with respect to the jth marker and the kth trait, averaged over the long and short sire alleles (Lipkin et al., 1998; Mosig et al., 2001). DLijk is the difference in allele frequency of the long sire allele between the high and low daughter pools. DSijk is the difference in allele frequency of the short sire allele between the high and low daughter pools. DLijk and DSijk will necessarily have opposite algebraic sign; hence, the minus sign in the expression for Dijk.

Empirical SE(D').
Lipkin et al. (1998) and Mosig et al. (2001) calculated SE(D) on a priori grounds, on the assumption that it was determined by binomial sampling of sire and dam alleles and technical error of densitometric estimation of allele frequency in a pool. For the present study a novel empirical standard error was used, denoted SE(D'), based on a D-value (Dtail), obtained between 2 independent subpools at the same tail. Details on calculation of SE(D') are presented in Bagnato et al. (2008). To obtain an estimate of SE(D') for use in the present study, we reanalyzed 3 datasets (DS4, DS5, DS6; Table 3Go) for which 2 subpools were prepared for each tail. Data set 4 consisted of high and low daughters according to EBV for PP, of each of 7 Israel-Holstein sires. For each sire, the daughters comprising each tail were divided into 2 subpools. The 2 subpools in each tail were not completely independent, because the most extreme daughters within tails were assigned to the external subpools and the remainder to internal subpools (Lipkin et al., 1998; Mosig et al., 2001). Nevertheless, average Dtail was not significantly different from zero (data not shown), allowing the use of this DS to estimate SD(Dtail). Data set 5 consisted of high and low daughters according to EBV for PP of a new set of 7 sires. For each sire, the daughters in each tail were divided at random among 2 independent subpools. Data set 6 consisted of high and low daughters according to EBV for female fertility and a second set of high and low daughters according to EBV for milk SCC for 6 sires, 5 of which were included in DS5 and 1 which was collected independently. Here too, for each sire, the daughters in each tail were divided at random among 2 independent subpools.

Marker-Trait Combinations.
The CWER Pjk-value, for the jth marker-kth trait combination, was obtained as the area of the {chi}2 distribution from {chi}jk2 to + {infty}, where


Formula

The parentheses in the subscript of Zi(jk)2 and Zi(jk)2 indicate that the summation is over sires heterozygous at the jth marker, within the jth marker-kth trait combination, and si(jk) is the number of heterozygous sires for the jkth marker-trait combination (Weller et al., 1990; Lipkin et al., 1998; Mosig et al., 2001).

Proportion of True and Falsified Null Hypotheses Out of All Tests.
The total number of tests (N) represents a mixture comprised of n1 tests for which the null hypothesis is false (i.e., marker-QTL linkage is present) and n2 tests for which the null hypothesis is true (i.e., marker-QTL linkage is not present). Thus, the observed distribution of CWER P-values is a mixture of P-values representing these 2 classes of tests. On this basis, given the observed distribution of CWER P-values for the individual tests, the number of true null hypotheses (denoted n2) out of all N tests can be estimated by a number of procedures (Nettleton et al., 2006). For the present study, the histogram-based procedure first described by Mosig et al. (2001) and simplified by Nettleton et al. (2006) was used with the following algorithm:

Count the P-values in each of the H histogram intervals of width 1/H

Let yi be the count in the ith interval for i = 1,..., H,

Let xj be the mean of the yi from i = j to H,

then,

n2 = Hxj', where j'= the lowest value of j for which yj is lower than xj.

In the present study, H was set to 10 so that bin width was 0.10.

The number of false null hypotheses (denoted n1) is then obtained by subtraction, n1 = N – n2. Both n2 and n1 are useful statistics for a number of purposes, as will be shown in the following sections.

Proportion of False Positives: Setting Significance Levels for Experiment-Wise Linkage Tests.
To control for the multitest situation while retaining power, the proportion of false positive (PFP) criterion (Fernando et al., 2004), also termed aFDR (Mosig et al., 2001), was used to set CWER threshold P-values for declaration of significance at the marker-trait (PFPM) and sire-marker-trait (PFPS) levels. Following Mosig et al. (2001) and Fernando et al. (2004), PFPMj(k) for the jth marker test within the kth trait was calculated as:


Formula

where PMj(k) = the CWER P-value of the jth marker test within the kth trait, when the marker tests are ranked by their P-values from lowest to highest within a given trait; RMj(k) = the rank number of the jth marker test within the kth trait, and n2Mk = the number of true null hypotheses for marker tests within the kth trait, estimated as shown above.

Values for the ith sire-jth marker combination within the kth trait [PFPSij(k)] were obtained in an exactly analogous manner, except that at the sire-marker level, the number of tests for which null hypothesis is true (n2Sk) represents tests for which either the marker is not in linkage to a QTL or the marker is in linkage to a QTL, but the QTL is homozygous in the sire.

Significance Thresholds.
Significance thresholds were calculated separately by traits at the marker level and at the sire-marker level. At the marker level, marker-trait tests having CWER P-values corresponding to PFPM ≤ 0.10 were taken as significant. At the sire-marker level, PFP were calculated across all markers, but only sire-marker-trait tests having CWER P-values corresponding to PFP ≤ 0.20 within significant markers were taken as significant.

Estimating Power of the Test
Power of the statistical tests used to determine marker-QTL linkage for the kth trait at the given significance threshold for the marker-trait and sire- marker-trait levels (denoted VMk and VSk, respectively) were estimated as observed number of significant tests at each level (nOMk and nOSk) after correction for proportion of false positives, divided by the estimated number of false null hypotheses in the DS at that level (n1M, n1S) obtained as described above:


Formula


Formula

where the factor (1 – PFP) represents the correction for PFP.

Estimating the Proportion of Heterozygosity at the QTL
At the marker-trait level, a false null hypothesis represents a marker in linkage to a QTL. The proportion of false null hypotheses out of all marker-trait tests for a given trait (n1Mk/NMk) thus represents the proportion of markers in linkage to QTL for the kth trait (denoted QMk). At the sire-marker-trait level, the proportion of false sire-marker-trait null hypotheses out of all sire-marker-trait tests for a given trait (n1Sk/NSk, denoted QSk) represents the joint proportion of markers in link-age to QTL (QMk) and of QTL that are heterozygous in the sires (denoted Hk). It follows that QSk = QMkHk, and Hk can readily be estimated as Hk = QSk/QMk. This expression is algebraically equivalent to the corresponding expression given in Mosig et al. (2001). As pointed out by Mosig et al. (2001), QTL that are not represented in the sample by at least 1 sire heterozygous at both the marker and the QTL will not be included among the significant markers, so that QMk is an underestimate. This, in turn, leads to the Hk being overestimates. Mosig et al. (2001) also presented a procedure for correcting for the bias. However, when they applied this procedure to their data, the bias was close to 10%, and should be even lower for the present DS, because it includes a larger number of sires. Given the negligible bias, the correction procedure was not implemented in the present study.

Allele Substitution Effects
Allele substitution effects were calculated as described (Lipkin et al., 1998), based on shadow-corrected allele frequency estimates and the mean EBV of the pools (Table 4Go). The effects were calculated for all sire-marker-trait combinations defined as significant on the above PFPM and PFPS criteria.


View this table:
[in this window]
[in a new window]

 
Table 4. Mean EBV of high and low pools by sire1
 
Estimating the Proportion of QTL Affecting More Than One Trait
In an appreciable number of instances (denoted Oij), a given marker was observed to have significant effects on 2 traits, i and j. On the simplest assumption, this can be taken to reflect an underlying pleiotropic effect of the corresponding QTL in linkage to the marker on the pair of traits. It follows that the proportion of markers affecting trait i that also affect trait j is also an estimate of Qij, the proportion of QTL affecting i that also affect j.

Let OMij be the observed number of markers that affect both traits. Because the observed marker effects include false positives, the number of observed marker effects on the 2 traits that represent true effects on both traits is equal to OMij(1 – PFPM)2. On the other hand, because of incomplete power of the experiment, the actual number of markers affecting the 2 traits will be n1Mij = OMij(1 – PFPM)2/ViVj, where Vi and Vj = the power for detecting marker effects on traits i and j, respectively. The true proportion of markers affecting trait i that also affect trait j (QMij) will then be given by the proportion of n1Mij among the total number of estimated false null hypotheses involving i, (n1Mi): QMij = n1Mij/n1Mi. Similarly, the true proportion of markers affecting j that also affect i (QMji) will be given by QMji = n1Mji/n1Mj, where n1Mji and n1Mj are obtained as described above. Because the estimated numbers of false null hypotheses n1Mi and n1Mj can be different for different traits, it follows that QMij and QMji need not be equal. That is, the proportion of QTL affecting trait i that also affect trait j need not be the same as the proportion of QTL affecting j that also affect i. For example, it might be that a small fraction of QTL affecting MY also affect PY, but almost all QTL affecting PY also affect MY. On the same argument, the true proportion of markers affecting 3 traits (i, j, k) will be n1Mijk = OMijk(1 – PFPM)3/ViVjVk.

As calculated above, n1Mi represents the total number of markers that affect trait i, including those that also affect j and k, and n1Mij represents the total number of markers that affect traits i and j, including those that affect i, j, and k. From this, it follows that the number of markers {varepsilon}n1Mi, {varepsilon}n1Mij, and {varepsilon}n1Mik that exclusively affect traits i, j, or k will equal


Formula

and similarly for the other traits.

The proportion of markers affecting trait i that exclusively affect trait i, and trait combinations ij, ik, and ijk, are obtained by dividing {varepsilon}n1Mi, {varepsilon}n1Mij, {varepsilon}n1Mik, and n1Mijk, respectively, by n1Mi.

In principle, it would have been preferable to perform the mapping for each trait in an independent set of sires or daughters. This was achieved in part by the separation of daughters for the same sires in DS1 and DS2. But the logistical and analytical problems posed by extending such an approach to the entire DS seem insuperable. Had such an experiment been performed, we can expect that the proportion of QTL showing pleio-tropic effects would have been somewhat less than found in the present study.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 CONCLUSIONS
 Appendix 1
 ACKNOWLEDGEMENTS
 REFERENCES
 
Sampling and Population Phenotypic Means
The mean size of the sire half-sib families was 2,779 (1,827 to 4,769) for PP and 2,893 (2,135 to 4,769) for PY and MY (Table 5Go). With target number of daughters in the designated tails of 220 (DS1) and 230 (DS2, DS3), the average proportion selected in each tail was about 0.08. For PY and MY, the proportion of targeted daughters actually sampled was greater in the high pools (0.83) than in the low pools (0.78), indicating higher herd retention and hence positive selection for these traits. For PP, the proportion actually sampled was the same for high and low pools (0.79).


View this table:
[in this window]
[in a new window]

 
Table 5. Total number of milk-recorded daughters and number of daughters sampled to the high and low pools by sire and trait1
 
The phenotypic means of the 2 tails for each trait are shown in Table 4Go. Mean values for the high and low pools varied among the different sires, with nonsignificant negative correlations between PP and PY: r = – 0.523 (P = 0.121) and – 0.357 (P = 0.311) for the high tail and for the low tail, respectively; significant negative correlation between PP and MY: r = – 0.867 (P = 0.001) and – 0.728 (P = 0.017) for the high and low tails, respectively; and significant positive correlation between PY and MY: r = 0.869 (P = 0.001) and 0.693 (P = 0.026) for the high and low tails, respectively. These values were as expected by the known correlations between the traits, which have been previously found in literature (Sölkner, 1989; Chauhan and Hayes, 1991; Welper and Freeman, 1992). The mean difference between high and low tails for PP was 0.209% (calculated as high tail mean, 0.192, minus low tail mean, – 0.017; range, 0.184 to 0.245%). Corresponding values for PY and MY were 28.5 kg (21.8 to 33.0 kg) and 1,074 kg (634 to 1,310 kg), respectively. With proportion selected of 0.08 to each tail, each difference is the equivalent of 3.72 standard deviation units (Falconer and Mackay, 1996). Thus, the above values for the mean difference between the high and low tails provide estimates of 0.056%, 7.66 kg, and 288.7 kg for the within-sire standard deviation of EBV values of the daughter groups for PP, PY, and MY, respectively. Assuming female EBV reliability = 1.0, the standard deviation of EBV values within sires will represent about 0.87 of the population standard deviation of EBV (Lipkin et al., 1998), these can now be estimated as 0.065%, 8.84 kg, and 333.4 kg, for PP, PY, and MY, respectively.

Empirical Standard Error
For DS4, SE(Dtail) was calculated across 1,375 sire-marker-trait-tail combinations and was found equal to 0.072 so that SE(Dnull) = 0.072/{surd}2 = 0.051. For DS5 and DS6, SE(Dtail) was calculated across 1,338 sire-marker-trait-tail combinations and was found equal to 0.073 so that again SE(Dnull) = 0.051. Independent subpools were not prepared for the samples of DS2 and DS3. Given that the same methodology was used, we took the average SE(Dnull) = 0.051 obtained in DS4, DS5, and DS6 as the SE(D) to analyze the DS2 and DS3 data as well. This value is somewhat less than the value of 0.056 reported by Bagnato et al. (2008).

The Proportion of False Null Hypotheses Among All Null Hypotheses
A total of 2,563 sire-marker-trait tests and 402 marker-trait tests were implemented in the present study. Table 6Go shows the frequency distribution in bins of width 0.10 of CWER P-values by traits, for sire marker, and for marker tests. At both levels, there was an excess of low (significant) P-values, compared with the null hypothesis expectation of 10% of total tests in each bin. This indicates a high proportion of false null hypotheses in the DS, which are generating the excess low P-values. The excess of low P-values was about twice as great at the marker level than at the sire-marker level. This is as expected, because even when a sire is tested at a marker in linkage to a QTL, a significant sire-marker effect can be obtained only if the QTL is present in a heterozygous state as well. At both marker and sire-marker levels, the excess of low P-values was similar for PP and MY but considerably less for PY. This is due to the generally smaller D-values found for PY. The mean absolute magnitude of D-values was 0.056, 0.048, and 0.057 for MY, PY, and PP, respectively.


View this table:
[in this window]
[in a new window]

 
Table 6. The distribution of comparison-wise error rate (CWER) P-values and derived statistics for tests by traits1
 
Heterozygosity at the QTL
Based on the data of Table 6Go, estimates of the proportion of false null hypotheses at the sire-marker-trait level (QS) were n1S/NS = 0.281, 0.134, and 0.282, for PP, PY, and MY, respectively. These represent the estimated proportion of markers in linkage to a heterozygous QTL at the level of the individual sire-trait combination. At the marker level, the corresponding QM estimates were n1M/NM = 0.612, 0.343, and 0.701, for PP, PY, and MY, respectively. This represents the estimated proportion of markers in linkage to QTL at the individual trait level. The estimates of the proportion of heterozygosity (Hk) at the QTL for the various traits (Qs/QM) were therefore equal to 0.46, 0.39 and 0.40, for PP, PY, and MY, respectively. Average estimated heterozygosity at the QTL was 0.42, which is very close to the value of 0.44 obtained in Mosig et al. (2001). As noted in the methods section, this is a slight overestimate and should be reduced by about 10%.

Critical CWER P-Values and Power According to PFP
Table 6Go also shows critical CWER P-values for declaration of linkage, according to PFP criteria for significance set at a level of 0.10 for marker-trait tests and 0.20 for sire-marker-trait tests within significant markers. Because PFP values are in direct proportion to n2, PP and MY had similar critical P-values, which were distinctly higher than the corresponding values for PY. Critical P-values for PP were comparable to the values reported for this trait by Mosig et al. (2001).

The Number of Significant Marker-Trait Tests
A total of 134 markers covering 25 chromosomes were tested against each of the 3 traits. Thus, there were 402 tests in all, of which 165 (41.0%) were significant at PFPM ≤ 0.1 (appendix Table IGo). Of the markers tested, 30 were not significant for any of the traits. Of the 104 significant markers, 20 were exclusively significant for PP, 2 for PY, and 27 for MY; 6 were exclusively significant for PP and PY, 36 for PP and MY, 7 for PY and MY, and 6 markers were exclusively significant for all 3 traits. Considering the traits individually, 68 markers (50.7%) on 23 chromosomes had a significant effect on PP (i.e., this includes markers significant exclusively for PP and those significant for PP and PY, PP and MY, and PP, PY, and MY), 21 markers (15.7%) on 15 chromosomes had a significant effect on PY, and 76 markers (56.7%) on 23 chromosomes had a significant effect on MY.


View this table:
[in this window]
[in a new window]

 
Table I. Marker P-values, according to chromosome (Bos taurus autosome, BTA) and location (cM)1
 
Allele Substitution Effects
Allele substitution effect of all sires significant at PFPS = 0.20 within markers significant at PFPM = 0.10 are presented in Table 7Go. The mean allele substitution effects of the 3 traits were very close in standard deviation units, 0.30, 0.33, and 0.32 for PP, PY, and MY, respectively. The relatively high allele substitution effects are probably a result of the low power of the experiment at the sire-marker level. As it has been shown previously, when power is low, allele substitution effects are biased upward. This would apply particularly to PY, which generally showed smaller effects than PP or MY. Thus, the apparently equivalent trait allele substitution effects may be a result of the lower power of PY relative to PP and MY.


View this table:
[in this window]
[in a new window]

 
Table 7. Allele substitution effect ({alpha}) of all sires significant at proportion of false positive (PFPS) ≤ 0.20 within markers significant at PFPM ≤ 0.101
 
The Estimated Proportion of QTL Affecting More Than One Trait
Table 8Go shows the number of significant marker-trait combinations by pairs of traits and the estimated proportion of QTL that affect the various trait pairs. Based on these values and on the procedures described in Materials and Methods, it can be calculated that, of the estimated 82 markers affecting PP, 8.9% exclusively affect PP, 14.8% exclusively affect PP and PY, 52.4% exclusively affect PP and MY, and 23.9% affect all 3 traits. Thus, 76.3% of markers affecting PP also affect MY. Of the estimated 46 markers affecting PY, none exclusively affect PY, 26.3% exclusively affect PY and PP, 33.9% exclusively affect PY and MY, and 42.6% affect all 3 traits. Of the estimated 94 markers affecting MY, 16.0% exclusively affect MY, 16.6% exclusively affect MY and PY, 45.7% exclusively affect MY and PP, and 20.9% affect all 3 traits. Thus, 66.6% of markers affecting MY also affect PP. Considering all 112.6 markers estimated as affecting at least one trait, 6.5% affected PP exclusively, 13.3% affected MY exclusively, none affected PY exclusively, 10.7% affected PP and PY exclusively, 13.9% affected MY and PY exclusively, 38.2% affected MY and PP, and 17.4% affected all 3 traits. Thus, 55.6% of QTL simultaneously affected MY and PP (with or without effects on PY), 17.2% affected PP without effects on MY, and 27.2% affected MY without effects on PP.


View this table:
[in this window]
[in a new window]

 
Table 8. Proportion of QTL affecting pairs of traits1
 

    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 CONCLUSIONS
 Appendix 1
 ACKNOWLEDGEMENTS
 REFERENCES
 
Empirical SE
In the course of the European Union sponsored Bov-MAS project, of which this study was a part, the use of an empirical estimate of the standard error for selective DNA pooling analysis was introduced (Bagnato et al., 2008). This was used in the present study in place of the estimate of the standard error based on a priori considerations used in previous studies (Lipkin et al., 1998; Mosig et al., 2001). The a priori approach involved a complex calculation based on many assumptions and approximations (e.g., covariance among allele frequency determinations based on densitometry of multi-allelic mixes) but which did not take into account sources of technical error that are not captured by differences among duplicated pools (such as errors in determining cell or DNA concentration of the samples making up the pools, which lead to over- or underrepresentation of an individual cow in the DNA mix of the pool). In contrast, the empirical estimate is based on the observed variation among independent subpools in the same tail. As such, it should include all sources of technical error, whether defined or not. Indeed, the empirical estimate was found equal to 0.051, compared with previous a priori estimates of SE(D) for the same DS of 0.039 to 0.043 (depending on the estimated frequency of the sire alleles among the dams of the daughters). Thus, the empirical estimate was of the same order of magnitude as the a priori estimates, indicating the general validity of the approach. It was, however, about 20% greater, supporting the view that the empirical estimate included aspects of the samples and data that were not included in the a priori calculations. Thus, the use of split pools and empirical estimates of the standard error should be considered as the standard option for statistical analysis of selective DNA pooling. It should be noted, however, that the empirical estimate is the same for all markers and pools and does not take into account frequency of the sire allele in the dam population and the number of duplicates and replicates of the pool that were genotyped, both of which may differ among markers and pools.

The Number of True and False Null Hypotheses Among All Null Hypotheses
The present study made extensive use of estimates of the number of true null hypotheses (n2) among all null hypotheses. These were obtained by a histogram-based approach first proposed by Mosig et al. (2001) using an iterative procedure and later simplified to a short algorithm by Nettleton et al. (2006). Evaluation of this procedure in simulations involving QTL mapping (Fernando et al., 2004) and DNA expression analysis (Nettleton et al., 2006) showed that the procedure was effective in estimating n2 and much simpler to implement relative to a variety of alternative procedures (Nettleton et al., 2006). The estimate of n2 is primarily of interest for application of the PFP criterion for setting statistical significance levels (Mosig et al., 2001; Fernando et al., 2004). However, once an estimate of n2 is available, the corresponding n1, an estimate of the number of false null-hypotheses is obtained by simple subtraction and can be used to estimate power of the experiment with respect to uncovering marker-QTL linkage (Mosig et al., 2001). As shown in Mosig et al. (2001), and somewhat differently in the present study, estimates of statistical power can be used within the framework of a multiple-sire daughter design to estimate proportion of heterozygosity at the QTL. In the present study, power estimates were further utilized to estimate the proportion of QTL affecting 2 or more traits.

QTL Effects on PP, MY, and PY
As far as PP is concerned, the high estimated proportion of false null hypotheses at the marker level (61.4%, Table 6Go) derives from the fact that the markers tested were for the most part chosen from QTLR previously shown to affect PP. The estimated 38.6% of true null hypotheses can be attributed to false positive results in the previous studies, chance sampling such that the new sires were homozygous at some of the QTLR identified in previous studies, and new markers added to explore the flanking regions of the previously identified QTLR, or QTLR reported from other populations. Given that the scan was concentrated on QTLR affecting PP, and that the calculations in this study indicate that three-fourths of markers affecting PP also affect MY, the proportion of false null hypotheses for MY would have been expected to be somewhat less than what was found for PP. Nevertheless, MY showed an even higher proportion of false null hypotheses (70.1%) than PP (Table 6Go). A simple explanation for this observation was not found. The proportion of false null hypotheses for PY (34.5%) was considerably less than for PP or MY. This is as expected if most of the markers affecting PP and MY are more or less balanced in their effects on the 2 traits. In this case, the net effect on PY is reduced by the negative correlated effects of MY and PP so that detectable signal strength of PY extends to fewer markers than the signal for MY or PP. The generally lower D-values observed for PY compared with MY and PP noted in the results section are attributed to this factor.

The estimated numbers of true marker-QTL linkages (n1M, Table 6Go) were 82, 46, and 94 for PP, PY, and MY, respectively, with corresponding power after correction for 10% of false positives of 0.75, 0.41, and 0.73. The power for PP is somewhat higher than the value 0.71 (uncorrected for false positives) reported by Mosig et al. (2001). The increase in power in the present study may reflect the use of 10 sires, whereas only 7 sires were surveyed in Mosig et al. (2001), and the fact that the present study was focused on QTLR found in that study.

Averaging over PP taken as trait i and MY taken as trait j, over two-thirds (0.71) of QTL affecting PP also affect MY and vice versa. The proportion of markers affecting PP that also affect PY was 0.39; the corresponding value for MY and PY was 0.37. In the reverse direction, the proportion of markers affecting PY that also affect PP was 0.69; the corresponding value for PY and MY was 0.77. Thus, about one-third of markers that affect PP or MY also affect PY, whereas about two-thirds of markers that affect PY also affect PP or MY. This is plausible. Many QTL affecting PP or MY will not affect PY because of the inverse relationship between MY and PP. In the converse direction, however, we would expect a QTL affecting PY to have effects on MY or PP or both. Thus, the results of the present study are broadly consistent with Viitala et al. (2003) but also support the presence of an appreciable body of QTL that affect MY or PP without major effects on the alternative trait (see below). These may exert their effects through other mechanisms.

Although most markers, and by implication QTL, were estimated to have effects on both MY and PP, it was also estimated that an appreciable proportion (about one-third) of markers affecting MY did not have major effects on PP and vice versa. Thus, these results imply that effects of individual QTL on MY, PP, and PY can differ. This is supported by studies of the effects of alleles at individual known genes on the 3 traits, MY, PY, and PP.


    CONCLUSIONS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 CONCLUSIONS
 Appendix 1
 ACKNOWLEDGEMENTS
 REFERENCES
 
In any experiment involving effects of a QTL on a number of traits, some of the markers will present statistical significance for a single trait only. The analytical problem is to distinguish those QTL that are truly limited in their effect to a single trait from those that affect multiple traits but because of limited experimental power of the study present statistical significance for a single trait only. In the present study, a method was developed to correct for incomplete power. Using this approach, estimates of the proportion of markers affecting PP that also affect MY (and vice versa) averaged 0.71. This agrees well with the Viitala et al. (2003) hypothesis. However, the results of the present study also indicate that there may be a proportion of almost 30% of QTL that affect either PP or MY in a major way, presumably via a mechanism other than liquid volume, with only minor effects on the other trait. This agrees very closely with the QTL mapping results of Bagnato et al. (2008) for MY and PP in Brown Swiss dairy cattle and their homologues in the Holstein breed as represented by the main QTL databases.

The estimate of 30% of QTL that affect either MY or PP without effects on the alternative trait is close to the estimate of about one-third (0.32) of QTL affecting PP or MY that also affect PY. Thus, those QTL that affect primarily PP or MY with only minor effect on the other main trait may be the QTL that have significant effects on PY. As expected from the arithmetical relationships among PP, MY, and PY, QTL affecting PY alone were absent from Viitala et al. (2003) and were estimated as absent in the corrected analysis of the present study. The high proportion of QTL estimated as affecting MY and PP with neutral effects on PY should enable PP to be maintained even when selection for PY results in an increase in MY. Assuming a diallelic QTL, average QTL heterozygosity equal to 0.40 as found in the present study implies allele frequencies in the range 0.7 or more for the more frequent QTL allele. This would be consistent with the longstanding selection for MY that has been underway in the Holstein breed for many generations and suggests that, for the most part, alleles with positive effects on MY (and associated negative effects on PP) may already be at moderately high frequency in the breed.


    Appendix 1
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 CONCLUSIONS
 Appendix 1
 ACKNOWLEDGEMENTS
 REFERENCES
 


    ACKNOWLEDGEMENTS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 CONCLUSIONS
 Appendix 1
 ACKNOWLEDGEMENTS
 REFERENCES
 
This research was supported by the FP5 program of the European Union (BovMAS project QLK5-CT-2001- 02379). We thank Jack Dekkers (Iowa State Univ., Ames) and the reviewers for useful comments.

Received for publication August 31, 2007. Accepted for publication December 18, 2007.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 CONCLUSIONS
 Appendix 1
 ACKNOWLEDGEMENTS
 REFERENCES
 


Bagnato, A., F. Schiavini, A. Rossoni, C. Maltecca, M. Dolezal, I. Medugorac, J. Sölkner, V. Russo, L. Fontanesi, A. Friedmann, M. Soller, and E. Lipkin. 2008. Quantitative trait loci affecting milk yield and protein percent in a 3-county Brown Swiss population. J. Dairy Sci. 91 :767–783.[Abstract/Free Full Text]

Chauhan, V. P., and J. F. Hayes. 1991. Genetic parameters for first milk production and composition traits for Holsteins using multivariate restricted maximum likelihood. J. Dairy Sci. 74 :603–610.[Abstract]

Darvasi, A., and M. Soller. 1994. Selective DNA pooling for determination of linkage between a molecular marker and a quantitative trait locus. Genetics 138 :1365–1373.[Abstract]

Falconer, D. S., and T. F. C. Mackay. 1996. Introduction to Quantitative Genetics. 4th ed. Longmans Green, Harlow, Essex, UK.

Fernando, R. L., D. Nettleton, B. R. Southey, J. C. M. Dekkers, M. Rothschild, and M. Soller. 2004. Controlling the proportion of false positives (PFP) in a multiple test situation. Genetics 166 :611–619.[Abstract/Free Full Text]

Freyer, G., Z. Liu, G. Erhardt, and L. Panicke. 1999. Casein polymorphism and relation between milk production traits. J. Anim. Breed. Genet. 116 :87–98.[CrossRef]

Grisart, B., W. Coppieters, F. Farnir, L. Karim, C. Ford, P. Berzi, N. Cambisano, M. Mni, S. Reid, P. Simon, R. Spelman, M. Georges, and R. Snell. 2001. Positional candidate cloning of a QTL in dairy cattle: Identification of a missense mutation in the bovine DGAT1 gene with major effect on milk yield and composition. Gen. Res. 12 :222–231.[CrossRef]

Hu, J., E. R. Fritz, and J. M. Reecy. 2007. AnimalQTLdb: A livestock QTL database tool set for positional QTL information mining and beyond. Nucleic Acids Res , 35 (Database issue):D604–D609.[CrossRef][Medline]

Khatkar, M. S., P. C. Thomson, I. Tammen, and H. W. Raadsma. 2004. Quantitative trait loci mapping in dairy cattle: Review and meta-analysis. Genet. Sel. Evol. 36 :163–190.[CrossRef][Medline]

Kühn, C., G. Thaller, A. Winter, O. R. P. Bininda-Emonds, B. Kaupe, G. Erhardt, B. Jörn, S. Manfred, and F. Ruedi. 2004. Evidence for multiple alleles at the DGAT1 locus better explains a quantitative trait locus with major effect on milk fat content in cattle. Genetics 167 :1873–1881.[Abstract/Free Full Text]

Lander, R., and L. Kruglyak. 1995. Genetic dissection of complex traits: Guidelines for interpreting and reporting linkage results. Nat. Genet. 11 :241–247.[CrossRef][Medline]

Lipkin, E., M. O. Mosig, A. Darvasi, E. Ezra, A. Shalom, A. Friedmann, and M. Soller. 1998. Quantitative trait locus mapping in dairy cattle by means of selective DNA pooling using milk samples and dinucleotide microsatellite markers: Analysis of milk protein percentage. Genetics 149 :1557–1567.[Abstract/Free Full Text]

Mosig, M. O., E. Lipkin, G. Khutoreskaya, E. Tchourzyna, M. Soller, and A. Friedmann. 2001. A whole genome scan for QTL affecting milk protein percent in Israeli-Holstein cattle, by means of selective milk DNA pooling in a daughter design, using a modified false discovery rate criterion. Genetics 157 :1683–1698.[Abstract/Free Full Text]

Nettleton, D., J. T. G. Hwang, R. A. Caldo, and R. P. Wise. 2006. Estimating the number of true null hypotheses from a histogram of P-values. J. Agric. Biol. Env. Stat. 11 :337–356.[CrossRef]

Polineni, P., P. Aragonda, S. R. Xavier, R. Furuta, and D. L. Adelson. 2006. The bovine QTL viewer: A web accessible database of bovine quantitative trait loci. BMC Bioinformatics 7 :283.[CrossRef][Medline]

Sölkner, J. 1989. Genetic relationships between level of production in different lactations, rate of maturity and longevity in a dual purpose cattle population. Livest. Prod. Sci. 23 :33–45.[CrossRef]

Soller, M. 1994. Marker assisted selection - An overview. Anim. Biotechnol. 5 :193–207.

Soller, M., and A. Genizi. 1978. The efficiency of experimental designs for detection of linkage between a marker locus and a locus affecting a quantitative trait in segregating populations. Biometrics 34 :47–55.[CrossRef]

Spelman, R. J., C. A. Ford, P. McElhinney, G. C. Gregory, and R. G. Snell. 2002. Characterization of the DGAT1 gene in the New Zealand dairy population. J. Dairy Sci. 85 :3514–3517.[Abstract/Free Full Text]

Thaller, G., W. Kramer, A. Winter, B. Kaupe, F. Erhardt, and R. Fries. 2003. Effects of DGAT1 variants on milk production traits in German cattle breeds. J. Anim. Sci. 81 :1911–1918.[Abstract/Free Full Text]

Viitala, S. M., N. F. Schulman, D. J. de Koning, K. Elo, R. Kinos, A. Virta, J. Virta, A. Mäki-Tanila, and J. H. Vilkki. 2003. Quantitative trait loci affecting milk production traits in Finnish Ayrshire dairy cattle. J. Dairy Sci. 86 :1828–1836.[Abstract/Free Full Text]

Weller, J. I., M. Golik, E. Seroussi, E. Ezra, and M. Ron. 2003. Population-wide analysis of a QTL affecting milk-fat production in the Israeli Holstein population. J. Dairy Sci. 86 :2219–2227.[Abstract/Free Full Text]

Weller, J. I., Y. Kashi, and M. Soller. 1990. Power of daughter and granddaughter designs for determining linkage between marker loci and quantitative trait loci in dairy cattle. J. Dairy Sci. 73 :2525–2537.[Abstract]

Welper, R. D., and A. E. Freeman. 1992. Genetic parameters for yield traits of Holsteins, including lactose and somatic cell score. J. Dairy Sci. 75 :1342–1348.[Abstract]


This article has been cited by other articles:


Home page
J DAIRY SCIHome page
H. Khatib, W. Huang, X. Wang, A. H. Tran, A. B. Bindrim, V. Schutzkus, R. L. Monson, and B. S. Yandell
Single gene and gene interaction effects on fertilization and embryonic survival rates in cattle
J Dairy Sci, May 1, 2009; 92(5): 2238 - 2247.
[Abstract] [Full Text] [PDF]


Home page
J DAIRY SCIHome page
E. Lipkin, A. Bagnato, and M. Soller
Expected Effects on Protein Yield of Marker-Assisted Selection at Quantitative Trait Loci Affecting Milk Yield and Milk Protein Percentage
J Dairy Sci, July 1, 2008; 91(7): 2857 - 2863.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Interpretive Summary
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Lipkin, E.
Right arrow Articles by Soller, M.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Lipkin, E.
Right arrow Articles by Soller, M.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS