|
|
||||||||




* Department of Veterinary Sciences and Technologies for Food Safety, Università degli Studi di Milano, 20133 Milano, Italy
Italian Brown Breeders Association, Loc. Ferlina 204, Bussolengo 37012, Italy
Dairy Science Department, University of Wisconsin, 1675 Observatory Dr., Madison
Department of Sustainable Agricultural Systems, BOKU—University of Natural Resources and Applied Life Sciences, Vienna, A-1180 Vienna, Austria
# Institute for Animal Breeding, Ludwig-Maximilians University, 80539 Munich, Germany
|| Dipartimento di Protezione e Valorizzazione Agroalimentare, Università di Bologna, 40010 Bologna, Italy
¶ Department of Genetics, The Hebrew University of Jerusalem, 91904 Jerusalem, Israel
1 Corresponding author: alessandro.bagnato{at}unimi.it
| ABSTRACT |
|---|
|
|
|---|
Key Words: quantitative trait loci mapping Brown Swiss productive trait selective DNA pooling
| INTRODUCTION |
|---|
|
|
|---|
Numerous studies (e.g., de Koning et al., 2001; Mosig et al., 2001; Thomsen et al., 2001; Boichard et al., 2003) have found a large number of QTL affecting productive and functional traits in dairy cattle. Khatkhar et al. (2004), in a meta-analysis, considered 45 studies in dairy cattle populations and provided an overview of QTL reported in literature. To the best of our knowledge, with the exception of 2 autochthonous French breeds (Boichard et al., 2003), and the Ayrshire breed (Velmala et al., 1999; de Koning et al., 2001), all other QTL mapping studies in dairy cattle have been implemented in the Holstein population. This is probably due to the need for large families, when using either a daughter (DD) or a granddaughter designs (GDD; Weller et al., 1990). Because QTL mapping studies are traditionally realized on a national basis, this consideration limits QTL mapping studies in the minor breeds due to the difficulty of finding large enough paternal half-sib families. For cosmopolitan minor breeds, however, such as Brown Swiss, Jersey, and Guernsey, the number and size of families on a multinational basis are sufficient to support such studies and in this way provide a basis for genomic approaches to selection in these breeds as well. Carrying out a multinational QTL mapping study based on the GDD should be quite feasible, using semen as source of DNA, and Interbull EBV (http://www-interbull.slu.se/) to bring all sires to a specific country common denominator. On the other hand, a multinational DD for daughters spread over a number of countries would be more difficult with respect to the need to adjust national daughter EBV to a common denominator. To convert different national EBV to a common basis, the best option is to use linear regression conversion formulas method used for several years to provide sire international proofs (Wilmink et al., 1986). This method was abandoned for sires when MACE (multiple across country evaluation) (Schaeffer, 1994) became available. Several studies tried to construct a common denominator genetic evaluation for dams as well, but these attempts have been largely unsuccessful (Canavesi et al., 2001). At present, the only way to obtain consistent EBV for an international female population would be to perform a borderless genetic evaluation beginning with phenotypes (Maltecca et al., 2004). This is the procedure followed by Austria and Germany for the joint genetic evaluation of the Brown Swiss breed across the 2 countries (Emmerling et al., 2002). In the recent past the breeding goals in the 3 countries were similar and sires used in the 3 populations were shared among countries so to permit here the consideration of individuals from Austria, Germany, and Italy as a single European Brown Swiss population.
A selective DNA pooling approach using milk as a source of DNA (Lipkin et al., 1993, 1998; Mosig et al., 2001) was applied in the present study. The use of selective DNA pooling in a daughter design (Darvasi and Soller, 1994) allows an efficient initial screening of marker-QTL association. In addition, because in selective DNA pooling QTL association is based on allele frequency estimation in the pools made of extreme individuals, these pools can readily be constructed without the need of international EBV for the daughters of each sire family, even if sampled in 2 or more countries. In the present study, the use of milk somatic cells as a source of DNA (Lipkin et al., 1993, 1998) allowed collection of a large number of samples through the routine milk recording testing performed by the Brown Swiss breed and milk recording associations in all 3 countries.
The objective of this work was to map QTL for milk yield (MY) and milk protein percent (PP) in the European Brown Swiss breed, utilizing a selective DNA pooling approach using milk as source of DNA and sampling individuals from Austrian, German, and Italian populations. Although Brown Swiss population size in each country is relatively small compared with the respective Holstein population, sampling jointly from all 3 countries provided a sufficient number of sire families and daughters for a comprehensive mapping study. To the best of our knowledge this represents the first QTL mapping study reported for this breed, and we believe it will serve as a paradigmatic example for other minor cosmopolitan breeds.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
Information on SCC for each official sample was made available through the milk recording agencies. In some instances a reported SCC for a particular sample was missing or the date recorded in the breed association database did not match the sampling date. In these cases, SCC was obtained directly from the collected sample. As a quality assurance, SCC values for a random cross section of 100 samples were verified by comparing the SCC information from the milk recording agencies with those obtained directly from the collected samples. The maximum observed difference was 6%, which is below the 7% tolerance accepted among instruments in official testing. Additionally, a pilot study was conducted to compare SCC obtained from fresh and frozen samples: for fresh samples SCC was recorded after sampling; for frozen-unfrozen samples SCC was measured weekly for up to 6 wk. In the frozen-unfrozen collected samples, SCC had a maximum variation of 5% with respect to the cell count reported for the original fresh sample (data not shown).
Pool Constitution
For MY and PP EBV were available for each daughter according to the national genetic evaluation procedure in Italy (Interbull, 2000) and in Austria and Germany (Emmerling et al., 2002). Daughter EBV were corrected (cEBV) by subtracting half of the dams EBV to remove the dam effect on daughter EBV and optimize the ranking of high and low individuals on their sires genetic contribution (Dolezal, 2007). The highest and lowest 200 individuals of the cEBV distribution (high and low "tails", respectively) for each sire-family-by-trait combination were identified for inclusion in the pools. Each high or low tail group was divided into 2 independent subpools each consisting of 100 randomly selected daughters (henceforth, replicate subpools), by assigning animals with even rank position to subpool-1, and those with odd rank position to subpool-2. In addition each of the replicate subpools was duplicated from the same samples in a separate independent pooling procedure, performed by different operators on a different day (henceforth, duplicate subpools). Thus, a total of 8 subpools (2 tails [high, low] x 2 replicate subpools per tail x 2 duplicate subpools per replicate subpool) were constructed for each sire-by-trait combination. With 10 sires and 2 traits, there were a total of 20 sire-by-trait combinations, giving a total of 160 subpools. Pool construction for sires sampled across Italy and Austria was proportional to the number of daughters collected in each country. This was done to control for possible differences in genetic variability and average genetic level among the different populations.
Sample Preparation
Based on the SCC, a volume of milk to attain the desired numbers of cells (20,000 per individual) was included in each subpool. The subpool was then divided into 10 pool-fractions to decrease possible risk of contamination or loss of an entire subpool due to mishandling. Each pool-fraction contained 2,000 cells from each of the 100 daughters included to guarantee approximately equal genomic contribution from each individual to the PCR. Following Lipkin et al. (1993, 1998), cell lysate was used in the PCR reaction instead of purified DNA. An aliquot of milk was diluted with saline (NaCl 0.9%) and centrifuged at 685 x g for 5 min to pellet somatic cells. Cells were resuspended in saline and recentrifuged (2 to 15 times) until a clear pellet of milk somatic cells was obtained. Somatic cells were resuspended in Tris and EDTA to a volume of 50 µL (a final concentration of 4,000 cells per µL) and incubated in thermal cycler (MJ Research PTC-200, GMI Inc., Ramsey, MN) at 100°C for 5 min, 50°C for 5 min, and again at 100°C for 5 min to lyse the cells. A commercial kit (http://www.spin.it/talent/genomix.html) was utilized to obtain DNA from semen.
Markers and Genotyping
A panel of 187 dinucleotide microsatellite markers (www.marc.usda.gov/genome/genome.html) covering the entire autosomal genome was chosen. Primers were labeled with fluorescent dye (FAM, HEX, TET) according to the standard (TAMRA350, Applied Biosystems Foster City, CA) used for fragment analysis on the ABI377 sequencer (Applied Biosystems).
The PCR reaction was optimized for each fluorescent primer starting from a standard protocol consisting of 1 µL of PCR buffer with MgCl2 (15 mM), 0.2 µL of dNTP (10 mM each), 0.3 µL of each primer (10 µM each), 2 µL of 5X TaqMaster PCR Enhancer (Eppendorf AG, Hamburg, Germany) and 0.5 U of Taq DNA polymerase (Eppendorf AG). One microliter of semen DNA 1:10 diluted or 1 µL of cell lysate (to provide a DNA average contribution of 4,000 cells per subpool) was used in each PCR reaction. Water was added to a final volume of 10 µL. Amplification was performed in MJ Research PTC-200 thermal cycler (GMI Inc.), programming 1 step of 10 min at 95°C, 35 cycles comprised of 95°C for 1 min, 54°C to 64°C for 1 min (according to specific primer annealing temperature) and 72°C for 1 min; a final step of 45 min at 72°C completed the amplification process to eliminate the half-step bands.
One-half microliter of each PCR product was added to a mix of TAMRA 350 (Applied Biosystem), blue-dextran, and formamide and separated by electrophoresis on an ABI377 automatic sequencer of Applied Biosystems using a 5% acrylamide 0.2-mm gel. Densitometric values of sires and pool genotypes (peak height) of each detectable fragment were obtained using Genescan and Genotyper softwares (Applied Biosystems). Among the initial 187 markers, 3 were homozygous in all sires, 10 did not amplify properly, and 2 systematically produced by-products that affected densitometric values. This left a total of 172 markers that were used in the genome scan, yielding an average spacing of about 20 cM, except for BTA 3, 6, 14, and 20 where a denser map (about 2 to 5 cM spacing) was used: for chromosome 6, because of the strongly confirmed presence of QTL on that chromosome (reviewed by Khatkar et al., 2004; Polineni et al., 2006; Hu et al., 2007); for chromosomes 3, 14, and 20, because they were found significant across all populations studied in the BovMAS-EU-funded project (of which this is a part) and were targeted for more detailed study. For the same reason, to confirm significance in these 3 chromosomes, individual genotyping was carried out for a set of 42 markers each with a sample of 96 individuals per tail, and results provided a correlation larger then 0.95 among pools and individual frequencies (results not shown). Further confirmation of the effectiveness of the selective DNA pooling approach of this study comes from the consistency of trait-specific results obtained in the present study and those reported in 3 Web-based cattle QTL databases (see below).
The 10 sires were genotyped for each marker, and only markers that were heterozygous in the sires were subsequently genotyped in the pools of the sire. In the initial scan, one of the duplicates of each of the 4 sub-pools per sire-trait combination was genotyped. Following the initial analysis in suggestive QTL regions, the subpools were typed again with their duplicates to confirm the putative association, by an independent set of reactions.
Estimating Frequency of Sire Alleles in the Pools
To obtain frequencies of sire marker alleles in the pools, densitometric values were corrected for shadow bands as in Lipkin et al. (1998). The relative intensity of a shadow band with respect to the main band is a linear function of the number of repeats of the dinucleotide in the microsatellites. The number of dinucleotide repeats was estimated from the genotypes of the sires of the 10 studied families, plus 1 additional Brown sire and 10 Holstein sires, according to the methodology described in Lipkin et al. (1998). Only densitometric values of homozygous individuals or of heterozygous individuals for which alleles were separated by at least 4 repeats (8 bp) were used in repeat number estimation. Holstein sires were considered only when fewer than 2 Brown sires were available for repeat number estimation. In these cases additional Brown Swiss females were later genotyped to confirm results and to test for breed difference in allele repeat number calculation. For alleles for which suitable individuals were not available, interpolation was used.
Pool densitometry and estimation of frequency of long (l) and short (s) sire alleles were performed separately for each of the 2 subpools in each tail (see Appendix for details). Consequently, denoting the 4 subpools: H1 and H2, L1 and L2, respectively (H = high tail, L = low tail), a total of 8 allele frequency estimates were obtained for each sire-marker-trait combination, denoted H1l, H2l, H1s, H2s, L1l, L2l, L1s, L2s. It should be emphasized that there is no intrinsic relationship between the 2 replicate-subpools in the H tail and the 2 replicate-subpools in the L tail, and the assignment of indices 1, 2 within tails was arbitrary.
Frequency estimates for long and short alleles in replicate-subpools in the same tails were compared and their variance calculated. If the calculated variance within sire-trait-marker-replicate-subpool-allele was greater than 0.012 (about 4 times the observed empirical variance between replicate-subpools in the same tail; see appendix for definition of empirical variance), the test was tagged as an outlier, and PCR amplification and gel run were performed again on the same samples and on the duplicates of the samples. If variance among the duplicates was reduced below the above threshold, the new values were used. If variance remained high, the frequency estimates from the individual subpools were compared with those from other subpools (replicates and duplicates) of the same sire-marker-trait-tail combination, in an attempt to identify the outlier sub-pool. When this was successful, the test was recalculated excluding the data from the outlier subpool. When it was not possible to clearly identify a specific outlier subpool, the entire test was discarded. The outlier sub-pools were examined with respect to sire, trait, and tail to identify subpools that were problematic in multiple instances. In this case, the entire subpool was eliminated from all subsequent calculations, even when individual data points of that subpool were not aberrant.
Marker-QTL Linkage Tests
The test for a linkage between a marker and each of the 2 studied traits was carried out at 2 levels: i) the sire-marker level (within sire analysis), which tests for association of marker and trait at the level of the individual sire; and ii) the marker level (across sires analysis), which tests for association of marker and trait across all sires heterozygous at a marker.
Individual Sire-Marker-Trait Combinations.
For each sire-marker-trait combination for which the sire was heterozygous at the marker, the comparison-wise error rate (CWER), Pijk, for the ith sire-jth marker-kth trait combination was obtained as twice the area of the normal curve from |Zijk| to +
, with
![]() |
where Dijk represents the difference in sire-allele frequencies between the high and low daughter pools of the ith sire with respect to the jth marker and the kth trait, and SE(D') is the empirical standard error of Dijk. The Dijk and Se(D') were estimated as described in Appendix 1.
Individual Marker-Trait Combinations.
The CWER, Pjk, for the jth marker and kth trait combination was obtained as the area of the
2 distribution from
jk2 to +
, with
![]() |
where sjk is the number of heterozygous sires tested for the jth marker, kth trait combination (Weller et al., 1990, Lipkin et al., 1998; Mosig et al., 2001).
Experimentwise Linkage Tests Proportion of False Positives.
To overcome the multiple-test situation but retain power, the proportion of false positive (PFP) criterion (Mosig et al., 2001; Fernando et al., 2004) was used to set threshold CWER levels for declaration of significance at the sire-marker-trait and marker-trait levels. To calculate the PFP, it is necessary to first estimate the number of tests for which the null hypothesis is true (denoted n2) because, by definition, a false positive can come only from among this set of tests. Following Mosig et al. (2001), n2 was estimated by iteration from the frequency distribution of the obtained P-values. This was done separately for marker level tests (n2M) or for sire-marker level tests (n2S). Using these values for n2, PFP values were calculated separately for marker level tests (PFPM) and for sire-marker level tests (PFPs) as follows:
![]() |
where PMi is the CWER P-value of the ith marker test, RMi is the rank number of the ith marker test, when the marker tests are ranked by their P-values from lowest to highest; and similarly,
![]() |
substituting the corresponding expressions for sire-marker tests in place of marker tests.
Significance Thresholds.
For each trait, marker-tests having CWER P-values corresponding to PFPM
0.10 were considered to be significant. With respect to sire-marker tests, PFP values were calculated for all sire-marker-trait tests, whether the marker was significant. However, sire-marker tests were considered significant only if they had CWER P-values corresponding to PFP
0.20 and were also within significant markers. The less stringent threshold for the sire-marker level reflects an attempt to rebalance type I and type II error, taking into consideration the weaker power curve of the individual sire x marker tests compared with the marker tests. The less stringent threshold also takes into account the prior Bayesian probability of about 0.40 that a sire will be heterozygous at the QTL, given a significant marker.
Estimating Power of the Test
Because the PFP analysis for each trait provides an estimate of the number of tests involving that trait for which the null hypothesis is true (n2M or n2S), the total number of tests for which the null hypothesis is falsified [i.e., the number of tests for the trait representing true marker-QTL association (n1M) or sire-marker-QTL (n1S) association], can be estimated as
![]() |
and
![]() |
where NM and NS are the total number of marker or sire-marker tests, respectively, for each trait. On this basis, adapting the notation of Mosig et al. (2001) to the notation of the present paper, the power of the test for a particular trait can be calculated as
![]() |
for the marker level of analysis, and as
![]() |
for the sire-marker level of analysis, where oM and oS are the observed number of effects declared significant at the marker and sire-marker levels, respectively; and (1 – PFPM) and (1 – PFPS) are the proportion of true significant effects among all declared significant effects for marker and sire-marker tests, respectively; so that oM(1 – PFPM) and oS(1 – PFPS) are the observed number of declared significant effects at the marker and sire-marker levels, respectively, corrected for the proportion of true positives.
Allele Substitution Effects
Allele substitution effects were calculated as described in Lipkin et al. (1998) and Mosig et al. (2001), based on shadow-corrected allele frequency estimates. The population standard deviation of EBV needed for this calculation was estimated from the difference in the mean EBV of the individuals in the high and low pools, as described in Appendix 2. Allele substitution effects were calculated only for sire-marker combinations defined as significant following the above PFP criterion.
Identifying QTL Regions
An attempt was made to define chromosomal regions harboring QTL (QTLR) and chromosomal regions that did not harbor QTL according to the results of the genome scan to highlight them over the entire genome. All markers and sire combinations were considered in QTLR definition, thus adding novel information with respect to significant markers only. This information allows exclusion of chromosomal regions from further QTL search (e.g., to refine QTL location) in the same breed for MY and PP. Given that interval mapping methods that would permit analysis of multiple QTL on the same chromosome with selective DNA pooling are still under development (Dekkers et al., 1999; Dolezal, 2007), to identify multiple QTL on the same chromosome we used the qualitative approach described by Mosig et al. (2001). This approach is based on the analysis of all sires by marker combinations (significant and not significant) on the chromosome. It identifies as multiple well-separated QTL on a chromosome an interval where one or more sires showed, with respect to a marker, significance for more proximal markers, with clear lack of significance for more distal markers, and the opposite was true for other sires (i.e., nonsignificant at proximal but significant at distal). On the other hand it also identified as multiple separated QTL on a chromosome an interval where one or more sires showed significance at proximal and distal markers but not at the intervening markers, and the opposite was true for the other sires (as above). We identified QTL for MY and PP separately to define trait specific-QTL regions and then combined the 2 analyses defining as a unique chromosomal region across the 2 traits those where both traits showed significance. Some of the defined regions include more than 1 flanking marker indicating a chromosomal span, whereas others are based on a single marker.
Definition of QTLR and non-QTLR on the same chromosome was based exclusively on the results of the present study and did not include a comparative approach with literature as this would require a meta-analysis as developed by Khatkar et al. (2004), which is beyond the scope of this paper.
The QTLR here defined were compared with 3 Web-based QTL maps (http://www.vetsci.usyd.edu.au/reprogen/QTL_Map/; http://www.animalgenome.org/QTLdb/cattle.html; http://bovineqtl.tamu.edu/) reporting results of published literature. The first Web QTL map is based on the meta-analysis of Khatkar et al. (2004). The second (Hu et al., 2007) and the third (Polineni et al., 2006) collect and update all publicly available QTL reported for a large variety of traits.
Comparison with Web-based QTL maps (i.e., literature) is not straightforward because the approaches used to map QTL are different, and information on QTL description is quite variable (e.g., some QTL are reported as a point location based on a single marker, some are point locations presented with a confidence interval, others do not provide a point location but only suggest a possible QTL-containing region, and so on). In any event we indicated the corresponding QTLR as previously reported if in the databases a QTL location is reported (i) within plus or minus 5 cM from our QTLR defined as an interval, or (ii) within plus or minus 10 cM of our QTLR defined by a single marker.
| RESULTS |
|---|
|
|
|---|
|
0.012. For sire B, one of the high PP subpools and one of the low MY subpools were consistently aberrant and therefore were eliminated from the subsequent analyses. For sire A, more than 50% of the tests were outliers, yet specific aberrant pools were not identified. Nevertheless, sire A was also excluded from subsequent analyses. On average, for the remaining 9 sires, including sire B, less than 15% of the tests were discarded as outliers. After the editing, among the 976 sire-marker tests initially available for MY and 970 for PP, 776 and 762, respectively, were included in the analyses for marker QTL association. It is important to note the effective manner in which the replicated subpools of each tail identified outlier values. These would have been retained in the analysis had this been based on only a single pool per tail. After elimination of outliers, the SE(D') was reduced to 0.056 and this value was used to calculate the Z-test statistic. The empirical value of 0.056 obtained for SE(D') was considerably larger than the value of 0.043 calculated according to the deterministic procedure of Lipkin et al. (1998), apparently because it accounts for sources of errors such as sample cell count in addition to those included in the deterministic procedure (technical error and binomial sampling).
The Proportion of False Null Hypotheses Among All Null Hypotheses
Across both traits, there were a total of 340 marker-trait tests and 1,538 sire-marker-trait tests (Table 3
). Both at the marker level and sire-marker level, there was a marked excess of low (significant) P-values, compared with the null-hypothesis expectation of equal numbers (i.e., 10 % of all tests) in each bin. The excess of low P-values was less evident at the sire-marker-trait level than at the marker-trait level. This is due to the fact that when a sire is tested at a marker in linkage to a QTL, a significant sire-marker effect is obtained only if the linked QTL is also in the heterozygous state and fewer data are available for each sire, compared with the data available across sires. At both the marker-trait and sire-marker-trait levels, the excess of low P-values was similar for PP and MY.
|
Critical CWER P-Values and Power of the Tests
At both test levels (marker and sire-marker) critical PFP values were similar for PP and MY, but critical P-values at the marker level were about twice those at the sire-marker level (Table 3
).
Marker Level.
There were a total of 340 tests, of which 118 were significant at PFP
0.l0 (Table 4
). Of these, 60 markers were significant for MY and 58 were significant for PP. The estimated numbers of marker-QTL linkages (Table 3
) were 86 for MY and 96 for PP. The resulting power of the marker tests (Pm), adjusted for presence of false positives, was 0.63 and 0.54 for MY and PP, respectively.
|
0.20. These can be divided into 2 groups. The first, representing 534 tests (34.7% of the total), includes tests within the 118 marker-trait combinations that were significant at PFP
0.10. Among these, 169 sire-marker tests showed significance at PFP = 0.20. The second group of tests, comprising 1,004 tests (65.3% of the total), includes tests within the 222 marker-trait combinations that were not significant at PFP
0.10. Among these, there were only 35 significant sire-marker-trait tests at PFP
0.20. The total estimated numbers of sire-marker combinations heterozygous at the QTL (n1S, listed as n1 under sire-marker in Table 3
Relationships Between the Two Traits
Of the 168 markers tested for both traits, 75 (44.65%) were not significant for either of the traits (Table 4
). Of the 93 markers significant for at least 1 trait, 36 (38.7%) were significant at PFP
0.1 for MY alone, 33 (35.5%) for PP alone, and 24 (25.8%) for both (Table 4
). Significant markers were distributed over 24 chromosomes (all except for BTA5, 15, 24, 25, 29) for MY; and 26 chromosomes (all except BTA9, 15, 18) for PP.
Of the 663 sire-marker combinations tested for both traits, 504 (76.0%) were not significant for either trait (Table 4
). Of the 159 significant sire-marker combinations, 76 (47.8%) were significant for MY, 73 (45.9%) were significant for PP, and only 10 (6.3%) were significant for both traits (Table 4
). Part of the disproportion between significant tests for both traits obtained within family and those found across families can be attributed to the lower power of the sire-marker tests, which are only about half those of the marker tests. Nevertheless, here too, the large number of tests significant for a single trait only indicates that at least some of these loci are indeed specific to a single trait. Thus, the indication from both levels of analysis is that a significant proportion of loci affects both MY and PP, but an appreciable proportion of loci affects a single trait only.
Among the 10 sire-marker combinations for which both traits were significant, 9 showed opposite effect direction, as expected from the biometrical relationships between milk yield and protein percent. However, one of the sire-marker combinations showed effect estimates of the same sign.
Significant Markers and Sire-Marker Families
Tables 5
and 6
have results for significant MY (Table 5
) and PP (Table 6
) single marker tests. When the marker was significant at PFP < 0.10 but none of the sire-marker tests was significant at the PFP < 0.20 level, an allele substitution effect was not calculated. This occurred for 5 of the 60 significant markers for MY, and for 6 of the 58 significant markers for PP. For marker Map2C on BTA19 for PP there was only a single heterozygous sire having CWER P-value of 0.0395. This was significant at the marker level (critical P-value for PFP
0.10 = 0.073), but just below significance threshold at the sire-marker level (critical P-value for PFP
0.20 = 0.036).
|
|
Defining QTLR
In the present study, a total of 55 QTLR affecting MY, PP, or both were identified (Table 7
). These QTLR were distributed across 28 of the 29 bovine autosomes (all, except for BTA15). On 10 chromosomes a single QTLR was identified and 9 chromosomes showed evidence for 2 and 3 QTLR. Of the total of 55 QTLR, 16 (29%) were for MY alone, 13 (23.6%) were for PP alone, and 26 (47.2%) were for MY and PP. The proportion of regions affecting both MY and PP is considerably higher than that found for the individual marker level and sire-marker level tests. This is due to the fact that for many of the QTLR multiple markers were tested, increasing the power of the test, so that loci had more of an opportunity to show significance for both of the traits. Nevertheless, an appreciable number of regions affecting only one of the traits remain, suggesting that at least some of these QTLR are indeed limited in their effect to only one of the traits. This has important implications for marker-assisted selection which will be considered in the Discussion section. For convenience, Table 7
also compares the QTLR identified in the present study with those reported in the 3 Web databases. This, too, will be addressed in more detail in the Discussion section.
|
| DISCUSSION |
|---|
|
|
|---|
Of the 43 QTLR found in the present study that also had reported results in one or more of the 3 Web databases, 10 affected PP only, 8 MY only, and 25 both MY and PP. Remarkably, all 10 QTLR of the present study affecting PP only and all 8 QTLR of the present study affecting MY only were also reported in the Web databases as affecting the same single trait only (see later for further discussion of these findings), whereas of the 25 QTLR affecting both traits in the present study, 11 were reported also in the databases as affecting both MY and PP, 10 were reported as affecting PP only, and 4 as affecting MY only. Nine of the 12 QTLR identified in the present study that were not represented in the databases, and 10 of the 18 QTLR that were represented in a single database only, were assigned to chromosomes BTA16 through BTA29. These are the smaller bovine chromosomes and have been less intensively studied than chromosomes BTA1 through BTA15. Thus, in some part, the lack of representation of the results of the present study in the databases may be due to paucity of reported results in the literature for these chromosomal regions. In this study QTLR affecting both MY and PP, for which effects on only one of the traits (PP or MY) were reported in the databases, were found. Part of this discrepancy can be attributed to sampling variation. However, in 2 instances (BTA13 region 1 and BTA14 region 4) all 3 databases reported effects on PP only, and the present study found effects on both PP and MY. This may indicate different effects of the alleles segregating in the Brown Swiss, compared with those segregating in the populations (close to 100% Holstein) screened for the databases, but could also represent false positives. Because the databases draw partly on the same literature, they are correlated.
Aside from the few differences outlined above, the overall concordance between the results presented in this study and those available in the Web databases is very high and indicates that most of the QTL segregating in the reference populations (mainly Holstein) from which the databases are drawn are also segregating in the Brown Swiss population. The high general correspondence in QTLR distribution between the Brown Swiss and that reported in the databases is somewhat unexpected. The origin and selection history for the Brown Swiss and the Holstein breeds differ. The Holstein breed originated in the lowlands (the Netherlands) of Europe and has been generally selected to increase milk yield, and only in the last 2 decades or so have milk quality and functional traits become a target for selection. The Brown Swiss was originally a dual-purpose Alpine breed, and only in the past 4 decades has it been selected primarily for milk production, retaining however, a strong emphasis on functional traits, with weighting about 30% in the selection indices of the Italian Brown Swiss population, and as much as 45% in the German and Austrian Brown Swiss populations.
Although we might have expected somewhat greater differentiation between QTL found in the Brown Swiss and those reported in the Web databases for the Holstein than actually observed, the present results suggest that the same genes may be involved in genetic variation in milk production in the 2 breeds. The appreciable difference in production levels between them would then be more a matter of allele frequencies at the responsible genes than differences in the genes that are segregating. This would be consistent with the fact that what really differs in the selection history of the 2 breeds is primarily the intensity of selection for the 2 traits here investigated and not the direction of selection.
Nevertheless, some of the novel QTLR reported here, and particularly the 2 above mentioned QTLR with different effects in the Web databases and the present study (BTA13 region 1, and BTA14 region 4) may represent true differences in the QTL segregating in the Brown Swiss and Holstein populations.
Specific QTLR for PP and for MY and MAS
The complete concordance of results between the present study and the databases with respect to QTLR with effects limited to MY or PP supports the assumption that these specific QTLR are, indeed, affecting only 1 of the 2 traits, PP or MY. Furthermore, of the 43 QTLR of the databases that had corresponding QTLR in the present study, 11 had effects on both MY and PP. All 11 of these QTLR corresponded to QTLR of the present study that also affected both traits. Thus, this supports the assumption that these QTLR do indeed have effects on both traits. Thus, of the 43 QTLR reported in the present study and also in the databases, 29 (10 for PP, 8 for MY, and 11 for PP and MY) show complete concordance between the present study and the databases. Considering that the present study is completely independent of the databases, these QTLR and their specific effects can be regarded as confirmed QTL, using the terminology of Lander and Kruglyak (1995). With respect to the remaining 14 QTLR for which the present study reported effects on both traits, but the databases reported effects on a single traits only, part may represent incomplete power of the experiments reported in the databases (it should be noted that all 3 databases are based mostly on the same literature, and hence are highly correlated), but some may indeed, as noted above, represent different effects in the Brown Swiss breed.
The loci that apparently affect PP only are of particular interest because they would appear to provide an avenue to increase milk protein yield through MAS without increasing milk yield and consequently without increasing functional stress on the cow. These loci should be prime candidates for high resolution mapping followed by candidate gene analysis. Similarly, the loci that affect MY without affecting PP would appear to provide loci that could act powerfully to increase overall protein yield. In contrast, the many loci that affect both MY and PP would be expected to act in opposite directions on the 2 traits, so having little net effect on protein yield.
Selective DNA Pooling – The Case of the Brown Swiss as a Paradigm for an International QTL Mapping Project in the Cosmopolitan Minor Breeds
Mapping QTL in minor cosmopolitan breeds having a different genetic makeup than the predominant breeds due to different selection criteria and environment, can lead to the detection of QTL that are not segregating in the Holstein population and paves the way to detect genes related to very important economic traits under selection in specific genomic regions.
Generally, the limited size of outbred populations structured in large half-sib families is a limiting factor for mapping QTL for economically important traits in these minor breeds. In the present experiment, an international sampling design was used to collect DNA samples of individuals from 10 sire families across national boundaries. International sampling of the largest families was carried out evaluating the family size jointly for the 3 countries involved in the project, Austria, Germany, and Italy. In Europe some autochthonous breeds in the Alps (e.g., Alpine Gray) represent a similar situation: they are located across countries, and as a population as a whole they have a sufficient numbers and family structure for a QTL mapping study based on daughter design, but not if they are considered as resource population within any single country.
One concern of mapping QTL with a DD is the possibility to have available EBV of each daughter on the same scale base. However, an international EBV service is not presently available for female populations. Even if the cosmopolitan minor breed mainly comes from the same original strain, the different selection goals may have changed the genetic background of the female populations. The selective DNA pooling design used in this project can overcome the necessity to convert phenotypes to the same scale to use them jointly in the association analyses. Pooling individuals from different genetic strains does not need the direct comparison of EBV calculated by different systems and methods. In fact, selective DNA pooling allows mixing of individuals from the trait-tail of each country distribution of the same sire family, taking a proportional number of individuals according to country origin.
This approach is based on the assumption that the different populations are at the same genetic evolutionary moment (i.e., active male reproducers and close female individuals ancestors are of the same generation), or in other words that the average genetic value of the sire family in country A is equivalent to the average genetic value in country B. This assumption deals very effectively with the reality of the international market of semen that has made the same genes available across many national populations for several decades. Calculation of EBV for the daughters on an international scale is only needed for the estimation of allele substitution effect and not to test the QTL association. This is not the case for a DD with individual genotyping, for which association is detected with multiple regression approach and interval mapping.
| CONCLUSIONS |
|---|
|
|
|---|
Even if the QTL information presently available in the Brown Swiss population can be utilized only in a within family MAS approach, we believe that the possibility for the breed organizations to use available knowledge of detected QTL should motivate them to implement a structure for a genomic selection of individuals: new expected findings in gene identification can be expected to increase the efficiency of genomic assisted selection in the near future (Meuwissen et al., 2001, Schaeffer, 2006). In the Brown Swiss population the limited size of each national breeding program may motivate the stakeholders of the selection to join their effort internationally to increase their chances for success in the application of genomic technologies. Today the stakeholders of the genetic improvement in dairy cattle have indeed the possibility to be the prime movers in the gene discovery activity and in their use for selection in livestock. The confirmed QTLR identified in the Brown Swiss and the 3 Web-based databases represent a basic resource for such gene discovery.
| APPENDIX 1. ESTIMATION OF DIJK AND SE(D') |
|---|
|
|
|---|
![]() |
where (omitting the subscripts ijk for clarity), D1 = [(H1l – L1l) – (H1s – L1s)]/2 is the difference in frequency of sire marker alleles between the high and low tails, averaged across the 2 sire marker alleles, for a single pair of arbitrarily chosen subpools, one from each tail; and
![]() |
is the difference in frequency of sire marker alleles between the high and low tails, averaged across the 2 sire marker alleles for the remaining pair of subpools, one from each tail.
The minus sign of the short alleles (H1s, L1s, H2s, L2s) in the expressions for D1 and D2 derives from the fact that in the event of marker-QTL linkage, selection will have opposite direction of effect on frequency of the alternative sire alleles in the 2 tails. In the initial scan, one of the duplicates of each of the 4 subpools per sire-trait combination was genotyped, and each Dijk was based on frequencies estimated on these 4 subpools per sire-marker-trait combination. In suggestive QTL regions, the subpools were typed again with their duplicates, producing a Dijk based on 8 subpool analyses, for a replicated and more robust comparison-wise marker test. In this case allele frequencies of each tail were obtained by averaging the corrected intensity (Lipkin et al., 1998) of the 2 duplicate subpools of each replicate subpool.
Estimating SE(D'), the Empirical SE(D)
Because pool densitometry and estimation of the frequency of the L and S sire alleles was performed separately for each of the 2 replicate subpools in each tail, it was possible to use the variance among D values estimated from the 2 replicate subpools of the same tail to provide an empirical estimate of SE(D), denoted SE(D'). The estimate of SE(D'), computed across markers and traits, was used in the present study in place of the rather complex deterministic calculation of SE(D) used in Lipkin et al. (1998) and Mosig et al. (2001). The empirical SE(D) was based on a tail D-value denoted DH for the H pools or DL for the L pools, obtained between the 2 independent subpools at the same tail for the same allele.
According to the notation in the text, calculation of DH and DL is as follows:
![]() |
and
![]() |
The DH and DL values include all of the factors causing sampling variation between high and low subpools of the same allele (e.g., between H1l and L1l; that is, all sources of technical error, binomial sampling, and other undefined factors). But because the DH, DL values are from the same tail, they do not include effects due to QTL linkage. Thus, the DH, DL values represent D values for a single pair of high-low pools under the null hypothesis: H0 = absence of marker-QTL linkage. Note, that DH and DL have exactly parallel structure to D1 and D2, except that the 2 elements separated by a minus sign come from the same tail; whereas for D1 and D2, the 2 elements separated by a minus sign come from alternative tails.
Whenever DH or DL were larger than 0.2, the pools used in their estimation were considered as outliers and eliminated from the subsequent calculations only for this specific marker (a total of 24 DH and DL values were eliminated in this manner).
Because the actual D-values used to test marker-QTL linkage for any particular sire-marker-trait combination, have the composition D = (D1 + D2)/2, the corresponding D' expression, substituting DH for D1 and DL for D2, has the composition
![]() |
In principle, SE(D') is an estimate of SE(D) under the null hypothesis. In practice, SE(D') was estimated from SE(DH,DL), the pooled standard deviation across all DH and DL values, as
![]() |
Because it is based on many more degrees of freedom, SE(D') calculated in this way was found to provide a more stable estimate of SD(D') than SE(D') calculated directly from D' values. This SE(D') was used in all subsequent calculations of Z values.
| APPENDIX 2. ESTIMATION OF EBV STANDARD DEVIATION |
|---|
|
|
|---|
i, where IP is the selection intensity corresponding to selection of the extreme P proportion of the population, and
i is the population standard deviation for trait i (Falconer and Mackay, 1996). Based on this, it is possible to estimate the within-sire population standard deviation for the studied traits, from the known proportion selected to the high and low tails (Pi), and the observed difference between the mean of the high and low tails (Ti). The within-sire standard deviation is an underestimate of the population standard deviation, because it includes only three-fourths of the genetic variance. Thus, assuming that EBV variance is completely genetic (i.e., EVB accuracy equal 1), the estimates of the within-sire standard deviation in EBV values were increased by a factor of 1/(0.75)0.5 = 1.15 to give an estimate of the population standard deviation of EBV values.
In the ideal case, when the accuracy of EBV is 1, SD of EBV will be equal to the genetic SD. When accuracy is less than 1, estimates of allele effects are expected to be regressed to a similar degree as the SD of EBV. Given the average accuracy of daughter EBV is 0.49, this undoubtedly gives a somewhat conservative estimate of the standardized allele substitution effects.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
Received for publication July 6, 2007. Accepted for publication October 5, 2007.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
H. Khatib, W. Huang, X. Wang, A. H. Tran, A. B. Bindrim, V. Schutzkus, R. L. Monson, and B. S. Yandell Single gene and gene interaction effects on fertilization and embryonic survival rates in cattle J Dairy Sci, May 1, 2009; 92(5): 2238 - 2247. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Russo, L. Fontanesi, E. Scotti, F. Beretti, R. Davoli, L. Nanni Costa, R. Virgili, and L. Buttazzoni Single nucleotide polymorphisms in several porcine cathepsin genes are associated with growth, carcass, and production traits in Italian Large White pigs J Anim Sci, December 1, 2008; 86(12): 3300 - 3314. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Lipkin, A. Bagnato, and M. Soller Expected Effects on Protein Yield of Marker-Assisted Selection at Quantitative Trait Loci Affecting Milk Yield and Milk Protein Percentage J Dairy Sci, July 1, 2008; 91(7): 2857 - 2863. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Lipkin, R. Tal-Stein, A. Friedmann, and M. Soller Effect of Quantitative Trait Loci for Milk Protein Percentage on Milk Protein Yield and Milk Yield in Israeli Holstein Dairy Cattle J Dairy Sci, April 1, 2008; 91(4): 1614 - 1627. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |