|
|
||||||||
Department of Animal Science, Iowa State University, Ames 50011
Corresponding author:
G. Abdel-Azim; e-mail:
gaazim{at}iastate.edu.
| ABSTRACT |
|---|
|
|
|---|
The effect of using QTL information on the genetic response, the frequency of the favorable QTL allele, and the accuracy of evaluation were examined. Moreover, the effect was studied in four distinct paths of selection: active sires, proven young bulls, bull dams, and first-lactation cows. Average superiority values of 4.6, 7.6, 11.7, and 1.1% for genetic response were observed over 16 yr of selection for active sires, young bulls, bull dams, and first-lactation cows, respectively. Frequency of the favorable QTL allele changed faster in bull dams than males, and was the slowest in first-lactation cows. Finally, accuracy of evaluation under the QTL-assisted selection scheme was higher than under the QTL-free selection scheme. Young bulls of the QTL-assisted selection scheme on average had 0.049 higher accuracy, and first-lactation cows had on average 0.185 higher accuracy than corresponding animals of the QTL-free selection scheme.
Abbreviation key: AS = active sires, BD = bull dams, BV = breeding value, EBV = estimated breeding value, FLC = first lactation cows, GFD = gene frequency difference, MAS = marker-assisted selection, QAS = QTL-assisted selection, QFS = QTL-free selection, YB = young bulls
Key Words: dairy cattle breeding scheme quantitative trait loci selection
| INTRODUCTION |
|---|
|
|
|---|
As a result of these recent discoveries, studying the impact that known loci might have on the efficiency of selection becomes essential. The impact of using QTL information in selection has been studied either analytically or by means of computer simulation. Lande and Thompson (1990) analyzed the utilization of marker-assisted selection (MAS) in classical schemes of individual and family selection and derived optimal indices combining phenotypic and genomic information to maximize genetic gain. Meuwissen and van Arendonk (1992) used a deterministic simulation model to test the additional genetic gain of MAS within family and found that MAS did not increase rates of genetic gain when applied to conventional progeny testing schemes. Spelman et al. (1999), also using a deterministic simulation model, found limited benefit of MAS in conventional progeny testing schemes. An increase in rate of genetic response of 1.8% over a base line scheme with no markers was found when 10% of the genetic variance was associated with the marked QTL. The percent increase became 3.7 with marked QTL variance of 20%. Both Meuwissen and van Arendonk (1992) and Spelman et al. (1999) used a deterministic model to predict the asymptotic genetic gain, ignoring both the reduction in genetic variance from inbreeding and the change in the QTL allele frequency.
Spelman and Garrick (1998) evaluated two within-family MAS schemes using stochastic simulation for a locus that explained 17% of the genetic variance. A maximum increase of 9% over a non MAS scheme was found when 40 progeny were simulated for each bull dam. Although Spelman and Garrick (1998) stochastically simulated animals, breeding values were not estimated from data; instead, estimated breeding values were simulated by simulating prediction errors. Also, genetic responses were from a single cycle of MAS, i.e., not from several generations.
Ruane and Colleau (1995) conducted a simulation study of six discrete generations of selection in a closed small population. Single-stage selection was applied to each generation after evaluating animals based on a conventional animal model and a MAS-based animal model (Fernando and Grossman, 1989). Although MAS schemes in general were found to be superior to non-MAS schemes, the study reported inferior MAS polygenic responses to polygenic responses obtained from non-MAS schemes. QTL responses from MAS schemes were, however, distinctively higher than those from non-MAS schemes. The inferior MAS polygenic response was attributed to a reduced accuracy of evaluating the polygenic effects under MAS schemes (Ruane and Colleau, 1995).
Most of the simulation studies found in the literature have used simple breeding structures and small populations with selection for a number of nonoverlapping generations, except with deterministic simulation in which more realistic population structures and overlapping generations were implemented. Also, simulated QTL were rarely estimated by statistical methods but were simulated, except for small populations (e.g., Zhang and Smith, 1993; Meuwissen and Goddard, 1996; Ruane and Colleau, 1995). In other cases, marker or QTL genotypes were directly used in selection in a separate step (e.g., Spelman and Garrick, 1998; Edwards and Page, 1994).
The objective of the current work is to develop a stochastic simulation model that is based on average parameters of the US Holstein population to evaluate the maximum superiority of a QTL-assisted selection (QAS) scheme over a QTL-free selection (QFS) scheme. QAS is based on complete information about a known QTL. Selection is performed for any number of years, 16 yr in the current study, with overlapping generations in a large population. Breeding values and genotype effects are estimated by an animal model.
| MATERIALS AND METHODS |
|---|
|
|
|---|
![]() | [1] |
and based on the simulation model, a general model for analysis can be formulated in matrix form as
![]() | [2] |
where b is a vector of fixed effects, a is a vector of random additive genetic effects, and X and Z are incidence matrices relating records in y to fixed and random effects, respectively. Further,
where
2aA is the variance covariance matrix of a with A representing the relationship matrix among animals. The diagonal matrix I
2e explains the nature of the residual variance which followed from the way they were simulated, i.e., elements of e, {ei}, were simulated as independent identically distributed N(0,
2e). Additive genetic effects, a, were simulated under the assumption of a finite number of loci controlling milk yield. Fixed effects were simulated by adding constants to the record of each individual that corresponded to its levels of fixed factors.
Values of Fixed Effects and Variances of Random Effects
To simulate a record, appropriate values for the components in equation [1] had to be devised. Kuhn et al. (1994) computed the phenotypic variance of milk records by averaging the within-parity variances reported by Hansen et al. (1983) and adding to it an extra (249 kg)2 due to parity effect. The total phenotypic value was partitioned to 40% due to fixed factors and 60% due to random factors (Hansen et al., 1983; Kuhn et al., 1994). Of the 40%, due to fixed components, the herd effect accounted for 22% and the year effect for 6%. Other fixed factors accounted for the remaining 12%. Similar proportions were used in the current simulation model.
Fixed factors considered in the current simulation were management group and parity. Herd, year, and season subclasses constituted a management group, thus there were four fixed factors whose levels made up the columns of X in [2], a column for each level. Values for herd levels were sampled as in Kuhn et al. (1994) from a normal distribution with mean 0 and variance 0.22 (913 kg)2, where (913)2 is the total phenotypic variance used in the current study. Herd effects were drawn once from the described normal distribution, then they were randomly assigned to base animals. In subsequent years of simulation, a heifer was assigned the same herd effect as her dam.
Six seasons were considered in the current simulation model. Each two consecutive months starting with January constituted the six season levels whose values in kilograms were 179, 68, –158, –318, 43, and 187, respectively. For the parity effect, a value of –249 kilograms was added to first-parity records, and a value of 249 kilograms was added to latter records (Kuhn et al., 1994). A year effect sampled from a normal distribution with mean 0 and variance 0.06(913 kg)2 was added to all records generated in this year.
Finally, of the 60%, due to random factors, the additive genetic variance accounted for h2(0.6)(913 kg)2, and the residual variance accounted for the rest, i.e., (1-h2)(0.6)(913 kg)2. In addition, the QTL in the current study explained 15% of the additive variance, hence the QTL variance was computed as h2(0.15)(0.6)(913 kg)2, and the polygenic variance was computed as h2(0.85)(0.6)(913 kg)2.
Simulating Breeding Values
Breeding value for milk yield was simulated as the aggregate effect of 40 loci in addition to one QTL. The 40 loci, that will be henceforth referred to as the polygenes, were simulated independent of each other and of the QTL. Both the polygenes and the QTL were assumed to have two alleles at each locus. The aggregate polygenic effect was computed as the sum of individual breeding values (BV) associated with each locus, and the total BV of an individual was simulated by adding its aggregate polygenic effect to the BV of the QTL that the individual carries.
Breeding values associated with single loci for base animals.
The simulation program requires known proportions for both the polygenic and the QTL additive genetic variances to simulate their corresponding BV. For a single locus of the polygenes, the following steps are taken by the program. First, compute genotypic values in the absence of dominance by taking the square root of
is the polygenic variance that is divided by 40 to obtain the portion of variance contributed by one locus of the 40, and pi is the frequency of the favorable allele at the ith locus in the base population. Because the distribution of the pi values is unknown, they were sampled from a uniform distribution with parameters 0 and 1 (Unif(0,1)). The equal-variance assumption for the polygenes was made to avoid accidental assignment of big variance portions to any of the polygenes in the case of an alternative approach that randomly assigns variance portions to the loci.
The second step is to compute BV according to Falconer (1989) as 2ai(1 – pi) and –2aipi for homozygotes and ai(1 – 2pi) for the heterozygote, where ai refers to the genotypic value associated with the ith locus of 40. The same procedures are followed to simulate the BV of the QTL according to the QTL proportion of the genetic variance. As previously mentioned, the QTL was assumed to explain 15% of the genetic variance, i.e., 49568.588 kg. Given that the frequency of the favorable QTL allele in the base population equals 0.1, the QTL genotypic value in the absence of dominance equals 353.429 kg, which is 0.39 of the phenotypic standard deviation.
Breeding values for progeny.
Genotypes of progeny are constructed from gametes sampled at random from parents genotypes. A gamete is sampled by drawing with a 0.5 probability one of the two alleles that a parent carries at each locus including the QTL. After forming progeny genotypes, the approach described earlier for simulating BV associated with single loci for base animals was followed to simulate BV for progeny.
Population Structure and Dynamics
Generally, the simulation addresses an advanced dairy breeding system that utilizes current strategies in selecting dairy sires over two major stages of selection. The first stage includes searching for elite parents to produce young bulls, and the second stage is selecting the best of the progeny tested bulls to become active.
Eighty herds were used in the current study. Herds usually vary in size with frequency that is high for moderate-size herds. This was not considered here since no effect in our model related to herd size. Table 1
summarizes the parameters used in the current simulation.
|
Generating new cow replacements was done by going through the whole list of females and checking age suitability for breeding. If a female was older than 1 yr of age, then her parity was determined; and if a draw from a Unif(0,1) was less than or equal (1 – the culling probability given parity), then she stayed. If a female stayed, she produced either a male or a female with equal probabilities. If she produced a male, a new record was generated for her, and she proceeded to the next parity. If she produced a female, she also received a new record and proceeded to the next parity; in addition, her new female offspring was simulated after picking a sire at random from the list of active sires or from the progeny-test bulls. The new female offspring was added to the list of females that the program kept and updated each year.
Bulls.
The number of young bulls tested annually was set to 21, and the number of active sires was 10. Every year, the top 50% of the active sires are mated to the top cows to produce the 21 young bulls. The process of producing a male is entirely stochastic and similar to what was described about generating female replacements. First, the best female is picked from the milking cow population. Then, based on culling probabilities given parity, the program decides whether she stays or is culled after a draw from a Unif(0,1). If she stays, she is mated to a random sire from the top 50% of the active sires. But the probability of a male is only 50%, thus a young bull is generated if a draw from a Unif(0,1) does not exceed 0.5. The next male is generated by repeating the same steps after drawing the next best suitable female.
Selection Criteria
The simulation program is designed so that selection is always based on an estimated BV (EBV) obtained differently under QAS and QFS. Codes for the simulation and estimation programs may be obtained from Abdel-Azim (2002). Selection is needed first when selecting sires of the future young bulls by selecting the elite 50% of all active sires based on their EBV. Selection is also needed when the list of active sires is set each year by selecting the top number of active sires out of all males evaluated every year. Males evaluated each year include the active sires of the previous year and the proven young bulls of the current year. Finally, selection is needed when bull dams are chosen by sorting the whole list of milking cows based on cows EBV, and the best 21 eligible females are selected. The following explains the two selection criteria that were the basis for selection throughout selection practices examined in the current study.
QTL-free selection.
This study is intended to compare QTL-assisted selection to current selection practices when implemented in dairy cattle. Current selection practices were simulated in our model as conventional two-stage selection. Active sires for each year were selected based on the animal model, [2], in which elements of the vector a represent the total additive effects of evaluated animals. Each year, after the fifth, proofs of a new set of bulls are computed based on first lactation of their daughters.
QTL-assisted selection.
It is assumed that loci affecting milk yield could be physically mapped. Genotypes of loci with large effects can be expressed as independent fixed factors in a mixed linear model, a strategy that provides unbiased estimates of single gene effects when genotypes can be identified for all individuals with observations on the trait (Kennedy et al., 1992). This could be formulated by adding a new factor pertaining to genotypes to model [2] as follows
![]() | [3] |
where Xg is an incidence matrix relating genotypes to animals with observations and g is a vector of genotype effects. Other elements of [3] are defined as in [2], except that u is a vector of additive genetic effects with the known QTL excluded, i.e., only the polygenic effects.
The selection criterion investigated here was selecting based on EBVi = ûij +
j where ûij and
j are estimates of the polygenic effect of the ith animal that carries the jth genotype and the QTL effect of the jth genotype, respectively, equation [3]. In the current simulation, all individuals were assumed to have known genotypes for a QTL segregating in the population across families.
Aspects Examined
Sixteen years of selection were simulated. This is about three cycles of selecting young bulls. In the first 5 yr, no analysis was performed because young bulls generated during the first 5 yr were still waiting for their first daughter crop to finish a lactation. In addition, only one set of active sires was simulated at random in the first year and continued to be used throughout the early 5 yr. Hence, there was no genetic improvement in the first 5 yr of selection. In the results section, we do not report selection results of the first 5 yr.
For both QAS and QFS, three aspects were addressed: cumulative genetic response, selection pressure, and accuracy of evaluation. These aspects were computed in four distinct groups of animals: active sires (AS), proven young bulls (YB), bull dams (BD), and first-lactation cows (FLC). The advantage of this to designers of breeding programs is to know the pathways of selection that may benefit the most under QAS schemes and also to shed some light on the relationships between trends of genetic response and accuracy of evaluation or selection pressure on the QTL within pathways of selection.
Cumulative genetic response is the most important aspect among the three aspects investigated in the current study. Every year, active sires and bull dams were first selected based on their EBV, and then their true breeding values were averaged. In addition, true breeding values of the proven young bulls and the first-lactation cows were averaged, as well. The average of the true breeding value was used in the current study to indicate the cumulative genetic response. Cumulative genetic responses were presented in terms of percent superiority of specific animals of a selection pathway under the QAS scheme over corresponding animals of the QFS scheme. Mean percent superiority was computed as
![]() | [4] |
where QASi and QFSi are the true breeding values averaged for any of the four concerned groups of animals, namely AS, YB, BD, and FLC of the ith replicate of 50. In early years, QASi and QFSi may differ in sign, i.e., negative response for QASi and positive for QFSi or vice versa, although this situation was rarely found in the current study. To avoid computing a negative percent superiority for the QAS scheme when QASi is greater than a negative QFSi, division was by the absolute value of QFSi which is the component |QFSi| in formula [4]. Division by the absolute value is also important if QFSi is greater than a negative QASi. In addition, the multiplication by 2 in the formula resulted from the multiplication by 100 to convert to percentages and the division by 50 to average over all replicates.
Second, the correlation coefficient between true and EBV was computed separately for AS, YB, BD, and FLC every year and taken to indicate accuracy of EBV under QAS and QFS schemes. Finally, the frequency of the favorable QTL allele was computed within each of the four groups and taken to indicate selection pressure imposed on the QTL in pathways of selection.
The two schemes, QAS and QFS, were replicated 50 times. Results for all replicates were averaged, and standard errors and estimated confidence intervals of the computed means were evaluated, when necessary. In the current simulation, mixed model equations were solved every year using the true variance components to obtain BV estimates. Given that simulation was replicated for 50 times, the equations had to be set and solved 550 times for each scheme each year. In addition, all ancestral and collateral pedigree information were used in a complete relationship matrix inverse which, in turn, produced a large number of equations over the years of simulation. This was a major computational task that limited our resources to only 50 replicates per scheme.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
Cumulative Genetic Response
Active sires.
On average of all replicates and years, AS of the QAS scheme were 4.6% higher than AS of the QFS scheme for cumulative genetic response. Table 2
summarizes percent superiority of AS of the QAS scheme over AS of the QFS scheme. Percent superiority started low, then increased and reached a maximum in the third cycle of selection, and finally started to decrease again at the end of the third cycle of selection.
|
Young bulls.
Only results of yr 10 to 16 are shown in Table 3
for the proven YB for the following reason. The YB of both QAS and QFS schemes were exactly the same until yr 9. Proven YB of yr 6, 7, 8, and 9 were born in yr 1, 2, 3, and 4, respectively, and were sampled for progeny testing at one year of age in yr 2, 3, 4, and 5, respectively. Estimated BV were not available for animals before yr 6, and selection of parents of young bulls was at random. Because seed numbers used for random number generators were the same for the QAS and QFS schemes, the same set of parents and the resulting young bulls were sampled for both schemes until yr 5.
|
The variation across the 50 replicates was greater than the corresponding variation in AS. YB are expected to have more variation than AS due to their larger number (or equivalently their smaller selection intensity) and their less accuracy of evaluation than AS. The range of variation extended from the worst group of proven YB under the QAS scheme that had an inferiority of 26.7% to the best group that had a superiority of 80.6% relative to corresponding animals of the QFS scheme.
Bull dams.
Percent superiority of BD of the QAS scheme over BD of the QFS scheme is presented in Table 04
. Percent superiority in BD was the highest among the four groups of animals studied. On average of all replicates and years, BD of the QAS scheme was 11.7% higher than BD of the QFS scheme. The superiority ranged from a minimum of 8.8% at yr 8 to a maximum of 14.5 at yr 15. The trend of starting low superiority that increased to a maximum during the last cycle of selection and then dropped was similar to what was found for AS and YB.
|
First-lactation cows.
Cows in general benefited the least from using QTL information partly because not enough time existed for parental superiority to be expressed in the cows. On average, of all replicates and years, superiority of 1.1% was found for the FLC of the QAS scheme over FLC of the QFS scheme. However, a high standard error of 1.1 for the superiority average, 1.1%, was also found. As a result, the mean value of superiority of response in FLC was not significantly different from 0. The 95% confidence interval for the mean was (–1.002, 3.180). Table 5
shows the frequency distribution of percent superiority for all datasets. The frequency distribution shows a greater number of positive percent superiority than negative. Table 5
, however, shows two outlier values: –500% and 143%. Outliers were common in FLC in yr 6 to 9 of selection because of sign differences between genetic responses obtained from QAS and QFS schemes. For example, the –500% value resulted from an average BV of –1.814 kg for the QAS scheme and 0.454 kg for the QFS scheme in yr 7. The two values are potentially 0 and could better be excluded. Generally for FLC, yr 6 to 9 were not a good indication of genetic response because their parents were randomly generated or chosen before yr 6.
|
|
|
Selection Pressure on the QTL
Measuring frequency of QTL alleles over years of selection is a possibility that stochastic simulation offers. This section discusses the change in frequency of the favorable QTL allele for the four paths of selection. In the current study, the change in gene frequency is taken to express selection pressure on the favorable QTL allele. Gene frequency is expected to be higher for animals of the QAS scheme over animals of the QFS scheme. Gene frequency of animals of the QAS scheme minus gene frequency of animals of the QFS scheme will be henceforth referred to as gene frequency difference (GFD). GFD indicates the superiority of the QAS over the QFS scheme in pushing the favorable QTL allele towards fixation.
Males.
Table 8
shows the annual change in gene frequency as an average of all replicates for males (active sires and young bulls). Gene frequency increased from 0.127 to 0.825 in 10 yr of QAS and increased from 0.125 to 0.588 in 10 yr of QFS. Maximum values show that the favorable allele was even fixed in some replicates under QAS and reached close-to-fixation values under QFS. This indicates the short time horizon needed to fix a QTL of the size used in the current simulation, which, in turn, justifies designing breeding programs to benefit from QTL information only in the short run. The results also cast a great deal of doubt about the possibility of finding a QTL of size 0.39 phenotypic standard deviation at a low frequency of the favorable allele in similarly selected populations. Future artificial intervention and mutations are still possible as ways of introducing new genes to such selected populations.
|
|
|
First-lactation cows.
Similar to genetic superiority, differences in GFD between paths are the result of genetic lag and gene flow. Frequency average of the favorable QTL allele for FLC started at 0.1 for QAS and QFS in yr 6 and increased to 0.566 and 0.453 for QAS and QFS, respectively, in yr 16. In FLC, GFD was the least among other paths but continued to increase (Figure 1
).
Accuracy of Evaluation
This section uses the correlation coefficient between true and estimated BV of proven YB to express accuracy of evaluation under the two schemes studied. Figure 2
displays accuracy for all years for both QAS and QFS schemes. Proven YB of QAS were consistently superior to corresponding animals under the QFS scheme. The difference between QAS and QFS was averaged and found to be 0.035 for yr 6 to 16 and 0.049 for yr 11 to 16, i.e., the last cycle of selection. These results indicate that under QAS, YB in the second selection stage are more accurately evaluated. This means less risk involved with promoting young bulls to active sires. Notice, however, that the difference in accuracy was small.
|
|
|
| CONCLUSIONS |
|---|
|
|
|---|
For designing breeding programs that consider QTL information, attention must be paid to improvement of distinct pathways that benefit the most from using QTL information which eventually benefits the whole population. As we showed, the advantage of QAS increases significantly and decreases at different times in the future for the four pathways of selection. Future work to test breeding programs that shorten the 5-year waiting period of young bulls and optimum schemes that employ nucleus herds with multiple ovulation and embryo transfer (MOET) is in progress.
The simulation was stochastic and allowed for studying the frequency of the favorable QTL allele by directly computing the favorable allele every year from data for the different pathways of selection. QTL allele frequency reached close-to-fixation values in BD in a short time. Also, it increased significantly in other pathways of selection. Therefore, using information about QTL equal or greater than 0.39 phenotypic standard deviation units may be beneficial only in the short run. However, the high rate of increase of the frequency of the favorable QTL allele under schemes of QFS found in the current study casts a great deal of doubt about the possibility of finding QTL or marked QTL of the size presented here, 0.39 phenotypic standard deviation units, at low frequency. Gibson (1994) showed that the favorable QTL allele reached fixation under mass selection that gave better long-term response when genotype information was ignored. Future artificial intervention and mutations as ways of introducing new genes to selected populations at low frequency are still possible.
Received for publication September 17, 2001. Accepted for publication January 29, 2002.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
G. Abdel-Azim and S. Schnell Genetic Impacts of Using Female-Sorted Semen in Commercial and Nucleus Herds J Dairy Sci, March 1, 2007; 90(3): 1554 - 1563. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Tamassia, F. Nuttinck, P. May-Panloup, P. Reynier, Y. Heyman, G. Charpigny, M. Stojkovic, S. Hiendleder, J.-P. Renard, and S. Chastant-Maillard In Vitro Embryo Production Efficiency in Cattle and Its Association with Oocyte Adenosine Triphosphate Content, Quantity of Mitochondrial DNA, and Mitochondrial DNA Haplogroup Biol Reprod, August 1, 2004; 71(2): 697 - 704. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Abdel-Azim and A. E. Freeman Effects of Including a Quantitative Trait Locus in Selection Under Different Waiting Plans of Young Bulls J Dairy Sci, February 1, 2003; 86(2): 667 - 676. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |