|
|
||||||||
Animal Improvement Programs Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705-2350
Corresponding author: Melvin Kuhn; e-mail: mkuhn{at}aipl.arsusda.gov.
| ABSTRACT |
|---|
|
|
|---|
0.91 by d 130, and phenotypic means of predicted days open were in agreement with means for final days open, indicating that the predictions were phenotypically unbiased. Comparison of mean PTA based on actual and predicted days open indicated no bias in PTA, and correlations between PTA were
0.92 by d 130. The earlier use of data increased reliability by about 5% for sires between the ages of 4 and 5 yr. The USDA began using predicted days open for records that are at least 130 DIM in national evaluations starting November, 2003.
Key Words: fertility genetic evaluation prediction
Abbreviation key: DO = days open, DPR = daughter pregnancy rate, DO_LB = days open based on last breeding
| INTRODUCTION |
|---|
|
|
|---|
Prediction of DO requires knowledge of factors affecting DO. Numerous studies have found effects of both parity and age on DO (Seykora and McDaniel, 1983; Taylor et al., 1985; Marti and Funk, 1994; Dematawewa and Berger, 1998). Virtually all research on DO has indicated differences in herds and/or regions, years, and seasons (Seykora and McDaniel, 1983; Taylor et al., 1985; Marti and Funk, 1994; Dematawewa and Berger, 1998). The effect of milk yield on DO has also been well established, although estimates of the magnitude of the association have varied. Most estimates of genetic correlation between milk yield and DO have been around 0.3 to 0.35 (Hansen et al., 1983; Seykora and McDaniel, 1983; VanRaden et al., 2004), although Dematawewa and Berger (1998) reported correlations around 0.55. Calving ease is another factor affecting DO, with more difficult calving resulting in more DO (Dematawewa and Berger, 1997). Days open is a function of days to first breeding, number of services to conception, and intervals between services. Raheja et al. (1989) reported genetic correlations between days to first breeding and DO in the range of 0.28 to 0.12 for lactations 1 through 3. Estimates from Hansen et al. (1983), however, were considerably higher, ranging from 0.8 to 1.0. Heritability estimates for DO have generally been in the range of 0.03 to 0.05 (Seykora and McDaniel, 1983; Marti and Funk, 1994; Dematawewa and Berger, 1998; VanRaden et al., 2004).
| MATERIALS AND METHODS |
|---|
|
|
|---|
age at calving
12 yr), about 3.5 million records were available, overall, for analysis. Days open were calculated as differences between breeding dates and calving dates. To be consistent with the USDAs definition of final or actual DO (VanRaden et al., 2004), DO greater than 250 were set to 250 d and those lower than 50 were set to 50 d.
Prediction Models
The two basic prediction equations considered were
![]() | ([1]) |
![]() | ([2]) |
where
is the predicted DO, CE is calving ease score (scores of 1 through 5), age is age at calving in years (e.g., 2.5 yr), DO_LB is DO based on last breeding, milk is the average of first 3 test-day milk yields, and ß1 through ß5 are regression coefficients for corresponding effects. The prediction models were fit separately for each of 9 DIM groups: 70, 90, ..., 230. Nesting within DIM was done to allow for determination of minimum DIM required before using records in progress and to provide for more accurate predictions, especially for cows not yet bred. Separate solutions for different DIM groups allows higher predicted DO for cows later in lactation and known to be not yet bred. Separate sets of solutions were also computed for cows that did not have calving ease scores and cows that did not have a breeding. There were a total 9 (DIM groups) x2 (with/without calving ease scores) x2 (with/without a breeding) = 36 sets of solutions for each prediction model.
Inclusion of DO_LB in the model reflects the uncertainty in pregnancy status associated with a last reported breeding. If it were known with certainty that the last breeding resulted in a pregnancy, then the record would be excluded from prediction.
Inclusion of milk yield in the predictor was tested as an option because of the known phenotypic and genetic correlation of milk yield with DO (VanRaden et al., 2004). However, with milk yield in the predictor, early lactation predictions could be affected more by milk yield than by fertility differences, which could result in a genetic correlation between predicted DO and milk yield that was higher than the genetic correlation between actual DO and milk yield. This, in turn, could reduce early DPR PTA for high-milk sires more than is appropriate. Genetic correlations between predicted DO and milk yield were estimated for both models to assess this concern.
Calculation of DO_LB.
The DO_LB was calculated for each DIM group as the difference between breeding date and calving date, using the last breeding prior to the cutoff point for the given DIM group. Thus, if a cow had breedings at 85, 115, and 135 d, her DO_LB was 85 for the first 2 DIM groups (70, 90), 115 for the third DIM group (110), and 135 for each DIM group thereafter. If a cow did not have a breeding by the upper end of the DIM group, then for estimation of effects in [1] and [2] she contributed to the "without breeding" subset rather than the "with breeding" subset of solutions for both models.
Other alternatives.
Numerous preliminary models were tested prior to the final selection of [1] and [2]. One preliminary model used days to first breeding instead of DO_LB, but use of DO_LB provided a higher correlation with final DO, so it was included in the final models. To improve phenotypic correlations, attempts were also made to include environmental effects such as herd, year, and season, but these all resulted in predictors with severe phenotypic bias and were thus excluded. For second and later lactations, consideration was also given to using an animals previous DO and previous number of services to conception. This increased both the phenotypic and genetic correlations with final DO, without the consequence of creating a difference between mean predicted DO and final DO (i.e., phenotypic bias). However, previous DO and previous number of services were not included in the final predictor because of concern about correlated errors between records in genetic evaluation.
Finally, for cows that had at least one breeding, an attempt was made to utilize number of days since last breeding. If 2 cows are both at 150 DIM, for example, and one was last bred at 80 d and the other at 149 d, it was conceivable that the regression on DO_LB could dominate the predictor more so for the cow that had gone over for 70 d than for the cow that had gone over for only 1 d. Thus, records were classified according to both DIM and days since last breeding, and the prediction model fit separately for each subclass. Use of days since last breeding, however, did not improve accuracy of prediction, primarily because there was little variation in DO_LB within subclass, when records were classified according to both DIM and days since last breeding. Days since last breeding added little information beyond DIM and DO_LB, so days since last breeding was not used as part of the final predictor.
Evaluation of Predictors
Phenotypic bias, phenotypic and genetic correlations, and PTA based on final versus predicted DO were used as criteria to evaluate the predictors. Means of predicted and actual DO were used to evaluate phenotypic bias within each DIM group.
The primary criteria for choice of model and DIM to begin predictions were genetic, rather than phenotypic, correlations because the objective was to develop a prediction of DO suitable for genetic evaluation. Inclusion of environmental factors such as season or region-season was not given priority consideration because, while they would increase the phenotypic correlations between predicted and final DO, they would have no impact on genetic correlations. The predictor could put them in only to have the genetic evaluation model remove them. Furthermore, season effects vary by year, and it would not be appropriate to estimate year-season effects from the incomplete data (records in progress) available for genetic evaluation because that could introduce the very bias that use of predicted records is intended to avoid. Thus, extensive attempts to include environmental effects such as season were considered a nonproductive complication and were not extensively pursued.
Phenotypic and genetic correlations.
Phenotypic and genetic correlations were estimated for each DIM group. A multiple-trait sire model was used for REML estimation of genetic correlations. The model included, in addition to the random effect of sire, only the fixed effect of herd-year-season-parity of calving. Eleven traits were included: final DO, 9 predicted DO corresponding to the 9 DIM groups, and milk yield, which was the average of the first 3 test days.
Each lactation for each cow included in the analysis had a record for each trait. Although a cow-lactation could contribute to only one set of solutions (e.g., with or without a breeding) for estimation of effects in [1] and [2], each cow-lactation had a predicted DO for each DIM group. If the cow did not have a breeding by the end of the upper limit for the DIM group, she was simply predicted using the "without a breeding" set of solutions for [1] or [2]. Approximately 670,000 records were used for estimation of correlations and calculation of phenotypic means for evaluating bias. Calving years for data included in estimation of correlations were 1995 through 1998, inclusive. In contrast, only years 1995 through 1997 were used to estimate the effects in the prediction models. Therefore, there was one additional year of data used for prediction that was not used in estimation of effects in the prediction models. A data set completely independent of that used for estimation of effects would have been desirable, but that alternative was not practical given a limited amount of data.
Comparison of PTA based on actual and predicted DO.
Two data sets, PTAy and PTA
, were formed to evaluate PTA computed using predicted DO. The 2 data sets contained the same cow-lactations, roughly 374,000 records that included calvings from 1995 through 1998. For computation of PTA, cows were required to have first lactation. In national DPR evaluations, most records would be completed, while projected DO would be mainly for current records in progress. To make the comparison between PTA based on actual vs. predicted DO as realistic as possible, PTA
contained predicted records only for the last year of data included (1998), while all previous years in PTA
(1995 through 1997) contained actual DO. There were about 127,000 predicted records in PTA
, which was roughly 34% of all records. The 1998 records accounted for more than 25% of the data, expected given 4 yr of data, partly because there were more records in the later years and partly because edits for this comparison, in particular the one requiring cows to have a first lactation, removed more cow-records from the earlier years than from the later years.
In contrast to PTA
, PTAy contained only actual DO for all years. Days open breeding values for this comparison were computed from an animal model that included the fixed effect of herd-year-season-parity and the random effects of animal and permanent environment. Criteria to compare the 2 sets of PTA were mean differences and correlations, computed separately for cows and sires. This PTA comparison was done only for DIM groups 110, 130, and 150.
Expansion and Weighting Factors
The USDA routinely uses projected records in the single-trait national genetic evaluations for production traits, SCS, and productive life (VanRaden et al., 1991; VanRaden and Klaaskate, 1993). Expansion and weighting factors are applied because variances for predicted and actual records are generally not equal (VanRaden et al., 1991).
"Expansion factors" equalize genetic variances for predicted and actual records (VanRaden et al., 1991; VanRaden and Klaaskate, 1993). These expansion factors are computed simply as the ratio of additive genetic standard deviation for actual records to additive genetic standard deviation for predicted records. Genetic variances are equalized, then, by multiplying deviations of predicted records from their management group mean by the appropriate expansion factor. Expansion factors for predicted DO records were estimated for each of the 9 DIM groups.
"Weighting factors" refer to the diagonal elements of R1, the inverse of the error variance-covariance matrix for genetic evaluation. These are computed as the ratio of error variance for actual records to error variance for predicted records. Although USDA utilizes a diagonal error variance-covariance matrix for national evaluations, the diagonal elements are not required to be equal. Thus, in contrast to the expansion factors which are used as a prior adjustment, the weighting factors are used directly in the mixed-model equations. After expansion, predicted records generally have larger error variances than actual records and get less weight than actual records, which is intuitively appealing. Weighting factors were computed for each of the 9 DIM groups.
Effects on DPR Evaluations
Although collection of more complete breeding information has been initiated by the USDA in the form of the new "format 5" [Format 5 will include information on each breeding: service sire, service number, date of service, type of mating (AI or natural service). Additionally, observed heats, synchronizations, embryo transfers, results of pregnancy exams and "do not breed designations" can also be reported.] record (VanRaden et al., 2004), the historical USDA database, which is what the national DPR evaluations are based on, contains only final DO as opposed to information on each individual breeding. For records in progress, "final" DO, in the USDA database, is DO based on last breeding. Thus, implementation cannot be tested, for example, by using actual vs. predicted DO, since only final DO is known. Nonetheless, properties of evaluations using predicted DO can still be compared to evaluations using only final DO.
The DPR evaluations using predicted DO were computed using data available at the time of the August, 2003 run. Mean sire PTA and reliabilities by birth year were compared to those for the August, 2003 run which used only final DO. Only sires designated with sampling status S (S = semen distributed to at least 40 herds with records qualifying for use in USDA genetic evaluations.) by the National Association of Animal Breeders were included in this comparison.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
|
|
|
|
) than for actual DO (
), in which case the ratio of error variances (
) is necessarily greater than one. In turn, the higher heritabilities for predictions in the earlier stages of lactation were likely reflecting the fact that the heritability of days to first breeding is higher than the heritability of final DO. VanRaden et al. (2004) estimated heritability for days to first breeding at 0.07, which was consistent with the 0.06 estimate of Weigel and Rekaya (2000). In contrast, final DO has a heritability of about 0.04 (VanRaden et al., 2004). The DO_LB would be more akin to days to first breeding in the early predictions than in the predictions later in lactation.
|
|
0.90, and overall correlation of sire PTA based on predicted versus final DO was
0.94. Starting at d 130 also partially circumvents the problem of weighting incomplete records more heavily than complete records, which is counter-intuitive. Weights will be slightly modified from those in Table 6
150,
170,
190,
210,
230, and
249, respectively. Records that are complete (DIM
250) but unconfirmed by a subsequent calving will be given a weight of 0.96. These values are a compromise between the values in Table 6
Records will be projected using equation [1]
, without milk yield in the predictor. Genetic correlations of predicted DO with milk yield were more consistent with the genetic correlation between final DO and milk yield without milk in the predictor than with milk in the predictor. Furthermore, this allows more readily for implementation of approximate multiple trait procedures (VanRaden, 2000), which could combine animals milk yield and DPR evaluations instead of data.
A record must meet several criteria to be predicted: 1) it must be the cows last record on file; i.e., either a terminal lactation or a record in progress, 2) 130
DIM
249 and 3) calving must have occurred on or after January 1, 2000. Records satisfying these criteria can still be excluded from prediction if 1) the herd does not report breedings or 2) the cow was culled for reproductive failure, in which case she is assigned the upper limit of 250 d for that lactation, regardless of DIM, or 3) the cow left the herd without a breeding. Only a cows last available record will be considered for projection, since DO records for lactations with a subsequent calving can be confirmed by calving interval. Herds that do not report breedings are excluded from prediction because the predictor was not developed to handle such herds. Implementation of the approximate multiple-trait procedure would have the added benefit of bringing these animals into the evaluations as well. Consideration of only calvings on or after January 1, 2000, was introduced as a means to simplify identification of herds not reporting breedings. The effect of this starting point on either PTA or reliabilities will be minimal, since most predicted records will be records in progress.
Changes in mean Holstein sire PTA, reliability, and daughter and herd number, by birth year, are given in Table 8
. The predicted DO had little effect on mean sire PTA, which is ideal. An effect on mean PTA would indicate a bias in either the previous or current PTA. Mean reliability increased 4.9% for sires born in 1998 and 4.6% for sires born in 1999, with smaller changes for birth years earlier than 1998. The biggest effect was expected for 1998, since most bulls get first-crop daughters around 5 yr of age. Use of predicted DO made more daughters available earlier for the young bulls, which was the objective. Average number of daughters increased by 20 for bulls born in 1998 and, on average, there were 14 more herds per bull when predicted records were used. For the Holsteins in the August 2003 test run, 92% of predicted records had a breeding. Ten percent of candidate records for prediction were excluded because of herds not reporting breedings.
|
| CONCLUSIONS |
|---|
|
|
|---|
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
Received for publication November 18, 2003. Accepted for publication January 20, 2004.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
R. L. Powell and H. D. Norman Major advances in genetic evaluation techniques. J Dairy Sci, April 1, 2006; 89(4): 1337 - 1348. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Gonzalez-Recio and R. Alenda Genetic Parameters for Female Fertility Traits and a Fertility Index in Spanish Dairy Cattle J Dairy Sci, September 1, 2005; 88(9): 3282 - 3289. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. R. Wiggans and R. C. Goodling Jr. Accounting for Pregnancy Diagnosis in Predicting Days Open J Dairy Sci, May 1, 2005; 88(5): 1873 - 1877. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. M. VanRaden, A. H. Sanders, M. E. Tooker, R. H. Miller, H. D. Norman, M. T. Kuhn, and G. R. Wiggans Development of a National Genetic Evaluation for Cow Fertility J Dairy Sci, July 1, 2004; 87(7): 2285 - 2292. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |