|
|
||||||||


* Animal Improvement Programs Laboratory and
Bovine Functional Genomics Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705-2350
Animal and Dairy Science Department, University of Georgia, Athens 30602
Corresponding author: C. Van Tassell; e-mail: curtvt{at}aipl.arsusda.gov.
| ABSTRACT |
|---|
|
|
|---|
Key Words: calving ease dystocia threshold model genetic evaluation
Abbreviation key: AIPL = Animal Improvement Programs Laboratory, CE = calving ease, %DBH = PTA for percentage of births that are difficult for first-calf heifers, ID = identification, MGS = maternal grandsire, S-MGS = sire-maternal grandsire
| INTRODUCTION |
|---|
|
|
|---|
Calving ease is scored on a scale of 1 (no problem or unobserved) to 5 (extreme difficulty). Each unit increase in score does not represent the same increase in difficulty, so a threshold model based on these ordered categorical data is appropriate. A threshold sire model (Djemali et al., 1987) was introduced for the US genetic evaluation in 1988 (Berger, 1994). Concern about the antagonism between direct and maternal genetic effects on dystocia (Burfening et al., 1981; Thompson et al., 1981; Manfredi et al., 1991) led to interest in adding maternal effects to the evaluation model. Development of an evaluation based on a sire-maternal grandsire (S-MGS) model was desirable to increase the accuracy of service sire evaluations by partially accounting for differences in the merit of mates and to provide a calving ease evaluation that included maternal effects. This model improvement also provides a way for dairy producers and breeding companies to include maternal calving performance in breeding programs. The main objective of this research was to implement a S-MGS threshold model for the dystocia genetic evaluation and to assess the impact on evaluations of the change from a sire only model. Because only approximately half of the CE records submitted to AIPL include maternal grandsire (MGS) identification (ID), an additional challenge was to integrate the dystocia data with the production database to make the more complete production pedigree and calving records available for cross-referencing. The use of relational databases allowed for more stringent data editing and improved data access, and in some cases, the CE records provided new information for the production database.
Relationships are described relative to the calf born. Specifically, the dam is the cow observed for dystocia, the sire is the service sire for this calving, and the MGS is the sire of the dam. However, the recorded observation is described as the calving of the dam rather than as the birth of the calf.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
Two primary tables were created for storage of dystocia data. The primary CE table contains CE data for individual calving records (herd, sire, dam, CE score, parity, calving date, multiple birth code, and data source). Sire and dam ID in this table are stored as animal keys. A unique numeric animal key is assigned to every animal in the production database, allowing multiple ID (e.g., American ID and eartag) to be easily cross-referenced. A second table retains pedigree (i.e., MGS) information for records with dam ID that are not compatible with the production database. An example of an incompatible ID would be a cow name, such as "Suzie," rather than a registration or eartag number. Fields in this table include dam and MGS ID, herd where the observation was recorded, and dam birth date.
Data Processing
The CE records with a dam ID that is in the production database are stored with the associated animal key (thus linking them to the production pedigree table). If a dam ID is present but not found in the production pedigree table, then a pedigree input record is generated and submitted to the production edit system to add the animal ID and pedigree information. If that pedigree record is accepted, animal keys are assigned for the dam and possibly for the MGS. The MGS ID and birth date are updated if not present in the production pedigree, but existing data in the production database are not modified.
Records that fail production edits or do not include valid dam ID may not be unique. In this case, unique keys are assigned, and pedigree information is loaded into the CE pedigree table. These keys, which will be referred to as CE keys, are negative so that they are easily distinguishable from those in the production data. Thus, records in the primary CE data table contain either a production (positive) key, linking them to the production database, or a CE (negative) key, linking them to the supplemental CE pedigree table. This scheme was designed to facilitate storage of pedigree information for the implementation of an S-MGS model by allowing storage of MGS ID even if dam ID is missing or ambiguous. Each record with unknown dam is assigned a unique CE key. For ID that are not already present in the production database, additional pedigree information is specifically requested from Holstein Association USA (Bratteboro, VT) and Canadian Dairy Network (Guelph, ON).
A number of data integrity edits are also imposed. An appropriate value for CE is required. Some additional fields are also required. Records included are for calvings since 1980.
Detection of Duplicate Records
The data undergo preliminary editing to remove duplicates. Based on suggestions from the National Association of Animal Breeders, records were considered duplicates if they contained the same herd, sire, calving date, parity of dam, and sex of calf. For records that originated from a single data source, matching dam was also required. For records with production dam keys, a new and more rigorous definition was applied: records with the same dam and calving dates within 6 mo were considered duplicates. An additional class of duplicates was defined for records with CE keys. Records with dam ID are considered duplicates if they have the same herd, the same (nonzero) dam key, and calving dates within 6 mo. In all duplicate edits, if CE scores disagree, then both records are discarded.
The impact of the edits was evaluated by comparing data extracted for the August 2002 evaluation with the data for the same evaluation prepared using the sire model series of programs.
Genetic Evaluation Models
Sire model.
The sire threshold model previously used for the national genetic evaluation was:
![]() |
where yijklmno = CE score, hyi = random effect of herd-year i, Seasonj = fixed effect of season j, Sexk = fixed effect of sex k, Pl = fixed effect of parity l, Ym = fixed effect of sire birth-year m, smn = random effect of sire n in sire birth year group m, and eijklmno = random residual effect. The two seasons begin in May and October. Parity effects were assigned for parities 1 and
2. Fixed birth year groups were
1982, 1983 to 1984, 1985 to 1986, 1987, 1988, ..., 1997, 1998 to 2001, and a group for bulls with unknown birth year. Inverse of the relationships matrix was calculated using only sire and MGS information for bulls represented in the data (Henderson, 1975). The pedigree information was based on the genetic evaluation distribution file for production data from AIPL. Based on research by Djemali et al. (1987), variance components used were 0.1601, 0.0381, and 1.0000 for herd-year, sire, and residual, respectively. These correspond to heritabilities of 0.13 or 0.15 if the herd-year variance is included or excluded, respectively, from the total variance.
Dam and sire breeds were required to be either Holstein or Red and White for records included in the sire model evaluation.
Sire-MGS model.
The S-MGS threshold model developed in this project and used in national genetic evaluations beginning in August 2002 was:
![]() |
where yijklmnop = CE score, hyi = random effect of herd-year i, YSj = fixed effect of year-season j, PSk = fixed effect of parity-sex k, BSl = fixed effect of sire birth year l, BMm = fixed effect of MGS birth year m, sln = random effect of sire n in sire birth year group l, mmo = random effect of MGS m in MGS birth year group o, and eijklmnop = random residual effect. The year-season groups begin in May and October. Parity effects were assigned for parities 1, 2, and
3. Birth year groups for sires were <1982, 1982 to 1983, 1984 to 1985, 1986, 1987, ..., 1996, and >1996. Birth year groups for identified MGS were <1982, 1982 to 1983, 1984 to 1985, 1986, 1987, ...,1994, and >1994. The birth year groups will be updated annually. For records without valid MGS ID, separate MGS birth year groups were assigned based on dam birth year. When dam birth years were not recorded, then dam birth year was approximated by calving year - parity - 1. Then birth year groups for MGS were <1982, 1982 to 1983, 1984 to 1985, 1986, 1987, ..., 1997, and >1997. Fixed birth year groups were used to allow for different group specifications for bulls as sires and MGS. The birth year groups for MGS with and without valid ID were separated to allow for different rates of genetic trend and because the grouping assignments based on MGS and dam birth years are not directly comparable. The (co)variance components used were those estimated by Wiggans et al. (2003): herd-year, 0.434; sire, 0.022; S-MGS, 0.009; MGS, 0.016; and residual, 1.000. These components correspond to direct heritabilities of 0.059 or 0.084 and maternal heritabilities of 0.034 or 0.048 if the herd-year variance is included or excluded, respectively, from the total variance. The estimated genetic correlation of direct and maternal components was -0.12. The estimated correlation of sire and MGS effects was 0.48. This positive association includes the genetic antagonism between direct and maternal effects but also the greater positive correlation (0.59) associated with the shared direct components (a portion of the direct or service sire effect of the MGS is included in the MGS effect).
For both models, the first threshold was constrained to be zero. This is done to ensure identifiability, because the underlying CE scale is arbitrary (Gianola and Foulley, 1983; Harville and Mee, 1984). Models parameterized with residual variance of one and first threshold of zero are often referred to as standardized threshold models.
Herd-years are considered random to avoid the extreme category problems caused when all values for a fixed effect subclass fall in the same category (Harville and Mee, 1984; Misztal et al., 1989). Of the 61,784 herd-years represented in the data, 5582 (9%) included only one calving difficulty score, and 5480 (>98%) of these included only scores of 1. This is not surprising because 76% of records included a CE score of 1, and many herd-year groups are small. The most common herd-year size (17,945 cases) was a single calving, whereas the herd-year size with the most records (45) included 83,925 (0.79%) cases. Over half of the observations occurred in herd-year of <100 records. Tables 1
and 2
provide overall distribution of CE scores and distribution of scores by parity.
|
|
An additional data restriction imposed for the S-MGS model was the exclusion of records for which the sire was over 15 yr old when the calf was born. This edit was added to exclude data from analysis in cases where the sire was likely incorrect because of typographical mistakes.
Reporting of Genetic Values
Genetic merit for CE is reported as PTA for percentage of births that are difficult for first-calf heifers (%DBH), where difficult births are those scored
4. In both the sire and S-MGS models, a fixed genetic base was implemented as a means to stabilize PTA over time.
For the sire model analysis, bulls born before 1977 were set as the base and forced to average zero on the underlying scale. The sire solution was then calculated as the sum of the solutions for the first parity and sire, with the mean of the solutions for sex of calf and season effects, all on the underlying scale. This sire solution was then converted to %DBH by calculating the probability that a random normal variable with this value as a mean and unit variance exceeds the threshold between categories of CE score 3 and 4 (Berger, 1994; Berger, 2002, personal communication).
For the S-MGS model analysis, both a sire effect (service sire CE) and a MGS effect (daughter CE) are calculated. The MGS effect is reported as a combination of direct and maternal contributions (1/4 direct effect and 1/2 maternal effect), because it is expressed only in that ratio in daughters and future offspring. For this S-MGS analysis, the fixed base was defined on the observed scale. Currently, the base is defined by bulls born in 1995 for sire effects and bulls born in 1990 for MGS effects. Both sire and MGS solutions are adjusted on the underlying scale so that the mean of those bulls on the observed scale is approximately equal to the mean observed %DBH in the appropriate offspring. Therefore, the mean service sire %DBH for bulls born in 1995 is approximately equal to the observed frequency of difficult births from first-parity cows in calves sired by those same bulls, and the mean daughter %DBH for bulls born in 1990 is approximately equal to the observed frequency of difficult births from first-parity cows that are daughters of those bulls. Because these observed frequencies are quite volatile from year to year, using a multiple-year smoothed mean (rather than a single birth year) may be preferable in the future for defining the base.
Specifically, let T3 be the threshold separating difficulty scores 3 and 4 on the observed scale,
be the solution on the underlying scale with fixed sire birth year group added to the sire solution. Then
![]() |
Setting the base for the animals represented in the appropriate group (indictated by *),
![]() |
Then,
![]() |
where F = standard normal cumulative distribution function and Z = standard normal deviate. Next, a constant (c) is added to the underlying scale to achieve the desired base:
![]() |
Then,
![]() |
Finally, substituting to obtain the final equation used to compute %DBH,
![]() |
This prediction equation, unlike most, is not a direct function of the thresholds or any fixed effect solutions. Those effects, rather, are inferred by the selection of the population used to determine the value of %DBH*.
Reliabilities were calculated using only diagonal information. This simplification results in overestimated reliabilities because it effectively assumes a reliability of one for parents. Specifically,
![]() |
where, reli,t is the reliability for bull i and trait t (i.e., sire or MGS effect), diagi,t is the diagonal element from the threshold model equations, and
t is the genetic standard deviation. This simplification ignores sire relationships and the influence of the distribution of sires within fixed and random effects. The simplification is expected to be a reasonable approximation of the true reliabilities because of the low heritabilities and the use of an S-MGS, not animal model. Improvements in this approximation are an area of current investigation.
The data editing systems and evaluations associated with the S-MGS model were compared with those for the previously used sire model.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
MGS ID
A major effort was made to improve MGS ID rate. By integrating the CE data with the production pedigree information, 1,030,408 additional MGS were identified, increasing the rate of MGS ID from 57.6 to 73.1%. Conservation of available CE data was also a consideration in database design. The creation of the CE pedigree table to store pedigree records incompatible with the production data edits (e.g., dam ID of zero or a name) preserved 2,532,527 additional pedigree records. Only 10.7% of those records contained MGS ID. Of the nearly 72% of the records with production dam keys, nearly 98% had MGS ID recorded, slightly higher than the 95.3% of production pedigrees from a comparable time period that contain sire identity. Because only 72% of the records had production dam keys, nearly 30% of the dams that were being observed for dystocia were not uniquely identified over time. However, the percentage of dams without unique ID has decreased to around 20 in recent years.
Data Characteristics
Percentage of male calves was 51.5 in records for both the sire and S-MGS models. Distributions of records across parities were also very similar for the two datasets, with 25.9 (25.6), 29.3 (29.4), and 44.8% (45.0%) first, second and third or later parities for sire model (S-MGS model) data.
Distributions of dystocia scores are shown in Table 1
. Changes in data processing did not appreciably alter the distribution of scores in the total dataset. Van Tassell and Sattler (2000) demonstrated considerable variation in these distributions between herds or when evaluated on a herd-year basis. That study identified herds in which the distribution of CE scores differed substantially from the distribution of scores in the population at large. Based on these findings, the data extraction system was designed to include a within-herd CE score distribution criteria, enabling the rejection of herds with an excessive frequency of extreme scores. At this time, however, data for all herds are included. The impact of restricting data based on herd distribution is an area for additional study. Additionally, a minimum herd size could be imposed, and MGS ID could be required.
As has been observed in previous studies (e.g., Berger, 1994), more difficult calving is observed for first parity than for later parities. Distribution of dystocia scores by parity for the S-MGS model data is shown in Table 2
. Strong evidence exists for differences in calving ease between first and later parities, whereas the difference between second and later parities is relatively small.
Genetic Evaluations
Distributions of service sire %DBH are shown in Figure 2
for the sire and S-MGS models. Differences in the distribution of the predicted genetic merit for this trait were relatively minor. Correlation between evaluations for the 33,626 bulls included in both evaluations was correspondingly high: 0.85 for the underlying evaluations and 0.82 for %DBH. Although these correlations indicate that the service sire evaluations are similar, significant changes are to be expected with correlations substantially less than one. Reliabilities from the sire and S-MGS models were quite similar as well, with a correlation of 0.81. The small difference in the mean %DBH for the two evaluations (8.58 for the sire model and 8.27 for the S-MGS model) is likely attributable to differences in the definition of the base. The distribution of difference in service sire %DBH for bulls analyzed with both models is shown in Figure 3
. Reassuringly, almost no directional bias exists, as the changes are nearly symmetrically distributed around the mean difference. Although attributing the differences between the two evaluation systems specifically to differences in data or model is impossible, a reasonable expectation that the largest changes resulted from model differences, because the vast majority of the data is common for the two systems.
|
|
|
|
11, and seven of the top 10 widely used bulls had service sire %DBH of
9. The wide use of these bulls in the cow population likely influenced the genetic trend in %DBH through bull dams in years since 1994, and in 1995 and later, the weighted mean of MGS service sire %DBH has increased. The mean daughter %DBH calculated the same way shows a flat to even slightly decreasing trend over the same period.
The comparison of distributions of service sire %DBH and daughter %DBH is shown in Figure 5
. For both traits, a %DBH of 8 is the most common. However, the distribution of daughter %DBH is more compact than the distribution of service sire %DBH, which is expected because of its smaller genetic variance. The correlation between solutions on the underlying scale for service sire and daughter effects was 56%, slightly higher than the genetic correlation of 48% used in the evaluation system. On the %DBH scale, the correlation dropped slightly to 53%. This drop could be due to the nonlinearity of the transformation from the underlying scale to the observed scale or to the granularity of the observed scale, as only whole percentages are reported for %DBH.
|
|
| CONCLUSIONS |
|---|
|
|
|---|
Received for publication December 30, 2002. Accepted for publication April 16, 2003.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
H. D. Norman, J. R. Wright, M. T. Kuhn, S. M. Hubbard, J. B. Cole, and P. M. VanRaden Genetic and environmental factors that affect gestation length in dairy cattle J Dairy Sci, May 1, 2009; 92(5): 2259 - 2269. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. D. R. N. Appuhamy, B. G. Cassell, and J. B. Cole Phenotypic and genetic relationships of common health disorders with milk and fat yield persistencies from producer-recorded health data and test-day yields J Dairy Sci, April 1, 2009; 92(4): 1785 - 1795. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. R. Wiggans, J. B. Cole, and L. L. M. Thornton Multiparity Evaluation of Calving Ease and Stillbirth with Separate Genetic Effects by Parity J Dairy Sci, August 1, 2008; 91(8): 3173 - 3178. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Heringstad, Y. M. Chang, M. Svendsen, and D. Gianola Genetic Analysis of Calving Difficulty and Stillbirth in Norwegian Red Cows J Dairy Sci, July 1, 2007; 90(7): 3500 - 3507. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. B. Cole, G. R. Wiggans, and P. M. VanRaden Genetic Evaluation of Stillbirth in United States Holsteins Using a Sire-Maternal Grandsire Threshold Model J Dairy Sci, May 1, 2007; 90(5): 2480 - 2488. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. B. Cole, G. R. Wiggans, P. M. VanRaden, and R. H. Miller Stillbirth (Co)Variance Components for a Sire-Maternal Grandsire Threshold Model and Development of a Calving Ability Index for Sire Selection J Dairy Sci, May 1, 2007; 90(5): 2489 - 2496. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. L. de Maturana, E. Ugarte, J. Komen, and J. A. M. van Arendonk Consequences of Selection for Yield Traits on Calving Ease Performance J Dairy Sci, May 1, 2007; 90(5): 2497 - 2505. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. L. de Maturana, A. Legarra, L. Varona, and E. Ugarte Analysis of Fertility and Dystocia in Holsteins Using Recursive Models to Handle Censored and Categorical Data J Dairy Sci, April 1, 2007; 90(4): 2012 - 2024. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. D. Norman, J. L. Hutchison, J. R. Wright, M. T. Kuhn, and T. J. Lawlor Selection on Yield and Fitness Traits When Culling Holsteins During the First Three Lactations J Dairy Sci, February 1, 2007; 90(2): 1008 - 1020. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Fetrow, K. V. Nordlund, and H. D. Norman Invited review: Culling: nomenclature, definitions, and recommendations. J Dairy Sci, June 1, 2006; 89(6): 1896 - 1905. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. G. Vitezica, C. R. Moreno, L. Bodin, D. Francois, F. Barillet, J. C. Brunel, and J. M. Elsen No associations between PrP genotypes and reproduction traits in INRA 401 sheep J Anim Sci, June 1, 2006; 84(6): 1317 - 1322. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. L. Powell and H. D. Norman Major advances in genetic evaluation techniques. J Dairy Sci, April 1, 2006; 89(4): 1337 - 1348. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. B. Cole, R. C. Goodling Jr., G. R. Wiggans, and P. M. VanRaden Genetic Evaluation of Calving Ease for Brown Swiss and Jersey Bulls from Purebred and Crossbred Calvings J Dairy Sci, April 1, 2005; 88(4): 1529 - 1539. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. M. VanRaden Invited Review: Selection on Net Merit to Improve Lifetime Profit J Dairy Sci, October 1, 2004; 87(10): 3125 - 3131. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. M. VanRaden, A. H. Sanders, M. E. Tooker, R. H. Miller, H. D. Norman, M. T. Kuhn, and G. R. Wiggans Development of a National Genetic Evaluation for Cow Fertility J Dairy Sci, July 1, 2004; 87(7): 2285 - 2292. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |