JDS
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Liu, Z.
Right arrow Articles by Reents, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Liu, Z.
Right arrow Articles by Reents, R.
J. Dairy Sci. 87:1896-1907
© American Dairy Science Association, 2004.

Derivation and Calculation of Approximate Reliabilities and Daughter Yield-Deviations of a Random Regression Test-Day Model for Genetic Evaluation of Dairy Cattle

Z. Liu, F. Reinhardt, A. Bünger and R. Reents

VIT, Heideweg 1, D-27283 Verden, Germany

Corresponding author: Z. Liu; e-mail: Zengting.Liu{at}vit.de.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 APPENDIX 1
 APPENDIX 2
 REFERENCES
 
Test-day milk, fat, protein yield, and somatic cell score (SCS) were analyzed separately using data from the first 3 lactations and a random regression model. Data used in the model were from Austria, Germany, and Luxembourg and from Holstein, Red, and Jersey dairy cattle. For reliability approximation, a multiple-trait effective daughter contribution (MTEDC) method was developed under general multiple trait models, including random regression test-day models, by extending the single-trait daughter equivalents concept. The MTEDC was applied to the very large dairy population, with about 15.5 million animals. The calculation of reliabilities required less computer memory than the corresponding iteration program and a significantly lower computing time equivalent to 24 rounds of iteration. A formula for daughter-yield deviations was derived for bulls under multiple-trait models. Reliability associated with daughter-yield deviations was approximated using the MTEDC method. Both the daughter-yield deviation formula and associated reliability method were verified in a simulation study using the random regression test-day model. Correlations of lactation daughter-yield deviations with estimated breeding values calculated from a routine genetic evaluation were 0.996 for all bulls and 0.95 for young bulls having only daughters with short lactations.

Key Words: daughter yield deviation • effective daughter contribution • random regression test-day model • reliability

Abbreviation key: DE = daughter equivalents, DYD = daughter yield deviations, EDC = effective daughter contribution, ENP = effective number of progeny, MTEDC = multiple trait effective daughter contribution, RRTDM = random regression test-day model


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 APPENDIX 1
 APPENDIX 2
 REFERENCES
 
Genetic evaluation of dairy cattle using test-day yields or SCS, instead of 305-d lactation records, has become a common practice in many countries (Reents et al., 1995; Strandén and Lidauer, 1999; Schaeffer et al., 2000; Haile-Mariam et al., 2001; De Roos et al., 2002; Wiggans et al., 2002). Since May 2003, a random regression test-day model (RRTDM) has been implemented for joint production-trait evaluations for Holstein, Red, and Jersey cattle from Austria, Germany, and Luxembourg (Liu et al., 2001). This evaluation includes over 10 million cows with test-day records and over 250 million equations for each trait. For the purpose of approximating reliabilities of EBV in large-scale national genetic evaluations, the daughter-equivalents (DE) concept (VanRaden and Wiggans, 1991) was shown to be accurate and efficient. In recent years, several new reliability methods were developed, e.g., information source method (Harris and Johnson, 1998) and effective number of progeny (ENP) method (Koots et al., 1997; Jamrozik et al., 2000). Reliability approximation was also extended to multiple-trait models (Graser and Tier, 1997; Strabel et al., 2001; VanRaden, 2001). Fikse and Banos (2001) applied the reliability method of Harris and Johnson (1998) in a study on alternative weighting factors for international bull evaluation, which resulted in the development of the effective daughter contribution (EDC) method for computing reliability associated with daughter information of bulls. All of the methods are closely related to the DE method. Although EDC and DE have been widely used for single-trait models, their extension to multiple-trait models, including RRTDM, is limited. The multitrait extensions have typically combined information from multiple traits at an early stage, which may lead to loss of important information.

Daughter yield deviations (DYD) of a bull are, by definition, the average of the daughters’ performance adjusted for fixed and nongenetic random effects of the daughters and genetic effects of the bull’s mates. Daughter yield deviations are not regressed and are a more independent measure of phenotypic performance of a bull’s daughters than EBV (VanRaden and Wiggans, 1991). Thanks to this property, DYD are widely used in genetic analyses, such as in mapping quantitative trait loci (Szyda et al., 2002), validating national genetic trends (Boichard et al., 1995) and a 2-step multiple-trait genetic evaluation (Ducrocq et al., 2001). For repeatability animal models applied to lactation records, VanRaden and Wiggans (1991) showed the calculation of DYD. Mrode and Swanson (2002) extended the method to an RRTDM; however, computation of reliability associated with DYD, estimable lactation of DYD, and short lactation problems were not addressed.

The objectives of this study were (1) to extend the single-trait DE concept to multiple-trait models for approximating the reliability of EBV, and (2) to develop a method for calculating DYD under general multiple-trait models and to verify the DYD method.


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 APPENDIX 1
 APPENDIX 2
 REFERENCES
 
Theory of the Multiple-Trait EDC Method
The general idea behind DE, EDC, or ENP methods is to express the amount of information contributed by different sources for an animal in the number of standard daughters (progeny) that would give equal reliability by treating the animal as a standard sire. A standard sire is defined as an animal without its own performance records and having both parents missing, and only his daughters (progeny) contribute to the estimation of his breeding values. Further, it is assumed that the whole population is comprised of a single half-sib family, and all daughters of the sire have their dams missing. Under the above assumptions, EBV or their reliabilities of the standard sire under a sire model and an animal model are equal. A multiple-trait animal model in a general form


([1])

can be expressed as a multiple-trait sire model for EBV of the sire


([2])

where y is a vector of records of a daughter; X, Zp, and Z are design matrices for all fixed effects (b); all nongenetic random effects (p); and additive genetic effects (a), respectively. Variance and covariance matrices are G0, P0, and R0 for the genetic, nongenetic random, and error effects, respectively. It is assumed that these random effects are not correlated with each other. Design matrix Zs is for genetic effects of the sire. (Co)variance matrices for the sire effects (as) and residual effect (es) of the sire model [2] are var(as) = Gs = 1/4G0 and var(es) = 3/4G0 + R0. For estimating the sire’s breeding values, all fixed and nongenetic random effects of the daughters are absorbed into their genetic effects that are further absorbed into the genetic effects of the sire. The absorptions result in mixed model equations


([3])

where {Psi} and {Delta} are the part of the left-hand side and right-hand side of the equations after the absorptions, respectively. Let , then a matrix R can be defined as


([4])

The matrix R can be converted back to an EDC matrix ({Psi}) with


([5])

The matrix R will be referred to as a reliability matrix. Although it does not contain reliabilities, it can be used to derive reliabilities for all traits, and can be viewed as a multivariate generalization of the scalar reliability (R2) of single-trait models. Matrix R is equal to R2 in a single-trait case, as shown from Formula [4]Go:


([6])

where /, and and are sire and residual variances of a single-trait sire model, respectively. Similarly, the matrix {Psi} provides the equivalent to the effective number of daughters (n) of single-trait models


([7])

which can be obtained by simplifying formula [5]Go to


([8])

The reliability and EDC matrices constitute the core of the multiple trait effective daughter contribution (MTEDC) method for approximating reliabilities under general multiple-trait models. The reliability matrix R is asymmetric, and it usually has nonzero off-diagonal elements. R has a row of zeros corresponding to a missing trait. In contrast to R, the EDC matrix {Psi} is symmetric, and it normally has nonzero off-diagonal elements. For a missing trait, {Psi} has a row and a column of zeros. The EDC matrices from different information sources of the same animal can be summed up together. It can be proved using formulas [4]Go and [5]Go that a symmetric EDC matrix is always guaranteed by converting a nonsymmetric reliability matrix. Additionally, the inverse of matrix IR in formula [5]Go, which equals C–1Gs, exists under all conditions. The multitrait reliability matrix R does not contain final reliability values of EBV, unless the traits are not correlated. By defining appropriate vectors vi and vj, final reliability or coreliability of any linear function of EBV can be calculated with


([9])

Approximating reliabilities for an RRTDM.
By extending the daughter-equivalents method (VanRaden and Wiggans, 1991) of single-trait models, reliability approximation using the MTEDC was developed under general multiple-trait models. The procedure of reliability approximation using MTEDC is illustrated below with an RRTDM (Liu et al., 2001):


([10])

where h represents fixed effects of herd test-date-parity milking frequency, f is the effects of fixed lactation curves, and p is the permanent environmental (p.e.) effects, and Zh and Xf are design matrices for h and f, respectively.

Reliability approximation using the MTEDC method consists of 3 steps in the sequence-computing contributions by own data, progeny, and parental average. Computing the data contribution for a cow involves absorbing all fixed effects and the nongenetic random effects into her genetic effects. By absorbing the major fixed herd test-date-parity milking frequency effect and ignoring the minor effect of fixed lactation curves (Liu et al., 2001), the left-hand side of the mixed model equations pertinent to a cow i with records becomes


where and R is the error (co)variance matrix for the records of the cow, and aii is the diagonal element of inverse of the relationship matrix for the cow. Further absorption of her p.e. effects gives with . The effective information contributed by her own test-day records is summarized in matrix Z'WZ, from which a reliability matrix due to own records (Y) is computed using


([11])

and then the reliability matrix is converted to an EDC matrix based on Formula [5]Go


([12])

It can be shown that {Psi}Y = 4Z'WZ for cows with records.

Collecting EDC from all progeny is done by processing pedigree files from the youngest to oldest animals (VanRaden and Wiggans, 1991). An animal’s reliability contributed by a progeny adjusted for mate effect (P-M) is computed as


([13])

where is the mate’s reliability calculated excluding EDC of this progeny and


([14])

with being the progeny’s reliability, including information from its records and its progeny but not from its parents. The reliability matrix RP–M is converted to an EDC matrix


for the progeny. Note that , if . For the derivation of formula [13]Go, see Appendix 1.

After the EDC from all progeny has been accumulated, pedigree files are processed from the oldest to youngest animals to collect parental contribution (VanRaden and Wiggans, 1991). To avoid double counting an animal’s contribution to its parents, the EDC provided by the animal to its parents must first be subtracted from its parental EDC. Then the resulting EDC ({Psi}*) is converted to reliability using


for computing parental contribution to the animal. Assuming that both parents are not related,


([15])

and


can be used to compute reliability and the EDC matrix provided by parental average to the animal, respectively.

When the 3 steps have been completed, the total EDC ({Psi}T) is calculated with


([16])

for each animal and converted to a total reliability matrix with


([17])

Using the total reliability matrix, the reliability value of any linear function of genetic random regression coefficient estimates can be readily derived. An example is given here to show the calculation of reliability of combined lactation EBV using the total reliability matrix RT. The breeding value on a 305-d lactation basis is defined as , and combined lactation breeding value as ucomb = w'uL = v'a, where is a 305-day lactation breeding value for lactation i, uL contains lactation breeding values of the first 3 lactations, tj contains Legendre polynomial constants (Liu et al., 2001) for DIM j, ai contains regression coefficients of genetic effects for lactation i, w contains economic weights of the first 3 lactations, and v' = w' diag {1'T,1'T,1'T}. Because var(a) = G0 and var(â) = G0RT, reliability of combined lactation EBV is calculated with


([18])

By setting the ith element of w to one and the others to zero, reliability of the ith lactation EBV can be calculated using the same formula for .

According to the MTEDC method, the relationship among animal, parents, and progeny is modeled via reliability matrix, e.g., equation [13]Go for calculating an animal’s reliability contributed by a progeny adjusted for its mate, and formula [15]Go for computing the reliability of parental average. The relationships among different information sources of the same animal are modeled via EDC matrix {Psi}, which can be summed up over information sources.

Calculating DYD
Calculation of DYD of a bull represents a process of absorbing the daughters’ genetic effects using the daughters’ records adjusted for all other effects and the bull’s mates’ breeding values. Denote a*ii as a diagonal element of the inverse of the numerator relationship matrix corresponding to daughter i of a bull, under the assumption that the daughter has no progeny, and di as the diagonal element pertinent to daughter i in inverse of the (co)variance matrix of Mendelian sampling D–1 (Mrode, 1996, p. 28). The DYD of the bull under the general multiple-trait animal model [1] is computed with


([19])

where subscript i represents daughters of the bull, n is the number of daughters with records, âd is EBV of the dam of the daughter i obtained from a complete genetic evaluation like routine national evaluation, yi is a vector of the ith daughter’s test-day yields adjusted for heterogeneous herd variances, and


See Appendix 2 for the derivation of the DYD formula [19]Go under multiple-trait models.

Formula [19]Go for calculating DYD is equivalent to the one that VanRaden and Wiggans (1991) developed for the repeatability animal model and also similar to the one by Mrode and Swanson (2002) for RRTDM. For repeatability animal models, the DYD vector q becomes a scalar that is a weighted average DYD over all lactation records. For multiple lactation/trait models, q consists of DYD for each lactation/trait. As genetic effects are modeled in regression coefficients on DIM in RRTDM, q is also expressed in the form of regression coefficients (Mrode and Swanson, 2002). With those regression coefficients, any linear function of the DYD regression coefficients can be calculated. For RRTDM, the inverse of matrix B does not always exist. For instance, when all daughters of a bull have fewer test-day records in a given lactation than the order of fit for genetic effects, DYD are not estimable for the bull. However, when at least one daughter has at least as many test-day records in a lactation as the order of fit for genetic effects, DYD of the bull are defined for this lactation. Lactation yield deviations for cows (see Appendix 2) are defined when there are at least so many test-day records available for a lactation as the order of fit for genetic effects. In case of estimable DYD, all data from daughters with defined, as well as daughters with undefined, lactation yield deviations can be used. Because of using adjusted records to derive DYD, the size of contemporary groups is not accounted for. To obtain DYD that correspond with EBV, records without contemporaries must be excluded from the DYD calculation. In case of unknown mates, âd is replaced with solutions of corresponding genetic groups. The DYD formula [19]Go for bulls and formula [B3]Go in Appendix 2 for average yield deviations of cows show how information on test-day level should be summarized to lactation level for test-day models in national and international genetic evaluations.

In contrast to the derivations by VanRaden and Wiggans (1991) and Mrode and Swanson (2002), the DYD formula [19]Go was derived from bull equations (see Appendix 2) but not from cow equations. Additionally, bulls and their mates can be inbred. Factor 2 in formula [19]Go reflects the fact that the derived DYD are expressed on an animal basis and not on a sire basis, as in VanRaden and Wiggans (1991) and Mrode and Swanson (2002). Formula [19]Go always gives zero DYD values for missing lactations/traits. The DYD Formula [19]Go developed for general multiple-trait models is applicable for any RRTDM and single-trait models.

Fixed effects, nongenetic random effects, and EBV of mates are subtracted from daughters’ yields, but are not absorbed into the daughters’ genetic effects. In DYD calculation, no reliability associated with DYD can be derived from the DYD formula [19]Go. However, the MTEDC can be implemented to compute reliability associated with DYD. The reliability calculation procedure for DYD using the MTEDC differs with the normal procedure of reliability approximation in that only the first 2 steps of the MTEDC are needed: calculation of data contribution for daughters with records and collecting EDC of daughters adjusted for mates’ contribution. The summing of the EDC of all daughters gives EDC associated with bull’s DYD q


([20])

with {Psi}P–M being the EDC matrix contributed by a daughter adjusted for her dam’s contribution. Reliability matrix associated with q is computed with


([21])

Data
Table 1Go describes the data set used in the official May 2003 genetic evaluation of production traits for Austrian, German, and Luxembourg dairy cattle (includes Holstein, Red, and Jersey). The presented DYD formula [19]Go and associated reliability formula [21]Go were applied to the data set. Besides DYD of bulls, lactation yield deviations were computed for cows with records in the application.


View this table:
[in this window]
[in a new window]
 
Table 1. Descriptive statistics of the data set for official production-trait genetic evaluation of Holstein, Red, and Jersey dairy cattle from Austria, Germany, and Luxembourg in May 2003.
 
A Verification Study on the DYD Calculation and Reliability Approximation Methods
The rationale of the verification study is that EBV of a bull obtained from DYD and associated reliabilities must be equal to those obtained from a complete mixed model equation system, including equations of genetic and nongenetic effects of the bull, his daughters, and mates. Also, approximated reliability values of the bull’s EBV must be equal to his true reliability value, derived from the exact inverse of the mixed model equations.

Pedigree and data.
A single half-sib family structure was assumed in the verification study. As parental contribution is irrelevant for validating the DYD formula and its associated reliability method, both parents of the bull were assumed to be unknown. All daughters of the bull had a known dam and performance records. All mates of the bull had both parents missing and full performance information, i.e., 3 complete lactations with 10 test-day records each. Each mate had only one progeny, namely a daughter of the bull. Between daughter-dam pairs, there was no additional genetic connection except the bull. With permanent environmental effects included in the study, correctness of the method of adjusting records for other effects and accuracy of the reliability calculation method could be examined; therefore, no fixed effects were included in the simulation. A monthly testing scheme was assumed to generate test-day records for daughters and dams, with first tests done at 10 DIM for mates or first-crop daughters and 15 DIM for second-crop daughters. A new test-day record was added every 30 d for daughters or mates.

Various scenarios were investigated for accuracy of the DYD and reliability methods. A different number of daughters was used to simulate the bull in the progeny testing program and with second-crop daughters. The daughters may have 1 to 3 lactations. Lactation progress of the daughters was simulated with a number of test-day records. Test-day yields of all mates of the simulated bull were set to zero, implying that phenotypic performance of the mates was equal to comparison-group average. All daughters’ phenotypic records on all test days were simulated as deviations from their comparison groups and expressed in phenotypic standard deviations. Test-day yield deviation at a given DIM was assumed to be 0.10 of the phenotypic standard deviation of DIM for all daughters. Parameters of test-day milk yield (Liu et al., 2000a) were used in generating and analyzing data.

Estimating a bull’s breeding values using DYD and associated reliabilities.
The solutions from the complete mixed model equation system were used in the calculation of DYD with formula [19]Go. For estimating the bull’s breeding values using his DYD, the following mixed model equations are to be solved


([22])


([23])

and


([24])

The reliability matrix of the sire’s EBV was calculated with .

For international genetic evaluations, the multitrait DYD calculated with formula [19]Go can be used as trait values. Setting up mixed-model equations of multiple across-country evaluations based on the multitrait DYD differs with the multiple-trait bull-comparison procedure based on de-regressed proofs (Schaeffer, 2001). The least squares part of the left-hand side of the mixed-model equation corresponding to a bull with daughters having performance is {Psi}b; the right-hand side of the mixed-model equation pertinent to the bull in multitrait across-country equations is {Delta}b.


    RESULTS AND DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 APPENDIX 1
 APPENDIX 2
 REFERENCES
 
Reliability Approximation Using the MTEDC
The presented MTEDC method extends the single trait DE (VanRaden and Wiggans, 1991) to general multiple-trait models, including RRTDM. Therefore, it must give identical formulas as those in VanRaden and Wiggans (1991) for single-trait models, when the MTEDC is applied to single-trait models. As shown before, formulas [4]Go and [5]Go of multiple-trait models are equivalent to formulas [6]Go and [7]Go of single-trait models, respectively. Let , , and represent a progeny’s reliability based on its own performance records and its descendants but not based on its parents, the animal’s reliability contributed by the progeny adjusted for mate of the animal, and the reliability of the mate with EDC from this progeny excluded, respectively. Formula [13]Go can then be simplified to , which is identical to derived by VanRaden and Wiggans (1991).

Using formula [13]Go, a standard daughter can be defined. For multiple-trait models, a standard daughter is characterized as a cow having one record in all traits with an infinite number of contemporaries, and the other parent missing. This definition is an extension of Interbull’s EDC definition for single-trait models but differs slightly from VanRaden and Wiggans (1991) in the assumption of the other parent. However, the difference in definitions does not have an impact on reliability because of the corresponding difference in variance ratios. For traits with repeated records, such as lactations with test-day records, one record means a complete lactation, according to MTEDC. To study the effect of the model on the value of EDC, a single trait repeatability model was applied to the lactation average SCS, and a multiple lactation RRTDM (Liu et al., 2001) was applied to test-day SCS and compared. The repeatability model was assumed to have a heritability of 0.1 and repeatability of 0.35, whereas heritability of lactation average SCS was about 0.2 for the RRTDM (Liu et al., 2000b). Table 2Go shows the EDC of an example cow in a monthly testing program during the course of lactations under both models. With only 3 tests in first lactation, the cow has 1.6 EDC for the single trait model. This means that the reliability contributed by 3 tests of her own is equal to the reliability that would be contributed by 1.6 of her standard daughters. When the cow has completed her first 3 lactations, the EDC of the repeatability model is 8.3, which is greater than the EDC of the RRTDM, which is 5.2. The phenomenon of smaller EDC of multiple-trait models than single-trait models has also been observed in Interbull’s EDC for bulls of the same age from countries using different models for national genetic evaluations. This can be explained by different definitions of standard daughter in single and multiple trait models: a standard daughter has only one complete lactation under the repeatability model but 3 complete lactations under the RRTDM.


View this table:
[in this window]
[in a new window]
 
Table 2. Comparison of effective daughter contribution (EDC) of a single- and multiple-trait model, illustrated with a cow in a monthly testing program.
 
Liu et al. (2002) compared the MTEDC and ENP (Jamrozik et al., 2000) methods for RRTDM in a simulation study. Whereas the MTEDC gave unbiased reliability estimates, ENP overestimated reliability values of bulls that had daughters with short or missing lactations. In the verification study of this research, the reliability values computed with the MTEDC were equal to true reliability values. Additionally, the MTEDC can be applied to very large populations. It has been routinely used in Germany (Liu et al., 2002) and The Netherlands (De Roos et al., 2002).

Approximating reliability using the MTEDC method does not require special handling of missing traits. In case of nonrandom distribution of relatives in contemporary groups, the loss of information must be considered in the computation of data contribution. Calculating the parental contribution to an animal requires recomputation of the animal’s contribution to its parents that was done already in the calculation of progeny contribution, and this repetition can be avoided if the animal’s contribution to its parents can be stored in the progeny contribution calculation and then be read in for the parental contribution calculation. The MTEDC makes approximations in the calculation of parental contribution by assuming that both parents are unrelated, as do all existing reliability approximation methods, to our knowledge. A problem in the calculation of progeny contribution was identified that, at the time of adjusting the contribution of a progeny to an animal for reliability of the animal’s mate, the mate may not have all its progeny processed yet. This problem may be solved by using reliability values from the previous genetic evaluation as starting values for the mate; however, this bias has limited impact due to the factor 0.25 for the mate’s reliability. We applied the MTEDC method to a maternal-effect model with correlated genetic effects for international genetic evaluations of calving traits. An extension of the MTEDC to nonlinear or threshold models needs to be developed for traits like herd life. Although the MTEDC has been proven to be fairly accurate for multiple-trait models, until now, in dairy cattle only, it should also be applicable for other livestock species, e.g., swine or beef cattle.

Calculation of DYD and Associated Reliabilities
Formula [19]Go for calculating DYD and formula [21]Go for approximating associated reliability are valid for general multiple-trait models, including RRTDM. Calculation of DYD using formula [19]Go requires estimates of all fixed effects and nongenetic random effects and EBV of mates of bulls from a genetic evaluation. The size of contemporary groups and absorption of nongenetic random effects are considered in computing data contribution of daughters in reliability calculations, and reliability values of mates of a bull are adjusted for as well. For the calculation of DYD of a bull, only records of his own daughters have to be considered; male progeny must be excluded because they do not have their own performance records for production traits. The contribution of granddaughters of the bull to his DYD must be ignored, otherwise the contribution would be double counted, as the granddaughters would contribute their data to the DYD of his sons. In DYD calculation, only the cow-to-sire path is considered. Other paths, such as son to sire, are ignored. This is because the contribution of male progeny to the bull will be accounted for in subsequent genetic analyses, e.g., international bull evaluations (Fikse and Banos, 2001), marker-assisted genetic evaluation (Szyda et al., 2002), and 2-step multiple-trait evaluations based on evaluation results from single-trait models (Ducrocq et al., 2001). Because the DYD of one bull do not affect the DYD of other bulls, DYD calculation can be done on a within-bull basis. Parental contribution to the bull is irrelevant for the calculation of the DYD of bulls.

For genetic analyses, DYD on a 305-d lactation basis are more useful than the DYD vector q expressed on a daily basis in the form of regression coefficients; therefore, DYD on a daily basis need to be converted to 305-d lactation basis. Due to the mathematical function for modeling genetic effects in RRTDM, lactation DYD cannot always be estimated. In case that there are no data available in some lactations or some lactation DYD are not estimable, only a submatrix of matrix B needs to be inverted, and the other parts of the inverse of matrix B are set to zero for obtaining a generalized inverse of B. Lactation DYD estimated from shorter lactations are more influenced by extrapolation than lactation DYD from longer lactations. To minimize the impact of extrapolation, at least 10 daughters are required to pass 120 DIM in a lactation in order to make DYD of this lactation official for a bull, which was determined in an application study using the data from a May 2003 evaluation. Similarly, lactation-yield deviations for a lactation of a cow will be publishable if her lactation passes 210 DIM or she has at least 7 tests. Computer resources required for the calculation of the DYD of bulls and lactation-yield deviations of cows and their associated reliabilities were rather limited. Total CPU was equivalent to 8 rounds of iteration for solving mixed-model equations (Liu et al., 2001) for May 2003 production trait genetic evaluations.

The verification study was conducted using Maple 6 software. Estimated breeding value and reliabilities of the bull obtained from the DYD-based mixed-model equation system were identical to those from the complete mixed-model equation system in all scenarios. The reliability values calculated using the MTEDC were equal to their true values. EBV of the daughters obtained from the 2 solving procedures for the complete mixed model equation system were also equal. The results proved that the DYD formula [19]Go and the reliability formula [21]Go are correct. A similar verification study was done for a repeatability model, and the DYD and associated reliability formulas were confirmed to be correct for the single-trait model as well. Table 3Go shows lactation DYD and EBV of the simulated bull during the course of lactation of his daughters. When all daughters had only 2 test-day records in a lactation, which was smaller than the order of fit for genetic effects, DYD of the bull could not be estimated for the lactation. Even in this case, EBV of the bull could be estimated because left-hand side and right-hand side of the DYD-based mixed-model equation system existed in spite of the inestimable DYD. For short lactations, DYD were greater than EBV, with the exception that both were nearly equal in cases of at least 100 daughters. As lactation progressed, the difference between DYD and EBV became smaller, and this was more evident with higher numbers of daughters. The DYD of missing lactations were zero, but corresponding EBV were not. Adding 1000 lactations with 1 test each from second-crop daughters caused changes in both DYD and EBV for the bull. However, these changes were much smaller than those caused by short lactations of first-crop daughters.


View this table:
[in this window]
[in a new window]
 
Table 3. Lactation daughter yield deviations (DYD) and EBV of the simulated bull, expressed in genetic standard deviations.
 
It can be seen in Table 4Go that, as lactations of daughters became longer or more complete, correlation of lactation DYD with EBV of bulls increased significantly. When the average number of test-day records of daughters reached 5, the correlation was stabilized. The DYD and EBV of later lactations were less correlated than first lactations, in particular when daughters had, on average, only 3 tests in a lactation. This can be explained by the fact that later lactation EBV were estimated from more sources of information, including daughter information of earlier lactations, whereas first lactation EBV were influenced predominantly by first-lactation daughter information and parental average only. Table 5Go indicates that correlation between lactation DYD and EBV increased slightly with the number of daughters. In Table 6Go, correlations of lactation DYD with EBV were close to unity, except for the very last birth year, where bulls tended to have fewer daughters and higher percentages of daughters with less completed lactations. Due to shorter lactations of second-crop daughters, bulls born in 1994, 1993, and 1991 to 1992, had slightly lower correlations for first, second, and third lactations, respectively. Across all birth years of bulls, correlation between lactation DYD and EBV was 0.996 for all 3 lactations. Summarizing the correlations in Tables 4Go, 5Go, and 6Go, we concluded that the DYD formula [19]Go is accurate.


View this table:
[in this window]
[in a new window]
 
Table 4. Correlation of milk-yield lactation daughter yield-deviations with EBV by average number of test-day records of daughters for Black and White Holstein bulls that had at least 10 daughters with lactation passed 120 DIM and in at least 10 herds in a May 2003 genetic evaluation.
 

View this table:
[in this window]
[in a new window]
 
Table 5. Correlation of milk-yield lactation daughter yield-deviations with EBV by number of daughters for Black and White Holstein bulls that had at least 10 daughters with lactation passed 120 DIM and in at least 10 herds in a May 2003 genetic evaluation.
 

View this table:
[in this window]
[in a new window]
 
Table 6. Correlation of milk-yield lactation daughter yield-deviations with EBV by birth year for Black and White Holstein bulls that had at least 10 daughters with lactation passed 120 DIM and in at least 10 herds in a May 2003 genetic evaluation.
 
Because the DYD formula [19]Go expresses DYD in the form of regression coefficients for RRTDM, any linear function of the regression coefficient estimates can be derived for individual bulls. For routine genetic evaluations, 305-d lactation DYD values and DYD lactation curves for bulls satisfying the requirement for official DYD are published, in addition to lactation EBV and genetic lactation curves.


    CONCLUSIONS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 APPENDIX 1
 APPENDIX 2
 REFERENCES
 
A multiple lactation RRTDM is used for genetic evaluations of milk, fat, protein yield, or SCS for joint analyses of Holstein, Red, and Jersey dairy cattle from Austria, Germany, and Luxembourg. For reliability approximation, the MTEDC was feasible for multitrait models, reasonably accurate, and applicable for very large populations (e.g., a German Holstein population with more than 11 million cows and 15 million animals). The presented DYD formula for multitrait models was verified to be correct in a simulation study. The correlations between DYD and EBV from a routine genetic evaluation were very high-0.996 for all bulls-suggesting that the DYD were highly consistent with EBV. The developed DYD and MTEDC methods can be used to compute DYD and associated reliabilities for international bull comparisons based on a multiple-trait model, a marker-assisted genetic evaluation, and validation of a national genetic evaluation system.


    APPENDIX 1
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 APPENDIX 1
 APPENDIX 2
 REFERENCES
 
Derivation of the Formula for Computing Animal’s Reliability Contributed by a Progeny Adjusted for Mate of the Animal
Formula [13]Go is used for calculating an animal’s reliability contributed by a progeny adjusted for mate of the animal. To derive the formula, we apply here a similar derivation method for general multiple-trait models as VanRaden and Wiggans (1991) did for a single-trait repeatability animal model. Let y* represent a vector of adjusted trait values of a progeny, from which true values, instead of estimates, of all fixed and nongenetic random effects have been subtracted. Let yo be y* adjusted for the effect of mate of the animal (âd)


([A1])

A sire model can be used to predict the animal’s breeding values based on this progeny’s information only


([A2])

with as being a vector of the animal’s breeding values as sire of the progeny, and {omega} is a vector of residual effects containing the progeny’s Mendelian sampling and error effects.

If y* is the only source of information of the progeny for estimating its breeding values (aP), then the corresponding mixed-model equations would be


Reliability of the progeny contributed by its own data only is computed with


([A3])

As a result of adjusting for the mate’s EBV, genetic variance of yo is reduced to


([A4])

The following equations are required in the derivation


The following (co)variances are also used in the derivation: and cov(as,yo) = ZGs where . Because


([A5])

An animal’s breeding values, based only on the information of this progeny, are estimated with (Henderson, 1984, p. 39)


([A6])

Because , and animals’s reliability contributed by the progeny and adjusted for the mate of the animal is


([A7])


([A8])

The above formula [A8]Go was derived for progeny with performance records. In case that progeny has no performance data, Z'R–1Z is replaced with an EDC matrix. Whether an EDC matrix or Z'R–1Z are considered, the same formula for RP–M can be obtained.


    APPENDIX 2
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 APPENDIX 1
 APPENDIX 2
 REFERENCES
 
Derivation of the Formula for Computing Daughter Yield Deviations of Bulls under Multiple Trait Models
Formula [19]Go is used for computing DYD of bulls under general multiple-trait models, including RRTDM. To derive the DYD formula for multiple-trait models, we apply a similar approach here as used by VanRaden and Wiggans (1991) for a single-trait model. Assuming a cow i has its own performance records and no progeny, equations of genetic effects for the cow i can be written in a general way as follows


([B1])

where EBV of the dam of the cow (âd) are obtained from a complete evaluation system, e.g., from a national genetic evaluation. ãi is EBV of the cow i that were estimated as if only her own performance and her parents made a contribution to her EBV. ãs is the EBV of the sire of the cow i that were estimated as if only his daughters contributed to his EBV. For computing the contribution of the cow to her sire’s DYD, only her performance records are considered, and the contribution of her progeny to her EBV must be ignored because her daughters’ records make a contribution to her mates’ DYD and her granddaughters to her sons. di is the diagonal element pertinent to cow i in inverse of the (co)variance matrix of Mendelian sampling D–1 (Mrode, 1996). Without its own progeny, di = . Note that the sire and his mate are allowed to be inbred. Rewriting equation [B1]Go gives EBV of the cow based on her records and parental information


([B2])

where {varepsilon}i = yiXi Zpii and

Define average yield deviations for the cow i as


([B3])

For the RRTDM, ci are expressed in the form of regression coefficients. Lactation or average-yield deviations do not exist for lactation of cow i when she has fewer tests in the lactation than the order of fit for genetic effects. Replacing {varepsilon}i in equation [B2]Go with ci gives


([B4])

Because no other sources of information are involved,


([B5])

Genetic effect equations for a bull without its own performance can be written as


([B6])

where ãs, âss, and âds represent EBV of the bull, his sire, and dam, respectively. ds is diagonal element of matrix D–1 (Mrode, 1996, p. 28) for the bull. Note that no male progeny of the bull is considered, because his sons do not have their own production records.


([B7])

where n is the number of daughters with data. Substitution of equation [B4]Go into equation [B6]Go by treating the cow i as one of n daughters of the bull gives


([B8])

Accumulating the terms involving ãs to the left side of Equation [B8]Go leads to


([B9])

Based on equation [B5]Go, the right side of equation [B9]Go can be simplified to


([B10])

Let and


([B11])

The DYD formula for general multiple-trait models is obtained as


([B12])

Received for publication July 17, 2003. Accepted for publication November 10, 2003.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 APPENDIX 1
 APPENDIX 2
 REFERENCES
 


Biochard, D., B. Bonaiti, A. Barbat, and S. Mattalia. 1995. Three methods to validate the estimation of genetic trend for dairy cattle. J. Dairy Sci. 78:431–437.[Abstract]

Fikse, W. F., and G. Banos. 2001. Weighting factors of sire daughter information in international genetic evaluations. J. Dairy Sci. 84:1759–1767.[Abstract]

De Roos, A. P. W., A. G. F. Harbers, and G. de Jong. 2002. Herd specific random regression curves in a test-day model for protein yield in dairy cattle. Pages 55–58 in Proc. 7 WCGALP, Communication #01–05, Montpellier, France.

Ducrocq, V., D. Boichard, A. Barbat, and H. Larroque. 2001. Implementation of an approximate multi-trait BLUP evaluation to combine production traits and functional traits into a total merit index. Page 2 in 52nd Annual Meeting of the European Association for Animal Production, Budapest, Hungary.

Graser, H. U., and B. Tier. 1997. Applying the concept of number of effective progeny to approximate accuracies of predictions derived from multiple-trait analyses. Pages 547–551 in Proc. Assoc. Adv. Anim. Breed. Genet., Dubbo, NSW, Australia.

Haile-Mariam, M., M. E. Goddard, and P. J. Bowman. 2001. Estimates of genetic parameters for daily somatic cell count of Australian dairy cattle. J. Dairy Sci. 84:1255–1264.[Abstract]

Harris, B., and D. Johnson. 1998. Approximate reliabilities of genetic evaluations under an animal model. J. Dairy Sci. 81:2723–2728.[Abstract]

Harville, D. A. 1997. Matrix Algebra from a Statistician’s Perspective. Springer, New York, NY.

Henderson, C. R. 1984. Applications of Linear Models in Animal Breeding. University of Guelph, Guelph, Ontario, Canada.

Jamrozik, J., L. R. Schaeffer, and G. B. Jansen. 2000. Approximate accuracies of prediction from random regression models. Livest. Prod. Sci. 66:85–92.

Koots, K. R., L. R. Schaeffer, and G. B. Jansen. 1997. Approximate accuracy of genetic evaluation under an animal model. J. Dairy Sci. 80 (Suppl. 1):226. (Abstr.)

Liu, Z., F. Reinhardt, and R. Reents. 2000a. Estimating parameters of a random regression test day model for first three lactation milk production traits using the covariance function approach. Interbull Bull. 25:74–80.

Liu, Z., F. Reinhardt, and R. Reents. 2000b. Parameter estimates of a random regression test day model for first three lactation somatic cell scores. Interbull Bull. 26:61–66.

Liu, Z., F. Reinhardt, A. Bünger, L. Dopp, and R. Reents. 2001. Application of a random regression model to genetic evaluations of test day yields and somatic cell scores in dairy cattle. Interbull Bull. 27:159–166.

Liu, Z., F. Reinhardt, and R. Reents. 2002. The multiple trait effective daughter contribution method applied to approximate reliabilities of estimated breeding values of a random regression test day model for genetic evaluation in dairy cattle. Pages 553–556 in Proc. 7WCGALP, Communication #20–15, Montpellier, France.

Mrode, R. A. 1996. Linear models for the prediction of animal breeding values. CAB International, Wallingford, UK.

Mrode, R. A., and G. J. T. Swanson. 2002. The calculation of cow and daughter yield deviations and partitioning of genetic evaluations when using a random regression model. Pages 51–54 in Proc. 7 WCGALP, Communication #01–04, Montpellier, France.

Reents, R., J. C. M. Dekkers, and L. R. Schaeffer. 1995. Genetic evaluation for somatic cell score with a test day model for multiple lactations. J. Dairy Sci. 78:2858–2870.[Abstract]

Schaeffer, L. R., J. Jamrozik, G. J. Kistemaker, and B. J. Van Doormaal. 2000. Experience with a test day model. J. Dairy Sci. 83:1135–1144.[Abstract]

Schaeffer, L. R. 2001. Multiple trait international bull comparison. Livest. Prod. Sci. 69:145–153.[Medline]

Strabel, T., I. Misztal, and J. K. Bertrand. 2001. Approximation of reliabilities for multiple-trait model with maternal effects. J. Anim. Sci. 79:833–839.[Abstract/Free Full Text]

Strandén, I., and M. Lidauer. 1999. Solving large mixed linear models using preconditioned conjugate gradient iteration. J. Dairy Sci. 82:2779–2787.[Abstract]

Szyda, J., Z. Liu, R. Maschka, F. Reinhardt, and R. Reents. 2002. Computer system for routine QTL detection and genetic evaluation under a mixed inheritance model in dairy cattle. Pages 749–750 in Proc. 7 WCGALP, Communication #28–10, Montpellier, France.

VanRaden, P. M. 2001. Methods to combine estimated breeding values obtained from separate source. J. Dairy Sci. 84 (E. Suppl.):E47–E55.[Abstract/Free Full Text]

VanRaden, P. M., and G. R. Wiggans. 1991. Derivation, calculation, and use of national animal model information. J. Dairy Sci. 74:2737–2746.[Abstract]

Wiggans, G. R., P. M. VanRaden, J. Bormann, J. C. Philpot, T. Druet, and N. Gengler. 2002. Deriving lactation yields from test-day yields adjusted for lactation stage, age, pregnancy, and herd test date. J. Dairy Sci. 85:264 (online only). http://www.adsa.org/jds/abs/2002/d0210264.htm.[Abstract]


This article has been cited by other articles:


Home page
J DAIRY SCIHome page
Z. Liu, J. Jaitner, F. Reinhardt, E. Pasman, S. Rensing, and R. Reents
Genetic Evaluation of Fertility Traits of Dairy Cattle Using a Multiple-Trait Animal Model
J Dairy Sci, November 1, 2008; 91(11): 4333 - 4343.
[Abstract] [Full Text] [PDF]


Home page
J DAIRY SCIHome page
J. Tarres, Z. Liu, V. Ducrocq, F. Reinhardt, and R. Reents
Validation of an Approximate REML Algorithm for Parameter Estimation in a Multitrait, Multiple Across-Country Evaluation Model: A Simulation Study
J Dairy Sci, October 1, 2007; 90(10): 4846 - 4855.
[Abstract] [Full Text] [PDF]


Home page
J DAIRY SCIHome page
J. Szyda and J. Komisarek
Statistical Modeling of Candidate Gene Effects on Milk Production Traits in Dairy Cattle
J Dairy Sci, June 1, 2007; 90(6): 2971 - 2979.
[Abstract] [Full Text] [PDF]


Home page
J DAIRY SCIHome page
J. Szyda, Z. Liu, F. Reinhardt, and R. Reents
Estimation of Quantitative Trait Loci Parameters for Milk Production Traits in German Holstein Dairy Cattle Population
J Dairy Sci, January 1, 2005; 88(1): 356 - 367.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services