JDS
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Journal of Dairy Science Vol. 85 No. 8 2040-2045
© 2002 by American Dairy Science Association ®
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Carvalheira, J.
Right arrow Articles by Blake, R. W.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Carvalheira, J.
Right arrow Articles by Blake, R. W.

An Autoregressive Repeatability Animal Model for Test-Day Records in Multiple Lactations

J. Carvalheira*,{dagger}, E. J. Pollak{ddagger}, R. L. Quaas{ddagger} and R. W. Blake{ddagger}

* Instituto de Ciências Biomédicas Abel Salazar and
{dagger} Centro de Estudos de Ciência Animal, Universidade do Porto, Rua do Monte-Crasto, 4485-661 Vairão, Portugal
{ddagger} Department of Animal Science, Cornell University, Ithaca, NY 14853

Corresponding author:
J. Carvalheira; e-mail:
jgc3{at}mail.icav.up.pt.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIAL AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Test-day (TD) models are becoming a standard for genetic evaluation of production traits in dairy cattle. Various approaches to model covariances between TD records include random regression, autoregressive repeatability, orthogonal polynomials, and models based on character processing. The applicability of these models is mainly associated with the number of parameters to estimate, incorporation of multiple lactations, and the accuracy of correlations generated by the cow’s repeated expression of milking performance (TD yields) within and across lactations. We define and evaluate a multiple-lactation, autoregressive-repeatability model that disentangles environmental effects due to cow within and between lactations. Simulated records either included or ignored a long-term environmental effect between lactations. Our autoregressive TD animal model correctly detected presence and the absence of this effect and accurately recovered the assumed variance components and correlations underlying the data (10 parameters for three lactations). Estimates of variance components and autocorrelation coefficients were obtained using DFREML-simplex methodology. Given the value of this approach to reduce the size of residual variance components, autoregressive animal models are a preferable alternative to classical methods based on cumulative lactation yield to improve milk production in dairy cattle.

Abbreviation key: CL = confidence limit, HTD = herd-test-date, LTE = long-term environmental effect, STE = short-term environmental effect, TD = test day

Key Words: test-day animal model • dairy cattle • autoregression • genetic evaluation


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIAL AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Analysis of daily yields on test days (TD) to calculate cumulative 305-d milk yield (Kachman and Everett, 1989; Van Tassell et al., 1992; Pander and Hill, 1993; Everett and Schmitz, 1994) was an important improvement in modeling dairy production to improve genetic gains in milk, which from 1960 to 1988 averaged 84 kg/yr in US Holstein cows (Freeman and Lindberg, 1993). These TD-adjusted cumulative yields contained similar amounts of genetic variation with smaller residual variances compared to unadjusted lactation records. Consequently, heritabilities for TD-adjusted yields of milk, fat, and protein were 11 to 17% larger than for unadjusted records, which portended more accurate genetic evaluations and greater genetic progress from selection based on TD-adjusted cumulative 305-d lactation records (Van Tassell et al., 1992).

Other approaches (Meyer et al., 1989; Ptak and Schaeffer, 1993; Schaeffer and Sullivan, 1994; Swalve, 1995; Carvalheira et al., 1998) directly analyzed daily yields instead of TD-adjusted cumulative 305-d lactation records to genetically evaluate sires and cows for milk production. Date of test effects may explain more of the systematic environmental variations (Ptak and Schaeffer, 1993; Swalve, 1995). More TD records per cow also promote more accurate prediction of genetic merit. Especially following advice in Quaas (1984) and Wade and Quaas (1993), Carvalheira and co-workers (1998) relaxed the usual assumption of unitary correlation between TD within cow by implementing first-order autoregressive structures within and between lactations to fit short- and long-term environmental covariances among repeated TD records. Results from this study indicated that an autoregressive TD animal model would provide larger additive genetic variance and heritability, more accurate estimates of individual genetic merit, and nearly double the theoretical annual genetic gain (200 kg) in milk for US Holstein. Consequently, our objectives were to define and evaluate a multiple lactation autoregressive repeatability model, challenging its capacity to accurately estimate the variance components and correlations underlying simulated data, which either included or ignored long-term environmental effects.


    MATERIAL AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIAL AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Model Definition
Modeling TD data requires an expression that relates these observations to their underlying random genetic and environmental factors. Systematic environmental factors are typically treated as fixed effects. Smaller residual variances commonly result when groups of contemporary animals are represented using herd-test-dates (HTD) instead of herd-year-seasons (Kachman and Everett, 1989; Van Tassell et al., 1992; Ptak and Schaeffer, 1993; Swalve, 1995). Age at calving and the lactation of repeated daily milk yields also need to be represented. These important fixed effects (or their variants) have been explanatory variables in all TD models. The general principles outlined by Wade and Quaas (1993) were followed to incorporate a first-order autoregressive structure into mixed model methodology. Variance components and correlations were estimated with DFREML procedures (Smith and Graser, 1986; Meyer, 1988; Boldman and Van Vleck, 1991) using the simplex algorithm (Nelder and Mead, 1965) to maximize likelihood functions.

Records of daily milk yield may be depicted as repeated observations. For example, a simple repeatability TD model may be defined as:


Formula

where

yijkmnis the observed daily yield,

HTDiis the fixed effect due to cows tested in the same herd (H) and test date,

Age(H)jis the fixed effect due to the jth age class at calving within herd,

DIM(H)kis the fixed effect due to cows tested in the kth days in milk (DIM) class within herd,

amis the random effect of animal,

pmis the random effect accounting for environmental covariances among TDs within cow, and

eijkmnis the random residual term.

When nonadditive genetic components are assumed to be negligible, a simple repeatability model is an extension of the usual breeding value model, with the further assumption of an additional nongenetic covariance from repeated observations on the same animal. This covariance among residuals of repeated records on the same animal results in a structure typically assumed to be:


Formula
,

where {delta}ijis the Kronecker delta, or, in matrix notation for three repeated records


Formula

and having the phenotypic (co)variance structure


Formula

where

Formula represents the variance unique to each observation,

Formula represents the environmental covariance between pairs of records on the same animal (assumed equally correlated for all pairs), and

amm{sigma}2a represents the genetic covariance assuming unitary genetic correlation between records on the same animal, which implies that the same genes are responsible for milk production throughout the lactation.

Quaas (1984) called this the "simplistic repeatability model" because it is unlikely that all records are equally correlated regardless of adjacency. A more realistic general structure was suggested (Quaas, 1984), which would impose an autoregressive covariance structure for residuals. The simplest of these structures is a stationary first-order autoregressive process, which is applicable with equal intervals (i.e., TD records taken 1 mo apart), as shown here for three repeated observations


Formula

where

{sigma}2e represents the environmental variance, and

{rho} represents the autocorrelation with |{rho}| <1, and which involves no more parameters (two in this case) than a structure ignoring the autoregressive process. Contiguous records in an autoregressive model are equally correlated if the interval between them is constant, which yields a decaying correlation between noncontiguous records with increasingly greater separation in time.Fcan be easily factored intoLDL', whereL(L') is a lower (upper) triangular matrix andDis a diagonal matrix. This factorization is especially useful for computing the determinant ofF, e.g., for evaluating the likelihood function.

This structure is also computationally tractable because its inverse is easily obtained without formal inversion, resulting in a tridiagonal matrix, if all contiguous observations are present and equidistantly spaced:


Formula

or


Formula

However, it may create problems for interpreting environmental influences because its structure is assumed entirely autocorrelated, which precludes independent effects peculiar to each TD record. Therefore, a potentially more realistic portrayal of random environmental effects on daily milk yield may be (for three repeated observations):


Formula

where

tandrare two environmental components resulting from partitioning the environmental variance (e) withtfollowing a first-order autoregressive process in repeated TD yields andris an independent effect,

{sigma}2t represents the environmental covariance among repeated daily yields,

Frepresents the first-order autocorrelation structure associating records from each cow as defined above, and

{sigma}2r represents the residual variance common to all observations.

Thus portrayed, the two environmental components represent separate influences on each TD record. One component ({sigma}2t) comprises short-term effects (STE) within a lactation (e.g., dietary quality and intake, weather, minor injury, estrus). These positive or negative fluctuations may be canceled or reversed with time, thus imparting a pattern or structure (i.e., a stationary, first-order autoregressive process between equally spaced, monthly TD records), where the correlations between TD records decay in time. The other component({sigma}2r) comprises all other sources of unaccounted temporary variation that independently affect TD records.

Environmental effects that permanently influence subsequent lactations include body size, disease, and other health events, and physical injury. Conceptually, this is the classical definition of a permanent environmental effect, where close events in an autoregressive structure are assumed more highly correlated than distant events. These correlated long-term environmental effects (LTE) may not be separately partitioned from the STE effect (which accounts only for correlations between TD within a lactation). A repeatability TD model involving multiple lactations needs to anticipate potential covariances, including those occurring between repeated lactations. The unequal time lag between consecutive lactations (the dry period) precludes using the STE effect to also portray potential covariances between lactations. The report by Harville (1979) may have contained the earliest suggestion to model the cow’s permanent environmental effect as a first-order autoregressive process to more accurately reflect temporal effects due to cow from one lactation to the next. Factors that may cause correlation between lactations have long-term influence in the sense that they should similarly influence repeated lactations (but possibly in different degrees) with noncanceling, carryover effects. Consequently, an autoregressive structure may be a realistic approach to portray these LTE effects, which relaxes the restriction (or assumption) that covariances across lactations are equal and invariant (Harville, 1979; Quaas, 1984).

Consequently, the conceptual multiple-lactation TD animal model was defined:


Formula

where

yijkLmnis the TD record,

HTDiis the fixed effect due to measuring the milk yields of cows on the same test date in the same herd,

Agejis the fixed effect due to the jth age at calving,

DIM(H)k(L)is the fixed effect due to cows tested in the same DIM class within herd and lactation,

am is the random effect of animal,

pm(L)isthe random effect of LTE following a first-order autoregressive process across lactations,

tn(mL) is the random effect of STE nested within cow and lactation, assumed independent between lactations, and following a first-order autoregressive process within cow and between TD, and

rijkLmn is the random residual term.

The model in matrix notation is:


Formula

wherey~N(Xß,V),ßis the unknown vector of fixed effects that, with knownX, defines the mean;a,p, and t are vectors representing the random effects due to animal, LTE, and STE, which are, respectively, associated with records inybyZ,M, andQ;ris the vector of residuals; andVis the (co)variance matrix. The expectations and (co)variance for this model are (for the case ofLequal to three lactations per cow and invoking previous definitions):


Formula

and then


Formula

Formula

Formula

Formula

Formula

Formula

where q1equals the number of animals being evaluated,q2equals the number of animals having records,nequals the number of TD of a particular female, andNequals the total number of records in the analysis; {otimes} represents the Kronecker tensor product;Ais the numerator relationship matrix,Iis the identity matrix, andFis the autocorrelated block diagonal corresponding to themth cow within theLth lactation. This parameterization also permits different variances of the STE effect for multiple lactations ({sigma}2tL). Note thatQ=Iwith this design. Therefore, the corresponding mixed model equations are:


Formula

Simulated Data
Two datasets were simulated according to model assumptions to evaluate the multiple lactation autoregressive repeatability methodology, and to assess the efficiency of achieving the desired convergence. One dataset contained an LTE effect; the other one did not. The TD animal model was challenged to accurately retrieve the variance components and correlations underlying the data simulated with these different effects. Although plausible parameter estimates are not a guarantee that procedure is correct, the reverse implies incorrect or inaccurate methodology.

A pedigree file containing 50 sires and 500 cows was generated and used to simulate datasets of 15,000 records (10 TD records per cow in each of three lactations). The data were organized in groupings according to fixed effects in the model: 44 test dates, 20 DIM subclasses per lactation, and 4 age-at-calving classes. Fifty replicates of each dataset were analyzed: the one including the LTE effect had 10 parameters to estimate (six variance components and four autocorrelations), and the one ignoring it had eight parameters (five variance components and three autocorrelations). Starting values for variance components and correlations were deliberately 10% larger than the true values for every analysis. Convergence was achieved when the variance of the –2 log likelihood functions from all points defining the simplex polytope was <10–6. For each replicate, the same log likelihood (up to the fourth decimal place; Boldman et al., 1995) in each of the two last runs was obtained with three or four cold starts.


    RESULTS AND DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIAL AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Results from analysis of the simulated data are summarized in Table 1Go. Each model performed acceptably well for the data possessing identical effects (emboldened values, in Table 1Go). For these scenarios, the resulting parameter estimates were close to the true values within the limits of confidence (CL) at {alpha}= 0.05. Furthermore, the model including an LTE effect was sensitive to the absence of this effect in the data that did not contain an LTE component. The small values obtained forFormula and Formulap were plausibly from a sampling effect from imposing nonzero requirements on the parameter space. However, the analytical model omitting LTE overestimated most parameters from data containing an LTE effect.


View this table:
[in this window]
[in a new window]

 
Table 1. Estimates of variance components for animal({sigma}2a),long-term environmental effects (LTE,{sigma}2p),short-term environmental effects({sigma}2t)and residual({sigma}2r),and autocorrelations ({rho}) and their 95% confidence limits (CL) using a multiple lactation, test-day animal model either including or ignoring a LTE effect to analyze data simulated to either contain or not contain an LTE effect.
 
Conceptually, the LTE effect differs from the classical permanent environmental effect under the assumption that noncontiguous lactations are less correlated than adjacent ones. Many events that permanently affect a cow’s capacity to express milk production occur during productive life (e.g., udder injury and infection), which would impart a correlation structure between lactations that decays in time. This may be why Harville (1979) and Quaas (1984) considered the autoregressive approach to be a realistic way to describe this effect. Our results indicated that the least risky approach for accurate evaluation is to use a full model including an LTE effect.

The number of parameters to estimate (variance components and correlations) may restrict the applicability of a linear mixed model for TD genetic analysis. The relatively small number of parameters considered by an autoregressive model is advantageous in this sense. The additive genetic (co)variance structure in this study and analysis of field data (Carvalheira et al., 1998; Carvalheira, 2001) was fitted with unitary correlation between TD and lactations under assumption that the same genes similarly affect milk yield expression throughout a cow’s productive life. Study is warranted to further evaluate models that would represent genetic (co)variances with autocorrelation structure.

As expected, substantial iteration was required to achieve convergence due to the number of parameters to estimate, which, as pointed out by Boldman et al. (1995), is worth considering when using DFREML with the simplex in variance component estimation. When the analytical model mismatched effects contained in the data, convergence sometimes required over 400 iterations. Therefore, besides "poor" starting values, an incorrect model may also affect convergence requirements. Table 2Go shows the average numbers of iterations required to achieve convergence for the replicates requiring fewer than 400 iterations.


View this table:
[in this window]
[in a new window]

 
Table 2. Average number of iterations1 to attain convergence in a three-lactation variance component estimation problem using DFREML-simplex methodology applied to TD animal models with and without long-term environmental effects (LTE).
 

    CONCLUSIONS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIAL AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
Approaches to describe covariances between TD records include random regression, autoregressive repeatability, orthogonal polynomials, and models based on character processing. The applicability of these models is mainly associated with the number of parameters to be estimated, incorporation of multiple lactations, and the accuracy of correlations generated by the cow’s repeated expression of milking performance (TD yields) within and across lactations. Results showed that an autoregressive animal model containing STE and LTE autoregressive processes accurately estimated these effects and the assumed variance components and correlations regardless of presence or absence of a long-term environmental effect due to cow. This model has been applied to field data from Italy (Carvalheira, 2001) and the United States, where "persistent environmental effects did not have major impact on the milking performance of Holstein cows" (Carvalheira et al., 1998). Variance component estimates from these studies were consistent and revealed a potential theoretical genetic gain of nearly 200 kg/yr in milk for US Holstein herds. Therefore, this analytical approach can increase the accuracy of predicted EBV and the attendant rate of genetic gain by reducing the size of residual components of variance.


    ACKNOWLEDGEMENTS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIAL AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 
This work contributes to the project POCTI/33162/CVT/2000 and to regional project S-284, Genetic Enhancement of Health and Survival for Dairy Cattle. It was partially funded by the Fundação para a Ciência e a Tecnologia (FCT) and FEDER (EU).

Received for publication October 19, 2001. Accepted for publication February 26, 2002.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIAL AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 ACKNOWLEDGEMENTS
 REFERENCES
 


Boldman, K. G., and L. D. Van Vleck. 1991. Derivative-free restricted maximum likelihood estimation in animal models with a sparse matrix solver. J. Dairy Sci. 74:4337–4343.[Abstract]

Boldman, K. G., L. A. Kriese, L. D. Van Vleck, C. P. Van Tassel, and S. D. Kachman. 1995. A manual for the use of MTDFREML. A set of programs to obtain estimates of variance and covariances (Draft). USDA, Agric. Res. Serv.

Carvalheira, J. G. V., R. W. Blake, E. J. Pollak, R. L. Quaas, and C. V. Durán-Castro. 1998. Application of an autoregressive process to estimate genetic parameters and breeding values for daily milk yield in a tropical herd of Lucerna cattle and in United States Holstein herds. J. Dairy Sci. 81:2738–2751.[Abstract]

Carvalheira, J. 2001. Application of test-day models for variance component estimation: Genetic evaluation and management information of dairy cattle in Ragusa (Italy). Proc. Int. Conf. Nutrition and Management On-Line. Consorzio Ricerca Filiera Lattiero-Casearia, Ragusa, Italy, September, 2001.

Everett, R. W., and F. Schmitz. 1994. Dairy genetics in 1994 and beyond. Animal Science Mimeo Series #170, Cornell Univ., Ithaca, NY.

Freeman, A. E., and G. L. Lindberg. 1993. Challenges to dairy cattle management: Genetic considerations. J. Dairy Sci. 76:3143–3159.[Free Full Text]

Harville, D. A. 1979. Recursive estimation using mixed linear model with autoregressive random effects. Page 157 inVariance Components and Animal Breeding. Proc. Conf. in Honor of C. R. Henderson, Cornell Univ. Ithaca, NY.

Kachman, S. D., and R. W. Everett. 1989. Test day model with individual herd correction factors. J. Dairy Sci. 72(Suppl.1):60. (Abstr.)

Meyer, K. 1988. DFREML. A set of programs to estimate variance components under individual animal model. J. Dairy Sci. 71(Suppl. 2):33 (Abstr.).

Meyer, K., H. U. Graser and K. Hammond. 1989. Estimates of genetic parameters for first lactation test day production of Australian Black and White cows. Livest. Prod. Sci. 21:177–199.

Nelder, J. A., and R. Mead. 1965. A simplex method for function minimization. Computer J. 7:308–313.

Pander, B. L., and W. G. Hill. 1993. Genetic evaluation of lactation yield from test day records on incomplete lactation. Livest. Prod. Sci. 37:23–36.

Ptak, E., and L. R. Schaeffer. 1993. Use of test day yields for genetic evaluation of dairy sires and cows. Livest. Prod. Sci. 34:23–34.

Quaas, R. L. 1984. Linear Prediction. Pages 1–76 in BLUP School Handbook. Animal Genetics and Breeding Unit, Univ. of New England, N.S.W., 2351, Australia.

Schaeffer, L. R., and B. P. Sullivan. 1994. Genetic evaluation of dairy goats using test day yields. Proc. 5th World Congr. Gen. Appl. Livest. Prod. 18:182–185.

Smith, S. P., and H. U. Graser. 1986. Estimating variance components in a class of mixed models by restricted maximum likelihood. J. Dairy Sci. 69:1156–1165.[Abstract/Free Full Text]

Swalve, H. H. 1995. The effect of test day models on the estimation of genetic parameters and breeding values for dairy yield traits. J. Dairy Sci. 78:929–938.[Abstract]

Van Tassell, C. P., R. L. Quaas, and R. W. Everett. 1992. Parameter estimates for 305-day ME records and 305-day test-day residual records. J. Dairy Sci. 75(Suppl. 1):251. (Abstr.)

Wade, K. M., and R. L. Quaas. 1993. Solutions to a system of equations involving a first-order autoregressive process. J. Dairy Sci. 76:3026–3031.[Abstract/Free Full Text]


This article has been cited by other articles:


Home page
J DAIRY SCIHome page
J. Vasconcelos, F. Santos, A. Bagnato, and J. Carvalheira
Effects of Clustering Herds with Small-Sized Contemporary Groups in Dairy Cattle Genetic Evaluations
J Dairy Sci, January 1, 2008; 91(1): 377 - 384.
[Abstract] [Full Text] [PDF]


Home page
J DAIRY SCIHome page
R. M. Sawalha, J. F. Keown, S. D. Kachman, and L. D. Van Vleck
Genetic Evaluation of Dairy Cattle with Test-Day Models with Autoregressive Covariance Structures and with a 305-d Model
J Dairy Sci, September 1, 2005; 88(9): 3346 - 3353.
[Abstract] [Full Text] [PDF]


Home page
J DAIRY SCIHome page
R. M. Sawalha, J. F. Keown, S. D. Kachman, and L. D. Van Vleck
Evaluation of Autoregressive Covariance Structures for Test-Day Records of Holstein Cows: Estimates of Parameters
J Dairy Sci, July 1, 2005; 88(7): 2632 - 2642.
[Abstract] [Full Text] [PDF]


Home page
J DAIRY SCIHome page
J. Vasconcelos, A. Martins, M. F. Petim-Batista, J. Colaco, R. W. Blake, and J. Carvalheira
Prediction of Daily and Lactation Yields of Milk, Fat, and Protein Using an Autoregressive Repeatability Test Day Model
J Dairy Sci, August 1, 2004; 87(8): 2591 - 2598.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Carvalheira, J.
Right arrow Articles by Blake, R. W.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Carvalheira, J.
Right arrow Articles by Blake, R. W.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS