|
|
||||||||
Animal Improvement Programs Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705-2350
Corresponding author: Melvin Kuhn; e-mail: mkuhn{at}aipl.arsusda.gov.
| ABSTRACT |
|---|
|
|
|---|
Estimates from the unadjusted model were biased downward; however, the 3 alternative analyses provided estimates of DD effects that were essentially unbiased, with a prior adjustment for cow effects and previous milk yield in the model providing the best results in terms of elimination of bias. Therefore, DD effects can be estimated from field data without bias from cow effects.
A designed trial with 2 groups and 10 or fewer cows/ group is noninformative and has an unacceptably high probability of leading to invalid conclusions. A minimum of 30 cows/group is considerably better and should be used whenever possible. Even with 30 cows/group, however, the power is low unless the difference between DD groups for yield is at least 1130 kg. Prior correction of 305-d, mature equivalent records for cow effects, using predicted producing abilities, could be done in designed trials to improve the statistical power of tests and accuracy of estimates.
Key Words: days dry methodology simulation
Abbreviation key: DD = days dry, PE = permanent environmental, PrevM = previous lactation milk yield.
| INTRODUCTION |
|---|
|
|
|---|
Although both designed trials and observational studies have been done in the past, much of the research on DD has been done using field data. However, the validity of using field data to determine DD effects has been recently questioned (Bachman and Schairer, 2003; Grummer and Rastani, 2004). Bachman and Schairer (2003) seemed to imply that only designed trials could be used to assess the impact of dry period length on performance. Their final conclusion, for example, was: "Importantly, additional animal trials that assign cows randomly to the dry period lengths that are being evaluated are needed to determine the optimal dry period length for the modern dairy cow in various management scenarios." The idea that studies based on field data are "biased" and basically noninformative has been promulgated in the popular press literature as well (e.g., Linderoth, 2003; Rastani and Grummer, 2003; Mohr, 2004). This criticism is unfortunate because it could potentially (and erroneously) erode producer confidence in results based on their data.
Certainly, some difficulties can be associated with the use of field data, but there are problems with designed trials as well and, more importantly, there are pronounced advantages of observational studies over designed trials. One primary advantage of observational studies, (beyond sample size) is that they have the potential to provide much more information than can be gleaned even from numerous designed trials. For example, one can feasibly determine the effects of DD on herd life or lifetime production in observational studies and examine interactions with other factors such as parity and previous lactation variables such as milk yield (PrevM), days open, or SCS. This more complete information is crucial in making appropriate recommendations for dry period length. However, few if any designed trials have done so, almost certainly because the expense, time, and effort involved would be prohibitive. The contention that field data cannot be used to determine DD effects should be examined so that this unparalleled source of information could be used to assist in the development of appropriate management recommendations and to lend guidance to further research.
The main argument put forth by Bachman and Schairer (2003), which was repeated several times in different ways throughout the paper, was that, in practice, higher producing cows tend to get shortened dry periods more frequently than do lower producing cows, which means that estimates of DD effects based on field data, at least for production, are biased by cow effects. The primary objective of this research was to determine if by using appropriate methodology, unbiased estimates of DD effects could be obtained from field data, even when higher producing cows receive shorter dry periods more frequently than do lower producing cows. Another objective was to ascertain relevant sampling properties of designed trials for estimation of DD effects.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
Model for simulation of records.
To address the simple question of whether partial confounding of cow effects with DD biases estimates of DD effects, the simulation need not be complicated. It is adequate to simulate only the basic aspects of the data, relevant to the question of interest. First-lactation milk records (y) were simulated according to the equation:
![]() | ([1.1]) |
and second and later lactations according to the equation:
![]() | ([1.2]) |
where µM was an overall mean, A was an additive genetic effect, PE a permanent environmental (PE) effect, Parity was a parity 1 vs. second and later effect, DDeffect was the effect of DD, and e was a temporary environmental (or error) effect. Parameters used for simulation are summarized in Table 1
. Breeding values for base generation animals were generated as normal random variables with mean zero. Thereafter, breeding values were generated as parent average breeding value plus a normally distributed Mendelian sampling effect. Permanent environmental effects and errors were also generated as normal random variables with mean zero. Standard deviations for A, PE, and e for simulated milk records (Table 1
) were comparable to those of Kuhn et al. (2005) for kilograms of lactation milk yield.
|
![]() | ([2]) |
where µDD was an overall mean DD, ßM was a linear regression of DD on the cows milk yield deviation from the population mean in the ith yr (µMi), and eDD was a random normal error, independent of milk yield. The overall standard deviation of 15 d for simulated DD was similar to that of Kuhn et al. (2005). The 0.2 correlation between simulated DD and PrevM used in this simulation (Table 1
) was higher than the 0.15 correlation reported by Kuhn et al. (2005) to ensure an adequate test of methodology. If the stronger relationship can be handled, then certainly the weaker one can be as well.
Once DD was calculated from equation [2], it was then categorized as indicated in Table 2
, and the corresponding DD effect, also listed in Table 2
, was added to the subsequent lactation milk record in the next year, as indicated in equation [1.2]. The objective of this part of the research was simply to determine if DD effects can be estimated without bias when DD is correlated with PrevM (i.e., cow effects). Thus, the exact magnitude used for the DD effects (Table 2
) is largely irrelevant. Nonetheless, the effects used in this simulation (Table 2
) are at least as large or larger than literature estimates (e.g., Funk et al., 1987; Makuza and McDaniel, 1996), which ensures at least an adequate test of methodology. Larger effects provide a more stringent test of methodology. The simulated DD effects (Table 2
) for the last 4 categories were set to zero to simulate no effect of DD after the minimum dry period to maximize production is met.
|
There were 1000 sires available each year with replacement of 200 sires/yr, which resulted in 2800 total sires. For the sake of introducing some selection into the simulation, the 200 bulls culled each year were randomly selected from the bottom half of bulls based on a simulated value computed as the sum of true breeding value plus a normally distributed random error with mean zero and standard deviation 335. Sires and dams of new sires were randomly selected from the top half of their respective populations and this selection was based on a similar simulated value. Each sire and dam was allowed to produce only one son/yr.
Estimation of DD Effects for Observational Data
DD as a covariate vs. categorical variable.
In all analyses tested in this study, DD effects were estimated using a categorical variable for DD rather than regression. This approach has been common practice in the past with respect to DD (e.g., Keown and Everett, 1986; Funk et al., 1987; Makuza and McDaniel, 1996) and may be justified for a number of reasons. Most research to date (Funk et al., 1987; Makuza and McDaniel, 1996) indicates that production increases with increasing DD up to a point, but after that point, additional DD add nothing to subsequent lactation yield. This relationship is somewhat difficult to model precisely with regression. The relationship is certainly not linear. A quadratic regression may be a sufficient approximation but, in principle, a quadratic is necessarily parabolic and thus would tend to "predict" a decrease in production after the maximum point. A cubic regression would allow for the flattening off after the maximum point but would predict an increase in production with long DD. Fitting DD as a covariate necessarily makes prior assumptions about the nature of the relationship between DD and milk yield and the purpose of the research should be to discover the nature of the relationship. Fitting DD as a covariate may conceal variation that would be uncovered when DD is fit as a categorical variable.
Concerning potential bias in estimates of DD effects using field data, it is clear that what needs to be corrected for are the cow effects (breeding value and PE effects). Thus, in addition to an analysis with no adjustments, 3 alternative analyses, aimed at correcting for cow effects, were tested.
Alternative I: PrevM.
The first, and simplest, alternative was to include PrevM as a linear covariate in the model for estimation of DD effects. The model equation for analysis under this alternative, was:
![]() | ([3.1]) |
where y was a milk record and DDcatg was a categorical variable for DD as defined in Table 2
. Only second and later lactations contributed to this analysis. This alternative is similar to the approach of Makuza and McDaniel (1996). Previous lactation yield has also been used in some designed trials to correct for differences in cow effects between groups (e.g., Gulay et al., 2003).
Alternative II: Prior correction for cow effects, estimated from an animal model.
Although the properties of animal models fit using BLUP are well established (Henderson, 1984), their use in DD research presents a unique aspect. Preferably, DD would be kept in the model for estimation of cow effects to ensure that the DD effect is not partially partitioned into the animal or PE effect. However, exclusion of first-lactation records can lead to bias in predicted breeding values (Pollak and Quaas, 1981; Henderson, 1982). The dilemma, then, is that first-lactation records cannot be excluded but they also have no DD associated with them. This difficulty was circumvented by assigning all first-lactation records their own unique (dummy) DD category, category 11. This method was similar to the approach of Funk et al. (1987), although they did not use genetic relationships for estimation of cow effects. This approach to estimating the cow effects was the reason for inclusion of a parity effect in the simulation. Parity 1 and DD category 11 were completely confounded and this aspect should be tested because it would occur when working with field data.
In practice, it may be desirable to examine DD effects on subsets of data, for example, by parity. Cow effects, however, need to be estimated using all data simultaneously to maximize accuracy and to avoid biases. Both of these goals can be met using a 3-step procedure for estimation of DD effects: 1) estimate cow effects from an animal model using all data, 2) preadjust the records by subtracting the estimated cow effects, and 3) estimate DD effects from the preadjusted records. This was the procedure tested in this alternative. Cow effects were estimated from the model:
![]() | ([3.2a]) |
where DDcatg was a categorical variable as defined in Table 2
, plus a category 11 for all first-lactation records. The effects A, PE, and e were fit as random effects with variance-covariance matrices: Var(A) = A
A2, Var(PE) = I
PE2, and Var(e) = I
, where A was a relationship matrix. Days dry effects were estimated from the model:
![]() | ([3.2b]) |
where Adjusted-y is the record with solutions for breeding value and PE from [3.2a] subtracted, and DDcatg is as defined in Table 2
. First-lactation records were not included for analysis in [3.2b].
Alternative III: Combination of I and II.
A third alternative was to preadjust for cow effects as in alternative II but to include PrevM in the model for analysis, i.e., in equation [3.2b]. Thus, alternative III was the same as II except the model for estimation of DD effects was:
![]() | ([3.3]) |
Evaluation of Models/Alternatives for Analysis of Simulated Observational Data
As indicated in Figure 1
, 50 replicates were run. In each replicate, DD effects were estimated using each of the 4 alternatives (no adjustment, plus alternatives I to III). The primary objective was to determine whether a given method was biased. To evaluate bias for a given alternative, estimates across the 50 replicates were averaged for that alternative and simply compared with the true DD effects to see if the estimates differed from the true values.
The amount of variation (standard deviation) among replicate estimates was also calculated for each method. This reflects the precision or standard error (i.e., the standard deviation of the sampling distribution) of each approach.
Simulation of Designed Experiments
The second part of this research was to determine the sampling properties of designed trials for estimation of DD effects. Twenty, 40, and 60 cows were generated and divided randomly and evenly between 2 DD groups. Five levels of DD effect (difference between the 2 groups) were used: 225, 450, 680, 900, and 1130. Thus, there were 3 (sample sizes) x 5 (DD effects) = 15 independent experiments, each with 1000 independent replicates.
Two records were simulated for each cow, according to the equations:
![]() |
![]() |
where DD_effect was the DD effect for the given experiment and Cow was a normally distributed cow effect with mean zero and variance equal to the sum of the variances for breeding value and PE used in the simulation of observational data. Error variance was also the same as that in Table 1
. Relationships were not generated in this simulation because cows were assigned randomly to DD groups and because an animal model was not used for estimation. Two records were generated on each cow to be able to determine the usefulness of including "previous lactation" yield in the model for analysis. Two models were used for analysis in each replicate:
![]() |
![]() |
Summary statistics for each of the 15 experiments included percentage of replicates with significant differences at the level of 0.10, mean P values, the standard deviation among replicate estimates, and a description of the distribution of estimates. The difference between groups in absolute cow effects was also summarized.
| RESULTS |
|---|
|
|
|---|
|
Standard deviations among estimates (Table 4
) were small for all analyses and there was surprisingly little difference among them. Use of PrevM was as good as adjustment for cow effects from an animal model. This, too, probably reflects large sample size. If sample size for estimation of DD effects was small but sample size for estimation of cow effects was large (e.g., if cow effects in a designed trial were corrected for by using animal model estimates from the national database), then prior correction for cow effects would likely show an advantage in standard error over the use of PrevM alone. An analysis using only second-lactation records (number of records per replicate = 308,000) was done and results (not shown) were similar to those in Tables 3
and 4
.
|
The probabilities of detecting real differences, for different magnitudes of DD effects and the 3 different sample sizes used in this simulation, are given in Table 5
. If DD effects truly exist, it would be better to flip a coin to decide whether dry period length affects milk yield than to run an experiment with only 10 cows/ group in 2 groups. The 10-cow groups did not detect a DD effect even 50% of the time until the magnitude of the effect was 1130 kg, and even then only with the enhanced analysis using PrevM. With a milk price of $0.26/kg ($12.00/cwt), the 10-cow groups would fail to detect an effect on subsequent lactation yield (1130 kg) worth $294.00/cow per lactation, 46% of the time. The lowest mean P value (Table 6
) for the 10-cow groups was only 0.19. With 20- and 30-cow groups, better results were obtained, as expected, but even with 30 cows/ group, DD effects were not detected more than 50% of the time until the magnitude of the DD effect was at least 680 kg. Mean P values dropped to about 0.20 for both 20- and 30-cow groups when the DD effect reached 680 kg. However, 30-cow groups did not reach a mean P value less than 0.10 until 900 kg, and 20-cow groups not until 1130 kg.
|
|
|
|
300 kg 53% of the time. Days dry groups differed in mean cow effects by
300 kg 37% of the time for 20-cow groups and 28% of the time for 30-cow groups. Use of mature-equivalent records for estimation of DD effects (in contrast to use of actual yields) may cloud differences in lactation yield caused by DD. Nonetheless, they may still be the best choice of production variable for designed studies because the predicted producing abilities, computed by Animal Improvement Programs Laboratory and based on a large national database, could be used to preadjust the mature-equivalent records. This, in turn, would reduce differences between groups in cow effects and, subsequently, increase power of tests and precision of estimates.
|
| DISCUSSION |
|---|
|
|
|---|
Bachman and Schairer (2003) listed 15 factors affecting milk yield and seemed to indicate that these factors were not or could not be accounted for in the analysis of field data. In fact, with the exception of bST usage, virtually all of the factors (and certainly all of the major factors) listed by Bachman and Schairer (2003) can be readily accounted for in the analysis of field data and perhaps better so than what can or has been done in designed trials. Season of calving effects, for example, would be estimated much more accurately with the larger field data sets than with the smaller experimental data sets.
One last point of contention brought up by Bachman and Schairer (2003), concerning the use of field data, was quality of data. One argument was that, in field data, "short dry-period categories are composed primarily of a nonrandom population of cows that freshened earlier than expected for various reasons." Ignoring the fact that they also argued that high-producing cows were the ones with short DD, this concern can be easily dealt with using an edit, as can abortions, which were pointed to as another possible source of corruption in field data.
The topic of "planned vs. unplanned shortened dry period" is rather questionable. First, this argument is largely speculative; it is by no means well documented that most short dry periods occurring in practice are "unplanned." Furthermore, from a certain perspective, it does not matter why a cow was dried off or how the cow was managed, only how long the dry period was. The "planned vs. unplanned" mantra necessarily implies that 1) shortened dry periods do cause a loss in subsequent lactation milk yield and 2) that there are (special?) management practices that can offset this loss, either partially or completely. The viewpoint taken in making this argument, then, is that the only correct comparison to make is the one where these management practices are used. It would be useful if these management practices were clearly specified. One could then evaluate whether such practices are likely to be implemented in practice and, subsequently, what should be estimated: the effect of DD with or without this management. The "unplanned" dry periods can be readily identified with an edit if one desires to use "planned" dry periods for comparison. Both days open and calving dates are reported to DHI. For a given cow, the reported days open from the previous lactation can be compared with the calving date in the next lactation. If they match (say, within 10 d), that would suggest that the producer knew (or should have known) when the cow was going to calve because it was, in effect, reported to DHI; if they do not match, then the record can be excluded. If a given herd has many records that do not match, the whole herd can be deleted. Furthermore, if the field data indicate that shorter DD are associated with decreases in yield, and management practices do exist that can offset these losses, then the conclusion is either that 1) farmers do not know about these management practices, or 2) they are not feasible for implementation. Such a result, in turn, would suggest that these practices (which putatively offset the losses) need to be better publicized or need to be reworked because the farmer cannot implement them in a cost-effective way.
In spite of an overstated criticism of the use of field data for determining the effects of DD on performance, Bachman and Schairer (2003) are correct in their overall implication that well-designed experiments, followed up by appropriate statistical analysis, have certain potential advantages over the use of field data. The converse, however, is also very much true. A major limitation of designed experiments will almost invariably be sample size and the ensuing lack of statistical power (ability) to detect real differences, which was clearly demonstrated in Tables 5
through 9
of this paper. Another concern related to small sample sizes is scope of inference. With only a small number of cows, typically in a single herd (and possibly a specialized research herd), located in one part of the country, it would be difficult to make confident inferences for all cows across the United States. To overcome the shortcomings of designed trials related to sample size, the experiments need to be independently repeated numerous times, in different herds, across different regions of the country.
Another serious limitation of most designed trials for determining the consequences of shortened dry periods is that they usually involve short-term investigation only. The long-term consequences of shortened dry periods need to be investigated before being recommended to producers. Even if shortened DD resulted in no loss in production in the subsequent lactation, there could be other consequences that result in shortened herd life. Designed experiments aimed at addressing this aspect, although feasible, would be costly both financially and in terms of time. Furthermore, it seems likely that the degree of control in a long-term experiment, one of the potential advantages of designed trials, would deteriorate over time.
Bachman and Schairer (2003) bring up a number of good points related to use of field data for examining the effect of DD, although most if not all of these can be handled using appropriate edits and data analysis. Designed experiments and observational studies both have advantages and disadvantages. Furthermore, the weaknesses of one are the strengths of the other. Designed trials have the potential advantage of better control over some extraneous variables and, perhaps most importantly, direct observation of cows involved. They are limited, however, in their ability to detect real differences, in their ability to ensure equal representation or proper correction for cow effects across groups, in the scope of inference that can be made, and in their ability to examine long-term consequences or interactions with other factors, all of which are the strengths of observational studies.
The objective should be to present dairy farmers, as accurately as possible, with all the consequences of varying dry period length. Both types of studies should be done (and done well) to meet that goal. An extreme viewpoint in either direction, that designed studies are too small and therefore noninformative or that observational studies are "biased" and merit little if any consideration, is contrary to accomplishing that goal.
| CONCLUSIONS |
|---|
|
|
|---|
A designed experiment with 10 or fewer cows/group is noninformative and has an unacceptably high probability of leading to erroneous conclusions. Although the use of 20 cows/group reduces this probability somewhat, an absolute minimum of 30 cows/group is considerably better and should be used whenever possible. Even with 30 cows/group, however, standard errors are large and, thus, group sizes larger than 30 would be preferable whenever feasible. With 30-cow groups, repeated trials would still be needed to obtain results in which confidence could be placed.
Previous-lactation milk yield should be included as a covariate in the analysis of data from designed experiments. Mature-equivalent production records corrected using the predicted producing abilities computed quarterly by Animal Improvement Programs Laboratory may be the best choice of trait for analysis in designed trials. Correction of records using predicted producing abilities would help eliminate cow effects and increase the accuracy of estimates, which is the greatest weakness of designed trials. If mature-equivalent yields are used, however, scrutiny of the records involved would be necessary because these records are extended to a 305-d basis, even for cows milking less than 305-d. Use of 305-d records could conceal differences in lactation yield if cows in either group failed to milk the full 305-d, as would be the case if short dry periods led to earlier dry off or earlier culling in the subsequent lactation.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
Received for publication October 20, 2004. Accepted for publication December 17, 2004.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
R. D. Watters, J. N. Guenther, A. E. Brickner, R. R. Rastani, P. M. Crump, P. W. Clark, and R. R. Grummer Effects of Dry Period Length on Milk Production and Health of Dairy Cattle J Dairy Sci, July 1, 2008; 91(7): 2595 - 2603. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Pezeshki, J. Mehrzad, G. R. Ghorbani, H. R. Rahmani, R. J. Collier, and C. Burvenich Effects of Short Dry Periods on Performance and Metabolic Status in Holstein Dairy Cows J Dairy Sci, December 1, 2007; 90(12): 5531 - 5541. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. T. Kuhn, J. L. Hutchison, and H. D. Norman Dry Period Length in US Jerseys: Characterization and Effects on Performance J Dairy Sci, April 1, 2007; 90(4): 2069 - 2081. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. T. Kuhn, J. L. Hutchison, and H. D. Norman Dry Period Length to Maximize Production Across Adjacent Lactations and Lifetime Production J Dairy Sci, May 1, 2006; 89(5): 1713 - 1722. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. S. Gulay, M. J. Hayen, H. H. Head, C. J. Wilcox, and K. C. Bachman Milk Production from Holstein Half Udders After Concurrent Thirty- and Seventy-Day Dry Periods J Dairy Sci, November 1, 2005; 88(11): 3953 - 3962. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |