|
|
||||||||
,2
* Institute of Biology and Biotechnology of Agriculture, National Research Council, Segrate 20090, Italy
Department of Dairy Science, University of Wisconsin, Madison 53706
1 Corresponding author: pjboettcher{at}gmail.com
| ABSTRACT |
|---|
|
|
|---|
Key Words: mastitis mixture model somatic cell count
| INTRODUCTION |
|---|
|
|
|---|
Current genetic evaluation procedures for SCC treat the records as homogeneous; that is, ignoring that there may be some hidden structure due to unknown disease status (Schutz, 1994). However, Detilleux and Leroy (2000) suggested that SCC could be a different trait, genetically, in infected and uninfected cows. If this were the case, considering the infection status of cattle may be of value when predicting breeding values for SCC. Unfortunately, this practice would not be straightforward, given the lack of data for mastitis incidence at the individual cow level in most countries. Even when mastitis is recorded, only data for the clinical form of the disease are obtained, whereas cattle may be infected subclinically, showing an increased SCC as the only symptom. To overcome these limitations, Detilleux and Leroy (2000) proposed the use of finite mixture models for analysis of SCC in the absence of information regarding infection status. Such models are appropriate for analysis of heterogeneous data when observations are derived from different distributions, and are particularly useful for situations (like that of SCC) in which the distribution from which a given observation arose is unknown (McLachlan and Peel, 2000). Detilleux and Leroy (2000) outlined a maximum likelihood approach for analysis of SCC with a finite mixture model. Gianola et al. (2004) refined this work and proposed algorithms for estimating parameters of interest. In addition, extensions to the model to allow for heterogeneity of variances were proposed; also, Gianola (2005) discussed issues connected with prediction of random effects in mixture models. Ødegård et al. (2003) developed a Bayesian approach for analysis of a 2-component mixture model for SCC with heterogeneous residual variances, and applied it to simulated data.
The model of Ødegård et al. (2003) considered hetero-scedasticity of variances for residual effects only, and it was extended subsequently to derive a criterion suitable for selection against putative mastitis (Ødegård et al., 2005). If SCC were a trait that differs genetically between infected and uninfected cattle, allowing for heterogeneity of genetic and permanent environmental (PE) variances would be appropriate. The goal of this study was extend the approach of Ødegård et al. (2003) to allow for heterogeneous variances of genetic and PE effects and to apply it to data on SCC collected in US Holsteins. Several models of increasing levels of complexity were compared for fit in an attempt to assess which model was most appropriate for use in genetic evaluation of SCC.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Models
Data were analyzed with a series of 5 different models of increasing order of complexity. Model 1 was a standard test-day repeatability model, similar to that used by Reents et al. (1995) for the evaluation of SCS. Fixed effects of systematic nongenetic factors and random additive genetic and PE effects were fitted. The other 4 specifications were 2-component Gaussian mixture models differing according to the type of heterogeneity of variances considered (Table 1
). All 3 variances (additive, PE, and residual) were homogeneous for model 2, whereas all variances were heterogeneous for model 5. Analyses were based on previous work of Ødegård et al. (2003), with some extensions to accommodate models 4 and 5.
|
![]() | [1] |
where y = vector of n observations for test-day SCS; ß0 = vector of fixed effects common to all records; ß1 = vector of fixed effects corresponding to observations from infected cows; I = identity matrix of order n; Mz = matrix with diagonal elements corresponding to vector z; a0 = vector of random additive genetic effects on SCS in the healthy state; a1 = vector of random additive genetic effects on SCS in the infected state; p0 = vector of random PE effects in the healthy state; p1 = vector of random PE effects in the infection state; e = vector of residual effects; and X0, X1, Za, and Zp = incidence matrices corresponding to fixed (X.) and random (Z.) effects, respectively.
The fixed effects in ß0 included 3 regression coefficients for effects of DIM on SCS, 17 effects of age at calving, and 3,361 herd-test-day effects. Regression coefficients for DIM were based on the Wilmink curve (Wilmink, 1987). Age-at-calving effects were one for each age from 20 through 36 mo. The ß1 vector included a single element, the mean difference (shift) between components 1 (healthy) and 2 (diseased) for observation n. For the nonmixture model (model 1), all elements of Mz were zero. For models with homogeneous genetic and PE variances (i.e., Models 1, 2, and 3), a0 = a1 and p0 = p1. For these models, a0
MVN(0, A
), where A is the numerator relationship matrix and
is the additive genetic variance, and p0
MVN(0, I
pe2), where I is an identity matrix of order 31,040. When genetic and PE effects were heterogeneous, expectations of a0, a1, p0, p1 were all zero and
![]() | [2] |
where
![]() |
is the variance-covariance matrix between additive genetic values under the "healthy" and "infected" statuses. Further,
![]() | [3] |
where
![]() |
is the variance-covariance matrix between corresponding PE effects. Conditionally on the breeding values and PE effects, the variance matrix of the observation vector (residual variance matrix) was expressed as
![]() | [4] |
where I is an identity matrix of order n, and
e02 and
e12 are residual variances for observations from the first and second components, respectively. For models with homogeneous residual variance, (i.e., models 1, 2, and 4) equation [4] simplifies to R = I
e02.
Bayesian Analysis
As mentioned previously, this study extended the work of Ødegård et al. (2003) by allowing for heterogeneity of animal (additive genetic and PE) and residual variances. Many aspects of the Bayesian structure of the model were very similar. Following the notation of Ødegård et al. (2003), the density of y, conditional on z, fixed and random effects, and residual variances, of the most complex model (with heterogeneity of all variances) can be expressed as
![]() | [5] |
where {zi = 0} and {zi = 1} denote the observations assigned to the first and second components, respectively, of the mixture model. In more detail,
![]() | [6] |
for the n0 observations assigned to the first component. For observations assigned to the second component (i.e., when zi = 1), the conditional density is obtained by subtracting X1ß1 to the quadratic form in equation [6], and replacing
e02 with
e12, n0 with n1, I Mz with Mz and Zaa0 and Zpp0 with Zaa1 and Zpp1, respectively.
With regard to prior distributions, widely bounded uniform priors were assigned to all fixed effects. In addition, ß1 was required to be greater than zero, to attain parameter identification. The multivariate normal distributions given above were used as priors for the random effects, conditionally on G and P for additive genetic and PE effects, respectively. Priors for G and P were inverted Wishart distributions, defined by 2 parameters, a degrees of freedom
, and a scale matrix V of the same dimension of G and P. In all cases,
was set equal to 5. For the scale matrices, diagonal elements (variances) were systematically varied from 0.1 to 2.0 in different chains to examine the influence of changing these prior values. Off diagonal elements were held constant at a small, but nonzero value (0.001). The priors for
e02 and
e12 were scale-inverted
2 distributions. For example, for
e02, the prior was
![]() | [7] |
where se02 and
are hyper-parameters corresponding to the prior variance and degrees of freedom, respectively. Experimentation was done to test for the effects of different priors for se02, and chains eventually converged to similar posterior distributions. Therefore, a fixed prior was used. For all models,
was set to 5 and
was set to 1.0 for both mixture components. The elements of the z vector were assumed to be independent and identically distributed as Bernoulli random variables, a priori, with their probabilities depending on Pm, the probability that an SCS is drawn from the "infected" status. The joint prior distribution of z was
![]() | [8] |
Finally, the prior for the mixing proportion was a beta distribution with parameters
and ß. A value of 2 was assumed for both parameters, as in the example of Ødegård et al. (2003).
Based on the sampling distribution of y and on the various prior distributions assigned, the joint posterior density of all unknown parameters was assumed to take the form
![]() | [9] |
To implement a Gibbs sampler, realizations for each parameter of interest must be drawn from their conditional posterior distributions, given the most recent values for all other parameters in the model. For an element of the ß., a., and p. vectors, the conditional posterior distribution was Gaussian. The mean was obtained by solving the mixed model equation corresponding to that element by inserting the most recent realizations for all other elements of ß., a., and p. and forming an offset of the data vector (Wang et al., 1994). The variance was equal to the most recent realization of the residual variance divided by the diagonal element of the coefficient matrix that corresponded to the element of interest (Sorensen, 1999). The conditional posterior distributions of G and P were the inverted Wishart distributions:
![]() | [10] |
where f = a or p and Vf and
f are scale and degree of freedom parameters of the corresponding prior distributions. For G
![]() | (11) |
and qf is the number of animals in the relationship matrix. For P
![]() | [12] |
and qf is the number of animals with records. The fully conditional posterior distributions of
e02 and
e12 were both scale-inverted
2 distributions. For
e02, the scale parameter was
![]() | [13] |
The scale parameter for
e12 was similar to [13], with substitutions to account for the fact that different additive genetic and PE effects were present for observations in the second mixture component:
![]() |
The elements of z assigning individual records to 1 of the 2 mixture components had mutually independent Bernoulli conditional posterior distributions. Bernoulli distributions are fully defined by a parameter p, which is the probability that a certain binary outcome will be obtained. In this case, pi was the probability that observation i would be assigned to the second (infected) component.
![]() | [14] |
where
= [ß0' ß1' a0' a1' p0' p1' The conditional densities for the yi were as presented in equation [6], using the most recent realizations of the various parameters as true values. The fully conditional distribution of the mixing parameter Pm was a beta distribution, with parameters n0 +
and n1 + ß.
The Bayesian structure of the more simple models is obtained by removing terms that refer to effects and (co)variances for the second component when the variance for a given factor is homogeneous. When the variances of genetic and PE effects were homogeneous, prior and fully conditional distributions were scale-inverted
2 distributions, rather than inverted Wishart.
The Gibbs Sampler
The Gibbs sampler was run as follows: 1) Initial values for all fixed and random effects were zero. Variances and covariances were set to the values used in the corresponding prior distributions. 2) Observations were randomly assigned to 1 of the 2 mixture components. 3) Fixed effects were sampled piecewise from univariate normal distributions in the following order: a) regressions on DIM, b) mean of the second component, c) age-at-calving, d) herd-test-day. The mean of the second component was forced to be
0, so only positive values of ß1 were accepted in the sampling process. 4) Random effects were sampled from univariate normal distributions. Genetic effects were sampled first, followed by PE effects. 5) The PE covariance matrix was sampled from an inverted-Wishart (or scale-inverted
2) distribution. 6) The genetic covariance matrix was sampled from an inverted-Wishart (or scale-inverted
2) distribution. 7) The residual variances were sampled from scale-inverted
2 distributions. Variance of the "healthy" component was sampled first. 8) Group membership variables (i.e., elements of z) were sampled from Bernoulli distributions. 9) The mixing proportion Pm was sampled from a beta distribution.
The steps for the appropriate models were repeated as needed at each chain of the Gibbs sampler. Five sampling chains of 205,000 cycles each were generated for each model. For each chain, the first 5,000 cycles were discarded as burn-in period so that a total of 1,000,000 posterior samples were available for each model. Convergence was assessed by the approach of Gelman et al. (2004), which involves calculating the square root of the sum of the within and across chain variances, divided by the variance within chains. Convergence was declared when this value was less than 1.1. Model 5 was the slowest model to converge, with a ratio of 1.09; the value of this ratio was <1.02 for all other models. Posterior distributions of (co)variances were assessed based on sampling every 20th cycle. Posterior means for breeding values were obtained by averaging realizations from every 500th cycle.
Comparison of Models
Model Fit.
The models were compared based on the deviance information criterion (DIC; Spiegelhalter et al., 2002). The DIC is one of a family of methods, including Akaikes information criterion (Akaike, 1970) and the Bayesian information criterion (Spiegelhalter et al., 2002), which consider both the fit and the complexity of a model. The DIC can be expressed as
![]() | [15] |
where D is the posterior expectation of the Bayesian deviance and pD is a measure of the effective number of parameters in the model. The pD is obtained based on the difference between the posterior mean of the deviance and the deviance evaluated at the posterior means of the parameters. The model with the lowest DIC is considered to be the most appropriate model statistically. The DIC has previously been used for evaluation of Bayesian models applied to livestock data (Rekaya et al., 2003).
The quantities needed to calculate the DIC can be obtained readily from the Gibbs sampling iteration. To obtain D, the expected Bayesian deviance, one needs to calculate the Bayesian deviance:
![]() | [16] |
every kth cycle of the Gibbs sampler (
is a vector of the realized values of all parameters of interest at that cycle of the sampling chain), and then average these values over all samples taken. The deviance evaluated at the posterior means of the parameters of interest is then calculated after the Gibbs chain is completed, by substituting
for
in equation [16] where
is a vector of posterior means of all parameters entering into the deviance. Here, D was evaluated every 2,000 cycles, whereas posterior means were calculated based on samples obtained every 500th cycle.
EBV.
In addition to comparing models for statistical appropriateness, the EBV resulting from the different models were evaluated for similarity. The posterior means of additive genetic effects (calculated by sampling every 500th cycle) were used as EBV. Two approaches were used to examine similarity of EBV. First, Pearson correlation coefficients were calculated between all pairs of the 7 sets of animal solutions (1 set of EBV from each of the 3 models (1, 2, and 3) with homogeneous genetic variance and 2 sets each from the 2 models (4 and 5) with heterogeneous genetic variance). Correlation coefficients were calculated for 2 sets of animals: 1) all animals, n = 54,143, and 2) only sires with at least 10 offspring (n = 541).
Second, to examine changes in rank, all sires with at least 10 offspring were sorted in ascending order based on each of the 7 sets of EBV. Then, the top and bottom 50 sires were identified for each set. Finally, the number of animals in common between each pair (high ranking with high ranking and low vs. low) of these sets was observed. Low numbers of mismatches were assumed to indicate high similarity among evaluation models.
| RESULTS |
|---|
|
|
|---|
|
No obvious trend was observed for genetic variance when comparing the standard model with the 4 mixture models. In model 4, the estimates of genetic and PE variances for the second component were much larger than the variances obtained by either of the other 3 mixture models. The genetic variance of the second component was 2.41 in model 4 vs. 0.52 for model 5; corresponding PE variances were 3.22 and 0.76, respectively.
Allowing for genetic and PE variances to be heterogeneous across components had a much greater effect on the analysis than did allowing for heterogeneous residual variance (because differences between models 2 and 3 were much less than between models 2 and 4). In addition, the genetic effects for the 2 components were distinctly different in model 4, and the correlation between genetic effects of the 2 components was only 0.13. In contrast, when genetic, PE, and residual variances were all allowed to be heterogeneous (model 5), the genetic effects of the 2 components essentially differed only in variance (r = 0.97).
Based on the results from Table 2
, clear differences among the models existed. However, results did not reveal per se which model was more strongly supported by the data. The DIC values (based on means across 5 sampling chains per model) for the 5 models are in Table 3
. In the light of the empirical standard errors of DIC statistics, based on between-chain variability, all models can be considered as producing statistically significantly different results, roughly. According to the DIC, model 4 was favored, by far; recall that a model with the lowest DIC is preferred. The average DIC for model 4 was 509,907 or about 6% lower than the DIC of the second-ranking model, model 5. The DIC for model 5 was about 10% lower than for models 2 and 3. As one can observe by comparing models 4 and 5, the assumption of fixed residual variance had profound effects on both the parameter estimates and the fit of the model as evaluated by DIC. The proportion of observations in the second or "infected" component was much greater than in any other model and the difference in SCS level for the 2 classes was much less than for any other model. One conclusion that can be drawn is that models 1 to 3 were underparameterized relative to model 4 (because DIC were larger) and model 5 was overparameterized (larger and more variable DIC). Table 3
also shows that any of the 4 mixture models used was superior to the standard linear model (model 1) for analysis of SCS data. The DIC of model 1 was nearly twice as large as for any of the mixture models. Boettcher et al. (2005) observed a similar advantage when applying a simple mixture model (i.e., similar to model 2) to SCS data from goats.
|
10 daughters were considered. The lowest correlations between sets of EBV were between the first and second components of model 4. At the same time, the EBV from the first component of model 4 were similar to the EBV from the other models (generally around 0.95). The highest correlations (<0.99) were between the 2 components of model 5.
|
|
| DISCUSSION |
|---|
|
|
|---|
0.90, but shuffling in order of the highest ranked sires was observed, demonstrating that practical differences would be realized with the adoption of a mixture model for genetic evaluation. Differences between the linear and mixture models may be even more marked in an analysis with all herds included, because this study included primarily data from well-managed herds with low mean SCS. The superiority of the mixture model may also increase if a more complex model is applied. The primary objective of this research was to compare models that differed according to the distributions of random effects in the models and heterogeneity of variance of effects from each component. Therefore, very simple assumptions were made about the mixing proportions and all observations had the same prior of falling in each of the 2 components, regardless of possible effects of differences in herds, ages, and genetics on this probability. Ødegård et al. (2005) have outlined an approach to accomplish this by including a "liability" variable into the mixing proportion.
The "best" mixture model (based on having the lowest DIC) was one that allowed for heterogeneous genetic and PE variance, but assumed homogeneous residual variance. Correlations between the EBV of the 2 components of the model with heterogeneous genetic effects were low, suggesting that SCS for infected and healthy cows may be different traits.
Although the statistical evidence supporting the use of mixture models is strong, questions remain about the biological ramifications of applying a mixture model, and about the precise meaning of the different EBV resulting from a mixture model with heterogeneous genetic effects. One possible way to approach this question may be to analyze a set of data for which the true infection status was known, considering SCS from healthy and infected cows as different traits, similar to the approach of Heringstad et al. (2006), and then comparing these results with those from the application of a mixture model to the same data. One might also consider a more complex model that includes more components based on the type of pathogen. Some research (M. M. Schutz, Purdue University, West Lafayette, IN; personal communication) has suggested that SCS in response to a contagious infection is under greater genetic control than SCS during an environmental infection. Of course, prospects for such studies are limited by lack of data availability. Another issue is how a genetic evaluation for SCS can be translated into a selection criterion, as discussed in Ødegård et al., (2005).
Finally, the statistical analysis of SCS and IMI should be complemented with biological research to determine whether high or low SCS is favorable in both healthy and infected states. As mentioned in the introduction, contradictory reports have been presented on the relationship between SCS in the seemingly healthy udder and subsequent susceptibility to infection. The relationships between SCS in the infected udder and elimination of the pathogen should also be examined. High SCS in this state could be beneficial, leading to a fast return to health, but a very strong immune response; high SCS could also trigger more damage to the udder and cause greater loss in milk yield.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
| FOOTNOTES |
|---|
Received for publication April 21, 2006. Accepted for publication August 25, 2006.
| REFERENCES |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |