|
|
||||||||
,



* Department of Animal Sciences and
Department of Dairy Science, University of Wisconsin, Madison 53706
Department of Animal and Aquacultural Sciences, Norwegian University of Life Sciences, PO Box 5003, N-1432 Ås, Norway
1 Corresponding author: motta{at}calshp.cals.wisc.edu
| ABSTRACT |
|---|
|
|
|---|
Key Words: Bayesian method mastitis zero-inflated data
| INTRODUCTION |
|---|
|
|
|---|
In some situations, overdispersion may be due to an excessive number of zero counts (Lambert, 1992). In such a case, a zero-inflated Poisson (ZIP) model would be suitable. In a ZIP model, it is assumed that, with probability p, the only possible observation is 0 and, with probability 1 – p, that a Poisson random variable with parameter
is observed. Here, p is the probability of the "perfect" state (i.e., an animal is either resistant or not exposed to the disease), and
is the mean number of mastitis cases in the "imperfect" case (i.e., an animal is liable to the disease).
The main objective of this study was to extend the ZIP model to accommodate correlated genetic effects, and to use this model for genetic analysis of the number of mastitis cases. As an illustration, an application was made to data containing information on the number of cases of clinical mastitis in first-lactation Norwegian Red cows from a previously analyzed Norwegian data set from the early 1990s. Further, the ZIP model was compared with a corresponding Poisson model via a residual and predictive ability analysis.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Only records of daughters with first calving in 1990 through 1992, and from herds having at least 5 daughters of any of the test bulls, were used. For each cow, the number of cases of veterinary-treated clinical mastitis (NCM) in first lactation, from 30 d before calving to either 300 d after calving or culling, whichever occurred first, was counted. Cows having a second calving before 300 d after first calving were excluded from the analysis. The interval between treatments had to be at least 5 d, to avoid counting repeated treatments of the same case as distinct mastitis episodes. About 76% of the cows did not have a case of clinical mastitis during first lactation, whereas 15.8, 5.1, and 1.6% had 1, 2, and 3 cases, respectively. Only 315 cows had more than 3 episodes of clinical mastitis during first lactation.
Discriminating between simple Poisson and ZIP models can be done by testing the null hypothesis, H0: p = 0. Asymptotic tests proposed by Rao and Chakravarti (1956) and van den Broek (1995) were used as part of an explanatory analysis. The Rao and Chakravarti criterion was calculated as
![]() |
where n0 = 27,745 is the number of zeros in the data; n = 36,178 is the total number of observations; and
= 0.34 is the mean number of mastitis cases in the data set. The U statistic is asymptotically distributed as an N(0, 1) variable; the realized value of U = 62.59 leads to rejection of H0. The score test is given by
![]() |
where
= e–y = e–0.34 is the estimate of the probability of having no mastitis cases. The realized value of S = 3917.3 was referred to a
2 distribution with one degree of freedom, leading to rejection of the hypothesis that p is 0. The tests suggest overdispersion in the number of mastitis cases relative to the homogeneous Poisson sampling, and part of this may be caused by the observed excess of zeros. The tests are suggestive, but not conclusive, because they are based on the assumption of a homogeneous population.
An excess of zeros by level of year, month, and age at first calving was also investigated. For each level of these 3 potential explanatory variables, the observed proportion of zeros was compared against a Poisson distribution with parameter equal to the mean number of mastitis cases in the respective level, as shown in Table 1
. An excess of zeros was found for all levels. However, there were some herds and sires without extra zeros according to this criterion. For the sire with the largest frequency of daughters with mastitis, the observed and expected proportions of zeros were 0.64 and 0.84, respectively. A sire pedigree file, with a total of 437 males, was built by tracing the pedigree of the 245 sires with daughters in the data set, through sires and maternal grandsires, as far back as possible.
|
Poisson(
i) with probability 1 – p, with n = 36,178. Note that if p is equal to zero, the setting yields a Poisson distribution with mean
i. The probability of a cow being healthy is
![]() |
and the probability of a cow having k cases of mastitis is
![]() |
Note that p was assumed to be homogeneous, meaning that it is the same for all animals. Let
= (
1, . . .,
n)'. The conditional (given the fixed and random effects) log-likelihood function for the homogeneous p ZIP model is given by
![]() |
where l(
, p | ß, h, s, y) is the conditional likelihood function for the homogeneous p ZIP model. Further,
and p satisfied the models:
![]() |
where ß is a vector of effects of year of first calving (3 levels), month of first calving (12 levels), age at first calving (15 levels; Table 1
), and regression on the log of the number of days elapsed from calving to the end of first lactation (culling or d 300); h is a vector of herd effects of order 5,286; s is a vector of sire transmitting abilities of order 437; and X, W, and Z are the corresponding incidence matrices. Further,
is a vector of residuals, which was assumed to follow the multivariate normal distribution N(0,In
2), where 
2 is the residual variance representing all unaccounted for sources of variation. Part of this unaccounted for variation is because a sire model is used here, so three-fourths of the additive genetic variance was not modeled explicitly (Damgaard et al., 2006). The distributions of herd and sire effects were assumed to be independent multivariate normal with mean zero and covariance matrices I5,286
h2and A
s2, respectively, where A is a known matrix of additive genetic relationships between sires, and
h2 and
s2 are variances of herd and sire effects, respectively, on a logarithm scale. The ZIP model was compared with a Poisson model with the same structure for the Poisson parameter. Inferences about unknown parameters were based on the Bayesian approach, via a Markov chain Monte Carlo (MCMC) sampling algorithm.
Prior Distributions.
Models were developed hierarchically, with all parameters regarded as random variables following some prior distribution. The prior distribution adopted for
* was a normal distribution with appropriate mean and variance 
2. Independent bounded uniform priors were assigned to p* and to each of the elements of ß, that is, p*
U(–5, 5] and ß
U(–50, 50]. As stated above, herd and sire effects were assigned independent normal prior distributions with mean zero and variances
h2 and
s2, respectively. Scale-inverse
2 prior distributions were used for the variances of herd, sire, and residual effects, that is,
h2,
h
h
h–2,
s2
s
s
h–2 and 
2






, respectively, where the degrees of freedom parameters were set to
h =
s = 
= 5 and the scale parameters were assigned the values
h =
s = 
= 1. These values were chosen to achieve vague proper priors for the variance components.
Fully Conditional Posterior Distributions and Sampling Procedure.
The fully conditional distributions of individual parameters were deduced by taking all other parameters as fixed, and absorbing them into the integration constant of the conditional posterior distribution of interest. The fully conditional densities of 
2,
h2,
s2 are scale-inverse
2 with appropriate parameters, and the joint fully conditional density of ß, h, and s is a multivariate normal distribution with appropriate mean and (co)variance matrix (Wang et al., 1993, 1994).
The fully conditional density of each element of
* and the density of p* do not have a closed form, and are given by
![]() |
and
![]() |
respectively, where I(p*; –5, 5) means that p*
(–5, 5). Here, I(.) is the indicator function, which is equal to 1 if the condition is satisfied and 0 otherwise; X'i, Wi, and Zi respectively.
A Gibbs-Metropolis algorithm was developed, based on sampling iteratively from all fully conditional distributions. Location effects, ß, h, and s were sampled from a multivariate normal distribution with appropriate mean and (co)variance matrix, and the 3 variance components were drawn from scale-inverted
2 distributions (Wang et al., 1993, 1994). A Metropolis algorithm was used for all other parameters. A normal distribution with mean equal to
i*(t) at iteration t and an appropriate variance was used as proposal distribution to sample
i*(t+1). Similarly, a normal distribution with mean equal to p*(t) at iteration t and an appropriate variance was used as proposal distribution to sample p*(t+1). The variances of the proposal distributions were chosen to satisfy acceptance rates between 30 and 50% (Gelman et al., 2002), which restrict the acceptance rate to being neither too high nor too low, reducing the chance of a chain getting stuck or moving too slowly in the parameter space.
Convergence Diagnostics.
Visual inspection of trace plots of the MCMC run and diagnostics suggested by Gelman and Rubin (1992) were used to determine the length of the burn-in period and the total number of iterations for the Gibbs-Metropolis procedure. Two chains with overdispersed starting points were used in the method of Gelman and Rubin (1992), which is based on monitoring convergence of the iterative simulation by estimating the factor by which the scale of the current distribution for a parameter under study, say
, might be reduced if simulations were continued for an infinite amount of time. The potential scale reduction is given by
![]() |
where
![]() |
Here, W and B are estimates of the within- and between-chain variances, and J is the number of iterations. R declines to 1 as the number of iterations goes to infinity. There is an indication of convergence when R is close to 1.
Two chains starting from overdispersed values were used, with a total of 500,000 samples, including a burn-in period of 250,000 iterations. A visual inspection of the trace plots (results not shown) suggested that the 2 chains had converged after 250,000 iterations. For the Gelman and Rubin (1992) convergence criterion, the scale reduction factors (R) for the residual, herd, and sire variances were 0.9999996, 1.0001, and 1.000001, respectively; the scale reduction R for p was 1.00001. On this basis, we decided to use only one chain with a burn-in period of 250,000 iterations, and with 250,000 samples collected thereafter for calculating Monte Carlo estimates of posterior features. No thinning of samples was practiced.
Residual Analysis
In the residual analysis, a residual for the record of cow i under a given model was defined as ri = yi – 10 – E(yi|
). The posterior mean of the standardized residual was estimated as
![]() |
with i = 1, . . . , n; J is equal to the number of samples from the posterior distribution, and the expectation and variance correspond to those under model M with parameter vector
. An observation would be unusual if the posterior distribution of ri were concentrated away from zero. Under the Poisson model, E(yi|
(j)) = var(yi|
(j)) =
i. Under the ZIP model, E(yi|
(j)) = (1 – p)
i and var(yi|
(j)) = E(yi|
(j))[1 +
ip], where
i = exp(
I*) and p = exp(p*)/[1+ exp(p*)].
Posterior Predictive Assessment
The adequacy of a given statistical model may be assessed by referring the observed value of some statistic to its sampling distribution expected under this model (Sorensen and Waagepetersen, 2003). If the observed value is atypical, this would be interpreted as evidence against the model. One way of checking whether a model describes the observed data well is to adopt a measure of discrepancy, say T(.), between the observed data and predictions from the model posited (Gelman and Rubin., 1992).
Let yrep be "replicated" or simulated data from model M with parameter vector
, and y be the observed data vector. In the posterior predictive approach to model validation, one can work with T(y|
) – T(yrep|
) and check whether zero is an extreme value in the posterior predictive distribution of T(y|
) – T(yrep|
). Because the parameter vector
is unknown, its values are sampled from the posterior distribution. Likewise, yrep is obtained by sampling from the posterior predictive density
![]() |
Here, p(yrep | y,
, M) = p(yrep |
, M), because yrep (the new sample, given the parameters) is independent of y (the observed data, given the parameters).
Two different discrepancy measures were used, with each of these evaluated at the observed and replicated data. The first measure was based on a sum, across observations, of standardized residuals:
![]() |
with the expectation and variance corresponding to model M. Using the samples from the MCMC procedure, we constructed a scatter plot to measure the discrepancy betweenT1(y,
(l)) and T1(yrep(l),
(l)),where l = 1, . . ., 10,000 is the number of simulated samples. If the model fits, points are expected to fall in a circle centered at (0, 0).
The second measure of discrepancy used was based on:
![]() |
k = 0, 1, 2, . . ., which gives the proportion of cows with k cases of mastitis in a sample of size n. We examined how Dk(l) = T2,k(y) – T2,k(yrep(l)) varied over k, with l = 1, . . . , 10,000 simulated samples, each of size n. Here, T2,k(y) represents the proportion of cows with k cases of mastitis in the observed sample, and T2,k(yrep(l)) represents the proportion of cows with k cases of mastitis in the simulated sample under
=
(l).
Computations were as follows: 1) each element y rep,i (l) of the vector yrep(l) = (yrep(l),1,...,yrep,n(l)) was drawn from a ZIP distribution evaluated at the posterior sample of
i* and p* at iteration l; 2) the statistics T1(yrep(l),
(l)) and T2,k(yrep(l)) were evaluated for each realization of yrep(l). This yielded 10,000 "replicated" standardized residuals and relative frequencies of the distribution of mastitis.
Sire Evaluation
The posterior mean of the probability of producing daughters without any case of mastitis in first lactation is appealing for the genetic evaluation and ranking of sires for selection. Sire transmitting abilities in the logarithmic scale for the Poisson parameters do not lead directly to the probability of sires having daughters without mastitis infection, because the Poisson parameter depends on other effects in the model. Let
![]() |
be the expected Poisson parameter of sire j, with µ being some combination of levels of the ß vector, and
being the average of the posterior means of the herd effects. In addition, let Pj be the probability that sire j produces a daughter without mastitis. Then
![]() |
where
j = exp(
j*) and p = exp(p*)/[1+ exp(p*)]. Sire ranking could be based on the posterior mean of Pj. However, note that Pj decreases monotonically as sj increases. Hence, in a model with homogenous p, sire ranking based on posterior means of Pj and sj are identical, given p.
Correlations between sire evaluations under Poisson and ZIP models were computed by using the posterior mean of the probability of no mastitis for each of the sires as the criterion. The ß-effects were combined by setting calving year to 1992, calving month to August, age at calving to 24 mo, and number of days from first calving to the defined end of the lactation as equal to 300.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
(l)) (abscissa) and T1(yrep(l)
(l)) (ordinate) for the ZIP and Poisson models. Both graphs show that realizations for the observed-data statistic cluster around the coordinate (0, 0), indicating a good agreement between observed data and predictions, at least with respect to T1 (.,
(l)).
|
|
In the ZIP model, the herd and sire variances accounted for 37 and 9%, respectively, of the variability of log-Poisson parameters, indicating the relevance of these effects. Posterior distributions of the variance components and of the perfect state probability, p, under the ZIP model are given in Figure 4
. The posterior distributions of the residual and herd variances, as well as of the perfect state probability were nearly symmetric. The posterior distribution of the sire variance had a slightly longer tail to the right. This was expected, because the posterior mean of
s2 was closer to 0 than those of the residual and herd components of variance. There is no obviously useful definition of heritability in this model; hence, posterior distributions of ratios of the variance components were not assessed. However, the between-sires variance accounted for 10% of the variation in
j* parameters, indicating genetic variation for mastitis.
|
The Monte Carlo variances of residual, herd, and sire variances and of the perfect state probability were 3.3 x 10–8, 7.5 x 10–9, 8.6 x 10–8, and 6.1 x 10–9, respectively, indicating small Monte Carlo errors.
Figure 5
shows the distribution of the posterior means of sire transmitting abilities under the ZIP model. Under this model, the sire "transmitting ability" esj for the Poisson parameter was 0.31, on average, whereas Pj averaged 0.82, where Pj is the probability that sire j produces a daughter with no mastitis. The correlation between sire ranks based on posterior means of Pj under the Poisson and ZIP models was 0.98. The rank correlation between the top 10, 20, 30, or 50 out of 437 sires ranged from 0.96 to 0.98. This indicates a strong agreement between sire rankings based on ZIP and Poisson models.
|
| CONCLUSIONS |
|---|
|
|
|---|
The inferred mixture probability was 0.32, giving a weight of 0.32 to the perfect state and a weight of 0.68 to the imperfect state, represented by a Poisson model. Hence, 32% of first-lactation Norwegian Red cows are expected to be in the perfect state and not contract mastitis at all, either because of complete resistance to the disease or because they are not exposed to it.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
Received for publication December 27, 2006. Accepted for publication July 12, 2007.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
J. E. Vallimont, C. D. Dechow, C. G. Sattler, and J. S. Clay Heritability estimates associated with alternative definitions of mastitis and correlations with somatic cell score and yield J Dairy Sci, July 1, 2009; 92(7): 3402 - 3410. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Perez-Cabal, G. de los Campos, A. I. Vazquez, D. Gianola, G. J. M. Rosa, K. A. Weigel, and R. Alenda Genetic evaluation of susceptibility to clinical mastitis in Spanish Holstein cows J Dairy Sci, July 1, 2009; 92(7): 3472 - 3480. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. I. Vazquez, D. Gianola, D. Bates, K. A. Weigel, and B. Heringstad Assessment of Poisson, logit, and linear models for genetic analysis of clinical mastitis in Norwegian Red cows J Dairy Sci, February 1, 2009; 92(2): 739 - 748. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |