|
|
||||||||
Department of Animal Health, Welfare and Nutrition, Danish Institute of Agricultural Sciences, PO Box 50, DK-8830 Tjele, Denmark
1 Corresponding author: Mizeck.Chagunda{at}agrsci.dk
| ABSTRACT |
|---|
|
|
|---|
Key Words: biological model mastitis risk indicator in milk
| INTRODUCTION |
|---|
|
|
|---|
A number of mastitis detection models have been developed previously with varying degrees of accuracy and automationfor example, models based on electrical conductivity (Cond), milk yield, and milk temperature (de Mol et al., 1999; de Mol and Ouweltjes, 2001; de Mol and Woldt, 2001). However, models are currently lacking that combine time-series measurements of an indicator with other known mastitis predisposing factors of direct physiological relevance to the status of udder health throughout the lactation. The objective of the current study was to develop a biological model for early detection of mastitis for individual cows in a dairy herd based on inline real-time measurements of an indicator in milk and additional mastitis risk factors. Such a model, with focus on early mastitis detection, allows timely follow-up and thus more targeted prevention and treatment strategies. This paper presents a model using the milk enzyme L-lactate dehydrogenase (LDH) as an indicator, although in principle the model would apply to any mastitis indicator measured in milk.
| MODEL DESCRIPTION |
|---|
|
|
|---|
The outputs of the model are 1) the overall risk of acute mastitis (AcuteRisk), 2) the relative degree of chronic mastitis (ChronDeg), and 3) a calculation of when to take the next sample (days to next sample; DNS), which is designed to feed back to the sampling device. In the model, the 3 outputs are generated by separate equations. Apart from the inputs, the outputs, and the equations described hereafter, the model includes a set of constants, the values for which are given in Tables 1
and 2
. The constants are based on the literature and on expert opinion.
|
|
Model Inputs
The model is based on the following inputs: cow identification number, number of days from calving (calving date), parity, LDH, milk yield, udder characteristics, herd-level mastitis, disease recordings, and Cond where available. Not all inputs trigger a new run of the model; the model is triggered by LDH, Cond, and disease incidences. The time of occurrence associated with each of the inputs is required. Cow identification number, number of days from calving, and parity are self-evident and are not discussed further. The product of LDH and milk yield is the main component of MIBR, whereas milk yield acceleration (MYAcc), udder characteristics, herd-level mastitis, disease recordings, and Cond are the main components of ARF. The elements of the 2 overall inputs, MIBR and ARF, are described in detail in the following sections.
Elements of MIBR
The MIBR is based on measurements of LDH activity in milk (µmol of product/min per L). A strong, positive correlation has been found between LDH activity and SCC in milk samples from clinically mastitic cows (Bogin et al., 1977; Harmon, 1994). Our own study of LDH activity, measured at every milking from 197 cows for a period of 8 mo, showed a correlation of 0.8 between LDH activity and SCC in naturally occurring mastitis and 0.5 in healthy cows (Chagunda et al., 2006). An important issue for the use of milk measurements, including LDH, concerns the effects of dilution. Assuming that the amount of LDH activity (µmol/min) is a function of the amount of epithelial and somatic cell damage (Bogin et al., 1977; Kitchen et al., 1980; Kato et al., 1989), then the measured LDH activity (µmol/min per L) will depend on milk yield. In addition, mastitis usually occurs in only one quarter at a time, and the milk from that quarter is diluted by milk in the other 3 quarters. A further complication is that the ratio of the amounts of milk produced by the infected and healthy quarters is also affected by mastitis. To minimize these effects, and especially because it is envisaged that inline milk measurements will be made at the cow level rather than at the quarter level, the indicator variable in the model is the amount of LDH (µmol/min), which is the product of LDH activity (µmol/min per L) and milk yield (L).
Filtering and Smoothing the LDH Amount Time Series.
To reduce random noise in the time series, the amounts of LDH used in the biological part of the model are smoothed values. A biometric module was used to generate these smoothed values. This was an extended Kalman filter, using a local linear growth model with outliers (e.g., Smith and West, 1983). The smoothed values (called Level, µmol/min) are posterior mean estimates of the true level in this model. The assumptions made in the local linear growth model, the estimation of parameters, and the implementation of the extended Kalman filter are described in Korsgaard and Løvendahl (2002). In brief, this being a dynamic linear model for time-series data, it estimates the state of any new milk LDH value based on data from the previous time step and the current measurement. Hence, any new data point may belong to 1 of the 4 groups: normal evolution, outlier, slope change, and level change. The biometric model generates estimates of the probability that any given LDH observation belongs to one of these groups and smooths the time series accordingly. The prior probabilities of belonging to a particular group and the variances associated with the different models, as well as the parameters associated with the initial state, were estimated from a data set of daily milk LDH values collected from 100 cows over a 6-mo period. These parameters were treated as known when producing Level parameters of LDH. The extended Kalman filter generates smoothed values of varying accuracy, depending on the degree of back smoothing used. Zero-step back smoothing provides posterior mean estimates of LDH activity level, that is, estimates of LDH activity level at time t from samples up to and including time t. One-step-back smoothing provides estimates of LDH activity level at time t 1 from samples up to and including time t. Clearly, using back-smoothed values implies accepting a lag in availability of level estimates (producing a one-step-back-smoothed value requires a subsequent sample to have been taken). This process can, in principle, be extended with 2-step smoothing, 3-step smoothing, and so on back. However, this was not considered because the gains in accuracy diminish relative to the associated lag in availability. Because the optimal trade-off between time lag in availability of smoothed values and accuracy of the smoothed values may vary according to local conditions (e.g., frequency of milk sampling), the biometric module was implemented so that either zero- or one-step-back-smoothed values could be used in the biological component of the model. The rate of change in Level, LDH slope (Slope, µmol/min per d), was calculated from the difference between consecutive Level values using a 3-d rolling average with exponential weighting.
To distinguish between the short-term changes in the amount of LDH symptomatic of acute mastitis and the long-term changes associated with chronic mastitis, the underlying trend in the amount of LDH is calculated. This underlying trend in the amount of LDH is called Stable. Stable is calculated as a rolling average of Level (µmol/min) for each cow over an interval of 7 d. A default value of Stable at calving is needed to calculate the risk due to Level. The default Stable value is set low (2 µmol/min) and is diluted out as actual Level data accumulates.
Calculating MIBR.
As shown in Figure 1
, MIBR is the sum of the risk due to Level (RiskLevel) and the risk due to Slope (RiskSlope). The assumption in the model is that the greater the Slope, the greater the risk of acute mastitis. The equation for calculating the risk due to slope is
|
![]() | [1] |
where SignSlopeChan gives the sign of the slope change (current Slope previous Slope), returning a value of +1 for slope changes
0 and 1 for slope changes <0; PSlope is the probability of a slope change calculated by the extended Kalman filter in the biometric module; MaxSlope is a constant giving the value of the slope that will give a risk of 1. The term signSlopeChan x PSlope prevents RiskSlope from being increased by a high probability of a slope change when the associated change in slope is negative (i.e., PSlope only increases RiskSlope when Slope is increasing).
Risk Due to LDH Level (Level).
It is assumed that the higher the Level relative to the stable level (Stable), the greater the risk of acute mastitis. Stable is used as a baseline to facilitate the differentiation between acute and chronic mastitis; it assumes that the increase in LDH caused by an acute case is independent of the underlying stable level. The equation for the RiskLevel is
![]() | [2] |
where signLevChan gives the sign of the level change (Level LastLev), returning a value of +1 for level changes
0 and 1 for the level changes <0, PLevel is that probability of a level change calculated by the extended Kalman filter, and MaxLevel is a constant. LastLev is the value of Level from the preceding model run. Level Stable is not allowed to be negative.
Elements of ARF
When considering which factors to include as ARF for mastitis, an important criterion was that the risks accounted for by ARF were not already included in the MIBRthat is, any factor that directly affects LDH has not been included as an ARF. In the event that a potential risk factor has an effect on LDH and also an additional effect, the 2 effects are distinguished and only the additional effect of the factor is included in the ARF.
Having identified those aspects of mastitis biology (e.g., teat canal defenses, infection pressure) that should be included, a further important issue in implementing these ARF was that they should be based on measures that can reasonably be expected to be available on farm now (or in the near future). For model to be of practical use, in some cases we have used inputs that are not state-of-the-art, thereby retaining applicability. The ARF included in the current model are MY-Acc, milking duration, udder characteristics, herd mastitis level, current lactation disease history, and quarter level conductivity. Details of these factors are described in the following sections. The ARF model architecture is presented in Figure 2
.
|
![]() | [3] |
where MaxAcc is a scaling constant that gives the level of acceleration, which will return a risk of 1.
Duration of Milking.
Duration of milking (Milk-Dur) is used as an index reflecting the negative effect of machine milking on teat integrity. The underlining assumption is that the longer the duration, the higher the risk of mastitis. This assumption has been confirmed by several researchers: Machine milking can result in congestion and edema of the teat tissue, especially at the teat end, and can also influence teat diameter, penetrability of the teat canal, and teat defense mechanisms (Bramley and Dodd, 1984; Hamann and Duck, 1984). Further, machine milking can transmit pathogens, both passively from the teat skin of one cow to the teat cistern of another, and actively by dispersing them inside the udder (Hamann, 1987). In the model, the risk of mastitis due to duration of milking (Risk-MilkDur) is calculated as
![]() | [4] |
where MilkDur is the duration of milking (s), and Max-MDur, MDurR, and MDurT are constants. Given the constant values in Table 1
, RiskMilkDur increases from 0.011 to 0.24 and 0.37 as MilkDur increases from 240 to 480 and 720 s (the maximum risk, given by Max-MDur, is 0.4).
Udder Characteristics.
The purpose of these inputs is to index the cows inherent susceptibility to mastitis caused by the physical characteristics of the udder that relate to teat integrity (Hamann, 1987). Included in the model are teat length, udder depth, teat leakiness, and teat canal diameter. Teat length, udder depth and teat leakiness are handled as binary traits. Thus, the user can input short teats, low udder, and leaky teats (LeakyT). These increase the risk of mastitis attributable to udder characteristics (RiskUdd) by 0.1, 0.05, and 0.15, respectively (Table 1
). Because these characteristics are not expected to change on a short time scale, once input they are retained for the rest of the lactation. If a given cow is not recorded as having short teats (ShortT), then ShortT = 0, but if she is recorded as having ShortT, then ShortT takes the value given in Table 1
. The same applies to LowUdd and LeakyT.
Teat canal diameter is not a measure that is commonly made on farm; therefore, we have chosen to use peak milk flow rate (PeakMF) as a proxy for teat canal diameter. Teat canal diameter does not change markedly through lactation, although PeakMF varies with milk yield (Baxter et al., 1950; Giesecke et al., 1972). To minimize disturbance caused by milk yield, we chose to use the maximum recorded PeakMF within a given lactation. This is obtained by retaining the maximum recorded PeakMF as the lactation progresses. A default maximum PeakMF of 50 mL/s is assumed at the start of lactation. The retained maximum recorded PeakMF is used to calculate the associated risk. This has the practical advantage that a value is available from the start of lactation. Because this routine is vulnerable to noisy measurements, the input PeakMF used are the smoothed outputs of the Kalman filter. The PeakMF (mL/s) is converted to a risk factor (PeakMFRisk) by the following equation:
![]() | [5] |
where MaxPMF (maximum PeakMF), PMFR (PeakMF rate) and PMFT (T value for PeakMF) are constants. The greater the PeakMF, the greater the PeakMFRisk, and given the values of the constants in Table 1
, PeakM-FRisk = 0.04, 0.18, and 0.27 when PeakMF = 40, 60, and 80 mL/s, respectively.
The combined risk due to udder characteristics and PeakMF (RiskUdd and PeakMFRisk) is
![]() | [6] |
Herd Mastitis Level.
Given that mastitis can be either environmental or infectious and also that some types of mastitis pathogens are more virulent than others, the impact of mastitic cows within the herd on the health of an individual cow varies (Bramley and Dodd, 1984; Gröhn et al., 2004). However, in general, the higher is the number of mastitic cows in the herd, the higher the infection pressure on any given cow in the herd (Bramley and Dodd, 1984). The equation for the risk from herd-level mastitis (RiskHerd; see Equation 10) given here allows a general representation of this risk suitable for the situation in which no information about pathogen types is available. Examples of how this might be modified for specific pathogens are given by Østergaard et al. (2005) and Seegers et al. (2003). Risk from herd-level mastitis is derived from 3 components: the relative level of LDH in the herd (Herd-RelLDH), the potential for spread because of cow density (Spread), and the longevity of the bacteria (BacLife).
The herd mastitis level is calculated by combining the individual mastitis burdens of the cows in the herd, which are assumed to be reflected in the stable LDH amounts (Stable). The HerdRelLDH is then calculated as
![]() | [7] |
where HerdMast is the median value of Stable in the herd, and ChronRef is the expected level of LDH caused by chronic mastitis, given the parity and stage of lactation profile of the herd.
Calculating the likelihood for Spread is potentially complicated because it involves considering a number of possible interactions between healthy and infected cows, such as herd size and the proximity of cows, which itself includes stocking density, housing design, and management practices (Østergaard et al., 2005). For simplicity, we have chosen to approximate Spread using a Gompertz function. This recognizes that the underlying relationships are not linear while at the same time providing a value bounded between 0 and 1. Clearly, this aspect of the model could be substantially expanded if users deemed it worthwhile; the current function provides a point of entry for such a development. The function for Spread is
![]() | [8] |
where SpreadR and SpreadT are constants (Table 1
), and Herdsize is an index of cow density. Under normal housing conditions in a yard (a barn of fixed area), Herdsize would simply be the number of cows. If the average stocking density were changed markedly from the norm (e.g., cows going out to graze), then Herdsize could be the number of animals multiplied by a proportion reflecting the difference in average stocking density. With the constant values given in Table 1
, Herdsize of 50 and 100 result in Spread values of 0.36 and 0.87, respectively.
The longevity of bacteria is clearly pathogen specific as well as being affected by environmental factors such as local temperature and humidity. If such inputs were readily available, then BacLife could be calculated in a more sophisticated manner than that included in the present model. Given that seasonal variations generally affect most bacteria of importance in the same way (Ribeiro et al., 2001), BacLife reflects the effect of season on bacteria survival (i.e., bacteria survive longer in warm, wet conditions than in cold, dry ones). The longevity of bacteria is a cyclic function of the current date (RunTime), ranging from 0 to 1, calculated as
![]() | [9] |
where DayInYear is the number of days between runtime and newyearsday, where newyearsday is the first day of the year. Hemisphere is an offset to adjust the equation to different regions of the globe, with the northern hemisphere being 1 and the southern hemisphere being +1.
The 3 herd components are combined to give mastitis risk caused by RiskHerd, as follows:
![]() | [10] |
where HerdRelLDH is the relative level of LDH in the herd, BacLife reflects the effect of season on bacteria survival, and Spread is the potential for mastitis spread caused by cow density. The Gompertz function applied to Spread converts it to a 0 to 1 scale.
Current Lactation Disease History.
The physiological background for this is the fact that some diseases and disorders often increase the risk of mastitis (Hillerton et al., 1995; Hamann and Krömker, 1997). Several diseases have therefore been included as ARF for mastitis (DisRisk). Two classes of diseases or disorders are recognized: infections and noninfections. The infections are metritis, teat injury, and acidosis, and the noninfections are retained placenta, ketosis, and milk fever. For any given case, the severity of the disease (DisSev), ranging from 0 to 1, can be input. The severity of the disease could, for example, be grouped in 1 of 3 categories of mild (0.3), average (0.6), or severe (0.9). The constants used in the model when any of these diseases are present are given in Table 2
. It is assumed that the effect of a disease is greatest on the day of occurrence and that this effect subsequently declines as a function of days since occurrence (DisDays) to 0. The function used to calculate the risk of mastitis caused by a given disease (DisRisk) is
![]() | [11] |
where Dis- is replaced by the relevant disease name (see Table 2
). The period between the model RunTime and date of disease occurrence is DisDays. Because at any one time more than one DisRisk may be in operation, the disease effects are combined such that within each class of disease (infections vs. noninfections), the greatest DisRisk is chosen (i.e., it is assumed that within-class disease risks are not additive). The overall DisRisk is the sum of the highest DisRisk in the infections class and the highest DisRisk in the noninfections class. For each disease, the constant MaxDis gives its effect on the ARF on the day of occurrence. The constants DisR and DisT control the rate of decay of the disease effect with time from occurrence.
Quarter-Level Conductivity.
Electrical conductivity has been used as the state variable for detecting mastitis in other models (De Mol et al., 1999). It can therefore be expected to have a reasonable degree of correlation with the amount of LDH, and thus the MIBR. Despite this fact, Cond measures (when available) are utilized as part of the ARF. This is because Cond has 2 useful attributes: It is inexpensive in terms of cost per measurement, and it is measured at the quarter level. We take advantage of the fact that when it is available, it is usually recorded at every milking and thus can be used to trigger a new LDH measurement (i.e., it can influence DNS). The main use of quarter-level Cond on ARF is to identify the quarter that is mostly likely to be infected. The input of Cond in the model is an interquarter ratio of the Cond values from a milking. The interquarter ratio is the ratio between the highest and the lowest quarter value within cow and milking. Interquarter ratio has been found to perform better in classifying both clinically and subclinically mastitic cows than the absolute Cond values (Norberg et al., 2004). In recognition of the likely correlation between LDH and Cond, the risk of mastitis due to Cond (RiskCond) is given a relatively low weight in the ARF by utilizing a very high value of an adjusting constant, MaxCond, in the following equation:
![]() | [12] |
Calculating ARF.
The elements that make up the ARF, described in the previous sections, combine to give ARF as follows:
![]() | [13] |
where RiskAcc is the risk attributable to acceleration in milk yield, RiskMilkDur is the risk attributable to milking duration, RiskUdd is the risk to udder characteristics, RiskHerd is the risk attributable to herd characteristics, DisRisk is the risk of mastitis attributable to other diseases, and RiskCond is the risk attributable to conductivity.
Model Outputs
The model produces 3 outputs. These are output risk of acute mastitis (AcuteRisk), DNS, and degree of chronic mastitis (ChronDeg).
AcuteRisk.
The AcuteRisk presented to the user is generated from the combination of MIBR and ARF as follows:
![]() | [14] |
where MIBR is the indicator-based risk and ARF is the risk caused by the additional risk factors. The ARFW is a scaling factor (= 0.25; Table 1
) to weight the ARF relative to the MIBR. The current model is not based on the principle that the 2 risks (MIBR and ARF) generating AcuteRisk should add up to 1. The indicator alone should be able to generate a risk of 1 because it can be envisaged that in many situations, no additional information (and therefore no ARF) is available. The model has been designed to run under such conditions; this also reflects the principle that the indicator is the primary information source in the model. However, because there are well-known predisposing factors for mastitis, the model allows their inclusion as ARF so that susceptible cows are more readily identified when their LDH levels increase. The weighting (to 25%, i.e., ARFW = 0.25) to ARF controls the relative value of this information as opposed to the indicator (i.e., a greater proportion of AcuteRisk should come from MIBR other than ARF). Ultimately, the sum of MIBR and (ARF x ARFW) is scaled, making the value of AcuteRisk between 0 and 1.
DNS.
This parameter is designed to make the best use of opportunities afforded by automated, real-time, inline sampling technology. It is designed to feed back to the sampling system so that the frequency of milk sampling (i.e., next analysis of LDH for a particular cow) can be varied according to the calculated risk of acute mastitis. In this scenario, it is desirable for sampling frequency to be increased for a given cow when the risk of mastitis is high and vice versa. Consequently, in the model, an increased risk of acute mastitis causes the DNS to be reduced from a default value (DNSdef). This equation is
![]() | [15] |
where MIBR is the indicator-based risk, ARF is the additional risk factor; signLevChan is the sign change in the difference from the current value of Level and the previous value of Level; POutlier is the Kalman filter-derived probability that the current LDH measurement is an outlier, and ZM is an adjustment factor for Cond. Indicator-based risk and ARF are included separately in the equation rather than the combined AcuteRisk value so that if either is high, DNS decreases, regardless of their combined value. In addition to the effects of a high MIBR or ARF, if the latest LDH value has a high probability that it is a positive deviation, or outlier, from the normal time series (signLevChan x POutlier), then another sample should be taken quickly. Also, if there is a high Cond measurement, a follow-up sample should be taken quickly.
ChronDeg.
The model also estimates the degree of chronic mastitis. The importance of this lies in the effect of chronic mastitis cases on milk quality. To do this a relative ranking of the cows with respect to ChronDeg is calculated. Because ChronDeg is expected to vary with breed and parity (Harmon, 1994; Sloth et al., 2003), it is calculated relative to a breed-parity reference value to allow comparison between cows across parities. From our own analysis, Chagunda et al. (2006), the relationship between LDH and logSCC was found to be the same across breeds and parities, allowing us to make use of logSCC reference values (logSCCRef) together with reference values for milk yield (RefMY) to generate ChronRef. Degree of chronic mastitis is calculated as
![]() | [16] |
where ChronRef = SCCtoLDHcept x RefMY + SCCtoLD-Hslope x logSCCRef, and SCCtoLDHcept = 8.58 (intercept of LDH vs. logSCC regression), SCCtoLDHslope = 2.38 (slope of LDH vs. logSCC regression), and logSC-CRef is a breed-parity reference value for the absolute amount of logSCC. The regression coefficients for converting SCC to LDH were derived from Larsen (2003), who used a set of 456 milk samples with both LDH and SCC available; the values of SCC ranged from 10,000 to 16,097,000, and the R2 of the regression was 54.3%. The breed-parity RefMY is derived as
![]() | [17] |
where 305d MY is an input representing the expected yield of healthy, well-fed parity 3 cows. ParMYFac for parities 1, 2, and 3 are 0.78, 0.94, and 1, respectively (Friggens et al., 1999).
| MODEL FUNCTIONALITY |
|---|
|
|
|---|
Testing Model Logic
Test data were generated for 2 simulated data sets, one representing an acute case of mastitis and the other representing a chronic case of mastitis. Further, additional disease incidences were introduced in the chronic mastitis case to test the effect of disease on the ARF. The acute mastitis data set was called Tanja, whereas the data set representing a chronic mastitis case was called Anita.
Acute Mastitis Test Data Set (Tanja).
The description of this acute mastitis case, which had 2 episodes of increasing LDH, is shown in Figure 3a
. The data simulated 2 mastitis occurrences, the first on d 16 and the second on d 33 after calving. From d 2 to 12 after calving, the amount of LDH was low, ranging from 3 to 37 µmol/min. From the second milking of d 12 after calving, LDH increased to 243 µmol/min on d 15 after calving. This was the first episode of an LDH increase. The LDH dropped to about 90 µmol/min on d 21 after calving, and the amount stabilized at that level until d 26 after calving. From d 26 after calving, LDH increased again, this time to an even higher level of 311 µmol/min on d 31 after calving. Thereafter, LDH dropped again and stabilized around 170 µmol/min.
|
|
Model Validation Using Real Data
Cows and Milk Samples.
The data originated from a research farm at the Danish Cattle Research Center, Foulum, Denmark. For the purpose of the current analysis, full lactation profiles of 100 cows (initial data set of 76,257 records from every milking for a period from September 2003 to February 2005) were selected at random. Fifty cows had a mastitis treatment record and the other 50 cows had no mastitis treatment. The cows were from 3 breeds of Danish Holstein, Danish Red, and Danish Jersey. At the farm, all cows were milked with an automatic milking system (3 units, on average 2.3 ± 0.83 milkings/cow per d) in which the milk yield was automatically recorded. During each milking, a proportional sample of composite milk was collected automatically from each cow in 10-mL tubes. The tubes contained a concentrated solution of Bronopol (2-bromo-2-nitro-1,0.3 propanediol) to reach 200 ppm (wt/vol) in the filled tube. The automatic milk sampling system was emptied of samples in the morning and in the afternoon. Milk samples were kept at 4°C until laboratory analysis, which was done within 24 h of sampling.
Milk Analysis.
In the laboratory, samples were distributed from the 10-mL tubes to 96-well plates using a Biomek 2000 (Laboratory Automation Workstation, Beckman Coulter, Fullerton, CA) and analyzed for LDH activity in a spectrophotometer/fluorometer (FluoStar, BMG Labtech GmbH, Offenburg, Germany). L-Lactate dehydrogenase activity was analyzed by a fluorometric, kinetic method as described by Larsen (2005) at the Danish Institute of Agricultural Sciences. The accuracies in the laboratory (relative bias) obtained in the present material were 2.7 and 4.9% for low and high controls, respectively. Intra-assay precision (CV%) was 8.6 and 3.7% for low and high controls, respectively, and interassay precision was 15.8 and 10.4%, respectively. The units used in the assay were micromoles per minute per liter. Somatic cell counts were measured at a commercial laboratory (Sønderjysk Kontrolforening, Vojens, Denmark) using Fossomatic 5000 automatic equipment (Foss Electric, Hillerød, Denmark). An example of data from a randomly selected real cow is presented in Figure 5a
, with the results of the model run presented in Figure 5b
.
|
400,000 cells/mL. This is in accordance with Hillerton (1999), who pointed out that when the cell count at cow level exceeds 400,000 cells/mL, the infected quarter has a cell count of 1,000,000 cells/mL or greater, irrespective of a yield depression. A veterinarian following the same protocol as consistently as possible performed all veterinary treatments. The aforementioned definition of a clinically mastitic cow was used as the "gold standard" against which sensitivity was calculated. In this data set, repeat mastitis occurrences recorded within an 8-d period were treated as one occurrence, whereas cases occurring after a gap greater than 8 d since any previous mastitis record were treated as new mastitis cases. This was done as recommended by the IDF (1997). To concentrate on mastitic periods within the lactation, a 15-d window10 d before and 5 d after a mastitis occurrencewas created relative to the recorded mastitis occurrence. If fewer than 3 LDH records existed within the 15-d window, the case was excluded from being used in the model testing.
Model Validation.
Three main criteria for testing the model were used: the proportion of healthy cows the model identified as healthy (specificity), the proportion of clinical mastitis cases the model detected as mastitic, and an indication of how early the model detected mastitis relative to the treatment records. A threshold of 0.7 for AcuteRisk was used to classify the mastitis risk values generated by the model into negative or positive mastitis detections, allowing calculation of model specificity and sensitivity. Specificity was calculated to identify the proportion of cows the model would classify as being without mastitis (true negatives) within the group of healthy cows (false positives and true negatives). Specificity was calculated as follows:
![]() | [18] |
Sensitivity (the proportion of mastitic cows detected by the model) was calculated within the mastitic group as follows:
![]() | [19] |
Along with the values of specificity and sensitivity, predictive values were calculated as either the proportion of true positives among the apparent positives, or the proportion of true negatives among the apparent negatives. However, it is important to note that the model does not actually produce mastitis indications in terms of positive and negative indications but rather produces a continuous risk of mastitis. The threshold used here was merely for model testing purposes. To determine how early the model would detect mastitis, the first rise of AcuteRisk beyond 0.7 within each mastitis window was taken to be the earliest time at which the model detected the mastitis incidence. Other threshold values of 0.6 and 0.8 were also tested.
| RESULTS |
|---|
|
|
|---|
Results from the model runs using the DNS function, which allowed the model to self-regulate sampling frequency, are presented in Figure 3c
. When the model was run to self-regulate the sampling frequency, results were similar to those when the model was run using a full data set. Using the DNS function, the model identified the same mastitis cases as in the full data set but used only 72% of the number of "samples" in the full time series.
The results of the model runs on the data set representing a chronically mastitic cow (Anita) indicated that the model was able to identify not only acute cases of mastitis but also chronic ones. In Figure 4b
, one can see that the model initially increased the AcuteRisk as a result of the persistently increasing amount of LDH (as shown by the ChronDeg line) relative to the rising baseline (Stable) up until d 12, after which it gradually dropped, even though LDH was still increasing. Although the simulated case was not an incidence of acute mastitis, the initial rate of increase of LDH was sufficient for the model to return a high AcuteRisk value. Degree of chronic mastitis persistently increased to about d 25 after calving and remained high thereafter, indicating an increasing risk of chronic mastitis. As expected, DNS decreased with increased AcuteRisk and increased otherwise.
When there was an input of 2 diseases (metritis and teat injury) in the chronic mastitis case, the results indicated an increased AcuteRisk and a reduced DNS. On d 12 and 35 after calving (Figure 4c
), which were the days when these diseases were registered, the disease effect in both cases pushed the AcuteRisk up for some days after the disease registration. As can be noted in Figure 4c
, metritis had a short and not-so-prominent effect on AcuteRisk, whereas teat injury had a noticeably higher and more persistent effect. This was expected, because in the model constants (Table 2
), teat injury had higher values than metritis (i.e., MaxDis of 0.9 vs. 0.5 and DisR of 21 vs. 10).
The consequences of adding random variation (noise) to the LDH values are presented in Table 3
. When 1 rSD of noise was added, the model indicated an average of 1.1 extra periods in which the acute rise exceeded 0.7. The extra mastitis risk periods increased by an average of 4.7 and 8.8, with noise levels of 2 and 3 rSD, respectively. However, the model did not miss any of the 2 true mastitis incidences, with noise levels of 1 and 2 rSD. When 3 rSD of noise were added to the amount of LDH, the model missed on average 0.1 of the true mastitis cases. With increasing noise in the LDH values, there was a relatively small shift (0.4 d) in days to detection of mastitis up to 2 rSD added noise. At the highest level of noise addition (3 rSD), there was a numerically significant delay (0.8 d) in detection of true mastitis cases. The interassay precision for the low control for analysis of real LDH values in the laboratory was 15.8%. Over the equivalent range in LDH, the simulated noise from adding 1 rSD equated to a coefficient of variation of 27.4%. This implies that the model is robust to a level of noise substantially greater than expected from analysis of uncertainties in real LDH data.
|
0.7) and treatment records for the real data of naturally occurring mastitis are presented in Table 4
|
| DISCUSSION |
|---|
|
|
|---|
When random noise was introduced into the indicator values, the model was robust to an amount of variation that was greater than the noise expected from the interassay variation for the measurement used here, LDH (see also Bogin et al., 1976; Harmon, 1994). The model was stable in identifying the "true" mastitis cases, although the onset of a high mastitis risk period before a defined mastitis case increased by 0.36 d with 2 rSD added noise (Table 3
).
The results of the model runs based on real data and using LDH as the indicator in milk indicate that the model is able to detect true mastitis incidences and is also able to correctly identify cows that do not have mastitis. On average, the model detected mastitis 3.5 d earlier than the day of treatment. Using a threshold of 0.7 for the model output risk, the specificity from the current model (99%) was higher than those reported by Hillerton (2000) of 70% for temperature and 95% for a combination of Cond, temperature, and milk yield, but was similar to that reported by De Mol and Ouweltjes (2001). The sensitivity values of 82% for clinical mastitis obtained from the current model are generally higher than those reported previously. For example, De Mol and Ouweltjes (2001) reported sensitivities of between 66 and 67% from a model based on Cond. Hillerton (2000) reported sensitivities in automatic systems for the following indicators: NAGase, 60%; Cond between 70 and 80%; temperature, 50%; milk yield between 20 and 40%; and combined Cond, temperature, and milk yield between 85 and 90%. Although failure to detect about 20% of true acute cases in absolute terms and also the variation of AcuteRisk within and outside the mastitis window would be problematic in commercial situations, one major focus of the current model is on early warning and allowing a range of different strategies according to a continuous scale of risks. The gain in this case is that the model generates AcuteRisk based on repeated measurements, hence indicating a trend in the direction of real-time mastitis risk as opposed to a one-off measurement of the mastitis indicator. However, it is important to note that the test reported here was carried out on one farm and remains to be validated in the field across several farms. Nevertheless, the fact that the current results were obtained from a farm under an automatic milking system, which is normally associated with a high degree of variation, indicates that the current model offers promising prospects in the more stable environment of conventional milking systems.
The specificities and sensitivities obtained depend on the criteria applied for classifying a mastitis treatment record as "true" mastitis or not. This indicates the importance of recording precision for reference values in specificity or sensitivity testing. Further, the values obtained also depend on the value of model output risk chosen as the threshold for classifying these risks as mastitis or not. As illustrated by Norberg et al. (2004), altering the threshold allows widely varying values of specificity and sensitivity to be obtained. The limitations of this form of test are clear, especially when one considers that mastitis infection is not a binomial trait but rather a continuous one because both the infection pressure and degree of response are on continuous scales. For these reasons, we have chosen not to present the model output as "alarms" but rather to present the output as a continuous risk ranging from 0 to 1. We believe that this provides the end user with more information (which can then be collapsed into alarms should the end user wish).
The current model was illustrated using LDH as the main indicator measured in milk. Several studies have shown that the activity of LDH in individual milk samples has proved to be an important indicator of bovine mastitis (e.g., Larsen, 2005; Chagunda et al., 2006). As biosensor assays for enzymes like LDH in milk are now becoming available, they provide an opportunity for automated, real-time, inline mastitis detection. Although this is the case, within the current architecture the model could be implemented with any mastitis indicator measured in milk. The design of the model to incorporate ARF would allow several indicators to be combined. Indeed conductivity is used in this way. However, when combining information or indicators, care must be taken to avoid double counting and contraindications. Only a limited number of the risk factors for mastitis described in the literature were included in the model as ARF. There were 2 reasons for this: We were trying to achieve the simplest adequate description, and we wanted to avoid double counting caused by correlations between risk factors. Both are important for achieving robust and widely applicable models. For example, traits such as the lifetime number of mastitis incidences have previously been used as a single measure of the cows susceptibility to mastitis. However, mastitis is both an infectious and an environmental disease. Thus, to some extent we can expect the lifetime number of mastitis incidences to reflect the disease pressure placed on the cow by the environments she has been in. Lifetime number is therefore the result not only of that cows susceptibility, but also a number of other factors that are not inherent to that cow. For this reason, lifetime number was not chosen as a predictor of the cows susceptibility even though in the herd where this model was being used, it had the practical advantage of being automatically generated if the risk of mastitis output was used to generate disease incidences. Other factors that have been suggested to increase the risk of mastitis (Harmon, 1994; Peeler et al., 2000) were not included in the model. One such example is negative energy balance. It is expected that poor energy status will, among other things, increase the risk of mastitis. However, energy balance is difficult to measure in practice, and the literature evidence is not clear-cut for an effect of energy status independent of other effects with which it is correlated in epidemiological-type studies. We therefore chose instead to use MYAcc as an indicator of physiological stress (Kronfeld, 1976; Knight et al., 1999; Ingvartsen et al., 2003), which is both biologically relevant and easy to apply in practice. The list of other diseases included as mastitis risk factors is also not exhaustive. Some other diseases (e.g., lameness) have not been included in the current model. This is mainly because of a lack of information in the literature on the direct influence of such diseases on mastitis. However, the indirect influence of lameness on mastitis through milking duration and milking interval has been accommodated in the model. Further, the architecture of the model allows the inclusion of other diseases once information about their direct influence on mastitis is available.
Any model is always a trade-off between including all known effects in a complex model and including fewer, readily available inputs in a simpler model. By including a number of optional inputs of biological risk factors, we believe this model can be adapted to a wide range of circumstances. Despite making extensive use of the literature, we expect that the values of some of the constants presented in this paper will need adjusting to optimize the performance of the model. The model is not expected to perform equally well in all circumstances. In particular, differences could be expected in, for example, infection pressure when different mastitis pathogens are dominant in different localities. Further, various mastitis agents differ in their pathophysiological pathways; thus, the model may behave differently depending on the mastitis-causing agent. The results obtained could therefore differ if bacteria-specific mastitis types were considered, and the extent to which this is so should be quantified by further research. Given the built-in flexibility of the model, we expect that it could easily be extended to deal with such cases. Because the model utilizes biological "rules" and logic in its underlying architecture, we believe it is an improvement on existing mastitis models and will make a useful contribution to dairy cattle management. An indication of the costsbenefits of a system such as the one described in the current paper would be informative on the practicalities of such systems. However, this is beyond the scope of the current paper, which focuses on model description and functionality. Economic implications of such a system have been covered by Østergaard et al. (2005). The current model is one of a set, which includes reproduction (Friggens and Chagunda, 2005) and ketosis detection models (Nielsen et al., 2005).
| CONCLUSIONS |
|---|
|
|
|---|