|
|
||||||||
Department of Dairy Science, University of Wisconsin, Madison 53706
1 Corresponding author: gulisija{at}calshp.cals.wisc.edu
| ABSTRACT |
|---|
|
|
|---|
7%, and inbreeding depression seemed to stabilize at F >20% for fat and protein yield. For SCS, both parametric and LOESS fits were consistent with the absence of sizable dominance effects. Effects of inbreeding on performance seemed to be more complex than suggested by previous studies based on linear regression. Results should be interpreted with caution because the data were scarce at high levels of F.
Key Words: nonparametric regression inbreeding Jersey cattle
| INTRODUCTION |
|---|
|
|
|---|
Epistasis can also contribute to inbreeding depression (Crow and Kimura, 1970; Bulmer, 1980; Hill, 1982; Lynch, 1991), even though it is thought to play a small or negligible role (Falconer and Mackay, 1996). If dominance interactions involve 2 or more loci, the mean value can be expressed as a high-order polynomial in F (Crow and Kimura, 1970; Lynch and Walsh, 1998). However, only small departures from linearity are expected, because the high-order terms in F tend to be small, especially when F is low (Mackay, 2001). If epistatic effects are sizable, the rate of decline in mean value with respect to F can either diminish as inbreeding increases (i.e., diminishing epistasis) or accelerate with respect to F (i.e., reinforcing epistasis).
Inbreeding depression due to unfavorable recessive alleles with major effects can be effectively reduced or purged via selection against inbred individuals, or by random drift (Hedrick, 1994; Glemin, 2003; Gulisija and Crow, accepted). Thus, some of the highly inbred individuals could be removed from a population before reaching productive age. This selection process can produce nonlinearity in the relationship between a trait and the inbreeding level. Purging of deleterious alleles would be reflected by a reduced rate of decline in performance at higher levels of inbreeding (resembling diminishing epistasis).
Numerous studies have used linear-regression to estimate inbreeding depression for production in dairy cattle (e.g., Allaire and Henderson, 1965; Hudson and Van Vleck, 1984; VanRaden et al., 1992; Miglior et al., 1995b; Smith et al., 1998), but evidence of effects of inbreeding on SCC is scarce (Miglior et al., 1995a; Smith et al., 1998). Several studies in the animal breeding field have addressed the form of the relationship between quantitative traits and F. Departures from linearity at higher levels of F were indicated in dairy cattle (Hudson and Van Vleck, 1984; Miglior et al., 1992; Thompson et al., 2000b; Wall et al., 2005) and in horses (Klemetsdal, 1998). These studies suggested a faster depression rate at higher values of F, with little recovery at the highest levels of inbreeding. Although Thompson et al. (2000a) obtained similar results for 305-d milk yield, no systematic departure from linear inbreeding depression was found for fat and protein yield in US Holsteins. Previous studies suggest that the relationship between quantitative traits and the inbreeding coefficient might be more complex than would be expected from a genetic model with additive and dominance effects only.
The objective of this research was to examine the impact of inbreeding on SCS and on milk, protein, and fat yields in US Jersey cows using nonparametric regression models.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Statistical Models
A 2-stage analysis was performed. First, empirical best linear unbiased predictions (EBLUP) of residuals in models that accounted for additive genetic effects, with or without F as a covariate, were obtained. In the second stage, EBLUP were studied via a nonparametric regression method (LOESS), described later.
Inbreeding depression is expected to be a function of dominance effects and of interactions involving dominance effects. The assumption in this study was that, working with the EBLUP residuals, the analysis would become net of additive effects, which do not contribute to inbreeding depression. Although the entire analysis can be performed in a single stage, the 2-step approach conferred computational flexibility, because varying forms of inbreeding depression could be explored in a local regression context (in the second stage). The EBLUP residuals from models accounting for F were studied to assess the goodness of fit.
In the first stage, 4 univariate linear models were fitted to first-lactation SCS and milk, fat, or protein yield (305-d mature equivalent) records. Model 1 was
![]() |
where y is an n x 1 vector of SCS and milk, protein, or fat yields; X and Z are incidence matrices of order n x p and n x q, respectively; ß is a p x 1 vector of fixed effects; and u is a q x 1 vector of random additive genetic effects for all the animals in the pedigree, with q = 266,470. The ß vector included herd-year-season effects (6,406 levels, with season classes of January to April, May to August, and September to December), age at calving effects (6 levels: <617, 617 to 716, 717 to 816, 817 to 916, 917 to 1,016, or >1,016 d of age), and a regression coefficient on DIM deviation from the average (
= 263), and e is an n x 1 vector of random residuals, with the typical element eijk (corresponding to cow k calving in herd-year-season i at age j). Although yields were 305-d mature equivalent, DIM had a small but significant effect. Because there was no colinearity between F and DIM, this covariate was included in the model.
Genetic and residual effects were assumed to be mutually independent, with u
N(0,A
a2) and e
N(0,I
e2), where A is the additive relationship matrix (containing 1 + Fi in the ith diagonal position, where Fi is the inbreeding coefficient of animal i), I is an identity matrix of order equal to the number of records, and
a2 and
e2 are additive genetic and residual components of variance, respectively.
Model 2 was like model 1, but it also had a linear regression of performance on F, such that ß included a linear regression coefficient of performance on in- breeding level (Fk = inbreeding coefficient of cow k). Model 3 was like model 2, plus a quadratic regression coefficient of performance on F. Last, model 4 included a cubic effect, in addition to the first- and second-order terms.
Variance components were estimated by univariate REML with the program REMLF90 (http://nce.ads.uga.edu/~ignacy/programs.html) for each of the 4 traits. Using the REML estimates as true parameter values, mixed-model equations were solved using the program BLUPF90 (http://nce.ads.uga.edu/~ignacy/programs.html), to obtain estimates of model effects. The EBLUP of residuals were then computed as
![]() |
Local Regression Analysis with LOESS
The EBLUP residuals (êijk), which were expected to be net of effects of location parameters and of additive genetic values, were modeled via local regression using F as an explanatory variable. The objective was to assess the extent to which models 1 to 4 adequately described the relationship between performance and inbreeding.
Local regression is a nonparametric approach to fitting curves and surfaces to data based on smoothing (Cleveland and Loader, 1996). The local regression method used, LOESS (Cleveland, 1979; Cleveland and Devlin, 1988; Cleveland et al., 1992), approximates the relationship between response and explanatory variables locally by a smooth curve based on a parametric function, using locally weighted least squares. Weights are assigned such that points close (Euclidean distance) to the predictor value (F) of interest receive higher weight. Also, a subset of the data can be used in the local fit. Subsets are distinct across neighborhoods, constructed such that values of the predictor variables are close to the focal predictor value of interest. Neighborhood size is defined via a spanning parameter (f); f = 1 implies that all data points are used in the local fit, and a spanning parameter <1 denotes that only a subset (f x 100%) of the points is used in the local fit. A smaller spanning parameter allows more detailed assessment of local behavior but increases variance. Once local predictions are obtained for each given predictor, predicted values are returned, and a smoothed curve is formed. In our context, LOESS allows discovery of ranges of F values where inbreeding depression differs from what would be expected based on global fit.
As opposed to splines, which are frequently used for smoothing, LOESS is arguably more robust, because it is less influenced by "knot-to-knot" polynomial fit. As with splines, the LOESS method can also be embedded into mixed-model methodology (i.e., semiparametric models). However, LOESS and EBLUP residuals were used in this study, because a 2-step approach provided more opportunities to explore local behaviors in a flexible and computationally feasible manner.
F-Statistics were computed to test whether LOESS improved fit over first-order regression on F for EBLUP residuals from model 1. This is valid, because parametric and nonparametric models are properly nested; a linear relationship is a special case of a general, potentially nonlinear, relationship (Fox, 2005).
A LOESS analysis can be affected by outliers or by data structure, such as scarcity of observations at high values of F. In addition to regular LOESS curves, robust LOESS fits for production traits were obtained using the entire data set. In robust LOESS, which is designed to attenuate outliers, weights are assigned through an iterative process that penalizes data points that are far away from their predicted values. To avoid misleading inference due to data structure, and to explore different values of spanning parameters, medians of EBLUP residuals were computed at each value of F (1,622 distinct F values at 4 decimal digits). These medians were used for robust LOESS fit at optimal values of the spanning parameter f. Optimal values of f were assessed via cross-validation. The cross-validation function, CV(f), was the average of squared differences of values at focal data points and predictions from a fit obtained without using the focal data point (Fox, 2005). CV(f) was evaluated at f values ranging from 0.1 to 1 (by 0.1 increments), with the optimal f being the minimizer of this function.
To assess uncertainty associated with the LOESS fit, a simple bootstrap approach was used (Efron, 2005). Here, the 1,622 medians of the EBLUP residuals were resampled with replacement and 100 bootstrap samples were obtained. For each bootstrap sample, a robust LOESS curve was fitted, thus creating a family of 100 regression curves. Computations utilized the R software (R Development Core Team, 2004).
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
a (estimated from model 1). Somatic cell score did not exhibit detectable inbreeding depression, irrespective of the order of the polynomial fitted (P > 0.05). Similar results were reported by Smith et al. (1998) and Thompson et al. (2000b). Miglior et al. (1995a) did find an effect of 0.012 units per percentage of F, which was small. These findings suggest that for SCS, the effect of dominance is rather small or absent.
|
In the quadratic model, second-order regressions on F were positive for yield traits, suggesting a diminishing rate of inbreeding depression. Furthermore, the inbreeding depression seemed to stabilize at high values of F. This could reflect diminishing epistasis or partial purging of deleterious alleles by selection.
In the third-order model (Table 1
), coefficients corresponding to linear and quadratic terms in F were negative for yield traits, whereas cubic terms were positive. Partial recovery (perhaps due to purging of deleterious genes) was suggested for milk yield at F > 25%. A similar pattern was observed for fat and protein yields, where a diminishing rate of depression occurred at high values of F.
To examine whether models 2 to 4 fitted the data better than model 1, the EBLUP of residuals from model 1 and 3 models including F as a covariate in the explanatory structure were analyzed via simple linear and non-parametric regression. Regressions of EBLUP residuals from model 1 with inbreeding coefficient as a covariate indicated that third-order regression on F improved fit significantly over models with lesser order regression on F for yield traits (P < 0.01, Table 2
). Robust LOESS curves of EBLUP residuals from all models supported these findings and are discussed subsequently.
|
The LOESS curves for yield traits from the model ignoring F (model 1) are shown in Figure 2
. These curves, using the entire data (f = 1) with a local weighted least squares fit, were obtained using a second-degree local polynomial for both the regular and robust fits. In general, LOESS suggested nonlinear relationships between yield traits and F.
|
20%, whereas minor departures from linearity were noted for milk yield. For fat yield, inbreeding depression also seemed to stabilize at high levels of inbreeding, because no further depression was observed for F > 23%.
Once outliers were attenuated via the robust fits, milk yield did not decrease until F
9%. Inbreeding depression appeared to be less severe throughout the range of F with the robust LOESS fit. In general, regular and robust LOESS had similar forms, but with some discrepancies at the higher levels of F. For example, with robust LOESS, protein stabilized at higher levels of F than that observed with regular LOESS. Also, fat yield recovered at a smaller F (24%) with robust LOESS, which did not happen with the regular fit. Milk yield followed a similar trajectory as that observed in the regular fit.
The small discrepancies between the regular and robust LOESS analysis could be an artifact of data structure. However, it is possible that a few rare alleles with strong effects may underlie the differences. The observed curvature in the 2 analyses may indicate that animals having higher levels of F are perhaps survivors of a purging process. Selection could have removed genetic load prior to production. Thus, the interplay of selection and inbreeding can distort the picture of the inbreeding depression.
Hudson and Van Vleck (1984), Miglior et al. (1992), Thompson et al. (2000b), and Wall et al. (2005) also reported nonlinearity in inbreeding depression. Wall et al. (2005) observed a lesser effect until F = 8%, in agreement with our study. However, other studies cited seem to suggest a faster depression rate at higher values of F, with little recovery at the highest levels of inbreeding. Hudson and Van Vleck (1984) and Miglior et al. (1992) associated recovery with involuntary culling and sparse data structure. However, except for the study of Wall et al. (2005), in the cited studies pedigree data was not edited as stringently as in this study. It is possible that with stringent data editing, cows were selected with more generations of inbred ancestors since the base. Thus, it is possible that the analyzed sample consisted of animals that had, on average, more opportunity for purging than an average cow.
Miglior et al. (1994) found between-family heterogeneity in inbreeding depression for production traits in a group of Holstein cattle. Also, Gulisija et al. (2006) found between-founder heterogeneity in inbreeding depression for production traits from the sample of cows used in this study. These findings are important because the variability among cattle families implies that relatively few alleles with major effects are involved in inbreeding depression. Major-effect alleles can be effectively selected against in homozygotes, and this is the basic mechanism of purging. Gulisija and Crow (accepted) showed that the inbreeding decline due to major factors is reduced by about 12.6% because of ancestral inbreeding in the group of cows used in this study. Given that the opportunity of purging increased with F for the sample of animals used in analyses (correlation = 0.93, unpublished results), diminishing effects of F at the higher levels were expected.
Robust LOESS Analyses of Medians of EBLUP Residuals from Model 1
Results from the LOESS analysis may have been affected by data structure. For instance, while LOESS curves for yields were decreasing (Figure 2
), only 35 cows had F < 1%, and the data set was sparse for F > 15% (Figure 1
). Hence, LOESS predictions for highly inbred animals might be strongly influenced by residuals from animals with small F. To reduce possible distortions caused by an uneven data structure, medians of EBLUP residuals were computed for each unique F value, in the sense mentioned earlier.
Figure 3
gives LOESS curves obtained with f = 0.9, 0.5, and 0.9 (optimal values) and a second-degree local polynomial, along with bootstrapped LOESS curves for milk, fat, and protein yields, respectively. The robust LOESS curve for milk yield was strictly decreasing, and at a faster rate at higher inbreeding levels. The fat yield curve increased slightly at small levels of F (F < 4%), as opposed to the strictly decreasing curves found with the entire data. After a drop for F values between 8 and 15%, fat and protein yields did not show further depression at higher levels of F, but rather yields increased. However, statistical variability was very large at very small values of inbreeding, and especially at F > 15%, making it impossible to draw any definitive conclusions outside of a range of F between 5 and 15%.
|
|
Overall, the LOESS analyses indicated that additive models did not fit the data well. A parametric model including a cubic regression on F provided the best fit for all traits. Because this is an observational study (as opposed to a randomized trial), and the data were sparse at high levels of F, results from this study do not allow stronger inference.
Furthermore, before editing, most animals had pedigrees traceable until the 1950s, but only the subset used in the analyses had at least 6 generations of known pedigree. Perhaps the animals from this study accumulated their genomes, on average, through more generations of inbred ancestors from the base than an average cow in the population, thereby giving more opportunity for selection to operate against alleles with deleterious effects. Thus, results may not reflect a general population pattern.
| CONCLUSIONS |
|---|
|
|
|---|
7%, and there was no additional depression in fat yield for F > 20%, whereas protein yield seemed to stabilize at the higher levels of inbreeding. The joint effects of selection and of inbreeding depression could not be disentangled in this observational study. Cows with
6 generations of complete pedigree data may have accumulated inbreeding through a greater number of generations, possibly giving selection more chances to purge deleterious alleles. Somatic cell scores did not exhibit detectable inbreeding depression, suggesting the absence of detectable dominance. Including F as a covariate in the linear model appeared to improve fit at small values of F, but it did not alleviate the lack of fit at higher levels of inbreeding. Standardized yield residuals increased with F > 10% for models with first- and second-order regressions on F, whereas third-order regressions on inbreeding provided an adequate fit to the data. These results should be interpreted with caution because the data were scarce at high levels of F.
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
Received for publication December 20, 2005. Accepted for publication June 12, 2006.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
S. M. Parland, J. F. Kearney, M. Rath, and D. P. Berry Inbreeding Effects on Milk Production, Calving Performance, Fertility, and Conformation in Irish Holstein-Friesians J Dairy Sci, September 1, 2007; 90(9): 4411 - 4419. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |