当前位置: 首页 > 期刊 > 《基因杂志》 > 2003年第3期 > 正文
编号:10585765
cc in the Case of a Spike in the Phenotype Distribution
http://www.100md.com 《基因杂志》2003年第3期
     a Department of Biostatistics, Johns Hopkins University, Baltimore, Maryland 21205^4?&[a\, 百拇医药

    ABSTRACT^4?&[a\, 百拇医药

    A common departure from the usual normality assumption in QTL mapping concerns a spike in the phenotype distribution. For example, in measurements of tumor mass, some individuals may exhibit no tumors; in measurements of time to death after a bacterial infection, some individuals may recover from the infection and fail to die. If an appreciable portion of individuals share a common phenotype value (generally either the minimum or the maximum observed phenotype), the standard approach to QTL mapping can behave poorly. We describe several alternative approaches for QTL mapping in the case of such a spike in the phenotype distribution, including the use of a two-part parametric model and a nonparametric approach based on the Kruskal-Wallis test. The performance of the proposed procedures is assessed via computer simulation. The procedures are further illustrated with data from an intercross experiment to identify QTL contributing to variation in survival of mice following infection with Listeria monocytogenes.

    THE standard approach for mapping the genetic loci (quantitative trait loci, QTL) contributing to variation in a quantitative trait makes use of the assumption that the residual environmental variation follows a normal distribution (LANDER and BOTSTEIN 1989 ). A common departure from this assumption is to observe a spike in the phenotype distribution. For example, in , the survival time (in hours) following infection with Listeria monocytogenes is displayed, for 116 female intercross mice (from BOYARTCHUK et al. 2001 ). Approximately 30% of the mice recovered from the infection and survived to the end of the study (264 hr). Other examples include the density of metastatic tumors, with some individuals exhibiting no metastasis (HUNTER et al. 2001 ), and gallstone weight, with some individuals having no gallstones (WITTENBURG et al. 2002 ).:hw, 百拇医药

    fig.ommitted:hw, 百拇医药

    Figure 1. Histogram of survival time, following infection with Listeria monocytogenes, of 116 intercross mice. Approximately 30% of the mice recovered from the infection and survived to the end of the experiment (264 hr).

    Let us assume, without loss of generality, that the spike in the distribution is at 0 (which we call the null phenotype) and that all other phenotype values are strictly positive. QTL mapping under a normal model can work reasonably well in this situation if the proportion of individuals with the null phenotype is not large and the remainder of the phenotype distribution is not far above 0. However, when this is not the case, maximum-likelihood estimation under a normal mixture model can occasionally produce spurious LOD peaks in regions of low genotype information (e.g., widely spaced markers).uz5:|^, http://www.100md.com

    A simple method of analysis is to consider separately the binary trait, defined by whether or not an individual has the null phenotype, and the quantitative trait, for those individuals having a strictly positive phenotype. We develop a parametric, two-part model that allows us to combine these two analyses. In this single-QTL model, an individual with QTL genotype g has probability {pi} g of having a nonzero phenotype; if its phenotype is nonzero, the value is assumed to follow a normal distribution with mean µg and standard deviation (SD) .

    We also describe an extension of the Kruskal-Wallis test statistic for nonparametric interval mapping in an intercross (exactly analogous to the extension of the rank-sum test described in KRUGLYAK and LANDER 1995 , which was suitable for a backcross). While KRUGLYAK and LANDER 1995 had randomized the rank of any tied phenotypes, we assign the average rank to tied phenotypes and apply a standard correction factor. A possible advantage of the nonparametric approach is that the statistical test concerns 2 d.f., while the test derived from use of the two-part model concerns 4 d.f. and so has a larger null threshold.\jq, 百拇医药

    We illustrate the use of these procedures with data on survival time of mice, following infection with Listeria monocytogenes (BOYARTCHUK et al. 2001 ). We further study their performance via computer simulations.\jq, 百拇医药

    METHODS\jq, 百拇医药

    Consider n F2 progeny from an intercross between two inbred strains. Let yi denote the quantitative phenotype for individual i. We assume, without loss of generality, that the spike in the phenotype distribution is at 0. Let zi = 0 if yi = 0 and zi = 1 if yi > 0. Consider data on a set of genetic markers, with a known genetic map. Let mi denote the multipoint marker data for individual i.

    Conditional and binary trait analyses:'kmx(, 百拇医药

    A simple approach for QTL mapping in this situation is to first analyze the quantitative phenotype, yi, using only the individuals for which yi > 0, by standard interval mapping using a normal model (LANDER and BOTSTEIN 1989 ), and then separately analyze the binary trait zi.'kmx(, 百拇医药

    The analysis of the binary trait deserves further explanation. XU and ATCHLEY 1996 described maximum-likelihood estimation for a binary trait in the context of composite interval mapping (ZENG 1993 , ZENG 1994 ). VISSCHER et al. 1996 and MCINTYRE et al. 2001 described approximate methods for analysis of binary traits. We prefer the approach of XU and ATCHLEY 1996 . We briefly describe the special case of no marker covariates.'kmx(, 百拇医药

    We consider some fixed position in the genome as the location of a putative QTL and let gi = 1, 2, or 3, according to whether individual i has genotype AA, AB, or BB, respectively, at the QTL. Let us assume that the binary phenotypes, zi, are independent, and let {pi} j = Pr(zi = 1|gi = j). Given the marker data, mi, but not knowing the QTL genotypes gi, the zi follow mixtures of Bernoulli distributions (analogous to the mixtures of normals that arise in standard interval mapping).

    We assume that we may calculate pij = Pr(gi = j|mi), the QTL genotype probabilities, given the observed multipoint marker data. Under no crossover interference and no genotyping errors, the distribution depends only on the nearest flanking typed markers, but one may also use the approach of LINCOLN and LANDER 1992 to take account of the presence of genotyping errors.;f@v, http://www.100md.com

    The likelihood for the parameters {pi} = ({pi} j), given the observed data {(mi,zi)}, is then;f@v, http://www.100md.com

    We obtain maximum-likelihood estimates (MLEs), j, using a form of the expectation-maximization (EM) algorithm (DEMPSTER et al. 1977 ). At iteration s + 1, we have estimates of the parameters, (s). In the E-step, we calculate weights for each individual and for each genotype:;f@v, http://www.100md.com

    In the M-step, we reestimate the probabilities {pi} j as weighted proportions using the weights, w(s+1)ij:

    We begin the algorithm by taking and iterate until the estimates converge, giving the MLE, .6, http://www.100md.com

    We next calculate a LOD score for the test of H0: j . First note that the MLE, under H0, of the common probability {pi} is the overall proportion, 0 = izi/n. Letting 0 = (0, 0, 0), the LOD score is LOD = log10 {L()/L(0)}.6, http://www.100md.com

    As with standard interval mapping, the likelihood under H0 is calculated once, while the EM algorithm is performed at each position in the genome (in practice, at 1-cM steps), producing a LOD curve for each chromosome.6, http://www.100md.com

    Two-part model:6, http://www.100md.com

    The two separate analyses described above suggest the following two-part, single-QTL model. We again consider n F2 progeny and some fixed position in the genome as the location of a putative QTL. Let yi, zi, gi, and mi be defined as above, and again let pij = Pr(gi = j|mi).

    We assume that the (mi, yi, zi) are mutually independent, that Pr(zi = 1|gi = j) = j, and that yi|(gi = j, zi = 1) normal(µj, {sigma} 2). In other words, the probability that an individual with QTL genotype j has the null phenotype is 1 - {pi} j; if this individual's phenotype is nonnull, it follows a normal distribution with mean µj, depending on the QTL genotype, and with SD {sigma} , independent of genotype.(?s*&04, http://www.100md.com

    This model contains seven parameters, {theta} = ({pi} 1, {pi} 2, {pi} 3, µ1, µ2, µ3, {sigma} ). The likelihood function is(?s*&04, http://www.100md.com

    where f(y; µ, {sigma} ) is the density function for a normal distribution with mean µ and SD {sigma} .(?s*&04, http://www.100md.com

    We may again obtain MLEs with a form of the EM algorithm. Assume at iteration s + 1 we have estimates (s). In the E-step, we calculate weights for each individual and each genotype:

    In the M-step, we obtain revised estimates of the parameters according to the following equations:)!i}, 百拇医药

    We again start the algorithm by taking and iterate until the estimates converge, producing the MLEs, .)!i}, 百拇医药

    We may calculate a LOD score for the test of H0: j , µj µ. We first note that, under H0, the MLEs of the three parameters, , µ, and , are)!i}, 百拇医药

    In other words, 0 is the proportion of individuals with a positive phenotype, and 0 and 0 are the sample mean and SD, among individuals with positive phenotypes. Letting 0 = (0, 0, 0, 0, 0, 0, 0), the LOD score is LOD = log10{L()/L(0)}.)!i}, 百拇医药

    Note that in the case of complete QTL genotype information (i.e., when the putative QTL is at a marker that has been fully typed), the pij are all either 1 or 0, and the two parts of the model separate fully. As a result, the MLEs under the two-part model are exactly those obtained by the two separate analyses (the analysis of the binary trait and the conditional analysis of the quantitative trait, for those individuals with nonzero phenotype). Further, the LOD score for the two-part model is simply the sum of the LOD scores from the two separate analyses.

    Nonparametric analysis:1iu?9.$, http://www.100md.com

    KRUGLYAK and LANDER 1995 described an extension of the Wilcoxon rank-sum test for nonparametric interval mapping in a backcross. The rank-sum test is a nonparametric version of the two-sample t-test. In the case of an intercross, they suggested tests for the additive or dominant effects at a putative QTL. An alternative approach is to extend the Kruskal-Wallis test statistic, a nonparametric version of a one-way analysis of variance, for the comparison of two or more samples (e.g., see LEHMANN 1975 ). We describe such an extension below.1iu?9.$, http://www.100md.com

    Rank the phenotypes, yi, from 1, ... , n, and let Ri denote the rank for individual i. In the case of ties, use the average rank within each group of ties. We again consider some fixed position in the genome as the location of a putative QTL and let pij = Pr(gi = j|mi), the QTL genotype probabilities for individual i, given the available multipoint marker data. Whereas, in the Kruskal-Wallis test statistic, one considers the sum of the ranks within each group, here the exact assignment of individuals to QTL genotype groups is not known; rather, individual i has prior probability pij of belonging to group j. We follow the approach of KRUGLYAK and LANDER 1995 and consider the expected rank sum, Sj = ipijRi. We then consider the statistic

    where E0j and V0j are the mean and variance of Sj under the null hypothesis of no linkage, considering the pij as fixed. After some algebra, we obtain the formula6)5/., 百拇医药

    In the case that the putative QTL is at a fully typed genetic marker, the pij will all be 0 or 1, and the above statistic reduces to the Kruskal-Wallis test statistic.6)5/., 百拇医药

    KRUGLYAK and LANDER 1995 had randomized any tied phenotypes, a reasonable approach in the case of very few ties. In our application, however, a large proportion of the individuals share a common phenotype. Thus, rather than randomizing ties, we assign the average rank to each individual within a set of tied phenotypes. A standard correction for the case of ties is to use the statistic H' = H/D, where , with tk being the number of values in the kth group of ties. Note that if there are no ties, D = 1 and so H' = H. [Of course, if one uses a permutation test (CHURCHILL and DOERGE 1994 ) to obtain the genome-wide significance threshold, as we recommend, the correction factor is unnecessary.] As the nonparametric statistic H' follows, approximately, a {chi} 2 distribution under the null hypothesis of no linkage, we convert the statistic to the LOD scale by taking LOD = H'/(2 ln 10).

    EXAMPLE0, 百拇医药

    To illustrate our methods, we consider the data of BOYARTCHUK et al. 2001 , on the time to death following infection with L. monocytogenes in 116 F2 mice from an intercross between the BALB/cByJ and C57BL/6ByJ strains. The mice were typed at 133 markers, including 2 on the X chromosome. A histogram of the survival times (in hours) appears in . Note that ~ 30% of the mice recovered from the infection and survived to 264 hr.0, 百拇医药

    We applied each of the four methods described above to these data: analysis of the binary trait, survived/died ("binary"); standard interval mapping with the log time to death, with only those mice that died ("QT"); use of our two-part model ("two-part"); and the nonparametric interval-mapping method based on the Kruskal-Wallis test statistic ("NP").0, 百拇医药

    Genome-wide LOD thresholds were obtained by permutation tests (CHURCHILL and DOERGE 1994 ), using 11,000 permutation replicates. The estimated 95% genome-wide LOD thresholds for the four methods, binary, QT, two-part, and NP, were 3.54, 3.96, 4.91, and 3.27, respectively. The estimated standard errors (SEs) for these thresholds were ~ 0.02.

    Because of the large differences in the LOD thresholds for the four methods, we converted the LOD curves to a common scale, the estimated experiment-wise P values derived from the permutation tests. The results indicated evidence for QTL on chromosomes 1, 5, 13, and 15. In , the statistic -log10P for each method is displayed for these selected chromosomes.9#to#}, http://www.100md.com

    fig.ommitted9#to#}, http://www.100md.com

    Figure 2. Results of four QTL mapping methods for data on survival time following infection with Listeria monocytogenes in 116 intercross mice. The four methods are analysis of the binary trait, survived/died (binary); standard interval mapping using the log time to death for the nonsurviving individuals (QT); use of the two-part model (2-part); and nonparametric interval mapping (NP). LOD scores were converted to experiment-wise P values derived from permutation tests; -log10P is plotted as a function of genomic position. A horizontal line is plotted at -log10(0.05).9#to#}, http://www.100md.com

    The locus on chromosome 1 appears to have an effect only on the average time to death among the nonsurvivors. The locus on chromosome 5 appears to have an effect only on the chance of survival. The loci on chromosomes 13 and 15 have an effect on both the chance of survival and the average time to death among nonsurvivors. Note that the locus on chromosome 15 achieved the 5% genome-wide significance level only with the nonparametric interval-mapping method.

    SIMULATIONS!3}, 百拇医药

    To better understand the relative performance of these approaches for QTL mapping in the case of a spike in the phenotype distribution, we performed a small simulation study. We first estimated the 95% genome-wide LOD threshold for each method, in the case of 250 intercross individuals with 25% having the null phenotype and an autosomal genome modeled after the genetic map for the mouse described in ROWE et al. 1994 , consisting of 19 autosomes with total length 1300 cM. Genetic markers were equally spaced on each chromosome, with a marker spacing of 10–12 cM. (The intermarker spacing was slightly different for each chromosome, so that the chromosomal lengths could match those in the genetic maps of ROWE et al. 1994 .) A random 10% of the marker genotype data was missing. We simulated a phenotype that was independent of the marker data. Each individual had probability 25% of having a null phenotype; otherwise their phenotype was drawn from a normal distribution with mean 10 and SD 1.

    For each of 10,000 replicates, we simulated such data under the null hypothesis of no QTL, applied each of the four methods, and recorded the maximum LOD score, genome-wide, for each method. The 95th percentiles of the maximum LOD score, for the four methods, binary, QT, two-part, and NP, were 3.55, 3.53, 4.64, and 3.41, respectively. Note that the binary, QT, and NP methods have similar LOD thresholds. The LOD threshold for the two-part model is much higher, due to the fact that the corresponding statistical test concerns four free parameters, rather than two.y%, 百拇医药

    We also considered a fifth approach, in which one takes the maximum of the LOD scores from the binary and conditional quantitative trait analyses. For this approach, we used a Bonferroni correction and declared significant linkage if the LOD scores for either the binary trait analysis or the conditional quantitative trait analysis exceeded the corresponding 97.5% genome-wide LOD thresholds, which were estimated to be 3.88 and 3.86, respectively.

    To investigate the power and precision of each of these methods, we simulated data under the two-part model described above, with a single QTL located between two markers near the center of chromosome 1 (of length 103 cM). The QTL was taken to have multiplicative effect on the probabilities {pi} j and additive effect µ on the conditional means µj. The probabilities, {pi} j, were chosen so that 2 = 1 and and so that the overall proportion of individuals with positive phenotypes was {pi} 1/4 + {pi} 2/2 + {pi} 3/4 = 75%. The means were chosen so that µ1 = µ2 - µ and µ3 = µ2 + µ, with µ2 = 10. The residual SD was {sigma} = 1. We considered the values {phi} {pi} = 1, 1.5, and 2 and µ = 0, 0.4, and 0.6. (Note that = 1 and µ = 0 correspond to no QTL effect.)

    We performed 4000 simulations of 250 intercross individuals, for all pairs of effects (, µ), except for the case = 1, µ = 0. The latter corresponds to the null hypothesis of no QTL; simulations for this case were used to estimate the LOD thresholds (see above). In each case, we applied the four methods to the simulated data on chromosome 1 (containing the QTL), calculated the maximum LOD score on that chromosome, and finally calculated the power of each test, as the proportion of the simulation replicates for which the maximum LOD score exceeded the corresponding 95% genome-wide LOD threshold. The power of the fifth procedure, taking the maximum of the binary and conditional quantitative trait LOD scores, was estimated as the proportion of the 4000 replicates in which either the binary or the conditional quantitative trait LOD score exceeded its corresponding 97.5% genome-wide LOD threshold.3c+11ko, 百拇医药

    The estimated power of the procedures appears in . In and , the QTL had effect only on the probabilities, {pi} j. In these cases, the conditional analysis of the quantitative trait had no power, and the analysis of the binary trait had the greatest power. The two-part model was somewhat inferior to the binary trait analysis, but had greater power than the nonparametric method. Use of the maximum of the binary and conditional quantitative trait LOD scores (with correction for the use of two tests) had somewhat greater power than the two-part model.

    fig.ommitted3'@gj&, http://www.100md.com

    Figure 3. Estimated power (±2 SE) to detect a QTL, based on 4000 simulation replicates. An intercross with 250 individuals was simulated, with an average of 25% of individuals having the null phenotype. The alleles at the QTL have an additive effect, µ, on the phenotype means, and a multiplicative effect, , on the probability of having a positive phenotype. Five analysis methods were studied: analysis of the binary phenotype (binary), standard interval mapping using only those individuals with a positive phenotype (QT), the maximum of the binary and conditional quantitative trait analyses (max), use of the two-part model (2-part), and a nonparametric analysis (NP).3'@gj&, http://www.100md.com

    In and , the QTL had effect only on the conditional means, µj. In these cases, analysis of the binary trait had no power, and the conditional analysis of the quantitative trait had the greatest power. The results for the other methods were similar to the results in and : the two-part model was superior to the nonparametric method, but inferior to either the conditional quantitative trait analysis on its own or the maximum of the binary and conditional quantitative trait analyses.

    In , , , and , the QTL had effect on both the probabilities, {pi} j, and the conditional means, µj. In these cases, the nonparametric method was best, although the use of the two-part model was competitive; both of these approaches showed considerable gains over either of the two separate analyses and over the maximum of the two separate analyses.xds}}, 百拇医药

    contains the results on the precision of QTL localization for the four basic methods. For each method and for each setting of the parameter values ({pi} , µ), the root-mean-square (RMS) of the error in the estimated QTL location, among simulation replicates in which there was significant evidence for the presence of a QTL (i.e., in which the maximum LOD score exceeded the corresponding 95% genome-wide LOD threshold), was calculated. Results for the conditional quantitative trait analysis (QT) for and , and for the binary trait analysis for and , are not shown, since these methods have no power to detect a QTL with the corresponding parameter settings. The results in mirror those in . The methods with the highest power have the greatest precision of QTL localization (i.e., the smallest RMS error), while those with the lowest power have the lowest precision.

    fig.ommittedq|p;d*e, 百拇医药

    Figure 4. Estimated root-mean-square (RMS) error (±2 SE) of the estimated QTL location, among simulation replicates showing significant evidence for the presence of a QTL.q|p;d*e, 百拇医药

    In summary, if a QTL has an effect only on the probabilities, {pi} j, or the conditional means, µj, greatest power to detect the QTL is obtained with the separate analysis of that aspect of the data. If a QTL has an effect on both the probabilities, {pi} j, and the conditional means, µj, the nonparametric method performed best. In all cases, analysis under the two-part model (with which the data were simulated) was second place, in terms of power. Note that further simulations, with 100 rather than 250 intercross individuals and with the proportion of individuals with the null phenotype taken to be 15 or 35% rather than 25%, gave qualitatively similar results (data not shown).q|p;d*e, 百拇医药

    DISCUSSIONq|p;d*e, 百拇医药

    We have considered the problem of QTL mapping in the case of a spike in the phenotype distribution, a common departure from the usual normality assumption in standard interval mapping. Standard interval mapping works reasonably well when the spike is not too far from the rest of the phenotype distribution and contains only a small proportion of the individuals. When the spike is well separated and contains an appreciable proportion of the data, maximum-likelihood estimation under a normal mixture model has a tendency to produce spurious LOD score peaks in regions of low genotype information (e.g., widely spaced markers).

    We developed a parametric, two-part model for QTL mapping in this situation and have described an extension of the Kruskal-Wallis test statistic for nonparametric interval mapping in the case of an intercross. These approaches serve to combine the analysis of the binary trait with the conditional analysis of the quantitative trait among individuals with positive phenotype.l+t^|, 百拇医药

    The interpretation of the results of analysis with the two-part model may deserve further explanation. A QTL identified through the two-part model may influence the probability of having a nonnull phenotype or the average phenotype among individuals with positive phenotypic values or both. Inspection of the estimated QTL effects (the j and j) or of the results of the separate binary and conditional quantitative trait analyses should assist in discriminating between these cases.l+t^|, 百拇医药

    In our simulation results, most interesting was the comparison among the two-part model, the nonparametric method, and the maximum of the binary and conditional quantitative trait analyses. In the case that QTL have an effect on both the parameters {pi} j (the probability that an individual with QTL genotype j will have a positive phenotype) and µj (the conditional mean phenotype, among individuals with positive phenotype and QTL genotype j), the nonparametric approach was seen to have greater power than analysis under the two-part model; this is largely due to the fact that the genome-wide LOD threshold is considerably larger for the latter method. In the case that QTL have an effect on only the {pi} j or only the µj, the maximum of the separate analyses will have greatest power, and the nonparametric method will have the least power. Thus, analysis under the two-part model is always second best. On the other hand, the overall average power, across the eight parameter settings considered herein, was greatest for the two-part model. Further, the parametric, two-part model may be more useful in consideration of multiple-QTL models.

    Thus, while nonparametric interval mapping is a valuable general method, analysis under the two-part model may be preferred for the situation considered here. The extensions of the two-part model for use with multiple QTL (for example, by combining a logistic model for the probabilities with a linear model for the conditional means) deserve exploration.x&, 百拇医药

    The methods described in this article have been implemented in the QTL mapping software, R/qtl (), an add-on package for the general statistical software, R (IHAKA and GENTLEMAN 1996 ).x&, 百拇医药

    ACKNOWLEDGMENTSx&, 百拇医药

    The author thanks Victor Boyartchuk and William Dietrich for providing the Listeria data. This work was supported in part by a Faculty Innovation Fund grant from the Johns Hopkins Bloomberg School of Public Health.x&, 百拇医药

    Manuscript received August 1, 2002; Accepted for publication November 26, 2002.x&, 百拇医药

    LITERATURE CITEDx&, 百拇医药

    BOYARTCHUK, V. L., K. W. BROMAN, R. E. MOSHER, S. E. F. D'ORAZIO, and M. N. STARNBACH et al., 2001 Multigenic control of Listeria monocytogenes susceptibility in mice. Nat. Genet. 27:259-260.g\;, 百拇医药

    CHURCHILL, G. A. and R. W. DOERGE, 1994 Empirical threshold values for quantitative trait mapping. Genetics 138:963-971.g\;, 百拇医药

    DEMPSTER, A. P., N. M. LAIRD, and D. B. RUBIN, 1977 Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. B 39:1-38.g\;, 百拇医药

    HUNTER, K. W., K. W. BROMAN, T. LE VOYER, L. LUKES, and D. COZMA et al., 2001 Predisposition to efficient mammary tumor metastatic progression is linked to the breast cancer metastasis suppressor gene Brms1.. Cancer Res. 61:8866-8872.g\;, 百拇医药

    IHAKA, R. and R. GENTLEMAN, 1996 R: a language for data analysis and graphics. J. Comp. Graph. Stat. 5:299-314.g\;, 百拇医药

    KRUGLYAK, L. and E. S. LANDER, 1995 A nonparametric approach for mapping quantitative trait loci. Genetics 139:1421-1428.g\;, 百拇医药

    LANDER, E. S. and D. BOTSTEIN, 1989 Mapping Mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics 121:185-199.

    LEHMANN, E. L., 1975 Nonparametrics: Statistical Methods Based on Ranks. Holden-Day, San Francisco.8{xssqw, http://www.100md.com

    LINCOLN, S. E. and E. S. LANDER, 1992 Systematic detection of errors in genetic linkage data. Genomics 14:604-610.8{xssqw, http://www.100md.com

    MCINTYRE, L. M., C. J. COFFMAN, and R. W. DOERGE, 2001 Detection and localization of a single binary trait locus in experimental populations. Genet. Res. 78:79-92.8{xssqw, http://www.100md.com

    ROWE, L. B., J. H. NADEAU, R. TURNER, W. N. FRANKEL, and V. A. LETTS et al., 1994 Maps from two interspecific backcross DNA panels available as a community genetic mapping resource. Mamm. Genome 5:253-274.8{xssqw, http://www.100md.com

    VISSCHER, P. M., C. S. HALEY, and S. A. KNOTT, 1996 Mapping QTLs for binary traits in backcross and F2 populations. Genet. Res. 68:55-63.8{xssqw, http://www.100md.com

    WITTENBURG, H., F. LAMMERT, D. Q. WANG, G. A. CHURCHILL, and R. LI et al., 2002 Interacting QTLs for cholesterol gallstones and gallbladder mucin in AKR and SWR strains of mice. Physiol. Genomics 8:67-77.8{xssqw, http://www.100md.com

    XU, S. and W. R. ATCHLEY, 1996 Mapping quantitative trait loci for complex binary diseases using line crosses. Genetics 143:1417-1424.8{xssqw, http://www.100md.com

    ZENG, Z-B., 1993 Theoretical basis for separation of multiple linked gene effects in mapping quantitative trait loci. Proc. Natl. Acad. Sci. USA 90:10972-10976.8{xssqw, http://www.100md.com

    ZENG, Z-B., 1994 Precision mapping of quantitative trait loci. Genetics 136:1457-1468.(Karl W. Broman)