Linkage analysis in the presence of errors IV:: Joint pseudomarker analysis of linkage and/or linkage disequilibrium on a mixture of pedigrees and singletons when the mode of inheritance cannot be accurately specified

被引:133
作者
Göring, HHH
Terwilliger, JD
机构
[1] Columbia Univ, Dept Psychiat, New York, NY 10032 USA
[2] Columbia Univ, Dept Genet, New York, NY 10032 USA
[3] Columbia Univ, Columbia Genome Ctr, New York, NY 10032 USA
[4] New York State Psychiat Inst, New York, NY 10032 USA
关键词
D O I
10.1086/302845
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
There is a lot of confusion in the literature about the "differences" between "model-based" and "model-free" methods and about which approach is better suited for detection of the genes predisposing to complex multifactorial phenotypes. By starting from first principles, we demonstrate that the differences between the two approaches have more to do with study design than statistical analysis. When simple data structures are repeatedly ascertained, no assumptions about the genotype-phenotype relationship need to be made for the analysis to be powerful, since simple data structures admit only a small number of df. When more complicated and/or heterogeneous data structures are ascertained, however, the number of df in the underlying probability model is too large to have a powerful, truly "model-free" test. So-called "model-free" methods typically simplify the underlying probability model by implicitly assuming that, in some sense, all meioses connecting two affected individuals are informative for linkage with identical probability and that the affected individuals in a pedigree share as many disease-predisposing alleles as possible. By contrast, "model-based" methods add structure to the underlying parameter space by making assumptions about the genotype-phenotype relationship, making it possible to probabilistically assign disease-locus genotypes to all individuals in the data set on the basis of the observed phenotypes. In this study, we demonstrate the equivalence of these two approaches in a variety of situations and exploit this equivalence to develop more powerful and efficient likelihood-based analogues of "model-free" tests of linkage and/or linkage disequilibrium. Through the use of a "pseudomarker" locus to structure the space of observations, sib-pairs, triads, and singletons can be analyzed jointly, which will lead to tests that are more well-behaved, efficient, and powerful than traditional "model-free" tests such as the affected sib-pair, transmission/disequilibrium, haplotype relative risk, and case-control tests. Also described is an extension of this approach to large pedigrees, which, in practice, is equivalent to affected relative-pair analysis. The proposed methods are equally applicable to two-point and multipoint analysis (using complex-valued recombination fractions).
引用
收藏
页码:1310 / 1327
页数:18
相关论文
共 57 条
  • [1] AGARWALA R, IN PRESS DISCRETE AP
  • [2] An allele of COL9A2 associated with intervertebral disc disease
    Annunen, S
    Paassilta, P
    Lohiniva, J
    Perälä, M
    Pihlajamaa, T
    Karppinen, J
    Tervonen, O
    Kröger, H
    Lähde, S
    Vanharanta, H
    Ryhänen, L
    Göring, HHH
    Ott, J
    Prockop, DJ
    Ala-Kokko, L
    [J]. SCIENCE, 1999, 285 (5426) : 409 - 412
  • [3] [Anonymous], 1866, VERHANDLUNGEN NATURF, DOI DOI 10.5962/BHL.TITLE.61004
  • [4] BLACKWELDER W C, 1985, Genetic Epidemiology, V2, P85, DOI 10.1002/gepi.1370020109
  • [5] CHASE GA, 1977, TRANSPLANT P, V1, P167
  • [6] COTTINGHAM RW, 1993, AM J HUM GENET, V53, P252
  • [7] DAVIS S, 1997, AM J HUM GENET S, V59, pA216
  • [8] DEMOIVRE A, 1956, DOCTRINE CHANCES
  • [9] DUPUIS J, 1995, GENETICS, V140, P843
  • [10] EDWARDS JH, 1976, HGM, V3, P289