MDR and PRP: A comparison of methods for high-order genotype-phenotype associations

被引:41
作者
Bastone, L
Reilly, M
Rader, DJ
Foulkes, AS
机构
[1] Univ Penn, Sch Med, Div Biostat, Philadelphia, PA 19104 USA
[2] Univ Penn, Sch Med, Div Cardiovasc, Philadelphia, PA 19104 USA
[3] Univ Penn, Sch Med, Ctr Expt Therapeut, Philadelphia, PA 19104 USA
[4] Univ Massachusetts, Sch Publ Hlth & Hlth Sci, Dept Biostat, Amherst, MA 01003 USA
关键词
genotype; phenotype; multifactor dimensionality reduction; classification and regression trees; recursive partitioning; patterning; patterning and recursive partitioning; multi-locus; complex disease;
D O I
10.1159/000083029
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Complex diseases such as cardiovascular disease are likely due to the effects of high-order interactions among multiple genes and demographic factors. Therefore, in order to understand their underlying biological mechanisms, we need to consider simultaneously the effects of genotypes across multiple loci. Statistical methods such as multifactor dimensionality reduction (MDR), the combinatorial partitioning method (CPM), recursive partitioning (RP), and patterning and recursive partitioning (PRP) are designed to uncover complex relationships without relying on a specific model for the interaction, and are therefore well-suited to this data setting. However, the theoretical overlap among these methods and their relative merits have not been well characterized. In this paper we demonstrate mathematically that MDR is a special case of RP in which (1) patterns are used as predictors (PRP), (2) tree growth is restricted to a single split, and (3) misclassification error is used as the measure of impurity. Both approaches are applied to a case-control study assessing the effect of eleven single nucleotide polymorphisms on coronary artery calcification in people at risk for cardiovascular disease. Copyright (C) 2004 S. Karger AG, Basel.
引用
收藏
页码:82 / 92
页数:11
相关论文
共 47 条
[1]  
Aplenc R, 2003, BLOOD, V102, p222A
[2]   Diabetes and atherosclerosis - Epidemiology, pathophysiology, and management [J].
Beckman, JA ;
Creager, MA ;
Libby, P .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2002, 287 (19) :2570-2581
[3]  
Breiman L., 1998, CLASSIFICATION REGRE
[4]  
CHIPMAN H, 1996, COMPUTING SCI STAT G, P199
[5]   Multivariate regression trees: a new technique for modeling species-environment relationships [J].
De'Ath, G .
ECOLOGY, 2002, 83 (04) :1105-1117
[6]   Paraoxonase and atherosclerosis [J].
Durrington, PN ;
Mackness, B ;
Mackness, MI .
ARTERIOSCLEROSIS THROMBOSIS AND VASCULAR BIOLOGY, 2001, 21 (04) :473-480
[7]  
EXCOFFIER L, 1995, MOL BIOL EVOL, V12, P921
[8]   Combining genotype groups and recursive partitioning: an application to human immunodeficiency virus type 1 genetics data [J].
Foulkes, AS ;
De Gruttola, V ;
Hertogs, K .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2004, 53 :311-323
[9]  
FOULKES AS, 2005, IN PRESS STAT MED
[10]   PROJECTION PURSUIT ALGORITHM FOR EXPLORATORY DATA-ANALYSIS [J].
FRIEDMAN, JH ;
TUKEY, JW .
IEEE TRANSACTIONS ON COMPUTERS, 1974, C 23 (09) :881-890