Combining genotype groups and recursive partitioning: an application to human immunodeficiency virus type 1 genetics data

被引:18
作者
Foulkes, AS
De Gruttola, V
Hertogs, K
机构
[1] Univ Penn, Sch Med, Div Biostat, Philadelphia, PA 19104 USA
[2] Harvard Univ, Sch Publ Hlth, Boston, MA 02115 USA
[3] Tibotec Virco, Mechelen, Belgium
关键词
cluster analysis; genotype; human immunodeficiency virus type 1; patterning; phenotype; recursive partitioning; weighted distance;
D O I
10.1046/j.1467-9876.2003.05094.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Understanding the relationship between human immunodeficiency virus type 1 genotypic markers of resistance and response to therapy presents an analytic challenge because of the high dimensionality of the viral genome and the complex interactions across sites on the genome. Recursive partitioning is a natural approach to classifying patients on the basis of covariates that capture information on the variability of a response variable and is therefore well suited to handling both of these features of the problem. In the human immunodeficiency virus genetics setting, we aim to classify patients on the basis of the genotypic characteristics of their infecting viral population so that the resulting groups explain the most variability in drug susceptibility phenotype. Recursive partitioning is also informative in identifying meaningful genotypic patterns that define the resulting classes. By combining dimension reduction techniques with recursive partitioning we can arrive at different classification schemes which are potentially more informative. Furthermore, the use of weighted distance metrics in the clustering algorithm allows us to update and potentially to improve on this classification scheme. An illustration of the relative benefits of the various approaches is provided by using 2559 protease sequences and the corresponding 50% inhibitory concentrations for Indinavir.
引用
收藏
页码:311 / 323
页数:13
相关论文
共 17 条
[1]   Diversity and complexity of HIV-1 drug resistance: A bioinformatics approach to predicting phenotype from genotype [J].
Beerenwinkel, N ;
Schmidt, B ;
Walter, H ;
Kaiser, R ;
Lengauer, T ;
Hoffmann, D ;
Korn, K ;
Selbig, J .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (12) :8271-8276
[2]   Genetic correlates of in vivo viral resistance to indinavir, a human immunodeficiency virus type 1 protease inhibitor [J].
Condra, JH ;
Holder, DJ ;
Schleif, WA ;
Blahy, OM ;
Danovich, RM ;
Gabryelski, LJ ;
Graham, DJ ;
Laird, D ;
Quintero, JC ;
Rhodes, A ;
Robbins, HL ;
Roth, E ;
Shivaprakash, M ;
Yang, T ;
Chodakewitz, JA ;
Deutsch, PJ ;
Leavitt, RY ;
Massari, FE ;
Mellors, JW ;
Squires, KE ;
Steigbigel, RT ;
Teppler, H ;
Emini, EA .
JOURNAL OF VIROLOGY, 1996, 70 (12) :8270-8276
[3]  
COSMAN PC, 1994, J APPL STAT, V21, P93
[4]   Characterizing the relationship between HIV-1 genotype and phenotype: Prediction-based classification [J].
Foulkes, AS ;
De Gruttola, V .
BIOMETRICS, 2002, 58 (01) :145-156
[5]  
Hartigan J. A., 1975, CLUSTERING ALGORITHM
[6]  
JUKES TH, 1969, EVOLUTION PROTEIN MO, P21
[8]  
Kohonen T., 1997, Self-organizing Maps, V2nd ed.
[9]   A preliminary evaluation of nelfinavir mesylate, an inhibitor of human immunodeficiency virus (HIV)-1 protease, to treat HIV infection [J].
Markowitz, M ;
Conant, M ;
Hurley, A ;
Schluger, R ;
Duran, M ;
Peterkin, J ;
Chapman, S ;
Patick, A ;
Hendricks, A ;
Yuen, GJ ;
Hoskins, W ;
Clendeninn, N ;
Ho, DD .
JOURNAL OF INFECTIOUS DISEASES, 1998, 177 (06) :1533-1540
[10]  
NEI M, 1986, MOL BIOL EVOL, V3, P418