A non-parametric approach to translating gene region heterogeneity associated with phenotype into location heterogeneity

被引:3
作者
Kowalski, J [1 ]
机构
[1] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
关键词
D O I
10.1093/bioinformatics/17.9.775
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The analysis of genetic data poses statistical problems in the form of high dimensionality with small sample sizes. The construction of a composite gene region (sequence pair) heterogeneity measure is one technique for reducing the dimensionality of the problem. This approach however is not without cost, since the contribution of locations to observed gene region differences between groups becomes entangled in this summary measure. This is problematic since it is of scientific interest to identify locations that together depict phenotype. Results: A method is proposed for relating observed gene region heterogeneity back to the location level. In the spirit of a factor analysis-type setting, the approach focuses on identifying a latent variable structure among locations to explain within and between group genetic differences associated with phenotype. The method is flexible for identifying either the additive contribution from individual locations or the additive contribution from a group of locations, to observed gene region heterogeneity, depending upon the weighting scheme used in constructing a gene region heterogeneity measure. The approach is illustrated with clinical trial data, where the problem of altered HIV drug susceptibility is examined through characterizing location contributions to HIV protease gene region differences associated with a phenotypic treatment response.
引用
收藏
页码:775 / 790
页数:16
相关论文
共 31 条
[1]  
[Anonymous], 1999, APPL MULTIVARIATE AN
[2]  
Baxter J, 1999, ANTIVIR THER S, V4, P43
[3]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[4]  
Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946
[5]  
COSMAN PC, 1999, VECTOR QUANTIZATION
[6]  
Everitt B., 1993, CLUSTER ANAL
[7]   The use of multiple measurements in taxonomic problems [J].
Fisher, RA .
ANNALS OF EUGENICS, 1936, 7 :179-188
[8]  
Fisher Ronald A., 1935, DESIGN EXPT
[9]  
FOULKES AS, 2000, THESIS HARV SCH PUBL
[10]   Antiretroviral drug resistance testing in adults with HIV infection -: Implications for clinical management [J].
Hirsch, MS ;
Conway, B ;
D'Aquila, RT ;
Johnson, VA ;
Brun-Vézinet, F ;
Clotet, B ;
Demeter, LM ;
Hammer, SM ;
Jacobsen, DM ;
Kuritzkes, DR ;
Loveday, C ;
Mellors, JW ;
Vella, S ;
Richman, DD .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1998, 279 (24) :1984-1991