Distance-based reconstruction of tree models for oncogenesis

被引:78
作者
Desper, R
Jiang, F
Kallioniemi, OP
Moch, H
Papadimitriou, CH
Schäffer, AA
机构
[1] NIH, Natl Ctr Biotechnol Informat, Bethesda, MD 20894 USA
[2] Univ Basel, Inst Pathol, Basel, Switzerland
[3] NHGRI, Canc Genet Branch, NIH, Bethesda, MD 20892 USA
[4] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[5] Deutsch Krebsforschungszentrum, Abt Theoret Bioinformat, D-6900 Heidelberg, Germany
关键词
cancer; algorithms; phylogenetic trees; comparative genomic hybridization;
D O I
10.1089/10665270050514936
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Comparative genomic hybridization (CGH) is a laboratory method to measure gains and losses in the copy number of chromosomal regions in tumor cells. It is hypothesized that certain DNA gains and losses are related to cancer progression and that the patterns of these changes are relevant to the clinical consequences of the cancer. It is therefore of interest to develop models which predict the occurrence of these events, as well as techniques for learning such models from CGH data. We continue our study of the mathematical foundations for inferring a model of tumor progression from a CGH data set that we started in Desper et al, (1999). In that paper, we proposed a class of probabilistic tree models and showed that an algorithm based on maximum-weight branching in a graph correctly infers the topology of the tree, under plausible assumptions. In this paper, we extend that work in the direction of the so-called distance-based trees, in which events are leaves of the tree, in the style of models common in phylogenetics, Then we show how to reconstruct the distance-based trees using tree-fitting algorithms developed by researchers in phylogenetics, The main advantages of the distance-based models are that 1) they represent information about co-occurrences of all pairs of events, instead of just some pairs, 2) they allow quantitative predictions about which events occur early in tumor progression, and 3) they bring into play the extensive methodology and software developed in the context of phylogenetics, We illustrate the distance-based tree method and how it complements the branching tree method, with a CGH data set for renal cancer.
引用
收藏
页码:789 / 803
页数:15
相关论文
共 37 条
[1]   On the approximability of numerical taxonomy (fitting distances by tree metrics) [J].
Agarwala, R ;
Bafna, V ;
Farach, M ;
Paterson, M ;
Thorup, M .
SIAM JOURNAL ON COMPUTING, 1999, 28 (03) :1073-1085
[2]   Nearly tight bounds on the learnability of evolution [J].
Ambainis, A ;
Desper, R ;
Farach, M ;
Kannan, S .
38TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, PROCEEDINGS, 1997, :524-533
[3]  
[Anonymous], 1971, STAT DECISION THEORY
[4]  
Barthelemy J.P., 1991, TREES PROXIMITY REPR
[5]   STATISTICAL-ANALYSIS OF CYTOGENETIC ABNORMALITIES IN HUMAN CANCER-CELLS [J].
BRODEUR, GM ;
TSIATIS, AA ;
WILLIAMS, DL ;
LUTHARDT, FW ;
GREEN, AA .
CANCER GENETICS AND CYTOGENETICS, 1982, 7 (02) :137-152
[6]  
BUETOW KH, 1998, AM J HUM GENET, V63, pA336
[7]  
CAVENDER JA, 1978, MATH BIOSCI, V40, P271, DOI 10.1016/0025-5564(78)90089-5
[8]   Numerical taxonomy on data: Experimental results [J].
Cohen, J ;
Farach, M .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1997, 4 (04) :547-558
[10]   Inferring tree models for oncogenesis from comparative genome hybridization data [J].
Desper, R ;
Jiang, F ;
Kallioniemi, OP ;
Moch, H ;
Papadimitriou, CH ;
Schäffer, AA .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (01) :37-51