NOTUNG: A program for dating gene duplications and optimizing gene family trees

被引:385
作者
Chen, K
Durand, D
Farach-Colton, M
机构
[1] Carnegie Mellon Univ, Dept Biol Sci, Pittsburgh, PA 15213 USA
[2] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
[3] Rutgers State Univ, Dept Comp Sci, Piscataway, NJ 08855 USA
关键词
gene duplication; gene family; duplication/loss models; phylogeny; evolutionary tree;
D O I
10.1089/106652700750050871
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Large scale gene duplication is a major force driving the evolution of genetic functional innovation. Whole genome duplications are widely believed to have played an important role in the evolution of the maize, yeast, and vertebrate genomes, The use of evolutionary trees to analyze:the history of gene duplication and estimate duplication times provides a powerful tool for studying this process. Many studies in the molecular evolution literature have used this approach on small data sets, using analyses performed by hand, The rapid growth of genetic sequence data will soon allow similar studies on a genomic scale, but such studies will be limited unless the analysis can be automated. Even existing data sets admit alternative hypotheses that would be too tedious to consider without automation, In this paper, we describe a program called NOTUNG that facilitates large scale analysis, using both rooted and unrooted trees, When tested on trees analyzed in the literature, NOTUNG consistently yielded results that agree with the assessments in the original publications, Thus, NOTUNG provides a basic building block for inferring duplication dates from gene trees automatically and can also be used as an exploratory analysis tool for evaluating alternative hypotheses.
引用
收藏
页码:429 / 447
页数:19
相关论文
共 33 条
[1]  
BENDER M, 2000, LATIN 00
[2]  
CHEN K, 2000, RECOMB 2000 4 ANN IN
[3]   A LEISURELY LOOK AT THE BOOTSTRAP, THE JACKKNIFE, AND CROSS-VALIDATION [J].
EFRON, B ;
GONG, G .
AMERICAN STATISTICIAN, 1983, 37 (01) :36-48
[4]   Evolutionary significance of intra-genome duplications on human chromosomes [J].
Endo, T ;
Imanishi, T ;
Gojobori, T ;
Inoko, H .
GENE, 1997, 205 (1-2) :19-27
[5]   Duplication-based measures of difference between gene and species trees [J].
Eulenstein, O ;
Mirkin, B ;
Vingron, M .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1998, 5 (01) :135-148
[6]  
EULENSTEIN O, 1996, DIMACS SERIES DISCRE, V37, P71
[7]   Life with 6000 genes [J].
Goffeau, A ;
Barrell, BG ;
Bussey, H ;
Davis, RW ;
Dujon, B ;
Feldmann, H ;
Galibert, F ;
Hoheisel, JD ;
Jacq, C ;
Johnston, M ;
Louis, EJ ;
Mewes, HW ;
Murakami, Y ;
Philippsen, P ;
Tettelin, H ;
Oliver, SG .
SCIENCE, 1996, 274 (5287) :546-&
[8]   FITTING THE GENE LINEAGE INTO ITS SPECIES LINEAGE, A PARSIMONY STRATEGY ILLUSTRATED BY CLADOGRAMS CONSTRUCTED FROM GLOBIN SEQUENCES [J].
GOODMAN, M ;
CZELUSNIAK, J ;
MOORE, GW ;
ROMEROHERRERA, AE ;
MATSUDA, G .
SYSTEMATIC ZOOLOGY, 1979, 28 (02) :132-163
[9]  
GRAUR D, 1996, NATURE
[10]   Reconstruction of ancient molecular phylogeny [J].
Guigo, R ;
Muchnik, I ;
Smith, TF .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 1996, 6 (02) :189-213