Evolutionary sequence analysis of complete eukaryote genomes

被引:41
作者
Blair, JE
Shah, P
Hedges, SB [1 ]
机构
[1] Penn State Univ, NASA Astrobiol Inst, Mueller Lab 208, University Pk, PA 16802 USA
[2] Penn State Univ, Dept Biol, Mueller Lab 208, University Pk, PA 16802 USA
关键词
D O I
10.1186/1471-2105-6-53
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Gene duplication and gene loss during the evolution of eukaryotes have hindered attempts to estimate phylogenies and divergence times of species. Although current methods that identify clusters of orthologous genes in complete genomes have helped to investigate gene function and gene content, they have not been optimized for evolutionary sequence analyses requiring strict orthology and complete gene matrices. Here we adopt a relatively simple and fast genome comparison approach designed to assemble orthologs for evolutionary analysis. Our approach identifies single-copy genes representing only species divergences (panorthologs) in order to minimize potential errors caused by gene duplication. We apply this approach to complete sets of proteins from published eukaryote genomes specifically for phylogeny and time estimation. Results: Despite the conservative criterion used, 753 panorthologs ( proteins) were identified for evolutionary analysis with four genomes, resulting in a single alignment of 287,000 amino acids. With this data set, we estimate that the divergence between deuterostomes and arthropods took place in the Precambrian, approximately 400 million years before the first appearance of animals in the fossil record. Additional analyses were performed with seven, 12, and 15 eukaryote genomes resulting in similar divergence time estimates and phylogenies. Conclusion: Our results with available eukaryote genomes agree with previous results using conventional methods of sequence data assembly from genomes. They show that large sequence data sets can be generated relatively quickly and efficiently for evolutionary analyses of complete genomes.
引用
收藏
页数:10
相关论文
共 75 条
[21]   The timing of eukaryotic evolution: Does a relaxed molecular clock reconcile proteins and fossils? [J].
Douzery, EJP ;
Snell, EA ;
Bapteste, E ;
Delsuc, F ;
Philippe, H .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (43) :15386-15391
[22]   CASES IN WHICH PARSIMONY OR COMPATIBILITY METHODS WILL BE POSITIVELY MISLEADING [J].
FELSENSTEIN, J .
SYSTEMATIC ZOOLOGY, 1978, 27 (04) :401-410
[23]  
Felsenstein J., 2002, PHYLOGENY INFERENCE
[24]   Determining divergence times with a protein clock: Update and reevaluation [J].
Feng, DF ;
Cho, G ;
Doolittle, RF .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (24) :13028-13033
[25]   Inferring species phylogenies from multiple genes: Concatenated sequence tree versus consensus gene tree [J].
Gadagkar, SR ;
Rosenberg, MS ;
Kumar, S .
JOURNAL OF EXPERIMENTAL ZOOLOGY PART B-MOLECULAR AND DEVELOPMENTAL EVOLUTION, 2005, 304B (01) :64-74
[26]   The genome sequence of the filamentous fungus Neurospora crassa [J].
Galagan, JE ;
Calvo, SE ;
Borkovich, KA ;
Selker, EU ;
Read, ND ;
Jaffe, D ;
FitzHugh, W ;
Ma, LJ ;
Smirnov, S ;
Purcell, S ;
Rehman, B ;
Elkins, T ;
Engels, R ;
Wang, SG ;
Nielsen, CB ;
Butler, J ;
Endrizzi, M ;
Qui, DY ;
Ianakiev, P ;
Pedersen, DB ;
Nelson, MA ;
Werner-Washburne, M ;
Selitrennikoff, CP ;
Kinsey, JA ;
Braun, EL ;
Zelter, A ;
Schulte, U ;
Kothe, GO ;
Jedd, G ;
Mewes, W ;
Staben, C ;
Marcotte, E ;
Greenberg, D ;
Roy, A ;
Foley, K ;
Naylor, J ;
Stabge-Thomann, N ;
Barrett, R ;
Gnerre, S ;
Kamal, M ;
Kamvysselis, M ;
Mauceli, E ;
Bielke, C ;
Rudd, S ;
Frishman, D ;
Krystofova, S ;
Rasmussen, C ;
Metzenberg, RL ;
Perkins, DD ;
Kroken, S .
NATURE, 2003, 422 (6934) :859-868
[27]   Genome sequence of the human malaria parasite Plasmodium falciparum [J].
Gardner, MJ ;
Hall, N ;
Fung, E ;
White, O ;
Berriman, M ;
Hyman, RW ;
Carlton, JM ;
Pain, A ;
Nelson, KE ;
Bowman, S ;
Paulsen, IT ;
James, K ;
Eisen, JA ;
Rutherford, K ;
Salzberg, SL ;
Craig, A ;
Kyes, S ;
Chan, MS ;
Nene, V ;
Shallom, SJ ;
Suh, B ;
Peterson, J ;
Angiuoli, S ;
Pertea, M ;
Allen, J ;
Selengut, J ;
Haft, D ;
Mather, MW ;
Vaidya, AB ;
Martin, DMA ;
Fairlamb, AH ;
Fraunholz, MJ ;
Roos, DS ;
Ralph, SA ;
McFadden, GI ;
Cummings, LM ;
Subramanian, GM ;
Mungall, C ;
Venter, JC ;
Carucci, DJ ;
Hoffman, SL ;
Newbold, C ;
Davis, RW ;
Fraser, CM ;
Barrell, B .
NATURE, 2002, 419 (6906) :498-511
[28]   An insect molecular clock dates the origin of the insects and accords with palaeontological and biogeographic landmarks [J].
Gaunt, MW ;
Miles, MA .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (05) :748-761
[29]   Genome sequence of the Brown Norway rat yields insights into mammalian evolution [J].
Gibbs, RA ;
Weinstock, GM ;
Metzker, ML ;
Muzny, DM ;
Sodergren, EJ ;
Scherer, S ;
Scott, G ;
Steffen, D ;
Worley, KC ;
Burch, PE ;
Okwuonu, G ;
Hines, S ;
Lewis, L ;
DeRamo, C ;
Delgado, O ;
Dugan-Rocha, S ;
Miner, G ;
Morgan, M ;
Hawes, A ;
Gill, R ;
Holt, RA ;
Adams, MD ;
Amanatides, PG ;
Baden-Tillson, H ;
Barnstead, M ;
Chin, S ;
Evans, CA ;
Ferriera, S ;
Fosler, C ;
Glodek, A ;
Gu, ZP ;
Jennings, D ;
Kraft, CL ;
Nguyen, T ;
Pfannkoch, CM ;
Sitter, C ;
Sutton, GG ;
Venter, JC ;
Woodage, T ;
Smith, D ;
Lee, HM ;
Gustafson, E ;
Cahill, P ;
Kana, A ;
Doucette-Stamm, L ;
Weinstock, K ;
Fechtel, K ;
Weiss, RB ;
Dunn, DM ;
Green, ED .
NATURE, 2004, 428 (6982) :493-521
[30]   Life with 6000 genes [J].
Goffeau, A ;
Barrell, BG ;
Bussey, H ;
Davis, RW ;
Dujon, B ;
Feldmann, H ;
Galibert, F ;
Hoheisel, JD ;
Jacq, C ;
Johnston, M ;
Louis, EJ ;
Mewes, HW ;
Murakami, Y ;
Philippsen, P ;
Tettelin, H ;
Oliver, SG .
SCIENCE, 1996, 274 (5287) :546-&