PhylomeDB: a database for genome-wide collections of gene phylogenies

被引:73
作者
Huerta-Cepas, Jaime [1 ]
Bueno, Anibal [1 ]
Dopazo, Joaquin [1 ]
Gabaldon, Toni [1 ]
机构
[1] Ctr Invest Principe Felipe, Bioinformat Dept, Valencia 46013, Spain
关键词
D O I
10.1093/nar/gkm899
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The complete collection of evolutionary histories of all genes in a genome, also known as phylome, constitutes a valuable source of information. The reconstruction of phylomes has been previously prevented by large demands of time and computer power, but is now feasible thanks to recent developments in computers and algorithms. To provide a publicly available repository of complete phylomes that allows researchers to access and store large-scale phylogenomic analyses, we have developed PhylomeDB. PhylomeDB is a database of complete phylomes derived for different genomes within a specific taxonomic range. All phylomes in the database are built using a high-quality phylogenetic pipeline that includes evolutionary model testing and alignment trimming phases. For each genome, PhylomeDB provides the alignments, phylogentic trees and tree-based orthology predictions for every single encoded protein. The current version of PhylomeDB includes the phylomes of Human, the yeast Saccharomyces cerevisiae and the bacterium Escherichia coli, comprising a total of 32 289 seed sequences with their corresponding alignments and 172 324 phylogenetic trees. PhylomeDB can be publicly accessed at http://phylomedb.bioinfo.cipf.es.
引用
收藏
页码:D491 / D496
页数:6
相关论文
共 17 条
[1]  
Akaike H, 1973, 2 INT S INFORM THEOR, P199, DOI 10.1007/978-1-4612-1694-0
[2]   Ensembl 2006 [J].
Birney, E. ;
Andrews, D. ;
Caccamo, M. ;
Chen, Y. ;
Clarke, L. ;
Coates, G. ;
Cox, T. ;
Cunningham, F. ;
Curwen, V. ;
Cutts, T. ;
Down, T. ;
Durbin, R. ;
Fernandez-Suarez, X. M. ;
Flicek, P. ;
Graf, S. ;
Hammond, M. ;
Herrero, J. ;
Howe, K. ;
Iyer, V. ;
Jekosch, K. ;
Kahari, A. ;
Kasprzyk, A. ;
Keefe, D. ;
Kokocinski, F. ;
Kulesha, E. ;
London, D. ;
Longden, I. ;
Melsopp, C. ;
Meidl, P. ;
Overduin, B. ;
Parker, A. ;
Proctor, G. ;
Prlic, A. ;
Rae, M. ;
Rios, D. ;
Redmond, S. ;
Schuster, M. ;
Sealy, I. ;
Searle, S. ;
Severin, J. ;
Slater, G. ;
Smedley, D. ;
Smith, J. ;
Stabenau, A. ;
Stalker, J. ;
Trevanion, S. ;
Ureta-Vidal, A. ;
Vogel, J. ;
White, S. ;
Woodwark, C. .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D556-D561
[3]   The Jalview Java']Java alignment editor [J].
Clamp, M ;
Cuff, J ;
Searle, SM ;
Barton, GJ .
BIOINFORMATICS, 2004, 20 (03) :426-427
[4]   From phylogenetics to phylogenomics:: The evolutionary relationships of insect endosymbiotic γ-proteobacteria as a test case [J].
Comas, Inaki ;
Moya, Andres ;
Gonzalez-Candelas, Fernando .
SYSTEMATIC BIOLOGY, 2007, 56 (01) :1-16
[5]   HOVERGEN - A DATABASE OF HOMOLOGOUS VERTEBRATE GENES [J].
DURET, L ;
MOUCHIROUD, D ;
GOUY, M .
NUCLEIC ACIDS RESEARCH, 1994, 22 (12) :2360-2365
[6]   MUSCLE: a multiple sequence alignment method with reduced time and space complexity [J].
Edgar, RC .
BMC BIOINFORMATICS, 2004, 5 (1) :1-19
[7]   Reconstruction of the proto-mitochondrial metabolism [J].
Gabaldón, T ;
Huynen, MA .
SCIENCE, 2003, 301 (5633) :609-609
[8]  
Gabaldón T, 2005, EVOL BIOINFORM, V1, P51
[9]   BIONJ: An improved version of the NJ algorithm based on a simple model of sequence data [J].
Gascuel, O .
MOLECULAR BIOLOGY AND EVOLUTION, 1997, 14 (07) :685-695
[10]   A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood [J].
Guindon, S ;
Gascuel, O .
SYSTEMATIC BIOLOGY, 2003, 52 (05) :696-704