PHOG: a database of supergenomes built from proteome complements

被引:7
作者
Merkeev, Igor V.
Novichkov, Pavel S.
Mironov, Andrey A.
机构
[1] State Sci Ctr GosNIIGenet, Moscow 113545, Russia
[2] Natl Lib Med, Natl Ctr Biotechnol Informat, Bethesda, MD 20894 USA
[3] Moscow MV Lomonosov State Univ, Dept Bioengn & Bioinformat, Moscow 119992, Russia
关键词
D O I
10.1186/1471-2148-6-52
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Orthologs and paralogs are widely used terms in modern comparative genomics. Existing procedures for resolving orthologous/paralogous relationships are often based on manual revision of clusters of orthologous groups and/or lack any rigorous evolutionary base. Description: We developed a completely automated procedure that creates clusters of orthologous groups at each node of the taxonomy tree (PHOGs-Phylogenetic Orthologous Groups). As a result of this procedure, a tree of orthologous groups was obtained. Each cluster is a "supergene" and it is represented by an "ancestral" sequence obtained from the multiple alignment of orthologous and paralogous genes. The procedure has been applied to the taxonomy tree of organisms from all three domains of life. Protein complements from 50 bacterial, archaeal and eukaryotic species were used to create PHOGs at all tree nodes. 51367 PHOGs were obtained at the root node. Conclusion: The PHOG database demonstrates that it is possible to automatically process any number of sequenced genomes and to reconstruct orthologous and paralogous relationships between genomes using a rigorous evolutionary approach. This database can become a very useful tool in various areas of comparative genomics.
引用
收藏
页数:9
相关论文
共 29 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]   GenBank [J].
Benson, DA ;
Karsch-Mizrachi, I ;
Lipman, DJ ;
Ostell, J ;
Rapp, BA ;
Wheeler, DL .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :17-20
[3]   An evolutionary analysis of orphan genes in Drosophila [J].
Domazet-Loso, T ;
Tautz, D .
GENOME RESEARCH, 2003, 13 (10) :2213-2219
[4]   PROGRESSIVE SEQUENCE ALIGNMENT AS A PREREQUISITE TO CORRECT PHYLOGENETIC TREES [J].
FENG, DF ;
DOOLITTLE, RF .
JOURNAL OF MOLECULAR EVOLUTION, 1987, 25 (04) :351-360
[5]   DISTINGUISHING HOMOLOGOUS FROM ANALOGOUS PROTEINS [J].
FITCH, WM .
SYSTEMATIC ZOOLOGY, 1970, 19 (02) :99-&
[6]   Prediction of transcription regulatory sites in Archaea by a comparative genomic approach [J].
Gelfand, MS ;
Koonin, EV ;
Mironov, AA .
NUCLEIC ACIDS RESEARCH, 2000, 28 (03) :695-705
[7]   Lineage-specific gene expansions in bacterial and archaeal genomes [J].
Jordan, IK ;
Makarova, KS ;
Spouge, JL ;
Wolf, YI ;
Koonin, EV .
GENOME RESEARCH, 2001, 11 (04) :555-565
[8]  
Klinger Claudia, 2003, BMC Biochem, V4, P12, DOI 10.1186/1471-2091-4-12
[9]  
Koonin E.V., 2001, GENOME BIOL, V2
[10]   RAPID AND SENSITIVE PROTEIN SIMILARITY SEARCHES [J].
LIPMAN, DJ ;
PEARSON, WR .
SCIENCE, 1985, 227 (4693) :1435-1441