ProPhylo: partial phylogenetic profiling to guide protein family construction and assignment of biological process

被引:14
作者
Basu, Malay K. [1 ]
Selengut, Jeremy D. [1 ]
Haft, Daniel H. [1 ]
机构
[1] J Craig Venter Inst, Rockville, MD 20850 USA
来源
BMC BIOINFORMATICS | 2011年 / 12卷
关键词
FUNCTIONAL LINKAGES; CELLULAR PATHWAYS; DNA BACKBONE; BACTERIAL; GENOMES; SYSTEM; GENES; PHOSPHOROTHIOATION; IDENTIFICATION; PREDICTION;
D O I
10.1186/1471-2105-12-434
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Phylogenetic profiling is a technique of scoring co-occurrence between a protein family and some other trait, usually another protein family, across a set of taxonomic groups. In spite of several refinements in recent years, the technique still invites significant improvement. To be its most effective, a phylogenetic profiling algorithm must be able to examine co-occurrences among protein families whose boundaries are uncertain within large homologous protein superfamilies. Results: Partial Phylogenetic Profiling (PPP) is an iterative algorithm that scores a given taxonomic profile against the taxonomic distribution of families for all proteins in a genome. The method works through optimizing the boundary of each protein family, rather than by relying on prebuilt protein families or fixed sequence similarity thresholds. Double Partial Phylogenetic Profiling (DPPP) is a related procedure that begins with a single sequence and searches for optimal granularities for its surrounding protein family in order to generate the best query profiles for PPP. We present ProPhylo, a high-performance software package for phylogenetic profiling studies through creating individually optimized protein family boundaries. ProPhylo provides precomputed databases for immediate use and tools for manipulating the taxonomic profiles used as queries. Conclusion: ProPhylo results show universal markers of methanogenesis, a new DNA phosphorothioation-dependent restriction enzyme, and efficacy in guiding protein family construction. The software and the associated databases are freely available under the open source Perl Artistic License from ftp://ftp.jcvi.org/pub/data/ppp/.
引用
收藏
页数:14
相关论文
共 47 条
[1]   Predicting functional gene links from phylogenetic-statistical analyses of whole genomes [J].
Barker, D ;
Pagel, M .
PLOS COMPUTATIONAL BIOLOGY, 2005, 1 (01) :24-31
[2]   Constrained models of evolution lead to improved prediction of functional linkage from correlated gain and loss of genes [J].
Barker, Daniel ;
Meade, Andrew ;
Pagel, Mark .
BIOINFORMATICS, 2007, 23 (01) :14-20
[3]   MultiLoc2: integrating phylogeny and Gene Ontology terms improves subcellular protein localization prediction [J].
Blum, Torsten ;
Briesemeister, Sebastian ;
Kohlbacher, Oliver .
BMC BIOINFORMATICS, 2009, 10 :274
[4]   Use of logic relationships to decipher protein network organization [J].
Bowers, PM ;
Cokus, SJ ;
Elsenberg, D ;
Yeates, TO .
SCIENCE, 2004, 306 (5705) :2246-2249
[5]   The [Fe-Fe]-hydrogenase maturation protein HydF from Thermotoga maritima is a GTPase with an iron-sulfur cluster [J].
Brazzolotto, X ;
Rubach, JK ;
Gaillard, J ;
Gambarelli, S ;
Atta, M ;
Fontecave, M .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2006, 281 (02) :769-774
[6]   SherLoc2: A High-Accuracy Hybrid Method for Predicting Subcellular Localization of Proteins [J].
Briesemeister, Sebastian ;
Blum, Torsten ;
Brady, Scott ;
Lam, Yin ;
Kohlbacher, Oliver ;
Shatkay, Hagit .
JOURNAL OF PROTEOME RESEARCH, 2009, 8 (11) :5363-5366
[7]   Count: evolutionary analysis of phylogenetic profiles with parsimony and likelihood [J].
Csuroes, Miklos .
BIOINFORMATICS, 2010, 26 (15) :1910-1912
[8]  
Date Shailesh V., 2008, V453, P201, DOI 10.1007/978-1-60327-429-6_9
[9]   Discovery of uncharacterized cellular systems by genome-wide analysis of functional linkages [J].
Date, SV ;
Marcotte, EM .
NATURE BIOTECHNOLOGY, 2003, 21 (09) :1055-1062
[10]   MicrobesOnline: an integrated portal for comparative and functional genomics [J].
Dehal, Paramvir S. ;
Joachimiak, Marcin P. ;
Price, Morgan N. ;
Bates, John T. ;
Baumohl, Jason K. ;
Chivian, Dylan ;
Friedland, Greg D. ;
Huang, Katherine H. ;
Keller, Keith ;
Novichkov, Pavel S. ;
Dubchak, Inna L. ;
Alm, Eric J. ;
Arkin, Adam P. .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D396-D400