Integration of curated databases to identify genotype-phenotype associations

被引:32
作者
Goh, Chern-Sing
Gianoulis, Tara A.
Liu, Yang
Li, Jianrong
Paccanaro, Alberto
Lussier, Yves A. [1 ]
Gerstein, Mark
机构
[1] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT 06520 USA
[2] Columbia Univ, Dept Biomed Informat, New York, NY USA
[3] Royal Holloway Univ London, Dept Comp Sci, Egham, Surrey, England
关键词
D O I
10.1186/1471-2164-7-257
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The ability to rapidly characterize an unknown microorganism is critical in both responding to infectious disease and biodefense. To do this, we need some way of anticipating an organism's phenotype based on the molecules encoded by its genome. However, the link between molecular composition (i.e. genotype) and phenotype for microbes is not obvious. While there have been several studies that address this challenge, none have yet proposed a large-scale method integrating curated biological information. Here we utilize a systematic approach to discover genotype-phenotype associations that combines phenotypic information from a biomedical informatics database, GIDEON, with the molecular information contained in National Center for Biotechnology Information's Clusters of Orthologous Groups database (NCBI COGs). Results: Integrating the information in the two databases, we are able to correlate the presence or absence of a given protein in a microbe with its phenotype as measured by certain morphological characteristics or survival in a particular growth media. With a 0.8 correlation score threshold, 66% of the associations found were confirmed by the literature and at a 0.9 correlation threshold, 86% were positively verified. Conclusion: Our results suggest possible phenotypic manifestations for proteins biochemically associated with sugar metabolism and electron transport. Moreover, we believe our approach can be extended to linking pathogenic phenotypes with functionally related proteins.
引用
收藏
页数:10
相关论文
共 42 条
[1]   THE PRESENCE OF ACYL-COA HYDROLASE IN RAT BROWN-ADIPOSE-TISSUE PEROXISOMES [J].
ALEXSON, SEH ;
OSMUNDSEN, H ;
BERGE, RK .
BIOCHEMICAL JOURNAL, 1989, 262 (01) :41-46
[2]  
ANDERSON MS, 1985, J BIOL CHEM, V260, P5536
[3]   Oxidative protein folding is driven by the electron transport system [J].
Bader, M ;
Muse, W ;
Ballou, DP ;
Gassner, C ;
Bardwell, JCA .
CELL, 1999, 98 (02) :217-227
[4]   Catalase-peroxidases of Legionella pneumophila:: Cloning of the katA gene and studies of KatA function [J].
Bandyopadhyay, P ;
Steinman, HM .
JOURNAL OF BACTERIOLOGY, 2000, 182 (23) :6679-6686
[5]   CYCLIC ADENOSINE 3',5'-MONOPHOSPHATE-MEDIATED HYPERINDUCTION OF ARABAD AND LACZYA EXPRESSION IN A CRP MUTANT OF ESCHERICHIA-COLI-K-12 [J].
BANKAITIS, VA ;
KLINE, EL .
JOURNAL OF BACTERIOLOGY, 1981, 147 (02) :500-508
[6]   GenBank [J].
Benson, DA ;
Karsch-Mizrachi, I ;
Lipman, DJ ;
Ostell, J ;
Wheeler, DL .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D34-D38
[7]  
BERGER SA, 1993, COMPUTER DRIVEN BAYE
[8]   Structures of gram-negative cell walls and their derived membrane vesicles [J].
Beveridge, TJ .
JOURNAL OF BACTERIOLOGY, 1999, 181 (16) :4725-4733
[9]   Maltose/maltodextrin system of Escherichia coli:: Transport, metabolism, and regulation [J].
Boos, W ;
Shuman, H .
MICROBIOLOGY AND MOLECULAR BIOLOGY REVIEWS, 1998, 62 (01) :204-+
[10]  
BRAUN V, 2002, MICROBIAL TRANSPORT, P289