Global Functional Atlas of Escherichia coli Encompassing Previously Uncharacterized Proteins

被引:263
作者
Hu, Pingzhao [2 ]
Janga, Sarath Chandra [2 ,3 ]
Babu, Mohan [2 ]
Diaz-Mejia, J. Javier [1 ,2 ]
Butland, Gareth [2 ]
Yang, Wenhong [2 ]
Pogoutse, Oxana [2 ]
Guo, Xinghua [2 ]
Phanse, Sadhna [2 ]
Wong, Peter [2 ]
Chandran, Shamanta [2 ]
Christopoulos, Constantine [2 ]
Nazarians-Armavil, Anaies [2 ]
Nasseri, Negin Karimi [2 ]
Musso, Gabriel [2 ]
Ali, Mehrab [2 ]
Nazemof, Nazila [4 ,5 ]
Eroukova, Veronika [4 ,5 ]
Golshani, Ashkan [4 ,5 ]
Paccanaro, Alberto [6 ]
Greenblatt, Jack F. [2 ]
Moreno-Hagelsieb, Gabriel [1 ]
Emili, Andrew [2 ]
机构
[1] Wilfrid Laurier Univ, Dept Biol, Waterloo, ON N2L 3C5, Canada
[2] Univ Toronto, Banting & Best Dept Med Res, Terrence Donnelly Ctr Cellular & Biomol Res, Toronto, ON, Canada
[3] MRC, Mol Biol Lab, Cambridge CB2 2QH, England
[4] Carleton Univ, Dept Biol, Ottawa, ON K1S 5B6, Canada
[5] Carleton Univ, Ottawa Inst Syst Biol, Ottawa, ON K1S 5B6, Canada
[6] Univ London, Dept Comp Sci, Egham, Surrey, England
基金
加拿大自然科学与工程研究理事会; 英国生物技术与生命科学研究理事会; 加拿大健康研究院;
关键词
INTERACTION MAP; POLYMYXIN RESISTANCE; INTERACTION NETWORKS; HIGH-THROUGHPUT; GENE ONTOLOGY; IDENTIFICATION; DATABASE; K-12; PREDICTION; ANNOTATION;
D O I
10.1371/journal.pbio.1000096
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
One-third of the 4,225 protein-coding genes of Escherichia coli K-12 remain functionally unannotated (orphans). Many map to distant clades such as Archaea, suggesting involvement in basic prokaryotic traits, whereas others appear restricted to E. coli, including pathogenic strains. To elucidate the orphans' biological roles, we performed an extensive proteomic survey using affinity-tagged E. coli strains and generated comprehensive genomic context inferences to derive a high-confidence compendium for virtually the entire proteome consisting of 5,993 putative physical interactions and 74,776 putative functional associations, most of which are novel. Clustering of the respective probabilistic networks revealed putative orphan membership in discrete multiprotein complexes and functional modules together with annotated gene products, whereas a machine-learning strategy based on network integration implicated the orphans in specific biological processes. We provide additional experimental evidence supporting orphan participation in protein synthesis, amino acid metabolism, biofilm formation, motility, and assembly of the bacterial cell envelope. This resource provides a "systems-wide" functional blueprint of a model microbe, with insights into the biological and evolutionary significance of previously uncharacterized proteins.
引用
收藏
页码:929 / 947
页数:19
相关论文
共 104 条
[1]   EcID. A database for the inference of functional interactions in E. coli [J].
Andres Leon, Eduardo ;
Ezkurdia, Iakes ;
Garcia, Beatriz ;
Valencia, Alfonso ;
Juan, David .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D629-D635
[2]  
Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkw1099, 10.1093/nar/gkh131]
[3]   Large-scale identification of protein-protein interaction of Escherichia coli K-12 [J].
Arifuzzaman, M ;
Maeda, M ;
Itoh, A ;
Nishikata, K ;
Takita, C ;
Saito, R ;
Ara, T ;
Nakahigashi, K ;
Huang, HC ;
Hirai, A ;
Tsuzuki, K ;
Nakamura, S ;
Altaf-Ul-Amin, M ;
Oshima, T ;
Baba, T ;
Yamamoto, N ;
Kawamura, T ;
Ioka-Nakamichi, T ;
Kitagawa, M ;
Tomita, M ;
Kanaya, S ;
Wada, C ;
Mori, H .
GENOME RESEARCH, 2006, 16 (05) :686-691
[4]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[5]  
Baba Tomoya, 2006, Mol Syst Biol, V2
[6]  
Bader GD, 2003, NUCLEIC ACIDS RES, V31, P248, DOI 10.1093/nar/gkg056
[7]   Functional maps of protein complexes from quantitative genetic interaction data [J].
Bandyopadhyay, Sourav ;
Kelley, Ryan ;
Krogan, Nevan J. ;
Ideker, Trey .
PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (04)
[8]   The CRM domain: An RNA binding module derived from an ancient ribosome-associated protein [J].
Barkan, Alice ;
Klipcan, Larik ;
Ostersetzer, Oren ;
Kawamura, Tetsuya ;
Asakura, Yukari ;
Watkins, Kenneth P. .
RNA, 2007, 13 (01) :55-64
[9]   The global transcriptional regulatory network for metabolism in Escherichia coli exhibits few dominant functional states [J].
Barrett, CL ;
Herring, CD ;
Reed, JL ;
Palsson, BO .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (52) :19103-19108
[10]   Integrating physical and genetic maps: from genomes to interaction networks [J].
Beyer, Andreas ;
Bandyopadhyay, Sourav ;
Ideker, Trey .
NATURE REVIEWS GENETICS, 2007, 8 (09) :699-710