Structural Annotation of Mycobacterium tuberculosis Proteome

被引:29
作者
Anand, Praveen [1 ,2 ]
Sankaran, Sandhya [1 ,2 ]
Mukherjee, Sumanta [1 ,2 ]
Yeturu, Kalidas [1 ,2 ]
Laskowski, Roman [3 ]
Bhardwaj, Anshu [5 ]
Bhagavat, Raghu [1 ,2 ]
Brahmachari, Samir K. [4 ,5 ]
Chandra, Nagasuma [1 ,2 ]
机构
[1] Indian Inst Sci, Dept Biochem, Bangalore 560012, Karnataka, India
[2] Indian Inst Sci, Bioinformat Ctr, Bangalore 560012, Karnataka, India
[3] Wellcome Trust Genome Campus, European Bioinformat Inst, Cambridge, England
[4] Inst Genom & Integrat Biol CSIR, New Delhi, India
[5] Council Ind & Sci Res, New Delhi, India
[6] CSIR, Open Source Drug Discovery OSDD Consortium, New Delhi, India
来源
PLOS ONE | 2011年 / 6卷 / 10期
关键词
LIGAND-BINDING SITES; MODELS; PREDICTION; ALGORITHM; PROTEINS; DATABASE; SEARCH; IDENTIFICATION; GENTHREADER; ALIGNMENT;
D O I
10.1371/journal.pone.0027044
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Of the similar to 4000 ORFs identified through the genome sequence of Mycobacterium tuberculosis (TB) H37Rv, experimentally determined structures are available for 312. Since knowledge of protein structures is essential to obtain a high-resolution understanding of the underlying biology, we seek to obtain a structural annotation for the genome, using computational methods. Structural models were obtained and validated for similar to 2877 ORFs, covering similar to 70% of the genome. Functional annotation of each protein was based on fold-based functional assignments and a novel binding site based ligand association. New algorithms for binding site detection and genome scale binding site comparison at the structural level, recently reported from the laboratory, were utilized. Besides these, the annotation covers detection of various sequence and sub-structural motifs and quaternary structure predictions based on the corresponding templates. The study provides an opportunity to obtain a global perspective of the fold distribution in the genome. The annotation indicates that cellular metabolism can be achieved with only 219 folds. New insights about the folds that predominate in the genome, as well as the fold-combinations that make up multi-domain proteins are also obtained. 1728 binding pockets have been associated with ligands through binding site identification and sub-structure similarity analyses. The resource (http://proline.physics.iisc.ernet.in/Tbstructuralannotation), being one of the first to be based on structure-derived functional annotations at a genome scale, is expected to be useful for better understanding of TB and for application in drug discovery. The reported annotation pipeline is fairly generic and can be applied to other genomes as well.
引用
收藏
页数:14
相关论文
共 56 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]  
[Anonymous], 2009, GLOB TUB CONTR SHORT
[4]  
Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkh131, 10.1093/nar/gkw1099]
[5]   An algorithm for constraint-based structural template matching: application to 3D templates with statistical analysis [J].
Barker, JA ;
Thornton, JM .
BIOINFORMATICS, 2003, 19 (13) :1644-1649
[6]   The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data [J].
Berman, Helen ;
Henrick, Kim ;
Nakamura, Haruki ;
Markley, John L. .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D301-D303
[7]   Protein structure homology modeling using SWISS-MODEL workspace [J].
Bordoli, Lorenza ;
Kiefer, Florian ;
Arnold, Konstantin ;
Benkert, Pascal ;
Battey, James ;
Schwede, Torsten .
NATURE PROTOCOLS, 2009, 4 (01) :1-13
[8]   A tour of structural genomics [J].
Brenner, SE .
NATURE REVIEWS GENETICS, 2001, 2 (10) :801-809
[9]   An overview of structural genomics [J].
Burley, SK .
NATURE STRUCTURAL BIOLOGY, 2000, 7 (Suppl 11) :932-934
[10]   Re-annotation of the genome sequence of Mycobacterium tuberculosis H37Rv [J].
Camus, JC ;
Pryor, MJ ;
Médigue, C ;
Cole, ST .
MICROBIOLOGY-SGM, 2002, 148 :2967-2973