Structural Annotation of Mycobacterium tuberculosis Proteome

被引:29
作者
Anand, Praveen [1 ,2 ]
Sankaran, Sandhya [1 ,2 ]
Mukherjee, Sumanta [1 ,2 ]
Yeturu, Kalidas [1 ,2 ]
Laskowski, Roman [3 ]
Bhardwaj, Anshu [5 ]
Bhagavat, Raghu [1 ,2 ]
Brahmachari, Samir K. [4 ,5 ]
Chandra, Nagasuma [1 ,2 ]
机构
[1] Indian Inst Sci, Dept Biochem, Bangalore 560012, Karnataka, India
[2] Indian Inst Sci, Bioinformat Ctr, Bangalore 560012, Karnataka, India
[3] Wellcome Trust Genome Campus, European Bioinformat Inst, Cambridge, England
[4] Inst Genom & Integrat Biol CSIR, New Delhi, India
[5] Council Ind & Sci Res, New Delhi, India
[6] CSIR, Open Source Drug Discovery OSDD Consortium, New Delhi, India
来源
PLOS ONE | 2011年 / 6卷 / 10期
关键词
LIGAND-BINDING SITES; MODELS; PREDICTION; ALGORITHM; PROTEINS; DATABASE; SEARCH; IDENTIFICATION; GENTHREADER; ALIGNMENT;
D O I
10.1371/journal.pone.0027044
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Of the similar to 4000 ORFs identified through the genome sequence of Mycobacterium tuberculosis (TB) H37Rv, experimentally determined structures are available for 312. Since knowledge of protein structures is essential to obtain a high-resolution understanding of the underlying biology, we seek to obtain a structural annotation for the genome, using computational methods. Structural models were obtained and validated for similar to 2877 ORFs, covering similar to 70% of the genome. Functional annotation of each protein was based on fold-based functional assignments and a novel binding site based ligand association. New algorithms for binding site detection and genome scale binding site comparison at the structural level, recently reported from the laboratory, were utilized. Besides these, the annotation covers detection of various sequence and sub-structural motifs and quaternary structure predictions based on the corresponding templates. The study provides an opportunity to obtain a global perspective of the fold distribution in the genome. The annotation indicates that cellular metabolism can be achieved with only 219 folds. New insights about the folds that predominate in the genome, as well as the fold-combinations that make up multi-domain proteins are also obtained. 1728 binding pockets have been associated with ligands through binding site identification and sub-structure similarity analyses. The resource (http://proline.physics.iisc.ernet.in/Tbstructuralannotation), being one of the first to be based on structure-derived functional annotations at a genome scale, is expected to be useful for better understanding of TB and for application in drug discovery. The reported annotation pipeline is fairly generic and can be applied to other genomes as well.
引用
收藏
页数:14
相关论文
共 56 条
[41]   Intraphagosomal Mycobacterium tuberculosis acquires iron from both extracellular transferrin and intracellular iron pools -: Impact of interferon-γ and hemochromatosis [J].
Olakanmi, O ;
Schlesinger, LS ;
Ahmed, A ;
Britigan, BE .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2002, 277 (51) :49727-49734
[42]   MODBASE, a database of annotated comparative protein structure models and associated resources [J].
Pieper, Ursula ;
Eswar, Narayanan ;
Webb, Ben M. ;
Eramian, David ;
Kelly, Libusha ;
Barkan, David T. ;
Carter, Hannah ;
Mankoo, Parminder ;
Karchin, Rachel ;
Marti-Renom, Marc A. ;
Davis, Fred P. ;
Sali, Andrej .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D347-D354
[43]   Recent improvements in prediction of protein structure by global optimization of a potential energy function [J].
Pillardy, A ;
Czaplewski, C ;
Liwo, A ;
Lee, J ;
Ripoll, DR ;
Kazmierkiewicz, R ;
Oldziej, S ;
Wedemeyer, WJ ;
Gibson, KD ;
Arnautova, YA ;
Saunders, J ;
Ye, YJ ;
Scheraga, HA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (05) :2329-2333
[44]   The Catalytic Site Atlas: a resource of catalytic sites and residues identified in enzymes using structural data [J].
Porter, CT ;
Bartlett, GJ ;
Thornton, JM .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D129-D133
[45]   Protein family and fold occurrence in genomes: Power-law behaviour and evolutionary model [J].
Qian, J ;
Luscombe, NM ;
Gerstein, M .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 313 (04) :673-681
[46]   Advances in comparative protein-structure modelling [J].
Sanchez, R ;
Sali, A .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1997, 7 (02) :206-214
[47]   Statistical potential for assessment and prediction of protein structures [J].
Shen, Min-Yi ;
Sali, Andrej .
PROTEIN SCIENCE, 2006, 15 (11) :2507-2524
[48]   Protein structure alignment by incremental combinatorial extension (CE) of the optimal path [J].
Shindyalov, IN ;
Bourne, PE .
PROTEIN ENGINEERING, 1998, 11 (09) :739-747
[49]   PIC: Protein Interactions Calculator [J].
Tina, K. G. ;
Bhadra, R. ;
Srinivasan, N. .
NUCLEIC ACIDS RESEARCH, 2007, 35 :W473-W476
[50]   Kappa-alpha plot derived structural alphabet and BLOSUM-like substitution matrix for rapid search of protein structure database [J].
Tung, Chi-Hua ;
Huang, Jhang-Wei ;
Yang, Jinn-Moon .
GENOME BIOLOGY, 2007, 8 (03)