1,000 structures and more from the MCSG

被引:10
作者
Lee, David [1 ]
de Beer, Tjaart A. P. [2 ]
Laskowski, Roman A. [2 ]
Thornton, Janet M. [2 ]
Orengo, Christine A. [1 ]
机构
[1] UCL, Dept Struct & Mol Biol, London WC1E 6BT, England
[2] European Bioinformat Inst, Cambridge CB10 1SD, England
关键词
FUNCTION PREDICTION; PROTEIN FAMILIES; GENOMICS; DATABASE; DOMAIN; CLASSIFICATION; SEQUENCES; RESOURCE; PROGRAM; TOOLS;
D O I
10.1186/1472-6807-11-2
中图分类号
Q6 [生物物理学];
学科分类号
071011 ;
摘要
Background: The Midwest Center for Structural Genomics (MCSG) is one of the large-scale centres of the Protein Structure Initiative (PSI). During the first two phases of the PSI the MCSG has solved over a thousand protein structures. A criticism of structural genomics is that target selection strategies mean that some structures are solved without having a known function and thus are of little biomedical significance. Structures of unknown function have stimulated the development of methods for function prediction from structure. Results: We show that the MCSG has met the stated goals of the PSI and use online resources and readily available function prediction methods to provide functional annotations for more than 90% of the MCSG structures. The structure-to-function prediction method ProFunc provides likely functions for many of the MCSG structures that cannot be annotated by sequence-based methods. Conclusions: Although the focus of the PSI was structural coverage, many of the structures solved by the MCSG can also be associated with functional classes and biological roles of possible biomedical value.
引用
收藏
页数:15
相关论文
共 51 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
[Anonymous], P 5 INT C MOL STRUCT
[3]   The Universal Protein Resource (UniProt) in 2010 [J].
Apweiler, Rolf ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Alam-Faruque, Yasmin ;
Antunes, Ricardo ;
Barrell, Daniel ;
Bely, Benoit ;
Bingley, Mark ;
Binns, David ;
Bower, Lawrence ;
Browne, Paul ;
Chan, Wei Mun ;
Dimmer, Emily ;
Eberhardt, Ruth ;
Fedotov, Alexander ;
Foulger, Rebecca ;
Garavelli, John ;
Huntley, Rachael ;
Jacobsen, Julius ;
Kleen, Michael ;
Laiho, Kati ;
Leinonen, Rasko ;
Legge, Duncan ;
Lin, Quan ;
Liu, Wudong ;
Luo, Jie ;
Orchard, Sandra ;
Patient, Samuel ;
Poggioli, Diego ;
Pruess, Manuela ;
Corbett, Matt ;
di Martino, Giuseppe ;
Donnelly, Mike ;
van Rensburg, Pieter ;
Bairoch, Amos ;
Bougueleret, Lydie ;
Xenarios, Ioannis ;
Altairac, Severine ;
Auchincloss, Andrea ;
Argoud-Puy, Ghislaine ;
Axelsen, Kristian ;
Baratin, Delphine ;
Blatter, Marie-Claude ;
Boeckmann, Brigitte ;
Bolleman, Jerven ;
Bollondi, Laurent ;
Boutet, Emmanuel ;
Quintaje, Silvia Braconi ;
Breuza, Lionel .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D142-D148
[4]  
Arnold Konstantin, 2009, Journal of Structural and Functional Genomics, V10, P1, DOI 10.1007/s10969-008-9048-5
[5]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[6]   The ENZYME database in 2000 [J].
Bairoch, A .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :304-305
[7]   An algorithm for constraint-based structural template matching: application to 3D templates with statistical analysis [J].
Barker, JA ;
Thornton, JM .
BIOINFORMATICS, 2003, 19 (13) :1644-1649
[8]   The protein structure initiative structural genomics knowledgebase [J].
Berman, Helen M. ;
Westbrook, John D. ;
Gabanyi, Margaret J. ;
Tao, Wendy ;
Shah, Raship ;
Kouranov, Andrei ;
Schwede, Torsten ;
Arnold, Konstantin ;
Kiefer, Florian ;
Bordoli, Lorenza ;
Kopp, Jrgen ;
Podvinec, Michael ;
Adams, Paul D. ;
Carter, Lester G. ;
Minor, Wladek ;
Nair, Rajesh ;
Baer, Joshua La .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D365-D368
[9]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[10]   The impact of structural genomics: Expectations and outcomes [J].
Chandonia, JM ;
Brenner, SE .
SCIENCE, 2006, 311 (5759) :347-351