PSI-2: Structural Genomics to Cover Protein Domain Family Space

被引:83
作者
Dessailly, Benoit H. [1 ]
Nair, Rajesh [2 ]
Jaroszewski, Lukasz [3 ]
Fajardo, J. Eduardo [4 ]
Kouranov, Andrei [5 ]
Lee, David [1 ]
Fiser, Andras [4 ]
Godzik, Adam [3 ]
Rost, Burkhard [6 ,7 ]
Orengo, Christine [1 ]
机构
[1] UCL, Dept Biol Mol & Struct, London WC1E 6BT, England
[2] US FDA, Ctr Devices & Radiol Hlth, Rockville, MD 20850 USA
[3] Burnham Inst, La Jolla, CA 92037 USA
[4] Albert Einstein Coll Med, Dept Syst & Computat Biol, Bronx, NY 10461 USA
[5] Rutgers State Univ, Dept Chem & Chem Biol, Piscataway, NJ 08854 USA
[6] Columbia Univ, Ctr Computat Biol & Bioinformat C2B2, Dept Biochem & Mol Biophys, New York, NY 10032 USA
[7] Columbia Univ, NE Struct Genom Consortium NESG, New York, NY 10032 USA
基金
美国国家卫生研究院;
关键词
COMPLETED GENOMES; EVOLUTION; DATABASE; ALIGNMENT; SUPERFAMILIES; CLASSIFICATION; METAGENOMICS; SEQUENCES; RESOURCE; PROGRESS;
D O I
10.1016/j.str.2009.03.015
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
One major objective of structural genomics efforts, including the NIH-funded Protein Structure Initiative (PSI), has been to increase the structural coverage of protein sequence space. Here, we present the target selection strategy used during the second phase of PSI (PSI-2). This strategy, jointly devised by the bioinformatics groups associated with the PSI-2 large-scale production centers, targets representatives from large, structurally uncharacterized protein domain families, and from structurally uncharacterized subfamilies in very large and diverse families with incomplete structural coverage. These very large families are extremely diverse both structurally and functionally, and are highly overrepresented in known proteomes. On the basis of several metrics, we then discuss to what extent PSI-2, during its first 3 years, has increased the structural coverage of genomes, and contributed structural and functional novelty. Together, the results presented here suggest that PSI-2 is successfully meeting its objectives and provides useful insights into structural and functional space.
引用
收藏
页码:869 / 881
页数:13
相关论文
共 51 条
[1]   Structural and chemical profiling of the human cytosolic sulfotransferases [J].
Allali-Hassani, Abdellah ;
Pan, Patricia W. ;
Dombrovski, Ludmila ;
Najmanovich, Rafael ;
Tempel, Wolfram ;
Dong, Aiping ;
Loppnau, Peter ;
Martin, Fernando ;
Thonton, Janet ;
Edwards, Aled M. ;
Bochkarev, Alexey ;
Plotnikov, Alexander N. ;
Vedadi, Masoud ;
Arrowsmith, Cheryl H. .
PLOS BIOLOGY, 2007, 5 (05) :1063-1078
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[4]   Monophyly of class I aminoacyl tRNA synthetase, USPA, ETFP, photolyase, and PP-ATPase nucleotide-binding domains: Implications for protein evolution in the RNA world [J].
Aravind, L ;
Anantharaman, V ;
Koonin, EV .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 48 (01) :1-14
[5]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[6]   The Universal Protein Resource (UniProt) [J].
Bairoch, Amos ;
Bougueleret, Lydie ;
Altairac, Severine ;
Amendolia, Valeria ;
Auchincloss, Andrea ;
Puy, Ghislaine Argoud ;
Axelsen, Kristian ;
Baratin, Delphine ;
Blatter, Marie-Claude ;
Boeckmann, Brigitte ;
Bollondi, Laurent ;
Boutet, Emmanuel ;
Quintaje, Silvia Braconi ;
Breuza, Lionel ;
Bridge, Alan ;
Saux, Virginie Bulliard-Le ;
decastro, Edouard ;
Ciampina, Luciane ;
Coral, Danielle ;
Coudert, Elisabeth ;
Cusin, Isabelle ;
David, Fabrice ;
Delbard, Gwennaelle ;
Dornevil, Dolnide ;
Duek-Roggli, Paula ;
Duvaud, Severine ;
Estreicher, Anne ;
Famiglietti, Livia ;
Farriol-Mathis, Nathalie ;
Ferro, Serenella ;
Feuermann, Marc ;
Gasteiger, Elisabeth ;
Gateau, Alain ;
Gehant, Sebastian ;
Gerritsen, Vivienne ;
Gos, Arnaud ;
Gruaz-Gumowski, Nadine ;
Hinz, Ursula ;
Hulo, Chantal ;
Hulo, Nicolas ;
Innocenti, Alessandro ;
James, Janet ;
Jain, Eric ;
Jimenez, Silvia ;
Jungo, Florence ;
Junker, Vivien ;
Keller, Guillaume ;
Lachaize, Corinne ;
Lane-Guermonprez, Lydie ;
Langendijk-Genevaux, Petra .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D190-D195
[7]   The protein structure initiative structural genomics knowledgebase [J].
Berman, Helen M. ;
Westbrook, John D. ;
Gabanyi, Margaret J. ;
Tao, Wendy ;
Shah, Raship ;
Kouranov, Andrei ;
Schwede, Torsten ;
Arnold, Konstantin ;
Kiefer, Florian ;
Bordoli, Lorenza ;
Kopp, Jrgen ;
Podvinec, Michael ;
Adams, Paul D. ;
Carter, Lester G. ;
Minor, Wladek ;
Nair, Rajesh ;
Baer, Joshua La .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D365-D368
[8]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[9]   New dimensions of structural proteomics: exploring chemical and biological space [J].
Blundell, Tom .
STRUCTURE, 2007, 15 (11) :1342-1343
[10]   The ProDom database of protein domain families: more emphasis on 3D [J].
Bru, C ;
Courcelle, E ;
Carrre, S ;
Beausse, Y ;
Dalmar, S ;
Kahn, D .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D212-D215