Completeness in structural genomics

被引:255
作者
Vitkup, D
Melamud, E
Moult, J
Sander, C
机构
[1] MIT, Ctr Genome Res, Cambridge, MA 02139 USA
[2] Harvard Univ, Dept Chem & Chem Biol, Cambridge, MA 02138 USA
[3] Univ Maryland, Maryland Biotechnol Inst, Ctr Adv Res Biotechnol, Rockville, MD 20850 USA
[4] Millennium Pharmaceut Inc, Cambridge, MA 02139 USA
关键词
D O I
10.1038/88640
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Structural genomics has the goal of obtaining useful, three-dimensional models of all proteins by a combination of experimental structure determination and comparative model building. We evaluate different strategies for optimizing information return on effort. The strategy that maximizes structural coverage requires about seven times fewer structure determinations compared with the strategy in which targets are selected at random, With a choice of reasonable model quality and the goal of 90% coverage, we extrapolate the estimate of the total effort of structural genomics. It would take similar to 16,000 carefully selected structure determinations to construct useful atomic models for the vast majority of all proteins. In practice, unless there is global coordination of target selection, the total effort will likely increase by a factor of three. The task can be accomplished within a decade provided that selection of targets is highly coordinated and significant funding is available.
引用
收藏
页码:559 / 566
页数:8
相关论文
共 54 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
Ashburner M, 1999, GENETICS, V153, P179
[3]  
BAIROCH A, 1996, NUCLEIC ACIDS RES, V24, P17
[4]   The Pfam protein families database [J].
Bateman, A ;
Birney, E ;
Durbin, R ;
Eddy, SR ;
Howe, KL ;
Sonnhammer, ELL .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :263-266
[5]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[6]  
Bourne PE, 1999, BIOINFORMATICS, V15, P715
[7]  
Brenner SE, 2000, PROTEIN SCI, V9, P197
[8]   Structural genomics: beyond the Human Genome Project [J].
Burley, SK ;
Almo, SC ;
Bonanno, JB ;
Capel, M ;
Chance, MR ;
Gaasterland, T ;
Lin, DW ;
Sali, A ;
Studier, FW ;
Swaminathan, S .
NATURE GENETICS, 1999, 23 (02) :151-157
[9]   Recent improvements of the ProDom database of protein domain families [J].
Corpet, F ;
Gouzy, J ;
Kahn, D .
NUCLEIC ACIDS RESEARCH, 1999, 27 (01) :263-267
[10]   Prediction of transmembrane alpha-helices in prokaryotic membrane proteins: the dense alignment surface method [J].
Cserzo, M ;
Wallin, E ;
Simon, I ;
vonHeijne, G ;
Elofsson, A .
PROTEIN ENGINEERING, 1997, 10 (06) :673-676