High-throughput computational and experimental techniques in structural genomics

被引:47
作者
Chance, MR [1 ]
Fiser, A
Sali, A
Pieper, U
Eswar, N
Xu, GP
Fajardo, JE
Radhakannan, T
Marinkovic, N
机构
[1] Albert Einstein Coll Med, New York Struct Genom Res Consortium, Bronx, NY 10461 USA
[2] Albert Einstein Coll Med, Dept Physiol & Biophys, Bronx, NY 10461 USA
[3] Albert Einstein Coll Med, Dept Biochem, Bronx, NY 10461 USA
[4] Albert Einstein Coll Med, Ctr Synchrotron Biosci, Bronx, NY 10461 USA
[5] Univ Calif San Francisco, Dept Biopharmaceut Sci, San Francisco, CA 94143 USA
[6] Univ Calif San Francisco, Dept Pharmaceut Chem, San Francisco, CA 94143 USA
[7] Univ Calif San Francisco, Calif Inst Quantitat Biomed Res, San Francisco, CA 94143 USA
关键词
D O I
10.1101/gr.2537904
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Structural genomics has as its goal the provision of structural information for all possible ORF sequences through a combination of experimental and computational approaches. The access to genome sequences and cloning resources from an ever-widening array of organisms is driving high-throughput structural studies by the New York Structural Genomics Research Consortium. In this report, we outline the progress of the Consortium in establishing its pipeline for structural genomics, and some of the experimental and bioinformatics efforts leading to structural annotation of proteins. The Consortium has established a pipeline for structural biology studies, automated modeling of ORF sequences using solved (template) structures, and a novel high-throughput approach (metallomics) to examining the metal binding to purified protein targets. The Consortium has so far produced 493 purified proteins from >1077 expression vectors. A total of 95 have resulted in crystal structures, and 81 are deposited in the Protein Data Bank (PDB). Comparative modeling of these structures has generated >40,000 structural models. We also initiated a high-throughput metal analysis of the purified proteins; this has determined that 10%-15% of the targets contain a stoichiometric structural or catalytic transition metal atom. The progress of the structural genomics centers in the U.S. and around the world suggests that the goal of providing useful structural information on most all ORF domains will be realized. This projected resource will provide structural biology information important to understanding the function of most proteins of the cell.
引用
收藏
页码:2145 / 2154
页数:10
相关论文
共 51 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   SCOP database in 2004: refinements integrate structure and sequence family data [J].
Andreeva, A ;
Howorth, D ;
Brenner, SE ;
Hubbard, TJP ;
Chothia, C ;
Murzin, AG .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D226-D229
[3]   Protein structure prediction and structural genomics [J].
Baker, D ;
Sali, A .
SCIENCE, 2001, 294 (5540) :93-96
[4]   GenBank [J].
Benson, DA ;
Karsch-Mizrachi, I ;
Lipman, DJ ;
Ostell, J ;
Rapp, BA ;
Wheeler, DL .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :17-20
[5]   Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2) [J].
Bentley, SD ;
Chater, KF ;
Cerdeño-Tárraga, AM ;
Challis, GL ;
Thomson, NR ;
James, KD ;
Harris, DE ;
Quail, MA ;
Kieser, H ;
Harper, D ;
Bateman, A ;
Brown, S ;
Chandra, G ;
Chen, CW ;
Collins, M ;
Cronin, A ;
Fraser, A ;
Goble, A ;
Hidalgo, J ;
Hornsby, T ;
Howarth, S ;
Huang, CH ;
Kieser, T ;
Larke, L ;
Murphy, L ;
Oliver, K ;
O'Neil, S ;
Rabbinowitsch, E ;
Rajandream, MA ;
Rutherford, K ;
Rutter, S ;
Seeger, K ;
Saunders, D ;
Sharp, S ;
Squares, R ;
Squares, S ;
Taylor, K ;
Warren, T ;
Wietzorrek, A ;
Woodward, J ;
Barrell, BG ;
Parkhill, J ;
Hopwood, DA .
NATURE, 2002, 417 (6885) :141-147
[6]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[7]   The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[8]   Structural genomics: beyond the Human Genome Project [J].
Burley, SK ;
Almo, SC ;
Bonanno, JB ;
Capel, M ;
Chance, MR ;
Gaasterland, T ;
Lin, DW ;
Sali, A ;
Studier, FW ;
Swaminathan, S .
NATURE GENETICS, 1999, 23 (02) :151-157
[9]  
Burley Stephen K, 2003, Methods Biochem Anal, V44, P591
[10]   Global mapping of structural solutions provided by the extended x-ray absorption fine structure ab initio code FEFF 6.01: Structure of the cryogenic photoproduct of the myoglobin carbon monoxide complex [J].
Chance, MR ;
Miller, LM ;
Fischetti, RF ;
Scheuring, E ;
Huang, WX ;
Sclavi, B ;
Hai, Y ;
Sullivan, M .
BIOCHEMISTRY, 1996, 35 (28) :9014-9023