GTOP: a database of protein structures predicted from genome sequences

被引:62
作者
Kawabata, T
Fukuchi, S
Homma, K
Ota, M
Araki, J
Ito, T
Ichiyoshi, N
Nishikawa, K
机构
[1] Natl Inst Genet, Ctr Informat Biol, Shizuoka 4118540, Japan
[2] Natl Inst Genet, DNA Data Bank Japan, Shizuoka 4118540, Japan
[3] Japan Sci & Technol Corp, Kawaguchi, Saitama 3320012, Japan
[4] Mitsubishi Res Inst Inc, Chiyoda Ku, Tokyo 1008141, Japan
关键词
D O I
10.1093/nar/30.1.294
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Large-scale genome projects generate an unprecedented number of protein sequences, most of them are experimentally uncharacterized. Predicting the 3D structures of sequences provides important clues as to their functions. We constructed the Genomes TO Protein structures and functions (GTOP) database, containing protein fold predictions of a huge number of sequences. Predictions are mainly carried out with the homology search program PSI-BLAST, currently the most popular among high-sensitivity profile search methods. GTOP also includes the results of other analyses, e.g. homology and motif search, detection of transmembrane helices and repetitive sequences. We have completed analyzing the sequences of 41 organisms, with the number of proteins exceeding 120 000 in total. GTOP uses a graphical viewer to present the analytical results of each ORF in one page in a 'color-bar' format. The assigned 3D structures are presented by Chime plug-in or RasMol. The binding sites of ligands are also included, providing functional information. The GTOP server is available at http://spock.genes.nig.ac.jp/similar togenome/gtop.html.
引用
收藏
页码:294 / 298
页数:5
相关论文
共 26 条
[11]   The PROSITE database, its status in 1999 [J].
Hofmann, K ;
Bucher, P ;
Falquet, L ;
Bairoch, A .
NUCLEIC ACIDS RESEARCH, 1999, 27 (01) :215-219
[12]   Structural/functional assignment of unknown bacteriophage T4 proteins by iterative database searches [J].
Kawabata, T ;
Arisaka, F ;
Nishikawa, K .
GENE, 2000, 259 (1-2) :223-233
[13]   HUGE: a database for human large proteins identified in the Kazusa cDNA sequencing project [J].
Kikuno, R ;
Nagase, T ;
Suyama, M ;
Waki, M ;
Hirosawa, M ;
Ohara, O .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :331-332
[14]  
MURZIN AG, 1995, J MOL BIOL, V247, P536, DOI 10.1016/S0022-2836(05)80134-2
[15]   CyanoBase, a WWW database containing the complete nucleotide sequence of the genome of Synechocystis sp. strain PCC6803 [J].
Nakamura, Y ;
Kaneko, T ;
Hirosawa, M ;
Miyajima, N ;
Tabata, S .
NUCLEIC ACIDS RESEARCH, 1998, 26 (01) :63-67
[16]   Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methods [J].
Park, J ;
Karplus, K ;
Barrett, C ;
Hughey, R ;
Haussler, D ;
Hubbard, T ;
Chothia, C .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 284 (04) :1201-1210
[17]   IMPROVED TOOLS FOR BIOLOGICAL SEQUENCE COMPARISON [J].
PEARSON, WR ;
LIPMAN, DJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1988, 85 (08) :2444-2448
[18]   Assigning protein functions by comparative genome analysis: Protein phylogenetic profiles [J].
Pellegrini, M ;
Marcotte, EM ;
Thompson, MJ ;
Eisenberg, D ;
Yeates, TO .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (08) :4285-4288
[19]   COMPARATIVE PROTEIN MODELING BY SATISFACTION OF SPATIAL RESTRAINTS [J].
SALI, A ;
BLUNDELL, TL .
JOURNAL OF MOLECULAR BIOLOGY, 1993, 234 (03) :779-815
[20]   ModBase, a database of annotated comparative protein structure models [J].
Sánchez, R ;
Pieper, U ;
Mirkovic, N ;
de Bakker, PIW ;
Wittenstein, E ;
Sali, A .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :250-253