The Swiss-Prot protein knowledgebase and ExPASy: providing the plant community with high quality proteomic data and tools

被引:56
作者
Schneider, M [1 ]
Tognolli, M [1 ]
Bairoch, A [1 ]
机构
[1] CMU, Swiss Inst Bioinformat, CH-1211 Geneva 4, Switzerland
关键词
bioinformatics; proteomics; databases; ExPASy; Swiss-Prot; TrEMBL; UniProt;
D O I
10.1016/j.plaphy.2004.10.009
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
The Swiss-Prot protein knowledgebase provides manually annotated entries for all species, but concentrates on the annotation of entries from model organisms to ensure the presence of high quality annotation of representative members of all protein families. A specific Plant Protein Annotation Program (PPAP) was started to cope with the increasing amount of data produced by the complete sequencing of plant genomes. Its main goal is the annotation of proteins from the model plant organism Arabidopsis thaliana. In addition to bibliographic references, experimental results, computed features and sometimes even contradictory conclusions, direct links to specialized databases connect amino acid sequences with the current knowledge in plant sciences. As protein families and groups of plant-specific proteins are regularly reviewed to keep up with current scientific findings, we hope that the wealth of information of Arabidopsis origin accumulated in our knowledgebase, and the numerous software tools provided on the Expert Protein Analysis System (ExPASy) web site might help to identify and reveal the function of proteins originating from other plants. Recently, a single, centralized, authoritative resource for protein sequences and functional information, UniProt, was created by joining the information contained in Swiss-Prot, Translation of the EMBL nucleotide sequence (TrEMBL), and the Protein Information Resource-Protein Sequence Database (PIR-PSD). A rising problem is that an increasing number of nucleotide sequences are not being submitted to the public databases, and thus the proteins inferred from such sequences will have difficulties finding their way to the Swiss-Prot or TrEMBL databases. (C) 2004 Elsevier SAS. All rights reserved.
引用
收藏
页码:1013 / 1021
页数:9
相关论文
共 45 条
[1]  
[Anonymous], GENOME BIOL
[2]  
Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkw1099, 10.1093/nar/gkh131]
[3]  
Apweiler R, 2001, Brief Bioinform, V2, P9, DOI 10.1093/bib/2.1.9
[4]   Analysis of the genome sequence of the flowering plant Arabidopsis thaliana [J].
Kaul, S ;
Koo, HL ;
Jenkins, J ;
Rizzo, M ;
Rooney, T ;
Tallon, LJ ;
Feldblyum, T ;
Nierman, W ;
Benito, MI ;
Lin, XY ;
Town, CD ;
Venter, JC ;
Fraser, CM ;
Tabata, S ;
Nakamura, Y ;
Kaneko, T ;
Sato, S ;
Asamizu, E ;
Kato, T ;
Kotani, H ;
Sasamoto, S ;
Ecker, JR ;
Theologis, A ;
Federspiel, NA ;
Palm, CJ ;
Osborne, BI ;
Shinn, P ;
Conway, AB ;
Vysotskaia, VS ;
Dewar, K ;
Conn, L ;
Lenz, CA ;
Kim, CJ ;
Hansen, NF ;
Liu, SX ;
Buehler, E ;
Altafi, H ;
Sakano, H ;
Dunn, P ;
Lam, B ;
Pham, PK ;
Chao, Q ;
Nguyen, M ;
Yu, GX ;
Chen, HM ;
Southwick, A ;
Lee, JM ;
Miranda, M ;
Toriumi, MJ ;
Davis, RW .
NATURE, 2000, 408 (6814) :796-815
[5]   The ENZYME database in 2000 [J].
Bairoch, A .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :304-305
[6]   Improved prediction of signal peptides: SignalP 3.0 [J].
Bendtsen, JD ;
Nielsen, H ;
von Heijne, G ;
Brunak, S .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 340 (04) :783-795
[7]   The PDB data uniformity project [J].
Bhat, TN ;
Bourne, P ;
Feng, ZK ;
Gilliland, G ;
Jain, S ;
Ravichandran, V ;
Schneider, B ;
Schneider, K ;
Thanki, N ;
Weissig, H ;
Westbrook, J ;
Berman, HM .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :214-218
[8]   THE FOCUSING POSITIONS OF POLYPEPTIDES IN IMMOBILIZED PH GRADIENTS CAN BE PREDICTED FROM THEIR AMINO-ACID-SEQUENCES [J].
BJELLQVIST, B ;
HUGHES, GJ ;
PASQUALI, C ;
PAQUET, N ;
RAVIER, F ;
SANCHEZ, JC ;
FRUTIGER, S ;
HOCHSTRASSER, D .
ELECTROPHORESIS, 1993, 14 (10) :1023-1031
[9]   The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[10]  
Cooper CA, 2001, PROTEOMICS, V1, P340, DOI 10.1002/1615-9861(200102)1:2<340::AID-PROT340>3.3.CO