PROSITE, a protein domain database for functional characterization and annotation

被引:615
作者
Sigrist, Christian J. A. [1 ]
Cerutti, Lorenzo [1 ]
de Castro, Edouard [1 ]
Langendijk-Genevaux, Petra S. [1 ]
Bulliard, Virginie [1 ]
Bairoch, Amos [1 ,2 ]
Hulo, Nicolas [1 ]
机构
[1] Ctr Med Univ Geneva, SIB, CH-1211 Geneva 4, Switzerland
[2] Univ Geneva, Struct Biol & Bioinformat Dept, CH-1211 Geneva 4, Switzerland
关键词
SEQUENCE; PRORULE;
D O I
10.1093/nar/gkp885
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
PROSITE consists of documentation entries describing protein domains, families and functional sites, as well as associated patterns and profiles to identify them. It is complemented by ProRule, a collection of rules based on profiles and patterns, which increases the discriminatory power of these profiles and patterns by providing additional information about functionally and/or structurally critical amino acids. PROSITE is largely used for the annotation of domain features of UniProtKB/Swiss- Prot entries. Among the 983 ( DNA-binding) domains, repeats and zinc fingers present in Swiss-Prot (release 57.8 of 22 September 2009), 696 (similar to 70%) are annotated with PROSITE descriptors using information from ProRule. In order to allow better functional characterization of domains, PROSITE developments focus on subfamily specific profiles and a new profile building method giving more weight to functionally important residues. Here, we describe AMSA, an annotated multiple sequence alignment format used to build a new generation of generalized profiles, the migration of ScanProsite to Vital-IT, a cluster of 633 CPUs, and the adoption of the Distributed Annotation System (DAS) to facilitate PROSITE data integration and interchange with other sources. The latest version of PROSITE (release 20.54, of 22 September 2009) contains 1308 patterns, 863 profiles and 869 ProRules. PROSITE is accessible at: http://www.expasy.org/prosite/.
引用
收藏
页码:D161 / D166
页数:6
相关论文
共 15 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
[Anonymous], The Stockholm le format
[3]   The Jalview Java']Java alignment editor [J].
Clamp, M ;
Cuff, J ;
Searle, SM ;
Barton, GJ .
BIOINFORMATICS, 2004, 20 (03) :426-427
[4]   ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins [J].
de Castro, Edouard ;
Sigrist, Christian J. A. ;
Gattiker, Alexandre ;
Bulliard, Virginie ;
Langendijk-Genevaux, Petra S. ;
Gasteiger, Elisabeth ;
Bairoch, Amos ;
Hulo, Nicolas .
NUCLEIC ACIDS RESEARCH, 2006, 34 :W362-W365
[5]   The Distributed Annotation System [J].
Dowell, Robin D. ;
Jokerst, Rodney M. ;
Day, Allen ;
Eddy, Sean R. ;
Stein, Lincoln .
BMC BIOINFORMATICS, 2001, 2 (1)
[6]   The Pfam protein families database [J].
Finn, Robert D. ;
Tate, John ;
Mistry, Jaina ;
Coggill, Penny C. ;
Sammut, Stephen John ;
Hotz, Hans-Rudolf ;
Ceric, Goran ;
Forslund, Kristoffer ;
Eddy, Sean R. ;
Sonnhammer, Erik L. L. ;
Bateman, Alex .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D281-D288
[7]   ProServer: a simple, extensible Perl DAS server [J].
Finn, Robert D. ;
Stalker, James W. ;
Jackson, David K. ;
Kulesha, Eugene ;
Clements, Jody ;
Pettett, Roger .
BIOINFORMATICS, 2007, 23 (12) :1568-1570
[8]   The 20 years of PROSITE [J].
Hulo, Nicolas ;
Bairoch, Amos ;
Bulliard, Virginie ;
Cerutti, Lorenzo ;
Cuche, Beatrice A. ;
de Castro, Edouard ;
Lachaize, Corinne ;
Langendijk-Genevaux, Petra S. ;
Sigrist, Christian J. A. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D245-D249
[9]   InterPro: the integrative protein signature database [J].
Hunter, Sarah ;
Apweiler, Rolf ;
Attwood, Teresa K. ;
Bairoch, Amos ;
Bateman, Alex ;
Binns, David ;
Bork, Peer ;
Das, Ujjwal ;
Daugherty, Louise ;
Duquenne, Lauranne ;
Finn, Robert D. ;
Gough, Julian ;
Haft, Daniel ;
Hulo, Nicolas ;
Kahn, Daniel ;
Kelly, Elizabeth ;
Laugraud, Aurelie ;
Letunic, Ivica ;
Lonsdale, David ;
Lopez, Rodrigo ;
Madera, Martin ;
Maslen, John ;
McAnulla, Craig ;
McDowall, Jennifer ;
Mistry, Jaina ;
Mitchell, Alex ;
Mulder, Nicola ;
Natale, Darren ;
Orengo, Christine ;
Quinn, Antony F. ;
Selengut, Jeremy D. ;
Sigrist, Christian J. A. ;
Thimma, Manjula ;
Thomas, Paul D. ;
Valentin, Franck ;
Wilson, Derek ;
Wu, Cathy H. ;
Yeats, Corin .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D211-D215
[10]   Dasty2, an ajax protein DAS client [J].
Jimenez, Rafael C. ;
Quinn, Antony F. ;
Garcia, Alexander ;
Labarga, Alberto ;
O'Neill, Kieran ;
Martinez, Fernando ;
Salazar, Gustavo A. ;
Hermjakob, Henning .
BIOINFORMATICS, 2008, 24 (18) :2119-2121