Predotar:: A tool for rapidly screening proteomes for N-terminal targeting sequences

被引:709
作者
Small, I
Peeters, N
Legeai, F
Lurin, C
机构
[1] UEVE, CNRS, INRA, Unite Rech Genom Vegetable, F-91057 Evry, France
[2] INRA, Genet & Ameliorat Plantes Stn, F-78026 Versailles, France
[3] INRA, Unite Rech Genom Info, Evry, France
关键词
bioinformatics predictions; neural networks; organelles; protein targeting;
D O I
10.1002/pmic.200300776
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Probably more than 25% of the proteins encoded by the nuclear genomes of multicellular eukaryotes are targeted to membrane-bound compartments by N-terminal targeting signals. The major signals are those for the endoplasmic reticulum, the mitochondria, and in plants, plastids. The most abundant of these targeted proteins are well-known and well-studied, but a large proportion remain unknown, including most of those involved in regulation of organellar gene expression or regulation of biochemical pathways. The discovery and characterization of these proteins by biochemical means will be long and difficult. An alternative method is to identify candidate organellar proteins via their characteristic N-terminal targeting sequences. We have developed a neural network-based approach (Predotar - Prediction of Organelle Targeting sequences) for identifying genes encoding these proteins amongst eukaryotic genome sequences. The power of this approach for identifying and annotating novel gene families has been illustrated by the discovery of the pentatricopeptide repeat family.
引用
收藏
页码:1581 / 1590
页数:10
相关论文
共 33 条
[1]   EMPIRICAL HYDROPHOBICITY SCALE FOR ALPHA-AMINO-ACIDS AND SOME OF ITS APPLICATIONS [J].
ABODERIN, AA .
INTERNATIONAL JOURNAL OF BIOCHEMISTRY, 1971, 2 (11) :537-&
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]   MitoP2, an integrated database on mitochondrial proteins in yeast and man [J].
Andreoli, C ;
Prokisch, H ;
Hörtnagel, K ;
Mueller, JC ;
Münsterkötter, M ;
Scharfe, C ;
Meitinger, T .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D459-D462
[4]   Extensive feature detection of N-terminal protein sorting signals [J].
Bannai, H ;
Tamada, Y ;
Maruyama, O ;
Nakai, K ;
Miyano, S .
BIOINFORMATICS, 2002, 18 (02) :298-305
[5]  
Bauer MF, 2002, INT REV NEUROBIOL, V53, P57
[6]   The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[7]   Multiple sequence alignment with the Clustal series of programs [J].
Chenna, R ;
Sugawara, H ;
Koike, T ;
Lopez, R ;
Gibson, TJ ;
Higgins, DG ;
Thompson, JD .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3497-3500
[8]   Protein subcellular location prediction [J].
Chou, KC ;
Elrod, DW .
PROTEIN ENGINEERING, 1999, 12 (02) :107-118
[9]   CONFORMATIONAL PARAMETERS FOR AMINO-ACIDS IN HELICAL, BETA-SHEET, AND RANDOM COIL REGIONS CALCULATED FROM PROTEINS [J].
CHOU, PY ;
FASMAN, GD .
BIOCHEMISTRY, 1974, 13 (02) :211-222
[10]   Computational method to predict mitochondrially imported proteins and their targeting sequences [J].
Claros, MG ;
Vincens, P .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1996, 241 (03) :779-786