MultiLoc:: prediction of protein subcellular localization using N-terminal targeting sequences, sequence motifs and amino acid composition

被引:245
作者
Höglund, A
Dönnes, P
Blum, T
Adolph, HW
Kohlbacher, O
机构
[1] Univ Tubingen, Div Simulat Biol Syst, WSI, ZBIT, D-72076 Tubingen, Germany
[2] Univ Saarland, Dept Biochem, D-66041 Saarbrucken, Germany
[3] Univ Saarland, Ctr Bioinformat, D-66041 Saarbrucken, Germany
关键词
D O I
10.1093/bioinformatics/btl002
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Functional annotation of unknown proteins is a major goal in proteomics. A key annotation is the prediction of a protein's subcellular localization. Numerous prediction techniques have been developed, typically focusing on a single underlying biological aspect or predicting a subset of all possible localizations. An important step is taken towards emulating the protein sorting process by capturing and bringing together biologically relevant information, and addressing the clear need to improve prediction accuracy and localization coverage. Results: Here we present a novel SVM-based approach for predicting subcellular localization, which integrates N-terminal targeting sequences, amino acid composition and protein sequence motifs. We show how this approach improves the prediction based on N-terminal targeting sequences, by comparing our method TargetLoc against existing methods. Furthermore, MultiLoc performs considerably better than comparable methods predicting all major eukaryotic subcellular localizations, and shows better or comparable results to methods that are specialized on fewer localizations or for one organism.
引用
收藏
页码:1158 / 1165
页数:8
相关论文
共 40 条
[31]   A KNOWLEDGE BASE FOR PREDICTING PROTEIN LOCALIZATION SITES IN EUKARYOTIC CELLS [J].
NAKAI, K ;
KANEHISA, M .
GENOMICS, 1992, 14 (04) :897-911
[32]   Prediction of protein subcellular locations by support vector machines using compositions of amino acids and amino acid pairs [J].
Park, KJ ;
Kanehisa, M .
BIOINFORMATICS, 2003, 19 (13) :1656-1663
[33]   BIOSYNTHETIC PROTEIN-TRANSPORT AND SORTING BY THE ENDOPLASMIC-RETICULUM AND GOLGI [J].
PFEFFER, SR ;
ROTHMAN, JE .
ANNUAL REVIEW OF BIOCHEMISTRY, 1987, 56 :829-852
[34]   Using neural networks for prediction of the subcellular location of proteins [J].
Reinhardt, A ;
Hubbard, T .
NUCLEIC ACIDS RESEARCH, 1998, 26 (09) :2230-2236
[35]   Automatic prediction of protein function [J].
Rost, B ;
Liu, J ;
Nair, R ;
Wrzeszczynski, KO ;
Ofran, Y .
CELLULAR AND MOLECULAR LIFE SCIENCES, 2003, 60 (12) :2637-2650
[36]   Protein transport via amino-terminal targeting sequences: Common themes in diverse systems [J].
Rusch, SL ;
Kendall, DA .
MOLECULAR MEMBRANE BIOLOGY, 1995, 12 (04) :295-307
[37]   Predicting subcellular localization via protein motif co-occurrence [J].
Scott, MS ;
Thomas, DY ;
Hallett, MT .
GENOME RESEARCH, 2004, 14 (10A) :1957-1966
[38]   CLUSTAL-W - IMPROVING THE SENSITIVITY OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT THROUGH SEQUENCE WEIGHTING, POSITION-SPECIFIC GAP PENALTIES AND WEIGHT MATRIX CHOICE [J].
THOMPSON, JD ;
HIGGINS, DG ;
GIBSON, TJ .
NUCLEIC ACIDS RESEARCH, 1994, 22 (22) :4673-4680
[39]  
YING H, 2004, BIOINFORMATICS, V20, P21
[40]   Prediction of protein subcellular locations using Markov chain models [J].
Yuan, Z .
FEBS LETTERS, 1999, 451 (01) :23-26