Using discriminant function for prediction of subcellular location of prokaryotic proteins

被引:87
作者
Chou, KC [1 ]
Elrod, DW [1 ]
机构
[1] Pharmacia & Upjohn Inc, Comp Aided Drug Discovery, Kalamazoo, MI 49007 USA
关键词
organelles; amino-acid composition; self-consistency; jackknife; collective interaction;
D O I
10.1006/bbrc.1998.9498
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The discriminant function algorithm was introduced to predict the subcellular location of proteins in prokaryotic organisms from their amino-acid composition. The rate of correct prediction for the three possible subcellular locations of prokaryotic proteins studied by Reinhardt and Hubbard (Nucleic Acid Research, 1998, 26:2230-2236) was 90% by the self-consistency test, and 87% by the jackknife test. These rates are considerably higher than the results recently reported by them using the neural network method. Furthermore, the test procedure adopted here is also more rigorous. The core of the current algorithm is the covariance matrix, through which the collective interactions among different amino-acid components of a protein can be reflected. It is anticipated that, owing to the intimate correlation of the function of a protein with its subcellular location, the current algorithm mill become a useful tool for the systematic analysis of genome data. (C) 1998 Academic Press.
引用
收藏
页码:63 / 68
页数:6
相关论文
共 13 条
[1]   Relation between amino acid composition and cellular location of proteins [J].
Cedano, J ;
Aloy, P ;
PerezPons, JA ;
Querol, E .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 266 (03) :594-600
[2]   PREDICTION OF PROTEIN STRUCTURAL CLASSES [J].
CHOU, KC ;
ZHANG, CT .
CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) :275-349
[3]   A NOVEL-APPROACH TO PREDICTING PROTEIN STRUCTURAL CLASSES IN A (20-1)-D AMINO-ACID-COMPOSITION SPACE [J].
CHOU, KC .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 21 (04) :319-344
[4]  
Chou KC, 1998, PROTEINS, V31, P97, DOI 10.1002/(SICI)1097-0134(19980401)31:1<97::AID-PROT8>3.3.CO
[5]  
2-Y
[6]  
Hart P.E., 1973, Pattern recognition and scene analysis
[7]   Complete sequence analysis of the genome of the bacterium Mycoplasma pneumoniae [J].
Himmelreich, R ;
Hilbert, H ;
Plagens, H ;
Pirkl, E ;
Li, BC ;
Herrmann, R .
NUCLEIC ACIDS RESEARCH, 1996, 24 (22) :4420-4449
[8]  
KING RD, 1996, PROTEIN STRUCTURE PR, P79
[9]  
Mahalanobis P. C., 1936, P NATL I SCI INDIA, V1, P49
[10]  
MARDIA KV, 1979, MULTIVARIATE ANAL, P322