Amino Acid Principal Component Analysis (AAPCA) and its applications in protein structural class prediction

被引:85
作者
Du, Qi-Shi
Jiang, Zhi-Qin
He, Wen-Zhang
Li, Da-Peng
Chou, Kou-Chen
机构
[1] Tianjin Univ Technol & Educ, Dept Math, Tianjin 300222, Peoples R China
[2] Tianjin Normal Univ, Dept Chem, Tianjin 300074, Peoples R China
[3] Gordon Life Sci Inst, San Diego, CA 92130 USA
基金
中国国家自然科学基金;
关键词
D O I
10.1080/07391102.2006.10507088
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The extremely complicated nature of many biological problems makes them bear the features of fuzzy sets, such as with vague, imprecise, noisy, ambiguous, or input-missing information For instance, the current data in classifying protein structural classes are typically a fuzzy set. To deal with this kind of problem, the AAPCA (Amino Acid Principal Component Analysis) approach was introduced. In the AAPCA approach the 20-dimensional amino acid composition space is reduced to an orthogonal space with fewer dimensions, and the original base functions are converted into a set of orthogonal and normalized base functions. The advantage of such an approach is that it can minimize the random errors and redundant information in protein dataset through a principal component selection, remarkably improving the success rates in predicting protein structural classes. It is anticipated that the AAPCA approach can be used to deal with many other classification problems in proteins as well.
引用
收藏
页码:635 / 640
页数:6
相关论文
共 38 条
[1]   Relation between amino acid composition and cellular location of proteins [J].
Cedano, J ;
Aloy, P ;
PerezPons, JA ;
Querol, E .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 266 (03) :594-600
[2]   Prediction of protease types in a hybridization space [J].
Chou, KC ;
Cai, YD .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2006, 339 (03) :1015-1020
[3]   Progress in protein structural class prediction and its impact to bioinformatics and proteomics [J].
Chou, KC .
CURRENT PROTEIN & PEPTIDE SCIENCE, 2005, 6 (05) :423-436
[4]   Prediction of G-protein-coupled receptor classes [J].
Chou, KC .
JOURNAL OF PROTEOME RESEARCH, 2005, 4 (04) :1413-1418
[5]   Predicting protein localization in budding yeast [J].
Chou, KC ;
Cai, YD .
BIOINFORMATICS, 2005, 21 (07) :944-950
[6]   Using GO-PseAA predictor to predict enzyme sub-class [J].
Chou, KC ;
Cai, YD .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2004, 325 (02) :506-509
[7]   Predicting enzyme family class in a hybridization space [J].
Chou, KC ;
Cai, YD .
PROTEIN SCIENCE, 2004, 13 (11) :2857-2863
[8]   Structural bioinformatics and its impact to biomedical science [J].
Chou, KC .
CURRENT MEDICINAL CHEMISTRY, 2004, 11 (16) :2105-2134
[9]   Predicting protein quaternary structure by pseudo amino acid composition [J].
Chou, KC ;
Cai, YD .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2003, 53 (02) :282-289
[10]  
Chou KC, 1999, PROTEINS, V34, P137, DOI 10.1002/(SICI)1097-0134(19990101)34:1<137::AID-PROT11>3.0.CO