PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence

被引:245
作者
Li, Z. R.
Lin, H. H.
Han, L. Y.
Jiang, L.
Chen, X.
Chen, Y. Z.
机构
[1] Natl Univ Singapore, Dept Computat Sci, Bioinformat & Drug Design Grp, Singapore 117543, Singapore
[2] Sichuan Univ, Coll Chem, Chengdu 610064, Peoples R China
[3] Zhejiang Univ, Dept Biotechnol, Hangzhou 310029, Peoples R China
[4] Shanghai Ctr Bioinformat Technol, Shanghai 201203, Peoples R China
关键词
D O I
10.1093/nar/gkl305
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Sequence-derived structural and physicochemical features have frequently been used in the development of statistical learning models for predicting proteins and peptides of different structural, functional and interaction profiles. PROFEAT (Protein Features) is a web server for computing commonly-used structural and physicochemical features of proteins and peptides from amino acid sequence. It computes six feature groups composed of ten features that include 51 descriptors and 1447 descriptor values. The computed features include amino acid composition, dipeptide composition, normalized Moreau-Broto autocorrelation, Moran autocorrelation, Geary autocorrelation, sequence-order-coupling number, quasi-sequence-order descriptors and the composition, transition and distribution of various structural and physicochemical properties. In addition, it can also compute previous autocorrelations descriptors based on user-defined properties. Our computational algorithms were extensively tested and the computed protein features have been used in a number of published works for predicting proteins of functional classes, protein-protein interactions and MHC-binding peptides. PROFEAT is accessible at http:// jing.cz3.nus.edu.sg/cgi-bin/prof/prof.cgi.
引用
收藏
页码:W32 / W37
页数:6
相关论文
共 53 条
[1]   Classification of nuclear receptors based on amino acid composition and dipeptide composition [J].
Bhasin, M ;
Raghava, GPS .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2004, 279 (22) :23262-23266
[2]  
Bhaskaran R., 1988, INT J PEPT PROTEIN R, V32, P242
[4]   Whole-proteome interaction mining [J].
Bock, JR ;
Gough, DA .
BIOINFORMATICS, 2003, 19 (01) :125-134
[5]   Predicting protein-protein interactions from primary structure [J].
Bock, JR ;
Gough, DA .
BIOINFORMATICS, 2001, 17 (05) :455-460
[6]  
BROTO P, 1984, EUR J MED CHEM, V19, P71
[7]   Enzyme family classification by support vector machines [J].
Cai, CZ ;
Han, LY ;
Ji, ZL ;
Chen, YZ .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 55 (01) :66-76
[8]   SVM-Prot: web-based support vector machine software for functional classification of a protein from its primary sequence [J].
Cai, CZ ;
Han, LY ;
Ji, ZL ;
Chen, X ;
Chen, YZ .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3692-3697
[9]   PROTEIN FOLDING AND THE GENETIC-CODE - AN ALTERNATIVE QUANTITATIVE MODEL [J].
CHARTON, M .
JOURNAL OF THEORETICAL BIOLOGY, 1981, 91 (01) :115-123
[10]   THE STRUCTURAL DEPENDENCE OF AMINO-ACID HYDROPHOBICITY PARAMETERS [J].
CHARTON, M ;
CHARTON, BI .
JOURNAL OF THEORETICAL BIOLOGY, 1982, 99 (04) :629-644