RISP: A web-based server for prediction of RNA-binding sites in proteins

被引:39
作者
Tong, Jing [1 ]
Jiang, Peng [1 ]
Lu, Zu-hong [1 ]
机构
[1] SE Univ, Dept Biol Sci & Med Engn, State Key Lab Bioelect, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
protein RNA-binding site; position-specific scoring matrix; support vector machine; RISP;
D O I
10.1016/j.cmpb.2007.12.003
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Protein-RNA interactions play significant roles in a number of biological activities, such as protein synthesis, regulation of gene expression. Here we propose a hybrid RISP (RNA-interaction site prediction) method, using support vector machine (SVM) in conjunction with evolutionary information of amino acid sequences in terms of their position-specific scoring matrices (PSSMs) for prediction of RNA-binding sites. The results show that our RISP method has 72.2% net prediction (NP) (61.0% sensitivity and 83.3% specificity). When compared with previous studies, this novel method appears more accurate and better generalization abilities. RISP is freely available at http://grc.seu.edu.cn/RISP. Given a protein sequence, RISP decides whether residue in the protein is RNA-binding or not (optimal prediction), and gives the confidence value, 'high specificity' prediction and 'high sensitivity' prediction. (c) 2007 Elsevier Ireland Ltd. All rights reserved
引用
收藏
页码:148 / 153
页数:6
相关论文
共 24 条
[1]   PSSM-based prediction of DNA binding sites in proteins [J].
Ahmad, S ;
Sarai, A .
BMC BIOINFORMATICS, 2005, 6 (1)
[2]   Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information [J].
Ahmad, S ;
Gromiha, MM ;
Sarai, A .
BIOINFORMATICS, 2004, 20 (04) :477-486
[3]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[4]  
[Anonymous], CURR OPIN STRUC BIOL
[5]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[6]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[7]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[8]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[9]   Support vector machines for predicting rRNA-, RNA-, and DNA-binding proteins from amino acid sequence [J].
Cai, YD ;
Lin, SL .
BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2003, 1648 (1-2) :127-133
[10]   RNA as a small-molecule drug target: doubling the value of genomics [J].
Ecker, DJ ;
Griffey, RH .
DRUG DISCOVERY TODAY, 1999, 4 (09) :420-429