共 42 条
Improved and automated prediction of effective siRNA
被引:102
作者:
Chalk, AM
[1
]
Wahlestedt, C
[1
]
Sonnhammer, ELL
[1
]
机构:
[1] Karolinska Inst, Ctr Genom & Bioinformat, S-17177 Stockholm, Sweden
关键词:
D O I:
10.1016/j.bbrc.2004.04.181
中图分类号:
Q5 [生物化学];
Q7 [分子生物学];
学科分类号:
071010 ;
081704 ;
摘要:
Short interfering RNAs are used in functional genomics studies to knockdown a single gene in a reversible manner. The results of siRNA experiments are highly dependent on the choice of siRNA sequence. In order to evaluate siRNA design rules, we collected a database of 398 siRNAs of known efficacy from 92 genes. We used this database to evaluate previously proposed rules from smaller datasets, and to find a new set of rules that are optimal for the entire database. We also trained a regression tree with full cross-validation. It was however difficult to obtain the same precision as methods previously tested on small datasets from one or two genes. We show that those methods are overfitting as they work poorly on independent validation datasets from multiple genes. Our new design rules can predict siRNAs with efficacy greater than or equal to50% in 91% of cases, and with efficacy greater than or equal to90% in 52% of cases, which is more than a twofold improvement over random selection. Software for designing siRNAs is available online via a web server at http:// sisearch.cgb.ki.se/ or as a standalone version for high-throughput applications. (C) 2004 Elsevier Inc. All rights reserved.
引用
收藏
页码:264 / 274
页数:11
相关论文