Weighted sequence motifs as an improved seeding step in microRNA target prediction algorithms

被引:83
作者
Sætrom, O
Snove, O
Saetrom, P [1 ]
机构
[1] Interagon AS, Med Tek Ctr, NO-7489 Trondheim, Norway
[2] Norwegian Univ Sci & Technol, Dept Comp & Informat Sci, N-7034 Trondheim, Norway
关键词
miRNA target prediction; genetic programming; boosting; machine learning;
D O I
10.1261/rna.7290705
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present a new microRNA target prediction algorithm called TargetBoost, and show that the algorithm is stable and identifies more true targets than do existing algorithms. TargetBoost uses machine learning on a set of validated microRNA targets in lower organisms to create weighted sequence motifs that capture the binding characteristics between microRNAs and their targets. Existing algorithms require candidates to have (1) near-perfect complementarity between microRNAs' 5' end and their targets; (2) relatively high thermodynamic duplex stability; (3) multiple target sites in the target's 3' UTR; and (4) evolutionary conservation of the target between species. Most algorithms use one of the two first requirements in a seeding step, and use the three others as filters to improve the method's specificity. The initial seeding step determines an algorithm's sensitivity and also influences its specificity. As all algorithms may add filters to increase the specificity, we propose that methods should be compared before such filtering. We show that TargetBoost's weighted sequence motif approach is favorable to using both the duplex stability and the sequence complementarity steps.
引用
收藏
页码:995 / 1003
页数:9
相关论文
共 43 条
[1]   MicroRNAs: Genomics, biogenesis, mechanism, and function (Reprinted from Cell, vol 116, pg 281-297, 2004) [J].
Bartel, David P. .
CELL, 2007, 131 (04) :11-29
[2]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[3]   Developmental defects by antisense-mediated inactivation of micro-RNAs 2 and 13 in Drosophila and the identification of putative target genes [J].
Boutla, A ;
Delidakis, C ;
Tabler, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (17) :4973-4980
[4]   bantam encodes a developmentally regulated microRNA that controls cell proliferation and regulates the proapoptotic gene hid in Drosophila [J].
Brennecke, J ;
Hipfner, DR ;
Stark, A ;
Russell, RB ;
Cohen, SM .
CELL, 2003, 113 (01) :25-36
[5]   Specificity of microRNA target selection in translational repression [J].
Doench, JG ;
Sharp, PA .
GENES & DEVELOPMENT, 2004, 18 (05) :504-511
[6]   siRNAs can function as miRNAs [J].
Doench, JG ;
Petersen, CP ;
Sharp, PA .
GENES & DEVELOPMENT, 2003, 17 (04) :438-442
[7]  
Enright AJ, 2004, GENOME BIOL, V5
[8]   Use of receiver operating characteristic (ROC) analysis to evaluate sequence matching [J].
Gribskov, M ;
Robinson, NL .
COMPUTERS & CHEMISTRY, 1996, 20 (01) :25-33
[9]   The microRNA Registry [J].
Griffiths-Jones, S .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D109-D111
[10]   A recursive MISD architecture for pattern matching [J].
Halaas, A ;
Svingen, B ;
Nedland, M ;
Sætrom, P ;
Snove, O ;
Birkeland, OR .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2004, 12 (07) :727-734