The value of position-specific priors in motif discovery using MEME

被引：75

作者：

Bailey, Timothy L. ^{[1
]}

Boden, Mikael ^{[1
]}

Whitington, Tom ^{[1
]}

Machanick, Philip ^{[1
]}

机构：

[1] Univ Queensland, Inst Mol Biosci, Brisbane, Qld 4072, Australia

来源：

BMC BIOINFORMATICS | 2010年 / 11卷

关键词：

DNA; SEQUENCES; GENOME; ALGORITHM;

D O I：

10.1186/1471-2105-11-179

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Background: Position-specific priors have been shown to be a flexible and elegant way to extend the power of Gibbs sampler-based motif discovery algorithms. Information of many types-including sequence conservation, nucleosome positioning, and negative examples-can be converted into a prior over the location of motif sites, which then guides the sequence motif discovery algorithm. This approach has been shown to confer many of the benefits of conservation-based and discriminative motif discovery approaches on Gibbs sampler-based motif discovery methods, but has not previously been studied with methods based on expectation maximization (EM). Results: We extend the popular EM-based MEME algorithm to utilize position-specific priors and demonstrate their effectiveness for discovering transcription factor (TF) motifs in yeast and mouse DNA sequences. Utilizing a discriminative, conservation-based prior dramatically improves MEME's ability to discover motifs in 156 yeast TF ChIP-chip datasets, more than doubling the number of datasets where it finds the correct motif. On these datasets, MEME using the prior has a higher success rate than eight other conservation-based motif discovery approaches. We also show that the same type of prior improves the accuracy of motifs discovered by MEME in mouse TF ChIP-seq data, and that the motifs tend to be of slightly higher quality those found by a Gibbs sampling algorithm using the same prior. Conclusions: We conclude that using position-specific priors can substantially increase the power of EM-based motif discovery algorithms such as MEME algorithm.

引用

页数：14

共 24 条

[21] Discriminative motif discovery in DNA and protein sequences using the DEME algorithm [J].

Redhead, Emma ;

Bailey, Timothy L. .

BMC BIOINFORMATICS, 2007, 8 (1)

[22] PhyloGibbs: A Gibbs sampling motif finder that incorporates phylogeny [J].

Siddharthan, Rahul ;

Siggia, Eric D. ;

van Nimwegen, Erik .

PLOS COMPUTATIONAL BIOLOGY, 2005, 1 (07) :534-556

[23] PhyME: A probabilistic algorithm for finding motifs in sets of orthologous sequences [J].

Sinha, S ;

Blanchette, M ;

Tompa, M .

BMC BIOINFORMATICS, 2004, 5 (1)

[24] Information content and free energy in DNA-protein interactions [J].

Stormo, GD .

JOURNAL OF THEORETICAL BIOLOGY, 1998, 195 (01) :135-137

← 1 2 3 →