MEME-ChIP: motif analysis of large DNA datasets

被引:1228
作者
Machanick, Philip [1 ]
Bailey, Timothy L. [1 ]
机构
[1] Univ Queensland, Inst Mol Biosci, Brisbane, Qld 4072, Australia
基金
美国国家卫生研究院;
关键词
D O I
10.1093/bioinformatics/btr189
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Advances in high-throughput sequencing have resulted in rapid growth in large, high-quality datasets including those arising from transcription factor (TF) ChIP-seq experiments. While there are many existing tools for discovering TF binding site motifs in such datasets, most web-based tools cannot directly process such large datasets. Results: The MEME-ChIP web service is designed to analyze ChIP-seq 'peak regions'-short genomic regions surrounding declared ChIP-seq 'peaks'. Given a set of genomic regions, it performs (i) ab initio motif discovery, (ii) motif enrichment analysis, (iii) motif visualization, (iv) binding affinity analysis and (v) motif identification. It runs two complementary motif discovery algorithms on the input data-MEME and DREME-and uses the motifs they discover in subsequent visualization, binding affinity and identification steps. MEME-ChIP also performs motif enrichment analysis using the AME algorithm, which can detect very low levels of enrichment of binding sites for TFs with known DNA-binding motifs. Importantly, unlike with the MEME web service, there is no restriction on the size or number of uploaded sequences, allowing very large ChIP-seq datasets to be analyzed. The analyses performed by MEME-ChIP provide the user with a varied view of the binding and regulatory activity of the ChIP-ed TF, as well as the possible involvement of other DNA-binding TFs.
引用
收藏
页码:1696 / 1697
页数:2
相关论文
共 10 条
[1]   MEME: discovering and analyzing DNA and protein sequence motifs [J].
Bailey, Timothy L. ;
Williams, Nadya ;
Misleh, Chris ;
Li, Wilfred W. .
NUCLEIC ACIDS RESEARCH, 2006, 34 :W369-W373
[2]   Combining evidence using p-values: application to sequence homology searches [J].
Bailey, TL ;
Gribskov, M .
BIOINFORMATICS, 1998, 14 (01) :48-54
[3]  
BAILEY TL, 2011, BIOINFORMAT IN PRESS
[4]   Assigning roles to DNA regulatory motifs using comparative genomics [J].
Buske, Fabian A. ;
Boden, Mikael ;
Bauer, Denis C. ;
Bailey, Timothy L. .
BIOINFORMATICS, 2010, 26 (07) :860-866
[5]   Trawler:: de novo regulatory motif discovery pipeline for chromatin immunoprecipitation [J].
Ettwiller, Laurence ;
Paten, Benedict ;
Ramialison, Mirana ;
Birney, Ewan ;
Wittbrodt, Joachim .
NATURE METHODS, 2007, 4 (07) :563-565
[6]   Quantifying similarity between motifs [J].
Gupta, Shobhit ;
Stamatoyannopoulos, John A. ;
Bailey, Timothy L. ;
Noble, William Stafford .
GENOME BIOLOGY, 2007, 8 (02)
[7]   Genome-wide identification of TAL1's functional targets: Insights into its mechanisms of action in primary erythroid cells [J].
Kassouf, Mira T. ;
Hughes, Jim R. ;
Taylor, Stephen ;
McGowan, Simon J. ;
Soneji, Shamit ;
Green, Angela L. ;
Vyas, Paresh ;
Porcher, Catherine .
GENOME RESEARCH, 2010, 20 (08) :1064-1083
[8]   Motif Enrichment Analysis: a unified framework and an evaluation on ChIP data [J].
McLeay, Robert C. ;
Bailey, Timothy L. .
BMC BIOINFORMATICS, 2010, 11
[9]   JASPAR 2010: the greatly expanded open-access database of transcription factor binding profiles [J].
Portales-Casamar, Elodie ;
Thongjuea, Supat ;
Kwon, Andrew T. ;
Arenillas, David ;
Zhao, Xiaobei ;
Valen, Eivind ;
Yusuf, Dimas ;
Lenhard, Boris ;
Wasserman, Wyeth W. ;
Sandelin, Albin .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D105-D110
[10]   RSAT:: regulatory sequence analysis tools [J].
Thomas-Chollier, Morgane ;
Sand, Olivier ;
Turatsinze, Jean-Valery ;
Janky, Rekin's ;
Defrance, Matthieu ;
Vervisch, Eric ;
Brohee, Sylvain ;
van Helden, Jacques .
NUCLEIC ACIDS RESEARCH, 2008, 36 :W119-W127