An algorithm for finding protein-DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments

被引:460
作者
Liu, XS
Brutlag, DL
Liu, JS
机构
[1] Harvard Univ, Dept Stat, Cambridge, MA 02138 USA
[2] Stanford Univ, Dept Biochem, Stanford, CA 94305 USA
关键词
D O I
10.1038/nbt717
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Chromatin immunoprecipitation followed by cDNA microarray hybridization (ChIP-array) has become a popular procedure for studying genome-wide protein-DNA interactions and transcription regulation. However, it can only map the probable protein-DNA interaction loci within 1-2 kilobases resolution. To pinpoint interaction sites down to the base-pair level, we introduce a computational method, Motif Discovery scan (MDscan), that examines the ChIP-array-selected sequences and searches for DNA sequence motifs representing the protein-DNA interaction sites. MDscan combines the advantages of two widely adopted motif search strategies, word enumeration(1-4) and position-specific weight matrix updating(5-9), and incorporates the ChIP-array ranking information to accelerate searches and enhance their success rates. MDscan correctly identified all the experimentally verified motifs from published ChIP-array experiments in yeast(10-13) (STE12, GAL4, RAP1, SCB, MCB, MCM1, SFF, and SWI5), and predicted two motif patterns for the differential binding of Rap1 protein in telomere regions. In our studies, the method was faster and more accurate than several established motif-finding algorithms(5,8,9). MDscan can be used to find DNA motifs not only in ChIP-array experiments but also in other experiments in which a subgroup of the sequences can be inferred to contain relatively abundant motif sites. The MDscan web server can be accessed at http:// BioProspector.stanford.edu/MDscan/.
引用
收藏
页码:835 / 839
页数:6
相关论文
共 20 条