Identifying ChIP-seq enrichment using MACS

被引:1185
作者
Feng, Jianxing [1 ]
Liu, Tao [2 ]
Qin, Bo [1 ]
Zhang, Yong [1 ]
Liu, Xiaole Shirley [2 ]
机构
[1] Tongji Univ, Sch Life Sci & Technol, Dept Bioinformat, Shanghai 200092, Peoples R China
[2] Harvard Univ, Sch Med, Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02115 USA
基金
中国国家自然科学基金; 美国国家卫生研究院;
关键词
TRANSCRIPTION FACTOR-BINDING; HUMAN GENOME; CHROMATIN-IMMUNOPRECIPITATION; SITES; IDENTIFICATION; TECHNOLOGY; ALIGNMENT; SOFTWARE; BROWSER; LOCI;
D O I
10.1038/nprot.2012.101
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Model-based analysis of ChIP-seq (MACS) is a computational algorithm that identifies genome-wide locations of transcription/chromatin factor binding or histone modification from ChIP-seq data. MACS consists of four steps: removing redundant reads, adjusting read position, calculating peak enrichment and estimating the empirical false discovery rate (FDR). In this protocol, we provide a detailed demonstration of how to install MACS and how to use it to analyze three common types of ChIP-seq data sets with different characteristics: the sequence-specific transcription factor FoxA1, the histone modification mark H3K4me3 with sharp enrichment and the H3K36me3 mark with broad enrichment. We also explain how to interpret and visualize the results of MACS analyses. The algorithm requires similar to 3 GB of RAM and 1.5 h of computing time to analyze a ChIP-seq data set containing 30 million reads, an estimate that increases with sequence coverage. MACS is open source and is available from http://liulab.dfci.harvard.edu/MACS/.
引用
收藏
页码:1728 / 1740
页数:13
相关论文
共 41 条
[1]   Genomic Location Analysis by ChIP-Seq [J].
Barski, Artem ;
Zhao, Keji .
JOURNAL OF CELLULAR BIOCHEMISTRY, 2009, 107 (01) :11-18
[2]   High-resolution profiling of histone methylations in the human genome [J].
Barski, Artern ;
Cuddapah, Suresh ;
Cui, Kairong ;
Roh, Tae-Young ;
Schones, Dustin E. ;
Wang, Zhibin ;
Wei, Gang ;
Chepelev, Iouri ;
Zhao, Keji .
CELL, 2007, 129 (04) :823-837
[3]   Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project [J].
Birney, Ewan ;
Stamatoyannopoulos, John A. ;
Dutta, Anindya ;
Guigo, Roderic ;
Gingeras, Thomas R. ;
Margulies, Elliott H. ;
Weng, Zhiping ;
Snyder, Michael ;
Dermitzakis, Emmanouil T. ;
Stamatoyannopoulos, John A. ;
Thurman, Robert E. ;
Kuehn, Michael S. ;
Taylor, Christopher M. ;
Neph, Shane ;
Koch, Christoph M. ;
Asthana, Saurabh ;
Malhotra, Ankit ;
Adzhubei, Ivan ;
Greenbaum, Jason A. ;
Andrews, Robert M. ;
Flicek, Paul ;
Boyle, Patrick J. ;
Cao, Hua ;
Carter, Nigel P. ;
Clelland, Gayle K. ;
Davis, Sean ;
Day, Nathan ;
Dhami, Pawandeep ;
Dillon, Shane C. ;
Dorschner, Michael O. ;
Fiegler, Heike ;
Giresi, Paul G. ;
Goldy, Jeff ;
Hawrylycz, Michael ;
Haydock, Andrew ;
Humbert, Richard ;
James, Keith D. ;
Johnson, Brett E. ;
Johnson, Ericka M. ;
Frum, Tristan T. ;
Rosenzweig, Elizabeth R. ;
Karnani, Neerja ;
Lee, Kirsten ;
Lefebvre, Gregory C. ;
Navas, Patrick A. ;
Neri, Fidencio ;
Parker, Stephen C. J. ;
Sabo, Peter J. ;
Sandstrom, Richard ;
Shafer, Anthony .
NATURE, 2007, 447 (7146) :799-816
[4]  
Chen YW, 2012, NAT METHODS, V9, P609, DOI [10.1038/NMETH.1985, 10.1038/nmeth.1985]
[5]   Substantial biases in ultra-short read data sets from high-throughput DNA sequencing [J].
Dohm, Juliane C. ;
Lottaz, Claudio ;
Borodina, Tatiana ;
Himmelbauer, Heinz .
NUCLEIC ACIDS RESEARCH, 2008, 36 (16)
[6]   FindPeaks 3.1: a tool for identifying areas of enrichment from massively parallel short-read sequencing technology [J].
Fejes, Anthony P. ;
Robertson, Gordon ;
Bilenky, Mikhail ;
Varhol, Richard ;
Bainbridge, Matthew ;
Jones, Steven J. M. .
BIOINFORMATICS, 2008, 24 (15) :1729-1730
[7]   Nucleosome dynamics define transcriptional enhancers [J].
He, Housheng Hansen ;
Meyer, Clifford A. ;
Shin, Hyunjin ;
Bailey, Shannon T. ;
Wei, Gang ;
Wang, Qianben ;
Zhang, Yong ;
Xu, Kexin ;
Ni, Min ;
Lupien, Mathieu ;
Mieczkowski, Piotr ;
Lieb, Jason D. ;
Zhao, Keji ;
Brown, Myles ;
Liu, X. Shirley .
NATURE GENETICS, 2010, 42 (04) :343-U101
[8]   An integrated software system for analyzing ChIP-chip and ChIP-seq data [J].
Ji, Hongkai ;
Jiang, Hui ;
Ma, Wenxiu ;
Johnson, David S. ;
Myers, Richard M. ;
Wong, Wing H. .
NATURE BIOTECHNOLOGY, 2008, 26 (11) :1293-1300
[9]   Genome-wide mapping of in vivo protein-DNA interactions [J].
Johnson, David S. ;
Mortazavi, Ali ;
Myers, Richard M. ;
Wold, Barbara .
SCIENCE, 2007, 316 (5830) :1497-1502
[10]   Genome-wide identification of in vivo protein-DNA binding sites from ChIP-Seq data [J].
Jothi, Raja ;
Cuddapah, Suresh ;
Barski, Artem ;
Cui, Kairong ;
Zhao, Keji .
NUCLEIC ACIDS RESEARCH, 2008, 36 (16) :5221-5231