Multiplexed massively parallel SELEX for characterization of human transcription factor binding specificities

被引:313
作者
Jolma, Arttu [1 ,2 ,3 ,4 ]
Kivioja, Teemu [1 ,2 ,3 ,5 ]
Toivonen, Jarkko [5 ]
Cheng, Lu [5 ]
Wei, Gonghong [1 ,2 ,3 ]
Enge, Martin [4 ]
Taipale, Mikko [1 ,2 ,3 ]
Vaquerizas, Juan M. [6 ]
Yan, Jian [1 ,2 ,3 ]
Sillanpaa, Mikko J. [7 ]
Bonke, Martin [1 ,2 ,3 ]
Palin, Kimmo [5 ]
Talukder, Shaheynoor [8 ,9 ]
Hughes, Timothy R. [8 ,9 ]
Luscombe, Nicholas M. [6 ]
Ukkonen, Esko [5 ]
Taipale, Jussi [1 ,2 ,3 ,4 ]
机构
[1] Univ Helsinki, Dept Mol Med, Natl Publ Hlth Inst KTL, Biomedicum, FI-00014 Helsinki, Finland
[2] Univ Helsinki, Biomedicum, Genome Scale Biol Program, Inst Biomed, FI-00014 Helsinki, Finland
[3] Univ Helsinki, Biomedicum, High Throughput Ctr, FI-00014 Helsinki, Finland
[4] Karolinska Inst, Dept Biosci & Nutr, Stockholm, Sweden
[5] Univ Helsinki, Dept Comp Sci, FI-00014 Helsinki, Finland
[6] EMBL European Bioinformat Inst, Cambridge CB10 1SD, England
[7] Univ Helsinki, Dept Math & Stat, FI-00014 Helsinki, Finland
[8] Univ Toronto, Dept Mol Genet & Banting, Toronto, ON M4T 2J4, Canada
[9] Univ Toronto, Best Dept Med Res, Toronto, ON M4T 2J4, Canada
基金
芬兰科学院;
关键词
DNA-BINDING; PROTEIN; DOMAIN; GENE; RECOGNITION; AFFINITY; SELECTIVITY; SEQUENCES; PROFILES; ELEMENTS;
D O I
10.1101/gr.100552.109
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The genetic code-the binding specificity of all transfer-RNAs-defines how protein primary structure is determined by DNA sequence. DNA also dictates when and where proteins are expressed, and this information is encoded in a pattern of specific sequence motifs that are recognized by transcription factors. However, the DNA-binding specificity is only known for a small fraction of the similar to 1400 human transcription factors (TFs). We describe here a high-throughput method for analyzing transcription factor binding specificity that is based on systematic evolution of ligands by exponential enrichment (SELEX) and massively parallel sequencing. The method is optimized for analysis of large numbers of TFs in parallel through the use of affinity-tagged proteins, barcoded selection oligonucleotides, and multiplexed sequencing. Data are analyzed by a new bioinformatic platform that uses the hundreds of thousands of sequencing reads obtained to control the quality of the experiments and to generate binding motifs for the TFs. The described technology allows higher throughput and identification of much longer binding profiles than current microarray-based methods. In addition, as our method is based on proteins expressed in mammalian cells, it can also be used to characterize DNA-binding preferences of full-length proteins or proteins requiring post-translational modifications. We validate the method by determining binding specificities of 14 different classes of TFs and by confirming the specificities for NFATC1 and RFX3 using ChIP-seq. Our results reveal unexpected dimeric modes of binding for several factors that were thought to preferentially bind DNA as monomers.
引用
收藏
页码:861 / 873
页数:13
相关论文
共 50 条
[11]   The basic domain leucine zipper protein hXBP-1 preferentially binds to and transactivates CRE-like sequences containing an ACGT core [J].
Clauss, IM ;
Chu, M ;
Zhao, JL ;
Glimcher, LH .
NUCLEIC ACIDS RESEARCH, 1996, 24 (10) :1855-1864
[12]   A biophysical approach to transcription factor binding site discovery [J].
Djordjevic, M ;
Sengupta, AM ;
Shraiman, BI .
GENOME RESEARCH, 2003, 13 (11) :2381-2390
[13]  
Emery P, 1996, MOL CELL BIOL, V16, P4486
[14]   A dimer-specific function of the transcription factor NFATp [J].
Falvo, James V. ;
Lin, Charles H. ;
Tsytsykova, Alla V. ;
Hwang, Peter K. ;
Thanos, Dimitris ;
Goldfeld, Anne E. ;
Maniatis, Tom .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (50) :19637-19642
[15]   TFEB HAS DNA-BINDING AND OLIGOMERIZATION PROPERTIES OF A UNIQUE HELIX LOOP HELIX LEUCINE-ZIPPER FAMILY [J].
FISHER, DE ;
CARR, CS ;
PARENT, LA ;
SHARP, PA .
GENES & DEVELOPMENT, 1991, 5 (12A) :2342-2352
[16]   CELL-TYPE SPECIFIC ACTIVITY OF 2 GLUCOCORTICOID RESPONSIVE UNITS OF RAT TYROSINE AMINOTRANSFERASE GENE IS ASSOCIATED WITH MULTIPLE BINDING-SITES FOR C/EBP AND A NOVEL LIVER-SPECIFIC NUCLEAR FACTOR [J].
GRANGE, T ;
ROUX, J ;
RIGAUD, G ;
PICTET, R .
NUCLEIC ACIDS RESEARCH, 1991, 19 (01) :131-139
[17]   Genome-wide prediction of mammalian enhancers based on analysis of transcription-factor binding affinity [J].
Hallikas, O ;
Palin, K ;
Sinjushina, N ;
Rautiainen, R ;
Partanen, J ;
Ukkonen, E ;
Taipale, J .
CELL, 2006, 124 (01) :47-59
[18]   High-throughput assay for determining specificity and affinity of protein-DNA binding interactions [J].
Hallikas, Outi ;
Taipale, Jussi .
NATURE PROTOCOLS, 2006, 1 (01) :215-222
[19]   Footprinting: A method for determining the sequence selectivity, affinity and kinetics of DNA-binding ligands [J].
Hampshire, Andrew J. ;
Rusling, David A. ;
Broughton-Head, Victoria J. ;
Fox, Keith R. .
METHODS, 2007, 42 (02) :128-140
[20]   Histone modifications at human enhancers reflect global cell-type-specific gene expression [J].
Heintzman, Nathaniel D. ;
Hon, Gary C. ;
Hawkins, R. David ;
Kheradpour, Pouya ;
Stark, Alexander ;
Harp, Lindsey F. ;
Ye, Zhen ;
Lee, Leonard K. ;
Stuart, Rhona K. ;
Ching, Christina W. ;
Ching, Keith A. ;
Antosiewicz-Bourget, Jessica E. ;
Liu, Hui ;
Zhang, Xinmin ;
Green, Roland D. ;
Lobanenkov, Victor V. ;
Stewart, Ron ;
Thomson, James A. ;
Crawford, Gregory E. ;
Kellis, Manolis ;
Ren, Bing .
NATURE, 2009, 459 (7243) :108-112