Identifying gene regulatory elements by genome-wide recovery of DNase hypersensitive sites

被引:133
作者
Crawford, GE
Holt, IE
Mullikin, JC
Tai, D
Green, ED
Wolfsberg, TG
Collins, FS [1 ]
机构
[1] NHGRI, Genome Technol Branch, NIH, Bethesda, MD 20892 USA
[2] NHGRI, Intramural Sequencing Ctr, NIH, Bethesda, MD 20892 USA
关键词
D O I
10.1073/pnas.0307540100
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Analysis of the human genome sequence has identified approximate to25,000-30,000 protein-coding genes, but little is known about how most of these are regulated. Mapping DNase I hypersensitive (HS) sites has traditionally represented the gold-standard experimental method for identifying regulatory elements, but the labor-intensive nature of this technique has limited its application to only a small number of human genes. We have developed a protocol to generate a genome-wide library of gene regulatory sequences by cloning DNase HS sites. We generated a library of DNase HS sites from quiescent primary human CD4(+) T cells and analyzed approximate to5,600 of the resulting clones. Compared to sequences from randomly generated in silico libraries, sequences from these clones were found to map more frequently to regions of the genome known to contain regulatory elements, such as regions upstream of genes, within CpG islands, and in sequences that align between mouse and human. These cloned sites also tend to map near genes that have detectable transcripts in CD4(+) T cells, demonstrating that transcriptionally active regions of the genome are being selected. Validation of putative regulatory elements was achieved by repeated recovery of the same sequence and real-time PCR. This cloning strategy, which can be scaled up and applied to any cell line or tissue, will be useful in identifying regulatory elements controlling global expression differences that delineate tissue types, stages of development, and disease susceptibility.
引用
收藏
页码:992 / 997
页数:6
相关论文
共 22 条
[1]   Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays [J].
Brenner, S ;
Johnson, M ;
Bridgham, J ;
Golda, G ;
Lloyd, DH ;
Johnson, D ;
Luo, SJ ;
McCurdy, S ;
Foy, M ;
Ewan, M ;
Roth, R ;
George, D ;
Eletr, S ;
Albrecht, G ;
Vermaas, E ;
Williams, SR ;
Moon, K ;
Burcham, T ;
Pallas, M ;
DuBridge, RB ;
Kirchner, J ;
Fearon, K ;
Mao, J ;
Corcoran, K .
NATURE BIOTECHNOLOGY, 2000, 18 (06) :630-634
[2]   A vision for the future of genomics research [J].
Collins, FS ;
Green, ED ;
Guttmacher, AE ;
Guyer, MS .
NATURE, 2003, 422 (6934) :835-847
[3]   NUCLEASE HYPERSENSITIVE SITES IN CHROMATIN [J].
GROSS, DS ;
GARRARD, WT .
ANNUAL REVIEW OF BIOCHEMISTRY, 1988, 57 :159-197
[4]   THE DNASE-I SENSITIVE DOMAIN OF THE CHICKEN LYSOZYME GENE SPANS 24-KB [J].
JANTZEN, K ;
FRITTON, HP ;
IGOKEMENES, T .
NUCLEIC ACIDS RESEARCH, 1986, 14 (15) :6085-6099
[5]  
Kent WJ, 2002, GENOME RES, V12, P656, DOI [10.1101/gr.229202, 10.1101/gr.229202. Article published online before March 2002]
[6]   CPG ISLANDS AS GENE MARKERS IN THE HUMAN GENOME [J].
LARSEN, F ;
GUNDERSEN, G ;
LOPEZ, R ;
PRYDZ, H .
GENOMICS, 1992, 13 (04) :1095-1107
[7]   Promoter-specific binding of Rap1 revealed by genome-wide maps of protein-DNA association [J].
Lieb, JD ;
Liu, XL ;
Botstein, D ;
Brown, PO .
NATURE GENETICS, 2001, 28 (04) :327-334
[8]   Quantification of DNasel-sensitivity by real-time PCR:: Quantitative analysis of DNasel-hypersensitivity of the mouse β-globin LCR [J].
McArthur, M ;
Gerum, S ;
Stamatoyannopoulos, G .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 313 (01) :27-34
[9]   SSAHA: A fast search method for large DNA databases [J].
Ning, ZM ;
Cox, AJ ;
Mullikin, JC .
GENOME RESEARCH, 2001, 11 (10) :1725-1729
[10]   Genomic strategies to identify mammalian regulatory sequences [J].
Pennacchio, LA ;
Rubin, EM .
NATURE REVIEWS GENETICS, 2001, 2 (02) :100-109