Distribution and characterization of regulatory elements in the human genome

被引:244
作者
Majewski, J [1 ]
Ott, J [1 ]
机构
[1] Rockefeller Univ, New York, NY 10021 USA
关键词
D O I
10.1101/gr.606402
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The regulation of transcription and subsequent gene splicing are crucial to correct gene expression. Although a number of regulatory sequences involved in both processes are known, it is not clear how general their functions are in the genomic context, nor how the regulatory regions are distributed throughout the genome. Here we study the distribution of known mutagenic elements within human introns and exons to deduce the properties of regions essential for splicing and transcription. We show that intronic splicing regulators are generally found close to the splice sites, but may be found as far as 200 nucleotides away from the splice junctions. Similarly, sequences important for splicing may be located as far as 125 nucleotides away from the junctions, within exons. We characterize several types of simple repetitive sequences and low-complexity regions that are overrepresented close to both intron ends and are likely to play important roles in the splicing process. We show that the first introns within most genes play a particularly important regulatory role that is most likely, however, to be involved in transcription control. We also study the distribution of two known regulatory motifs, the GGG trinucleotide and the CpG dinucleotide, and deduce their respective importance to splicing and transcription regulation.
引用
收藏
页码:1827 / 1836
页数:10
相关论文
共 36 条
[21]  
Lamb Bruce T., 1991, Gene Expression, V1, P185
[22]   Initial sequencing and analysis of the human genome [J].
Lander, ES ;
Int Human Genome Sequencing Consortium ;
Linton, LM ;
Birren, B ;
Nusbaum, C ;
Zody, MC ;
Baldwin, J ;
Devon, K ;
Dewar, K ;
Doyle, M ;
FitzHugh, W ;
Funke, R ;
Gage, D ;
Harris, K ;
Heaford, A ;
Howland, J ;
Kann, L ;
Lehoczky, J ;
LeVine, R ;
McEwan, P ;
McKernan, K ;
Meldrim, J ;
Mesirov, JP ;
Miranda, C ;
Morris, W ;
Naylor, J ;
Raymond, C ;
Rosetti, M ;
Santos, R ;
Sheridan, A ;
Sougnez, C ;
Stange-Thomann, N ;
Stojanovic, N ;
Subramanian, A ;
Wyman, D ;
Rogers, J ;
Sulston, J ;
Ainscough, R ;
Beck, S ;
Bentley, D ;
Burton, J ;
Clee, C ;
Carter, N ;
Coulson, A ;
Deadman, R ;
Deloukas, P ;
Dunham, A ;
Dunham, I ;
Durbin, R ;
French, L .
NATURE, 2001, 409 (6822) :860-921
[23]   A computational analysis of sequence features involved in recognition of short introns [J].
Lim, LP ;
Burge, CB .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (20) :11193-11198
[24]   Genomic scrap yard: how genomes utilize all that junk [J].
Makalowski, W .
GENE, 2000, 259 (1-2) :61-67
[25]   G triplets located throughout a class of small vertebrate introns enforce intron borders and regulate splice site selection [J].
McCullough, AJ ;
Berget, SM .
MOLECULAR AND CELLULAR BIOLOGY, 1997, 17 (08) :4562-4571
[26]   Intronic and exonic sequences modulate 5' splice site selection in plant nuclei [J].
McCullough, AJ ;
Schuler, MA .
NUCLEIC ACIDS RESEARCH, 1997, 25 (05) :1071-1077
[27]   Frequent alternative splicing of human genes [J].
Mironov, AA ;
Fickett, JW ;
Gelfand, MS .
GENOME RESEARCH, 1999, 9 (12) :1288-1293
[28]   Transposable elements are found in a large number of human protein-coding genes [J].
Nekrutenko, A ;
Li, WHS .
TRENDS IN GENETICS, 2001, 17 (11) :619-621
[29]   Analysis of the human neurexin genes: Alternative splicing and the generation of protein diversity [J].
Rowen, L ;
Young, J ;
Birditt, B ;
Kaur, A ;
Madan, A ;
Philipps, DL ;
Qin, SZ ;
Minx, P ;
Wilson, RK ;
Hood, L ;
Graveley, BR .
GENOMICS, 2002, 79 (04) :587-597
[30]   A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms [J].
Sachidanandam, R ;
Weissman, D ;
Schmidt, SC ;
Kakol, JM ;
Stein, LD ;
Marth, G ;
Sherry, S ;
Mullikin, JC ;
Mortimore, BJ ;
Willey, DL ;
Hunt, SE ;
Cole, CG ;
Coggill, PC ;
Rice, CM ;
Ning, ZM ;
Rogers, J ;
Bentley, DR ;
Kwok, PY ;
Mardis, ER ;
Yeh, RT ;
Schultz, B ;
Cook, L ;
Davenport, R ;
Dante, M ;
Fulton, L ;
Hillier, L ;
Waterston, RH ;
McPherson, JD ;
Gilman, B ;
Schaffner, S ;
Van Etten, WJ ;
Reich, D ;
Higgins, J ;
Daly, MJ ;
Blumenstiel, B ;
Baldwin, J ;
Stange-Thomann, NS ;
Zody, MC ;
Linton, L ;
Lander, ES ;
Altshuler, D .
NATURE, 2001, 409 (6822) :928-933