Identification and characterization of the potential promoter regions of 1031 kinds of human genes

被引:193
作者
Suzuki, Y [1 ]
Tsunoda, T
Sese, J
Taira, H
Mizushima-Sugano, J
Hata, H
Ota, T
Isogai, T
Tanaka, T
Nakamura, Y
Suyama, A
Sakaki, Y
Morishita, S
Okubo, K
Sugano, S
机构
[1] Univ Tokyo, Inst Med Sci, Dept Virol, Minato Ku, Tokyo 1088639, Japan
[2] Univ Tokyo, Inst Med Sci, Ctr Human Genome, Minato Ku, Tokyo 1088639, Japan
[3] RIKEN, Inst Phys & Chem Res, Genom Sci Ctr, Wako, Saitama 3510106, Japan
[4] Univ Tokyo, Grad Sch Frontier Sci, Dept Complex Sci & Engn, Bunkyo Ku, Tokyo 1130033, Japan
[5] NTT Corp, Commun Sci Labs, Intelligent Commun Lab, Seiko, Kyoto 6190237, Japan
[6] Helix Res Inst, Chiba 2920812, Japan
[7] Univ Tokyo, Dept Life Sci, Meguro Ku, Tokyo 1530041, Japan
[8] Osaka Univ, Inst Mol & Cell Biol, Suita, Osaka 5650871, Japan
关键词
D O I
10.1101/gr.GR-1640R
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
To understand the mechanism of transcriptional regulation, it is essential to identify and characterize the promoter, which is located proximal to the mRNA start site. To identify the promoters from the large volumes of genomic sequences, we used mRNA start sites determined by a large-scale sequencing of the cDNA libraries constructed by the "oligo-capping" method. We aligned the mRNA start sites with the genomic sequences and retrieved adjacent sequences as potential promoter regions (PPRs) for 1031 genes. The PPR sequences were searched to determine the frequencies of major promoter elements. Among 1031 PPRs, 329 (32%) contained TATA boxes, 872 (85%) contained initiators, 999 (97%) contained CC box, and 663 (64%) contained CAAT box. Furthermore, 493 (48%) PPRs were located in CpG islands. This frequency of CpG islands was reduced in TATA(+)/lnr(+) PPRs and in the PPRs of ubiquitously expressed genes. In the PPRs of the CGM2 gene, the DRA gene, and the TM30pl genes, which showed highly colon specific expression patterns, the consensus sequences of E boxes were commonly observed. The PPRs were also useful For exploring promoter SNPs.
引用
收藏
页码:677 / 684
页数:8
相关论文
共 30 条
[1]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[2]   SIZING AND MAPPING OF EARLY ADENOVIRUS MESSENGER-RNAS BY GEL-ELECTROPHORESIS OF S1 ENDONUCLEASE-DIGESTED HYBRIDS [J].
BERK, AJ ;
SHARP, PA .
CELL, 1977, 12 (03) :721-732
[3]   The essence of SNPs [J].
Brookes, AJ .
GENE, 1999, 234 (02) :177-186
[4]   WEIGHT MATRIX DESCRIPTIONS OF 4 EUKARYOTIC RNA POLYMERASE-II PROMOTER ELEMENTS DERIVED FROM 502 UNRELATED PROMOTER SEQUENCES [J].
BUCHER, P .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 212 (04) :563-578
[5]   Aberrant CpG-island methylation has non-random and tumour-type-specific patterns [J].
Costello, JF ;
Frühwald, MC ;
Smiraglia, DJ ;
Rush, LJ ;
Robertson, GP ;
Gao, X ;
Wright, FA ;
Feramisco, JD ;
Peltomäki, P ;
Lang, JC ;
Schuller, DE ;
Yu, L ;
Bloomfield, CD ;
Caligiuri, MA ;
Yates, A ;
Nishikawa, R ;
Huang, HJS ;
Petrelli, NJ ;
Zhang, XL ;
O'Dorisio, MS ;
Held, WA ;
Cavenee, WK ;
Plass, C .
NATURE GENETICS, 2000, 24 (02) :132-138
[6]   CPG ISLANDS AND GENES [J].
CROSS, SH ;
BIRD, AP .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 1995, 5 (03) :309-314
[7]   CPG ISLANDS IN VERTEBRATE GENOMES [J].
GARDINERGARDEN, M ;
FROMMER, M .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 196 (02) :261-282
[8]   TRANSCRIPTIONAL REGULATION OF THE CARCINOEMBRYONIC ANTIGEN GENE - IDENTIFICATION OF REGULATORY ELEMENTS AND MULTIPLE NUCLEAR FACTORS [J].
HAUCK, W ;
STANNERS, CP .
JOURNAL OF BIOLOGICAL CHEMISTRY, 1995, 270 (08) :3602-3610
[9]   TRANSCRIPTIONAL CONTROL OF THE HUMAN BILIARY GLYCOPROTEIN GENE, A CEA GENE FAMILY MEMBER DOWN-REGULATED IN COLORECTAL CARCINOMAS [J].
HAUCK, W ;
NEDELLEC, P ;
TURBIDE, C ;
STANNERS, CP ;
BARNETT, TR ;
BEAUCHEMIN, N .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1994, 223 (02) :529-541
[10]   Expanding the TRANSFAC database towards an expert system of regulatory molecular mechanisms [J].
Heinemeyer, T ;
Chen, X ;
Karas, H ;
Kel, AE ;
Kel, OV ;
Liebich, I ;
Meinhardt, T ;
Reuter, I ;
Schacherer, F ;
Wingender, E .
NUCLEIC ACIDS RESEARCH, 1999, 27 (01) :318-322