Diversification of transcriptional modulation: Large-scale identification and characterization of putative alternative promoters of human genes

被引:397
作者
Kimura, K
Wakamatsu, A
Suzuki, Y [1 ]
Ota, T
Nishikawa, T
Yamashita, R
Yamamoto, J
Sekine, M
Tsuritani, K
Wakaguri, H
Ishii, S
Sugiyama, T
Saito, K
Isono, Y
Irie, R
Kushida, N
Yoneyama, T
Otsuka, R
Kanda, K
Yokoi, T
Kondo, H
Wagatsuma, M
Murakawa, K
Ishida, S
Ishibashi, T
Takahashi-Fujii, A
Tanase, T
Nagai, K
Kikuchi, H
Nakai, K
Isogai, T
Sugano, S
机构
[1] Univ Tokyo, Grad Sch Frontier Sci, Dept Med Genome Sci, Minato Ku, Tokyo 1088639, Japan
[2] Reverse Proteom Res Inst, Kisarazu, Chiba 2920818, Japan
[3] Helix Res Inst, Kisarazu, Chiba 2920812, Japan
[4] Hitachi Ltd, Cent Res Lab, Life Sci Res Lab, Kokubunji, Tokyo 1858601, Japan
[5] Univ Tokyo, Inst Med Sci, Ctr Human Genome, Minato Ku, Tokyo 1088639, Japan
[6] Natl Inst Technol & Evaluat, Dept Biotechnol, Genome Anal Ctr, Shibuya Ku, Tokyo 1510066, Japan
[7] Hitachi Ltd, Life Sci Grp, Kawagoe, Saitama 3501165, Japan
[8] Hitachi Sci Syst Ltd, Kokubunji, Tokyo 1858601, Japan
[9] Takara Shuzo Co Ltd, Shiga 5250055, Japan
[10] Hitachi Ltd, Adv Res Lab, Kokubunji, Tokyo 1858601, Japan
关键词
D O I
10.1101/gr.4039406
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
By analyzing 1,780,295 S'-end Sequences of human full-length cDNAs derived from 164 kinds of oligo-cap cDNA libraries, we identified 269,774 independent positions of transcriptional start sites (TSSs) for 14,628 human RefSeq genes. These TSSs were clustered into 30,964 Clusters that were separated from each other by more than 500 bp and thus are very likely to Constitute Mutually distinct alternative promoters. To Our Surprise, at least 7674 (52%) human RefSeq genes were Subject to regulation by Putative alternative promoters (PAPs). Oil average, there were 3.1 PAPs per gene, with the composition of one CpG-island-containing promoter per 2.6 CpG-less promoters. In 17% of the PAP-containing loci, tissue&specific use of the PAPs was observed. The richest tissue Sources of the tissue-specific PAPs were testis and brain. It was also intriguing that the PAP-containing promoters were enriched in the genes encoding signal transduction-related proteins and were rarer in the genes encoding extracellular proteins, possibly reflecting the varied functional requirement for and the restricted expression of those categories of genes, respectively. The patterns of the first exons were highly diverse as well. Oil average, there were 7.7 different splicing types of first exons per locus partly produced by the PAPs, Suggesting that a wide variety of transcripts call be achieved by this mechanism. Our findings Suggest that use of alternate promoters and consequent alternative use of first exons Should play a pivotal role in generating the complexity required for the highly elaborated molecular systems in humans.
引用
收藏
页码:55 / 65
页数:11
相关论文
共 45 条
  • [1] The genome sequence of Drosophila melanogaster
    Adams, MD
    Celniker, SE
    Holt, RA
    Evans, CA
    Gocayne, JD
    Amanatides, PG
    Scherer, SE
    Li, PW
    Hoskins, RA
    Galle, RF
    George, RA
    Lewis, SE
    Richards, S
    Ashburner, M
    Henderson, SN
    Sutton, GG
    Wortman, JR
    Yandell, MD
    Zhang, Q
    Chen, LX
    Brandon, RC
    Rogers, YHC
    Blazej, RG
    Champe, M
    Pfeiffer, BD
    Wan, KH
    Doyle, C
    Baxter, EG
    Helt, G
    Nelson, CR
    Miklos, GLG
    Abril, JF
    Agbayani, A
    An, HJ
    Andrews-Pfannkoch, C
    Baldwin, D
    Ballew, RM
    Basu, A
    Baxendale, J
    Bayraktaroglu, L
    Beasley, EM
    Beeson, KY
    Benos, PV
    Berman, BP
    Bhandari, D
    Bolshakov, S
    Borkova, D
    Botchan, MR
    Bouck, J
    Brokstein, P
    [J]. SCIENCE, 2000, 287 (5461) : 2185 - 2195
  • [2] [Anonymous], 1998, SCIENCE, V282, P2012
  • [3] Protein diversity from alternative splicing: A challenge for bioinformatics and post-genome biology
    Black, DL
    [J]. CELL, 2000, 103 (03) : 367 - 370
  • [4] Comparative genomics: The mouse that roared
    Boguski, MS
    [J]. NATURE, 2002, 420 (6915) : 515 - 516
  • [5] Finishing the euchromatic sequence of the human genome
    Collins, FS
    Lander, ES
    Rogers, J
    Waterston, RH
    [J]. NATURE, 2004, 431 (7011) : 931 - 945
  • [6] Naturally occurring antisense: Transcriptional leakage or real overlap?
    Dahary, D
    Elroy-Stein, O
    Sorek, R
    [J]. GENOME RESEARCH, 2005, 15 (03) : 364 - 368
  • [7] Analysis of expressed sequence tags indicates 35,000 human genes
    Ewing, B
    Green, P
    [J]. NATURE GENETICS, 2000, 25 (02) : 232 - 234
  • [8] The ENCODE (ENCyclopedia of DNA elements) Project
    Feingold, EA
    Good, PJ
    Guyer, MS
    Kamholz, S
    Liefer, L
    Wetterstrand, K
    Collins, FS
    Gingeras, TR
    Kampa, D
    Sekinger, EA
    Cheng, J
    Hirsch, H
    Ghosh, S
    Zhu, Z
    Pate, S
    Piccolboni, A
    Yang, A
    Tammana, H
    Bekiranov, S
    Kapranov, P
    Harrison, R
    Church, G
    Struhl, K
    Ren, B
    Kim, TH
    Barrera, LO
    Qu, C
    Van Calcar, S
    Luna, R
    Glass, CK
    Rosenfeld, MG
    Guigo, R
    Antonarakis, SE
    Birney, E
    Brent, M
    Pachter, L
    Reymond, A
    Dermitzakis, ET
    Dewey, C
    Keefe, D
    Denoeud, F
    Lagarde, J
    Ashurst, J
    Hubbard, T
    Wesselink, JJ
    Castelo, R
    Eyras, E
    Myers, RM
    Sidow, A
    Batzoglou, S
    [J]. SCIENCE, 2004, 306 (5696) : 636 - 640
  • [9] The G-protein-coupled receptors in the human genome form five main families.: Phylogenetic analysis, paralogon groups, and fingerprints
    Fredriksson, R
    Lagerström, MC
    Lundin, LG
    Schiöth, HB
    [J]. MOLECULAR PHARMACOLOGY, 2003, 63 (06) : 1256 - 1272
  • [10] Life with 6000 genes
    Goffeau, A
    Barrell, BG
    Bussey, H
    Davis, RW
    Dujon, B
    Feldmann, H
    Galibert, F
    Hoheisel, JD
    Jacq, C
    Johnston, M
    Louis, EJ
    Mewes, HW
    Murakami, Y
    Philippsen, P
    Tettelin, H
    Oliver, SG
    [J]. SCIENCE, 1996, 274 (5287) : 546 - &