Patterns of variant polyadenylation signal usage in human genes

被引:543
作者
Beaudoing, E [1 ]
Freier, S [1 ]
Wyatt, JR [1 ]
Claverie, JM [1 ]
Gautheret, D [1 ]
机构
[1] Struct & Genet Informat Lab, CNRS, UMR 1889, F-13402 Marseille 20, France
关键词
D O I
10.1101/gr.10.7.1001
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The formation of mature mRNAs in vertebrates involves the cleavage and polyadenylation of the pre-mRNA, 10-30 nt downstream of an AAUAAA or AUUAAA signal sequence. The extensive cDNA data now available shows that these hexamers are not strictly conserved. In order to identify variant polyadenylation signals on a large scale, we compared over 8700 human 3' untranslated sequences to 157,775 polyadenylated expressed sequence tags (ESTs], used as markers of actual mRNA 3' ends. About 5600 EST-supported putative mRNA 3' ends were collected and analyzed for significant hexameric sequences. Known polyadenylation signals were found in only 73% of the 3' fragments. Ten single-base variants of the AAUAAA sequence were identified with a highly significant occurrence rate, potentially representing 14.9% of the actual polyadenylation signals. Of the mRNAs, 28.6% displayed two or more polyadenylation sites. In these mRNAs, the poly(A) sites proximal to the coding sequence tend to use variant signals more often, while the 3'-most site tends to use a canonical signal. The average number of ESTs associated with each signal type suggests that variant signals (including the common AUUAAA] are processed less efficiently than the canonical signal and could therefore be selected for regulatory purposes. However, the position of the site in the untranslated region may also play a role in polyadenylation rate.
引用
收藏
页码:1001 / 1010
页数:10
相关论文
共 63 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] Multiple transcripts of the murine immunoglobulin epsilon membrane locus are generated by alternative splicing and differential usage of two polyadenylation sites
    Anand, S
    Batista, FD
    Tkach, T
    Efremov, DG
    Burrone, OR
    [J]. MOLECULAR IMMUNOLOGY, 1997, 34 (02) : 175 - 183
  • [3] HIERARCHY OF POLYADENYLATION SITE USAGE BY BOVINE PAPILLOMAVIRUS IN TRANSFORMED MOUSE CELLS
    ANDREWS, EM
    DIMAIO, D
    [J]. JOURNAL OF VIROLOGY, 1993, 67 (12) : 7705 - 7710
  • [4] Presence of multiple functional polyadenylation signals and a single nucleotide polymorphism in the 3′ untranslated region of the human serotonin transporter gene
    Battersby, S
    Ogilvie, AD
    Blackwood, DHR
    Shen, SB
    Muqit, MMK
    Muir, WJ
    Teague, P
    Goodwin, GM
    Harmar, AJ
    [J]. JOURNAL OF NEUROCHEMISTRY, 1999, 72 (04) : 1384 - 1388
  • [5] DBEST - DATABASE FOR EXPRESSED SEQUENCE TAGS
    BOGUSKI, MS
    LOWE, TMJ
    TOLSTOSHEV, CM
    [J]. NATURE GENETICS, 1993, 4 (04) : 332 - 333
  • [6] AU-RICH ELEMENTS - CHARACTERIZATION AND IMPORTANCE IN MESSENGER-RNA DEGRADATION
    CHEN, CYA
    SHYU, AB
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 1995, 20 (11) : 465 - 470
  • [7] CHENG JF, 1986, J BIOL CHEM, V261, P839
  • [8] Computational methods for the identification of genes in vertebrate genomic sequences
    Claverie, JM
    [J]. HUMAN MOLECULAR GENETICS, 1997, 6 (10) : 1735 - 1744
  • [9] Mechanism and regulation of mRNA polyadenylation
    Colgan, DF
    Manley, JL
    [J]. GENES & DEVELOPMENT, 1997, 11 (21) : 2755 - 2766
  • [10] STRUCTURAL CHARACTERIZATION AND PROMOTER ACTIVITY ANALYSIS OF THE GAMMA-KAFIRIN GENE FROM SORGHUM
    DEFREITAS, FA
    YUNES, JA
    DASILVA, MJ
    ARRUDA, P
    LEITE, A
    [J]. MOLECULAR AND GENERAL GENETICS, 1994, 245 (02): : 177 - 186