Identification of alternate polyadenylation sites and analysis of their tissue distribution using EST data

被引:136
作者
Beaudoing, E [1 ]
Gautheret, D [1 ]
机构
[1] Ctr Natl Rech Sci, Inst Natl Sante & Rech Med, Ctr Immunol Marseille Luminy, Marseille 09, France
关键词
D O I
10.1101/gr.190501
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Alternate polyadenylation affects a large fraction of higher eucaryote mRNAs, producing Mature transcripts with 3 ' ends of variable length. This variation is poorly represented in the Current transcript catalog derived from whole genome sequences, mostly because Such posttranscriptional events are not detectable directly at the DNA level. Alternate polydenylation of all mRNA is better understood by comparision to EST databases. Comparing ESTs to mRNAs, however, is a difficult task subjected to the pitfalls of internal priming I presence of intron sequences, repeated elements, chimerical ESTs or matches with EST from paralogous genes. We present here a computer program that addresses these problems and displays ESTs matches to a query mRNA sequence to predict alternate polyadenylation and to suggest library-specific forms. The Output highlights effective polyadenylation Signals, Possible Sources of artifacts such as A-rich stretches in the mRNA Sequences, and allows for a direct visualization of EST libraries using color codes. Statistical biases in the distribution of alternative mRNA forms among EST libraries were systematically Sought. About 1450 human and 200 Mouse mRNAs displayed such biases, suggesting in each case a tissue- or disease-specific regulation of polyadenylation.
引用
收藏
页码:1520 / 1526
页数:7
相关论文
共 20 条
  • [1] Agresti A., 1992, STAT SCI, V7, P131, DOI DOI 10.1214/SS/1177011454
  • [2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [3] Patterns of variant polyadenylation signal usage in human genes
    Beaudoing, E
    Freier, S
    Wyatt, JR
    Claverie, JM
    Gautheret, D
    [J]. GENOME RESEARCH, 2000, 10 (07) : 1001 - 1010
  • [4] DBEST - DATABASE FOR EXPRESSED SEQUENCE TAGS
    BOGUSKI, MS
    LOWE, TMJ
    TOLSTOSHEV, CM
    [J]. NATURE GENETICS, 1993, 4 (04) : 332 - 333
  • [5] Mechanism and regulation of mRNA polyadenylation
    Colgan, DF
    Manley, JL
    [J]. GENES & DEVELOPMENT, 1997, 11 (21) : 2755 - 2766
  • [6] The 3′ untranslated region of messenger RNA:: A molecular 'hotspot' for pathology?
    Conne, B
    Stutz, A
    Vassalli, JD
    [J]. NATURE MEDICINE, 2000, 6 (06) : 637 - 641
  • [7] Alternative poly(A) site selection in complex transcription units: Means to an end?
    EdwaldsGilbert, G
    Veraldi, KL
    Milcarek, C
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (13) : 2547 - 2561
  • [8] Alternate polyadenylation in human mRNAs: A large-scale analysis by EST clustering
    Gautheret, D
    Poirot, O
    Lopez, F
    Audic, S
    Claverie, JM
    [J]. GENOME RESEARCH, 1998, 8 (05): : 524 - 530
  • [9] In silico detection of control signals:: mRNA 3′-end-processing sequences in diverse species
    Graber, JH
    Cantor, CR
    Mohr, SC
    Smith, TF
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (24) : 14055 - 14060
  • [10] Repbase Update - a database and an electronic journal of repetitive elements
    Jurka, J
    [J]. TRENDS IN GENETICS, 2000, 16 (09) : 418 - 420