A probabilistic model of 3′ end formation in Caenorhabditis elegans

被引:46
作者
Hajarnavis, A [1 ]
Korf, I [1 ]
Durbin, R [1 ]
机构
[1] Wellcome Trust Sanger Inst, Hinxton CB10 1SA, Cambs, England
基金
英国惠康基金;
关键词
D O I
10.1093/nar/gkh656
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The 3' ends of mRNAs terminate with a poly(A) tail. This post-transcriptional modification is directed by sequence features present in the 3'-untranslated region (3'-UTR). We have undertaken a computational analysis of 3' end formation in Caenorhabditis elegans. By aligning cDNAs that diverge from genomic sequence at the poly(A) tract, we accurately identified a large set of true cleavage sites. When there are many transcripts aligned to a particular locus, local variation of the cleavage site over a span of a few bases is frequently observed. We find that in addition to the well-known AAUAAA motif there are several regions with distinct nucleotide compositional biases. We propose a generalized hidden Markov model that describes sequence features in C.elegans 3'-UTRs. We find that a computer program employing this model accurately predicts experimentally observed 3' ends even when there are multiple AAUAAA motifs and multiple cleavage sites. We have made available a complete set of polyadenylation site predictions for the C.elegans genome, including a subset of 6570 supported by aligned transcripts.
引用
收藏
页码:3392 / 3399
页数:8
相关论文
共 25 条
[11]   3'-end-forming signals of yeast mRNA [J].
Guo, ZJ ;
Sherman, F .
TRENDS IN BIOCHEMICAL SCIENCES, 1996, 21 (12) :477-481
[12]   Intercistronic region required for polycistronic Pre-mRNA processing in Caenorhabditis elegans [J].
Huang, T ;
Kuersten, S ;
Deshpande, AM ;
Spieth, J ;
MacMorris, M ;
Blumenthal, T .
MOLECULAR AND CELLULAR BIOLOGY, 2001, 21 (04) :1111-1120
[13]   Hrp1, a sequence-specific RNA-binding protein that shuttles between the nucleus and the cytoplasm, is required for mRNA 3'-end formation in yeast [J].
Kessler, MM ;
Henry, MF ;
Shen, E ;
Zhao, J ;
Gross, S ;
Silver, PA ;
Moore, CL .
GENES & DEVELOPMENT, 1997, 11 (19) :2545-2556
[14]   Gene finding in novel genomes [J].
Korf, I .
BMC BIOINFORMATICS, 2004, 5 (1)
[15]  
Kulp D, 1996, Proc Int Conf Intell Syst Mol Biol, V4, P134
[16]   Sequence determinants in human polyadenylation site selection [J].
Legendre, M ;
Gautheret, D .
BMC GENOMICS, 2003, 4 (1)
[17]   Reexamining the polyadenylation signal: were we wrong about AAUAAA? [J].
MacDonald, CC ;
Redondo, JL .
MOLECULAR AND CELLULAR ENDOCRINOLOGY, 2002, 190 (1-2) :1-8
[18]   GlimmerM, Exonomy and Unveil:: three ab initio eukaryotic genefinders [J].
Majoros, WH ;
Pertea, M ;
Antonescu, C ;
Salzberg, SL .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3601-3604
[19]   Genetic dangers in poly(A) signals [J].
Proudfoot, NJ .
EMBO REPORTS, 2001, 2 (10) :891-892
[20]   A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286