Statistical analysis of the exon-intron structure of higher and lower eukaryote genes

被引:25
作者
Kriventseva, EV
Gelfand, MS
机构
[1] Russian Acad Sci, VA Engelhardt Mol Biol Inst, Moscow 117984, Russia
[2] State Sci Ctr Biotechnol NIIGenet, Moscow 113545, Russia
基金
俄罗斯基础研究基金会;
关键词
D O I
10.1080/07391102.1999.10508361
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Statistics of the exon-intron structure and splicing sites of several diverse eukaryotes was studied. The yeast exon-intron structures have a number of unique features. A yeast gene usually have at most one intron. The branch site is strongly conserved, whereas the polypirimidine tract is short. Long yeast introns tend to have stronger acceptor sites. In other species the branch site is less conserved and often cannot be detemined. in non-yeast samples there is an almost universal correlation between lengths of neighboring exons (all samples excluding protists) and correlation between lengths of neighboring introns (human, drosophila, protists). On the average first introns are longer, and anomalously long introns are usually first introns in a gene. There is a universal preference for exons and exon pairs with the (total) length divisible by 3. Introns positioned between codons are preferred, whereas those positioned between the first and second positions in codon are avoided. The choice of A or G at the third position of intron (the donor splice sites generally prefer purines at this position) is correlated with the overall GC-composition of the gene. In all samples dinucleotide AG is avoided in the region preceding the acceptor site.
引用
收藏
页码:281 / 288
页数:8
相关论文
共 34 条
[1]   GenBank [J].
Benson, DA ;
Boguski, MS ;
Lipman, DJ ;
Ostell, J ;
Ouellette, BFF .
NUCLEIC ACIDS RESEARCH, 1998, 26 (01) :1-7
[2]  
BILLINGLEY P, 1961, STAT INFERENCES MARK
[3]   Prediction of complete gene structures in human genomic DNA [J].
Burge, C ;
Karlin, S .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) :78-94
[4]   Evaluation of gene structure prediction programs [J].
Burset, M ;
Guigo, R .
GENOMICS, 1996, 34 (03) :353-367
[5]   GENETIC-EVIDENCE FOR BASE-PAIRING BETWEEN U2 AND U6 SNRNA IN MAMMALIAN MESSENGER-RNA SPLICING [J].
DATTA, B ;
WEINER, AM .
NATURE, 1991, 352 (6338) :821-824
[6]   INFORMATION-CONTENT OF CAENORHABDITIS-ELEGANS SPLICE SITE SEQUENCES VARIES WITH INTRON LENGTH [J].
FIELDS, C .
NUCLEIC ACIDS RESEARCH, 1990, 18 (06) :1509-1512
[7]  
Gelfand M S, 1995, J Comput Biol, V2, P87, DOI 10.1089/cmb.1995.2.87
[8]   STATISTICAL-ANALYSIS OF MAMMALIAN PRE-MESSENGER RNA SPLICING SITES [J].
GELFAND, MS .
NUCLEIC ACIDS RESEARCH, 1989, 17 (15) :6369-6382
[9]  
GELFAND MS, 1992, J MOL EVOL, V35, P239
[10]   A SURVEY ON INTRON AND EXON LENGTHS [J].
HAWKINS, JD .
NUCLEIC ACIDS RESEARCH, 1988, 16 (21) :9893-9908