HUMAN PRE-MESSENGER-RNA SPLICING SIGNALS

被引:59
作者
PENOTTI, FE
机构
[1] Consiglio Nazionale delle Ricerche, Servizio Informatico Area Milanese, 20131 Milano, MI
关键词
D O I
10.1016/S0022-5193(05)80436-9
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A sample of 764 pairs of human pre-mRNA exon-intron and intron-exon boundaries, extracted from the European Molecular Biology Laboratory data bank, is analyzed to provide a species-optimized characterization of donor and acceptor sites, evaluate the information content of the two signals (found to be about 8 and 9 bits respectively) and check the independent-base approximation (which holds well) and the "GT-AG" rule (to which a few well-documented exceptions are found). No correlation is detected between the strength ("discrimination energy") of an actual donor-site signal and that of its corresponding acceptor-site counterpart, nor between that of either signal, or the cumulative strength of both, and the length of the intervening intron. The discrimination-energy distributions of the two signals are determined. Because of the large sample size and its single-species origin, the two distributions can be presumed to be representative of their underlying genomic counterparts. The size distribution of the introns shows a lower cut-off of 70 nucleotides (in essential agreement with published experimental results), and apparently no periodicities. A smaller sample of mammalian branch sites, taken from the literature, is similarly analyzed to attempt a characterization of this rather elusive signal, and provides some indication that at least part of the "long pyrimidine stretch", usually considered an integral constituent of the 3′ splice signal, may be just as strongly associated with the branch site, in agreement with recent experimental observations. The usefulness of these characterizations for splice-junction searches is assessed on a test sequence. © 1991 Academic Press Limited.
引用
收藏
页码:385 / 420
页数:36
相关论文
共 38 条
[1]   1ST GENOMIC SEQUENCE OF A HUMAN IG VARIABLE LAMBDA-GENE BELONGING TO SUBGROUP-I - FUNCTIONAL GENES, PSEUDOGENES AND VESTIGIAL SEQUENCES ARE INTERSPERSED IN THE IGLV LOCUS [J].
ALEXANDRE, D ;
CHUCHANA, P ;
BROCKLY, F ;
BLANCHER, A ;
LEFRANC, G ;
LEFRANC, MP .
NUCLEIC ACIDS RESEARCH, 1989, 17 (10) :3975-3975
[2]  
[Anonymous], 1986, NUMERICAL RECIPES
[3]  
BELL GI, 1980, NATURE, V284, P26, DOI 10.1038/284026a0
[4]   SELECTION OF DNA-BINDING SITES BY REGULATORY PROTEINS .2. THE BINDING-SPECIFICITY OF CYCLIC-AMP RECEPTOR PROTEIN TO RECOGNITION SITES [J].
BERG, OG ;
VONHIPPEL, PH .
JOURNAL OF MOLECULAR BIOLOGY, 1988, 200 (04) :709-723
[5]   SELECTION OF DNA-BINDING SITES BY REGULATORY PROTEINS - STATISTICAL-MECHANICAL THEORY AND APPLICATION TO OPERATORS AND PROMOTERS [J].
BERG, OG ;
VONHIPPEL, PH .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 193 (04) :723-743
[6]  
BREATHNACH R, 1981, ANNU REV BIOCHEM, V50, P349, DOI 10.1146/annurev.bi.50.070181.002025
[7]   COMPARATIVE ANATOMY OF THE HUMAN APRT GENE AND ENZYME - NUCLEOTIDE-SEQUENCE DIVERGENCE AND CONSERVATION OF A NONRANDOM CPG DINUCLEOTIDE ARRANGEMENT [J].
BRODERICK, TP ;
SCHAFF, DA ;
BERTINO, AM ;
DUSH, MK ;
TISCHFIELD, JA ;
STAMBROOK, PJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (10) :3349-3353
[8]  
COOL DE, 1987, J BIOL CHEM, V262, P13662
[9]   NUCLEOTIDE-SEQUENCE OF THE GENE FOR HUMAN-PROTHROMBIN [J].
DEGEN, SJF ;
DAVIE, EW .
BIOCHEMISTRY, 1987, 26 (19) :6165-6177
[10]   INFORMATION-CONTENT OF CAENORHABDITIS-ELEGANS SPLICE SITE SEQUENCES VARIES WITH INTRON LENGTH [J].
FIELDS, C .
NUCLEIC ACIDS RESEARCH, 1990, 18 (06) :1509-1512