Intron-exon structures of eukaryotic model organisms

被引:323
作者
Deutsch, M [1 ]
Long, M [1 ]
机构
[1] Univ Chicago, Dept Ecol & Evolut, Chicago, IL 60637 USA
基金
美国国家科学基金会;
关键词
D O I
10.1093/nar/27.15.3219
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
To investigate the distribution of intron-exon structures of eukaryotic genes, we have constructed a general exon database comprising ail available intron-containing genes and exon databases from 10 eukaryotic model organisms: Homo sapiens, Mus musculus, Gallus gallus, Rattus norvegicus, Arabidopsis thaliana, Zea mays, Schirosaccharomyces pombe, Aspergillus, Caenorhabditis elegans and Drosophila. We purged redundant genes to avoid the possible bias brought about by redundancy in the databases. After discarding those questionable introns that do not contain correct splice sites, the final database contained 17 102 introns, 21 019 exons and 2903 independent or quasi-independent genes, On average, a eukaryotic gene contains 3.7 introns per kb protein coding region. The exon distribution peaks around 30-40 residues and most introns are 40-125 nt long. The variable intron-exon structures of the 10 model organisms reveal two interesting statistical phenomena, which cast light on some previous speculations. (i) Genome size seems to be correlated with total intron length per gene. For example, invertebrate introns are smaller than those of human genes, while yeast introns are shorter than invertebrate introns, However, this correlation is weak, suggesting that other factors besides genome size may also affect intron size. (ii) Introns smaller than 50 nt are significantly less frequent than longer introns, possibly resulting from a minimum intron size requirement for intron splicing.
引用
收藏
页码:3219 / 3228
页数:10
相关论文
共 19 条
  • [1] COMPARATIVE SEQUENCE-ANALYSIS OF THE HUMAN AND PUFFERFISH HUNTINGTONS-DISEASE GENES
    BAXENDALE, S
    ABDULLA, S
    ELGAR, G
    BUCK, D
    BERKS, M
    MICKLEM, G
    DURBIN, R
    BATES, G
    BRENNER, S
    BECK, S
    LEHRACH, H
    [J]. NATURE GENETICS, 1995, 10 (01) : 67 - 76
  • [2] DYSTROPHIN IS TRANSCRIBED IN BRAIN FROM A DISTANT UPSTREAM PROMOTER
    BOYCE, FM
    BEGGS, AH
    FEENER, C
    KUNKEL, LM
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1991, 88 (04) : 1276 - 1280
  • [3] STRUCTURAL ORGANIZATION OF THE HUMAN TYPE-VII COLLAGEN GENE (COL7A1), COMPOSED OF MORE EXONS THAN ANY PREVIOUSLY CHARACTERIZED GENE
    CHRISTIANO, AM
    HOFFMAN, GG
    CHUNGHONET, LC
    LEE, SB
    CHENG, W
    UITTO, J
    GREENSPAN, DS
    [J]. GENOMICS, 1994, 21 (01) : 169 - 179
  • [4] Biology's new Rosetta stone
    Das, S
    Yu, LH
    Gaitatzes, C
    Rogers, R
    Freeman, J
    Bienkowska, J
    Adams, RM
    Smith, TF
    Lindellen, J
    [J]. NATURE, 1997, 385 (6611) : 29 - 30
  • [5] ANALYSIS OF NONUNIFORMITY IN INTRON PHASE DISTRIBUTION
    FEDOROV, A
    SUBOCH, G
    BUJAKOV, M
    FEDOROVA, L
    [J]. NUCLEIC ACIDS RESEARCH, 1992, 20 (10) : 2553 - 2557
  • [6] The miniaturized nuclear genome of a eukaryotic endosymbiont contains genes that overlap, genes that are cotranscribed, and the smallest known spliceosomal introns
    Gilson, PR
    McFadden, GI
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (15) : 7737 - 7742
  • [7] Life with 6000 genes
    Goffeau, A
    Barrell, BG
    Bussey, H
    Davis, RW
    Dujon, B
    Feldmann, H
    Galibert, F
    Hoheisel, JD
    Jacq, C
    Johnston, M
    Louis, EJ
    Mewes, HW
    Murakami, Y
    Philippsen, P
    Tettelin, H
    Oliver, SG
    [J]. SCIENCE, 1996, 274 (5287) : 546 - &
  • [8] A SURVEY ON INTRON AND EXON LENGTHS
    HAWKINS, JD
    [J]. NUCLEIC ACIDS RESEARCH, 1988, 16 (21) : 9893 - 9908
  • [9] SMALL GENOMES FOR BETTER FLYERS
    HUGHES, AL
    HUGHES, MK
    [J]. NATURE, 1995, 377 (6548) : 391 - 391
  • [10] The yeast splice site revisited: New exon consensus from genomic analysis
    Long, MY
    deSouza, SJ
    Gilbert, W
    [J]. CELL, 1997, 91 (06) : 739 - 740