Intron distribution difference for 276 ancient and 131 modern genes suggests the existence of ancient introns

被引:41
作者
Fedorov, A
Cao, XH
Saxonov, S
de Souza, SJ
Roy, SW
Gilbert, W
机构
[1] Harvard Univ, Dept Mol & Cellular Biol, Cambridge, MA 02138 USA
[2] Ludwig Inst Canc Res, Lab Computat Biol, Sao Paulo Branch, BR-01509010 Sao Paulo, Brazil
关键词
D O I
10.1073/pnas.231491498
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Do introns delineate elements of protein tertiary structure? This issue is crucial to the debate about the role and origin of introns. We present an analysis of the full set of proteins with known three-dimensional structures that have homologs with intron positions recorded in GenBank. A computer program was generated that maps on a reference sequence the positions of all introns in homologous genes. We have applied this program to a set of 665 nonredundant protein sequences with defined three-dimensional structures in the Protein Data Bank (PDB), which yielded 8,217 introns in 407 proteins. For the subset of proteins corresponding to ancient conserved regions (ACR), we find that there is a correlation of phase-zero introns with the boundary regions of modules and no correlation for the phase-one and phase-two positions. However, for a subset of proteins without prokaryotic counterparts (131 non-ACR proteins), a set of presumably modern proteins (or proteins that have diverged extremely far from any ancestral form), we do not find any correlation of phase-zero intron positions with three-dimensional structure. Furthermore, we find an anticorrelation of phase-one intron positions with module boundaries: they actually have a preference for the interior of modules. This finding is explicable as a preference for phase-one introns to lie in glycines, between G \ G sequences, the preference for glycines being anticorrelated with the three-dimensional modules. We interpret this anticorrelation as a sign that a number of phase-one introns, and hence many modern introns, have been inserted into G \ G "protosplice" sequences.
引用
收藏
页码:13177 / 13182
页数:6
相关论文
共 22 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Intron distribution in ancient paralogs supports random insertion and not random loss [J].
Cho, G ;
Doolittle, RF .
JOURNAL OF MOLECULAR EVOLUTION, 1997, 44 (06) :573-584
[3]   Toward a resolution of the introns early/late debate: Only phase zero introns are correlated with the structure of ancient proteins [J].
De Souza, SJ ;
Long, M ;
Kleln, RJ ;
Roy, S ;
Lin, S ;
Gilbert, W .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (09) :5094-5099
[4]   Intron positions correlate with module boundaries in ancient proteins [J].
deSouza, SJ ;
Long, M ;
Schoenbach, L ;
Roy, SW ;
Gilbert, W .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (25) :14632-14636
[5]   GENES IN PIECES - WERE THEY EVER TOGETHER [J].
DOOLITTLE, WF .
NATURE, 1978, 272 (5654) :581-582
[6]   INTRON-DEPENDENT EVOLUTION OF THE NUCLEOTIDE-BINDING DOMAINS WITHIN ALCOHOL-DEHYDROGENASE AND RELATED ENZYMES [J].
DUESTER, G ;
JORNVALL, H ;
HATFIELD, GW .
NUCLEIC ACIDS RESEARCH, 1986, 14 (05) :1931-1941
[7]   ON THE ANCIENT NATURE OF INTRONS [J].
GILBERT, W ;
GLYNIAS, M .
GENE, 1993, 135 (1-2) :137-144
[8]   THE EXON THEORY OF GENES [J].
GILBERT, W .
COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY, 1987, 52 :901-905
[10]   Divergent structures of Caenorhabditis elegans cytochrome P450 genes suggest the frequent loss and gain of introns during the evolution of nematodes [J].
Gotoh, O .
MOLECULAR BIOLOGY AND EVOLUTION, 1998, 15 (11) :1447-1459