SIMILARITY LANDSCAPES - A WAY TO DETECT MANY STRUCTURAL AND SEQUENCE MOTIFS IN BOTH INTRONS AND EXONS

被引:5
作者
HULTNER, M
SMITH, DW
WILLS, C
机构
[1] UNIV CALIF SAN DIEGO, DEPT BIOL 0116, LA JOLLA, CA 92093 USA
[2] UNIV CALIF SAN DIEGO, CTR MOLEC GENET, LA JOLLA, CA 92093 USA
关键词
FASTA; SIMILARITIES; INTRONS; EXONS; MOTIFS; LEUCINE ZIPPER; DNA BINDING REGIONS; DISTANT RESEMBLANCES;
D O I
10.1007/BF00166165
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
When investigators undertake searches of DNA databases, they normally discard large numbers of alignments that demonstrate very weak resemblances to each other, retaining only those that show statistically significant levels of resemblance. We show here that a great deal of information can be extracted from these weak alignments by examining them en masse. This is done by building three-dimensional similarity landscapes from the alignments, landscapes that reveal whether an unusual number of individually nonsignificant alignments tend to match up to a particular region of the query sequence being searched. The power of the search is increased by the use of libraries consisting entirely of introns or of exons. We show that (1) similarity landscapes with a variety of features can be generated from both intron and exon libraries, using introns or exons as query sequences; (2) the landscape features are real and not a statistical artifact; (3) well-known protein motifs used as query sequences can generate various landscape features; and (4) there is some evidence for resemblances between short regions of sequence carried by introns and exons. One possible interpretation of these results is that both introns and exons may have been built up during their evolution from short regions of sequence that as a result are now widely distributed throughout eukaryotic genomes. Such an interpretation would imply that these short regions have common ancestry. Alternatively, the wide sharing of short pieces of DNA may reflect regions with particular structural properties that have arisen through convergent evolution. The similarity-landscape approach can be used to detect such widespread structural motifs and sequence motifs in the genome that might be missed by less-global searches. It can also be used in conjunction with algorithms developed for detecting significant multiple alignments by isolating promising subsets of the databases that can be examined in more detail.
引用
收藏
页码:188 / 203
页数:16
相关论文
共 21 条
[1]   PROTEIN DATABASE SEARCHES FOR MULTIPLE ALIGNMENTS [J].
ALTSCHUL, SF ;
LIPMAN, DJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1990, 87 (14) :5509-5513
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]   ANALYSIS OF THE HUMAN APOLIPOPROTEIN-B GENE - COMPLETE STRUCTURE OF THE B-74 REGION [J].
CARLSSON, P ;
DARNFORS, C ;
OLOFSSON, SO ;
BJURSELL, G .
GENE, 1986, 49 (01) :29-51
[4]   GENOMIC AND CDNA CLONING OF THE HUMAN C1 INHIBITOR - INTRON-EXON JUNCTIONS AND COMPARISON WITH OTHER SERPINS [J].
CARTER, PE ;
DUNBAR, B ;
FOTHERGILL, JE .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1988, 173 (01) :163-169
[5]   SEQUENCE LANDSCAPES [J].
CLIFT, B ;
HAUSSLER, D ;
MCCONNELL, R ;
SCHNEIDER, TD ;
STORMO, GD .
NUCLEIC ACIDS RESEARCH, 1986, 14 (01) :141-158
[6]   IMPROVED FREE-ENERGY PARAMETERS FOR PREDICTIONS OF RNA DUPLEX STABILITY [J].
FREIER, SM ;
KIERZEK, R ;
JAEGER, JA ;
SUGIMOTO, N ;
CARUTHERS, MH ;
NEILSON, T ;
TURNER, DH .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1986, 83 (24) :9373-9377
[7]   EXHAUSTIVE MATCHING OF THE ENTIRE PROTEIN-SEQUENCE DATABASE [J].
GONNET, GH ;
COHEN, MA ;
BENNER, SA .
SCIENCE, 1992, 256 (5062) :1443-1445
[8]   PROFILE ANALYSIS - DETECTION OF DISTANTLY RELATED PROTEINS [J].
GRIBSKOV, M ;
MCLACHLAN, AD ;
EISENBERG, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (13) :4355-4358
[9]  
KONOPKA AK, 1987, GENE ANAL TECH, V4, P63
[10]   THE ROLE OF THE LEUCINE ZIPPER IN THE FOS JUN INTERACTION [J].
KOUZARIDES, T ;
ZIFF, E .
NATURE, 1988, 336 (6200) :646-651