HOW TO SEARCH FOR RNA STRUCTURES THEORETICAL CONCEPTS IN EVOLUTIONARY BIOTECHNOLOGY

被引:45
作者
SCHUSTER, P
机构
[1] Institut für Molekulare Biotechnologie e.V., D-07708 Jena, Beutenbergstraße 11
关键词
APPLIED MOLECULAR EVOLUTION; DARWINS PRINCIPLE; EVOLUTIONARY BIOTECHNOLOGY; INVERSE FOLDING; RNA SECONDARY STRUCTURE; SEQUENCE SPACE; SHAPE SPACE;
D O I
10.1016/0168-1656(94)00085-Q
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The relation between RNA sequences and minimum free energy secondary structures is viewed as a mapping from sequence space into shape space. The properties of such mappings depend strongly on the ratios of the numbers of sequences and structures and, hence, substantial differences are observed between samples of structures derived from AUGC, pure AU or pure GC sequences. Statistical analysis of large samples is used to demonstrate that structures from AUGC sequences are much less sensitive to point mutations than those from sequences containing exclusively AU or GC. The frequency with which a structure is realized in sequence space is inversely proportional to some power c > 1 of the structure's frequency rank, thus following a (generalized) Zipf law. For long sequences the exponent approaches c = 1. An inverse folding algorithm is used to compute samples of sequences folding into the same secondary structure. These sequences are distributed randomly in sequence space. Common structures form extended neutral networks along which populations can migrate through the entire sequence space without changing structure. In this migration, moves of Hamming distance d = 1 and d = 2 are accepted in order to allow for base and base pair exchanges, respectively. Around any arbitrarily chosen sequence a ball that contains sequences folding into all common structures can be drawn. This ball has a diameter that is much smaller than the diameter of sequence space. Hence, only a small fraction of sequence space needs to be searched in order to find a given structure. The results derived from the mapping of sequences into structures are used to suggest a rationale for evolutionary searches on RNA structures: selection cycles with high and low mutation rates applied in alternation. Generalizations of the results to RNA 3-D structures and protein structures are discussed.
引用
收藏
页码:239 / 257
页数:19
相关论文
共 47 条
[11]   RNA SELECTION - APTAMERS ACHIEVE THE DESIRED RECOGNITION [J].
ELLINGTON, AD .
CURRENT BIOLOGY, 1994, 4 (05) :427-429
[12]   INVITRO SELECTION OF RNA MOLECULES THAT BIND SPECIFIC LIGANDS [J].
ELLINGTON, AD ;
SZOSTAK, JW .
NATURE, 1990, 346 (6287) :818-822
[13]   RNA FOLDING AND COMBINATORY LANDSCAPES [J].
FONTANA, W ;
STADLER, PF ;
BORNBERGBAUER, EG ;
GRIESMACHER, T ;
HOFACKER, IL ;
TACKER, M ;
TARAZONA, P ;
WEINBERGER, ED ;
SCHUSTER, P .
PHYSICAL REVIEW E, 1993, 47 (03) :2083-2099
[14]   STATISTICS OF RNA SECONDARY STRUCTURES [J].
FONTANA, W ;
KONINGS, DAM ;
STADLER, PF ;
SCHUSTER, P .
BIOPOLYMERS, 1993, 33 (09) :1389-1404
[15]   STATISTICS OF LANDSCAPES BASED ON FREE-ENERGIES, REPLICATION AND DEGRADATION RATE CONSTANTS OF RNA SECONDARY STRUCTURES [J].
FONTANA, W ;
GRIESMACHER, T ;
SCHNABL, W ;
STADLER, PF ;
SCHUSTER, P .
MONATSHEFTE FUR CHEMIE, 1991, 122 (10) :795-819
[16]   COMPARATIVE-STUDIES OF RNA - INFERRING HIGHER-ORDER STRUCTURE FROM PATTERNS OF SEQUENCE VARIATION [J].
GUTELL, RR .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1993, 3 (03) :313-322
[17]   ERROR DETECTING AND ERROR CORRECTING CODES [J].
HAMMING, RW .
BELL SYSTEM TECHNICAL JOURNAL, 1950, 29 (02) :147-160
[18]   FAST FOLDING AND COMPARISON OF RNA SECONDARY STRUCTURES [J].
HOFACKER, IL ;
FONTANA, W ;
STADLER, PF ;
BONHOEFFER, LS ;
TACKER, M ;
SCHUSTER, P .
MONATSHEFTE FUR CHEMIE, 1994, 125 (02) :167-188
[19]  
HOFACKER IL, 1994, IN PRESS SIAM J DISC
[20]   SELECTION OF NEW BIOLOGICAL-ACTIVITIES FROM RANDOM NUCLEOTIDE-SEQUENCES - EVOLUTIONARY AND PRACTICAL CONSIDERATIONS [J].
HORWITZ, MSZ ;
DUBE, DK ;
LOEB, LA .
GENOME, 1989, 31 (01) :112-117