AN INTERACTIVE FRAMEWORK FOR RNA SECONDARY STRUCTURE PREDICTION WITH A DYNAMICAL TREATMENT OF CONSTRAINTS

被引:41
作者
GASPIN, C
WESTHOF, E
机构
[1] INRA, SBIA, F-31326 CASTANET TOLOSAN, FRANCE
[2] CNRS, INST BIOL MOLEC & CELLULAIRE, F-67084 STRASBOURG, FRANCE
关键词
RNA; SECONDARY STRUCTURE; PREDICTION; CONSTRAINT SATISFACTION PROBLEMS; CLIQUES;
D O I
10.1006/jmbi.1995.0608
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A novel approach aiding in the prediction of RNA secondary structures is presented. Although phylogenetic methods are the most successful at deriving RNA secondary structures, the are not applicable when the number of sequences or the sequence variability is too low. Methods based on energy minimization are therefore of great interest. However, some of the suboptimal RNA secondary structures computed with classic methods are unsaturated structures, i.e. some structures are included into others. Thus, the incorporation of constraints during the process of folding is not possible, while the incorporation of constraints before the process of folding often introduces a bias into the energy function. This paper describes a new procedure which allows for the incorporation of constraints before and during the process of RNA folding. SAPSSARN is an interactive program which offers a framework, both to specify a secondary structure through a set of folding constraints and to compute all the supoptimal saturated RNA secondary structures which satisfy all the folding constraints. At the start, it relies on the computation of the probabilities of pairing of each base with all others according to McCaskill's algorithm. The constraint satisfaction formulation of the problem deals dynamically with a chosen set of folding constraints and, finally, a search algorithm computes all the suboptimal saturated secondary structures which satisfy those folding constraints. Within such a framework, it is possible to test new ideas about RNA folding and secondary structures, including pseudoknots, can be computed. The program is illustrated with RNA sequences on, which we obtained results in agreement with known structures by using a protocol which mimics the hierarchical folding of RNA molecules. (C) 1995 Academic Press Limited
引用
收藏
页码:163 / 174
页数:12
相关论文
共 35 条
  • [1] ALTMAN RB, 1994, P 2 INT C INT SYST M, P10
  • [2] THERMODYNAMIC PARAMETERS FOR LOOP FORMATION IN RNA AND DNA HAIRPIN TETRALOOPS
    ANTAO, VP
    TINOCO, I
    [J]. NUCLEIC ACIDS RESEARCH, 1992, 20 (04) : 819 - 824
  • [3] COMPUTER-AIDED PREDICTION OF RNA SECONDARY STRUCTURES
    AURON, PE
    RINDONE, WP
    VARY, CPH
    CELENTANO, JJ
    VOURNAKIS, JN
    [J]. NUCLEIC ACIDS RESEARCH, 1982, 10 (01) : 403 - 419
  • [4] BESSIERE C, 1991, PROCEEDINGS : NINTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, P221
  • [5] BESSIERE C, 1995, P IJCAI 95, P592
  • [6] FINDING ALL CLIQUES OF AN UNDIRECTED GRAPH [H]
    BRON, C
    KERBOSCH, J
    [J]. COMMUNICATIONS OF THE ACM, 1973, 16 (09) : 575 - 577
  • [7] CEDERGREN R, 1988, COMPUT APPL BIOSCI, V4, P143
  • [8] GASPIN C, 1994, ADV MOL BIOINFORMATI, P103
  • [9] GOUY M, 1989, NUCLEIC ACID PROTEIN, P259
  • [10] FOLDING OF GROUP-I INTRONS FROM BACTERIOPHAGE-T4 INVOLVES INTERNALIZATION OF THE CATALYTIC CORE
    HEUER, TS
    CHANDRY, PS
    BELFORT, M
    CELANDER, DW
    CECH, TR
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1991, 88 (24) : 11105 - 11109