STOCHASTIC CONTEXT-FREE GRAMMARS FOR TRANSFER-RNA MODELING

被引:238
作者
SAKAKIBARA, Y
BROWN, M
HUGHEY, R
MIAN, IS
SJOLANDER, K
UNDERWOOD, RC
HAUSSLER, D
机构
[1] UNIV CALIF SANTA CRUZ, COMP & INFORMAT SCI LABS, SANTA CRUZ, CA 95064 USA
[2] UNIV CALIF SANTA CRUZ, COMP ENGN LABS, SANTA CRUZ, CA 95064 USA
[3] UNIV CALIF SANTA CRUZ, SINSHEIMER LABS, SANTA CRUZ, CA 95064 USA
关键词
D O I
10.1093/nar/22.23.5112
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Stochastic context-free grammars (SCFGs) are applied to the problems of folding, aligning and modeling families of tRNA sequences. SCFGs capture the sequences' common primary and secondary structure and generalize the hidden Markov models (HMMs) used in related work on protein and DNA. Results show that after having been trained on as few as 20 tRNA sequences from only two tRNA subfamilies (mitochondrial and cytoplasmic), the model can discern general tRNA from similar-length RNA sequences of other kinds, can find secondary structure of new tRNA sequences, and can produce multiple alignments of large sets of tRNA sequences. Our results suggest potential improvements in the alignments of the D- and T-domains in some mitochdondrial tRNAs that cannot be fit into the canonical secondary structure.
引用
收藏
页码:5112 / 5120
页数:9
相关论文
共 59 条
  • [1] RAPID SEARCHES FOR COMPLEX PATTERNS IN BIOLOGICAL MOLECULES
    ABARBANEL, RM
    WIENEKE, PR
    MANSFIELD, E
    JAFFE, DA
    BRUTLAG, DL
    [J]. NUCLEIC ACIDS RESEARCH, 1984, 12 (01) : 263 - 280
  • [2] Baker J. K., 1979, 97 M AC SOC AM, P547
  • [3] PHYLOGENETIC ANALYSIS AND EVOLUTION OF RNASE-P RNA IN PROTEOBACTERIA
    BROWN, JW
    HAAS, ES
    JAMES, BD
    HUNT, DA
    PACE, NR
    [J]. JOURNAL OF BACTERIOLOGY, 1991, 173 (12) : 3855 - 3863
  • [4] CHIU DKY, 1991, COMPUT APPL BIOSCI, V7, P347
  • [5] TURN PREDICTION IN PROTEINS USING A PATTERN-MATCHING APPROACH
    COHEN, FE
    ABARBANEL, RM
    KUNTZ, ID
    FLETTERICK, RJ
    [J]. BIOCHEMISTRY, 1986, 25 (01) : 266 - 275
  • [6] DAHLBERG JE, 1989, METHODS ENZYMOLOGY, V180
  • [7] Doolittle R. F., 1990, METHODS ENZYMOLOGY, V183
  • [8] RNA SEQUENCE-ANALYSIS USING COVARIANCE-MODELS
    EDDY, SR
    DURBIN, R
    [J]. NUCLEIC ACIDS RESEARCH, 1994, 22 (11) : 2079 - 2088
  • [9] ENGELFRIET J, 1991, LECT NOTES COMPUT SC, V532, P12, DOI 10.1007/BFb0017374
  • [10] IDENTIFYING POTENTIAL TRANSFER-RNA GENES IN GENOMIC DNA-SEQUENCES
    FICHANT, GA
    BURKS, C
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1991, 220 (03) : 659 - 671