The estimation of statistical parameters for local alignment score distributions

被引:114
作者
Altschul, SF [1 ]
Bundschuh, R
Olsen, R
Hwa, T
机构
[1] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
[2] Univ Calif San Diego, Dept Phys, La Jolla, CA 92093 USA
基金
英国惠康基金;
关键词
D O I
10.1093/nar/29.2.351
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The distribution of optimal local alignment scores of random sequences plays a vital role in evaluating the statistical significance of sequence alignments, These scores can be well described by an extreme-value distribution. The distribution's parameters depend upon the scoring system employed and the random letter frequencies; in general they cannot be derived analytically, but must be estimated by curve fitting. For obtaining accurate parameter estimates, a form of the recently described 'island' method has several advantages. We describe this method in detail, and use it to investigate the functional dependence of these parameters on finite-length edge effects.
引用
收藏
页码:351 / 361
页数:11
相关论文
共 38 条
  • [1] AMINO-ACID SUBSTITUTION MATRICES FROM AN INFORMATION THEORETIC PERSPECTIVE
    ALTSCHUL, SF
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1991, 219 (03) : 555 - 565
  • [2] ALTSCHUL SF, 1986, B MATH BIOL, V48, P603, DOI 10.1016/S0092-8240(86)90010-8
  • [3] Altschul SF, 1996, METHOD ENZYMOL, V266, P460
  • [4] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [5] ALTSCHUL SF, 1986, B MATH BIOL, V48, P633, DOI 10.1016/S0092-8240(86)90012-1
  • [6] ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
  • [7] [Anonymous], 1994, Ann. Prob
  • [8] AN EXTREME VALUE THEORY FOR SEQUENCE MATCHING
    ARRATIA, R
    GORDON, L
    WATERMAN, M
    [J]. ANNALS OF STATISTICS, 1986, 14 (03) : 971 - 993
  • [9] `A PHASE TRANSITION FOR THE SCORE IN MATCHING RANDOM SEQUENCES ALLOWING DELETIONS
    Arratia, Richard
    Waterman, Michael S.
    [J]. ANNALS OF APPLIED PROBABILITY, 1994, 4 (01) : 200 - 225
  • [10] COLLINS JF, 1988, COMPUT APPL BIOSCI, V4, P67