Replica model for an unusual directed polymer in 1+1 dimensions and prediction of the extremal parameter of gapped sequence alignment statistics

被引:3
作者
Yu, YK [1 ]
机构
[1] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20894 USA
[2] Florida Atlantic Univ, Dept Phys, Boca Raton, FL 33431 USA
基金
美国国家科学基金会;
关键词
D O I
10.1103/PhysRevE.69.061904
中图分类号
O35 [流体力学]; O53 [等离子体物理学];
学科分类号
070204 ; 080103 ; 080704 ;
摘要
Sequence alignment is one of the most important bioinformatics tools for modern molecular biology. The statistical characterization of gapped alignment scores has been a long-standing problem in sequence alignment research. In this paper, we provide a self-contained exposition of sequence alignment, a short review about how this problem is related to the directed polymer problem in statistical physics, and some analytical results that can be used for predicting alignment score statistics. Basically, we present two classes of solutions for the gapped alignment statistics by explicitly calculating the evolution of the few-replica partition function in 1+1 dimensions. We have obtained the conditions under which the more important extremal parameter lambda, characterizing the alignment score statistics, becomes predictable.
引用
收藏
页码:061904 / 1
页数:31
相关论文
共 60 条
[41]   FORCE FLUCTUATIONS IN BEAD PACKS [J].
LIU, CH ;
NAGEL, SR ;
SCHECTER, DA ;
COPPERSMITH, SN ;
MAJUMDAR, S ;
NARAYAN, O ;
WITTEN, TA .
SCIENCE, 1995, 269 (5223) :513-515
[42]   QUANTUM INTERFERENCE EFFECTS FOR STRONGLY LOCALIZED-ELECTRONS [J].
MEDINA, E ;
KARDAR, M .
PHYSICAL REVIEW B, 1992, 46 (16) :9984-10005
[43]   Approximate statistics of gapped alignments [J].
Mott, R ;
Tribe, R .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (01) :91-112
[44]  
MOTT R, 1992, B MATH BIOL, V54, P59, DOI 10.1007/BF02458620
[45]   A GENERAL METHOD APPLICABLE TO SEARCH FOR SIMILARITIES IN AMINO ACID SEQUENCE OF 2 PROTEINS [J].
NEEDLEMAN, SB ;
WUNSCH, CD .
JOURNAL OF MOLECULAR BIOLOGY, 1970, 48 (03) :443-+
[46]  
Olsen R, 1999, Proc Int Conf Intell Syst Mol Biol, P211
[47]   IMPROVED TOOLS FOR BIOLOGICAL SEQUENCE COMPARISON [J].
PEARSON, WR ;
LIPMAN, DJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1988, 85 (08) :2444-2448
[48]  
Siegmund D, 2000, ANN STAT, V28, P657
[49]   THE STATISTICAL DISTRIBUTION OF NUCLEIC-ACID SIMILARITIES [J].
SMITH, TF ;
WATERMAN, MS ;
BURKS, C .
NUCLEIC ACIDS RESEARCH, 1985, 13 (02) :645-656
[50]   IDENTIFICATION OF COMMON MOLECULAR SUBSEQUENCES [J].
SMITH, TF ;
WATERMAN, MS .
JOURNAL OF MOLECULAR BIOLOGY, 1981, 147 (01) :195-197