ISSUES IN SEARCHING MOLECULAR SEQUENCE DATABASES

被引:616
作者
ALTSCHUL, SF [1 ]
BOGUSKI, MS [1 ]
GISH, W [1 ]
WOOTTON, JC [1 ]
机构
[1] NIH,NATL LIB MED,NATL CTR BIOTECHNOL INFORMAT,BETHESDA,MD 20894
关键词
D O I
10.1038/ng0294-119
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Sequence similarity search programs are versatile tools for the molecular biologist, frequently able to identify possible DNA coding regions and to provide clues to gene and protein structure and function. While much attention had been paid to the precise algorithms these programs employ and to their relative speeds, there is a constellation of associated issues that are equally important to realize the full potential of these methods. Here, we consider a number of these issues, including the choice of scoring systems, the statistical significance of alignments, the masking of uninformative or potentially confounding sequence regions, the nature and extent of sequence redundancy in the databases and network access to similarity search services.
引用
收藏
页码:119 / 129
页数:11
相关论文
共 90 条
[1]   COMPLEMENTARY-DNA SEQUENCING - EXPRESSED SEQUENCE TAGS AND HUMAN GENOME PROJECT [J].
ADAMS, MD ;
KELLEY, JM ;
GOCAYNE, JD ;
DUBNICK, M ;
POLYMEROPOULOS, MH ;
XIAO, H ;
MERRIL, CR ;
WU, A ;
OLDE, B ;
MORENO, RF ;
KERLAVAGE, AR ;
MCCOMBIE, WR ;
VENTER, JC .
SCIENCE, 1991, 252 (5013) :1651-1656
[2]   AMINO-ACID SUBSTITUTION MATRICES FROM AN INFORMATION THEORETIC PERSPECTIVE [J].
ALTSCHUL, SF .
JOURNAL OF MOLECULAR BIOLOGY, 1991, 219 (03) :555-565
[3]   A NONLINEAR MEASURE OF SUBALIGNMENT SIMILARITY AND ITS SIGNIFICANCE LEVELS [J].
ALTSCHUL, SF ;
ERICKSON, BW .
BULLETIN OF MATHEMATICAL BIOLOGY, 1986, 48 (5-6) :617-632
[4]  
ALTSCHUL SF, 1986, B MATH BIOL, V48, P603, DOI 10.1016/S0092-8240(86)90010-8
[5]   PROTEIN DATABASE SEARCHES FOR MULTIPLE ALIGNMENTS [J].
ALTSCHUL, SF ;
LIPMAN, DJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1990, 87 (14) :5509-5513
[6]   A PROTEIN ALIGNMENT SCORING SYSTEM SENSITIVE AT ALL EVOLUTIONARY DISTANCES [J].
ALTSCHUL, SF .
JOURNAL OF MOLECULAR EVOLUTION, 1993, 36 (03) :290-300
[7]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[8]   THE NUCLEOSOMAL CORE HISTONE OCTAMER AT 3.1-A RESOLUTION - A TRIPARTITE PROTEIN ASSEMBLY AND A LEFT-HANDED SUPERHELIX [J].
ARENTS, G ;
BURLINGAME, RW ;
WANG, BC ;
LOVE, WE ;
MOUDRIANAKIS, EN .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1991, 88 (22) :10148-10152
[9]   A SENSITIVE PROCEDURE TO COMPARE AMINO-ACID-SEQUENCES [J].
ARGOS, P .
JOURNAL OF MOLECULAR BIOLOGY, 1987, 193 (02) :385-396
[10]   THE ERDOS-RENYI STRONG LAW FOR PATTERN-MATCHING WITH A GIVEN PROPORTION OF MISMATCHES [J].
ARRATIA, R ;
WATERMAN, MS .
ANNALS OF PROBABILITY, 1989, 17 (03) :1152-1169