BEAUTY - AN ENHANCED BLAST-BASED SEARCH TOOL THAT INTEGRATES MULTIPLE BIOLOGICAL INFORMATION RESOURCES INTO SEQUENCE SIMILARITY SEARCH RESULTS

被引:227
作者
WORLEY, KC [1 ]
WIESE, BA [1 ]
SMITH, RF [1 ]
机构
[1] BAYLOR COLL MED, WM KECK CTR COMPUTAT BIOL, HOUSTON, TX 77030 USA
关键词
D O I
10.1101/gr.5.2.173
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
BEAUTY (BLAST enhanced alignment utility) is an enhanced version of the NCBI's BLAST data base search tool that facilitates identification of the functions of matched sequences. We have created new data bases of conserved regions and functional domains For protein sequences in NCBI's Entrez data base, and BEAUTY allows this information to be incorporated directly into BLAST search results. A Conserved Regions Data Base, containing the locations of conserved regions within Entrez protein sequences, was constructed by (1) clustering the entire data base into families, (2) aligning each family using OUT. PIMA multiple alignment program, and (3) scanning the multiple alignments to locate the conserved regions within each aligned sequence. A separate Annotated Domains Data Base was constructed by extracting the locations of all annotated domains and sites from sequences represented in the Entrez PROSITE, BLOCKS, and PRINTS data bases. BEAUTY performs a BLAST search of those Entrez sequences with conserved regions and/or annotated domains. BEAUTY then uses the information from the Conserved Regions and Annotated Domains data bases to generate, for each matched sequence, a schematic display that allows one to directly compare the relative locations of (1) the conserved regions, (2) annotated domains and sites, and (3) the locally aligned regions matched in the BLAST search. In addition, BEAUTY search results include World-Wide Web hypertext links to a number of external data bases that provide a variety of additional types of information on the function of matched sequences. This convenient integration of protein families, conserved regions, annotated domains, alignment displays, and World-Wide Web resources greatly enhances the biological informativeness of sequence similarity searches. BEAUTY searches can be performed remotely on our system using the ''BCM Search Launcher'' World-Wide Web pages (URL is [http://gc.bcm.tmc.edu:8088/search-launcher/launcher.html]).
引用
收藏
页码:173 / 184
页数:12
相关论文
共 26 条
  • [1] ISSUES IN SEARCHING MOLECULAR SEQUENCE DATABASES
    ALTSCHUL, SF
    BOGUSKI, MS
    GISH, W
    WOOTTON, JC
    [J]. NATURE GENETICS, 1994, 6 (02) : 119 - 129
  • [2] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [3] ATTWOOD TK, 1994, NUCLEIC ACIDS RES, V22, P3590
  • [4] PRINTS - A PROTEIN MOTIF FINGERPRINT DATABASE
    ATTWOOD, TK
    BECK, ME
    [J]. PROTEIN ENGINEERING, 1994, 7 (07): : 841 - 848
  • [5] THE SWISS-PROT PROTEIN-SEQUENCE DATA-BANK
    BAIROCH, A
    BOECKMANN, B
    [J]. NUCLEIC ACIDS RESEARCH, 1992, 20 : 2019 - 2022
  • [6] PROSITE - A DICTIONARY OF SITES AND PATTERNS IN PROTEINS
    BAIROCH, A
    [J]. NUCLEIC ACIDS RESEARCH, 1992, 20 : 2013 - 2018
  • [7] GENBANK
    BENSON, D
    LIPMAN, DJ
    OSTELL, J
    [J]. NUCLEIC ACIDS RESEARCH, 1993, 21 (13) : 2963 - 2965
  • [8] INFORMATION ENHANCEMENT METHODS FOR LARGE-SCALE SEQUENCE-ANALYSIS
    CLAVERIE, JM
    STATES, DJ
    [J]. COMPUTERS & CHEMISTRY, 1993, 17 (02): : 191 - 201
  • [9] POSITIONAL CLONING MOVES FROM PERDITIONAL TO TRADITIONAL
    COLLINS, FS
    [J]. NATURE GENETICS, 1995, 9 (04) : 347 - 350
  • [10] EPSTEIN JA, 1994, 2ND EL P WORLD WID W