IDENTIFYING CODING EXONS BY SIMILARITY SEARCH - ALU-DERIVED AND OTHER POTENTIALLY MISLEADING PROTEIN SEQUENCES

被引:23
作者
CLAVERIE, JM
机构
[1] National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda
关键词
D O I
10.1016/0888-7543(92)90321-I
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The search for significant local similarities with known protein sequences is a powerful method for interpreting anonymous cDNA sequences or locating coding exons within genomic DNA sequences at a stage where the average contig size is still very small. The BLASTx program, implemented on the National Center for Biotechnology Information server, allows a sensitive search of all putative translations of a nucleotide query sequence against all known proteins in a matter of seconds. From an analysis of the current databases, I report a set of protein sequences exhibiting high local similarity to Alu repeat or vector sequences. These entries can lead to misleading interpretations of similarity searches. During the course of this study, the protease of a human spumaretrovirus was found to have integrated the 3′ end half of the U2 snRNA. © 1992.
引用
收藏
页码:838 / 841
页数:4
相关论文
共 10 条
[1]   COMPLEMENTARY-DNA SEQUENCING - EXPRESSED SEQUENCE TAGS AND HUMAN GENOME PROJECT [J].
ADAMS, MD ;
KELLEY, JM ;
GOCAYNE, JD ;
DUBNICK, M ;
POLYMEROPOULOS, MH ;
XIAO, H ;
MERRIL, CR ;
WU, A ;
OLDE, B ;
MORENO, RF ;
KERLAVAGE, AR ;
MCCOMBIE, WR ;
VENTER, JC .
SCIENCE, 1991, 252 (5013) :1651-1656
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]   THE NATIONAL-CENTER-FOR-BIOTECHNOLOGY-INFORMATION [J].
BENSON, D ;
BOGUSKI, M ;
LIPMAN, DJ ;
OSTELL, J .
GENOMICS, 1990, 6 (02) :389-391
[4]  
Doolittle R. F., 1990, METHODS ENZYMOLOGY, V183
[5]  
FIELDS CA, 1990, COMPUT APPL BIOSCI, V6, P263
[6]  
FLUEGEL R M, 1987, Molecular and Cellular Biology, V7, P231
[7]   THE CANDIDATE GENE FOR THE X-LINKED KALLMANN SYNDROME ENCODES A PROTEIN RELATED TO ADHESION MOLECULES [J].
LEGOUIS, R ;
HARDELIN, JP ;
LEVILLIERS, J ;
CLAVERIE, JM ;
COMPAIN, S ;
WUNDERLE, V ;
MILLASSEAU, P ;
LEPASLIER, D ;
COHEN, D ;
CATERINA, D ;
BOUGUELERET, L ;
DELEMARREVANDEWAAL, H ;
LUTFALLA, G ;
WEISSENBACH, J ;
PETIT, C .
CELL, 1991, 67 (02) :423-435
[8]   IMPROVED TOOLS FOR BIOLOGICAL SEQUENCE COMPARISON [J].
PEARSON, WR ;
LIPMAN, DJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1988, 85 (08) :2444-2448
[10]   CLONING OF AN NF-KAPPA-B SUBUNIT WHICH STIMULATES HIV TRANSCRIPTION IN SYNERGY WITH P65 [J].
SCHMID, RM ;
PERKINS, ND ;
DUCKETT, CS ;
ANDREWS, PC ;
NABEL, GJ .
NATURE, 1991, 352 (6337) :733-736