The closest BLAST hit is often not the nearest neighbor

被引:353
作者
Koski, LB [1 ]
Golding, GB [1 ]
机构
[1] McMaster Univ, Dept Biol, Hamilton, ON L8S 4K1, Canada
关键词
BLAST hits; nearest-neighbor;
D O I
10.1007/s002390010184
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
It is well known that basing phylogenetic reconstructions on uncorrected genetic distances can lead to errors in their reconstruction. Nevertheless, it is often common practice to report simply the most similar BLAST (Altschul et al. 1997) hit in genomic reports that discuss many genes (Ruepp et al. 2000; Freiberg et al. 1997). This is because BLAST hits can provide a rapid, efficient. and concise analysis of many genes at once. These hits are often interpreted to imply that the gene is most closely related to the gene or protein in the databases that returned the closest BLAST hit. Though these two may coincide, for many genes, particularly genes with few homologs, they may not be the same. There are a number of circumstances that can account for such limitations in accuracy (Eisen 2000). We stress here that genes appearing to be the most similar based on BLAST hits are often not each others closest relative phylogenetically. The extent to which this occurs depends on the availability of close relatives present in the databases. As an example we have chosen the analysis of the genomes of a crenarcheaota species Aeropyrum pernix, an organism with few close relatives fully sequenced, and Escherichia coli, an organism whose closest relative, Salmonella typhimurium, is completely sequenced.
引用
收藏
页码:540 / 542
页数:3
相关论文
共 14 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] A phylogenomic study of DNA repair genes, proteins, and processes
    Eisen, JA
    Hanawalt, PC
    [J]. MUTATION RESEARCH-DNA REPAIR, 1999, 435 (03): : 171 - 213
  • [3] Phylogenomics: Improving functional predictions for uncharacterized genes by evolutionary analysis
    Eisen, JA
    [J]. GENOME RESEARCH, 1998, 8 (03): : 163 - 167
  • [4] Horizontal gene transfer among microbial genomes: new insights from complete genome analysis
    Eisen, JA
    [J]. CURRENT OPINION IN GENETICS & DEVELOPMENT, 2000, 10 (06) : 606 - 611
  • [5] Molecular basis of symbiosis between Rhizobium and legumes
    Freiberg, C
    Fellay, R
    Bairoch, A
    Broughton, WJ
    Rosenthal, A
    Perret, X
    [J]. NATURE, 1997, 387 (6631) : 394 - 401
  • [6] GOLDING GB, 1983, MOL BIOL EVOL, V1, P125
  • [7] Molecular archaeology of the Escherichia coli genome
    Lawrence, JG
    Ochman, H
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (16) : 9413 - 9417
  • [8] Evidence for lateral gene transfer between Archaea and Bacteria from genome sequence of Thermotoga maritima
    Nelson, KE
    Clayton, RA
    Gill, SR
    Gwinn, ML
    Dodson, RJ
    Haft, DH
    Hickey, EK
    Peterson, LD
    Nelson, WC
    Ketchum, KA
    McDonald, L
    Utterback, TR
    Malek, JA
    Linher, KD
    Garrett, MM
    Stewart, AM
    Cotton, MD
    Pratt, MS
    Phillips, CA
    Richardson, D
    Heidelberg, J
    Sutton, GG
    Fleischmann, RD
    Eisen, JA
    White, O
    Salzberg, SL
    Smith, HO
    Venter, JC
    Fraser, CM
    [J]. NATURE, 1999, 399 (6734) : 323 - 329
  • [9] The mosaic nature of the eukaryotic nucleus
    Ribeiro, S
    Golding, GB
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 1998, 15 (07) : 779 - 788
  • [10] The genome sequence of the thermoacidophilic scavenger Thermoplasma acidophilum
    Ruepp, A
    Graml, W
    Santos-Martinez, ML
    Koretle, KK
    Volker, C
    Mewes, HW
    Frishman, D
    Stocker, S
    Lupas, AN
    Baumeister, W
    [J]. NATURE, 2000, 407 (6803) : 508 - 513