An approach to large scale identification of non-obvious structural similarities between proteins

被引:2
作者
Cherkasov, A [1 ]
Jones, SJM
机构
[1] Univ British Columbia, Fac Med, Dept Med, Div Infect Dis, Vancouver, BC, Canada
[2] British Columbia Canc Agcy, Genome Sci Ctr, Vancouver, BC V5Z 4E6, Canada
关键词
D O I
10.1186/1471-2105-5-61
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: A new sequence independent bioinformatics approach allowing genome-wide search for proteins with similar three dimensional structures has been developed. By utilizing the numerical output of the sequence threading it establishes putative non-obvious structural similarities between proteins. When applied to the testing set of proteins with known three dimensional structures the developed approach was able to recognize structurally similar proteins with high accuracy. Results: The method has been developed to identify pathogenic proteins with low sequence identity and high structural similarity to host analogues. Such protein structure relationships would be hypothesized to arise through convergent evolution or through ancient horizontal gene transfer events, now undetectable using current sequence alignment techniques. The pathogen proteins, which could mimic or interfere with host activities, would represent candidate virulence factors. The developed approach utilizes the numerical outputs from the sequence-structure threading. It identifies the potential structural similarity between a pair of proteins by correlating the threading scores of the corresponding two primary sequences against the library of the standard folds. This approach allowed up to 64% sensitivity and 99.9% specificity in distinguishing protein pairs with high structural similarity. Conclusion: Preliminary results obtained by comparison of the genomes of Homo sapiens and several strains of Chlamydia trachomatis have demonstrated the potential usefulness of the method in the identification of bacterial proteins with known or potential roles in virulence.
引用
收藏
页数:11
相关论文
共 27 条
  • [1] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [2] BATES A, 1996, GENOMES MOL BIOL DRU
  • [3] A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE
    BOWIE, JU
    LUTHY, R
    EISENBERG, D
    [J]. SCIENCE, 1991, 253 (5016) : 164 - 170
  • [4] Evidence that plant-like genes in Chlamydia species reflect an ancestral relationship between Chlamydiaceae, cyanobacteria, and the chloroplast
    Brinkman, FSL
    Blanchard, JL
    Cherkasov, A
    Av-Gay, Y
    Brunham, RC
    Fernandez, RC
    Finlay, BB
    Otto, SP
    Ouellette, BFF
    Keeling, PJ
    Rose, AM
    Hancock, REW
    Jones, SJM
    [J]. GENOME RESEARCH, 2002, 12 (08) : 1159 - 1167
  • [5] STATISTICS OF SEQUENCE-STRUCTURE THREADING
    BRYANT, SH
    ALTSCHUL, SF
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 1995, 5 (02) : 236 - 244
  • [6] The role of perspective in defining economic measures for the evaluation of medical technology
    Davidoff, AJ
    Powe, NR
    [J]. INTERNATIONAL JOURNAL OF TECHNOLOGY ASSESSMENT IN HEALTH CARE, 1996, 12 (01) : 9 - 21
  • [7] Multiple sequence information for threading algorithms
    Defay, TR
    Cohen, FE
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1996, 262 (02) : 314 - 323
  • [8] MUTANTS OF SALMONELLA-TYPHIMURIUM THAT CANNOT SURVIVE WITHIN THE MACROPHAGE ARE AVIRULENT
    FIELDS, PI
    SWANSON, RV
    HAIDARIS, CG
    HEFFRON, F
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1986, 83 (14) : 5189 - 5193
  • [9] INVASION AND REPLICATION OF SALMONELLA-TYPHIMURIUM IN ANIMAL-CELLS
    GAHRING, LC
    HEFFRON, F
    FINLAY, BB
    FALKOW, S
    [J]. INFECTION AND IMMUNITY, 1990, 58 (02) : 443 - 448
  • [10] SEQUENCE STRUCTURE MATCHING IN GLOBULAR-PROTEINS - APPLICATION TO SUPERSECONDARY AND TERTIARY STRUCTURE DETERMINATION
    GODZIK, A
    SKOLNICK, J
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (24) : 12098 - 12102