Comparative protein structure modeling of genes and genomes

被引:2091
作者
Martí-Renom, MA [1 ]
Stuart, AC [1 ]
Fiser, A [1 ]
Sánchez, R [1 ]
Melo, F [1 ]
Sali, A [1 ]
机构
[1] Rockefeller Univ, Pels Family Ctr Biochem & Struct Biol, Labs Mol Biophys, New York, NY 10021 USA
来源
ANNUAL REVIEW OF BIOPHYSICS AND BIOMOLECULAR STRUCTURE | 2000年 / 29卷
关键词
protein structure prediction; fold assignment; alignment; homology modeling; model evaluation; fully automated modeling; structural genomics;
D O I
10.1146/annurev.biophys.29.1.291
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Comparative modeling predicts the three-dimensional structure of a given protein sequence (target) based primarily on its alignment to one or more proteins of known structure (templates). The prediction process consists of fold assignment, target-template alignment, model building, and model evaluation. The number of protein sequences that can be modeled and the accuracy of the predictions are increasing steadily because of the growth in the number of known protein structures and because of the improvements in the modeling software. Further advances are necessary in recognizing weak sequence-structure similarities, aligning sequences with structures, modeling of rigid body shifts, distortions, loops and side chains, as well as detecting errors in a model. Despite these problems, it is currently possible to model with useful accuracy significant parts of approximately one third of all known protein sequences. The use of individual comparative models in biology is already rewarding and increasingly widespread. A major new challenge for comparative modeling is the integration of it with the torrents of data from genome sequencing projects as well as from functional and structural genomics. In particular, there is a need to develop an automated, rapid, robust, sensitive, and accurate comparative modeling pipeline applicable to whole genomes. Such large-scale modeling is likely to encourage new kinds of applications for the many resulting models, based on their large number and completeness at the level of the family organism, or functional network.
引用
收藏
页码:291 / 325
页数:35
相关论文
共 216 条
  • [1] ABOLA EE, 1987, CRYSTALLOGRAPHIC DAT, P107
  • [2] Alexandrov N. N., 1995, Pacific Symposium on Biocomputing '96, P53
  • [3] Refinement of modelled structures by knowledge-based energy profiles and secondary structure prediction:: Application to the human procarboxypeptidase A2
    Aloy, P
    Mas, JM
    Martí-Renom, MA
    Querol, E
    Avilés, FX
    Oliva, B
    [J]. JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2000, 14 (01) : 83 - 92
  • [4] ISSUES IN SEARCHING MOLECULAR SEQUENCE DATABASES
    ALTSCHUL, SF
    BOGUSKI, MS
    GISH, W
    WOOTTON, JC
    [J]. NATURE GENETICS, 1994, 6 (02) : 119 - 129
  • [5] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [6] ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
  • [7] Sequence alignment in molecular biology
    Apostolico, A
    Giancarlo, R
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 1998, 5 (02) : 173 - 196
  • [8] Homology modelling by distance geometry
    Aszodi, A
    Taylor, WR
    [J]. FOLDING & DESIGN, 1996, 1 (05): : 325 - 334
  • [9] The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1999
    Bairoch, A
    Apweiler, R
    [J]. NUCLEIC ACIDS RESEARCH, 1999, 27 (01) : 49 - 54
  • [10] KNOWLEDGE-BASED MODEL-BUILDING OF PROTEINS - CONCEPTS AND EXAMPLES
    BAJORATH, J
    STENKAMP, R
    ARUFFO, A
    [J]. PROTEIN SCIENCE, 1993, 2 (11) : 1798 - 1810