Protein domain analysis in the era of complete genomes

被引:49
作者
Copley, RR
Doerks, T
Letunic, I
Bork, P
机构
[1] European Mol Biol Lab, D-69012 Heidelberg, Germany
[2] Max Delbruck Ctr Mol Med, Berlin, Germany
关键词
protein domains; genome analysis; evolution; sequence analysis;
D O I
10.1016/S0014-5793(01)03289-6
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Domains present one of the most useful levels at which to understand protein function, and domain family-based analysis has had a profound impact on the study of individual proteins. Protein domain discovery has been progressing steadily over the past 30 years. What are the realistically achievable goals of sequence-based domain analysis, and how far off are they for the sequences encoded in eukaryotic genomes? Here we address some of the issues involved in better coverage of sequence-based domain annotation, and the integration of these results within the wider context of genomes, structures and function. (C) 2002 Federation of European Biochemical Societies. Published by Elsevier Science B.V. All rights reserved.
引用
收藏
页码:129 / 134
页数:6
相关论文
共 43 条
  • [1] Automated structure-based prediction of functional sites in proteins: Applications to assessing the validity of inheriting protein function from homology in genome annotation and to protein docking
    Aloy, P
    Querol, E
    Aviles, FX
    Sternberg, MJE
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2001, 311 (02) : 395 - 408
  • [2] Protein repeats: Structures, functions, and evolution
    Andrade, MA
    Perez-Iratxeta, C
    Ponting, CP
    [J]. JOURNAL OF STRUCTURAL BIOLOGY, 2001, 134 (2-3) : 117 - 131
  • [3] The InterPro database, an integrated documentation resource for protein families, domains and functional sites
    Apweiler, R
    Attwood, TK
    Bairoch, A
    Bateman, A
    Birney, E
    Biswas, M
    Bucher, P
    Cerutti, T
    Corpet, F
    Croning, MDR
    Durbin, R
    Falquet, L
    Fleischmann, W
    Gouzy, J
    Hermjakob, H
    Hulo, N
    Jonassen, I
    Kahn, D
    Kanapin, A
    Karavidopoulou, Y
    Lopez, R
    Marx, B
    Mulder, NJ
    Oinn, TM
    Pagni, M
    Servant, F
    Sigrist, CJA
    Zdobnov, EM
    [J]. NUCLEIC ACIDS RESEARCH, 2001, 29 (01) : 37 - 40
  • [4] Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
  • [5] PROTEINS - 1000 FAMILIES FOR THE MOLECULAR BIOLOGIST
    CHOTHIA, C
    [J]. NATURE, 1992, 357 (6379) : 543 - 544
  • [6] ProDom and ProDom-CG: tools for protein domain analysis and whole genome comparisons
    Corpet, F
    Servant, F
    Gouzy, J
    Kahn, D
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 267 - 269
  • [7] DOERKS T, 2002, IN PRESS GENOME RES
  • [8] GeneRAGE: a robust algorithm for sequence clustering and domain detection
    Enright, AJ
    Ouzounis, CA
    [J]. BIOINFORMATICS, 2000, 16 (05) : 451 - 457
  • [9] Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure
    Gough, J
    Karplus, K
    Hughey, R
    Chothia, C
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2001, 313 (04) : 903 - 919
  • [10] Whole genome protein domain analysis using a new method for domain clustering
    Gouzy, J
    Corpet, F
    Kahn, D
    [J]. COMPUTERS & CHEMISTRY, 1999, 23 (3-4): : 333 - 340