The Pfam protein families database

被引:76
作者
Bateman, A
Birney, E
Durbin, R
Eddy, SR
Howe, KL
Sonnhammer, ELL
机构
[1] Sanger Ctr, Cambridge CB10 1SA, England
[2] Washington Univ, Sch Med, Dept Genet, St Louis, MO 63110 USA
[3] Karolinska Inst, Ctr Genom Res, S-17177 Stockholm, Sweden
关键词
D O I
10.1093/nar/28.1.263
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Pfam is a large collection of protein multiple sequence alignments and profile hidden Markov models. Pfam is available on the WWW in the UK at http://www.sanger.ac.uk/Software/Pfam/, in Sweden at http://www.cgr.ki.se/Pfam/ and in the US at http:// pfam.wustl.edu/. The latest version (4.3) of Pfam contains 1815 families. These Pfam families match 63% of proteins in SWISS-PROT 37 and TrEMBL 9, For complete genomes Pfam currently matches up to half of the proteins. Genomic DNA can be directly searched against the Pfam library using the Wise2 package.
引用
收藏
页码:263 / 266
页数:4
相关论文
共 14 条
  • [1] ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
  • [2] Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins
    Bateman, A
    Birney, E
    Durbin, R
    Eddy, SR
    Finn, RD
    Sonnhammer, ELL
    [J]. NUCLEIC ACIDS RESEARCH, 1999, 27 (01) : 260 - 262
  • [3] PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES
    BERNSTEIN, FC
    KOETZLE, TF
    WILLIAMS, GJB
    MEYER, EF
    BRICE, MD
    RODGERS, JR
    KENNARD, O
    SHIMANOUCHI, T
    TASUMI, M
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) : 535 - 542
  • [4] BIRNEY E, 1997, ISMB, V5, P56
  • [5] Genome sequence of the nematode C-elegans:: A platform for investigating biology
    不详
    [J]. SCIENCE, 1998, 282 (5396) : 2012 - 2018
  • [6] ProDom and ProDom-CG: tools for protein domain analysis and whole genome comparisons
    Corpet, F
    Servant, F
    Gouzy, J
    Kahn, D
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 267 - 269
  • [7] Recent improvements of the ProDom database of protein domain families
    Corpet, F
    Gouzy, J
    Kahn, D
    [J]. NUCLEIC ACIDS RESEARCH, 1999, 27 (01) : 263 - 267
  • [8] Galperin M Y, 1998, In Silico Biol, V1, P55
  • [9] DICTIONARY OF PROTEIN SECONDARY STRUCTURE - PATTERN-RECOGNITION OF HYDROGEN-BONDED AND GEOMETRICAL FEATURES
    KABSCH, W
    SANDER, C
    [J]. BIOPOLYMERS, 1983, 22 (12) : 2577 - 2637
  • [10] RASMOL - BIOMOLECULAR GRAPHICS FOR ALL
    SAYLE, RA
    MILNERWHITE, EJ
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 1995, 20 (09) : 374 - 376