CATH: expanding the horizons of structure-based functional annotations for genome sequences

被引:90
作者
Sillitoe, Ian [1 ]
Dawson, Natalie [1 ]
Lewis, Tony E. [1 ]
Das, Sayoni [1 ]
Lees, Jonathan G. [1 ]
Ashford, Paul [1 ]
Tolulope, Adeyelu [1 ]
Scholes, Harry M. [1 ]
Senatorov, Ilya [1 ]
Bujan, Andra [1 ]
Rodriguez-Conde, Fatima Ceballos [1 ]
Dowling, Benjamin [1 ]
Thornton, Janet [2 ]
Orengo, Christine A. [1 ]
机构
[1] UCL, Struct & Mol Biol, London WC1E 6BT, England
[2] European Bioinformat Inst, Wellcome Trust Genome Campus, Hinxton CB10 1SD, Cambs, England
基金
英国惠康基金; 英国生物技术与生命科学研究理事会;
关键词
PROTEIN FUNCTION; LARGE-SCALE; CLASSIFICATION; EVOLUTION; RESOURCE;
D O I
10.1093/nar/gky1097
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
This article provides an update of the latest data and developments within the CATH protein structure classification database (http://www.cathdb.info). The resource provides two levels of release: CATH-B, a daily snapshot of the latest structural domain boundaries and superfamily assignments, and CATH+, which adds layers of derived data, such as predicted sequence domains, functional annotations and functional clustering (known as Functional Families or FunFams). The most recent CATH+ release (version 4.2) provides a huge update in the coverage of structural data. This release increases the number of fully- classified domains by over 40% (from 308 999 to 434 857 structural domains), corresponding to an almost two- fold increase in sequence data (from 53 million to over 95 million predicted domains) organised into 6119 superfamilies. The coverage of high-resolution, protein PDB chains that contain at least one assigned CATH domain is now 90.2% (increased from 82.3% in the previous release). A number of highly requested features have also been implemented in our web pages: allowing the user to view an alignment between their query sequence and a representative FunFam structure and providing tools that make it easier to view the full structural context (multi-domain architecture) of domains and chains.
引用
收藏
页码:D280 / D284
页数:5
相关论文
共 24 条
  • [21] Mechanism and Catalytic Site Atlas (M-CSA): a database of enzyme reaction mechanisms and active sites
    Ribeiro, Antonio J. M.
    Holliday, Gemma L.
    Furnham, Nicholas
    Tyzack, Jonathan D.
    Ferris, Katherine
    Thornton, Janet M.
    [J]. NUCLEIC ACIDS RESEARCH, 2018, 46 (D1) : D618 - D623
  • [22] New functional families (FunFams) in CATH to improve the mapping of conserved functional sites to 3D structures
    Sillitoe, Ian
    Cuff, Alison L.
    Dessailly, Benoit H.
    Dawson, Natalie L.
    Furnham, Nicholas
    Lee, David
    Lees, Jonathan G.
    Lewis, Tony E.
    Studer, Romain A.
    Rentzsch, Robert
    Yeats, Corin
    Thornton, Janet M.
    Orengo, Christine A.
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) : D490 - D498
  • [23] Tipton K. F., 1992, ENZYME NOMENCLATURE, V223, P1
  • [24] Scoring residue conservation
    Valdar, WSJ
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 48 (02) : 227 - 241