CDD: NCBI's conserved domain database

被引:2643
作者
Marchler-Bauer, Aron [1 ]
Derbyshire, Myra K. [1 ]
Gonzales, Noreen R. [1 ]
Lu, Shennan [1 ]
Chitsaz, Farideh [1 ]
Geer, Lewis Y. [1 ]
Geer, Renata C. [1 ]
He, Jane [1 ]
Gwadz, Marc [1 ]
Hurwitz, David I. [1 ]
Lanczycki, Christopher J. [1 ]
Lu, Fu [1 ]
Marchler, Gabriele H. [1 ]
Song, James S. [1 ]
Thanki, Narmada [1 ]
Wang, Zhouxi [1 ]
Yamashita, Roxanne A. [1 ]
Zhang, Dachuan [1 ]
Zheng, Chanjuan [1 ]
Bryant, Stephen H. [1 ]
机构
[1] NIH, Natl Lib Med, Natl Ctr Biotechnol Informat, Bethesda, MD 20894 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1093/nar/gku1221
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
NCBI's CDD, the Conserved Domain Database, enters its 15(th) year as a public resource for the annotation of proteins with the location of conserved domain footprints. Going forward, we strive to improve the coverage and consistency of domain annotation provided by CDD. We maintain a live search system as well as an archive of pre-computed domain annotation for sequences tracked in NCBI's Entrez protein database, which can be retrieved for single sequences or in bulk. We also maintain import procedures so that CDD contains domain models and domain definitions provided by several collections available in the public domain, as well as those produced by an in-house curation effort. The curation effort aims at increasing coverage and providing finer-grained classifications of common protein domains, for which a wealth of functional and structural data has become available. CDD curation generates alignment models of representative sequence fragments, which are in agreement with domain boundaries as observed in protein 3D structure, and which model the structurally conserved cores of domain families as well as annotate conserved features.
引用
收藏
页码:D222 / D226
页数:5
相关论文
共 13 条
[1]   Annotation of functional sites with the Conserved Domain Database [J].
Derbyshire, Myra K. ;
Lanczycki, Christopher J. ;
Bryant, Stephen H. ;
Marchler-Bauer, Aron .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2012,
[2]   Pfam: the protein families database [J].
Finn, Robert D. ;
Bateman, Alex ;
Clements, Jody ;
Coggill, Penelope ;
Eberhardt, Ruth Y. ;
Eddy, Sean R. ;
Heger, Andreas ;
Hetherington, Kirstie ;
Holm, Liisa ;
Mistry, Jaina ;
Sonnhammer, Erik L. L. ;
Tate, John ;
Punta, Marco .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D222-D230
[3]   CDART: Protein homology by domain architecture [J].
Geer, LY ;
Domrachev, M ;
Lipman, DJ ;
Bryant, SH .
GENOME RESEARCH, 2002, 12 (10) :1619-1623
[4]   Novel β-Propeller of the BTB-Kelch Protein Krp1 Provides a Binding Site for Lasp-1 That Is Necessary for Pseudopodial Extension [J].
Gray, Christopher H. ;
McGarry, Lynn C. ;
Spence, Heather J. ;
Riboldi-Tunnicliffe, Alan ;
Ozanne, Bradford W. .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2009, 284 (44) :30498-30507
[5]   TIGRFAMs and Genome Properties in 2013 [J].
Haft, Daniel H. ;
Selengut, Jeremy D. ;
Richter, Roland A. ;
Harkins, Derek ;
Basu, Malay K. ;
Beck, Erin .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D387-D395
[6]   The National Center for Biotechnology Information's Protein Clusters Database [J].
Klimke, William ;
Agarwala, Richa ;
Badretdin, Azat ;
Chetvernin, Slava ;
Ciufo, Stacy ;
Fedorov, Boris ;
Kiryutin, Boris ;
O'Neill, Kathleen ;
Resch, Wolfgang ;
Resenchuk, Sergei ;
Schafer, Susan ;
Tolstoy, Igor ;
Tatusova, Tatiana .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D216-D223
[7]   SMART: recent updates, new developments and status in 2015 [J].
Letunic, Ivica ;
Doerks, Tobias ;
Bork, Peer .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D257-D260
[8]   MMDB and VAST+: tracking structural similarities between macromolecular complexes [J].
Madej, Thomas ;
Lanczycki, Christopher J. ;
Zhang, Dachuan ;
Thiessen, Paul A. ;
Geer, Renata C. ;
Marchler-Bauer, Aron ;
Bryant, Stephen H. .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D297-D303
[9]   CDD: a curated Entrez database of conserved domain alignments [J].
Marchler-Bauer, A ;
Anderson, JB ;
DeWeese-Scott, C ;
Fedorova, ND ;
Geer, LY ;
He, SQ ;
Hurwitz, DI ;
Jackson, JD ;
Jacobs, AR ;
Lanczycki, CJ ;
Liebert, CA ;
Liu, CL ;
Madej, T ;
Marchler, GH ;
Mazumder, R ;
Nikolskaya, AN ;
Panchenko, AR ;
Rao, BS ;
Shoemaker, BA ;
Simonyan, V ;
Song, JS ;
Thiessen, PA ;
Vasudevan, S ;
Wang, YL ;
Yamashita, RA ;
Yin, JJ ;
Bryant, SH .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :383-387
[10]   CD-Search: protein domain annotations on the fly [J].
Marchler-Bauer, A ;
Bryant, SH .
NUCLEIC ACIDS RESEARCH, 2004, 32 :W327-W331