The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics

被引:5103
作者
Cantarel, Brandi L.
Coutinho, Pedro M.
Rancurel, Corinne
Bernard, Thomas
Lombard, Vincent
Henrissat, Bernard [1 ]
机构
[1] Univ Aix Marseille 1, CNRS, UMR6098, F-13288 Marseille, France
关键词
ACID-SEQUENCE SIMILARITIES; GLYCOSYL HYDROLASES; CLASSIFICATION; ANNOTATION; FAMILIES; SYSTEMS;
D O I
10.1093/nar/gkn663
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
070307 [化学生物学]; 071010 [生物化学与分子生物学];
摘要
The Carbohydrate-Active Enzyme (CAZy) database is a knowledge-based resource specialized in the enzymes that build and breakdown complex carbohydrates and glycoconjugates. As of September 2008, the database describes the present knowledge on 113 glycoside hydrolase, 91 glycosyltransferase, 19 polysaccharide lyase, 15 carbohydrate esterase and 52 carbohydrate-binding module families. These families are created based on experimentally characterized proteins and are populated by sequences from public databases with significant similarity. Protein biochemical information is continuously curated based on the available literature and structural information. Over 6400 proteins have assigned EC numbers and 700 proteins have a PDB structure. The classification (i) reflects the structural features of these enzymes better than their sole substrate specificity, (ii) helps to reveal the evolutionary relationships between these enzymes and (iii) provides a convenient framework to understand mechanistic properties. This resource has been available for over 10 years to the scientific community, contributing to information dissemination and providing a transversal nomenclature to glycobiologists. More recently, this resource has been used to improve the quality of functional predictions of a number genome projects by providing expert annotation. The CAZy resource resides at URL:http://www.cazy.org/.
引用
收藏
页码:D233 / D238
页数:6
相关论文
共 19 条
[1]
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]
GenBank: update [J].
Benson, DA ;
Karsch-Mizrachi, I ;
Lipman, DJ ;
Ostell, J ;
Wheeler, DL .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D23-D26
[3]
The distribution and query systems of the RCSB protein data bank [J].
Bourne, PE ;
Addess, KJ ;
Bluhm, WF ;
Chen, L ;
Deshpande, N ;
Feng, ZK ;
Fleri, W ;
Green, R ;
Merino-Ott, JC ;
Townsend-Merino, W ;
Weissig, H ;
Westbrook, J ;
Berman, HM .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D223-D225
[4]
Coutinho PM, 1999, ROY SOC CH, P3
[5]
Couto F.M., 2003, JISBD, P747
[6]
Recent structural insights into the expanding world of carbohydrate-active enzymes [J].
Davies, GJ ;
Gloster, TM ;
Henrissat, B .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2005, 15 (06) :637-645
[7]
Protein annotation: detective work for function prediction [J].
Doerks, T ;
Bairoch, A ;
Bork, P .
TRENDS IN GENETICS, 1998, 14 (06) :248-250
[8]
Eddy S R, 1995, Proc Int Conf Intell Syst Mol Biol, V3, P114
[9]
Poplar carbohydrate-active enzymes.: Gene identification and expression analyses [J].
Geisler-Lee, J ;
Geisler, M ;
Coutinho, PM ;
Segerman, B ;
Nishikubo, N ;
Takahashi, J ;
Aspeborg, H ;
Djerbi, S ;
Master, E ;
Andersson-Gunnerås, S ;
Sundberg, B ;
Karpinski, S ;
Teeri, TT ;
Kleczkowski, LA ;
Henrissat, B ;
Mellerowicz, EJ .
PLANT PHYSIOLOGY, 2006, 140 (03) :946-962
[10]
Modeling the percolation of annotation errors in a database of protein sequences [J].
Gilks, WR ;
Audit, B ;
De Angelis, D ;
Tsoka, S ;
Ouzounis, CA .
BIOINFORMATICS, 2002, 18 (12) :1641-1649