Puzzling over orphan enzymes

被引:15
作者
Lespinet, O [1 ]
Labedan, B [1 ]
机构
[1] Univ Paris 11, CNRS, Inst Genet & Microbiol, UMR 8621, F-91405 Orsay, France
关键词
orphan enzyme; EC number; protein sequence; protein function; gene annotation; database exactitude; hidden knowledge;
D O I
10.1007/s00018-005-5520-6
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Despite the current availability of several hundreds of thousands of amino acid sequences, more than 39% of the well-defined enzyme activities (EC numbers) are not associated with any sequence in major public databases. This wide gap separating knowledge of biochemical function and sequence information is found in nearly all classes of enzymes. Thus, there is an urgent need to explore the 1525 orphan enzymes (EC numbers without associated sequences), in order to progressively bridge this unwanted gap. Improving genome annotation could unveil a significant proportion of sequenceless enzymes. Peptide mass mapping and further genome mining would be useful to identify proper sequence for enzymes found in species for which genetic tools are missing. Finally, the whole community must help major public databases to begin addressing the problem of missing or incomplete information.
引用
收藏
页码:517 / 523
页数:7
相关论文
共 35 条
[1]  
[Anonymous], 1998, A History of Molecular Biology
[2]  
[Anonymous], STANFORD ENCY PHILOS
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]   The universal protein resource (UniProt) [J].
Bairoch, A ;
Apweiler, R ;
Wu, CH ;
Barker, WC ;
Boeckmann, B ;
Ferro, S ;
Gasteiger, E ;
Huang, HZ ;
Lopez, R ;
Magrane, M ;
Martin, MJ ;
Natale, DA ;
O'Donovan, C ;
Redaschi, N ;
Yeh, LSL .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D154-D159
[5]   Arginine methylation: An emerging regulator of protein function [J].
Bedford, MT ;
Richard, S .
MOLECULAR CELL, 2005, 18 (03) :263-272
[6]   Glyceraldehyde-3-phosphate dehydrogenase, apoptosis and neurodegenerative diseases [J].
Chuang, DM ;
Hough, C ;
Senatorov, VV .
ANNUAL REVIEW OF PHARMACOLOGY AND TOXICOLOGY, 2005, 45 :269-+
[7]   Characterization of an inducible chlorophenol O-methyltransferase from Trichoderma longibrachiatum involved in the formation of chloroanisoles and determination of its role in cork taint of wines [J].
Coque, JJR ;
Alvarez-Rodríguez, ML ;
Larriba, G .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2003, 69 (09) :5089-5095
[8]   A unifold, mesofold, and superfold model of protein fold use [J].
Coulson, AFW ;
Moult, J .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 46 (01) :61-71
[9]   Fungal trehalose phosphorylase:: kinetic mechanism, pH-dependence of the reaction and some structural properties of the enzyme from Schizophyllum commune [J].
Eis, C ;
Watkins, M ;
Prohaska, T ;
Nidetzky, B .
BIOCHEMICAL JOURNAL, 2001, 356 (03) :757-767
[10]   Complete genome sequence of the hyperthermophilic archaeon Thermococcus kodakaraensis KOD1 and comparison with Pyrococcus genomes [J].
Fukui, T ;
Atomi, H ;
Kanai, T ;
Matsumi, R ;
Fujiwara, S ;
Imanaka, T .
GENOME RESEARCH, 2005, 15 (03) :352-363