Mining fatty acid databases for detection of novel compounds in aerobic bacteria

被引:27
作者
Dawyndt, Peter
Vancanneyt, Marc
Snauwaert, Cindy
De Baets, Bernard
De Meyer, Hans
Swings, Jean
机构
[1] Univ Ghent, Microbiol Lab, Dept Biochem Physiol & Microbiol, B-9000 Ghent, Belgium
[2] Univ Ghent, Dept Appl Math Biometr & Proc Control, B-9000 Ghent, Belgium
[3] Univ Ghent, BCCM, LMG Bacteria Collect, B-9000 Ghent, Belgium
[4] Univ Ghent, Dept Appl Math & Comp Sci, B-9000 Ghent, Belgium
关键词
bacterial identification; data mining; fatty acid analysis; feature extraction; knowledge discovery in databases;
D O I
10.1016/j.mimet.2006.01.008
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
This study examines how the discriminatory power of an automated bacterial whole-cell fatty acid identification system can be significantly enhanced by exploring the vast amounts of information accumulated during 15years of routine gas chromatographic analysis of the fatty acid content of aerobic bacteria. Construction of a global peak occurrence histogram based upon a large fatty acid database is shown to serve as a highly informative tool for assessing the delineation of the naming windows used during the automatic recognition of fatty acid compounds. Along the lines of this data mining application, it is suggested that several naming windows of the Sherlock MIS TSBA50 peak naming method may need to be re-evaluated in order to fit more closely with the bulk of observed fatty acid profiles. At the same time, the global peak occurrence histogram has put forward the delineation of 32 new peak naming windows, accounting for a 26% increase in the total number of fatty acid features taken into account for bacterial identification. By scrutinizing the relationships between the newly delineated naming windows and the many taxonomic units covered within a proprietary fatty acid database, all new naming windows were proven to correspond with stable features of some specific groups of microorganisms. This latter analysis clearly underscores the impact of incorporating the new fatty acid compounds for improving the resolution of the bacterial identification system and endorses the applicability of knowledge discovery in databases within the field of microbiology. (c) 2006 Elsevier B.V All rights reserved.
引用
收藏
页码:410 / 433
页数:24
相关论文
共 39 条
[1]   CLASSIFICATION OF MICROORGANISMS BY ANALYSIS OF CHEMICAL COMPOSITION .1. FEASIBILITY OF UTILIZING GAS CHROMATOGRAPHY [J].
ABEL, K ;
DESCHMER.H ;
PETERSON, JI .
JOURNAL OF BACTERIOLOGY, 1963, 85 (05) :1039-&
[2]  
[Anonymous], 1990, USFCC NEWSL
[3]   Cutting a gordian knot: Emended classification and description of the genus Flavobacterium, emended description of the family Flavobacteriaceae, and proposal of Flavobacterium hydatis nom nov (basonym, Cytophaga aquatilis Strohl and Tait 1978) [J].
Bernardet, JF ;
Segers, P ;
Vancanneyt, M ;
Berthe, F ;
Kersters, K ;
Vandamme, P .
INTERNATIONAL JOURNAL OF SYSTEMATIC BACTERIOLOGY, 1996, 46 (01) :128-148
[4]  
Date C.J., 2003, Introduction to database systems, Veigth
[5]   Knowledge accumulation and resolution of data inconsistencies during the integration of microbial information sources [J].
Dawyndt, P ;
Vancanneyt, M ;
De Meyer, H ;
Swings, J .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (08) :1111-1126
[6]  
GARRITY GM, 2004, BERGEYS MANUAL SYSTE, DOI DOI 10.1007/BERGEYSOUTLINE200405.
[7]   Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals [J].
Gray, J ;
Chaudhuri, S ;
Bosworth, A ;
Layman, A ;
Reichart, D ;
Venkatrao, M ;
Pellow, F ;
Pirahesh, H .
DATA MINING AND KNOWLEDGE DISCOVERY, 1997, 1 (01) :29-53
[8]  
Heyrman J, 2002, INT J SYST EVOL MICR, V52, P1641, DOI 10.1099/00207713-52-5-1641
[9]   The use of fatty acid methyl ester analysis (FAME) for the identification of heterotrophic bacteria present on three mural paintings showing severe damage by microorganisms [J].
Heyrman, J ;
Mergaert, J ;
Denys, R ;
Swings, J .
FEMS MICROBIOLOGY LETTERS, 1999, 181 (01) :55-62
[10]  
HUYS G, 1993, MED MICROBIOL LETT, V2, P248