A metabolome pipeline: from concept to data to knowledge

被引:115
作者
Brown, Marie [1 ]
Dunn, Warwick B. [1 ]
Ellis, David I. [1 ]
Goodacre, Royston [1 ]
Handl, Julia [1 ]
Knowles, Joshua D. [1 ]
O'Hagan, Steve [1 ]
Spasic, Irena [1 ]
Kell, Douglas B. [1 ]
机构
[1] Univ Manchester, Sch Chem, Manchester M60 1QD, Lancs, England
基金
英国生物技术与生命科学研究理事会; 英国工程与自然科学研究理事会;
关键词
metabolomics; chemometrics; data processing; databases; machine learning; genetic algorithms; genetic programming; evolutionary computing;
D O I
10.1007/s11306-005-1106-4
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Metabolomics, like other omics methods, produces huge datasets of biological variables, often accompanied by the necessary metadata. However, regardless of the form in which these are produced they are merely the ground substance for assisting us in answering biological questions. In this short tutorial review and position paper we seek to set out some of the elements of "best practice" in the optimal acquisition of such data, and in the means by which they may be turned into reliable knowledge. Many of these steps involve the solution of what amount to combinatorial optimization problems, and methods developed for these, especially those based on evolutionary computing, are proving valuable. This is done in terms of a "pipeline" that goes from the design of good experiments, through instrumental optimization, data storage and manipulation, the chemometric data processing methods in common use, and the necessary means of validation and cross-validation for giving conclusions that are credible and likely to be robust when applied in comparable circumstances to samples not used in their generation.
引用
收藏
页码:39 / 51
页数:13
相关论文
共 139 条
[1]   XML, bioinformatics and data integration [J].
Achard, F ;
Vaysseix, G ;
Barillot, E .
BIOINFORMATICS, 2001, 17 (02) :115-125
[2]  
Aharoni Asaph, 2002, OMICS A Journal of Integrative Biology, V6, P217, DOI 10.1089/15362310260256882
[3]   High-throughput classification of yeast mutants for functional genomics using metabolic footprinting [J].
Allen, J ;
Davey, HM ;
Broadhurst, D ;
Heald, JK ;
Rowland, JJ ;
Oliver, SG ;
Kell, DB .
NATURE BIOTECHNOLOGY, 2003, 21 (06) :692-696
[4]   Discrimination of modes of action of antifungal substances by use of metabolic footprinting [J].
Allen, J ;
Davey, HM ;
Broadhurst, D ;
Rowland, JJ ;
Oliver, SG ;
Kell, DB .
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2004, 70 (10) :6157-6165
[5]  
[Anonymous], 2000, SOLVE IT MODERN HEUR
[6]  
[Anonymous], 1998, Genetic programming: an introduction
[7]  
[Anonymous], 1995, DATA ANAL CHEM
[8]  
[Anonymous], 1988, Multivariate statistics: A practical approach
[9]  
[Anonymous], 2000, Bayesian theory
[10]  
[Anonymous], 1982, CASE CONTROL STUDIES