Design and analysis of experiments with high throughput biological assay data

被引:51
作者
Rocke, DM [1 ]
机构
[1] Univ Calif Davis, Div Biostat, Davis, CA 95616 USA
关键词
gene expression; mass spectrometry; metabolomics; microarray; NMR spectroscopy;
D O I
10.1016/j.semcdb.2004.09.007
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
The design and analysis of experiments using gene expression microarrays is a topic of considerable current research, and work is beginning to appear on the analysis of proteomics and metabolomics data by mass spectrometry and NMR spectroscopy. The literature in this area is evolving rapidly, and commercial software for analysis of array or proteomics data is rarely up to date, and is essentially nonexistent for metabolomics data. In this paper, I review some of the issues that should concern any biologists planning to use such high-throughput biological assay data in an experimental investigation. Technical details are kept to a minimum, and may be found in the referenced literature, as well as in the many excellent papers which space limitations prevent my describing. There are usually a number of viable options for design and analysis of such experiments, but unfortunately, there are even more non-viable ones that have been used even in the published literature. This is an area in which up-to-date knowledge of the literature is indispensable for efficient and effective design and analysis of these experiments. In general. we concentrate on relatively simple analyses, often focusing on identifying differentially expressed genes and the comparable issues in mass spectrometry and NMR spectroscopy (consistent differences in peak heights or areas for example). Complex multivariate and pattern recognition methods also need much attention, but the issues we describe in this paper must be dealt with first. The literature on analysis of proteomics and metabolomics data is as yet sparse, so the main focus of this paper will be on methods devised for analysis of gene expression data that generalize to proteomics and metabolomics, with some specific comments near the end on analysis of metabolomics data by mass spectrometry and NMR spectroscopy. (C) 2004 Elsevier Ltd. All rights reserved.
引用
收藏
页码:703 / 713
页数:11
相关论文
共 30 条
[11]   Transformation and normalization of oligonucleotide microarray data [J].
Geller, SC ;
Gregg, JP ;
Hagerman, P ;
Rocke, DM .
BIOINFORMATICS, 2003, 19 (14) :1817-1823
[12]   Diagnostics for conformity of paired quantitative measurements [J].
Hawkins, DM .
STATISTICS IN MEDICINE, 2002, 21 (13) :1913-1935
[13]  
Hoch J. C., 1996, NMR DATA PROCESSING
[14]  
HOMES E, 2001, CHEM RES TOXICOL, V14, P182
[15]  
Huber Wolfgang, 2002, Bioinformatics, V18 Suppl 1, pS96
[16]   Summaries of affymetrix GeneChip probe level data [J].
Irizarry, RA ;
Bolstad, BM ;
Collin, F ;
Cope, LM ;
Hobbs, B ;
Speed, TP .
NUCLEIC ACIDS RESEARCH, 2003, 31 (04) :e15
[17]  
KENDZIORSKI CM, 2003, POOLNG BIOL SAMPLES
[18]   Analysis of variance for gene expression microarray data [J].
Kerr, MK ;
Martin, M ;
Churchill, GA .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (06) :819-837
[19]  
MUNSON PA, 2001, GENECHIP DATA
[20]  
MUNSON PA, 2001, GENELOGIC WORKSHOP L