Gene expression data: The technology and statistical analysis

被引:30
作者
Craig, BA [1 ]
Black, MA
Doerge, RW
机构
[1] Purdue Univ, Dept Stat, W Lafayette, IN 47907 USA
[2] Univ Auckland, Dept Stat, Auckland 1, New Zealand
[3] Purdue Univ, Dept Agron, W Lafayette, IN 47907 USA
[4] Purdue Univ, Fac Computat Genom, W Lafayette, IN 47907 USA
关键词
hypothesis testing; linear models; microarrays; multiple comparisons; statistical genetics;
D O I
10.1198/1085711031256
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The desire to view the simultaneous behavior of genes affected by a stimulus at the total genome level has brought the scientific world to a new place in history. It is now commonplace to have an experiment that investigates the expression of thousands of genes across treatments and time points. Biologists are quickly understanding that in order to make sense of these data and the variation that is inherent in the experimental process, statistical models need to be employed. This article presents important aspects of the two most common microarray technologies, the spotted array and the oligonucleotide array, for the purpose of identifying common and unique features of each technology and the data produced. Statistical models are suggested, and the statistical literature reviewed, in an attempt to bring some level of simplicity to the daunting task of analyzing these data. We include two examples, each based upon one of the different technologies, suggest a statistical model, and present the results of the analyses in hopes of providing both encouragement and guidance to readers wanting to become more involved in this exciting field known as genomics.
引用
收藏
页码:1 / 28
页数:28
相关论文
共 58 条