Making sense of microarray data distributions

被引:111
作者
Hoyle, DC
Rattray, M
Jupp, R
Brass, A
机构
[1] Univ Manchester, Sch Biol Sci, Manchester M13 9PT, Lancs, England
[2] Univ Manchester, Dept Comp Sci, Manchester M13 9PL, Lancs, England
[3] Aventis Pharmaceut, Bridgewater, NJ 08807 USA
关键词
D O I
10.1093/bioinformatics/18.4.576
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Typical analysis of microarray data has focused on spot by spot comparisons within a single organism. Less analysis has been done on the comparison of the entire distribution of spot intensities between experiments and between organisms. Results: Here we show that mRNA transcription data from a wide range of organisms and measured with a range of experimental platforms show close agreement with Benford's law (Benford, Proc. Am. Phil. Soc., 78, 551-572, 1938) and Zipf's law (Zipf, The Psycho-biology of Language: an Introduction to Dynamic Philology, 1936 and Human Behaviour and the Principle of Least Effort, 1949). The distribution of the bulk of microarray spot intensities is well approximated by a log-normal with the tail of the distribution being closer to power law. The variance, c 2, of log spot intensity shows a positive correlation with genome size (in terms of number of genes) and is therefore relatively fixed within some range for a given organism. The measured value of g 2 can be significantly smaller than the expected value if the mRNA is extracted from a sample of mixed cell types. Our research demonstrates that useful biological findings may result from analyzing microarray data at the level of entire intensity distributions. Contact: david.c.hoyle@man.ac.uk.
引用
收藏
页码:576 / 584
页数:9
相关论文
共 31 条
[1]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[2]  
[Anonymous], 1949, Human behaviour and the principle of least-effort
[3]  
Benford F., 1938, P AM PHILOS SOC, P551, DOI DOI 10.2307/984802
[4]   Apoptosis signals in atopy and asthma measured with cDNA arrays [J].
Brutsche, MH ;
Brutsche, IC ;
Wood, P ;
Brass, A ;
Morrison, N ;
Rattay, M ;
Mogulkoc, N ;
Simler, N ;
Craven, M ;
Custovic, A ;
Egan, JJG ;
Woodcock, A .
CLINICAL AND EXPERIMENTAL IMMUNOLOGY, 2001, 123 (02) :181-187
[5]   Exploring the metabolic and genetic control of gene expression on a genomic scale [J].
DeRisi, JL ;
Iyer, VR ;
Brown, PO .
SCIENCE, 1997, 278 (5338) :680-686
[6]   Large-scale identification of secreted and membrane-associated gene products using DNA microarrays [J].
Diehn, M ;
Eisen, MB ;
Botstein, D ;
Brown, PO .
NATURE GENETICS, 2000, 25 (01) :58-62
[7]  
Fenton L. F., 1960, IRE T COMMUN SYST, V8, P57, DOI DOI 10.1109/TCOM.1960.1097606
[8]   Genomic expression programs in the response of yeast cells to environmental changes [J].
Gasch, AP ;
Spellman, PT ;
Kao, CM ;
Carmel-Harel, O ;
Eisen, MB ;
Storz, G ;
Botstein, D ;
Brown, PO .
MOLECULAR BIOLOGY OF THE CELL, 2000, 11 (12) :4241-4257
[9]   Hypoxia-induced gene expression profiling in the euryoxic fish Gillichthys mirabilis [J].
Gracey, AY ;
Troll, JV ;
Somero, GN .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (04) :1993-1998
[10]   A statistical derivation of the significant-digit law [J].
Hill, TP .
STATISTICAL SCIENCE, 1995, 10 (04) :354-363