A statistical problem for inference to regulatory structure from associations of gene expression measurements with microarrays

被引:28
作者
Chu, TJ [1 ]
Glymour, C
Scheines, R
Spirtes, P
机构
[1] Carnegie Mellon Univ, Dept Philosophy, Pittsburgh, PA 15213 USA
[2] Univ W Florida, Inst Human & Machine Cognit, Pensacola, FL 32514 USA
基金
美国安德鲁·梅隆基金会;
关键词
D O I
10.1093/bioinformatics/btg011
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: One approach to inferring genetic regulatory structure from microarray measurements of mRNA transcript hybridization is to estimate the associations of gene expression levels measured in repeated samples. The associations may be estimated by correlation coefficients or by conditional frequencies (for discretized measurements) or by some other statistic. Although these procedures have been successfully applied to other areas, their validity when applied to microarray measurements has yet to be tested. Results: This paper describes an elementary statistical difficulty for all such procedures, no matter whether based on Bayesian updating, conditional independence testing, or other machine learning procedures such as simulated annealing or neural net pruning. The difficulty obtains if a number of cells from a common population are aggregated in a measurement of expression levels. Although there are special cases where the conditional associations are preserved under aggregation, in general inference of genetic regulatory structure based on conditional association is unwarranted.
引用
收藏
页码:1147 / 1152
页数:6
相关论文
共 14 条
[1]  
AKutsu T., 1998, SODA'98: Proceedings of the Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, P695
[2]   Genetic network inference: from co-expression clustering to reverse engineering [J].
D'haeseleer, P ;
Liang, SD ;
Somogyi, R .
BIOINFORMATICS, 2000, 16 (08) :707-726
[3]  
D'haeseleer P., 2000, THESIS U NEW MEXICO
[4]  
DANKS D, 2002, P C UNC ART INT 2001
[5]   A genomic regulatory network for development [J].
Davidson, EH ;
Rast, JP ;
Oliveri, P ;
Ransick, A ;
Calestani, C ;
Yuh, CH ;
Minokawa, T ;
Amore, G ;
Hinman, V ;
Arenas-Mena, C ;
Otim, O ;
Brown, CT ;
Livi, CB ;
Lee, PY ;
Revilla, R ;
Rust, AG ;
Pan, ZJ ;
Schilstra, MJ ;
Clarke, PJC ;
Arnone, MI ;
Rowen, L ;
Cameron, RA ;
McClay, DR ;
Hood, L ;
Bolouri, H .
SCIENCE, 2002, 295 (5560) :1669-1678
[6]  
FRIEDMAN N, 2000, RECOMB 2000
[7]  
Hartemink A.J., 2001, THESIS MIT
[8]   Integrated genomic and proteomic analyses of a systematically perturbed metabolic network [J].
Ideker, T ;
Thorsson, V ;
Ranish, JA ;
Christmas, R ;
Buhler, J ;
Eng, JK ;
Bumgarner, R ;
Goodlett, DR ;
Aebersold, R ;
Hood, L .
SCIENCE, 2001, 292 (5518) :929-934
[9]  
Liang S, 1998, Pac Symp Biocomput, P18
[10]  
SCHILSTRA M, 2002, NETBUILDER