A general framework for weighted gene co-expression network analysis

被引:4068
作者
Zhang, Bin [1 ]
Horvath, Steve [1 ]
机构
[1] Univ Calif Los Angeles, Dept Human Genet & Biostat, Los Angeles, CA 90024 USA
关键词
scale-free topology; network analysis; clustering coefficient; hierarchical organization; module; topological overlap; microarrays;
D O I
10.2202/1544-6115.1128
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Gene co-expression networks are increasingly used to explore the system-level functionality of genes. The network construction is conceptually straightforward: nodes represent genes and nodes are connected if the corresponding genes are significantly co-expressed across appropriately chosen tissue samples. In reality, it is tricky to define the connections between the nodes in such networks. An important question is whether it is biologically meaningful to encode gene co-expression using binary information (connected=1, unconnected=0). We describe a general framework for 'soft' thresholding that assigns a connection weight to each gene pair. This leads us to define the notion of a weighted gene co-expression network. For soft thresholding we propose several adjacency functions that convert the co-expression measure to a connection weight. For determining the parameters of the adjacency function, we propose a biologically motivated criterion (referred to as the scale-free topology criterion). We generalize the following important network concepts to the case of weighted networks. First, we introduce several node connectivity measures and provide empirical evidence that they can be important for predicting the biological significance of a gene. Second, we provide theoretical and empirical evidence that the 'weighted' topological overlap measure ( used to define gene modules) leads to more cohesive modules than its 'unweighted' counterpart. Third, we generalize the clustering coefficient to weighted networks. Unlike the unweighted clustering coefficient, the weighted clustering coefficient is not inversely related to the connectivity. We provide a model that shows how an inverse relationship between clustering coefficient and connectivity arises from hard thresholding. We apply our methods to simulated data, a cancer microarray data set, and a yeast microarray data set.
引用
收藏
页码:i / 43
页数:45
相关论文
共 33 条
[1]   Topology of evolving networks:: Local events and universality [J].
Albert, R ;
Barabási, AL .
PHYSICAL REVIEW LETTERS, 2000, 85 (24) :5234-5237
[2]   Error and attack tolerance of complex networks [J].
Albert, R ;
Jeong, H ;
Barabási, AL .
NATURE, 2000, 406 (6794) :378-382
[3]   Network biology:: Understanding the cell's functional organization [J].
Barabási, AL ;
Oltvai, ZN .
NATURE REVIEWS GENETICS, 2004, 5 (02) :101-U15
[4]   Scale-free networks [J].
Barabási, AL ;
Bonabeau, E .
SCIENTIFIC AMERICAN, 2003, 288 (05) :60-69
[5]   Emergence of scaling in random networks [J].
Barabási, AL ;
Albert, R .
SCIENCE, 1999, 286 (5439) :509-512
[6]   Similarities and differences in genome-wide expression data of six organisms [J].
Bergmann, S ;
Ihmels, J ;
Barkai, N .
PLOS BIOLOGY, 2004, 2 (01) :85-93
[7]   Innovation - New tools for functional mammalian cancer genetics [J].
Brummelkamp, TR ;
Bernards, R .
NATURE REVIEWS CANCER, 2003, 3 (10) :781-789
[8]  
Butte A J, 2000, Pac Symp Biocomput, P418
[9]   Gene co-expression network topology provides a framework for molecular characterization of cellular state [J].
Carter, SL ;
Brechbühler, CM ;
Griffin, M ;
Bond, AT .
BIOINFORMATICS, 2004, 20 (14) :2242-2250
[10]  
Cox D.R., 1990, ANAL SURVIVAL DATA