A Top-Performing Algorithm for the DREAM3 Gene Expression Prediction Challenge

被引:6
作者
Ruan, Jianhua [1 ]
机构
[1] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX USA
来源
PLOS ONE | 2010年 / 5卷 / 02期
基金
美国国家卫生研究院;
关键词
NETWORKS;
D O I
10.1371/journal.pone.0008944
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
A wealth of computational methods has been developed to address problems in systems biology, such as modeling gene expression. However, to objectively evaluate and compare such methods is notoriously difficult. The DREAM (Dialogue on Reverse Engineering Assessments and Methods) project is a community-wide effort to assess the relative strengths and weaknesses of different computational methods for a set of core problems in systems biology. This article presents a top-performing algorithm for one of the challenge problems in the third annual DREAM (DREAM3), namely the gene expression prediction challenge. In this challenge, participants are asked to predict the expression levels of a small set of genes in a yeast deletion strain, given the expression levels of all other genes in the same strain and complete gene expression data for several other yeast strains. I propose a simple k-nearest-neighbor (KNN) method to solve this problem. Despite its simplicity, this method works well for this challenge, sharing the "top performer'' honor with a much more sophisticated method. I also describe several alternative, simple strategies, including a modified KNN algorithm that further improves the performance of the standard KNN method. The success of these methods suggests that complex methods attempting to integrate multiple data sets do not necessarily lead to better performance than simple yet robust methods. Furthermore, none of these top-performing methods, including the one by a different team, are based on gene regulatory networks, which seems to suggest that accurately modeling gene expression using gene regulatory networks is unfortunately still a difficult task.
引用
收藏
页数:8
相关论文
共 25 条
  • [1] How to infer gene networks from expression profiles
    Bansal, Mukesh
    Belcastro, Vincenzo
    Ambesi-Impiombato, Alberto
    di Bernardo, Diego
    [J]. MOLECULAR SYSTEMS BIOLOGY, 2007, 3 (1)
  • [2] Predicting gene expression from sequence
    Beer, MA
    Tavazoie, S
    [J]. CELL, 2004, 117 (02) : 185 - 198
  • [3] Connectivity of the mutual k-nearest-neighbor graph in clustering and outlier detection
    Brito, MR
    Chavez, EL
    Quiroz, AJ
    Yukich, JE
    [J]. STATISTICS & PROBABILITY LETTERS, 1997, 35 (01) : 33 - 42
  • [4] Advanced computing for systems biology
    Burrage, Kevin
    Hood, Lindsay
    Ragan, Mark A.
    [J]. BRIEFINGS IN BIOINFORMATICS, 2006, 7 (04) : 390 - 398
  • [5] Regulatory element detection using correlation with expression
    Bussemaker, HJ
    Li, H
    Siggia, ED
    [J]. NATURE GENETICS, 2001, 27 (02) : 167 - 171
  • [6] From in vivo to in silico biology and back
    Di Ventura, Barbara
    Lemerle, Caroline
    Michalodimitrakis, Konstantinos
    Serrano, Luis
    [J]. NATURE, 2006, 443 (7111) : 527 - 533
  • [7] Saccharomyces genome database:: Underlying principles and organisation
    Dwight, SS
    Balakrishnan, R
    Christie, KR
    Costanzo, MC
    Dolinski, K
    Engel, SR
    Feierbach, B
    Fisk, DG
    Hirschman, J
    Hong, EL
    Issel-Tarver, L
    Nash, RS
    Sethuraman, A
    Starr, B
    Theesfeld, CL
    Andrada, R
    Binkley, G
    Dong, Q
    Lane, C
    Schroeder, M
    Weng, S
    Botstein, D
    Cherry, JM
    [J]. BRIEFINGS IN BIOINFORMATICS, 2004, 5 (01) : 9 - 22
  • [8] Frazier Zach, 2009, V541, P535, DOI 10.1007/978-1-59745-243-4_23
  • [9] Using Bayesian networks to analyze expression data
    Friedman, N
    Linial, M
    Nachman, I
    Pe'er, D
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (3-4) : 601 - 620
  • [10] Computational methodologies for modelling, analysis and simulation of signalling networks
    Gilbert, David
    Fuss, Hendrik
    Gu, Xu
    Orton, Richard
    Robinson, Steve
    Vyshemirsky, Vladislav
    Kurth, Mary Jo
    Downes, C. Stephen
    Dubitzky, Werner
    [J]. BRIEFINGS IN BIOINFORMATICS, 2006, 7 (04) : 339 - 353