A probabilistic view of gene function

被引:84
作者
Fraser, AG
Marcotte, EM
机构
[1] Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England
[2] Univ Texas, Inst Cellular & Mol Biol, Ctr Syst & Synthet Biol, Austin, TX 78712 USA
关键词
D O I
10.1038/ng1370
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Cells are controlled by the complex and dynamic actions of thousands of genes. With the sequencing of many genomes, the key problem has shifted from identifying genes to knowing what the genes do; we need a framework for expressing that knowledge. Even the most rigorous attempts to construct ontological frameworks describing gene function (e.g., the Gene Ontology project) ultimately rely on manual curation and are thus labor-intensive and subjective. But an alternative exists: the field of functional genomics is piecing together networks of gene interactions, and although these data are currently incomplete and error-prone, they provide a glimpse of a new, probabilistic view of gene function. We outline such a framework, which revolves around a statistical description of gene interactions derived from large, systematically compiled data sets. In this probabilistic view, pleiotropy is implicit, all data have errors and the definition of gene function is an iterative process that ultimately converges on the correct functions. The relationships between the genes are defined by the data, not by hand. Even this comprehensive view fails to capture key aspects of gene function, not least their dynamics in time and space, showing that there are limitations to the model that must ultimately be addressed.
引用
收藏
页码:559 / 564
页数:6
相关论文
共 58 条
  • [1] Ashburner M, 2001, GENOME RES, V11, P1425
  • [2] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [3] Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkr1065, 10.1093/nar/gkh121]
  • [4] ATP-DEPENDENT RECOGNITION OF EUKARYOTIC ORIGINS OF DNA-REPLICATION BY A MULTIPROTEIN COMPLEX
    BELL, SP
    STILLMAN, B
    [J]. NATURE, 1992, 357 (6374) : 128 - 134
  • [5] UV-damaged DNA-binding protein in the TFTC complex links DNA damage recognition to nucleosome acetylation
    Brand, M
    Moggs, JG
    Oulad-Abdelghani, M
    Lejeune, F
    Dilworth, FJ
    Stevenin, J
    Almouzni, G
    Tora, L
    [J]. EMBO JOURNAL, 2001, 20 (12) : 3187 - 3196
  • [6] Machine learning of functional class from phenotype data
    Clare, A
    King, RD
    [J]. BIOINFORMATICS, 2002, 18 (01) : 160 - 166
  • [7] The Xenopus Cdc6 protein is essential for the initiation of a single round of DNA replication in cell-free extracts
    Coleman, TR
    Carpenter, PB
    Dunphy, WG
    [J]. CELL, 1996, 87 (01) : 53 - 63
  • [8] Discovery of uncharacterized cellular systems by genome-wide analysis of functional linkages
    Date, SV
    Marcotte, EM
    [J]. NATURE BIOTECHNOLOGY, 2003, 21 (09) : 1055 - 1062
  • [9] Protein interactions - Two methods for assessment of the reliability of high throughput observations
    Deane, CM
    Salwinski, L
    Xenarios, I
    Eisenberg, D
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2002, 1 (05) : 349 - 356
  • [10] Architecture of the human origin recognition complex
    Dhar, SK
    Delmolino, L
    Dutta, A
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 2001, 276 (31) : 29067 - 29071