Gaining confidence in high-throughput protein interaction networks

被引:316
作者
Bader, JS
Chaudhuri, A
Rothberg, JM
Chant, J
机构
[1] Johns Hopkins Univ, Dept Biomed Engn, Baltimore, MD 21218 USA
[2] CuraGen Corp, New Haven, CT 06511 USA
关键词
D O I
10.1038/nbt924
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Although genome-scale technologies have benefited from statistical measures of data quality, extracting biologically relevant pathways from high-throughput proteomics data remains a challenge. Here we develop a quantitative method for evaluating proteomics data. We present a logistic regression approach that uses statistical and topological descriptors to predict the biological relevance of protein-protein interactions obtained from high-throughput screens for yeast. Other sources of information, including mRNA expression, genetic interactions and database annotations, are subsequently used to validate the model predictions without bias or cross-pollution. Novel topological statistics show hierarchical organization of the network of high-confidence interactions: protein complex interactions extend one to two links, and genetic interactions represent an even finer scale of organization. Knowledge of the maximum number of links that indicates a significant correlation between protein pairs (correlation distance) enables the integrated analysis of proteomics data with data from genetics and gene expression. The type of analysis presented will be essential for analyzing the growing amount of genomic and proteomics data in model organisms and humans.
引用
收藏
页码:78 / 85
页数:8
相关论文
共 41 条
  • [1] Mass spectrometry-based proteomics
    Aebersold, R
    Mann, M
    [J]. NATURE, 2003, 422 (6928) : 198 - 207
  • [2] An automated method for finding molecular complexes in large protein interaction networks
    Bader, GD
    Hogue, CW
    [J]. BMC BIOINFORMATICS, 2003, 4 (1)
  • [3] Analyzing yeast protein-protein interaction data obtained from different sources
    Bader, GD
    Hogue, CWV
    [J]. NATURE BIOTECHNOLOGY, 2002, 20 (10) : 991 - 997
  • [4] Greedily building protein networks with confidence
    Bader, JS
    [J]. BIOINFORMATICS, 2003, 19 (15) : 1869 - 1874
  • [5] Emergence of scaling in random networks
    Barabási, AL
    Albert, R
    [J]. SCIENCE, 1999, 286 (5439) : 509 - 512
  • [6] MAP kinase phosphatase as a locus of flexibility in a mitogen-activated protein kinase signaling network
    Bhalla, US
    Ram, PT
    Iyengar, R
    [J]. SCIENCE, 2002, 297 (5583) : 1018 - 1023
  • [7] A genome-wide transcriptional analysis of the mitotic cell cycle
    Cho, RJ
    Campbell, MJ
    Winzeler, EA
    Steinmetz, L
    Conway, A
    Wodicka, L
    Wolfsberg, TG
    Gabrielian, AE
    Landsman, D
    Lockhart, DJ
    Davis, RW
    [J]. MOLECULAR CELL, 1998, 2 (01) : 65 - 73
  • [8] The transcriptional program of sporulation in budding yeast
    Chu, S
    DeRisi, J
    Eisen, M
    Mulholland, J
    Botstein, D
    Brown, PO
    Herskowitz, I
    [J]. SCIENCE, 1998, 282 (5389) : 699 - 705
  • [9] Protein interactions - Two methods for assessment of the reliability of high throughput observations
    Deane, CM
    Salwinski, L
    Xenarios, I
    Eisenberg, D
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2002, 1 (05) : 349 - 356
  • [10] Cluster analysis and display of genome-wide expression patterns
    Eisen, MB
    Spellman, PT
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) : 14863 - 14868