Analysis of protein complexes through model-based biclustering of label-free quantitative AP-MS data

被引:28
作者
Choi, Hyungwon [1 ]
Kim, Sinae [2 ]
Gingras, Anne-Claude [3 ,4 ]
Nesvizhskii, Alexey I. [1 ,5 ]
机构
[1] Univ Michigan, Dept Pathol, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[3] Mt Sinai Hosp, Samuel Lunenfeld Res Inst, Toronto, ON M5G 1X5, Canada
[4] Univ Toronto, Dept Mol Genet, Toronto, ON, Canada
[5] Univ Michigan, Ctr Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
基金
加拿大健康研究院;
关键词
clustering; mass spectrometry; protein complexes; protein-protein interaction; spectral counts; PHYSICAL INTERACTOME; INTERACTION NETWORKS; PROTEOMIC DATA; REVEALS;
D O I
10.1038/msb.2010.41
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Affinity purification followed by mass spectrometry (AP-MS) has become a common approach for identifying protein-protein interactions (PPIs) and complexes. However, data analysis and visualization often rely on generic approaches that do not take advantage of the quantitative nature of AP-MS. We present a novel computational method, nested clustering, for biclustering of label-free quantitative AP-MS data. Our approach forms bait clusters based on the similarity of quantitative interaction profiles and identifies submatrices of prey proteins showing consistent quantitative association within bait clusters. In doing so, nested clustering effectively addresses the problem of overrepresentation of interactions involving baits proteins as compared with proteins only identified as preys. The method does not require specification of the number of bait clusters, which is an advantage against existing model-based clustering methods. We illustrate the performance of the algorithm using two published intermediate scale human PPI data sets, which are representative of the AP-MS data generated from mammalian cells. We also discuss general challenges of analyzing and interpreting clustering results in the context of AP-MS data. Molecular Systems Biology 6: 385; published online 22 June 2010; doi:10.1038/msb.2010.41
引用
收藏
页数:11
相关论文
共 44 条
[1]   Mass spectrometry-based proteomics [J].
Aebersold, R ;
Mann, M .
NATURE, 2003, 422 (6928) :198-207
[2]   MIXTURES OF DIRICHLET PROCESSES WITH APPLICATIONS TO BAYESIAN NONPARAMETRIC PROBLEMS [J].
ANTONIAK, CE .
ANNALS OF STATISTICS, 1974, 2 (06) :1152-1174
[3]  
Braun P, 2009, NAT METHODS, V6, P91, DOI [10.1038/NMETH.1281, 10.1038/nmeth.1281]
[4]   A Global Protein Kinase and Phosphatase Interaction Network in Yeast [J].
Breitkreutz, Ashton ;
Choi, Hyungwon ;
Sharom, Jeffrey R. ;
Boucher, Lorrie ;
Neduva, Victor ;
Larsen, Brett ;
Lin, Zhen-Yuan ;
Breitkreutz, Bobby-Joe ;
Stark, Chris ;
Liu, Guomin ;
Ahn, Jessica ;
Dewar-Darch, Danielle ;
Reguly, Teresa ;
Tang, Xiaojing ;
Almeida, Ricardo ;
Qin, Zhaohui Steve ;
Pawson, Tony ;
Gingras, Anne-Claude ;
Nesvizhskii, Alexey I. ;
Tyers, Mike .
SCIENCE, 2010, 328 (5981) :1043-1046
[5]   Affinity-purification mass spectrometry (AP-MS) of serine/threonine phosphatases [J].
Chen, Ginny I. ;
Gingras, Anne-Claude .
METHODS, 2007, 42 (03) :298-305
[6]  
Cheng Y., 2000, Proceedings International Conference on Intelligent System,s for Molecular Biology
[7]  
ISMB. International Conference on Intelligent System, V8, P93
[8]   Significance Analysis of Spectral Count Data in Label-free Shotgun Proteomics [J].
Choi, Hyungwon ;
Fermin, Damian ;
Nesvizhskii, Alexey I. .
MOLECULAR & CELLULAR PROTEOMICS, 2008, 7 (12) :2373-2385
[9]   Toward a comprehensive atlas of the physical interactome of Saccharomyces cerevisiae [J].
Collins, Sean R. ;
Kemmeren, Patrick ;
Zhao, Xue-Chu ;
Greenblatt, Jack F. ;
Spencer, Forrest ;
Holstege, Frank C. P. ;
Weissman, Jonathan S. ;
Krogan, Nevan J. .
MOLECULAR & CELLULAR PROTEOMICS, 2007, 6 (03) :439-450
[10]  
Dudoit S, 2002, GENOME BIOL, V3