Analysis of protein complexes through model-based biclustering of label-free quantitative AP-MS data

被引:28
作者
Choi, Hyungwon [1 ]
Kim, Sinae [2 ]
Gingras, Anne-Claude [3 ,4 ]
Nesvizhskii, Alexey I. [1 ,5 ]
机构
[1] Univ Michigan, Dept Pathol, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[3] Mt Sinai Hosp, Samuel Lunenfeld Res Inst, Toronto, ON M5G 1X5, Canada
[4] Univ Toronto, Dept Mol Genet, Toronto, ON, Canada
[5] Univ Michigan, Ctr Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
基金
加拿大健康研究院;
关键词
clustering; mass spectrometry; protein complexes; protein-protein interaction; spectral counts; PHYSICAL INTERACTOME; INTERACTION NETWORKS; PROTEOMIC DATA; REVEALS;
D O I
10.1038/msb.2010.41
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Affinity purification followed by mass spectrometry (AP-MS) has become a common approach for identifying protein-protein interactions (PPIs) and complexes. However, data analysis and visualization often rely on generic approaches that do not take advantage of the quantitative nature of AP-MS. We present a novel computational method, nested clustering, for biclustering of label-free quantitative AP-MS data. Our approach forms bait clusters based on the similarity of quantitative interaction profiles and identifies submatrices of prey proteins showing consistent quantitative association within bait clusters. In doing so, nested clustering effectively addresses the problem of overrepresentation of interactions involving baits proteins as compared with proteins only identified as preys. The method does not require specification of the number of bait clusters, which is an advantage against existing model-based clustering methods. We illustrate the performance of the algorithm using two published intermediate scale human PPI data sets, which are representative of the AP-MS data generated from mammalian cells. We also discuss general challenges of analyzing and interpreting clustering results in the context of AP-MS data. Molecular Systems Biology 6: 385; published online 22 June 2010; doi:10.1038/msb.2010.41
引用
收藏
页数:11
相关论文
共 44 条
[21]   Global landscape of protein complexes in the yeast Saccharomyces cerevisiae [J].
Krogan, NJ ;
Cagney, G ;
Yu, HY ;
Zhong, GQ ;
Guo, XH ;
Ignatchenko, A ;
Li, J ;
Pu, SY ;
Datta, N ;
Tikuisis, AP ;
Punna, T ;
Peregrín-Alvarez, JM ;
Shales, M ;
Zhang, X ;
Davey, M ;
Robinson, MD ;
Paccanaro, A ;
Bray, JE ;
Sheung, A ;
Beattie, B ;
Richards, DP ;
Canadien, V ;
Lalev, A ;
Mena, F ;
Wong, P ;
Starostine, A ;
Canete, MM ;
Vlasblom, J ;
Wu, S ;
Orsi, C ;
Collins, SR ;
Chandran, S ;
Haw, R ;
Rilstone, JJ ;
Gandi, K ;
Thompson, NJ ;
Musso, G ;
St Onge, P ;
Ghanny, S ;
Lam, MHY ;
Butland, G ;
Altaf-Ui, AM ;
Kanaya, S ;
Shilatifard, A ;
O'Shea, E ;
Weissman, JS ;
Ingles, CJ ;
Hughes, TR ;
Parkinson, J ;
Gerstein, M .
NATURE, 2006, 440 (7084) :637-643
[22]  
Lazzeroni L, 2002, STAT SINICA, V12, P61
[23]   A model for random sampling and estimation of relative protein abundance in shotgun proteomics [J].
Liu, HB ;
Sadygov, RG ;
Yates, JR .
ANALYTICAL CHEMISTRY, 2004, 76 (14) :4193-4201
[24]   Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation [J].
Lu, Peng ;
Vogel, Christine ;
Wang, Rong ;
Yao, Xin ;
Marcotte, Edward M. .
NATURE BIOTECHNOLOGY, 2007, 25 (01) :117-124
[25]   Interpretation of shotgun proteomic data - The protein inference problem [J].
Nesvizhskii, AI ;
Aebersold, R .
MOLECULAR & CELLULAR PROTEOMICS, 2005, 4 (10) :1419-1440
[26]   Analysis and validation of proteomic data generated by tandem mass spectrometry [J].
Nesvizhskii, Alexey I. ;
Vitek, Olga ;
Aebersold, Ruedi .
NATURE METHODS, 2007, 4 (10) :787-797
[27]   Comparison of label-free methods for quantifying human proteins by shotgun proteomics [J].
Old, WM ;
Meyer-Arendt, K ;
Aveline-Wolf, L ;
Pierce, KG ;
Mendoza, A ;
Sevinsky, JR ;
Resing, KA ;
Ahn, NG .
MOLECULAR & CELLULAR PROTEOMICS, 2005, 4 (10) :1487-1502
[28]   Cluster analysis of mass spectrometry data reveals a novel component of SAGA [J].
Powell, DW ;
Weaver, CM ;
Jennings, JL ;
McAfee, KJ ;
He, Y ;
Weil, PA ;
Link, AJ .
MOLECULAR AND CELLULAR BIOLOGY, 2004, 24 (16) :7249-7259
[29]   A systematic comparison and evaluation of biclustering methods for gene expression data [J].
Prelic, A ;
Bleuler, S ;
Zimmermann, P ;
Wille, A ;
Bühlmann, P ;
Gruissem, W ;
Hennig, L ;
Thiele, L ;
Zitzler, E .
BIOINFORMATICS, 2006, 22 (09) :1122-1129
[30]   Identifying functional modules in the physical interactome of Saccharomyces cerevisiae [J].
Pu, Shuye ;
Vlasblom, Jim ;
Emili, Andrew ;
Greenblatt, Jack ;
Wodak, Shoshana J. .
PROTEOMICS, 2007, 7 (06) :944-960