Interpretation of shotgun proteomic data - The protein inference problem

被引:765
作者
Nesvizhskii, AI
Aebersold, R
机构
[1] Inst Syst Biol, Seattle, WA 98103 USA
[2] Swiss Fed Inst Technol, Inst Mol Syst Biol, CH-8093 Zurich, Switzerland
关键词
D O I
10.1074/mcp.R500012-MCP200
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The shotgun proteomic strategy based on digesting proteins into peptides and sequencing them using tandem mass spectrometry and automated database searching has become the method of choice for identifying proteins in most large scale studies. However, the peptide-centric nature of shotgun proteomics complicates the analysis and biological interpretation of the data especially in the case of higher eukaryote organisms. The same peptide sequence can be present in multiple different proteins or protein isoforms. Such shared peptides therefore can lead to ambiguities in determining the identities of sample proteins. In this article we illustrate the difficulties of interpreting shotgun proteomic data and discuss the need for common nomenclature and transparent informatic approaches. We also discuss related issues such as the state of protein sequence databases and their role in shotgun proteomic analysis, interpretation of relative peptide quantification data in the presence of multiple protein isoforms, the integration of proteomic and transcriptional data, and the development of a computational infrastructure for the integration of multiple diverse datasets.
引用
收藏
页码:1419 / 1440
页数:22
相关论文
共 115 条
  • [11] BRANDT U, 1993, J BIOL CHEM, V268, P8387
  • [12] The need for guidelines in publication of peptide and protein identification data - Working group on publication guidelines for peptide and protein identification data
    Carr, S
    Aebersold, R
    Baldwin, M
    Burlingame, A
    Clauser, K
    Nesvizhskii, A
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2004, 3 (06) : 531 - 533
  • [13] Quantitative profiling of proteins in complex mixtures using liquid chromatography and mass spectrometry
    Chelius, D
    Bondarenko, PV
    [J]. JOURNAL OF PROTEOME RESEARCH, 2002, 1 (04) : 317 - 323
  • [14] Discordant protein and mRNA expression in lung adenocarcinomas
    Chen, GA
    Gharib, TG
    Huang, CC
    Taylor, JMG
    Misek, DE
    Kardia, SLR
    Giordano, TJ
    Iannettoni, MD
    Orringer, MB
    Hanash, SM
    Beer, DG
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2002, 1 (04) : 304 - 313
  • [15] Multiple enzymatic digestion for enhanced sequence coverage of proteins in complex proteomic mixtures using capillary LC with ion trap MS/MS
    Choudhary, G
    Wu, SL
    Shieh, P
    Hancock, WS
    [J]. JOURNAL OF PROTEOME RESEARCH, 2003, 2 (01) : 59 - 67
  • [16] Choudhary JS, 2001, PROTEOMICS, V1, P651, DOI 10.1002/1615-9861(200104)1:5<651::AID-PROT651>3.0.CO
  • [17] 2-N
  • [18] Role of accurate mass measurement (±10 ppm) in protein identification strategies employing MS or MS MS and database searching
    Clauser, KR
    Baker, P
    Burlingame, AL
    [J]. ANALYTICAL CHEMISTRY, 1999, 71 (14) : 2871 - 2882
  • [19] Integrating gene and protein expression data: pattern analysis and profile mining
    Cox, B
    Kislinger, T
    Emili, A
    [J]. METHODS, 2005, 35 (03) : 303 - 314
  • [20] Open source system for analyzing, validating, and storing protein identification data
    Craig, R
    Cortens, JP
    Beavis, RC
    [J]. JOURNAL OF PROTEOME RESEARCH, 2004, 3 (06) : 1234 - 1242