Interpretation of shotgun proteomic data - The protein inference problem

被引:765
作者
Nesvizhskii, AI
Aebersold, R
机构
[1] Inst Syst Biol, Seattle, WA 98103 USA
[2] Swiss Fed Inst Technol, Inst Mol Syst Biol, CH-8093 Zurich, Switzerland
关键词
D O I
10.1074/mcp.R500012-MCP200
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The shotgun proteomic strategy based on digesting proteins into peptides and sequencing them using tandem mass spectrometry and automated database searching has become the method of choice for identifying proteins in most large scale studies. However, the peptide-centric nature of shotgun proteomics complicates the analysis and biological interpretation of the data especially in the case of higher eukaryote organisms. The same peptide sequence can be present in multiple different proteins or protein isoforms. Such shared peptides therefore can lead to ambiguities in determining the identities of sample proteins. In this article we illustrate the difficulties of interpreting shotgun proteomic data and discuss the need for common nomenclature and transparent informatic approaches. We also discuss related issues such as the state of protein sequence databases and their role in shotgun proteomic analysis, interpretation of relative peptide quantification data in the presence of multiple protein isoforms, the integration of proteomic and transcriptional data, and the development of a computational infrastructure for the integration of multiple diverse datasets.
引用
收藏
页码:1419 / 1440
页数:22
相关论文
共 115 条
  • [1] Constellations in a cellular universe
    Aebersold, R
    [J]. NATURE, 2003, 422 (6928) : 115 - 116
  • [2] Mass spectrometry-based proteomics
    Aebersold, R
    Mann, M
    [J]. NATURE, 2003, 422 (6928) : 198 - 207
  • [3] In vitro and in silico processes to identify differentially expressed proteins
    Allet, N
    Barrillat, N
    Baussant, T
    Boiteau, C
    Botti, P
    Bougueleret, L
    Budin, N
    Canet, D
    Carraud, S
    Chiappe, D
    Christmann, N
    Colinge, J
    Cusin, I
    Dafflon, N
    Depresle, B
    Fasso, I
    Frauchiger, P
    Gaertner, H
    Gleizes, A
    Gonzalez-Couto, E
    Jeandenans, C
    Karmime, A
    Kowall, T
    Lagache, S
    Mahé, E
    Masselot, A
    Mattou, H
    Moniatte, M
    Niknejad, A
    Paolini, M
    Perret, F
    Pinaud, N
    Ranno, F
    Raimondi, S
    Reffas, S
    Regamey, PO
    Rey, PA
    Rodriguez-Tomé, P
    Rose, K
    Rossellat, G
    Saudrais, C
    Schmidt, C
    Villain, M
    Zwahlen, C
    [J]. PROTEOMICS, 2004, 4 (08) : 2333 - 2351
  • [4] Protein sequence databases
    Apweiler, R
    Bairoch, A
    Wu, CH
    [J]. CURRENT OPINION IN CHEMICAL BIOLOGY, 2004, 8 (01) : 76 - 80
  • [5] Protein identification by mass spectrometry - Issues to be considered
    Baldwin, MA
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2004, 3 (01) : 1 - 9
  • [6] Ensembl 2004
    Birney, E
    Andrews, D
    Bevan, P
    Caccamo, M
    Cameron, G
    Chen, Y
    Clarke, L
    Coates, G
    Cox, T
    Cuff, J
    Curwen, V
    Cutts, T
    Down, T
    Durbin, R
    Eyras, E
    Fernandez-Suarez, XM
    Gane, P
    Gibbins, B
    Gilbert, J
    Hammond, M
    Hotz, H
    Iyer, V
    Kahari, A
    Jekosch, K
    Kasprzyk, A
    Keefe, D
    Keenan, S
    Lehvaslaiho, H
    McVicker, G
    Melsopp, C
    Meidl, P
    Mongin, E
    Pettett, R
    Potter, S
    Proctor, G
    Rae, M
    Searle, S
    Slater, G
    Smedley, D
    Smith, J
    Spooner, W
    Stabenau, A
    Stalker, J
    Storey, R
    Ureta-Vidal, A
    Woodwark, C
    Clamp, M
    Hubbard, T
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D468 - D470
  • [7] Protein diversity from alternative splicing: A challenge for bioinformatics and post-genome biology
    Black, DL
    [J]. CELL, 2000, 103 (03) : 367 - 370
  • [8] Tandem MS analysis of brain clathrin-coated vesicles reveals their critical involvement in synaptic vesicle recycling
    Blondeau, F
    Ritter, B
    Allaire, PD
    Wasiak, S
    Girard, M
    Hussain, NK
    Angers, A
    Legendre-Guillemin, V
    Roy, L
    Boismenu, D
    Kearney, RE
    Bell, AW
    Bergeron, JJM
    McPherson, PS
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (11) : 3833 - 3838
  • [9] The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003
    Boeckmann, B
    Bairoch, A
    Apweiler, R
    Blatter, MC
    Estreicher, A
    Gasteiger, E
    Martin, MJ
    Michoud, K
    O'Donovan, C
    Phan, I
    Pilbout, S
    Schneider, M
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 365 - 370
  • [10] Biomedical informatics for proteomics
    Boguski, MS
    McIntosh, MW
    [J]. NATURE, 2003, 422 (6928) : 233 - 237