Dynamic spectrum quality assessment and iterative computational analysis of shotgun proteomic data - Toward more efficient identification of post-translational modifications, sequence polymorphisms, and novel peptides

被引:151
作者
Nesvizhskii, AI
Roos, FF
Grossmann, J
Vogelzang, M
Eddes, JS
Gruissem, W
Baginsky, S
Aebersold, R
机构
[1] Inst Syst Biol, Seattle, WA 98103 USA
[2] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland
关键词
D O I
10.1074/mcp.M500319-MCP200
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In mass spectrometry-based proteomics, frequently hundreds of thousands of MS/MS spectra are collected in a single experiment. Of these, a relatively small fraction is confidently assigned to peptide sequences, whereas the majority of the spectra are not further analyzed. Spectra are not assigned to peptides for diverse reasons. These include deficiencies of the scoring schemes implemented in the database search tools, sequence variations (e.g. single nucleotide polymorphisms) or omissions in the database searched, post-translational or chemical modifications of the peptide analyzed, or the observation of sequences that are not anticipated from the genomic sequence (e.g. splice forms, somatic rearrangement, and processed proteins). To increase the amount of information that can be extracted from proteomic MS/MS datasets we developed a robust method that detects high quality spectra within the fraction of spectra unassigned by conventional sequence database searching and computes a quality score for each spectrum. We also demonstrate that iterative search strategies applied to such detected unassigned high quality spectra significantly increase the number of spectra that can be assigned from datasets and that biologically interesting new insights can be gained from existing data.
引用
收藏
页码:652 / 670
页数:19
相关论文
共 74 条
[1]   Mass spectrometry-based proteomics [J].
Aebersold, R ;
Mann, M .
NATURE, 2003, 422 (6928) :198-207
[2]  
[Anonymous], 2003, Statistical pattern recognition
[3]   Protein sequence databases [J].
Apweiler, R ;
Bairoch, A ;
Wu, CH .
CURRENT OPINION IN CHEMICAL BIOLOGY, 2004, 8 (01) :76-80
[4]   Protein identification by mass spectrometry - Issues to be considered [J].
Baldwin, MA .
MOLECULAR & CELLULAR PROTEOMICS, 2004, 3 (01) :1-9
[5]   Improving large-scale proteomics by clustering of mass spectrometry data [J].
Beer, I ;
Barnea, E ;
Ziv, T ;
Admon, A .
PROTEOMICS, 2004, 4 (04) :950-960
[6]   Automatic Quality Assessment of Peptide Tandem Mass Spectra [J].
Bern, Marshall ;
Goldberg, David ;
McDonald, W. Hayes ;
Yates, John R., III .
BIOINFORMATICS, 2004, 20 :49-54
[7]   LIME:: A new membrane raft-associated adaptor protein involved in CD4 and CD8 coreceptor signaling [J].
Brdicková, N ;
Brdicka, T ;
Angelisová, P ;
Horváth, O ;
Spicka, J ;
Hilgert, I ;
Paces, J ;
Simeoni, L ;
Kliche, S ;
Merten, C ;
Schraven, B ;
Horejsí, V .
JOURNAL OF EXPERIMENTAL MEDICINE, 2003, 198 (10) :1453-1462
[8]   The need for guidelines in publication of peptide and protein identification data - Working group on publication guidelines for peptide and protein identification data [J].
Carr, S ;
Aebersold, R ;
Baldwin, M ;
Burlingame, A ;
Clauser, K ;
Nesvizhskii, A .
MOLECULAR & CELLULAR PROTEOMICS, 2004, 3 (06) :531-533
[9]   Comprehensive analysis of a multidimensional liquid chromatography mass spectrometry dataset acquired on a quadrupole selecting, quadrupole collision cell, time-of-flight mass spectrometer - I. How much of the data is theoretically interpretable by search engines? [J].
Chalkley, RJ ;
Baker, PR ;
Hansen, KC ;
Medzihradszky, KF ;
Allen, NP ;
Rexach, M ;
Burlingame, AL .
MOLECULAR & CELLULAR PROTEOMICS, 2005, 4 (08) :1189-1193
[10]   A dynamic programming approach to de novo peptide sequencing via tandem mass spectrometry [J].
Chen, T ;
Kao, MY ;
Tepel, M ;
Rush, J ;
Church, GM .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (03) :325-337