Database searching and accounting of multiplexed precursor and product ion spectra from the data independent analysis of simple and complex peptide mixtures

被引:445
作者
Li, Guo-Zhong [2 ]
Vissers, Johannes P. C. [1 ]
Silva, Jeffrey C. [2 ]
Golick, Dan [2 ]
Gorenstein, Marc V. [2 ]
Geromanos, Scott J. [2 ]
机构
[1] Waters Corp, Manchester M22 5PP, Lancs, England
[2] Waters Corp, Milford, MA USA
关键词
Database searching; Data independent LC-MS; Multiplexed LC-MS; Parallel LC-MS; Shotgun sequencing; INDUCED DISSOCIATION SPECTRA; LARGE-SCALE PROTEOMICS; TANDEM MASS-SPECTRA; PROTEIN IDENTIFICATION; LIQUID-CHROMATOGRAPHY; SEQUENCE DATABASES; RETENTION TIMES; SPECTROMETRY; PREDICTION; MS;
D O I
10.1002/pmic.200800564
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A novel database search algorithm is presented for the qualitative identification of proteins over a wide dynamic range, both in simple and complex biological samples. The algorithm has been designed for the analysis of data originating from data independent acquisitions, whereby multiple precursor ions are fragmented simultaneously. Measurements used by the algorithm include retention time, ion intensities, charge state, and accurate masses on both precursor and product ions from LC-MS data. The search algorithm uses an iterative process whereby each iteration incrementally increases the selectivity, specificity, and sensitivity of the overall strategy. Increased specificity is obtained by utilizing a subset database search approach, whereby for each subsequent stage of the search, only those peptides from securely identified proteins are queried. Tentative peptide and protein identifications are ranked and scored by their relative correlation to a number of models of known and empirically derived physicochemical attributes of proteins and peptides. In addition, the algorithm utilizes decoy database techniques for automatically determining the false positive identification rates. The search algorithm has been tested by comparing the search results from a four-protein mixture, the same four-protein mixture spiked into a complex biological background, and a variety of other "system" type protein digest mixtures. The method was validated independently by data dependent methods, while concurrently relying on replication and selectivity. Comparisons were also performed with other commercially and publicly available peptide fragmentation search algorithms. The presented results demonstrate the ability to correctly identify peptides and proteins from data independent acquisition strategies with high sensitivity and specificity. They also illustrate a more comprehensive analysis of the samples studied; providing approximately 20% more protein identifications, compared to a more conventional data directed approach using the same identification criteria, with a concurrent increase in both sequence coverage and the number of modified peptides.
引用
收藏
页码:1696 / 1719
页数:24
相关论文
共 76 条
[1]   Mass spectrometry in proteomics [J].
Aebersold, R ;
Goodlett, DR .
CHEMICAL REVIEWS, 2001, 101 (02) :269-295
[2]   Mass spectrometry-based proteomics [J].
Aebersold, R ;
Mann, M .
NATURE, 2003, 422 (6928) :198-207
[3]   Electron capture dissociation mass spectrometry in characterization of peptides and proteins [J].
Bakhtiar, Ray ;
Guan, Ziqiang .
BIOTECHNOLOGY LETTERS, 2006, 28 (14) :1047-1059
[4]   Using statistical models to identify factors that have a role in defining the abundance of ions produced by tandem MS [J].
Barton, Sheila J. ;
Richardson, Sylvia ;
Perkins, David N. ;
Bellahn, Inga ;
Bryant, Trevor N. ;
Whittaker, John C. .
ANALYTICAL CHEMISTRY, 2007, 79 (15) :5601-5607
[5]   A novel precursor ion discovery method on a hybrid quadrupole orthogonal acceleration time-of-flight (Q-TOF) mass spectrometer for studying protein phosphorylation [J].
Bateman, RH ;
Carruthers, R ;
Hoyes, JB ;
Jones, C ;
Langridge, JI ;
Millar, A ;
Vissers, JPC .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 2002, 13 (07) :792-803
[6]   Improving large-scale proteomics by clustering of mass spectrometry data [J].
Beer, I ;
Barnea, E ;
Ziv, T ;
Admon, A .
PROTEOMICS, 2004, 4 (04) :950-960
[7]  
BEYNON RJ, 2001, PROTEOLYTIC ENZYMES, P149
[8]   AMINO-ACID SEQUENCING OF PROTEINS [J].
BIEMANN, K ;
PAPAYANNOPOULOS, IA .
ACCOUNTS OF CHEMICAL RESEARCH, 1994, 27 (11) :370-378
[9]   MASS-SPECTROMETRIC DETERMINATION OF THE AMINO-ACID-SEQUENCE OF PEPTIDES AND PROTEINS [J].
BIEMANN, K ;
MARTIN, SA .
MASS SPECTROMETRY REVIEWS, 1987, 6 (01) :1-75
[10]   Differential proteomics via probabilistic peptide identification scores [J].
Colinge, J ;
Chiappe, D ;
Lagache, S ;
Moniatte, M ;
Bougueleret, L .
ANALYTICAL CHEMISTRY, 2005, 77 (02) :596-606