Development and validation of a spectral library searching method for peptide identification from MS/MS

被引:380
作者
Lam, Henry
Deutsch, Eric W.
Eddes, James S.
Eng, Jimmy K.
King, Nichole
Stein, Stephen E.
Aebersold, Ruedi
机构
[1] ETH, Inst Mol Syst Biol, CH-8093 Zurich, Switzerland
[2] Inst Syst Biol, Seattle, WA USA
[3] Fred Hutchinson Canc Res Ctr, Seattle, WA 98104 USA
[4] Natl Inst Stand & Technol, Gaithersburg, MD 20899 USA
[5] Univ Zurich, Fac Sci, Zurich, Switzerland
关键词
peptide identification by MS/MS; spectral library; spectral similarity; targeted proteomics;
D O I
10.1002/pmic.200600625
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A notable inefficiency of shotgun proteomics experiments is the repeated rediscovery of the same identifiable peptides by sequence database searching methods, which often are time-consuming and error-prone. A more precise and efficient method, in which previously observed and identified peptide MS/MS spectra are catalogued and condensed into searchable spectral libraries to allow new identifications by spectral matching, is seen as a promising alternative. To that end, an open-source, functionally complete, high-throughput and readily extensible MS/MS spectral searching tool, SpectraST, was developed. A high-quality spectral library was constructed by combining the high-confidence identifications of millions of spectra taken from various data repositories and searched using four sequence search engines. The resulting library consists of over 30 000 spectra for Saccharomyces cerewsiae. Using this library, SpectraST vastly outperforms the sequence search engine SEQUEST in terms of speed and the ability to discriminate good and bad hits. A unique advantage of SpectraST is its full integration into the popular Trans Proteornic Pipeline suite of software, which facilitates user adoption and provides important functionalities such as peptide and protein probability assignment, quantification, and data visualization. This method of spectral library searching is especially suited for targeted proteomics applications, offering superior performance to traditional sequence searching.
引用
收藏
页码:655 / 667
页数:13
相关论文
共 47 条
[1]   Mass spectrometry in proteomics [J].
Aebersold, R ;
Goodlett, DR .
CHEMICAL REVIEWS, 2001, 101 (02) :269-295
[2]   Mass spectrometry-based proteomics [J].
Aebersold, R ;
Mann, M .
NATURE, 2003, 422 (6928) :198-207
[3]   The human plasma proteome - History, character, and diagnostic prospects [J].
Anderson, NL ;
Anderson, NG .
MOLECULAR & CELLULAR PROTEOMICS, 2002, 1 (11) :845-867
[4]   The critical evaluation of a comprehensive mass spectral library [J].
Ausloos, P ;
Clifton, CL ;
Lias, SG ;
Mikaya, AI ;
Stein, SE ;
Tchekhovskoi, DV ;
Sparkman, OD ;
Zaikin, V ;
Zhu, D .
JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 1999, 10 (04) :287-299
[5]   Using annotated peptide mass spectrum libraries for protein identification [J].
Craig, R. ;
Cortens, J. C. ;
Fenyo, D. ;
Beavis, R. C. .
JOURNAL OF PROTEOME RESEARCH, 2006, 5 (08) :1843-1849
[6]   The use of proteotypic peptide libraries for protein identification [J].
Craig, R ;
Cortens, JP ;
Beavis, RC .
RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 2005, 19 (13) :1844-1850
[7]   Open source system for analyzing, validating, and storing protein identification data [J].
Craig, R ;
Cortens, JP ;
Beavis, RC .
JOURNAL OF PROTEOME RESEARCH, 2004, 3 (06) :1234-1242
[8]   TANDEM: matching proteins with tandem mass spectra [J].
Craig, R ;
Beavis, RC .
BIOINFORMATICS, 2004, 20 (09) :1466-1467
[9]   Proteomics [J].
de Hoog, CL ;
Mann, M .
ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, 2004, 5 :267-293
[10]  
Desiere F, 2005, GENOME BIOL, V6