pNovo+: De Novo Peptide Sequencing Using Complementary HCD and ETD Tandem Mass Spectra

被引:85
作者
Chi, Hao [1 ,2 ]
Chen, Haifeng [1 ,2 ]
He, Kun [1 ,2 ]
Wu, Long [1 ,2 ]
Yang, Bing [3 ]
Sun, Rui-Xiang [1 ]
Liu, Jianyun [4 ]
Zeng, Wen-Feng [1 ,2 ]
Song, Chun-Qing [3 ]
He, Si-Min [1 ]
Dong, Meng-Qiu [3 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Grad Univ, Beijing 100049, Peoples R China
[3] Natl Inst Biol Sci, Beijing 102206, Peoples R China
[4] Beihang Univ, Beijing Key Lab Digital Media, Lab Intelligent Recognit & Image Proc, Beijing 100191, Peoples R China
关键词
tandem mass spectrometry; de novo peptide sequencing; HCD; ETD; antisymmetry restriction; k longest paths; INDUCED DISSOCIATION SPECTRA; AUTOMATED INTERPRETATION; PROTEIN IDENTIFICATION; SPECTROMETRIC DATA; SOFTWARE AID; ALGORITHM; MIXTURES; SEARCH; SEQMS; MS/MS;
D O I
10.1021/pr3006843
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
De novo peptide sequencing is the only tool for extracting peptide sequences directly from tandem mass spectrometry (MS) data without any protein database. However, neither the accuracy nor the efficiency of de novo sequencing has been satisfactory, mainly due to incomplete fragmentation information in experimental spectra. Recent advancement in MS technology has enabled acquisition of higher energy collisional dissociation (HCD) and electron transfer dissociation (ETD) spectra of the same precursor. These spectra contain complementary fragmentation information and can be collected with high resolution and high mass accuracy. Taking these advantages, we have developed a new algorithm called pNovo+, which greatly improves the accuracy and speed of de novo sequencing. On tryptic peptides, 86% of the topmost candidate sequences deduced by pNovo+ from HCD + ETD spectral pairs matched the database search results, and the success rate reached 95% if the top three candidates were included, which was much higher than using only HCD (87%) or only ETD spectra (57%). On Asp-N, Glu-C, or Elastase digested peptides, 69-87% of the HCD + ETD spectral pairs were correctly identified by pNovo+ among the topmost candidates, or 84-95% among the top three. On average, it takes pNovo+ only 0.018 s to extract the sequence from a spectrum or spectral pair on a common personal computer. This is more than three times as fast as other de novo sequencing programs. The increase of speed is mainly due to pDAG, a component algorithm of pNovo+. pDAG finds the k longest paths in a directed acyclic graph without the antisymmetry restriction. We have verified that the antisymmetry restriction is unnecessary for high resolution, high mass accuracy data. The extensive use of HCD and ETD spectral information and the pDAG algorithm make pNovo+ an excellent de novo sequencing tool.
引用
收藏
页码:615 / 625
页数:11
相关论文
共 58 条
[1]
Mass spectrometry-based proteomics [J].
Aebersold, R ;
Mann, M .
NATURE, 2003, 422 (6928) :198-207
[2]
Allmer J, 2011, EXPERT REV PROTEOMIC, V8, P645, DOI [10.1586/EPR.11.54, 10.1586/epr.11.54]
[3]
Antilope-A Lagrangian Relaxation Approach to the de novo Peptide Sequencing Problem [J].
Andreotti, Sandro ;
Klau, Gunnar W. ;
Reinert, Knut .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (02) :385-394
[4]
Shotgun protein sequencing - Assembly of peptide tandem mass spectra from mixtures of modified proteins [J].
Bandeira, Nuno ;
Clauser, Karl R. ;
Pevzner, Pavel A. .
MOLECULAR & CELLULAR PROTEOMICS, 2007, 6 (07) :1123-1134
[5]
FAST ALGORITHM FOR PEPTIDE SEQUENCING BY MASS-SPECTROSCOPY [J].
BARTELS, C .
BIOMEDICAL AND ENVIRONMENTAL MASS SPECTROMETRY, 1990, 19 (06) :363-368
[6]
De novo analysis of peptide tandem mass spectra by spectral graph partitioning [J].
Bern, M ;
Goldberg, D .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2006, 13 (02) :364-378
[7]
Lookup peaks: A hybrid of de novo sequencing and database search for protein identification by tandem mass spectrometry [J].
Bern, Marshall ;
Cai, Yuhan ;
Goldberg, David .
ANALYTICAL CHEMISTRY, 2007, 79 (04) :1393-1400
[8]
De novo peptide sequencing by tandem MS using complementary CID and electron transfer dissociation [J].
Bertsch, Andreas ;
Leinenbach, Andreas ;
Pervukhin, Anton ;
Lubeck, Markus ;
Hartmer, Ralf ;
Baessmann, Carsten ;
Elnakady, Yasser Abbas ;
Mueller, Rolf ;
Boecker, Sebastian ;
Huber, Christian G. ;
Kohlbacher, Oliver .
ELECTROPHORESIS, 2009, 30 (21) :3736-3747
[9]
Straightforward and de Novo Peptide Sequencing by MALDI-MS/MS Using a Lys-N Metalloendopeptidase [J].
Boersema, Paul J. ;
Taouatas, Nadia ;
Altelaar, A. F. Maarten ;
Gouw, Joost W. ;
Ross, Philip L. ;
Pappin, Darryl J. ;
Heck, Albert J. R. ;
Mohammed, Shabaz .
MOLECULAR & CELLULAR PROTEOMICS, 2009, 8 (04) :650-660
[10]
A dynamic programming approach to de novo peptide sequencing via tandem mass spectrometry [J].
Chen, T ;
Kao, MY ;
Tepel, M ;
Rush, J ;
Church, GM .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (03) :325-337