pFind-Alioth: A novel unrestricted database search algorithm to improve the interpretation of high-resolution MS/MS data

被引:49
作者
Chi, Hao [1 ]
He, Kun [1 ]
Yang, Bing [2 ]
Chen, Zhen [3 ]
Sun, Rui-Xiang [1 ]
Fan, Sheng-Bo [1 ]
Zhang, Kun [1 ]
Liu, Chao [1 ]
Yuan, Zuo-Fei [1 ]
Wang, Quan-Hui [3 ]
Liu, Si-Qi [3 ]
Dong, Meng-Qiu [2 ]
He, Si-Min [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
[2] Natl Inst Biol Sci, Beijing 102206, Peoples R China
[3] Chinese Acad Sci, Beijing Inst Genom, Beijing 100029, Peoples R China
基金
国家高技术研究发展计划(863计划);
关键词
Unrestricted database search; Ion index; In-depth interpretation; High resolution MS/MS; PROTEIN IDENTIFICATION; PEPTIDE IDENTIFICATION; POSTTRANSLATIONAL MODIFICATIONS; SEQUENCE DATABASES; SPECTRAL NETWORKS; TANDEM; SOFTWARE; ACCURATE; ENGINE; MODEL;
D O I
10.1016/j.jprot.2015.05.009
中图分类号
Q5 [生物化学];
学科分类号
070307 [化学生物学];
摘要
Database search is the dominant approach in high-throughput proteomic analysis. However, the interpretation rate of MS/MS spectra is very low in such a restricted mode, which is mainly due to unexpected modifications and irregular digestion types. In this study, we developed a new algorithm called Alioth, to be integrated into the search engine of pFind, for fast and accurate unrestricted database search on high-resolution MS/MS data. An ion index is constructed for both peptide precursors and fragment ions, by which arbitrary digestions and a single site of any modifications and mutations can be searched efficiently. A new re-ranking algorithm is used to distinguish the correct peptide-spectrum matches from random ones. The algorithm is tested on several HCD datasets and the interpretation rate of MS/MS spectra using Alioth is as high as 60%-80%. Peptides from semi- and non-specific digestions, as well as those with unexpected modifications or mutations, can be effectively identified using Alioth and confidently validated using other search engines. The average processing speed of Alioth is 5-10 times faster than some other unrestricted search engines and is comparable to or even faster than the restricted search algorithms tested. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:89 / 97
页数:9
相关论文
共 58 条
[1]
Mass spectrometry-based proteomics [J].
Aebersold, R ;
Mann, M .
NATURE, 2003, 422 (6928) :198-207
[2]
QuickMod: A Tool for Open Modification Spectrum Library Searches [J].
Ahrne, Erik ;
Nikitin, Frederic ;
Lisacek, Frederique ;
Mueller, Markus .
JOURNAL OF PROTEOME RESEARCH, 2011, 10 (07) :2913-2921
[3]
Spectral networks: a new approach to de novo discovery of protein sequences and posttranslational modifications [J].
Bandeira, Nuno .
BIOTECHNIQUES, 2007, 42 (06) :687-+
[4]
Protein identification by spectral networks analysis [J].
Bandeira, Nuno ;
Tsur, Dekel ;
Frank, Ari ;
Pevzner, Pavel A. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (15) :6140-6145
[5]
A complete sequence of the T tengcongensis genome [J].
Bao, QY ;
Tian, YQ ;
Li, W ;
Xu, ZY ;
Xuan, ZY ;
Hu, SN ;
Dong, W ;
Yang, J ;
Chen, YJ ;
Xue, YF ;
Xu, Y ;
Lai, XQ ;
Huang, L ;
Dong, XZ ;
Ma, YH ;
Ling, LJ ;
Tan, HR ;
Chen, RS ;
Wang, J ;
Yu, J ;
Yang, HM .
GENOME RESEARCH, 2002, 12 (05) :689-700
[6]
Blind search for post-translational modifications and amino acid substitutions using peptide mass fingerprints from two proteases [J].
Barsnes H. ;
Mikalsen S.-O. ;
Eidhammer I. .
BMC Research Notes, 1 (1)
[7]
Lookup peaks: A hybrid of de novo sequencing and database search for protein identification by tandem mass spectrometry [J].
Bern, Marshall ;
Cai, Yuhan ;
Goldberg, David .
ANALYTICAL CHEMISTRY, 2007, 79 (04) :1393-1400
[8]
Reanalysis of Tyrannosaurus rex Mass Spectra [J].
Bern, Marshall ;
Phinney, Brett S. ;
Goldberg, David .
JOURNAL OF PROTEOME RESEARCH, 2009, 8 (09) :4328-4332
[9]
Accurate and Sensitive Peptide Identification with Mascot Percolator [J].
Brosch, Markus ;
Yu, Lu ;
Hubbard, Tim ;
Choudhary, Jyoti .
JOURNAL OF PROTEOME RESEARCH, 2009, 8 (06) :3176-3181
[10]
Search engine processor: Filtering and organizing peptide spectrum matches [J].
Carvalho, Paulo C. ;
Fischer, Juliana S. G. ;
Xu, Tao ;
Cociorva, Daniel ;
Balbuena, Tiago S. ;
Valente, Richard H. ;
Perales, Jonas ;
Yates, John R., III ;
Barbosa, Valmir C. .
PROTEOMICS, 2012, 12 (07) :944-949