TagRecon: High-Throughput Mutation Identification through Sequence Tagging

被引:84
作者
Dasari, Surendra [1 ]
Chambers, Matthew C. [1 ]
Slebos, Robbert J. [2 ,3 ]
Zimmerman, Lisa J. [3 ,4 ]
Ham, Amy-Joan L. [3 ,4 ]
Tabb, David L. [1 ,3 ,4 ,5 ]
机构
[1] Vanderbilt Univ, Med Ctr, Dept Biomed Informat, Nashville, TN 37232 USA
[2] Vanderbilt Ingram Canc Ctr, Dept Canc Biol, Nashville, TN 37232 USA
[3] Vanderbilt Ingram Canc Ctr, Jim Ayers Inst Precanc Detect & Diag, Nashville, TN 37232 USA
[4] Vanderbilt Univ, Med Ctr, Dept Biochem, Nashville, TN 37232 USA
[5] Vanderbilt Univ, Med Ctr, Mass Spectrometry Res Ctr, Nashville, TN 37232 USA
关键词
mutation; bioinformatics; hydroxyproline; sequence tagging; TANDEM MASS-SPECTRA; ELEVATED MUTANT FREQUENCIES; POSTTRANSLATIONAL MODIFICATIONS; PEPTIDE IDENTIFICATION; PROTEIN MODIFICATIONS; TRANSITION MUTATIONS; SHOTGUN PROTEOMICS; SPECTROMETRY; CANCER; ALGORITHM;
D O I
10.1021/pr900850m
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Shotgun proteomics produces collections of tandem mass spectra that contain all the data needed to identify mutated peptides from clinical samples. Identifying these sequence variations, however, has not been feasible with conventional database search strategies, which require exact matches between observed and expected sequences. Searching for mutations as mass shifts on specified residues through database search can incur significant performance penalties and generate substantial false positive rates. Here we describe TagRecon, an algorithm that leverages inferred sequence tags to identify unanticipated mutations in clinical proteomic data sets. TagRecon identifies unmodified peptides as sensitively as the related MyriMatch database search engine. In both LTQ and Orbitrap data sets, TagRecon outperformed state of the art software in recognizing sequence mismatches from data sets with known variants. We developed guidelines for filtering putative mutations from clinical samples, and we applied them in an analysis of cancer cell lines and an examination of colon tissue. Mutations were found in up to 6% of identified peptides, and only a small fraction corresponded to dbSNP entries. The RKO cell line, which is DNA mismatch repair deficient, yielded more mutant peptides than the mismatch repair proficient SW480 line. Analysis of colon cancer tumor and adjacent tissue revealed hydroxyproline modifications associated with extracellular matrix degradation. These results demonstrate the value of using sequence tagging algorithms to fully interrogate clinical proteomic data sets.
引用
收藏
页码:1716 / 1726
页数:11
相关论文
共 41 条
[31]   MyriMatch: Highly accurate tandem mass spectral peptide identification by multivariate hypergeometric analysis [J].
Tabb, David L. ;
Fernando, Christopher G. ;
Chambers, Matthew C. .
JOURNAL OF PROTEOME RESEARCH, 2007, 6 (02) :654-661
[32]   GutenTag: High-throughput sequence tagging via an empirically derived fragmentation model [J].
Tabb, DL ;
Saraf, A ;
Yates, JR .
ANALYTICAL CHEMISTRY, 2003, 75 (23) :6415-6421
[33]   InsPecT: Identification of posttransiationally modified peptides from tandem mass spectra [J].
Tanner, S ;
Shu, HJ ;
Frank, A ;
Wang, LC ;
Zandi, E ;
Mumby, M ;
Pevzner, PA ;
Bafna, V .
ANALYTICAL CHEMISTRY, 2005, 77 (14) :4626-4639
[34]   Implementation and uses of automated de novo peptide sequencing by tandem mass spectrometry [J].
Taylor, JA ;
Johnson, RS .
ANALYTICAL CHEMISTRY, 2001, 73 (11) :2594-2604
[36]   Age-related changes in human crystallins determined from comparative analysis of post-translational modifications in young and aged lens: Does deamidation contribute to crystallin insolubility? [J].
Wilmarth, P. A. ;
Tanner, S. ;
Dasari, S. ;
Nagalla, S. R. ;
Riviere, M. A. ;
Bafna, V. ;
Pevzner, P. A. ;
David, L. L. .
JOURNAL OF PROTEOME RESEARCH, 2006, 5 (10) :2554-2566
[37]   COLLAGEN IN COLORECTAL-CANCER IN RELATION TO CLINICOPATHOLOGIC STAGE AND HISTOLOGIC GRADE [J].
WOBBES, T ;
HENDRIKS, T ;
DEBOER, HHM .
DISEASES OF THE COLON & RECTUM, 1988, 31 (10) :778-780
[38]   Mbd4 inactivation increases C→T transition mutations and promotes gastrointestinal tumor formation [J].
Wong, E ;
Yang, K ;
Kuraguchi, M ;
Werling, U ;
Avdievich, E ;
Fan, KH ;
Fazzari, M ;
Jin, B ;
Brown, AMC ;
Lipkin, M ;
Edelmann, W .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (23) :14937-14942
[39]  
ZEQIANG M, 2009, J PROTEOME RES, V8, P3872
[40]   Proteomic parsimony through bipartite graph analysis improves accuracy and transparency [J].
Zhang, Bing ;
Chambers, Matthew C. ;
Tabb, David L. .
JOURNAL OF PROTEOME RESEARCH, 2007, 6 (09) :3549-3557