jmzReader: A Java']Java parser library to process and visualize multiple text and XML-based mass spectrometry data formats

被引:28
作者
Griss, Johannes [1 ,2 ]
Reisinger, Florian [1 ]
Hermjakob, Henning [1 ]
Vizcaino, Juan Antonio [1 ]
机构
[1] EMBL European Bioinformat Inst, Cambridge CB10 1SD, England
[2] Med Univ Vienna, Dept Med 1, Vienna, Austria
基金
英国惠康基金;
关键词
Bioinformatics; Data standard; !text type='Java']Java[!/text; MS data processing; Proteomics standards initiative; PROTEIN IDENTIFICATION; STANDARD;
D O I
10.1002/pmic.201100578
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We here present the jmzReader library: a collection of Java application programming interfaces (APIs) to parse the most commonly used peak list and XML-based mass spectrometry (MS) data formats: DTA, MS2, MGF, PKL, mzXML, mzData, and mzML (based on the already existing API jmzML). The library is optimized to be used in conjunction with mzIdentML, the recently released standard data format for reporting protein and peptide identifications, developed by the HUPO proteomics standards initiative (PSI). mzIdentML files do not contain spectra data but contain references to different kinds of external MS data files. As a key functionality, all parsers implement a common interface that supports the various methods used by mzIdentML to reference external spectra. Thus, when developing software for mzIdentML, programmers no longer have to support multiple MS data file formats but only this one interface. The library (which includes a viewer) is open source and, together with detailed documentation, can be downloaded from .
引用
收藏
页码:795 / 798
页数:4
相关论文
共 17 条
[1]   OMSSA Parser: An open-source library to parse and extract data from OMSSA MS/MS search results [J].
Barsnes, Harald ;
Huber, Steffen ;
Sickmann, Albert ;
Eidhammer, Ingvar ;
Martens, Lennart .
PROTEOMICS, 2009, 9 (14) :3772-3774
[2]   The need for guidelines in publication of peptide and protein identification data - Working group on publication guidelines for peptide and protein identification data [J].
Carr, S ;
Aebersold, R ;
Baldwin, M ;
Burlingame, A ;
Clauser, K ;
Nesvizhskii, A .
MOLECULAR & CELLULAR PROTEOMICS, 2004, 3 (06) :531-533
[3]   thermo-msf-parser: An Open Source Java']Java Library to Parse and Visualize Thermo Proteome Discoverer msf Files [J].
Colaert, Niklaas ;
Barsnes, Harald ;
Vaudel, Marc ;
Helsens, Kenny ;
Timmerman, Evy ;
Sickmann, Albert ;
Gevaert, Kris ;
Martens, Lennart .
JOURNAL OF PROTEOME RESEARCH, 2011, 10 (08) :3840-3843
[4]   jmzML, an open-source Java']Java API for mzML, the PSI standard for MS data [J].
Cote, Richard G. ;
Reisinger, Florian ;
Martens, Lennart .
PROTEOMICS, 2010, 10 (07) :1332-1335
[5]  
Eisenacher M, 2011, METHODS MOL BIOL, V696, P161, DOI 10.1007/978-1-60761-987-1_10
[6]   A method for assessing the statistical significance of mass spectrometry-based protein identifications using general scoring schemes [J].
Fenyö, D ;
Beavis, RC .
ANALYTICAL CHEMISTRY, 2003, 75 (04) :768-774
[7]   Open mass spectrometry search algorithm [J].
Geer, LY ;
Markey, SP ;
Kowalak, JA ;
Wagner, L ;
Xu, M ;
Maynard, DM ;
Yang, XY ;
Shi, WY ;
Bryant, SH .
JOURNAL OF PROTEOME RESEARCH, 2004, 3 (05) :958-964
[8]   ProteoWizard: open source software for rapid proteomics tools development [J].
Kessner, Darren ;
Chambers, Matt ;
Burke, Robert ;
Agusand, David ;
Mallick, Parag .
BIOINFORMATICS, 2008, 24 (21) :2534-2536
[9]   MGFp: An Open Mascot Generic Format Parser Library Implementation [J].
Kirchner, Marc ;
Steen, Judith A. J. ;
Hamprecht, Fred A. ;
Steen, Hanno .
JOURNAL OF PROTEOME RESEARCH, 2010, 9 (05) :2762-2763
[10]   Probability-based validation of protein identifications using a modified SEQUEST algorithm [J].
MacCoss, MJ ;
Wu, CC ;
Yates, JR .
ANALYTICAL CHEMISTRY, 2002, 74 (21) :5593-5599