The pipeline system for Octave and Mat lab (PSOM): a lightweight scripting framework and execution engine for scientific workflows

被引:86
作者
Bellec, Pierre [1 ,2 ]
Lavoie-Courchesne, Sebastien [1 ,2 ,3 ]
Dickinson, Phil [1 ,3 ]
Lerch, Jason P. [4 ,5 ]
Zijdenbos, Alex P. [6 ]
Evans, Alan C. [3 ]
机构
[1] Inst Univ Geriatrie Montreal, Ctr Rech, Montreal, PQ H3W 1W5, Canada
[2] Univ Montreal, Dept Informat & Rech Operat, Montreal, PQ H3C 3J7, Canada
[3] McGill Univ, Montreal Neurol Inst, McConnell Brain Imaging Ctr, Montreal, PQ, Canada
[4] Hosp Sick Children, Mouse Imaging Ctr, Toronto, ON M5G 1X8, Canada
[5] Univ Toronto, Dept Med Biophys, Toronto, ON, Canada
[6] Biospective Inc, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
pipeline; workflow; Octave; Matlab; open-source; parallel computing; high-performance computing; neuroimaging; HUMAN BRAIN; MANAGEMENT; DISCOVERY; SCIENCE; SWIFT; FMRI;
D O I
10.3389/fninf.2012.00007
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The analysis of neuroimaging databases typically Involves a large number of inter-connected steps called a pipeline. The pipeline system for Octave and Matlab (PSOM) is a flexible framework for the implementation of pipelines in the form of Octave or Matlab scripts. PSOM does not introduce new language constructs to specify the steps and structure of the workflow. All steps of analysis are instead described by a regular Matlab data structure, documenting their associated command and options, as well as their input, output, and cleaned-up files. The PSOM execution engine provides a number of automated services: (1) it executes jobs in parallel on a local computing facility as long as the dependencies between jobs allow for it and sufficient resources are available; (2) it generates a comprehensive record of the pipeline stages and the history of execution, which is detailed enough to fully reproduce the analysis; (3) if an analysis is started multiple times, it executes only the parts of the pipeline that need to be reprocessed. PSOM is distributed under an open-source MIT license and can be used without restriction for academic or commercial projects. The package has no external dependencies besides Matlab or Octave, is straightforward to install and supports of variety of operating systems (Linux, Windows, Mac). We ran several benchmark experiments on a public database including 200 subjects, using a pipeline for the preprocessing of functional magnetic resonance images (fMRI). The benchmark results showed that PSOM is a powerful solution for the analysis of large databases using local or distributed computing resources.
引用
收藏
页数:18
相关论文
共 47 条
[1]  
[Anonymous], 2006, Proc. Special Interest Group on Management of Data Conf. (SIGMOD '06), DOI [10.1145/1142473.1142574, DOI 10.1145/1142473.1142574]
[2]  
[Anonymous], 2006, 12 ANN M ORG HUM BRA
[3]  
Armstrong T. G., 2011, THESIS U CHICAGO
[4]   SPM: A history [J].
Ashburner, John .
NEUROIMAGE, 2012, 62 (02) :791-800
[5]  
Baker H. G., 1977, P 1977 S ART INT PRO
[6]  
Bellec P., 2010, P 16 INT C FUNCT MAP
[7]  
Bellec P., 2011, P 17 INT C FUNCT MAP, P2735
[8]   Multi-level bootstrap analysis of stable clusters in resting-state fMRI [J].
Bellec, Pierre ;
Rosa-Neto, Pedro ;
Lyttelton, Oliver C. ;
Benali, Habib ;
Evans, Alan C. .
NEUROIMAGE, 2010, 51 (03) :1126-1139
[9]   Bootstrap generation and evaluation of an fMRI simulation database [J].
Bellec, Pierre ;
Perlbarg, Vincent ;
Evans, Alan C. .
MAGNETIC RESONANCE IMAGING, 2009, 27 (10) :1382-1396
[10]   Toward discovery science of human brain function [J].
Biswal, Bharat B. ;
Mennes, Maarten ;
Zuo, Xi-Nian ;
Gohel, Suril ;
Kelly, Clare ;
Smith, Steve M. ;
Beckmann, Christian F. ;
Adelstein, Jonathan S. ;
Buckner, Randy L. ;
Colcombe, Stan ;
Dogonowski, Anne-Marie ;
Ernst, Monique ;
Fair, Damien ;
Hampson, Michelle ;
Hoptman, Matthew J. ;
Hyde, James S. ;
Kiviniemi, Vesa J. ;
Kotter, Rolf ;
Li, Shi-Jiang ;
Lin, Ching-Po ;
Lowe, Mark J. ;
Mackay, Clare ;
Madden, David J. ;
Madsen, Kristoffer H. ;
Margulies, Daniel S. ;
Mayberg, Helen S. ;
McMahon, Katie ;
Monk, Christopher S. ;
Mostofsky, Stewart H. ;
Nagel, Bonnie J. ;
Pekar, James J. ;
Peltier, Scott J. ;
Petersen, Steven E. ;
Riedl, Valentin ;
Rombouts, Serge A. R. B. ;
Rypma, Bart ;
Schlaggar, Bradley L. ;
Schmidt, Sein ;
Seidler, Rachael D. ;
Siegle, Greg J. ;
Sorg, Christian ;
Teng, Gao-Jun ;
Veijola, Juha ;
Villringer, Arno ;
Walter, Martin ;
Wang, Lihong ;
Weng, Xu-Chu ;
Whitfield-Gabrieli, Susan ;
Williamson, Peter ;
Windischberger, Christian .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (10) :4734-4739