Ergatis: a web interface and scalable software system for bioinformatics workflows

被引:74
作者
Orvis, Joshua [1 ]
Crabtree, Jonathan [1 ]
Galens, Kevin [1 ]
Gussman, Aaron [1 ]
Inman, Jason M. [2 ]
Lee, Eduardo [3 ]
Nampally, Sreenath [2 ]
Riley, David [1 ]
Sundaram, Jaideep P. [2 ,4 ]
Felix, Victor [1 ]
Whitty, Brett [5 ]
Mahurkar, Anup [1 ]
Wortman, Jennifer [1 ]
White, Owen [1 ]
Angiuoli, Samuel V. [1 ,6 ]
机构
[1] Univ Maryland, Sch Med, Inst Genome Sci, Baltimore, MD 21201 USA
[2] J Craig Venter Inst, Rockville, MD USA
[3] Univ Calif Berkeley, Lawrence Berkeley Lab, Berkeley, CA 94720 USA
[4] Georgetown Univ, Dept Biol, Computat Genom Lab, Washington, DC 20057 USA
[5] Michigan State Univ, Dept Plant Biol, E Lansing, MI 48824 USA
[6] Univ Maryland, Ctr Bioinformat & Computat Biol, College Pk, MD 20742 USA
基金
美国国家卫生研究院;
关键词
COMPARATIVE GENOMICS; SEQUENCE; TOOL; UNIFICATION; FRAMEWORK; RESOURCE; ONTOLOGY;
D O I
10.1093/bioinformatics/btq167
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The growth of sequence data has been accompanied by an increasing need to analyze data on distributed computer clusters. The use of these systems for routine analysis requires scalable and robust software for data management of large datasets. Software is also needed to simplify data management and make large-scale bioinformatics analysis accessible and reproducible to a wide class of target users. Results: We have developed a workflow management system named Ergatis that enables users to build, execute and monitor pipelines for computational analysis of genomics data. Ergatis contains preconfigured components and template pipelines for a number of common bioinformatics tasks such as prokaryotic genome annotation and genome comparisons. Outputs from many of these components can be loaded into a Chado relational database. Ergatis was designed to be accessible to a broad class of users and provides a user friendly, web-based interface. Ergatis supports high-throughput batch processing on distributed compute clusters and has been used for data management in a number of genome annotation and comparative genomics projects.
引用
收藏
页码:1488 / 1492
页数:5
相关论文
共 31 条
[1]  
[Anonymous], OBO FLAT FILE FORMAT
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]   GeneMark: web software for gene finding in prokaryotes, eukaryotes and viruses [J].
Besemer, J ;
Borodovsky, M .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W451-W454
[4]   GeneWise and genomewise [J].
Birney, E ;
Clamp, M ;
Durbin, R .
GENOME RESEARCH, 2004, 14 (05) :988-995
[5]   Pathema: a clade-specific bioinformatics resource center for pathogen research [J].
Brinkac, Lauren M. ;
Davidsen, Tanja ;
Beck, Erin ;
Ganapathy, Anuradha ;
Caler, Elisabet ;
Dodson, Robert J. ;
Durkin, A. Scott ;
Harkins, Derek M. ;
Lorenzi, Hernan ;
Madupu, Ramana ;
Sebastian, Yinong ;
Shrivastava, Susmita ;
Thiagarajan, Mathangi ;
Orvis, Joshua ;
Sundaram, Jaideep P. ;
Crabtree, Jonathon ;
Galens, Kevin ;
Zhao, Yongmei ;
Inman, Jason M. ;
Montgomery, Robert ;
Schobel, Seth ;
Galinsky, Kevin ;
Tanenbaum, David M. ;
Resnick, Adam ;
Zafar, Nikhat ;
White, Owen ;
Sutton, Granger .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D408-D414
[6]   Draft genome sequence of the sexually transmitted pathogen Trichomonas vaginalis [J].
Carlton, Jane M. ;
Hirt, Robert P. ;
Silva, Joana C. ;
Delcher, Arthur L. ;
Schatz, Michael ;
Zhao, Qi ;
Wortman, Jennifer R. ;
Bidwell, Shelby L. ;
Alsmark, U. Cecilia M. ;
Besteiro, Sebastien ;
Sicheritz-Ponten, Thomas ;
Noel, Christophe J. ;
Dacks, Joel B. ;
Foster, Peter G. ;
Simillion, Cedric ;
Van de Peer, Yves ;
Miranda-Saavedra, Diego ;
Barton, Geoffrey J. ;
Westrop, Gareth D. ;
Mueller, Sylke ;
Dessi, Daniele ;
Fiori, Pier Luigi ;
Ren, Qinghu ;
Paulsen, Ian ;
Zhang, Hanbang ;
Bastida-Corcuera, Felix D. ;
Simoes-Barbosa, Augusto ;
Brown, Mark T. ;
Hayes, Richard D. ;
Mukherjee, Mandira ;
Okumura, Cheryl Y. ;
Schneider, Rachel ;
Smith, Alias J. ;
Vanacova, Stepanka ;
Villalvazo, Maria ;
Haas, Brian J. ;
Pertea, Mihaela ;
Feldblyum, Tamara V. ;
Utterback, Terry R. ;
Shu, Chung-Li ;
Osoegawa, Kazutoyo ;
de Jong, Pieter J. ;
Hrdy, Ivan ;
Horvathova, Lenka ;
Zubacova, Zuzana ;
Dolezal, Pavel ;
Malik, Shehre-Banoo ;
Logsdon, John M., Jr. ;
Henze, Katrin ;
Gupta, Arti .
SCIENCE, 2007, 315 (5809) :207-212
[7]  
Crabtree Jonathan, 2007, V408, P93, DOI 10.1007/978-1-59745-547-3_6
[8]   Semi-automatic web service composition for the life sciences using the BioMoby semantic web framework [J].
DiBernardo, Michael ;
Pottinger, Rachel ;
Wilkinson, Mark .
JOURNAL OF BIOMEDICAL INFORMATICS, 2008, 41 (05) :837-847
[9]   Comparative Genomics of emerging human ehrlichiosis agents [J].
Dunning Hotopp, Julie C. ;
Lin, Mingqun ;
Madupu, Ramana ;
Crabtree, Jonathan ;
Angiuoli, Samuel V. ;
Eisen, Jonathan ;
Seshadri, Rekha ;
Ren, Qinghu ;
Wu, Martin ;
Utterback, Teresa R. ;
Smith, Shannon ;
Lewis, Matthew ;
Khouri, Hoda ;
Zhang, Chunbin ;
Niu, Hua ;
Lin, Quan ;
Ohashi, Norio ;
Zhi, Ning ;
Nelson, William ;
Brinkac, Lauren M. ;
Dodson, Robert J. ;
Rosovitz, M. J. ;
Sundaram, Jaideep ;
Daugherty, Sean C. ;
Davidsen, Tanja ;
Durkin, Anthony S. ;
Gwinn, Michelle ;
Haft, Daniel H. ;
Selengut, Jeremy D. ;
Sullivan, Steven A. ;
Zafar, Nikhat ;
Zhou, Liwei ;
Benahmed, Faiza ;
Forberger, Heather ;
Halpin, Rebecca ;
Mulligan, Stephanie ;
Robinson, Jeffrey ;
White, Owen ;
Rikihisa, Yasuko ;
Tettelin, Herve .
PLOS GENETICS, 2006, 2 (02) :208-223
[10]   The Sequence Ontology: a tool for the unification of genome annotations [J].
Eilbeck, K ;
Lewis, SE ;
Mungall, CJ ;
Yandell, M ;
Stein, L ;
Durbin, R ;
Ashburner, M .
GENOME BIOLOGY, 2005, 6 (05)