Genome Modeling System: A Knowledge Management Platform for Genomics

被引:59
作者
Griffith, Malachi [1 ,2 ]
Griffith, Obi L. [1 ,3 ]
Smith, Scott M. [1 ]
Ramu, Avinash [1 ]
Callaway, Matthew B. [1 ]
Brummett, Anthony M. [1 ]
Kiwala, Michael J. [1 ]
Coffman, Adam C. [1 ]
Regier, Allison A. [1 ]
Oberkfell, Ben J. [1 ]
Sanderson, Gabriel E. [1 ]
Mooney, Thomas P. [1 ]
Nutter, Nathaniel G. [1 ]
Belter, Edward A. [1 ]
Du, Feiyu [1 ]
Long, Robert L. [1 ]
Abbott, Travis E. [1 ]
Ferguson, Ian T. [1 ]
Morton, David L. [1 ]
Burnett, Mark M. [1 ]
Weible, James V. [1 ]
Peck, Joshua B. [1 ]
Dukes, Adam [1 ]
McMichael, Joshua F. [1 ]
Lolofie, Justin T. [1 ]
Derickson, Brian R. [1 ]
Hundal, Jasreet [1 ]
Skidmore, Zachary L. [1 ]
Ainscough, Benjamin J. [1 ]
Dees, Nathan D. [1 ]
Schierding, William S. [1 ]
Kandoth, Cyriac [1 ]
Kim, Kyung H. [1 ]
Lu, Charles [1 ]
Harris, Christopher C. [1 ]
Maher, Nicole [3 ]
Maher, Christopher A. [1 ,3 ,4 ]
Magrini, Vincent J. [1 ,2 ]
Abbott, Benjamin S. [1 ]
Chen, Ken [1 ]
Clark, Eric [1 ]
Das, Indraniel [1 ]
Fan, Xian [1 ]
Hawkins, Amy E. [1 ]
Hepler, Todd G. [1 ]
Wylie, Todd N. [1 ]
Leonard, Shawn M. [1 ]
Schroeder, William E. [1 ]
Shi, Xiaoqi [1 ]
Carmichael, Lynn K. [1 ]
机构
[1] Washington Univ, Genome Inst, St Louis, MO 63130 USA
[2] Washington Univ, Sch Med, Dept Genet, St Louis, MO 63110 USA
[3] Washington Univ, Sch Med, Dept Med, St Louis, MO 63110 USA
[4] Washington Univ, Sch Med, Siteman Canc Ctr, St Louis, MO USA
[5] Washington Univ, Sch Med, Dept Mol Microbiol, St Louis, MO 63110 USA
关键词
EXPRESSION ANALYSIS; CANCER; MUTATIONS; SEQUENCE; GENE; ALIGNMENT; FORMAT; TOPHAT; TUMOR; TOOL;
D O I
10.1371/journal.pcbi.1004274
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In this work, we present the Genome Modeling System (GMS), an analysis information management system capable of executing automated genome analysis pipelines at a massive scale. The GMS framework provides detailed tracking of samples and data coupled with reliable and repeatable analysis pipelines. The GMS also serves as a platform for bioinformatics development, allowing a large team to collaborate on data analysis, or an individual researcher to leverage the work of others effectively within its data management system. Rather than separating ad-hoc analysis from rigorous, reproducible pipelines, the GMS promotes systematic integration between the two. As a demonstration of the GMS, we performed an integrated analysis of whole genome, exome and transcriptome sequencing data from a breast cancer cell line (HCC1395) and matched lymphoblastoid line (HCC1395BL). These data are available for users to test the software, complete tutorials and develop novel GMS pipeline configurations.
引用
收藏
页数:21
相关论文
共 54 条
[1]   Differential expression analysis for sequence count data [J].
Anders, Simon ;
Huber, Wolfgang .
GENOME BIOLOGY, 2010, 11 (10)
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]   Integrated genomic analyses of ovarian carcinoma [J].
Bell, D. ;
Berchuck, A. ;
Birrer, M. ;
Chien, J. ;
Cramer, D. W. ;
Dao, F. ;
Dhir, R. ;
DiSaia, P. ;
Gabra, H. ;
Glenn, P. ;
Godwin, A. K. ;
Gross, J. ;
Hartmann, L. ;
Huang, M. ;
Huntsman, D. G. ;
Iacocca, M. ;
Imielinski, M. ;
Kalloger, S. ;
Karlan, B. Y. ;
Levine, D. A. ;
Mills, G. B. ;
Morrison, C. ;
Mutch, D. ;
Olvera, N. ;
Orsulic, S. ;
Park, K. ;
Petrelli, N. ;
Rabeno, B. ;
Rader, J. S. ;
Sikic, B. I. ;
Smith-McCune, K. ;
Sood, A. K. ;
Bowtell, D. ;
Penny, R. ;
Testa, J. R. ;
Chang, K. ;
Dinh, H. H. ;
Drummond, J. A. ;
Fowler, G. ;
Gunaratne, P. ;
Hawes, A. C. ;
Kovar, C. L. ;
Lewis, L. R. ;
Morgan, M. B. ;
Newsham, I. F. ;
Santibanez, J. ;
Reid, J. G. ;
Trevino, L. R. ;
Wu, Y. -Q. ;
Wang, M. .
NATURE, 2011, 474 (7353) :609-615
[4]  
Cancer Genome Atlas Network, 2012, Nature, V487, P330, DOI [DOI 10.1038/NATURE11252, 10.1038/nature11252]
[5]  
Chen K, 2009, NAT METHODS, V6, P677, DOI [10.1038/NMETH.1363, 10.1038/nmeth.1363]
[6]   The variant call format and VCFtools [J].
Danecek, Petr ;
Auton, Adam ;
Abecasis, Goncalo ;
Albers, Cornelis A. ;
Banks, Eric ;
DePristo, Mark A. ;
Handsaker, Robert E. ;
Lunter, Gerton ;
Marth, Gabor T. ;
Sherry, Stephen T. ;
McVean, Gilean ;
Durbin, Richard .
BIOINFORMATICS, 2011, 27 (15) :2156-2158
[7]   Application of Next-Generation Sequencing to Identify Genes and Mutations Causing Autosomal Dominant Retinitis Pigmentosa (adRP) [J].
Daiger, Stephen P. ;
Bowne, Sara J. ;
Sullivan, Lori S. ;
Blanton, Susan H. ;
Weinstock, George M. ;
Koboldt, Daniel C. ;
Fulton, Robert S. ;
Larsen, David ;
Humphries, Peter ;
Humphries, Marian M. ;
Pierce, Eric A. ;
Chen, Rui ;
Li, Yumei .
RETINAL DEGENERATIVE DISEASES: MECHANISMS AND EXPERIMENTAL THERAPY, 2014, 801 :123-129
[8]   The Pediatric Cancer Genome Project [J].
Downing, James R. ;
Wilson, Richard K. ;
Zhang, Jinghui ;
Mardis, Elaine R. ;
Pui, Ching-Hon ;
Ding, Li ;
Ley, Timothy J. ;
Evans, William E. .
NATURE GENETICS, 2012, 44 (06) :619-622
[9]   Tools for mapping high-throughput sequencing data [J].
Fonseca, Nuno A. ;
Rung, Johan ;
Brazma, Alvis ;
Marioni, John C. .
BIOINFORMATICS, 2012, 28 (24) :3169-3177
[10]   COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer [J].
Forbes, Simon A. ;
Bindal, Nidhi ;
Bamford, Sally ;
Cole, Charlotte ;
Kok, Chai Yin ;
Beare, David ;
Jia, Mingming ;
Shepherd, Rebecca ;
Leung, Kenric ;
Menzies, Andrew ;
Teague, Jon W. ;
Campbell, Peter J. ;
Stratton, Michael R. ;
Futreal, P. Andrew .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D945-D950