The Sequence Analysis and Management System - SAMS-2.0: Data management and sequence analysis adapted to changing requirements from traditional sanger sequencing to ultrafast sequencing technologies

被引:23
作者
Bekel, Thomas [1 ]
Henckel, Kolja [1 ]
Kuester, Helge [2 ]
Meyer, Folker [3 ]
Runte, Virginie Mittard
Neuweger, Heiko [1 ]
Paarmann, Daniel [3 ]
Rupp, Oliver
Zakrzewski, Martha
Puehler, Alfred [4 ]
Stoye, Jens [5 ]
Goesmann, Alexander
机构
[1] Univ Bielefeld, Ctr Biotechnol CeBiTec, Int NRW Grad Sch Bioinformat & Genome Res, D-33594 Bielefeld, Germany
[2] Leibniz Univ Hannover, Inst Plant Genet, D-30419 Hannover, Germany
[3] Argonne Natl Lab, Argonne, IL 60439 USA
[4] Univ Bielefeld, Lehrstuhl Genet, D-33594 Bielefeld, Germany
[5] Univ Bielefeld, Tech Fak, AG Genominformat, D-33594 Bielefeld, Germany
关键词
Whole genome shotgun sequencing; DNA sequence quality control; cDNA sequencing; EST clustering; Ultrafast sequencing; COMPLETE GENOME SEQUENCE; TIGR GENE INDEXES; ARBUSCULAR MYCORRHIZA; EST; BACTERIUM; REVEALS; TOOL; RECONSTRUCTION; METAGENOME; INSIGHTS;
D O I
10.1016/j.jbiotec.2009.01.006
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
DNA sequencing plays a more and more important role in various fields of genetics. This includes sequencing of whole genomes, libraries of cDNA clones and probes of metagenome communities. The applied sequencing technologies evolve permanently. Willi the emergence of ultrafast sequencing technologies, a new era of DNA sequencing has recently started. Concurrently, the needs for adapted bioinformatics tools arise. Since the ability to process current datasets efficiently is essential for modern genetics, a modular bioinformatics platform providing extensive sequence analysis methods, is designated to achieve well the constantly growing requirements. The Sequence Analysis and Management System (SAMS) is a bioinformatics software platform with a database backend designed to Support the computational analysis of (1) whole genome shotgun (WGS) bacterial genome sequencing, (2) cDNA sequencing by reading expressed sequence tags (ESTs) as well as (3) sequence data obtained by Ultrafast sequencing. It provides extensive bioinformatics analysis of sequenced single reads. sequencing libraries and fragments of arbitrary DNA sequences such as assembled contigs of metagenome reads for instance. The system has been implemented to cope with several thousands of sequences, efficiently processing them and storing the results for further analysis. With the project set up, SAMS automatically recognizes the data type. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:3 / 12
页数:10
相关论文
共 47 条
[21]   CAP3: A DNA sequence assembly program [J].
Huang, XQ ;
Madan, A .
GENOME RESEARCH, 1999, 9 (09) :868-877
[22]   Exploring root symbiotic programs in the model legume Medicago truncatula using EST analysis [J].
Journet, EP ;
van Tuinen, D ;
Gouzy, J ;
Crespeau, H ;
Carreau, V ;
Farmer, MJ ;
Niebel, A ;
Schiex, T ;
Jaillon, O ;
Chatagnier, O ;
Godiard, L ;
Micheli, F ;
Kahn, D ;
Gianinazzi-Pearson, V ;
Gamas, P .
NUCLEIC ACIDS RESEARCH, 2002, 30 (24) :5579-5592
[23]   Whole genome shotgun sequencing guided-by bioinformatics pipelines -: An optimized approach for an established technique [J].
Kaiser, O ;
Bartels, D ;
Bekel, T ;
Goesmann, A ;
Kespohl, S ;
Pühler, A ;
Meyer, F .
JOURNAL OF BIOTECHNOLOGY, 2003, 106 (2-3) :121-133
[24]   From genomics to chemical genomics: new developments in KEGG [J].
Kanehisa, Minoru ;
Goto, Susumu ;
Hattori, Masahiro ;
Aoki-Kinoshita, Kiyoko F. ;
Itoh, Masumi ;
Kawashima, Shuichi ;
Katayama, Toshiaki ;
Araki, Michihiro ;
Hirakawa, Mika .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D354-D357
[25]   Complete genome of the mutualistic, N2-fixing grass endophyte Azoarcus sp strain BH72 [J].
Krause, Andrea ;
Ramakumar, Adarsh ;
Bartels, Daniela ;
Battistoni, Federico ;
Bekel, Thomas ;
Boch, Jens ;
Boehm, Melanie ;
Friedrich, Frauke ;
Hurek, Thomas ;
Krause, Lutz ;
Linke, Burkhard ;
McHardy, Alice C. ;
Sarkar, Abhijit ;
Schneiker, Susanne ;
Syed, Arshad Ali ;
Thauer, Rudolf ;
Vorhoelter, Frank-Joerg ;
Weidner, Stefan ;
Puehler, Alfred ;
Reinhold-Hurek, Barbara ;
Kaiser, Olaf ;
Goesmann, Alexander .
NATURE BIOTECHNOLOGY, 2006, 24 (11) :1385-1391
[26]   Development of bioinformatic tools to support EST-sequencing, in silico- and microarray-based transcriptome profiling in mycorrhizal symbioses [J].
Kuester, Helge ;
Becker, Anke ;
Firnhaber, Christian ;
Hohnjec, Natalija ;
Manthey, Katja ;
Perlick, Andreas M. ;
Bekel, Thomas ;
Dondrup, Michael ;
Henckel, Koja ;
Goesmann, Alexander ;
Meyer, Folker ;
Wipf, Daniel ;
Requena, Natalia ;
Hildebrandt, Ulrich ;
Hampp, Ruediger ;
Nehls, Uwe. ;
Krajinski, Franziska ;
Franken, Philipp ;
Puehler, Alfred .
PHYTOCHEMISTRY, 2007, 68 (01) :19-32
[27]  
LANDER E S, 1988, Genomics, V2, P231
[28]   ESTpass: a web-based server for processing and annotating expressed sequence tag (EST) sequences [J].
Lee, Byungwook ;
Hong, Taehui ;
Byun, Sang Jin ;
Woo, Taeha ;
Choi, Yoon Jeong .
NUCLEIC ACIDS RESEARCH, 2007, 35 :W159-W162
[29]   An optimized protocol for analysis of EST sequences [J].
Liang, F ;
Holt, I ;
Pertea, G ;
Karamycheva, S ;
Salzberg, SL ;
Quackenbush, J .
NUCLEIC ACIDS RESEARCH, 2000, 28 (18) :3657-3665
[30]   Host genes involved in nodulation preference in common bean (Phaseolus vulgaris)-Rhizobium etli symbiosis revealed -: by suppressive subtractive hybridization [J].
Meschini, Eitel Peltzer ;
Blanco, Flavio Antonio ;
Zanetti, Maria Eugenia ;
Beker, Maria Pia ;
Kuester, Helge ;
Pueher, Alfred ;
Aguilar, O. Mario .
MOLECULAR PLANT-MICROBE INTERACTIONS, 2008, 21 (04) :459-468