New computational tools for Brassica genome research

被引:14
作者
Love, CG
Batley, J
Lim, G
Robinson, AJR
Savage, D
Singh, D
Spangenberg, GC
Edwards, D [1 ]
机构
[1] La Trobe Univ, Ctr Plant Biotechnol, Primary Ind Res Victoria, Dept Primary Ind, Bundoora, Vic 3086, Australia
[2] La Trobe Univ, Ctr Plant Biotechnol, Victorian Bioinformat Consortium, Bundoora, Vic 3086, Australia
来源
COMPARATIVE AND FUNCTIONAL GENOMICS | 2004年 / 5卷 / 03期
关键词
ASTRA; EnsEMBL; molecular marker; gene ontology (GO); bacterial artificial chromosome (BAC); genome sequencing;
D O I
10.1002/cfg.394
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
With the increasing quantities of Brassica genomic data being entered into the public domain and in preparation for the complete Brassica genome sequencing effort, there is a growing requirement for the structuring and detailed bioinformatic analysis of Brassica genomic information within a user-friendly database. At the Plant Biotechnology Centre, Melbourne, Australia, we have developed a series of tools and computational pipelines to assist in the processing and structuring of genomic data, to aid its application to agricultural biotechnology research. These tools include a sequence database, ASTRA, a sequence processing pipeline incorporating annotation against GenBank, SwissProt and Arabidopsis Gene Ontology (GO) data and tools for molecular marker discovery and comparative genome analysis. All sequences are mined for simple sequence repeat (SSR) molecular markers using 'SSR primer' and mapped onto the complete Arabidopsis thaliana genome by sequence comparison. The database may be queried using a text-based search of sequence annotation or GO terms, BLAST comparison against resident sequences, or by the position of candidate orthologues within the Arabidopsis genome. Tools have also been developed and applied to the discovery of single nucleotide polymorphism (SNP) molecular markers and the in silico mapping of Brassica BAC end sequences onto the Arabidopsis genome. Planned extensions to this resource include the integration of gene expression data and the development of an EnsEMBL-based genome viewer. Copyright (C) 2004 John Wiley Sons, Ltd.
引用
收藏
页码:276 / 280
页数:5
相关论文
共 13 条
[1]  
Abajian C., 1994, Sputnik
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]   Redundancy based detection of sequence polymorphisms in expressed sequence tag data using autoSNP [J].
Barker, G ;
Batley, J ;
O'Sullivan, H ;
Edwards, KJ ;
Edwards, D .
BIOINFORMATICS, 2003, 19 (03) :421-422
[5]   Mining for single nucleotide polymorphisms and insertions/deletions in maize expressed sequence tag data [J].
Batley, J ;
Barker, G ;
O'Sullivan, H ;
Edwards, KJ ;
Edwards, D .
PLANT PHYSIOLOGY, 2003, 132 (01) :84-91
[6]   Base-calling of automated sequencer traces using phred.: I.: Accuracy assessment [J].
Ewing, B ;
Hillier, L ;
Wendl, MC ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :175-185
[7]   The Arabidopsis Information Resource (TAIR): a comprehensive database and web-based information retrieval, analysis, and visualization system for a model plant [J].
Huala, E ;
Dickerman, AW ;
Garcia-Hernandez, M ;
Weems, D ;
Reiser, L ;
LaFond, F ;
Hanley, D ;
Kiphart, D ;
Zhuang, MZ ;
Huang, W ;
Mueller, LA ;
Bhattacharyya, D ;
Bhaya, D ;
Sobral, BW ;
Beavis, W ;
Meinke, DW ;
Town, CD ;
Somerville, C ;
Rhee, SY .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :102-105
[8]  
JAMES N, 2004, PLANT AN GEN 12 C, P999
[9]   Analysis of the genome sequence of the flowering plant Arabidopsis thaliana [J].
Kaul, S ;
Koo, HL ;
Jenkins, J ;
Rizzo, M ;
Rooney, T ;
Tallon, LJ ;
Feldblyum, T ;
Nierman, W ;
Benito, MI ;
Lin, XY ;
Town, CD ;
Venter, JC ;
Fraser, CM ;
Tabata, S ;
Nakamura, Y ;
Kaneko, T ;
Sato, S ;
Asamizu, E ;
Kato, T ;
Kotani, H ;
Sasamoto, S ;
Ecker, JR ;
Theologis, A ;
Federspiel, NA ;
Palm, CJ ;
Osborne, BI ;
Shinn, P ;
Conway, AB ;
Vysotskaia, VS ;
Dewar, K ;
Conn, L ;
Lenz, CA ;
Kim, CJ ;
Hansen, NF ;
Liu, SX ;
Buehler, E ;
Altafi, H ;
Sakano, H ;
Dunn, P ;
Lam, B ;
Pham, PK ;
Chao, Q ;
Nguyen, M ;
Yu, GX ;
Chen, HM ;
Southwick, A ;
Lee, JM ;
Miranda, M ;
Toriumi, MJ ;
Davis, RW .
NATURE, 2000, 408 (6814) :796-815
[10]  
Lewis Christopher T., 2003, Journal of Plant Biotechnology, V5, P197