Integrating and interrogating diverse Brassica data within an EnsEMBL structured database

被引:4
作者
Love, Christopher [1 ,2 ]
Logan, Erica [1 ,2 ]
Erwin, Tim [1 ,2 ]
Kaur, Jatinder [1 ]
Lim, Geraldine A. C. [1 ]
Hopkins, Clare [1 ]
Batley, Jacqueline [1 ]
James, Nick [3 ]
May, Sean [3 ]
Spangenberg, German [1 ,2 ]
Edwards, David [1 ,2 ]
机构
[1] La Trobe Univ, Dept Primary Ind, Plant Biotechnol Ctr, Bundoora, Vic 3086, Australia
[2] La Trobe Univ, Victorian Bioinformat Consortium, Plant Biotechnol Ctr, Bundoora, Vic 3086, Australia
[3] Univ Nottingham, Nottingham Arabidopsis Stock Ctr, Plant Sci Div, Sch Biosci, Loughborough LE12 5RD, Leics, England
来源
PROCEEDINGS OF THE JOINT MEETING OF THE FOURTEENTH CRUCIFER GENETICS WORKSHOP AND FOURTH ISHS SYMPOSIUM ON BRASSICAS | 2006年 / 706期
关键词
bioinformatics; SSR; SNP; EnsEMBL; microarray; database;
D O I
10.17660/ActaHortic.2006.706.7
中图分类号
S6 [园艺];
学科分类号
0902 ;
摘要
There is a vast quantity of data being produced by Brassica researchers throughout the world. Genomic data include gene or Expressed Sequence Tag (EST) sequences and genomic sequences from bacterial artificial chromosomes and whole genome shotgun approaches. Associated gene expression or transcriptome data are being produced using various formats of microarray, Serial Analysis of Gene Expression (SAGE) and Massively Parallel Signal Sequencing (MPSS). Molecular marker data such as Simple Sequence Repeats (SSRs) and Single Nucleotide Polymorphisms (SNPs) are providing insights into genetic structure and genetic diversity as well as inherited traits within Brassica species. Phenotypic data are also increasing in complexity through the characterisation of broad diverse germplasm collections and the development of advanced techniques in proteomics and metabolomics. There is a significant challenge in bringing this diverse set of data together in an integrated bioinformatics platform to permit interrogation across these broad fields. The most advanced genome database structure currently available uses the EnsEMBL format. EnsEMBL permits both broad data integration, comparative analysis between related organisms and efficient data interrogation. We have established a Brassica-centric EnsEMBL database founded on the current Arabidopsis thaliana EnsEMBL database, incorporating tracks for Brassica genes, genomic sequences and molecular markers. This database is publicly available to the Brassica research community and can be used as the foundation of a Brassica based EnsEMBL database on completion of the B. rapa genome sequencing under the Multinational Brassica Genome Project.
引用
收藏
页码:77 / +
页数:4
相关论文
共 16 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]   Redundancy based detection of sequence polymorphisms in expressed sequence tag data using autoSNP [J].
Barker, G ;
Batley, J ;
O'Sullivan, H ;
Edwards, KJ ;
Edwards, D .
BIOINFORMATICS, 2003, 19 (03) :421-422
[3]   Mining for single nucleotide polymorphisms and insertions/deletions in maize expressed sequence tag data [J].
Batley, J ;
Barker, G ;
O'Sullivan, H ;
Edwards, KJ ;
Edwards, D .
PLANT PHYSIOLOGY, 2003, 132 (01) :84-91
[4]   In vitro cloning of complex mixtures of DNA on microbeads:: Physical separation of differentially expressed cDNAs [J].
Brenner, S ;
Williams, SR ;
Vermaas, EH ;
Storck, T ;
Moon, K ;
McCollum, C ;
Mao, JI ;
Luo, SJ ;
Kirchner, JJ ;
Eletr, S ;
DuBridge, RB ;
Burcham, T ;
Albrecht, G .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (04) :1665-1670
[5]   Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays [J].
Brenner, S ;
Johnson, M ;
Bridgham, J ;
Golda, G ;
Lloyd, DH ;
Johnson, D ;
Luo, SJ ;
McCurdy, S ;
Foy, M ;
Ewan, M ;
Roth, R ;
George, D ;
Eletr, S ;
Albrecht, G ;
Vermaas, E ;
Williams, SR ;
Moon, K ;
Burcham, T ;
Pallas, M ;
DuBridge, RB ;
Kirchner, J ;
Fearon, K ;
Mao, J ;
Corcoran, K .
NATURE BIOTECHNOLOGY, 2000, 18 (06) :630-634
[6]   SAGE analysis of transcriptome responses in Arabidopsis roots exposed to 2,4,6-trinitrotoluene [J].
Ekman, DR ;
Lorenz, WW ;
Przybyla, AE ;
Wolfe, NL ;
Dean, JFD .
PLANT PHYSIOLOGY, 2003, 133 (03) :1397-1406
[7]   The Arabidopsis root transcriptome by serial analysis of gene expression.: Gene identification using the genome sequence [J].
Fizames, C ;
Muños, S ;
Cazettes, C ;
Nacry, P ;
Boucherez, J ;
Gaymard, F ;
Piquemal, D ;
Delorme, V ;
Commes, TS ;
Doumas, P ;
Cooke, R ;
Marti, J ;
Sentenac, H ;
Gojon, A .
PLANT PHYSIOLOGY, 2004, 134 (01) :67-80
[8]  
JAMES N, 2004, PLANT AN GEN 12 C SA
[9]   Use of SAGE technology to reveal changes in gene expression in Arabidopsis leaves undergoing cold stress [J].
Jung, SH ;
Lee, JY ;
Lee, DH .
PLANT MOLECULAR BIOLOGY, 2003, 52 (03) :553-567
[10]  
KAUR J, 2005, PLANT AN GEN 13 C SA