The PEPR GeneChip data warehouse, and implementation of a dynamic time series query tool (SGQT) with graphical interface

被引:33
作者
Chen, J
Zhao, P
Massaro, D
Clerch, LB
Almon, RR
DuBois, DC
Jusko, WJ
Hoffman, EP
机构
[1] Childrens Natl Med Ctr, Ctr Med Genet, Washington, DC 20010 USA
[2] Georgetown Univ, Sch Med, Dept Pediat, Washington, DC 20007 USA
[3] Georgetown Univ, Sch Med, Dept Med, Washington, DC 20007 USA
[4] SUNY Buffalo, Dept Biol Sci, Buffalo, NY 14260 USA
[5] SUNY Buffalo, Dept Pharmaceut Sci, Buffalo, NY 14260 USA
关键词
D O I
10.1093/nar/gkh003
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Publicly accessible DNA databases (genome browsers) are rapidly accelerating post-genomic research (see http://www.genome.ucsc.edu/), with integrated genomic DNA, gene structure, EST/splicing and cross-species ortholog data. DNA databases have relatively low dimensionality; the genome is a linear code that anchors all associated data. In contrast, RNA expression and protein databases need to be able to handle very high dimensional data, with time, tissue, cell type and genes, as interrelated variables. The high dimensionality of microarray expression profile data, and the lack of a standard experimental platform have complicated the development of web-accessible databases and analytical tools. We have designed and implemented a public resource of expression profile data containing 1024 human, mouse and rat Affymetrix GeneChip expression profiles, generated in the same laboratory, and subject to the same quality and procedural controls (Public Expression Profiling Resource; PEPR). Our Oracle-based PEPR data warehouse includes a novel time series query analysis tool (SGOT), enabling dynamic generation of graphs and spreadsheets showing the action of any transcript of interest over time. In this report, we demonstrate the utility of this tool using a 27 time point, in vivo muscle regeneration series. This data warehouse and associated analysis tools provides access to multidimensional microarray data through web-based interfaces, both for download of all types of raw data for independent analysis, and also for straightforward gene-based queries. Planned implementations of PEPR will include web-based remote entry of projects adhering to quality control and standard operating procedure (QC/SOP) criteria, and automated output of alternative probe set algorithms for each project (see http://microarray.cnmcresearch.org/pgadatatable.asp).
引用
收藏
页码:D578 / D581
页数:4
相关论文
共 9 条
  • [1] ALMON RR, 2003, FUNCT INTEGR GE 0820
  • [2] Gene profiling in spinal cord injury shows role of cell cycle neuronal death
    Di Giovanni, S
    Knoblach, SM
    Brandoli, C
    Aden, SA
    Hoffman, EP
    Faden, AI
    [J]. ANNALS OF NEUROLOGY, 2003, 53 (04) : 454 - 468
  • [3] Gene Expression Omnibus: NCBI gene expression and hybridization array data repository
    Edgar, R
    Domrachev, M
    Lash, AE
    [J]. NUCLEIC ACIDS RESEARCH, 2002, 30 (01) : 207 - 210
  • [4] Skeletal muscle dictates the fibrinolytic state after exercise training in overweight men with characteristics of metabolic syndrome
    Hittel, DS
    Kraus, WE
    Hoffman, EP
    [J]. JOURNAL OF PHYSIOLOGY-LONDON, 2003, 548 (02): : 401 - 410
  • [5] HOFFMAN EP, IN PRESS LUNG REMODE
  • [6] Modeling of corticosteroid pharmacogenomics in rat liver using gene microarrays
    Jin, JY
    Almon, RR
    Dubois, DC
    Jusko, WJ
    [J]. JOURNAL OF PHARMACOLOGY AND EXPERIMENTAL THERAPEUTICS, 2003, 307 (01) : 93 - 109
  • [7] SEO J, 2003, IEEE ICME, V3, P461
  • [8] Slug is a novel downstream target of MyoD - Temporal profiling in muscle regeneration
    Zhao, P
    Iezzi, S
    Carver, E
    Dressman, D
    Girdley, T
    Sartorelli, V
    Hoffman, EP
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 2002, 277 (33) : 30091 - 30101
  • [9] ZHAO P, IN PRESS C R BIOL