GEOquery: a bridge between the gene expression omnibus (GEO) and BioConductor

被引:1935
作者
Sean, Davis [1 ]
Meltzer, Paul S. [1 ]
机构
[1] Natl Canc Inst, Natl Inst Hlth, Genet Branch, Bethesda, MD 20892 USA
关键词
D O I
10.1093/bioinformatics/btm254
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Microarray technology has become a standard molecular biology tool. Experimental data have been generated on a huge number of organisms, tissue types, treatment conditions and disease states. The Gene Expression Omnibus ( Barrett et al., 2005), developed by the National Center for Bioinformatics (NCBI) at the National Institutes of Health is a repository of nearly 140 000 gene expression experiments. The BioConductor project (Gentleman et al., 2004) is an open-source and open-development software project built in the R statistical programming environment (R Development core Team, 2005) for the analysis and comprehension of genomic data. The tools contained in the BioConductor project represent many state-of-the-art methods for the analysis of microarray and genemics data. We have developed a software tool that allows access to the wealth of information within GEO directly from BioConductor, eliminating many the formatting and parsing problems that have made such analyses labor-intensive in the past. The software, called GEOquery, effectively establishes a bridge between GEO and BioConductor. Easy access to GEO data from BioConductor will likely lead to new analyses of GEO data using novel and rigorous statistical and bioinformatic tools. Facilitating analyses and meta-analyses of microarray data will increase the efficiency with which biologically important conclusions can be drawn from published genomic data.
引用
收藏
页码:1846 / 1847
页数:2
相关论文
共 4 条
  • [1] Barrett T, 2005, NUCLEIC ACIDS RES, V33, pD562
  • [2] Bioconductor: open software development for computational biology and bioinformatics
    Gentleman, RC
    Carey, VJ
    Bates, DM
    Bolstad, B
    Dettling, M
    Dudoit, S
    Ellis, B
    Gautier, L
    Ge, YC
    Gentry, J
    Hornik, K
    Hothorn, T
    Huber, W
    Iacus, S
    Irizarry, R
    Leisch, F
    Li, C
    Maechler, M
    Rossini, AJ
    Sawitzki, G
    Smith, C
    Smyth, G
    Tierney, L
    Yang, JYH
    Zhang, JH
    [J]. GENOME BIOLOGY, 2004, 5 (10)
  • [3] R Core Team, 2016, R LANG ENV STAT COMP
  • [4] Smyth GK, 2005, STAT BIOL HEALTH, P397, DOI 10.1007/0-387-29362-0_23