User-friendly solutions for microarray quality control and pre-processing on ArrayAnalysis.org

被引:94
作者
Eijssen, Lars M. T. [1 ]
Jaillard, Magali [1 ]
Adriaens, Michiel E. [1 ,2 ]
Gaj, Stan [1 ,3 ]
de Groot, Philip J. [4 ]
Muller, Michael [4 ]
Evelo, Chris T. [1 ]
机构
[1] Maastricht Univ, Dept Bioinformat BiGCaT, NL-6200 MD Maastricht, Netherlands
[2] Univ Amsterdam, Acad Med Ctr, Heart Failure Res Ctr, Dept Expt Cardiol, NL-1100 DD Amsterdam, Netherlands
[3] Maastricht Univ, Dept Toxicogen, NL-6200 MD Maastricht, Netherlands
[4] Wageningen UR, Div Human Nutr, Nutr Metab & Genom Grp, NL-6700 AA Wageningen, Netherlands
关键词
GENECHIP DATA; BIOCONDUCTOR; DATABASE; INTERFACE; PACKAGE; BIOMART;
D O I
10.1093/nar/gkt293
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Quality control (QC) is crucial for any scientific method producing data. Applying adequate QC introduces new challenges in the genomics field where large amounts of data are produced with complex technologies. For DNA microarrays, specific algorithms for QC and pre-processing including normalization have been developed by the scientific community, especially for expression chips of the Affymetrix platform. Many of these have been implemented in the statistical scripting language R and are available from the Bioconductor repository. However, application is hampered by lack of integrative tools that can be used by users of any experience level. To fill this gap, we developed a freely available tool for QC and pre-processing of Affymetrix gene expression results, extending, integrating and harmonizing functionality of Bioconductor packages. The tool can be easily accessed through a wizard-like web portal at http://www.arrayanalysis.org or downloaded for local use in R. The portal provides extensive documentation, including user guides, interpretation help with real output illustrations and detailed technical documentation. It assists newcomers to the field in performing state-of-the-art QC and pre-processing while offering data analysts an integral open-source package. Providing the scientific community with this easily accessible tool will allow improving data quality and reuse and adoption of standards.
引用
收藏
页码:W71 / W76
页数:6
相关论文
共 31 条
[1]   The NuGO proof of principle study package: a collaborative research effort of the European Nutrigenomics Organisation [J].
Baccini, Michela ;
Bachmaier, Eva-Maria ;
Biggeri, Annibale ;
Boekschoten, Mark V. ;
Bouwman, Freek G. ;
Brennan, Lorraine ;
Caesar, Robert ;
Cinti, Saverio ;
Coort, Susan L. ;
Crosley, Katie ;
Daniel, Hannelore ;
Drevon, Christian A. ;
Duthie, Susan ;
Eijssen, Lars ;
Elliott, Ruan M. ;
van Erk, Marjan ;
Evelo, Chris ;
Gibney, Mike ;
Heim, Carolin ;
Horgan, Graham W. ;
Johnson, Ian T. ;
Kelder, Thomas ;
Kleemann, Robert ;
Kooistra, Teake ;
van Iersel, Martijn P. ;
Mariman, Edwin C. ;
Mayer, Claus ;
McLoughlin, Gerard ;
Mueller, Michael ;
Mulholland, Francis ;
van Ommen, Ben ;
Polley, Abigael C. ;
Pujos-Guillot, Estelle ;
Rubio-Aliaga, Isabel ;
Roche, Helen M. ;
de Roos, Baukje ;
Sailer, Manuela ;
Tonini, Giulia ;
Williams, Lynda M. ;
de Wit, Nicole .
GENES AND NUTRITION, 2008, 3 (3-4) :147-151
[2]   NCBI GEO: archive for functional genomics data sets-update [J].
Barrett, Tanya ;
Wilhite, Stephen E. ;
Ledoux, Pierre ;
Evangelista, Carlos ;
Kim, Irene F. ;
Tomashevsky, Maxim ;
Marshall, Kimberly A. ;
Phillippy, Katherine H. ;
Sherman, Patti M. ;
Holko, Michelle ;
Yefanov, Andrey ;
Lee, Hyeseung ;
Zhang, Naigong ;
Robertson, Cynthia L. ;
Serova, Nadezhda ;
Davis, Sean ;
Soboleva, Alexandra .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D991-D995
[3]   Minimum information about a microarray experiment (MIAME) - toward standards for microarray data [J].
Brazma, A ;
Hingamp, P ;
Quackenbush, J ;
Sherlock, G ;
Spellman, P ;
Stoeckert, C ;
Aach, J ;
Ansorge, W ;
Ball, CA ;
Causton, HC ;
Gaasterland, T ;
Glenisson, P ;
Holstege, FCP ;
Kim, IF ;
Markowitz, V ;
Matese, JC ;
Parkinson, H ;
Robinson, A ;
Sarkans, U ;
Schulze-Kremer, S ;
Stewart, J ;
Taylor, R ;
Vilo, J ;
Vingron, M .
NATURE GENETICS, 2001, 29 (04) :365-371
[4]   Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data [J].
Dai, MH ;
Wang, PL ;
Boyd, AD ;
Kostov, G ;
Athey, B ;
Jones, EG ;
Bunney, WE ;
Myers, RM ;
Speed, TP ;
Akil, H ;
Watson, SJ ;
Meng, F .
NUCLEIC ACIDS RESEARCH, 2005, 33 (20) :e175.1-e175.9
[5]   BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis [J].
Durinck, S ;
Moreau, Y ;
Kasprzyk, A ;
Davis, S ;
De Moor, B ;
Brazma, A ;
Huber, W .
BIOINFORMATICS, 2005, 21 (16) :3439-3440
[6]  
Fortea Jose Osorio Y., 2008, Journal of Bioinformatics and Computational Biology, V6, P317, DOI 10.1142/S0219720008003400
[7]   affy -: analysis of Affymetrix GeneChip data at the probe level [J].
Gautier, L ;
Cope, L ;
Bolstad, BM ;
Irizarry, RA .
BIOINFORMATICS, 2004, 20 (03) :307-315
[8]   Bioconductor: open software development for computational biology and bioinformatics [J].
Gentleman, RC ;
Carey, VJ ;
Bates, DM ;
Bolstad, B ;
Dettling, M ;
Dudoit, S ;
Ellis, B ;
Gautier, L ;
Ge, YC ;
Gentry, J ;
Hornik, K ;
Hothorn, T ;
Huber, W ;
Iacus, S ;
Irizarry, R ;
Leisch, F ;
Li, C ;
Maechler, M ;
Rossini, AJ ;
Sawitzki, G ;
Smith, C ;
Smyth, G ;
Tierney, L ;
Yang, JYH ;
Zhang, JH .
GENOME BIOLOGY, 2004, 5 (10)
[9]   Genevestigator. Facilitating Web-based gene-expression analysis [J].
Grennan, Aleel K. .
PLANT PHYSIOLOGY, 2006, 141 (04) :1164-1166
[10]   BioMart Central Portal-unified access to biological data [J].
Haider, Syed ;
Ballester, Benoit ;
Smedley, Damian ;
Zhang, Junjun ;
Rice, Peter ;
Kasprzyk, Arek .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W23-W27