chip artifact CORRECTion (caCORRECT):: A bioinformatics system for quality assurance of genomics and proteomics array data

被引:23
作者
Stokes, Todd H.
Moffitt, Richard A.
Phan, John H.
Wang, May D.
机构
[1] Georgia Inst Technol, Dept Elect & Comp Engn, Dept Biomed Engn, Atlanta, GA 30332 USA
[2] Georgia Inst Technol, Winship Canc Inst, Atlanta, GA 30332 USA
[3] Emory Univ, Atlanta, GA 30332 USA
关键词
high throughput data analysis; chip artifact correction; quality assurance; biomarker; normalization; heatmap visualization; bioinformatics; systems biology;
D O I
10.1007/s10439-007-9313-y
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Quality assurance of high throughput "-omics" data is a major concern for biomedical discovery and translational medicine, and is considered a top priority in bioinformatics and systems biology. Here, we report a web-based bioinformatics tool called caCORRECT for chip artifact detection, analysis, and CORRECTion, which removes systematic artifactual noises that are commonly observed in microarray gene expression data. Despite the development of major databases such as GEO arrayExpress, caArray, and the SMD to manage and distribute microarray data to the public, reproducibility has been questioned in many cases, including high-profile papers and datasets. Based on both archived and synthetic data, we have designed the caCORRECT to have several advanced features: (1) to uncover significant, correctable artifacts that affect reproducibility of experiments; (2) to improve the integrity and quality of public archives by removing artifacts; (3) to provide a universal quality score to aid users in their selection of suitable microarray data; and (4) to improve the true-positive rate of biomarker selection verified by test data. These features are expected to improve the reproducibility of Microarray study. caCORRECT is freely available at: http://caCORRECT.bme.gatech.edu.
引用
收藏
页码:1068 / 1080
页数:13
相关论文
共 27 条
[1]  
Ball CA, 2005, NUCLEIC ACIDS RES, V33, pD580
[2]   Gene-expression profiles predict survival of patients with lung adenocarcinoma [J].
Beer, DG ;
Kardia, SLR ;
Huang, CC ;
Giordano, TJ ;
Levin, AM ;
Misek, DE ;
Lin, L ;
Chen, GA ;
Gharib, TG ;
Thomas, DG ;
Lizyness, ML ;
Kuick, R ;
Hayasaka, S ;
Taylor, JMG ;
Iannettoni, MD ;
Orringer, MB ;
Hanash, S .
NATURE MEDICINE, 2002, 8 (08) :816-824
[3]   A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J].
Bolstad, BM ;
Irizarry, RA ;
Åstrand, M ;
Speed, TP .
BIOINFORMATICS, 2003, 19 (02) :185-193
[4]  
BRETTSCNEIDER J, 2006, UNPUB TECHNOMET 0317
[5]  
BROSKY L, 2004, NUCLEIC ACIDS RES, V32, pE46
[6]   Fundamentals of experimental design for cDNA microarrays [J].
Churchill, GA .
NATURE GENETICS, 2002, 32 (Suppl 4) :490-495
[7]   Effects of atmospheric ozone on microarray data quality [J].
Fare, TL ;
Coffey, EM ;
Dai, HY ;
He, YDD ;
Kessler, DA ;
Kilian, KA ;
Koch, JE ;
LeProust, E ;
Marton, MJ ;
Meyer, MR ;
Stoughton, RB ;
Tokiwa, GY ;
Wang, YQ .
ANALYTICAL CHEMISTRY, 2003, 75 (17) :4672-4675
[8]   The Stanford Microarray Database: data access and quality assessment tools [J].
Gollub, J ;
Ball, CA ;
Binkley, G ;
Demeter, J ;
Finkelstein, DB ;
Hebert, JM ;
Hernandez-Boussard, T ;
Jin, H ;
Kaloper, M ;
Matese, JC ;
Schroeder, M ;
Brown, PO ;
Botstein, D ;
Sherlock, G .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :94-96
[9]   Options available - from start to finish - for obtaining data from DNA microarrays [J].
Holloway, AJ ;
van Laar, RK ;
Tothill, RW ;
Bowtell, DDL .
NATURE GENETICS, 2002, 32 (Suppl 4) :481-489
[10]   Transcriptional response to nitrosative stress in Saccharomyces cerevisiae [J].
Horan, Susannah ;
Bourges, Ingrid ;
Meunier, Brigitte .
YEAST, 2006, 23 (07) :519-535