Spatial normalization of array-CGH data

被引:65
作者
Neuvial, Pierre
Hupe, Philippe
Brito, Isabel
Liva, Stephane
Manie, Elodie
Brennetot, Caroline
Radvanyi, Francois
Aurias, Alain
Barillot, Emmanuel
机构
[1] Inst Curie, Serv Bioinformat, F-75248 Paris 05, France
[2] Inst Curie, CNRS, UMR 144, F-75248 Paris 05, France
[3] Inst Curie, INSERM, U509, F-75248 Paris 05, France
关键词
D O I
10.1186/1471-2105-7-264
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Array-based comparative genomic hybridization (array-CGH) is a recently developed technique for analyzing changes in DNA copy number. As in all microarray analyses, normalization is required to correct for experimental artifacts while preserving the true biological signal. We investigated various sources of systematic variation in array-CGH data and identified two distinct types of spatial effect of no biological relevance as the predominant experimental artifacts: continuous spatial gradients and local spatial bias. Local spatial bias affects a large proportion of arrays, and has not previously been considered in array-CGH experiments. Results: We show that existing normalization techniques do not correct these spatial effects properly. We therefore developed an automatic method for the spatial normalization of array-CGH data. This method makes it possible to delineate and to eliminate and/or correct areas affected by spatial bias. It is based on the combination of a spatial segmentation algorithm called NEM (Neighborhood Expectation Maximization) and spatial trend estimation. We defined quality criteria for array-CGH data, demonstrating significant improvements in data quality with our method for three data sets coming from two different platforms ( 198, 175 and 26 BAC-arrays). Conclusion: We have designed an automatic algorithm for the spatial normalization of BAC CGH-array data, preventing the misinterpretation of experimental artifacts as biologically relevant outliers in the genomic profile. This algorithm is implemented in the R package MANOR (MicroArray NORmalization), which is described at http://bioinfo.curie.fr/projects/manor and available from the Bioconductor site http://www.bioconductor.org. It can also be tested on the CAPweb bioinformatics platform at http://bioinfo.curie.fr/CAPweb.
引用
收藏
页数:20
相关论文
共 31 条
[1]   Chromosome aberrations in solid tumors [J].
Albertson, DG ;
Collins, C ;
McCormick, F ;
Gray, JW .
NATURE GENETICS, 2003, 34 (04) :369-376
[2]  
Ambroise C, 1997, QUANT GEO G, V9, P493
[3]  
AMBROISE C, 1996, THESIS U TECHNIQUE C
[4]  
[Anonymous], GENOME BIOL
[5]   Normalization of microarray data using a spatial mixed model analysis which includes splines [J].
Baird, D ;
Johnstone, P ;
Wilson, T .
BIOINFORMATICS, 2004, 20 (17) :3196-3205
[6]   Frequent FGFR3 mutations in papillary non-invasive bladder (pTa) tumors [J].
Billerey, C ;
Chopin, D ;
Aubriot-Lorton, MH ;
Ricol, D ;
de Medina, SGD ;
Van Rhijn, B ;
Bralet, MP ;
Lefrere-Belda, MA ;
Lahaye, JB ;
Abbou, CC ;
Bonaventure, J ;
Zafrani, ES ;
van der Kwast, T ;
Thiery, JP ;
Radvanyi, F .
AMERICAN JOURNAL OF PATHOLOGY, 2001, 158 (06) :1955-1959
[7]  
*BIOCONDUCTOR, OP SOFTW DEV COMP BI
[8]  
*CAPWEB, BIOINF CGH ARR AN PL
[9]  
Cleveland WilliamS., 1991, Statistics and Computing, V1, P47, DOI [10.1007/BF01890836, DOI 10.1007/BF01890836]
[10]   REGRESSION BY LOCAL FITTING - METHODS, PROPERTIES, AND COMPUTATIONAL ALGORITHMS [J].
CLEVELAND, WS ;
DEVLIN, SJ ;
GROSSE, E .
JOURNAL OF ECONOMETRICS, 1988, 37 (01) :87-114