Consistent annotation of gene expression arrays

被引:15
作者
Ballester, Benoit [1 ]
Johnson, Nathan [1 ]
Proctor, Glenn [1 ]
Flicek, Paul [1 ]
机构
[1] European Bioinformat Inst EMBL EBI, Cambridge CB10 1SD, England
来源
BMC GENOMICS | 2010年 / 11卷
基金
英国惠康基金;
关键词
PROBE SETS; GENOME; ENSEMBL; IDENTIFICATION; NORMALIZATION; BIOCONDUCTOR; REDEFINITION; INFORMATION; GENERATION;
D O I
10.1186/1471-2164-11-294
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Gene expression arrays are valuable and widely used tools for biomedical research. Today's commercial arrays attempt to measure the expression level of all of the genes in the genome. Effectively translating the results from the microarray into a biological interpretation requires an accurate mapping between the probesets on the array and the genes that they are targeting. Although major array manufacturers provide annotations of their gene expression arrays, the methods used by various manufacturers are different and the annotations are difficult to keep up to date in the rapidly changing world of biological sequence databases. Results: We have created a consistent microarray annotation protocol applicable to all of the major array manufacturers. We constantly keep our annotations updated with the latest Ensembl Gene predictions, and thus cross-referenced with a large number of external biomedical sequence database identifiers. We show that these annotations are accurate and address in detail reasons for the minority of probesets that cannot be annotated. Annotations are publicly accessible through the Ensembl Genome Browser and programmatically through the Ensembl Application Programming Interface. They are also seamlessly integrated into the BioMart data-mining tool and the biomaRt package of BioConductor. Conclusions: Consistent, accurate and updated gene expression array annotations remain critical for biological research. Our annotations facilitate accurate biological interpretation of gene expression profiles.
引用
收藏
页数:14
相关论文
共 36 条
[1]   Whole-genome re-sequencing [J].
Bentley, David R. .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 2006, 16 (06) :545-552
[2]   Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project [J].
Birney, Ewan ;
Stamatoyannopoulos, John A. ;
Dutta, Anindya ;
Guigo, Roderic ;
Gingeras, Thomas R. ;
Margulies, Elliott H. ;
Weng, Zhiping ;
Snyder, Michael ;
Dermitzakis, Emmanouil T. ;
Stamatoyannopoulos, John A. ;
Thurman, Robert E. ;
Kuehn, Michael S. ;
Taylor, Christopher M. ;
Neph, Shane ;
Koch, Christoph M. ;
Asthana, Saurabh ;
Malhotra, Ankit ;
Adzhubei, Ivan ;
Greenbaum, Jason A. ;
Andrews, Robert M. ;
Flicek, Paul ;
Boyle, Patrick J. ;
Cao, Hua ;
Carter, Nigel P. ;
Clelland, Gayle K. ;
Davis, Sean ;
Day, Nathan ;
Dhami, Pawandeep ;
Dillon, Shane C. ;
Dorschner, Michael O. ;
Fiegler, Heike ;
Giresi, Paul G. ;
Goldy, Jeff ;
Hawrylycz, Michael ;
Haydock, Andrew ;
Humbert, Richard ;
James, Keith D. ;
Johnson, Brett E. ;
Johnson, Ericka M. ;
Frum, Tristan T. ;
Rosenzweig, Elizabeth R. ;
Karnani, Neerja ;
Lee, Kirsten ;
Lefebvre, Gregory C. ;
Navas, Patrick A. ;
Neri, Fidencio ;
Parker, Stephen C. J. ;
Sabo, Peter J. ;
Sandstrom, Richard ;
Shafer, Anthony .
NATURE, 2007, 447 (7146) :799-816
[3]   A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J].
Bolstad, BM ;
Irizarry, RA ;
Åstrand, M ;
Speed, TP .
BIOINFORMATICS, 2003, 19 (02) :185-193
[4]   Minimum information about a microarray experiment (MIAME) - toward standards for microarray data [J].
Brazma, A ;
Hingamp, P ;
Quackenbush, J ;
Sherlock, G ;
Spellman, P ;
Stoeckert, C ;
Aach, J ;
Ansorge, W ;
Ball, CA ;
Causton, HC ;
Gaasterland, T ;
Glenisson, P ;
Holstege, FCP ;
Kim, IF ;
Markowitz, V ;
Matese, JC ;
Parkinson, H ;
Robinson, A ;
Sarkans, U ;
Schulze-Kremer, S ;
Stewart, J ;
Taylor, R ;
Vilo, J ;
Vingron, M .
NATURE GENETICS, 2001, 29 (04) :365-371
[5]   Strand selective generation of endo-siRNAs from the Na/phosphate transporter gene Slc34a1 in murine tissues [J].
Carlile, Mark ;
Swan, Daniel ;
Jackson, Kelly ;
Preston-Fayers, Keziah ;
Ballester, Benoit ;
Flicek, Paul ;
Werner, Andreas .
NUCLEIC ACIDS RESEARCH, 2009, 37 (07) :2274-2282
[6]   Redefinition of affymetrix probe sets by sequence overlap with cDNA microarray probes reduces cross-platform inconsistencies in cancer-associated gene expression measurements [J].
Carter, SL ;
Eklund, AC ;
Mecham, BH ;
Kohane, IS ;
Szallasi, Z .
BMC BIOINFORMATICS, 2005, 6 (1)
[7]  
Chalifa-Caspi Vered, 2003, Briefings in Bioinformatics, V4, P349, DOI 10.1093/bib/4.4.349
[8]   AILUN: reannotating gene expression data automatically [J].
Chen, Rong ;
Li, Li ;
Butte, Atul J. .
NATURE METHODS, 2007, 4 (11) :879-879
[9]   Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse [J].
Church, Deanna M. ;
Goodstadt, Leo ;
Hillier, LaDeana W. ;
Zody, Michael C. ;
Goldstein, Steve ;
She, Xinwe ;
Bult, Carol J. ;
Agarwala, Richa ;
Cherry, Joshua L. ;
DiCuccio, Michael ;
Hlavina, Wratko ;
Kapustin, Yuri ;
Meric, Peter ;
Maglott, Donna ;
Birtle, Zoe ;
Marques, Ana C. ;
Graves, Tina ;
Zhou, Shiguo ;
Teague, Brian ;
Potamousis, Konstantinos ;
Churas, Christopher ;
Place, Michael ;
Herschleb, Jill ;
Runnheim, Ron ;
Forrest, Daniel ;
Amos-Landgraf, James ;
Schwartz, David C. ;
Cheng, Ze ;
Lindblad-Toh, Kerstin ;
Eichler, Evan E. ;
Ponting, Chris P. .
PLOS BIOLOGY, 2009, 7 (05)
[10]   Finishing the euchromatic sequence of the human genome [J].
Collins, FS ;
Lander, ES ;
Rogers, J ;
Waterston, RH .
NATURE, 2004, 431 (7011) :931-945