Transcript-level annotation of Affymetrix probesets improves the interpretation of gene expression data

被引:35
作者
Yu, Hui [1 ]
Wang, Feng
Tu, Kang
Xie, Lu
Li, Yuan-Yuan
Li, Yi-Xue
Agrawal, Sunil
机构
[1] Shanghai Ctr Bioinformat Technol, Shanghai 200235, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Life Sci & Technol, Shanghai 200240, Peoples R China
[3] Chinese Acad Sci, Shanghai Inst Biol Sci, Key Lab Syst Biol, Shanghai 200031, Peoples R China
[4] Chinese Acad Sci, Shanghai Inst Biol Sci, Bioinformat Ctr, Shanghai 200031, Peoples R China
关键词
D O I
10.1186/1471-2105-8-194
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The wide use of Affymetrix microarray in broadened fields of biological research has made the probeset annotation an important issue. Standard Affymetrix probeset annotation is at gene level, i.e. a probeset is precisely linked to a gene, and probeset intensity is interpreted as gene expression. The increased knowledge that one gene may have multiple transcript variants clearly brings up the necessity of updating this gene-level annotation to a refined transcript-level. Results: Through performing rigorous alignments of the Affymetrix probe sequences against a comprehensive pool of currently available transcript sequences, and further linking the probesets to the International Protein Index, we generated transcript-level or protein-level annotation tables for two popular Affymetrix expression arrays, Mouse Genome 430A 2.0 Array and Human Genome U133A Array. Application of our new annotations in re-examining existing expression data sets shows increased expression consistency among synonymous probesets and strengthened expression correlation between interacting proteins. Conclusion: By refining the standard Affymetrix annotation of microarray probesets from the gene level to the transcript level and protein level, one can achieve a more reliable interpretation of their experimental data, which may lead to discovery of more profound regulatory mechanism.
引用
收藏
页数:15
相关论文
共 52 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Correlation between gene expression profiles and protein-protein interactions within and across genomes [J].
Bhardwaj, N ;
Lu, H .
BIOINFORMATICS, 2005, 21 (11) :2730-2738
[3]   An overview of ensembl [J].
Birney, E ;
Andrews, TD ;
Bevan, P ;
Caccamo, M ;
Chen, Y ;
Clarke, L ;
Coates, G ;
Cuff, J ;
Curwen, V ;
Cutts, T ;
Down, T ;
Eyras, E ;
Fernandez-Suarez, XM ;
Gane, P ;
Gibbins, B ;
Gilbert, J ;
Hammond, M ;
Hotz, HR ;
Iyer, V ;
Jekosch, K ;
Kahari, A ;
Kasprzyk, A ;
Keefe, D ;
Keenan, S ;
Lehvaslaiho, H ;
McVicker, G ;
Melsopp, C ;
Meidl, P ;
Mongin, E ;
Pettett, R ;
Potter, S ;
Proctor, G ;
Rae, M ;
Searle, S ;
Slater, G ;
Smedley, D ;
Smith, J ;
Spooner, W ;
Stabenau, A ;
Stalker, J ;
Storey, R ;
Ureta-Vidal, A ;
Woodwark, KC ;
Cameron, G ;
Durbin, R ;
Cox, A ;
Hubbard, T ;
Clamp, M .
GENOME RESEARCH, 2004, 14 (05) :925-928
[4]   MULTIPLE FORMS OF PROLACTIN RECEPTOR MESSENGER-RIBONUCLEIC-ACID ARE SPECIFICALLY EXPRESSED AND REGULATED IN MURINE TISSUES AND THE MAMMARY CELL LINE-HC11 [J].
BUCK, K ;
VANEK, M ;
GRONER, B ;
BALL, RK .
ENDOCRINOLOGY, 1992, 130 (03) :1108-1114
[5]   Optimization of oligonucleotide arrays and RNA amplification protocols for analysis of transcript structure and alternative splicing [J].
Castle, J ;
Garrett-Engele, P ;
Armour, CD ;
Duenwald, SJ ;
Loerch, PM ;
Meyer, MR ;
Schadt, EE ;
Stoughton, R ;
Parrish, ML ;
Shoemaker, DD ;
Johnson, JM .
GENOME BIOLOGY, 2003, 4 (10)
[6]   GeneAnnot: comprehensive two-way linking between oligonucleotide array probesets and GeneCards genes [J].
Chalifa-Caspi, V ;
Yanai, I ;
Ophir, R ;
Rosen, N ;
Shmoish, M ;
Benjamin-Rodrig, H ;
Shklar, M ;
Stein, TI ;
Shmueli, O ;
Safran, M ;
Lancet, D .
BIOINFORMATICS, 2004, 20 (09) :1457-1458
[7]   Global protein function annotation through mining genome-scale data in yeast Saccharomyces cerevisiae [J].
Chen, Y ;
Xu, D .
NUCLEIC ACIDS RESEARCH, 2004, 32 (21) :6414-6424
[8]   Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data [J].
Dai, MH ;
Wang, PL ;
Boyd, AD ;
Kostov, G ;
Athey, B ;
Jones, EG ;
Bunney, WE ;
Myers, RM ;
Speed, TP ;
Akil, H ;
Watson, SJ ;
Meng, F .
NUCLEIC ACIDS RESEARCH, 2005, 33 (20) :e175.1-e175.9
[9]   Gene Expression Omnibus: NCBI gene expression and hybridization array data repository [J].
Edgar, R ;
Domrachev, M ;
Lash, AE .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :207-210
[10]   Integrating probe-level expression changes across generations of Affymetrix arrays -: art. no. e193 [J].
Elo, LL ;
Lahti, L ;
Skottman, H ;
Kyläniemi, M ;
Lahesmaa, R ;
Aittokallio, T .
NUCLEIC ACIDS RESEARCH, 2005, 33 (22) :e193