Inferring global levels of alternative splicing isoforms using a generative model of microarray data

被引:46
作者
Shai, O [1 ]
Morris, QD
Blencowe, BJ
Frey, BJ
机构
[1] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 3G8, Canada
[2] Univ Toronto, Banting & Best Dept Med Res, Toronto, ON M5G 1L6, Canada
基金
加拿大自然科学与工程研究理事会; 加拿大健康研究院;
关键词
D O I
10.1093/bioinformatics/btk028
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Alternative splicing (AS) is a frequent step in metozoan gene expression whereby the exons of genes are spliced in different combinations to generate multiple isoforms of mature mRNA. AS functions to enrich an organism's proteomic complexity and regulates gene expression. Despite its importance, the mechanisms underlying AS and its regulation are not well understood, especially in the context of global gene expression patterns. We present here an algorithm referred to as the Generative model for the Alternative Splicing Array Platform (GenASAP) that can predict the levels of AS for thousands of exon skipping events using data generated from custom microarrays. GenASAP uses Bayesian learning in an unsupervised probability model to accurately predict AS levels from the microarray data. GenASAP is capable of learning the hybridization profiles of microarray data, while modeling noise processes and missing or aberrant data. GenASAP has been successfully applied to the global discovery and analysis of AS in mammalian cells and tissues. Results: GenASAP was applied to data obtained from a custom microarray designed for the monitoring of 3126 AS events in mouse cells and tissues. The microarray design included probes specific for exon body and junction sequences formed by the splicing of exons. Our results show that GenASAP provides accurate predictions for over one-third of the total events, as verified by independent RT-PCR assays. Contact: ofer@psi.toronto.edu Supplementary information: http://www.psi.toronto.edu/GenASAP.
引用
收藏
页码:606 / 613
页数:8
相关论文
共 27 条
[1]   Protein diversity from alternative splicing: A challenge for bioinformatics and post-genome biology [J].
Black, DL .
CELL, 2000, 103 (03) :367-370
[2]   Global analysis of positive and negative pre-mRNA splicing regulators in Drosophila [J].
Blanchette, M ;
Green, RE ;
Brenner, SE ;
Rio, DC .
GENES & DEVELOPMENT, 2005, 19 (11) :1306-1314
[3]  
Blencowe BJ, 2000, TRENDS BIOCHEM SCI, V25, P228
[4]   Exonic splicing enhancers: mechanism of action, diversity and role in human genetic diseases [J].
Blencowe, BJ .
TRENDS IN BIOCHEMICAL SCIENCES, 2000, 25 (03) :106-110
[5]   Listening to silence and understanding nonsense: Exonic mutations that affect splicing [J].
Cartegni, L ;
Chew, SL ;
Krainer, AR .
NATURE REVIEWS GENETICS, 2002, 3 (04) :285-298
[6]   Genomewide analysis of mRNA processing in yeast using splicing-specific microarrays [J].
Clark, TA ;
Sugnet, CW ;
Ares, M .
SCIENCE, 2002, 296 (5569) :907-910
[7]   ANOSVA: a statistical method for detecting splice variation from expression data [J].
Cline, MS ;
Blume, J ;
Cawley, S ;
Clark, TA ;
Hu, JS ;
Lu, G ;
Salomonis, N ;
Wang, H ;
Williams, A .
BIOINFORMATICS, 2005, 21 :I107-I115
[8]  
COZMAN F, 1995, P 5 WORKSH ART INT S, P161
[9]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[10]   Variance-stabilizing transformations for two-color microarrays [J].
Durbin, BP ;
Rocke, DM .
BIOINFORMATICS, 2004, 20 (05) :660-U190