Model-based detection of alternative splicing signals

被引:19
作者
Barash, Yoseph [1 ,2 ,3 ]
Blencowe, Benjamin J. [1 ,3 ,4 ]
Frey, Brendan J. [1 ,2 ,3 ]
机构
[1] Univ Toronto, Banting & Best Dept Med Res, Toronto, ON M5S 1A1, Canada
[2] Univ Toronto, Dept Elect & Comp Engn, Toronto, ON M5S 1A1, Canada
[3] Univ Toronto, Donnelly Ctr Cellular & Biomol Res, Toronto, ON M5S 1A1, Canada
[4] Univ Toronto, Dept Mol Genet, Toronto, ON M5S 1A1, Canada
基金
加拿大创新基金会;
关键词
TRACT BINDING-PROTEIN; REGULATORY ELEMENTS; GENE-EXPRESSION; MICROARRAY DATA; NETWORKS; MODULES; CODE;
D O I
10.1093/bioinformatics/btq200
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Transcripts from similar to 95% of human multi-exon genes are subject to alternative splicing (AS). The growing interest in AS is propelled by its prominent contribution to transcriptome and proteome complexity and the role of aberrant AS in numerous diseases. Recent technological advances enable thousands of exons to be simultaneously profiled across diverse cell types and cellular conditions, but require accurate identification of condition-specific splicing changes. It is necessary to accurately identify such splicing changes to elucidate the underlying regulatory programs or link the splicing changes to specific diseases. Results: We present a probabilistic model tailored for high-throughput AS data, where observed isoform levels are explained as combinations of condition-specific AS signals. According to our formulation, given an AS dataset our tasks are to detect common signals in the data and identify the exons relevant to each signal. Our model can incorporate prior knowledge about underlying AS signals, measurement quality and gene expression level effects. Using a large-scale multi-tissue AS dataset, we demonstrate the advantage of our method over standard alternative approaches. In addition, we describe newly found tissue-specific AS signals which were verified experimentally, and discuss associated regulatory features. Contact: yoseph@psi.utoronto.ca; frey@psi.utoronto.ca Supplementary information: Supplementary data are available at Bioinformatics online.
引用
收藏
页码:i325 / i333
页数:9
相关论文
共 33 条
[1]  
[Anonymous], 1996, The em algorithm for mixtures of factor analyzers
[2]  
Ashiya M, 1997, RNA, V3, P996
[3]   Independent factor analysis [J].
Attias, H .
NEURAL COMPUTATION, 1999, 11 (04) :803-851
[4]   Computational discovery of gene modules and regulatory networks [J].
Bar-Joseph, Z ;
Gerber, GK ;
Lee, TI ;
Rinaldi, NJ ;
Yoo, JY ;
Robert, F ;
Gordon, DB ;
Fraenkel, E ;
Jaakkola, TS ;
Young, RA ;
Gifford, DK .
NATURE BIOTECHNOLOGY, 2003, 21 (11) :1337-1342
[5]  
BARASH Y, 2010, NATURE, V464, P7294
[6]   Predicting gene expression from sequence [J].
Beer, MA ;
Tavazoie, S .
CELL, 2004, 117 (02) :185-198
[7]   MicroRNAs regulate the expression of the alternative splicing factor nPTB during muscle development [J].
Boutz, Paul L. ;
Chawla, Geetanjali ;
Stoilov, Peter ;
Black, Douglas L. .
GENES & DEVELOPMENT, 2007, 21 (01) :71-84
[8]  
Candes E. J., 2009, ROBUST PRINCIPAL COM
[9]   Expression of 24,426 human alternative splicing events and predicted cis regulation in 48 tissues and cell lines [J].
Castle, John C. ;
Zhang, Chaolin ;
Shah, Jyoti K. ;
Kulkarni, Amit V. ;
Kalsotra, Auinash ;
Cooper, Thomas A. ;
Johnson, Jason M. .
NATURE GENETICS, 2008, 40 (12) :1416-1425
[10]   The polypyrimidine tract binding protein binds upstream of neural cell-specific c-src exon N1 to repress the splicing of the intron downstream [J].
Chan, RCC ;
Black, DL .
MOLECULAR AND CELLULAR BIOLOGY, 1997, 17 (08) :4667-4676