Observations on novel splice junctions from RNA sequencing data

被引:17
作者
Wang, Likun [1 ,2 ]
Wang, Xiaowo [2 ]
Wang, Xi [2 ]
Liang, Yanchun [1 ]
Zhang, Xuegong [2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
[2] Tsinghua Univ, MOE Key Lab, Bioinformat & Bioinformat Div, TNLIST Dept Automat, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
RNA sequencing; Alternative splicing; Splice junction detection; Sequencing depth; Differential expression; SINGLE; TRANSCRIPTOME; GENOME;
D O I
10.1016/j.bbrc.2011.05.005
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
High-throughput RNA sequencing (RNA-seq) technology provides a revolutionary approach to studying splicing events de nova. However, identifying splice junctions with high sensitivity and specificity remains a challenge. In the present study, we proposed a new tool named SeqSaw to detect splice junctions with or without the canonical GT-AG splicing signal. SeqSaw was applied to two ENCODE RNA-seq datasets and also compared with two existing methods. It was shown that the proposed method obtained better results on finding novel splice junctions. Experiments also revealed that the current sequencing depth has not yet reached saturation to detect novel transcripts. Moreover, by comparing the number of supporting reads, we demonstrated that many un-annotated splicing events can be tissue specific. (C) 2011 Elsevier Inc. All rights reserved.
引用
收藏
页码:299 / 303
页数:5
相关论文
共 27 条
[1]   RAPID CDNA SEQUENCING (EXPRESSED SEQUENCE TAGS) FROM A DIRECTIONALLY CLONED HUMAN INFANT BRAIN CDNA LIBRARY [J].
ADAMS, MD ;
SOARES, MB ;
KERLAVAGE, AR ;
FIELDS, C ;
VENTER, JC .
NATURE GENETICS, 1993, 4 (04) :373-386
[2]   Detection of splice junctions from paired-end RNA-seq data by SpliceMap [J].
Au, Kin Fai ;
Jiang, Hui ;
Lin, Lan ;
Xing, Yi ;
Wong, Wing Hung .
NUCLEIC ACIDS RESEARCH, 2010, 38 (14) :4570-4578
[3]   Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project [J].
Birney, Ewan ;
Stamatoyannopoulos, John A. ;
Dutta, Anindya ;
Guigo, Roderic ;
Gingeras, Thomas R. ;
Margulies, Elliott H. ;
Weng, Zhiping ;
Snyder, Michael ;
Dermitzakis, Emmanouil T. ;
Stamatoyannopoulos, John A. ;
Thurman, Robert E. ;
Kuehn, Michael S. ;
Taylor, Christopher M. ;
Neph, Shane ;
Koch, Christoph M. ;
Asthana, Saurabh ;
Malhotra, Ankit ;
Adzhubei, Ivan ;
Greenbaum, Jason A. ;
Andrews, Robert M. ;
Flicek, Paul ;
Boyle, Patrick J. ;
Cao, Hua ;
Carter, Nigel P. ;
Clelland, Gayle K. ;
Davis, Sean ;
Day, Nathan ;
Dhami, Pawandeep ;
Dillon, Shane C. ;
Dorschner, Michael O. ;
Fiegler, Heike ;
Giresi, Paul G. ;
Goldy, Jeff ;
Hawrylycz, Michael ;
Haydock, Andrew ;
Humbert, Richard ;
James, Keith D. ;
Johnson, Brett E. ;
Johnson, Ericka M. ;
Frum, Tristan T. ;
Rosenzweig, Elizabeth R. ;
Karnani, Neerja ;
Lee, Kirsten ;
Lefebvre, Gregory C. ;
Navas, Patrick A. ;
Neri, Fidencio ;
Parker, Stephen C. J. ;
Sabo, Peter J. ;
Sandstrom, Richard ;
Shafer, Anthony .
NATURE, 2007, 447 (7146) :799-816
[4]   DBEST - DATABASE FOR EXPRESSED SEQUENCE TAGS [J].
BOGUSKI, MS ;
LOWE, TMJ ;
TOLSTOSHEV, CM .
NATURE GENETICS, 1993, 4 (04) :332-333
[5]   Analysis of canonical and non-canonical splice sites in mammalian genomes [J].
Burset, M ;
Seledtsov, IA ;
Solovyev, VV .
NUCLEIC ACIDS RESEARCH, 2000, 28 (21) :4364-4375
[6]   The Ensembl automatic gene annotation system [J].
Curwen, V ;
Eyras, E ;
Andrews, TD ;
Clarke, L ;
Mongin, E ;
Searle, SMJ ;
Clamp, M .
GENOME RESEARCH, 2004, 14 (05) :942-950
[7]   Real-Time DNA Sequencing from Single Polymerase Molecules [J].
Eid, John ;
Fehr, Adrian ;
Gray, Jeremy ;
Luong, Khai ;
Lyle, John ;
Otto, Geoff ;
Peluso, Paul ;
Rank, David ;
Baybayan, Primo ;
Bettman, Brad ;
Bibillo, Arkadiusz ;
Bjornson, Keith ;
Chaudhuri, Bidhan ;
Christians, Frederick ;
Cicero, Ronald ;
Clark, Sonya ;
Dalal, Ravindra ;
deWinter, Alex ;
Dixon, John ;
Foquet, Mathieu ;
Gaertner, Alfred ;
Hardenbol, Paul ;
Heiner, Cheryl ;
Hester, Kevin ;
Holden, David ;
Kearns, Gregory ;
Kong, Xiangxu ;
Kuse, Ronald ;
Lacroix, Yves ;
Lin, Steven ;
Lundquist, Paul ;
Ma, Congcong ;
Marks, Patrick ;
Maxham, Mark ;
Murphy, Devon ;
Park, Insil ;
Pham, Thang ;
Phillips, Michael ;
Roy, Joy ;
Sebra, Robert ;
Shen, Gene ;
Sorenson, Jon ;
Tomaney, Austin ;
Travers, Kevin ;
Trulson, Mark ;
Vieceli, John ;
Wegener, Jeffrey ;
Wu, Dawn ;
Yang, Alicia ;
Zaccarin, Denis .
SCIENCE, 2009, 323 (5910) :133-138
[8]   Genome-wide mapping of alternative splicing in Arabidopsis thaliana [J].
Filichkin, Sergei A. ;
Priest, Henry D. ;
Givan, Scott A. ;
Shen, Rongkun ;
Bryant, Douglas W. ;
Fox, Samuel E. ;
Wong, Weng-Keen ;
Mockler, Todd C. .
GENOME RESEARCH, 2010, 20 (01) :45-58
[9]   Single-molecule DNA sequencing of a viral genome [J].
Harris, Timothy D. ;
Buzby, Phillip R. ;
Babcock, Hazen ;
Beer, Eric ;
Bowers, Jayson ;
Braslavsky, Ido ;
Causey, Marie ;
Colonell, Jennifer ;
DiMeo, James ;
Efcavitch, J. William ;
Giladi, Eldar ;
Gill, Jaime ;
Healy, John ;
Jarosz, Mirna ;
Lapen, Dan ;
Moulton, Keith ;
Quake, Stephen R. ;
Steinmann, Kathleen ;
Thayer, Edward ;
Tyurina, Anastasia ;
Ward, Rebecca ;
Weiss, Howard ;
Xie, Zheng .
SCIENCE, 2008, 320 (5872) :106-109
[10]   GENCODE: producing a reference annotation for ENCODE [J].
Harrow, Jennifer ;
Denoeud, France ;
Frankish, Adam ;
Reymond, Alexandre ;
Chen, Chao-Kung ;
Chrast, Jacqueline ;
Lagarde, Julien ;
Gilbert, James Gr ;
Storey, Roy ;
Swarbreck, David ;
Rossier, Colette ;
Ucla, Catherine ;
Hubbard, Tim ;
Antonarakis, Stylianos E. ;
Guigo, Roderic .
GENOME BIOLOGY, 2006, 7 (Suppl 1)