Discovering chimeric transcripts in paired-end RNA-seq data by using EricScript

被引:142
作者
Benelli, Matteo [1 ,2 ]
Pescucci, Chiara [2 ]
Marseglia, Giuseppina [2 ]
Severgnini, Marco [3 ]
Torricelli, Francesca [1 ,2 ]
Magi, Alberto [2 ,4 ]
机构
[1] Careggi Univ Hosp, Dept Lab, Diagnost Genet Unit, I-50134 Florence, Italy
[2] Univ Florence, Ctr Study Complex Dynam, I-50019 Florence, Italy
[3] Natl Res Council CNR, Inst Biomed Technol ITB, I-20090 Segrate, Italy
[4] Univ Florence, Dept Med & Surg Crit Care, I-50134 Florence, Italy
关键词
GENE FUSIONS; CANCER; IDENTIFICATION; ALIGNMENT;
D O I
10.1093/bioinformatics/bts617
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
MOTIVATION: The discovery of novel gene fusions can lead to a better comprehension of cancer progression and development. The emergence of deep sequencing of trancriptome, known as RNA-seq, has opened many opportunities for the identification of this class of genomic alterations, leading to the discovery of novel chimeric transcripts in melanomas, breast cancers and lymphomas. Nowadays, few computational approaches have been developed for the detection of chimeric transcripts. Although all of these computational methods show good sensitivity, much work remains to reduce the huge number of false-positive calls that arises from this analysis. RESULTS: We proposed a novel computational framework, named chimEric tranScript detection algorithm (EricScript), for the identification of gene fusion products in paired-end RNA-seq data. Our simulation study on synthetic data demonstrates that EricScript enables to achieve higher sensitivity and specificity than existing methods with noticeably lower running times. We also applied our method to publicly available RNA-seq tumour datasets, and we showed its capability in rediscovering known gene fusions.
引用
收藏
页码:3232 / 3239
页数:8
相关论文
共 27 条
[1]   Integrative analysis of the melanoma transcriptome [J].
Berger, Michael F. ;
Levin, Joshua Z. ;
Vijayendran, Krishna ;
Sivachenko, Andrey ;
Adiconis, Xian ;
Maguire, Jared ;
Johnson, Laura A. ;
Robinson, James ;
Verhaak, Roel G. ;
Sougnez, Carrie ;
Onofrio, Robert C. ;
Ziaugra, Liuda ;
Cibulskis, Kristian ;
Laine, Elisabeth ;
Barretina, Jordi ;
Winckler, Wendy ;
Fisher, David E. ;
Getz, Gad ;
Meyerson, Matthew ;
Jaffe, David B. ;
Gabriel, Stacey B. ;
Lander, Eric S. ;
Dummer, Reinhard ;
Gnirke, Andreas ;
Nusbaum, Chad ;
Garraway, Levi A. .
GENOME RESEARCH, 2010, 20 (04) :413-427
[2]   Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses [J].
Cabili, Moran N. ;
Trapnell, Cole ;
Goff, Loyal ;
Koziol, Magdalena ;
Tazon-Vega, Barbara ;
Regev, Aviv ;
Rinn, John L. .
GENES & DEVELOPMENT, 2011, 25 (18) :1915-1927
[3]   Identification of fusion genes in breast cancer by paired-end RNA-sequencing [J].
Edgren, Henrik ;
Murumagi, Astrid ;
Kangaspeska, Sara ;
Nicorici, Daniel ;
Hongisto, Vesa ;
Kleivi, Kristine ;
Rye, Inga H. ;
Nyberg, Sandra ;
Wolf, Maija ;
Borresen-Dale, Anne-Lise ;
Kallioniemi, Olli .
GENOME BIOLOGY, 2011, 12 (01)
[4]   FusionMap: detecting fusion genes from next-generation sequencing data at base-pair resolution [J].
Ge, Huanying ;
Liu, Kejun ;
Juan, Todd ;
Fang, Fang ;
Newman, Matthew ;
Hoeck, Wolfgang .
BIOINFORMATICS, 2011, 27 (14) :1922-1928
[5]   ChimeraScan: a tool for identifying chimeric transcription in sequencing data [J].
Iyer, Matthew K. ;
Chinnaiyan, Arul M. ;
Maher, Christopher A. .
BIOINFORMATICS, 2011, 27 (20) :2903-2904
[6]   Statistical inferences for isoform expression in RNA-Seq [J].
Jiang, Hui ;
Wong, Wing Hung .
BIOINFORMATICS, 2009, 25 (08) :1026-1032
[7]  
Kent WJ, 2002, GENOME RES, V12, P656, DOI [10.1101/gr.229202, 10.1101/gr.229202. Article published online before March 2002]
[8]   TopHat-Fusion: an algorithm for discovery of novel fusion transcripts [J].
Kim, Daehwan ;
Salzberg, Steven L. .
GENOME BIOLOGY, 2011, 12 (08)
[9]   Sensitive gene fusion detection using ambiguously mapping RNA-Seq read pairs [J].
Kinsella, Marcus ;
Harismendy, Olivier ;
Nakano, Masakazu ;
Frazer, Kelly A. ;
Bafna, Vineet .
BIOINFORMATICS, 2011, 27 (08) :1068-1075
[10]  
Li H, 2009, BIOINFORMATICS, V25, P1094, DOI [10.1093/bioinformatics/btp324, 10.1093/bioinformatics/btp100]