FusionMap: detecting fusion genes from next-generation sequencing data at base-pair resolution

被引:199
作者
Ge, Huanying [1 ]
Liu, Kejun [2 ]
Juan, Todd [3 ]
Fang, Fang [4 ]
Newman, Matthew [2 ]
Hoeck, Wolfgang [1 ]
机构
[1] Amgen Inc, Res & Dev Informat, Thousand Oaks, CA 91320 USA
[2] OmicSoft Corp, Morrisville, NC 27560 USA
[3] Amgen Inc, Prot Sci, Thousand Oaks, CA 91320 USA
[4] Univ So Calif, Los Angeles, CA 90089 USA
关键词
SPLICE JUNCTIONS; COPY NUMBER; GENOME; IDENTIFICATION; VARIANTS;
D O I
10.1093/bioinformatics/btr310
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Next generation sequencing technology generates high-throughput data, which allows us to detect fusion genes at both transcript and genomic levels. To detect fusion genes, the current bioinformatics tools heavily rely on paired-end approaches and overlook the importance of reads that span fusion junctions. Thus there is a need to develop an efficient aligner to detect fusion events by accurate mapping of these junction-spanning single reads, particularly when the read gets longer with the improvement in sequencing technology. Results: We present a novel method, FusionMap, which aligns fusion reads directly to the genome without prior knowledge of potential fusion regions. FusionMap can detect fusion events in both single- and paired-end datasets from either RNA-Seq or gDNA-Seq studies and characterize fusion junctions at base-pair resolution. We showed that FusionMap achieved high sensitivity and specificity in fusion detection on two simulated RNA-Seq datasets, which contained 75 nt paired-end reads. FusionMap achieved substantially higher sensitivity and specificity than the paired-end approach when the inner distance between read pairs was small. Using FusionMap to characterize fusion genes in K562 chronic myeloid leukemia cell line, we further demonstrated its accuracy in fusion detection in both single-end RNA-Seq and gDNA-Seq datasets. These combined results show that FusionMap provides an accurate and systematic solution to detecting fusion events through junction-spanning reads.
引用
收藏
页码:1922 / 1928
页数:7
相关论文
共 25 条
[1]   Detection of splice junctions from paired-end RNA-seq data by SpliceMap [J].
Au, Kin Fai ;
Jiang, Hui ;
Lin, Lan ;
Xing, Yi ;
Wong, Wing Hung .
NUCLEIC ACIDS RESEARCH, 2010, 38 (14) :4570-4578
[2]   Integrative analysis of the melanoma transcriptome [J].
Berger, Michael F. ;
Levin, Joshua Z. ;
Vijayendran, Krishna ;
Sivachenko, Andrey ;
Adiconis, Xian ;
Maguire, Jared ;
Johnson, Laura A. ;
Robinson, James ;
Verhaak, Roel G. ;
Sougnez, Carrie ;
Onofrio, Robert C. ;
Ziaugra, Liuda ;
Cibulskis, Kristian ;
Laine, Elisabeth ;
Barretina, Jordi ;
Winckler, Wendy ;
Fisher, David E. ;
Getz, Gad ;
Meyerson, Matthew ;
Jaffe, David B. ;
Gabriel, Stacey B. ;
Lander, Eric S. ;
Dummer, Reinhard ;
Gnirke, Andreas ;
Nusbaum, Chad ;
Garraway, Levi A. .
GENOME RESEARCH, 2010, 20 (04) :413-427
[3]   Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project [J].
Birney, Ewan ;
Stamatoyannopoulos, John A. ;
Dutta, Anindya ;
Guigo, Roderic ;
Gingeras, Thomas R. ;
Margulies, Elliott H. ;
Weng, Zhiping ;
Snyder, Michael ;
Dermitzakis, Emmanouil T. ;
Stamatoyannopoulos, John A. ;
Thurman, Robert E. ;
Kuehn, Michael S. ;
Taylor, Christopher M. ;
Neph, Shane ;
Koch, Christoph M. ;
Asthana, Saurabh ;
Malhotra, Ankit ;
Adzhubei, Ivan ;
Greenbaum, Jason A. ;
Andrews, Robert M. ;
Flicek, Paul ;
Boyle, Patrick J. ;
Cao, Hua ;
Carter, Nigel P. ;
Clelland, Gayle K. ;
Davis, Sean ;
Day, Nathan ;
Dhami, Pawandeep ;
Dillon, Shane C. ;
Dorschner, Michael O. ;
Fiegler, Heike ;
Giresi, Paul G. ;
Goldy, Jeff ;
Hawrylycz, Michael ;
Haydock, Andrew ;
Humbert, Richard ;
James, Keith D. ;
Johnson, Brett E. ;
Johnson, Ericka M. ;
Frum, Tristan T. ;
Rosenzweig, Elizabeth R. ;
Karnani, Neerja ;
Lee, Kirsten ;
Lefebvre, Gregory C. ;
Navas, Patrick A. ;
Neri, Fidencio ;
Parker, Stephen C. J. ;
Sabo, Peter J. ;
Sandstrom, Richard ;
Shafer, Anthony .
NATURE, 2007, 447 (7146) :799-816
[4]   Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing [J].
Campbell, Peter J. ;
Stephens, Philip J. ;
Pleasance, Erin D. ;
O'Meara, Sarah ;
Li, Heng ;
Santarius, Thomas ;
Stebbings, Lucy A. ;
Leroy, Catherine ;
Edkins, Sarah ;
Hardy, Claire ;
Teague, Jon W. ;
Menzies, Andrew ;
Goodhead, Ian ;
Turner, Daniel J. ;
Clee, Christopher M. ;
Quail, Michael A. ;
Cox, Antony ;
Brown, Clive ;
Durbin, Richard ;
Hurles, Matthew E. ;
Edwards, Paul A. W. ;
Bignell, Graham R. ;
Stratton, Michael R. ;
Futreal, P. Andrew .
NATURE GENETICS, 2008, 40 (06) :722-729
[5]   Fusion genes and chromosome translocations in the common epithelial cancers [J].
Edwards, Paul A. W. .
JOURNAL OF PATHOLOGY, 2010, 220 (02) :244-254
[6]   A sequence-level map of chromosomal breakpoints in the MCF-7 breast cancer cell line yields insights into the evolution of a cancer genome [J].
Hampton, Oliver A. ;
Den Hollander, Petra ;
Miller, Christopher A. ;
Delgado, David A. ;
Li, Jian ;
Coarfa, Cristian ;
Harris, Ronald A. ;
Richards, Stephen ;
Scherer, Steven E. ;
Muzny, Donna M. ;
Gibbs, Richard A. ;
Lee, Adrian V. ;
Milosavljevic, Aleksandar .
GENOME RESEARCH, 2009, 19 (02) :167-177
[7]  
Illumina, 2010, SBS SEQ KIT V5 REAG
[8]   Ultrafast and memory-efficient alignment of short DNA sequences to the human genome [J].
Langmead, Ben ;
Trapnell, Cole ;
Pop, Mihai ;
Salzberg, Steven L. .
GENOME BIOLOGY, 2009, 10 (03)
[9]   Targeted next-generation sequencing of a cancer transcriptome enhances detection of sequence variants and novel fusion transcripts [J].
Levin, Joshua Z. ;
Berger, Michael F. ;
Adiconis, Xian ;
Rogov, Peter ;
Melnikov, Alexandre ;
Fennell, Timothy ;
Nusbaum, Chad ;
Garraway, Levi A. ;
Gnirke, Andreas .
GENOME BIOLOGY, 2009, 10 (10)
[10]   TYROSINE KINASE-ACTIVITY AND TRANSFORMATION POTENCY OF BCR-ABL ONCOGENE PRODUCTS [J].
LUGO, TG ;
PENDERGAST, AM ;
MULLER, AJ ;
WITTE, ON .
SCIENCE, 1990, 247 (4946) :1079-1082