ABRA: improved coding indel detection via assembly-based realignment

被引:113
作者
Mose, Lisle E. [1 ]
Wilkerson, Matthew D. [1 ,2 ]
Hayes, D. Neil [1 ,3 ]
Perou, Charles M. [1 ,2 ]
Parker, Joel S. [1 ,2 ]
机构
[1] Univ N Carolina, Lineberger Comprehens Canc Ctr, Chapel Hill, NC 27599 USA
[2] Univ N Carolina, Dept Genet, Chapel Hill, NC 27599 USA
[3] Univ N Carolina, Dept Internal Med, Div Med Oncol, Multidisciplinary Thorac Oncol Program, Chapel Hill, NC 27599 USA
关键词
ALIGNMENT; DISCOVERY;
D O I
10.1093/bioinformatics/btu376
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Variant detection from next-generation sequencing (NGS) data is an increasingly vital aspect of disease diagnosis, treatment and research. Commonly used NGS-variant analysis tools generally rely on accurately mapped short reads to identify somatic variants and germline genotypes. Existing NGS read mappers have difficulty accurately mapping short reads containing complex variation (i.e. more than a single base change), thus making identification of such variants difficult or impossible. Insertions and deletions (indels) in particular have been an area of great difficulty. Indels are frequent and can have substantial impact on function, which makes their detection all the more imperative. Results: We present ABRA, an assembly-based realigner, which uses an efficient and flexible localized de novo assembly followed by global realignment to more accurately remap reads. This results in enhanced performance for indel detection as well as improved accuracy in variant allele frequency estimation.
引用
收藏
页码:2813 / 2815
页数:3
相关论文
共 18 条
[1]   Dindel: Accurate indel calls from short-read data [J].
Albers, Cornelis A. ;
Lunter, Gerton ;
MacArthur, Daniel G. ;
McVean, Gilean ;
Ouwehand, Willem H. ;
Durbin, Richard .
GENOME RESEARCH, 2011, 21 (06) :961-973
[2]  
[Anonymous], 2012, ARXIV12073907V2
[3]   Computational Techniques for Human Genome Resequencing Using Mated Gapped Reads [J].
Carnevali, Paolo ;
Baccash, Jonathan ;
Halpern, Aaron L. ;
Nazarenko, Igor ;
Nilsen, Geoffrey B. ;
Pant, Krishna P. ;
Ebert, Jessica C. ;
Brownley, Anushka ;
Morenzoni, Matt ;
Karpinchyk, Vitali ;
Martin, Bruce ;
Ballinger, Dennis G. ;
Drmanac, Radoje .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (03) :279-292
[4]   TIGRA: A targeted iterative graph routing assembler for breakpoint assembly [J].
Chen, Ken ;
Chen, Lei ;
Fan, Xian ;
Wallis, John ;
Ding, Li ;
Weinstock, George .
GENOME RESEARCH, 2014, 24 (02) :310-317
[5]   A framework for variation discovery and genotyping using next-generation DNA sequencing data [J].
DePristo, Mark A. ;
Banks, Eric ;
Poplin, Ryan ;
Garimella, Kiran V. ;
Maguire, Jared R. ;
Hartl, Christopher ;
Philippakis, Anthony A. ;
del Angel, Guillermo ;
Rivas, Manuel A. ;
Hanna, Matt ;
McKenna, Aaron ;
Fennell, Tim J. ;
Kernytsky, Andrew M. ;
Sivachenko, Andrey Y. ;
Cibulskis, Kristian ;
Gabriel, Stacey B. ;
Altshuler, David ;
Daly, Mark J. .
NATURE GENETICS, 2011, 43 (05) :491-+
[6]   Development and validation of a clinical cancer genomic profiling test based on massively parallel DNA sequencing [J].
Frampton, Garrett M. ;
Fichtenholtz, Alex ;
Otto, Geoff A. ;
Wang, Kai ;
Downing, Sean R. ;
He, Jie ;
Schnall-Levin, Michael ;
White, Jared ;
Sanford, Eric M. ;
An, Peter ;
Sun, James ;
Juhn, Frank ;
Brennan, Kristina ;
Iwanik, Kiel ;
Maillet, Ashley ;
Buell, Jamie ;
White, Emily ;
Zhao, Mandy ;
Balasubramanian, Sohail ;
Terzic, Selmira ;
Richards, Tina ;
Banning, Vera ;
Garcia, Lazaro ;
Mahoney, Kristen ;
Zwirko, Zac ;
Donahue, Amy ;
Beltran, Himisha ;
Mosquera, Juan Miguel ;
Rubin, Mark A. ;
Dogan, Snjezana ;
Hedvat, Cyrus V. ;
Berger, Michael F. ;
Pusztai, Lajos ;
Lechner, Matthias ;
Boshoff, Chris ;
Jarosz, Mirna ;
Vietz, Christine ;
Parker, Alex ;
Miller, Vincent A. ;
Ross, Jeffrey S. ;
Curran, John ;
Cronin, Maureen T. ;
Stephens, Philip J. ;
Lipson, Doron ;
Yelensky, Roman .
NATURE BIOTECHNOLOGY, 2013, 31 (11) :1023-+
[7]   Improved variant discovery through local re-alignment of short-read next-generation sequencing data using SRMA [J].
Homer, Nils ;
Nelson, Stanley F. .
GENOME BIOLOGY, 2010, 11 (10)
[8]   De novo assembly and genotyping of variants using colored de Bruijn graphs [J].
Iqbal, Zamin ;
Caccamo, Mario ;
Turner, Isaac ;
Flicek, Paul ;
McVean, Gil .
NATURE GENETICS, 2012, 44 (02) :226-232
[9]  
Kent WJ, 2002, GENOME RES, V12, P656, DOI [10.1101/gr.229202, 10.1101/gr.229202. Article published online before March 2002]
[10]   Comprehensive molecular portraits of human breast tumours [J].
Koboldt, Daniel C. ;
Fulton, Robert S. ;
McLellan, Michael D. ;
Schmidt, Heather ;
Kalicki-Veizer, Joelle ;
McMichael, Joshua F. ;
Fulton, Lucinda L. ;
Dooling, David J. ;
Ding, Li ;
Mardis, Elaine R. ;
Wilson, Richard K. ;
Ally, Adrian ;
Balasundaram, Miruna ;
Butterfield, Yaron S. N. ;
Carlsen, Rebecca ;
Carter, Candace ;
Chu, Andy ;
Chuah, Eric ;
Chun, Hye-Jung E. ;
Coope, Robin J. N. ;
Dhalla, Noreen ;
Guin, Ranabir ;
Hirst, Carrie ;
Hirst, Martin ;
Holt, Robert A. ;
Lee, Darlene ;
Li, Haiyan I. ;
Mayo, Michael ;
Moore, Richard A. ;
Mungall, Andrew J. ;
Pleasance, Erin ;
Robertson, A. Gordon ;
Schein, Jacqueline E. ;
Shafiei, Arash ;
Sipahimalani, Payal ;
Slobodan, Jared R. ;
Stoll, Dominik ;
Tam, Angela ;
Thiessen, Nina ;
Varhol, Richard J. ;
Wye, Natasja ;
Zeng, Thomas ;
Zhao, Yongjun ;
Birol, Inanc ;
Jones, Steven J. M. ;
Marra, Marco A. ;
Cherniack, Andrew D. ;
Saksena, Gordon ;
Onofrio, Robert C. ;
Pho, Nam H. .
NATURE, 2012, 490 (7418) :61-70