Reliable Identification of Genomic Variants from RNA-Seq Data

被引:240
作者
Piskol, Robert [1 ]
Ramaswami, Gokul [1 ]
Li, Jin Billy [1 ]
机构
[1] Stanford Univ, Dept Genet, Stanford, CA 94305 USA
基金
美国国家卫生研究院;
关键词
ACCURATE IDENTIFICATION; MUTATIONAL EVOLUTION; SPLICE JUNCTIONS; READ ALIGNMENT; TRANSCRIPTOME; DNA; POLYMORPHISM; MECHANISMS; EXPRESSION; LANDSCAPE;
D O I
10.1016/j.ajhg.2013.08.008
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Identifying genomic variation is a crucial step for unraveling the relationship between genotype and phenotype and can yield important insights into human diseases. Prevailing methods rely on cost-intensive whole-genome sequencing (WGS) or whole-exome sequencing (WES) approaches while the identification of genomic variants from often existing RNA sequencing (RNA-seq) data remains a challenge because of the intrinsic complexity in the transcriptome. Here, we present a highly accurate approach termed SNPiR to identify SNPs in RNA-seq data. We applied SNPiR to RNA-seq data of samples for which WGS and WES data are also available and achieved high specificity and sensitivity. Of the SNPs called from the RNA-seq data, >98% were also identified by WGS or WES. Over 70% of all expressed coding variants were identified from RNA-seq, and comparable numbers of exonic variants were identified in RNA-seq and WES. Despite our method's limitation in detecting variants in expressed regions only, our results demonstrate that SNPiR outperforms current state-of-the-art approaches for variant detection from RNA-seq data and offers a cost-effective and reliable alternative for SNP discovery.
引用
收藏
页码:641 / 651
页数:11
相关论文
共 53 条
[1]   The common PPARγ Pro12Ala polymorphism is associated with decreased risk of type 2 diabetes [J].
Altshuler, D ;
Hirschhorn, JN ;
Klannemark, M ;
Lindgren, CM ;
Vohl, MC ;
Nemesh, J ;
Lane, CR ;
Schaffner, SF ;
Bolk, S ;
Brewer, C ;
Tuomi, T ;
Gaudet, D ;
Hudson, TJ ;
Daly, M ;
Groop, L ;
Lander, ES .
NATURE GENETICS, 2000, 26 (01) :76-80
[2]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[3]   Integrating common and rare genetic variation in diverse human populations [J].
Altshuler, David M. ;
Gibbs, Richard A. ;
Peltonen, Leena ;
Dermitzakis, Emmanouil ;
Schaffner, Stephen F. ;
Yu, Fuli ;
Bonnen, Penelope E. ;
de Bakker, Paul I. W. ;
Deloukas, Panos ;
Gabriel, Stacey B. ;
Gwilliam, Rhian ;
Hunt, Sarah ;
Inouye, Michael ;
Jia, Xiaoming ;
Palotie, Aarno ;
Parkin, Melissa ;
Whittaker, Pamela ;
Chang, Kyle ;
Hawes, Alicia ;
Lewis, Lora R. ;
Ren, Yanru ;
Wheeler, David ;
Muzny, Donna Marie ;
Barnes, Chris ;
Darvishi, Katayoon ;
Hurles, Matthew ;
Korn, Joshua M. ;
Kristiansson, Kati ;
Lee, Charles ;
McCarroll, Steven A. ;
Nemesh, James ;
Keinan, Alon ;
Montgomery, Stephen B. ;
Pollack, Samuela ;
Price, Alkes L. ;
Soranzo, Nicole ;
Gonzaga-Jauregui, Claudia ;
Anttila, Verneri ;
Brodeur, Wendy ;
Daly, Mark J. ;
Leslie, Stephen ;
McVean, Gil ;
Moutsianas, Loukas ;
Nguyen, Huy ;
Zhang, Qingrun ;
Ghori, Mohammed J. R. ;
McGinnis, Ralph ;
McLaren, William ;
Takeuchi, Fumihiko ;
Grossman, Sharon R. .
NATURE, 2010, 467 (7311) :52-58
[4]   Detection of splice junctions from paired-end RNA-seq data by SpliceMap [J].
Au, Kin Fai ;
Jiang, Hui ;
Lin, Lan ;
Xing, Yi ;
Wong, Wing Hung .
NUCLEIC ACIDS RESEARCH, 2010, 38 (14) :4570-4578
[5]   Accurate identification of A-to-I RNA editing in human by transcriptome sequencing [J].
Bahn, Jae Hoon ;
Lee, Jae-Hyung ;
Li, Gang ;
Greer, Christopher ;
Peng, Guangdun ;
Xiao, Xinshu .
GENOME RESEARCH, 2012, 22 (01) :142-150
[6]   Accurate whole human genome sequencing using reversible terminator chemistry [J].
Bentley, David R. ;
Balasubramanian, Shankar ;
Swerdlow, Harold P. ;
Smith, Geoffrey P. ;
Milton, John ;
Brown, Clive G. ;
Hall, Kevin P. ;
Evers, Dirk J. ;
Barnes, Colin L. ;
Bignell, Helen R. ;
Boutell, Jonathan M. ;
Bryant, Jason ;
Carter, Richard J. ;
Cheetham, R. Keira ;
Cox, Anthony J. ;
Ellis, Darren J. ;
Flatbush, Michael R. ;
Gormley, Niall A. ;
Humphray, Sean J. ;
Irving, Leslie J. ;
Karbelashvili, Mirian S. ;
Kirk, Scott M. ;
Li, Heng ;
Liu, Xiaohai ;
Maisinger, Klaus S. ;
Murray, Lisa J. ;
Obradovic, Bojan ;
Ost, Tobias ;
Parkinson, Michael L. ;
Pratt, Mark R. ;
Rasolonjatovo, Isabelle M. J. ;
Reed, Mark T. ;
Rigatti, Roberto ;
Rodighiero, Chiara ;
Ross, Mark T. ;
Sabot, Andrea ;
Sankar, Subramanian V. ;
Scally, Aylwyn ;
Schroth, Gary P. ;
Smith, Mark E. ;
Smith, Vincent P. ;
Spiridou, Anastassia ;
Torrance, Peta E. ;
Tzonev, Svilen S. ;
Vermaas, Eric H. ;
Walter, Klaudia ;
Wu, Xiaolin ;
Zhang, Lu ;
Alam, Mohammed D. ;
Anastasi, Carole .
NATURE, 2008, 456 (7218) :53-59
[7]   Initial genome sequencing and analysis of multiple myeloma [J].
Chapman, Michael A. ;
Lawrence, Michael S. ;
Keats, Jonathan J. ;
Cibulskis, Kristian ;
Sougnez, Carrie ;
Schinzel, Anna C. ;
Harview, Christina L. ;
Brunet, Jean-Philippe ;
Ahmann, Gregory J. ;
Adli, Mazhar ;
Anderson, Kenneth C. ;
Ardlie, Kristin G. ;
Auclair, Daniel ;
Baker, Angela ;
Bergsagel, P. Leif ;
Bernstein, Bradley E. ;
Drier, Yotam ;
Fonseca, Rafael ;
Gabriel, Stacey B. ;
Hofmeister, Craig C. ;
Jagannath, Sundar ;
Jakubowiak, Andrzej J. ;
Krishnan, Amrita ;
Levy, Joan ;
Liefeld, Ted ;
Lonial, Sagar ;
Mahan, Scott ;
Mfuko, Bunmi ;
Monti, Stefano ;
Perkins, Louise M. ;
Onofrio, Robb ;
Pugh, Trevor J. ;
Rajkumar, S. Vincent ;
Ramos, Alex H. ;
Siegel, David S. ;
Sivachenko, Andrey ;
Stewart, A. Keith ;
Trudel, Suzanne ;
Vij, Ravi ;
Voet, Douglas ;
Winckler, Wendy ;
Zimmerman, Todd ;
Carpten, John ;
Trent, Jeff ;
Hahn, William C. ;
Garraway, Levi A. ;
Meyerson, Matthew ;
Lander, Eric S. ;
Getz, Gad ;
Golub, Todd R. .
NATURE, 2011, 471 (7339) :467-472
[8]   RNASEQR-a streamlined and accurate RNA-seq sequence analysis program [J].
Chen, Leslie Y. ;
Wei, Kuo-Chen ;
Huang, Abner C. -Y. ;
Wang, Kai ;
Huang, Chiung-Yin ;
Yi, Danielle ;
Tang, Chuan Yi ;
Galas, David J. ;
Hood, Leroy E. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (06) :e42
[9]   Personal Omics Profiling Reveals Dynamic Molecular and Medical Phenotypes [J].
Chen, Rui ;
Mias, George I. ;
Li-Pook-Than, Jennifer ;
Jiang, Lihua ;
Lam, Hugo Y. K. ;
Chen, Rong ;
Miriami, Elana ;
Karczewski, Konrad J. ;
Hariharan, Manoj ;
Dewey, Frederick E. ;
Cheng, Yong ;
Clark, Michael J. ;
Im, Hogune ;
Habegger, Lukas ;
Balasubramanian, Suganthi ;
O'Huallachain, Maeve ;
Dudley, Joel T. ;
Hillenmeyer, Sara ;
Haraksingh, Rajini ;
Sharon, Donald ;
Euskirchen, Ghia ;
Lacroute, Phil ;
Bettinger, Keith ;
Boyle, Alan P. ;
Kasowski, Maya ;
Grubert, Fabian ;
Seki, Scott ;
Garcia, Marco ;
Whirl-Carrillo, Michelle ;
Gallardo, Mercedes ;
Blasco, Maria A. ;
Greenberg, Peter L. ;
Snyder, Phyllis ;
Klein, Teri E. ;
Altman, Russ B. ;
Butte, Atul J. ;
Ashley, Euan A. ;
Gerstein, Mark ;
Nadeau, Kari C. ;
Tang, Hua ;
Snyder, Michael .
CELL, 2012, 148 (06) :1293-1307
[10]   Mechanisms and consequences of widespread random monoallelic expression [J].
Chess, Andrew .
NATURE REVIEWS GENETICS, 2012, 13 (06) :421-428