Transcriptome genetics using second generation sequencing in a Caucasian population

被引:618
作者
Montgomery, Stephen B. [1 ,2 ]
Sammeth, Micha [3 ]
Gutierrez-Arcelus, Maria [1 ]
Lach, Radoslaw P. [2 ]
Ingle, Catherine [2 ]
Nisbett, James [2 ]
Guigo, Roderic [3 ]
Dermitzakis, Emmanouil T. [1 ,2 ]
机构
[1] Univ Geneva, Sch Med, Dept Genet Med & Dev, CH-1211 Geneva, Switzerland
[2] Wellcome Trust Sanger Inst, Cambridge CB10 1HH, England
[3] Univ Pompeu Fabra, Ctr Genom Regulat, Barcelona 08003, Catalonia, Spain
基金
英国惠康基金; 瑞士国家科学基金会;
关键词
RNA-SEQ; REGULATORY VARIATION; EXPRESSION; GENOME; NUCLEOTIDE; RESOLUTION;
D O I
10.1038/nature08903
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gene expression is an important phenotype that informs about genetic and environmental effects on cellular state. Many studies have previously identified genetic variants for gene expression phenotypes using custom and commercially available microarrays(1-5). Second generation sequencing technologies are now providing unprecedented access to the fine structure of the transcriptome(6-14). We have sequenced the mRNA fraction of the transcriptome in 60 extended HapMap individuals of European descent and have combined these data with genetic variants from the HapMap3 project(15). We have quantified exon abundance based on read depth and have also developed methods to quantify whole transcript abundance. We have found that approximately 10 million reads of sequencing can provide access to the same dynamic range as arrays with better quantification of alternative and highly abundant transcripts. Correlation with SNPs (small nucleotide polymorphisms) leads to a larger discovery of eQTLs (expression quantitative trait loci) than with arrays. We also detect a substantial number of variants that influence the structure of mature transcripts indicating variants responsible for alternative splicing. Finally, measures of allele-specific expression allowed the identification of rare eQTLs and allelic differences in transcript structure. This analysis shows that high throughput sequencing technologies reveal new properties of genetic effects on the transcriptome and allow the exploration of genetic effects in cellular processes.
引用
收藏
页码:773 / U151
页数:7
相关论文
共 34 条
[1]  
Ahuja R. K., 1993, Network flows: theory, algorithms, and applications
[2]   Stem cell transcriptome profiling via massive-scale mRNA sequencing [J].
Cloonan, Nicole ;
Forrest, Alistair R. R. ;
Kolle, Gabriel ;
Gardiner, Brooke B. A. ;
Faulkner, Geoffrey J. ;
Brown, Mellissa K. ;
Taylor, Darrin F. ;
Steptoe, Anita L. ;
Wani, Shivangi ;
Bethel, Graeme ;
Robertson, Alan J. ;
Perkins, Andrew C. ;
Bruce, Stephen J. ;
Lee, Clarence C. ;
Ranade, Swati S. ;
Peckham, Heather E. ;
Manning, Jonathan M. ;
McKernan, Kevin J. ;
Grimmond, Sean M. .
NATURE METHODS, 2008, 5 (07) :613-619
[3]  
Cormen T., 2001, Introduction to Algorithms, P770
[4]   Genetics of gene expression and its effect on disease [J].
Emilsson, Valur ;
Thorleifsson, Gudmar ;
Zhang, Bin ;
Leonardson, Amy S. ;
Zink, Florian ;
Zhu, Jun ;
Carlson, Sonia ;
Helgason, Agnar ;
Walters, G. Bragi ;
Gunnarsdottir, Steinunn ;
Mouy, Magali ;
Steinthorsdottir, Valgerdur ;
Eiriksdottir, Gudrun H. ;
Bjornsdottir, Gyda ;
Reynisdottir, Inga ;
Gudbjartsson, Daniel ;
Helgadottir, Anna ;
Jonasdottir, Aslaug ;
Jonasdottir, Adalbjorg ;
Styrkarsdottir, Unnur ;
Gretarsdottir, Solveig ;
Magnusson, Kristinn P. ;
Stefansson, Hreinn ;
Fossdal, Ragnheidur ;
Kristjansson, Kristleifur ;
Gislason, Hjortur G. ;
Stefansson, Tryggvi ;
Leifsson, Bjorn G. ;
Thorsteinsdottir, Unnur ;
Lamb, John R. ;
Gulcher, Jeffrey R. ;
Reitman, Marc L. ;
Kong, Augustine ;
Schadt, Eric E. ;
Stefansson, Kari .
NATURE, 2008, 452 (7186) :423-U2
[5]   A second generation human haplotype map of over 3.1 million SNPs [J].
Frazer, Kelly A. ;
Ballinger, Dennis G. ;
Cox, David R. ;
Hinds, David A. ;
Stuve, Laura L. ;
Gibbs, Richard A. ;
Belmont, John W. ;
Boudreau, Andrew ;
Hardenbol, Paul ;
Leal, Suzanne M. ;
Pasternak, Shiran ;
Wheeler, David A. ;
Willis, Thomas D. ;
Yu, Fuli ;
Yang, Huanming ;
Zeng, Changqing ;
Gao, Yang ;
Hu, Haoran ;
Hu, Weitao ;
Li, Chaohua ;
Lin, Wei ;
Liu, Siqi ;
Pan, Hao ;
Tang, Xiaoli ;
Wang, Jian ;
Wang, Wei ;
Yu, Jun ;
Zhang, Bo ;
Zhang, Qingrun ;
Zhao, Hongbin ;
Zhao, Hui ;
Zhou, Jun ;
Gabriel, Stacey B. ;
Barry, Rachel ;
Blumenstiel, Brendan ;
Camargo, Amy ;
Defelice, Matthew ;
Faggart, Maura ;
Goyette, Mary ;
Gupta, Supriya ;
Moore, Jamie ;
Nguyen, Huy ;
Onofrio, Robert C. ;
Parkin, Melissa ;
Roy, Jessica ;
Stahl, Erich ;
Winchester, Ellen ;
Ziaugra, Liuda ;
Altshuler, David ;
Shen, Yan .
NATURE, 2007, 449 (7164) :851-U3
[6]   Discovery of expression QTLs using large-scale transcriptional profiling in human lymphocytes [J].
Goering, Harald H. H. ;
Curran, Joanne E. ;
Johnson, Matthew P. ;
Dyer, Thomas D. ;
Charlesworth, Jac ;
Cole, Shelley A. ;
Jowett, Jeremy B. M. ;
Abraham, Lawrence J. ;
Rainwater, David L. ;
Comuzzie, Anthony G. ;
Mahaney, Michael C. ;
Almasy, Laura ;
MacCluer, Jean W. ;
Kissebah, Ahmed H. ;
Collier, Gregory R. ;
Moses, Eric K. ;
Blangero, John .
NATURE GENETICS, 2007, 39 (10) :1208-1216
[7]  
HILLER D, 2009, BIOINFORMATICS, P93901
[8]   Ensembl 2009 [J].
Hubbard, T. J. P. ;
Aken, B. L. ;
Ayling, S. ;
Ballester, B. ;
Beal, K. ;
Bragin, E. ;
Brent, S. ;
Chen, Y. ;
Clapham, P. ;
Clarke, L. ;
Coates, G. ;
Fairley, S. ;
Fitzgerald, S. ;
Fernandez-Banet, J. ;
Gordon, L. ;
Graf, S. ;
Haider, S. ;
Hammond, M. ;
Holland, R. ;
Howe, K. ;
Jenkinson, A. ;
Johnson, N. ;
Kahari, A. ;
Keefe, D. ;
Keenan, S. ;
Kinsella, R. ;
Kokocinski, F. ;
Kulesha, E. ;
Lawson, D. ;
Longden, I. ;
Megy, K. ;
Meidl, P. ;
Overduin, B. ;
Parker, A. ;
Pritchard, B. ;
Rios, D. ;
Schuster, M. ;
Slater, G. ;
Smedley, D. ;
Spooner, W. ;
Spudich, G. ;
Trevanion, S. ;
Vilella, A. ;
Vogel, J. ;
White, S. ;
Wilder, S. ;
Zadissa, A. ;
Birney, E. ;
Cunningham, F. ;
Curwen, V. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D690-D697
[9]   Mapping short DNA sequencing reads and calling variants using mapping quality scores [J].
Li, Heng ;
Ruan, Jue ;
Durbin, Richard .
GENOME RESEARCH, 2008, 18 (11) :1851-1858
[10]   The Sequence Alignment/Map format and SAMtools [J].
Li, Heng ;
Handsaker, Bob ;
Wysoker, Alec ;
Fennell, Tim ;
Ruan, Jue ;
Homer, Nils ;
Marth, Gabor ;
Abecasis, Goncalo ;
Durbin, Richard .
BIOINFORMATICS, 2009, 25 (16) :2078-2079