A uniform survey of allele-specific binding and expression over 1000-Genomes-Project individuals

被引:52
作者
Chen, Jieming [1 ,2 ]
Rozowsky, Joel [1 ,3 ]
Galeev, Timur R. [1 ,3 ]
Harmanci, Arif [1 ,3 ]
Kitchen, Robert [1 ,3 ]
Bedford, Jason [1 ]
Abyzov, Alexej [1 ,3 ,6 ]
Kong, Yong [4 ,5 ]
Regan, Lynne [1 ,2 ,3 ]
Gerstein, Mark [1 ,2 ,3 ,4 ]
机构
[1] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT 06520 USA
[2] Yale Univ, Integrated Grad Program Phys & Engn Biol, New Haven, CT 06520 USA
[3] Yale Univ, Dept Mol Biophys & Biochem, POB 6666, New Haven, CT 06520 USA
[4] Yale Univ, Dept Comp Sci, POB 2158, New Haven, CT 06520 USA
[5] Yale Univ, Keck Biotechnol Resource Lab, New Haven, CT 06511 USA
[6] Mayo Clin, Dept Hlth Sci Res, Ctr Individualized Med, Rochester, MN 55905 USA
关键词
MONOALLELIC GENE-EXPRESSION; GENOME; TRANSCRIPTION; FRAMEWORK; VARIANTS; GENOTYPE; BIAS; IDENTIFICATION; ANNOTATION; LANDSCAPE;
D O I
10.1038/ncomms11101
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Large-scale sequencing in the 1000 Genomes Project has revealed multitudes of single nucleotide variants (SNVs). Here, we provide insights into the functional effect of these variants using allele-specific behaviour. This can be assessed for an individual by mapping ChIP-seq and RNA-seq reads to a personal genome, and then measuring 'allelic imbalances' between the numbers of reads mapped to the paternal and maternal chromosomes. We annotate variants associated with allele-specific binding and expression in 382 individuals by uniformly processing 1,263 functional genomics data sets, developing approaches to reduce the heterogeneity between data sets due to overdispersion and mapping bias. Since many allelic variants are rare, aggregation across multiple individuals is necessary to identify broadly applicable 'allelic elements'. We also found SNVs for which we can anticipate allelic imbalance from the disruption of a binding motif. Our results serve as an allele-specific annotation for the 1000 Genomes variant catalogue and are distributed as an online resource (alleledb.gersteinlab.org).
引用
收藏
页数:13
相关论文
共 70 条
[41]   Transcriptome genetics using second generation sequencing in a Caucasian population [J].
Montgomery, Stephen B. ;
Sammeth, Micha ;
Gutierrez-Arcelus, Maria ;
Lach, Radoslaw P. ;
Ingle, Catherine ;
Nisbett, James ;
Guigo, Roderic ;
Dermitzakis, Emmanouil T. .
NATURE, 2010, 464 (7289) :773-U151
[42]   A census of mammalian imprinting [J].
Morison, IM ;
Ramsay, JP ;
Spencer, HG .
TRENDS IN GENETICS, 2005, 21 (08) :457-465
[43]   The imprinted gene and parent-of-origin effect database [J].
Morison, IM ;
Paton, CJ ;
Cleverley, SD .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :275-276
[44]   Implementing a successful data-management framework: the UK10K managed access model [J].
Muddyman, Dawn ;
Smee, Carol ;
Griffin, Heather ;
Kaye, Jane .
GENOME MEDICINE, 2013, 5
[45]  
Olender Tsviya, 2013, Methods Mol Biol, V1003, P23, DOI 10.1007/978-1-62703-377-0_2
[46]   Allelic mapping bias in RNA-sequencing is not a major confounder in eQTL studies [J].
Panousis, Nikolaos I. ;
Gutierrez-Arcelus, Maria ;
Dermitzakis, Emmanouil T. ;
Lappalainen, Tuuli .
GENOME BIOLOGY, 2014, 15 (09) :467
[47]   Accurate whole-genome sequencing and haplotyping from 10 to 20 human cells [J].
Peters, Brock A. ;
Kermani, Bahram G. ;
Sparks, Andrew B. ;
Alferov, Oleg ;
Hong, Peter ;
Alexeev, Andrei ;
Jiang, Yuan ;
Dahl, Fredrik ;
Tang, Y. Tom ;
Haas, Juergen ;
Robasky, Kimberly ;
Zaranek, Alexander Wait ;
Lee, Je-Hyuk ;
Ball, Madeleine Price ;
Peterson, Joseph E. ;
Perazich, Helena ;
Yeung, George ;
Liu, Jia ;
Chen, Linsu ;
Kennemer, Michael I. ;
Pothuraju, Kaliprasad ;
Konvicka, Karel ;
Tsoupko-Sitnikov, Mike ;
Pant, Krishna P. ;
Ebert, Jessica C. ;
Nilsen, Geoffrey B. ;
Baccash, Jonathan ;
Halpern, Aaron L. ;
Church, George M. ;
Drmanac, Radoje .
NATURE, 2012, 487 (7406) :190-195
[48]   Understanding mechanisms underlying human gene expression variation with RNA sequencing [J].
Pickrell, Joseph K. ;
Marioni, John C. ;
Pai, Athma A. ;
Degner, Jacob F. ;
Engelhardt, Barbara E. ;
Nkadori, Everlyne ;
Veyrieras, Jean-Baptiste ;
Stephens, Matthew ;
Gilad, Yoav ;
Pritchard, Jonathan K. .
NATURE, 2010, 464 (7289) :768-772
[49]   A genome-wide approach to identifying novel-imprinted genes [J].
Pollard, Katherine S. ;
Serre, David ;
Wang, Xu ;
Tao, Heng ;
Grundberg, Elin ;
Hudson, Thomas J. ;
Clark, Andrew G. ;
Frazer, Kelly .
HUMAN GENETICS, 2008, 122 (06) :625-634
[50]   Pooled Association Tests for Rare Variants in Exon-Resequencing Studies [J].
Price, Alkes L. ;
Kryukov, Gregory V. ;
de Bakker, Paul I. W. ;
Purcell, Shaun M. ;
Staples, Jeff ;
Wei, Lee-Jen ;
Sunyaev, Shamil R. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2010, 86 (06) :832-838