共 31 条
Whole-genome haplotyping using long reads and statistical methods
被引:124
作者:
Kuleshov, Volodymyr
[1
,2
,3
]
Xie, Dan
[3
]
Chen, Rui
[3
]
Pushkarev, Dmitry
[2
]
Ma, Zhihai
[3
]
Blauwkamp, Tim
[2
]
Kertesz, Michael
[2
]
Snyder, Michael
[3
]
机构:
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[2] Illumina Inc, San Diego, CA USA
[3] Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA
基金:
美国国家卫生研究院;
关键词:
IMPRINTED GENES;
SEQUENCE;
AMPLIFICATION;
POLYMORPHISM;
D O I:
10.1038/nbt.2833
中图分类号:
Q81 [生物工程学(生物技术)];
Q93 [微生物学];
学科分类号:
071005 ;
0836 ;
090102 ;
100705 ;
摘要:
The rapid growth of sequencing technologies has greatly contributed to our understanding of human genetics. Yet, despite this growth, mainstream technologies have not been fully able to resolve the diploid nature of the human genome. Here we describe statistically aided, long-read haplotyping (SLRH), a rapid, accurate method that uses a statistical algorithm to take advantage of the partially phased information contained in long genomic fragments analyzed by short-read sequencing. For a human sample, as little as 30 Gbp of additional sequencing data are needed to phase genotypes identified by 50x coverage whole-genome sequencing. Using SLRH, we phase 99% of single-nucleotide variants in three human genomes into long haplotype blocks 0.2-1 Mbp in length. We apply our method to determine allele-specific methylation patterns in a human genome and identify hundreds of differentially methylated regions that were previously unknown. SLRH should facilitate population-scale haplotyping of human genomes.
引用
收藏
页码:261 / 266
页数:6
相关论文