PEMer: a computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data

被引:173
作者
Korbel, Jan O. [1 ,2 ,3 ]
Abyzov, Alexej [3 ]
Mu, Xinmeng Jasmine [4 ]
Carriero, Nicholas [5 ]
Cayting, Philip [3 ]
Zhang, Zhengdong [3 ]
Snyder, Michael [3 ,4 ]
Gerstein, Mark B. [3 ,4 ,5 ,6 ]
机构
[1] European Mol Biol Lab, Gene Express Unit, D-69117 Heidelberg, Germany
[2] EMBL EBI, EMBL Outstat Hinxton, Cambridge CB10 1SA, England
[3] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
[4] Yale Univ, Dept Mol Cellular & Dev Biol, New Haven, CT 06520 USA
[5] Yale Univ, Dept Comp Sci, New Haven, CT 06511 USA
[6] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT 06520 USA
来源
GENOME BIOLOGY | 2009年 / 10卷 / 02期
关键词
COPY-NUMBER VARIATION; FINE-SCALE; RESOLUTION; IDENTIFICATION; TOOL; POLYMORPHISM; ELEMENTS; SNPS; MAP;
D O I
10.1186/gb-2009-10-2-r23
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Personal-genomics endeavors, such as the 1000 Genomes project, are generating maps of genomic structural variants by analyzing ends of massively sequenced genome fragments. To process these we developed Paired-End Mapper (PEMer; http://sv.gersteinlab.org/pemer). This comprises an analysis pipeline, compatible with several next-generation sequencing platforms; simulation-based error models, yielding confidence-values for each structural variant; and a back-end database. The simulations demonstrated high structural variant reconstruction efficiency for PEMer's coverage-adjusted multi-cutoff scoring-strategy and showed its relative insensitivity to base-calling errors.
引用
收藏
页数:14
相关论文
共 47 条
  • [41] Relative impact of nucleotide and copy number variation on gene expression phenotypes
    Stranger, Barbara E.
    Forrest, Matthew S.
    Dunning, Mark
    Ingle, Catherine E.
    Beazley, Claude
    Thorne, Natalie
    Redon, Richard
    Bird, Christine P.
    de Grassi, Anna
    Lee, Charles
    Tyler-Smith, Chris
    Carter, Nigel
    Scherer, Stephen W.
    Tavare, Simon
    Deloukas, Panagiotis
    Hurles, Matthew E.
    Dermitzakis, Emmanouil T.
    [J]. SCIENCE, 2007, 315 (5813) : 848 - 853
  • [42] Fine-scale structural variation of the human genome
    Tuzun, E
    Sharp, AJ
    Bailey, JA
    Kaul, R
    Morrison, VA
    Pertz, LM
    Haugen, E
    Hayden, H
    Albertson, D
    Pinkel, D
    Olson, MV
    Eichler, EE
    [J]. NATURE GENETICS, 2005, 37 (07) : 727 - 732
  • [43] High-resolution mapping of DNA copy alterations in human chromosome 22 using high-density tiling oligonucleotide arrays
    Urban, AE
    Korbel, JO
    Selzer, R
    Richmond, T
    Hacker, A
    Popescu, GV
    Clubells, JF
    Green, R
    Emanuel, BS
    Gerstein, MB
    Weissman, SM
    Snyder, M
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (12) : 4534 - 4539
  • [44] Rare structural variants disrupt multiple genes in neurodevelopmental pathways in schizophrenia
    Walsh, Tom
    McClellan, Jon M.
    McCarthy, Shane E.
    Addington, Anjene M.
    Pierce, Sarah B.
    Cooper, Greg M.
    Nord, Alex S.
    Kusenda, Mary
    Malhotra, Dheeraj
    Bhandari, Abhishek
    Stray, Sunday M.
    Rippey, Caitlin F.
    Roccanova, Patricia
    Makarov, Vlad
    Lakshmi, B.
    Findling, Robert L.
    Sikich, Linmarie
    Stromberg, Thomas
    Merriman, Barry
    Gogtay, Nitin
    Butler, Philip
    Eckstrand, Kristen
    Noory, Laila
    Gochman, Peter
    Long, Robert
    Chen, Zugen
    Davis, Sean
    Baker, Carl
    Eichler, Evan E.
    Meltzer, Paul S.
    Nelson, Stanley F.
    Singleton, Andrew B.
    Lee, Ming K.
    Rapoport, Judith L.
    King, Mary-Claire
    Sebat, Jonathan
    [J]. SCIENCE, 2008, 320 (5875) : 539 - 543
  • [45] The complete genome of an individual by massively parallel DNA sequencing
    Wheeler, David A.
    Srinivasan, Maithreyan
    Egholm, Michael
    Shen, Yufeng
    Chen, Lei
    McGuire, Amy
    He, Wen
    Chen, Yi-Ju
    Makhijani, Vinod
    Roth, G. Thomas
    Gomes, Xavier
    Tartaro, Karrie
    Niazi, Faheem
    Turcotte, Cynthia L.
    Irzyk, Gerard P.
    Lupski, James R.
    Chinault, Craig
    Song, Xing-zhi
    Liu, Yue
    Yuan, Ye
    Nazareth, Lynne
    Qin, Xiang
    Muzny, Donna M.
    Margulies, Marcel
    Weinstock, George M.
    Gibbs, Richard A.
    Rothberg, Jonathan M.
    [J]. NATURE, 2008, 452 (7189) : 872 - U5
  • [46] A greedy algorithm for aligning DNA sequences
    Zhang, Z
    Schwartz, S
    Wagner, L
    Miller, W
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (1-2) : 203 - 214
  • [47] PEMER PACKAGE