Extraction and annotation of human mitochondrial genomes from 1000 Genomes Whole Exome Sequencing data

被引:52
作者
Diroma, Maria Angela [1 ]
Calabrese, Claudia [2 ]
Simone, Domenico [1 ,3 ]
Santorsola, Mariangela [1 ,4 ]
Calabrese, Francesco Maria [1 ]
Gasparre, Giuseppe [2 ]
Attimonelli, Marcella [1 ]
机构
[1] Univ Bari, Dept Biosci Biotechnol & Biopharmaceut, I-70126 Bari, Italy
[2] Univ Bologna, Dept Med & Surg Sci, I-40138 Bologna, Italy
[3] Univ Calgary, Dept Biol Sci, Calgary, AB T2N 1N4, Canada
[4] Univ Sannio, Dept Sci, I-82100 Benevento, Italy
关键词
DNA HETEROPLASMY; DIRECT REPEATS; CONTROL REGION; CELL-LINE; DISEASE; MUTATIONS; TREE;
D O I
10.1186/1471-2164-15-S3-S2
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 [微生物学]; 090105 [作物生产系统与生态工程];
摘要
Background: Whole Exome Sequencing (WES) is one of the most used and cost-effective next generation technologies that allows sequencing of all nuclear exons. Off-target regions may be captured if they present high sequence similarity with baits. Bioinformatics tools have been optimized to retrieve a large amount of WES off-target mitochondrial DNA (mtDNA), by exploiting the aspecificity of probes, partially overlapping to Nuclear mitochondrial Sequences (NumtS). The 1000 Genomes project represents one of the widest resources to extract mtDNA sequences from WES data, considering the large effort the scientific community is undertaking to reconstruct human population history using mtDNA as marker, and the involvement of mtDNA in pathology. Results: A previously published pipeline aimed at assembling mitochondrial genomes from off-target WES reads and further improved to detect insertions and deletions (indels) and heteroplasmy in a dataset of 1242 samples from the 1000 Genomes project, enabled to obtain a nearly complete mitochondrial genome from 943 samples (76% analyzed exomes). The robustness of our computational strategy was highlighted by the reduction of reads amount recognized as mitochondrial in the original annotation produced by the Consortium, due to NumtS filtering. An accurate survey was carried out on 1242 individuals. 215 indels, mostly heteroplasmic, and 3407 single base variants were mapped. A homogeneous mismatches distribution was observed along the whole mitochondrial genome, while a lower frequency of indels was found within protein-coding regions, where frameshift mutations may be deleterious. The majority of indels and mismatches found were not previously annotated in mitochondrial databases since conventional sequencing methods were limited to homoplasmy or quasi-homoplasmy detection. Intriguingly, upon filtering out non haplogroup-defining variants, we detected a widespread population occurrence of rare events predicted to be damaging. Eventually, samples were stratified into blood-and lymphoblastoid-derived to detect possibly different trends of mutability in the two datasets, an analysis which did not yield significant discordances. Conclusions: To the best of our knowledge, this is likely the most extended population-scale mitochondrial genotyping in humans enriched with the estimation of heteroplasmies.
引用
收藏
页数:15
相关论文
共 43 条
[1]
A method and server for predicting damaging missense mutations [J].
Adzhubei, Ivan A. ;
Schmidt, Steffen ;
Peshkin, Leonid ;
Ramensky, Vasily E. ;
Gerasimova, Anna ;
Bork, Peer ;
Kondrashov, Alexey S. ;
Sunyaev, Shamil R. .
NATURE METHODS, 2010, 7 (04) :248-249
[2]
An integrated map of genetic variation from 1,092 human genomes [J].
Altshuler, David M. ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Donnelly, Peter ;
Eichler, Evan E. ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Green, Eric D. ;
Hurles, Matthew E. ;
Knoppers, Bartha M. ;
Korbel, Jan O. ;
Lander, Eric S. ;
Lee, Charles ;
Lehrach, Hans ;
Mardis, Elaine R. ;
Marth, Gabor T. ;
McVean, Gil A. ;
Nickerson, Deborah A. ;
Schmidt, Jeanette P. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Dinh, Huyen ;
Kovar, Christie ;
Lee, Sandra ;
Lewis, Lora ;
Muzny, Donna ;
Reid, Jeff ;
Wang, Min ;
Wang, Jun ;
Fang, Xiaodong ;
Guo, Xiaosen ;
Jian, Min ;
Jiang, Hui ;
Jin, Xin ;
Li, Guoqing ;
Li, Jingxiang ;
Li, Yingrui ;
Li, Zhuo ;
Liu, Xiao ;
Lu, Yao ;
Ma, Xuedi ;
Su, Zhe ;
Tai, Shuaishuai ;
Tang, Meifang .
NATURE, 2012, 491 (7422) :56-65
[3]
Exome sequencing as a tool for Mendelian disease gene discovery [J].
Bamshad, Michael J. ;
Ng, Sarah B. ;
Bigham, Abigail W. ;
Tabor, Holly K. ;
Emond, Mary J. ;
Nickerson, Deborah A. ;
Shendure, Jay .
NATURE REVIEWS GENETICS, 2011, 12 (11) :745-755
[4]
Consistent treatment of length variants in the human mtDNA control region: a reappraisal [J].
Bandelt, H. -J. ;
Parson, W. .
INTERNATIONAL JOURNAL OF LEGAL MEDICINE, 2008, 122 (01) :11-21
[5]
A "Copernican" Reassessment of the Human Mitochondrial DNA Tree from its Root [J].
Behar, Doron M. ;
van Oven, Mannis ;
Rosset, Saharon ;
Metspalu, Malt ;
Loogvali, Eva-Liis ;
Silva, Nuno M. ;
Kivisild, Toomas ;
Torroni, Antonio ;
Villems, Richard .
AMERICAN JOURNAL OF HUMAN GENETICS, 2012, 90 (04) :675-684
[6]
Functional Annotations Improve the Predictive Score of Human Disease-Related Mutations in Proteins [J].
Calabrese, Remo ;
Capriotti, Emidio ;
Fariselli, Piero ;
Martelli, Pier Luigi ;
Casadio, Rita .
HUMAN MUTATION, 2009, 30 (08) :1237-1244
[7]
Mitochondrial DNA G10398A polymorphism and invasive breast cancer in African-American women [J].
Canter, JA ;
Kallianpur, AR ;
Parl, FF ;
Millikan, RC .
CANCER RESEARCH, 2005, 65 (17) :8028-8033
[8]
Mitochondrial genetics [J].
Chinnery, Patrick Francis ;
Hudson, Gavin .
BRITISH MEDICAL BULLETIN, 2013, 106 (01) :135-159
[9]
HETEROPLASMY IN THE CONTROL REGION OF HUMAN MITOCHONDRIAL-DNA [J].
COMAS, D ;
PAABO, S ;
BERTRANPETIT, J .
GENOME RESEARCH, 1995, 5 (01) :89-90
[10]
Variation in genome-wide mutation rates within and between human families [J].
Conrad, Donald F. ;
Keebler, Jonathan E. M. ;
DePristo, Mark A. ;
Lindsay, Sarah J. ;
Zhang, Yujun ;
Casals, Ferran ;
Idaghdour, Youssef ;
Hartl, Chris L. ;
Torroja, Carlos ;
Garimella, Kiran V. ;
Zilversmit, Martine ;
Cartwright, Reed ;
Rouleau, Guy A. ;
Daly, Mark ;
Stone, Eric A. ;
Hurles, Matthew E. ;
Awadalla, Philip .
NATURE GENETICS, 2011, 43 (07) :712-U137