Mapping the Human Reference Genome's Missing Sequence by Three-Way Admixture in Latino Genomes

被引:26
作者
Genovese, Giulio [1 ,2 ,3 ]
Handsaker, Robert E. [2 ,3 ]
Li, Heng [2 ,3 ]
Kenny, Eimear E. [4 ,5 ,6 ,7 ,8 ]
McCarroll, Steven A. [1 ,2 ,3 ]
机构
[1] Broad Inst MIT & Harvard, Stanley Ctr Psychiat Res, Cambridge, MA 02142 USA
[2] Broad Inst MIT & Harvard, Program Med & Populat Genet, Cambridge, MA 02142 USA
[3] Harvard Univ, Sch Med, Dept Genet, Boston, MA 02115 USA
[4] Stanford Univ, Dept Genet, Stanford, CA 94305 USA
[5] Icahn Sch Med Mt Sinai, Dept Genet & Genom Sci, New York, NY 10029 USA
[6] Icahn Sch Med Mt Sinai, Charles Bronfman Inst Personalized Med, New York, NY 10029 USA
[7] Icahn Sch Med Mt Sinai, Ctr Stat Genet, New York, NY 10029 USA
[8] Icahn Sch Med Mt Sinai, Inst Genom & Multiscale Biol, New York, NY 10029 USA
关键词
COPY NUMBER; DNA PRIMASE; DISCOVERY; POLYMORPHISM; CHROMOSOMES; ASSEMBLIES; DIVERSITY; EVOLUTION; GENES; MAPS;
D O I
10.1016/j.ajhg.2013.07.002
中图分类号
Q3 [遗传学];
学科分类号
071007 [遗传学];
摘要
A principal obstacle to completing maps and analyses of the human genome involves the genome's "inaccessible" regions: sequences (often euchromatic and containing genes) that are isolated from the rest of the euchromatic genome by heterochromatin and other repeat-rich sequence. We describe a way to localize these sequences by using ancestry linkage disequilibrium in populations that derive ancestry from at least three continents, as is the case for Latinos. We used this approach to map the genomic locations of almost 20 mega-bases of sequence unlocalized or missing from the current human genome reference (NCBI Genome GRCh37)-a substantial fraction of the human genome's remaining unmapped sequence. We show that the genomic locations of most sequences that originated from fosmids and larger clones can be admixture mapped in this way, by using publicly available whole-genome sequence data. Genome assembly efforts and future builds of the human genome reference will be strongly informed by this localization of genes and other euchromatic sequences that are embedded within highly repetitive pericentromeric regions.
引用
收藏
页码:411 / 421
页数:11
相关论文
共 44 条
[1]
Limitations of next-generation genome sequence assembly [J].
Alkan, Can ;
Sajjadian, Saba ;
Eichler, Evan E. .
NATURE METHODS, 2011, 8 (01) :61-65
[2]
[Anonymous], 2012, Nature
[3]
Fast and accurate inference of local ancestry in Latino populations [J].
Baran, Yael ;
Pasaniuc, Bogdan ;
Sankararaman, Sriram ;
Torgerson, Dara G. ;
Gignoux, Christopher ;
Eng, Celeste ;
Rodriguez-Cintron, William ;
Chapela, Rocio ;
Ford, Jean G. ;
Avila, Pedro C. ;
Rodriguez-Santana, Jose ;
Burchard, Esteban Gonzalez ;
Halperin, Eran .
BIOINFORMATICS, 2012, 28 (10) :1359-1367
[4]
Benson DA, 2013, NUCLEIC ACIDS RES, V41, pD36, DOI [10.1093/nar/gkn723, 10.1093/nar/gkp1024, 10.1093/nar/gkw1070, 10.1093/nar/gkr1202, 10.1093/nar/gkx1094, 10.1093/nar/gkl986, 10.1093/nar/gkq1079, 10.1093/nar/gks1195, 10.1093/nar/gkg057]
[5]
Genome-wide patterns of population structure and admixture among Hispanic/Latino populations [J].
Bryc, Katarzyna ;
Velez, Christopher ;
Karafet, Tatiana ;
Moreno-Estrada, Andres ;
Reynolds, Andy ;
Auton, Adam ;
Hammer, Michael ;
Bustamante, Carlos D. ;
Ostrer, Harry .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 :8954-8961
[6]
HOMOLOGOUS ALPHA-SATELLITE SEQUENCES ON HUMAN ACROCENTRIC CHROMOSOMES WITH SELECTIVITY FOR CHROMOSOMES 13, 14 AND 21 - IMPLICATIONS FOR RECOMBINATION BETWEEN NONHOMOLOGUES AND ROBERTSONIAN TRANSLOCATIONS [J].
CHOO, KH ;
VISSEL, B ;
BROWN, R ;
FILBY, RG ;
EARLE, E .
NUCLEIC ACIDS RESEARCH, 1988, 16 (04) :1273-1283
[7]
EVOLUTION OF ALPHA-SATELLITE DNA ON HUMAN ACROCENTRIC CHROMOSOMES [J].
CHOO, KH ;
VISSEL, B ;
EARLE, E .
GENOMICS, 1989, 5 (02) :332-344
[8]
Population stratification confounds genetic association studies among Latinos [J].
Choudhry, S ;
Coyle, NE ;
Tang, H ;
Salari, K ;
Lind, D ;
Clark, SL ;
Tsai, HJ ;
Naqvi, M ;
Phong, A ;
Ung, N ;
Matallana, H ;
Avila, PC ;
Casal, J ;
Torres, A ;
Nazario, S ;
Castro, R ;
Battle, NC ;
Perez-Stable, EJ ;
Kwok, PY ;
Sheppard, D ;
Shriver, MD ;
Rodriguez-Cintron, W ;
Risch, N ;
Ziv, E ;
Burchard, EG .
HUMAN GENETICS, 2006, 118 (05) :652-664
[9]
Lack of genomic imprinting of DNA primase, polypeptide 2 (PRIM2) in human term placenta and white blood cells [J].
Chung, Jaewook ;
Tsai, Shengdar ;
James, Andra H. ;
Thames, Betty H. ;
Shytle, Stephanie ;
Piedrahita, Jorge A. .
EPIGENETICS, 2012, 7 (05) :429-431
[10]
Modernizing Reference Genome Assemblies [J].
Church, Deanna M. ;
Schneider, Valerie A. ;
Graves, Tina ;
Auger, Katherine ;
Cunningham, Fiona ;
Bouk, Nathan ;
Chen, Hsiu-Chuan ;
Agarwala, Richa ;
McLaren, William M. ;
Ritchie, Graham R. S. ;
Albracht, Derek ;
Kremitzki, Milinn ;
Rock, Susan ;
Kotkiewicz, Holland ;
Kremitzki, Colin ;
Wollam, Aye ;
Trani, Lee ;
Fulton, Lucinda ;
Fulton, Robert ;
Matthews, Lucy ;
Whitehead, Siobhan ;
Chow, Will ;
Torrance, James ;
Dunn, Matthew ;
Harden, Glenn ;
Threadgold, Glen ;
Wood, Jonathan ;
Collins, Joanna ;
Heath, Paul ;
Griffiths, Guy ;
Pelan, Sarah ;
Grafham, Darren ;
Eichler, Evan E. ;
Weinstock, George ;
Mardis, Elaine R. ;
Wilson, Richard K. ;
Howe, Kerstin ;
Flicek, Paul ;
Hubbard, Tim .
PLOS BIOLOGY, 2011, 9 (07)