Quantitative analysis of population-scale family trees with millions of relatives

被引:132
作者
Kaplanis, Joanna [1 ,2 ]
Gordon, Assaf [1 ,2 ]
Shor, Tal [3 ,4 ]
Weissbrod, Omer [5 ]
Geiger, Dan [4 ]
Wahl, Mary [1 ,2 ,6 ]
Gershovits, Michael [2 ]
Markus, Barak [2 ]
Sheikh, Mona [2 ]
Gymrek, Melissa [1 ,2 ,7 ,8 ,9 ]
Bhatia, Gaurav [10 ,11 ]
MacArthur, Daniel G. [7 ,9 ,10 ]
Price, Alkes L. [10 ,11 ,12 ]
Erlich, Yaniv [1 ,2 ,3 ,13 ,14 ]
机构
[1] New York Genome Ctr, New York, NY 10013 USA
[2] Whitehead Inst Biomed Res, 9 Cambridge Ctr, Cambridge, MA 02142 USA
[3] MyHeritage, IL-6037606 Or Yehuda, Israel
[4] Technion Israel Inst Technol, Comp Sci Dept, IL-3200003 Haifa, Israel
[5] Weizmann Inst Sci, Comp Sci Dept, IL-7610001 Rehovot, Israel
[6] Harvard Univ, Dept Mol & Cellular Biol, Cambridge, MA 02138 USA
[7] Harvard Med Sch, Boston, MA 02115 USA
[8] Harvard MIT Program Hlth Sci & Technol, Cambridge, MA 02142 USA
[9] Massachusetts Gen Hosp, Analyt & Translat Genet Unit, Boston, MA 02114 USA
[10] Broad Inst MIT & Harvard, Program Med & Populat Genet, Cambridge, MA 02142 USA
[11] Harvard Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[12] Harvard Sch Publ Hlth, Dept Epidemiol, Boston, MA USA
[13] Columbia Univ, Dept Comp Sci, Fu Fdn Sch Engn, New York, NY 10027 USA
[14] Columbia Univ, Dept Syst Biol, Ctr Computat Biol & Bioinformat, New York, NY 10027 USA
关键词
GENE INTERACTIONS; HUMAN LONGEVITY; COMPLEX TRAITS; LIFE-SPAN; HERITABILITY; ASSOCIATION; HISTORY; FERTILITY; EPISTASIS; ERA;
D O I
10.1126/science.aam9309
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Family trees have vast applications in fields as diverse as genetics, anthropology, and economics. However, the collection of extended family trees is tedious and usually relies on resources with limited geographical scope and complex data usage restrictions. We collected 86 million profiles from publicly available online data shared by genealogy enthusiasts. After extensive cleaning and validation, we obtained population-scale family trees, including a single pedigree of 13 million individuals. We leveraged the data to partition the genetic architecture of human longevity and to provide insights into the geographical dispersion of families. We also report a simple digital procedure to overlay other data sets with our resource.
引用
收藏
页码:171 / 175
页数:5
相关论文
共 58 条
  • [1] Utah family-based analysis: Past, present and future
    Albright, Lisa A. Cannon
    [J]. HUMAN HEREDITY, 2008, 65 (04) : 209 - 220
  • [2] How well does paternity confidence match actual paternity? Evidence from worldwide nonpaternity rates
    Anderson, Kermyt G.
    [J]. CURRENT ANTHROPOLOGY, 2006, 47 (03) : 513 - 520
  • [3] [Anonymous], 2007, WORLD WIDE WEB
  • [4] An Expanded View of Complex Traits: From Polygenic to Omnigenic
    Boyle, Evan A.
    Li, Yang I.
    Pritchard, Jonathan K.
    [J]. CELL, 2017, 169 (07) : 1177 - 1186
  • [5] Cannon Albright Lisa Anne, 2006, AMIA Annu Symp Proc, P1161
  • [6] Cavalli-Sforza L.L., 1994, HIST GEOGRAPHY HUMAN
  • [7] Cavalli-Sforza LL., 2004, MG POP BIOL
  • [8] Chang JT, 1999, ADV APPL PROBAB, V31, P1002, DOI 10.1239/aap/1029955256
  • [9] Detecting gene-gene interactions that underlie human diseases
    Cordell, Heather J.
    [J]. NATURE REVIEWS GENETICS, 2009, 10 (06) : 392 - 404
  • [10] Genome-wide association meta-analysis of human longevity identifies a novel locus conferring survival beyond 90 years of age
    Deelen, Joris
    Beekman, Marian
    Uh, Hae-Won
    Broer, Linda
    Ayers, Kristin L.
    Tan, Qihua
    Kamatani, Yoichiro
    Bennet, Anna M.
    Tamm, Riin
    Trompet, Stella
    Guobjartsson, Daniel F.
    Flachsbart, Friederike
    Rose, Giuseppina
    Viktorin, Alexander
    Fischer, Krista
    Nygaard, Marianne
    Cordell, Heather J.
    Crocco, Paolina
    Van den Akker, Erik B.
    Bohringer, Stefan
    Helmer, Quinta
    Nelson, Christopher P.
    Saunders, Gary I.
    Alver, Maris
    Andersen-Ranberg, Karen
    Breen, Marie E.
    van der Breggen, Ruud
    Caliebe, Amke
    Capri, Miriam
    Cevenini, Elisa
    Collerton, Joanna C.
    Dato, Serena
    Davies, Karen
    Ford, Ian
    Gampe, Jutta
    Garagnani, Paolo
    de Geus, Eco J. C.
    Harrow, Jennifer
    van Heemst, Diana
    Heijmans, Bastiaan T.
    Heinsen, Femke-Anouska
    Hottenga, Jouke-Jan
    Hofman, Albert
    Jeune, Bernard
    Jonsson, Palmi V.
    Lathrop, Mark
    Lechner, Doris
    Martin-Ruiz, Carmen
    Mcnerlan, Susan E.
    Mihailov, Evelin
    [J]. HUMAN MOLECULAR GENETICS, 2014, 23 (16) : 4420 - 4432