Toward high-throughput genotyping: Dynamic and automatic software for manipulating large-scale genotype data using fluorescently labeled dinucleotide markers

被引:52
作者
Li, JL
Deng, HY
Lai, DB
Xu, FH
Chen, J
Gao, GM
Recker, RR
Deng, HW [1 ]
机构
[1] Creighton Univ, Osteoporosis Res Ctr, Omaha, NE 68131 USA
[2] Creighton Univ, Dept Math & Comp Sci, Omaha, NE 68131 USA
[3] Creighton Univ, Dept Biomed Sci, Omaha, NE 68131 USA
[4] Boys Town Natl Res Hosp, Ctr Hereditary Commun Disorders, Omaha, NE 68131 USA
[5] Hunan Normal Univ, Coll Life Sci, Lab Mol & Stat Genet, Changsha 410081, Peoples R China
关键词
D O I
10.1101/gr.159701
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
To efficiently manipulate large amounts of genotype data generated with fluorescently labeled dinucleotide markers, we developed a Microsoft Access database management system, named GenoDB. GenoDB offers several advantages. First, it accommodates the dynamic nature of the accumulations of genotype data during the genotyping process; some data need to be confirmed or replaced by repeat lab procedures. By using GenoDB, the raw genotype data can be imported easily and continuously and incorporated into the database during the genotyping process that may continue over an extended period of time in large projects. Second, almost all of the procedures are automatic, including autocomparison of the raw data read by different technicians from the same gel, autoadjustment among the allele fragment-size data from cross-runs or cross-platforms, autobinning of alleles, and autocompilation of genotype data for suitable programs to perform inheritance check in pedigrees. Third, GenoDB provides functions to track electrophoresis gel files to locate gel or sample sources for any resultant genotype data, which is extremely helpful for double-checking consistency of raw and final data and for directing repeat experiments. In addition, the user-friendly graphic interface of GenoDB renders processing of large amounts of data much less labor-intensive. Furthermore, GenoDB has built-in mechanisms to detect some genotyping errors and to assess the quality of genotype data that then are summarized in the statistic reports automatically generated by GenoDB. The GenoDB can easily handle >500,000 genotype data entries, a number more than sufficient for typical whole-genome linkage studies. The modules and programs we developed for the GenoDB can be extended to other database platforms, such as Microsoft SQL server, if the capability to handle still greater quantities of genotype data simultaneously is desired.
引用
收藏
页码:1304 / 1314
页数:11
相关论文
共 28 条
  • [1] Multipoint quantitative-trait linkage analysis in general pedigrees
    Almasy, L
    Blangero, J
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 1998, 62 (05) : 1198 - 1211
  • [2] PhenoDB: An integrated client/server database for linkage and population genetics
    Cheung, KH
    Nadkarni, P
    Silverstein, S
    Kidd, JR
    Pakstis, AJ
    Miller, P
    Kidd, KK
    [J]. COMPUTERS AND BIOMEDICAL RESEARCH, 1996, 29 (04): : 327 - 337
  • [3] *CYB, CYB 2000 TUT INTR TR
  • [4] Genetic determination of Colles' fracture and differential bone mass in women with and without Colles' fracture
    Deng, HW
    Chen, WM
    Recker, S
    Stegman, MR
    Li, JL
    Davies, KM
    Zhou, Y
    Deng, HY
    Heaney, R
    Recker, RR
    [J]. JOURNAL OF BONE AND MINERAL RESEARCH, 2000, 15 (07) : 1243 - 1252
  • [5] Deng HW, 2000, GENET EPIDEMIOL, V19, P160, DOI 10.1002/1098-2272(200009)19:2<160::AID-GEPI4>3.0.CO
  • [6] 2-H
  • [7] Deng HW, 1998, J CLIN DENSITOM, V1, P339, DOI 10.1385/JCD:1:4:339
  • [8] DENG HW, 2001, IN PRESS J CLIN ENDO
  • [9] DENG HW, 2001, UNPUB NAT GENET
  • [10] Ghosh S, 1996, ANNU REV MED, V47, P333