Computerized polymorphic marker identification: Experimental validation and a predicted human polymorphism catalog

被引:50
作者
Fondon, JW
Mele, GM
Brezinschek, RI
Cummings, D
Pande, A
Wren, J
O'Brien, KM
Kupfer, KC
Wei, MH
Lerman, M
Minna, JD
Garner, HR
机构
[1] Univ Texas, SW Med Ctr, McDermott Ctr Human Growth & Dev, Dallas, TX 75235 USA
[2] Univ Texas, SW Med Ctr, Ctr Biomed Invent, Dallas, TX 75235 USA
[3] Univ Texas, SW Med Ctr, Hamon Ctr Therapeut Oncol Res, Dallas, TX 75235 USA
[4] NCI, Frederick Canc Res & Dev Ctr, Immunobiol Lab, Frederick, MD 21702 USA
关键词
D O I
10.1073/pnas.95.13.7514
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
A computational system for the prediction of polymorphic loci directly and efficiently from human genomic sequence was developed and verified. A suite of programs, collectively called POMPOUS (polymorphic marker prediction of ubiquitous simple sequences) detects tandem repeats ranging from dinucleotides up to 250 mers, scores them according to predicted level of polymorphism, and designs appropriate flanking primers for PCR amplification. This approach was validated on an approximately 750-kilobase region of human chromosome 3p213, involved in lung and breast carcinoma homozygous deletions. Target DNA from 36 paired B lymphoblastoid and lung cancer lines was amplified and allelotyped for 33 loci predicted by POMPOUS to be variable in repeat size,We found that among those 36 predominately Caucasian individuals 22 of the 33 (67%) predicted loci were polymorphic with an average heterozygosity of 0.42, Allele loss in this region was found in 27/36 (75%) of the tumor lines using these markers. POMPOUS provides the genetic researcher with an additional tool for the rapid and efficient identification of polymorphic markers, and through a World Wide Web site, investigators can use POMPOUS to identify polymorphic markers for their research. A catalog of 13,261 potential polymorphic markers and associated primer sets has been created from the analysis of 141,779,504 base pairs of human genomic sequence in GenBank. This data is available on our Web site (pompous.swmed.edu) and will be updated periodically as GenBank is expanded and algorithm accuracy is improved.
引用
收藏
页码:7514 / 7519
页数:6
相关论文
共 39 条
  • [1] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [2] Andreassen R, 1996, AM J HUM GENET, V59, P360
  • [3] Biology and applications of human minisatellite loci
    Armour, John A. L.
    Jeffreys, Alec J.
    [J]. CURRENT OPINION IN GENETICS & DEVELOPMENT, 1992, 2 (06) : 850 - 856
  • [4] A METHOD FOR FAST DATABASE SEARCH FOR ALL K-NUCLEOTIDE REPEATS
    BENSON, G
    WATERMAN, MS
    [J]. NUCLEIC ACIDS RESEARCH, 1994, 22 (22) : 4828 - 4836
  • [5] METHODS AND ALGORITHMS FOR STATISTICAL-ANALYSIS OF PROTEIN SEQUENCES
    BRENDEL, V
    BUCHER, P
    NOURBAKHSH, IR
    BLAISDELL, BE
    KARLIN, S
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (06) : 2002 - 2006
  • [6] BLAZE (TM) - AN IMPLEMENTATION OF THE SMITH-WATERMAN SEQUENCE COMPARISON ALGORITHM ON A MASSIVELY-PARALLEL COMPUTER
    BRUTLAG, DL
    DAUTRICOURT, JP
    DIAZ, R
    FIER, J
    MOXON, B
    STAMM, R
    [J]. COMPUTERS & CHEMISTRY, 1993, 17 (02): : 203 - 207
  • [7] FREQUENCY AND POLYMORPHISM OF SIMPLE SEQUENCE REPEATS IN A CONTIGUOUS 685-KB DNA-SEQUENCE CONTAINING THE HUMAN T-CELL RECEPTOR BETA-CHAIN GENE-COMPLEX
    CHARMLEY, P
    CONCANNON, P
    HOOD, L
    ROWEN, L
    [J]. GENOMICS, 1995, 29 (03) : 760 - 765
  • [8] MATS - A RAPID AND EFFICIENT METHOD FOR THE DEVELOPMENT OF MICROSATELLITE MARKERS FROM YACS
    CHEN, H
    PULIDO, JC
    DUYK, GM
    [J]. GENOMICS, 1995, 25 (01) : 1 - 8
  • [9] Chung GTY, 1995, ONCOGENE, V11, P2591
  • [10] INFORMATION ENHANCEMENT METHODS FOR LARGE-SCALE SEQUENCE-ANALYSIS
    CLAVERIE, JM
    STATES, DJ
    [J]. COMPUTERS & CHEMISTRY, 1993, 17 (02): : 191 - 201