Computerized polymorphic marker identification: Experimental validation and a predicted human polymorphism catalog

被引:50
作者
Fondon, JW
Mele, GM
Brezinschek, RI
Cummings, D
Pande, A
Wren, J
O'Brien, KM
Kupfer, KC
Wei, MH
Lerman, M
Minna, JD
Garner, HR
机构
[1] Univ Texas, SW Med Ctr, McDermott Ctr Human Growth & Dev, Dallas, TX 75235 USA
[2] Univ Texas, SW Med Ctr, Ctr Biomed Invent, Dallas, TX 75235 USA
[3] Univ Texas, SW Med Ctr, Hamon Ctr Therapeut Oncol Res, Dallas, TX 75235 USA
[4] NCI, Frederick Canc Res & Dev Ctr, Immunobiol Lab, Frederick, MD 21702 USA
关键词
D O I
10.1073/pnas.95.13.7514
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
A computational system for the prediction of polymorphic loci directly and efficiently from human genomic sequence was developed and verified. A suite of programs, collectively called POMPOUS (polymorphic marker prediction of ubiquitous simple sequences) detects tandem repeats ranging from dinucleotides up to 250 mers, scores them according to predicted level of polymorphism, and designs appropriate flanking primers for PCR amplification. This approach was validated on an approximately 750-kilobase region of human chromosome 3p213, involved in lung and breast carcinoma homozygous deletions. Target DNA from 36 paired B lymphoblastoid and lung cancer lines was amplified and allelotyped for 33 loci predicted by POMPOUS to be variable in repeat size,We found that among those 36 predominately Caucasian individuals 22 of the 33 (67%) predicted loci were polymorphic with an average heterozygosity of 0.42, Allele loss in this region was found in 27/36 (75%) of the tumor lines using these markers. POMPOUS provides the genetic researcher with an additional tool for the rapid and efficient identification of polymorphic markers, and through a World Wide Web site, investigators can use POMPOUS to identify polymorphic markers for their research. A catalog of 13,261 potential polymorphic markers and associated primer sets has been created from the analysis of 141,779,504 base pairs of human genomic sequence in GenBank. This data is available on our Web site (pompous.swmed.edu) and will be updated periodically as GenBank is expanded and algorithm accuracy is improved.
引用
收藏
页码:7514 / 7519
页数:6
相关论文
共 39 条
[31]   A COLLECTION OF TRINUCLEOTIDE AND TETRANUCLEOTIDE REPEAT MARKERS USED TO GENERATE HIGH-QUALITY, HIGH-RESOLUTION HUMAN GENOME-WIDE LINKAGE MAPS [J].
SHEFFIELD, VC ;
WEBER, JL ;
BUETOW, KH ;
MURRAY, JC ;
EVEN, DA ;
WILES, K ;
GASTIER, JM ;
PULIDO, JC ;
YANDAVA, C ;
SUNDEN, SL ;
MATTES, G ;
BUSINGA, T ;
MCCLAIN, A ;
BECK, J ;
SCHERPIER, T ;
GILLIAM, J ;
ZHONG, J ;
DUYK, GM .
HUMAN MOLECULAR GENETICS, 1995, 4 (10) :1837-1844
[32]  
SHPAER E, 1996, METHODS MOL BIOL SEQ
[33]   The Staden sequence analysis package [J].
Staden, R .
MOLECULAR BIOTECHNOLOGY, 1996, 5 (03) :233-241
[34]  
Todd S, 1997, CANCER RES, V57, P1344
[35]   POPULATION-GENETICS OF TRINUCLEOTIDE REPEAT POLYMORPHISMS [J].
WATKINS, WS ;
BAMSHAD, M ;
JORDE, LB .
HUMAN MOLECULAR GENETICS, 1995, 4 (09) :1485-1491
[36]   INFORMATIVENESS OF HUMAN (DC-DA)N.(DG-DT)N POLYMORPHISMS [J].
WEBER, JL .
GENOMICS, 1990, 7 (04) :524-530
[37]  
Wei MH, 1996, CANCER RES, V56, P1487
[38]   A 2ND-GENERATION LINKAGE MAP OF THE HUMAN GENOME [J].
WEISSENBACH, J ;
GYAPAY, G ;
DIB, C ;
VIGNAL, A ;
MORISSETTE, J ;
MILLASSEAU, P ;
VAYSSEIX, G ;
LATHROP, M .
NATURE, 1992, 359 (6398) :794-801
[39]   STATISTICS OF LOCAL COMPLEXITY IN AMINO-ACID-SEQUENCES AND SEQUENCE DATABASES [J].
WOOTTON, JC ;
FEDERHEN, S .
COMPUTERS & CHEMISTRY, 1993, 17 (02) :149-163