Overlapping genomic sequences: A treasure trove of single-nucleotide polymorphisms

被引:110
作者
Taillon-Miller, P
Gu, ZJ
Li, Q
Hillier, L
Kwok, PY [1 ]
机构
[1] Washington Univ, Sch Med, Div Dermatol, St Louis, MO 63110 USA
[2] Washington Univ, Sch Med, Genome Sequencing Ctr, St Louis, MO 63110 USA
来源
GENOME RESEARCH | 1998年 / 8卷 / 07期
关键词
D O I
10.1101/gr.8.7.748
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
dAn efficient strategy to develop a dense set of single-nucleotide polymorphism [SNP] markers is to take advantage of the human genome sequencing effort currently under way. Our approach is based oil the fact that bacterial artificial chromosomes (BACs) and PI-based artificial chromosomes (PACs) used ill long-range sequencing projects come from diploid libraries. If the overlapping clones sequenced are from different lineages, one is comparing the sequences from 2 homologous chromosomes in the overlapping region. We have analyzed in detail every SNP identified while sequencing three sets of overlapping clones found on chromosome 5p15.2, 7q21-7q22, and 13q12-13q13. In the 200.6 kb of DNA sequence analyzed in these overlaps, 153 SNPs were identified. Computer analysis for repetitive elements and suitability for STS development yielded 44 STSs containing 68 SNPs for further study. All 68 SNPs were confirmed to be present in at least one of the three (Caucasian, African-American, Hispanic) populations studied. Furthermore, 42 of the SNPs tested (62%) were informative in at least one population, 32 (47%) were informative in two or more populations, and 23 (34%) were informative in all three populations. These results clearly indicate that developing SNP markers from overlapping genomic sequence is highly efficient and cost effective, requiring only the two simple steps of developing STSs around the known SNPs and characterizing them in the appropriate populations.
引用
收藏
页码:748 / 754
页数:7
相关论文
共 13 条
[1]   A new DNA sequence assembly program [J].
Bonfield, JK ;
Smith, KF ;
Staden, R .
NUCLEIC ACIDS RESEARCH, 1995, 23 (24) :4992-4999
[2]   A NEW 5-YEAR PLAN FOR THE UNITED-STATES HUMAN GENOME PROJECT [J].
COLLINS, F ;
GALAS, D .
SCIENCE, 1993, 262 (5130) :43-46
[3]   Variations on a theme: Cataloging human DNA sequence variation [J].
Collins, FS ;
Guyer, MS ;
Chakravarti, A .
SCIENCE, 1997, 278 (5343) :1580-1581
[4]   AN ESTIMATE OF UNIQUE DNA-SEQUENCE HETEROZYGOSITY IN THE HUMAN GENOME [J].
COOPER, DN ;
SMITH, BA ;
COOKE, HJ ;
NIEMANN, S ;
SCHMIDTKE, J .
HUMAN GENETICS, 1985, 69 (03) :201-205
[5]   Base-calling of automated sequencer traces using phred.: I.: Accuracy assessment [J].
Ewing, B ;
Hillier, L ;
Wendl, MC ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :175-185
[6]  
Hillier L, 1991, PCR Methods Appl, V1, P124
[7]  
Kimura Motoo., 1985, The Neutral Theory of Molecular Evolution
[8]   The use of a genetic map of biallelic markers in linkage studies [J].
Kruglyak, L .
NATURE GENETICS, 1997, 17 (01) :21-24
[9]   COMPARATIVE-ANALYSIS OF HUMAN DNA VARIATIONS BY FLUORESCENCE-BASED SEQUENCING OF PCR PRODUCTS [J].
KWOK, PY ;
CARLSON, C ;
YAGER, TD ;
ANKENER, W ;
NICKERSON, DA .
GENOMICS, 1994, 23 (01) :138-144
[10]   Increasing the information content of STS-based genome maps: Identifying polymorphisms in mapped STSs [J].
Kwok, PY ;
Deng, Q ;
Zakeri, H ;
Taylor, SL ;
Nickerson, DA .
GENOMICS, 1996, 31 (01) :123-126