SNPs on human chromosomes 21 and 22 - analysis in terms of protein features and pseudogenes

被引:12
作者
Balasubramanian, S [1 ]
Harrison, P [1 ]
Hegyi, H [1 ]
Bertone, P [1 ]
Luscombe, N [1 ]
Echols, N [1 ]
McGarvey, P [1 ]
Zhang, ZL [1 ]
Gerstein, M [1 ]
机构
[1] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
关键词
nonsynonymous; pseudogenes; secondary structure; SNP; synonymous;
D O I
10.1517/14622416.3.3.393
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
SNPs are useful for genome-wide mapping and the study of disease genes. Previous studies have focused on SNPs in specific genes or SNPs pooled from a variety of different sources. Here, a systematic approach to the analysis of SNPs in relation to various features on a genome-wide scale, with emphasis on protein features and pseudogenes, is presented. We have performed a comprehensive analysis of 39,408 SNPs on human chromosomes 21 and 22 from the SNP consortium (TSC) database, where SNPs are obtained by random sequencing using consistent and uniform methods. Our study indicates that the occurrence of SNPs is lowest in exons and higher in repeats, introns and pseudogenes. Moreover, in comparing genes and pseudogenes, we find that the SNP density is higher in pseudogenes and the ratio of nonsynonymous to synonymous changes is also much higher. These observations may be explained by the increased rate of SNP accumulation in pseudogenes, which presumably are not under selective pressure. We have also performed secondary structure prediction on all coding regions and found that there is no preferential distribution of SNPs in cc-helices, P-sheets or coils, This could imply that protein structures, in general, can tolerate a wide degree of substitutions. Tables relating to our results are available from http://genecensus.org/pseudogene.
引用
收藏
页码:393 / 402
页数:10
相关论文
共 50 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000
    Bairoch, A
    Apweiler, R
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 45 - 48
  • [3] HGBASE - a unified human SNP database
    Brookes, AJ
    [J]. TRENDS IN GENETICS, 2001, 17 (04) : 229 - 229
  • [4] Reliable identification of large numbers of candidate SNPs from public EST data
    Buetow, KH
    Edmonson, MN
    Cassidy, AB
    [J]. NATURE GENETICS, 1999, 21 (03) : 323 - 325
  • [5] Characterization of single-nucleotide polymorphisms in coding regions of human genes
    Cargill, M
    Altshuler, D
    Ireland, J
    Sklar, P
    Ardlie, K
    Patil, N
    Lane, CR
    Lim, EP
    Kalyanaraman, N
    Nemesh, J
    Ziaugra, L
    Friedland, L
    Rolfe, A
    Warrington, J
    Lipshutz, R
    Daley, GQ
    Lander, ES
    [J]. NATURE GENETICS, 1999, 22 (03) : 231 - 238
  • [6] Predicting the functional consequences of non-synonymous single nucleotide polymorphisms: Structure-based assessment of amino acid variation
    Chasman, D
    Adams, RM
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (02) : 683 - 706
  • [7] Complex promoter and coding region β2-adrenergic receptor haplotypes alter receptor expression and predict in vivo responsiveness
    Drysdale, CM
    McGraw, DW
    Stack, CB
    Stephens, JC
    Judson, RS
    Nandabalan, K
    Arnold, K
    Ruano, G
    Liggett, SB
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (19) : 10483 - 10488
  • [8] The DNA sequence of human chromosome 22
    Dunham, I
    Shimizu, N
    Roe, BA
    Chissoe, S
    Dunham, I
    Hunt, AR
    Collins, JE
    Bruskiewich, R
    Beare, DM
    Clamp, M
    Smink, LJ
    Ainscough, R
    Almeida, JP
    Babbage, A
    Bagguley, C
    Balley, J
    Barlow, K
    Bates, KN
    Beasley, O
    Bird, CP
    Blakey, S
    Bridgeman, AM
    Buck, D
    Burgess, J
    Burrill, WD
    Burton, J
    Carder, C
    Carter, NP
    Chen, Y
    Clark, G
    Clegg, SM
    Cobley, V
    Cole, CG
    Collier, RE
    Connor, RE
    Conroy, D
    Corby, N
    Coville, GJ
    Cox, AV
    Davis, J
    Dawson, E
    Dhami, PD
    Dockree, C
    Dodsworth, SJ
    Durbin, RM
    Ellington, A
    Evans, KL
    Fey, JM
    Fleming, K
    French, L
    [J]. NATURE, 1999, 402 (6761) : 489 - 495
  • [9] Evolutionarily conserved sequences on human chromosome 21
    Frazer, KA
    Sheehan, JB
    Stokowski, RP
    Chen, XY
    Hosseini, R
    Cheng, JF
    Fodor, SPA
    Cox, DR
    Patil, N
    [J]. GENOME RESEARCH, 2001, 11 (10) : 1651 - 1659
  • [10] Frishman D, 1997, PROTEINS, V27, P329, DOI 10.1002/(SICI)1097-0134(199703)27:3<329::AID-PROT1>3.0.CO