Large-scale sequencing of the CD33-related Siglec gene cluster in five mammalian species reveals rapid evolution by multiple mechanisms

被引:137
作者
Angata, T
Margulies, EH
Green, ED
Varki, A
机构
[1] Univ Calif San Diego, Glycobiol Res & Training Ctr, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Dept Med, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Dept Cellular & Mol Med, La Jolla, CA 92093 USA
[4] NHGRI, Genome Technol Branch, NIH, Bethesda, MD 20892 USA
[5] NHGRI, Intramural Sequencing Ctr, NIH, Bethesda, MD 20892 USA
关键词
D O I
10.1073/pnas.0404833101
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Siglecs are a recently discovered family of animal lectins that belong to the Ig superfamily and recognize sialic acids (Sias). CD33-related Siglecs (CD33rSigiecs) are a subgroup with as-yet-unknown functions, characterized by sequence homology, expression on innate immune cells, conserved cytosolic tyrosine-based signaling motifs, and a clustered localization of their genes. To better understand the biology and evolution of CD33rSiglecs, we sequenced and compared the CD33rSiglec gene cluster from multiple mammalian species. Within the sequenced region, the segments containing CD33rSiglec genes showed a lower degree of sequence conservation. In contrast to the adjacent conserved kallikrein-like genes, the CD33rSiglec genes showed extensive species differences, including expansions of gene subsets; gene deletions, including one human-specific loss of a novel functional primate Siglec (Siglec-13); exon shuffling, generating hybrid genes; accelerated accumulation of nonsynonymous substitutions in the Sia-recognition domain; and multiple instances of mutations of an arginine residue essential for Sia recognition in otherwise intact Siglecs. Nonsynonymous differences between human and chimpanzee orthologs showed uneven distribution between the two 13 sheets of the Sia-recognition domain, suggesting biased mutation accumulation. These data indicate that CD33rSiglec genes are undergoing rapid evolution via multiple genetic mechanisms, possibly due to an evolutionary "arms race" between hosts and pathogens involving Sia recognition. These studies, which reflect one of the most complete comparative sequence analyses of a rapidly evolving gene cluster, provide a clearer picture of the ortholog status of CD33rSiglecs among primates and rodents and also facilitate rational recommendations regarding their nomenclature.
引用
收藏
页码:13251 / 13256
页数:6
相关论文
共 37 条
  • [11] Siglecs, sialic acids and innate immunity
    Crocker, PR
    Varki, A
    [J]. TRENDS IN IMMUNOLOGY, 2001, 22 (06) : 337 - 342
  • [12] Comparative genomics of the MHC: Glimpses into the evolution of the adaptive immune system
    Flajnik, MF
    Kasahara, M
    [J]. IMMUNITY, 2001, 15 (03) : 351 - 362
  • [13] Human-specific regulation of α2-6-linked sialic acids
    Gagneux, P
    Cheriyan, M
    Hurtado-Ziola, N
    van der Linden, ECMB
    Anderson, D
    McClure, H
    Varki, A
    Varki, NM
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 2003, 278 (48) : 48245 - 48250
  • [14] Evolutionary considerations in relating oligosaccharide diversity to biological function
    Gagneux, P
    Varki, A
    [J]. GLYCOBIOLOGY, 1999, 9 (08) : 747 - 755
  • [15] Strategies for the systematic sequencing of complex genomes
    Green, ED
    [J]. NATURE REVIEWS GENETICS, 2001, 2 (08) : 573 - 583
  • [16] Natural selection and the diversification of vertebrate immune effectors
    Hughes, AL
    [J]. IMMUNOLOGICAL REVIEWS, 2002, 190 (01) : 161 - 168
  • [17] Multiple sequence alignment with Clustal x
    Jeanmougin, F
    Thompson, JD
    Gouy, M
    Higgins, DG
    Gibson, TJ
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (10) : 403 - 405
  • [18] Initial sequencing and analysis of the human genome
    Lander, ES
    Int Human Genome Sequencing Consortium
    Linton, LM
    Birren, B
    Nusbaum, C
    Zody, MC
    Baldwin, J
    Devon, K
    Dewar, K
    Doyle, M
    FitzHugh, W
    Funke, R
    Gage, D
    Harris, K
    Heaford, A
    Howland, J
    Kann, L
    Lehoczky, J
    LeVine, R
    McEwan, P
    McKernan, K
    Meldrim, J
    Mesirov, JP
    Miranda, C
    Morris, W
    Naylor, J
    Raymond, C
    Rosetti, M
    Santos, R
    Sheridan, A
    Sougnez, C
    Stange-Thomann, N
    Stojanovic, N
    Subramanian, A
    Wyman, D
    Rogers, J
    Sulston, J
    Ainscough, R
    Beck, S
    Bentley, D
    Burton, J
    Clee, C
    Carter, N
    Coulson, A
    Deadman, R
    Deloukas, P
    Dunham, A
    Dunham, I
    Durbin, R
    French, L
    [J]. NATURE, 2001, 409 (6822) : 860 - 921
  • [19] Identification and characterization of multi-species conserved sequences
    Margulies, EH
    Blanchette, M
    Haussler, D
    Green, ED
    [J]. GENOME RESEARCH, 2003, 13 (12) : 2507 - 2518
  • [20] Crystal structure of the N-terminal domain of sialoadhesin in complex with 3′ sialyllactose at 1.85 Å resolution
    May, AP
    Robinson, RC
    Vinson, M
    Crocker, PR
    Jones, EY
    [J]. MOLECULAR CELL, 1998, 1 (05) : 719 - 728