Manual annotation and analysis of the defensin gene cluster in the C57BL/6J mouse reference genome

被引:36
作者
Amid, Clara [1 ]
Rehaume, Linda M. [2 ]
Brown, Kelly L. [2 ,3 ]
Gilbert, James G. R. [1 ]
Dougan, Gordon [1 ]
Hancock, Robert E. W. [2 ]
Harrow, Jennifer L. [1 ]
机构
[1] Wellcome Trust Sanger Inst, Hinxton CB10 1SA, Cambs, England
[2] Univ British Columbia, Ctr Microbial Dis & Immun Res, Vancouver, BC V6T 1Z4, Canada
[3] Gothenburg Univ, Dept Rheumatol & Inflammat Res, S-41346 Gothenburg, Sweden
来源
BMC GENOMICS | 2009年 / 10卷
基金
美国国家卫生研究院; 英国惠康基金;
关键词
CYSTEINE-RICH PEPTIDES; ANTIMICROBIAL PEPTIDES; RAPID EVOLUTION; ALPHA-DEFENSINS; BETA-DEFENSINS; TRANSCRIPTION; CRYPTDIN; FAMILY; EXPRESSION; DIVERSITY;
D O I
10.1186/1471-2164-10-606
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Host defense peptides are a critical component of the innate immune system. Human alpha-and beta-defensin genes are subject to copy number variation (CNV) and historically the organization of mouse alpha-defensin genes has been poorly defined. Here we present the first full manual genomic annotation of the mouse defensin region on Chromosome 8 of the reference strain C57BL/6J, and the analysis of the orthologous regions of the human and rat genomes. Problems were identified with the reference assemblies of all three genomes. Defensins have been studied for over two decades and their naming has become a critical issue due to incorrect identification of defensin genes derived from different mouse strains and the duplicated nature of this region. Results: The defensin gene cluster region on mouse Chromosome 8 A2 contains 98 gene loci: 53 are likely active defensin genes and 22 defensin pseudogenes. Several TATA box motifs were found for human and mouse defensin genes that likely impact gene expression. Three novel defensin genes belonging to the Cryptdin Related Sequences (CRS) family were identified. All additional mouse defensin loci on Chromosomes 1, 2 and 14 were annotated and unusual splice variants identified. Comparison of the mouse alpha-defensins in the three main mouse reference gene sets Ensembl, Mouse Genome Informatics (MGI), and NCBI RefSeq reveals significant inconsistencies in annotation and nomenclature. We are collaborating with the Mouse Genome Nomenclature Committee (MGNC) to establish a standardized naming scheme for alpha-defensins. Conclusions: Prior to this analysis, there was no reliable reference gene set available for the mouse strain C57BL/6J defensin genes, demonstrating that manual intervention is still critical for the annotation of complex gene families and heavily duplicated regions. Accurate gene annotation is facilitated by the annotation of pseudogenes and regulatory elements. Manually curated gene models will be incorporated into the Ensembl and Consensus Coding Sequence (CCDS) reference sets. Elucidation of the genomic structure of this complex gene cluster on the mouse reference sequence, and adoption of a clear and unambiguous naming scheme, will provide a valuable tool to support studies on the evolution, regulatory mechanisms and biological functions of defensins in vivo.
引用
收藏
页数:13
相关论文
共 57 条
[1]   Allelic recombination between distinct genomic locations generates copy number diversity in human β-defensins [J].
Abu Bakar, Suhaili ;
Hollox, Edward J. ;
Armour, John A. L. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (03) :853-858
[2]  
[Anonymous], UCSC Genome Browser LiftOver Utility
[3]   Tandem repeats finder: a program to analyze DNA sequences [J].
Benson, G .
NUCLEIC ACIDS RESEARCH, 1999, 27 (02) :573-580
[4]   GeneWise and genomewise [J].
Birney, E ;
Clamp, M ;
Durbin, R .
GENOME RESEARCH, 2004, 14 (05) :988-995
[5]   The Mouse Genome Database genotypes::phenotypes [J].
Blake, Judith A. ;
Bult, Carol J. ;
Eppig, Janan T. ;
Kadin, James A. ;
Richardson, Joel E. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D712-D719
[6]   Prediction of complete gene structures in human genomic DNA [J].
Burge, C ;
Karlin, S .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) :78-94
[7]   Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse [J].
Church, Deanna M. ;
Goodstadt, Leo ;
Hillier, LaDeana W. ;
Zody, Michael C. ;
Goldstein, Steve ;
She, Xinwe ;
Bult, Carol J. ;
Agarwala, Richa ;
Cherry, Joshua L. ;
DiCuccio, Michael ;
Hlavina, Wratko ;
Kapustin, Yuri ;
Meric, Peter ;
Maglott, Donna ;
Birtle, Zoe ;
Marques, Ana C. ;
Graves, Tina ;
Zhou, Shiguo ;
Teague, Brian ;
Potamousis, Konstantinos ;
Churas, Christopher ;
Place, Michael ;
Herschleb, Jill ;
Runnheim, Ron ;
Forrest, Daniel ;
Amos-Landgraf, James ;
Schwartz, David C. ;
Cheng, Ze ;
Lindblad-Toh, Kerstin ;
Eichler, Evan E. ;
Ponting, Chris P. .
PLOS BIOLOGY, 2009, 7 (05)
[8]   Cancer-specific loss of β-defensin 1 in renal and prostatic carcinomas [J].
Donald, CD ;
Sun, CQ ;
Lim, SD ;
Macoska, J ;
Cohen, C ;
Amin, MB ;
Young, AN ;
Ganz, TA ;
Marshall, FF ;
Petros, JA .
LABORATORY INVESTIGATION, 2003, 83 (04) :501-505
[9]   Computational detection and location of transcription start sites in mammalian genomic DNA [J].
Down, TA ;
Hubbard, TJP .
GENOME RESEARCH, 2002, 12 (03) :458-461
[10]   MOUSE NEUTROPHILS LACK DEFENSINS [J].
EISENHAUER, PB ;
LEHRER, RI .
INFECTION AND IMMUNITY, 1992, 60 (08) :3446-3447