Criteria for gene identification and features of genome organization: analysis of 6.5 Mb of DNA sequence from human chromosome 21

被引:17
作者
Slavov, D
Hattori, M
Sakaki, Y
Rosenthal, A
Shimizu, N
Minoshima, S
Kudoh, J
Yaspo, ML
Ramser, J
Reinhardt, R
Reimer, C
Clancy, K
Rynditch, A
Gardiner, K
机构
[1] Eleanor Roosevelt Inst Canc Res, Denver, CO 80206 USA
[2] Riken Kitasato Univ, Wako, Saitama, Japan
[3] RIKEN, Human Genome Res Grp, Genome Sci Ctr, Wako, Saitama, Japan
[4] Jena Sequencing Ctr, D-07745 Jena, Germany
[5] Inst Mol Biotechnol Jena, D-07745 Jena, Germany
[6] Keio Univ, Sch Med, Dept Mol Biol, Tokyo 1608582, Japan
[7] Max Planck Inst Mol Genet, Berlin, Germany
[8] Keio Univ, Sequencing Ctr, Berlin, Germany
关键词
Down syndrome; gene identification; genome organization; human chromosome 21; sequence analysis;
D O I
10.1016/S0378-1119(00)00089-5
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
To establish criteria for and the limitations of novel gene identification, to identify novel genes of potential relevance to Down Syndrome and to investigate features of genome organization, 6.5 Mb of DNA sequence, dispersed throughout the long arm of human chromosome 21, have been annotated computationally and experimentally. Exon prediction with four programs, protein and EST database searches, two-sequence BLAST searches and CpG island characterization identified 41 genes with known or new protein homologies. Features of these genes suggested criteria for prediction of novel genes (those lacking any protein homology) with the following characteristics: (1) exon + EST genes: genes with excellent patterns of predicted exons and one or more matches in dbEST; (2) exon-EST genes: genes with good patterns of predicted exons and no matches in dbEST; (3) EST-exon genes: genes without any patterns of reliable exon prediction but with matches in dbEST; and (4) isolated CpG island genes: genes consisting of strong CpG islands that are apparently unique sequences and found in regions lacking any consistent exon predictions within > 50 kb. In total, 41 novel gene models were predicted, and for a subset of these, RT-PCR experiments helped to verify and refine the models, and were used to assess expression in early development and in adult brain regions of potential relevance to Down syndrome. Results suggest generally low and/or restricted patterns of expression, and also reveal examples of complex alternative processing, especially in brain, that may have important implications for regulation of protein function. Analysis of complete gene structures of the known genes identified a number of very large introns, a number of very short intergenic distances, and at least one potentially bi-directional promoter. At least 3/4 of known genes and 1/2 of predicted genes are associated with CpG islands. For novel genes, three cases of overlapping genes are predicted. Results of these analyses illustrate some of the complexities inherent in mammalian genome organization and some of the limitations of current sequence analysis technologies. They also doubled the number of potential genes within the region. (C) 2000 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:215 / 232
页数:18
相关论文
共 28 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   10 years of genomics, chromosome 21, and Down syndrome [J].
Antonarakis, SE .
GENOMICS, 1998, 51 (01) :1-16
[3]   The human genome: Organization and evolutionary history [J].
Bernardi, G .
ANNUAL REVIEW OF GENETICS, 1995, 29 :445-476
[4]   Evaluation of gene structure prediction programs [J].
Burset, M ;
Guigo, R .
GENOMICS, 1996, 34 (03) :353-367
[5]   Cloning of 559 potential exons of genes of human chromosome 21 by exon trapping [J].
Chen, H ;
Chrast, R ;
Rossier, C ;
Morris, MA ;
Lalioti, MD ;
Antonarakis, SE .
GENOME RESEARCH, 1996, 6 (08) :747-760
[6]   ISOLATION AND MAPPING OF HUMAN-CHROMOSOME-21 CDNA - PROGRESS IN CONSTRUCTING A CHROMOSOME-21 EXPRESSION MAP [J].
CHENG, JF ;
BOYARTCHUK, V ;
ZHU, YW .
GENOMICS, 1994, 23 (01) :75-84
[7]   SINGLE-STEP METHOD OF RNA ISOLATION BY ACID GUANIDINIUM THIOCYANATE PHENOL CHLOROFORM EXTRACTION [J].
CHOMCZYNSKI, P ;
SACCHI, N .
ANALYTICAL BIOCHEMISTRY, 1987, 162 (01) :156-159
[8]   Computational methods for the identification of genes in vertebrate genomic sequences [J].
Claverie, JM .
HUMAN MOLECULAR GENETICS, 1997, 6 (10) :1735-1744
[9]   Transcriptional map of the 2.5-Mb CBR-ERG region of chromosome 21 involved in Down syndrome [J].
Dahmane, N ;
Ghezala, GA ;
Gosset, P ;
Chamoun, Z ;
Dufresne-Zacharia, MC ;
Lopes, C ;
Rabatel, N ;
Gassanova-Maugenre, S ;
Chettouh, Z ;
Abramowski, V ;
Fayet, E ;
Yaspo, ML ;
Korn, B ;
Blouin, JL ;
Lehrach, H ;
Poutska, A ;
Antonarakis, SE ;
Sinet, PM ;
Créau, N ;
Delabar, JM .
GENOMICS, 1998, 48 (01) :12-23
[10]   A physical map of 30,000 human genes [J].
Deloukas, P ;
Schuler, GD ;
Gyapay, G ;
Beasley, EM ;
Soderlund, C ;
Rodriguez-Tomé, P ;
Hui, L ;
Matise, TC ;
McKusick, KB ;
Beckmann, JS ;
Bentolila, S ;
Bihoreau, MT ;
Birren, BB ;
Browne, J ;
Butler, A ;
Castle, AB ;
Chiannilkulchai, N ;
Clee, C ;
Day, PJR ;
Dehejia, A ;
Dibling, T ;
Drouot, N ;
Duprat, S ;
Fizames, C ;
Fox, S ;
Gelling, S ;
Green, L ;
Harrison, P ;
Hocking, R ;
Holloway, E ;
Hunt, S ;
Keil, S ;
Lijnzaad, P ;
Louis-Dit-Sully, C ;
Ma, J ;
Mendis, A ;
Miller, J ;
Morissette, J ;
Muselet, D ;
Nusbaum, HC ;
Peck, A ;
Rozen, S ;
Simon, D ;
Slonim, DK ;
Staples, R ;
Stein, LD ;
Stewart, EA ;
Suchard, MA ;
Thangarajah, T ;
Vega-Czarny, N .
SCIENCE, 1998, 282 (5389) :744-746