Nineteen additional unpredicted transcripts from human chromosome 21

被引:40
作者
Reymond, A
Camargo, AA
Deutsch, S
Stevenson, BJ
Parmigiani, RB
Ucla, C
Bettoni, F
Rossier, C
Lyle, R
Guipponi, M
de Souza, S
Iseli, C
Jongeneel, CV
Bucher, P
Simpson, AJG
Antonarakis, SE [1 ]
机构
[1] Univ Geneva, Sch Med, Div Med Genet, CH-1211 Geneva, Switzerland
[2] Ludwig Inst Canc Res, BR-01509010 Sao Paulo, Brazil
[3] Swiss Inst Bioinformat, CH-1066 Epalinges, Switzerland
[4] Ludwig Inst Canc Res, Off Informat Technol, CH-1066 Epalinges, Switzerland
[5] Swiss Inst Expt Canc Res, CH-1066 Epalinges, Switzerland
基金
巴西圣保罗研究基金会;
关键词
human chromosome 21; transcription map; genomic sequences annotation; gene prediction; Down syndrome; trisomy; 21; expression pattern; Acedb;
D O I
10.1006/geno.2002.6781
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The identification of all human chromosome 21 (HC21) genes is a necessary step in understanding the molecular pathogenesis of trisomy 21 (Down syndrome). The first analysis of the sequence of 21q included 127 previously characterized genes and predicted an additional 98 novel anonymous genes. Recently we evaluated the quality of this annotation by characterizing a set of HC21 open reading frames (C21orfs) identified by mapping spliced expressed sequence tags (ESTs) and predicted genes (PREDs), identified only in silico. This study underscored the limitations of in silico-only gene prediction, as many PREDs were incorrectly predicted. To refine the HC21 annotation, we have developed a reliable algorithm to extract and stringently map sequences that contain bona fide 3' transcript ends to the genome. We then created a specific 21q graphical display allowing an integrated view of the data that incorporates new ESTs as well as features such as CpG islands, repeats, and gene predictions. Using these tools we identified 27 new putative genes. To validate these, we sequenced previously cloned cDNAs and carried out RT-PCR, 5'- and 3'-RACE procedures, and comparative mapping. These approaches substantiated 19 new transcripts, thus increasing the HC21 gene count by 9.5%. These transcripts were likely not previously identified because they are small and encode small proteins. We also identified four transcriptional units that are spliced but contain no obvious open reading frame. The HC21 data presented here further emphasize that current gene prediction algorithms miss a substantial number of transcripts that nevertheless can be identified using a combination of experimental approaches and multiple refined algorithms.
引用
收藏
页码:824 / 832
页数:9
相关论文
共 50 条
[1]   An autoimmune disease, APECED, caused by mutations in a novel gene featuring two PHD-type zinc-finger domains [J].
Aaltonen, J ;
Bjorses, P ;
Perheentupa, J ;
HorelliKuitunen, N ;
Palotie, A ;
Peltonen, L ;
Lee, YS ;
Francis, F ;
Hennig, S ;
Thiel, C ;
Lehrach, H ;
Yaspo, ML .
NATURE GENETICS, 1997, 17 (04) :399-403
[2]   The genome sequence of Drosophila melanogaster [J].
Adams, MD ;
Celniker, SE ;
Holt, RA ;
Evans, CA ;
Gocayne, JD ;
Amanatides, PG ;
Scherer, SE ;
Li, PW ;
Hoskins, RA ;
Galle, RF ;
George, RA ;
Lewis, SE ;
Richards, S ;
Ashburner, M ;
Henderson, SN ;
Sutton, GG ;
Wortman, JR ;
Yandell, MD ;
Zhang, Q ;
Chen, LX ;
Brandon, RC ;
Rogers, YHC ;
Blazej, RG ;
Champe, M ;
Pfeiffer, BD ;
Wan, KH ;
Doyle, C ;
Baxter, EG ;
Helt, G ;
Nelson, CR ;
Miklos, GLG ;
Abril, JF ;
Agbayani, A ;
An, HJ ;
Andrews-Pfannkoch, C ;
Baldwin, D ;
Ballew, RM ;
Basu, A ;
Baxendale, J ;
Bayraktaroglu, L ;
Beasley, EM ;
Beeson, KY ;
Benos, PV ;
Berman, BP ;
Bhandari, D ;
Bolshakov, S ;
Borkova, D ;
Botchan, MR ;
Bouck, J ;
Brokstein, P .
SCIENCE, 2000, 287 (5461) :2185-2195
[3]  
[Anonymous], 2000, Nature
[4]   Chromosome 21: from sequence to applications [J].
Antonarakis, SE .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 2001, 11 (03) :241-246
[5]   DBEST - DATABASE FOR EXPRESSED SEQUENCE TAGS [J].
BOGUSKI, MS ;
LOWE, TMJ ;
TOLSTOSHEV, CM .
NATURE GENETICS, 1993, 4 (04) :332-333
[6]   A flexible motif search technique based on generalized profiles [J].
Bucher, P ;
Karplus, K ;
Moeri, N ;
Hofmann, K .
COMPUTERS & CHEMISTRY, 1996, 20 (01) :3-23
[7]   Prediction of complete gene structures in human genomic DNA [J].
Burge, C ;
Karlin, S .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) :78-94
[8]   Genome sequence of the nematode C-elegans:: A platform for investigating biology [J].
不详 .
SCIENCE, 1998, 282 (5396) :2012-2018
[9]   The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome [J].
Camargo, AA ;
Samaia, HPB ;
Dias-Neto, E ;
Simao, DF ;
Migotto, IA ;
Briones, MRS ;
Costa, FF ;
Nagai, MA ;
Verjovski-Almeida, S ;
Zago, MA ;
Andrade, LEC ;
Carrer, H ;
El-Dorry, HFA ;
Espreafico, EM ;
Habr-Gama, A ;
Giannella-Neto, D ;
Goldman, GH ;
Gruber, A ;
Hackel, C ;
Kimura, ET ;
Maciel, RMB ;
Marie, SKN ;
Martins, EAL ;
Nóbrega, MP ;
Paçó-Larson, ML ;
Pardini, MIMC ;
Pereira, GG ;
Pesquero, JB ;
Rodrigues, V ;
Rogatto, SR ;
da Silva, IDCG ;
Sogayar, MC ;
Sonati, MDF ;
Tajara, EH ;
Valentini, SR ;
Alberto, FL ;
Amaral, MEJ ;
Aneas, I ;
Arnaldi, LAT ;
de Assis, AM ;
Bengtson, MH ;
Bergamo, NA ;
Bombonato, V ;
de Camargo, MER ;
Canevari, RA ;
Carraro, DM ;
Cerutti, JM ;
Corrêa, MLC ;
Corrêa, RFR ;
Costa, MCR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (21) :12103-12108
[10]   A newly identified locus for usher syndrome type I, USH1E, maps to chromosome 21q21 [J].
Chaib, H ;
Kaplan, J ;
Gerber, S ;
Vincent, C ;
Ayadi, H ;
Slim, R ;
Munnich, A ;
Weissenbach, J ;
Petit, C .
HUMAN MOLECULAR GENETICS, 1997, 6 (01) :27-31